assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000166055.1_ASM16605v1	NC_014664	Rhodomicrobium vannielii ATCC 17100, complete sequence	1	1352288-1357227	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,csb2gr5,RT,csm3gr7,csx19,cas10,csm6,csx1,csx16,DEDDh	Type I-C,Type I-U, Type I-U?	GTCGCTCCCCGTGCGGGAGCGTGGATCGAAAC,GTCGCTCCCCGTGCGGGAGCGTGGATCGAAAC,GTCGCTCCCCGTGCGGGAGCGTGGATCGAAAC	32,32,32	0	0	NA	NA	I-C:I-C:I-C	74,74,71	74	TypeI-C,TypeI-U,TypeI-U?	csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,csb2gr5,RT,csm3gr7,csx19,cas10,csm6,csx1,csx16,DEDDh	NA|171aa|up_8|NC_014664.1_1339927_1340440_-,NA|274aa|down_8|NC_014664.1_1369717_1370539_+,NA|54aa|down_9|NC_014664.1_1370521_1370683_-	NA|166aa|up_9|NC_014664.1_1339152_1339650_+	pfam13463, HTH_27, Winged helix DNA-binding domain	NA|171aa|up_8|NC_014664.1_1339927_1340440_-	NA	NA|783aa|up_7|NC_014664.1_1341026_1343375_+	pfam08707, PriCT_2, Primase C terminal 2 (PriCT-2)	cas3|744aa|up_6|NC_014664.1_1344395_1346627_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|233aa|up_5|NC_014664.1_1346642_1347341_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|590aa|up_4|NC_014664.1_1347337_1349107_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|320aa|up_3|NC_014664.1_1349103_1350063_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas4|247aa|up_2|NC_014664.1_1349990_1350731_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|345aa|up_1|NC_014664.1_1350727_1351762_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NC_014664.1_1351771_1352062_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|265aa|down_0|NC_014664.1_1357529_1358324_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|338aa|down_1|NC_014664.1_1358320_1359334_+	pfam01032, FecCD, FecCD transport family	NA|353aa|down_2|NC_014664.1_1359330_1360389_+	pfam01032, FecCD, FecCD transport family	NA|317aa|down_3|NC_014664.1_1360523_1361474_-	COG0614, FepB, ABC-type Fe3+-hydroxamate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|696aa|down_4|NC_014664.1_1361592_1363680_-	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|359aa|down_5|NC_014664.1_1364036_1365113_-	cd06193, siderophore_interacting, Siderophore interacting proteins share the domain structure of the ferredoxin reductase like family	NA|402aa|down_6|NC_014664.1_1365464_1366670_-	COG1858, MauG, Cytochrome c peroxidase [Inorganic ion transport and metabolism]	NA|956aa|down_7|NC_014664.1_1366683_1369551_-	smart00965, STN, Secretin and TonB N terminus short domain	NA|274aa|down_8|NC_014664.1_1369717_1370539_+	NA	NA|54aa|down_9|NC_014664.1_1370521_1370683_-	NA
GCF_000166055.1_ASM16605v1	NC_014664	Rhodomicrobium vannielii ATCC 17100, complete sequence	2	1636014-1636645	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no		csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,csb2gr5,RT,csm3gr7,csx19,cas10,csm6,csx1,csx16,DEDDh	Orphan	GTCGCTCCCCGTGCGGGAGCGTGGATCGAAAC,GTCGCTCCCCGTGCGGGAGCGTGGATCGAAAC,NNNNNNNNNNGTCGCTCCCCGTGCGGGAGCGTGGATCGAAAC	32,32,42	0	0	NA	NA	I-C:I-C:NA	9,9,9	9	Orphan	csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,csb2gr5,RT,csm3gr7,csx19,cas10,csm6,csx1,csx16,DEDDh	NA|65aa|up_9|NC_014664.1_1626294_1626489_+,NA|191aa|up_7|NC_014664.1_1627040_1627613_-,NA|674aa|up_5|NC_014664.1_1628438_1630460_+,NA|73aa|up_1|NC_014664.1_1635069_1635288_-,NA|119aa|up_0|NC_014664.1_1635631_1635988_-,NA|95aa|down_4|NC_014664.1_1642537_1642822_+	NA|65aa|up_9|NC_014664.1_1626294_1626489_+	NA	NA|127aa|up_8|NC_014664.1_1626571_1626952_-	COG5499, COG5499, Predicted transcription regulator containing HTH domain [Transcription]	NA|191aa|up_7|NC_014664.1_1627040_1627613_-	NA	NA|64aa|up_6|NC_014664.1_1628166_1628358_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|674aa|up_5|NC_014664.1_1628438_1630460_+	NA	NA|544aa|up_4|NC_014664.1_1630969_1632601_+	COG3593, COG3593, Predicted ATP-dependent endonuclease of the OLD family [DNA replication, recombination, and repair]	NA|380aa|up_3|NC_014664.1_1632597_1633737_+	cd17932, DEXQc_UvrD, DEXQD-box helicase domain of UvrD	NA|257aa|up_2|NC_014664.1_1634302_1635073_-	COG3617, COG3617, Prophage antirepressor [Transcription]	NA|73aa|up_1|NC_014664.1_1635069_1635288_-	NA	NA|119aa|up_0|NC_014664.1_1635631_1635988_-	NA	NA|322aa|down_0|NC_014664.1_1637311_1638277_-	pfam00665, rve, Integrase core domain	NA|196aa|down_1|NC_014664.1_1638405_1638993_+	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|136aa|down_2|NC_014664.1_1639862_1640270_+	pfam01527, HTH_Tnp_1, Transposase	NA|528aa|down_3|NC_014664.1_1640711_1642295_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|95aa|down_4|NC_014664.1_1642537_1642822_+	NA	NA|52aa|down_5|NC_014664.1_1643518_1643674_-	pfam00556, LHC, Antenna complex alpha/beta subunit	NA|68aa|down_6|NC_014664.1_1644218_1644422_-	pfam00556, LHC, Antenna complex alpha/beta subunit	NA|52aa|down_7|NC_014664.1_1644444_1644600_-	pfam00556, LHC, Antenna complex alpha/beta subunit	NA|65aa|down_8|NC_014664.1_1644712_1644907_-	pfam00556, LHC, Antenna complex alpha/beta subunit	NA|51aa|down_9|NC_014664.1_1644931_1645084_-	pfam00556, LHC, Antenna complex alpha/beta subunit
GCF_000166055.1_ASM16605v1	NC_014664	Rhodomicrobium vannielii ATCC 17100, complete sequence	3	2048647-2049436	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	csb2gr5,RT,cas2,cas1,csm3gr7,csx19,cas10,csm6,csx1,csx16	csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,csb2gr5,RT,csm3gr7,csx19,cas10,csm6,csx1,csx16,DEDDh	Type III-D,Type III-B,Type III-C,Type III-A	CTCTCAGGCCGCTTCGGCGGCTTGCCCGCGTGAAAC,CTCTCAGGCCGCTTCGGCGGCTTGCCCGCGTGAAAC,CTCTCAGGCCGCTTCGGCGGCTTGCCCGCGTGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	10,11,11	11	TypeIII-D,TypeIII-B,TypeIII-C,TypeIII-A	csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,csb2gr5,RT,csm3gr7,csx19,cas10,csm6,csx1,csx16,DEDDh	NA|97aa|up_9|NC_014664.1_2042745_2043036_+,NA|57aa|up_8|NC_014664.1_2043123_2043294_-,NA|49aa|up_7|NC_014664.1_2043376_2043523_+,NA|94aa|up_6|NC_014664.1_2043624_2043906_+,NA|152aa|down_5|NC_014664.1_2059368_2059824_+,NA|124aa|down_8|NC_014664.1_2062012_2062384_-	NA|97aa|up_9|NC_014664.1_2042745_2043036_+	NA	NA|57aa|up_8|NC_014664.1_2043123_2043294_-	NA	NA|49aa|up_7|NC_014664.1_2043376_2043523_+	NA	NA|94aa|up_6|NC_014664.1_2043624_2043906_+	NA	NA|278aa|up_5|NC_014664.1_2043902_2044736_+	pfam06414, Zeta_toxin, Zeta toxin	NA|75aa|up_4|NC_014664.1_2044940_2045165_-	TIGR02613, cell_filamentation_protein, mobile mystery protein B	csb2gr5|251aa|up_3|NC_014664.1_2045203_2045956_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	RT|294aa|up_2|NC_014664.1_2045952_2046834_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	cas2|98aa|up_1|NC_014664.1_2046848_2047142_-	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	cas1|367aa|up_0|NC_014664.1_2047138_2048239_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	csm3gr7|767aa|down_0|NC_014664.1_2049501_2051802_-	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csx19|214aa|down_1|NC_014664.1_2051798_2052440_-	TIGR03984, hypothetical_protein_FrEUN1fDRAFT_5778, CRISPR-associated protein, TIGR03984 family	csm3gr7|563aa|down_2|NC_014664.1_2052436_2054125_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|798aa|down_3|NC_014664.1_2054121_2056515_-	pfam03787, RAMPs, RAMP superfamily	cas10|578aa|down_4|NC_014664.1_2056511_2058245_-	COG1353, COG1353, Predicted CRISPR-associated polymerase [Defense mechanisms]	NA|152aa|down_5|NC_014664.1_2059368_2059824_+	NA	csm6|361aa|down_6|NC_014664.1_2059759_2060842_-	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csx1|389aa|down_7|NC_014664.1_2060853_2062020_-	cd09741, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	NA|124aa|down_8|NC_014664.1_2062012_2062384_-	NA	csx16|99aa|down_9|NC_014664.1_2062431_2062728_-	pfam09652, Cas_VVA1548, Putative CRISPR-associated protein (Cas_VVA1548)
GCF_000166055.1_ASM16605v1	NC_014664	Rhodomicrobium vannielii ATCC 17100, complete sequence	4	2058649-2059300	4,4,4	PILER-CR,CRISPRCasFinder,CRT	no	csb2gr5,RT,cas2,cas1,csm3gr7,csx19,cas10,csm6,csx1,csx16,WYL	csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,csb2gr5,RT,csm3gr7,csx19,cas10,csm6,csx1,csx16,DEDDh	Type III-D,Type III-B,Type III-C,Type III-A	CTTTCAGGCCGCTTCGGCGGCTTGCCCGCGTGAAAC,CTTTCAGGCCGCTTCGGCGGCTTGCCCGCGTGAAAC,CTTTCAGGCCGCTTCGGCGGCTTGCCCGCGTGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	8,9,9	9	TypeIII-D,TypeIII-B,TypeIII-C,TypeIII-A	csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,csb2gr5,RT,csm3gr7,csx19,cas10,csm6,csx1,csx16,DEDDh	NA,NA|152aa|down_0|NC_014664.1_2059368_2059824_+,NA|124aa|down_3|NC_014664.1_2062012_2062384_-,NA|612aa|down_7|NC_014664.1_2064018_2065854_-,NA|316aa|down_8|NC_014664.1_2066082_2067030_-	NA|75aa|up_9|NC_014664.1_2044940_2045165_-	TIGR02613, cell_filamentation_protein, mobile mystery protein B	csb2gr5|251aa|up_8|NC_014664.1_2045203_2045956_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	RT|294aa|up_7|NC_014664.1_2045952_2046834_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	cas2|98aa|up_6|NC_014664.1_2046848_2047142_-	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	cas1|367aa|up_5|NC_014664.1_2047138_2048239_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	csm3gr7|767aa|up_4|NC_014664.1_2049501_2051802_-	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csx19|214aa|up_3|NC_014664.1_2051798_2052440_-	TIGR03984, hypothetical_protein_FrEUN1fDRAFT_5778, CRISPR-associated protein, TIGR03984 family	csm3gr7|563aa|up_2|NC_014664.1_2052436_2054125_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|798aa|up_1|NC_014664.1_2054121_2056515_-	pfam03787, RAMPs, RAMP superfamily	cas10|578aa|up_0|NC_014664.1_2056511_2058245_-	COG1353, COG1353, Predicted CRISPR-associated polymerase [Defense mechanisms]	NA|152aa|down_0|NC_014664.1_2059368_2059824_+	NA	csm6|361aa|down_1|NC_014664.1_2059759_2060842_-	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csx1|389aa|down_2|NC_014664.1_2060853_2062020_-	cd09741, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	NA|124aa|down_3|NC_014664.1_2062012_2062384_-	NA	csx16|99aa|down_4|NC_014664.1_2062431_2062728_-	pfam09652, Cas_VVA1548, Putative CRISPR-associated protein (Cas_VVA1548)	NA|89aa|down_5|NC_014664.1_2063019_2063286_+	COG2929, COG2929, Uncharacterized protein conserved in bacteria [Function unknown]	NA|100aa|down_6|NC_014664.1_2063263_2063563_+	pfam14384, BrnA_antitoxin, BrnA antitoxin of type II toxin-antitoxin system	NA|612aa|down_7|NC_014664.1_2064018_2065854_-	NA	NA|316aa|down_8|NC_014664.1_2066082_2067030_-	NA	NA|601aa|down_9|NC_014664.1_2067319_2069122_-	PRK13878, PRK13878, conjugal transfer relaxase TraI; Provisional
