assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002250075.1_ASM225007v1	NZ_CP016893	Thermoanaerobacterium thermosaccharolyticum strain TG57 chromosome, complete genome	1	1017078-1021895	1,1,1	CRISPRCasFinder,PILER-CR,CRT	no	cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20	Cas9_archaeal,csa3,cas14j,cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,WYL,DinG,cas7b,DEDDh	Type I-B,Type III-A,Type III-C,Type III-B,Type III-D	ATTTCAATTCCTGATAGGTAGGCTAAAAAC,TTTCAATTCCTGATAGGTAGGCTAAAAAC,TTTCAATTCCTGATAGGTAGGCTAAAAAC	30,29,29	0	0	NA	NA	NA:NA:NA	72,72,72	72	TypeI-B,TypeIII-A,TypeIII-C,TypeIII-B,TypeIII-D	Cas9_archaeal,csa3,cas14j,cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,WYL,DinG,cas7b,DEDDh	NA,csm2gr11|122aa|down_5|NZ_CP016893.1_1028838_1029204_-,csm2gr11|150aa|down_9|NZ_CP016893.1_1031607_1032057_-	NA|742aa|up_9|NZ_CP016893.1_1006751_1008977_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|148aa|up_8|NZ_CP016893.1_1009009_1009453_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|522aa|up_7|NZ_CP016893.1_1009586_1011152_-	cd01031, EriC, ClC chloride channel EriC	NA|207aa|up_6|NZ_CP016893.1_1011358_1011979_+	COG0490, COG0490, Putative regulatory, ligand-binding protein related to C-terminal domains of K+ channels [Inorganic ion transport and metabolism]	NA|370aa|up_5|NZ_CP016893.1_1012099_1013209_+	COG1125, OpuBA, ABC-type proline/glycine betaine transport systems, ATPase components [Amino acid transport and metabolism]	NA|213aa|up_4|NZ_CP016893.1_1013221_1013860_+	COG1174, OpuBB, ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism]	NA|297aa|up_3|NZ_CP016893.1_1013874_1014765_+	cd13609, PBP2_Opu_like_1, Substrate-binding domain of putative ABC-type osmoprotectant uptake system; the type 2 periplasmic-binding protein fold	cas2|88aa|up_2|NZ_CP016893.1_1014794_1015058_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|up_1|NZ_CP016893.1_1015070_1016063_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|166aa|up_0|NZ_CP016893.1_1016059_1016557_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|788aa|down_0|NZ_CP016893.1_1022063_1024427_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|238aa|down_1|NZ_CP016893.1_1024404_1025118_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7|304aa|down_2|NZ_CP016893.1_1025193_1026105_-	TIGR02590, hypothetical_protein_MM_0563, CRISPR-associated protein Cas7/Csh2, subtype I-B/HMARI	cas8b1|587aa|down_3|NZ_CP016893.1_1026101_1027862_-	pfam09484, Cas_TM1802, CRISPR-associated protein TM1802 (cas_TM1802)	cas6|247aa|down_4|NZ_CP016893.1_1027877_1028618_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	csm2gr11|122aa|down_5|NZ_CP016893.1_1028838_1029204_-	NA	csm3gr7|285aa|down_6|NZ_CP016893.1_1029217_1030072_-	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csm3gr7|260aa|down_7|NZ_CP016893.1_1030058_1030838_-	cd09683, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm3gr7|250aa|down_8|NZ_CP016893.1_1030830_1031580_-	cd09683, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|150aa|down_9|NZ_CP016893.1_1031607_1032057_-	NA
GCF_002250075.1_ASM225007v1	NZ_CP016893	Thermoanaerobacterium thermosaccharolyticum strain TG57 chromosome, complete genome	2	1469250-1469413	2,2	PILER-CR,CRISPRCasFinder	no		Cas9_archaeal,csa3,cas14j,cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,WYL,DinG,cas7b,DEDDh	Orphan	GGATTTAGTCTACCTATTGATTGATTATGCCC,ATTTAGTCTACCTATTGATTGATTATGCCC	32,30	0	0	NA	NA	NA:NA	2,2	2	Orphan	Cas9_archaeal,csa3,cas14j,cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,WYL,DinG,cas7b,DEDDh	NA|89aa|up_2|NZ_CP016893.1_1465756_1466023_+,NA|238aa|down_0|NZ_CP016893.1_1469854_1470568_+,NA|77aa|down_5|NZ_CP016893.1_1474950_1475181_+,NA|69aa|down_7|NZ_CP016893.1_1475887_1476094_-	NA|120aa|up_9|NZ_CP016893.1_1458462_1458822_+	COG2945, COG2945, Predicted hydrolase of the alpha/beta superfamily [General function prediction only]	NA|150aa|up_8|NZ_CP016893.1_1458796_1459246_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|364aa|up_7|NZ_CP016893.1_1459710_1460802_+	COG1397, DraG, ADP-ribosylglycohydrolase [Posttranslational modification, protein turnover, chaperones]	NA|374aa|up_6|NZ_CP016893.1_1460851_1461973_-	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|178aa|up_5|NZ_CP016893.1_1462100_1462634_+	pfam14690, zf-ISL3, zinc-finger of transposase IS204/IS1001/IS1096/IS1165	NA|267aa|up_4|NZ_CP016893.1_1464156_1464957_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|159aa|up_3|NZ_CP016893.1_1465074_1465551_-	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|89aa|up_2|NZ_CP016893.1_1465756_1466023_+	NA	NA|121aa|up_1|NZ_CP016893.1_1466567_1466930_-	pfam06114, Peptidase_M78, IrrE N-terminal-like domain	NA|473aa|up_0|NZ_CP016893.1_1467117_1468536_-	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|238aa|down_0|NZ_CP016893.1_1469854_1470568_+	NA	NA|255aa|down_1|NZ_CP016893.1_1470695_1471460_+	TIGR01391, DNA_primase, DNA primase, catalytic core	NA|276aa|down_2|NZ_CP016893.1_1471452_1472280_+	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|77aa|down_3|NZ_CP016893.1_1473287_1473518_+	TIGR03789, pdsO, proteobacterial sortase system peptidoglycan-associated protein	NA|399aa|down_4|NZ_CP016893.1_1473665_1474862_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|77aa|down_5|NZ_CP016893.1_1474950_1475181_+	NA	NA|86aa|down_6|NZ_CP016893.1_1475516_1475774_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|69aa|down_7|NZ_CP016893.1_1475887_1476094_-	NA	NA|367aa|down_8|NZ_CP016893.1_1480288_1481389_+	pfam02317, Octopine_DH, NAD/NADP octopine/nopaline dehydrogenase, alpha-helical domain	NA|551aa|down_9|NZ_CP016893.1_1481385_1483038_+	pfam02310, B12-binding, B12 binding domain
