assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001267405.1_ASM126740v1	NZ_CP012369	Moorella thermoacetica strain DSM 521 chromosome, complete genome	1	420191-421882	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,WYL	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	 Type I-U?,Type I-U,Type I-C	GTTTCAACCCTCGCCCGGCATGGAAGCCGGGCGCGAC,GTTTCAACCCTCGCCCGGCATGGAAGCCGGGCGCGAC,GTTTCAACCCTCGCCCGGCATGGAAGCCGGGCGCGAC	37,37,37	0	0	NA	NA	I-C,III-B:I-C,III-B:I-C,III-B	23,23,23	23	TypeI-U?,TypeI-U,TypeI-C	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	NA|272aa|up_4|NZ_CP012369.1_411764_412580_+,NA|486aa|up_3|NZ_CP012369.1_413124_414582_+,NA|55aa|down_9|NZ_CP012369.1_431178_431343_-	NA|1152aa|up_9|NZ_CP012369.1_402860_406316_+	TIGR02773, ATP-dependent_helicase/deoxyribonuclease_subunit_B, helicase-exonuclease AddAB, AddB subunit	NA|1406aa|up_8|NZ_CP012369.1_406325_410543_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|74aa|up_7|NZ_CP012369.1_410735_410957_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|79aa|up_6|NZ_CP012369.1_410953_411190_+	COG1724, COG1724, Predicted RNA binding protein (dsRBD-like fold), HicA family    [General function prediction only]	NA|102aa|up_5|NZ_CP012369.1_411239_411545_+	pfam06114, Peptidase_M78, IrrE N-terminal-like domain	NA|272aa|up_4|NZ_CP012369.1_411764_412580_+	NA	NA|486aa|up_3|NZ_CP012369.1_413124_414582_+	NA	NA|204aa|up_2|NZ_CP012369.1_414674_415286_+	pfam13835, DUF4194, Domain of unknown function (DUF4194)	NA|1208aa|up_1|NZ_CP012369.1_415254_418878_+	pfam13558, SbcCD_C, Putative exonuclease SbcCD, C subunit	NA|444aa|up_0|NZ_CP012369.1_418810_420142_+	cd00223, TOPRIM_TopoIIB_SPO, TOPRIM_TopoIIB_SPO: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in the type IIB family of DNA topoisomerases and Spo11	cas2|97aa|down_0|NZ_CP012369.1_422053_422344_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|NZ_CP012369.1_422376_423408_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|215aa|down_2|NZ_CP012369.1_423479_424124_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas7|298aa|down_3|NZ_CP012369.1_424207_425101_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|561aa|down_4|NZ_CP012369.1_425193_426876_-	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas5|234aa|down_5|NZ_CP012369.1_426885_427587_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|725aa|down_6|NZ_CP012369.1_427648_429823_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	WYL|230aa|down_7|NZ_CP012369.1_430028_430718_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	WYL|58aa|down_8|NZ_CP012369.1_430677_430851_+	pfam13280, WYL, WYL domain	NA|55aa|down_9|NZ_CP012369.1_431178_431343_-	NA
GCF_001267405.1_ASM126740v1	NZ_CP012369	Moorella thermoacetica strain DSM 521 chromosome, complete genome	2	616943-617034	2	CRISPRCasFinder	no	WYL	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	Unclear	GGGGCGCATGCTCTATGGCCTCTCCGGC	28	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	NA|90aa|up_8|NZ_CP012369.1_607528_607798_+,NA|191aa|down_3|NZ_CP012369.1_620535_621108_+	WYL|207aa|up_9|NZ_CP012369.1_606219_606840_+	pfam13280, WYL, WYL domain	NA|90aa|up_8|NZ_CP012369.1_607528_607798_+	NA	NA|159aa|up_7|NZ_CP012369.1_607769_608246_+	cd18738, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|183aa|up_6|NZ_CP012369.1_610047_610596_+	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]	NA|154aa|up_5|NZ_CP012369.1_610927_611389_-	cd18738, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|82aa|up_4|NZ_CP012369.1_611376_611622_-	smart00966, SpoVT_AbrB, SpoVT / AbrB like domain	NA|91aa|up_3|NZ_CP012369.1_612109_612382_+	smart00966, SpoVT_AbrB, SpoVT / AbrB like domain	NA|133aa|up_2|NZ_CP012369.1_612369_612768_+	cd18680, PIN_MtVapC20-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC20 and related proteins	NA|837aa|up_1|NZ_CP012369.1_613016_615527_-	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|377aa|up_0|NZ_CP012369.1_615530_616661_-	COG3287, COG3287, Uncharacterized conserved protein [Function unknown]	NA|271aa|down_0|NZ_CP012369.1_617397_618210_+	pfam00072, Response_reg, Response regulator receiver domain	NA|307aa|down_1|NZ_CP012369.1_618621_619542_+	pfam04294, VanW, VanW like protein	NA|185aa|down_2|NZ_CP012369.1_619733_620288_+	PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic subunit	NA|191aa|down_3|NZ_CP012369.1_620535_621108_+	NA	NA|529aa|down_4|NZ_CP012369.1_621157_622744_+	cd06061, PurM-like1, AIR synthase (PurM) related protein, subgroup 1 of unknown function	NA|267aa|down_5|NZ_CP012369.1_623029_623830_+	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|359aa|down_6|NZ_CP012369.1_623888_624965_+	cd01536, PBP1_ABC_sugar_binding-like, periplasmic sugar-binding domain of active transport systems that are members of the type 1 periplasmic binding protein (PBP1) superfamily	NA|493aa|down_7|NZ_CP012369.1_625025_626504_+	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]	NA|331aa|down_8|NZ_CP012369.1_626531_627524_+	cd06579, TM_PBP1_transp_AraH_like, Transmembrane subunit (TM) of Escherichia coli AraH and related proteins	NA|422aa|down_9|NZ_CP012369.1_627565_628831_+	PRK09550, mtnK, methylthioribose kinase; Reviewed
GCF_001267405.1_ASM126740v1	NZ_CP012369	Moorella thermoacetica strain DSM 521 chromosome, complete genome	3	1667560-1670187	2,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas3,cas5,cas7,cas6	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	Unclear	GTTCAAATTCCTCTATGGTCGATGGTCAC,GTTCAAATTCCTCTATGGTCGATGGTCAC,GTTCAAATTCCTCTATGGTCGATGGTCAC	29,29,29	0	0	NA	NA	NA:NA:NA	40,40,40	40	Unclear	cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,WYL,DinG,cas6,RT	NA|60aa|up_3|NZ_CP012369.1_1663941_1664121_-,NA|63aa|up_1|NZ_CP012369.1_1666110_1666299_-,NA	NA|620aa|up_9|NZ_CP012369.1_1655746_1657606_-	COG1894, NuoF, NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [Energy production and conversion]	NA|158aa|up_8|NZ_CP012369.1_1657641_1658115_-	pfam01257, 2Fe-2S_thioredx, Thioredoxin-like [2Fe-2S] ferredoxin	NA|365aa|up_7|NZ_CP012369.1_1658590_1659685_-	pfam16864, dimerization2, dimerization domain	NA|362aa|up_6|NZ_CP012369.1_1659681_1660767_-	pfam02277, DBI_PRT, Phosphoribosyltransferase	NA|427aa|up_5|NZ_CP012369.1_1660792_1662073_-	PRK13352, PRK13352, phosphomethylpyrimidine synthase ThiC	NA|433aa|up_4|NZ_CP012369.1_1662085_1663384_-	PRK13352, PRK13352, phosphomethylpyrimidine synthase ThiC	NA|60aa|up_3|NZ_CP012369.1_1663941_1664121_-	NA	NA|403aa|up_2|NZ_CP012369.1_1664835_1666044_+	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|63aa|up_1|NZ_CP012369.1_1666110_1666299_-	NA	NA|127aa|up_0|NZ_CP012369.1_1667132_1667513_-	cd01126, TraG_VirD4, The TraG/TraD/VirD4 family are bacterial conjugation proteins involved in type IV secretion	cas2|94aa|down_0|NZ_CP012369.1_1670268_1670550_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|322aa|down_1|NZ_CP012369.1_1670552_1671518_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas4|178aa|down_2|NZ_CP012369.1_1671534_1672068_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|918aa|down_3|NZ_CP012369.1_1672057_1674811_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|295aa|down_4|NZ_CP012369.1_1674810_1675695_-	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas7|355aa|down_5|NZ_CP012369.1_1675694_1676759_-	pfam01905, DevR, CRISPR-associated negative auto-regulator DevR/Csa2	NA|493aa|down_6|NZ_CP012369.1_1676785_1678264_-	TIGR01908, Uncharacterized_protein_aq_372, CRISPR-associated protein Cas8b1/Cst1, subtype I-B/TNEAP	cas6|227aa|down_7|NZ_CP012369.1_1678267_1678948_-	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|468aa|down_8|NZ_CP012369.1_1679246_1680650_-	PRK09613, thiH, thiamine biosynthesis protein ThiH; Reviewed	NA|351aa|down_9|NZ_CP012369.1_1680764_1681817_-	PRK07094, PRK07094, biotin synthase; Provisional
