assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001484725.1_ASM148472v1	NZ_CP012395	Clostridium autoethanogenum DSM 10061 chromosome, complete genome	1	646171-646367	1	PILER-CR	no		cas8b1,DinG,PD-DExK,csa3,DEDDh,cas2,cas1,cas4,cas3,cas5,cas7b,cas6,WYL,cas3HD,c2c10_CAS-V-U3	Orphan	AATGAATAAGTGATTCACACCAAATCAAAGATTTGGGTTCTAATTTAGGTAACTC	55	1	1	646297-646312	NZ_CP012395.1_646368-646383	NA	2	2	Orphan	cas8b1,DinG,PD-DExK,csa3,DEDDh,cas2,cas1,cas4,cas3,cas5,cas7b,cas6,WYL,cas3HD,c2c10_CAS-V-U3	NA|262aa|up_8|NZ_CP012395.1_638512_639298_-,NA|153aa|down_7|NZ_CP012395.1_657712_658171_-,NA|99aa|down_8|NZ_CP012395.1_658256_658553_+	NA|228aa|up_9|NZ_CP012395.1_637367_638051_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|262aa|up_8|NZ_CP012395.1_638512_639298_-	NA	NA|232aa|up_7|NZ_CP012395.1_639297_639993_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|123aa|up_6|NZ_CP012395.1_639995_640364_-	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|151aa|up_5|NZ_CP012395.1_640550_641003_-	cd06262, metallo-hydrolase-like_MBL-fold, mainly hydrolytic enzymes and related proteins which carry out various biological functions; MBL-fold metallohydrolase domain	NA|224aa|up_4|NZ_CP012395.1_641184_641856_+	COG1182, AcpD, Acyl carrier protein phosphodiesterase [Lipid metabolism]	NA|125aa|up_3|NZ_CP012395.1_642035_642410_+	pfam03965, Penicillinase_R, Penicillinase repressor	NA|509aa|up_2|NZ_CP012395.1_642421_643948_+	pfam05569, Peptidase_M56, BlaR1 peptidase M56	NA|368aa|up_1|NZ_CP012395.1_644024_645128_-	pfam12671, Amidase_6, Putative amidase domain	NA|302aa|up_0|NZ_CP012395.1_645247_646153_-	pfam13539, Peptidase_M15_4, D-alanyl-D-alanine carboxypeptidase	NA|379aa|down_0|NZ_CP012395.1_646882_648019_+	COG1453, COG1453, Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]	NA|1069aa|down_1|NZ_CP012395.1_648249_651456_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|355aa|down_2|NZ_CP012395.1_651499_652564_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|334aa|down_3|NZ_CP012395.1_652578_653580_-	PRK02102, PRK02102, ornithine carbamoyltransferase; Validated	NA|301aa|down_4|NZ_CP012395.1_653871_654774_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|208aa|down_5|NZ_CP012395.1_654820_655444_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|723aa|down_6|NZ_CP012395.1_655495_657664_-	TIGR01389, recQ, ATP-dependent DNA helicase RecQ	NA|153aa|down_7|NZ_CP012395.1_657712_658171_-	NA	NA|99aa|down_8|NZ_CP012395.1_658256_658553_+	NA	NA|172aa|down_9|NZ_CP012395.1_658629_659145_-	pfam03802, CitX, Apo-citrate lyase phosphoribosyl-dephospho-CoA transferase
GCF_001484725.1_ASM148472v1	NZ_CP012395	Clostridium autoethanogenum DSM 10061 chromosome, complete genome	2	1245721-1245812	1	CRISPRCasFinder	no		cas8b1,DinG,PD-DExK,csa3,DEDDh,cas2,cas1,cas4,cas3,cas5,cas7b,cas6,WYL,cas3HD,c2c10_CAS-V-U3	Orphan	ACCAATGATTGCTATAATAAATA	23	0	0	NA	NA	NA	1	1	Orphan	cas8b1,DinG,PD-DExK,csa3,DEDDh,cas2,cas1,cas4,cas3,cas5,cas7b,cas6,WYL,cas3HD,c2c10_CAS-V-U3	NA|252aa|up_7|NZ_CP012395.1_1238547_1239303_+,NA|99aa|up_1|NZ_CP012395.1_1244090_1244387_+,NA|57aa|up_0|NZ_CP012395.1_1244603_1244774_+,NA|147aa|down_1|NZ_CP012395.1_1247627_1248068_+,NA|74aa|down_4|NZ_CP012395.1_1249994_1250216_+	NA|327aa|up_9|NZ_CP012395.1_1236826_1237807_+	cd12185, HGDH_LDH_like, Putative Lactate dehydrogenase and (R)-2-Hydroxyglutarate Dehydrogenase-like proteins, NAD-binding and catalytic domains	NA|76aa|up_8|NZ_CP012395.1_1237927_1238155_-	pfam13031, DUF3892, Protein of unknown function (DUF3892)	NA|252aa|up_7|NZ_CP012395.1_1238547_1239303_+	NA	NA|204aa|up_6|NZ_CP012395.1_1239358_1239970_-	pfam06161, DUF975, Protein of unknown function (DUF975)	NA|407aa|up_5|NZ_CP012395.1_1240163_1241384_+	cd17391, MFS_MdtG_MDR_like, Multidrug resistance protein MdtG and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|187aa|up_4|NZ_CP012395.1_1241429_1241990_-	cd03395, PAP2_like_4, PAP2_like_4 proteins	NA|119aa|up_3|NZ_CP012395.1_1242372_1242729_+	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|139aa|up_2|NZ_CP012395.1_1242817_1243234_+	COG3871, COG3871, Uncharacterized stress protein (general stress protein 26) [General function prediction only]	NA|99aa|up_1|NZ_CP012395.1_1244090_1244387_+	NA	NA|57aa|up_0|NZ_CP012395.1_1244603_1244774_+	NA	NA|101aa|down_0|NZ_CP012395.1_1246806_1247109_+	pfam07261, DnaB_2, Replication initiation and membrane attachment	NA|147aa|down_1|NZ_CP012395.1_1247627_1248068_+	NA	NA|143aa|down_2|NZ_CP012395.1_1248152_1248581_+	PRK10562, PRK10562, putative acetyltransferase; Provisional	NA|257aa|down_3|NZ_CP012395.1_1248621_1249392_+	smart00650, rADc, Ribosomal RNA adenine dimethylases	NA|74aa|down_4|NZ_CP012395.1_1249994_1250216_+	NA	NA|55aa|down_5|NZ_CP012395.1_1250433_1250598_+	pfam12841, YvrJ, YvrJ protein family	NA|75aa|down_6|NZ_CP012395.1_1250681_1250906_+	pfam07872, DUF1659, Protein of unknown function (DUF1659)	NA|102aa|down_7|NZ_CP012395.1_1251213_1251519_-	pfam11667, DUF3267, Putative zincin peptidase	NA|300aa|down_8|NZ_CP012395.1_1252212_1253112_+	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|113aa|down_9|NZ_CP012395.1_1253826_1254165_-	COG4823, AbiF, Abortive infection bacteriophage resistance protein [Defense mechanisms]
GCF_001484725.1_ASM148472v1	NZ_CP012395	Clostridium autoethanogenum DSM 10061 chromosome, complete genome	3	1489404-1490822	2,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas3,cas5,cas7b,cas8b1,cas6	cas8b1,DinG,PD-DExK,csa3,DEDDh,cas2,cas1,cas4,cas3,cas5,cas7b,cas6,WYL,cas3HD,c2c10_CAS-V-U3	Type I-B	ATTTAAATACATCTCATGTTGAGGTTCAAC,ATTTAAATACATCTCATGTTGAGGTTCAAC,ATTTAAATACATCTCATGTTGAGGTTCAAC	30,30,30	0	0	NA	NA	II-B:II-B:II-B	21,21,21	21	TypeI-B	cas8b1,DinG,PD-DExK,csa3,DEDDh,cas2,cas1,cas4,cas3,cas5,cas7b,cas6,WYL,cas3HD,c2c10_CAS-V-U3	NA|130aa|up_3|NZ_CP012395.1_1486205_1486595_-,NA|172aa|down_9|NZ_CP012395.1_1503925_1504441_-	NA|331aa|up_9|NZ_CP012395.1_1481057_1482050_-	cd06309, PBP1_galactofuranose_YtfQ-like, periplasmic binding domain of ABC-type galactofuranose YtfQ-like transport systems	NA|204aa|up_8|NZ_CP012395.1_1482526_1483138_-	COG0558, PgsA, Phosphatidylglycerophosphate synthase [Lipid metabolism]	NA|181aa|up_7|NZ_CP012395.1_1483143_1483686_-	PRK06242, PRK06242, flavodoxin; Provisional	NA|198aa|up_6|NZ_CP012395.1_1483710_1484304_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|243aa|up_5|NZ_CP012395.1_1484548_1485277_-	COG1664, CcmA, Integral membrane protein CcmA involved in cell shape determination [Cell envelope biogenesis, outer membrane]	NA|210aa|up_4|NZ_CP012395.1_1485296_1485926_-	pfam13171, DUF4004, Protein of unknown function (DUF4004)	NA|130aa|up_3|NZ_CP012395.1_1486205_1486595_-	NA	NA|251aa|up_2|NZ_CP012395.1_1486615_1487368_-	cd05333, BKR_SDR_c, beta-Keto acyl carrier protein reductase (BKR), involved in Type II FAS, classical (c) SDRs	NA|269aa|up_1|NZ_CP012395.1_1487509_1488316_-	cd07713, DHPS-like_MBL-fold, Methanocaldococcus jannaschii dihydropteroate synthase, Thermoanaerobacter tengcongensis Tflp, and related proteins; MBL-fold metallo hydrolase domain	NA|72aa|up_0|NZ_CP012395.1_1488472_1488688_-	pfam08765, Mor, Mor transcription activator family	cas2|95aa|down_0|NZ_CP012395.1_1491000_1491285_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NZ_CP012395.1_1491301_1492306_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas4|165aa|down_2|NZ_CP012395.1_1492323_1492818_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|856aa|down_3|NZ_CP012395.1_1492836_1495404_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|249aa|down_4|NZ_CP012395.1_1495416_1496163_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7b|310aa|down_5|NZ_CP012395.1_1496166_1497096_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8b1|585aa|down_6|NZ_CP012395.1_1497096_1498851_-	TIGR02591, cas_Csh1, CRISPR-associated protein Cas8b/Csh1, subtype I-B/HMARI	cas6|237aa|down_7|NZ_CP012395.1_1498869_1499580_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	NA|254aa|down_8|NZ_CP012395.1_1503134_1503896_-	pfam02517, Abi, CAAX protease self-immunity	NA|172aa|down_9|NZ_CP012395.1_1503925_1504441_-	NA
GCF_001484725.1_ASM148472v1	NZ_CP012395	Clostridium autoethanogenum DSM 10061 chromosome, complete genome	4	1499998-1502801	3,2,3,4	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas2,cas1,cas4,cas3,cas5,cas7b,cas8b1,cas6	cas8b1,DinG,PD-DExK,csa3,DEDDh,cas2,cas1,cas4,cas3,cas5,cas7b,cas6,WYL,cas3HD,c2c10_CAS-V-U3	Type I-B	ATTTAAATACATCTCATGTTGAGGTTCAAC,ATTTAAATACATCTCATGTTGAGGTTCAAC,TTAATTTAAATACATCCCATGTTAAGGTTCAACT,ATTTAAATACATCTCATGTTGAGGTTCAAC	30,30,34,30	0	0	NA	NA	II-B:II-B:III-B:II-B	42,42,38,38	42	TypeI-B	cas8b1,DinG,PD-DExK,csa3,DEDDh,cas2,cas1,cas4,cas3,cas5,cas7b,cas6,WYL,cas3HD,c2c10_CAS-V-U3	NA,NA|172aa|down_1|NZ_CP012395.1_1503925_1504441_-	NA|269aa|up_9|NZ_CP012395.1_1487509_1488316_-	cd07713, DHPS-like_MBL-fold, Methanocaldococcus jannaschii dihydropteroate synthase, Thermoanaerobacter tengcongensis Tflp, and related proteins; MBL-fold metallo hydrolase domain	NA|72aa|up_8|NZ_CP012395.1_1488472_1488688_-	pfam08765, Mor, Mor transcription activator family	cas2|95aa|up_7|NZ_CP012395.1_1491000_1491285_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|up_6|NZ_CP012395.1_1491301_1492306_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas4|165aa|up_5|NZ_CP012395.1_1492323_1492818_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|856aa|up_4|NZ_CP012395.1_1492836_1495404_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|249aa|up_3|NZ_CP012395.1_1495416_1496163_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7b|310aa|up_2|NZ_CP012395.1_1496166_1497096_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8b1|585aa|up_1|NZ_CP012395.1_1497096_1498851_-	TIGR02591, cas_Csh1, CRISPR-associated protein Cas8b/Csh1, subtype I-B/HMARI	cas6|237aa|up_0|NZ_CP012395.1_1498869_1499580_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	NA|254aa|down_0|NZ_CP012395.1_1503134_1503896_-	pfam02517, Abi, CAAX protease self-immunity	NA|172aa|down_1|NZ_CP012395.1_1503925_1504441_-	NA	NA|133aa|down_2|NZ_CP012395.1_1507049_1507448_+	cd07824, SRPBCC_6, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|169aa|down_3|NZ_CP012395.1_1507462_1507969_-	pfam13238, AAA_18, AAA domain	NA|181aa|down_4|NZ_CP012395.1_1507996_1508539_-	cd02139, nitroreductase, nitroreductase family protein	NA|155aa|down_5|NZ_CP012395.1_1508742_1509207_+	pfam07853, DUF1648, Protein of unknown function (DUF1648)	NA|293aa|down_6|NZ_CP012395.1_1509221_1510100_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|485aa|down_7|NZ_CP012395.1_1510255_1511710_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|146aa|down_8|NZ_CP012395.1_1511845_1512283_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|190aa|down_9|NZ_CP012395.1_1512724_1513294_+	pfam09512, ThiW, Thiamine-precursor transporter protein (ThiW)
GCF_001484725.1_ASM148472v1	NZ_CP012395	Clostridium autoethanogenum DSM 10061 chromosome, complete genome	5	1504570-1506776	4,3,5	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas3,cas5,cas7b,cas8b1,cas6	cas8b1,DinG,PD-DExK,csa3,DEDDh,cas2,cas1,cas4,cas3,cas5,cas7b,cas6,WYL,cas3HD,c2c10_CAS-V-U3	Type I-B	ATTTAAATACATCTTATGTTGAGGTTCAAC,ATTTAAATACATCTTATGTTGAGGTTCAAC,ATTTAAATACATCTTATGTTGAGGTTCAAC	30,30,30	0	0	NA	NA	II-B:II-B:II-B	33,33,31	33	TypeI-B	cas8b1,DinG,PD-DExK,csa3,DEDDh,cas2,cas1,cas4,cas3,cas5,cas7b,cas6,WYL,cas3HD,c2c10_CAS-V-U3	NA|172aa|up_0|NZ_CP012395.1_1503925_1504441_-,NA	cas2|95aa|up_9|NZ_CP012395.1_1491000_1491285_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|up_8|NZ_CP012395.1_1491301_1492306_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas4|165aa|up_7|NZ_CP012395.1_1492323_1492818_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|856aa|up_6|NZ_CP012395.1_1492836_1495404_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|249aa|up_5|NZ_CP012395.1_1495416_1496163_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7b|310aa|up_4|NZ_CP012395.1_1496166_1497096_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8b1|585aa|up_3|NZ_CP012395.1_1497096_1498851_-	TIGR02591, cas_Csh1, CRISPR-associated protein Cas8b/Csh1, subtype I-B/HMARI	cas6|237aa|up_2|NZ_CP012395.1_1498869_1499580_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	NA|254aa|up_1|NZ_CP012395.1_1503134_1503896_-	pfam02517, Abi, CAAX protease self-immunity	NA|172aa|up_0|NZ_CP012395.1_1503925_1504441_-	NA	NA|133aa|down_0|NZ_CP012395.1_1507049_1507448_+	cd07824, SRPBCC_6, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|169aa|down_1|NZ_CP012395.1_1507462_1507969_-	pfam13238, AAA_18, AAA domain	NA|181aa|down_2|NZ_CP012395.1_1507996_1508539_-	cd02139, nitroreductase, nitroreductase family protein	NA|155aa|down_3|NZ_CP012395.1_1508742_1509207_+	pfam07853, DUF1648, Protein of unknown function (DUF1648)	NA|293aa|down_4|NZ_CP012395.1_1509221_1510100_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|485aa|down_5|NZ_CP012395.1_1510255_1511710_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|146aa|down_6|NZ_CP012395.1_1511845_1512283_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|190aa|down_7|NZ_CP012395.1_1512724_1513294_+	pfam09512, ThiW, Thiamine-precursor transporter protein (ThiW)	NA|266aa|down_8|NZ_CP012395.1_1513271_1514069_+	cd01170, THZ_kinase, 4-methyl-5-beta-hydroxyethylthiazole (Thz) kinase catalyzes the phosphorylation of the hydroxylgroup of Thz	NA|472aa|down_9|NZ_CP012395.1_1514116_1515532_-	smart00422, HTH_MERR, helix_turn_helix, mercury resistance
