assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000577895.1_M2_40	NZ_HG917868	Clostridium bornimense strain M2/40 chromosome M2/40_rep1	1	217843-223120	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas14j,WYL,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2	cas14j,WYL,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,csa3,DinG,DEDDh	Unclear	CTTTTATCTTAACTATGAGGAATGTAAAT,CTTTTATCTTAACTATGAGGAATGTAAAT,CTTTTATCTTAACTATGAGGAATGTAAAT	29,29,29	0	0	NA	NA	I-A:I-A:I-A	80,80,80	80	TypeV	cas14j,WYL,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,csa3,DinG,DEDDh	NA|143aa|up_9|NZ_HG917868.1_207881_208310_+,NA|72aa|down_1|NZ_HG917868.1_223539_223755_-,NA|183aa|down_4|NZ_HG917868.1_225806_226355_+	NA|143aa|up_9|NZ_HG917868.1_207881_208310_+	NA	WYL|314aa|up_8|NZ_HG917868.1_208475_209417_+	pfam13280, WYL, WYL domain	cas6|246aa|up_7|NZ_HG917868.1_209512_210250_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas8b2|600aa|up_6|NZ_HG917868.1_210262_212062_+	cd09665, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas7|302aa|up_5|NZ_HG917868.1_212062_212968_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|221aa|up_4|NZ_HG917868.1_212972_213635_+	cd09658, Cas5_I-B, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|731aa|up_3|NZ_HG917868.1_213684_215877_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas4|173aa|up_2|NZ_HG917868.1_215889_216408_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|328aa|up_1|NZ_HG917868.1_216407_217391_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|88aa|up_0|NZ_HG917868.1_217393_217657_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|71aa|down_0|NZ_HG917868.1_223309_223522_-	pfam11148, DUF2922, Protein of unknown function (DUF2922)	NA|72aa|down_1|NZ_HG917868.1_223539_223755_-	NA	NA|277aa|down_2|NZ_HG917868.1_223991_224822_+	pfam04931, DNA_pol_phi, DNA polymerase phi	NA|239aa|down_3|NZ_HG917868.1_224901_225618_-	pfam13529, Peptidase_C39_2, Peptidase_C39 like family	NA|183aa|down_4|NZ_HG917868.1_225806_226355_+	NA	NA|143aa|down_5|NZ_HG917868.1_226355_226784_-	COG4970, FimT, Tfp pilus assembly protein FimT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|371aa|down_6|NZ_HG917868.1_226972_228085_+	cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold	NA|952aa|down_7|NZ_HG917868.1_228244_231100_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|2006aa|down_8|NZ_HG917868.1_231277_237295_+	cd01951, lectin_L-type, legume lectins	NA|1211aa|down_9|NZ_HG917868.1_237366_240999_+	pfam01841, Transglut_core, Transglutaminase-like superfamily
GCF_000577895.1_M2_40	NZ_HG917868	Clostridium bornimense strain M2/40 chromosome M2/40_rep1	2	379114-379206	2	CRISPRCasFinder	no		cas14j,WYL,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,csa3,DinG,DEDDh	Orphan	TTATTCTACAGATGACATAGTTAATCCACA	30	0	0	NA	NA	NA	1	1	Orphan	cas14j,WYL,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,csa3,DinG,DEDDh	NA|101aa|up_7|NZ_HG917868.1_370398_370701_+,NA|273aa|down_1|NZ_HG917868.1_380999_381818_+	NA|192aa|up_9|NZ_HG917868.1_368571_369147_+	cd04301, NAT_SF, N-Acyltransferase superfamily: Various enzymes that characteristically catalyze the transfer of an acyl group to a substrate	NA|310aa|up_8|NZ_HG917868.1_369271_370201_-	TIGR00990, Mitochondrial_import_receptor_subunit_TOM70, mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70)	NA|101aa|up_7|NZ_HG917868.1_370398_370701_+	NA	NA|643aa|up_6|NZ_HG917868.1_371088_373017_+	PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed	NA|255aa|up_5|NZ_HG917868.1_373545_374310_+	pfam11738, DUF3298, Protein of unknown function (DUF3298)	NA|98aa|up_4|NZ_HG917868.1_374360_374654_+	pfam06949, DUF1292, Protein of unknown function (DUF1292)	NA|257aa|up_3|NZ_HG917868.1_374711_375482_+	COG0084, TatD, Mg-dependent DNase [DNA replication, recombination, and repair]	NA|349aa|up_2|NZ_HG917868.1_375644_376691_+	cd14667, 3D_containing_proteins, Non-mltA associated 3D domain containing proteins, named for 3 conserved aspartate residues	NA|183aa|up_1|NZ_HG917868.1_376893_377442_+	TIGR00334, Ribonuclease_M5, ribonuclease M5	NA|285aa|up_0|NZ_HG917868.1_377468_378323_+	PRK00274, ksgA, 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))-dimethyltransferase RsmA	NA|264aa|down_0|NZ_HG917868.1_380207_380999_+	pfam01790, LGT, Prolipoprotein diacylglyceryl transferase	NA|273aa|down_1|NZ_HG917868.1_380999_381818_+	NA	NA|285aa|down_2|NZ_HG917868.1_381951_382806_+	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase	NA|150aa|down_3|NZ_HG917868.1_382822_383272_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|285aa|down_4|NZ_HG917868.1_383273_384128_-	TIGR00762, DegV, EDD domain protein, DegV family	NA|283aa|down_5|NZ_HG917868.1_384380_385229_+	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|187aa|down_6|NZ_HG917868.1_385406_385967_+	PRK05618, PRK05618, 50S ribosomal protein L25/general stress protein Ctc; Reviewed	NA|398aa|down_7|NZ_HG917868.1_386602_387796_-	PRK13902, alaS, alanyl-tRNA synthetase; Provisional	NA|371aa|down_8|NZ_HG917868.1_387893_389006_+	COG1251, NirB, NAD(P)H-nitrite reductase [Energy production and conversion]	NA|246aa|down_9|NZ_HG917868.1_389075_389813_+	PRK12434, PRK12434, tRNA pseudouridine(38-40) synthase TruA
GCF_000577895.1_M2_40	NZ_HG917868	Clostridium bornimense strain M2/40 chromosome M2/40_rep1	3	1653092-1653183	3	CRISPRCasFinder	no		cas14j,WYL,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,csa3,DinG,DEDDh	Orphan	TTCCACCATTACCACCATCGCCT	23	0	0	NA	NA	NA	1	1	Orphan	cas14j,WYL,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,csa3,DinG,DEDDh	NA,NA	NA|87aa|up_9|NZ_HG917868.1_1641456_1641717_+	pfam01649, Ribosomal_S20p, Ribosomal protein S20	NA|339aa|up_8|NZ_HG917868.1_1641749_1642766_-	PRK05574, holA, DNA polymerase III subunit delta; Reviewed	NA|540aa|up_7|NZ_HG917868.1_1642768_1644388_-	pfam03772, Competence, Competence protein	NA|792aa|up_6|NZ_HG917868.1_1644439_1646815_-	COG0474, MgtA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|209aa|up_5|NZ_HG917868.1_1646845_1647472_-	TIGR01259, ComE_operon_protein_1, comEA protein	NA|298aa|up_4|NZ_HG917868.1_1647528_1648422_-	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|129aa|up_3|NZ_HG917868.1_1648487_1648874_-	TIGR00004, RutC_family_protein, reactive intermediate/imine deaminase	NA|455aa|up_2|NZ_HG917868.1_1649042_1650407_-	COG1316, LytR, Transcriptional regulator [Transcription]	NA|392aa|up_1|NZ_HG917868.1_1650418_1651594_-	PRK07152, nadD, nicotinate-nucleotide adenylyltransferase	NA|97aa|up_0|NZ_HG917868.1_1651629_1651920_-	pfam01985, CRS1_YhbY, CRS1 / YhbY (CRM) domain	NA|101aa|down_0|NZ_HG917868.1_1653338_1653641_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|109aa|down_1|NZ_HG917868.1_1653645_1653972_-	pfam04327, Peptidase_Prp, Cysteine protease Prp	NA|104aa|down_2|NZ_HG917868.1_1653974_1654286_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|480aa|down_3|NZ_HG917868.1_1654398_1655838_-	pfam10150, RNase_E_G, Ribonuclease E/G family	NA|237aa|down_4|NZ_HG917868.1_1656004_1656715_-	pfam10105, DUF2344, Uncharacterized protein conserved in bacteria (DUF2344)	NA|620aa|down_5|NZ_HG917868.1_1656692_1658552_-	TIGR03960, radical_SAM_domain_protein, radical SAM family uncharacterized protein	NA|274aa|down_6|NZ_HG917868.1_1658620_1659442_-	cd06161, S2P-M50_SpoIVFB, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|378aa|down_7|NZ_HG917868.1_1659841_1660975_-	TIGR02210, Rod_shape-determining_protein_RodA, rod shape-determining protein RodA	NA|88aa|down_8|NZ_HG917868.1_1661106_1661370_-	PRK13987, PRK13987, cell division topological specificity factor MinE; Provisional	NA|265aa|down_9|NZ_HG917868.1_1661387_1662182_-	TIGR01968, Septum_site-determining_protein_MinD, septum site-determining protein MinD
GCF_000577895.1_M2_40	NZ_HG917868	Clostridium bornimense strain M2/40 chromosome M2/40_rep1	4	2299388-2299478	4	CRISPRCasFinder	no		cas14j,WYL,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,csa3,DinG,DEDDh	Orphan	TAGTTAAGTAGTGGTGACAAGCC	23	0	0	NA	NA	NA	1	1	Orphan	cas14j,WYL,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,csa3,DinG,DEDDh	NA,NA	NA|443aa|up_9|NZ_HG917868.1_2284265_2285594_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|449aa|up_8|NZ_HG917868.1_2285846_2287193_-	COG1004, Ugd, Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|220aa|up_7|NZ_HG917868.1_2287205_2287865_-	pfam02397, Bac_transf, Bacterial sugar transferase	NA|461aa|up_6|NZ_HG917868.1_2287894_2289277_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|188aa|up_5|NZ_HG917868.1_2289646_2290210_-	PRK05618, PRK05618, 50S ribosomal protein L25/general stress protein Ctc; Reviewed	NA|490aa|up_4|NZ_HG917868.1_2290364_2291834_+	COG1982, LdcC, Arginine/lysine/ornithine decarboxylases [Amino acid transport and metabolism]	NA|292aa|up_3|NZ_HG917868.1_2291990_2292866_-	COG3677, COG3677, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|970aa|up_2|NZ_HG917868.1_2293154_2296064_-	COG1026, COG1026, Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]	NA|145aa|up_1|NZ_HG917868.1_2296442_2296877_-	pfam04463, DUF523, Protein of unknown function (DUF523)	NA|780aa|up_0|NZ_HG917868.1_2296881_2299221_-	PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed	NA|779aa|down_0|NZ_HG917868.1_2299508_2301845_-	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|378aa|down_1|NZ_HG917868.1_2301897_2303031_-	pfam05164, ZapA, Cell division protein ZapA	NA|132aa|down_2|NZ_HG917868.1_2303211_2303607_-	pfam06935, DUF1284, Protein of unknown function (DUF1284)	NA|181aa|down_3|NZ_HG917868.1_2303593_2304136_-	COG1268, BioY, Uncharacterized conserved protein [General function prediction only]	NA|791aa|down_4|NZ_HG917868.1_2304514_2306887_-	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|341aa|down_5|NZ_HG917868.1_2306906_2307929_-	PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha; Validated	NA|265aa|down_6|NZ_HG917868.1_2308311_2309106_-	COG0566, SpoU, rRNA methylases [Translation, ribosomal structure and biogenesis]	NA|227aa|down_7|NZ_HG917868.1_2309107_2309788_-	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|456aa|down_8|NZ_HG917868.1_2309796_2311164_-	TIGR00933, Trk_system_potassium_uptake_protein_trkH	NA|119aa|down_9|NZ_HG917868.1_2311299_2311656_-	PRK05185, rplT, 50S ribosomal protein L20; Provisional
GCF_000577895.1_M2_40	NZ_HG917869	Clostridium bornimense strain M2/40 chromosome M2/40_rep2	1	90443-91519	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no		DEDDh,WYL,csa3	Orphan	GTTGAACAATAACATGTGATGTTTTTAAAT,GTTGAACAATAACATGTGATGTTTTTAAAT,GTTGAACAATAACATGTGATGTTTTTAAAT	30,30,30	0	0	NA	NA	III-B:III-B:III-B	15,16,16	16	Orphan	cas14j,WYL,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,csa3,DinG,DEDDh	NA|150aa|up_6|NZ_HG917869.1_83679_84129_+,NA|56aa|up_3|NZ_HG917869.1_86182_86350_-,NA	NA|375aa|up_9|NZ_HG917869.1_79935_81060_+	pfam14286, DHHW, DHHW protein	NA|197aa|up_8|NZ_HG917869.1_81173_81764_-	pfam14270, DUF4358, Domain of unknown function (DUF4358)	NA|466aa|up_7|NZ_HG917869.1_81946_83344_+	COG1696, DltB, Predicted membrane protein involved in D-alanine export [Cell envelope biogenesis, outer membrane]	NA|150aa|up_6|NZ_HG917869.1_83679_84129_+	NA	NA|257aa|up_5|NZ_HG917869.1_84144_84915_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|290aa|up_4|NZ_HG917869.1_85135_86005_+	cd07197, nitrilase, Nitrilase superfamily, including nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes	NA|56aa|up_3|NZ_HG917869.1_86182_86350_-	NA	NA|132aa|up_2|NZ_HG917869.1_86671_87067_-	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|147aa|up_1|NZ_HG917869.1_87346_87787_+	PRK13289, PRK13289, NO-inducible flavohemoprotein	NA|662aa|up_0|NZ_HG917869.1_88216_90202_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|217aa|down_0|NZ_HG917869.1_92210_92861_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|253aa|down_1|NZ_HG917869.1_92892_93651_+	pfam05857, TraX, TraX protein	NA|260aa|down_2|NZ_HG917869.1_93690_94470_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|345aa|down_3|NZ_HG917869.1_94877_95912_+	COG1613, Sbp, ABC-type sulfate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|277aa|down_4|NZ_HG917869.1_95924_96755_+	TIGR02139, permease_CysT, sulfate ABC transporter, permease protein CysT	NA|274aa|down_5|NZ_HG917869.1_96765_97587_+	COG4208, CysW, ABC-type sulfate transport system, permease component [Inorganic ion transport and metabolism]	NA|354aa|down_6|NZ_HG917869.1_97603_98665_+	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|564aa|down_7|NZ_HG917869.1_98674_100366_+	PRK06854, PRK06854, adenylyl-sulfate reductase subunit alpha	NA|105aa|down_8|NZ_HG917869.1_100349_100664_+	TIGR02060, adenylylsulfate_reductase_beta_subunit, adenosine phosphosulphate reductase, beta subunit	NA|300aa|down_9|NZ_HG917869.1_100681_101581_+	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD
