assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000389635.1_ASM38963v1	NC_021182	Clostridium pasteurianum BC1, complete sequence	1	88057-88345	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	c2c10_CAS-V-U3	c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	Type V-U3	GCTTAAACATAACCTATGTTAATAGTTAAC,GCTTAAACATAACCTATGTTAATAGTTAAC,TAAACATAACCTATGTTAATAGTTAAC	30,30,27	0	0	NA	NA	NA:NA:NA	4,4,4	4	TypeV-U3	c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	NA|139aa|up_8|NC_021182.1_82418_82835_+,NA|46aa|up_7|NC_021182.1_82818_82956_+,NA|105aa|up_6|NC_021182.1_83491_83806_+,NA|51aa|up_5|NC_021182.1_83816_83969_+,NA|204aa|up_3|NC_021182.1_85064_85676_+,NA|66aa|up_2|NC_021182.1_85688_85886_+,NA|60aa|down_0|NC_021182.1_88632_88812_+,NA|344aa|down_1|NC_021182.1_88913_89945_+,NA|147aa|down_3|NC_021182.1_91031_91472_+,NA|161aa|down_5|NC_021182.1_93022_93505_+,NA|70aa|down_8|NC_021182.1_95333_95543_+	NA|736aa|up_9|NC_021182.1_79932_82140_+	TIGR01613, putative_primase, phage/plasmid primase, P4 family, C-terminal domain	NA|139aa|up_8|NC_021182.1_82418_82835_+	NA	NA|46aa|up_7|NC_021182.1_82818_82956_+	NA	NA|105aa|up_6|NC_021182.1_83491_83806_+	NA	NA|51aa|up_5|NC_021182.1_83816_83969_+	NA	NA|139aa|up_4|NC_021182.1_84639_85056_+	pfam13518, HTH_28, Helix-turn-helix domain	NA|204aa|up_3|NC_021182.1_85064_85676_+	NA	NA|66aa|up_2|NC_021182.1_85688_85886_+	NA	NA|238aa|up_1|NC_021182.1_85938_86652_+	COG0568, RpoD, DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) [Transcription]	c2c10_CAS-V-U3|380aa|up_0|NC_021182.1_86648_87788_-	pfam07282, OrfB_Zn_ribbon, Putative transposase DNA-binding domain	NA|60aa|down_0|NC_021182.1_88632_88812_+	NA	NA|344aa|down_1|NC_021182.1_88913_89945_+	NA	NA|157aa|down_2|NC_021182.1_90554_91025_+	COG3600, GepA, Uncharacterized phage-associated protein [Function unknown]	NA|147aa|down_3|NC_021182.1_91031_91472_+	NA	NA|148aa|down_4|NC_021182.1_92134_92578_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|161aa|down_5|NC_021182.1_93022_93505_+	NA	NA|160aa|down_6|NC_021182.1_93652_94132_+	cd00077, HDc, Metal dependent phosphohydrolases with conserved 'HD' motif	NA|262aa|down_7|NC_021182.1_94335_95121_-	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|70aa|down_8|NC_021182.1_95333_95543_+	NA	NA|93aa|down_9|NC_021182.1_95565_95844_+	pfam07875, Coat_F, Coat F domain
GCF_000389635.1_ASM38963v1	NC_021182	Clostridium pasteurianum BC1, complete sequence	2	366595-366672	2	CRISPRCasFinder	no		c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	Orphan	GCACTCTGTGAAAGTGATTCACA	23	0	0	NA	NA	NA	1	1	Orphan	c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	NA|105aa|up_5|NC_021182.1_361126_361441_+,NA|124aa|up_3|NC_021182.1_362465_362837_+,NA|254aa|up_1|NC_021182.1_365029_365791_+,NA	NA|781aa|up_9|NC_021182.1_353896_356239_+	cd02850, E_set_Cellulase_N, N-terminal Early set domain associated with the catalytic domain of cellulase	NA|359aa|up_8|NC_021182.1_356317_357394_+	COG3936, COG3936, Protein involved in polysaccharide intercellular adhesin (PIA) synthesis/biofilm formation [Carbohydrate transport and metabolism]	NA|407aa|up_7|NC_021182.1_357618_358839_+	cd08023, GH16_laminarinase_like, Laminarinase, member of the glycosyl hydrolase family 16	NA|656aa|up_6|NC_021182.1_359109_361077_+	pfam06605, Prophage_tail, Prophage endopeptidase tail	NA|105aa|up_5|NC_021182.1_361126_361441_+	NA	NA|320aa|up_4|NC_021182.1_361446_362406_+	cd06525, GH25_Lyc-like, Lyc muramidase is an autolytic lysozyme (autolysin) from Clostridium acetobutylicum encoded by the lyc gene	NA|124aa|up_3|NC_021182.1_362465_362837_+	NA	NA|507aa|up_2|NC_021182.1_363118_364639_+	cd00737, lyz_endolysin_autolysin, endolysin and autolysin	NA|254aa|up_1|NC_021182.1_365029_365791_+	NA	NA|170aa|up_0|NC_021182.1_365841_366351_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|79aa|down_0|NC_021182.1_367041_367278_+	COG4466, Veg, Uncharacterized protein conserved in bacteria [Function unknown]	NA|521aa|down_1|NC_021182.1_367718_369281_+	pfam12673, DUF3794, Domain of unknown function (DUF3794)	NA|281aa|down_2|NC_021182.1_369480_370323_+	PRK00128, ipk, 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional	NA|141aa|down_3|NC_021182.1_370955_371378_-	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|234aa|down_4|NC_021182.1_371658_372360_+	pfam09551, Spore_II_R, Stage II sporulation protein R (spore_II_R)	NA|385aa|down_5|NC_021182.1_372431_373586_-	cd17507, GT28_Beta-DGS-like, beta-diglucosyldiacylglycerol synthase and similar proteins	NA|458aa|down_6|NC_021182.1_373879_375253_+	TIGR02889, Sporulation_protein_YpeB, germination protein YpeB	NA|344aa|down_7|NC_021182.1_375437_376469_+	PRK01966, ddl, D-alanine--D-alanine ligase	NA|139aa|down_8|NC_021182.1_376503_376920_+	pfam09148, DUF1934, Domain of unknown function (DUF1934)	NA|533aa|down_9|NC_021182.1_377122_378721_-	PRK05380, pyrG, CTP synthetase; Validated
GCF_000389635.1_ASM38963v1	NC_021182	Clostridium pasteurianum BC1, complete sequence	3	1220463-1220573	3	CRISPRCasFinder	no		c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	Orphan	TAATTCAACTACAAACAGTAATAGTAA	27	0	0	NA	NA	NA	1	1	Orphan	c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	NA|320aa|up_7|NC_021182.1_1209879_1210839_-,NA|455aa|up_4|NC_021182.1_1214522_1215887_+,NA|64aa|down_2|NC_021182.1_1231423_1231615_-	NA|394aa|up_9|NC_021182.1_1206781_1207963_-	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|434aa|up_8|NC_021182.1_1208241_1209543_+	PRK02813, PRK02813, putative aminopeptidase 2; Provisional	NA|320aa|up_7|NC_021182.1_1209879_1210839_-	NA	NA|400aa|up_6|NC_021182.1_1211541_1212741_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|434aa|up_5|NC_021182.1_1212894_1214196_+	cd06828, PLPDE_III_DapDC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Diaminopimelate Decarboxylase	NA|455aa|up_4|NC_021182.1_1214522_1215887_+	NA	NA|258aa|up_3|NC_021182.1_1215933_1216707_+	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|205aa|up_2|NC_021182.1_1217097_1217712_+	pfam00455, DeoRC, DeoR C terminal sensor domain	NA|252aa|up_1|NC_021182.1_1217745_1218501_+	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|271aa|up_0|NC_021182.1_1218533_1219346_+	cd07516, HAD_Pase, phosphatase, similar to Escherichia coli Cof and Thermotoga maritima TM0651; belongs to the haloacid dehalogenase-like superfamily	NA|297aa|down_0|NC_021182.1_1221609_1222500_+	cd17313, MFS_SLC45_SUC, Solute carrier family 45 and similar sugar transporters of the Major Facilitator Superfamily of transporters	NA|2885aa|down_1|NC_021182.1_1222692_1231347_+	COG3459, COG3459, Cellobiose phosphorylase [Carbohydrate transport and metabolism]	NA|64aa|down_2|NC_021182.1_1231423_1231615_-	NA	NA|125aa|down_3|NC_021182.1_1231960_1232335_+	TIGR00320, Desulfoferrodoxin_homolog, desulfoferrodoxin	NA|335aa|down_4|NC_021182.1_1232542_1233547_+	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|553aa|down_5|NC_021182.1_1233739_1235398_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|639aa|down_6|NC_021182.1_1235498_1237415_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|269aa|down_7|NC_021182.1_1237674_1238481_-	PRK10513, PRK10513, sugar phosphate phosphatase; Provisional	NA|530aa|down_8|NC_021182.1_1239289_1240879_+	COG4108, PrfC, Peptide chain release factor RF-3 [Translation, ribosomal structure and biogenesis]	NA|323aa|down_9|NC_021182.1_1241833_1242802_+	PRK09283, PRK09283, porphobilinogen synthase
GCF_000389635.1_ASM38963v1	NC_021182	Clostridium pasteurianum BC1, complete sequence	4	2220738-2220865	4	CRISPRCasFinder	no		c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	Orphan	AAAAATGAAAAGTGTTATACAATTAAAATGAATTAA	36	0	0	NA	NA	NA	1	1	Orphan	c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	NA,NA	NA|148aa|up_9|NC_021182.1_2212371_2212815_+	pfam09424, YqeY, Yqey-like protein	NA|94aa|up_8|NC_021182.1_2212932_2213214_+	TIGR02856, Uncharacterized_protein_YqfC, sporulation protein YqfC	NA|381aa|up_7|NC_021182.1_2213215_2214358_+	pfam06898, YqfD, Putative stage IV sporulation protein YqfD	NA|685aa|up_6|NC_021182.1_2214408_2216463_+	COG1480, COG1480, Predicted membrane-associated HD superfamily hydrolase [General function prediction only]	NA|168aa|up_5|NC_021182.1_2216477_2216981_+	PRK00016, PRK00016, metal-binding heat shock protein; Provisional	NA|233aa|up_4|NC_021182.1_2217004_2217703_+	cd14266, UDPK_IM_PAP2_like, Integral membrane undecaprenol kinase domain co-occurring with type 2 phosphatidic acid phosphatase-like domains	NA|132aa|up_3|NC_021182.1_2217729_2218125_+	PRK05578, PRK05578, cytidine deaminase; Validated	NA|298aa|up_2|NC_021182.1_2218345_2219239_+	PRK00089, era, GTPase Era; Reviewed	NA|249aa|up_1|NC_021182.1_2219285_2220032_+	PRK00085, recO, DNA repair protein RecO; Reviewed	NA|201aa|up_0|NC_021182.1_2220033_2220636_+	pfam14242, DUF4342, Domain of unknown function (DUF4342)	NA|211aa|down_0|NC_021182.1_2220910_2221543_+	cd04617, CBS_pair_CcpN, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains of CcpN repressor	NA|877aa|down_1|NC_021182.1_2221617_2224248_+	PRK09279, PRK09279, pyruvate phosphate dikinase; Provisional	NA|351aa|down_2|NC_021182.1_2224359_2225412_-	TIGR02906, Spore_coat_protein_S, spore coat protein, CotS family	NA|343aa|down_3|NC_021182.1_2225727_2226756_+	PRK01286, PRK01286, deoxyguanosinetriphosphate triphosphohydrolase-like protein; Provisional	NA|591aa|down_4|NC_021182.1_2227072_2228845_+	PRK05667, dnaG, DNA primase; Validated	NA|374aa|down_5|NC_021182.1_2228866_2229988_+	PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated	NA|297aa|down_6|NC_021182.1_2230092_2230983_+	pfam10882, bPH_5, Bacterial PH domain	NA|230aa|down_7|NC_021182.1_2231207_2231897_+	pfam12847, Methyltransf_18, Methyltransferase domain	NA|272aa|down_8|NC_021182.1_2231887_2232703_+	pfam01784, NIF3, NIF3 (NGG1p interacting factor 3)	NA|245aa|down_9|NC_021182.1_2232817_2233552_+	COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
GCF_000389635.1_ASM38963v1	NC_021182	Clostridium pasteurianum BC1, complete sequence	5	4416526-4416624	5	CRISPRCasFinder	no		c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	Orphan	TATAACAAAATCGTGTAGATAAAATAAAGATTTTGTT	37	0	0	NA	NA	NA	1	1	Orphan	c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	NA,NA	NA|359aa|up_9|NC_021182.1_4405319_4406396_-	PRK00045, hemA, glutamyl-tRNA reductase; Reviewed	NA|611aa|up_8|NC_021182.1_4406795_4408628_-	COG2895, CysN, GTPases - Sulfate adenylate transferase subunit 1 [Inorganic ion transport and metabolism]	NA|267aa|up_7|NC_021182.1_4408643_4409444_-	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|251aa|up_6|NC_021182.1_4409566_4410319_-	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|280aa|up_5|NC_021182.1_4410275_4411115_-	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|352aa|up_4|NC_021182.1_4411333_4412389_-	TIGR01728, Putative_aliphatic_sulfonates-binding_protein, ABC transporter, substrate-binding protein, aliphatic sulfonates family	NA|105aa|up_3|NC_021182.1_4412513_4412828_-	COG1146, COG1146, Ferredoxin [Energy production and conversion]	NA|562aa|up_2|NC_021182.1_4412811_4414497_-	PRK06854, PRK06854, adenylyl-sulfate reductase subunit alpha	NA|197aa|up_1|NC_021182.1_4414573_4415164_-	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|409aa|up_0|NC_021182.1_4415243_4416470_-	COG2873, MET17, O-acetylhomoserine sulfhydrylase [Amino acid transport and metabolism]	NA|768aa|down_0|NC_021182.1_4416646_4418950_-	COG0155, CysI, Sulfite reductase, beta subunit (hemoprotein) [Inorganic ion transport and metabolism]	NA|136aa|down_1|NC_021182.1_4418981_4419389_-	cd08070, MPN_like, Mpr1p, Pad1p N-terminal (MPN) domains with catalytic isopeptidase activity (metal-binding)	NA|277aa|down_2|NC_021182.1_4419451_4420282_-	cd00757, ThiF_MoeB_HesA_family, ThiF_MoeB_HesA	NA|72aa|down_3|NC_021182.1_4420271_4420487_-	cd00565, Ubl_ThiS, ubiquitin-like (Ubl) domain found in sulfur carrier protein ThiS	NA|401aa|down_4|NC_021182.1_4420568_4421771_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|218aa|down_5|NC_021182.1_4422010_4422664_-	COG1648, CysG, Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) [Coenzyme metabolism]	NA|144aa|down_6|NC_021182.1_4422982_4423414_+	cd02598, HAD_BPGM, beta-phosphoglucomutase, similar to Lactococcus lactis beta-phosphoglucomutase (beta-PGM)	NA|339aa|down_7|NC_021182.1_4423516_4424533_-	PRK05589, PRK05589, peptide chain release factor 2; Provisional	NA|836aa|down_8|NC_021182.1_4424640_4427148_-	PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed	NA|179aa|down_9|NC_021182.1_4427467_4428004_-	COG1544, COG1544, Ribosome-associated protein Y (PSrp-1) [Translation, ribosomal structure and biogenesis]
GCF_000389635.1_ASM38963v1	NC_021182	Clostridium pasteurianum BC1, complete sequence	6	4648580-4648672	6	CRISPRCasFinder	no	cas3	c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	Unclear	TTTTCCATTTCTAATTACTTTACATG	26	0	0	NA	NA	NA	1	1	Unclear	c2c10_CAS-V-U3,RT,WYL,csa3,cas14j,PD-DExK,DinG,c2c9_V-U4,cas3,DEDDh	NA,NA|107aa|down_9|NC_021182.1_4661599_4661920_+	NA|135aa|up_9|NC_021182.1_4637336_4637741_-	TIGR02893, Spore_protein_YabQ, spore cortex biosynthesis protein YabQ	NA|94aa|up_8|NC_021182.1_4637746_4638028_-	TIGR02892, conserved_hypothetical_protein, sporulation protein YabP	NA|86aa|up_7|NC_021182.1_4638115_4638373_-	COG1188, COG1188, Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]	NA|125aa|up_6|NC_021182.1_4638470_4638845_-	cd13831, HU, histone-like DNA-binding protein HU	NA|484aa|up_5|NC_021182.1_4638848_4640300_-	COG3956, COG3956, Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain [General function prediction only]	NA|514aa|up_4|NC_021182.1_4640328_4641870_-	cd13124, MATE_SpoVB_like, Stage V sporulation protein B, also known as Stage III sporulation protein F, and related proteins	NA|185aa|up_3|NC_021182.1_4642053_4642608_-	TIGR02851, stage_V_sporulation_protein_T, stage V sporulation protein T	NA|336aa|up_2|NC_021182.1_4642932_4643940_-	PRK00059, prsA, peptidylprolyl isomerase; Provisional	cas3|1173aa|up_1|NC_021182.1_4644020_4647539_-	COG1197, Mfd, Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription]	NA|188aa|up_0|NC_021182.1_4647557_4648121_-	pfam01195, Pept_tRNA_hydro, Peptidyl-tRNA hydrolase	NA|474aa|down_0|NC_021182.1_4649845_4651267_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|229aa|down_1|NC_021182.1_4651270_4651957_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|325aa|down_2|NC_021182.1_4652131_4653106_-	PRK01259, PRK01259, ribose-phosphate diphosphokinase	NA|457aa|down_3|NC_021182.1_4653128_4654499_-	PRK14354, glmU, bifunctional UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase GlmU	NA|96aa|down_4|NC_021182.1_4654656_4654944_-	PRK13259, PRK13259, septation regulator SpoVG	NA|272aa|down_5|NC_021182.1_4655048_4655864_-	PRK09213, PRK09213, pur operon repressor; Provisional	NA|459aa|down_6|NC_021182.1_4656130_4657507_+	PRK00421, murC, UDP-N-acetylmuramate--L-alanine ligase; Provisional	NA|579aa|down_7|NC_021182.1_4657908_4659645_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|595aa|down_8|NC_021182.1_4659644_4661429_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|107aa|down_9|NC_021182.1_4661599_4661920_+	NA
