assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002003385.1_ASM200338v1	NZ_CP016092	Clostridium saccharobutylicum strain NCP 195, complete genome	1	148425-148574	1	CRISPRCasFinder	no	cas3	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	Unclear	AAATATCGCGGGGTGGAGCAGTTGGTAGCTCGTCGGGCTCATAACCCGAAGGTCG	55	0	0	NA	NA	NA	1	1	Unclear	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	NA,NA	NA|174aa|up_9|NZ_CP016092.1_140493_141015_+	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|184aa|up_8|NZ_CP016092.1_141452_142004_+	TIGR02851, stage_V_sporulation_protein_T, stage V sporulation protein T	NA|513aa|up_7|NZ_CP016092.1_142113_143652_+	cd13124, MATE_SpoVB_like, Stage V sporulation protein B, also known as Stage III sporulation protein F, and related proteins	NA|484aa|up_6|NZ_CP016092.1_143668_145120_+	COG3956, COG3956, Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain [General function prediction only]	NA|92aa|up_5|NZ_CP016092.1_145338_145614_+	cd13831, HU, histone-like DNA-binding protein HU	NA|87aa|up_4|NZ_CP016092.1_145685_145946_+	COG1188, COG1188, Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]	NA|98aa|up_3|NZ_CP016092.1_146192_146486_+	TIGR02892, conserved_hypothetical_protein, sporulation protein YabP	NA|131aa|up_2|NZ_CP016092.1_146495_146888_+	TIGR02893, Spore_protein_YabQ, spore cortex biosynthesis protein YabQ	NA|98aa|up_1|NZ_CP016092.1_146967_147261_+	pfam04977, DivIC, Septum formation initiator	NA|135aa|up_0|NZ_CP016092.1_147417_147822_+	PRK05807, PRK05807, RNA-binding protein S1	NA|799aa|down_0|NZ_CP016092.1_149457_151854_+	TIGR02865, Stage_II_sporulation_protein_E, stage II sporulation protein E	NA|470aa|down_1|NZ_CP016092.1_152062_153472_+	cd01992, PP-ATPase, N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]	NA|180aa|down_2|NZ_CP016092.1_153473_154013_+	COG0634, Hpt, Hypoxanthine-guanine phosphoribosyltransferase [Nucleotide transport and metabolism]	NA|603aa|down_3|NZ_CP016092.1_154084_155893_+	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|557aa|down_4|NZ_CP016092.1_156370_158041_+	pfam01268, FTHFS, Formate--tetrahydrofolate ligase	NA|272aa|down_5|NZ_CP016092.1_158170_158986_+	PRK13318, PRK13318, type III pantothenate kinase	NA|322aa|down_6|NZ_CP016092.1_158960_159926_+	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|161aa|down_7|NZ_CP016092.1_160376_160859_+	PRK00226, greA, transcription elongation factor GreA; Reviewed	NA|502aa|down_8|NZ_CP016092.1_160872_162378_+	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|464aa|down_9|NZ_CP016092.1_163758_165150_+	PRK04173, PRK04173, glycyl-tRNA synthetase; Provisional
GCF_002003385.1_ASM200338v1	NZ_CP016092	Clostridium saccharobutylicum strain NCP 195, complete genome	2	1658957-1659039	2	CRISPRCasFinder	no	DEDDh	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	Unclear	ATGGATATATGAAAACAGGTTGG	23	0	0	NA	NA	NA	1	1	Orphan	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	NA|109aa|up_8|NZ_CP016092.1_1647581_1647908_-,NA|378aa|up_0|NZ_CP016092.1_1657284_1658418_+,NA|104aa|down_0|NZ_CP016092.1_1659658_1659970_+	NA|525aa|up_9|NZ_CP016092.1_1645675_1647250_+	cd13609, PBP2_Opu_like_1, Substrate-binding domain of putative ABC-type osmoprotectant uptake system; the type 2 periplasmic-binding protein fold	NA|109aa|up_8|NZ_CP016092.1_1647581_1647908_-	NA	NA|181aa|up_7|NZ_CP016092.1_1647944_1648487_-	cd01046, Rubrerythrin_like, rubrerythrin-like, diiron-binding domain	NA|180aa|up_6|NZ_CP016092.1_1648756_1649296_+	cd02139, nitroreductase, nitroreductase family protein	NA|218aa|up_5|NZ_CP016092.1_1649464_1650118_+	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|358aa|up_4|NZ_CP016092.1_1650335_1651409_+	pfam01841, Transglut_core, Transglutaminase-like superfamily	NA|783aa|up_3|NZ_CP016092.1_1651837_1654186_+	PRK02471, PRK02471, bifunctional glutamate--cysteine ligase GshA/glutathione synthetase GshB	NA|266aa|up_2|NZ_CP016092.1_1654721_1655519_+	COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function prediction only]	NA|392aa|up_1|NZ_CP016092.1_1655776_1656952_-	pfam01270, Glyco_hydro_8, Glycosyl hydrolases family 8	NA|378aa|up_0|NZ_CP016092.1_1657284_1658418_+	NA	NA|104aa|down_0|NZ_CP016092.1_1659658_1659970_+	NA	NA|354aa|down_1|NZ_CP016092.1_1660108_1661170_+	cd08174, G1PDH-like, Glycerol-1-phosphate dehydrogenase-like	NA|413aa|down_2|NZ_CP016092.1_1661334_1662573_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|625aa|down_3|NZ_CP016092.1_1662775_1664650_-	TIGR01702, Carbon_monoxide_dehydrogenase, carbon-monoxide dehydrogenase, catalytic subunit	NA|138aa|down_4|NZ_CP016092.1_1664766_1665180_-	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	DEDDh|252aa|down_5|NZ_CP016092.1_1665641_1666397_-	cd06133, ERI-1_3'hExo_like, DEDDh 3'-5' exonuclease domain of Caenorhabditis elegans ERI-1, human 3' exonuclease, and similar proteins	NA|177aa|down_6|NZ_CP016092.1_1666676_1667207_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|468aa|down_7|NZ_CP016092.1_1668207_1669611_+	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|462aa|down_8|NZ_CP016092.1_1670011_1671397_+	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|279aa|down_9|NZ_CP016092.1_1671980_1672817_+	PRK01060, PRK01060, endonuclease IV; Provisional
GCF_002003385.1_ASM200338v1	NZ_CP016092	Clostridium saccharobutylicum strain NCP 195, complete genome	3	2252451-2253592	3,1,1	CRISPRCasFinder,CRT,PILER-CR	no	RT,cas6,cas8b1,cas7b,cas5,cas3	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	Type I-B	GTTGAAGATTAACATTATATGTTTTGAAAT,GTTGAAGATTAACATTATATGTTTTGAAAT,GTTGAAGATTAACATTATATGTTTTGAAAT	30,30,30	1	1	2253135-2253170	NZ_CP016092.1_3308494-3308529	NA:NA:NA	17,17,14	17	TypeI-B	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	NA|603aa|up_9|NZ_CP016092.1_2239881_2241690_+,NA|183aa|down_2|NZ_CP016092.1_2257503_2258052_+	NA|603aa|up_9|NZ_CP016092.1_2239881_2241690_+	NA	RT|502aa|up_8|NZ_CP016092.1_2241686_2243192_+	cd03487, RT_Bac_retron_II, RT_Bac_retron_II: Reverse transcriptases (RTs) in bacterial retrotransposons or retrons	NA|342aa|up_7|NZ_CP016092.1_2243779_2244805_+	pfam12395, DUF3658, Protein of unknown function	NA|176aa|up_6|NZ_CP016092.1_2245023_2245551_+	TIGR02954, RNA_polymerase_ECF-type_sigma_factor, RNA polymerase sigma-70 factor, TIGR02954 family	NA|331aa|up_5|NZ_CP016092.1_2245573_2246566_+	pfam13786, DUF4179, Domain of unknown function (DUF4179)	NA|262aa|up_4|NZ_CP016092.1_2246848_2247634_+	pfam00457, Glyco_hydro_11, Glycosyl hydrolases family 11	cas6|232aa|up_3|NZ_CP016092.1_2248002_2248698_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas8b1|593aa|up_2|NZ_CP016092.1_2248712_2250491_+	TIGR02591, cas_Csh1, CRISPR-associated protein Cas8b/Csh1, subtype I-B/HMARI	cas7b|328aa|up_1|NZ_CP016092.1_2250483_2251467_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas5|261aa|up_0|NZ_CP016092.1_2251469_2252252_+	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas3|859aa|down_0|NZ_CP016092.1_2253644_2256221_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|279aa|down_1|NZ_CP016092.1_2256381_2257218_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|183aa|down_2|NZ_CP016092.1_2257503_2258052_+	NA	NA|195aa|down_3|NZ_CP016092.1_2258144_2258729_+	COG0655, WrbA, Multimeric flavodoxin WrbA [General function prediction only]	NA|187aa|down_4|NZ_CP016092.1_2260104_2260665_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|146aa|down_5|NZ_CP016092.1_2261000_2261438_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|454aa|down_6|NZ_CP016092.1_2261522_2262884_+	cd13143, MATE_MepA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Streptococcus aureus MepA	NA|269aa|down_7|NZ_CP016092.1_2263459_2264266_+	smart00138, MeTrc, Methyltransferase, chemotaxis proteins	NA|684aa|down_8|NZ_CP016092.1_2264285_2266337_+	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|349aa|down_9|NZ_CP016092.1_2266371_2267418_+	PRK00742, PRK00742, chemotaxis-specific protein-glutamate methyltransferase CheB
GCF_002003385.1_ASM200338v1	NZ_CP016092	Clostridium saccharobutylicum strain NCP 195, complete genome	4	2818091-2820275	2,2,4	CRT,PILER-CR,CRISPRCasFinder	no	cas2,cas1,cas4,cas3,cas5,cas7b,cas8b1,cas6,c2c9_V-U4	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	Type I-B	NNNNNNATTTAAATACATCTCATGTTAATGTTAATC,ATTTAAATACATCTCATGTTAATGTTAATC,ATTTAAATACATCTCATGTTAATGTTAATC	36,30,30	0	0	NA	NA	II-B:II-B:II-B	33,31,32	33	TypeI-B	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	NA,NA	NA|313aa|up_9|NZ_CP016092.1_2804995_2805934_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|268aa|up_8|NZ_CP016092.1_2806003_2806807_+	pfam13556, HTH_30, PucR C-terminal helix-turn-helix domain	NA|333aa|up_7|NZ_CP016092.1_2806823_2807822_+	pfam11728, ArAE_1_C, Putative aromatic acid exporter C-terminal domain	NA|530aa|up_6|NZ_CP016092.1_2808342_2809932_+	PRK09431, asnB, asparagine synthetase B; Provisional	NA|636aa|up_5|NZ_CP016092.1_2810129_2812037_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|203aa|up_4|NZ_CP016092.1_2812604_2813213_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|574aa|up_3|NZ_CP016092.1_2813822_2815544_-	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|172aa|up_2|NZ_CP016092.1_2815773_2816289_-	pfam08020, DUF1706, Protein of unknown function (DUF1706)	NA|174aa|up_1|NZ_CP016092.1_2816316_2816838_-	cd01014, nicotinamidase_related, Nicotinamidase_ related amidohydrolases	NA|243aa|up_0|NZ_CP016092.1_2816968_2817697_-	pfam07007, LprI, Lysozyme inhibitor LprI	cas2|97aa|down_0|NZ_CP016092.1_2820450_2820741_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|333aa|down_1|NZ_CP016092.1_2820740_2821739_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas4|165aa|down_2|NZ_CP016092.1_2821748_2822243_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|864aa|down_3|NZ_CP016092.1_2822425_2825017_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|267aa|down_4|NZ_CP016092.1_2825095_2825896_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7b|326aa|down_5|NZ_CP016092.1_2825899_2826877_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8b1|593aa|down_6|NZ_CP016092.1_2826876_2828655_-	TIGR02591, cas_Csh1, CRISPR-associated protein Cas8b/Csh1, subtype I-B/HMARI	cas6|231aa|down_7|NZ_CP016092.1_2828670_2829363_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	NA|525aa|down_8|NZ_CP016092.1_2830492_2832067_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|134aa|down_9|NZ_CP016092.1_2832642_2833044_-	COG5015, COG5015, Uncharacterized conserved protein [Function unknown]
GCF_002003385.1_ASM200338v1	NZ_CP016092	Clostridium saccharobutylicum strain NCP 195, complete genome	5	4329882-4329982	5	CRISPRCasFinder	no		csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	Orphan	TAGCACATTATAATCTTCTCACATT	25	0	0	NA	NA	NA	1	1	Orphan	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	NA|143aa|up_1|NZ_CP016092.1_4325884_4326313_+,NA|62aa|down_3|NZ_CP016092.1_4333527_4333713_-,NA|176aa|down_4|NZ_CP016092.1_4333721_4334249_-	NA|74aa|up_9|NZ_CP016092.1_4317777_4317999_-	COG1918, FeoA, Fe2+ transport system protein A [Inorganic ion transport and metabolism]	NA|70aa|up_8|NZ_CP016092.1_4318012_4318222_-	pfam04023, FeoA, FeoA domain	NA|355aa|up_7|NZ_CP016092.1_4318928_4319993_-	cd10944, CE4_SmPgdA_like, Catalytic NodB homology domain of Streptococcus mutans polysaccharide deacetylase PgdA, Bacillus subtilis YheN, and similar proteins	NA|338aa|up_6|NZ_CP016092.1_4320524_4321538_-	PRK12282, PRK12282, tryptophanyl-tRNA synthetase II; Reviewed	NA|81aa|up_5|NZ_CP016092.1_4322095_4322338_-	TIGR04540, CLB_0814_fam, conserved hypothetical protein	NA|144aa|up_4|NZ_CP016092.1_4322533_4322965_-	cd00293, USP_Like, Usp: Universal stress protein family	NA|349aa|up_3|NZ_CP016092.1_4323245_4324292_-	PRK14133, PRK14133, DNA polymerase IV; Provisional	NA|318aa|up_2|NZ_CP016092.1_4324528_4325482_-	pfam13170, DUF4003, Protein of unknown function (DUF4003)	NA|143aa|up_1|NZ_CP016092.1_4325884_4326313_+	NA	NA|851aa|up_0|NZ_CP016092.1_4327078_4329631_-	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|549aa|down_0|NZ_CP016092.1_4330080_4331727_-	PRK14869, PRK14869, putative manganese-dependent inorganic diphosphatase	NA|260aa|down_1|NZ_CP016092.1_4331953_4332733_-	PRK12817, flgG, flagellar basal body rod protein FlgG; Reviewed	NA|259aa|down_2|NZ_CP016092.1_4332749_4333526_-	PRK12818, flgG, flagellar basal body rod protein FlgG; Reviewed	NA|62aa|down_3|NZ_CP016092.1_4333527_4333713_-	NA	NA|176aa|down_4|NZ_CP016092.1_4333721_4334249_-	NA	NA|243aa|down_5|NZ_CP016092.1_4334300_4335029_-	TIGR02479, RNA_polymerase_sigma_factor_WhiG, RNA polymerase sigma factor, FliA/WhiG family	NA|214aa|down_6|NZ_CP016092.1_4335067_4335709_-	COG5581, COG5581, c-di-GMP-binding protein [Signal transduction mechanisms]	NA|291aa|down_7|NZ_CP016092.1_4335720_4336593_-	cd02038, FlhG-like, MinD-like ATPase FlhG	NA|431aa|down_8|NZ_CP016092.1_4336586_4337879_-	PRK05703, flhF, flagellar biosynthesis protein FlhF	NA|689aa|down_9|NZ_CP016092.1_4337879_4339946_-	PRK06012, flhA, flagellar type III secretion system protein FlhA
GCF_002003385.1_ASM200338v1	NZ_CP016092	Clostridium saccharobutylicum strain NCP 195, complete genome	6	4536586-4536759	3	PILER-CR	no		csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	Orphan	CCTATACTTGTTATACTGTTTGGGATTGTTATTGAC	36	0	0	NA	NA	NA	2	2	Orphan	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	NA,NA|260aa|down_6|NZ_CP016092.1_4546459_4547239_-	NA|272aa|up_9|NZ_CP016092.1_4522307_4523123_-	cd01948, EAL, EAL domain	NA|200aa|up_8|NZ_CP016092.1_4523329_4523929_+	pfam00589, Phage_integrase, Phage integrase family	NA|119aa|up_7|NZ_CP016092.1_4524045_4524402_-	cd08026, DUF326, Cysteine-rich 4 helical bundle widely conserved in bacteria	NA|138aa|up_6|NZ_CP016092.1_4524527_4524941_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|380aa|up_5|NZ_CP016092.1_4525156_4526296_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|523aa|up_4|NZ_CP016092.1_4526382_4527951_-	COG4670, COG4670, Acyl CoA:acetate/3-ketoacid CoA transferase [Lipid metabolism]	NA|259aa|up_3|NZ_CP016092.1_4528061_4528838_-	PRK05809, PRK05809, short-chain-enoyl-CoA hydratase	NA|817aa|up_2|NZ_CP016092.1_4529759_4532210_+	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|45aa|up_1|NZ_CP016092.1_4532306_4532441_+	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|1213aa|up_0|NZ_CP016092.1_4532618_4536257_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|391aa|down_0|NZ_CP016092.1_4539038_4540211_-	COG3947, COG3947, Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]	NA|511aa|down_1|NZ_CP016092.1_4540240_4541773_-	pfam16927, HisKA_7TM, N-terminal 7TM region of histidine kinase	NA|249aa|down_2|NZ_CP016092.1_4542236_4542983_-	pfam01564, Spermine_synth, Spermine/spermidine synthase domain	NA|111aa|down_3|NZ_CP016092.1_4543194_4543527_+	TIGR01911, conserved_protein, HesB-like selenoprotein	NA|445aa|down_4|NZ_CP016092.1_4543971_4545306_-	COG3864, COG3864, Uncharacterized protein conserved in bacteria [Function unknown]	NA|361aa|down_5|NZ_CP016092.1_4545359_4546442_-	pfam07728, AAA_5, AAA domain (dynein-related subfamily)	NA|260aa|down_6|NZ_CP016092.1_4546459_4547239_-	NA	NA|264aa|down_7|NZ_CP016092.1_4547329_4548121_-	COG2357, COG2357, PpGpp synthetase catalytic domain [General function prediction only]	NA|225aa|down_8|NZ_CP016092.1_4548287_4548962_+	pfam09986, DUF2225, Uncharacterized protein conserved in bacteria (DUF2225)	NA|314aa|down_9|NZ_CP016092.1_4548992_4549934_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)
GCF_002003385.1_ASM200338v1	NZ_CP016092	Clostridium saccharobutylicum strain NCP 195, complete genome	7	4537679-4538003	4	PILER-CR	no		csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	Orphan	CGATTCCTATACTTGTTACACTACTTGGTATTGTTATTGACGTTAAGCC	49	2	4	4537866-4537885|4537866-4537885|4537935-4537954|4537935-4537954	NZ_CP016092.1_4537659-4537678|NZ_CP016092.1_4538004-4538023|NZ_CP016092.1_4537659-4537678|NZ_CP016092.1_4538004-4538023	NA	4	4	Orphan	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	NA,NA|260aa|down_6|NZ_CP016092.1_4546459_4547239_-	NA|200aa|up_9|NZ_CP016092.1_4523329_4523929_+	pfam00589, Phage_integrase, Phage integrase family	NA|119aa|up_8|NZ_CP016092.1_4524045_4524402_-	cd08026, DUF326, Cysteine-rich 4 helical bundle widely conserved in bacteria	NA|138aa|up_7|NZ_CP016092.1_4524527_4524941_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|380aa|up_6|NZ_CP016092.1_4525156_4526296_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|523aa|up_5|NZ_CP016092.1_4526382_4527951_-	COG4670, COG4670, Acyl CoA:acetate/3-ketoacid CoA transferase [Lipid metabolism]	NA|259aa|up_4|NZ_CP016092.1_4528061_4528838_-	PRK05809, PRK05809, short-chain-enoyl-CoA hydratase	NA|817aa|up_3|NZ_CP016092.1_4529759_4532210_+	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|45aa|up_2|NZ_CP016092.1_4532306_4532441_+	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|1213aa|up_1|NZ_CP016092.1_4532618_4536257_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|192aa|up_0|NZ_CP016092.1_4536389_4536965_-	pfam13306, LRR_5, Leucine rich repeats (6 copies)	NA|391aa|down_0|NZ_CP016092.1_4539038_4540211_-	COG3947, COG3947, Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]	NA|511aa|down_1|NZ_CP016092.1_4540240_4541773_-	pfam16927, HisKA_7TM, N-terminal 7TM region of histidine kinase	NA|249aa|down_2|NZ_CP016092.1_4542236_4542983_-	pfam01564, Spermine_synth, Spermine/spermidine synthase domain	NA|111aa|down_3|NZ_CP016092.1_4543194_4543527_+	TIGR01911, conserved_protein, HesB-like selenoprotein	NA|445aa|down_4|NZ_CP016092.1_4543971_4545306_-	COG3864, COG3864, Uncharacterized protein conserved in bacteria [Function unknown]	NA|361aa|down_5|NZ_CP016092.1_4545359_4546442_-	pfam07728, AAA_5, AAA domain (dynein-related subfamily)	NA|260aa|down_6|NZ_CP016092.1_4546459_4547239_-	NA	NA|264aa|down_7|NZ_CP016092.1_4547329_4548121_-	COG2357, COG2357, PpGpp synthetase catalytic domain [General function prediction only]	NA|225aa|down_8|NZ_CP016092.1_4548287_4548962_+	pfam09986, DUF2225, Uncharacterized protein conserved in bacteria (DUF2225)	NA|314aa|down_9|NZ_CP016092.1_4548992_4549934_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)
GCF_002003385.1_ASM200338v1	NZ_CP016092	Clostridium saccharobutylicum strain NCP 195, complete genome	8	4652297-4652389	6	CRISPRCasFinder	no		csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	Orphan	TTTATTACATATATAGTTATATATGTA	27	0	0	NA	NA	NA	1	1	Orphan	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	NA|95aa|up_8|NZ_CP016092.1_4640398_4640683_+,NA|259aa|up_2|NZ_CP016092.1_4648506_4649283_+,NA|263aa|down_1|NZ_CP016092.1_4654283_4655072_-	NA|837aa|up_9|NZ_CP016092.1_4637599_4640110_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|95aa|up_8|NZ_CP016092.1_4640398_4640683_+	NA	NA|321aa|up_7|NZ_CP016092.1_4640938_4641901_+	pfam00688, TGFb_propeptide, TGF-beta propeptide	NA|532aa|up_6|NZ_CP016092.1_4642181_4643777_-	PRK00741, prfC, peptide chain release factor 3; Provisional	NA|489aa|up_5|NZ_CP016092.1_4644541_4646008_-	COG3391, COG3391, Uncharacterized conserved protein [Function unknown]	NA|90aa|up_4|NZ_CP016092.1_4646221_4646491_-	pfam11823, DUF3343, Protein of unknown function (DUF3343)	NA|558aa|up_3|NZ_CP016092.1_4646581_4648255_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|259aa|up_2|NZ_CP016092.1_4648506_4649283_+	NA	NA|199aa|up_1|NZ_CP016092.1_4649557_4650154_+	pfam14879, DUF4489, Domain of unknown function (DUF4489)	NA|580aa|up_0|NZ_CP016092.1_4650538_4652278_-	COG1231, COG1231, Monoamine oxidase [Amino acid transport and metabolism]	NA|380aa|down_0|NZ_CP016092.1_4652595_4653735_-	TIGR01977, am_tr_V_EF2568, cysteine desulfurase family protein	NA|263aa|down_1|NZ_CP016092.1_4654283_4655072_-	NA	NA|751aa|down_2|NZ_CP016092.1_4655380_4657633_+	PRK01156, PRK01156, chromosome segregation protein; Provisional	NA|325aa|down_3|NZ_CP016092.1_4658070_4659045_-	cd08563, GDPD_TtGDE_like, Glycerophosphodiester phosphodiesterase domain of Thermoanaerobacter tengcongensis and similar proteins	NA|383aa|down_4|NZ_CP016092.1_4659241_4660390_+	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|349aa|down_5|NZ_CP016092.1_4660821_4661868_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|965aa|down_6|NZ_CP016092.1_4661964_4664859_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|543aa|down_7|NZ_CP016092.1_4665428_4667057_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|706aa|down_8|NZ_CP016092.1_4667437_4669555_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|136aa|down_9|NZ_CP016092.1_4669781_4670189_-	pfam01934, DUF86, Protein of unknown function DUF86
GCF_002003385.1_ASM200338v1	NZ_CP016092	Clostridium saccharobutylicum strain NCP 195, complete genome	9	4733677-4733772	7	CRISPRCasFinder	no		csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	Orphan	TAGATCCTTCTTCAGGTGCTATGAAAACAGGAT	33	0	0	NA	NA	NA	1	1	Orphan	csx1,cas3,cas14j,DEDDh,WYL,c2c9_V-U4,csa3,RT,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,DinG	NA,NA|203aa|down_5|NZ_CP016092.1_4740718_4741327_-,NA|86aa|down_9|NZ_CP016092.1_4747384_4747642_-	NA|302aa|up_9|NZ_CP016092.1_4721527_4722433_-	TIGR01207, Glucose-1-phosphate_thymidylyltransferase_1, glucose-1-phosphate thymidylyltransferase, short form	NA|405aa|up_8|NZ_CP016092.1_4722533_4723748_-	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|479aa|up_7|NZ_CP016092.1_4723947_4725384_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|415aa|up_6|NZ_CP016092.1_4725534_4726779_-	pfam13425, O-antigen_lig, O-antigen ligase like membrane protein	NA|367aa|up_5|NZ_CP016092.1_4727018_4728119_-	cd03820, GT4_AmsD-like, amylovoran biosynthesis glycosyltransferase AmsD and similar proteins	NA|294aa|up_4|NZ_CP016092.1_4728209_4729091_-	COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]	NA|413aa|up_3|NZ_CP016092.1_4729083_4730322_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|246aa|up_2|NZ_CP016092.1_4730366_4731104_-	COG1922, WecG, Teichoic acid biosynthesis proteins [Cell envelope biogenesis, outer membrane]	NA|349aa|up_1|NZ_CP016092.1_4731134_4732181_-	COG0836, {ManC}, Mannose-1-phosphate guanylyltransferase [Cell envelope biogenesis, outer membrane]	NA|214aa|up_0|NZ_CP016092.1_4732251_4732893_-	pfam02397, Bac_transf, Bacterial sugar transferase	NA|256aa|down_0|NZ_CP016092.1_4735009_4735777_-	COG4464, CapC, Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|248aa|down_1|NZ_CP016092.1_4735794_4736538_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|227aa|down_2|NZ_CP016092.1_4736550_4737231_-	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|325aa|down_3|NZ_CP016092.1_4737844_4738819_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|364aa|down_4|NZ_CP016092.1_4739423_4740515_+	pfam01757, Acyl_transf_3, Acyltransferase family	NA|203aa|down_5|NZ_CP016092.1_4740718_4741327_-	NA	NA|396aa|down_6|NZ_CP016092.1_4741936_4743124_-	pfam07907, YibE_F, YibE/F-like protein	NA|553aa|down_7|NZ_CP016092.1_4743269_4744928_-	cd00839, MPP_PAPs, purple acid phosphatases of the metallophosphatase superfamily, metallophosphatase domain	NA|572aa|down_8|NZ_CP016092.1_4745331_4747047_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|86aa|down_9|NZ_CP016092.1_4747384_4747642_-	NA
