assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002003365.1_ASM200336v1	NZ_CP016091	Clostridium saccharobutylicum strain NCP 258, complete genome	1	160092-160236	1	CRISPRCasFinder	no	cas3	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	Unclear	AAATATCGCGGGGTGGAGCAGTTGGTAGCTCGTCGGGCTCATAACCCGAA	50	0	0	NA	NA	NA	1	1	Unclear	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	NA,NA	NA|174aa|up_9|NZ_CP016091.1_152160_152682_+	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|184aa|up_8|NZ_CP016091.1_153119_153671_+	TIGR02851, stage_V_sporulation_protein_T, stage V sporulation protein T	NA|513aa|up_7|NZ_CP016091.1_153780_155319_+	cd13124, MATE_SpoVB_like, Stage V sporulation protein B, also known as Stage III sporulation protein F, and related proteins	NA|484aa|up_6|NZ_CP016091.1_155335_156787_+	COG3956, COG3956, Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain [General function prediction only]	NA|92aa|up_5|NZ_CP016091.1_157005_157281_+	cd13831, HU, histone-like DNA-binding protein HU	NA|87aa|up_4|NZ_CP016091.1_157352_157613_+	COG1188, COG1188, Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]	NA|98aa|up_3|NZ_CP016091.1_157859_158153_+	TIGR02892, conserved_hypothetical_protein, sporulation protein YabP	NA|131aa|up_2|NZ_CP016091.1_158162_158555_+	TIGR02893, Spore_protein_YabQ, spore cortex biosynthesis protein YabQ	NA|98aa|up_1|NZ_CP016091.1_158634_158928_+	pfam04977, DivIC, Septum formation initiator	NA|135aa|up_0|NZ_CP016091.1_159084_159489_+	PRK05807, PRK05807, RNA-binding protein S1	NA|799aa|down_0|NZ_CP016091.1_161124_163521_+	TIGR02865, Stage_II_sporulation_protein_E, stage II sporulation protein E	NA|470aa|down_1|NZ_CP016091.1_163729_165139_+	cd01992, PP-ATPase, N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]	NA|180aa|down_2|NZ_CP016091.1_165140_165680_+	COG0634, Hpt, Hypoxanthine-guanine phosphoribosyltransferase [Nucleotide transport and metabolism]	NA|603aa|down_3|NZ_CP016091.1_165751_167560_+	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|557aa|down_4|NZ_CP016091.1_168037_169708_+	pfam01268, FTHFS, Formate--tetrahydrofolate ligase	NA|272aa|down_5|NZ_CP016091.1_169837_170653_+	PRK13318, PRK13318, type III pantothenate kinase	NA|322aa|down_6|NZ_CP016091.1_170627_171593_+	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|161aa|down_7|NZ_CP016091.1_172043_172526_+	PRK00226, greA, transcription elongation factor GreA; Reviewed	NA|502aa|down_8|NZ_CP016091.1_172539_174045_+	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|464aa|down_9|NZ_CP016091.1_175425_176817_+	PRK04173, PRK04173, glycyl-tRNA synthetase; Provisional
GCF_002003365.1_ASM200336v1	NZ_CP016091	Clostridium saccharobutylicum strain NCP 258, complete genome	2	1681317-1681399	2	CRISPRCasFinder	no	DEDDh	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	Unclear	ATGGATATATGAAAACAGGTTGG	23	0	0	NA	NA	NA	1	1	Orphan	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	NA|109aa|up_8|NZ_CP016091.1_1669941_1670268_-,NA|378aa|up_0|NZ_CP016091.1_1679644_1680778_+,NA|104aa|down_0|NZ_CP016091.1_1682018_1682330_+,NA|58aa|down_9|NZ_CP016091.1_1693846_1694020_+	NA|525aa|up_9|NZ_CP016091.1_1668035_1669610_+	cd13609, PBP2_Opu_like_1, Substrate-binding domain of putative ABC-type osmoprotectant uptake system; the type 2 periplasmic-binding protein fold	NA|109aa|up_8|NZ_CP016091.1_1669941_1670268_-	NA	NA|181aa|up_7|NZ_CP016091.1_1670304_1670847_-	cd01046, Rubrerythrin_like, rubrerythrin-like, diiron-binding domain	NA|180aa|up_6|NZ_CP016091.1_1671116_1671656_+	cd02139, nitroreductase, nitroreductase family protein	NA|218aa|up_5|NZ_CP016091.1_1671824_1672478_+	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|358aa|up_4|NZ_CP016091.1_1672695_1673769_+	pfam01841, Transglut_core, Transglutaminase-like superfamily	NA|783aa|up_3|NZ_CP016091.1_1674197_1676546_+	PRK02471, PRK02471, bifunctional glutamate--cysteine ligase GshA/glutathione synthetase GshB	NA|266aa|up_2|NZ_CP016091.1_1677081_1677879_+	COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function prediction only]	NA|391aa|up_1|NZ_CP016091.1_1678136_1679309_-	pfam01270, Glyco_hydro_8, Glycosyl hydrolases family 8	NA|378aa|up_0|NZ_CP016091.1_1679644_1680778_+	NA	NA|104aa|down_0|NZ_CP016091.1_1682018_1682330_+	NA	NA|354aa|down_1|NZ_CP016091.1_1682468_1683530_+	cd08174, G1PDH-like, Glycerol-1-phosphate dehydrogenase-like	NA|625aa|down_2|NZ_CP016091.1_1683603_1685478_-	TIGR01702, Carbon_monoxide_dehydrogenase, carbon-monoxide dehydrogenase, catalytic subunit	NA|138aa|down_3|NZ_CP016091.1_1685594_1686008_-	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	DEDDh|252aa|down_4|NZ_CP016091.1_1686469_1687225_-	cd06133, ERI-1_3'hExo_like, DEDDh 3'-5' exonuclease domain of Caenorhabditis elegans ERI-1, human 3' exonuclease, and similar proteins	NA|177aa|down_5|NZ_CP016091.1_1687504_1688035_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|468aa|down_6|NZ_CP016091.1_1689035_1690439_+	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|462aa|down_7|NZ_CP016091.1_1690839_1692225_+	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|279aa|down_8|NZ_CP016091.1_1692807_1693644_+	PRK01060, PRK01060, endonuclease IV; Provisional	NA|58aa|down_9|NZ_CP016091.1_1693846_1694020_+	NA
GCF_002003365.1_ASM200336v1	NZ_CP016091	Clostridium saccharobutylicum strain NCP 258, complete genome	3	2338128-2338615	1,3,1	PILER-CR,CRISPRCasFinder,CRT	no	RT,cas6,cas8b1,cas7b,cas5,cas3	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	Type I-B	GTTGAAGATTAACATTATATGTTTTGAAAT,GTTGAAGATTAACATTATATGTTTTGAAAT,GTTGAAGATTAACATTATATGTTTTGAAAT	30,30,30	1	1	2338484-2338519	NZ_CP016091.1_2948576-2948611	NA:NA:NA	7,7,7	7	TypeI-B	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	NA|603aa|up_9|NZ_CP016091.1_2325558_2327367_+,NA|183aa|down_2|NZ_CP016091.1_2342526_2343075_+	NA|603aa|up_9|NZ_CP016091.1_2325558_2327367_+	NA	RT|502aa|up_8|NZ_CP016091.1_2327363_2328869_+	cd03487, RT_Bac_retron_II, RT_Bac_retron_II: Reverse transcriptases (RTs) in bacterial retrotransposons or retrons	NA|342aa|up_7|NZ_CP016091.1_2329456_2330482_+	pfam12395, DUF3658, Protein of unknown function	NA|176aa|up_6|NZ_CP016091.1_2330700_2331228_+	TIGR02954, RNA_polymerase_ECF-type_sigma_factor, RNA polymerase sigma-70 factor, TIGR02954 family	NA|331aa|up_5|NZ_CP016091.1_2331250_2332243_+	pfam13786, DUF4179, Domain of unknown function (DUF4179)	NA|262aa|up_4|NZ_CP016091.1_2332525_2333311_+	pfam00457, Glyco_hydro_11, Glycosyl hydrolases family 11	cas6|232aa|up_3|NZ_CP016091.1_2333679_2334375_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas8b1|593aa|up_2|NZ_CP016091.1_2334389_2336168_+	TIGR02591, cas_Csh1, CRISPR-associated protein Cas8b/Csh1, subtype I-B/HMARI	cas7b|328aa|up_1|NZ_CP016091.1_2336160_2337144_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas5|261aa|up_0|NZ_CP016091.1_2337146_2337929_+	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas3|859aa|down_0|NZ_CP016091.1_2338667_2341244_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|279aa|down_1|NZ_CP016091.1_2341404_2342241_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|183aa|down_2|NZ_CP016091.1_2342526_2343075_+	NA	NA|195aa|down_3|NZ_CP016091.1_2343167_2343752_+	COG0655, WrbA, Multimeric flavodoxin WrbA [General function prediction only]	NA|178aa|down_4|NZ_CP016091.1_2344222_2344756_-	pfam13787, HXXEE, Protein of unknown function with HXXEE motif	NA|187aa|down_5|NZ_CP016091.1_2345128_2345689_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|146aa|down_6|NZ_CP016091.1_2346024_2346462_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|454aa|down_7|NZ_CP016091.1_2346546_2347908_+	cd13143, MATE_MepA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Streptococcus aureus MepA	NA|269aa|down_8|NZ_CP016091.1_2348482_2349289_+	smart00138, MeTrc, Methyltransferase, chemotaxis proteins	NA|684aa|down_9|NZ_CP016091.1_2349308_2351360_+	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]
GCF_002003365.1_ASM200336v1	NZ_CP016091	Clostridium saccharobutylicum strain NCP 258, complete genome	4	2367042-2367233	2,4	PILER-CR,CRISPRCasFinder	no		csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	Orphan	TAAATTAAAAAGTATAACAATACCAGATAGTGTAACAAATATAGGAGAATATGCATTT,AAAGTATAACAATACCAGATAGTGTAACAAATATAG	58,36	0	0	NA	NA	NA:NA	2,2	2	Orphan	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	NA,NA|131aa|down_2|NZ_CP016091.1_2369841_2370234_+,NA|254aa|down_3|NZ_CP016091.1_2370361_2371123_+,NA|242aa|down_7|NZ_CP016091.1_2374456_2375182_+,NA|245aa|down_8|NZ_CP016091.1_2375197_2375932_+	NA|153aa|up_9|NZ_CP016091.1_2352621_2353080_+	cd00732, CheW, CheW, a small regulator protein, unique to the chemotaxis signalling in prokaryotes and archea	NA|538aa|up_8|NZ_CP016091.1_2353105_2354719_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|182aa|up_7|NZ_CP016091.1_2355226_2355772_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|426aa|up_6|NZ_CP016091.1_2355830_2357108_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|831aa|up_5|NZ_CP016091.1_2357432_2359925_+	pfam02687, FtsX, FtsX-like permease family	NA|280aa|up_4|NZ_CP016091.1_2360324_2361164_+	COG0348, NapH, Polyferredoxin [Energy production and conversion]	NA|224aa|up_3|NZ_CP016091.1_2361266_2361938_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|831aa|up_2|NZ_CP016091.1_2361940_2364433_+	pfam02687, FtsX, FtsX-like permease family	NA|227aa|up_1|NZ_CP016091.1_2364564_2365245_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|308aa|up_0|NZ_CP016091.1_2365346_2366270_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|187aa|down_0|NZ_CP016091.1_2367708_2368269_+	PRK06811, PRK06811, RNA polymerase factor sigma-70; Validated	NA|373aa|down_1|NZ_CP016091.1_2368270_2369389_+	pfam13786, DUF4179, Domain of unknown function (DUF4179)	NA|131aa|down_2|NZ_CP016091.1_2369841_2370234_+	NA	NA|254aa|down_3|NZ_CP016091.1_2370361_2371123_+	NA	NA|324aa|down_4|NZ_CP016091.1_2371491_2372463_+	pfam02517, Abi, CAAX protease self-immunity	NA|228aa|down_5|NZ_CP016091.1_2372598_2373282_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|311aa|down_6|NZ_CP016091.1_2373540_2374473_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|242aa|down_7|NZ_CP016091.1_2374456_2375182_+	NA	NA|245aa|down_8|NZ_CP016091.1_2375197_2375932_+	NA	NA|294aa|down_9|NZ_CP016091.1_2375955_2376837_+	COG2205, KdpD, Osmosensitive K+ channel histidine kinase [Signal transduction mechanisms]
GCF_002003365.1_ASM200336v1	NZ_CP016091	Clostridium saccharobutylicum strain NCP 258, complete genome	5	2725111-2726540	5,2,3	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas3,cas5,cas7,cas8b2,cas6,WYL	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	Unclear	ATTTACATTCCTCTAGTTAAGATAAAAC,ATTTACATTCCTCTAGTTAAGATAAAAC,ATTTACATTCCTCTAGTTAAGATAAAAC	28,28,28	0	0	NA	NA	NA:NA:NA	22,22,21	22	Unclear	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	NA,NA|104aa|down_9|NZ_CP016091.1_2736443_2736755_-	NA|318aa|up_9|NZ_CP016091.1_2710931_2711885_-	COG0655, WrbA, Multimeric flavodoxin WrbA [General function prediction only]	NA|423aa|up_8|NZ_CP016091.1_2712113_2713382_+	pfam13556, HTH_30, PucR C-terminal helix-turn-helix domain	NA|530aa|up_7|NZ_CP016091.1_2714916_2716506_+	PRK09431, asnB, asparagine synthetase B; Provisional	NA|636aa|up_6|NZ_CP016091.1_2716703_2718611_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|203aa|up_5|NZ_CP016091.1_2719165_2719774_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|574aa|up_4|NZ_CP016091.1_2720403_2722125_-	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|172aa|up_3|NZ_CP016091.1_2722354_2722870_-	pfam08020, DUF1706, Protein of unknown function (DUF1706)	NA|174aa|up_2|NZ_CP016091.1_2722897_2723419_-	cd01014, nicotinamidase_related, Nicotinamidase_ related amidohydrolases	NA|243aa|up_1|NZ_CP016091.1_2723549_2724278_-	pfam07007, LprI, Lysozyme inhibitor LprI	NA|43aa|up_0|NZ_CP016091.1_2724514_2724643_+	pfam13701, DDE_Tnp_1_4, Transposase DDE domain group 1	cas2|88aa|down_0|NZ_CP016091.1_2726723_2726987_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|328aa|down_1|NZ_CP016091.1_2726988_2727972_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|172aa|down_2|NZ_CP016091.1_2727977_2728493_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|762aa|down_3|NZ_CP016091.1_2728509_2730795_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|245aa|down_4|NZ_CP016091.1_2730833_2731568_-	cd09658, Cas5_I-B, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|294aa|down_5|NZ_CP016091.1_2731576_2732458_-	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas8b2|563aa|down_6|NZ_CP016091.1_2732478_2734167_-	cd09665, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas6|247aa|down_7|NZ_CP016091.1_2734181_2734922_-	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	WYL|313aa|down_8|NZ_CP016091.1_2735092_2736031_-	pfam13280, WYL, WYL domain	NA|104aa|down_9|NZ_CP016091.1_2736443_2736755_-	NA
GCF_002003365.1_ASM200336v1	NZ_CP016091	Clostridium saccharobutylicum strain NCP 258, complete genome	6	4379494-4379666	4	PILER-CR	no		csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	Orphan	TCAATAACAATCCCAAACAGTATAACAAGTATAGG	35	0	0	NA	NA	NA	2	2	Orphan	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	NA,NA|260aa|down_6|NZ_CP016091.1_4389367_4390147_-	NA|272aa|up_9|NZ_CP016091.1_4365048_4365864_-	cd01948, EAL, EAL domain	NA|200aa|up_8|NZ_CP016091.1_4366237_4366837_+	pfam00589, Phage_integrase, Phage integrase family	NA|119aa|up_7|NZ_CP016091.1_4366953_4367310_-	cd08026, DUF326, Cysteine-rich 4 helical bundle widely conserved in bacteria	NA|138aa|up_6|NZ_CP016091.1_4367435_4367849_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|380aa|up_5|NZ_CP016091.1_4368064_4369204_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|523aa|up_4|NZ_CP016091.1_4369290_4370859_-	COG4670, COG4670, Acyl CoA:acetate/3-ketoacid CoA transferase [Lipid metabolism]	NA|259aa|up_3|NZ_CP016091.1_4370969_4371746_-	PRK05809, PRK05809, short-chain-enoyl-CoA hydratase	NA|817aa|up_2|NZ_CP016091.1_4372667_4375118_+	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|45aa|up_1|NZ_CP016091.1_4375214_4375349_+	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|40aa|up_0|NZ_CP016091.1_4375526_4375646_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|391aa|down_0|NZ_CP016091.1_4381946_4383119_-	COG3947, COG3947, Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]	NA|511aa|down_1|NZ_CP016091.1_4383148_4384681_-	pfam16927, HisKA_7TM, N-terminal 7TM region of histidine kinase	NA|249aa|down_2|NZ_CP016091.1_4385144_4385891_-	pfam01564, Spermine_synth, Spermine/spermidine synthase domain	NA|111aa|down_3|NZ_CP016091.1_4386102_4386435_+	TIGR01911, conserved_protein, HesB-like selenoprotein	NA|445aa|down_4|NZ_CP016091.1_4386879_4388214_-	COG3864, COG3864, Uncharacterized protein conserved in bacteria [Function unknown]	NA|361aa|down_5|NZ_CP016091.1_4388267_4389350_-	pfam07728, AAA_5, AAA domain (dynein-related subfamily)	NA|260aa|down_6|NZ_CP016091.1_4389367_4390147_-	NA	NA|264aa|down_7|NZ_CP016091.1_4390237_4391029_-	COG2357, COG2357, PpGpp synthetase catalytic domain [General function prediction only]	NA|225aa|down_8|NZ_CP016091.1_4391195_4391870_+	pfam09986, DUF2225, Uncharacterized protein conserved in bacteria (DUF2225)	NA|314aa|down_9|NZ_CP016091.1_4391900_4392842_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)
GCF_002003365.1_ASM200336v1	NZ_CP016091	Clostridium saccharobutylicum strain NCP 258, complete genome	7	4380587-4380911	5	PILER-CR	no		csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	Orphan	GGCTTAACGTCAATAACAATACCAAGTAGTGTAACAAGTATAGGA--------ATCG	57	2	4	4380774-4380793|4380774-4380793|4380843-4380862|4380843-4380862	NZ_CP016091.1_4380567-4380586|NZ_CP016091.1_4380912-4380931|NZ_CP016091.1_4380567-4380586|NZ_CP016091.1_4380912-4380931	NA	4	4	Orphan	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	NA,NA|260aa|down_6|NZ_CP016091.1_4389367_4390147_-	NA|272aa|up_9|NZ_CP016091.1_4365048_4365864_-	cd01948, EAL, EAL domain	NA|200aa|up_8|NZ_CP016091.1_4366237_4366837_+	pfam00589, Phage_integrase, Phage integrase family	NA|119aa|up_7|NZ_CP016091.1_4366953_4367310_-	cd08026, DUF326, Cysteine-rich 4 helical bundle widely conserved in bacteria	NA|138aa|up_6|NZ_CP016091.1_4367435_4367849_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|380aa|up_5|NZ_CP016091.1_4368064_4369204_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|523aa|up_4|NZ_CP016091.1_4369290_4370859_-	COG4670, COG4670, Acyl CoA:acetate/3-ketoacid CoA transferase [Lipid metabolism]	NA|259aa|up_3|NZ_CP016091.1_4370969_4371746_-	PRK05809, PRK05809, short-chain-enoyl-CoA hydratase	NA|817aa|up_2|NZ_CP016091.1_4372667_4375118_+	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|45aa|up_1|NZ_CP016091.1_4375214_4375349_+	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|40aa|up_0|NZ_CP016091.1_4375526_4375646_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|391aa|down_0|NZ_CP016091.1_4381946_4383119_-	COG3947, COG3947, Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]	NA|511aa|down_1|NZ_CP016091.1_4383148_4384681_-	pfam16927, HisKA_7TM, N-terminal 7TM region of histidine kinase	NA|249aa|down_2|NZ_CP016091.1_4385144_4385891_-	pfam01564, Spermine_synth, Spermine/spermidine synthase domain	NA|111aa|down_3|NZ_CP016091.1_4386102_4386435_+	TIGR01911, conserved_protein, HesB-like selenoprotein	NA|445aa|down_4|NZ_CP016091.1_4386879_4388214_-	COG3864, COG3864, Uncharacterized protein conserved in bacteria [Function unknown]	NA|361aa|down_5|NZ_CP016091.1_4388267_4389350_-	pfam07728, AAA_5, AAA domain (dynein-related subfamily)	NA|260aa|down_6|NZ_CP016091.1_4389367_4390147_-	NA	NA|264aa|down_7|NZ_CP016091.1_4390237_4391029_-	COG2357, COG2357, PpGpp synthetase catalytic domain [General function prediction only]	NA|225aa|down_8|NZ_CP016091.1_4391195_4391870_+	pfam09986, DUF2225, Uncharacterized protein conserved in bacteria (DUF2225)	NA|314aa|down_9|NZ_CP016091.1_4391900_4392842_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)
GCF_002003365.1_ASM200336v1	NZ_CP016091	Clostridium saccharobutylicum strain NCP 258, complete genome	8	4494981-4495073	6	CRISPRCasFinder	no		csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	Orphan	TTTATTACATATATAGTTATATATGTA	27	0	0	NA	NA	NA	1	1	Orphan	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	NA|95aa|up_8|NZ_CP016091.1_4483082_4483367_+,NA|259aa|up_2|NZ_CP016091.1_4491190_4491967_+,NA|263aa|down_1|NZ_CP016091.1_4496966_4497755_-	NA|837aa|up_9|NZ_CP016091.1_4480283_4482794_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|95aa|up_8|NZ_CP016091.1_4483082_4483367_+	NA	NA|321aa|up_7|NZ_CP016091.1_4483622_4484585_+	pfam00688, TGFb_propeptide, TGF-beta propeptide	NA|532aa|up_6|NZ_CP016091.1_4484865_4486461_-	PRK00741, prfC, peptide chain release factor 3; Provisional	NA|489aa|up_5|NZ_CP016091.1_4487225_4488692_-	COG3391, COG3391, Uncharacterized conserved protein [Function unknown]	NA|90aa|up_4|NZ_CP016091.1_4488905_4489175_-	pfam11823, DUF3343, Protein of unknown function (DUF3343)	NA|558aa|up_3|NZ_CP016091.1_4489265_4490939_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|259aa|up_2|NZ_CP016091.1_4491190_4491967_+	NA	NA|199aa|up_1|NZ_CP016091.1_4492241_4492838_+	pfam14879, DUF4489, Domain of unknown function (DUF4489)	NA|580aa|up_0|NZ_CP016091.1_4493222_4494962_-	COG1231, COG1231, Monoamine oxidase [Amino acid transport and metabolism]	NA|380aa|down_0|NZ_CP016091.1_4495278_4496418_-	TIGR01977, am_tr_V_EF2568, cysteine desulfurase family protein	NA|263aa|down_1|NZ_CP016091.1_4496966_4497755_-	NA	NA|751aa|down_2|NZ_CP016091.1_4498063_4500316_+	PRK01156, PRK01156, chromosome segregation protein; Provisional	NA|325aa|down_3|NZ_CP016091.1_4500753_4501728_-	cd08563, GDPD_TtGDE_like, Glycerophosphodiester phosphodiesterase domain of Thermoanaerobacter tengcongensis and similar proteins	NA|382aa|down_4|NZ_CP016091.1_4501924_4503070_+	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|349aa|down_5|NZ_CP016091.1_4503501_4504548_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|965aa|down_6|NZ_CP016091.1_4504644_4507539_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|543aa|down_7|NZ_CP016091.1_4508108_4509737_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|706aa|down_8|NZ_CP016091.1_4510117_4512235_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|136aa|down_9|NZ_CP016091.1_4512461_4512869_-	pfam01934, DUF86, Protein of unknown function DUF86
GCF_002003365.1_ASM200336v1	NZ_CP016091	Clostridium saccharobutylicum strain NCP 258, complete genome	9	4576357-4576452	7	CRISPRCasFinder	no		csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	Orphan	TAGATCCTTCTTCAGGTGCTATGAAAACAGGAT	33	0	0	NA	NA	NA	1	1	Orphan	csx1,cas3,cas14j,RT,DEDDh,WYL,c2c9_V-U4,csa3,cas6,cas8b1,cas7b,cas5,cas2,cas1,cas4,cas7,cas8b2,DinG	NA,NA|203aa|down_5|NZ_CP016091.1_4583398_4584007_-,NA|86aa|down_9|NZ_CP016091.1_4590064_4590322_-	NA|194aa|up_9|NZ_CP016091.1_4563616_4564198_-	pfam00908, dTDP_sugar_isom, dTDP-4-dehydrorhamnose 3,5-epimerase	NA|302aa|up_8|NZ_CP016091.1_4564208_4565114_-	TIGR01207, Glucose-1-phosphate_thymidylyltransferase_1, glucose-1-phosphate thymidylyltransferase, short form	NA|405aa|up_7|NZ_CP016091.1_4565214_4566429_-	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|479aa|up_6|NZ_CP016091.1_4566627_4568064_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|415aa|up_5|NZ_CP016091.1_4568214_4569459_-	pfam13425, O-antigen_lig, O-antigen ligase like membrane protein	NA|367aa|up_4|NZ_CP016091.1_4569698_4570799_-	cd03820, GT4_AmsD-like, amylovoran biosynthesis glycosyltransferase AmsD and similar proteins	NA|294aa|up_3|NZ_CP016091.1_4570889_4571771_-	COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]	NA|413aa|up_2|NZ_CP016091.1_4571763_4573002_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|246aa|up_1|NZ_CP016091.1_4573046_4573784_-	COG1922, WecG, Teichoic acid biosynthesis proteins [Cell envelope biogenesis, outer membrane]	NA|214aa|up_0|NZ_CP016091.1_4574931_4575573_-	pfam02397, Bac_transf, Bacterial sugar transferase	NA|256aa|down_0|NZ_CP016091.1_4577689_4578457_-	COG4464, CapC, Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|248aa|down_1|NZ_CP016091.1_4578474_4579218_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|227aa|down_2|NZ_CP016091.1_4579230_4579911_-	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|325aa|down_3|NZ_CP016091.1_4580524_4581499_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|364aa|down_4|NZ_CP016091.1_4582103_4583195_+	pfam01757, Acyl_transf_3, Acyltransferase family	NA|203aa|down_5|NZ_CP016091.1_4583398_4584007_-	NA	NA|396aa|down_6|NZ_CP016091.1_4584616_4585804_-	pfam07907, YibE_F, YibE/F-like protein	NA|553aa|down_7|NZ_CP016091.1_4585949_4587608_-	cd00839, MPP_PAPs, purple acid phosphatases of the metallophosphatase superfamily, metallophosphatase domain	NA|572aa|down_8|NZ_CP016091.1_4588011_4589727_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|86aa|down_9|NZ_CP016091.1_4590064_4590322_-	NA
