assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002234575.2_ASM223457v2	NZ_CP022464	Enterocloster bolteae strain ATCC BAA-613 chromosome, complete genome	1	145113-145213	1	CRISPRCasFinder	no		DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	Orphan	TAAAAAAGTTGTTGACAAACAGT	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	NA,NA|386aa|down_2|NZ_CP022464.2_153311_154469_+,NA|311aa|down_3|NZ_CP022464.2_154479_155412_+,NA|131aa|down_5|NZ_CP022464.2_156535_156928_+,NA|340aa|down_7|NZ_CP022464.2_159246_160266_+	NA|1095aa|up_9|NZ_CP022464.2_128959_132244_+	cd10789, GH38N_AMII_ER_cytosolic, N-terminal catalytic domain of endoplasmic reticulum(ER)/cytosolic class II alpha-mannosidases; glycoside hydrolase family 38 (GH38)	NA|571aa|up_8|NZ_CP022464.2_132286_133999_+	cd11333, AmyAc_SI_OligoGlu_DGase, Alpha amylase catalytic domain found in Sucrose isomerases, oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), dextran glucosidase (also called glucan 1,6-alpha-glucosidase), and related proteins	NA|741aa|up_7|NZ_CP022464.2_134016_136239_+	COG3345, GalA, Alpha-galactosidase [Carbohydrate transport and metabolism]	NA|360aa|up_6|NZ_CP022464.2_136878_137958_+	PRK03202, PRK03202, ATP-dependent 6-phosphofructokinase	NA|561aa|up_5|NZ_CP022464.2_138035_139718_+	PRK05563, PRK05563, DNA polymerase III subunits gamma and tau; Validated	NA|119aa|up_4|NZ_CP022464.2_139760_140117_+	PRK00153, PRK00153, YbaB/EbfC family nucleoid-associated protein	NA|199aa|up_3|NZ_CP022464.2_140116_140713_+	PRK00076, recR, recombination protein RecR; Reviewed	NA|843aa|up_2|NZ_CP022464.2_140759_143288_+	PTZ00395, PTZ00395, Sec24-related protein; Provisional	NA|276aa|up_1|NZ_CP022464.2_143284_144112_+	cd07516, HAD_Pase, phosphatase, similar to Escherichia coli Cof and Thermotoga maritima TM0651; belongs to the haloacid dehalogenase-like superfamily	NA|241aa|up_0|NZ_CP022464.2_144130_144853_+	pfam02674, Colicin_V, Colicin V production protein	NA|394aa|down_0|NZ_CP022464.2_151213_152395_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|88aa|down_1|NZ_CP022464.2_152486_152750_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|386aa|down_2|NZ_CP022464.2_153311_154469_+	NA	NA|311aa|down_3|NZ_CP022464.2_154479_155412_+	NA	NA|346aa|down_4|NZ_CP022464.2_155497_156535_+	COG5377, COG5377, Phage-related protein, predicted endonuclease [DNA replication, recombination, and repair]	NA|131aa|down_5|NZ_CP022464.2_156535_156928_+	NA	NA|754aa|down_6|NZ_CP022464.2_156911_159173_+	TIGR01448, recD_rel, helicase, putative, RecD/TraA family	NA|340aa|down_7|NZ_CP022464.2_159246_160266_+	NA	NA|416aa|down_8|NZ_CP022464.2_160333_161581_+	TIGR01391, DNA_primase, DNA primase, catalytic core	NA|306aa|down_9|NZ_CP022464.2_161666_162584_+	cd10227, ParM_like, Plasmid segregation protein ParM and similar proteins
GCF_002234575.2_ASM223457v2	NZ_CP022464	Enterocloster bolteae strain ATCC BAA-613 chromosome, complete genome	2	601611-602453	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	 Type I-U?,Type I-U,Type I-C	GTCTCCGTCCTCGCGGGCGGAGTGGGTTGAAAT,GTCTCCGTCCTCGCGGGCGGAGTGGGTTGAAAT,GTCTCCGTCCTCGCGGGCGGAGTGGGTTGAAAT	33,33,33	0	0	NA	NA	NA:NA:NA	11,12,12	12	TypeI-U?,TypeI-U,TypeI-C	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	NA|60aa|up_7|NZ_CP022464.2_593516_593696_+,NA|58aa|down_5|NZ_CP022464.2_608267_608441_+,NA|107aa|down_9|NZ_CP022464.2_611405_611726_+	NA|365aa|up_9|NZ_CP022464.2_590396_591491_-	pfam10282, Lactonase, Lactonase, 7-bladed beta-propeller	NA|496aa|up_8|NZ_CP022464.2_591799_593287_-	cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus AldA1 (SACOL0154) aldehyde dehydrogenase-like	NA|60aa|up_7|NZ_CP022464.2_593516_593696_+	NA	cas3|726aa|up_6|NZ_CP022464.2_593836_596014_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|220aa|up_5|NZ_CP022464.2_596057_596717_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|592aa|up_4|NZ_CP022464.2_596713_598489_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|297aa|up_3|NZ_CP022464.2_598490_599381_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas4|221aa|up_2|NZ_CP022464.2_599380_600043_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|344aa|up_1|NZ_CP022464.2_600039_601071_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NZ_CP022464.2_601109_601400_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|268aa|down_0|NZ_CP022464.2_602667_603471_-	pfam02633, Creatininase, Creatinine amidohydrolase	NA|203aa|down_1|NZ_CP022464.2_603509_604118_-	COG3601, COG3601, Predicted membrane protein [Function unknown]	NA|469aa|down_2|NZ_CP022464.2_604671_606078_+	PRK01096, PRK01096, deoxyguanosinetriphosphate triphosphohydrolase-like protein; Provisional	NA|129aa|down_3|NZ_CP022464.2_606089_606476_+	pfam10990, DUF2809, Protein of unknown function (DUF2809)	NA|591aa|down_4|NZ_CP022464.2_606472_608245_+	pfam07929, PRiA4_ORF3, Plasmid pRiA4b ORF-3-like protein	NA|58aa|down_5|NZ_CP022464.2_608267_608441_+	NA	NA|310aa|down_6|NZ_CP022464.2_608480_609410_+	COG2390, DeoR, Transcriptional regulator, contains sigma factor-related N-terminal domain [Transcription]	NA|207aa|down_7|NZ_CP022464.2_609583_610204_+	PRK14479, PRK14479, dihydroxyacetone kinase; Provisional	NA|333aa|down_8|NZ_CP022464.2_610334_611333_+	pfam02733, Dak1, Dak1 domain	NA|107aa|down_9|NZ_CP022464.2_611405_611726_+	NA
GCF_002234575.2_ASM223457v2	NZ_CP022464	Enterocloster bolteae strain ATCC BAA-613 chromosome, complete genome	3	1322942-1323037	3	CRISPRCasFinder	no	WYL	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	Unclear	GTCCAGGAATTCCGGAGAGTCCAAGGCATCCAG	33	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	NA,NA|218aa|down_8|NZ_CP022464.2_1333286_1333940_+	NA|252aa|up_9|NZ_CP022464.2_1314212_1314968_+	PRK10621, PRK10621, hypothetical protein; Provisional	NA|319aa|up_8|NZ_CP022464.2_1315008_1315965_+	COG2264, PrmA, Ribosomal protein L11 methylase [Translation, ribosomal structure and biogenesis]	NA|250aa|up_7|NZ_CP022464.2_1315995_1316745_+	pfam04452, Methyltrans_RNA, RNA methyltransferase	NA|385aa|up_6|NZ_CP022464.2_1316758_1317913_+	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|395aa|up_5|NZ_CP022464.2_1317980_1319165_+	PRK01565, PRK01565, thiamine biosynthesis protein ThiI; Provisional	NA|78aa|up_4|NZ_CP022464.2_1319337_1319571_-	COG1925, FruB, Phosphotransferase system, HPr-related proteins [Carbohydrate transport and metabolism]	NA|454aa|up_3|NZ_CP022464.2_1319776_1321138_+	COG0621, MiaB, 2-methylthioadenine synthetase [Translation, ribosomal structure and biogenesis]	NA|87aa|up_2|NZ_CP022464.2_1321287_1321548_+	PRK05473, IreB-like, IreB family regulatory phosphoprotein	NA|142aa|up_1|NZ_CP022464.2_1321768_1322194_+	PRK00109, PRK00109, Holliday junction resolvase RuvX	NA|95aa|up_0|NZ_CP022464.2_1322264_1322549_+	pfam06949, DUF1292, Protein of unknown function (DUF1292)	NA|547aa|down_0|NZ_CP022464.2_1323143_1324784_+	COG0595, COG0595, mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair]	NA|237aa|down_1|NZ_CP022464.2_1324877_1325588_+	pfam02618, YceG, YceG-like family	NA|220aa|down_2|NZ_CP022464.2_1325584_1326244_+	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]	NA|412aa|down_3|NZ_CP022464.2_1326240_1327476_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|276aa|down_4|NZ_CP022464.2_1327582_1328410_+	cd06259, YdcF-like, YdcF-like	NA|990aa|down_5|NZ_CP022464.2_1328521_1331491_+	COG1026, COG1026, Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]	NA|304aa|down_6|NZ_CP022464.2_1331512_1332424_+	PRK00089, era, GTPase Era; Reviewed	NA|217aa|down_7|NZ_CP022464.2_1332541_1333192_+	PRK00085, recO, DNA repair protein RecO; Reviewed	NA|218aa|down_8|NZ_CP022464.2_1333286_1333940_+	NA	NA|463aa|down_9|NZ_CP022464.2_1333996_1335385_+	PRK04173, PRK04173, glycyl-tRNA synthetase; Provisional
GCF_002234575.2_ASM223457v2	NZ_CP022464	Enterocloster bolteae strain ATCC BAA-613 chromosome, complete genome	4	2403801-2404008	4	CRISPRCasFinder	no		DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	Orphan	CAGCGGCGCCATGCAGACCGGCTGGCAG	28	0	0	NA	NA	NA	3	3	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	NA,NA|310aa|down_2|NZ_CP022464.2_2406685_2407615_-,NA|97aa|down_6|NZ_CP022464.2_2410251_2410542_+	NA|278aa|up_9|NZ_CP022464.2_2388421_2389255_+	COG2996, COG2996, Predicted RNA-bindining protein (contains S1 and HTH domains) [General function prediction only]	NA|304aa|up_8|NZ_CP022464.2_2389321_2390233_+	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|371aa|up_7|NZ_CP022464.2_2390498_2391611_+	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|363aa|up_6|NZ_CP022464.2_2391752_2392841_+	COG3835, CdaR, Sugar diacid utilization regulator [Transcription / Signal transduction mechanisms]	NA|224aa|up_5|NZ_CP022464.2_2392855_2393527_+	COG2884, FtsE, Predicted ATPase involved in cell division [Cell division and chromosome partitioning]	NA|303aa|up_4|NZ_CP022464.2_2393596_2394505_+	COG2177, FtsX, Cell division protein [Cell division and chromosome partitioning]	NA|403aa|up_3|NZ_CP022464.2_2394520_2395729_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|470aa|up_2|NZ_CP022464.2_2395771_2397181_+	COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer membrane]	NA|929aa|up_1|NZ_CP022464.2_2397389_2400176_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|661aa|up_0|NZ_CP022464.2_2400802_2402785_+	PRK05298, PRK05298, excinuclease ABC subunit UvrB	NA|79aa|down_0|NZ_CP022464.2_2404309_2404546_+	pfam04023, FeoA, FeoA domain	NA|700aa|down_1|NZ_CP022464.2_2404542_2406642_+	COG0370, FeoB, Fe2+ transport system protein B [Inorganic ion transport and metabolism]	NA|310aa|down_2|NZ_CP022464.2_2406685_2407615_-	NA	NA|341aa|down_3|NZ_CP022464.2_2407847_2408870_+	pfam06738, ThrE, Putative threonine/serine exporter	NA|147aa|down_4|NZ_CP022464.2_2408863_2409304_+	pfam12821, ThrE_2, Threonine/Serine exporter, ThrE	NA|60aa|down_5|NZ_CP022464.2_2409608_2409788_-	TIGR01764, Probable_excisionase, DNA binding domain, excisionase family	NA|97aa|down_6|NZ_CP022464.2_2410251_2410542_+	NA	NA|173aa|down_7|NZ_CP022464.2_2410541_2411060_+	pfam03432, Relaxase, Relaxase/Mobilisation nuclease domain	NA|88aa|down_8|NZ_CP022464.2_2411148_2411412_+	pfam08765, Mor, Mor transcription activator family	NA|178aa|down_9|NZ_CP022464.2_2411555_2412089_+	pfam13353, Fer4_12, 4Fe-4S single cluster domain
GCF_002234575.2_ASM223457v2	NZ_CP022464	Enterocloster bolteae strain ATCC BAA-613 chromosome, complete genome	5	3935097-3935237	5	CRISPRCasFinder	no		DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	Orphan	CGCTTACGTGCGATACATGTCACTGGCACTTAGCGAGGTCTTTTACG	47	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	NA,NA	NA|386aa|up_9|NZ_CP022464.2_3924499_3925657_-	PRK10657, PRK10657, isoaspartyl dipeptidase; Provisional	NA|380aa|up_8|NZ_CP022464.2_3925792_3926932_-	COG4225, COG4225, Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins [General function prediction only]	NA|436aa|up_7|NZ_CP022464.2_3926949_3928257_-	COG1593, DctQ, TRAP-type C4-dicarboxylate transport system, large permease component [Carbohydrate transport and metabolism]	NA|157aa|up_6|NZ_CP022464.2_3928258_3928729_-	COG3090, DctM, TRAP-type C4-dicarboxylate transport system, small permease component [Carbohydrate transport and metabolism]	NA|341aa|up_5|NZ_CP022464.2_3928745_3929768_-	cd13671, PBP2_TRAP_SBP_like_3, Uncharacterized substrate-binding protein of the Tripartite ATP-independent  Periplasmic transporter family; the type 2 periplasmic-binding protein fold	NA|310aa|up_4|NZ_CP022464.2_3929984_3930914_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|276aa|up_3|NZ_CP022464.2_3931043_3931871_-	PRK03634, PRK03634, rhamnulose-1-phosphate aldolase; Provisional	NA|417aa|up_2|NZ_CP022464.2_3931959_3933210_-	PRK01076, PRK01076, L-rhamnose isomerase; Provisional	NA|105aa|up_1|NZ_CP022464.2_3933258_3933573_-	TIGR02625, L-rhamnose_mutarotase, L-rhamnose mutarotase	NA|474aa|up_0|NZ_CP022464.2_3933630_3935052_-	cd07771, FGGY_RhuK, L-rhamnulose kinases; a subfamily of the FGGY family of carbohydrate kinases	NA|323aa|down_0|NZ_CP022464.2_3935375_3936344_-	cd06996, cupin_Lmo2851-like_N, AraC/XylS family transcriptional regulators similar to Listeria monocytogenes Lmo2851 protein, N-terminal cupin domain	NA|323aa|down_1|NZ_CP022464.2_3936612_3937581_-	cd12162, 2-Hacid_dh_4, Putative D-isomer specific 2-hydroxyacid dehydrogenases	NA|212aa|down_2|NZ_CP022464.2_3937599_3938235_-	COG1802, GntR, Transcriptional regulators [Transcription]	NA|355aa|down_3|NZ_CP022464.2_3938339_3939404_-	PRK08194, PRK08194, tartrate dehydrogenase; Provisional	NA|315aa|down_4|NZ_CP022464.2_3939426_3940371_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|469aa|down_5|NZ_CP022464.2_3940372_3941779_-	COG3051, CitF, Citrate lyase, alpha subunit [Energy production and conversion]	NA|293aa|down_6|NZ_CP022464.2_3941771_3942650_-	TIGR01588, Citrate_lyase_subunit_beta, citrate lyase, beta subunit	NA|92aa|down_7|NZ_CP022464.2_3942646_3942922_-	PRK13253, PRK13253, citrate lyase subunit gamma; Provisional	NA|300aa|down_8|NZ_CP022464.2_3942992_3943892_-	cd00408, DHDPS-like, Dihydrodipicolinate synthase family	NA|204aa|down_9|NZ_CP022464.2_3943911_3944523_-	PRK08228, PRK08228, L(+)-tartrate dehydratase subunit beta; Validated
GCF_002234575.2_ASM223457v2	NZ_CP022464	Enterocloster bolteae strain ATCC BAA-613 chromosome, complete genome	6	3975348-3975466	6	CRISPRCasFinder	no		DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	Orphan	GCAGCATCGCTTACAGTATCACC	23	0	0	NA	NA	NA	2	2	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	NA|133aa|up_9|NZ_CP022464.2_3965305_3965704_-,NA|95aa|down_8|NZ_CP022464.2_3984739_3985024_-	NA|133aa|up_9|NZ_CP022464.2_3965305_3965704_-	NA	NA|198aa|up_8|NZ_CP022464.2_3965688_3966282_-	PRK06029, PRK06029, UbiX family flavin prenyltransferase	NA|496aa|up_7|NZ_CP022464.2_3966278_3967766_-	pfam01977, UbiD, 3-octaprenyl-4-hydroxybenzoate carboxy-lyase	NA|293aa|up_6|NZ_CP022464.2_3968055_3968934_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|256aa|up_5|NZ_CP022464.2_3968921_3969689_-	COG1878, COG1878, Kynurenine formamidase [Amino acid transport and metabolism]	NA|197aa|up_4|NZ_CP022464.2_3969807_3970398_-	cd07983, LPLAT_DUF374-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: DUF374	NA|423aa|up_3|NZ_CP022464.2_3970336_3971605_-	cd03817, GT4_UGDG-like, UDP-Glc:1,2-diacylglycerol 3-a-glucosyltransferase and similar proteins	NA|170aa|up_2|NZ_CP022464.2_3971616_3972126_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|431aa|up_1|NZ_CP022464.2_3972413_3973706_+	cd17355, MFS_YcxA_like, MFS-type transporter YcxA and similar proteins of the Major Facilitator Superfamily of transporters	NA|520aa|up_0|NZ_CP022464.2_3973739_3975299_-	COG2944, COG2944, Predicted transcriptional regulator [Transcription]	NA|245aa|down_0|NZ_CP022464.2_3975725_3976460_+	cd01015, CSHase, N-carbamoylsarcosine amidohydrolase (CSHase) hydrolyzes N-carbamoylsarcosine to sarcosine, carbon dioxide and ammonia	NA|462aa|down_1|NZ_CP022464.2_3976553_3977939_+	PRK08323, PRK08323, phenylhydantoinase; Validated	NA|184aa|down_2|NZ_CP022464.2_3978065_3978617_-	pfam12675, DUF3795, Protein of unknown function (DUF3795)	NA|342aa|down_3|NZ_CP022464.2_3978869_3979895_+	cd13671, PBP2_TRAP_SBP_like_3, Uncharacterized substrate-binding protein of the Tripartite ATP-independent  Periplasmic transporter family; the type 2 periplasmic-binding protein fold	NA|168aa|down_4|NZ_CP022464.2_3979898_3980402_+	COG3090, DctM, TRAP-type C4-dicarboxylate transport system, small permease component [Carbohydrate transport and metabolism]	NA|436aa|down_5|NZ_CP022464.2_3980398_3981706_+	COG1593, DctQ, TRAP-type C4-dicarboxylate transport system, large permease component [Carbohydrate transport and metabolism]	NA|403aa|down_6|NZ_CP022464.2_3981830_3983039_-	pfam00375, SDF, Sodium:dicarboxylate symporter family	NA|474aa|down_7|NZ_CP022464.2_3983194_3984616_-	COG0823, TolB, Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]	NA|95aa|down_8|NZ_CP022464.2_3984739_3985024_-	NA	NA|92aa|down_9|NZ_CP022464.2_3985291_3985567_+	COG5566, COG5566, Uncharacterized conserved protein [Function unknown]
GCF_002234575.2_ASM223457v2	NZ_CP022464	Enterocloster bolteae strain ATCC BAA-613 chromosome, complete genome	7	4783951-4785390	7,2,2	CRISPRCasFinder,CRT,PILER-CR	no		DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	Orphan	ATTTCAATCCACAAGGCTCTCGCGAGCCTCGAC,ATTTCAATCCACAAGGCTCTCGCGAGCCTCGAC,ATTTCAATCCACAAGGCTCTCGCGAGCCTCGAC	33,33,33	0	0	NA	NA	NA:NA:NA	21,21,20	21	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	NA|232aa|up_0|NZ_CP022464.2_4783033_4783729_-,NA|235aa|down_2|NZ_CP022464.2_4791454_4792159_-	NA|330aa|up_9|NZ_CP022464.2_4772908_4773898_-	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|564aa|up_8|NZ_CP022464.2_4773958_4775650_-	cd08513, PBP2_thermophilic_Hb8_like, The substrate-binding component of ABC-type thermophilic oligopeptide-binding protein Hb8-like import systems, contains the type 2 periplasmic binding fold	NA|296aa|up_7|NZ_CP022464.2_4775712_4776600_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|320aa|up_6|NZ_CP022464.2_4776619_4777579_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|221aa|up_5|NZ_CP022464.2_4777702_4778365_+	COG1802, GntR, Transcriptional regulators [Transcription]	NA|325aa|up_4|NZ_CP022464.2_4778383_4779358_-	cd03465, URO-D_like, The URO-D _like protein superfamily includes bacterial and eukaryotic uroporphyrinogen decarboxylases (URO-D), coenzyme M methyltransferases and other putative bacterial methyltransferases	NA|369aa|up_3|NZ_CP022464.2_4779581_4780688_+	COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and metabolism]	NA|439aa|up_2|NZ_CP022464.2_4780751_4782068_-	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|158aa|up_1|NZ_CP022464.2_4782190_4782664_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|232aa|up_0|NZ_CP022464.2_4783033_4783729_-	NA	NA|1402aa|down_0|NZ_CP022464.2_4785597_4789803_-	COG1201, Lhr, Lhr-like helicases [General function prediction only]	NA|443aa|down_1|NZ_CP022464.2_4790108_4791437_+	cd06545, GH18_3CO4_chitinase, The Bacteroides thetaiotaomicron protein represented by pdb structure 3CO4 is an uncharacterized bacterial member of the family 18 glycosyl hydrolases with homologs found in Flavobacterium, Stigmatella, and Pseudomonas	NA|235aa|down_2|NZ_CP022464.2_4791454_4792159_-	NA	NA|547aa|down_3|NZ_CP022464.2_4792396_4794037_-	pfam00920, ILVD_EDD, Dehydratase family	NA|456aa|down_4|NZ_CP022464.2_4794020_4795388_-	COG3775, GatC, Phosphotransferase system, galactitol-specific IIC component [Carbohydrate transport and metabolism]	NA|93aa|down_5|NZ_CP022464.2_4795476_4795755_-	cd05566, PTS_IIB_galactitol, PTS_IIB_galactitol: subunit IIB of enzyme II (EII) of the galactitol-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS)	NA|158aa|down_6|NZ_CP022464.2_4795782_4796256_-	pfam00359, PTS_EIIA_2, Phosphoenolpyruvate-dependent sugar phosphotransferase system, EIIA 2	NA|213aa|down_7|NZ_CP022464.2_4796252_4796891_-	cd00452, KDPG_aldolase, KDPG and KHG aldolase	NA|993aa|down_8|NZ_CP022464.2_4796937_4799916_-	COG1221, PspF, Transcriptional regulators containing an AAA-type ATPase domain and a DNA-binding domain [Transcription / Signal transduction mechanisms]	NA|317aa|down_9|NZ_CP022464.2_4800014_4800965_-	COG2267, PldB, Lysophospholipase [Lipid metabolism]
GCF_002234575.2_ASM223457v2	NZ_CP022464	Enterocloster bolteae strain ATCC BAA-613 chromosome, complete genome	8	6339864-6339969	8	CRISPRCasFinder	no		DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	Orphan	AGGATAGAGCGTCCGCCTCCTAAG	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,WYL,PD-DExK,DinG,RT	NA,NA|179aa|down_4|NZ_CP022464.2_6343370_6343907_-,NA|173aa|down_5|NZ_CP022464.2_6344014_6344533_-,NA|224aa|down_8|NZ_CP022464.2_6346839_6347511_-	NA|300aa|up_9|NZ_CP022464.2_6328544_6329444_+	cd10938, CE4_HpPgdA_like, Catalytic domain of Helicobacter pylori peptidoglycan deacetylase (HpPgdA) and similar proteins	NA|278aa|up_8|NZ_CP022464.2_6329440_6330274_+	cd10938, CE4_HpPgdA_like, Catalytic domain of Helicobacter pylori peptidoglycan deacetylase (HpPgdA) and similar proteins	NA|217aa|up_7|NZ_CP022464.2_6330437_6331088_-	pfam02586, SRAP, SOS response associated peptidase (SRAP)	NA|485aa|up_6|NZ_CP022464.2_6331214_6332669_-	COG1249, Lpd, Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes [Energy production and conversion]	NA|343aa|up_5|NZ_CP022464.2_6332704_6333733_-	PRK03822, lplA, lipoate-protein ligase A; Provisional	NA|478aa|up_4|NZ_CP022464.2_6333857_6335291_-	PRK04366, PRK04366, aminomethyl-transferring glycine dehydrogenase subunit GcvPB	NA|456aa|up_3|NZ_CP022464.2_6335287_6336655_-	PRK00451, PRK00451, aminomethyl-transferring glycine dehydrogenase subunit GcvPA	NA|127aa|up_2|NZ_CP022464.2_6336707_6337088_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|370aa|up_1|NZ_CP022464.2_6337156_6338266_-	PRK00389, gcvT, glycine cleavage system aminomethyltransferase GcvT	NA|291aa|up_0|NZ_CP022464.2_6338862_6339735_+	pfam12997, DUF3881, Domain of unknown function, E	NA|487aa|down_0|NZ_CP022464.2_6340237_6341698_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|67aa|down_1|NZ_CP022464.2_6341717_6341918_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|80aa|down_2|NZ_CP022464.2_6342456_6342696_-	pfam12645, HTH_16, Helix-turn-helix domain	NA|140aa|down_3|NZ_CP022464.2_6342692_6343112_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|179aa|down_4|NZ_CP022464.2_6343370_6343907_-	NA	NA|173aa|down_5|NZ_CP022464.2_6344014_6344533_-	NA	NA|473aa|down_6|NZ_CP022464.2_6344658_6346077_-	cd07341, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|138aa|down_7|NZ_CP022464.2_6346057_6346471_-	pfam03965, Penicillinase_R, Penicillinase repressor	NA|224aa|down_8|NZ_CP022464.2_6346839_6347511_-	NA	NA|404aa|down_9|NZ_CP022464.2_6347804_6349016_+	pfam01548, DEDD_Tnp_IS110, Transposase
