assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900066015.1_Cfx-K	NZ_LN890655	Candidatus Promineofilum breve strain Cfx-K chromosome I	1	51589-51690	1	CRISPRCasFinder	no	csa3	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	Type I-A	TTCTTCCTCCCCTCCCGCCGGGAGGGGTTGGGGGAGGG	38	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|474aa|up_8|NZ_LN890655.1_40711_42133_-,NA|427aa|up_0|NZ_LN890655.1_50301_51582_+,NA|130aa|down_1|NZ_LN890655.1_52277_52667_+	NA|402aa|up_9|NZ_LN890655.1_39473_40679_-	cd07207, Pat_ExoU_VipD_like, ExoU and VipD-like proteins; homologus to patatin, cPLA2, and iPLA2	NA|474aa|up_8|NZ_LN890655.1_40711_42133_-	NA	NA|374aa|up_7|NZ_LN890655.1_42144_43266_-	cd07487, Peptidases_S8_1, Peptidase S8 family domain, uncharacterized subfamily 1	NA|400aa|up_6|NZ_LN890655.1_43698_44898_+	PRK04073, rocD, ornithine--oxo-acid transaminase; Provisional	NA|418aa|up_5|NZ_LN890655.1_44894_46148_+	PRK09440, avtA, valine--pyruvate transaminase; Provisional	csa3|357aa|up_4|NZ_LN890655.1_46411_47482_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|447aa|up_3|NZ_LN890655.1_47561_48902_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|254aa|up_2|NZ_LN890655.1_49103_49865_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|121aa|up_1|NZ_LN890655.1_49861_50224_-	cd07245, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|427aa|up_0|NZ_LN890655.1_50301_51582_+	NA	NA|168aa|down_0|NZ_LN890655.1_51746_52250_+	pfam04306, DUF456, Protein of unknown function (DUF456)	NA|130aa|down_1|NZ_LN890655.1_52277_52667_+	NA	NA|1102aa|down_2|NZ_LN890655.1_52660_55966_-	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|164aa|down_3|NZ_LN890655.1_56097_56589_+	COG3889, COG3889, Predicted solute binding protein [General function prediction only]	NA|377aa|down_4|NZ_LN890655.1_56615_57746_+	cd05656, M42_Frv, M42 Peptidase, endoglucanases	NA|348aa|down_5|NZ_LN890655.1_57742_58786_+	cd05656, M42_Frv, M42 Peptidase, endoglucanases	NA|335aa|down_6|NZ_LN890655.1_58816_59821_+	cd05656, M42_Frv, M42 Peptidase, endoglucanases	NA|311aa|down_7|NZ_LN890655.1_59812_60745_-	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|206aa|down_8|NZ_LN890655.1_60758_61376_-	pfam01252, Peptidase_A8, Signal peptidase (SPase) II	NA|223aa|down_9|NZ_LN890655.1_61417_62086_-	PRK08317, PRK08317, hypothetical protein; Provisional
GCF_900066015.1_Cfx-K	NZ_LN890655	Candidatus Promineofilum breve strain Cfx-K chromosome I	2	149365-149451	2	CRISPRCasFinder	no		csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	Orphan	GGAGTGGAGCTGGAAGCTCTACC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|139aa|up_1|NZ_LN890655.1_147564_147981_-,NA	NA|136aa|up_9|NZ_LN890655.1_137351_137759_+	PRK00070, acpS, 4'-phosphopantetheinyl transferase; Provisional	NA|302aa|up_8|NZ_LN890655.1_137751_138657_+	TIGR03446, mycothiol_Mca, mycothiol conjugate amidase Mca	NA|312aa|up_7|NZ_LN890655.1_138707_139643_+	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|668aa|up_6|NZ_LN890655.1_139697_141701_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|630aa|up_5|NZ_LN890655.1_141791_143681_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|190aa|up_4|NZ_LN890655.1_143664_144234_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|254aa|up_3|NZ_LN890655.1_145294_146056_+	cd05350, SDR_c6, classical (c) SDR, subgroup 6	NA|491aa|up_2|NZ_LN890655.1_146070_147543_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|139aa|up_1|NZ_LN890655.1_147564_147981_-	NA	NA|367aa|up_0|NZ_LN890655.1_148061_149162_-	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|113aa|down_0|NZ_LN890655.1_149457_149796_-	COG3677, COG3677, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|345aa|down_1|NZ_LN890655.1_149798_150833_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|213aa|down_2|NZ_LN890655.1_150915_151554_+	pfam00498, FHA, FHA domain	NA|452aa|down_3|NZ_LN890655.1_151619_152975_-	pfam14353, CpXC, CpXC protein	NA|536aa|down_4|NZ_LN890655.1_153042_154650_-	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|273aa|down_5|NZ_LN890655.1_154812_155631_+	TIGR02427, b-ketoadipate_enol-lactone_hydrolase, 3-oxoadipate enol-lactonase	NA|320aa|down_6|NZ_LN890655.1_155653_156613_+	cd02252, nylC_like, nylC-like family; composed of proteins with similarity to Flavobacterium endo-type 6-aminohexanoate-oligomer hydrolase (EIII), the product of the nylon oligomer degradation gene, nylC	NA|184aa|down_7|NZ_LN890655.1_156612_157164_+	COG1611, COG1611, Predicted Rossmann fold nucleotide-binding protein [General function prediction only]	NA|266aa|down_8|NZ_LN890655.1_157192_157990_+	cd04241, AAK_FomA-like, AAK_FomA-like: This CD includes a fosfomycin biosynthetic gene product, FomA, and similar proteins found in a wide range of organisms	NA|173aa|down_9|NZ_LN890655.1_158297_158816_+	PRK07571, PRK07571, bidirectional hydrogenase complex protein HoxE; Reviewed
GCF_900066015.1_Cfx-K	NZ_LN890655	Candidatus Promineofilum breve strain Cfx-K chromosome I	3	1317495-1317581	3	CRISPRCasFinder	no		csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	Orphan	CCACTTCCGAAGTGGGTTGCACCTAA	26	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|189aa|up_9|NZ_LN890655.1_1302897_1303464_+,NA|206aa|up_8|NZ_LN890655.1_1303460_1304078_+,NA|378aa|up_6|NZ_LN890655.1_1307359_1308493_+,NA|336aa|up_2|NZ_LN890655.1_1310985_1311993_+,NA|883aa|up_0|NZ_LN890655.1_1314738_1317387_+,NA|367aa|down_3|NZ_LN890655.1_1321629_1322730_+,NA|346aa|down_4|NZ_LN890655.1_1322868_1323906_+	NA|189aa|up_9|NZ_LN890655.1_1302897_1303464_+	NA	NA|206aa|up_8|NZ_LN890655.1_1303460_1304078_+	NA	NA|918aa|up_7|NZ_LN890655.1_1304483_1307237_+	PRK00252, alaS, alanyl-tRNA synthetase; Reviewed	NA|378aa|up_6|NZ_LN890655.1_1307359_1308493_+	NA	NA|162aa|up_5|NZ_LN890655.1_1308693_1309179_+	cd16964, YqgF, putative pre-16S rRNA nuclease YqgF and RuvX family	NA|395aa|up_4|NZ_LN890655.1_1309168_1310353_+	pfam02618, YceG, YceG-like family	NA|208aa|up_3|NZ_LN890655.1_1310333_1310957_+	cd06342, PBP1_ABC_LIVBP-like, type 1 periplasmic ligand-binding domain of ABC (Atpase Binding Cassette)-type active transport systems involved in the transport of all three branched chain aliphatic amino acids (leucine, isoleucine and valine)	NA|336aa|up_2|NZ_LN890655.1_1310985_1311993_+	NA	NA|858aa|up_1|NZ_LN890655.1_1311979_1314553_+	cd13401, Slt70-like, 70kDa soluble lytic transglycosylase (Slt70) and similar proteins	NA|883aa|up_0|NZ_LN890655.1_1314738_1317387_+	NA	NA|478aa|down_0|NZ_LN890655.1_1317590_1319024_+	PRK14338, PRK14338, (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional	NA|435aa|down_1|NZ_LN890655.1_1319109_1320414_-	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|322aa|down_2|NZ_LN890655.1_1320413_1321379_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|367aa|down_3|NZ_LN890655.1_1321629_1322730_+	NA	NA|346aa|down_4|NZ_LN890655.1_1322868_1323906_+	NA	NA|647aa|down_5|NZ_LN890655.1_1323909_1325850_+	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|505aa|down_6|NZ_LN890655.1_1325950_1327465_-	cd07808, FGGY_D-XK_EcXK-like, Escherichia coli xylulokinase-like D-xylulose kinases; a subgroup of the FGGY family of carbohydrate kinases	NA|389aa|down_7|NZ_LN890655.1_1327479_1328646_-	PRK12677, PRK12677, xylose isomerase; Provisional	NA|225aa|down_8|NZ_LN890655.1_1328862_1329537_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|462aa|down_9|NZ_LN890655.1_1329549_1330935_+	TIGR01386, Probable_sensor_protein_PcoS, heavy metal sensor kinase
GCF_900066015.1_Cfx-K	NZ_LN890655	Candidatus Promineofilum breve strain Cfx-K chromosome I	4	2109976-2110075	4	CRISPRCasFinder	no		csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	Orphan	GAACTTCCCAGTTCCACCTTCCCCT	25	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA,NA	NA|186aa|up_9|NZ_LN890655.1_2100764_2101322_+	pfam03608, EII-GUT, PTS system enzyme II sorbitol-specific factor	NA|181aa|up_8|NZ_LN890655.1_2101351_2101894_+	COG3732, SrlE, Phosphotransferase system sorbitol-specific component IIBC [Carbohydrate transport and metabolism]	NA|128aa|up_7|NZ_LN890655.1_2101980_2102364_+	pfam03612, EIIBC-GUT_N, Sorbitol phosphotransferase enzyme II N-terminus	NA|120aa|up_6|NZ_LN890655.1_2102369_2102729_+	pfam03829, PTSIIA_gutA, PTS system glucitol/sorbitol-specific IIA component	NA|435aa|up_5|NZ_LN890655.1_2102735_2104040_+	COG4091, COG4091, Predicted homoserine dehydrogenase [Amino acid transport and metabolism]	NA|236aa|up_4|NZ_LN890655.1_2104058_2104766_+	pfam00596, Aldolase_II, Class II Aldolase and Adducin N-terminal domain	NA|336aa|up_3|NZ_LN890655.1_2104758_2105766_+	cd02000, TPP_E1_PDC_ADC_BCADC, Thiamine pyrophosphate (TPP) family, E1 of PDC_ADC_BCADC subfamily, TPP-binding module; composed of proteins similar to the E1 components of the human pyruvate dehydrogenase complex (PDC), the acetoin dehydrogenase complex (ADC) and the branched chain alpha-keto acid dehydrogenase/2-oxoisovalerate dehydrogenase complex (BCADC)	NA|329aa|up_2|NZ_LN890655.1_2105758_2106745_+	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|462aa|up_1|NZ_LN890655.1_2106797_2108183_+	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|571aa|up_0|NZ_LN890655.1_2108259_2109972_+	COG1080, PtsA, Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [Carbohydrate transport and metabolism]	NA|276aa|down_0|NZ_LN890655.1_2110112_2110940_+	cd01639, IMPase, IMPase, inositol monophosphatase and related domains	NA|661aa|down_1|NZ_LN890655.1_2110944_2112927_+	PRK11388, PRK11388, DNA-binding transcriptional regulator DhaR; Provisional	NA|242aa|down_2|NZ_LN890655.1_2113344_2114070_+	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]	NA|359aa|down_3|NZ_LN890655.1_2114153_2115230_+	PRK11468, PRK11468, dihydroxyacetone kinase subunit DhaK; Provisional	NA|210aa|down_4|NZ_LN890655.1_2115233_2115863_+	PRK10005, PRK10005, dihydroxyacetone kinase ADP-binding subunit DhaL	NA|503aa|down_5|NZ_LN890655.1_2115974_2117483_+	cd07789, FGGY_CsGK_like, Cellulomonas sp	NA|387aa|down_6|NZ_LN890655.1_2117606_2118767_+	TIGR03377, glycerol3P_GlpA, glycerol-3-phosphate dehydrogenase, anaerobic, A subunit	NA|241aa|down_7|NZ_LN890655.1_2118814_2119537_+	COG0479, FrdB, Succinate dehydrogenase/fumarate reductase, Fe-S protein subunit [Energy production and conversion]	NA|135aa|down_8|NZ_LN890655.1_2119538_2119943_+	cd03501, SQR_TypeA_SdhC_like, Succinate:quinone oxidoreductase (SQR) Type A subfamily, Succinate dehydrogenase C (SdhC)-like subunit; SQR catalyzes the oxidation of succinate to fumarate coupled to the reduction of quinone to quinol	NA|132aa|down_9|NZ_LN890655.1_2119972_2120368_+	cd03500, SQR_TypeA_SdhD_like, Succinate:quinone oxidoreductase (SQR) Type A subfamily, Succinate dehydrogenase D (SdhD)-like subunit; SQR catalyzes the oxidation of succinate to fumarate coupled to the reduction of quinone to quinol
GCF_900066015.1_Cfx-K	NZ_LN890655	Candidatus Promineofilum breve strain Cfx-K chromosome I	5	2312626-2312793	5	CRISPRCasFinder	no		csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	Orphan	GATTTAAACATGGATAAGTTCGTGAAAATCCGTTTTGTCCGTTTTAATCCGTAGC	55	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA,NA|48aa|down_5|NZ_LN890655.1_2317534_2317678_+,NA|55aa|down_6|NZ_LN890655.1_2317678_2317843_+	NA|166aa|up_9|NZ_LN890655.1_2302111_2302609_+	pfam03918, CcmH, Cytochrome C biogenesis protein	NA|195aa|up_8|NZ_LN890655.1_2302715_2303300_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|191aa|up_7|NZ_LN890655.1_2303320_2303893_+	pfam13240, zinc_ribbon_2, zinc-ribbon domain	NA|105aa|up_6|NZ_LN890655.1_2305911_2306226_+	TIGR02118, TIGR02118, conserved hypothetical protein	NA|169aa|up_5|NZ_LN890655.1_2306422_2306929_+	pfam02675, AdoMet_dc, S-adenosylmethionine decarboxylase	NA|331aa|up_4|NZ_LN890655.1_2307010_2308003_+	cd11592, Agmatinase_PAH, Agmatinase-like family includes proclavaminic acid amidinohydrolase	NA|326aa|up_3|NZ_LN890655.1_2307983_2308961_+	pfam01916, DS, Deoxyhypusine synthase	NA|382aa|up_2|NZ_LN890655.1_2309000_2310146_-	cd13590, PBP2_PotD_PotF_like, The periplasmic-binding component of ABC transporters involved in uptake of polyamines; possess the type 2 periplasmic binding fold	NA|244aa|up_1|NZ_LN890655.1_2310276_2311008_-	COG1177, PotC, ABC-type spermidine/putrescine transport system, permease component II [Amino acid transport and metabolism]	NA|313aa|up_0|NZ_LN890655.1_2311136_2312075_-	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|402aa|down_0|NZ_LN890655.1_2312799_2314005_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|654aa|down_1|NZ_LN890655.1_2314417_2316379_+	COG2414, COG2414, Aldehyde:ferredoxin oxidoreductase [Energy production and conversion]	NA|106aa|down_2|NZ_LN890655.1_2316441_2316759_+	pfam06296, RelE, RelE toxin of RelE / RelB toxin-antitoxin system	NA|104aa|down_3|NZ_LN890655.1_2316770_2317082_+	COG2944, COG2944, Predicted transcriptional regulator [Transcription]	NA|129aa|down_4|NZ_LN890655.1_2317124_2317511_+	cd05403, NT_KNTase_like, Nucleotidyltransferase (NT) domain of Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins	NA|48aa|down_5|NZ_LN890655.1_2317534_2317678_+	NA	NA|55aa|down_6|NZ_LN890655.1_2317678_2317843_+	NA	NA|439aa|down_7|NZ_LN890655.1_2318044_2319361_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|925aa|down_8|NZ_LN890655.1_2319424_2322199_+	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|218aa|down_9|NZ_LN890655.1_2322336_2322990_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]
GCF_900066015.1_Cfx-K	NZ_LN890655	Candidatus Promineofilum breve strain Cfx-K chromosome I	6	2634317-2644769	1,6,1,2	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas4,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	Unclear	GTTTCAGTACCCTCTATCGGGTCGAAGGCTGTGCAGC,GTTTCAGTACCCTCTATCGGGTCGAAGGCTGTGCAGC,GTTTCAGTACCCTCTATCGGGTCGAAGGCTGTGCAGC,GTTTCAGTACCCTCTATCGGGTCGAAGGCTGTGCAGC	37,37,37,37	1	1	2639771-2639805	NZ_LN890657.1_19293-19259	NA:NA:NA:NA	136,142,142,136	142	Unclear	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|417aa|up_5|NZ_LN890655.1_2618280_2619531_+,NA|1326aa|up_4|NZ_LN890655.1_2619552_2623530_+,NA|730aa|up_2|NZ_LN890655.1_2625198_2627388_+,NA|868aa|up_0|NZ_LN890655.1_2631601_2634205_+,NA|79aa|down_3|NZ_LN890655.1_2647522_2647759_+,cas8a4|596aa|down_6|NZ_LN890655.1_2651748_2653536_+,cas5|241aa|down_8|NZ_LN890655.1_2654533_2655256_+,cas6|344aa|down_9|NZ_LN890655.1_2655286_2656318_+	NA|579aa|up_9|NZ_LN890655.1_2611330_2613067_+	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|714aa|up_8|NZ_LN890655.1_2613120_2615262_+	pfam04020, Phage_holin_4_2, Mycobacterial 4 TMS phage holin, superfamily IV	NA|511aa|up_7|NZ_LN890655.1_2615258_2616791_+	COG4803, COG4803, Predicted membrane protein [Function unknown]	NA|315aa|up_6|NZ_LN890655.1_2616787_2617732_+	cd05271, NDUFA9_like_SDR_a, NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, subunit 9, 39 kDa, (NDUFA9) -like, atypical (a) SDRs	NA|417aa|up_5|NZ_LN890655.1_2618280_2619531_+	NA	NA|1326aa|up_4|NZ_LN890655.1_2619552_2623530_+	NA	NA|518aa|up_3|NZ_LN890655.1_2623560_2625114_+	COG4675, MdpB, Microcystin-dependent protein [Function unknown]	NA|730aa|up_2|NZ_LN890655.1_2625198_2627388_+	NA	NA|1384aa|up_1|NZ_LN890655.1_2627416_2631568_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|868aa|up_0|NZ_LN890655.1_2631601_2634205_+	NA	cas4|204aa|down_0|NZ_LN890655.1_2644982_2645594_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas2|92aa|down_1|NZ_LN890655.1_2645593_2645869_-	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	cas1|348aa|down_2|NZ_LN890655.1_2646120_2647164_-	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	NA|79aa|down_3|NZ_LN890655.1_2647522_2647759_+	NA	WYL|352aa|down_4|NZ_LN890655.1_2647806_2648862_+	pfam13280, WYL, WYL domain	cas3|818aa|down_5|NZ_LN890655.1_2648861_2651315_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas8a4|596aa|down_6|NZ_LN890655.1_2651748_2653536_+	NA	cas7|326aa|down_7|NZ_LN890655.1_2653549_2654527_+	pfam01905, DevR, CRISPR-associated negative auto-regulator DevR/Csa2	cas5|241aa|down_8|NZ_LN890655.1_2654533_2655256_+	NA	cas6|344aa|down_9|NZ_LN890655.1_2655286_2656318_+	NA
GCF_900066015.1_Cfx-K	NZ_LN890655	Candidatus Promineofilum breve strain Cfx-K chromosome I	7	3485761-3487940	7,2,3	CRISPRCasFinder,CRT,PILER-CR	no	csm3gr7,csx19,csx15,cas6	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	Type III-A	CGCGGCCGCAGCCTGTTTGCCTTTGAGGGATTGAAAC,CGCGGCCGNNGCCTGTTTGCCTTTGAGGGATTGAAAC,GCCTGTTTGCCTTTGAGGGATTGAAAC	37,37,27	0	0	NA	NA	NA:NA:NA	29,29,28	29	TypeIII-A	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|145aa|up_8|NZ_LN890655.1_3478336_3478771_+,NA|220aa|down_1|NZ_LN890655.1_3488739_3489399_+,NA|635aa|down_4|NZ_LN890655.1_3496661_3498566_+,NA|1031aa|down_6|NZ_LN890655.1_3500508_3503601_+,NA|130aa|down_8|NZ_LN890655.1_3504787_3505177_-	NA|417aa|up_9|NZ_LN890655.1_3476820_3478071_-	PRK02862, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|145aa|up_8|NZ_LN890655.1_3478336_3478771_+	NA	NA|345aa|up_7|NZ_LN890655.1_3478791_3479826_+	COG2382, Fes, Enterochelin esterase and related enzymes [Inorganic ion transport and metabolism]	NA|351aa|up_6|NZ_LN890655.1_3479943_3480996_-	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins	NA|207aa|up_5|NZ_LN890655.1_3481014_3481635_-	PRK00129, upp, uracil phosphoribosyltransferase; Reviewed	NA|355aa|up_4|NZ_LN890655.1_3481799_3482864_+	PRK00080, ruvB, Holliday junction branch migration DNA helicase RuvB	NA|76aa|up_3|NZ_LN890655.1_3482870_3483098_+	pfam11146, DUF2905, Protein of unknown function (DUF2905)	NA|278aa|up_2|NZ_LN890655.1_3483141_3483975_+	COG1922, WecG, Teichoic acid biosynthesis proteins [Cell envelope biogenesis, outer membrane]	NA|323aa|up_1|NZ_LN890655.1_3484045_3485014_+	cd05256, UDP_AE_SDR_e, UDP-N-acetylglucosamine 4-epimerase, extended (e) SDRs	NA|151aa|up_0|NZ_LN890655.1_3485014_3485467_-	PRK10860, PRK10860, tRNA-specific adenosine deaminase; Provisional	NA|213aa|down_0|NZ_LN890655.1_3488059_3488698_+	COG1428, COG1428, Deoxynucleoside kinases [Nucleotide transport and metabolism]	NA|220aa|down_1|NZ_LN890655.1_3488739_3489399_+	NA	NA|634aa|down_2|NZ_LN890655.1_3489550_3491452_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|1577aa|down_3|NZ_LN890655.1_3491771_3496502_+	cd11304, Cadherin_repeat, Cadherin tandem repeat domain	NA|635aa|down_4|NZ_LN890655.1_3496661_3498566_+	NA	NA|567aa|down_5|NZ_LN890655.1_3498715_3500416_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|1031aa|down_6|NZ_LN890655.1_3500508_3503601_+	NA	NA|326aa|down_7|NZ_LN890655.1_3503666_3504644_+	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed	NA|130aa|down_8|NZ_LN890655.1_3504787_3505177_-	NA	cas3|1016aa|down_9|NZ_LN890655.1_3505192_3508240_-	COG1197, Mfd, Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription]
GCF_900066015.1_Cfx-K	NZ_LN890656	Candidatus Promineofilum breve strain Cfx-K chromosome II	1	127459-127602	1	CRISPRCasFinder	no		csa3	Orphan	GGATTGACACGGATTCGACGGATTAAAACGGATAGCTCGATCCGTGTT	48	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|72aa|up_9|NZ_LN890656.1_104122_104338_-,NA	NA|72aa|up_9|NZ_LN890656.1_104122_104338_-	NA	NA|1161aa|up_8|NZ_LN890656.1_105020_108503_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|479aa|up_7|NZ_LN890656.1_108531_109968_+	smart00862, Trans_reg_C, Transcriptional regulatory protein, C terminal	NA|142aa|up_6|NZ_LN890656.1_110029_110455_-	pfam02657, SufE, Fe-S metabolism associated domain	NA|1235aa|up_5|NZ_LN890656.1_110833_114538_-	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|149aa|up_4|NZ_LN890656.1_114646_115093_-	pfam13490, zf-HC2, Putative zinc-finger	NA|202aa|up_3|NZ_LN890656.1_115375_115981_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|691aa|up_2|NZ_LN890656.1_116407_118480_-	TIGR03663, TIGR03663, TIGR03663 family protein	NA|989aa|up_1|NZ_LN890656.1_118482_121449_-	TIGR03662, Chlor_Arch_YYY, Chlor_Arch_YYY domain	NA|1798aa|up_0|NZ_LN890656.1_121926_127320_-	TIGR03662, Chlor_Arch_YYY, Chlor_Arch_YYY domain	NA|238aa|down_0|NZ_LN890656.1_127671_128385_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|1049aa|down_1|NZ_LN890656.1_128512_131659_-	COG4889, COG4889, Predicted helicase [General function prediction only]	NA|341aa|down_2|NZ_LN890656.1_131754_132777_-	pfam01061, ABC2_membrane, ABC-2 type transporter	NA|241aa|down_3|NZ_LN890656.1_132773_133496_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|194aa|down_4|NZ_LN890656.1_133602_134184_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|1167aa|down_5|NZ_LN890656.1_134339_137840_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|261aa|down_6|NZ_LN890656.1_137983_138766_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|307aa|down_7|NZ_LN890656.1_138860_139781_+	cd01558, D-AAT_like, D-Alanine aminotransferase (D-AAT_like): D-amino acid aminotransferase catalyzes transamination between D-amino acids and their respective alpha-keto acids	NA|199aa|down_8|NZ_LN890656.1_139991_140588_+	PRK05500, PRK05500, bifunctional orotidine-5'-phosphate decarboxylase/orotate phosphoribosyltransferase	NA|917aa|down_9|NZ_LN890656.1_140662_143413_+	COG1042, COG1042, Acyl-CoA synthetase (NDP forming) [Energy production and conversion]
GCF_900066015.1_Cfx-K	NZ_LN890656	Candidatus Promineofilum breve strain Cfx-K chromosome II	2	286565-286677	2	CRISPRCasFinder	no		csa3	Orphan	ATTCTTAACGTAACCAAATTGGCTACGGA	29	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|715aa|up_5|NZ_LN890656.1_276998_279143_-,NA|209aa|up_4|NZ_LN890656.1_279453_280080_-,NA|241aa|up_1|NZ_LN890656.1_284970_285693_-,NA|204aa|up_0|NZ_LN890656.1_285863_286475_-,NA|92aa|down_1|NZ_LN890656.1_290009_290285_+,NA|142aa|down_3|NZ_LN890656.1_291794_292220_+	NA|224aa|up_9|NZ_LN890656.1_274115_274787_+	pfam10881, DUF2726, Protein of unknown function (DUF2726)	NA|448aa|up_8|NZ_LN890656.1_274896_276240_-	pfam00067, p450, Cytochrome P450	NA|89aa|up_7|NZ_LN890656.1_276242_276509_-	pfam15738, YafQ_toxin, Bacterial toxin of type II toxin-antitoxin system, YafQ	NA|87aa|up_6|NZ_LN890656.1_276508_276769_-	TIGR02384, Putative_antitoxin_RelB, addiction module antitoxin, RelB/DinJ family	NA|715aa|up_5|NZ_LN890656.1_276998_279143_-	NA	NA|209aa|up_4|NZ_LN890656.1_279453_280080_-	NA	NA|900aa|up_3|NZ_LN890656.1_281071_283771_-	TIGR03311, Se_dep_XDH, selenium-dependent xanthine dehydrogenase	NA|168aa|up_2|NZ_LN890656.1_284225_284729_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|241aa|up_1|NZ_LN890656.1_284970_285693_-	NA	NA|204aa|up_0|NZ_LN890656.1_285863_286475_-	NA	NA|1048aa|down_0|NZ_LN890656.1_286738_289882_-	COG0587, DnaE, DNA polymerase III, alpha subunit [DNA replication, recombination, and repair]	NA|92aa|down_1|NZ_LN890656.1_290009_290285_+	NA	NA|446aa|down_2|NZ_LN890656.1_290319_291657_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|142aa|down_3|NZ_LN890656.1_291794_292220_+	NA	NA|319aa|down_4|NZ_LN890656.1_292317_293274_+	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|450aa|down_5|NZ_LN890656.1_293353_294703_-	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|287aa|down_6|NZ_LN890656.1_295041_295902_-	pfam12679, ABC2_membrane_2, ABC-2 family transporter protein	NA|297aa|down_7|NZ_LN890656.1_295916_296807_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|96aa|down_8|NZ_LN890656.1_296796_297084_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|139aa|down_9|NZ_LN890656.1_297143_297560_-	COG1832, COG1832, Predicted CoA-binding protein [General function prediction only]
GCF_900066015.1_Cfx-K	NZ_LN890656	Candidatus Promineofilum breve strain Cfx-K chromosome II	3	404623-404736	3	CRISPRCasFinder	no		csa3	Orphan	CCTCTCCCAGGGGGAGAGGGACCAGAG	27	1	1	404650-404709	NZ_LN890656.1_404607-404666	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|177aa|up_7|NZ_LN890656.1_397706_398237_-,NA|211aa|up_0|NZ_LN890656.1_403925_404558_+,NA	NA|270aa|up_9|NZ_LN890656.1_395756_396566_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|294aa|up_8|NZ_LN890656.1_396640_397522_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|177aa|up_7|NZ_LN890656.1_397706_398237_-	NA	NA|243aa|up_6|NZ_LN890656.1_398452_399181_+	pfam04029, 2-ph_phosp, 2-phosphosulpholactate phosphatase	NA|164aa|up_5|NZ_LN890656.1_399185_399677_+	PRK13291, PRK13291, putative metal-dependent hydrolase	NA|297aa|up_4|NZ_LN890656.1_399673_400564_+	COG0456, RimI, Acetyltransferases [General function prediction only]	NA|137aa|up_3|NZ_LN890656.1_400595_401006_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|414aa|up_2|NZ_LN890656.1_401017_402259_+	cd17507, GT28_Beta-DGS-like, beta-diglucosyldiacylglycerol synthase and similar proteins	NA|457aa|up_1|NZ_LN890656.1_402258_403629_+	TIGR03356, BGL, beta-galactosidase	NA|211aa|up_0|NZ_LN890656.1_403925_404558_+	NA	NA|716aa|down_0|NZ_LN890656.1_404946_407094_+	COG3889, COG3889, Predicted solute binding protein [General function prediction only]	NA|256aa|down_1|NZ_LN890656.1_407169_407937_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|292aa|down_2|NZ_LN890656.1_407954_408830_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|286aa|down_3|NZ_LN890656.1_408903_409761_-	cd13713, PBP2_Cystine_like_1, Substrate binding domain of putative ABC transporters involved in cystine import; the type 2 periplasmic binding protein fold	NA|222aa|down_4|NZ_LN890656.1_409991_410657_-	cd00429, RPE, Ribulose-5-phosphate 3-epimerase (RPE)	NA|329aa|down_5|NZ_LN890656.1_410789_411776_-	PRK09479, glpX, fructose 1,6-bisphosphatase II; Reviewed	NA|238aa|down_6|NZ_LN890656.1_411914_412628_-	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|237aa|down_7|NZ_LN890656.1_412747_413458_-	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|369aa|down_8|NZ_LN890656.1_413480_414587_-	COG1108, ZnuB, ABC-type Mn2+/Zn2+ transport systems, permease components [Inorganic ion transport and metabolism]	NA|379aa|down_9|NZ_LN890656.1_414583_415720_-	pfam00950, ABC-3, ABC 3 transport family
GCF_900066015.1_Cfx-K	NZ_LN890656	Candidatus Promineofilum breve strain Cfx-K chromosome II	4	480507-480635	4	CRISPRCasFinder	no		csa3	Orphan	AGTAGCCAATTCTTCATCGTCTAAGAATTGGCTACGGA	38	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|362aa|up_7|NZ_LN890656.1_469524_470610_+,NA|216aa|up_5|NZ_LN890656.1_471863_472511_+,NA|815aa|up_4|NZ_LN890656.1_473291_475736_+,NA|257aa|up_1|NZ_LN890656.1_478414_479185_+,NA|172aa|down_3|NZ_LN890656.1_482998_483514_-	NA|304aa|up_9|NZ_LN890656.1_467100_468012_-	cd09084, EEP-2, Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily; uncharacterized family 2	NA|328aa|up_8|NZ_LN890656.1_468174_469158_-	cd12827, EcCorA_ZntB-like_u2, uncharacterized bacterial subfamily of the Escherichia coli CorA-Salmonella typhimurium ZntB family	NA|362aa|up_7|NZ_LN890656.1_469524_470610_+	NA	NA|380aa|up_6|NZ_LN890656.1_470697_471837_+	PRK09435, PRK09435, methylmalonyl Co-A mutase-associated GTPase MeaB	NA|216aa|up_5|NZ_LN890656.1_471863_472511_+	NA	NA|815aa|up_4|NZ_LN890656.1_473291_475736_+	NA	NA|283aa|up_3|NZ_LN890656.1_475953_476802_+	cd03378, beta_CA_cladeC, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|404aa|up_2|NZ_LN890656.1_476954_478166_-	pfam14399, BtrH_N, Butirosin biosynthesis protein H, N-terminal	NA|257aa|up_1|NZ_LN890656.1_478414_479185_+	NA	NA|388aa|up_0|NZ_LN890656.1_479221_480385_+	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|271aa|down_0|NZ_LN890656.1_480709_481522_-	cd05233, SDR_c, classical (c) SDRs	NA|88aa|down_1|NZ_LN890656.1_481695_481959_-	pfam02325, YGGT, YGGT family	NA|248aa|down_2|NZ_LN890656.1_482060_482804_-	cd00635, PLPDE_III_YBL036c_like, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzymes, YBL036c-like proteins	NA|172aa|down_3|NZ_LN890656.1_482998_483514_-	NA	NA|480aa|down_4|NZ_LN890656.1_483689_485129_-	COG2133, COG2133, Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]	NA|146aa|down_5|NZ_LN890656.1_485671_486109_-	PRK05273, PRK05273, D-tyrosyl-tRNA(Tyr) deacylase; Provisional	NA|356aa|down_6|NZ_LN890656.1_486188_487256_-	TIGR01208, rmlA_long, glucose-1-phosphate thymidylylransferase, long form	NA|216aa|down_7|NZ_LN890656.1_487452_488100_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|217aa|down_8|NZ_LN890656.1_488192_488843_-	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|245aa|down_9|NZ_LN890656.1_488881_489616_-	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]
GCF_900066015.1_Cfx-K	NZ_LN890656	Candidatus Promineofilum breve strain Cfx-K chromosome II	5	565967-566143	5	CRISPRCasFinder	no		csa3	Orphan	GTTGCCGGAGAGGGTGCTATTGA	23	0	0	NA	NA	NA	2	2	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|65aa|up_6|NZ_LN890656.1_555845_556040_+,NA|753aa|up_5|NZ_LN890656.1_556277_558536_+,NA|521aa|up_3|NZ_LN890656.1_559040_560603_+,NA|62aa|down_0|NZ_LN890656.1_567013_567199_+	NA|229aa|up_9|NZ_LN890656.1_551234_551921_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|543aa|up_8|NZ_LN890656.1_552019_553648_-	cd07099, ALDH_DDALDH, Methylomonas sp	NA|625aa|up_7|NZ_LN890656.1_553661_555536_-	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|65aa|up_6|NZ_LN890656.1_555845_556040_+	NA	NA|753aa|up_5|NZ_LN890656.1_556277_558536_+	NA	NA|129aa|up_4|NZ_LN890656.1_558657_559044_+	cd07246, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|521aa|up_3|NZ_LN890656.1_559040_560603_+	NA	NA|751aa|up_2|NZ_LN890656.1_560871_563124_+	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|87aa|up_1|NZ_LN890656.1_563193_563454_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|86aa|up_0|NZ_LN890656.1_563456_563714_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|62aa|down_0|NZ_LN890656.1_567013_567199_+	NA	NA|344aa|down_1|NZ_LN890656.1_567175_568207_-	cd09084, EEP-2, Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily; uncharacterized family 2	NA|182aa|down_2|NZ_LN890656.1_568414_568960_-	COG2318, DinB, Uncharacterized protein conserved in bacteria [Function unknown]	NA|385aa|down_3|NZ_LN890656.1_569129_570284_-	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|256aa|down_4|NZ_LN890656.1_570598_571366_-	cd03145, GAT1_cyanophycinase, Type 1 glutamine amidotransferase (GATase1)-like domain found in cyanophycinase	NA|620aa|down_5|NZ_LN890656.1_571355_573215_-	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|145aa|down_6|NZ_LN890656.1_573231_573666_-	pfam13581, HATPase_c_2, Histidine kinase-like ATPase domain	NA|707aa|down_7|NZ_LN890656.1_573668_575789_-	cd11326, AmyAc_Glg_debranch, Alpha amylase catalytic domain found in glycogen debranching enzymes	NA|470aa|down_8|NZ_LN890656.1_575827_577237_-	cd16943, HATPase_AtoS-like, Histidine kinase-like ATPase domain of two-component sensor histidine kinases similar to Escherichia coli K-12 AtoS	NA|338aa|down_9|NZ_LN890656.1_577377_578391_-	COG0456, RimI, Acetyltransferases [General function prediction only]
GCF_900066015.1_Cfx-K	NZ_LN890656	Candidatus Promineofilum breve strain Cfx-K chromosome II	6	566273-566373	6	CRISPRCasFinder	no		csa3	Orphan	GTTGCCGGAGAGGGTGCTATTGA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|65aa|up_6|NZ_LN890656.1_555845_556040_+,NA|753aa|up_5|NZ_LN890656.1_556277_558536_+,NA|521aa|up_3|NZ_LN890656.1_559040_560603_+,NA|62aa|down_0|NZ_LN890656.1_567013_567199_+	NA|229aa|up_9|NZ_LN890656.1_551234_551921_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|543aa|up_8|NZ_LN890656.1_552019_553648_-	cd07099, ALDH_DDALDH, Methylomonas sp	NA|625aa|up_7|NZ_LN890656.1_553661_555536_-	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|65aa|up_6|NZ_LN890656.1_555845_556040_+	NA	NA|753aa|up_5|NZ_LN890656.1_556277_558536_+	NA	NA|129aa|up_4|NZ_LN890656.1_558657_559044_+	cd07246, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|521aa|up_3|NZ_LN890656.1_559040_560603_+	NA	NA|751aa|up_2|NZ_LN890656.1_560871_563124_+	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|87aa|up_1|NZ_LN890656.1_563193_563454_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|86aa|up_0|NZ_LN890656.1_563456_563714_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|62aa|down_0|NZ_LN890656.1_567013_567199_+	NA	NA|344aa|down_1|NZ_LN890656.1_567175_568207_-	cd09084, EEP-2, Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily; uncharacterized family 2	NA|182aa|down_2|NZ_LN890656.1_568414_568960_-	COG2318, DinB, Uncharacterized protein conserved in bacteria [Function unknown]	NA|385aa|down_3|NZ_LN890656.1_569129_570284_-	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|256aa|down_4|NZ_LN890656.1_570598_571366_-	cd03145, GAT1_cyanophycinase, Type 1 glutamine amidotransferase (GATase1)-like domain found in cyanophycinase	NA|620aa|down_5|NZ_LN890656.1_571355_573215_-	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|145aa|down_6|NZ_LN890656.1_573231_573666_-	pfam13581, HATPase_c_2, Histidine kinase-like ATPase domain	NA|707aa|down_7|NZ_LN890656.1_573668_575789_-	cd11326, AmyAc_Glg_debranch, Alpha amylase catalytic domain found in glycogen debranching enzymes	NA|470aa|down_8|NZ_LN890656.1_575827_577237_-	cd16943, HATPase_AtoS-like, Histidine kinase-like ATPase domain of two-component sensor histidine kinases similar to Escherichia coli K-12 AtoS	NA|338aa|down_9|NZ_LN890656.1_577377_578391_-	COG0456, RimI, Acetyltransferases [General function prediction only]
GCF_900066015.1_Cfx-K	NZ_LN890656	Candidatus Promineofilum breve strain Cfx-K chromosome II	7	624008-624160	7	CRISPRCasFinder	no		csa3	Orphan	CAATCCGTTTTCATCCGTCCCATCCGTGTTAATCCGTGGCTAAATGATTAA	51	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|72aa|up_6|NZ_LN890656.1_617602_617818_-,NA|92aa|up_2|NZ_LN890656.1_622005_622281_-,NA|140aa|up_0|NZ_LN890656.1_623491_623911_-,NA|215aa|down_3|NZ_LN890656.1_629534_630179_-	NA|468aa|up_9|NZ_LN890656.1_615050_616454_+	cd17332, MFS_MelB_like, Salmonella enterica Na+/melibiose symporter MelB and similar transporters of the Major Facilitator Superfamily	NA|266aa|up_8|NZ_LN890656.1_616530_617328_-	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|89aa|up_7|NZ_LN890656.1_617339_617606_-	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|72aa|up_6|NZ_LN890656.1_617602_617818_-	NA	NA|627aa|up_5|NZ_LN890656.1_617905_619786_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|597aa|up_4|NZ_LN890656.1_619782_621573_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|127aa|up_3|NZ_LN890656.1_621612_621993_-	pfam14079, DUF4260, Domain of unknown function (DUF4260)	NA|92aa|up_2|NZ_LN890656.1_622005_622281_-	NA	NA|300aa|up_1|NZ_LN890656.1_622297_623197_-	TIGR03709, PPK2_rel_1, polyphosphate:nucleotide phosphotransferase, PPK2 family	NA|140aa|up_0|NZ_LN890656.1_623491_623911_-	NA	NA|806aa|down_0|NZ_LN890656.1_624168_626586_-	pfam03724, META, META domain	NA|623aa|down_1|NZ_LN890656.1_626768_628637_-	pfam03724, META, META domain	NA|252aa|down_2|NZ_LN890656.1_628771_629527_-	COG1611, COG1611, Predicted Rossmann fold nucleotide-binding protein [General function prediction only]	NA|215aa|down_3|NZ_LN890656.1_629534_630179_-	NA	NA|809aa|down_4|NZ_LN890656.1_630448_632875_+	pfam13229, Beta_helix, Right handed beta helix region	NA|469aa|down_5|NZ_LN890656.1_633137_634544_+	PRK04208, rbcL, ribulose bisophosphate carboxylase; Reviewed	NA|323aa|down_6|NZ_LN890656.1_634721_635690_+	PRK07429, PRK07429, phosphoribulokinase; Provisional	NA|356aa|down_7|NZ_LN890656.1_636835_637903_+	smart00089, PKD, Repeats in polycystic kidney disease 1 (PKD1) and other proteins	NA|317aa|down_8|NZ_LN890656.1_638397_639348_+	pfam01116, F_bP_aldolase, Fructose-bisphosphate aldolase class-II	NA|330aa|down_9|NZ_LN890656.1_639625_640615_-	pfam02371, Transposase_20, Transposase IS116/IS110/IS902 family
GCF_900066015.1_Cfx-K	NZ_LN890656	Candidatus Promineofilum breve strain Cfx-K chromosome II	8	748231-748461	1	PILER-CR	no		csa3	Orphan	ATTTCGCGCCCGCCCTTCAGTATCGGCCCGTAGGTGGAGCTTCCAGCTCCACC	53	1	1	748284-748322	NZ_LN890656.1_748472-748510	NA	2	2	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA,NA|746aa|down_1|NZ_LN890656.1_750631_752869_-,NA|527aa|down_7|NZ_LN890656.1_758908_760489_-	NA|198aa|up_9|NZ_LN890656.1_736092_736686_-	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|597aa|up_8|NZ_LN890656.1_736975_738766_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|586aa|up_7|NZ_LN890656.1_738842_740600_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|223aa|up_6|NZ_LN890656.1_740592_741261_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|130aa|up_5|NZ_LN890656.1_741414_741804_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|226aa|up_4|NZ_LN890656.1_741777_742455_-	COG0400, COG0400, Predicted esterase [General function prediction only]	NA|160aa|up_3|NZ_LN890656.1_742643_743123_-	pfam09835, DUF2062, Uncharacterized protein conserved in bacteria (DUF2062)	NA|705aa|up_2|NZ_LN890656.1_743523_745638_+	cd16147, G6S, glucosamine (N-acetyl)-6-sulfatase(G6S, GNS) AND sulfatase 1(SULF1)	NA|291aa|up_1|NZ_LN890656.1_745752_746625_-	PRK11689, PRK11689, aromatic amino acid efflux DMT transporter YddG	NA|468aa|up_0|NZ_LN890656.1_746766_748170_-	cd15482, Sialidase_non-viral, Non-viral sialidases	NA|622aa|down_0|NZ_LN890656.1_748594_750460_+	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|746aa|down_1|NZ_LN890656.1_750631_752869_-	NA	NA|456aa|down_2|NZ_LN890656.1_752887_754255_-	sd00006, TPR, Tetratricopeptide repeat	NA|259aa|down_3|NZ_LN890656.1_754711_755488_+	pfam10547, P22_AR_N, P22_AR N-terminal domain	NA|261aa|down_4|NZ_LN890656.1_755658_756441_+	pfam13614, AAA_31, AAA domain	NA|379aa|down_5|NZ_LN890656.1_756509_757646_+	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|396aa|down_6|NZ_LN890656.1_757717_758905_-	sd00038, Kelch, Kelch repeat	NA|527aa|down_7|NZ_LN890656.1_758908_760489_-	NA	NA|972aa|down_8|NZ_LN890656.1_760882_763798_-	PRK15319, PRK15319, fibronectin-binding autotransporter adhesin ShdA	NA|43aa|down_9|NZ_LN890656.1_763911_764040_-	pfam04255, DUF433, Protein of unknown function (DUF433)
GCF_900066015.1_Cfx-K	NZ_LN890656	Candidatus Promineofilum breve strain Cfx-K chromosome II	9	932595-932835	2	PILER-CR	no		csa3	Orphan	GCCAAATCTTCAAGCATTGGCTACGGATTACCACGGAAGAAACGGATTTATTTCTTTCCTACGG	64	1	1	932773-932804	NZ_LN890656.1_932836-932867	NA	2	2	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|170aa|up_9|NZ_LN890656.1_917640_918150_+,NA|637aa|down_1|NZ_LN890656.1_934935_936846_+,NA|177aa|down_4|NZ_LN890656.1_940549_941080_-,NA|73aa|down_6|NZ_LN890656.1_943305_943524_+	NA|170aa|up_9|NZ_LN890656.1_917640_918150_+	NA	NA|304aa|up_8|NZ_LN890656.1_918256_919168_-	PRK11272, PRK11272, putative DMT superfamily transporter inner membrane protein; Provisional	NA|116aa|up_7|NZ_LN890656.1_919298_919646_-	COG0662, {ManC}, Mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]	NA|171aa|up_6|NZ_LN890656.1_919997_920510_+	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|128aa|up_5|NZ_LN890656.1_920735_921119_+	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|998aa|up_4|NZ_LN890656.1_921322_924316_+	COG3903, COG3903, Predicted ATPase [General function prediction only]	NA|80aa|up_3|NZ_LN890656.1_924519_924759_+	pfam13711, DUF4160, Domain of unknown function (DUF4160)	NA|88aa|up_2|NZ_LN890656.1_924718_924982_+	pfam10387, DUF2442, Protein of unknown function (DUF2442)	NA|1943aa|up_1|NZ_LN890656.1_925157_930986_-	pfam08757, CotH, CotH kinase protein	NA|407aa|up_0|NZ_LN890656.1_931300_932521_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|481aa|down_0|NZ_LN890656.1_933109_934552_+	cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms	NA|637aa|down_1|NZ_LN890656.1_934935_936846_+	NA	NA|272aa|down_2|NZ_LN890656.1_937202_938018_+	cd07247, SgaA_N_like, N-terminal domain of Streptomyces griseus SgaA and similar domains	NA|599aa|down_3|NZ_LN890656.1_938199_939996_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|177aa|down_4|NZ_LN890656.1_940549_941080_-	NA	NA|554aa|down_5|NZ_LN890656.1_941319_942981_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|73aa|down_6|NZ_LN890656.1_943305_943524_+	NA	NA|446aa|down_7|NZ_LN890656.1_943558_944896_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|91aa|down_8|NZ_LN890656.1_945121_945394_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|365aa|down_9|NZ_LN890656.1_945672_946767_-	pfam18854, baeRF_family10, Bacterial archaeo-eukaryotic release factor family 10
GCF_900066015.1_Cfx-K	NZ_LN890656	Candidatus Promineofilum breve strain Cfx-K chromosome II	10	968765-968894	8	CRISPRCasFinder	no		csa3	Orphan	TCGCAATCGCAATCGAAAAGCGAAGAATCGATAGCGATTGCGAT	44	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,DinG,cas2,cas1,WYL,cas3,cas8a4,cas7,cas5,cas6,cas10,csm3gr7,csx10gr5,csx19,csx15	NA|75aa|up_1|NZ_LN890656.1_965663_965888_+,NA	NA|330aa|up_9|NZ_LN890656.1_954421_955411_-	pfam02371, Transposase_20, Transposase IS116/IS110/IS902 family	NA|207aa|up_8|NZ_LN890656.1_955906_956527_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|569aa|up_7|NZ_LN890656.1_956820_958527_-	COG2189, COG2189, Adenine specific DNA methylase Mod [DNA replication, recombination, and repair]	NA|343aa|up_6|NZ_LN890656.1_958641_959670_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|568aa|up_5|NZ_LN890656.1_959801_961505_+	PRK15098, PRK15098, beta-glucosidase BglX	NA|143aa|up_4|NZ_LN890656.1_961598_962027_-	cd04623, CBS_pair_bac_euk, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains present in bacteria and eukaryotes	NA|247aa|up_3|NZ_LN890656.1_962236_962977_+	pfam13614, AAA_31, AAA domain	NA|840aa|up_2|NZ_LN890656.1_963180_965700_+	COG0568, RpoD, DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) [Transcription]	NA|75aa|up_1|NZ_LN890656.1_965663_965888_+	NA	NA|904aa|up_0|NZ_LN890656.1_966022_968734_+	pfam13229, Beta_helix, Right handed beta helix region	NA|859aa|down_0|NZ_LN890656.1_969119_971696_+	pfam13229, Beta_helix, Right handed beta helix region	NA|418aa|down_1|NZ_LN890656.1_972391_973645_+	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|235aa|down_2|NZ_LN890656.1_973969_974674_+	cd07729, AHL_lactonase_MBL-fold, quorum-quenching N-acyl-homoserine lactonase, MBL-fold metallo-hydrolase domain	NA|330aa|down_3|NZ_LN890656.1_974727_975717_-	pfam02371, Transposase_20, Transposase IS116/IS110/IS902 family	NA|269aa|down_4|NZ_LN890656.1_975937_976744_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|189aa|down_5|NZ_LN890656.1_977056_977623_+	COG0262, FolA, Dihydrofolate reductase [Coenzyme metabolism]	NA|125aa|down_6|NZ_LN890656.1_977785_978160_+	pfam09966, DUF2200, Uncharacterized protein conserved in bacteria (DUF2200)	NA|408aa|down_7|NZ_LN890656.1_978361_979585_-	pfam10117, McrBC, McrBC 5-methylcytosine restriction system component	NA|396aa|down_8|NZ_LN890656.1_979798_980986_-	cd17256, RMtype1_S_EcoJA65PI-TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to S	NA|496aa|down_9|NZ_LN890656.1_980982_982470_-	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]
