assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	1	43400-43478	1	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	AACAAAGCGAGCAAACGAGCAAA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|341aa|up_9|NZ_CP053452.1_28985_30008_+,NA|66aa|up_7|NZ_CP053452.1_32218_32416_+,NA|736aa|up_3|NZ_CP053452.1_38056_40264_-,NA|161aa|up_2|NZ_CP053452.1_40935_41418_-,NA|261aa|up_1|NZ_CP053452.1_41461_42244_-,NA|311aa|down_1|NZ_CP053452.1_46574_47507_+,NA|134aa|down_2|NZ_CP053452.1_47606_48008_+,NA|74aa|down_9|NZ_CP053452.1_66006_66228_+	NA|341aa|up_9|NZ_CP053452.1_28985_30008_+	NA	NA|544aa|up_8|NZ_CP053452.1_30461_32093_-	COG5610, COG5610, Predicted hydrolase (HAD superfamily) [General function prediction only]	NA|66aa|up_7|NZ_CP053452.1_32218_32416_+	NA	NA|473aa|up_6|NZ_CP053452.1_32466_33885_-	cd03823, GT4_ExpE7-like, glycosyltransferase ExpE7 and similar proteins	NA|278aa|up_5|NZ_CP053452.1_34074_34908_-	TIGR02198, Uncharacterized_sugar_kinase_BU060, rfaE bifunctional protein, domain I	NA|436aa|up_4|NZ_CP053452.1_35407_36715_-	sd00006, TPR, Tetratricopeptide repeat	NA|736aa|up_3|NZ_CP053452.1_38056_40264_-	NA	NA|161aa|up_2|NZ_CP053452.1_40935_41418_-	NA	NA|261aa|up_1|NZ_CP053452.1_41461_42244_-	NA	NA|344aa|up_0|NZ_CP053452.1_42283_43315_-	PRK01287, xerC, site-specific tyrosine recombinase XerC; Reviewed	NA|886aa|down_0|NZ_CP053452.1_43566_46224_-	COG0358, DnaG, DNA primase (bacterial type) [DNA replication, recombination, and repair]	NA|311aa|down_1|NZ_CP053452.1_46574_47507_+	NA	NA|134aa|down_2|NZ_CP053452.1_47606_48008_+	NA	NA|837aa|down_3|NZ_CP053452.1_48086_50597_+	pfam09250, Prim-Pol, Bifunctional DNA primase/polymerase, N-terminal	NA|201aa|down_4|NZ_CP053452.1_50901_51504_+	pfam00239, Resolvase, Resolvase, N terminal domain	NA|203aa|down_5|NZ_CP053452.1_52215_52824_-	pfam09346, SMI1_KNR4, SMI1 / KNR4 family (SUKH-1)	NA|3785aa|down_6|NZ_CP053452.1_52825_64180_-	NF012181, MSCRAMM_SdrD, MSCRAMM family adhesin SdrD	NA|235aa|down_7|NZ_CP053452.1_64853_65558_+	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|154aa|down_8|NZ_CP053452.1_65557_66019_+	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|74aa|down_9|NZ_CP053452.1_66006_66228_+	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	2	824998-825081	2	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	TCGGAACGGCGTCAGGATCGGCTA	24	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|83aa|up_9|NZ_CP053452.1_817527_817776_+,NA|172aa|up_6|NZ_CP053452.1_820202_820718_-,NA|203aa|up_5|NZ_CP053452.1_820823_821432_-,NA|62aa|up_1|NZ_CP053452.1_823484_823670_+,NA|172aa|up_0|NZ_CP053452.1_823650_824166_+,NA|123aa|down_0|NZ_CP053452.1_825402_825771_-,NA|91aa|down_7|NZ_CP053452.1_837110_837383_-,NA|109aa|down_8|NZ_CP053452.1_837847_838174_-	NA|83aa|up_9|NZ_CP053452.1_817527_817776_+	NA	NA|355aa|up_8|NZ_CP053452.1_818006_819071_+	pfam13369, Transglut_core2, Transglutaminase-like superfamily	NA|254aa|up_7|NZ_CP053452.1_819096_819858_+	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|172aa|up_6|NZ_CP053452.1_820202_820718_-	NA	NA|203aa|up_5|NZ_CP053452.1_820823_821432_-	NA	NA|195aa|up_4|NZ_CP053452.1_821434_822019_-	pfam13578, Methyltransf_24, Methyltransferase domain	NA|184aa|up_3|NZ_CP053452.1_822040_822592_+	COG4725, IME4, Transcriptional activator, adenine-specific DNA methyltransferase [Signal transduction mechanisms / Transcription]	NA|245aa|up_2|NZ_CP053452.1_822760_823495_+	TIGR02197, heptose_epim, ADP-L-glycero-D-manno-heptose-6-epimerase	NA|62aa|up_1|NZ_CP053452.1_823484_823670_+	NA	NA|172aa|up_0|NZ_CP053452.1_823650_824166_+	NA	NA|123aa|down_0|NZ_CP053452.1_825402_825771_-	NA	NA|839aa|down_1|NZ_CP053452.1_826441_828958_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|294aa|down_2|NZ_CP053452.1_829013_829895_-	pfam08305, NPCBM, NPCBM/NEW2 domain	NA|731aa|down_3|NZ_CP053452.1_830082_832275_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|727aa|down_4|NZ_CP053452.1_832529_834710_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|497aa|down_5|NZ_CP053452.1_834791_836282_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|133aa|down_6|NZ_CP053452.1_836608_837007_+	pfam00072, Response_reg, Response regulator receiver domain	NA|91aa|down_7|NZ_CP053452.1_837110_837383_-	NA	NA|109aa|down_8|NZ_CP053452.1_837847_838174_-	NA	NA|428aa|down_9|NZ_CP053452.1_838436_839720_-	TIGR01926, peroxid_rel, uncharacterized peroxidase-related enzyme
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	3	1427540-1427652	3	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CAAGGCCGAGGCCGAGGCCAAGGCCGCACGCAAGGCGGAAG	41	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|47aa|up_9|NZ_CP053452.1_1419820_1419961_-,NA|81aa|up_8|NZ_CP053452.1_1419981_1420224_-,NA|136aa|up_7|NZ_CP053452.1_1420888_1421296_-,NA|84aa|up_6|NZ_CP053452.1_1421406_1421658_-,NA|147aa|up_5|NZ_CP053452.1_1422887_1423328_-,NA|135aa|up_4|NZ_CP053452.1_1423452_1423857_-,NA|138aa|up_3|NZ_CP053452.1_1423986_1424400_-,NA|157aa|up_2|NZ_CP053452.1_1424503_1424974_-,NA|115aa|up_1|NZ_CP053452.1_1425068_1425413_-,NA|107aa|down_2|NZ_CP053452.1_1430625_1430946_-,NA|89aa|down_3|NZ_CP053452.1_1431968_1432235_-,NA|116aa|down_5|NZ_CP053452.1_1432984_1433332_-,NA|111aa|down_6|NZ_CP053452.1_1433716_1434049_-,NA|166aa|down_7|NZ_CP053452.1_1434455_1434953_-,NA|567aa|down_8|NZ_CP053452.1_1435325_1437026_-	NA|47aa|up_9|NZ_CP053452.1_1419820_1419961_-	NA	NA|81aa|up_8|NZ_CP053452.1_1419981_1420224_-	NA	NA|136aa|up_7|NZ_CP053452.1_1420888_1421296_-	NA	NA|84aa|up_6|NZ_CP053452.1_1421406_1421658_-	NA	NA|147aa|up_5|NZ_CP053452.1_1422887_1423328_-	NA	NA|135aa|up_4|NZ_CP053452.1_1423452_1423857_-	NA	NA|138aa|up_3|NZ_CP053452.1_1423986_1424400_-	NA	NA|157aa|up_2|NZ_CP053452.1_1424503_1424974_-	NA	NA|115aa|up_1|NZ_CP053452.1_1425068_1425413_-	NA	NA|182aa|up_0|NZ_CP053452.1_1425557_1426103_+	pfam07638, Sigma70_ECF, ECF sigma factor	NA|515aa|down_0|NZ_CP053452.1_1428701_1430246_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|143aa|down_1|NZ_CP053452.1_1430203_1430632_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|107aa|down_2|NZ_CP053452.1_1430625_1430946_-	NA	NA|89aa|down_3|NZ_CP053452.1_1431968_1432235_-	NA	NA|126aa|down_4|NZ_CP053452.1_1432613_1432991_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|116aa|down_5|NZ_CP053452.1_1432984_1433332_-	NA	NA|111aa|down_6|NZ_CP053452.1_1433716_1434049_-	NA	NA|166aa|down_7|NZ_CP053452.1_1434455_1434953_-	NA	NA|567aa|down_8|NZ_CP053452.1_1435325_1437026_-	NA	NA|164aa|down_9|NZ_CP053452.1_1437293_1437785_-	cd14505, CDKN3-like, cyclin-dependent kinase inhibitor 3 and similar proteins
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	4	1460920-1461020	4	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CCGCACCCGGCCCCGCCGTCACC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|56aa|up_9|NZ_CP053452.1_1451409_1451577_-,NA|59aa|up_8|NZ_CP053452.1_1451776_1451953_+,NA|105aa|up_6|NZ_CP053452.1_1452264_1452579_+,NA|149aa|up_5|NZ_CP053452.1_1452819_1453266_+,NA|276aa|down_6|NZ_CP053452.1_1470903_1471731_+	NA|56aa|up_9|NZ_CP053452.1_1451409_1451577_-	NA	NA|59aa|up_8|NZ_CP053452.1_1451776_1451953_+	NA	NA|48aa|up_7|NZ_CP053452.1_1451949_1452093_+	pfam10905, DUF2695, Protein of unknown function (DUF2695)	NA|105aa|up_6|NZ_CP053452.1_1452264_1452579_+	NA	NA|149aa|up_5|NZ_CP053452.1_1452819_1453266_+	NA	NA|327aa|up_4|NZ_CP053452.1_1453293_1454274_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|791aa|up_3|NZ_CP053452.1_1454962_1457335_+	pfam07631, PSD4, Protein of unknown function (DUF1592)	NA|122aa|up_2|NZ_CP053452.1_1457514_1457880_+	TIGR04256, conserved_hypothetical_protein, GxxExxY protein	NA|447aa|up_1|NZ_CP053452.1_1458038_1459379_+	pfam07586, HXXSHH, Protein of unknown function (DUF1552)	NA|452aa|up_0|NZ_CP053452.1_1459477_1460833_-	COG3069, DcuC, C4-dicarboxylate transporter [Energy production and conversion]	NA|738aa|down_0|NZ_CP053452.1_1462445_1464659_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|191aa|down_1|NZ_CP053452.1_1464655_1465228_-	TIGR02984, Sig-70_plancto1, RNA polymerase sigma-70 factor, Planctomycetaceae-specific subfamily 1	NA|238aa|down_2|NZ_CP053452.1_1465480_1466194_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|412aa|down_3|NZ_CP053452.1_1466200_1467436_-	PRK13837, PRK13837, two-component system VirA-like sensor kinase	NA|406aa|down_4|NZ_CP053452.1_1467738_1468956_-	sd00044, HEAT, HEAT repeats	NA|466aa|down_5|NZ_CP053452.1_1469391_1470789_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|276aa|down_6|NZ_CP053452.1_1470903_1471731_+	NA	NA|366aa|down_7|NZ_CP053452.1_1471772_1472870_+	PRK12858, PRK12858, tagatose 1,6-diphosphate aldolase; Reviewed	NA|395aa|down_8|NZ_CP053452.1_1472975_1474160_-	pfam13098, Thioredoxin_2, Thioredoxin-like domain	NA|300aa|down_9|NZ_CP053452.1_1474267_1475167_-	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	5	1575127-1575226	5	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	GCGAGCGGCGCGGCGTGAGCCCGCCGGTGCTA	32	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|216aa|up_6|NZ_CP053452.1_1566762_1567410_-,NA|290aa|up_5|NZ_CP053452.1_1567530_1568400_-,NA|312aa|up_4|NZ_CP053452.1_1568598_1569534_-,NA|106aa|up_3|NZ_CP053452.1_1570021_1570339_+,NA|100aa|down_6|NZ_CP053452.1_1586517_1586817_-	NA|242aa|up_9|NZ_CP053452.1_1562558_1563284_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|458aa|up_8|NZ_CP053452.1_1563457_1564831_+	COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function prediction only]	NA|530aa|up_7|NZ_CP053452.1_1565140_1566730_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|216aa|up_6|NZ_CP053452.1_1566762_1567410_-	NA	NA|290aa|up_5|NZ_CP053452.1_1567530_1568400_-	NA	NA|312aa|up_4|NZ_CP053452.1_1568598_1569534_-	NA	NA|106aa|up_3|NZ_CP053452.1_1570021_1570339_+	NA	NA|293aa|up_2|NZ_CP053452.1_1570428_1571307_-	sd00006, TPR, Tetratricopeptide repeat	NA|682aa|up_1|NZ_CP053452.1_1571303_1573349_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|359aa|up_0|NZ_CP053452.1_1573999_1575076_-	pfam11199, DUF2891, Protein of unknown function (DUF2891)	NA|395aa|down_0|NZ_CP053452.1_1575348_1576533_-	pfam00144, Beta-lactamase, Beta-lactamase	NA|491aa|down_1|NZ_CP053452.1_1576787_1578260_+	COG1520, COG1520, FOG: WD40-like repeat [Function unknown]	NA|1104aa|down_2|NZ_CP053452.1_1578617_1581929_+	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|511aa|down_3|NZ_CP053452.1_1583002_1584535_-	cd00366, FGGY, FGGY family of carbohydrate kinases	NA|421aa|down_4|NZ_CP053452.1_1584703_1585966_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|125aa|down_5|NZ_CP053452.1_1585962_1586337_+	cd19919, REC_NtrC, phosphoacceptor receiver (REC) domain of DNA-binding transcriptional regulator NtrC	NA|100aa|down_6|NZ_CP053452.1_1586517_1586817_-	NA	NA|285aa|down_7|NZ_CP053452.1_1587032_1587887_+	PRK00450, dapF, diaminopimelate epimerase; Provisional	NA|600aa|down_8|NZ_CP053452.1_1587883_1589683_+	pfam09594, GT87, Glycosyltransferase family 87	NA|261aa|down_9|NZ_CP053452.1_1589696_1590479_-	PRK00278, trpC, indole-3-glycerol phosphate synthase TrpC
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	6	1578295-1578441	6	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CGCAGGCAACGCAGATCCGAAGAACAGAATTCTTCTTGTCTTCATC	46	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|216aa|up_8|NZ_CP053452.1_1566762_1567410_-,NA|290aa|up_7|NZ_CP053452.1_1567530_1568400_-,NA|312aa|up_6|NZ_CP053452.1_1568598_1569534_-,NA|106aa|up_5|NZ_CP053452.1_1570021_1570339_+,NA|100aa|down_4|NZ_CP053452.1_1586517_1586817_-,NA|93aa|down_8|NZ_CP053452.1_1590678_1590957_+	NA|530aa|up_9|NZ_CP053452.1_1565140_1566730_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|216aa|up_8|NZ_CP053452.1_1566762_1567410_-	NA	NA|290aa|up_7|NZ_CP053452.1_1567530_1568400_-	NA	NA|312aa|up_6|NZ_CP053452.1_1568598_1569534_-	NA	NA|106aa|up_5|NZ_CP053452.1_1570021_1570339_+	NA	NA|293aa|up_4|NZ_CP053452.1_1570428_1571307_-	sd00006, TPR, Tetratricopeptide repeat	NA|682aa|up_3|NZ_CP053452.1_1571303_1573349_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|359aa|up_2|NZ_CP053452.1_1573999_1575076_-	pfam11199, DUF2891, Protein of unknown function (DUF2891)	NA|395aa|up_1|NZ_CP053452.1_1575348_1576533_-	pfam00144, Beta-lactamase, Beta-lactamase	NA|491aa|up_0|NZ_CP053452.1_1576787_1578260_+	COG1520, COG1520, FOG: WD40-like repeat [Function unknown]	NA|1104aa|down_0|NZ_CP053452.1_1578617_1581929_+	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|511aa|down_1|NZ_CP053452.1_1583002_1584535_-	cd00366, FGGY, FGGY family of carbohydrate kinases	NA|421aa|down_2|NZ_CP053452.1_1584703_1585966_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|125aa|down_3|NZ_CP053452.1_1585962_1586337_+	cd19919, REC_NtrC, phosphoacceptor receiver (REC) domain of DNA-binding transcriptional regulator NtrC	NA|100aa|down_4|NZ_CP053452.1_1586517_1586817_-	NA	NA|285aa|down_5|NZ_CP053452.1_1587032_1587887_+	PRK00450, dapF, diaminopimelate epimerase; Provisional	NA|600aa|down_6|NZ_CP053452.1_1587883_1589683_+	pfam09594, GT87, Glycosyltransferase family 87	NA|261aa|down_7|NZ_CP053452.1_1589696_1590479_-	PRK00278, trpC, indole-3-glycerol phosphate synthase TrpC	NA|93aa|down_8|NZ_CP053452.1_1590678_1590957_+	NA	NA|256aa|down_9|NZ_CP053452.1_1591015_1591783_-	COG3967, DltE, Short-chain dehydrogenase involved in D-alanine esterification of lipoteichoic acid and wall teichoic acid (D-alanine transfer protein) [Cell envelope biogenesis, outer membrane]
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	7	1730272-1730384	7	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CCGCCCGCCCGACCACGGGCGCCCCGGCC	29	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|416aa|up_2|NZ_CP053452.1_1726119_1727367_+,NA|178aa|down_4|NZ_CP053452.1_1744407_1744941_+,NA|273aa|down_9|NZ_CP053452.1_1751772_1752591_+	NA|494aa|up_9|NZ_CP053452.1_1713867_1715349_-	pfam00756, Esterase, Putative esterase	NA|840aa|up_8|NZ_CP053452.1_1715473_1717993_-	pfam07587, PSD1, Protein of unknown function (DUF1553)	NA|445aa|up_7|NZ_CP053452.1_1719121_1720456_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|123aa|up_6|NZ_CP053452.1_1720456_1720825_-	COG4372, COG4372, Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]	NA|500aa|up_5|NZ_CP053452.1_1721431_1722931_-	cd17319, MFS_ExuT_GudP_like, Hexuronate transporter, Glucarate transporter, and similar transporters of the Major Facilitator Superfamily	NA|323aa|up_4|NZ_CP053452.1_1723141_1724110_+	PRK05035, PRK05035, electron transport complex protein RnfC; Provisional	NA|666aa|up_3|NZ_CP053452.1_1724109_1726107_+	pfam00493, MCM, MCM2/3/5 family	NA|416aa|up_2|NZ_CP053452.1_1726119_1727367_+	NA	NA|289aa|up_1|NZ_CP053452.1_1727462_1728329_+	PRK07544, PRK07544, branched-chain amino acid aminotransferase; Validated	NA|271aa|up_0|NZ_CP053452.1_1728416_1729229_+	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|945aa|down_0|NZ_CP053452.1_1730998_1733833_+	TIGR02517, Putative_type_II_secretion_system_protein_D, type II secretion system protein D	NA|573aa|down_1|NZ_CP053452.1_1734153_1735872_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|1400aa|down_2|NZ_CP053452.1_1736045_1740245_-	sd00033, LRR_RI, leucine-rich repeats, ribonuclease inhibitor (RI)-like subfamily	NA|1165aa|down_3|NZ_CP053452.1_1740422_1743917_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|178aa|down_4|NZ_CP053452.1_1744407_1744941_+	NA	NA|685aa|down_5|NZ_CP053452.1_1745053_1747108_+	COG2319, COG2319, FOG: WD40 repeat [General function prediction only]	NA|92aa|down_6|NZ_CP053452.1_1747239_1747515_-	COG3422, COG3422, Uncharacterized conserved protein [Function unknown]	NA|537aa|down_7|NZ_CP053452.1_1748022_1749633_+	PRK09776, PRK09776, putative diguanylate cyclase; Provisional	NA|672aa|down_8|NZ_CP053452.1_1749566_1751582_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|273aa|down_9|NZ_CP053452.1_1751772_1752591_+	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	8	1830542-1830642	8	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	AGAGGATCGCCTCCCGGAGCCGG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|81aa|up_8|NZ_CP053452.1_1816726_1816969_+,NA|204aa|up_7|NZ_CP053452.1_1817091_1817703_+,NA|147aa|up_6|NZ_CP053452.1_1818210_1818651_-,NA|127aa|down_9|NZ_CP053452.1_1841050_1841431_+	NA|880aa|up_9|NZ_CP053452.1_1813777_1816417_-	TIGR00975, precursor_PBP-3_PstS-3_Antigen_Ag88	NA|81aa|up_8|NZ_CP053452.1_1816726_1816969_+	NA	NA|204aa|up_7|NZ_CP053452.1_1817091_1817703_+	NA	NA|147aa|up_6|NZ_CP053452.1_1818210_1818651_-	NA	NA|644aa|up_5|NZ_CP053452.1_1818920_1820852_+	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|516aa|up_4|NZ_CP053452.1_1821077_1822625_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|225aa|up_3|NZ_CP053452.1_1822660_1823335_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|174aa|up_2|NZ_CP053452.1_1823382_1823904_-	pfam13565, HTH_32, Homeodomain-like domain	NA|591aa|up_1|NZ_CP053452.1_1823963_1825736_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|418aa|up_0|NZ_CP053452.1_1826399_1827653_+	cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen biosynthesis	NA|155aa|down_0|NZ_CP053452.1_1831527_1831992_-	PRK00061, ribH, 6,7-dimethyl-8-ribityllumazine synthase; Provisional	NA|111aa|down_1|NZ_CP053452.1_1832104_1832437_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|365aa|down_2|NZ_CP053452.1_1832585_1833680_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|485aa|down_3|NZ_CP053452.1_1833768_1835223_-	PRK09376, rho, transcription termination factor Rho; Provisional	NA|121aa|down_4|NZ_CP053452.1_1835170_1835533_-	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|219aa|down_5|NZ_CP053452.1_1835783_1836440_-	PRK00081, coaE, dephospho-CoA kinase; Reviewed	NA|231aa|down_6|NZ_CP053452.1_1836436_1837129_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|970aa|down_7|NZ_CP053452.1_1837181_1840091_-	TIGR00593, DNA_polymerase_I, DNA polymerase I	NA|133aa|down_8|NZ_CP053452.1_1840448_1840847_-	cd06154, YjgF_YER057c_UK114_like_6, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function	NA|127aa|down_9|NZ_CP053452.1_1841050_1841431_+	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	9	1903897-1907936	1,9,1	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,cas3,cas8u1,cas7,csb2gr5	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Unclear	GTTCTCCACGGCTCACCGCCGTGGCCGAATTGAAGG,GTTCTCCACGGCTCACCGCCGTGGCCGAATTGAAGG,GTTCTCCACGGCTCACCGCCGTGGCCGAATTGAAGG	36,36,36	0	0	NA	NA	NA:NA:NA	55,55,55	55	Unclear	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|70aa|up_7|NZ_CP053452.1_1893151_1893361_+,cas8u1|272aa|up_0|NZ_CP053452.1_1902839_1903655_+,NA|116aa|down_2|NZ_CP053452.1_1909966_1910314_-,NA|116aa|down_3|NZ_CP053452.1_1913240_1913588_+,NA|401aa|down_7|NZ_CP053452.1_1917740_1918943_+,NA|256aa|down_9|NZ_CP053452.1_1921071_1921839_+	NA|225aa|up_9|NZ_CP053452.1_1890385_1891060_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|658aa|up_8|NZ_CP053452.1_1891079_1893053_+	COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|70aa|up_7|NZ_CP053452.1_1893151_1893361_+	NA	cas1|574aa|up_6|NZ_CP053452.1_1893402_1895124_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|95aa|up_5|NZ_CP053452.1_1895123_1895408_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas3|798aa|up_4|NZ_CP053452.1_1895404_1897798_+	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	cas8u1|725aa|up_3|NZ_CP053452.1_1897794_1899969_+	TIGR04113, hypothetical_protein_AaLAA1DRAFT_1703, CRISPR-associated protein Csx17, subtype Dpsyc	cas7|362aa|up_2|NZ_CP053452.1_1899974_1901060_+	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	csb2gr5|537aa|up_1|NZ_CP053452.1_1901066_1902677_+	cd09734, Csb2_I-U, CRISPR/Cas system-associated protein Csb2	cas8u1|272aa|up_0|NZ_CP053452.1_1902839_1903655_+	NA	NA|519aa|down_0|NZ_CP053452.1_1908025_1909582_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|126aa|down_1|NZ_CP053452.1_1909595_1909973_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|116aa|down_2|NZ_CP053452.1_1909966_1910314_-	NA	NA|116aa|down_3|NZ_CP053452.1_1913240_1913588_+	NA	NA|126aa|down_4|NZ_CP053452.1_1913581_1913959_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|519aa|down_5|NZ_CP053452.1_1913972_1915529_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|519aa|down_6|NZ_CP053452.1_1915812_1917369_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|401aa|down_7|NZ_CP053452.1_1917740_1918943_+	NA	NA|171aa|down_8|NZ_CP053452.1_1920466_1920979_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|256aa|down_9|NZ_CP053452.1_1921071_1921839_+	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	10	1910415-1912691	2,10,2	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,cas3,cas8u1,cas7,csb2gr5	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Unclear	GTTCTCCACGGCTCACCGCCGTGGCCGAATTGAAGG,GTTCTCCACGGCTCACCGCCGTGGCCGAATTGAAGG,GTTCTCCACGGCTCACCGCCGTGGCCGAATTGAAGG	36,36,36	0	0	NA	NA	NA:NA:NA	31,31,31	31	Unclear	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	cas8u1|272aa|up_3|NZ_CP053452.1_1902839_1903655_+,NA|116aa|up_0|NZ_CP053452.1_1909966_1910314_-,NA|116aa|down_0|NZ_CP053452.1_1913240_1913588_+,NA|401aa|down_4|NZ_CP053452.1_1917740_1918943_+,NA|256aa|down_6|NZ_CP053452.1_1921071_1921839_+,NA|88aa|down_8|NZ_CP053452.1_1922795_1923059_+,NA|72aa|down_9|NZ_CP053452.1_1923085_1923301_+	cas1|574aa|up_9|NZ_CP053452.1_1893402_1895124_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|95aa|up_8|NZ_CP053452.1_1895123_1895408_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas3|798aa|up_7|NZ_CP053452.1_1895404_1897798_+	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	cas8u1|725aa|up_6|NZ_CP053452.1_1897794_1899969_+	TIGR04113, hypothetical_protein_AaLAA1DRAFT_1703, CRISPR-associated protein Csx17, subtype Dpsyc	cas7|362aa|up_5|NZ_CP053452.1_1899974_1901060_+	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	csb2gr5|537aa|up_4|NZ_CP053452.1_1901066_1902677_+	cd09734, Csb2_I-U, CRISPR/Cas system-associated protein Csb2	cas8u1|272aa|up_3|NZ_CP053452.1_1902839_1903655_+	NA	NA|519aa|up_2|NZ_CP053452.1_1908025_1909582_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|126aa|up_1|NZ_CP053452.1_1909595_1909973_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|116aa|up_0|NZ_CP053452.1_1909966_1910314_-	NA	NA|116aa|down_0|NZ_CP053452.1_1913240_1913588_+	NA	NA|126aa|down_1|NZ_CP053452.1_1913581_1913959_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|519aa|down_2|NZ_CP053452.1_1913972_1915529_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|519aa|down_3|NZ_CP053452.1_1915812_1917369_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|401aa|down_4|NZ_CP053452.1_1917740_1918943_+	NA	NA|171aa|down_5|NZ_CP053452.1_1920466_1920979_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|256aa|down_6|NZ_CP053452.1_1921071_1921839_+	NA	NA|270aa|down_7|NZ_CP053452.1_1921956_1922766_+	TIGR00180, Probable_chromosome-partitioning_protein_ParB, ParB/RepB/Spo0J family partition protein	NA|88aa|down_8|NZ_CP053452.1_1922795_1923059_+	NA	NA|72aa|down_9|NZ_CP053452.1_1923085_1923301_+	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	11	1983020-1983092	11	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	TCGCGAGCGGCGCGGCGTGAGCC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|544aa|up_7|NZ_CP053452.1_1973816_1975448_+,NA|90aa|up_6|NZ_CP053452.1_1975514_1975784_-,NA|269aa|down_5|NZ_CP053452.1_1988546_1989353_-	NA|143aa|up_9|NZ_CP053452.1_1972114_1972543_+	PLN03098, LPA1, LOW PSII ACCUMULATION1; Provisional	NA|330aa|up_8|NZ_CP053452.1_1972741_1973731_+	COG1788, AtoD, Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit [Lipid metabolism]	NA|544aa|up_7|NZ_CP053452.1_1973816_1975448_+	NA	NA|90aa|up_6|NZ_CP053452.1_1975514_1975784_-	NA	NA|263aa|up_5|NZ_CP053452.1_1976414_1977203_-	pfam04337, DUF480, Protein of unknown function, DUF480	NA|495aa|up_4|NZ_CP053452.1_1977632_1979117_-	pfam03150, CCP_MauG, Di-haem cytochrome c peroxidase	NA|193aa|up_3|NZ_CP053452.1_1979608_1980187_-	cd02970, PRX_like2, Peroxiredoxin (PRX)-like 2 family; hypothetical proteins that show sequence similarity to PRXs	NA|343aa|up_2|NZ_CP053452.1_1980183_1981212_-	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|245aa|up_1|NZ_CP053452.1_1981427_1982162_-	pfam02397, Bac_transf, Bacterial sugar transferase	NA|217aa|up_0|NZ_CP053452.1_1982360_1983011_-	TIGR02914, hypothetical_protein, EpsI family protein	NA|754aa|down_0|NZ_CP053452.1_1983137_1985399_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|209aa|down_1|NZ_CP053452.1_1985524_1986151_-	pfam09721, Exosortase_EpsH, Transmembrane exosortase (Exosortase_EpsH)	NA|140aa|down_2|NZ_CP053452.1_1986036_1986456_-	pfam09721, Exosortase_EpsH, Transmembrane exosortase (Exosortase_EpsH)	NA|224aa|down_3|NZ_CP053452.1_1986891_1987563_-	pfam04468, PSP1, PSP1 C-terminal conserved region	NA|211aa|down_4|NZ_CP053452.1_1987758_1988391_-	sd00006, TPR, Tetratricopeptide repeat	NA|269aa|down_5|NZ_CP053452.1_1988546_1989353_-	NA	NA|192aa|down_6|NZ_CP053452.1_1990079_1990655_-	PRK00137, rplI, 50S ribosomal protein L9; Reviewed	NA|184aa|down_7|NZ_CP053452.1_1990759_1991311_-	pfam00436, SSB, Single-strand binding protein family	NA|172aa|down_8|NZ_CP053452.1_1991568_1992084_-	cd00473, bS6, Bacterial ribosomal protein S6	NA|213aa|down_9|NZ_CP053452.1_1992385_1993024_-	PRK05426, PRK05426, peptidyl-tRNA hydrolase; Provisional
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	12	2624952-2625118	12	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	AGCCCGGCGTCCGTCAGCGCGGT	23	0	0	NA	NA	NA	2	2	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|178aa|up_8|NZ_CP053452.1_2612070_2612604_-,NA|130aa|up_2|NZ_CP053452.1_2619290_2619680_-,NA|137aa|down_3|NZ_CP053452.1_2634369_2634780_+,NA|320aa|down_5|NZ_CP053452.1_2635923_2636883_+	NA|256aa|up_9|NZ_CP053452.1_2611093_2611861_-	TIGR03000, plancto_dom_1, Planctomycetes uncharacterized domain TIGR03000	NA|178aa|up_8|NZ_CP053452.1_2612070_2612604_-	NA	NA|968aa|up_7|NZ_CP053452.1_2613082_2615986_-	pfam01835, A2M_N, MG2 domain	NA|150aa|up_6|NZ_CP053452.1_2615909_2616359_-	pfam13490, zf-HC2, Putative zinc-finger	NA|206aa|up_5|NZ_CP053452.1_2616450_2617068_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|410aa|up_4|NZ_CP053452.1_2617357_2618587_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|228aa|up_3|NZ_CP053452.1_2618583_2619267_+	pfam14279, HNH_5, HNH endonuclease	NA|130aa|up_2|NZ_CP053452.1_2619290_2619680_-	NA	NA|887aa|up_1|NZ_CP053452.1_2619844_2622505_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|501aa|up_0|NZ_CP053452.1_2622745_2624248_-	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|255aa|down_0|NZ_CP053452.1_2628201_2628966_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|901aa|down_1|NZ_CP053452.1_2629987_2632690_+	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|378aa|down_2|NZ_CP053452.1_2632786_2633920_+	cd02932, OYE_YqiM_FMN, Old yellow enzyme (OYE) YqjM-like FMN binding domain	NA|137aa|down_3|NZ_CP053452.1_2634369_2634780_+	NA	NA|172aa|down_4|NZ_CP053452.1_2634999_2635515_+	PRK00522, tpx, thiol peroxidase	NA|320aa|down_5|NZ_CP053452.1_2635923_2636883_+	NA	NA|554aa|down_6|NZ_CP053452.1_2637026_2638688_-	pfam07642, BBP2, Putative beta-barrel porin-2, OmpL-like	NA|99aa|down_7|NZ_CP053452.1_2638886_2639183_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|56aa|down_8|NZ_CP053452.1_2639790_2639958_+	PRK14900, valS, valyl-tRNA synthetase; Provisional	NA|1148aa|down_9|NZ_CP053452.1_2640022_2643466_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	13	2764786-2764874	13	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	GTGGGCGACGCCGGCGCGGCGGC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|323aa|up_5|NZ_CP053452.1_2757234_2758203_+,NA|262aa|up_3|NZ_CP053452.1_2759450_2760236_+,NA|103aa|down_4|NZ_CP053452.1_2773832_2774141_-	NA|631aa|up_9|NZ_CP053452.1_2751473_2753366_-	PRK07431, PRK07431, aspartate kinase; Provisional	NA|245aa|up_8|NZ_CP053452.1_2753681_2754416_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|511aa|up_7|NZ_CP053452.1_2754501_2756034_+	cd07116, ALDH_ACDHII-AcoD, Ralstonia eutrophus NAD+-dependent acetaldehyde dehydrogenase II-like	NA|343aa|up_6|NZ_CP053452.1_2756076_2757105_+	cd08297, CAD3, Cinnamyl alcohol dehydrogenases (CAD)	NA|323aa|up_5|NZ_CP053452.1_2757234_2758203_+	NA	NA|334aa|up_4|NZ_CP053452.1_2758313_2759315_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|262aa|up_3|NZ_CP053452.1_2759450_2760236_+	NA	NA|99aa|up_2|NZ_CP053452.1_2760250_2760547_-	pfam13240, zinc_ribbon_2, zinc-ribbon domain	NA|435aa|up_1|NZ_CP053452.1_2760783_2762088_-	pfam06245, DUF1015, Protein of unknown function (DUF1015)	NA|372aa|up_0|NZ_CP053452.1_2762248_2763364_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|388aa|down_0|NZ_CP053452.1_2765079_2766243_+	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|422aa|down_1|NZ_CP053452.1_2766360_2767626_+	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|422aa|down_2|NZ_CP053452.1_2767839_2769105_+	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|894aa|down_3|NZ_CP053452.1_2771055_2773737_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|103aa|down_4|NZ_CP053452.1_2773832_2774141_-	NA	NA|168aa|down_5|NZ_CP053452.1_2774137_2774641_-	pfam14280, DUF4365, Domain of unknown function (DUF4365)	NA|290aa|down_6|NZ_CP053452.1_2775080_2775950_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|393aa|down_7|NZ_CP053452.1_2776087_2777266_+	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|119aa|down_8|NZ_CP053452.1_2777237_2777594_-	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|487aa|down_9|NZ_CP053452.1_2777657_2779118_-	cd05242, SDR_a8, atypical (a) SDRs, subgroup 8
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	14	3003589-3003662	14	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	TCGTCCTCATAGCGGCGCCGGCG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|427aa|up_6|NZ_CP053452.1_2994365_2995646_+,NA|156aa|up_0|NZ_CP053452.1_3002887_3003355_-,NA|72aa|down_2|NZ_CP053452.1_3008344_3008560_+,NA|229aa|down_5|NZ_CP053452.1_3015088_3015775_+,NA|292aa|down_7|NZ_CP053452.1_3017084_3017960_-,NA|116aa|down_9|NZ_CP053452.1_3020810_3021158_+	NA|362aa|up_9|NZ_CP053452.1_2985582_2986668_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1125aa|up_8|NZ_CP053452.1_2986709_2990084_+	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family	NA|1082aa|up_7|NZ_CP053452.1_2990981_2994227_+	cd03143, A4_beta-galactosidase_middle_domain, A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain	NA|427aa|up_6|NZ_CP053452.1_2994365_2995646_+	NA	NA|789aa|up_5|NZ_CP053452.1_2995642_2998009_+	cd14840, D-Ala-D-Ala_dipeptidase_Aad, D-Ala-D-Ala dipeptidase (includes Lactobacillus plantarum Aad peptidase)	NA|354aa|up_4|NZ_CP053452.1_2998174_2999236_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|481aa|up_3|NZ_CP053452.1_2999680_3001123_-	cd05673, M20_Acy1L2_AbgB, M20 Peptidase Aminoacylase 1-like protein 2 aminobenzoyl-glutamate utilization protein B subfamily	NA|181aa|up_2|NZ_CP053452.1_3001307_3001850_+	pfam11026, DUF2721, Protein of unknown function (DUF2721)	NA|300aa|up_1|NZ_CP053452.1_3001979_3002879_+	COG1864, NUC1, DNA/RNA endonuclease G, NUC1 [Nucleotide transport and metabolism]	NA|156aa|up_0|NZ_CP053452.1_3002887_3003355_-	NA	NA|927aa|down_0|NZ_CP053452.1_3004210_3006991_-	PRK09277, PRK09277, aconitate hydratase AcnA	NA|305aa|down_1|NZ_CP053452.1_3007350_3008265_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|72aa|down_2|NZ_CP053452.1_3008344_3008560_+	NA	NA|396aa|down_3|NZ_CP053452.1_3008616_3009804_-	COG0654, UbiH, 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases [Coenzyme metabolism / Energy production and conversion]	NA|395aa|down_4|NZ_CP053452.1_3013809_3014994_-	PRK05382, PRK05382, chorismate synthase; Validated	NA|229aa|down_5|NZ_CP053452.1_3015088_3015775_+	NA	NA|341aa|down_6|NZ_CP053452.1_3015941_3016964_+	PLN02721, PLN02721, threonine aldolase	NA|292aa|down_7|NZ_CP053452.1_3017084_3017960_-	NA	NA|830aa|down_8|NZ_CP053452.1_3018200_3020690_+	pfam16313, DUF4953, Met-zincin	NA|116aa|down_9|NZ_CP053452.1_3020810_3021158_+	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	15	3094576-3094743	15	CRISPRCasFinder	no	csa3	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Type I-A	GAAGTTGCCGTCGCTCGATGTGCCGAA	27	0	0	NA	NA	NA	3	3	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|180aa|up_6|NZ_CP053452.1_3087934_3088474_-,NA|394aa|up_5|NZ_CP053452.1_3088772_3089954_-,NA	NA|437aa|up_9|NZ_CP053452.1_3085331_3086642_+	TIGR02644, Thymidine_phosphorylase, pyrimidine-nucleoside phosphorylase	NA|134aa|up_8|NZ_CP053452.1_3086638_3087040_+	PRK05578, PRK05578, cytidine deaminase; Validated	NA|210aa|up_7|NZ_CP053452.1_3087214_3087844_+	PRK00129, upp, uracil phosphoribosyltransferase; Reviewed	NA|180aa|up_6|NZ_CP053452.1_3087934_3088474_-	NA	NA|394aa|up_5|NZ_CP053452.1_3088772_3089954_-	NA	NA|267aa|up_4|NZ_CP053452.1_3089973_3090774_-	PRK10847, PRK10847, DedA family protein	NA|146aa|up_3|NZ_CP053452.1_3090906_3091344_-	TIGR03067, Planc_TIGR03067, Planctomycetes uncharacterized domain TIGR03067	NA|146aa|up_2|NZ_CP053452.1_3091424_3091862_-	TIGR03067, Planc_TIGR03067, Planctomycetes uncharacterized domain TIGR03067	NA|155aa|up_1|NZ_CP053452.1_3091935_3092400_-	TIGR03067, Planc_TIGR03067, Planctomycetes uncharacterized domain TIGR03067	NA|475aa|up_0|NZ_CP053452.1_3092531_3093956_-	COG4099, COG4099, Predicted peptidase [General function prediction only]	NA|603aa|down_0|NZ_CP053452.1_3095378_3097187_-	cd17393, MFS_MosC_like, Membrane protein MosC and similar proteins of the Major Facilitator Superfamily of transporters	NA|255aa|down_1|NZ_CP053452.1_3097292_3098057_+	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|294aa|down_2|NZ_CP053452.1_3098757_3099639_+	PRK12886, ubiA, prenyltransferase; Reviewed	NA|1051aa|down_3|NZ_CP053452.1_3099787_3102940_+	TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase domain	NA|239aa|down_4|NZ_CP053452.1_3103000_3103717_-	cd00635, PLPDE_III_YBL036c_like, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzymes, YBL036c-like proteins	NA|115aa|down_5|NZ_CP053452.1_3103719_3104064_-	cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides	NA|314aa|down_6|NZ_CP053452.1_3104332_3105274_+	PRK00015, rnhB, ribonuclease HII; Validated	NA|307aa|down_7|NZ_CP053452.1_3105379_3106300_-	COG2319, COG2319, FOG: WD40 repeat [General function prediction only]	NA|377aa|down_8|NZ_CP053452.1_3106723_3107854_+	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|475aa|down_9|NZ_CP053452.1_3107865_3109290_-	cd13137, MATE_NorM_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Thermotoga marina NorM
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	16	3136328-3136432	16	CRISPRCasFinder	no	csa3	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Type I-A	CCCCCTCCCTGCAGGGAGGGGGTG	24	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|372aa|up_3|NZ_CP053452.1_3130863_3131979_-,NA|386aa|up_2|NZ_CP053452.1_3131975_3133133_-,NA|168aa|down_3|NZ_CP053452.1_3142813_3143317_-,NA|380aa|down_8|NZ_CP053452.1_3150096_3151236_-	NA|750aa|up_9|NZ_CP053452.1_3124130_3126380_+	PRK05306, infB, translation initiation factor IF-2; Validated	NA|109aa|up_8|NZ_CP053452.1_3126428_3126755_+	pfam04456, DUF503, Protein of unknown function (DUF503)	NA|177aa|up_7|NZ_CP053452.1_3126891_3127422_+	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|266aa|up_6|NZ_CP053452.1_3127517_3128315_+	TIGR01084, A/G-specific_adenine_glycosylase, A/G-specific adenine glycosylase	NA|420aa|up_5|NZ_CP053452.1_3128954_3130214_+	COG0156, BioF, 7-keto-8-aminopelargonate synthetase and related enzymes [Coenzyme metabolism]	NA|109aa|up_4|NZ_CP053452.1_3130323_3130650_+	COG2076, EmrE, Membrane transporters of cations and cationic drugs [Inorganic ion transport and metabolism]	NA|372aa|up_3|NZ_CP053452.1_3130863_3131979_-	NA	NA|386aa|up_2|NZ_CP053452.1_3131975_3133133_-	NA	NA|517aa|up_1|NZ_CP053452.1_3133151_3134702_-	COG1696, DltB, Predicted membrane protein involved in D-alanine export [Cell envelope biogenesis, outer membrane]	NA|448aa|up_0|NZ_CP053452.1_3134976_3136320_-	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|407aa|down_0|NZ_CP053452.1_3136471_3137692_-	TIGR01185, membrane_spanning_subunit, DevC protein	NA|657aa|down_1|NZ_CP053452.1_3139461_3141432_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|275aa|down_2|NZ_CP053452.1_3141563_3142388_-	PRK05254, PRK05254, uracil-DNA glycosylase; Provisional	NA|168aa|down_3|NZ_CP053452.1_3142813_3143317_-	NA	NA|313aa|down_4|NZ_CP053452.1_3143867_3144806_-	pfam10134, RPA, Replication initiator protein A	NA|636aa|down_5|NZ_CP053452.1_3145516_3147424_+	pfam13589, HATPase_c_3, Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase	NA|419aa|down_6|NZ_CP053452.1_3147918_3149175_+	pfam01555, N6_N4_Mtase, DNA methylase	NA|307aa|down_7|NZ_CP053452.1_3149174_3150095_+	pfam09549, RE_Bpu10I, Bpu10I restriction endonuclease	NA|380aa|down_8|NZ_CP053452.1_3150096_3151236_-	NA	NA|386aa|down_9|NZ_CP053452.1_3151287_3152444_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	17	3452060-3452162	17	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CGGTTGAAGTGGCGCCGCACCCGCGCCCGGT	31	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|173aa|up_8|NZ_CP053452.1_3441967_3442486_+,NA|692aa|down_2|NZ_CP053452.1_3454776_3456852_-,NA|781aa|down_3|NZ_CP053452.1_3457165_3459508_+,NA|195aa|down_8|NZ_CP053452.1_3465056_3465641_+	NA|369aa|up_9|NZ_CP053452.1_3439908_3441015_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|173aa|up_8|NZ_CP053452.1_3441967_3442486_+	NA	NA|177aa|up_7|NZ_CP053452.1_3442626_3443157_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|96aa|up_6|NZ_CP053452.1_3443201_3443489_-	sd00006, TPR, Tetratricopeptide repeat	NA|124aa|up_5|NZ_CP053452.1_3443720_3444092_+	PRK00051, hisI, phosphoribosyl-AMP cyclohydrolase; Reviewed	NA|1039aa|up_4|NZ_CP053452.1_3444211_3447328_+	TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase domain	NA|369aa|up_3|NZ_CP053452.1_3447670_3448777_+	cd13573, PBP2_PnhD_3, Substrate binding domain of uncharacterized ABC-type phosphonate-like transporter; contains the type 2 periplasmic binding fold	NA|323aa|up_2|NZ_CP053452.1_3448952_3449921_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|156aa|up_1|NZ_CP053452.1_3449993_3450461_+	pfam10670, DUF4198, Domain of unknown function (DUF4198)	NA|119aa|up_0|NZ_CP053452.1_3450875_3451232_+	PRK06995, flhF, flagellar biosynthesis protein FlhF	NA|446aa|down_0|NZ_CP053452.1_3452314_3453652_-	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|249aa|down_1|NZ_CP053452.1_3453759_3454506_-	pfam13267, DUF4058, Protein of unknown function (DUF4058)	NA|692aa|down_2|NZ_CP053452.1_3454776_3456852_-	NA	NA|781aa|down_3|NZ_CP053452.1_3457165_3459508_+	NA	NA|197aa|down_4|NZ_CP053452.1_3459999_3460590_+	TIGR03000, plancto_dom_1, Planctomycetes uncharacterized domain TIGR03000	NA|231aa|down_5|NZ_CP053452.1_3460649_3461342_+	PRK14376, PRK14376, membrane protein insertion efficiency factor YidD	NA|512aa|down_6|NZ_CP053452.1_3461430_3462966_+	cd07345, M48A_Ste24p-like, Peptidase M48 subfamily A-like, putative CaaX prenyl protease	NA|612aa|down_7|NZ_CP053452.1_3463132_3464968_+	PLN02919, PLN02919, haloacid dehalogenase-like hydrolase family protein	NA|195aa|down_8|NZ_CP053452.1_3465056_3465641_+	NA	NA|737aa|down_9|NZ_CP053452.1_3465774_3467985_+	COG4232, COG4232, Thiol:disulfide interchange protein [Posttranslational modification, protein turnover, chaperones / Energy production and conversion]
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	18	3825750-3825860	18	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	AGCGGACATCACGCGGAGCGTGATGACTACGGTGCG	36	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|117aa|up_9|NZ_CP053452.1_3817682_3818033_-,NA|204aa|up_5|NZ_CP053452.1_3819802_3820414_+,NA|157aa|up_1|NZ_CP053452.1_3824448_3824919_-,NA|95aa|down_3|NZ_CP053452.1_3830337_3830622_-,NA|86aa|down_5|NZ_CP053452.1_3832476_3832734_+,NA|179aa|down_6|NZ_CP053452.1_3833082_3833619_+	NA|117aa|up_9|NZ_CP053452.1_3817682_3818033_-	NA	NA|160aa|up_8|NZ_CP053452.1_3818036_3818516_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|171aa|up_7|NZ_CP053452.1_3818619_3819132_+	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional	NA|179aa|up_6|NZ_CP053452.1_3819160_3819697_+	pfam07564, DUF1542, Domain of Unknown Function (DUF1542)	NA|204aa|up_5|NZ_CP053452.1_3819802_3820414_+	NA	NA|350aa|up_4|NZ_CP053452.1_3820462_3821512_-	pfam13485, Peptidase_MA_2, Peptidase MA superfamily	NA|523aa|up_3|NZ_CP053452.1_3821887_3823456_+	COG1477, ApbE, Membrane-associated lipoprotein involved in thiamine biosynthesis [Coenzyme metabolism]	NA|228aa|up_2|NZ_CP053452.1_3823590_3824274_+	pfam16357, PepSY_TM_like_2, Putative PepSY_TM-like	NA|157aa|up_1|NZ_CP053452.1_3824448_3824919_-	NA	NA|186aa|up_0|NZ_CP053452.1_3825078_3825636_+	pfam00472, RF-1, RF-1 domain	NA|488aa|down_0|NZ_CP053452.1_3825969_3827433_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|163aa|down_1|NZ_CP053452.1_3827638_3828127_+	cd14504, DUSP23, dual specificity phosphatase 23	NA|583aa|down_2|NZ_CP053452.1_3828232_3829981_+	cd16016, AP-SPAP, SPAP is a subclass of alkaline phosphatase (AP)	NA|95aa|down_3|NZ_CP053452.1_3830337_3830622_-	NA	NA|484aa|down_4|NZ_CP053452.1_3830901_3832353_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|86aa|down_5|NZ_CP053452.1_3832476_3832734_+	NA	NA|179aa|down_6|NZ_CP053452.1_3833082_3833619_+	NA	NA|402aa|down_7|NZ_CP053452.1_3833778_3834984_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|529aa|down_8|NZ_CP053452.1_3835394_3836981_+	pfam13598, DUF4139, Domain of unknown function (DUF4139)	NA|223aa|down_9|NZ_CP053452.1_3837632_3838301_-	pfam10670, DUF4198, Domain of unknown function (DUF4198)
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	19	3913371-3913560	19	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	ACGGTGACGGTCTTCTGCACCGGCACC	27	0	0	NA	NA	NA	2	2	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|143aa|up_9|NZ_CP053452.1_3903113_3903542_+,NA|77aa|up_6|NZ_CP053452.1_3904926_3905157_+,NA|80aa|up_5|NZ_CP053452.1_3905170_3905410_+,NA|82aa|down_8|NZ_CP053452.1_3927382_3927628_+	NA|143aa|up_9|NZ_CP053452.1_3903113_3903542_+	NA	NA|134aa|up_8|NZ_CP053452.1_3903631_3904033_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|304aa|up_7|NZ_CP053452.1_3904052_3904964_+	COG3000, ERG3, Sterol desaturase [Lipid metabolism]	NA|77aa|up_6|NZ_CP053452.1_3904926_3905157_+	NA	NA|80aa|up_5|NZ_CP053452.1_3905170_3905410_+	NA	NA|538aa|up_4|NZ_CP053452.1_3905419_3907033_-	PRK00149, dnaA, chromosomal replication initiator protein DnaA	NA|296aa|up_3|NZ_CP053452.1_3907407_3908295_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|285aa|up_2|NZ_CP053452.1_3908430_3909285_+	PRK00847, thyX, FAD-dependent thymidylate synthase; Reviewed	NA|430aa|up_1|NZ_CP053452.1_3909416_3910706_+	cd01834, SGNH_hydrolase_like_2, SGNH_hydrolase subfamily	NA|601aa|up_0|NZ_CP053452.1_3910879_3912682_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|623aa|down_0|NZ_CP053452.1_3914856_3916725_+	COG1620, LldP, L-lactate permease [Energy production and conversion]	NA|232aa|down_1|NZ_CP053452.1_3916773_3917469_-	cd14527, DSP_bac, unknown subfamily of bacterial and plant dual specificity protein phosphatases	NA|343aa|down_2|NZ_CP053452.1_3917468_3918497_-	pfam14100, PmoA, Methane oxygenase PmoA	NA|393aa|down_3|NZ_CP053452.1_3918806_3919985_+	cd01854, YjeQ_EngC, Ribosomal interacting GTPase YjeQ/EngC, a circularly permuted subfamily of the Ras GTPases	NA|932aa|down_4|NZ_CP053452.1_3920185_3922981_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|147aa|down_5|NZ_CP053452.1_3923518_3923959_+	TIGR04320, hypothetical_protein, SEC10/PgrA surface exclusion domain	NA|509aa|down_6|NZ_CP053452.1_3923940_3925467_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|369aa|down_7|NZ_CP053452.1_3925699_3926806_+	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|82aa|down_8|NZ_CP053452.1_3927382_3927628_+	NA	NA|354aa|down_9|NZ_CP053452.1_3927698_3928760_+	pfam13391, HNH_2, HNH endonuclease
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	20	3913866-3914024	20	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	ACGGTGACGGTCTTCTGCACCGGCACC	27	0	0	NA	NA	NA	2	2	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|143aa|up_9|NZ_CP053452.1_3903113_3903542_+,NA|77aa|up_6|NZ_CP053452.1_3904926_3905157_+,NA|80aa|up_5|NZ_CP053452.1_3905170_3905410_+,NA|82aa|down_8|NZ_CP053452.1_3927382_3927628_+	NA|143aa|up_9|NZ_CP053452.1_3903113_3903542_+	NA	NA|134aa|up_8|NZ_CP053452.1_3903631_3904033_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|304aa|up_7|NZ_CP053452.1_3904052_3904964_+	COG3000, ERG3, Sterol desaturase [Lipid metabolism]	NA|77aa|up_6|NZ_CP053452.1_3904926_3905157_+	NA	NA|80aa|up_5|NZ_CP053452.1_3905170_3905410_+	NA	NA|538aa|up_4|NZ_CP053452.1_3905419_3907033_-	PRK00149, dnaA, chromosomal replication initiator protein DnaA	NA|296aa|up_3|NZ_CP053452.1_3907407_3908295_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|285aa|up_2|NZ_CP053452.1_3908430_3909285_+	PRK00847, thyX, FAD-dependent thymidylate synthase; Reviewed	NA|430aa|up_1|NZ_CP053452.1_3909416_3910706_+	cd01834, SGNH_hydrolase_like_2, SGNH_hydrolase subfamily	NA|601aa|up_0|NZ_CP053452.1_3910879_3912682_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|623aa|down_0|NZ_CP053452.1_3914856_3916725_+	COG1620, LldP, L-lactate permease [Energy production and conversion]	NA|232aa|down_1|NZ_CP053452.1_3916773_3917469_-	cd14527, DSP_bac, unknown subfamily of bacterial and plant dual specificity protein phosphatases	NA|343aa|down_2|NZ_CP053452.1_3917468_3918497_-	pfam14100, PmoA, Methane oxygenase PmoA	NA|393aa|down_3|NZ_CP053452.1_3918806_3919985_+	cd01854, YjeQ_EngC, Ribosomal interacting GTPase YjeQ/EngC, a circularly permuted subfamily of the Ras GTPases	NA|932aa|down_4|NZ_CP053452.1_3920185_3922981_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|147aa|down_5|NZ_CP053452.1_3923518_3923959_+	TIGR04320, hypothetical_protein, SEC10/PgrA surface exclusion domain	NA|509aa|down_6|NZ_CP053452.1_3923940_3925467_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|369aa|down_7|NZ_CP053452.1_3925699_3926806_+	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|82aa|down_8|NZ_CP053452.1_3927382_3927628_+	NA	NA|354aa|down_9|NZ_CP053452.1_3927698_3928760_+	pfam13391, HNH_2, HNH endonuclease
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	21	4485190-4485275	21	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	GGTCGTTCTGCCGAGATCGGACCCGC	26	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|198aa|up_9|NZ_CP053452.1_4472492_4473086_-,NA|65aa|up_8|NZ_CP053452.1_4473254_4473449_-,NA|279aa|up_7|NZ_CP053452.1_4473541_4474378_-,NA|184aa|up_6|NZ_CP053452.1_4474551_4475103_+,NA|164aa|up_4|NZ_CP053452.1_4476239_4476731_+,NA|332aa|up_3|NZ_CP053452.1_4476778_4477774_+,NA|85aa|down_0|NZ_CP053452.1_4485492_4485747_-	NA|198aa|up_9|NZ_CP053452.1_4472492_4473086_-	NA	NA|65aa|up_8|NZ_CP053452.1_4473254_4473449_-	NA	NA|279aa|up_7|NZ_CP053452.1_4473541_4474378_-	NA	NA|184aa|up_6|NZ_CP053452.1_4474551_4475103_+	NA	NA|366aa|up_5|NZ_CP053452.1_4474991_4476089_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|164aa|up_4|NZ_CP053452.1_4476239_4476731_+	NA	NA|332aa|up_3|NZ_CP053452.1_4476778_4477774_+	NA	NA|190aa|up_2|NZ_CP053452.1_4477815_4478385_-	PLN00072, PLN00072, 3-isopropylmalate isomerase/dehydratase small subunit; Provisional	NA|658aa|up_1|NZ_CP053452.1_4478748_4480722_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|722aa|up_0|NZ_CP053452.1_4481050_4483216_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|85aa|down_0|NZ_CP053452.1_4485492_4485747_-	NA	NA|104aa|down_1|NZ_CP053452.1_4485964_4486276_-	TIGR02813, omega-3_polyunsaturated_fatty_acid_synthase_PfaA, polyketide-type polyunsaturated fatty acid synthase PfaA	NA|200aa|down_2|NZ_CP053452.1_4486494_4487094_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|571aa|down_3|NZ_CP053452.1_4487339_4489052_+	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region	NA|315aa|down_4|NZ_CP053452.1_4489358_4490303_+	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|301aa|down_5|NZ_CP053452.1_4490474_4491377_+	sd00044, HEAT, HEAT repeats	NA|538aa|down_6|NZ_CP053452.1_4491406_4493020_-	cd07099, ALDH_DDALDH, Methylomonas sp	NA|417aa|down_7|NZ_CP053452.1_4493075_4494326_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|126aa|down_8|NZ_CP053452.1_4494624_4495002_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|1177aa|down_9|NZ_CP053452.1_4495118_4498649_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	22	4489107-4489187	22	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	GTAAGCCCCCCGAGGAAACGAGGAGC	26	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|164aa|up_9|NZ_CP053452.1_4476239_4476731_+,NA|332aa|up_8|NZ_CP053452.1_4476778_4477774_+,NA|85aa|up_3|NZ_CP053452.1_4485492_4485747_-,NA	NA|164aa|up_9|NZ_CP053452.1_4476239_4476731_+	NA	NA|332aa|up_8|NZ_CP053452.1_4476778_4477774_+	NA	NA|190aa|up_7|NZ_CP053452.1_4477815_4478385_-	PLN00072, PLN00072, 3-isopropylmalate isomerase/dehydratase small subunit; Provisional	NA|658aa|up_6|NZ_CP053452.1_4478748_4480722_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|722aa|up_5|NZ_CP053452.1_4481050_4483216_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|615aa|up_4|NZ_CP053452.1_4483588_4485433_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|85aa|up_3|NZ_CP053452.1_4485492_4485747_-	NA	NA|104aa|up_2|NZ_CP053452.1_4485964_4486276_-	TIGR02813, omega-3_polyunsaturated_fatty_acid_synthase_PfaA, polyketide-type polyunsaturated fatty acid synthase PfaA	NA|200aa|up_1|NZ_CP053452.1_4486494_4487094_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|571aa|up_0|NZ_CP053452.1_4487339_4489052_+	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region	NA|315aa|down_0|NZ_CP053452.1_4489358_4490303_+	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|301aa|down_1|NZ_CP053452.1_4490474_4491377_+	sd00044, HEAT, HEAT repeats	NA|538aa|down_2|NZ_CP053452.1_4491406_4493020_-	cd07099, ALDH_DDALDH, Methylomonas sp	NA|417aa|down_3|NZ_CP053452.1_4493075_4494326_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|126aa|down_4|NZ_CP053452.1_4494624_4495002_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|1177aa|down_5|NZ_CP053452.1_4495118_4498649_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|204aa|down_6|NZ_CP053452.1_4498728_4499340_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|459aa|down_7|NZ_CP053452.1_4499611_4500988_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|353aa|down_8|NZ_CP053452.1_4501120_4502179_-	cd00955, Transaldolase_like, Transaldolase-like proteins from plants and bacteria	NA|121aa|down_9|NZ_CP053452.1_4502354_4502717_-	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	23	5089777-5089878	23	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CAGCCCCTCCCTGAAGGGAGGGG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|116aa|up_8|NZ_CP053452.1_5079854_5080202_-,NA|150aa|up_6|NZ_CP053452.1_5081611_5082061_-,NA|179aa|up_4|NZ_CP053452.1_5083154_5083691_-,NA|178aa|up_3|NZ_CP053452.1_5083852_5084386_-,NA|230aa|down_8|NZ_CP053452.1_5108832_5109522_+,NA|359aa|down_9|NZ_CP053452.1_5109719_5110796_+	NA|126aa|up_9|NZ_CP053452.1_5079483_5079861_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|116aa|up_8|NZ_CP053452.1_5079854_5080202_-	NA	NA|228aa|up_7|NZ_CP053452.1_5080170_5080854_-	pfam13400, Tad, Putative Flp pilus-assembly TadE/G-like	NA|150aa|up_6|NZ_CP053452.1_5081611_5082061_-	NA	NA|276aa|up_5|NZ_CP053452.1_5082238_5083066_+	cd07421, MPP_Rhilphs, Rhilph phosphatases, metallophosphatase domain	NA|179aa|up_4|NZ_CP053452.1_5083154_5083691_-	NA	NA|178aa|up_3|NZ_CP053452.1_5083852_5084386_-	NA	NA|412aa|up_2|NZ_CP053452.1_5084521_5085757_+	cd06177, MFS_NHS, Nucleoside:H(+) symporter family of the Major Facilitator Superfamily of transporters	NA|256aa|up_1|NZ_CP053452.1_5085937_5086705_+	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|883aa|up_0|NZ_CP053452.1_5086920_5089569_+	pfam04151, PPC, Bacterial pre-peptidase C-terminal domain	NA|498aa|down_0|NZ_CP053452.1_5089944_5091438_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|1109aa|down_1|NZ_CP053452.1_5092001_5095328_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|283aa|down_2|NZ_CP053452.1_5095456_5096305_+	TIGR04179, hypothetical_protein_GM18DRAFT_2302, rhombotail lipoprotein	NA|954aa|down_3|NZ_CP053452.1_5096515_5099377_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|1112aa|down_4|NZ_CP053452.1_5099580_5102916_+	COG2319, COG2319, FOG: WD40 repeat [General function prediction only]	NA|536aa|down_5|NZ_CP053452.1_5103138_5104746_-	pfam13646, HEAT_2, HEAT repeats	NA|641aa|down_6|NZ_CP053452.1_5105045_5106968_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|256aa|down_7|NZ_CP053452.1_5107154_5107922_-	pfam13646, HEAT_2, HEAT repeats	NA|230aa|down_8|NZ_CP053452.1_5108832_5109522_+	NA	NA|359aa|down_9|NZ_CP053452.1_5109719_5110796_+	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	24	5496032-5496559	24	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	GGTGACGGACGCCGGGTTAAAGGA	24	0	0	NA	NA	NA	7	7	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|297aa|up_7|NZ_CP053452.1_5485260_5486151_-,NA|76aa|down_0|NZ_CP053452.1_5497055_5497283_-,NA|148aa|down_1|NZ_CP053452.1_5497526_5497970_-,NA|401aa|down_6|NZ_CP053452.1_5508422_5509625_-	NA|219aa|up_9|NZ_CP053452.1_5484115_5484772_+	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|103aa|up_8|NZ_CP053452.1_5484895_5485204_+	TIGR02176, pyruvate_flavodoxin/ferrodoxin_oxidoreductase, pyruvate:ferredoxin (flavodoxin) oxidoreductase, homodimeric	NA|297aa|up_7|NZ_CP053452.1_5485260_5486151_-	NA	NA|165aa|up_6|NZ_CP053452.1_5486379_5486874_+	TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family	NA|107aa|up_5|NZ_CP053452.1_5487037_5487358_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|942aa|up_4|NZ_CP053452.1_5487504_5490330_-	cd07328, M48_Ste24p_like, M48 Ste24 endopeptidase-like, integral membrane metallopeptidase	NA|438aa|up_3|NZ_CP053452.1_5490464_5491778_+	COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase [Posttranslational modification, protein turnover, chaperones]	NA|334aa|up_2|NZ_CP053452.1_5491826_5492828_-	pfam03929, PepSY_TM, PepSY-associated TM region	NA|58aa|up_1|NZ_CP053452.1_5493171_5493345_+	pfam10636, hemP, Hemin uptake protein hemP	NA|294aa|up_0|NZ_CP053452.1_5493644_5494526_-	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional	NA|76aa|down_0|NZ_CP053452.1_5497055_5497283_-	NA	NA|148aa|down_1|NZ_CP053452.1_5497526_5497970_-	NA	NA|1048aa|down_2|NZ_CP053452.1_5498186_5501330_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|1162aa|down_3|NZ_CP053452.1_5502201_5505687_+	COG2319, COG2319, FOG: WD40 repeat [General function prediction only]	NA|579aa|down_4|NZ_CP053452.1_5505926_5507663_-	pfam13372, Alginate_exp, Alginate export	NA|159aa|down_5|NZ_CP053452.1_5507910_5508387_+	cd15457, NADAR, Escherichia coli swarming motility protein YbiA and related proteins	NA|401aa|down_6|NZ_CP053452.1_5508422_5509625_-	NA	NA|175aa|down_7|NZ_CP053452.1_5509921_5510446_-	pfam00719, Pyrophosphatase, Inorganic pyrophosphatase	NA|340aa|down_8|NZ_CP053452.1_5510608_5511628_+	COG2017, GalM, Galactose mutarotase and related enzymes [Carbohydrate transport and metabolism]	NA|146aa|down_9|NZ_CP053452.1_5511693_5512131_-	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	25	5863324-5863455	25	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	GAGACCAAGGAAGCCGCGCTGGTGGCGCGCGAGGCCGAAATCGCG	45	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|115aa|up_9|NZ_CP053452.1_5852972_5853317_+,NA|57aa|up_1|NZ_CP053452.1_5861451_5861622_-,NA|148aa|down_8|NZ_CP053452.1_5875932_5876376_-	NA|115aa|up_9|NZ_CP053452.1_5852972_5853317_+	NA	NA|71aa|up_8|NZ_CP053452.1_5854195_5854408_+	pfam02599, CsrA, Global regulator protein family	NA|323aa|up_7|NZ_CP053452.1_5854751_5855720_+	cd04513, Glycosylasparaginase, Glycosylasparaginase and similar proteins	NA|361aa|up_6|NZ_CP053452.1_5855917_5857000_+	PRK12757, PRK12757, cell division protein FtsN; Provisional	NA|424aa|up_5|NZ_CP053452.1_5857121_5858393_+	pfam03692, CxxCxxCC, Putative zinc- or iron-chelating domain	NA|168aa|up_4|NZ_CP053452.1_5858540_5859044_+	cd06121, cupin_YML079wp, Saccharomyces cerevisiae YML079wp and related proteins, cupin domain	NA|399aa|up_3|NZ_CP053452.1_5859053_5860250_-	pfam10092, DUF2330, Uncharacterized protein conserved in bacteria (DUF2330)	NA|387aa|up_2|NZ_CP053452.1_5860294_5861455_-	pfam10092, DUF2330, Uncharacterized protein conserved in bacteria (DUF2330)	NA|57aa|up_1|NZ_CP053452.1_5861451_5861622_-	NA	NA|329aa|up_0|NZ_CP053452.1_5861618_5862605_-	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|475aa|down_0|NZ_CP053452.1_5863759_5865184_-	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|395aa|down_1|NZ_CP053452.1_5865413_5866598_-	pfam08668, HDOD, HDOD domain	NA|651aa|down_2|NZ_CP053452.1_5866758_5868711_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|734aa|down_3|NZ_CP053452.1_5868898_5871100_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|249aa|down_4|NZ_CP053452.1_5871145_5871892_-	PRK12693, flgG, flagellar basal body rod protein FlgG; Provisional	NA|328aa|down_5|NZ_CP053452.1_5872570_5873554_-	cd19079, AKR_EcYajO-like, Escherichia coli YajO and similar proteins	NA|345aa|down_6|NZ_CP053452.1_5873591_5874626_-	cd19091, AKR_PsAKR, Polaromonas Sp	NA|341aa|down_7|NZ_CP053452.1_5874913_5875936_-	pfam12770, CHAT, CHAT domain	NA|148aa|down_8|NZ_CP053452.1_5875932_5876376_-	NA	NA|733aa|down_9|NZ_CP053452.1_5876450_5878649_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	26	6434624-6435603	3,3,26	CRT,PILER-CR,CRISPRCasFinder	no	cas3,csb2gr5,cas7,cas2,cas1	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Unclear	GCTTCAATTCGGCCACGGCGTTGAGCCGTGGAGAAC,GTTCTCCACGGCTCAACGCCGTGGCCGAATTGAAGC,GCTTCAATTCGGCCACGGCGTTGAGCCGTGGAGAAC	36,36,36	0	0	NA	NA	NA:NA:NA	13,12,12	13	Unclear	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|71aa|up_0|NZ_CP053452.1_6434207_6434420_-,NA|70aa|down_5|NZ_CP053452.1_6445339_6445549_-,NA|407aa|down_9|NZ_CP053452.1_6448851_6450072_-	NA|199aa|up_9|NZ_CP053452.1_6421999_6422596_+	cd07909, YciF, YciF bacterial stress response protein, ferritin-like iron-binding domain	NA|848aa|up_8|NZ_CP053452.1_6422808_6425352_-	COG0178, UvrA, Excinuclease ATPase subunit [DNA replication, recombination, and repair]	NA|355aa|up_7|NZ_CP053452.1_6425422_6426487_-	cd05283, CAD1, Cinnamyl alcohol dehydrogenases (CAD)	NA|285aa|up_6|NZ_CP053452.1_6426980_6427835_-	cd05269, TMR_SDR_a, triphenylmethane reductase (TMR)-like proteins, NMRa-like, atypical (a) SDRs	NA|154aa|up_5|NZ_CP053452.1_6428235_6428697_+	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|340aa|up_4|NZ_CP053452.1_6428787_6429807_+	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|344aa|up_3|NZ_CP053452.1_6429855_6430887_+	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|477aa|up_2|NZ_CP053452.1_6431158_6432589_-	COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]	NA|59aa|up_1|NZ_CP053452.1_6434038_6434215_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|71aa|up_0|NZ_CP053452.1_6434207_6434420_-	NA	cas3|1279aa|down_0|NZ_CP053452.1_6435844_6439681_-	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	csb2gr5|544aa|down_1|NZ_CP053452.1_6439677_6441309_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	cas7|407aa|down_2|NZ_CP053452.1_6441312_6442533_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	cas2|95aa|down_3|NZ_CP053452.1_6443295_6443580_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|574aa|down_4|NZ_CP053452.1_6443579_6445301_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|70aa|down_5|NZ_CP053452.1_6445339_6445549_-	NA	NA|79aa|down_6|NZ_CP053452.1_6446325_6446562_+	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|112aa|down_7|NZ_CP053452.1_6446585_6446921_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|632aa|down_8|NZ_CP053452.1_6446927_6448823_-	COG0433, COG0433,  HerA helicase [Replication, recombination, and repair]	NA|407aa|down_9|NZ_CP053452.1_6448851_6450072_-	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	27	6554993-6555085	27	CRISPRCasFinder	no	RT	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Unclear	GGTCAACCCGCAAGGGGAGGAAT	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|170aa|up_6|NZ_CP053452.1_6540143_6540653_+,NA|53aa|up_5|NZ_CP053452.1_6540682_6540841_-,NA|116aa|down_1|NZ_CP053452.1_6556556_6556904_+,NA|146aa|down_8|NZ_CP053452.1_6561195_6561633_-	NA|208aa|up_9|NZ_CP053452.1_6538412_6539036_+	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|172aa|up_8|NZ_CP053452.1_6539058_6539574_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|115aa|up_7|NZ_CP053452.1_6539597_6539942_-	pfam13592, HTH_33, Winged helix-turn helix	NA|170aa|up_6|NZ_CP053452.1_6540143_6540653_+	NA	NA|53aa|up_5|NZ_CP053452.1_6540682_6540841_-	NA	NA|195aa|up_4|NZ_CP053452.1_6540878_6541463_+	pfam13592, HTH_33, Winged helix-turn helix	NA|507aa|up_3|NZ_CP053452.1_6541386_6542907_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|290aa|up_2|NZ_CP053452.1_6543076_6543946_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|2580aa|up_1|NZ_CP053452.1_6544450_6552190_-	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	RT|308aa|up_0|NZ_CP053452.1_6553662_6554586_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	RT|418aa|down_0|NZ_CP053452.1_6555087_6556341_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|116aa|down_1|NZ_CP053452.1_6556556_6556904_+	NA	NA|126aa|down_2|NZ_CP053452.1_6556897_6557275_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|519aa|down_3|NZ_CP053452.1_6557288_6558845_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|157aa|down_4|NZ_CP053452.1_6558997_6559468_-	pfam13356, Arm-DNA-bind_3, Arm DNA-binding domain	NA|155aa|down_5|NZ_CP053452.1_6559757_6560222_-	cd09874, PIN_MT3492-like, VapC-like PIN domain of the hypothetical protein MT3492 of Mycobacterium tuberculosis CDC1551 and other uncharacterized, annotated PilT protein domain proteins	NA|79aa|down_6|NZ_CP053452.1_6560238_6560475_-	pfam01954, DUF104, Protein of unknown function DUF104	NA|68aa|down_7|NZ_CP053452.1_6560843_6561047_-	pfam05930, Phage_AlpA, Prophage CP4-57 regulatory protein (AlpA)	NA|146aa|down_8|NZ_CP053452.1_6561195_6561633_-	NA	NA|86aa|down_9|NZ_CP053452.1_6561637_6561895_-	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	28	6759779-6760014	4	PILER-CR	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CCCGCGTCCGTCACCTGCGTG	21	0	0	NA	NA	NA	3	3	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|81aa|up_9|NZ_CP053452.1_6747174_6747417_-,NA|308aa|up_6|NZ_CP053452.1_6749438_6750362_-,NA|97aa|up_4|NZ_CP053452.1_6753488_6753779_-,NA|183aa|up_2|NZ_CP053452.1_6755255_6755804_-,NA|217aa|down_1|NZ_CP053452.1_6761215_6761866_+,NA|107aa|down_2|NZ_CP053452.1_6762162_6762483_-,NA|104aa|down_5|NZ_CP053452.1_6766388_6766700_+,NA|104aa|down_6|NZ_CP053452.1_6766729_6767041_+,NA|124aa|down_8|NZ_CP053452.1_6768161_6768533_-,NA|357aa|down_9|NZ_CP053452.1_6768501_6769572_-	NA|81aa|up_9|NZ_CP053452.1_6747174_6747417_-	NA	NA|420aa|up_8|NZ_CP053452.1_6747731_6748991_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|108aa|up_7|NZ_CP053452.1_6748987_6749311_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|308aa|up_6|NZ_CP053452.1_6749438_6750362_-	NA	NA|636aa|up_5|NZ_CP053452.1_6750653_6752561_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|97aa|up_4|NZ_CP053452.1_6753488_6753779_-	NA	NA|444aa|up_3|NZ_CP053452.1_6753920_6755252_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|183aa|up_2|NZ_CP053452.1_6755255_6755804_-	NA	NA|363aa|up_1|NZ_CP053452.1_6756018_6757107_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|144aa|up_0|NZ_CP053452.1_6757536_6757968_-	pfam14317, YcxB, YcxB-like protein	NA|63aa|down_0|NZ_CP053452.1_6760603_6760792_-	pfam12975, DUF3859, Domain of unknown function (DUF3859)	NA|217aa|down_1|NZ_CP053452.1_6761215_6761866_+	NA	NA|107aa|down_2|NZ_CP053452.1_6762162_6762483_-	NA	NA|147aa|down_3|NZ_CP053452.1_6762586_6763027_-	pfam10990, DUF2809, Protein of unknown function (DUF2809)	NA|936aa|down_4|NZ_CP053452.1_6763551_6766359_+	pfam05136, Phage_portal_2, Phage portal protein, lambda family	NA|104aa|down_5|NZ_CP053452.1_6766388_6766700_+	NA	NA|104aa|down_6|NZ_CP053452.1_6766729_6767041_+	NA	NA|385aa|down_7|NZ_CP053452.1_6767059_6768214_-	cd09008, MTAN, 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidases	NA|124aa|down_8|NZ_CP053452.1_6768161_6768533_-	NA	NA|357aa|down_9|NZ_CP053452.1_6768501_6769572_-	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	29	6776307-6776408	28	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	GGCTTTCCAGCCTGTGTACCTGGAA	25	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|104aa|up_9|NZ_CP053452.1_6766729_6767041_+,NA|124aa|up_7|NZ_CP053452.1_6768161_6768533_-,NA|357aa|up_6|NZ_CP053452.1_6768501_6769572_-,NA|182aa|up_5|NZ_CP053452.1_6769871_6770417_+,NA|184aa|up_4|NZ_CP053452.1_6770893_6771445_-,NA|390aa|up_3|NZ_CP053452.1_6771567_6772737_+,NA|258aa|up_2|NZ_CP053452.1_6772830_6773604_+,NA|138aa|up_1|NZ_CP053452.1_6773810_6774224_+,NA|124aa|down_0|NZ_CP053452.1_6776479_6776851_+,NA|359aa|down_1|NZ_CP053452.1_6776872_6777949_+,NA|71aa|down_2|NZ_CP053452.1_6777962_6778175_+,NA|115aa|down_3|NZ_CP053452.1_6778178_6778523_+,NA|150aa|down_4|NZ_CP053452.1_6778530_6778980_+,NA|79aa|down_6|NZ_CP053452.1_6780231_6780468_+,NA|162aa|down_7|NZ_CP053452.1_6780471_6780957_+,NA|125aa|down_8|NZ_CP053452.1_6780959_6781334_+,NA|159aa|down_9|NZ_CP053452.1_6781340_6781817_+	NA|104aa|up_9|NZ_CP053452.1_6766729_6767041_+	NA	NA|385aa|up_8|NZ_CP053452.1_6767059_6768214_-	cd09008, MTAN, 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidases	NA|124aa|up_7|NZ_CP053452.1_6768161_6768533_-	NA	NA|357aa|up_6|NZ_CP053452.1_6768501_6769572_-	NA	NA|182aa|up_5|NZ_CP053452.1_6769871_6770417_+	NA	NA|184aa|up_4|NZ_CP053452.1_6770893_6771445_-	NA	NA|390aa|up_3|NZ_CP053452.1_6771567_6772737_+	NA	NA|258aa|up_2|NZ_CP053452.1_6772830_6773604_+	NA	NA|138aa|up_1|NZ_CP053452.1_6773810_6774224_+	NA	NA|349aa|up_0|NZ_CP053452.1_6774281_6775328_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|124aa|down_0|NZ_CP053452.1_6776479_6776851_+	NA	NA|359aa|down_1|NZ_CP053452.1_6776872_6777949_+	NA	NA|71aa|down_2|NZ_CP053452.1_6777962_6778175_+	NA	NA|115aa|down_3|NZ_CP053452.1_6778178_6778523_+	NA	NA|150aa|down_4|NZ_CP053452.1_6778530_6778980_+	NA	NA|349aa|down_5|NZ_CP053452.1_6779094_6780141_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|79aa|down_6|NZ_CP053452.1_6780231_6780468_+	NA	NA|162aa|down_7|NZ_CP053452.1_6780471_6780957_+	NA	NA|125aa|down_8|NZ_CP053452.1_6780959_6781334_+	NA	NA|159aa|down_9|NZ_CP053452.1_6781340_6781817_+	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	30	7237472-7237583	29	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	ACGAACTTGGGGGCCACGGCCCGCGGGGCGGGCGCGAGGG	40	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|238aa|up_5|NZ_CP053452.1_7231095_7231809_+,NA|160aa|up_4|NZ_CP053452.1_7231840_7232320_+,NA|64aa|up_2|NZ_CP053452.1_7234037_7234229_+,NA|46aa|down_7|NZ_CP053452.1_7246165_7246303_-,NA|318aa|down_8|NZ_CP053452.1_7246593_7247547_+	NA|830aa|up_9|NZ_CP053452.1_7222492_7224982_+	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins	NA|547aa|up_8|NZ_CP053452.1_7225238_7226879_+	cd05801, PGM_like3, This bacterial PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily	NA|230aa|up_7|NZ_CP053452.1_7227083_7227773_+	pfam01182, Glucosamine_iso, Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase	NA|938aa|up_6|NZ_CP053452.1_7227895_7230709_-	pfam16313, DUF4953, Met-zincin	NA|238aa|up_5|NZ_CP053452.1_7231095_7231809_+	NA	NA|160aa|up_4|NZ_CP053452.1_7231840_7232320_+	NA	NA|470aa|up_3|NZ_CP053452.1_7232442_7233852_+	COG3733, TynA, Cu2+-containing amine oxidase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|64aa|up_2|NZ_CP053452.1_7234037_7234229_+	NA	NA|433aa|up_1|NZ_CP053452.1_7234289_7235588_-	pfam13360, PQQ_2, PQQ-like domain	NA|69aa|up_0|NZ_CP053452.1_7235757_7235964_-	PRK12309, PRK12309, transaldolase	NA|453aa|down_0|NZ_CP053452.1_7237760_7239119_-	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|117aa|down_1|NZ_CP053452.1_7239150_7239501_-	TIGR02436, S23_ribosomal_protein, four helix bundle protein	NA|552aa|down_2|NZ_CP053452.1_7239657_7241313_-	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|155aa|down_3|NZ_CP053452.1_7241674_7242139_-	TIGR03067, Planc_TIGR03067, Planctomycetes uncharacterized domain TIGR03067	NA|874aa|down_4|NZ_CP053452.1_7242412_7245034_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|99aa|down_5|NZ_CP053452.1_7245138_7245435_+	smart00834, CxxC_CXXC_SSSS, Putative regulatory protein	NA|183aa|down_6|NZ_CP053452.1_7245595_7246144_+	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|46aa|down_7|NZ_CP053452.1_7246165_7246303_-	NA	NA|318aa|down_8|NZ_CP053452.1_7246593_7247547_+	NA	NA|318aa|down_9|NZ_CP053452.1_7249347_7250301_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	31	7494200-7494313	30	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CCCGGCGGACAGGTGCTCGCGACCGGG	27	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|106aa|up_8|NZ_CP053452.1_7481418_7481736_+,NA|59aa|down_0|NZ_CP053452.1_7494710_7494887_-,NA|634aa|down_2|NZ_CP053452.1_7497963_7499865_-,NA|1150aa|down_8|NZ_CP053452.1_7509569_7513019_-	NA|177aa|up_9|NZ_CP053452.1_7480722_7481253_-	pfam01882, DUF58, Protein of unknown function DUF58	NA|106aa|up_8|NZ_CP053452.1_7481418_7481736_+	NA	NA|323aa|up_7|NZ_CP053452.1_7481866_7482835_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|548aa|up_6|NZ_CP053452.1_7482952_7484596_-	PRK06676, rpsA, 30S ribosomal protein S1; Reviewed	NA|326aa|up_5|NZ_CP053452.1_7484746_7485724_+	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|327aa|up_4|NZ_CP053452.1_7485716_7486697_+	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|506aa|up_3|NZ_CP053452.1_7486686_7488204_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|384aa|up_2|NZ_CP053452.1_7488240_7489392_+	pfam02517, Abi, CAAX protease self-immunity	NA|899aa|up_1|NZ_CP053452.1_7489502_7492199_+	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|478aa|up_0|NZ_CP053452.1_7492324_7493758_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|59aa|down_0|NZ_CP053452.1_7494710_7494887_-	NA	NA|381aa|down_1|NZ_CP053452.1_7495217_7496360_+	pfam13421, Band_7_1, SPFH domain-Band 7 family	NA|634aa|down_2|NZ_CP053452.1_7497963_7499865_-	NA	NA|437aa|down_3|NZ_CP053452.1_7500191_7501502_-	COG3291, COG3291, FOG: PKD repeat [General function prediction only]	NA|470aa|down_4|NZ_CP053452.1_7502414_7503824_-	cd01126, TraG_VirD4, The TraG/TraD/VirD4 family are bacterial conjugation proteins involved in type IV secretion	NA|146aa|down_5|NZ_CP053452.1_7504013_7504451_-	pfam01322, Cytochrom_C_2, Cytochrome C'	NA|901aa|down_6|NZ_CP053452.1_7505734_7508437_+	sd00006, TPR, Tetratricopeptide repeat	NA|184aa|down_7|NZ_CP053452.1_7508769_7509321_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|1150aa|down_8|NZ_CP053452.1_7509569_7513019_-	NA	NA|292aa|down_9|NZ_CP053452.1_7513157_7514033_-	COG0190, FolD, 5,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase [Coenzyme metabolism]
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	32	7736654-7736751	31	CRISPRCasFinder	no	RT	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Unclear	ACTCGGGGGGCTTACGCCCCCCGCTCGCCTGC	32	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|282aa|up_7|NZ_CP053452.1_7726391_7727237_-,NA	NA|946aa|up_9|NZ_CP053452.1_7722881_7725719_+	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|118aa|up_8|NZ_CP053452.1_7725779_7726133_+	pfam05685, Uma2, Putative restriction endonuclease	NA|282aa|up_7|NZ_CP053452.1_7726391_7727237_-	NA	NA|654aa|up_6|NZ_CP053452.1_7727245_7729207_-	pfam12679, ABC2_membrane_2, ABC-2 family transporter protein	NA|316aa|up_5|NZ_CP053452.1_7729295_7730243_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|336aa|up_4|NZ_CP053452.1_7730335_7731343_-	cd12172, PGDH_like_2, Putative D-3-Phosphoglycerate Dehydrogenases, NAD-binding and catalytic domains	NA|140aa|up_3|NZ_CP053452.1_7731781_7732201_+	pfam02410, RsfS, Ribosomal silencing factor during starvation	NA|646aa|up_2|NZ_CP053452.1_7732335_7734273_+	COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|374aa|up_1|NZ_CP053452.1_7734370_7735492_+	PRK12297, obgE, GTPase CgtA; Reviewed	NA|256aa|up_0|NZ_CP053452.1_7735488_7736256_+	pfam03309, Pan_kinase, Type III pantothenate kinase	RT|441aa|down_0|NZ_CP053452.1_7737541_7738864_-	cd03487, RT_Bac_retron_II, RT_Bac_retron_II: Reverse transcriptases (RTs) in bacterial retrotransposons or retrons	NA|622aa|down_1|NZ_CP053452.1_7738992_7740858_-	pfam04434, SWIM, SWIM zinc finger	NA|116aa|down_2|NZ_CP053452.1_7740862_7741210_-	pfam10112, Halogen_Hydrol, 5-bromo-4-chloroindolyl phosphate hydrolysis protein	NA|2284aa|down_3|NZ_CP053452.1_7741206_7748058_-	PRK13800, PRK13800, fumarate reductase/succinate dehydrogenase flavoprotein subunit	NA|1094aa|down_4|NZ_CP053452.1_7748059_7751341_-	cd17748, BRCT_DNA_ligase_like, BRCT domain of bacterial NAD-dependent DNA ligase (LigA) and similar proteins	NA|326aa|down_5|NZ_CP053452.1_7751917_7752895_-	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region	NA|264aa|down_6|NZ_CP053452.1_7752984_7753776_+	PRK07231, FabG-like, SDR family oxidoreductase	NA|840aa|down_7|NZ_CP053452.1_7754167_7756687_-	TIGR01840, poly3-hydroxybutyrate_depolymerase_A_precursor, esterase, PHB depolymerase family	NA|136aa|down_8|NZ_CP053452.1_7756857_7757265_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|345aa|down_9|NZ_CP053452.1_7757492_7758527_+	pfam12006, DUF3500, Protein of unknown function (DUF3500)
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	33	8588351-8588447	32	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CCGACCACCGCCGCCGAGCTGTTCG	25	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|105aa|up_7|NZ_CP053452.1_8581703_8582018_-,NA|188aa|up_6|NZ_CP053452.1_8582834_8583398_+,NA|349aa|up_5|NZ_CP053452.1_8583504_8584551_+,NA|67aa|up_4|NZ_CP053452.1_8584654_8584855_+,NA|136aa|up_2|NZ_CP053452.1_8585253_8585661_+,NA|80aa|down_1|NZ_CP053452.1_8592182_8592422_-,NA|236aa|down_9|NZ_CP053452.1_8601489_8602197_+	NA|386aa|up_9|NZ_CP053452.1_8579423_8580580_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|236aa|up_8|NZ_CP053452.1_8580774_8581482_+	pfam09731, Mitofilin, Mitochondrial inner membrane protein	NA|105aa|up_7|NZ_CP053452.1_8581703_8582018_-	NA	NA|188aa|up_6|NZ_CP053452.1_8582834_8583398_+	NA	NA|349aa|up_5|NZ_CP053452.1_8583504_8584551_+	NA	NA|67aa|up_4|NZ_CP053452.1_8584654_8584855_+	NA	NA|110aa|up_3|NZ_CP053452.1_8584847_8585177_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|136aa|up_2|NZ_CP053452.1_8585253_8585661_+	NA	NA|67aa|up_1|NZ_CP053452.1_8586138_8586339_+	cd01029, TOPRIM_primases, TOPRIM_primases: The topoisomerase-primase (TORPIM) nucleotidyl transferase/hydrolase domain found in the active site regions of bacterial DnaG-type primases and their homologs	NA|630aa|up_0|NZ_CP053452.1_8586404_8588294_+	TIGR01613, putative_primase, phage/plasmid primase, P4 family, C-terminal domain	NA|279aa|down_0|NZ_CP053452.1_8588754_8589591_+	COG0338, Dam, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|80aa|down_1|NZ_CP053452.1_8592182_8592422_-	NA	NA|399aa|down_2|NZ_CP053452.1_8592656_8593853_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|386aa|down_3|NZ_CP053452.1_8594364_8595522_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|430aa|down_4|NZ_CP053452.1_8595950_8597240_-	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|283aa|down_5|NZ_CP053452.1_8597250_8598099_-	TIGR01496, Dihydropteroate_synthase, dihydropteroate synthase	NA|192aa|down_6|NZ_CP053452.1_8598181_8598757_-	cd00340, GSH_Peroxidase, Glutathione (GSH) peroxidase family; tetrameric selenoenzymes that catalyze the reduction of a variety of hydroperoxides including lipid peroxidases, using GSH as a specific electron donor substrate	NA|213aa|down_7|NZ_CP053452.1_8599002_8599641_-	PRK05679, PRK05679, pyridoxal 5'-phosphate synthase	NA|320aa|down_8|NZ_CP053452.1_8599912_8600872_-	COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid transport and metabolism]	NA|236aa|down_9|NZ_CP053452.1_8601489_8602197_+	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	34	9114408-9114648	33	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CGTGACGGACGCGGGGCTGAAGGA	24	0	0	NA	NA	NA	3	3	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|373aa|up_3|NZ_CP053452.1_9106548_9107667_+,NA|199aa|down_1|NZ_CP053452.1_9117196_9117793_-,NA|161aa|down_4|NZ_CP053452.1_9119988_9120471_-,NA|246aa|down_5|NZ_CP053452.1_9120552_9121290_-	NA|333aa|up_9|NZ_CP053452.1_9093341_9094340_-	cd01049, RNRR2, Ribonucleotide Reductase, R2/beta subunit, ferritin-like diiron-binding domain	NA|770aa|up_8|NZ_CP053452.1_9094512_9096822_-	PLN02437, PLN02437, ribonucleoside--diphosphate reductase large subunit	NA|138aa|up_7|NZ_CP053452.1_9097330_9097744_-	PRK05222, PRK05222, 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase; Provisional	NA|999aa|up_6|NZ_CP053452.1_9098150_9101147_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|582aa|up_5|NZ_CP053452.1_9101909_9103655_-	COG1520, COG1520, FOG: WD40-like repeat [Function unknown]	NA|773aa|up_4|NZ_CP053452.1_9103683_9106002_-	pfam13360, PQQ_2, PQQ-like domain	NA|373aa|up_3|NZ_CP053452.1_9106548_9107667_+	NA	NA|386aa|up_2|NZ_CP053452.1_9107951_9109109_+	pfam01070, FMN_dh, FMN-dependent dehydrogenase	NA|351aa|up_1|NZ_CP053452.1_9109258_9110311_+	PRK02492, PRK02492, deoxyhypusine synthase	NA|834aa|up_0|NZ_CP053452.1_9110633_9113135_+	TIGR03396, PC_PLC, phospholipase C, phosphocholine-specific, Pseudomonas-type	NA|568aa|down_0|NZ_CP053452.1_9115336_9117040_+	pfam12831, FAD_oxidored, FAD dependent oxidoreductase	NA|199aa|down_1|NZ_CP053452.1_9117196_9117793_-	NA	NA|135aa|down_2|NZ_CP053452.1_9118051_9118456_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|262aa|down_3|NZ_CP053452.1_9118980_9119766_-	pfam12975, DUF3859, Domain of unknown function (DUF3859)	NA|161aa|down_4|NZ_CP053452.1_9119988_9120471_-	NA	NA|246aa|down_5|NZ_CP053452.1_9120552_9121290_-	NA	NA|202aa|down_6|NZ_CP053452.1_9121509_9122115_-	pfam04336, ACP_PD, Acyl carrier protein phosphodiesterase	NA|433aa|down_7|NZ_CP053452.1_9122396_9123695_-	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|477aa|down_8|NZ_CP053452.1_9124305_9125736_+	cd06572, Histidinol_dh, Histidinol dehydrogenase, HisD, E	NA|407aa|down_9|NZ_CP053452.1_9125738_9126959_+	PRK05387, PRK05387, histidinol-phosphate aminotransferase; Provisional
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	35	9114768-9114863	34	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CGTGACGGACGCGGGGCTGAAGGA	24	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|373aa|up_3|NZ_CP053452.1_9106548_9107667_+,NA|199aa|down_1|NZ_CP053452.1_9117196_9117793_-,NA|161aa|down_4|NZ_CP053452.1_9119988_9120471_-,NA|246aa|down_5|NZ_CP053452.1_9120552_9121290_-	NA|333aa|up_9|NZ_CP053452.1_9093341_9094340_-	cd01049, RNRR2, Ribonucleotide Reductase, R2/beta subunit, ferritin-like diiron-binding domain	NA|770aa|up_8|NZ_CP053452.1_9094512_9096822_-	PLN02437, PLN02437, ribonucleoside--diphosphate reductase large subunit	NA|138aa|up_7|NZ_CP053452.1_9097330_9097744_-	PRK05222, PRK05222, 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase; Provisional	NA|999aa|up_6|NZ_CP053452.1_9098150_9101147_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|582aa|up_5|NZ_CP053452.1_9101909_9103655_-	COG1520, COG1520, FOG: WD40-like repeat [Function unknown]	NA|773aa|up_4|NZ_CP053452.1_9103683_9106002_-	pfam13360, PQQ_2, PQQ-like domain	NA|373aa|up_3|NZ_CP053452.1_9106548_9107667_+	NA	NA|386aa|up_2|NZ_CP053452.1_9107951_9109109_+	pfam01070, FMN_dh, FMN-dependent dehydrogenase	NA|351aa|up_1|NZ_CP053452.1_9109258_9110311_+	PRK02492, PRK02492, deoxyhypusine synthase	NA|834aa|up_0|NZ_CP053452.1_9110633_9113135_+	TIGR03396, PC_PLC, phospholipase C, phosphocholine-specific, Pseudomonas-type	NA|568aa|down_0|NZ_CP053452.1_9115336_9117040_+	pfam12831, FAD_oxidored, FAD dependent oxidoreductase	NA|199aa|down_1|NZ_CP053452.1_9117196_9117793_-	NA	NA|135aa|down_2|NZ_CP053452.1_9118051_9118456_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|262aa|down_3|NZ_CP053452.1_9118980_9119766_-	pfam12975, DUF3859, Domain of unknown function (DUF3859)	NA|161aa|down_4|NZ_CP053452.1_9119988_9120471_-	NA	NA|246aa|down_5|NZ_CP053452.1_9120552_9121290_-	NA	NA|202aa|down_6|NZ_CP053452.1_9121509_9122115_-	pfam04336, ACP_PD, Acyl carrier protein phosphodiesterase	NA|433aa|down_7|NZ_CP053452.1_9122396_9123695_-	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|477aa|down_8|NZ_CP053452.1_9124305_9125736_+	cd06572, Histidinol_dh, Histidinol dehydrogenase, HisD, E	NA|407aa|down_9|NZ_CP053452.1_9125738_9126959_+	PRK05387, PRK05387, histidinol-phosphate aminotransferase; Provisional
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	36	9325241-9325353	35	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	AGGGCGCGACTCGCGGAGCGCGTCGCCTACTTT	33	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|915aa|up_6|NZ_CP053452.1_9312107_9314852_+,NA|377aa|down_0|NZ_CP053452.1_9325380_9326511_-,NA|106aa|down_7|NZ_CP053452.1_9337938_9338256_-	NA|202aa|up_9|NZ_CP053452.1_9309456_9310062_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|303aa|up_8|NZ_CP053452.1_9310561_9311470_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|148aa|up_7|NZ_CP053452.1_9311499_9311943_+	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|915aa|up_6|NZ_CP053452.1_9312107_9314852_+	NA	NA|136aa|up_5|NZ_CP053452.1_9314984_9315392_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|616aa|up_4|NZ_CP053452.1_9315479_9317327_+	pfam05960, DUF885, Bacterial protein of unknown function (DUF885)	NA|785aa|up_3|NZ_CP053452.1_9317493_9319848_+	pfam12810, Gly_rich, Glycine rich protein	NA|309aa|up_2|NZ_CP053452.1_9319990_9320917_-	pfam01261, AP_endonuc_2, Xylose isomerase-like TIM barrel	NA|785aa|up_1|NZ_CP053452.1_9321413_9323768_+	cd16025, PAS_like, Bacterial Arylsulfatase of Pseudomonas aeruginosa and related proteins	NA|384aa|up_0|NZ_CP053452.1_9324035_9325187_+	PRK09105, PRK09105, pyridoxal phosphate-dependent aminotransferase	NA|377aa|down_0|NZ_CP053452.1_9325380_9326511_-	NA	NA|618aa|down_1|NZ_CP053452.1_9326507_9328361_-	pfam07593, UnbV_ASPIC, ASPIC and UnbV	NA|1171aa|down_2|NZ_CP053452.1_9328357_9331870_-	pfam07593, UnbV_ASPIC, ASPIC and UnbV	NA|500aa|down_3|NZ_CP053452.1_9331895_9333395_-	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|301aa|down_4|NZ_CP053452.1_9333391_9334294_-	COG1244, COG1244, Predicted Fe-S oxidoreductase [General function prediction only]	NA|442aa|down_5|NZ_CP053452.1_9334430_9335756_-	cd01991, Asn_Synthase_B_C, The C-terminal domain of Asparagine Synthase B	NA|629aa|down_6|NZ_CP053452.1_9336017_9337904_+	pfam07593, UnbV_ASPIC, ASPIC and UnbV	NA|106aa|down_7|NZ_CP053452.1_9337938_9338256_-	NA	NA|317aa|down_8|NZ_CP053452.1_9338597_9339548_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|136aa|down_9|NZ_CP053452.1_9340057_9340465_+	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	37	9827214-9827363	36	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CCCGCACCACCTACGCCTACGAC	23	1	1	9827300-9827339	NZ_CP053452.1_9826922-9826961	NA	2	2	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+,NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-,NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+,NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-,NA	NA|563aa|up_9|NZ_CP053452.1_9807281_9808970_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+	NA	NA|497aa|up_7|NZ_CP053452.1_9809600_9811091_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|399aa|up_6|NZ_CP053452.1_9811800_9812997_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|260aa|up_5|NZ_CP053452.1_9813009_9813789_-	pfam13704, Glyco_tranf_2_4, Glycosyl transferase family 2	NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-	NA	NA|183aa|up_3|NZ_CP053452.1_9815226_9815775_-	COG3306, COG3306, Glycosyltransferase involved in LPS biosynthesis [Cell envelope biogenesis, outer membrane]	NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+	NA	NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-	NA	NA|271aa|up_0|NZ_CP053452.1_9818855_9819668_+	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	38	9827466-9827615	37	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CCCGCACCACCTACGCCTACGAC	23	0	0	NA	NA	NA	2	2	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+,NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-,NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+,NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-,NA	NA|563aa|up_9|NZ_CP053452.1_9807281_9808970_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+	NA	NA|497aa|up_7|NZ_CP053452.1_9809600_9811091_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|399aa|up_6|NZ_CP053452.1_9811800_9812997_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|260aa|up_5|NZ_CP053452.1_9813009_9813789_-	pfam13704, Glyco_tranf_2_4, Glycosyl transferase family 2	NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-	NA	NA|183aa|up_3|NZ_CP053452.1_9815226_9815775_-	COG3306, COG3306, Glycosyltransferase involved in LPS biosynthesis [Cell envelope biogenesis, outer membrane]	NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+	NA	NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-	NA	NA|271aa|up_0|NZ_CP053452.1_9818855_9819668_+	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	39	9827718-9827804	38	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CCCGCACCACCTACGCCTACGAC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+,NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-,NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+,NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-,NA	NA|563aa|up_9|NZ_CP053452.1_9807281_9808970_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+	NA	NA|497aa|up_7|NZ_CP053452.1_9809600_9811091_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|399aa|up_6|NZ_CP053452.1_9811800_9812997_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|260aa|up_5|NZ_CP053452.1_9813009_9813789_-	pfam13704, Glyco_tranf_2_4, Glycosyl transferase family 2	NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-	NA	NA|183aa|up_3|NZ_CP053452.1_9815226_9815775_-	COG3306, COG3306, Glycosyltransferase involved in LPS biosynthesis [Cell envelope biogenesis, outer membrane]	NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+	NA	NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-	NA	NA|271aa|up_0|NZ_CP053452.1_9818855_9819668_+	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	40	9827907-9828056	39	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CCCGCACCACCTACGCCTACGAC	23	0	0	NA	NA	NA	2	2	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+,NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-,NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+,NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-,NA	NA|563aa|up_9|NZ_CP053452.1_9807281_9808970_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+	NA	NA|497aa|up_7|NZ_CP053452.1_9809600_9811091_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|399aa|up_6|NZ_CP053452.1_9811800_9812997_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|260aa|up_5|NZ_CP053452.1_9813009_9813789_-	pfam13704, Glyco_tranf_2_4, Glycosyl transferase family 2	NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-	NA	NA|183aa|up_3|NZ_CP053452.1_9815226_9815775_-	COG3306, COG3306, Glycosyltransferase involved in LPS biosynthesis [Cell envelope biogenesis, outer membrane]	NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+	NA	NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-	NA	NA|271aa|up_0|NZ_CP053452.1_9818855_9819668_+	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	41	9828156-9828506	40	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CCCGCACCACCTACGCCTACGAC	23	2	2	9828368-9828419|9828443-9828482	NZ_CP053452.1_18420-18471|NZ_CP053452.1_18494-18533	NA	5	5	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+,NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-,NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+,NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-,NA	NA|563aa|up_9|NZ_CP053452.1_9807281_9808970_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+	NA	NA|497aa|up_7|NZ_CP053452.1_9809600_9811091_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|399aa|up_6|NZ_CP053452.1_9811800_9812997_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|260aa|up_5|NZ_CP053452.1_9813009_9813789_-	pfam13704, Glyco_tranf_2_4, Glycosyl transferase family 2	NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-	NA	NA|183aa|up_3|NZ_CP053452.1_9815226_9815775_-	COG3306, COG3306, Glycosyltransferase involved in LPS biosynthesis [Cell envelope biogenesis, outer membrane]	NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+	NA	NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-	NA	NA|271aa|up_0|NZ_CP053452.1_9818855_9819668_+	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
GCF_013128195.1_ASM1312819v1	NZ_CP053452	Gemmataceae bacterium strain PL17 chromosome, complete genome	42	9828609-9828691	41	CRISPRCasFinder	no		csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	Orphan	CCCGCACCACCTACGCCTACGAC	23	1	1	9828632-9828668	NZ_CP053452.1_18683-18719	NA	1	1	Orphan	csa3,cas3,cas1,cas2,cas8u1,cas7,csb2gr5,RT,PD-DExK,DEDDh,DinG	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+,NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-,NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+,NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-,NA	NA|563aa|up_9|NZ_CP053452.1_9807281_9808970_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|85aa|up_8|NZ_CP053452.1_9808962_9809217_+	NA	NA|497aa|up_7|NZ_CP053452.1_9809600_9811091_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|399aa|up_6|NZ_CP053452.1_9811800_9812997_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|260aa|up_5|NZ_CP053452.1_9813009_9813789_-	pfam13704, Glyco_tranf_2_4, Glycosyl transferase family 2	NA|187aa|up_4|NZ_CP053452.1_9814656_9815217_-	NA	NA|183aa|up_3|NZ_CP053452.1_9815226_9815775_-	COG3306, COG3306, Glycosyltransferase involved in LPS biosynthesis [Cell envelope biogenesis, outer membrane]	NA|70aa|up_2|NZ_CP053452.1_9816167_9816377_+	NA	NA|91aa|up_1|NZ_CP053452.1_9817893_9818166_-	NA	NA|271aa|up_0|NZ_CP053452.1_9818855_9819668_+	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
