assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	1	400270-400384	1	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	GGCGAACGGCCGGTGTGAGCCGGCCGGTG	29	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|67aa|up_7|NZ_CP011271.1_390425_390626_-,NA|200aa|up_4|NZ_CP011271.1_392964_393564_-,NA|165aa|up_3|NZ_CP011271.1_393560_394055_-,NA	NA|78aa|up_9|NZ_CP011271.1_388697_388931_+	TIGR04137, hypothetical_protein, Chlam_Verruc_Plancto small basic protein	NA|435aa|up_8|NZ_CP011271.1_389018_390323_+	COG0801, FolK, 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase [Coenzyme metabolism]	NA|67aa|up_7|NZ_CP011271.1_390425_390626_-	NA	NA|358aa|up_6|NZ_CP011271.1_390700_391774_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|270aa|up_5|NZ_CP011271.1_392158_392968_-	pfam18299, R2K_2, ATP-grasp domain, R2K clade family 2	NA|200aa|up_4|NZ_CP011271.1_392964_393564_-	NA	NA|165aa|up_3|NZ_CP011271.1_393560_394055_-	NA	NA|620aa|up_2|NZ_CP011271.1_394082_395942_-	cd10170, HSP70_NBD, Nucleotide-binding domain of the HSP70 family	NA|904aa|up_1|NZ_CP011271.1_396221_398933_+	COG0421, SpeE, Spermidine synthase [Amino acid transport and metabolism]	NA|396aa|up_0|NZ_CP011271.1_399072_400260_-	COG1092, COG1092, Predicted SAM-dependent methyltransferases [General function prediction only]	NA|143aa|down_0|NZ_CP011271.1_400586_401015_-	cd17538, REC_D1_PleD-like, first (D1) phosphoacceptor receiver (REC) domain of response regulator PleD and similar domains	NA|342aa|down_1|NZ_CP011271.1_401193_402219_+	cd10001, HDAC_classII_APAH, Histone deacetylase class IIa	NA|290aa|down_2|NZ_CP011271.1_402237_403107_+	PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase; Reviewed	NA|116aa|down_3|NZ_CP011271.1_403294_403642_-	cd13834, HU_like, DNA-binding proteins similar to HU domains	NA|386aa|down_4|NZ_CP011271.1_404027_405185_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|280aa|down_5|NZ_CP011271.1_405333_406173_-	pfam00263, Secretin, Bacterial type II and III secretion system protein	NA|239aa|down_6|NZ_CP011271.1_406443_407160_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|115aa|down_7|NZ_CP011271.1_407590_407935_+	pfam02599, CsrA, Global regulator protein family	NA|40aa|down_8|NZ_CP011271.1_408761_408881_+	pfam04945, YHS, YHS domain	NA|550aa|down_9|NZ_CP011271.1_409071_410721_+	pfam08811, DUF1800, Protein of unknown function (DUF1800)
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	2	1048641-1048726	2	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	CGCGTTGGTATCGTTGATTGCGT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|152aa|up_6|NZ_CP011271.1_1037412_1037868_+,NA|566aa|up_2|NZ_CP011271.1_1040734_1042432_-,NA|639aa|up_1|NZ_CP011271.1_1042428_1044345_-,NA|141aa|down_2|NZ_CP011271.1_1050265_1050688_+	NA|370aa|up_9|NZ_CP011271.1_1031568_1032678_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|767aa|up_8|NZ_CP011271.1_1032725_1035026_+	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|305aa|up_7|NZ_CP011271.1_1036478_1037393_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|152aa|up_6|NZ_CP011271.1_1037412_1037868_+	NA	NA|319aa|up_5|NZ_CP011271.1_1037991_1038948_+	cd07396, MPP_Nbla03831, Homo sapiens Nbla03831 and related proteins, metallophosphatase domain	NA|220aa|up_4|NZ_CP011271.1_1039048_1039708_+	cd16841, RraA_family, ribonuclease activity regulator RraA family	NA|288aa|up_3|NZ_CP011271.1_1039825_1040689_-	cd07197, nitrilase, Nitrilase superfamily, including nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes	NA|566aa|up_2|NZ_CP011271.1_1040734_1042432_-	NA	NA|639aa|up_1|NZ_CP011271.1_1042428_1044345_-	NA	NA|1293aa|up_0|NZ_CP011271.1_1044716_1048595_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|151aa|down_0|NZ_CP011271.1_1048820_1049273_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|209aa|down_1|NZ_CP011271.1_1049648_1050275_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|141aa|down_2|NZ_CP011271.1_1050265_1050688_+	NA	NA|1440aa|down_3|NZ_CP011271.1_1050900_1055220_+	pfam13490, zf-HC2, Putative zinc-finger	NA|262aa|down_4|NZ_CP011271.1_1055318_1056104_+	TIGR03000, plancto_dom_1, Planctomycetes uncharacterized domain TIGR03000	NA|471aa|down_5|NZ_CP011271.1_1056200_1057613_-	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|572aa|down_6|NZ_CP011271.1_1057862_1059578_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|185aa|down_7|NZ_CP011271.1_1059696_1060251_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|325aa|down_8|NZ_CP011271.1_1061608_1062583_-	PRK10856, PRK10856, cytoskeleton protein RodZ	NA|278aa|down_9|NZ_CP011271.1_1063015_1063849_+	PRK14289, PRK14289, molecular chaperone DnaJ
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	3	1088588-1088677	3	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	GCGGCCACCGCCACCACCGTACCCGCCGCCACC	33	1	2	1088621-1088644|1088621-1088644	NZ_CP011271.1_1088660-1088683|NZ_CP011271.1_6599389-6599412	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|86aa|up_8|NZ_CP011271.1_1080934_1081192_+,NA|132aa|up_3|NZ_CP011271.1_1085920_1086316_+,NA|130aa|up_2|NZ_CP011271.1_1086483_1086873_+,NA|160aa|up_1|NZ_CP011271.1_1087223_1087703_+,NA|85aa|down_0|NZ_CP011271.1_1089095_1089350_-,NA|87aa|down_2|NZ_CP011271.1_1091075_1091336_+,NA|181aa|down_6|NZ_CP011271.1_1093878_1094421_-	NA|260aa|up_9|NZ_CP011271.1_1079757_1080537_-	COG0622, COG0622, Predicted phosphoesterase [General function prediction only]	NA|86aa|up_8|NZ_CP011271.1_1080934_1081192_+	NA	NA|308aa|up_7|NZ_CP011271.1_1081435_1082359_-	pfam04972, BON, BON domain	NA|575aa|up_6|NZ_CP011271.1_1082761_1084486_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|231aa|up_5|NZ_CP011271.1_1084607_1085300_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|110aa|up_4|NZ_CP011271.1_1085309_1085639_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|132aa|up_3|NZ_CP011271.1_1085920_1086316_+	NA	NA|130aa|up_2|NZ_CP011271.1_1086483_1086873_+	NA	NA|160aa|up_1|NZ_CP011271.1_1087223_1087703_+	NA	NA|207aa|up_0|NZ_CP011271.1_1087816_1088437_+	cd07379, MPP_239FB, Homo sapiens 239FB and related proteins, metallophosphatase domain	NA|85aa|down_0|NZ_CP011271.1_1089095_1089350_-	NA	NA|371aa|down_1|NZ_CP011271.1_1089377_1090490_-	cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase (Delta6-FADS)-like CD includes the integral-membrane enzymes: delta-4, delta-5, delta-6, delta-8, delta-8-sphingolipid, and delta-11 desaturases found in vertebrates, higher plants, fungi, and bacteria	NA|87aa|down_2|NZ_CP011271.1_1091075_1091336_+	NA	NA|100aa|down_3|NZ_CP011271.1_1091480_1091780_-	pfam01844, HNH, HNH endonuclease	NA|344aa|down_4|NZ_CP011271.1_1091889_1092921_+	TIGR02800, Protein_TolB, tol-pal system beta propeller repeat protein TolB	NA|108aa|down_5|NZ_CP011271.1_1093433_1093757_+	pfam07238, PilZ, PilZ domain	NA|181aa|down_6|NZ_CP011271.1_1093878_1094421_-	NA	NA|565aa|down_7|NZ_CP011271.1_1094541_1096236_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|918aa|down_8|NZ_CP011271.1_1096410_1099164_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|1192aa|down_9|NZ_CP011271.1_1099181_1102757_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	4	1334356-1334542	4,1	CRISPRCasFinder,PILER-CR	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	ACCCGCGGCCCTTGCGGGCCGGGTTC,CGCGGCCCTTGCGGGCCGGGTTC	26,23	0	0	NA	NA	NA:NA	3,2	3	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|116aa|up_4|NZ_CP011271.1_1329365_1329713_+,NA|162aa|up_3|NZ_CP011271.1_1329910_1330396_+,NA|348aa|down_1|NZ_CP011271.1_1335444_1336488_+,NA|249aa|down_3|NZ_CP011271.1_1338514_1339261_+,NA|252aa|down_4|NZ_CP011271.1_1339435_1340191_+,NA|81aa|down_6|NZ_CP011271.1_1342156_1342399_-,NA|162aa|down_8|NZ_CP011271.1_1345735_1346221_-,NA|186aa|down_9|NZ_CP011271.1_1346259_1346817_-	NA|168aa|up_9|NZ_CP011271.1_1323326_1323830_+	pfam05991, NYN_YacP, YacP-like NYN domain	NA|521aa|up_8|NZ_CP011271.1_1323841_1325404_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|464aa|up_7|NZ_CP011271.1_1325475_1326867_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|346aa|up_6|NZ_CP011271.1_1326976_1328014_+	pfam12006, DUF3500, Protein of unknown function (DUF3500)	NA|349aa|up_5|NZ_CP011271.1_1328201_1329248_+	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|116aa|up_4|NZ_CP011271.1_1329365_1329713_+	NA	NA|162aa|up_3|NZ_CP011271.1_1329910_1330396_+	NA	NA|513aa|up_2|NZ_CP011271.1_1330521_1332060_-	COG0606, COG0606, Predicted ATPase with chaperone activity [Posttranslational modification, protein turnover, chaperones]	NA|243aa|up_1|NZ_CP011271.1_1332362_1333091_-	pfam13719, zinc_ribbon_5, zinc-ribbon domain	NA|266aa|up_0|NZ_CP011271.1_1333462_1334260_+	cd06442, DPM1_like, DPM1_like represents putative enzymes similar to eukaryotic DPM1	NA|221aa|down_0|NZ_CP011271.1_1334718_1335381_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|348aa|down_1|NZ_CP011271.1_1335444_1336488_+	NA	NA|523aa|down_2|NZ_CP011271.1_1336749_1338318_+	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|249aa|down_3|NZ_CP011271.1_1338514_1339261_+	NA	NA|252aa|down_4|NZ_CP011271.1_1339435_1340191_+	NA	NA|613aa|down_5|NZ_CP011271.1_1340272_1342111_+	cd02969, PRX_like1, Peroxiredoxin (PRX)-like 1 family; hypothetical proteins that show sequence similarity to PRXs	NA|81aa|down_6|NZ_CP011271.1_1342156_1342399_-	NA	NA|304aa|down_7|NZ_CP011271.1_1342800_1343712_-	COG3568, ElsH, Metal-dependent hydrolase [General function prediction only]	NA|162aa|down_8|NZ_CP011271.1_1345735_1346221_-	NA	NA|186aa|down_9|NZ_CP011271.1_1346259_1346817_-	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	5	1792507-1792620	5	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	GTGGCTCGTCTCCGCAACCCGGGTTGAAA	29	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|224aa|up_4|NZ_CP011271.1_1784705_1785377_-,NA|293aa|down_2|NZ_CP011271.1_1796018_1796897_-,NA|312aa|down_3|NZ_CP011271.1_1796916_1797852_-,NA|71aa|down_4|NZ_CP011271.1_1798406_1798619_+,NA|166aa|down_8|NZ_CP011271.1_1808076_1808574_-	NA|377aa|up_9|NZ_CP011271.1_1771969_1773100_+	cd01406, SIR2-like, Sir2-like: Prokaryotic group of uncharacterized Sir2-like proteins which lack certain key catalytic residues and conserved zinc binding cysteines; and are members of the SIR2 superfamily of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation	NA|318aa|up_8|NZ_CP011271.1_1773292_1774246_+	cd09084, EEP-2, Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily; uncharacterized family 2	NA|375aa|up_7|NZ_CP011271.1_1774590_1775715_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|444aa|up_6|NZ_CP011271.1_1775758_1777090_-	COG1004, Ugd, Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|2317aa|up_5|NZ_CP011271.1_1777431_1784382_-	NF012181, MSCRAMM_SdrD, MSCRAMM family adhesin SdrD	NA|224aa|up_4|NZ_CP011271.1_1784705_1785377_-	NA	NA|705aa|up_3|NZ_CP011271.1_1786256_1788371_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|499aa|up_2|NZ_CP011271.1_1788415_1789912_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|448aa|up_1|NZ_CP011271.1_1790037_1791381_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|472aa|up_0|NZ_CP011271.1_1790947_1792363_+	TIGR00284, Uncharacterized_protein_MJ0107, dihydropteroate synthase-related protein	NA|386aa|down_0|NZ_CP011271.1_1792999_1794157_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|130aa|down_1|NZ_CP011271.1_1795556_1795946_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|293aa|down_2|NZ_CP011271.1_1796018_1796897_-	NA	NA|312aa|down_3|NZ_CP011271.1_1796916_1797852_-	NA	NA|71aa|down_4|NZ_CP011271.1_1798406_1798619_+	NA	NA|849aa|down_5|NZ_CP011271.1_1799132_1801679_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|638aa|down_6|NZ_CP011271.1_1801899_1803813_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|1313aa|down_7|NZ_CP011271.1_1803841_1807780_-	PRK13560, PRK13560, hypothetical protein; Provisional	NA|166aa|down_8|NZ_CP011271.1_1808076_1808574_-	NA	NA|910aa|down_9|NZ_CP011271.1_1809121_1811851_+	pfam03200, Glyco_hydro_63, Glycosyl hydrolase family 63 C-terminal domain
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	6	2560494-2560584	6	CRISPRCasFinder	no	csa3	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Type I-A	CCCGCAAGGGCGGCGGGTGACGCGC	25	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|196aa|up_7|NZ_CP011271.1_2549191_2549779_-,NA|106aa|up_4|NZ_CP011271.1_2553173_2553491_-,NA|76aa|up_3|NZ_CP011271.1_2553672_2553900_-,NA|103aa|up_2|NZ_CP011271.1_2554014_2554323_-,NA|185aa|up_1|NZ_CP011271.1_2554587_2555142_-,NA	NA|535aa|up_9|NZ_CP011271.1_2546437_2548042_-	PRK12344, PRK12344, putative alpha-isopropylmalate/homocitrate synthase family transferase; Provisional	NA|188aa|up_8|NZ_CP011271.1_2548381_2548945_-	pfam05685, Uma2, Putative restriction endonuclease	NA|196aa|up_7|NZ_CP011271.1_2549191_2549779_-	NA	NA|540aa|up_6|NZ_CP011271.1_2549850_2551470_-	COG0029, NadB, Aspartate oxidase [Coenzyme metabolism]	NA|328aa|up_5|NZ_CP011271.1_2551953_2552937_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|106aa|up_4|NZ_CP011271.1_2553173_2553491_-	NA	NA|76aa|up_3|NZ_CP011271.1_2553672_2553900_-	NA	NA|103aa|up_2|NZ_CP011271.1_2554014_2554323_-	NA	NA|185aa|up_1|NZ_CP011271.1_2554587_2555142_-	NA	NA|1590aa|up_0|NZ_CP011271.1_2555505_2560275_-	PTZ00121, PTZ00121, MAEBL; Provisional	NA|302aa|down_0|NZ_CP011271.1_2560915_2561821_+	COG0705, COG0705, Membrane associated serine protease [Amino acid transport and metabolism]	NA|76aa|down_1|NZ_CP011271.1_2561913_2562141_+	pfam09723, Zn-ribbon_8, Zinc ribbon domain	csa3|104aa|down_2|NZ_CP011271.1_2562316_2562628_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|309aa|down_3|NZ_CP011271.1_2562673_2563600_-	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|145aa|down_4|NZ_CP011271.1_2563624_2564059_+	PRK03624, PRK03624, putative acetyltransferase; Provisional	NA|289aa|down_5|NZ_CP011271.1_2564100_2564967_+	cd02573, PseudoU_synth_EcTruB, Pseudouridine synthase, Escherichia coli TruB like	NA|161aa|down_6|NZ_CP011271.1_2565103_2565586_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|253aa|down_7|NZ_CP011271.1_2565873_2566632_-	cd04723, HisA_HisF, Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase (HisA) and the cyclase subunit of imidazoleglycerol phosphate synthase (HisF)	NA|300aa|down_8|NZ_CP011271.1_2566845_2567745_+	PRK05299, rpsB, 30S ribosomal protein S2; Provisional	NA|286aa|down_9|NZ_CP011271.1_2567880_2568738_+	PRK09377, tsf, elongation factor Ts; Provisional
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	7	2703388-2703482	7	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	ACCGAACCGGCGGAACAACCGGC	23	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|70aa|up_5|NZ_CP011271.1_2699137_2699347_+,NA|103aa|down_3|NZ_CP011271.1_2707152_2707461_-,NA|208aa|down_4|NZ_CP011271.1_2707543_2708167_-	NA|127aa|up_9|NZ_CP011271.1_2695495_2695876_+	TIGR02541, Peptidoglycan_hydrolase_FlgJ, flagellar rod assembly protein/muramidase FlgJ	NA|162aa|up_8|NZ_CP011271.1_2695820_2696306_+	pfam05130, FlgN, FlgN protein	NA|564aa|up_7|NZ_CP011271.1_2696331_2698023_+	COG1256, FlgK, Flagellar hook-associated protein [Cell motility and secretion]	NA|314aa|up_6|NZ_CP011271.1_2698034_2698976_+	TIGR02550, Flagellar_hook-associated_protein_3, flagellar hook-associated protein 3	NA|70aa|up_5|NZ_CP011271.1_2699137_2699347_+	NA	NA|287aa|up_4|NZ_CP011271.1_2699401_2700262_+	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|273aa|up_3|NZ_CP011271.1_2700258_2701077_+	pfam11684, DUF3280, Protein of unknown function (DUF2380)	NA|167aa|up_2|NZ_CP011271.1_2701252_2701753_+	pfam03748, FliL, Flagellar basal body-associated protein FliL	NA|284aa|up_1|NZ_CP011271.1_2701749_2702601_+	pfam01052, FliMN_C, Type III flagellar switch regulator (C-ring) FliN C-term	NA|121aa|up_0|NZ_CP011271.1_2702626_2702989_+	pfam01052, FliMN_C, Type III flagellar switch regulator (C-ring) FliN C-term	NA|419aa|down_0|NZ_CP011271.1_2703690_2704947_-	PRK05682, flgE, flagellar hook protein FlgE; Validated	NA|119aa|down_1|NZ_CP011271.1_2705032_2705389_-	PRK06655, flgD, flagellar hook assembly protein FlgD	NA|569aa|down_2|NZ_CP011271.1_2705430_2707137_-	cd17470, T3SS_Flik_C, C-terminal domain of flagellar hook-length control protein FliK and similar domains	NA|103aa|down_3|NZ_CP011271.1_2707152_2707461_-	NA	NA|208aa|down_4|NZ_CP011271.1_2707543_2708167_-	NA	NA|189aa|down_5|NZ_CP011271.1_2708255_2708822_-	PRK09041, motB, motility protein MotB	NA|282aa|down_6|NZ_CP011271.1_2708872_2709718_-	TIGR03818, MotA1, flagellar motor stator protein MotA	NA|274aa|down_7|NZ_CP011271.1_2709933_2710755_+	PRK13800, PRK13800, fumarate reductase/succinate dehydrogenase flavoprotein subunit	NA|462aa|down_8|NZ_CP011271.1_2710846_2712232_-	pfam07593, UnbV_ASPIC, ASPIC and UnbV	NA|1229aa|down_9|NZ_CP011271.1_2713004_2716691_+	pfam00930, DPPIV_N, Dipeptidyl peptidase IV (DPP IV) N-terminal region
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	8	2951571-2951669	8	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	TCTGGCGCACCCCGCGCACCTTCTTCTT	28	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|174aa|up_9|NZ_CP011271.1_2929013_2929535_-,NA|373aa|up_7|NZ_CP011271.1_2935615_2936734_-,NA|427aa|up_5|NZ_CP011271.1_2938717_2939998_+,NA|1826aa|up_4|NZ_CP011271.1_2940165_2945643_+,NA|70aa|down_4|NZ_CP011271.1_2958468_2958678_+,NA|133aa|down_7|NZ_CP011271.1_2960872_2961271_+,NA|312aa|down_8|NZ_CP011271.1_2961685_2962621_+,NA|105aa|down_9|NZ_CP011271.1_2962750_2963065_+	NA|174aa|up_9|NZ_CP011271.1_2929013_2929535_-	NA	NA|347aa|up_8|NZ_CP011271.1_2934578_2935619_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|373aa|up_7|NZ_CP011271.1_2935615_2936734_-	NA	NA|392aa|up_6|NZ_CP011271.1_2937506_2938682_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|427aa|up_5|NZ_CP011271.1_2938717_2939998_+	NA	NA|1826aa|up_4|NZ_CP011271.1_2940165_2945643_+	NA	NA|352aa|up_3|NZ_CP011271.1_2945786_2946842_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|275aa|up_2|NZ_CP011271.1_2946881_2947706_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|164aa|up_1|NZ_CP011271.1_2947780_2948272_+	pfam13592, HTH_33, Winged helix-turn helix	NA|181aa|up_0|NZ_CP011271.1_2948288_2948831_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|146aa|down_0|NZ_CP011271.1_2952543_2952981_-	COG1846, MarR, Transcriptional regulators [Transcription]	NA|1055aa|down_1|NZ_CP011271.1_2953024_2956189_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|372aa|down_2|NZ_CP011271.1_2956185_2957301_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|145aa|down_3|NZ_CP011271.1_2957957_2958392_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|70aa|down_4|NZ_CP011271.1_2958468_2958678_+	NA	NA|164aa|down_5|NZ_CP011271.1_2958723_2959215_+	pfam11149, DUF2924, Protein of unknown function (DUF2924)	NA|541aa|down_6|NZ_CP011271.1_2959211_2960834_+	smart00857, Resolvase, Resolvase, N terminal domain	NA|133aa|down_7|NZ_CP011271.1_2960872_2961271_+	NA	NA|312aa|down_8|NZ_CP011271.1_2961685_2962621_+	NA	NA|105aa|down_9|NZ_CP011271.1_2962750_2963065_+	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	9	3686478-3687133	9	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	ACGCTGACGGTCACGAACAGCACCGT	26	0	0	NA	NA	NA	8	8	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|498aa|up_4|NZ_CP011271.1_3679440_3680934_-,NA|50aa|up_3|NZ_CP011271.1_3681042_3681192_+,NA|397aa|up_0|NZ_CP011271.1_3684415_3685606_+,NA|224aa|down_0|NZ_CP011271.1_3688399_3689071_+,NA|502aa|down_3|NZ_CP011271.1_3692651_3694157_-,NA|471aa|down_9|NZ_CP011271.1_3700321_3701734_+	NA|292aa|up_9|NZ_CP011271.1_3672742_3673618_-	COG0190, FolD, 5,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase [Coenzyme metabolism]	NA|253aa|up_8|NZ_CP011271.1_3673625_3674384_-	PRK00481, PRK00481, NAD-dependent deacetylase; Provisional	NA|268aa|up_7|NZ_CP011271.1_3674533_3675337_-	pfam00459, Inositol_P, Inositol monophosphatase family	NA|742aa|up_6|NZ_CP011271.1_3675586_3677812_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|431aa|up_5|NZ_CP011271.1_3677977_3679270_-	COG1277, NosY, ABC-type transport system involved in multi-copper enzyme maturation, permease component [General function prediction only]	NA|498aa|up_4|NZ_CP011271.1_3679440_3680934_-	NA	NA|50aa|up_3|NZ_CP011271.1_3681042_3681192_+	NA	NA|338aa|up_2|NZ_CP011271.1_3681384_3682398_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|443aa|up_1|NZ_CP011271.1_3682675_3684004_+	PRK09284, PRK09284, thiamine biosynthesis protein ThiC; Provisional	NA|397aa|up_0|NZ_CP011271.1_3684415_3685606_+	NA	NA|224aa|down_0|NZ_CP011271.1_3688399_3689071_+	NA	NA|646aa|down_1|NZ_CP011271.1_3689332_3691270_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|373aa|down_2|NZ_CP011271.1_3691378_3692497_-	pfam02517, Abi, CAAX protease self-immunity	NA|502aa|down_3|NZ_CP011271.1_3692651_3694157_-	NA	NA|337aa|down_4|NZ_CP011271.1_3694156_3695167_-	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|341aa|down_5|NZ_CP011271.1_3695296_3696319_-	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|518aa|down_6|NZ_CP011271.1_3696422_3697976_+	PRK06676, rpsA, 30S ribosomal protein S1; Reviewed	NA|323aa|down_7|NZ_CP011271.1_3698179_3699148_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|379aa|down_8|NZ_CP011271.1_3699144_3700281_+	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|471aa|down_9|NZ_CP011271.1_3700321_3701734_+	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	10	3903615-3912171	2,10,1,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas1,cas2,cas3,cas8u2,cas7,cas5u	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Unclear	GTCTCTCCCCAGATACATCTGGGGCCGAATTGAAGC,GTCTCTCCCCAGATACATCTGGGGCCGAATTGAAGC,GTCTCTCCCCAGATACATCTGGGGCCGAATTGAAGC,GTCTCTCCCCAGATACATCTGGGGCCGAATTGAAGC	36,36,36,36	5	6	3908372-3908408|3908735-3908770|3908371-3908407|3908734-3908769|3910131-3910166|3910131-3910166	NZ_CP011271.1_2197005-2196969|NZ_CP011271.1_2162335-2162300|NZ_CP011271.1_2197005-2196969|NZ_CP011271.1_2162335-2162300|NZ_CP011271.1_2168360-2168325|NZ_CP011271.1_3275897-3275862	NA:NA:NA:NA	115,117,117,115	117	Unclear	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|73aa|up_7|NZ_CP011271.1_3892793_3893012_+,NA|455aa|down_1|NZ_CP011271.1_3912452_3913817_-	NA|120aa|up_9|NZ_CP011271.1_3892063_3892423_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|79aa|up_8|NZ_CP011271.1_3892419_3892656_-	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|73aa|up_7|NZ_CP011271.1_3892793_3893012_+	NA	cas1|570aa|up_6|NZ_CP011271.1_3893042_3894752_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|95aa|up_5|NZ_CP011271.1_3895124_3895409_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas3|753aa|up_4|NZ_CP011271.1_3895421_3897680_+	cd09696, Cas3_I, CRISPR/Cas system-associated protein Cas3; Distinct Cas3 family with HD domain fused to C-termus of Helicase domain	NA|429aa|up_3|NZ_CP011271.1_3897685_3898972_-	pfam13546, DDE_5, DDE superfamily endonuclease	cas8u2|196aa|up_2|NZ_CP011271.1_3899891_3900479_+	TIGR04106, hypothetical_protein_GobsU_11505, CRISPR-associated protein GSU0052/csb3, Dpsyc system	cas7|366aa|up_1|NZ_CP011271.1_3900471_3901569_+	cd09678, Csb1_I-U, CRISPR/Cas system-associated protein Csb1	cas5u|552aa|up_0|NZ_CP011271.1_3901581_3903237_+	cd09667, Csb2_I-U, CRISPR/Cas system-associated protein Csb2	NA|74aa|down_0|NZ_CP011271.1_3912209_3912431_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|455aa|down_1|NZ_CP011271.1_3912452_3913817_-	NA	NA|122aa|down_2|NZ_CP011271.1_3913846_3914212_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|414aa|down_3|NZ_CP011271.1_3914869_3916111_+	pfam00589, Phage_integrase, Phage integrase family	NA|169aa|down_4|NZ_CP011271.1_3916381_3916888_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|162aa|down_5|NZ_CP011271.1_3916932_3917418_-	COG5433, COG5433, Transposase [DNA replication, recombination, and repair]	NA|225aa|down_6|NZ_CP011271.1_3917414_3918089_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|185aa|down_7|NZ_CP011271.1_3918127_3918682_-	pfam13592, HTH_33, Winged helix-turn helix	NA|446aa|down_8|NZ_CP011271.1_3918818_3920156_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|263aa|down_9|NZ_CP011271.1_3920480_3921269_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	11	4213049-4213159	11	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	GCCAAAAACATATGCCCTATACCCCCATCCTTGCGAC	37	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|92aa|up_0|NZ_CP011271.1_4212658_4212934_+,NA|83aa|down_1|NZ_CP011271.1_4214829_4215078_-,NA|639aa|down_2|NZ_CP011271.1_4215386_4217303_+,NA|153aa|down_5|NZ_CP011271.1_4219992_4220451_-,NA|151aa|down_7|NZ_CP011271.1_4222586_4223039_-,NA|49aa|down_9|NZ_CP011271.1_4225668_4225815_+	NA|269aa|up_9|NZ_CP011271.1_4197737_4198544_+	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|378aa|up_8|NZ_CP011271.1_4198559_4199693_-	COG4658, RnfD, Predicted NADH:ubiquinone oxidoreductase, subunit RnfD [Energy production and conversion]	NA|593aa|up_7|NZ_CP011271.1_4199689_4201468_-	pfam07593, UnbV_ASPIC, ASPIC and UnbV	NA|1168aa|up_6|NZ_CP011271.1_4201509_4205013_-	sd00006, TPR, Tetratricopeptide repeat	NA|502aa|up_5|NZ_CP011271.1_4205037_4206543_-	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|309aa|up_4|NZ_CP011271.1_4206558_4207485_-	COG1244, COG1244, Predicted Fe-S oxidoreductase [General function prediction only]	NA|448aa|up_3|NZ_CP011271.1_4207456_4208800_-	cd01991, Asn_Synthase_B_C, The C-terminal domain of Asparagine Synthase B	NA|712aa|up_2|NZ_CP011271.1_4209206_4211342_+	COG2268, COG2268, Uncharacterized protein conserved in bacteria [Function unknown]	NA|300aa|up_1|NZ_CP011271.1_4211657_4212557_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|92aa|up_0|NZ_CP011271.1_4212658_4212934_+	NA	NA|337aa|down_0|NZ_CP011271.1_4213801_4214812_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|83aa|down_1|NZ_CP011271.1_4214829_4215078_-	NA	NA|639aa|down_2|NZ_CP011271.1_4215386_4217303_+	NA	NA|550aa|down_3|NZ_CP011271.1_4217304_4218954_+	COG1277, NosY, ABC-type transport system involved in multi-copper enzyme maturation, permease component [General function prediction only]	NA|322aa|down_4|NZ_CP011271.1_4218981_4219947_-	cd01935, Ntn_CGH_like, Choloylglycine hydrolase (CGH)_like	NA|153aa|down_5|NZ_CP011271.1_4219992_4220451_-	NA	NA|676aa|down_6|NZ_CP011271.1_4220408_4222436_-	pfam13578, Methyltransf_24, Methyltransferase domain	NA|151aa|down_7|NZ_CP011271.1_4222586_4223039_-	NA	NA|678aa|down_8|NZ_CP011271.1_4223147_4225181_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|49aa|down_9|NZ_CP011271.1_4225668_4225815_+	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	12	4786708-4786813	12	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	ACCTCCATCGAGATGCCACGCGGCTATTGAGCAACC	36	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|443aa|up_9|NZ_CP011271.1_4779119_4780448_-,NA|93aa|up_2|NZ_CP011271.1_4784927_4785206_+,NA|51aa|up_1|NZ_CP011271.1_4786003_4786156_+,NA|286aa|down_4|NZ_CP011271.1_4798566_4799424_+,NA|160aa|down_7|NZ_CP011271.1_4802594_4803074_+	NA|443aa|up_9|NZ_CP011271.1_4779119_4780448_-	NA	NA|169aa|up_8|NZ_CP011271.1_4780649_4781156_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|162aa|up_7|NZ_CP011271.1_4781200_4781686_-	COG5433, COG5433, Transposase [DNA replication, recombination, and repair]	NA|225aa|up_6|NZ_CP011271.1_4781682_4782357_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|185aa|up_5|NZ_CP011271.1_4782395_4782950_-	pfam13592, HTH_33, Winged helix-turn helix	NA|235aa|up_4|NZ_CP011271.1_4783212_4783917_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|160aa|up_3|NZ_CP011271.1_4783670_4784150_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|93aa|up_2|NZ_CP011271.1_4784927_4785206_+	NA	NA|51aa|up_1|NZ_CP011271.1_4786003_4786156_+	NA	NA|87aa|up_0|NZ_CP011271.1_4786213_4786474_-	TIGR02890, conserved_hypothetical_protein, regulatory protein, yteA family	NA|623aa|down_0|NZ_CP011271.1_4786836_4788705_-	PRK12305, thrS, threonyl-tRNA synthetase; Reviewed	NA|118aa|down_1|NZ_CP011271.1_4788887_4789241_+	cd06981, cupin_reut_a1446, Cupriavidus pinatubonensis reut_a1446 and related proteins, cupin domain	NA|1666aa|down_2|NZ_CP011271.1_4789400_4794398_+	TIGR02226, hypothetical_protein-transmembrane_prediction, N-terminal double-transmembrane domain	NA|1349aa|down_3|NZ_CP011271.1_4794479_4798526_+	PTZ00121, PTZ00121, MAEBL; Provisional	NA|286aa|down_4|NZ_CP011271.1_4798566_4799424_+	NA	NA|383aa|down_5|NZ_CP011271.1_4799504_4800653_+	cd00688, ISOPREN_C2_like, This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement	NA|559aa|down_6|NZ_CP011271.1_4800842_4802519_+	cd07560, Peptidase_S41_CPP, C-terminal processing peptidase; serine protease family S41	NA|160aa|down_7|NZ_CP011271.1_4802594_4803074_+	NA	NA|292aa|down_8|NZ_CP011271.1_4803168_4804044_+	cd01572, QPRTase, Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase), also called nicotinate-nucleotide pyrophosphorylase, is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes	NA|266aa|down_9|NZ_CP011271.1_4804040_4804838_+	cd16442, BPL, biotin protein ligase
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	13	5194645-5194745	13	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	ACATCCACCGCCCTTGCGGGCGGGG	25	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|266aa|up_3|NZ_CP011271.1_5190812_5191610_+,NA|320aa|down_4|NZ_CP011271.1_5200258_5201218_+,NA|470aa|down_5|NZ_CP011271.1_5201228_5202638_+,NA|245aa|down_8|NZ_CP011271.1_5208015_5208750_+,NA|394aa|down_9|NZ_CP011271.1_5208760_5209942_+	NA|183aa|up_9|NZ_CP011271.1_5182162_5182711_+	pfam14024, DUF4240, Protein of unknown function (DUF4240)	NA|176aa|up_8|NZ_CP011271.1_5182745_5183273_+	pfam14024, DUF4240, Protein of unknown function (DUF4240)	NA|254aa|up_7|NZ_CP011271.1_5183313_5184075_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|863aa|up_6|NZ_CP011271.1_5184085_5186674_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|840aa|up_5|NZ_CP011271.1_5187062_5189582_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|360aa|up_4|NZ_CP011271.1_5189584_5190664_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|266aa|up_3|NZ_CP011271.1_5190812_5191610_+	NA	NA|244aa|up_2|NZ_CP011271.1_5191790_5192522_-	PRK00173, rph, ribonuclease PH; Reviewed	NA|212aa|up_1|NZ_CP011271.1_5192811_5193447_-	COG3124, COG3124, Uncharacterized protein conserved in bacteria [Function unknown]	NA|303aa|up_0|NZ_CP011271.1_5193560_5194469_-	pfam06283, ThuA, Trehalose utilisation	NA|433aa|down_0|NZ_CP011271.1_5194802_5196101_-	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|475aa|down_1|NZ_CP011271.1_5196714_5198139_+	pfam00815, Histidinol_dh, Histidinol dehydrogenase	NA|355aa|down_2|NZ_CP011271.1_5198297_5199362_+	PRK05387, PRK05387, histidinol-phosphate aminotransferase; Provisional	NA|196aa|down_3|NZ_CP011271.1_5199409_5199997_+	PRK00951, hisB, imidazoleglycerol-phosphate dehydratase HisB	NA|320aa|down_4|NZ_CP011271.1_5200258_5201218_+	NA	NA|470aa|down_5|NZ_CP011271.1_5201228_5202638_+	NA	NA|518aa|down_6|NZ_CP011271.1_5203030_5204584_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|650aa|down_7|NZ_CP011271.1_5205221_5207171_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|245aa|down_8|NZ_CP011271.1_5208015_5208750_+	NA	NA|394aa|down_9|NZ_CP011271.1_5208760_5209942_+	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	14	5529625-5529729	14	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	CCTCGCGTCCGAGACCACGATGAAGCGACTCAC	33	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|76aa|up_7|NZ_CP011271.1_5520238_5520466_-,NA|81aa|up_6|NZ_CP011271.1_5520462_5520705_-,NA|386aa|up_5|NZ_CP011271.1_5521144_5522302_-,NA|141aa|up_2|NZ_CP011271.1_5527237_5527660_+,NA|117aa|up_1|NZ_CP011271.1_5527786_5528137_+,NA|153aa|down_0|NZ_CP011271.1_5530464_5530923_+,NA|287aa|down_3|NZ_CP011271.1_5533780_5534641_+,NA|169aa|down_4|NZ_CP011271.1_5534712_5535219_+,NA|421aa|down_5|NZ_CP011271.1_5535294_5536557_+,NA|72aa|down_6|NZ_CP011271.1_5536656_5536872_+,NA|121aa|down_7|NZ_CP011271.1_5536868_5537231_+,NA|99aa|down_8|NZ_CP011271.1_5537234_5537531_+,NA|227aa|down_9|NZ_CP011271.1_5537530_5538211_+	NA|42aa|up_9|NZ_CP011271.1_5516521_5516647_-	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|876aa|up_8|NZ_CP011271.1_5517558_5520186_-	cd02077, P-type_ATPase_Mg, magnesium transporting ATPase (MgtA), similar to Escherichia coli MgtA and Salmonella typhimurium MgtA	NA|76aa|up_7|NZ_CP011271.1_5520238_5520466_-	NA	NA|81aa|up_6|NZ_CP011271.1_5520462_5520705_-	NA	NA|386aa|up_5|NZ_CP011271.1_5521144_5522302_-	NA	NA|876aa|up_4|NZ_CP011271.1_5522664_5525292_-	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|398aa|up_3|NZ_CP011271.1_5525687_5526881_+	TIGR02224, Tyrosine_recombinase_XerC, tyrosine recombinase XerC	NA|141aa|up_2|NZ_CP011271.1_5527237_5527660_+	NA	NA|117aa|up_1|NZ_CP011271.1_5527786_5528137_+	NA	NA|422aa|up_0|NZ_CP011271.1_5528133_5529399_+	pfam13481, AAA_25, AAA domain	NA|153aa|down_0|NZ_CP011271.1_5530464_5530923_+	NA	NA|435aa|down_1|NZ_CP011271.1_5530894_5532199_+	COG5410, COG5410, Uncharacterized protein conserved in bacteria [Function unknown]	NA|503aa|down_2|NZ_CP011271.1_5532272_5533781_+	pfam05136, Phage_portal_2, Phage portal protein, lambda family	NA|287aa|down_3|NZ_CP011271.1_5533780_5534641_+	NA	NA|169aa|down_4|NZ_CP011271.1_5534712_5535219_+	NA	NA|421aa|down_5|NZ_CP011271.1_5535294_5536557_+	NA	NA|72aa|down_6|NZ_CP011271.1_5536656_5536872_+	NA	NA|121aa|down_7|NZ_CP011271.1_5536868_5537231_+	NA	NA|99aa|down_8|NZ_CP011271.1_5537234_5537531_+	NA	NA|227aa|down_9|NZ_CP011271.1_5537530_5538211_+	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	15	5649880-5649990	15	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	GGAAGCCAACGAGACGCGCGCCGACGCCGG	30	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|131aa|up_8|NZ_CP011271.1_5640567_5640960_-,NA|70aa|up_7|NZ_CP011271.1_5641672_5641882_+,NA|97aa|up_6|NZ_CP011271.1_5642032_5642323_+,NA|270aa|up_5|NZ_CP011271.1_5642749_5643559_-,NA|78aa|up_4|NZ_CP011271.1_5644184_5644418_+,NA|81aa|up_3|NZ_CP011271.1_5644877_5645120_+,NA|84aa|up_2|NZ_CP011271.1_5645552_5645804_+,NA|262aa|down_0|NZ_CP011271.1_5651590_5652376_-,NA|101aa|down_1|NZ_CP011271.1_5652497_5652800_-,NA|186aa|down_3|NZ_CP011271.1_5653579_5654137_-,NA|55aa|down_4|NZ_CP011271.1_5654447_5654612_-,NA|152aa|down_6|NZ_CP011271.1_5655890_5656346_+,NA|78aa|down_8|NZ_CP011271.1_5657750_5657984_+,NA|217aa|down_9|NZ_CP011271.1_5658023_5658674_-	NA|158aa|up_9|NZ_CP011271.1_5639763_5640237_-	smart00871, AraC_E_bind, Bacterial transcription activator, effector binding domain	NA|131aa|up_8|NZ_CP011271.1_5640567_5640960_-	NA	NA|70aa|up_7|NZ_CP011271.1_5641672_5641882_+	NA	NA|97aa|up_6|NZ_CP011271.1_5642032_5642323_+	NA	NA|270aa|up_5|NZ_CP011271.1_5642749_5643559_-	NA	NA|78aa|up_4|NZ_CP011271.1_5644184_5644418_+	NA	NA|81aa|up_3|NZ_CP011271.1_5644877_5645120_+	NA	NA|84aa|up_2|NZ_CP011271.1_5645552_5645804_+	NA	NA|188aa|up_1|NZ_CP011271.1_5646261_5646825_-	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|356aa|up_0|NZ_CP011271.1_5647069_5648137_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|262aa|down_0|NZ_CP011271.1_5651590_5652376_-	NA	NA|101aa|down_1|NZ_CP011271.1_5652497_5652800_-	NA	NA|147aa|down_2|NZ_CP011271.1_5653006_5653447_-	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|186aa|down_3|NZ_CP011271.1_5653579_5654137_-	NA	NA|55aa|down_4|NZ_CP011271.1_5654447_5654612_-	NA	NA|87aa|down_5|NZ_CP011271.1_5655271_5655532_-	TIGR03066, Gem_osc_para_1, Gemmata obscuriglobus paralogous family TIGR03066	NA|152aa|down_6|NZ_CP011271.1_5655890_5656346_+	NA	NA|429aa|down_7|NZ_CP011271.1_5656355_5657642_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|78aa|down_8|NZ_CP011271.1_5657750_5657984_+	NA	NA|217aa|down_9|NZ_CP011271.1_5658023_5658674_-	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	16	5693618-5693734	16	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	CCCCGGGCTATTCCCGGTGACCCCTTCGGGGTCC	34	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|97aa|up_9|NZ_CP011271.1_5683400_5683691_-,NA|270aa|up_8|NZ_CP011271.1_5684252_5685062_+,NA|90aa|up_6|NZ_CP011271.1_5686096_5686366_-,NA|87aa|up_2|NZ_CP011271.1_5690105_5690366_-,NA|373aa|down_1|NZ_CP011271.1_5695018_5696137_+,NA|204aa|down_3|NZ_CP011271.1_5697283_5697895_+,NA|525aa|down_6|NZ_CP011271.1_5699916_5701491_+,NA|257aa|down_7|NZ_CP011271.1_5701692_5702463_-,NA|79aa|down_9|NZ_CP011271.1_5706610_5706847_-	NA|97aa|up_9|NZ_CP011271.1_5683400_5683691_-	NA	NA|270aa|up_8|NZ_CP011271.1_5684252_5685062_+	NA	NA|184aa|up_7|NZ_CP011271.1_5685476_5686028_-	pfam09346, SMI1_KNR4, SMI1 / KNR4 family (SUKH-1)	NA|90aa|up_6|NZ_CP011271.1_5686096_5686366_-	NA	NA|224aa|up_5|NZ_CP011271.1_5687396_5688068_-	cd00884, beta_CA_cladeB, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|183aa|up_4|NZ_CP011271.1_5688266_5688815_-	pfam13173, AAA_14, AAA domain	NA|184aa|up_3|NZ_CP011271.1_5689485_5690037_-	pfam09346, SMI1_KNR4, SMI1 / KNR4 family (SUKH-1)	NA|87aa|up_2|NZ_CP011271.1_5690105_5690366_-	NA	NA|459aa|up_1|NZ_CP011271.1_5691012_5692389_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|381aa|up_0|NZ_CP011271.1_5692437_5693580_-	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|224aa|down_0|NZ_CP011271.1_5693917_5694589_+	cd17320, MFS_MdfA_MDR_like, Multidrug transporter MdfA and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|373aa|down_1|NZ_CP011271.1_5695018_5696137_+	NA	NA|319aa|down_2|NZ_CP011271.1_5696133_5697090_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|204aa|down_3|NZ_CP011271.1_5697283_5697895_+	NA	NA|112aa|down_4|NZ_CP011271.1_5698297_5698633_+	cd00552, RaiA, RaiA ("ribosome-associated inhibitor A", also known as Protein Y (PY), YfiA, and SpotY,  is a stress-response protein that binds the ribosomal subunit interface and arrests translation by interfering with aminoacyl-tRNA binding to the ribosomal A site	NA|112aa|down_5|NZ_CP011271.1_5698812_5699148_-	PRK13874, PRK13874, conjugal transfer protein TrbJ; Provisional	NA|525aa|down_6|NZ_CP011271.1_5699916_5701491_+	NA	NA|257aa|down_7|NZ_CP011271.1_5701692_5702463_-	NA	NA|927aa|down_8|NZ_CP011271.1_5703396_5706177_-	COG3378, COG3378, Phage associated DNA primase [General function prediction only]	NA|79aa|down_9|NZ_CP011271.1_5706610_5706847_-	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	17	5706172-5706256	17	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	GCGCATCGATGCCCGAGGTGGAGG	24	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|373aa|up_6|NZ_CP011271.1_5695018_5696137_+,NA|204aa|up_4|NZ_CP011271.1_5697283_5697895_+,NA|525aa|up_1|NZ_CP011271.1_5699916_5701491_+,NA|257aa|up_0|NZ_CP011271.1_5701692_5702463_-,NA|79aa|down_0|NZ_CP011271.1_5706610_5706847_-,NA|53aa|down_1|NZ_CP011271.1_5707027_5707186_-,NA|83aa|down_2|NZ_CP011271.1_5707248_5707497_-,NA|505aa|down_9|NZ_CP011271.1_5714143_5715658_+	NA|459aa|up_9|NZ_CP011271.1_5691012_5692389_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|381aa|up_8|NZ_CP011271.1_5692437_5693580_-	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|224aa|up_7|NZ_CP011271.1_5693917_5694589_+	cd17320, MFS_MdfA_MDR_like, Multidrug transporter MdfA and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|373aa|up_6|NZ_CP011271.1_5695018_5696137_+	NA	NA|319aa|up_5|NZ_CP011271.1_5696133_5697090_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|204aa|up_4|NZ_CP011271.1_5697283_5697895_+	NA	NA|112aa|up_3|NZ_CP011271.1_5698297_5698633_+	cd00552, RaiA, RaiA ("ribosome-associated inhibitor A", also known as Protein Y (PY), YfiA, and SpotY,  is a stress-response protein that binds the ribosomal subunit interface and arrests translation by interfering with aminoacyl-tRNA binding to the ribosomal A site	NA|112aa|up_2|NZ_CP011271.1_5698812_5699148_-	PRK13874, PRK13874, conjugal transfer protein TrbJ; Provisional	NA|525aa|up_1|NZ_CP011271.1_5699916_5701491_+	NA	NA|257aa|up_0|NZ_CP011271.1_5701692_5702463_-	NA	NA|79aa|down_0|NZ_CP011271.1_5706610_5706847_-	NA	NA|53aa|down_1|NZ_CP011271.1_5707027_5707186_-	NA	NA|83aa|down_2|NZ_CP011271.1_5707248_5707497_-	NA	NA|204aa|down_3|NZ_CP011271.1_5707688_5708300_+	PRK00215, PRK00215, transcriptional repressor LexA	NA|141aa|down_4|NZ_CP011271.1_5708479_5708902_+	COG4775, COG4775, Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane]	NA|238aa|down_5|NZ_CP011271.1_5708963_5709677_+	COG2258, COG2258, Uncharacterized protein conserved in bacteria [Function unknown]	NA|648aa|down_6|NZ_CP011271.1_5710108_5712052_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|149aa|down_7|NZ_CP011271.1_5712140_5712587_-	TIGR03067, Planc_TIGR03067, Planctomycetes uncharacterized domain TIGR03067	NA|144aa|down_8|NZ_CP011271.1_5713728_5714160_+	pfam03592, Terminase_2, Terminase small subunit	NA|505aa|down_9|NZ_CP011271.1_5714143_5715658_+	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	18	5835070-5835171	18	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	ACCGCGGCGTGCGCGACGCGGCGG	24	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|178aa|up_0|NZ_CP011271.1_5834135_5834669_-,NA|169aa|down_1|NZ_CP011271.1_5838655_5839162_-,NA|312aa|down_6|NZ_CP011271.1_5845283_5846219_-,NA|383aa|down_8|NZ_CP011271.1_5847737_5848886_+	NA|67aa|up_9|NZ_CP011271.1_5821745_5821946_+	pfam04405, ScdA_N, Domain of Unknown function (DUF542)	NA|110aa|up_8|NZ_CP011271.1_5821975_5822305_+	cd02230, cupin_HP0902-like, Helicobacter pylori HP0902 and related proteins, cupin domain	NA|213aa|up_7|NZ_CP011271.1_5822652_5823291_-	pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases superfamily	NA|919aa|up_6|NZ_CP011271.1_5823772_5826529_-	cd06240, M14-like, Peptidase M14-like domain; uncharacterized subgroup	NA|937aa|up_5|NZ_CP011271.1_5826682_5829493_-	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|349aa|up_4|NZ_CP011271.1_5829779_5830826_+	COG4977, COG4977, Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain [Transcription]	NA|234aa|up_3|NZ_CP011271.1_5830940_5831642_+	cd03139, GATase1_PfpI_2, Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus	NA|326aa|up_2|NZ_CP011271.1_5832282_5833260_+	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|192aa|up_1|NZ_CP011271.1_5833405_5833981_-	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|178aa|up_0|NZ_CP011271.1_5834135_5834669_-	NA	NA|581aa|down_0|NZ_CP011271.1_5836916_5838659_-	sd00044, HEAT, HEAT repeats	NA|169aa|down_1|NZ_CP011271.1_5838655_5839162_-	NA	NA|998aa|down_2|NZ_CP011271.1_5839272_5842266_+	TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase domain	NA|180aa|down_3|NZ_CP011271.1_5842271_5842811_-	pfam11026, DUF2721, Protein of unknown function (DUF2721)	NA|480aa|down_4|NZ_CP011271.1_5842917_5844357_+	cd05673, M20_Acy1L2_AbgB, M20 Peptidase Aminoacylase 1-like protein 2 aminobenzoyl-glutamate utilization protein B subfamily	NA|301aa|down_5|NZ_CP011271.1_5844362_5845265_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|312aa|down_6|NZ_CP011271.1_5845283_5846219_-	NA	NA|139aa|down_7|NZ_CP011271.1_5846793_5847210_-	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|383aa|down_8|NZ_CP011271.1_5847737_5848886_+	NA	NA|468aa|down_9|NZ_CP011271.1_5849131_5850535_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	19	6489590-6489664	19	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	CCCCCCTCCCTGAAGGGAAGGGGG	24	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|195aa|up_8|NZ_CP011271.1_6477423_6478008_+,NA|135aa|up_3|NZ_CP011271.1_6485136_6485541_-,NA|78aa|down_2|NZ_CP011271.1_6491430_6491664_+	NA|101aa|up_9|NZ_CP011271.1_6476979_6477282_+	pfam13532, 2OG-FeII_Oxy_2, 2OG-Fe(II) oxygenase superfamily	NA|195aa|up_8|NZ_CP011271.1_6477423_6478008_+	NA	NA|239aa|up_7|NZ_CP011271.1_6478172_6478889_-	PRK01305, PRK01305, arginyl-tRNA-protein transferase; Provisional	NA|746aa|up_6|NZ_CP011271.1_6479146_6481384_+	COG0483, SuhB, Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family [Carbohydrate transport and metabolism]	NA|371aa|up_5|NZ_CP011271.1_6481509_6482622_+	pfam05076, SUFU, Suppressor of fused protein (SUFU)	NA|414aa|up_4|NZ_CP011271.1_6483415_6484657_-	cd07208, Pat_hypo_Ecoli_yjju_like, Hypothetical patatin similar to yjju protein of Escherichia coli	NA|135aa|up_3|NZ_CP011271.1_6485136_6485541_-	NA	NA|397aa|up_2|NZ_CP011271.1_6485687_6486878_+	cd15482, Sialidase_non-viral, Non-viral sialidases	NA|342aa|up_1|NZ_CP011271.1_6487061_6488087_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|385aa|up_0|NZ_CP011271.1_6488091_6489246_-	PRK00770, PRK00770, deoxyhypusine synthase	NA|318aa|down_0|NZ_CP011271.1_6490174_6491128_+	pfam08668, HDOD, HDOD domain	NA|84aa|down_1|NZ_CP011271.1_6491203_6491455_+	PRK11749, PRK11749, dihydropyrimidine dehydrogenase subunit A; Provisional	NA|78aa|down_2|NZ_CP011271.1_6491430_6491664_+	NA	NA|129aa|down_3|NZ_CP011271.1_6491830_6492217_+	pfam12762, DDE_Tnp_IS1595, ISXO2-like transposase domain	NA|388aa|down_4|NZ_CP011271.1_6492400_6493564_-	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|389aa|down_5|NZ_CP011271.1_6493988_6495155_+	COG3147, DedD, Uncharacterized protein conserved in bacteria [Function unknown]	NA|306aa|down_6|NZ_CP011271.1_6495164_6496082_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|110aa|down_7|NZ_CP011271.1_6496090_6496420_+	PRK10089, PRK10089, chaperone CsaA	NA|340aa|down_8|NZ_CP011271.1_6496439_6497459_-	TIGR01128, DNA_polymerase_III_subunit_delta, DNA polymerase III, delta subunit	NA|219aa|down_9|NZ_CP011271.1_6497516_6498173_-	COG0062, COG0062, Uncharacterized conserved protein [Function unknown]
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	20	6621299-6621387	20	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	CTACGAACGCCGGCCCCTCCGGGGCG	26	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|196aa|up_5|NZ_CP011271.1_6608612_6609200_+,NA|147aa|up_4|NZ_CP011271.1_6609203_6609644_+,NA|309aa|down_2|NZ_CP011271.1_6623936_6624863_-,NA|97aa|down_3|NZ_CP011271.1_6624997_6625288_-,NA|234aa|down_8|NZ_CP011271.1_6630641_6631343_+	NA|289aa|up_9|NZ_CP011271.1_6605011_6605878_-	cd06164, S2P-M50_SpoIVFB_CBS, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|157aa|up_8|NZ_CP011271.1_6606156_6606627_-	pfam02724, CDC45, CDC45-like protein	NA|352aa|up_7|NZ_CP011271.1_6606732_6607788_-	cd12822, TmCorA-like, Thermotoga maritima CorA-like family	NA|198aa|up_6|NZ_CP011271.1_6608022_6608616_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|196aa|up_5|NZ_CP011271.1_6608612_6609200_+	NA	NA|147aa|up_4|NZ_CP011271.1_6609203_6609644_+	NA	NA|1253aa|up_3|NZ_CP011271.1_6609800_6613559_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|178aa|up_2|NZ_CP011271.1_6613573_6614107_-	COG1778, COG1778, Low specificity phosphatase (HAD superfamily) [General function prediction only]	NA|349aa|up_1|NZ_CP011271.1_6614111_6615158_-	PRK10892, PRK10892, arabinose-5-phosphate isomerase KdsD	NA|1674aa|up_0|NZ_CP011271.1_6616236_6621258_+	PRK15319, PRK15319, fibronectin-binding autotransporter adhesin ShdA	NA|436aa|down_0|NZ_CP011271.1_6621569_6622877_-	pfam13360, PQQ_2, PQQ-like domain	NA|198aa|down_1|NZ_CP011271.1_6623191_6623785_-	pfam13488, Gly-zipper_Omp, Glycine zipper	NA|309aa|down_2|NZ_CP011271.1_6623936_6624863_-	NA	NA|97aa|down_3|NZ_CP011271.1_6624997_6625288_-	NA	NA|285aa|down_4|NZ_CP011271.1_6625620_6626475_+	COG1234, ElaC, Metal-dependent hydrolases of the beta-lactamase superfamily III [General function prediction only]	NA|379aa|down_5|NZ_CP011271.1_6626512_6627649_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|335aa|down_6|NZ_CP011271.1_6627768_6628773_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|370aa|down_7|NZ_CP011271.1_6628846_6629956_-	COG4948, COG4948, L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily [Cell envelope biogenesis, outer membrane / General function prediction only]	NA|234aa|down_8|NZ_CP011271.1_6630641_6631343_+	NA	NA|486aa|down_9|NZ_CP011271.1_6631468_6632926_-	pfam07394, DUF1501, Protein of unknown function (DUF1501)
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	21	6667878-6667988	21	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	CGGGACCAAGATCCACGGCCGCGTGAATGCGGCTCG	36	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|81aa|up_9|NZ_CP011271.1_6657818_6658061_+,NA|69aa|up_1|NZ_CP011271.1_6666054_6666261_-,NA|79aa|down_0|NZ_CP011271.1_6668482_6668719_+,NA|204aa|down_3|NZ_CP011271.1_6674404_6675016_-	NA|81aa|up_9|NZ_CP011271.1_6657818_6658061_+	NA	NA|233aa|up_8|NZ_CP011271.1_6658081_6658780_-	pfam05685, Uma2, Putative restriction endonuclease	NA|425aa|up_7|NZ_CP011271.1_6658838_6660113_-	PRK00045, hemA, glutamyl-tRNA reductase; Reviewed	NA|279aa|up_6|NZ_CP011271.1_6660109_6660946_-	pfam01578, Cytochrom_C_asm, Cytochrome C assembly protein	NA|404aa|up_5|NZ_CP011271.1_6661180_6662392_-	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|406aa|up_4|NZ_CP011271.1_6662503_6663721_-	COG4102, COG4102, Uncharacterized protein conserved in bacteria [Function unknown]	NA|523aa|up_3|NZ_CP011271.1_6663732_6665301_-	pfam08811, DUF1800, Protein of unknown function (DUF1800)	NA|223aa|up_2|NZ_CP011271.1_6665334_6666003_-	pfam10670, DUF4198, Domain of unknown function (DUF4198)	NA|69aa|up_1|NZ_CP011271.1_6666054_6666261_-	NA	NA|327aa|up_0|NZ_CP011271.1_6666273_6667254_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|79aa|down_0|NZ_CP011271.1_6668482_6668719_+	NA	NA|1211aa|down_1|NZ_CP011271.1_6668827_6672460_-	TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase domain	NA|511aa|down_2|NZ_CP011271.1_6672682_6674215_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|204aa|down_3|NZ_CP011271.1_6674404_6675016_-	NA	NA|144aa|down_4|NZ_CP011271.1_6675311_6675743_+	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|211aa|down_5|NZ_CP011271.1_6675826_6676459_+	pfam00380, Ribosomal_S9, Ribosomal protein S9/S16	NA|413aa|down_6|NZ_CP011271.1_6676616_6677855_-	pfam08305, NPCBM, NPCBM/NEW2 domain	NA|326aa|down_7|NZ_CP011271.1_6678469_6679447_-	pfam13557, Phenol_MetA_deg, Putative MetA-pathway of phenol degradation	NA|659aa|down_8|NZ_CP011271.1_6679683_6681660_+	PRK15041, PRK15041, methyl-accepting chemotaxis protein	NA|771aa|down_9|NZ_CP011271.1_6681942_6684255_+	PRK15048, PRK15048, methyl-accepting chemotaxis protein II; Provisional
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	22	7042173-7042273	22	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	AACAGGTGAGCCGGCCCGGAGTTCACAC	28	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|253aa|up_5|NZ_CP011271.1_7035313_7036072_-,NA|135aa|down_2|NZ_CP011271.1_7047089_7047494_+,NA|178aa|down_8|NZ_CP011271.1_7054904_7055438_-,NA|58aa|down_9|NZ_CP011271.1_7055440_7055614_-	NA|172aa|up_9|NZ_CP011271.1_7031063_7031579_+	TIGR02996, rpt_mate_G_obs, repeat-companion domain TIGR02996	NA|429aa|up_8|NZ_CP011271.1_7031588_7032875_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|328aa|up_7|NZ_CP011271.1_7033114_7034098_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|306aa|up_6|NZ_CP011271.1_7034105_7035023_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|253aa|up_5|NZ_CP011271.1_7035313_7036072_-	NA	NA|347aa|up_4|NZ_CP011271.1_7036364_7037405_+	PRK13800, PRK13800, fumarate reductase/succinate dehydrogenase flavoprotein subunit	NA|71aa|up_3|NZ_CP011271.1_7037878_7038091_+	pfam02599, CsrA, Global regulator protein family	NA|219aa|up_2|NZ_CP011271.1_7038274_7038931_-	pfam06439, DUF1080, Domain of Unknown Function (DUF1080)	NA|472aa|up_1|NZ_CP011271.1_7039278_7040694_+	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|411aa|up_0|NZ_CP011271.1_7040934_7042167_+	pfam12576, DUF3754, Protein of unknown function (DUF3754)	NA|379aa|down_0|NZ_CP011271.1_7042535_7043672_+	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|871aa|down_1|NZ_CP011271.1_7044176_7046789_+	COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer membrane]	NA|135aa|down_2|NZ_CP011271.1_7047089_7047494_+	NA	NA|209aa|down_3|NZ_CP011271.1_7047574_7048201_-	pfam13899, Thioredoxin_7, Thioredoxin-like	NA|421aa|down_4|NZ_CP011271.1_7048652_7049915_+	TIGR03300, assembly_YfgL, outer membrane assembly lipoprotein YfgL	NA|890aa|down_5|NZ_CP011271.1_7050114_7052784_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|330aa|down_6|NZ_CP011271.1_7052931_7053921_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|273aa|down_7|NZ_CP011271.1_7053931_7054750_-	COG0568, RpoD, DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) [Transcription]	NA|178aa|down_8|NZ_CP011271.1_7054904_7055438_-	NA	NA|58aa|down_9|NZ_CP011271.1_7055440_7055614_-	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	23	7164131-7164249	23	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	GGGGATTCAGTGCTGGTACTGATGCCACCCT	31	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|298aa|up_6|NZ_CP011271.1_7156272_7157166_+,NA|102aa|up_0|NZ_CP011271.1_7163384_7163690_+,NA|54aa|down_2|NZ_CP011271.1_7167491_7167653_+,NA|807aa|down_4|NZ_CP011271.1_7169061_7171482_-,NA|374aa|down_6|NZ_CP011271.1_7172900_7174022_-,NA|116aa|down_7|NZ_CP011271.1_7174025_7174373_-,NA|175aa|down_9|NZ_CP011271.1_7176919_7177444_-	NA|356aa|up_9|NZ_CP011271.1_7152773_7153841_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|311aa|up_8|NZ_CP011271.1_7154078_7155011_-	cd19163, AKR_galDH, L-galactose dehydrogenase (L-galDH) and similar proteins	NA|327aa|up_7|NZ_CP011271.1_7155140_7156121_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|298aa|up_6|NZ_CP011271.1_7156272_7157166_+	NA	NA|236aa|up_5|NZ_CP011271.1_7157221_7157929_-	cd06561, AlkD_like, A new structural DNA glycosylase	NA|365aa|up_4|NZ_CP011271.1_7158386_7159481_+	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|260aa|up_3|NZ_CP011271.1_7159487_7160267_-	cd04221, MauL, Methylamine utilization protein MauL	NA|541aa|up_2|NZ_CP011271.1_7160300_7161923_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|332aa|up_1|NZ_CP011271.1_7162138_7163134_-	cd07010, cupin_PMI_type_I_N_bac, Phosphomannose isomerase in bacteria and archaea, N-terminal cupin domain	NA|102aa|up_0|NZ_CP011271.1_7163384_7163690_+	NA	NA|405aa|down_0|NZ_CP011271.1_7165426_7166641_-	COG0402, SsnA, Cytosine deaminase and related metal-dependent hydrolases [Nucleotide transport and metabolism / General function prediction only]	NA|95aa|down_1|NZ_CP011271.1_7167106_7167391_-	cd10456, GIY-YIG_UPF0213, The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific endonuclease SLX1	NA|54aa|down_2|NZ_CP011271.1_7167491_7167653_+	NA	NA|387aa|down_3|NZ_CP011271.1_7167745_7168906_-	pfam05762, VWA_CoxE, VWA domain containing CoxE-like protein	NA|807aa|down_4|NZ_CP011271.1_7169061_7171482_-	NA	NA|393aa|down_5|NZ_CP011271.1_7171644_7172823_-	pfam07728, AAA_5, AAA domain (dynein-related subfamily)	NA|374aa|down_6|NZ_CP011271.1_7172900_7174022_-	NA	NA|116aa|down_7|NZ_CP011271.1_7174025_7174373_-	NA	NA|801aa|down_8|NZ_CP011271.1_7174504_7176907_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|175aa|down_9|NZ_CP011271.1_7176919_7177444_-	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	24	7198507-7198686	4,24	PILER-CR,CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	GTATCTCCGTGGGGCAACTCACGGCCGAATTGAAGC,GTATCTCCGTGGGGCAACTCACGGCCGAATTGAAGC	36,36	0	0	NA	NA	NA:NA	2,2	2	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|393aa|up_9|NZ_CP011271.1_7191616_7192795_-,NA|417aa|up_8|NZ_CP011271.1_7192794_7194045_-,NA|173aa|up_7|NZ_CP011271.1_7194235_7194754_-,NA|133aa|up_6|NZ_CP011271.1_7194795_7195194_-,NA|145aa|up_5|NZ_CP011271.1_7195190_7195625_-,NA|113aa|up_4|NZ_CP011271.1_7195621_7195960_-,NA|339aa|up_3|NZ_CP011271.1_7196071_7197088_+,NA|94aa|up_2|NZ_CP011271.1_7197084_7197366_-,NA|59aa|up_0|NZ_CP011271.1_7198155_7198332_+,NA|70aa|down_1|NZ_CP011271.1_7199561_7199771_-,NA|101aa|down_2|NZ_CP011271.1_7199829_7200132_-,NA|400aa|down_4|NZ_CP011271.1_7202047_7203247_-,NA|162aa|down_5|NZ_CP011271.1_7203315_7203801_-,NA|330aa|down_6|NZ_CP011271.1_7203857_7204847_-,NA|106aa|down_8|NZ_CP011271.1_7211744_7212062_-,NA|137aa|down_9|NZ_CP011271.1_7212130_7212541_-	NA|393aa|up_9|NZ_CP011271.1_7191616_7192795_-	NA	NA|417aa|up_8|NZ_CP011271.1_7192794_7194045_-	NA	NA|173aa|up_7|NZ_CP011271.1_7194235_7194754_-	NA	NA|133aa|up_6|NZ_CP011271.1_7194795_7195194_-	NA	NA|145aa|up_5|NZ_CP011271.1_7195190_7195625_-	NA	NA|113aa|up_4|NZ_CP011271.1_7195621_7195960_-	NA	NA|339aa|up_3|NZ_CP011271.1_7196071_7197088_+	NA	NA|94aa|up_2|NZ_CP011271.1_7197084_7197366_-	NA	NA|122aa|up_1|NZ_CP011271.1_7197553_7197919_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|59aa|up_0|NZ_CP011271.1_7198155_7198332_+	NA	NA|87aa|down_0|NZ_CP011271.1_7199230_7199491_-	COG0365, Acs, Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases [Lipid metabolism]	NA|70aa|down_1|NZ_CP011271.1_7199561_7199771_-	NA	NA|101aa|down_2|NZ_CP011271.1_7199829_7200132_-	NA	NA|409aa|down_3|NZ_CP011271.1_7200765_7201992_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|400aa|down_4|NZ_CP011271.1_7202047_7203247_-	NA	NA|162aa|down_5|NZ_CP011271.1_7203315_7203801_-	NA	NA|330aa|down_6|NZ_CP011271.1_7203857_7204847_-	NA	NA|1883aa|down_7|NZ_CP011271.1_7204972_7210621_-	cd03673, Ap6A_hydrolase, Diadenosine hexaphosphate (Ap6A) hydrolase is a member of the Nudix hydrolase superfamily	NA|106aa|down_8|NZ_CP011271.1_7211744_7212062_-	NA	NA|137aa|down_9|NZ_CP011271.1_7212130_7212541_-	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	25	7198794-7198904	25	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	GTATCTCCGTGGGGCAACTCACGGCCGAATTGAAGC	36	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|393aa|up_9|NZ_CP011271.1_7191616_7192795_-,NA|417aa|up_8|NZ_CP011271.1_7192794_7194045_-,NA|173aa|up_7|NZ_CP011271.1_7194235_7194754_-,NA|133aa|up_6|NZ_CP011271.1_7194795_7195194_-,NA|145aa|up_5|NZ_CP011271.1_7195190_7195625_-,NA|113aa|up_4|NZ_CP011271.1_7195621_7195960_-,NA|339aa|up_3|NZ_CP011271.1_7196071_7197088_+,NA|94aa|up_2|NZ_CP011271.1_7197084_7197366_-,NA|59aa|up_0|NZ_CP011271.1_7198155_7198332_+,NA|70aa|down_1|NZ_CP011271.1_7199561_7199771_-,NA|101aa|down_2|NZ_CP011271.1_7199829_7200132_-,NA|400aa|down_4|NZ_CP011271.1_7202047_7203247_-,NA|162aa|down_5|NZ_CP011271.1_7203315_7203801_-,NA|330aa|down_6|NZ_CP011271.1_7203857_7204847_-,NA|106aa|down_8|NZ_CP011271.1_7211744_7212062_-,NA|137aa|down_9|NZ_CP011271.1_7212130_7212541_-	NA|393aa|up_9|NZ_CP011271.1_7191616_7192795_-	NA	NA|417aa|up_8|NZ_CP011271.1_7192794_7194045_-	NA	NA|173aa|up_7|NZ_CP011271.1_7194235_7194754_-	NA	NA|133aa|up_6|NZ_CP011271.1_7194795_7195194_-	NA	NA|145aa|up_5|NZ_CP011271.1_7195190_7195625_-	NA	NA|113aa|up_4|NZ_CP011271.1_7195621_7195960_-	NA	NA|339aa|up_3|NZ_CP011271.1_7196071_7197088_+	NA	NA|94aa|up_2|NZ_CP011271.1_7197084_7197366_-	NA	NA|122aa|up_1|NZ_CP011271.1_7197553_7197919_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|59aa|up_0|NZ_CP011271.1_7198155_7198332_+	NA	NA|87aa|down_0|NZ_CP011271.1_7199230_7199491_-	COG0365, Acs, Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases [Lipid metabolism]	NA|70aa|down_1|NZ_CP011271.1_7199561_7199771_-	NA	NA|101aa|down_2|NZ_CP011271.1_7199829_7200132_-	NA	NA|409aa|down_3|NZ_CP011271.1_7200765_7201992_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|400aa|down_4|NZ_CP011271.1_7202047_7203247_-	NA	NA|162aa|down_5|NZ_CP011271.1_7203315_7203801_-	NA	NA|330aa|down_6|NZ_CP011271.1_7203857_7204847_-	NA	NA|1883aa|down_7|NZ_CP011271.1_7204972_7210621_-	cd03673, Ap6A_hydrolase, Diadenosine hexaphosphate (Ap6A) hydrolase is a member of the Nudix hydrolase superfamily	NA|106aa|down_8|NZ_CP011271.1_7211744_7212062_-	NA	NA|137aa|down_9|NZ_CP011271.1_7212130_7212541_-	NA
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	26	8927100-8927232	26	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	ACCATGCGGCAGGTGGTGTAGCACACCTGCCGAACG	36	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|66aa|up_7|NZ_CP011271.1_8919266_8919464_+,NA|89aa|up_4|NZ_CP011271.1_8921139_8921406_-,NA|256aa|up_1|NZ_CP011271.1_8923832_8924600_-,NA|285aa|down_0|NZ_CP011271.1_8929251_8930106_+,NA|81aa|down_1|NZ_CP011271.1_8930658_8930901_+,NA|149aa|down_7|NZ_CP011271.1_8939866_8940313_+	NA|230aa|up_9|NZ_CP011271.1_8917717_8918407_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|240aa|up_8|NZ_CP011271.1_8918547_8919267_+	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|66aa|up_7|NZ_CP011271.1_8919266_8919464_+	NA	NA|330aa|up_6|NZ_CP011271.1_8919559_8920549_-	PRK00050, PRK00050, 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH	NA|190aa|up_5|NZ_CP011271.1_8920579_8921149_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|89aa|up_4|NZ_CP011271.1_8921139_8921406_-	NA	NA|497aa|up_3|NZ_CP011271.1_8921489_8922980_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|243aa|up_2|NZ_CP011271.1_8923069_8923798_-	cd03266, ABC_NatA_sodium_exporter, ATP-binding cassette domain of the Na+ transporter	NA|256aa|up_1|NZ_CP011271.1_8923832_8924600_-	NA	NA|280aa|up_0|NZ_CP011271.1_8925832_8926672_-	cd13634, PBP2_Sco4506, The conserved hypothetical protein SCO4506 exhibits the type 2 periplasmic-binidng protein fold	NA|285aa|down_0|NZ_CP011271.1_8929251_8930106_+	NA	NA|81aa|down_1|NZ_CP011271.1_8930658_8930901_+	NA	NA|187aa|down_2|NZ_CP011271.1_8931028_8931589_+	pfam14595, Thioredoxin_9, Thioredoxin	NA|636aa|down_3|NZ_CP011271.1_8931723_8933631_+	PRK10811, rne, ribonuclease E; Reviewed	NA|202aa|down_4|NZ_CP011271.1_8933537_8934143_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|592aa|down_5|NZ_CP011271.1_8934251_8936027_+	pfam13360, PQQ_2, PQQ-like domain	NA|706aa|down_6|NZ_CP011271.1_8936713_8938831_-	cd01920, cyclophilin_EcCYP_like, cyclophilin_EcCYP_like: cyclophilin-type A-like peptidylprolyl cis- trans isomerase (PPIase) domain similar to the cytosolic E	NA|149aa|down_7|NZ_CP011271.1_8939866_8940313_+	NA	NA|321aa|down_8|NZ_CP011271.1_8940402_8941365_+	cd05242, SDR_a8, atypical (a) SDRs, subgroup 8	NA|137aa|down_9|NZ_CP011271.1_8941423_8941834_-	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)
GCF_001610855.1_ASM161085v1	NZ_CP011271	Gemmata sp. SH-PL17 chromosome, complete genome	27	8927388-8927519	27	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	Orphan	ACCATGCGGCAGGTGGTGTAGCACACCTGCCGAACG	36	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,csf3gr5,csf2gr7,csf4gr11,csf1gr8,DinG,cas1,cas2,cas8u2,cas7,cas5u,cas8u1,csm6,csm4gr5,csm3gr7,cas10,DEDDh	NA|66aa|up_7|NZ_CP011271.1_8919266_8919464_+,NA|89aa|up_4|NZ_CP011271.1_8921139_8921406_-,NA|256aa|up_1|NZ_CP011271.1_8923832_8924600_-,NA|285aa|down_0|NZ_CP011271.1_8929251_8930106_+,NA|81aa|down_1|NZ_CP011271.1_8930658_8930901_+,NA|149aa|down_7|NZ_CP011271.1_8939866_8940313_+	NA|230aa|up_9|NZ_CP011271.1_8917717_8918407_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|240aa|up_8|NZ_CP011271.1_8918547_8919267_+	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|66aa|up_7|NZ_CP011271.1_8919266_8919464_+	NA	NA|330aa|up_6|NZ_CP011271.1_8919559_8920549_-	PRK00050, PRK00050, 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH	NA|190aa|up_5|NZ_CP011271.1_8920579_8921149_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|89aa|up_4|NZ_CP011271.1_8921139_8921406_-	NA	NA|497aa|up_3|NZ_CP011271.1_8921489_8922980_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|243aa|up_2|NZ_CP011271.1_8923069_8923798_-	cd03266, ABC_NatA_sodium_exporter, ATP-binding cassette domain of the Na+ transporter	NA|256aa|up_1|NZ_CP011271.1_8923832_8924600_-	NA	NA|280aa|up_0|NZ_CP011271.1_8925832_8926672_-	cd13634, PBP2_Sco4506, The conserved hypothetical protein SCO4506 exhibits the type 2 periplasmic-binidng protein fold	NA|285aa|down_0|NZ_CP011271.1_8929251_8930106_+	NA	NA|81aa|down_1|NZ_CP011271.1_8930658_8930901_+	NA	NA|187aa|down_2|NZ_CP011271.1_8931028_8931589_+	pfam14595, Thioredoxin_9, Thioredoxin	NA|636aa|down_3|NZ_CP011271.1_8931723_8933631_+	PRK10811, rne, ribonuclease E; Reviewed	NA|202aa|down_4|NZ_CP011271.1_8933537_8934143_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|592aa|down_5|NZ_CP011271.1_8934251_8936027_+	pfam13360, PQQ_2, PQQ-like domain	NA|706aa|down_6|NZ_CP011271.1_8936713_8938831_-	cd01920, cyclophilin_EcCYP_like, cyclophilin_EcCYP_like: cyclophilin-type A-like peptidylprolyl cis- trans isomerase (PPIase) domain similar to the cytosolic E	NA|149aa|down_7|NZ_CP011271.1_8939866_8940313_+	NA	NA|321aa|down_8|NZ_CP011271.1_8940402_8941365_+	cd05242, SDR_a8, atypical (a) SDRs, subgroup 8	NA|137aa|down_9|NZ_CP011271.1_8941423_8941834_-	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)
