assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000265425.1_ASM26542v1	NC_018014	Terriglobus roseus DSM 18391, complete sequence	1	120943-121019	1	CRISPRCasFinder	no		csa3,Cas9_archaeal,cas3,WYL,DinG	Orphan	TCTTCACTTTGATGACCTCAGGA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,Cas9_archaeal,cas3,WYL,DinG	NA|258aa|up_3|NC_018014.1_116899_117673_-,NA|260aa|up_1|NC_018014.1_118206_118986_-,NA|273aa|down_5|NC_018014.1_126746_127565_+	NA|245aa|up_9|NC_018014.1_109491_110226_+	pfam06439, DUF1080, Domain of Unknown Function (DUF1080)	NA|300aa|up_8|NC_018014.1_110232_111132_-	pfam06719, AraC_N, AraC-type transcriptional regulator N-terminus	NA|278aa|up_7|NC_018014.1_111278_112112_+	PRK06180, PRK06180, short chain dehydrogenase; Provisional	NA|415aa|up_6|NC_018014.1_112177_113422_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|793aa|up_5|NC_018014.1_113626_116005_+	pfam14067, LssY_C, LssY C-terminus	NA|279aa|up_4|NC_018014.1_116059_116896_-	TIGR03435, Soli_TIGR03435, soil-associated protein, TIGR03435 family	NA|258aa|up_3|NC_018014.1_116899_117673_-	NA	NA|117aa|up_2|NC_018014.1_117669_118020_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|260aa|up_1|NC_018014.1_118206_118986_-	NA	NA|243aa|up_0|NC_018014.1_119111_119840_-	COG1556, COG1556, Uncharacterized conserved protein [Function unknown]	NA|265aa|down_0|NC_018014.1_121362_122157_-	COG0247, GlpC, Fe-S oxidoreductase [Energy production and conversion]	NA|256aa|down_1|NC_018014.1_122314_123082_+	cd05233, SDR_c, classical (c) SDRs	NA|284aa|down_2|NC_018014.1_123188_124040_+	PLN02220, PLN02220, delta-9 acyl-lipid desaturase	NA|541aa|down_3|NC_018014.1_124190_125813_+	pfam12543, DUF3738, Protein of unknown function (DUF3738)	NA|127aa|down_4|NC_018014.1_126239_126620_+	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|273aa|down_5|NC_018014.1_126746_127565_+	NA	NA|304aa|down_6|NC_018014.1_127707_128619_+	pfam06439, DUF1080, Domain of Unknown Function (DUF1080)	NA|505aa|down_7|NC_018014.1_128678_130193_-	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|285aa|down_8|NC_018014.1_130504_131359_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|736aa|down_9|NC_018014.1_131620_133828_+	PRK08324, PRK08324, bifunctional aldolase/short-chain dehydrogenase
GCF_000265425.1_ASM26542v1	NC_018014	Terriglobus roseus DSM 18391, complete sequence	2	242626-242715	2	CRISPRCasFinder	no		csa3,Cas9_archaeal,cas3,WYL,DinG	Orphan	GTTGGCGCGGGCCTTCAGCCCGC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,Cas9_archaeal,cas3,WYL,DinG	NA,NA|210aa|down_6|NC_018014.1_251451_252081_+	NA|412aa|up_9|NC_018014.1_230098_231334_-	COG1668, NatB, ABC-type Na+ efflux pump, permease component [Energy production and conversion / Inorganic ion transport and metabolism]	NA|308aa|up_8|NC_018014.1_231375_232299_-	COG4152, COG4152, ABC-type uncharacterized transport system, ATPase component [General function prediction only]	NA|68aa|up_7|NC_018014.1_232303_232507_-	COG1942, COG1942, Uncharacterized protein, 4-oxalocrotonate tautomerase homolog [General function prediction only]	NA|775aa|up_6|NC_018014.1_232588_234913_+	PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional	NA|151aa|up_5|NC_018014.1_234978_235431_-	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|304aa|up_4|NC_018014.1_235808_236720_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|219aa|up_3|NC_018014.1_236915_237572_+	cd03016, PRX_1cys, Peroxiredoxin (PRX) family, 1-cys PRX subfamily; composed of PRXs containing only one conserved cysteine, which serves as the peroxidatic cysteine	NA|243aa|up_2|NC_018014.1_237952_238681_+	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal	NA|221aa|up_1|NC_018014.1_239091_239754_-	pfam01027, Bax1-I, Inhibitor of apoptosis-promoting Bax1	NA|801aa|up_0|NC_018014.1_240142_242545_-	TIGR01970, ATP-dependent_RNA_helicase_HrpB, ATP-dependent helicase HrpB	NA|941aa|down_0|NC_018014.1_242970_245793_-	TIGR02063, Ribonuclease_R, ribonuclease R	NA|354aa|down_1|NC_018014.1_245832_246894_-	cd07731, ComA-like_MBL-fold, Competence protein ComA, ComEC and related proteins; MBL-fold metallo hydrolase domain	NA|266aa|down_2|NC_018014.1_247085_247883_-	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	NA|297aa|down_3|NC_018014.1_248053_248944_-	COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]	NA|451aa|down_4|NC_018014.1_249028_250381_+	cd00424, PolY, Y-family of DNA polymerases	NA|183aa|down_5|NC_018014.1_250783_251332_-	COG1321, TroR, Mn-dependent transcriptional regulator [Transcription]	NA|210aa|down_6|NC_018014.1_251451_252081_+	NA	NA|835aa|down_7|NC_018014.1_252159_254664_-	pfam00930, DPPIV_N, Dipeptidyl peptidase IV (DPP IV) N-terminal region	NA|245aa|down_8|NC_018014.1_254710_255445_-	cd18093, SpoU-like_TrmJ, SAM-dependent tRNA methylase related to TrmJ	NA|228aa|down_9|NC_018014.1_255596_256280_+	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain
GCF_000265425.1_ASM26542v1	NC_018014	Terriglobus roseus DSM 18391, complete sequence	3	1418825-1419178	1	CRT	no		csa3,Cas9_archaeal,cas3,WYL,DinG	Orphan	GGCTTTAGCCACNGNGATAGA	21	0	0	NA	NA	NA	8	8	Orphan	csa3,Cas9_archaeal,cas3,WYL,DinG	NA|412aa|up_4|NC_018014.1_1408938_1410174_-,NA	NA|535aa|up_9|NC_018014.1_1399239_1400844_-	cd16371, DMSOR_beta_like, uncharacterized subfamily of DMSO Reductase beta subunit family	NA|586aa|up_8|NC_018014.1_1400871_1402629_-	TIGR01931, Sulfite_reductase_flavoprotein_alpha-component, sulfite reductase [NADPH] flavoprotein, alpha-component	NA|600aa|up_7|NC_018014.1_1402625_1404425_-	PRK09567, nirA, NirA family protein	NA|557aa|up_6|NC_018014.1_1404851_1406522_+	pfam02602, HEM4, Uroporphyrinogen-III synthase HemD	NA|752aa|up_5|NC_018014.1_1406593_1408849_+	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|412aa|up_4|NC_018014.1_1408938_1410174_-	NA	NA|1165aa|up_3|NC_018014.1_1410477_1413972_+	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|429aa|up_2|NC_018014.1_1414123_1415410_+	cd17355, MFS_YcxA_like, MFS-type transporter YcxA and similar proteins of the Major Facilitator Superfamily of transporters	NA|392aa|up_1|NC_018014.1_1415444_1416620_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|271aa|up_0|NC_018014.1_1416623_1417436_+	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|267aa|down_0|NC_018014.1_1419199_1420000_+	PRK12745, PRK12745, 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional	NA|404aa|down_1|NC_018014.1_1420474_1421686_+	pfam05426, Alginate_lyase, Alginate lyase	NA|822aa|down_2|NC_018014.1_1421728_1424194_+	PRK10150, PRK10150, beta-D-glucuronidase; Provisional	NA|824aa|down_3|NC_018014.1_1424229_1426701_-	cd06604, GH31_glucosidase_II_MalA, Alpha-glucosidase II-like	NA|733aa|down_4|NC_018014.1_1426810_1429009_-	COG3710, CadC, DNA-binding winged-HTH domains [Transcription]	NA|1122aa|down_5|NC_018014.1_1429231_1432597_+	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|615aa|down_6|NC_018014.1_1432685_1434530_+	cd11340, AmyAc_bac_CMD_like_3, Alpha amylase catalytic domain found in bacterial cyclomaltodextrinases and related proteins	NA|139aa|down_7|NC_018014.1_1434656_1435073_-	COG0432, COG0432, Uncharacterized conserved protein [Function unknown]	NA|224aa|down_8|NC_018014.1_1435439_1436111_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|462aa|down_9|NC_018014.1_1436107_1437493_-	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]
GCF_000265425.1_ASM26542v1	NC_018014	Terriglobus roseus DSM 18391, complete sequence	4	1609963-1610053	3	CRISPRCasFinder	no		csa3,Cas9_archaeal,cas3,WYL,DinG	Orphan	TTCGCGGGCTGTAGGCACACGAC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,Cas9_archaeal,cas3,WYL,DinG	NA|146aa|up_7|NC_018014.1_1599159_1599597_-,NA|120aa|up_4|NC_018014.1_1603986_1604346_-,NA	NA|281aa|up_9|NC_018014.1_1597407_1598250_+	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|255aa|up_8|NC_018014.1_1598322_1599087_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|146aa|up_7|NC_018014.1_1599159_1599597_-	NA	NA|326aa|up_6|NC_018014.1_1599735_1600713_+	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|859aa|up_5|NC_018014.1_1601303_1603880_-	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|120aa|up_4|NC_018014.1_1603986_1604346_-	NA	NA|623aa|up_3|NC_018014.1_1604740_1606609_-	pfam13520, AA_permease_2, Amino acid permease	NA|248aa|up_2|NC_018014.1_1606766_1607510_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|463aa|up_1|NC_018014.1_1607522_1608911_+	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|221aa|up_0|NC_018014.1_1609083_1609746_+	TIGR02135, Uncharacterized_protein, phosphate transport system regulatory protein PhoU	NA|1216aa|down_0|NC_018014.1_1610187_1613835_-	COG1074, RecB, ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) [DNA replication, recombination, and repair]	NA|919aa|down_1|NC_018014.1_1613828_1616585_-	TIGR03623, TIGR03623, probable DNA repair protein	NA|406aa|down_2|NC_018014.1_1616603_1617821_-	cd17333, MFS_FucP_MFSD4_like, Bacterial fucose permease, eukaryotic Major facilitator superfamily domain-containing protein 4, and similar proteins	NA|296aa|down_3|NC_018014.1_1617904_1618792_+	cd02968, SCO, SCO (an acronym for Synthesis of Cytochrome c Oxidase) family; composed of proteins similar to Sco1, a membrane-anchored protein possessing a soluble domain with a TRX fold	NA|628aa|down_4|NC_018014.1_1618902_1620786_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|703aa|down_5|NC_018014.1_1620804_1622913_-	COG1770, PtrB, Protease II [Amino acid transport and metabolism]	NA|171aa|down_6|NC_018014.1_1623116_1623629_+	pfam13628, DUF4142, Domain of unknown function (DUF4142)	NA|240aa|down_7|NC_018014.1_1623842_1624562_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|190aa|down_8|NC_018014.1_1624618_1625188_+	pfam04011, LemA, LemA family	NA|335aa|down_9|NC_018014.1_1625284_1626289_-	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]
GCF_000265425.1_ASM26542v1	NC_018014	Terriglobus roseus DSM 18391, complete sequence	5	1711179-1711271	4	CRISPRCasFinder	no		csa3,Cas9_archaeal,cas3,WYL,DinG	Orphan	TCCTTTCGAATCTGATTTTTGCG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,Cas9_archaeal,cas3,WYL,DinG	NA,NA	NA|407aa|up_9|NC_018014.1_1699733_1700954_+	pfam13975, gag-asp_proteas, gag-polyprotein putative aspartyl protease	NA|356aa|up_8|NC_018014.1_1701093_1702161_+	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|664aa|up_7|NC_018014.1_1702188_1704180_+	COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|139aa|up_6|NC_018014.1_1704202_1704619_-	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|514aa|up_5|NC_018014.1_1704594_1706136_-	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|176aa|up_4|NC_018014.1_1706210_1706738_-	pfam01668, SmpB, SmpB protein	NA|829aa|up_3|NC_018014.1_1706927_1709414_+	COG1450, PulD, Type II secretory pathway, component PulD [Cell motility and secretion / Intracellular trafficking and secretion]	NA|141aa|up_2|NC_018014.1_1709506_1709929_+	COG2165, PulG, Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion]	NA|154aa|up_1|NC_018014.1_1709934_1710396_+	TIGR01710, Type_II_secretion_system_protein_G, type II secretion system protein G	NA|216aa|up_0|NC_018014.1_1710432_1711080_+	PRK05414, PRK05414, urocanate hydratase; Provisional	NA|214aa|down_0|NC_018014.1_1711322_1711964_-	PRK00454, engB, GTP-binding protein YsxC; Reviewed	NA|447aa|down_1|NC_018014.1_1712088_1713429_-	PRK09441, PRK09441, cytoplasmic alpha-amylase; Reviewed	NA|172aa|down_2|NC_018014.1_1713574_1714090_+	pfam01594, AI-2E_transport, AI-2E family transporter	NA|596aa|down_3|NC_018014.1_1714113_1715901_-	COG4986, COG4986, ABC-type anion transport system, duplicated permease component [Inorganic ion transport and metabolism]	NA|447aa|down_4|NC_018014.1_1715897_1717238_-	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|124aa|down_5|NC_018014.1_1717412_1717784_+	pfam00072, Response_reg, Response regulator receiver domain	NA|276aa|down_6|NC_018014.1_1717932_1718760_-	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|389aa|down_7|NC_018014.1_1718821_1719988_-	cd03821, GT4_Bme6-like, Brucella melitensis Bme6 and similar proteins	NA|411aa|down_8|NC_018014.1_1719995_1721228_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|330aa|down_9|NC_018014.1_1721284_1722274_-	TIGR03028, EpsE, polysaccharide export protein EpsE
GCF_000265425.1_ASM26542v1	NC_018014	Terriglobus roseus DSM 18391, complete sequence	6	2133077-2133170	5	CRISPRCasFinder	no		csa3,Cas9_archaeal,cas3,WYL,DinG	Orphan	CTGGGTACAGCACAAGCGATGTCCC	25	0	0	NA	NA	NA	1	1	Orphan	csa3,Cas9_archaeal,cas3,WYL,DinG	NA|495aa|up_8|NC_018014.1_2119589_2121074_-,NA|445aa|down_1|NC_018014.1_2137729_2139064_+,NA|119aa|down_2|NC_018014.1_2139136_2139493_+	NA|397aa|up_9|NC_018014.1_2118285_2119476_-	cd02858, E_set_Esterase_N, N-terminal Early set domain associated with the catalytic domain of esterase	NA|495aa|up_8|NC_018014.1_2119589_2121074_-	NA	NA|682aa|up_7|NC_018014.1_2121070_2123116_-	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|424aa|up_6|NC_018014.1_2123747_2125019_+	COG2382, Fes, Enterochelin esterase and related enzymes [Inorganic ion transport and metabolism]	NA|660aa|up_5|NC_018014.1_2125008_2126988_+	COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]	NA|628aa|up_4|NC_018014.1_2127527_2129411_-	pfam00263, Secretin, Bacterial type II and III secretion system protein	NA|97aa|up_3|NC_018014.1_2129446_2129737_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|201aa|up_2|NC_018014.1_2129748_2130351_-	pfam02674, Colicin_V, Colicin V production protein	NA|298aa|up_1|NC_018014.1_2130356_2131250_-	PRK13961, PRK13961, phosphoribosylaminoimidazole-succinocarboxamide synthase; Provisional	NA|442aa|up_0|NC_018014.1_2131426_2132752_+	cd17394, MFS_FucP_like, Fucose permease and similar proteins of the Major Facilitator Superfamily of transporters	NA|1291aa|down_0|NC_018014.1_2133400_2137273_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|445aa|down_1|NC_018014.1_2137729_2139064_+	NA	NA|119aa|down_2|NC_018014.1_2139136_2139493_+	NA	NA|265aa|down_3|NC_018014.1_2139496_2140291_-	pfam13277, YmdB, YmdB-like protein	NA|195aa|down_4|NC_018014.1_2140879_2141464_-	pfam03692, CxxCxxCC, Putative zinc- or iron-chelating domain	NA|440aa|down_5|NC_018014.1_2141556_2142876_-	cd01360, Adenylsuccinate_lyase_1, Adenylsuccinate lyase (ASL)_subgroup 1	NA|393aa|down_6|NC_018014.1_2142941_2144120_+	sd00006, TPR, Tetratricopeptide repeat	NA|579aa|down_7|NC_018014.1_2144116_2145853_+	cd00400, Voltage_gated_ClC, CLC voltage-gated chloride channel	NA|86aa|down_8|NC_018014.1_2145912_2146170_-	cd02976, NrdH, NrdH-redoxin (NrdH) family; NrdH is a small monomeric protein with a conserved redox active CXXC motif within a TRX fold, characterized by a glutaredoxin (GRX)-like sequence and TRX-like activity profile	NA|259aa|down_9|NC_018014.1_2146193_2146970_-	cd05358, GlcDH_SDR_c, glucose 1 dehydrogenase (GlcDH), classical (c) SDRs
GCF_000265425.1_ASM26542v1	NC_018014	Terriglobus roseus DSM 18391, complete sequence	7	2140774-2140863	6	CRISPRCasFinder	no		csa3,Cas9_archaeal,cas3,WYL,DinG	Orphan	GAGGCGGGCTGAAGGCCCGCGAC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,Cas9_archaeal,cas3,WYL,DinG	NA|445aa|up_2|NC_018014.1_2137729_2139064_+,NA|119aa|up_1|NC_018014.1_2139136_2139493_+,NA	NA|660aa|up_9|NC_018014.1_2125008_2126988_+	COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]	NA|628aa|up_8|NC_018014.1_2127527_2129411_-	pfam00263, Secretin, Bacterial type II and III secretion system protein	NA|97aa|up_7|NC_018014.1_2129446_2129737_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|201aa|up_6|NC_018014.1_2129748_2130351_-	pfam02674, Colicin_V, Colicin V production protein	NA|298aa|up_5|NC_018014.1_2130356_2131250_-	PRK13961, PRK13961, phosphoribosylaminoimidazole-succinocarboxamide synthase; Provisional	NA|442aa|up_4|NC_018014.1_2131426_2132752_+	cd17394, MFS_FucP_like, Fucose permease and similar proteins of the Major Facilitator Superfamily of transporters	NA|1291aa|up_3|NC_018014.1_2133400_2137273_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|445aa|up_2|NC_018014.1_2137729_2139064_+	NA	NA|119aa|up_1|NC_018014.1_2139136_2139493_+	NA	NA|265aa|up_0|NC_018014.1_2139496_2140291_-	pfam13277, YmdB, YmdB-like protein	NA|195aa|down_0|NC_018014.1_2140879_2141464_-	pfam03692, CxxCxxCC, Putative zinc- or iron-chelating domain	NA|440aa|down_1|NC_018014.1_2141556_2142876_-	cd01360, Adenylsuccinate_lyase_1, Adenylsuccinate lyase (ASL)_subgroup 1	NA|393aa|down_2|NC_018014.1_2142941_2144120_+	sd00006, TPR, Tetratricopeptide repeat	NA|579aa|down_3|NC_018014.1_2144116_2145853_+	cd00400, Voltage_gated_ClC, CLC voltage-gated chloride channel	NA|86aa|down_4|NC_018014.1_2145912_2146170_-	cd02976, NrdH, NrdH-redoxin (NrdH) family; NrdH is a small monomeric protein with a conserved redox active CXXC motif within a TRX fold, characterized by a glutaredoxin (GRX)-like sequence and TRX-like activity profile	NA|259aa|down_5|NC_018014.1_2146193_2146970_-	cd05358, GlcDH_SDR_c, glucose 1 dehydrogenase (GlcDH), classical (c) SDRs	NA|477aa|down_6|NC_018014.1_2147066_2148497_-	COG0174, GlnA, Glutamine synthetase [Amino acid transport and metabolism]	NA|349aa|down_7|NC_018014.1_2148788_2149835_+	COG2008, GLY1, Threonine aldolase [Amino acid transport and metabolism]	NA|304aa|down_8|NC_018014.1_2149859_2150771_+	cd07573, CPA, N-carbamoylputrescine amidohydrolase (CPA) (class 11 nitrilases)	NA|161aa|down_9|NC_018014.1_2150782_2151265_+	pfam11937, DUF3455, Protein of unknown function (DUF3455)
GCF_000265425.1_ASM26542v1	NC_018014	Terriglobus roseus DSM 18391, complete sequence	8	3773094-3773410	7,1	CRISPRCasFinder,PILER-CR	no		csa3,Cas9_archaeal,cas3,WYL,DinG	Orphan	CTCATTTCCAGCCAGGTCCGGGA,TTTCCAGCCAGGTTCGGG	23,18	0	0	NA	NA	NA:NA	5,3	5	Orphan	csa3,Cas9_archaeal,cas3,WYL,DinG	NA|69aa|up_9|NC_018014.1_3761755_3761962_-,NA|562aa|up_4|NC_018014.1_3766635_3768321_-,NA|183aa|up_2|NC_018014.1_3769458_3770007_+,NA|512aa|up_0|NC_018014.1_3771435_3772971_-,NA|137aa|down_1|NC_018014.1_3775052_3775463_+,NA|101aa|down_7|NC_018014.1_3785378_3785681_+,NA|71aa|down_9|NC_018014.1_3787851_3788064_+	NA|69aa|up_9|NC_018014.1_3761755_3761962_-	NA	NA|322aa|up_8|NC_018014.1_3762065_3763031_+	cd05400, NT_2-5OAS_ClassI-CCAase, Nucleotidyltransferase (NT) domain of 2'5'-oligoadenylate (2-5A)synthetase (2-5OAS) and class I CCA-adding enzyme	NA|173aa|up_7|NC_018014.1_3763043_3763562_+	pfam18138, bacHORMA_1, Bacterial HORMA domain family 1	NA|312aa|up_6|NC_018014.1_3763564_3764500_+	COG1223, COG1223, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|512aa|up_5|NC_018014.1_3764496_3766032_+	pfam18145, SAVED, SMODS-associated and fused to various effectors sensor domain	NA|562aa|up_4|NC_018014.1_3766635_3768321_-	NA	NA|313aa|up_3|NC_018014.1_3768396_3769335_-	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|183aa|up_2|NC_018014.1_3769458_3770007_+	NA	NA|233aa|up_1|NC_018014.1_3770430_3771129_-	cd02042, ParAB_family, partition proteins ParAB family	NA|512aa|up_0|NC_018014.1_3771435_3772971_-	NA	NA|79aa|down_0|NC_018014.1_3773852_3774089_-	NF033450, BREX_PglZ_1_B, BREX-1 system phosphatase PglZ type B	NA|137aa|down_1|NC_018014.1_3775052_3775463_+	NA	NA|605aa|down_2|NC_018014.1_3775612_3777427_+	pfam10412, TrwB_AAD_bind, Type IV secretion-system coupling protein DNA-binding domain	NA|324aa|down_3|NC_018014.1_3777475_3778447_-	cd00797, INT_RitB_C_like, C-terminal catalytic domain of recombinase RitB, a component of the recombinase trio	NA|293aa|down_4|NC_018014.1_3778446_3779325_-	cd01188, INT_RitA_C_like, C-terminal catalytic domain of recombinase RitA, a component of the recombinase trio	NA|625aa|down_5|NC_018014.1_3779760_3781635_-	COG1964, COG1964, Predicted Fe-S oxidoreductases [General function prediction only]	NA|919aa|down_6|NC_018014.1_3782027_3784784_+	pfam08751, TrwC, TrwC relaxase	NA|101aa|down_7|NC_018014.1_3785378_3785681_+	NA	NA|504aa|down_8|NC_018014.1_3785816_3787328_-	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|71aa|down_9|NC_018014.1_3787851_3788064_+	NA
GCF_000265425.1_ASM26542v1	NC_018014	Terriglobus roseus DSM 18391, complete sequence	9	4025654-4025875	8	CRISPRCasFinder	no		csa3,Cas9_archaeal,cas3,WYL,DinG	Orphan	CACATAGGCGGCTAAGTGGTGCACCGCATCCCCCCTGCCGTAGCGAACAATGGT	54	0	0	NA	NA	NA	1	1	Orphan	csa3,Cas9_archaeal,cas3,WYL,DinG	NA|65aa|up_7|NC_018014.1_4019095_4019290_-,NA|190aa|up_4|NC_018014.1_4022072_4022642_-,NA|60aa|up_2|NC_018014.1_4023505_4023685_-,NA|92aa|up_1|NC_018014.1_4023759_4024035_+,NA|96aa|up_0|NC_018014.1_4024117_4024405_+,NA|344aa|down_1|NC_018014.1_4027255_4028287_-	NA|525aa|up_9|NC_018014.1_4016033_4017608_+	pfam17389, Bac_rhamnosid6H, Bacterial alpha-L-rhamnosidase 6 hairpin glycosidase domain	NA|302aa|up_8|NC_018014.1_4018055_4018961_-	PRK00104, scpA, segregation and condensation protein A; Reviewed	NA|65aa|up_7|NC_018014.1_4019095_4019290_-	NA	NA|520aa|up_6|NC_018014.1_4019321_4020881_-	PRK12283, PRK12283, tryptophanyl-tRNA synthetase; Reviewed	NA|235aa|up_5|NC_018014.1_4021333_4022038_-	cd06158, S2P-M50_like_1, Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|190aa|up_4|NC_018014.1_4022072_4022642_-	NA	NA|243aa|up_3|NC_018014.1_4022776_4023505_-	COG2968, COG2968, Uncharacterized conserved protein [Function unknown]	NA|60aa|up_2|NC_018014.1_4023505_4023685_-	NA	NA|92aa|up_1|NC_018014.1_4023759_4024035_+	NA	NA|96aa|up_0|NC_018014.1_4024117_4024405_+	NA	NA|392aa|down_0|NC_018014.1_4026162_4027338_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|344aa|down_1|NC_018014.1_4027255_4028287_-	NA	NA|283aa|down_2|NC_018014.1_4028480_4029329_+	cd19088, AKR_AKR13B1, AKR13B family of aldo-keto reductase (AKR)	NA|495aa|down_3|NC_018014.1_4029642_4031127_-	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal	NA|1050aa|down_4|NC_018014.1_4031283_4034433_-	COG3696, COG3696, Putative silver efflux pump [Inorganic ion transport and metabolism]	NA|395aa|down_5|NC_018014.1_4034435_4035620_-	pfam16576, HlyD_D23, Barrel-sandwich domain of CusB or HlyD membrane-fusion	NA|514aa|down_6|NC_018014.1_4035619_4037161_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|382aa|down_7|NC_018014.1_4037224_4038370_+	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|457aa|down_8|NC_018014.1_4038366_4039737_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|286aa|down_9|NC_018014.1_4039777_4040635_-	COG5430, COG5430, Uncharacterized secreted protein [Function unknown]
GCF_000265425.1_ASM26542v1	NC_018014	Terriglobus roseus DSM 18391, complete sequence	10	4157153-4157217	9	CRISPRCasFinder	no		csa3,Cas9_archaeal,cas3,WYL,DinG	Orphan	AAAAATCAGATTCTAAAGGAGTT	23	1	1	4157176-4157194	NC_018014.1_4737090-4737108	NA	1	1	Orphan	csa3,Cas9_archaeal,cas3,WYL,DinG	NA|68aa|up_1|NC_018014.1_4156320_4156524_+,NA|196aa|down_3|NC_018014.1_4162032_4162620_+,NA|68aa|down_8|NC_018014.1_4166804_4167008_+	NA|223aa|up_9|NC_018014.1_4143473_4144142_-	sd00010, SLR, Sel1-like repeat	NA|868aa|up_8|NC_018014.1_4144148_4146752_-	COG4774, Fiu, Outer membrane receptor for monomeric catechols [Inorganic ion transport and metabolism]	NA|182aa|up_7|NC_018014.1_4147106_4147652_+	cd07820, SRPBCC_3, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|630aa|up_6|NC_018014.1_4147796_4149686_-	PRK05444, PRK05444, 1-deoxy-D-xylulose-5-phosphate synthase; Provisional	NA|471aa|up_5|NC_018014.1_4149775_4151188_-	COG1301, GltP, Na+/H+-dicarboxylate symporters [Energy production and conversion]	NA|179aa|up_4|NC_018014.1_4151248_4151785_+	pfam14534, DUF4440, Domain of unknown function (DUF4440)	NA|807aa|up_3|NC_018014.1_4151784_4154205_+	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|658aa|up_2|NC_018014.1_4154289_4156263_-	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|68aa|up_1|NC_018014.1_4156320_4156524_+	NA	NA|191aa|up_0|NC_018014.1_4156520_4157093_-	COG2323, COG2323, Predicted membrane protein [Function unknown]	NA|324aa|down_0|NC_018014.1_4157362_4158334_-	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]	NA|375aa|down_1|NC_018014.1_4158722_4159847_+	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal	NA|629aa|down_2|NC_018014.1_4159950_4161837_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|196aa|down_3|NC_018014.1_4162032_4162620_+	NA	NA|201aa|down_4|NC_018014.1_4162704_4163307_-	pfam13505, OMP_b-brl, Outer membrane protein beta-barrel domain	NA|308aa|down_5|NC_018014.1_4163424_4164348_-	CHL00180, rbcR, LysR transcriptional regulator; Provisional	NA|388aa|down_6|NC_018014.1_4164460_4165624_+	PRK00915, PRK00915, 2-isopropylmalate synthase; Validated	NA|367aa|down_7|NC_018014.1_4165707_4166808_+	PRK00772, PRK00772, 3-isopropylmalate dehydrogenase; Provisional	NA|68aa|down_8|NC_018014.1_4166804_4167008_+	NA	NA|489aa|down_9|NC_018014.1_4167021_4168488_+	PRK05478, PRK05478, 3-isopropylmalate dehydratase large subunit
GCF_000265425.1_ASM26542v1	NC_018014	Terriglobus roseus DSM 18391, complete sequence	11	4276908-4277007	10	CRISPRCasFinder	no		csa3,Cas9_archaeal,cas3,WYL,DinG	Orphan	GGTCGCGCTGCGCGCGATTGAGCG	24	0	0	NA	NA	NA	1	1	Orphan	csa3,Cas9_archaeal,cas3,WYL,DinG	NA|137aa|up_2|NC_018014.1_4275469_4275880_-,NA|57aa|up_0|NC_018014.1_4276339_4276510_-,NA|206aa|down_2|NC_018014.1_4280402_4281020_-,NA|202aa|down_3|NC_018014.1_4281016_4281622_-,NA|263aa|down_4|NC_018014.1_4281657_4282446_-,NA|347aa|down_9|NC_018014.1_4285324_4286365_-	NA|132aa|up_9|NC_018014.1_4268538_4268934_-	cd04766, HTH_HspR, Helix-Turn-Helix DNA binding domain of the HspR transcription regulator	NA|123aa|up_8|NC_018014.1_4268948_4269317_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|347aa|up_7|NC_018014.1_4269470_4270511_+	cd05332, 11beta-HSD1_like_SDR_c, 11beta-hydroxysteroid dehydrogenase type 1 (11beta-HSD1)-like, classical (c) SDRs	NA|404aa|up_6|NC_018014.1_4270631_4271843_-	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|326aa|up_5|NC_018014.1_4271902_4272880_-	COG0266, Nei, Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]	NA|501aa|up_4|NC_018014.1_4272987_4274490_-	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|225aa|up_3|NC_018014.1_4274546_4275221_-	cd00884, beta_CA_cladeB, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|137aa|up_2|NC_018014.1_4275469_4275880_-	NA	NA|105aa|up_1|NC_018014.1_4275951_4276266_-	pfam05336, rhaM, L-rhamnose mutarotase	NA|57aa|up_0|NC_018014.1_4276339_4276510_-	NA	NA|637aa|down_0|NC_018014.1_4277078_4278989_-	PRK00290, dnaK, molecular chaperone DnaK; Provisional	NA|396aa|down_1|NC_018014.1_4279218_4280406_-	PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional	NA|206aa|down_2|NC_018014.1_4280402_4281020_-	NA	NA|202aa|down_3|NC_018014.1_4281016_4281622_-	NA	NA|263aa|down_4|NC_018014.1_4281657_4282446_-	NA	NA|356aa|down_5|NC_018014.1_4282447_4283515_-	COG4972, PilM, Tfp pilus assembly protein, ATPase PilM [Cell motility and secretion / Intracellular trafficking and secretion]	NA|114aa|down_6|NC_018014.1_4283588_4283930_-	TIGR02607, Virulence-associated_protein_I, addiction module antidote protein, HigA family	NA|93aa|down_7|NC_018014.1_4283939_4284218_-	COG3549, HigB, Plasmid maintenance system killer protein [General function prediction only]	NA|300aa|down_8|NC_018014.1_4284397_4285297_+	pfam01784, NIF3, NIF3 (NGG1p interacting factor 3)	NA|347aa|down_9|NC_018014.1_4285324_4286365_-	NA
