assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	2	513740-513815	2	CRISPRCasFinder	no	DEDDh	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Unclear	TTATAGTTGCATTTTGCGACGATGG	25	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|347aa|up_3|AP019400.1_508884_509925_+,NA|432aa|up_1|AP019400.1_510979_512275_+,NA|427aa|down_4|AP019400.1_517365_518646_+,NA|225aa|down_7|AP019400.1_521592_522267_-,NA|138aa|down_8|AP019400.1_522378_522792_-,NA|225aa|down_9|AP019400.1_522904_523579_-	NA|230aa|up_9|AP019400.1_501941_502631_+	cd14845, L-Ala-D-Glu_peptidase_like, L-Ala-D-Glu peptidase, also known as L-alanyl-D-glutamate endopeptidase	NA|278aa|up_8|AP019400.1_502622_503456_-	cd07516, HAD_Pase, phosphatase, similar to Escherichia coli Cof and Thermotoga maritima TM0651; belongs to the haloacid dehalogenase-like superfamily	NA|438aa|up_7|AP019400.1_503517_504831_-	cd01068, globin_sensor, Globin sensor domain of globin-coupled-sensors (GCSs), protoglobins (Pgbs), and sensor single-domain globins (SSDgbs); S family	NA|694aa|up_6|AP019400.1_504837_506919_-	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|96aa|up_5|AP019400.1_506960_507248_-	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|397aa|up_4|AP019400.1_507468_508659_-	PRK13656, PRK13656, enoyl-[acyl-carrier-protein] reductase FabV	NA|347aa|up_3|AP019400.1_508884_509925_+	NA	NA|242aa|up_2|AP019400.1_510087_510813_+	pfam06271, RDD, RDD family	NA|432aa|up_1|AP019400.1_510979_512275_+	NA	NA|255aa|up_0|AP019400.1_512363_513128_-	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	DEDDh|257aa|down_0|AP019400.1_513910_514681_-	cd06133, ERI-1_3'hExo_like, DEDDh 3'-5' exonuclease domain of Caenorhabditis elegans ERI-1, human 3' exonuclease, and similar proteins	NA|252aa|down_1|AP019400.1_514862_515618_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|319aa|down_2|AP019400.1_515693_516650_-	pfam16472, DUF5050, Domain of unknown function (DUF5050)	NA|171aa|down_3|AP019400.1_516863_517376_+	TIGR02954, RNA_polymerase_ECF-type_sigma_factor, RNA polymerase sigma-70 factor, TIGR02954 family	NA|427aa|down_4|AP019400.1_517365_518646_+	NA	NA|639aa|down_5|AP019400.1_519009_520926_+	PRK00413, thrS, threonyl-tRNA synthetase; Reviewed	NA|188aa|down_6|AP019400.1_521015_521579_+	COG5663, COG5663, Uncharacterized conserved protein [Function unknown]	NA|225aa|down_7|AP019400.1_521592_522267_-	NA	NA|138aa|down_8|AP019400.1_522378_522792_-	NA	NA|225aa|down_9|AP019400.1_522904_523579_-	NA
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	3	534579-534666	3	CRISPRCasFinder	no		cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Orphan	CTAAGTTACATGAGTGCTCACCCA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|61aa|up_2|AP019400.1_532369_532552_+,NA|428aa|down_6|AP019400.1_541646_542930_+,NA|166aa|down_9|AP019400.1_544491_544989_-	NA|717aa|up_9|AP019400.1_524649_526800_+	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|150aa|up_8|AP019400.1_526884_527334_-	cd08893, SRPBCC_CalC_Aha1-like_GntR-HTH, Putative hydrophobic ligand-binding SRPBCC domain of an uncharacterized subgroup of CalC- and Aha1-like proteins; some contain an N-terminal GntR family winged HTH DNA-binding domain	NA|269aa|up_7|AP019400.1_527479_528286_+	pfam12833, HTH_18, Helix-turn-helix domain	NA|280aa|up_6|AP019400.1_528286_529126_-	pfam04439, Adenyl_transf, Streptomycin adenylyltransferase	NA|144aa|up_5|AP019400.1_529209_529641_-	pfam00582, Usp, Universal stress protein family	NA|169aa|up_4|AP019400.1_529767_530274_+	pfam07853, DUF1648, Protein of unknown function (DUF1648)	NA|642aa|up_3|AP019400.1_530409_532335_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|61aa|up_2|AP019400.1_532369_532552_+	NA	NA|126aa|up_1|AP019400.1_533113_533491_-	PRK13315, PRK13315, heme oxygenase	NA|323aa|up_0|AP019400.1_533538_534507_-	TIGR03944, ornithine_cyclodeaminase, 2,3-diaminopropionate biosynthesis protein SbnB	NA|407aa|down_0|AP019400.1_535690_536911_-	COG1541, PaaK, Coenzyme F390 synthetase [Coenzyme metabolism]	NA|227aa|down_1|AP019400.1_537189_537870_+	pfam14398, ATPgrasp_YheCD, YheC/D like ATP-grasp	NA|113aa|down_2|AP019400.1_538008_538347_+	pfam13453, zf-TFIIB, Transcription factor zinc-finger	NA|291aa|down_3|AP019400.1_538607_539480_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|321aa|down_4|AP019400.1_539584_540547_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|366aa|down_5|AP019400.1_540549_541647_+	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|428aa|down_6|AP019400.1_541646_542930_+	NA	NA|242aa|down_7|AP019400.1_543227_543953_+	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein	NA|127aa|down_8|AP019400.1_544025_544406_+	cd08349, BLMA_like, Bleomycin binding protein (BLMA) and similar proteins	NA|166aa|down_9|AP019400.1_544491_544989_-	NA
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	4	619091-619165	4	CRISPRCasFinder	no		cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Orphan	CTCCCCTGCGCAAGACTGCGCTTCT	25	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|76aa|up_3|AP019400.1_615182_615410_+,NA	NA|299aa|up_9|AP019400.1_610480_611377_+	PRK05416, PRK05416, RNase adapter RapZ	NA|329aa|up_8|AP019400.1_611381_612368_+	TIGR01826, Putative_gluconeogenesis_factor, conserved hypothetical protein, cofD-related	NA|312aa|up_7|AP019400.1_612373_613309_+	TIGR00647, DNA_bind_WhiA, DNA-binding protein WhiA	NA|90aa|up_6|AP019400.1_613412_613682_+	pfam00381, PTS-HPr, PTS HPr component phosphorylation site	NA|254aa|up_5|AP019400.1_613800_614562_+	COG2968, COG2968, Uncharacterized conserved protein [Function unknown]	NA|87aa|up_4|AP019400.1_614718_614979_-	pfam08970, Sda, Sporulation inhibitor A	NA|76aa|up_3|AP019400.1_615182_615410_+	NA	NA|195aa|up_2|AP019400.1_615618_616203_-	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|341aa|up_1|AP019400.1_616700_617723_+	COG2390, DeoR, Transcriptional regulator, contains sigma factor-related N-terminal domain [Transcription]	NA|337aa|up_0|AP019400.1_617844_618855_+	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|394aa|down_0|AP019400.1_619271_620453_+	PRK00073, pgk, phosphoglycerate kinase; Provisional	NA|252aa|down_1|AP019400.1_620464_621220_+	PRK00042, tpiA, triosephosphate isomerase; Provisional	NA|511aa|down_2|AP019400.1_621221_622754_+	PRK05434, PRK05434, 2,3-bisphosphoglycerate-independent phosphoglycerate mutase	NA|431aa|down_3|AP019400.1_622788_624081_+	PRK00077, eno, enolase; Provisional	NA|301aa|down_4|AP019400.1_624173_625076_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|559aa|down_5|AP019400.1_625265_626942_+	cd08492, PBP2_NikA_DppA_OppA_like_15, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|309aa|down_6|AP019400.1_626964_627891_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|280aa|down_7|AP019400.1_627905_628745_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|677aa|down_8|AP019400.1_628761_630792_+	PRK10261, PRK10261, glutathione transporter ATP-binding protein; Provisional	NA|287aa|down_9|AP019400.1_630830_631691_+	COG1788, AtoD, Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit [Lipid metabolism]
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	6	1351687-1351807	5	CRISPRCasFinder	no		cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Orphan	TGGCAGACAGGTATGTGTAGTTCACTCAGTAAGTCTGGAGAAGAGA	46	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|223aa|up_2|AP019400.1_1350209_1350878_+,NA|70aa|up_1|AP019400.1_1350935_1351145_+,NA|66aa|up_0|AP019400.1_1351152_1351350_+,NA|730aa|down_8|AP019400.1_1367066_1369256_+	NA|698aa|up_9|AP019400.1_1339922_1342016_+	cd11576, GH99_GH71_like_2, Uncharacterized glycoside hydrolase family 99-like domain	NA|204aa|up_8|AP019400.1_1342362_1342974_+	cd04084, CBM6_xylanase-like, Carbohydrate Binding Module 6 (CBM6); many are appended to glycoside hydrolase (GH) family 11 and GH43 xylanase domains	NA|258aa|up_7|AP019400.1_1343116_1343890_+	cd05826, Sortase_B, Sortase domain found in class B sortases	NA|312aa|up_6|AP019400.1_1344066_1345002_+	cd03267, ABC_NatA_like, ATP-binding cassette domain of an uncharacterized transporter similar in sequence to NatA	NA|209aa|up_5|AP019400.1_1345183_1345810_+	COG4587, COG4587, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|269aa|up_4|AP019400.1_1345811_1346618_+	pfam06182, ABC2_membrane_6, ABC-2 family transporter protein	NA|1038aa|up_3|AP019400.1_1346734_1349848_+	cd00257, Fascin, Fascin-like domain; members include actin-bundling/crosslinking proteins facsin, histoactophilin and singed;  identified in sea urchin, Drosophila, Xenopus, rodents, and humans; The fascin-like domain adopts a beta-trefoil topology and contains an internal threefold repeat; the fascin subgroup contains four copies of the domain; Structurally similar to fibroblast  growth factor (FGF)	NA|223aa|up_2|AP019400.1_1350209_1350878_+	NA	NA|70aa|up_1|AP019400.1_1350935_1351145_+	NA	NA|66aa|up_0|AP019400.1_1351152_1351350_+	NA	NA|1032aa|down_0|AP019400.1_1352018_1355114_+	pfam06283, ThuA, Trehalose utilisation	NA|882aa|down_1|AP019400.1_1355225_1357871_-	PRK15098, PRK15098, beta-glucosidase BglX	NA|284aa|down_2|AP019400.1_1358008_1358860_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|378aa|down_3|AP019400.1_1359081_1360215_+	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|981aa|down_4|AP019400.1_1360211_1363154_+	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|324aa|down_5|AP019400.1_1363150_1364122_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|286aa|down_6|AP019400.1_1364126_1364984_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|672aa|down_7|AP019400.1_1365010_1367026_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|730aa|down_8|AP019400.1_1367066_1369256_+	NA	NA|298aa|down_9|AP019400.1_1369252_1370146_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	7	1399313-1399799	1,2	CRT,PILER-CR	no	csa3	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Type I-A	GAAAAAATCGTACAACACC,AGCCCAGTTACTAGAAAAAATCGTACAACACCT	19,33	0	0	NA	NA	NA:NA	9,2	9	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|305aa|up_0|AP019400.1_1397972_1398887_-,NA	NA|186aa|up_9|AP019400.1_1389561_1390119_-	cd03015, PRX_Typ2cys, Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides	NA|254aa|up_8|AP019400.1_1390250_1391012_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|135aa|up_7|AP019400.1_1390981_1391386_-	cd07153, Fur_like, Ferric uptake regulator(Fur) and related metalloregulatory proteins; typically iron-dependent, DNA-binding repressors and activators	NA|318aa|up_6|AP019400.1_1391513_1392467_+	cd12831, TmCorA-like_u2, Uncharacterized bacterial subfamily of the Thermotoga maritima CorA-like family	NA|222aa|up_5|AP019400.1_1392503_1393169_+	cd16423, HAD_BPGM-like, uncharacterized subfamily of beta-phosphoglucomutase-like family, similar to uncharacterized Bacillus subtilis YhcW	NA|365aa|up_4|AP019400.1_1393269_1394364_+	cd01541, PBP1_AraR, ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators	NA|235aa|up_3|AP019400.1_1394470_1395175_+	PRK08193, araD, L-ribulose-5-phosphate 4-epimerase AraD	NA|558aa|up_2|AP019400.1_1395229_1396903_+	PRK04123, PRK04123, ribulokinase; Provisional	NA|216aa|up_1|AP019400.1_1397028_1397676_+	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|305aa|up_0|AP019400.1_1397972_1398887_-	NA	NA|343aa|down_0|AP019400.1_1400070_1401099_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|453aa|down_1|AP019400.1_1401231_1402590_+	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|298aa|down_2|AP019400.1_1402816_1403710_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|276aa|down_3|AP019400.1_1403699_1404527_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|364aa|down_4|AP019400.1_1404560_1405652_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|241aa|down_5|AP019400.1_1405660_1406383_+	COG4813, ThuA, Trehalose utilization protein [Carbohydrate transport and metabolism]	NA|1089aa|down_6|AP019400.1_1406854_1410121_+	cd06597, GH31_transferase_CtsY, CtsY (cyclic tetrasaccharide-synthesizing enzyme Y)-like	NA|1284aa|down_7|AP019400.1_1410209_1414061_+	COG1501, COG1501, Alpha-glucosidases, family 31 of glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|301aa|down_8|AP019400.1_1414220_1415123_+	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	csa3|113aa|down_9|AP019400.1_1415436_1415775_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	8	2010763-2011523	2,3	CRT,PILER-CR	no	DEDDh	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Unclear	CCCCAGTNGNNGNAATTAGTCG,CCGTCTCAAACGAGCACCAGTAGCAGCAATTAGTCGACCACCCCGC	22,46	0	0	NA	NA	NA:NA	14,2	14	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|94aa|up_9|AP019400.1_1995408_1995690_+,NA|222aa|up_1|AP019400.1_2008430_2009096_+,NA	NA|94aa|up_9|AP019400.1_1995408_1995690_+	NA	NA|781aa|up_8|AP019400.1_1995940_1998283_+	COG3973, COG3973, Superfamily I DNA and RNA helicases [General function prediction only]	NA|605aa|up_7|AP019400.1_1998353_2000168_+	TIGR01389, recQ, ATP-dependent DNA helicase RecQ	DEDDh|203aa|up_6|AP019400.1_2000258_2000867_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|1499aa|up_5|AP019400.1_2001026_2005523_+	COG5492, COG5492, Bacterial surface proteins containing Ig-like domains [Cell motility and secretion]	NA|260aa|up_4|AP019400.1_2005732_2006512_+	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|333aa|up_3|AP019400.1_2006514_2007513_+	PRK06740, PRK06740, histidinol phosphate phosphatase domain-containing protein	NA|260aa|up_2|AP019400.1_2007509_2008289_+	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|222aa|up_1|AP019400.1_2008430_2009096_+	NA	NA|476aa|up_0|AP019400.1_2009252_2010680_+	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|276aa|down_0|AP019400.1_2011645_2012473_+	cd00519, Lipase_3, Lipase (class 3)	NA|308aa|down_1|AP019400.1_2012695_2013619_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|189aa|down_2|AP019400.1_2013650_2014217_+	TIGR03567, FMN_reductase_NADPH, FMN reductase, SsuE family	NA|251aa|down_3|AP019400.1_2014589_2015342_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|388aa|down_4|AP019400.1_2015394_2016558_+	cd05669, M20_Acy1_YxeP-like, M20 Peptidase aminoacyclase-1 YxeP-like proteins, including YxeP, YtnL, YjiB and HipO2	NA|319aa|down_5|AP019400.1_2016758_2017715_+	PRK11259, solA, N-methyl-L-tryptophan oxidase	NA|165aa|down_6|AP019400.1_2017728_2018223_+	pfam12867, DinB_2, DinB superfamily	NA|179aa|down_7|AP019400.1_2018260_2018797_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|440aa|down_8|AP019400.1_2018815_2020135_+	TIGR03860, FMN_nitrolo, FMN-dependent oxidoreductase, nitrilotriacetate monooxygenase family	NA|323aa|down_9|AP019400.1_2020329_2021298_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	9	2133696-2133833	6	CRISPRCasFinder	no		cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Orphan	TATAGCCGTCCTAAGGATGGCAAAGCCGTTTTACTTGTA	39	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|187aa|up_7|AP019400.1_2126063_2126624_+,NA|206aa|up_5|AP019400.1_2127622_2128240_+,NA|69aa|down_3|AP019400.1_2135361_2135568_-,NA|167aa|down_5|AP019400.1_2138718_2139219_+,NA|480aa|down_7|AP019400.1_2140828_2142268_-	NA|452aa|up_9|AP019400.1_2123431_2124787_+	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|345aa|up_8|AP019400.1_2124822_2125857_+	TIGR02872, UPF0118_membrane_protein_YtvI, sporulation integral membrane protein YtvI	NA|187aa|up_7|AP019400.1_2126063_2126624_+	NA	NA|103aa|up_6|AP019400.1_2126819_2127128_-	pfam12978, DUF3862, Domain of Unknown Function with PDB structure (DUF3862)	NA|206aa|up_5|AP019400.1_2127622_2128240_+	NA	NA|215aa|up_4|AP019400.1_2128626_2129271_+	pfam13624, SurA_N_3, SurA N-terminal domain	NA|270aa|up_3|AP019400.1_2129593_2130403_-	cd03293, ABC_NrtD_SsuB_transporters, ATP-binding cassette domain of the nitrate and sulfonate transporters	NA|347aa|up_2|AP019400.1_2130415_2131456_-	cd13560, PBP2_taurine, Taurine-binding periplasmic protein; the type 2 periplasmic binding protein fold	NA|256aa|up_1|AP019400.1_2131481_2132249_-	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|379aa|up_0|AP019400.1_2132300_2133437_-	PRK00719, PRK00719, alkanesulfonate monooxygenase; Provisional	NA|78aa|down_0|AP019400.1_2133931_2134165_+	pfam11007, CotJA, Spore coat associated protein JA (CotJA)	NA|167aa|down_1|AP019400.1_2134157_2134658_+	pfam12652, CotJB, CotJB protein	NA|190aa|down_2|AP019400.1_2134673_2135243_+	pfam05067, Mn_catalase, Manganese containing catalase	NA|69aa|down_3|AP019400.1_2135361_2135568_-	NA	NA|756aa|down_4|AP019400.1_2136158_2138426_+	PRK15098, PRK15098, beta-glucosidase BglX	NA|167aa|down_5|AP019400.1_2138718_2139219_+	NA	NA|491aa|down_6|AP019400.1_2139247_2140720_+	COG2268, COG2268, Uncharacterized protein conserved in bacteria [Function unknown]	NA|480aa|down_7|AP019400.1_2140828_2142268_-	NA	NA|222aa|down_8|AP019400.1_2142460_2143126_+	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|248aa|down_9|AP019400.1_2143216_2143960_+	pfam14398, ATPgrasp_YheCD, YheC/D like ATP-grasp
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	10	2316753-2317461	3	CRT	no	DEDDh	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Unclear	TGAGATAGANTANAGCGGNNNNACCGACTAATT	33	0	0	NA	NA	NA	13	13	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|48aa|up_0|AP019400.1_2316332_2316476_-,NA|79aa|down_5|AP019400.1_2323238_2323475_-	NA|249aa|up_9|AP019400.1_2307959_2308706_+	COG5578, COG5578, Predicted integral membrane protein [Function unknown]	NA|127aa|up_8|AP019400.1_2309012_2309393_+	pfam07386, DUF1499, Protein of unknown function (DUF1499)	NA|171aa|up_7|AP019400.1_2309468_2309981_-	PRK00522, tpx, thiol peroxidase	NA|301aa|up_6|AP019400.1_2310091_2310994_+	cd08434, PBP2_GltC_like, The substrate binding domain of LysR-type transcriptional regulator GltC, which activates gltA expression of glutamate synthase operon, contains type 2 periplasmic binding fold	NA|228aa|up_5|AP019400.1_2311070_2311754_-	COG2738, COG2738, Predicted Zn-dependent protease [General function prediction only]	NA|133aa|up_4|AP019400.1_2311813_2312212_-	cd04779, HTH_MerR-like_sg4, Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily	NA|455aa|up_3|AP019400.1_2312653_2314018_+	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|363aa|up_2|AP019400.1_2314075_2315164_+	COG2905, COG2905, Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms]	DEDDh|248aa|up_1|AP019400.1_2315160_2315904_+	PRK07740, PRK07740, hypothetical protein; Provisional	NA|48aa|up_0|AP019400.1_2316332_2316476_-	NA	NA|434aa|down_0|AP019400.1_2317569_2318871_-	cd17474, MFS_YfmO_like, Bacillus subtilis multidrug efflux protein YfmO and similar transporters of the Major Facilitator Superfamily	NA|194aa|down_1|AP019400.1_2319069_2319651_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|538aa|down_2|AP019400.1_2319800_2321414_-	PRK12344, PRK12344, putative alpha-isopropylmalate/homocitrate synthase family transferase; Provisional	NA|430aa|down_3|AP019400.1_2321562_2322852_+	PRK01810, PRK01810, DNA polymerase IV; Validated	NA|79aa|down_4|AP019400.1_2322971_2323208_+	COG1141, Fer, Ferredoxin [Energy production and conversion]	NA|79aa|down_5|AP019400.1_2323238_2323475_-	NA	NA|610aa|down_6|AP019400.1_2323407_2325237_+	cd07302, CHD, cyclase homology domain	NA|885aa|down_7|AP019400.1_2325307_2327962_+	NF033165, lipo_LipL45, lipoprotein LipL45	NA|524aa|down_8|AP019400.1_2328001_2329573_+	cd14953, NHL_like_1, Uncharacterized NHL-repeat domain in bacterial proteins	NA|487aa|down_9|AP019400.1_2329589_2331050_+	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	13	2783427-2783520	8	CRISPRCasFinder	no		cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Orphan	TGAACTACACATATGGGTTGGCAGACAG	28	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|176aa|up_3|AP019400.1_2778519_2779047_+,NA|50aa|down_0|AP019400.1_2783563_2783713_+	NA|554aa|up_9|AP019400.1_2770252_2771914_-	cd01138, FeuA, Periplasmic binding protein FeuA	NA|339aa|up_8|AP019400.1_2772035_2773052_-	cd01138, FeuA, Periplasmic binding protein FeuA	NA|352aa|up_7|AP019400.1_2773375_2774431_+	pfam01032, FecCD, FecCD transport family	NA|345aa|up_6|AP019400.1_2774433_2775468_+	pfam01032, FecCD, FecCD transport family	NA|572aa|up_5|AP019400.1_2775508_2777224_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|351aa|up_4|AP019400.1_2777258_2778311_+	COG0492, TrxB, Thioredoxin reductase [Posttranslational modification, protein turnover, chaperones]	NA|176aa|up_3|AP019400.1_2778519_2779047_+	NA	NA|509aa|up_2|AP019400.1_2779286_2780813_+	COG0672, FTR1, High-affinity Fe2+/Pb2+ permease [Inorganic ion transport and metabolism]	NA|392aa|up_1|AP019400.1_2780831_2782007_+	COG2822, COG2822, Predicted periplasmic lipoprotein involved in iron transport [Inorganic ion transport and metabolism]	NA|413aa|up_0|AP019400.1_2782030_2783269_+	TIGR01412, Probable_deferrochelatase/peroxidase_EfeN, Tat-translocated enzyme	NA|50aa|down_0|AP019400.1_2783563_2783713_+	NA	NA|284aa|down_1|AP019400.1_2784014_2784866_-	cd07385, MPP_YkuE_C, Bacillus subtilis YkuE and related proteins, C-terminal metallophosphatase domain	NA|425aa|down_2|AP019400.1_2785009_2786284_+	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|180aa|down_3|AP019400.1_2786461_2787001_+	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|481aa|down_4|AP019400.1_2787050_2788493_+	PRK09243, PRK09243, nicotinate phosphoribosyltransferase; Validated	NA|281aa|down_5|AP019400.1_2788574_2789417_+	TIGR00762, DegV, EDD domain protein, DegV family	NA|172aa|down_6|AP019400.1_2789517_2790033_+	COG4682, COG4682, Predicted membrane protein [Function unknown]	NA|80aa|down_7|AP019400.1_2790153_2790393_+	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|158aa|down_8|AP019400.1_2790368_2790842_+	TIGR00654, Uncharacterized_isomerase_YddE, phenazine biosynthesis protein PhzF family	NA|389aa|down_9|AP019400.1_2791029_2792196_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	14	2812519-2812691	5	PILER-CR	no		cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Orphan	TTCGAAATAATCGAGCAAC	19	1	1	2812590-2812622	AP019400.1_2812380-2812412	NA	3	3	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|102aa|up_8|AP019400.1_2803773_2804079_-,NA|107aa|down_6|AP019400.1_2819460_2819781_-	NA|219aa|up_9|AP019400.1_2803073_2803730_+	cd01834, SGNH_hydrolase_like_2, SGNH_hydrolase subfamily	NA|102aa|up_8|AP019400.1_2803773_2804079_-	NA	NA|309aa|up_7|AP019400.1_2804184_2805111_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|375aa|up_6|AP019400.1_2805294_2806419_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|358aa|up_5|AP019400.1_2806431_2807505_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|291aa|up_4|AP019400.1_2807536_2808409_+	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|286aa|up_3|AP019400.1_2808401_2809259_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|256aa|up_2|AP019400.1_2809255_2810023_+	PRK08277, PRK08277, D-mannonate oxidoreductase; Provisional	NA|354aa|up_1|AP019400.1_2810057_2811119_+	PRK03906, PRK03906, mannonate dehydratase; Provisional	NA|215aa|up_0|AP019400.1_2811274_2811919_+	cd00452, KDPG_aldolase, KDPG and KHG aldolase	NA|360aa|down_0|AP019400.1_2812716_2813796_-	PRK09358, PRK09358, adenosine deaminase; Provisional	NA|218aa|down_1|AP019400.1_2814048_2814702_+	PRK01362, PRK01362, fructose-6-phosphate aldolase	NA|173aa|down_2|AP019400.1_2814879_2815398_+	cd04617, CBS_pair_CcpN, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains of CcpN repressor	NA|269aa|down_3|AP019400.1_2815409_2816216_+	pfam03618, Kinase-PPPase, Kinase/pyrophosphorylase	NA|578aa|down_4|AP019400.1_2816297_2818031_-	COG4176, ProW, ABC-type proline/glycine betaine transport system, permease component [Amino acid transport and metabolism]	NA|403aa|down_5|AP019400.1_2818023_2819232_-	COG4175, ProV, ABC-type proline/glycine betaine transport system, ATPase component [Amino acid transport and metabolism]	NA|107aa|down_6|AP019400.1_2819460_2819781_-	NA	NA|319aa|down_7|AP019400.1_2820152_2821109_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|94aa|down_8|AP019400.1_2821163_2821445_+	pfam12823, DUF3817, Domain of unknown function (DUF3817)	NA|580aa|down_9|AP019400.1_2821682_2823422_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	16	3696238-3696343	10	CRISPRCasFinder	no		cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Orphan	ATATATTAAAATAAAAAGCACTACATTATTTC	32	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|54aa|up_5|AP019400.1_3688033_3688195_-,NA|199aa|up_4|AP019400.1_3688160_3688757_+,NA|91aa|up_3|AP019400.1_3688801_3689074_-,NA|135aa|down_0|AP019400.1_3696754_3697159_-,NA|91aa|down_1|AP019400.1_3697386_3697659_-,NA|189aa|down_2|AP019400.1_3697854_3698421_+,NA|232aa|down_5|AP019400.1_3699756_3700452_-,NA|213aa|down_6|AP019400.1_3700448_3701087_-,NA|283aa|down_7|AP019400.1_3701079_3701928_-,NA|77aa|down_9|AP019400.1_3703371_3703602_+	NA|197aa|up_9|AP019400.1_3679071_3679662_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|1700aa|up_8|AP019400.1_3679990_3685090_+	pfam09479, Flg_new, Listeria-Bacteroides repeat domain (List_Bact_rpt)	NA|313aa|up_7|AP019400.1_3685182_3686121_-	COG3947, COG3947, Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]	NA|608aa|up_6|AP019400.1_3686117_3687941_-	COG3275, LytS, Putative regulator of cell autolysis [Signal transduction mechanisms]	NA|54aa|up_5|AP019400.1_3688033_3688195_-	NA	NA|199aa|up_4|AP019400.1_3688160_3688757_+	NA	NA|91aa|up_3|AP019400.1_3688801_3689074_-	NA	NA|316aa|up_2|AP019400.1_3689459_3690407_-	pfam01636, APH, Phosphotransferase enzyme family	NA|1051aa|up_1|AP019400.1_3691179_3694332_-	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|521aa|up_0|AP019400.1_3694632_3696195_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|135aa|down_0|AP019400.1_3696754_3697159_-	NA	NA|91aa|down_1|AP019400.1_3697386_3697659_-	NA	NA|189aa|down_2|AP019400.1_3697854_3698421_+	NA	NA|156aa|down_3|AP019400.1_3698554_3699022_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|227aa|down_4|AP019400.1_3699086_3699767_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|232aa|down_5|AP019400.1_3699756_3700452_-	NA	NA|213aa|down_6|AP019400.1_3700448_3701087_-	NA	NA|283aa|down_7|AP019400.1_3701079_3701928_-	NA	NA|256aa|down_8|AP019400.1_3702311_3703079_-	pfam15595, Imm51, Immunity protein 51	NA|77aa|down_9|AP019400.1_3703371_3703602_+	NA
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	18	5213856-5214007	11	CRISPRCasFinder	no		cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Orphan	CAAGTAAAACGGCTTCACCGTCCTTGGGACGGTAGTATACGTTTGTATCGAGC	53	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|104aa|up_4|AP019400.1_5207329_5207641_+,NA|119aa|up_3|AP019400.1_5207739_5208096_-,NA|220aa|down_2|AP019400.1_5217255_5217915_-	NA|247aa|up_9|AP019400.1_5203877_5204618_-	cd11614, SAF_CpaB_FlgA_like, SAF domains of the flagella basal body P-ring formation protein FlgA and the flp pilus assembly CpaB	NA|209aa|up_8|AP019400.1_5204753_5205380_-	cd06262, metallo-hydrolase-like_MBL-fold, mainly hydrolytic enzymes and related proteins which carry out various biological functions; MBL-fold metallohydrolase domain	NA|189aa|up_7|AP019400.1_5205393_5205960_-	pfam14595, Thioredoxin_9, Thioredoxin	NA|204aa|up_6|AP019400.1_5205996_5206608_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|154aa|up_5|AP019400.1_5206814_5207276_+	pfam13473, Cupredoxin_1, Cupredoxin-like domain	NA|104aa|up_4|AP019400.1_5207329_5207641_+	NA	NA|119aa|up_3|AP019400.1_5207739_5208096_-	NA	NA|743aa|up_2|AP019400.1_5208219_5210448_-	COG3307, RfaL, Lipid A core - O-antigen ligase and related enzymes [Cell envelope biogenesis, outer membrane]	NA|234aa|up_1|AP019400.1_5211035_5211737_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|236aa|up_0|AP019400.1_5213113_5213821_-	pfam07238, PilZ, PilZ domain	NA|552aa|down_0|AP019400.1_5214258_5215914_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|388aa|down_1|AP019400.1_5216092_5217256_-	pfam13679, Methyltransf_32, Methyltransferase domain	NA|220aa|down_2|AP019400.1_5217255_5217915_-	NA	NA|266aa|down_3|AP019400.1_5217916_5218714_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|173aa|down_4|AP019400.1_5218736_5219255_-	pfam06866, DUF1256, Protein of unknown function (DUF1256)	NA|74aa|down_5|AP019400.1_5219458_5219680_+	pfam06569, DUF1128, Protein of unknown function (DUF1128)	NA|157aa|down_6|AP019400.1_5219768_5220239_-	pfam14275, DUF4362, Domain of unknown function (DUF4362)	NA|532aa|down_7|AP019400.1_5220429_5222025_-	PRK05270, PRK05270, UDP-glucose--hexose-1-phosphate uridylyltransferase	NA|329aa|down_8|AP019400.1_5222040_5223027_-	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]	NA|394aa|down_9|AP019400.1_5223030_5224212_-	PRK05322, PRK05322, galactokinase; Provisional
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	20	5492168-5492244	13	CRISPRCasFinder	no		cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Orphan	TGAGTTTGTCGGTCTCTCCTCTTAT	25	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|62aa|up_2|AP019400.1_5490346_5490532_+,NA|107aa|up_0|AP019400.1_5491773_5492094_-,NA	NA|142aa|up_9|AP019400.1_5481824_5482250_-	cd00732, CheW, CheW, a small regulator protein, unique to the chemotaxis signalling in prokaryotes and archea	NA|531aa|up_8|AP019400.1_5482262_5483855_-	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|197aa|up_7|AP019400.1_5483866_5484457_-	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|314aa|up_6|AP019400.1_5484760_5485702_+	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed	NA|451aa|up_5|AP019400.1_5486067_5487420_-	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|225aa|up_4|AP019400.1_5487416_5488091_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|654aa|up_3|AP019400.1_5488146_5490108_-	COG1680, AmpC, Beta-lactamase class C and other penicillin binding proteins [Defense mechanisms]	NA|62aa|up_2|AP019400.1_5490346_5490532_+	NA	NA|280aa|up_1|AP019400.1_5490826_5491666_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|107aa|up_0|AP019400.1_5491773_5492094_-	NA	NA|386aa|down_0|AP019400.1_5492507_5493665_-	cd01163, DszC, Dibenzothiophene (DBT) desulfurization enzyme C	NA|394aa|down_1|AP019400.1_5493716_5494898_-	TIGR03565, alk_sulf_monoox, alkanesulfonate monooxygenase, FMNH(2)-dependent	NA|322aa|down_2|AP019400.1_5495054_5496020_-	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|217aa|down_3|AP019400.1_5496181_5496832_+	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|254aa|down_4|AP019400.1_5496843_5497605_+	cd05233, SDR_c, classical (c) SDRs	NA|178aa|down_5|AP019400.1_5498174_5498708_-	COG0461, PyrE, Orotate phosphoribosyltransferase [Nucleotide transport and metabolism]	NA|247aa|down_6|AP019400.1_5498888_5499629_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|359aa|down_7|AP019400.1_5499709_5500786_-	COG0614, FepB, ABC-type Fe3+-hydroxamate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|439aa|down_8|AP019400.1_5501813_5503130_-	smart00812, Alpha_L_fucos, Alpha-L-fucosidase	NA|275aa|down_9|AP019400.1_5503201_5504026_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	21	5847374-5847830	5	CRT	no	csa3	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Type I-A	GTGCCTCTNCCAAGNCCCGTTGCTCGAAA	29	5	7	5847455-5847477|5847455-5847477|5847455-5847477|5847507-5847541|5847675-5847697|5847727-5847749|5847779-5847801	AP019400.1_1351527-1351505|AP019400.1_5025641-5025663|AP019400.1_5247215-5247237|AP019400.1_5847339-5847373|AP019400.1_1351858-1351836|AP019400.1_5247215-5247237|AP019400.1_2334839-2334817	NA	8	8	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|59aa|up_8|AP019400.1_5837041_5837218_-,NA|42aa|up_7|AP019400.1_5837238_5837364_-,NA|123aa|down_0|AP019400.1_5848131_5848500_-	NA|145aa|up_9|AP019400.1_5836388_5836823_-	pfam06491, Disulph_isomer, Disulphide isomerase	NA|59aa|up_8|AP019400.1_5837041_5837218_-	NA	NA|42aa|up_7|AP019400.1_5837238_5837364_-	NA	NA|553aa|up_6|AP019400.1_5837440_5839099_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|419aa|up_5|AP019400.1_5839337_5840594_+	pfam03486, HI0933_like, HI0933-like protein	NA|275aa|up_4|AP019400.1_5840660_5841485_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|299aa|up_3|AP019400.1_5841497_5842394_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|434aa|up_2|AP019400.1_5842571_5843873_-	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|709aa|up_1|AP019400.1_5843928_5846055_-	COG3408, GDB1, Glycogen debranching enzyme [Carbohydrate transport and metabolism]	NA|339aa|up_0|AP019400.1_5846203_5847220_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|123aa|down_0|AP019400.1_5848131_5848500_-	NA	NA|432aa|down_1|AP019400.1_5848569_5849865_-	pfam05975, EcsB, Bacterial ABC transporter protein EcsB	NA|256aa|down_2|AP019400.1_5849864_5850632_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|499aa|down_3|AP019400.1_5850779_5852276_-	cd07808, FGGY_D-XK_EcXK-like, Escherichia coli xylulokinase-like D-xylulose kinases; a subgroup of the FGGY family of carbohydrate kinases	NA|439aa|down_4|AP019400.1_5852378_5853695_-	PRK05474, PRK05474, xylose isomerase; Provisional	NA|387aa|down_5|AP019400.1_5853847_5855008_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	csa3|107aa|down_6|AP019400.1_5855113_5855434_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|1113aa|down_7|AP019400.1_5855436_5858775_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|330aa|down_8|AP019400.1_5858944_5859934_+	cd01138, FeuA, Periplasmic binding protein FeuA	NA|553aa|down_9|AP019400.1_5860061_5861720_+	cd01138, FeuA, Periplasmic binding protein FeuA
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	23	6841497-6841834	6	CRT	no		cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Orphan	AAAAGTAACATTAGTGAACNCACAGTNCTCTA	32	3	9	6841580-6841598|6841580-6841598|6841682-6841700|6841682-6841700|6841682-6841700|6841682-6841700|6841682-6841700|6841784-6841802|6841784-6841802	AP019400.1_544782-544800|AP019400.1_544935-544953|AP019400.1_1317956-1317974|AP019400.1_1318007-1318025|AP019400.1_1318058-1318076|AP019400.1_1318109-1318127|AP019400.1_1318160-1318178|AP019400.1_544782-544800|AP019400.1_544935-544953	NA	6	6	Orphan	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|55aa|up_3|AP019400.1_6836717_6836882_-,NA|63aa|down_6|AP019400.1_6854502_6854691_-,NA|43aa|down_7|AP019400.1_6854964_6855093_+	NA|333aa|up_9|AP019400.1_6828878_6829877_+	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|396aa|up_8|AP019400.1_6829915_6831103_+	PRK00719, PRK00719, alkanesulfonate monooxygenase; Provisional	NA|260aa|up_7|AP019400.1_6831074_6831854_+	PRK11365, ssuC, aliphatic sulfonate ABC transporter permease SsuC	NA|257aa|up_6|AP019400.1_6831872_6832643_+	PRK11247, ssuB, aliphatic sulfonates transport ATP-binding subunit; Provisional	NA|1023aa|up_5|AP019400.1_6832716_6835785_-	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|259aa|up_4|AP019400.1_6835813_6836590_-	PRK00059, prsA, peptidylprolyl isomerase; Provisional	NA|55aa|up_3|AP019400.1_6836717_6836882_-	NA	NA|684aa|up_2|AP019400.1_6837007_6839059_-	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|212aa|up_1|AP019400.1_6839358_6839994_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|467aa|up_0|AP019400.1_6839990_6841391_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|194aa|down_0|AP019400.1_6842080_6842662_+	pfam11579, DUF3238, Protein of unknown function (DUF3238)	NA|1683aa|down_1|AP019400.1_6842837_6847886_-	pfam00395, SLH, S-layer homology domain	NA|213aa|down_2|AP019400.1_6848089_6848728_-	cd02137, MhqN-like, nitroreductase family protein similar to the NAD(P)H nitroreductase MhqN	NA|120aa|down_3|AP019400.1_6848838_6849198_+	cd01109, HTH_YyaN, Helix-Turn-Helix DNA binding domain of the MerR-like transcription regulators YyaN and YraB	NA|786aa|down_4|AP019400.1_6849214_6851572_-	COG3973, COG3973, Superfamily I DNA and RNA helicases [General function prediction only]	NA|852aa|down_5|AP019400.1_6851952_6854508_+	COG2909, MalT, ATP-dependent transcriptional regulator [Transcription]	NA|63aa|down_6|AP019400.1_6854502_6854691_-	NA	NA|43aa|down_7|AP019400.1_6854964_6855093_+	NA	NA|413aa|down_8|AP019400.1_6855096_6856335_+	COG1748, LYS9, Saccharopine dehydrogenase and related proteins [Amino acid transport and metabolism]	NA|376aa|down_9|AP019400.1_6856336_6857464_+	cd06829, PLPDE_III_CANSDC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Carboxynorspermidine Decarboxylase
GCA_004295585.1_ASM429558v1	AP019400	Cohnella sp. HS21 DNA, complete genome	24	7007333-7007652	7	CRT	no	cas3	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	Unclear	CTGTAACCATACTATTTAT	19	1	1	7007503-7007534	AP019400.1_7007301-7007332	NA	6	6	Unclear	cas3,DEDDh,csa3,WYL,DinG,RT,PD-DExK	NA|189aa|up_8|AP019400.1_6998074_6998641_-,NA|67aa|down_5|AP019400.1_7015835_7016036_+	NA|69aa|up_9|AP019400.1_6997925_6998132_+	pfam04542, Sigma70_r2, Sigma-70 region 2	NA|189aa|up_8|AP019400.1_6998074_6998641_-	NA	NA|160aa|up_7|AP019400.1_6998796_6999276_-	pfam02590, SPOUT_MTase, Predicted SPOUT methyltransferase	NA|71aa|up_6|AP019400.1_6999275_6999488_-	TIGR03833, TIGR03833, conserved hypothetical protein	cas3|667aa|up_5|AP019400.1_6999695_7001696_-	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|241aa|up_4|AP019400.1_7002023_7002746_-	pfam07332, Phage_holin_3_6, Putative Actinobacterial Holin-X, holin superfamily III	NA|252aa|up_3|AP019400.1_7002769_7003525_-	pfam09843, DUF2070, Predicted membrane protein (DUF2070)	NA|437aa|up_2|AP019400.1_7003535_7004846_-	cd03408, SPFH_like_u1, Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|52aa|up_1|AP019400.1_7005211_7005367_+	pfam07561, DUF1540, Domain of Unknown Function (DUF1540)	NA|504aa|up_0|AP019400.1_7005536_7007048_-	cd07770, FGGY_GntK, Gluconate kinases; a subfamily of the FGGY family of carbohydrate kinases	NA|678aa|down_0|AP019400.1_7008126_7010160_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|571aa|down_1|AP019400.1_7010180_7011893_-	PRK10789, PRK10789, SmdA family multidrug ABC transporter permease/ATP-binding protein	NA|481aa|down_2|AP019400.1_7012422_7013865_+	cd12823, Mrs2_Mfm1p-like, Saccharomyces cerevisiae inner mitochondrial membrane Mg2+ transporters Mfm1p and Mrs2p-like family	NA|56aa|down_3|AP019400.1_7013999_7014167_-	TIGR04129, conserved_hypothetical_protein, CxxH/CxxC protein, BA_5709 family	NA|444aa|down_4|AP019400.1_7014194_7015526_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|67aa|down_5|AP019400.1_7015835_7016036_+	NA	NA|267aa|down_6|AP019400.1_7016080_7016881_-	cd07733, YycJ-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis YycJ and related proteins; MBL-fold metallo hydrolase domain	NA|257aa|down_7|AP019400.1_7016958_7017729_-	pfam09648, YycI, YycH protein	NA|439aa|down_8|AP019400.1_7017775_7019092_-	pfam07435, YycH, YycH protein	NA|612aa|down_9|AP019400.1_7019088_7020924_-	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK
