Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP016501 | Streptococcus agalactiae strain WC1535 chromosome, complete genome | 2 crisprs | RT,DEDDh,csm6,csn2,cas2,cas1,cas9,DinG,cas3,WYL | 1 | 3 | 6 | 1 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016501_1 | 846807-847108 | TypeII |
II-A
Consensus repeat of NZ_CP016501_1
|
4 spacers
spacers of NZ_CP016501_1
>1.1|846844|29|NZ_CP016501|CRT ACATATCCTTTTGTTAGGTCAAAAGAAGA >1.2|846910|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder GGTTATATGGTCGATATGATTCTACAACA >1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder TTGATAGATTTCTTACAAAATGAACGCAT >1.4|847042|30|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder CGTAAGTCGTGCGATGAAACCATTTACTCG |
csn2,cas2,cas1,cas9 |
CRISPR arrays and Neighbor proteins around NZ_CP016501_1
The CRISPR arrays of NZ_CP016501_1 >merge|NZ_CP016501|1|846807-847108|CRT,PILER-CR,CRISPRCasFinder ATGATGTCCTAGCATAATAACAGCACAGCTCTAAAACACATATCCTTTTGTTAGGTCAAAAGAAGATGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAACGGTTATATGGTCGATATGATTCTACAACATGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAACTTGATAGATTTCTTACAAAATGAACGCATTGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAACCGTAAGTCGTGCGATGAAACCATTTACTCGTGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC >NZ_CP016501|1|1|846807-847108|CRT ATGATGTCCTAGCATAATAACAGCACAGCTCTAAAAC ACATATCCTTTTGTTAGGTCAAAAGAAGA TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC GGTTATATGGTCGATATGATTCTACAACA TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC TTGATAGATTTCTTACAAAATGAACGCAT TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC CGTAAGTCGTGCGATGAAACCATTTACTCG TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC >NZ_CP016501|1|1|846873-847108|PILER-CR TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC GGTTATATGGTCGATATGATTCTACAACA TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC TTGATAGATTTCTTACAAAATGAACGCAT TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC CGTAAGTCGTGCGATGAAACCATTTACTCG TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC >NZ_CP016501|1|1|846873-847108|CRISPRCasFinder TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC GGTTATATGGTCGATATGATTCTACAACA TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC TTGATAGATTTCTTACAAAATGAACGCAT TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC CGTAAGTCGTGCGATGAAACCATTTACTCG TGTTTTGGAACCATTCGAAACAGCACAGCTCTAAAAC
>NZ_CP016501.1|WP_011058334.1|846382_846553_-|hypothetical-protein MKGYNSNLMVKGLGCSTLYLIISLTALVLLVIAGVFFVINTCKLTRKAVENLSIIN >NZ_CP016501.1|WP_000438314.1|845833_846250_-|nucleoside-diphosphate-kinase MEQTFFMIKPDGVKRGFIGEVISRIERRGFSIDRLEVRYADADILKRHYAELTDRPFFPTLVDYMTSGPVIIGVISGEEVISTWRTMMGSTNPKDALPGTIRGDFAQAPSPNQATCNIVHGSDSPESATREIAIWFNN >NZ_CP016501.1|WP_001019133.1|843865_845698_-|elongation-factor-4 MNIEDLKKRQEKIRNFSIIAHIDHGKSTLADRILEKTETVSSREMQAQLLDSMDLERERGITIKLNAIELNYTAKDGETYIFHLIDTPGHVDFTYEVSRSLAACEGAILVVDAAQGIEAQTLANVYLALDNDLEILPVINKIDLPAADPERVRAEVEDVIGLDASEAVLASAKAGIGIEEILEQIVEKVPAPTGEVDAPLQALIFDSVYDAYRGVILQVRIVNGMVKPGDKIQMMSNGKTFDVTEVGIFTPKAVGRDFLATGDVGYIAASIKTVADTRVGDTITLANNPAIEPLHGYKQMNPMVFAGLYPIESNKYNDLREALEKLQLNDASLQFEPETSQALGFGFRCGFLGLLHMDVIQERLEREFNIDLIMTAPSVVYHVNTTDGEMLEVSNPSEFPDPTRVDSIEEPYVKAQIMVPQEFVGAVMELAQRKRGDFVTMDYIDDNRVNVIYQIPLAEIVFDFFDKLKSSTRGYASFDYEISEYRRSQLVKMDILLNGDKVDALSFIVHKEFAYERGKLIVDKLKKIIPRQQFEVPIQAAIGQKIVARSDIKALRKNVLAKCYGGDVSRKRKLLEKQKAGKKRMKAIGSVEVPQEAFLSVLSMDDDDKK >NZ_CP016501.1|WP_070001305.1|840222_840834_-|HD-domain-containing-protein MVHHKLKNDPSGHDWFHIVRVRNLAVELAHKEGANTFICQMAALLHDIIDDKICQDSKQASYELTQWLYSQDLAIAEVEHILDILENISFKAGTGLTMKTLEGQIVQDADRLDAMGAIGIARTMAYSGSKGRLIHDPNLKPRENLTLEEYRNGQDTAIMHFYEKLLKLKDLMNTKQGKMLAQKRHDFLELYLAEFYAEWNGKR >NZ_CP016501.1|WP_000621142.1|839749_840214_-|N-acetyltransferase MIRRAKEKDLPDIAELLKQILMLHHEVRPDIFHTRGSKFSKEQLKEMLIDESKPIFVYESDEGKVVAHLFLQLQEKRDLPRKSFKTLYIDDLCIDEEVRGQQIGQKLMDFARQYAKKHGCYNITLNVWNDNQRAVSFYEKLGFKPQQTQMEEIL >NZ_CP016501.1|WP_000665410.1|839315_839750_-|peptide-methionine-(R)-S-oxide-reductase MKETQEELRQRIGHTAYQVTQNSATEHAFTGKYDDFFEEGIYVDIVSGEVLFSSLDKFQSGCGWPAFSKPIENRMVTNHQDHSHGMHRIEVRSRQADSHLGHVFNDGPVDAGGLRYCINSAALDFIPYDQMAKRGYGDYLSLFD >NZ_CP016501.1|WP_000065321.1|835268_836372_-|DUF2974-domain-containing-protein MSNIITYLKNNSNLTFDELALNDVDILCLNEFGYISFEKLINTTEMKSVLVCELYHEYLQTMAKSYSFMFTSQRHDLCQLMMTSKRFKNLTLSYYRAEISLEFEKQFAAMVFTIPNINYHQVVFRGTDANLIGWKEDFKLTYMREISAHRSAIKYLNTILPYFDKVVLSGHSKGGNLALYAAMFTKPDLKAKIDLIWLIDSPGLQKTLLPTTEYKTTKQKCIRLLPEESIVGMMLYSDIEPLIISSNARGILQHDVTTWEIQEPAILKTGTGLSLKSICFEKTFQQWMAELKSQERKLFFDLLFDSFLSSGVSSLDDFNLASRAKMMKAFHSFRELDDDKKRLFNKSLKLLVTIFWGAYHDNSRETK >NZ_CP016501.1|WP_000146566.1|834509_835148_+|antibiotic-acetyltransferase MASILKKHLEKRLELKNDPMRTDMSTGASYPKYGFSVGKYTYGYQQFFYEGVNLKEIGAFCSIAQNVTITGLNHPTDHITTNPFIYYKSRGFINEDRADLIDEKKNGKVIIGNDVWIGTNVTILPSVTIGNGAIIGAGSVITKDIPDYGVVAGTPAKIIKYRFSEEEITLLNASQWWNWSDEAIKEHISEFSDKKEFFNTLKSISENKNHKL >NZ_CP016501.1|WP_001056394.1|833187_833799_-|TVP38/TMEM64-family-protein MNMKLSKRYRFWQKVIKALGVLALIATLVLVVYLYKLGILNDSNELKDLVHKYEFWGPMIFIVAQIVQIVFPVIPGGVTTVAGFLIFGPTLGFIYNYIGIIIGSVILFWLVKFYGRKFVLLFMDQKTFDKYESKLETSGYEKFFIFCMASPISPADIMVMITGLSNMSIKRFVTIIMITKPISIIGYSYLWIYGGDILKNFLN >NZ_CP016501.1|WP_000467138.1|830933_831716_+|hypothetical-protein MFGKLLKYELKSVGKWYLTLNAAVLLVSIILGLVLKALGGNFSTDTNSTSAQIFTIILVLLLAMVISGSLLSTLAIIIKRFYSNIFGRQGYLTLTLPVTTNQIICSKLLASLLWSIFNIFIVIIGIILVILPLVGIGQFVVAFPEIYKIISSSNAPLFIAYFFLSYVAGTLLIYLSIAVGQLFTNKRVLMGIVSYFGISLLITFLTLIIDSIFHIDLFNSHANATFSQPVLLYNILVSIVEIAIFYMLTHSIIKYKLNIQ >NZ_CP016501.1|WP_000590706.1|847208_847874_-|type-II-A-CRISPR-associated-protein-Csn2 MIKINFPILDEPLVLSNATILTIEDVSVYSSLVKHFYQYDVDEHLKLFDDKQKSLKATELMLVTDILGYDVNSAPILKLIHGDLENQFNEKPEVKSMVEKLAATITELIAFECLENELDLEYDEITILELIKALGVKIETQSDTIFEKCFEIIQVYHYLTKKNLLVFVNSGAYLTKDEVIKLCEYINLMQKSVLFLEPRRLYDLPQYVIDKDYFLIGENMV >NZ_CP016501.1|WP_001242359.1|847860_848187_-|CRISPR-associated-endonuclease-Cas2 MRMILMFDMPTETAEERKAYRKFRKFLLSEGFIMHQFSVYSKLLLNNTANNAMIGRLKVNNPKKGNITLLTVTEKQFARMVYLHGERNTSVANSDSRLVFLGDSYDQD >NZ_CP016501.1|WP_000929489.1|848198_849068_-|type-II-CRISPR-associated-endonuclease-Cas1 MAGWRTVVVNTHSKLSYKNNHLIFKDSYQTEMIHLSEIDILIMETTDIVLSTMLIKRLVDENILVIFCDDKRLPTAMLMPYYARHDSSLQLSRQMSWIEDVKADVWTSIIAQKILNQSFYLGECSFFEKSQSIMNLYHDLEPFDPSNREGHAARIYFNTLFGNDFSREQDNPINAGLDYGYSLLLSMFAREVVKCGCMTQFGLKHANQFNQFNLASDIMEPFRPIVDRIIYENRQSDFVKMKRELFSMFSETYSYNGKEMYLSNIVSDYTKKVIKSLNSDGNGIPEFRI >NZ_CP016501.1|WP_001040087.1|849069_853182_-|type-II-CRISPR-RNA-guided-endonuclease-Cas9 MNKPYSIGLDIGTNSVGWSIITDDYKVPAKKMRVLGNTDKEYIKKNLIGALLFDGGNTAADRRLKRTARRRYTRRRNRILYLQEIFAEEMSKVDDSFFHRLEDSFLVEEDKRGSKYPIFATLQEEKDYHEKFSTIYHLRKELADKKEKADLRLIYIALAHIIKFRGHFLIEDDSFDVRNTDISKQYQDFLEIFNTTFENNDLLSQNVDVEAILTDKISKSAKKDRILAQYPNQKSTGIFAEFLKLIVGNQADFKKYFNLEDKTPLQFAKDSYDEDLENLLGQIGDEFADLFSAAKKLYDSVLLSGILTVIDLSTKAPLSASMIQRYDEHREDLKQLKQFVKASLPEKYQEIFADSSKDGYAGYIEGKTNQEAFYKYLSKLLTKQEDSENFLEKIKNEDFLRKQRTFDNGSIPHQVHLTELKAIIRRQSEYYPFLKENQDRIEKILTFRIPYYIGPLAREKSDFAWMTRKTDDSIRPWNFEDLVDKEKSAEAFIHRMTNNDFYLPEEKVLPKHSLIYEKFTVYNELTKVRYKNEQGETYFFDSNIKQEIFDGVFKEHRKVSKKKLLDFLAKEYEEFRIVDVIGLDKENKAFNASLGTYHDLEKILDKDFLDNPDNESILEDIVQTLTLFEDREMIKKRLENYKDLFTESQLKKLYRRHYTGWGRLSAKLINGIRDKESQKTILDYLIDDGRSNRNFMQLINDDGLSFKSIISKAQAGSHSDNLKEVVGELAGSPAIKKGILQSLKIVDELVKVMGYEPEQIVVEMARENQTTNQGRRNSRQRYKLLDDGVKNLASDLNGNILKEYPTDNQALQNERLFLYYLQNGRDMYTGEALDIDNLSQYDIDHIIPQAFIKDDSIDNRVLVSSAKNRGKSDDVPSLEIVKDCKVFWKKLLDAKLMSQRKYDNLTKAERGGLTSDDKARFIQRQLVETRQITKHVARILDERFNNELDSKGRRIRKVKIVTLKSNLVSNFRKEFGFYKIREVNNYHHAHDAYLNAVVAKAILTKYPQLEPEFVYGDYPKYNSYKTRKSATEKLFFYSNIMNFFKTKVTLADGTVVVKDDIEVNNDTGEIVWDKKKHFATVRKVLSYPQNNIVKKTEIQTGGFSKESILAHGNSDKLIPRKTKDIYLDPKKYGGFDSPIVAYSVLVVADIKKGKAQKLKTVTELLGITIMERSRFEKNPSAFLESKGYLNIRADKLIILPKYSLFELENGRRRLLASAGELQKGNELALPTQFMKFLYLASRYNESKGKPEEIEKKQEFVNQHVSYFDDILQLINDFSKRVILADANLEKINKLYQDNKENISVDELANNIINLFTFTSLGAPAAFKFFDKIVDRKRYTSTKEVLNSTLIHQSITGLYETRIDLGKLGED >NZ_CP016501.1|WP_000653360.1|853569_854226_-|TIGR01906-family-membrane-protein MKDKLLVVLTWIWIISLATLATIYIAWLIYPIEIQFLKLEKVVYLKAETIYYNFNKLMIYLTHPFISDLNMPSFPSSEDGLKHFADVKYLFTLAHGLFVILTFPVIYFLRSGWKQKSIFLYEGFFKIAIMLPIFIVVCAFLLGFDQFFTLFHEVLFPGDSTWQFNPLTDPVIWILPETFFLHCFIIFLLIYETITIILLIIGRKHLKLRKYKNKMQAL >NZ_CP016501.1|WP_000323544.1|854215_854986_-|TIGR01457-family-HAD-type-hydrolase MAYKGYLIDLDGTIYKGKSRIPAGERFIERLQEKGIPYMLVTNNTTRTPESVQEMLRGFNVETPLETIYTATMATVDYMNDMNRGKTAYVIGEEGLKKAIADAGYVEDTKNPAYVVVGLDWNVTYDKLATATLAIQNGALFIGTNPDLNIPTERGLLPGAGSLNALLEAATRIKPVFIGKPNAIIMNKALEILNIPRNQAVMVGDNYLTDIMAGINNDIDTLLVTTGFTTVEEVPDLPIQPSYVLASLDEWTFNEG >NZ_CP016501.1|WP_000523795.1|854986_855724_-|acyl-ACP-thioesterase MGLLYRETYEVPFYESDTNHYMKLPQLLALALQISAKQSLKLGIGDDIVFKRYGLVWVVTDYIIDIERLPKHAEKIVIETEAKAHNKLLCYRYFYIYGEDGQKIITISSAFVLMDFKTRKIHPVLDDITSIYQSQRIKKVIRGPKYHPIGDSKVKQYHVRYFDLDMNGHVNNSKYLEWMYDVLDLDFLSSHIPKKIDLKYIKEIQYGTDIKSHWYQDGLVTRHDIIGGDAIHAQARIEWQEKKED >NZ_CP016501.1|WP_000914664.1|855727_856858_-|coproporphyrinogen-III-oxidase MLKKPTSAYVHIPFCTQICYYCDFSKVFIKNQPVDAYLQALIREFRSYDITELRTLYIGGGTPTSISAVQLDYLLTELSRDLNLNTLEEFTIEANPGDLTVDKIEVLQKSAVNRVSLGVQTFNDKHLKRIGRSHNEAQIYSTIDALKTAGFQNISIDLIYALPGQTMDDVRSNVAKALSLNIPHLSLYSLILEHHTVFMNKMRRGKLHLPTEDLEAEMFEYIISEMERNGFEHYEISNFTKPGFESRHNLMYWDNVEYYGVGAGASGYLDGIRYRNRGPIQHYLKGVSEGNARLSEEVLSKNEMMEEELFLGLRKKEGVSIGKFEQKFGTSFEKRYGQIVQELQSDGLLKENNGFIQMTKKGLFLGDTVAEKFIVE >NZ_CP016501.1|WP_001241816.1|856949_857330_-|hypothetical-protein MRLWHQDIIELLPRQQLLGQHRECCALRGNGWGRKHETVDYVFRYSPYRLFAYHQLVMEEMMERGYRVSKEWLIAEYRGMKCPRYDTLNPVDLETPIYPEHNQDYLQECLWNLKAKGIDLPINKIK >NZ_CP016501.1|WP_000159234.1|857354_857726_-|DUF1722-domain-containing-protein MTKEAELLWAKHKYLVLSKSQKIYLDIRQTLKSPNCTVLDVQSLIDQAVLLEESPSQVTNAYMHIWGYFKNKAERQEKEEFLTLLEKYRKTGYQRRKLLAFLKQLLAKYPNSYLQNSSIFEEE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016501_2 | 1898343-1898429 | Orphan |
NA
Consensus repeat of NZ_CP016501_2
|
1 spacers
spacers of NZ_CP016501_2
>2.1|1898367|39|NZ_CP016501|CRISPRCasFinder GTTAAAGCAGACCAAGATGCTAAAGTTAAGTTGGCTAAG |
RT |
CRISPR arrays and Neighbor proteins around NZ_CP016501_2
The CRISPR arrays of NZ_CP016501_2 >merge|NZ_CP016501|2|1898343-1898429|CRISPRCasFinder GCTCAAACTAAGTACGATACTGCTGTTAAAGCAGACCAAGATGCTAAAGTTAAGTTGGCTAAGGCTCAAACTGAGTACGATACTGCT >NZ_CP016501|2|2|1898343-1898429|CRISPRCasFinder GCTCAAACTAAGTACGATACTGCT GTTAAAGCAGACCAAGATGCTAAAGTTAAGTTGGCTAAG GCTCAAACTGAGTACGATACTGCT
>NZ_CP016501.1|WP_000661406.1|1895977_1896313_-|XRE-family-transcriptional-regulator MKELNLRAYIGKRIRILRTQKHLTQQQLEEDADLPLKYTYKLENLEPNITVETLEKIMTALDTNIESFFDISLKEDNPLTNQLLSKIAEFPASKQERLLKLLINIIDEVEK >NZ_CP016501.1|WP_001180636.1|1894984_1895941_+|hypothetical-protein MQNNISCFEVLENLYKENTEFKNEFKQIIFDIAILKKYITYWQTYDFLTKFRDKILPKQFRGGSKGGFIEDYDSDNKMKEWKSFSAKGTTKFVISLVFQEIKEESEEIKDMLRGGIESYLQDKDPELNELLLSQEDKDLIRSCSNVVKKYENITKIYNVRYSIFELSSEQLNNMKSYLLQFIMDSQDKNNFDLEYTALEEYLNQKNFLNLLEELTANLINSQNKSEAQKNIMGIMFLLAKVRIGITESSALQTREVTNDKGRKPLLNIPTKMNLNELSEENQKLQKELEVLLEPLLNSRFILNNLYEFYFECRGNLQI >NZ_CP016501.1|WP_000458739.1|1894456_1894750_-|hypothetical-protein MFCQEILVTVSVLGVSPEIDFETKQATGNIKIDAGFRNSSGKHITRTIKIANSTVSEYSTYLDEKVNLRLERVTFSPYLTNGRAALSIKAESATIEE >NZ_CP016501.1|WP_000190889.1|1893934_1894375_-|hypothetical-protein MATIFKKTIKQTPTITFDKCVTECEKILGVLGIDQLVNLSVKDRMKLSKKFEYNSYDSRGILSLQALVPKYAIKKDSLLLKQRENSFVTTAQRQHAQKLYDKISRAVNSYPYGVITCYEFEQWIDEHLRLYIMLKYDKDFEGDNIL >NZ_CP016501.1|WP_000614740.1|1893407_1893938_-|hypothetical-protein MIQILQLVFYFIIIFFIPYLGLILVNTVILIYTLPAEQLLPQAIQNTNLQIQKWLKSFNVFPKERDCYSDFYIELTHQAIKLRPCLNTGLYPNGVQREGMDFYLWLSKVPDPKEFEPWKMQLRQTVRTYDDNLDVLFNMIEEEPFLKLKMNLVPKIELNQLEHGQPKALYNEEGEV >NZ_CP016501.1|WP_000027783.1|1892661_1893408_-|hypothetical-protein MSIRLKILRQEGSNEPVSWDTIRYPHMLISAPTNSGKSYFIKYLLSILAIHSTDASIMILDYKQGTDYWQWIDSDNVFLGDSVYDGFRQAYKIFENRRLDPKKEYPPFFLIFEEYQSTIESISKKADKDEFLRLVGNLLRLSRQVNYHLICICQRVDASVFPAGGRENFAAKISIGRLSPQAKQMLFPDDGVNCNKGQGEVNFQFDGQPVIEAKTYTIRDMEKAKHLITDLLNRGLPQSDETVAKGEP >NZ_CP016501.1|WP_000185201.1|1891226_1892498_-|replication-protein-RepA MTNSNISSDFRANSFCCVLNNVDKLFDKESVFFDSDKYKEKTKKRIQEFRKLLPDTNEPLNPEEIVDFLMERWMARNENVVCAINYEIGDNGVHHCHMVLEDKKAFRFSALQKLYPTIHAEITRGTKEQVIAYLEKSGEHEEKAHTIVVPMKVHGELRARNQGHRSDFDYIQRQIENGATPEEIMMGNLEYRKYSKMIREHFFQHRFAQTPDIKDMKVYWHVGESGSGKSHTQVHLKQEFGRDNVYIWTDFDNGGLDLYCAEPILFMDEFKGMSYKEFLKVTDVYPVQLHARYTNTIALWNEIHITSIFTPKEAYGLMVPESQQEVDSYKQLQRRLTNVIYHFKIREDDGNFKYKTITFTPEDFERHNQEQIEKFAYLFDKFNPNVLNYDFTKDAFHLSMNPINNKKKKATTLKLTKAESNSK >NZ_CP016501.1|WP_000097743.1|1890880_1891144_-|hypothetical-protein MSSNNFDLKQIELMLLTVIDLLKHLNTEKNKTGIIYTQKELLDLLSISPNTLKSWENRGLKRLEPPIEGTRTVYYHIDTVLEFLSHN >NZ_CP016501.1|WP_001019353.1|1889518_1890784_-|site-specific-integrase MNIEKIIHKGKTVLKKVKDNGEISYSAKSIYLGLDIKTGKPVKTTVTAKTLRSLDRKIIQAKIDFEESGSTRKETFSIATLSDLAELWFSNYETWVSSDNTLNRVRNYLDTYILPQFGQYQPDKVTSSDIQNWVNELATKSKESVDSGIKRAEKGCAKDFGAIAHKLSDIFDFGITHFELKHNPAQSIKIPPKPKSNQKRIMVLHDEDLTIWLNFVDTLPNTRANRRFKVICDSLLASGMRINELLALTIYDLDFESSEILVTKTLVWKNAKPKLGLKGKVVCKNTPKSDSGNRKIAVPYQIIEQLQNFHDEMNLYFKKNGLSKSKLIFPTIYGNYMCDRNERATLKRRLQEVGLPDYGFHLFRHTHASMMLNAGMNWKELQVRMGHKSIKTTMDIYAELAPKNQTQAVDIYLNKIAELTS >NZ_CP016501.1|WP_050886285.1|1888044_1889322_+|group-II-intron-reverse-transcriptase/maturase MSELLDKILSRNNMLEAYKQVKSNKGSAGIDGVTIEQMDDYLHQNWRETKQLIKERSYKPQPVLRVEIPKPNGGVRNLGIPTAMDRMIQQAIVQVLSPLCEKHFSEYSYGFRSNRSCETAIVQLLEYLNDGYEWIVDIDLEKFFDTVPQDRLMSLVHNIIQDGDTESLIRKYLHSGVVINGQRHKTLVGTPQGGNLSPLLSNIMLNELDKELEKRGLRFVRYADDCVITVGSEAAAKRVMHSVSSYIEKRLGLKVNMTKTKIVRPNKLKYLGFGFWKSPKGWKCRPHQDSVQSFKRKLKQLTMRKWSIDLITRIERLNWVIRGWINYFSLGNMKSIMTQIDERLRTRIRVIIWKQWKKKAKRLWGLLKLGVARWIADKVSGWGDHYQLVAQKSVLKRAISKPALAKRGLVSCLDYYLERHALKVS >NZ_CP016501.1|WP_000852726.1|1899297_1899759_+|hypothetical-protein MAEVTTIVYRLGSQYDEKMSAISVFDAHLFHAKCHGDKVLFTVPTQNNTKVGKKVEKIDNIILTLKDGSKSLFAEVDAHGVFPEKVKDGYIYTLPSQWNVFEEIEQGYTWFALKGVKEISEEELNSYKSTNERECPLLQSISGASCRIYVTEE >NZ_CP016501.1|WP_001292556.1|1900319_1901597_+|group-II-intron-reverse-transcriptase/maturase MSELLDKILSRNNMLEAYKQVKSNKGSAGIDGVTIEQMDDYLHQNWRETKQLIKERSYKPQPVLRVEIPKPNGGVRNLGIPTAMDRMIQQAIVQVLSPLCEKHFSEYSYGFRPNRSCETAIVQLLEYLNDGYEWIVDIDLEKFFDTVPQDRLMSLVHNIIQDGDTESLIRKYLHSGVVINGQRHKTLVGTPQGGNLSPLLSNIMLNELDKELEKRGLRFVRYADDCVITVGSEAAAKRVMHSVSSYIEKRLGLKVNMTKTKIVRPNKLKYLGFGFWKSPKGWKCRPHQDSVQSFKRKLKQLTMRKWSIDLITRIERLNWVIRGWINYFSLGNMKSIMTQIDERLRTRIRVIIWKQWKKKAKRLWGLLKLGVARWIADKVSGWGDHYQLVAQKSVLKRAISKPALAKRGLVSCLDYYLERHALKVS >NZ_CP016501.1|WP_001265622.1|1901896_1902046_-|50S-ribosomal-protein-L33 MRVNITLEHKESGERLYLTSKNKRNTPDRLQLKKYSPKLRKHVVFTEVK >NZ_CP016501.1|WP_000290414.1|1902061_1902244_-|50S-ribosomal-protein-L32 MAVPARHTSKAKKNKRRTHYKLTAPSVQFDETTGDYSRSHRVSLKGYYKGRKIAKANEAK >NZ_CP016501.1|WP_000775906.1|1902463_1903744_+|histidine--tRNA-ligase MKLQKPKGTQDILPGESAKWQYVENVIRNLFKQYHYDEIRTPMFEHYEVISRSVGDTTDIVTKEMYDFHDKGDRHITLRPEGTAPVVRSYVENKLFAPEVQKPTKMYYIGSMFRYERPQAGRLREFHQVGVECFGSNNPATDVETIAMGHHLFEDLGIKNVKLHLNSLGSPESRQAYRQALIDYLTPIREQLSKDSQRRLNENPLRVLDSKEPEDKLAVENAPSILDYLDESSQAHFDAVCHMLDALNIPYIIDTNMVRGLDYYNHTIFEFITEIEDNELTICAGGRYDGLVSYFGGPETPAFGFGLGLERLLLILGKQGIPLPIENTIDLYIAVLGSEANLAALDLAQSIRHQGFKVERDYLGRKIKAQFKSADTFNAKVIMTLGSSEVDSREVSLKNNQTRQEVKVSFENIKTGFSSVLKQLGL >NZ_CP016501.1|WP_000830936.1|1903836_1905588_+|aspartate--tRNA-ligase MKRSMYAGRVRSEHIGTSITLKGWVGRRRDLGGLIFIDLRDREGIMQLVINPEEVSASVMATAESLRSEFVIEVSGVVTAREQANDNLPTGEVELKVQELSVLNTSKTTPFEIKDGIEANDDTRMRYRYLDLRRPEMLENFKLRAKVTHSIRNYLDNLEFIDVETPMLTKSTPEGARDYLVPSRVNQGHFYALPQSPQITKQLLMNAGFDRYYQIVKCFRDEDLRGDRQPEFTQVDLETSFLSDQEIQDIVEGMIAKVMKDTKGLEVSLPFPRMAYDDAMNNYGSDKPDTRFDMLLQDLTEVVKEVDFKVFSEASVVKAIVVKNKADKYSRKNIDKLTEIAKQYGAKGLAWLKYADNTISGPVAKFLTAIEDRLTEALQLENNDLILFVADSLEVANETLGALRTRIAKELELIDYSKFNFLWIVDWPMFEWSEEEGRYMSAHHPFTLPTAETAHELEGDLAKVRAVAYDIVLNGYELGGGSLRINQKDTQERMFKALGFSAESAQEQFGFLLEAMDYGFPPHGGLAIGLDRFVMLLAGKDNIREVIAFPKNNKASDPMTQAPSLVSEQQLEELSLTVESYEN >NZ_CP016501.1|WP_000857770.1|1905577_1906522_+|YitT-family-protein MKTRRTPLEKKVKYIISVWAKKFGLLHTLKSISREKYAEKISASLLYGILSSVAVNFFFQPGHVYSSGATGLAQVISAVSKHWFSFEIPVALAFYAINIPLLILSWRKIGHKFTIFTFITVTVSSIFIQLMPQITLTTDPLINAIFGGLIMGAGVGFSFKSRISSGGTDIISLTIRKKTGRDVGSISFIINGIILLFAGLLFGWKYALYSMVTIFVSSRVTDAIFTKQKKMQAMIVTSKPDCVIKRIHRDLHRGVTCINDAEGTYNHEKKAVLITILTREEFSDFKYLMLKADPKAFVSVAENVHIIGRFVDDD >NZ_CP016501.1|WP_000915478.1|1906629_1907502_+|YitT-family-protein MLKLDLKTKIKEAILIALGVALYTFGFVKFNMANHLAEGGISGVTLIIHALFGVNPALSSLLLNIPLFILGARILGKKSLLLTIYGTVLMSFFMWFWQQIPVTVPLKNDMMLVAVAAGILAGTGSGLVFRYGATTGGADIIGRIVEEKSGIKLGQTLLFIDAIVLTSSLVYINLQQMLYTLVASFVFSQVLTNVENGGYTVRGMIIITKESESAAATILHEINRGVTFLRGQGAYSGREHDVLYVALNPSEVRDVKEIMADLDPDAFISVINVDEVISSDFKIRRRNYDK >NZ_CP016501.1|WP_001138859.1|1907528_1907837_-|bacteriocin-immunity-protein MPSEKEILDALSKVYSEEVIQADDYFRQAIFELASQLEKEGMNSLLATKIDSLINQYVLTHQFDAPKSIFDLSRLVKTKASHYKGTAISAIMLGSFLSGGPK >NZ_CP016501.1|WP_000379866.1|1907924_1909616_-|arginine--tRNA-ligase MDTKHLIASEIQKVVPDMEQSTILSLLETPKNSSMGDLAFPAFSLAKTLRKAPQIIASDIAEQIKSDQFEKVEAVGPYVNFFLDKAAISSQVLKQVLSDGSAYATQNIGEGRNVAIDMSSPNIAKPFSIGHLRSTVIGDSLANIFDKIGYHPVKINHLGDWGKQFGMLIVAYKKWGNEEAVRAHPIDELLKLYVRINAEAETDPSVDEEAREWFRKLEANDPEATELWQWFRDESLLEFNRLYDKMNVTFDSYNGEAFYNDKMEEVLELLESKNLLVESKGAQVVNLEKYGIEHPALIKKSDGATLYITRDLAAALYRRRTYDFAKSIYVVGNEQSAHFKQLKAVLKEMDYDWSDDMTHVPFGLVTKGGAKLSTRKGNVILLEPTVAEAINRAASQIEAKNPNLADKDKVAQAVGVGAIKFYDLKTDRTNGYDFDLEAMVSFEGETGPYVQYAHARIQSILRKANFNPSNSDNYSLNDVESWEIIKLIQDFPRIIVRAADNFEPSIIAKFAINLAQCFNKYYAHTRILDEDAEISSRLALCYATATVLKESLRLLGVDAPNEM |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NZ_CP016501_1 | 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846976-847004 | 29 | NZ_CP016501.1 | 2094020-2094048 | 0 | 1.0 |
1. spacer 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to position: 2094020-2094048, mismatch: 0, identity: 1.0
ttgatagatttcttacaaaatgaacgcat CRISPR spacer ttgatagatttcttacaaaatgaacgcat Protospacer *****************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP016501_1 | 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846976-847004 | 29 | MK308676 | Vibrio phage vB_VmeM-Yong MS31, complete genome | 172843-172871 | 7 | 0.759 |
NZ_CP016501_1 | 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846976-847004 | 29 | MK308675 | Vibrio phage vB_VmeM-Yong XC32, complete genome | 172842-172870 | 7 | 0.759 |
NZ_CP016501_1 | 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846976-847004 | 29 | MK308677 | Vibrio phage vB_VmeM-Yong MS32, complete genome | 172843-172871 | 7 | 0.759 |
NZ_CP016501_1 | 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846976-847004 | 29 | MK308674 | Vibrio phage vB_VmeM-Yong XC31, complete genome | 172842-172870 | 7 | 0.759 |
NZ_CP016501_1 | 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846976-847004 | 29 | NZ_CP053657 | Bacillus cereus strain CTMA_1571 plasmid p.1, complete sequence | 429648-429676 | 7 | 0.759 |
NZ_CP016501_1 | 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846976-847004 | 29 | MN855763 | Myoviridae sp. isolate 210, complete genome | 13489-13517 | 7 | 0.759 |
NZ_CP016501_1 | 1.2|846910|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846910-846938 | 29 | NZ_CP020580 | Acinetobacter baumannii strain SSMA17 plasmid pSSMA17_1, complete sequence | 55464-55492 | 8 | 0.724 |
NZ_CP016501_1 | 1.2|846910|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846910-846938 | 29 | NZ_CP020582 | Acinetobacter baumannii strain JBA13 plasmid pJBA13_1, complete sequence | 16314-16342 | 8 | 0.724 |
NZ_CP016501_1 | 1.2|846910|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846910-846938 | 29 | NZ_CP017657 | Acinetobacter baumannii strain KAB08 plasmid unnamed, complete sequence | 56273-56301 | 8 | 0.724 |
NZ_CP016501_1 | 1.2|846910|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846910-846938 | 29 | NZ_CP017649 | Acinetobacter baumannii strain KAB04 plasmid unnamed, complete sequence | 58336-58364 | 8 | 0.724 |
NZ_CP016501_1 | 1.2|846910|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder | 846910-846938 | 29 | NZ_CP050915 | Acinetobacter baumannii strain DT-Ab007 plasmid unnamed1, complete sequence | 22391-22419 | 8 | 0.724 |
NZ_CP016501_2 | 2.1|1898367|39|NZ_CP016501|CRISPRCasFinder | 1898367-1898405 | 39 | DQ535032 | Lactococcus lactis phage KSY1, complete genome | 76516-76554 | 11 | 0.718 |
NZ_CP016501_2 | 2.1|1898367|39|NZ_CP016501|CRISPRCasFinder | 1898367-1898405 | 39 | NC_009817 | Lactococcus phage KSY1, complete genome | 76516-76554 | 11 | 0.718 |
1. spacer 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to MK308676 (Vibrio phage vB_VmeM-Yong MS31, complete genome) position: , mismatch: 7, identity: 0.759
ttgatagatttcttacaaaatgaacgcat CRISPR spacer cgtccaaatttcttagaaaatgaacgcat Protospacer . .*.******** *************
2. spacer 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to MK308675 (Vibrio phage vB_VmeM-Yong XC32, complete genome) position: , mismatch: 7, identity: 0.759
ttgatagatttcttacaaaatgaacgcat CRISPR spacer cgtccaaatttcttagaaaatgaacgcat Protospacer . .*.******** *************
3. spacer 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to MK308677 (Vibrio phage vB_VmeM-Yong MS32, complete genome) position: , mismatch: 7, identity: 0.759
ttgatagatttcttacaaaatgaacgcat CRISPR spacer cgtccaaatttcttagaaaatgaacgcat Protospacer . .*.******** *************
4. spacer 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to MK308674 (Vibrio phage vB_VmeM-Yong XC31, complete genome) position: , mismatch: 7, identity: 0.759
ttgatagatttcttacaaaatgaacgcat CRISPR spacer cgtccaaatttcttagaaaatgaacgcat Protospacer . .*.******** *************
5. spacer 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to NZ_CP053657 (Bacillus cereus strain CTMA_1571 plasmid p.1, complete sequence) position: , mismatch: 7, identity: 0.759
ttgatagatttcttacaaaatgaacgcat CRISPR spacer ttgatagatttcttgcaaaatacgtactt Protospacer **************.******. ...* *
6. spacer 1.3|846976|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to MN855763 (Myoviridae sp. isolate 210, complete genome) position: , mismatch: 7, identity: 0.759
ttgatagatttcttacaaaatgaacgcat CRISPR spacer attttgtttttctttcaaaatgaacgcat Protospacer * *. ****** **************
7. spacer 1.2|846910|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to NZ_CP020580 (Acinetobacter baumannii strain SSMA17 plasmid pSSMA17_1, complete sequence) position: , mismatch: 8, identity: 0.724
ggttatatggtcgatatgattctacaaca CRISPR spacer aaatgcttggtcgatttgattctacaacc Protospacer .. *.. ******** ************
8. spacer 1.2|846910|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to NZ_CP020582 (Acinetobacter baumannii strain JBA13 plasmid pJBA13_1, complete sequence) position: , mismatch: 8, identity: 0.724
ggttatatggtcgatatgattctacaaca CRISPR spacer aaatgcttggtcgatttgattctacaacc Protospacer .. *.. ******** ************
9. spacer 1.2|846910|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to NZ_CP017657 (Acinetobacter baumannii strain KAB08 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.724
ggttatatggtcgatatgattctacaaca CRISPR spacer aaatgcttggtcgatttgattctacaacc Protospacer .. *.. ******** ************
10. spacer 1.2|846910|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to NZ_CP017649 (Acinetobacter baumannii strain KAB04 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.724
ggttatatggtcgatatgattctacaaca CRISPR spacer aaatgcttggtcgatttgattctacaacc Protospacer .. *.. ******** ************
11. spacer 1.2|846910|29|NZ_CP016501|CRT,PILER-CR,CRISPRCasFinder matches to NZ_CP050915 (Acinetobacter baumannii strain DT-Ab007 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.724
ggttatatggtcgatatgattctacaaca CRISPR spacer aaatgcttggtcgatttgattctacaacc Protospacer .. *.. ******** ************
12. spacer 2.1|1898367|39|NZ_CP016501|CRISPRCasFinder matches to DQ535032 (Lactococcus lactis phage KSY1, complete genome) position: , mismatch: 11, identity: 0.718
gttaaagcagaccaagatgctaaagttaagttggctaag CRISPR spacer gaacaagcagaacaagatgctaaagttgagttattgtat Protospacer * ******* ***************.****. . *
13. spacer 2.1|1898367|39|NZ_CP016501|CRISPRCasFinder matches to NC_009817 (Lactococcus phage KSY1, complete genome) position: , mismatch: 11, identity: 0.718
gttaaagcagaccaagatgctaaagttaagttggctaag CRISPR spacer gaacaagcagaacaagatgctaaagttgagttattgtat Protospacer * ******* ***************.****. . *
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
170766 : 182275
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP016501|170766:182275|DBSCAN-SWA ATTGAAAGTAGAAAATCTATCTATTCACTATGGTGTTATTCAAGCGGTGAATGATGTCTCGTTTGAAGTTAACCAAGGTGAGGTTGTAACCTTAATTGGAGCCAATGGTGCAGGAAAGACATCAATTTTGAGGACAATTTCTGGTTTAGTAAGACCAAGTCAGGGTTCTATTTCTTTTATGGGGAAACCTATACATAAGTTAGCAGCTAGAAAAATTGTTGGCAATGGTTTAGCTCAGGTTCCAGAAGGGCGCCACGTTTTCTCAAGTTTGTCTGTTATGGAAAATTTAGAAATGGGAGCCTTTCTTCAAAAGGATCGTGAACAAAATCAAAAAATGTTAAAAAAAGTGTTTGATCGTTTTCCTCGCTTAGAAGAACGTAAAAACCAAGATGCAGCAACCTTATCAGGTGGTGAGCAGCAAATGCTCGCTATGGGGCGTGCATTGATGAGTCGTCCTAAATTATTACTTTTAGATGAACCGTCAATGGGCCTAGCTCCAATTTTTATTCAAGAAATTTTTAATATCATTGAAGACATCAAAAAACAAGGGACTACAGTTCTTTTAGTTGAACAAAATGCAAATAAAGCTCTGACGATTGCTGATAAAGCATATGTTCTAGAGACTGGGAAGGTTGTTTTATCAGGGACAGGTAAGGAATTACTAGTTTCAGATCAAGTTCGTAAAGCTTACTTAGGTGGATAATATTTAGAGGAGTCAGGTATGCCAGTGAAAGATTTTATGACAAAAAAATTAGTTTATGTCTCTCCAGATACTACAGTTGCAGAAGCTGCTGATTTATTGAGAGAACACCACTTAAGGCGTTTGCCAGTTGTTGAAAATGATCAGCTAGTGGGGTTAGTGACAGAAGGGACAATGGCAGAAGCACAACCTTCTAAAGCAACCAGTCTTTCTATATATGAGATGAACTATCTTCTTAATAAGACAAAAATTCGAGATATTATGATTAAAGATATTGTGACTGTTTCACAGTATGCTAGCCTTGAGGACGCTATTTATTTGATGATGTCTCGAAAGATCGGTGTTTTACCAGTAGTTGATAATGGACAACTTTATGGGATTGTCACGGATAGGGATGTTTTTAAAGCCTTCCTTGAAATTGCGGGGTATGGTCAAGAAAGTTATCGCTTGGTTATATTAGCTGATGAGGGGATAGGGGTACTATCAAAAGTCCTAAATCGTCTTTCTAGTGCAAATTTAAGTGTTAAACGTTTAGTCATTATTGAACGAAAAGCAGGTAAGAAAGCGGTCGAGATACAACTAGAAGGATACGCTGACAAGGATGTTTTAAAACAAGAACTAGTATTTGATGATGTTATTGTGGAAACTTTAGAAAAGACCATGAAAAAACCACTTTCTTAAAAATATCAGCTAAATTCTCAAAGTAAGTTCCACCTCTAGTAGAAATGGAATTTAGTCTGATAAAAACCAATAAAAACCAACGAAAATCCGTCTCAGACTAAGTTCCGTTGGTTTTTATAATACTTGATTTCATAGGTTTTCAGAGTTAAAAATCGCAAAAAAAGGGTACAAGAAAACATCCTCATCTGAAAGCGTTTCAGATTAAGTTCCGCTATGGTTGAAGTTAGTGTGAGACGTTTAGGTTGTATCAGTTAGTTGATTAGGTTAGATTTTGTCTTTTAATAGTTTGGGAGATTGGCAGACGTCTTCCGCAGTAATTTCCTCAATACCGAGTATACTTGCGATGTCTTTTCCGTAAGTAAAAGAGAATAGCTCATAAGCCGTTTTACCATTTAGAGCCTGTCTCTTAACGCTGTTGATATGAGATAGTGCCAGATTAATGTCCTCTTGAGTCAAGTTGTCGAAGGAAGTCCCCTTAGGCAGTATGTCTCTCACGAGGGTATGATTCTTCTCGATTCTTGCTTTCTGGTCAGACCGATTAGGGTCACAAAAGAAGAGTTGAGACTGTCCACAAACATCTATCTCAATATCATCCACTCTAGCGAATTCCCCACCATTATCCGTTAGGATAACAGGAAAGAGTTCGAAGAAGTCTCTCTTGTTATCATAGAGTGTCCTCTTAATGACCTGTATGTGTTTAGCAGTTTCGATAGCTGTCTTAGAATCCATCAGTTTGGCAAAGATGAAGTTACAGAAGGCGACATTGAAGGTGAGAAGGACCTTTCCACCAATCCGTCCAATAACCGTGTCCATTTCAAGCCAGGAGTTCAGCTCTGATTGGTTCATGTGTTCTATAAAATCCTCGTACCGTCGCCCTTCTTTAATCGCTTTTGGAATAGGAGGGAGTGTGCTCTTTCGCCTTTTTTTGAACTTCACAGCTCTGGGTAGGTCGATTGGTGTGATGGATAGGTAGCCTTTCTTGATGTGTCGATAAACGGTTGAAGAACTCACTTCTAGATCGTTGGTTTTGAGGATATGATAGATACGTTGGCCCTTCTTGACCCCATTAGAAAGCACTCTATCTATTTTCCAGAAGGTCTCTTTGTTCAAAGGGATTCCTTCTCTAGATTCAACTAAAAGTTTTTCATAGTTTCTTTGAGCTTGCTTAGCAAGATAGAAGGTCTTCTTGTAACCACAGTTGATCCTCCTCTTTGGACACCCATTACAAACATAGGTAGCTTTTCTTAGTAGCGGACAGTCTAGGCAGTCTTTGGTTGATTCTCTTAGTTGCTTATTTCGTTTGACTTCTTTTGCGATAGTAGTAGGGTGTTTCAGTAGATTAAGCCCGATGGCTTTAAAGGTTTCTCCTCTATCCAAACCAGATTGGATGTCATTACGGTCTAAGAGAGTTAGGTGTTTGTGTTTTGTCATGATGGATAAACTCATTTCTAACCCAAACGTCTCAGACTTAATTCCACCATAAAAATCCTTTTCACACTAACTTCCGGTTTTGGAGGGCAGACGGAACTTACTTTGAGAAATTACTCTTAAAAATATCAGACATGCACAATCAAGTTGTGGGTGTCTTTTTTTGATAAATTTTGATAGAATAGTAGTAATTATAAATAAAGGATGTACCATGAAAAAAGGACTAATGATTTCTTTTGAGGGCCCAGATGGTGCGGGTAAAACAACCGTTTTAGAAGCTGTTTTACCCTTGCTCAGAGAAAAATTATCACAAGATATTCTCACCACCAGAGAGCCTGGTGGCGTAACTATTTCAGAAGAAATTCGTCATATTATTTTAGATGTTAAACACACGCAAATGGATAAAAAAACAGAGCTATTGCTTTACATGGCTGCTCGACGCCAACATCTAGTGGAAAAGGTACTCCCAGCTCTAGAAGAAGGGAAAATAGTCCTAATGGATCGTTTTATTGATAGTTCTGTTGCCTATCAGGGATCAGGAAGAGGATTAGATAAGTCACATATAAAATGGCTTAATGACTATGCGACTGATAGTCATAAACCTGATTTAACTTTATATTTTGATGTTCCTTCTGAAGTAGGTTTAGAGCGTATTCAAAAAAGTGTTCAAAGGGAGGTAAATCGTCTAGATCTCGAGCAATTAGATATGCATCAACGCGTCCGTCAAGGTTACCTTGAATTGGCGGATTCGGAGCCAAACCGCATTGTGACGATTGATGCCTCCCAACAGCTAGATGAAGTAATAGCTGAGACATTCTCTATTATTTTAGATCGTATCAACCAATGAATCAGGAAGTACAAAGGTCATGGATTTAAAGCGAACACAACCCAAATTGCTGGAAAAGTTTAATACTATTTTACAGTCAGATAGGATGAGTCATGCCTATCTTTTTTCGGGAAATTTTGCAAGTCTAGATATGGCACTTTATTTAGCACAAAGTCAGTTCTGTGAAAAGCGCCAGAGTGGTTTACCATGTCAAGAGTGTAGAGCCTGTCGTTTAATCGCTAATGGGGAATTTTCAGATGTTAAAATCATAGAACCACAGGGGCAACTTATTAAGACAGAGACTATAAAAGAGCTGACTAAAGATTTTTCAAGATCAGGTTTCGAAGGGAAATCACAGGTCTTTATCATCAAAGATTGCGAAAAAATGCATGTCAATGCAGCCAACAGTCTTTTGAAATTTATTGAAGAACCTCAAAGTTCATCTTATGTCATCTTATTGACTAATGATGAAAACAATGTTTTGCCAACAATTAAGAGTAGAACGCAAATTTTTCGTTTTCCAAAACAATTGGACATGTTGGTTCATCAAGCTGAACAAGCAGGATTGCTAAAATCACAAGCTAGCTTATTAGCTCAAGTAGCAGACGATCCTAAACATTTAGAAATATTGTTGACTAACAAAAAGTTATTAGATTACTTGAATTTAAGTCAACAGTTTGTGACGACTTTGGCAAAAGATAGGCAAACTGCTTATTTAGAAGTATCTCGCTTGACATCGCAAGTTGTAGATAAAAATGATCAGGCATTTGTTTTTCAATGGCTGACAATAATGCTGGCTAAAGAAGGCCAACTTTATGATTTAGAAAATACTTATAGAGCTCAGCAAATGTGGAAGAGCAATGTTAGTTTTCAAAATAGTTTAGAGTATATGGTGCTTTCTTAGAAAATAATACGCGCAGAGAGGAAGAGTGAATGGATAAAAAAGACTTGTTTGATGCCTTTGATGATTTTTCACAAAATTTATTAGTTGGGTTGTCTGAAATTGAGACTATGAAAAAACAAATCCAAAAATTATTGGAGGAGAATACTGTTCTACGCATTGAAAATGGTAAACTTCGTGAGCGTCTGAGCGTTATTGAAGCAGAAACAGAAACTGCAGTTAAAAATTCTAAACAAGGAAGAGAATTACTCGAAGGCATTTACAATGATGGCTTTCACATTTGTAATACTTTCTATGGTCAACGTCGTGAAAATGACGAAGAATGTGCTTTTTGTATTGAATTATTATATAGAGATTGATGGAAATGCAAGTTCAAAAAAGTTTTAAATCAAATATACATTACGGAACACTCTATCTAGTCCCAACTCCAATTGGTAATCTAGATGATATGACTTTTCGTGCCATTAGGATTTTAAGAGAAGTTGATTTTATTTGTGCAGAGGATACACGAAATACGGGACTTTTACTCAAGCACTTTGATATTACTACTAAACAAATTAGTTTTCACGAACACAATGCTTACGATAAAATCTCTGGGTTAATTGATTTGTTAAAAGAAGGGAAATCTTTAGCCCAAGTATCTGATGCAGGAATGCCCTCTATTTCTGACCCAGGACATGACCTTGTCAAGGCTGCTATTGAAGGGGATATCCCAGTTGTATCTATACCAGGAGCTAGCGCTGGTATTACTGCTCTCATCGCTTCAGGTTTAGCTCCACAACCTCATATTTTTTATGGCTTCTTACCACGTAAGAAAGGTCAACAAATAACTTTCTTTGAAACAAAGCAAGATTACCCTGAAACACAAATCTTTTATGAGTCACCGTTTCGAGTCTCTGATACGCTAAAACACATGAAAGAGATTTACGGAGATCGCCAAGTTGTTTTAGTACGCGAATTGACGAAACTCTATGAAGAGTATCAAAGAGGAACCATTAGTCAACTTTTAGAGCATATTGAAAAGGTCCCTCTCAAAGGTGAATGCTTAATTATTGTTGATGGTAAGAGAGATACCGAGCGAGTGAAAGACAGTAGCCAACAAGATCCACTAGTATTAGTAAAAGAATATATCGCTAATGGTGATAAAACTAATCAAGCGATAAAAAAAGTAGCAAAAGAATTTAATCTCAATAGACAAGAACTCTATGCTAGTTTCCATGATTTATAAGTAATAATATAAAAAGGTTGCCAAAGAAAAATTTGGCAACCTTTTATAATCTCCGATTTAGCTATTGGAATAAAGAAGTAAGATACTTCCAATTTCCTTAAAATTTTTAATTTCAGCTCTTTTGAATACCATTGCAAATTGGATTTTATTTCTGAGAGTAGGTTATAAAAGATGGTATGCTCTTATTCCGGTTCTTTCAGGTTTTAGAAAACATGAGAGAACTTAGTAGTAAATATACTAAAGAAACTTATAGCCCTGTTTCGAATTTTTTATTGAGTTATTCTAGTACCATGGGCTTGTTTAACCCCAGTTTCTTGGCAAATATATTGATAGTTTTCAGCAGTCACACCGCCGCCAACCATAATTTCAATACGATTATTAGCATATTCAACTAATGCTTTGATATGTTTGATATTTTCTATAATTGGTTCGCCATTAGATGAACCATGTAGGAGAATCCTTGTAAAGCCTAAAGCAACTAATTGATCTATTGATTTTTTTTGGTCAGATTTCGGGATAACATCAAATGCCATATGAAAAACAAGTGGCAAACCTTGGGTAGCAGGTAGAAGTTGTTCAATGGCTTCAGTATCAATATGATTATTTGAAGTTAATATTCCCAAAACAAGAGCATCTGATTCTAACTCAACAGCACGTAGGATATCCTCTTCCATAATACGTAATTCTAAATCATTGTATACAAAATTTCCTCCACGGGGACGAATCATAACAGCGACACTAATACCTTTTTCATGTAAATATTGATTAGCTTCTTTGATAACACCGTAGCTAGGAGTAGTGCCTCCGACAGCAAGGTTATCACAAAGCTCGACACGTGAAATAATAGCTTTATCTAATCTGGTTAAGTCTGTTAAATTCTCTGCACAAAATTCCCTCAAAATCATAAATTACTCCTTTAAACTTGATTGTTCGCCTTTACCATTATATCACGAATAAAAAGCAAAATTAATCAAAAAAATCTTTACAATATTCAAAATACTGTGCTATAATCTTACACAATTATATGGAGGTGTTATATGACAATTTATAATTTTTCTGCAGGTCCTGCAGTCTTACCAAAACCAGTGCTTGTAAAAGCCCAATCTGAACTCCTTAATTATCAAGGGTCTAGTATGAGTGTTTTGGAGGTATCGCACCGCTCCAAAGAGTTTGATGATATTATCAAAGGTGCTGAGCGTTACTTAAGAGATTTGATGGGAATTCCGGATAATTATAAAGTAATCTTTTTACAAGGTGGTGCATCCTTACAGTTTAGTATGATTCCCTTGAATATTGCCAGAGGGCGTAAAGCTTACTATCATGTTGCTGGTTCGTGGGGGGAAAAAAGCTTATACAGAGGCTGTAAAACTATCAAAAACAATTCCTTTTGAACCAATTCTATTAGCTTCATCAGAAGAATCAGTTTATGATTATATTCCAGAGTTTGATGAAAAAGAGATTGATCCTAAAGCTGCCTATGTTCATGTGACAACAAATAATACCATTGAAGGAACATCGCTTTATGATATTCCTAAAACTAATGAAGTGCCTGTTATTGCGGATATGTCTTCTAATATATTAGCTGTAAAATATAAAGTAGAAGATTTTGCAATGATTTATGCAGGAGCTCAAAAAAATATTGGACCTGCTGGTGTAACGGTAGTTATTATCCGTGAAGATATGATTAATGAGGAACCAACTTTATCATCAATGTTGGATTATAAGATTCAGTCAGATGCTGGTTCTCTTTATAATACGCCGCCTGCTTACAGTATTTACATTGCTAAATTAGTCTTTGAATGGGTGAAAAGTCTAGGTGGTGTTGATGCTATGGAAAAAGCTAATCGTGAAAAATCAGGACTTCTCTATGATTATATTGATTCCTCAGAATTTTATAGTAATCCTGTCAGAGATAAAAAGAGTCGTTCTCTATGTAACATTCCTTTCATAACTATTAATAAAGATTTAGATGAAAAATTTGTAAAGGAAGCGACAGAACGTGGATTTAAAAATATTAAAGGACATCGTTCGGTAGGTGGTATGAGAGCTAGCCTTTATAATGCCTTTCCTAAACAAGGTGTCATCGAATTAATTGATTTTATGAAAACATTTGAAGCAGAAAATACATAGTAAATGTTATCACGAAAGAAAATTAAACCGTTATTATAGTCTATAATATATAAAAGTGGGTAATATAATGCAAATCAGATTAGCTTTTCCAAATGAAATTGACCAAATTATGCTTCTTATTGAGGAAGCGCGTGCCGAAATTGCTAAAACAGGAAGCGATCAATGGCAAAAAGAAGATGGATACCCTAATCGTAACGATATTATTGATGATATCTTGAACGGTTATGCTTGGGTTGGTATTGAAGATGGGATGCTAGCGACTTATGCGGCTGTCATTGATGGTCATGAAGAGGTTTATGATGCTATTTATGAGGGAAAATGGCTCCATGATAACCATCGCTATCTTACATTTCATCGTATTGCGATCTCAAATCAATTTCGTGGTAGAGGACTAGCACAGACATTCCTTCAAGGGTTAATTGAAGGTCATAAAGGACCTGATTTTCGTTGCGATACTCATGAAAAAAATGTCACTATGCAGCATATTCTAAATAAATTAGGCTATCAATATTGTGGTAAAGTACCTCTAGATGGGGTACGATTGGCTTATCAGAAGATAAAAGAAAAGGGAGAAACGAGCATCTATCGTGAGATTGATGAAAGAAATCCCATGTAAGAAAAAAAGAAAGGGGTTATTTTGGTCAATGATGACTAAAGAGATAACCTTAGTGAAAAAGAATGGTTTTTAGTGTCAAAACATTTAATAATATTAATCAAATTGGTTTACAAGAGTTAGGGAATCGTTTCCAGATTGATGGGGATATGTCAGAAAATCCTGATGCCTATATTATTCGTAGTCAAAATTTGCATAATCAGGATTTCCCAAGTAACCTCAAAGCTATTGCTAGGGCGGGTGCAGGAACAAATAATATTCCTATTGAAGAGGCAAGTGCACAGGGAATAGTCGTGTTTAATACCCCAGGTGCAAATGCTAATGCTGTAAAAGAAGCGGTCATTGCTGCCTTATTACTTTCAGCTCGTGATTATTTAGGAGCTAACCGATGGGTTAATACTCTAACTGGAACAGATATTCCCAAACAAATTGAAGCAGGAAAGAAAGCTTTTGCTGGTAATGAAATTGCAGGAAAAAAATTGGGAGTTATCGGCCTTGGTGCCATTGGAGCTAGAATTGCGAATGATGCTAGACGCTTAGGAATGACAGTTCTTGGTTATGATCCCTATGTTTCAATTGAAACAGCTTGGAATATTTCAAGCCATGTTCAAAGGGTTAAAGAGATTAAGGATATTTTTGAAACTTGTGACTATATCACAATTCATGTTCCCTTAACAAATGAAACTAAGCATACTTTTGATGCGAAAGCTTTTTCAATCATGAAAAAAGGAACTACGATTATCAACTTTGCTCGTGCAGAATTAGTCAATAACCAAGAGCTATTTGAAGCGATAGAAACTGGTGTTGTCAAGCGCTATATTACTGATTTTGGAGACAAAGAATTATTAAACCAAAAAGGAATTACAGTCTTTCCTCATGTAGGCGGATCAACAGATGAGGCAGAGCTAAATTGTGCTATTATGGCAAGTCAAACCATTCGTTGTTTTATGGAGACAGGAGAAATCACTAATTCAGTTAATTTCCCCAATGTACATCAAATTCAAACCGCACCATTTCGTATTACTTTAATTAACAAGAATGTTCCTAACATTGTGGCTAAAATTTCAACTGCTGTATCAGAGTTAGGTATAAATATTGATAACATTATTAACCGTTCAAAGGGTGACTACGCCTACACACTAATAGATTTAGATGAAACGGATAATAACAAGATTTCTACTTTAATTGAAGAATTTGAAGGTGATGAGAATATCGTCCGTGTACGTTTAATTGCAAAACAACAATAAAGAACTAAAAAGTCATGAATTGCTTTTCATGGCTTTTTAAGTTATGCTTATCATATGTTATATCAAGAATTCTATCAGTCCCCTCTTGGAGAAATTCGCCTTCTAGCAGACAATCTAGGTTTATCAGGGCTTTATTTTGTTGGGCAAAAATATGATATGCTAGCAGTCAACCAAGAGGAAATTGTTAATATGTCAAATTCCTATACTTTACTTGGTAAGAAGTGGTTAGATGCTTATTTTTCACAGCAAAATTTACCTAGTATTCCATTATCATTGAGAGGGACAGCATTTCAGACGAGAGTATGGCAGGAATTACAGAAAATCCCTTTTGGAGATACCAAAACTTATGGAGAATTAGCGAAAGAGCTAAATTGTCAATCTGCACAGGCTGTTGGAGGAGCAATAGGTAAAAATCCTATTTCTCTAATTATACCTTGTCACCGTGTTTTAGGAAGATATGGTCAATTAACTGGTTATGCTGGTGGCTTAGAAAGAAAATCTTGGTTATTAGAATACGAAAAGGAGAAATAATATGTATACGTTTTATGAATACCCCAAATGTACCACTTGTCGTTCAGCAAAAAAAGAGTTAACTGAACTCGGCTTGACTTTTGAAGCGATTGATATAAAAAGCAATCCTCCCAAAGTTTCGTTATTAAAAGAATTGTTAGAGAATTCGCCTTATGACCTTAAAAAGTTTTTCAATACTTCAGGAAATTCTTACAGAGAATTAGGCTTAAAAGATAAATTTGATGATTTAACTCTTGACCAAGCTTTGGACTTATTGGCATCGGATGGAATGCTTATTAAGCGTCCATTGCTAGTTAAAGATAATAAAATTCTTCAGATTGGCTACCGTACTAAGTATAAAGACCTCAACTTAGTATAAAAAGATGATTGAGTTATAACTCAATCATCTTTTTTTGGTTTATAGTTCAATCTCCAAAATAATTGGTGTATGATCTTGACGATCACCAGAATGAATCATTTCTGACTTGGTAATTTTATCTGCGACACGGTTACTTGTTAGCCAATAATCAATTCTCCACCCTGTATTGTTGATTTTGGATGTACGACTTCGTTGTGCCCACCAGCTATAAACATTTGGAACATCGCCATGAAGGTAACGAAAAGTATCTGTAAAACCTTTGGCTAGTAGATTTGTAAAGCCTTGGCGCTCTTCAGCTGTGAATCCAGCAGAACGACGATTACTGCTAGGATTAGCTAGGTCTATTTCCTTGTGGGCTACGTTATAGTCACCTGTTGCTAGAACAGGTTTTTGACTATCCAATGTTGCTAAATACTCAGCATATTTAATATCCCATATTTGACGGTCAGCTAAGCGCTTTAGGCCATCACCTGCATTTGGAGTATAGACCTGTGTGATATAGCAGTTTTCCAATTCGAGAGTAATAATGCGACCTTCATTATCCATTGTAGTAGGAGCATCGATTTCAGGGAAGCTAACAATAGGATTTAACCCTTTACGATATAAGAACATTGTACCAGCATAGCCCTTACGAGCAGGTTCGACTGATGATCGCCAAACCAAATCATATTCTGGGAAGTAAGTTTCAAGTACTTCCAAATGTTTTTTAGTAGGTCCTTTGGCAGACAGTTTAGTTTCTTGAATCGCAATGATGTCGGCATCTTCAGCAACTAAAGTATCAATAACTTGGCGTGACATTAAAGCACGTGTTGATTCACTTGTTAATGCCGCATTGAGGGAATCAATATTCCATGAAATGAGTTTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP016501|170766:182275|175333_175660_+|WP_000358198.1|DBSCAN-SWA MDKKDLFDAFDDFSQNLLVGLSEIETMKKQIQKLLEENTVLRIENGKLRERLSVIEAETETAVKNSKQGRELLEGIYNDGFHICNTFYGQRRENDEECAFCIELLYRD >NZ_CP016501|170766:182275|175665_176529_+|WP_001866320.1|DBSCAN-SWA MQVQKSFKSNIHYGTLYLVPTPIGNLDDMTFRAIRILREVDFICAEDTRNTGLLLKHFDITTKQISFHEHNAYDKISGLIDLLKEGKSLAQVSDAGMPSISDPGHDLVKAAIEGDIPVVSIPGASAGITALIASGLAPQPHIFYGFLPRKKGQQITFFETKQDYPETQIFYESPFRVSDTLKHMKEIYGDRQVVLVRELTKLYEEYQRGTISQLLEHIEKVPLKGECLIIVDGKRDTERVKDSSQQDPLVLVKEYIANGDKTNQAIKKVAKEFNLNRQELYASFHDL >NZ_CP016501|170766:182275|172410_173577_-|WP_000160537.1|transposase|DBSCAN-SWA MTKHKHLTLLDRNDIQSGLDRGETFKAIGLNLLKHPTTIAKEVKRNKQLRESTKDCLDCPLLRKATYVCNGCPKRRINCGYKKTFYLAKQAQRNYEKLLVESREGIPLNKETFWKIDRVLSNGVKKGQRIYHILKTNDLEVSSSTVYRHIKKGYLSITPIDLPRAVKFKKRRKSTLPPIPKAIKEGRRYEDFIEHMNQSELNSWLEMDTVIGRIGGKVLLTFNVAFCNFIFAKLMDSKTAIETAKHIQVIKRTLYDNKRDFFELFPVILTDNGGEFARVDDIEIDVCGQSQLFFCDPNRSDQKARIEKNHTLVRDILPKGTSFDNLTQEDINLALSHINSVKRQALNGKTAYELFSFTYGKDIASILGIEEITAEDVCQSPKLLKDKI >NZ_CP016501|170766:182275|176798_177434_-|WP_000603277.1|DBSCAN-SWA MILREFCAENLTDLTRLDKAIISRVELCDNLAVGGTTPSYGVIKEANQYLHEKGISVAVMIRPRGGNFVYNDLELRIMEEDILRAVELESDALVLGILTSNNHIDTEAIEQLLPATQGLPLVFHMAFDVIPKSDQKKSIDQLVALGFTRILLHGSSNGEPIIENIKHIKALVEYANNRIEIMVGGGVTAENYQYICQETGVKQAHGTRITQ >NZ_CP016501|170766:182275|181447_182275_-|WP_000767484.1|DBSCAN-SWA MKLISWNIDSLNAALTSESTRALMSRQVIDTLVAEDADIIAIQETKLSAKGPTKKHLEVLETYFPEYDLVWRSSVEPARKGYAGTMFLYRKGLNPIVSFPEIDAPTTMDNEGRIITLELENCYITQVYTPNAGDGLKRLADRQIWDIKYAEYLATLDSQKPVLATGDYNVAHKEIDLANPSSNRRSAGFTAEERQGFTNLLAKGFTDTFRYLHGDVPNVYSWWAQRSRTSKINNTGWRIDYWLTSNRVADKITKSEMIHSGDRQDHTPIILEIEL >NZ_CP016501|170766:182275|181051_181408_+|WP_000287944.1|DBSCAN-SWA MYTFYEYPKCTTCRSAKKELTELGLTFEAIDIKSNPPKVSLLKELLENSPYDLKKFFNTSGNSYRELGLKDKFDDLTLDQALDLLASDGMLIKRPLLVKDNKILQIGYRTKYKDLNLV >NZ_CP016501|170766:182275|178726_179275_+|WP_001167085.1|DBSCAN-SWA MQIRLAFPNEIDQIMLLIEEARAEIAKTGSDQWQKEDGYPNRNDIIDDILNGYAWVGIEDGMLATYAAVIDGHEEVYDAIYEGKWLHDNHRYLTFHRIAISNQFRGRGLAQTFLQGLIEGHKGPDFRCDTHEKNVTMQHILNKLGYQYCGKVPLDGVRLAYQKIKEKGETSIYREIDERNPM >NZ_CP016501|170766:182275|176650_176755_+|WP_011058365.1|DBSCAN-SWA MNTIANWILFLRVGYKRWYALIPVLSGFRKHERT >NZ_CP016501|170766:182275|179337_180519_+|WP_000232156.1|DBSCAN-SWA MVFSVKTFNNINQIGLQELGNRFQIDGDMSENPDAYIIRSQNLHNQDFPSNLKAIARAGAGTNNIPIEEASAQGIVVFNTPGANANAVKEAVIAALLLSARDYLGANRWVNTLTGTDIPKQIEAGKKAFAGNEIAGKKLGVIGLGAIGARIANDARRLGMTVLGYDPYVSIETAWNISSHVQRVKEIKDIFETCDYITIHVPLTNETKHTFDAKAFSIMKKGTTIINFARAELVNNQELFEAIETGVVKRYITDFGDKELLNQKGITVFPHVGGSTDEAELNCAIMASQTIRCFMETGEITNSVNFPNVHQIQTAPFRITLINKNVPNIVAKISTAVSELGINIDNIINRSKGDYAYTLIDLDETDNNKISTLIEEFEGDENIVRVRLIAKQQ >NZ_CP016501|170766:182275|171486_172146_+|WP_001144248.1|DBSCAN-SWA MPVKDFMTKKLVYVSPDTTVAEAADLLREHHLRRLPVVENDQLVGLVTEGTMAEAQPSKATSLSIYEMNYLLNKTKIRDIMIKDIVTVSQYASLEDAIYLMMSRKIGVLPVVDNGQLYGIVTDRDVFKAFLEIAGYGQESYRLVILADEGIGVLSKVLNRLSSANLSVKRLVIIERKAGKKAVEIQLEGYADKDVLKQELVFDDVIVETLEKTMKKPLS >NZ_CP016501|170766:182275|173785_174421_+|WP_000715592.1|DBSCAN-SWA MKKGLMISFEGPDGAGKTTVLEAVLPLLREKLSQDILTTREPGGVTISEEIRHIILDVKHTQMDKKTELLLYMAARRQHLVEKVLPALEEGKIVLMDRFIDSSVAYQGSGRGLDKSHIKWLNDYATDSHKPDLTLYFDVPSEVGLERIQKSVQREVNRLDLEQLDMHQRVRQGYLELADSEPNRIVTIDASQQLDEVIAETFSIILDRINQ >NZ_CP016501|170766:182275|180573_181050_+|WP_000966772.1|DBSCAN-SWA MLYQEFYQSPLGEIRLLADNLGLSGLYFVGQKYDMLAVNQEEIVNMSNSYTLLGKKWLDAYFSQQNLPSIPLSLRGTAFQTRVWQELQKIPFGDTKTYGELAKELNCQSAQAVGGAIGKNPISLIIPCHRVLGRYGQLTGYAGGLERKSWLLEYEKEK >NZ_CP016501|170766:182275|176504_176666_-|WP_011058366.1|DBSCAN-SWA MQWYSKELKLKILRKLEVSYFFIPIAKSEIIKGCQIFLWQPFYIITYKSWKLA >NZ_CP016501|170766:182275|170766_171468_+|WP_069571495.1|DBSCAN-SWA MKVENLSIHYGVIQAVNDVSFEVNQGEVVTLIGANGAGKTSILRTISGLVRPSQGSISFMGKPIHKLAARKIVGNGLAQVPEGRHVFSSLSVMENLEMGAFLQKDREQNQKMLKKVFDRFPRLEERKNQDAATLSGGEQQMLAMGRALMSRPKLLLLDEPSMGLAPIFIQEIFNIIEDIKKQGTTVLLVEQNANKALTIADKAYVLETGKVVLSGTGKELLVSDQVRKAYLGG >NZ_CP016501|170766:182275|174440_175304_+|WP_000364565.1|DBSCAN-SWA MDLKRTQPKLLEKFNTILQSDRMSHAYLFSGNFASLDMALYLAQSQFCEKRQSGLPCQECRACRLIANGEFSDVKIIEPQGQLIKTETIKELTKDFSRSGFEGKSQVFIIKDCEKMHVNAANSLLKFIEEPQSSSYVILLTNDENNVLPTIKSRTQIFRFPKQLDMLVHQAEQAGLLKSQASLLAQVADDPKHLEILLTNKKLLDYLNLSQQFVTTLAKDRQTAYLEVSRLTSQVVDKNDQAFVFQWLTIMLAKEGQLYDLENTYRAQQMWKSNVSFQNSLEYMVLS |
15 | Streptococcus_phage(90.91%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1140008 : 1182358
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP016501|1140008:1182358|DBSCAN-SWA GTTACTCTGGAGCACAATAAAAAAGTTGCCCAAAAGACGCCGTATCCATAGTTTGATGAGAAGTTAACGTATTCAGATTTTTCCCCAATCCAAAATCAGCTAATTTAATTATACCATTTATGAAGAATATATTGGTCGGACTCAAATCTCTATGTAGCACGCCTCTTTGATGAACTAGTGACATCGAATACAAAATTTGTCGAATAATATTAATTTGAGAATCCTCTGTCAAAAAACTTTCTTTAACAAAGTCATCCAATATATTATCTGCTTTCTCCATAGTATATGAGCAATTACCAATATCAAAATCATACACTTTAATGATACTTCCAATATCTGAACAAGATTTAGTTATTTCATATTCACGCTTAAAGCGGCTTCGGATTGAAGCGCTTCTTGCAGATTCCTCATTAAGTTTTTTCAATACTAGTCCTGTTGATTTTTGGAAATAAATATTTGCAAAACCACCACTACCAATTTCAATCAGATCTAAATCAATTTCTACTAAATGAAATGCACCATCTTTTTGAGATAGTTTTAGCGAGTAAACCGAGCATACTTTATTCAGTTCATCTAATATTCTTTGTTGATGCACAAGTGCATCAATTTCACTGATTTGTCTTTCTGTTAAAAGATACTGTTTAGAAAGAATTATATTGAAATACTCATCAATCTTTCCTACTGAAGAAAAATCTAATAATTTTTGATTTACATATCTCCACCTTGTTGGAAAGCCCTGTCCATAGCTATCAGATATTTTGAAATTATTATTAAAAAAGTGAACTAGGTCTGGGCCGGATTTGTACTTGAATAGTTCTAAATCATCTCCACAGAATATTTTAGCTATTTTTTCATAAAACTCTGGATTAACCAAAACATTTCCCCTTTTAGATAGAATAACTTTTTAATTTAACAAAAGTAGGTAATTCCGAACAACTGAGTTTTTCAATGAGTTTTAGCCTTTTGTTGGATATCTTAAAAATATTAAACTCATCACGTACAATAGTAAGTAAATAAGTTTCAAACGATTGTACCTTTATGTGCCTGTAGGCGAAAAGAGAAGAAAAATCATAGCTTATTGCATTTAAAATGATATCATCTAGTTCATTAGTGCTAATTTCTTCTTCCGTATAAAAAATGTAAGGGAAAAATAAAAGGAGATTTTTCTGCTTCTCAAGCATATTTAGGACTTTTATAATATCTTTTTCAAGGCATTGTTTCCTAACATGTTGACGCTTTACCTTGAGTCGTATTAAATCAGATACAGTTCTAAATCGAAACGCAACATGAATCTGAGTTGCTTTTATTTCCCCGTTCGGATTTCTACTTTCTCCAATTGCAGTAATCCCATTTCCAAGATTAGTGATACTAGGTGAAAATAAACTGCTTGCCTGCAGTCTACTACTTGAAGAAAGAAGCTTGAAGTCTATCTTGTAATCATCTGCTACAGCATCACATTCCCCGTGGCTCTCGTCTTCAGGTGCGTGAAATTTAGATTTACCTTTTTCCCGAAAATATACTGATTTATTAAGCAATTCCAATAGATAAATTTCATAATTACATTTGGTCTCACCAGCTACAAAACCTTTGATTATTGTAGCTGGAGGTAATTGTTTAAATATAATTCGAGGATCTATATTAGAATTCAAATTTTCACCCATGCTTTATACTCCTTAAGTAATTATACCACTAAAACCACAAAAAGACTGCATCCTCTCAGTCACAGTCTCTTATAATTCTACTTCAGTTCCTTCTAGGAAAATCACGATAATTTGTCCGACTTCTGAAATACTGATGTGATCTAAAACTTTGTTCATGATAGTTCCATCAAATTTCTCTAAATCTAGTAACTGATACATTTGTTTGGCATAGTGCTTATGCAGCAGATTAGTTCCCTTTTCTAACTTTTGCCACTTAACCACCACATCTGCTCTATGTTCCTGAAGCAATTCCACTGCCTTCATGAAGGCTTTCTCAAGCATTTCTTCATCAATATGGTTGTTCTGGCAACCAATCTGTCCCTTGACCTTGTAGCGGTTGTTACACTGCCAAACCTTGCGTTTACCTCGACTGGTCGTCCAGTTCTTTCGTCCAAAGGCAGAACCACACTCGGCACAGAAGACCTTAGTGGTAAAAGGATTGTCGTCACTTTGCATGATATAGGTTTTCAGCTTGTGCTTTTCTCTGAAGTCTTTTCGCCTTGCCAATTCTTCTTGAACAGTGTCCCACATTTCTGTATCAATAATGGCTTCATGATTATTGGTCACATAGTATTGATTGACCTGCCCATCATTCTCTGTTCGCTTTTTTGTAAGGAAATCCACCGTGTAGGTTTTCTGAAGTAAGGCATCACCTTTGTACTTCTCATTCTGAAGCATTTTGAGAATACTGCTAGGATACCAGTTGGCTTTTCCTGACCATCCCTTGATACCGTTAGTGTTAAGTTCCTTGGCAATGCTTTCTGGACTGTAGCCTTTTAGGAACTGATGATAGATATGTTTAACTACCTCTGCTTGCTCTGGATTAATGACTAGCTTACCATTCTCATCTTTATCATACCCCATGAACTTGGTGGTATTGACACGAACCTCTCCACGTTCAAACTTCTTACGAATGCCCCATGTCGCATTCTCTGAAATGGAGCGTGATTCATCTTGGGCAAGAGAAGAAAGAATCGTGAGTAATACCTCACCTTTAGAGTCAAGGCTATCAATATTCTCCTTTTCAAAAGTGACACCAATCCCCAATTCTTTCAGCTCACGGACATACTTGATACAATCCAAGGTGTTTCTGGCAAAACGACTAATGGACTTAACTAAAATCCTATCCACCTTACCTGCTCTACAATCCTGTATCAGTCGGTTAAAGGCTTCTCGTTTTTTGGTATTGGTTGCAGAAATGCCTTCGTCCGCATAGATGTCCACAAGCTCATAATCTTCATGTTTTAAGATGTAATCACGATAGTAAGATACTTGGTTCTCATAGCTTGATAGCTGTTCGTCTTGGTCGGTGGACACCCGACAATAGGCTGCCACTCTTATTTTGGTTGCTTGCTGATGGGTGACCGTCAGCTGCACTTTCTTGGCTGGAATAACTGTAATGTTTTTGCCCATATTCGTTCCTTTCTATCACTGTTACAGGTGTCCTAAGTTTCCACTCTTCAATAGCTTTCTCAGGCACGCGCATGCCTCGGCAGGAAGCCTTACCTTCTTTAATATACTTGGAACAACACCAAACTATTTGTTTCTTGTAGGTAACCTGGCGTTTGAGGGTTGACCCACAATACTCGCATTTAAGCATACTCGTAAATCGATAGTGCTTATTGACTCCCTGTTTCCAAGTTTTGCTTGCAAGCTTATCTTGGACAGCCTGCCAGACTTCTTTTGAAACAATTGGCTCATGGTTATCTTCAATCAAATATTGCTCTACCTGACCTTGGTTTAATCTCTTAGGCCCATTGATACCAACATGAAAATATTTTTGAAGAAGGACAGTTCCTTTGTACTTTTCATTCTTTAGGATATTTCGAACAGTTGTATCATGCCATTTAGCATCTGTCACCGTAGGCACATTCTCTTGATTTAAAAGTTTAGCTATTCGATGCATCCCCATACCATCAAGATAAAGTTGGAAAATCCGTCTCACAATCTGTGCTTCTGCTTCATTGATGACCAACTCTCCTTTATCATCCACATCATAACCCATGAATCGCTTGGTGTTGATGACCATCTCCCCACGTTGAAACTTTCTCTGAAAGGCCCACCGCTGGTTGTCACTCATGCTTTGCAACTCTTCCTCTGCCAAACTGGCAAGAACTGAAAGCATGCCCTCTCCTTCATTGGATAAGGTATTGATGTTCTGCTCTTCAAAATAGATACCAACACCTAGGGCTTTCAACTCACGACTGACTTCAAGTACCGTCATGGTATTCCTCGCAAACCGTGAAATGGACTTGGTATGTATCAAATCAATCTGACCTTGCCTACAAGCTTCAAGCATAGCTTGAAAACTCGGTCGTTTCTCCTTAGACCCAGAAATCCCCTTATCGTAATAGACTCCGAGAAAATCGACGTCCTCCCGATTGACATAGAATGTTTCAAAATAAGCCTTCTGGTTTTCAAGGGAATCAAGCTGACTACTATTTGTTGTTGACACCCTAGTATAAGCACAGACACGTTGTTTTTTACGATTACTTTTTGTCTTAATTAACCTTACTGCCATTCAGGCTCCTTCCTATAACTTTTGTACACCATATATCACTCTAAAGGGAAGATAAGTCAAGCTTTTCGCAAACCTACTCCCTCCATATTACTAAGTAAATTTGGAAACAAATTTTCCGCAGTTTTTGAAAAAAGCAAAAAGAAAAAGCGTGGCTATTTCAACACGCTGAGATACAAGTATGTTATAATGAACTTATAAAAAGGAGCTGGACGGCAATCCAACCCCCAATGTAGGACCGTTAAAAAGACGGTGACTCGTTTAATTAGACTATAACTCGTCCAACAACTCGGTCAAAAGTGGGGACGGGTTATTTCTTTTTGTCGTCATCTTTGAAGATTTTATAGCACAAGCTAATTTTCAAGAAGTCATCTGCCAACTCTTTATTATGTTCTGACTTGAAAAGTACTGCCAGTGTGTTATAATGAACGTAGAAAAAGGAGTTGAGCTACCAACTCAACTCCAATGTAGAACCGTTAAAAAGACGGTGACCCAATATTTCGATATAACTCGTCCTACTCGGTCAAAAGTGGGGACGGGTTATTTCTTTTTGTCGTCATCTTTGAAGATTTTATAGCACAAGCCAATCAAAGCAACGGTAAAACTACCAAAACCAATGATGATTTGCACGACTTCAAAAGCGGTCAAAAGAAAGCTCCCTCCTTTCCATCAGATTTTGATGAATTGCCCATAGGCATCACCTCTCTCTTTCAGGATTTAGGAACCACCGTCTTCACTTTTCTACACTTAATAGTATACCATTAAAGCTGCCTATTGTGTAGCTTATTTTAGTTTCAACTTCTGTCCAACTGAGATCAGACTTGGATTGATGATGCTGTTAAGTCGCACCAATTCACTGACACTGGTCTTAAACTTCCGTGCAATACCATAAAGGGTATCACCCTTTTGAACCGTATAAGTCTGTTCACTATGTCCGTTCGTTGTGCCTTTGACATCTTGTTCCAAGACCCATGAGTTAATTCCCTCAAGCAGGTAGGCTTTCTTACTATTGGATTGATTAACATCCTTCACACGAATAACCTTGAAAGTTTTGCCTTTAACCCATGACGAAATCGCTTGACCTGTCTGGTAATGCGTCGCATGTTGTTGAACAGTAACAGGGTCACCAACATGATAGGAAGTAACCCTTGATTGAGGAGTATTGGGACTGGGTGAAATTACTTGAGCTTGCCCAGTGATGGCTTGTACAAATTCACGAGCCATACTGTCTTTATTGGCTTCATAGATGCCCACATCACTATTATTATCGATGAAGGCAATCTCCACCAATCGATAATTGTATCCTCGACTTGCTGATACATTGGCATTGTAGAGCCAGTTCACTTGTTTAAAACCACGATTCGTAAAACGCTTGGCTAGAATAGCTAGCAGTTTTTGGTCTAAACTATCAGCTGTGTAGCCTGAATAAATCAGAATTTCACTTCCTCTCGCTTGTCCATTAAAAGCGTTGAAATGAAGTTCCGTAATCGACTCATAGCCATTTCCCAAACTACCAATACTGCGGTAGTCATAGACATTATGGTCAGTGATGTAATCAATCCGATTCCCAGAATACTTCTTCATCAGATTCGCAAACTCACGAACCTTTCCTGCTTCAGTAATACCTAAACTAGGATTCGTTGCTCCAGGATCATAACCTGTTTGTCCTTGTCCATGTCCGCAGATAACTAAATGTTTACTCATAGTCGTTTCTCCTTTTTTGTTTAAAATACGGTCATCCCAAGTGTTGAGATGATGTGCTTCAATAAGTGTAATCAATTTTGACCCATAGTTAGGATCTGTCGCATAACCTGACGCTTGTAAAGCGAGACAGGTTTTTTTATAGTCTTCTTCACCGAGCACGTTTTGGTAATGGCTACGTCGCCATGCTGTTTCTGAAAAGAACAACGCATGATCTTTAATGGATTCCTCCCATGAGTCATACTTTCTAAAGGTAGCCTCAACCGTGACGAATTTCCCATCAATGTATTCTTGAGTTGGAAGGTCAACACGTTTTCCCTTCCAATCAGAACTAGCTTTGATTCCAAAGAGATTATGATTAGGGTATTGTGCTAACAAAGACTCTCCCCAACCACTCTCTAAGATAGCTTGTGCTGCTGACACGGAAGGAAGAATACCGTGCTCCCAAGATGCTAAACAACCGTCTTTAATCTTTGATAAAAAAGTCATCCTAATCTCCTTGTTTCAATTGTTTCAGCACATCTTTCAACTTCTCAGGAACTGGTAGTCCAATCCTAGCCGCATTCTCAATGATGGACAAGCCTTCATTTGAGAGATAGTAAAAGATGATGGCAGTACGAACCATGCCCCCTTGTTTCAAGACATGCTCATCAATGATTTGTCCTACTGCTACCAAAAATAAAAGGACAATTTTTTTGAAGATGCCCTTAAACCCAATATTGCTAGCTAATTTCTTCTCAACGGCTGCTGCCATGAGACCTGTGATGTAATCGATGGAAATAAAAACCATCAATGCAAAGAGAAAGCCATCGACTTCTCCAAAGATTGATCCGATAAGTCCTCCAATCGCTGAGAAGAGGACTTTATTTGTTGCCAGTAATTCTTTCATAATGACACTTTCCTTTCTTATGCTGCCCTACGCCAACGGTAAACCGTGACGTAAGGTTGTAAGTTATTGTGTGGCTTACCGCCACCAGTATTTCCTGTGTTATTTCCTTGCGGATAAATCGTAGAGCTGCCATCTGAAGAGTAGTCACGACGAACAGCTCCTGAACCATTGTTAGCCGTCACATACTGAGCGTGTGAGTGAGACGGCATCTCATCAATGGTCAAGGTATGTGTCTTACTACCACCTGACTGATTGACGCTATTAAACTCACTTTCATTCTCAGACACACCAACCAAGACCCGACCGTTGCCAAACCGCTCCCAAGTGCCTCCCATAAATGTTGAGGGATTAGAGCTTGAAGTTGACTCGTAAATAACACCCACTGGATAGAAAATATCAAGAAGTTTCTTATTCTTCATATAAATGTCCCCATCAAAATAAGCAGGTAAGCTCCCATCCACATCGAGCACGCCTCTTGTCCAGGCTTTCCCTATCCCCATACCAGATGGACTTAAGCCGTAGACAACTTTTTCTGGACCAATGGTAAATTCAAAGATTGTGGCATAAAAGAGATCTTCTAGCCTTCCAATAATGGTGTAGGATTTGGTGGTATCATAGGTGCCACCTAAAATAGCTTGGAAATCGGTCTTTGTGTGTTCCGTAGTTGACGTCCAGTTGGCGGCACCACCAGCGTTTGTGACCTTCTGACCGCTTGCTAAATCGACTACTTCCCATGTCAAGGCCGCCTTGTTCTTTTGGACACTATTGATGGTTAAAGGGGCAATCTTAAGCTTTCGTGTGACCGTCACCTGATTCATACTAGAACCAGCTCGAACCGCTGAGAATGAAAAGATAGGCTTGAAATACTCTAATAAGGTTATCTCTACTTCTTTTCGAGCACTTTGTCGCCCTCTCGAATCGGTCACATAGGCAGATACCTTAGCTCGACCAATCCAGTTTATTCCACCCAATAGTCCGTTATTACTACTAGTGACATTTTGAAGTTGAACCCACTGATTGTTCTCAAACTTAAAGACCTCTGCGCGAAAACCCGTCGAAGGAATCGTCGATCCGTAAATACCCGCTCCTTGATTGAAGGTCACTTTAGGATTGGACACCAACTGGGCAAAACTGGTGCCAGTTAAAATCGTTTTTGCAGTGGCATGGGATTCTGAAACAGAAATACTGCCCAAGGTTGGAACAACAGACGTCGGTAAGTTGAGGGTAATGGGAATTGTCATAGAACCTATTGTCTTACCACCGTAGATAGTGGCTAGCGTTAAATGCCCATTACCTGAGGTGCTATTTGGAATTTGAGTTGCTAATTGAGAAATCGAAGGCGTCCATGTTGCAGAGGTCGCAATGCCTGTTCCAATCGTGCCACTTAGACTCCCAAAATGCCAGGTCATGTTATGGGTAAAATCACTACTGGCACGTTTAATCGTAATGGTTACGTCTTGTCCCATCAGATTCCCTGACACCGTGGCACTGGATGACCGTGGGATATCACTTAAGCGTAGCGTTTGTGACCCCGTATTTAAGGTGCCAGGCGACCAACCCCCAGAGCCTGAAAAGGTCGCTGAAAAACTGATGGTTTTTGACCCATTGGAATCATGCGGCACCGTGATGGTCTTATCAATCAAGTGGAGAGAACTATGAGCCGTATACATATCTGGTCGCCCTGACCAAGAAAGCGTCTGGCCATTGATGGACACACTCGCCCTACAGTCATACATCCCAAAGGTGGTGTAACCATTCTTTAGCCAGAGTTGAACTCGGACAGTAGATGTATTGTCAGCGGTCGAAGTGCCTGTTTCTTCCACTCGCAAAAGTAGGGTATAACCCCTGTCATTATTTGAACCGTAATCTGCCATAAGGTCTCCTTTCTACTTGGCATCAATAAAACGACAAACCAGATGCTTGGCATTATGTCTTGCGGCTTCTAGTCGATAATAACCAACCTGTAAGGTCTCCACAAAGACCCCGTGATGGATTTTAATGACACCAACTGTCACTGTCATGACGGCATTACCAGCTGACTTAATCATCATCCCTTGTGGGGTTAATTCGATATACTCAGAGTTATCCTTTTTACCGATAATCACCCCATTGTCACCAGCTCTCAGATAGGTATTGACAAAGTTAAGCAAGAGACTGCTGGCTTTAAGATCGGCTTCAATCGCTGCGATACGAGCTGTATTATCAATAAAGTCTTGATTAAATTGCGCAAGGACGGCTTCATTATTTTTCTCAAATTCTTTATAAGACTTGAGCCAATCGGCCACTTCTTTTGCCAACGCTCTTGCTTCAAGATCCACTCGGAGAGATTCTGTTTTTTCAGTTAGAAGGTCTAATTGCTGCTTCATAAAGCCATTATCTGCCTTGGCATCAATCTGGGCGATAAGATCATTCAAAGATGGTCCTGGAGCTGAAGCAACGTTTCCATCCTCTAGTTGAACATTTCGAAGGTAAACCACATCTCCCACCGCCCATGTTCCTGACTTCAGGTAAAAGACATAGGAATGATTCTGGGCACTAGTAACCTTCCAAGACACCGAATACCGCTGCCAGCTAGAACTAACTTGAACAACCTTAGTCCCACCCAGTTCATTTCCGATGGTGAGACTAACCGTCTTACTCGCCTTAAGGTCAATACTAAACGTCATGTTAGCACCAATTCGACTTCGTAAATCATAGAAGTTACGATGAAAACCACCTGTACCTGCTTTGGTACAGGTCATCTTAACAGTCACGCCACTAACAGAACTTGTATCCTCAACCACCTCCTTCTTCCACTCAGAGGTTATTGATGAAAAAGTCGCAGCCTTCATGGCATAATCGTCAATGTAATTGCGCCCACCGAGTTCAGTTCCTTCAAAAAAGGATGACCAGGTATAGTCACTAGGATTGGTTGAAGGTGTTGCGCTCTCCTTATTGACGGCTAGTCCGAGGTAGCGTTTGCCAGTTGAGACGGCAGAAATCCCATCTCCCTTGTCATTATCTGCGTACATCCGCCAAGTGTAGAGGGTCTTTCCGTCCTTACCAGCCTTTCCATCCACACCTTTGTCCCCATAAACTCCGATAATAACAGGCGTTGTGACTGTCGTTGAGCCATTGGTGAAAGTCGTTTTTTCATAATTCCACAAATACTTAAGGGTCGAGGTTAAAGTTGGAATCGTTTTAGTCCAACTAGCGCTTGTTGCCGTTAGACCTGTCTTTTGTGCAGAGACTAAGTAATACTGTTCCCTTGACTGAATGCCCACCCCATCTGCTCCAGCATCACCCTTTGGTCCAGGAGTTAAAGAGATAGTTTTAAGATCAGCTTGTGTCGCTAGACTTTCCCCTCTCACCTTTAAGAGTGGTGTATCAATGGAAAGATTCCCATCCCTATCCAAACTAAAGACTGGGGTATGTTGACCTGGGATAACCACCTTATCAGCTTCCACCTCTAAACTCTTCACATACTCTGATAGAATTTGTTGAGAGACAAGCCTTGTCATCCATGCTGAACTAGAGACGGTTAACTCATCAATCGTCGCTTTTTGAAAGGCTGCGACCCGTGCTTCAAGGCCATTCGTTGCCAGATAATCAAGCGCAGCTTTCTTCCCAGAACGCTCGCCTTCTGCTTTGGCAATAGCGATACCATCTTCAACTTCGGTCTGAAGACGCTCAAATTGCCTGTCAAAGACCTTGTTAAAGTTAGCTCGCTCCTTAGACGCACGATGTTCTGTAACCGATTGATTAACATCAAGAATAGTCTTAGCCGCACTAGTTAGCGAATTGACCCCAGATGACCCAGAGTTTGTAAAGGTAACCTCATCATCAAAGGTTACTGAAAGATACACTTCTTCAAGAGCGTCAAAGGTATAACCGACTGCCTTTTTCTTAACATCTACCTTGTGCTTTTGACTTTTAAGAGTAACCGTATCTCCCAGATGAACTTCCTGACCGTCTAACTGATAAGCTTCAACGGTTATTTGTCTGGAGATGTTATCGATGTGCTCATGCGTGAACTTAGCCATCGCCCATTGGCGCAATTCTTCCTCCGTCTGAAGCGTGTTATTTTCATACCGTGCTTCATGGATATAAGGGTATTGGTTAATGAGGGGGCTTTCCACAACGACTGAAAGAACCGTATCTTCATCACTACCTTCGGCTTGAAAGGTTGAGGTCGCATAGATGCGCGTAATGACCTTATCAGAATCACCCTTATCCTCAAAAGCTTTCAGATTGTGATGACTCGTTAAAATAACACCCTTATCGTTGCCACGGTGCTTCTTAACCATCAGCTGAAAGTTATCACGAACGAACTCACCTTCCCACATCCCAAGGAGGGAATGTTTCCCATCCATCAGAGCTTGGTAAAGTGTCAGGTCTTCATCAGACACATAGGTATGACGCTCCGTCACATCACTATCAAAGCTGAAAAGTCCCAAATCAGATGGACAAGCCTCAACCATTCTCATAAGAGCAGATTGACAAGTCGTATTAATCGCAGAAAAAGGCTTAACTTGGCGTTTCATCACATCATCAGAAATGTGATAACACTCAAGCTCTACCGTATCGTCCTGAATCTTTACCTGCTTAATACGAAAGAGCTGCTTTCCCAAATCTGGAGTTGGACACAAAATCAACTCATCTGCTCTAAGGGTTTCATGAACACCTGAATCTGTAATCGGATAAGTTAAGCGTAGCTGAAAGGTGCCATTCAGCACCTCTTCTACCGATGCACTCACCGTTTCAAATAATGGCTGACCATTCCATTTCGGTGTCTTTGTTTGACCATCAAGAAGGTATAACATCACACCCACCCCCAATTCGTCTCAAAGGTAAGCGATAAGATGCCACTACCTAAAATAACTCCGACAGACTGAGTAGGATGACTGGCATCAATCGTGATGAAATCGCCAGACCACTTAACAGCTTTCCCTGATAGGGTCTTAAAACTAGGGCGTTCTGGGTGATTAACCATTACGAGAGGCTCTGTAAATTTCTCAATGCGAATCACTTGGTCACCAACTGTAAAGGAAGTTTCTGACACCGATTGACCAGTGATAGTAATGGTAGGAAAAGCAAGGGCAGAACCTTGCGTTTTCAGTACGCCATTAGTGGTCAAAACTTGCCTATCCACGTTCTTAAAAAAGCGAGTTGGGTGACAGATAAAGGTCACATCAATCACAAACACATCATGGTCATCCTGCTTGATGTCAAAACTATCCGTACGGTAACACCAGAGTCTGGTGAGCTTAAGACGTTCACTTTCTAGCCAAAACCCTTCCTGCATCAAAAAGGCCGAAAACTCATTTACCTCTTTCTCACTCGCACCAATCAGATACAAGCGGTAAGACTTCTCAATCACTTCACGGTGCTTATTGGTTTGAACAATTGCCCCACTCAATCCACGATGGTCTAAGAGTTGCGTTTTAGACCGTGGTACTTGAATGCTCGGTCTATCTTCCACAAGAACTTTAAAAGGAAAAGACGAGGTGCCTTTTCTATTCAATACCAATTCATTATGCTTAATCACACCCTTCCTCCTCTCAGTAAGGCTTGTCGTGCCATTTCAGCAGCTAATCGACCAGCCACGTAATCGGCTAGTTTTTTCATATCCGCTTCTTCACGAATCACAACATCTGTGATATTGACTGTTATGGTTGTTCCCTTGTTAGGCATGGTAGCTGCGATGCTGCTACCAATACTGCCTAGAGTCTGGTTGTTAAGGGGCAAGACAGCTTCACGTCCTGCTTCTCCTCCAACCATCAGGCTATTGCCTGTCATACCAAATGCCGTTGGCTTGGTGAGAATCCCACCCTTGGCGTACCACTGAATAGAAATCTTAGGCAAGCCACCCTTCAACCAATCAAGGGGATTCGCAGAACCTGACACACTAAAGTGCGGAAGAGGAATATGCGGCCACTTAATCTTGAAGTTAAAGAGATTCTTAATGGCATTGATGGCTGAAGAGACCGCATTTTTTGCCCCATTGATAGCGTTTGTAATGGTAGATTTGACTCCATTCCAAACAAAAGACACTGTGCTTGAAATACCACTTAGGACACCTGAGATGGTTGCTTTCATCCCATTCCAGATAGAAGATACCGTTGAACCAATACTCGATAGAATGGAACTAATCGTTGATTTGATGCCATTCCAGACATTACTAACGACACCCTTGATACTATTGAGTAAGTTTGTCATGGTGCCTTTGATGCCATTCCAGGAATTGGAAATGAACTGCGCAATGGCACTTAGAACAATCGAAATCAGAGACTTGATGACTTCCCAAACCGTAGAGACGACCTGCTTGATAGTTTCCCAAGCACCTGACCAATCACCTGTGATAATCTGCATAACTGCCTTGATAATACCTAAGACAACATTGATGGCTGTTTCAACGACCACTTTGATGATATCCCAAGCTGTCGTGATAATCAGTTTGATATTCTCCCAACTGGCTTGAAGGTAAGGTGCAAGAATGGTCATGATGGTTTGAATGACTGTTGAAATAGCTGTCCAGACAGTATTTGTGGCATTAAGAATTAGCTGTTGGTTCTCTGTCCACCAAGTAATCAAGGTTCCCCATATCGACATGACAAAGCTTGATATCTGTTGGATAATGACAGTTAAAAAGGCATAGATGGCGTTCCAGATTTCTGTCACAGCTGTTCTAAACCCTTCGTGATGTTGCCAGAGTTGTTGAATCCCAACAACCAGTAAGGCAACAACGGCAATGACACCAAGAATAATCCCTACGATTGGAGCTGCTGCAGTTATCATCCCCATGATGGTGGTTCCCATAGCCATCGCGGCTGCTTGCAGGGCAATGAAAATCGGGAGGACTAAGCCAAGGGCTGCGACCAAACTTCCGACAATGACAATAAACTGCTTGACTGGCTCTGATAGCCCAGAAAACCAAGTGGCAACAGCTTGAAGCAAACTGGCGATCACCTCTAAGATAGGTGCTAGGGTTGCGGCAATAGCGTCTCCCATCTCAGCCATCGCTAACTTCGCTGTGTTTTGAGCAGTTGTAAACTTATCAACGGGATCAAGCGTCCCCTCATAGGTCTGAGTGACAATCCCAGCTGCCTTATCGGCTGTTCCTGCTAAATCTTCAAAAGATAAGGCCCCACGCTTGATGGCATCAACCATACGAGGAGCTGCTTTACTACCAAAGATTTCTGAGGCTAGAGAAAGAGCCTCTGTTTCACTCGTTGAGGTTTTAATTTGTTCAATGGTTCCAGCAAGTCCTTCTTGAAGCGTTAACCCATCACCCGCATACTTAACAGATGCCTTTGAGAGTGACGAAAGTGCTGCAGAAGAATCAACCCCTGCTTTTTCAAATTGTCCCATCAAGGTGATTCCCTCATCAAAGGAAAGGCCAAGGGCTTTAATTTGTGGTGATCCAGCTACTGCCTTGTCCATCAACTCTTGGACACCGACACCAGTTGACTGACTGGTATAAGTAACCGTGTCTAAGACACTTGATAAATCAGTCGCTTCAAGTCCATAGGCCTCAATCGCTTGTTTGGCTGAAATGGCAGAGCTGGTCACATCACTCCCATTGATTTCAGAAAACTGAATGAGTTGTGTTGCTGCAGATTTAAGAGCATCTCCTGTTAATCCAAATTGCGTATTCAACTCCCCTACGGCACTTCCTGCCGTGTTAAAATCTGTTGGCAGTTCAGTGGCTAGGGTTTTGGCGATGTCTGTCATCTCTTCAAGGGCAGAACCAGTTGCCCCAGTCTTGGTAACGATGATATCCATGCCCTCATCAACTTCAAGAAATGCGTCAAGCGATTGTTGACCGAAGTCAATCAACTTCTGTGATAACTCTCCCAGTTGGTCGCCAAACTCCATGAGAAGGTCAGCCTTTAAGAGGCTATTTGTCTCTTCTAAAGAAGCCTTGGAACTCGCAGAGCTAGATGCCAACTCCTCCATCTCATTTTGGAGATTGTTGTAAGCCGTCTTAGTCTCATTAAGAGTTTTCTCAAGCTTGTTAGCTTCAACCGAATTCTCACCATATTCGCTCTTTGTCAAAGAGAGCTGCTGTTCTAGATTATGTATCTGTTTCTCAAGGATTTCTGAATGAGAAGCAACCTTTTGTTGGGCAAGTGCTAACTTATCAGCCTCACTTGCAGTAGTTGCTAATGCTGATTCTTGTAGCTTAAAGGAACTTTGGAGTTTTTCACTTTCTGACACCAACTGTGCCTGCTCATTTTGGAGACGATTAAGTTTGGACTGATTGGTTTCAACCTGCGCTCCGTTTTCTGAGAGAGCTCTGTTAACACTTTCTAGCTTTGATTCATAGCCCTTTAAGACTGTTTGAGTACTCTCCACCTCACGTTGGAAGGCACGGTATTGGTCTGCACCGATGTTACCGGCCTTGAACTGAGCCTCGACTTGGGCTTGAGCTTGACGAAGCGTGGCGAGTTTCTCTTTAGTCGTCTCAACTTGTTTGGCTAAGACTTCCTGCTTCTGGGTCAAAAGAGTGACATTGCCAGTGTCAAACTTGAGTGCCTTGTCAATCTGACGCAGTTCTTTGGTGGCTTCAGAAGCCTGTTTATTCACACCCTTTAAGGCAGTTTGTAAGGGCTGGGTATCGCCACCAATCTCAATAGTAATCCCCTTAATGTTTCCTGCCATCGTCACTCCTCCCTCCATCAAGATACTAAGGGCGTCAAAAGCAAAGTAAAAATAGGAGATGGAAGAATGGTGCTTGCACTAAAGTCCATCTCTCTTTTTTACACAGCTTTTAGCCCGTGTTCAGTTCCTATCAAAAATTGTCAAAGTCTGCTTGGGTTGCTCTGCGAACTCCAGTCTCATCTCGACTTCTTAGCTCCACATAATCCGTCTGATAGTCAAGTGCCATGCCAATTGAGATATGCTTTAAATCGTCAATGGAAAGGCCAGTCTCCTTACAACAAGAGAGGTAGCTTTCTACCGTGAAGACTTCCTCGCTCGCTGTTTCGGAAGTTTCTGCTTTTTTCTGGTTGTCATCCCTTGGTTAAGCATGGACATTAAGACGGGGCCAACTTCCTGAAGAGGGAATTCCTCCATCGACATAAAGAAATCCTCGAAGGGCTTGATTCGAGGATTGGCTGACTTGGCAAAGACCCAAAAGAGACGGTGGAAAAAGGTCATGTCGAAGTCAGATAAAATAGACAAGTCAATCTGACTAGCCTTTAACTCTTCCCCTTCTTCCAATTGCTCAAGCTGAGCCATGATGGATTCCGCACTCAACATGTTAAAGAGGTCGTGGAAATAATCTTTTCCGAATTGCTCTTTATAAGCAATCGGTGTATAGGCGTTAGTGGCCAGGGGATATGTTTTTCCACTAATCGTGATATTTTGTCGCATGTTCTCCTCCTTTAAGCAGCTGGTTCAAAGACCGACTTAAACCAGTTCTCACGAATCTCATCACTGGTTTCTTCCGTTGTTCTGCGTCTCACCACCTTATCAAGTGGTCGTGGGCTTGCCGTAAAGGTCAACTCTACCTCATTGATATCAGAACCTGATTTGGTTTTAGACCCAACGGTAGGCCGTGACGCATAACAGTAATAAAGCACATGAAGCGTCTCTTTCTTGTCCCCTTCAAATCGGAACATGAGGGCGAAGTTTTTCTTCTCGCTATTCGCAATTTCTGAAATGGTATTGGTCGTCGCATCAAGTTGCTCGCCCAAAACACGAGTCAGAAACTCCTGATATAAGAGAGCTACCTTAAGCGTTCCCTCGTAACCGTCATTAGATTCAGTCGTGTAAAAGTTGATGTTATCTGCCTTATAAGAACCCTTGTCTCCGGTGGGTTCAAGGGTTAGTTCAGCTGCACCACGAAGACGTTCGACCTTGCCATAGGTTAAAGCTCCGTCAGCTCCCTCATTAGTGACTTCTGCCCAGTGGACGTCTTGTAGGCCAAAGGTGACCTTGTTTTTTTCTGCCATAGTTATCCTCCTAATAGTGTGATGGAATAAATGGTTTGGTAGAGTTTCTCATTAGTGATGTAAGTCTCTACCTTGTCAAAATAAAGACGGTGGGCATCAAGGACTGATTCCACCGTTTTTTCTGTTGCTAAATCTTTCTTAGTCGTGTAAAGCTCAATCTGTACGTTGATAGCTTTGTGATAAGCCCAGTTGTCTGCCCCAAGATTGTCTGAATCCGTAACTAGATAAACCATAAAAGGCGGACTTGGACTGTGCCCTTCCTCAAAATGGTGATAGGCTACTGGGAGTTTGGTCTTTTTTAGAACAGGAAAAAGCTCTTCAAATCTCATAAGCCACCTCACAGTTTCTGTCGCAATTTGTCTTCAAAAGACTGAATTGCCTTTTTTTCGACAGGTGAGATGTGCTTTCGTCCTTCAACCCGGCCACCATTTTGTTTAGCATGCCCATCTTCAAGGAGGTGCGTCAGTCCTGGCGTTCGGTTGTGAATGGCTTTGGTCAGAGCCGTATTGGTGTCAGTCGTTGCCTTACTCGTCCACCCTTTAGCATATTTCCCACGTCGCTTTGGGGAAGTAACCTTTAAGGTGTCAACGGCATCGTCTGTGACTTCCTCAACCACCTCACGCATGGTGTCCGTAGTGTCTTTGGCATAAGTCGTCAGATCCTTTTCGATGACAGAAGCTAAATCATCAAGTCCAATCTTAGTCATAAAGCTCCTCCTTGGTCGCAACGATATAAATCAAGCTTCGAGCCACAGTATCGCCATCAATGGACTCGATAGCGTAATACTGGTCACGAAAGTAAATCCGAGTCGTTAAAGAATTAAGAGCAAGAACAGCCTTATCGTAGCGTAGGGTAAACTGCACCTTGTTGTGAATCAGTTTTGTCGCACTCCCATCACTTTCAGTTAAAGCCAGAGGACGACAAGAACACCAACGCATAAAAAGGTCATCCCAAATGGCTGACTCGTTCCCGATATCGTCCTGCTTGAGTCGCTTTTCTTGAAAGACCAGCTGTTCTCTTAGAGGCGCAATCTTCATCAGAACACATCCTTCCTGTCAGCTAAAAGCAAATGGTAGAGAGTTTCCTTTAACTCTTTGTGATTGGCTTCTTCACGGTGTTCATAAAGATAGGCAACCCCATAGAGGATTGCCGTCTTTAGAACTTCTGAAGTGGAGCTCTCACGAAGAATATCTTCACAGATTTGGCGACTTGTTGCCATCAACTGCTCGATTAGATAGTCCTCCTCGCCATTTTCCACTTTCAGATAGAGCTTAACTTCTTCTAACGTCATCATGCCGTTTTACCTTTGATGGTCAAAGTCTTCACCGCTTCTGGTAGAACGAGTTTCCCATCCACACGCTGGCTGGCAAGAAAACCAATCTGACCATTGTTAGCGTAAAGCTCATTGAGACGTTTGAAGGTACGCCCTTGACGGTCCGCAATCCAGTAGTAAGAGAAATCACCAAAAGCAATAGCTTTGTTTCCTGCTTCTGGAAGTGGCGCAAAAGTTGACGTGTAGTAAGGACGGTTAAGAATCAGATCAGGTTGACCAGCTTGTGTAGACGGTTGCCAGATGTAATTGCCATTATTGTCCTTAAGCTTGCGGATAGCTTTTACTGTAGTGTCATGGAGAATCCAGACCGCATTCTTACGATAAGGAGCAGGAAGTGAGTGGTAAAGCTCAATCATGTCATCAAAGGTGACATCTTTGGTTGCGGTCGTTGGTCCTTCAACGTCTGCTTGCGTAAAGATACCTGTTGGTTTTTTAGAACCATCACCCACCAAGAATGATTTTTCTTCTTCTGTACCGATGCGGCGTGCAAACTCAGAAGTCATGTAAGATTCAAGGTCAAAGACAGAATCATTGAGCAATTCTTCAGAGATACGAATTGCTGTCCCAATCTTATGCGAATCAAGAGTCACTTGACCAAAAGTCTCATCCGTCTCTGGATAGAGCCCATTCTCGTCCATCCAAGAGGCAGACCCGTGACCCGTAACAACTGGAATCTTACGCTCACCACTAGAGGTTTTGATAACAGTTGCCAGGCTACGGAAAAAGTTTTCTTCCTGAAGCTCTTGTACCAATTTCTTCTCGTATTCATCAGGAACAAGGTGTCCGCCTTCTGTGTCTTCACCGACACGAAGAACATCCTTCACGTCATAGAAGTTACGCTTACGGACACTGGTCCAGAAAGTCTGGGTGTAGATGTCTGATGCCACACCTTTCTTTTCATCTTCTTTTTGGTTATCGACAATGACTGTTGGCTGCGTCGTTAGCGCTTGTGAGGTAGGTTGCGCCAGTTCAAGGTCAATCTTTTCTTGGCGCTCCAAGCGAGCAATTTCTTTATTGTAGAGCTCGATTTTAGCTTCCATTTCCTCATAGCGTTTGGAATCTTCATCCGATACCAAGCCGTCTTCAGAGCGAACAGTATCCAGAAAGGCTTTCGCTTGAGCCCAAGCAGCGTTACGTTTTTCTTTCAATTCAAGTAGTTTAGACATAGGTATTCTCCTTTTATTTCAATAGGTTCAATCGTTTTTCCAACTGATTGAAAGGGATCGTTTTCTGTGGTTTAGGCGGTTGAAGGCTCGCTTGCAGTTTCACCACCAAATCATGAGCAGCAGTCACTCTACTAAAGGTATAGCTATTTTGATAATCCTGCTCAGGTGTCTCCTCTTTCTCAAAGAGCACCTTATCCGCAAAACCAAGCTCCACGGCTTTCTTGGCATTGAACCAAGATTCCGAATCCATAAGATGAGAAATCTTAGTTCTGGAAAGTCCAGTTCTAAGCTCATAGGCATTGATAATGGACTCCTTGATTTCACCAAGCATCTCAATGACCTTGGCCATATCTTTAGCTTCACCTTGTGCAAATGTCCATGGGTTATGAATCATCATCATGGCAACTGGACTCATGGAAACTGTTGTCCCTGCCATGGCGATGACACTGGCAGCACTCGCAGCTAGACCATCAATGATGACATGGACATCGCCCTGATAATCCATAAGCATGTTATAGATTTGAGCAGCCGCAAACACATCACCCCCTGGACTATTGATCCAGAGGGTGATGTCGCCTGTTTCTGAAGTCAAGTCATTCTTAAAGAGCTGCGGGGTGACTTCATCCCCAAACCAAGTCTCGTCCGCAATCTGTCCTTCAATCCGAAGGGTGCGAACTTCTCCTTCGTCAGTAAAATTCCAAAATTTACGCATCTTCTTCCTCCTCTGGTGGGTCTTCAGCTGGTTCCGTTTCTGTCGGTTGCTTCATGAAACCACCAGCATCTTTTAATTTGGTCATGTTGCCGTTAATCAAGTAAAGGTTTCCGCCTTCTTCATCAGATAACAAGTTCAAGTCTTCCAGCTCACGGATGTCGTTAGTCGAAAGCCATCCATTTTGCCGTGCGATGGCATAGCCATTCATACGGCTTTGGTAATCGCCACGAAGCAAACCGTCTACATTAAACTTGATAAGGTAACGTTTCTTTTCTTCGGGTAAAAAAAGAGACCTCTTAAAAGCCTGTTCTAAGCGAACCACCCAAGGGTCTAAGGTATATTTCACAAATTCAAGAGATTGTTGTTCGATATTTGAAAATGACGACTTCTCCAAATCACCAACCATATGTGGCGGAATGCGGTATAGCCGTGCAATCTCATTAATCTGAAACTTCCGTGTCTGAAGGAACTGAGCTTCCTCTGGTGGAATACCGACTTGGGTATACTTCATCCCTTCTTCAAGAACTGCCACTTTGTGAGCGTTTGTGACCCCATTATAAACGGCATTCCATGAATCACGGACTCTTTTAGGATCTTTAAGAATCCCTGGGTGCTCTAAGACACCACCTGGGTTAGCCCCATTTTTGAAGAAGGCTGCCCCATAGTTCTCAGTCGCAAGGGTCATCCCAATCACATTTTTAGCCATGGCAATAGGCGAATAACCAATCAAACCATCAAAGCCAAGACCAGGCACATGAAGGATATCCTCCTGCTTCAATAAAACTGTTCCTTTGTCTTTGAAGTTAGGATTCTCTTCAGTCTGTCGTTGGTATTTGTAGTAGAGTTTTCCTGAATCATCACGATGGACAGACATCTTGTCAGGTAAGAGCGGATAGAGGCTAATTACTCGTCCAGCCTTATCCCTGATGATCTGCACATAAGCATTTCCCCATATTAACAAGTGACTCATAATCGTCTCTCGAAAGATAAAAGAGGACATCTCTGGATTGGGTTCATCATGAAGCAGAAAGTATAAAGGATGATCGATTTTTTTCTTCTTCCCACTACTCGTCATCTCATAGACATGGATTGGTAGAGAGGCAACTGCTTCAGCTAAAATACGCACACAAGCGTAAACTGCCGTTGTCTGCATGGCCTTAAACTCATCCACATTCTCTCCACTCGTTGTCCGACCAAAAAGGTAGTAGAAATCCTGACCTTCATAACTATTTTGAGGCTTGTCTCTCGCCCTTTTTCTTCCAAGTAAATCAAGTAGTCCCATAAGCCCTCCTTATTTTTGGGTACGAAAAAAGCACCTCATTTTGAAGTGCTTTCGATATATTCTTACAAAACAGAGTATTCTCATTCAGCTAATTCTTTAAAGGCATCAAGATGGCGTGATAATACTGAATTCGCAACCTTATCCAATGTTGCCTGCTCAACAACTGTTGTCTCCTGCTCATCACTACTTAAACTTTGGTAATCCGCTGATAGTTTGCTATCTAGTGGACTTAATACCAAAGTTCCATCACTTGATTGATAAAGATTAAACTCCTGACCCTCTTGAATGCCAAATTGGTCAGGTATCGGAAGATAGATATCATCACCTATTTGTATCGTCTTAACGAGTTCCATAAATCACAACCTCACTTTACCAATTATCCATATGACGAGACCCATAGACTACAGCAACTATAATCACTTCATCTTCCAGAACATGATATATAATACGATACTTTTTGACAATCAGTTGTCTGAAGGTGTAACCTTTTCCGACTAAATCCTCAATGATGGAACAACGCTCAGGAAAAATGGATAGGGATAACATTGCCTGAGATATCTTCTCAAGGAGATTATCCGCTGCCTGTGGTGCACAGAGTTCGTCACGAACATAGTGATAAATGCTCAGCAAATCTGCTTTAGCATCATCCGAAATGGTAACCTGATACTCTTTCATTAAGCATTTTCCTTAAATTGAGATAGAAATTCTCTGACATCCTGACGTTTACCACCTTTAGCATCTTCAAAGCTAGTGATCAGTTTATCGTAAAACTCCTCCTGACTCATATAATCGACATTAACTCGTTGAGGTGCTTCAGCCAGAGAAACATCAAATGGGATGCCACCAGTCAAAATAATCTGATTCAAAAACATATCAATCGCAGTTGACATGGGAATACCCAAGCGTTTCAATATCTCGTCTGCTGCACTTTTTACTGAATCATCAACTCGTAAATTTAAAGTTCCTGTTTTAGCCATGGCAATCCTCCTTATTATGTAACGACATTGTATCACATTTTGTGTATTGCCTCAACTAAAAGCTCAATAGCCCCCGCTCATCATAGACACTTGTTCCATTACCTTGATGGCGAATACAACGGTCAAGTCCCATGATAAGAGCCACAATACCGTCAATCTTCTCGACTGACTTTTCCTTATCTGGCTTGATGTTACCAGCAGGGTCTTGTCGCATGACCACGTTTTGTCCCATCCATTTGAGAACTGGATGACCACCGTGTTGGATTTTCCCTTCCATCATGAGCTTGTAAAATTCCTTGGACGGTGGGCTCATATCCTTATAGCCCTGCCCAAAAGGCACCATGGTTAAGCCCATCCCCTCAAGGTTCTGCACCATTTGTGTCGCATTCCATCGGTCATAGGCAATCTCCTTGATGTGGTAGGTTTCAGAGAGTTGTTCAATAAAGGCTTCGATGAAACCATAGTGAACAACATTCCCTTCGGTCGTCTTGATGTAGCCCTGCCTTTCCCAAACGTCATAAAGGACGTGGTCACGACGACATCTAAGTTCCAAGGTGTCCTCTGGTAACCAAAAAAAGGGCAAGATAATGTAGTTCTCCTCGCTATGTCGTGGTGGGAAGACCAAGACAAATGCGGTGATGTCAGAAGTGCTTGATAAGTCAAGCCCTGCGTAACAGTCACGACCTTTTAGAGCTTCATAGTCGATTGGGCTATTTCCTTTGGCATAAACATGTTCAGGTATCCAAGCCACGCTGGAACTCGTCCACATGTTGAGTCGGAGTTGCTTAAAGACATTCTCCTCTGCTGGGTTATCAAGAGCCTGTTGGTAGGCTTCACGAACACGGTCAATCCCAATGGTATGACCAAGTGATGGATTAGCTTTCAGCCAGTTGGCTTCGTCATTCCAATCATCTGCATCAGAAAGACCATAAACTACTGGATAAAAGGACGTGTCCTTCTTTCGACCTTTAAGAATATCAAGTGCCTTGGTGTGGAGTTCATAACAGATGGAGTTTTTATCAGTTCCAGCTGTTGTGATGATAAAAAAGAGAGGTTGTTCCCTAGCATCACCAGAACCCTTAGTCAAGACATCATAGAGATGGCGGTTGGGTTGGGCATGGATTTCGTCAAAGACAAGTCCTGACACGTTGAGTCCGTGTTTGGTTCCTGTCTCAGCGGAGAGGACTTGGTAAAATCCAGCGTTTGAATAGTTGACTATCCGCTTGGTTGCTCCCATGATCTTTGAGCGTTTCTCAAGCGGTCGGCTCATCAAAACCATTTGTTTGGCAACGTCAAAGACGATAGACGCTTGGTTACGGTCACAAGCCGCCCCATAAACTTCTGCACTGGCTTCATTGTCAGCGTAGAGAAGATAGAGAGCAATAGCTGCGGCGAGTTCAGACTTGCCATTTTTCTTTGGAATCTCGATGTAGGCTGTCAGAAACTGACGGTTACCATCTTCCTTAACAATTCCAAAGAGGTCACGAACTATCTGTTCCTGCCACGGCAACAAATCAAATTTCTTCCCAGCCCACTTCCCCTTGGTATGAGATAGGTTATTAATAAAGGTCACTGCTCTATCTGCCTTTGCCTTGTCGTAACGAGAAGTCGGAAGCATAAAGGGACTCGGTTCGTAGTGATAGGTCATAAAACACCTCCTAATAAATCCTCCATCTCATCACCAGTGCCAACCTCTGCGTCCATGGTCGCCAAGCGATTACGAGCTGATGGTGTCAGACCAAACTGCTCACAGAACTTAAGCATGATTTTCAGATTGGTCTGGCTGATAGATACCTGTGGCACTTGTTGGAGATAACCATTTGGAGTCTTGATGATAGACCCATGTTTAGAGAGAAACTCTTCGGCTTCCTTCCAACGGGCATAGGCTTGGCAATAGCCTGCGAAGGCTGTCATATCCATCTCCGTTAATATCCCTATCTGTTCGAGGATTTTGCCCATCCGCTTCCATTCCTTCTTGGCATCGTCTTCGAGCCACTGTGGGCAACGTGGGGCTTTCTGTTTTAGTAGGTAGAGGTCGTTTCCCAGGGTTTCCTTCAAGTATTTTCAAATTTGTAGGCTTTGGTTTTCGCCCTCTAACTGCCACGGTCTCACCCCCTTTTTGGCACAAGAAAAGGCTTCTTAGCGAAACCTTGTCTATTCTTGTATTGCCTTTTCTATTTCTTCTTTCGTCAAGACAGTTGTCGGCTTAAGATTAAAATGATTACATTCATCAATGTAGATATCAATTTCAGTTTGCGGATGCGTACTAGTCATAAGAGTGATGACTTCTTCGTGGGTTAACTGTACAGGAATATCACCACTATCAAAAACAAATTCATTGTGCCCAGCTCTTAAATAGCAATAAATACTTCTTATAAAATCATGCCTATTTGGTTCTACAATAACCACCCTATCAATACCATGCAACACGTTTAAGCTCTGCCCATTGTCCCAAGAAACGATTAGCGATCCAATATCATCCACATCTTCTACTGTTCCTAGCATACCTATTGGAACACTGTAAGGGTCGTCCATATGCACTAAACGAACTCGAGTACCAGTTGGGTATGTTTTTCTTAACCGTTCAAGGGTTTTATCATCCATTTGCTTCTCCTTATCTCACAAAGGTTTGAGATACCTCGGCCCATAACACTTCGGTGTCTTCGTAAATGTAGCGTGCTTCCAGTTCGTTAAACCCACCAAGTTCGATACCATTTGTAATATCATTCAATAAATCATTAATTTGGTCAAGATTTCCTTGTTCAATCACATCTAGTGAATAATTTCTCGGCTTAGTGTAATGTTGTTTCATTACAGTTAGGTGAATAAAGCAGCTATCAATAATTTCGTTGAACTCATTTTTTGTCATCGTTTTGACTTCCTTTGTCTTTTGGTAACAGCATATTACCGTAAGGTTCACCTTATATCCAGTGATTAGCTACTAGTCTAAGCAGATAAAATGGCCTTCCCAATAGTATAAACCACCGTTACCGTCACGCCATTACCAGCTTGTTTGTAGAGCTGGGCATCAGAGTTGACAGCCTGAGCTTTGTCAAATAAGTCATCTGAAAAGCCTTGGAGCCTGAAGCACTCTCGTGGAGTCAAGCGTCTGATTTTTACGACTCGACCATTCCAAACCACAGCTCCCATCCGTCCACCACAAGATAGATTGTGGGCGATGCCTTTCCCAACCCTAGCTCGTCTCGTTTGTGAGCCTGGATAGGAAAGATCAACTGAATCTCCAAGCTCTGCCACTTGGTAGCCCTGCTTTGTGCCATTTCTGACCTTGATACCCTCAAGAACACCATGTCGGTCTTGAGAGGTAAGAGTAAACATCGGCTCATCCTGTTCCTTAAGCCGTCTGCCATTTTGCCGTTTTATTACCCTGTCTGGTGTTAGAATGGGTTGGACTTCAAGGACACCAGAGTTCATGGCAGTTCTCTTTGTAGCACCTGCCGTGTAGCGAGCGGTGATACACCGTGCTTCATCAGTTAGCTTTGGTTCAGTCAAAGACTGGTCAATCAGATAAAGACCTGTCTTAGCTCCCAGTCCTCCACCCTCACCAACAAGGGTTGTGGCAATGCCACTAGGGTCGTAGACTCGGTAGCTCTGCATACCACCTATAAGTTGCTTAAGATGGCTACCGCTTTCTCCGCTGAGAGGTAATACTTGTCGTCGACCCCGGCTTCTAAGATGTCCGAGAGTATAGATGCGTTCTCGGTTTTGGGGAACTCCGTAATCTTTTGAGTTGAACACTTGCCATTCGAGGTCATACCCTGCTTGATCCAGGATAGAGAGATAGTCGAGATAATCTCGTCCTCCGCCACTTGATAGAAGTCCCTTAACATTTTCAAGGAGTACCCACTCGGGTTTATCTTCTTCTTTTTGGCTCTGGAGGAGGTCAACAAATGTAAAAAAGAGTCCACTTCGCTCACCGTATAAGCCTGCTCGCTTTCCTGCGATAGACAAATTTTGACAAGGGCTTCCCGCAGTCCATAAATCTGCTTTTGGAAGTCGTGTGGGGTCAATGCTTGTGATGTCGTCATGAAACCATTCTCCTTCTGTATCATACATTGCTTCGTAGGACTTTCGTGCAAACTTGTCCTTTTCACAGTAGCCAAGACAGGTCATCCCTGCCAACTCCAACCCACGACGAAAGCCACCCACTCCTGCAAAGAAATCAAGAAAGGTTAGGGTCATGAGTGTTCCTCCATTACTTCTAAGGCTTGGTCATAGCTCAAAGTCTCACCGTTTCGGAGTACCGACACATCTCTATTACTAGTTGACTCCATGTAGCGTTTGACAATGACATCCACAAACTTTTCATCAAGCTCAATCCCATAACAAACTCGACCAGTCTGGTCTGCTGCCATGAGGGTTGACCCTGAACCAAGAAATGGATCAAAAACAAGCGTTCCTCGCATGGATGAGTTTTGAATAGGATAAGCCATAAGCTGAATAGGCTTCATGGTTGGGTGGTCTTTACTAGACTTAGGACGGTCGTATTCCCAGATAGTTGTCTGCTTACGGTCGCTGAACCACTGGTGTTTTCCCTTTTGTTTCCAACCAAAGAGACAGGGTTCGTGTTGCCATTGGTAAGGACTACGTCCGAGAACCAGTGAGTTCTTCTTCCAAATGCAACTCCCACTTAAATAGAAACCAGCGTCCTTAAAAGCCTTCCGGAAATTCAGACCTTCCGTGTCTGCATGGAAAACATAGATAGACGCATCGGCTTCCATATGACTTTCCACTTGAGTGAACATATCAAAAAGGAACTGGTAGAAGTCCCTGTCCGACATATTGTCATTGAGGATTTTTCCAGCGGTCTCTTCAACGTCCACGTTATAAGGCGGATCCGTCACAATAAGATTGGCTTTCTTATCCCCCAGCAATTGCTCGTAGGTTTCTGCTTCGGTGGAGTCACCACAAATCACTCGGTGTTTTCCGAGTTGCCAGATGTCACCACGTCTTGCGACCGTTGGTTTCTTCAACTCCTCATCCACATCGAAATCGTCTTCAGACAAGTCCTTGTCATGGATGTTGGACAGAATGTCATCAATCTCTGGTGGCTCAAACCCAGTCAAATCAAGGTTGAAATCAGACTCTTGCAAGTCCAAAAGCAAGTCAGCGAGGGGCTGGTCATCCCATTGACCAGTGATTTTATTAAGGGCAATATTCAGTGCCTTTTCATCTTCCTTGGAAAGAGAGACAATGACGCATTTTGCGGTTTCATACTTGAGGTCTTTAAGAACCGTTAGGCGTTGGTGACCACCGATGACCGTCAAATCGTCATTGACAATGATGGGGTCAACGTAGCCAAACTTGAGTAGACTCTGTTTAATCTTTTCGTACTCCTTATCGCCCTTCTTTAATTTCTTTCGAGGGTTATAAGAGGCCGGCTTTAAGTCACTTAGGGGGAGTTCCTTAATTTCCATATTGGGTTGAGTTGTCATGCACTACTCCTTTTCTAAATCGATGTTCTACGTAACAAGGGTGTCCGCAGAACTTCCTTGTTGGATTGGCATAGGATAAAAAAGACCTGCCACAGTTTTGGCAAGTCAATTCATCGTATGCAGTTTTGGTTTTATCATGTTGGCTTTGATGGTTTCGCCACCAGATAGCTCGACACCTATTTGAGCAGAACTTTTTAGGTCTGCCTTGGACAGCATGGTGTAACTTTTTCATGCAGTTCTTACAATATGAACGTTCTCCTTCATCAAGCTGATACTTGACAAACTGACCCATTCCTTTTAACTTTGGGTGTCTACGGCAATATTGCTTCACCGAACCAAGAGACATGTTTAACATTTGAGCAATGGCACCATAGCCAAATCCATCTCGCCTAAGTTTCCAAATGCCTCGCCGCTGATTATCGTTCATTTGTTTTCCTCCAAGAACTAAAAATGGTTAATTTCTCTATTTTTCATGTATTTATCCCTTCAAAAACAGGCCAAAACAAACACTGGCAAACGATACTTACCCTGCTAGAAAGATTGACACCATAATGGTTCAATGCACTTTTTAACTGTTTTGGGTATGCCCCTAACGAATTTTGCGAAAATGCACGTTTGAGGGGGCGTCGGTCTTTGTGGGACAAGGGATTAGAGATTTAATCCCCCTACCCCCCAAGCGTAAAAAAGAGATACTTTTAGAACGAAACGCCAATATTAAAACCGATAGGTATACTCCACATATCGGTCAGTCGTCTTGGTCTTTCTATCGTGACAAGTTTTACAGAGAGCTTGCCAATTAGATTGATTCCAAAAGAGGTCTTGATCACCTCGGTGGGGAGTGATATGGTCAACTACCGTTGCCTTGGTCAGTCGTCCTTCTCTTTGACAGTAAACACACAAAGGGTTGAGCTTCAGGTATCTAAGCCGTTCCTTGTTCCACCGTGCGTTGTATCCTTTAGCTTTGGTTGACTTAGCATCAAGCGAGTGGTTAGCTTTGTGAATTTCACAGTACTTATTCCCATAGCTCACAAGGTTAGGACAACCGCTTTGCTTACAAGGTGTGCTTGGTCTACGTGGCATCTTACTGCTCCCAAGGAAGGTAGGACTTGGTGAAATGCCCAAGGCAAGTGGTCTTAGTGTAATCCACATCTAAGAGGTTCAACTCCTTGATGAGCCCTTGTGGGGTCAGGTCGTAGCGTTCACGAACTACTCCCTCAAGCTGTTCAGCTGGGTAATCACTGGTTCCAAAGGTGCTCACGTAAACACCAACAGGTTCAGCAACTCCAATGGCATAAGCTAACTGGACTTCACAACGTTTAGCATAGCCTTCACGGACAAAGTCCTTGGCAATCTTCCGTGCCATGTAAGCAGCTGAGCGGTCTACCTTACTAGGGTCTTTACCAGAGAAGGCACCACCTCCATGGTGGGCAAAGCCACCGTAGGTATCTGCCACAATCTTACGACCGGTTACTCCAGCGTCTGCGTAAGACCCACCCAGAACAAAACGACCGGTTGGGTTAACCAATACCTTGAAGTCAAGATTCTGACGGTAGCGTTGAGCAACGGACATCATAGCTTGGGTGACAATACGTTTGACTGAAGCAAGGTCAACCTCCTCATCGTGTTGGATAGAAACGAGGAAGGTCTCGATACGTTTGTTTTCATAATCGTAAGTGACTTGAGCCTTGGCATCTTTACCCAAGGCAGGATGACCAAGGTTGGTTAGCTTTTCAAGAACACGAGTCGCCAACACGTAAGGGAGTGGTAAGAACTCTGGTGTTTCATCGGTCGCATAACCGAACATAATCCCTTGGTCACCTGCACCACCACTATCCACACCTTGAGCGATGTCTGGACTTTGAACACCAAGGAGGTTAGTCACCATGATATCCTCCATGCCGTAAAGTTCAAGAACCTTTTTGACAATGCCTTCAAGATTGAAGAAGTGCCTGGTTGAAACTTCTCCTGCCACAACCACTTGGTTATCTTTGATAAGTGTTTCAACGGCTACACGGCTGTTCTTATCATACTTGAGACATTCCGTCACAATAGCATCTGAGATTTGGTCACAGAGCTTGTCTGGATGTCCGCTAGAAACTTGTTCACTGGTATAAATCATGTTTTCCTCCACGCAAAAAGCCCAACCCTTTTGGGCTAGGCTTGGGTTTATTTTACTGATTGTTGGCCTGCTTCGTAGGCTCTCTCGAGTGCCCTTTTGATTCCCCAAACCGAAACATCGTAGAAGTCAAGATTGTCACTTCTTCGTGTCTCTAAGGTTTCAACCCTAAGTTCTTATTTGGCAATTTCTGTTAAAAGGGCATTGAGTTTTTCTTGTTGGCGTTTTGTCATTTTGATGACCTCCTCTTGTTTTTGTAGGTGTATATTACCGTACAAGTGGAAGGTTATCCAGTCATTACTGGGAGATTTTTTATCTTTTTTGACACTTACAATTCTACCACAAATTTTTACAAAAGGAGTTCAGAGGTAGTTCAATATAAGTTCAGCCCTAGTTCAAGGTGAGTTCATATAAAGTTCAATCGTCAAAGAACGACCCAGCTTAACGAAAATCTGTCGAATATGTTCCAGTAACTTTCTACGCCAATTATGAACTGTTCCACGACTGATATGAAATTCTCGCATTAAATAGTCCCAGTTACAGTCAGGTTTGAGAAGTTCCACCGCAAATTCCGAAAGATCTCCTTTGAGAAATCGTATCGCCATATCAAACGTTTCAAGGTCACTTGCCAAACGAATATAGCGTTGACTAAGGTCTGATAAGAGTTCCTCGTTTTCTTGAACCATCTTCTCACGAAAACTTAAGGCAATCATTTCTGACCGTTGATTTGTTGGTGTACTGGTAACTTTAGGTTCATCAGACCGCTCAAAGACTAAAGAACTAATTACCTCATTTTCCGTTACTGGTTTGAAGTTATTTAAACGGTACTTTAACATTTCCAACTCCCATTTGAGTTCATTGTAATGTGTCAGTATATATTCTGCCTTATCCATCTGTTCCTCCTACTTGTGCTTTGACGGCTTCAAGCAGCCGTGCTTGTTGGGCATCTTTATTTTCCAGTACTTTGAGAATCTCCTCGTCAATGGTGCATTCGGTTACGATGTGTTGGATAACCACAGTTTCAGCCTGTTGCCCTTGTCGCCAAAGTCGTGCGTTGGTTTGTTGGTAGAGTTCCAGTGACCACGTCAAACCGAACCAAACCAAGTGGTGTCCACCCTTTTGTAAATTTAGTCCATGACCGCTACTAGCTGGATGAAGCAGACCAACTGAGACATTCCCTTTATTCCACTCACGGATATCTTCCTCAGTTTTAAGGACTGTTCCCTTAACTTTAAGTTTCGCCAAACGTTCCTCAATGCGCTTAAGGTCGTGTTTGAACCAATAGGCGACCAGGACTGGCTCACCGTTGGCGGCTTCGATAATATCTTCAAGGGCGTCCAGTTTCTGGTCATGAAGACTAACTACTTGCTGGTCATCTGAGTAGACGGCACCATTTGCCATCTGCACCAGCTTGTTTGAAAGACTTGCCGCATTGGCAGCTGTCACTTCAGTGTCTTCAAGGTCTGCCATGATGTAGTCTTTCTTGAACTGGCTGTAGCTTGCCTTTTCTTTATCCGTCAGTCGCACCACTTTCTTGGTCGAAATCAACTCAGGCATGTCCAGATAATCCATAGCTTTCATGGAAATGGTAATGTCATCAATCTTGTCATAAATCTGACACTCCGCATAGTCCATGGGGAGGTATTCATAGACGACATTGCCATTCCTACGACCCTCACGGAAATAACGACCACGGTATTCGCCAATGAACCGACCCAACCGTTCACCACCGTCAATGACCTTGAACTCCGCAAACAAGTCCATGAGTCCGTTTGAGGAAGGAGTTCCTGTCAGCCCAACTACTCGTTTCATGTAAGGACGCATAGCCATGAAGGCCTTGAAACGTTTAGACTGCCATGATTTGAATGAACTTAATTCATCGATGACAACCATGTCCCACTTGAAGTAGGGGCTGCACTGTTCCACCAGCCAAGGGAGATTTTCACGATTGACAATGTAGATGTCTGCGTCTTTCTCAAGGGCTGCTTGTCTTTGTTTGGGAGTGCCCACAATCTTGGAATAGCGGAGATGGTTCAGCTCCTCCCACTGGTCAATTTCATCACTCCAGACTGTGGTGGCAACACGAAGGGGTGCAATCACCAAAACTTTATAAACTTCGTAGCGGTCAAACATCAGCTCGTTAATAGCTGATAGGGTGGTGGCTGTTTTCCCCATCCCCATGTCTAGGATGACCGCTGCATAGGGAGTTCTTATGATGAAGTCCTTGGTGACTTCTTGATAGTCATGTAGTTTCAATTTCATCTAGCACTTCTCCAATCTTATCAACTCTGTCTAGCACATGGACCTTGAAACCTAATCGCTCAAATAGTCTGTGCCTAGAGACTTGTAACAAACGTGGCTTTTCGCCAGGGGCTTTCACTTCCACTAGGCCAAACTTGCCACTAGGTAAAAACACCAACCTATCTGGCACACCTGCAAAAGATGGTGACACCCACTTAGGACAAATCCCACCACGGTTTCTGACTTCACTCACTAACTTTCTCTCAACAACTTTTTCTCGCATGATAAATCCTTTCGTCAGATAAAAGAGTGGAGGTCTAATGAGGTCATTTCCTAAACTTTCCCTATGTGCTTTTTATTAGTATTTTTATTTGCATAGAGATAGTTATAGAAAAGACCATCACTGACTTACACTAAATCGAGAAATAGATGTCGTGGAACTCAATCCATAAACTTTTTAGTATTCATTTCTTGATAGATTTTTTCTTTTTGGTTTCCACGACTGTCACTCCTAAAATCTCCACTTTCTGATGGTGTGGAACTCAAAGCCGTTTGGTGGAGGTCACTTAGTCAAGGAAATCATCACCATCATCAGCTAGTTTTAGACCCATGATGAAGTTTCCTTTATTGGTCCGCTTACGTTCAAATCCAGCTTGAGCTAAGGCTGCATAGAAATCTGTTGTGTTGCGCGTGTACTCCATGTTTTGGACGCAATAAGCACGGTAACGGCTATAAAGCTCCCCAGATTTCTCGCTCAATTTATCTCCCACTTCACAACACTCGCTTAAGAAGTGTCCTAGCCAATCGTTGGCCTCTCGGTAAACTTTGACTGAGTTTGCGACTGCAGCAGGAACTTTTGTTTTGAAGTTTGCCTTGATAGCTTTCTCAGCTCCTTCAATAATCCAAGACATAATGGCTGGTGCGGCATGGTCATACAAATAGTCCGCAAAGTTTTTGATATCAGAGCGACCAGTGATTTTGGCGTTAAATGGGATAACAACCAAACGACGCCAAGTCCCATCATCGTTCGCTCCTACTTTAGGCAGATGATTTGTGTAAAGAACCAGCGTATGAGAAGGTACAAAGTGAAAAGGATCCTTGTACTTCTTCTCTGCTTGGATTTCGTCCGTAGAGGTAATCTGCTTAACAACAGCAGTATTGAGTCGCATTCCCTCAGCCATCTCTGAAGCAATCACAAGACGCTTTCCTTTAAGCTCCGCAAGTTCAGGGCTCACGTTTCTCTTGTTAGACATGGTTAAGGCATCAGCCGATAATTTCCCAGAATAGCTTCCTAGCACACGAGCAATCGTGTTCCAAAAGGTCGACTTACCGTTCGCACCGCCACCATAGGCAATAATCATATGTTCCTGATAAACCTTACCAATAGCTGCCATCCCAATGATTTCTTGAACATAGTCAATCAACTCTTGGTCGTTACAGAAAAAGGTAGCCAAGGTTTCCTGCCACAATCCCATGCCTTGGTCGCCTGGAGATACTGTGGTCATTTTCGTAATGTAGTCTTTGGGATCATGCTCATGTGAACCCGATAAACCAATACGCAAGTCGTAAGTAGCCTCAGGAGTATTAAGCACCATATCATCCTTATCAAGCTCGGATAAATCAATGGCAAGCATAGGCTTAGCGGTATTATGGGTTGCTGTGATGTAACGATAATCACGGCGCTTCATCACAAATTGATAGTAGGTTCTGGCAGCTAGATAAGTCGTATAGAGCTTTTGTTGGGTTGGTGTTTCAATCACTTTAGCTAGTGCTTTTCCTCCCTCTCGAACTAGGTTTTCTGAAACACCAGTGTTAACTAAGTCTTTGATAACTTTCTCGTATTTGTCGCTAGCATCTTCCAGCTGCAAATCCATAAATTCAAGGACTGCACCAATAGCTAATTGCTTATCTTCTTTCCAGTACTGACCTGTAAAGGTAAGGTAATCCGTCGCATTCGTATAAGCTAACTTTTCTCCGTATTCACGAGCAAGAACTCCTGCTTCTCCAATATCGGAATAATCATCAGGTTTTAACTCCCCACGATTAAAGGCTTCTGGCGACACATACCCTTCTGAGCCTTTAATGGTCCGATTGTAGAAACGCACCGCACTTCCCCAGATAGTATCTAGTTCAGTCTTATCAAGTGGTGGATCACATTTCAAGGCTTGTTCATCAAATCCGTCACGAGCTTCCTGTGTTATCCCAAGGCGTTTGAGGATTTTAGCCGCAAACTGCGACATGGTTGAGTTACGGCTTCCCTCAGTAATTGGTCCAGTTGGTGGAGTATAGAAATCCGCATCGAAATCTTCTTCATCTGAATCAAACGAAGAGTCCAATAAATCAGCATCTATAGTCAGCCATGAATCATTCCAAAAAACTTGTGCATTTGGATTACCGAAGAAGAAACGTGCTGCATCCTTAGCGTTCGTATCAAAGAAGTTATACTGATTCGTCAACTCTTCTTTTAGAAAGGCATAGGTATCTTTATCAGTAACCTCATTGATTTGGAAGTAAATATGGAATTTAGGTCTTGCTACCTTACTGCCTTTTTGAACCATGTGGTTTCGACTTGTAACCAAAGCAAAATGGTAATCCGAAAACAGTATTTTTAGGTATTCCTCTGTAATCCAATCATCATGATTTTCTGTATGGTCATTATCAATATCCATGACTAATACATCTGACTTAAGGAAATTAGCGTTCGATCGAGTGTTATTGGAAAACAATCCTGCCACATGGTCGTACTGAGCAATACGTTTTAGACTAGTTTCATCCGTGATTGTCACTTGGTGCGGGTAGACCGTTGTTGTTTGAACACCAGATTGCCCTGAATGAGATAAGGTAAATTGCATGCATCATGCCTCCATCTTTCGTTGTTGAATTAGGAATTACTCTTCCTAACTTACTAAGTAAGAATCCGACAGGATTTTCCGCACTAACAGAAAATTTTTTCTAAAAAAATAAAAGTTTCCTATTAAATGCACAGGAAACTTTTTTTGATGAGCAAATTTTTTCTTATCAGGCGGAAAAATTTATCTCAACCCTACTTAGTATGGTGTAAGGGATAAAAAATAAAAAAATCTCTTCCAAAGTGGAAAATCCACTCAAAACCTTACTTAGTAAGATAGGAGGACCCAATATGGCAAACGAACCATACATCGAACCTGATGATGATGTGGCTGATACCCTCATAGCCATCAGCGTTATCTCAAAACTACTCGCTCGGAAAATTACGGAGGAAAGACAACATGAGCAAAATGAAACAACTGAATGAACTGATTAATGAAATGGAAGGCACAGCCAAATACTATCTTCGCTTAGTAGATGAGTTCAAGAAGATCCTCTCTACTGAGGAAGAAACTACAACAACTTCAAAAGAACCAAAACATAAACCACAAAAAGAACTCAAACTCGAAGACGTTCGTTCCGTCCTTGCGACAAAAGCCAAAGATGGCTACAAGAACGAAGTTCGTGCTCTTCTCAATAAATATGGTGCAGAATCCCTATCAGCCTTAGCAACTGAGCACTACGCAGCGGTTCTTGAAGAAGCTGGAGGAATTGGCCATGACTAACCATGCTGTCCTATCTGCTTCCGCATCCCATCGCTGGCTCAACTGTCCGCCTTCTGTTCGCTTAACCGAGGACATGCCAGATGTTACTTCTGAATTTGCCCTTGAGGGAACTGACGCTCACGAGCTCTGTGCTTACCTTGTTGAGAAGGCACTAGGCAGAAAGGCGCGTGATCCAACTGAGGATCTGTCCTTCTACAATGAAGAGATGCAAAATTGTGCCGAGGAATATCGCAACTACGTCATGGAACAGGTCGAGAAAGCTAAAGACTACTCTCATGACCCAACAGTACTTATCGAGCAACGTTTGGACTTCTCCAAATGGGTACCTGAAGGGTTCGGTACTGGCGACTGTCTGATTGTGGCAGATGGACTTCTTCAAGTGATCGACTACAAGCACGGTTTGGGCATTCTGGTCGATGCCGACCACAACCCACAAATGATGTGCTATGCCCTAGGTGCTCTTGAGATGTTCGATGGAATCTATGATTTTGATAATGTCACCATGACTATCTTTCAACCACGGAAGAACAATATCTCTACCTTTGAAATGGATAAGGCTGAACTGCTTGAATGGGCGGAAGACCAGCTCTCACCTAAAGCTGAACTTGCCTTTAAAGGCGAGGGAGAACTGAAATCTGGTAAACACTGCCAGTTCTGCAAGATTAAGAATGTCTGTCGCAAACGTGCGGAGGAGAATTTAGCACTTGCCAAGATGGAGTTTGCGGATCCTGCTACTCTAGACTATGAGGATATTGCAGAGATTTTGACTAAACTGGACTTACTGGTTTCATGGGCAAACGATGTCAAAGCCTATGCTTTGAAAGAAGCTACTGAGGGACACTCCATTCCAGGCTACAAATTAGTAGAGGGACGCTCAGTTCGTAAGTTTTCAGACGAAGCTGCCGTCAGTCAAGCTGTGATGGATGCTGGCTTTGATCCTTACGAAAAGAAACTCCTAACTATCACTGCCATGACTAAACTCCTTGGCAAGAAAACCTTTAACGACCTGCTTGGTGGTCTGATTGTAAAACAAAGCGGTAAACCAACACTCGTTCCTCTTGACGACAGTCGTCAAGAATTGAACCTAGCTACTAATGAATTTAAAGAGGATTAAACGTATGACAACTAAAGTACAAACTACAAAAGTAATCACTGGTAAAAACACACGCTTCAGCTACTTGAATGCCAATGAACCTAAGTCCATCAACGGAAGCACGCCAAAGTACAGCGTCTCTCTTATCATTCCAAAGGATGATATTGAAACTGTCGATAAAATCAAAGCAGCCATTGAGCTTGCCTACAAGGAAGGCGAGTCCAAACTCAAAGGTAATGGAAAATCTGTCCCAGCACTTTCTATCCTTAAAACTCCACTTCGTGATGGAGACTTAGAGCGTCCTGATGATGAAGCTTATCGCAATGCCTACTTCGTCAATGCCAACTCACCACATAAGCCTGGGGTTGTGGACGCTAATCGACAAGAGATTATAGATACTTCTGAACTCTACTCAGGTATCTATGGCCGTGCGTCTATTTCCTTCTATGCCTTCAACTCTAACGGTAACAAGGGTATCGCCTGTGGTTTGAATAACTTGCAAAAACTCCGTGATGGAGAGCCACTTGGTGGACGTACTCGTGCTGAAGACGACTTTGCGACTGATGACGATGATGATTTCTTGAACTAAGAACGGAGGACTAAATATGTTTGAAACAATTTTCTTTTACTCACTTATTGGCATTTACCTATTCTTTGGTCTGTACCTCAACTACATGACCATCCGTGATGATATTCGTCGTGAAAAAGAACGTAAGGCTGAAAAGAAACATCATAGCAACAACACAACACCGCTACATCGTAGCCGATAACACCTCCGGTGGCAGCCACTCCTGCCACCTTTTTATGAAAGGACAAGCTATGCTAATAAAAGAACTATCCATCGACTTAGAGACCTACTGTGAGGTAGATTTGAGAAAGTCTGGTGTTTATAGCTACGCAGAAGATGATTCTTTTGAAATCCTTCTCTTAGCAGTTTCTGTTGACAATGGTCCAGTAACAGTTTATGACCTAACTAAAGAAAATCTTCCTGATCAAATCCTACAAGCATTGGTGAATGACTCCATTATTAAGTGGGCTTTTAATGCCTCATTTGAACGCATCTGTCTGTCTAACTGGCTAAAGAAACATCATCCAAAATTATTGTCCGAGGGCTTTCTGTCTCCAAACTCATGGCGTTGTAGCATGGTTTGGTCAGCATATCTTGGACTTCCACTCTCCCTTGAAGGAGTCGGAACAGTTCTAAAACTCAAAGAACAGAAGTTAAAAGAAGGCGGGGATGTGATTCGTTACTTCTGCCTGCCCTGCAAACCTACCAAAATTAATGGTGGACGAAAACGAAACTTCCCTCATCACGCACCTGATAAGTGGGCAGCCTTTATCAACTACAACAAGCGTGACGTTGAGGTTGAGTTAGCCATCAAAAATAAACTCCGTAACCACCCTGTTCCTGACTTTCTTTGGGAAGAGTATCATCAAGACCAAAATATCAATGATCGTGGGATTGGTATTGATGTAGACTTTGTCAAAGCAGCTATTACCATTGACGAGGAAAGCAAATCTAAAATTCAAGAGGAACTTAAAGAACTTACTGGGCTTGAAAATCCCAACTCTGTTCTTCAAATGATTGGCTGGCTACGAGAACACGGAGTAACGACTAATTCTCTTGATAAGAAAGCTGTCAAAGAGCTATTAAAGGTAGTCGATGCAAAGACAACTAAAGTCCTAAAGTTAAGACAACAGGCGGCTAAATCTAGCGTTTCTAAATACCAAGCTATGGTGAACTGTGTTTGTTTGGATGGTCGAGCTAGAGGGATGTTCCAATTCTATGGAGCAAATCGTACGGGTCGTTGGGCTGGGCGATTGGTACAACTTCAGAACCTCCCACAGAACCATCTTCCTGACCTTAAAGAGGCTAGAGACCTCTTCAAAACTGGTGACTTAGAGGCAACTGATCTCCTCTATGGTACGCAAGATACCCTATCTCAACTCATCCGTACTGCTTTTGTACCTAGTGATGGGAAAAAGTTTATCGTCTGTGACTTTTCTGCCATAGAAGCGCGAGTATTATCTCACCTAGCTGGCGAAAAATGGCGAAGCATGGTCTTTGAACAAGGCAAGGACATCTACTGTATGTCAGCTAGCCAGATGTTTGGAGTGCCTGTTGAGAAACATGGACGTAACGCAGACTTGCGTCAGAAAGGGAAAATTGCAGAGTTGGCCTGTGGCTATGGCGGAGCAGTCGGGGCACTTAAAGCCATGGGGGCTATTGACATGGGGCTTGATGAACAGGAGCTGCAGCCTCTTGTGGACTCGTGGAGACAAGCCAACCCAAACATCGTACTCTTTTGGTGGGATGTTGATAAAGCTGTAAAGACTGCAATAAAGTACCAAAAGCAAACTGAAACCCATGGTATTCAATTCAAAGTAAGAAAAGGGATGTTATTTATTACTCTTCCTTCTGGACGCAAACTCGCCTATGTCAAACCTAAAATGGGAGAAAACCAATTTGGTGGAGAGTCCGTCACCTACGAAGGTACAGGAACTGCTAAACGTTGGGAGAGACTTGAAAGCTATGGTCCAAAATTTGTCGAGAATATTATTCAAGCTATAAGTCGAGACATACTCGCTTACTCTATGAAACAACTGAAAGACTTCAGAATTGTAGGACATGTGCATGATGAAATCATCATTGAGTGTGACCAGAGCCAAAATCTTGAGCAAATCGCAACTTTGATGGGAAAAGCACCATACTGGATGCCTGATATTAACCTCAGAGCTGATGGATACGAGTGTCTCTTCTATCAAAAAGACTGACAAAAAATCGCCACCTCAGTTGAGATGGCGATTTGGTTTTATTTGTTGAGTTCTTTGTAAAGTTTCAGCCCTTCTTTCTGGGCATTATTGACTTTCTTATAGCCTGCAGATTTCTTAACACCTAGGTACTCGAAGATTTCTTTATTCTCATAACCTTCAAAAAGCATATCTAAAATTTCTGGTGCTTGGAAATTGGTTGCTGCGAGTTTAGCTTTCAAGAAGTCCAGTTGATCCATTACTAGATATAGTTCAATACCTTCATCAGCAATTGCTAAATCCTGCTTTTCAGTGAAGACTTCCCAGGAGGAGACCTCTAAAGTATTCTTTTTAGGTTTACGAAAATCCTTGAGATAATCATTGACTGAGTTGTTATACCACCAAACCATTTGTTCATACTCCTCTTCTGCCACAGGGACAAAGGCTGTTAGAATTGGAATACTCATAATACGGCATTGTCGAAATGTACCACGAATCAGTCCTGAATATTCTGAGGACATATAATAGTCCTGAACGAACATTGGCGCTAGTACTTCACCCTTTGATGGCTCCACACCAGTTGATGAAGATTGAGTTTGACAGTAGTTGAAAAAGTTGACATTGATTGGCATGCTTTTGACTCCGTTTCTTTGGCAAGCAAAGAAAAAGAGTATGACAAACCAATATTCAGTTGTGTATTTGACCACATCAGCAATTCCTTTGCTTAATCATGGTCAACTGGCTTTATAAGTTGAGCTGCTTTCCCTAAAAACACAGTTAAAGCCAATAATGTAGGAATATTCCCCATTACAAAGTATTTTTAAAGAGACGATAACTTGTAGGATTAATCTTTACTCTCTCAGTATATAGTTTTACTTCTATCAAAAACAGGTAGTCCAAACTTCCGCCTTAAAACGCCCTTGGCAACAGATTTTTGGAATTTTTATGTAGTAAAATCATACTAATTCAATCTCAAAACCGGAAGCTGAGTTTCCGCTTTTTGAAACCAAAAAAATAAACCATCACAAATTTGTGATGGTTTATAAGGTTTTCATTTCTGCAGCTTGTAATATTGAATTTATTTCACCCAAGGGTTTATAGTAACTTGTTGATAGCAACATTTTATATACTTGGTGTGGTGGAGAAATTGTAAGTTGATGACCTGCTTTCTTTATCATATCGTCACTAAGTTCAGGTCTTAATTTTAAAGCAAAACAAAATGACAATGTTAACTCAAGCCTTGGTAAATTATCTTCTAGTGTTTCATAATCTCTCAACGTTCGTTCTGTAATTCCGACTAGCTTAGCCAAAAATGGTTGTGTGCAATTTTTTCGCTTTCTATGACTACGTAATGTCCCAGAAAATTCAAAAGGTAGTTCTTTCAATAATTCAGAGATTTTTTTCCCGAGTTTCATCATATCCAGTGGAGGGAGCTGATCCATCAAACTTGGATTCTGGAGAATATCTACAAAGTCAGCCTTAATCTCACTTTCCTTTGTTACGCCTCTATTCAGTACATAATCATAGTATGTCTCATTAGAAATCGAGGTGAAATTTTTAGACTTTACTTTAAAGATTAGACAACATTCATCCATGTGCTCGTAAGCATAGTCAGTCATAATTGGCCCATCTTTTGTCATATAAATAAATTTCTTATCCTTTAAACAAAGATGGTTATCAACGTAAATAAACTTATTTCTATCAATAATCTGTCTAAAACTCTCGTTAAAACAATACTCAAAACACAAGTCATTTGAAGTAATAGTATAACTACTTCCTTTATCAAATGCTTCAAGTTCAAAAGCAAAGTTATGCATATATCTATCATCAAGATAATTATATACACCATTTGCTTCCTTAAATCCTAAATCAATCATTCTAATTTTAGCAGCCTGCCTAGATACCTCAAAGAAAGTAGCTAAGTTGTCTACCACTTCTTGCACTAATTCAGAACGACTTATATCAGGATTAACCAATGTCAAAGTTTGAAATAGCTCCCTAATCTTTATTCTAGTTTGGACTTTGGGCATAAGAATTCGGGGAGCAATTCCATTAGCTTGCCACTCCATCCAGTCAAGCGACGTCCACATACTAGAATCAGCTAAATTTTCTTCTGTCCAGCTACTAACTTGTGAGTGGTCCTTATCAAGTACCATCTTGACTTCGTGAAATACTTTATGGAGTTCCCAGTGAACACATTCATGGATGACCGTATTGTTATACGAACCTACATTCCGTTTGAAAACAACATCCTTATCTACAAGGATACTCCCTTTGTTAAAATGCTTAGAAACTAACTGCTCATCCTCTATGACCTCAACATCAGTATCTTTAAAAACCATTTTCCCGAAAACTGAATTATCTATAGTTAGTTTCTCTTGATGGATAGACAGTCCCATTTCAGAAACTATTGTCTCAACTGGAACAGGGGTTGGTTGAGTCAGGGCTTGAGGATAGTACTTCCTTAAGAATTTTTCAGCAATATCATCAAAGTCCTTCTTCCTTATATATGGAACCCAGTCTTTACTCAATTTTAAGTTTCGGGACTTTTTGTACTGATCTGATTTAAATTCTGTATTGTAAATTCGAAAGTTTTTGATATCAGCATCAAGTTCGACTTCAGCATATACAGACACAAACTTAGTTTTAGTATCAACCTCCATCTCACCTTTGATATACTGCCGAACAATTACATTAGCAATCACAATGATTTCAAGATTTAATTCGCCATTATTAATAATTTCATAGTTTATCTTATATAACTCAAAATTATCAAACTCAATAAAACCATTCGGTTCTGGCACCATGTATGTAGACAAATCAGTATTATCTTTATTATTAAAAATAAATCCTTTAACAGTTTTGACTATTTGCTCATAGTAGGTATCAAAAATATATTTATCGAACAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP016501|1140008:1182358|1159956_1161246_-|WP_000523688.1|portal|DBSCAN-SWA MGLLDLLGRKRARDKPQNSYEGQDFYYLFGRTTSGENVDEFKAMQTTAVYACVRILAEAVASLPIHVYEMTSSGKKKKIDHPLYFLLHDEPNPEMSSFIFRETIMSHLLIWGNAYVQIIRDKAGRVISLYPLLPDKMSVHRDDSGKLYYKYQRQTEENPNFKDKGTVLLKQEDILHVPGLGFDGLIGYSPIAMAKNVIGMTLATENYGAAFFKNGANPGGVLEHPGILKDPKRVRDSWNAVYNGVTNAHKVAVLEEGMKYTQVGIPPEEAQFLQTRKFQINEIARLYRIPPHMVGDLEKSSFSNIEQQSLEFVKYTLDPWVVRLEQAFKRSLFLPEEKKRYLIKFNVDGLLRGDYQSRMNGYAIARQNGWLSTNDIRELEDLNLLSDEEGGNLYLINGNMTKLKDAGGFMKQPTETEPAEDPPEEEEDA >NZ_CP016501|1140008:1182358|1141737_1143042_-|WP_079254335.1|DBSCAN-SWA MAAYCRVSTDQDEQLSSYENQVSYYRDYILKHEDYELVDIYADEGISATNTKKREAFNRLIQDCRAGKVDRILVKSISRFARNTLDCIKYVRELKELGIGVTFEKENIDSLDSKGEVLLTILSSLAQDESRSISENATWGIRKKFERGEVRVNTTKFMGYDKDENGKLVINPEQAEVVKHIYHQFLKGYSPESIAKELNTNGIKGWSGKANWYPSSILKMLQNEKYKGDALLQKTYTVDFLTKKRTENDGQVNQYYVTNNHEAIIDTEMWDTVQEELARRKDFREKHKLKTYIMQSDDNPFTTKVFCAECGSAFGRKNWTTSRGKRKVWQCNNRYKVKGQIGCQNNHIDEEMLEKAFMKAVELLQEHRADVVVKWQKLEKGTNLLHKHYAKQMYQLLDLEKFDGTIMNKVLDHISISEVGQIIVIFLEGTEVEL >NZ_CP016501|1140008:1182358|1156755_1157082_-|WP_001209971.1|DBSCAN-SWA MRFEELFPVLKKTKLPVAYHHFEEGHSPSPPFMVYLVTDSDNLGADNWAYHKAINVQIELYTTKKDLATEKTVESVLDAHRLYFDKVETYITNEKLYQTIYSITLLGG >NZ_CP016501|1140008:1182358|1168757_1169795_-|WP_000640707.1|DBSCAN-SWA MIYTSEQVSSGHPDKLCDQISDAIVTECLKYDKNSRVAVETLIKDNQVVVAGEVSTRHFFNLEGIVKKVLELYGMEDIMVTNLLGVQSPDIAQGVDSGGAGDQGIMFGYATDETPEFLPLPYVLATRVLEKLTNLGHPALGKDAKAQVTYDYENKRIETFLVSIQHDEEVDLASVKRIVTQAMMSVAQRYRQNLDFKVLVNPTGRFVLGGSYADAGVTGRKIVADTYGGFAHHGGGAFSGKDPSKVDRSAAYMARKIAKDFVREGYAKRCEVQLAYAIGVAEPVGVYVSTFGTSDYPAEQLEGVVRERYDLTPQGLIKELNLLDVDYTKTTCLGHFTKSYLPWEQ >NZ_CP016501|1140008:1182358|1140894_1141668_-|WP_000504421.1|DBSCAN-SWA MGENLNSNIDPRIIFKQLPPATIIKGFVAGETKCNYEIYLLELLNKSVYFREKGKSKFHAPEDESHGECDAVADDYKIDFKLLSSSSRLQASSLFSPSITNLGNGITAIGESRNPNGEIKATQIHVAFRFRTVSDLIRLKVKRQHVRKQCLEKDIIKVLNMLEKQKNLLLFFPYIFYTEEEISTNELDDIILNAISYDFSSLFAYRHIKVQSFETYLLTIVRDEFNIFKISNKRLKLIEKLSCSELPTFVKLKSYSI >NZ_CP016501|1140008:1182358|1168390_1168756_-|WP_001138031.1|DBSCAN-SWA MPRRPSTPCKQSGCPNLVSYGNKYCEIHKANHSLDAKSTKAKGYNARWNKERLRYLKLNPLCVYCQREGRLTKATVVDHITPHRGDQDLFWNQSNWQALCKTCHDRKTKTTDRYVEYTYRF >NZ_CP016501|1140008:1182358|1170677_1172054_-|WP_000768926.1|DBSCAN-SWA MKLKLHDYQEVTKDFIIRTPYAAVILDMGMGKTATTLSAINELMFDRYEVYKVLVIAPLRVATTVWSDEIDQWEELNHLRYSKIVGTPKQRQAALEKDADIYIVNRENLPWLVEQCSPYFKWDMVVIDELSSFKSWQSKRFKAFMAMRPYMKRVVGLTGTPSSNGLMDLFAEFKVIDGGERLGRFIGEYRGRYFREGRRNGNVVYEYLPMDYAECQIYDKIDDITISMKAMDYLDMPELISTKKVVRLTDKEKASYSQFKKDYIMADLEDTEVTAANAASLSNKLVQMANGAVYSDDQQVVSLHDQKLDALEDIIEAANGEPVLVAYWFKHDLKRIEERLAKLKVKGTVLKTEEDIREWNKGNVSVGLLHPASSGHGLNLQKGGHHLVWFGLTWSLELYQQTNARLWRQGQQAETVVIQHIVTECTIDEEILKVLENKDAQQARLLEAVKAQVGGTDG >NZ_CP016501|1140008:1182358|1157451_1157790_-|WP_000684962.1|head,tail|DBSCAN-SWA MKIAPLREQLVFQEKRLKQDDIGNESAIWDDLFMRWCSCRPLALTESDGSATKLIHNKVQFTLRYDKAVLALNSLTTRIYFRDQYYAIESIDGDTVARSLIYIVATKEELYD >NZ_CP016501|1140008:1182358|1144765_1144873_-|WP_000123549.1|holin|DBSCAN-SWA MTAFEVVQIIIGFGSFTVALIGLCYKIFKDDDKKK >NZ_CP016501|1140008:1182358|1177517_1179473_+|WP_000905646.1|DBSCAN-SWA MLIKELSIDLETYCEVDLRKSGVYSYAEDDSFEILLLAVSVDNGPVTVYDLTKENLPDQILQALVNDSIIKWAFNASFERICLSNWLKKHHPKLLSEGFLSPNSWRCSMVWSAYLGLPLSLEGVGTVLKLKEQKLKEGGDVIRYFCLPCKPTKINGGRKRNFPHHAPDKWAAFINYNKRDVEVELAIKNKLRNHPVPDFLWEEYHQDQNINDRGIGIDVDFVKAAITIDEESKSKIQEELKELTGLENPNSVLQMIGWLREHGVTTNSLDKKAVKELLKVVDAKTTKVLKLRQQAAKSSVSKYQAMVNCVCLDGRARGMFQFYGANRTGRWAGRLVQLQNLPQNHLPDLKEARDLFKTGDLEATDLLYGTQDTLSQLIRTAFVPSDGKKFIVCDFSAIEARVLSHLAGEKWRSMVFEQGKDIYCMSASQMFGVPVEKHGRNADLRQKGKIAELACGYGGAVGALKAMGAIDMGLDEQELQPLVDSWRQANPNIVLFWWDVDKAVKTAIKYQKQTETHGIQFKVRKGMLFITLPSGRKLAYVKPKMGENQFGGESVTYEGTGTAKRWERLESYGPKFVENIIQAISRDILAYSMKQLKDFRIVGHVHDEIIIECDQSQNLEQIATLMGKAPYWMPDINLRADGYECLFYQKD >NZ_CP016501|1140008:1182358|1164837_1165092_-|WP_000164539.1|DBSCAN-SWA MTKNEFNEIIDSCFIHLTVMKQHYTKPRNYSLDVIEQGNLDQINDLLNDITNGIELGGFNELEARYIYEDTEVLWAEVSQTFVR >NZ_CP016501|1140008:1182358|1155752_1156172_-|WP_001249616.1|DBSCAN-SWA MRQNITISGKTYPLATNAYTPIAYKEQFGKDYFHDLFNMLSAESIMAQLEQLEEGEELKASQIDLSILSDFDMTFFHRLFWVFAKSANPRIKPFEDFFMSMEEFPLQEVGPVLMSMLNQGMTTRKKQKLPKQRARKSSR >NZ_CP016501|1140008:1182358|1145008_1146415_-|WP_000143479.1|DBSCAN-SWA MTFLSKIKDGCLASWEHGILPSVSAAQAILESGWGESLLAQYPNHNLFGIKASSDWKGKRVDLPTQEYIDGKFVTVEATFRKYDSWEESIKDHALFFSETAWRRSHYQNVLGEEDYKKTCLALQASGYATDPNYGSKLITLIEAHHLNTWDDRILNKKGETTMSKHLVICGHGQGQTGYDPGATNPSLGITEAGKVREFANLMKKYSGNRIDYITDHNVYDYRSIGSLGNGYESITELHFNAFNGQARGSEILIYSGYTADSLDQKLLAILAKRFTNRGFKQVNWLYNANVSASRGYNYRLVEIAFIDNNSDVGIYEANKDSMAREFVQAITGQAQVISPSPNTPQSRVTSYHVGDPVTVQQHATHYQTGQAISSWVKGKTFKVIRVKDVNQSNSKKAYLLEGINSWVLEQDVKGTTNGHSEQTYTVQKGDTLYGIARKFKTSVSELVRLNSIINPSLISVGQKLKLK >NZ_CP016501|1140008:1182358|1175281_1175605_+|WP_000041407.1|DBSCAN-SWA MSKMKQLNELINEMEGTAKYYLRLVDEFKKILSTEEETTTTSKEPKHKPQKELKLEDVRSVLATKAKDGYKNEVRALLNKYGAESLSALATEHYAAVLEEAGGIGHD >NZ_CP016501|1140008:1182358|1157090_1157459_-|WP_000161241.1|DBSCAN-SWA MTKIGLDDLASVIEKDLTTYAKDTTDTMREVVEEVTDDAVDTLKVTSPKRRGKYAKGWTSKATTDTNTALTKAIHNRTPGLTHLLEDGHAKQNGGRVEGRKHISPVEKKAIQSFEDKLRQKL >NZ_CP016501|1140008:1182358|1151616_1152342_-|WP_000589863.1|DBSCAN-SWA MIKHNELVLNRKGTSSFPFKVLVEDRPSIQVPRSKTQLLDHRGLSGAIVQTNKHREVIEKSYRLYLIGASEKEVNEFSAFLMQEGFWLESERLKLTRLWCYRTDSFDIKQDDHDVFVIDVTFICHPTRFFKNVDRQVLTTNGVLKTQGSALAFPTITITGQSVSETSFTVGDQVIRIEKFTEPLVMVNHPERPSFKTLSGKAVKWSGDFITIDASHPTQSVGVILGSGILSLTFETNWGWV >NZ_CP016501|1140008:1182358|1156183_1156753_-|WP_000818565.1|DBSCAN-SWA MAEKNKVTFGLQDVHWAEVTNEGADGALTYGKVERLRGAAELTLEPTGDKGSYKADNINFYTTESNDGYEGTLKVALLYQEFLTRVLGEQLDATTNTISEIANSEKKNFALMFRFEGDKKETLHVLYYCYASRPTVGSKTKSGSDINEVELTFTASPRPLDKVVRRRTTEETSDEIRENWFKSVFEPAA >NZ_CP016501|1140008:1182358|1162278_1163871_-|WP_000220985.1|terminase|DBSCAN-SWA MTYHYEPSPFMLPTSRYDKAKADRAVTFINNLSHTKGKWAGKKFDLLPWQEQIVRDLFGIVKEDGNRQFLTAYIEIPKKNGKSELAAAIALYLLYADNEASAEVYGAACDRNQASIVFDVAKQMVLMSRPLEKRSKIMGATKRIVNYSNAGFYQVLSAETGTKHGLNVSGLVFDEIHAQPNRHLYDVLTKGSGDAREQPLFFIITTAGTDKNSICYELHTKALDILKGRKKDTSFYPVVYGLSDADDWNDEANWLKANPSLGHTIGIDRVREAYQQALDNPAEENVFKQLRLNMWTSSSVAWIPEHVYAKGNSPIDYEALKGRDCYAGLDLSSTSDITAFVLVFPPRHSEENYIILPFFWLPEDTLELRCRRDHVLYDVWERQGYIKTTEGNVVHYGFIEAFIEQLSETYHIKEIAYDRWNATQMVQNLEGMGLTMVPFGQGYKDMSPPSKEFYKLMMEGKIQHGGHPVLKWMGQNVVMRQDPAGNIKPDKEKSVEKIDGIVALIMGLDRCIRHQGNGTSVYDERGLLSF >NZ_CP016501|1140008:1182358|1167647_1168103_-|WP_000999418.1|DBSCAN-SWA MNDNQRRGIWKLRRDGFGYGAIAQMLNMSLGSVKQYCRRHPKLKGMGQFVKYQLDEGERSYCKNCMKKLHHAVQGRPKKFCSNRCRAIWWRNHQSQHDKTKTAYDELTCQNCGRSFLSYANPTRKFCGHPCYVEHRFRKGVVHDNSTQYGN >NZ_CP016501|1140008:1182358|1172034_1172316_-|WP_001208492.1|DBSCAN-SWA MREKVVERKLVSEVRNRGGICPKWVSPSFAGVPDRLVFLPSGKFGLVEVKAPGEKPRLLQVSRHRLFERLGFKVHVLDRVDKIGEVLDEIETT >NZ_CP016501|1140008:1182358|1140008_1140881_-|WP_000247956.1|DBSCAN-SWA MVNPEFYEKIAKIFCGDDLELFKYKSGPDLVHFFNNNFKISDSYGQGFPTRWRYVNQKLLDFSSVGKIDEYFNIILSKQYLLTERQISEIDALVHQQRILDELNKVCSVYSLKLSQKDGAFHLVEIDLDLIEIGSGGFANIYFQKSTGLVLKKLNEESARSASIRSRFKREYEITKSCSDIGSIIKVYDFDIGNCSYTMEKADNILDDFVKESFLTEDSQINIIRQILYSMSLVHQRGVLHRDLSPTNIFFINGIIKLADFGLGKNLNTLTSHQTMDTASFGQLFYCAPE >NZ_CP016501|1140008:1182358|1158043_1159252_-|WP_000040500.1|capsid|DBSCAN-SWA MSKLLELKEKRNAAWAQAKAFLDTVRSEDGLVSDEDSKRYEEMEAKIELYNKEIARLERQEKIDLELAQPTSQALTTQPTVIVDNQKEDEKKGVASDIYTQTFWTSVRKRNFYDVKDVLRVGEDTEGGHLVPDEYEKKLVQELQEENFFRSLATVIKTSSGERKIPVVTGHGSASWMDENGLYPETDETFGQVTLDSHKIGTAIRISEELLNDSVFDLESYMTSEFARRIGTEEEKSFLVGDGSKKPTGIFTQADVEGPTTATKDVTFDDMIELYHSLPAPYRKNAVWILHDTTVKAIRKLKDNNGNYIWQPSTQAGQPDLILNRPYYTSTFAPLPEAGNKAIAFGDFSYYWIADRQGRTFKRLNELYANNGQIGFLASQRVDGKLVLPEAVKTLTIKGKTA >NZ_CP016501|1140008:1182358|1176723_1177287_+|WP_000208946.1|DBSCAN-SWA MTTKVQTTKVITGKNTRFSYLNANEPKSINGSTPKYSVSLIIPKDDIETVDKIKAAIELAYKEGESKLKGNGKSVPALSILKTPLRDGDLERPDDEAYRNAYFVNANSPHKPGVVDANRQEIIDTSELYSGIYGRASISFYAFNSNGNKGIACGLNNLQKLRDGEPLGGRTRAEDDFATDDDDDFLN >NZ_CP016501|1140008:1182358|1165169_1166423_-|WP_000176350.1|DBSCAN-SWA MTLTFLDFFAGVGGFRRGLELAGMTCLGYCEKDKFARKSYEAMYDTEGEWFHDDITSIDPTRLPKADLWTAGSPCQNLSIAGKRAGLYGERSGLFFTFVDLLQSQKEEDKPEWVLLENVKGLLSSGGGRDYLDYLSILDQAGYDLEWQVFNSKDYGVPQNRERIYTLGHLRSRGRRQVLPLSGESGSHLKQLIGGMQSYRVYDPSGIATTLVGEGGGLGAKTGLYLIDQSLTEPKLTDEARCITARYTAGATKRTAMNSGVLEVQPILTPDRVIKRQNGRRLKEQDEPMFTLTSQDRHGVLEGIKVRNGTKQGYQVAELGDSVDLSYPGSQTRRARVGKGIAHNLSCGGRMGAVVWNGRVVKIRRLTPRECFRLQGFSDDLFDKAQAVNSDAQLYKQAGNGVTVTVVYTIGKAILSA >NZ_CP016501|1140008:1182358|1146832_1148689_-|WP_000796404.1|DBSCAN-SWA MADYGSNNDRGYTLLLRVEETGTSTADNTSTVRVQLWLKNGYTTFGMYDCRASVSINGQTLSWSGRPDMYTAHSSLHLIDKTITVPHDSNGSKTISFSATFSGSGGWSPGTLNTGSQTLRLSDIPRSSSATVSGNLMGQDVTITIKRASSDFTHNMTWHFGSLSGTIGTGIATSATWTPSISQLATQIPNSTSGNGHLTLATIYGGKTIGSMTIPITLNLPTSVVPTLGSISVSESHATAKTILTGTSFAQLVSNPKVTFNQGAGIYGSTIPSTGFRAEVFKFENNQWVQLQNVTSSNNGLLGGINWIGRAKVSAYVTDSRGRQSARKEVEITLLEYFKPIFSFSAVRAGSSMNQVTVTRKLKIAPLTINSVQKNKAALTWEVVDLASGQKVTNAGGAANWTSTTEHTKTDFQAILGGTYDTTKSYTIIGRLEDLFYATIFEFTIGPEKVVYGLSPSGMGIGKAWTRGVLDVDGSLPAYFDGDIYMKNKKLLDIFYPVGVIYESTSSSNPSTFMGGTWERFGNGRVLVGVSENESEFNSVNQSGGSKTHTLTIDEMPSHSHAQYVTANNGSGAVRRDYSSDGSSTIYPQGNNTGNTGGGKPHNNLQPYVTVYRWRRAA >NZ_CP016501|1140008:1182358|1161615_1161921_-|WP_000666489.1|DBSCAN-SWA MKEYQVTISDDAKADLLSIYHYVRDELCAPQAADNLLEKISQAMLSLSIFPERCSIIEDLVGKGYTFRQLIVKKYRIIYHVLEDEVIIVAVVYGSRHMDNW >NZ_CP016501|1140008:1182358|1161326_1161599_-|WP_000424367.1|DBSCAN-SWA MELVKTIQIGDDIYLPIPDQFGIQEGQEFNLYQSSDGTLVLSPLDSKLSADYQSLSSDEQETTVVEQATLDKVANSVLSRHLDAFKELAE >NZ_CP016501|1140008:1182358|1152338_1155458_-|WP_000918343.1|DBSCAN-SWA MAGNIKGITIEIGGDTQPLQTALKGVNKQASEATKELRQIDKALKFDTGNVTLLTQKQEVLAKQVETTKEKLATLRQAQAQVEAQFKAGNIGADQYRAFQREVESTQTVLKGYESKLESVNRALSENGAQVETNQSKLNRLQNEQAQLVSESEKLQSSFKLQESALATTASEADKLALAQQKVASHSEILEKQIHNLEQQLSLTKSEYGENSVEANKLEKTLNETKTAYNNLQNEMEELASSSASSKASLEETNSLLKADLLMEFGDQLGELSQKLIDFGQQSLDAFLEVDEGMDIIVTKTGATGSALEEMTDIAKTLATELPTDFNTAGSAVGELNTQFGLTGDALKSAATQLIQFSEINGSDVTSSAISAKQAIEAYGLEATDLSSVLDTVTYTSQSTGVGVQELMDKAVAGSPQIKALGLSFDEGITLMGQFEKAGVDSSAALSSLSKASVKYAGDGLTLQEGLAGTIEQIKTSTSETEALSLASEIFGSKAAPRMVDAIKRGALSFEDLAGTADKAAGIVTQTYEGTLDPVDKFTTAQNTAKLAMAEMGDAIAATLAPILEVIASLLQAVATWFSGLSEPVKQFIVIVGSLVAALGLVLPIFIALQAAAMAMGTTIMGMITAAAPIVGIILGVIAVVALLVVGIQQLWQHHEGFRTAVTEIWNAIYAFLTVIIQQISSFVMSIWGTLITWWTENQQLILNATNTVWTAISTVIQTIMTILAPYLQASWENIKLIITTAWDIIKVVVETAINVVLGIIKAVMQIITGDWSGAWETIKQVVSTVWEVIKSLISIVLSAIAQFISNSWNGIKGTMTNLLNSIKGVVSNVWNGIKSTISSILSSIGSTVSSIWNGMKATISGVLSGISSTVSFVWNGVKSTITNAINGAKNAVSSAINAIKNLFNFKIKWPHIPLPHFSVSGSANPLDWLKGGLPKISIQWYAKGGILTKPTAFGMTGNSLMVGGEAGREAVLPLNNQTLGSIGSSIAATMPNKGTTITVNITDVVIREEADMKKLADYVAGRLAAEMARQALLRGGRV >NZ_CP016501|1140008:1182358|1142986_1144228_-|WP_000290968.1|DBSCAN-SWA MAVRLIKTKSNRKKQRVCAYTRVSTTNSSQLDSLENQKAYFETFYVNREDVDFLGVYYDKGISGSKEKRPSFQAMLEACRQGQIDLIHTKSISRFARNTMTVLEVSRELKALGVGIYFEEQNINTLSNEGEGMLSVLASLAEEELQSMSDNQRWAFQRKFQRGEMVINTKRFMGYDVDDKGELVINEAEAQIVRRIFQLYLDGMGMHRIAKLLNQENVPTVTDAKWHDTTVRNILKNEKYKGTVLLQKYFHVGINGPKRLNQGQVEQYLIEDNHEPIVSKEVWQAVQDKLASKTWKQGVNKHYRFTSMLKCEYCGSTLKRQVTYKKQIVWCCSKYIKEGKASCRGMRVPEKAIEEWKLRTPVTVIERNEYGQKHYSYSSQESAADGHPSASNQNKSGSLLSGVHRPRRTAIKL >NZ_CP016501|1140008:1182358|1175597_1176719_+|WP_001873559.1|DBSCAN-SWA MTNHAVLSASASHRWLNCPPSVRLTEDMPDVTSEFALEGTDAHELCAYLVEKALGRKARDPTEDLSFYNEEMQNCAEEYRNYVMEQVEKAKDYSHDPTVLIEQRLDFSKWVPEGFGTGDCLIVADGLLQVIDYKHGLGILVDADHNPQMMCYALGALEMFDGIYDFDNVTMTIFQPRKNNISTFEMDKAELLEWAEDQLSPKAELAFKGEGELKSGKHCQFCKIKNVCRKRAEENLALAKMEFADPATLDYEDIAEILTKLDLLVSWANDVKAYALKEATEGHSIPGYKLVEGRSVRKFSDEAAVSQAVMDAGFDPYEKKLLTITAMTKLLGKKTFNDLLGGLIVKQSGKPTLVPLDDSRQELNLATNEFKED >NZ_CP016501|1140008:1182358|1159265_1159964_-|WP_001225236.1|protease|DBSCAN-SWA MRKFWNFTDEGEVRTLRIEGQIADETWFGDEVTPQLFKNDLTSETGDITLWINSPGGDVFAAAQIYNMLMDYQGDVHVIIDGLAASAASVIAMAGTTVSMSPVAMMMIHNPWTFAQGEAKDMAKVIEMLGEIKESIINAYELRTGLSRTKISHLMDSESWFNAKKAVELGFADKVLFEKEETPEQDYQNSYTFSRVTAAHDLVVKLQASLQPPKPQKTIPFNQLEKRLNLLK >NZ_CP016501|1140008:1182358|1179511_1180081_-|WP_001122250.1|DBSCAN-SWA MPINVNFFNYCQTQSSSTGVEPSKGEVLAPMFVQDYYMSSEYSGLIRGTFRQCRIMSIPILTAFVPVAEEEYEQMVWWYNNSVNDYLKDFRKPKKNTLEVSSWEVFTEKQDLAIADEGIELYLVMDQLDFLKAKLAATNFQAPEILDMLFEGYENKEIFEYLGVKKSAGYKKVNNAQKEGLKLYKELNK >NZ_CP016501|1140008:1182358|1166419_1167676_-|WP_000211620.1|DBSCAN-SWA MTTQPNMEIKELPLSDLKPASYNPRKKLKKGDKEYEKIKQSLLKFGYVDPIIVNDDLTVIGGHQRLTVLKDLKYETAKCVIVSLSKEDEKALNIALNKITGQWDDQPLADLLLDLQESDFNLDLTGFEPPEIDDILSNIHDKDLSEDDFDVDEELKKPTVARRGDIWQLGKHRVICGDSTEAETYEQLLGDKKANLIVTDPPYNVDVEETAGKILNDNMSDRDFYQFLFDMFTQVESHMEADASIYVFHADTEGLNFRKAFKDAGFYLSGSCIWKKNSLVLGRSPYQWQHEPCLFGWKQKGKHQWFSDRKQTTIWEYDRPKSSKDHPTMKPIQLMAYPIQNSSMRGTLVFDPFLGSGSTLMAADQTGRVCYGIELDEKFVDVIVKRYMESTSNRDVSVLRNGETLSYDQALEVMEEHS >NZ_CP016501|1140008:1182358|1157789_1158047_-|WP_000988605.1|head,tail|DBSCAN-SWA MMTLEEVKLYLKVENGEEDYLIEQLMATSRQICEDILRESSTSEVLKTAILYGVAYLYEHREEANHKELKETLYHLLLADRKDVF >NZ_CP016501|1140008:1182358|1172599_1174885_-|WP_001160920.1|DBSCAN-SWA MQFTLSHSGQSGVQTTTVYPHQVTITDETSLKRIAQYDHVAGLFSNNTRSNANFLKSDVLVMDIDNDHTENHDDWITEEYLKILFSDYHFALVTSRNHMVQKGSKVARPKFHIYFQINEVTDKDTYAFLKEELTNQYNFFDTNAKDAARFFFGNPNAQVFWNDSWLTIDADLLDSSFDSDEEDFDADFYTPPTGPITEGSRNSTMSQFAAKILKRLGITQEARDGFDEQALKCDPPLDKTELDTIWGSAVRFYNRTIKGSEGYVSPEAFNRGELKPDDYSDIGEAGVLAREYGEKLAYTNATDYLTFTGQYWKEDKQLAIGAVLEFMDLQLEDASDKYEKVIKDLVNTGVSENLVREGGKALAKVIETPTQQKLYTTYLAARTYYQFVMKRRDYRYITATHNTAKPMLAIDLSELDKDDMVLNTPEATYDLRIGLSGSHEHDPKDYITKMTTVSPGDQGMGLWQETLATFFCNDQELIDYVQEIIGMAAIGKVYQEHMIIAYGGGANGKSTFWNTIARVLGSYSGKLSADALTMSNKRNVSPELAELKGKRLVIASEMAEGMRLNTAVVKQITSTDEIQAEKKYKDPFHFVPSHTLVLYTNHLPKVGANDDGTWRRLVVIPFNAKITGRSDIKNFADYLYDHAAPAIMSWIIEGAEKAIKANFKTKVPAAVANSVKVYREANDWLGHFLSECCEVGDKLSEKSGELYSRYRAYCVQNMEYTRNTTDFYAALAQAGFERKRTNKGNFIMGLKLADDGDDFLD >NZ_CP016501|1140008:1182358|1146416_1146815_-|WP_000661349.1|holin|DBSCAN-SWA MKELLATNKVLFSAIGGLIGSIFGEVDGFLFALMVFISIDYITGLMAAAVEKKLASNIGFKGIFKKIVLLFLVAVGQIIDEHVLKQGGMVRTAIIFYYLSNEGLSIIENAARIGLPVPEKLKDVLKQLKQGD >NZ_CP016501|1140008:1182358|1161920_1162223_-|WP_001140392.1|DBSCAN-SWA MAKTGTLNLRVDDSVKSAADEILKRLGIPMSTAIDMFLNQIILTGGIPFDVSLAEAPQRVNVDYMSQEEFYDKLITSFEDAKGGKRQDVREFLSQFKENA >NZ_CP016501|1140008:1182358|1164377_1164827_-|WP_000342209.1|DBSCAN-SWA MDDKTLERLRKTYPTGTRVRLVHMDDPYSVPIGMLGTVEDVDDIGSLIVSWDNGQSLNVLHGIDRVVIVEPNRHDFIRSIYCYLRAGHNEFVFDSGDIPVQLTHEEVITLMTSTHPQTEIDIYIDECNHFNLKPTTVLTKEEIEKAIQE >NZ_CP016501|1140008:1182358|1180489_1182358_-|WP_000459854.1|DBSCAN-SWA MFDKYIFDTYYEQIVKTVKGFIFNNKDNTDLSTYMVPEPNGFIEFDNFELYKINYEIINNGELNLEIIVIANVIVRQYIKGEMEVDTKTKFVSVYAEVELDADIKNFRIYNTEFKSDQYKKSRNLKLSKDWVPYIRKKDFDDIAEKFLRKYYPQALTQPTPVPVETIVSEMGLSIHQEKLTIDNSVFGKMVFKDTDVEVIEDEQLVSKHFNKGSILVDKDVVFKRNVGSYNNTVIHECVHWELHKVFHEVKMVLDKDHSQVSSWTEENLADSSMWTSLDWMEWQANGIAPRILMPKVQTRIKIRELFQTLTLVNPDISRSELVQEVVDNLATFFEVSRQAAKIRMIDLGFKEANGVYNYLDDRYMHNFAFELEAFDKGSSYTITSNDLCFEYCFNESFRQIIDRNKFIYVDNHLCLKDKKFIYMTKDGPIMTDYAYEHMDECCLIFKVKSKNFTSISNETYYDYVLNRGVTKESEIKADFVDILQNPSLMDQLPPLDMMKLGKKISELLKELPFEFSGTLRSHRKRKNCTQPFLAKLVGITERTLRDYETLEDNLPRLELTLSFCFALKLRPELSDDMIKKAGHQLTISPPHQVYKMLLSTSYYKPLGEINSILQAAEMKTL >NZ_CP016501|1140008:1182358|1170187_1170685_-|WP_000356300.1|DBSCAN-SWA MDKAEYILTHYNELKWELEMLKYRLNNFKPVTENEVISSLVFERSDEPKVTSTPTNQRSEMIALSFREKMVQENEELLSDLSQRYIRLASDLETFDMAIRFLKGDLSEFAVELLKPDCNWDYLMREFHISRGTVHNWRRKLLEHIRQIFVKLGRSLTIELYMNSP >NZ_CP016501|1140008:1182358|1148701_1151617_-|WP_069571476.1|DBSCAN-SWA MLYLLDGQTKTPKWNGQPLFETVSASVEEVLNGTFQLRLTYPITDSGVHETLRADELILCPTPDLGKQLFRIKQVKIQDDTVELECYHISDDVMKRQVKPFSAINTTCQSALMRMVEACPSDLGLFSFDSDVTERHTYVSDEDLTLYQALMDGKHSLLGMWEGEFVRDNFQLMVKKHRGNDKGVILTSHHNLKAFEDKGDSDKVITRIYATSTFQAEGSDEDTVLSVVVESPLINQYPYIHEARYENNTLQTEEELRQWAMAKFTHEHIDNISRQITVEAYQLDGQEVHLGDTVTLKSQKHKVDVKKKAVGYTFDALEEVYLSVTFDDEVTFTNSGSSGVNSLTSAAKTILDVNQSVTEHRASKERANFNKVFDRQFERLQTEVEDGIAIAKAEGERSGKKAALDYLATNGLEARVAAFQKATIDELTVSSSAWMTRLVSQQILSEYVKSLEVEADKVVIPGQHTPVFSLDRDGNLSIDTPLLKVRGESLATQADLKTISLTPGPKGDAGADGVGIQSREQYYLVSAQKTGLTATSASWTKTIPTLTSTLKYLWNYEKTTFTNGSTTVTTPVIIGVYGDKGVDGKAGKDGKTLYTWRMYADNDKGDGISAVSTGKRYLGLAVNKESATPSTNPSDYTWSSFFEGTELGGRNYIDDYAMKAATFSSITSEWKKEVVEDTSSVSGVTVKMTCTKAGTGGFHRNFYDLRSRIGANMTFSIDLKASKTVSLTIGNELGGTKVVQVSSSWQRYSVSWKVTSAQNHSYVFYLKSGTWAVGDVVYLRNVQLEDGNVASAPGPSLNDLIAQIDAKADNGFMKQQLDLLTEKTESLRVDLEARALAKEVADWLKSYKEFEKNNEAVLAQFNQDFIDNTARIAAIEADLKASSLLLNFVNTYLRAGDNGVIIGKKDNSEYIELTPQGMMIKSAGNAVMTVTVGVIKIHHGVFVETLQVGYYRLEAARHNAKHLVCRFIDAK |
41 | Streptococcus_phage(97.22%) | tail,portal,terminase,head,holin,protease,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1492981 : 1500501
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP016501|1492981:1500501|DBSCAN-SWA ACTAATTAGCTAATGCTCCCATCGGATCCCACGGCGCTAAAGTAATTGGCTCTGCGTTGTAAGCCTCTAATTCTTCTTTAGTAAGCAATTCTTTACGAACGACTATTTGGTAAGTATATTCATCCATCCAAGCATCTGAAGCTACAAAATAACCATCCTTACCAACTTTCTCTCCCCAAGAGTTCTCAACTTTCCATTTTAAAGGTTGTCCAGATTCATCAAGGTCTACTCCCGTTAAGACCATAGCATGTGTCATTAAACTTTCACTATAATCTAGGCGGCCCGCTTTGTCTTGGCTAAGTTTAATATCCATAGATGAATTAAAATCATAAGTCGTAGTTGCTAGGATACCCTTTTGACGATTTGATACTTGACCTACGTCTGAACCAAACCACACAGTTTCTCCTGATTTCATTTGCGCAATCGCTAACTCTTTAAACCGTTTCATATCAAGGTTTAAATATTTAACAGCTGGCCCTCCGACAACATTTCCAAGCATCTCAACTGTATAGGATTGTCCATAAGGCTTATCTACAGTTGGAGCATTGATAATAGAAACATAATCTGATAGGTCAAGATTAACATATTTTTGATAAAATGCTTTCGGTGTAATATTTTTATCAGACTGATAATGATTGTCTTTATCACGATAAGCAAAATCAAAAGATTGCGGTGGCAAGCCAAGGTTCATTGCTAAAAAGTTAAAAATTTCTTGAAGCAATTCTTCCTTTTTATTTTGAACAGTTGCCCCATCTGCTCCCTGTGCTATTAATTCTCGTAGAATCTGTGCATCTTGGCGTAAGAGTTTATTCAAATATTGATTTAGTTCACGACTAGCACTTGATGAAACAGATTCAGGATAAACTGTTTTAGGAACGACACCATATTTCTCAAACAGAGCAACCACCATGTCCCATTGTCCTCCATCTTGTTGAGGTACATCTAATAGGAATTTCACCTTACGACTACTCAATTCCTGATTTGCAGTCGCAATAATTTGTTCCATAAACCAATTTGATTTTTCATACTTATCCCAGAAAAATGTATGTGCCTGAGATAATTCAAAATTCTCAAGTTTAAAATCTGAAATCAGTTTATGACGAAATGTATTCAGTGCAGCAAACATCCAGCAACGTCCTGATTGTTTCTGATTAGAAACTTCATCTTTGGTCAAATCAATTGAAAAGACATAGTCGTTCTCAATCTCACTTTGACGAGTCTCTAGAGACTTCAATAAACCATTGTGAGTTACTGCATTTTCAATAGCGCTAAATTTTGTATTTGCTTGGTAATCAGCAAATAACTTATCTGTAAATGTTTGAGTTAATTTTGACATAAATTCCTCCTTTTATACTTAGCTACTATTATAAACATTCTGAATATTTCTGCAAGTCTGAAAGTAATTTATTAGAGAAACAAAAATAGCTAGAATATTCTAGCTATTTCAGATTGAAGACAAAGTCCTTAGACATTCACTTTTCTAAGGACTTTTCTGTTTGTCTAGAGTTATAATAAAGGTAAGGTGTTCTGGGAGGTATTGCTATGTTCCACAAAGAAAATCCTGACTATAATCGAAATCAAGTTGGTTTCTATAGTTTAGATGAGTTGGTTCCCAAAGACCATCTCTTGCGTCAAATTGATGAAGCGATAGATTTTTCATTTATTTATGACTTAGTGAAAGACAGTTATTGTGCAGATAACGGTCGTCCTAGTTTGGATCCGGTCATGTTAGTAAAAATCCCCATGATTCAATGTCTCTTTGGCATTCGTTCCATGCGTCAAACCATCAAGGACATTGAGGTTAATGTGGCTTACCGCTGGTTTCTTGGGTTGACTTTAGAAGATAAGGTGCCTCATTTCACAACTTATGGTAAGAACTATAATCGTCGTTTCCAGGATAAACAAGTCATTGAAGCAATTTTTGCTTATATCTTGGGGCTCTGTCTCAATGCTGGTTTGATTGACCCCACTGATATCTTTGTGGATGGGACTCATATCAAGGCAGCCGCTAACAATCACAAATACATCAATCAAGAAGTCGATGCACAAGCTAAATTTATGAGTGATCAGTTGGAGAAGGAAATTGCTAAAGATAGGGAAAAACATGGAAAAAAGTCGCTAGGAATCGCCAAAGAAAAAGAGCCCATAAGTAAGAAAATCTCTACAACCGACCCTGATAGTGGTTGGTTTCATAAGGGAGAACATAAGCAAGTTTTTGCTTATAATGCCCAGGTAGCTTGTGATAAGTATGGTTGGGCACTAGGCTATAGTATCCACGCAGGGAATGTCCATGATAGTCAGGCATTTCCTGAACTCTTTGACAAAATCAAAGCATTGACTCCTCACTATCTCATCGCTGATTCTGGTTATAAAACTCCAAGCATCGCGCATTACCTATTAACTATAGGTATCATACCAGTTTTTTCTTATACTAGACCTCGTAGTAAAAAGGGTATGCTTCGGCTAAAAGACTTTATTTATGATGAGTATTATGATGTTTACCTCTGTCCAGAAAATCACCCCTTATCCTATTCAACGACTACTCGAGATGGCTATCGTGAGTATAAAAGTAATCCAGCTGTATGCCAGTCTTGTCCCTTATTATCGGTTTGTACCCAATCAAAAAACCATCAAAAAGTAATCACTCGTCATCTCTGGAAAGATGATTTAGAGGTCTGTGAAGACATCCGACACCAAAGAGGGATGAAGGAACGCTATCAACAGCGCAAAGAAACCATTGAGCGTTTGTTTGGAACAGCTAAAGAATACCACAACTTACGTTACACGAGATTAAGAGGAAAGTCCAAAATGGAGGCAACCCTTGGGCTGACTTTAGCGTGCTTAAATATGAAAAAATATAGTAAGATAATGGCTGGAATAGTCTTTTTAGTTTGCCTAAAAGTGATCATATCAAGACCAATTGTGATAACAATAGTAAAAGAAAAGACAAGCTGGATTAACATTCCAGTTTGTCTTCAATCTGAAATAGCTAGAATATTCTAGCTATTTTTCTTATTTCCAGAAATCATCAAAAATTGTGATTGGAAGATGACGTTTATGTTGACCCTTATACCACCAATTTTCAATGATACCTCGAGACTTGTCCGAAACCACCTTCCCTTCAAGATAAGCATCAATTTCTTGGTAGGTTACTCCAAGGGCTATTTCATCAGCGATTCCTGGTTTATTTTCTTCTAAATCTGCGGTTGGAATCTTTTCGTACAAGGCTTTATCAGCACCAAGTTCAGCTAATAACTGTTTCCCCTGACTTTTATTAAGTCTAAAGAGAGGTAATAAGTCAGCACCTCCATCACCAAATTTGGTGAAAAAACCTGTAATATTTTCTGCAGCATGATCAGTACCAATAACCGCTCCAGCATATTGACCTGCAACTGCATATTGACTAATCATTCTTTGACGAGCTTTAATATTTCCTTTATTAAAGTCTGTAATCTCTACTCCTGCTGCATTAAGAGCTCTAACTTGGCCATCAACAGCTTCTTTAATATTAATGGTCAAAGCAATATCTGGCTTGATAAAATCTAATGCTTTTTGTGCATCTTCTTCATCGGCCTGAATACCATATGGTAAACGAATAGCTATAAATTGATAGTTTTCCCCTGTATCAGCACGCAACTCTTCAACTGCTAGTTGAGCTAGACGTCCTGCTAAAGTTGAATCTTGCCCCCCTGAAATACCCAGTACATAGGTCTTTAGAAAACTATGTTTTAATAGATAGTCTTTTAAAAATTCTACGGAACGACGGATTTCTTGACTTGGATTAATAACAGGTTTTACACCGAGTTCTTTGATAATTTGATCTTGCAAAGTCATCTTACTTCTCCTTTTGCCAAAGCTTCTTTACGGATACGATCAATCAGGTCCATTTTATTTTGCCAAACATCGCGTGCCAAATCAACTGGGTAATCCTGTGGATTCAGAACACGTTTATACTCATCCCAAAGCTGATCAAATTCTTTGCGTCCGTACTCTTGAATTTCTTGAAGACTTGGTAATTGATAAACTAATTTCCCTTTGTCAAAGATATCTACCAGTAAGGGTACTGCGTCAAAATCACGGACTGTTTTATTGATATAAGTATAAGTTGGATGAAACATTTCAATTTCATCTAATTGTGTCACATCAGTGTCTGCAAATGTGATGTAATCACCCTCAGATTTCCCTTTAGCTCGACTGGTGATACGCCAAACTTGCTTTTTCCCAGGTGTTGATACTTTTTCAGCATTGTTTGATAGTTTAATTGTATCTCGCATACTACCGGCGTCAGTCTCAATAGAAACAATTTTATAAACAGCACCTAGCGCAGGCTGATCATAGGCTGTAATTAATTTTGTTCCGACACCCCAAACATCAATTTTAGCTTTTTGCATTTTTAGGTTGAGGATTGTATTTTCATCTAGATCATTCGAAGCATAAATCTTTGCATTAGGAAAACCGGCGTCATCTAACTGCTGACGAACTTTCTTAGACAAATAAGCTAAATCACCTGAATCAAGGCGAACTCCCAAGAAATTAATCTTTTCGCCCATCTCTTTTGCAACTCGAATAGCATTAGGCACACCTACACGAAGGGTATCATAAGTATCCACTAAAAAGACGCAATCTTTATGAGTTTCTGCGTAAGCTTTAAATGCCTGGTAATCATCGCCATAAGTTTGAACTAAAGCATGTGCATGCGTTCCTGATACTGGAATATTAAAGATTTTACCAGCACGAACATTGCTCGTTGCGTTAGCACCACCAATAATCGCTGCCCTAGTACCCCAAATGGCTGCATCCATTTCTTGAGCACGTCGTGTCCCAAACTCTAATAGCGGTTCATCTTCAATAACAGAACGAATACGAGCTGCTTTTGTAGCAACAAGTGTCTGGTAGTTGATGATATTTAAAATCGCTGTCTCTACTAACTGACATTGTGCTAAAGGACCTTCAATTTGAACTAATGGTTCATTAGCAAATACAAGGTCACCTTCCTTGGCTGATTTGACAGTCAGTTCCATTTTTAAATTTTTTAAGTAATCTAAAAATTCTTCAGGATAACCTAACTCTTCTAAATAGGATAAATCACTATCAGAAAAGCTCAGATTTTCTAAATAACGAACAATACGCTCTAAACCTGCAAAAACAGCATAACCATTTTCAAATGGAACCTTGCGAAAGTAAGCTTCAAACACCGCTCTTTTATTGTGAATACCTTTATTAAAATAAACTTGCATCATATTGATTTGATATAAATCAGTATGTAGTGTTAAACTATCGTCTTTATACATCAGACAATCTCCTTTTAATCATGAGCCTAGGATAAGGCTATATGTCTAACATTATAGCATATTTCTATCTCGGTAGTTCAGTTTATCACCTTACTTTCATGAAAATAAAAAGAAAGACTAATCGTTTTTCGATAGTCTTTTCTACTTTTAGTATTTCTTAGAAATGCTCAGTTATATAGTTATAGACACCCTGTCCAGCGATAGCACCTTCACCAACAGCAGTCGCAATCTGACGTAGATCTTTTTGACGAACATCCCCAATAGCATATAGTCCTGGAATTGATGTTTTCATATTTGTATCAGTTAAGACCCAGCCAGTCTCGTCCGTAATACCTAATTCTGAGACCATGCTAGAATGTGGTTTCAAACCAACGTAAATGAAAACACCTCCGAAAGTCATCTCTGAGATTTCTCCAGTCTTTAGATTTTCAACAGTTACACCAGATACTTTAATCTCATTTCCTTTAATCTCTTTGACCACTGAATCCCATACAAATTTTATTTTTTCGTTAGCAAAAGCTCGGTCTTGAAGTACTTTTTGAGCTCTCAATTGATCACGTCGGTGAATAATAGTAACACTTTTAGCAAATTGAGTTAAGAAGACAGCTTCTTCAACAGCAGAATCACCGCCACCCACAACTAAAAGATCCTGATCACGGAAAAATGCACCATCACAAACCGCACAGTAAGAAACGCCACGACTAGTATATTCCTCTTCACCCGGAACTCCTAAAAGGCTGTTTTTAGCTCCGGTAGCAAGGATTACTGTTTTGGCTTCATACGATTCATCTTCGGTGATAACACGCTTTACATCACCATCGTTCTCAACTCTTTGAACTATACCATAAATATGCTCTACTTCAAATTTTTCCAACGGCTCATACATCTTCATTGATAATTCTGGACCTGAGATATGATCGTAGCCTGGATAGTTCTCAATTTCAGCAGTATTATTCATCTGTCCACCTGGAGCACCTTGTTCAATTAACCCAACTTTAAGGTTTGAACGTGCAGCGTAAAGCGCTGCTGTCATACCACCAGGACCTGAACCTATTATTAAAGTATCATACATTACTATTTCTCCTTTGAATCTTATGGTTTTATTCTATAGCAAAGCGTCTATATTTCCAACTAATCTGACTAAGCTTTTATCATTAAAATGATAGCTACAAATGCAAAGGCAAGTATTGGAGCTGTCATAAGAGCTATCAGAAGGACATCATAAAGAACAGATTGGCGTTCTCTAGCTGTTTTATCTATGTTTTTCTTGGCACGCCAAAAGACCCAAATTATACCAACTAAAATAACAGTTATGCTGGCAAGCCCTAAACCTTGTAAATAAACTTTTGCTATTTCTAAAAACATCTCTATCCAACAAGCGATTGTCAAAACATATCGCAGCTTTCTTATATACTTAGCCTAACTCAAGAATAATTAAAAATGAAGATTAGACATTCTAACCTTCATTATATCAAAATATTTTAGCTTATAGGTACGATTTGTTATAACTAGCGAAGAATTCTTTTGTACGTTCTTCTAAAGGATGGTTAAATAATTGTTCAGGCGTCCCAGACTCTAAAATACGACCTTTTTCCAAAAATAAAACTTTATCTGCTACTTGGTAAACAAAATTCATATCATGACTAACAAGTACCATGGTTTGCCCCTGCTTAGCAGCATCTGCAATAGATTTTTCAACTTCTCCAACTAATTCTGGATCAAGTGCTGATGTCGGTTCGTCTAAAAGAAGAACATCAGGTTTCATGGCTAATGCGCGTGCCAAAGCAACACGTTGTTTCTGTCCACCTGATAAATGTCTTGGATAATATTTTTCACGATCTGCAAGTCCGACCTTTGCCAGTTCGTCTCTTGCAATCCGTGTTGCCTCTTGATCTGACATCTTTTTGACAATTTTCAAACCTTCTTTTACATTATCCAAAGCTGTACGACGCTCAAATAGATTGAATTGTTGAAAAACCATGGCTAACTTACGGCGTAATGTTAGTATGTCATCTTTACTGATTGATTTAAAATCAACTTTAAAATCATCAATTTCAATTGTTCCATAATCAGGCTCTTCTAAATAATTCATACTACGAAGAAAGGTTGATTTTCCTGCACCTGAAGCACCTACTAGAGCAACAACTTGACCTTTTTCAATATCTAAATCAAGTTTGTCTAACACTTTTTGACCAGAAAATGATTTAGTCAATTGTCTCAATTTTATCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP016501|1492981:1500501|1495992_1496814_-|WP_000174854.1|DBSCAN-SWA MTLQDQIIKELGVKPVINPSQEIRRSVEFLKDYLLKHSFLKTYVLGISGGQDSTLAGRLAQLAVEELRADTGENYQFIAIRLPYGIQADEEDAQKALDFIKPDIALTINIKEAVDGQVRALNAAGVEITDFNKGNIKARQRMISQYAVAGQYAGAVIGTDHAAENITGFFTKFGDGGADLLPLFRLNKSQGKQLLAELGADKALYEKIPTADLEENKPGIADEIALGVTYQEIDAYLEGKVVSDKSRGIIENWWYKGQHKRHLPITIFDDFWK >NZ_CP016501|1492981:1500501|1492981_1494316_-|WP_000041114.1|DBSCAN-SWA MSKLTQTFTDKLFADYQANTKFSAIENAVTHNGLLKSLETRQSEIENDYVFSIDLTKDEVSNQKQSGRCWMFAALNTFRHKLISDFKLENFELSQAHTFFWDKYEKSNWFMEQIIATANQELSSRKVKFLLDVPQQDGGQWDMVVALFEKYGVVPKTVYPESVSSSASRELNQYLNKLLRQDAQILRELIAQGADGATVQNKKEELLQEIFNFLAMNLGLPPQSFDFAYRDKDNHYQSDKNITPKAFYQKYVNLDLSDYVSIINAPTVDKPYGQSYTVEMLGNVVGGPAVKYLNLDMKRFKELAIAQMKSGETVWFGSDVGQVSNRQKGILATTTYDFNSSMDIKLSQDKAGRLDYSESLMTHAMVLTGVDLDESGQPLKWKVENSWGEKVGKDGYFVASDAWMDEYTYQIVVRKELLTKEELEAYNAEPITLAPWDPMGALAN >NZ_CP016501|1492981:1500501|1499411_1499636_-|WP_000478229.1|DBSCAN-SWA MFLEIAKVYLQGLGLASITVILVGIIWVFWRAKKNIDKTARERQSVLYDVLLIALMTAPILAFAFVAIILMIKA >NZ_CP016501|1492981:1500501|1499757_1500501_-|WP_000593862.1|DBSCAN-SWA MIKLRQLTKSFSGQKVLDKLDLDIEKGQVVALVGASGAGKSTFLRSMNYLEEPDYGTIEIDDFKVDFKSISKDDILTLRRKLAMVFQQFNLFERRTALDNVKEGLKIVKKMSDQEATRIARDELAKVGLADREKYYPRHLSGGQKQRVALARALAMKPDVLLLDEPTSALDPELVGEVEKSIADAAKQGQTMVLVSHDMNFVYQVADKVLFLEKGRILESGTPEQLFNHPLEERTKEFFASYNKSYL >NZ_CP016501|1492981:1500501|1496810_1498271_-|WP_000276183.1|DBSCAN-SWA MYKDDSLTLHTDLYQINMMQVYFNKGIHNKRAVFEAYFRKVPFENGYAVFAGLERIVRYLENLSFSDSDLSYLEELGYPEEFLDYLKNLKMELTVKSAKEGDLVFANEPLVQIEGPLAQCQLVETAILNIINYQTLVATKAARIRSVIEDEPLLEFGTRRAQEMDAAIWGTRAAIIGGANATSNVRAGKIFNIPVSGTHAHALVQTYGDDYQAFKAYAETHKDCVFLVDTYDTLRVGVPNAIRVAKEMGEKINFLGVRLDSGDLAYLSKKVRQQLDDAGFPNAKIYASNDLDENTILNLKMQKAKIDVWGVGTKLITAYDQPALGAVYKIVSIETDAGSMRDTIKLSNNAEKVSTPGKKQVWRITSRAKGKSEGDYITFADTDVTQLDEIEMFHPTYTYINKTVRDFDAVPLLVDIFDKGKLVYQLPSLQEIQEYGRKEFDQLWDEYKRVLNPQDYPVDLARDVWQNKMDLIDRIRKEALAKGEVR >NZ_CP016501|1492981:1500501|1498428_1499343_-|WP_000272353.1|DBSCAN-SWA MYDTLIIGSGPGGMTAALYAARSNLKVGLIEQGAPGGQMNNTAEIENYPGYDHISGPELSMKMYEPLEKFEVEHIYGIVQRVENDGDVKRVITEDESYEAKTVILATGAKNSLLGVPGEEEYTSRGVSYCAVCDGAFFRDQDLLVVGGGDSAVEEAVFLTQFAKSVTIIHRRDQLRAQKVLQDRAFANEKIKFVWDSVVKEIKGNEIKVSGVTVENLKTGEISEMTFGGVFIYVGLKPHSSMVSELGITDETGWVLTDTNMKTSIPGLYAIGDVRQKDLRQIATAVGEGAIAGQGVYNYITEHF >NZ_CP016501|1492981:1500501|1494522_1495983_+|WP_000468675.1|transposase|DBSCAN-SWA MFHKENPDYNRNQVGFYSLDELVPKDHLLRQIDEAIDFSFIYDLVKDSYCADNGRPSLDPVMLVKIPMIQCLFGIRSMRQTIKDIEVNVAYRWFLGLTLEDKVPHFTTYGKNYNRRFQDKQVIEAIFAYILGLCLNAGLIDPTDIFVDGTHIKAAANNHKYINQEVDAQAKFMSDQLEKEIAKDREKHGKKSLGIAKEKEPISKKISTTDPDSGWFHKGEHKQVFAYNAQVACDKYGWALGYSIHAGNVHDSQAFPELFDKIKALTPHYLIADSGYKTPSIAHYLLTIGIIPVFSYTRPRSKKGMLRLKDFIYDEYYDVYLCPENHPLSYSTTTRDGYREYKSNPAVCQSCPLLSVCTQSKNHQKVITRHLWKDDLEVCEDIRHQRGMKERYQQRKETIERLFGTAKEYHNLRYTRLRGKSKMEATLGLTLACLNMKKYSKIMAGIVFLVCLKVIISRPIVITIVKEKTSWINIPVCLQSEIARIF |
7 | Bacillus_virus(50.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1764624 : 1776689
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP016501|1764624:1776689|DBSCAN-SWA ATTATTTGTTAAATGATACGTGAACGTGGTCATAGTGGTTGGCAGTAACGCCACCACGATCTGGCATTGCATTCCAAGTATTAGCAGGTCCATAAATACTATTTGTATTTGAGTAAAACTTTTGTTGCCAGATAACATATGAAATGTTATTTGCTGCCATATTTTGTGTAGAGTACTGTGCAACTTCATTACCAAGTGCTTGGTTTTTACCTACAATAAAGTCAACTGCTAAACCTTTACCATGATCACCTGGATCTCCCGCACGGTATGTACTGAATTCATTAACTCCATAAGTTGACGCTACTTTTTCTTTATAAGCTGCAACATGAGGTTGGAGCCTTGCATTTTCAGGATGTGCAGCTACTGCATTTGTTGTTGAAGCTGGTTGTGCTACCGGTGTTGCTGTTGGAGCTTTTTGTGCTACCGGAACGCTCTTAACTTCAGTCGCTTGTAACTTACTGTCTGTAGCTGTTGAAGTCGTAGTCACAGGAACTGCTGGAGCTGATACATGCTCTGGTGATGCACCAGTTTCTACTTTAGGAGTGACTACTTTAACACTTGCCACTCTAGGGGCTGCTACAGTTCTTACCGGTGCTACTTTAGCTACTGGAGCTGGTGTTTCAGCGGCAACAGAAGCTGGTGATACTGTTGTTGACTGACTGACTGACGTCTGAGTTGGTTTAACTTCCTCTTTAGCTGCTGGAACTTCTGAAGTAATCGACTTCACAGGAGCTGGTGATACCTGTTCATTAGCTGCTGCTTGACTAACAGCTTGCCCTTGTGCTAATACTTCTTTTGATTTCAAAGCTGGCGCAGAAGAATATGTCTTCATTGGCGAAACAATCGTTGTTGCTGCTTCTGGTGTCATACCTTCCGAAATTGTATTGAGAGAAACTTTTTGGTCTGCAACAGAAACTTGATTGGTTTTCAAATCGACAGTAGCTGTTGTTTGACCAGCAGCATTTGTTGCTGGTGTTTCTATTTTCATTGAAGTAGCAGTATGACTCTTCTGATCGTAAGTTACTGTCAGTGTTGTCTCAGGATAAATAAGATTGATATCTGCAATGTTATTAATTTTTGCTAAGACATTCATATCAATTGACATTGCTTCTGAAATAACGCTTAGTGTATCACCATATTTCACAGTATATGATGATTTATTGTCTTGCTTTACCAAATCAGCCTTTACCTCTGAAACAGTACGTGCTGTCCACGTCGTATCTGTTTCTTGTGCTTGAACACTTGCGACTGATAATAGCGAAGCTGCCATTGTCGATGTCAATAGTACCTTTTTATTCATTTTCATTGCTTCAATTCCTATTTCCTTTTTATTAGTCCGAGACTATAATATCACACTTTACTGGACTATAAAATTATTTTCAGACGGTTTAAAGAAATTGTACTTAGTTAGTAACAAAACTGTCATTTAAAACAATGAATCAATACCTCAAACTATTTGTAAAGCAGTTCTTGACTGTTGTCAACGCGAAGCCAAACTGAACTACCATTAGCAAAGCGTGTCTTAGCCCAAGTTAAACCAGCTGTTCCTTGGATGGGTTTTTCTACAGTCTTAAGAGTTTTAGGATTAAAGATAAAGTAACCACCTTTTTGAAGAACCTGGTCAGCTGTCAAATTACCATTAGCATCAACCTCATCAATTTCTGATGCTGGTATACCATTATCGTTCCAATCAAACCCAGTCGGCGTTAGGGTGTTATTTTTGACTAACCAAACACCATTAACCTTTTGTAATTCATCTACTCGATACACTTTTAATTTACTAGATTGATTCTGTACAGGAGCTGATTGTAATGGCTTAATGCTTGGAGCTGATGCTTGCGTTTTTCCACTAAAGGTCGCAACATTAGCAATTAGTGACGTTGGATTGATACGTCCATGGAAACCATTTTGAAAATTAGGGTTAGCTGGTAAAAATTCAAAATGAAGGTGAGGTCCCGTCGCCATACCAGTTGCTCCTACGTAACCGATGATATCTCCTTGTTTGACTTTTTCCCCAGTCCTAGCCACCACACGTGACATATGAGCGTAACCACTATGCATTCCATCCGCATGTTGAATCATGACACAATTTCCTGCTAAGTCTGTCATCCAAGAAAAGTTGGCTCCAGCTCCTGCAAATTTCACAGTACCATCTGCCACTGCCCTAATAATCGTTCCAGTCGGAACAGCATAATCCACCCCACAATGTCCAGGATAACCATTGAAACCTGTTGTAATTCTACCATTATCAATTGGACGGACATAAGTATCCGCTAAAACTCGGGAACCCGCAGATAAAACCATACCACCTAAAACAACTAAGGAACTTGCCTTAACTAACCATTTATTCAAGAAAAACCTCCAAATAGAATTAATATAATGAAGTGCTCAAACACTTGCAATTAACTTATTCGAAAATTTAAATCAAAATATCAAAAAAAGTACTCACCTTGATGAGTACTTAGGAGGTTTAATAAAAAGAAAAGTTTTTAGGATATACGTAGTATACCTGAGTTAACTTAAGAAAACCTAAACTACTATTTTTTAATGTCTAAAATGTCTCACGCCTGTGAAGATCATGGTCAAGCCATGTTTGTTTGCGGCGTCAATAGATTCTTGGTCACGAACTGAACCACCTGGTTGGATGATTGCTTTGATCCCTGCGGCAGCGATTTCTTCAATGTTGTCCGCAAATGGGAAGAAGGCATCTGATGCTAGAACGGCACCGTCAAGGTGGTCCTTAGCCTGCTCGATAGCAATCTTAACTGAGCCGACACGGTTGGTTTGACCTGCACCGAGTCCAAGCGTCATGTGGTCGTTAGTGATGATAATCCCGTTAGATTTAACATACTTGATAGCCTTCCAGGCAAACTCAAGGGCAGTCGCCTCTTGTTCTGTTGGCTGGCGGTCTGTCACCACTTGCCAGTCAGATGGATTTTCAGCCACAACGTCTTGGTTTTGCACCAAAAGTCCACCAACTACGCCAGTGTACTCAGCTTCCACTTCGCTAGCAGCTTGGGCATCAAACGGCAACTCAAGGATACGCAAGTTTTTCTTTTTATTTGTGAGAATAGCTAGCGCTTCTTCTGAGTATGATGGTGCGATGATGATTTCCAAGAAGATAGGGTGCATCTTCCCAGCTGTCGCTGCGTCAACTTCACGGTTAAGGACAACAATTCCACCAAAGATTGAAACTGGGTCAGCTTCATAAGCGTAATCCCAAGCTGTCTCAATATCATCAGCCTGTCCGATACCACATGGGTTCATGTGTTTGAGGGCAACAACCGTTGGACTGGCTTTGAAATCGCGGATAATACGGATCGCCGCATCAGCATCACGGATATTATTGAAGGACAATTCTTTACCGTTGAGCTGTTTAGCTGAAGCGATTGAGTAGTCTGTTGGCAAGGCTTTTTGGTAGAAATCAGCGTCTTGCTGTGGATTTTCTCCGTAACGCATAGCCTGTTTAAGGTCATAAGTGATGGTCAATTTTTCAGGCTTAGCCTCTCCCACTTGAGCTGTGAAGTACTCAGCAATCAAAGCGTCGTAGGCTGCCGTATGACGGAAGGCCTTAGCTGCCAAGCGTTGACGAGTTTTAAATGTCGTCTGACTAGCGTCAGCCAATTCTCCCAAAACAGTGGCATAGTCAGCTGGATCAACCACAACGGTTACGCTAGCGTGGTTTTTAGCAGCTGAGCGAAGCATTGATGGACCGCCGATGTCGATATTTTCCACCGCCAAATCGTAGGTCACGTCTGGGCGAAGGATGGTCTCCTTGAAGGGATAGAGGTTGACAACCACGAGGTCAATCAGCTCGATATTATTGTCCTTAGCAGCCTGAAGGTGGCTGTCAGCGTCGCGACGAGCCAGAAGCCCACCGTGAATGTTTGGGTGGAGGGTCTTAACACGACCGTCCATCATTTCTGGGAATCCAGTCACATCGTCGATGGCAATGGTCTCAACACCAGCATCATCTAGGGAAACCTTAGTCCCACCAGTTGAGATAATATCCCAACCCAAGTTTTTTAATTCTTTTGCAAAGTCAACAATTCCTGACTTATCAGAAACTGAGATTAAAGCACGTTTAGTCATGAGTTCTCCTTTTCTTACTTCAATCTATTTCATAGGCTATGTATTCATGAGAAAATTCATAACCGAGTTTTTCAGCTAAATTTAATGAAGTCCTTGTATGAGCATCCCAGCTAGGATAAATTCCCTTGTCTAAACAAGTTAGTATCAACTGAGCTGCAACTATTGTTGCTAAACCACGCCGACGAAAATCTGGATGCGTATCTACTTCTATCTCAATCCCATTTTTATAGGTTGAATAAGATGAAGCTCCTGCAATGATATTCCCCTGATAATATACAACATAACCTATACCTTGTTTTTTATAGTACTGATAAGTAGCGTAATTTGCTACTAAATCCTGTGACCATTCCTTTTCTAAGCAAGAGTTGTAAACTTTCTCATCAATAGCACGTAATTCAAAACCATTTGGCAATTGAGTAACAAACTTCTCTAACCTACTTCGCTCAAACAAAGTATCTTTTTTCGTCGCATAACGCTTAAAAGAATGAGCATTCTGACCATAAGTTGATTCAATCAAATCTGACCATCCTTTATGCTGAGGAACAAGGATAATATCCTCCCCAGAGCAAACTTCTAGTAGAAATAAAGTAGGCTGACCTGCTAGAAAACCAAAAGATGATTTCCTTCCCAATTTTGCCAAAGAAGATTTTGGTTGGTCTAAACTATCTACAAAAACTTCTCCCATAATACCTTGGACACACGACCAGATTATGGTCTCGTCCCAATCACCAAAAATAGACTCTACTATCTCTAAATTTTTCACTAGTTTCATTAGCTTATTATAACAAATTTTACTGTTTTCTCTTTATGCCCAGGCTATCTAAGACAGCTGGGTAGAGTTGGTATTCGGTTTCGTGGATGCGGGTTTCAAAGCTTTCTAGACTATCATCTGCTAGGCGTGGCACGCGCACTTGTTGGATGACCTGACCAGTATCCACACCAGAGTCCACCCAGTGGATGGTCACACCAGACTGGTCAACACCTGCCTCCCAAGCATCCTCGATACCGTGGGCACCTGGAAATTCAGGCAAGTAGGCTGGGTGAATATTGATGATACGCCCTTCATAGGCTGAGAGCAAGGCTTCTCCGACAATCTTCATGTAGCCCGCCAGACAGACCAAGTCAATTTCGTGTTTGTCCAACAAATTAACGATGGCTTGCTCGTAAGCCGCCTTGTTCTCAAACTCCTTGAGTTCAAAAGCGAAACTTGGAATGGTTAAGTTCTGAGCACGCTCTAAAACATAGGCATCACGATGATCTGAAAAGACAAAACTAACTGGAAACTGCTCTGCTATGATCTGAAAGTTGGAACCATTACCAGAAGCAAAAACAGCGATTTTCATATTCGATACAAAGGGATGATTCGGCTTGCCGAAAAATTTCGTAGAAAAATAGGAAAACGAACAACGACACTTGCGTCTAAGTCGTTTTATCTTTTTTCCTAGAAATTTAATCCCGTGTTCAGTTAGGCAAACTAACTAAACACTCCTTTCTTATAAAATTTAGTGACTTATTTAATCACCACACTATCGTCTGCTTTCTTGATGATACGACCGATTTCATAAACTGGTTCGTCCAAGAGCTCTTTGACGCGGTTAACATTTTCAGGACTAACAGCTAGCATAAGACCGACACCCATATTGAAGATTTCAAACATTTCTTCGTGCTTGATATCACCATATTTTTCAATCGCATTGAAAATCGGAAGTACTGGCACCTTATCCTCATCGATTTCCGCAGCCAAATCATCCGCAAACATACGAGGAACATTCTCGATAAAACCACCACCCGTAATGTGGGCGATACCGTTGACTAGCTCTTCCTTGATCAATGGCAGAACTGCTTTAACATAGATACGAGTTGGCTCAAGAAGAACATCCTTGAGTTGTTTGCCTTCAAGCTCTGGAAGCACCTCATTACCAGTGTAGTCAGCAAAGACACGACGTACCAATGAGTAACCATTCGAGTGGATACCACTTGAAGCAAGTCCAAGAAGAATATCTCCCTCTTTTACTTTTGAACCGTCGATGATTTGAGATTTTTCAGCCACACCAACAGCAAAGCCTGCAAGGTCATAATCATCTTCGCCATACATACCAGGCATCTCAGCCGTTTCACCACCAATAAGAGCTGCGCCAGCTTGAACACAACCTTCAGCAACGCCAGCGACAACCTGTTCCAATTTGGCAGGTTCGTTTTTACCAGTCGCTACATAGTCAAGGAAGTAAAGGGGTTCAGCACCTGCTGCAATAATATCGTTGACACACATGGCAACACAGTCTTGGCCAATGGTGTCATGCTGTTGATATTTAATCGCAAGCATGAGTTTTGTGCCAACACCGTCTGTCCCTGAGACCAAAACAGGCTCTCTAACTCCTGTTTTACTCAAGTCAAACATCCCACCAAAGCCACCTAGAGCTCCCATGACTCCCGCACGTTCTGTACGAGCAACGTGTTTTTTTATTCTTTCAACCACTTCATAACCAGCTTCAACATCAACACCAGATTTTGCATAAGCATTTTTTTCAGACATGTTTTCTCCTTATATAATTAGCAGAGCTTATTTCACTTTTTGAATATAAAAACTTGTTTTTTCTTCTAAACTTCTGAGGTATTCTTCTTCATAATCATAAAGCGGTGTTGGATAATGTCCATCAAAGTAAGCTACGCATAAACCACCATTTGGTGCTTTTGTTTCGAGTCCAATTGATTCAATCAACCCATCTAGTGAGAGATAGGTCAGACTATCTGCGCCAATAATATCACACACTTCGTCAACAGCGTGATTTGCTGAAATTAATTCTCGTCTTGTTTGAATATCAATACCATAAAAACAAGGATACTTTAATTCTGGACTAGCTATAGCAACATGTACTTCACTAGCTCCTGCTTCTCTTAATAATCCGACAATCCTTCTAGAAGTCGTTCCTCTTACAATTGAATCATCAATCATAACAACGCGCTTTCCTTTGACAACACCAGATACCGCTGATAGTTTCATTCGAACACCTTGTTCCCTTAATTCTTGTGTCGGTTGAATAAAGGTTCGCTGCGTATACTGATTTTTTACAAGACCCATCTCATTTGGTAGTCCGGATTCTTCAGCAAAGCCCATAGCAGCCGATAAGGACGAATTTGGGACACCAATTATAATATCAGCATCCTGTTTAAATTCTTGTGCAAGACGCTTTCCCATATTTTTTCGAGCCGTATGAACGTTAACACCATGTATAGTTGAATCCGGCCTTGCAAAATAGACATATTCCATCGAACAAATTGCCAGTTGCGTTTCATCAGTATAACGATCACATTGAATACCACTATCGTCTATAAGAATAACCTCGCCAGGTTCAACATCTCTAACCCATTTTGCGCCTACCACCTCAAAAGCACAGGTCTCACTGGAAATAACCCAGGCACCATTTTGCATTTGTCCAATTGACAAAGGACGAAAGGCATTAGGGTCAAGAGCAGCAATTAATTTATCTTCTGTCATCAGTAGATAGGCGAAGCCTCCCTTTACAGTGCTTAAAGCTTCTTTTACCTTCCCCATAAAACTTGGGTTATGGCTTCGACGAATCAAATGCATCAAAATTTCAGTATCTGAGGAGGCATTGAAAATTGCACCTTGCTTTTCTAATTCTTTCCTTAAGGAAATAGCATTTGTCAAATTACCATTATGACATAAAGCAAATTGCCCGTCATGAAATTTATAAAGAAAAGGCTGAATATTGCGAATATCTGCAGAACCTGCAGTAGCATACCGAACATGTCCAATAGCCGCATTCCCAGTTAAATTATCTAATTCAGATTGATTCTTAAAAACTTCAGAAAGGAGCCCAACATTTCGATAACCATAGAGTTTCCCATTATCATTCGAAACAATACCAGCACCTTCTTGACCGCGATGTTGAAGGCTATGAAGCCCAAAGTAAGTGACTTGAGCTGCCTGAGGATGCCCCCAGATACCAAAGACTCCACATTCTTCATTTAGAGATTTTACTTCGTATGTCATTCTTTTTTCCTAATTCTTTCTGTATCTCTTCTTAAGTAGCTTTGATGATTTTCTCAATTTTGGATCTATCGTCACCAATGCCAACATCAAAATAGAAAATCTATCAAGCTATTTTCAAGCTAATCATTGAGTCTTAGCCTCAACGGCTCCCAAATACCACTACGACAGAAAGATACAGTTAAGCTCGCTTCGCTCTTGTTTCCTTATCTGTTTTATGAAAAAAATTGAAAAAATTATTTTCCTGTAAAATATTTTACTGCACTAGCAAACAAGGCTTGGTCTTTGTTACCAGGGATATTTTGGAAGAGGCCGTCTTCCCAGCGTTCTGAGTGCCCCATCTTACCGATGATTTGACCATTCTTGCTGGTAATCCCTTCGATGGCATTGACAGAGCCGTTTGGATTGTATTTAGAATCCATAGATGGTTGTCCGTCAAAGTCCACATATTGGCTCCAGATTTGACCATTGTCTCTTAGCTCTGCAAATTCAGAAGCGCTGACAACAAATTTACCTTCACCGTGTGAAACTGGAATGGCATGAATATCGCCGACCTCAACTCCTGCCAACCATGGTGAGTTGGTATTTGCGATACGAGTCTCAACCATCTTTGCAACGTGCTGGTTAGCATCGTTATAGAAGAGAGTTGGACTTGTCTCACCAGCTTCCTCGAAGTTTCCGTATGGAAGAAGACCTGATTTAACAAGAGCTTGGAATCCATTACAGATACCGATGATAAAGCCACCTTTTTCGATGAAGCTGTCAATAGCTGCGCGGACCTTCTCGTTAAGCAAGATATTGACGATAAACTTAGCAGACCCATCTGGTTCATCCGCTGCTGAGAAACCTCCAGCAAAGAAGATGATATTTGCCTTAGCAATATTAGCGACCATTGTGTCAACTGATTCAGCAATAGCAGCCTCATTCAAGGTTACAAATGCTACCAAGTTGACACTAGCTCCAACCTGTTCAAAAGCCTTAGCTGAATCATATTCTGAGTTGGTACCAGGGAAGACTGGAATGTAAACCACTGGTTTTTCAATTGTTTCCTTAGCCTTGATGACGGTGTCTGATACCACAGCAGGAACTTCTTCAAGAGCGTCTGCCTGTTCGAATTCTGTTGGATAAACCTCTTCCAATTTACCTTCGAAGGCTGCTAGAAGGCTAGCGCCAGCAAGGTCATTTCCATTGACAGTGACTGTAAAGTCTGCCTGAGTTTGACCGATTTTCACAGCGCCAGCGATTTCCTCAGCTGATGTAAAAACAAAGCCTCCGAGTTGAGCTGTCAAGCTGCTGTCAAGCTCTGCAATTTCAACAGAGGCCCCGATACGGTTACCAAAAGTCATGAGAGCAAGACTTTCTAGGACACCACCGTATTTAACAGCTGAAGCAGCAGTAATCTTATGTTGAGCTTGAATAGTCTCGAACTGGCTAAAGTTAGCCTTGATAAGGTCAAAATCAATATCTTCTGAAATAGCTTGACCTGGAATGTAGTAGATGTTCTCACCAGCCGCTTTAAACTCAGGAGAGAGAACCCTGCGGCTATCCGCGGTCGTCACACCGAAAGCTACCAAGGTTGGTGGTACTGTCAAGTCTTCGAAAGTACCAGACATAGAGTCCTTACCACCGATTGATGGCAAGCCAAGTTGAATCTGAGCCTCAATAGAACCAAGAAGAGCTGATACTGGCTGACCAAAACGCTCTGCCTGTTTATCCATACGCTTGAAATACTCTTGGTAAGAGAAGCGTGCACGAGACCAGTCAGCACCCGTTGCTACCAAGCGAGCTGTCGCTTCAATAACAGCATAGGCAGCACCGTGATAAGGTGACCACTCTGCAATATAAGGATTATAACCTTGAGCAATAACAGAGGCTGTTGTCGTCACACCATGTTGAACTGGCAATTTTTGAACAGAACTTTCTGTCGGTGTGATTTGGTAGCGACCACCGATTGGGTGGTTAACGGTTGAGCGACCAACAGATGAGTCAAAGATAGTTTGAAGACCTTTTTGGCTAGCATGGTTGAGGTCAGACAAGACCTTAAGCGTATCTGCTTCAAGTGTCTCTGCAGATGTTGTGCGTGCTTCTGGAACTGTCAAGTCCTTGTCAACGACTTTAGCATCAACAACGACACGGACACCGTTGGTATCAAGGAAACGGCGTTCCAAGTCAACGATGATTTCGCCATTCCAAGTCATGACAAGATTTGGTTTTTCAGTCACAGTCGCAACTACGACTGCATCGATATTTTCCTTGTTACAGGCTGCGATGAAGGCATCCACATCACTTGGACGAACAACGACTGACATACGCTCTTGTGATTCTGAGATTGCAATTTCAGTACCATTAAGACCTTGGTATTTAAGTGGCACCTTGTCCAAATCGATTTCAAGACCATCAGCCAATTCACCGATGGCAACACAGACACCACCTGCACCGAAGTCATTTGATTTCTTGATAAGACGAGTGACATTGCCATCACGGAAAAGACGTTGAATCTTACGCTCTTCAATGGCATTCCCTTTTTGTACCTCTGCGCCAGCTGTTTCCACAGATTCAACCGTTTGAACCTTAGATGAACCTGTCGCACCACCGACACCATCACGACCTGTTTTACCACCGAGCAAGATGACCACATCGCCTGTTTCTGGTTTTTCACGAACCACATTTTCCTTAGGTGCAGCACCAACCACAGCTCCAAGCTCCATACGTTTGGCTACGAAGCCAGGGTGGAAGTACTCACGCACATACGTTGTCGCAAGCCCAATTTGGTTACCATATGAAGAATAGCCGTGCGCCGCAGTCTTAGAAATAACCTGTTGTGGCAATTTTCCAGCACGTGTTTCCGCAATCGGAGTCGTGATATCGCCTGCGCCTGAAATACGCATAGCCTGATAAACGTATGAACGTCCTGACAACGGGTCACGAATGGCACCACCGATACAAGTTGCTGCGCCACCAAATGGCTCAATTTCTGTTGGGTGATTGTGAGTCTCATTCTTAAACATGAGGAGCCAAGGCTCTTTTACACCATCAACATCTACTTCAATTTCAACTGAGCAGGCATTGATTTCATCTGAGACTTCCATATCGTCCAGACGACCGTTGGCACGCTCATAACGACCAAAAATAGTCGCCATATCCATAAGTGTCTGCGGCTTTTCAGAACGATCAAGTTCATCACGCATGGCGATATATTTGTCATAAGTCGCCTGCAATTGTTTTTGGAATTTAGAAGCTGAAAAATCAATGTTCTTCAATTCAGTTTCAAAGGTTGTGTGACGGCAGTGGTCTGACCAGTAAGTGTCCAAAACTTTCAGCTCAGTCTCAGTTGGCACACGCCCGATTGATTTGAAATAATCTTGGATGAAGAGAAGGTCATCGACCTCCATAGCCAAGCCCTGCTCTGCCTTATAGGCCACAAAATCGTCAGCTGTATAAGTCTCGAAGAAATCAAGATTAGGGATCGTCTTATCAGACACAGAGAAAGCCTGCTCTTCAAGCGGCAAAGTAATGTCCTTGAAACGTGAATCAACTGGATTCAAAAGATAGTTTTTAACCGCTTCAAGTTCTGCTTCTGCAATATCCTTATTGACCAAGTAAAGCTGGGCGGTGTTGACCTTGACCTGACTGTCACCTCCAAGCAATAGCAAAGCTTCTTGCGAACTAGCTGCACGTTGGTCAAATTGACCAGGAAGGGCCTCAATAGCAAAGAAGGCAACCTTATCAAGCTCCGCAGTGATTTCAGCTTCTGTCAAAAGACGGTCTGTCACCTGCTCAGAGAAAATATGCTTCTCCGCACGCGCCAGCAAATCCTCAGCCAAATGAAAGACATCATAGACCTGCACAATACGCAACTCCTTCAAAGAAGTCAATTGCAAATTATGCGTCAACTCTTTCACAAGACTAGCCGATTTAATGCCAAAGTCAGCCTTTTTCTCAACAAAAATACGTTTATTCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP016501|1764624:1776689|1764624_1765929_-|WP_000783424.1|DBSCAN-SWA MKMNKKVLLTSTMAASLLSVASVQAQETDTTWTARTVSEVKADLVKQDNKSSYTVKYGDTLSVISEAMSIDMNVLAKINNIADINLIYPETTLTVTYDQKSHTATSMKIETPATNAAGQTTATVDLKTNQVSVADQKVSLNTISEGMTPEAATTIVSPMKTYSSAPALKSKEVLAQGQAVSQAAANEQVSPAPVKSITSEVPAAKEEVKPTQTSVSQSTTVSPASVAAETPAPVAKVAPVRTVAAPRVASVKVVTPKVETGASPEHVSAPAVPVTTTSTATDSKLQATEVKSVPVAQKAPTATPVAQPASTTNAVAAHPENARLQPHVAAYKEKVASTYGVNEFSTYRAGDPGDHGKGLAVDFIVGKNQALGNEVAQYSTQNMAANNISYVIWQQKFYSNTNSIYGPANTWNAMPDRGGVTANHYDHVHVSFNK >NZ_CP016501|1764624:1776689|1769506_1770058_-|WP_000685111.1|DBSCAN-SWA MKIAVFASGNGSNFQIIAEQFPVSFVFSDHRDAYVLERAQNLTIPSFAFELKEFENKAAYEQAIVNLLDKHEIDLVCLAGYMKIVGEALLSAYEGRIINIHPAYLPEFPGAHGIEDAWEAGVDQSGVTIHWVDSGVDTGQVIQQVRVPRLADDSLESFETRIHETEYQLYPAVLDSLGIKRKQ >NZ_CP016501|1764624:1776689|1770225_1771248_-|WP_001291325.1|DBSCAN-SWA MSEKNAYAKSGVDVEAGYEVVERIKKHVARTERAGVMGALGGFGGMFDLSKTGVREPVLVSGTDGVGTKLMLAIKYQQHDTIGQDCVAMCVNDIIAAGAEPLYFLDYVATGKNEPAKLEQVVAGVAEGCVQAGAALIGGETAEMPGMYGEDDYDLAGFAVGVAEKSQIIDGSKVKEGDILLGLASSGIHSNGYSLVRRVFADYTGNEVLPELEGKQLKDVLLEPTRIYVKAVLPLIKEELVNGIAHITGGGFIENVPRMFADDLAAEIDEDKVPVLPIFNAIEKYGDIKHEEMFEIFNMGVGLMLAVSPENVNRVKELLDEPVYEIGRIIKKADDSVVIK >NZ_CP016501|1764624:1776689|1772963_1776689_-|WP_001042263.1|DBSCAN-SWA MNKRIFVEKKADFGIKSASLVKELTHNLQLTSLKELRIVQVYDVFHLAEDLLARAEKHIFSEQVTDRLLTEAEITAELDKVAFFAIEALPGQFDQRAASSQEALLLLGGDSQVKVNTAQLYLVNKDIAEAELEAVKNYLLNPVDSRFKDITLPLEEQAFSVSDKTIPNLDFFETYTADDFVAYKAEQGLAMEVDDLLFIQDYFKSIGRVPTETELKVLDTYWSDHCRHTTFETELKNIDFSASKFQKQLQATYDKYIAMRDELDRSEKPQTLMDMATIFGRYERANGRLDDMEVSDEINACSVEIEVDVDGVKEPWLLMFKNETHNHPTEIEPFGGAATCIGGAIRDPLSGRSYVYQAMRISGAGDITTPIAETRAGKLPQQVISKTAAHGYSSYGNQIGLATTYVREYFHPGFVAKRMELGAVVGAAPKENVVREKPETGDVVILLGGKTGRDGVGGATGSSKVQTVESVETAGAEVQKGNAIEERKIQRLFRDGNVTRLIKKSNDFGAGGVCVAIGELADGLEIDLDKVPLKYQGLNGTEIAISESQERMSVVVRPSDVDAFIAACNKENIDAVVVATVTEKPNLVMTWNGEIIVDLERRFLDTNGVRVVVDAKVVDKDLTVPEARTTSAETLEADTLKVLSDLNHASQKGLQTIFDSSVGRSTVNHPIGGRYQITPTESSVQKLPVQHGVTTTASVIAQGYNPYIAEWSPYHGAAYAVIEATARLVATGADWSRARFSYQEYFKRMDKQAERFGQPVSALLGSIEAQIQLGLPSIGGKDSMSGTFEDLTVPPTLVAFGVTTADSRRVLSPEFKAAGENIYYIPGQAISEDIDFDLIKANFSQFETIQAQHKITAASAVKYGGVLESLALMTFGNRIGASVEIAELDSSLTAQLGGFVFTSAEEIAGAVKIGQTQADFTVTVNGNDLAGASLLAAFEGKLEEVYPTEFEQADALEEVPAVVSDTVIKAKETIEKPVVYIPVFPGTNSEYDSAKAFEQVGASVNLVAFVTLNEAAIAESVDTMVANIAKANIIFFAGGFSAADEPDGSAKFIVNILLNEKVRAAIDSFIEKGGFIIGICNGFQALVKSGLLPYGNFEEAGETSPTLFYNDANQHVAKMVETRIANTNSPWLAGVEVGDIHAIPVSHGEGKFVVSASEFAELRDNGQIWSQYVDFDGQPSMDSKYNPNGSVNAIEGITSKNGQIIGKMGHSERWEDGLFQNIPGNKDQALFASAVKYFTGK >NZ_CP016501|1764624:1776689|1771275_1772730_-|WP_000220672.1|DBSCAN-SWA MTYEVKSLNEECGVFGIWGHPQAAQVTYFGLHSLQHRGQEGAGIVSNDNGKLYGYRNVGLLSEVFKNQSELDNLTGNAAIGHVRYATAGSADIRNIQPFLYKFHDGQFALCHNGNLTNAISLRKELEKQGAIFNASSDTEILMHLIRRSHNPSFMGKVKEALSTVKGGFAYLLMTEDKLIAALDPNAFRPLSIGQMQNGAWVISSETCAFEVVGAKWVRDVEPGEVILIDDSGIQCDRYTDETQLAICSMEYVYFARPDSTIHGVNVHTARKNMGKRLAQEFKQDADIIIGVPNSSLSAAMGFAEESGLPNEMGLVKNQYTQRTFIQPTQELREQGVRMKLSAVSGVVKGKRVVMIDDSIVRGTTSRRIVGLLREAGASEVHVAIASPELKYPCFYGIDIQTRRELISANHAVDEVCDIIGADSLTYLSLDGLIESIGLETKAPNGGLCVAYFDGHYPTPLYDYEEEYLRSLEEKTSFYIQKVK >NZ_CP016501|1764624:1776689|1768734_1769487_-|WP_000780020.1|DBSCAN-SWA MKLVKNLEIVESIFGDWDETIIWSCVQGIMGEVFVDSLDQPKSSLAKLGRKSSFGFLAGQPTLFLLEVCSGEDIILVPQHKGWSDLIESTYGQNAHSFKRYATKKDTLFERSRLEKFVTQLPNGFELRAIDEKVYNSCLEKEWSQDLVANYATYQYYKKQGIGYVVYYQGNIIAGASSYSTYKNGIEIEVDTHPDFRRRGLATIVAAQLILTCLDKGIYPSWDAHTRTSLNLAEKLGYEFSHEYIAYEID >NZ_CP016501|1764624:1776689|1767167_1768715_-|WP_000166558.1|DBSCAN-SWA MTKRALISVSDKSGIVDFAKELKNLGWDIISTGGTKVSLDDAGVETIAIDDVTGFPEMMDGRVKTLHPNIHGGLLARRDADSHLQAAKDNNIELIDLVVVNLYPFKETILRPDVTYDLAVENIDIGGPSMLRSAAKNHASVTVVVDPADYATVLGELADASQTTFKTRQRLAAKAFRHTAAYDALIAEYFTAQVGEAKPEKLTITYDLKQAMRYGENPQQDADFYQKALPTDYSIASAKQLNGKELSFNNIRDADAAIRIIRDFKASPTVVALKHMNPCGIGQADDIETAWDYAYEADPVSIFGGIVVLNREVDAATAGKMHPIFLEIIIAPSYSEEALAILTNKKKNLRILELPFDAQAASEVEAEYTGVVGGLLVQNQDVVAENPSDWQVVTDRQPTEQEATALEFAWKAIKYVKSNGIIITNDHMTLGLGAGQTNRVGSVKIAIEQAKDHLDGAVLASDAFFPFADNIEEIAAAGIKAIIQPGGSVRDQESIDAANKHGLTMIFTGVRHFRH >NZ_CP016501|1764624:1776689|1766075_1766975_-|WP_001045908.1|DBSCAN-SWA MNKWLVKASSLVVLGGMVLSAGSRVLADTYVRPIDNGRITTGFNGYPGHCGVDYAVPTGTIIRAVADGTVKFAGAGANFSWMTDLAGNCVMIQHADGMHSGYAHMSRVVARTGEKVKQGDIIGYVGATGMATGPHLHFEFLPANPNFQNGFHGRINPTSLIANVATFSGKTQASAPSIKPLQSAPVQNQSSKLKVYRVDELQKVNGVWLVKNNTLTPTGFDWNDNGIPASEIDEVDANGNLTADQVLQKGGYFIFNPKTLKTVEKPIQGTAGLTWAKTRFANGSSVWLRVDNSQELLYK |
8 | Streptomyces_phage(14.29%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1836703 : 1843907
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP016501|1836703:1843907|DBSCAN-SWA AATGAAAATAGTTGAAGGCGTTTCTCTACATTTAATTAAAAACCAACAATTTAAGACTAATCATCTTACTTTTCGATTTTCAGGTGATTTTAATAATAAAACAGTAGCACGTAGATCGCTCGTTGCTCAAATGTTAGTTACTGCAAATGCTAAGTACCCTAAGGTACAAGAATTTCGAGAAAAGTTAGCTTCCTTATATGGTGCTAGTTTATCAACTAAGATTTCTACTAAGGGTCTTGTTCACATTGTTGATATTGATATTGTATTTGTTAAAAATACCTTTACTTTAGAACAAGAAAATATCGTTGAACAAATTATTACATTCTTGGAGGATATGCTGTTTTCTCCTTTAATTTCTTTAGAACAGTATCAAACATCTATTTTTGATACTGAGAAGAAAAACCTTATTCAATATTTAGAGGCCGATATTGAAGATAATTTTTATAGCAGTGACTTAGCACTAAAATCTTTATTCTACAATAATAAGACACTTCGTTTACCTAAGTATGGTACAGCATCTTTAGTGGAATCAGAAAATTCATTTACAGCTTATCAAGAATTTCAGAAGATGCTTAAGGAAGATCAATTAGATATTTTTGTCGTAGGAGATTTTGATGACTATCGAATGATTCAGGCATTTAATCGTATGGCATTTGAGCCACGTCACAAAGTATTAGCTTTTGATTATACGCAGACTTATGAAAATATAACGAGAAGTCAAGTAGAAGATAAGGACGTTAATCAATCTATTATGCAATTGGCTTATCATCTTCCAATAACTTATAAAGATGAAGATTATTTTGCTTTAATAGTATTTAACGGTTTATTTGGTGCTTTTGCCCATTCCTTGTTGTTTACAGAAATTCGAGAAAAGCAAGGTTTAGCCTATACTATAGGTAGTCAATTTGACAGCTTTACAGGTCTCTTTACGATATATGCAGGTATTGATAGAGAGAATCGTGAGCGTTTTTTGAAGCTTATTAATAAGCAATTCAATAATATTAAGATGGGCAGGTTCTCTTCTACATTATTGAAACAAACAAAAGATATATTGAAAATGAACTATGTTTTAGCTTCTGATAATCCTAAGGTCATAGTTGACCACATTTATCATGAACATTATTTAGATCAGTTTCATACTTCAGCACTATTCATTGACAAAGTGGACGATGTAACAAAATCAGATATTGTCAGCGTAGCAACAAAATTAAAATTACAAGCTTTTTATTTTTTGGAAGGAAACTAGGATGACAGTTAATAAAATAACTTATCAGAACCTTCAGGAGGAAGTATATAAGTTAACATTAGAGAGCGGGCTAAATGTATACCTTATTCCAAAACCATCATTTAAGGAAACTGTTGGAGTATTGACAGCAAATTTTGGATCACTCCATACAAGATATACGAGGAATGGTTGTGTAGAACATTATCCGGCAGGAATTGCTCACTTCTTAGAACATAAATTATTTGAACTGGATAAAGGACAAGACGCTGCAACTCAATTTACTAAATATGGCGCCGAGAGTAATGCCTTTACAACTTTTGATAAAACTAGTTTTTATTTTTCAACTATAAGTCACATTACAAACTGCTTAGATATACTACTTGACTTTGTATTGACAACAAACTTTACAGAAGAGTCAATTACTAAAGAAAAAGACATTATCAAACAAGAAATTGAAATGTATCAAGATGACCCTGAATATAGGCTTTATCAAGGAGTGTTATCGAACCTTTATCCCAACTCTCCATTAGCATTTGATATCGCTGGAGATTACCAATCAATCTCTCAAATAACCTTAACCGATTTACAAGAAAATCACAAAGACTTTTATCAGTTGTCTAATATGAATTTAGTTCTTGTTGGACAATTTAGTCCACAAGAAATTATAACTTACCTTCAAAAAAATTCTCATTTAACAAGTTACTCACAGAATATTGACCGAGATTCTATTAGTCTTGAACCTGTTATAAAAAACAACTCTTGCCATATGACAGTGACAAAACCTAAATTAGCTATTGGTTATCGAAAATCAAGTCATATGAGACATGGTTCTTACCTAAAAGAGAAAATCGGATTACAGCTATTTTTTGCAATGCTTTTAGGATGGACTTCTACAATTAATCAAGACTGGTACGAATCTGGTCAAATTGATGACTCTTTTGATATAGAAATTGAAGTTCATCCAGATTTTGAGTGCGTTATTATTTCTTTAGATACGACAGAGCCAATTGCTTTTTCAACTCAGTTGAGGTTATTGCTAAAAAATGCTTTGCAATCATCTGATCTTACCGAAAGCCACTTACAAAACGTTAAGAGAGAGCTCTATGGCGATTTTTTAAGGAGCTTAGATTCTATTGAAAATCTCGCAATGCAATTCGTCACCTATTTATATGATGGAAAAACGATGTATTTAGATTTGCCATCTATCGTTGAAGAATTAGACTTGGAAGATGTCATTACGATTGGTAAGGATTTCTTAGATAATGCTGACACATCTGACTTCGTTATTTTTCCAAAATCTTCGTAAATAATCTGAAATTTTGATATAATAAGGAAGCTTAGACATCTATAAAGTTGGTATGTCAAAAGTCTTTTACTAGTTTTGAAATAACATAATATAGGAATACTATGTTTCCAAAAGGAGAATTTATGGTGAAGAAAGAAAATATTCCGAATCTATTAACTGTAGTCAGAATTTTAATGATTCCGTTATTTATAGTATTAACTTCAGTAACAACAAGTACAACTTGGCATATTGTTGCAGCTATTGTATTTGCTATTGCAAGCTTGACCGATTACCTAGACGGTTATCTAGCACGTAAATGGCAAGTAGTAACAAATTTTGGGAAATTTGCAGATCCATTAGCTGATAAAATGCTAGTTATGAGTGCTTTTATCATGTTAGTTGGACTAGACTTAGCACCGGCTTGGGTTTCAGCTATTATCATTTGTAGAGAGCTAGCAGTTACAGGTTTGCGTTTACTTTTAGTTGAAACAGGTGGAACAGTACTTGCTGCAGCTATGCCAGGGAAAATAAAGACGGCTACTCAAATGTTTGCAGTTATTTTTTTGTTAGTACATTGGATGACACTTGGTAACATCATGTTATACATTGCTTTATTCTTTACCTTGTATTCAGGCTACGACTATTTTAAAGGTGCAGGCTTCTTATTTAAGGATACTTTTAAATAATGACAAATATTATTACCGTTAATAATCTCTTTTTTAAATATGATAGTAATCAAACACATTATCAATTGGAGAATGTATCGTTTCATGTGAAACAAGGAGAATGGCTATCTATTATTGGTCATAATGGTTCTGGGAAATCCACTACTGTTCGTTTAATAGATGGGCTTTTAGAGGCGGAGTCAGGCCAGATTATCATTGATGGACAAGAGTTAACCGAAGATAATGTCTGGGAGTTACGTCACAAGATTGGCATGGTATTTCAAAATCCTGATAACCAATTTGTAGGGGCAACTGTTGAAGATGACGTCGCCTTTGGTTTGGAAAATAAAGGTATTCCTTTGAAAGACATGAAAGAAAGAGTGGACCAAGCACTTGATTTGGTAGGCATGTCTGAATTTAAAATGAGGGAGCCTGCACGTTTATCTGGGGGACAAAAGCAGCGTGTAGCCATTGCAGGAGCTGTAGCTATGCGTCCACAAGTTATCATCTTGGATGAGGCGACTAGTATGCTTGATCCTGAGGGACGATTGGAACTGATAAGAACGATACGAGCTATTCGTCAAAAATATAATCTTACTGTTATTTCAATTACACACGACTTAGATGAAGTTGCATTGAGTGATCGTGTCATCGTTATGAAAAATGGTAAAGTTGAGTCGACATCCACTCCAAAAGCGTTATTTGGTCGTGGAAATCGTTTGATTAGTTTAGGTTTAGATGTTCCCTTTACAAGTAGGTTAATGGCAGAACTGGCTGCTAATGGTCTTGATATAGGGACAGAGTATCTTACAGAGAAGGAATTAGAAGAACAATTATGGGAATTGAATTTAAAAATGTAAGTTATACCTATCAAGCCGGCACTCCTTTTGAAGGGCGTGCCCTTTTTGACGTCAATCTGAAAATTGAAGATGCTTCCTATACCGCGTTCATTGGGCACACAGGTTCTGGAAAATCAACTATTATGCAACTTTTGAATGGTTTACATATTCCTACAAAAGGTGAGGTAATTGTCGATGATTTTTCTATTAAAGCAGGGGACAAGAACAAAGAAATCAAATTTATAAGGCAAAAAGTTGGTTTAGTTTTTCAATTTCCAGAAAGTCAGCTTTTTGAAGAGACAGTTTTAAAAGATGTTGCTTTTGGACCACAAAATTTTGGTATTTCTCAGATTGAAGCTGAAAGGCTGGCTGAAGAAAAATTAAGGTTAGTTGGTATCAGTGAGGATTTATTCGATAAAAATCCATTTGAACTTTCTGGAGGGCAGATGAGGCGGGTTGCTATAGCTGGTATTTTAGCGATGGAACCCAAAGTACTAGTACTAGATGAGCCAACAGCTGGACTTGATCCTAAGGGAAGAAAAGAATTAATGACTCTTTTTAAAAATCTTCATAAAAAAGGAATGACTATCGTCTTAGTGACTCACTTAATGGACGATGTAGCGGATTATGCTGACTATGTGTATGTTTTAGAAGCAGGGAAAGTAACCTTATCAGGACAACCAAAGCAGATTTTTCAAGAAGTAGAACTTTTAGAAAGTAAACAATTAGGAGTTCCCAAAATCACCAAGTTTGCTCAAAGGCTATCTCATAAGGGATTAAATTTACCTAGTTTACCAATTACTATTAACGAATTTGTGGAGGCTATTAAGCATGGATAAATTGATTTTGGGACGTTACATACCAGGTAATTCTCTCATTCATAAATTAGATCCTCGAAGTAAGTTACTAGCCATGTTGCTTTTTATCATTATTGTTTTTTGGGCAAATAATGTAGTGACAAATGTGATCGTATTCATTTTTACGTTGGTTATTGTGGGTTTATCTCAGATAAAATTTTCCTATTTTTTCAATGGTATTAAACCGATGGTTGGTATTATTTTATTTACAACCCTGTTCCAGATGCTGTTTGCACAAGGCGGGCAGGTTATCTTCTCGTTTTGGATTTTTAGCATCACTAGCCTTGGGTTACAACAAGCAGCACTTATTTTTATGAGATTTGTTTTAATTATCTTCTTTTCGACATTACTTACCTTAACTACTACTCCGTTAAGTTTAGCAGATGCTGTTGAATCTTTGTTAAAACCATTGGAAGTACTGAGAGTACCAGCACACGAGATTGGTTTGATGTTATCCCTTAGCTTACGTTTCGTTCCTACTTTAATGGATGATACAACACGTATCATGAATGCTCAAAGAGCTCGTGGAGTTGATTTTGGAGAGGGAAACCTAATTCATAAGGTTAAGTCCATTATCCCTATTTTGATTCCATTATTTGCTTCTAGTTTTAAAAGAGCAGATGCGTTGGCAATTGCTATGGAAGCAAGAGGTTATCAAGGCGGAGCCAATCGTAGTAAGTACCGTTTATTGAAGTGGACTGTTCGGGATACTTTTAGTATATTACTGATGTTATTATTGGGTTTGAGTTTGTTTCTATTAAAAAATTAAAAGGTTAAGGTCAGTTTATGGTCTTAGCCTTTTCATTTATATAATGTGAAAAGCTCAACAACTTTACTTAAAGAAAACTAAAAATTGTTTTTTGGAGGCCTTTTCTTTATTAATTTACAATTTATTTGTAATAAATTTCATCTAAGGCAAAACAATTAGTAATATTTACAAGGTATGATAGTACGCGTATCAATTAATGAAAGAGAGCATATATTATGTCAATAACCTCGGTTAAAAAATCAAAACCATTTAAATTAGGAGTGGCAGGTCTTTTAGTGGGTGCTTCATTAGCTTTACCACTTTCAGTAAGCGCAGCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCAAAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGTTGGTATTAATAAAAGTTGATAAAACAAAGAGATTTACTTAAAATTAAATCTCTTTGTAATATTTTTAACATCTATTCAATTGCAAGGTAACAAATAAAGTATAATATAGGTAACATGAATAAAAGAAGAAAATTATCAAAATTGAATGTAAAAAAACAACATTTAGCTTATGGAGCTATCACTTTAGTAGCCCTTTTTTCATGTATTTTGGCTGTAACGGTCATTTTTAAAAGTTCACAAGTTACCACTGAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCCAAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCAAAACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTCAATCTACAGAAGCTAATTCTCAGCAACAAGTTACTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGTAACAGAAAACACCCCTGCTACCAGTCAGGCACAACAAGCTTATGCTGTTACTGAGACAACTTATAGACCTGCTCAACACCAGACAAGTGGCCAAGTATTGAGTAATGGAAATACTGCAGGGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTTGGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGAATCAAGTTAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTACTAG
Protein sequences of DBSCAN-SWA_5 >NZ_CP016501|1836703:1843907|1841547_1842342_+|WP_000359358.1|DBSCAN-SWA MDKLILGRYIPGNSLIHKLDPRSKLLAMLLFIIIVFWANNVVTNVIVFIFTLVIVGLSQIKFSYFFNGIKPMVGIILFTTLFQMLFAQGGQVIFSFWIFSITSLGLQQAALIFMRFVLIIFFSTLLTLTTTPLSLADAVESLLKPLEVLRVPAHEIGLMLSLSLRFVPTLMDDTTRIMNAQRARGVDFGEGNLIHKVKSIIPILIPLFASSFKRADALAIAMEARGYQGGANRSKYRLLKWTVRDTFSILLMLLLGLSLFLLKN >NZ_CP016501|1836703:1843907|1839897_1840737_+|WP_000181757.1|DBSCAN-SWA MTNIITVNNLFFKYDSNQTHYQLENVSFHVKQGEWLSIIGHNGSGKSTTVRLIDGLLEAESGQIIIDGQELTEDNVWELRHKIGMVFQNPDNQFVGATVEDDVAFGLENKGIPLKDMKERVDQALDLVGMSEFKMREPARLSGGQKQRVAIAGAVAMRPQVIILDEATSMLDPEGRLELIRTIRAIRQKYNLTVISITHDLDEVALSDRVIVMKNGKVESTSTPKALFGRGNRLISLGLDVPFTSRLMAELAANGLDIGTEYLTEKELEEQLWELNLKM >NZ_CP016501|1836703:1843907|1840712_1841555_+|WP_000510606.1|DBSCAN-SWA MGIEFKNVSYTYQAGTPFEGRALFDVNLKIEDASYTAFIGHTGSGKSTIMQLLNGLHIPTKGEVIVDDFSIKAGDKNKEIKFIRQKVGLVFQFPESQLFEETVLKDVAFGPQNFGISQIEAERLAEEKLRLVGISEDLFDKNPFELSGGQMRRVAIAGILAMEPKVLVLDEPTAGLDPKGRKELMTLFKNLHKKGMTIVLVTHLMDDVADYADYVYVLEAGKVTLSGQPKQIFQEVELLESKQLGVPKITKFAQRLSHKGLNLPSLPITINEFVEAIKHG >NZ_CP016501|1836703:1843907|1837949_1839233_+|WP_000217765.1|DBSCAN-SWA MTVNKITYQNLQEEVYKLTLESGLNVYLIPKPSFKETVGVLTANFGSLHTRYTRNGCVEHYPAGIAHFLEHKLFELDKGQDAATQFTKYGAESNAFTTFDKTSFYFSTISHITNCLDILLDFVLTTNFTEESITKEKDIIKQEIEMYQDDPEYRLYQGVLSNLYPNSPLAFDIAGDYQSISQITLTDLQENHKDFYQLSNMNLVLVGQFSPQEIITYLQKNSHLTSYSQNIDRDSISLEPVIKNNSCHMTVTKPKLAIGYRKSSHMRHGSYLKEKIGLQLFFAMLLGWTSTINQDWYESGQIDDSFDIEIEVHPDFECVIISLDTTEPIAFSTQLRLLLKNALQSSDLTESHLQNVKRELYGDFLRSLDSIENLAMQFVTYLYDGKTMYLDLPSIVEELDLEDVITIGKDFLDNADTSDFVIFPKSS >NZ_CP016501|1836703:1843907|1843202_1843907_+|WP_001042661.1|DBSCAN-SWA MNKRRKLSKLNVKKQHLAYGAITLVALFSCILAVTVIFKSSQVTTESLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEANSQQQVTASEEAAVEQAVVTENTPATSQAQQAYAVTETTYRPAQHQTSGQVLSNGNTAGAIGSAAAAQMAAATGVPQSTWEHIIARESNGNPNVANASGASGLFQTMPGWGSTATVQNQVNSAIKAYRAQGLSAWGY >NZ_CP016501|1836703:1843907|1836703_1837948_+|WP_000706190.1|DBSCAN-SWA MKIVEGVSLHLIKNQQFKTNHLTFRFSGDFNNKTVARRSLVAQMLVTANAKYPKVQEFREKLASLYGASLSTKISTKGLVHIVDIDIVFVKNTFTLEQENIVEQIITFLEDMLFSPLISLEQYQTSIFDTEKKNLIQYLEADIEDNFYSSDLALKSLFYNNKTLRLPKYGTASLVESENSFTAYQEFQKMLKEDQLDIFVVGDFDDYRMIQAFNRMAFEPRHKVLAFDYTQTYENITRSQVEDKDVNQSIMQLAYHLPITYKDEDYFALIVFNGLFGAFAHSLLFTEIREKQGLAYTIGSQFDSFTGLFTIYAGIDRENRERFLKLINKQFNNIKMGRFSSTLLKQTKDILKMNYVLASDNPKVIVDHIYHEHYLDQFHTSALFIDKVDDVTKSDIVSVATKLKLQAFYFLEGN >NZ_CP016501|1836703:1843907|1839355_1839898_+|WP_000239224.1|DBSCAN-SWA MVKKENIPNLLTVVRILMIPLFIVLTSVTTSTTWHIVAAIVFAIASLTDYLDGYLARKWQVVTNFGKFADPLADKMLVMSAFIMLVGLDLAPAWVSAIIICRELAVTGLRLLLVETGGTVLAAAMPGKIKTATQMFAVIFLLVHWMTLGNIMLYIALFFTLYSGYDYFKGAGFLFKDTFK >NZ_CP016501|1836703:1843907|1842557_1843097_+|WP_000029067.1|DBSCAN-SWA MSITSVKKSKPFKLGVAGLLVGASLALPLSVSAASYTVKSGDTLSAIAKNHKTTVQELVSLNSISNADVISIGDVLKLDNSTASQAEAKSQPTIENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEKVADNYVASRYGSWSAALSFWNSNGWY |
8 | Streptococcus_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
1852209 : 1867825
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP016501|1852209:1867825|DBSCAN-SWA TTTGGCAGAAGTATCTGAGCTAAGAGTACAACCTCAAGACCTGTTGGCAGAACAGGCCGTTTTAGGATCTATCTTTATCTCACCAGAGAAACTGATTATGGTGAGAGAATTTATAAGTCCAGATGATTTTTATAAATATTCACACAAAGTTATCTTCAGAGCGATGATTACCTTAGCTGATAGAAATGATGCAATTGATGCTGCTACTGTTAGAAATATTTTAGATGACCAAGGTGATCTTCAAAACATTGGTGGGTTGGGCTATATTGTTGAATTAGTTAATAGTGTTCCAACAAGTGCTAATGCGGAGTTTTATGCCAAAATAGTTTCTGAGAAAGCCATGTTGCGAGATATCATTTCTAAGTTGACAGATACTGTCAATATGGCCTATGAAGGAAATGATTCTGATGAAATTATTGCTACCGCTGAGAAGGCTTTAGTTGATATTAACGAACACAGTAATCGTAGTGGGTTTAGAAAGATATCTGATGTTTTAAAAGTTAATTACGAAAACTTAGAATTACGTTCACAGCAAACCTCAGATGTCACAGGTTTACCAACTGGATTTAGGGATTTAGACAGGATTACTACAGGTTTACATCCAGATCAATTAATCATCTTAGCGGCTCGTCCTGCAGTAGGTAAAACAGCCTTTGTTTTAAATATTGCTCAAAATGTTGGAACTAAACAAAATAGACCTGTAGCTATTTTTTCACTTGAAATGGGGGCTGAGAGTTTAGTTGACCGTATGTTGGCAGCAGAGGGAATGGTTGATTCCCATAGTTTACGAACTGGTCAGTTGACGGATCAAGATTGGAATAATGTAACGATTGCTCAAGGAGCTTTGGCGGATGCGCCAATTTATATTGATGATACGCCCGGGATTAAAATTACAGAAATTAGAGCACGCTCTAGGAAGTTATCTCAAGAAGTTGATGATGGTTTGGGTTTAATCGTCATTGACTACCTCCAATTAATTTCAGGAACTAGACCAGAAAATCGTCAGCAAGAAGTTTCTGAAATTTCAAGGCAGTTGAAGATTTTAGCTAAAGAATTAAAGGTTCCTGTTATCGCATTAAGCCAGCTCTCTCGTGGGGTTGAGCAAAGGCAGGATAAACGACCGGTCTTATCTGATATTCGTGAATCTGGATCGATTGAACAAGATGCTGACATTGTTGCTTTCTTATATCGAGATGATTATTATCGTAGAGAGGGCGAAGAAGCAGAGGAAATTGTAGAAGATAATACCGTTGAGGTTATTTTAGAAAAGAACCGTGCTGGTGCGCGTGGGACAGTAAAATTAATGTTTCAAAAGGAATATAATAAATTTTCTAGTATAGCTCAATTTGAAGAATAGGAGATGTTAATATGAGTGATGCATTTGCAGATGTTGCAAAAATGAAAAAAATAAAAGAAGATATCAAGTCACATGAGGGACAGATGGTCGAACTTACTCTCGAAAATGGTCGTAAGAGGGAAAAAAATAAAATTGGGCGCCTAATTGAAGTTTACCCATCTTTGTTTATCGTAGAGTATAAGGATACTGCAGCTGTACCAGGAGCAATTGATAATACTTATGTAGAGTCTTACACTTATTCAGATATTTTAACTGAGAAAACTTTAATTCGTTATTTTGATGACGAGTCAGCAGAATAATATGAAAACCTCGACTGAGGTTTTTTGTTTCTATAAAAAAATTTACTAATTCTCTATTGGTGGTTAGAATTAAGAATTTTTCTTATTATAATCGGAAATACTTATTAGATATCTATTTGTATTTAAGGTTTAAGAGATATAAAAACTTGATTCAATATTATTTTCATGATATAATATAGCTTGTGTGAAATAACAGCAGAAAAATATGAAGCTCGTCAACAGGTGGTTAACTAATCTAGTACCCTTTGCTGTTTAGGCGAAAATCCCCTGCACGAATCAGGATTTTTTAAATCGTTATTTCCTCAATTATTTTAGAGGAGGACATTTAGATGTCACGTTATACTGGTCCATCATGGAAACAATCACGTCGCCTTGGTTTATCACTTACAGGCACAGGTAAAGAATTGGCACGTCGTAACTACGTACCTGGTCAACACGGTCCTAATAACCGTAGCAAGCTTTCAGAATATGGCTTGCAGTTAGCTGAGAAACAAAAACTTCGTTTTTCATACGGACTTGGTGAAAAACAATTCCGTAACTTGTTCGTACAAGCTACTAAAGCAAAAGAAGGAACTCTTGGTTTTAACTTTATGGTTCTTTTAGAGCGTCGTCTTGATAACGTTGTTTATCGTTTAGGACTTGCAACAACTCGTCGCCAAGCTCGTCAATTTGTTAACCATGGTCATATCCTTGTTGATGGTAAACGTGTTGATATCCCATCATACCGCGTTACTCCAGGACAAGTTATTTCAGTTCGTGAAAAATCAATGAAAGTACCTGCAATCCTTGAAGCTGTTGAAGCTACTTTAGGACGTCCAGCTTTCGTATCATTTGATGCTGAAAAACTTGAAGGTTCATTGACTCGCCTTCCAGAACGCGATGAAATTAACCCAGAAATCAATGAAGCACTTGTCGTTGAATTCTACAACAAAATGCTTTAATTTTAAGATATATCTTACAGAAAGCCTACAACAGTGGGCTTTTTGTCTTATCTTAAAACATTGATTTCTGGGTTTGCTTACTTTTTTGCTAACTTTTGATTTTTTACTATATTGAGTTAATTGCTGTTTCAAAGAATGAGACAGCTTTTTTTGCATTCTCTTTTGATAAATGGCTATAAATATCCATAGTCATAGATAGAGTAGAGTGCCCTAATCTATATTGGAGTTCTTTGTAGGGGATGCCTGTGTTCAGCAATAAACTAGCGTGAGTGTGACGAAAACCATGGAAGCCTATATTCGATACATTTGCTCTTTTAAAATGTGTTCTTAATCGAGTTTGTAAGGTTCTATTGTTTGGGTACTTATGGATAAAATCAGAAAATACCACTGTTTCTGAACGCCCTAACTTCCATGCTTCTTGAATCTGTCGTCGTCTATACTGCTTAAGCATCGTAACTGTTCGACTATCTATGTCAATATCACGATAGCTTGACTTTGACTTTGGACTATTAATTTCTTGTTTGTAATTTAGCGTTTTTGTGATATGAACAACGGCATTATCCAAGTCGATATCTGACCAGTTTAGAGCTAATGCTTCATTAATGCGACAACCTGTAGCTAATAAAAACTTATAAAGTGTGACTTCATAGAAGTATCTGTATTTATTTTTATCTAGATTGTTTAAGTAGTTGAAAAATGTTCTTAGTTCATCATTTTCGAAATGCTTTACTCTTTTAGTGTTAGCTTTCTTAGTGTTGCGAGGGAGAATGACCTCACGCGCAGGGTTGAATGGTATAGCTTGCATGATAACGCCATACTGTAGTATACGTTTATTCAGCGCGTGTATTCTGTCATAATGGAGGTAAGCTTTTCTTTCTCCTTTATTAGTTTTATCAGCTAGTTTATTGATGATAGACTGTATAAGTGGTGTCGTTAACTTATCAAGCTTATATGCTCCAAAAATTGGTATGACATGAACGGTTAACAGCTTTTCAGTAGCTAGCTGAGTATTGTATTTTACGGTATGTTTGTAACTATCCCACCATAAGGTTGCAAGTTCCTGATAACTTGTTACGGAAGTAGCTTTGAAGCGAGTAGAACCATTTTTTATAAACTCGACAGCTTCTTGCTTAGCTTTTTCTCTAACTTCTTTCTTAGTTCGTCCTGTTATTTTAGTAGTTACTTTTTTACCTGTTACTTTATCAGTACCAAGATAAATACTGGCACGATAAATTACCGTACCATCTTTCTTTTTTACTTCAGTTATTTTCATGATCATAAACCTTTCCATCAGCAGGCAAGCTATTATTAAAGGGATTTTAGGTTTATATCATGCCAGAGCTTACGAGAAGATTCTTATTTCGTTTGTTTTGGAGAAGTCGGTAAAATTACCTGGTTAAAGGTTAAAATGCGATTATGGATGATTTAAAGCTGTTTTTCAGGGGTTTTTGACAAAGTTAGCTTACTGGATAGCTAACAAAATCCTTTACATTTAGAACGTGCGTGATATAATATAGTTAACAAAGATAACTTGTGAAGGATTAACGCTGGGTCCCAGAATGGGGTAAGCCTTCGGGCTGAGCATTCCTATGTGCCAGGGGTTATCTTTTTTATTTAGATTTTTTCAGACTTTCAACAAAATGTTGAGGGTCTTTTTCTATTTCAGAGATAATAAAATCAACGAATTTTTGAGAATAAGTGTAGTGTTCAGATTTACCTATTTTATGGCAATAAGAATATTTTTCATCAGACTTTATTGAATAGAAATCAATCACTAATGTTAGGACATATTGATTAAAACCAGATTTATAATTAAGTTTAATATTCTTTTTATTTAGTCTATCATTTACAACAGATATAATATTTGCATATGAATACTTGTGCGTTTCTGATGGATCTTTTAAATCTTTTAAAATAGCAACTTGAGAAGGTGATTGATTTGCAATTGATACTATAAAATCGGCTTCAGATTTTTTCTTAGTAATATAAAGATTTTGTTTAATACCAATAGCGAATTTATCAGAATTATACTCTGTCACTAAGACATCTATGGCATTTGCCTGTTGGATGAATTTTTCAGCAATTTCCGCAGGATATTTTAATCTAATTTGTTCGTTAGATAATGGTTCGTAAGTTGCAGTGATAGTTAAAAAATTTTGGGAAATGGCCTTTGTAACATCTCTTGAATGAAATCGTTGAAGTTCATTGACATAGTTAAGTACACAAGCTTGGAAAAGTGGAGCATACTTTAATTCATAATCTTCTGTTATGTAGTGAGTACTAATATTCCTTAGTTCAATAATGCGCTCAAGATTAAGTCGAATTCTAGTGCTGTTGTCCGAGTATATTTTTTTTATAACACCTTCTAAGCTTAGTGTTCTATCAGGATTATCCTTAAAATAGATAGATTGATTACGATTTAACATTTCAGCTTTAAGCATCAATTCCCAAGCATTACAAATAAAGAAACTAAAGCCCTCAATTCGGTACTTAATCGTTGGTTTATTGTATATTTCAAGTCCCATAATAAAAGCTTCAATACTTTTATCAACTAACCTTGTACTTAAATTCTCCATATATTTCCTTTCTAGTTTTTTTCACGCGCTTCGAAATCGATCTGTACAATTTTATTAACATCAACAAAATACTTAACCACAAGTTTTCAGAATTTTTTCTAATTTAGTATCTTTAAGGTTACCCCCATCATAAAAAAATGTTGTTCAATCCAAGTTTTGAAACCGGTGGGAAGGGTCAACTGTCCAAATATTTGCTTTTTTTCTCACACGACCCCCCACCCCGAAGATAAGAATACTAAATTATCACATCATTAAAGAGACAAACGTTTCTAACTGCCATTGCTTTTATTGGAGGAGTAGTTGCATAAGTTGGAAATCGATTATACTTTTGCAAAGTGAATGATGAAAATAGCTCATGAATTACCAAATTTCCCGACGTTGGGAAATATCGGAACAGAAGCTCTCTACCTTATTGCCACCCTACCAGATAACCAAAAGCAGTAACAGCTTGAATAACCTATTGCTTCCTATGGCGGAAGGCTCTTGAACCTAAATGCGACCATGTGGTCGTATTTGACATGTTAATGCTAAGGAATGCTATGGTTGCCAATATAAATAAGTGTTAATTATTGTTTGGTGGGGTAGTTGAGGGTTGGTCGGTATCAGGGAGATTTGGGAGAGAAAATTTTGCTTTCTTTTTGCAACTCCGTTGACATGTTTTGCTGTAGCCCCTAAATATTTCAACGTTGAAACATTTGGAATTTGTTTAGCTACCGTCATCATCCTGTTAGCCTCCCTGTAATGGATACCAATTTTATTTTCTATCCATTCAGCAAACTCTCCATGTGCTAAAACATTTTTTACATGATTTACTCGTCTGAGCATTTATTATTTATTTAATCAAAATCTATCTTATCTCCCTCAGTATAGATTTTATATCCGTAGGACTCCAAATTTTTTTTAAAATAATCAAAACTAGCGGGGGTATCAGTCACAAAATAATCTAATTGTTTATAAAATTTGAAAATAGCTTCTTGTTGACTTTTGGAAAGTTCACCACTTGGGAAGTGGTCTAACATCATCAATAGCTCAATAAGGTCAGTATGTATTTCAAAAAGTCTGTCCGTAAGTCTTCTGCTGACTTTTTCAAAATCATAATATTTTGAAGCATTTTTTATATATTCTTCTTGTTTAACAGATTCATTGATGGTATTTTTTATTTTTTTTATGCTTTTCGAGGTGTTTTCTAAAGCCAAAAACTGTTCTTTGGTTAGATGGAGGGTTGTACCATCTAAATTTAGCTGAAAATCAATCTTTTTATTAGGAGCATAACCTAGTAAATATCCAACTGATACATTAAAATATTTTGCTAATTCTTTTGCTTTGTCAGGCTTTATTTGTCTTTCGCCGTTTTCCCAACGGAGAATAGTGATTTTTGATACACCAATCTCCCCAGCTAATTCTTCTTGGGTTAGCTTTTTTTCTTTGCGTAATTCTTTCAACCTATTCATTTATTTCACTACCTTTCAAAGTTGATTATAACCGAACTTATAAAAAGTATCAAGAAATGATATAAAAATTTCAAAATAATTCTTGACAAGTAACCGAAACAGATATATAATCGTTTTGAAGTTATCCGAAAAGGATACTTTTATCTCCCGCAACCTTTCTCACTTTCAACCTACGGGAGGAATTTTTTCAAAAGGAGGATACATCATGGACAAACTACGAGGCTATCGTGTCATGCTAGGGTTAACCCAGAAAGACATGTCGGACAAGCTGAATATTTCTTTACAGTCTTACAACAATAAAGAGACGGGTAAGAGCGCTTTTAATGACAAAGAACGACTAGCAATTAAGTCAATGGTTTCAGAAATCAAACCAGATATAACCATTGATGAATTATTTTATAGCTAGAAGATTAAAGAAAGGACAACTTATTGAGAACAGAAACATGGAACGGATATACTATCCGATTTGTAGAGCACCAAGGTGAATGGTGGGCGGTGCTAGCTGATATTTGTCACGCGCTAGACTTAAAACCAAAACGTGTGAAAGAGCGTTTAGTTGATGAGGTCGTTTCAACCGACCACGTCGCAGACAGTTTAGGACGTCAACAAGAAATGCTAATTGTTAACGAGTTTGGAATTTATGACACTATTTTTTCAAGTCGTAAACCAGAAGCAAAATCATTTAAGTTTTGGGTATTTGAAACCATTAAACAGTTAAGACAAGCAACTGGTCTAGAAGGCTTTGAGGTCTTTAGAATGCTGGATAAGGAACATCAAAAAGAAGCTATGGCAAGATTGACTAATAGCTTAGATAGGGTATCTAAGAAAGACTTGATTAAAGCTAACACAATCACAAATAAAGCCGTTTCCAACAAGTTTGGATATTCAAAAATGGTTAAAAAATCAGAAATGACACAAGATATGTTAGTTGCTAGAGAAATGATTTTAGATGATACTGTGGAACTCATGGGAATTAAAGAAAAGTTCGGATTGAATATCAGTGTGTCTGAGTCAATCTACAATAAAAATTAAAAAAAGCGCAAAAATGCGCGTGAAAACAACAAAAAAAGGCTTACCGAGACCAATCAGCAGAGCCTTTAACTAGTATAACTAAACTCAATTAATAAAGCAGGCAAGCTATTATTAAAGGGGTTTTAGTAAAAGATTTGATACTTCCATTGTATCATACTACCGCAGAGCAGGCAACGACTTAACAGTTGCAGTTCTCCCCGACAAAACAACAATAATCAAATAACGAGGTAAAACAGTGAATATCATAAAACAAGTAAAAAGTTCTTTTGGAGAACTTGAAATTGATTTTTATCTGGACAGGAATAGAAATATTTTTGTGACGATTGAACAATTAGCGCAGGGATTTGGATATAAGAGCCGAAATGCTATTGAAAAGATGATAGAGCGCCAACCCTACCTCAAAGAAAAGCGATTTTCAGTTACTGACAAATTGTCAGCTACTGATGGCAAACAATACGAGACCCGACTATTCAATAAGCGAGGAATTTTTGAGATCGGTATGCTGTCCAAAACGGAGAAAGGTAAAATCTTTCGTCAATGGATTTATGACCATATCGAAGAACTAGAAAGAGAAAACGCTAACTTTAAACTGATACGAGAGCTTGAAAAGTCTAATCATAAAGAATTAACACAAGCTATCAAGGATTGGGAACACTTTAATCAATGGAGCTACAAGGCTATTAGCGACCTTTTGCTAAAATCTGTCACAGGACAGACTGCTAAACAACTGAAACAGTCACGGGTAGGTTATGAAATTGCATTAGATTGCCTAAGTGCTGATGAGTTGACGCGATATAGAAAACTTGAACAAAAAGTAATTGTCCTATTGGAGCTCAATGCAGAATATAACGATATTAAAAAATTAGTCCTTTAACAACTGAATAGAGTACGCAAAAAAGTTTGATTTAGGGGTTGACTTATGTACGTGCTTATCATATGATATAAGCACGGGCTTAAATGAAAGGAGGAACTCCTATGAGTCCAAAAATGGGCAGACCTGTAAAAGGTACAGCTAAACGTGATAAACGGCTGGAAGTTCGTTTGACCGCTGACGAGTATAATACAATACAAGAAACAGCGGATAAAAATGGATTATCTAAAGCGGATTTAATCGTTAAAGCTGTAAACTCTTATGAGTCTGAAAAATAAAAAAGTTCCTAACGTGGTATAGTTTGGCGACCGTACACGTTAGGAACTCCCAGCACCCACAAAGTAGGTACGTAAATAGTATATCATGCGTACTCTATTTTGTGAACCTAAAATTCATAGAAAGGGAGTGCGCTTTTTGTGTGCTCAAAAATCAAGACAATATGATTAAGAAAAAAAGAAATCCGAGAGTGTTATTTAGAAACGTCGCTTATAAACTATCTGAAATTGAAGGGAAAACATTAACAGAAATCGCTTCTTTTCTTGGTTTTGGGAATTCAGAAGTTTGCAGAAGCGCCCTATACAAATGGAAGCGTAAAAAATGGCTAAAATTCGACCTCAAAAACGGTCATTATCGTAATGTTGAAGTATTACATGAAGTTACGCTTGAAAAAATGGCTAATAAGGAATTAAAAGAGCAAGGACTCATTTATAAAGCTAACATTTACTATGAACAAGTGGTCTCGACATCCGAAATTATAGAAGACATTAAAACCAAAACACAAGATAGAATTAAAGCCATCCACTTACAACAAAAAGCGCTAGAGCGTATCCCTAGCGAGTTATTCGCAGAATTATACACTAACATGAACTAATCAGGGAGCTACCCCTTAAACCTAGCATGAATCTAGTATAGGAAACCGGCAATTACACAGAAAATAAAATTAAGCGAGAAAAGACATAATGAAATATAGAGTAGAAACAAATCCTTTTTCAAAAGATAGATACACTCCTGAACAGCTAGAAATGTTCAAAAATCGCCAACTCAGCAAAAATAAAGCAGAAGCCTATTTCACTCGACTATATAACCAACATATCGCTTGGGTAATTATTGCTAACGTTATGACAGAGTACGTCATTAAATTCAGAAAAAGTGCCACCAGCTTTGAAGAAGCATGGGACGCTTTAGACTATCAACGAACCACAGAGATTGTCTTTAGAGCCGTTAACGGTTTACCTTGTTCAGAGAAAGACACAGGGGAATTAGAAACTTATTTAAGTGAGGTATCGGCATGATTCAAGAACTTAATCTCACCCCAACACAGACACTTATTTTATTCATTGTTCTAGGTCTCATAGGGCTTCTTCTTAGCCGTTCTAAGCCATTAATAGAGATTGACTTACCAGAAGATATCCAATCACCTAAACCACGTCAGAACGCAAACTATGGGGCTTATATTCAATCACAGAACCATTATTACAATTAGGGAGGAACTGAATGACACTACCAGAAAATTATAGACGTGTCCTTAAACTGATTAAGGTGGGGGCAGACAACCCCATTACAGGAGCAGAGATTGGCTTAATACTGAAACTTGAAGAACGCTCCGTCCAAAGTATCATCAGTAGCTTAATCACGCGCTATAACGTCCCTATTATCGGCATTAGACACGGATTCAATCGTGGTTACTTTATTCCAGCTAACAAAGAAGAATTGCTAGACGGTGCTAAAGCCTTTTACAACCAAGTACAAAAGGAACAAGAACGCCTAAGTGTGTTATTGAATGCCGATTTAACCAGCTATAAGAAATTACTCAAAGGAGGTTAGGCATGAACGTATTTAGTCAAGATTATGAAGCCAAACTCTTAGAACAAAACCTGACCGCGTTTAATCGCTTTTTGGAAGCCTACCAGAAACCTAAACCAAGAGTTTTAGGGTTGATAACGGCTGAACAGGTCAAAGAGGAATTAAATATCAAAGGCAAAACTCTAAAACGGTGGGAAAAAGCTGGTCTAAGACGATACCAACCACCACTAGAAGATACGAGGAAACATTATTACAAGGTCAGTGATATTCTTATTTTTTTGGGGGTAAATGTGTAGATGGCTATTTATGAAGCAAGAGGCTTTAGCTCTTATTTGTACCCCTACAAAGGACCTTTAGAACCATTTGACTATATTGCGCAGTTTAAACCTCTGAAACCGCCTGAGGATATTGATATTGAAGAATACAAGCGAACACAAGCTCCCTACTGCCTGAGTGGCAAAGTCACAGCAGAGAAAAACGGTAGCTATAAGCGTAATAATGCCAGCCTGGTATATCGTGATTTAATTTTCCTTGATTATGATGAGATTGAGGGACCAGCCCAGGGTTTTATAGAGGCTGTTTCTAGAGCCTTGTTTGGCTTTTCCTATATCTTGTACTCAACAATCAAACACACCCCAGAAAGCCCCCGTTTTAGGCTTGTGGTAAAGCCTGGGGATGTGATGAATGAGGAAACCTATAAGCAGGTAGTCAAGGAGATAGCAGACAAGATAGGGCTACCTTTTGACATGGCTAGCTTAACCTGGTCCCAACTTCAGGGTTTACCAGTCACAACAGGCGACCCAGCAGACTATCAAAAAATCGTAGAGCATGGCCTAGATTATCCAGTACCAAAATCAAACAAAAATCGAACAAGTGGCCAGGGTGTAAAACCGCAGACGTACACGCCACGACCAAGCGGTCAGCGGTCTATCACTATGAGGGTAATAGATACCTTGTTTAATGGTTTTGGAGACGAAGGCGGGCGCAACGTGGCCTTAACTAAGTTTGTTGGCTTGCTATTTAATAAATGGGTGGATTGTGATATAGAAACAGCTTACGAATTAACAAAGATCGCTAACAGTGTGACAGCTAACCCCCTACCAGAGAGGGAGCTAGATAGGACTTTTGAAAGTATAGCAAGAGCAGAATTTAGAAAGAGAGGATAGAATCATAGAAAAGGAAGAATTGAAAAGCCTGGAAAGTGAAATCTTAGAGGCGCGTGAGAATGAGCAACCGCCCAAGACCATGAGAGAGCTAGAAAACCGTATCTTTCAAGCTGGTGAACAATGGCGGGAAGAACACACGGAAACCAAAATAAATGAAAGTACAGGGGACGTTACCGAAAAGGTGGCCATGCCCCAGGTTTTCACAGTTGCCAAAATGCTAAGCGAAATTATCACCTTTACTTTTATCAGTAAAAGCAACGTACCTGATTATAGCCTACTCTATATCTATGATTTAGATGAGGGCATATATACGGCTAGTAATGACCTATTTAACCGATTTTGTAAGACTTTTGACGTGAGGATTAAGCCTAGGGAATGGCCCCAGATTAAGCTAATGGTTAGGACATTGACAAGGATAAAGAAACCGCTGGAGAGCGCCTACTTTATCCCTGTACAGAATGGCATTATTGACTTAAGGACTAAGGAGCTACTTCCTTTCAGCCCTAAATATGTGATTACAAGCAAAATCAGCACAGCTTATCACGCGCCTAAACGGGTCCCAACCGATAGGGAGGGGAAGACATTTGACGATTGGTTAAACTCTATCGCTTGCAATGATAGTGAGCTGGTAACATTGTTTTGGCAGATTATCCTGGAGGCTATCAACCCCAACCATACACGGAATAAATTTGCTATCTTTTACGGGGACGGTAACAACGGAAAAGGGACATTTCAGCGGTTCCTTATCAATCTGATAGGAGAAAGTAACGTATCAGCATTGAAGCCCGCACAGTTTGCTGAAAAGCATAACCTGGAAACGCTAGTAGGTAAAGTTTGCAATATTGGAGACGAGGCACCTAATGAATACTTAAAAAATCCGTCTGACCTAATGAGTATTTCCAGCGGGGACACCGTGCTAGTCAATCCAAAGGGGCGCCCAGCTTTTGAAGCGACCTTCAAACTCTTTAATATCTTTTCAGGAAACTATATTCCCAACGGTGGAAATAAGACAAAGGGCTGGTATAGGCGTATTATGATTGTCCCATTTAATGCTGACTTTAACGGTGAGAAAGAAAAGCCCTGGATAAAAAATGATTTTTTAGGCAAAAAAAAGGTGCTAGAGTATGCCCTTTATAAAGCTATCAACCAGGAGTCGTTTACTCATTTTATCGAACCGCAGGCAGTCAAAGGCCTGTTAGAGGAATACCAGGAAGATAACGATTACTTGCTATCATGGGTTAAAAATGAGTACATGGAAAAAGGTTGGCATGAGCTGGAAGTTGTCCCCGTTTTCATCGTGACAAGGTCATTAAAACATTATGCTGAAGATATGGGGATAGCTAAGCCGAACGTTTACGGGGCTGGTAAGGAAACAATCAGACACTTACAGCAACTAACACCGAATAAATATCAACTCAAAAGGGCAAGGGTAAAGCTAGAAGATTATGACAAGTTAGACCCTTTAGAGTTTGAACGGAAGAAACTAGGAAATGTTAACCACTCAATCACTAAAAAAGAATAATGTTGACCTTCTTTTGAAGTTGAAAACCCTTGATATTACTGAGTTTTCAAGAAATTTGTTGACCTTCTTGTTGACCTTATTTTTGAGAAGGTAAACATCTTAACCCCTTGATATGACTAGATTTGTAGCTATTATGTTGACCTTGTTACCTTCTTTTTAACTCTCTATATAGGAAAAAAGTAAGTATTTTATATATAAGAGCAATGACTTGAAAAGAAGGTCAACAAGGTAACAAAACGCCGTAAACCTTGATATATAAAGATTTAAGCGTGTTACCTAGAAGGTCAACAAAAGTAACAACAAGGTCAACAAAACACAAAACTTTACATAAATAATCTTAAAATAGAAATAGGAGACAAACACCATGACACTAAAAACATTTTCAGACACAGCACAAACATTTACTTTTACTTATGACTTTGAAGACATTGACACCGCTAAAGTAGCAAGTCATGCAGTACTGGGCTATATGACAGGGACATATCACGCGCCCGTGATTGAGGTAACGATTAAAGGCAAGGGTCAGTTAGTGCTGGAGTATGTGGAAGATAAGAAACTAAGCAAGGTCTTCAAGCGTATCTGTGGCAGTTTCAAGGACTATTACAACCAACCTGAGGATATGACGGATGAAGAACTTGATGACATGGCTCAAGAAAACGAATTAATCAAGGAGGTCGAAGAACTTGAACATCAGCGCGTGGTTCCTTTATTTGAGATTACTCAAGAGGAAGCTAATAAACAAGACACACTCATGGCTTTCATCTCAGACCATGACCAACTGGCTGAACATCTCTCTATGAATTATCAGGAGATGAACCAAGACGATTTAGGAGCTATCCTTGAAACTATCAGTCAAGCCTTTAACCATTTGTATGATATGGTTGTTGAAGGTCAGTTAGTCGTTAAATAAACAATCAGAGGGTTTTCCCTCTTTTTGTCGTTTTATCAATAGTTTTGGGTTGTTTGAATGTTAAGGAGAAAGGATGTTAGAACTAGCTATTGAAAGTATCATTAAACCAATGAAGGCACAGAGGAAGACAAGGGTTACAGGAACAATAAATGACCAATCTGTCACCATAGACCTAAATAATCTGGTTATTCATTATAACCATCAAAACTTATTACTTGAAACGATACCAGGAACTTATGGTGGTAAACGCTACTTCTTCTTGTGTCCTAAATGTGAGAGACGTTGTCGGAAACTATTTAAAATTTACAATATCTTTGCCTGTGGTTCTTGTCAGAAAGTTCATCAAGCCACACTCAACCGAAGTAAGACAGATTGTCAATACTATTGGCGATTAGCGTTTAGAGAGTGTCTGAAAGTAGATCCAAAAGCAAGATACAAACATGGTTATTATAGTCATGATGACTTTCCTAAGCGTCCAAAATACATGAGAATAGCTAAATACTTGTATCATTGGAAGAGATTCCATTACTATATGGATAAGGGAGACAGGCACTGGCTATAACATTCGGGAAATACCCCACTTGTTTTTGAACGGGGCTATATTGTTCGGAAACTAAAGAACGCGCCCTTTTCCGTGCAAAAAATTCCCTTTTTGAAATTTTTGATAAAAGTTAAAAGCTTGATTCTAAAGGATTTTATATCTAATTTAAGCTAAGTACCCTACCACAAAGAATGACTTTATCAGACTAAAATAAAGCATGGTTTCAGATCTTCTATCTGACAAGTTGATATTTTATATTACCAACTTTAAAAAGCGCTTAGAAACGATTTTAGAAGCGAAAAGCGAAGTACTACAAAAAATATCTAGTTTACAAAACGAACAAAACAAAAAGACGTCCACACGGAACGCCCCCTTGGTTAAATTTAAGCTTAAATAAATTATACCATAACCAGGAGAAAAGACCATGAGCGCTAAAGAACAACTTAAAGAATTGAAACCACTTTTCGCTTTAATAACCTTATTTGAGGAACAACGAGATAAAGACATCAAGCTGATGAATGCTTTTCGTAATCCTGAGTTACTAAATGGCATTGAAAAAGGTACTGCACAGCAACTCTTATATTTAGCTAAGGAACGTGATAAAAGACTAGCCATGATCGCTACATTGCAAGATGAGAGACAGATTGCTGTTATTAAGGCTAGATATGTGGATGACTTATCATGGGATGAGATACCCGACAAGCTAGGTTATTCAAGGAATACCGTTTTTAAATTACATAGAGAAGCTTTAGAGGTGTTAGATGAGCAAGAAGAACGCTATTCGTAAACTAAAAGAGTTTCATAGATGGCAACGTATCGCCAATAGCCTTAATTTAACCTATAACGAGCGTTACCAGTTTGATATAGATTACCATCCCACGCGCAGAAAACACCTTGAAATAAGCCGAGAATGCGCTCTAGAGGAGCTAGACGCGATTAAGCATGCCATTAATCAACTATCTAAGATAGAGTATAGAAAGATACTGATTGAGTGTTACTTGATCGGTGAGAAAAACCTCAAAAAACCTCAACAAGACATCATAGCAGAACTTAACAGAAGTCAAAGTTGGTATTATGAGATTAAGAAAAGAGCCTTGCTTGAGTTTGTGGAGCTTTATAGGGATGGAATTCTAAATAAAATAAATTAA
Protein sequences of DBSCAN-SWA_6 >NZ_CP016501|1852209:1867825|1856415_1857381_-|WP_000429451.1|DBSCAN-SWA MENLSTRLVDKSIEAFIMGLEIYNKPTIKYRIEGFSFFICNAWELMLKAEMLNRNQSIYFKDNPDRTLSLEGVIKKIYSDNSTRIRLNLERIIELRNISTHYITEDYELKYAPLFQACVLNYVNELQRFHSRDVTKAISQNFLTITATYEPLSNEQIRLKYPAEIAEKFIQQANAIDVLVTEYNSDKFAIGIKQNLYITKKKSEADFIVSIANQSPSQVAILKDLKDPSETHKYSYANIISVVNDRLNKKNIKLNYKSGFNQYVLTLVIDFYSIKSDEKYSYCHKIGKSEHYTYSQKFVDFIISEIEKDPQHFVESLKKSK >NZ_CP016501|1852209:1867825|1867438_1867825_+|WP_000038827.1|DBSCAN-SWA MSKKNAIRKLKEFHRWQRIANSLNLTYNERYQFDIDYHPTRRKHLEISRECALEELDAIKHAINQLSKIEYRKILIECYLIGEKNLKKPQQDIIAELNRSQSWYYEIKKRALLEFVELYRDGILNKIN >NZ_CP016501|1852209:1867825|1852209_1853565_+|WP_000852645.1|DBSCAN-SWA MAEVSELRVQPQDLLAEQAVLGSIFISPEKLIMVREFISPDDFYKYSHKVIFRAMITLADRNDAIDAATVRNILDDQGDLQNIGGLGYIVELVNSVPTSANAEFYAKIVSEKAMLRDIISKLTDTVNMAYEGNDSDEIIATAEKALVDINEHSNRSGFRKISDVLKVNYENLELRSQQTSDVTGLPTGFRDLDRITTGLHPDQLIILAARPAVGKTAFVLNIAQNVGTKQNRPVAIFSLEMGAESLVDRMLAAEGMVDSHSLRTGQLTDQDWNNVTIAQGALADAPIYIDDTPGIKITEIRARSRKLSQEVDDGLGLIVIDYLQLISGTRPENRQQEVSEISRQLKILAKELKVPVIALSQLSRGVEQRQDKRPVLSDIRESGSIEQDADIVAFLYRDDYYRREGEEAEEIVEDNTVEVILEKNRAGARGTVKLMFQKEYNKFSSIAQFEE >NZ_CP016501|1852209:1867825|1862834_1863704_+|WP_001029288.1|DBSCAN-SWA MAIYEARGFSSYLYPYKGPLEPFDYIAQFKPLKPPEDIDIEEYKRTQAPYCLSGKVTAEKNGSYKRNNASLVYRDLIFLDYDEIEGPAQGFIEAVSRALFGFSYILYSTIKHTPESPRFRLVVKPGDVMNEETYKQVVKEIADKIGLPFDMASLTWSQLQGLPVTTGDPADYQKIVEHGLDYPVPKSNKNRTSGQGVKPQTYTPRPSGQRSITMRVIDTLFNGFGDEGGRNVALTKFVGLLFNKWVDCDIETAYELTKIANSVTANPLPERELDRTFESIARAEFRKRG >NZ_CP016501|1852209:1867825|1854193_1854805_+|WP_000092759.1|DBSCAN-SWA MSRYTGPSWKQSRRLGLSLTGTGKELARRNYVPGQHGPNNRSKLSEYGLQLAEKQKLRFSYGLGEKQFRNLFVQATKAKEGTLGFNFMVLLERRLDNVVYRLGLATTRRQARQFVNHGHILVDGKRVDIPSYRVTPGQVISVREKSMKVPAILEAVEATLGRPAFVSFDAEKLEGSLTRLPERDEINPEINEALVVEFYNKML >NZ_CP016501|1852209:1867825|1853576_1853864_+|WP_001278152.1|DBSCAN-SWA MSDAFADVAKMKKIKEDIKSHEGQMVELTLENGRKREKNKIGRLIEVYPSLFIVEYKDTAAVPGAIDNTYVESYTYSDILTEKTLIRYFDDESAE >NZ_CP016501|1852209:1867825|1866209_1866698_+|WP_000891150.1|DBSCAN-SWA MLELAIESIIKPMKAQRKTRVTGTINDQSVTIDLNNLVIHYNHQNLLLETIPGTYGGKRYFFLCPKCERRCRKLFKIYNIFACGSCQKVHQATLNRSKTDCQYYWRLAFRECLKVDPKARYKHGYYSHDDFPKRPKYMRIAKYLYHWKRFHYYMDKGDRHWL >NZ_CP016501|1852209:1867825|1860821_1861013_+|WP_011285274.1|DBSCAN-SWA MKGGTPMSPKMGRPVKGTAKRDKRLEVRLTADEYNTIQETADKNGLSKADLIVKAVNSYESEK >NZ_CP016501|1852209:1867825|1862561_1862834_+|WP_001100458.1|DBSCAN-SWA MNVFSQDYEAKLLEQNLTAFNRFLEAYQKPKPRVLGLITAEQVKEELNIKGKTLKRWEKAGLRRYQPPLEDTRKHYYKVSDILIFLGVNV >NZ_CP016501|1852209:1867825|1861694_1862027_+|WP_000877495.1|DBSCAN-SWA MKYRVETNPFSKDRYTPEQLEMFKNRQLSKNKAEAYFTRLYNQHIAWVIIANVMTEYVIKFRKSATSFEEAWDALDYQRTTEIVFRAVNGLPCSEKDTGELETYLSEVSA >NZ_CP016501|1852209:1867825|1859261_1859864_+|WP_001258618.1|DBSCAN-SWA MRTETWNGYTIRFVEHQGEWWAVLADICHALDLKPKRVKERLVDEVVSTDHVADSLGRQQEMLIVNEFGIYDTIFSSRKPEAKSFKFWVFETIKQLRQATGLEGFEVFRMLDKEHQKEAMARLTNSLDRVSKKDLIKANTITNKAVSNKFGYSKMVKKSEMTQDMLVAREMILDDTVELMGIKEKFGLNISVSESIYNKN >NZ_CP016501|1852209:1867825|1862229_1862559_+|WP_000174496.1|DBSCAN-SWA MTLPENYRRVLKLIKVGADNPITGAEIGLILKLEERSVQSIISSLITRYNVPIIGIRHGFNRGYFIPANKEELLDGAKAFYNQVQKEQERLSVLLNADLTSYKKLLKGG >NZ_CP016501|1852209:1867825|1863723_1865226_+|WP_000838179.1|DBSCAN-SWA MKSLESEILEARENEQPPKTMRELENRIFQAGEQWREEHTETKINESTGDVTEKVAMPQVFTVAKMLSEIITFTFISKSNVPDYSLLYIYDLDEGIYTASNDLFNRFCKTFDVRIKPREWPQIKLMVRTLTRIKKPLESAYFIPVQNGIIDLRTKELLPFSPKYVITSKISTAYHAPKRVPTDREGKTFDDWLNSIACNDSELVTLFWQIILEAINPNHTRNKFAIFYGDGNNGKGTFQRFLINLIGESNVSALKPAQFAEKHNLETLVGKVCNIGDEAPNEYLKNPSDLMSISSGDTVLVNPKGRPAFEATFKLFNIFSGNYIPNGGNKTKGWYRRIMIVPFNADFNGEKEKPWIKNDFLGKKKVLEYALYKAINQESFTHFIEPQAVKGLLEEYQEDNDYLLSWVKNEYMEKGWHELEVVPVFIVTRSLKHYAEDMGIAKPNVYGAGKETIRHLQQLTPNKYQLKRARVKLEDYDKLDPLEFERKKLGNVNHSITKKE >NZ_CP016501|1852209:1867825|1865590_1866136_+|WP_000173127.1|DBSCAN-SWA MTLKTFSDTAQTFTFTYDFEDIDTAKVASHAVLGYMTGTYHAPVIEVTIKGKGQLVLEYVEDKKLSKVFKRICGSFKDYYNQPEDMTDEELDDMAQENELIKEVEELEHQRVVPLFEITQEEANKQDTLMAFISDHDQLAEHLSMNYQEMNQDDLGAILETISQAFNHLYDMVVEGQLVVK >NZ_CP016501|1852209:1867825|1859037_1859238_+|WP_000359663.1|DBSCAN-SWA MDKLRGYRVMLGLTQKDMSDKLNISLQSYNNKETGKSAFNDKERLAIKSMVSEIKPDITIDELFYS >NZ_CP016501|1852209:1867825|1860099_1860738_+|WP_001021393.1|DBSCAN-SWA MNIIKQVKSSFGELEIDFYLDRNRNIFVTIEQLAQGFGYKSRNAIEKMIERQPYLKEKRFSVTDKLSATDGKQYETRLFNKRGIFEIGMLSKTEKGKIFRQWIYDHIEELERENANFKLIRELEKSNHKELTQAIKDWEHFNQWSYKAISDLLLKSVTGQTAKQLKQSRVGYEIALDCLSADELTRYRKLEQKVIVLLELNAEYNDIKKLVL >NZ_CP016501|1852209:1867825|1861153_1861606_+|WP_000916925.1|DBSCAN-SWA MLKNQDNMIKKKRNPRVLFRNVAYKLSEIEGKTLTEIASFLGFGNSEVCRSALYKWKRKKWLKFDLKNGHYRNVEVLHEVTLEKMANKELKEQGLIYKANIYYEQVVSTSEIIEDIKTKTQDRIKAIHLQQKALERIPSELFAELYTNMN >NZ_CP016501|1852209:1867825|1858217_1858832_-|WP_001080841.1|DBSCAN-SWA MNRLKELRKEKKLTQEELAGEIGVSKITILRWENGERQIKPDKAKELAKYFNVSVGYLLGYAPNKKIDFQLNLDGTTLHLTKEQFLALENTSKSIKKIKNTINESVKQEEYIKNASKYYDFEKVSRRLTDRLFEIHTDLIELLMMLDHFPSGELSKSQQEAIFKFYKQLDYFVTDTPASFDYFKKNLESYGYKIYTEGDKIDFD >NZ_CP016501|1852209:1867825|1854911_1856084_-|WP_000605384.1|integrase|DBSCAN-SWA MIMKITEVKKKDGTVIYRASIYLGTDKVTGKKVTTKITGRTKKEVREKAKQEAVEFIKNGSTRFKATSVTSYQELATLWWDSYKHTVKYNTQLATEKLLTVHVIPIFGAYKLDKLTTPLIQSIINKLADKTNKGERKAYLHYDRIHALNKRILQYGVIMQAIPFNPAREVILPRNTKKANTKRVKHFENDELRTFFNYLNNLDKNKYRYFYEVTLYKFLLATGCRINEALALNWSDIDLDNAVVHITKTLNYKQEINSPKSKSSYRDIDIDSRTVTMLKQYRRRQIQEAWKLGRSETVVFSDFIHKYPNNRTLQTRLRTHFKRANVSNIGFHGFRHTHASLLLNTGIPYKELQYRLGHSTLSMTMDIYSHLSKENAKKAVSFFETAINSI >NZ_CP016501|1852209:1867825|1862023_1862218_+|WP_000613926.1|DBSCAN-SWA MIQELNLTPTQTLILFIVLGLIGLLLSRSKPLIEIDLPEDIQSPKPRQNANYGAYIQSQNHYYN >NZ_CP016501|1852209:1867825|1866894_1867074_+|WP_079260833.1|DBSCAN-SWA MVSDLLSDKLIFYITNFKKRLETILEAKSEVLQKISSLQNEQNKKTSTRNAPLVKFKLK >NZ_CP016501|1852209:1867825|1857918_1858206_-|WP_000946250.1|DBSCAN-SWA MLRRVNHVKNVLAHGEFAEWIENKIGIHYREANRMMTVAKQIPNVSTLKYLGATAKHVNGVAKRKQNFLSQISLIPTNPQLPHQTIINTYLYWQP >NZ_CP016501|1852209:1867825|1867101_1867464_+|WP_001274639.1|DBSCAN-SWA MSAKEQLKELKPLFALITLFEEQRDKDIKLMNAFRNPELLNGIEKGTAQQLLYLAKERDKRLAMIATLQDERQIAVIKARYVDDLSWDEIPDKLGYSRNTVFKLHREALEVLDEQEERYS |
23 | Streptococcus_phage(76.92%) | integrase | attL 1854788:1854807|attR 1868663:1868682 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016501.1|WP_000384271.1|1568021_1568348_-|hypothetical-protein |
1568021_1568348_-
Protein sequences of NZ_CP016501.1|WP_000384271.1|1568021_1568348_-|hypothetical-protein>NZ_CP016501.1|WP_000384271.1|1568021_1568348_-|hypothetical-protein MDYDNENYLIPKILLQDDFYSSLSAKDILVYAVLKDRQIEALEKGWIDTDGSIYLNFKLIELAKMFSCSRTTMIDVMQRLEEVNLIERERVDVFYGYSLPYKTYINEV |
108 aa aa |
63
gnl|BL_ORD_ID|63 information
|
NA | NA | No | NA |