Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LR134321 | Shewanella putrefaciens strain NCTC10737 genome assembly, chromosome: 1 | 1 crisprs | cas3,RT,csa3,c2c9_V-U4,DEDDh,DinG,WYL | 0 | 0 | 5 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134321_2 | 2033382-2033647 | Orphan |
NA
Consensus repeat of LR134321_2
|
2 spacers
spacers of LR134321_2
>2.1|2033443|58|LR134321|PILER-CR CCTGTTGTTTATAATGAGTCGAGCCCGATGAGCTGTTTCTCGTTATAAACACATATAG >2.2|2033562|61|LR134321|PILER-CR GCTACCATGCTGCTCCATCCCGCGTCCGATATCTATCTGTTTTACTTTATTATTTTGCTTT |
CRISPR arrays and Neighbor proteins around LR134321_2
The CRISPR arrays of LR134321_2 >merge|LR134321|2|2033382-2033647|PILER-CR TGCCGAAGCCGGATTTGAACCGACGACCTTCGCCTGTTGTTTATAATGAGTCGAGCCCGATGAGCTGTTTCTCGTTATAAACACATATAGAAAGTGGTTGCGGGAGCCGGATTTGAACCGACGACCTTCGGGTTATGAGCCCGACGAGCTACCATGCTGCTCCATCCCGCGTCCGATATCTATCTGTTTTACTTTATTATTTTGCTTTAAATTGGTTGCGGGAGCAGGATTTGAACCTACGACCTTCGGGTTATGAGCCCGACGAG >LR134321|2|1|2033382-2033647|PILER-CR TGCCGAAGCCGGATTTGAACCGACGACCTTCGCCTGTTGTTTATAATGAGTCGAGCCCGAT GAGCTGTTTCTCGTTATAAACACATATAGAAAGTGGTTGCGGGAGCCGGATTTGAACC GACGACCTTCGGGTTATGAGCCCGACGAGCTACCATGCTGCTCCATCCCGCGTCCGATATC TATCTGTTTTACTTTATTATTTTGCTTTAAATTGGTTGCGGGAGCAGGATTTGAACCTACG ACCTTCGGGTTATGAGCCCGACGAG
>LR134321.1|VEF25716.1|2032187_2032601_+|Uncharacterised-protein MGLILLPLLILWLGVGIYAIRIGYQVLAGTSQLSYTLSVCAIAMVALLLYLYFGFAQFKENKALWAFEIPMFFAANKLAFGVMILGLLLHWFGQGVLTSAYLKPLPFIMIFTVSFGAMAGVILSDTFMAKFEIQKTH >LR134321.1|VEF25715.1|2031431_2031923_+|Spermidine-N(1)-acetyltransferase MEKITIRHAEKRDAAAIQAVYACPNAYTGTLQLPWPSTDKWESRLAATSNHISRYVAEIEGEIVGELGFEVYEQPRRRHVASFGMGVKDSYQGRGVGSALINAMLELTDKWMNIKRIELTVYTDNHAAIGLYKKFGFVIEGESKDFAFRNGEFVSVYHMARIK >LR134321.1|VEF25714.1|2030403_2031003_+|Recombination-protein-RecR MKFSPLLDELIQSLRCLPGVGPKSAQRMAFQLLERDRKAGLKLASALSSAMSDVGHCQSCRTYTEETLCPICASHKRGTSSTICVVETPADVLAIEAGGHFTGRYFVLLGHLSPLDGVGPEELGLALLERHLASGDVAELILATNPTVEGEATAHFIADMARRHKVMISRIAHGVPVGGELEYVDSTTLALSFNGRLPL >LR134321.1|VEF25713.1|2030061_2030391_+|DNA-binding-protein,-YbaB/EbfC-family MFGKGGMGNLMKQAQMMQEKMAKVQEEIARMEMVGESGAGLVKVTMTGAHTVRKVEIDPSLMEDDKEMLEDLIAAACNDAARRIEENQKTKMAEVTGGMQLPPGMKMPF >LR134321.1|VEF25712.1|2026505_2029838_+|DNA-polymerase-III-subunit-tau MSYQVLARKWRPATFDQMVGQSHVLHALTNALSQQRLHHAYLFTGTRGVGKTSLARLFAKGLNCEQGVTATPCGACGSCVEIAQGRFVDLIEVDAASRTKVDDTRELLDNVQYRPTRGRFKVYLIDEVHMLSRSSFNALLKTLEEPPEHVKFLLATTDPQKLPVTVLSRCLQFNLKSLTQGEISTQLNHILTQEQLPFEAEALKLLAKAANGSMRDALSLTDQAIAFGGGNVMLTQVQTMLGSIDEQHVVALLKALTDADIGVLMHTCAQVLAYGADAQEVLRSLLELLHQITLTQFAPAAAQQSLYSAQIRAFAEQLTPEQVQLYYQILLTGRKDLPHAPDPKSGLEMALLRAVAFVPEKPVKRWQPDAVAEIRLPEGQTPVAATAAAQAPHVNEQQFAETHTAEPHTTQTVPAEKKTALIEQTSANQQVETAQADAVDMADVSIEPADTSEQAFDAELEADAALIAEQAVILSQAQSQGFNADSLNTDIVNAEVEPQTQTVSTAQAPSIHISEPVADIEASVASLDAHNSEVPELIGAAAESSIVNPAMAETLVDSTSAVNNSAAENTLDNNPTANNTLEQKGLDEHSPYGTEAEHSYPAMGAYAQESAPLDSYQDAYVEFSSGSYNDDDFHHSFDSHAVPDDVQHSANVQHSDSVQHSADIQSAASIQSAVGMQGSVDSQSAAMAQVSPQTPTPISIPTSASASLADDDILSAVLAARDSLLSDLDALSVKDGDEKKSSLDSKLKTANSKANGVALSKPVSKSDASQTSASFSVDPAADLDPDLTIDFDDDFDLDLEPMALHQSVPSAASASGSVPAEPVKPAAYDRPPWEAAPEVASTAELIETTQVDSGNTDIADAVMAQGSDINDDSVSSGQGSLETSNTANDVQNNAADGNDSQSAEQGSGSKSQALQTQARHQDKVQATALKPASTTALTQTASAVERTQEREVALSTLPISGHPLDLHWYKLMASLEVGGRVRQLAVNSVCQTQSDPLPLLLKPNQKHLAADVAIVQLEQALSAALGNPRRVQVVIGIDAQRETPLELRKRFHQELLQQAHQSLIHDDNVQWLIQRMGAELDADSLVYPPELLNLRSQQIQALPELTEAAS >LR134321.1|VEF25711.1|2025832_2026387_+|Adenine-phosphoribosyltransferase MMAMNTETLSLIKQSIKTIPNYPKEGILFRDVTSLLENAAAYKATIDLLVEHYRGQGFTKIVGTEARGFLFGAPLALELGVGFVPVRKPGKLPRATISQSYELEYGHDSLEIHTDAINPNDKVLVVDDLLATGGTIEATVKLIRQLGGEVKHAAFVISLPDLGGEARLTALGLELVKLCEFEGE >LR134321.1|VEF25710.1|2025199_2025568_+|Inner-membrane-protein-ybaN MVLKRGLFLLLGLTALALGLLGIVLPLLPTVPFILLAAFCFARSSERLHHWLMTHPWFADALTQWQEQRAMRKGLKRKAMVVSALSFSVSIIVVPIVWVKGLLLVMALVLLWYLKGIPEIEG >LR134321.1|VEF25709.1|2024995_2025118_+|Uncharacterised-protein MSSQPFKDPFNFLYFIGFILVLLLPTLPASLSWLKHLGLI >LR134321.1|VEF25708.1|2024098_2024845_+|Uncharacterized-protein-conserved-in-bacteria MDFKQVQQSFIDYIRDPSRPLPADTDVRRMQVYRELFFNNVLGFVSNGFPVLKSLYSEEEWLALVQSFFSQHDCQSPIFIDIAGEFLDFLQQEYQPTANDPVFMLELAHYEWLELAVAVAQASGDESQLSPAQIPMQALCLSKTARVAQYHFEVQHIRHDYRPQQQLDTPVFFCLYQDADCEVCFLQLNPLSAQVLAFLQAQGQASFKEILDWLTITYPKMAPEIIAQGCTQLLEQLVAKGIVRGRQS >LR134321.1|VEF25707.1|2023229_2024099_+|Protein-of-uncharacterised-function-(DUF692) MNDQKNAAQVGLGLRREMLSEFCESVPEAINFFEVAPENWMTLGGKFGRQFRELTEQHTFYCHGLSLSIGSPEPLDLAFVKNIKTFMDLHQIQVYSEHLSYCSGQGHLYDLMPIPFTDAAVKHVAARVKQVEDILERPLILENVSFYAAPGAHMSELEFVNAVLQEADCKLLLDVNNIYVNSINHQYDADTFLQAMPTERIAYLHIAGHYKQAEDLLIDTHGAAINDPVWALLQRCYALHGVKPTLLERDFNVPTTAELLLELNQIHAYQAAAPSFHIQNRSHVVKRIA >LR134321.1|VEF25717.1|2033927_2036057_+|Ribosomal-RNA-large-subunit-methyltransferase-L MLNFFAAAPKGFEYSLAQELTEFGATEVKESVAGVYFTASLALAYRITLWTRLASRIVLVIYKGSCESAEQLYNAAYCVDWPAHFSNKSTFSIDFHGTGGFLNNTQFGALKIKDAIVDRFRDDDIERPNVSRVDAEFKVDAHFRNGVITIAMNFSGPSLHQRGYRSTTGEAPLKENLAANMLVRSGWQAAPSTLLDPFCGSGTVLIEAALMAADIAPGLQRSRFGFEHWRRHDKAVWQEIVEEAKARASLGVKRCEIKFYGSDIDSRLVALAKRNAENAGVLELIEFQVADALTIAPPAESGYLITNPPYGERLGNVSELLQLYYQLGDKFKKEFGGWKVAMLCSDIELVSSLKLKADKQMKMFNGALECAFNIYTLHANSTRRDTPVLPDGVDIADIAPAFANRIKKNAKLLEKWAKKEGIDSYRIYDADIPEYNVAVDKYLDYVIIQEYMAPATIPEAVTKRRLSDVLLALPSAIGINPNKMIMKTRERQKGTSQYQKLDERKLELITTEYGAKFKLNLTGYLDTGLFLDHRLTRRLVGQKSKGRRVLNLFSYTGSASVHAALGGAKSVTTVDMSNTYIAWAKDNFALNGLQGKQYEFVQSDCMQWIRDCNEQYDLIFIDPPTFSNSKRMEDSFDVQRDHVNLLSSLVKLLSPTGELVFSNNKRKFKMDIETLTKMNINVTNIDDMTLPMDYKRNPHIHNTWLITHA >LR134321.1|VEF25718.1|2036049_2036304_+|Glutaredoxin-like-domain-(DUF836) MPELTQAERDNRTYLLYHTDGCHLCELAAALLDAADIGYRAIDICDDEYLAQRYGVSIPVLKAWDDRELHWPFNATQLQEFTGA >LR134321.1|VEF25719.1|2036305_2038231_+|Uncharacterized-ABC-transporter-ATP-binding-protein-Rv2477c/MT2552 MSLVRINSGSLAYGYTPLLQKADFTIQRGERVCIVGRNGAGKSSLLKVLSGDVLLDEGEFNIAGNVSVSRLQQDPPKAEQGTVYAYIAAGLKEVGEALERYHQLSHDVAHADPEQMDRMLNEMQGLQETLDHYNGWQLDSRIQQNCELLGLDPDKSLSELSGGWQRKVALARALVSEPDLLLLDEPTNHLDIDTIEWLEKFLLDYQGAIVFISHDRGFIARMATRIVDLDRGVVTSWPGNYQMYLDGKQEWLRVEAEKNALFDKRLADEEVWIRQGVKARRTRNEGRVRALKALRDERSERLNRQGNAKMAVSDTERSGKLVFDVQDLNFNLPDKNLVKNFNTTVIRGDRIALIGPNGCGKSTLIKLLIEKLQPQSGEIKVGTKLEIAYFDQYREALDPEQTVEDNVGEGKKTITINGQDRHILSYLQDFLFSPMRARTPVKALSGGEKNRLLLAKLLIRPANLIILDEPTNDLDIETLELLESLLTEYQGTLLLVSHDRAFIDNTVTSSWWYAGNGHWSEYVGGYQDAVNQGAKFYSEEPSSQKAVEAPAVETKTVEVKAAEPAKAVKKLSYKLQRELESLPTVMEQLEADILALQTTIGHSDFYSQAQDKVNQVLSQLADKEKQLEVCFERWEELESLK >LR134321.1|VEF25720.1|2038259_2040092_+|probable-extracellular-repeat,-HAF-family MKLQLDKALSLVALGVLGVLNSAHAAPVYEIVNIDSYDLQGTLEGTRSGYALGVNANDELVGISKGKKKLSSSDVEGGVIDIADGIAPEETITYSIEKAIIANNFTFVAAQNGAAGAWLPTFDSINGTTPSSDTAVINSVDTFYYGINDAGIKVGSMTAPEKKTENTATANVADDYWYYRDYEFRGVAKSGSTEIPLVPPYTQFVNADKTKTVELGGWSAATAINNNNLVAGYASTAISKYGSDRVNYCLGTDNTLPLDICVQREQYPNSTGTRNIQYQTRAYVWQIDNDVATGIELPLGLTPATDNTLTFTAQALGLNNNGVVVGRSHVYRNNNTDKLRQDAAYWSKDTEGKYQYHWIPMGDSISSSIAYDINDSGILVGSYRSYIQGYLRDKFFVFDTNTPEVAYVTPNDFASTTTDLSSKPKDINNKGQVVGYIETTYDKEKPRPKAGFLYEKSTGEFNNLNKLLTCESKGYEKASDGSWARHQVEVQDGSGKILQYNADILVVEGSSINEDGTIVGTAFIRKPSYQFDKDGNIVIGENGLPLFELSGSGDPVTAYIPRMVVLKPASSGEACTVEDNTDTGNFERSGAATLAWLFALPLVWFRRRIR >LR134321.1|VEF25721.1|2040279_2040456_+|Ribosome-modulation-factor MKRQKRDRLDRAFSKGFQAGVGGRSKELCPYANLDSRSQWLGGWREGVDGRVNGLFNK >LR134321.1|VEF25722.1|2040579_2041095_-|3-hydroxydecanoyl-[acyl-carrier-protein]-dehydratase MNKANSFNKEELIACGHGKLFGPNSPRLPVDNMLMMDRIVTINDNGGEFGKGEIVAELDINPDLWFFDCHFITDPVMPGCLGLDAMWQLVGFYLGWEGAEGKGRALGVGEVKFTGQVLPGAKKVTYKLNIKRTIHRKLVMGIADAILEVDGRQIYSATDLKVGVFSDTSTF >LR134321.1|VEF25723.1|2041169_2042903_-|ATP-dependent-protease-Lon MNSLLIPTSSLAPEFSLPTLSETAANLSALLLGQERTVDAFKLHQAIVDQHLYLADFPSIDRQLMIQACIDSLAPLSPAYLVATRPIDKVVTFQWQSDKPQENQGSIAEKTTHYRYLSGNIRRADLIGRMAQTGTTSQYQAGALAQCHYVFICAESLWKREGLWDLVMQILTHKEYQISNNLSPIPLNCKIVLVGAGMIYSQVRTEDWQFQRHFTLLAELASEIDLVRYKESQYVAWLQAVAQSVDVTLEQSSLAPLFRYSARLTEHQRRLSLSMLEFAQLMMQAKAYRGKPSINASSIEHALTQANYRHNSSEEYSGQSFDDNFINLPTSGAMVGQINGLTVIDAIDYSYGEPARITASVHYGDGEVADIERKSELGGNIHAKGMMILSACLYRIFGRDAPLHLNANIVFEQSYQEIDGDSASLAEYCCLMSAIAEQPIIQSLAITGALDQFGNVQAIGGINEKIEGFFNLCERRGLTGEQGVIMPKSNVLQLNLNPKVITAVGKGLFHIYAIEHMDQAVELLMQMPAGVADEDNDFPHDSLYGLVQERLDKLAGNGEEEIGLITRLLAKLGFFRR >LR134321.1|VEF25724.1|2043748_2045746_+|Methyl-accepting-chemotaxis-protein-mcpC MLLPTRISQRLTMGSIVLLLLTGLAVFGVMSLRGQPRVVAASQELIEQTGSSIVRQLALKLASIEGITLSLAHLAEVLPHEQALYLNSLPNLIDNNGDLTIAGGGIWPEPDKFVAGSTRHSFFWARGSDNLLAYSDDYNAQSGPRYHNDPWYTGARTSSVSQCLWSEAYQDTVTKVAMVTCSVPYRLTGTFAGVATIDLKLDDLAKFLTEHGNVTDGYAFALDQAGNILHFPEANKSDTMLRFTDLVGQLPWLAPVEAALKQTSINNVISVSLDKDGRLNQASEVSLFVMPDTGWVIGLVTPKTRVTGLARELTFDILLFLLPLLAILLYLAWLSGKKLLAQLEETTDQIASLGSGNLGANVELQIQRDDEIGALRRAVNTYAGTLRAMLDLITQEAKKVQFEATQLSGLSHRLAQRAEQQRLESAQLAAAITQMSSSAMEVANNTNDCAATAQSSLIVVREGQTRVASSNASIEALAGEIANATKVISQLAQDSQKVGAVLDVIKAISEQTNLLALNAAIEAARAGDQGRGFAVVADEVRTLAGRTQDSANEISTMINALQSASRLAVQAMQTGESRTVHAVAEAEGAASSLSSTVQSFDDISQRAQQIALAAQQQSQVTQEINELAVRINSISEDNSLDATALDALSLEMQKLSDRLININRA >LR134321.1|VEF25725.1|2046130_2046283_+|Uncharacterised-protein MMAFSVYAVITLGFSSQHFIFNAIQRLINYLPIYLFTYLPIYLFTYLHLS >LR134321.1|VEF25726.1|2047092_2047737_+|Response-regulator-uvrY MISIYLVDDHELVRTGNRRILEDERGIKVVGEAPDGETAVQWARQNEADVILMDMNMPGMGGLEATRKILRYQPHARIIVLTVHTEDPFPSKVMQAGASGYLTKGATPPEVLQAIRQVSRGQRYLSPEIAQQMALSQFNPADENPFKSLSERELQIMLMITNGEKVNDISEQLNLSPKTVNSYRYRLFAKLGISGDVELTRLAIRYKMLDAGHF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
2272084 : 2282384
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LR134321|2272084:2282384|DBSCAN-SWA TATGAAGCTTCCTATCTATTTAGATTATGCCGCCACCACGCCTGTTGATCCAAGAGTCGCGGAAAAGATGTTCCAATGCATGACAATGGACGGCATCTTTGGTAATCCAGCCTCACGTTCACATCGCTACGGCTGGCAAGCTGAAGAAGCGGTGGATATTGCTCGCAATCAAATTGCCGAGTTAATTAACGCCGATCACCGCGAAATTGTGTTTACCTCTGGCGCGACGGAATCGAACAACTTAGCGATTAAGGGTGTTGCTCATTTCTATCACAAGAAAGGCAAGCACATCATCACCAGTAAGACTGAACATAAAGCCGTTCTTGATACCTGTCGCCAATTAGAGCGCGAAGGTTTCGAAGTGACTTATTTAGAGCCTGCGGCTAACGGTATCATTCCGATGGAACGTTTAGAAGCGGCAATGCGTGACGACACTATCCTTGTCAGCATCATGCACGTAAACAACGAAATCGGTGTGATCCATGATATCGATGCAATCGGTGAGCTATGCCGTTCAAAGGGCATTATTTTCCACATGGATGCAGCGCAAAGTGCAGGCAAAGTGCCTATCGATGTGCAAGCCACTAAAGTGGATTTGATCTCGATTTCTGGTCACAAAATGTATGGTCCTAAAGGTATCGGCGCGCTATATGTTCGTCGTAAGCCACGCATTCGCTTAGAAGCGCAAATGCACGGTGGTGGCCATGAGCGTGGTATGCGTAGCGGTACGTTACCAACGCATCAAATCGTAGGTTTAGGTGAAGCTGCTGCAATCGCAAAAGCGGAAATGGCCTCTGATGACGCTCGTATCGGTGCATTACGTGACAAATTATGGAATGGCATCAAGCACATTGAAGAAACCTATATCAATGGTGATGCGATTGAGCGCGTTAGCGGTAGCCTCAACGTCAGTTTCAACTATGTTGAAGGCGAGTCGTTAATGATGGCACTGAAAGATTTAGCGGTTTCATCGGGTTCGGCTTGTACCTCAGCCAGCCTAGAGCCTAGCTACGTATTACGTGCGCTTGGCTTAAACGATGAAATGGCACATAGCTCGATTCGTTTCTCTATCGGCCGTTTCACAACCGAAGAAGAAATTGATCACGCTATCGAAGTAATTACTCAATCTATTGATAAATTAAGAGAAATGTCTCCTTTGTGGGAAATGTTTAAAGATGGAATCGACCTGAACCAAGTTCAATGGGCACATCATTAATTCTGACCTTTACAATAGATAATTTTGGAGCAGTACCATGGCTTACAGTGAAAAAGTGATAGATCATTATGAGAACCCACGTAACGTTGGTTCTTTTGATAAAAACGATCCTTCGGTCGTGACCGGTATGGTAGGCGCGCCTGCTTGCGGTGACGTGATGAAACTTCAGTTGAAAATTGGTGCTGATGGCATCATTCAAGACGCTAAGTTCAAAACTTACGGTTGTGGTAGCGCGATTGCGTCTAGCTCACTGGTAACTGAGTGGGTTAAAGGCAAGACTATCGAACAAGCTGCGGCGATTAAGAACACAGATATCGCTGAAGAATTGGCATTGCCACCGGTGAAGATCCACTGTTCAATTTTGGCTGAAGATGCCATTAAAGCTGCTATCGACGAGTACAAATCGAAACAAGCTAAGTAACATCTGGAGTTAAGATGGCGATTACAATGACCCCAGCGGCAGCCGATCGTGTCAGATCTTTCTTAGTTAACCGAGGCAAAGGTGTAGGCCTGCGTCTCGGCTTAAGAACATCAGGCTGTTCTGGTATGGCTTATGTACTTGAGTTCGTTGATTCTTTAAATGATGACGATGAAGTGTTTGATATCGAAGATGTGAAAATCATCATCGATGCTAAGAGCCTCATCTATCTTCAAGGGATCGAGCTGGATTTTGTTAAAGAAGGGCTGAATGAAGGCTTCCAATTTAACAATCCTAACGCAAAAGGTGAGTGTGGTTGCGGTGAGAGTTTCACTGTTTAACCGCTAAAGTTGACGCATGAATTATTTCGAGCTGTTTAAATTTTCCCCTGCCTTCGATATTGATACCGCCTTACTTGCAGAACGCTATCGCGAACTGCAACGGGCGGTTCATCCCGATAAATTTGCCAATGATACTGAGCAGCAAAAATTGCTGTCGGTGCAGCGCACGGCGCAAGTCAATGATGGTTTTCAAACCTTAAAAGATCCCATTCGCCGCGCCGAGCATATGCTGTCGCTGCGTGGTATTGAACTAAGCCATGAAACCACGACAGTGAAAGATACTGGCTTCTTGATGCAGCAAATGGAATGGCGTGAGGCGTTGGAAGACATACGCGATAGTGCCGATCCTCAAGCGAGTATTGACGAGCTATATCAATCGTTTGCGCAGTACCGCGCGCAACTTACTCAGCAATTGACTCAACTGTTAACCAGTGAGCAGGCTGAAGATGCATTGTTAGCGGCGGATCAAGTTCGCAAACTCAAATTTATGGCAAAATTACACGACGAGTTGACCAGAGTCGAAGACGCTCTGTTAGATTGATTTCCGTGTTTATGTTACTTTGCTGACTCAAGCACGCTTGAGTTGGCATTACTAAGTTGGATATATATGGCCCTTTTGCAGATAGCTGAGCCCGGTCAAAGTGCCGCGCCGCACCAACATAGACTTGCCGTTGGCATCGATTTAGGTACGACCAATTCTTTGGTAGCGGCTGTTCGAAGTGGAGAGACGGCAACCCTGCCGGACGAACTTGGACAGCATTCATTACCTTCTATCGTGCGTTATACCCAAGATTCTGTCGAAGTCGGCGCCCTTGCGGCGTTGAGTTCGGCGCAGGATCCGCAAAACACGATCGTTTCGGTTAAGCGTTTTATGGGCCGCAGCCTGGCTGATATCAAAGCCGGCGAGCAATCATTCCCCTACGAGTTTGCCGAAAGCGAAAACGGTTTACCTTTATTCGTGACTCCCCAAGGCCAAGTAAATCCCGTGCAAGTGTCTGCGGAGATTTTACGTCCGCTGATTGCCCGTGCTGAAAAAACCTTAGGTGGCGAGCTGCAAGGTGTGGTGATAACCGTACCGGCCTATTTTGATGATGCCCAGCGCCAAGGCACGAAAGATGCCGCAGCTTTGCTGGGTGTTAAAGTGCTACGTCTGTTGAATGAACCGACGGCTGCTGCGATTGCCTACGGCTTAGACTCTAAGCAAGAGGGCGTGATTGCCATCTATGACTTAGGCGGCGGTACCTTCGATATTTCTATTTTGCGTTTGAATCGAGGTGTATTCGAAGTGTTAGCCACCGGTGGTGATTCAGCGCTTGGAGGCGATGATTTCGATCATTTACTGCAAGCACATATGCAACAAGTGTGGCAGCTTAGCGACATCGACTCACAATTAAGCCGTCAACTGTTGATTGAATCGCGTCGAGTCAAAGAAGCCTTAACGGATGCAGCAGAAACTGAAGCAAAAGTGATCCTTGCCGATGGGACTGAGCTCACGCAAATCGTTACTAAAGCTGAATTTGATGCCATGATTGCGGCGTTGGTTAAGAAGACCATTGCTAGCTGTCGCCGTACCCTGCGTGATGCGGGCGTAACGACGGATGAAGTGCTTGAAACTGTGATGGTTGGCGGTTCAACGCGCGTGCCATTAGTGCGTGAACAGGTTGAAGCTTTCTTCGGTAAATCACCACTGACGTCTATTGATCCCGATCGTGTTGTCGCCATTGGTGCGGCCATTCAAGCCGACATTTTAGTGGGTAATAAACCTGAATCTGATTTGCTGCTCCTCGATGTTATCCCTTTGTCATTAGGCATAGAAACCATGGGCGGCTTAGTGGAAAAAGTGGTGTCGCGTAATACGACGATTCCGGTTGCGCGAGCACAGGAATTTACTACCTTCAAAGATGGTCAAACGGCCATGGCATTCCATGTGGTGCAGGGCGAGCGTGAGCTCGTTGCCGATTGTCGCTCACTGGCGCGTTTTACCTTAAAAGGTATTCCGCCGTTAGCCGCAGGCGCTGCGCACATTCGTGTGACTTTCCAAGTGGATGCCGATGGTTTACTCAGCGTGACCGCGATGGAGAAATCCACCGGCGTGCAATCTAGCATTCAAGTTAAGCCATCTTTTGGTTTATCGGATACTGAAATTGCCACTATGCTGAAAGATTCGATGAAGTATGCCAAAGACGATATCGGTCGTCGTATGCTTGCCGAACAGCAAGTGGAGGCGGCGAGGGTACTCGAGTCTTTACATGCCGCGTTAGCGAAAGATGGCGACTTGCTGAATGCCGATGAGCGTGGACAAATCGATGCCACTATGGCCAATGTGGCGCAAGTTGCCGCGGGCGATGATGCCGATGCGATTAAGCTCGCAATTGAAAAACTGGATGAGCAAACCCAAGATTTTGCTGCCAGACGTATGGACAATTCTATTCGAGTGGCATTCAAAGGCCAGTCGATCGACAACATATAGGTGATGTAATGCCCCAATTAGTCTTTCTTCCCCATGCCGAGTTGTGCCCAGATGGCGCAGTGCTTGAGGCGAATGTTGGTGAGACCATTCTTGATGTTGCGCTGCGTAATGGCATCAATATCGAGCATGCATGTGAAAAGTCCTGTGCTTGCACGACGTGTCACTGTATCGTGCGTGAAGGCTTTAATGATCTTGAGCCAAGTGATGAGCTGGAAGATGACATGTTAGACAAAGCTTGGGGACTTGAGCCCGAAAGCCGTCTATCTTGTCAGGCGAAAGTGGTCGACACTGACATGGTGATTGAGATCCCTAAATACACGGTTAACATGGTCAGCGAAGGCAATTAATGCTGCCGCGATAATTGTTAAAACCAGTCCACGTGACTGGTTTTTTGCTTTTGGCGTTGAGCATTTGTGCAACTAATCTTGTCTATGCCTCTAGTTGTTTGATTTATCACTGCGCAGGCGTATTATGCGCTCAAAATTAATGGTTGCCGGAGACCAGAATGAGAATAGCGATCTTATCGCAAGGGCCTGAGTTATATTCAACGAAGCGACTCGTTGAAGCGGCTCAATTACGTGGTCATGAAGTACACGTAATCAATCCCCTTGAATGTTATATGAATATCAACATGCGGCAATCGAGTATCCACATTGGTGGGCGAGAGTTACCCGCATTTGATGCTGTGATCCCCCGCATTGGGGCGTCGATTACCTTTTATGGCTCTGCAGTTTTGCGTCAGTTTGAGATGATGGGCGTATATGCTTTAAATGATTCTGTTGGGATCTCGCGCTCACGTGACAAATTGCGTTCTATGCAGTTGATGTCACGCCGCGGTATTGGCTTACCCATTACTGGTTTTGCCAATAAACCGAGTGATATCCCCGACTTGATTGATATGGTGGGTGGCGCGCCTCTGGTGATTAAATTACTCGAAGGCACCCAAGGTATTGGCGTGGTATTAGCCGAGACCCGTAAAGCGGCTGAGAGTGTGATTGAAGCCTTTATGGGCCTTAAAGCCAACATCATGGTGCAGGAATATATTAAAGAAGCCAATGGCGCAGACATTCGCTGCTTTGTGCTCGGCGATAAAGTGATTGCCGCCATGAAGCGTCAAGCTATGCCCGGTGAGTTCCGCTCTAACTTGCACCGTGGTGGCACGGCAAGTTTAGTCAAATTAACCCCAGAGGAACGCTCAGTCGCCATTCGTGCGGCTAAGACCATGGGGCTCAATGTCGCAGGTGTTGACTTGCTACGCTCAAACCACGGTCCTGTGATCATGGAAGTGAACTCTTCACCCGGTCTTGAAGGAATTGAAGGCGCCACCACGAAAGATGTGGCGGGTGCTATTATTGATTTTGTTGAAAAAAATGCGATTAAAGTGAAAAAGGTGACGCAAGCACAGGGTTAGTGCCCTTGCTGTCGAGTATTGACTACGGAATATCAACACTTGATTCACCTCCTTATGGGTAACGTCTTTTAGCAATCGCTTACCCATTAAAGCGAACCGAATAGGGGAAAACGATGACATTAACCTGGATAGATTCGTTAGATATTGCGTTGGAATTACTCGAAGCACACCCAGAGGTGAATCCTACTCAGCTGCATTTCACCCAATTGTATGAGTGGGTGTTAGCCCTGGATGATTTTGCCGATGATCCCAAACATTGTAATGAAAAAATACTTGAAGCGATTCAGCAGTGTTGGATTGATGAAAAGTAGTGAGCACAGCGGTGATACGGCTTTTTATGTGAGGCATCTTGTCCTTTTGACGTTTAACTAAGGTGATTTACCTCACGTATTTGCAGTGGCGCAGAGAATATTTGGCGAGAGTTTGATCACAGTCAAGTTCATTTTTAGCGGATATCGTACCTTGTTGTATATTGGTTAAGGCATTTAAATGGCTTAACTTTGTTATCGCAACTGCAAGGATGTTCACATGAAACTTTTGCTTGTATTGTTATGTATGCTTGTCTCTTCAATGGCTTGGGCTCATCCAGGGCATGGCGGTGTCGGTTTATTCCACCACTTGTTAGATCTTGCTCCCGCAGTCATTTTAGTCGCTTTGATTGCTTGGGCAGGTATTTGGGCCAAAAATAGAAAATAATCCCATTTTTTAATGAGTTAATTAACCCTGCTAAGGATTTATCCTTCTAGCGGGGTCTTTTGCGTGATCAAGCTGATTACGAATAAAGCGAATGTGCGCAATATCTCAGACTAATTTGCTATAAAGATAGGATTTAGCTTTTAAATTTCGTATAATCCGCGCGAATTTTTAAACTTGCTGACAAAGGAAGCTAGTTATGGCGATCGAACGCACATTTTCTATTATCAAACCTGATGCTGTTGCAAAAAACCACATCGGTGCTATCTACAATCGTTTTGAAACTGCTGGTCTTAAGATCGTTGCATCAAAAATGTTACACCTGACTAAAGAACAAGCTGAAGGTTTCTATGCTGAGCATAGCGAGCGCGGCTTCTTCGGTGCTCTGGTTGCATTCATGACTTCTGGTCCTATCATGGTCCAAGTATTAGAAGGCGAAAACGCTGTTTTAGCTCACCGTGAAATTTTAGGTGCAACTAACCCAGCTCAAGCGGCTCCTGGTACTATCCGTGCTGATTTCGCTGAAAGCATTGACGAAAACGCGGCTCACGGCTCTGATGCTGTTGAATCTGCTGCTCGTGAAATTGCTTACTTCTTCAGCGCTGAAGAACTGTGTCCACGCACTCGTTAATTCGATGCTAAGACTGAAAAAGGAGCCCTAGGCTCCTTTTTTATTGGCTGCAATTTAGCGAATACAGCTTGCCGTGCGCAATGAATCTAATGCCGCGATACCAATCACGTCTCGCTTCAGCGTTCTAAAGTTGAATTAGCGTTTGTTTTTATTGGTCTTTTAAAGTTCTGCACATTTTTTTAATTTTCTTGCTTGAAATTTTCTCTCCCGTCCTTATATCTATTTCAGGATCGCTCGATGAGGGTCCTAGTGTGAATTCTGTGCCGTAAGGACAGAAGATTTAATTCGGCAGACTCTACAACCGTTTTCATCCTGAAAACAGTCCGCCAATTCTCCATATTGCTTAAGACAAGGATATGAATATGCGTACTTATGATTTAACACCGCTTTACCGCAGTGCCATTGGTTTTGACCGTTTAGCGCAATTGGCAGAACATGCCGCCGCCAATAATGGCAACTCAGGTTATCCTCCATATAACATCGAATTACTCGGCGAAAATCGTTATCGCATAACGATGGCCGTAGCCGGATTCTCTATGGATGAACTTGAGATCAGCAGTGAAGGTGAAAAATTACTGGTGAAAGGCAATAAAGCCGAAAGCCAAACCGAGCGTAAATACCTGTATCAAGGCATAGCTGAACGCGGTTTTGAGCGTACTTTCCAACTGGCAGACTATGTCACAGTGCTTGGTGCAAGTTTAGAAAATGGCTTACTCAATGTTGATTTAGTGCGTGAAATTCCTGAAGCATTGAAACCACGCAAAATTGAGATCACCTCATCGCGCTTGTTAGACAGCCAGTCATAATTGACGCTGTGATAACCGCCTAAAACCTAAGGGGAGCATTATGCTCCCCTTGATGTTTTTGGTGTTTACGAGAAAAATGTCCGCGAGATTTCCGTGGGAAAGCTAGGCTTTCATGGCCTTTTCGCCGCGAGCCAAACCCACCACGCCGGAGCGAGAGACTTCGACCAGTTTGGTGACTTCTCCCAGCGCATGAATAAACGCATCGAGTTTATCTGACGTGCCCACCATTTGGATGGTGTATAAATTCGCCGTGACATCGACAATCTGACCACGGAAAATATCCGCCATACGTTTCACTTCTTCCCTAAATTCACCTTGGGCTCGTACCTTAACAAGGGCAAGCTCGCGTTCAATATAAGCAGACTCAGTAATATTCGATACTTTTAACACATCGATTAACTTATGCAGCTGCTTCTCTATCTGCTCTAACACCATTTCATCGGCAACGACTGTGATGTTAAGCCGCGACAGGGTGATATCGTCCGTTGGCGCCACTGTTAAACTTTCAATGTTATAACCGCGCTGTGAAAACAAACCGACCACACGAGACAGGGCGCCGGATTGGTTTTCTAATAATACAGATATAATTCGACGCATTAGCATTTCTCCGTCTTGGTCAGCCACATTTCATTCATCGCACCACCGCGGATCAACATGGGGTACACGTGCTCAGTTTCATCCACCATGATATCGATAAAGACCAGCCTGTCTTTCATCGCCAGTGCTTCAGCCATTTTTGACTCAAGTTCGGCGGGATCGCTGATCGTCATGCCAACATGACCATAGGCCTCGGCAATTTTGGCGAAGTTAGGCACTGAATCCATATAGGAATGTGAGTGACGGCCAGAGTAAATCATGTCCTGCCATTGTTTTACCATGCCAAGGAAACGATTGTTTAAGTTGATGATTTTTACCGGCGTATCATATTGCAGCGCGGTCGAAAGCTCTTGAATGTTCATCTGGATGGAACCATCACCCGTGACACACACAACGGTGGCATCAGGCATGGCCATTTTTACGCCCATCGCGGCGGGCAGGCCGAATCCCATCGTCCCTAGGCCACCAGAGTTGATCCAGCGACGCGGCTTATCGAACGGATAGTACAAGGCGGCAAACATTTGGTGCTGGCCAACATCTGATGCGACATAGGCATCGCCATTGGTTAGCTTATACAAGGTTTCAATCACTTGCTGTGGTTTGATGCGGCCACTGTTTTTGTCATAGGCCAGACAATCGCGGCTGCGCCATTGTTCAATGTCATTCCACCAAGTTGCAATGGCATCGTTGTCGTTATGTTCGTTCGATTCATCTAACAGCGCCAACATGCTGTCTAAAATACTGTCGGCCGAACCCACGATAGGAATATCCACGTGGATAGTTTTCGAAATAGAAGAGGGATCGATATCTATGTGCAAAATGGTCGCGTTCGGACAGTATTTTTCCACATTATTGGTGGTTCTATCGTCGAAGCGTACACCAATACCGAAGATCAAATCGCAGTTATGCATTGTCATGTTGGCTTCGTAGCGACCGTGCATCCCAAGCATGCCTAAACTGTTTTTGTGGGTGCTTGGAAAGGCGCCAAGCCCCATCAGCGTGCTGACGACGGGAATGTTTAATCTCTCAGCTAATTGCAGAATTTGCTTATCACAACCTGAGATAATTGCGCCGCCACCGACATAAAGTACCGGTTTTTTCGCGGCTAATAGAGCTTGAAGACCACGGCGGATCTGACCTTTATGGCCTGATGTCGTTGGATTATACGAGCGCATTTTGACGCTTTCTGGGTAGCAATATTCGTGCAGCAGAGCGGGGTTTAAACAATCTTTTGGCAGATCGACAACCACAGGGCCGGGGCGGCCAGTAGAGGCAATGTAAAAAGCCTTTTTAATAATTTCAGGAATTTCAGTCGGATCTTTAACTAAAAAGCTGTGTTTCACTACAGGGCGAGAGATACCTATCATGTCGCATTCTTGGAAGGCATCGTTGCCGATAAGATTGCTCGGGACTTGACCCGATAACACCACAAGGGGAATAGAATCCATGTAAGCGGTGGCAATACCGGTAATCGCGTTGGTCGCGCCTGGGCCTGAGGTGACTAATACCACGCCCACTTTACCCGTGGCACGGGCGTAGCCATCGGCCATGTGTACGGCGGCTTGTTCGTGGCGAACGAGTATGTGTTCAATACCGGGGATAACGTGCAGGGCGTCGTAGATATCTAAAACTGAACCGCCAGGATAGCCAAAAATGTGCTTTACGCCCTCATCGATCAAAGAACGCACTATCATGCTGGCGCCGGATAACATCTCCAT
Protein sequences of DBSCAN-SWA_1 >LR134321|2272084:2282384|2278467_2278635_+|VEF25911.1|DBSCAN-SWA MKLLLVLLCMLVSSMAWAHPGHGGVGLFHHLLDLAPAVILVALIAWAGIWAKNRK >LR134321|2272084:2282384|2274074_2274599_+|VEF25906.1|DBSCAN-SWA MNYFELFKFSPAFDIDTALLAERYRELQRAVHPDKFANDTEQQKLLSVQRTAQVNDGFQTLKDPIRRAEHMLSLRGIELSHETTTVKDTGFLMQQMEWREALEDIRDSADPQASIDELYQSFAQYRAQLTQQLTQLLTSEQAEDALLAADQVRKLKFMAKLHDELTRVEDALLD >LR134321|2272084:2282384|2273734_2274058_+|VEF25905.1|DBSCAN-SWA MAITMTPAAADRVRSFLVNRGKGVGLRLGLRTSGCSGMAYVLEFVDSLNDDDEVFDIEDVKIIIDAKSLIYLQGIELDFVKEGLNEGFQFNNPNAKGECGCGESFTV >LR134321|2272084:2282384|2279625_2280069_+|VEF25913.1|DBSCAN-SWA MRTYDLTPLYRSAIGFDRLAQLAEHAAANNGNSGYPPYNIELLGENRYRITMAVAGFSMDELEISSEGEKLLVKGNKAESQTERKYLYQGIAERGFERTFQLADYVTVLGASLENGLLNVDLVREIPEALKPRKIEITSSRLLDSQS >LR134321|2272084:2282384|2277033_2277939_+|VEF25909.1|DBSCAN-SWA MRIAILSQGPELYSTKRLVEAAQLRGHEVHVINPLECYMNINMRQSSIHIGGRELPAFDAVIPRIGASITFYGSAVLRQFEMMGVYALNDSVGISRSRDKLRSMQLMSRRGIGLPITGFANKPSDIPDLIDMVGGAPLVIKLLEGTQGIGVVLAETRKAAESVIEAFMGLKANIMVQEYIKEANGADIRCFVLGDKVIAAMKRQAMPGEFRSNLHRGGTASLVKLTPEERSVAIRAAKTMGLNVAGVDLLRSNHGPVIMEVNSSPGLEGIEGATTKDVAGAIIDFVEKNAIKVKKVTQAQG >LR134321|2272084:2282384|2276536_2276875_+|VEF25908.1|DBSCAN-SWA MPQLVFLPHAELCPDGAVLEANVGETILDVALRNGINIEHACEKSCACTTCHCIVREGFNDLEPSDELEDDMLDKAWGLEPESRLSCQAKVVDTDMVIEIPKYTVNMVSEGN >LR134321|2272084:2282384|2280171_2280666_-|VEF25914.1|DBSCAN-SWA MRRIISVLLENQSGALSRVVGLFSQRGYNIESLTVAPTDDITLSRLNITVVADEMVLEQIEKQLHKLIDVLKVSNITESAYIERELALVKVRAQGEFREEVKRMADIFRGQIVDVTANLYTIQMVGTSDKLDAFIHALGEVTKLVEVSRSGVVGLARGEKAMKA >LR134321|2272084:2282384|2280665_2282384_-|VEF25915.1|DBSCAN-SWA MEMLSGASMIVRSLIDEGVKHIFGYPGGSVLDIYDALHVIPGIEHILVRHEQAAVHMADGYARATGKVGVVLVTSGPGATNAITGIATAYMDSIPLVVLSGQVPSNLIGNDAFQECDMIGISRPVVKHSFLVKDPTEIPEIIKKAFYIASTGRPGPVVVDLPKDCLNPALLHEYCYPESVKMRSYNPTTSGHKGQIRRGLQALLAAKKPVLYVGGGAIISGCDKQILQLAERLNIPVVSTLMGLGAFPSTHKNSLGMLGMHGRYEANMTMHNCDLIFGIGVRFDDRTTNNVEKYCPNATILHIDIDPSSISKTIHVDIPIVGSADSILDSMLALLDESNEHNDNDAIATWWNDIEQWRSRDCLAYDKNSGRIKPQQVIETLYKLTNGDAYVASDVGQHQMFAALYYPFDKPRRWINSGGLGTMGFGLPAAMGVKMAMPDATVVCVTGDGSIQMNIQELSTALQYDTPVKIINLNNRFLGMVKQWQDMIYSGRHSHSYMDSVPNFAKIAEAYGHVGMTISDPAELESKMAEALAMKDRLVFIDIMVDETEHVYPMLIRGGAMNEMWLTKTEKC >LR134321|2272084:2282384|2278052_2278250_+|VEF25910.1|DBSCAN-SWA MTLTWIDSLDIALELLEAHPEVNPTQLHFTQLYEWVLALDDFADDPKHCNEKILEAIQQCWIDEK >LR134321|2272084:2282384|2273336_2273720_+|VEF25904.1|DBSCAN-SWA MAYSEKVIDHYENPRNVGSFDKNDPSVVTGMVGAPACGDVMKLQLKIGADGIIQDAKFKTYGCGSAIASSSLVTEWVKGKTIEQAAAIKNTDIAEELALPPVKIHCSILAEDAIKAAIDEYKSKQAK >LR134321|2272084:2282384|2278831_2279263_+|VEF25912.1|DBSCAN-SWA MAIERTFSIIKPDAVAKNHIGAIYNRFETAGLKIVASKMLHLTKEQAEGFYAEHSERGFFGALVAFMTSGPIMVQVLEGENAVLAHREILGATNPAQAAPGTIRADFAESIDENAAHGSDAVESAAREIAYFFSAEELCPRTR >LR134321|2272084:2282384|2274665_2276528_+|VEF25907.1|DBSCAN-SWA MALLQIAEPGQSAAPHQHRLAVGIDLGTTNSLVAAVRSGETATLPDELGQHSLPSIVRYTQDSVEVGALAALSSAQDPQNTIVSVKRFMGRSLADIKAGEQSFPYEFAESENGLPLFVTPQGQVNPVQVSAEILRPLIARAEKTLGGELQGVVITVPAYFDDAQRQGTKDAAALLGVKVLRLLNEPTAAAIAYGLDSKQEGVIAIYDLGGGTFDISILRLNRGVFEVLATGGDSALGGDDFDHLLQAHMQQVWQLSDIDSQLSRQLLIESRRVKEALTDAAETEAKVILADGTELTQIVTKAEFDAMIAALVKKTIASCRRTLRDAGVTTDEVLETVMVGGSTRVPLVREQVEAFFGKSPLTSIDPDRVVAIGAAIQADILVGNKPESDLLLLDVIPLSLGIETMGGLVEKVVSRNTTIPVARAQEFTTFKDGQTAMAFHVVQGERELVADCRSLARFTLKGIPPLAAGAAHIRVTFQVDADGLLSVTAMEKSTGVQSSIQVKPSFGLSDTEIATMLKDSMKYAKDDIGRRMLAEQQVEAARVLESLHAALAKDGDLLNADERGQIDATMANVAQVAAGDDADAIKLAIEKLDEQTQDFAARRMDNSIRVAFKGQSIDNI >LR134321|2272084:2282384|2272084_2273299_+|VEF25903.1|DBSCAN-SWA MKLPIYLDYAATTPVDPRVAEKMFQCMTMDGIFGNPASRSHRYGWQAEEAVDIARNQIAELINADHREIVFTSGATESNNLAIKGVAHFYHKKGKHIITSKTEHKAVLDTCRQLEREGFEVTYLEPAANGIIPMERLEAAMRDDTILVSIMHVNNEIGVIHDIDAIGELCRSKGIIFHMDAAQSAGKVPIDVQATKVDLISISGHKMYGPKGIGALYVRRKPRIRLEAQMHGGGHERGMRSGTLPTHQIVGLGEAAAIAKAEMASDDARIGALRDKLWNGIKHIEETYINGDAIERVSGSLNVSFNYVEGESLMMALKDLAVSSGSACTSASLEPSYVLRALGLNDEMAHSSIRFSIGRFTTEEEIDHAIEVITQSIDKLREMSPLWEMFKDGIDLNQVQWAHH |
13 | Faustovirus(12.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2477542 : 2489092
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LR134321|2477542:2489092|DBSCAN-SWA TTTGTCTCAAGGAAATAGTGTACGCACGCTCACTGGCCTGCAACGTCTGCTGGAAGGCGGACTCATCATTTGTTGTGTCCTCGCCACTTATATTTTGCTCGCATTGACGAGTTTTAGTCCGTCTGATCCTGGTTGGAGTCAGTCGCATTTTCAAGGCGATATTAAAAACTGGACCGGAGCAGTAGGGGCTTGGATAGCCGATATTCTGTTATATTTCTTCGGCGTCACGGCTTACATCATGCCAATCATTGTCGCGTCGACCGGCTGGCTGCTGTTCAAACGCGCCCATCATTTGCTCGAAGTAGATTACTTTTCAGTGGCCCTGAGATTGATTGGATTTTTGCTGATCATCCTTGGGTTTTCTGCGTTGGCGAGTATGAACGCCAACAATATCTACGAGTTTTCGGCAGGCGGAGTGGCGGGCGATGTGATCGGCCAAGCCATGCTGCCGTATTTTAATAAACTCGGCACCACCTTATTGCTGCTGTGCTTCTTAGGTTCTGGCTTTACCCTGTTGACGGGGATCAGCTGGCTGACCGTGGTCGAGAAAATAGGCTTAATCTCTATTTGGATTTATAAGAAACTTAAATTATTACCGCAAGCGATGAAACCTGAGCGTGAAACCGAAGATACCCGTGGTTTTATGTCTGTGGTCGACAAATTTAAACAGCGCCGCGAATCACAATACGTGCTTGAGAAGCCGCCAGTCGTTGCCACGCCAAAGGTGCGTGAACGTCATATTGGCCGCAGAGCCGAAATAACTCCAACCCTGTCTACTGCCGCAGATGAAGGGTTTATTACCGAAAGCATCAATACTGAAGAAGTTGCGCCGCAAAAAACTAAGCTATCGGCCCTTGCAAAAATTTTGAGTTTGAATGGCAGTAAAAGCAAAAATGCACAAAGGGTTGAGCCTCAAATCGATCAAGAGGATTTTGCCGCCCATGGCAATTTTGAAGCGCCACCTTGGTTAGCCGAACCTCAACATGCAAGAAATGATGAGCAAGAAAGAGCCGATTTCAATTCTCATTCATTCGATCACGACGACAATGAGCCTGTGTTTAACAGCCAAACCTTGGCGGAAGATGACGATGAAAGCTTAGGCTTTACCGATGATGACGTCATTGATTTTGATACAAAAGCCTCAACGGGCGCCGTGAATCAAGCCCAGCGAAAGAAGCAAGATCAAAAAGCGAAGATTGTGGATGGTATTGTGGTGTTACCGGGACAAGAAGATAAACCCGCGCCAAAAAAACCAATGGACCCGTTACCCAGCATCAATTTGCTTGATGTGCCTGATCGTAAGAAAAACCCGATCAGCCCAGAAGAGTTAGATCAAGTCGCTCGTCTAGTTGAAGTGAAATTGGCCGATTTCAACATCATCGCCAATGTGGTGGGTGTGTATCCAGGTCCTGTGATCACGCGCTTCGAATTAGAATTAGCGCCTGGCGTTAAAGCGTCAAAAATCTCTAATCTGTCGAAGGATTTAGCCCGTTCACTGCTGTCTGAAAGTGTGCGCGTGGTTGAAGTTATCCCGGGCAAGGCTTATGTCGGACTCGAGCTGCCGAATAAATTCCGTGAAACCGTGTTTATGCGCGATGTGTTAGATTGCGCCGCCTTTACCGACAGTAAATCCAACTTAACTATGGTGTTAGGTCAAGATATTGCCGGCGACCCAGTCGTGGTTGACCTGGGTAAAATGCCGCATTTATTAGTTGCGGGAACCACAGGTTCTGGTAAGTCTGTGGGCGTAAATGTGATGATCACCAGCTTACTGTATAAGTCTGGCCCAGAAGATGTGCGCTTTATCATGATCGATCCCAAAATGTTGGAACTCTCTGTCTATGAAGGTATTCCGCATTTGCTCTGTGAAGTGGTTACCGATATGAAAGAAGCCGCCAATGCACTGCGTTGGTGTGTGGGCGAGATGGAACGCCGCTACAAGCTGATGTCAATGATGGGCGTGCGTAACATCAAGGGCTATAACGCCAAAATTGCTGAGGCGAAAGCCAATGGCGAAGTGATTTTAGATCCTATGTGGAAGTCATCCGACAGCATGGAGCCAGAAGCGCCTGCGTTAGATAAGCTGCCATCGATTGTTGTCGTGGTCGATGAATTTGCTGACATGATGATGATTGTCGGTAAGAAAGTTGAAGAGTTGATTGCCCGTATCGCGCAAAAGGCCCGCGCTGCAGGCATACATTTGATTCTGGCAACCCAAAGACCGTCAGTGGATGTGATTACTGGCTTGATTAAAGCTAACATTCCGACACGTATGGCGTTCCAAGTGTCATCGCGTATCGATTCGCGCACCATTTTAGACCAACAGGGTGCTGAAACCTTGCTCGGTATGGGGGATATGTTATTCCTACCACCGGGTACCGCGGTTCCAAACCGTGTCCATGGCGCCTTTGTTGACGATCATGAAGTTCACCGCGTCGTTGCCGATTGGTGCGCCCGTGGTAAACCGCAATATATCGATGAAATTCTCAATGGTGCCAGCGATGGTGAGCAAGTCTTACTACCGGGTGAAACCGCAGAAACCGATGAAGAATACGATCCACTCTACGATGAAGCCGTCGCCTTTGTGACCGAAACCCGTCGCGGCTCGATTTCAAGCGTACAGCGTAAATTTAAGATTGGTTATAACCGTGCAGCGCGCATTATTGAACAGATGGAAGCCCAAGGTATCGTTTCGGCTCAAGGCCACAACGGTAACCGTGAAGTCTTAGCGCCGCCGGCACCAAAACATTATTAAGCCGCGCTAAGGCGTGGCTTTGATACACCCAAGGGTTTTACAGGGTAAGGAATAAAATGAAAAAACGATTATGCGCTGTGTTGTTAGCTTCACCTTTGCTATTTAGCGCGGCTGTGTTTGCCGATGATGCGCAGCAGTTAAGAGACAAGTTGATTGGTACTGCATCACTAAAAGCCGATTTCAAACAAACCGTCACTGACGTCAATAAAAAGGTCATTCAGACGGGTTCTGGTATTTTTGCCTTGGCGTATCCTAATCAGTTTTATTGGCACTTAACTCAGCCAGATGAATCGCAAATTGTCGCTGATGGTAAAGATTTATGGATCTACAATCCCTTCGCCGAAGAAGTGGTCATCATGGATTTTGCCGAGGCTATCAATGCTTCGCCTATTGCTTTGTTAGTCCACCGCGACGATGCCACTTGGTCACAGTATTCAGTGACCAAGCAACAAGACTGCTATGAGATCAAACCTAAGGCGATTGATTCGGGCATTTTGTCTGTCAAGGTGTGTTTCAAAAATGCCCAGTTAGCCAACTTTAATGTTGCCGATGACAAAGGTAACTTGAGCCAATTTGATTTGAGCAATCAGCAAGCGATTACCGATAAAGACAAAGCGCTGTTCAGCTTTGTGTTGCCTGACAATGTCGATGTTGATGATCAACGTCGTAAAACAGCACACTAGGCGATAGCGTGAGCAGTTTATCCTTTAATTTCGCCCCCGACTTTCGTCCCTTGGCCGCACGCATGCGGCCAAGAACGATCGCCGAGTACATAGGCCAAGCCCATTTACTGGGGGAAGGCCAGCCGCTACGCAAAGCATTGGAAGCGGGACGCGCCCATTCCATGATGTTGTGGGGGCCGCCGGGTACAGGAAAAACGACCTTAGCCGAATTAATCGCACATTATTCAAATGCGCACGTTGAACGCATCTCTGCGGTCACCTCTGGCGTCAAAGACATTCGCGCGGCGATTGAGCAAGCGAAAGCCGTGGCTGAATCCCGTGGTCAACGCACGTTATTGTTTGTCGATGAAGTCCACCGATTCAATAAAAGCCAGCAGGATGCCTTTCTGCCATTTATTGAAGATGGCACTGTGATTTTTATCGGTGCGACCACTGAAAACCCGTCGTTTGAAATCAACAATGCGCTGCTCTCGCGGGCACGGGTTTATCTTATCAAGCGCTTAAGCCATGATGAGATCGCCCACATAGTGACTCAAGCCTTAAGCGATACCGAGCGCGGCTTAGGCCAACGCCAATTTGTGATGCCAACCGATGTGCTCACCACACTGGCGCAACTTTGTGATGGTGATGCCCGTAAAGCATTAAATCTTATTGAGTTGATGAGCGATATGCTCGCCGATGGCGGCACCTTTACCACTGAGATGTTGATCCAAGTGGCGGGGCACCAAGTTGCCGGATTCGATAAGAACGGCGATCAGTTTTACGATTTAATCTCAGCCGTCCATAAATCAATCCGCGGCTCAGCACCCGATGCGGCGCTGTATTGGTTTTGTCGAATATTAGAGGGCGGCGGCGATCCGCTGTATGTCGCAAGGCGCTTACTGGCGATTGCCTCTGAAGATGTCGGCAATGCCGATCCTGCGGCGATGACCATAGCGCTTAATGCTTGGGATTGTTTTCACCGTGTCGGCCCAGCCGAAGGTGAGCGTGCGATAGCACAAGCCATTGTTTATCTTGCCAGCGCGCCTAAGAGTAACGCTGTCTATACCGCATTTAAGGCGGCGCGTGAATTAGCTCGCGATACTGGGCAAGTCGAAGTGCCGCACCATTTACGCAATGCACCGACTCAGTTAATGAAAGACATTGGCATGGGAGCAGGGTATCGATATGCCCACGATGAAGCCAATGCCTATGCCAGTGGTGAGAATTATTTCCCTGAATCCCTGCAAACAGCGCAGTTTTATTTTCCGACTGAGCGAGGGTTCGAGAAGCGAATCAAAGATAAGTTGGCGCAATTAGCCCAGTTAGATCAAGCCAGCGAGCATAAAAGATATGAATAATCTCTTACTTGTGGCGCTAGGTGGTTCAATTGGGGCTGTTTTTCGCTATCTTATTTCAATATTCATGATCCAGGTATTTGGCAGCAGTTTTCCTTTTGGTACACTGTTGGTTAATGTCCTCGGTTCATTTTTAATGGGCGTAATTTACGCACTGGGGCAAATGAGTCATATCAGCCCAGAACTCAAAGCGCTGATCGGTATCGGCCTGTTGGGCGCTTTGACAACGTTTTCGACTTTCTCTAATGAAACCTTATTGCTGCTGCAAGAAGGGGATTGGCTGAAGGCGACTTTGAATGTGGTGTTGAATCTAAGTCTATGTTTATTCATGGTGTACTTAGGTCAGCAACTGGTTTTTTCTCGCATTTAACTATTAAGAATATATCACATGTTAGATCCTAAATTTTTGCGCAACGAATTAGAAGTTACCGCTGAGCGACTGGCCACCCGTGGCTTTATTTTAGACATAGCTCACCTCACTCAATTAGAAGAAAAGCGTAAGTCACTGCAAGTGGCTACCGAAGAATTACAAGCGTCGCGTAATGCCATCTCCAAATCCATTGGCCAAGCTAAAGCCCGTGGTGAAGATGTTGACGCTATCATGGCGCAAGTGGGCGATTTAGGTTCACAGCTGGATGCCAGAAAGATTGAACTGGCCGCCGTACTTGAAGAAGTCAATGCGATTGCCATGTCGATGCCCAACCTGCCAGACGAGTCCGCACCAATCGGTGCCGATGAAACAGAGAACGTTGAAGTTCGCCGCTGGGGCACGCCACGCACGTTTGATTTCCCAATTAAAGATCATATCGATTTAGGCGAAAGCCTAAACGGTTTAGATTTTAAAAATGCAGTGAAAATCACTGGCTCACGTTTCATAGTGATGAAAGGCCAAATTGCCCGCTTAAACCGCGCCATTGGACAGTTCATGCTGGACTTGCACACCACAGAGCACGGTTATACCGAAGCCTATGTGCCATTACTGGTTAACGAAGCAAGCTTACTCGGCACAGGCCAGCTACCTAAGTTTGGTGAAGATTTATTCCACACTAAACCTGCCACCGAAGAAGGTCAGGGCTTAAGCTTGATCCCAACGGCCGAAGTGCCATTAACCAACTTAGTGCGCGATAGCATTGTTGATGAAGACGAGCTGCCGATTAAATTAACTGCGCATACCGCATGTTTCCGCAGCGAAGCGGGCTCATACGGCAAAGATACCCGTGGTCTGATCCGTCAGCATCAGTTCGATAAAGTTGAAATGGTGCAAATCGTTAAGCCGGAAGATTCAATGGCGGCGCTCGAAGCGCTAACTGGTCATGCTGAAACTGTATTGCAACGTTTAGGCCTGCCATACCGTACTGTGATTTTATGTACTGGTGATATGGGCTTTGGCTCAAGTAAAACCTACGACATCGAAGTCTGGTTACCAGCACAAAACACTTATCGTGAAATTTCTTCTTGCTCAAACATGAAGGATTTCCAAGCCCGTCGTATGCAAGCGCGTTACCGCGTGAAAGCAGACAATAAACCCGCGTTACTGCATACCTTAAACGGTTCAGGTTTAGCTGTGGGTCGTACCTTAGTGGCTATTTTAGAAAACTATCAAAATGCCGATGGTAGCATCACAATCCCAGAAGTGCTGCGTCCATACATGGGCGGCTTGACTCAAATAGGCTAATGGAACGCAAAATAATACTATGGATAAAGGCCAGCGCTATCTCCGTTGGCTTTCCTATACCGCTGTCATTGCAGTATTTTTTGCGGTAATGTTGGTGACCATAGGCAAAGCCGTCTGGGTTGTGTTAGAAAACGTCAAATAACCCTATTAACTTTGTTCAAGTAAAAAGAAAGGCATGTTTCCTAAGGGAACATGCCTTTTTTGTTTGTGCCATGTTTATGTTTAATACTCACATTTGACGCTTACGTTCGATATTGAGCTAAGTATGATGACTTAACCTATACCATTAAATGCATTTCGCTGGCTTTGGCAGGCCCGCAATTTTGGTGGCTTGTTTTGCAGGGCCAATAGGGAACAAGGTATATAAGTATTTTGAATTGCCTTTGTCTGGCCCTAAAGCTTGGCCAATAGCTTTGACTAATACGCGAATAGCAGGGCTCGTTTTGTATTCAAGATAAAAATCGCGAACAAAGTTAATCACTTCCCAATGGGCGGGCGTAAGCACGATTTGCTCTTCTGCCGCCAATAGTAAAGCCATATCCGGTTGCCAATCGTTGAGATCTTTTAAATAACCTTGATGGTCACGGGCGATTTCTGCGCCATTAAATTGCAATGGATTTACCACGTTATCACCTTGTCATGGAGTAATGATTGAGCAACAAACTCATTGTAATCGATAAGCGTCACGTTCTTTAAACGCTCAGATAGGCCGCGTGCGACGACATCGTCTTTTAAGACCATGAGCTTGAAGGGCGATAATGCCATTGCCCATTGACGCAGCAGTAACGCGTTCACGCCATCGCTCGATAGTAGAATCGCATCTTCCTTACAGGCGTAACGTAAGCAGAGTTTCAATGCACTATCTCGGCTGGGAGATGTTTGAATATGATGTAAAATCATCAGAATACTAAGACCTCATCCACTTGTTTTAAATGGGCAGCGATGGCTTCATCGTTGACCACAGTAACAGGAATAGACAATAAACTATGGCTTAAGCCGTAATCGGCTAAGGATTGCTTACAGACAAATACCGACTCGATATCGTACAGCGGCAGTGCTTTAAAGGTGGAGATGTAATCTTTAGCGCCAATAAGCTCAGGTTGTTGATCTTTAATCAAATGCAGCACACCTTCATCGACAAACACTAAACTCACTTCTTGCTCAAAGCTGGCGCTCAGCAGGGCGAGATCTAAACCTTCGCGGCCACGAACAGTACCGTGTGGCGAACAGCGAAATAGGATACACAATTTTTTCATAGTCACCCTGAGAGTTAAAAACTGATCAAACGATCGGCCGATTCAATGCCGGTGACAAGCTCGCCTAAACCGCCCATGATAAATGACTCGCCCACATTCCAGTGGCTCAGGCCATTCTCGCTGGCATCTTGCTTGCTTACAATCCCACGGCGCAGCGCTGCTGAAACGCAATTGACTAACGGGAGCTGATGGGATTGCGCGAGCTGTTTCCAGTCACTGATCACATCGTTTTCATCGGAGGCGGGTAAGCTTAAATCTGTGGAGTTATACACACCGTCTTGATAAAAAAACACACAGATAATCTGATGCCCAGCCAAAAGTGCAGCTTGGGCAAAGCGTAAGGCGTTAACACTCGCCGACGTGCCATAAGCGGGCCCGTTCACTTGGATAATAAATTTGCTCATTTAAAATAAAAATGGCCCTAAAAGTAGGGCCATTCTAACGCAAGTTAAGCTAAAAATCGCTAACTAGGATTAGTCATCGTTGCCTATGCCCATAAGGTGCAACAGCGCGATGAAGAGGTTTAAGAAATCTAAGTATAATGAGATAGTTGCGCGAATGTAGTTCGTCTCGCCACCGTTCACAATACGGCTAGTATCAAACAGGATAAAACCTGTCATCAATAATGCAATACCGGCGTTAATCGCCATAAAGGCAATGCTGTTACCCATAAAGATGTTAATTACGGCAGCGGCAATCACCACAATCAAGCCAGCGAACAAGAACCCACGTAGGAAAGAGAAATCTTTCTTCGTGGTCACCGCATAGGCTGATAAGGCAACGAAAATCACTGACGTTAACCCCAGTGCTTGCATGATCAGCTGTGGACCGTTTGCCATGCCTGCATAGTGATTTAACATGTAGCCCAGTGACGCACCTTCCATACCAGTAAAGGCAAATACCCAGAAGATACCCGCTGCAGATTCGGCTTTACGTAAGGTTACAAACAATAGCACTAGGCCACCAATGGATAAGCCAATGGAGAGTAATGGACTGATATTTAATGCCATTGCCAAGCCAGCGCACAGGGCAGAGAAGGCCAATGTCATAGACAACAGCATATAAGTGTTTTTAAGAAGTTTGTTCACTTCCAGTGTTGACGCACTTGTCGAATATAAGGTTTCTTGGGTCATGTTGATCTCCGTTGACACATTTCCGAACTGATCATTTTGATTTTTTACAGCACACTAAGTTCCCGTAAAATCCTAAAACGAACTTAACTGAATTCATGATACCGTACCTGAGATACGGGTGAAAGTCATAGAACCATGAATAACAAATTGTTAATCTTAAAAATTTTGCCTCATCTCACATATCAATCTTAAGTGCTAAGCGCTGAGGATTGAGATGGCTGCATCCGTATCTTTATCCACTAAGCACATGAATGCTTGGTTAAAACTTAATTGGCTTGATAAATAGGCCTGCTGAACTTAGGAGCACTTACAGCAAATTTCAAGTTCAATGTTAATTTCAATGCCGATTTCAAAGACAAGTCCAATGCTAATTTCAATGCCAAGTATAATGACTAGATCAGCATATTGAGTTTAAACCAAGCTTAAGCCAGCCACTCTTTATGCAATTTTTCGGTGTCACCCAGATATTCCAGCACCCAAGCGAGCGCGGGCGACATCTTGTCGGCATTCCACGCGAGGCAACAAGGGCTGGTCTGCTTAGGATTTTCCAGTTGTTTCTCAACCAGCGCACCCGCTTTAATAAACACACTTGCTAAGTGCACTGGCATATAACCAACGCCTAAACCTTCGCGGAAACAGTTGATGGCGCGGATCCAATCGGGGACAACGAGGCGACGTTGATTTTCGAGCAGCCAAGTCATACGTTTTGGAATTTCCCGAGAGGTGTCTTCTAGGCAAATCGAGGGGAAAGGGCGCAGCTCATCGTCGCTTAGCGGACGGTCAATATTGGCCAATGGATGATTTTTGCTGACTAAAAAAGCCCACTCAATATCGCCCATGTCTTTATATTGGTACACGCCGCCGACCGGAATTGCGGTTGTTGCACCAATTGCAATGTCACTGCGGCCCGTGGCAAGTGCTTCCCAAACGCCGTTAAAGACCTCAATGCGAATAATCAATTCAATATCATGGAAATGACGATAGAAGTCAGCAATCAATACGCTGATCCTATCGGCACGGACGATATTATCCAGCGCGATGGACAAGGTCGGTTGCCAACCGTTTGCTACGCGCTGCGTGCCACGCTTCATCTCATCCATCTGCGTAAGCAAAGTTCTGGCTTGCTTAACAAAATGTTCACCCGCAGGGGTTAAGGTTACGCTGCGATGATGACGTTCGAAAAGAATGACGCCGAGTTCTTCTTCTATTTGCTTGACCGCGTAGCTCACGGCAGAGGGCACTTTATGCAGCCGATTCGCCGCCGCGGTGAAACTTCCTACACGAGCCACAATATCTATGAGTTCTAGTGCTTGTTCTGAGAGCATAGCTGACGTCCGTTCCAATGAAAATAATTGATAGCACAAGTCAAAAATAAACGTTTCCAATAGATTTAACAAGTCATTAGACTGCACGCTTTAGTCAATCCGACCCGATTATTTTATGAAAACGTCTAGTAACACCTTTGTAAACATAAAGTTCTTTATGTTTCTATTCTATTTAGCCATGTTGAGCATGCTCGGCTTTATTGCCACTGACATGTATTTGCCTGCTTTCAAAGCAATTGAAGGCTCACTTAACAGTACCCCATCACAGGTGGCGATGTCGCTGACCTGTTTCCTGGCGGGTCTTGCCTTAGGTCAGCTACTTTATGGCCCCTTAGTCAATAAAATCGGCAAACGTTGGGCATTGATTTTTGGCCTAGTGGTATTTGCCGTTGCCAGCGTGTTTATTGCCAACAGTGATTCAATCCTAATGCTCAATACCGCGCGCTTCTTCCAAGCGATTGGCGCCTGTAGTGCAGGGGTTATTTGGCAAGCGATTGTGGTTGAACAATATGATGCTGATAAAGCCCAAGGTATTTTCAGTAACATTATGCCACTGGTCGCATTATCGCCTGCATTAGCACCGATCCTTGGCGCTTACATTCTGAACGAATTAGGCTGGCGTGCAATCTTTATCTCTTTGTGTGTTATCGCTTTTCTACTGATATTGATGACATTGTACTTTGTGCCAGGCAAGCGTCAGCATACACACAGTGAACATGCTGCTGTATCTTATGGACAAATTTTGAAAAACACCCGCTATTTAGGCAACGTCGTGATCTTCGGCGCCTGTTCCGGTGCTTTCTTTGCTTATCTAACCGTATGGCCGATTGTGATGGAAATGCACGGTTATCTGCCAACTGAAATCGGCCTTAGCTTTATCCCACAAACCATCATGTTTATTGTCGGTGGTTATGCGAGTAAGTTACTGATAAAACGCATTGGCGGGCATCAAACACTGAATATCTTGTTGTCGATTTTCGCAGCCTGCGTTATTTCGATCATCATGTTCACGCTGATTTTCCCAGCAAGCACGATTTTCCCACTGCTTATTTCGTTCTCGATTTTGGCCGCAGCTAATGGTGCCATTTACCCGATAGTCGTGAACAGCGCGCTACAGCAGTTTAGTCAGAATGCCGCTAAGGCCGCAGGATTACAGAACTTCTTGCAGATCAGCATTGCCTTTGGTGCATCTAGCTTAGTCGCTATGTGGGCAAGCCTAGGTGAAGTAGCGATCGGTTGGGGCATCCTATGTTGCGCAATCTTAGTTGTGGTCGGCTATGTGCTGAAGAGCCAACAAGGGTGGTGTGATTTTGCCAAACATTTCACTAAGCCAGATCCCGCACGTGTTGGCATAGTGGCCGATGTGAAGCAAAATTCAGCGGATTGA
Protein sequences of DBSCAN-SWA_2 >LR134321|2477542:2489092|2480987_2482319_+|VEF26079.1|DBSCAN-SWA MSSLSFNFAPDFRPLAARMRPRTIAEYIGQAHLLGEGQPLRKALEAGRAHSMMLWGPPGTGKTTLAELIAHYSNAHVERISAVTSGVKDIRAAIEQAKAVAESRGQRTLLFVDEVHRFNKSQQDAFLPFIEDGTVIFIGATTENPSFEINNALLSRARVYLIKRLSHDEIAHIVTQALSDTERGLGQRQFVMPTDVLTTLAQLCDGDARKALNLIELMSDMLADGGTFTTEMLIQVAGHQVAGFDKNGDQFYDLISAVHKSIRGSAPDAALYWFCRILEGGGDPLYVARRLLAIASEDVGNADPAAMTIALNAWDCFHRVGPAEGERAIAQAIVYLASAPKSNAVYTAFKAARELARDTGQVEVPHHLRNAPTQLMKDIGMGAGYRYAHDEANAYASGENYFPESLQTAQFYFPTERGFEKRIKDKLAQLAQLDQASEHKRYE >LR134321|2477542:2489092|2485261_2485651_-|VEF26085.1|DBSCAN-SWA MSKFIIQVNGPAYGTSASVNALRFAQAALLAGHQIICVFFYQDGVYNSTDLSLPASDENDVISDWKQLAQSHQLPLVNCVSAALRRGIVSKQDASENGLSHWNVGESFIMGGLGELVTGIESADRLISF >LR134321|2477542:2489092|2484609_2484891_-|VEF26083.1|tRNA|DBSCAN-SWA MILHHIQTSPSRDSALKLCLRYACKEDAILLSSDGVNALLLRQWAMALSPFKLMVLKDDVVARGLSERLKNVTLIDYNEFVAQSLLHDKVITW >LR134321|2477542:2489092|2482704_2483991_+|VEF26081.1|tRNA|DBSCAN-SWA MLDPKFLRNELEVTAERLATRGFILDIAHLTQLEEKRKSLQVATEELQASRNAISKSIGQAKARGEDVDAIMAQVGDLGSQLDARKIELAAVLEEVNAIAMSMPNLPDESAPIGADETENVEVRRWGTPRTFDFPIKDHIDLGESLNGLDFKNAVKITGSRFIVMKGQIARLNRAIGQFMLDLHTTEHGYTEAYVPLLVNEASLLGTGQLPKFGEDLFHTKPATEEGQGLSLIPTAEVPLTNLVRDSIVDEDELPIKLTAHTACFRSEAGSYGKDTRGLIRQHQFDKVEMVQIVKPEDSMAALEALTGHAETVLQRLGLPYRTVILCTGDMGFGSSKTYDIEVWLPAQNTYREISSCSNMKDFQARRMQARYRVKADNKPALLHTLNGSGLAVGRTLVAILENYQNADGSITIPEVLRPYMGGLTQIG >LR134321|2477542:2489092|2480352_2480979_+|VEF26078.1|DBSCAN-SWA MKKRLCAVLLASPLLFSAAVFADDAQQLRDKLIGTASLKADFKQTVTDVNKKVIQTGSGIFALAYPNQFYWHLTQPDESQIVADGKDLWIYNPFAEEVVIMDFAEAINASPIALLVHRDDATWSQYSVTKQQDCYEIKPKAIDSGILSVKVCFKNAQLANFNVADDKGNLSQFDLSNQQAITDKDKALFSFVLPDNVDVDDQRRKTAH >LR134321|2477542:2489092|2477542_2480296_+|VEF26077.1|DBSCAN-SWA MSQGNSVRTLTGLQRLLEGGLIICCVLATYILLALTSFSPSDPGWSQSHFQGDIKNWTGAVGAWIADILLYFFGVTAYIMPIIVASTGWLLFKRAHHLLEVDYFSVALRLIGFLLIILGFSALASMNANNIYEFSAGGVAGDVIGQAMLPYFNKLGTTLLLLCFLGSGFTLLTGISWLTVVEKIGLISIWIYKKLKLLPQAMKPERETEDTRGFMSVVDKFKQRRESQYVLEKPPVVATPKVRERHIGRRAEITPTLSTAADEGFITESINTEEVAPQKTKLSALAKILSLNGSKSKNAQRVEPQIDQEDFAAHGNFEAPPWLAEPQHARNDEQERADFNSHSFDHDDNEPVFNSQTLAEDDDESLGFTDDDVIDFDTKASTGAVNQAQRKKQDQKAKIVDGIVVLPGQEDKPAPKKPMDPLPSINLLDVPDRKKNPISPEELDQVARLVEVKLADFNIIANVVGVYPGPVITRFELELAPGVKASKISNLSKDLARSLLSESVRVVEVIPGKAYVGLELPNKFRETVFMRDVLDCAAFTDSKSNLTMVLGQDIAGDPVVVDLGKMPHLLVAGTTGSGKSVGVNVMITSLLYKSGPEDVRFIMIDPKMLELSVYEGIPHLLCEVVTDMKEAANALRWCVGEMERRYKLMSMMGVRNIKGYNAKIAEAKANGEVILDPMWKSSDSMEPEAPALDKLPSIVVVVDEFADMMMIVGKKVEELIARIAQKARAAGIHLILATQRPSVDVITGLIKANIPTRMAFQVSSRIDSRTILDQQGAETLLGMGDMLFLPPGTAVPNRVHGAFVDDHEVHRVVADWCARGKPQYIDEILNGASDGEQVLLPGETAETDEEYDPLYDEAVAFVTETRRGSISSVQRKFKIGYNRAARIIEQMEAQGIVSAQGHNGNREVLAPPAPKHY >LR134321|2477542:2489092|2486802_2487705_-|VEF26087.1|DBSCAN-SWA MLSEQALELIDIVARVGSFTAAANRLHKVPSAVSYAVKQIEEELGVILFERHHRSVTLTPAGEHFVKQARTLLTQMDEMKRGTQRVANGWQPTLSIALDNIVRADRISVLIADFYRHFHDIELIIRIEVFNGVWEALATGRSDIAIGATTAIPVGGVYQYKDMGDIEWAFLVSKNHPLANIDRPLSDDELRPFPSICLEDTSREIPKRMTWLLENQRRLVVPDWIRAINCFREGLGVGYMPVHLASVFIKAGALVEKQLENPKQTSPCCLAWNADKMSPALAWVLEYLGDTEKLHKEWLA >LR134321|2477542:2489092|2484277_2484616_-|VEF26082.1|DBSCAN-SWA MVNPLQFNGAEIARDHQGYLKDLNDWQPDMALLLAAEEQIVLTPAHWEVINFVRDFYLEYKTSPAIRVLVKAIGQALGPDKGNSKYLYTLFPIGPAKQATKIAGLPKPAKCI >LR134321|2477542:2489092|2485720_2486380_-|VEF26086.1|protease|DBSCAN-SWA MTQETLYSTSASTLEVNKLLKNTYMLLSMTLAFSALCAGLAMALNISPLLSIGLSIGGLVLLFVTLRKAESAAGIFWVFAFTGMEGASLGYMLNHYAGMANGPQLIMQALGLTSVIFVALSAYAVTTKKDFSFLRGFLFAGLIVVIAAAVINIFMGNSIAFMAINAGIALLMTGFILFDTSRIVNGGETNYIRATISLYLDFLNLFIALLHLMGIGNDD >LR134321|2477542:2489092|2487862_2489092_+|VEF26088.1|DBSCAN-SWA MFLFYLAMLSMLGFIATDMYLPAFKAIEGSLNSTPSQVAMSLTCFLAGLALGQLLYGPLVNKIGKRWALIFGLVVFAVASVFIANSDSILMLNTARFFQAIGACSAGVIWQAIVVEQYDADKAQGIFSNIMPLVALSPALAPILGAYILNELGWRAIFISLCVIAFLLILMTLYFVPGKRQHTHSEHAAVSYGQILKNTRYLGNVVIFGACSGAFFAYLTVWPIVMEMHGYLPTEIGLSFIPQTIMFIVGGYASKLLIKRIGGHQTLNILLSIFAACVISIIMFTLIFPASTIFPLLISFSILAAANGAIYPIVVNSALQQFSQNAAKAAGLQNFLQISIAFGASSLVAMWASLGEVAIGWGILCCAILVVVGYVLKSQQGWCDFAKHFTKPDPARVGIVADVKQNSAD >LR134321|2477542:2489092|2484890_2485247_-|VEF26084.1|tRNA|DBSCAN-SWA MKKLCILFRCSPHGTVRGREGLDLALLSASFEQEVSLVFVDEGVLHLIKDQQPELIGAKDYISTFKALPLYDIESVFVCKQSLADYGLSHSLLSIPVTVVNDEAIAAHLKQVDEVLVF >LR134321|2477542:2489092|2482311_2482686_+|VEF26080.1|DBSCAN-SWA MNNLLLVALGGSIGAVFRYLISIFMIQVFGSSFPFGTLLVNVLGSFLMGVIYALGQMSHISPELKALIGIGLLGALTTFSTFSNETLLLLQEGDWLKATLNVVLNLSLCLFMVYLGQQLVFSRI |
12 | uncultured_Caudovirales_phage(28.57%) | tRNA,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
3590721 : 3601854
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >LR134321|3590721:3601854|DBSCAN-SWA CTTAAGCTAGACGCTCAGTAATCCAAGGTAATACTTGAGCCAATGCCGCATCCAGATTTTCTGGCTGGCTACCGCCCGCTTGAGCCATATCAGGACGACCGCCGCCTTTACCGCCCACTTGCGCAGCGACCATAGCGACTAGCTCACCCGCTTTGACTTTGCCGATCAGATCGTTACTCACACCAGCGATCAGGTTAACTTTCCCTTCCTGCGCGGTGCCAAGAACGATAATGGCCGATTTCAATTTTTGTTTTAATTCATCTTGTAGACCGCGTAATGAGCTTGCCTCAACGCCGTCTAACTTTTTGATCAGTACGTTAACGCCATTAACCACAACGGCATCGCCCACTAAATCAGCACTGGCTGCAGCCGCGAGTTTATCTTTCAACTGCGCCATTTCTTTTTCGAGTTGCTTCATCTTATCGAGCTGGGCTTTTAACTTAGCGACAACTGAGTTGGCATCGCCTTTCAATAAGGCTGCAGCTTCTTCCAGTTCAGCTTGTTGCTGCGCCACATAGGCCATAGCAGCAGCGCCAGTAACGGCTTCGATACGACGTACACCTGCAGCAATACCCGCTTCAGAGGTGATTTTGAATAAGCCGATATCGCCCGTGCGGCCCACATGGGTACCACCACACAATTCGATTGAGAAATCGCCCATAGTCACAACACGCACTTGTGAGTCATATTTCTCACCAAAGAGCGCCATTGCGCCTTTCTCTTTGGCTTCGTCTATACCCATTTCAGCGGTTTTCAGCTCATGGTTACGACGAATTTGAGTGTTAACTAATTCTTCAACTTCTTTTAATTCAGCGGCTTTTACACCTTCGAAATGGGAGAAGTCAAAACGCAAACGCTCAGGATCAACCAAAGAACCTTTTTGCGAAACGTGAGTACCGAGCACTTGACGCAGTGCGGCATGCAGTAAATGCGTAACAGAATGGTTTAACTGAGTACGATGACGCAGTTTTTTGTCGACTTTTGCTTCAACAACTTGGCCAACACTCAAATTACCGGTAGCTAACACACCTTGATGGCCCGTTGCTTGACCGTACTTTTGCGTGTCGTTAACGGTAAATTCAACACCGTTCGCAACCAGTTGGCCTTTATCGCCCACTTGACCGCCAGATTCAGCGTAGAAAGGTGTTACATCGAGCACGACAACGGCTTCATCACCCGCTTTGATCGCCGTCACTGACTCACCGTTTTGATAAATCGCAGTGATCTTAGCTTGGCCCGCTAATTCGGTGTAACCACTGAATGCGGTTTCAGCATCGATTTTCAGCGCCGCGTTATAATCTGCACCAAAGTTACCGGCAGCTTGTGCGCGGCTACGTTGTTCAGCCATGGCGACTTCGAAACCTGCTTCATCGACAATGATGTTGCGCTCACGACAGACGTCAGCGGTCAAATCCATTGGGAAGCCATAGGTGTCATACAGCTTAAATACGGTTTCGCCGTCTAAAGTGTCACCTTTCAACTCGTTTAAGGCCGTATCTAAAATACCTAAACCACGTTCAAGCGTCCGGGCAAATTGCTCTTCTTCCGCTTTCAGTGCTTTTTCAACAATCGCTTGAGTCTCAGCTAACCCTTTGGCCGCATCGCCCATCACCGCAATCAAAGACGGAACCAGCTTATAGAAGAAAGCTTCGGTCGCGCCTAACTTGTTACCATGGCGAACTGCGCGGCGGATAATGCGGCGCAGCACATAACCGCGGCCTTCATTCGACGGCATAACGCCATCGGCAATAAGGAATGCACATGAACGGATATGGTCGGCAATGACGCGCAGTGACTTCTCGGTTAAATCGCTTACGCCGATGATTTCAGCCGCTTTGGCGATCAGCGTACGGAAGATATCGATTTCGTAGTTAGAATGTACGCCTTGCATAATCGCGGCAATACGCTCAATCCCCATGCCAGTATCAACCGATGGCTTAGGCAAAGGTAGCATTTCACCCGATGCTTGACGGTTGTACTGCATGAACACTATGTTCCAGATTTCGATGAAACGGTCGCCATCTTCTTCGGGACTGCCAGGACGGCCGCCCCAAATATGGTCGCCATGGTCATAGAAAATCTCAGAACATGGGCCGCAAGGACCGGTATCACCCATTTGCCAGAAGTTATCTGACGCATAAGCAGCACCTTTGTTGTCGCCGATACGGATAATGTTCTCGGCAGCAACACCAATCTTTTTATTCCAGATTTCAAAGGCTTCATCATCTGTTTGATAAATAGTGACACACAGACGTTCTTTCGGCAGCTTTAAAGTTTCAGTTAAGAAAGTCCAGCCAAAGCGAATCGCTTCTTCTTTGAAATAATCACCAAAACTGAAGTTACCCAGCATTTCGAAAAAGGTGTGATGACGTGCTGTATAACCGACGTTATCTAAATCGTTATGCTTACCACCGGCGCGTACACAACGTTGAGCAGTGGTCGCACGCGTATAATTGCGTTTGTCCATGCCAAGAAACACGTCCTTGAACTGGTTCATGCCTGCATTGGTAAATAAAAGTGTGGGATCGTTGCCTGGCACTAATGAACTACTGTCCACAACTTGATGACCATTGCTGCGGAAAAATTCGAGGAAAGCGCTTCTAAGCTCTGCTGTAGTTTGATACATGAAATCATCCTGAATTGGGTAAGCATTACCAGCTACGCCTTAGGCTAGCCGTATTTCTATTTGCCCAGCATTATAAGCAGACTGAGTAAGGACGACCAGCCTTGATTGGTGATATGCACATCAGGAATTTAATATTTTAGCGTGAATAGCCCATACCAGAGTAGCTAGAGGCAAAACAGCAATAATAAAGGACAATGACAGACTCTGCGGGTTAGGTGCAAAAAGTACAGTATTTGAGCAAGTGGGTAAAGTGTCGGGCACGAAAGCAACAAAGCCCATTTGATGACATTTTAGGGGAGATTTTCTGAGGACTTGTTAAAGAAGCTTTTGCAGTCTTAAGGTTACATCGCAATCGATGAGTTGAGGCGATTAATAATCGTTGTCGAGATCATTGGGATCGGAATCTAACGCATAAGTGATTTGATCGTAGCTAAAGCCTTGGCCCATTAAAAACCGCACTCGTTTGGCCTTTTCCTTCGCGATGAGATCCCGGGCTTTAGAGCCCTTCACTTCTGTTACGCGGGGAACGCCATACTTTTTGGTGGCCTTATCTTTTGCAAGCTCAAACCAGTCACAATCACTATTGGTTATCGCCGCTTCAATGCAATCTTTAGACAGGCCTTTTTGCGCTATCGCTTGGCGAATGCGGATGGCACCGTGACCTCGACTAATGTAGCTACGGACCAACAATTCGGCATAACGATTGTCATTGATAAAACCGGAGGATTCACATGCATCGAGCACAGGTTCGATATCGTTAAGTTCGAAGCCTTTTTCGAGCAATTTATCGCGGATTTGTAAGCGGGAATAATCGCGGCGCGCTAACAGCGCGACCGCAATTTGACGGGCTGACTGACTCAGAATACTTCGCCTGTTTCTAAATCAACATTATCTTCCGTGCTAGCATCATCTTTAGCAGCCAGTGCACTAGGATTGCTCAAAAGTAACTCACGTAATGTCTTGTCAATTTCAGTAGCAATGGCTGGGTTTTCAGTCAAATATTTACCCGCATTAGCACGACCTTGACCGATTTTATCACCTTTATAACTGTACCAAGCACCCGCTTTTTCAATCAACTTATGGGCTACGCCTAAGTCAACTAACTCACCAGTACGGTTAATACCTTGGCCGTAAAGGATTTGGAATTCGGCTTGCTTGAACGGTGCTGCAACCTTGTTTTTCACCACTTTAACGCGAGTTTCGTTACCGACAACTTCATCGCCGTCTTTAATGGCACCAGTACGACGAATGTCTAAACGAACAGAAGCATAGAACTTCAGCGCGTTACCACCGGTTGTCGTTTCTGGGTTACCGAACATCACACCAATCTTCATCCGGATTTGGTTGATGAAGATAAGTAAGGTGTTAGATTGCTTTAAGTTACCAGCAAGTTTACGCATAGCTTGGCTCATCATACGTGCCGCTAGGCCCATGTGAGAATCACCAATTTCGCCTTCGATTTCAGCTTTAGGCGTTAATGCCGCCACTGAGTCGACGACTATAACGTCAACAGCGCCTGAGCGAGTTAATGCATCACAAATCTCAAGCGCTTGCTCGCCGGTATCTGGTTGTGAACACAGAAGGTTATCAATATCTACGCCCAATTTTTTAGCATAGATAGGGTCGAGTGCATGCTCTGCATCGATAAAGGCACAGGTTTTACCTTCACGTTGCGCTGCGGCAATCACTTCTAAAGTCAGTGTCGTTTTACCTGATGATTCAGGACCATAGATCTCAACGATACGACCCATTGGCAAACCGCCAGCGCCTAAAGCAACGTCCAGAGACAGAGAACCGGTAGAAATGGTCTCAACATCCATAGAGCGATCTTCGCCCAGCTTCATGATGGAGCCTTTACCAAATTGTTTCTCAATTTGGATCAATACCGCAGCAAGTGCTTTCTCTTTATTTGGATCGACCTTCATTCCAGTTCCCTCATCAAGACAAGCCTTAAGCTGTCAAAAGTAATATTCAATTTACAAACGGGTAATTCAAAAAGCCGCGCTTTTTGTCTTAATTCTGCTGACTAGTATACTGTACAATCATACAGTATCAAGACCTGTGACTGAATTATTTTTAGCTAATGGGATTTTTTAATATGGCCAAGGTTTTAGCGAATATCAAATCGGATATTGCATCCCTTTGACGAATATCCGTCATCAAGGGCTCAATTTTCGGTAAATTATTGCTAAGATTGGCGCAGCAAATTTTGCGTGGTCTTTTTAGCTGAAAATTGTACAGCAGTCTTCTGCTTCCATAGCGCTTAATCCCACATAACCACGGCAGTCACAACGGACGATAACGATTTAATGAATGTAATTGATACCGATGATTTAGAAAAACATACTCCTATGATGCGTCAGTACTTGACCATGAAAGCAGAACATCACGACATGTTGCTGTTTTATCGCATGGGTGACTTCTACGAGTTGTTTTATGACGATGCTAAACGTGCTTCTGAGCTATTAGGCATTTCACTCACAGCACGCGGTAAAAGTGGTGGCGATCCTATCCCCATGGCGGGACTGCCTTACCATGCTGTGGAAGGCTATCTTGCGAAATTAGTTCAAATTGGTCAATCGGTTGCGATATGTGAACAAGTTGGCGACCCTGCAACCTCTAAAGGGCCAGTTGAGCGTAAAGTGGTGCGTATCGTCACGCCTGGCACCTTAACCGACGAAGCCCTGCTGCAGGAACGCCAAGACAATCTATTGGCAGCCGTTTATCAAGGTAAAGTCGGTTTTGGTTATGCCACTCTGGATGTTTCATCCGGACGTTTTGTGATTGCTGAGCTCGACACACGAGAGTCATTAGAAGCTGAATTGCAACGCACTAATCCCGTTGAAATTCTCTACAGTGAAGACTTTGGCGAACTAGGTCTACTAAACGGTTTTAAAGGTAAACGTCGTCGCCCTGAATGGGAATTTGATTACGACACTAGCATCAAGTTACTACTGGCTCAGTTTGGCACGAAAGACTTGCATGGTTTTGGTATTGCAGATGCACGTTTATCACTGCAGGCTGCAGGTTGCTTGATGCAATACGTCAAAGACACACAGCGCACAGCCCTGCCCCATATCAATGCCATTACCCGCTTTAATCAAACCGATAGCATAGTGCTTGATGCGGCAACGCGCCGAAATCTAGAACTCACTCAAAACCTTGCCGGTGGCCGCGATAATACCTTGGCTGCAGTATTGGATAACACGGCGACGCCTATGGGCAGCCGCATGTTGCAACGCTGGATCCATCAGCCACTGCGAGATCCTAAGCACATCAAAGCACGCCAACAGGCGGTGGCTGAATTACTCGATACCGACGCCCACGAAGGTCTGCATGAGCAATTAAAGGCACTTGGCGATATCGAACGTATCATGGCAAGACTCGCGCTGCGTACCGCTCGCCCAAGGGACTTTGCCCGTTTACGTCAAGCACTGGGCTTACTACCTGAATTACAACAGAGTTTAAGCACACTGAGCGCGCCGCACACAACTCAATTACGGCAACACTTAGGTGAGTTCCCCGCTGAACAAGCCTTGCTTGAGCGCGCGATAGTCGATAATCCTCCCATGCTTATCCGCGATGGCGGTGTGATCCGTGAAGGCTACAACAGCGAATTAGATGAATGGCGCGGTCTTAGCGAAGGTGCGAGTGATTACTTAGTACAACTCGAAGCCAGAGAAAAAGAACGTACAGGTATCAACACACTTAAAGTCGGCTATAACCGTGTACACGGCTATTACATTGAAGTGAGCCGCTTGCAATCCTCGCAGGTGCCGCTCAATTATCAACGACGTCAAACCCTTAAGAATATGGAGCGTTATATCACGCCCGAACTTAAGGAGTACGAAGAAAAAGTGCTCTCGAGCCAAGGTAAAGCGCTGGCACTCGAAAAGCAATTATGGGAACAGTTATTCGATCTTATTCTGCCAAAATTGCATGAATTACAAGCTTTTGCTCGAGCGGCGGCTGAGCTTGATGTGTTGAGTAACTTTGCCGAACGCGCCGAAACCTTAAGCTACACTTGCCCAGAGCTGAGCCAAGATATCGGCGTACAGATAGAGGCAGGTCGCCATCCCGTGGTGGAGCGTGTGAGTCAAACACCGTTTATCGCTAACCCAGTGACCTTGCACAATCAAAGACGTATGTTGATTGTCACCGGACCCAACATGGGCGGTAAATCGACCTATATGCGTCAAGTCGCCTTGATTACGCTGATGGCCCATATTGGTTGTTTTGTGCCAGCAGATCGTGCCCTGATTGGCCCGATTGATCGTATATTTACCCGTATTGGCGCATCAGACGATCTGGCCTCTGGCCGCTCAACCTTTATGGTCGAAATGACTGAAACTGCCAACATTCTGCACAATGCCACTGCCAGTAGTTTAGTGTTAATGGATGAAATCGGCCGTGGAACATCCACCTATGATGGTTTATCGCTAGCTTGGTCGGCGGCGGAATATTTAGCCCAGCAAATCGGGGCTATGACCCTATTCGCCACCCATTATTTTGAGCTAACTCAATTACCTGACTTAATGGCAGGTGTTTACAATGTGCACCTTGATGCCATTGAGCATGAAGATACCATCGCCTTTATGCATGCGGTACAGGAAGGTGCAGCCAGTAAAAGCTACGGCTTACAAGTTGCGGCGCTTGCCGGCGTGCCAAACAAAGTGATTAAAGCAGCGAAACATAAATTGCAGCAATTGGAGAGTCGCGATCATCAACCAGAAGGAACGAGGACGCCCATTCAAAGCTTACTCGCTTTACCTGAACCGGTTGAAAATCCAGCTTTGACTAAATTAAGTAGCATTAATCCCGATAACTTAACGCCAAAGCAGGCACTTGATTTGCTCTATGAGCTGAAACGTCTGAGCTAAAGCTCTGTGAACGGGTGATATTCGCCAAGCGATATGATCAGCAATAAAGGATTAGAAACAACAACGCCCACTCTTTAGTGGGCGTTTTACTTTTATCGACTAGACCTAAGCACTAGACTAAATCACTAGACCTTAGCTAGGAGCTAGGCATAGAGATCAACGAACAACTGATCTTAGTTTCTCATTTATTAGTTTCGAAATAATGCTTCAACAGATAAACCTTGAGATCCTAATAAGTCGCGTAAACGTTTCAATGCTTCCACTTGGATTTGACGTACACGTTCACGAGTCAAACCAATCTCAGCACCCACATCTTCGAGTGTAGAAGGTTCATAACCTAACAGACCAAAACGGCGGGCTAACACTTCTCTTTGTTTAGTGTTTAACTCATTGAGCCATTTGACCACTGAATTAGAAATATCTTCATCTTGCACTTTATAGTCAGGGCCAACGTTATCGTCATCGGCTAAAATATCCAGCAGCGCCTTGTCATTATCGCCCCCCAAAGGCGTATCGACAGAGGTGATTTTCTCATTCAGCTTAAGCATACGGCTGACATCAGCACTCGACACTTGTAGTTTTTCGGCAATTTCTTCGGCCGTAGGTTCGTGATCGAGTTTTTGTGCTAACTCGCGAGCGGTGCGCAAGTAAACATTGAGTTCTTTAACAACATGAATAGGCAAACGAATTGTACGAGTCTGATTCATAATGGCGCGCTCAATGGTCTGTCTGATCCACCAAGTCGCGTAGGTTGAAAAACGGAAACCTCTTTCTGGGTCAAACTTTTCAACGGCCCGAATTAAGCCTAAGTTGCCTTCTTCGATAAGATCAAGCAGTGCTAAGCCGCGATTGTTGTAACGACGGGCAATCTTTACGACCAGACGTAAATTACTTTCGATCATGCGATTACGGGATTTTGCACAGCCTTTTAAGGCTTTACGGGAAAAGTAAACTTCTTCTTCTGCGCTAAGCAGTGGGGAAAAACCAATCTCACCTAGGTAAAGTTGAGTGGCATCAAGATTTTTTTGCAGGTCATCTTGAACCTGTTGTTCTATTCCTAGCTCTTGAACTAAATCGGATGCAACTTCTTCTTTTTCAAGGTCGAAGTCAACGGTGTCTACGGATAAATCTACTAGTGCTTCTGCGGCAGTGCTTTTTATGCGGCTCATGATTAAATCTCCCAAACCTGATTAAACTGCTGAATTAGTGCAATTTCCTATCCTCCATTTGCCAACAAGGCCCCAAGCAATCATTGTTTGGGTAAATAAGTAAGTGGGTTTACAGACTGTCCATGGTAACGAATTTCAAAATGTAACATTACCCGATTGGCACCTGTACTGCCCATTTTTGCAACTGTCTGCCCTGCGAGGACATGTTGCTTTTCTTTGACTAAGATCTTGTCCGTATGAGCATAAGCACTAAGGTAATCGTCACTATGTTTAATAATCACTAAATTACCGTAACCCCTAAGAGCACTTCCTGCGTATACCACCCGCCCATCTGCAGCAGCTTTGATGATATCTCCCCGATTCCCCGCGATCTTAATGCCTTTATTTCCCTGCTCATTGGCAGAGTAAGTCCCGATTAACTTCCCTCTTACTGGCCACTCCCACTGACTCACACTGGTTGGTAGTGTCGAAGTCGGTGGAACCATGATTGAGTTAACATTTTGTTGGGAACTTGTTACAAGGTACGCGGGCTTAGCTTTCTGATCAAGTGTTTTTTTCAGATCATTACTTGACTTAGCGCCGGTTTGAGAAACTTGATTATCTGTTTGTTTTTTATAATTTTTACTAGAATTATTACTCGAGGAGCTCGTTTTCATTCCATCTAAAGTTGTAGATTTGGACTGACTACTCGACTTAGCATTCGTTAAATATAAAATCTGTCCGGGGTAAATTGTATAAGGGTTATCTAATTGATTAAGTTTAGCTATTTCAGAAAAATCCTTTCCTGCTCCCCAAGAAATAGAGTAAAGCGTGTCGCCTTTCTTAACTTTATATGAATTCGATTTAATGTGGCCTTTGTTGTTTTTAGAGTAGCTATGGGAAATATTTTCGACGGGCGCGGGTCGGCTGGCCTGAAAGCTACAGCCTGCAAGCAGAAATATAAAGCAGAGGTTTAAGACTAAACCCGCATTCAACAATAAAAACTCCTTGGTGAGGCTGTATTACTGTTTATGATGAGAAAATAATGCCCTCGTTGCAAGACAAGCTGTTCAGTGGAGACAGTAGCGCAACACAAGGTTAAGCTAACTCTCCATTAATCAAAGGGACAAATTTAACCGTTTCAATTGTTTCCGAACTGAAGCGGTCCGCAGTACGAGTGATTCGCATCAATTGCTGGGTATCTTCGCCGACTGGCAGAACTAACACTCCGCCCTCGGCTAATTGAGACAGTAAGGCCTCAGGTATAGTCGAAGCGGCAGCGGTCACCATAATGGCATCGAACGGGCTACGATTCGCCCAACCTAACCAGCCATCACCGTATTTAAACGACACATTATGCAAATCTAATCGTTTTAAGCGCTGTCTCGCTTGAATTTGTAAGCCTTTTATTCGCTCGATAGTACAAAGCTGCGGCACTAATTGCGCCAAAATTGCCGCCTGATAACCCGATCCCGTGCCCACTTCTAATACCCGCTGGGGCATCTTATGCAGTAAGAGTTCGGTCATACGGGCGACGATATAAGGCTGCGAGATGGTTTGCCCTTGGCCAATCGGTAAAGCGGTATTTTCATAGGCTTTATGGGCTAGCGCATTATCGAGAAACATTTCCCGCGGCGTATGCGATATCGCCTTTAACACGGCTTGATTGCGGATCCCCGCATCATGAAGCTTTTTAGCTAAGTTCACCGCCGATGTTAAGGCAACTCGAGTCATATTTTATCTACCCAATTTTGCAATGCCGTTAGTTGTCCGTAGGCAGTTAAATCAACCGTCAAAGGCGTAATTGACACATAGCCATTGGCGATAGCATGAAAATCAGTGCCCTCTGTGGCGTCCTGCTCTTGTCCTGGTGGGCCGAGCCAGAAGATCTCACGTCCAGCGGGATCTTGCGTTCGTACTATGCCTTCAGCCTTATGACGAGCACCAAGGCGAGTCACTTTAATGCCTTTGATTTGATCAAGCGGCAAGTCTGGCACATTTACATTTAAAATCTGATCCTTTGCCAAAGGCTGGGCCAACAAGCCTTGGACTATTCGCCGAGCGTAAACAGCCGCCGATTGATAATGTTCAAATTTCCGGCCATTGAGCGAAATCGCGACCGCAGGAAAACCTAAAAAGCGCCCTTCCATCGCGGCAGCAACAGTCCCAGAATACAAGGTATCATCGCCCATGTTCGCGCCCGCATTGATACCAGACACCACCATATCGGGTTCGCCATCGTACAATTCTCGGATAGCCAAATGCACACAATCGGTCGGTGTACCGCGAACAGAAATGTAACCGTTATCTAACCTATTAATTCTTAATGGATTAGTCAAGGTTAATGAATTGCTCGCACCAGAACAATTACGATCTGGCCCCACTGTCAGCACAGTCGCAATTTCGGTCAACGCCTCAGTTAAGGCTTTGATCCCCGGAGCATTAACCCCATCATCGTTGCTCACTAAGATGCGGATCAT
Protein sequences of DBSCAN-SWA_3 >LR134321|3590721:3601854|3593715_3594129_-|VEF27010.1|DBSCAN-SWA MLEKGFELNDIEPVLDACESSGFINDNRYAELLVRSYISRGHGAIRIRQAIAQKGLSKDCIEAAITNSDCDWFELAKDKATKKYGVPRVTEVKGSKARDLIAKEKAKRVRFLMGQGFSYDQITYALDSDPNDLDNDY >LR134321|3590721:3601854|3598414_3599395_-|VEF27013.1|DBSCAN-SWA MSRIKSTAAEALVDLSVDTVDFDLEKEEVASDLVQELGIEQQVQDDLQKNLDATQLYLGEIGFSPLLSAEEEVYFSRKALKGCAKSRNRMIESNLRLVVKIARRYNNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHVVKELNVYLRTARELAQKLDHEPTAEEIAEKLQVSSADVSRMLKLNEKITSVDTPLGGDNDKALLDILADDDNVGPDYKVQDEDISNSVVKWLNELNTKQREVLARRFGLLGYEPSTLEDVGAEIGLTRERVRQIQVEALKRLRDLLGSQGLSVEALFRN >LR134321|3590721:3601854|3599475_3600372_-|VEF27014.1|DBSCAN-SWA MLNAGLVLNLCFIFLLAGCSFQASRPAPVENISHSYSKNNKGHIKSNSYKVKKGDTLYSISWGAGKDFSEIAKLNQLDNPYTIYPGQILYLTNAKSSSQSKSTTLDGMKTSSSSNNSSKNYKKQTDNQVSQTGAKSSNDLKKTLDQKAKPAYLVTSSQQNVNSIMVPPTSTLPTSVSQWEWPVRGKLIGTYSANEQGNKGIKIAGNRGDIIKAAADGRVVYAGSALRGYGNLVIIKHSDDYLSAYAHTDKILVKEKQHVLAGQTVAKMGSTGANRVMLHFEIRYHGQSVNPLTYLPKQ >LR134321|3590721:3601854|3594203_3595271_-|VEF27011.1|DBSCAN-SWA MKVDPNKEKALAAVLIQIEKQFGKGSIMKLGEDRSMDVETISTGSLSLDVALGAGGLPMGRIVEIYGPESSGKTTLTLEVIAAAQREGKTCAFIDAEHALDPIYAKKLGVDIDNLLCSQPDTGEQALEICDALTRSGAVDVIVVDSVAALTPKAEIEGEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRTGAIKDGDEVVGNETRVKVVKNKVAAPFKQAEFQILYGQGINRTGELVDLGVAHKLIEKAGAWYSYKGDKIGQGRANAGKYLTENPAIATEIDKTLRELLLSNPSALAAKDDASTEDNVDLETGEVF >LR134321|3590721:3601854|3601104_3601854_-|VEF27016.1|DBSCAN-SWA MIRILVSNDDGVNAPGIKALTEALTEIATVLTVGPDRNCSGASNSLTLTNPLRINRLDNGYISVRGTPTDCVHLAIRELYDGEPDMVVSGINAGANMGDDTLYSGTVAAAMEGRFLGFPAVAISLNGRKFEHYQSAAVYARRIVQGLLAQPLAKDQILNVNVPDLPLDQIKGIKVTRLGARHKAEGIVRTQDPAGREIFWLGPPGQEQDATEGTDFHAIANGYVSITPLTVDLTAYGQLTALQNWVDKI >LR134321|3590721:3601854|3595655_3598226_+|VEF27012.1|DBSCAN-SWA MNVIDTDDLEKHTPMMRQYLTMKAEHHDMLLFYRMGDFYELFYDDAKRASELLGISLTARGKSGGDPIPMAGLPYHAVEGYLAKLVQIGQSVAICEQVGDPATSKGPVERKVVRIVTPGTLTDEALLQERQDNLLAAVYQGKVGFGYATLDVSSGRFVIAELDTRESLEAELQRTNPVEILYSEDFGELGLLNGFKGKRRRPEWEFDYDTSIKLLLAQFGTKDLHGFGIADARLSLQAAGCLMQYVKDTQRTALPHINAITRFNQTDSIVLDAATRRNLELTQNLAGGRDNTLAAVLDNTATPMGSRMLQRWIHQPLRDPKHIKARQQAVAELLDTDAHEGLHEQLKALGDIERIMARLALRTARPRDFARLRQALGLLPELQQSLSTLSAPHTTQLRQHLGEFPAEQALLERAIVDNPPMLIRDGGVIREGYNSELDEWRGLSEGASDYLVQLEAREKERTGINTLKVGYNRVHGYYIEVSRLQSSQVPLNYQRRQTLKNMERYITPELKEYEEKVLSSQGKALALEKQLWEQLFDLILPKLHELQAFARAAAELDVLSNFAERAETLSYTCPELSQDIGVQIEAGRHPVVERVSQTPFIANPVTLHNQRRMLIVTGPNMGGKSTYMRQVALITLMAHIGCFVPADRALIGPIDRIFTRIGASDDLASGRSTFMVEMTETANILHNATASSLVLMDEIGRGTSTYDGLSLAWSAAEYLAQQIGAMTLFATHYFELTQLPDLMAGVYNVHLDAIEHEDTIAFMHAVQEGAASKSYGLQVAALAGVPNKVIKAAKHKLQQLESRDHQPEGTRTPIQSLLALPEPVENPALTKLSSINPDNLTPKQALDLLYELKRLS >LR134321|3590721:3601854|3590721_3593346_-|VEF27009.1|tRNA|DBSCAN-SWA MYQTTAELRSAFLEFFRSNGHQVVDSSSLVPGNDPTLLFTNAGMNQFKDVFLGMDKRNYTRATTAQRCVRAGGKHNDLDNVGYTARHHTFFEMLGNFSFGDYFKEEAIRFGWTFLTETLKLPKERLCVTIYQTDDEAFEIWNKKIGVAAENIIRIGDNKGAAYASDNFWQMGDTGPCGPCSEIFYDHGDHIWGGRPGSPEEDGDRFIEIWNIVFMQYNRQASGEMLPLPKPSVDTGMGIERIAAIMQGVHSNYEIDIFRTLIAKAAEIIGVSDLTEKSLRVIADHIRSCAFLIADGVMPSNEGRGYVLRRIIRRAVRHGNKLGATEAFFYKLVPSLIAVMGDAAKGLAETQAIVEKALKAEEEQFARTLERGLGILDTALNELKGDTLDGETVFKLYDTYGFPMDLTADVCRERNIIVDEAGFEVAMAEQRSRAQAAGNFGADYNAALKIDAETAFSGYTELAGQAKITAIYQNGESVTAIKAGDEAVVVLDVTPFYAESGGQVGDKGQLVANGVEFTVNDTQKYGQATGHQGVLATGNLSVGQVVEAKVDKKLRHRTQLNHSVTHLLHAALRQVLGTHVSQKGSLVDPERLRFDFSHFEGVKAAELKEVEELVNTQIRRNHELKTAEMGIDEAKEKGAMALFGEKYDSQVRVVTMGDFSIELCGGTHVGRTGDIGLFKITSEAGIAAGVRRIEAVTGAAAMAYVAQQQAELEEAAALLKGDANSVVAKLKAQLDKMKQLEKEMAQLKDKLAAAASADLVGDAVVVNGVNVLIKKLDGVEASSLRGLQDELKQKLKSAIIVLGTAQEGKVNLIAGVSNDLIGKVKAGELVAMVAAQVGGKGGGRPDMAQAGGSQPENLDAALAQVLPWITERLA >LR134321|3590721:3601854|3600472_3601108_-|VEF27015.1|DBSCAN-SWA MTRVALTSAVNLAKKLHDAGIRNQAVLKAISHTPREMFLDNALAHKAYENTALPIGQGQTISQPYIVARMTELLLHKMPQRVLEVGTGSGYQAAILAQLVPQLCTIERIKGLQIQARQRLKRLDLHNVSFKYGDGWLGWANRSPFDAIMVTAAASTIPEALLSQLAEGGVLVLPVGEDTQQLMRITRTADRFSSETIETVKFVPLINGELA |
8 | uncultured_Mediterranean_phage(33.33%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
3637282 : 3644885
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >LR134321|3637282:3644885|DBSCAN-SWA ACTATAACAACTGTTGTTCAAGTTCTTGCAGAACATTGACCATTTCAAGCAGACTTAAAGCAGCTTCGCCGCCTTTGTTACCCGCTTTGGTACCTGAACGTTCAATAGCTTGTTCAATGGTATCTGTGGTCAAAACACCGAAAGCAACAGGCAGATCAAATTCTAAAGCAACTTGAGCTAGACCTTTATTACATTCGCCTGCAACAAAATCAAAATGAGGAGTACCACCACGGATCACCGCACCTAATGCGATAATACCGTCGAATTTACCACTGGCAGCTACGCGACGCGCAGCTAGCGGCAACTCAACCGCACCTGGTACACGGACAACAGTGATATTGTCGTCGCTGACTTGGCCAAAACGTTTCAGCGTGTCAAGTGCACCTTCAAGCAAGCTCTCAACTAAAAAGCTGTTGAAACGCGAAATTACAATCGCAACTTTGGCATTTTTCGCTTCGATATTACCTTGAACTACGTTCATTATCTTACCTAATTTAGCTAAAGCACCCAGCTCGGGGCGAAAAGTGGCGACATGATATCACAGCCAATTGCCCAGAGCCCTATGCAAATTATCAGAAATTGTTAATTTTACTTCAGTGTAACTTTGGGTTTTAGTCCGCCACATACTCAGTCACTTCAAGGCCAAAGCCCGAAAGTGAATGGTAACGTTTAGGCGAGCTGAGCAGGCGCATCTTAGTCACGCCAAGGCTGGCGAGAATTTGCGACCCCACACCCACACGGCGCGACGTCCCCTGCCATTTTGCAGAAGCAGGCGCTTGGCCTTGATCTTCAGCTTCAAAGGCTTTCACCTTAGAGAGGATTTCACAGGGATGTTCTTGATTCCCTAATAAAACCAATACACCACCTTCTGCAGAAATACGCTCCATCGCCTTTTCGAGTGGCCAGCTACGTTGCTGATCGCGCTCTGAATGGAGTAAATCGTTGAAGGTGTTTTGCAGATGCACGCGCACCAAACAATCGCTCTTCACCTCACCTTTGACTAAGGCAAAATGCAGTTGATTGTCGATAGTGTCTCTGAAAGTCACCATGTCGAACTCGCCGAAACGGGTCGGTAGTTTGCATTTAGCTTCACGCACAACCGTGGTTTCTTTGGTGTTGCGATACTCGATCAATGCCGCGATGGTGCCGATTTTGATACCGTGCAACTCGGAGAAAATCTCTAAATCTGGGCGGCGTGCCATAGTGCCGTCTTCGTTCAAAATCTCAACGATAACGCCCGATGGCTCAAGTCCGGCAAGACGGGCTAAATCACAACCGGCTTCAGTGTGGCCTGCGCGGGTTAATACGCCGCCGTCCTGTGCCATTAACGGGAAGATATGCCCAGGTTGCACTAAATCAGACGCTTTAGCCTCTTTAGCGACGGCCGCTTTTACCGTTACCGCGCGGTCGTGGGCCGAAATACCGGTAGTCACGCCTTCGGCTGCTTCAATAGAAACCGTAAAGTTAGTCGAGAACTGGGCGTTGTTATTCGTGACCATTAAGGGCAGATTTAACTGCTGGCAACGGGCTTTGGTCATGGTCTGGCAAATGAGTCCACGGCCATATTTCGCCATAAAGTTAATCGCTTCTGGCGTGACCATTTCGGCCGCCATGATCAAATCACCTTCGTTTTCTCTGTCTTCGTCATCCATCAAAATAACCATTTTGCCTTGACGAATATCTTCGATGATCTCTTCTATACTGTGCAGCGCCATTGTAGGACCTTATGATTTTGCTGTTATTATGAAATTCTATACCTAAATAACCTAAGCTGCCGATAAGCTTAAGTTAAATAATCTGCTCCTGTGCACTAACGCACAAAGCCGGCACGGGCTAACATTTCCATGGTGACACCACCGCTCTTGCTGTCTTTGCCATCATAATTCATCAAGCGTTCAAGATAACGGGCGATTAAATCCACTTCGATATTCACCTTATCGCCCGCTTGCAGGTCGATTAACGTGGTTTCCCCTGCGGTATGCGGCACTATGGTCAGTCGGAATCGACTGCCGTCCACTTCGTTAACCGTCAAACTCACGCCATCAATGGTGATGGAACCTTTATGGGCAATGTAACGCCCAAGCTCTGCAGGTGCCGCTAACCAAAACTCAATAGCTTGTCCACGCACTAGGCGTTGCTCAACCATAGCAATACCGTCTACATGGCCGCTGACCATGTGCCCGCCTAAGCGTGTGGTTGGGGTGACGGCTTTTTCAAGATTCACCTTAGTGCCCACTTTGTAGTGAGCAAAGCCGGTTAGACTCACGGTTTCAGCAGAGACATCAGCCACATAACCGTCGGCTAATTGCTGCACAACCGTTAAGCACACACCGTTAGTGGCAATACTGTCGCCTAAACGCACATCGTTTAAATCCAGTTTGCCGCTCGCCACGGTAAGACGAATATCATCGCCTTTACGCTCCAGTTTGCGCAGGGTGCCAACGGCTTCAATAATCCCAGTAAACATGGTTAACTCACTTAATTAAATAAAGAAGTATCAAGCTTAACGCTCAAGATTAAACGGGTATCTGGCCCTAATTTGCGCTCATCGACCAACGTTAATGCAGGAATTTGCGCCATGGTTTGATAATCGGGTAGCTGCAATAGATTACGCCCTTCCCCACCGAGGATTTTCATCGCTTGGTACAGCACTAATTCATCGGCCAGCCCCTCTGCTAAAAAAGCGCCCGCTAAGGTTGCACCCGCTTCCACCAGCACGTGATTACAACTTTGGCCTAAATAGCTTAATAGTTCAGGTAAGGATACGCGACCTGCAATCGCAGGTAACACGAGGCAACTCACATGGGAGGGCAATTGTGCTTGAAACTCTGCGGGATAAGCCACGGTCGAGACCAAGAGTATCGGCGATACGATAGCCAAACAGGCAGCCGATAACGGCAAACGGGCGCGACTATCGAGTATCACTCTTAATGGCTGCAGTAATTGCGCCTCGGTTACGCAGTCTTTTATGTCGCCAAGTTCTTGATATCGCACATTGAGCGAGGGATCGTCAGCCAGTATGGTTTCAATCCCAGTCACCAACGCGCAGGAACGTAAGCGTAAGCGCAGCACGTCGCGGCGAGATTCAGGCCCGGTAATCCATTTAGACACACCGTTAGACAGCGCGGTTTTACCATCGAGACTCGCAGCCAATTTGACTGTGACTCTGGGTAAACCCGATTCCATGCGCTTCATAAAGCCTAAATTCAAGGCATAGGCTTCATCGCGATGTAATCCCACATCCACTTCAATGCCGGCGTCGCGCAGCATTTGAATACCGCGACCAGCCACTTGCGGATTTGGATCTTCAACCGCCACCACCACACGCTTAACGCCAATATTGATCAGTGCCAAGGCGCAGGGCGGCGTGCGGCCATAATGGCTGCAGGGCTCTAAGGTGACATAGGCTGTTGCGCCGCGGGCGTGTTCGCCAGCAATGCGTAGTGCATGCACTTCGGCGTGGGGCTCACCCACTTTTTGATGATAACCTTCGCCGACAATCTGATTATCTTTAACGATAACGCAGCCCACACTGGGATTGGGTCTCGTGGTATAAAAACCTTTGCGCGCCAATTGAATCGCGCGGCTCATCATTTGGGTATCGAGTACGGACCAACTTATGTTCAAGCTCATGTTCAACCCTGTTGTTAAAGGTTTCGGCTTTCACCAGTCTTGTAAATCTGTGTTTAAAACCTGTCTTGCAGAAGCTATTAATCAAAAGCCTCTTTTAAAGCTAGCTCAACCAGAATACTGAACGACAGGCCACAAAAATCGAATAACTACTTTTGCAGTTTGGCGATGGCTTCACCAAATTCAGATACGTCTTCAAAGGCACGGTACACAGAAGCAAAACGTATATAGGCGACTTTATCTAGGCTCATCAATTGCTCCATCATTAAGTTACCGACCATTTCTGATGGCACTTCTCGCTCGCCAGTGGCGCGTAGGGTCGATTTGATTTTACTGAGTGCTTGCTCGATTTCATCCATAGACACAGGGCGTTTTTCAACGGCGCGCAACATGCCACCTTGGAGTTTTTCTTCATCGAAGGGTTGGCGTGAACCATCGCGCTTGATCACCCTAGGCATGACTAATTCGGCGCCTTCGAAGGTGGTAAATCTTTCATGGCATTCGGTGCATTCTCGGCGACGGCGCACTTGATGGCCTTCCGCCACTAATCGAGAATCGATGACTTTAGTATCTGTAGCGCTGCAAAATGGACAATGCATTGAGCCTCCTGCCAGTGAATGACGTAAAGCCGTTGGCTTTACTATGTTATCTAAGGTTAGCCTGTTTGGTTTAAGTCTAAGATTTTAGCAATAAAAATGGCCGCTTAATGCGGCCATGGGATTTTATCCTATTTATGGGCTGAGGTTAACTACCGGATGTTAAGTACCTAATGTAAGTAACTGATGTAAGTACCTCATGTAAGTACCTAGTCAGCAAGCGATCACTCTCAGCCAGTCATCTACTTAAGCACATACGTACTTAAATTAGCCGTAAACAGGGAAACGTGCGCACAGTGCCAATACTTGGCCTTTAACACGCTCAATCACGGCAGGATTGTTAGCATCATCAAGGATGTCACAAATCCAGCCAGTCAGCTCTTTCGATTCCGCTTCTTTAAAACCGCGGCGCGTAATCGCTGGCGTACCGATACGCACACCTGAGGTCACAAAAGGTGAACGTGGATCGTTTGGCACTGAGTTTTTGTTCACGGTAATGTTAGCGCTACCTAAGGCGGCATCGGCTTCTTTACCCGTTAAGTCACGACCAATCAGGTCAACTAGCATTAAGTGGTTGCTGGTTCCGCCAGAGACGATTTTGTAACCGCGCTCAAGGAACACTTCAACCATTGCTTTAGCGTTGTTAACCACTTGCTGTTGGTACACTTTAAATTCTGGCTCTAGGGCTTCTTTGAACGCCACCGCTTTACCTGCGATAACGTGCATTAATGGGCCACCTTGACCGCCAGGGAATACTGCAGAGTTCAACTTTTTGTACAAGTCTTCATCATCAGCAGCAGACAGGATCACGCCACCACGGGGACCGGCTAAGGTTTTGTGGGTCGTAGATGTCACAACATGTGCGTGTGGTACTGGGTTTGGATAAACGCCCGCAGCAATAAGACCGGCAACGTGTGCCATATCGACAAATAAATAAGCACCGATTTTGTCTGCGATTTCACGCATTTTTGCCCAATCAACGATACCTGAGTAGGCAGAGAAACCGCCGATCATCATCTTAGGCTTGTGTTCTAAGGCAATACGTTCCATCTCGTCGTAGTCAATTTTGCCAGACTCGTCGATACCGTAAGGAATGATGTTGTACAGTTTGCCAGAGAAGTTAACCGGTGAACCGTGGGTCAAGTGACCACCATGAGCTAAGTTCATACCCAGTACGGTATCGCCAGGTTTTAACAATGCCATGTAAACAGCGCTGTTTGCCTGTGAACCTGAATGAGGTTGTACGTTAGCGTAAGTTGCGCCAAACAATTCTTTAGCACGTTCAATCGCTAAGGTTTCAACTACGTCCACATACTCACAACCACCGTAGTAACGCTTACCTGGATAGCCTTCAGCGTACTTATTGGTTAACTGTGAACCTTGCGCTTCCATCACGCGTGGACTGGTGTAGTTTTCAGAAGCAATCAGCTCAATATGCTCTTCTTGACGCAGAGTTTCGTTCTGAATTGCGTTAAACAGTTCCGGATCATAATCTGCGATATTCATATCTTTTTTCAGCATTGCTTACTCCAGCTGCCAATTCTTATAGGTAGGGTTGCGCGGTATTCTACTCGGAACGCGACCACAATCCCAGTATTTGAAACATGAAATTCCTAGCGTTTTAGCTTGAACTTGGCTCAATGTCGAGTTGAGTTCGGCGTGCAATCGAGTAGAATTAGCCGCATCATTAGTCACTTAAGATAGAGAAAAATGGCTCAGTTTGTTTACAGCATGCTGCGGGTGGGAAAAATTGTTCCGCCTAAGAAGCAGATCCTTAAAGATATTTCTTTAAGCTTTTTCCCCGGCGCCAAAATTGGTGTACTCGGCCTTAACGGTTCAGGTAAGTCGACACTACTGCGTATTATGGCGGGTATCGACACCGAAATTGAAGGTGAAGCGCGCCCAATGCCGGGATTGAAAATCGGTTACCTACCGCAGGAACCTAAACTCGATCCAACTCAAACCGTGCGTGAAGCGATTGAAGAAGCCGTCTCTGAGGCAAAAAATGCCCTAACTCGCCTCGATGCCGTTTATGCAGCCTACGCCGAGCCGGATGCAGATTTCGACGCCCTTGCCAAAGAGCAAGGTGAACTCGAAGCCATTATCCAAGCCCAGGATGCGCATAACTTAGATCATATCCTTGAGCGCGCAGCTAACGCCCTGCGTCTGCCGGATTGGGATGAAAAAATCGAAGTGTTATCCGGTGGTGAACGTCGCCGCGTAGCGATTTGTCGTTTGCTGCTTGAAAAGCCAGAAATGCTGTTACTGGACGAACCAACCAACCACTTGGATGCTGAATCTGTGGCGTGGCTTGAGCACTTTTTGCAGGAATATGCCGGTACTGTGGTCGCGATTACCCATGACAGATACTTCCTCGACAATGCCGCTGGCTGGATTTTAGAACTCGACCGCGGTGAAGGTATTCCGTGGGAAGGTAACTATTCTTCATGGCTTGAGCAGAAAGACGCCCGTCTGAAACAAGAATCTGCCACCGAAAGTGCTCGCCAAAAAACCATTGCCAAGGAATTGGAATGGGTACGCCAAGGTGCGAAAGGCCGTCAGTCTAAGGGCAAAGCCCGTATGGCACGCTTTGAAGAACTGAACACTAACGATTACCAAAAACGTAACGAAACCAACGAGCTGTTTATTCCGCCCGGACCACGTTTAGGTGACAAGGTTATTGAAGTAAATAACCTGACCAAGTCTTACGGTGACCGCGTACTCATCGACAACCTGTCGTTCTCTGTGCCTAAGGGCGCGATCGTCGGTATTATCGGTGCCAACGGCGCGGGTAAATCAACCCTGTTCCGCATGTTATCCGGCGCTGAAACGCCAGACAGCGGCACGATTGAACTGGGTGATACAGTGCAATTGGCATCAGTTGAGCAGTTCCGCGATTCGATGAATGATAAGAACACCATTTGGGAAGAAATCTCAGGTGGTCAAGACATCATGCGTATCAACAACATGGAAATCCCGAGCCGCGCCTATGTGGGCCGCTTTAACTTCCGTGGTGGCGATCAGCAAAAAGTCATCGGTACTTTATCTGGTGGTGAACGTAACCGAGTGCATTTAGCCAAACTGCTGCAAGCAGGCGGTAACGTCTTACTCCTCGACGAACCCACCAACGACTTAGACGTTGAAACCCTGCGCGCACTGGAAGAAGCGATTCTGGAATTCCCAGGTTGCGCTATGGTGATTTCGCATGACCGTTGGTTCCTCGACCGTATCGCGACCCACATTCTGGATTACCGTGACGAAGGCCAAGTGAACTTCTACGAAGGTAACTACACAGAATACTCGGCATGGTTGAAAAACACCTATGGCGCCGATGTGGTTGAGCCACACCGCTTGAAATACAAGCGCATGATTAAGTAG
Protein sequences of DBSCAN-SWA_4 >LR134321|3637282:3644885|3637282_3637762_-|VEF27042.1|DBSCAN-SWA MNVVQGNIEAKNAKVAIVISRFNSFLVESLLEGALDTLKRFGQVSDDNITVVRVPGAVELPLAARRVAASGKFDGIIALGAVIRGGTPHFDFVAGECNKGLAQVALEFDLPVAFGVLTTDTIEQAIERSGTKAGNKGGEAALSLLEMVNVLQELEQQLL >LR134321|3637282:3644885|3641774_3643028_-|VEF27047.1|DBSCAN-SWA MLKKDMNIADYDPELFNAIQNETLRQEEHIELIASENYTSPRVMEAQGSQLTNKYAEGYPGKRYYGGCEYVDVVETLAIERAKELFGATYANVQPHSGSQANSAVYMALLKPGDTVLGMNLAHGGHLTHGSPVNFSGKLYNIIPYGIDESGKIDYDEMERIALEHKPKMMIGGFSAYSGIVDWAKMREIADKIGAYLFVDMAHVAGLIAAGVYPNPVPHAHVVTSTTHKTLAGPRGGVILSAADDEDLYKKLNSAVFPGGQGGPLMHVIAGKAVAFKEALEPEFKVYQQQVVNNAKAMVEVFLERGYKIVSGGTSNHLMLVDLIGRDLTGKEADAALGSANITVNKNSVPNDPRSPFVTSGVRIGTPAITRRGFKEAESKELTGWICDILDDANNPAVIERVKGQVLALCARFPVYG >LR134321|3637282:3644885|3639091_3639748_-|VEF27044.1|DBSCAN-SWA MFTGIIEAVGTLRKLERKGDDIRLTVASGKLDLNDVRLGDSIATNGVCLTVVQQLADGYVADVSAETVSLTGFAHYKVGTKVNLEKAVTPTTRLGGHMVSGHVDGIAMVEQRLVRGQAIEFWLAAPAELGRYIAHKGSITIDGVSLTVNEVDGSRFRLTIVPHTAGETTLIDLQAGDKVNIEVDLIARYLERLMNYDGKDSKSGGVTMEMLARAGFVR >LR134321|3637282:3644885|3641060_3641510_-|VEF27046.1|DBSCAN-SWA MHCPFCSATDTKVIDSRLVAEGHQVRRRRECTECHERFTTFEGAELVMPRVIKRDGSRQPFDEEKLQGGMLRAVEKRPVSMDEIEQALSKIKSTLRATGEREVPSEMVGNLMMEQLMSLDKVAYIRFASVYRAFEDVSEFGEAIAKLQK >LR134321|3637282:3644885|3637892_3638996_-|VEF27043.1|DBSCAN-SWA MALHSIEEIIEDIRQGKMVILMDDEDRENEGDLIMAAEMVTPEAINFMAKYGRGLICQTMTKARCQQLNLPLMVTNNNAQFSTNFTVSIEAAEGVTTGISAHDRAVTVKAAVAKEAKASDLVQPGHIFPLMAQDGGVLTRAGHTEAGCDLARLAGLEPSGVIVEILNEDGTMARRPDLEIFSELHGIKIGTIAALIEYRNTKETTVVREAKCKLPTRFGEFDMVTFRDTIDNQLHFALVKGEVKSDCLVRVHLQNTFNDLLHSERDQQRSWPLEKAMERISAEGGVLVLLGNQEHPCEILSKVKAFEAEDQGQAPASAKWQGTSRRVGVGSQILASLGVTKMRLLSSPKRYHSLSGFGLEVTEYVAD >LR134321|3637282:3644885|3643217_3644885_+|VEF27048.1|DBSCAN-SWA MAQFVYSMLRVGKIVPPKKQILKDISLSFFPGAKIGVLGLNGSGKSTLLRIMAGIDTEIEGEARPMPGLKIGYLPQEPKLDPTQTVREAIEEAVSEAKNALTRLDAVYAAYAEPDADFDALAKEQGELEAIIQAQDAHNLDHILERAANALRLPDWDEKIEVLSGGERRRVAICRLLLEKPEMLLLDEPTNHLDAESVAWLEHFLQEYAGTVVAITHDRYFLDNAAGWILELDRGEGIPWEGNYSSWLEQKDARLKQESATESARQKTIAKELEWVRQGAKGRQSKGKARMARFEELNTNDYQKRNETNELFIPPGPRLGDKVIEVNNLTKSYGDRVLIDNLSFSVPKGAIVGIIGANGAGKSTLFRMLSGAETPDSGTIELGDTVQLASVEQFRDSMNDKNTIWEEISGGQDIMRINNMEIPSRAYVGRFNFRGGDQQKVIGTLSGGERNRVHLAKLLQAGGNVLLLDEPTNDLDVETLRALEEAILEFPGCAMVISHDRWFLDRIATHILDYRDEGQVNFYEGNYTEYSAWLKNTYGADVVEPHRLKYKRMIK >LR134321|3637282:3644885|3639759_3640914_-|VEF27045.1|DBSCAN-SWA MSLNISWSVLDTQMMSRAIQLARKGFYTTRPNPSVGCVIVKDNQIVGEGYHQKVGEPHAEVHALRIAGEHARGATAYVTLEPCSHYGRTPPCALALINIGVKRVVVAVEDPNPQVAGRGIQMLRDAGIEVDVGLHRDEAYALNLGFMKRMESGLPRVTVKLAASLDGKTALSNGVSKWITGPESRRDVLRLRLRSCALVTGIETILADDPSLNVRYQELGDIKDCVTEAQLLQPLRVILDSRARLPLSAACLAIVSPILLVSTVAYPAEFQAQLPSHVSCLVLPAIAGRVSLPELLSYLGQSCNHVLVEAGATLAGAFLAEGLADELVLYQAMKILGGEGRNLLQLPDYQTMAQIPALTLVDERKLGPDTRLILSVKLDTSLFN |
7 | Staphylococcus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
4788651 : 4803321
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >LR134321|4788651:4803321|DBSCAN-SWA GCTAAAGCCCTAGGCCTACTACTTGGTTACGTCCACTCTCTTTCGCCAAGTACAATTGCTGGTCCGCTGTTTCGATCAGTGTTTGCCAAGTCAAATCACTTTTGGGCAGCACGCTCGCAACACCCGCACTAATCGTCACCTCTTGGAAGCTCGACTCGAGATGTTCAAGATGCAAAGCTTGTACTGCATTCACTATTTTCTGTGCCGTATGAATGGCACCGGCAAGCTTGGTTTCTGGCAGAATACAAATAAACTCCTCCCCGCCATAACGTGCAACAAAGTCAGTCGCGCGCTGCAGAGAATCACTAATGGCTTTTGCAACTTGTCGCAAACATTGGTCGCCTTCCTGATGACCATAACGATCATTAAAACGTTTAAAAAAATCCACATCTAGCATCACCACAGATAAGGCTAACTCGTTGCGGCAGCAATGTTTCCATATTTCAGGAAGTCGCTGCTCAAACTGACGTCGATTGGCGACGCCCGTTAAACTGTCAAGCAGCGCGATAGAGTGCAACAGATCAGATTGTCGCTTGAGGGTAAACTGATTTTTTACCCGCGCAGTGGTGATAATTGGATTGATAGGTTTATGGATAAAATCGACTGCACCCAGCTGAAAACCTTTAACTTCTTGCACCTCATCAAAATGAGCCGTAACAAAAATAACCCCAATAGTCGCGGTTTCAGGATCGGCTTTAAGATGTTGGCAAACGTCAAAACCCGACATACTAGGCATTTCAATGTCTAACAGCACTAAATCTGGCTGTACCTTCTGGCATATGGCTATCGCCTGCTCACCATTGGTCGCCATAAACAACTCATACTCTTCATTGAAGAGTTGATGCAGTATTTTGATATTGAGCGGCTGATCATCAACGATGAGGATCTTACCTTTTTCACTTTGGGGCCGAGCAATTAATTCTGCAAGATGCATTACGCCTCCTCCAACATGATAATCCGTAGCGTATCTAATGCCTGGTTAAAATCGAGCGCTTGCACTTGCCCATGCAATATGGACCATTGTGGATGGCCTGACAGCAATTCAGCTAACTGGTCAGTTAAGGCCATCGCCTCTAAATTGTTCTCTTTTAATAGGGTCATTAGGGTATTAAGGTCCGTCTTAACACTTTGCACCGATACCGAAGAATACACGGCTTTTTCTACATTATCGCTAGTAACTGCATTAAATTCTTCCACCTCAAAGTAGGTTGAAAGCTGCTGACTACTCTGATTAATGAGTTCCTCTAACATATCCGTCCAGCGCTTAATATCAATCAGTTCAGGACATTCTTGCTTAAACTGTTTCTCTAAGTAAGCCGCATGCGCAGCTAAACGCCGAGCACCAAAATTACTGGCAATGCCTTTAATACCATGGCTAATCGCCGCCGCCGTTGAGTGATCAAATATCTTGGTCGACTGCTTAAAGAGCGCGAGTTGCTTCATCATTTCAGGCGCAAAACTTTTGGCTATTTTTTGGAAGAAAGACTGATTACCCCCAAAACGACGCAGGATCAGGCGAATATCATCGAGTAATTGCTCATCACTCTGCTCCTGCGTCGCTTCAGGCCAGTGCCCTTCAAATTCAGCAGAGGTCATATCCTCACGGCCAACTAAGCGCAAAATACTCGGTAGTAATAATGGCATATCGATGGGTTTGCCAACATGGTCATTCATACCCGCATTGAGACACTCTTGCCTATCGGACTGTGAAGCATTAGCTGTCATCGCAAGTATCGGTAATGCTGCAAAACGTCCATCAGCGCGGATACGTCGAGTCGCCTCCAATCCATCCATATCGGGCATTTGCATGTCCATGATGACAATATCGAATAAGTCACCACTCTCGAGCACCAAGGAAACACCTTCCATGCCGCCTTCTGCCAATACGACATTCGCGCCCTCATAACTCAATAGCTCATCGATAACTTGTCGATTGAGTTGATTATCTTCCACCACTAATATTGTTAGCCCTACTAAAAATCGCTGTGAACGCGGCTGAGGGTGCGCCTCCATTGTCTTACCTTCGATGGCATTTAACACAGCTTCAGCCAAGATCTGCGATGTCACAGGTTTAGTGAGGAAATTAACGAAAGGGACATGTCTAATTTGCTGACTCTCGGCAATCACTTCATGGCCGTAGGCCGTTAACATCACCACCAGCGGAGTCTGGTTATTCGCGTCTGAGTTACGCAGCATTTCGGCGGTTTGTAACCCATCAAGGTCTGGCATACGCCAATCCATCAACACCACATCAAACGGTACCGATTTGTCATGGGATTTTTTAACCTTCTCTAGAGCCGCATATCCACCTAAGGCAGTTTCAACCTCACACCCAAATCCGGTCAAAATTTTCTCTAAAATCTCCGTCGTCAATTCATTATCATCAACCACTAACACACGGTATCCGCTAAGGTCTGTGTGCCGAGTTTGTTCGACTTGTATAATAGGAAACGTCATATCAAACCAGAAACGGCTTCCAACACCCACCTTGCTGGTCACATGTAACTCACTGCCCAATAAGGCCGATAAGCCTTTTGCTTATTGCTAGCCCGAGCCCTGTGCCCCCAAAGCGGCGTGATGTTGAGGATTCTGCCTGTTCAAATCCGGTAAAAATCCGCTCAATCTGCTCATCACTGATACCAATACCCGTATCGGTAATTGAAATTTGTACTGTGACGGTATCAGCCTCATGGCGTAAACATTCAATGCCGACAATCACCTGACCGTGGTGAGTGAATTTAAGGGCATTACCTGCGAGGTTAACGAGGATCTGTTGCAATCTAAGTTGATCTGCTAATAACCAAGGAGGTAAAGCCGTGTCGAGATCAAACATCACCTCGACATCTTTATCACCATGGTTACCCGACAGTACAACGGCAAGATCCCGCATCAGCAATTCAATCGAACAGGGATGAAGATCGAGTACTAACTTGCCAGCATCAATTTTAGAAAAGTCGAGAATGTCGTTCAGCAGTCCTAATAAGGACTTAGCGGCCGTTTGAGTTTTATTAACATAATCTTGCTGCTGCCTAGACAAAGAGGTGTATTGAATTAATTGCAGCATGCCTAATACGGCATTCATTGGGGTGCGAATTTCATGGCTCATATTGGCTAAAAATGCAGACTTAGCCGCACTCGCCGCATCGGCCTCCTGCTTAGCTTCTCGCAAGCTTTCTTCCAATTGTTGCTGTTCAGTGATATCTAAATTAATCCCAATCACTCTAATCGCATGACCCTGTTTATCGCGCTCTACTTGGGCGGCTGCCTGAACGAAGCGAATCGAGCCATCTGTCCTTACCACCCTAAAAATAGGGTCATATTTCGCCGTTCCTTCAACAGCGCCTTTCAATTTAGCATCAGCAAAAGCAATATCGTCAGGATGGACCCGCATCGACCAGTGCTCATAGGTTAATCCTTGTTCACGAAGACTTTCAGGCTGATCGTAAATCGCAAACATACGTTCATTCCATTCAAGCGAATTACCGAGCAGATTCCAAGTCCAGATACCTAACTGAGCCACTTCGGCGGCCTTACTTAAATGGTTACTGGCCGTCATCAGTGCTTGTTGCTGCAGTAACATCAGTGAAATATCGACGCCAATGCCTAAATAACCAATAATCTCGCCAACACTGTCCCGCATCGCAGTGACAGAAAGTGACACTTGAAACTGGCTACCGTCTTTACGCACATAGGTCCAATTGCGGGTTTCCGAACCTTCGGCTCTTGCCTTATAGATGAAAACATCAAAGCCTTGAATGTCGTGACCATATTCAGCGGAGAGCTCCGCAGACCTGACCATTACCTCCTCTGGCAAATGCAGAGGCGCAGGGGTAGATTGACCTATCATTTCCTCGGCTGAATAACCGAGTAAGCGCTCTGCTCCACGGTTAAAAATGGTGATATTCCCTTGCACATCAGTGGCTATGATGGACATTTCAGAGGCGGCATCGAGCACGTTAGTCAGTAATGACCCTAAGTTATTTTTCTCAATCTGTGTCAACTTACGGTCGGTAATATCAATCCTTAATGCCACAAAACGTTCAATTTTACCGTATTCATCAAACACTGGGCCAATCACAGTATCGAACCATTTCAACGTTTGATGCTTATCGAGGTTGCAGATTTCACCGTGCCAAGACTGTCCTGAATTTATATGCTCCCACATAGATCGCCAAAAAGAAGCATCATGTTCACCCGACTTGAGTATTGCGTGGGTTTCCCCCACCAGTTGCTCACGCGCATAGCTATTAATACGGCAAAAATTATCATTCACCTCTAAGATGACACCATTGGGGTCAGTAACGGAATAGAGTAGCTGTTGATTAATAGTGTCGAGCAAGGTTTGGTTTTCGAGTAACGCCTGCTGCAACGCATGGGTACGCTTGCTGACTTGCCGTTCGAGACTCGCATTCAAGCTTAAAATATGTCGTTCAGCTTCAAGCTGCGCCGTAATATCGCGGATCGTTTGGCTCATGCCAGCTATCTTGCCGTGCTCGTCATAAATAGGCACGGCCGTAGTCGAGGTCGATAAATGACTACCATCTTGCCGTTGATGGCTAGTTATCCGATTTAAAGAGGTTTTACCCGCTAACACTTCGGCGAATAATGCCCTTTCTTCCTGCACTATGCTGGCAGGAACAATCAATTTACAACACGACTGGCCAAGAGCGTGCGCCTCGGTATAGCCAAATAGCTGCTCCGCACCTTGGTTCCAACTGGTAATTTGGCCTTTAGTGTCATAACTGATAATGCCATCGAGAGAATGCTCCAGCATGCTGGCGCGCCTAGCCTGTTCAAGCAGCACCTGCTGCTTACGCTGCAAACTGATGGACCACATGGCAATTAACGCAGCTAAAAGCAGGCTAAACAAACTGCCACTCAACAGCACTAAACTGGGTTGATTAAGATGCAATGACTTAATAAAGGATGGATAAGCGATAACTTCTATCTGCCATTTACGACCAAAAATGTCTTTCGTCATTTTATGGCGATATGCAGATAGTGGAGACGCCTCATCAGGGTGTGTTTCAAAAAAACTGATGGGACGCTCAGCTTGGGTGATATCGCTAAGCAACAGCTTAGTCATCTTCTGATTCAACGCTAACCCCATTAGAACTTCGTTCGTCACTAAGGGGGCATAACTCCAACCATACCCGGCTGACAGTCTATCCTGTTGTGTTTTAGGTACTAAACCCGTGCGATAAATAGGCTGCAGAATTAAGAATGATTGTAGCGGCCTACCCGTAGCCTGCACTAATGTGATGGGTGCAGATAAGCGCACTTCACCGGATAACATAGCCGCATCGGCAGCGGCTTTACGATGTGCCTCTGAGGCAATATCTAAGCCAACGGCATCTTTATTGCGGTCTACTGGCTCAATATATTCAATAACGTATCTTTCGCCATCATGAGCGTTAAGTTGGCGAATATTAAAATCAGGCCAATCATCTTGACGAGCTCTTGCTATAAATGCCGCTTCATCGGGACTGGGTACTCGACGAATAAAGCCAAATCCACGGGCGCCGGGAAACTCATGGTCCACATCCCGCGTTAAACTATAACGATGGAAGCTAGCACGACTAATATCATGCTCACCCGCAGTGACTATCGTGCCGCGTGCACCGCGTAAACCATATTGATATAAGGTGATACGGTTAATCACGTTCTCACTGATTTGCTCCGCGTCCTCGCGCAAGGCTTGCTCAATCTTTTGGGCGTTCATGCGCCCCGTCAACCCTGTTAATAACGCGCTTAGCACCAATCCAATCAACAGGACTAATAAGCCCCATTTCGAAGCATTTTTATAACTCATGGACAAGTGCATAAACGCCTAATAATTTTTAATCAGAGAGTTATGATTTTGGACTAATAAAAATTCTTTTGCCACTCAGCGAGTGGCAACAAACGATAAATGCGCCAAAAGTTGCTGATGCCCATTTAGTTACAATTTCAGCTATAGCGTGAATAAACGCGAGTATTCCCATGCAATTTCATTTGCTTAGTATTAACATTGGGCAATCAATATCAGCATTTATCGCATTCAAGTTACTAAGGTGAGCTAGTCGCCTTGGTTGAATGCGGCTATATTAGGCAGATTCGTTTTCAATTTCAGCCAGTTGGTTAATCTTAACTGTTTCTCGTCAGTAGAATAAGCACCAATCTGAGCAATTAAAGGACATCCCATGGGACAAGAAACCTCGAAAATCCTCGTCGTCGATGATGATATGCGCCTGCGAGCGTTACTCGAGCGTTACCTGATGGAGCAAGGTTATCAGGTGCGCAGCGCGGCCAATGCCGAGCAGATGGACCGCCTATTAGAACGCGAAAACTTCCATTTACTCGTGCTCGACTTGATGTTACCCGGTGAAGATGGCCTATCTATCTGCCGCCGTTTGCGCCAGCAAGGTAATCCTATTCCGATCGTGATGCTGACGGCCAAGGGCGATGAAGTCGACCGTATTATTGGCCTAGAGTTAGGCGCCGACGATTATCTGCCAAAACCTTTCAATCCACGGGAATTACTCGCACGGATCAAAGCCGTGATGCGTCGCCAAACCCAAGATGTGCCCGGTGCGCCAGCCCAGCAGGAAGCGCAGATCAGCTTCGGTGAGTTTTCGCTGGATTTAGCCACCCGTGAGATGTACCACGGCGACGAATCGATTGCGCTCACCAGCGGTGAGTTTGCCGTGCTGAAAGTGCTCGTCACCCATCCACGCGAGCCCTTGTCTCGGGATAAATTGATGAACCTCGCCCGTGGTCGCGATTACTCGGCGCTAGAACGCTCCATCGACGTGCAAGTGTCACGTCTGCGGCGTTTAATCGAGAAAGATCCTGCGAATCCGCGTTACATTCAAACTGTGTGGGGATTAGGTTATGTGTTTGTGCCCGATGGCGCCGCGCGTCGATGAGCAGCATTGCCATCTTAATCTGCTGCGGTTATTGAGCCCCCGATGAAACCTAAGTTTTGGTGGCGGTTTATGCCTCGCAGCGCGTTTAGCCAAACCGTGATGCTGATTGGTTGTCTATTACTGATCAATCAGCTGGTGTCTTACGTCACGGTTGCCGTCTATGTGCTTAAGCCAAGCTACCAACAAATCAATCAGTTAATCGCTCGCCAAGTCAAATTGCTGTTTGTCGATGGCATAGATATTGGCCGCGAACACTTAACCATAGTCGATGCCCTCAATGCCAAAGTCCACGACGATGGCATGAAGATCTACAATCAGCAGCAGGCGCGCGAAGCCGGTATTGAGCAAGCGACCTATTATGGTTTCTGGTCGGCGCAAATGTCGGAATATCTCGGCGGCGACGCCGAAGTGCGCGTCACCCACGGCAGCGTATTGCAGATTTGGATCCGTCCACCACAGGCGCCCTCGATTTGGATAAAAGTGCCGCTAATTGGCCAAAATGTTTCAGACCTTTCGCCGCTCACCCTCTATTTAATGGTCATTGGTGCCCTCAGCGTCGCTGGGGGTTGGTGGTTTGCCCGCCAACAAAACCGCCCACTTAGACGACTACAAAAAGCCGCGATTGCCGTTTCCCGCGGTGAATTCCCCGAGCCCTTGCCGCTGAATGGTTCGAGTGAAATTGTCGAAGTGACTAACGCCTTCAACCAGATGGCGCACAGCATGAAACAACTGGAGCAGGACAGAGCGCTATTGATGGCGGGGATTTCCCACGACTTGCGCACGCCGCTCACCCGTATTCGGCTCGCTTCTGAGATGATGGTTGAAGAAGATCAATATCTTAAAGATGGCATAGTCAATGATATCGAAGATATGGACGCCATCATCAGCCAGTTTATTGCGTATATTCGCCAAGATCAGGAAACCAGCCGTGAGCTAGGCCAAATCAATAAACTCATTCAAGATGTCGCTCAAGCAGAGGCCAATCGCGCCGGCGAAATTGAAGTGGTGTTAACTGACTGCCCAGAAGCTCAATTCCAAGCGATTGCCATAAAGCGGGTACTGAGTAACTTAGTCGAAAATGCTTTTCGCTATGGTTCTGGCTGGATCCGCATTAGCTCGCAGTTCGATGGCAAACGTATCGGTTTTAGTGTTGAAGATAATGGCCCAGGAATCGACGAGTCACAAATTACCAAACTATTTCAACCCTTTACCCAAGGTGATATCGCCCGTGGCAGTGTCGGTTCAGGCCTAGGCCTCGCCATCATCAAACGGATTATCGATAGACACCAAGGGCAAGTCACCCTCTCCAACCGCGCCGAAGGCGGTTTAAGGGCCCAAGTGTGGCTGCCGCTGGAATAACGCGAGATCACATATCGGCAAATCAAATGACAACGCTGACATTCATGTTGGTGCTTGTCATATTAGTGTCACGCTAAATTCATAAACTCAGAACAGTTAACTGGCTCTGACAATTATGAAGATTGCAAAATGAAAATTTCAACGCTGTCGCTCTCTGCTTCTGCCCTGTTGTTATTGCTCGCGGGATTATTGGCGGCCGTAGTGCTGTGGAGCAGTGATCAGCGACAAAATATAGAACAGCAAACACTCACACTGCAAAGCATACAAGAAGACTTCCTCGTCGGCGTACGCCGCGATCTCGACGGCTATCTCGCGAGCGGCAACGCTAGCCAACTTGAACAAGCGAAAACTAAACTCAGCGCCATTAAGAACCAGCTTACAGAGCTTAATCTCGCTACCATGGGCGCTACCGATAATGACTTGCAAGCAAGTCTCAGCAGCTTCATTCAAGATTTAGATACCAAGTACCGCGCCGCGGGTAAACTCGCTGGCAATCCTAGGCAACTGCTGGCCCACGCCGAATCTGAAATGCTCGACTATAACCGTCGTCTGGGCAGTTATGCCGATAAAGGCTTAACCACAAATGCGGCGGTCGCCGAACAATATCTGCAATTGAGCCGCGATTTACCTTCCATCGTCTACCAACTTTCCCAGCTAACCGATGGCTACCTCATCGGCAAAAATCAGCAACTCAAGGGCATTTTAGACAGCACCAGCAAAGAGCTCAATACGTGGCACGACGCACTCAGCGCACTGCCGTTAATCGGTGTATATGAGCAGCAAGAAGCCGATGAATTTGCCCTCGGTGCCAGCGAACCGGAACAGGTTGAAGTCGGCGAAAGTGACCGTAGTGAACTGCTGAGTCTCGCCAATCGATACAATAAAGAAGTCGCCAATACCCACCAATTGCTGCAAGCCAACCAAGAGATGCAAGACCAACTCATTCAAGCGATTAGCAATGTGGAACAACAACTTATTGGCTTAGGTGAGGCTCAGGCAGCGAAGAATCAGCAGCTGAAATATGAGCTACAGCTGATCCTCTACACTATGGTTTCAATCATGGCCTTGTTTGCTATAGGTTATTTAATCCTGCAGCAAAATCGCGTCGTCAAACCACTAAAACGCCTCAACCAAGCCTTTATGAAACTAAGCGAATCGAACAGCCGCGAGCGTTTAGACATAAATCGTCGCTGCGAGACGGGCCAAATCGCAGGTCATTTCAACCAGTTGCTGCAAAGGTTTGAACAGGAAGACGAAGCACAGCGCCAACAAATCACTAAGGTTTCTCAATCATTAAGCCAACTGGTTGCGCGCATTACTCAGCTATCTCAGCACACAGAGCACACCCAAACGATTGTGGCCGATACGCAAACACAGACCGAGCATATCCGCAGCCTCGCCAATGAGGTGAGCCACACTTCGGCACTCGTGGAAGACAGCGCCGCCGAAACCATGCGCCAAATGCAGTCGAGCCAAACCGAAGCAGAAGCAGTACTGAGCGCAACTGAGCAGACGCAAACCGCCGTTGGCCTTTGCCATGCTTCCCTTGAAAGCCTGAATAACTCAGTGACGGATGTGTCGAAAATCATTGATGTGATTGGCAATATCGCCGAACAAACTAACTTACTTGCCCTCAATGCCGCCATTGAAGCTGCGCGGGCGGGAGAGCAAGGTCGCGGCTTTGCCGTGGTGGCCGATGAAGTGCGAAATTTAAGTCAGCGCACCCAAGTGTCCTTAAACGAAATCGTGAAAATTCTGCAGCAACTGACCCAATCGAATCATGCCCTCAGTGAGAGTGTCGATGGCATAGCACAGGCCACCAGCAGTCAAAAACTGCGCGCTCAAAGCCTATGGCAAGTGGCGCAAACGGTGCAAAACCAAGCGAGCGACATGGCCAATACCGCGAAACAAGGCTCGCTCAATGCCAAGGAGCAAGTCGATTATCTCGATCAATTTGTGCGCACTATGGATAGCTTAAAAGAGCAGGCACAAACCAGTTCACAACAGAGTGAAGTCATAGCCCAAGAAGTGCAGCAGAGCGTCGAGGATATTGAGACCAGTTTAGGCATTGCCGATGCTGGTAACTTAGCACCACAGTCGCGGGCCGCTTAACCCACTAAAATACCAATAAAAATGGGAGCCTAAGCTCCCATTTATTTTGATGTCCGATTACGTTCGGACCTTAATCATTACAGTGCAGCTAAAACCACTTCGGCTTTACTGGCTTCAAAAGCCTTTCGCTCTTCAACATTCAGCAGAGTCACTACGCCGTTATCTATGATCATGGCGTAACGTTGTGAACGCACGCCACCAAAACCAGCTGTGTCCATTTCTAAGCCTAACGCTTTAGTAAAGCTGGCATCGCCATCGGCTAGCATCATCAATTCAGAGGCATTTTGCGCTTCGCCCCACGCTTTCATCACGAAGGCATCGTTCACAGCAACACAGGCAATCAAATCCACGCCTTTCGCTTTAAATTGATCGGCTAATACGACATAGCCAGGTAAATGCGCTTCAGAACAGGTCGGAGTAAACGCACCGGGTACAGCAAACAATACCACTTTTTTACCAGCGAACAGTTCAGTAACTTGGTGATTCACCATGCCATCTTTCGTTAGTTGGCCTAATGTAGCTGCTGGTAGTGTTTGACCTTGAGCAATCATATTTATCTCCATTTAGTTAAATTAACCTGACCATACTAGCCCTATTTACCTGAGATAAACACAGAGAGAAAAAAGGGTGATTAGCCGCGAGTGCGATTCATCCGCAGCCCTGCACACAAACCGAGGAATAACAGACCCAATACCGATAAGCTACCGCCACTTTCTTCGACGACGATGACATCAGGCTCTCGGTCTCGATCGCTACTTTCCAGTGGCAACGCATACAAACCATCGGTTTCATCGGCACTAATGGTCGCGACAATGTCGCTGTAACCCACTTCGTACACATCGATTAGCACATCGTAGTGGTCCGTCGCATAACCCGTGTAGAGCGTGGTCAGCACTTCATAATCGTCCTGTGTTGAATCGCCATAAATAGTAAACACGTCCGTGGTGTAGTAATGCACCCAAGGGCCGCCATTACGGCTGAGATAAAGCTCGGCAAATAAGTCAGCTCTTTCATTAAGATACGCTCCGTTAACATCGACGTCGAAGGTCACGCTAAAGGTTTGATAAAACCCATCGTAGTCAAAGTCTTCGAACAAACGACTGCTGGCATCAAAAATCGAAAAGCTGTGATACACAGGGGCGCGGTAGGGATCTTCACTAGTCGCGCTCGAACTCGGCATGCCTTGGCTCGCATGTTTCGCAGTCACTTGCTCACGCGTCATCGGCGATGCGCCCATTAGTTGCACGCGGTTTGCCGTCGGCGCTTTAGAGGATAACGCGGGCGCCAAGGATTTAACGGCTGCAGCGGCAGCTTCAGTGACATTTTTTGCGGGCGCAGCTTGTTGTAACAGGGCTAAAGCTTGCTGTTCCTGCTCGGCTGCTTGCTCGCTGTTTTCAGCTTTTTTGGCAATGCCGACACTGGCCGCAGTAAATGGCATTAAGCTTTGCGATGATTCCAGTTCGCGGGGCTGAGCGCTGACGGCAGTGGACGCTAAAAACAGCGCGGCAATGGCCGTCGCTTTAACGAAATGGGTGTTAACCCGACAGTTAACTGTGGCTGCCATACCTTGTTTATTGAGTGTGTTCATCTTGAATACCCCATGGATAACCCGTTCAAATTGGAGCCATTAAAGTGCAATGAAGGTGAACATAAGCTGAACATCAACATTTAGTGTTCTTAGCGCGACAATACGTTCAGCCTTAGTTCATCTGGATTAGCCGACACTAGGGCATATTGAATGCGAGCATAGGAATAGACTCGCTGACAGAGCTTCAGTTCTGTCACGAATAACAAAATGACTACCAAAGTTAATTTTGATGGAATTGTGCCAACCAATTGGCTGATCTCTATGTGTTTGTTAAGGTGATGAGGTATTCTTAGTCGACTCGATAAGTTATCCATCTCACCCATTAACTTAAGATCGCCGAACATTGCGGGTACCAGATATGAAACTTGGTTTGACATTGGCCCTAATCGCTTGCTTGTTTGCCTCCTTTGGCAGCATGGCAGGCAATGATAGACAGAATGATCGCAACCAAGGTGCGAAGAATGAGCAGCGCCGTCTGGCGGTCAATAGCCCAGATCAAGCTGTGGCTATGGTGCAGCGCCAATACCAAGGCAAGGTACTGAGCGTGCAATCCAGCGGCTCAGGGTATCGGGTTAAGCTCCTCAATAATGACGGCCAAGTGTTTTCTGTCTCAGTGGATGCCGCTACTGGCCGAGTGTCGAGGAACTAACTATGCGATTGTTATTGGTTGAAGATGATTTAGAACTGCAGGCGAACTTAAAACAACATCTGCTCGATGCCCATTACAGTATCGATGTGGCGAGCGATGGTGAAGAAGGCTTATTTCAAGCACTCGAATACAACTATGATGCGGCGATAATCGATGTCGGTTTGCCTAAACTTGATGGTATCGCGCTTATCCGCAGCGTACGCGAACAGGAACGCGACTTCCCAATCCTAATTTTAACCGCGAGGGACAGCTGGCAAGATAAGGTTGAAGGACTCGACGCTGGTGCCGACGACTACCTGACGAAACCTTTTCATCCCCAAGAACTGGTCGCGCGGCTAAAAGCCCTGATCCGTCGCTCTGCCGGCAAGGCCAGCCCTTTGATTTATAACGGGCCCTTTAGCTTGAATACCAGCAGCTTAGAAGTGCGTAAAGGTGAAGAGTTGGTTAACCTCAGCGGCTCTGAATACAAGTTATTCGAATTTTTTATGCTGCATCAGGGCGAAGTGAAATCTAAAACCGTATTGACGGAGCACATCTACGATCAGGATTTTGATTTAGATTCAAACGTCATCGAAGTCTTTATCCGCCGGTTACGCAAAAAACTCGACCCTGATAATCAATACAACCTGATTGAAACTCTGCGCGGCCAAGGTTATCGCCTAAAAGCGTTAACGCCCGAGCAAGCCATCGATGAGTAAACGTGCCATTGTGTGGCAAGTATTGAACTCCCTTAAAGCGCGCTTAGTGATTAGCGCGCTATTGTTTATCTTAGTGTTATTACCTTTAATCGGCGTCGCCTTAAACGATGCCTTTACCGAGCAAGTTAAAAGCGCGACCAAAAATGAACTCAGCGCCTATGTCTATTCGATACTCGCGGTGACGGAAGTCGAGAATAAACAAATATCTATCCCTGAATTAGTACTCGAAAATCGTTTTAACCTTATTCAATCTGGGCTGTATGCCATTGCGACCACTGAAGATGCCAGCGGCAAACAAACGATCGTGTGGCACTCGCAATCTTTTATGGGCATGGTGCCACCGCCACATTTTACCATCCCAGCAACAGGTCAAAGTGCCTTTGAGCAAATCGAGCTCGCCGAACAACCCCATTTGATTTATAGCTTTAGCGTCAGTTTTGCCAGCCAAAATCAAAACGTGCCTGTGACGATTCACATCATTAAGGACGAGCGCGAATTTCAGCAGCAAATCGATCAATTTAACCAGCAGCTTTGGACTTGGCTGCTGATCCTCATGTTCGTCATGCTCGTGTTCCAACTGAGCTGGCTAGTGTGGACGCTGCAACCATTGGCGCGCTTTACCCAAGAATTACATTCCGTCGAGCAAGGTAAGTCGATGCAGCTAAGTAGTCAGTACCCCACTGAATTACAAGCCGTTGCGCGGCAGCTCAATATTTTGCTCAATACCGAGCAAACCCAACGCAAACGCTACCGTAATGCCTTGGCCGATCTTGCCCACAGCCTCAAAACACCGCTGGCGGTGATTAAGAGTCAGGCCGACTTAAGCGAAGCCTCCAGCGAGCAAGTGTCGGTGATCAGCCGCATTATTGGCCACCAGCTAAAACGTGCCCAAACCGCAGCGGCAGCCTCGTGGCACTTAGGTATTCGCGTCGATGACGTCGCCGCTAAGTTATTACGCACCTTAGCTAAAATTTATCGTGAGCCGCAAATCAACCTCAGCGGCGAGATGGCGGACGCAGCAGTATTCAAAGGTGATGAAGCGGATCTCACGGAAATTCTCGGTAACGTGCTCGACAACGCCTGCAAAGCGGCAAAGTCCACGGTTAAATTAACCGTGACCGGCGATGCCTATCAATTGCTGATCTGCATCGAAGATGATGGGCCAGGGATCAGTGAAGCGCTGCAAAATCAAATCTTTGAACGCGGTATTCGTGCCGATTCCTATCATCAAGGTAATGGTATTGGGCTTGCGATCGTGCGCGATTTAGTCGACAGCTATAACGGCAGAATTTCAGTCTCACGTTCAGAAACCTTGGGCGGCGCCAAGTTCAGCATCAGCTTTGTGCACTCAATTTAA
Protein sequences of DBSCAN-SWA_5 >LR134321|4788651:4803321|4794832_4795558_+|VEF28015.1|DBSCAN-SWA MGQETSKILVVDDDMRLRALLERYLMEQGYQVRSAANAEQMDRLLERENFHLLVLDLMLPGEDGLSICRRLRQQGNPIPIVMLTAKGDEVDRIIGLELGADDYLPKPFNPRELLARIKAVMRRQTQDVPGAPAQQEAQISFGEFSLDLATREMYHGDESIALTSGEFAVLKVLVTHPREPLSRDKLMNLARGRDYSALERSIDVQVSRLRRLIEKDPANPRYIQTVWGLGYVFVPDGAARR >LR134321|4788651:4803321|4799660_4800617_-|VEF28019.1|DBSCAN-SWA MNTLNKQGMAATVNCRVNTHFVKATAIAALFLASTAVSAQPRELESSQSLMPFTAASVGIAKKAENSEQAAEQEQQALALLQQAAPAKNVTEAAAAAVKSLAPALSSKAPTANRVQLMGASPMTREQVTAKHASQGMPSSSATSEDPYRAPVYHSFSIFDASSRLFEDFDYDGFYQTFSVTFDVDVNGAYLNERADLFAELYLSRNGGPWVHYYTTDVFTIYGDSTQDDYEVLTTLYTGYATDHYDVLIDVYEVGYSDIVATISADETDGLYALPLESSDRDREPDVIVVEESGGSLSVLGLLFLGLCAGLRMNRTRG >LR134321|4788651:4803321|4799106_4799580_-|VEF28018.1|DBSCAN-SWA MIAQGQTLPAATLGQLTKDGMVNHQVTELFAGKKVVLFAVPGAFTPTCSEAHLPGYVVLADQFKAKGVDLIACVAVNDAFVMKAWGEAQNASELMMLADGDASFTKALGLEMDTAGFGGVRSQRYAMIIDNGVVTLLNVEERKAFEASKAEVVLAAL >LR134321|4788651:4803321|4795600_4796917_+|VEF28016.1|DBSCAN-SWA MKPKFWWRFMPRSAFSQTVMLIGCLLLINQLVSYVTVAVYVLKPSYQQINQLIARQVKLLFVDGIDIGREHLTIVDALNAKVHDDGMKIYNQQQAREAGIEQATYYGFWSAQMSEYLGGDAEVRVTHGSVLQIWIRPPQAPSIWIKVPLIGQNVSDLSPLTLYLMVIGALSVAGGWWFARQQNRPLRRLQKAAIAVSRGEFPEPLPLNGSSEIVEVTNAFNQMAHSMKQLEQDRALLMAGISHDLRTPLTRIRLASEMMVEEDQYLKDGIVNDIEDMDAIISQFIAYIRQDQETSRELGQINKLIQDVAQAEANRAGEIEVVLTDCPEAQFQAIAIKRVLSNLVENAFRYGSGWIRISSQFDGKRIGFSVEDNGPGIDESQITKLFQPFTQGDIARGSVGSGLGLAIIKRIIDRHQGQVTLSNRAEGGLRAQVWLPLE >LR134321|4788651:4803321|4797046_4799029_+|VEF28017.1|DBSCAN-SWA MKISTLSLSASALLLLLAGLLAAVVLWSSDQRQNIEQQTLTLQSIQEDFLVGVRRDLDGYLASGNASQLEQAKTKLSAIKNQLTELNLATMGATDNDLQASLSSFIQDLDTKYRAAGKLAGNPRQLLAHAESEMLDYNRRLGSYADKGLTTNAAVAEQYLQLSRDLPSIVYQLSQLTDGYLIGKNQQLKGILDSTSKELNTWHDALSALPLIGVYEQQEADEFALGASEPEQVEVGESDRSELLSLANRYNKEVANTHQLLQANQEMQDQLIQAISNVEQQLIGLGEAQAAKNQQLKYELQLILYTMVSIMALFAIGYLILQQNRVVKPLKRLNQAFMKLSESNSRERLDINRRCETGQIAGHFNQLLQRFEQEDEAQRQQITKVSQSLSQLVARITQLSQHTEHTQTIVADTQTQTEHIRSLANEVSHTSALVEDSAAETMRQMQSSQTEAEAVLSATEQTQTAVGLCHASLESLNNSVTDVSKIIDVIGNIAEQTNLLALNAAIEAARAGEQGRGFAVVADEVRNLSQRTQVSLNEIVKILQQLTQSNHALSESVDGIAQATSSQKLRAQSLWQVAQTVQNQASDMANTAKQGSLNAKEQVDYLDQFVRTMDSLKEQAQTSSQQSEVIAQEVQQSVEDIETSLGIADAGNLAPQSRAA >LR134321|4788651:4803321|4800975_4801266_+|VEF28020.1|DBSCAN-SWA MKLGLTLALIACLFASFGSMAGNDRQNDRNQGAKNEQRRLAVNSPDQAVAMVQRQYQGKVLSVQSSGSGYRVKLLNNDGQVFSVSVDAATGRVSRN >LR134321|4788651:4803321|4789583_4791149_-|VEF28013.1|DBSCAN-SWA MTSKVGVGSRFWFDMTFPIIQVEQTRHTDLSGYRVLVVDDNELTTEILEKILTGFGCEVETALGGYAALEKVKKSHDKSVPFDVVLMDWRMPDLDGLQTAEMLRNSDANNQTPLVVMLTAYGHEVIAESQQIRHVPFVNFLTKPVTSQILAEAVLNAIEGKTMEAHPQPRSQRFLVGLTILVVEDNQLNRQVIDELLSYEGANVVLAEGGMEGVSLVLESGDLFDIVIMDMQMPDMDGLEATRRIRADGRFAALPILAMTANASQSDRQECLNAGMNDHVGKPIDMPLLLPSILRLVGREDMTSAEFEGHWPEATQEQSDEQLLDDIRLILRRFGGNQSFFQKIAKSFAPEMMKQLALFKQSTKIFDHSTAAAISHGIKGIASNFGARRLAAHAAYLEKQFKQECPELIDIKRWTDMLEELINQSSQQLSTYFEVEEFNAVTSDNVEKAVYSSVSVQSVKTDLNTLMTLLKENNLEAMALTDQLAELLSGHPQWSILHGQVQALDFNQALDTLRIIMLEEA >LR134321|4788651:4803321|4801956_4803321_+|VEF28022.1|DBSCAN-SWA MSKRAIVWQVLNSLKARLVISALLFILVLLPLIGVALNDAFTEQVKSATKNELSAYVYSILAVTEVENKQISIPELVLENRFNLIQSGLYAIATTEDASGKQTIVWHSQSFMGMVPPPHFTIPATGQSAFEQIELAEQPHLIYSFSVSFASQNQNVPVTIHIIKDEREFQQQIDQFNQQLWTWLLILMFVMLVFQLSWLVWTLQPLARFTQELHSVEQGKSMQLSSQYPTELQAVARQLNILLNTEQTQRKRYRNALADLAHSLKTPLAVIKSQADLSEASSEQVSVISRIIGHQLKRAQTAAAASWHLGIRVDDVAAKLLRTLAKIYREPQINLSGEMADAAVFKGDEADLTEILGNVLDNACKAAKSTVKLTVTGDAYQLLICIEDDGPGISEALQNQIFERGIRADSYHQGNGIGLAIVRDLVDSYNGRISVSRSETLGGAKFSISFVHSI >LR134321|4788651:4803321|4801268_4801964_+|VEF28021.1|DBSCAN-SWA MRLLLVEDDLELQANLKQHLLDAHYSIDVASDGEEGLFQALEYNYDAAIIDVGLPKLDGIALIRSVREQERDFPILILTARDSWQDKVEGLDAGADDYLTKPFHPQELVARLKALIRRSAGKASPLIYNGPFSLNTSSLEVRKGEELVNLSGSEYKLFEFFMLHQGEVKSKTVLTEHIYDQDFDLDSNVIEVFIRRLRKKLDPDNQYNLIETLRGQGYRLKALTPEQAIDE >LR134321|4788651:4803321|4788651_4789584_-|VEF28012.1|DBSCAN-SWA MHLAELIARPQSEKGKILIVDDQPLNIKILHQLFNEEYELFMATNGEQAIAICQKVQPDLVLLDIEMPSMSGFDVCQHLKADPETATIGVIFVTAHFDEVQEVKGFQLGAVDFIHKPINPIITTARVKNQFTLKRQSDLLHSIALLDSLTGVANRRQFEQRLPEIWKHCCRNELALSVVMLDVDFFKRFNDRYGHQEGDQCLRQVAKAISDSLQRATDFVARYGGEEFICILPETKLAGAIHTAQKIVNAVQALHLEHLESSFQEVTISAGVASVLPKSDLTWQTLIETADQQLYLAKESGRNQVVGLGL >LR134321|4788651:4803321|4791156_4794474_-|VEF28014.1|DBSCAN-SWA MHLSMSYKNASKWGLLVLLIGLVLSALLTGLTGRMNAQKIEQALREDAEQISENVINRITLYQYGLRGARGTIVTAGEHDISRASFHRYSLTRDVDHEFPGARGFGFIRRVPSPDEAAFIARARQDDWPDFNIRQLNAHDGERYVIEYIEPVDRNKDAVGLDIASEAHRKAAADAAMLSGEVRLSAPITLVQATGRPLQSFLILQPIYRTGLVPKTQQDRLSAGYGWSYAPLVTNEVLMGLALNQKMTKLLLSDITQAERPISFFETHPDEASPLSAYRHKMTKDIFGRKWQIEVIAYPSFIKSLHLNQPSLVLLSGSLFSLLLAALIAMWSISLQRKQQVLLEQARRASMLEHSLDGIISYDTKGQITSWNQGAEQLFGYTEAHALGQSCCKLIVPASIVQEERALFAEVLAGKTSLNRITSHQRQDGSHLSTSTTAVPIYDEHGKIAGMSQTIRDITAQLEAERHILSLNASLERQVSKRTHALQQALLENQTLLDTINQQLLYSVTDPNGVILEVNDNFCRINSYAREQLVGETHAILKSGEHDASFWRSMWEHINSGQSWHGEICNLDKHQTLKWFDTVIGPVFDEYGKIERFVALRIDITDRKLTQIEKNNLGSLLTNVLDAASEMSIIATDVQGNITIFNRGAERLLGYSAEEMIGQSTPAPLHLPEEVMVRSAELSAEYGHDIQGFDVFIYKARAEGSETRNWTYVRKDGSQFQVSLSVTAMRDSVGEIIGYLGIGVDISLMLLQQQALMTASNHLSKAAEVAQLGIWTWNLLGNSLEWNERMFAIYDQPESLREQGLTYEHWSMRVHPDDIAFADAKLKGAVEGTAKYDPIFRVVRTDGSIRFVQAAAQVERDKQGHAIRVIGINLDITEQQQLEESLREAKQEADAASAAKSAFLANMSHEIRTPMNAVLGMLQLIQYTSLSRQQQDYVNKTQTAAKSLLGLLNDILDFSKIDAGKLVLDLHPCSIELLMRDLAVVLSGNHGDKDVEVMFDLDTALPPWLLADQLRLQQILVNLAGNALKFTHHGQVIVGIECLRHEADTVTVQISITDTGIGISDEQIERIFTGFEQAESSTSRRFGGTGLGLAISKRLIGLIGQ |
11 | Bacillus_phage(44.44%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|