Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP016783 | Candidatus Planktophila lacus isolate MMS-IIB-60 chromosome, complete genome | 2 crisprs | DinG,WYL,cas4,DEDDh,cas3 | 0 | 1 | 5 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016783_1 | 848969-849076 | Orphan |
NA
Consensus repeat of NZ_CP016783_1
|
1 spacers
spacers of NZ_CP016783_1
>1.1|848993|60|NZ_CP016783|CRISPRCasFinder GCTGGTGCTTCAACAACAGCCTCTGGTGCGGTTTCAACAACCGCTTCTACTGCTACCTCT |
CRISPR arrays and Neighbor proteins around NZ_CP016783_1
The CRISPR arrays of NZ_CP016783_1 >merge|NZ_CP016783|1|848969-849076|CRISPRCasFinder TCGGTCGCAGTTGGTTCGGACACAGCTGGTGCTTCAACAACAGCCTCTGGTGCGGTTTCAACAACCGCTTCTACTGCTACCTCTTCGGTCGCAGTTGGTTCGGACACA >NZ_CP016783|1|1|848969-849076|CRISPRCasFinder TCGGTCGCAGTTGGTTCGGACACA GCTGGTGCTTCAACAACAGCCTCTGGTGCGGTTTCAACAACCGCTTCTACTGCTACCTCT TCGGTCGCAGTTGGTTCGGACACA
>NZ_CP016783.1|WP_095527184.1|848671_848911_-|RNA-binding-protein MMDEALEHLVKGIVDNPEDVIVKEKTHRRGTTLEVRVNPEDIGKVIGRNGRTAKALRTVVSAIAGRTVRVDLIDTDEAR >NZ_CP016783.1|WP_095689257.1|848143_848653_-|ribosome-maturation-factor-RimM MRLNIGRIGRAHGILGEATIEVRTDDAAERFAIGAILETESHGNLTVASARVHNGILLLGFDGVEDRNAIEKYRDQLLYADVDIDAPGLDEDDYHVLQLIGCQAYLVDGDLFGEVTEVLNLPGQDVLSIKTEAGEVLIPFVHQLVPVVDVKEKRMTVIPPEDYEAGVSK >NZ_CP016783.1|WP_095689256.1|847463_848147_-|tRNA-(guanosine(37)-N1)-methyltransferase-TrmD MKIDAVTIFPEYFAPAQLSLLGKAQSKGLVDIQVHDLRAATQDNHNTVDDTPYGGGAGMVMLPEIWGRVLDPLMVADTDLIILTPAGKRFNQEMAQSFANSKHLIFACGRYEGIDDRVRQHYESSEYAAKNVRVHEVSIGDYVLGGGEVASLVMIEAITRLIPGVLGNPESLAEESHNSEGYLEYPNFTKPQEWRGIAVPEILLSGNHAEIAKWRELQAIERAKKNF >NZ_CP016783.1|WP_095689255.1|846457_847426_-|acyl-ACP-desaturase MSEESEKIQSRLIRDLEPVVAVELERHLSVQKNWYPHEYVPWSEGRDYAGPLNGDAWEAKDSRLSPVAQDSLILNLLTEDNLPSYHTEIAISMGRDGAWGNWIERWTAEEARHAIVIRDYLMATRGVDPYELEDLRMAHMSLGYQTPYENDMLHTIAYVSFQELATRISHRNTGKISDDPIAEGMMQRVALDENLHMLFYRNTLSAALDMEPNASMRAIADVVTNFDMPGANMPGFGRKAVQIALAGIYDLQQHLDDVVSPVLRAWNVFERNDLSGDGLVARDELVAFMEKSGKEAATFNDKREVHFERLIARGQNPIRIQK >NZ_CP016783.1|WP_095689254.1|845683_846424_-|class-II-glutamine-amidotransferase MCRLLGYVATEERTFADVVGEGFENFVELSKEHKHGWGISACSTDTRKTELVRDLTLAAESEKFAESSTNLKSDGALLHLRLASKGLTVDLSNNHPFIYGDYSFMHNGTIKSIDSVQKFVDPRYLGKFTSTTDSERYFFTILSCLDELGLIEGVRKAVKTISANCDFTSINSMLMTPDTFIAVCEFNEADSTEWTVDSHYELRYSVEDGVIKVASTGWGKDHWTRLTNHSILVVNRKDLSFEVLPL >NZ_CP016783.1|WP_095689253.1|844226_845660_-|tRNA-(N6-isopentenyl-adenosine(37)-C2)-methylthiotransferase-MiaB MARTYAVETYGCQMNVHDSERIAGLLDEAGYLPVIQGEQADIVVFNTCAVRENADNKLYGNLSFLAPIKKANPNMQIAVGGCLAQKDQSIILKKAPYVDVVFGTHNVGSLPVLLERARIEEESQIEIKEALEHFPSTLPARRFSAFTSWVSVSVGCNNTCTFCIVPTLRGIEKDRPEADILKEVRALVDQGVIEITLLGQNVNAYGVEFGDRGAFAALLRKCGEIYGLERVRFMSPHPRDFTDDVIEVMATTPNVMPHLHMPLQSGSDRILQEMRRSYRSSRYLGILDRVRSAMPHAMITTDIIVGFPGETEYDFQATLDLTTQAQFSAAYTYKYSIRPGTPAGVMPNQVPEAIVGERYTLLHEHQQKISLGVNKRSIGQTHRALVSEVEGRRDQAQARLTGKTEDFRLVHFDAKSQARPGDFVDLEITDASAHYLIAKELAHIKTRGGDVHEARTKQGPTATSLGIPTLRKPSAAE >NZ_CP016783.1|WP_095689252.1|843313_844219_-|tRNA-(adenosine(37)-N6)-dimethylallyltransferase-MiaA MKTIVICGATATGKSDLAVALAKEIGAEIINADSMQLYRGMDIGTAKLTLEERGGIEHHLLDVLDVTEDATVAWYQEQARKKITEIHARNNHAVIVGGTGLYIKAILDELNFPGTNPEVRERLNQEAEEFGVARLFERLQLLDPAAAAAIDIQNVRRVIRALEVIEITGQPFTANLPREDSSKYPDALQFGLVMDREKLTERIDARVDLMWRRGLVAEVDSLITEGITKATTARRALGYAQVISMRAGEISEDQAMEETKRATRQYARRQETWFSRDARIKWISPIQPRLETVLQKINSTT >NZ_CP016783.1|WP_095689638.1|842501_843314_-|diaminopimelate-epimerase MNTPIIATYGHGTENDFLVVFDPEEEISITTSQTAAMCNRETGIGADGLIKIGKRDGKWFMDYRNADGSLAEMCGNGIRVMARYLVERGHQPEGIFAINTRDGVKHLRVPAKDDISVNMGKVIDEGEAVTASVEGKIWNGYHISVGNPHAVVFVEKIEDVGSLKDAPVVRPRDIYPEGVNVEFVEITANKEIKMRVHERGSGETRSCGTGTCAVALAATLHTNTSLPARWVIFPPGGRLVVDIDPHANATLIGPAVLIKDIDITDYLQDA >NZ_CP016783.1|WP_095689251.1|841035_842505_-|GTPase-HflX MSNDQPRNNDLFEELLRESARVAADFDADESEYDPDPQQELSDRHALRRVKGFSTELQDISDAEYRQLQLERVVLVGVWTEGSAEMAENSLAELKALAETAGSEVLDGLIQRRDKPDPATYIGSGKVIELRQIVASSGADTVICDGELSPSQLRQLEDKLKVKVVDRTALILDIFAQHAKSKEGKAQVELAQIAYLLPRLRGWGDSLSRQVGGRAAGGAGIGGRGPGETKIETDRRRIRDKMAKLRREIAEMKVSRDTKRQERKRFNIPSVAIAGYTNAGKSSLLNRLTNAGVLVENALFATLDPTVRKTHTADGRTYTLSDTVGFVRHLPHQLVDAFKSTLEEVSGADLIVHVVDGSHPDPFEQIRAVREVITDIGGGDIPEIIAINKVDSANPEVVMEILRKEKNSYAFSARTGFGIEGLVHAIEKSLPQPNIEINSIIPYNRGDLVNAIHETGEILSEEYVESGTAIHARVGASLAKAIEAVEKPY >NZ_CP016783.1|WP_029168537.1|840590_840887_-|DUF4193-domain-containing-protein MATDYDTPRKTDEELHEESLEELKAKRVDAQSGQIDVDEAEAAENLELPGADLSGEELAVRVVPKQVDEFTCSRCFLVHHITQLAKGEGAKAVCKECN >NZ_CP016783.1|WP_095689259.1|849683_851123_-|signal-recognition-particle-protein MFDNLSSKFSNAFSSLRSRGKISPSDIENTCAEVRTALLESDVALPVVDRFIEKIRSKSLEALPNLQSGTNQAQAIFEIVNAELVEILGGSARRVRFAKTAPTVIMLAGLQGAGKTTLAGKLAKFYADQGNTPLLVASDLQRPNAVNQLQVVGESLGVPVFAPEPGNGVGNPVKVAEQGIAFAKSKLYNMVIVDTAGRLGVDEDLMREAIAIRDAISPDEILFVVDAMIGQDAVRTAQAFQDGVGFDGVVLTKLDGDARGGAALSITQLTGKPIMFSSNGEKLSDFDIFYPDRMASRILGLGDVATLAEQAKKAFDGESSARLEEKFARGDDFTLEDFLEQLEAMSKMGSMSKLLGMLPGAGGMKKQIENFDESEIVRTKSIVQSMTPIERRDPKVLNGSRRARIALGAGRKVQDVNSLVDKFAAAQKMMKQMRSGKGLPPGMGMPTGAPAPVQKVSQQPAKKKSKSGNPAKRALEEKG >NZ_CP016783.1|WP_095689260.1|851151_853413_-|[protein-PII]-uridylyltransferase MGTRERRERSNESDRFLTSLFSQVPDAVSGVALAAVGGYGRGELSPGSDLDILFLHNGRIEVKSLAEIVNKILYPLWDKSYKVDHSVRTRSETRESASSDLKVALGLLDIRLICGDPDLVAAVQHDAIDDWRGNSRERLPELRRSLADRHLRAGDLAYLLEPDLKEARGGLRDINALRAIARSGAVSISLEHISTAESVLNNAREALHTVTSRDKDRLLFQEQDKVAELLKYADADALMSDIAQAARSVDYLLESTWHRFDHRGKDGLGRFLRKTRTTHIAQGISIAHQEVFIDEGFDFHKDPVIGLRAAAIAAQAGLPVSPTSLQALAQASKSGISKLPNPWPRSARENLITLIGAGAPMVRIWEGLDQEEILFDWLPEWRAVRSLPQRNALHRHTVDRHMVETAVFAAALTRNVHRPDLLLFAALFHDIGKGTQEDHSDRGEKLIEPLARRIGFDENDIATLKLLVKHHLLLSATATRRDLDDPATIASVTSVIPDLQTLELLHALSIADGQATGRAAWSDWKESLLSELVSRVTSALTDNTIARQPEFTSEQRVLVDSGELQVKIEARDPDFAIEIIAPDKTGLLSIVAGVLNLARFDVRSARTQTIGKSAIMKWIVTPNQFAPSVDEGAIKAAIAEALADASDLTERIKRRVADYSNIPSIPVPLPIVETFMDSATDATIIEVRSHDRPALLFGIGDVITKSQVDIRSAIVTTLGAEAIDTLYVTEIGGGALSPERAQELANRLSSALA >NZ_CP016783.1|WP_095671244.1|853442_853781_-|P-II-family-nitrogen-regulator MKLITAILKPFKLDEVKDALQAAGVTGMTVSEASGFGRQRGHTEVYRGAEYTVDLVPKVRLEVLVDDADAAAIVDVIVKAASTGSIGDGKVWTTPVDSIVRVRTGERGTDAI >NZ_CP016783.1|WP_095677467.1|853777_855082_-|ammonium-transporter MEEIVLNSGDTAWVLASTALVLLMTPGLAFFYGGMVRTKSVLNMMMMSMVTIGIVSVIWVIYGFELAFGYKANSAWYGNISLSGLGGVVNDFTNNGGVYPIPVLVFAAFQLMFAIITPALISGAIADRTKFTAWAIFVAIWSTLVYFPVAHWVFAFGNKVGDTITGAGYLASRGVQDFAGGTAVHINAGAAGLALAIVIGKRVGWRKESMRPHSLPLVMLGAGLLWFGWFGFNAGSSLAANGTAALALINTQVAAAAAVLGWLLVEKIRNGHATSLGAASGAVAGLVAITPACAFVAPWAAVVIGLLAGAICALSVGIKYKLGFDDSLDVVAVHLIGGIWGSLAIGLFGSSAVNSLSLDGIFYGGGTALLGKQALGVGLVFAYSFIATLIIGYAIEKTIGFRVKKDAEVEGIDLNEHAETAYEMTSSSRGGSLL >NZ_CP016783.1|WP_095689261.1|855252_856092_-|signal-recognition-particle-docking-protein-FtsY MGIFSKFISKIKGVSTADPLDWQELEAELLAADLGPSLTSSIIKDAKSIKGDDALASLTQILNSKLSSKSRTLSKAASTSALIVVGVNGTGKTTSVAKLAASLKGSGSSVILAAADTFRAAAVEQLQTWGARIGVEVIAGKDGAEPASVVFDAAVRANEQQVSYLLIDTAGRLHNKNDLMAELGKVKRVIEKSLPVSEVLLVIDATTGQNGITQAKIFTEAVDVTGLIVTKLDGSARGGVALAIENALDIPIKFVGTGEAESDFAAFDSERYISGLIGE >NZ_CP016783.1|WP_095689262.1|856120_859642_-|chromosome-segregation-protein-SMC MTLKGFKSFASATTLRLEPGITCVVGPNGSGKSNVVDALTWVMGEQGAKSLRGGKMEDVIFAGTSGRAPLGRAEVSLTIDNTDGALPIDYTEVTISRILFRNGQSEYQINGEASRLLDIQELLSDSGIGREMHVIVGQGQLDAILMATPEERRGFIEEAAGVLKHRKRKEKALRKLDSMQANLARVQDLTVELRRQLRPLGKQAEVAKKAATIQADLRDAKLRLLADDFISLSKTLDAEVADETALRERRSLVEDELDKVRSREESLDAQAAFESPLLIAAQENFYALSALREKFRGTQSLAQERSRFLAEEAEEARSAGRDPEALDQEALALRQQEAQLRSEVQSAQAHLQATTSKLSSAEQSLKVEEDKIAAAMRAIADQREGTARQEGHIKSLAARLEAIAEEIARLVKARDEAQSRAESAQRQYSTLEMDIAGADAGELGLDSEFEVAKRSLDNAKAELSSLVDAERAADRERNAIESKLEAMLLTSQSRDGGAALVRDSRGLTILGSIASLVQIDSGWESATAAALGSLCDAIVVRDLNSAISALTTMRSENLGQADVLVYQPGSHSATSVPDGLTALTSHVRSTEISELLASLLSNTVVAESAREAEGIIHSHPSVTVVTRDGDVITAKRARGGSASSTSLIEINALVQELQAKLETITHNCDRIKFEISTAQGDVESKQNTFDIALSKLNESDARISAFTEQLAVAGQNMKSASAEVERLNSAIAEANAAKGRDENELSIASAQLQQRGEIGEPDHSAAENLRNEVSLARTAEVEARLAVRTSEERVDSIGARAKALEDSANAEREASERAVSRRGARARGALISSAIAEAAYEALIHIERSIAKAATERARLEASRSDREGETLTVRSRGRELASELEQLTSSVHKDEIARAEQRMRIEQLESKAVEELGVDTTTLVNEYGPQNDVPTFIETETGEIVATELIPYRRDQQEKRLAATERSLTLLGKINPLALEEYNALEERLKFLAEQLEDLKRTKKDLLDIIKEVDDRVQQIFMEAYEETAKHFEDIFARLFPGGDGRLILTNPDDLLNTGVDVEARPPGKRIKRLSLLSGGEKSLTAVAMLVAIFKARPSPFYVLDEVEAALDDVNLGRLLVVLEELRESSQLIIITHQKRTMEIADALYGVTMRGDGVTEVISQRLRESDTA >NZ_CP016783.1|WP_095671830.1|859711_860389_-|ribonuclease-III MFSTLTAQLGITLKPELLELAFTHRSFAYETGAKETNERLEFLGDSVLGLIVTEELYLRYPDLDESRLSPLRSGIVNMRALADIARTLELGKYIRLGKGEEVTGGRDKNSLLADALEALIGAIYLECGFATTTTVVRTLINETLESAMAKGAGLDGKTALQELVSSLGKGTLEYLVTEEGPDHDKSFTAVAMVAGEAVAQGIGKSKREAEQSAARSAYEILATLK >NZ_CP016783.1|WP_095671248.1|860390_860564_-|50S-ribosomal-protein-L32 MPVPKRKMSRSKTRSRRSMWKTTAASLAACPQCQQPKLTHTACPTCGTYNRRQVLEV >NZ_CP016783.1|WP_095689263.1|860636_861227_-|DUF177-domain-containing-protein MSKASSVFEFNTFELPRRAGEMKEYQLDLEILEPIGVPLVSVPAGDVIEVDLRLESVTEGVLLSADLYAIAKGECIRCLDPVEITVERKIQELYRYEPSKDSGKKGKKSKRASDDDVDLEEDDELWMDGNVMDLEPPIRDAVVLDLPINPLCSEECLGLCPDCGQKWADLPEGHQHEAIDARWAGLAGLDFKKSDE >NZ_CP016783.1|WP_095689264.1|861247_861730_-|ATP-synthase-subunit-B/B' MDSIEKLSTAITLIEEARGVPLSASCVVHRGEILEILDGARIALPQDLSAAEAILAQRDNLVEEGRSSAEQMIATAREEVARMIEQTAIVQAARDEAQRILDDARALAADERAEVESYIDGRLATLEVILNKTLDAVARGRERLDGANDKDVLSQLADDN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016783_2 | 900409-900501 | Orphan |
NA
Consensus repeat of NZ_CP016783_2
|
1 spacers
spacers of NZ_CP016783_2
>2.1|900439|33|NZ_CP016783|CRISPRCasFinder CGGACGTGGAACTGCAGGTGTTGTTCCACCAGA |
CRISPR arrays and Neighbor proteins around NZ_CP016783_2
The CRISPR arrays of NZ_CP016783_2 >merge|NZ_CP016783|2|900409-900501|CRISPRCasFinder AGTTGAGAATGGATTATTTCCGGGACGTGGCGGACGTGGAACTGCAGGTGTTGTTCCACCAGAAGTTGAGAATGGATTATTTCCAGGACGTGG >NZ_CP016783|2|2|900409-900501|CRISPRCasFinder AGTTGAGAATGGATTATTTCCGGGACGTGG CGGACGTGGAACTGCAGGTGTTGTTCCACCAGA AGTTGAGAATGGATTATTTCCAGGACGTGG
>NZ_CP016783.1|WP_095689287.1|897736_898168_-|30S-ribosome-binding-factor-RbfA MGQSHRTQKVADRIKVVVAQLLETKIKDPRLGFVTVTDARVTGDLQNASIFYTVLGDEEQRAATAAALESAKGVIRSAVGRDLQTRITPTIEFFEDGLPESAKALDSLLDKVHVLDAEVAQLRKNATFAGGADPYKVPKVIED >NZ_CP016783.1|WP_095689286.1|896853_897726_-|tRNA-pseudouridine(55)-synthase-TruB MDGFLVVDKAPAMTSHDVVAVARKALGTRKVGHAGTLDPMATGVLVLGFGNGTRLLQYITDGDKSYLATVVLGAATVTDDQEGEVISEADASKITDEEIRVDLSKMVGEIQQRPSSVSAVKVAGERAYDRVRAGEVFELAARTITISSLDVLDIRRTSSRIEIDIDVTCSAGTFIRAIARDLGSELAVGGHLSALRRTRVAGFAIDQAITFDALKAQDFKTLDLADVARATFSVRELALDEVAELSFGRPLPSNSGEDIFAALSPDNRLIALLQNESGKAKPIAVFAAAN >NZ_CP016783.1|WP_095689285.1|895944_896835_-|bifunctional-riboflavin-kinase/FAD-synthetase MSSGSVVVIGVFDGVHKGHQLLLNRAKEIADGRSIIALTFDPHPMQIFAPDRAPTMLTTLADRVELLKIHNADQVAVLKFNEKFAAMSPEDFVKDVLVSQLSVSTVIVGKNFTYGHKAAGNIESLIKDSLNFNFTVDVQELKSEESDVISSSRIRKLVVEGKVEEARTLLSRPHRLDGVVVHGEKRGREIGYPTANLGNIDGQTIPADGVYAGWLTVGINFWPAAISIGTNPTFEGVRGRQVEAYALDQEGLELYDKNASIEFGWRLRDTLKFDGLEPLLAQMKLDCDRARALTEG >NZ_CP016783.1|WP_095527135.1|895554_895824_-|30S-ribosomal-protein-S15 MALTPEVKKEIIAKYGSSATDTGSPEAQVALLSRRIEDITTHLKTNPHDHHNRRGLLLLVGQRRRILQYLSKTDINRYRAIIEKLGIRR >NZ_CP016783.1|WP_095689284.1|893170_895372_-|polyribonucleotide-nucleotidyltransferase MEGQEVQTSVAVIDNGKFGKREIRFETGRLARQAAGAAAVYLDDQTMIFSATTASKTPKDQFDFFPLTVDVEEKMYAVGRIPGSFFRREGRPSEDAILTCRLIDRPLRPSFVKGLRNEVQIVVTVMALDPDHMYDVIAINAASMSTQLAGLPFSGPIGGVRVALIDGQWVAFPNHSQVENAVFDMVVAGRISDGDVAIMMVEAEATAKTIELIKNGQPTPTEEVVAQGLDAAKPFIKILCEAQSKLAKVAAKPTAEFPVFLDYQDDVFAAVEKAAKDDLAKALTIVGKQERETKIDEISAATKESVSAAFEGREKEVPAAFRSLTKKLVRQRVLRDKIRIDGRGLRDIRALSAEVEVIPRVHGSAIFERGETQILGITTLNMLKMEQQLDTLNPENHKRYMHNYNFPPYSTGETGRVGTPKRREIGHGALAERALVPVLPTREEFPYAIRQVSEALGSNGSTSMGSVCASTLAMLNAGVPLKAPVAGIAMGLISDTVDGKVEYVALTDILGAEDAFGDMDFKVAGTKDFVTALQLDTKLDGIPASVLAGALHQAKEARLAILDVMNEAIDTPDEMSPFAPRIISVKIPVDQIGAVIGPKGKIINQIQDETGADISIEDDGTIYIGAVDGPSAEAARAQINAIANPQMPEVGERYLGTVVKLAAFGAFISLMPGKDGLLHVSQIRKMHGGKRIENLEEVMKVGDKIQVEIGEIDPKGKLSLVPVLEDGSTPSAE >NZ_CP016783.1|WP_095689283.1|891862_893152_-|insulinase-family-protein MSVRRPVVRTVLPSGLRIVTEEVPSVRSAAIGIWVNVGSRDETPAVAGASHFLEHLLFKGTTRRNALEISATIEAVGGEMNAFTSKEYTCFYARVIDTDLPMAIDVVSDLITSSIVSALDVDAERKVVLEEIAMRDDDPSDLVHDLYAETYYGDTQLGRPILGTIKSISDMTRSSVFNYYKKKYLPQDLVVAVAGNIKHKRVVAMVEEALSRDNFLDVKGSPQIRPNTPLKTKPMQSVGLLTRKTEQAHMFYGMEGVARSDERRFAMGVLASALGGGMSSRLFQEIREKRGLAYSVYAYAQQFAGSGQIGFYAGCNPTKAIEVVEIIREVLADVAENGMSHEEIERAKGAVRGSLVLSQEDSGSRMSRIGKNEIVYGQVMGFDEILKAISRVNPTDVREIANEFLTKSPTLALVGPFKSEAKFEKVLKS >NZ_CP016783.1|WP_095689282.1|891128_891866_-|4-hydroxy-tetrahydrodipicolinate-reductase MIKVGVLGARGRMGAEVVKAVTEAADLELVAALDLGDSLETLKSSGAQVVVDFTTPDSVMANLEFLANNGIHAVVGTTGFDAARIATLEKLIAANPSVGILIAPNFAIGAVLMMEFATKAAKYFESAEIIELHHPNKVDAPSGTASRTAELMSKARKDAGLGAMPDATTTSLDGARGATVGDIPVHSVRLRGLIAHQEVLLGGLGETLTIRHDSLDRAGFMPGVLLGVRSVISKPGLTFGLEKFM >NZ_CP016783.1|WP_095677497.1|890859_891117_-|NrdH-redoxin MAKADITMYGADWCGDCRRSKRLLEELDVQVNHIDVEADKSAAAKVIEINGGAQSIPVIVFSDGTHLTEPSDNDLKAKLQALKII >NZ_CP016783.1|WP_095677873.1|890544_890853_+|AzlD-domain-containing-protein MEPFWLAVIGTSLLAFLLKILGYSVPKKWLSNPRALKINSLIPIALLSALVAVQSFTQDSRIVADQRMAGVGVAVLALILRAPFPVVVISAAATSAALFHLN >NZ_CP016783.1|WP_095689281.1|889822_890539_+|AzlC-family-ABC-transporter-permease MPLPHRGYLGQNAPVSRVNRATAAQSFSVSLTVGAYGIAFGAASVAAGFSVLQSCLLSLLTFTGASQFAVVGVLGAGGSALSGIATATLLGVRNTLYGLRMAPVLNVRGLKRVFAAQLTIDESTGVALSQENLGVREARQGFWLTGIGVYVFWNLFTLAGALGAQAMGNPAAWGLDAAVPAAFLGLLWPRLTNKSERLLAVAACTLALAMTPYFVPGFPIIATAVLAVIFGLRAKIWK >NZ_CP016783.1|WP_095671288.1|901023_901302_-|YlxR-family-protein MAAMQRGIKPIRTCIVCRKSQEPSQLLRVNCVDGVITPTSGGKSHGRGAWLHLSCGYIAIDRKAFKSAFKLEQAPDLSKFKVFLDERLKSDN >NZ_CP016783.1|WP_095689289.1|901391_902378_-|transcription-termination/antitermination-protein-NusA MNKGQVDVDALIALATHKQMPLEDLITEIEAGVLTAYYETDAPKRQARASIDRETGEIVIWVPTFNELGERISEDRDEPEGFARTATSTVRQVIKLKMRANNDAEIVGEFSASVGDVISGIVQQGRDPKMINVNLGKVEGRIPPQEQVPGEVYNHGDRIKCFVVEVKQGMKGPEIMLSRSHPALVKQLFALEVPEINDRIVEIMEVAREAGHRTKLAVRSHRAGVSPKGSLIGPMGSRANAVMEELHGEKIDIVDWSEDPAQFVAHALSPAKVTKVEIVDLATRSAKVTVPDYQLSLAIGKDGQNARLAARLTGWRIDIHPDTPTVPR >NZ_CP016783.1|WP_095689290.1|902370_902874_-|ribosome-maturation-factor-RimP MSLTQSITQLIEPAVTEAGFFLEEVQLTSAGSHRVVTCIVDGQSPLNLDQVTVVSRLISELLDTATFMDETPFTLEVTSPGVDRPLTLPRHWTKNLTRLVKVTLQDGTVTTGRLTEFNEVTATLVENIKGRIKTHTVNFADIKRAVVEIEFNRKEEAQNLNDEAGDE >NZ_CP016783.1|WP_095689291.1|902944_903730_-|TSUP-family-transporter MFADLTLYTLAFLAAAAFCAGLIDAIAGGGGLIQLPAMLIGLAKTETVVVLGTNKVPSIFGTTASALTYRRNIKVSSKLLIVMAIPAFIGSMGGASLASLIPTEVLKPLVVALLIAVLIYTWKRPQLGQIESMRHSESKRLKISAVAALVIGFYDGIIGPGTGSFLILLLVAVMGFAFLSASAIAKVVNVATNGGAILVFGVNGEILWKIGLTLAIANVCGGLIGVRIALRGGSGLVRKVFMAITAALILKVAFDTFTALR >NZ_CP016783.1|WP_095689292.1|903731_905465_-|proline--tRNA-ligase MLRMSSLFLRTLRDDPADAEVPSHKLLVRAGYIRRIAAGIYSWLPLGVITLRNVENIIREEMDKAGFQEVHFPALLPKEAYEATNRWEEYGPSLFRLQDRKGGDYLLGPTHEEMFTLMVKGEYSSYKDLPLTIYQIQNKFRDEARPRSGIIRGREFVMKDSYSFDLTDEGLGISYDKHRDAYITTFDRLHLKYNIVKAVSGAMGGSKSEEFLAPCPTGEDTYVLCPKCGFAANVEAMVTKVTPADASGVPALEELDTPNTPTIDSLVDVLNLKFGGGFTGASTLKNVLLMADKKAISVLVPGDREVDLKRLQAGLPGVEELRVFEEADFAKHPGLVKGYIGPQDAGKFGITVYADPRVAPGTSWVTGANKKDRHARNVVNGRDFTPDAYVEAAEIREGDSCPECSTAVVIDRAIEIGHIFQLGRKYADALGLTVLDQNGKSQVVTMGSYGIGVSRAVAAIAEQSYDEIGLVWPLEVAPAKVHIVATGKEDLPFDTALDIATKLEASGITVMLDDRRDPSAGVKFKDAELIGIPVIIVVGKSLADGKIELRNRKSGEKSEVAVGDAISAVTTLIASLS >NZ_CP016783.1|WP_095671293.1|905486_906638_-|flavodoxin-dependent-(E)-4-hydroxy-3-methylbut-2-enyl-diphosphate-synthase MVDLGIPSAPPAPLHPRRKTKQLRVGSVGVGSDSPVSVQSMCTTLTADVNSTLQQIAELTAAGCQIVRVAVPSQDDADALAQIAKKSQIPVIADIHFQPKYIFAAIDAGCAAVRVNPGNIKQFDDKVKEVAKAAGDAGIPIRIGVNAGSLDPRLLAKYGKATPEALAESALWEASLFEEHGFSNLKISVKHHDPVTMVNAYRILAAKCDYPLHLGVTEAGPAFQATIKSATAFGILLAEGIGDTIRVSMSAPPVEEVKVGISILESLNLRQRKLEIVSCPSCGRAQVDVYTLAEKVQAGLEGMTVPLRVAVMGCVVNGPGEAREADLGVASGNGKGQIFVKGEVIKTVPEAMIVETLIEEAMRLAEEMEAAGVGSGEPSVTVK >NZ_CP016783.1|WP_095689293.1|906693_907842_-|site-2-protease-family-protein MQLVGILAFIFALLFSVMVHEFGHYLTARRYGMRVSEFFVGFGKRIWSRQRGETEFGIKAIPAGGYCRIDGMTPRDEMPEGEEGRAFYRASSGRKLIVLGAGSFLHFVLGYLLLFVLLAGVGVNQVLPVIDSVAANSAAAAAGFQKGDQIIAIDGDRSTDWQDQLLKIRNSEGRPLTFTIERGGVIQDISAAPRMTDIDDGTSRYVLGIINEFGTKRMSPVSSVTRAAELTWRFTSASATSLVQLPTKIPALWGQTFGGEERDQNGLVGVVGVARVSGQAAASGELDLAERLGTFILIVASLNIFVGLFNLLPILPLDGGHMAVAIADEIRALFARIRRKPRPAAIDVQVLTPITATVFVILAALTVLLLIADIFNPISLNF >NZ_CP016783.1|WP_095677874.1|907841_908987_-|1-deoxy-D-xylulose-5-phosphate-reductoisomerase MKDLVILGSTGSIGVQALEIVEANPSLFRVVAITAAGSNTELLISQAKKFNVPVIGVTKNADQIKAALPNATVIDGPLASTEIAAITCDVVLNGITGSIGLGPTLAALRVGNRLALANKESLVAGGELVMSLAKPDQLLPVDSEHSALWQSLMGGKKSEVSKLVLTASGGPFRDRTDLSAITVAEALNHPTWSMGPVVTINSATLVNKGLEIIEAHYLFAIPYSQIEAVIHPQSVVHSMVEYIDGSTIAQASPPNMKGPISFAINHPDRVKAATAPIDWTQSHTWTFAPIDNERFPAIDLARRCGALGGGLPAIFNAANEVCVASFISGKIGFTSIVETVEEVVQKLGGKSASAPRDLSDVSAIEEDARAIATELLQKAAR >NZ_CP016783.1|WP_095689294.1|909003_910002_-|agmatinase MANHGNMYGPSFTFLGIPQCDLDDPKSYADADVIIVGAPIDSGTSHRSGAKFGPQAIRGGDYLPHDAERPHLALRIDALKALKVYDAGDLMMPPGDLVSSLKVLEEATEKISRAGKIPVILGGDHSIASADVAGIAKHRGLGKISMIHFDAHADTGEDQFGALVGHGTPMRRLIESGHVRGDRFLQLGLRGYWPDDKTLNWMRDQGMRSYEMTEIHHRGLNKVLDESFATLTDGCDGVFLSVDIDVVDPGMAPGTGTPEPGGMTSRELLEAVRRICLELPVVGIDVVEVAPPFDSADITAILANRVVLEALSAIAKRRSGTPYSPTQNLLDR >NZ_CP016783.1|WP_095689295.1|910025_911438_-|FAD-dependent-oxidoreductase MANIDPSILNHLSQMQPELYWLDEDPLEPMPHPTLIGDVRSDLCIVGAGYTGLWTALLAKEANPEREVVVVEQRETGAGASGRNGGFCSYSLTHGFMNGYSRFKDEMAVIEKLGRENLDAIEATIKKYGIECNFEWNGELRVAVEDWQMQGLGEEAELRNSFGDNVELLSQEQVQARVKSPIYKGALWDPDGTALVDPARLVWGLERACISLGVKIFENSHVDRLERTKNGMIVHTPYGSVYATKVALATNVFKSLIKRAHKYVVPVYDFQLVTEPLTAEQLQSIGWKEREGLSDAGNQFHYYRMTKDNEILWGGYDAIYNFRGKVRQEYETDAETYAHLAEAFLETFPQLKGIKFTHGWGGAIDTCSRFSPFWGAAYGGRVAYVLGYTGLGVGSTRFGAQVMLDLLDGKDNERTRLKMVRKKPMPFPPEPLRFIFIRLTQWSINQADKHHGKRNLWLRLLDSLGLGFDS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MN010762 | Gordonia phage MintFen, complete genome | 38928-38960 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MT553336 | Gordonia phage BlingBling, complete genome | 38532-38564 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MN586031 | Gordonia phage Gambino, complete genome | 40801-40833 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MN369740 | Gordonia phage Delian, complete genome | 38312-38344 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | KU998236 | Gordonia phage Blueberry, complete genome | 40801-40833 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MN096365 | Gordonia phage Samba, complete genome | 41150-41182 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MK801724 | Gordonia phage PhrostedPhlake, complete genome | 38359-38391 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MK864263 | Gordonia phage Melba, complete genome | 38357-38389 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MK878896 | Gordonia phage Begonia, complete genome | 40314-40346 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MT723935 | Gordonia phage Azula, complete genome | 40801-40833 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | KX557274 | Gordonia phage CaptainKirk2, complete genome | 38314-38346 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MH153808 | Gordonia phage Petra, complete genome | 37577-37609 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MT723936 | Gordonia phage Hitter, complete genome | 38117-38149 | 7 | 0.788 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MH020241 | Gordonia phage Fenry, complete genome | 39822-39854 | 8 | 0.758 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MN234161 | Gordonia phage Toast, complete genome | 40072-40104 | 8 | 0.758 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | NC_031265 | Gordonia phage Guacamole, complete genome | 38541-38573 | 9 | 0.727 |
NZ_CP016783_2 | 2.1|900439|33|NZ_CP016783|CRISPRCasFinder | 900439-900471 | 33 | MK967389 | Gordonia phage JasperJr, complete genome | 38541-38573 | 9 | 0.727 |
1. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MN010762 (Gordonia phage MintFen, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
2. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MT553336 (Gordonia phage BlingBling, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
3. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MN586031 (Gordonia phage Gambino, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
4. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MN369740 (Gordonia phage Delian, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
5. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to KU998236 (Gordonia phage Blueberry, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
6. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MN096365 (Gordonia phage Samba, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
7. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MK801724 (Gordonia phage PhrostedPhlake, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
8. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MK864263 (Gordonia phage Melba, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
9. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MK878896 (Gordonia phage Begonia, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
10. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MT723935 (Gordonia phage Azula, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
11. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to KX557274 (Gordonia phage CaptainKirk2, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
12. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MH153808 (Gordonia phage Petra, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
13. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MT723936 (Gordonia phage Hitter, complete genome) position: , mismatch: 7, identity: 0.788
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggggacctgcaggtgttcttccaccagg Protospacer ** * *** *********** *********.
14. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MH020241 (Gordonia phage Fenry, complete genome) position: , mismatch: 8, identity: 0.758
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cgccgggagacctgcaggtgttcttccaccagg Protospacer ** * .** *********** *********.
15. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MN234161 (Gordonia phage Toast, complete genome) position: , mismatch: 8, identity: 0.758
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer cccccggcgatctgcaggtgttcttccaccagg Protospacer * ** ** *********** *********.
16. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to NC_031265 (Gordonia phage Guacamole, complete genome) position: , mismatch: 9, identity: 0.727
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer caccgggagacctgcaggtgttcttccaccagg Protospacer *. * .** *********** *********.
17. spacer 2.1|900439|33|NZ_CP016783|CRISPRCasFinder matches to MK967389 (Gordonia phage JasperJr, complete genome) position: , mismatch: 9, identity: 0.727
cggacgtggaactgcaggtgttgttccaccaga CRISPR spacer caccgggagacctgcaggtgttcttccaccagg Protospacer *. * .** *********** *********.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
631244 : 639415
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP016783|631244:639415|DBSCAN-SWA GATGAGCACTCTAGAAATCCGCGGACTTAAAGTCTCTGTTGAAACCGAAAGCGGTTCAGTAGAAATTCTCAAAGGCGTAGACCTTACTATTAAATCAGGTGAAACCCACGCGATCATGGGACCAAACGGTTCAGGTAAATCAACGCTTGCCTACTCAATTGCGGGGCATCCTAAGTACACAATCACCAGTGGCACAGTCACGTTAGATGGTGCAGATGTTCTTGAGATGACCGTCGATGAGCGGGCTAAGGCCGGTCTTTTCTTGGCGATGCAATATCCAGTAGAAGTTCCTGGCGTATCTGTCTCTAACTTCCTGCGCACTGCAGCAACTGCACTACGCGGTGAAGCGCCTAAGTTACGTACTTGGGTCGGCGAAGTTAAAGGCGCGATGGAAGCGTTAAGCATGGATCCATCATTTGCACAACGCAATGTCAACGAAGGTTTCTCTGGCGGAGAGAAGAAGCGCCACGAAATTATGCAGCTAGAACTACTCAAGCCAAAGATGGCGATTCTCGATGAGACAGATTCAGGCCTAGACGTTGATGCACTTCGCATCGTCTCTGAGGGCGTCAACCGCGCGAAGGCTGCCAATGATCTGGGCGTTCTATTGATCACGCACTACACCCGAATCCTGCGCTACATCAAGCCTGATTTCGTTCACGTATTCGCCAACGGCAAGATCGTTGAACAAGGTGGACCAGAACTTGCAGATAAGCTCGAAGCAAATGGCTACGCGGAGTATGTAACTGCGTAATTTCGCAATTACTTATATACATGTCATTTGATGCAGTCGCTATCCGCGCGGATTTTCCGATCTTTGATCGCAAGATCCGCGATGGAAAGCGTCTGGTCTATCTAGATAGCGGTGCAACAAGCCAGAAACCGAACGTTGTTATCGATGCGGAGAGCAATTTCTACCGCTTACATAACGCTGCCGCTCATCGCGGTGCTCACCAGTTAGCTGAAGAAGCTACTGAAGGTATTGAGTCTGCACGGCGGATTGTTGCTGACTTCCTTGGTGGGAAAGAAGATGAGATCGTCTTCACGAAAAGTTCAACGGAGGCGCTCAACCTCTTGGCCTATTCGATCGGCGCTGCACCTGTTGGAAATAAGTTCCATCTAAAAGCTGGTGATGAGATCGTGATTTCCGAGATGGAACATCACGCCAATTTGATTCCTTGGCAACAACTAGCAGCGCGCACAGGTGCGGTTCTTAAATGGTTTGAGGTAACTCCTGAGGGACGCTTAGATCTCTCAAATATCGATTCTGTGATTACGACGAAGACCAAGATTGTTGCGCTAACTCACCAATCAAATGTTCTCGGAACTATTGTTCCATTGGATGATCTGGTAAAGCGCACCCACGATGTAGGCGCCGTTTTCATCCTCGATGCTTGTCAATCAGTGCCGCATATGCCAGTTAACGTGGCAGATCTAGATATCGATTTCTTAGCCTTCTCTGGCCATAAAGCAGTTGGACCAACTGGAGTTGGAGTTCTCTGGGGAAGAGCGAAGTTGCTAGATGAACTTCCGCCATTCCTATTTGGCGGATCGATGATCGAGTCAGTGACTATGACTGAAGCAACTTGGGCACCAGCACCTCGTAAATTTGAACCAGGCGTTCCAAATATGGCGCAGATTGTTGGTTTAGGGGTCGCACTCAAGTACTTAACAAAGATTGGAATGGGTGCGATCCATAAACATGAAATCGAACTCACTAAATATGCGCTAGATGCAATGTCTGGTATTGATTCACTTCGTATTCTCGGCCCTCTAAATACGGATATGCGTGGGGGAGCGGTTTCCTTCACACTCGGTGAAATTCATCCGCATGACTTAGGACAATTTTTAGATGATGCGGGAATCGCAGTTCGCACGGGACATCACTGCGCCTGGCCGCTAACTCGCAAGATGAACGTTGCCGCTACAACCCGTGCAAGTTTCTATCTCTACAACACCACCGAAGATATCGATCAACTTTGTCAATCGATCCTTGCTGCGCAGAAATACTTTGGCTAATGCAACTCGATAACCTTTATCAAGAAGTGATCCTTGATCACTACAAGAACCCGCAGAACAAGAAGCTATCAGCGACTTACGACGCACAGGTTCACCACGTAAACCCTTCATGTGGCGATGAGATCGATCTAAATATCACCGTTAAAGATGGTGCAGTGGCAGCAATCACTTGGGATGGCGTTGGTTGTTCGATCTCTCAAGCCAGCGTCTCGGTTCTCACAGACCTGCTAATCGGTAAGAGCATTTCAGATGCTTACCGAATATTTGATGAGTTTGTCGCTCTAATGCAGAGTAAAGGTGCGGTGACTGGAGATGAAGATCTCCTGGAAGATGCAGTCGCCTTTGCTGGCGTCTCTAAATATCCCGCACGAATCAAATGTGCCCTGTTGGGCTGGATGGCATTTAAAGATGCCAGCGTTCAAGCCCAATCCAAGGGCTAGTAAGATCTAACCAGAGCAATCTACAAAATGGAGAAAATAATGGCTGCAACAGTTGATGATGTAACAGAGGCGATGAAGGATGTCGTCGATCCCGAGCTCGGAATCAATGTTGTGGATCTTGGCCTTATCTACGATGTAATGGTCGACGATAACAATATCGCCGTACTAAATATGACTCTGACATCTGCGGCTTGTCCATTGCAGGATGTAATCGAAGATCAAACTCGTCAAGCACTCGCACCTCATACTACGGATGTAAAGATCAACTGGGTATGGATGCCGCCTTGGGGTCCAGATAAGATTTCAGATGATGGGCGCGAACAACTACGTGCGCTTGGTTTTACCGTCTAACCTAAATTAGCGTTAGCGATCTAGCAAGTTATTTGATCGGAAGCTTTTTATGTCAATGCATGCTGGCTGGATGGCGCTTCGCACTTTAAGTTCTGATCAATCTGTAAAAAACGCCAAGTTAAAGCCCGGCACGTTAAAGCGAATATTTGCCTATGCGATCCCATACAAATCGGTCTTTGCGCTCTTTCTAATCTGTTTAATTGCCGACGCGGTTCTAACCGTGGCCACCCCGCTTCTGCTTCGCGAACTTATCGATAACGGAGTAATTCCCAAAGATCGCTCAGTCGTAACAACTATGGCGATAGCGGTTGCGCTCCTTGCTATCGCCAGCGCTCTGGTAAATATCGTCGTACGTTGGATTTCAGCGAAGATCGGCGAAGGTCTTATCTATGATCTTCGCTCTCAAGTCTTTCGCCATGTTCAAGAGCAGTCGATCGCATTCTTTACGCGCACTCAGACTGGTGCGTTAATTTCCCGAATTAACTCCGATGTTATTGGAGCCCAACGCGCATTTACATCCACATTTTCCGGAATCATCAGCAATGTTCTGACCCTGGTCTTGGTAGTTGGAACTATGTTGGCGCTTAGTTGGCAGATCACCGTAGCCTCGCTTCTCCTGCTGCCAATTTTCTTAGCTCCAACAAAGTGGATCGGTTCGCGCTTACAGGGCTACACCCGAGATTCATTCGAAGTAAATGCCCAAATGTCATCGACGATGACCGAACGTTTCAACGTATCCGGGGCGCTCTTGGTAAAGCTTTATGGTGATCTGAATCAGGAATCAAAAGAGTTTAAGACCAAAGCACGTAAAGTCGCCGATATCGGTATCTCAATGGCGATGCTCAACACCTTCTTCTTCATTGCGTTGATAAGTATTGCTGCTCTTGCCACTGCAATTGCTTATGGCATCGGCGGACATCTAGCGATTGATGGTTCGATAACCGTTGGAAGTTTGATTGCAATAACCACTTTGCTCGCTCGGCTTTACGGTCCACTGGTTGCTCTGGCCACAGTGCGCGTGGATGTTATGGGCGCACTGGTCTCTTTCGAGCGAGTCTTTGAAGTCTTAGATCTCGAGCCAATGGTTAAAGAGATTGCAGCACCGAAGAAGCTGGAAAGTAAGACTCCCGCAATTACTTTTAGCGAAGTGCGCTTTACCTATCCGAAAGCACAAGAGATCTCTCTTGCCTCCCTTGAAGCCGCTTCTCTACCTGAGGTAAAGGACTCTGAAGAGATTCTCAAAGGAATCTCATTTGAAGTAAGACCTGGAACGGTAACAGCGCTAGTTGGTCCGTCGGGTGCAGGTAAGACAACTATCAGCGCGCTACTTCCAAGACTTTACGATGTTACAGGTGGCTCTATTTCGATTGATGGTGAAGATATCCGTAACTTCACCCTTAATTCACTTCGTGGATCTATTGGTGTTGTTATGCAAGATTCACATCTTTTCCACGCAACAATTACCGAAAATCTAAGATATGCAAAGAGCGATGCCACACTCGATGAGATGAAGGTCGCTTGTGAAGCAGCGCAGATTTGGGATCTAGTCTCATCTCTTCCAAACGGATTAGAAACTATGGTTGGAGAGCGCGGCCACCGTTTATCAGGCGGCGAAAAACAACGTTTGGCAATTGCGCGGTTACTGTTGAAATCTCCAAGCATCGTAATTCTGGATGAAGCAACCGCTCACCTTGATTCTGAAAATGAAGAGTTAGTTCAAAAGGCGCTAGCGCAAGCGTTATATGGACGCACCAGTATTGTGATTGCGCATCGCTTGAGCACCGTCATGGGCGCAGATCAGATAATCGTTCTTGAAAATGGATCCATCGCCGAACACGGAAAACATGAGGAGCTAGTGCTAACTGGTGGGCTCTATTCCGAGCTCTTTGCTCGTCAAGATTTGACCACGCCTAATTAAGATTCTGCCGTAGATAATTAGCCCGGCGAAGAAGATCCATTGCAGCGAGTAAGCAATATGTGGCCCATCGCTTAGTTCAGGCACTTGCGCAGGCACAGCTGGAGTTAGTGATGGTTGCGAACCGCTCAGTAAATCTAAGTAGAAACCTTCGGATGTCGAATCTGATTGGGCATTTAACTTCGAGATCATTCCTGCGCCACTTGCAGGAAGTGCGAAGAAAGCGCCTTGTGGAAGTGATCTATCTAAGCGCAGTCGTGCAGTTAGAGAAACTGCACCGGTCGGGGCAGTAGAGACAACTGGTGCGGTAGCTGCATCCTTGCCAGCTTTTACCCATCCACGATCAACCCAGAAAGACCGGCCATCGCTTGCGGTAAATCTAGTTAGTACCTCATAGCCATAGACGCCTTCAAAGTATCGATTCTTAAGCAGAATTTGATTGGTTGAATCGAAAGAACCTTCTACCGTAACCGTGCGCCACTCGTACTTTACAAAATTATCTTTTATCTCATTCAATTCGACTGGAGTAAGAGTTAACTGCGCCTCAATACCTTGGTTACGATCTTGGCGATCTATACCGCGCTGAAACTGCCACTGACTGGCCCAGAGACATCCCAAGATTAGTAATAGCGCGAGAAGATGCTTTAGAAAAGTATTAGATTCCTTCTCTGAAGAGCGCTCACTAGACATGTATTGAGTCTAGCCAATCGATAATTCTAGAAAGATTATTTACTCTAGGGATTTCGCTTTATCTGAAACCAAGCCAGAGCCCGGCGCAAAAGACTCACGACCAGCATTTGCGATGACTACTGCAAAATATGGAAGCGTTACTGCGCCTAGGAGTGCAAACCAGCGATAAGGACTAGGCAGAATCACCGTCAGAATGAAGCAAAAGGTACGGATCATCATTGAATAGAAGTAGCGACGTTGACGTCCTGCTTGATCACGAGTCAGTGATGCAGGAGCGGCGGTGATGTCATAAACCGCAGGAGCCTTACTTGCTGACTTGCGTCGGATCGCCATAGCCCAAGAGTAAGGCGCATCTCCGTGGCGCGCCCGCCGAAAACGGCAAGTTACGCGTAAGTAGCAGGGTAAGTTTTGTCATATGTCCGAGGAATCAAAAGCAATCGCTCTGGTCACCGGCGGAAATCGCGGTATTGGATTAGCGATCGCGCAATCACTTCAAGAAGATGGCCATCACGTAGTAGTCACCTATCGCAGTGGCAGCGCACCAGCAGGATTTAGCGCAGTGCAGATGGATGTGACTTCCTCCGAAAGCGTTGATGCCGCCTTTGCTGAAATTGAGTCGCAATGGGGGTTTCCTGAAATTATCGTTGCCAACGCTGGCATAACTAAAGATGGTTTGGTTATGCGAATGTCAGATGAAGATTTTGAGAGCGTTATCAACGCCAATTTAACTGGCGCCTTCCGCGTAGCAAGACGAGCGACCAAGGGTCTGCTTAAGTTAAAGCGCGGACGACTTATCTTTATTGGCTCAGTAGTCGGGGGAGTCGGAGCCGCTGGCCAGGTTAACTATTCGGCAAGTAAATCCGGTCTGCTTGGCATGGCTCGTTCTTTTGCACGCGAACTTGGTTCCCGCGGTATTACCGCAAATGTGATCGCTCCAGGATTTGTCGAAACAGATATGACTGCAGCCCTTGATGAGAAACGACGTACTGAAATCGCCGCTTCTGTGCCGCTCGGTCGTTTCTGCTCCGCAGATGAGATCGCAAAAGTTGTTAGCTTTGTGGCATCACCTGCAAGTGGTTACATCACAGGCGCTTTGATTCCAGTAGATGGTGGATTAGGAATGGGACATTAAATGGGAATTCTTCAAGGTAAAAATATTCTGGTTACTGGCGTATTAACAGATGGCTCGATCGCTTTCCATATTGCCAAGATTGCGCAGGAACAAGGCGCCAACGTACTGCTCTCTTCTTACGGTCGCGTAATGAGTTTGACTACTCGTATTTCAGGTCGCTTGCCGCAACCAGCTCCAGTCATTGAGCTCGACGTTACAAATCAAGAGCATCTCGATGCTCTTGCCGGTCGTGTCAAAGAACATTTCCCACATCTTGATGGCGTAGTACATTCAATTGGTTTCGCACCTGAAGCAGCGCTTGGTGGAAATTTCTTAAATACCGCTTGGGAAGATGTCGCAACTGCCGTACATGTTTCTGCGTATTCACTTAAATCTCTAACAATGGCGGCACGTCCACTGTTCAACGGTGGCGGCTCTGTTGTTGGACTGGATTTCGATGCCACTGTTGCCTGGCCAAAGTACGACTGGATGGGCGTTGCAAAGGCAGCGCTTGAGTCGACATCTCGTTACTTAGCCCGCGATTTAGGCGCTGAAAATATTCGCGTTAACTTGGTTGCAGCAGGTCCGATTCGCACCATGGCTGCAAAGTCGATCCCGGGCTTTGATGAATTTGAAAAGGTCTGGAATGAACGCTCTCCACTGGAATGGGATGTAACTGACCCAGTTCCTGCCGCCCAGGCTGTCGTGGCTCTACTTTCTGACTGGTTCCCGAAAACAACGGGCGAAATCGTGCATGTAGACGGCGGTTTGCACGCGATGGGCGCTTAAAGGTAAAGTACGCCTCAGACCAGTGGCTTTGGCTACTGCAAGAGGAGCTCACCGCTCCCACCTTCTCGGCCCTAGAACTAGTACTAAATGTAACTAGAGGAGTTGGCTGGTTTGTCGAAAGTAAGCCGCCTTACGGGCGCGTTTTCCTTTCCTTAGCGGTGCGCGATTTCTGCGGTAAACCCTCAAGCTACTAAAGTCAAAATCGACAGAACAACGTTCAAAAGAGAACAGCGTTCAAGAAGTAGCAAACCAACTAAGGAGCACAAATCACAACTGAACCGCGCATCAATGACCGTATCCGCACACCTCAAATTCGTCTCATCGGCCACACCGGTGATCAAGTTGGAGTTGTAGATATTGAAGTAGCGCTACAGATGGCAGATGAAATTGGTTTAGATCTCGTAGAGATCGCACCAGAGGCAAATCCACCTGTATGCAAAATCATGGACTTCGGCAAGTACAAGTACGAAATTGCTCAGAAGGCTCGTGAAGCTCGTCAGAACCAGACCCACATTGTGGTGAAGGAAGTGCGATTGACACCAAAGATTGAAAATCACGATTACGAAACTAAACGTAACGCAATTGTGAAATTCCTAAAGGGTGGCGACAAGGTAAAGATCACGATGAAGTTCCGCGGACGCGAACAAACTCGCCCCGAGCTTGGTTTTAAATTGCTACAACGTCTTGCAGAAGATGTTGCAGAAGTTGCTTTTGTTGAGTTCGCCCCTAAGCAAGAGGGTCGAAATATGACGATGGTGCTGGGACCGACGAAGAAGAAGACTGAAGCGGTTGCAGAGCAGAAGGCAGCGCGCGCAGCCAAGGAAAAGGCAGCGCTCGAGGCGGCAGAAGAGAAGTAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP016783|631244:639415|631244_631997_+|WP_095689094.1|DBSCAN-SWA MSTLEIRGLKVSVETESGSVEILKGVDLTIKSGETHAIMGPNGSGKSTLAYSIAGHPKYTITSGTVTLDGADVLEMTVDERAKAGLFLAMQYPVEVPGVSVSNFLRTAATALRGEAPKLRTWVGEVKGAMEALSMDPSFAQRNVNEGFSGGEKKRHEIMQLELLKPKMAILDETDSGLDVDALRIVSEGVNRAKAANDLGVLLITHYTRILRYIKPDFVHVFANGKIVEQGGPELADKLEANGYAEYVTA >NZ_CP016783|631244:639415|632017_633262_+|WP_095689095.1|DBSCAN-SWA MSFDAVAIRADFPIFDRKIRDGKRLVYLDSGATSQKPNVVIDAESNFYRLHNAAAHRGAHQLAEEATEGIESARRIVADFLGGKEDEIVFTKSSTEALNLLAYSIGAAPVGNKFHLKAGDEIVISEMEHHANLIPWQQLAARTGAVLKWFEVTPEGRLDLSNIDSVITTKTKIVALTHQSNVLGTIVPLDDLVKRTHDVGAVFILDACQSVPHMPVNVADLDIDFLAFSGHKAVGPTGVGVLWGRAKLLDELPPFLFGGSMIESVTMTEATWAPAPRKFEPGVPNMAQIVGLGVALKYLTKIGMGAIHKHEIELTKYALDAMSGIDSLRILGPLNTDMRGGAVSFTLGEIHPHDLGQFLDDAGIAVRTGHHCAWPLTRKMNVAATTRASFYLYNTTEDIDQLCQSILAAQKYFG >NZ_CP016783|631244:639415|637790_638558_+|WP_095689101.1|DBSCAN-SWA MGILQGKNILVTGVLTDGSIAFHIAKIAQEQGANVLLSSYGRVMSLTTRISGRLPQPAPVIELDVTNQEHLDALAGRVKEHFPHLDGVVHSIGFAPEAALGGNFLNTAWEDVATAVHVSAYSLKSLTMAARPLFNGGGSVVGLDFDATVAWPKYDWMGVAKAALESTSRYLARDLGAENIRVNLVAAGPIRTMAAKSIPGFDEFEKVWNERSPLEWDVTDPVPAAQAVVALLSDWFPKTTGEIVHVDGGLHAMGA >NZ_CP016783|631244:639415|635911_636658_-|WP_095689099.1|DBSCAN-SWA MSSERSSEKESNTFLKHLLALLLILGCLWASQWQFQRGIDRQDRNQGIEAQLTLTPVELNEIKDNFVKYEWRTVTVEGSFDSTNQILLKNRYFEGVYGYEVLTRFTASDGRSFWVDRGWVKAGKDAATAPVVSTAPTGAVSLTARLRLDRSLPQGAFFALPASGAGMISKLNAQSDSTSEGFYLDLLSGSQPSLTPAVPAQVPELSDGPHIAYSLQWIFFAGLIIYGRILIRRGQILTSKELGIEPTS >NZ_CP016783|631244:639415|637073_637790_+|WP_095689100.1|DBSCAN-SWA MSEESKAIALVTGGNRGIGLAIAQSLQEDGHHVVVTYRSGSAPAGFSAVQMDVTSSESVDAAFAEIESQWGFPEIIVANAGITKDGLVMRMSDEDFESVINANLTGAFRVARRATKGLLKLKRGRLIFIGSVVGGVGAAGQVNYSASKSGLLGMARSFARELGSRGITANVIAPGFVETDMTAALDEKRRTEIAASVPLGRFCSADEIAKVVSFVASPASGYITGALIPVDGGLGMGH >NZ_CP016783|631244:639415|633261_633702_+|WP_095689096.1|DBSCAN-SWA MQLDNLYQEVILDHYKNPQNKKLSATYDAQVHHVNPSCGDEIDLNITVKDGAVAAITWDGVGCSISQASVSVLTDLLIGKSISDAYRIFDEFVALMQSKGAVTGDEDLLEDAVAFAGVSKYPARIKCALLGWMAFKDASVQAQSKG >NZ_CP016783|631244:639415|634108_635971_+|WP_190279229.1|DBSCAN-SWA MHAGWMALRTLSSDQSVKNAKLKPGTLKRIFAYAIPYKSVFALFLICLIADAVLTVATPLLLRELIDNGVIPKDRSVVTTMAIAVALLAIASALVNIVVRWISAKIGEGLIYDLRSQVFRHVQEQSIAFFTRTQTGALISRINSDVIGAQRAFTSTFSGIISNVLTLVLVVGTMLALSWQITVASLLLLPIFLAPTKWIGSRLQGYTRDSFEVNAQMSSTMTERFNVSGALLVKLYGDLNQESKEFKTKARKVADIGISMAMLNTFFFIALISIAALATAIAYGIGGHLAIDGSITVGSLIAITTLLARLYGPLVALATVRVDVMGALVSFERVFEVLDLEPMVKEIAAPKKLESKTPAITFSEVRFTYPKAQEISLASLEAASLPEVKDSEEILKGISFEVRPGTVTALVGPSGAGKTTISALLPRLYDVTGGSISIDGEDIRNFTLNSLRGSIGVVMQDSHLFHATITENLRYAKSDATLDEMKVACEAAQIWDLVSSLPNGLETMVGERGHRLSGGEKQRLAIARLLLKSPSIVILDEATAHLDSENEELVQKALAQALYGRTSIVIAHRLSTVMGADQIIVLENGSIAEHGKHEELVLTGGLYSELFARQDLTTPN >NZ_CP016783|631244:639415|638824_639415_+|WP_095671062.1|DBSCAN-SWA MTTEPRINDRIRTPQIRLIGHTGDQVGVVDIEVALQMADEIGLDLVEIAPEANPPVCKIMDFGKYKYEIAQKAREARQNQTHIVVKEVRLTPKIENHDYETKRNAIVKFLKGGDKVKITMKFRGREQTRPELGFKLLQRLAEDVAEVAFVEFAPKQEGRNMTMVLGPTKKKTEAVAEQKAARAAKEKAALEAAEEK >NZ_CP016783|631244:639415|636697_636991_-|WP_095671059.1|DBSCAN-SWA MAIRRKSASKAPAVYDITAAPASLTRDQAGRQRRYFYSMMIRTFCFILTVILPSPYRWFALLGAVTLPYFAVVIANAGRESFAPGSGLVSDKAKSLE >NZ_CP016783|631244:639415|633741_634053_+|WP_095689097.1|DBSCAN-SWA MAATVDDVTEAMKDVVDPELGINVVDLGLIYDVMVDDNNIAVLNMTLTSAACPLQDVIEDQTRQALAPHTTDVKINWVWMPPWGPDKISDDGREQLRALGFTV |
10 | Cedratvirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
989782 : 999475
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP016783|989782:999475|DBSCAN-SWA TTCATAGAGCCACCTTAAGTATCGCGCTAACGCTGGCATCAAGATCTGCAGTGCCATCGATTGTAATCTCAGGTTTTGCTGGGCGCTCGTAATCTTGACCTACGCCAGTGAAGTTAGGGATCTCTCCCTTGGCAGCCTTCTTATACAGACCCTTTGGATCGCGAGAGATACATACATCTACTGGTGTATCTACGAATACTTCAATGAAATCACCTTCTGCAAATATCGAACGTGCGCGCTGGCGATCAACTTCAAATGGGCTAACTAGCGCTGTAATAACAATTAACCCGGCATCGACCATTAGTTTGGCAACTTCAGAGACTCTACGAACATTCTCTGCACGATCTTCGGGGGTAAAGCCCAAATCAACATTGAGACCCATACGTAGATTGTCACCATCTAATACATAAGCATGTGCACCTTGAGCATGAAGGCGTTTTTCAAGAAGGTTGGCGATCGTAGATTTTCCTGAACCCGATAGACCTGTTAACCAGATAACTTTTGCGCGCTGATTTTTAGCCGCTTCGCGAGCAGTTTTATCGACCTCGTAGGCCTGTGCGGTGATGTTTTCACCACGGCGTAATGAGTGGCGAATCATGCCGGCGCCAACTGTTTGGAAGTTGGTGCGATCTACCAGGATAAAGTTTCCGAATTCGCGAGATTGTTTATATGGATCTAGTGCGATCGGTTGGTTTGTTGCAATTTCAACCTCGCCAATTTCGTTCATGGCCAGGGTATTTACTGAGAGGTGCTCACCTGTATGAACGTCAATCTTGTGACGAATCTTGGTGATGATTACTGGAGTCTCGCTAGGGCCTGATACGAGAAGATATGAACGGCTATGAATCAAGGCATCTTCATTTAGCCAGACTAGGTTTGCGTTAAAGCGATCAGATGGAGTTACAGCGCTTTGATTGGCGTAAACCAGATCACCGCGAGTTATATCTACCTCTGGCTCGAGTACCAGCGTCACCGCATCCGACTGAGCGGCAACTTTTTGGCTCTGCATGTCAGAAATTATCTGCGCAATCGTTGCTACTTGATTGCTAGGAAAGATCTTTACCTGATCGCCTAATTTGAAGTCACCTAAGTGAACCGTCCCGGTAACACCTCTGAAATCTTCGGCACGAGCGATAGATTGAACGCGCAGGCGAGCAGTTGCTTCTGCGGATTTGCCTGGTTGCCAACCTTGAATCGCCTCAAGCAAAGTCTCGCCTGAATACCAAGGCGTTGAATTACTGCGATAAACCACGTTATCGCCAGCTAGCGCACTAAGCGGAATAAAGTGAACGTCGGCGATATCAAGGCGCTCTAGAGTCTTTTCAATCTCACTCTTTATCTCGCTATAGATACTCTCTTGAAACCCAACGGCATCTAATTTATTGATCGCAACTAATACTCGCTGGACTCCCATTAGAGAACAGATAGTTAGATGGCGCAAAGTTTGGGTACGCACTCCCTTTGTAGCATCGACTAAAACTATTGAGATCTCAGCGCGTGATGCGGCAACAGCCATATTGCGTGTGTATTGCTCGTGTCCTGGAGCATCAGAGATGATTAAACGTCGTCCATCAAGAAGTGACATCGAGCGATATGCCACATCAATTGTGATTCCTTGTTCGCGCTCTGCTTCAAGGCCATCTGTAAGCAAACTAAAGTCAATTTCACCCGCTGGAATAATCGACCCTGCACGACGTACTTTTCGCGTTGAATCAATCGTGTCATGAGGAACGCTATCGGTCTCAACCAATAGACGTCCGATCAAGGTGCTCTTTCCATCATCAACGCTTCCACAAGTTAAGAGACGAACAATTGGGGTCTGCACTAGAAGTACCCCTCTTTCTTCTTCTTCTCCATCGAATCTTCATTTGCGCCGTCAATTAAACGCGTTGCTCGTTCGCTCAGCTTCACCTTTAAAACCTCGTCAACAACTTCAGAGATCGTTGTTGCAGTTGATTCAACGGCTGCGGTAAGCGGGTAGCAACCGAGAGTTCTAAAGCGAACCATCTTTGGCTCTGGAACTTCTCCAGGGTTAAGTGGGTAGCGATCATCATCGACCATAATTAATTGGTCACCGCGTTTAACCATGGGGCGATTTTTTGCAAAGTAAAGAGGATTTACTTCAATTTTCTCTTGTTGGATATAGCGCCAGATATCGCCTTCGGTCCAATCTGAAAGTGGAAAGACGCGCATTGTCTGACCAGGCGCTAAACGAGTGTTGTAAGAACGCCAGAGTTCGGGGCGTTGATTACGTGGGTTCCAGCTATGGCCTTCATCGCGAACGCTAAAGATTCGTTCCTTTGCGCGAGATTTTTCTTCATCACGGCGTGATCCCCCGATAGCGGCATCTACTTCATAGTGATCGAGCGCTTGCCGTAGTGCAACAGTCTTCATGATGCGCGTGTATTCAGAAATACCGGTAGTAAATGGATTTGCTCCGGAAGTTGCGCCCTCTTGGTTTGTGTGAACAATCAAATTCAATTTGTGATCGGCGACAATCTTGTCGCGGAAGGAGATCATCTCTTTAAACTTCCATGTTGTATCGATATGTAACAACCGGAAAGGAACTGGGGCGGGATAGAAAGCTTTGATCGCTAAATGGAGAAGAACGGAAGAATCTTTTCCAATGGAATACATCATCACGGGATTTCGGAAGGATGCGGCGGTCTCGCGCAGAATCTCTATCGATTCAGCCTCAAGGGCCATTAAAATTCCTTCATTCACATGCACGCGCCTACGCTACTGGTTAATCCTGTGATCTACCTGTATTTCTTCTGGCAGCGCGGGCAAAAATGCGATGATCTCGCACCAAATGTGATCCGGCGAATAGGTGTTCCACAACGCGGGCATGGCTCATCGGTACACCCATAAGCCGCAAGGGCTTGTTCGAAGAATCCACTTTCGCCATTTACATTTATATAGAGATCGTCAAACGATGTGCCACCTTGTTCGATTGCTTCACTCATAACCTCTGTTGCATAACTAATTATCGAAGCAAGCTTCTTCTTGGAAAGATCGGCTGTAGATACTTCTGGATGAACCTTGGCGCGCCAAAGAGTTTCATCGGCATAGATATTTCCAACACCGCTCATGATTTCTTGGTTTAGTAGCGCGGTTTTAATTCGGATCTTTCGCTTTGCAAACTTCTCAATCGTCTCTTGCAGATCAAATTGCGAATCAAAGGGATCGCGCGCAATGTGCTGGGCTGAACTTGGCACGCCATCAATTGTCTCTTCAACTGATACCCAGCCGAAGGTTCGCTGATCATTAAAAACTAGTTCGCGCTTGCGCATCTGCTTGGTCAGGTCAAACTGTGCGCGTACATGCTTTGCTCTGGGATGGCCTTGCTGATGAATTAAGAATTGCCCACTCATTCCCAAATGAGCAACTAATACCTGTGGGCGATCCAAAACAAACCAGAGAAACTTGCCACGTCGATTTAAATCGATGATCTTGGCGCCCTGAATAGATTTTAGCGGTGCGATTGATTCAGGTTTAAGCGCGCGTGGATGTAAATCATGCGCCCCAACAATTTTGTAACCTTTAACCAGGCTGACTAAACCGCGGCGAACCGTTTCTACTTCTGGGAGTTCGGGCACAACTTAACCTTTTTTTATCACGCTAAGTGCGGCAAGTGCGCTGTCATAACCCTTATCTTCTTTGCTTCCAGGAAGGCCAGCGCGCGCAGTCGCTTGTTCTAGGTTGTCGCACATAAGTACTCCAAAGCCAATTGGCTTAGATCTGCTTAGCGAAACATCCATTACGCCCTGAGTGAGGCCCTGACAGACATAATCAAAGTGCGGAGTTTCACCACGAAGAACAAGACCGACAATGACAGCAACGTCAAAGCCTTCATCAAAGGCTAACTGTGCAGCAAGTGGAAGTTCAAAAGAGCCAGCAACTGTGCGAACTACAGGATTAGCTATACCCGACTCTTTAAAACCACGAACTGCACCGGCAACTAGCGCATCACATATATCTGGATGCCAAGCAGCGGTGATGACAATTGCCTTGAGATGTGGCATTTGTGGAATCTTTACTTCGGGGGCATGTCCAGCCATTAGAGCGCTCCTAACGCGTGGTTTAACTTATCTCGCTTGGTTTCTAGATACTTCTGGTTATGTTTGTTAGCAATTACTTCAAGTGGTGCGGTAGTTAGAGAAATTCCAGTTGCCTTTATCGCGGCAATCTTTTCAGGATTGTTGGTCAATAAAGTTAGTGATTTTACTTTTAAGCGGTTAAGGATCTCTGCCGCATCTTCATAGGTGCGATCATCTACAGGCTGACCGATTGAAATATTTGCTTCAACGGTATCTTGGCCAGAATCCTGCAACGCGTAAGCGCGAATTTTCTCAGCAAGACCTATGCCGCGGCCTTCGTGATCGCGAAGATAGATGACATATCCGTGGCCGTTTTCTTCAATCAGGCGAATTGCTTCTTGGAGTTGCGGACCACAATCACAGCGCTTGGATCCGAATACATCTCCAGTTAAGCATTCGGAATGAACGCGCACTACCGGCTGACTACTTGGAGTCTGGGCAAATTTAAGAACTGCGTGTTCTGCTCCAGTTGGTGAAATATATGTGGAGATCTGCCATTGCGCGCTGCCAAGTGGAAGCTCTGCCCATTCAAGGCTTGCTATATCAGCGCTCTTCTTTTCGGGCAGTACGCGAGATAGTTCATCGATAGAGATCACTTCTAAATCATGCGCTTTGGCAAATTCGGTTATCTGAGAACCGCGCATCATCGATCCATCATCATTTACTAATTCAGAGAGAACTGCACAAGGATTAGATCCAGTTAGTTCGCATAGAGCCATACTTGCCTCGGTATGGCCACGACGTTGGTGAAGTCCACCCGGTACTGATATCAGCGGAAACACATGTCCTGGTCGTGCAAAATCTGTCGGCAGTGAACTGGCATTAGCAAGTGCACGCAGTGTGTTTGCTCGCTCGGCTGCCGGAATACCTGTCGAGCGGCCAGCGATTGCATCGACCGTAACGGTGAAAGCTGTTGTGTGGTTATCTTCATTACGCTTAACCATAAGTGGAAGTTTTAGCTTCTTGGCAGTTTCCTCAGTAATGATTCCGCAGATGACACCAGATGTGTATCGAACCATAAATCCGATATTTTCAGGGGTGGCTTTATCAGCGAGGAGAATTAAATCTGCTTCGTTCTCGCGATTTTCATCATCGGTTACGACGATGAACTTGCCCGCCTTGAATTCAGAGAATACTTTTTCTAAATTACTCATTTGTCATCCTGTCCACGTGCAATAAGTCTTTCAACGTACTTTGCTAGAACATCTACCTCGACGTTGAGGGGATCTGAAATCTTTTTGGCAGCAAGATTGGTCTTAGCTAGAGTTTCTGGGATTAACCAAACAGATACTTGATCAAGCCCATCGTTTAGTTCGCCAACCGTCAGCGATACACCTTCAATACAAATAGAACCTTGTGCAACCACGTACTTCATAAGGTGGGCAGGCACTGCAATATCCATGCGCGTCCACTCTTTATCAGCCGAGATTCCAGCAACTTTCGCTACGCCATCAACATGGCCCTGCACGATATGGCCACCAAGGCGATCTTGTGTGCGGGTAGCAAGTTCCAGGTTTACAGGAGAACCAACACCAAGTGCGCCCGTTGAAGTTAGGTTGAGCGTTTGCATCATTACATCTACTTCAAAACTCTGGCTATCAAAAGATGTAACAGTTAAACAGACACCGTTAACGCTAACCGAATCTCCTTGGGCTATTTGAGCGCAAATTTCGGCGCTCTTAATCTGCAACTTGGCGCTGCTTTCCTGGCGCTCAATTGCAGAAACTTGGCCGACTGCTTGAATTAGCCCTGTAAACATTAGCGCCCACCTTCTTCTTGGTTTGCGCTCTTTATAAGTGTTATCTTCAAATCTGCACCGATTACTTCAACATCTGCGATCTCGAAATCTAAACGCTCTGAAATAGTGGCAACTCCCAAATCTGAAATTGATGGTGTGCCCGAGCCAAAGAAAGTTGGCGCCTGATACAGAATTACTTCATCGATTAAGCCTGACTTCAAGAGCGCAGTTCCAAAGGTTGGTCCAGATTCAATTAGCAATTGGTTAAAGCCACGTGCCTTAGCCAAGTTGATTAGCTCTTGCAGATCACGAGATTTGATGGCAATGGTTTCTGCATCGGCGTTAAGAATCTGCGAGCTTGAAGTAATTTCAGAGCTACCCATCACAATACGGACCGGGTTCTTGCCGGCACCTTTACTGGTGAGCAAAGGATTGTCAGCCTTTACCGTTGTTGTAGATGTAACGATGGCATCGGCTTGTGAGCGAAGTAGCGCAACATCTGCTCGCGCTACTTCGCTAGTAATCCATTTGGAAGTGCCATCGGCGGCGGCAACTTTGCCATCCATGGTCGATGCGATCTTCCAGGTGATTCGCGGGCGCTCTTTGGCAATCTTGGTTATCCATGCGCGGTTATCAAAGGCTGCCTCACTTTCCAGCAGCCCTTGCTCTACATCAATTCCAGATGCGCGAAGGTACTCGCCTCCCCCAGAGGCAATCGGATTTGGATCGCTAACTGCATAAACAACTTTTTTGATTCCGGCGGCAATAATCGCTTGTGTGCAGGGTGGAGTTTTGCCCTGGTGGTTACATGGTTCAAGAGTTACATAAATGATTGCGCCCGCTGGAATACTCTTTGCCGCATTTATCGCAACTACTTCAGCATGGTCTGCGCCCTGATGAAAACCTTCGCTTATGAATTCATCGGTAGAGCTGGTTATAACTGCGCCAACAATTGGGTTTGGGAAAGTCTTACCAAGGCCAGCGCGAGCACACTCGATGGCGCGCGCCATCGCGGCTTCAGCACTTAACACTTGGTTCATCTTCAACACCCTTTATCGAGATTTGGTGTTGCATCTCGGGGTGGAAAGGCAGTGGCAGCCGACGCATTAGCCCTTTAAACAAAGTTAATTGCACAAAGGGTGCGACAGCTACACGTTGGTTTTCTCTCATCCGGACTTTAACCGTCGGTTCGGGAATTACACCCAATCCACCGGTCATTGGCTATGACCGGGTCGCAGACTTTCACTGCCGGTTCGAAATTACATCGACCCCGAAAAACAACACTCTTATTCTAGCGCCCCGCTTTGAGGGCTTTCATCCAAATCTCATAAAGTGGTTTTGCCGCCGCCTGTGTAGCGGTCATTACGCCACCTTTTTCAGGAAATAGGTTCTGTTTTCCATCTTCATCCCAGACTGCGCCACCTGCTTCAGCAACGATTAAGAGCGATGCCAAATGATCAATCGGACTGAAATGTCCAACGACTGCGCCAACACCGCGCGCAAGGGCCGGACCAACGATCGTTAGCGTGCCAGAGCCCATGATGCGCATTGTGCAATATTGCTCCGCAAGTCCATCAAGTAACCCAAGCATGCCGGGCCATGGTTGATATGCCGCAAGTTCGGTTGAAACTATTCGTGAAGCTAGTGGATTTTCAACGCCACTTTGATCTTCAATAACTAGCGGTTTGCCATTTCGTTTCGCACCTTTGCCACGCTGTGCTTCATACAACTCATCACGCCATGGATCTATAACAACGCCAACTAAGACATCGCGATTATGTGCAAGTCCGAAACAGAATGAAGTCCAAGGTAAACGGTTTGCAAAGTTAGTTGTTCCATCGATTGGATCGAGATACCAAGATGGAGTATCGGCAAGGTAAGCACCGCCCATCTCTTCGCCAACGAAGTTATGTCCAGGCAAGTTTGCCGCAATAACTTCACGTACGTGGGCTTCGATCATTACATCAATCTCGGTAACGATATCGGCGGCATTTGTCTTTAACTCAATTTTTCCAGGATCCATATTGCTAGCGACTAGAATCGCCCACTTGCCTAGATCGCGGGCGATAACCATCGCGCGGTCTAAATCAATATCTTCTAACTTCATCTTCCGTGGTCCACCTGTATCCATGGTTGGATTTTCAGCGATCACCTTTTGTAATTGAGTTAATTGATTAGAGGTAACTGAATCAACGCCGATAGCTGCTGCCCAGCGCATGGTGGCCTCGTCATCAACCGTCCAGACTGCAACTTTAAAACCTGATTCGTGGAGTGCCTTAACAGACTTCTGAGTAACAAATGAGTAATGCAAATTTATAAATTCGGGCTGCAAAATATCGGTCTCAGCCTTTGTAGGAAGCGCCAACTTATCCCACGGCATCCAGATGCGAGCTTTTGGTGATAGCGAACGGATGGTGTGCATTCCATCGAAATTTCCGCACCAAAATATTTGTTCTTCAGGAAGTGGGCCACCGGCAACAACCTCATAGGTTGGCTTTGCTGGCTCTGGTTGTTCCATATCGATCATCAAAATTGATTTGCTACCGACAAATAACTTTAAGACATCAATTAACAGCGGAATGCGATCTTCACCATGGCCGAGTTGAGAAATCTGCGCCCAATCCATCTCGGTGCTCTCTTTGGATATTCCCCAAAGGCGTTCTAGGTTTGAGTCATGTAGAACAATTACCTTGCCATCGCGAGTTATACGGACATCAATTTCAACGACTTCAGCGCCGGCGTCAATTGCGCTTTGGATAGCAACAATTGTATTCTCACGAAAGCGGCTGGAATCTCCGCGATGTGCGCAGGCAAAAGTTTTGGATTTGGATTCAGACATGGGTTAAGCAGTTCTAACTAGCTTGCCATCCTTGTAAATCAGAGCCTTGTTTATCTTTACAAAGCCAGAATCACCAACTTTAGGAACATCGACTGAAATGAAGGCGCGAAGTGATGCATCAGCGCGCTTTAAGTAGACCTCATGGAAGTGGCCACGAGGAACAACTAATTCGACCGTTGCTGGATAACTACCAGGAGCCTGAGTCTTCGAGAATGCAACATCTTCGGGGCGAATTCCAACTACTTCGTTTCCTGAAACAACTTGTCCATCGCTGTAACCGCCTTCGACGCGATTTAAACTACCGATAAAGTTGGCAACGTATTCGGTTGCTGGACGATCGTAGAGTTCTTGCGGAGTGCTGAACTGTTCAATTCGACCATCGTTCATAACTGCGATGCGATCTGAGATAGAGAGGGCTTCATCTTGATCGTGGGTCACAATGACAGTTGTAATTCCGAGGCGACGCTGTAGATCGCGGATATCTTCACGAACCTTTACACGCAACTTAGCGTCAAGATTTGAAAGTGGCTCATCTAGAAGCAAGATATCTGGCTCTTGTACTAAAGCACGCGCTAGAGCAACACGTTGCTGTTCTCCACCAGAGATCTGTCCTGGGCGAGAATCTTTATGGTGCATCAAATTAACAACTTCAAGTACTTCGTGAACTCGCTTTTCGATCTCGGCTTTACCCATCTTGCGGATCTTTAGCGGGTAAGCAACATTTTTAAATACTGTCATATGTGGCCAGAGCGCATAGTTCTGGAAAACCATCGCACTAGGGCGCTTTTCCGGACCAAGGTTAGTTACATCTTTACCTGCCAGATTTACAGTGCCTTTTTCAGGGATTAAAAAGCCTGCGATCATACGAAGGGTTGTTGTCTTACCGCATCCTGATGGGCCTAGCAGTGAGACCATCTCGCCTTTTTCAACGGTTAAGTTGAGATTATCAATGATGACGCGTCCACCAAGAATCTTGGTTAAGCCAATTAGCTCTAGACCTTGCGTTGCGTTGCTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP016783|989782:999475|993841_995035_-|WP_095689349.1|DBSCAN-SWA MSNLEKVFSEFKAGKFIVVTDDENRENEADLILLADKATPENIGFMVRYTSGVICGIITEETAKKLKLPLMVKRNEDNHTTAFTVTVDAIAGRSTGIPAAERANTLRALANASSLPTDFARPGHVFPLISVPGGLHQRRGHTEASMALCELTGSNPCAVLSELVNDDGSMMRGSQITEFAKAHDLEVISIDELSRVLPEKKSADIASLEWAELPLGSAQWQISTYISPTGAEHAVLKFAQTPSSQPVVRVHSECLTGDVFGSKRCDCGPQLQEAIRLIEENGHGYVIYLRDHEGRGIGLAEKIRAYALQDSGQDTVEANISIGQPVDDRTYEDAAEILNRLKVKSLTLLTNNPEKIAAIKATGISLTTAPLEVIANKHNQKYLETKRDKLNHALGAL >NZ_CP016783|989782:999475|993383_993842_-|WP_095689348.1|DBSCAN-SWA MAGHAPEVKIPQMPHLKAIVITAAWHPDICDALVAGAVRGFKESGIANPVVRTVAGSFELPLAAQLAFDEGFDVAVIVGLVLRGETPHFDYVCQGLTQGVMDVSLSRSKPIGFGVLMCDNLEQATARAGLPGSKEDKGYDSALAALSVIKKG >NZ_CP016783|989782:999475|998461_999475_-|WP_095689353.1|DBSCAN-SWA MSNATQGLELIGLTKILGGRVIIDNLNLTVEKGEMVSLLGPSGCGKTTTLRMIAGFLIPEKGTVNLAGKDVTNLGPEKRPSAMVFQNYALWPHMTVFKNVAYPLKIRKMGKAEIEKRVHEVLEVVNLMHHKDSRPGQISGGEQQRVALARALVQEPDILLLDEPLSNLDAKLRVKVREDIRDLQRRLGITTVIVTHDQDEALSISDRIAVMNDGRIEQFSTPQELYDRPATEYVANFIGSLNRVEGGYSDGQVVSGNEVVGIRPEDVAFSKTQAPGSYPATVELVVPRGHFHEVYLKRADASLRAFISVDVPKVGDSGFVKINKALIYKDGKLVRTA >NZ_CP016783|989782:999475|989782_991606_-|WP_095689347.1|DBSCAN-SWA MQTPIVRLLTCGSVDDGKSTLIGRLLVETDSVPHDTIDSTRKVRRAGSIIPAGEIDFSLLTDGLEAEREQGITIDVAYRSMSLLDGRRLIISDAPGHEQYTRNMAVAASRAEISIVLVDATKGVRTQTLRHLTICSLMGVQRVLVAINKLDAVGFQESIYSEIKSEIEKTLERLDIADVHFIPLSALAGDNVVYRSNSTPWYSGETLLEAIQGWQPGKSAEATARLRVQSIARAEDFRGVTGTVHLGDFKLGDQVKIFPSNQVATIAQIISDMQSQKVAAQSDAVTLVLEPEVDITRGDLVYANQSAVTPSDRFNANLVWLNEDALIHSRSYLLVSGPSETPVIITKIRHKIDVHTGEHLSVNTLAMNEIGEVEIATNQPIALDPYKQSREFGNFILVDRTNFQTVGAGMIRHSLRRGENITAQAYEVDKTAREAAKNQRAKVIWLTGLSGSGKSTIANLLEKRLHAQGAHAYVLDGDNLRMGLNVDLGFTPEDRAENVRRVSEVAKLMVDAGLIVITALVSPFEVDRQRARSIFAEGDFIEVFVDTPVDVCISRDPKGLYKKAAKGEIPNFTGVGQDYERPAKPEITIDGTADLDASVSAILKVAL >NZ_CP016783|989782:999475|995031_995640_-|WP_095689350.1|DBSCAN-SWA MFTGLIQAVGQVSAIERQESSAKLQIKSAEICAQIAQGDSVSVNGVCLTVTSFDSQSFEVDVMMQTLNLTSTGALGVGSPVNLELATRTQDRLGGHIVQGHVDGVAKVAGISADKEWTRMDIAVPAHLMKYVVAQGSICIEGVSLTVGELNDGLDQVSVWLIPETLAKTNLAAKKISDPLNVEVDVLAKYVERLIARGQDDK >NZ_CP016783|989782:999475|996910_998458_-|WP_095689352.1|DBSCAN-SWA MSESKSKTFACAHRGDSSRFRENTIVAIQSAIDAGAEVVEIDVRITRDGKVIVLHDSNLERLWGISKESTEMDWAQISQLGHGEDRIPLLIDVLKLFVGSKSILMIDMEQPEPAKPTYEVVAGGPLPEEQIFWCGNFDGMHTIRSLSPKARIWMPWDKLALPTKAETDILQPEFINLHYSFVTQKSVKALHESGFKVAVWTVDDEATMRWAAAIGVDSVTSNQLTQLQKVIAENPTMDTGGPRKMKLEDIDLDRAMVIARDLGKWAILVASNMDPGKIELKTNAADIVTEIDVMIEAHVREVIAANLPGHNFVGEEMGGAYLADTPSWYLDPIDGTTNFANRLPWTSFCFGLAHNRDVLVGVVIDPWRDELYEAQRGKGAKRNGKPLVIEDQSGVENPLASRIVSTELAAYQPWPGMLGLLDGLAEQYCTMRIMGSGTLTIVGPALARGVGAVVGHFSPIDHLASLLIVAEAGGAVWDEDGKQNLFPEKGGVMTATQAAAKPLYEIWMKALKAGR >NZ_CP016783|989782:999475|995639_996659_-|WP_095689351.1|DBSCAN-SWA MNQVLSAEAAMARAIECARAGLGKTFPNPIVGAVITSSTDEFISEGFHQGADHAEVVAINAAKSIPAGAIIYVTLEPCNHQGKTPPCTQAIIAAGIKKVVYAVSDPNPIASGGGEYLRASGIDVEQGLLESEAAFDNRAWITKIAKERPRITWKIASTMDGKVAAADGTSKWITSEVARADVALLRSQADAIVTSTTTVKADNPLLTSKGAGKNPVRIVMGSSEITSSSQILNADAETIAIKSRDLQELINLAKARGFNQLLIESGPTFGTALLKSGLIDEVILYQAPTFFGSGTPSISDLGVATISERLDFEIADVEVIGADLKITLIKSANQEEGGR >NZ_CP016783|989782:999475|992540_993380_-|WP_095671380.1|DBSCAN-SWA MPELPEVETVRRGLVSLVKGYKIVGAHDLHPRALKPESIAPLKSIQGAKIIDLNRRGKFLWFVLDRPQVLVAHLGMSGQFLIHQQGHPRAKHVRAQFDLTKQMRKRELVFNDQRTFGWVSVEETIDGVPSSAQHIARDPFDSQFDLQETIEKFAKRKIRIKTALLNQEIMSGVGNIYADETLWRAKVHPEVSTADLSKKKLASIISYATEVMSEAIEQGGTSFDDLYINVNGESGFFEQALAAYGCTDEPCPRCGTPIRRITFGARSSHFCPRCQKKYR >NZ_CP016783|989782:999475|991605_992487_-|WP_095689645.1|DBSCAN-SWA MALEAESIEILRETAASFRNPVMMYSIGKDSSVLLHLAIKAFYPAPVPFRLLHIDTTWKFKEMISFRDKIVADHKLNLIVHTNQEGATSGANPFTTGISEYTRIMKTVALRQALDHYEVDAAIGGSRRDEEKSRAKERIFSVRDEGHSWNPRNQRPELWRSYNTRLAPGQTMRVFPLSDWTEGDIWRYIQQEKIEVNPLYFAKNRPMVKRGDQLIMVDDDRYPLNPGEVPEPKMVRFRTLGCYPLTAAVESTATTISEVVDEVLKVKLSERATRLIDGANEDSMEKKKKEGYF |
9 | Bacillus_phage(28.57%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1101235 : 1110156
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP016783|1101235:1110156|DBSCAN-SWA ATTATCGAACTGTTACCGATTTAATTTCAATTGGCTGTACCGGAATTCCATCGCCCGCGTAGTAAGCGTTTCCGGTTGATTGATCAACTTGGTATGCCCCAACTTTTTCAATGCGCAGCAGGAGCGGCAAACCAGTTTTAATCTTTCCCCAGATTGTGTAGCTCGGTGGCAGCGTTGTGTCCTTATAGACCAAGAAGAACTGGCTGCCGTTGGTATTTGGGCCAGAGTTAGCCATGGCTACTGTTCCTGCGGGATAGGTAAGAATCTTCTTAGTTGGAAGGTTCTCGTCTTTGTAACCTTTCCAAGAACCAGGTGAACCATTTCCTTGCGCAGAAGGATCTCCGCATTGCAGAACGTAGATACCTTCAGTAGTTAGGCGGTGACAGAAAGAGCCATCAAAATATTTTGAGCGCGCAAGTGTTGCGAGGTTGGTAACGGTTTGAGGCGCTGCTGGATCTAGCTCGATAGTAATGACGCCACAATTTGTAGTAATCGTCATCGTTTTAGCAAGTTTCTTATCAACGCTAGTTGGCTGCTTAAGCGTTGCTGGCACATGCGCCTTCGCAGTTGGCTTAGCGCAACCCTTAACTGAAGTTGGGCGCTCAGCGGCTGTCGAAGTAACAGGAGCTAACGAAGCCACGGCGATTGCCAAGGCACAGAGCGCAGACATTCTTTTCATAGTCGAGTTCCTACTCCCATTCGATCGTTCCAGGCGGCTTACTAGTTACATCAAGTACAACGCGATTTACTTCGCGAACCTCATTGGTAATTCTCGTAGAAATCTTCTCTAGAGTTTCGTAAGGCACGCGCGACCAATCTGCAGTCATCGCATCTTCACTTGATACTGGACGCAAAACTATTGGGTGGCCATAAGTACGGCCATCACCTTGCACACCAACGCTGCGCACATCCGCCAAGAGAACTACCGGGCATTGCCAGATCTGACGATCAAGTCCTGCCGCCTTTAGCTCTTCGCGCGCAATTAAATCTGCATGGCGCAGAATTTCCAGGCGATCTTGCGTTACTTCACCGATGATGCGGATACCAAGACCCGGACCTGGGAATGGTTGCCGCCAAACGATTTCTTCAGGAAGACCGAGTTCTAAACCGACCTGACGAACTTCATCTTTAAACAAAGTACGAAGTGGCTCTACAAGAGTGAACTTCAAATCATCTGGAAGACCACCAACGTTGTGGTGTGACTTGATATTTGCAGTACCTGTACCGCCACCTGATTCAACAACATCCGGGTAAAGGGTGCCCTGAACTAGGAATTCAACATCTCCCCCAGCGGCAATGTCGCGCGCTGCCTTTTCAAAGGAGCGGATAAATTCACGGCCAATAATCTTGCGCTTAGTTTCCGGATCAGTAACACCAGCAAGGGCACTTAAAAATTGATCAACCGCATCTACAACAACTAGATCAATACCGGTGGCTGCTACAAAGTCTCGCTGTACTTGTTCGGATTCGCCACTGCGCAACAATCCGTGATCTACGAATACGCAGGTCAACTGTGATCCGACTGCGCGCTGCACGATTGCGGCAGCAACTGCTGAATCAACGCCACCTGAAAGTCCGCAGATAACGCGCTTGCTGCCAATGAGCGCCTTCGCTTTAGCAACTTCATTCTCGGCAATGTTTCCGGTTGTCCAGTTTGGTGTGCAACCGGCAATATTGATCAACCAGTTTTTCAGGATTGCCTGGCCATGTTCTGAGTGAAGAACTTCGGGGTGGAACTGCACGCCAGCGATTTTTCCGGTGGCATCTTCAAATGCTGCGATAGGAGTATCAGCTGTTACTGCACAGATTGTGAATCCAGCAGGTGCTTGTGTAACGGCATCGCCGTGCGACATCCAAACGCTCTGTGACGCAGGTAGCGTTGCAAAGATCTTCGAACTTACTTCTGCCTTAATTTCAGTGCGGCCGAATTCTGATTTACCTGTCTGGCTTACAACGCCGCCAAGTGCTGCCGCCATCGCTTGGAAGCCGTAACAAATTCCAAATACTGGAATACCAAGTTTGAGAATTTCAGCATCGAACTTTGGCGCGCGACCTGCATAAACGCTTGAAGGCCCGCCCGATAGAACAATTGCTGATGGCGCCTTGGCCGAAACTTCTGCAGCGGTGATATGTGATGGAACAATTTCCGAGAAAACATTTGCTTCGCGAACTCGGCGTGCGATCAACTGCGCATATTGCGCACCGAAATCAACGACCAGTACGCCGTTCTGCGAATTACTCATTTACGCTCCGTGAATGATTACTTCTACGCGCTGGAATGACTTAACATCTGAATAACCAGAAGTTGCCATTGCGCGACGAAGTGCGCCAAAGAGGTTCATTGAGCCATCAGATGTATGTGATGGTCCGTTAAGAATTTCTTCGAGTGTGCCAACGGTGCCAACGTTTACGCGTTCGCCACGTGGAAGTTCTTGGTGATGTGCTTCAGATCCCCAATGCCAGCCGAGTCCGGGCGCCTGTGTTGCCTTCGCAAGTGGGGAGCCCATCATTACTGCATCTGCGCCAACGGCAATTGCCTTTGCGATATCGCCAGAGCGACCAACTGAACCGTCTGCAATAACGTGAACATATCGTCCACCTGATTCATCTAAATATTCACGGCGCGCTGCAGCAACTTCAGCAACAGCAGATGCCATCGGAACTTCGATACCTAAAACTTTACGAGTCGTGTGTGCAGCACCGCCACCGAAACCAACTAGAACACCAGCTGCTCCTGCGCGCATCAAGTGCAGCGCACCTGTTGCAGTAGCAACTCCACCAACAATTACTGGCACATCGAGTTCGTAGATGAATTTCTTAAGGTTTAGCGCTGAGTCATCGTTACCAACGTGCTCTGCAGAAACTGTTGTTCCACGAATTACGAAGATATCTACACCAGCATCGATAACTGCCTTGTGAAGTTCAGCGGTGCGCTGTGGGGAAAGTGATGCCGCAACTGTGACACCTGAGTCGCGAATTGTTTTGATGCGCTCTTTGATTAACTCTGGGCGAATTGGCGCGCTGTAAATTTCTTGCATGCGACGAGTGGCATGCTTATCTGGCATTGATGCTATTTCACCAAGTGGAATACGTGGATCGTCATAACGAGTCCAGAGGCCTTCTAGGTTTAGAACGCCGAGTCCGCCGAGCTTTCCAATTTCAATTGCAGTTTCAGGAGAAACAACAGAGTCCATTGGCGCTGCCATCATTGGAATATCAAATTTATAAGCATCGATCTGCCAAGCAGTTGAAACATCTTCGGGGTTGCGAGTACGGCGGCTTGGAACGATGGCGATATCGTCAAATGAGTAAGCCTGGCGGGCGCGTTTTCCTGGTGCGATCTCGATATCCATGGCGGCCTTCCTTAGCTCTTCTTGCTGTAATTAGGGGCATCGGCAACGTGCAAGACATCGTGAGGATGGCTCTCTTGTAGCCCTGCTGCAGTAATTTGAATCAAGCGACCTTCACGGCGCAGCGTTTCGATATCGGGTGCGCCGGCATAACCCATTCCGCTGCGCAGACCGCCAACGAGTTGGTGCACAACTTCAGCAACTGGGCCGCGGTAGGCAACTCTTCCTTCGATACCTTCAGGGACAAGTTTGTCTTCAGATAAAACATCGTCTTGCATGTAGCGATCTTTGGAATATGACTTCTGTTCTCCGCGAGATTGCATCGCACCTAGTGAACCCATGCCGCGATATGCCTTGTACTTTCGGCCATCGATTTCAACGAGTTGTCCTGGTGATTCTTCGCAACCGGCAAGAAGTGAACCAAGCATTACTGAATTTGCGCCAGCAACAATTGCCTTAACAATGTCACCTGAATATTGAAGACCACCATCTGCGATTAGCGGAATGCCGGCCTTGTTACATGCCTTAGCTGCTTCCATGATTGCGGTGATTTGTGGAACACCGACACCTGCAACAACGCGCGTTGTGCAGATTGATCCGGGGCCAACGCCAACCTTTACCGCATCAGCACCTGCGTTAATTAGCGCTTGTGCGCCTGCACGCGTTGCGACGTTGCCACCGATAATTTCGATAGTGGACGAAAACTTTTTTATGCGTTCGATCGCATCTAATACAGCGCGGTGATGGCCATGTGCGGTATCAACAACAACTACATCTACGCCGGCTTCGATTAAAGCCTGTGCACGGGCAAAACCATCATCGCCAACGCCAACTGCTGCACCTGCAAGGCCAACGTTCTTAACTAACTTAACGTGGGTTACTTGTTCTTCGATCGGCAGATTGCGGTGAATGATTCCGATTCCGCCGGCCTTGGCCATAGCGATCGCCATCGTTGATTCGGTAACGGTATCCATCGCAGATGAAATGACCGGAACGGCAAGTGAAATATTTCGAGTTAACCAGGTTGCGGTATTTACTTCGCTGGGAACAACCTCAGAGGCATCTGGCAATAAGAGCACATCGTCGTAAGTCAGCCCGAGCATCGCGACTTTGTCGTTATCGCTGCTCATGGGCCCTCCGAACTATCGTTGAGCGCCCTAGTCTAACTTGAGATTTAGGCTCCGATCGCCTGCTTAATTTGCTCGCAAGCCAGGGTTAAATCATCGGCAAAGGCCACATCAGCGCATTGTTCGCGGTCCCAACCAGGTCCACCCAAGATATAGCGCGGTGCTGGGCGAATCGCAGGAGTGTCGCGGAAGTACATTGGGTCGCCATTTTTTGAAAGCTGTGCCCAAAGGAAAACAGCAGGTGGGGCGCTGCGGGTTACGACATTTGATAGCGCTTCAAGTGGAGTTCGCGCACCTAAGAAATGACACTCCACTTTTATATCAGAGAGGGCGGCATGTAAGGCATGAAGGGCCAAGGTGTGAAGTTCTTCGCCAACAGATGCGAGTAATACAGGGCGAGAATTGATTGGCGTTTTCATCGACTTGGTGCGATCTCGTAAAATTGATGTAACGATCTCAGAAAGTAAGTGCTCTACTTCAATACCAGTGCCGGTTTCAGCCCAGGCATTTCCAACTATGACAAGAACTGGAACAATGACTTCCTGCCAACTATCAATAACCCCGTATTTATCAATATCTTTTCGCAGTATGGATTCAACAAAATTTCGATCGAGCGAATTAGCCGCATTAACAATTGCGGCAACAACATCTTCGCGAACTTCAAAAGTCTTTAAAATCTTTTCTACTTTTACTTCGCCCTTATGCGCCAGCGCTTGTTCGGCTGCTTCAGCGGGTGCAACGCCTGCTGTGATTAACCGGCGCATCAGAGTTAACTTGGCGAGATCACTGGGACAATATCTGCGGTGTTCGCCGGCCTCGTGGCTGGAAGGTCCCAGGCCATAACGGCGAGCCCAGGTCCGCAGAGTGGCAGGGGCTACGCCTATTTGGCGGGCAACGGCTGCAACCGTCAGGAGCTCTTCCATACGCCTATGGTGGCGTGAACTGGCCCACCCCACCACTTGAAACACGCTCGCACAGTGATGAAACGAACAACTTTTGAACCGATAGGTACTTGAACAACCTATGGCGCGGTTGTATGGTTGGGCTACAACAAGGCAAGGAGAAAAACATGGCTGAACTATCACGTCTACCGCTTCCTATCGCAGATCATTGGGAATGGCAGTACGAAGGCGCCTGTCGCGAACTTCCTGCTGAGATGTTCTTTCATCCAGATGGGGAACGCGGCCCACGTCGTCGCAATCGCGAGAATGCAGCGAAGGCTGTCTGCGCAACTTGCCCAGTTATCCAGGCATGCCGTGCACACGCGCTAGCAGTGCAAGAGCCTTACGGAATCTGGGGCGGTTTATCTGAAGATGATCGCGCAGTATTGCTTGGCAAAGAAGGGCCAACGATTTACGAAGCGTCATAAAGATTTAATTAAAAACCCCTCTAACCAAATCGGTTAGCGGGGTTTTTTAATTCCTTACTTAGTGGCTGTGCCCACCGGGACCATGGCTGTGGCCATGTCCGCCAGCTGCATCTGCAACTTCATCTGCTGGACGCTCATATACAACTGCTTCAGTTGTAATGAACATCGCTGCAATTGATGCCGCGTTTGCAAGTGCTGAACGAGTCACCTTAACTGGATCGATTACGCCATCTTTTGCAAGATCGCCGTAGACATCAGTTGCTGCGTTAAAGCCTTCATGGGCCTTAAGCGCGCGAACCTTTGCAACTACAACGTAGCCTTCTAGGCCAGCGTTCTCTGCAATCCAACGAAGTGGTTCATCGCAAGCTTTGCGAACTAGGCGAACACCGACAGCCTTATCGCCAGTGAATCCGAGATCGCCTTCAAGAGCATCAGCTGCATGCACAAGTGCTGCGCCTCCGCCGATAACGATTCCTTCTTCAACTGCTGCGCGAGTTGCTGAGATCGCATCTTCAAGACGGTGCTTCTTCTCTTTCAACTCAACTTCAGTGTGTGCGCCAACTTTGATTACGCAAACGCCACCAGCGAGTTTTGCAACGCGCTCTTGTAGCTTTTCGCGATCCCAGTCAGAGTCGGTGTTAGCGATCTCGTTGCGGATTTCAGCGACGCGTGCAGCAACAACTGCCTTATCCCCTGCTCCATCAACGATCGTTGTTGCATCCTTAGTGATAACGATGCGACGAGCCTTACCGAGATCTTCAATGTTCACCTGATCTAGCTTCATACCAACTTCAGCAGAGATCACTGTGCCGCCGGTCAAGATTGCAATATCTTGCAACATCGCCTTGCGACGATCGCCAAAGCCAGGTGCCTTAACTGCCGCTGATGTAAATACTCCGCGCATGCGGTTTACAACGAGAGTTGAAAGCGCTTCGCCTTCAACATCTTCAGCGATGATCAGAAGTGGCTTATTTGCTTGCGCAACCTTTTCAAGTACTGGCAGCAACTCTGCGAGCGCGGAGATCTTGTTGCCAGAGATCAATACGAAAGCATCCTCGAGGATTGCTTCCATACGATCTTGATCAGTTACGAAATATGGAGAGATGTAACCCTTATCAAATTGCATACCTTCAGTGAACTCGAGTTCGAGCGCTGTGGTTGATGCTTCTTCTACGGTGATTACGCCATCTTTGCCGACCTTGTCCATCGCTTCAGCAATTAACTCGCCGATTGCGCGATCTTGGGCAGAGATAGTTGCAACGTCTGCGATCTGTGCCTTGTCTTTTACGACAGTTGCGTTCTCGGCTAAACGTGCAGATATAGCAACAACTGCTGCTTCGATACCAATCTTTAGATCCATTGGCTGTGCGCCAGCGGCCAGGTTGCGAAGGCCTTCTTTAACCATGGCTTGAGCAAGCACGGTTGCTGTTGTTGTTCCGTCACCAGCAACATCGTTGGTCTTTGTCGCAACTTCCTTTACGAGCTGTGCGCCCATATTTTCTGCTGGATCTGAAAGTTCGATCTCTTTGGCAATTGTCACGCCATCGTTTGTGATGGTTGGTGCGCCAAAAGACTTTGAGATAACAACGTTACGTCCCTTAGGACCTAGTGTTACCTTGACTGTGTCAGCGAGGATATTTACTCCACGCTCCATCGCGCGACGAGCTTTCTCGTCGAATTCAAGAATTTTTCCCATTACTACTTCTCGATTACAGCGAGAATGTCACGGGCTGAAAGTACGAGGTACTCCTCGTTGTTGTACTTAACTTCAGTTCCGCCGTACTTGCTGTAGAGAACAACGTCTCCGACCTTGACATCCATAGGAACGCGAACGCCGTCATCGAAACGACCAGGACCTACGGCAACAACGGTGCCTTCCTGTGGCTTCTCTTTTGCAGTATCAGGGATAACAAGACCTGAAGCTGTGGTTGTCTCAGCCTCTGAGGCCTTTACAACGATGCGATCTTCGAGTGGCTTAATGGCAACTGCCATGGTTTTTTCTCCTTCAATTAGCAGTGTGGGGCTTAGAGTGCTAACTCTAGTTCTAGAGGGCTACGCGCTCAACTTGGAGCGCAGAATCGGCATCGAAGGCCTGTGGGCCAGGTGCGCGGCCGGCAGCGACCATCAGCGAGCCCAGAGCAGCGACCATCGCCCCGTTATCGGTGCAAAGCGCCGGGCTTGGGATCCGCAGACGAACCCCGGCCTTAGCGCAGCGCTCTTCAGCCACTGCACGTAGACGTGAATTCGCAGCAACTCCTCCTGCGATAACAAGTGAATCGATTCCGGTGGCTTTGCATGCAGCGAGTGATTTCTGCAGAAGTACATCGACGATTGATTCTTGGAAAGAGGCAGCGACATCTGCTTTCGCATAATTCGGAGTGCTTTCCAAATAACGAGCAACTGCAGTTTTAAGCCCCGAGAACGAGAAATCATAAGGACGAGTTGCCCAGTCGTTGCTCGTTGTTAAGCCGCGTGGAAAATCGATTGCAGTTGGCGAACCAGAAAGAGCTGTGCGATCGATCGCCGGGCCGCCCGGAAAACCAAGTCCCATAACGCGGGCAATTTTATCGAAAGCTTCACCTGCAGCATCATCCATCGTTGCACCGAGTTTGGTAATCGATGAAGTGATGTCATCTACTTGCAGCAGGGATGAATGTCCACCGCTAACTAACAAAGCGATCGTTGGATCAGTTGGTTGATCGTGAGTTAAGAAATCAACGCTGACGTGCGCTGCTAAATGGTTTACGCCATATAAAGGTTTGCCCAGTCCAAGTGCTAAACCGCTGGCAGATGCAACGCCGACGAGAAGTGCACCAACTAAACCAGGGCCAGCAGTAACTGCCACAGCATCGATATCTTTTAGCGAAATCTTAGAGTCCTTAATTGCCTTTTCAATGCTGGGTAGCATCGCCTCTAAATGTGCTCGGCTCGCGATTTCAGGTACAACTCCCCCGAATCGCGCATGTTCATCAACGCTAGAAGCAATTACATTGGCAAGCAGCGTGCGTCCGCGGACAATGCCGATTGCGGTCTCATCACAGGAAGTCTCAATGCCCAGGACTATTGGCTCGCTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP016783|1101235:1110156|1108778_1109072_-|WP_095526936.1|DBSCAN-SWA MAVAIKPLEDRIVVKASEAETTTASGLVIPDTAKEKPQEGTVVAVGPGRFDDGVRVPMDVKVGDVVLYSKYGGTEVKYNNEEYLVLSARDILAVIEK >NZ_CP016783|1101235:1110156|1101235_1101913_-|WP_095689436.1|DBSCAN-SWA MKRMSALCALAIAVASLAPVTSTAAERPTSVKGCAKPTAKAHVPATLKQPTSVDKKLAKTMTITTNCGVITIELDPAAPQTVTNLATLARSKYFDGSFCHRLTTEGIYVLQCGDPSAQGNGSPGSWKGYKDENLPTKKILTYPAGTVAMANSGPNTNGSQFFLVYKDTTLPPSYTIWGKIKTGLPLLLRIEKVGAYQVDQSTGNAYYAGDGIPVQPIEIKSVTVR >NZ_CP016783|1101235:1110156|1106780_1107080_+|WP_095689441.1|DBSCAN-SWA MAELSRLPLPIADHWEWQYEGACRELPAEMFFHPDGERGPRRRNRENAAKAVCATCPVIQACRAHALAVQEPYGIWGGLSEDDRAVLLGKEGPTIYEAS >NZ_CP016783|1101235:1110156|1105758_1106634_-|WP_095689440.1|DBSCAN-SWA MEELLTVAAVARQIGVAPATLRTWARRYGLGPSSHEAGEHRRYCPSDLAKLTLMRRLITAGVAPAEAAEQALAHKGEVKVEKILKTFEVREDVVAAIVNAANSLDRNFVESILRKDIDKYGVIDSWQEVIVPVLVIVGNAWAETGTGIEVEHLLSEIVTSILRDRTKSMKTPINSRPVLLASVGEELHTLALHALHAALSDIKVECHFLGARTPLEALSNVVTRSAPPAVFLWAQLSKNGDPMYFRDTPAIRPAPRYILGGPGWDREQCADVAFADDLTLACEQIKQAIGA >NZ_CP016783|1101235:1110156|1107138_1108776_-|WP_095689442.1|DBSCAN-SWA MGKILEFDEKARRAMERGVNILADTVKVTLGPKGRNVVISKSFGAPTITNDGVTIAKEIELSDPAENMGAQLVKEVATKTNDVAGDGTTTATVLAQAMVKEGLRNLAAGAQPMDLKIGIEAAVVAISARLAENATVVKDKAQIADVATISAQDRAIGELIAEAMDKVGKDGVITVEEASTTALELEFTEGMQFDKGYISPYFVTDQDRMEAILEDAFVLISGNKISALAELLPVLEKVAQANKPLLIIAEDVEGEALSTLVVNRMRGVFTSAAVKAPGFGDRRKAMLQDIAILTGGTVISAEVGMKLDQVNIEDLGKARRIVITKDATTIVDGAGDKAVVAARVAEIRNEIANTDSDWDREKLQERVAKLAGGVCVIKVGAHTEVELKEKKHRLEDAISATRAAVEEGIVIGGGAALVHAADALEGDLGFTGDKAVGVRLVRKACDEPLRWIAENAGLEGYVVVAKVRALKAHEGFNAATDVYGDLAKDGVIDPVKVTRSALANAASIAAMFITTEAVVYERPADEVADAAGGHGHSHGPGGHSH >NZ_CP016783|1101235:1110156|1103477_1104587_-|WP_095689438.1|DBSCAN-SWA MDIEIAPGKRARQAYSFDDIAIVPSRRTRNPEDVSTAWQIDAYKFDIPMMAAPMDSVVSPETAIEIGKLGGLGVLNLEGLWTRYDDPRIPLGEIASMPDKHATRRMQEIYSAPIRPELIKERIKTIRDSGVTVAASLSPQRTAELHKAVIDAGVDIFVIRGTTVSAEHVGNDDSALNLKKFIYELDVPVIVGGVATATGALHLMRAGAAGVLVGFGGGAAHTTRKVLGIEVPMASAVAEVAAARREYLDESGGRYVHVIADGSVGRSGDIAKAIAVGADAVMMGSPLAKATQAPGLGWHWGSEAHHQELPRGERVNVGTVGTLEEILNGPSHTSDGSMNLFGALRRAMATSGYSDVKSFQRVEVIIHGA >NZ_CP016783|1101235:1110156|1101923_1103477_-|WP_095689437.1|DBSCAN-SWA MSNSQNGVLVVDFGAQYAQLIARRVREANVFSEIVPSHITAAEVSAKAPSAIVLSGGPSSVYAGRAPKFDAEILKLGIPVFGICYGFQAMAAALGGVVSQTGKSEFGRTEIKAEVSSKIFATLPASQSVWMSHGDAVTQAPAGFTICAVTADTPIAAFEDATGKIAGVQFHPEVLHSEHGQAILKNWLINIAGCTPNWTTGNIAENEVAKAKALIGSKRVICGLSGGVDSAVAAAIVQRAVGSQLTCVFVDHGLLRSGESEQVQRDFVAATGIDLVVVDAVDQFLSALAGVTDPETKRKIIGREFIRSFEKAARDIAAGGDVEFLVQGTLYPDVVESGGGTGTANIKSHHNVGGLPDDLKFTLVEPLRTLFKDEVRQVGLELGLPEEIVWRQPFPGPGLGIRIIGEVTQDRLEILRHADLIAREELKAAGLDRQIWQCPVVLLADVRSVGVQGDGRTYGHPIVLRPVSSEDAMTADWSRVPYETLEKISTRITNEVREVNRVVLDVTSKPPGTIEWE >NZ_CP016783|1101235:1110156|1109124_1110156_-|WP_095689443.1|tRNA|DBSCAN-SWA MSEPIVLGIETSCDETAIGIVRGRTLLANVIASSVDEHARFGGVVPEIASRAHLEAMLPSIEKAIKDSKISLKDIDAVAVTAGPGLVGALLVGVASASGLALGLGKPLYGVNHLAAHVSVDFLTHDQPTDPTIALLVSGGHSSLLQVDDITSSITKLGATMDDAAGEAFDKIARVMGLGFPGGPAIDRTALSGSPTAIDFPRGLTTSNDWATRPYDFSFSGLKTAVARYLESTPNYAKADVAASFQESIVDVLLQKSLAACKATGIDSLVIAGGVAANSRLRAVAEERCAKAGVRLRIPSPALCTDNGAMVAALGSLMVAAGRAPGPQAFDADSALQVERVAL >NZ_CP016783|1101235:1110156|1104598_1105714_-|WP_095689439.1|DBSCAN-SWA MSSDNDKVAMLGLTYDDVLLLPDASEVVPSEVNTATWLTRNISLAVPVISSAMDTVTESTMAIAMAKAGGIGIIHRNLPIEEQVTHVKLVKNVGLAGAAVGVGDDGFARAQALIEAGVDVVVVDTAHGHHRAVLDAIERIKKFSSTIEIIGGNVATRAGAQALINAGADAVKVGVGPGSICTTRVVAGVGVPQITAIMEAAKACNKAGIPLIADGGLQYSGDIVKAIVAGANSVMLGSLLAGCEESPGQLVEIDGRKYKAYRGMGSLGAMQSRGEQKSYSKDRYMQDDVLSEDKLVPEGIEGRVAYRGPVAEVVHQLVGGLRSGMGYAGAPDIETLRREGRLIQITAAGLQESHPHDVLHVADAPNYSKKS |
9 | uncultured_virus(25.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1116700 : 1126284
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP016783|1116700:1126284|DBSCAN-SWA ATCAGTACCCCAAATATGCGGCGCGCAAATTAGCATCCGCCCTTAACTCAGCTGCTGGTCGATCGGCAACGACTTTGCCAAGTGCCAAAAGAACTCCGTGATCGGCTACGCCAAGTGCCGTATTGGCATTTTGTTCAACTAGTAAAACTGTGAGACCAGTTGTGCGACAGAGAATATTTATTGAATCAATGATCTGTTCTACAACTAACGGTGCAAGGCCAAGAGATGGTTCATCTAAAAGCAGTAACTTGGGGCGCGAAACCAGCGCGCGACTAATGGCCAACATCTGACGTTCGCCACCTGAGAGAGTTCCAGCCAACTGTTTGCGGCGTTCTTTCAAACGAGGAAATAGTTCGAACATCTCATCCTGTGCAGCAATTACATCTGCTTTGAATTTACGGCGCCGAAATAGCGATCCCATTGAGATATTTTCTTCAACGGTTAACTCTGAGATAACGGCGTGGCCTTCAGGAACATGGGCGATGCCATCTCGCACAATATCTTCGGCGCGCTTGCCGATTAGAGATTTTCCGCTCCAGGTGATTGAGCCAGACGCTGGATGTTCTAGGCCAGAGATGGTGCGTAACAAAGTGCTCTTACCGGCGCCGTTTGCACCGATAACTGTTGTTATCTTCGATTCTTGTGCTGTAAACGAAACGCCGTCGAGTGCAATTACCGAACCAAAACTGGTTACTAAATCTTTAACTTCAAGCATTGTTGACTCCTAATTGATCGGTGCCACTTTGATTTGAAGTTCCAAGATATGCCGCAAGAACTGCTGGGTCGCGCTTAACGGTATCGAAGGATCCGCTGGCGATAACTTTGCCAAAATTCAATACGTAGACTTGATCTGAAATCGCACCAATGAAATCAACGTGGTGCTCAACGATGATAATTGCACAGCGTTGCTTTAACTGCCTTAGCAAGTCGGCGAACTTATCAATATCATCTTGGCCAAGTCCAGCAGCTGGTTCATCAAGAAGTAAAATCTTCGGTTCTGAAACTAGAGCTCTTGCAATGGATACGCGCTTTGTTTCGGGATACGTTAACTCTGATGGCGTCTTATCTGCTAAATCCAGCGCTCCTGCCCAGGCCAGCGCATCTTGTGCTTTAGCGCGGAGTTTGCGTTCTGATTTTGCAGATAGCCCAAAGAGATCTTTTAGGAAATTGCTCTCACGCTTCTTATCTGCACCCATCATGACATTTTCAATAACACTTAACCCGCTAAATAGACCAACGCCTTGCAAGGTACGGCTTATCTCAAAATTTGCTAGCTCGTAACTTTTTAGCCAATCAACTTTCTTGCCATGGATTGAGATCTCACCAGATGTCGGTGCGACTAAGCCTGAGATGCAATTAAACAAAGTGGTCTTGCCTGCGCCATTTGGACCAATTAGCCCAACGATCGAATTAGGCGCAACGTCGATGAAAACATCATCGAGGGCAATGAGTCCACCAAAGTTGACTCGTAAATTTGCAATCGCTAAAACGCTCATGGTTCGTAAATTCTAGGCACACGCGGGCCAATTCGGGTAACTATCTCGTAATTAATTGAGCCCGAAGCTGCACCCCAATCATCTGCCGTGTACTCGCCATCTGCACCGGATCCGAAAACTGTGACCCAATCGCCAGATTGCGCAGGGGAATCCGCTGCTAAATCAACAACGAATTGATCCATAGATACTCGTCCAATGATTGGCGCTTTTTCTCCCAGAAAACTGACACCCGCGCTTTGTGCAACACGAGGAATTCCGTCGGCATAACCCATCGCAACCAGTCCTAGCTTGGTCGCTGACTTTGTAACCGCCGTTGCTCCGTAACCAACTGGGCTTCCGGCGGGAACTTTCTTTACAAGATGTAACTTGGCGCGCAAAGTCATCGCAGGAATTAACCCAAGATCTTTAGATGTACCGAGTGTTTTTACATCAGGACTTAGGCCATACATAGCAATTCCAGTTCGCACCATGTCAAAGCCAGATGCCTGATCTTTGATCGTGGCAGCTGAATTTGAAAGGTGGCGAATCAAATTTGGGTATCCAAAAGATTCAAGGGATGCAACCATCTTTTTAAATCGATTGAGTTGACTTAAATTTTGTGGCTGTCCAGGTTCATCTGCGCGCGCGAAGTGGGAGAAGATTCCAACTATTTCAACCCCGTGTAGATGAAGTGCATCAATCTTCGGCCATTCATCCAAGAATCCGCCACGAGTCATTCCGGTATCAACTTCTAAATGAACGCGCGGGCGCTTGTCACTTTTTACTGCGCTAACTTCATCCAACGCCTTTATTGATGCAACTGCAATATCGATGTTGTTATCTACTGCGCTTTGAAAATCTGATCCCGGTGGAACTAACCACGCCAAGATCGGTGCAGTTATTCCAGCAGCGCGAAGAGCAATCGCTTCTTCAAGGAGTGCAACACCTAACCAAGTAGCTCCGCCATCAAGTGCCGCTTTTGCAACTGGCACGAGCCCGTGTCCATAAGCATCTGCTTTCACAACTGCCATCAAGTCAACGCCAGCTTTCGCCTTTAGCGTTGCCACGTTGGCTTTAATTGCAGAGAGATTTACGACTACTTCGGCGCGATTTGTCATCGTTCCACCACCACCATCGCCGATGCGATTCCGGCATCGTGAGATAAAGAAAGGTGCACGCGTGCACCATCTACTAACTCTGCTATCTCGCCGCGAAAGAGAAAATCTGGTTTACCGCTCTCGTGATTTATGACTTCGGCTTCGTGCCAAGAAAGTCCATGGCCGGCATTAAGCGCTTTCGCTAAAGCCTCTTTGGCGGCAAAGCGCGCTGCCAGTGAATGAATTGGTTTTACTTGTTCAGCAGGAGTGAACAGTTTTTCGCGAAGTGATGGCGTGCGCTCGAGCGATTGGGAGAAACGTTCGATATCTACGACATCTATTCCAATCCCATCGATCATGGGGTGGAGTCTAATTGGCGTTAGGAGAGTTGGCTATTAGATGGGTAATAGCGATCTACCGCGATCACCAAAACCAAGAGAGATGCGCCGATGAATAAACCGCCGAGTAGATCCGAGAACCAATGAGTGTCGCGGACTAAAGAGACTATGCAAACGGTCAAACTAATTGTGGCAACGCCTGCGCTAGCCAGACGGCCGCGATAACGATCGACGTGGGCATAGCGATAAATTAAATATGCAAGGACTCCCCAAGTAAGAAGTGCATTTGAGGCATGGCCACTTGGGTATGACATTCCGCCGAAGTGAAGTAGATCGATTCCAACCTTTGGCTTGGTGCGACCTAGGCCAAATTTAAATAGACCTACAACTAAGTTAAGAGAGATCAACGAAAGAATTGCCAAATTCAGTGGACGCCAGGTTTTAAATCTCCGCGCAATGTAGAAAGCGGCGATAAGAAGTGCGGTAGCAGTAACGCCACGCAAACCAAGGTCATCTAATCTGCGAATAATGAATCCACTTACTCCACGAAATTCTGGACGCTCTAGATCATGGATCCATTTGTCGATTTCAATTAGCGGCCCATTAATGATTACTTGATGAGTAATTAGGAGAAATCCAGCAAATAGCGCTGCAGACCAGCGCGCAGCGCGCACCATCTGGGCGCGACGCGAATCAAGGACTGCCTGCATAATTATTCGACCGTTACTGATTTAGCTAAATTGCGTGGCTGATCAACATCATTTCCGCGCGCAATCGCCATTTCGTAGGCAAAGACTTGTAGGGGAACTGTGGCCAAGATTGGTTGAAAGAGCGGTGAGCACGCTGGGATGCGAATGATGTGTTCTGCGCCAACAACTTCAGCGCCTTCGTGTGCGATGACAATTACGCGCGCACCACGAGCTTTCACTTCTTGGATATTGCTAAGCATCTTTTCATCTAGTGAGTGATCGCTGCCGGAAGGCAAGATCGCGATTACGGGCGTGCCTTTTTCGGCATCGATTAACGCGATCGAACCGTGCTTGAGCTCTCCCCCAGCAAATCCTTCCGCGTGGATATATGCGAGTTCCTTCAGCTTCAAAGCACCTTCTAGAACAACTGGGTAACCAATATTGCGACCGATAAAGAGAACGGTTTCAGATTTTGCAAAAGTGCGAGTAAGTGCGCGAAGTGGTTCGACGGTTTCGAGGATTTGTTCGATCTTGCCTGGTAGTTCGAGCATTTCGTTGTAAAGATCGCGAACCTGAGCATCGGTTAACTCACCACGAACCTGTGCCAAATGCAGACCAATTAAATCAACGGCAACGATCTGGGTTAGCAGCGCTTTAGTTGAAGCGACTGCAATTTCAGGACCGGCATGTGTGTAAAGAACGGCATCTGATTCGCGCGGAATCGTTGAGCTATTTGTATTGCAGATCGCAAGTACTCGCGCACCCGCAGCCTTAGCATGGCGGATAGCCATCAAGGTATCCATCGTTTCGCCAGATTGAGAAATGGCGATAACTAAAGTGTTGGAATCAATGATCGGATCGCGGTAGCGATATTCGCTGGCAATTTCTACTTCTACAGAAATCTTCGCCCATTTTTCAATTACATATTTAGCGATCATTCCTGCGTGATAAGCGGTTCCACAAGCGATTACAACAATCTTCTTGAGCGCTTTAATTTCAGCCTCTGACATATGTAATTCATCGAGTTCGATCTTGCCGTTATCTGTTAAGCGGCCAATCAAAGTGTCGGCAACTGCCTTGGGTTGCTCGAAGATCTCCTTGAGCATGAAGTGGGAGTATCCACCTTTTTGCGCAGATGCTGAATCCCAAGTAATTGCATATTCCTTGGGAACTACTGCTTTGCCATCTAAGCCAATGATGCTGATGCCAGCTGGCGTCATGGTGATGATCTCGTCTTGGCCAAGTTCTACCGCGCGCTTGGTGTAATCGATAAAGGCTGCAACATCGGAGGCCATAAAGTTTTCGCCATCGCCTAAACCAACAACGAGCGGTGAATTGCGGCGTACTCCAACAATTACTTCAGGGGCATCGGCGTGAACTGCAACGAGAGTAAATGAGCCGCGTAAGCGCTTTACCGCATCTAACATCGCAGCAGTTAAATCGCCGCCATGAGATTTACGAAGTTCAGAGAGTAAATGTGCAACTGATTCAGTGTCAGTATCTGATGAAAATTCGTGGCCGCGTGATTGGAGCTCGGCCTTTAGCTCTGAATAATTCTCGATGATGCCGTTATGGATAACGGCAAGCTTTCCTTCATTGTCAGTGTGCGGATGTGCGTTGCGATCTGTTGGTCCACCATGTGTTGCCCAACGAGTGTGGCCAATTCCAGAGTGAACAACCGGCATATTTGCAGGCAGTGAGCCTTCAAGATTTGTTAACTTACCGGCGCGCTTTTCAATAAATAGAGCGCCCGATGTACCAAGTGCGATTCCGGCTGAGTCGTAACCACGATATTCAAGGCGACGTAGACCTTCGATGAGCGGAGTAATTGCAGACTGTGGCCCTGTGTAACCCACAATTCCGCACACTGTTATCTACCCCAGCTCTGATTGAACGACCTTTGCCAAGTCGTTTGCTATTTCGCTGGCTAGGTTATCACTCTGGGCTTCAACCATCACGCGGACAAGCGGTTCGGTGCCAGATGCTCTTAATAAAACGCGACCATGTTCGCCAAGTTTTGCCTCTGCCGCTGAAACCGCTGCCTTAATCACTGCAGAATCTGCAAGACGCTCTTTGGCAACGTTCTTAACATTTACCAATACTTGTGGGAAACGCACCAATGTTGATGCAAGATCTTGCAAAGGTTGACCTGATTTAACAACTTCTTGCGCAAGTGCAAGAGCGGTCAATATTCCATCGCCCGTTGTTGCGAATTCGCGCATGATTAAGTGGCCAGATTGTTCTCCGCCAAGTGAATAGCCTTTTTCCAGCATCTTCTCTAATACATAACGATCGCCAACGGCAGTTGTTTCAACGTTGATATCTGCGTTCTTCATCGCGTGGAAGAAACCGAGGTTGCTCATCACAGTTGCCACAATCGTGTTTGATTTCAGCGAACCAGATGCTTTAAATCCGCGCGCCAAAATAGTCATGATTGCATCACCATCGACCTCATTGCCTTGCGCATCTATCGCTAGCGCACGGTCGGCATCGCCATCGTGGGCAATTCCAAAATCAGCGCCTTCGCGAAGCACTGCAGTGCGAAGCTGGTGTAAATGTGTAGAGCCGCAATCATCGTTGATATTCCAGCCATCTGGGGCATCGAAGATTGAGATTACTTCGGCACCTGCGCGGCGGTAAGTATCAGGTGCAACAGTTGAAGATGCGCCATTTGCACAATCGACGACGATCTTTAAACCCTTTAGCGGTGTTGAAACGGATGAGAGAAGGTGTGAGATATAACGTTCGCGAGCGCTTTCATCGTTGATCGCACGTCCCACACCGCTGCCGGTAGGGCGCTCCCATGGTTCGCCGATGCGCGCTTCGATCGCTGCTTCAATGGAATCGTCTAACTTTCCGCCACCGCGTGAAAATAACTTAATGCCGTTATCGGGCATTGGATTGTGTGATGCCGAAATCATTACGCCGAGGTCGGCACCGGATTCGGCAACTAAATGTGCAATTGCAGGAGTTGGTAGAACTCCTACGCGATAGACATCAACGCCGGCACTAGTTAGACCAGCAACAACTGCTGCTTCTAGGAATTCACCAGATGCGCGTGAGTCATGGCCGACGATTGCTATAGGTCGAGAATTGCTTGCAATCTCGCCAGAAGAACTAGGCCGGTGTGATTTATTAGGATTTGTTTCGACGAGAATATGTGCAGCGGCAACGGCGACATCAAGTGCTAATTCAGCAGTTAAATCACGATTGGCTAAGCCACGTATTCCATCGGTTCCAAAGAGTGCCACGTTGGGGCGCCTTTGAAATTAACGCTTGCTGTATTGCGAACGCTTACGGGCCTTCTTCAGACCGTACTTCTTGCGTTCGATAACACGTGCATCACGTGAAAGGAATCCAGCTTTCTTGAGCGCTGGACGGTGTGCTTCAACATCGATCTCGTTAAGTGAACGAGCAACGCCAAGACGAAGTGCACCGGCTTGACCAGATACTCCGCCACCGTTGATACGAGCAAATACGTCGTACTGGCTTTCAGCACCAACAGTGCGGAATGGCTCAGAGACTAGTTGCTGGTGAACTTTATTTGGGAAATAGTTTTCCAGGGTCTTACCGTTTACAACCCAACGTCCTGTTCCAGGTACGAGACGAACGCGAGCGACTGCTTCTTTACGACGACCAGTGCCTGCACCAGGTGCCTTGATTGCTGGGCGGTTAGTTGCAGCAGCTGGAGTTGCTGAAGAGTATGAAGTAGGAACTTCGTTCTCGTCTAGATCTGTAACTTCGTTATAAGTTGTATCTGACATTTGTTATGGGTCCCTTACTTTGCGATCTGTGAAACTTGAGTGAATACATATGGTGTTGGTGATTGAGCAGCGTGTGGGTGCTCAGGTCCTGCGTAAACCTTTAGCTTTGATGCAACCTGCTGGCCAAGACGGTTCTTTGGAAGCATTCCCTTGATCGCCTTTTCAACAGCGCGTGTTGGGTGCTTTTCGATTAGCTCGCCGTAAACAGTTGAGCGAAGACCACCTGGGAAACCTGAGTGGCGGTGTTCAAACTTTGAAAGCGCTTTGTTTCCTGAAAGAGCAATCTTTTCTGCGTTGATGATTACAACAAAGTCGCCCATATCCATGTGTGGTGCAAATGTTGCCTTGTGCTTACCGCGAAGAAGAACAGCAGCATGTGTTGCAAGGCGGCCAAGAACTACATCTTGTGCATCGATGACGTGCCATTTACGGACGACATCTCCGGCCTTAGGACTATATGTACGCATGCTTTATTACCTTCTTCTTCTTAGTTTGAACTTAGTTTGAGCTCTTGGTTCTGCGGTTTTGCCTTCGGCGCCTTATCTGGCGCAGGGGATGAACTTTACCTTTCATGCGTAGCAGGGTCAAAATGGCCTAAATCCACGCATACCAGCCGCCCCTTAATGGTAGAAGATCGGCTCTGGGCTCTCATAAACCTGGCCATTTGCCCCAAAGACCAAGAAGCGGGTAAATCCACGAGTAAACCAGCGATCGTGAGTTACAGATAGAACTGTTCCTTCGTAACGATCGAGCGCTTTCTCTAAACTCTCGGCGCTCGCTAGATCTAAGTTATCGGTCGGTTCGTCAAGCAACAACAAAGTAGAGCCAGATAGTTCTAGCAATAAGACTTGAAAGCGCGCTTGTTGTCCACCAGATAGGGAAGAGAATTTCTGTTCTGCTTGCTTATCAATTTCATATTTGCGAAGCACGCCTTTAGCGGGGCCAAGTTGAAGTGAATACGTCTCCCACAAGATTTCTAGCAGAGTCTTTGATTCAAATTCTGGGTGAGCATGGGTCTGTGCGAAGTAACCGATCTCAACGCGTGCACCAATTTTAAAAGTGCCACCGTATTTAACAGTTTCATCGCCAGCGATCATGCGCAGGAAGTGAGATTTGCCGGTGCCGTTCTTACCCAGCACTGCGATGCGATCGCCGAAGAATACTTCGAAGTTAAATGGTTCAGTTAAGTTAGCAAGAGATAAGTTTTCGAAGGTAAGTGCGCGCAGACCTGTGCGGCCGCCACGTAAACCGACTTTTACATCTTGTTCAATTGGTGGGGCTTCTGGCGCACCTGCTTCTTCAAATTTACGCAAGCGCGTCTGCATCGCATGATATTTACTTGCCATATCTGGAGAGATTGCGGCTTGTACCTGAAGCGTCTTAACTAGATCAATGAGGCGTTGGTGTTCTTGTTCCCAGCGGAGCAACATTTCAGCAAAGCGTTCGTTGCGCGCCTTGCGGGCATCATGCCAAGTCGAAAACTTTCCGCCATGCACCCAAGTGGTATTTCCATTAACGCCTTGTTCGAGAGTCACAATTCGGTTAGCGGTATTTTCAAGTAATTCACGATCGTGGCTAACGAAGAGAACGGTCTTCTTAGTTGCACCGATAACTTCTTCCAACCAACGCTTTGAAGGAACATCAAGATAGTTATCTGGCTCATCAAGTAACAAGACTTCCTCAGGTCCTTCTAGCAAGGCACGCAGAACTAATTTTTTCTGCTCTCCCCCGGATAAAGTGTTCACCTTTCGGGCTGCAACTTGATCAAAAGATTTATTGAGCATCTCGGTGCAGCAGACATCCCAAATTAGTTCGAAGTCATATCCCCCGGCATCGCCCCAATCGATAATCGCTTGGGCGTAACTCATCTGATCTTTCTCTGATCCACTAGCGTTCATCGCAGCTTCAGTTGTAACAACTTTTTCAGCGGCATCGCGGATTCGCTTTGGTGCAATTGAAAGCAGTAAGTCCCGCACTGTTGATTCATCGCGTACCGAACCAATGAACTGGCGCATCACACCTAAGCCACCAGTAATGGCAACCGCTCCTTCATCAGGTTTTAAATCTCCACAGATCATGCGGATTAAAGTTGTTTTGCCTGATCCGTTAGGACCAATTAGCGCAACCTTGGCGCCATCACCGACGCGGAATGAAACATCGTTTAGCAATACTCGGCCGTCAGAGAGCGAGTATTCAACAGCGCTTACATCAATGTGGCCCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP016783|1116700:1126284|1117406_1118195_-|WP_095671494.1|DBSCAN-SWA MSVLAIANLRVNFGGLIALDDVFIDVAPNSIVGLIGPNGAGKTTLFNCISGLVAPTSGEISIHGKKVDWLKSYELANFEISRTLQGVGLFSGLSVIENVMMGADKKRESNFLKDLFGLSAKSERKLRAKAQDALAWAGALDLADKTPSELTYPETKRVSIARALVSEPKILLLDEPAAGLGQDDIDKFADLLRQLKQRCAIIIVEHHVDFIGAISDQVYVLNFGKVIASGSFDTVKRDPAVLAAYLGTSNQSGTDQLGVNNA >NZ_CP016783|1116700:1126284|1122172_1123549_-|WP_095671498.1|DBSCAN-SWA MALFGTDGIRGLANRDLTAELALDVAVAAAHILVETNPNKSHRPSSSGEIASNSRPIAIVGHDSRASGEFLEAAVVAGLTSAGVDVYRVGVLPTPAIAHLVAESGADLGVMISASHNPMPDNGIKLFSRGGGKLDDSIEAAIEARIGEPWERPTGSGVGRAINDESARERYISHLLSSVSTPLKGLKIVVDCANGASSTVAPDTYRRAGAEVISIFDAPDGWNINDDCGSTHLHQLRTAVLREGADFGIAHDGDADRALAIDAQGNEVDGDAIMTILARGFKASGSLKSNTIVATVMSNLGFFHAMKNADINVETTAVGDRYVLEKMLEKGYSLGGEQSGHLIMREFATTGDGILTALALAQEVVKSGQPLQDLASTLVRFPQVLVNVKNVAKERLADSAVIKAAVSAAEAKLGEHGRVLLRASGTEPLVRVMVEAQSDNLASEIANDLAKVVQSELG >NZ_CP016783|1116700:1126284|1124076_1124529_-|WP_095671500.1|DBSCAN-SWA MRTYSPKAGDVVRKWHVIDAQDVVLGRLATHAAVLLRGKHKATFAPHMDMGDFVVIINAEKIALSGNKALSKFEHRHSGFPGGLRSTVYGELIEKHPTRAVEKAIKGMLPKNRLGQQVASKLKVYAGPEHPHAAQSPTPYVFTQVSQIAK >NZ_CP016783|1116700:1126284|1116700_1117414_-|WP_095671493.1|DBSCAN-SWA MLEVKDLVTSFGSVIALDGVSFTAQESKITTVIGANGAGKSTLLRTISGLEHPASGSITWSGKSLIGKRAEDIVRDGIAHVPEGHAVISELTVEENISMGSLFRRRKFKADVIAAQDEMFELFPRLKERRKQLAGTLSGGERQMLAISRALVSRPKLLLLDEPSLGLAPLVVEQIIDSINILCRTTGLTVLLVEQNANTALGVADHGVLLALGKVVADRPAAELRADANLRAAYLGY >NZ_CP016783|1116700:1126284|1120321_1122166_-|WP_095671497.1|DBSCAN-SWA MCGIVGYTGPQSAITPLIEGLRRLEYRGYDSAGIALGTSGALFIEKRAGKLTNLEGSLPANMPVVHSGIGHTRWATHGGPTDRNAHPHTDNEGKLAVIHNGIIENYSELKAELQSRGHEFSSDTDTESVAHLLSELRKSHGGDLTAAMLDAVKRLRGSFTLVAVHADAPEVIVGVRRNSPLVVGLGDGENFMASDVAAFIDYTKRAVELGQDEIITMTPAGISIIGLDGKAVVPKEYAITWDSASAQKGGYSHFMLKEIFEQPKAVADTLIGRLTDNGKIELDELHMSEAEIKALKKIVVIACGTAYHAGMIAKYVIEKWAKISVEVEIASEYRYRDPIIDSNTLVIAISQSGETMDTLMAIRHAKAAGARVLAICNTNSSTIPRESDAVLYTHAGPEIAVASTKALLTQIVAVDLIGLHLAQVRGELTDAQVRDLYNEMLELPGKIEQILETVEPLRALTRTFAKSETVLFIGRNIGYPVVLEGALKLKELAYIHAEGFAGGELKHGSIALIDAEKGTPVIAILPSGSDHSLDEKMLSNIQEVKARGARVIVIAHEGAEVVGAEHIIRIPACSPLFQPILATVPLQVFAYEMAIARGNDVDQPRNLAKSVTVE >NZ_CP016783|1116700:1126284|1124682_1126284_-|WP_095689447.1|DBSCAN-SWA MGHIDVSAVEYSLSDGRVLLNDVSFRVGDGAKVALIGPNGSGKTTLIRMICGDLKPDEGAVAITGGLGVMRQFIGSVRDESTVRDLLLSIAPKRIRDAAEKVVTTEAAMNASGSEKDQMSYAQAIIDWGDAGGYDFELIWDVCCTEMLNKSFDQVAARKVNTLSGGEQKKLVLRALLEGPEEVLLLDEPDNYLDVPSKRWLEEVIGATKKTVLFVSHDRELLENTANRIVTLEQGVNGNTTWVHGGKFSTWHDARKARNERFAEMLLRWEQEHQRLIDLVKTLQVQAAISPDMASKYHAMQTRLRKFEEAGAPEAPPIEQDVKVGLRGGRTGLRALTFENLSLANLTEPFNFEVFFGDRIAVLGKNGTGKSHFLRMIAGDETVKYGGTFKIGARVEIGYFAQTHAHPEFESKTLLEILWETYSLQLGPAKGVLRKYEIDKQAEQKFSSLSGGQQARFQVLLLELSGSTLLLLDEPTDNLDLASAESLEKALDRYEGTVLSVTHDRWFTRGFTRFLVFGANGQVYESPEPIFYH >NZ_CP016783|1116700:1126284|1119288_1119630_-|WP_095677636.1|DBSCAN-SWA MIDGIGIDVVDIERFSQSLERTPSLREKLFTPAEQVKPIHSLAARFAAKEALAKALNAGHGLSWHEAEVINHESGKPDFLFRGEIAELVDGARVHLSLSHDAGIASAMVVVER >NZ_CP016783|1116700:1126284|1118191_1119292_-|WP_095677635.1|DBSCAN-SWA MTNRAEVVVNLSAIKANVATLKAKAGVDLMAVVKADAYGHGLVPVAKAALDGGATWLGVALLEEAIALRAAGITAPILAWLVPPGSDFQSAVDNNIDIAVASIKALDEVSAVKSDKRPRVHLEVDTGMTRGGFLDEWPKIDALHLHGVEIVGIFSHFARADEPGQPQNLSQLNRFKKMVASLESFGYPNLIRHLSNSAATIKDQASGFDMVRTGIAMYGLSPDVKTLGTSKDLGLIPAMTLRAKLHLVKKVPAGSPVGYGATAVTKSATKLGLVAMGYADGIPRVAQSAGVSFLGEKAPIIGRVSMDQFVVDLAADSPAQSGDWVTVFGSGADGEYTADDWGAASGSINYEIVTRIGPRVPRIYEP >NZ_CP016783|1116700:1126284|1119650_1120286_-|WP_095671857.1|DBSCAN-SWA MVRAARWSAALFAGFLLITHQVIINGPLIEIDKWIHDLERPEFRGVSGFIIRRLDDLGLRGVTATALLIAAFYIARRFKTWRPLNLAILSLISLNLVVGLFKFGLGRTKPKVGIDLLHFGGMSYPSGHASNALLTWGVLAYLIYRYAHVDRYRGRLASAGVATISLTVCIVSLVRDTHWFSDLLGGLFIGASLLVLVIAVDRYYPSNSQLS >NZ_CP016783|1116700:1126284|1123567_1124062_-|WP_095671499.1|DBSCAN-SWA MSDTTYNEVTDLDENEVPTSYSSATPAAATNRPAIKAPGAGTGRRKEAVARVRLVPGTGRWVVNGKTLENYFPNKVHQQLVSEPFRTVGAESQYDVFARINGGGVSGQAGALRLGVARSLNEIDVEAHRPALKKAGFLSRDARVIERKKYGLKKARKRSQYSKR |
10 | Planktothrix_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1373806 : 1384381
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP016783|1373806:1384381|DBSCAN-SWA CTCACACAAGCTCAAATCTCTTGGTGCCGAGGAATGGTTGCAGTGCAGCAGGCACATTTACTGTGCCATCTGCATTTTGGTGGTTCTCCAGAATCGCAACGATCATGCGGGGAATAGCGACCAGCGTGCCATTTAACGTTGCGACAGCCTTTGTGCCATCAGCATCTTTATAGCGAATATTTAAGCGGCGGGCTTGGAATTCTGTGCAGTTAGAAGTACTTGTTACTTCACGATATGCCTTCTGTGTTGGGATCCAAGCTTCGATATCAAATTTACGCGTTGCGCTTGATCCGAGATCACCAGATGCCACATCGATTACGCGGAAAGGGATCTCCATCGCATTTAAGAAATCTTTCTCCCATTGCAGTAAACGCTTATGTTCTTCCTTGGCTTCTTCCGGCTTACAGAAAGAGAACATTTCAACTTTATCGAATTGGTGTACTCGGATAATTCCGCGGGTATCTTTTCCATAACTTCCGGCCTCGCGGCGGAAACAAGTTGAATACCCGGCATAACGCATCGGCAATTTATCCGCTGGTAGTACTTCATCCATATGGAATGCAGCCAGTGGAACTTCACTTGTGCCGACTAAGTAAGCGTCATCTTTTTCAAGGTGATAAACATTTTCGGCTGCTTGTCCAAGAAAGCCGGTACCTTCCATCGCAGCAGGCTTAACAAGTACTGGTGGAATAACAGGGGTAAATCCAGCCTTTAGCGCGCTTTGAATCGCGTAGTTAACAAGTGCGAATTCCAGCATCGCTCCAACGCCGGTTAAATAATATGAACGAGAACCAGAAACTTTTGCACCACGTTCAGTGTCAATTGCGCCAAGTAGCTTTCCAAGTTCAACGTGATCTTTTGCTTCAAAGCCATCTTTCGCAAAATCACGTGGTGTGCCAACGTGTTCAATGGTTACGAAATCTGCTTCGCCACCGATAGGCGCTTCTGGGTCCAGTAAGTTTGAAAGTTGCATGACAAGTGCGTTGGCTTTAGCTTCGATTTCAGCGCGCTTAGCATCGGCAGCTTTTACTTTATCTGCCAGCGCTTTTGCATTCTCTAATAACGCAGCCTTTTCATCGCCTTTTGCAGCTCCGACTGATTTTGAAAGGGTGTTCTGTTCTGCGCGTAAAGTTTCGAATTCGGTAATTGCGGCACGGCGCACATCATCGATAGCAAGTACTTGATCCACGATCGTGACATCTTCACCACGACCTGTTTGTGATGCGCGAACTAGATCTGGATTCTCGCGAAGGAGTTTGATATCGATCATGCTTTAACGCACCTCTCGTGGGTAAGAACCTAAGAATCTGATGTCATCGCAAATCTCACGAAGTTCGGCAACGCTGCCACCGACTGCTTCGTCGTTGATATGTCCTTCGACATCAATAATGAAATGGTAGTGACCAAGTTCGCGGCCAGTAGGGCGACTTTGAATAAAAGTTAAGTTGACGTTGCGCTTGGCAAAAACCTGCAGGATCTCAAGCAGCGCTCCAGCGTGGTCGATATCGATAAAGATCGCCATAGAGGTGCGGTCATCACCAGTTGGCTCAGGAATCCATCCAGGTTTCTGGGCAGCGATAAAGCGAGTGACTGCTCCATTGTTATCGCCAATATTTTCGTCACGGATTTCTAAGCCGAAGTGTTCTGCGGCAACGCTTGATGCGATTGCTGCATCGAGCTCGCCTTTACTTACTAACTCAGCGGCGGCAGAGGTTGAATTCGTTGGGACAACTTCTGCATGCGGATAATTTTTAGCGATGTAGCTGCGGCACTGAGACTCAGCATGTGGGTGGGTGCCAATTCTTTTAATTTCTGTTGCACCTGGCTTAGTCATAAGCGCAAAGGTGACCGGCAAGGTAACTTCGCCAGAGATAGTTAGAGGGTCCCCCGTTGCGAGTTCATCGAGAGTGCGTGCGACAACGCCTTCCACGGAATTTTCAATTGGGACTAGGGCGTAATCAACTTCGCCGGTGCGAACTGCGTTAAGTGCGGCAGTGACATTTGCATAAGGAATTTTTACATCGGTGTCGCAAGTGATCTTATTTAGCGCAGCCTCAGTGAAGGTACCTACTGGACCTAAATAGGCGAATCTATGGCTATCTGCGGCGCTCACTTGCTAAGGATACCTGCTCCATAGCCAAAGATTTCCAGTAATTAATTGGAGGCAAGAACCTCACGCGCAAAGGCGGGGCGATTGGTAATCAAATTATCGACGCCGAGATGGCGACACAAGATGATGTCGGATGGTTCATCAACGGTCCAAACATTTAATTTCTTGCCGGCCGCTTTTATCTTCTGCGCTAACGCAGGCTTCTTGCGAAGTAGCTCGATGCCAGGACCAATTGAGTGAGCAGATGAGTAGCGTGTAGAAAACCACGGGGTGTAATCGTGAAGTAAATAGGTAGTTGAAATCTTGCTATCTAAAGCTTTTATCCGCTCTATCGCAAACCAAGAAAATGACATTACCGTTACATCGATTCCAGCTTTTTCGATTCTTTTGGCTTCTTTATGTAATTTCTGAACAACTTGGTTTTCAATTTCAGTGCGTGAAGGAACTGGATGTTTAGTTTCCAGCAATAAAGATTTCTTCTCGGTAATTCCAAAATCTATGAATTGATCGAGGGTAACAATCTGCGGATAGATTGCCACTAGCTGCGCATATGTTGATTCGTGAACTACGGCGGCGTTATTTGCGCGACGCTTTAAATCTGCATCATGCCAAAGAACTACGATGCCATCTTTAGTTAGTCGTAGATCGCATTCGAAACCATCGGCGCCTTGCTTAACCGCGCCTTCATAAGCCGCCATCGTCATCTCAGGAAAGTCATATGAGGCTCCGCGGTGAGCGTAAATCTTCATAGTTCGAGATTAACATCAATACCCCAGTACCAATGGGCTTGGGTAGGCTTAGCGTCATGAAGGCCTTGTACTTCCAATGCGCTGCACGTTCAGCGCTAGGTTTGGTGCGCAACGGCAATGAGGACTCTGCACTTATAGGCGCGCAAGTTATCGCCGTCGCTGATGGTATGGGTGGCCACGCAGGTGGTGAAGTTGCATCGAGCGTTGCAATAAATACGCTCTCAGAAATCGCACCGACTTTTACGGCTGAAAATATTGATCTTGATTCGGCGAGCGATCTTTATTTAAATTCACTACACACTATCGATGCGCAGATTCGCGCCGTCGCTAGCGATGAACCAGCCCTTTCCGGGATGGGAACAACTCTTACCTCTCTATTCATAAGTGGTACTGAGGTTTCTTTGCTTCATATCGGGGATTCCCGCGCTTATCGTCTGCGTGGAAACACTCTCGAGCAGCTCAGCGCCGACCACACCGTTCTGCAAGAGTTATTAGATAAAGGCGTAATTTCTGAATCAGAGGCGCAAGTACATCCACAACGCTCAATGCTGACGCAGGCGCTGATGGGTGAGGGAAATCTTGAGCCATCACTTCATGTTTTCGAAGGAAAAGTTCAGGATCGCTACCTATTGTGCAGCGATGGCTTAAGTGGGGTTCTCACTGAGAAAGAGATAAAAACTCTTATCAAGGGTAAAGATGCCGCTGAAGCAGTTGATGCCTTGATTGATGCGACTTATATAAATGGAGCACCAGATAACGTCACGATCGTAATCGCTGATATTGTCGAAGAAAGTTCCACCCAAGTAGAAAAGTTAGGAGCGGCAAAACCATGATGAATAAAATTTTGGGCGGCCGCTACCAACTCGGCGCGATGATCGGCACCGGCGGAATGGCCGATGTTTATATTGCACAAGATCAGCGTTTATCTCGCGAAGTCGCAGTAAAGATTCTGCGTAGTGATCTCGCGAAAGATCCAACCTTCGTTTCACGCTTTCGTAAAGAAGCAAAAGCTGCCGCTGGTTTAAACCACCCAGGCGTGGTTGCCGTTTATGACTCAGGTGAAGAACCTGCGCCATACATTGTTATGGAGCTTGTCTCTGGCCACACCCTTCGCGAGTTAATTCATGGTGGCGAGCGTTTGCCACTTGATCGCGCACTTGAAATCGGCGGTGGAATTTTAGAAGCCGTTGATTATTCACATCAACGCGGCATCGTTCACCGCGACATCAAACCCGCAAACGTGATGATCACCGATGGTGGCGACGTGAAGGTGATGGATTTCGGAATTGCGCGCGCCATGGATGATCTCGGTGCAACGTTAACTAGCACCTGGAACGTGGTCGGTACCGCACAATACTTATCTCCGGAACAAGCTCTTGGTGAAGTTGCAGATACTCCGACAGATATTTACTCAACTGGTTGCTTGTTATACGAACTGTTAACTGGCCGTCCCCCATTTACCGGTGAGACTCCAGTATCAATCGCCTACCAACATGTTTCAGGTCATTTAACACCGGCTCGACAACTTCAACCTGATCTTCCCGAGAGCGTTGAAGTTATGTTGGAAGTTGCTCTATCTAAGAAACCAGAAGATCGTTATATTTCTGCGCGCGCGATGCTCGAAGATTTAAATAAAATTCGCGCCGGTGAAAGTATTTCAACGAAAGTTGCACGTGGGCCGCTTGTCTCTCGTCGCACCGCTGTTATCTCTGGAATTTCGCTCTTGTTAGTTGGCGTTCTTGCCGTTGTAGGTCTTTCGCTAAACGGCGGTTCAACTCCTTCGCTAGGCAATGAAATCCCTAACGTGGTTGGCCTTACTGAAGATCAAGCCCGGGCTCTGTTAGGTGATTACACAGTCACGGTGCAGCGTGCGCCTGATCCACGGATTCCTAAAGATCGCGTTGCCAGCCAAATTCCACTTGCAACTAGCCAAGCAACAAAGGGTTCTGCCGTAACGCTAACTATCTCTGATGGTCCAGGTGATGCGATCGTTCCCGATGATTTAGTTGGAATGTCTTTAATTGATGCCCGCACTTCACTTGCTGCGGCAGGTTTAGTTATCTCTCGCACTGAAGCTGCACCATCAGATGAACCGCAAGGAACTGTATTAAGCGTGCTTCCTGAACCTGGTTCAACAATTACCGCCGGCACTGGAGTTGTCTTAACTATCGCTAGCGGAGAAGTTGAAGTTCCTAATTTGATTGGCGTTGAAGCGATCCAAGCGAAAACGCTTTTGATTCAGGCTGGCTTCTTAGTTCGCGAATTCAATGATTATGATGCCGCGCAGGCTGTTGGCGTTGTTATTCGCCAAGCACCTGATGCGGGCACAACTCAAACTATTGGTAAGCCTGTAACAATTACAATTAATAAAGCGCCGTAATTTAAGCGCGCTTACCAACAACTGGAGATAAGCCCACAGACTTTGCGAGCGCATCGATGTCTCCGCATTGTGTTAACCAGTTAGCGAGCATTAAATGTCCGTGTTCAGTCAGTACTGATTCGGGGTGGAACTGCACACCTTCAATTGGCAAAGTTTTATGGCGCATCGACATAACAACGCCAGATTCTGTTGAACCAGTAATCTCTAATTCGCTGCTAACCGTGTCGCGTTCTACTGCGAGTGAGTGGTAACGAGTTGCCGTAAATGGTGATGGAAGGCTTGTTAAAACACCTTTTCCTTCGTGCAACACTTGTGAAGTTTTTCCATGCAGTAACTCGGGTGCGCGCGAAACAGTTGCGCCAAATGCAACGCCGATTGCTTGATGTCCTAAGCAAACGCCAAAGAGTGGAATGTTTTTATCTGCGCAATATTTGATCATCGCGATACTGATGCCGGCCTCTTCTGGGGTGCCGGGCCCGGGAGAGATAAGTACTCCGTCATATTCGCCGGCTTCCTCAACGCTTACTTCGTCATTTCGCTTAACGATGCATTCGGCGCCAAGTTGGCCGAGGTATTGCACCAAGTTAAATACGAACGAGTCGTAGTTATCGATGACGAGGATGCGGGACATGGGTCAATTCTTGCAGGGTTGGCGGGCGGGCAGTTTCCAGATTTGTTCTTTTCAGTGCTGCAGTTTCCAGGATTGGGTTTTCCATGTAGCCTTCTACCTATGCCAAAGTCGAAATTGCGTAAAAAGGTTCAAGAAAAGCACGCTCACGAAGAGCAGATCTTTGATGCTGAGCCACACGGCCCAATCGAGAGCCCAAGCTGGCTTGCTCCAGTTATGGTCGCCAACTTCCTGGTCGGCCTCTTCTGGATCGTCGTCTTCTACGTAAGTCAGACCACCTACCCAATTCCGGGTATCGGCGCATGGAACATGATTATCGGCTTTAGCTTTATTGGGGTTGGATTCAGCCTCGCAACCAAGTGGCGTTAGGCGAATAACCCTCTTTTTTACACCTTTTCTCAGCCTTCTCTCAGCCTATGAGAGCATTCTCAAACCATGCCTAAGAGCTCTGAAAGCACCAATTACCGAATTCTCGTCGTTGATGATGAATCCAGTATCAGCGATCTAATCTCAACAAGTCTGCGCTTCGTCGGTTTTGAAGTTCGCACTGCTGCGACCGGCTCTGAAGCTTTAACAGTTGCTGAAGAATTCAAACCGCATGCTGTCGTACTCGATGTGATGCTTCCAGATTCAGATGGTTTTGAAGTCTGTCGCCAACTGCGCAGCGAAGGTTTAAATATTGGCGTGCTCTTCTTAACCGCTAAAGATGGCATGGAAGATAAAGTTGCCGGGTTAACAATCGGTGGCGATGATTACATGACCAAACCATTTAGTTTGGAAGAACTTGTTGCGCGTTTGCGTGCCCTGCTTCGCCGTATTGGTGTTGCTGAGATAAATACCGATGATGAAAAGATTCGCTTTGCTGACCTTGAACTAAATGAAGCAACTCATGAGGTTCGTCGCGGCGGCGTTTTACTTGAGATGTCTCCAACTGAATTTCAACTTCTGCGCTACTTATTAATTAACGCCGATCGCGTTGTCTCTAAATCTCAAATTCTTGATCACGTCTGGCAATACGATTTCCGCGGTGATGCCGGAATTGTCGAAACATATATCTCCTACTTACGAAAGAAGATCGATATCTTCGATCCACCACTAATTCATACGGTTCGCGGCGTTGGTTACCGCTTGCGCTTGCCAGCAGCGAAGTAGCATGAGCAATGCCATCGTTCTCTAGTTGGAGTCTGCTCAACCGCCTGACTTTAGGTGTTGTTCTGCTTTCTACACTAGGTGTGGTGGCGTCAGATATCTCTGCTCAATCGCTACTTCGAACTTACTTAACTCAAGAGGTAGATAACGAACTATTAAGTGTCGCTGGTGGTTCTATCCCGCGACTAGAACGTGCGGGAATCGAATCTGATTACGATGATTTCAGCGACAACGAAGCTCGTTCAATGCAGATGCGGCCACTGCGCAGTATCCCAACTTCAACTTCAGTCACTTTGATTGGACCTGCAGGAATAGTTCTAGGCCAAATTGGTGGAGATTTAAATGCCACCGAGATAACTAGTTATTTAACGGCAATTACCCCTGAAAAAGCTTCTGAAAATGGAGATCGCCCGTTCACTATTGAAACTGACGAACATGATTTCCGCGTTCTTTTGCGAACACTTCCTTCTGGTTTAGGAAATGTAGTAATCGCACATTCATTTGAAGATATCGATCGTATTCTCGCTCGCCTGCAAGGTCTCTTTATTCTTATTGGTTTATTCATGATCTTCTTTATCGCACTCGCCTCTCGCAAGATGATTATCGTTGGACTTAAGCCACTGGCAAACGTTGAAGCCACAGCCGAAAGAATTGCTGAAGGAGATCTAACTGCGCGTCTTCCAGATGTAAAACCAAATACTGAAGTCGGGCGCTTGGTCGGTACGCTAAATACAATGTTGGGGCGCATTGAGGAATCTTTCGCTGCGCGTTTAGAGAGCGAAAGTAAGTTACGCCGCTTTGTGGCCGATGCTTCACACGAACTTCGTACACCGATTACCGCTATCCGCGGCTTTGCCGAACTCCATCGTCAGGGAGCGGTTACTGGTGAAGAGAAGACGAAAGAGCTAATCGGTCGTATCGAGAACGAATCTAAGCGGATGGGTTCACTTGTAGAGGATCTACTGCTCTTGGCTCGCCTTGATCAATCGCGGGAAATGAAATCAGATCCGATTAACCTCACACAAATTGTCTCCGATGCAGTTGCATCTGCACGCGCCGCCGGACAAAATCACACCGTCAACTTTAATGAGCAAGCTGAAGAAATTTACGCACTTGGAGATAAGGATCGAATTCATCAAGTTATCGCAAACTTGCTTGCCAATGCGCGTACTCACACACCTGCCGGAACGGTTATAGATGTCTCACTAAAGCAAGATATTGATGGAGTTCGTATTCGAATTGCGGATAACGGTCCTGGTTTAACGGATGCGGATCAAGCGCGAATCTTTGAGCGCTTCTATCGCGCTGATGCTTCGCGTGTTCGCACCGATGGTGAAGGAACAGGTTTAGGACTTTCCATTGTTGATGCGGTTATGCGCGCCCATGCTGGCCAAGTAAGTGTTCAATCCGAAGTCGGAAAAGGTGCTGCATTTACCCTCTTCTTCCCGTTAGGAAGTTAGTTTCCTAACGACTCCATCGCACGCAGAGTTGAAATGCGCTCTTCAATTGGCGGATGGGTAGAGAACATCTTGGAAACTGCTTTGCCATCCAGCGGTGACTCAATCCAGATATGAGCAACTGCATTTGATTTCTGTTGCACGACCGTTGAATCTGCCGCAAGTACTTCAAGTGCTTTGCGAAGACCGGCAGGGTTGCGAGTAAATGAAACAGCAGTGGCATCAGCGAGTGATTCACGACGGCGTGAAATTGCGCTCTTAAGTAGTGTTGCCGCAATTGGCGCCAGGATTAAAACGAATAGTGAAACTACCAACATGATTGGGTTGGCGTTGCTATCGCGATCGCGACGACCACCACCAAACCACATCATGCGCATTAAGAAATCGCTCAAGATCGCAATTGCACCTGCAGTAGTTGCGGCAACTGCAGAAACTAAAGTATCGCGGTTTGCTACGTGCGCCATCTCGTGCGCAATAACGCCTTGCAGTTCATCACGATCCATAACTTCCAAGATGCGAGTTGTAAATGCAATTAACGCATGATCGGGATCGCGCCCTGTTGCAAAAGCGTTTGGCGCAGTGTCGACCACGATTGCAACCTGTGGCATTGGTAGTCCACTTGCGATAACAACTTCTTCAATGACGTTAAAGAGTTCGGGGTTATCTTCGCGCTGAATTAACTTAGCTCCAGTCATCTTTAGAACTAATTTATCTGAGCCGTAATACGAACCCCATACGCCGATCAGTGAAATACCAACTGCAAGTGGCACCATGAAGGTTGATGTGCCGGCGCCGAAGTAAGTTAGCGCTGCATATGCAACTAGACCGGTGAGTAAACCCATTGAGGCAAGAAGGAAATAAGTCTTTCCTTTATTTGTGCTCTGTTGGGCGCGAAAATCGTTTGTTGCCATTGTTTCTTAGAAAGTTACCTTTGGAACGTTGCGATCTTGAGGATCGGTAACTTCATAAAATTCGCGCTCTGTAACCTTTGCAAAACCTGCGAAAAAGCTGGTCGGGATGGTTTTTACAGCGGTGTTGAGGTTGCGAACGTTGTCGTTATAGAACTGACGTGAGAAAGCAACTTTATTTTCAGTTGTTGAAAGTTCTTCTTGTAGAGATAAGAAGTTTGATGATGCTTTGAGATCTGGGTAAGCCTCAGCGACTGCGAGCAATCCGCGGAGCGCTTGCGTCAACATTCCATCAGCGGCTGCAACATCGGCAACCGTTGAAGCAGTGGTCGCCTTAGCGCGCGCTGTAACAACGGCATCGAAGGTGCCTTGTTCGTGCGCGGCATAACCTTTTACGGTTTCAACTAAATTTGGGATTAAGTCAGAGCGGCGCTTTAACTGCACTTCGATCTGAGCAAATGATTCATCGACGCTGATATTTAAGCGGATGAGTCGGTTGTATTGAGCAACGAAGAAAAGTACGAGAAGACCAACAACTGCGAGCGCGATAATGATTTCCATAATTAATTCAACCCCTTAAGTAAAGGTTGTATTCCCTTTACTAGTTTATTGTGGATTACTTGATCTTGCTTTGGTGGAAATTTAAGAAGGAGCGGCTTGCGGTTGGTCCGCGTTGACCCTGATATCTAGAGCCGTATTTCGCACTGCCGTATGGATGCTCAGCAGGAGATGAGAGTTTGATCAGACAGAGCTGACCGATCTTCATTCCTGGGAATAGCTTTACCGGCAGATTTGCTACGTTGGAAAGTTCAAGGGTGATGTGTCCAGAAAATCCCGGATCGATAAAGCCAGCAGTTGAATGTGTAAGCAAGCCAAGGCGGCCGAGTGAAGATTTTCCTTCAAGGCGACCGGCAATATCATCGGGCAAGGTGATGATTTCGTAGGTGCTGGCGAGAACGAATTCGCCTGGGTGCAAGATAAATGCTTCGCCATCTTCGGCGACGACTTCGCGAGTTAGCTCGGCTTGTTCGATCGATGGATCGATAACTGAGTACTTGTGGTTTTCAAAGACGCGGAAGAATTTATCGAGGCGGACATCAACCGATGAAGGTTGGATCATCGCCTCTTCAAATGGTTCGACGGCGACGCGGCCGGCGTTGATTTCAGAGCGGATATCGCGATCACTTAGAAGCAC
Protein sequences of DBSCAN-SWA_5 >NZ_CP016783|1373806:1384381|1380107_1380824_+|WP_095689580.1|DBSCAN-SWA MPKSSESTNYRILVVDDESSISDLISTSLRFVGFEVRTAATGSEALTVAEEFKPHAVVLDVMLPDSDGFEVCRQLRSEGLNIGVLFLTAKDGMEDKVAGLTIGGDDYMTKPFSLEELVARLRALLRRIGVAEINTDDEKIRFADLELNEATHEVRRGGVLLEMSPTEFQLLRYLLINADRVVSKSQILDHVWQYDFRGDAGIVETYISYLRKKIDIFDPPLIHTVRGVGYRLRLPAAK >NZ_CP016783|1373806:1384381|1383195_1383747_-|WP_095677797.1|DBSCAN-SWA MEIIIALAVVGLLVLFFVAQYNRLIRLNISVDESFAQIEVQLKRRSDLIPNLVETVKGYAAHEQGTFDAVVTARAKATTASTVADVAAADGMLTQALRGLLAVAEAYPDLKASSNFLSLQEELSTTENKVAFSRQFYNDNVRNLNTAVKTIPTSFFAGFAKVTEREFYEVTDPQDRNVPKVTF >NZ_CP016783|1373806:1384381|1382277_1383189_-|WP_095677796.1|protease|DBSCAN-SWA MATNDFRAQQSTNKGKTYFLLASMGLLTGLVAYAALTYFGAGTSTFMVPLAVGISLIGVWGSYYGSDKLVLKMTGAKLIQREDNPELFNVIEEVVIASGLPMPQVAIVVDTAPNAFATGRDPDHALIAFTTRILEVMDRDELQGVIAHEMAHVANRDTLVSAVAATTAGAIAILSDFLMRMMWFGGGRRDRDSNANPIMLVVSLFVLILAPIAATLLKSAISRRRESLADATAVSFTRNPAGLRKALEVLAADSTVVQQKSNAVAHIWIESPLDGKAVSKMFSTHPPIEERISTLRAMESLGN >NZ_CP016783|1373806:1384381|1380832_1382281_+|WP_095677795.1|DBSCAN-SWA MPSFSSWSLLNRLTLGVVLLSTLGVVASDISAQSLLRTYLTQEVDNELLSVAGGSIPRLERAGIESDYDDFSDNEARSMQMRPLRSIPTSTSVTLIGPAGIVLGQIGGDLNATEITSYLTAITPEKASENGDRPFTIETDEHDFRVLLRTLPSGLGNVVIAHSFEDIDRILARLQGLFILIGLFMIFFIALASRKMIIVGLKPLANVEATAERIAEGDLTARLPDVKPNTEVGRLVGTLNTMLGRIEESFAARLESESKLRRFVADASHELRTPITAIRGFAELHRQGAVTGEEKTKELIGRIENESKRMGSLVEDLLLLARLDQSREMKSDPINLTQIVSDAVASARAAGQNHTVNFNEQAEEIYALGDKDRIHQVIANLLANARTHTPAGTVIDVSLKQDIDGVRIRIADNGPGLTDADQARIFERFYRADASRVRTDGEGTGLGLSIVDAVMRAHAGQVSVQSEVGKGAAFTLFFPLGS >NZ_CP016783|1373806:1384381|1379045_1379675_-|WP_095689579.1|DBSCAN-SWA MSRILVIDNYDSFVFNLVQYLGQLGAECIVKRNDEVSVEEAGEYDGVLISPGPGTPEEAGISIAMIKYCADKNIPLFGVCLGHQAIGVAFGATVSRAPELLHGKTSQVLHEGKGVLTSLPSPFTATRYHSLAVERDTVSSELEITGSTESGVVMSMRHKTLPIEGVQFHPESVLTEHGHLMLANWLTQCGDIDALAKSVGLSPVVGKRA >NZ_CP016783|1373806:1384381|1375078_1375918_-|WP_095689575.1|DBSCAN-SWA MSAADSHRFAYLGPVGTFTEAALNKITCDTDVKIPYANVTAALNAVRTGEVDYALVPIENSVEGVVARTLDELATGDPLTISGEVTLPVTFALMTKPGATEIKRIGTHPHAESQCRSYIAKNYPHAEVVPTNSTSAAAELVSKGELDAAIASSVAAEHFGLEIRDENIGDNNGAVTRFIAAQKPGWIPEPTGDDRTSMAIFIDIDHAGALLEILQVFAKRNVNLTFIQSRPTGRELGHYHFIIDVEGHINDEAVGGSVAELREICDDIRFLGSYPREVR >NZ_CP016783|1373806:1384381|1373806_1375075_-|WP_095689574.1|tRNA|DBSCAN-SWA MIDIKLLRENPDLVRASQTGRGEDVTIVDQVLAIDDVRRAAITEFETLRAEQNTLSKSVGAAKGDEKAALLENAKALADKVKAADAKRAEIEAKANALVMQLSNLLDPEAPIGGEADFVTIEHVGTPRDFAKDGFEAKDHVELGKLLGAIDTERGAKVSGSRSYYLTGVGAMLEFALVNYAIQSALKAGFTPVIPPVLVKPAAMEGTGFLGQAAENVYHLEKDDAYLVGTSEVPLAAFHMDEVLPADKLPMRYAGYSTCFRREAGSYGKDTRGIIRVHQFDKVEMFSFCKPEEAKEEHKRLLQWEKDFLNAMEIPFRVIDVASGDLGSSATRKFDIEAWIPTQKAYREVTSTSNCTEFQARRLNIRYKDADGTKAVATLNGTLVAIPRMIVAILENHQNADGTVNVPAALQPFLGTKRFELV >NZ_CP016783|1373806:1384381|1379774_1380041_+|WP_095671740.1|DBSCAN-SWA MPKSKLRKKVQEKHAHEEQIFDAEPHGPIESPSWLAPVMVANFLVGLFWIVVFYVSQTTYPIPGIGAWNMIIGFSFIGVGFSLATKWR >NZ_CP016783|1373806:1384381|1383802_1384381_-|WP_095671745.1|DBSCAN-SWA MLLSDRDIRSEINAGRVAVEPFEEAMIQPSSVDVRLDKFFRVFENHKYSVIDPSIEQAELTREVVAEDGEAFILHPGEFVLASTYEIITLPDDIAGRLEGKSSLGRLGLLTHSTAGFIDPGFSGHITLELSNVANLPVKLFPGMKIGQLCLIKLSSPAEHPYGSAKYGSRYQGQRGPTASRSFLNFHQSKIK >NZ_CP016783|1373806:1384381|1377493_1379044_+|WP_095689578.1|DBSCAN-SWA MMNKILGGRYQLGAMIGTGGMADVYIAQDQRLSREVAVKILRSDLAKDPTFVSRFRKEAKAAAGLNHPGVVAVYDSGEEPAPYIVMELVSGHTLRELIHGGERLPLDRALEIGGGILEAVDYSHQRGIVHRDIKPANVMITDGGDVKVMDFGIARAMDDLGATLTSTWNVVGTAQYLSPEQALGEVADTPTDIYSTGCLLYELLTGRPPFTGETPVSIAYQHVSGHLTPARQLQPDLPESVEVMLEVALSKKPEDRYISARAMLEDLNKIRAGESISTKVARGPLVSRRTAVISGISLLLVGVLAVVGLSLNGGSTPSLGNEIPNVVGLTEDQARALLGDYTVTVQRAPDPRIPKDRVASQIPLATSQATKGSAVTLTISDGPGDAIVPDDLVGMSLIDARTSLAAAGLVISRTEAAPSDEPQGTVLSVLPEPGSTITAGTGVVLTIASGEVEVPNLIGVEAIQAKTLLIQAGFLVREFNDYDAAQAVGVVIRQAPDAGTTQTIGKPVTITINKAP >NZ_CP016783|1373806:1384381|1376720_1377497_+|WP_095689577.1|DBSCAN-SWA MKALYFQCAARSALGLVRNGNEDSALIGAQVIAVADGMGGHAGGEVASSVAINTLSEIAPTFTAENIDLDSASDLYLNSLHTIDAQIRAVASDEPALSGMGTTLTSLFISGTEVSLLHIGDSRAYRLRGNTLEQLSADHTVLQELLDKGVISESEAQVHPQRSMLTQALMGEGNLEPSLHVFEGKVQDRYLLCSDGLSGVLTEKEIKTLIKGKDAAEAVDALIDATYINGAPDNVTIVIADIVEESSTQVEKLGAAKP >NZ_CP016783|1373806:1384381|1375959_1376664_-|WP_095689576.1|DBSCAN-SWA MKIYAHRGASYDFPEMTMAAYEGAVKQGADGFECDLRLTKDGIVVLWHDADLKRRANNAAVVHESTYAQLVAIYPQIVTLDQFIDFGITEKKSLLLETKHPVPSRTEIENQVVQKLHKEAKRIEKAGIDVTVMSFSWFAIERIKALDSKISTTYLLHDYTPWFSTRYSSAHSIGPGIELLRKKPALAQKIKAAGKKLNVWTVDEPSDIILCRHLGVDNLITNRPAFAREVLASN |
12 | Bacillus_phage(25.0%) | tRNA,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|