Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP047045 | Caulobacteraceae bacterium 0127_4 chromosome, complete genome | 1 crisprs | csa3,WYL,cas3,DEDDh | 0 | 2 | 3 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP047045_1 | 3277770-3278108 | Orphan |
NA
Consensus repeat of NZ_CP047045_1
|
5 spacers
spacers of NZ_CP047045_1
>1.1|3277796|29|NZ_CP047045|CRISPRCasFinder GGAAAACAGACCGAGATCGACACCACGAG >1.2|3277851|64|NZ_CP047045|CRISPRCasFinder CGAAGAATCCTTCAACGACGATCACTCGACCAGCAATCAAGTCGATAGCGAAGTGAAGACGGAC >1.3|3277941|28|NZ_CP047045|CRISPRCasFinder CGACAACTCGATCAACACGGAGAACGAG >1.4|3277995|34|NZ_CP047045|CRISPRCasFinder GGACGTCGAACTCGAAGAATCTTTCAACACCGAA >1.5|3278055|28|NZ_CP047045|CRISPRCasFinder GGACAACTCGATCGACACCGACAACTCC |
CRISPR arrays and Neighbor proteins around NZ_CP047045_1
The CRISPR arrays of NZ_CP047045_1 >merge|NZ_CP047045|1|3277770-3278108|CRISPRCasFinder TTCGAAGCCGAAGACAGCTTCAACACGGAAAACAGACCGAGATCGACACCACGAGATCGAAACGGAAGGATCTTTCAACACCGAAGAATCCTTCAACGACGATCACTCGACCAGCAATCAAGTCGATAGCGAAGTGAAGACGGACGTTGACGTCGAAGACTCTTTCAACACCGACAACTCGATCAACACGGAGAACGAGTCCGAGATCGAAGAATCCTACAACACGGACGTCGAACTCGAAGAATCTTTCAACACCGAAGTTGAGAACGAAGAGTCCTTCAACACGGACAACTCGATCGACACCGACAACTCCGTTGAGAACGAAGAGTCGTTCAACAC >NZ_CP047045|1|1|3277770-3278108|CRISPRCasFinder TTCGAAGCCGAAGACAGCTTCAACAC GGAAAACAGACCGAGATCGACACCACGAG ATCGAAACGGAAGGATCTTTCAACAC CGAAGAATCCTTCAACGACGATCACTCGACCAGCAATCAAGTCGATAGCGAAGTGAAGACGGAC GTTGACGTCGAAGACTCTTTCAACAC CGACAACTCGATCAACACGGAGAACGAG TCCGAGATCGAAGAATCCTACAACAC GGACGTCGAACTCGAAGAATCTTTCAACACCGAA GTTGAGAACGAAGAGTCCTTCAACAC GGACAACTCGATCGACACCGACAACTCC GTTGAGAACGAAGAGTCGTTCAACAC
>NZ_CP047045.1|WP_158767324.1|3277606_3277744_+|hypothetical-protein MQLRLFSSVCAVALGLAFALMAWADDGSSTDEGSASAVENSVAAR >NZ_CP047045.1|WP_158767323.1|3276118_3277192_-|SDR-family-NAD(P)-dependent-oxidoreductase MTGYCLGRPEVRFDLAALKAFYADKRVLITGAAGSVGSALSLELARLGCAHLAMLDQFDHGLINIVESVRRIAPKLQITEALCDVRDSGRLDAWVRRIEPDVVIHSAALKHVHLGERHPVECVLTNLLGVRNALSAAVNAGAGHFMLISSDKAAAPSCVMGATKRLAELHLTGFQMERPTATRLKAVRFGNVLGSQGSVLPRFEAQIAAGGPLEVTHEDMERFFMSVQEAVGLILSVTAYGDEGAGTYFMEMGAPISIIELGRDMIRASGKEIAVEITGLRPGEKLKEQLADECEAITPTTLPGVFRVTPIAEDAYVTAADVAHFEALARTMENAVVRQRVFACLDQRLQRPARVAG >NZ_CP047045.1|WP_158767322.1|3275128_3275986_-|hypothetical-protein MRLYIHIGIGKTGTSSIQHMLANSAQALADCGFYYPQQGRNGTAAHHSLAAFDVDDLGVGIEAYFKALLEELDAQSAPNAILSSEGFCFCRPRVVRRIGELLSSYHVRVIFYARRPVELIASSYLEKLKAGQLTNATIEQFYKVCLAERSFFMSDRLDSWAIEFGRQALSVRLYDRRFLKGDSVSDFLDVIGAGEMPADMGEVQENPTLSSAFVPCLEAFDRAAPSSPMRPHIVAALVNASEDVVGSDIFSAATLKQIARDHAVANAIFAQTYLSREEARAFIAP >NZ_CP047045.1|WP_158767321.1|3273817_3274297_-|hypothetical-protein MRILTAIRANLEVHHETLEDPDLMLMLAEGETSLLETLDFMLEADLFDEGLLHGLKTQKDTLAVRLHRIEERRQSRRAILEQALLLMERKSLERPVATLSLSERPPNLIVEEEAQIPSRFFDLKPTLNRRLTKEALTSGEEVPGARLSNGSISLTVRRR >NZ_CP047045.1|WP_158767320.1|3272802_3273816_-|phage-recombination-protein-Bet MSGWSAHKDATKREFTSAQLKLIRRTIARQCTEIEFDQFIAVSVQAGLDPLRRQMAPLILNASDPERRRMVPWATIDGLRVIAARQGDYRPMETAPLIERDESRLDADLNPLGITRAEVCAWKSSDGVWHPVAGEAWWDEYAPTREEWAADATGQHHPTGKRQLDPVWLRMGRVMIAKCAEAQALRRGWPDILSGLYGEEELHGLRLAEQTASEVLREGDEAAKRRLLKTRTLWFVFGSDGGFNPVLAHEAFDRLRGFYAEASVEEIERFDQVNSTSLHTLWEWAPSDAFALKQISEARRLFGKANTEVSAEAPPAGSHQGDPQSPKQAPPTASVGS >NZ_CP047045.1|WP_158767319.1|3272302_3272806_-|hypothetical-protein MKTRAGKPNEIHCRQRPSRRLKQCIWDAQGGRCLACDQRLVASEFDHVVPLGLGGSNAPDNWAALCVSCHRTKTVEDLRRIAKAKRQRRYHETGRSRAAKTWSPFNSAAKQGFSKTLRRHLNGFVTRRCPCAVCSGENDFSQSSPDGADDGGDAAPKCTGSSDDRSS >NZ_CP047045.1|WP_158767318.1|3272103_3272319_-|hypothetical-protein MIDQAEARAASDALYSVIMGLLEEGLDRGASAPRGDDPCIARGEIFRALAADLATLAEAAALLGRFADRSP >NZ_CP047045.1|WP_158767317.1|3270864_3271560_+|cephalosporin-hydroxylase MPNDQHRARPQSPDQVRGRSFRTALSAETLSSIQTGALQTLYRDRGFLKSPFDVALYLQLIGRLRPASIIEIGTKHGGSALWFADQMTAHGLRARVVSVDLSPPADLSDARIQFVAGDAHDLSAALTHDLLASLPHPWLVSEDSAHTFEACTAVLRFFDNHLVVGDYIVIEDGVLSDMAERHYETYEHGPNRAVERFLSEHVDTYEIDGALCDFFGQNVTWNPNAWLRRAR >NZ_CP047045.1|WP_158767316.1|3269821_3270733_-|glycosyltransferase MTVASERPKLSVIIIGYNMARELPRTIRSMSPAMQRGLHDSDYELILLDNGSTQPFDANLLLGLAPNLSIHRVQNPSASPVGAIRLGLELARGDLVGVCIDGARMASPGLFSTALAASKLHAKPMIGTLAFHLGPEVQMQSVLKGYNQTVEDQLLDSSSWETDGYRLFGVSAFAGSSNDGWFVTPAETNALFLTARHWRELGGYDQRFQTPGGGLANLDIWLRVCEDHSGALIMLLGEATFHQVHGGIATNNPVSPWEQFHEEYMRLRGKAFAKSKRGPLFYGALNRETYPSLRRSINALTPV >NZ_CP047045.1|WP_158767315.1|3268761_3269694_-|glycosyltransferase-family-2-protein MQRTPFLSVIVVLYDMVRESERSLFSLSTAYQSQVGAEDYEVIVVENGSRQPVSRERAESFGPNFRYLDISRDVALPSPCTAINAGVAMARAPFVGIMIDGARIASPGVLMLAIQALKGFNRAVVATIGLHLGPAMQTRAAETGYNQQVEDALLASVPWRENGYKLFEVSVLTGVNTAAWFGPMAESNLIFLSRAMYEEAGGFDPRFDIPGGGIANLDFYNCVGGLPGATLISLFGEATFHQIHGGVMSSRPAETIADEVQRYMAQYHGIHGKAFQFSQQIPLLLGQFRPEVAACIKAAEPRWNAESSPP >NZ_CP047045.1|WP_158767326.1|3278649_3279573_+|NAD-dependent-epimerase/dehydratase-family-protein MQVVVVTGAAGLVGQNLIPRLKSQFRIVAVDKHAHNVGVLRKLHPDIEVIEADMAEAGNWEDRVTQADAVIALQAQIGGLDPEPFHRNNVLSTERLIAAAKRGERPYLVQVSSSVVRSAACDLYTESKKAQETLALKSGLEHVILRPTLMFGWFDRKHLGWLRRFMERTPVFPIPGSGDYLRQPLYAGDFAAIIASSLERRTQGIYNISGLERVTYVQMIRMIKQTVGAKTAIVHIPYAAFWTLLKVYSWFDKNPPFTTLQLQALVTPDVFEEIDWPALFDVQRTPLERALEETFLDPTYSHVALEF >NZ_CP047045.1|WP_158767327.1|3279576_3280863_+|NAD(P)-binding-protein MAKVIVIGAGPMGLAAAYEALKRGHEVDLLEASDRPGGMAAHFDFDGLSIERFYHFCCLSDRDTIALLDELGLNGALQWVSTKMGYFVDGKLYRWGDPFALLTFPKLGLVDKIRYGVQVFLSTKRSDWQRLDKISAKKWFTEWLGEDLYNKLWRPLLELKFYELTDKISAAWVWQRIKRLGNSRKSLLEERLGYIEGGSETLVKALVGAIENKGGRIRLKTPAKTFLIENGAVRGVETASGETIAADFVVSTAPMPLVPGMLSQAPELRPAYERMDNVGVVCVLHKLKRSVSDNFWINISDPDFEIPGLVEFSNLRPLANTVVYVPYYMPATQPKWGWSDEQFVAESWGYLKRINPALADDDRLASHVGRLRYAQPVCEVGFAELIPPAQTPIKGLQIADTCFYYPEDRGVSESIRYARNMIAEMGAA >NZ_CP047045.1|WP_158767328.1|3280859_3281276_+|GtrA-family-protein MKLSGEILRFVGVGAFAALVNWVSRIALSVVLPLSAAIIVAYLIGMITAYALSRKYVFQPTERGVGSELTRFALVNVVALVQVWAVTIVMAEYVLPALHVDWRPLEVAHAVGVASPIVTSYLGHRYFSFAQARKSGRG >NZ_CP047045.1|WP_158767329.1|3281280_3282717_+|UbiA-family-prenyltransferase MNTEAAFDCPLVVDMDGALLRTDTTFEGLARALFAKPVTTMLACASILRGRAAFKRAIAEIVQIDVESLPLREDFVAHLKQERARGRHLHLVSGSDHQVVERVAARLGLFESAQGSSSGHNLKGANKARFLVERFGRFAYAGDSPADLKVWPHAQSAVLAGASPETARRARKLAVPIEREFLDPPRTVKHWMRTLRLHQWAKNILLFVPLLLSGHFTDADLVLRCGLGFLILGLTASGTYIVNDLADLAADRRHRSKKERPFAAGVLKVYQGLMVAPALIGGGLVAAFLLSPAFAAALLSYLVCTLAYSLRLKAIPFLDVMLLAWLYTLRLLMGVALAQSTSSVWLLTFSMMFFFSMSLAKRHVEVAAASPDQDEIAGRGYLPMDAPVTLAFGISSSVASLLIMTLYLMEEAFPSNVYGQPALLWLVLPIVGLWTMRIWLLAHRGELDDDPVAFAVKDKVSIVLGSAMALAFAIAVFG >NZ_CP047045.1|WP_158767330.1|3282713_3284048_+|FAD-binding-protein MSTAYVNDDTRLSWGRVVRSHHLIAKPRFVDEIAPALADASVMGLRALPVGLGRSYGDSNLNPGGALIDLSKLDRIVAFDTQNGVLRADSGISLSDILRFSVPRGWFLPTTPGTRFVTLGGSIANDVHGKNHHAAGSVGCSIRRVGLVRSDRGALELASDIEPELFAATIGGLGLTGVIAWAEIQMVPIVSAYIEQEVLPFDDLDSFFDIAEASQNTFEHTVAWIDCTASGRHLGRGLFTRGNWAPEGGLDAHSDKLKLTMPVDGTPLAFNALSLRVLNTMIRTAQSFKAAESRVHYEPHLYPLDAIGAWNRLYGRAGFYQYQCIVPPDGRAAIAELLCAIADEGAGSVLGVLKSFGPKRSPGLLSFPMEGFTLAMDFCNAGARTHALFARLDAIVRAANGRLYAAKDGRMPASMFQTSYPEWARFAKQIDPLLTSAFWDRVSQ >NZ_CP047045.1|WP_158767331.1|3284053_3284815_+|SDR-family-NAD(P)-dependent-oxidoreductase MSGSNRRVIVLGALSAMAEATCRMLAEEGAQLALLGRDAERLDTVARDLKTRGAAGVHVFARDLLDTSDTPSALQAAADSMGGANAVLIFYGVLGDQNRAETDLEEARRIIAVNFTSAAAWSLASADLLERSGGDGAVLVGVSSVAGDRGRRSNYVYGAAKGGLSILLQGIAHRFAAKPGGARAVTVKAGFVDTPMTAHLKKGPLWATPQQIAQVVRRAMDRGGPILYAPWIWRWIMLAIRLIPDAVFKRVNI >NZ_CP047045.1|WP_158767332.1|3284835_3285630_-|methyltransferase-domain-containing-protein MSTDGRPQQLARTAHFLELYRAYLVADVDMTRSSVESMENQWYVPVGHSAAQVIYSACVGSWLSEVRTVLDMPCGHGRILRHLTKLFPDAAIHACDIDEAGLQFCASQFGAHPILSKEIPEEVAFPVQYDCIWVGSLFTHLSKTMSERWLAHLARQLSPTGILITTWHGRWSAANGAEIQYIEPDKWRAILAEYESTGYGYASYLRGHQHQQYIEGDYGISLSTPVALMEMALAVPDVRVFSFTERGWAGHQDVLVLGKPQIMA >NZ_CP047045.1|WP_158767333.1|3285830_3287387_+|hypothetical-protein MSKALTTKTVVWDELRALPGHRLLIGFIIVAVCLRLIFWLYTGRTWEDAIISLTPARNLWDGFGLTHHASEPRVHSFTSGLGEIVLIIGEAVRAGLTTMRVVSIFAAAFALYYAFRVGVILSFHWSAQLLVLAYLAADHLQIFFGMGGMETQLATALVLANVYYYLNSNWTKLGIVGGLAVICRPELGLWGLILGAAIVLWHRQAFVKVAVPAILIAGVWFGFAALYYGSPIPHTITVKSGATMINNDIGQIATYLSSFWSHIAPFLQFWQVGEAPVPEILLQAVVALLLLLGSAGAVHAARFQPRMLAVLALLLAFLLYRAWGNVNPYFMWYMPPFVALLFLFAAAGISWLAQKYTSAAIGIACVLALAYSAPVFLAMPLDRAQQQVIEDNVRTRVGARLNELMSADDAVVLEPAGYVGWEIRPKTMYDFPGLTSPRAFEAWKKHHHMTGLIIELNPRFVVQRPPERLEFEEREPELAARYEAVETFRAEPGFHLSNAGLLYWPIDTEFTIYRRRDE >NZ_CP047045.1|WP_158767334.1|3287412_3289611_+|hypothetical-protein MRYIRYGSAVLVLLFLLCGLAIALWPEPAPRALVSSEGWLTTDQIGRVDVGSLPETIRQTHRSLQSTAFRTWTPESGARRGEVVSPGFQISPVMAVTIAGSTATADGGASVSIVCDSHTQSLPVFRGNVNTHFTEAIFEVDEGWCPGEARLQLRSREPGVNVGVGTVAEVSRITLWKRSAIGLFPFLVLAFVVLGALSIVGVLVARAARLTIPAAFAGLTTIGVAGLATFLAYTLGPSWDYGSALAILLVLLLGGIAVALPRALRQAVIDLSPAGAVWLLSATAFFFLSTLAYNGLGHWEPNYRFSPAHWSSDNELPWMFAETLRHNWNTEGVLGPWSFSDRPPLMTGALLLTADLFDLLQTGNDGNWLRGPAHNTSSILINTLWAPVFFTAAKHLFKLDTRVAALATLITAIIPFFVFNSIYGWPKLFGAAFAGVAIWSAIDRRSSIPISDRAVAFGLASALSILSHASNAIFLLPLALYFLPSLLRAPKALIGGVLAGLVMLAPWIAYQHFILPSNDPLLKYALTGDFGFAEPARTTLESARAFYAQLSLESWLQTKAAMAAQLFWPASTPLSQPPIHTLFGMHGVDALRQWDFYFLSAGNALLLAAAIVSAFKGRRDDNPIGALLCVTGSSYLLILLVFFHPLILHHAPYGALIALALAGFGGLAAYAPGWLRGIGLLAGIYGGVVWGLSPLRSALSIDLIAALGLAFCLGAALCSTLSDSRSTSSVEG >NZ_CP047045.1|WP_158767335.1|3289837_3290419_+|hypothetical-protein MTRQFAAFVMLAALSACGQPAEQAAPGETSAPVFRTDADLVAVPLDSIIGTADGLGTQYIEQISPGGGGLRSEPGPSGPTPTIVAPTPTTFTVTIPANSQEFVVMYGMSPESYTNGGTTKGACFAVAAVEIGGPRELAQRCLTPVETSADQGFQEFAVQVPPGVTQFQLQTTPAAPSGELTWGWSFWANPRAK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | NZ_CP047900 | Pseudarthrobacter sp. YJ56 plasmid unnamed2, complete sequence | 14627-14654 | 5 | 0.821 |
NZ_CP047045_1 | 1.1|3277796|29|NZ_CP047045|CRISPRCasFinder | 3277796-3277824 | 29 | NZ_CP043499 | Rhizobium grahamii strain BG7 plasmid unnamed, complete sequence | 1091779-1091807 | 6 | 0.793 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MK864266 | Gordonia phage Arri, complete genome | 7789-7816 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MN284907 | Gordonia phage Fireball, complete genome | 7721-7748 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MK864264 | Gordonia phage VanDeWege, complete genome | 7909-7936 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MK937603 | Gordonia phage Bakery, complete genome | 7490-7517 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MH479910 | Gordonia phage Danyall, complete genome | 7617-7644 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MT639651 | Gordonia phage Portcullis, complete genome | 7677-7704 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MH479917 | Gordonia phage KimmyK, complete genome | 7745-7772 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MH669015 | Gordonia phage TillyBobJoe, complete genome | 7432-7459 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MK814761 | Gordonia phage SmokingBunny, complete genome | 7596-7623 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | KX557286 | Gordonia phage Twister6, complete genome | 7528-7555 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MK864267 | Gordonia phage Valary, complete genome | 7981-8008 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MK967381 | Gordonia phage RogerDodger, complete genome | 7981-8008 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MT310872 | Gordonia phage Evamon, complete genome | 7598-7625 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MT521998 | Gordonia phage Jambalaya, complete genome | 7429-7456 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MK864265 | Gordonia phage Barb, complete genome | 7745-7772 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | NC_030913 | Gordonia phage Wizard, complete genome | 7721-7748 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MK305889 | Gordonia phage Mutzi, complete genome | 7617-7644 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MN010760 | Gordonia phage Nubi, complete genome | 7618-7645 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | MT723933 | Streptomyces phage Keanu, complete genome | 10862-10889 | 6 | 0.786 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | NZ_CP049749 | Rhodococcus fascians A21d2 plasmid pA21d2, complete sequence | 25445-25472 | 7 | 0.75 |
NZ_CP047045_1 | 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder | 3278055-3278082 | 28 | NZ_CP015236 | Rhodococcus fascians D188 plasmid pFiD188, complete sequence | 73079-73106 | 7 | 0.75 |
1. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to NZ_CP047900 (Pseudarthrobacter sp. YJ56 plasmid unnamed2, complete sequence) position: , mismatch: 5, identity: 0.821
ggacaactcgatcgacaccgacaactcc CRISPR spacer ctacgactcgatcgacaccgactactac Protospacer **.***************** *** *
2. spacer 1.1|3277796|29|NZ_CP047045|CRISPRCasFinder matches to NZ_CP043499 (Rhizobium grahamii strain BG7 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.793
ggaaaacagaccgagatcgacaccacgag CRISPR spacer agaacgttgaccgagatcgacgccacgag Protospacer .*** .. *************.*******
3. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK864266 (Gordonia phage Arri, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
4. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MN284907 (Gordonia phage Fireball, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
5. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK864264 (Gordonia phage VanDeWege, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
6. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK937603 (Gordonia phage Bakery, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
7. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MH479910 (Gordonia phage Danyall, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
8. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MT639651 (Gordonia phage Portcullis, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
9. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MH479917 (Gordonia phage KimmyK, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
10. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MH669015 (Gordonia phage TillyBobJoe, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
11. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK814761 (Gordonia phage SmokingBunny, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
12. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to KX557286 (Gordonia phage Twister6, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
13. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK864267 (Gordonia phage Valary, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
14. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK967381 (Gordonia phage RogerDodger, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
15. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MT310872 (Gordonia phage Evamon, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
16. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MT521998 (Gordonia phage Jambalaya, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
17. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK864265 (Gordonia phage Barb, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
18. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to NC_030913 (Gordonia phage Wizard, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
19. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK305889 (Gordonia phage Mutzi, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
20. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MN010760 (Gordonia phage Nubi, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer ggagaactcgatcgacaccgacgggatc Protospacer *** ******************.. .*
21. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MT723933 (Streptomyces phage Keanu, complete genome) position: , mismatch: 6, identity: 0.786
ggacaactcgatcgacaccgacaactcc CRISPR spacer cgacaacacgatcgacaccgccaagtgg Protospacer ****** ************ *** *
22. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to NZ_CP049749 (Rhodococcus fascians A21d2 plasmid pA21d2, complete sequence) position: , mismatch: 7, identity: 0.75
ggacaactcgatcgacaccgacaactcc CRISPR spacer ctcggtctcgatcgacaccgacacctcc Protospacer . ***************** ****
23. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to NZ_CP015236 (Rhodococcus fascians D188 plasmid pFiD188, complete sequence) position: , mismatch: 7, identity: 0.75
ggacaactcgatcgacaccgacaactcc CRISPR spacer ctcggtctcgatcgacaccgacacctcc Protospacer . ***************** ****
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
2067539 : 2072961
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP047045|2067539:2072961|DBSCAN-SWA TTCATCGCGGCAGCTCAGCGAAATGCCGCTTTAGCGCAATCATCGCCAACGCCGCACGCGCGCTCTCGCCGCCCTTGTCGCCACGCCCCGGGTCGGCGCGCTCTTCGGCCTGCGCCAGCGTGTTCACAGTGAGAATGCCGTAGCCGATCGCGAGCCCTTCGCGCACCGTTAGATCCATCAATCCGCGCGCGTTTTCACCGCAGACGTAATCGTAATGGCTGGTCTCACCACGCACGACGCAGCCGAGCGCGAGATAGCCGTCCCAGCTTTCCGTTTTGGCCGCCAACGCAATCGCCGCCGGAATCTCAAATGCACCCGGCACGAACACGCGATCTAGCGTCGCTTTCGCGTCTTCCGCCGCCGCTTCCGCGCCGCGCTGCATCATGTCGGCGACGCCATTGTAGTAGGGAGAAATCACCATCAGCAGCCGCGCGCCAGGCACGCTGGCGACCGGGAATTTTTCAACGTCGGTTTTCACGACTTGAATCCGCGCCAACCGTCGATCGACAGCCCATAGCCTTCCAGCCCCACCAGCTTCTGCGGCGCGCCCGCCAGCACCGTCATACGCCCGACCCCTAGTTCGCGCAGGATCTGCGACCCCAAGCCGATCACGCGGCGCATGTCATCTTCCTCGAACTCTAGCGGCTCACCCGACAAACGACGCGACAACGCATCCGGCACCAAGTCGCGCAACAGCACCACAACGCCTGCGCCTTCCGCTTCGATCTCTGCCACCGCGCGCTCGACCAACCCAGCGCGCGCGCCATAGCCGCCCATCACGTCAGCAGCGAAATCAATGCGATGCACGCGAACGAGCGACGACTTATCCGCCCCGATCTCGCCCTTCACCAGAGCAGCGTGCTCAACGCCATCCAGCTTGTTGCGAAACACGACGAGCTTGAAGTCCTTGCCGCCGCGCACCTCAAACGGCGCCTCGGCAACCTTCTCAACCAAGCGCTCTTGCACGCGGCGGTACGCGATCAGTTCATCGATCGCGCCAATCTTCAGATTATGGAACTGCGCGAACGCGACCAGCTCAGGCAGCCGCGCCATGGTCCCGTCGTCGTTCATAATCTCGCAGATCACCGCCGACGGGTTCAGCCCCGCCGCACGAGAAATATCGACGCTCGCTTCCGTATGCCCAGCGCGAACCAAAGTGCCGCCCTCACGCGCCGTCAGCGGAAACACGTGGCCAGGCGAAACGATATCGCGACGGTCCTTCGTTGGATCGATCGCGGTTGCGATGGTTAGCGCACGATCGTGCGCGGAGATGCCGGTGGACACGCCCTCACGCGCCTCGATCGATACCGTGAAGGCCGTACCCATCCGCGTCTGATTGCGCGGCGTCATCGGATCGAGCCCCAGCGTCTCCGCCCGCTCCGGCGTCAGCGCCAGGCAGATCAGGCCACGCCCGTGCTTGGCCATAAAGTTCACTTGCGCCGGCGTCGCGAACTGCGCGGGGATGACGAGATCGCCTTCGTTCTCGCGATCCTCTTCGTCGATCAGGATGAACATGCGCCCCTCGCGCGCTTCCTCGATAATCTCGGGGACCGACGAGATCGCGCGCTTGAACTCAGAAATGGCAGACGGCTTGGCCGTCATGTGTTTCTCCGCGCTTCGATGATACGCGCCGCATAGCGCGCCATCATGTCCGCTTCCAGGTTCACCTTATCGCCCGCTTTGAGCTTCGACAAAGTCGTAACGCTCCAGGTATGCGGAATGATCAGCACGCCGAAGCCCCGATCATCGACTTCGTTCACCGTCAGCGACACGCCAGCAATCGCAATCGAGCCCTTGGACGCGATGAGTGGCGAAATCTCATGAGGCGGCTTGATCTGAATCCGCCAGCCCTCGCCGTCTTGGCTGATCGAGAGCACTTCGCCTAAACCGTCGACGTGCCCCAGCACCATGTGCCCGCCCAATTCGTCGCCGACGCGAAGCGAGCGCTCCAGATTCACCTTGTCGCCTTCGGCAAGCGAACCCAGCGTGGTCAGCGCCAAACTCTCCGCCGCCACTTCCACGACGTGCCTCATGCCGCCTTGCTGCCCGCTCGTCTCCACCACCGTGAGACAGCAACCATCGTGCGAAATGCTAGCGCCGATCTCCACGCCCGCGGGATCGTACGAACTCGCAATTGTGAGGCGCACGAGACCAGGCAAGCGCTCGACCGCCTTCACCTCTCCAAGCGCGCTGACAATCCCAGTGAACATGAATCTCTCTTACGAGGCCCGCTCGTAGCTTTCCCACACATCGGGGCCAAGCTCGCGCAATGCAACGCGGGTGAGGCGTGGCGCATCCGCAAGCCTGTCCAGCGCCAATGCCGCGACGGCAGGGCGGCCTTCCTCGCCCAGCACCATCGGCGCCCGGAACCACTCAAGCCGATCCACCAGCCCGCCCTGGATCAAACTAGCGGCCAACTTGCCGCCACCCTCGACCAGCACCCGCTGCACGCCGTCAGCCGCGAGCTGCAACAGCGCCTCAGACGCATCCACGCCGCCAGCACCGCGCGGCACCACCGCGACACGCGCGCCAGCCGATTGCAGCCCAGCACGACGTTCGAACTCCGCATCCTCCGCACCGATCACCAGCAGCGGCGCTTGGGCGAGCGTCTCGAACAAGCGCCCCGCCGGCGGCAGATTGAGATTGCTGTCCAGCACCACGCGCAACGGTTGCTTCGCCGGCGGTGGATCGGTGCGCGCCAACAGCTCCGGATCGTCCGCCCAGGCCGTGCCGGCGCCGATCATCACCGCATCATGGTTCGCACGGAGCTTCTGCACTTCTGCGCGCGCCAGCTCGCCCGTGATCCAACGGCTCTCGCCCGACGCGGTCGCAATGCGCCCGTCGAGCGAAGTCGCAAGTTTCAAAGTGACGTAAGGGCGAACCATGCTCGCAATATGGACCGCCGGCGCCCTCGCCGGCCAGCGCCGAAGCATGCCGGCGAGGCGCCGGCGGTCCATATTACCCGCGCTTGGGCGACTTCTCGGCTTCTTCGATGAAGTCCGCGAAGTCCGACGCTTCGCTGAAATCACGATAGACCGACGCGTAGCGGACGTAGGCCACGGTATCGAGGTTGCGTAGCGCCTCCATCACCATGCCGCCGATCACCTCCGACGTGATCTCGCTCTCGCCCTGGCTCTCAAGCTTGCGAACAATACCGGATATCATCTGTTCGATCTGATCCTGCTCGATCGGGCGCTTCCGGAGCGCGATGGAGAGCGAGCGCGCCAGCTTGTCGCGATCGAACGGCACGCGCCGGCCCGAGCGCTTCAGCACCACCAGATCGCGCAATTGCACCCGCTCGAACGTCGTGAAACGCGAGCCGCACTTGTCGCATTGGCGCCGGCGCCGGATCGCGGCGCCGTCCTCAGTCGGACGGCTGTCCTTCACCTGGGTGTCTTCGTTCTGACAAAACGGGCAGCGCATTCGGCGGACTCCCGGCGGGCAGCCGAATCAGCGCCGCGCCGCCTCAAGTGTAGATGGGAAACCGCGCCGTGAGTTGTTCGACCTTGGCGGCAACTGCAGCTTCGACCGAGGAATTGTCCCCGTTCGTTCCCTGCAAGCCGTCCAAAACTTCACAGATCAGGTCCGCCACTTGGCGGAATTCCGCCGGGCCAAACCCGCGCGTCGTGCACGCGGGTGAGCCCAACCGGATGCCGCTCGTAACAAACGGCTTTTCCGGATCGAACGGGATGCCGTTCTTATTGGTGGTGATCGACGCGTTCTCCAGCGAATGCTCGGCCACCTTACCGGTGGCGCGCTTCGGCCGCAGATCGACCAGCATGACGTGGCTGTCGGTGCCGCCCGAGACGACCGCGAGGCCGTTCTCGATCAGCCGCGCCGCCAACGCGCGCGCGTTTTCGAGCGTGCGTTGGGCATAGAGCTTGAATTCCGGCTGCAGCGCTTCGCCGAAGCTCACGGCCTTCGCCGCAATGACATGCATCAGCGGCCCGCCCTGCAAGCCAGGGAAAACCGCTGAGTTGATGCGTTTGCCGATCTCTTCGTCGTTCGAGAGGATCATGCCGCCGCGCGGGCCCCGCAACGTTTTGTGGGTCGTCGTCGTGACCACGTGCGCATGCGGCAGAGGAGACGGATAGACGCCGCCCGCAATGAGCCCCGCATAGTGGGCCATATCCACCATCAGGATCGCGCCGATTTCATCGGCAATTTCGCGGAAGCGCTTGAAATCGATGTGCCGCGAATAGGCGCTCGCGCCCGCCAGGATCAGCTTCGGCTTCTCGCGATGCGCAATTTCGCGGACCTCATCCATGTCGATCAGATGATCGTCCGGCCGCACGCCGTACGCGACAGGATTGAACCACTTGCCCGAGATGTTCACCGGCGAGCCGTGCGTCAGGTGACCCCCGCACGCCAGATCCATGCCCAGGAACTTATCGCCCGGCTGCAGCAGCGCGAAGAACACAGCTTGGTTCGCGTTCGCGCCCGAGTGCGGCTGCACGTTGGCGAATTTGCAATCGAACAGACGCTTGGCCCGGTCGATGGCGAGGCGCTCGACTTCATCCACGAACTCGCAGCCACCATAATAGCGGCGGCCCGGATAGCCCTCCGCGTACTTGTTCGTAAGAACCGAGCCTTGCGCCTCCAACACCGCCTTCGAGACGATGTTCTCGGACGCAATCAGCTCGAGCTGGCGCTGTTGGCGCTGCAGCTCGGCGTCGATCGACGCCTTGACGTCATGATCCACGCGCTCGAGCGCCGCCTCGAAGAAACGGTCGCGTTCACCGGCGGAGCGTCCCGCGGCCTGCATCATGCCAGTTCTCACCATTCGTCTGGGTCCGCCGAGATTCGGCTCGCCCATCATAATTCTCCGCACCGATGTAAGCAAAGCGCGGGATTTCCCAAGGGTGAGGCGCGACTTCTGTAAATATCGCGAAGCTAAACGGGTTAGACCGATAGTAATACTTGCCAGAAATACCGCAAACCGCGTATAGCGGCGCTGAGGGCATTGTATTAGCTAGGAAATTCGCTTGGCTATACGAGATGAAGACGATCAGGATATCGCCGCAAGCGGCGATGCGCTGCGCATGACCGCAGATATCGTCGCCTCCTTTGTCAGCAACAACAAATGCTCTTCTGACGAACTGAGCGAGATCATCCGCTCAGTTCATAAGGCCGTTACAGGATTGTCGGTTTCGAACGGAGCGGCGCCCGCGGAAAGGCCCAAACCGGCCGCGCCAATCGGAAAATCCGTCCACAACGACTACATCATCTGTCTCGAAGACGGCAAAAGGCTGAAAATGCTGAAGCGCTATTTGCGTTCGACTTACGGCATGTCGCCGGATGACTACCGCAAGCGCTGGGGTCTGCCCGCCGATTATCCGATGGTCGCGCCCTCGTACGCCGCACGCCGCTCCGAGTTCGCCAAGAAGATCGGCCTCGGCAAGGGCGTGCGCCGCAAAGACTAG
Protein sequences of DBSCAN-SWA_1 >NZ_CP047045|2067539:2072961|2071004_2072303_-|WP_158768067.1|DBSCAN-SWA MQAAGRSAGERDRFFEAALERVDHDVKASIDAELQRQQRQLELIASENIVSKAVLEAQGSVLTNKYAEGYPGRRYYGGCEFVDEVERLAIDRAKRLFDCKFANVQPHSGANANQAVFFALLQPGDKFLGMDLACGGHLTHGSPVNISGKWFNPVAYGVRPDDHLIDMDEVREIAHREKPKLILAGASAYSRHIDFKRFREIADEIGAILMVDMAHYAGLIAGGVYPSPLPHAHVVTTTTHKTLRGPRGGMILSNDEEIGKRINSAVFPGLQGGPLMHVIAAKAVSFGEALQPEFKLYAQRTLENARALAARLIENGLAVVSGGTDSHVMLVDLRPKRATGKVAEHSLENASITTNKNGIPFDPEKPFVTSGIRLGSPACTTRGFGPAEFRQVADLICEVLDGLQGTNGDNSSVEAAVAAKVEQLTARFPIYT >NZ_CP047045|2067539:2072961|2072580_2072961_+|WP_158768068.1|DBSCAN-SWA MTADIVASFVSNNKCSSDELSEIIRSVHKAVTGLSVSNGAAPAERPKPAAPIGKSVHNDYIICLEDGKRLKMLKRYLRSTYGMSPDDYRKRWGLPADYPMVAPSYAARRSEFAKKIGLGKGVRRKD >NZ_CP047045|2067539:2072961|2067539_2067959_-|WP_158768066.1|DBSCAN-SWA MVISPYYNGVADMMQRGAEAAAEDAKATLDRVFVPGAFEIPAAIALAAKTESWDGYLALGCVVRGETSHYDYVCGENARGLMDLTVREGLAIGYGILTVNTLAQAEERADPGRGDKGGESARAALAMIALKRHFAELPR >NZ_CP047045|2067539:2072961|2070496_2070961_-|WP_158766166.1|DBSCAN-SWA MRCPFCQNEDTQVKDSRPTEDGAAIRRRRQCDKCGSRFTTFERVQLRDLVVLKRSGRRVPFDRDKLARSLSIALRKRPIEQDQIEQMISGIVRKLESQGESEITSEVIGGMVMEALRNLDTVAYVRYASVYRDFSEASDFADFIEEAEKSPKRG >NZ_CP047045|2067539:2072961|2069757_2070423_-|WP_158766165.1|DBSCAN-SWA MVRPYVTLKLATSLDGRIATASGESRWITGELARAEVQKLRANHDAVMIGAGTAWADDPELLARTDPPPAKQPLRVVLDSNLNLPPAGRLFETLAQAPLLVIGAEDAEFERRAGLQSAGARVAVVPRGAGGVDASEALLQLAADGVQRVLVEGGGKLAASLIQGGLVDRLEWFRAPMVLGEEGRPAVAALALDRLADAPRLTRVALRELGPDVWESYERAS >NZ_CP047045|2067539:2072961|2068012_2069140_-|WP_158766163.1|DBSCAN-SWA MTAKPSAISEFKRAISSVPEIIEEAREGRMFILIDEEDRENEGDLVIPAQFATPAQVNFMAKHGRGLICLALTPERAETLGLDPMTPRNQTRMGTAFTVSIEAREGVSTGISAHDRALTIATAIDPTKDRRDIVSPGHVFPLTAREGGTLVRAGHTEASVDISRAAGLNPSAVICEIMNDDGTMARLPELVAFAQFHNLKIGAIDELIAYRRVQERLVEKVAEAPFEVRGGKDFKLVVFRNKLDGVEHAALVKGEIGADKSSLVRVHRIDFAADVMGGYGARAGLVERAVAEIEAEGAGVVVLLRDLVPDALSRRLSGEPLEFEEDDMRRVIGLGSQILRELGVGRMTVLAGAPQKLVGLEGYGLSIDGWRGFKS >NZ_CP047045|2067539:2072961|2069136_2069748_-|WP_158766164.1|DBSCAN-SWA MFTGIVSALGEVKAVERLPGLVRLTIASSYDPAGVEIGASISHDGCCLTVVETSGQQGGMRHVVEVAAESLALTTLGSLAEGDKVNLERSLRVGDELGGHMVLGHVDGLGEVLSISQDGEGWRIQIKPPHEISPLIASKGSIAIAGVSLTVNEVDDRGFGVLIIPHTWSVTTLSKLKAGDKVNLEADMMARYAARIIEARRNT |
7 | Staphylococcus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2105361 : 2118402
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP047045|2105361:2118402|DBSCAN-SWA ATCAAGAGCCTCTTGGTTTGGCGCGACGCGTCGGCGGCGCGCTCAGGGGATCTTCGGGCCAGGGGTGCTTTGGGTAACGGCCGCGCATTTCGGCGCGCACCGCGGCGTAAGAGCCGCGCCAGAAGCCTGGCAAGTCCTTCGTCGTTTGCACCGGCCGATGCGCTGGCGAAAGCAGGCGCAACGTCAACGGCACGCGGCCGCCGGCGATTGTGGGATGCTTGTCCATACCAAACAGCTCTTGCAGGCGAACTTCGAGCGCCGGTCCGCCCTCGGCTTCGTAGTCAATTGCGAGGGAGGAGCCAGCAGGCGTCTCAAAGCGCGCCGGCGCCTCAGCATCGAGCCTGCGGCGCTCCTCGTAGTCCAGCGTGTTTTGAAGCGCCCGCGCAACATCTACATCACCGAGACGCGACGCTAGCGCCGGCGCAAGCCAATCATCGAGCTTGGCGCTCAGCGCGTCGTCAGACCAATCCGGCCACGCCTCACCCTCCAGCGACAGCATCAGCGCCACGCGCGCCCGCACCTGCTTCGCGGCGTCATCCCAATCCAGCAGCGCCAGGCCATCGTCGCGCACAGCATCGAGCAGCGCCTGCTTCATCTCGTCGGCGTTCAGTTTCTCAAGCGGCGCTTCGGAAAGAACCACGCGGCCCAACCGCCGCGTCCGCCGTCCACGCACGGCGCCGGTTGCCGCGTCAACGCTGACTGCTGCACGCGTTTCGATCTGAGCGGAGAAGGCCTCTTCAACCTCTTGCGCGGAGATCGCGGCAAACGACAGAATTCTGCTCTTGCCCGCTGTGCCTGTGGTTTCACCGATGACAATGAACGGCGCGGAGGCCATCGGCTCGGACATGTCCATGCTAGCCGCCCGCCCATTGACCATAAGGAACGCGCCGCCGCCGCGCGATTTCGCGACGCGGTCTGGAAAAGCGCGCGCGAGCACCCGACCAGACGCAAGTTCATCGTTGCGACCAGTTCCGCCGCCTGCGGACAGTGAGATGCGGCGGGCTAAACTTCGCGCGGCATCGGCGCGTTTTCCTTTGTCCCACTCGAGTTGATTCACGCGATCAGCCAAATCGGGGCTGCGTCCTCCTAGACCCTGCTCAGTCAGCAGCACCGCAAGAGTCGCGGCCAACCACGTCTCGCCGAATTGTTTGGACTCAATGACCATGTGCGCCAAGCGTGGCGGCAACGGCAGCGTCGCTATCGCCGCGCCATGCTCCGTGAGCCGCCCAGCCTCATCCAACGCCCCGAGCCGCGTCAGCAGCGCAATCGCTTCGTTCCACGCCGGCTTCGGCGGCGGATCGAGCCACGCCAAGGTCGTCGGGTCGCCCACACCCCACGCCGCCAAGTCGAGCGCCAAGCCCGAGAGATCAGCATCGAGAATTTCTGGTCGCTCAAACGCCGGCAGCGAACGCGTTTCACCCTCGCTCCACAAGCGCCAGCAAACCCCCGGCTCCAAGCGCCCCGCGCGCCCAGCCCGTTGCGTGATGGCGGCCTGGCTCGCTCGCACTGTTTCGAGACGCGAGAGGCCTAACGCCGGTTCGTAGCGCGGTCGCCGCGCCAATCCAGCGTCGACGACGATCCGCACGCCGTCGATGGTCAAGCTGGTCTCAGCAATCGACGTCGCCAGCACCACCTTGCGCCGTCCCGCTGGCGCAGGAGAGATGGCAGCGTCCTGATCCGCCGGACTCATGGCGCCATAGAGCGGCCGGATGTCCACGTTCGGATCGCGCAGATCGGTGCGGAGCCGCTCTGCCGTCCGTTCGATCTCCCGCACGCCCGGCAGAAACACAAGTGCGCTGCCCGACTCCGCCGCCAGCGCAGCGCGCACAGCGTTCGCGGTTTCCTGCTCCAGCAGCGCGCGCGGATCGCGTGGACGATAGACATGCGTGACCGGAAACCTCCGGCCTTCGCTCACAATAACCGGCGCATCACCCAACAGCGCCGCGACACGCGCGGCGTCTAACGTCGCTGACATCACCACGATCTTGAGATCAGGCCGTAGCGCCGATTGCGTGTCACGCGCGAGGGCTAAGCCAAGATCACCCTCCAAACTGCGCTCGTGAAATTCATCAAACAGCACGGCCGCAACACCGCGCAGCTCCGGATCGTCCGAGATCATCCGCGTGAACACGCCTTCGGTGACAACCTCGATCCGCGTCTTCGGCCCGACGCGCGAATCCAGCCGCACGCGATAACCGATCGTACCGCCCACGGCCTCGCCGCGTTCGCTCGCCATCCGCGCCGCCGCGGCACGTGCCGCGATCCGGCGCGGCGAGAGCAAGATGATTTTCCCTGTCCGCGCCCACGGCTCATCCAACAGAGCCAGAGGCACACGCGTCGTCTTGCCCGCGCCGGGCGGGGCGACCAGCACGGCCTCGCCGCGTTCGGCCAGCGCCGCGCGCAAAGTCGGAAGAACGTCCTCAACCGGAAGCATGGGGCGGGATTAACCCGCCGGGATGGCCATGTCTCGCCACATTGGCGTGGCTTGGTCGCCCGGCTATAGGGCTGACTGGAACGAGCTGGGAGCGGAACATGCGGGTGGCGATTTTTGGTCGGGTGGCGGCAGCGGCCATCCTCGCGATCGGTTTGGCGGCGTGCTCGTCTGGCCGCACGGCAGACACCCGCCGCGGCGTCGGCGACGCGGCTTACATCCCGCTGCGGGACGTCGGCTTGATGCGCCCGGAAATCCCGCTGCTGCTGCGCAATTTGCAATATCCCTATTCCACCGCGACCTTGGCCGATTGCCACGCCGTCACGCGCGAGATCGCCGCGCTCGATGGCGTGCTTGGCCCGGAGAGCTATCAGCCTGGCCCCAACCGCAACGTCTGGGACCGCAGCGGCGACTTCGTCGAAGAACAAGCCATCTCCGCCGCCGAAAGCACCGCGCAAGACCTGATCCCGTTCCGCTCATGGGTGCGGCGCATTTCGGGCGCCAGCCGCGCGGAACGCGACGCGCTGCGCGCTGTCGCCAACGGCCAGCAGCGGCGCACGTTCTTGCGCGGCTATGGCGCCTCGCTCGGTTGCCCCGGCATGATCCCGCCGCCGCCGCCCAACGTTGCACGCCAGCGTTAGTCTTCGACCTCGGGCGTAAGACGTGCGCGAAACGCGGCACGGAAGTCATCGAGATTCTGCGCGGCGATGGAGTCACGCAGGCGCTGCATCAAATGCTGGTAGTAGGCGACGTTGTGCCACGACAGCAGTACTTGGCCGAGTAGCTCACCGGCTTTGAACAAATGGTGCAAATAGGCTTTGGAATAGTCACGGCTGGCTGGGCAATCGATGCTCTCATCCACCGGCGTCAGGTCTTCCGCAAAGCGCGCGTTCTTGATGTTGAGCGGCCCGGCGTCGGTCCACGCTTGGCCATGCCGGCCGGAGCGCGTCGGCAGAACGCAATCGAACATATCAATCCCGCGCGCCACGCTTTCAACGATGTCTATCGGCTTGCCGACGCCCATCAGGTAACGCGGCCGTTCCTGCGGCAGACACGGCGTGGTGACGTCGAGCGTCGCGAGCATGGCGTCATGGCCCTCGCCGACCGCGAGGCCGCCAATCGCGTACCCATCAAAGCCCTCGCTCACCAACGACTCCGCGGACTGTCGGCGCAACACGGCGTCCGTGCCGCCCTGCACGATCGCAAACAAATTCTGCGCATCACGCGCACCGAACGCACGCTTGCAACGCTCGGCCCAACGCAGGGATAGCTCCATCGCTTTCTGCACATCCGCGCGCGGCGCCGGCAGCGCGGGACATTCGTCCAGCTGCATAACGATGTCGGCGCCGATCTGATCGGCTTGAATTTGAATCGCGACCTCCGGCGTCAGCCGATGCTGTGCGCCGTCGACATGGCTGCGGAATGTCACGCCCTCAGCGTCGAGCTTCCGCAGCTGAGAGAGCGACATCACCTGGTACCCGCCACTGTCGGTGAGGATCGGTTTGTCCCAGCGCATGAAGCGATGCAGCCCGCCAAGTCGCGCCATGCGCTCGCCGCCCGGACGCAGCATCAGGTGATAGGTGTTGCCGAGAATGATGTCTGAGCCGGTCGAAGCCACTTGCTCCACCGTCAGCGCCTTCACGGTACCCGCGGTCCCCACCGGCATGAACGCAGGCGTGCGGACTTCGCCGCGGGGCGTGCGCAGCACACCGATGCGCGCCGCGCCGTCGCGCGCGTGAATCTCAAACGGAAAGCCGCTCATTCCGCTCGCCATAACAGGCTGGCGTCGCCGTAGGAATAGAAGCGGTAGCCGTTAGCTATCGCATGCGCATACCCCGCCTGTATGCGTTCAAGCCCAGCGAACGCCGACACCAGCATGAACAGCGTCGAGCGCGGCAGGTGGAAGTTGGTCCAAAGTCCGTCGACCACGCGAAAGCGATAGCCCGGCGTGATGAAGATATCGGTGCCGCCGGAGACCGCGCTGACGCGTCCGCCCTCATCCGACGCGGATTCCAGCGAACGCAGTGCGGTGGTGCCGATCGCGATAACGCGCCGGCCCTCGGCTTTGGCGCGATTGACCGCGTCCGCCGTTGCTGCACTGATTTCGCGCCACTCCGAATGCATGGTGTGCGCCTCGGTATCTTCGGCCTTCACAGGCAGAAAAGTGCCGGCGCCAACGTGCAGCGTCACGCGCGCACTCTCTACGCCAGCATCCTCCAGAGCGAGGAGCAATTCGTCCGTGAAGTGCAAACCAGCGGTCGGCGCCGCGACCGCGCCGTCGTTTGCCGCGAACACGGTCTGATAGTCGCGCGCGTCTTCGGCGTCCGGCGCGCGCTTTCCCGCGATGTACGGCGGAAGCGGCGGCGCGCCGACCAGCGCGATCGTCGCGTCCAGCGCGGCGCCCGAGCGATCGAAGCAAAGATCGACGGCGCCTTCGTCGTGTTTCGCCTGGATTGTCGCTTTTAGGCCGTCCTCGAACAGCAATAGGTCCCCCGGCTTCAATCGCTTCCCCGGCTTGGCCAGGGCGCGCCAGCGATCTGGCGCGAGACGCTCAATCAAATTGAGCGAGACCTGGACATCCGGCGCCTGCGCATCCCGGCTCGGTCGCACGGCTCTCAACGCGGCCGCGATTACACGTGTGTCGTTGAACACCAGAAGATCGCCCGGCCTAAACAAGCCCAGCGCGTCGCGCACAGTTCGATCCTCAAACCCGCCCTCAGCCGGCACGAGCAACAGCCGCGCGCTATCGCGCGGCCGTGCGGGGCGGAGCGCAATCAGCTCTTCGGGTAGCTGGAAATCAAAGAGGTCAACGCGCATGTCATGGACCGCCGGCGCCCACGCCGGCTAATGCTGCGTTATGCCGGCGAGGGTGCCGGCGGTCCATGTAACTCACTCGGCCCGCGCGCTCACGATCTTGCCGGGCGTGCGCGGCGGCTCGCCCTTCGGCAGCGCGTCGATGGCCTCCATGCCTTCCTCGACCACGCCCCAGACGGTGTATTGCTTGTCGAGGAAACGCGCGTCGTCGAGGCAGATGAAGAACTGGCTGTTGGCCGAGTTCGGATCGTTGGTGCGCGCCATCGAGCAGACGCCGCGCACATGCGCCTCGGCGGAAAATTCCTGCTTCAGGTTCGGCTTCTTCGAACCGCCGGTGCCGTTCTTGTTTCCGCCGTCGCCGCCTTGCGCCATGAAGCCGGCGATGACGCGGTGAAACGGCACGCCGTTGTAGAAGCCTTCATTGGCGAGTTCGCCGATGCGCTGCACATGGCCCGGCGCCAGATCGGGCCGCAGCTTGATCTGCACCGTGCCGGTGTCGAGCTCGAGCGTCAGTCTATGAAAGGAATTATCGACCACCAGCGTCTCTCTCTGTTTTCAGTGCGCGGAACCCGCGCGGGAATTTGCACATCGCACACCGAGAAATCGGCTGACCGTTCGCGGCGCGTATCGTCAATCAAATCACGGAACGCCGGGCCATCGGTGCGCAGCACGTATATAGGCGCGCGCTCGGTCTCGGGCACGTCAGCCGCGATCCGCACCGCGACCATGCGGTCCGGCGTGGGCGGGGGGTTGCCAACGGCAAGCGCCATCACGGCGTTCTGGCCCCACACAACGCGGCCGAAGATCGAATAGCGCTTGTCGAGATCGGTGTTCGGCGCACGCATGATGAAAAATTGGCTGTTGGCCGAGTTCGGGTTATCCGCCCGCGCCATCGAGACGACGCCCGGGCAGTGCAGCGCATTGGCGGCAACACGGCCGTCAGCGGTCACCGCCATTTGCGCATCCGGCTGGCTCTCGATCGGGATCGCTTTGTAAAACCCGAGCCGCGCGCCGCCTTGGTTGGCGGCTTCCACAAACGGCATGTCAGGGCCACGGCGGAAGCGGAACTCCTCGCGTATATCCGGCAAGCTCGATGCACCTTCGCCGGTGCCGAGCGGATCGCCGGTCTGCGCCATGAAGCCATCGACGACGCGGTGGAACATGAGGCCGTCGTAGTAGCCCGCCCGCGTCAACGTCTCGAGGCGCTCGACGTGGCGCGGCGCGATCTCCGGATACATCTCGATCACGATGCGGCCATGGACCGTGTCGATGTAGAGCGTGTGCTCCGGATCGACCTGACGCCAATTGCGCGGATCGGGCGCCGATTGCGCCGCCGCGTCCATACTCAGAAGCGAGAGCGCCACAGCGCCCGCCGCCAAAATTCCACATACGCGCAGCATCAAGACACCCCGAACTTTGCCTTCAGTCGTCTTTCTACTTCTTGAGGCACGAAGTGCGCGACGCTTCCTTGCAGTCGCGCGATTTCTTTCACCAACCTGGACGCGATCGCTTGGTGCGTCGGGTCGGCCATGAGAAATACTGTTTCGATGGCTGGGTTTAGCTTCTGGTTCATCCCAACCATGGCGAATTCGTACTCGAAGTCGGCGACCGCTCTAAGTCCCCGGACGATGATTTGCGCCCCTACGCTTTCGGCGAATGACATCAGCAATGTGTCAAAAGGTTGGATGACGATCTCGGCTTCGCCTTTGATCTTGGCGCACTGCTCCTGAACCATCTCCACCCGCTCCTGGAGGGTGAAGAGCGGGCCTTTGTCCTCATTGATAGCCACGCCGATGACGAGGCGGTCCACCAAAGTGGCGGCGCGGGCAATGATATCCACATGGCCGTTGGTCGGGGGATCGAAGGTCCCGGGGTAAAGGCCAACGCTGACTCTTTTGCTCACTCGCCGCCCTCCTCAGACGCGCCCTCAACAGCTTCGCCCGCCTCGCCGCCCAGCGCCTCTAGGCGCTCGGCCGCGACGACGTGCTCGTCTTCGCTGACGTTGATCAGGATCACGCCTTGGGTCGCACGTCCAGCGATGCGGATCTGATCGATCGCGGTGCGGATGAGCTGGCCCTTGTCGGTGACCAGCAGCAGCTCCTCGTCCGTCTTAACCGGGAACGACGCAATCAACCGCGCCTCACCCGTGAGCTTGTGCGCCCGCAGTCCTTTGCCGCCGCGACCGGTGATCCGGTATTCGTACGACGAAGCACGCTTGCCGAGGCCGCCGGACGTCATCGTCAGGATGAACTCTTCTTGCGTCTTCAACCACGCCAGGCGCTCGAACGGCACATCCGCGACTTCACCGGTGTCCTCACCAGCTTCCGCGTCGGCCTCCACGTCAGCGTCGCCCGCGTCACGCGATTGCGCGTCCCACTTAAAGTAGGCGCGCGCTTCCGCTGGCGTCACCTCTATGTGGCGCAACACGCCCATCCCGATGACGCTATCGCCTTCGGCCAGATTGATGCCGCGGTTGCCAGTCGAGGTGCGCCCTGAGAACACGCGCACATCCGTTGCCGGGAAGCGGATGCACTGAGCATTTTGCGTCGTGAGCAGGATGTCGTCCGCCTCCGAGCACACCTGCACCGACACGATGCCGTCGCCCTCATCCGGCTTCATGGCGATCTTGCCGTTGCGGTTCACCTGCACGAAGTCGCTGAGCTTGTTACGCCGAACACCGCCGGAGCGGGTTGCGAACATCACGTCGAGCTTATCCCAATCGCGCTCATCCTCCGGCAGCGCCATCACCGATGTGATGGTCTCACCGGCGCCGAGCGGAAAAAGATTGACCAACGCCCTGCCCTTTGTGCGCGGATCACCCGCTGGCAAGCGCCACGTCTTCATCTTGTAGACCATGCCCGACGACGAGAAGAACAGGATCGGCGCGTGCGTGGAGGCCACGAAGATGCGCGTGACGAAATCCTCGTCCTTCATCGTCATGCCGGAGCGACCGCGCCCGCCGCGTCGCTGCGTGCGATAGGCTGCCAGCGGCGTGCGCTTCACGTAGCCGCCATGCGTAACGGTAACGACCATGTCTTCGCGCGGGATCAAAGCCTCGTCGTCGATGTCGCCTTCGGACTCAGAGAATTGCGTGCGGCGCACAAAGCCGAACGCTTCCTTCACGCCCGTTAGTTCGTCGCGGATTATCGTCAGGATACGTTCGCGGGAGCCGAGGATCTCAAGCAGTTCCTTGATCTCTTCCGCGATCTCGTTCGCCGCCTTCGCGATGTCATCGCGGCCAAGGCCGGTGAGGCGCGAGAGCTGTAATGCCAGAATGGCGCGCGCTTGTTCGTCGGACAGCCGGATCGCATTGCCATCTTCAACAATCGTGCGCGGGTCAGCGATCAGCGCGATCAGCGGCATCATATCGCCAGCCGGCCAAGCCCGCGCCACCAAGCGCTCGCGCGCCGTGCCCGGATCGGGGCTTTCGCGGATCAGCTTGATCACGTCGTCGATGTTGGCGACAGCAACGGCGAGACCCACCGTCTCGTGACCGCGCTTCCGCGCTTTGGCCAAGCGAAACTTAGTGCGGCGCGTGATGACCTGTTCGCGGAAGCTCAAGAAGTGGATGAGCATCTGCCGCAAGCCCATCTGCTCGGGCTTGCCGTTGTTCAGCGCCAGCATGTTGACGCCGAACGAGGTCTGCAGCTGTGAGAAGCGATAGAGCTGGTTCAGCACCACATCGGCCACCGCATCGCGCTTCAGCTCGACAACGAGGCGGATACCCAAGCGGTTCGATTCATCGCGCACTTCGGCGATGCCTTCGATGCGCTTCTCACGCACCTGTTCGCCGATTTTCTCGACCATCTCGGCCTTGTTCACCTGGTACGGAATCTCGGTGAACACCAAAGCTTCACGACCACGGATGGTTTCGGTGTGATGCTTGGAACGGATCGTCACTGAACCGCGGCCCGTCAGCAGCCCCATGCGCGCGCCGGCGCGGCCGAGGATTTCGCCGCCGGTCGGAAAGTCCGGCCCAGGCACAATGTCGAGCAATTCGAGATCGGTGATCGCCGGATTATCGACCATCGCGACAGCCGCATCGATCACTTCGGACAAATTGTGCGGTGGAATATTCGTCGCCATGCCGACGGCAATGCCGCCTGCGCCGTTCACAAGCAGGTTCGGATACTTCGCCGGCAGCGCGCTCGGCTCTTGGCGCGACCCATCATAGTTGGCGACGAAATCAACCGTGTCTTCGTCGATGTCGTCCAACAGCGCGCGCGCCGTCTTGGCCAAGCGGCTTTCGGTGTAGCGCATCGCCGCTGGCATATCGCCGTCGATCGAGCCGAAGTTTCCCTGCCCGTCGATCAGCGGCAGGCTCATGGCGAAGTCCTGCGCCATGCGGACGAGCGAGAGATAGATCGCCTGGTCGCCGTGCGGGTGATAGCGGCCCATCACTTCGCCGACCGTGTTGGCGCTCTTGCGGTAGCTCTTGTCGTGCGTGTTGCCGGCCTCATCCATGCCGAACAGGATGCGCCGGTGCACCGGCTTCAAGCCGTCGCGCACGTCTGGCAGCGCGCGCGAGACGATGACGCTCATCGCGTAATCGAGGTAAGAGCGCTTCAGCTCATCCTCGACGCTGATCAGCGCGATGCCTTTGGGAAGAGTTGGTCCGCCGTTCTCGGCGGGATCGGACGTGTCTGACACGAGATCGCCGGTTCTAAATCTGCTGTTCAGGAGTTGATGAAAACAGCGAGAAAAACTCGCCTTCGATTCCATCGGAAACTAACATTGTGACGCCGCCGTCGCAAACCAAGGAGCCCCTGATGCGCCCTCTCGCCGCCCTCCTTTTTGCAGCCCTGCTGACCGCCTGCGCCAGCGGCGCGGGCCGCAGCGACGACTATCCCCAGGCTACGCCGCGAACGGTCTGGATTGTCGGTCCGGATGGGCGGGCCCTAGGCCAGGCCAATTTCACCGGCGGACCACACGGGGTGTTGATTCAACTCGAATTTTCAGAGCGCGCTTTGCCCCCTGGCTGGCACGGCCTGCACCTGCACGATCGCGGCGACTGCAGCGACTTCGCCGCCGGCTTCCAAGCCTCCGGCGGCCACCTCGGCATGAACCGCCGTATCCAGCACGGCCTGATGAACCCGGAAGGTCCCGAAGCGGGCGATCTGCCCAACATCTTCGCGCCGCCATCGGGCGTGTTCGCGGCGGAAGTCTTCGCCCCTTACGTCACCCTGAGCGGCGAACGCATCCCTGGCAATGCCAACAGCCGCGAGCGGCTCCCATTGCTGGACGAAGACGGCACAGCGCTGCTGATCCACGCGGCACGCGACGATCAGATGGGCCAGCCTGTCGGCAACGCAGGCGCCCGCATCGCCTGCGCCGCGCTCACGCCACAGCCTTAAAGCGCCGCTCGGCTAGAGCCACCAATGCGACGCCAGCAAGCGTCGCTGCGGCGCCGATCAGAATTTGAACCGTGATGCGGTCATGCAGGATCGCGATAGCGAGCGCAAAAGAGATCACGGGCGTCAGCAGCAAGAACGGTGTCGTGCGCGAGACTTCGTATTTCTGCAGCAGCTTGAACATGAAGGCATTGGCAACGACGGACGACACCAACGCGCCAAACGCGATGAACACCCAAACGCTCCAATGCGCTGCCTGCGCTGCTTCCACATGGCCGCGTTCAAACGTGAGAGACGCAATCAATAGCGTCGGCGCCGCGGCCAACGCGATCCAAGCTTGCGTCGCCCACGGATCAAGTCCGCCGCCCAGCTTGCGCACCATCACGGTGCCCGCCCCGTAAATGGCTGACGCGATCGCCACGAGGAAAAGCGGCCAGCCTTGCGCCAACACGGTCGGATCGAAATTCATCGACGCCGCGCCGACAAACGAGATACCCACACCCGCCCAACGCAAGCGGCTCACGTGCTCGCCCAAAAACAGCGCCGCAAACACCACCGACGCGGGCGCCCATAACTGCATCGCCACGATCATCGGCGGCAAATCGGTCGCGAGCTTCAAGCCGATGGACTGAATGGCGAAGTGCAACGGCCCGATGAAAGCCAGCATCGCGAAGAACAGCGGCAATTGCCCCTTCGGCGGCGGCTTCAGCCAAACGGCCAACACCGCCAGCACAATGATGAACCTGAGCGCCGCCATCATCATAGCCGGCAGCGCGTCCACCGCGATTTTCGCCAGAATATTGTTCACGCCCCAAATCAGCGCGATCGACAGCAGCATGATCAGATCAAGCGGCGCGAAGGCGCGAGAGGGCGTTGTCACGCCCGCTGGCTACGCCCGCCCGCGACTCGCCGCAAGCGGGCCGCCGCTACATTTCAGCCCGAACCGCTGCGGCAATCTCGTAAGCCCGCACCCGCGCAGCGTGGTCGTAGATGTGCGACGCTAATATGAGTTCGTCGGCGCCAGTACGGGTCACGAACGCCGCGATGCTGGCCTTCACCGTCGCCGGCGAACCGATGGCCGAGCATTGCAGCACCTGATCGAGCTGCGCCCGCTCCATGTCATTCAACGTCGCTTCAAACGCAGGGTCTGGCGGCTGCAACCGACGCGGCTCACCCCGCCGCAAGCTTGCGAACGCCTGCATCAACGATGTCGCCAGCAAACGACCTTCCTCGTCCGTCGGCGCCGCGAAAATGTTGAAGCCCAGCATCACGTACGGCTTCTGCAATTGGTCGGACGGCCGGAATGTCGCGCGATAAATCTCGATCGCGCTCATCATCTGCGCAGGCGCAAAGTGCGACGCGAACGCGAACGGCAACCCAAGCGCCGCCGCGAGCTGCGCGCCGAACGTCGAAGACCCAAGCATCCACAGCGGCGTACTTGCGCCCTCACCCGGCACCGCGCGCAATCGTTGCCCAGGCTCGCTCGGCCCGAGCAACGCCTGCAATTCCATCACGTCCTGCGGAAACCGGTCCTCGCCGCCAATCAACGTCCGCCGCAGCGCACGCGCAGTCAGCCCATCGCCACCAGGTGCGCGCCCGAGCCCCAGATCAATGCGCCCCGGAAACAGCGCCTCAAGCGTCCCAAATTGTTCAGCAATGATGATCGGCGCGTGGTTCGGCAACATGATCCCGCCGGCGCCGACGCGGATCGTCTTGGTCGCCTGCGCCGCGTTCATGATGGTGAGCGCCGTCGCCGCACTCGCGACACCCGGCATGGCGTGATGCTCCGCCAGCCAAAAGCGCTTGTACCCCAACCGCTCCGCGGCTTGCGCCAAATCGCGGCTATTCGCCAACGCCTGCGCAACGCTCCCGCCTTCAACGATGGGCGCGAGATCGAGAACCGAGAATGGGATCATGGGCGCGATATGGGCGCTTCATGCGAAGCGCCCAAGTTCACACTGAGTGATTAGAACGGAATCTCGTCGTCCAGATCCTGGCTGAAGCTTTCGCGCGGGGCGTCGCCGCCGCTGCGCTTGCCGCCAGCCTTCGCGGTGAAGCCGCCGCCATCGTCCTCATAGGACTTGCCTTCGCCGCCGCCTTTGCCGCCGAGCATGGTGAGCTCGCCGCGGAACTTCTGCAGCACGATCTCGGTCGAGTATTTCTCGACGCCGTTCTGCTCGTACTTCCGGGTCTGCAGCTGGCCTTCGATGTAAACCGTCGAGCCCTTCTTCAGATATTGCTCGGCGACCTTCGCGATGTTCTCGTTGAAGATCACGACGCGGTGCCACTCGGTCTTTTCCTTGCGCTCGCCGCTCTGCTTGTCGCGCCAGCTCTCCGACGTCGCGACGCTCAGGTTCACCACCGGATCGCCATTGTTGAGCTTCCGCACTTCCGGGTCCTTGCCCAGATTGCCGACCAAAATCACCTTGTTGACGCTACCAGCCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP047045|2105361:2118402|2115942_2116836_-|WP_158766208.1|DBSCAN-SWA MTTPSRAFAPLDLIMLLSIALIWGVNNILAKIAVDALPAMMMAALRFIIVLAVLAVWLKPPPKGQLPLFFAMLAFIGPLHFAIQSIGLKLATDLPPMIVAMQLWAPASVVFAALFLGEHVSRLRWAGVGISFVGAASMNFDPTVLAQGWPLFLVAIASAIYGAGTVMVRKLGGGLDPWATQAWIALAAAPTLLIASLTFERGHVEAAQAAHWSVWVFIAFGALVSSVVANAFMFKLLQKYEVSRTTPFLLLTPVISFALAIAILHDRITVQILIGAAATLAGVALVALAERRFKAVA >NZ_CP047045|2105361:2118402|2107892_2108432_+|WP_158766201.1|DBSCAN-SWA MRVAIFGRVAAAAILAIGLAACSSGRTADTRRGVGDAAYIPLRDVGLMRPEIPLLLRNLQYPYSTATLADCHAVTREIAALDGVLGPESYQPGPNRNVWDRSGDFVEEQAISAAESTAQDLIPFRSWVRRISGASRAERDALRAVANGQQRRTFLRGYGASLGCPGMIPPPPPNVARQR >NZ_CP047045|2105361:2118402|2116882_2117872_-|WP_158766209.1|DBSCAN-SWA MIPFSVLDLAPIVEGGSVAQALANSRDLAQAAERLGYKRFWLAEHHAMPGVASAATALTIMNAAQATKTIRVGAGGIMLPNHAPIIIAEQFGTLEALFPGRIDLGLGRAPGGDGLTARALRRTLIGGEDRFPQDVMELQALLGPSEPGQRLRAVPGEGASTPLWMLGSSTFGAQLAAALGLPFAFASHFAPAQMMSAIEIYRATFRPSDQLQKPYVMLGFNIFAAPTDEEGRLLATSLMQAFASLRRGEPRRLQPPDPAFEATLNDMERAQLDQVLQCSAIGSPATVKASIAAFVTRTGADELILASHIYDHAARVRAYEIAAAVRAEM >NZ_CP047045|2105361:2118402|2110680_2111145_-|WP_158768071.1|DBSCAN-SWA MVVDNSFHRLTLELDTGTVQIKLRPDLAPGHVQRIGELANEGFYNGVPFHRVIAGFMAQGGDGGNKNGTGGSKKPNLKQEFSAEAHVRGVCSMARTNDPNSANSQFFICLDDARFLDKQYTVWGVVEEGMEAIDALPKGEPPRTPGKIVSARAE >NZ_CP047045|2105361:2118402|2112504_2115327_-|WP_158766206.1|DBSCAN-SWA MESKASFSRCFHQLLNSRFRTGDLVSDTSDPAENGGPTLPKGIALISVEDELKRSYLDYAMSVIVSRALPDVRDGLKPVHRRILFGMDEAGNTHDKSYRKSANTVGEVMGRYHPHGDQAIYLSLVRMAQDFAMSLPLIDGQGNFGSIDGDMPAAMRYTESRLAKTARALLDDIDEDTVDFVANYDGSRQEPSALPAKYPNLLVNGAGGIAVGMATNIPPHNLSEVIDAAVAMVDNPAITDLELLDIVPGPDFPTGGEILGRAGARMGLLTGRGSVTIRSKHHTETIRGREALVFTEIPYQVNKAEMVEKIGEQVREKRIEGIAEVRDESNRLGIRLVVELKRDAVADVVLNQLYRFSQLQTSFGVNMLALNNGKPEQMGLRQMLIHFLSFREQVITRRTKFRLAKARKRGHETVGLAVAVANIDDVIKLIRESPDPGTARERLVARAWPAGDMMPLIALIADPRTIVEDGNAIRLSDEQARAILALQLSRLTGLGRDDIAKAANEIAEEIKELLEILGSRERILTIIRDELTGVKEAFGFVRRTQFSESEGDIDDEALIPREDMVVTVTHGGYVKRTPLAAYRTQRRGGRGRSGMTMKDEDFVTRIFVASTHAPILFFSSSGMVYKMKTWRLPAGDPRTKGRALVNLFPLGAGETITSVMALPEDERDWDKLDVMFATRSGGVRRNKLSDFVQVNRNGKIAMKPDEGDGIVSVQVCSEADDILLTTQNAQCIRFPATDVRVFSGRTSTGNRGINLAEGDSVIGMGVLRHIEVTPAEARAYFKWDAQSRDAGDADVEADAEAGEDTGEVADVPFERLAWLKTQEEFILTMTSGGLGKRASSYEYRITGRGGKGLRAHKLTGEARLIASFPVKTDEELLLVTDKGQLIRTAIDQIRIAGRATQGVILINVSEDEHVVAAERLEALGGEAGEAVEGASEEGGE >NZ_CP047045|2105361:2118402|2112004_2112508_-|WP_158766205.1|DBSCAN-SWA MSKRVSVGLYPGTFDPPTNGHVDIIARAATLVDRLVIGVAINEDKGPLFTLQERVEMVQEQCAKIKGEAEIVIQPFDTLLMSFAESVGAQIIVRGLRAVADFEYEFAMVGMNQKLNPAIETVFLMADPTHQAIASRLVKEIARLQGSVAHFVPQEVERRLKAKFGVS >NZ_CP047045|2105361:2118402|2109549_2110608_-|WP_158766203.1|tRNA|DBSCAN-SWA MRVDLFDFQLPEELIALRPARPRDSARLLLVPAEGGFEDRTVRDALGLFRPGDLLVFNDTRVIAAALRAVRPSRDAQAPDVQVSLNLIERLAPDRWRALAKPGKRLKPGDLLLFEDGLKATIQAKHDEGAVDLCFDRSGAALDATIALVGAPPLPPYIAGKRAPDAEDARDYQTVFAANDGAVAAPTAGLHFTDELLLALEDAGVESARVTLHVGAGTFLPVKAEDTEAHTMHSEWREISAATADAVNRAKAEGRRVIAIGTTALRSLESASDEGGRVSAVSGGTDIFITPGYRFRVVDGLWTNFHLPRSTLFMLVSAFAGLERIQAGYAHAIANGYRFYSYGDASLLWRAE >NZ_CP047045|2105361:2118402|2108428_2109553_-|WP_158766202.1|tRNA|DBSCAN-SWA MSGFPFEIHARDGAARIGVLRTPRGEVRTPAFMPVGTAGTVKALTVEQVASTGSDIILGNTYHLMLRPGGERMARLGGLHRFMRWDKPILTDSGGYQVMSLSQLRKLDAEGVTFRSHVDGAQHRLTPEVAIQIQADQIGADIVMQLDECPALPAPRADVQKAMELSLRWAERCKRAFGARDAQNLFAIVQGGTDAVLRRQSAESLVSEGFDGYAIGGLAVGEGHDAMLATLDVTTPCLPQERPRYLMGVGKPIDIVESVARGIDMFDCVLPTRSGRHGQAWTDAGPLNIKNARFAEDLTPVDESIDCPASRDYSKAYLHHLFKAGELLGQVLLSWHNVAYYQHLMQRLRDSIAAQNLDDFRAAFRARLTPEVED >NZ_CP047045|2105361:2118402|2105361_2107794_-|WP_158766200.1|DBSCAN-SWA MLPVEDVLPTLRAALAERGEAVLVAPPGAGKTTRVPLALLDEPWARTGKIILLSPRRIAARAAAARMASERGEAVGGTIGYRVRLDSRVGPKTRIEVVTEGVFTRMISDDPELRGVAAVLFDEFHERSLEGDLGLALARDTQSALRPDLKIVVMSATLDAARVAALLGDAPVIVSEGRRFPVTHVYRPRDPRALLEQETANAVRAALAAESGSALVFLPGVREIERTAERLRTDLRDPNVDIRPLYGAMSPADQDAAISPAPAGRRKVVLATSIAETSLTIDGVRIVVDAGLARRPRYEPALGLSRLETVRASQAAITQRAGRAGRLEPGVCWRLWSEGETRSLPAFERPEILDADLSGLALDLAAWGVGDPTTLAWLDPPPKPAWNEAIALLTRLGALDEAGRLTEHGAAIATLPLPPRLAHMVIESKQFGETWLAATLAVLLTEQGLGGRSPDLADRVNQLEWDKGKRADAARSLARRISLSAGGGTGRNDELASGRVLARAFPDRVAKSRGGGAFLMVNGRAASMDMSEPMASAPFIVIGETTGTAGKSRILSFAAISAQEVEEAFSAQIETRAAVSVDAATGAVRGRRTRRLGRVVLSEAPLEKLNADEMKQALLDAVRDDGLALLDWDDAAKQVRARVALMLSLEGEAWPDWSDDALSAKLDDWLAPALASRLGDVDVARALQNTLDYEERRRLDAEAPARFETPAGSSLAIDYEAEGGPALEVRLQELFGMDKHPTIAGGRVPLTLRLLSPAHRPVQTTKDLPGFWRGSYAAVRAEMRGRYPKHPWPEDPLSAPPTRRAKPRGS >NZ_CP047045|2105361:2118402|2115374_2115959_+|WP_158766207.1|DBSCAN-SWA MRPLAALLFAALLTACASGAGRSDDYPQATPRTVWIVGPDGRALGQANFTGGPHGVLIQLEFSERALPPGWHGLHLHDRGDCSDFAAGFQASGGHLGMNRRIQHGLMNPEGPEAGDLPNIFAPPSGVFAAEVFAPYVTLSGERIPGNANSRERLPLLDEDGTALLIHAARDDQMGQPVGNAGARIACAALTPQP >NZ_CP047045|2105361:2118402|2117922_2118402_-|WP_158766210.1|DBSCAN-SWA MAGSVNKVILVGNLGKDPEVRKLNNGDPVVNLSVATSESWRDKQSGERKEKTEWHRVVIFNENIAKVAEQYLKKGSTVYIEGQLQTRKYEQNGVEKYSTEIVLQKFRGELTMLGGKGGGEGKSYEDDGGGFTAKAGGKRSGGDAPRESFSQDLDDEIPF >NZ_CP047045|2105361:2118402|2111114_2112005_-|WP_158766204.1|DBSCAN-SWA MLRVCGILAAGAVALSLLSMDAAAQSAPDPRNWRQVDPEHTLYIDTVHGRIVIEMYPEIAPRHVERLETLTRAGYYDGLMFHRVVDGFMAQTGDPLGTGEGASSLPDIREEFRFRRGPDMPFVEAANQGGARLGFYKAIPIESQPDAQMAVTADGRVAANALHCPGVVSMARADNPNSANSQFFIMRAPNTDLDKRYSIFGRVVWGQNAVMALAVGNPPPTPDRMVAVRIAADVPETERAPIYVLRTDGPAFRDLIDDTRRERSADFSVCDVQIPARVPRTENRERRWWSIIPFID |
12 | uncultured_Mediterranean_phage(57.14%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2354477 : 2366947
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP047045|2354477:2366947|DBSCAN-SWA CTTAACCCTGTGGGCGGGGCGGGACATAATCCCTTTTGGCCCATTTCTTGACGAATTTCTTCAAGTCCTCGTCCGGGGTTTCCGGCAGCATGACCTGCAGTTTCACGAACTGATCACCCTGGCCCGCCACGCCTTTGCCTTTAAGTCGCAGCGTTTTTCCGGTGTTGGAACCGGCGGGGATGGTGAGGGTGACTGGGCCGGTCGGTGTTGGGGCCTGAACGCGGCCGCCCTCGACGGCCTCTGTCAGCGAGATGTTCAAGTCCATGTGAACGTCTTGGCCTTCGCGGCGGAAGAACGCGTGCGGGCGCACCGTCAGTTCCACCAGCGCATCGCCTGCCGGCCCGCCTTGCACGCCGGCGCCGCCCTGATTCTTCAAACGCAGCACTTGGCCGCTCTCGACGCCTGCCGGAATGGCGACATCGAGTGTGCGGCCTTCAGCCAGTGAAACGCGGCGCTTCGCGCCGTTGATAGAGTCGAGGAAATCCACTTCCAGCGTGAAGCGAATATCGCGGCCGCGCATGCGCGAATAGCTGCGGCCATTGCCGAAGCCTGGGCCGAACAAGTCTGAGAAGATGTCGCCGAGGTCGAACGCGTCCTGCGCACCGGCCCCACCAGGGCCGGGCCCGGCCCCTGCGCCGGCATGCGCGCGCGCGCTTTGCCGTGGCCGCGAGCTGAACGCCATGCGTTCGTTGCCGTCGGCATCGATCTCACCGCGGTCGTAACGGGACTTCATCACCGGATCGGAGAGCAAATTGAACGCAGCGGTCGCGCGCTTGAAGCGATCCTCGGCTTGCTTATCGTTGGGACGCACGTCGGGATGCAGCTCCTTGGCCATGGCGCGGTACGCACGACGAATTTCGTCGGCGGTCGCGGTGCGGCCGAGGGCTAGCGTGGCATATGGGTCCCAACTCAAACTGACTGCGCTCCGGCCTTCGGGCAGATTTTGGCAGCGATGAGGTCAGACCCGATGGAATCAGACCGCGCCGGCATTCTGGCCCGCCTGGCTATAATGTTATGGTTAACTCCTTGTTCAACCCTGGCGCACCGTTCTCTCCCATCTGGGCGACTCGGAGGCGCAACGAGACAGGCGGAGCACCGAAATCAGCGATCTGGTCGGCGGCGCCGTAGAGGTGAGCCGGGTTTGTGACGTTCACACTTCGCTTCAGAACGTCCGCGCCGTCGAGGATATCCAGGACGTAGCTTTCCGGCGCCGCGCCGAGCGGCGGCTCGCCCGGCCCCCAAGCGTCGCCGCCCACACGTGCGCAACGCACCCAGCTGATCGCGACGTCGCCGCTCGGTTCACGGCGTGCGCGAAGATGCGCCGGCGCCCAAGGACGCAAAGCGGCGTGCGGCAAAGTGAGCGTGAGATGTTCGGCGCGTGGATCGGTAGCGAGTGCGTTCGCGGGCGGCGCCGCGAACGCCAGCGGCTCATTCCATTCGTGCGCGCCAATCTCAGCGCGCGCCAGCCGCGCGTCCAGCTTCACGATGCGGGCCCCGACCGGATGCGGTTCGCGCATCGCGGGAGCGGAGCCGAGCTGGCCGCGTAGAAAGCCCGAGAGTTGATACTCGTTCGGCGCGACCAATTCGCACGAACGCGCTTGCACAACTTCCCAGTCTCCGTCAGCGCCTTCGATTGCGAACACGTTGTCGCCGTTCAACACGGCATCGGGCGTCGCGCTCGCGAGTGCGCCGCCGTAGAGCTTGATCCGGATGATGTTGCCATCGTCCCACCGATCCACTGGCCCCGGCCACAGCGCCCACAATAATTCACCCATCACCGCAGGCTGCGCGACGGTCGCGCGGCGGCTCAGCGCCGCTCCTGCGTAGAGCGCATGCGCACCGCGCCAGGGTGACGCGTAGATCGCGACCAGCGGCCGGCTCTCGCCTTCGGCCTTCGGCAAGGGAGGCAGATTGAGAACTGAGACGGCTGGAGTCGGCGCGGGCGTCGACACCGGCGCGGTCGGATCGCCGATCTGCGTTTGGGTCGGGGCCGCTGCGCGCGCGCGACGAACATCGAGCTGCCGTTCCGCGCCATCCTCGATTGCGGCGATCTCAAACAGGTCGGACCCGCCGGCGAGCGTGATGCGGTCGCCGGGCTCCAGTGCCAGATGCGCCGGACCGAGTGCTATATTCAGCGCTTCGGCCGCCGCGCGGCGATCCGCGAGCACGCTTTGCGCGATGCGTTCTGCCGCCTCCGCTTCGAGCACAAGCGGGGCGTCCAAGCTGATGACGCCGACTTCGGCGCGGTCCAATCGCCGCGCACTGGCGCCGCCGATCAGGTAATCTCGCGCCGCATCGATAAAGCGCACGCGCGCCTCAATTGGCGCGTCCGCCGCATCCGAGCGCCGCGCAAGCGGTGTGGCGACCGTATCGGCAGTGAGATCGCTTATAGCGACATCCACAGGCGCCGCGTCGTCAGCGTGAAAGAACACAATCGCGCCCTCACGCTCGGCCGCGCCGAAATCGTAAGCGGCCATCAACGGCTCCAGCGCCGCACGCGCCTCCATTGGCGCATCCACGACGTAGCCGCTCAACGCGCCGACCAAGGCCGACGCAGCAACGTCTTCCGCGTCGGCGCGCGCACAGATATCTGTCACCACTTCGCCGAGCATTGAGAGCCCAGCGCGACCATTGAGCCAATGACCGCGCCGCCACGCTGGTCCGTCACTCCAGACCTGGCTCAATGCCGGAAACGCCGGATGTGGCCGCGCATCCCATGCCCAGAGGAATGCGCGCTCGATCATCGGCAGGCCGGTGAGCGACGAGACGGGGTTGGCTTCGCCTTCCGCGTCCCAGTAGCGCAGATACGCTTCGAGCGTGCGGCGTTGAATGAGATCGTCGCGCGCGCCTGAGGAGAAGGGCGGCAACATGCTCTCGGCGCTCTTCGCGTCGATGAAAAGGTTGGGCGCGTTGGCGCCTTTGTCGATTGCGGGGCAACCGAGTTCGATCAGCCAAACCGGCTTCGACTCCGGCGTCCAGGCTGTCGGCGAAGCTGCGCGCACGCCGGCCGGGCGATCGAAGTGCGGGTGCGCCCAGAAATTGCGCACGTCCTTGGCGCGATAGATCCAAGGCTCGGCGTGAGCGCCGTCGGTGATCGACGTGCGCGTTTGCGCCGCGCGATCAGCGTCGGAGGCGTAGAACCAATCGTAGTTCTCACCGGCTTCGATCCGGCTTTCCAGATAGGCGCTATCGTGCGGATGCTTGGTAATCGTGGCGTCGAGGTGCCCGATCCCATCACGCCAATCGGTCAGCGGTGGATACCAATCGATGCCGACGACGTCGATCTCGTCATCGGCCCAAAGCGGATCGAGATGGGAGAGCACATCGCCCGACCCGTCTTGCGGGCGATAGCCTCCGTACTCACTCCAATCTGCCGCATAGGTGAGTGTCGTGCTCGGACCGAGCATGGCGAGAACATCGCTGGCGAGATCGCGAAGAGCGGCGACGACAGGAAAACTGGTCGCACTATCGCGCGCCGTTGTCAGGGCGCGCAGTTCCGATCCGATGATGAACGCGTCGACGCCGCCCGCGAGTGCGGCGAGCTTTGCGTGATGGAGGACGAAGCGGCGGTAGGACCATTCGTTTGGACCGGAATAGGTGGGCACGCCTTCACCGGCGCCAAAATCAGATGGCGCAGCAGCGCCGAAGAAGCTTTCAATCTGCGTTGCCGCTGCGCCCGTCTTGTCCACCGCACCCAGCTGACCGACCGCGGGATGCGGTGTAATGCGCCCGCGCCACGGATAGGCCGCCTGCTCCGCGGCGCCGTAAGGATCCGGCAATGCATCGCCCGCCGACACGTCCATCAGCACGAACGGGTACAAACTCACCGCATAGCCGCGCGCTTTCAGCGCCGCGATCGCTTGCAACACGCTCGCATCACTCGGCGTCCCGCCATAGGCCGGCGCGCCCTCAAACTGACTGACGACGTGCGCGCCGCCGCGCGTGACGCCGCCCGCGCGCCAACTCACGGGCGTCGTGCTTTTCTCATCCAACTCGACGCCAGGCCGGATTTGGCATTCGCCGCAGCGCAGGTCATCGCCGAACCAGGAGACCACTAGCAGCACGCTCTCACAATTGGGAAAATCCTGCTGAAGCTGATCAAGCGACACCAACAGGTTGGCGCGGTCCCGTTCGGCGTGAACGTTTTCCGCCGTCTCTTGGCCCGGCCCGATGCGGCGCATCACGGGCTCGGTGGCGTACACGAACTCCCCCGAGCCAGGGATCAGACAGATCGCTTTGACGCGGTCCTCGAAGCGCGGGCCGGGCGTGCGTTGTGGCGCCGGCCGCACGATCTCAAATGAAAGCTGCGGGATGGTGTTGCCGAAGCGGTCGAGCGGCAGATCTTGCCTCAGACGTAGCTGCCTTCCGCGACTGCGTCGTAGTCCAGGAAGCCGCGCGGCTTGTCATCGCCATCGCCATTGACGAACGCGGCGCGCTCTTGGCCCGCGAAGGCTTCTTCGACTTCGGTCGCGATCCATTCTTCGAGATTGACGAAGCTGTCGTCGAGCAACGTCTGTGTCGCGGCGAGGTTGGCGTAAAGCTCTGAGGTTGGGAATTCGAGCAGTGCCAGTGTCGGCGAAGTCGTCTGCGTGCGCGCCGCGGTTTCCGCGACCCAACCCGTGCCGGCGTTGGTGAGGCTGATCGGCTTCTTGAACACGTTGGCGCCGGTGGTGCGCACGGTCGCAATGCCGCGCATCGGCGAAACCTGGCGCAAACGTGTCTCAATCATGCGGTCGAGTTCAGGGGGCGCGACGTAGCCGCCGTCGGCGCCGCCCGGAGTGGGATCACCAACGACCGAAAGTGCTTTCGCCTCGAATTGCGCGAGCGCCGAAATATCGCCACGGCGAAGGTATGAACTCCATGCCGACTTGTGCTCGCTCAGCGCCGGATCAGCAGCAAGCCCCGGCCGGCGGCCGGCGAGTGCTGCGCGTTCGATCTGCGACTTCTGCTCACTCAGCGCCCGGTCGATGCGATCGACCTTCTCTTCCAAGAGAACGTCGGCGCGTTTACGCTCGATTGCGTCGAGGCGCGCGTCGTTTGCGGTCTTGAAGTCCTCGAACGCGCGCATGAGCGCGTCCTGCGCGCCAACAGGCGCGGCCTTGGTCTCTTTTCTCACGTTTTTCTCCTAAGCTAGCGCGGGTGAACTCTCGCGCAAGAGTCAACCGCGCTTGCGGCTGCATCGGATGGGTCACGATGGACACTTCCAGAAGCTCGATTTCTTCCAACACGCGCCCGCTGGCTTTGCGGTGCGCCACGATCGGCACGAACCCGATCGACAACCCATCGACGCCGCGGGCGATCAAACGCCTGGCGCGCGCGGCGCCGGGCAGCTCGTCGCGGATCTGGCCTCGCAGGTAGAGCCCGCGCCCATCCTCGCGCACAACATCCCAGTGCCCGGCATGCAAGCGCCGTTCGTGTTCAACCAGCATCGGCAGCGGCTCACCGCGCCGCGCCAAGCTCGACGCGAAAGCGCCGGCGCGCACCACGTCCGCCATCTGATCCGCGACGCCGAACAGAGCGGCGTAGCCTTCGATGATCACGGCTCGAGCCGCGCCTCGATGCGCGCGAGCGAACGGCGCGCGTCCGCAACTTGCTCTTCCAACCGCGCCAGGCGCTCGGCTACTTCAGGCTGCGTCGCGACGGCTCGTTCTACCTCTTCCAACCGCGCCGCAGCGCGGCCAGCCCAGATCAAACCACCAGCGGTTCCAGAAACAGCGCAAGAATGAGCGCCAGCGTGACGCGCGGATCAATCCTCATTGCGCACGCTCCGGCGTCTCGACTCCCGCCATCCGGCGCTTCTCGTCCTCGCTCAGGAAATTCGCGCTGCCGATGCGCGACCACAAGGCCTCGCGCTCAATCGTTAGCGCTGGCACATTCTCGACATCGACATTCACTGCAGCAGGCCCGGTCTCGGGCCAGCGATCTCCGATCCACGCTTCGATCCCCCGCGCACCTTTCATGGCGAGCGGCAAAGCTGTCTGGCGCCAGAACGCCAAGTTCGCTTCGCGGTAGTTGGAGTAGGTGTTGTCGCCAGGAATGCCGAGCAACATCGGCGGCACGCCGAACGCCAACGCAATGTCGCGCGCGGCCGAGTGCCGCGCTTCGCTGAAGTCCATCTCCGACGGCGAGAGCGACATCGGCCGCCACTCCAGTCCGCCTTCCAAAAGCATTGGCCGGCCAGCGTTTGCGGCGCCGACATGCATGTCTTCCAGCTCTTGCTTCAAACGCCGGAATTGCTCTTCGCTCAAGCGATCCGCGCCGCCGGCGCCTGTGAACACAAGCGCGCCTGACGGCCGCGCCGCGTTGTCGATCAGTGCTTTGTTCCAGGCGCCGCCGGCATTGTGGATGTCGATGGAGAACGCCGCTGCTTCCATCGGCGAGAGCCCGTACCAGTCGTCGGCCGGATGAAAGAGCTTGAGATGCAGGATCGGCGCATCGCTCGTGACCGGATCGCGTTCGAACTTGCGCACACTTGAACCAACCCGATGCTCCCATCCCACCGGCCAACCATCGGCGCCAGGCACAACGCTCATCCGGTCTGGCCGAAGCACATAAAGCTCGGACGGCGCATCGCTCTCGATGCTCGCCGCTTCGAGGTACGCGTTCCCCGAAACCTGCAGGTGGCCGTAGAACGCTTCCAGCAGCTCAATCCCGGTTTGCTCAGGGTTGGGTCGTGCCAGCAAACGCGCCAGCGGGTGATCCGCCGGGCCGACCTTCAGCGGCGCACTTGCCGCGGCTTCGGCGATCATGCGCACACAACGATACGCGATCGCGTTGCGCGCATAGCCTTCGCGCGCGAACACGCTGGGATCGCGCGCCGACCAGGCCGGGCGCCCAAGCGCGTGCCACGCCAACAGCTTTCGCTCCACGCGCGGCGCGCGGAGACGTCTCCACCAATCCAACATCTAGAGCTTTCTCGTTAAAGTATCCGCACGCGCGGCTGTGCGCCGCCCATGAACAAATCCGTCAGCGCCCACACCAGTGCGTCCACGCGATCAGGGCTGGCCTTCGCAACACCGGCGCCGAACGCGCACATCTCGTCTTCCAGCGCCGGGAACACGCCAGCATGTGTCACGCGCCCTTGCGCGTAGTATGCTGCGACGGGTTCGGCCCGCGCGCGCTTGCCTTTGCTGGCATGAACCAAGCGCACGAAGAACTCCGGCGCAGCTGCGCGCAGAACCTCTCGCACCATTTCGCCGCCGTTGTTCGCCTCGGCGACAATTGCATCAGCACCGACGGATCGCGCCAGCGCTGCCGCGCGCCCGGCCCACTCGTGTGGCGCCGCGCCTTGGATGGTGGCGTCCGCCAGCACTACAACCTGTCGCGCCGCGCCTTCGCCATATGCGCCCGCAGCGATGATGCCGCACGAGTCTGCCTCAGCGCCCACGCTCGCCGGCGGATCGACGGCGACAACGACGCGATCAAACTCCGCATCGACATTGCGCCGCAACGCTTCAACTTCCGCGCGCCGCCACAGCGCACCTTCGAACGTCTCGATCAGTTCACCCAACAATTCCTGCCGGTGATACGCAGTGCCCGCCCAGCGCTCGTTCAGCTTGGCGACGAAGTCCGGCGAGACGTTGTGCCGGTTCGCCCAAGTCGCAGATATTGACCGGACCGTGTCGCGCGCATCGAGCAGGCGCTTCAGCGCCGGCATCGGGCGCGGTGTCGTAGTCACCATCAGCCGCGGCCGTTCGCCCAAACGCAAACCATGCTCCAGGGTCGAGAGCGCTTCCTCTGGATACGCCCAGAAACACAATTCATCCGCCCACGCCGCATCGAACTGCGGCCCGCGCAGCGACTCTGGGTCCTCCGCCGAAAAACAGTACGCTTGCGCTCCGTTGGCCCAGCAGAGCCGCTTGCGGCTCGCCTCATAGACGGGCCGTTCATCGGCCAGGGCGCGTAAGCCCGATGAGCCTTCGATCATAACCTCGCGCACATCTTGAAACGTCGGCGCGATCAGCGCGATCCGCCCCGCTTTGCTCCATCGCGCCAATTCCGAAACCCATTCCGCGCCAGCGCGGGTCTTGCCGGCGCCACGTCCGCCCAAGAACACCCAAGTGCGCCAGTCGCCGTCCGGCGCGATCTGTTCCTCCCGCGCCCAGAACCGCCAATCATTGAGCAGATCCTGTAGCTGCGATGCGCTGAAGCACGTCGTCAGGTGCGCCAGCCCTACTGGCGTCAACGAACACAGCAATGCGGCGGCGCAGTTCGGCGCGTCTGGACTGCTCATCTTCCTCCACGGACGGCGCAACCGGCGCCGCCATTAGTTCACCTACTTCTCGCTCCGCCTTGACCAGCGCCGAGATGGCTTTCGCCCGTCGATCGGCGTCCGAGGCGTTGTCGCCGCGCGTCATCGCCACGAGCGCATCGTTCAGAAGGCGCTTCATCGTTTCAGCTGCTTCGCTCATCGTTGCCTCCTCGAAACCCAAAGCCTCGACAGTGCCTGGATGCTAAGGCTTGGCCGGGTCGGTCGGATAGATTTTGGAGAAGTCCAACCGAATCTGTGCTTTGAACGCGGAACTTGAGGGATACGCAGGGGCTCTATCCGCGTTTCTCCCGCCAGAACGCTGGCCAGAATTCGGCCATACCTCTATTGGACACCCGCTCCCGGAGAACTCCTTGTCCGATCTACCACCAGACCCGCGCCGCGGCATCGACTGGGCGAAGCTGCGCGCCGCCGCGCAGACGCACGCCGCTGCGATCGGCGACCGCTCGAACTTCACATGGCGCCGGATCGCGAGCTTCGCGGCCGGCGGCGTGTTGCTGTTCGTGTTGGCGCTCTGGATTTGGATTTACTGGGGCCTGCCGCGCGTTCCGGATGCTGAAGCGCTGTGGGCGCTCAATCGTCAACCGTCGATTATGTTTGTCGACACCGAAGGCGAGATCATCGGCGTGCGCGGGCCCTATTATGCGCGGCGCGCAACGCTCGCAGAGCTGCCCGAATACGTGCCGCAGGCCTTCCTCGCGATCGAGGACCGGCGCTTCTATCAGCACGAGGGCGTCGACCGCATGGCGATCTTCCGCGCCATCCTCGCCAACCTTCGCGCCGGCGAGACGGTGCAAGGCGCGTCAACCATTACGCAGCAGCTCTCGCGCAATCTTTTCCTCACGCCGAACCAAACCATCAACCGCAAGCTCCGCGAGATGGTGCTCGCATCCCGCATCGAGCGGCGGCTGACCAAAGATGAGATCCTCGAGCTCTACCTCAACCGCGTTTATCTCGGCGACCAGGCTTACGGCGTCGATGCAGCCGCGCGGCGTTTCTTCGGAAAAACCTCAAGTGAGTTGACACTCGCGGAAGCCGCCATGCTGGCTGGTTTGCCCAAAGCGCCATCGCGCTCGGCGCCGACGGAGAGCATGGAACGGGCGACAGCGCGTCAGCACGTCGTGCTCGATGCGATGGTCGAGGCAGGCTTCATCACCGCCGAGCAAGCCGCCGAAGCCAAGGAAGAGCGCATCCGCGTCATCGAGCGCCCGAGCACCGAACGCGCCATGGGCTACGCGTTCGATCTCGCCGTCGAACAAGCACGCGCCGCCGTCGGCCGCGACACACCCGACCTCGTGATCCAGATGACGATTGATCGTGACGTCCAAGAGGCAGCCGCCAACTCGATCCGGCGGCGCCTCGGCAACCGCGCCTTCGGCCGCCGCCCGCTGCAAGCCTCGATGATGGCCGTCAACCGCCAAGGCGCAATCGTCGCGCTCGTTGGCGGGACTGATTACAACACGTCGAAGTTCAATCGCGTCACCCAAGCCGAACGCCAACCCGGCTCAACATTCAAGACGTTCGTCTACACCGCCGCGCTTGAAGCTGGCCTCGACACCGAAGATGTGCGTTACGACGAACCCGTCGTCATCGATGGCTGGCGCCCGCGCAACTACGATGACGGCTATCGCGGCGCCGTCACGCTGCGCACGGCATTTGCGCTCTCGATCAACACTGTCGCTGCCTCTGTCGCCAACGAAGTCAGTCCACGCCGTGTCGCCGACGTCGCCACGCGCCTCGGCATTACCGACATGCCCGCTCGCGGCCAATTCGTGCCGCCTTCGATCGCGCTGGGCTCTATCGAGACGACGCTGTGGGACATGACATCGGCGTTCGCCGTTTTCATGAATGACGGCCGGCGTATCGACGCGCACATCATTCAGTCGGTCTCGAACTCCGCGGGTCAGCTGCTGTATACGCGCCCGCCCTATGAAGGCGCGCGCGTCCTCGACGAGCAAGTCGTCCAGCACATGACCAGCATGATGGGCGCTGTTGTCCTGCGCGGCACCGGTACAGGCGCCAGCCTCGGCGGGCGCGACGTCGCAGGCAAAACCGGCACCAGCTCCGATTGGCGCGACGCCTGGTTCGTCGGCTACACCGCCGACTACACTGCCGGCGTCTGGGTCGGCCACGACGACTTTACCTCGATGGGCCGCACCACCGGCGGCACCTTGCCGGCGCAGATCTGGAACGACACCATGCGCGTCGCGCACCAGGGCGTCGAAGATCACCCGCTCCCCGGCATCGAGCAGCCAGCGTATTCGCCAGCGGAAATAGAGACGGCGTCGTTCTTCGACGATCTCGCCAATGCTTTCGGCGATTCCGGCAACGATCTGGGTGACGCGCTCGAAGAGATTTTCAACTAGTCTCGCGCGATGGCAGGTCCAGCTCTCGCGCACGCAAAGGACTTGCGCCTAACGCTGGGCAGCGCGCCGCTGTTCGAAGGCGTCTCCTTCGTCCTGCACAAGGGCGAGTGCGCCGCGCTCATCGGCGCCAATGGCGCCGGTAAATCGACGCTGATCCGCATGCTCGCCGGAGAAGCTGAACCGGACTCAGGCATCATCACCTATGCCTCCGGCACGGTGGTGGCGCTGGCGCGCCAAGAGCCGGATATGGAGGGCTTCGCCACCCTCCGCGACTACGCACGGGCGCCTTCTGTCAGCATCGCCAGCAGCGACCGTCCGGCGCCCGCGCACGACGCTGACTCCGAACTCGAACTCTTCGGGCTTGATCCCTATCGCGCGCCTACCGGTCTCTCCGGCGGCGAGACGCGCCGCGCTTCTCTCGCGCGTGCATTCGCCGCTGGGCCAGACATTCTGCTCCTCGACGAACCAACGAACCATCTCGACATCGCGGCGATCGAACTGCTTGAACAACGTGTCGCTGCGTTCAACGGCGCCTGCCTGATCGTCAGTCACGACCGCCGCTTCCTCGAACGCGTCACCACCGCGACGCTTTGGTTGCGGCAACGCCGCGTCCTCACATCCGACGAAGGCTATGCGCAGTTCGAGAGCTGGGCAGAACGCATTGAGGTCGAAGAGGAACGCTACGCCGCGCGGCTCGAAACCCACCTCAAAGCCGAAGAGCACTGGCTCCGCCGCGGCGTCACAGCCCGCCGCTCGCGCAACGAAGGTCGTCGCCGCAAGCTGGAAGCCATGCGCACGGAAAAGCGCGACTTCAAGGCTCTCAGCGCAACGCCAAAGGCTGCGTTGCAAGCCGACAAGGGCGCGGAATCCTCGAAGCTGGTGATCGAGGCCAAACGCGTATCGATCGCGTACGGACGTCCCATCGTGACCGACTTCTCCACCCGCATCATGCGCGGTGATCGCGTCGGCGTCGTCGGCGCCAATGGCGCGGGTAAGACGACACTGCTCGAACTTCTGCTCCAGCGCCGCGATCCGGAGTCCGGCGAAGTTCGCTTGGGCGGCAACCTCGAAATCGCCTACGTCGATCAATCGCGCGCCATCTTGGGCAACGCCGGCACCATCTGGGACGCGCTTGCGCCACGCGGGGGCGATCAGATCATGGTGCGGGGCCGGCCAAAGCACGTCGCTGCGTACGCCGGCGAATTTCTGTTCTCGCCCGCGCAGCTCCGGCAACCCATCGACGCGCTTTCCGGCGGCGAACGCAATCGCCTCGCGCTAGCGGTGGCGCTGGCCAAGCCGGCGAACCTGCTCGTGCTCGATGAGCCGACGAACGATCTCGATATTGATACGCTCGACGCGCTCGAAGATATGCTCGCCGCTTACGACGGCACCGTGATCCTGGTCAGCCACGATCGGGCCTTTCTCGACGGCGTTGCAACGCAAATCATTGGCCCCCTCGGCGACGGCAAGTGGGTGGAGGCACCGGGCGGCTGGTCTGATTTCGAGCGCGAGTATGGCGGTGTGAAACCGAAACGCCGCGAGCAACCTGCAGCTCAACGCGCCGAACCCAAGCCGCAAGCGCCACGCAAGGCCACCAAGCTGAGCTACAAGGACGAACGTCGCGCCGCTGAACTCGATACGCTGCTGCCGAAGCTATCTGCGGAAATCGGCGTACTCGAAGCAAGTCTCGCCGCTTCCGGCGTCTTCGAGCGTGACCCGAAAGCGTTCCACACGACCGCAGCCCGCCTCGAAGCGGCGCGGGCCGAGCACATTGCTGGCGAAGGCGAATGGTTGGAGATTGAACTCAAGCGCGAGGCGCTCGCGTCGGAGGAATGA
Protein sequences of DBSCAN-SWA_3 >NZ_CP047045|2354477:2366947|2361395_2362709_-|WP_158766434.1|DBSCAN-SWA MSSPDAPNCAAALLCSLTPVGLAHLTTCFSASQLQDLLNDWRFWAREEQIAPDGDWRTWVFLGGRGAGKTRAGAEWVSELARWSKAGRIALIAPTFQDVREVMIEGSSGLRALADERPVYEASRKRLCWANGAQAYCFSAEDPESLRGPQFDAAWADELCFWAYPEEALSTLEHGLRLGERPRLMVTTTPRPMPALKRLLDARDTVRSISATWANRHNVSPDFVAKLNERWAGTAYHRQELLGELIETFEGALWRRAEVEALRRNVDAEFDRVVVAVDPPASVGAEADSCGIIAAGAYGEGAARQVVVLADATIQGAAPHEWAGRAAALARSVGADAIVAEANNGGEMVREVLRAAAPEFFVRLVHASKGKRARAEPVAAYYAQGRVTHAGVFPALEDEMCAFGAGVAKASPDRVDALVWALTDLFMGGAQPRVRIL >NZ_CP047045|2354477:2366947|2363098_2365111_+|WP_158766436.1|DBSCAN-SWA MSDLPPDPRRGIDWAKLRAAAQTHAAAIGDRSNFTWRRIASFAAGGVLLFVLALWIWIYWGLPRVPDAEALWALNRQPSIMFVDTEGEIIGVRGPYYARRATLAELPEYVPQAFLAIEDRRFYQHEGVDRMAIFRAILANLRAGETVQGASTITQQLSRNLFLTPNQTINRKLREMVLASRIERRLTKDEILELYLNRVYLGDQAYGVDAAARRFFGKTSSELTLAEAAMLAGLPKAPSRSAPTESMERATARQHVVLDAMVEAGFITAEQAAEAKEERIRVIERPSTERAMGYAFDLAVEQARAAVGRDTPDLVIQMTIDRDVQEAAANSIRRRLGNRAFGRRPLQASMMAVNRQGAIVALVGGTDYNTSKFNRVTQAERQPGSTFKTFVYTAALEAGLDTEDVRYDEPVVIDGWRPRNYDDGYRGAVTLRTAFALSINTVAASVANEVSPRRVADVATRLGITDMPARGQFVPPSIALGSIETTLWDMTSAFAVFMNDGRRIDAHIIQSVSNSAGQLLYTRPPYEGARVLDEQVVQHMTSMMGAVVLRGTGTGASLGGRDVAGKTGTSSDWRDAWFVGYTADYTAGVWVGHDDFTSMGRTTGGTLPAQIWNDTMRVAHQGVEDHPLPGIEQPAYSPAEIETASFFDDLANAFGDSGNDLGDALEEIFN >NZ_CP047045|2354477:2366947|2354477_2355389_-|WP_158766430.1|DBSCAN-SWA MSWDPYATLALGRTATADEIRRAYRAMAKELHPDVRPNDKQAEDRFKRATAAFNLLSDPVMKSRYDRGEIDADGNERMAFSSRPRQSARAHAGAGAGPGPGGAGAQDAFDLGDIFSDLFGPGFGNGRSYSRMRGRDIRFTLEVDFLDSINGAKRRVSLAEGRTLDVAIPAGVESGQVLRLKNQGGAGVQGGPAGDALVELTVRPHAFFRREGQDVHMDLNISLTEAVEGGRVQAPTPTGPVTLTIPAGSNTGKTLRLKGKGVAGQGDQFVKLQVMLPETPDEDLKKFVKKWAKRDYVPPRPQG >NZ_CP047045|2354477:2366947|2362590_2362887_-|WP_158766435.1|DBSCAN-SWA MSEAAETMKRLLNDALVAMTRGDNASDADRRAKAISALVKAEREVGELMAAPVAPSVEEDEQSRRAELRRRIAVFVDASRAGAPDDVLQRIAATGSAQ >NZ_CP047045|2354477:2366947|2359482_2360013_-|WP_158768085.1|head,protease|DBSCAN-SWA MIEGYAALFGVADQMADVVRAGAFASSLARRGEPLPMLVEHERRLHAGHWDVVREDGRGLYLRGQIRDELPGAARARRLIARGVDGLSIGFVPIVAHRKASGRVLEEIELLEVSIVTHPMQPQARLTLAREFTRASLGEKREKRDQGRACWRAGRAHARVRGLQDRKRRAPRRNRA >NZ_CP047045|2354477:2366947|2358824_2359592_-|WP_158766432.1|capsid|DBSCAN-SWA MRKETKAAPVGAQDALMRAFEDFKTANDARLDAIERKRADVLLEEKVDRIDRALSEQKSQIERAALAGRRPGLAADPALSEHKSAWSSYLRRGDISALAQFEAKALSVVGDPTPGGADGGYVAPPELDRMIETRLRQVSPMRGIATVRTTGANVFKKPISLTNAGTGWVAETAARTQTTSPTLALLEFPTSELYANLAATQTLLDDSFVNLEEWIATEVEEAFAGQERAAFVNGDGDDKPRGFLDYDAVAEGSYV >NZ_CP047045|2354477:2366947|2355480_2358765_-|WP_158766431.1|DBSCAN-SWA MRPAPQRTPGPRFEDRVKAICLIPGSGEFVYATEPVMRRIGPGQETAENVHAERDRANLLVSLDQLQQDFPNCESVLLVVSWFGDDLRCGECQIRPGVELDEKSTTPVSWRAGGVTRGGAHVVSQFEGAPAYGGTPSDASVLQAIAALKARGYAVSLYPFVLMDVSAGDALPDPYGAAEQAAYPWRGRITPHPAVGQLGAVDKTGAAATQIESFFGAAAPSDFGAGEGVPTYSGPNEWSYRRFVLHHAKLAALAGGVDAFIIGSELRALTTARDSATSFPVVAALRDLASDVLAMLGPSTTLTYAADWSEYGGYRPQDGSGDVLSHLDPLWADDEIDVVGIDWYPPLTDWRDGIGHLDATITKHPHDSAYLESRIEAGENYDWFYASDADRAAQTRTSITDGAHAEPWIYRAKDVRNFWAHPHFDRPAGVRAASPTAWTPESKPVWLIELGCPAIDKGANAPNLFIDAKSAESMLPPFSSGARDDLIQRRTLEAYLRYWDAEGEANPVSSLTGLPMIERAFLWAWDARPHPAFPALSQVWSDGPAWRRGHWLNGRAGLSMLGEVVTDICARADAEDVAASALVGALSGYVVDAPMEARAALEPLMAAYDFGAAEREGAIVFFHADDAAPVDVAISDLTADTVATPLARRSDAADAPIEARVRFIDAARDYLIGGASARRLDRAEVGVISLDAPLVLEAEAAERIAQSVLADRRAAAEALNIALGPAHLALEPGDRITLAGGSDLFEIAAIEDGAERQLDVRRARAAAPTQTQIGDPTAPVSTPAPTPAVSVLNLPPLPKAEGESRPLVAIYASPWRGAHALYAGAALSRRATVAQPAVMGELLWALWPGPVDRWDDGNIIRIKLYGGALASATPDAVLNGDNVFAIEGADGDWEVVQARSCELVAPNEYQLSGFLRGQLGSAPAMREPHPVGARIVKLDARLARAEIGAHEWNEPLAFAAPPANALATDPRAEHLTLTLPHAALRPWAPAHLRARREPSGDVAISWVRCARVGGDAWGPGEPPLGAAPESYVLDILDGADVLKRSVNVTNPAHLYGAADQIADFGAPPVSLRLRVAQMGENGAPGLNKELTITL >NZ_CP047045|2354477:2366947|2365120_2366947_+|WP_158766437.1|DBSCAN-SWA MAGPALAHAKDLRLTLGSAPLFEGVSFVLHKGECAALIGANGAGKSTLIRMLAGEAEPDSGIITYASGTVVALARQEPDMEGFATLRDYARAPSVSIASSDRPAPAHDADSELELFGLDPYRAPTGLSGGETRRASLARAFAAGPDILLLDEPTNHLDIAAIELLEQRVAAFNGACLIVSHDRRFLERVTTATLWLRQRRVLTSDEGYAQFESWAERIEVEEERYAARLETHLKAEEHWLRRGVTARRSRNEGRRRKLEAMRTEKRDFKALSATPKAALQADKGAESSKLVIEAKRVSIAYGRPIVTDFSTRIMRGDRVGVVGANGAGKTTLLELLLQRRDPESGEVRLGGNLEIAYVDQSRAILGNAGTIWDALAPRGGDQIMVRGRPKHVAAYAGEFLFSPAQLRQPIDALSGGERNRLALAVALAKPANLLVLDEPTNDLDIDTLDALEDMLAAYDGTVILVSHDRAFLDGVATQIIGPLGDGKWVEAPGGWSDFEREYGGVKPKRREQPAAQRAEPKPQAPRKATKLSYKDERRAAELDTLLPKLSAEIGVLEASLAASGVFERDPKAFHTTAARLEAARAEHIAGEGEWLEIELKREALASEE >NZ_CP047045|2354477:2366947|2360229_2361381_-|WP_158766433.1|portal|DBSCAN-SWA MLDWWRRLRAPRVERKLLAWHALGRPAWSARDPSVFAREGYARNAIAYRCVRMIAEAAASAPLKVGPADHPLARLLARPNPEQTGIELLEAFYGHLQVSGNAYLEAASIESDAPSELYVLRPDRMSVVPGADGWPVGWEHRVGSSVRKFERDPVTSDAPILHLKLFHPADDWYGLSPMEAAAFSIDIHNAGGAWNKALIDNAARPSGALVFTGAGGADRLSEEQFRRLKQELEDMHVGAANAGRPMLLEGGLEWRPMSLSPSEMDFSEARHSAARDIALAFGVPPMLLGIPGDNTYSNYREANLAFWRQTALPLAMKGARGIEAWIGDRWPETGPAAVNVDVENVPALTIEREALWSRIGSANFLSEDEKRRMAGVETPERAQ |
9 | Chrysochromulina_ericina_virus(14.29%) | protease,head,capsid,portal | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|