Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP028892 | Vibrio cholerae strain Sa5Y chromosome 1, complete sequence | 1 crisprs | RT,cas3,DEDDh,DinG,csx1,csa3 | 0 | 1 | 4 | 0 |
NZ_CP028893 | Vibrio cholerae strain Sa5Y chromosome 2, complete sequence | 0 crisprs | cas9,Cas9_archaeal,DEDDh,csa3,cas3 | 0 | 0 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP028892_1 | 2579760-2580004 | Orphan |
NA
Consensus repeat of NZ_CP028892_1
|
2 spacers
spacers of NZ_CP028892_1
>1.1|2579810|39|NZ_CP028892|PILER-CR AGTAGGTCGCCAGTTCGATTCCGGCAGCCGGCACCACTT >1.2|2579899|63|NZ_CP028892|PILER-CR GTGTCGGTGGTTCGATTCCGCCTCGAGGCACCACTATTAAAAGTTGTGCCATAGACACAATAA |
CRISPR arrays and Neighbor proteins around NZ_CP028892_1
The CRISPR arrays of NZ_CP028892_1 >merge|NZ_CP028892|1|2579760-2580004|PILER-CR TGTGCCGGCTTAGCTCAGTAGGTAGAGCAACTGACTTGTAATCAGTAGGTCGCCAGTTCGATTCCGGCAGCCGGCACCACTTAAAATTTGCCTCGATAGCTCAGTCGGTAGAGCAGAGGATTGAAAATCCTCGTGTCGGTGGTTCGATTCCGCCTCGAGGCACCACTATTAAAAGTTGTGCCATAGACACAATAAAAATTGTTCCTCCTTAGCTCAGTCGGTAGAGCGACGGACTGTTAATCCGC >NZ_CP028892|1|1|2579760-2580004|PILER-CR TGTGCCGGCTTAGCTCAGTAGGTAGAGCAACTGACTTGTAATCAGTAGGT CGCCAGTTCGATTCCGGCAGCCGGCACCACTTAAAATTT GCCTCGATAGCTCAGTCGGTAGAGCAGAGGATTGAAAATCCTCGTGTCGG TGGTTCGATTCCGCCTCGAGGCACCACTATTAAAAGTTGTGCCATAGACACAATAAAAATTGT TCCTCCTTAGCTCAGTCGGTAGAGCGACGGACTGTTAATCCGC
>NZ_CP028892.1|WP_108243901.1|2578401_2579529_+|membrane-bound-lytic-murein-transglycosylase-MltC MRKLVLCITALLLSGCSRGFIEKIYDVDYEPTNRFAKNLAELPGQFQKDTAALDALINSFSGNIEKRWGRRELKIAGKNNYVKYIDNYLSRSEVNFTEGRIIVETVSPIEPKAHLRNAIITTLLTPDDPAHVDLFSSKDIELKGQPFLYQQVLDQDGQPIQWSWRANRYADYLIANHLKVKQVDFKKAYYVEIPMVKDQIDIRGYKYASIVRKASRKYDIPEDLIYAIIKTESSFNPYAVSWANAYGLMQVVPKTAGRDVFKLVKNRSGEPSPEYLFNPENNIDTGTAYFYILKNRYLKEVRHPTSLEYSMISAYNGGTGGVLSTFSSDRQRAMRDLNALQPNQVYWALTKKHPNAEARRYLEKVTKFKKEFNAG >NZ_CP028892.1|WP_000124737.1|2578026_2578299_+|oxidative-damage-protection-protein MARTVFCTRLQKEADGLDFQLYPGELGKRIFDNICKEAWAQWQSKQTMLINEKKLNMMDPEHRKLLEQEMVNFLFEGKEVHIEGYTPPAK >NZ_CP028892.1|WP_108243900.1|2576951_2578013_+|A/G-specific-adenine-glycosylase MTPFAQAILTWYDAYGRKNLPWQQNKNAYRVWLSEIMLQQTQVATVIPYFERFLERFPTVQALAAAPQDEVLHFWTGLGYYARARNLHKAAQTVVNQYGGEFPTDLELMNALPGVGRSTAAAVLSSVYKKPHAILDGNVKRTLARCFAVEGWPGQKSVENQLWHYAEMHTPSVDVDKYNQAMMDMGAMICTRSKPKCSLCPVESFCLAKQQGNPQEYPGKKPKTDKPIKATWFVMLYHDNAVWLEQRPQTGIWGGLYCFPQSEVANIQTTIDQRAIGDSTIKSQKTLIAFRHTFSHYHLDITPILLQLSRKPDIVMEGSKGLWYNLSQPDEIGLAAPVKQLLHSLPFDIDSHI >NZ_CP028892.1|WP_000005574.1|2575992_2576712_-|tRNA-(guanosine(46)-N7)-methyltransferase-TrmB MSEVITQEFTEDGKVLRRIRSFVRREGRLTKGQEAAMKECWPTMGIDYQPELLDWQQVFGNDNPVVLEIGFGMGASLVEMAKNAPEKNFFGIEVHSPGVGACLASAREAGVTNLRVMCHDAVEVFAHMIPDNSLHTLQLFFPDPWHKKRHHKRRIVQLEFAEMVRQKLIIGSGVFHMATDWENYAEHMVEVMNQAPGFANLATDGDYIPRPDERPLTKFEQRGHRLGHGVWDIKYQRTA >NZ_CP028892.1|WP_055043572.1|2574961_2575882_-|glutaminase-B MKPTADILASILDEVRPLTSKGNVADYIPALAKVPSEKLGIAVFTNQGEVITAGDAQEGFSIQSISKVLSLTLAMGLYQPDELWSRVGKEPSGQAFNSLIQLEMEQGIPRNPFINAGAIVVCDMLQSRLSAPRQRLLEFVRQLSGEPQIAYDKVVAASEMMHSDRNAAIAYLMRSFGNFHNEVIPVLHNYFHACALKMSCVELAKTFSYLANKGVSVVTGETVITPTQSKQTNALLATCGLYDGAGEFAYRVGMPGKSGVGGGIIAVVPGEMTIAVWSPALDQSGNSLAGTRALELLAQRIGRSIF >NZ_CP028892.1|WP_094392754.1|2573721_2574897_+|radical-SAM-family-heme-chaperone-HemW MLTPPPLSLYIHIPWCVQKCPYCDFNSHAQKGEIPEQEYLDALLQDLDRDIERYQLKGDPRLLHSIFIGGGTPSLISAEGIARLLQGVAERIAFKPEIEITMEANPGTIEAQRFAGYRTAGVTRISIGVQSFEPEKLARLGRIHGQQEAVRAAQLAHQIGLNSFNLDLMHGLPDQTPEQALADLDQAIALNPPHLSWYQLTIEPNTLFYSKPPKLPDEDALWDIFELGHQKLSDAGYVQYEISGYSKLGYQCQHNLNYWRFGDYLGIGCGAHGKLSFADGRIVRTTKTKHPRGYLAALNNLAKAYLDSEQLVADQDKPFEFFMNRFRLIEPCPKADFTATTGLTIDVIRPTLDWALSEGYLSEDDQHWQITEKGKLFLNDLLEAFMADEEE >NZ_CP028892.1|WP_000725016.1|2573125_2573728_+|XTP/dITP-diphosphatase MKKIVLATGNQGKVREMADLLSDFGFDVVAQSEFNVPEAAETGTTFIENAIIKARHAAQITGLPAIADDSGLEVDYLNGAPGIYSARYAGEHASDGDNLNKLLVAMQDVPDDQRSARFHCVLVLMRHADDPTPIVCHGKWEGKILTAPHGSNGFGYDPIFWVPEENCASAELEPVRKKQLSHRGKALQKLFKAIEEQRTC >NZ_CP028892.1|WP_094392753.1|2572624_2573056_+|DUF4426-domain-containing-protein MREWMMSLLLTLLALPASAEQQKTIKDIEVHYSAFNSTFLTPKVASSYQLTRNGYTAILNISVLDRASLGKPATEAKLTGHAKNLIGNLRELSFKQVKEGNAIYYLAEVPISNEEMLTFDIDVDAGLKGAGKLTFSQKFYTEQ >NZ_CP028892.1|WP_001914289.1|2572289_2572580_+|YggU-family-protein MAAVWREGDDLLLRLYIQPKASRDSIIGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVIEKGELGRHKQVRILQPSQIPAEIAALIE >NZ_CP028892.1|WP_001087261.1|2571732_2572290_+|YggT-family-protein MNSLSFLINTLFDLYIMVVILRIWLQAARADFYNPFSQFIVKATQPVVGPLRRVIPSIGSIDLATIVFAYVLCVLKFMALVLIASSGSVSFSADFLFLGLLSLIKAAGGLLFWVLLIRAILSWVSQGRSPIEYVFHQLTEPMLAPIRRIIPVMGGFDLSILVLFIVLQFANFLMGDVIGPIWYQL >NZ_CP028892.1|WP_000769919.1|2580292_2581918_+|methyl-accepting-chemotaxis-protein MKLKTQAYLLSAIILAALLALTATGLWTLRVASNLDNKARVTELFNSAYSILTEVEKLAQEGKMSEPEAKALATRLMRNNLYKDNEYVYVADENMTFVATPLDPQLHDTSFHDFKDGKGNSVGRLIQDVLRHQSGKLVEYTWTQKQADGSIEEKLSIARKTPHWGWVVGTGIGFNEVNARFWSTAQWQLSLCVVIAVAILSLLLVAIRKILLIIGGEPNEVRSAVQAVAQGRIRREFAIQAPKESIYGAVQQMSSSLADLVAKLEQSMVALRSELAGAGTRAKSIAELTDSQQQSTAMIATAMTEMASSANQVADSARDTAFNTDQADQQSQHTQKLIHNTVSNIQGLATQLQTASTAVAELDLDVKNIAKVLDVIGDIAEQTNLLALNAAIEAARAGEQGRGFAVVADEVRNLAGRTQTSTKEIQQMIHNLQEGSRNAIQTIQICGQTSQSSVQESENAASALALIVSALESVSSMSHQIATAAAEQTQVSDDIARRINMIEESGSKLSRVVMESHNSTQTLTKLARELEQWAAHFEVTR >NZ_CP028892.1|WP_000546924.1|2582964_2583819_-|co-chaperone-DjlA MHIFGKILGAFFGFLFGGPFGAIFGIFLGHQFDKARRLNQAGFQSGTFGAGPSQAERQEEFFKSAFSVMGHVAKAKGQVTKEEIQLATIMMDRMNLTLEQKRAAQDAFRDGKESDFPLEQVLERVKIATGGRFDLLQFFLELQVSSAFADGDVHPSERQVLHRIARGLGFSSEQLERRLRMQEAAFRFQQGGGFGGSQQQSHSGQQWQQPSSRHQLADAYEVLGVSESASAQEVKRAYRKLMNEHHPDKLMAKGLPPEMMNVAKEKSQQIQHAYELIRKEKGIK >NZ_CP028892.1|WP_071180988.1|2583977_2586329_+|LPS-assembly-protein-LptD MSCFSRTFLAASISAALFAPQIQAEASVDDNRAQLPNGEQCLVNQPEPTNPGQQPINVEADKLEAINGQKATYSGNVVVVQGKKRIAADNVTLHQQENVVVAEGNVQFSDGEIKTHSTKATNHLNTDEMTLENTRYQFLCEPGRGEAVYVSKTGKAVYEIEDGSITSCPDGDNAWRMRASSIDVDQNEEIATFYNPRLEVQNVPVFYLPYLTVPIGDTRKTGFLYPTASYGSRNGYSFEVPIYWNLAPQYDLETTFNYMQKRGTQLNSVFRYLTDFGAGQIKSEYLADDQLHTELGDRWAFQYEHNGIFQQAWKFEIDYSKVSDINYFSDLDSGVGNREDGQLIQEGRATYRSDSWDSALLVRDFQLLTKDTTSTNLPYRLMPQLSYNYYAPETMKYLDLDLVSHVSRFETDARGKPSATRVHIEPGLKIPFSNTWGNWTTEARVLGTYYQQDLDKTTDAKLEESVTRVIPEIRSVAGIVLERDTVLLDDYTQTLEPKIQYLYVPEKYQDNIGLYDSTLLQTDYYGLFRSRKYSGVDRIESANQVSYGASTRFFDSNYKERLNIAFGQIFYLDSKLNPSNKNPDSTSDKTSYSAWAVEMDFNFADYLFYHGGIQYDIDSQAVQLGNSTLEYRVASGYIQANYRYVAKDYIRNTVGDSITNIDDITRDGISQAGILAGYQLSRKWSASGQYYYDLTTDEALEWLANLTYTSDCWYVGFTYSNQLKSWNGNFVTDPYATPIYENNFSFNIGIIGFGTSIGAGSSMTGVDSAGNSLGYGRPFFLNN >NZ_CP028892.1|WP_000780521.1|2586381_2587677_+|peptidylprolyl-isomerase-SurA MKLWKPTLISVLSALTLFNAHAEPKQLDSVAVIVNSGVILQSDVDSALKTIKANAKQNKQPLPQETVLREQVLEKLIIDTLQQQEADRIGVKIDDNRLNEAIKEIAKNNQQTQEQLIASVAQEGLTYPEFREQVRKEMAASDARNALVRRRINILPAEVDTLAELLAQETDATVQYKISHIQLRVDDGQDKSAAETLANKLVNDLKNGADFAQMAYAYSKGPKALQGGDWGWMRKEEMPTIFADQIKMQNKGSIIGPFRSGVGFHILKIDDVKGLETVAVTEVNARHILIKPTIILSDEGAQKQLNEFVQRIKNGEVTFAELAQQYSQDPGSAAQKGELGYQTPDLYVPEFKHQIETLPVGQISEPFKTVHGWHIVEVLDRREVDRTDSALKNKAYRILFNRKFNEEASAWLQELRASAFVEVLKDEKDEQ >NZ_CP028892.1|WP_000095776.1|2587666_2588659_+|4-hydroxythreonine-4-phosphate-dehydrogenase-PdxA MSSKRIIVTAGEPAGIGPDLVLALSAQDWPHQLVVCADKALLAQRAVQLGIQVKLLDYQRDNPVQAQQAGTLLVEHIPLAEPVVAGQLNPANGHYVLKTLERAAKGCMNGEFDAIVTGPVHKGVINRAGVAFSGHTEFFAEQSKTPLVVMMLATEGLRTALVTTHLPLAEVPQAITCERLEQIVHILHKDLVEKFAIAEPKIYVCGLNPHAGEDGVLGMEEIETITPTLQRLREQYGMQLVGPLPADTIFSEKYLQQADAVLGMYHDQVLPVLKYKGFGRSVNITLGLPFIRTSVDHGTALDLAGTGQADAGSFWTALAYAIELVDKKAQ >NZ_CP028892.1|WP_001243262.1|2588672_2589488_+|16S-rRNA-(adenine(1518)-N(6)/adenine(1519)-N(6))--dimethyltransferase-RsmA MRNDVHLGHKARKRFGQNFLNDPYIIDGIVSAINPKPGQNLVEIGPGLGAITEPVGREVDKFTVIELDRDLAERLRNHPELASKLTIHEGDAMRFDFKQLVKPNNKLRVFGNLPYNISTPLMFHLFEFHRDIQDMHFMLQKEVVNRLAAGPGTKAYGRLTVMAQYYCKVVPVLEVPPSAFVPPPKVDSAVVRLVPYEDLPHPATSLEWLDRVVREGFNQRRKTVRNCYKGLAEPETLETLGINPGMRPENLTLAQFVALANWLDATHKTHA >NZ_CP028892.1|WP_000383338.1|2589559_2589940_+|Co2+/Mg2+-efflux-protein-ApaG MDVSLPCIKIQVQTRYIEEQSNPEYQRFVFAYLITIKNLSSQTVQLMSRRWLITDADGKQTVVEGDGVVGEQPRIKANDEYTYSSGTALDTPVGVMQGQYLMIDEQGESFTVEIEPFRLAVPHVLN >NZ_CP028892.1|WP_057550627.1|2589950_2590760_+|symmetrical-bis(5'-nucleosyl)-tetraphosphatase MANYIVGDIQGCFDELQQLLKQAEFNSQLDTLWFAGDLVARGPKSLETLRFAYQLGDAARVVLGNHDLHLLSVALGHHSAKRRDQTQAILDAPDAAPLLDWLRQQPLLAEHQEFVLCHAGISPQWDLATARQAAQEVESVLRSPEWSTLIEQMYSDQPDAWHPTLQGIDRLRYIVNVFTRMRFCFPDGRLDMQCKLPPKEVTDGSLLPWFQLPQRIALEKTVIFGHWAALEGYVSETVIGLDTGCVWGGTLTMLRWEDKHYFSQAALPA >NZ_CP028892.1|WP_053034672.1|2590909_2591407_-|type-3-dihydrofolate-reductase MMISMIAAMADQRIIGKDNQMPWHLPADFAWFKRCTLGKPVVMGRKTYQSIGRPLPGRHNIVISRDASLQIEGVDVVTSIEAALAKAGEVDEVMIIGGGSLYAACLPMAHKLYITEIHAKLEGDTQFPEWGSDWLERSREHYPADEKNAYGMDFVIFERQYPLPT >NZ_CP028892.1|WP_162298896.1|2591419_2591893_-|hypothetical-protein MLSWDLFFGLLNDMLFAAIPAVGFALVFNVPVPALKYCALGGALGHGSRYLMMHFGVPIEWASFFAATLVGMVGVYWSRRFLAHPKVFTVAALIPMVPGVFAFKAMIALVEINHVGFSPELMEALLENFLKAMFIISGLAVGLAVPGLLFYRRKPII |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP028892_1 | 1.1|2579810|39|NZ_CP028892|PILER-CR | 2579810-2579848 | 39 | NC_021742 | Serratia liquefaciens ATCC 27592 plasmid unnamed, complete sequence | 35428-35466 | 4 | 0.897 |
NZ_CP028892_1 | 1.1|2579810|39|NZ_CP028892|PILER-CR | 2579810-2579848 | 39 | NC_049942 | Escherichia phage JLK-2012, complete sequence | 23522-23560 | 5 | 0.872 |
1. spacer 1.1|2579810|39|NZ_CP028892|PILER-CR matches to NC_021742 (Serratia liquefaciens ATCC 27592 plasmid unnamed, complete sequence) position: , mismatch: 4, identity: 0.897
agtaggtcgccagttcgattccggcagccggcaccactt CRISPR spacer agtaggtcaccagttcgattccggtagccggcaccaatc Protospacer ********.***************.*********** *.
2. spacer 1.1|2579810|39|NZ_CP028892|PILER-CR matches to NC_049942 (Escherichia phage JLK-2012, complete sequence) position: , mismatch: 5, identity: 0.872
agtaggtcgccagttcgattccggcagccggcaccactt CRISPR spacer atcaggtcgccagttcgattccggtagccggcaccatat Protospacer * .*********************.***********. *
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
652419 : 658912
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP028892|652419:658912|DBSCAN-SWA CATGACCACAAATCAACGAAGCATCAAAGCTCCACAAACGGTTGTTGTGAAGCTGGGAACCAGTGTTCTGACCGGCGGTACTCTTGTGCTGGATCGTGCTCATATGGTTGAGCTCGCTCGCCAGTGTGCTGAACTCAAAAAGCAAGGTCACTCAGTAGTGATCGTGACCTCGGGCGCCATTGCGGCTGGCCGTGAGCATTTAGGTTACCCTGCGCTGCCCAACGCTATTTCCAGCAAGCAACTGCTGGCAGCGGTTGGGCAAAGTCGTTTGATTCAGGTCTGGGAATCGTTATTTGCGATTTATGGCATCAAAATTGGTCAGATGCTGCTCACCCGTGCTGATCTTGATGACCGTGAGCGTTTCCTCAATGCGCGTGACACCATCAATGCGTTAGTTGAAAACGGCATTGTGCCTGTGGTCAACGAAAACGATGCGGTAGCAACCAATGAAATCAAAGTCGGTGACAATGATAACTTATCGGCGCTGGTCGGTATTTTGTGTGGCGCGGATAAGCTGCTGCTGTTGACCGATCAAAAAGGCTTGTTCACCGCCGATCCGCGTAAAGATCCCAATGCCCAACTGATCAAAGAAGTGACCACCATTGATGACACACTGCGCAAAATTGCTGGTGGCAGTGGCACTACACTCGGTACGGGCGGCATGGCCACCAAGCTGCAAGCGGCGGACATTGCTCGTCGTGCTGGCATTGAAGTCATCATTGCGGCAGGTCGTGCGCCCAATGTGATTTTTGATTCCTTGAGTGATGAGCCACAAGGTACGCGCTTCTTAGCTTTGGAAGAAGCGTTAGAAAACCGCAAACGCTGGATTTTGGCAGGCCCAGCGGCATCGGGTGATATCGTGATTGACGATGGCGCAGTGAAAGCGGTGCAAACCAAGGGCAGCAGCTTGCTCGCCAAAGGTGTGGTGCGAGTGCAAGGTCAGTTCGCCCGTGGTGAAGTGGTGCGGGTACTGGACAAGCAAAGTCATCTGATTGCGCGCGGCATCGCGGCCTACTCCAATGACGAGTTGGCGCAAATTGCTGGTAAGCACAGTAAAGAGATCATCGATATTTTAGGCTACGATCACGGCTCAGAAGTGATCCACCGCGACGACATGGTCGTGATTCAAGAATAAAGGAGAGAATGATGGATTTAACCGTACTCGGTAAAGCGGCCAAAGCGGCCTCTTTCCAACTGGCTACCGCCAGTACCGCGCAAAAAAATCAGGCTCTCGCCATTATGGCCGATCAGTTGGAAGCGCAAAGCGCGAGCATTCTTGCTGCAAACGCAAAAGATATTGCGTTGGGGCGCGAAGCGGGTTTGTCTGATGCCATGCTGGATCGTTTGCTGCTCAATGAGTCTCGCTTGCAAGCGATCGCCAACGATGTGCGTAACGTGATCAAATTGAACGATCCCGTCGGCAGCGAAATTGATAGCCGAGTGTTGGAAAACGGCATGTCATTGGCACGCCGTCGCGTACCCCTTGGTGTGGTCGGCGTCATTTATGAAGCTCGCCCTAATGTCACCATTGATATTGCGGCGCTGTGTTTAAAAACCGGTAACGCCGCGATTTTGCGAGGAGGCAAAGAGACGTTTTTCTCTAACATGGAACTGGTCAAAGTGATCCAGTCGGCGTTGGATAAAGCAGGTCTACCAGCGGCTTCCGTGCAGTACATTGAAAAGCCGGATCGTGAACTGGTGACGCAACTCCTGAAAATGGATGATTACGTCGACATGATCATCCCACGTGGCGGTGCGGGCCTGCACAAAATGTGCAAAGAAAACAGCACTGTGCCTGTGATCATTGGCGGCTTTGGCATTAGCCATATTTTCGTGGATGAAAGTGCCGATTTGGATAAATCGGTTGCCGTGATTGAAAACGCTAAAGTGCAGCGCCCATCGGCGTGTAATGCGCTAGATACTTTGTTAGTCCATCAAGCGATTGCCAAACCGTTACTAGACAAACTCATCGCAAAACTGAATGGCAAAGTGGCGTTTGTGGCGGAGCCCAAAGCCAAAGCCTTGATGAATGCCGCAGCTGAGCTGCGTGATGCGCAAGCGGGCGATTTTGATACCGAATGGCTCAGCTACACCTTAGGGGTGAAAGTTGTGCAGGATGTACAAGAAGCCATTGAGCATATGCGTGAGCACAATGCGAGCCATTCCGATGCGATCATGACCAACGATCTCTATAACGCTGAGCTGTTTGTGAATACGGCTGGTTCAGCGGCGGTGTATGTCAACGCTTCTACGCGTTTTACTGACGGTGCACAGTTTGGTTTAGGCGCAGAAGTGGCAGTCTCAACACAAAAACTGCACGCGCGTGGCCCGATGGGGCTAGAAGAGCTCACCAGCTACAAGTGGGTGGGCAAAGCCAATTACCTGTCACGCAGCTAAAAATCAAGCAAAGATGTGATGTAAAGGGGCTTCGCGCCCCTTTTTCTTTGGCTCTCAACAGCACATCGGCTACACTGGTTGCCTTGTTATGGAGGTAATATGCATTGTCCTTTCTGTTCTGAAAACGATACCAAAGTGATCGATTCACGTCTGGTGGCTGACGGACATCAAGTGCGCCGTCGTCGTCAGTGTCTGGCTTGTAATGAGCGCTTTACCACCTTTGAAACCGCGGAGTTAGTGATGCCGCGCGTGATCAAATCGAATGGTAATCGTGAGCCTTTTGATGAAGAGAAAATGATTGGTGGCTTGCAGCGCGCATTGGAAAAGCGCCCTGTGAGTGCCGATGCGATTGAACTGGCGATCAGCACCATAAAATCTAAGCTACGTGCCACTGGCGAGCGTGAAGTTCCGAGTAAACTGATTGGTAACTTGGTGATGGAACAGCTGAAAGTGCTGGATAAAGTGGCCTATATTCGCTTTGCATCCGTATATCGTAGTTTTGAAGATGTCCGTGAATTTGGCGAGGAGATCGCCAAACTGCAGGACTAACCGATAGGATTAACTGAAAGCGATTATGCCTATGTTTACCTCTTTTGATCATCAAATGATGTCTCGCGCGATTGAACTTGCGTGGCGCGGGCGTTTTACCACTTCTCCTAATCCTAATGTCGGCTGCGTGATCACTCGCGGTGAGCAGATTGTAGGGGAAGGTTTCCATTTTCGCGCGGGCGAACCCCATGCTGAAGTGCATGCGATGCGCCAAGCCGGTGAGCTTACCCGCGGTGCGACCGCTTATGTCACTTTAGAACCTTGCTCTCATTATGGTCGCACACCGCCTTGTGCTGAAGGGCTTATTAAAGCCGGGGTTGCGAAAGTGATTTGCGCAATGCAAGACCCTAACCCACAAGTGGCGGGGAAGGGCGTACAAATGCTGCGTGATGCCGGGATTGAGGTTGAAGTGGGATTGCTGGAAGCAGATGCTCGCGCACTCAACCGCGGTTTTCTAAAGCGTATGGAAACTGGCATGCCTTATGTGCAGCTCAAAATGGCGGCGAGTCTTGATGGACAAACCGCGCTCGCTAACGGTAAAAGCCAGTGGATTACCTCTCCTGCAGCACGTAAAGATGTTCAGCGGTTTCGCGCGCAAGCCAGTGCGATTCTCTCCACCAGTCAAACGGTGCTGGCAGATAACGCTTCTCTAGCCGTGCGTTGGCACGATTTACCGTCATCGGTACAGGCGCAATACGCAGAAGCGGATTTGCGTCAGCCACTGCGGGTGATTTTGGATCGTCAACATCAATTGCATCCAGAGTTAGCGTTATACCAAACACCTTCACCCGTGCTGCGAGTTGCCAGCGAGAATGCCGAACTCTGTATCTCTGCTGAGCACGGAAAGCTTGATTTGCGTGAGCTTTTGGCACAACTCGCTCAGCAACACAATGTGAATCAACTCTGGGTAGAGGCGGGGAGCCAATTAGCGAAATCCTTGATTGAGCAAAAGTTGGTGGATGAAATCATTCTCTATTTAGCGCCGAAACTGATGGGCAGTGATGGCCGTGGTTTATTTGGCGCGCTCGGTTTGACGGAAATGGCTGAGGCCATTGAACTCAAAATAGAAGATTGTCGAATGGTGGGGGCAGATTTGCGGATCATCGCCACCCCAAAAACAAAAGATTAGCATTATGTTTACAGGTATTGTTGAAGCGGTTGGCAAGCTGACCGCTATTATCCCCAAAGGCAGCGACGTCACCATCAGTGTGGATGTGGGTAAGCTGGACATGGGCGATGTTAAACTCGGTGACAGTATTGCCACCAATGGCGTCTGTTTAACAGTTGTGGCGTTTGATCAGCGTAGTTTTAGCGCGGATCTGTCGATGGAAACCCTCAAAAAATCGGGCTTTGCACAGTATCAGGTTGGCGATCGCGTTAACCTTGAAAAAGCCATGCTGCCCACGACTCGTTTTGGTGGTCATATCGTCTCAGGACATGTGGATGGCGTAGGTGAAATCGTCGAGCGTATTCCTGTGGGGCGTGCGGTGGATCTGTGGGTCAATATGCCGGCAGAAATCAGCAAATACGTGGCAGAAAAAGGCTCGATTACCGTTGATGGCATCAGTTTAACGGTCAATGATCTGCGTAAAAACGCGTTTAAACTGACCATAGTGCCCCACACCAGCGCAGAAACCACTATCGATGAATTCCAAGTGGGACGCCGAGTCAATCTGGAAGTGGATGTATTGGCACGTTACATGGAGCGTTTACTGCAAGGTCAACAAGAGAGTGAACCTCAATCGCGTCTGACCATGGCATTTTTGCAACAAAACGGTTTTGCTTAACCTCTGCTCTCAGAGGCTTCATAACAATAGGACTGACATCATGCCAATTAGCACCCCACAAGAGATCATTGAAGACATTCGTCAAGGCAAAATGGTCATCCTGATGGATGATGAAGATCGTGAAAATGAAGGCGATCTGATCATGGCGGCCGAGCACATTACGCCAGCAGCGATCAATTTTATGGCAACCCATGGTCGCGGCCTCATCTGTTTAACGCTCACGAAAGAGCGTTGTCGCCGTCTGGGTTTAAATCCCATGGTGCAAGATAATAATGCCCAGTACACCACCAATTTCACCGTTTCGATTGAAGCGGCCGAAGGCGTGACCACCGGTATTTCAGCGGCGGATCGCGCGCGCACAGTACAAGCGGCAGTGGCCAAAGAAGCGAAAGCGGCTGATTTGGTTCAGCCGGGGCATATTTTCCCGCTGGCTGCACAAGATGGCGGGGTATTAACGCGCGCTGGGCACACAGAAGCTGGGTGTGATTTGGCACGCTTGGCAGGTTTAGAGCCCGCGTCTGTGATTGTGGAAATTCTTAATGATGACGGCACCATGGCGCGTCGCCCGGATTTAGAAGTGTTCGCTGAAAAACACGGTTTAAAACTGGGTACGATTGCAGACTTGATTGAGTACCGCAACCACACCGAAACCACTATTGAGCGTGTCGCACAGTGTAAACTGCCAACCGAATATGGTGAGTTTGAGCTGGTCACTTACCGTGACATCATCGATAAGCAAATCCATTTTGCTCTACGTAAAGGCGATATTACTAGCGCGCCAACTCTCGTGCGAGTGCATCTGCAAGACACTTTTACCGATTTGCTACACAGTGACCGTACCGCCGAGCGTAGTTGGACGCTGACTACCGCCATGCAGCGTATTGGTCAAGAAGGTGGGGTGTTGGTAATTCTTGGTAATGAAGAGTCAAGCGATCTACTTCTGCACCGCGTGAAAATGTTTGAGCTGCAAGATAAAGGTGAAGCACCGGCAATGGCGAAAAAGCAAGGCACTTCGCGCCGTGTGGGTGTGGGTTCACAAATCCTTGCCGATCTAGGGGTTAAGGAGATGCGTCTGCTCTCTTCGCCGAACAAAAAATACCACGCATTGGGCGGTTTTGGTCTCAATGTAGTCGAATATGTCTGTGAATAATCACAGCAAATTGCCGGTTTGCCGCGTGTTGCCTTGAAGGTGGGGCGCGGCTTTTAGCCCAAATTTCTGTTAGATATTGCTCACAAAATTGTGCTAGAATCCGGCGATTCTCACTTGATGAACAGAGTTAAAGGAAAGCTATGAAAGTGATCGAGGGTGGTTTCCCAGCACCAAATGCGAAAATTGCGATTGTGATTTCTCGTTTCAACAGTTTTATCAATGAAAGTTTGCTGTCTGGTGCCATCGATACTCTTAAGCGTCACGGTCAGATCAGTGATGACAACATTACCGTTGTGCGTTGTCCCGGTGCGGTTGAGCTGCCACTGGTAGCGCAACGTGTTGCAAAAACGGGCGACTACGATGCGATTGTCTCTCTGGGTTGTGTGATCCGTGGCGGTACACCGCACTTTGACTACGTTTGCAGTGAAATGAATAAAGGTCTGGCACAAGTGTCTCTGGAATTTAGCATTCCAGTCGCATTCGGTGTGTTGACTGTTGATACTATCGATCAAGCTATTGAACGCGCAGGAACCAAGGCTGGTAATAAGGGTGCTGAAGCAGCACTGAGCGCACTTGAGATGATTAATGTTCTTTCTGAAATCGATTCCTAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP028892|652419:658912|655389_656493_+|WP_001131992.1|DBSCAN-SWA MPMFTSFDHQMMSRAIELAWRGRFTTSPNPNVGCVITRGEQIVGEGFHFRAGEPHAEVHAMRQAGELTRGATAYVTLEPCSHYGRTPPCAEGLIKAGVAKVICAMQDPNPQVAGKGVQMLRDAGIEVEVGLLEADARALNRGFLKRMETGMPYVQLKMAASLDGQTALANGKSQWITSPAARKDVQRFRAQASAILSTSQTVLADNASLAVRWHDLPSSVQAQYAEADLRQPLRVILDRQHQLHPELALYQTPSPVLRVASENAELCISAEHGKLDLRELLAQLAQQHNVNQLWVEAGSQLAKSLIEQKLVDEIILYLAPKLMGSDGRGLFGALGLTEMAEAIELKIEDCRMVGADLRIIATPKTKD >NZ_CP028892|652419:658912|658441_658912_+|WP_000864130.1|DBSCAN-SWA MKVIEGGFPAPNAKIAIVISRFNSFINESLLSGAIDTLKRHGQISDDNITVVRCPGAVELPLVAQRVAKTGDYDAIVSLGCVIRGGTPHFDYVCSEMNKGLAQVSLEFSIPVAFGVLTVDTIDQAIERAGTKAGNKGAEAALSALEMINVLSEIDS >NZ_CP028892|652419:658912|654914_655364_+|WP_000543544.1|DBSCAN-SWA MHCPFCSENDTKVIDSRLVADGHQVRRRRQCLACNERFTTFETAELVMPRVIKSNGNREPFDEEKMIGGLQRALEKRPVSADAIELAISTIKSKLRATGEREVPSKLIGNLVMEQLKVLDKVAYIRFASVYRSFEDVREFGEEIAKLQD >NZ_CP028892|652419:658912|652419_653553_+|WP_108243719.1|DBSCAN-SWA MTTNQRSIKAPQTVVVKLGTSVLTGGTLVLDRAHMVELARQCAELKKQGHSVVIVTSGAIAAGREHLGYPALPNAISSKQLLAAVGQSRLIQVWESLFAIYGIKIGQMLLTRADLDDRERFLNARDTINALVENGIVPVVNENDAVATNEIKVGDNDNLSALVGILCGADKLLLLTDQKGLFTADPRKDPNAQLIKEVTTIDDTLRKIAGGSGTTLGTGGMATKLQAADIARRAGIEVIIAAGRAPNVIFDSLSDEPQGTRFLALEEALENRKRWILAGPAASGDIVIDDGAVKAVQTKGSSLLAKGVVRVQGQFARGEVVRVLDKQSHLIARGIAAYSNDELAQIAGKHSKEIIDILGYDHGSEVIHRDDMVVIQE >NZ_CP028892|652419:658912|657191_658301_+|WP_094392061.1|DBSCAN-SWA MPISTPQEIIEDIRQGKMVILMDDEDRENEGDLIMAAEHITPAAINFMATHGRGLICLTLTKERCRRLGLNPMVQDNNAQYTTNFTVSIEAAEGVTTGISAADRARTVQAAVAKEAKAADLVQPGHIFPLAAQDGGVLTRAGHTEAGCDLARLAGLEPASVIVEILNDDGTMARRPDLEVFAEKHGLKLGTIADLIEYRNHTETTIERVAQCKLPTEYGEFELVTYRDIIDKQIHFALRKGDITSAPTLVRVHLQDTFTDLLHSDRTAERSWTLTTAMQRIGQEGGVLVILGNEESSDLLLHRVKMFELQDKGEAPAMAKKQGTSRRVGVGSQILADLGVKEMRLLSSPNKKYHALGGFGLNVVEYVCE >NZ_CP028892|652419:658912|653564_654815_+|WP_033932838.1|DBSCAN-SWA MDLTVLGKAAKAASFQLATASTAQKNQALAIMADQLEAQSASILAANAKDIALGREAGLSDAMLDRLLLNESRLQAIANDVRNVIKLNDPVGSEIDSRVLENGMSLARRRVPLGVVGVIYEARPNVTIDIAALCLKTGNAAILRGGKETFFSNMELVKVIQSALDKAGLPAASVQYIEKPDRELVTQLLKMDDYVDMIIPRGGAGLHKMCKENSTVPVIIGGFGISHIFVDESADLDKSVAVIENAKVQRPSACNALDTLLVHQAIAKPLLDKLIAKLNGKVAFVAEPKAKALMNAAAELRDAQAGDFDTEWLSYTLGVKVVQDVQEAIEHMREHNASHSDAIMTNDLYNAELFVNTAGSAAVYVNASTRFTDGAQFGLGAEVAVSTQKLHARGPMGLEELTSYKWVGKANYLSRS >NZ_CP028892|652419:658912|656497_657151_+|WP_000493874.1|DBSCAN-SWA MFTGIVEAVGKLTAIIPKGSDVTISVDVGKLDMGDVKLGDSIATNGVCLTVVAFDQRSFSADLSMETLKKSGFAQYQVGDRVNLEKAMLPTTRFGGHIVSGHVDGVGEIVERIPVGRAVDLWVNMPAEISKYVAEKGSITVDGISLTVNDLRKNAFKLTIVPHTSAETTIDEFQVGRRVNLEVDVLARYMERLLQGQQESEPQSRLTMAFLQQNGFA |
7 | Staphylococcus_phage(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1511262 : 1535307
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP028892|1511262:1535307|DBSCAN-SWA CCTAGTTATCTACAGATTCTGTGGATATCTTTTTCTGCAAATTTGATTTTAGTGCTCGAACACCAAGGGCAAGAAAAGACGATAACAAGTCATAAACAAAGAGCGCGGCAAGTACAGCCGTGAAGTTGTAGAAATAAGATGCTTCGATAAGCGTAGCCAGTTGATTGGCCGTGATAACGATTTGTTCCATGAGTTCACCTCTAAAATAGCGCGCCTAACTTGACTTGCTGCGGCTGTCCTTGCAGTTCAGCCACTGGCTCGCGAAGCATCGGCTTGCAGGTAACGTTAATAGTGATGTTTTCCTTGGTGAGCTTGAGTAAGCAGTCATCGTAATGCACATAAGCAATGTCGTTAGCCCTCAAAAATGAGTCATCAAGGTAATAAGTGCCTTCCGGTGTTTTGGCCTCAAGAGTGACAAAGAACTGAAAGCCTTTATCCGATTGACGCGTGGTGTGTCCGGTGTAATAGAGATTCTGCAAGTCATAAAGGCCGAGCATCTGCTTTATGGTGTCAATCCGATGAGATGGAGCAGGGGAAAGAGCGCTAGCTTGACCGCCGTTCCCACCAGAAGGTAAAGCAGGAGCATTTTGCCCACTCGTTTGATGGCCGACAGTGGACGGGTCAGAAGTGGGGGAAGTGTTAGACGTTTGCGCGGCCGTGTTCGCCACCGACGAAGAAGAACCAAAAACCAAACCGGATAGACCATAAAGAAAGTACCCCATGCCAAGTATGCCAATAAGAAGAACGACCATGATTTTAGGATTACTCAAAATCATGTTTACCCCTTTGCCGTTCTGCGCGTTCCCCGTCGATGTGGACTTGTAAAGCAAGAACGCATCAAGCGGAATTTTCTTCGTCGTGACGTTTGGGTCTTTACCCTTGGGAATAACGGGCGTGCTCGTATTCTTGGCGTGTTTATAGATGTAAGGTTTACGCACCGCCCAAAAGTAAGCGTCACGGCCTTTGTGGAAATAACACTCTTCCGCACAAGCGCGAATAGCCGAATCAATTTGCCCCCAATCAGGCGAGAGCAAGTGAATATCCCAGTTGTATTTACGATGACGCATAAAGCCCTCATTGAACGAGAGCGGGTAAATGATGCGACCCTCAGAATCGTATTCGGCCACGCCTCTATCATCTGACTCACAAGCTTGAAGCTGTGACATATCGGCCGGAACGTAACGGGAATTAAAGAAACTCTCATAGTCTGGCGGGAGCTTAGGAAGGAACTCAGCCAAAGGACGATAAAAGACTTTCTCAAATCGAAAGCCAATGTTCTTAGAAAAAATATCTTGGCACTCATCAATCACAATGAGCGCACCAATCGGACACCAGCAAAAGAAGTGTTGCCAAAGCTCTATCCCGTTCTTGTCTCGGCTGAAAATACGGATAAGGCGAGCCGTGCTAGGGAACTGCATATCAAAGCGGCGCTCGATTTCATCGAGAGGTTGCATACCTTCCAAATTGGTCACCACCACACGGCCAGCCTTGAGCGCTTCATAAATGACAAAGTAGGCCACATAAGCAGATTTATAGGAGCCGTTCGCGCCCGTTCTAATGAATATCGCCATGATTAAAACCTTGTGATTTTCCAAACGAAAGCCGTAGCCATACAGTTGAAGTAAATACCGATGGCTTGAGGGATTTTGAATAAGAAGGCGTAATAACGCAGTTCATCGGGCAAGGCATTAAAGAAGCTCGCGAGCATATCGTTAAAGCCAATATCATTAAGCAGGTATTCCGCGGTTTTGTAGGCCAGCTCTAAAGAGAGAATGAGCCAAGTAAGCTTTAATTTGACATACCAAGCATTGCACCAAATGACAAACTGCTCGAAGTAATCGGGGATGGACTTGAAGAACTCCACGACCGTATCACCGGCATTTCCAATCGCACCTAATAAATCGAGTAAAAATTGCATTATTTATCCCCTCCACCCATGACAATGCGAAGCCCAGCCAAGGCCGCAAGAAACAGAATGACCGACGAGATTAAACCAGCGTTCGCCACCAAAGCAGGGAATACACTCGATTTGATAGAAGTCTCTTGGCCGTTGGCGAACCGGAAGGAGAGAGAGTGCTCTTTGTATTCACCCGTGTTGAGTTTGGAGACATCAAAGGAAAAGAGCTTTTGAAAGTCCTTTATCTTCTCGGAATATTCTTTCTGTAACTCGGTGATTTCCGTGTTTAAGGCCGCGAGAGAATCAGAGCCATAGAGAGGCGTCTCACCGAAGTTAACGCCAGAGCTAATACCTGGTTTAGTTAACCCGTTACCACCAAGCAGGCCGTTTAAGTTATCAATGCCCGATTTGATGGAATCGATACCGGACTGAACGCCGGATAAATCACCGTTACCAGAGCCGCCACCGTTGGCGTTAATAGCAGAGACAATCTTGTTCGTGTTGTCTTCCATTGTAAAAAACAAGCCCTCACTTAAACCTGCGAGGTTTTCGTTAACCAGTTGAAGCTTTTCATTTACGGCATTAATTCCGAACTCAACAGGACGAACCGCTTCACGAACATCACGAACAGCGCTGACGATATTATTAGAGTTGGCGCTAATAACGCTACGAGTCGTAGACATGGAACCCATAAAGGAATTGAGTTGTGAATCAGGAAGGCCAGAGCCGGAACCGCCACCTGAACCCGAATCAGTTAATTTATTGAGTATGTTGCCAAGGGTATTGGCAGACATTTGAGAAAATGCGGCAGTTTGTGAAGTGTTCTTTTCGATATAGTCCGATAAGTTGCGGATAGAATAAAGCTGTCCTGACATGGTTGACATTAAACTTGAATAACCTTTATTTATATCAAGGAGCTGAGAATTTTTAATTAGAAGCTCATCAAGCGTCATAACGCCCATGCCCTCAATATGCGCAATATCTTTAAAAGCCTGAGCTTGTTGGATACCACTAGTGGTATCTCTAGGCATACCACGAATAACATTCATGACTTTTGCAGGCATAGAAGAAGGAGCATTCGGATCCCAAGGCTTGTCTGGGTCAGCGCTTGGGTCACGGATAACGGTATTACCACCAAAGCGCAAGCCATCGCTTGGATCAATTTGGCCGTTAGCCAACAAACAATGTTTGCCATTAGAAACAAATTCACCATGACAAGAACCATTATTCACAAAGCAGACAGCGACTGTGGCCAAGTTATAACGACAAGTGCGAACGCAGGTATAAGGCTTATCGCCAAGGATTTCACCAGACCAGCTCACCGAACCGGAAGAGATACCGATTTGGCATTCAATCTCAGCGCTGGCCTTGGACGGAAGAAGTAACAGAAAAAGGGTAAGGAACAGTAATAGTGCAAGATTGGTTAGTGTAAGAGTGCGCAGCATAGACCCCCCAAAGGAAAACGCCCCCGTTAGGAGGCGTTCACACCCGTATAAACTCCGTATATAAACGCTCCAGCCATGCCAAGGCCAAAGAGCACAGACAGGGTGGAGGTCAAGAGTTCAGCCATAGGGTAATTACTTCATTGCGCCAACAATCATGCGCAGACCAAAGCCTAGCGCGGCCATAGCAATCAGACCAACGACAACGAGAGTGTAGTTACCTTGACCAGACGACACACCCGCTTTGATTGCGTCCGAAATAGGATCTTCTGCAAAAGCGAAAGACGCAGGAAGTACGGTCGCTGCTACAACACCAAATTTTTTAGCCATGTTACGAAATTTCATAGGATTATCTCCAACTGATTTAAGGGTTTAGGGCTATGCGCGCCCCATTGTTTTTAAGATACGACCCAAGACATGACCAGACAGAAACGACAACAGGAGATAGCCGCTTACGTGGTAGTAAATGTCAGGGTCAATGGTTACCGAGCCAAGCGAGGTATTGCGTATTTCGTCCAGCTCGGAAGGAGTGAGAACCACGTAAGTGCAGTCAAAGCCTTGAGGCGCAAGCATCAAATAACCGTTGTATGCAATTACGCAATTACTCATGTTCTTGACCTACTTCTTCAATGTTTCAGTAAAGTATTTTTTCACGTCTTCATCCTTGGGGATGAGCTCAACCGCAACGACTTCCAATGGGTCGTCAGGGTTACTACCAAAGCGAATGTCATATTCACGGTTAGGAACAAAAGCGCGTGTTTCAATCAAACGCTTGGCGTAAGTAGGTTCGATACGCAGCGGCTGCTTGTTGAAAGGAATATCCGTGTTTAGCCCTAAACCGTATTGAGCAAACTTCTCAACGTTGACGGTTTCAACAGGGCGCAGAACGTTCAGCTCTGCAATGGTGGTTCCCGACTTGGGAAATGTTTTGATGACGATGCCAGTGATGTTAGCCATTACCTTAACTCCATAGTGTGTTTTTCAGTTGTGTGTATGAATCAGGAACGCCGAGCAATTCAAAGTCAGGGCGTCTATGTTTGTGAGGGATAAGCATCCCGAATGCTTCGCCTAAGTCGCCTTGCGTCATGGCGATAACTTCCGCTAACGCCACACCACATTGACGGCGAACCCATGCGATACGAGCCATAAATTCAAGACCTTGAGCCTTTTTGTTGCGAGAGAACTTAACCGGAGGCGTACACTCGATAGAGGCCGCGAAAGGGCAGATACCCGCAAAAGAGGCGGCAGGGTTGGCTAAAAGCTCTATATCGCACTTTTTCAGCTCAACCTCGTTTCGATACCAAATCAGGTCAGGGTCAGTAATTTTTTGCTCAAGCTTTTTGTTGTAGATACGCCAGTAAATCGCCGAGGAACGAGAGCCGACAATCGTAGCTTCTTCCATCAAAGCGCCGTTTTCTGTAATGCGTTTATGAGGAACCATTGAGGGACCTTGACCCCTTGGAGCAGTGCGAAATGCTCCCTCATAAAAACATTTCTCTGCATACTTGGCGTCGAAGTTTCCGGTGTAATCGTCCACGGCCAAGTCGAGACGAACTAAGCGAGTAATGCCAAGAACCTGAGCAAGCCACCAATGAAGCTTCTTAGAGTCGATACGGTCGAAAAGTTTGGTGCAACCCGTGCCGTTGATTTGGACAAAAACGGTATCGTTGTTTCCGCCAATTCCGACAAGGCCGCACTCAACTTGTCCGGTCATATCGAGAATGACCATAGAATCGTTGTAACCATGAAGGCCACGACCACGCATAGGCGATAAGCGAAAGCCCATGATTTTGGACATGAACAAATCGAAGCGATGAAAGAGCATCTTTGACACTTTGTTTTTGTGCGCTTCCATATGGCGCTCGATTTGTTCCAAGGTACAGCACACCGCGCCTTGTTCCTTGGTTTTAGTTTTTGGCTCGTGGTACACGGGCATTTGTAAATTGATAAAGTCTTGGTCGTTGCTTTTATCCAAGTGACGCAAGTCCGCATAGGCGAAAGTAAAAGCCAAGTGGTCAACTTTGACAGGGCGAACCGTGTCATGGTGAGGGTGCTTACATGGCATGAAAGACCCCCTTTAACAGCAATTCGTTGTAGTTTTCGTTAGTGATTTCAACTAGTTGATATGGGTCAGAGCCATAGTGAACGGCAAGGTATTGCTCAAACTCAGGCCAGTTTTTAAAGAAACGATGCCCCCAAACGAAATACACGTTAATTCCGATGTTGGGTTCGTTGTCGTAGTAGATGAAATCACCCATGATGACCGCCTAAGCCAATTTTCCGTTGACGGCATTGATTAGGCGGCGAGTCATTTCGCAGTCAGCCAAAGCGCGATGAGCCGTTAAATCAGAAGTATCAACGTCTTGCTGATAGCAAGCGTTGGTTAAACGCTGAAAACGGGGAGTAACATCATTGCCGCACAAAGAGCCGTAGAAATTAGCGTACCAACGCATAACACAAACGCCACCGCCGAAAAAGTGAACAAGGTTAGCAACTGGATAGCCAAATGGGCGTAGAGACTGCGCAAACATGCGATAGTCAAAATCAAGGTTGTAAACGTAGATAACTGAGTCAGATAAATAACCTTTGATATCGTTCCAGACCTTATCAAAAGTAGGCATATCAATAACCATATCGTTAGTAATGCCGTGGATTGCAGTAACCTCAGGAGAGATTGAGCATAAGGGGTTAACAAGACTGCTATAGAGCTTGATGCCTGTTTGGGCATCGATGATTGACACCTCGACAATACGAGCGTCAGAGTCAAGACCTGTAGTCTCAGTGTCCAAAATGATTGCGTTGTCTAGATTGAGTGCTTGCATGATAACCGCCTTAACTGGTTGGGCGACCACCAAGGGGAAGAGTTAAGGTCTAGCGCCCAAGGTGGTCAATACGATAAATCTCGTATCAGTTAATACGACAAATCTCGTAGTGTAAATACGATGATTTCAGTACTAATCAGCTAGAATGGAAGAAATGAGGGAAACAAGGAAACCGCAGATGTACACAAATAATCTGATAGATGCGTACAAAAGCCACATGAATTTTGTCCAATATAAGCAAGTGGCTCACCAATTAGGTTTAAGTCCTCAAATGTTGGCAGACATTAGAAACGGACGAGCACATTTAAAAGAAAATCTGGCACTTATCATTGCTGACGAGATTGGCGAAGATAAAGAAAAGGTACTAATTGGACTTGCAGCCGACAGAGCAAAATCACCAGAAGAGCAAGCCATTTGGCAACAGATAGCAAAAAAGTATAAAGGGCTAGGTTTACAAGGATTATCAATGGCTTATGTGGGGATTGCACTTTACCACGCCCCTATTTCTCAGTGCGTATTAGGTATATTATGTTAAATTTAAGCGTGAAAGCAGGTCGCAAACCCCAATCCGCTCCCTATTCTTAATCCAGTATTAAACATAATTGCATTTACCATAAAATACCTAATAATCATATTTATCATTTGACAATGTTAAGTATGATTATTAGTGACATTGTCAATGTACTTGCGTCAATTGTGTGTGCCAAAGTGTTTGTGTAATCCCTTGATTTGAAAATCTGTTACCAACACATTTCAAATCTGTTTAGGTTTGCTTCGCTAAAAAAGAACTCCACACTAAGGCGGAGTTCTTGGATATTTTGTCAGCTTGTTTTTGCTGCTTAAATTTGGGCTACGCGATTACACCATCAATCCTTTTTTGTGAACCACGTCTGACCACGCCATGCTGATCGCAGCTTGAGATTTACTCTCTAGGCTGTCGATAAAGCCATTGTCACCCGCTTGCGGTTCAAAACCACTCATCGCTTGAACCAGAGCGTCAATCGCACTATCCGATAACATGGTGTATTCACGTTCACCCGACAGGTCTTTCTCACTCATCCCGATAACCACTTGAGCACGGTTACCATTAAAGTAATCACTGAATGTCACCGAACCAATATGCTCAAACTTCGATTGGTCACTGTCGTTAGACGGGTCACGTAAAATCGACAATTTCAGGTCATACCCGCTGCGCTCGAACCACAGTTTCTGCCAATCGATACCATTGAAGACAATGTTATCTTCGCTGCGAATGTCTTCCACAAGGTTGTCGATAACATCACCGCTCACAACAAAGCTGTCCACATCCGTACCACCTTTGAAAGTATTTTGATAACCACCAAGTACCATCAGATCACGGCCTTCACCACCGTTGAACTGGCTGAACTTAGAGATGGCCGCTGCAATCAGGTGATCATCTCCATCACCACCGTTTAGCACGGCATGGTAACCCATCAGTTTCACCACATCGTTACCCGCACCGGCATTGATACGGTTATAGTTACCAAAGACATTGGCAAAGTCATTACCTGATCCGAGTTCCACTTGGTTATTGTTGCCAATCGTAACGGAGTAGTCTTGATCATCTCCCGTATCCACACGGTTGAAGTTGCCAGAAGTCACCACATAGTCACGGCCTGACCCTGTATCAATTTCACCACCTTCGCCAAAGACAAACGACTGGTCGTCACCGGCACCAAGGAAGACATGGTTAATGCGTCCAGCAACCACTGCCGTGTCGTTGCCATCACCACCAAACATCATGTTTTCACGTCCCATCAGAACGCCCATGTCGTTGCCTTCTCCACCCGTGAAGATGTTTGACGTGCCTGTTGCGTAGAACACATCATCGCCTCGTCCGCCCCAGAAGTTGTTGTTATCTCCGAGATACGCACCGAGATCTTTACCATCACCACCCCAGTTGAAGTTGTAGTTTCCTAGGGAGATGTTGATATCACCATCTCCTTGGAGATGGCCACTATTACGAAGGAAATCAAACGTTTTCTCCTTCATGTTCAACAAATCAGCAGTGAGATTCTGTTTGAGATTCTCCACCAGTGACTTCATTTCCTTCTGCTTATCTTGGCTGAAAATAGTCGCAAACAAGTTAGGCAGGTTAAGCGAGTTAAAGCCAAAGGCACGATCTTGTGTTTCTGCTGTTTCAGTATTGCCATCAGCCATAGTCGCACCTTGCGCACTGTTCACGTTCGCACCATTAACAGAAACCGAAGAATTGTCCTTTTCCTCTTCAGGCGCTTTCTCTTTCAGCCCATGAGTTTCAGCAAAAGACTTAATGCCATCCGCTCCCATGTCGATACCCGCCTCTAAGCTATTTAGTAGCTTGGTAGGATCAACAAAGGCTTGCAGTTGGTCACCACTGAACTCACCAATGACCTCCAACATCTCTTTGAGAATCGCCACGCCATCAACGGCTTCACCATTACGCGAAACAATGTGACCTGACGCGGTGTAATCCACACCAAAAATATCCGCCAGCGTGGTCTCTGCACCCACGCCAGCTAACTGGCCTAATAGCTTATTCTTAAGCTGACGAGGCAAATCTTCTGGCGTGTAGGTGAAGGTTGTTTTGGCCTGACCCGTGGCAGAGAACTGCTGTGTCATGAGGCCAAACAATGAACCTAGGTTGGTATCCAAAATAGACTGGATGTTGTCACCAAACATGAAGTTCCAGTTACCTGTCGTCACTTGGATATCCGCACCTTGACCACCCACGGCAAAGTTAAACGCAAGCTTTTCATTGGCTTGACGCAGCTTATCTGCACGGCTGAGCTGACTCGTGCCACTATTACCGTTACCGCTTAGCCACTGATTGTATTGCTTATTCAGTGTCGCTTCTGCATCGTGCTTCAGGCCACGGCTATCACGTTCATTTTGTGAATCCAATTCAACTAATGTGGTGTAATCCACACTGCTACTTTGATCCAAACCAGACATATCTTTGACGAACTTCTTCGCGCCAGACAGTGTCCATTGCTGCTCTTGAGCGGCAAGCCAATCTTCTCCGTCTCCAGACATAGCAATACCTTGCAGCACACCAGAAATACGTGCGGCACCATCGAATGGATTGACGAGTGGTGGAGTTGGGATCGACTTGTCCATCATCAAAATCAGATCATTACTGTGTCCGAAATTAAAGCTCACATTACGGTTACCGAGGAACATTTGCGCTCCTTCCAGTGCTTGATAGCCACCGATATCGACACTGTGTTTGCTTTCACCGTCACCGATATGAACCATAACGTTATTGTCACCAAAGGCCAGTGATTTAAAACCACCAGTACCGACTTTAATGCCCACATTCGATGTGCCCCAGTTCACCGCCGTAAACTCCCCTTCGCCCACGTTAACTTGAATGTTACCTGAGTAGCTGAGTTGATTACTGCTGCTAGAATTGGCAATCACTTCCGTTTCTGGTATGCGTTTTTCTGGCGCATCAAACACGTCACTATTGTCACCGATCGCGCCACGAGCAGGTTCATCGACATTGTTGACACCAATACGAGAGAGATCGATGTCGCCTTCCGCAATACCATTGCGAATACGTTCATCCTTGGCAACAACTTCACCTTGCGCGTCCCAGCTTAGCGAAACTTTGTTGTTTTCTGCCTTTTGAACCCAATCGCCATTCGCGTCCTTGGTATGCTTACGTCCCGCCTCGTCTACGGCCAGTTCAGAACTACGAACAGAGACATCGACACGAAGACCATTCGCATCCATCGCGTTAATAAACTGATGACCAAAGCCTTTTTGCTTGTCGTCACTCACCAAAGAACAACCAACAATACTGATGTGATACGGTTTGTTGTTGATGTTTTCGGCTTGATTAAACGACTGTTGGAACTTGGCCAATTTCACGGCCAACTCATCGGCACTGTAACCACTTAAGCGAGTATTGTTACTTTCTGAGTGGTCGCGACCATGCCCCACCAACTGCCAACGTAGCTTTCCATCCAGTTTTGACGGATCGCCATACACCACGCGATAGTTGCCGTCTGAATCGAGCTGCACCACCACACTGCTTTCTGGATGCTTACCCGCAAGGTTCGCAGCCGCTTTTGCAACAACATCATCGTTCTCCATTTGCACGATAATTTGACCGTCGAAGCGGGTTTCACCACCATCTGTCGTTGGTGTAACCGTAATCGGCCCCCAGCTATTAACATCTTGATTATGGAGTATTTTTCCATCCGCTAATGCTTCTTTACTATCCACCACTTGCAGATGATCATTCTTATTTGAGATGCCTTGATCCTTGGTTGATGTGGTATCCAACCCTTGTTGATCTAGGTCTTCATCTACACTTGCACTGCTGGACAAACCGGAGACAATTTGATCCGCATATTGACTCATCAAACGGTTACTTGCTTCGTGACCATAGAATGTCTGCTCACCAGTGACGTTGTAACCAGAAGCGGTGAGTTTGGTACGAAGTTTCTCACCTTCATTACCCAAACCTTCGTTATCGGTCAACAGCAGAATGGATGTCTCTTTTGGCAAACCTTCGAGATTTTTCTCTACAGAGAACTGGCCGTTAACCGCTTTCGCGATAGCCCCCACAATGCCCGCTGGATTCGCCACTTCGTGAGCGGTGATGGCTTTGGTCATGCTTGGCATAGGACGGTCAAGCAATAAGCCAGACACCGCTTGGCCGTTTTGCGCGGCATAACGTGCTAAATCTGCGGCAATTGGACCGCCCATTGAGTAGCCGTGAATGATGATGTTGCTTGGGTCAATACCCTTATCATTCACTAGGTAGTTGAACATGGTGCGAGCATCTTGATACAAGCCTTTTTCGCTTGGTCCACCGTCGCTTTCACCATAGCCACGCAGGTTGACTGCGAGCATATCGATACCTTGCTTCTGGTAGTGATTTCGAATCGCGCTCGCTTGCTCTTCAGCAGAAGAACCAGAACCGTGCAGGAACAATACGACTTTGCCGCTTGTCGTGCTCGTTTCACCTTCCCTTGGGGCAGTTCCCTGATGGTAATACCCCGTTAGACGGCCCGCTTCTCCTTGCAGAGTGATTTTCTGCGATTCACCTTTTTCCACCGCATGATCAAGTAGGGTTTGCGTGATTTCGCCAATCTTACGGCGAGCCTCTTTATCACCATACAACTCGTTATTGAGGAATCGAGTCAATGGACTCAGAGATTCTTTATCGCGCGGTGGCGTACCATCATTTTCAATCGCCACGTTTTCTTTGGGCTTCTCATTGCCTGACAAGGCATCAAGGACATCAGTGACTTGATGACTTTGCTTCTTCGCAGCAATTTCCAGCTGAGCTTCCTGCAGTGCCTGACCAAAGTTGAAAAGCTCGGTCGGCGTCCAAACACCAAACTTAGGCAGCCATGTATGGCCAATCAGTTTATCCGCCCCGCCGGCTTTCAACACTTTAGCCACCGTGCTGGAGCAGTTTTTGGTCAGTAATTGATAGCGAGCATCCGGATCGTTGCTTAGACGATGCCATTCTGCTTGCATTGCCGCCACATCGAGGCCTTCAAGGTTAATTCGGAATACTCGACCTTGATCGGCTTGGCTTGCTTTGAAGGTTTCGATCTCTTCCAGTGCTCTTTCTGCAAATTGGCGAATCACATTGCCGATGCGCTTTTCAAGCAGTTCGGGATCGTCACTTCTTTGAGCCTGTAGTCTGAGCTCTTGCGCAAAACGGTTTGCCACGTCCATCATGTCATAACTGGTGTCATTCCATTGTTCGACAAATGGTTGGAAAACATGTGCAGGAATACCTGTCGTTTCGAGCATATCTGGGTTACCCAGTAGTACGCTCGCGTAACCTTCAGAAGCCTCTTTAAATGAAGCGTCAATTCCTTTCGCAGCATTGAGCTTCTCGATAAAGCGTTTCAGCTTAATGTCGCCATCATGCAAACCAAAGCCATCGTTTTCTTCCGAAGCAACATCATGCTCAAGCGTGTCATTTTGATGTGCAGGTTGACTGAAGTCGCTCCAGCGCAGCTTAAGATCAGGTTGGTCTTTTGTGGCCACATTCAAGATATTGCTGATATTGGATGACTTGCTGCCTAGTGGCCACCAGCTTACGTAATTTTGCTGGTTAAAATCAGCTGCAGCTTGACCTTCAAGTTGCGTGCGACCTTGGCCTATTTGCAATGCGGCATGACCTAGCGCGCTATGATCACTCGGTTTCCAAATGTACAACGTGGCGCTGACAGGTGACAGATGCTGTTCAATCGCTTGGTGTACTTTATTCAAACCCGCGATATCAATCCCTTTACCGTCTTTGGCAAGATCTTTGGCTGATAATCCAGCAAAAATTGTATCCGCAACGGTTTGACCGTACTCGTTACTTACTGTTAACTTCACTTTTTCAAGCACAGCTTCGCGGCTACCTAGCAGATCGATATTGGCAAGATTACCTTTGCCTTTGGTAACAAAGTCACCATTTTGATCCAGATAAAGGTGCTTACTCTCTTTCAAGTTAGCGGCTTCTAGCTGGTCATATAGTCGGCTGAAACTGTCTTGCGCTTGGATAGTTTGCTCAGCAACTGACAGCAGTGAAACTTCATCTAAACGCGTATTCACCTGATTTAACAGTGAATTCATCGCTTCAACACGACCTGAGTCTGGGTGTCCGAGTACATAGCCTTCAATTTTTTTACGCAAGTTGAGCAGTTTTTCCACGGACTCCGGTTCATAATCTGCGTGCTCAGTGATGCTGGCGTGGTAATCAGTCAATGCGTCGATCACTTGCTGATAGCTTTCACCGCGGATCTTACCCGTTACATTCGCCGCATCCAGTAGTTCAGAGACTGACATCAGCTCTACTGTGGCGTTGTATTGCCATGCACCATCGACTTTTTGCGCCAACACATGTGTACGACCTTCATGGTGTGAAATCACTTTGTTATCCACCAAGAAGAGCTCGTCTGGCATTTGTTCGAAATTATCGAAGTTGGCCAGAATGAAGCGTTCGATGGCTTTGTGCTGAGCCGCGTCTTCAGGGTCCTCAAGAATAATGGTTTTACTCTTGTCTTCGGTTCCTGGCCAAGCACGCACTTCGAAGCCTTCATCTCCTTCTTGTACCAGTAGATTTCGTTCGGTACGCAGTGCATAGAATGGCAGCATGTCATTTAACATGGTGTTAATTATGGAAGCGACGTTTTCTTGCTGCTTGTTATGCTCTAGATTCGACGTTTCACCTAAAGCGTGTAATTCTTCTTTATCGTTAGAAGCAAGCTTGGTAGATGTGAGCGTCCCATGGTTATCAGTATAAGGAGCGGTAATTTTCGCCCATTGTTCAGCATGCTGCGTCATCAACCATTCAGCTTTGTGATTGGCGGATGCACTGGACTTAGGATTTTTGACTGGCGGCTGTCCTTTGCTTGCTCGATACTCCTGATACCAAGCATCGAAATCCTCACTACCGCTAACCAAAGCATTTACTTCAGTAATGATGGCTTTACGGCTTGCCGAGTCAAAATGATCGAGTGCCTTAACATCCACTTTCACCGTTGACGCTGTTTTGGGTACAAAGTCACTTAATGCACTCGGTACACTGCGAAACTCAAATAAAACCGCTGGTTCAGGCTGTTGCACACTTCCTGTCGTCGCATTATTGATTCCATGTCCAGCGACTTCTCCACCATATTGGAAGTATTCAAACACGTTCTTAGCAAGGCTTTCTGAATAGGTGCCTTTCAGCTTTTCCGCAATTTGTTGCCTTACGTAGCGTGAGTCTTCCGCACTCAGAAGTGCCAGCATTCTAACGGGTTTAGTTCTTGGCAATATGCTCCAAGCATTTTTAACTTTCGGATCCGTTAACCCTTGAGCATTCGGTGAAAAACCTTCAGAGGCAGGATCCCAATCATTGATGTAGATCCCATACTCTTTGGCCAAATCTGCTGTTTTTGAATAGACAGACGTGAGATAGGCATACACATTTTGTGCTGTAGCGGGATCATCCAACGTGGTGTTTTTCGCGTTTGCTAAAAACTCATCTCTCAACCCTGCTTGATACCAAGGCGCAGATTCAAGCAGTCTGACTTCCGACGACGAGCTTGTACCAAATTCTTTCGCACTGATCGCCATTGTTGCTTGTTGACCACTAGGGGTCATTCCTATGGTCTTGCCTTGTGCATCAATAGAGGTTCCGTTACCCGCCGCGATAAGATGCTTAGAGTTCGATATAACCAGTGTGAAACGACCGTCATCACTCACTAAATGAGGTAAGCTCTGGTGGTTAGACTGATTGATATGATCGGTAAACTCTTTCGCAAGCCATAGCATTGCCTCTTTACGACTTTCAACCGCTGCTGTATCACTGATTTCACTAGGATATGTAACCAGTTCAATCGTATGCGTCTGCCAGTTGTTCACCCCTTGAATATCATTGATACCCACTGGGTTGCTATAACCACCTTGATTCATATCCTTGGTTAGCATGAACAATGGGTTACCTTGTGAATCATGCACATAGCCAAAAGTCTGCGCTGAGTTTTTCGGTAAAACCACCACCAGACCGGATAGCTCATTTTCTATACCAATGGATGTCGATGCGAAATGGCTTTTAAATCCGGGGACACTTTTCGAAGCCAGTTGACCCGTTGGTTGACTTCCCTGTACCGCACTTAGACTTTCGAGGTTCACCCCTGAGATACGGACGTCTCTACGAGTCACTTCTTGGCGATCTGGTTCAGGCGAGACTTTGGTAGGAACAACAATGCTCTTGCTATTTGTTTCAGAGAACATAGAAGTCATACTGCTAACGCTGTTTTTCGCTCGAATACCTGCGTTAATTTGCAAACGGTTCACTGCGTTGGTCGCACCTTCTAGCGCTTCCTGCTCTTGTTCGGTCAAACCTTCACTGAATCGGCCATCGGCGTTGGTTTGGCTGTCGGTGTTGATATGACTGTCTGTTTCGCCAGCGCCTTCCACGCTATGAGCGTTACCCGAAAGGCCACTACCAGTCACACCTTGACGATCAGGACGATCACCTCCGTTTTGTTTAGCACCTTGAGCATCGGCTTGTGCTTGGTTGGCTTTGTTTTCCGCCAATTGCACATCACGATCGCCGCGCGATTGAGCATTATTTACCGCATGATGCGCATCAGATTCCGCTTGTTGCGCATCCTTACCTTTTGCTAAAGCATCGGCTTTACGTTTTTCAGCATCGGCTTGAGCATCAGCAATATCTTGCTCTGCTCCCGCTCGATTTTGCTCGCCTTTGGCGATTCCCGCTTCAGACTTCGCAACCGCATCTTTGACTTTATTTTGATTGTCCGCGTGCGTCTGCTTCGCCTCAGCAATCTTGTCATTCGCGAGCTGCTTCGCATCGTCGAGTTGAGTTTGAACGCCAGCCAGCAGACCGCTAGCAAATTCGTTACGCCATTGGTCGCCTGACTCGCCCGTATGCGTTGCTTGGCTATCAAGCACATCGAGACCTTGCGCCAGTTTTGTCAACTCAGCCGTGATAGCTTCTGATTCTTCTTGCACCGCATCACGTTGAGCTTGACCGTTATTTTCTAAGGCTTGTTGGTCGGTCGACTCCAACTGACTTTGTGAACCTGCGACAGCATCAAGCTGTTTCTGTTTTTCTTGTTCTAGACGTTGGCGATCCGCTTCTGCACGCTCTTTATCGGATAGAGCATTCTGCGCTGCTGGGTTTTGCGTTGCGTGCTCACGGATTGCATTGGCTTGTTGCGATGACTCTGATGTCGCCACCACATTATCTAGGATGTAACCCAGTCCATCATTGTGGCCTGTGCCTTTGAATTCGATACGGTTACTGCCAGCCTGCGCGGTCAGCTTTAAGGTTTTTTGCTGCCAAGCAGACTCGTCACCCGACGATGAGAATACCACTTCACCGTTCCAAAGAACTTCGATGCCTTCGTTATTAGAAAGGCCCGCGCGTTTTGCAAAATCAAAGCTCACCGCAATGACTTCACCTTGCGCTAGATTAGCCAGATCTTGATAGAGACTGGTGTTGGTATAAGTATCGAGCTCGGTCACACGTGCGCCGTGACCTTCACCCTCTACGCCATACACACTGCCTGCATAAGAGGCTTCAACCCCATGAGTAGATTGCCAGCCATGTTCGCCCAGCTCAAAATCCCCATTCACAATCAAATTCGAAGCTTGAACAGAAGAATCATCAACATTGAGCGCATGCTCCATCTTGTTGAGATCAGGAGTATCTACCTTAGTCACGCTGCCCGTGAGACTATCACCGAGGTCAGAACCGACTTCCTTAATGGCATCCATTTGGAAACCATCCAGTTTGGTGATTTCAGGTGTTGCAATCGCACCACGACCTTTATGCGTGCCGGATGAGGAGGCTTCATCGCCTTGCACTAAGTAGTTGATGGCTTGGCTGCCACCGACACCGAGTACGGTTTGTTTGATGTTATCAAACAAGGCAGAGAGCTTGTTGCTGCTGGTATTGCTCGATGCCACCGCTAAACTATAGAAATCGCCGTTACCGACTTTGATGGCGACGTTGTTTTGTGCGTAGGAGGCATTGATGTTGAGGCCGTCCCCAACGTGAATATTGGCGTTAGCTTTGCCCTTCATCACAGCGACATTGAGCCCATCACCCACTTTCGTGTTGATATTATATTTGCCCCAAGCGACGTTGACGGATACCCCATCACCGACCTTGGTGTTGATATTGGCATCGCCGTAGGCAGCAGTGACATTTAAACCGTTACCCACGGTGGTGGTGATATTGGCTTGGCCTTTTGCCGCCGTCACCTGCATACCATCGCCCACCTTGGTAACTATGTTGCCTTTGCTCCAAAGTGCGTTAAAGCTGTCACCATCACCCACTTGAGTAACAATGTTGGCTTCGCCTTTGGCTAGTACCACGTTTTGGCCATGACCGACTCTAGTAATGACGTTCGCTTTACCCCAAGCACCAGTGTAGTCGTCGCCATTGCCCACATGAGTAATGATGTTGGCTTCGCCTTGCACCACGGAGACTTCTTGGCCATCACCCACCTTGGTGATGAGGTTGGCTTCACCTTTGGCAAAGTTATAGCGATCACCGTCGCCCACTTGGGTAAACACGTTGGCTTGCCCCCAAGCGACATTAACACCTAAACCATCACCGACTTTGGTAATGAGGTTGGCCTTACCTTTGGCAATACCAATGGTGGTGCCGTCACCAACGTGGGTCATCACGTTGCCGACATCGGAAATCAACAACCCTACCGTGGTGCCTTCACCCACCTTGGTGAGGATATTGCCTTTACCCGCCAATACGGCTGCGGTGGTAGCATTACCAACATGAGTCACAACGTTGGCTTTGGCGGCGGCCACCACCCCCATGAAATCATCACCCACTTTGGTAACGATATTGGCTTCACCCAGTGCGAGTACACCGGTTAAACCATCGCCGACATGGGTCATGATGTTGGCTTTACCGAACATCGCCGCTAATGTGGTGCCGTTACCAACCTTAGTCATCACGTTCACTTCGCCAGCAAACAACCCTAGGCTGGTACCATCGCCAACATGGGTCATGATGTTCGCTTTGCCGACCATCAGCGCCGCCGTGAGATCATTGCCCACTTTGGTCATGATATTGGCTTGGCCGATCATCGCCGCAAACGTGCTGCCACTGCCTACGTGAGTGAAGATATTGGCGTTACCAACCATTGCAGCGAGTGTGGTTCCATTACCCACTTTTGTCGCAACGTTGCCTTTGGCTAACATCAGTGCCACCGACACGCCATCACCAATGTGAGTGAAGACGTTGGCTTCGGCTACCATTAACGCGAGAGCATCACCGTTGCCTACTTTGGTAAAGACGTTACCCAAGCCGCCCATCAGCGCCCAAGCATTCCCTTCTCCCACATGGGTAAAGATGTTGCCAGCACCAATCATGACCGCAATAGAGGTTCCATCACCCACTTTGGTGAAGATGTTACCCGCCGCGCCCATTACGCCAAGGGTTTGGCCATCACCCACATGCGTCAGCACGTTGCCAACGCCAAGCAGAATGCCGGTCGTGTCGCCATTCCCGATTTTCGTTAGGATGTTCGCGCCGCCGACCATGACACCCGTTGTGGTGCCATCACCAACATGGGTAAGCACGTTTGCGCCACCACCCATAACAGCAAGGGTATTCCCTTTGCCTTTTTTGGTGAGAATATTTGCACCGCCAAGTGCCACAGCCGTGGTATTCGATAATTGATCATCATTGCTGATGTGGGTTATTACGTTCGCGCCACCCAGCATACCCGACGTGAGTTGACCCTCACCCATTTTGGTCAGTACGTTTGCACCGCCAGCGAGTACCGCAGCAACATCACCTTGGCCAACTTGCGTCATCACGTTGAAACCGCCCGCCGCCAACCATAATCCGTTGCCAGAGCCGATTTGGGTATGGGTGTTGTAGCCACCGAGCATCACGACGCGACTATCACCGCTGCCTTTTTGGACTGAAATGTTACCGTAAGCCAGTAAATGCGCGAGATATTGCCCATCGCCAAGACGAACCAACACGTTTACCGCACCGCCCGCGTACACATCCATCTTGCCTTGTTGGCTTTGGTGTACCAAGACGTTCGCCAAACCTGCGCCGCGGAAGGTGAGATCCCCTTCTTCGCCACTTTTGACAATGACGTTCGCCGCACCACCACCGTTAAACTCGGTATTACCAAATTGGGAGCTATGCAAAATCACGTTGGCAATACCGCCACCGTTGAAATGCACATTACCACGCGTCACGTTTGATTTGATGACGTTACCGCCACCGGCACCGTTGAAACGAACATCACCTGAGCTTTCACCACTATCGTTTGCGGGTTTAAACAACGTTTGATCTGTGTATTCTTCGATATCCGCTACGGTTTCGCTGGTACTTCTAACGGCGTTTACGCTGTAGTGCAAGTCACTGAGCGAGTAGCCACCTTTGCCCATTGGGTTATACCCAGTTGCAGATGAAATATCTTGGTTGGCTATATTGGCCGTGTGATCACCTTCTTTATACCAAGAGCGAGCTTGATATTTCAGTGCCCCGGTTTCGGGATCGTTTGCCAGTTCCACCACCACAATTTTTGTGTAAGGCCCAAGCACTTTGGCATAAATGTAAGTGTTGGGTTGGCGGTTGGATTTTACCGCTTTGACATCCACCTTAGTGCCATCAGCATAAATCGCATGACCGCCCATTTTGGCATCCGACAATGACACCTCTTTCGCATCAATCACCGCACCATTGGCGTAAGTCACCCACTCATATTCCGTCAGGCTTTTTTCAACCGCATGAACGGTGACGGATTGCTGATGTTCAACTTTGAGGTCAGATAGCGTGTAGGCGCCATTGATATTAATGGCTGTGAAACCACCGTTGTCGGAAATGTCTTGATTCGCTAAGTTGCTTAGATGGTTACCCTCTTTATACCAAGCCGTAGAGTAATATTTCAGTTCTCCAGTTTGAGGATCATTGCGTAACTGTACTTTGTTAATTTTGGTATACGTAGAGTCTGCGAAAGCAAAAAGATATGTATTAGGCTCGCTAGCCGATTTCACCGCTGTGACATGATGATCTTGCCCTATCCATGAGCCACTCATCACGGCCTTAGTTAATACAATCTCATCAGCCTTAGCATTGGTCATGCCTTCTTTCGCAAAGTCATTTCCAGAACCTTTACGAATAATGGTGTTGTACGCACCGCCACCCGAGAAATAAATGTCACCATGAGCCACATCAGAGTAAAGACTGTTGTAACCACCCACACCTTCAAAACGAATATTGCCGCGGGTTTGCGTGTAGACATCTTCGGCCTGATGTGTCCGCTCAATGCGGTTTGAAGCCCCAGCACCTTGCAGAGTAATATCGCCCACTTTGCCTTTACGGACTAAATGGTTATCCGCACCTGCGCCACGGAAGGTGATGTTACCGGTTTCAACGCGTGAACTGATGCTGTTCGCGGCACCGGCACCATCAAAGGTCACATCGCCATGCGAGCCTTGATAACGGTTAAACCAAGTACGGTCTAATTTATTACCTGCGCCTGCTCCTGTAAAAGACAGGTTCCCTTGGTTGGTTTCATGCCAGAGGGCATTGTATCCCCCTGCGCCTGCAAAGGTGACATTGCCGCTCAACCCTTTACGGGTAATGCCATTATAAGCCGCGGCACCGCCGTAACTGACATCACCATGATTACCTAAATGGTCAATCGACACACCACCGGCGGCACCAGCGAATGACACATTGCCATCGCCACTTTTATTAATGTCTGCATACCCAGCCGCGCCTTTTACAGTTAGGTGGCCAGTGGAATCTTCCACTTTGAGATAGGCAGAGCCGCCCACGACCGTATCGTTGCCGCTACCGGTGTACACCGTTGCACCAATCGATCCAACAGTGACGTGGTCGTCACCGCCGTAGGCATGAATTTGTCCCCCAAAACCAATGGCTACAATGTTGTTGTTTCCATCGTCGGCGGAATAGTTTCCTGTGAAAAAGTATTCAACACTTCTCCAAAATGGTTTTCCCATAAGCCAAACTCTTCTTTTAGGGATTAAATAGAAAACCACTGCACCTTTCGGATACAGCGGTCATTTTGAGCATCCACCTTGCCGCGGACAGTACAGACTTTCTGCCCCTGCCACGGAAGAAATACACGACGACGAAGATCATTGACGACCTCGCGACCATGCCCGAATGGAGCAATCATCTCTGGAATGTAGATTTGCTGGCCCGAACGCCAGTCGGATGGAGATATTTCGCGCACACCAGAAAGCAGCTCATCTCGGATCTGCTCAGAGACAAACGCCCAATTACAAAAGGCAATTGGACGCCCATGCTCATCTTCGTAATAGCAAAACTGGTTTAGCTCAAACGCAGGCAGAATGCGTTGCAGCCATTCTGCAACCACATAACGACGATGAAGCGGCGAATGCTGGCTGAGCAGCATCACACCACCTATCATCTGTTGTATTTGCGCTAATGTGAGATTTGCAGGTTGATGTGTAATAGACATGTGCCCAGTATCAACCGATGCGCGGTTACTTTTTCACCTGAGCAGGACGTTTCTTCGCTGCGGTTTTCTCTTCCGCGGCAGCGTCTTCTGCTGAATTTGCGGTGGCATCACCTGCTGGCAGAAATCCAGCAGCAACAAGCTTATTAAATTCGGCTTCCATTCTAGGATCTTCTAACATGTTTTTGATCATAGAAACCATGTACATAAAACCGGATGCAGGCATATGAACAACTTGCTTTAATTCAAGATCATTATCGCTTTCAGCCATAACGCCGTGCGCCATATTTTTTTGATCCTGACCTACGAAATAAAGAGAAAACACACCTTTATTGTAGTTAACACTCGAGACTGTCTGAACGAACTTCTCCATCATAAATTTCCCCATTTATTTTACACAGATATGAAAAGATGGTTTCTATGAACCAATCTTCTCCAGCAAGCCCAAAGGGCAAAATATAACAAATCAAACCTAGGGTTATTATTGTATTATTTATTTGAAGGCGGGATATTACATCGTTAAACCATGGCTATCAACCGACTTAAACACATTAAATTAGAGTATATATTCTAATTTAATGGTGCGGTATTCCCTCAATTTCACACAGATGCAACCATCAGACAAATTGCACATTAACGTGACTCATATCATTAATTTATATCAAATTTGACTACAAAATAGACTATTTCAACATTTATCATACAACGCTGACGGTGATTGAGGATTGAACATGTTAACAAGCGGTGATAGGATGCCGCCCTTTCAAAAATGCTGTAATACATCGGCGAAATAATTTAATGATATCCAACAATGAAAGACTCACGCTGATATGCATTCATTTTTATTTAAGTATCATCAGCGGTAATCGAGAAAAAATAAAAGACGAAAGTCATTTAACTGACACTGAAAACTTCAAAGAAGAACTGGATAAAATACAGAAAGAACACAAAGTCAAAATACGCACAAAAATTTCACAATTTAAAAACATCGAATCACTAAAAACACCTGCAATTTTATTTGATCAGCACGACTTACCATTTATTCTAGCAAAGACTAATAAAGATAAATGTTTAATACAAAGACCTAATAAAGAAACTCCCGAAGTTATCTCTTCTCACGAACTTAATTCTACTTGGAATAAAAAATCGTTAGTAATACAGCAAGCACAATCGCGCTTTGATATCACTTGGTTCATTCCTGAATTCCTCCAGCATAAAAGAGTGCTGAGCGAAATTCTTTTATTTTCGTTCGTGCTGCAAATTCTGGCTCTGATCTCACCTCTGTTTTTTCAAGTGGTGATGGATAAAGTGCTGGTCCACCAAGCTTGGTCGACGTTGGATGTACTGGTATTTGGTCTAGTGATTACAGGGGTGATCGAAGTGGTGTTACGCGGATTGCGCGAATATCAGTATGCCCATACCGCCAACCGAATTGACATACAGCTTGGACTCAAACTGGTGCAGCATCTGTTTGGACTGCCGCTCATGTTTTTTAAATCACGCCAAGTCGGCGCGATTGTCACGCGTGTGCGCGAGCTCGACACCATTCGCGAGTTTTTGACTGGCTCTATGTTTACCTTAACAGTCGAGCTACTGTTTATGTTCGTCTTTCTTTACGTCATGAGCTTACTTTCTGCTCCGCTCACAGGGCTATTTATCGCTACTGTGCCCTGTTATGTCCTTCTCGCTTGGTGGCTCACACCAAGAATGCAGGCGGCGATTGAAAAGCAGTTTTCACATGCTGCGGCAAATACCTCATTTCTCACTGAAACCGTAGCCGGTAGTGAAACGCTGAAAAGTTTAGCGGTTGAGCCGCGATTTATCCGACGCTGGGATGAACAAACCGAAAAAATGGTCACAACGGGTTATGACGTCCAGCAGCTTAATAACCGCTCTAACCATTTAGTGCAGTTGCTGCAAAAAATCACCAGCGTCGCCATTTTATGGTTGGGGGCGACGGAAGTGCTCTCACTTGAAATGACCATTGGTCAACTGATCGCTTTCAACATGATGACCAACCACATTGCGCAGCCCTTAGCGCGGATGGTCGAGCTGTGGGGACAATTTATTCAAACTCGGGTCGCCATCGAGAAGCTCGGCGATATGCTTAACCTGCCGGTAGAACAACACACAGGGAGTGACAATGTCACTATAAGCGGCGCTATCAGTTTCAAAAATATCCTCTTTCGTTATCAACCGGATATTCCTCCTACCATTAATGATTTGTCGCTAGACATCCGTGCAGGTGAAACATTGGGGGTGGTCGGTACCTCGGGCTCAGGCAAAAGCACCCTCGCCCGTCTGTTATTGCGACTTTATAGCCCTGAGCAAGGCAGTATTACCATTGACGGCATTCCACTCAATCACATCAATGTTCAACAGCTCAGACAGCGAGTGGGTGTGGTATTGCAAGAGAACTTTTTATTTCACAAAAGCGTGAGTGAAAATATTGCCCAATCTAAGCCTGAGGCAAGTTTGGAGGAGATCATTGAGGCCGCAAAACTCTCGGGCGCACATGACTTCATCCTCAAACTCCCCATGGGTTACGACACAGTATTAGCGGAAGGTGGTCAATCGCTTTCAGGTGGTCAACGGCAGCGTCTCGCGATTGCACGCACCCTTTTATCAGATCCGAAAGTGCTGATTTTGGATGAAGCAACCAGTGCGCTGGATGACGAATCTCAAGCCGTCATTCAAGCCAATATGGCCAGCATTGCCAGAGGCCGTACGGTGATCACGATTGCTCATCGCTTGTCTACGGTGCGTGATTGCGATCGCATCATAGTGCTTCATCAAGGCACGATTGTTGAGCAAGGCTCACACCAACAATTACTCGCCTACGGCAAACAATACAAACAGTTATGGCAGCTTCAGCAAGAGCTCAAACAAGAGGAAGCCTCCGCATGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP028892|1511262:1535307|1517600_1517957_+|WP_000288489.1|DBSCAN-SWA MYTNNLIDAYKSHMNFVQYKQVAHQLGLSPQMLADIRNGRAHLKENLALIIADEIGEDKEKVLIGLAADRAKSPEEQAIWQQIAKKYKGLGLQGLSMAYVGIALYHAPISQCVLGILC >NZ_CP028892|1511262:1535307|1532428_1532776_-|WP_001906284.1|DBSCAN-SWA MMEKFVQTVSSVNYNKGVFSLYFVGQDQKNMAHGVMAESDNDLELKQVVHMPASGFMYMVSMIKNMLEDPRMEAEFNKLVAAGFLPAGDATANSAEDAAAEEKTAAKKRPAQVKK >NZ_CP028892|1511262:1535307|1516654_1516858_-|WP_000502184.1|DBSCAN-SWA MGDFIYYDNEPNIGINVYFVWGHRFFKNWPEFEQYLAVHYGSDPYQLVEITNENYNELLLKGVFHAM >NZ_CP028892|1511262:1535307|1531941_1532403_-|WP_001881196.1|DBSCAN-SWA MSITHQPANLTLAQIQQMIGGVMLLSQHSPLHRRYVVAEWLQRILPAFELNQFCYYEDEHGRPIAFCNWAFVSEQIRDELLSGVREISPSDWRSGQQIYIPEMIAPFGHGREVVNDLRRRVFLPWQGQKVCTVRGKVDAQNDRCIRKVQWFSI >NZ_CP028892|1511262:1535307|1513178_1514603_-|WP_108243802.1|DBSCAN-SWA MLRTLTLTNLALLLFLTLFLLLLPSKASAEIECQIGISSGSVSWSGEILGDKPYTCVRTCRYNLATVAVCFVNNGSCHGEFVSNGKHCLLANGQIDPSDGLRFGGNTVIRDPSADPDKPWDPNAPSSMPAKVMNVIRGMPRDTTSGIQQAQAFKDIAHIEGMGVMTLDELLIKNSQLLDINKGYSSLMSTMSGQLYSIRNLSDYIEKNTSQTAAFSQMSANTLGNILNKLTDSGSGGGSGSGLPDSQLNSFMGSMSTTRSVISANSNNIVSAVRDVREAVRPVEFGINAVNEKLQLVNENLAGLSEGLFFTMEDNTNKIVSAINANGGGSGNGDLSGVQSGIDSIKSGIDNLNGLLGGNGLTKPGISSGVNFGETPLYGSDSLAALNTEITELQKEYSEKIKDFQKLFSFDVSKLNTGEYKEHSLSFRFANGQETSIKSSVFPALVANAGLISSVILFLAALAGLRIVMGGGDK >NZ_CP028892|1511262:1535307|1515561_1516665_-|WP_108243803.1|DBSCAN-SWA MPCKHPHHDTVRPVKVDHLAFTFAYADLRHLDKSNDQDFINLQMPVYHEPKTKTKEQGAVCCTLEQIERHMEAHKNKVSKMLFHRFDLFMSKIMGFRLSPMRGRGLHGYNDSMVILDMTGQVECGLVGIGGNNDTVFVQINGTGCTKLFDRIDSKKLHWWLAQVLGITRLVRLDLAVDDYTGNFDAKYAEKCFYEGAFRTAPRGQGPSMVPHKRITENGALMEEATIVGSRSSAIYWRIYNKKLEQKITDPDLIWYRNEVELKKCDIELLANPAASFAGICPFAASIECTPPVKFSRNKKAQGLEFMARIAWVRRQCGVALAEVIAMTQGDLGEAFGMLIPHKHRRPDFELLGVPDSYTQLKNTLWS >NZ_CP028892|1511262:1535307|1515218_1515557_-|WP_001284209.1|DBSCAN-SWA MANITGIVIKTFPKSGTTIAELNVLRPVETVNVEKFAQYGLGLNTDIPFNKQPLRIEPTYAKRLIETRAFVPNREYDIRFGSNPDDPLEVVAVELIPKDEDVKKYFTETLKK >NZ_CP028892|1511262:1535307|1511262_1511451_-|WP_071187661.1|DBSCAN-SWA MEQIVITANQLATLIEASYFYNFTAVLAALFVYDLLSSFLALGVRALKSNLQKKISTESVDN >NZ_CP028892|1511262:1535307|1518280_1531918_-|WP_162298852.1|DBSCAN-SWA MGKPFWRSVEYFFTGNYSADDGNNNIVAIGFGGQIHAYGGDDHVTVGSIGATVYTGSGNDTVVGGSAYLKVEDSTGHLTVKGAAGYADINKSGDGNVSFAGAAGGVSIDHLGNHGDVSYGGAAAYNGITRKGLSGNVTFAGAGGYNALWHETNQGNLSFTGAGAGNKLDRTWFNRYQGSHGDVTFDGAGAANSISSRVETGNITFRGAGADNHLVRKGKVGDITLQGAGASNRIERTHQAEDVYTQTRGNIRFEGVGGYNSLYSDVAHGDIYFSGGGAYNTIIRKGSGNDFAKEGMTNAKADEIVLTKAVMSGSWIGQDHHVTAVKSASEPNTYLFAFADSTYTKINKVQLRNDPQTGELKYYSTAWYKEGNHLSNLANQDISDNGGFTAININGAYTLSDLKVEHQQSVTVHAVEKSLTEYEWVTYANGAVIDAKEVSLSDAKMGGHAIYADGTKVDVKAVKSNRQPNTYIYAKVLGPYTKIVVVELANDPETGALKYQARSWYKEGDHTANIANQDISSATGYNPMGKGGYSLSDLHYSVNAVRSTSETVADIEEYTDQTLFKPANDSGESSGDVRFNGAGGGNVIKSNVTRGNVHFNGGGIANVILHSSQFGNTEFNGGGAANVIVKSGEEGDLTFRGAGLANVLVHQSQQGKMDVYAGGAVNVLVRLGDGQYLAHLLAYGNISVQKGSGDSRVVMLGGYNTHTQIGSGNGLWLAAGGFNVMTQVGQGDVAAVLAGGANVLTKMGEGQLTSGMLGGANVITHISNDDQLSNTTAVALGGANILTKKGKGNTLAVMGGGANVLTHVGDGTTTGVMVGGANILTKIGNGDTTGILLGVGNVLTHVGDGQTLGVMGAAGNIFTKVGDGTSIAVMIGAGNIFTHVGEGNAWALMGGLGNVFTKVGNGDALALMVAEANVFTHIGDGVSVALMLAKGNVATKVGNGTTLAAMVGNANIFTHVGSGSTFAAMIGQANIMTKVGNDLTAALMVGKANIMTHVGDGTSLGLFAGEVNVMTKVGNGTTLAAMFGKANIMTHVGDGLTGVLALGEANIVTKVGDDFMGVVAAAKANVVTHVGNATTAAVLAGKGNILTKVGEGTTVGLLISDVGNVMTHVGDGTTIGIAKGKANLITKVGDGLGVNVAWGQANVFTQVGDGDRYNFAKGEANLITKVGDGQEVSVVQGEANIITHVGNGDDYTGAWGKANVITRVGHGQNVVLAKGEANIVTQVGDGDSFNALWSKGNIVTKVGDGMQVTAAKGQANITTTVGNGLNVTAAYGDANINTKVGDGVSVNVAWGKYNINTKVGDGLNVAVMKGKANANIHVGDGLNINASYAQNNVAIKVGNGDFYSLAVASSNTSSNKLSALFDNIKQTVLGVGGSQAINYLVQGDEASSSGTHKGRGAIATPEITKLDGFQMDAIKEVGSDLGDSLTGSVTKVDTPDLNKMEHALNVDDSSVQASNLIVNGDFELGEHGWQSTHGVEASYAGSVYGVEGEGHGARVTELDTYTNTSLYQDLANLAQGEVIAVSFDFAKRAGLSNNEGIEVLWNGEVVFSSSGDESAWQQKTLKLTAQAGSNRIEFKGTGHNDGLGYILDNVVATSESSQQANAIREHATQNPAAQNALSDKERAEADRQRLEQEKQKQLDAVAGSQSQLESTDQQALENNGQAQRDAVQEESEAITAELTKLAQGLDVLDSQATHTGESGDQWRNEFASGLLAGVQTQLDDAKQLANDKIAEAKQTHADNQNKVKDAVAKSEAGIAKGEQNRAGAEQDIADAQADAEKRKADALAKGKDAQQAESDAHHAVNNAQSRGDRDVQLAENKANQAQADAQGAKQNGGDRPDRQGVTGSGLSGNAHSVEGAGETDSHINTDSQTNADGRFSEGLTEQEQEALEGATNAVNRLQINAGIRAKNSVSSMTSMFSETNSKSIVVPTKVSPEPDRQEVTRRDVRISGVNLESLSAVQGSQPTGQLASKSVPGFKSHFASTSIGIENELSGLVVVLPKNSAQTFGYVHDSQGNPLFMLTKDMNQGGYSNPVGINDIQGVNNWQTHTIELVTYPSEISDTAAVESRKEAMLWLAKEFTDHINQSNHQSLPHLVSDDGRFTLVISNSKHLIAAGNGTSIDAQGKTIGMTPSGQQATMAISAKEFGTSSSSEVRLLESAPWYQAGLRDEFLANAKNTTLDDPATAQNVYAYLTSVYSKTADLAKEYGIYINDWDPASEGFSPNAQGLTDPKVKNAWSILPRTKPVRMLALLSAEDSRYVRQQIAEKLKGTYSESLAKNVFEYFQYGGEVAGHGINNATTGSVQQPEPAVLFEFRSVPSALSDFVPKTASTVKVDVKALDHFDSASRKAIITEVNALVSGSEDFDAWYQEYRASKGQPPVKNPKSSASANHKAEWLMTQHAEQWAKITAPYTDNHGTLTSTKLASNDKEELHALGETSNLEHNKQQENVASIINTMLNDMLPFYALRTERNLLVQEGDEGFEVRAWPGTEDKSKTIILEDPEDAAQHKAIERFILANFDNFEQMPDELFLVDNKVISHHEGRTHVLAQKVDGAWQYNATVELMSVSELLDAANVTGKIRGESYQQVIDALTDYHASITEHADYEPESVEKLLNLRKKIEGYVLGHPDSGRVEAMNSLLNQVNTRLDEVSLLSVAEQTIQAQDSFSRLYDQLEAANLKESKHLYLDQNGDFVTKGKGNLANIDLLGSREAVLEKVKLTVSNEYGQTVADTIFAGLSAKDLAKDGKGIDIAGLNKVHQAIEQHLSPVSATLYIWKPSDHSALGHAALQIGQGRTQLEGQAAADFNQQNYVSWWPLGSKSSNISNILNVATKDQPDLKLRWSDFSQPAHQNDTLEHDVASEENDGFGLHDGDIKLKRFIEKLNAAKGIDASFKEASEGYASVLLGNPDMLETTGIPAHVFQPFVEQWNDTSYDMMDVANRFAQELRLQAQRSDDPELLEKRIGNVIRQFAERALEEIETFKASQADQGRVFRINLEGLDVAAMQAEWHRLSNDPDARYQLLTKNCSSTVAKVLKAGGADKLIGHTWLPKFGVWTPTELFNFGQALQEAQLEIAAKKQSHQVTDVLDALSGNEKPKENVAIENDGTPPRDKESLSPLTRFLNNELYGDKEARRKIGEITQTLLDHAVEKGESQKITLQGEAGRLTGYYHQGTAPREGETSTTSGKVVLFLHGSGSSAEEQASAIRNHYQKQGIDMLAVNLRGYGESDGGPSEKGLYQDARTMFNYLVNDKGIDPSNIIIHGYSMGGPIAADLARYAAQNGQAVSGLLLDRPMPSMTKAITAHEVANPAGIVGAIAKAVNGQFSVEKNLEGLPKETSILLLTDNEGLGNEGEKLRTKLTASGYNVTGEQTFYGHEASNRLMSQYADQIVSGLSSSASVDEDLDQQGLDTTSTKDQGISNKNDHLQVVDSKEALADGKILHNQDVNSWGPITVTPTTDGGETRFDGQIIVQMENDDVVAKAAANLAGKHPESSVVVQLDSDGNYRVVYGDPSKLDGKLRWQLVGHGRDHSESNNTRLSGYSADELAVKLAKFQQSFNQAENINNKPYHISIVGCSLVSDDKQKGFGHQFINAMDANGLRVDVSVRSSELAVDEAGRKHTKDANGDWVQKAENNKVSLSWDAQGEVVAKDERIRNGIAEGDIDLSRIGVNNVDEPARGAIGDNSDVFDAPEKRIPETEVIANSSSSNQLSYSGNIQVNVGEGEFTAVNWGTSNVGIKVGTGGFKSLAFGDNNVMVHIGDGESKHSVDIGGYQALEGAQMFLGNRNVSFNFGHSNDLILMMDKSIPTPPLVNPFDGAARISGVLQGIAMSGDGEDWLAAQEQQWTLSGAKKFVKDMSGLDQSSSVDYTTLVELDSQNERDSRGLKHDAEATLNKQYNQWLSGNGNSGTSQLSRADKLRQANEKLAFNFAVGGQGADIQVTTGNWNFMFGDNIQSILDTNLGSLFGLMTQQFSATGQAKTTFTYTPEDLPRQLKNKLLGQLAGVGAETTLADIFGVDYTASGHIVSRNGEAVDGVAILKEMLEVIGEFSGDQLQAFVDPTKLLNSLEAGIDMGADGIKSFAETHGLKEKAPEEEKDNSSVSVNGANVNSAQGATMADGNTETAETQDRAFGFNSLNLPNLFATIFSQDKQKEMKSLVENLKQNLTADLLNMKEKTFDFLRNSGHLQGDGDINISLGNYNFNWGGDGKDLGAYLGDNNNFWGGRGDDVFYATGTSNIFTGGEGNDMGVLMGRENMMFGGDGNDTAVVAGRINHVFLGAGDDQSFVFGEGGEIDTGSGRDYVVTSGNFNRVDTGDDQDYSVTIGNNNQVELGSGNDFANVFGNYNRINAGAGNDVVKLMGYHAVLNGGDGDDHLIAAAISKFSQFNGGEGRDLMVLGGYQNTFKGGTDVDSFVVSGDVIDNLVEDIRSEDNIVFNGIDWQKLWFERSGYDLKLSILRDPSNDSDQSKFEHIGSVTFSDYFNGNRAQVVIGMSEKDLSGEREYTMLSDSAIDALVQAMSGFEPQAGDNGFIDSLESKSQAAISMAWSDVVHKKGLMV >NZ_CP028892|1511262:1535307|1512834_1513179_-|WP_095477137.1|DBSCAN-SWA MQFLLDLLGAIGNAGDTVVEFFKSIPDYFEQFVIWCNAWYVKLKLTWLILSLELAYKTAEYLLNDIGFNDMLASFFNALPDELRYYAFLFKIPQAIGIYFNCMATAFVWKITRF >NZ_CP028892|1511262:1535307|1514735_1514945_-|WP_000672169.1|DBSCAN-SWA MKFRNMAKKFGVVAATVLPASFAFAEDPISDAIKAGVSSGQGNYTLVVVGLIAMAALGFGLRMIVGAMK >NZ_CP028892|1511262:1535307|1516867_1517422_-|WP_108243804.1|DBSCAN-SWA MQALNLDNAIILDTETTGLDSDARIVEVSIIDAQTGIKLYSSLVNPLCSISPEVTAIHGITNDMVIDMPTFDKVWNDIKGYLSDSVIYVYNLDFDYRMFAQSLRPFGYPVANLVHFFGGGVCVMRWYANFYGSLCGNDVTPRFQRLTNACYQQDVDTSDLTAHRALADCEMTRRLINAVNGKLA >NZ_CP028892|1511262:1535307|1511461_1512832_-|WP_000974159.1|DBSCAN-SWA MAIFIRTGANGSYKSAYVAYFVIYEALKAGRVVVTNLEGMQPLDEIERRFDMQFPSTARLIRIFSRDKNGIELWQHFFCWCPIGALIVIDECQDIFSKNIGFRFEKVFYRPLAEFLPKLPPDYESFFNSRYVPADMSQLQACESDDRGVAEYDSEGRIIYPLSFNEGFMRHRKYNWDIHLLSPDWGQIDSAIRACAEECYFHKGRDAYFWAVRKPYIYKHAKNTSTPVIPKGKDPNVTTKKIPLDAFLLYKSTSTGNAQNGKGVNMILSNPKIMVVLLIGILGMGYFLYGLSGLVFGSSSSVANTAAQTSNTSPTSDPSTVGHQTSGQNAPALPSGGNGGQASALSPAPSHRIDTIKQMLGLYDLQNLYYTGHTTRQSDKGFQFFVTLEAKTPEGTYYLDDSFLRANDIAYVHYDDCLLKLTKENITINVTCKPMLREPVAELQGQPQQVKLGALF >NZ_CP028892|1511262:1535307|1533201_1535307_+|WP_157951529.1|DBSCAN-SWA MSNNERLTLICIHFYLSIISGNREKIKDESHLTDTENFKEELDKIQKEHKVKIRTKISQFKNIESLKTPAILFDQHDLPFILAKTNKDKCLIQRPNKETPEVISSHELNSTWNKKSLVIQQAQSRFDITWFIPEFLQHKRVLSEILLFSFVLQILALISPLFFQVVMDKVLVHQAWSTLDVLVFGLVITGVIEVVLRGLREYQYAHTANRIDIQLGLKLVQHLFGLPLMFFKSRQVGAIVTRVRELDTIREFLTGSMFTLTVELLFMFVFLYVMSLLSAPLTGLFIATVPCYVLLAWWLTPRMQAAIEKQFSHAAANTSFLTETVAGSETLKSLAVEPRFIRRWDEQTEKMVTTGYDVQQLNNRSNHLVQLLQKITSVAILWLGATEVLSLEMTIGQLIAFNMMTNHIAQPLARMVELWGQFIQTRVAIEKLGDMLNLPVEQHTGSDNVTISGAISFKNILFRYQPDIPPTINDLSLDIRAGETLGVVGTSGSGKSTLARLLLRLYSPEQGSITIDGIPLNHINVQQLRQRVGVVLQENFLFHKSVSENIAQSKPEASLEEIIEAAKLSGAHDFILKLPMGYDTVLAEGGQSLSGGQRQRLAIARTLLSDPKVLILDEATSALDDESQAVIQANMASIARGRTVITIAHRLSTVRDCDRIIVLHQGTIVEQGSHQQLLAYGKQYKQLWQLQQELKQEEASA >NZ_CP028892|1511262:1535307|1514978_1515173_-|WP_032468570.1|DBSCAN-SWA MLAPQGFDCTYVVLTPSELDEIRNTSLGSVTIDPDIYYHVSGYLLLSFLSGHVLGRILKTMGRA |
15 | Vibrio_phage(84.62%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2280483 : 2287676
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP028892|2280483:2287676|DBSCAN-SWA ACTATGGACGAGGGCAGATTTCTGATTCTGGAAAGAAGAATTCAATTTCACGAGCGGCTGATGCAGGACTGTCGCTACCGTGTACTGAGTTGTAACGCATGCTCAGTGCATAATCGGCACGCAGAGTACCGCAGGCGGCTTCTTCTGGGTTAGTTTTCCCCATTAATTCACGGTAACGTGCAATTGCGTTTTCACCCTCTAAGACCTGCACCATGATCGGGCCTGAGGTCATAAATTCTTTCAGAGGTTCAAAAAACGGTTTGCCTTCATGCTCAGCGTAAAAGCCACTGGCTTGCTCTTCCGACAGGTGAACCATTTTCGCGGCAATGATCTGCAAACCGGCTTTTTCAATGCGGTGGTAAATCTCACCAATCAAGTTACGCTTCACCGCATCCGGCTTAATGATCGAAAATGTTCTTTCTAGAGCCATAAAATTTCCTTTTGTTTATTGTGCTTATGTTATACCCAAGCGATTTGGCGTCACAGGTAGTCGGCAAGGGAGTTCACCTCCATCAGCATAGACACACTATGTGATTAGGATGAACGAACAGAGCCAACACCGCTGCAGCGTCAAATTGGAAGGGTATACTCTTATTAATAAAAACAAGGCAGCTATACAAGCTGCCTTTAAATTTATGCTTGCTGCATCAAAATACGTGCGAGCATTTTCACGCCCATGCCCGTGGCTCCTGCAGCCCATTTATCGCTGGCTGACTTGCGATACGTGGCTGAACAGTCGAAGTGTAGCCAACCTTTTTGGTAGCCTTCCACGAAATAGGAAAGGAAGGCCGCTGCGGTGCTAGCTCCCGGCGTGTAATCGCCGTTACTGATGTTTGACAGATCCGCAAAGTTAGAAGGCAGCATTTCACGGTGGAACTCAGCTAAAGGCAGAGCCCACAGCGCTTCATTCTCTTCTTTTGCCGCAGCTAATGCTTGTTGGCTCAGCGACTCATCATAAGAAAGCAGTGCGTGGTAATCATTACCCAGCGCGTTTTTCGCCGCTCCGGTTAAGGTTGCACAGTCGATAATCAATTGCGGTTTCTGCTCGCTGGCGTAGATAAGACCATCGGCCAGCACCAAACGACCTTCCGCATCGGTGTTCATGATTTCAACGGTTTTGCCATTTTTGTAGGTGATGATGTCACCAAGCTTCAACGCACGGCCGGAAACCATGTTTTCCGCGCAGCATAGAATGAGTTTCACGCGCTTGTTGAAGCCGCGCATGATAGCCAAACCAAGCGCACCAGTGATCATGCCTGAGCCGCCCATGTCCGCTTTCATGGCTGACATCATGTTGGAAGGTTTTAAGCTGTAACCGCCCGAGTCAAAAGTGATGCCTTTACCGACTAAACACGCGAATACCGGTGCATTTTCATCGCCCGTTGGGTTGTAGTCAAGTTGCAGCATCGCCGACGTACGCTCAGAGCCACGGCCTACAGCGTAAATGCCCTCCCACCCTTCAGTGAGCAGATCTTTGTCTTTGACGATACGGTAAGAGACATGATCAGGTGCAAGCGATTTGATGAACTCTGCTGCCATGGTCGCCAATTGGCGTGGCGCAACTTCTTCCGCGCTTTTGTTGATGATATCGCGTGTCCAATCCGTCGCTTTAATGCGCGCTTCTAGCTCTACTTGTTCAGCAGCCGCTAACACTTTCCACTCGACGGTATTGCGCTTTTTCGCATCACGGTAACCTTGATAGAAAGCCCAAATGCTCTCCAGATCCCAACCTTCTCCAGCCAGGAATGCCGTGCGGATACCTTGATTATCTAACTTACGAGCTGCGCGTTGGATCGCGCTGAAATCTTGTGTTTGTTGCAGGTGAATGGTTGCACCTTGCTCTGCAAAAGAAATGAGCGCTTTCTCTCCCCACTGAGGCTGGGCAGCTTGATTACTCAAAAATACAGACATCTGTGTAGACATGGTTTCTCCTTGTCTTAAATATTGGCTTTGCCTGTGCCATCTCATTAAATCGCCAAGATGTTAGCATTATGTCGAGGCAAAATGTCAACCGAATAAAAAAACGGACCAAGTGGTCCGCATTTTTTGCTAAGAAGTGTTAACTCTGTGGGGTTTCTCCCCCGGCTAAACAAAGCAAGCGCCATATTGATTGACCACTCACTTCATTTTGCATGAGATGAATTACTCGAATTCATCCATCCAGCACAGAATGACGGCTTCAAGGATTTTTTCGTTGGAATGATTCGGATCATCGTCAAAATCTTCCAACTCCATGATCCAACGATGCAGATCGGTAAAACGTACGGTTTTGGGATCGACATCGGGAAACTTGTCACACAGCTCAATCGCGATATCGCGTGAATCTGTCCATTTCATGGTTAGCCTTCCTATTGGTTTTACCTCGGCTGCGACCGAGGCTAAGAATTAATGATCTTCGCTTGCGTGGTTTAGCGTGTATTTCGGGATCTCGACGACGAGATCTTCATCCGCCACTCGAGCCTGACAGCTGAGACGAGATTCTGGCTCCAGCCCCCAAGCTTTATCCAGCATATCGTCTTCCAGCTCATCACTCTCTTCCAGTGAATCAAACCCTTCTCGAACGATACAGTGGCAAGTAGTACAAGCGCACGATTTCTCACACGCATGCTCAATAGCGATACCGTTTTTGAGTGCCACATCAAGAATCGTTTCCCCGGTTTGCGCTTCTAGCACTGCGCCTTCCGGGCAAAGGGTTTCATGAGGCAAAACAATAATCTTAGGCATGTTGGTAATCTCTTCAAATCTCGTCAACGGAGTGGCCAGCCAGTGCGCTACGAATCGATTTATCCATTCGTCTTGACGCAAAATCCTGACTCGCTTTGTCTGTGTCTTTTATTCCTTGCTCAATGGCATCCGCATTATCGCCATTGCGCAGTTCAATCAAGCGCTCGATCGCCTGTAATAAAGTTTGGCGCTCTTGCTCGCTCAGCAGCTCATCACCATCCGCTTGCAGTGCTGAAACTAAGCCTTCAATCACGCGATCGGCTTCTACGCGCTGCTCTGCCAACGCACGCGCCAGCATATCCTCTTTAGCATAGGCCATGGAGTCTTTGAGCATCTGGGTGACTTCGTCATCGCTTAAACCATAAGATGGCTTAACCTGAATCTCTGCTTGTACACCGGTGCTTTTTTCCAAAGCCGTCACGGAAAGTAAACCATCCGCATCCACTTGATAAGTGACGCGAATGTGCGCAGCACCTGCCGCCATCGGCGGTATACCTTTGAGTGAGAAGCGTGCCAGAGAGCGGCAGTCATCGACCATTTCACGCTCCCCTTGTACCACATGCACACTCATCGCCGTTTGCCCATCTTTGAAGGTGGTAAACTCTTGCGCGCGAGCGACAGGAATGGTGGTGTTGCGTGGGATGATTTTTTCGACCAAACCGCCCATGGTTTCAATCCCTAAAGAGAGTGGGATGACATCGAGCAGCAACATTTCTGCATCCGGCTTATTGCCGGCCAGAATGTCCGCCTGAATCGCCGCACCAATCGCGACCACTTCATCCGGGTTAATGCTGGTCAGTGGTGTGCGACCAAAAAACTCCCCCACTTGCTCGCGCACAAACGGTGTGCGGGTTGAACCGCCCACCATCACGACTTCGAGCACCTCATCCGCTTCAACACCGGCATCTTTCAATGCGCGGCGGCAAGAAAGTAGCGTCTTTTTCAACAATGGTGCAATCAGGTTTTCCAACTCTTCACGAGTAAACGTACCTTGCCAGCCAAGCACATTCAGCTCAGCCGTCATGTGTTCAGTCAGATCGATTTTGGCTTGTGTCGCGGCGTTAATCAGCGCGCGCTGCTGCTCAGCGGTTAAGCTGGTGAGTCCGATTTGTGCTTGCAGGTGATCGGCGATCAGATGGTCAAAATCATCACCGCCCAGTGCAGAGTCACCACCGGTTGCGAGCACTTCAAACACGCCGCGTGAAAGGCGTAAAATCGAAATATCGAAGGTGCCGCCACCTAGATCGTACACCGCGATCACCCCTTCTTGGCCTGAGTCCAAACCATAAGCAATCGCCGCTGCTGTCGGCTCATTAAGTAAACGCAGCACATGCAAGCCCGCTAACGCCGCAGCATCTTTGGTGGCAACCCGCTGCGCATCATCAAAATACGCAGGTACCGTGATGACCACTCCCGCCAATTCGCCACCTAAGGTGGCGGTGGCACGTTCTGCCAAGGCTTTCAAAATATCGGCAGAGATTTGAATAGGGTTTTTATCCCCTTGCGCGGTTTGCACAATCGGCAAGCCTTTCTCACTCGCTTTGAAGCGGTAAGGCAGATGAGGATAACGCTGATTGATGTCTTGCAGTGAACGTCCAAGCAGACGTTTCACCGAAATCACAGTGTTGTGCGGATCGGTTTCAGCTTGTTCACGAGCGGGATAGCCGACACGAGTCGCATCAGCACCATAGTTCACCACTGAGGGGAGAATACTACGCCCTTGGCTATCGACGAGTGTGCTTGCGGTTCCGCTGCGCACAGAAGCGACCAGTGAATTCGTGGTGCCTAAATCAATACCCGCAGCCAACTTATGCTGGTGTGGTGCCGAACTTTGCCCCGGCTCTGCAATCTGAAGTAATGCCATCAGTTGTTCCTTTGAATCTTGAATTAGCCAAGCAGTTGGTCTTCGACTCGTTCTACTTCATTTTTGAGTTTGGCAATAAATTTGAGCTTTCTGATCTGATCAGCCGCCGCTAACCACTCACTTTGAGATAATTGGCCTTGTAGCTGTGCGAGATAATGACGCTGCATCGCGGTTACCTTAGTGTCAAAAGCGACTAAGGCGGCCTCTGGATCTGCACAAGCCGTCACGGACTCTAGTTCTTCGCGTAACTCCATCTGTTCCATCAGAAACATCGGATCTTGCAGCGTCTGCTGCTCAGCGTTCATCTCTATGCCTTGTAACGACAACAGATATTCCGCGCGGCGGAGTGAATCTTTCAGTGTCTGATACGCATCGTTAATTTGCGCGGCTTGCTGCACTGCCATTAGGCGGTCACGCTCAGAAGCGGTAGCAAAATTATCCGGATGGAATCGTTTCTGCAACGCTCTGAACTGAGAAGAAAGAAGGCTACCATCCAGTTCAAACTGGATCGGCAGCCCAAATAATTCAAAATAATTCATCTTGATAGCGGTCCTGTGATCGAGAACAGTCTCGATGGTCAGTTACCGTTAAACGTTGAAGCTTTCACCACAACCACATTCGCTTTTGACATTTGGGTTGTTGAATTCAAAGCCTTCGTTTAACCCTTCTTTTTTGTAATCGAGTTGAGTGCCATCTAAATAGACCATGCTTTTGGTGTCAATGATCACCTTCACACCATGGCTTTCAAATACGTGGTCTTCTTCGTTGAGATCATCAACAAACTCGAGCACATACGCCATTCCCGAACACCCAGTTGTCTTGACACCAAGACGTAAACCGATGCCTTTTCCACGATTATCCAGAAAAGTTTTTACTCGGCTTGCTGCGGTTTCGCTAAGAGTGACGGCCATACTTCAACCTTATGATTTCAATAAATTAAAGGGTGGGAGCGCTCGACGCTCCCAAACGAATTAGTGTTGATGCTTTTTCTTGTAGTCCGCAACAGCGGCTTTGATAGCATCTTCGGCTAGGATTGAGCAGTGAATTTTCACTGGTGGCAATTCGAGCTCTTCGGCGATTTCGCTGTTCTTGATCGCTGCCGCTTCATCAATGCTCTTGCCTTTCACCCACTCAGTCACTAGCGAGCTAGAAGCAATTGCGCTACCACAGCCGTAAGTTTTAAATTTTGCATCTTCGATGATGCCTTCTGGTGATACTTTGATCTGCAAACGCATTACATCACCACACGCAGGTGCGCCAACCATGCCGCTACCTACAGAAGGATCTTCTTTATCAAATGAACCAACGTTACGTGGGTTCTCGTAATGGTCAATGACCTTTTCGCTGTATGCCATGTTCGTTACCTCGAATCCTTTATAACCTTGGAATGATTAGTGGTGTGCCCACTCAACCGTGTTCAGATCTACGCCGTCTTTGTACATGTCCCAAAGTGGAGACATCGCACGCAGTTTATCCACTGCTACGCGGATCAGTTCGATCGCATAATCAATCTCTGCCTCGGTCGTGAAACGACCAAAAGAGAAGCGAATTGAGCTGTGTGCCAGTTCATCGTTAAGACCCAGAGCGCGCAATACGTAAGAAGGCTCAAGGCTGGCTGAAGTACATGCACTGCCTGACGATACGGCCAGATCTTTCAGTGCCATCAGCAGAGATTCACCTTCTACAAACGCAAAACTGACGTTGAGATTGTGTGGTACACGCTGATCCAGATCACCGTTAATGGTCACGGCTTCCATATCTTTAATGCCATCAAGCAGACGATTACGCAGCTTCAGTGCGTGATCGTAATCTTGCTGCAGCTCTTCCTTCGCGATACGGAACGCTTCACCCATACCCACAATTTGGTGAGTCGGCAGAGTACCAGAACGGAAACCACGTTCATGGCCACCACCGTGCATTTGTGCTTCAAGACGAATACGTGGTTTACGGCGAACATACAAAGCACCAATACCTTTCGGGCCGTAGGCTTTGTGCGCAGACAGAGAAATCAGATCAACCTTCATCTCTTGGACATCAATCGCTACTTTACCCGCGGATTGCGCTGCATCAACATGGAACACCACTTTACGTGAACGGCACAGTTCGCCGATCGCAGCGATATCTTGCACCACGCCGATTTCGTTGTTCACGTGCATGATAGAAACCAAAATAGTGTCATCACGCATCGCGGCTTCAAGCTTAGCCAGATCAATCAAGCCATTGCTTTCTGGATCAAGATAAGTCACTTCAAAGCCTTCACGCTCCAATTGGCGCATGGTATCCAGTACCGCTTTGTGTTCGGTTTTGCTGGTGATGATGTGCTTACCTTGCTTGTTATAGAAGTGAGCAACACCTTTGATCGCGAGGTTGTCAGATTCAGTAGCACCCGAAGTGAACACAATTTCACGTGGGTCTGCATTCAGCAGGGCTGCGATTTGCTCACGCGCAGTGTCTACCGCTTCTTCTGCCTGCCAGCCATAACGGTGCGAGCGCGAAGCTGGGTTACCGAAGGTACCATCCATCGTCATGTACTGAACCATTTTTTCAGCTACGCGCGGATCAACCGGGCATGTCGCTGAATAATCTAGATAAATAGGCAGTTTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP028892|2280483:2287676|2286041_2286425_-|WP_000331703.1|DBSCAN-SWA MAYSEKVIDHYENPRNVGSFDKEDPSVGSGMVGAPACGDVMRLQIKVSPEGIIEDAKFKTYGCGSAIASSSLVTEWVKGKSIDEAAAIKNSEIAEELELPPVKIHCSILAEDAIKAAVADYKKKHQH >NZ_CP028892|2280483:2287676|2285093_2285609_-|WP_002043422.1|DBSCAN-SWA MNYFELFGLPIQFELDGSLLSSQFRALQKRFHPDNFATASERDRLMAVQQAAQINDAYQTLKDSLRRAEYLLSLQGIEMNAEQQTLQDPMFLMEQMELREELESVTACADPEAALVAFDTKVTAMQRHYLAQLQGQLSQSEWLAAADQIRKLKFIAKLKNEVERVEDQLLG >NZ_CP028892|2280483:2287676|2282624_2282819_-|WP_000872174.1|DBSCAN-SWA MKWTDSRDIAIELCDKFPDVDPKTVRFTDLHRWIMELEDFDDDPNHSNEKILEAVILCWMDEFE >NZ_CP028892|2280483:2287676|2286461_2287676_-|WP_000775249.1|DBSCAN-SWA MKLPIYLDYSATCPVDPRVAEKMVQYMTMDGTFGNPASRSHRYGWQAEEAVDTAREQIAALLNADPREIVFTSGATESDNLAIKGVAHFYNKQGKHIITSKTEHKAVLDTMRQLEREGFEVTYLDPESNGLIDLAKLEAAMRDDTILVSIMHVNNEIGVVQDIAAIGELCRSRKVVFHVDAAQSAGKVAIDVQEMKVDLISLSAHKAYGPKGIGALYVRRKPRIRLEAQMHGGGHERGFRSGTLPTHQIVGMGEAFRIAKEELQQDYDHALKLRNRLLDGIKDMEAVTINGDLDQRVPHNLNVSFAFVEGESLLMALKDLAVSSGSACTSASLEPSYVLRALGLNDELAHSSIRFSFGRFTTEAEIDYAIELIRVAVDKLRAMSPLWDMYKDGVDLNTVEWAHH >NZ_CP028892|2280483:2287676|2281115_2282405_-|WP_108243885.1|DBSCAN-SWA MSTQMSVFLSNQAAQPQWGEKALISFAEQGATIHLQQTQDFSAIQRAARKLDNQGIRTAFLAGEGWDLESIWAFYQGYRDAKKRNTVEWKVLAAAEQVELEARIKATDWTRDIINKSAEEVAPRQLATMAAEFIKSLAPDHVSYRIVKDKDLLTEGWEGIYAVGRGSERTSAMLQLDYNPTGDENAPVFACLVGKGITFDSGGYSLKPSNMMSAMKADMGGSGMITGALGLAIMRGFNKRVKLILCCAENMVSGRALKLGDIITYKNGKTVEIMNTDAEGRLVLADGLIYASEQKPQLIIDCATLTGAAKNALGNDYHALLSYDESLSQQALAAAKEENEALWALPLAEFHREMLPSNFADLSNISNGDYTPGASTAAAFLSYFVEGYQKGWLHFDCSATYRKSASDKWAAGATGMGVKMLARILMQQA >NZ_CP028892|2280483:2287676|2285657_2285981_-|WP_000301571.1|DBSCAN-SWA MAVTLSETAASRVKTFLDNRGKGIGLRLGVKTTGCSGMAYVLEFVDDLNEEDHVFESHGVKVIIDTKSMVYLDGTQLDYKKEGLNEGFEFNNPNVKSECGCGESFNV >NZ_CP028892|2280483:2287676|2283219_2285070_-|WP_001196554.1|DBSCAN-SWA MALLQIAEPGQSSAPHQHKLAAGIDLGTTNSLVASVRSGTASTLVDSQGRSILPSVVNYGADATRVGYPAREQAETDPHNTVISVKRLLGRSLQDINQRYPHLPYRFKASEKGLPIVQTAQGDKNPIQISADILKALAERATATLGGELAGVVITVPAYFDDAQRVATKDAAALAGLHVLRLLNEPTAAAIAYGLDSGQEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLIADHLQAQIGLTSLTAEQQRALINAATQAKIDLTEHMTAELNVLGWQGTFTREELENLIAPLLKKTLLSCRRALKDAGVEADEVLEVVMVGGSTRTPFVREQVGEFFGRTPLTSINPDEVVAIGAAIQADILAGNKPDAEMLLLDVIPLSLGIETMGGLVEKIIPRNTTIPVARAQEFTTFKDGQTAMSVHVVQGEREMVDDCRSLARFSLKGIPPMAAGAAHIRVTYQVDADGLLSVTALEKSTGVQAEIQVKPSYGLSDDEVTQMLKDSMAYAKEDMLARALAEQRVEADRVIEGLVSALQADGDELLSEQERQTLLQAIERLIELRNGDNADAIEQGIKDTDKASQDFASRRMDKSIRSALAGHSVDEI >NZ_CP028892|2280483:2287676|2280483_2280912_-|WP_001162850.1|DBSCAN-SWA MALERTFSIIKPDAVKRNLIGEIYHRIEKAGLQIIAAKMVHLSEEQASGFYAEHEGKPFFEPLKEFMTSGPIMVQVLEGENAIARYRELMGKTNPEEAACGTLRADYALSMRYNSVHGSDSPASAAREIEFFFPESEICPRP >NZ_CP028892|2280483:2287676|2282867_2283206_-|WP_001124187.1|DBSCAN-SWA MPKIIVLPHETLCPEGAVLEAQTGETILDVALKNGIAIEHACEKSCACTTCHCIVREGFDSLEESDELEDDMLDKAWGLEPESRLSCQARVADEDLVVEIPKYTLNHASEDH |
9 | Anguillid_herpesvirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2519142 : 2526367
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP028892|2519142:2526367|DBSCAN-SWA TGTGTCTGATTTTCCCACTCTTGAAGATTATGTAGGTCAAACGCCCTTGGTGCGTTTACAGCGCCTCAATGCAGGCTGCTCTACAGTATTGGTGAAGCTCGAAGGCAATAATCCAGCGGGTTCCGTGAAAGATAGACCAGCGCTTAATATGATTGTGCAAGCCGAGGCGCGCGGCAGTTTGCAGCCCGGCGATACCATCATCGAAGCCACCAGTGGTAATACTGGCATAGCGTTGGCTATGGCTGCCGCCATCAAAGGCTACAAGATGATTCTGATCATGCCCGATAACGCCACTCAAGAGCGCAAAGATTCGATGCGCGCGTATGGCGCTGAGCTGATTTTGGTTAGCAAAGAACAAGGGATGGAAGGCGCACGCGATTTAGCCTTACAGATGCAGCAAGAAGGCAAAGGCAAGGTATTGGATCAATTCAATAACTTGGATAACCCTGACGCACATTTTCGCTCCACTGGCCCAGAAATCTGGCAGCAAAGCCAAGGCAAAATCACTCATTTTGTTTCAAGCATGGGCACCACAGGCACCATAATGGGCGTCTCTCGCTACCTGAAACAGCAAAATCCGCAGATCCAGATCATCGGCTTACAACCCTCGGAAGGCAGCGCGATTCCCGGTATTCGCCGTTGGCCGCAAGCCTACCTCCCCGGCATTTTTGATGCTGCACGAGTCGATCAGGTGCTGGACGTGACTCAAACTGACGCCGAACAGACCGCACGCGCCCTTGCGCGTGAAGAAGGAATTTGCGCCGGCGTTAGCTCTGGCGGAGCCGTGTTTGCCGCTTTGCAGATTGCCCAGCAAAATCCGGGATCCGTAGTTGTAGCGATTGTGTGCGATCGTGGCGACCGTTATTTATCCTCAGGATTATTTTCCTAACTGAGCTTTCCGACACTCGGCTTAGGCTGAGTGTCGAAACCGGTTTGACTTGGTATCCATATCACGCTCCCTACGCGCTTTTCGGTAAACCTCTTGCGGCGATGGCCTGAAGAGCCGTAAGGTCATTACCATGCAAAGCAAGAGCTCAATTGCCCCGCCCTAGTCAGGTTTGCTATTCTCTGTAACCGTTTTTAAGTGACCCATGCCCGCCTTGTGATGAAATGCCGCAAGGCCTGCGCAGATGATGATAAGCAGCGATATAAGATTATGATGAAATCGAACGCCTCACCGAGCGAGTCTCTTTCGCACCACACCCCGATGATGCAACAGTATTTAAGACTCAAGGCGGAAAACCCAGACATTCTGCTGTTCTATCGCATGGGTGATTTTTACGAACTGTTTTATGACGATGCCAAACGCGCCTCTGAGCTGCTGGACATTTCCTTAACCAAACGCGGCGCATCGGCGGGTGAACCCATTCCTATGGCGGGTGTGCCTTTTCATGCCGTAGAAGGTTATTTAGCGAAACTCGTCCAAATGGGTGAATCGGTCGCGATCTGCGAACAGATTGGCGATCCAGCCACCAGCAAAGGCCCCGTTGAGCGCAAAGTGGTGCGGATTGTGACCCCGGGAACCGTGACCGATGAAGCCCTGCTCTCAGAGCGAGTCGATAACCTGATCGCGGCGATTTATCATCATAACGGCCGCTTTGGCTACGCGACCATGGATATCACCTCCGGCCGTTTTCAGCTCAGTGAACCGCAAACCGAAGAAGAGATGGCCGCCGAGCTGCAACGCACTTCGCCTCGTGAACTCTTGTTCCCTGAAGATTTCGCACCAGTGCATTTAATGGCCAATCGCCAAGGCAACCGCCGTCGTCCAATTTGGGAATTTGAATTAGCTACCGCGAAACAGCAGCTCAACCAGCAATTCGGCACGCGCGATCTGGTTGGCTTTGGGGTTGAGCAAGCAAAACTTGGCTTGTGCGCAGCAGGCTGCTTGATCCAATACGTAAAAGATACTCAGCGTACTGCTTTGCCACACATCCGTTCACTGACTTGGGATCGCCAAGATCAATCGGTGATTTTGGATGCCGCGACGCGGCGCAACCTTGAGCTCACTCATAACTTGGCGGGTGGAACAGATAACACGCTTGCAGAAGTACTCGACCATTGTGCGACCCCGATGGGCAGTCGTATGCTCAAACGTTGGATCCACCAACCGATGCGCGATAACGCCACCCTCAATCAGCGTTTAGATGCGATCACTGAGCTCAAAGAAACCGCTTTGTATGGGGAACTGCATCCTGTACTCAAGCAGATTGGCGATATTGAACGGATCCTCGCCCGTTTAGCGCTGCGCTCAGCGCGGCCGCGCGATTTAGCTCGCCTACGCCACGCCATGCAGCAGTTACCAGAATTGCACTCGGTCATGAGCGAGCTTAAGCAGCCTCACCTTACCGAGCTACGCACCCATGCCGAGCCGATGAATGAATTGTGCGATTTACTCGAACGTGCGATCAAAGAAAACCCACCCGTGGTGATTCGTGATGGTGGCGTGATCGCCGATGGTTACAGTGCCGAACTCGATGAATGGCGCGATTTAGCCAATGGCGCAACCGAATTTCTGGAACGCTTGGAAGCCGAAGAGCGCGATCGTCACGGCATTGATACCCTGAAAGTGGGTTATAACAATGTGCACGGTTTCTACATTCAAGTGAGCCGGGGTCAGAGCCATTTAGTGCCTCCCCACTATGTACGCCGTCAAACCTTAAAAAATGCTGAGCGTTACATCATCGAAGAACTTAAACAGCATGAAGATAAAGTGCTCAATTCTAAGTCTCGTGCTTTAGCGTTAGAAAAACAGCTGTGGGAAGAGTTGTTCGATTTGCTGATGCCGCATCTTGAGCAGCTGCAACAACTGGCGGCTTCCGTTGCTCAATTGGATGTGCTGCAAAACCTCGCAGAGCGCGCAGAAAACTTGGAATATTGTCGCCCAACGCTGGTTCAAGAAGCGGGCATTCACATCCAAGGTGGCCGCCACCCTGTGGTAGAGCGAGTAATGAATGAGCCGTTTATCGCCAACCCGATCGAACTTAATCCACAGCGACGCATGCTGATCATTACTGGTCCGAATATGGGCGGTAAATCGACTTACATGCGTCAAACCGCCTTGATTGCACTGATGGCGCATATCGGCAGTTATGTGCCTGCTGAAAGTGCGTCAATCGGCCCACTGGATCGTATTTTTACCCGTATCGGTGCTTCGGACGATCTCGCCTCTGGTCGCTCAACCTTTATGGTGGAGATGACGGAGACCGCCAATATTTTGCACAATGCGACTCGTAACAGTTTGGTGTTGATGGATGAGATTGGCCGCGGTACCAGTACTTATGACGGACTCTCGCTCGCTTGGGCGAGTGCCGAGTGGCTTGCCAAAGAGATTGGTGCCATGACGCTGTTTGCCACTCACTATTTTGAGTTAACCGAACTGCCGAATGTGTTACCTCATCTGGCGAATGTCCATTTAGACGCGGTTGAACATGGTGATGGCATCGCCTTTATGCACGCAGTGCAAGAGGGTGCCGCGAGTAAATCGTATGGTTTAGCCGTAGCAGGACTGGCTGGCGTACCCAAGCCCGTGATTAAAAATGCGCGCGCCAAGTTACAACAGCTTGAGCTATTAAGCTCACAACCTGCCGAGACTCGTAAGCCAAGCCGCGTCGATATTGCCAACCAGTTGAGCTTAATTCCAGAGCCGAGTGCCGTTGAGCAAGCCTTGGCAGGCGTAGATCCTGACCAGCTTACACCTCGCCAAGCTCTGGATATGCTCTATCAATTGAAAAAGCTGCTCTAGTTATTGCCCATATCTCAAGCATGGAATCTACAGATTCCAGAAACAACAAGGCACCCCACTGGGTGCCTTAGTTTTGGATGAGTCTGGAAAACATTAGTTGTCGTATTCGACGTTAAACAGCGCTTCCATATTCAAACCTTGTTTCACCAAAATCTCACGCAGACGACGTAGACCTTCCACTTGAATTTGGCGAACACGCTCACGAGTGAGATTGATCTCACGACCTACTTCTTCCAAGGTCGATGGTTCATAGCCAAGAAGCCCAAAGCGACGAGCAAGCACTTCTTTTTGCTTTGGATTAAGTTCATCCAACCAGTTGAGCAGCGATTCACGAATGTCATCATCTTGAGTTGAAAACTCAGGATCGGCATTGTGAGAGTCTGGCAAAATATCCAGCAGTGCCTTATCTCCATCACCACCAATTGGCGTATCCACTGAGCTGATCCGTTCGTTAAGACGCAGCATCTTAGTGACATCATCGACAGGTCGGTCTAACTCAAGAGCAATTTCTTCTGGCGTAGGTTCGTGGTCAAGGCGCTGTGATAATTCGCGAGCAGTACGCAAATAAATGTTCAGCTCTTTGACGACATGAATCGGTAGACGAATGGTGCGTGTTTGGTTCATCAGCGCACGTTCAATGGTTTGACGGATCCACCATGTTGCGTAGGTAGAGAAGCGGAAACCGCGTTCTGGATCGAATTTCTCAACCGCACGGATCAGACCAAGATTACCTTCTTCAATCAGATCGAGCAGTGCTAATCCTCGGTTGCTGTAACGGCGTGAAATTTTTACCACCAGACGCAAGTTACTTTCAATCATGCGTTTACGTGCGGCTTCATCACCACGTAAGGCACGACGAGCATAAAGCACTTCTTCTTCGGCAGTAAGGAGCGGTGAAAAACCAATTTCGCTGAGATACATCTGGGTCGCATCAAGACTTTTCGCAGAAGCATCAAACTCTTCACGAACGTCTTCACTTGCCCCTTCAACAGCAACTAATTCTTCATCACTGGCGAGCTCGGCATCAGTTTCTAGCACTTCCAGTGCTTCATCTTCAAAATCGAACTCTTCTACTTTGGTTACGGTATTGCTGACACTCATAGCGGCCTCCCCCTGGCAACTTTGCGAGTCATTGCGATTTACAACCTTTGCAGTATGACCTTAGCAATATGAGTTAAGGTAAGTAGCGTTTAGGATTCACTGACTTCCCTTGATAACGGATCTCAAAGTGCAAGCGTACGCTGTTGGTACCAGAACTTCCCATGGTAGCGATCTTCTGGCCTGCTTGCACTGTTTGTCCTTCCTTTGCTAGCAGCTGATCATTGTGGGCATAGGCACTTAAATAGTGCTCATTATGTTTTATGATGATTAGGTTGCCATAACCACGTAATGCGTTGCCCGAATACACTACGGTTCCATCTGCAGTAGCAACGACAGCCTGACCACGTTGGCCAGCGATGTCTATCCCTTTGTTGCCTTGATCGCCCGCAGAAAAATTCTTTATGACTCTACCTTTTGTCGGCCATAGCCACTTCGCTATCTTCTCATCCGAAGGTTTAGCTTTTGCTACATTAACATTAACATTTTGTTTACCGACAGGTTCAACATACTCCTTTGTTTTTGTTTGATCAACCGTCTTAACTGGATCTTTTTTAGTCAAATTTTGACTATTCGTTGACCCATTTTGTACATTTTTGGTATTAGAGCTTTTGCTCACTGTTTGAGCAACTGTTGCCGTCGTCGCCGCTTTGGCAGCACTTGCACTTGTGCTTGAAGCCACAGCCACGGTAGCTGCACCGCCCGTTCCACCATAAGCAGGAGGAGTATAATTAGGTAACCAGAGCTTAATTTTTTGCCCAGGATGAATGGTGTAAGGTGGAGCGAGATCGTTGTAACTGATCAGATCATTTACATCTTTATCTGTGAGGTAGGCAATAAAATAGAGCGTGTCGCCTTTTTTAACTTCATAGTAACTTCCGCGGTAACTACCACGCTCAACTTTGTTGTAATCTTTACCCAATCCAGAGACTGGTGCTGGCGTAGGTGCAGTACAGCCGAATAATAAGCTGCAAAACAATAATAAGCCCAGTCGGAAAGAGCAAAGCGTCATCAGGCAAGATCTCCCGCGACCAGTGGCACAAAACGCACAGCTTCAACCCGCTCAGAGATAAACTGCCCACCTTGACGCACAATTTTGTACAGATATTGCTCGTCTTCGCCCACAGGGATCACCATACGACCACCCTCAGCAAGCTGATCCAATAGACTTTGCGGCACTTTTGCCGCAGCTGCCGTCACAAGGATCGCATCAAAAGGTCCACGAGCGGGCCAACCTTGCCAGCCATCGCCATGTTTCGTCGACACGTTATAGATATCCAACTGCTTCAAGCGTCGTTTCGCATCCCATTGCAGGGTTTTGATCCGCTCAACAGTAAACACATGATTAACCAGTTTGGCGAGAACTGCGGTTTGATAACCAGAGCCCGTACCAATTTCTAGCACTTTGGTTTCTGGGGTTAACGCCAGTAGCTCTGTCATCTTCGCCACGATATAGGGTTGCGAGATGGTCTGGCCTTGACCTATGGGCAGGGCATTATTGTCATAAGCTTGATGCATCATGGCCGGTGCGACGAAAAACTCGCGTGGCAGTGCATGGATTGCCGCCAACACTTGTGGTGAGGTAATCCCCTGCTCAGTTAAAAATTGAATCAGTCTGTCGGCTTTTGGGTTAGCCATTCACCTTTTCCTTTAACCAATGATCCATGCTACGCAGTGACTCATGTGCCGTTAGGTCAACTTGTAGCGGCGTGAGTGACACCCAGCCACGTTCAATCGCATGAAAATCCGTTCCCGGACCGGCATCTTGCTCTTTACCCGGAGGACCAAGCCAATAAATATCATGCCCACGCGGATCTTTCTGCTTTATCATGCTTTCGGCGTGATGCCGCGCACCTAAGCGTGTGACTTCAATACCTTGAATGAGTTCTAAAGGTCGATCCGGAATATTCACGTTAAGTAAGCGATTGGTCGGAATCGGGTTAGCCAGATGTTGCTCGACCAATTGACGCACAAAGTGAGCAGCACTAGCAAAATGAGTAGTTCCCGCCAGAGAAAAGGCGATCGACTGCACGCCTAGAAAATGCCCTTCCATTGCCGCCGCCACCGTACCGGAATAGAGCACATCATCACCCAAGTTAGCTCCGTGATTGATTCCACTCAGTACCAGATCCGGTAGCGCGTCTTTCATCAGTTCATTGAGCGCAAAGTGTACGCAATCGGTCGGGGTACCTTGCACGGAATAGGTATTCTCGGCAATTTGCGACACTCGTAATGGATGCTCTAACGTCAGTGAATTCGAGGCACCACTGCGGTTACGATCCGGCGCGACAATCACAATCTCCGCTAAATCACGCAATGCATCCGCCAAAGCATGAATGCCTTGTGCGTAAACGCCATCATCGTTACTGAGTAGAATCTTCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP028892|2519142:2526367|2519142_2520030_+|WP_001279362.1|DBSCAN-SWA MSDFPTLEDYVGQTPLVRLQRLNAGCSTVLVKLEGNNPAGSVKDRPALNMIVQAEARGSLQPGDTIIEATSGNTGIALAMAAAIKGYKMILIMPDNATQERKDSMRAYGAELILVSKEQGMEGARDLALQMQQEGKGKVLDQFNNLDNPDAHFRSTGPEIWQQSQGKITHFVSSMGTTGTIMGVSRYLKQQNPQIQIIGLQPSEGSAIPGIRRWPQAYLPGIFDAARVDQVLDVTQTDAEQTARALAREEGICAGVSSGGAVFAALQIAQQNPGSVVVAIVCDRGDRYLSSGLFS >NZ_CP028892|2519142:2526367|2520297_2522886_+|WP_095487753.1|DBSCAN-SWA MMKSNASPSESLSHHTPMMQQYLRLKAENPDILLFYRMGDFYELFYDDAKRASELLDISLTKRGASAGEPIPMAGVPFHAVEGYLAKLVQMGESVAICEQIGDPATSKGPVERKVVRIVTPGTVTDEALLSERVDNLIAAIYHHNGRFGYATMDITSGRFQLSEPQTEEEMAAELQRTSPRELLFPEDFAPVHLMANRQGNRRRPIWEFELATAKQQLNQQFGTRDLVGFGVEQAKLGLCAAGCLIQYVKDTQRTALPHIRSLTWDRQDQSVILDAATRRNLELTHNLAGGTDNTLAEVLDHCATPMGSRMLKRWIHQPMRDNATLNQRLDAITELKETALYGELHPVLKQIGDIERILARLALRSARPRDLARLRHAMQQLPELHSVMSELKQPHLTELRTHAEPMNELCDLLERAIKENPPVVIRDGGVIADGYSAELDEWRDLANGATEFLERLEAEERDRHGIDTLKVGYNNVHGFYIQVSRGQSHLVPPHYVRRQTLKNAERYIIEELKQHEDKVLNSKSRALALEKQLWEELFDLLMPHLEQLQQLAASVAQLDVLQNLAERAENLEYCRPTLVQEAGIHIQGGRHPVVERVMNEPFIANPIELNPQRRMLIITGPNMGGKSTYMRQTALIALMAHIGSYVPAESASIGPLDRIFTRIGASDDLASGRSTFMVEMTETANILHNATRNSLVLMDEIGRGTSTYDGLSLAWASAEWLAKEIGAMTLFATHYFELTELPNVLPHLANVHLDAVEHGDGIAFMHAVQEGAASKSYGLAVAGLAGVPKPVIKNARAKLQQLELLSSQPAETRKPSRVDIANQLSLIPEPSAVEQALAGVDPDQLTPRQALDMLYQLKKLL >NZ_CP028892|2519142:2526367|2522979_2523987_-|WP_000116735.1|DBSCAN-SWA MSVSNTVTKVEEFDFEDEALEVLETDAELASDEELVAVEGASEDVREEFDASAKSLDATQMYLSEIGFSPLLTAEEEVLYARRALRGDEAARKRMIESNLRLVVKISRRYSNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERALMNQTRTIRLPIHVVKELNIYLRTARELSQRLDHEPTPEEIALELDRPVDDVTKMLRLNERISSVDTPIGGDGDKALLDILPDSHNADPEFSTQDDDIRESLLNWLDELNPKQKEVLARRFGLLGYEPSTLEEVGREINLTRERVRQIQVEGLRRLREILVKQGLNMEALFNVEYDN >NZ_CP028892|2519142:2526367|2524995_2525622_-|WP_000002982.1|DBSCAN-SWA MANPKADRLIQFLTEQGITSPQVLAAIHALPREFFVAPAMMHQAYDNNALPIGQGQTISQPYIVAKMTELLALTPETKVLEIGTGSGYQTAVLAKLVNHVFTVERIKTLQWDAKRRLKQLDIYNVSTKHGDGWQGWPARGPFDAILVTAAAAKVPQSLLDQLAEGGRMVIPVGEDEQYLYKIVRQGGQFISERVEAVRFVPLVAGDLA >NZ_CP028892|2519142:2526367|2524060_2524996_-|WP_000171040.1|DBSCAN-SWA MTLCSFRLGLLLFCSLLFGCTAPTPAPVSGLGKDYNKVERGSYRGSYYEVKKGDTLYFIAYLTDKDVNDLISYNDLAPPYTIHPGQKIKLWLPNYTPPAYGGTGGAATVAVASSTSASAAKAATTATVAQTVSKSSNTKNVQNGSTNSQNLTKKDPVKTVDQTKTKEYVEPVGKQNVNVNVAKAKPSDEKIAKWLWPTKGRVIKNFSAGDQGNKGIDIAGQRGQAVVATADGTVVYSGNALRGYGNLIIIKHNEHYLSAYAHNDQLLAKEGQTVQAGQKIATMGSSGTNSVRLHFEIRYQGKSVNPKRYLP >NZ_CP028892|2519142:2526367|2525614_2526367_-|WP_000698379.1|DBSCAN-SWA MKILLSNDDGVYAQGIHALADALRDLAEIVIVAPDRNRSGASNSLTLEHPLRVSQIAENTYSVQGTPTDCVHFALNELMKDALPDLVLSGINHGANLGDDVLYSGTVAAAMEGHFLGVQSIAFSLAGTTHFASAAHFVRQLVEQHLANPIPTNRLLNVNIPDRPLELIQGIEVTRLGARHHAESMIKQKDPRGHDIYWLGPPGKEQDAGPGTDFHAIERGWVSLTPLQVDLTAHESLRSMDHWLKEKVNG |
6 | uncultured_Mediterranean_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|