Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_022536 | Rhizobium pusense plasmid IRBL74_p, complete sequence | 1 crisprs | csa3 | 0 | 1 | 52 | 0 |
NC_022545 | Rhizobium pusense, complete genome | 1 crisprs | DEDDh,csa3 | 0 | 0 | 0 | 0 |
NC_022535 | Rhizobium pusense, complete genome | 0 crisprs | cas3,WYL,csa3,DEDDh | 0 | 0 | 6 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_022536_1 | 270634-270709 | Orphan |
NA
Consensus repeat of NC_022536_1
|
1 spacers
spacers of NC_022536_1
>1.1|270659|26|NC_022536|CRISPRCasFinder TAACCCCGGAAAATTCCAGCTCCCGC |
CRISPR arrays and Neighbor proteins around NC_022536_1
The CRISPR arrays of NC_022536_1 >merge|NC_022536|1|270634-270709|CRISPRCasFinder TGGCCCAAAATCGGGGAGCACTTCATAACCCCGGAAAATTCCAGCTCCCGCTGGCCCAAAATCGGGGAGCACTTCA >NC_022536|1|1|270634-270709|CRISPRCasFinder TGGCCCAAAATCGGGGAGCACTTCA TAACCCCGGAAAATTCCAGCTCCCGC TGGCCCAAAATCGGGGAGCACTTCA
>NC_022536.1|WP_144115371.1|269503_270603_+|IS3-family-transposase MKASKFSEAQIAFVLKQAEDGTPIGEVCRKAGISDATFYNWRKKYAGLMPSEMKRLRQLEEENAKLKRIVADLSLDKAMLQDVLFKKALRPARKRKLVDTIKADWKVSIRRACSVLKVDRSLYVYKSRRGEQAELKLKIKDICQTRIRYGYRRVHILIKREGWSVNPKRIYRLYKEMDLQLRNKVPKRRVKAKLRADRTEPSHSNHVWAMDFVHDQLATGRKIRVLTVVDTFSRFSPVVDARFSYKGEDVVQTLERVCRQIGYPATIRVDNGSEFISRDLDLWAYHKGVVLDFSRPGKPTDNSYIESFNGKFRAECLNAHWFMSLDDARAKMEDWRRDYNEFRPHSAIGNKVPISLMSGSSASPPT >NC_022536.1|WP_022557361.1|268867_269149_-|acyl-carrier-protein MSDQLAKEVIATINNRALAERGEPTTATPSAEITLSTELSSLDLDSLALADILWDLEQANNIKIEMNTADAWSNLQTVGDVVMAVRSLLVKEA >NC_022536.1|WP_022557360.1|267658_268867_-|beta-ketoacyl-[acyl-carrier-protein]-synthase-family-protein MDGRVVITGIGGLCGLGTDVAPIWRGMCTGVSAIGPIANPELHELAGVIGCEIKTLPEHDITRRQLVSMDRFSLLAVLAAREAMQQAGLSSEEGNPYRFGAAVGVGVCGWDAIEENYRALLLNGAKRAEVLTAPRVMPCAAAGQVSMHFGLRGPVFGASSACASANHAIALAVDQIRLGRADVMLAGGSDAPLVWGVLKSWEALRILAPDTCRPFSADRKGVVLGEGAGIAVLESYEHASRRGASVLAEIAGIGLSADAFDIVAPAVEGPEAAMRACLEDARLNVEDVDYLNAHGTGTKANDQVETAAIKRVFSEHAYSMSISSTKSVHAHCLGAASALEMIACVMAIREGIIPPTANYNERDPNCDLDVTPNVPRERKIRVALSNAFAMGGLNAVLAFRQV >NC_022536.1|WP_022557359.1|266709_267300_-|NodA-family-N-acyltransferase MCSDVRWKICWETELQVDDHAELSAFFRNTYGPTGAFNAQPFEGGRSWAGARPEMRVIAYDSRGVAAHMGLLRRFIKVGEVDLLVGELGLWGVRADLEGLGLSHSMFTMYPELQRLGVPFAFGTVRHALYKHVERLCRGGIATILPGVRVRSTLPEVYLDLPATRVEDPLAVVFPIARSMDEWPSGTLIDRNGPEL >NC_022536.1|WP_022557358.1|266065_266713_-|chitooligosaccharide-deacetylase-NodB MKQLNYTCKVSSNSADRSVYLTFDDGPNPFCTPDILDVLAERRVPATFFVIGAYAADQPALIQRMVAEGHAVGNHTMTHPDLTTCRLEEIEYQITEASSAIKAASPQAAPKHMRAPYGLWNEDVLSISERAGLTPVHWSVDPRDWSRPGVNVIVDAVLNSVQPGAIVLLHDGCPPSELVGQSLSGLRDQTLMALSRVITGLHERGFFIRLLSQHN >NC_022536.1|WP_022557357.1|264798_266052_-|chitooligosaccharide-synthase-NodC MDLFGTAGTVAISLYALLSGVYKGMQVLYAPPASFFSASPGSSPFHALASVDVIVPCFNEDPDTLSACLESIANQDYAGRLQVFVVDDGSTNREDLAPVHNKYARDPRFNFILLSTNAGKRKAQIAAIRRSSGDLLLNVDSDTMLAPDVVTKLVHRMRDPSIGAVMGQLVARNRSDTWLTRLIDMEYWLACNEERAAQARFGAVMCCCGPCAMYRRSALLLLLDQYETQLFRGKPSDFGEDRHLTILMLKAGFRTEYVADAIAATVVPARLAPYLRQQLRWARSTFRDTWLALRLLPSLDRYLTLDVIGQNLGPLLLAVSVLMGLTQIAVAATVPWSTIILIAFMTIVRSSVAALRARQVRFLAFAAHTPINLFLILPLKAYALCTLSNSAWLSRTSILQSPPRGAETSSLKSLPMD >NC_022536.1|WP_048903028.1|264077_264761_-|methyltransferase-domain-containing-protein MLQLTARPRAGKSNIVCRISHKRCRKLSQKKHYQLLHDELAEDDPWRLDSNPFEQERHRQTLRLALAQQSITHALEVGCAAGAFTEKLAPHCEQLTVIDVVPHALARTRRRLMDPPNISWISCDILQFATQQFFDLIVVAEVLYYLESVAEMRTVVRNLAQMLAPSGHLIFGSAGDASCQRWGHVAGAETVITILDEELVQIDRVRCVGQTTNEDCLLTRYRHPVSQ >NC_022536.1|WP_022557355.1|262331_264047_-|Nodulation-protein-U MRICGIKLTHDGAVALIENGKLIFCIEQEKRNNNSRYQAIDNLDAIVEALNDHGLSVGDVDQFVIDGWDGELESEFQVFSEGAPITLSGAPYVERWPESPLKPHDSSGLALSGSILPYKSFPHVISHVVSAYCTSPFAKAGDPSFCLVWDGCIFPRLYYAEPKGVRLVKCLFPMIGHAYAVAGHHFGPFRNADPQSWDLGVAGKLMAYIALGAAQENILNVFRELYEEHFAGETLIAVNYRENIHNADALLACVNDYFDASASRLQGEKPQDVLASFHVFLERLLVGEMTNALQMQSQFESRNLCVVGGCGLNIKWNSALRASGLFESVWVPPFPNDSGSAIGAACCALAVDRGLAPLDWSVYSGPKLKSSAIPQGWKAAPCTLAELATILASNEPVVFLAGRAELGPRALGGRSILAAGTSPQMKDHLNKVKFREHFRPVAPICLEDRARDIFDPGTPDPYMLFDHTTRPEWRERIPAVVHLDGTARLQTISRDSEHEVAKLLVEYESLTGIPLLCNTSANYNGRGFFPDAAAACEWGQIGHVWCDGLLLTKASEVDGSPAGCADFSASA >NC_022536.1|WP_022557354.1|261420_262335_-|nodulation-factor-ABC-transporter-ATP-binding-protein-NodI MTTIAISFIEVTKTYIDRTVVDRFSFAVKKGECFGLLGPNGAGKSTIARMVLGMTPPDEGKITVLGAPVPAQARLARASIGVVPQFDDLDQEFTVRENLLVFGRYFNMSTRQIEAAIPSLLEFARLESKADARVVGLSGGMKRRLMLARALINDPQLLVLDEPTTGLDPHARHLIWERLRSLLARGKTILLTTHFMEEAERLCDRLCVLEEGQKIAEGRPQGLIDEQIGCQVIEINGGNPHELRALIKTCAQRIEVSGETLFCYSSTPEQVRIKLREHTNLRLLQRPPNLEDVFLRLTGREMKD >NC_022536.1|WP_022557353.1|260628_261417_-|ABC-transporter-permease MWENYAAVLPANGWNWTAVWRRNYLAWKKAALVSILGNLADPMIYLFGLGTGLGLMVGRVEGMSYIAFLATGMVAASAMTASTFETIYAAFARMRDQRSWEAILYTQITLGDIVLGEVIWAATKALLAGTAIAVVAAILGYSVWSSIPYVVPVIALTGIAFASLAMIVAALAPSYDYFIFYQTLILTPMLFLSGAVYPVTQLPGKVQQMATFLPLAHSIDLIRPAMFGRPAADVIFHLGMLFVFGVLPFFVSTALLRRRLMS >NC_022536.1|WP_022557364.1|270737_271943_-|hypothetical-protein MIRFVPTTKQIEYFIAVCFIIAGMFGLIWIFIAQEPFRITNGSEWDAVYYDRLLRLLAAEGGLQLRIPFPYCARVGTPWILVNIFHNRSSFYEFNLVVSGLFAATLLFATRSLWHGSIKGLTAVIGASSFLYFAPVKFTNFYPAYMDPPFLLVLSLCLIFIIKKNYLLASIICIAGIPFREASFYLLPLLIGFYIKNAQISIGVWVISISIIICGFLLKELMLFVSDCDSQSQLITAIFWFYRFLSEPAHVLGSIAAISLTLGPLYVVLDKQTLTGIKSDDTVIFSIIASVYSGFLSIVGGSDVTRIFYSFLPFYMPLLIKCFKVSSLTSFVLSCFGWLLTNHMLQKYEQPISEGPNKDILGFFAQFPDYGHPTIALVVLGIWFVLAMSRTLIEPLEGYLE >NC_022536.1|WP_022557365.1|271939_272674_-|SDR-family-oxidoreductase MKPSVLILGARSDIGNAVAHKFAAQGHPIQLAARQSETLDAEKTNLQLRYGVPVTLHEFDALLTETHAQFLAMLPELPEVAVSVVGLMESQERSERDHLLARCIMRSNYEGPANLLALLANRFEERGSGTLVGLSSVAGERGRATNYVYGSAKAGFTAFLSGLRSRFAKSDVHVVTVLPGYVATKMTEGMNLPAWLTAQPSEVAESIVVAVERKKNVIYVRPVWRMIMLIIRLIPERLFKRVRM >NC_022536.1|WP_022557366.1|272673_273999_-|FAD-binding-oxidoreductase MKLSGWGRSPLVDAQVYMPRDLEALQKLLASRPSMIARGWGRAYGDSAINSSATIDMRHLNRMLAFDPKTGQLIAEAGVVLYDIIAAFLPRGWFPMVTPGTKFVTLGGMIAADVHGKNHRKHGSFRGCVDWIDVMGPDGSIQRCSSNSHVELYEHTLGGMGLTGIIIRAAVRLRTVETGWIRRTTIPAPNLRSAMTALEGAQDSTYSVAWIDCLGTGKNLGRSLVFLGDHANTSDLPIYRSAHPFATPARRKLSVPFNFPCFALNQLSLRAFNALYYRIGLWNRGQQLIDWDSYFYPLDAVTDWNRIYGRKGFAQFQCVIPIKNSEEGLSALLKTVAKAGAGSFLAVLKRFGPQESCFSFPMEGYSLALDFPITTKTSRLLANLDRVTIEHGGRFYLAKDSRMSAETLRASDGRVASFVRVRAKNGWKSSFQSAQAERLVL >NC_022536.1|WP_022557367.1|273995_274463_-|GtrA-family-protein MPSNHSGLRYRRSAEVKRQGLALRYAAFALIAMVVNVTGQHVVLHFGNTSAIFALAMCAGTIAGLMIKYLLDKFWIFGDREIGLINDGWKFSLYTAVGALTTAIFWSAEAAAWWIWKTELMHDLGAAMGLTIGYLVKYQLDKRFVFAGHRRRISS >NC_022536.1|WP_022557368.1|274414_275578_-|UbiA-family-prenyltransferase MSGGSKDEKAYVHPSPGDTGTIGFEHLVTHHPFDSTVFKRTVRPRVFTVAPYGLENARANDAAATLDGKSDGCDGFADIRCGGGQTMSETPRIKARAFKHYVNAFRPHQWLKNILVFLPALAAHKLDWPTLLSSLEAFVCFSLVASSVYVMNDLLDVCADRAHPRKRYRPFASHSIPTAHGTWMVVGLVLPGVLIAIFIGWSFFLVVAVYFLVTTAYSLHLKRRIVIDLCILAGLYTIRIVAGGIATSTPLSVLLIAFSVFFFLSLAAVKRQSELVDGAERGSLQATGRGYHVNDLPIISMIAVGAGYVSVLVMTYYVNSPVVMELYPHPQMLWGVCAVLLYWITRTVMVSHRGNMHDDPVIYAAQDRTSQVCLAIILVFVTGGVLR >NC_022536.1|WP_048903029.1|276131_278117_-|acyltransferase MARTEPHFFRMDIEGLRALAVSGVIAFHFGMTSVPGGFVGVDIFFVISGYLITRHIQLEIERTGSLDLLRFYARRARRLLPASCFVILATLFFGYFILSPPEQQLYSKGSFYASAYMINIWLISWAADYFAPDAFNNPFIHFWSLSVEEQFYLVWPALLLLFARLRPGRYGLFLPVVLMGVISFAFCWYYTAISQPWAFYFSPFRAWEFACGGLALMISEEAAKRFRLTPVFGWTGIGLIMTAYLGMSEDVPFPGLTALVPVAGTVMVLLSGTRPGPAGPQVLLSLPPLQWLGRISYSLYLWHWPVIVYSGILKPELTTFERFLCLALILGLSVFSYSFIENPMRRNPWLLARTSRSLGFAALLTACGAAAAYGSARVANHNIDLQQNLILRSAERDSSARQFDDGCLLNAQQVQPKPCEFGAIPPGKTIVLFGDSHADHWSTPLISIAKSNGWQLVTYLKSSCPAADVTIWNSMLMRNYEECDRWRQLAMREIATRKPDMVIISEYSSAYVKNDINVVSIHQIDATTWAQGLRRTVDALESAVTKIAVLRDGPVHKTYLDKCVARALWQKRGAETCDTPRSGAMEETIPDAERKAVSDFGNASYVDITDVFCNATTCPAMIGGKLTFRDRHHIATPFAATLATPLQRALFEMMNAGTPTN >NC_022536.1|WP_048903080.1|278208_279132_-|LysR-family-transcriptional-regulator MRFKGLDLNLLVALDALMTERNLTVAARSINLSQPAMSAAVGRLREYFRDDLFTMNGRELCLTPRAEGLAPVVRDVLLKIRCSIISWEPFNPSQSERRFRILLSDFMTLVFFDRVIARVAREAPAVSFELLPLDNDPNELLRRGEVDFLLLPDLYMANSHPTAKLFDEKLVCVSCPTNNAVPRELTFEQYMSMGHVAVMFGRLLRPSIEEWYLLEHGIKRRLEVVVQGFSFIPQMLSGTNRIATMPLRLVEYFEKTIPLRIINLPLPLPAFTEAVQWPALHNGDPASIWMRQILIEEASRMAVSHNA >NC_022536.1|WP_022557371.1|279107_280037_-|dihydrodipicolinate-synthase-family-protein MAFLSGLSAFPITPSDLNGRVDTAALKRLVARLCIAGVDSIGLLGSTGTYMYLCREERRRALDAAIQEANGVPVVAGVGALRTDEAVRLAQDAKAAGAAAGLLAAVSYAPLTEDEVFEHFSTVTRKSGLLIVIYDNPGTTHFRFTAALVERMAHRAGVGLRQSNGWHRRITKALLYIVWALPRADRHRPACKRSCLHARCRHHRTYRGNSGDHGSLQDRPSSNSRGDVDDRTSVGVGHRLAVRAGITMPLEREPWSSRRLFWPVRSAGQQRQCCQWPLHELRSQSLSGMITENDPQHGWFQTCASKALI >NC_022536.1|WP_022557291.1|280092_281328_-|aminotransferase-class-I/II-fold-pyridoxal-phosphate-dependent-enzyme MTGGSEHDNRQRQSDHLHAASFGFDTRAIHHAFSPVDFKRAVQPPVFLTSTYGFESVEANDAAAALGGRLYAREYNPTTEILEQRLANLERAEAGLVVSTGMAAFGTLILSLLSQGDELVVHKTLYSNSVAMVEQGLPRFGIKVIPVDLSDPSNLDAAITERTKLVYFETPVNPLSSILDIAAISERARARGVKVAVDSTFASPALQRPIEHGADIVLHSLTKYINGHGDTLGGALLGDAETLHRLHETGLRYITGATLAPHSAFLIMRGLKTLSLRMDRHSASALAIARMLEAHPAVSWVSYPFLESHPDQAIARKQMTQGSGMLAFGLHAGFDGARNMLDRLQLMTRAVSLGDTDTLIYHPASITRARQSIRKDAHMVSGVGDDLIRLSVGLEDVTDLIGDLRQALATL >NC_022536.1|WP_022557372.1|281409_282897_+|PLP-dependent-aminotransferase-family-protein MVQSENQAAAVHAPRMGARRIYEALKSQILSRVYEAGSQLPSSRSLANELHVSRTTVTVAYEQLAAEGFVELRQGARPRVTALELRQRPRESDTTLEAFGPLSAYGERLRALSPWLDYLPTNLAVDFRYGDLAPSDFPALAWKRAINSVLTQRQGRLSYEDPRGSRRLRQALQGYLWRARTLQCDLEQIIVVNGSQQGLDLCARILLDANSAFVMENPGYRMARQIFSSTGASAVAVDADAGGLKTLDLSGIDARMAYVTPSHQFPLGGVMPISRRHQLLAWARDRDAYVVEDDYDSEYRYDISPVPPLQSLAEGRNVIYLGTVSKTLSPMMRIGYLVVPKQLQEVFATAKQLTDRHTPMTEQEALAFLIESGAYESHVRRVRRLNRERRETLLSALETAFGDRITIEGADAGLHVVVWFNELPGSAEIALMDAARQRGVGLYGISLLYDSAPWASEAPRERLGLVMGYSALTPRQIEKGIQLVAPAVDAVKGAG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP020910 | Rhizobium etli strain NXC12 plasmid pRetNXC12d, complete sequence | 87739-87764 | 0 | 1.0 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP032928 | Agrobacterium tumefaciens strain 1D1460 plasmid pAt1D1460, complete sequence | 500785-500810 | 0 | 1.0 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP030762 | Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed2, complete sequence | 404459-404484 | 0 | 1.0 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NC_011981 | Agrobacterium vitis S4 plasmid pAtS4e, complete sequence | 454408-454433 | 0 | 1.0 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NC_011981 | Agrobacterium vitis S4 plasmid pAtS4e, complete sequence | 628807-628832 | 0 | 1.0 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NC_011981 | Agrobacterium vitis S4 plasmid pAtS4e, complete sequence | 430718-430743 | 0 | 1.0 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | CP007645 | Rhizobium etli bv. phaseoli str. IE4803 plasmid pRetIE4803d, complete sequence | 226557-226582 | 0 | 1.0 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP039909 | Agrobacterium tumefaciens strain CFBP6624 plasmid pAtCFBP6624, complete sequence | 339965-339990 | 0 | 1.0 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NC_022536 | Rhizobium sp. IRBG74 plasmid IRBL74_p, complete sequence | 270659-270684 | 0 | 1.0 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP030763 | Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence | 301633-301658 | 0 | 1.0 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP049249 | Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed3, complete sequence | 876349-876374 | 0 | 1.0 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NC_022536 | Rhizobium sp. IRBG74 plasmid IRBL74_p, complete sequence | 270608-270633 | 1 | 0.962 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP039694 | Agrobacterium larrymoorei strain CFBP5473 plasmid pTiCFBP5473, complete sequence | 121030-121055 | 1 | 0.962 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP053858 | Rhizobium pusense strain 76 plasmid pR76, complete sequence | 285789-285814 | 1 | 0.962 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP053209 | Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1A, complete sequence | 287296-287321 | 1 | 0.962 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP050084 | Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b2, complete sequence | 447643-447668 | 1 | 0.962 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP048425 | Rhizobium daejeonense strain KACC 13094 plasmid unnamed2, complete sequence | 109324-109349 | 1 | 0.962 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP032697 | Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525a, complete sequence | 142583-142608 | 1 | 0.962 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP017148 | Bosea vaviloviae strain Vaf18 plasmid unnamed1, complete sequence | 96971-96996 | 1 | 0.962 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP017148 | Bosea vaviloviae strain Vaf18 plasmid unnamed1, complete sequence | 98932-98957 | 1 | 0.962 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP048284 | Rhizobium leguminosarum bv. viciae 248 plasmid pRle248b, complete sequence | 113370-113395 | 1 | 0.962 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP016287 | Rhizobium leguminosarum strain Vaf10 plasmid unnamed1, complete sequence | 966649-966674 | 1 | 0.962 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP017943 | Phyllobacterium zundukense strain Tri-48 plasmid unnamed2, complete sequence | 367864-367889 | 2 | 0.923 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | LR606149 | Rhizobium sp. Q54 genome assembly, plasmid: 6 | 209586-209611 | 2 | 0.923 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP025505 | Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvC, complete sequence | 234493-234518 | 2 | 0.923 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP017105 | Rhizobium gallicum strain IE4872 plasmid pRgalIE4872d, complete sequence | 1719677-1719702 | 2 | 0.923 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP053209 | Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1A, complete sequence | 424848-424873 | 3 | 0.885 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP050084 | Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b2, complete sequence | 308224-308249 | 3 | 0.885 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP032696 | Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525b, complete sequence | 352346-352371 | 3 | 0.885 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | LR606149 | Rhizobium sp. Q54 genome assembly, plasmid: 6 | 14950-14975 | 4 | 0.846 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP007796 | Azospirillum brasilense strain Az39 plasmid AbAZ39_p3, complete sequence | 274362-274387 | 4 | 0.846 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP018781 | Ochrobactrum pituitosum strain AA2 plasmid pOAAA2, complete sequence | 291106-291131 | 4 | 0.846 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP020900 | Rhizobium phaseoli Brasil 5 strain Bra5 plasmid pRphaBra5d, complete sequence | 120244-120269 | 4 | 0.846 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP033321 | Azospirillum brasilense strain Cd plasmid p3, complete sequence | 244428-244453 | 5 | 0.808 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP033315 | Azospirillum brasilense strain Sp 7 plasmid p3, complete sequence | 214007-214032 | 5 | 0.808 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP032349 | Azospirillum brasilense strain MTCC4039 plasmid p3, complete sequence | 617405-617430 | 5 | 0.808 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP032342 | Azospirillum brasilense strain MTCC4038 plasmid p3, complete sequence | 520429-520454 | 5 | 0.808 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP033323 | Azospirillum brasilense strain Cd plasmid p5, complete sequence | 108219-108244 | 5 | 0.808 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP050083 | Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b3, complete sequence | 224557-224582 | 5 | 0.808 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP049733 | Rhizobium leguminosarum strain A1 plasmid pRL10, complete sequence | 215510-215535 | 5 | 0.808 |
NC_022536_1 | 1.1|270659|26|NC_022536|CRISPRCasFinder | 270659-270684 | 26 | NZ_CP053207 | Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1C, complete sequence | 246322-246347 | 5 | 0.808 |
1. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP020910 (Rhizobium etli strain NXC12 plasmid pRetNXC12d, complete sequence) position: , mismatch: 0, identity: 1.0
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgc Protospacer **************************
2. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP032928 (Agrobacterium tumefaciens strain 1D1460 plasmid pAt1D1460, complete sequence) position: , mismatch: 0, identity: 1.0
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgc Protospacer **************************
3. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP030762 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed2, complete sequence) position: , mismatch: 0, identity: 1.0
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgc Protospacer **************************
4. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NC_011981 (Agrobacterium vitis S4 plasmid pAtS4e, complete sequence) position: , mismatch: 0, identity: 1.0
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgc Protospacer **************************
5. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NC_011981 (Agrobacterium vitis S4 plasmid pAtS4e, complete sequence) position: , mismatch: 0, identity: 1.0
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgc Protospacer **************************
6. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NC_011981 (Agrobacterium vitis S4 plasmid pAtS4e, complete sequence) position: , mismatch: 0, identity: 1.0
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgc Protospacer **************************
7. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to CP007645 (Rhizobium etli bv. phaseoli str. IE4803 plasmid pRetIE4803d, complete sequence) position: , mismatch: 0, identity: 1.0
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgc Protospacer **************************
8. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP039909 (Agrobacterium tumefaciens strain CFBP6624 plasmid pAtCFBP6624, complete sequence) position: , mismatch: 0, identity: 1.0
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgc Protospacer **************************
9. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NC_022536 (Rhizobium sp. IRBG74 plasmid IRBL74_p, complete sequence) position: , mismatch: 0, identity: 1.0
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgc Protospacer **************************
10. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP030763 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence) position: , mismatch: 0, identity: 1.0
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgc Protospacer **************************
11. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP049249 (Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed3, complete sequence) position: , mismatch: 0, identity: 1.0
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgc Protospacer **************************
12. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NC_022536 (Rhizobium sp. IRBG74 plasmid IRBL74_p, complete sequence) position: , mismatch: 1, identity: 0.962
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgt Protospacer *************************.
13. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP039694 (Agrobacterium larrymoorei strain CFBP5473 plasmid pTiCFBP5473, complete sequence) position: , mismatch: 1, identity: 0.962
taaccccggaaaattccagctcccgc CRISPR spacer aaaccccggaaaattccagctcccgc Protospacer *************************
14. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP053858 (Rhizobium pusense strain 76 plasmid pR76, complete sequence) position: , mismatch: 1, identity: 0.962
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcccgt Protospacer *************************.
15. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP053209 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1A, complete sequence) position: , mismatch: 1, identity: 0.962
taaccccggaaaattccagctcccgc CRISPR spacer caaccccggaaaattccagctcccgc Protospacer .*************************
16. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP050084 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b2, complete sequence) position: , mismatch: 1, identity: 0.962
taaccccggaaaattccagctcccgc CRISPR spacer caaccccggaaaattccagctcccgc Protospacer .*************************
17. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP048425 (Rhizobium daejeonense strain KACC 13094 plasmid unnamed2, complete sequence) position: , mismatch: 1, identity: 0.962
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcacgc Protospacer ********************** ***
18. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP032697 (Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525a, complete sequence) position: , mismatch: 1, identity: 0.962
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcctgc Protospacer ***********************.**
19. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP017148 (Bosea vaviloviae strain Vaf18 plasmid unnamed1, complete sequence) position: , mismatch: 1, identity: 0.962
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcgcgc Protospacer ********************** ***
20. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP017148 (Bosea vaviloviae strain Vaf18 plasmid unnamed1, complete sequence) position: , mismatch: 1, identity: 0.962
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctcgcgc Protospacer ********************** ***
21. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP048284 (Rhizobium leguminosarum bv. viciae 248 plasmid pRle248b, complete sequence) position: , mismatch: 1, identity: 0.962
-taaccccggaaaattccagctcccgc CRISPR spacer ataacccc-gaaaattccagctcccgc Protospacer ******* ******************
22. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP016287 (Rhizobium leguminosarum strain Vaf10 plasmid unnamed1, complete sequence) position: , mismatch: 1, identity: 0.962
-taaccccggaaaattccagctcccgc CRISPR spacer ataacccc-gaaaattccagctcccgc Protospacer ******* ******************
23. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP017943 (Phyllobacterium zundukense strain Tri-48 plasmid unnamed2, complete sequence) position: , mismatch: 2, identity: 0.923
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagctccggg Protospacer *********************** *
24. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to LR606149 (Rhizobium sp. Q54 genome assembly, plasmid: 6) position: , mismatch: 2, identity: 0.923
taaccccggaaaattccagctcccgc CRISPR spacer taactccggaaaactccagctcccgc Protospacer ****.********.************
25. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP025505 (Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvC, complete sequence) position: , mismatch: 2, identity: 0.923
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggaaaattccagcttctgc Protospacer *********************.*.**
26. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP017105 (Rhizobium gallicum strain IE4872 plasmid pRgalIE4872d, complete sequence) position: , mismatch: 2, identity: 0.923
taaccccggaaaattccagctcccgc CRISPR spacer taaacccggaaaattccagttcccgc Protospacer *** ***************.******
27. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP053209 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1A, complete sequence) position: , mismatch: 3, identity: 0.885
taaccccggaaaattccagctcccgc CRISPR spacer aaaaccccgaaaattccagctcccgc Protospacer ** *** ******************
28. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP050084 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b2, complete sequence) position: , mismatch: 3, identity: 0.885
taaccccggaaaattccagctcccgc CRISPR spacer aaaaccccgaaaattccagctcccgc Protospacer ** *** ******************
29. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP032696 (Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525b, complete sequence) position: , mismatch: 3, identity: 0.885
taaccccggaaaattccagctcccgc CRISPR spacer caaaaccggaaaattccagctcccgc Protospacer .** *********************
30. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to LR606149 (Rhizobium sp. Q54 genome assembly, plasmid: 6) position: , mismatch: 4, identity: 0.846
taaccccggaaaattccagctcccgc CRISPR spacer atgccccggaaaattccagctctcgc Protospacer .*******************.***
31. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP007796 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p3, complete sequence) position: , mismatch: 4, identity: 0.846
taaccccggaaaattccagctcccgc CRISPR spacer ccgccccggaaaattccagctcgcgc Protospacer . .******************* ***
32. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP018781 (Ochrobactrum pituitosum strain AA2 plasmid pOAAA2, complete sequence) position: , mismatch: 4, identity: 0.846
taaccccggaaaattccagctcccgc CRISPR spacer taaacccggaaaattccagctctccg Protospacer *** ******************.*
33. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP020900 (Rhizobium phaseoli Brasil 5 strain Bra5 plasmid pRphaBra5d, complete sequence) position: , mismatch: 4, identity: 0.846
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggagaattccaggtccggg Protospacer **********.******** *** *
34. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP033321 (Azospirillum brasilense strain Cd plasmid p3, complete sequence) position: , mismatch: 5, identity: 0.808
taaccccggaaaattccagctcccgc CRISPR spacer gcggcccggaaaattccagctcgcgc Protospacer . ****************** ***
35. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP033315 (Azospirillum brasilense strain Sp 7 plasmid p3, complete sequence) position: , mismatch: 5, identity: 0.808
taaccccggaaaattccagctcccgc CRISPR spacer gcggcccggaaaattccagctcgcgc Protospacer . ****************** ***
36. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP032349 (Azospirillum brasilense strain MTCC4039 plasmid p3, complete sequence) position: , mismatch: 5, identity: 0.808
taaccccggaaaattccagctcccgc CRISPR spacer gcggcccggaaaattccagctcgcgc Protospacer . ****************** ***
37. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP032342 (Azospirillum brasilense strain MTCC4038 plasmid p3, complete sequence) position: , mismatch: 5, identity: 0.808
taaccccggaaaattccagctcccgc CRISPR spacer gcggcccggaaaattccagctcgcgc Protospacer . ****************** ***
38. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP033323 (Azospirillum brasilense strain Cd plasmid p5, complete sequence) position: , mismatch: 5, identity: 0.808
taaccccggaaaattccagctcccgc CRISPR spacer gcggcccggaaaattccagctcgcgc Protospacer . ****************** ***
39. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP050083 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b3, complete sequence) position: , mismatch: 5, identity: 0.808
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggagaattccaggtccgtg Protospacer **********.******** ***
40. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP049733 (Rhizobium leguminosarum strain A1 plasmid pRL10, complete sequence) position: , mismatch: 5, identity: 0.808
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggagaattccaggtccgtg Protospacer **********.******** ***
41. spacer 1.1|270659|26|NC_022536|CRISPRCasFinder matches to NZ_CP053207 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1C, complete sequence) position: , mismatch: 5, identity: 0.808
taaccccggaaaattccagctcccgc CRISPR spacer taaccccggagaattccaggtccgtg Protospacer **********.******** ***
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 2246
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_022536|0:2246|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >NC_022536|0:2246|1220_2246_+|WP_048902998.1|DBSCAN-SWA MTSKSRKSIVANFGLLSAELENRLSTENPTPAPQPVPTGRVGAGVIGAAHRAIDDIKSERDRLKALVEAGGGTIRELDPLSIDPSPFPDRLPDDDAADFEAFRNSIRSEGQKVPIQVRKSPSSPDRYQVIYGHRRLQAARDLGIAVKAIEVEISDIELAIAQGIENADRQDLTWIERALFARRMDDAGVKPRDIKAALSIDDPELARMRSVYRVVPTEIIEAIGRAGKVGRPRWADFAKNYSERPDLHDALRNVLSGSAEKRLGSDQKFLAAFNALKPEDPQKDTGKSIAGPAGEQLGRLVRTANEVRISAHTASAKEFLSFIESELPTLAERFSREKSKN |
1 | Ochrobactrum_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
7014 : 10159
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_022536|7014:10159|DBSCAN-SWA CTCATATGGATTTCAGGCTCACGCGCTGCTCCAGCTTTTCGGCTTCTTCTTTTCGCTCGCTGTAGCGATCGGTGAGGTAGGCAGAGACGTCACGGGTCAGGATCGTGAACTTCACCAGTTCCTCGCAGACGTCGACGACGCGATCATAATAGGAAGACGGCTTCATGCGGCCGTCGGTGTCGAACTCTTGGAAAGCCTTGGCCACCGACGACTGGTTCGGGATGGTGATCATCCGCATCCAGCGACCGAGGACCCGCAGCTGGTTGACCGCATTGAACGACTGCGAACCACCGGAAACCTGCATAACGGCCAGGGTCTTGCCCTGCGTCGGCCGCACCGAACCAACCGAAAGCGGGAGCCAGTCGATCTGCGCCTTCATGATACCGGTCATGGCGCCGTGGCGTTCCGGACTAATCCAGACTTGGCCTTCCGACCAGGCAGAAAGCTCACGCAGCTCCTGCACCTGTGGATGGCTCACCGGCGCCGCATCGGGAAGCGGCAGGCCCGCTGGATTGAAGATCCGAACCTCGCAGCCAAAATGTTCGAGCAGGCGCGCCGCCTCATTCGCCAGGAGCCGGCTATAAGATACGGCGCGCAGGGAACCATAGAGGATCAGAATGCGCGGCTTGTGCGATGAGAATGGAGGACGCAGGGCGCCCAAATCTGGCTGACAAAGATGCGAGAGGGATGCGGCAGGAAGATCAGACAATGCGTTTGCCCTCCGCATCGAGGACCTGCTCACCATCTTCTTTGGTGAAGGCGCCCTTGTGGGTGTCGGGCAGAATGTCCAGCACCACTTCGGATGGTCGTGCCAAGCGCGTGCCAAGCGGCGTCACGACCAGAGGCCGGTTGATTAGAATCGGGTTTTTAAGCATCGCGTTGAGCAGCTGATCGTCGCTCAGCTCCGGATTGTCGAGGTTCAACTCCGCATAGGGCGTGCTCTTTTTACGGATGGCGCCGCGCACCGTCAGGCCGGCATCAGCGATCATCTGCACAAGCTGTGCGCGCGTCGGCGGGGTTTTCAGGTATTCGATCACCGTGGGCTCAATGCCGGCGTTGCGGATCATCTCAAGGGTGTTGCGCGACGTTCCGCAGGCGGGGTTGTGATATATTGTGACGTCCATGGTCGTCAGGCCCTTTCGATGGTGGAAGTGGAGGAGCGGACGGCGGCGCCGCGCTCGTACCAGCCCTTGCTGCGGTTGACGATCCAGACCACTGACAGCATGACCGGCACTTCGATCAGCACGCCTACGACGGTCGCCAGGGCTGCTCCGGAATCGAAACCGAACAGGCTGATGGCCGCTGCCACCGCCAGTTCAAAGAAGTTGGATGCGCCAATGAGAGCGGAAGGGCCTGCAACGCAGTGCTGCTCCCCACTCGCGCGGTTGAGAAGGTAGGCCAGGCCGGAGTTGAAATAGACCTGGATGAGGATCGGCACGGCCAACAGGGCGATCACGCCCGGCTGCGCGAAGATCTGCTCGCCTTGAAAGCCGAACAATAATATCAGGGTCGCCAGTAGAGCGACCAGGGATACCGGCTGCAACCTGGCAGCGAGACGGTTTACGTCGGTCAGCCGACGCAGGATTTGCGCCGAGATTACCGGCAGGACGATATAGAGCACGACCGACAGGATAAGAGTATTCCACGGGACCGTAATGGCCGAAAGGCCGAGTAGCAGCCCGACGATCGGTGCGAAGGCCAACACCATGATCACATCGTTTAACGCGACCTGGCTCAGCGTGAAATGCGGCTCGCCTTTGGTCAGGTTCGACCAGACGAACACCATGGCGGTGCAGGGCGCGGCCGCAAGGATAATCAGGCCAGCGATATAGGAATCGATCTGGCCTGCCGGCAGGTAAGGCCGGAACAGATAGCCGATAAACAGCCAGCCAAGCAGCGCCATCGAAAACGGTTTTAATGCCCAGTTGATGAACAGGGTAACGCCGATCCCTCGCCAGTGGCGGCCGACCTGCCCGAGGGACGCGAAATCGATCTTGAGCAGCATTGGGATGATCATCAGCCAGATCAGCACTGCCACTGGCAGGTTGACCTTCGCGACTTCGGCCGCGCCGATCGTCTGGAAGGCCCCGGGCGTAAGGTGGCCGAGGGTCACCCCGACGACGATGCAGGCGATGACCCACACCGTCAGATAGCGTTCAAACGTGGACATGATTTTACCCTTGGATAGTCTTGGCAAACAGGCCGGCGGAGGCCGGGAAAAGGCTTGCCGCCTGGCCTGTTTGCAAGATTGCGGCCGGCGCGCTTGTGCGATCGATCTTCACGAAGCCCAAAATAATCAAAGAATGCAGTGGCCGTTTCTGTCAGCAGGTAAAGGGTGGGGGCGCCATCGCGCGCTGCCTGGTCAAGCATCTGGCGTGAAATGGCTTCGCCATAACCGCGTCTGCGTTGGTCGGGCAGGGCGACCACAGAGCGCAACAGCGCGTAGTCTCCATAGGGCTCCAGCCCAGCGAAGCCGATAATCTGGGCCCGGTCCGCAAAGCGAAAGAAGGTGCGCCCGCTTTGCTCCAGATCGTCGATCGGAAGTCCCGCCGCCTGCAGTGCTGCCTGGAGGTCCTGATCGCGCCCGCTGGCCGGCTGCTGATCCAAAACGTCGCTCACGCGACGTCGCCCTTACGGCTGGTGCTGCCTTCCATCGTGCCGATCGTGCGGAGCTGAGTTTCCAGCGCCAGCTTGTCGATCGACGCGAGGGGCAGATTTACAAACGCGGTGATCCGGTTCTTGAGGTAGCGGGCGGCTTGGGAGAAGGCCTGGCCCTTCTCAACCAGAGAACCTTCGACGGCTGCTGGATCTTCAACGCCCCAGTGGGCAGTCATCGGGTGGCCGATCCAGACCGGACAGGCTTCGCCAGCGGCATTGTCGCAGACCGTGAAGATGAAATCCATTTGCGGCGCGCCAGGCTCCGCGAAGACATCCCAGCTTTTTGAGCTGAACCCCGTTGCCTCGTATCCGAGGCTTCGCAGCGTATCCAAAGCAAGCGGATGGACCTCGCCCTTGGGCTGGCTGCCAGCGGAGAAAGCGCGGAATCGGCCCTTGCCTTCCCCATTCAAGATGGACTCGGCGAGGATCGAACGGGCCGAATTGCCGGTGCAGAGAAACAGGACGTTATAAACGCGATCAGCGGTCAT
Protein sequences of DBSCAN-SWA_2 >NC_022536|7014:10159|8142_9183_-|WP_022557055.1|DBSCAN-SWA MSTFERYLTVWVIACIVVGVTLGHLTPGAFQTIGAAEVAKVNLPVAVLIWLMIIPMLLKIDFASLGQVGRHWRGIGVTLFINWALKPFSMALLGWLFIGYLFRPYLPAGQIDSYIAGLIILAAAPCTAMVFVWSNLTKGEPHFTLSQVALNDVIMVLAFAPIVGLLLGLSAITVPWNTLILSVVLYIVLPVISAQILRRLTDVNRLAARLQPVSLVALLATLILLFGFQGEQIFAQPGVIALLAVPILIQVYFNSGLAYLLNRASGEQHCVAGPSALIGASNFFELAVAAAISLFGFDSGAALATVVGVLIEVPVMLSVVWIVNRSKGWYERGAAVRSSTSTIERA >NC_022536|7014:10159|7014_7740_-|WP_022557053.1|DBSCAN-SWA MRRANALSDLPAASLSHLCQPDLGALRPPFSSHKPRILILYGSLRAVSYSRLLANEAARLLEHFGCEVRIFNPAGLPLPDAAPVSHPQVQELRELSAWSEGQVWISPERHGAMTGIMKAQIDWLPLSVGSVRPTQGKTLAVMQVSGGSQSFNAVNQLRVLGRWMRMITIPNQSSVAKAFQEFDTDGRMKPSSYYDRVVDVCEELVKFTILTRDVSAYLTDRYSERKEEAEKLEQRVSLKSI >NC_022536|7014:10159|9628_10159_-|WP_022557057.1|DBSCAN-SWA MTADRVYNVLFLCTGNSARSILAESILNGEGKGRFRAFSAGSQPKGEVHPLALDTLRSLGYEATGFSSKSWDVFAEPGAPQMDFIFTVCDNAAGEACPVWIGHPMTAHWGVEDPAAVEGSLVEKGQAFSQAARYLKNRITAFVNLPLASIDKLALETQLRTIGTMEGSTSRKGDVA >NC_022536|7014:10159|7714_8137_-|WP_022557054.1|DBSCAN-SWA MDVTIYHNPACGTSRNTLEMIRNAGIEPTVIEYLKTPPTRAQLVQMIADAGLTVRGAIRKKSTPYAELNLDNPELSDDQLLNAMLKNPILINRPLVVTPLGTRLARPSEVVLDILPDTHKGAFTKEDGEQVLDAEGKRIV |
4 | uncultured_Caudovirales_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
15131 : 15425
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_022536|15131:15425|DBSCAN-SWA ATTACTGTTTTATGTTGTCGTTCGATTCGATCAGCCGCCGCTTGCGCCAGAGGCCTCCGTCATATCCCGTCAAAGATCCGTCTGCTCCGATGACCCTGTGACATGGAATCATGAGAGCTATTTGATTGGCACCAGTGGCACGTGCCACCGCGCGAATTGCGGTGGGGCGACCGATCTGACGTGCAATTTCGGAATAGGTTCGGGCTCGAGAGCTGGTATTTCGCGCAATACGTCCCACACGTGTTGAGTGAAGGGGTTGGCGGACATCGCAAGTGGTGTTGAAAACCTGTCGCAA
Protein sequences of DBSCAN-SWA_3 >NC_022536|15131:15425|15131_15425_-|WP_173402666.1|DBSCAN-SWA MRQVFNTTCDVRQPLHSTRVGRIARNTSSRARTYSEIARQIGRPTAIRAVARATGANQIALMIPCHRVIGADGSLTGYDGGLWRKRRLIESNDNIKQ |
1 | Moumouvirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
20804 : 28711
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_022536|20804:28711|DBSCAN-SWA AATGATCCGGGGGGCATTCATCGGCATCGATCGGCATGCGGATCCCTTTATCGGTGATCTGACGGGAGCCGCACGCGACGCCACCGCTTTGTGGGCGGTGCTGTCGGATAGTATCGCCGATCTAAACGCGCCCCTCATCACGAGTGAAGCGGCGACGCTGGTTGCGATGCGCGATGTGCTCGACGCGACGCTTGGCGCTGCGGATGAAGATGATGTAGTGATCCTCGGGTTCGCCGGACACGGCACGCAGGACCACCGTTTGGTTCTGCATGATACGAGCGTAGCCGATTTACCGAACACGACGCTCGGCATGGACGAGCTGGCTGAGCGTTTTCGAGAAACCAGAGCGCGTGCGGTCATTCTCTTGCTGGATTGCTGCTTCAGTGGCGGTGCTCCGGCGCGCGTGATCGACGCCGGCCTTGTCCCTCGCGATGCTGTTTCTTTCCGGCTCGTCGACGTGGCGGGGCAGGGACGTATCCTGTTTGCCGCATCTTCCGCCGATCAAGAGGCGCTCGAGGATCCCCAATCACGGCACGGGCTCTTCACCAAGGCCGTCATTGACGTTTTGCTGGAGGCAGGCGCCCCGCTAGGGGTGCTTGAGCTTGTCGGCCGCGTTACTCGCCTCGTCACAGCGAACGCTGGGCGATTTGGCTACATCCAATCACCGACCATGTTCGGCCAGACCGAAGGCGATCTCGTCCTACCTCCCGGCCAGATCGGTGCCCGCTATCGCGAAGCCTTTCCGGAACGCGTCAATTTTCAGACCGCCGGCGATATTCGCGAATTGGCGGCAGCGGGTATTCCGCAAGAAGCCGTCGGTGCATGGCACGAGCGTTTTCCTCAGGGGCTCAACGAACTTCAGATTGAAGCCGTCAACGAACAGGGGGTGCTGGACGGCAACTCGCTCATGGTCGTCGCGCCGACCAGCGCCGGAAAGACTTTCATCGGAGAGTTGGCGGCGCTCAAGGCCATCGCGGATGGCCGCAAAGCAGTGTTCCTGCTGCCCTACAAGGCGCTCGTGAACGAGAAGTTTGAAGACTTCTCCGCCCTCTACGGGGATCGCCTGGGACTCCGCATTGCTCGTTGCTCCGGCGATTGGCAGGATCAGGTCGGTACCGTCCTGCGCGGCAAATACGACATCGCGTTCTTCACCTACGAGAAGTTCCTCAGCATGTCGGTCGCCTCCGCGCACATGCTCAATCAGATCGGCCTGGTCGTGCTCGACGAGGCGCAGTTCATTACTGAGCCAGGTCGCGGCATGGTGGTCGAACTATTGCTCACGAACCTTGTGAGCGCACGCCAACGTGGCGTTGCTCCGCAGCTCATCACATTGAGTGCGGTTATTGGTCACGCCAACCATTTCGACCGCTGGCTCGGTTGTGGCCTTCTGCAGACTGATCATCGGCCGGTCCCACTCGTGGAAGGCGTGCTTGACCGAACTGGTGCATTGACTCGCTTGGGGCTGAACGGTCCCGAAACCGTTCAGCTTCTCGATCGCTTTACGGTCCGCCAGCGTGGCCAAAATCAATCGTCGCAAGACGTGATCGTACCGCTGGTGCGCCATCTCGTTGATGCCGGCGAGAAGGTCATCGTCTTCCGCAACGCGCGTGGTCCGGCGAGCGGCTGCGCAAACTATCTTTCTGCCGAACTTGGGCTACCGGCGGCGCAAGAAGTGATCGAGGCGCTGCCCGAAGGCGACCTTTCCCAAATGTCCGAGAGCCTTCGACGTGCGCTCGCGGGCGGCATCGCATTCCACAACGGTGATCTCACCCGAGATGAGCGTGTTGCTGTGGAGCGCGGCTTCCGCCGGCCGGACGGCCAGATCAGAGTGCTCGTCGCGACTAGCACGGTTGCTGCGGGTGTAAACACGCCTGCATCTACAGTCATCATCGCTGAGACGGATTTCCCCGGCCGGGAACGGCAGCCGTACACGGTGGCCCAATACAAGAACATGGCAGGACGTGCCGGTCGCCTGGGGTTTGAGGTTGAGGGAAAGGCTATCCTGCTCGCCGATACTCCCATGGAGCGCAATCAACTCTTCCGGCAATACGTGCAGGGTCAACCAGAGGTCATCCGCTCGTCGTTCAACCCGGAACAGCCGGGCACTTGGGTAATCCGTTTGTTAACGCAGGTGAGGGAAGTTCCGCGTGGAGCAGTCGTCGACCTGATCGCAAATACCTACGGTGGTTATCTCGCTGCCCTTCAAAATCCGGCATGGCGTGATCGTATGGTCCCCACGCTGGAGCGGCTTTTGGACCGCATGATCGCCGATGGCCTGATCGAGGTAGACGGCGACAATCTCCGCCTGACCATGCTTGGACGAGCCTGCGGCGAGTCGCCTCTAACGCTCGAGTCGGCGATGCGTCTGGTTGAGCTATTGCGTCGCGTGGATCCGGCCGACGCTACCTTGGAAGCCCTTCTTGTACTCATCGAATGCTTACCGGAGCGCGACGATGATTACACGCCGCAGACCCGCAATGGTGAGCCGCGCTGGCAGCAAGCAGCACTTGGCCGCTTCGGCCATGGCGTGGGCCGCGCATTGCAGCATCGCGCTGAAAGCGATGTTGCATTTTATGCCCGATGCAAACGCGCGCTTATTGTGAGCGACTGGATTGCTGGCGAACCGACCAACGATATCGAGGCTCGCTATTCGTCCAATGCCTTCGTCAGGGTAGGGCACGGCGACATTCGCGGCTACGCGGATGGGTCCCGGTTCCTATTGGAGTCTGCGCTGCGCATAGCCGCGATTTTACTGGAGCGGGCCGAGGATCAGGAGGCGGCCGCTCTATTGCTCACGCGGCTTGATCTCGGCCTCCCAGCCGAAGCGTTGCCGCTGAGCGCGTTAGGCTTTCCGCTGACACGAGGGGAAATTCTTGCACTATTTCGAGCCGGTCATTTGAACCGAGAGGCCGTCGCCGCCTTAACCGCCGACGCTGTCTCCGCAATCATTGGCCGCCGTGGACGTGAGCTGCATGAGGCGATGCAGCCCGTCCTAGTGGAAGGAACCTAAGTGGCTTGGAGTCTTATCAGCACATGTCTGAATCTGAAGAGTGCTAGCGAGAAAAGGGCGGCACGATGAAGCAGTTGGGAGGAGTCGTTGGTGCGACGGGAATTGCAGGACCGAGCGGCGCAGGTGAAATGATAGTACACTCGCTTCGAAAACGACGGCTCAATGGAGAGCTTTATCAGCGTGACCCGAAGGTAGAGTCGCTCATGGCCGAACTGGCGGCCTTGCCGCGTGACGAGTTGATTGCCAAGGCTGCAGTCACGAAGCGCTCCGACCCAGCCTATGTCCCAAGCGAATGTCTTGTCTATTTCATCCGTGCGAGCCGTCGCGACAACAATGAAGCGTGGTTCGAACGTCTTTACCGGATCCTGATCGAACGGGTACTGCGCAGCCTGCCTCGCGCAGAAAGCTCCGACGGCAAGACCGAGTCGCTGACCCGTGGGGCCGTCCGCGACAAGGTGTTCAGTCGGTTTGTCGAGCTTCTTTCGACAGATCGCACTACTTACGTCGACAAGCTCGATTATTTCGAAGTACGCTTCGACGGCGCGGTCGCCAGCCTGCGGCGTGATGCACAGGAACAGGCCTGGCGCGACGAGAACCGCTCTCGACCCCTCGAATATGATGAGGAGAGTGGAGAGCTGTCGCCCGAGGTGGAGGCTGCGGTAGGGGCTTACGATCCTTTCGCTGCTTCGGATTTTGACAATGCGGCTTACCGGTTGCGCCTTGATGCAGCGATTGAGGCTTTACCACCAGAACAAAGCAGGATCATTCACATGCTGAGACAAGGCTTCCCTATCGACTCGAAGGAGGCGGACGTCATGACCGTCGCCAAGGCCTTGGGTCGCTCCGAAAAGACCATCCGAACCTACCGTGATAAGGCCTTCGCCGCCCTGCGAACTGCCATGGCCGATGGAGACGAGCAATGAGTCCCGCCGGCGAAAGTCCATCCCGGGAGTCGGTCCTCGACGCGTTTGCGGTTGAGAGCGAGGCTGGCCGGCCGACATTGGAGCGCTATCTGCGGCTTTACCCCGAATACGCTCGAGAACTCATCGACCTCTCGCGCGAACTTAGCCGCCAATCTTACGAAGACGACGCCCCTCTTTCTGCTGCCGATCAGGCGTTGGTTGATGCCGCTTGGTCGCAGCATGCGAAGGCTGTGCCGGCGGCGGTGGATCCTTTCGCGGCGCTGACGGTTGATGACTGGCGCGCGATCGCACAGGTTTTGGACGTGCCCCGACAGGTTGTGACTGCGTTGCGCGAACGGCGGGTCTCGCTCGTCAGTATCCCGCGGCGCTTCCTGGCGATGTTGGCTGATGCGATGCGCAGCTCTGTCTCGGAGTTGGAGTCATTTTGGGCGCCCGCCCCGCTGCTCGTCGCGCGCAGCTACAAGTCCGATAATAAGCCGACGGTCGGCGAGCAGGTTACGTTCGAGCAGGTGCTGATCGACGCCGGCGTTCCAGCCGAGAAGCGCGCCAGCTTGATGTCGGAGGCAGATTGACATGGATGGCGTGGAGCTCGCCAGGCAAGTCGCCGCCGAGCTTCACGCTCGCCTCGTTGCGTCCGGCGCCGATCCTTGGTCGCCTTATGACTTTGCTGTCGCTGAAGCCAAGCGGCGCGGGATCGACGTCGAGCCGACGGCGGTGGGGGCGGCCGTCCTCAACGGTGGTCGAGCGACTTTTGTCGCCGCCGACGATCTGATCCTCCACGAAAATATCGGATCGCGGTTTGAGCAGGCCTTCCTCGTTGCTCACGAACTTGGCCATGTCGAGCTCGGCGATGATCCTGATGGTGAAGCGGCGCCGACAATTGATCCGGCGCGCACTGCGGAGCCATCGCCGGTCGGGATCGATCGGGTCATCGACTATGGGCGCAGGCAACGTCGCGAGGTCCAGATGGATCTTTTCGCACGCGAACTGCTGCTCCCACGCAACGTCGCGCGAACGCTTCATCTTGATGGAGGGCTTTCCGCTTCTGAGATTGCTGTAAAACTCGGCGCGCCATTTGAGGTCGTGGCGCAGCAGTTGTTCGATGCATTGCTGTTGCCGCAGGTCCCGCCAACATCGGTAGAAACGCATGTCGAACGCCCGCTGAATTCGCTGCAAGCTGCGGCGGCAGCACATCGAGGGGAAGCCTATCTGCTCGAAGCGGGTCCCGGCACCGGCAAGACCCAGACGCTGATTGCGCGTGTCGAAGGCCTTCTGGAGGAGAATATCGATCCAAGGCGTATCTTGCTGTTGACCTTCTCGAACAAAGCGGCGGGTGAGATGGCAGAGCGGATTGCACGGAAGCAGCCCGAGTCAGCGGCCGCGATGTGGATCGGCACGTTCCATGCCTTCGGCCTCGATGTCATCCGTCGCTTCCATGCGGAGCTTGGGCTGCCAAAAGACCCGCGGATGATGGATCGAACCGAAGCGGTTGAACTCCTCGAGGAGGATTTCCCGCGGCTGAGGCTCGCGTATTACCGCAACCTTTACGATCCGACCCAGATCATTGCCGACATGTTGGCGGCCGTCTCACGGGCGAAGGACGAAGTGGTCAACGCCGAGGCCTACGCCGAGCTCGCCGCCGCCATGTTGGCGAAGGCAGCTGATGCAGAGGCGCGTGAAGCCGCAGAACGTGCCGGCGAAGTGGCCCGTGTCTACTTGGCCTATGAACAGCGCAAGCGCAACGCGCATTGTATTGATTTCGGCGACCTTGTCGCTTTACCAGTGCAATTGCTCGAAAGAGACGCAGCCATCTGCGCGGCACTACAGGCGCAGTATGATCATATCCTTGTGGATGAGTATCAGGACGTGAACCGTAGCAGCGTGCGTTTGCTGAGAGCGCTGCGGCCAGATGGCCGAAATCTTTGGATGGTCGGGGACGCCAAGCAGTCGATTTACCGTTTTCGCGGCGCCTCATCCTTCAATATGGCCCGATTTGGCGAGGAAGACTTTTCCGGCGGTAAACGTGGCCGATTGAAGCGCAATTATCGCTCAGTGCCGGAGGTTGTCGCCAGCTTCTCCAGCTTTGCAATCAAGATGCGCGCGGGTGATGCGGAGAGTGGCCTTGAGGCGGAGCGCGACGCTAACGGTCACAGGCCGGAGTTACGGACGGTGCAACGCGCCGAACAGCAGCCGGTCGCTTTGGCCGATGCAATCGAGAAGCTTCGGTGCGAGGGCTACGCCTATCGCGACCAGGCAGTCCTATGCACCGGCAACGAGAAGCTGTCGACGATCGGGCAGGAACTAGAACGGCTCGGCGTGCCCGTCCTGTTTCTGGGCAGTCTGTTCGAGCGGAGTGAGGTCAAAGACCTCCTTGCTTTCCTCTCGATCCTTGTCGATCGACGAGCCATGGGTTTGGTGCGCATCGCCTGTTGGCCCGACTTCACCATGTCGTTTGCAGATGTGGCTGCTATCTTCGACCACCTCCGGGCGTCCGAGCAAGCGCCAGCCGAATGGCTTCAACATGGGAATGCGATCCCTGGTGTCAGCGATGCTGGCCGCCAAGTGCTTACCAAGCTCGCGGCCGTGCTCGGCGGGTTTGAACAAACGTCTTCGCCGTGGACTGTGCTTGCGACGCTTCTGCTCGATCGCACCCACATTGCAGCGCGCCTTTGTGCATCCGAGCAGCTTGCTGATAGAACGCGCGGCATCGCGATTTGGCAGTTCCTCAACTTCCTTCGAGTTCAGCCAGCGGGCCATGGAATGCCGATCACGCGCGTGCTCGATCGTGTCCGCAGGCTCGTGCGGCTCGGCGACGACCGTGACCTGCGTCAACTTCCGGCTGCCGCCCAGCATCTCGACGCTGTGCGGTTGATGACCATCCATGGCGCCAAGGGCTTGGAATTCGGTGGTGTCCACATTCCAGGGCTCAACAGCGACACTATCCCGCGCACGCCGCCGGCGCCGCCCTGTCCGGCGCCCGACGGAATGATCGCGGGGACAGACGGTAGTGCGCTGGAGGCATTTCGCGCGGGTCAGGTCGAGGAGCAAGAGTGCCTATTCTACGTGGCGCAGTCGCGCGCCCGCGACCGCCTGATTCTTTATGCTCCGACTGAGAAGAGCAACGGCCACAACCGACCGCTGTCGCCATTCCTCGACCGTCTCGGAGCCACCTTGATGCGCCGCTCGATCGTGCCGGCGCGGTCGCTTCCCATAGCGGCGGAAGCTCAGGATATCAATTTGGTCGTCGACGGCGGGTTGCGGTTCGGGGCAATGCAGATCGCTCTTTATGAATCCTGCCCGCGCCGCTTCTTCTATACGCACGTCCTGCAGGTTGGCGGTCGCCGGACCGCGACCGCGTTCATGCACCTGCACGAGGCTGTTCGTGTAGTAGTCGGAGCTGTGATCGCGTCTGGTACCCCTATGACTGAACGCGACCTAGAAGATCACACTGACGCGGCGCTTGCCGAGCGGGGTCTGGGCGACCACGGCTATCGCGCGGAATTTCGCGATCTGGCCCTGGCGATGTTGCGTTTCTTTTTAGCGAACCGCGTCGACGCGGCTGTCGAGACCCCTGTTGCGCTCAGCCTCAATTTCGGCGCTGAGGAGATCGTGGTGCGGCCGGATGAGGTGCTGGTGCGGCCGGGCGGAGTTCGAGCTGTCCGGCACATCCGTACGGGCCATATGCGATCGGCCGAGAGCGACGATGTCGGAGCGGCCGCGCTGATGCTGGCGGTCCAGCAGGCCATGCCCGGCGCTGTCGCGGAGCTTGTCCACCTTTCCGACGGTGAGGCGCATGCCATCGCGTTGTCGGATCGGAAGCTGAAGGGTCGCAAGGAAAAGCTCGTAGAGTTTCTTGGCGACATCCGGGCAGGACGGTTTCCGGCCGAGGTTTCGTCGCGCACCTGCCCGAACTGCCCGGCCTTTTTCATCTGCGGCCCGACACCCGATGGCCCCTTGAAAAAAAAGTTCGCCTGA
Protein sequences of DBSCAN-SWA_4 >NC_022536|20804:28711|20804_23813_+|WP_022557080.1|DBSCAN-SWA MIRGAFIGIDRHADPFIGDLTGAARDATALWAVLSDSIADLNAPLITSEAATLVAMRDVLDATLGAADEDDVVILGFAGHGTQDHRLVLHDTSVADLPNTTLGMDELAERFRETRARAVILLLDCCFSGGAPARVIDAGLVPRDAVSFRLVDVAGQGRILFAASSADQEALEDPQSRHGLFTKAVIDVLLEAGAPLGVLELVGRVTRLVTANAGRFGYIQSPTMFGQTEGDLVLPPGQIGARYREAFPERVNFQTAGDIRELAAAGIPQEAVGAWHERFPQGLNELQIEAVNEQGVLDGNSLMVVAPTSAGKTFIGELAALKAIADGRKAVFLLPYKALVNEKFEDFSALYGDRLGLRIARCSGDWQDQVGTVLRGKYDIAFFTYEKFLSMSVASAHMLNQIGLVVLDEAQFITEPGRGMVVELLLTNLVSARQRGVAPQLITLSAVIGHANHFDRWLGCGLLQTDHRPVPLVEGVLDRTGALTRLGLNGPETVQLLDRFTVRQRGQNQSSQDVIVPLVRHLVDAGEKVIVFRNARGPASGCANYLSAELGLPAAQEVIEALPEGDLSQMSESLRRALAGGIAFHNGDLTRDERVAVERGFRRPDGQIRVLVATSTVAAGVNTPASTVIIAETDFPGRERQPYTVAQYKNMAGRAGRLGFEVEGKAILLADTPMERNQLFRQYVQGQPEVIRSSFNPEQPGTWVIRLLTQVREVPRGAVVDLIANTYGGYLAALQNPAWRDRMVPTLERLLDRMIADGLIEVDGDNLRLTMLGRACGESPLTLESAMRLVELLRRVDPADATLEALLVLIECLPERDDDYTPQTRNGEPRWQQAALGRFGHGVGRALQHRAESDVAFYARCKRALIVSDWIAGEPTNDIEARYSSNAFVRVGHGDIRGYADGSRFLLESALRIAAILLERAEDQEAAALLLTRLDLGLPAEALPLSALGFPLTRGEILALFRAGHLNREAVAALTADAVSAIIGRRGRELHEAMQPVLVEGT >NC_022536|20804:28711|25309_28711_+|WP_022557083.1|DBSCAN-SWA MDGVELARQVAAELHARLVASGADPWSPYDFAVAEAKRRGIDVEPTAVGAAVLNGGRATFVAADDLILHENIGSRFEQAFLVAHELGHVELGDDPDGEAAPTIDPARTAEPSPVGIDRVIDYGRRQRREVQMDLFARELLLPRNVARTLHLDGGLSASEIAVKLGAPFEVVAQQLFDALLLPQVPPTSVETHVERPLNSLQAAAAAHRGEAYLLEAGPGTGKTQTLIARVEGLLEENIDPRRILLLTFSNKAAGEMAERIARKQPESAAAMWIGTFHAFGLDVIRRFHAELGLPKDPRMMDRTEAVELLEEDFPRLRLAYYRNLYDPTQIIADMLAAVSRAKDEVVNAEAYAELAAAMLAKAADAEAREAAERAGEVARVYLAYEQRKRNAHCIDFGDLVALPVQLLERDAAICAALQAQYDHILVDEYQDVNRSSVRLLRALRPDGRNLWMVGDAKQSIYRFRGASSFNMARFGEEDFSGGKRGRLKRNYRSVPEVVASFSSFAIKMRAGDAESGLEAERDANGHRPELRTVQRAEQQPVALADAIEKLRCEGYAYRDQAVLCTGNEKLSTIGQELERLGVPVLFLGSLFERSEVKDLLAFLSILVDRRAMGLVRIACWPDFTMSFADVAAIFDHLRASEQAPAEWLQHGNAIPGVSDAGRQVLTKLAAVLGGFEQTSSPWTVLATLLLDRTHIAARLCASEQLADRTRGIAIWQFLNFLRVQPAGHGMPITRVLDRVRRLVRLGDDRDLRQLPAAAQHLDAVRLMTIHGAKGLEFGGVHIPGLNSDTIPRTPPAPPCPAPDGMIAGTDGSALEAFRAGQVEEQECLFYVAQSRARDRLILYAPTEKSNGHNRPLSPFLDRLGATLMRRSIVPARSLPIAAEAQDINLVVDGGLRFGAMQIALYESCPRRFFYTHVLQVGGRRTATAFMHLHEAVRVVVGAVIASGTPMTERDLEDHTDAALAERGLGDHGYRAEFRDLALAMLRFFLANRVDAAVETPVALSLNFGAEEIVVRPDEVLVRPGGVRAVRHIRTGHMRSAESDDVGAAALMLAVQQAMPGAVAELVHLSDGEAHAIALSDRKLKGRKEKLVEFLGDIRAGRFPAEVSSRTCPNCPAFFICGPTPDGPLKKKFA >NC_022536|20804:28711|23878_24736_+|WP_022557081.1|DBSCAN-SWA MKQLGGVVGATGIAGPSGAGEMIVHSLRKRRLNGELYQRDPKVESLMAELAALPRDELIAKAAVTKRSDPAYVPSECLVYFIRASRRDNNEAWFERLYRILIERVLRSLPRAESSDGKTESLTRGAVRDKVFSRFVELLSTDRTTYVDKLDYFEVRFDGAVASLRRDAQEQAWRDENRSRPLEYDEESGELSPEVEAAVGAYDPFAASDFDNAAYRLRLDAAIEALPPEQSRIIHMLRQGFPIDSKEADVMTVAKALGRSEKTIRTYRDKAFAALRTAMADGDEQ >NC_022536|20804:28711|24732_25308_+|WP_022557082.1|DBSCAN-SWA MSPAGESPSRESVLDAFAVESEAGRPTLERYLRLYPEYARELIDLSRELSRQSYEDDAPLSAADQALVDAAWSQHAKAVPAAVDPFAALTVDDWRAIAQVLDVPRQVVTALRERRVSLVSIPRRFLAMLADAMRSSVSELESFWAPAPLLVARSYKSDNKPTVGEQVTFEQVLIDAGVPAEKRASLMSEAD |
4 | Aureococcus_anophage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
39414 : 42406
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_022536|39414:42406|DBSCAN-SWA GATGACGGATGCCGCGGAAGTTCTGACCGAAGCGACAGCGCTCGCATTCCGCGGCGTCCTCACAACCGGCGTCTACTGCAGGGCGACCTGCATCTCCCGTCCTCCCCGGCCAGAGAACATGCGCTGGTTCGGGTGCGTCTCCGACGCACAAAGAGCAGGCTTTCGCCCGTGCCTGCGATGCCGTCCCAATGATGAGGAATTCGCAAGGAGGAATGCCGATCTGGTCGCCGAGGCGTGCAAGCTGATTGATGCCGGAGACAGCTCGCCGACCGTCGCGGCGCTCGCGCACGCCCTGGGGATCAGTGAAGGGCACTTCCACCGAACGTTCCGTGCACACACGGGAATGACCCCGCGAGCATACATGCTCGAGAGGAGGGCTCAACTCGTCCGGGAAGGACTAAAACCGGGCAGGACCATTACATCGACGTTCTATGACGCAGGATATGGTTCCAGCGGAAGGTTCTACGCGGTATGCAGCCGCGCGCTCGGAATGGCTCCGGCAGACTACAGGGCCGGCGGGCGCAGGGAAACGCTGCACTTTGCAGTCGGCCAGACATCGCTTGGTTCGATCCTGGTTGCATCGAGCGCCAAGGGCGTTGCGGCAATTCTCCTTGGAGATGATCCTGCTGCCCTCCTGACTGATCTTCAGGATCGATTTCCGAACGCGAACTTTGTCGGAGGGGACGAGCGATACGAGGATACCGTCGCCAAGGTTGTTGGAATGGTCGAGCGCCCGGAGGTCGGCCTCGACTTACCGCTCGACCTGCGTGGGACTACCTTCCAGCGTCGCGTCTGGCTGGCGTTGCGCAATATTTCTCCGGGCGAGATCATCAGCTATGAAGATCTCGCAGCTAGGGTTGGATATCCGAAGGCCGGTAAGGCTGTGGCTAGCGCCTGCGCGAGCAATCCACTCGCCATCGCTATTCCCTGTCACCGTGTTGTGAGAAAAGATGGAGCGTCATTCCGCTATACCTGGGGAATTGAGCGGAAGGCTGAGCTGCTGCGAAGAGAGGGACGGCAGTAAATTGAAATTATGTTCCCGGACGACTGCGCCACGGTGGCTGGCCCGATTTTTGAGTGCTCTTGCCGAGCGAAACAGCTAATCGGTTACACCAGACCGTCTTGCTCCACCGGCTCGCCAGATTTCGATACGATTGACGATCCATAGGGCTCACGCGACAAAATGATCAGCGCATCGTCCGGAAGCGGGCGAGCAAGGTGCTTTGCCTCGTCCCAGGGCGCCCGCATCCAGATATCGGTTTCTTCCTTCGTCAGCAGCAGGACAGGCATGGCCTTTTCGTGGATCGGCTTAACGAGATCATTGGGATCCGTGGTCAGGAAACCATAGAGATCATCCGTGGTCATTCCATCCTTTACCTTCCGGACGCTCCTCCATTGCGGCAGGTGGATACCAGCGAAGAACATCAGCGACTTTTCCTCGTCGCGAGCGAACCAGGCGTTAGGCACATTGCCACCTGCCTGTTTGCTCGCTGGGTCCGGCTCGGCGAAGCTGGTGACCGGGGCGAGACACCTGTGCTCGACGCCGAACCAGCGCGTCCAGTGGGGCAGGTTGAGCTTGCGGACATTGGTTACGCCACGGTCCGGCTCCATACGAATGAGTTCGTTGATATCGGCGGCTTTGCCCTTGGCCTTGAGTTTATCTGCTCGAGCTTCGGCGGCCTTCTTCTGCACGAAAATCGGCGAGGGCAGGCCCCATCGCGCATGCACAAGCTGCTTCTTTCCGTCAGCCGTGTTGCGGACGATAGGCCCCATCTGGTCGGGGTTCATCTGATAGGCCGGCATCAGGTTGATGAGGCTCTCGGCGTCTTGGGCCCATTTCGAGACCCAGTCCTTGTCTTCCATGCGATAAAGATTGCACATGGCAGCAATTCCTCCGTTGCGACCCAATGCAGTTTGACGGATGAGTTCCGCCTGCGTCTCTAGTCGAGCTGGTAAACGTCGGCGTTGTCCTGCCGTTCCCGCAGGCCCTTATAGGAAGCATGCCGCAGTTTATCGTCGCCGGTCCAGGCCCGAAACTCGATCTCGGCGATCAAGGTCGGCGCGACCCAGACGATATCTGATTTTTCAGCAAAGGCGACAGGCGGCTGCTTTCGTTTCCAGCGGAGTTTGTCGAGTATCTTGCGCAGCCTGATCATCTCGGCTTCTTTGAAGCCCGTGCCGACGTTTCCGACATGAATGAGCTCGTCGCCACGATAGGCAGCCAATACCAGCGAGGCGAACCCGGTAGACGATACCGACGATGGCTCGTAACCAACGACCATGAAGGCTTCGCTTTCAACGCATTTGACCTTTACCCAGTCGCCGGTTCTGCCGGAGCGATAGGGCTGATTGAGACGCTTTCCGACGATCCCCTCAAGGCCAAGGCTGCGGACATGCTCCAACAGAACTTCCGGTTCGGCGTCGAGTGTTTCCGAGACACGGATGGCGCCGTCATTTGCCTTTATCGTATCCTCGAGAAGATGCCGGCGTGATCGGTATTCCATCTCCCGCAGATCGTGGCCGTCGAGGTATATAAGATCAAAAGCGTAGAGGACGGCTTCAGATGCTCGATTTCCGACCGCCTTGCCCGAAGCCCCCAACGACTTCTGGAGCAGCCCGAAATCCGGCCGCCCTTCATCGTCAAGCACCACCGCCTCGCCATCGATAATCATCGTTGCCGGGCCGAGCGCTCGGGCTGCCTTTTCTATCGCCGGAAACCGGTGCGTCCAGTCATGACCCCCGCGGGTAATGATGCGGACGCCTGTCGGCTCGATGTGGACCGCCAGCCGATAGCCATCCCACTTCAGCTCCCAGCTCCACTGGTCACCTTTTGGCGGATGGGCTTTCAGTTGCGCAAGCGCCGGCTCGACGCGGTCGGGCATCGGATCGAACAGAAGCTGTGGCTGTGCAGGGTTACGCTTCCGACGCGGCTTTGAGCGGATCGGGGCTTCGATATCACGAAGGAGCGGCTTTGATCTCGGTGGCTTCGTCAT
Protein sequences of DBSCAN-SWA_5 >NC_022536|39414:42406|40520_41294_-|WP_022557098.1|DBSCAN-SWA MCNLYRMEDKDWVSKWAQDAESLINLMPAYQMNPDQMGPIVRNTADGKKQLVHARWGLPSPIFVQKKAAEARADKLKAKGKAADINELIRMEPDRGVTNVRKLNLPHWTRWFGVEHRCLAPVTSFAEPDPASKQAGGNVPNAWFARDEEKSLMFFAGIHLPQWRSVRKVKDGMTTDDLYGFLTTDPNDLVKPIHEKAMPVLLLTKEETDIWMRAPWDEAKHLARPLPDDALIILSREPYGSSIVSKSGEPVEQDGLV >NC_022536|39414:42406|39414_40437_+|WP_022557097.1|DBSCAN-SWA MTDAAEVLTEATALAFRGVLTTGVYCRATCISRPPRPENMRWFGCVSDAQRAGFRPCLRCRPNDEEFARRNADLVAEACKLIDAGDSSPTVAALAHALGISEGHFHRTFRAHTGMTPRAYMLERRAQLVREGLKPGRTITSTFYDAGYGSSGRFYAVCSRALGMAPADYRAGGRRETLHFAVGQTSLGSILVASSAKGVAAILLGDDPAALLTDLQDRFPNANFVGGDERYEDTVAKVVGMVERPEVGLDLPLDLRGTTFQRRVWLALRNISPGEIISYEDLAARVGYPKAGKAVASACASNPLAIAIPCHRVVRKDGASFRYTWGIERKAELLRREGRQ >NC_022536|39414:42406|41353_42406_-|WP_022557099.1|DBSCAN-SWA MTKPPRSKPLLRDIEAPIRSKPRRKRNPAQPQLLFDPMPDRVEPALAQLKAHPPKGDQWSWELKWDGYRLAVHIEPTGVRIITRGGHDWTHRFPAIEKAARALGPATMIIDGEAVVLDDEGRPDFGLLQKSLGASGKAVGNRASEAVLYAFDLIYLDGHDLREMEYRSRRHLLEDTIKANDGAIRVSETLDAEPEVLLEHVRSLGLEGIVGKRLNQPYRSGRTGDWVKVKCVESEAFMVVGYEPSSVSSTGFASLVLAAYRGDELIHVGNVGTGFKEAEMIRLRKILDKLRWKRKQPPVAFAEKSDIVWVAPTLIAEIEFRAWTGDDKLRHASYKGLRERQDNADVYQLD |
3 | Acanthamoeba_polyphaga_mimivirus(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
57497 : 59654
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NC_022536|57497:59654|DBSCAN-SWA GATGACTAACCTCAATGGAAAGATTGCACTCGTAACTGGCGCCTCGAGCGGCATCGGCGCTGCCACCGCCATCAAGCTCGCGAAGGCCGGAGTCAAGGTCGGCATTGCCGCCCGCCGCACCGAAAAGCTCGAGGATATCAAGCAGCAGATCGGAGCCAACGGTGGTCAAGCTCTGGTTCTGCAGATGGACGTGGTCGATCCGGCCTCGGTCGAAGCAGGCGTCAAGACGCTGATCGATACCCACGGTGCAATCGACATCCTCGTCAACAATGCAGGCCTGATGCCGCTCTCCGATATCGATCAGTTCAAGGTCAATGAGTGGCACCGGATGGTGGATGTGAATGTGAAGGGCCTGTTCAACACGACGGCCGCCGTCCTGCCTCAGATGATCAAGCAGCGGTCCGGGCACGTCTTCAACATGTCCTCGATTGCGGGTCGCAAGGTGTTCAAGGGATTGTCTGTCTACTGCGCCACCAAGCATGCAGTAGCCGCCTTTTCGGATGGTCTGCGCATGGAGGTGGGGCCGAAGCACAATATCCGGGTTACCTGCATTCAGCCGGGTGCTGTCGCAACCGAGCTCTACGATCACATCACCGATCCCGGCTACCGCCAGCAGATGGATGACCTCGCCGGCCAGATGACCTTCCTCAATGGTGAGGACATCGGCGACACCATCGTCTTTGCTGCGCAGGCCCCGGCGCATGTCGATGTTGCCGAGCTATTCGTCCTGCCCGTGGAACAGGGTTGGTGATGCACCGCCCTTTTGGCCCCTGCCATTGCGCAGGGGTCCCGAAGGAAATCCAGATGCCCGAAGGAGATCGAGATGAAGGAACTACCCGCATCGCAGGTTTATCGCCTGCTTGAACCTGGCCCGATCGTGATGGTATCGACGCTCGACAACGGCAGGCCCAATGTCATGACCATGGGTTTCCACATGATGATTCAGCACGATCCGCCGCTCATCGGATGTGTGATCGGCCCGTGGGATCACAGCTATCAGGCGCTTCGTAACACCGGTGAATGCGTGATCGCCGTGCCCGGACTGGATCTGGCCGAAACCGTCGTCGATATCGGAAACTGTTCCGGCGCGGATGTCGAGAAGTTCGAGAGATTTGGCCTCGAGACCAAGCCCGCGGAGCAGGTCTCAGCTCCGCTTCTCGAAGATTGCCTGGCCAACATCGAATGCGTGGTAATCGATGACAGGCTGCTCGATCCCTACAACCTCTTCATCCTTGAGGCGAAGAGAATCTGGCTCAACGAAAGCCGGACGGAACGGCGAACCCTGCACCATCGCGGGGATGGGACCTTCGCCGTCGACAACGGCACGCTCGACCTGAACCACCGTATGGTCAAGTGGCGTCACCTGCCGTGAGCACCGTGCGCCAGGCCGTTGCCTCATTTGAAACCACCATGGAAGGAATATCCGATGACGAACATCGCCAACAAGATCGTCCTTATCACCGGAGCGAGCAGCGGGATAGGCGAAGCGACCGCGCGCACTCTCGCCACTTCAGGCGCTGCCGTTGTGCTGGGGGCAAGACGAACGGATCGTCTCGAAAAGCTTGCCGAGGACATTACCGCTGCCGGGGGTAGGGCAATCTACAGAAGCCTTGATGTGACTTCTCGTGAGAGCGTCCGGTCGTTCGCGGATGCGGCAGTGCAGGAGTTCGGCCGGATCGACGTGATCATTAATAATGCCGGCATCATGCCGCTGTCACCCATGGCATCTCTGAAGGTGGACGAGTGGGACCGGATGATCGACGTCAACATTAAGGGCGTCCTGCATGGCATTGCAGCGGTTCTGCCATTGATGAACAGGCAGGGATCCGGTCAGATCATCAATATCTCGTCGATCGGCGGCTTTGCCGTCTCGCCGACGGCCGCCGTTTACTGCGCCACCAAATATGCAGTTCGCGCGATCTCGGACGGGCTGCGCCAGGAGAACGACAAGCTCCGCGTCACCTGCATCCATCCAGGTGTGGTGGAATCTGAACTGGCCAACACGATCACCGATCCGGTGGCGGCGCAGGCCATGGAGAGTTATCGCCAGATCGCTTTGAAGCCCGAGGCGATAGCAGCGGCCATCATGCATGTCATAGACCAGCCTGACGAGGTGGACACGAGCGACATCGTCGTTCGACCGACGGCCAGTGCCTGA
Protein sequences of DBSCAN-SWA_6 >NC_022536|57497:59654|57497_58247_+|WP_022557116.1|DBSCAN-SWA MTNLNGKIALVTGASSGIGAATAIKLAKAGVKVGIAARRTEKLEDIKQQIGANGGQALVLQMDVVDPASVEAGVKTLIDTHGAIDILVNNAGLMPLSDIDQFKVNEWHRMVDVNVKGLFNTTAAVLPQMIKQRSGHVFNMSSIAGRKVFKGLSVYCATKHAVAAFSDGLRMEVGPKHNIRVTCIQPGAVATELYDHITDPGYRQQMDDLAGQMTFLNGEDIGDTIVFAAQAPAHVDVAELFVLPVEQGW >NC_022536|57497:59654|58922_59654_+|WP_022557118.1|DBSCAN-SWA MTNIANKIVLITGASSGIGEATARTLATSGAAVVLGARRTDRLEKLAEDITAAGGRAIYRSLDVTSRESVRSFADAAVQEFGRIDVIINNAGIMPLSPMASLKVDEWDRMIDVNIKGVLHGIAAVLPLMNRQGSGQIINISSIGGFAVSPTAAVYCATKYAVRAISDGLRQENDKLRVTCIHPGVVESELANTITDPVAAQAMESYRQIALKPEAIAAAIMHVIDQPDEVDTSDIVVRPTASA >NC_022536|57497:59654|58319_58868_+|WP_022557117.1|DBSCAN-SWA MKELPASQVYRLLEPGPIVMVSTLDNGRPNVMTMGFHMMIQHDPPLIGCVIGPWDHSYQALRNTGECVIAVPGLDLAETVVDIGNCSGADVEKFERFGLETKPAEQVSAPLLEDCLANIECVVIDDRLLDPYNLFILEAKRIWLNESRTERRTLHHRGDGTFAVDNGTLDLNHRMVKWRHLP |
3 | Bacillus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
64390 : 72532
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NC_022536|64390:72532|DBSCAN-SWA AATGAGCAATTCCTTACGCAGTGCAGCCGTCCCTTCACGGATAATTCAGGTCCCACAGTCCATCAGCGTCGAAGCACAAGCGGCACTGTCGCGCCTGGTCGACAAGGACGGCAGCCCTATCAATGCGCGGTTCGAAATGCCGTCCCCAGAGGATTTTTCAGGCTGGATGATGATGAAAGCCGCAGTAGATGCGCATTACGCCGCCGCGGCCAAAGATCTTGCCGGGAGTCTGCAGTCTACCGTCACCACAATCGTAGTCGAGCAAGCAACGATCCATGTCGCGACGCCGTACGGAGCATTTCATGAGCGTGGCGCACTCATCGACCTGCATGGCGGCGCATTGGTGTTCGGAGGCGGTGAGGCCTGTCTTGTCAGTGCGCGACGTCAAGCTCACCAGCATGCCGTGCGATGCTACGGCGTCGATTACCGGATGCCGCCTGAGCATCCCTATCCGGCCGCTCTTGATGACTGCTTAGCCACGTACCGTCATGTCTTGGCAGGTCACTCCCCCGACAAGATAATCATCCTGGGGAGATCGGCGGGCGGCAATCTCGCGACCGCCATGCTGCTGCGGGCGAGAGATGAAGGTATGCCAATGCCTGGCAGATTAGTCTTGCTCTCGCCACAGGTTGACCTCACCGAATCCGGTGACAGCTTCCAGACCAACCAGATGATCGATCTCGTTCTGCCCCGCCCGCTAAGACCAAACAACCTGCTCTACGCCGGTGGTGCCGATCTTTCCAATCCCTATCTATCGCCGCTCTTCGGCGATTTGGCGGGCTTTCCGCCGACATTCCTGCAGACCGGCACGCGTGACCTGTTCCTATCGAACACGGTGAGGATGCATCGAGCCTTGCGAAAAGCTGGCGTGGAAACCGAATTGCACGTCTTTGAAGCCATGCCCCATGGTGGCTTCATGGGTGGGACACCGGAAGAGCAGGAACTCGAAGCGGAGATCCACCGGTTCGTCATGGCAAACCGGAACTGAGGCCGGAAACCCACAAAGTTTTGGCGCTGCCCCTCTGTTGTCCTGCTACATCCCTGGCCACATTCCGCCATCAAGTCGGAGGTTTGCTCCTGTGATATAGCCAGCACGGGGGCTCGAGAGAAATGTAATCGCATCCGCAATCTCCTCGAGGGTTCCTACCCGTCCAAGGGGCACCTGAGCAAACAACGGCAGGATTTCTCTCTCTACGTCATTCCAAGGGGCGTCGATGGCCATGCCCCTCTCAATGGCTTTCTTACGGAACGCGGTATCCAAACTGATGCTATGCACTGTTCCAGGCGAAACGGTGTTTGCGGTAATGCCTTGTGCAGCAACGTCTTTTGCGAGCGACGCCGTCATCGCGATCATGGCTGCTTTGGCGGCAGAGTAGTCCGGTCGACTTGCAGGCGGCATCAGGGCCGCCAGGCTTGAGATGTTTATGATCCGACCCCACCGCGACGCTTTCATTGCGGGCAGAACGAGCGAGACGATCCTCACAGATGCGAGCACGTTTCTGTCATAGACAGCGGCCCACGTTTCAGAGTTTGTCGAGGTCCAATCCTCCGCCGGGGCAGATCCGCCAGCATTGTTGACCAGGATGTCGATCGACCCTGCTGCGGCCTTGGAATCCCTAACAAGACGATCGACCTGATCCGACACTGTCAGGTCACCGACTACCGCGAATGCCCGACCTCCAGAGGATATGATGTCATGCGCGACTTCCTCCGTTTTCATTCTGTCGCGACCGTGGACAAGAACGGTTGCCCCTTCCTTTGCGAGGCCTCTGGCCACACCCTCGCCAATTCCCTTGCTGCTTCCCGTGACCAGCGCAACCTTGTTCTGGAGTTGTAAGTCCATGATATTTGGTCCGTGTTATCGTCTCCGGCAGATATGAGGTTCGTCGATGAAACAGGCAATTACGCACTTTAAAGTGCCTGTCGAGGACAGGGCGACATGCCTCGGAGCCGACGGTTCTGTCTCTCATGTCAGCAGGGTGCTTCGGATGATCACAGGACGTTGGAAACTGCCGATCCTTTTCCGATTGTTCGCCGAACCATCATTGCGGGCATCGCAGTTCATGAGAGACATACCTGGCATATCCCAGAAGATGCTAACGCAACATCTCAGGGAACTGGAAAATGACGGCCTCATAAGCCGACACGACTTTCAAGAGCAGCCTCCTCGAGTCGAATACTCGCTGAGTTCAGCAGGCCACGGGCTTATGCCGATTTTGATGGCGGTCAGGGAATTCTCTCGGGATTATCCTGTTGATCGGCGTCGATAACTAACTTGGGTGCCGACGCTTGGCGCGATCCCGAAAATATCGCTTGCATTATGGACCAAATGGTTCCATATAGAACTTATGGTCAAGCGCACTCAAATATCCGCGTCCGTTGGTCGCCCTAGAGAATTTGAACTCGATGAAGCAGTTCGAAAAGCAATGCACGTATTTTGGGATCGTGGATATCACGAAGCATCCCTTCCCGACTTGCTTGAGGGTATGAAACTTTCCAGAGGCAGCTTTTATAAGGCTTTCGTCGATAAGAGAGGCGTCTATCTGCGCGCCCTCGACGCTTACATTGAGGACGCGGTTCGCTCGGTCGGTGAAGCGCTTCATTCCAATCCGTCGCCCAAAGCCGCGATTTTAGAAGCTTTTTCTCAACAGGTGGATCAAGCATCGGGACAGGACGGTTTGCGTGGATGCTTTGTCGTCTTTGCAGCTGTCGAGATGCTCCCAAAGGATAAAGAGGTCGCACCACGTATTTCCCGGCTGTTCAGGCGGCTGCAAGATCTTTATGCGGCGGCGATAATAAGAGCGCAGGCCTTGGGCGAAATCGAGCCCGAGCTGGATGAACGAACGCTTGCGAGGTTCCTCGTGTGTCAGATTCAGGGCATGCGGGTCCTTGCCAAAGCAGGAGCGGATCGCGCTGAGACGAGGGCCATGGTCGAACTGGCGCTGAAGGCGCTTGGCTAACTTCAATTTGGAACCAATAGGTTCCCAATTTTTATCATTTCAAACCGTCGATCGATTTTTTGGTCGATGTGGAGACACTTATGAGAGACACGAGCAATCCCGATGACAAACTCGATCCACGTCGTTGGATTGCACTGATCATCCTTTTGACCGGCGCCTTTTTGCCACCGCTTGACTTCTTCATCGTCAATGTGGCGCTGCCCTCGATCCGGGCAGATTTCAGGGCTTCTGCGTCAACGATGCAGCTGATTATCTCCGGTTACGCCACCACCTATGCTGTGATGCTTATCACTGGCGGCAGGCTTGGAGATCTCTATGGTCGACGAAATGTCTTTCTGGCTGGAATGGTCGGGTTTGCAGCCGCATCTGCCTTATGCGGGTTTGCCTGGTCTCCAGCCGCGTTGGTCGCCGGCCGTATCCTTCAGGGGTTTGCAGCGGCAATTATGGCACCGCAGGCCTTGGCTTCGATCAATGCCATTTTCCCAGACCAAGAAAAATCAAGAGCCCTCAGCTTTTATGCCCTGACATTCGGGATGGCATCGATGGTTGGTCTGTTTCTTGGTGGTGCGTTGATTGCGCTTGATGTTCTTGGTCTTGGATGGAGAGCCATCTTCCTCATCAACCTGCCGGTTATTGCAGTCGCTGCGCCGTCGGCCTTCATCATGTTGCGAGAAACCCGATCTGCGCACCCAAGCAAACTGGATCTTGGCGGAGCACTGTTGATCGCGATTGCGCTATTTGCACTGATTGCGCCGCTGATCGAGGGCCGGGAGCACGGGTGGCCGATCTGGCTTATCGTGATGCTTGCCACGTGTCTTCCGTTTTTCGTCTTGTTTTGGCGGCATGAGAAACATCTGGAATCGACGGGGAAAGATCCGATTCTTGCACCGAGCCTGCTGCAAAACCGTGGGCTACTGCGCGGGCTCCTGGCTGCCTTGTTCTTCTACGCCCTTGCGGCCTTCTGGCTGATATTCTCGGTCTATGAGCAGGGTGGCCTTGGTCGGACGCCTTTCGAGGCCGGGCTGGCGATCCTTCCTGCGGCTGTCGGCTTCGTCCTTGGTCCTTTTGCAAGCGAGCGCATCCTCAGCGTCTTCGGGAGATTTTCCGCTGGCGCCGGCATGGTGCTGCAGGCTGCAGGATTGTTTGGAACGGCGGCCTTGATTTCTACTGGCCTTCCGCAATTCCTTTTTGCTGCGCTCTTTCTTATCGGTGCGGGGCAGGGGATCGCCCTTCCAAACCTGGTGAAGAGCATCGTGCAAAGGGTAGACCGGACGCAGTCAGGATTGGCTTCTGGTCTGGTCAATTCGATGTTTCAGATTGGCGGTGCGTTGGCAGCCGCAATCGTTGGCGGCTTGTTCTTCTCGATCTTGGGGCCGGCCACAGACGTTCAATCCATTGGCCAAGCATACAGCGTTGCAGCGGTTGCAATTGCCATTTGTCTTCTCATCGCGGGATGGCTTTCCGTCAGCCTGACATCGACCCAACCGTTGCGTCGATAAGCGGCGCCATCGCAGCGCATCATCAGCCGATCGCGCGGAAACGGGGTAAAACTCCCCAGCAAAACAGAAACATTCAGAGGTAATCTCCATGAACAACATCTCCACACTACCTCTTGCCGGCAAGTGCGCGCTGGTCACCGGTGCGGCACGCGGTATCGGTGCGGCAATTGCGCTGAAACTGGCCGAGGACGGTGCCGACGTTGCGATCACCTATGAGAAGTCGGTCGAGAAAGCGGAAGCCCTCGTTGCTGAGATCCGCGCCATGGGTCGCAAAGCGATCGCCATCCATGCCGATGCCGCGCGTACAGAAGCGGCAAGGTCCACTGTCGAGCAGACCGTCGCCCGCCTCGGCAGCCTTGATATTCTGGTCAATAATGCCGGCGTCCTTTTCGCAAGTGATTTTTCCACGCAGCCACTTGAGGAGATTGATCTGCAGCTGAACGTCAATGTTCGCGGTGTCTTTCTGATCACCCAGGCAGCGCTGAAATATATTCCCAACGGAGGTCGCATCATCAGTACCGGCAGCAACGCGGGTCTGGCTGTGCCGTTCGCGGGGATCGCGGTCTATGCCGCGACCAAGTCTGCGCTGGAAAGCTTCACGCGCGGTCTCGCACGTGAGCTGGGCTCTCGGGAAATTACTGTCAATCTGGTTCGACCAGGTCCGATTGATACTGACATGAACCCTGCCGATGGCGCGCTCGCAGCCGCTATCCTGCCAAACCTCTCGATTGCTCGATACGGTAAAGCCAGTGAAGTCGCAGAGGCCGTCGCCTTTCTCGCAGGTCCCGGTGCAGCCTACATCACCGGCTCAGGCATTCTCGTCGATGGCGGCATCAGTGCCTGAGCATTGATACGAGGCGGTGCCTCGCCGCTCTGTCCTGTAGGTGATAATGCGGAATTTAAGCGCGAATGGGCTGCGGAGTTCCCTCCGCTGCACTGTGCGCGTAAATTTGGCGAAGCGGGCGGTGGCCATCATCCAGGAATTTCTCCACTCCGTTGGTGGTTTTCCTGCGTTGCGATCGTAGCAAAATAGCAAATATGGGTAGCCTGGTTTCTGCTCAAGCGCCCGTTGAGCGGTAACAGACCTAACGTATGAACATGGAGAAGAATTTGAAGACCATCGGCATACTTGGCGGAATGTCCGCGACGTCAACCCAGATTTATTACCGAGAACTCTGCAGGCTTACCCGCGAGAGCCTGGGAGGACTCCATTCTCCCGAGCTTCTCATACGGTCTGTCGACTTCGACGAAATCGAAAAGTTGCAGGCATCCTCCGATTGGGACGCGGCGGGCCGAATCCTCAATGAACATGCTATCGCGCTGCAGCGCGGAGGTGCCGACCTCCTCATTCTGGCGACGAATACCATGCACAAGGTGGCGGACAAGATCGTGGGCAGGGTGTCGATCCCGTTCGTGCACATCGCGGACGCGACTGCAACCGCTATCCTCGACCGCGGTTTCCGGAGACCGGGTTTGATGGCAACGGCCTTCACCATGGAGCAGACCTTCTACACCGACCGCCTCATCGCTCAGGGGCTGTCGCCAACGATCCCTGAAGCTGATGATCGAACAGAAACCCATCGTATTATCTATGAGGAGCTGTGCCGCGACATCGTTCGCGAGGAGAGCCGCTTGACCTATGAACGGATTGCGCAGCGTCTGGTCGACAAAGGCTGCGACTGTCTTATCCTCGGCTGTACCGAGGTGGGTATGCTGTTAAACCAGGATAACGTAGGCGTTCCCGTCTTCGACACGACCCTCATTCATTGCAAAGCCGCCCTTCAAACCGCTTTACAATGACATGTTGTGTTCTGACGGTCGATCGATCGTCATAACGGTGAGGGGCAATGGTCAAACCTGTTCGCCCAGGTGATGCCTCACCCACTTCGCTTCTTCCGATGCGGTACGCCCGAAATGCCGTCTGAAGTCGCGGCTGAATTGCGCCGGGCTGACATAGCCCACTGACGCCGCCACCTCTGCAATTGTCGCCGTCTGGCGGGCGATCATCAGTCTCGCTTCATGGAGCCGCATCGCCTTCACGTATTGCATCGGACTTGAGCCTGTCAGCTCCTTGAAATGGACATGGTATGAAGGAATGCTCATGCCCACCGCTCTGGCAAGGTCCGCGACGGTGATTTCAGAGCCGTAGTTTTTGCGCAGCCATGCCAGGCTCTCGATCAGTTTTCCCGACGTCCCTTTCCGCTGCAGGGCGGCGATCACTGCGCCGCCTTGCGGCCCTGACATGACCCGGTAATGCAACTCACGCAGGATGGAGCCGCCGAGGACGGCGACCTCAACCGGACTGCCGAGCACCGTCAGCAAGCGCAGCAGGACGTCTTCGATAGTGCTGTCCATCCTACTCGAGAAGAGCCCTTTTGGCTTGACGCTTGCAGGACCCGCGAGCTTTTCAAGCTGCGTGGCGATCTCGGCCGCCATCTGCATGTCGAACTCTAAATAGACCGCGAGCAGTGGCCGCCGTGGACTGGCTGTAGATGTCATCCTAAAGGGAACGGGCACCGAAACGGCAAGATAGTGGTGCTCGTCATAGAGGTAGATTTCCCCGTCCAGAATTCCGTGTTTGCTCCCCTGCAATACGAAGACGGCGCCCGGCTTATAAAGAACCGGAATGTCGTACAGCACTGCTTCCGTGCGTAGGATACGGACACAACTGAGCCCGGTCTGGTTGTAGCCAAGACGAGGAGCGAGTTGTCCGGCCAAGGCGGCAGACTGGTCATGTGTAGAGGGCGGCATTTTATAGTTCATAGGAATTGGCAAAAATTTAAGAGGATTCGCAATCGAACCCGCGTTGTCTCCAGATTATCTGTCACATCAACGGGTAATCAAGGAGTTTCAGAAATGTCGAGCAAAACGTTTTTCATAACAGGGGCGAACTCGGGTTTTGGGCTGGCGATCGCCACTGCTGCAATCGAAGCCGGTCATACCGTCATCGGGACCATCCGCTCCGAAGCCTCGCGCGAAGCGCTGGCAAAGACCCTCCCGGAACTGCGCCCGGTCCTCTGCGACGTCACCGATTTTGATCGTATTCCGGTCGCGGTGCAGCGAGCGGAGGAAGAGCACGGCCCAGTCGATGTGCTGATCAACAATGCCGGATATGGGCACGAGGGTGTGCTTGAGGAATCGCCGATCGAGGAGATGCGCCGCCAGTTCGACGTGAATGTGTTTGGCGCCGTTGCAGTCGCCAAGGCGTTCCTCCCGAGGTTCCGTGAGCGCCGAAGCGGCTTTATCGTCAACGTCACGTCGATGGGAGGCATGATCACCATGCCCGGCATCGCCTATTACTGCGGCAGCAAGTTCGCGTTGCAGGGTATTTCAGAGGTCATGCGGTCGGAAATGGCGCCGTTTGGTGTGCACGTAACAACCCTTTGCCCCGGCTCGTTCCGGACGGACTGGGCAGGCCGTTCTATGGTCCGCACAGAGCGTTCCATTGCTGACTATGATACCCTGTTCGATCCGATCCGCGAGGCGCGTCAGGCAGTGAGCGGCAAGCAGCTCGGAAATCCGAAAAAGCTCGCCGACGCGGTGCTGACCCTCATCGAATCTGAAAATCCCCCGCCGCAACTTCTCCTTGGCAGCGATGCGCTTAGACATGTAACGGCGCGGATCGAACGCCTGACCCAGGAAATCGAAGCTTGGAAGAGCGTGACTGTTTCCACAGACGGCTAG
Protein sequences of DBSCAN-SWA_7 >NC_022536|64390:72532|66278_66659_+|WP_034499614.1|DBSCAN-SWA MKQAITHFKVPVEDRATCLGADGSVSHVSRVLRMITGRWKLPILFRLFAEPSLRASQFMRDIPGISQKMLTQHLRELENDGLISRHDFQEQPPRVEYSLSSAGHGLMPILMAVREFSRDYPVDRRR >NC_022536|64390:72532|68936_69692_+|WP_013637163.1|DBSCAN-SWA MNNISTLPLAGKCALVTGAARGIGAAIALKLAEDGADVAITYEKSVEKAEALVAEIRAMGRKAIAIHADAARTEAARSTVEQTVARLGSLDILVNNAGVLFASDFSTQPLEEIDLQLNVNVRGVFLITQAALKYIPNGGRIISTGSNAGLAVPFAGIAVYAATKSALESFTRGLARELGSREITVNLVRPGPIDTDMNPADGALAAAILPNLSIARYGKASEVAEAVAFLAGPGAAYITGSGILVDGGISA >NC_022536|64390:72532|69958_70648_+|WP_022557128.1|DBSCAN-SWA MKTIGILGGMSATSTQIYYRELCRLTRESLGGLHSPELLIRSVDFDEIEKLQASSDWDAAGRILNEHAIALQRGGADLLILATNTMHKVADKIVGRVSIPFVHIADATATAILDRGFRRPGLMATAFTMEQTFYTDRLIAQGLSPTIPEADDRTETHRIIYEELCRDIVREESRLTYERIAQRLVDKGCDCLILGCTEVGMLLNQDNVGVPVFDTTLIHCKAALQTALQ >NC_022536|64390:72532|67429_68848_+|WP_022557127.1|DBSCAN-SWA MRDTSNPDDKLDPRRWIALIILLTGAFLPPLDFFIVNVALPSIRADFRASASTMQLIISGYATTYAVMLITGGRLGDLYGRRNVFLAGMVGFAAASALCGFAWSPAALVAGRILQGFAAAIMAPQALASINAIFPDQEKSRALSFYALTFGMASMVGLFLGGALIALDVLGLGWRAIFLINLPVIAVAAPSAFIMLRETRSAHPSKLDLGGALLIAIALFALIAPLIEGREHGWPIWLIVMLATCLPFFVLFWRHEKHLESTGKDPILAPSLLQNRGLLRGLLAALFFYALAAFWLIFSVYEQGGLGRTPFEAGLAILPAAVGFVLGPFASERILSVFGRFSAGAGMVLQAAGLFGTAALISTGLPQFLFAALFLIGAGQGIALPNLVKSIVQRVDRTQSGLASGLVNSMFQIGGALAAAIVGGLFFSILGPATDVQSIGQAYSVAAVAIAICLLIAGWLSVSLTSTQPLRR >NC_022536|64390:72532|66815_67349_+|WP_048903004.1|DBSCAN-SWA MHVFWDRGYHEASLPDLLEGMKLSRGSFYKAFVDKRGVYLRALDAYIEDAVRSVGEALHSNPSPKAAILEAFSQQVDQASGQDGLRGCFVVFAAVEMLPKDKEVAPRISRLFRRLQDLYAAAIIRAQALGEIEPELDERTLARFLVCQIQGMRVLAKAGADRAETRAMVELALKALG >NC_022536|64390:72532|71704_72532_+|WP_020808319.1|DBSCAN-SWA MSSKTFFITGANSGFGLAIATAAIEAGHTVIGTIRSEASREALAKTLPELRPVLCDVTDFDRIPVAVQRAEEEHGPVDVLINNAGYGHEGVLEESPIEEMRRQFDVNVFGAVAVAKAFLPRFRERRSGFIVNVTSMGGMITMPGIAYYCGSKFALQGISEVMRSEMAPFGVHVTTLCPGSFRTDWAGRSMVRTERSIADYDTLFDPIREARQAVSGKQLGNPKKLADAVLTLIESENPPPQLLLGSDALRHVTARIERLTQEIEAWKSVTVSTDG >NC_022536|64390:72532|65422_66232_-|WP_020808317.1|DBSCAN-SWA MDLQLQNKVALVTGSSKGIGEGVARGLAKEGATVLVHGRDRMKTEEVAHDIISSGGRAFAVVGDLTVSDQVDRLVRDSKAAAGSIDILVNNAGGSAPAEDWTSTNSETWAAVYDRNVLASVRIVSLVLPAMKASRWGRIINISSLAALMPPASRPDYSAAKAAMIAMTASLAKDVAAQGITANTVSPGTVHSISLDTAFRKKAIERGMAIDAPWNDVEREILPLFAQVPLGRVGTLEEIADAITFLSSPRAGYITGANLRLDGGMWPGM >NC_022536|64390:72532|64390_65377_+|WP_022557125.1|DBSCAN-SWA MSNSLRSAAVPSRIIQVPQSISVEAQAALSRLVDKDGSPINARFEMPSPEDFSGWMMMKAAVDAHYAAAAKDLAGSLQSTVTTIVVEQATIHVATPYGAFHERGALIDLHGGALVFGGGEACLVSARRQAHQHAVRCYGVDYRMPPEHPYPAALDDCLATYRHVLAGHSPDKIIILGRSAGGNLATAMLLRARDEGMPMPGRLVLLSPQVDLTESGDSFQTNQMIDLVLPRPLRPNNLLYAGGADLSNPYLSPLFGDLAGFPPTFLQTGTRDLFLSNTVRMHRALRKAGVETELHVFEAMPHGGFMGGTPEEQELEAEIHRFVMANRN >NC_022536|64390:72532|70699_71611_-|WP_022557129.1|DBSCAN-SWA MNYKMPPSTHDQSAALAGQLAPRLGYNQTGLSCVRILRTEAVLYDIPVLYKPGAVFVLQGSKHGILDGEIYLYDEHHYLAVSVPVPFRMTSTASPRRPLLAVYLEFDMQMAAEIATQLEKLAGPASVKPKGLFSSRMDSTIEDVLLRLLTVLGSPVEVAVLGGSILRELHYRVMSGPQGGAVIAALQRKGTSGKLIESLAWLRKNYGSEITVADLARAVGMSIPSYHVHFKELTGSSPMQYVKAMRLHEARLMIARQTATIAEVAASVGYVSPAQFSRDFRRHFGRTASEEAKWVRHHLGEQV |
9 | Trichoplusia_ni_ascovirus(40.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
82882 : 84894
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NC_022536|82882:84894|DBSCAN-SWA TATGCGGACGAAGAAAGGATCATTTGTGAATCGCAAAATTATCGTGCCGCTGGCAATGAAGCCCATTGTGGAGCGGGCAGGCTATGCCCCTGCAGTACGCTTCGAAAACCTGGTTTTCTGCGCGGGGCAGGTTGGACGTGATACCAACATGAATGTTATCAACGACCCAGAGAAGCAATTCGAGGCGTGTTGGGACAACTTAGCTACCGTTTTGTCAGCGGCAGGTTGCAGCTTCGAAGATGTTGTCGAAATGACTACGTATCATGTCGGATTGCAGCAGCACATACATACCTTTCGCAAGGTAAAGGACCGTATTTTCCCCCGCGGAACATGCGCGTGGACTTGCATTGGCGTCTCTGAACTTGCCCATCCCGGTCTTCTTGTGGAAATCAAAGTTGTGGCAGCTATACCTTCGCACCCGTTGGAATAGAGCGGCAGTTGATCCCCGCCGAAGTCGAAAAGTGCTCAAATTCGACCGGACAGCTGGTGTAGAAGACATAACAAAACGACGCAGCTGGTGGACGACCAGCCGCGGTTTGTATCGCGAATGCCGCGGCTGCGACAGGTTTGTATTCTCTCCATAATATTCCTTGGTCGGCTGGAATTAGCGGTTCCATAAGCTCCGGTACGCCGCACCACCGATCCGCATACGCTTAGTAGTCGAAGTCTCAGTCACTTAGAGGTCTGATGAGCAGATCGCGAGTTGGTATCCGTTCAAGAGCGAAGACGCTCGCAAGGCATCTTGCTCGAGCTAACTGTGGTTCATTCGCAACGACGGCTTTGCGCCTTCGAAGCAGTCGTTGTTCGTTATCCACGGATAAAGCTGCTTTGCCCCTTCATTCCGGACGTAAGCATAGCAATTACACGGTCCCTAAAGCAGTCTTTTCGCTTATCAGGCCTTCAACGATGTTAAGGGATGGGAAATTGACGGGCAGGTTTCGGACGACGGCGGCGGGAAAGCTGCATCCCACCGAGCCAGCGAGCTTTGGGCGGCAGTGCACTTTACATCGCCACTCCTTCATTCCTGCCGCTCCTCAGGAGTGAGCATCGCGTGTATCGATGGAGGGTCAAATTAGCCCTCGTTCTTTGATATTAACGGTTTTACAGGACGGCAAATATCTGTGCGGGTTGGCTTCAAAGCTATCCCGCCCCCAAATGACCTTAATTGAAGGATTGCTAAACAGACGAGCGGGCCTCATCATCATGACCTCGGCAGCAATTACACCGGGCGCTGCCGCCGGGGGCTTTAATGCCGCGCCCGCAATCTGAAAGGAGGACACCATGTCTTCCAGCGTCCACGCCGTCGTTTTCGATCCCGGCAGCAATCGAATTCTCAACCAGATGCTTCATAAGGCCGGATATGTAACGGCGAGGATGAAACCGCTGTCAGGAGCGCTTTCAAGTCAATCGCGCACTGTGTTGCTCCGGTTCATGTCCGGAGTTATACGCCAGCGAGACGGTGGTTTTCCGTCGGGACCGGAGTCCATTTCTGCTTTCGGACAACCGTGATTCCATAGGTGAACCTAGTGTCGGGTTACGCTCCCAGCTCCGTTTCCGCAGTCTAGGCTGTGAAACGGCCTCGGAAATTACAACTAAATGCTGAGAATATTGGCGAGAAAAAAACTAAAAGTTGACAATCGCATATGAGCGACCTATATGTTGATTGTCGATATTTGCTTGCTTAGCTTTGGTTATCGCGCGTCTGGCACCGTGCCCTGAACGAGCCTTTTCTCTCAAAAATCGCTATCGCATCTGCAGCGCCGCAGGCGTTGTGGTCCTCTTATTGCTATGAAAGGGTTCATCATGACCACTGGCACAGTTAAATGGTTCAATTCCACGAAGGGCTTTGGCTTCATCCAGCCTGACAATGGCGGCACCGACGCGTTCGTGCATATCTCGGCCGTCGAGCGCGCCGGAATGCGCGAACTCATTGAAGGTCAGAAGATCGGCTACGACCTTGAGCGCGATAACAAGTCGGGCAAGATGTCGGCTTGCAACCTCCAGGCCGCTTAA
Protein sequences of DBSCAN-SWA_8 >NC_022536|82882:84894|82882_83311_+|WP_034499639.1|DBSCAN-SWA MRTKKGSFVNRKIIVPLAMKPIVERAGYAPAVRFENLVFCAGQVGRDTNMNVINDPEKQFEACWDNLATVLSAAGCSFEDVVEMTTYHVGLQQHIHTFRKVKDRIFPRGTCAWTCIGVSELAHPGLLVEIKVVAAIPSHPLE >NC_022536|82882:84894|84684_84894_+|WP_003519970.1|DBSCAN-SWA MTTGTVKWFNSTKGFGFIQPDNGGTDAFVHISAVERAGMRELIEGQKIGYDLERDNKSGKMSACNLQAA |
2 | Pandoravirus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
93048 : 93810
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >NC_022536|93048:93810|DBSCAN-SWA TATGAATATCTCTTTCGAAAACAAGGTAGCCCTGGTCACTGGTGCAGCCTCCGGCATGGGCCTTGCCGCAGCAAAAGCCTTCGCCGAGGCCGGAGCGGCGGTTGCTCTCGCCGACGTCAATGAAGAGGCAGTGCGTGCTGCGGCTGAAGCTCTGACCTCTTCCGGTTACCGGGCGATCGCCATCCAGTGCGACGTCGCCGTCATGGAGCAGGTCGCGGCCATGGTGGATCAGACGGTCGCAGAGTTCGGGCGTCTAGACGCGGCTTTCAACAATGCCGGTGTACAGAGTCCCGTCGCCGAGACTGCTGACGCGGACCCCAAGGACTACGATTTCGTCATGGGGGTCAACCTGAGGGGTGTCTGGAATTGCATGAAGTATGAACTTCTGCAGATGCGCAAGCAGGGTTCCGGCGCGATCGTCAATAACTCCTCCCTCGGTGGTTTGGTCGGGATCGCTGAACGCGGCATCTACCATGCCTCGAAGCACGGCGTTGTCGGACTGACCAAGAGTGCCGGCCTCGAATACGCGCCTAAGGGAATCCGGATCAATGCGATCTGCCCAGGCATTATCGAAACCCCGATGGTGACCGGAATGCTGGAGACACAACCGGACGCTATGTATGCTCTGATGCAGGTTGTCCCCATGAGCAGGCTCGGCAAAGCTGAAGAAATCGCCGACGCGGTCCTGTGGCTGTGCAGCGACGCGTCCAGCTATGTCGTCGGACACGCTCTTCCCGTCGATGGCGGTTATACCGTCCAGTAG
Protein sequences of DBSCAN-SWA_9 >NC_022536|93048:93810|93048_93810_+|WP_022557160.1|DBSCAN-SWA MNISFENKVALVTGAASGMGLAAAKAFAEAGAAVALADVNEEAVRAAAEALTSSGYRAIAIQCDVAVMEQVAAMVDQTVAEFGRLDAAFNNAGVQSPVAETADADPKDYDFVMGVNLRGVWNCMKYELLQMRKQGSGAIVNNSSLGGLVGIAERGIYHASKHGVVGLTKSAGLEYAPKGIRINAICPGIIETPMVTGMLETQPDAMYALMQVVPMSRLGKAEEIADAVLWLCSDASSYVVGHALPVDGGYTVQ |
1 | Trichoplusia_ni_ascovirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
99508 : 103621
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >NC_022536|99508:103621|DBSCAN-SWA CATGAGCAGCATTTCCTTGCGCGACATCCGAAAAGCCTACGTCAACGGACCTCAAGTTCTTCACGGAGTGTCGCTCGACATCGAGCCGGGAGAGTTCGTGGTCGTTGTCGGTCCGTCCGGTTGCGGAAAGTCCACACTGCTGCGCCTGATTGCAGGTCTGGATAAATGTGAGGACGGTACGATCGAGATTGGCGGAAAGCGCGCCAACGATCTTCCCCCACAGGATCGCGACATCGCAATGATTTTCCAGAACTATGCGCTCTATCCGCACATGACGGTCCGCGACAACATCGCGTTCGGCCTGGAGCTGCGCGGGATGAGCAAAACAGAGCGTAATGAGAGAGCAGAGAGGGTCGCCGCAACCCTTCAGCTGCACGCTTATCTGGATCGTAAGCCCGCCGCCTTGTCGGGAGGTCAACGTCAGCGTGTCGCCATGGGCCGAGCGATGGCGCGTAATGCCGCAATCTTCCTGATGGACGAGCCGCTTTCCAATCTCGACAATTCCCTTCGTATCTCAATGCGCACGGAGATCAAGGAGCTTCATCGGCAATTGGGCGCAACGATCGTTTACGTCACCCATGATCAGACCGAGGCGCTCTCGCTTGCCGATCGTATCGCCGTCATGAAGGACGGGCATCTATTGCAATTCGATCGCCCCGAGGTGATCTACGACCGTCCGTCAAACCGTTTCGTAGCCAGTTTTCTCGGAAGTCCCCCGATGAACTTCCTCGCATCCAACAGCCTTCCAGGCTGGACGGGGGCAGGGGAAGTGACTGTGGGACTGCGGCCTGACTTGCTCACCGTCCACCATGAAAAGCCCAACCAACCAGCTCTTCCTGGACGCCTCCTGCTCTCCGAAATGACAGGATCAGACATGCTCCTCCATTGTGAGACTCCAGCGGGCCGTCTGACCGTTTCCGCCCCCCGCAAGACGGTGGCAAGGGAGACTGAGCAGCTTTGGATCGGCTTCGACCTTGACCGTGCTCTGTTCTTTGATCCTCAAAACGGTGACCGCGTCGATTTGCCTGCCAGTGGCCACATGGAAGGACGAGGGGCGGCGAGTTGATGAACAAACAAGATGCGCCATTGGCACTCAGCGATCTTATCAATCGCTATTCGACGAAGCTGACAGAGGCCGACACGCGCCTTCTTGACGTTCTGTTACAAGATCCGATCCGTGCGGCCATGGAAAACGGAAAGGATGTTTCTTCTCGCGCGGGCGTTCATCCAGCATCGGCCGTGAGGCTGGCCCGTCGTTTGGGGTTCAAAGGCTATCCCGAGTTCAGAAGTTTCCTGCAGTCCAGTTTAACGGAAGGGGAGGGAGACTTCGAAAGCCCTGCAGCGCGCATGGCTGCGCGGCTGGTGCGGGCCGAAGACAACGGTCTGCTCGCATCGGTCCTGGACAGCGAGATAACGGCTCTTCAACAGGTCCGCCACGCCGTATCCGACGCAGATATTCGCGCGTTTTCCGCAATCCTGCGCGACAGTCGCCGGATTTTTGTCTTCGGATGCGGACATTTTTCGGCGCTCTCATCGCTTGTTGCGCTTCGCCTCAATCGCTCCGGCTACGAAGCGATCGATCTGGCGAGCCGGATGCACCAGCTTGCGGAAGTGCTGGCAGCACTGACCGCGGAAGATGTCGTTTGGTTTCTCGCCTTCCGCCGGACACCCCCGCTTTTCCAGGAAGTACGCGAGATCGCAAAACGGCGAGGGGCAAAAACGCTGGCTGTGACGGACGTCGGGGGCACGAGGATCGATCCTGCTCCCTGTCATCAAATACTGGTGTCACGAGGTAACCCAGGTGAATCGCAGTCGCTTGTTGTGCCTATGACGATTGCCAATGCGATTATCCTCGATCTGGCGTCCATCGACAACGGGCGGTCGTTGCAGTCCCTGAGCGAGTTCAGATCGTTCCGCGCATCGTCACGTCTGACTGCCGGGTGAGAGTGCACCACGGCAGAGGAGATCATGCGATGCTGGGGGTTCCCTAGAATTCGTCCCAAGTGTTATTCGCAAGTGCAGCACTGCCGTAACCAGCAGAGACCACTGCGGTGTTCCTCGACCGCCTCGTCAGCGGTCCGGAGCCAGATGCTGCTGCCATGGCAGAGGCCATGCGGTGAAGTCCCGCCGTCTCATGCGACGCAGCCAAGCCACCTCGCCCCAACTGGAACTGACTGATCAACTCGCGCAACCGGACAGCTTCTGCGGCAAGAGTTGCGCCAGCAGCGTTGGCCTCCTCCACCATGGCTGCGTTTTGCTGGGTGACCTGGTCCATCTGATTGACAGCGGTATTGACTTCGGACAGGCCGACCGATTGTTCGCGGGAGGACGTGGCAATGGAGTCCATGTGCTGGTTGATCGTGACAATGTAAGCCTCGATCGTTTTGAGAACCTCGCCGGTTTCGCTGACCAGCCTGACGCCGTTGTCGACCTCATGCGTGGACTTGCTGATCAGGCTTTTGATTTCTTTCGCTGCATTGGCAGAACGCTGGGCAAGTTCCCGAACTTCCTGGGCGACCACTGCAAAGCCTTTTCCCGCTTCACCCGCGCGCGCGGCTTCAACGCCTGCGTTGAGCGCGAGCAGATTGGTCTGAAAGGCGATATCATCAATCACGCCAATGATGTTGGAAATCTGGCCTGAAGATTGTTCTATCCGCTCCATTGCTTCAACAGCCTTGGCCACCACGGTCCCCGACTGGCGGGCACTGTCGTTTGCCTGAACTGCGGTATTTCGTGCTTCTTCTGCGCGCTTATAGGAATTGGTGACATTCACCGTAATCTGGTCAAGGGCGGCCGCAGTCTCTTCGAGAGAAGCCGCTTGTTGTTCGGTCCGCCTCGACAAGTCTTCGGCACTCTGGCTGATCTCACGCGAGCCGCTGTCAATAGACGCTGCGGCTCGCGAAACCGATCCGAGTGTCCCTTGCAACTGTTCGACAGCTGCATTGAAGTCTGTTCGCAAGCTTTCGAAATCCGGTGCGAAGCTGTGCTCAAGAGTGACCGTCAAATCGCCTGAGGCGAGACGCTTAAGCCCTGCAGCAAGCCCATTTGTGGCCTCTGCCATCGCCTCTGCACGGACACGTTCCTGCTCGGCAATTTGGCGCCGTTGCTCCTCGGTCATTTGCCGCTGTTCATCCGCCTGATTTTCGAGTTCCCTTGTCTTCAAGGCGTTGTCCTTGAAGACCCGCACGGCGCGTGCCATCGCACCGGTCTCGTCAGAGCGATCCTCACCGTCTATATCAGTTGAAAGATCACCGGATGCGAGCCTGGTCATCGTGCCCGTCAGGCGGCCGATTGCATTGCCAAGAGATTTAGCGAACAGAAAAAAGGCAGTAAGAGCTAATAGTGAGGTCAGGAGAGTGACAGCGATCATGGTCCAAAGCGATGAACGGGCCTCCGCCAAAAGCGTAGTCGCATCAGTACCGATCTCGAAGACGCCTATTTTATCGCCAGCGAAGGACGTGAACGGAACCGCTTTGATCAGCATATCGCGGTTATCAAGCACAGTTTGTTCAAACACAGACGCGCCATCGAATGCTGACCGAAGTATATCGTCCGACAGGAAGGGCTTGCCTCCATCTGTCGAGGACTGATTAACAAGTTTTCCGTCCTGGACGACATTCACGGAGATCTCAGCGCCAATGCGCTCTGCGATAGGTGTAAAATAGGCGTTGGATAGTTCCGTGCCGATGTCTACCACCCCAACGACCTTTCCGTCGACCAAAACAGGCGCGACGGCATAAACGCCAATTCCGGTTCGGCCCGGTTCTATTCCTGCCGCCACTTTTCCCGTTGTGACCGCCGCGCCGACTGTCTTGCGGCGAGCCAGATAGTTGTCGCCGAATTTCTCAGGCGCGTGCACGCGGGCGATTACATTTCCGTTTGCATCCGCTACGGTAAAATTCTGCAACCCGCCCTGTTGTGCCACGGCCTTGATGTTAGACGAGAATTTATCGAGAAGTTTCTGGCGATCATTATTCTGGATGTAGCCGGCAATATCCGGTTCTCCGGCGAGCGTCAGCGCTAGAGCCGATGCGGAGCGCTGTATGGCTGCCATATCTGTCTGAATAACATGGAGGTCGCTTTCTGCCTTACTGTGAAGAGCTTGCGAATTCAT
Protein sequences of DBSCAN-SWA_10 >NC_022536|99508:103621|100572_101451_+|WP_048903007.1|DBSCAN-SWA MNKQDAPLALSDLINRYSTKLTEADTRLLDVLLQDPIRAAMENGKDVSSRAGVHPASAVRLARRLGFKGYPEFRSFLQSSLTEGEGDFESPAARMAARLVRAEDNGLLASVLDSEITALQQVRHAVSDADIRAFSAILRDSRRIFVFGCGHFSALSSLVALRLNRSGYEAIDLASRMHQLAEVLAALTAEDVVWFLAFRRTPPLFQEVREIAKRRGAKTLAVTDVGGTRIDPAPCHQILVSRGNPGESQSLVVPMTIANAIILDLASIDNGRSLQSLSEFRSFRASSRLTAG >NC_022536|99508:103621|99508_100573_+|WP_022557167.1|DBSCAN-SWA MSSISLRDIRKAYVNGPQVLHGVSLDIEPGEFVVVVGPSGCGKSTLLRLIAGLDKCEDGTIEIGGKRANDLPPQDRDIAMIFQNYALYPHMTVRDNIAFGLELRGMSKTERNERAERVAATLQLHAYLDRKPAALSGGQRQRVAMGRAMARNAAIFLMDEPLSNLDNSLRISMRTEIKELHRQLGATIVYVTHDQTEALSLADRIAVMKDGHLLQFDRPEVIYDRPSNRFVASFLGSPPMNFLASNSLPGWTGAGEVTVGLRPDLLTVHHEKPNQPALPGRLLLSEMTGSDMLLHCETPAGRLTVSAPRKTVARETEQLWIGFDLDRALFFDPQNGDRVDLPASGHMEGRGAAS >NC_022536|99508:103621|101494_103621_-|WP_084317464.1|DBSCAN-SWA MNSQALHSKAESDLHVIQTDMAAIQRSASALALTLAGEPDIAGYIQNNDRQKLLDKFSSNIKAVAQQGGLQNFTVADANGNVIARVHAPEKFGDNYLARRKTVGAAVTTGKVAAGIEPGRTGIGVYAVAPVLVDGKVVGVVDIGTELSNAYFTPIAERIGAEISVNVVQDGKLVNQSSTDGGKPFLSDDILRSAFDGASVFEQTVLDNRDMLIKAVPFTSFAGDKIGVFEIGTDATTLLAEARSSLWTMIAVTLLTSLLALTAFFLFAKSLGNAIGRLTGTMTRLASGDLSTDIDGEDRSDETGAMARAVRVFKDNALKTRELENQADEQRQMTEEQRRQIAEQERVRAEAMAEATNGLAAGLKRLASGDLTVTLEHSFAPDFESLRTDFNAAVEQLQGTLGSVSRAAASIDSGSREISQSAEDLSRRTEQQAASLEETAAALDQITVNVTNSYKRAEEARNTAVQANDSARQSGTVVAKAVEAMERIEQSSGQISNIIGVIDDIAFQTNLLALNAGVEAARAGEAGKGFAVVAQEVRELAQRSANAAKEIKSLISKSTHEVDNGVRLVSETGEVLKTIEAYIVTINQHMDSIATSSREQSVGLSEVNTAVNQMDQVTQQNAAMVEEANAAGATLAAEAVRLRELISQFQLGRGGLAASHETAGLHRMASAMAAASGSGPLTRRSRNTAVVSAGYGSAALANNTWDEF |
3 | Bacillus_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_11 |
109468 : 116886
Sequences of DBSCAN-SWA_11
Nucleotide sequences of DBSCAN-SWA_11 >NC_022536|109468:116886|DBSCAN-SWA TATGGCACAAGCCACCCAGAAAATCACCCTCAGCGCAGCCCGGGATATTCCCTTCAACAAGCTGGTGCTCAGTCAGCAAAATGTCCGCAAGATCAAGGCGGGCATCTCGATCGAGGACCTTGCCGAAGACATCGCCCATCGCGGGCTGCTCACCAGCCTTAACGTCCGCCCCGAACTCGACGGCGATGGCAACGAAACCGGCATCTACCGGATCCCGGCTGGCGGGCGCCGGTACCGCGCCCTCGAGCGTCTTGTGGCGCAGAAGCGTCTGGCCAAAACTGCCGGTGTCCCTTGCATCGTCAGCAAAGGCGAGACCCTTGAAGTCGAGGACTCCCTCGCCGAAAACGTCCAGCGCGTCAGCCTTCATCCGCTCGATCAATTCCGCGCCTTCCAGACCTTGCGTGAGCAGGGGCTCGGTGAGGAAGAAATTGCGGCGCGCTTCTTCGTTTCGGTTGCCACCGTCAAGCAGCGCCTGCGGCTGGCCTCCGTCTCGTCGCGGCTTCTCGATCTTTATGCCGAGGACGAGATGACTCTCGAACAGATCATGGCCTTCTCGATCACCAACGACCATGTTCGCCAGGAGCAGGTCTGGGATACGGTCTCCCGCTCCCATAGCCGTGAACCTTATTATATCCGCCGACTACTGACGGAGACCGCGATCCGCGCCAGCGACCGCCGCGCAGTCTATGTCGGCATCGAATACTACGAGGCCGCGGGCGGCGTCACCATGCGCGATCTGTTCGACCAGGATCAGGGCGGCTGGCTGCAGGATCCGGCGCTGCTCGAGCAGCTCGTGATGGAAAAGCTGAAGGCCGACGCCGAGGCGATCCGCGTGAGCGAAGGCTGGAAATGGGTTGAAGCCGCCTTCGACTTTCCCTACGGCCACACGTCCGGCCTTCGCCGCTTCTACGGAGAGCAGGCCGAGATGACCGAGGCGGACTTGGCCCACTATGACGCCACCCGTGCCGAATACGATAAGCTCGACGCAGAATATTCGGAAGCAGATGAGTACTCGGAAGCGACCGAACAGAAGCTGGAGGAGCTCGGCGCGGAGCTCGACCGGCTCAACGACCGCCCGTATGTGTTCGATCCGGAGGAGGTCGCCCGCGGCGGTCTCTTCGTCTCGCTCGGCGTCGGTGGCGAGCTCAACATCGAGCGAGGCTTCGTGCGGCCCGAGGACGAACCGAAGGATGCAGCCGATCCCTCGGCTGATGTCGGCGATAGTGGCGATTATGCCAGCAGTGTTCCGGCAACCGGGGCCGCCGGTGGGGAGGAGACGCAACCGGACGACGAAGACCAAACGGTTAAACCGCTTCCTGATCGCCTGGTTCTCGATCTGACAGCGGCGCGCACGGTGGCCTTGCGCAATGCGCTGGCGAACGATCCTGTCATCGCCTTCATTGCGGTCCTGCATGCCTTCGTCCTGAAGACCTTCTATGTCTACGGCTTGGATTCATGCCTGGAGGTGACGCTGCAGAGCGCCCGCTTCAGCCAGACGCCCGGCCTCGGCGATACAGTCTGGGCGAAAGAGATCGAGCAGAGGCACGAAGGCTGGGGCCAGGATCTTCCCAAGGATCCGAATGATCTCTGGAATTTTCTGATCAGGCTCGACGAGGTCAGCCGGCAGGCATTGTTTGCCCATTGCGCTTCATTGTCGGTCAATGCGGTCATCGAACCCTGGAACAAGCGGACCCGGGCGATCGCCCATGCCGAGCAGCTGGCCAACTCGATCGGCTTCGATTTGGTGGAAGCCGGCTGGACACCGACCGCCGACAACTATCTCGGCCGGGTCACGAAGGCGCGGATCCTCCAGGCGGTCCGAGAGGCCAAGGGCGACCAGGCGGCGGAACTCATCGGCCATCTCAAGGAGACCGACATGGCCCGCGAGGCCGAGCGGCTGATGACAGGCTGCGGCTGGCTGCCGGAACCGCTTCGCATGACGGTCGTGGATGGCATCAGCGAAAACGACGCTGTATCTGATGATGCCTCCATCAGCTCCGACGAGGCCGAGGCTTCGGCCGAACCCTCTGACCTGCCGGCTTTCCTGCTCGACCAGCCGGAAACATCGGACGATGACACCGGCGATACGGAAAGTGGGATCGACGGCGAGCATCTCGCAGCCGCCGAATAGCGTCAAAAGCATCCCATCAGCCTGATCCGATCGCCGTTGCGGTCGGGTCTTCCTTACAAACAGCCCGGCCATCGTGCCGGGCTTTCCCGTTTCAAGAGGCACCCATGCCTGACTTCACCATCAAGACCACCTACCACCTGCCGGTCTTCCGGCATCGCACCTACGCGGCCGACACGCTGGAGGCGGCGTGTCGTGCTGCCATCGATGACGACAGCTGGGATGTCGCCGAGAAAGACTTCGATTCTTCCGGTCCAGTTCATATCACCGGCATCTGGGACGGCACACATGCGGCCTATAAAGGTCTACCGCTCCGGATTCCACAGCAGTTCGACGAACCTGTCCAGCGCCGGGCCCGTCATTTCGAGATCTTGCTCGGACTGTTGAAGATGTTGTTGGATGACATCAGTTCGGCCCGGCCGCCATTGCCCGATTGGCTCGCCCGATCGGCCTGGGCGATTGCCCGAGGAGAAGCCATTCTCGCCGGAGAACCCGATCCCGAAGAGCCGGTCGACCTACCAAAGCCTTCCCACGTTCTGGTCAGGCTGCAAGAGGCTGGTGTACGCGACGCAATCGCCGCCGTGCTCGAAATAGATACGAGTTTCAAAGCACTATCGCCCGAGGCGGTGACCGACGACGAGGTCCACGCCGCATGCCGTTCCATCGCCGCCACGATGGATTTTTCCGACGCGGTCGGAAGCGCCGAATTCCAGGCAGCGCTCTCGGCGATCCGTTCGGCGCATCGCCGGCTTACGTCCGATTAATTCATCTTCTTCCCCAGAATCCCTCAACCTCGCCCGGCCACCGCGCCGGGTTTCATCTTATGGAGACAATCCAAATGAATTCCCTCGCATCCACTAACCAGCCAGCTTCCTCCGGTTTCAAAGTCGATATCTCCCGCGGCGAACGGATTGGCCGCGTTTCGTCCGAATGGTTTTCGCGCCCTGATGACGAACGCTTCCTGTCGCTGAACGATCTCTACGACACGGTCCGGTCCCGCGCTGACCGGGCCCATGCCCGAACGATCGAAAGCGCCGCGATCCGCGTCGAGGCTACGCGCGACAATGCGGAGCGGCTTGAACTTCTCGTTCCAGGTCAGCGCCAGCCGATCGCACCGACCCACTGGAGCTACGGTCAGCTCTGCAGCCTGGTCGGTGCACCGTCGAGCTACATGCGGCAACTCCCCGCGCCTCTTGCCGCGATCAACCTGCAGCACGGCCTGCTCAATCACCGTGCCGAGCTGGTAAAAACGCTCGAGATGGACGATGGCCGGCTCGAACTGCGCGCGGTGACGGGACCCGAATACGGCCGCATCTGGGATCACGAACTGGTCTCGGCGGTGATGAAGATCGCCGGCAATGGGACCGGCGACACGATGTGGAAAGTGCCGGGCGTCCTGGACTGGGCGACGATGAGCCACAATCCCTTCGTCGACATCACCAAAGATACGACGACGCTTTATGCCAGCGACCGCGATGTCTTCCTGTTCCTCGTCGACGACACCCATCCGATCGAAGCCGGGCGATCGCCCAACGGTGAGCCCGACCTCTATTTTCGCGGCTTCTATGCCTGGAACTCGGAGGTAGGCTCGAAGACACTCGGCATCGCCTCCTTTTATCTGCGGGCGGTCTGCGCCAACAGGAACCTCTGGGGCACTGAGAATTTTGAGGAGATCACTATCCGCCATTCGAAGTTCGCCGCCCAGCGTTTTGCGCATGAAGCAGCACCCGCGCTGACCCGCTTTGCCAATTCGTCGCCCGCTCCGTTCATCGCCGGCATCAAGGCGGCACGCGAAAGGATCGTCGCGCGCAAGGATGACGACCGTGAGAGCTTCCTGCGTCGGCGCGGCATCTCGAAAGGCGAGACCGGCAAGGTGATTGAGATGGTCTTGTCGGAAGAGGGGAGGCCGCCGGAATCGATCTTCGATTTCGTACAGGGGATCACCGCACTGGCGCGCACCAAGACCAATCAGGACACGCGTCTCGAGCTCGAAGGAAAGGCCAAGAAGCTGCTGGAGAGCGCCTCCTGACACGTTCAAACGTACCGGGCTCAGAACCCGGCCGCGTTGGCAAATCCCGTCATCGTCATCACAGAGCCCCGTCAAGCGCCGGGCCTCATTTTGTCTGGTGCACCCAATGGCTATCCCCGATCATGCCCGCACAAACTTCGACACGCTGCTGCGCGCCGCGTCCGATGGTAATCTTGCTCTCATGGAATGCCTCGATGCCACTACACGCGAGCCGCGTTACGTGCTCTGCGGCGTCGGCCGCAGTAACGGCGAATTTTTCTTCACGCCGTTTGGTCATCTGGCCGACGGCAATCCTTACGACGCCTACCTGCCGCCGGATCGAGAGAATCCCGCTGGCTTCATCGTAAACCCGCCGTGCTAGGCGCGATCGCCGGCGGTTCAACCTGCCGATCCCCGACCCTTCCACGGCCGCCCAGATGTGGCCTCCGACCTAGTCGGCAGGACACCGGACCTTGCAGTTCTTTCCTCACTCCTGGAGCGTCCCCCTTATGAATATCATTTCGCATCCTCAAAACGTTTCCACCTCTCCGCGTCCTATGGCCAGAACCAGCGCCGGGGCACTTATTCATGGTGCCGAACAGATCAGGTCGCTGGAGCAAGGTAAAGGCATCGCCACCGCCGACCTGAGACAGGTCATGACGGAGGTTTTCGGCGGCAGCGACGCCGAAGGCCGCTGGCTCCGGAAGGATGCCTATAAACCAACAGAGGTCGCCCAGGTCTCGTTTCTATCCCGCTCAATCACCTCCCGGGTCCAGTCGCCTCAATCGGCGCGTGCGATGCTGATGAAGGCGGCCCGGCTCTCGCCGACCCACACGCGCCGATCCGAGGAGATGATCAGTTGGAAGCTCCGTTCCTTTGTGCCTGTCGCAGGTGAGGGCACGGCGATCCTGTCGAAGCTGAATGATCGTCATTACCTTGTCGATGTCGCACACCGGTCATGATGGAGTTCTGCCATGCACTCCGCGTCGGATCTGGCGGGCCGTCTCGCGCGGGACGCGGAGGCGGTCTGCCGGCACTATCTCTCCGCCGGCCGTCGTGCCGGCAACTACTGGCTCGTTGGCGATGTCAGCAACAGAAAGGGCCGGTCGCTCTATGTGCACCTCGTCGGCCCGCGCGCCGGCCGCTGGACCGACGCGGCGACAGGCCAGTTCGGTGATCTGCTCGATCTGATCCGGGAGACTTGCGGTCTCGTCGACTTCCGGGACGTTGCAGACGAGGCGCGTCATTTCCTCAGCCTGCCTCGACCAGAGCCGGTGTCCTCTCGCGGGACCGATGCTGATGATTTCGCCCCGGTGGAACGGCGCACGGGCGTGCAGGCACAGCGGCTGTTCCGGATGACGCAGCCCCTGGCGGGCACGCTTGCCGACACCTATCTGCGCGGGCGCGGCATCTTACGGGCATCGACGCATGCGGCGCTGCGCTTCCATCCGTCCTGCTACTATCGTGATCTCGTGAGTGGTCGCACGACCAGCTATCCGGCCCTGATTGCCGCCGTGACCGACTCTGCCGGCGCGATCACCGGTGTGCATCGCAGCTGGCTCGATCCTGACGGCGCCGGCAAGGCGAAGGTCGACGATCCGCGGCGCGCAGTTGGCGGGCTCCTCGGGAATGGCGTCCGTTTCTGCTTTCCGGTCAATGCACCTGTCCCGGTCATGGCTGCAGGCGAGGGCATCGAAACAATGCTGTCGCTCGCACACGTGATGCCCGGCATGCCGATAGTGGCGGCGCTCACGGCCAACCACCTTGCCGCCTTCCGTTTCCCGCCCGGATGCCGGCGCATCTATATCGCCGCAGACGCGGATGCCGCCGGCCGGCATGGGATCGAGGGCATGAGCCGCCGGGCGCAGGAGTGCGGGATCCTGCCACTGGTGCTGTCGCCGGAGCTCGGCGACTTCAATGAAGATCTTCGCTGGCTGGGCCCGGACCGGTTGACGGCAAACCTCAGGGCACAACTCGTCCCGGAAGACTCGATGGCTTTTCTTCCAGCCTGACGTCGGGGACGAGGTTGGGGGAGGGCCCGGCCGCGGCACTGGCGAGGCGCAATCCATTGGCCTTATTCGGGATGGGCGCGCGCCCGCGGGCCTGCCGAGAGGCGACCCTTACCACGCGGGCCTCCGGGCCTTCAGCGGGTCGGTCGGGTTTCTTCCCCTGCCAGACCGCAGGCGCGGTCCGTCATTCCTCGCGGATCAAGAAACCCTCCCTCCGCCGCCCGGCGCTTCGCTGGGCCGCGGCACTTCGCTTGCGGTTCCGGCCTCGGCCCGCGTGATCGGGTTCGCCGTCAGGCCGCGAGAGGCGCGGCCCCAAACCGATGGAGACCACCATGGACCTGATCCTTCACCCCGAAGACACCTTCGAGCCCCACCATACATCCTCCCCGACCGACCGCTTCATCTACGAGATGCAGGTCTTCGGCTATCGCCCCTTCCAGGACGAACCCGATCCGCGGCCATTGCCGGAGGAGCCCCAAGTCCGCTCATCGATCACCACGCTCTTCGACGCCCTCGCAGAAATGCTCGGCGACACCCGCCTGGAGCCCGATCTCGAAGACCTTTTCTGGTCGATCCCCAATCTCTTCCATCGCGCCGGCGAACGGATCCAGCGCGAACTCGATCGTAACGAAGAGGCGCAACGCACCGGCCAGCGCGAGCAGGACGGCTCGGAGGTGAAGAGCGTCGAACTCGAACGCCTCATCGCCGAAGGCATCACGCTCCTGGAACGCCGCAACAGCTTCGAGTTCATGCGCGACTTCAGTGCCGACCTGTACGAGGCGCAGACCGGATCGTCCTGGCGGCCGCGCAGCGGCTCCAAGGTCAACCACGCCAACATGACGGCGGCGATGATCGACAGCCGGGACTTCCTCTCCGCCCGACGCCGCGCCGAAACCGAGGTGCTGATCCCTGCCGGAACCAAGATCGCATTCGCCGGCGGTATGGACTACAACGATCATGAGCGCATCTGGGCCAAGCTCGATCAGGCGCATGCTAAGCATCCCGACATGGTGCTCTTGCACGGCGGCTCGCCGAAAGGCGCTGAACGCATCGCCGCCTGCTGGGCGGAGGCGCGCCAAGTGACGCAGATTACCTTCAAGCCCAACTGGACCAAACATGCCAAGGCTGCGCCCTTCCGGCGCAATGACGAGATGCTTTCAGTCATGCCCGCCGGATTGATCGTCTTTCCCGGAAACGGCATCACCGGCAATCTCGCCGACAAGGCACGCCAGCTCGGCATCCCGGTCTGGCAGGGTTCAGGAGACGGCGCCTGA
Protein sequences of DBSCAN-SWA_11 >NC_022536|109468:116886|112433_113624_+|WP_022557180.1|DBSCAN-SWA MNSLASTNQPASSGFKVDISRGERIGRVSSEWFSRPDDERFLSLNDLYDTVRSRADRAHARTIESAAIRVEATRDNAERLELLVPGQRQPIAPTHWSYGQLCSLVGAPSSYMRQLPAPLAAINLQHGLLNHRAELVKTLEMDDGRLELRAVTGPEYGRIWDHELVSAVMKIAGNGTGDTMWKVPGVLDWATMSHNPFVDITKDTTTLYASDRDVFLFLVDDTHPIEAGRSPNGEPDLYFRGFYAWNSEVGSKTLGIASFYLRAVCANRNLWGTENFEEITIRHSKFAAQRFAHEAAPALTRFANSSPAPFIAGIKAARERIVARKDDDRESFLRRRGISKGETGKVIEMVLSEEGRPPESIFDFVQGITALARTKTNQDTRLELEGKAKKLLESAS >NC_022536|109468:116886|111702_112359_+|WP_022557179.1|DBSCAN-SWA MPDFTIKTTYHLPVFRHRTYAADTLEAACRAAIDDDSWDVAEKDFDSSGPVHITGIWDGTHAAYKGLPLRIPQQFDEPVQRRARHFEILLGLLKMLLDDISSARPPLPDWLARSAWAIARGEAILAGEPDPEEPVDLPKPSHVLVRLQEAGVRDAIAAVLEIDTSFKALSPEAVTDDEVHAACRSIAATMDFSDAVGSAEFQAALSAIRSAHRRLTSD >NC_022536|109468:116886|115944_116886_+|WP_022557186.1|DBSCAN-SWA MDLILHPEDTFEPHHTSSPTDRFIYEMQVFGYRPFQDEPDPRPLPEEPQVRSSITTLFDALAEMLGDTRLEPDLEDLFWSIPNLFHRAGERIQRELDRNEEAQRTGQREQDGSEVKSVELERLIAEGITLLERRNSFEFMRDFSADLYEAQTGSSWRPRSGSKVNHANMTAAMIDSRDFLSARRRAETEVLIPAGTKIAFAGGMDYNDHERIWAKLDQAHAKHPDMVLLHGGSPKGAERIAACWAEARQVTQITFKPNWTKHAKAAPFRRNDEMLSVMPAGLIVFPGNGITGNLADKARQLGIPVWQGSGDGA >NC_022536|109468:116886|114112_114565_+|WP_022557183.1|DBSCAN-SWA MNIISHPQNVSTSPRPMARTSAGALIHGAEQIRSLEQGKGIATADLRQVMTEVFGGSDAEGRWLRKDAYKPTEVAQVSFLSRSITSRVQSPQSARAMLMKAARLSPTHTRRSEEMISWKLRSFVPVAGEGTAILSKLNDRHYLVDVAHRS >NC_022536|109468:116886|114577_115615_+|WP_022557185.1|DBSCAN-SWA MHSASDLAGRLARDAEAVCRHYLSAGRRAGNYWLVGDVSNRKGRSLYVHLVGPRAGRWTDAATGQFGDLLDLIRETCGLVDFRDVADEARHFLSLPRPEPVSSRGTDADDFAPVERRTGVQAQRLFRMTQPLAGTLADTYLRGRGILRASTHAALRFHPSCYYRDLVSGRTTSYPALIAAVTDSAGAITGVHRSWLDPDGAGKAKVDDPRRAVGGLLGNGVRFCFPVNAPVPVMAAGEGIETMLSLAHVMPGMPIVAALTANHLAAFRFPPGCRRIYIAADADAAGRHGIEGMSRRAQECGILPLVLSPELGDFNEDLRWLGPDRLTANLRAQLVPEDSMAFLPA >NC_022536|109468:116886|109468_111598_+|WP_022557178.1|DBSCAN-SWA MAQATQKITLSAARDIPFNKLVLSQQNVRKIKAGISIEDLAEDIAHRGLLTSLNVRPELDGDGNETGIYRIPAGGRRYRALERLVAQKRLAKTAGVPCIVSKGETLEVEDSLAENVQRVSLHPLDQFRAFQTLREQGLGEEEIAARFFVSVATVKQRLRLASVSSRLLDLYAEDEMTLEQIMAFSITNDHVRQEQVWDTVSRSHSREPYYIRRLLTETAIRASDRRAVYVGIEYYEAAGGVTMRDLFDQDQGGWLQDPALLEQLVMEKLKADAEAIRVSEGWKWVEAAFDFPYGHTSGLRRFYGEQAEMTEADLAHYDATRAEYDKLDAEYSEADEYSEATEQKLEELGAELDRLNDRPYVFDPEEVARGGLFVSLGVGGELNIERGFVRPEDEPKDAADPSADVGDSGDYASSVPATGAAGGEETQPDDEDQTVKPLPDRLVLDLTAARTVALRNALANDPVIAFIAVLHAFVLKTFYVYGLDSCLEVTLQSARFSQTPGLGDTVWAKEIEQRHEGWGQDLPKDPNDLWNFLIRLDEVSRQALFAHCASLSVNAVIEPWNKRTRAIAHAEQLANSIGFDLVEAGWTPTADNYLGRVTKARILQAVREAKGDQAAELIGHLKETDMAREAERLMTGCGWLPEPLRMTVVDGISENDAVSDDASISSDEAEASAEPSDLPAFLLDQPETSDDDTGDTESGIDGEHLAAAE >NC_022536|109468:116886|113730_113985_+|WP_034499700.1|DBSCAN-SWA MAIPDHARTNFDTLLRAASDGNLALMECLDATTREPRYVLCGVGRSNGEFFFTPFGHLADGNPYDAYLPPDRENPAGFIVNPPC |
7 | Emiliania_huxleyi_virus(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_12 |
120621 : 120900
Sequences of DBSCAN-SWA_12
Nucleotide sequences of DBSCAN-SWA_12 >NC_022536|120621:120900|DBSCAN-SWA ATTACTTGTTCAGCGCATCCTTGAGAGCCTTGGCCGGCGTGAAAGCCAGTTTTTTTGCGGCGGCAACTTTGATCGTCGCACCAGTCGCCGGATTGCGCGCCTCGCGCTCCGGCGTGGCTTTCACCTTGAACTTGCCGAAGCCGGGCAGCGACGTCTCTTTGCCGGCAACCGCGGCCTCGGTGATTGAGGCGATCACCGCTTCGACGATGGCTTTTCCCTGTACCTTGGTGAGACCATTCTCGCTTGCGATCTTGTCGGCAATCTCATTGGTTGTCGTCAT
Protein sequences of DBSCAN-SWA_12 >NC_022536|120621:120900|120621_120900_-|WP_022557192.1|DBSCAN-SWA MTTTNEIADKIASENGLTKVQGKAIVEAVIASITEAAVAGKETSLPGFGKFKVKATPEREARNPATGATIKVAAAKKLAFTPAKALKDALNK |
1 | Burkholderia_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_13 |
124187 : 125273
Sequences of DBSCAN-SWA_13
Nucleotide sequences of DBSCAN-SWA_13 >NC_022536|124187:125273|DBSCAN-SWA TATGACGTCACTGCTTCTGGCGGCCAGTTTTGCCTCGACTTCCGCCGAGAATGATCCACGCCGGAGAAGCGAAAGCCGCGAGCTTAGCGATCGGCCCATTGGTTTTGCCCTCAACCATTCGGAAGAGCGCGCCGCGCCGGCAACACCTGTGGCGGTATCGCTTCTTGGTGTGACGCCCGAACATCTATTGCCGCCAGCCATTGATCTCGCCAATTTCAACCAACGTTATTGGGGGCGGGCCGAAATCAACGCCGCCACCGAAAATCTTGCCTTCGTCGTAAAAGCAACGACCCGATCCCGGCTTGCCGGTCCTGAACTATCGGAACAATTGGCGCGCGACACGAGAACATTGCGGGACCACTCCGTGTTCTCACCGATCAGCGTCTTCTCGTTTGAGCCGGGTCGAGGGGAACGGGCCGGCGATCCAGCTGTCGACGCCGGCAACGCCGCGGATCAAACATCGACGCAAGCTCGTCCCGACCTGCTTTCCGGCGTGCCTGAAGAATACGCAACACTCGCGGTGCAGATTGCGGCGGAGGAGAAGGTCGATCCGAACTGGGTGCTGTCGATCATGCGCGCTGAAAACGCGCGTTTCGACCCGGACCTGGTCAGTTCGGCTGGTGCCGTCGGCCTGATGCAGGTGATGCCAAAGATCGGTGAGGCTTTCGGCGCCGATGACCTTACTGACCCCGAACAGAATATTCGTGCAGGAACGCGTTTCCTGCGTGTGCTCATCGACAAGTATCGCAACCCGGTGCTGATCGCCTCCGCGTATAATGCCGGAGAACCACGTGTCGATGCGCATCAATCCTTGCCGCTCATCCGCGAAACCGCGGACTATGTGACGCGCGTCGTCGGCTACTACACCGGCAAGCCGGCAACCAGCCTGCCTTACACACCCAGGCCGACGGCTAGCCCTGAATCCAGTCAGCACCGCCGCGCAGGGCTGGTCGACCGGGCAAGGTCCCCGATGCTTGTCTTCTCGGCCACAAAGCCGCTTGTCTCGGACGACAGCCCGCAACAGCCAATGAAAGCGATCCAACGCGGCGGCCCGGTCAAAATCGTCAAGGAAGAGGTAGTTCAATGA
Protein sequences of DBSCAN-SWA_13 >NC_022536|124187:125273|124187_125273_+|WP_052349969.1|DBSCAN-SWA MTSLLLAASFASTSAENDPRRRSESRELSDRPIGFALNHSEERAAPATPVAVSLLGVTPEHLLPPAIDLANFNQRYWGRAEINAATENLAFVVKATTRSRLAGPELSEQLARDTRTLRDHSVFSPISVFSFEPGRGERAGDPAVDAGNAADQTSTQARPDLLSGVPEEYATLAVQIAAEEKVDPNWVLSIMRAENARFDPDLVSSAGAVGLMQVMPKIGEAFGADDLTDPEQNIRAGTRFLRVLIDKYRNPVLIASAYNAGEPRVDAHQSLPLIRETADYVTRVVGYYTGKPATSLPYTPRPTASPESSQHRRAGLVDRARSPMLVFSATKPLVSDDSPQQPMKAIQRGGPVKIVKEEVVQ |
1 | Escherichia_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_14 |
128452 : 129625
Sequences of DBSCAN-SWA_14
Nucleotide sequences of DBSCAN-SWA_14 >NC_022536|128452:129625|DBSCAN-SWA GATGACCAACGCAAGGCTTCCTGAACTGGCAACCGTCAGTGCCGTGGTACTGGCCAGCGTGTTCGGACCAGTCGCGGTGCTAGCGCAGGTACCGGTGAAAGACGGCGCCCGCCGCGATATCCAAAAGGAGACCGAGAACTGGTACAAGACCTATCAGTCAGATACGCGAAGCGTGAAGGGTGCGTCGGGCGGAACGACAGACAGCGTTGCGCCCGGGCAGGGCTCTGGTGCGCCAACCGATTGCTCGGCCGAGGGTATCGGCGGTGAAGCCGCGCCGGCCGGCCCGTCGAAGCGCACCTATAGCCAGCAGGAAGTTGCCCGATTGGTCGCCGACGAGGCGATCCGCAAGGGCGTCGATCCGAACTTAGCGCTGGCGATTGCCGAACAGGAGTCACGTTTTCGCCAATCGGCGCGCTCACCGGTCGGTGCGACCGGCGTCATGCAACTGATGCCGGGGACCGCGGCCGGCCTTGGTGTCAATCCTTACGACCTCCGCGACAATATTCGTGGCGGTGTGAAATACATCAAGCAGCTGCAGGGCATGTTCGGCAATCGCTACGACCTGATCGCCGCCGGCTACAATGCCGGACCCTATCGCCAGTCGCTTCAGAACGGCCGAATTCCCGACATTCCCGAGACCCAGGATTACGTCAAAAAGGTTTCATCCTACTATAGCCGGAACAACGCAGAGAACGGCGACAGACGTCCCCCAGGCGACGGCGTCGAGACGGAAACCGTCAGTGACGTCAGTGGATGCGGCGAGCAGCTGAAGAAGGCGATCGATCGCAATACTGAGGCGCAGGCCGAAAGGGGTTCCGTCTGGAACGAGCTTCTGGGGAAATCGCTCACGGCAAACCAGCAATATCTACAGCGCCTGCAGGCGGGGCTTCGGTCTTCTTCTGCGGCCTTACGCGGCTCGGGAGGCGGCAACGCCGGGGAACACGATGGCTCGCTTGTGATGGCCGAGGTCCGGTGCCCTTCGACCACCATCGATACCGGCAACGTTAGGTGCTTTGCCGTGCCATCCACCGTTACCACCGACCAGATCAGGCGCTGGCTTGAAGACCTCCAGGAGGAGGCTCGCGCCAGTGGCGATGTCGCCACCTTCTCGGTGGTGGAGGATCCGGCGCTTGGGCTGGTGACGGTCGTCGATGCCCGGCCGGTGGCAAATTGA
Protein sequences of DBSCAN-SWA_14 >NC_022536|128452:129625|128452_129625_+|WP_048903064.1|DBSCAN-SWA MTNARLPELATVSAVVLASVFGPVAVLAQVPVKDGARRDIQKETENWYKTYQSDTRSVKGASGGTTDSVAPGQGSGAPTDCSAEGIGGEAAPAGPSKRTYSQQEVARLVADEAIRKGVDPNLALAIAEQESRFRQSARSPVGATGVMQLMPGTAAGLGVNPYDLRDNIRGGVKYIKQLQGMFGNRYDLIAAGYNAGPYRQSLQNGRIPDIPETQDYVKKVSSYYSRNNAENGDRRPPGDGVETETVSDVSGCGEQLKKAIDRNTEAQAERGSVWNELLGKSLTANQQYLQRLQAGLRSSSAALRGSGGGNAGEHDGSLVMAEVRCPSTTIDTGNVRCFAVPSTVTTDQIRRWLEDLQEEARASGDVATFSVVEDPALGLVTVVDARPVAN |
1 | Geobacillus_virus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_15 |
143229 : 143976
Sequences of DBSCAN-SWA_15
Nucleotide sequences of DBSCAN-SWA_15 >NC_022536|143229:143976|DBSCAN-SWA CATGGCAGTAAAACCCCTCTCTCTCGCCATATCGACCTTCAAGGGCGGGGCCGGGAAGTCGACCCTGAACGTCAATCTCGCTGCGGAGTTTGCACAGCAAGGCCACAAGACGCTGCTGATCGATTGCGACGAACAGGAATCGTCGACGCGCTGGTACAATGTCTCCATGAAGCGGGGCCTATTGCCGGCAGACGGGTCATTGTTGCATCTGCGCGTGGCTCCTAAAGACAGGTTTCGTGAGCGGCTGGACAACGCGCCCGAAGCGGCAATCCGGATCTACGATGTCGGCGGGTACGTGCATCAGCGTGCGATCGAAGTCTACCACGAGTGTGATGCTATCCTTATTCCCGTGATTCCAGAGCCAACGGCAGCTACGTCAGCCATCAAGGTGGCTACCATATTGACGGAAATAGGTCGGAAGCGGGATCGCGGGCCGATCCCTTACGCGGCGATCTGGAACTACGTGGACATGATTGCCCTCAAGCACAATCGATCGCTTCCCGAGGTTCACGCGATCTTGGCGCGCGGCCGGATACCCGTGGTCGAAACGGTCGTCCGTAAGAGCAATCATTTCAGCGACGTGGGGGCCGGTTACGGTTCCCTTTACTCAAAACTCGCAGGCATCGTGGACAACCCGGCCTTGACGCCGTCAGCCAAAAGGCGGGCGTCGGAAAGCGTGCTCGACGCAATCGATCTCGTGAAAAGGATCAACAACGAGTTTATTGGTCTGCTGCAGGCGGAGGCGTAA
Protein sequences of DBSCAN-SWA_15 >NC_022536|143229:143976|143229_143976_+|WP_048903067.1|DBSCAN-SWA MAVKPLSLAISTFKGGAGKSTLNVNLAAEFAQQGHKTLLIDCDEQESSTRWYNVSMKRGLLPADGSLLHLRVAPKDRFRERLDNAPEAAIRIYDVGGYVHQRAIEVYHECDAILIPVIPEPTAATSAIKVATILTEIGRKRDRGPIPYAAIWNYVDMIALKHNRSLPEVHAILARGRIPVVETVVRKSNHFSDVGAGYGSLYSKLAGIVDNPALTPSAKRRASESVLDAIDLVKRINNEFIGLLQAEA |
1 | Vibrio_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_16 |
164119 : 165607
Sequences of DBSCAN-SWA_16
Nucleotide sequences of DBSCAN-SWA_16 >NC_022536|164119:165607|DBSCAN-SWA TATGCAAACTCTTTCAAACTACCGTCAGGAGTGGTTCTCCAACATCCGTGGCGACCTACTTTCGGGTATCGTCGTCGCTCTCGCGCTTATCCCGGAAGCGATAGGCTTCTCGGTGATTGCGGGTGTCGACCCCAAGGTCGGCCTCTACGCTTCCTTCGCAATCGCCTGCGTGACGGCCTTTGTAGGCGGCCGGCCGGGCATGATCTCTGCCGCCACGGCGGCGACTGCTGTCGTCATGATCTCGCTGGTCAAGGACCACGGTCTTCAGTATCTCTTTGCCGCCACCATCCTGATGGGCATCATCCAGATCGCTGCGGGATGGCTGAGGCTGGGCCGCGTGATGCGCTTCGTTTCCCGTTCTGTCATCACAGGCTTCGTCAATGCGCTTGCGATCCTGATCTTCATGGCCCAGTTACCCGAACTCGTTGGGGTCCCGACGCTCACCTACATGATGATTGCAGGCGGTCTTGCCATCATTTACCTCTTTCCATACGTGACCAAGGCGATCCCATCGCCGCTTGTGGCCATCGTGGTTCTCACGACCATGGCGTGGTGGTTCGGCATGGACCTGCGCACCGTCGGTGACCTCGGCGAACTGCCGTCTTCTGTCCCCTTCCTGATGCTTCCTCAGGTTCCGGTGACCTGGGAGACGCTGGAGATCATCTTCCCATACTCGGTGACACTTGCTGCTGTCGGCCTGCTGGAATCGCTGCTTACCGCGCAGATCGTCGACGATATGACGGACACGCCGAGCAACAAGAGCCAAGAATGCGTTGGACAGGGCGCGGGGAACATCGCCTCGGCACTGATCGGCGGCATGGGCGGATGCGCCATGATCGGACAATCGGTCATCAACGTAACATCCGGTGGTCGAGGACGTCTTTCAACGTTCGTGGCTGGTTCTTTCCTGCTGTTTCTGATCGTCGTCCTCAACGATCTCGTCCGGATCATCCCCATGGCCGCGCTTGTTGCGGTTATGATTATGGTTTCAATCGGCACTTTCTCCTGGAGGTCGATCGTCGATCTGAGACGTCACCCGCTTCCGTCGTCCTTCGTCATGCTGGCGACCGTCGTCACCGTCGTTGCAACACATGATCTGGCAAAAGGCGTAATTGTCGGCGTTCTGCTGTCGGGCATTTTTTTCGCTGGCAAGGTAGCTCGTCTTTTCAAGGTCACGCGTCAGGAGAATCCCGAACAGAATAGTGTCACCTATGAAGTAGTGGGGCAGGTCTTTTTTGCGTCGGCAGAAGCCTTTATCCACGCGTTCGACTTCACCGATAAGGGCAAACGGATCATCATCGATCTCGCCAAGGCTCATCTCTGGGACATCACCGCAATTGGCGCATTGGACAAGGTCGTGCTGAAGTTCCGTCAAGCGGGCGACGAGGTCGAGGTTCTCGGATTCAACGAAGCGAGTGCTGATATGGTTGATCGCTTCGCGCTGCACGACAAGGATGAGCGGCACGCGGCAAGCGCCGCACTACACTGA
Protein sequences of DBSCAN-SWA_16 >NC_022536|164119:165607|164119_165607_+|WP_048903014.1|DBSCAN-SWA MQTLSNYRQEWFSNIRGDLLSGIVVALALIPEAIGFSVIAGVDPKVGLYASFAIACVTAFVGGRPGMISAATAATAVVMISLVKDHGLQYLFAATILMGIIQIAAGWLRLGRVMRFVSRSVITGFVNALAILIFMAQLPELVGVPTLTYMMIAGGLAIIYLFPYVTKAIPSPLVAIVVLTTMAWWFGMDLRTVGDLGELPSSVPFLMLPQVPVTWETLEIIFPYSVTLAAVGLLESLLTAQIVDDMTDTPSNKSQECVGQGAGNIASALIGGMGGCAMIGQSVINVTSGGRGRLSTFVAGSFLLFLIVVLNDLVRIIPMAALVAVMIMVSIGTFSWRSIVDLRRHPLPSSFVMLATVVTVVATHDLAKGVIVGVLLSGIFFAGKVARLFKVTRQENPEQNSVTYEVVGQVFFASAEAFIHAFDFTDKGKRIIIDLAKAHLWDITAIGALDKVVLKFRQAGDEVEVLGFNEASADMVDRFALHDKDERHAASAALH |
1 | uncultured_Caudovirales_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_17 |
170012 : 172783
Sequences of DBSCAN-SWA_17
Nucleotide sequences of DBSCAN-SWA_17 >NC_022536|170012:172783|DBSCAN-SWA TATGTCGTTCCGACCGCTTCATGACCGCATTCTCGTCCGCCGGGTCGAATCCGAAGAAAAGACCAAAGGCGGTATTATCATTCCCGACACTGCCAAGGAAAAACCCCAGGAGGGCGAGGTCATCGCCGTTGGGCCCGGTGCGCGCAACGATGCCGGACAGATCCAGGTGCTCGACGTCAAGGTGGGCGACCGCATCTTGTTCGGCAAATGGTCAGGCACCGAGATCAAGATCAATGGCGAAGACCTGCTGATCATGAAGGAAAGCGACGTCATGGGCATAATCGGCGCCCAGGCTGAGCAGAAGAAAGCCGCCTGAGCGTCCCACCCTACACCCATCAGTCAGATAGCTGCCTATTGAGGAGTTAGAAAAATGGCTGCCAAAGAAGTCAAGTTCCACACCGATGCCCGTGAACGCATGCTGCGAGGGGTCGATGTGCTCGCTAATGCCGTGAAAGTCACCCTTGGCCCCAAAGGTCGTAACGTCGTTATCGACAAGTCCTTCGGCGCACCGCGTATTACGAAAGACGGTGTGTCGGTCGCGAAGGAAGTCGAGCTTGAAGACAAATTCGAGAATATGGGCGCGCAGATGCTGCGCGAAGTCGCTTCGAAGACCAACGATCTGGCCGGCGACGGCACCACGACCGCAACTGTCCTCGCCCAGGCAATCGTCAAGGAAGGTGCTAAGGCGGTAGCCTCCGGAATGAACCCGATGGACCTGAAGCGCGGCATCGATCTTGCGGTCGAAGCTGTCGTCAAGGAGCTGAAGACCAACGCCCGTAAGATAACCAGCAATTCCGAAATCGCCCAGGTTGGCACAATCTCTGCGAACGGGGATACGGAGATCGGACGCTATCTTGCCGAGGCGATGGAGAAGGTCGGCAACGAAGGCGTCATTACCGTCGAGGAAGCAAAGACCGCAGACACCGAACTCGAAGTCGTCGAAGGCATGCAGTTCGACCGCGGCTACCTGTCGCCTTACTTCGTCACCAATCAGGACAAGATGAGGGTCGAACTGGAAGATTCCTATATCCTTATCCACGAAAAGAAGCTCTCGAACCTTCAGGCGATGCTTCCGGTTCTCGAAGCTGTGGTCCAGTCGGGCAAACCTCTCCTGATCATCGCTGAAGACGTCGAAGGTGAAGCTCTTGCAACGCTCGTCGTCAACAAGCTGCGTGGCGGCCTCAAGATCGCTGCGGTCAAGGCTCCGGGCTTCGGCGACCGCCGCAAGGCCATGCTGGAAGATATCGCCATCCTGACCGGCGGCACTGTCATCTCCGAAGACCTCGGCATCAAGCTTGAAAACGTCACGCTCAACATGCTCGGCCGCGCCAGGAAGGTGGCGATCGAGAAGGAGAACACCACCATCATCGACGGTGTTGGCTCCAAGTCCGAGATCGACGGCCGTGTTGCGCAGATCCGCGCTCAAATCGATGATACCACTTCCGACTATGACCGCGAAAAGCTGCAGGAGCGTCTCGCCAAGCTGGCTGGCGGCGTTGCCGTCATCCGGGTTGGCGGCTCGACGGAAGTCGAAGTAAAGGAAAAGAAGGACCGCGTCGACGACGCGCTGCACGCCACTCGTGCGGCGGTAGAGGAAGGCATCCTGCCTGGCGGCGGCGTCGCGTTGCTACGCGCTGTCAAGGCGCTCGACAATCTCGGTACGGCCAATCAGGATCAGAGGGTCGGCGTTGATATTGTCCGCCGTGCAATCGAGGCACCCGTCCGTCAGATCGCCGAAAACGCCGGCGCGGAAGGCTCCGTTATTGTCGGCAAGCTGCGCGAGAAGACGGACTTCTCCTTTGGCTGGAACGCACAGACAGGCGAATACGGTGATCTCTACGCGCAGGGCGTTATTGACCCGGCCAAGGTCGTTCGTACTGCGCTTCAGGATGCCGCCTCTGTAGCAGGCCTTCTGGTGACGACCGAAGCGATGATTGCCGAAAAGCCGAAGAAGGATGCCGCGCCTGCACCCGTCGGAGCGGGAATGGACTTTTGATGGAAGGGAGCGCCCAGAGGGCGCCCCCTATTTGGCGCTCTCTGGGCAATGATGCGGAGCCTGACCGGCCAAATGCCAGTGCTAGTACTCTTCCGCAAATTCGCGGGCCGTCAGGATGAACGCAGTGACGGCGGCCGGTGGGCCGTTCCTCCCGAAATAATAAAGGGCCAAGGGCGTCAGGCGCGGCGTCCACTCCTCCAATATGCGCATCAGTCGTCCAGCCTCAAATCTTCGCGGTAAGTCGGTTTCTGCGAAAGCGAAATCGACCACACGAAACTTCAGCAGGAAATCAATCGGTCCTGCCGAGGTTTTTCCATTCTAATGTCATCCAGGGTGGAATGCTGCGCGGTTGACATGGCGACCTCGTCGCAGTCGGACTGCCCGAAGTTCGGAACGGAGGCGACATCGAATAAGGAGGGGTCGGTTTAATGTCTCTGACATTGCGCTTCCGTTCTGACATCCTCGAGCCTCTAGGGCCGGAGCTGGGCGGAATCGTCCTAAACGGTGTGCCTGGTCACTACCTGGGCATCGACCTCACCAGCCGGATTGGCCTGCCGGACCTATGGTGGCGAAGTGAATTGAGGGAGCTTAACAGCGACGATGCGCTTAAGGAAATCGTCTACCAGAACCTCGGCGCTGGGTTCGGCATCGTGCGGAACACGTATAACGGCGTTGACCTTGCGCTGCAGGGCAATGTCGAGCGGGGCGTTGAGAACGCTGCGCCAGAATTCATTCGAGATCTCCTGCGTTCCGTCGCTATTCCAGTGACGGTGTAG
Protein sequences of DBSCAN-SWA_17 >NC_022536|170012:172783|172435_172783_+|WP_022557256.1|DBSCAN-SWA MSLTLRFRSDILEPLGPELGGIVLNGVPGHYLGIDLTSRIGLPDLWWRSELRELNSDDALKEIVYQNLGAGFGIVRNTYNGVDLALQGNVERGVENAAPEFIRDLLRSVAIPVTV >NC_022536|170012:172783|170012_170327_+|WP_006699024.1|DBSCAN-SWA MSFRPLHDRILVRRVESEEKTKGGIIIPDTAKEKPQEGEVIAVGPGARNDAGQIQVLDVKVGDRILFGKWSGTEIKINGEDLLIMKESDVMGIIGAQAEQKKAA >NC_022536|170012:172783|170381_172007_+|WP_022557254.1|DBSCAN-SWA MAAKEVKFHTDARERMLRGVDVLANAVKVTLGPKGRNVVIDKSFGAPRITKDGVSVAKEVELEDKFENMGAQMLREVASKTNDLAGDGTTTATVLAQAIVKEGAKAVASGMNPMDLKRGIDLAVEAVVKELKTNARKITSNSEIAQVGTISANGDTEIGRYLAEAMEKVGNEGVITVEEAKTADTELEVVEGMQFDRGYLSPYFVTNQDKMRVELEDSYILIHEKKLSNLQAMLPVLEAVVQSGKPLLIIAEDVEGEALATLVVNKLRGGLKIAAVKAPGFGDRRKAMLEDIAILTGGTVISEDLGIKLENVTLNMLGRARKVAIEKENTTIIDGVGSKSEIDGRVAQIRAQIDDTTSDYDREKLQERLAKLAGGVAVIRVGGSTEVEVKEKKDRVDDALHATRAAVEEGILPGGGVALLRAVKALDNLGTANQDQRVGVDIVRRAIEAPVRQIAENAGAEGSVIVGKLREKTDFSFGWNAQTGEYGDLYAQGVIDPAKVVRTALQDAASVAGLLVTTEAMIAEKPKKDAAPAPVGAGMDF |
3 | uncultured_virus(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_18 |
178518 : 188855
Sequences of DBSCAN-SWA_18
Nucleotide sequences of DBSCAN-SWA_18 >NC_022536|178518:188855|DBSCAN-SWA CATGGGAAATATTTCAATGCGTAGCAAATTGGTCCTCTTCATATGTGGTCTCACATGTGCGCTGTCTATTACTGCTGCGGTTCTAGTCAAAAATGCAATGGAAAGTGTTTCTGTTGAGCGGATCGAGGCGTTGAAGAGCCAGGTGGATATCGCCTACTCCATCATGAATCACTTCTACGAAGCCGAGAAATCGGGAACGTTAAGTCATGAAACGGCCCTGTCCTCGGCTGCCGACGCAATCGAAGCGATCCGCTTCGAACCAACCGGCTACATATTCGGGTACGACTACAATGGTGTGCGCGTATTGATGCCGGACAAGAAGGACGTCGGGAAGAATTTTATTGGCACCAAGGACAAAAACGAGGTCCCCGTCGTCAAAACATTGATAGAGCTTGCTCGCAATGGTGGCGGATCATTGCAGTACTACTGGCCAAAGCCGGGACTTCCTGCCGAGGAACAAGTCGTGAAAACGGCTTATGCGCGGGATTTTGCGCCATGGCAGATCGTACTGGGGACTGGTACCTATATGGACGATATCGACGGCAAGAACTACCAGATATACAAGAACGCTGCGATTATCGGCTTATCAGTACTCGTTGTCAGCGTGGCTATCGCGGTCGCTTTGCTGCGAAGCATCACGGTGCCGTTTTCGGAGGTCCGCAAGGCGCTCGGTGCAGTAGCCGCACAGGACATCTCCTATCAAATTCCCTACACTGATCAGCCAAGTGAAATCGGCATGATGGCCACTTCGATCAAGACCCTTCAAGACAGAGTGAGAAGGCGGCTGGAGTTGGAGCGTCGCGATGCCGAGAACAAACTCGCCCTAGAAGCCGAACGGGAGCAGCGATTGCAGCTCGAAGAGGCAGAGCGATCGGTCCAATCGCATGTCGTATCCACAATCAGTGGGGCCCTGGAAGCATTATCCCGCGGCGATCTTACCGTAAGGTGCGGCGATCTAGGCGAGAAATATGCTGCCCTGCGCTTGAACTTCAATACCGCAATGGAGAAACTCGAACTGGCCTTGGCTGTCGTCAATCAAAAAGGCCAAGTCATTTCGATGAGCAAGGATGAAATTTATCGCGCGTCTGCCGATCTTGCGCGCCGCAGCGAGCTTCAAGCAGCAAACCTTGAAGAGGCTTCAGCTGCAATTGATGAACTGGCGGTTACTGTCCGGGAAACAGCGCAGGGAGCGCACAGCGCAGCAGCCAAGGTAAGCGCCATCAGCCAGGAAGCTACCCAGAGCGAAGCAATTGTCGTGAAAGCAATCAGCGCGATGGCCGGAATTGAACATTCTGCGAGAGAGATCTCCAATATCATCAGTGTCATTGATGAGATCGCATTCCAGACCAATCTCCTTGCGCTGAACGCCGGCGTCGAAGCCGCTCGAGCAGGCGAAAGCGGTAAGGGCTTCGCAGTTGTTGCCCAGGAAGTGCGCGAGCTGGCACAGCGTTCAGCCTCTGCCGCGAAAGAGATCAAGGATCAGATCTCCAAGTCGACGCTTCAGGTCACCGAAGGTGTGAGCCTTGTCGGCAATGCCGGTCAGGCGCTCCAGCGAATATCGAGACAGATAGAAGAAGCCGATGAATTCGTGGCAAGTATTGCCCGTAGCGCCAAAGAACAGGACACGACACTCTCCAGCGTGGCGAGTTCGATCAATGAGCTTGACGTGGCAACCCAGAAAAATGCCGCCATGGCGCAGGCATCGACTGAGATTGCGGCGGCTTTGGCAACTGATACCGAAGGCCTGCTGAGTTTGATTGCCAACTTCAAAACCGAATCCGGCAATCGTGACACCCTTCTTTATGAGGCTGCGTAAGGCGCTAGCCCGTTCGTTGAATGAACTGCTTTTATCGTAAGTGAGACTTTACTGCCTCCGGCGGCTTCGTGCCGCCGGAGGGCGCTGGACCACCAACGGAGCAAGACAGTGCTCAAACTGCGAGGCTGCGAAACTACGGGCGGTGCAATTGCGTCGTGGCATTTGTCGGATGGAAAGAGCCAGCGGTGACCGGCCCTCTCCCTATAGGTACCAGTGAAGTTGAGCTTACTGCGGCATCTTGTTCTGATGATTGGCGATCAGATCATCCACGACAGCCGGATCGGCGAGCGTCGAGGTGTCACCAAGCGCGCCGAAATCATCCTCGGCGATCTTGCGCAGGATACGGCGCATGATCTTGCCGGAACGGGTCTTCGGCAGACCGGGCGCGAACTGGATCTTGTCCGGTGTGGCGATCGGGCCGATTTCCGTACGGACATGCTTCACCAGGTCTGCTCGCAGCGCGTCATTGCCCTCGTGGCCCGACATCAGGGTGACGTAGCAATAGATGCCCTGGCCCTTGATGTTGTGCGGGTAACCCACAACGGCCGCTTCGGAGACGAGGTGGTGCGAGACCAGCGCCGATTCGACTTCGGCCGTGCCCAGGCGGTGACCGGAAACGTTCAGAACGTCGTCCACGCGACCCGTGATCCAGTAGTAGCCGTCTTCATCGCGCCGGCAGCCGTCACCCGTGAAATACTTGCCCTTGTAGGTGGAGAAGTAGGTCTGCACGAAGCGATCGTGATCGCCGTAGACGGAGCGGGCCTGGCCCGGCCAGCTGTCGGTGATGCAGAGGTTGCCGTCGGTCGCGCCTTCCAGCACTTGACCCTCATTGTCCACCAGCTCCGGCTTGACCCCGAAGAAGGGTCGCGTGGCCGAACCGGGTTTCAGCGCGGTCGCACCCGGCAGCGGCGTGATCATGATGCCGCCGGTTTCCGTCTGCCACCAGGTATCGACGACAGGCGACTTCTGCTCGCCGACGACATTGTAATACCACTCCCAGGCTTCCGGATTGATCGGCTCGCCGACCGAGCCGAGAAGGCGGAGCGACGAACGGTCCGAGCGCTTCACAAACTCCTCGCCGGCACCCATCAGCGAGCGGATCGCGGTCGGTGCGGTGTAGAAGATGTTGACCTTGTGCTTCTCGATCACCTCCCAGAAGCGGCCCTGATCCGGGAAGGTCGGGATACCTTCGAACATGAGCGTCGTGGCGCAGTTCGAAAGCGGCCCGTAGACGATGTAGGAGTGACCGGTGACCCAGCCGACATCGGCCGTGCACCAGTAGATGTCACCATCGTGATAATCGAAGACATATTCATGCGTCATCGACGCATAGACCAGATAGCCGCCGGTCGTATGCATGACGCCCTTCGGCTTGCCTGTCGAACCGGAGGTGTAGAGGATGAACAGCGGATCTTCTGCCTTCATCTTCACCGGCGGGCAGTCTGCCTTGACCGTGTGCACTTCCTGGTGATGCCAGATGTCGCGGCCCGGCGCCCAGCCGATCTTGCCGCCGGTGCGGCGCACGACGAGCACCTTGTTGACGATGACATACTGCTTGGCCGCGATGTCGATCGCGATATCGGTGTTTTCCTTAAGCGGCACGGGCTTGCCGCCGCGCACGCCTTCATCGCAGGTGATGACGAGAGTCGATTCGCAATCGACGATACGGCCGGCGAGCGCTTCTGGCGAGAAGCCACCGAAGACCACCGAATGCACCGCGCCGATGCGGGCGCAGGCGAGCATGGCATAAGCCGCTTCCGGGATCATCGGCATGTAGATGGTGACGCGGTCACCCTTCTTCACGCCGTGCTTCTTGAGGACGTTCGCCATGCGGCAGACGTGCTCGTAGAGCTCGTTATAGGTGATCTTCTTGTCGATATAGGGATTGTCGCCTTCCCAGATGATGGCAGTGCGTTCGCCATGCGTCTTCAGGTGACGGTCGATGCAGTTGTAGGAGACGTTGGTCAGGCCGTCCTCGAACCACTTGATCGGGACCTTGCCCTTGAAGGACGTGTTCTTGACCTTCGTATAGGGCTTGAACCAGTCGATCCGCTTGCCGTGCTTGCCCCAGAATTTTTCCGGGTCTTCGATGCTTTCCTGGTACCATTTCTGATACTTGTCATCGTCGATCAGAGCCCGGGTTTTTGCCGATTTCAGCACCGGATAGGTTTTGGCTGACATGAATTCCTCCTCATAAATTGGCGATTTTACTGAAGCATCAGGCACTTCATTGGAGGGTGAATCTCAGGAGCCCCCCTGCAGCTTGCTTTGTGGTCGCTGCACGAGACCGCTGCAGGACGCCCTCGCAGATATTCAAGCATTCGAGGGCCCGCGGCTGCACCGGCCAGAGCTCAACTCCTGGTCCGCCTAGTTGTGAGTTGATCAGACTCTTTTGACGGGGCGTTCCGCGGGCCCCTAAGAGAAGTCCGGGCGAGTTGCCCGGACTTCGTCTACGTCGATCTGTTGGTGGCTCAGGAAACCAGGCGACTTGACAGTCGACTTGCCGAGTGAACCACTTCGCCGCGTCAGCTGTTCAGATCGCGGCCCCGCGTTTCAGGCAGGAGAAGCGCACCGATCACGGCCGTCAGTCCGGCAAACGCGACGGTATACCAAAGCCCACCGAAGATGTTGCCTGTGGCGATCACCACCGCCGTCGCGACAAACGGAGCGAACCCCCCGATCCAACCGTTCCCGAGCTGCAAGGCAACGGAAACACCACTGTAGCGGATACGGACAGGGAAGAGCTCGACAAGAAGCGCTCCGAGAGGGCCGTACACCATCGTGGCCAGGAACACGAGGCAGAACAAGATGCCGATCACGGCGGCTCCGTTCGGTTGAACGGGTGCTTTCGGATCAGGCAAGCCAGTTGCAGTCAAAGCCGCGAGCAGTTCAGCTTCGTTATATGGCTTCGCCTCGCCGCCGTTCACTTGAACCCGCAGGGGGTCCGCGTTGGTCTTGACCGTGTAGGGAACCGCCCGGGCTGTCAGAAGCGCCTTCGCTTTCTGGCAATCGGTCGTGGCCTGGGGACTGAACAATTGACCGACGAAGGATTGGTCGTGGTCGCAGTCGTTTCCAGCGAGCGTGACCGAGGTCTTCGCGCGGAACTCGGCAAGGGTGGGGTTGCCGTAGGTCCCCAGCGCCTGAAAGAGTGGCATGATCAGGAACGCCGACAGTGCGCAACCTGTCACGACAATCCATTTGCGTCCAATCCGATCGGAAAGCCAGCCGAAGAACAGGAAAAAGGGCGTGCCGAGCGTCAGCGCGATCAGGAGGTAGAAGTTGGCCTGATCGTGCGTCATCTTCAGCGTCGTCGTCATGAAGTAGAGGCTGTAGAAATGGCCCGTGTACCAGATCACCGCCTGGCCGGCGACGACACCAAAGATCGCAACCATAATCAGCCGAAGATTCCGGCTATCACCGAGGGTCTCGGCGATCGGATTCTTCGAGCCCTTACCCTGTGCCTTCATCTCCTGGAACACAGGCGACTCATGCAGCTTCAGTCGGATGTAGAGCGAAAGAACCAGAAGCACGATCGAGCCGACGAAAGGCAAGCGCCAGCCCCAGCTTGCAAAGGCTTCCGGCGACATCGAGCTGCGGCACAGCACAATCACGAGCAACGACAGGAAAAGACCGAGCGTCGCCGTGATCTGGATCCAGCTGGTCGTTAAACCACGTTGATTGTGATCGGCGTGCTCGGCAACATAGGTCACGGCGCCGCCGAACTCGCCACCCACAGCCAATCCCTGGAGAAGACGCAAGGCCACCAGCAATGCAGGGGCAGCCCAGCCTATCGATTCGTAGGTTGGCAAGAGGCCAATGGCTGCCGTTGCAAGGCCCATGACAAGCATGGTGACGAGAAAGGTTTTCTTCCGGCCGACGAGGTCACCGAGTCGCCCAAAGACGACTGCACCGAACGGCCGAAGGACAAAGCCGGCGCCAAAGGTCGCAAGTGCCGCAAGAAGTGCCGCGGTCTCGTTTCCCTTGGGGAAGAACAACGTACTGAAGAATGCGGCGAGTGACCCATAAATGAAGAAGTCGTACCACTCGAAAACGGTTCCGAGCGCTGCAGCGAAGACCGTGCGTTTTGTGTTGCGGTCGAGGCCTTCGGCCCGCGCAACGATTCCAGACGAAATTTCATGAACAGACATTACGAGCCCCTCCCCTAGATTGCATGTTGCGAGAGGCCTTGGCCGCTCGCCTGAGAAACCGGATCTTCACAGGCCCTTAGAAATGCTGCCAGCATTTCATTGAACTTGTGAGGCTGCTCCAGCATCGGGAAGTGCCCCGCATTCTCGACCACCTCCAGTCTGCTGTCCGGTATGCCCTCAGCCAACACCACCGACTCCGCCGGCGGCGTGATGATATCCTCGTTTCCGACGACCACCAGGGTCGGGACTTTGATGTCACCAAGACGCGAACGACTGTCGGATGCATTCAGCGAGGCGATCGCTTCGCGCGCAACGAAATCAGGGGTCTGTGCGACCTCCTGCCTGGCAAAGTTCAGGAGTTCCTGACTGGCCGCGCTGCCGAATGACCGATCGATGACGTTTTGACTGGCCTGGACGACACCGAGGTCGTCGATTGCCTTCAGAACGTTATCGACATTGACGTCCTCTCCGAGTCCATGCGGCGTCGCTCCGACGAGCACCAACGCCTGTACGCGCTCCGGATGCGCCAGCGTAAAGCTTTGTGCGATGGTTCCACCCATTGAGAGGCCGACAAGGAAGGCCTGACGGATGTCGAGCTTACGGAAGACTTCGAGCACGTCGTTTGCGAATGCGTCGATGGTATAGCTGCGAACGGCAGGCCTTGGCGACACGCCGTGCCCCGGCAGGTTGATACGGATGACTTTGTAGCGGGATGAGAAGGCCTCGACCTGCTCGCGCCAGAACTCGGCGGTCGTCGTAAATCCGTGCACGAAGACGAGCGGTTGGCCCGCGCCCGACACCCGGACGATCGTGTCGCCTATCTCCATTGTTTCTGTATGCATGGTATCTGCTCCCCTTTCGGTTTTTACGCCTGGGAGCATCGCAAGCCATGGGCCAGTTCCATGCATGCCAATGTAACGCAATCTAACACTCTCAAGACAAACGGTTATTTGAAACTGAGGGCGGTTCGTTCGACTGTCTTGCCGGCGGCGCACTTAGCCCCCTTGTCTCGACTGCGTTACAAAAATTCTCGCAAATGAGACAAAAACCGATAGCATATGTTTGGCGATACAACACCAACAGTCAGGCAGTTGATATGACCCTTCCCGAATGGCTTGCAGGTTCTTCCGAGGAAGCCGGCTTTCTCCGGCTGGTCCTAGATCACGTTTCCGATTGCCTCGTCGCCGTCGACACGGAAGGAACGATCGTCCTGATTAACGACCCCTATTGCCGCCTCCTTGGCGGTACGGCCGAGGAATTTCTCGGACGGCACATCACCGATGTGGTGGGGCCGCAAACCAAACTTCATTTCGTCGCGCGGGGTATTGGCACTCATATTGGTTATCCGCTCGAAGTGCGGGGCCACAAGCTGGTGACGAAACAGGTCCCTGTGCGCAAGGATGGACGGATCATCGGTGCCGTGGGTCTGGCGCTGTTCTCGGACTACGATGCGCTGAAAAAGACCTACGGCCGGATTTCAAAAGCGGAACTCGCCATCCCTTCGAAGCCCAGGGCCTGGCAGGCCAAGTTCGGTCTCGACAATGTCATCGGGACTGGTCCGTTGATGGAAGCGCATCGCGACGCGCTTAAACAGGCGGCCGCCTATGATCTGCCTGTCCTCATCTGCGGTGAGACAGGCACCGGCAAGGAATTGGCCGCTCAGGCAATCTATTCCTTGTCGGTCCGCTCTGCCGGACCATTCGTATGGGTCAATTGCGCATCCATTCCCAGCGAACTTATCGAAGCAGAGCTCTTCGGCTATGAGGGCGGGGCTTTCACAGGCGCACGCAGCCAAGGTAAACCCGGCAAGTTCGAACTGGCGGCAGGCGGCGTGCTCTTCCTTGACGAGATTGGCGACATGCCGCTTGCCTTGCAGGGCAGTCTTCTGCGCGTGCTCCAGACCGGGGAGATCGTCCGCGTCGGCGGCACCAACCCGGTCGGCATCGATCTTCGCATCATCTGCGCTACCAACAAGCCACTCGTCGAACTGGTGCAGACGGGGCGTTTTCGCGAGGACCTATATCACCGGCTGAACGTTCTGCCGATCGAGGTTCCGGCCCTGAGAGAAAGGGGCGATCTGGCTCATCTTGCCGAACACCTCCTTGCACACATCGCCACCCGGCTGTCCGTACCGGCGCCAGTATTGACAGCTGAGGATCATGATCGGCTTGTCGCGCATACGTGGCCGGGTAACGTCCGGGAATTGGAGAATGCCCTCACGCGCATGATCGTGACGGGGCATATATCCACTGCATCGCTCGATTATCGCCATCGGTCTCCCTCTGTGGACTCGAAGAGCGACTTGAAGAGCCGCATGAAATCCGAAGCCCATGCAGCCCTTCGCGCCGCCCTCCTGCAGACCGGAGGCAACAAGCAACGCGCAGCCGAAGTGCTCGGCATCAGCCGCGCGCAACTCTATCGACTGTTGAAAGAGCAAAGCCCGCCTAGCAAGTAGCCCTTCAGTCGCGCAAGGTTTAAGCCGCCGAATGCGATCTAGCGCCTGAGGTCCGGTTGGTCGGAATTAGGCTATGCAAATTCGAGCGCGGTCAGAAGATCTTCAACGAGATCCTCGGCGTCCTCGAGGCCAACCGAGCATCGTATGACGTCCTCGCCAAGGTTGACGCAGAATTGGCTGCCGGCAAGAGACAGATGACGGGCATTGGTGAGCCGAGCTGAACCGCAGATGAAGCTACCGACCTCCGCTAAATCTCCGCCTGACCTGATCAGGCTCAGTCTCTTGATCATTGCCGTGGCGCCCTCCGGCCCCGCTTCAAGCCCAAAGGCAAGCATCCCCGAGCCGCCCGACATCTGACGGCGTGCAATTTCATGATCGGGATGTGACGTGAGGAAAGGATATCTCACCCAGGCAACCGAGGGATGGGCCTCCAGTGTCAGAGCAAGGACGTGCGCGGTGTTGCAATGTTGATTCATTCTGAGCGCCAGAGTACTGAGCCCGCGCAGGATCAGGAACGCAGCTTGCGGCGAGATGGTGGTGTCATGGCGCTCGCGCAGGCTTTCATGGCGCAATTTTCGAATATCCGCCGCCGTCCCAAGTAATGCGCCGCCGACCGTATCACCGTGCCCGTTCATGTATTTCGACAGCGAGTGCAGCACGATGTCCGCCCCATGTCCAAGCGGCCTTTGGAGGGCCGGCGTTGCAAGTGTGCTATCAACTGCCACCTTGAGGCCGAACGCGTGAGCTTGTTCAGAAATGGCCGAGACATCCAGAACGCTGTTCAGTGGATTGGTGGGCGTATCGAAATACACTAATCTTGTCCGTTCGGTTATCGAACGGCCGATCGCGCTCGGCTCTGAAAGATCCACCGCGACCAGCTTTGCGCCGACCGCCGAAACAATCTCCAGCATTAGGTACGCTGTGTCGGTGCAGACCGGCCTGTGAACAATCATCTCGTCTCCGGCAGCAAGAAATGACCGGGCAAGCGTGCGAAAGGCAACCAGACCTGAGGAGAGGACAAGACCGGCCTCTGCTCCTTCAAGACGGGCAAGCTTGTGCTCGAGCAGCCATGTCGCCGAACGGTCTTTGTCCGCGCCGGATGGATGGACAAGTGTGGTCCGACCGCAGACAGAAGCCGATGGTCCACTGCGAAGGGACGCAATCGCAGTAGACGGGAACGGCTCATGTATGGAGCGGGTCGAAAAGCCGAGATCATTGACGTCAGGCGAGGCCTTGCCCGAAACTGGCTCGTCTCTCGTTATCTTGCTGCTCATCTGTGCCCCGATCGTCGATCGCTAAAGACCCGTCCTCACCCGAGGGCGCTGCAGTTCCGCCACAGCCATTACGAACCGTAGTTCATCGTTTCCACGATTCATGTAGGCGTGCTTGTCTTCGGTTCGCGCCAGAACGGATGAACCCGCGTGAACCAAGTGCTCGCTGTCTGCAAACTTCAGGGTGAGCTCACCCGTTTCGACGTTAATGAGCTCGAGTGTGCCGGGCGAATGGCCGGGAGATTCAAATATCTCGCCCGGGAAAAGCGTCCAGCGCCACAGCTCGATCTCGTCCGGCCCGTTGGTTCCAACGAGAAGTGTTGCGCTTCCGCCCTTCGGACCGTGCCACAGGGTCGATGCGTCCTGTGCTGGGACGATGCGAACAGGAACACTTGACGCTACCCCGACGAAATCCGCAACCGACACGCCCAAGGCGGTGGCGGCCCGGCACAGCGTGGCGATGCTCGGATTGGCTGTTCCTTTCTCGATCTCGACAAGCATGCCCTTGCTCACGCCGGACTTCCGGGACAGTTCGTCCAGCGTCAGGCCGCTCTGGCGCCTGAAGGTTTTCAGATTCTGGGAAACGGTTGCGCTGACGCGCTCGACATCGGCGACCGTGTCGGTCGATATATTGACTTTCTTTTCCAT
Protein sequences of DBSCAN-SWA_18 >NC_022536|178518:188855|182858_184481_-|WP_022557268.1|DBSCAN-SWA MSVHEISSGIVARAEGLDRNTKRTVFAAALGTVFEWYDFFIYGSLAAFFSTLFFPKGNETAALLAALATFGAGFVLRPFGAVVFGRLGDLVGRKKTFLVTMLVMGLATAAIGLLPTYESIGWAAPALLVALRLLQGLAVGGEFGGAVTYVAEHADHNQRGLTTSWIQITATLGLFLSLLVIVLCRSSMSPEAFASWGWRLPFVGSIVLLVLSLYIRLKLHESPVFQEMKAQGKGSKNPIAETLGDSRNLRLIMVAIFGVVAGQAVIWYTGHFYSLYFMTTTLKMTHDQANFYLLIALTLGTPFFLFFGWLSDRIGRKWIVVTGCALSAFLIMPLFQALGTYGNPTLAEFRAKTSVTLAGNDCDHDQSFVGQLFSPQATTDCQKAKALLTARAVPYTVKTNADPLRVQVNGGEAKPYNEAELLAALTATGLPDPKAPVQPNGAAVIGILFCLVFLATMVYGPLGALLVELFPVRIRYSGVSVALQLGNGWIGGFAPFVATAVVIATGNIFGGLWYTVAFAGLTAVIGALLLPETRGRDLNS >NC_022536|178518:188855|188231_188855_-|WP_022557273.1|DBSCAN-SWA MEKKVNISTDTVADVERVSATVSQNLKTFRRQSGLTLDELSRKSGVSKGMLVEIEKGTANPSIATLCRAATALGVSVADFVGVASSVPVRIVPAQDASTLWHGPKGGSATLLVGTNGPDEIELWRWTLFPGEIFESPGHSPGTLELINVETGELTLKFADSEHLVHAGSSVLARTEDKHAYMNRGNDELRFVMAVAELQRPRVRTGL >NC_022536|178518:188855|184495_185323_-|WP_022557269.1|DBSCAN-SWA MHTETMEIGDTIVRVSGAGQPLVFVHGFTTTAEFWREQVEAFSSRYKVIRINLPGHGVSPRPAVRSYTIDAFANDVLEVFRKLDIRQAFLVGLSMGGTIAQSFTLAHPERVQALVLVGATPHGLGEDVNVDNVLKAIDDLGVVQASQNVIDRSFGSAASQELLNFARQEVAQTPDFVAREAIASLNASDSRSRLGDIKVPTLVVVGNEDIITPPAESVVLAEGIPDSRLEVVENAGHFPMLEQPHKFNEMLAAFLRACEDPVSQASGQGLSQHAI >NC_022536|178518:188855|178518_180333_+|WP_022557266.1|DBSCAN-SWA MGNISMRSKLVLFICGLTCALSITAAVLVKNAMESVSVERIEALKSQVDIAYSIMNHFYEAEKSGTLSHETALSSAADAIEAIRFEPTGYIFGYDYNGVRVLMPDKKDVGKNFIGTKDKNEVPVVKTLIELARNGGGSLQYYWPKPGLPAEEQVVKTAYARDFAPWQIVLGTGTYMDDIDGKNYQIYKNAAIIGLSVLVVSVAIAVALLRSITVPFSEVRKALGAVAAQDISYQIPYTDQPSEIGMMATSIKTLQDRVRRRLELERRDAENKLALEAEREQRLQLEEAERSVQSHVVSTISGALEALSRGDLTVRCGDLGEKYAALRLNFNTAMEKLELALAVVNQKGQVISMSKDEIYRASADLARRSELQAANLEEASAAIDELAVTVRETAQGAHSAAAKVSAISQEATQSEAIVVKAISAMAGIEHSAREISNIISVIDEIAFQTNLLALNAGVEAARAGESGKGFAVVAQEVRELAQRSASAAKEIKDQISKSTLQVTEGVSLVGNAGQALQRISRQIEEADEFVASIARSAKEQDTTLSSVASSINELDVATQKNAAMAQASTEIAAALATDTEGLLSLIANFKTESGNRDTLLYEAA >NC_022536|178518:188855|185577_186936_+|WP_022557271.1|DBSCAN-SWA MTLPEWLAGSSEEAGFLRLVLDHVSDCLVAVDTEGTIVLINDPYCRLLGGTAEEFLGRHITDVVGPQTKLHFVARGIGTHIGYPLEVRGHKLVTKQVPVRKDGRIIGAVGLALFSDYDALKKTYGRISKAELAIPSKPRAWQAKFGLDNVIGTGPLMEAHRDALKQAAAYDLPVLICGETGTGKELAAQAIYSLSVRSAGPFVWVNCASIPSELIEAELFGYEGGAFTGARSQGKPGKFELAAGGVLFLDEIGDMPLALQGSLLRVLQTGEIVRVGGTNPVGIDLRIICATNKPLVELVQTGRFREDLYHRLNVLPIEVPALRERGDLAHLAEHLLAHIATRLSVPAPVLTAEDHDRLVAHTWPGNVRELENALTRMIVTGHISTASLDYRHRSPSVDSKSDLKSRMKSEAHAALRAALLQTGGNKQRAAEVLGISRAQLYRLLKEQSPPSK >NC_022536|178518:188855|180558_182514_-|WP_022557267.1|DBSCAN-SWA MSAKTYPVLKSAKTRALIDDDKYQKWYQESIEDPEKFWGKHGKRIDWFKPYTKVKNTSFKGKVPIKWFEDGLTNVSYNCIDRHLKTHGERTAIIWEGDNPYIDKKITYNELYEHVCRMANVLKKHGVKKGDRVTIYMPMIPEAAYAMLACARIGAVHSVVFGGFSPEALAGRIVDCESTLVITCDEGVRGGKPVPLKENTDIAIDIAAKQYVIVNKVLVVRRTGGKIGWAPGRDIWHHQEVHTVKADCPPVKMKAEDPLFILYTSGSTGKPKGVMHTTGGYLVYASMTHEYVFDYHDGDIYWCTADVGWVTGHSYIVYGPLSNCATTLMFEGIPTFPDQGRFWEVIEKHKVNIFYTAPTAIRSLMGAGEEFVKRSDRSSLRLLGSVGEPINPEAWEWYYNVVGEQKSPVVDTWWQTETGGIMITPLPGATALKPGSATRPFFGVKPELVDNEGQVLEGATDGNLCITDSWPGQARSVYGDHDRFVQTYFSTYKGKYFTGDGCRRDEDGYYWITGRVDDVLNVSGHRLGTAEVESALVSHHLVSEAAVVGYPHNIKGQGIYCYVTLMSGHEGNDALRADLVKHVRTEIGPIATPDKIQFAPGLPKTRSGKIMRRILRKIAEDDFGALGDTSTLADPAVVDDLIANHQNKMPQ >NC_022536|178518:188855|187007_188210_-|WP_022557272.1|DBSCAN-SWA MSSKITRDEPVSGKASPDVNDLGFSTRSIHEPFPSTAIASLRSGPSASVCGRTTLVHPSGADKDRSATWLLEHKLARLEGAEAGLVLSSGLVAFRTLARSFLAAGDEMIVHRPVCTDTAYLMLEIVSAVGAKLVAVDLSEPSAIGRSITERTRLVYFDTPTNPLNSVLDVSAISEQAHAFGLKVAVDSTLATPALQRPLGHGADIVLHSLSKYMNGHGDTVGGALLGTAADIRKLRHESLRERHDTTISPQAAFLILRGLSTLALRMNQHCNTAHVLALTLEAHPSVAWVRYPFLTSHPDHEIARRQMSGGSGMLAFGLEAGPEGATAMIKRLSLIRSGGDLAEVGSFICGSARLTNARHLSLAGSQFCVNLGEDVIRCSVGLEDAEDLVEDLLTALEFA |
7 | uncultured_Caudovirales_phage(20.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_19 |
205084 : 206320
Sequences of DBSCAN-SWA_19
Nucleotide sequences of DBSCAN-SWA_19 >NC_022536|205084:206320|DBSCAN-SWA GTTAGAGGGTCGCGAGCGCCTGCCTCAGATCGCCAATCAGGTCAGTCACATCCTCAAGACCGACAGAAAGACGGATGAGGTCGTCACCAACCCCCGAGACCATATGGGCGTCTTTTCGGATCGATTGCCGCGCCCTCGTAATGCTGGCCGGATGATAGATCAGTGTGTCGGTGTCGCCGAGGCTCACCGCGCGCGTCATCAGTTGCAAACGATCGAGCATATTTCTTGCGCCGTCAAAACCGGCGTGAAGTCCGAAGGCCAGCATGCCCGATCCCTGTGTCATTTGCTTGCGCGCGATAGCCTGATCGGGATGCGATTCCAGGAAGGGATAGCTCACCCAGGACACGGCGGGATGCGCTTCGAGCATTCGAGCGATCGCCAGAGCCGAGGCGCTGTGCCGATCCATTCGAAGCGAGAGTGTTTTCAGACCGCGCATGATCAGGAAAGCGGAGTGTGGCGCCAAGGTGGCGCCGGTAATGTAACGCAGACCCGTTTCGTGCAGGCGGTGGAGCGTTTCGGCGTCGCCGAGTAGCGCCCCACCCAAGGTGTCGCCATGACCGTTGATGTATTTCGTAAGAGAGTGAAGCACGATATCGGCGCCATGCTCGATCGGCCGCTGGAGCGCGGGAGAAGCAAAGGTACTGTCCACTGCGACTTTCACCCCACGCGCGCGTGCGCGCTCTGAGATTGCCGCAATGTCGAGAATGCTGCTGAGCGGGTTCACGGGCGTCTCGAAATAGACGAGCTTGGTCCTTTCCGTGATCGCCGCGTCGAGGTTTGACGGATCAGAGAGATCGACGGGAATGACCTTGATGCCGAAACGCGGCAGCCCCTGCTCCACCATCGCCACAGAATTCGAATAAAGCGTTTTGTGAACGACGAGCTCGTCACCCTGCGACAGCAAGGAGAGGATGAGAGTGCCGAAGGCGGCCATTCCGGTCGATACGACAAGGCCCGCCTCCGCCCTTTCCAGATTTGCAAGCCTCTGCTCAAGGATCTCTGTGGTCGGGTTGTATTCCCGCGCGTAAAGTCGGCCGCCGAGCGCTGCTGCGGCGTCATTGGCCTCGACGCTCTCGAAGCCGTAGGTCGAGGTCAGGAAGACTGGCGGCTGGACTGCCCGTTTGAAGTCCACCGGGCTGAATGCATGATGAATGGCGCGCGTGTCGAAGCCGAAGCTTGCCGCGTGAAGGTGGTCAGATTGTCTCTGACGGTTGTCATGCTCGCTGCCGCCTGTCAT
Protein sequences of DBSCAN-SWA_19 >NC_022536|205084:206320|205084_206320_-|WP_022557291.1|DBSCAN-SWA MTGGSEHDNRQRQSDHLHAASFGFDTRAIHHAFSPVDFKRAVQPPVFLTSTYGFESVEANDAAAALGGRLYAREYNPTTEILEQRLANLERAEAGLVVSTGMAAFGTLILSLLSQGDELVVHKTLYSNSVAMVEQGLPRFGIKVIPVDLSDPSNLDAAITERTKLVYFETPVNPLSSILDIAAISERARARGVKVAVDSTFASPALQRPIEHGADIVLHSLTKYINGHGDTLGGALLGDAETLHRLHETGLRYITGATLAPHSAFLIMRGLKTLSLRMDRHSASALAIARMLEAHPAVSWVSYPFLESHPDQAIARKQMTQGSGMLAFGLHAGFDGARNMLDRLQLMTRAVSLGDTDTLIYHPASITRARQSIRKDAHMVSGVGDDLIRLSVGLEDVTDLIGDLRQALATL |
1 | Pandoravirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_20 |
209335 : 211341
Sequences of DBSCAN-SWA_20
Nucleotide sequences of DBSCAN-SWA_20 >NC_022536|209335:211341|DBSCAN-SWA TATGAAAGTAGCTTTGATATCGGGTGTAACCGGCCAGGATGGGGCTTATCTCGCCCAACTACTGTTGGACAAAGGCTATATAGTTCACGGCATAAAACGACGTTCATCATCATTCAATAGTCAAAGAATAGAGCGAATCTACCAGGATCCCCACGATCCTGAAGCAAGATTCTTTCTTCACTATGGAGATATGACAGATTCGGCCAATTTGCTACGTATTGTTCAGCAAACCCAGCCGGATGAAATTTACAACCTTGCGGCGCAAAGCCATGTCCAGGTGAGTTTCGAAACACCCGAATACACCGCGAACGCTGATGCAATTGGTACACTGCGCATGCTGGAAGCTATTCGAATTCTTGGCCTGACCAATCGGACGCGCTTTTATCAGGCATCAACGTCCGAGTTATACGGTCTAGCTCAAGAAGTCCCGCAAAATGAGAAAACACCGTTCTACCCGCGTTCACCTTACGCTGCCGCCAAGCTCTACGCGTACTGGACTGTCGTAAACTATAGAGAGGCCTACGGCATGCACGCTTCCAACGGTATTCTTTTCAATCACGAAAGCCCGCTTCGAGGTGAGACATTCGTGACCCGCAAGATTACCCGGGCTGCGGCAGCAATCAGTCTCGGTAAACAGAACACGCTTTATCTCGGAAATCTCAATGCTCAACGTGATTGGGGCCATGCCCGTGAGTATGTGCGCGGCATGTGGATGATGTGCCAGCAGGACAGGCCAGGTGACTATGTCCTTGCCACCGGCGTTACAACATCAGTTCGAACGTTTGTAGAATGGGCATTCGACGAAGTCGGCAGAAAGATTGAATGGGTCGGAGAAGGCAGCGAAGAACGCGGCATCGATGCCGCCACAGGAAAGTGCATCGTGGCGGTAGATCCGCGTTATTTCCGGCCTACGGAAGTACATTTACTTTTAGGCGATGCCAGCAAGGCGGAGCAAGTCTTGGGTTGGAGGCACGAAACGAGCGTGAGGGAGCTTGCCCGCGAAATGGTTAGGGAAGATCTGAAGGTAATGACGGGTTCGCCACAGGAAGAGGCATGAGGACAATGTATTCGCTAAGCGGAAAGCGAGTTTGGGTCGCAGGACATAGGGGTATGGTGGGCAGCGCTATTGCCCGGTCCCTTGCCTCGGAGGATTGCGAAATTATTGTAGCCGACAGGCAGAAGCTTGATCTGACGCGGCAAGAGGAAGTTCAGCAATTTCTGTCAAACGAAAAACCGCATGCGGTCGTAATGGCCGCAGCCAAGGTTGGCGGTATATTGGCAAACGACACAATGCCTGCCGACTTCATCTATCAGAACCTGGTGATGGAGGCCAACGTGATCGAAGCGTCATTTCGAAATTGCGTTGAAAAGCTTCTTTTTCTTGGATCGAGTTGCATCTATCCCAAATACGCTTCACAACCCATCAGAGAAGAAGCGCTATTAACTGGTCCGCTTGAATCGACCAACGAGTGGTACGCAATTGCAAAGATTGCCGGCGTTAAGTTGTGTCAGGCCTACCGCAAGCAATACCGCGCGGACTTTATTTCAGTCATGCCGACGAACCTCTATGGCCCTCGCGATAACTTCGATCTTATGAGCTGTCATGTCGTGCCCGCGTTGATACGCAAAGCACATGACGCAAAGATTAGAAATCTCGATCGACTGCGTATATGGGGAAGCGGAACGCCTCGTCGGGAGTTCTTGTACAGTGAAGACTGCGCCGATGCAGTGGTTTTTGTTCTTAAGCATTACTCCGAAGCTGAACACATTAACATTGGCTTCGGCAGCGACATAAGCATCATCGAGCTTGCTCGCCTCGTCTGCGATGTTGTTGGCTTTAAAGGGGACATAGAACTCGATACATCCAAGCCGGATGGAACACCACAAAAGCTGTTGTCCAGTGAAAAACTACTCTCGATGGGGTGGCGCCCGAAAACCTCCATCGAAGTTGGGCTGGCTAAATCCTACGAGTGGTTTATCAGCAATGTGGTCGATAACCCACGGTGA
Protein sequences of DBSCAN-SWA_20 >NC_022536|209335:211341|210396_211341_+|WP_048903072.1|DBSCAN-SWA MYSLSGKRVWVAGHRGMVGSAIARSLASEDCEIIVADRQKLDLTRQEEVQQFLSNEKPHAVVMAAAKVGGILANDTMPADFIYQNLVMEANVIEASFRNCVEKLLFLGSSCIYPKYASQPIREEALLTGPLESTNEWYAIAKIAGVKLCQAYRKQYRADFISVMPTNLYGPRDNFDLMSCHVVPALIRKAHDAKIRNLDRLRIWGSGTPRREFLYSEDCADAVVFVLKHYSEAEHINIGFGSDISIIELARLVCDVVGFKGDIELDTSKPDGTPQKLLSSEKLLSMGWRPKTSIEVGLAKSYEWFISNVVDNPR >NC_022536|209335:211341|209335_210391_+|WP_048903022.1|DBSCAN-SWA MKVALISGVTGQDGAYLAQLLLDKGYIVHGIKRRSSSFNSQRIERIYQDPHDPEARFFLHYGDMTDSANLLRIVQQTQPDEIYNLAAQSHVQVSFETPEYTANADAIGTLRMLEAIRILGLTNRTRFYQASTSELYGLAQEVPQNEKTPFYPRSPYAAAKLYAYWTVVNYREAYGMHASNGILFNHESPLRGETFVTRKITRAAAAISLGKQNTLYLGNLNAQRDWGHAREYVRGMWMMCQQDRPGDYVLATGVTTSVRTFVEWAFDEVGRKIEWVGEGSEERGIDAATGKCIVAVDPRYFRPTEVHLLLGDASKAEQVLGWRHETSVRELAREMVREDLKVMTGSPQEEA |
2 | Acanthocystis_turfacea_Chlorella_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_21 |
217874 : 218953
Sequences of DBSCAN-SWA_21
Nucleotide sequences of DBSCAN-SWA_21 >NC_022536|217874:218953|DBSCAN-SWA GTTAGGCTAATCCCACCTTGTTTTCAAATTCCATTTGGCTGAGATAGCCCAGTGTCGAATGGCGACGCCTCGGATTGTGGAGTTGCCCCTTAAAAACCGGACAGGAGCTTTCCATCGCGGCATTGTCCCAGACGTTGCCGGATCTGCTCATCGAGCAGGTGATGCCGTGATCGGCCATGAGACGCTGGAACTGCTCGCTCGTGTATTGGCTACCCTGGTCGGAATGATGCAGTAGCGCATCTGGTTTTCCTCTGCGCCAGATGGCCATGATCAATGCATCTGTAACGAGCTGGGCCGTCATGGTGGCGTTCATCGACCAGCCAACAACACGTCGCGAGAACAGGTCGATGACGGCGGCCACATAGAGCCAGCCCTCAGCCGTCCATAGATAGGTAAAATCGGCCACCCACTTCTGGTTCGGTCTTGCTGCCGTGAACTGCCGATCTAGCACATTCGGCATGATGACCGGGCGCTCACCGTTGTCTTTAGGCAAGCCACGCCGTCTCGGCCTTGCTCTCAATGCACTTTCCCGCATGAGACGCTCGACACGATGCAGCCCACAGGAAAGGCCTTCGGCGAGGAGGTCGTGCCAGACACGCCGCGCACCATAGGTGCGGTCGCTATCCTTGAAGCTGTCCTTGATCTTGTCGAGAAGAGCCTCGTCATACCGAGCATGTTGGCTCGGAGACCGGTGCAACCAGGCATAAAAGCCGGAACGCGATACACCCAGCGCTTCGCAGACCCATGCCACCGGCCAGATGGAGCGGTGCTTTGCAATGAACGCGAACTTCATATCACATCCTTCGCAAAGTAGGCGGCAGCCCTTTTTAAGATGTCACGCTCCGCCTTCAGCTTGGCCACTTCTTTCCGAAGCCTCTCGATCTCAAGCTGCTCGGGCTTCATCTGTCCCTGACCGGGAAACGCCTGTGCCGGGTCCGAACCATATTCCCTGCACCACTTGCGCAAGACATTCTCATGAACATCAAGATCGCGGGCTGCCTGAGCAACAGTGACCCCGCGCTCTCGCACCAACTTCACCGCCTCAAGCTTGTACTCGCGGCTGAACTTCCTTCGTTGCAT
Protein sequences of DBSCAN-SWA_21 >NC_022536|217874:218953|217874_218953_-|WP_144115367.1|transposase|DBSCAN-SWA MQRRKFSREYKLEAVKLVRERGVTVAQAARDLDVHENVLRKWCREYGSDPAQAFPGQGQMKPEQLEIERLRKEVAKLKAERDNLKKGCRLLCEGCDMKFAFIAKHRSIWPVAWVCEALGVSRSGFYAWLHRSPSQHARYDEALLDKIKDSFKDSDRTYGARRVWHDLLAEGLSCGLHRVERLMRESALRARPRRRGLPKDNGERPVIMPNVLDRQFTAARPNQKWVADFTYLWTAEGWLYVAAVIDLFSRRVVGWSMNATMTAQLVTDALIMAIWRRGKPDALLHHSDQGSQYTSEQFQRLMADHGITCSMSRSGNVWDNAAMESSCPVFKGQLHNPRRRHSTLGYLSQMEFENKVGLA |
1 | Shigella_phage(100.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_22 |
228921 : 230583
Sequences of DBSCAN-SWA_22
Nucleotide sequences of DBSCAN-SWA_22 >NC_022536|228921:230583|DBSCAN-SWA GTTAAGAGCTCCACAGCCAATATGGGAATTCTCGGCTCCCCGCGTCTCCTGACGAGCTCCAGCCTCCGTCCACGTGCAGGATCGAGCCGTTGATGTAGGAGGCATCAGACGATGCCAGAAAATAGACGGCTTCGGCGACTTCCTCCGGCTCCCCAAACCTGCCCAAAGGGATCTCTTGTCGGAGCGATCTCGGGCCTGTGCCGGTGGCCGTTTCGCACTGGATGTCGCCAGACGCGCGGATGTGACCTGGAGCGACTGTGGCTATGCGGATCCCAAGCGGCCCGAATTCGGCCGCCATGCACCGGGTAAGCATATCTAGACCGGCGTTGTAAGCGGCGTGAGCATGCTTCATCGAGAGAGGTAAGAGCCTGCAACTCGATGTGAGATTCAGGATTACACTACCAGGTCGCATCGATATCGCGGCCTCGCGCACGCAGGTGAAGGCGCCAGTAAGATTTATGCCCAGAGTGTTTTTGAGCCCTGCCGTGGCCTGCTCAAGATCGGGGGCAAATGTGCCATTCACGCCAAGGCCGTTGACGAAGACGTCGATGCGGCCGAAGCGCTTCCGCAATTCTTCGAAGAGCGCGACGATTTCGCTCTCGACTGTCCTATCCACGCGTTTTGACAGGTGCTTGTCGCCGAGCAAGTCAGCGAGCTTTGCCGCTGCACCACCGTCTCCGTCAGCGATCACAACCGTATCGCCGCTCGCGGCAAAGCGGCGAGCGATAGCCGCGCCTATGCCTTTAGCACCGTTCATGACGACCACCGTTCGCGCATTGGTGTCTTCGACCGGGCGTAAGAGTTCTGCTCCGGGCGTCCTACCCTGAGTCGGGTGGGCTTCACCTGGCTGGTTGAATGACATCCAGCCACCATCGACTACAAGCGTCGATCCGGTGATGTAACCTGCCTGCTCACTGGCCAGGAAGTGCACGGCTCGCGCAATCTCGTCGGGACGAGCCAGTCGGCCTATCGGCACGCGGCGCCTTATTGCCCTGACGTCCAGCTTGCCAGCTGTTTCTAATTCCGCGACCATTGGCGTACGCACGTGACCGGGGGCTACCGCTGTCACGCGGATGCCGCGCGAAGCCCATGCGCACGCCAGCGTTTTCGTGATTGAGATCAGCGCAGCCTTCGAAGCGGCGTAGTCATTGCGCTTCGGGTTGGCGAGGAGGCCAGCAAGCGAAGCGACGTTGACGATGGCTGCGCCAGGTTTCATCAGCTTGGCGGTTTCGCACGCCACCGAGTACGGCCCGATGAGGTTTACTGCTAGAGCGCGTTGAAAATCTTGAATGGGAGTATCGACGGTCGCAGCCATCGTCGGCCCTATCCCCGCATTATTGACGAGCACTCCAATCTGCGAGAATCGTTTTTCCAGAAGGGTACAGAATGCGAGGACATCGTCCTGTCGCGACACGTCGAACTCAAGGCCGAGATGAGGCTGGCCGAGATGGCGGCCCAGTTCGATCACACCGCTGTCTGGAACGTCCACCGCGACTACAATGTCTCCACTCGTGGCAAAAATATCGACCAGCGCACGGCCGATACCCCCTGCCGCACCTGTCACGATGATGACCCGCCCTGGCTGAACCGGCCAATCGTCCAGGCGTGTCTTCACCGGGCAGGCGGTGGCGGGGGGAAAAGGAGGGTAGCGAAGTCGCATCAT
Protein sequences of DBSCAN-SWA_22 >NC_022536|228921:230583|228921_230583_-|WP_022557314.1|DBSCAN-SWA MMRLRYPPFPPATACPVKTRLDDWPVQPGRVIIVTGAAGGIGRALVDIFATSGDIVVAVDVPDSGVIELGRHLGQPHLGLEFDVSRQDDVLAFCTLLEKRFSQIGVLVNNAGIGPTMAATVDTPIQDFQRALAVNLIGPYSVACETAKLMKPGAAIVNVASLAGLLANPKRNDYAASKAALISITKTLACAWASRGIRVTAVAPGHVRTPMVAELETAGKLDVRAIRRRVPIGRLARPDEIARAVHFLASEQAGYITGSTLVVDGGWMSFNQPGEAHPTQGRTPGAELLRPVEDTNARTVVVMNGAKGIGAAIARRFAASGDTVVIADGDGGAAAKLADLLGDKHLSKRVDRTVESEIVALFEELRKRFGRIDVFVNGLGVNGTFAPDLEQATAGLKNTLGINLTGAFTCVREAAISMRPGSVILNLTSSCRLLPLSMKHAHAAYNAGLDMLTRCMAAEFGPLGIRIATVAPGHIRASGDIQCETATGTGPRSLRQEIPLGRFGEPEEVAEAVYFLASSDASYINGSILHVDGGWSSSGDAGSREFPYWLWSS |
1 | Trichoplusia_ni_ascovirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_23 |
235801 : 239095
Sequences of DBSCAN-SWA_23
Nucleotide sequences of DBSCAN-SWA_23 >NC_022536|235801:239095|DBSCAN-SWA GATGCCAATGAAAAAAAGAATTCCCTTCGTTGTCTTTCATACGCGCGTTCGCGACGAGACCGTGAAGGGGCCAAATCCCTACCGGTGGGAAGTCAGAACAACCGAGGATTACTTCAGCGGCAAGCGCGTCGTTCTGTTTTCGCTTCCCGGTGCGTTCACCCCGACTTGCTCAACTCGGCAATTGCCCGATTTCGAGAAGCTATATGACGAGTTCCGGGATGCGGGGATCGAGTCGGTCTACTGCCTATCAGTCAATGATGCCTTTGCCATGAATGCGTGGGGTAAGGCCTTGGGCGTAGAAAAGGTCAATCTCATTCCGGATGGTTCCGGCGAATTCACTCGCAAAATGGGCATGCTGGTCGCAAAGGACAATCTTGGCTTCGGGATGCGCTCTTGGCGCTACGCTGCCGTGGTCAATGACGGCGTGGTGGAGAAATGGTTCGAAGAGGAAGGTCTCTCTGACAACTGCGAGTCCGATCCCTATGAGGTCTCTTCTCCGCAGAACATTCTTGAAACCTTGAGGACCGCTCCCACCTCTTGAGCGGTCCCCAACAGGATTTTGATGCTTCGGTGTGACCTTTCCAGCCTGATGTCGTGGAGCCCGAGCGCTGACGTTGGGGCCCGCAGCTGCCGTCGGCACTAAACCATCGTACTCTCTCATGATCCCCGCCGCTGAGCAGTACCAAAGACGGAGTTCAACACAGTCAGCGCGTTTTCGTCCGACGCGCTGCCTGCTATAGCCTCTAAGGTGGATCGCCCGTCTGCGTTGCGGCGCCCATCTGATGGCGGGTGGCACAGCTGATGTTATGGTTTTCGAAAGTCGCGGTTCGAAATACAGCATCTTGCCAGGCTCGATGCTTCTCGGGTGTCGCCGGATTGTAGTCTAGAGCCGTCGGGTGCTTGTGGCGCTGAATGGCGATCTGGCCTCATCCAACAGATTGTCTTGCCCCATCGTTGCAGAGGCAAGAAGACACGATCCGAACGCGGCCGCTCGAAAGGCCCACCGCCACGCCGAGCGGAGCCGGTTGACGAGACCTTTTTTTCACAGAAGATATAAGAGACAAGGAAATTGCACCCGCTACTTTGATCGACTGGTTCGGCGAACCCTGAGCAATAAAAATGCAACGGGAGCAACATTCTCCCGTTGAAGTGTCCGGGTAATCCGTCCGCGAGCATCTCGCATGCTTTATCCATCGTTTGACGAAAGCGTTTCGCATGAACAGGAAGAGGCTGGCAGCACCACAGCGTGCCAGCGTCCGCTTTATGCCGTTGGGCTCACCGCCATCATGTGAATGACAACGATTTCGCTTGCTAAAAGAAAACCGCTAGCAACTGCCTACGGTGTGCGATTGCGTGCTTTGACTCTTTGACTTGCTGCCGGACAGAACACACCGGCGTGACCCGCCGGATCCGACCTGGGCGGTTGGGCTGACCGCATCATTTGTCGGCCCAAAGGTCGACTGTCGCTCGATATAGTCAGAGTCCTCAACCACACTCTCATGAACTGGCCCGGGAGATAAGTTCGAGAAACGCCGGTGTCGCTCCACTGAGGCAGGAGCAAGCCCGAGCCGACTGCTACCGTGTCGCCGATGAAGAGAGATTAACCTCCTTTGCCCAGCCTCGGCAAAACCTCACACGAACTCGGAGTGATCAACTCGGCAATGTCGGAACTTCGACGGTGTTATACATGCGACACCTCTGTGTTTGGATGGGCTGTTGTTTTCGAACGGTTTTTCTGTTGGCACGGCAGGTGCATCGCTTTTCGCAAGTGCAACTCGGAACCGGGGAAAAGGTCCATGATAACTCTCACTGACAGTGCCATTGCCGCAATAAAATTTGCGCTGTCAAAGGCTCCCGAGCCGGCGACTGGATTACGCATCAAAGTGCAAGCGGGCGGCTGCTCTGGTTTCAAATACCACCTGGGTTTGGAAAGCGAGTCATGCAACGGCGATGCTGTCATTGAGGCGGGAGGGGTTAAAGTATTCGTAGACTCAGACTCTCAGCCTCACGTCGGCGGCATGACAGTCGATTTCACAACGGATGTGAATTCACCCGGATTCATCTTCGACAATCCCAACGCGAGCGAGAACTGCGCCTGCGGGAAGTCCTTTAGCTGACCGGGCAGAACGAGAATACGAGATCATCATGAAAAAGGATCAAGAAAGACCACATCAGTGCTGGTGTCCCAGCGTGCTCCCTGCGTCATTTCAGTAGGCGGAGGCCGTCCTAGATGTCTATCTACCTCGATAACAACGCAACGACACGCGTCGATTCCGAAGTTCTCCAGGCGATGCTGCCGTTCTTTGCGGACCAGTTTGGCAACCCCTCGTTGATGCACGACGTTGGCGCCGCTGCCGGCGCAGCGATGAAGAGGGCGCGCCAGCAATTACGAGCGCTGATCGGCGCCAAGTTCGATCACGAGATCATTTTTACGACGGGCGGGACCGAAAGCAATAATACGGCGATTCTTTCGGCGCTGGACGTGATGTCTGAGCGAACAGAGATACTCACTTCAGCGGTTGAACACCCGGCGGTGCTGGCGCTTTGCGGCCACCTTGAAGACACAAGAGGAATCAAGGTGCACCGAATCCCGGTGGATCAACATGGACGACTCGACGTCGACGCCTATCGGGCGGCCCTCACACCTCGGGTGGCAATCCTCTCAATTAAGTGTGCGAACAACGAGACGGGAACGATTTTCCCCGTGGCCAGGCTTTCTGAGATGGCCAAGAAAGTCGGCGTACTCTACATACTCTACATAAAGTGCGGCGTGCCCTCCCACGCACTCTTGAAAGGCGGGCACCAGCAGCCGAACCGTCGCGCGGGCACAGAGAACACGCCCGGGATTGTGGGCTTGGGCAAAGAAGCCGAACTTGCATTGGAGCGCATGGACGAGGAAAACACATGGGTCAAGTCGCTCCGCGACCGTTTGGAGAAAGGACTTCTCGAACGCGTTCCCAAGACCTTCGTCACCGGCGATCCGCTCGCGCGGTTACCTAACACGATAAATGTGGCATTCGAGGGCATCGGCAGAGAAGCCATGCAATTTCTGCTCAATCGCCACGGCATCGCCTGCGCGTTTGGCTCCGCCTGCAGCTCTCGCTCCTCAAGGCAGAGCCATGTTCTCGAAGCGATTAACGTTCCCCACGCTGCAGGACTCGGCGCAGTCCGCCTGTCCTTTTCGTGCTTTAACGGCGAAGAGGATGTCGATCAGGTACTCCAGGTGATACTTGGAATTGTGAAGAAACTTCGCGGTTCGTTTCCATCTGTACGGCAGGTGGAATGCGGAGGATAG
Protein sequences of DBSCAN-SWA_23 >NC_022536|235801:239095|238030_239095_+|WP_022557323.1|DBSCAN-SWA MSIYLDNNATTRVDSEVLQAMLPFFADQFGNPSLMHDVGAAAGAAMKRARQQLRALIGAKFDHEIIFTTGGTESNNTAILSALDVMSERTEILTSAVEHPAVLALCGHLEDTRGIKVHRIPVDQHGRLDVDAYRAALTPRVAILSIKCANNETGTIFPVARLSEMAKKVGVLYILYIKCGVPSHALLKGGHQQPNRRAGTENTPGIVGLGKEAELALERMDEENTWVKSLRDRLEKGLLERVPKTFVTGDPLARLPNTINVAFEGIGREAMQFLLNRHGIACAFGSACSSRSSRQSHVLEAINVPHAAGLGAVRLSFSCFNGEEDVDQVLQVILGIVKKLRGSFPSVRQVECGG >NC_022536|235801:239095|237596_237917_+|WP_022557322.1|DBSCAN-SWA MITLTDSAIAAIKFALSKAPEPATGLRIKVQAGGCSGFKYHLGLESESCNGDAVIEAGGVKVFVDSDSQPHVGGMTVDFTTDVNSPGFIFDNPNASENCACGKSFS >NC_022536|235801:239095|235801_236341_+|WP_022557320.1|DBSCAN-SWA MPMKKRIPFVVFHTRVRDETVKGPNPYRWEVRTTEDYFSGKRVVLFSLPGAFTPTCSTRQLPDFEKLYDEFRDAGIESVYCLSVNDAFAMNAWGKALGVEKVNLIPDGSGEFTRKMGMLVAKDNLGFGMRSWRYAAVVNDGVVEKWFEEEGLSDNCESDPYEVSSPQNILETLRTAPTS |
3 | Synechococcus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_24 |
249489 : 249951
Sequences of DBSCAN-SWA_24
Nucleotide sequences of DBSCAN-SWA_24 >NC_022536|249489:249951|DBSCAN-SWA GCTATGTAGGTTGAGATGGTAGTGGAAATGGGTGGGCCCGTGCGGACGGTGGGTGACCCGCTCGCCCTTGTCCTACCCACGACGCGCGGGCAACCTTGGAGCGCTGACGGGCGTAGCTCGCGGCGACTATCGGGTAATCGGCCGGGAGTCTCCATTTCCGGCGATGTTGTTCGGACGTTAGACCGTATTTCGCCATCAAGTGCCGCCGCCGCCGCTTGAACTTTTTGCCGTCTTCCAAGCAGATGATGAAGTCCTCGGTAGCCAATTTGCTGATCGGCACTGCTGGGCGCCGTTCGTCCTTGACTCGGAATATTCGAGAGGTAGCCCTCAGGCCCGCATGGGCTTGCTGGATCACATGCGGCAAATCGTTGATCGAGAAGATATTGCGGATCAGATAAGCCGCGACGATATTGCCGGTAAGCTTGAGTTGTCGCTCATCGGCGTTCGAACGCGGTCGGGTCAA
Protein sequences of DBSCAN-SWA_24 >NC_022536|249489:249951|249489_249951_-|WP_022557339.1|DBSCAN-SWA MTRPRSNADERQLKLTGNIVAAYLIRNIFSINDLPHVIQQAHAGLRATSRIFRVKDERRPAVPISKLATEDFIICLEDGKKFKRRRRHLMAKYGLTSEQHRRKWRLPADYPIVAASYARQRSKVARASWVGQGRAGHPPSARAHPFPLPSQPT |
1 | Erythrobacter_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_25 |
258680 : 266052
Sequences of DBSCAN-SWA_25
Nucleotide sequences of DBSCAN-SWA_25 >NC_022536|258680:266052|DBSCAN-SWA TATGACAAAGCATCTAATTGAGGTGATTACGTCCGCGGAGCGCCGTCGGCGCTGGTCACGCGAAGATAAAGAGCGCCTCGTCGCTGCCTGCTTTGAGCCAGACGCGGTAATCTCCGAGATTGCCCGCGCGGCCGGCATCCATGTCAGCCAGTTGTTCCGTTGGCGCAAAGAGCTTTGCCGGATCGAAGAGGCAAGGTCTGACACGGCGACTTTGGTGCCGGTGATCGTGTCCGAGGCTGCTTCGACAGCCTCTCCCGTTCAACCGGATTCGCCCACCACATCGCATCCTCGTCGGAAGCGTAGCGATGTGACGATCGAACTTGGGCGGGGTCGGCGCGTCCGCGTGGATAGCGACATCAAGGGTGATCGCGATCCAAATCCGCTAAGACAGGCGTTTTGTTGGGCACACGCGCGTCGCAAGTTCTTCGTGCTCGCAGACATTAATGCGAACGCCAAGCGTGGAAAGAACGCCGCGCCGATCTCGCCTATGGCGCTTGAGGCCGTCAAACGGATCGACGGCCTGTTCGATATCGAGCGCGAGATCAACGGGCTTACGGCCGATCAACGCCTGGAACGTCGCCGCAAGGAAAGCCTGCCGCTCGTCGACAGTCTGCAGGCCTGGCTTCAAACCGAGCGTGCAAAACTGTCGCGCAGTTCTCCGGTCGCCGAGGCGATCGATTACATGCTCAAGCGTTGGGATGGCTTCACGTCATTCCTGGAGGATGGCCGGCTTTGCCTCACGAACAACGCCGCGGAGCGAGCCCTGCGAGGCTTCGCACTCGGTAGGAAGTCCTGGCTCTTCGCCGGATCAGACCGCGGCGCGGATCGTGCAGCCTTCATGGCGACGCTGATCATGAGTGCCAAGCTCAACGATGTCGATCCGCAGGCATGGCTTGCCGATGTCCTCACCAACATTGCGGACACGCCGATCAGCAGGCTGGAGAAATTGCTGCCGTGGAATTGGACGCCACCGACCGTTAACGCTCAAGCTGCCTGACCTGTGGCCTTCACCGGAGGCTTACCTTTCCATCGATGGAAGCTCTGAACGATTGGCTTGAGCTTCGGTGCAAAGAATTCTGGGCAAAGACGCCACACGGTCAGATAGGCGGCACTATTGCTGACATCTGGGCGGAGGAAGCTCCGGCTCTCATGCGGGTGTCCAGGCCGTTCGACGCGTTCGTCGAATATACGAAGCGTGTCACGCCCACCTGCCTCGTGCATCTTGGGCGCAATCGCTACAGCGTCCCGGCATCCTTCGTCAATCGTCCGGTGAGCCTTCGGGTCTATCCGGATCGTGTCGTGGTCGCCGCAGAAGGCCAGATCGTCTGTGAGCACCGCCGTGTCATCGATCGTTCCCATGATCGGCCGGGCCAGACCATTTACGACTGGCGGCACTATCTGGCGGTCGTCCAGCGCAAGCCCGGCGCTCCAGCGCGCTCATAACGGTCCGACAAACCGCCCACCCAACCTGCCCATCAACGGACTCGGCGTTACCGAGGGCAACCTATTGAGGCTCCGCAGTTTGGCGTTCGTCGAAGCTGTCCTGCGCAGCAAGAGCCAGGTTGCCCTTGAAATCACCCTTGTGCGTCCTGAAGCGACTGAGTCTCGCCAACGCCTCGATCAAGCTCGACTGCGTCAGTTTGCCGAGCTCATGCCGGAGCATATCCGCACCGGCATGAGGGAGAATCTGTCGTAATTCCCATAGTCAGCCTTCTCTTTGTTGTTGCTGACCCAGGGGTATCTTCAGGGCCGGATACCGGCCGGTTTTTGATTGGCCGGCCTCAAATCGGAACGGTGGCGTGAATAGGCGCGTCGGTCTTGATAGCCGCTCCGCTGATGATTACAATCATGTGACGATGGAGAGCCGCTAAATCTCGGAACGCCGCCCGTGGTAGTACCGCCCCCCCACATAGACGATTTTGCCTCCCAACTTGGTGTTTGGCGCTATGTTAAGACATTAGGCGTCGCCGAAGTAAGGCAGTCGATACGAAGAAGGGTAATACGCCAAAAACGAAAAGCATTCCAAGATGGAAAATCACGTCGGCGGCTGGCCGGCCGAACATTGCCGGACGAATGAGGTCGATCGAATGTGCCAACGGCAAGAATGTCGCCATCTGCTGAACTTTACCGGGCAATTGGGTTACCGGGTAGACAGCACCCGACAAGAACAGCATCGGTGTGAGGATGAGGGTCTGATAGAAGATGAAATAGTCGTAGCTGGGCGCGAGCGCCGCTACGATCATAGCCAGACTCGCAAACGCGATTCCGGTGAGGGCGATCACCGGCACTACATAGGGAATGGATGACCAAACCGAATAACCCAGAATGGCTGCGACGACCGCAATCGCAGTTCCGGCCAAAAGCGCTTTGGTAGCTGCCCAGATGACTTCTCCCAGCACTATGTCGCCAAGGGTAATCTGAGTGTACAAGATCGCCTCCCAACTGCGTTGATCGCGCATGCGCGCGAAGGCCGCATAGATGGTTTCGAACGTAGACGCCGTCATCGCGCTTGCCGCGACCATGCCCGTTGCCAGGAAAGCAATGTATGACATGCCTTCGACTCGTCCGACCATCAAACCAAGACCGGTCCCCAGGCCGAAAAGATAGATCATGGGATCTGCGAGGTTACCGAGAATTGACACAAGCGCGGCTTTTTTCCACGCAAGGTAATTGCGGCGCCACACCGCAGTCCAGTTCCACCCATTGGCCGGCAGAACTGCCGCATAATTTTCCCACATCTTTCAATCCTTCATCTCTCGCCCAGTCAGTCGTAAGAAAACATCCTCTAGATTGGGGGGGCGCTGTAAAAGACGCAGATTCGTATGCTCCCGCAATTTGATTCGCACCTGCTCGGGAGTGGAGGAGTAACAAAAGAGGGTTTCTCCGCTTACCTCAATGCGCTGCGCGCATGTTTTGATAAGCGCGCGCAGTTCATGTGGATTGCCGCCGTTGATCTCAATGACTTGGCAGCCAATCTGCTCGTCGATCAGTCCTTGGGGGCGACCTTCGGCGATCTTCTGTCCTTCTTCAAGAACGCATAGCCGGTCGCATAACCGCTCTGCCTCCTCCATGAAATGAGTGGTAAGCAGGATCGTCTTGCCGCGTGCAAGCAGCGAGCGCAGCCGCTCCCAAATCAGGTGACGCGCGTGCGGGTCGAGACCGGTGGTGGGCTCATCCAATACGAGCAGTTGTGGGTCATTGATGAGCGCGCGTGCCAGCATCAGGCGCCGTTTCATACCACCCGAAAGGCCTACGACCCGGGCATCCGCCTTACTCTCAAGTCTGGCAAACTCCAAAAGTGATGGGATGGCGGCTTCGATCTGGCGGGTGCTCATATTGAAGTAGCGCCCAAAGACCAACAGGTTCTCGCGCACTGTGAATTCCTGATCGAGGTCGTCGAATTGCGGAACCACCCCGATGCTTGCCCGGGCCAGGCGAGCCTGCGCAGGCACCGGGGCGCCAAGCACCGTGATCTTACCTTCGTCGGGCGGCGTCATGCCAAGGACCATACGCGCAATCGTGCTTTTGCCGGCACCGTTGGGCCCGAGAAGCCCGAAGCACTCTCCTTTTTTGACGGCGAACGAAAACCGATCGACCACAGTCCGGTCGATATAGGTTTTTGTGACTTCAATGAAACTGATTGCTATCGTGGTCATGCGCTTGCCGAGAAATCCGCGCAGCCCGCCGGGGATCCATCAACTTCGCTTGCCTTTGTAAGCAACAAACCATCGCACCATACGTGGCCTATTTGTCCCCACTCACAAGCCGCTGCAGCGTCCGGAAAAAAACCCCGGCCGTTATAGTTGGCACTCGTGTTGCAGAGCAGCGGAATGCCGGTGAGACTTTCATATTCGACCAAAAGTTTCGCGACTTCGTGCTCGGAGTCTCGCGAGATCGTTTGAAGTCGCGCAGTTCCATCAAGGTGCACTACGGCAGGGATCCGTTCTCGCCATTCCGGGCGTGTCGTATGATCGAAGAGCATGTACGGATCCGGAGTGCCGGGGTCGAAGATGTCCCGCGCGCGGTCTTCCAAGCAAATTGGGGCTACTGGTCGGAAATGCTCGCGAAATTTCACTTTGTTGAGATGATCTTTCATTTGCGGGGAAGTCCCGGCGGCCAAGATGCTTCTGCCCCCCAACGCCCTTGGCCCAAGTTCTGCACGGCCGGCAAGAAAAACAACCGGTTCGTTAGATGCCAATATCGTGGCGAGCTCGGCCAAGGTGCATGGAGCAGCTTTCCACCCCTGCGGTATCGCGCTACTCTTCAGTTTGGGGCCGCTGTAAACTGACCAGTCAAGCGGAGCGAGACCACGGTCGACAGCAAGCGCGCAGCAGGCGGCGCCAATAGCCGATCCGCTGTCGTTGGGAAATGGGGGCACCCAGACTGATTCAAACAGACCACTGGCTCTGAGCGCGCTGTTCCATTTGATGTTAAGGCCGCATCCGCCGACCACGCACAAGTTTCTGGATTCGAATTGCGACTGCATCTGTAGAGCATTCGTCATCTCTCCAACGAGGAGGCGTTCAAGGAAAACATGAAATGAAGCAAGTACATCCTGGGGTTTTTCACCCTGTAACCGGGAGGCACTTGCGTCGAAGTAATCATTGACGCATGCAAGCAATGCGTCTGCGTTATGAATGTTTTCCCTATAATTGACCGCAATCAGCGTTTCTCCGGCGAAGTGTTCCTCGTACAGCTCCCGAAACACGTTTAGGATGTTCTCCTGAGCCGCGCCAAGTGCAATGTAGGCCATTAGCTTTCCGGCGACACCCAGATCCCAGCTTTGTGGATCCGCGTTCCTGAACGGCCCGAAGTGATGGCCGGCGACAGCATAGGCATGACCTATCATCGGGAACAAGCACTTGACCAATCGGACTCCTTTCGGCTCTGCATAGTAAAGCCGTGGAAATATGCAGCCATCCCATACCAAGCAGAACGAAGGATCTCCGGCTTTGGCAAACGGACTGGTGCAGTACGCGGAGACTACGTGGCTTATTACGTGCGGAAAACTTTTGTAGGGGAGGATCGAGCCGCTGAGCGCGAGACCCGAGCTGTCATGGGGCTTTAGAGGGCTCTCTGGCCACCGTTCGACGTAGGGAGCTCCACTGAGAGTTATGGGGGCGCCTTCGCTGAAGACCTGGAACTCTGATTCAAGTTCTCCGTCCCACCCGTCGATGACGAATTGGTCGACGTCCCCCACGCTCAAACCGTGGTCGTTCAGGGCTTCGACAATTGCGTCGAGGTTGTCGATAGCTTGATAGCGAGAATTGTTGTTGCGCTTCTCCTGCTCAATACAGAAAATAAGTTTGCCGTTTTCGATGAGGGCAACTGCTCCGTCGTGCGTGAGTTTAATTCCGCAAATGCGCATAAAATCTCCATATCTAACGCGGTAGCCGGATCATTGAGAAACGGGGTGGCGATATCGGGTAAGCAAACAGTCTTCGTTGGTGGTTTGGCCGACGCACCGTACACGATCAATTTGGACCAATTCCTCATCCAAGATCGTAATGACGGTTTCGGCACCAGCCACGTGACCCCAACGTTGGCAGCTCGCATCACCTGCGGATCCGAAGATTAGATGACCGGATGGCGCGAGCATTTGCGCCAAGTTGCGGACAACCGTCCGCATTTCAGCAACGCTCTCAAGGTAATAGAGCACCTCAGCGACCACGATCAGATCAAAGAACTGTTGTGTCGCGAACTGGAGGATATCGCAGCTTATCCAGGAGATATTCGGCGGATCCATTAGCCGTCGACGCGTTCGAGCGAGCGCATGGGGCACCACATCAATTACCGTGAGTTGCTCGCAATGCGGCGCAAGTTTTTCGGTGAACGCCCCGGCTGCGCATCCGACTTCGAGCGCGTGTGTGATGGATTGCTGCGCAAGCGCCAGCCGCAGCGTTTGCCTATGTCGCTCCTGCTCGAACGGGTTGCTGTCAAGCCGCCATGGATCGTCTTCAGCCAACTCATCATGCAGCAACTGGTAATGCTTTTTCTGGCTCAACTTCCTGCACCTCTTGTGGCTAATCCTGCAAACTATGTTAGATTTGCCCGCCCTTGGCCTCGCTGTCAGCTGCAACATCCAAAGCCGACCAAAATCTCGGATGCTCGCGAAAAGGTTAATCCATCGGCAGAGATTTAAGCGATGATGTTTCCGCACCGCGCGGGGGGCTTTGCAAGATAGAAGTTCGAGAGAGCCAAGCACTATTGCTTAACGTACACAGCGCATATGCTTTAAGGGGAAGAATAAGGAAAAGATTGATCGGCGTGTGGGCAGCGAAAGCTAAAAACCGAACCTGGCGAGCGCGAAGCGCAGCCACACTGCTGCGCACTATCGTCATGAATGCTATCAATATTATCGTCGACCACGGCACCGTGGCTGCCACCGCAATCTGTGTAAGCCCCATTAATACCGATACGGCAAGAAGCAGCGGTCCCAGATTCTGCCCGATCACGTCCAATGTTAGATAACGGTCAAGGCTGGGCAACAAGCGCAGCGCCAGCCACGTGTCTCGGAAGGTACTGCGCGCCCAGCGCAATTGTTGTCGCAAATATGGCGCGAGCCGAGCTGGCACGACTGTAGCTGCAATGGCATCCGCAACGTACTCGGTCCGAAAACCAGCTTTAAGCATCAGGATTGTAAGATGACGGTCCTCGCCAAAATCGCTTGGTTTGCCGCGAAACAGTTGCGTCTCGTATTGGTCGAGCAGTAAAAGCAGTGCAGATCGGCGGTACATGGCGCATGGACCACAACAGCACATCACCGCACCAAAGCGTGCCTGCGCCGCTCGCTCCTCGTTGCAGGCCAGCCAGTATTCCATGTCGATTAGTCGCGTTAGCCATGTGTCGCTGCGATTTCTAGCAACCAATTGTCCCATGACCGCGCCTATCGATGGATCGCGCATCCTGTGGACGAGCTTCGTGACTACGTCGGGCGCAAGCATGGTGTCTGAGTCGACATTCAGAAGCAAGTCACCGGATGAGCGGCGTATCGCGGCGATCTGCGCCTTGCGCTTTCCGGCATTTGTAGAGAGAAGAATGAAGTTGAACCGTGGATCTCGAGCATATTTGTTGTGTACGGGTGCGAGGTCCTCGCGGTTTGTAGAGCCATCATCCACCACGAAAACCTGCAGCCGCCCGGCATAATCTTGATTTGCGATAGACTCGAGGCACGCCGAGAGAGTGTCGGGGTCTTCATTGAAGCAGGGAACAATCACGTCGACGCTTGCCAACGCGTGGAAAGGGGAGCTTCCGGGAGATGCAGAAAAAAAACTTGCTGGTGGAGCATACAGAACCTGCATGCCTTTATAGACGCCCGAAAGAAGCGCATACAACGAAATGGCCACGGTGCCGGCTGTACCAAATAGATCCAT
Protein sequences of DBSCAN-SWA_25 >NC_022536|258680:266052|262331_264047_-|WP_022557355.1|DBSCAN-SWA MRICGIKLTHDGAVALIENGKLIFCIEQEKRNNNSRYQAIDNLDAIVEALNDHGLSVGDVDQFVIDGWDGELESEFQVFSEGAPITLSGAPYVERWPESPLKPHDSSGLALSGSILPYKSFPHVISHVVSAYCTSPFAKAGDPSFCLVWDGCIFPRLYYAEPKGVRLVKCLFPMIGHAYAVAGHHFGPFRNADPQSWDLGVAGKLMAYIALGAAQENILNVFRELYEEHFAGETLIAVNYRENIHNADALLACVNDYFDASASRLQGEKPQDVLASFHVFLERLLVGEMTNALQMQSQFESRNLCVVGGCGLNIKWNSALRASGLFESVWVPPFPNDSGSAIGAACCALAVDRGLAPLDWSVYSGPKLKSSAIPQGWKAAPCTLAELATILASNEPVVFLAGRAELGPRALGGRSILAAGTSPQMKDHLNKVKFREHFRPVAPICLEDRARDIFDPGTPDPYMLFDHTTRPEWRERIPAVVHLDGTARLQTISRDSEHEVAKLLVEYESLTGIPLLCNTSANYNGRGFFPDAAAACEWGQIGHVWCDGLLLTKASEVDGSPAGCADFSASA >NC_022536|258680:266052|264798_266052_-|WP_022557357.1|DBSCAN-SWA MDLFGTAGTVAISLYALLSGVYKGMQVLYAPPASFFSASPGSSPFHALASVDVIVPCFNEDPDTLSACLESIANQDYAGRLQVFVVDDGSTNREDLAPVHNKYARDPRFNFILLSTNAGKRKAQIAAIRRSSGDLLLNVDSDTMLAPDVVTKLVHRMRDPSIGAVMGQLVARNRSDTWLTRLIDMEYWLACNEERAAQARFGAVMCCCGPCAMYRRSALLLLLDQYETQLFRGKPSDFGEDRHLTILMLKAGFRTEYVADAIAATVVPARLAPYLRQQLRWARSTFRDTWLALRLLPSLDRYLTLDVIGQNLGPLLLAVSVLMGLTQIAVAATVPWSTIILIAFMTIVRSSVAALRARQVRFLAFAAHTPINLFLILPLKAYALCTLSNSAWLSRTSILQSPPRGAETSSLKSLPMD >NC_022536|258680:266052|264077_264761_-|WP_048903028.1|DBSCAN-SWA MLQLTARPRAGKSNIVCRISHKRCRKLSQKKHYQLLHDELAEDDPWRLDSNPFEQERHRQTLRLALAQQSITHALEVGCAAGAFTEKLAPHCEQLTVIDVVPHALARTRRRLMDPPNISWISCDILQFATQQFFDLIVVAEVLYYLESVAEMRTVVRNLAQMLAPSGHLIFGSAGDASCQRWGHVAGAETVITILDEELVQIDRVRCVGQTTNEDCLLTRYRHPVSQ >NC_022536|258680:266052|260628_261417_-|WP_022557353.1|DBSCAN-SWA MWENYAAVLPANGWNWTAVWRRNYLAWKKAALVSILGNLADPMIYLFGLGTGLGLMVGRVEGMSYIAFLATGMVAASAMTASTFETIYAAFARMRDQRSWEAILYTQITLGDIVLGEVIWAATKALLAGTAIAVVAAILGYSVWSSIPYVVPVIALTGIAFASLAMIVAALAPSYDYFIFYQTLILTPMLFLSGAVYPVTQLPGKVQQMATFLPLAHSIDLIRPAMFGRPAADVIFHLGMLFVFGVLPFFVSTALLRRRLMS >NC_022536|258680:266052|260201_260375_+|WP_158454204.1|DBSCAN-SWA MAFVEAVLRSKSQVALEITLVRPEATESRQRLDQARLRQFAELMPEHIRTGMRENLS >NC_022536|258680:266052|258680_259676_+|WP_022557351.1|transposase|DBSCAN-SWA MTKHLIEVITSAERRRRWSREDKERLVAACFEPDAVISEIARAAGIHVSQLFRWRKELCRIEEARSDTATLVPVIVSEAASTASPVQPDSPTTSHPRRKRSDVTIELGRGRRVRVDSDIKGDRDPNPLRQAFCWAHARRKFFVLADINANAKRGKNAAPISPMALEAVKRIDGLFDIEREINGLTADQRLERRRKESLPLVDSLQAWLQTERAKLSRSSPVAEAIDYMLKRWDGFTSFLEDGRLCLTNNAAERALRGFALGRKSWLFAGSDRGADRAAFMATLIMSAKLNDVDPQAWLADVLTNIADTPISRLEKLLPWNWTPPTVNAQAA >NC_022536|258680:266052|261420_262335_-|WP_022557354.1|DBSCAN-SWA MTTIAISFIEVTKTYIDRTVVDRFSFAVKKGECFGLLGPNGAGKSTIARMVLGMTPPDEGKITVLGAPVPAQARLARASIGVVPQFDDLDQEFTVRENLLVFGRYFNMSTRQIEAAIPSLLEFARLESKADARVVGLSGGMKRRLMLARALINDPQLLVLDEPTTGLDPHARHLIWERLRSLLARGKTILLTTHFMEEAERLCDRLCVLEEGQKIAEGRPQGLIDEQIGCQVIEINGGNPHELRALIKTCAQRIEVSGETLFCYSSTPEQVRIKLREHTNLRLLQRPPNLEDVFLRLTGREMKD |
7 | Stx2-converting_phage(25.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_26 |
269503 : 270602
Sequences of DBSCAN-SWA_26
Nucleotide sequences of DBSCAN-SWA_26 >NC_022536|269503:270602|DBSCAN-SWA GATGAAGGCCTCGAAGTTTTCGGAAGCGCAGATCGCGTTTGTTTTGAAGCAGGCAGAGGATGGAACGCCCATCGGCGAGGTCTGCCGCAAGGCGGGAATTTCGGACGCGACGTTTTACAATTGGCGCAAAAAATACGCGGGCTTGATGCCCTCGGAGATGAAGCGGCTACGACAGCTTGAGGAGGAGAATGCCAAGCTGAAGCGGATTGTTGCCGACCTGTCGCTGGACAAGGCCATGCTACAGGATGTGCTTTCAAAAAAGCTTTGAGGCCTGCCCGCAAGCGCAAGCTTGTCGACACGATCAAGGCGGACTGGAAGGTTTCGATCCGGCGTGCATGCTCGGTGTTGAAAGTCGACCGGTCGCTCTATGTCTACAAGTCCAGGCGTGGCGAGCAGGCCGAGTTAAAGCTGAAGATCAAGGACATCTGCCAGACGCGGATACGCTACGGCTACAGGCGTGTGCATATCCTTATCAAGCGTGAAGGCTGGTCTGTGAACCCGAAGCGAATCTATCGTCTTTACAAGGAGATGGACCTCCAGCTCCGCAACAAGGTTCCCAAGCGCCGGGTCAAGGCGAAGCTGCGGGCTGACCGCACGGAGCCTAGCCATTCCAACCATGTCTGGGCGATGGATTTCGTTCATGACCAACTGGCCACTGGTCGCAAGATCAGGGTTCTGACGGTTGTTGATACCTTCTCTCGCTTCTCGCCAGTGGTGGATGCTCGCTTTAGCTACAAAGGCGAGGATGTCGTTCAGACATTGGAGCGGGTCTGCCGACAGATCGGATATCCGGCGACCATACGTGTGGATAATGGCAGCGAATTCATCTCCCGAGACCTTGACCTCTGGGCCTATCACAAAGGTGTCGTGCTTGACTTCTCGCGGCCAGGCAAACCGACCGACAACAGCTACATCGAGAGTTTTAATGGCAAGTTCCGCGCCGAGTGCCTGAACGCTCACTGGTTCATGAGCCTTGACGACGCGCGGGCAAAGATGGAGGATTGGCGTAGAGACTATAATGAGTTCCGGCCACACAGCGCGATCGGCAACAAAGTGCCGATTTCGCTCATGAGCGGCTCATCGGCATCACCGCCGACATG
Protein sequences of DBSCAN-SWA_26 >NC_022536|269503:270602|269503_270602_+|WP_144115371.1|transposase|DBSCAN-SWA MKASKFSEAQIAFVLKQAEDGTPIGEVCRKAGISDATFYNWRKKYAGLMPSEMKRLRQLEEENAKLKRIVADLSLDKAMLQDVLFKKALRPARKRKLVDTIKADWKVSIRRACSVLKVDRSLYVYKSRRGEQAELKLKIKDICQTRIRYGYRRVHILIKREGWSVNPKRIYRLYKEMDLQLRNKVPKRRVKAKLRADRTEPSHSNHVWAMDFVHDQLATGRKIRVLTVVDTFSRFSPVVDARFSYKGEDVVQTLERVCRQIGYPATIRVDNGSEFISRDLDLWAYHKGVVLDFSRPGKPTDNSYIESFNGKFRAECLNAHWFMSLDDARAKMEDWRRDYNEFRPHSAIGNKVPISLMSGSSASPPT |
1 | Leptospira_phage(100.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_27 |
276131 : 281328
Sequences of DBSCAN-SWA_27
Nucleotide sequences of DBSCAN-SWA_27 >NC_022536|276131:281328|DBSCAN-SWA CTCAGTTGGTCGGTGTGCCGGCATTCATCATTTCAAAGAGCGCCCGCTGAAGCGGCGTGGCAAGCGTCGCCGCAAAAGGGGTCGCGATGTGGTGGCGGTCGCGGAAAGTCAGCTTGCCGCCAATCATGGCGGGGCAGGTCGTTGCGTTGCAAAATACGTCGGTGATGTCGACGTAAGAGGCATTCCCAAAATCCGAAACGGCCTTCCTTTCCGCGTCCGGAATCGTCTCTTCCATCGCCCCGCTGCGCGGCGTATCACAGGTCTCGGCCCCCCTTTTTTGCCAGAGCGCGCGCGCGACACATTTGTCGAGATAGGTCTTGTGCACGGGCCCATCTCGGAGGACGGCGATCTTGGTGACAGCGCTTTCCAACGCGTCCACGGTCCTGCGGAGCCCCTGAGCCCATGTGGTGGCGTCGATCTGGTGGATAGATACGACGTTGATGTCATTCTTGACGTAGGCCGAGGAATATTCCGAGATGATCACCATGTCGGGCTTGCGTGTGGCAATTTCGCGCATCGCGAGCTGCCGCCACCGATCGCATTCCTCGTAATTCCGCATGAGCATAGAATTCCAAATCGTAACGTCGGCTGCCGGGCACGAAGATTTCAGATAGGTGACGAGCTGCCATCCGTTGCTTTTGGCGATGCTGATCAACGGCGTCGACCAGTGGTCGGCATGTGAATCCCCGAAGAGAACAATCGTCTTACCGGGGGGGATGGCGCCGAATTCACAGGGTTTGGGCTGCACCTGTTGTGCGTTGAGAAGACAGCCATCGTCGAACTGCCGGGCGGACGAGTCGCGCTCAGCGCTTCGGAGGATAAGATTCTGCTGGAGATCGATGTTGTGATTGGCCACCCGCGCGCTTCCATAGGCTGCCGCTGCGCCACACGCGGTCAAAAGCGCGGCAAAACCGAGGGAACGGCTCGTCCTTGCCAACAGCCAGGGGTTCCTCCGCATCGGGTTTTCGATGAAGCTGTAGCTGAAGACTGAGAGGCCTAGGATCAGCGCGAGGCACAGGAACCGTTCGAATGTAGTGAGCTCGGGTTTCAAGATGCCGGAATAGACGATCACCGGCCAATGCCAGAGGTAAAGCGAATAGGATATCCGTCCGAGCCACTGCAGCGGCGGTAGCGAAAGAAGGACTTGCGGTCCCGCCGGGCCCGGCCGTGTGCCGCTGAGAAGCACCATCACTGTGCCGGCCACCGGAACCAGAGCAGTCAATCCTGGAAAGGGCACGTCCTCTGACATGCCGAGATAGGCCGTCATGATCAGGCCGATTCCGGTCCAGCCGAAGACGGGCGTCAGCCGAAAGCGCTTGGCCGCTTCCTCGGAGATCATCAAGGCGAGTCCCCCGCAGGCGAACTCCCACGCCCGGAACGGTGAAAAGTAAAATGCCCATGGTTGGGATATGGCGGTATAGTACCAGCAGAAGGCGAAGGAGATGACGCCCATTAGCACCACTGGGAGGAAAAGCCCGTATCTGCCGGGGCGGAGCCGCGCGAAAAGCAGCAGCAGGGCCGGCCAGACGAGATAGAATTGCTCCTCGACCGAGAGCGACCAGAAATGGATGAACGGATTGTTAAACGCGTCCGGGGCGAAGTAGTCAGCGGCCCAGCTGATAAGCCAAATATTAATCATATAGGCGGACGCGTAGAACGACCCCTTGGAATAGAGCTGCTGCTCGGGCGGCGAAAGAATGAAGTAGCCGAAGAAGAGCGTCGCCAGGATGACGAAACACGATGCAGGAAGAAGCCGGCGAGCTCGTCGCGCGTAAAACCGCAGGAGGTCGAGCGAGCCGGTTCGCTCGATTTCCAGCTGAATGTGCCTGGTGATGAGATAGCCCGATATGACGAAGAAGATATCGACACCGACAAAGCCGCCGGGCACGCTGGTCATGCCAAAGTGAAATGCGATTACTCCCGAAACGGCAAGGGCACGCAGCCCCTCGATATCCATTCGGAAAAAATGTGGTTCAGTTCGTGCCATATCGAAACTTGAATGCCCTATGTTGCGGTCTCACCGGATCAGTACCGACGGAAAGTACAGATCTCGATGGCCAGCATGTGAAGGCAGGCCTCTATGCATTATGAGAAACCGCCATCCGTGACGCCTCCTCGATCAATATCTGGCGCATCCAGATACTGGCCGGATCGCCGTTATGAAGCGCGGGCCATTGAACTGCTTCCGTAAAGGCAGGAAGCGGAAGCGGAAGATTAATGATCCGCAATGGGATCGTTTTCTCAAAGTACTCGACTAATCGTAAGGGCATTGTGGCGATGCGGTTTGTGCCGCTCAGCATCTGCGGTATGAAGCTAAAACCTTGCACTACGACTTCGAGGCGTCTTTTAATTCCATGTTCTAGCAAGTACCATTCTTCGATCGAAGGCCTCAACAAACGCCCGAACATAACTGCGACGTGTCCCATCGACATGTATTGCTCGAATGTGAGCTCCCGCGGTACCGCGTTGTTGGTTGGACAACTCACGCATACGAGTTTCTCGTCGAAGAGCTTCGCGGTAGGATGCGAGTTCGCCATGTATAGATCCGGAAGCAGGAGAAAATCAACTTCACCGCGCCGGAGGAGTTCATTCGGGTCATTATCGAGAGGCAGAAGTTCGAAACTGACAGCTGGTGCTTCCCGTGCCACACGGGCGATAACCCTATCAAAAAAGACAAGCGTCATGAAATCCGAAAGAAGGATCCTGAACCGACGTTCGGATTGAGATGGGTTGAATGGCTCCCAAGAAATAATTGAGCATCGAATTTTCAGGAGAACGTCGCGGACCACCGGGGCGAGTCCTTCGGCTCGCGGTGTGAGGCAAAGTTCGCGACCGTTCATCGTGAACAGGTCATCGCGAAAATATTCTCTTAACCGGCCGACGGCCGCGCTCATGGCGGGCTGGCTCAGATTGATGCTGCGTGCCGCCACCGTGAGATTCCGTTCGGTCATCAGTGCGTCGAGCGCGACGAGAAGATTTAGATCAAGGCCTTTGAAGCGCACGTTTGAAACCATCCATGCTGTGGATCATTTTCTGTTATCATTCCACTTAAGGACTGACTTCGAAGCTCATGAAGAGGCCATTGGCAGCATTGCCGCTGTTGGCCAGCGCTTCGAACCGGCCAAAACAAGCGACGTGACGACCACGGTTCCCTTTCCAAGGGCATGGTGATTCCGGCCCTAACTGCGAGGCGATGCCCGACCCCGACAGAGGTTCTGTCATCGACATCACCGCGAGAATTGGAAGACGGCCGGTCTTGCAGGCTGCCGTGGTCTCCGCTATTGCCACGGTAAGTCCGATGATGGCGGCAACGAGCGTGGAGGCACGACCGCTTGCAAGCAGGCCGATGACGATCAGCGCGGGGCAATGCCCAAACAATGTAAAGCAATGCCTTCGTGATCCTCCGGTGCCAGCCGTTGCTTTGCCTCAAGCCAACTCCTGCTCGGTGAGCCATTCGCTCCACCAGAGCAGCCGTGAAGCGAAAATGGGTGGTCCCCGGATTATCATAGATCACGATGAGGAGGCCGCTCTTTCGAGTGACTGTGGAGAAATGCTCGAAGACTTCATCCTCCGTCAGAGGCGCATACGAAACGGCAGCAAGAAGACCTGCGGCCGCGCCTGCGGCCTTGGCGTCCTGCGCCAGTCTGACCGCCTCGTCGGTCCGTAAGGCGCCGACGCCTGCCACGACCGGTACGCCGTTGGCTTCCTGGATGGCCGCATCCAAGGCGCGTCGCCGCTCCTCTCGGCACAGATACATGTAGGTTCCAGTGCTGCCGAGAAGGCCAATGGAGTCGACGCCTGCGATGCAAAGCCGAGCCACAAGTCTCTTGAGAGCGGCAGTGTCGACGCGACCGTTTAGGTCACTCGGCGTGATGGGAAATGCGGAAAGCCCGCTCAGGAAAGCCATTCGATGCTCCGGATCCGGTTCTGTACTTTTATATTCGCAGTCGTCAATGCGGGTGTTAGAGGGTCGCGAGCGCCTGCCTCAGATCGCCAATCAGGTCAGTCACATCCTCAAGACCGACAGAAAGACGGATGAGGTCGTCACCAACCCCCGAGACCATATGGGCGTCTTTTCGGATCGATTGCCGCGCCCTCGTAATGCTGGCCGGATGATAGATCAGTGTGTCGGTGTCGCCGAGGCTCACCGCGCGCGTCATCAGTTGCAAACGATCGAGCATATTTCTTGCGCCGTCAAAACCGGCGTGAAGTCCGAAGGCCAGCATGCCCGATCCCTGTGTCATTTGCTTGCGCGCGATAGCCTGATCGGGATGCGATTCCAGGAAGGGATAGCTCACCCAGGACACGGCGGGATGCGCTTCGAGCATTCGAGCGATCGCCAGAGCCGAGGCGCTGTGCCGATCCATTCGAAGCGAGAGTGTTTTCAGACCGCGCATGATCAGGAAAGCGGAGTGTGGCGCCAAGGTGGCGCCGGTAATGTAACGCAGACCCGTTTCGTGCAGGCGGTGGAGCGTTTCGGCGTCGCCGAGTAGCGCCCCACCCAAGGTGTCGCCATGACCGTTGATGTATTTCGTAAGAGAGTGAAGCACGATATCGGCGCCATGCTCGATCGGCCGCTGGAGCGCGGGAGAAGCAAAGGTACTGTCCACTGCGACTTTCACCCCACGCGCGCGTGCGCGCTCTGAGATTGCCGCAATGTCGAGAATGCTGCTGAGCGGGTTCACGGGCGTCTCGAAATAGACGAGCTTGGTCCTTTCCGTGATCGCCGCGTCGAGGTTTGACGGATCAGAGAGATCGACGGGAATGACCTTGATGCCGAAACGCGGCAGCCCCTGCTCCACCATCGCCACAGAATTCGAATAAAGCGTTTTGTGAACGACGAGCTCGTCACCCTGCGACAGCAAGGAGAGGATGAGAGTGCCGAAGGCGGCCATTCCGGTCGATACGACAAGGCCCGCCTCCGCCCTTTCCAGATTTGCAAGCCTCTGCTCAAGGATCTCTGTGGTCGGGTTGTATTCCCGCGCGTAAAGTCGGCCGCCGAGCGCTGCTGCGGCGTCATTGGCCTCGACGCTCTCGAAGCCGTAGGTCGAGGTCAGGAAGACTGGCGGCTGGACTGCCCGTTTGAAGTCCACCGGGCTGAATGCATGATGAATGGCGCGCGTGTCGAAGCCGAAGCTTGCCGCGTGAAGGTGGTCAGATTGTCTCTGACGGTTGTCATGCTCGCTGCCGCCTGTCAT
Protein sequences of DBSCAN-SWA_27 >NC_022536|276131:281328|276131_278117_-|WP_048903029.1|DBSCAN-SWA MARTEPHFFRMDIEGLRALAVSGVIAFHFGMTSVPGGFVGVDIFFVISGYLITRHIQLEIERTGSLDLLRFYARRARRLLPASCFVILATLFFGYFILSPPEQQLYSKGSFYASAYMINIWLISWAADYFAPDAFNNPFIHFWSLSVEEQFYLVWPALLLLFARLRPGRYGLFLPVVLMGVISFAFCWYYTAISQPWAFYFSPFRAWEFACGGLALMISEEAAKRFRLTPVFGWTGIGLIMTAYLGMSEDVPFPGLTALVPVAGTVMVLLSGTRPGPAGPQVLLSLPPLQWLGRISYSLYLWHWPVIVYSGILKPELTTFERFLCLALILGLSVFSYSFIENPMRRNPWLLARTSRSLGFAALLTACGAAAAYGSARVANHNIDLQQNLILRSAERDSSARQFDDGCLLNAQQVQPKPCEFGAIPPGKTIVLFGDSHADHWSTPLISIAKSNGWQLVTYLKSSCPAADVTIWNSMLMRNYEECDRWRQLAMREIATRKPDMVIISEYSSAYVKNDINVVSIHQIDATTWAQGLRRTVDALESAVTKIAVLRDGPVHKTYLDKCVARALWQKRGAETCDTPRSGAMEETIPDAERKAVSDFGNASYVDITDVFCNATTCPAMIGGKLTFRDRHHIATPFAATLATPLQRALFEMMNAGTPTN >NC_022536|276131:281328|280092_281328_-|WP_022557291.1|DBSCAN-SWA MTGGSEHDNRQRQSDHLHAASFGFDTRAIHHAFSPVDFKRAVQPPVFLTSTYGFESVEANDAAAALGGRLYAREYNPTTEILEQRLANLERAEAGLVVSTGMAAFGTLILSLLSQGDELVVHKTLYSNSVAMVEQGLPRFGIKVIPVDLSDPSNLDAAITERTKLVYFETPVNPLSSILDIAAISERARARGVKVAVDSTFASPALQRPIEHGADIVLHSLTKYINGHGDTLGGALLGDAETLHRLHETGLRYITGATLAPHSAFLIMRGLKTLSLRMDRHSASALAIARMLEAHPAVSWVSYPFLESHPDQAIARKQMTQGSGMLAFGLHAGFDGARNMLDRLQLMTRAVSLGDTDTLIYHPASITRARQSIRKDAHMVSGVGDDLIRLSVGLEDVTDLIGDLRQALATL >NC_022536|276131:281328|278208_279132_-|WP_048903080.1|DBSCAN-SWA MRFKGLDLNLLVALDALMTERNLTVAARSINLSQPAMSAAVGRLREYFRDDLFTMNGRELCLTPRAEGLAPVVRDVLLKIRCSIISWEPFNPSQSERRFRILLSDFMTLVFFDRVIARVAREAPAVSFELLPLDNDPNELLRRGEVDFLLLPDLYMANSHPTAKLFDEKLVCVSCPTNNAVPRELTFEQYMSMGHVAVMFGRLLRPSIEEWYLLEHGIKRRLEVVVQGFSFIPQMLSGTNRIATMPLRLVEYFEKTIPLRIINLPLPLPAFTEAVQWPALHNGDPASIWMRQILIEEASRMAVSHNA >NC_022536|276131:281328|279107_280037_-|WP_022557371.1|DBSCAN-SWA MAFLSGLSAFPITPSDLNGRVDTAALKRLVARLCIAGVDSIGLLGSTGTYMYLCREERRRALDAAIQEANGVPVVAGVGALRTDEAVRLAQDAKAAGAAAGLLAAVSYAPLTEDEVFEHFSTVTRKSGLLIVIYDNPGTTHFRFTAALVERMAHRAGVGLRQSNGWHRRITKALLYIVWALPRADRHRPACKRSCLHARCRHHRTYRGNSGDHGSLQDRPSSNSRGDVDDRTSVGVGHRLAVRAGITMPLEREPWSSRRLFWPVRSAGQQRQCCQWPLHELRSQSLSGMITENDPQHGWFQTCASKALI |
4 | Gordonia_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_28 |
287275 : 288771
Sequences of DBSCAN-SWA_28
Nucleotide sequences of DBSCAN-SWA_28 >NC_022536|287275:288771|DBSCAN-SWA TTCAAACTGCTTCCGCGTGGGCGCCAAGATAGATCTTTCCGACGACGGGATTGTTCATGATGCTCTCGGCGCTCCCATCGATCTGGTTTTTTCCCTCTGCAAGAATGTAGCAGCGATCGGAGATCTCCAGCGCCTTATGAGCATTCTGCTCAACCAGCAACACGCCGACGCCTTGTCGATGACTTATCTCGCGTATGTGCTGGAATACTTCGTCGGTCGCCTTGGGCGACAGGCCTGCGCTTGGTTCGTCAAGAAGCAAGACCGAGGGTCCAGCCATGAGCGCCCTGCCGAGCGCCACGACCTGCTGCTGGCCACCACTTAGCTTACCGGCGGTCGCGGTCCAGTAATCGATCAGGTTGGGCAGAATTTCACGCACCACCGCCTGCCGCTCCTTGACGGAACCCGGGACGGCAAAAAGACCCATACGCAGGTTTTCTTCGACGCTCATCGTGGCGAAGACATTGCGCACCTGCGGGACATATCCCAGACCGGAGCGCAAACGATCACGCGTCGGGGCGTTGGTGATATCGATCCCCTTCAGCGTGATAGTTCCACCTTTGGGCTTTAACAGGCCCGCAAGTGTCTTCAATAAAGTTGACTTGCCACAACCATTCGGACCGATAATGGTGACGATCTCACCTGCGGCCAACTCTAGGCTCACTCCATGTAAGATCTGCATCTCGCCGTAACCGGCGACAAGGGACTCGGCTGTTAAAATGATCTTCTTGTTCATGCGCTCACCCCTCCAAGGTAGGCGTCCAGCACTTCCTTGTTTGTTTGAATCTCAACAGGCGTGCCTTCGGCGATTACCTGCCCCCTGTCCATGACGATCACCCGCTCGCACAGTTCCATGATGACCGTCATGTTGTGCTCGATGATCAACAGCGTCACACCAGAGGCATGAATGCGCCTGAGTACCGCCATCAGATGTCGTATAAGGGTCGGATTGACTCCGGCGGTGGGCTCATCGAGCAACAACAGCTTTGGGTCAGCCATCAGGCAGCGGGCAAGTTCAAGAAGCTTTTTCTGACCACCCGAAAGTAGGGCTGCCGGGTGATGAGCCTTTTCACCCAAGCCGACCATTTCCAAGACACCCTGAGCCTTCAGCAGGTTGATGCTTTCCTCGTCGCGCACGCGCTTGCCGGGTAAGAGGACCGACCGCACTTGGTCGCCGAACTGCCCCTCGGGCACCAGCAGGAGGTTCTCAAGCACAGTCATTTGTTTGAGTTCGCGCGGAATCTGAAACGTGCGGCGAAGCCCGAGCCGGGCGATATCGTGCGCGCTCAGCTTTTCGATGTGGTTCCCATCGAACAGGACTTCGCCGCTGTCGGGGCGAACGAAACCACTCACCAAACTGAAAAGCGTGGACTTACCGGCTCCGTTTGGCCCGACGAGTCCGGTTATGCTGTCGGCGCGGACGGAGAACCGGCATTTTTTGACTGCCTGAAGGCCTCCAAAACGCCGGCTGACGCCCTTGATCTCGAGTATCTGTTTCAT
Protein sequences of DBSCAN-SWA_28 >NC_022536|287275:288771|287275_288007_-|WP_022557376.1|DBSCAN-SWA MNKKIILTAESLVAGYGEMQILHGVSLELAAGEIVTIIGPNGCGKSTLLKTLAGLLKPKGGTITLKGIDITNAPTRDRLRSGLGYVPQVRNVFATMSVEENLRMGLFAVPGSVKERQAVVREILPNLIDYWTATAGKLSGGQQQVVALGRALMAGPSVLLLDEPSAGLSPKATDEVFQHIREISHRQGVGVLLVEQNAHKALEISDRCYILAEGKNQIDGSAESIMNNPVVGKIYLGAHAEAV >NC_022536|287275:288771|288003_288771_-|WP_048903031.1|DBSCAN-SWA MKQILEIKGVSRRFGGLQAVKKCRFSVRADSITGLVGPNGAGKSTLFSLVSGFVRPDSGEVLFDGNHIEKLSAHDIARLGLRRTFQIPRELKQMTVLENLLLVPEGQFGDQVRSVLLPGKRVRDEESINLLKAQGVLEMVGLGEKAHHPAALLSGGQKKLLELARCLMADPKLLLLDEPTAGVNPTLIRHLMAVLRRIHASGVTLLIIEHNMTVIMELCERVIVMDRGQVIAEGTPVEIQTNKEVLDAYLGGVSA |
2 | Cedratvirus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_29 |
304141 : 307475
Sequences of DBSCAN-SWA_29
Nucleotide sequences of DBSCAN-SWA_29 >NC_022536|304141:307475|DBSCAN-SWA ATCACGCGCATGCCTTCGGGCGAGCCGCGTTCAAGGTGTTGGCCATCAGCATTGCGATCGTCATGGGCCCCACACCACCTGGAACCGGTGTCACGGCGCCCGCGACAGCCGCGACTTCTTCGAAGTCGACATCGCCAACGAGCTTTGTTTTCCCCTCGCCTTTCTCCGGTGCTGCCACGCGGTTGATCCCAACATCGATGACCGTCGCGCCCGGCTTTACCCAGTCAGCCCTGACCATTTGCGGGCGTCCTACGGCTGCAACGAGAATGTCCACCTGACGCGCTAAAGACGGCAGGTTCCGACTTTTTGAATGAGCTATCATTACTGTCGCGTTGGCGTTCAATAAAAGTTGCGCCATTGGCCTTCCGAAAAGGTTGGAGCGGCCGATTACCAGCGCGTTGAGTCCGGAAAGATCGGATCCGTGAATTTGCTGGATCAACAGCATGGTGCCCGCCGGGGTGCAGGGCACAAGTCCCGTGGTGAGGTCTCCCGTTGCGACCTTTCCTGCATTGACAACGTGTAACCCGTCGACATCCTTCTCTGGTAGGACAGCTTGAATGACTTTCTCACCGTTAAGATGCTTTGGTAGCGGCAATTGGACGAGTATACCGTCAATCTTTGGATCGGAGTTCAGCCTTGCGATCAGAGACAGCAACTCGCTTTGTGTTATGTTTGCAGGCAGCGTATGCTGAACTGAGTGGAAGCCGCATTTCTTCGCCATCCTGCTTTTCGCAGCGACATATGCATGGCTTGCCGGATCGTCTCCGACGATGACGACCGCTAGACCGGGTTTCCTTTGGTGGGAAGCTTCCAACTTCAAAACCGCGTTCGTCACGTTCGCAATCATATCGGCGCCTGCTGCCTTTCCATCGATGATTAGCGCCATGGCTTTTATCCCATCCGCTCAGATGCGTAGGAGCCCGGGCTTGGCGGGAAGACGACGGTACGGTTGCCGTTGATGAAGGTCCGGTGATGGATATGGGCGTGGATGGCGCGGGCGAGAACCTGGCTCTCCACGTCGCGGCCGATCGACACATAGTCGTCGGCGCTCTGAGCGTGAGTGATGCGCGCTGTATCCTGCTCGATGATCGGGCCTTCATCCAGATCGGCAGTCACGTAGTGGGCCGTCGCGCCGATCAGCTTCACGCCGCGCTGATAGGCCTGCTTGTAAGGGTTGGCGCCCTTGAAACTCGGCAAAAAGGAATGGTGAATGTTGATGATCTGGCCGGACATCTTGCGGCACATAGAATCGGAAAGCACCTGCATGTAGCGGGCGAGCACGATGAGTTCGGTGCCTGTCTGATCGACGAGATCGAGAAGCTGGGCTTCCGCTTGCGGCTTGTTTTCCTTCGTGACCTTGATGTGGTGGAAGGGGATATCGTGGTTGACGACCACCTTTTGGTAATCGAAATGGTTGGAGACGACGCCGACGATTTCGATCGGCAATGCGCCGATCTTCCAGCGATAAAGTAGATCGTTGAGGCAATGGCCGAAACGGGAGACCATCAGCAGCACCTTCATGCGCTTTTTGGCATCGTGTATTTCAGCCTGCATCGAAAACCTTTCGGCAACCGGCTGGAGACCTTCCAGAAGTGCCTCTCGACTAACGTCTTCTTCTGAAATGAAAGAAATGCGCATGAAAAAGCGACCTGTATCGAAATCATCGAACTGTGAGCTATCGACGATGTTGCAGCCCTTCTCTGCCAAGTAGGCCGAAAGGGCAGCCACAATGCCGCGTGTCGTATCGCAGGAAACCGTCAAGACGTAGTTCTTCATTCCAACCTCGTGTTTGTTGTAGCCTCTTAAGAAGGTCTCGCCCGTTCGCAGAGGGCATTCGGGCTTCTTGAACAGCTCAATGTGGCCATACGCCAACAATCCCTGTTTCGGGTTATCTGGCGGAGAGTCCAATATATCGGTCGAGTTCCGTGCGATTGAGGGCCAGTGTCGCAGCTCGTTCCTCCAACACGATCTTGCCCTGGTCCAGCGCAATGACGCGTTCTGCGAAGTTGAGCGCCAAGCCGATATGTTGCTCCACCAGCACGATCGTCTGGCCGTGCTCAGCCAGTTTGTGAAACACCTCCATCAGCATTTCACATATAACCGGTGCCAGGCCCTCAAGCGGCTCATCTAGCAAGATGATTTCGGGTTTCCCAAGCAGGGCGCGGGCTATCGACAGCATCTGCTGTTCGCCTCCGGACAACTGACGTCCGCCGTTCGATCGACGCTCCTTTAACCGGGGAAACAGATCGAAAGCCTCTTCGACGGTTGACCCATTCCGAAGTCCAGCAACCAGGTTTTCCTCAACTGTCAACGATCCGAAGATGTCCCGCGTTTGTGGCACGATGCCAATCCCCAGTCTTGCCCGCTGATGCGAGGTAAGTGTGTCGATAGCTGAGCCATTAAGTCTGATGCTTCCCGCGTGTTGCCGGGTCTGACCCATGATGGTGGAAAGCAGGGTGGTTTTACCAACGCCGTTTCGTCCGACAATAGCAAGCCGCTCGCCTCTCGGAATGCTGAGCGAAACATTGCGGACGATATGGGTGTCACCGTATCCAGCGACCACTCCATCCATTTCGAAAAGGTTATCAGACATGACCCGCTCCCAAATAGAGTTGCCGAACCCGATCGTCCGCGACCACCTCGCCTGGCTTGCCTTCCATCAGGATTGCTCCATTCACCAGAACGATGATGCGGTCTGCAACCTCGAAAACGAGTTGCATGTCATGTTCGATGATGAGAACGGAAAGGTCCGGCGGAAGCCGCTTGATGGCCTCCACGATCAAATGGCTCTCGGTGGAGGGGACGCCTGCTGCCGGCTCATCGAGTATGAGAACTTTTGGTTTCACTGCAAGCGTCAGTGCCAGTTCCACCAGCCGCTGCTGCCCATATGCCAGATCGCGTACCTCCTTGTCGGCCAGTGAGTGAATGTCGAGATCCTGCAGCAAACGGTCAATTTCAGCCTCGATATTCAGCCACCCGTCCGCCCGACGAAAGATATTATGTGTGCGCCCTTCCCGCTCTAGCATCGGCAGCCGCAGGTTCTCACGTACGCTCAGATCGCGGAAAAGGTTGGTTATCTGGAATGTTTTTGCGATACCTACCTTCACCCGAGCAGCTTCATTCAGTGAGGCAATGTCTTCACCGTTCAGCATGATCCTACCCGACGTAGGTTTCAGAACGCCGGTGAGCAGATTTGCAAATGTCGTCTTGCCCGCACCGTTCGGCCCGATCAAGGCGGCGCGCGAGCCTTGCGGCAAAGTGAGCGTCAGATCCCGCGCAACCTGAAGGCCTCCGAAGTTTTTGCAAAGCCCGGCAATAACCAGTCCAGCGCTCAT
Protein sequences of DBSCAN-SWA_29 >NC_022536|304141:307475|306725_307475_-|WP_048903034.1|DBSCAN-SWA MSAGLVIAGLCKNFGGLQVARDLTLTLPQGSRAALIGPNGAGKTTFANLLTGVLKPTSGRIMLNGEDIASLNEAARVKVGIAKTFQITNLFRDLSVRENLRLPMLEREGRTHNIFRRADGWLNIEAEIDRLLQDLDIHSLADKEVRDLAYGQQRLVELALTLAVKPKVLILDEPAAGVPSTESHLIVEAIKRLPPDLSVLIIEHDMQLVFEVADRIIVLVNGAILMEGKPGEVVADDRVRQLYLGAGHV >NC_022536|304141:307475|306028_306733_-|WP_022557395.1|DBSCAN-SWA MSDNLFEMDGVVAGYGDTHIVRNVSLSIPRGERLAIVGRNGVGKTTLLSTIMGQTRQHAGSIRLNGSAIDTLTSHQRARLGIGIVPQTRDIFGSLTVEENLVAGLRNGSTVEEAFDLFPRLKERRSNGGRQLSGGEQQMLSIARALLGKPEIILLDEPLEGLAPVICEMLMEVFHKLAEHGQTIVLVEQHIGLALNFAERVIALDQGKIVLEERAATLALNRTELDRYIGLSAR >NC_022536|304141:307475|305031_305916_-|WP_048903033.1|DBSCAN-SWA MKNYVLTVSCDTTRGIVAALSAYLAEKGCNIVDSSQFDDFDTGRFFMRISFISEEDVSREALLEGLQPVAERFSMQAEIHDAKKRMKVLLMVSRFGHCLNDLLYRWKIGALPIEIVGVVSNHFDYQKVVVNHDIPFHHIKVTKENKPQAEAQLLDLVDQTGTELIVLARYMQVLSDSMCRKMSGQIINIHHSFLPSFKGANPYKQAYQRGVKLIGATAHYVTADLDEGPIIEQDTARITHAQSADDYVSIGRDVESQVLARAIHAHIHHRTFINGNRTVVFPPSPGSYASERMG >NC_022536|304141:307475|304141_305026_-|WP_022557393.1|DBSCAN-SWA MALIIDGKAAGADMIANVTNAVLKLEASHQRKPGLAVVIVGDDPASHAYVAAKSRMAKKCGFHSVQHTLPANITQSELLSLIARLNSDPKIDGILVQLPLPKHLNGEKVIQAVLPEKDVDGLHVVNAGKVATGDLTTGLVPCTPAGTMLLIQQIHGSDLSGLNALVIGRSNLFGRPMAQLLLNANATVMIAHSKSRNLPSLARQVDILVAAVGRPQMVRADWVKPGATVIDVGINRVAAPEKGEGKTKLVGDVDFEEVAAVAGAVTPVPGGVGPMTIAMLMANTLNAARPKACA |
4 | Staphylococcus_phage(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_30 |
331335 : 332106
Sequences of DBSCAN-SWA_30
Nucleotide sequences of DBSCAN-SWA_30 >NC_022536|331335:332106|DBSCAN-SWA TATGACTTCAAGCGCTCCGGCAGACAACAAAGACATTCGGGAGGTTGCGATCTCTTTCGACCAGGTGTCCAAATGGTACGGCAGCTTTCAGGTGCTACAGAACATCAACCTGAACGTCCGGGTCGGCGAGCGGATCGTGATCGCGCGTCCTTCCGGCTCTGGTAAGTCGACGATGATCCGCTGCATCAACTGGCTCGAGAAACACCAAAGCGGCCGTATCGTCGTTGACGGCATCGAATTGACCAGCAACCACAAGAAGGTTGAGGCGGTGCGGAAGGAGGTCGGCATGGTCTTCCAGCATTTCAACCTGTTCCCACACCTCACCATTCTCGAGAACTGTAGCCTCGCCCCTATCTGGGTGAAAAAGCAGTCGAAGAAGGATGCGGACGCGGTCGCCTTCAAGTTCCTGGAAAAAGTCCGTATCGCGGATCAGGCCCATAAGTACCCCGGTCAGCTTTCCGGCGGCCAGCAACAACGTGTCGCGATCGCGAGGGCTTTGTGCATGCGACCCAAGGTCCTTCTGTTCGACGAACCCACATCGGCCCTCGACCCGGAGATGGTTAGGGAGGTGTTGGACACGATGGTGCGGCTGGCCGAGGACGGCATGACAATGCTCTGTGTCACCCACGAAATGGGTTTCGCGAGGCAGGTCGCCGACCGTGTTATTTTCATGGATCGTGGCGAAATCCTTGAGGAGAACGGGCCGGACGAGTTCTTCTCCAATCCCCGCCACGAGCGCGCGCGGGCCTTCCTGAGCCAGATCATCTATTGA
Protein sequences of DBSCAN-SWA_30 >NC_022536|331335:332106|331335_332106_+|WP_022557429.1|DBSCAN-SWA MTSSAPADNKDIREVAISFDQVSKWYGSFQVLQNINLNVRVGERIVIARPSGSGKSTMIRCINWLEKHQSGRIVVDGIELTSNHKKVEAVRKEVGMVFQHFNLFPHLTILENCSLAPIWVKKQSKKDADAVAFKFLEKVRIADQAHKYPGQLSGGQQQRVAIARALCMRPKVLLFDEPTSALDPEMVREVLDTMVRLAEDGMTMLCVTHEMGFARQVADRVIFMDRGEILEENGPDEFFSNPRHERARAFLSQIIY |
1 | Planktothrix_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_31 |
340609 : 348328
Sequences of DBSCAN-SWA_31
Nucleotide sequences of DBSCAN-SWA_31 >NC_022536|340609:348328|DBSCAN-SWA TTTAGAGAGTTCCGAGCGAGGCGAAATAGCCACCGCTTTCATGACCCCGCGAAAGGATGTAGCTCACCGCAAAGTTCTCATTGGAGCTCGGTGCATCCTCTCGGAAGTGACTTCCCCGGCTCTCCGTCCGCAACAATGCGGCCCGGGTCATCATGCGGCCTGTCTCAATCAAGTTGTCGGTTTCGATCGCGTGGACAAAGTTGTCACGGGACGGAGCCGATACGGCCTGCAGCCGTGCCTGCAGCTCTGTCAGCTCCGACAGGAATTGACTAAGACCTGGCTCGTTTCGCAGGATCAACAAGCTGCGGTTAGCCGCGGCCTGCAGTTCGAGGCGTATGGCTGCTCCGTCCTCTCCACCGGTCCGGCTGAACTTCGCGAGGTACGCCTGGATCGCCTCGATACCATCTCCAACAGACGTCACGCCGACTTCGCGGCAGCGCTCGGCCGCAGCCTTTCCGGCGCGCGCACCAAAAACCTGGCAGGTCATAGCCATGTTGCCGCCGAGACGGTCCGCTCCGTGCGGCCCGGCCGCTGTCTCACCTGCTGCAAACAGCCCCTTCAGTGTCGTCTCGCCATTCTCGTTGATACGAAGACCACCGTTGATCGCATGCGCCGAACACGTGATCTGAACCGCGTCCTTATAGAGATCGGTTCCGCGTTCCTTGTACCACTCGTAGGTCAGCGGCCACATCTGGCCGAATGAGCGCGTCTTGTCGGCGAGCAGGCGATCGAAATCGCAGTTCTTGAAGTCCATGTAGACGCCGCCCTTGTCGGTGCCACGGCCTTCGTTGATCGCTCGCTGGATCGATATTTCGATATAGCGGGAAATGTCGGATGAGCTAAACGGGAAGTGACGTTCCTTCTCCTGCATGACCTCAGCCACCGTCAGGCCAGCCGGAAGGTAGCTCTCGACGAAGGGCACACCATCTCGGTCCGTGAGGTTGGGATGAGCGTCCCAGAGGTAGTTGCCGAACAGGTTGATGAATGGGTCGATAATGGAAACGCCCGCCTGCATGAACTCCATGTTCGCAAGCTCGGCCCCTGCCCGCCAAGCCATGGCATAGCCGTCCCCGGTGATATCGGCCGGGTACAGGTTTTTGGAGAACAGCTGGCTCGCGCCGCCGGTCGTCAGGATCACAGACTTGGCGCGGACGAAGACAAGGCCGCCCTCGTCATCGATAGCCATGGCTCCGTGGCACGAGGTACCGTCGCTGACCAGGCCGACAACCATCATCCGGTCCAGCGCACGGACGCCGCGCCGGGTCGCCTCTTCACCCAACGCGCGCATAATTGGCTTGAAGTGGTCCTTCAGCACGTGCGATCGCGGCTTAGAGGAGAAGCAGGCCCGGAAGGCGAGATACCGATCCTCGTTCCTCTCGAAATGGACTCCGAACTTCTCCAGATAGCGCAGGCTGGGCTCGGACTCGTTGACCAGGATATCCACGAGGCGAGGATCCGCCATACCCTGAGCCGCCTTGAGAATGTCTTCTCTGAAGACGTCTGGATTATCGGCGGGATCGCCCGCACCGTCAGGGACGTTATAAGCGCCCACTTCGGCAACGCTGTAGAAGGTCGCTCCGCTCTTTCGGAAGTTGTTCTTGACGAGGACCGTAACCTCGGCCCCCTCGTCGCTGGCCGCGAGAACCGCTCTCAAAGCAGCGCCGCCGCTGCCAATAACGAGAACGTCTGTCTCAAACGTCTTGTAATCCGACATATGAATCATCCTTAGCTGAAGTGACCGACTTCCATGGTCTTGAGGTCTTCGGCGATTTTGACCGAGAACGGGATCAGCCGGAGAATGTCACGGTCGACGTCGACGCCGGGTGCGACCTCAACCAGTTCGAGCCCGTCCGGACGTCCTTCAAACACTGCGCGCTCGGTGACGTAGGTGACCCGCTGGCCACGTTCGACCGCAAGCTTCGCGTTGAACGTGACCTGCTCCACGGCGGGCACGAACTTGCTGAACTTGCCTTCGTTCTTAATGATAAGGCGACCGTCTTCTGTCCCGACGTCGAGGCCGCCGGCGCTGAACGCTCCGCAGAACACAATTCGAGGCGTTCTGCTTACAATGTCGATGAAGCCGCCGCACCCGGGAACAATCCCATTGAACTTGCTGACATTGACGTTGCCGTTTGCGTCGATTTGCCCGCAGCCAAGGAATGTGATGTCCAGGCCGCCACCCTGGAAGCAATCGAAGACCGACGTGGAGTCGATGATGGCATCGGGGTTGATGTTGGTCCCGAAGATCGCCCGCTCGCCCGGGATACCGCCCAGCGAACCATGCTCAGGAAAATACGTCGCGTTCGGCGCCGACCGCCGCTCGTGAAGAATGGTGGGCACGGCGATCGGCATTCCGACGCCGAGGTTGACGGTCGCTCCGTCGAACACTTCAAAGCTCGCTCTGCTCGCAACGATACGTTCAATACCGGGAGGCACGATACCGACATCGCCAACCGGCAGACGAACCGCACCCGTCAGTGCTGGATTGAGGCGTCCGCCTGCAAGCTCCTGGGCGGGCACCACGACGACGGCGTCAACGAAGATGCCTGGAACTTCGACCATCCTTGGATGGATCGACCCTTTCTGCGTCATCCGGCGAACCTGGACAACGACCTTGCCGCCGGAAGCCTTGGTTGCTTGTGCGATGGCTTTGATGCCGAGCGATACCGGCTCATCCTCAAGGCTGATGTTGCCGCATTCGTCCGCTGTGGTGCCGCGCAGGAACGCCACATCGATTGGCCGCACATCGTAGGCGAGATGGACGTCACCTTCGATCTCCACGTGTCTGGCAAGCGACTCCCCGGCCGCCCGGTTCATGCGTCCGCCTTCGACGTCTGGATCGACGAAGGTCCCGAGGCCCACCTTCGTGTAGGTCCGCTTTCGTCCAGCGGAGCAATCTCGCAGCATCTGGAAGATGGTTCCCATCGGGGCGACGTGCGCGGCGAACGCCCCTTGCTTGAGAAGTTCGGTATATCGTGAGGTCTTGAGGTAGCTGTATCCGCTGCCGATTGCCGTGGCGACCAGACCGGGGTGGGCCAGGTTCTCGAGGCCCCGGCCCTCCCCCATGCCGACGATAATGGGATGGATTTCCGTGATTGCCTTGGGCGACCCGCTTTGAACATAGCGCGCTCCGAGGGCGCCGAGCAGCGCATCGGGAGTGAGGACGTTTTCGCATCCGCAAATCGCGATGCAGTCGTTGTCCCTTACCCAGGATGCTGCCTCTTCGGCCGATACGATGTTGACGTTTTTCATTGGAGTTTCCTCTAAAGGATCTGTTTTTCTTGTTTGGCCAGCTGGCCCCGGGTGCGACTACGCAAAGGTGCGAACTATTCCGCCGTCGGCCCGTATCGGGGCCCCGTTGGTGGCGGCAGCCGCGGGGCTGCAGAGATAACAAATGACGTTCGCCACTTCTTCGGGCGTGGTGTACCGCTGGATGAGCGATGTTGGACGGCGCGTGCTGAAGGTCCGGGCTATCAATTCGCCCTCACCGACGTTTTGCTCCAGCGCGCGTTGCTTGATCTGATCCCTGGTGGTCTCGACCCAGGTGGGACCGGGCAGAACGGAGTTGACGGTGACGTTGGTACCAGCCGTCAATTCCGAGACCCCTCGGCTGAACGCCAGTTGAGCAGCCTTGGAGACACCGTAGTGGATCATCTCCTTGGGGATGAACACGCCGGATTCGCTCGAGACGTTCACGATCCGGCCCCAGCCCTTGTCGAGCATGCCCTGGAGGTAGTGACGCGTGAGACGGACACCGGACATGAGATTAAGCCGGAACATCTCATCCCATTCGTCGTCGGTTGTCTGGAAAAAATCCTTCTGGTTGTAGAAGCCAGCGTTGTTTACGAGGACGTCTACATGCGGAAGCTCTTCGCACAGTCGGGATATGTTGGAAGGGATTGTTAGATCGGCGGCTACTCCTCGGACATGGACGTCTCGGTCTAGGCGCAAGGCTTCAACTGCCACATCCACGGACGTCTGTGAGCGTCCCTGAACAACAACCTGCGCGCCCATGCCAGCCAATCCAGCGGCGACGGCGAAACCGATCCCTTTCGTCGATCCCGTTACCAAAGCGACTTGGCCATTCAGATTTAAGTTCATTGACATCCTCCTTGTTCGCCGACCGACCCTCTGCGGTTCGCCAGAAAACAATGGCATATGGCCGCGGGTCGTCAGGCCCACCAGCGCTCGCTGCGGGCTGATCGAAATTCGTTTTTGTCTTCGAGTTTGAGATGAGCCGTTAACGGGCTACTGTCTCGCTATGGCGGTTCTCGTGCGCTCTCATCACATCGTCGTCACCTTCCGCCATTCCAGAGGTGCATTCTGCGGCGGGAGAACGCGTTCGAAGGATGCAGCGGCTCGCAGCACAGTTGCGTCATCAAGATGCCTGCCGACAATCTGCAGACCGACAGGGAAGCTGTCGACTATACCGCACTGAATGCTCGCAGCCGGCTGGCCAGTAAGATTGAACGGGAACGTAAATCGAGCGTAATCGTCGTAACTCACATCCTTGCCTGCGATTTGGCCTTTGCCAGTACCGTCTGCCGCAATGGGCAGAAGAGCCACGGTTGGCGTGAGGATGAGATCGAACGATCCCATGACGACCGCCATGGCGTTCACAAACGCTTTTCGAGTCGTGATCGCGTCGGTAGCCGCTTCGAATGGAAGGTTTCGCGCAAGCAATTCGTGAAGTGCTTCGCCAACAGGCTCATGAGACTCCTCGACAAGCCGACGAACGCCCGTGATGTCCGTCTCCAGTGCCATGATCGTCTGGAACGCCTCTACGATCTCCAAGGCCGGAGGCGAGCTTTCGACAACGTCAAACCCGTAGGCTTCCGACAGTTTACCGACGGCTTCGGTCACCTTCCTCTTGATGCGAGGATCGGTTGGCTGGTCGTTCCAATCCGGCCAGTAGGCGACGCGCAATCCCCTTGGGAGGTTAGAGCTGGCGGCCCCGAGCCAATCAACGTCGCTGGTGTTGTTCGACAGACGGTCTCGACCATCCGCCCCCGCGATTACGGACAGCATGAGGGCGGCGTCCGCGACGTTCCTGGTGATCGGACCAACGTGTTCGAGAGACTCCCAGCCGGACGCCCCCGGTAGGCGCTCGTCGCGGCATCCCGGCCAAAGCGGAACTCGTCCCATCGACGCCTTGATACCGACATTGCCCGTCAGGGCCGAGGGTATGCGTATCGAGCCGCCGCCATCGCTGCCGATCGACAACGGGCAAACTTTTGCTGCAACCGCAGCCGCAGAACCTGCACTCGATCCTCCGGGCGTCTTCGTCAGGTCCCACGGGTTGCAGGTGGGTTCGAAAAGGAGGTTGCATCCGTGCGCCGAGTAACCAAACTCCGATGCATTGGTTTTTCCAATCACAATCGCGCCGGCCGCCTTGAGGCGTTCCACGACGATATCGTCATCTTCCGGAATGAAGTCTTTGTACAGTCGGGAGCCGAATGTCGTCCGCAAACCCTTTGTGAGGACGAGGTCTTTGATGGCGACCGGCACCCCAGCGAGCGGACCCACGCTTTGACCTTGCGCGAGGTCGGCATCCAGCTTTCTGGCCGCACGTATCGCACCCTCCTTGTCCAGCGTAGCAAACGCGTGGATGTGAGGTTCTGCGTCCTCAAGGACGGCAAATGCGGCTTCGATAACCTCCAGGGCGCTGACCTGCCGCGATGAGACCAGGTCTCTGATCTCAACCGCGGTCAGCTCGGTGTATTCATCAGGCGTCATGGACGTAACCCCGAGTTTGGAAATGCCGACTTGCGTGGAGTAAGATGTGTAATCTGAGCCCCGCGCGGTTCGGCGCGAAGACACGCCGCCTGGTGTCCCTCTTCCACCTCGATCAACGGAGGCATCACCGTACGACAGTTGCTCATTTCGACGGGACAACGGCCGGCAAATCGGCAGGTGTTCGGGTCCGGATCGATCGGGCTGCTGGCAGACCCTTCCAACCTTTGCGTTTTGATGCCGCGCCGGGCCGGATCAGGGCTCGCCGCCAAGAGAGCCTGTGTATAGGGATGAGCCGGGTTCTCGAACACCTGGTTTGCCGGCCCGCTTTCGATGATCTTGCCGAGATACATCACCAGAATGCGATCACTGATGATCCGAACCACCTCCAGGTCGTGCGTGACGAACAGGTAGCTCATGCCGCGCTTGGCGCGTAGTTCGCCGAGGAGACGAAGGATGGTAGACTGGACGGAAACGTCCAAAGCCGATGTCGGCTCGTCGAGGATCAGCAGGCGCGGCTCAACCGCGATCGCCCGAGCGATCCCGATGCGTGCCTTTTGACCGCCGGAGAGTTGATGCGGATAGCGTGAGCCGAGCGCTCTTGGCAATCCGACGTCGTCGAACGTGCGGTTCACCTTTTCCTCGACTTCAGCAGCGCTGAGATTGGACATCAACCGGCGGACTGGGTCTGCCACCGCCTGAAACGCCGTGAAATGCGGGTTCAGCGACTCGGTGGGGTCTTGGAAAACGATCTGAATCTGAGCGCGATGAAGGGTTTCCGCGAACTTCTTCGTTGGCATCTGCGTCAGGTCGACCTGGTCGAACAGGATCGATCCGGACGTGGGATCTATCATTCGGGAGAGCAAGCGCGACAGAATGGTCTTGCCACAGCCGGACTCGCCGACGAGGCCGACACATTCGCCCTCCCCGATCGAAAAATTAAGGTCGTCGACCGCATGAACCCAGTGGGTCTTCAGACCGCGACGTACCGGGAACCGCCTTGTCAAACCGTGTGCAGTGAGGAGATTTTTCATTTCGGATACCTGCAGCGAACGAGATGATCGGAATCGATTTGAGCCCACGGCAGAGGTCCGAGCGAACAGTCCTCTTTCGCGCGTTCGCACCGGGCGATGTAACGACAAGGCGGGAGCGTGCCCCTGAGGTCGGGCAAGCTGCCTGGAATGACTGCCAGCTCACTCAGATTGGCAGCCGCCGACGGCAGAGCCGCAAACAGCCGCTCTGTGTACGGATGCTTGAGCTCGTCGGGGAAGCTGCGCGATTGTGCCACCTCGACGATGTGTCCGGCATGCATGACTGCAATTCGATCGCAGTACTCACCTGCAAGAGCGAGATCGTGGGTGATCAGGATCGCGGCCATTTTGGTTTCGGCAACCAGATCGCGGATCAGGTCCATGATGACGGCCTGGGTTGTCACGTCCAGACCGGTTGTCGGCTCGTCGCAGATCAGCAAGGCCGGCTTACATGCCAATGCCATGGCGATCATCACGCGTTGGCACATGCCGCCCGACATTTCGAACGGATAGGCCTCGTACCGCCGCTCGGGATCCGGGATTCTCACGCGGGTCAACGCCTCGATCGCCGCCGACCTGAGCGCCTTGCGACGAAGGGTCGTATGACGCCGCAATACGTCCTCGATCTGATGCCCAACCTTGCGGATGGGGTTTAAGGCCGTTCGCGGGCTCTGAAAGACCATGGAAATCTCGCGGCCACGCAAATCCGCCATCTTCTCAGGCTTCAGGGGAGCACCCCTGAAATTTATGCTTCCCGATCGAATGTTGGCGGCTTTGTCGGACAGGCCCATAATACTGTAGGAAAGTACCGATTTGCCGGACCCGCTCTCGCCGACAATGCCGAGGATTTCGCCTTCGCTGACGTTGAAACTGACATCTTCGAGAGCCTGCACCCGTCCGCGACGAGTAAGGAAGTCGATCGAGAGGTCCTGTACGCTGAGGAGCTGTTTTGTCAT
Protein sequences of DBSCAN-SWA_31 >NC_022536|340609:348328|342333_343860_-|WP_022557440.1|DBSCAN-SWA MKNVNIVSAEEAASWVRDNDCIAICGCENVLTPDALLGALGARYVQSGSPKAITEIHPIIVGMGEGRGLENLAHPGLVATAIGSGYSYLKTSRYTELLKQGAFAAHVAPMGTIFQMLRDCSAGRKRTYTKVGLGTFVDPDVEGGRMNRAAGESLARHVEIEGDVHLAYDVRPIDVAFLRGTTADECGNISLEDEPVSLGIKAIAQATKASGGKVVVQVRRMTQKGSIHPRMVEVPGIFVDAVVVVPAQELAGGRLNPALTGAVRLPVGDVGIVPPGIERIVASRASFEVFDGATVNLGVGMPIAVPTILHERRSAPNATYFPEHGSLGGIPGERAIFGTNINPDAIIDSTSVFDCFQGGGLDITFLGCGQIDANGNVNVSKFNGIVPGCGGFIDIVSRTPRIVFCGAFSAGGLDVGTEDGRLIIKNEGKFSKFVPAVEQVTFNAKLAVERGQRVTYVTERAVFEGRPDGLELVEVAPGVDVDRDILRLIPFSVKIAEDLKTMEVGHFS >NC_022536|340609:348328|343917_344709_-|WP_022557441.1|DBSCAN-SWA MNLNLNGQVALVTGSTKGIGFAVAAGLAGMGAQVVVQGRSQTSVDVAVEALRLDRDVHVRGVAADLTIPSNISRLCEELPHVDVLVNNAGFYNQKDFFQTTDDEWDEMFRLNLMSGVRLTRHYLQGMLDKGWGRIVNVSSESGVFIPKEMIHYGVSKAAQLAFSRGVSELTAGTNVTVNSVLPGPTWVETTRDQIKQRALEQNVGEGELIARTFSTRRPTSLIQRYTTPEEVANVICYLCSPAAAATNGAPIRADGGIVRTFA >NC_022536|340609:348328|344892_346344_-|WP_048903038.1|DBSCAN-SWA MTPDEYTELTAVEIRDLVSSRQVSALEVIEAAFAVLEDAEPHIHAFATLDKEGAIRAARKLDADLAQGQSVGPLAGVPVAIKDLVLTKGLRTTFGSRLYKDFIPEDDDIVVERLKAAGAIVIGKTNASEFGYSAHGCNLLFEPTCNPWDLTKTPGGSSAGSAAAVAAKVCPLSIGSDGGGSIRIPSALTGNVGIKASMGRVPLWPGCRDERLPGASGWESLEHVGPITRNVADAALMLSVIAGADGRDRLSNNTSDVDWLGAASSNLPRGLRVAYWPDWNDQPTDPRIKRKVTEAVGKLSEAYGFDVVESSPPALEIVEAFQTIMALETDITGVRRLVEESHEPVGEALHELLARNLPFEAATDAITTRKAFVNAMAVVMGSFDLILTPTVALLPIAADGTGKGQIAGKDVSYDDYARFTFPFNLTGQPAASIQCGIVDSFPVGLQIVGRHLDDATVLRAAASFERVLPPQNAPLEWRKVTTM >NC_022536|340609:348328|340609_342322_-|WP_048903087.1|DBSCAN-SWA MSDYKTFETDVLVIGSGGAALRAVLAASDEGAEVTVLVKNNFRKSGATFYSVAEVGAYNVPDGAGDPADNPDVFREDILKAAQGMADPRLVDILVNESEPSLRYLEKFGVHFERNEDRYLAFRACFSSKPRSHVLKDHFKPIMRALGEEATRRGVRALDRMMVVGLVSDGTSCHGAMAIDDEGGLVFVRAKSVILTTGGASQLFSKNLYPADITGDGYAMAWRAGAELANMEFMQAGVSIIDPFINLFGNYLWDAHPNLTDRDGVPFVESYLPAGLTVAEVMQEKERHFPFSSSDISRYIEISIQRAINEGRGTDKGGVYMDFKNCDFDRLLADKTRSFGQMWPLTYEWYKERGTDLYKDAVQITCSAHAINGGLRINENGETTLKGLFAAGETAAGPHGADRLGGNMAMTCQVFGARAGKAAAERCREVGVTSVGDGIEAIQAYLAKFSRTGGEDGAAIRLELQAAANRSLLILRNEPGLSQFLSELTELQARLQAVSAPSRDNFVHAIETDNLIETGRMMTRAALLRTESRGSHFREDAPSSNENFAVSYILSRGHESGGYFASLGTL >NC_022536|340609:348328|346340_347375_-|WP_022557443.1|DBSCAN-SWA MKNLLTAHGLTRRFPVRRGLKTHWVHAVDDLNFSIGEGECVGLVGESGCGKTILSRLLSRMIDPTSGSILFDQVDLTQMPTKKFAETLHRAQIQIVFQDPTESLNPHFTAFQAVADPVRRLMSNLSAAEVEEKVNRTFDDVGLPRALGSRYPHQLSGGQKARIGIARAIAVEPRLLILDEPTSALDVSVQSTILRLLGELRAKRGMSYLFVTHDLEVVRIISDRILVMYLGKIIESGPANQVFENPAHPYTQALLAASPDPARRGIKTQRLEGSASSPIDPDPNTCRFAGRCPVEMSNCRTVMPPLIEVEEGHQAACLRAEPRGAQITHLTPRKSAFPNSGLRP >NC_022536|340609:348328|347371_348328_-|WP_022557444.1|DBSCAN-SWA MTKQLLSVQDLSIDFLTRRGRVQALEDVSFNVSEGEILGIVGESGSGKSVLSYSIMGLSDKAANIRSGSINFRGAPLKPEKMADLRGREISMVFQSPRTALNPIRKVGHQIEDVLRRHTTLRRKALRSAAIEALTRVRIPDPERRYEAYPFEMSGGMCQRVMIAMALACKPALLICDEPTTGLDVTTQAVIMDLIRDLVAETKMAAILITHDLALAGEYCDRIAVMHAGHIVEVAQSRSFPDELKHPYTERLFAALPSAAANLSELAVIPGSLPDLRGTLPPCRYIARCERAKEDCSLGPLPWAQIDSDHLVRCRYPK |
6 | Acanthocystis_turfacea_Chlorella_virus(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_32 |
353093 : 357989
Sequences of DBSCAN-SWA_32
Nucleotide sequences of DBSCAN-SWA_32 >NC_022536|353093:357989|DBSCAN-SWA CATGACCATGAGCCACTTTCCCAAACCAACCGCGACGAAGAAAGCCAAGCTCTCCGACGTCGCAAGCTTGGCGAACGTGTCGGCATCGACCGTATCACGCGTCCTCAATCAACCGGACATAGTGCGTCCGGAGATAAGGGAACGCGTGCAGAAGGCCATCGACATGCTGGACTACACGCCGGATCGGATGGCGAAAGCCCTCAGCTCGGGACGTAGTCACACGATCGGTACGGTGGTGCCCACGATCGGCGGTAGCGCAATCTTTTCTGATGGCGTGGAAGCGCTGCAAGATGAGCTCGAAACACTGGGTTACTCTTTGATCGTGTCCAGTTCGAGATACGATATCGAGAGGGAGTTTCAGCAGATCAAGACACTGCTTGACTACGGAGCGGCGGGACTCGTCCTCGTCGGCGACACGTTTTCGCCAGATGCTTTGAGACTCATAAAGAAGCAGGGCATCCCAACGGTTACGACCTACACGAATAGATCCCGCCACGGCATACCGGCGATCGGGATCGACAACAGGGCGGCTGGTCGAGAACTGACCCGCTTGCTGCTCAAGCTGGGGCACAAAAAGTTCGGTATCATTGCCAACACCGTGCTTCAGAACGATCGATCACAGGCCCGTTTGGACGGCGTTCGCGAGGCACTTGCCGAAGCAGGCGTTCCCCTGCCCCTCAATAATGTCGTTGAGGTGAAACTGCCCACGATTGAAAACGGCAGGCGCGGCCTCGCGGGTCTGCTCGAACTCGCGACCGTGCCGACGGCAATTATATGCACCACGGATGCACTGGCGCTTGGGGCGATGTCGGAGGCTCGTAGAAAAGGCCTTCACATCCCTGATGATCTCTCCGTTGTAGGATTTGACGACATTGACGTCGCAGCAGAATCGGACCCAGCGCTCACGACGGTCAAAAACCCGGCGGCCGAGATCGGCCGCATCGCCGCCGATTACCTCAATCGCGCGATGAACGGTCAGCCTATTCCCCTGGAAACCGAACTACCTGCGGGTTTGAGCATAAGGTCTTCGACGGCAAGCGCGCCTGCTGACTAGCGCGTCTCTACGAGAAACGTCATCGGGACGTCGGAGTCCCTAATTGACAGGTAGGTCAGGCGGCCCTGTTACTGGCCGAAGTTGGGCCATTCCGTCCGTCTTTGATGACTAGCAATTGGGACTGCTCGTAGGTCCGCCACGTTATAGCGAACCGAGAGGCATTTTTCCGCCACGAACTGGCTCAACAGCGACTGTAGGCAGCGTTGTTTTCAAAGCGCATTGTAGAAGCCGAGCGCTTCTTCAAGCTGAGTTTCCTCATAGCCGAAGTGCGAGCGGATGACGGCATCGAGCGTATCGAACGTCTTGCGCAGGGGCGCCAAGGTCTCCGGGCCGGGATTAGCGTGGATCGCCTGGGCTTTCTCCCAAAGATTTTCGAGCAGTTCGTGGACGATCTTGTGCTCAGCAGCGAGTTGATCGAGAACGGCACGCAGGCCATAGGTGCCGTTCGCGCGCAAAATGGGGAAGATGTCGTTGTCCTCTGATGTGTGGTGCCAAGTCAGGGCACGACATTCATGGCCGCAAAGTGTGCCGAACGCCTTATAGTTCTGGGTCATCTGTAACTGGTCCACGGCCGCGCCTAGTGCCGTAGGCGCAGCGTTTCCCGTCTCGATCTGGATGATCAGCGAACGCACCCTATCCAACTCGCGCATGTGAATGCGGTGCATCGCGGCCAGATGCCGGCCGCGCTGGCGATGCTCTTCTGTAGCGCCTGGAATGGGCGGCGCCTTCGGTCGTTTGGTTTCATCGATCAGAGTGACGGTCATCCCGATAGGCCCGCCACCGTTCTTGCTGCCACGCTCTTATCGTCCAACCTCAGTTGCTGCCCTTTCACCGAAGAAGCAGCCACATGGGCTGCCACACGGCCAACATCGTCCGCGGTGATCCAATCGGGCTTCCCTTGCGGACGGCTGCGTGAGACGACAATGCCGAGAACGAGCGTGTTGACCCGGAATCTGTCGTCGGCATCGGCCTGAAGGTCGGTCGCAAGCATGAGCTGTCCCCCGCCGAGGATACCCATCAAGGACATGCCCGGGAGAGGCAGAATCGCGGACATGCCAGCGATAAAGGAAAAGCTGCCACCTCGGCGCAAATGCGGTCCGAGGGCACGGGAAGCGGCATAATGAGCCTTGATATTGGGAAGGACGTATTTATCGAACTCAGCCTCGTTGACGTCCCATACCGCCTTGCTCTGCGACCATCCCGTGTTGACGCTGGATACAACATGGTCGATCGCGCCATGCGTTTCGGCGAGTTCATTTGCCAGCGCGCGATAGCCATCGAAGGTCGTTGTGTCACCCGCCACGGCGACAAGCTGCTCAAGGTTCTCATCCTTGAGATAGCTCCGCAGCACGTCGAGTTTCTCACTGTTGCGACTGAGGGCGATGACCCGTGCCCCAGCTCGAAGGAAGGAGCGCACGATGCCTTCTCCGACATTGCCGGTGGCTCCGTTTACCACCACGAGCTGGCCGGCAAGCGAAGGTACTTGCTCATAAGTGATTGCCATTTTATACCTCCGTATATTAAAGAAACTGGTAGATACCTATCTGCTGGTTGTATCTTTGCAAGACGGCACTTCTACGCAACCGGAGTATCCTCGATGAAACCACATCTCGATCACACGAATCCCGGCTGTGCGCCGGTGCGCCACCTTTTGCAGACCATTGGCAGCAAATGGGCCGTCTTGCTGGTCACGCTGCTCTCGGAGCGTCCGAAGCGGTTCTCCGAGCTAATGCGGGAGGTCAGCGGCATCACGCAGAAATCATTGACCTCGATACTGCGGGAACTGGAGAAGAATGGAATTGTCGAGCGTATGGTGACGCCCACAATCCCTCCCCGCGTCGACTATGCGCTGACGCCGCTCGGTACGACGCTGCTTGGGCCGCTCGATGCAGTGCAGAACTGGGCCATAGAGAACTCGGCGGCCGTGGAAGAATCACGCTCACGTTATGAATCTCGCAACAACAGTTAGAGATTTGACCATAGGGCGCGATAAAGCTTAACCGCAGGCGGTGTTCCCGTCGCACCTGGCGATAGTTCATTGTGCCGCTTCGAAAGCTTCTGCCGCGAAGCCTGCGGTAGAAGCGCAGCAAAAGTAGAGGTCACACCGGCCAGATCGACCAAAATAGGGTGCCGAGACAAACGATTATCGCCAGTATTGCGCCCATAATTCGCGCAAGACCTTGATTCACATTGCCGCTTCCAACGACAAGATAATACCAAAAGTCAGAATTGGATTTCTTTTCCTGCATTCTTAGGTTTTCCCTTAACTCGACCAAAAGATATTGTCGTAGTCTGATAGTTGTAGCGGTGAGCGCACTATTCAGCCGCCAGCTTTCCACGCGGCATCACTAGCACACCGCGCGGGAAGAAGAATCAGGCCCGTTCCTTGAGATACGCAGCGGCATTGGTGGCAGCATTTCGCCCTGAGTAAAACGAATAGCTCACCGCCAGTCCGCCATTGGTGTAGTCGTAAGAATCGCCAACTAGCATTCCGGACGATTCGTTGCCTGTCGCATACAGTCCGCAGATCGCCCCATTTTCTTCGCTGATGACCTCGTTCCTGGTGTTCACCTCGACGCCTCCCATCGAGCAGAAGCCGCAGGTGCTCAGCACAAATCCGTAGAACGGGCCACCTTCAACCTTCAGCAGATACTCGGCGGGCTTGCCGAACTGCTCATCCGTTTGGGTTTCGGCGTAGTGATTGTAGTCGGCGATCGTCTTTTGCAATGTATCCGCAGGAACACCGAGCTTTTCAGCCAGCTCAGGCAGGCTGTTAGCCTTCTGGATATAGGTCTTGTGTCCGTTCACGGCCCCGTCGATTAGGTGCTGCAGTTGATCCAGCTTGTCGCCGTTGCGGACCGCCAGACCCATGCCGACGAAATTCCCGACTTCGATCAGGCGATTGACATTGTCCTGACTGAGAATGCTGAACACCTTATTCTGGCTCAGATTTGCATTTCCGGCCATGACCATGTTGAAGACGACGTCTTCGTTGACAAATCGCAGGCCTTTCTCGTTGACCCAGAGCAACGGCTGCATCGCGGCAGCGTTGGTCAGATGGTTCATCATGGCGAAGGTGGGCTGCTCGGAGTCTTTCAGAAATGCGCCATAGACCATGAGCGTTCCAACCCGATGCTTTCTGGCGCCGACGGACCACATTGACCGGGAGCCGTCACCTGTAGCGTGGCCGTGATTTAGAGGGACGAAGCGGTCTATGTCGTAGTTCGTGTATTCACGGAGCATTTCTTCGTTGTTAATGTATCCCCCGGTTCCCACAACGACTGCGCGGGTGGCGATGAATTCGGTTTCGCCGGTGCTTTCGTTTCGAAGCGTAACGCCCTCGATGTCGCCGGCTTCGTCGGCGTGAAGTTCCACAAGCGCGGTGGAAGTCAAAATCTCTGCGCCCAGCTTCTTCGCATTAGGGATCAGAGCGTCGTTGATGACGGCCCAGCCGCCACCTTTGTAAGGCTGCATCACCGGTTTTCCGTGACCCATGGCTGTGATCGGATCGAAGCTGACGCCAAGGTTTCTCAGCCATTCCACGGTGTCGTTGCTCGCATTGATGAGGTTTCGCAGGATAGGGAGATTTGCCTTATACTTTGAAAACTCCGCCTCCTCCTTGAAGACCTCTTCCGGGGTGATCTCAATACCCTGAGCTTTTGCTTCGGCGGTTTGAAGGCCCAGCAGCCCTTCAGTAAAAACACCACAACCGCCCACCGCGTCGGCCTTCTCGACCAGAAGCACCGACAGTCCTTCCTGTCCGGCCTGGACGGCGGCGGCACAGCCAGACGCTCCGCTGCCCGCAATGACGATATCGTAAGAGCTATACTTGCAGATTGACAT
Protein sequences of DBSCAN-SWA_32 >NC_022536|353093:357989|354906_355650_-|WP_022557452.1|DBSCAN-SWA MAITYEQVPSLAGQLVVVNGATGNVGEGIVRSFLRAGARVIALSRNSEKLDVLRSYLKDENLEQLVAVAGDTTTFDGYRALANELAETHGAIDHVVSSVNTGWSQSKAVWDVNEAEFDKYVLPNIKAHYAASRALGPHLRRGGSFSFIAGMSAILPLPGMSLMGILGGGQLMLATDLQADADDRFRVNTLVLGIVVSRSRPQGKPDWITADDVGRVAAHVAASSVKGQQLRLDDKSVAARTVAGLSG >NC_022536|353093:357989|354355_354910_-|WP_022557451.1|DBSCAN-SWA MTVTLIDETKRPKAPPIPGATEEHRQRGRHLAAMHRIHMRELDRVRSLIIQIETGNAAPTALGAAVDQLQMTQNYKAFGTLCGHECRALTWHHTSEDNDIFPILRANGTYGLRAVLDQLAAEHKIVHELLENLWEKAQAIHANPGPETLAPLRKTFDTLDAVIRSHFGYEETQLEEALGFYNAL >NC_022536|353093:357989|353093_354146_+|WP_022557450.1|DBSCAN-SWA MTMSHFPKPTATKKAKLSDVASLANVSASTVSRVLNQPDIVRPEIRERVQKAIDMLDYTPDRMAKALSSGRSHTIGTVVPTIGGSAIFSDGVEALQDELETLGYSLIVSSSRYDIEREFQQIKTLLDYGAAGLVLVGDTFSPDALRLIKKQGIPTVTTYTNRSRHGIPAIGIDNRAAGRELTRLLLKLGHKKFGIIANTVLQNDRSQARLDGVREALAEAGVPLPLNNVVEVKLPTIENGRRGLAGLLELATVPTAIICTTDALALGAMSEARRKGLHIPDDLSVVGFDDIDVAAESDPALTTVKNPAAEIGRIAADYLNRAMNGQPIPLETELPAGLSIRSSTASAPAD >NC_022536|353093:357989|356519_357989_-|WP_022557455.1|DBSCAN-SWA MSICKYSSYDIVIAGSGASGCAAAVQAGQEGLSVLLVEKADAVGGCGVFTEGLLGLQTAEAKAQGIEITPEEVFKEEAEFSKYKANLPILRNLINASNDTVEWLRNLGVSFDPITAMGHGKPVMQPYKGGGWAVINDALIPNAKKLGAEILTSTALVELHADEAGDIEGVTLRNESTGETEFIATRAVVVGTGGYINNEEMLREYTNYDIDRFVPLNHGHATGDGSRSMWSVGARKHRVGTLMVYGAFLKDSEQPTFAMMNHLTNAAAMQPLLWVNEKGLRFVNEDVVFNMVMAGNANLSQNKVFSILSQDNVNRLIEVGNFVGMGLAVRNGDKLDQLQHLIDGAVNGHKTYIQKANSLPELAEKLGVPADTLQKTIADYNHYAETQTDEQFGKPAEYLLKVEGGPFYGFVLSTCGFCSMGGVEVNTRNEVISEENGAICGLYATGNESSGMLVGDSYDYTNGGLAVSYSFYSGRNAATNAAAYLKERA >NC_022536|353093:357989|355743_356115_+|WP_022557453.1|DBSCAN-SWA MKPHLDHTNPGCAPVRHLLQTIGSKWAVLLVTLLSERPKRFSELMREVSGITQKSLTSILRELEKNGIVERMVTPTIPPRVDYALTPLGTTLLGPLDAVQNWAIENSAAVEESRSRYESRNNS >NC_022536|353093:357989|356245_356395_-|WP_022557454.1|DBSCAN-SWA MQEKKSNSDFWYYLVVGSGNVNQGLARIMGAILAIIVCLGTLFWSIWPV |
6 | Enterobacteria_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_33 |
362672 : 364175
Sequences of DBSCAN-SWA_33
Nucleotide sequences of DBSCAN-SWA_33 >NC_022536|362672:364175|DBSCAN-SWA TATGGCCTTCGATTGGTCCTACACTATCGGGCTCCTGTGGAGCCGCGACTTTTGGAACGCTTCGCTCACAGTTGTCGAACTCAGCGTAGCCACTTGGGTGATCGCTGTTGTCCTCGGCTTCTTCGTAGCGCTCGCGGATCGCTCGAAATACATGGTCGTGAGACGGGCTGCCGCCCTCTACGTCTGGTTTTTCAGGAGCCTACCGCTGCTCGTTCTCCTGATTTTCGTCTACAATCTTCCGCAGGCCTTTCCCTCGACCTCGGGCCTCCTTTCCGTGCCGTTCATCGCCGGCCTGATGGCTCTCGTTCTGAGCGAGACAGCTTACATAGCCGAAATCCACCGGGGCGCGTTGCTTGCGGTTGGTAAAGGCCAGTATGAGGCGGGTCGCGTCCTCGGCCTTTCGACAACAGGAATTCAGCGAAAGATCGTCATTCCGCAATCCTTGCGCATCGCCCTTCCCGCCCTGGCGAACGAGTTCATTTCAATCGTCAAACTGACGTCGCTCGTATCCGTCATCTCGCTGGCAGAGATTCTGCTGGTCGGCCAGCGGTACTATACCCAAAACTTCCTTGTGATCGAGACGATGGTCGCTGTGGCCTTCTACTACGTTCTGATCGTATCCGTGTTCGACTTTGGCTTGAAGACGCTTGAGCGTCGACTGGACTTCTCCCGCCGGACCGCTCACCTGACAAGCGAAGGTGTGGATTTGGATGGCGTGTCTTCGTTCAAATCAGTGGATGCAGATCGCGCACTCAACGACGCGCCGGCGGTACGACTTGAGCGTGGTGGAAAGGCTTACGGCCACCATTCGGTGTTCGCAGAACTAGACCTCACCGTGAAACAGGGCGAAGTGGTTTCGATTATCGGACCTTCCGGGTCGGGCAAAACGACGCTCATCCGCTCACTAAACGGGCTTGAGACACTCGATACAGGCGTCGTGTACGTCGACGGTGTTCCGTTTCTTGCAGGATCGCAGGAAGGTCTCAACGTCCGGCAGAACCGCGGCGACGCGCTTCGCATCGGCATGGTCTTCCAGAACTTCAATCTCTTCCCCCACAAGACGGCCTTGGAAAACGTAATGATGGGGCCGCTGTACCACAAACGCGGCTCACACGAGGAGGTTCGGAAGGAAGCCCGTTATAAGTTGCAGCAAGTCGGAATGCTCGTACATGAGAACAAGTACCCCCACCAACTGTCGGGGGGGCAGCAACAGCGCGTGGCAATCGCGCGGGCCTTGGCGATGCACCCCTCGATCATGTTGTTCGACGAGCCGACCTCGGCGCTCGACCCCGAGACTGTAGGCGAGGTGCTCCGCATCATGGCATCATTAGCCAATTCGGGTCGAACTATGGTGATCGTCACCCATGAAATGAAGTTCGCCATCGAAGTATCGGACCGCATCATCTTCATGGAAGGCGGCAAGGTCGTTTTCGACGGCTCGCCCGAGCTGCTGATTGAGAAGCGCAAACAGGATGCCAGACTGTCCGACTTCGTTCGAATTTAG
Protein sequences of DBSCAN-SWA_33 >NC_022536|362672:364175|362672_364175_+|WP_022557462.1|DBSCAN-SWA MAFDWSYTIGLLWSRDFWNASLTVVELSVATWVIAVVLGFFVALADRSKYMVVRRAAALYVWFFRSLPLLVLLIFVYNLPQAFPSTSGLLSVPFIAGLMALVLSETAYIAEIHRGALLAVGKGQYEAGRVLGLSTTGIQRKIVIPQSLRIALPALANEFISIVKLTSLVSVISLAEILLVGQRYYTQNFLVIETMVAVAFYYVLIVSVFDFGLKTLERRLDFSRRTAHLTSEGVDLDGVSSFKSVDADRALNDAPAVRLERGGKAYGHHSVFAELDLTVKQGEVVSIIGPSGSGKTTLIRSLNGLETLDTGVVYVDGVPFLAGSQEGLNVRQNRGDALRIGMVFQNFNLFPHKTALENVMMGPLYHKRGSHEEVRKEARYKLQQVGMLVHENKYPHQLSGGQQQRVAIARALAMHPSIMLFDEPTSALDPETVGEVLRIMASLANSGRTMVIVTHEMKFAIEVSDRIIFMEGGKVVFDGSPELLIEKRKQDARLSDFVRI |
1 | Planktothrix_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_34 |
375174 : 376677
Sequences of DBSCAN-SWA_34
Nucleotide sequences of DBSCAN-SWA_34 >NC_022536|375174:376677|DBSCAN-SWA GATGCTCGAAAATACCGAGCGCGATGGTTTGCTGGCAATCGAAGGCATCCGAAAAGAGTTTCCCGGTGTCGTCGCTCTCGATGATGTCAGGCTGCGGGTGCGGCCCGGAACTGTTCATGCGCTGATGGGTGAAAATGGCGCTGGAAAGTCCACGCTGATGAAAATCATCGCAGGAATTTACCAACCAGACGCCGGTGAAATCAGGTTGCGCGGATTACCTGTTACGTTGAAATCTCCGCTTGACGCCCTCGAGCAGGGCGTTGCCATGATCCATCAGGAACTGGCGCTGATGAACTCGATGACTGTGGCGGAAAATATCTGGATACGCCGCGAACCGAAAAATCGCTTTGGTCTTATCGATCATTCAAATATGGCCAAGATGACTGAGGATCTGTTTTCGCGTCTCAACATTAAGCTCGATCCGAAGGCTCAGGTTTCCGAACTCACCGTTGCGCAGAAACAAATGGTCGAGATCGCCCGTGCGGTCAGCTATGAATCCTCGATCCTGATCATGGACGAGCCAACCTCGGCGTTGACCGATCGCGAAGTGGAGCACCTTTTCGCAATCATTCGCGATCTTCGCTCTCGCGGCATTGGCATCGTTTACATCACTCACAAGATGAATGAACTCTTTGAGATTGCCGACGAATTCACAGTCTTTCGAGACGGAAAATACGTTGGCACCCATTCTTCCAAGGATGTGACGCGCGACGACATTATTCGCATGATGGTCGGCCGCGAGATCACCGACATGTTCCCGAAAGTCGATTGCCCCATCGGCGACGTGATCCTGGAAGTAAAGAATCTGACAATACCGGGGGCCTTTTACGACATCTCCTTTTCAGTCCGCCGCGGCGAGATCCTCGGGCTCGCGGGATTGGTGGGTTCAAAACGTTCGAACGTCGCGGAGGCTCTTTTTGGTGTCACACCGGCGAGCTCAGGTGAAATTTTGATCGACGGCAAGATCGCGGCGATATCAAACCCAAGCGACGCGATGGCTTGTGGACTGGCGTTCCTGACCGAGGACCGCAAGGAGACGGGCTGTTTCCTGACGTTGAACTGCCTCGAGAACATCCAGTCTGCCCTTATCACACGCTCACACGTCAAGGCTGGGTTCGTCGATCAATCCACCGTCACAAAACTGGCCGAAGAGATGGCCGCCAAACTTCGCGTCAAGACCCCGAATTTGCATGAGACGGTCGAGAATCTATCCGGCGGGAACCAGCAGAAACTGCTGATTGCGCGCTGGCTTCTAACGAATCCCCGGATCCTTATCCTCGACGAGCCAACCCGCGGCATTGATGTGGGCGCCAAATCCGAAATCCACAGACTGATCACCGGCTTGGCTGCGGAAGGGGTGGCCGTCCTGATGATTTCGTCGGAACTGCCGGAGGTGTTGGGCATGAGCGATCGGATCATGGTCATGCACGAAGGTCACGTGGCCGGATTCCTTGATCGCGCCGATGCAACTCAAGTGAAAATTATGGACCTTGCTGCCCGATGA
Protein sequences of DBSCAN-SWA_34 >NC_022536|375174:376677|375174_376677_+|WP_048903094.1|DBSCAN-SWA MLENTERDGLLAIEGIRKEFPGVVALDDVRLRVRPGTVHALMGENGAGKSTLMKIIAGIYQPDAGEIRLRGLPVTLKSPLDALEQGVAMIHQELALMNSMTVAENIWIRREPKNRFGLIDHSNMAKMTEDLFSRLNIKLDPKAQVSELTVAQKQMVEIARAVSYESSILIMDEPTSALTDREVEHLFAIIRDLRSRGIGIVYITHKMNELFEIADEFTVFRDGKYVGTHSSKDVTRDDIIRMMVGREITDMFPKVDCPIGDVILEVKNLTIPGAFYDISFSVRRGEILGLAGLVGSKRSNVAEALFGVTPASSGEILIDGKIAAISNPSDAMACGLAFLTEDRKETGCFLTLNCLENIQSALITRSHVKAGFVDQSTVTKLAEEMAAKLRVKTPNLHETVENLSGGNQQKLLIARWLLTNPRILILDEPTRGIDVGAKSEIHRLITGLAAEGVAVLMISSELPEVLGMSDRIMVMHEGHVAGFLDRADATQVKIMDLAAR |
1 | Staphylococcus_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_35 |
382767 : 384600
Sequences of DBSCAN-SWA_35
Nucleotide sequences of DBSCAN-SWA_35 >NC_022536|382767:384600|DBSCAN-SWA TATGACAACGATCCGTTTGACCGCAGCACAGGCGATGTTCCGCTGGCTGTCGGTGCAGATGAACGAGGACGGGGAGCGCTTCATCGACGGCGTCTGGGCGATTTTCGGGCATGGAAATGTCGCTGGCATCGGCGAGGCGTTACACGGTATCCAGGAAAGACTCCCTACATGGCGCGGCCAGAACGAGCAGACCATGGCGCACGCAGCCATCGCCTATGCCAAGACGAAACGCCGCCGCAAGGCAATGGCGATCACGTCGTCGATCGGTCCGGGGGCCACCAACATGGTGACGGCGGCAGCCCTTGCCCATGTTAACCGGCTTCCCGTGCTGTTCATCCCCGGCGACGTCTTTGCCAATCGCCGACCCGATCCGGTGCTGCAGCAGATCGAAGATTTCGAGGACGGAACGATGAGCGTCACCGATTGCTTCAGGCCGGTCAGCCGCTATTTTGACCGTATCACGCGGCCGGAACACCTGTTGTCGGCCTTGCCACGCGCCATGCAGGTTATGACCGATCCTGCCAACTGCGGTCCTGTTACGCTCGCTTTCTGCCAGGACGTGCAGGCGGAGGCCTTTGACTGGCCGGAGAGCTTCTTCGAGGAAAGGACCTGGCGCATCCGGCGGCCGGAGCCGGATCCGCGTGAAGTCGCCGATGTCGTCGCCGCTCTCAAGTCGGCGCGCAATCCCGTGATCGTCGCGGGCGGCGGTGTCCTTTATTCTGGCGCCGAGAGCGATCTTCTAAAATTCGCCAGGAAGCATAACATTCCTGTCGTCGAAACCCAGGCCGGCAAGGGCGCGATCGACTGGAAGGAAAAATTGAACTTCGGATCGCCAGGTGTCACTGGGACCGATTGCGGCAACGAGATTGCGGCTAATGCCGACCTCATCCTCGGTGTCGGTACCCGCTTCCAGGATTTCACCACGGGCAGTTGGACGGCTTTCGCCAACCCGGCGCGCAAGCTTGTCTCGATCAATCTTGCCGGTTATGACGCTGCAAAGCACAGCGCTATCCCGCTTGTCTCAGATGCCAAGGTTGCGCTCTCACGATTATCCGCGGGACTGGATAGCCACCGGTTTGAAGATCCTGACTATGCCGCGCGCGATGCATGGTTTCGGTTGTCCGACGGCGCCCTCGCCGCCCCGAACCCAGAGAGTGCCAACTTCCTGCCATCGGATGCCCATGTCATCGGCGCCGTGTTGCGCCAAGCCAAGGAGAACACTGTCGCAATGTGTGCCGCCGGCACCATGCCAGGGGCGCTGCAGGTGCTGTGGCGGGCAGCCACAAATGGCTACCATATGGAATACGGCTACTCCTGCATGGGTTACGAGGTAGCAGGTGCCTTCGGCATCAAGCTTGCGGATAAGGCCAAGGATGTCGTCTGCTTCGTCGGCGACGGCTCCTACATGATGGCCAACTCCGAGCTGGCGACGGCCGTGATGATGCGTGTCCCGTTCACGATCGTGCTGACCGACAACCGCGGTTACGGCTGCATTAACCGCCTGCAGCAGGAATGCGGCGGGGCCGAATTCAATAATATGTACAAGGACAGCAATATCGAAGTGCAGCCGGAGATCGACTTCGTGGCACATGCGGCCTCGATGGGCGCCCATGCCATGAAGGTGTCGGGCATCGCCGCGCTCGAGACCGAGTTGGTCGCCGCTCGCGACCGCAACATCCCGACCGTCATCGTCATCGACACCGAAGCCGAAACCGGCGCTGGCATCGGCGGCGGCTGGTGGGACGTCGCGGTTCCCGAAGTCGGCAACACCGAAAAACTGAAGGACGCGCGCGCACACTATGAAGCCAACACCGCTCGCCAGCGGATCAATTGA
Protein sequences of DBSCAN-SWA_35 >NC_022536|382767:384600|382767_384600_+|WP_048903040.1|DBSCAN-SWA MTTIRLTAAQAMFRWLSVQMNEDGERFIDGVWAIFGHGNVAGIGEALHGIQERLPTWRGQNEQTMAHAAIAYAKTKRRRKAMAITSSIGPGATNMVTAAALAHVNRLPVLFIPGDVFANRRPDPVLQQIEDFEDGTMSVTDCFRPVSRYFDRITRPEHLLSALPRAMQVMTDPANCGPVTLAFCQDVQAEAFDWPESFFEERTWRIRRPEPDPREVADVVAALKSARNPVIVAGGGVLYSGAESDLLKFARKHNIPVVETQAGKGAIDWKEKLNFGSPGVTGTDCGNEIAANADLILGVGTRFQDFTTGSWTAFANPARKLVSINLAGYDAAKHSAIPLVSDAKVALSRLSAGLDSHRFEDPDYAARDAWFRLSDGALAAPNPESANFLPSDAHVIGAVLRQAKENTVAMCAAGTMPGALQVLWRAATNGYHMEYGYSCMGYEVAGAFGIKLADKAKDVVCFVGDGSYMMANSELATAVMMRVPFTIVLTDNRGYGCINRLQQECGGAEFNNMYKDSNIEVQPEIDFVAHAASMGAHAMKVSGIAALETELVAARDRNIPTVIVIDTEAETGAGIGGGWWDVAVPEVGNTEKLKDARAHYEANTARQRIN |
1 | Ostreococcus_lucimarinus_virus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_36 |
394107 : 397828
Sequences of DBSCAN-SWA_36
Nucleotide sequences of DBSCAN-SWA_36 >NC_022536|394107:397828|DBSCAN-SWA GTCAGCGGCCTCTAAAAATGCTGGCTCGATTGTAGCGCGTCTGGACGTAGACGTGCTCGTTGAACGTTCCCGATGCGTAATCACCGGAGTCAGGATTTGTCTGAATGGCGAGATTGCCAAGCGTAGGATAGCGAGCAACGAATTCTTCATCGATATCGACACCCAAGCCGGCGCCATTGGGAACTGCAATAAAGCCGTCGACGACCTTTGGTTGGGGAAAGATCGTGTTCTGCCGTCCTTCCCAATCGAATTCCAAGCGCTCAAGCATTAGCGCATTGGGGATAGCGGCCATCAAATGGAGGGCAGCATATTCACCGACTGGTCCGAGGGAACCGGAATGAGGCGCCATCATGATATGGTGTGCCTCCGCCATAGCGGCAATTTTTTTCATTTGTGTGAGCCCCCCTGCGCGACCGGTATCTGGTTGCACTACATCTACCAGGTTGCGCTCGATCAAATCTCTCTGGCCGAAAATTGTCGCCATGCGTTCGCCAGCAGCTAGCGGAATGTTGGCCGCATCTCTGATCCTTTGATAGCCGTCCAGATTTTCTGGAGCGATTGGATCTTCCAGGAAAAGTAGATTGTATGGTTCGAGGTCTCGGGCAAAACGCGCCGCATCGGCTGGGGTCAGCCACGGCGGACCGTGGAGATCAATCATTATATCCATGCTGTCCTCGACGGCCTCACGTAGCTTTTCCACGTGGCGAATCGGATTGCGGAGACCACTTGCCTTGAATGCTGAGTAGCCACGTTCTCTCAGCGCGAAGGCGGCCTCGGGGGTATTGGCGTGACCATATACGGGAATTCGATCCCTGACTTGACCACCCAGAAGATTCCAGACTGGGGTGTTCAGAGCCTTGCCTTTGATGTCCCAGAGAGCCATGTCGATCCCTGTCATCGCACCGCCACCGACAGTCCCCGTCATGCCATGATTCATAATAGCGGTCTGCATTTTTTGCCACAATCGCTCGATGTTGGCAGGGTTTTCACCAATCAACAGGGGCGTGATGTCGGCCACCGCCGTTTGCACAACACGCGGCCAACCAGAGCATTCACCGAGCCCGGTTATTCCCTCGTCTGTCTCTACTTTTACAAACAGCCAGTTTCTCGTACCGCGAAAATTCGATCGTTCAGGCGCGTCCTCTGGAGGTGCTCCCACATGCATAAGAAAGGTCTTGATAGCTGTAATTTTCATGTGTCATTTCCTCGACAAATCGTTGTTCATCTCGCTGCTCAGCAGTTGGCAAAAAGGGGGCTCAACCTTCCCAAGGCCGCGACAAGTGCGGCAGTGTCACAACGCGAAACGACAAGTTGGGACACCTTTGATCCCCACGTCGAAAGCCAACAATGTTCCAGTTTCCGCGACTCCGTTACGAACGGTCGACAGAGTTGTGACATAGATCGTTGAGAGTTCGGGCCCACCAAAGCAACACATGCTTGGTCCCTTTGCTGGCAATAGGTACGCCTCAACCACCTCCCCGCTTGGGGTCAGGCGATTAAGTCTGCTCCCTTGCTGTCCGGCAATCCAGTAGTAACCATCCGTATCGACAGTTGCGCCGTCGGGCCGGCCCTCTTCGGCGGTAAAGTCGTGGAGACGGCGAGCATTGCCTTTGATCCCGATATTGACATCGAAATCAAAAACCTGCGCGTAAAGCCCGCTGCTATCCGAGTGATACATCTGTTGCCCGTCCGGACTCCACGCGAGTCCGTTTGAGGTCAACAAGCCGCCATCTATCACCGTGTCATAAGAGCCATCCGGCCGAACACGATAAAGCCGTGCATTTCCCGTTTGTGGCACAGCTTCATCGCGAGTACCAACCCAAAAGCAACCGTCTGGCCCGACTTTTCCATCGTTCAACCGGCTGAAGCCACGCCCGCCATCCGGATTGCAAAGTAATCGTAGGCAACCCGTCACAGGGTCCAATATATGGACACCGCTCTCGAGCGCGACTGCGATTTCACCGTTCTCGCATAGCGCCAAACAACCGATTGTCGCCGGCATTTCGTAACGGTCTATCGTTCCATTGTTGCGGAGGCAGAGCGCTGCTGGTGCGAGCATATCGACGAACCAGAGGTCTCCCCGTTGTTCATCCCAAGTGGGCGATTCTCCAGCAATAATAGGAAAGTCGATCACCGTCTTCACGTCTGGCTTCACAACTCTAATCATCTTATTCAATTCCGATTTATGCCCTCTGTTCGAATGTAAGATTGCGTTTGCGCACATCATCCCGCTTGGCTTCCCTGGCGGCCAAGACAACGAATACGGTGAGCTAAGGGAGATTGAGATTGTTTCCCCGTGGCTATCTCACGAGGCAGGTGCTTTTATTGTCAAAGCGCCAGTCCTTTGCCACGACCTCCACTATCACGATCGGCAAATGTCTTTCGCACGCGTTCTAGCAGGGAATTGTAGCTACCAATCGGCATCTCTACGATCAATGAAGCAGCGTCTTCCAGAGTTCTCCAGATTGCTTCCCCGCCGGGCACCTCCCCTACTCCGATATGGCCCGAATCGTCCTCCACGATGACTAGGTTGCGCGTAAAAAAGGGGCCATGTGGCCGCTGAGATTCAGCAGCATACTGTCACGCCCCGCAACTGGTATTGCTCGAACGGAGGTAACGCGAGGCGTACCCTGGGACAGGTACACCTCGTTTCCTCGTGTTATGTTATTCTCGAAAGTCATGAGCTTCCTATTGAGTCTGGTGACGAATTGAAAGCGACGCGTTAGTTTGCTGCTTTTCCGGGCTCAGAATCTGTATTCGCGTCCTGGCAAAGCAGATGCAAAGACTGTTGTTCCCGCTGGAGGCGCATCCTCGCCGGCAATGCGGGCAGTCAATGGTCCGAGCGCGACCGTATCTAGATGGAGAATCGTTTCGGCACCGAGGCGCTCAACGTGCCGCACTGTACCTTCCCACTGCCCTGCGGACATGGAAACAGAGAGGTGCTCCGGCCTAATACCGAACACCTTGCATCCAAGGGCTGCCGCCGCATCACCCTCTACGAGGTTCATTTTTGGGCTTCCGATGAATCCTGCCACAAATGGAGAGACGGGACAGTTGTAAAGTTCCATCGGAGTTCCAACCTGTTCAATCCGACCCGCGTTCAAAACAACTATCTTATCTGCCATGGTCATCGCCTCGACCTGATCATGTGTGACATAGATCATCGTAGCGCCGAGCGTTCTGTGAAGGTCGGTCAGTTCCATCCGCATCTGGGTCCTTAATGAAGCATCCAAATTCGAAAGAGGCTCATCAAAAAGAAAGACCGAAGGATTGCGAACGATCGCTCGTCCAATCGCAACGCGTTGCCGCTGACCACCGGAAAGAGCCTTTGGCTTACGGTCGAGGAGGTGTGTCAATTGTAGGGACGCGGCCGCTCCTTCAACCTTTGCCTTAATCTCGGCCTTTGGTCGTTTAGCAAGACTGAGACCGAACCCTATGTTTTCCCGGACGGTCAAATGAGGGTAAAGCGCGTAGCTCTGGAACACCATTGCGATACCACGTGAAGACGGTTCAGCATCGGTTACATCACGACCGCCGATCTCAATTTGCCCGGATGTAACGTCTTCGAGACCGGCAATGAGGCGTAGAAGGGTTGATTTTCCACAGCCTGATGGCCCGACAAAGACACAGAATTCACCCTTTTTGATTTCGAGGTCGATGTTGTTCATGACATCCACTGCCCCGAAGCTTTTCTTGATTTGTTTAAGCGTGACGTTGGCCAT
Protein sequences of DBSCAN-SWA_36 >NC_022536|394107:397828|396850_397828_-|WP_022557493.1|DBSCAN-SWA MANVTLKQIKKSFGAVDVMNNIDLEIKKGEFCVFVGPSGCGKSTLLRLIAGLEDVTSGQIEIGGRDVTDAEPSSRGIAMVFQSYALYPHLTVRENIGFGLSLAKRPKAEIKAKVEGAAASLQLTHLLDRKPKALSGGQRQRVAIGRAIVRNPSVFLFDEPLSNLDASLRTQMRMELTDLHRTLGATMIYVTHDQVEAMTMADKIVVLNAGRIEQVGTPMELYNCPVSPFVAGFIGSPKMNLVEGDAAAALGCKVFGIRPEHLSVSMSAGQWEGTVRHVERLGAETILHLDTVALGPLTARIAGEDAPPAGTTVFASALPGREYRF >NC_022536|394107:397828|394107_395301_-|WP_022557490.1|DBSCAN-SWA MKITAIKTFLMHVGAPPEDAPERSNFRGTRNWLFVKVETDEGITGLGECSGWPRVVQTAVADITPLLIGENPANIERLWQKMQTAIMNHGMTGTVGGGAMTGIDMALWDIKGKALNTPVWNLLGGQVRDRIPVYGHANTPEAAFALRERGYSAFKASGLRNPIRHVEKLREAVEDSMDIMIDLHGPPWLTPADAARFARDLEPYNLLFLEDPIAPENLDGYQRIRDAANIPLAAGERMATIFGQRDLIERNLVDVVQPDTGRAGGLTQMKKIAAMAEAHHIMMAPHSGSLGPVGEYAALHLMAAIPNALMLERLEFDWEGRQNTIFPQPKVVDGFIAVPNGAGLGVDIDEEFVARYPTLGNLAIQTNPDSGDYASGTFNEHVYVQTRYNRASIFRGR >NC_022536|394107:397828|395397_396273_-|WP_022557491.1|DBSCAN-SWA MIRVVKPDVKTVIDFPIIAGESPTWDEQRGDLWFVDMLAPAALCLRNNGTIDRYEMPATIGCLALCENGEIAVALESGVHILDPVTGCLRLLCNPDGGRGFSRLNDGKVGPDGCFWVGTRDEAVPQTGNARLYRVRPDGSYDTVIDGGLLTSNGLAWSPDGQQMYHSDSSGLYAQVFDFDVNIGIKGNARRLHDFTAEEGRPDGATVDTDGYYWIAGQQGSRLNRLTPSGEVVEAYLLPAKGPSMCCFGGPELSTIYVTTLSTVRNGVAETGTLLAFDVGIKGVPTCRFAL |
3 | Oenococcus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_37 |
403361 : 404378
Sequences of DBSCAN-SWA_37
Nucleotide sequences of DBSCAN-SWA_37 >NC_022536|403361:404378|DBSCAN-SWA CATGAAAAAGAACAGTTCCAAGCAGCGGGCGACAATCCTTGCTGTGGCTGAAGACGCGGGAGTATCGAGGGCAGCTGTCTCGAAAGTTCTCAACAACGCCTACGGCGTCTCTGATGCTCTGCGAGAGAAGGTAGAGGCATCGATTGCGCGTTTAGGCTACCGGCCCTCTTTTGCCGCCAGGGGCATGCGCGGCAAAACATTTACATTTGGCGTCCTTCTCGGGGGAATCGAAAATCCACTGGTTTCGGAAATTGTCCGCGGCATTTCATCTGTCGCCGATAGTTCAGGCTATAAGGTTGTCATGGCGATGGGACGCTACAAACAACCCCTTGAGTCGGGCCTCATCGAGTCAATGATTGACCTTCGCCTTGATGGCCTCATCCTAATTGCGCCGCGGCTTCCGGATGCTGTGTTGAATCATTTTGCCCAACAGATCCCGATGGTCGTGGTCGGCCATCACCAACCTTCGACTATTTTGTTTGACACAGTCAATTGCGACGACGCTAAGGGCGCAGGCATCGCGGTATACGCTCTGGTGGAAAGAGGTTATCAAGATATCTGTATGCTCAGCATGCCGCAACCCGAAATGAACGAAGCCAACATCGTGACCATGCGAGAAGCAGGATACACTGCGGCGATGAACCAGTTAGGACTTCAGAGCTTCTCCAGAATAGATTACATGCCTATCGATCCTTCAGAGCGCGCAAGGTTTCTCCAAAACCTGATTTCTAACAGCGGCCGACCCCGCGCCCTATTTTGTTGGTCTGATCTCGATGCGATCCCCGCACTTGATGCGCTGCGAGACGCCAATATCTGGGTTCCAAAACAAGTCGCGATTATAGGGTATGACAACTCGCCGCCATCGTCACTCGCCCTATTCGATCTCACCAGTGTTGACCAACGTGCAGAAGAACAAGGGCGTATTGCCGCGCAAACTGTCCTCAGTCGCATTTCAGGTCGCGGGGAGCCTAACCACGTTATACTCCAGCCCAAGCTAGTTGCCCGTAGCAGCCACTAA
Protein sequences of DBSCAN-SWA_37 >NC_022536|403361:404378|403361_404378_+|WP_022557498.1|DBSCAN-SWA MKKNSSKQRATILAVAEDAGVSRAAVSKVLNNAYGVSDALREKVEASIARLGYRPSFAARGMRGKTFTFGVLLGGIENPLVSEIVRGISSVADSSGYKVVMAMGRYKQPLESGLIESMIDLRLDGLILIAPRLPDAVLNHFAQQIPMVVVGHHQPSTILFDTVNCDDAKGAGIAVYALVERGYQDICMLSMPQPEMNEANIVTMREAGYTAAMNQLGLQSFSRIDYMPIDPSERARFLQNLISNSGRPRALFCWSDLDAIPALDALRDANIWVPKQVAIIGYDNSPPSSLALFDLTSVDQRAEEQGRIAAQTVLSRISGRGEPNHVILQPKLVARSSH |
1 | Enterobacteria_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_38 |
408020 : 410454
Sequences of DBSCAN-SWA_38
Nucleotide sequences of DBSCAN-SWA_38 >NC_022536|408020:410454|DBSCAN-SWA TATGCACGAACGTCAGCGCCATCAAATAATCATCTCTGCTTTAAGGGACCGGTCGGTCCTCACGGTCCAGCATATTATCGACTTGACCGACGCCTCTGAAGCTACCGCCCGGCGGGATATCGCAGCGCTAGAGCGTCGAGGTAGCCTGAAGCGAGTGCGCGGTGGCGCTGTAGGCCAGCCGCATACGCAGCTGGAACACCTTTCTGCGAACAAATTCGAGGTCTCGAGCAAAATCAACGACGACAAGAAGCGCGCTATCGCTCGCCGAGCCGTTGACATGTGCGATGAAGGTGATGCCGTCATCATAAATGGGGGCACGACAACTTACCAGATGGTGCATTACATGTCAGCCCGCCGTCTGCAGGTCATGACCAATTCGTTTGCCGTCGCAGAGCACCTGGTCAAGCAATCTAAATGCACGGTCACGCTCCCTGGTGGCCATATTTACCGGGAACAAAGCCTCATTCTTTCTCCATTTGAAAATGACGCATCGCGGAATTTCTACGCGCGACGAATGTTTGTTGGAGCTGAGGGTATCGGACCACTTGGGGTGATGGAGTCTGATTATCTGATCGTGCAAAGCCAACACAGGCTCATGCAACAGGCAGAAGAGGTTGTCATACTGGCAGATTCCTCCAAGTTCCAAAGTCGATCGAGCCTCATACTGTGCTCCCTTGAAGCAATTACGACCATCATTACTGACGATGATCTGGCTGATACGGCACGTGAAATGATCACCATCGCTGGTGTTCAGCTAATCACCGTCAAGAGCTCCGGGCAGACGTGAACGATACCTACGATTTTCGACAAATGCGGCTGGGGAGGAACGTGTGAGCAGCATTAAACACGATACGTGGTCACCAGCCGTCCGGCTGAAAGCAGGCAAGAACGAGTTAATCCCAAACGGTTTCGGCGACCTCGGCAATGGCGTCGTTTCTTTTCACTTCATTTCTGCGATTAGAAGAAGGTCCCATTATAGGACTCTGTGCGGTGGCACCATGATTGAAGATCGGCATGTTTAATGCACGTCTCTAAACTCGTCGCAGTGCTCGATGTCGGAAAGACAAATGCGAAAGTTGTGATCGTCAATGCCGCGGGCGTTGAGGTTGCGAAGCGGTCAATACGTAACAGTGTCTGTAATTGGAGTGCACCCCATTTCACCGGGCAAGTCGGACAGTCGGCATAAGCAGAAAGGCTCAAGCACAGGCTTGGGTATAGGCTTATGTCTAACGAGTATCGACACGTTGAATTGCTGACGGGTGATGTTCGCCGCAGGCGGTGGACAACCGAGCAAAAGCTGACGATCATTGAGCAGAGTTTCGAACCAGGCGAGACGGTATCGTCCACCGCTCGCCGTCATGGCGTCGCGCCCAATCTGCTTTATCGGTGGCGCAGGCTCTTGAGCGAGGGAGGTGCTGCAGCTGTGGATTCTGACGAGCCGGTGGTCGGCAATTCGGAAGTAAAGAAGCTGGAAGATCGCGTCCGCGAGCTGGAGCGCATGCTCGGCCGCAAGACGATGGAGGTCGAAATCCTTCGGGAAGCCTTGTCCAAAGCAGACTCAAAAAAACGGATATCGCGGCCGATCTTGTTGCCGAAGGACGGTTCGCGATGAAGGCCGTCGCAGATACGCTGGGCGTATCCCGTTCCAACCTCATCGAGCGGCTGAAAGGCAGATCGAAGCCGCGCGGCTCCTATCACAAGGCCGAGGATGCAGAGCTTTTGCCCATCATTCGCAGGCTGGTAGATCAAAGGCCAACCTATGGCTATCGGCGGATCGCCGCGCTCCTCAATCGCGAAAGGCGAGCCGCCGATAAGCCTGTCGTCAACGCAAAACGGGTTCACCGCATCATGGGCAACCACGCCATGTTGCTGGAGAAGCATACGGCCGTTCGCAAGGGCCGCATCCATGACGGCAAGGTCATGGTCATGCGCTCGAACTTGCGCTGGTGCTCGGATGGGCTGGAGTTCACTTGCTGGAACGGCGAGGTCATCCGTCTCGCCTTCATCATCGACGCTTTCGACCGGGAGATCATCGCCTGGACGGCGGTCGCCAACGCGGGCATCTCCGGCTCTGACGTGCGCGACATGATGCTGGAGGCGGTCGAGAAACGCTTCCGCGCAACCCGAGCCCCGCACGCAATCGAGCATCTCTCGGACAACGGCTCTGCTTACACCGCGAGGGACACGAGGCTGTTTGCGCAAGCACTCAATCTGACACCCTGCTTCACGCCGGTAGCCAGCCCGCAGTCGAACGGCATGTCGGAAGCCTTCGTCAAAACGTTGAAGCGGGACTATATTCGGATATCAGCTCTACCGGACGCCCAAACAGCGCTCCGGCTCATTGACGGATGGATCGAGGACTACAACGAAATCCATCCCCATTCCGCGCTCAAGATGGCTTCCCCTCGGCAGTTCATCAGGGCTAAATCAATCTA
Protein sequences of DBSCAN-SWA_38 >NC_022536|408020:410454|409241_410454_+|WP_111818031.1|transposase|DBSCAN-SWA MSNEYRHVELLTGDVRRRRWTTEQKLTIIEQSFEPGETVSSTARRHGVAPNLLYRWRRLLSEGGAAAVDSDEPVVGNSEVKKLEDRVRELERMLGRKTMEVEILREALSKADFKKTDIAADLVAEGRFAMKAVADTLGVSRSNLIERLKGRSKPRGSYHKAEDAELLPIIRRLVDQRPTYGYRRIAALLNRERRAADKPVVNAKRVHRIMGNHAMLLEKHTAVRKGRIHDGKVMVMRSNLRWCSDGLEFTCWNGEVIRLAFIIDAFDREIIAWTAVANAGISGSDVRDMMLEAVEKRFRATRAPHAIEHLSDNGSAYTARDTRLFAQALNLTPCFTPVASPQSNGMSEAFVKTLKRDYIRISALPDAQTALRLIDGWIEDYNEIHPHSALKMASPRQFIRAKSI >NC_022536|408020:410454|408020_408806_+|WP_022557501.1|DBSCAN-SWA MHERQRHQIIISALRDRSVLTVQHIIDLTDASEATARRDIAALERRGSLKRVRGGAVGQPHTQLEHLSANKFEVSSKINDDKKRAIARRAVDMCDEGDAVIINGGTTTYQMVHYMSARRLQVMTNSFAVAEHLVKQSKCTVTLPGGHIYREQSLILSPFENDASRNFYARRMFVGAEGIGPLGVMESDYLIVQSQHRLMQQAEEVVILADSSKFQSRSSLILCSLEAITTIITDDDLADTAREMITIAGVQLITVKSSGQT |
2 | Escherichia_phage(50.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_39 |
417316 : 421575
Sequences of DBSCAN-SWA_39
Nucleotide sequences of DBSCAN-SWA_39 >NC_022536|417316:421575|DBSCAN-SWA TATGTCAGGACGTTTCATCCGAGCCGCAACGGGAATGTTTGCTGCGGTGATCTTGTTTACGGGGGCATCGGCCACTGGGGCTGCCGAAAAGCAATATACCGTCACATCCTCCGATGGCGTTACAATCGCCGTCGAAGAAACCGGAAATCCCCAAGGTCAGCCAATTGTATTCGTTCACGGTCTGCTTGGCAGCCGCATCAATTGGGACAGCCAGACGGCCGATCCAGACCTGCGGAAGTTTAGGCTGATCACCTTTGATCTGAGAGGACATGGACTGTCGGCAAAGCCGGATGACGCTGACGCCTACAAGGATGGCGACCGCTGGGCAGACGATCTCGATGCAGTCCTTCGGGGAAGCGGAGCGACGAACCCGGTGCTCGTGGGCTGGTCTCTTGGGGGTGTGGTGCTGTCGAACTATCTTGCCAACCATGGCGACGCCGGGATTGGTGGCCTCCTTTACGTCGACGGTGTCATCGAACTCAAACCCGATCTCATCACGCCGCATCCGGAGGTCTATGCAGGACTGGCCTCAGAGGACCTGCGGACGCATCTCGACATGGTGCGGACCTTCCTGGCCCTCTGCTTTGCAACGCAGCCAGAAAACGCGACGTTTGAACGGCTGCTGTCCAATGCAGCGGTGGCATCATGGCTGATGACGCGAACAGTCCCCTCGATGACAGTCCAGGCAAAAGAAGGACTGGCGAAGGCAAAAAAGCCGGTTCTGTTGATCTATGGGGGCAAGGATAATCTGGTACGGCCGCAGCCGAGTATAGAACGCGCGAAGTCGTTCAACGGCTCGATCAAGTCAGAGATTTACGACAACTCGGGTCATGCGCCATTTCTTGAAGAAGCGTCGCGGTTCAATAAGGATTTGGCCAGGTTTGCGGCTTCTGTGGCGGACGGCAAGTAAGTCAGAAACTGGCTCCACCGGCTCCATTCTGCTTCGTCTTTCATGCTCCCATTCTGAAATCGAGACTGAATGGTCGTCTAATCTGGAGAAGCGTGCGTGGGGAGCCGTGGTCTTGATAGGTCCGCCCCGTCGCGCTGGGCCGAACACCGGTGCGTCACGACGACGGAACCGGAACCGGCTAAACGAGTTATGATGGTTCCGGGAGATGGAGTAGCTGTCAGGAGCGAACGCGTGAACAGATGCAATATAACCCACCGGCGGATGACCATTGGAACGCACTCCCTGTTCTTCCGGGAAGCGGGGCCGATTGAGGCGCCCGTTCTGCTCCTGCCCCACGGATACCCGTGTTCGTCCTATCAGTACCGCAGGCTGATGCCGGCGCTCGCCGATCAGTGGCACACGGTAGCTTTCGATTGGCCGGGTTTTGGCTACAGCGACACCCCCGATCCCGCCCAGTTCGGATATGACTTCGACGCTTATGCCGAGGTGCTCAACAACCGTCGCTGAGGCGCTCCGACTGGAGCGCTACGCGCTCTGGCTCCACGACTATGGATCGCAGATTGGTCTGCGGCATGCCATAGCCCACCCGGAGCGGATTGCCGCGCTGATCATCCAGAACGGTGACATCTATGAAGACGTGCTCGGCCCTAAATACGAGACGATCAAAGCGTGGTGGGCCGACAAGTCGCCCGAGAAGCACCGCCCTCTCGCGGAGGCCGTAAGTGAAGACGGATTTCGCGAGGAGTTCGTCGGTGAAGTCTCTGAAGAAGTGGCAAGCCTCGTCCCGCCCGACCTTTGGAAGCTCCATTGGCCGCTTATGGATACGCCGACACGCAAAATGGTGGCAGTCCGCCTGATGGAAAAGCTGGAGGAAAACCTCGACTGGTTCCCTCGCTATCAAGGCTATCTGCGTGAGCACCGCCCTCCGACCCTGGTGGTGTGGGGCCCGCAGGACGGCTATATGCCCGAGGCGTCGGCGCAAGCTTACCGGCGCGATCTCCCGGACGCGGAGTTGCACATTCTCGGCGAGGCCGGGCATTGGCTGCTGGAAACCCATCTCGAACAGGCGTTGCCGCTCGTCCGCGATTTCCTGGCCCGGACGTTCCGATGATCGCGCCTTTGAACGTAGCCCGCAACGAGAGGAGACATGCATGGTTACGAAGCCCACGATTGCACGCATCTGGCGCGGCCGTACGCGCCGCGAGGTCGCGGACAGCTACGAGCCCTATTTGCGAGCCGAGGGCATTCCTCCGCTTGAGGAAATTGCCCTCGGCGTCCAGCTTTTTCGGGAGGATCGGGACGAGGAAACGTGGTTCACGACTATCTCCTACTGGCCGGACATCGAGACGATGACCGCCTTTACAAAGGGCGATCCTGTCAAGGTTCATCATCTCGATCGCGACGCCGAGTTCCTGATCGAACTGCCCGCGCGCATCGAGCTGCATCGCATTCTGGTGAATGATCCGAGCCTTCGTTGAGCCGCGAGCAAGCCCTGTCTAGGGTGATGGAGCTCCGCAAGAAAATCGGGATCTCCCGGCCGCAGCAAGGCCGGCCGGTCTCCAGATGGTCCCACCTCTTTGCATCGGGAACGGCGCGTGTTCCGGCGCAATCGCGCTGACCAGGAAAAGAAAGCGATGATGTCACTGCCCCACCGCGTCTCCGGCAATCTCGCTCTTGATCTCGCCAACACCATCAGTTGGCGCAACACGAGCCGGGAAGTCGACCATTTGGCAAGCTTCGATGACGTTGTGGCATGGTCGAAGCAAGTCGGCCTGGTCGGCGGCGACTTCGTCGTATCGCCGCAGGAGCAAGAGATACTGCTCCAGCAGGTGCTCGCGCTTCGCAAGGCAATCGGTGCCGCGGGATCGGCGATAGCTAACGATCTCGATCCGTCTCGGCTGGATCTGGACGTCATACGCGACGTTGCTGCGCTGGCTCTGCGCCAGGCTTCGCTTTCCGGGACGCCGTACACATGGCATTTTGAGGGCATCTATCGCGTGACCGGCACCATCGCGTGGTCCGCGCTCGACCTGCTTCGGGGCGACGAAATCTCCCGGCTGAAGCAATGCCCACCCGACGATTGTCACTGGCTGTTTATCGACCGGACAAAGAACGCTTCAAGGCGCTGGTGCGACATGGGCACCTGCGGCAATCGCGCCAAAAAGATCGCCCACAGAGCCCGGCGCTGACTCAAAAACGGCGGGAAAGAGCAGCAGCGGGATTCCGGGCCCGTGATCAAAGCGGGATGAAAACCAATGATCACTTAGGTGTCGGGATACCGCATTGACTGGGCGCCGCAGGCGGACGTCGAAGTGGATGGAGCATGTGGCTGAACTCATTCCTGAGGTCTGAGAAAACCATCACGGCCACTTCCTGCCGAGCGGAGAACAAGTGCCCCTCCATCTCTTCTGCGGCATTTACCCTCGGGAATTGACAGCGAAATCAGACAGAGCTTCAAATCGGAACGACTGCAAACTTTCATTTACCACAAGGCGTATTGTCGAATTCCACGCCGCGCATATGTCGTCTTTGTCATAATGTATCCGGCCATAACAACAACCGATGAGGTGACATGATGTCAATCAACTCGGGTCATGCTCGATATTTCCCGCGACCCATACGAACTACAGCTAACCCGTTGGAACGGGCTATCCTGCAGCTAGAGAAGATGATCGATGCGCAAAGAACGAGTGCGCTACCGGCACCGTTTGCTCTTTATAAAGCCAAAAGCATTCTCGAACGATGCCACCAGGGGAGCCCGCTGCCGAATGCGCGTGACTGCATGCCCAACGTGGAATAAGCGATCGGACATCGGTAGCACGGGCCCGAAACAATAAACCTTAACGGTGGTCAAAGTCTGCCCGACTGGGCATGACGACCTTCGCATGCTCGTGCTACATCCAATATGCCGGCAGCGTCGGCGAGCGTTATTTCTACTACAGCAGCCGAAGCGTGCTCTCATCACGGACTCCTGCGTAAAACATGTCCAGTGCGGCTTGGAAAGGTAAATCCCGTTCAGCAACAGCTCCGAACAACGTCATGCTGGATCGAAATTTGAGATCATCCGGGGATCCGAGAATGTCATGCGCGCTCAAATCGGTGTGCTGCAGCATGGCGGTCGTACACTCGACGAGGCGTGGTCCAAGCATAGGATGCGTCAAATACGCCTCCGCTTCCGCACGACCCGAGATCGCGTAAAAGCGCGCAGTTTCCGAACGGCCAAGGCCACGCAGTTGCGGAAAGATAAACCACATCCAGTGTGAGCGCTTCAGTCCCGCCTGCAATTCGCTGAGGGCCACCTCGTAAATGTTTTGCTGCGCCGTCACAAAGCGCTCGAGATTGAATTTCAT
Protein sequences of DBSCAN-SWA_39 >NC_022536|417316:421575|421161_421575_-|WP_022557517.1|DBSCAN-SWA MKFNLERFVTAQQNIYEVALSELQAGLKRSHWMWFIFPQLRGLGRSETARFYAISGRAEAEAYLTHPMLGPRLVECTTAMLQHTDLSAHDILGSPDDLKFRSSMTLFGAVAERDLPFQAALDMFYAGVRDESTLRLL >NC_022536|417316:421575|419820_420414_+|WP_048903045.1|DBSCAN-SWA MFRRNRADQEKKAMMSLPHRVSGNLALDLANTISWRNTSREVDHLASFDDVVAWSKQVGLVGGDFVVSPQEQEILLQQVLALRKAIGAAGSAIANDLDPSRLDLDVIRDVAALALRQASLSGTPYTWHFEGIYRVTGTIAWSALDLLRGDEISRLKQCPPDDCHWLFIDRTKNASRRWCDMGTCGNRAKKIAHRARR >NC_022536|417316:421575|417316_418225_+|WP_022557511.1|DBSCAN-SWA MSGRFIRAATGMFAAVILFTGASATGAAEKQYTVTSSDGVTIAVEETGNPQGQPIVFVHGLLGSRINWDSQTADPDLRKFRLITFDLRGHGLSAKPDDADAYKDGDRWADDLDAVLRGSGATNPVLVGWSLGGVVLSNYLANHGDAGIGGLLYVDGVIELKPDLITPHPEVYAGLASEDLRTHLDMVRTFLALCFATQPENATFERLLSNAAVASWLMTRTVPSMTVQAKEGLAKAKKPVLLIYGGKDNLVRPQPSIERAKSFNGSIKSEIYDNSGHAPFLEEASRFNKDLARFAASVADGK >NC_022536|417316:421575|419376_419703_+|WP_022557515.1|DBSCAN-SWA MVTKPTIARIWRGRTRREVADSYEPYLRAEGIPPLEEIALGVQLFREDRDEETWFTTISYWPDIETMTAFTKGDPVKVHHLDRDAEFLIELPARIELHRILVNDPSLR |
4 | Cedratvirus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_40 |
425826 : 429037
Sequences of DBSCAN-SWA_40
Nucleotide sequences of DBSCAN-SWA_40 >NC_022536|425826:429037|DBSCAN-SWA TATGACCGCCGCAGGCTCACGTGAACACCAGATGTTTCCTGCACTCGATCCGCAACAGATTGCGACGGCGAGACGTTTCGCTGATGACGAACCGCGGTACTTTCTGCCCGGCGAGATGATTTTCAATGTCGGCGAGAGGCACGCACCCGCCTGGCTGGTGCTCGAAGGATCGATTGAGGTTTTGCGCCAGGACGGCCTGTCATCCAGCGTTCCCGTGACCCGACATACGACGGGTCAATTTTCCGGAGAGGTCAGCCAGCTGTCGGGCAGGCCATCCCTGGCGAGTGGGCGTGCCGGTGCGGAGGGATGCGTGGCCGTTTCATTCGACGCCCCGCATCTTAGAGCTCTGATCATCGGCACCGCCGACATCGGAGAGATCGTGATGCGCGCGTTTATACTTCGGCGCGTCGCCCTGGTAGATCAGGGCGGTGCCGGTTCTGTACTGGTAGGTCAACCGGGCAGTGCTGACCTTATCCGCCTTCAAGGCTTTCTGGCGCGAAGCGGATATCCGTATGTCGCGCTTGACGCTGACGCCGACGGACAAGGCCGCGATCTTGTCCATCGACTGGGTATCTTGAGAGAAGAACTGCCACTGATGGTGTGCCCCGGCGGAGCCATTCTGAAGAATCCGACAGACGGCGAGGCCGCAGTTTGTCTGGGCGTCGCACAGGAGATCGTGTCCGGTGCCGTCTATGACGTTGCGATCATCGGCGCCGGCCCGGCCGGCCTAGCGGCCGCCGTCTACGCGGCCTCCGAAGGTTTGAGTGTCCTGGCGATTGACGAACGATCCGCAGGCGGCCAGGCGGGAGCGTCCGCACGGATCGAAAACTACCTTGGCTTCCCTGCGGGGATTTCCGGGCAAAATCTTGCGGCGCGTGCCTTCAACCAGGCGATAAAATTCGGGGTCGAGGTTGCTTTGCCCATCCCCGTTATTGACCTGCGGGTTGTTCAATCATCATGGGGAACCGGTTCGATATTGCGGCTGGGTCTCGATGGCGAACGGAGCGTCGATGCGCGAACCGTGGTGATCGCGTGCGGGGCGCGTTATCGCCGCCCGCAGTTGGCAAATCTCGCCGAATTCGAAGGGTCCGGTGTCTCCTATTGGGCGTCTCCTATCGAGGCAAAACTATGCGAAGGTGAAGAGGTCGTACTCGTCGGAGGGGGGAACTCGGCGGGGCAGGCGATTGTCTTCCTTGCTCCCAAGGTCCGGCAGCTCCATGTTGTTGTCCGAAGACCCCTCAAAGATACGATGTCCAGCTATCTTATTGACCGTATCGGGGCGCTGTCCAATGTCGAAATCCATGTCGACAGCGAGATTGTTGGCCTTGAGGGAGACCGCTCGACTGGTCTGAGCGGGGCCGTGATGCGTCACCGTCAATCCGGTGGTGAGCGAACCTTCAAGGTCCGGCACCTTTTCCTATTCATTGGCGCTGATCCGAATACGGCATGGTTACGCTCCGATGTGGACGTCGACGACAAAGGATTCATCCTGACGGGGAGGGCTTCTTCTGGACGCACATCCGTTCTTCCGCTCGAGACGAGTCTTCAAAACGTGTTTGCGATTGGTGACGTCAGGGCAGGGTCCACCAAGCGTGTCGCCGCCGCCGTGGGCGAAGGTGCCGCTGTAGTCTCTCAGATTCATGAGTGCCTTGGGCGGCAGCCCGGTTGAATTAACCCGGATCAATCCGTCCGGCACACTGAAGACTGATGACGGCAGACGGTTGCTTCGTTTCCTCCCTTGTCGTTGATCCGTCGGCCGCCGTGACAACACGCTGCATCGTGCGGTGTCCTCTCCGGTCATATTCAAGAGGTTCTCAATTCTGCGCGCGGACAAAACCTTCCTTTCGCTTCCCCCGATATGTCCGTGGACCAAGGCAAACTTCAGTCATTTCCGGTCGTGGTGGTGCAATCGGCCACTTTCTATGGAGTCTCCAAGCTTGATAAAGGATATATCGTGCTCCCGTAAAACGGCGACGAAGAGACTTCGCATCTGCCTGTTCCCGGGGCCATCGGCAGAGGTAAGCGCATTACCCAAATTCCGCAGCCCCGATATCATATTCCTCAATGCGAAAGAGACTGCGCATTCAGGGTTGCGAGCATTGTTGCAAATTGTGAAATAAACTCGACACTATTCAATCAATTGTTGATGGGCGGCTCTCACGTCAATATCGCGTCTGTCAGACGAACATAGCGTCGGCTGAAGTTTTCAGAACGAAATCGTAGGACGCGAGATGACCACGAATATCACATCTGTCACACGCCGGAACATGTTGCTGACCGCAGGTACGGCCGTTGCCGTTACAGCGATGTATCCCGTTTTCGGCTTCGCAGCCAAGAGTGACTCCGCCACTACCGCCACCGAAGGAACCAAGACCATGAGTACTGTAACAACCAGGGACGGCACTGAGATTTTCTACAAGGACTGGGGCCCGAAGGATGCCCAGCCGATCGTTTTCCACCATGGCTGGCCGCTCTCGTCCGATGACTGGGATGCGCAGATGCTGTTCTTCGTCTCCAAGGGTTTTCGTGTTGTCGCCCATGACCGCCGTGGTCATGGCCGTTCCGCCCAGGTTGCCGATGGTCACGACATGGACCACTACGCCGCCGACGCCTTTGCCGTCGTTGAAGCGCTCGATCTGAAGAATGCCGTGCATATCGGCCATTCCACCGGCGGCGGCGAAGTTGCCCGTTACGTCGCAAAGCATGGCGAACCCGCCGGCCGTGTCGCCAAGGCGGTTCTCGTTTCCGCCGTGCCGCCTCTGATGCTGAAGACCGAAGCCAATCCGGAAGGTCTCCCGATGGAGGTTTTTGACGGCTTCCGCTCCGCGCTTGCCGCAAACCGTGCCCAGTTCTTCCGCGATGTCCCGGCCGGCCCGTTCTATGGCTTCAACCGCGATGGCGCGACCGTGCATGAAGGCGTGATCCAGAACTGGTGGCGTCAGGGCATGATGGGTGGTGCAAAGGCCCATTACGATGGCATCAAGGCCTTCTCGGAAACCGACCAGACCGATGACCTGAAGAATATCAGCGTTCCGACGCTTGTTCTGCACGGTGAAGACGACCAGATCGTTCCGATCGCCGACTCCGCCCTGAAATCGGTGAAGCTGCTGAAAAACGGCACGCTGAAGACCTACCCCGGCTTCTCGCATGGCATGCTCACCGTCAATGCCGATGTGCTGAACGCCGACCTGCTGGCCTTCATCCGGTCCTGA
Protein sequences of DBSCAN-SWA_40 >NC_022536|425826:429037|428056_429037_+|WP_022557528.1|DBSCAN-SWA MTTNITSVTRRNMLLTAGTAVAVTAMYPVFGFAAKSDSATTATEGTKTMSTVTTRDGTEIFYKDWGPKDAQPIVFHHGWPLSSDDWDAQMLFFVSKGFRVVAHDRRGHGRSAQVADGHDMDHYAADAFAVVEALDLKNAVHIGHSTGGGEVARYVAKHGEPAGRVAKAVLVSAVPPLMLKTEANPEGLPMEVFDGFRSALAANRAQFFRDVPAGPFYGFNRDGATVHEGVIQNWWRQGMMGGAKAHYDGIKAFSETDQTDDLKNISVPTLVLHGEDDQIVPIADSALKSVKLLKNGTLKTYPGFSHGMLTVNADVLNADLLAFIRS >NC_022536|425826:429037|425826_427494_+|WP_022557526.1|DBSCAN-SWA MTAAGSREHQMFPALDPQQIATARRFADDEPRYFLPGEMIFNVGERHAPAWLVLEGSIEVLRQDGLSSSVPVTRHTTGQFSGEVSQLSGRPSLASGRAGAEGCVAVSFDAPHLRALIIGTADIGEIVMRAFILRRVALVDQGGAGSVLVGQPGSADLIRLQGFLARSGYPYVALDADADGQGRDLVHRLGILREELPLMVCPGGAILKNPTDGEAAVCLGVAQEIVSGAVYDVAIIGAGPAGLAAAVYAASEGLSVLAIDERSAGGQAGASARIENYLGFPAGISGQNLAARAFNQAIKFGVEVALPIPVIDLRVVQSSWGTGSILRLGLDGERSVDARTVVIACGARYRRPQLANLAEFEGSGVSYWASPIEAKLCEGEEVVLVGGGNSAGQAIVFLAPKVRQLHVVVRRPLKDTMSSYLIDRIGALSNVEIHVDSEIVGLEGDRSTGLSGAVMRHRQSGGERTFKVRHLFLFIGADPNTAWLRSDVDVDDKGFILTGRASSGRTSVLPLETSLQNVFAIGDVRAGSTKRVAAAVGEGAAVVSQIHECLGRQPG |
2 | Orpheovirus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_41 |
442952 : 443765
Sequences of DBSCAN-SWA_41
Nucleotide sequences of DBSCAN-SWA_41 >NC_022536|442952:443765|DBSCAN-SWA CCTATCGAATGATCGCATCCGCGACTTCATGCAGCGCGGCCTTCACGTCAGGCTCGTCGGTCAGCGGCTCGATCATCGCCGCGCGCACGGCCTGACGGGCGGGAAGGCCGGTATCGATGAGGCTGGCGCAATAAACGAGCAGGCGGGTGGAGACGCCTTCTTCCAGATCGTGACCTTTCAGGCCGCGCAACCGGTGGGCGAGATCGACGAGCGGCTCGACGTCGCGCGCATCGAGCCCGCTTTCATGCGAAACGACGGCAATCTCCTGCTCCTTCGGCAAAAAATCGAATTCAATGGCGACGAAGCGCTGGCGGGTGCTGGGTTTCAGGCTTTTCAGCAGGTTCTGGTAGCCGGGATTGTAGGATACGACGAGCATGAAACCCGAGGGCGCTTCCAGCACCTCGCCGGTGCGCTCCAGCGGCAGGATACGACGGTCGTCGGTGAGCGGATGCAGCACAACGGCCACATCCTTGCGCGCTTCCACAATCTCGTCGAGGTAACATATGCCGCCCTGACGCACGGATCGTGTCAGCGGGCCGTCCATCCACACGGTTTCACCGCCTTTCAGCAGGTAACGACCGGTCAGATCGGCGGCGGCAAGATCGTCGTGACATGAAACGGTCGAAAGCGGCAGGCCGAGTTTCGCCGCCATATGGCTGACGAAACGGGTCTTGCCGCAACCTGTCGGGCCTTTCAGCAGCAGCGGCAATTGCCGCACCCAGGCGCTTTCGAACAGCGTGCATTCATTGCCAAGCGGTGTATAGAACGGCGTGTCCGGCAAAGGCTGCGGCGGGGGGCGGAAAATCGTATTCAT
Protein sequences of DBSCAN-SWA_41 >NC_022536|442952:443765|442952_443765_-|WP_022557546.1|DBSCAN-SWA MNTIFRPPPQPLPDTPFYTPLGNECTLFESAWVRQLPLLLKGPTGCGKTRFVSHMAAKLGLPLSTVSCHDDLAAADLTGRYLLKGGETVWMDGPLTRSVRQGGICYLDEIVEARKDVAVVLHPLTDDRRILPLERTGEVLEAPSGFMLVVSYNPGYQNLLKSLKPSTRQRFVAIEFDFLPKEQEIAVVSHESGLDARDVEPLVDLAHRLRGLKGHDLEEGVSTRLLVYCASLIDTGLPARQAVRAAMIEPLTDEPDVKAALHEVADAIIR |
1 | Halovirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_42 |
450813 : 451791
Sequences of DBSCAN-SWA_42
Nucleotide sequences of DBSCAN-SWA_42 >NC_022536|450813:451791|DBSCAN-SWA CATGGAACTGATCTGTCCTGCAGGAACGCCTGCCGCCTTCCGCGAGGCCGTGGATGCCGGGGCGGATGCGGTCTATTGCGGTTTTAGCGATGAAACCAATGCCCGCAATTTCCCCGGCCTTAATTTTTCCCGCGAGGAACTTGCCGAGGCCATCGTTTACGCAAAAAAGCGCGGTGTACAGACCTTCGTGGCGCTCAACACCTTCATGCGTGCTGGCAATGAGGATATCTGGTATCGCGGCGCGGCCGATGCCGTGAAGGCGGGGGCGGACGCACTTATCCTTGCCGATTTCGGCCTGATGGCGCATGTGGCGGAACATCATCCGCAGCAGCGTATCCACGTCTCCGTGCAGGCCTCCGCCTCCAATGCGGATGCGGTGAATTTCCTCGTCGATGCCTTCGGCGCGAAACGCGTGGTGCTACCGCGCACCCTGACTATCCCCGACATCGCCCGGTTGGCGCGGCAAATCCGCTGCGAGATCGAAATTTTCGTGTTCGGCGGTCTCTGCGTCATGGCGGAGGGGCGCTGCTCGCTGTCGTCTTACGCCACAGGCAAGTCACCCAACATGAACGGCGTCTGTTCGCCGGCAAGCCACGTCCGCTACCGGCAGGACGGGCAGGCGCTGGTGTCGGAACTTGGCGATTACACCATCAACCGTTTTCCGGCCGGCGAGGCGGCGGGTTACCCCACGCTGTGCAAGGGCCGTTTCGAGATCGCCGATGACAGGTCCTATGCCTTTGAGGATCCGGTGTCGCTTGATGTGATGGACCAGATCGATGCCTTGCGCGAGGCGGGCGTCAGCGCACTGAAGATCGAGGGCCGCCAGCGCGGCAAGGCCTATGTGGCGGAAGTGGTTTCCACCCTGCACCGGGCGCTGGCCGCTAGCGCCGAAGAACGGGGACGGCTGCTCTCGCGCCTGCGGCTCCTAAGCGAAGGCCAGCGCACCACCGTCGGCGCTTATGAGAAACGCTGGAGATGA
Protein sequences of DBSCAN-SWA_42 >NC_022536|450813:451791|450813_451791_+|WP_022557556.1|DBSCAN-SWA MELICPAGTPAAFREAVDAGADAVYCGFSDETNARNFPGLNFSREELAEAIVYAKKRGVQTFVALNTFMRAGNEDIWYRGAADAVKAGADALILADFGLMAHVAEHHPQQRIHVSVQASASNADAVNFLVDAFGAKRVVLPRTLTIPDIARLARQIRCEIEIFVFGGLCVMAEGRCSLSSYATGKSPNMNGVCSPASHVRYRQDGQALVSELGDYTINRFPAGEAAGYPTLCKGRFEIADDRSYAFEDPVSLDVMDQIDALREAGVSALKIEGRQRGKAYVAEVVSTLHRALAASAEERGRLLSRLRLLSEGQRTTVGAYEKRWR |
1 | Phage_TP(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_43 |
455058 : 460629
Sequences of DBSCAN-SWA_43
Nucleotide sequences of DBSCAN-SWA_43 >NC_022536|455058:460629|DBSCAN-SWA GATGAGCGACCGTTCCATCCTGCGTTTCGAACATGTCGGCCACGCCTTTCTCGGCCGCAGCCTGTTCGAGAATTTCGATCTCGGCATAGCGCCGGGGGAAACCGTGGCGCTGCTCGGCCCCTCCGGCAGCGGCAAGACGACGATTTTACAGATCGCGGCTGGTATCATCGATCCTGTTCGCGGCCGCGTGCACCGCCATTATCGCCGGCAGGGTCTGGTGTTTCAGGAGCCGAGGCTGCTGCCGTGGATGACGCTTATCGACAATATTGCCTACGGCCTTGCGGCGGCCGGCTTGCCGAGACGGGAGCGGCGCGAAAGGGCGGGCCTTTTTGCGCTGGAGGTCGGTCTGGAGGTGGCTGATTTCGGCAAATATCCGGTGGAACTCTCCGGCGGCATGCGCCAGCGCGCCGGCGTGGCGCGGGCGCTCGCCGTGGAGCCGGACATGCTGTTTCTGGATGAGCCCTTCAGCGCCGTCGATGTCGGCCTGCGCCGTCATTTGCAGGAGCTTCTGGTGGGCGCCGCCCGTCGGCGCGGTTTTTCCACCCTTCTCGTCACCCATGATCTTCATGAGGCGTTGCTGGTTGCCGACCGGCTGATCGTGCTCTCCGGCGTCGATGGCCGCGTCATTGCCGCGCATAAGCCCGCGGGTTCGCCGGGCCGGCGCATGGCGCGGGCGGTTTTCGACGAAGCGGAACGCCTGTCCGAGAGCCCGGCCTTCGCCGAATTGTTTTCAGCAAGGGAGCGGGTGCGATGAGGCAGCCCCCCATCCTTTCCGAAGCCCTGCGGCTGTTTTTCCCTCTGGCCGCACTGCATGGTGCGGGCTGGCCCCTCCTGTGGATCGTCATCGGTGGTTATGCCCTGCCTTTCGCCGATGCGGTGCCTGCATCGCAATGGCACGCCCATGAAATGATCTACGGCACCTATGGCATGGCGCTTGCCGGTTTCCTTGGCTCGGCGGTGCCGGAATGGACGGACACGACGACGGCGCAGGGACGAACGCTGCTTCATCTCGCCGGTCTCTGGTTGCCGGGCCGCCTTATCGGCTTTCTCGGGATGGAGGCAGGGAGCCTGTTTGCAGGTTTTTTCGATCTCGCCTTTCTGCTCGCGCTCTCTGTTCTCATCGCCAGGGCGATGCTGGCGCGGCGCACGATGAAACACCTGGCGTTCCTTATCTGGCTTCTTCTGTTTACGGCGGCTGAAGCCGGTGTGCGTTATGCCTGGTGGAGCGGCGATCTCGAACTTGCATACCGCATGCTGGAGGCGGCGCTGTGCATTTTCACCGTGCTGTTTTCGCTTTCCGCCGCACGTATCAATGTGGTCGTCATCAATCTGGCGCTCGATCCCGGCGGCGAGACCACGCCTTACCGGCCGCATCCCGGCCGCCAGCATATGGCAGGCGCCATGGTGACGCTTTACATGGCGGCGAAACTTTTCTTTCCGCAGAGCGATGTCTGCGCCTGGCTTTCGCTTGCGGCCGGCGCCGCCTTTTTCGACCGGCTGGCCGAATGGTTCATCGGGCGTGCCGTCTTTAAGACCGAGGTGCTGTTGCTCGGCCTCGGCAACGCCTTTGCGGGCGTGGGTTTTCTGGCGCTCGGCGCGACACGGCTCGGTTTTTCCGTTACGCCCGCCGCAGGGCTGCATCTGCTCTCGGTCGGGGCGCTTGGATGCGCCATCATGGCCGTCTTCATCATTGCCGGGCTGCGGCATACGGGCCGCAATCTGACGCATCTTCCATGGCAGGCGCATGTGGCCGCCCTTTTGATGGCGATGGCCGGGCTGGTCCGCATCCTCCCGGAATTCGACTTTGCCGCAACATTGTCTCCCTATCACCATGGCCTCAGCGCCGTTTTGTGGGCGGTGAGCTTCGGGGTCTGGTTGCAGAGTTTCCTGCCCTTCATGCGCGCGCCTGGCATGGACGATGCAGGAGCCTGCGGATGAGAGCGAGACACTGTTTGCCCGTCTAATGTTTATTTTTTGAGCAGAAGCACAATGCGAAATCAGTGATTTAGGCGAGCCGTCTTGATGAATGTTAATTTTGATAAATAATCAATAAAATCAAGTGGTTGAATATTTCTGTCTTTTGGGGGATGATGACTCGAAACGAATAGGAAACGCGAATATGCCTAAAATAGTCGCGCCTCAACACGCAGATGAAAAGCCAGGTCGGACGAGGGAACTTGTGACCTTCGCCGTTCTGGCCTTCGGAATCTGGCCCATTCTGGCGGTCGGATTTGTCGGAGCCTATGGCTTTATCGTCTGGATGTTCCAGATCATTTACGGCCCGCCGGGGCCACCCGGACATTGAGGGCGAGATGATGATGGAAGTCTCTCTCAGCCGGCGCGATTTCCTGCGCGGCGGGCAAAAAAATAGACCACGCATCTGTCCCCCCGGGGTCGCGTTGAGCGACCTCGCCGCGTGCAGCGGTTGCGCCAAATGCGTCGAGGCCTGTCCTACCGGCATCATCGCCATGGCGGACGGTTTGCCTTGCGTGGATTTCTCCGCCGGGGAATGCACCTTTTGCGGCAAATGTGCTGAGGCCTGTCCCGAGCCGGTCTTTGCTGCCCCCACGGCACAGCGCTTCGGTCACGTCACGGCGATCGGTGAGGGGTGTCTCGCCTTCGGCAATATCGATTGTCAGGCCTGCCGCGATGCCTGTCCGACCGAGGCCATCCGGTTTCGGCCCCGGCGCGGCGGCCCCTTCGTTCCGGAACTGCTGGAAGATGCCTGCACCGGCTGCGGGGCCTGCGTGTCCGTCTGCCCGGCCGGCGTGATCGAGATCAAAGATAGAGCCACGGAGATGCAATATGCCTGAAAATACAGGCCGGTATCATGTTTCAAGCGCCGTGGTGGCGGTGATGCCGCAGATGCGGGACGCCGTGCTGGCAACGCTTTCGACGCTCGACAATGTCGAGGTTCATGGCGAGGGCAACGGCAAGATCGTCATCGTCATAGACGGCACGAGCACCGGCATGCTGGGCGATACGCTCACTTATATTTCGACGCTCGACGGTGTGATTGCCGCCAACATGGTTTTCGAACACGTCGACACAGAGGAGACAAGCGGCGATGAGCAGCGAACTGACGCGGCGTGATCTATTGAAGGCCCATGCCGCCGGCATTGCGGCGGCAACGGCGGGCATTGCGCTGCCGGCCGCCGCCCAGCCGGTGCCAGGCGGGGTTTCCGCATTGCAGATCAAGTGGTCCAAGGCGCCCTGTCGTTTCTGCGGCACGGGCTGCGGCGTCATGGTCGGCGTCAAGGAAGGCAAGGTCGTCGCCACCCATGGCGACATGCAGGCGGAGGTCAATCGCGGCCTCAACTGCATCAAGGGCTATTTCCTGTCCAAGATCATGTATGGCAAGGACCGGCTGCAAACTCCGCTTCTGCGCAAGAGAAACGGCGTCTACGCCAAGGATGGCGAGTTCGAGCCTGTGAGCTGGGACGAGGCCTTCGACGTCATGGCCACGCAGTGCAAGCGGGTGTTGAAGGAGAAGGGACCGACGGCCGTCGGCATGTTCGGCTCCGGCCAATGGACGATCTTCGAAGGTTACGCCGCAACCAAGCTGATGCGCGCCGGCTTCCGCTCCAACAATCTCGATCCTAATGCCCGTCACTGCATGGCGTCTGCTGCCTATGCCTTCATGCGCACCTTCGGCATGGACGAGCCGATGGGCTGTTACGATGATTTCGAACATGCCGATGCCTTCGTGCTCTGGGGTTCGAACATGGCGGAGATGCATCCCATCCTGTGGACGCGCATTGCCGACCGGCGTCTGGGCTTCGACCATGTGAAGGTGGCAGTGCTTTCGACCTTCACCCATCGCAGCATGGACCTTGCCGATATTCCGATGGTCTTCAAACCGGGCACGGATCTCGTCATCCTCAATTACATCGCCAACCACATCATCAAGACCGGACGCGTCAACGAAGACTTCGTGAGGAACCACACGAAATTCGTGCGCGGTGTCACCGATATCGGTTATGGCCTGCGGCCCGACAATCCGGTTGAGGTAAATGCCGCCAATTCCGCCGATCCAACCAAGACCGAAGCGATCGATTTCGAGACCTTCAAGGAATTCGTCTCCGAATACACGCTGGAAAAGACCGCAGCCATGACCGGCGTTGAAGCCGGTTTTCTGGAGGAGCTGGCCGAGCTTTATGCCGACCCGAAACGCAAGGTCATGTCGCTGTGGACCATGGGTTTCAACCAGCATGTTCGCGGCGTCTGGGCCAACCAGATGGTCTATAACATCCATCTTTTGACGGGTAAGATTTCCGAGCCGGGTAATAGCCCGTTCTCGCTCACCGGCCAGCCCTCGGCCTGCGGCACGGCGCGTGAGGTGGGAACCTTCGCCCACCGCCTGCCTGCCGACATGACGGTGACCAACCCCGAGCACCGCAAACATGCCGAAGAAATCTGGCGCATCCCGCACGGCATCATCCCGGAAAAGCCGGGTTACCATGCCGTGCAGCAGGACCGCATGCTGCATGACGGCAAGCTGAATTTCTACTGGGTGCAGGTCAATAACAACGTACAGGCAGGTCCCAACACCAAGAACGAGACCTATCAGGGATATCGCAACCCGGAAAACTTCATCGTTGTTTCCGATGCCTATCCGACCATCACGGCTATGAGCGCCGACCTCATCCTGCCCGCCGCCATGTGGGTGGAAAAGGAGGGAGCCTATGGCAATGCCGAGCGGCGCACCCATGTCTGGCACCAGCTTGTCGAGGCCTCGGGTGAGGCGCGTTCCGATCTCTGGCAGCTGGTGGAATTCTCCAAGCGCTTCACCACGGATGAGGTGTGGCCGGCGGAGATACTGGACGCCAATCCCGCCTATCGCGGAAAGACGCTGTACGAGGTGCTCTTCAAGGACAGTGATGTCGGCAAGTTCCCGCTGAGCGAGATCAACGCTGAATACGAAAACCAGGAAGCAAAACACTTCGGATTCTATCTCCAGAAGGGTCTCTTTGAGGAATATGCCGCCTTCGGGCGGGGCCACGGCCATGATCTGGCGCCCTATGATGCCTATCACGAGGTGCGCGGCATGCGCTGGCCGGTGGTGGACGGCAAGGAAACGCTGTGGCGTTACCGAGAGGGTTACGACCCTTATGTAAAGCCGGGCGAGGGCGTGAAATTCTACGGCAACAAGGACGGCAAGGCGGTCATCATTGCCGTGCCTTACGAACCGCCGGCGGAATCCCCGGATGCGGAATTCGATACCTGGCTGGTGACGGGCCGCGTGCTGGAGCATTGGCATTCCGGTTCCATGACCATGCGTGTGCCGGAACTCTACAAGGCGTTCCCGGGCGCCCGCTGTTTCATGAATGCCGACGATGCGCGAAAGCGTGGCCTCAATCAGGGCGCGGAAATCCGCATCGTGTCGCGTCGCGGCGAAATACGTTCCCGGGTGGAGACGCGCGGCCGCAACCGCATGCCGCCAGGCGTCATCTTCGTTCCCTGGTTCGATGCCAGCCAGCTCATCAACAAGGTCACGCTCGACGCAACCGATCCCATCTCCAAGCAGACGGATTTCAAGAAATGCGCAGTCAAGATAGAGCCAGTCGCATGA
Protein sequences of DBSCAN-SWA_43 >NC_022536|455058:460629|455807_456995_+|WP_022557562.1|DBSCAN-SWA MRQPPILSEALRLFFPLAALHGAGWPLLWIVIGGYALPFADAVPASQWHAHEMIYGTYGMALAGFLGSAVPEWTDTTTAQGRTLLHLAGLWLPGRLIGFLGMEAGSLFAGFFDLAFLLALSVLIARAMLARRTMKHLAFLIWLLLFTAAEAGVRYAWWSGDLELAYRMLEAALCIFTVLFSLSAARINVVVINLALDPGGETTPYRPHPGRQHMAGAMVTLYMAAKLFFPQSDVCAWLSLAAGAAFFDRLAEWFIGRAVFKTEVLLLGLGNAFAGVGFLALGATRLGFSVTPAAGLHLLSVGALGCAIMAVFIIAGLRHTGRNLTHLPWQAHVAALLMAMAGLVRILPEFDFAATLSPYHHGLSAVLWAVSFGVWLQSFLPFMRAPGMDDAGACG >NC_022536|455058:460629|455058_455811_+|WP_048903049.1|DBSCAN-SWA MSDRSILRFEHVGHAFLGRSLFENFDLGIAPGETVALLGPSGSGKTTILQIAAGIIDPVRGRVHRHYRRQGLVFQEPRLLPWMTLIDNIAYGLAAAGLPRRERRERAGLFALEVGLEVADFGKYPVELSGGMRQRAGVARALAVEPDMLFLDEPFSAVDVGLRRHLQELLVGAARRRGFSTLLVTHDLHEALLVADRLIVLSGVDGRVIAAHKPAGSPGRRMARAVFDEAERLSESPAFAELFSARERVR >NC_022536|455058:460629|457176_457362_+|WP_022557563.1|DBSCAN-SWA MPKIVAPQHADEKPGRTRELVTFAVLAFGIWPILAVGFVGAYGFIVWMFQIIYGPPGPPGH >NC_022536|455058:460629|458124_460629_+|WP_022557565.1|DBSCAN-SWA MSSELTRRDLLKAHAAGIAAATAGIALPAAAQPVPGGVSALQIKWSKAPCRFCGTGCGVMVGVKEGKVVATHGDMQAEVNRGLNCIKGYFLSKIMYGKDRLQTPLLRKRNGVYAKDGEFEPVSWDEAFDVMATQCKRVLKEKGPTAVGMFGSGQWTIFEGYAATKLMRAGFRSNNLDPNARHCMASAAYAFMRTFGMDEPMGCYDDFEHADAFVLWGSNMAEMHPILWTRIADRRLGFDHVKVAVLSTFTHRSMDLADIPMVFKPGTDLVILNYIANHIIKTGRVNEDFVRNHTKFVRGVTDIGYGLRPDNPVEVNAANSADPTKTEAIDFETFKEFVSEYTLEKTAAMTGVEAGFLEELAELYADPKRKVMSLWTMGFNQHVRGVWANQMVYNIHLLTGKISEPGNSPFSLTGQPSACGTAREVGTFAHRLPADMTVTNPEHRKHAEEIWRIPHGIIPEKPGYHAVQQDRMLHDGKLNFYWVQVNNNVQAGPNTKNETYQGYRNPENFIVVSDAYPTITAMSADLILPAAMWVEKEGAYGNAERRTHVWHQLVEASGEARSDLWQLVEFSKRFTTDEVWPAEILDANPAYRGKTLYEVLFKDSDVGKFPLSEINAEYENQEAKHFGFYLQKGLFEEYAAFGRGHGHDLAPYDAYHEVRGMRWPVVDGKETLWRYREGYDPYVKPGEGVKFYGNKDGKAVIIAVPYEPPAESPDAEFDTWLVTGRVLEHWHSGSMTMRVPELYKAFPGARCFMNADDARKRGLNQGAEIRIVSRRGEIRSRVETRGRNRMPPGVIFVPWFDASQLINKVTLDATDPISKQTDFKKCAVKIEPVA >NC_022536|455058:460629|457369_457870_+|WP_034499579.1|DBSCAN-SWA MMMEVSLSRRDFLRGGQKNRPRICPPGVALSDLAACSGCAKCVEACPTGIIAMADGLPCVDFSAGECTFCGKCAEACPEPVFAAPTAQRFGHVTAIGEGCLAFGNIDCQACRDACPTEAIRFRPRRGGPFVPELLEDACTGCGACVSVCPAGVIEIKDRATEMQYA >NC_022536|455058:460629|457862_458150_+|WP_003520944.1|DBSCAN-SWA MPENTGRYHVSSAVVAVMPQMRDAVLATLSTLDNVEVHGEGNGKIVIVIDGTSTGMLGDTLTYISTLDGVIAANMVFEHVDTEETSGDEQRTDAA |
6 | Bacillus_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_44 |
464576 : 465353
Sequences of DBSCAN-SWA_44
Nucleotide sequences of DBSCAN-SWA_44 >NC_022536|464576:465353|DBSCAN-SWA ATTATAATAGGCGTCCTTGCTCCTCCACCGGTTCTCCACTTTGCGAAATGATCGTTGATCCGTATGGTTCGCGTGACGAAACGATCAGCGCGCCGTTCGGGAGCGGGCGGGCGAGCTCCTTTGCCTCATTCCACGGAGCCCGCATCCAGACTTCGGTTTCTTCCTTCGTCAGCAGCAGGACGGGCATGGCCTTTTCATGGATTGGCTTGACGAGGTCATTCGGATCGGTCGTCAGGAAACCGTATAGGTCGTCGGTTGTGAGCCCGTCCCTGACCTTGCGGACGCTTTTCCATTGCGGCACGTGGATGCCAGCAAAGAACATGAGTGATTTCGCCTCGTCTCGAGCGAACCAGGCGTTAGGCACATTGCCACCCTCCTGTTTGCTCATCGGATCCGGTTCAGCAAAGCTTGTGACCGGGACAAGGCACCTGTGCTCGACGCCGAACCATCGCGTCCAGTGAGGGAGATTGAGTTTGCGCACATTCGTGACACCACGGTCCGGCTCCATGCGGATAAGCTCATCCATATCGACGGCCTGACGTTTGGCCTTCAGTTTTCCCGCCCTCGCATCCGCAGCTTTCTTCTGCACGAAAATCGGCGAGGGCAGGCCCCATCGTGCATGAACCAGCTGCTTCTTGCCGTCCGCCGTGTTTCTAACGATCGGCCCCATCTGGTCGGGGTTCATCTGATAGGCCGGCATGAGGTTGATTAGGCTTTCGGCGTCCTGGGCCCACTTCGAGACCCAGTCCTTGTCCTCCATCCGATAAAGGTTGCACAT
Protein sequences of DBSCAN-SWA_44 >NC_022536|464576:465353|464576_465353_-|WP_022557577.1|DBSCAN-SWA MCNLYRMEDKDWVSKWAQDAESLINLMPAYQMNPDQMGPIVRNTADGKKQLVHARWGLPSPIFVQKKAADARAGKLKAKRQAVDMDELIRMEPDRGVTNVRKLNLPHWTRWFGVEHRCLVPVTSFAEPDPMSKQEGGNVPNAWFARDEAKSLMFFAGIHVPQWKSVRKVRDGLTTDDLYGFLTTDPNDLVKPIHEKAMPVLLLTKEETEVWMRAPWNEAKELARPLPNGALIVSSREPYGSTIISQSGEPVEEQGRLL |
1 | Sinorhizobium_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_45 |
474887 : 475877
Sequences of DBSCAN-SWA_45
Nucleotide sequences of DBSCAN-SWA_45 >NC_022536|474887:475877|DBSCAN-SWA CATGCCTTTCAAACACAACGCCTGCCGCCGTCATTGTATCGGCAAGATGAAATTCAAGGTCACGAACTGGGCGGAATACGAGGCTGGTCTTCGTCAGCGTGGCAGCCTGACCCTTTGGGTGACGCCGGAGGCACTATTGTCCTGGCCGGCACCGAAGCGGACGACACGCGGTGGCCAGCCACGTTATTCCGATCTGGCGATCGAGACAGCTCTGACGCTGGGACTGGTGTTCGGCCTGAGGCTGCGCCAGACGGAAGGTTTTGTCGCATCGGTGCTGAAGCTGATTGGACTGGACCTCCCCGTTCCTGACCATTCGACCCTGAGCCGTCGGGCCGGTAAGCCGGGAGTGCCGAAGAAGCGGCACGATGATCGGATCCCTCGGAAAGGCCCCGTCCACGTCCTGATCGATAGCACAGGGCTAAAGGTCTACGGCGCTGGCCAATGGCTGGAGGAAAAGCATGGTGTCAAATCCCGCAGGGGTTGGCGCAAGCTGCACTTAGCACTGGATGCCGACAGCGGCGACGTCATCGCACATGTCGTGACCGGTCAGGATGCTGGCGATGCCTCGCAGGTGGAGCCGCTGCTCGAACAGATCGATCGCCCGATTGGCCAGTTCACGGCCGATGGCGCTTATGACGGCGAGCCAACTTACGACGCGGTCGCCAGACACAGTGGCGATGCGACGGTCGTTATTCCGCCACGCGCCAATGCGTTGGAGCGGTCGGACAGTCATCCGCCGGGCCAGCGAGACCGTCATATCGCGGCGATCAACGCAGATGGAAGAATGAAGTGGCAGATCGCCACCGGCTACGGCAAACGATCCCTGGTAGAGACAGCGATCGGTCGTTACAAATCGATCATCGGACGGCGGCTGCGGGCACGCTCACTCGCGGCACAGCGGACGGAGGTCGCCATCGGCTGCGCCGTCCTCAATCGAATGCTGGACTGCGCACGCCCGAAATCCGTCCGCGGCAAAAAGGCGGCCGTCTAG
Protein sequences of DBSCAN-SWA_45 >NC_022536|474887:475877|474887_475877_+|WP_022557583.1|transposase|DBSCAN-SWA MPFKHNACRRHCIGKMKFKVTNWAEYEAGLRQRGSLTLWVTPEALLSWPAPKRTTRGGQPRYSDLAIETALTLGLVFGLRLRQTEGFVASVLKLIGLDLPVPDHSTLSRRAGKPGVPKKRHDDRIPRKGPVHVLIDSTGLKVYGAGQWLEEKHGVKSRRGWRKLHLALDADSGDVIAHVVTGQDAGDASQVEPLLEQIDRPIGQFTADGAYDGEPTYDAVARHSGDATVVIPPRANALERSDSHPPGQRDRHIAAINADGRMKWQIATGYGKRSLVETAIGRYKSIIGRRLRARSLAAQRTEVAIGCAVLNRMLDCARPKSVRGKKAAV |
1 | Salmonella_phage(100.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_46 |
484764 : 500735
Sequences of DBSCAN-SWA_46
Nucleotide sequences of DBSCAN-SWA_46 >NC_022536|484764:500735|DBSCAN-SWA CTCAGAAGTAGCGAACTTCCTCAGCGTCGATGCTCAGAGGGTCTTTGCCGCGCTTGGTTAGCCACGAATAGACCGGCGGAACGCGATCAGCGATCGATTCGACGTGAATGTCGATCAAGCATGGGCCGGATGTATATGCAAACGCTTCGGCCAGCGCCCTGTCGAGTTCTTTCGAATTGGTGGCGGTCCAGGCCTTGACGTCAAATGCTTCCGCAATCGCCTGCCCACGCGGCGGCATGAAATCCACGCCGAAGCACTGGTTGTGTCCCTTCAGGCGGTGCAGCCCTTTGATCCAGCCGAAGGTGCCGTTGTTGAACAACAGGAGAATGGCGGGAACCTGAAGGCGTACCAAGGTTTCAAGCTCGCCTACCGTCATGCCGAACGAGCCGTCACCGAACATGCCGATCGGCCGTCGCTCCTTGTTCGCAAACCACGCGCCGACCGTGGCAGGCAGAGCAGAGCCCAGGCCGCCGAATGCCCGCGGTATGGCGAACCTGGTCCTGCGGTCCTTCAGTTTCAGGAAGCGCGTCATGTGCGGGGTTGGCGTGCCGGCGTCAGAGTAAATATGGGCCGGTTTCCCGTAGGCTTCCAATGCTTCGTTAAAGCAGCGCACTGCTCGTTCGGGGCGCAGCGGAGCGGCTTCGGAGCTCAAGAGCGCTTCGGCATTCTCCCAAAAATTGGCGCGCAACACGTTCAATTCGTCGACCCATTCCTGTGTCCGGGTCGCATCATTATCAACAGGGGCCATGTCAATGAGGCGTTGGAGCACGAGGCGTGCATCACCCTGCACCGATACGACATTTTCGTAATTATTGGCCATGATCTCCGGATCGATATCGATCTGGGCGACGCGCTTGTTGAGCGTGATCTTCGGGAAGGTCCAGCCGATGGTAACAACCGAACCCATTCTTGATCCGACGAAGAGGACAAAATCGGCGTGTTCCAGAGACCAGTTGGCATGCGGATGAAATCCGTTGTCGCCAATGACGCCCACAGCCAACCGGTGATCATCAGGCATGGTGCCCTGGCCGGTCATTGTGGTGCAGACTGGAATATTCAGGCGTTCGGCAAGCTCAGTGACTTCCGGTCCGGCGCAGGAGCGGTTGACGCCGCCGCCTGAAACGATCAACGGCTTCTTGCTGCTCGTCAGCATCGACATCACGGCATCCAGCTTGTCCTGCGGCGGAAGCGTGGGATAAGCGGGAAACTGCCTGCACTCCGTTTCGATGTGAAGCGATACCCGCGTCGGGTCCACCTCAGCGATAAGCATGTCCTCCGGAATCTGGAGGTGTACGGCACCGGGCTTGCCGGAGCAGGCAACGCGGAAGGCACGGCGAATGATTTCCGGCAATTTCTCCGCCGATTTCACCTGCACCGACATTTTGGTGATCGGGTCGAACAGGCGAGCGCAATCCAGTTCAGTCAAGACGCCGCGGCCTTCGCCGGGGAGGGGAATGTCGATCGTCAGCAGAATGACCGGAACCGAAGAGGAATTCGATTCAGCGACCGGAGGCAGTGAATACATGGCGCCTGCACCCGATGGGCACTCGAAGACACCGGGTTTGCTGGTGAAACGTCCATAGGCATCCGCCATGTAACCGGCCGAGCGTTCATCGCGTGCCATAACGTGACGAATTTTGTCTTCACGTTCCTGCAATGCCTCGTAAAAAGGAACGTTGGTGTCTCCCGGTACGCCGAATATTACTTCGACCCCATATCCGATCAGCATGTCCACCAGAATATCTGCGCCGCGCACGACGGTCTCCTTTCGTTACTTTGAAATTTAGTCACAAACTGACAGTGTTTTGAGGATGATCACTGTGACCCGCCCATGCTGCTGGATGCATGATGGGATGCTCTCCCAACTTTCACGAGCAGGGCTTACTTAAGTGAAACGTCTTGGCGAAAACGCCGCGATGCTGGAGGTGGGTGTCCTGCCGCTCACCATATCCGCGACGATCCGGCCAGTGATCGGACCGGATGCAAAGCCGACATGCCCGTGTCCAAAGGCGTAGAAAACGCTGTTCGAGCGGGAGGATGCGCCGATGACCGGCAAGCCGTCCGGTGTGGACGGGCGGTGGCCCATCCAGCGGTCCAGCTTCATCTCCGCGCGCGGAGTGAGGTGATCGTATGTGGCAAGGGCGTGCTCGATAAGAATATCGACACGCTTCCAGTTGGGGGGAGTCTCGACCGATGCCAGTTCGACCTGTCCGGAGAGGCGCAGACCTTTCGAGGTCGGCGTATTCGCCATCTTGCCGTCGCTTGGCATGACTGGATGCGAAGGCCCGTTCTCCTCGGAAACGAGAACGCCGTGATAGCCCCTTTCGGCTTCGAGCGGGATATGGTCGCCCGCCTTGCGTGCGAGTGCTTTGGACCAGACACCGGATGCGATCACGGCCCTGTCGCATTCGAGGCGGCCGTTATCCGTCTCGACCGCAATCAGGCTGTTATCCTTGAACACAAAACCTGTGGCTTGCGCTTTTACAAGAGTTGCGCCACGGGCGACTGCCATCGAGACGATCCCTGCAACGTAAGCGCCGGGATCGACGCAATGGGCGCCATCTTCCGCCAGAACCCCGAAGTTGTAGCGGTGGGAAAGCTGCGGCTCCCGTGCATGCAGTTGCCCGGCATCCAGTTCTTTCCAGCGCAAGCCTGTCATGGAGCGCAGGTGCCATGATAATTTCTCCGCCTCGAAATCCTGTCTGCCCGGATAAGCATACAGCAGGCCTTCCTGACGAATGAGATGGCTGAGGCCCAGTTCTTCAGCCAGTTTGAGGTGTCGTTGCGGGCTGTCTCCAAGCAGGGTTGCCAGCGCATGCGCGGTTTTTTCGACCCGCGCGACCGTCGAGCCTGCGATGAGAAATCTCAACAGCCATGGCAGGAGCTTGAGAACATGCCGCCAACGGATGACGAGTGGGCCTTTGGGATTCAACAAATAGCTCGGAATCTGCCTCCACAGGCCAGGCATCGACATGGGCACGACCGACGCGGGGCTGATCCAGCAGCCGTGTCCATAGCTGGCGCTCTGACGCCCGCCCGGCTCGCCGGCTTCCAGAATTGTAACGTGATGGCCGTCCCGGATTAATTCCAGCGCCGCACTTGCACCCACTATGCCTGCGCCAATTACGATTACGTGCATGTGTATCATTCCCGAGCATTACAGCGGTTAGACGATCGCTGATCGATGGGCGCAAAAATCCGTCTCCTGCTGTGACGCGGGAGTTGGGTTCATCGCTTTTCGATCACTTTAATGGACCCCGGGGCAGAGCCGGGATGACCTGTTTTTCAGTTTGCGTGCCAACCAGGACCAACGAGTGGGCCAATCGGCAAATCAGTTCGGACCAAGACCATTATGGAAGTGGCTCAGGAAGGATCGTGTCGCCGGTTGAAGCGGATTGTCGATAACCTGGCGCGCCTCGCCTTCCTCAACGACACATCCATCACGCATGAAGACGACACGGTCCGCCACGTCGCGTGCAAATGCCATTTCATGCGTGACCAGCAGCATCGTCATGCCTTCGGCGGCAAGCGAACGGATGACGGACAACACCTCTCCCACGAGTTCGGGATCGAGCGCCGACGTCGCTTCGTCGAACAGCATGACTTCAGGCTTCATCGCCAATGCCCGAGCGATGGCGACGCGTTGCTTCTGTCCACCGGACAAACTGTCGGGGGGAGCCGTTGACTTATCGGCGAGCCCTACCTTTTCGAGCAGTCCGCGGGCGATCTCCCGCGCCTCGGCCCTGGATTTTCGCAGCACCGTGATCGGCCCTTCCATCACATTTTGCTCGACGTTCATGTGCGGAAAAAGATTGAAGCTCTGGAAAACCATCCCTGTCCTTGCGCGGAAGGCGGCAAGCGCCTTGTCACCGGGCAGCTTCCGCGTGCTGGAAAAGTCAATTGCCTGATCGCCCACGCGAATGAAGCCCTGCTCCGGAACCGTCAGCAGATTGACGCTTCGCAGCAACGTCGATTTTCCTGAGCCGCTGGGGCCAATGAGAGCGACGACGTTTCCTTTCTTCACGTTCAGCGTGATGTCCTTCAGGACCTCGAAGTCGCCGAAGGATTTTTTTACGCCGGAAAGCTCGATCATATTGTGTTCTGACATGATGGTAAGCCTTTCGTGATCAGCGGCCTGCGCGCGCTTCAAGCCTGTGAGCGAAGCGTGTCAGCGGGAAAAGGAGGATGAAATAGATGACAGCGATCATCGTGTAGGTTTCCAGCGGCCGATAGCTCGCCGAAGTGATCAACTGGCCCTGATAAAGCAGATCCGGAACGGCCAGAACCGACAAAAGCGACGTGTTTTTCAACTGTAGAACCGTCTGGTTCATCAAGGGCGGGATCATGCGGCGAAATGCCTGGGGCAAAACCACACGCGCCATGAGCTGATGTCGCCTCATGCCGAGCGCGCGTCCCGCGTCCCATTGGCCAAGATCAACCGAAATGATGCCTCCGCGGATGATTTCGCCGAAAAATGATGCGCCGTAAAGCGTCAACGTGATGAAGGCAGCCGCACCCGGAGATAACTCTATTCCGGTCAGGATCGGCAGCGCATAATAGCACCATACAAGCTGGACCAGGACCGGCGTGCAACGGAATATCTCGATGAAGCCCGTGATAAGCGCGCCGAGGATTTTTACGTTTGAAAGGCGGGCCAGGGCGACCAGACAGCCGACGATCAAGCCTGAAACCGCAGTGCCAATCGTAAACCCGATAGTATAAAGGACACCCGTCCAGATCAGGCTCTGGTACTGTATGAGAAAATTGAAATCCCATTGATATTGCATATCGTTCCCACTTTTTGTTTTACGCATTTCCGGACGGAAAACCGGCGTCCACTTTTCCTGGAAATGCTCTAGCGCGCTTCACCAATCGAGGCTCCTGCATGAAGCCACCGGCCAAGTGTGAGGATATCGAACACCGGAAGGCCAAAACGCTTCGATATCTCCGCTGAAAAAGGCGGCAAGTTCGTGCACTCAAATACGAGCGCGCCGATGGAAGGATGCTTGTCGACGAGATCCTGAACGGTCTGCAATAACTCGTTGTTGAGGGCCTCGCGGTCATAGGGCTTGCCACCCTCGATCAGCTGGTGAAACGCGCCGCCTTTGGGCAGTCCGGCAACCGGAGGAATATCCTCGACGCCTGCTTCCCGGAAATGACGATCCGTTAGAGAGTTGAAATCATATGTAATCACTCCGACTTTCTTTCCAGCCGGAAGAAAGGCGGTGATCTGCGGAATTTGCAGCAGGCTGGACGTTGCAACAGGAAGCGTGAGGCGGCTTGCGAGCGCGCTTTGCCGCAGCACCAGAAAACCGCATGATGTCGTCAATGCACAGGCGCCTTCTCTCGCCAGCTCTTCACCGGCAGCCACAAAGGCATCGAGCAAAACCGCATCGTCTTCGCCATTGACGATCGTTCTGGCGCTTGCCCCTTTGACCGTTTTGTAGACGACGGGGAACGGCCAGCTTCCCGCGTTGCCCACATCACCCAAAGGACGCTCGAACGTCGTGTCGAGCATCATGATACCAAGCGAGTTGGCCATGATTTCACCGGAAAATACAATGGACGCCGTGCCTGACGCGACGGCAGCTCCTGTTGGAAACAGGGACAGGATTACTCGTGACCCTGTCCACACCCGATCATCAGAACTGAATATCGGCCGGAATATCTTCTTTCTGAACGCCGACCAGATCGAGACTTGAAGTGATCCAGCTCGTCACCACGCCGGAAGAGCGATTGAACTCAAGCCAGTTGTCTACAAATGTCCGGAATCGTGAACTGTCCTCTGCACGAACGCCGGCACAGGTCGGCTGACGCACGATTGGACCGGGGACAACGATTTTCCCCACATTCGCGTTCTTCTTGACAGTTACCAATGACAACATTGCGACCTGGATGACAGCGTCAGCGCGACCGGTCTGAACCGAGAGGGTCGCCTCGTCAGGGGTCTTCAGCGCGACGAGTGTTGCCTTTGGTAGAACGCGACGGGCGAAAAGATCGTGTGTCGATCCGATATCGACCGCAATGCGAACCTCCGGCTTGTTCAGCTCTTCCCAGGTCTTCGGTTCAAGGCCGGGCTTGGCGATGATGGTGAAGGTATTTTGCATGATCGGGCGGGTGAATTCGACCACCAGCGCACGTGACGGTGTGGGGCTAAGGCCAAACATGATGTCGATCTTGTTGGCCTGAAGATCGAGAACCGCGTTGCCCCAGGTGGTTTCGCTGATTTCGACTTCGGCCTCGAGCGATTTTGCCAGATCGCGGGCCATGCTCATACAAAAGCCAGACCACTCACCGGTCGCGATATCCTTATGATAATATGGTTCGGTTCCAACGATCCCGGCGACACGCAGCTTCTTGCTGCTGCGTATGCGTTCAAATGTCGAGCCAGTAGCGGGCGCCTGTGCGACTGCGGGTGTAGCAAGAGCGGTTGCAACCGCGCCCGCACCGGCAAGTCCAAGGAGGCGTCCAAAATCACGTCTGTTCATGTCATCCCCTTTTTTACATTATTTGATAGTACTCGTGCTTCGGCAGGTTCCCAGTAGTCCGCGTTGCAGTCTCGGCAGTTACCTTTTCCCTTCGGGCTAAAGGCACGTTCGCCGGATGAGTCAGTCTTGACGATCTTGCATTTAAAACAAAACATCGAATAATCCGCCTTGCATGAGTTTTAGTCATGATGAAGGGGTTGGGCAATGAGGCGCTATCTTCCATCACTTTCGGCGCTGCATGCCTTCGAAGCTGCGGCCCGATATATGAACTTCACGCGCGCCGCTGATGATCTCGGTCTCACGCAGAGCGGGATCAGTCGGCAGATCCGTAATCTGGAGGAGTTTCTGGGGGCCACGCTTTTTCACCGTTCAGGCCCACGTTTGGTGTTGACGGAAGTGGGGGCCAACTATTATCGCGACCTTGCGTTCACGCTGGACCGGCTTGAGGAAATCTCGATTGATGCGGTTCGGGGCCGTAGTGTGGATTCCTCCCTGATGGTGGGAACGCATCCAACTTTTGCGGCCCGCTGGTTGCCGCAAAGGCTGTACACGTTCATCTCGGCATTTCCCGACATACCTCTTGAATTCACACTCGCGACCCCTGCGACAGACTTTGAAACGACACGTCTAGACATAACGGTATTGAGGGGCGCTGGCACGTGGCTTCATGCCCGCGCGATTGAACTTTTCCCTGAGAAACTTGCTGTCGTTGCAGCACCCAGCCTTATTCCATTGGGCGAGAAACTCGATCCACTCGATTTTGCGAATTTCCCAATGATACAAAATGCCGGTAGGCCGAGCCTCTGGCTAAACTGGCTGCGTTTCGCCGGACTATCCTACAGCGGTCGCATTCAAGGAATACGATTTGGTCATTACGAGGCGATCATCAATGCTGCAGTAAGTGGACTTGGCATAGCAGTCGTACCGACGTTTTACATCGAACACGAAATCGCAAACGGCCAACTACATATGCCATTCGGAGAAGCTACCCATTCCGACGATTCCTATTATGTAGTTTACCCTGAAAGAAAGGCCAACAATCCGAATGTTGTGTTGTTCAGGGATTGGTTGATTAGGCAATCTCGTAAATACAAGGAACAATAGGACGATCTCACTAGACAGACTATGATGGCGTTGATCCCCACGCTGAGAGTCCGTCCGAGAATTCATGATCGTCTCGCGATCCGTCTGACGAGCGTCATTATGGATGCTGCGTAGAGGAAGGCCTCGGCGGACTTGATTGTCGCCTCCGGGTCTTTCCAGAGCCGTCGGTTTCTGCTGATCTAGGCGAAGAATCGCTCGACCACCCATCGCTTCGGGTGAACCGCAAAGCCGACCTGATCCTTGGGTTTGCGGACGATTTCGACATCGATCAACGTCGCACTCTGTGGCCTGTTGCCAGAGTAGCCCATGTCGGCAAACGCCTTCTCCACAAACGGGAAGGACTTTCGCGAGACCTGCAAGACCGGGATCGCACCGTCTCGGTCCTGAATATCGGCAGGCTGGGCATCGAGCACCAGTGCGCGACCATCCATATCCACCATCGCCTGACGCTTGCGACCTTTGACCTTCTTGCCTGCGTCATAGCCACGAGGACCGCCGCTCTCGGTGGTCTTGACGCTCTGGCTATCAAGGACAGCGGCACTCGGCGAGGCTTCTCGACCAACCCGCTCACGATCCTGCGCGACAAGGCAACGATTGATCCTCGTAAACAAGCCACTATCGCGCCATCGGCAAAAATAGCCGAACACTGTGCTTTTCGGCGGCAGGTCTTTCGGAATAAGCCGCCAAGGGATGCCGCCGCGCAGGACGTAGAATATCGCGTTGACGATTTCGCGCATGGGCCAGACAGGCTTGCGCCCGAGTCGCGCAGGTTTAGGCAGGAGCACCGAAAGCACCTCCCACTCCACATCCGTCAGGTCGGTTTCATATCGGGGCTGATAGCGACTATACTGCCGCCGGGTGGTCGGGTTCCACATGGCGCTTCCATTCAGGCTTCAGACACCCGATTGGAATCATCTGCGGCCCGATCACTCAACCCCAATTCTCGGACGGGCTCTCAGGGAACGGCGCTTCAAGCGCCTCCAAACCAACGAAATCTAACATCATATATGATGCATGATGCTTGACGCATGATATTCACTCGGCTATCTTCATATGGCTGCTGGTGGAGGCCACAGCGGAGGACTGCCAGTTAATCTTGGCCCGCCGGATTCCGGCGCCTTCTAACTTTAAAAATGTGGGATTTGTTCATGCAGAACACCGAACTTGGGGCGGAGCAGATTGCCTTGCTCGAGGAAAGGGCGAAGTTCGTCCGTCTGGAGACCATCAGGCTCATCAGTATTGCGAAAGTCGGACACTACTCGTCCGTTTTTTCCTGTGCCGAGATTTTCTCTTCCCTCTACTATGATGTGATGAACATCCGACGCGGCGAGCCGAAGTGGGAGGAGCGTGACCGCTTCCTTATGGGCAAAGGCCATGCGGCAGTTGGGCTGTTTCCGATACTCGCCGATCTCGATTACTTTCCGAAGGAATGGCTCGACGACTACACACGTCTTGGTAGCCCTCTCGGCGATCATCCTGATATGCGCAAGGTACCAGGCGTTGACTTCAGCTCTGGCTCGATCGGCCACGCGCTTTCAAACGGCGTGGGCATGGCGCACGGCGCACGTATCCAGAAGCAGCATTTTGACGTTTTCGTCCTTCTCGGGGACGGAGAAATGCAGGAGGGCCAAGTTTGGGAAGCCGCGCTGAACGCATCAAGCCACGGTCTTTCCAACGTTATCGCGATCGTCGACCGCAATGGTTACCAGTTGGATGGCAAGGTCGATGACGTGATCGCGATTGAACCGCTGGATGAGAAATGGCGCTCGTTTGGTTGGGAAGTTCATGTCGTCGATGGTCACGATATCGTCGAACTGACCCAGAAACTTCGGGAGGTGAAGGCAGACCGAAGCCGGACGAAACCGTGCTGCATCATAGCAAAGACGCTTAAAGGCAAAGGTATCAACTATATGGAAACCGAGCCCGGCTGGCATCTCGGCTGGCTTGCCCCAGACGATGAAGAACGTGCACGCCAGACAATTCTTGACGGAGTCCTATCATGAATGCCCCACAAAACCCGCAGTCCTGGCAGTACCGTGATCTGAACAAAAAGGCGCCATCACTGTCCGTGCTCTCGGACGCGCTGATCGAACTGGTCGAGGCTGGTCATCCCATTGCCGCAGGCACCGCAGATCTCCAGCACTCAAACGGTCTTGTTGCCTTTGCAGAGCGCTACCCCGACCGTTTCGTCCAGTTCGGCATCTCGGAGCAGAATATGGTCTCCGCGGCGGCCGGCATGGCCACAACCGGTCTTATCCCATATGTCGCGACCTTCGCTTCGTTCATTGGCCTGCTTGCGTGCGAGCAAATCCGCATGGACGTTGCGTATTGCGCTCAGCCGGTGAGGCTCATCGGTCACCATACGGGAATTTCAATGGGTTTCTACGGTACGTCGCATCATGCCACGGAGGACATCTCCACCATGCGTGCGATCGCCGACCTGACTGTCATTTCTCCGTCTGATGGCCAACAGTACAAGCAATTGCTATTGGAATCGGCTGTCTACAAGGAGCCCATCTATTTCCGTACCCATCGTGGCCACGATCCGGTGATCTACGACGACACGGAGAAGTTCCGCATCGGCAAAGCGAAGGTCCATGCCATCGGAAAAGATCTCACGATCATTGCCTGCGGCATGCCAGTCCATGGTGCAAAAGCGGCCATGCAGGCGCTCAATGCAAAGGGCCACTCGGTGGGCCTGATCGATATGCATACGATCAAGCCACTGGATGTCGATGCGGTTCTGGAAGCGGCGAAGGCTTCCCGGACTATACTGACTATTGAGGAGCACAACATTCTCGGCGGATTGGGCTCTGCAGTTGCCGAAGTGATGGCTGAAAACCCCGGCAATGCACGGCTTGTGCGGCACGGGATCATGGATGAATACAGCCTGATCGCACCGCCAACGCACCTTTACCGGCACTACAAACTCGATGCAGCTGGTATCGAGGAAGTAGCATCTCGCTTCGTCTGATCACGAAAAATCCAAGATTAATAACGGCAACAAAAAAGGCGGCTTCGAAAGAAACCGCCTCTTTATTTGATTGTTGGGAGCGATTAGCTCCCAATGCCCCCCAGGCAGATATATTTGAGTTCCATATAATCCTCGATGCCGTATTTCGAGCCCTCGCGGCCGGTACCAGACTGTTTTATGCCACCAAATGGTGCAACCTCGGTGGAAACCAGTCCTGTATTGACCCCGACGATACCGTATTCGAGCGCCTCTGCGACCTTGATGACGCGAGACAGATCACGGGCATAGAAATAGGCGGCCAATCCGAAAATCGTGTCATTGGCCATTTCTATGACCTCTTCGACGCTGCTGAAGCGAAAGACCGGGGCGAGCGGCCCGAAGGTTTCTTCACGTGCAACCGCCATGTCCTGACTGACATCACGCAACACCGTCGGCCGGAAGAAGCGTCCACCACGATCATCTTTTTGACCGCCAACGACAAGCGTCGCCCCTTTCGACAGGGCATCCGCGACGTGCTCTTCGACTTTTTCTACCGCAGCGTTGTCGATCAACGGTCCGATTTTCGTTTCTGCAGCCAGCCCATCGCCAATCTTGAGATTGGTCGCGGCATTCGCCAGTTTCTCGACAAAAGCTTCGTAGATGCTGTCATGGGCGTAAATGCGGTTGGTGCAAACGCACGTCTGACCTGCGTTGCGAAACTTTGAGAGGATTGCACCTTCGACGGCTGCATCAATGTCCGCATCTTCGAAGACGATGAAGGGTGCGTTTCCACCAAGCTCGAGACTGAGCTTTTTGATCTGGTCCGAAGCCTGGCGCATCAGGATACGGCCAACTTCCGTTGATCCGGTAAAACTTAGTTTTCTGACTTTCTCATTCTCGCAAAATTCGGCGCCGATCGATGAACTGGAGGCTGACGTTAGGACGGAAAACACGCCAGCGGGAACACCCGCGCGCTCTGCCAAGACCGCCAGCGCAAGTGCTGAAAGCGGTGTCTGTTTCGCGGGTTTCGAAATGAATGTGCAGCCGGCAGCAAGTGCCGGTGCTGCCTTTCTGGCGATCATGGCGATCGGGAAATTCCAGGGGGTTATGGAGCCCACAACACCAACAGGCTGCTTGATGACCATGATGCGTTTGTCTTCGGCATGACCCGGTATCATGTCACCGTAGGTGCGTTTGGCTTCCTCAGCGAACCACTCGACATATGCCGCACCGTACGCAACTTCGCCTCGCGACTCCGAGAGCGGCTTGCCCATTTCCGTGGTGAGGATGATGGCCAGATCCTCCTGATTTTCCATCATCAGCTCGAACCACTTGCGCAAGATCTTGGAGCGCGCGCCTGCCGTAAGCGACGCCCATTTTTTCTGGGCAATGTGGGCGCAGTCAATGGCACGCGCGGCTTCCGGCCGCCCCATGTCTGGTACGAGCGCTATTGTCTGACCGGTAGCGGGATTGTCGACAGCCAATGTAGCGCCACTGTCGGCGGTTGCCCATTGACCGGCAATGAAGGCTTTGTCTGTAAGCAGGCTCGCATCATTGAGCCGGGATCGAAGATCGTTTGTCATTATATACTTTACCTCTTTTATGCGAATTCGAGCATGACCTTGATGGCCTGCTGCCTGTGTCGGGCGAAGGCAAACGCATCATCGGCGCGCTCAAATGGAAATGTTCGCGAAATAATCGGCCGAACGTCGATCGTGCGGTCCGAGATTAGCTGGACAGCCAGATCGAATTCGCTGTCGAACCGGAACGTACCCCGAAACGTGATTTCCTTGGTGATCATTGGGTTGAATGCGGCCTTGATCTCGTCCGGTAGCAAACCCACCTGAACAATCGTTCCACTTGGCTTGGTGACACCAATGGCCAACCGCATTGCGGCGGGACTGCCAGAACACTCGAAGACCACGTCGAATTCTCCCCGATCGCGGAATTCTTCCTCCATGACATCTGCCGCGGAGACGTTCCAGACCGCTTCCGCGCCCAACTTTTTTGCAAGTTCGAGAGGCATGTCGGCGATATCGGTAGCCGAAATCGCGTCGGCACCAGCGTGTCTGGCTGCAAGAAACGTGAGCGCTCCAATTGGTCCAAAGCCCGTGACGAGCACACGCTTTCCTCGTAAATCCGGCGCCTGGGAGACGGCGTGAAGGCACACCGCCAGCGGTTCAGCGCAGGCCGCCTCCTCAAGTCCAGCCATTTCACCGACCGGGACTGCTTGGGTCGCCCGCACCACAACATACTCACGAAAACCACCATCAATATGGGGGGTCCGCATGGCGCTGCCCATAAAACGCATGTCCGAGCAATGTCTGGACATACCCTTTACGCAGAAAGAACAAGTACCGCACGCAAGCGATGGATTGACCGCAACCGCCGTGCCGACAGCGATACTGGAGACACCGGCCCCAAGGCGCTCGACGATGCCTGAGAGTTCATGTCCTGGCACCATCGGAGATTTGATGCGCACGGTTCCGAAGCCGCCGTGGTGGTAATAGTGCATGTCGGAGCCGCAGATGCCGCCTCGGTGAATTCGAACAAGCACGTCACCTTCACCCGGCATCGGTATCGGGCGATCTTCAATCACAAGATGATGGGGTGCGTGTAATACAACGGCCTTCATTGTTCCTCCCGTGGACTTTAGCGTTGACGGCAATCAAATATCATGCATGATACGTGATGACAATGCGAGGGAGGCGCAAATGGAGATTAACGGGAGAACGCAAGTTCTCGTTCACATAGCGTATCCGTCGGCGCATTTGCGAACGCCACAGCTCTTCAATGCGCGTTGCGCCGAGCGCAGGCTGGATGCAGTACTGGTGCCCTGGCAAGTCCATCCCGACAATCTTGCGGGTGTGATGAACGCGCTGCGTGTCAGCGAGAGCGTACCTGGCGCAATCATAACGATTCCCCATAAGGAAACGGTGGCTTCCTTGTGCGACAGGCTCGAAGGTCCAGCAGCATTGCTCGGCGTTGCAAATGTCGTCCGCAAGGAGGATGACGGGGCGCTGGTTGGGCGAATTCTCGACGGTGAAGGCTTTACCGGCGGGCTGAGCCGCACCGGGCGCAATCCCGCAGGAAAACGGGCGCTCCTGATAGGAGCGGGAGGCGTTTCAGTCGCTATAGCCGATGCGCTCCTGGCGTCAGGCGTTCGCGAGCTTGTTATCAGCAACCGCACTGAGTCTCGTGCCAATGCGCTGGTGGAGCGTTTAAGACCGCTCTATCCCGCCACTCCCATCAGTGTCGGGAAAGCAGATGGCACGGGCTTTGACCTGGTAGTCAACGGAACGGCACTTGGCATGCACGATAACGATCCGCTGCCTATAGATCCAGCTACGCTGGAGCGCGGCACAATAGTCGCCGAGGTCATCATGTCCCCCCCGGTCACGAGACTTTTGCAGGAAGCGCGGTTGCGTGGTGCGACGGTCCACGAAGGCGTTCATATGCTGACCGGCCAGATAGATCCTTTCATTGATTTCGTCGTCCGAAACGCTTCGGGCGAGCATTTTTCGAATATCCGGAGAGAAAAAGCATGATCATTTTTGGAGCTCCCCGCCGCTACATTCAGGGGCCAGGCGTTCTTGCAGCCATTGGTAAAGAACTTGCGGGTTTCGGCACTTCCGCGATCCTGCTTGCCGACGACAACGTCAACAAGATCGTGGGCAAAGCCATTACCGACAGCGCGAAGCAGGCCGGGGTCACAGTCACAGATCTTCGCTTTGGCGGTGAAATCACCTATGCCGAAATCGAACGGCTCGTGGGTGAGGCATCAGGCGCGAAATTCGATGTGGTGATCGCAGCCGGTGGCGGAAAGACCATCGATACAGGAAAACTGGTGTCGAAAGCTCTGAAATCAGCGTTCGTTTCGGTGCCAACAGTTGCCTCGAACGATTCTCCCACCAGTCATATCGCCGTAGTCTACGACCGCGATCACAAGCTGGTCGGCGTCGAGCAAAACCGTTCTAATCCTGATCTGGTGCTGGTCGATACAGCAATCATCGCAAAGGCACCCCTCAAACTCCTGTCCGCTGGCGTGGGTGATGCACTCGTCAAACGCTTCGAAGTCGAGCAATGTGTTGGCGCGAAGGGTAACAACGTCTTCGGTGCTGGCTCGCCGCGTTCTGCCTTGGCGCTTGCTCACGCCTGTTACGATACGGTGCGAGACCATACTGTGGCCGCCTACCAGTCTCTTGAGCGTGGTGAACCGGATGAACATCTGGAAGCACTGGTCGAGGCCTGCGTTCTAATGAGCGGCCTGGCATTCGAAAGCGGTGGGCTTTCGGTCTCGCATGGCATGACAAGAGGTCTGTCAGCCGTACCAGGCGTCGCCAACGCGCTTCACGGTCACCAGGTTGCCTATGGGTTGCTCGTCCAGTTGGTGCTCGAAGGGCGTGACGACGAGTTCATGGCGGACATGTATAATTTTTACCGGGAGGCCAGATTGCCACTGAAGCTGGCTGATCTCGGCCTTGAAGATCGATCGAATTCGGTTGTCGACACCATCGCGACCGTGTCGGCCGAGGCCGCGCACATGAAGAAGTTTGCTAAGCCGATTTCTGCTGAAGACATTGCCCGCGCCATCACAGTCATCGAGGCCGCCTGATCATGATCACAGGAACCGAACTTTTCTCGTTGAAGGGCAGGGTTGCGCTCGTAACTGGTGCCGGGCGTGGCATCGGGTATTCGTTCGCCAAAGGACTTGCTTCGGCCGGCGCGCATGTCGTTATCAACGACATCAATGCCGATAACGCCAATGCCGCAGTTGAAAGTCTGAGGGCCTCAGGTTGGAGCGCCGAAGCAGTTACCTTTGATGTGACTGACACACCAGCTGTTGCCGCAGCTGTCGATGGTATCGTCGAGCGCAATGGCAGGCTCGACATCTTGATGAACAATGCCGGTATTCTTATTCGAAAGCCGGTAGAGAGCCATGACATGGATGACTGGAACAAGGTCATCGCCATTAATCTCAGCTCGCTCTATGGCGTTGCTCGGGAGTGTATCCGCCATATGCGCAAGCAGAAATATGGCCGTATCATCAACACCGCTTCCGTCATGGCCATAAGCTCGCGGCCGGGCGTCATCTCATATGTTGCTGCCAAACATGGGGTGGTCGGCATTACACGTGGATTGTCGGCAGAGCTCGGGTCCTGTGGTATCACCGTCAATGCGATCGGTCCTGGCTATATCCTGACGGATATGAACAAGAAGGTATTGGGAACCGGGACATTCGAGCAGCAGGTCATCGACCGCACGCCGCTGGGACGTTGGGCCGAACCAGACGAACTCATGGGGTCGGCAATTTTCCTCGCGTCCGAAGCGGCCAGTTTCGTGACCGGTCATGTGCTGATGGTTGATGGTGGCATGACAGCCAACGCATTCCTGACAGAAAGTGCAGATCTGGGCGCAGCAAGCTGA
Protein sequences of DBSCAN-SWA_46 >NC_022536|484764:500735|489618_490305_-|WP_048903053.1|DBSCAN-SWA MANSLGIMMLDTTFERPLGDVGNAGSWPFPVVYKTVKGASARTIVNGEDDAVLLDAFVAAGEELAREGACALTTSCGFLVLRQSALASRLTLPVATSSLLQIPQITAFLPAGKKVGVITYDFNSLTDRHFREAGVEDIPPVAGLPKGGAFHQLIEGGKPYDREALNNELLQTVQDLVDKHPSIGALVFECTNLPPFSAEISKRFGLPVFDILTLGRWLHAGASIGEAR >NC_022536|484764:500735|498852_499923_+|WP_022557604.1|DBSCAN-SWA MIIFGAPRRYIQGPGVLAAIGKELAGFGTSAILLADDNVNKIVGKAITDSAKQAGVTVTDLRFGGEITYAEIERLVGEASGAKFDVVIAAGGGKTIDTGKLVSKALKSAFVSVPTVASNDSPTSHIAVVYDRDHKLVGVEQNRSNPDLVLVDTAIIAKAPLKLLSAGVGDALVKRFEVEQCVGAKGNNVFGAGSPRSALALAHACYDTVRDHTVAAYQSLERGEPDEHLEALVEACVLMSGLAFESGGLSVSHGMTRGLSAVPGVANALHGHQVAYGLLVQLVLEGRDDEFMADMYNFYREARLPLKLADLGLEDRSNSVVDTIATVSAEAAHMKKFAKPISAEDIARAITVIEAA >NC_022536|484764:500735|494354_495329_+|WP_022557600.1|DBSCAN-SWA MNAPQNPQSWQYRDLNKKAPSLSVLSDALIELVEAGHPIAAGTADLQHSNGLVAFAERYPDRFVQFGISEQNMVSAAAGMATTGLIPYVATFASFIGLLACEQIRMDVAYCAQPVRLIGHHTGISMGFYGTSHHATEDISTMRAIADLTVISPSDGQQYKQLLLESAVYKEPIYFRTHRGHDPVIYDDTEKFRIGKAKVHAIGKDLTIIACGMPVHGAKAAMQALNAKGHSVGLIDMHTIKPLDVDAVLEAAKASRTILTIEEHNILGGLGSAVAEVMAENPGNARLVRHGIMDEYSLIAPPTHLYRHYKLDAAGIEEVASRFV >NC_022536|484764:500735|493503_494358_+|WP_022557599.1|DBSCAN-SWA MQNTELGAEQIALLEERAKFVRLETIRLISIAKVGHYSSVFSCAEIFSSLYYDVMNIRRGEPKWEERDRFLMGKGHAAVGLFPILADLDYFPKEWLDDYTRLGSPLGDHPDMRKVPGVDFSSGSIGHALSNGVGMAHGARIQKQHFDVFVLLGDGEMQEGQVWEAALNASSHGLSNVIAIVDRNGYQLDGKVDDVIAIEPLDEKWRSFGWEVHVVDGHDIVELTQKLREVKADRSRTKPCCIIAKTLKGKGINYMETEPGWHLGWLAPDDEERARQTILDGVLS >NC_022536|484764:500735|491455_492355_+|WP_022557597.1|DBSCAN-SWA MRRYLPSLSALHAFEAAARYMNFTRAADDLGLTQSGISRQIRNLEEFLGATLFHRSGPRLVLTEVGANYYRDLAFTLDRLEEISIDAVRGRSVDSSLMVGTHPTFAARWLPQRLYTFISAFPDIPLEFTLATPATDFETTRLDITVLRGAGTWLHARAIELFPEKLAVVAAPSLIPLGEKLDPLDFANFPMIQNAGRPSLWLNWLRFAGLSYSGRIQGIRFGHYEAIINAAVSGLGIAVVPTFYIEHEIANGQLHMPFGEATHSDDSYYVVYPERKANNPNVVLFRDWLIRQSRKYKEQ >NC_022536|484764:500735|488890_489550_-|WP_022557594.1|DBSCAN-SWA MQYQWDFNFLIQYQSLIWTGVLYTIGFTIGTAVSGLIVGCLVALARLSNVKILGALITGFIEIFRCTPVLVQLVWCYYALPILTGIELSPGAAAFITLTLYGASFFGEIIRGGIISVDLGQWDAGRALGMRRHQLMARVVLPQAFRRMIPPLMNQTVLQLKNTSLLSVLAVPDLLYQGQLITSASYRPLETYTMIAVIYFILLFPLTRFAHRLEARAGR >NC_022536|484764:500735|498022_498856_+|WP_022557603.1|DBSCAN-SWA MEINGRTQVLVHIAYPSAHLRTPQLFNARCAERRLDAVLVPWQVHPDNLAGVMNALRVSESVPGAIITIPHKETVASLCDRLEGPAALLGVANVVRKEDDGALVGRILDGEGFTGGLSRTGRNPAGKRALLIGAGGVSVAIADALLASGVRELVISNRTESRANALVERLRPLYPATPISVGKADGTGFDLVVNGTALGMHDNDPLPIDPATLERGTIVAEVIMSPPVTRLLQEARLRGATVHEGVHMLTGQIDPFIDFVVRNASGEHFSNIRREKA >NC_022536|484764:500735|488094_488856_-|WP_173402668.1|DBSCAN-SWA MIELSGVKKSFGDFEVLKDITLNVKKGNVVALIGPSGSGKSTLLRSVNLLTVPEQGFIRVGDQAIDFSSTRKLPGDKALAAFRARTGMVFQSFNLFPHMNVEQNVMEGPITVLRKSRAEAREIARGLLEKVGLADKSTAPPDSLSGGQKQRVAIARALAMKPEVMLFDEATSALDPELVGEVLSVIRSLAAEGMTMLLVTHEMAFARDVADRVVFMRDGCVVEEGEARQVIDNPLQPATRSFLSHFHNGLGPN >NC_022536|484764:500735|490405_491251_-|WP_022557596.1|DBSCAN-SWA MNRRDFGRLLGLAGAGAVATALATPAVAQAPATGSTFERIRSSKKLRVAGIVGTEPYYHKDIATGEWSGFCMSMARDLAKSLEAEVEISETTWGNAVLDLQANKIDIMFGLSPTPSRALVVEFTRPIMQNTFTIIAKPGLEPKTWEELNKPEVRIAVDIGSTHDLFARRVLPKATLVALKTPDEATLSVQTGRADAVIQVAMLSLVTVKKNANVGKIVVPGPIVRQPTCAGVRAEDSSRFRTFVDNWLEFNRSSGVVTSWITSSLDLVGVQKEDIPADIQF >NC_022536|484764:500735|495412_496891_-|WP_022557601.1|DBSCAN-SWA MTNDLRSRLNDASLLTDKAFIAGQWATADSGATLAVDNPATGQTIALVPDMGRPEAARAIDCAHIAQKKWASLTAGARSKILRKWFELMMENQEDLAIILTTEMGKPLSESRGEVAYGAAYVEWFAEEAKRTYGDMIPGHAEDKRIMVIKQPVGVVGSITPWNFPIAMIARKAAPALAAGCTFISKPAKQTPLSALALAVLAERAGVPAGVFSVLTSASSSSIGAEFCENEKVRKLSFTGSTEVGRILMRQASDQIKKLSLELGGNAPFIVFEDADIDAAVEGAILSKFRNAGQTCVCTNRIYAHDSIYEAFVEKLANAATNLKIGDGLAAETKIGPLIDNAAVEKVEEHVADALSKGATLVVGGQKDDRGGRFFRPTVLRDVSQDMAVAREETFGPLAPVFRFSSVEEVIEMANDTIFGLAAYFYARDLSRVIKVAEALEYGIVGVNTGLVSTEVAPFGGIKQSGTGREGSKYGIEDYMELKYICLGGIGS >NC_022536|484764:500735|499925_500735_+|WP_022557605.1|DBSCAN-SWA MITGTELFSLKGRVALVTGAGRGIGYSFAKGLASAGAHVVINDINADNANAAVESLRASGWSAEAVTFDVTDTPAVAAAVDGIVERNGRLDILMNNAGILIRKPVESHDMDDWNKVIAINLSSLYGVARECIRHMRKQKYGRIINTASVMAISSRPGVISYVAAKHGVVGITRGLSAELGSCGITVNAIGPGYILTDMNKKVLGTGTFEQQVIDRTPLGRWAEPDELMGSAIFLASEAASFVTGHVLMVDGGMTANAFLTESADLGAAS >NC_022536|484764:500735|486648_487902_-|WP_022557592.1|DBSCAN-SWA MHVIVIGAGIVGASAALELIRDGHHVTILEAGEPGGRQSASYGHGCWISPASVVPMSMPGLWRQIPSYLLNPKGPLVIRWRHVLKLLPWLLRFLIAGSTVARVEKTAHALATLLGDSPQRHLKLAEELGLSHLIRQEGLLYAYPGRQDFEAEKLSWHLRSMTGLRWKELDAGQLHAREPQLSHRYNFGVLAEDGAHCVDPGAYVAGIVSMAVARGATLVKAQATGFVFKDNSLIAVETDNGRLECDRAVIASGVWSKALARKAGDHIPLEAERGYHGVLVSEENGPSHPVMPSDGKMANTPTSKGLRLSGQVELASVETPPNWKRVDILIEHALATYDHLTPRAEMKLDRWMGHRPSTPDGLPVIGASSRSNSVFYAFGHGHVGFASGPITGRIVADMVSGRTPTSSIAAFSPRRFT >NC_022536|484764:500735|496908_497943_-|WP_022557602.1|DBSCAN-SWA MKAVVLHAPHHLVIEDRPIPMPGEGDVLVRIHRGGICGSDMHYYHHGGFGTVRIKSPMVPGHELSGIVERLGAGVSSIAVGTAVAVNPSLACGTCSFCVKGMSRHCSDMRFMGSAMRTPHIDGGFREYVVVRATQAVPVGEMAGLEEAACAEPLAVCLHAVSQAPDLRGKRVLVTGFGPIGALTFLAARHAGADAISATDIADMPLELAKKLGAEAVWNVSAADVMEEEFRDRGEFDVVFECSGSPAAMRLAIGVTKPSGTIVQVGLLPDEIKAAFNPMITKEITFRGTFRFDSEFDLAVQLISDRTIDVRPIISRTFPFERADDAFAFARHRQQAIKVMLEFA >NC_022536|484764:500735|484764_486519_-|WP_022557591.1|DBSCAN-SWA MRGADILVDMLIGYGVEVIFGVPGDTNVPFYEALQEREDKIRHVMARDERSAGYMADAYGRFTSKPGVFECPSGAGAMYSLPPVAESNSSSVPVILLTIDIPLPGEGRGVLTELDCARLFDPITKMSVQVKSAEKLPEIIRRAFRVACSGKPGAVHLQIPEDMLIAEVDPTRVSLHIETECRQFPAYPTLPPQDKLDAVMSMLTSSKKPLIVSGGGVNRSCAGPEVTELAERLNIPVCTTMTGQGTMPDDHRLAVGVIGDNGFHPHANWSLEHADFVLFVGSRMGSVVTIGWTFPKITLNKRVAQIDIDPEIMANNYENVVSVQGDARLVLQRLIDMAPVDNDATRTQEWVDELNVLRANFWENAEALLSSEAAPLRPERAVRCFNEALEAYGKPAHIYSDAGTPTPHMTRFLKLKDRRTRFAIPRAFGGLGSALPATVGAWFANKERRPIGMFGDGSFGMTVGELETLVRLQVPAILLLFNNGTFGWIKGLHRLKGHNQCFGVDFMPPRGQAIAEAFDVKAWTATNSKELDRALAEAFAYTSGPCLIDIHVESIADRVPPVYSWLTKRGKDPLSIDAEEVRYF |
14 | Yellowstone_lake_phycodnavirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_47 |
510904 : 512047
Sequences of DBSCAN-SWA_47
Nucleotide sequences of DBSCAN-SWA_47 >NC_022536|510904:512047|DBSCAN-SWA TTTACTTTTGGGTCACAGGCTTGGGAGGTGTGAGCGTTTTGCCATCGGGGCCAAACAGGTAGCAGTCATCAAGGGCGAAGCCGACGCGGATGGATGAGCCGACATCGAAGGGCACAACCTTTCCAAGCTGGATAGTGACATTGTGCGTCCCCGTGTCGGAGCCGACAGCCATCAGTGGTGAGGCATCCGCATGGACAATGGTTTCCCGACCCAGATGCTCGACAAGGCGTACCTTCGCGTCGAACGTCATCAGCGAAGGATCGGCATCCTGCGTTTGCAGACGCTCTGGCCGGACACCCAGCGTTACCGTGTCGCCCGTCTTCACCGAGGCGTCGGCAGTCAGCGGAACGCGGAGGTCTTCAAGTCCAGCAAAGGAAACGACAGCAGTGCCTTGTTCTGTTCTGATGACCTTGCCTTCGAAGAAGTTCATGCGCGGCGCGCCGAGGAAGCCCGCGACGAAATGGTTCTTGGGATTGGCGTAGAGCTCCAGCGGAGAGCCAACCTGCTCGATCTCGCCCTTGTTGAGGACGACGATGCGGCTTGCCATGGTCATTGCTTCAACCTGATCGTGGGTGACGTAGATCATCGTTGAGCCGAGCTCGGCATGCAGCGTCGAAAGCTCGACGCGCATCTGGGTGCGAAGGGCAGCGTCGAGGTTCGACAACGGTTCATCGAACAGGAACACCTCCGGCGAACGGGTGATCGCCCGGCCGATGGCCACACGCTGGCGCTGGCCGCCGGACAATTGCTTTGGCAGCTTGTCCAGTTGTTCATCAATCCGCAGGATGGCGGCCGCGCGTTTGACCGCAGCATCGATCTCGGCTTTCGGGCGCTTAGCCATGCGCAGCCCAAAGCCCATGTTTTCGGCAACGGTCATGTGCGGGTAGAGGGCATAGGACTGGAACACCATGGCGATGCCACGGTCGCCCGGTTCTTCCTTCGTCACATCCCGGCCATTGATCATCAGTTTACCGGAGGAGATCGTTTCCAGACCGGCAATGGTCTTCAGCAGGGTAGATTTGCCACAGCCGGACGGGCCGACCATGACGATGAACTCGCCCTTTTCGATCGTCATATTGATGTTCTTCAAGGCGTGGAAGCTGGTGTAATGCTTCTGGACGTTGAGGATTTCGACGCTGCTCAT
Protein sequences of DBSCAN-SWA_47 >NC_022536|510904:512047|510904_512047_-|WP_048903099.1|DBSCAN-SWA MSSVEILNVQKHYTSFHALKNINMTIEKGEFIVMVGPSGCGKSTLLKTIAGLETISSGKLMINGRDVTKEEPGDRGIAMVFQSYALYPHMTVAENMGFGLRMAKRPKAEIDAAVKRAAAILRIDEQLDKLPKQLSGGQRQRVAIGRAITRSPEVFLFDEPLSNLDAALRTQMRVELSTLHAELGSTMIYVTHDQVEAMTMASRIVVLNKGEIEQVGSPLELYANPKNHFVAGFLGAPRMNFFEGKVIRTEQGTAVVSFAGLEDLRVPLTADASVKTGDTVTLGVRPERLQTQDADPSLMTFDAKVRLVEHLGRETIVHADASPLMAVGSDTGTHNVTIQLGKVVPFDVGSSIRVGFALDDCYLFGPDGKTLTPPKPVTQK |
1 | Planktothrix_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_48 |
525294 : 530349
Sequences of DBSCAN-SWA_48
Nucleotide sequences of DBSCAN-SWA_48 >NC_022536|525294:530349|DBSCAN-SWA TTTGATGAATATACCAATTAGTGCCCCTGCTCGTCTTGATGGGCGTGTCGCCCTCATAACGGGCGCAACCGGCGGGATTGGCAAGGCCACCGCCTACGCGCTTGCACAGGCGGGTGCTACCGTAGTTGCAACCGACATTGCCGAGCACGCAGAATTTGACAATTCTGCTATCGAATATGCGAAATACGATGTGACTTCGGCTTCGCAAACGAAGTCTGTGATCGCCGATATTATGGCGCGTTACGGTCAACTCGACATTCTGATCCTTTGTGCCGGCACAATCACCCATCGCCCTCTGACGGAATCGACCGATGACGAATGGCGCGCAATGCTTGACGTCAATCTTATGGGCGTCGTCAATCCTGTTCGTGAAATCTATCCCATAATGGCCAGCGCTGGTGGAGGTAAGATCGTTGCACTTGGGTCTATCGCGGCAAAGATAGGGGGCGTGGCATCGGGCCCTGCCTATGTTGCCGCCAAATCCGCTGTCCACGGACTGATGAAATGGGTGGCAAAAGCTGGAGCTTCAAAAGGTGTCTATGCGAGCGTCATCGCACCTGGGCCGGTCGAGACACCAATGTGGAACACGGTCACCCAGCGTGCCGCACCTTCTGCCAATGGCAATGTGCCGCTTGGACGATACGGCCAGCCCGAAGATATCGCCCAAGCCATACTCTTCCTTTGTTCGCCAGCATCGAACTGGATCACCGGAACGGTGCTCGATGTGAATGGCGGCATGCTGATGGATTGAACCATGACAGATAATTTCACGATAGGTTTCGTCGGGCTGGGCGTCATGGGTGGCCCTATGTGTCGCAACGTAGCACTCAAGCATCCCGGCCGCGTCATCGCATTCGATATGAACGACGAAGCGTTCTCCATATTGGATGGCACGAATGCCGAACGAGCAGCGTCGGTTGCTGCGGTCAGCGCTCACTCCGATTTTATACTTCTATCGCTGCCGGGCGGGCCGCAGGTCAAGGCCGTGGCCGACGAGGTTGGAAGACATGGGCGATCCGGTGCCACGATCGTCGATCTTTCGACCACGCCCGTCGCGCTTGCCAGACAAATCGCGGATGAGCTCAGCCAGTTGGGGTTGAACTTCGTTGACGCGCCTGTGGCGCGAACGCGCGAGGCAGCCCAGCGCGGCGAGCTCAGCATCATGGTTGGTGCAACAAAAGACGTGTTCGACAAGGTCAAGCCCATCCTCGAATATATGGCAACCGATATCACCCATGGTGGAGAGGTCGGTGCCGGGCAGGTGCTCAAGCTCGTCAACAACATGCTTGTCTTTGAACATGTCGTGACACTTGCAGAGATGATCGTGCTGGGCGAACGTGCGGGCGTGAAAGCTGAAACCCTTCTCGACGCTGTCTCGAAGGGGTCCGGCGACAGCTTCGTGCTGCGCAATCATGGCCGAAAGGCTATGTTGCCGCGGCAATTCCCTGAACGTTCTTTTCCGCCTGAATATGTCTTGAAGGACATCGATTACGTTTTCCAGCTCGCCGCCGAGACGGGCGTGCCGCTCAAGGCGGCGGAGACCGTGCGACGCTATTATGAAATGGCCATCGAGAAAGGGTTTGGTGGCCAATATTTCCCAGGCGTGATCCGGCTTGTTGAAGATGGAAGCATGTCCGAGGTGCCGGCATGAACGGCGCCTACAGCTTTATTCTCGATCTTTTTCCACTGGTGTGGTCGGGCATCGTCACTGGCTGTCTTTATGCACTAGGTGCGTTGGGGCTGGTGATGATCTTCAAAAGCTCACGGGTGGTAAATTTCTCCCACGGCAACATCGCGGGGCTTGCTGCCTTCCTGATCTACGGATTCTCCAGCGGCACTCTCTTGAGCCTTTCCTGGGGGAGTGCCGTGCTGCTCGCCGTCCTGATCGTTGTCGCCATTGCATTCCTTAGTTACGCGATCATTGCACCGATCATGGGGGCGTCCGACCTCACTGCGACGATTACCACGCTTGGGATTGGCCTTATCGCGCAAGGCGTGACGCTTCTCATGTTCGGCGCGGATATTGTGCCGCTCGAGCTGCCGCTTCCACGGTTCAGTGCCGCGATCCTCGGTCTTCGCATCACCGGATACGACCTTACCGTCCTGGCCGTTGCAGTCGTTACGATTGTCACCCTCTTTCTCGTAATTGACTACACCAAGCTGGGTGTTGCTTTTCGCGCGATATCGGCAAATCCCGTCGCTGCGGAGATTTGCGGTCTTAACCTCCGCTCAATTCACCTCTTTGCGTGGATCGTTTCTGCGGTGCTCGGTGTCATCGGCGCGCTTTTGATCGTACCGACAACGTTTCTGAGTTCGACCACGGTGGCAACGTTCATGCTGCAGGCATTTGCAGCTGCGGTCCTGGGGGGCTTCGGGAGCTTGCCTGGAGCGCTTGCTGGCGGGGTATTGATCGGCATCCTCATGAACCTTTTCACATTCTACGTGTCGCCCGAGTTCACCAACACGTTTCTCCTTGTGGTCATTCTGGGTGCCCTGAACCTCTTTCCAAATGGACTTTTTGCCAGAGCAGGAGGCAGCCGTGTCTGACACTACCGCGACACTCCACGCGCAGCGTGGACATTCAACGGCATTTCGGGCGATGGTGACAGAAGTACGGCTCGATGCACTGATCCTTTTCGTTTTGCTGGTTCTGCCTCTGATAATTTCCGGTTCATGGCGATATGCCCTTGGCCTTTGCTTTGCCAACAGCATCGGCGTGCTGGCGGTCAGCGTGCTGGTTCGCTATGGCGGAGAGGTTTCCATCGGTCACAGCTTCTTTGCTGCCATTGGTGCCTATTCCGTTGCGATTTTGGAAGCGCATTTGGGGATGCCGCTTGCCGTTTCTCTTCCCATCGCCATCGTTGCCGGCACGCTCTTTGGTGTCCTCTTCGCCTGGCCATCGCGTCGACTTTCGGGGATATACCTTGCGGTTTCCACCATGGCCCTGGCGCTTGCCTTGCCCGAGATCATCATAAGTGCTGACCGCTGGACGGGTGGTTTCGAAGGGCTGTACGTCTCGAAGCCGTTTCTTCCAGGCCTTCCGGTCGAACCGCAACGCTATTATGTTACGCTGATTGCGCTTACCTGCATCGTCTATGCGTTGGTGCAGCTACGACGGTCCAGACAGGGACGTGCTCTGTTACTTGCAAGGACACATCCCGCTGCGGCAGAAGCCTTCGGGACACGGCCAGCTTGGGCCAGGGTGAGTGTCCTGGGGATCAGTGGCGGTATTGCCGCGATCTCGGGTGCGATGCTCGCCTTTGCGAGCTCTGCCGTCTCGCCGACGGGCTACACATTCTGGTCGTCTATCTTCCTGCTCGTCGGATCCGTGGTCAGCTTCTATGGGCTCACCTTGACCCGGGCTTTGATTGGCGGCGCCTTCCTCACGCTTGTGCCGCAGGTTCTGGCAGGTTCGGGGGCCTGGATCCCTGTCTTTTACGGTGTTGCTCTTATTGGCGTCGTTCTTCTCGGGCATTTCTCACCGAAGATCGCACAACGGGTGGCAAGAAGCCGGGAGGGGACGTGATGGATGCACATAGCAAGTTGCTCAAGGGAGACTCGATATCGCTCTCTTTTGGTGGCATCAATGTTCTGAAGGAGTTGGATGTCGAGTTCAGGTCGGGGGAAATTACCGGTTTGATTGGACCGAATGGCGCCGGAAAGACGAGCCTTTTCAACTGCCTGACCGGTGCCTATCAGCCTCAGGGTGGTTCCATCACATATGACGGCCGGCCGCTTGATGGCTTGTCTCCTGCAGCCAGAGCCTCGCGAGGCGTCGTTCGCAGCTTCCAGACCGTTGCCCTGTGCCCCGACCTGACCGTTACCGAAAATGTGATGATCGGACTTGCGAGGAATTATCACGCCGGGTGGGCGAGCGTCTTTTTACCGCTCGCAGGTGGCCGACGAGAAACCGCCGAGATGAAGCAGGCGGCATTGGAAACCTTGTCGAGCCTCGGGCTTGCCGAAGCCGCTGGCCTCTTTCCTCAGCAACTGCCGCCAGGCATGCAGCGTCTTGTGGAAATTGCCCGTGCCATAGTTGGCAAGCCATCGGTCTTGCTTCTCGACGAACCTGCCGCCGGGCTCAACAATGCGGAGACACGGGATCTCACACGAACTTTGAAGAGCATCGCATCTCCCGATCTGGTCATGGTCGTTGTGGAGCATGACATGGATCTCGTCATGTCGCTTTGCGACAGGATCTACGTGATCAACTTCGGCGAGGTCGTCACATGCGGGGCCCCCGATGCCGTGCGCACCAATGAAAGAGTTGTCTCAGTTTATCTCGGGAGCGACGATGACTAGCCTGCTATCTGTCAATAACCTCTCCGTTTTCTATGGAGCAGGCGCTCAGGCCGTAGAGAACGTCAGTTTTGACGTTCAGCCCGCCAAGGTTACTGCACTTCTTGGGGCGAACGGCGCAGGCAAATCGTCGATCATGAAGGCGATTGCAGGACTGGTCCCCTTCCGCGGCAGCATCGTCTATGAAGGGGAACACATCGACAAGCTGACGAGACGGGAGCGGGTTCGACGCGGGATTGTCTACGTGCCGGAGCGACGCGAAATCGTCGGAGATCTCAGTGTACGTGAAAACCTGATTTTGGGCGGATATCAGATAGCCGCCTCGGAGAGGCGACGGCGGATGGACGCCATCCTTGAGCTCTTCCCCGAAATAGCGGGCAAACTCAGTGGCGGTGCCTGGCGCTTGAGTGGCGGCGAGCAGCAGATGCTGGCAATCGGCAGGGGATTGATGTCTGGGCCGAGACTGCTGTTGCTGGATGAACCCTCCCTAGGTCTGGCGCCACTCCTTGTGAGACGTGTCTTTGAGAGGCTTGGCGCGATCCGTGGTGACGGAGATCTGGCAATCGTTCTCGTCGAACAGAACCTTAGAATGACAATGCGTCTATGTGACGATCTGCACTTTCTCCGCAGCGGGCGTTTGGTCGGCTACCGACGTGCCGAAGAGCTTCAAGACAGTGCGGCGCGCCAGCAGGCCATCGACACATATCTTGGCGCGACATCGGTCTCAGAGGGTGAACATGACAAAGCATATGCCTGA
Protein sequences of DBSCAN-SWA_48 >NC_022536|525294:530349|529587_530349_+|WP_022557635.1|DBSCAN-SWA MTSLLSVNNLSVFYGAGAQAVENVSFDVQPAKVTALLGANGAGKSSIMKAIAGLVPFRGSIVYEGEHIDKLTRRERVRRGIVYVPERREIVGDLSVRENLILGGYQIAASERRRRMDAILELFPEIAGKLSGGAWRLSGGEQQMLAIGRGLMSGPRLLLLDEPSLGLAPLLVRRVFERLGAIRGDGDLAIVLVEQNLRMTMRLCDDLHFLRSGRLVGYRRAEELQDSAARQQAIDTYLGATSVSEGEHDKAYA >NC_022536|525294:530349|526047_526944_+|WP_022557631.1|DBSCAN-SWA MTDNFTIGFVGLGVMGGPMCRNVALKHPGRVIAFDMNDEAFSILDGTNAERAASVAAVSAHSDFILLSLPGGPQVKAVADEVGRHGRSGATIVDLSTTPVALARQIADELSQLGLNFVDAPVARTREAAQRGELSIMVGATKDVFDKVKPILEYMATDITHGGEVGAGQVLKLVNNMLVFEHVVTLAEMIVLGERAGVKAETLLDAVSKGSGDSFVLRNHGRKAMLPRQFPERSFPPEYVLKDIDYVFQLAAETGVPLKAAETVRRYYEMAIEKGFGGQYFPGVIRLVEDGSMSEVPA >NC_022536|525294:530349|528818_529595_+|WP_022557634.1|DBSCAN-SWA MDAHSKLLKGDSISLSFGGINVLKELDVEFRSGEITGLIGPNGAGKTSLFNCLTGAYQPQGGSITYDGRPLDGLSPAARASRGVVRSFQTVALCPDLTVTENVMIGLARNYHAGWASVFLPLAGGRRETAEMKQAALETLSSLGLAEAAGLFPQQLPPGMQRLVEIARAIVGKPSVLLLDEPAAGLNNAETRDLTRTLKSIASPDLVMVVVEHDMDLVMSLCDRIYVINFGEVVTCGAPDAVRTNERVVSVYLGSDDD >NC_022536|525294:530349|527892_528819_+|WP_144115389.1|DBSCAN-SWA MVTEVRLDALILFVLLVLPLIISGSWRYALGLCFANSIGVLAVSVLVRYGGEVSIGHSFFAAIGAYSVAILEAHLGMPLAVSLPIAIVAGTLFGVLFAWPSRRLSGIYLAVSTMALALALPEIIISADRWTGGFEGLYVSKPFLPGLPVEPQRYYVTLIALTCIVYALVQLRRSRQGRALLLARTHPAAAEAFGTRPAWARVSVLGISGGIAAISGAMLAFASSAVSPTGYTFWSSIFLLVGSVVSFYGLTLTRALIGGAFLTLVPQVLAGSGAWIPVFYGVALIGVVLLGHFSPKIAQRVARSREGT >NC_022536|525294:530349|526940_527840_+|WP_022557632.1|DBSCAN-SWA MNGAYSFILDLFPLVWSGIVTGCLYALGALGLVMIFKSSRVVNFSHGNIAGLAAFLIYGFSSGTLLSLSWGSAVLLAVLIVVAIAFLSYAIIAPIMGASDLTATITTLGIGLIAQGVTLLMFGADIVPLELPLPRFSAAILGLRITGYDLTVLAVAVVTIVTLFLVIDYTKLGVAFRAISANPVAAEICGLNLRSIHLFAWIVSAVLGVIGALLIVPTTFLSSTTVATFMLQAFAAAVLGGFGSLPGALAGGVLIGILMNLFTFYVSPEFTNTFLLVVILGALNLFPNGLFARAGGSRV >NC_022536|525294:530349|525294_526044_+|WP_077981974.1|DBSCAN-SWA MMNIPISAPARLDGRVALITGATGGIGKATAYALAQAGATVVATDIAEHAEFDNSAIEYAKYDVTSASQTKSVIADIMARYGQLDILILCAGTITHRPLTESTDDEWRAMLDVNLMGVVNPVREIYPIMASAGGGKIVALGSIAAKIGGVASGPAYVAAKSAVHGLMKWVAKAGASKGVYASVIAPGPVETPMWNTVTQRAAPSANGNVPLGRYGQPEDIAQAILFLCSPASNWITGTVLDVNGGMLMD |
6 | Trichoplusia_ni_ascovirus(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_49 |
540965 : 548398
Sequences of DBSCAN-SWA_49
Nucleotide sequences of DBSCAN-SWA_49 >NC_022536|540965:548398|DBSCAN-SWA TATGTCTAACGAGTATCGACACGTTGAATTGCTGACGGGTGATGTTCGCCGCAGGCGGTGGACAACCGAGCAAAAGCTGACGATCATTGAGCAGAGTTTCGAACCAGGCGAGACGGTATCGTCCACCGCTCGCCGTCATGGCGTCGCGCCCAATCTGCTTTATCGGTGGCGCAGGCTCTTGAGCGAGGGAGGTGCTGCAGCTGTGGATTCTGACGAGCCGGTGGTCGGCAATTCGGAAGTAAAGAAGCTGGAAGATCGCGTCCGCGAGCTGGAGCGCATGCTCGGCCGCAAGACGATGGAGGTCGAAATCCTTCGGGAAGCCTTGTCCAAAGCAGACTCAAAAAAACGGATATCGCGGCCGATCTTGTTGCCGAAGGACGGTTCGCGATGAAGGCCGTCGCAGATACGCTGGGCGTATCCCGTTCCAACCTCATCGAGCGGCTGAAAGGCAGATCGAAGCCGCGCGGCTCCTATCACAAGGCCGAGGATGCAGAGCTTTTGCCCATCATTCGCAGGCTGGTAGATCAAAGGCCAACCTATGGCTATCGGCGGATCGCCGCGCTCCTCAATCGCGAAAGGCGAGCCGCCGATAAGCCTGTCGTCAACGCAAAACGGGTTCACCGCATCATGGGCAACCACGCCATGTTGCTGGAGAAGCATACGGCCGTTCGCAAGGGCCGCATCCATGACGGCAAGGTCATGGTCATGCGCTCGAACTTGCGCTGGTGCTCGGATGGGCTGGAGTTCACTTGCTGGAACGGCGAGGTCATCCGTCTCGCCTTCATCATCGACGCTTTCGACCGGGAGATCATCGCCTGGACGGCGGTCGCCAACGCGGGCATCTCCGGCTCTGACGTGCGCGACATGATGCTGGAGGCGGTCGAGAAACGCTTCCGCGCAACCCGAGCCCCGCACGCAATCGAGCATCTCTCGGACAACGGCTCTGCTTACACCGCGAGGGACACGAGGCTGTTTGCGCAAGCACTCAATCTGACACCCTGCTTCACGCCGGTAGCCAGCCCGCAGTCGAACGGCATGTCGGAAGCCTTCGTCAAAACGTTGAAGCGGGACTATATTCGGATATCAGCTCTACCGGACGCCCAAACAGCGCTCCGGCTCATTGACGGATGGATCGAGGACTACAACGAAATCCATCCCCATTCCGCGCTCAAGATGGCTTCCCCTCGGCAGTTCATCAGGGCTAAATCAATCTAGCCGACTTGCCCGGTGAAATGGGGTGCACTCCACTCTGGTCAGTGGTTCGTGGTGGCACCAACCTCATGAACTGACAGGACGCATTATGGCGGATGTATTTTGGCTGGTTGTTGTCGGTGGAGGCCCATGCTGCTGGGTGCCGTGGTCGTTTATGTCCTGATGCGTCAGCGCAGGTTATCCGCCAAGGAGCAACAAGAACAAGCTCGCAAGGTTCATAAGCTTTACGACGGAAAGTAACCGGAAGCGGAGACTGATATGAACTATAGAACCATATGGCCCTTGGTAGGGGCTGCGGCTGTGGCTCTCGCGATGATCGTGGCCGTCTACTTCGTCAGGACGACTCCGGATACAGCACCGTCAAATTCACCAGACAGGGGGCAGACGGAAGACCAAGCACCTGCGGGGCCTGCTGACAGGAATGCGCTGCCGGGCAGCCAGAACACGACGCCCTGACCACAATAGTCATTGAGGGAGAGTTTAACATGAAGCCATTCCGTGCCTGCTGCGCCGCTCTGTGTCTTTCGATGTTAATGCCCGCTTTCTCGAACGCGCAATCACCCGCGGACTTCACTGCCGAGGCGGCTTCCTTCAATCAGTTTGAGGTGGAGGCGAGCAGGCTGGCGCTTAAAAGCGCGCAGGATGACGCGACGAAAATGTTCGCTAATGACATCATGCGCGACCATGAAAAAGCGCTCGTCGATCTCGCCGATGCTGCTAAAAAGGATGGTACCTCGGTCCCGGCTGACCTGAGCGCCGAATATGCCGAGAAGATGAGAGCTTTAGAAACCGCTTCACAAAGTGAGTTTGACCAGGCTTACCTCTCGACGCAGGTTTCGGTCCTTGAAGATGCGAAGACGATATTCGAAGCCTTCATCAAAACCGGAATTGCAGGCTCGCTACGTGCATACGCTGAAAATCAGCGTGGCACCCTGCGGACCTATAGTGTCCGTGTTCAGGGCCTGACAAACCCGTGAGTCGCCCAGGAGGGCCCAGTTGAGCATACTATGAAATGAGAAGCACTTAAAATGGAAGAAGAATGTCCGCTGGCAGCTGATGGAACATCGCGGGGACCAGTCGATATAATCGGAGATGATTATAAATGCGCGGCCAAACCAGCCGCAAAGGCGCTCATGAAAACCGTTGGTGCAAGCACCCGGACTGTGAGAACTGGGACGGTTTTTGGTTGCTAAATTAGGGGCGAGAAACCTGTGTGCCGCTGGGACCTCTACCAGCACAAGGCGCCTCCTACGTTTCCGCCGTTCGTTGGCCGGTGGCTTACCCGATGTCCGTTTCAACGAAGTCATTGCCTTTGTACAAGAGCTCAAGGCGCATTGACTTGGCAGCGGCATACGAGAAGCAGTCCCCGAGGTTTAACTGTGCCTTATGGCCAGCTAGCTTCCCGAACGTTGCCATCGCTTCCAGCGCTCCTTCCCCTATCCTCGCATCGATCATCACGTCGCGGGCTCCGATCAGATCAATGAATTCGGCAACCAGTGAGCGGGCAGTTGCAAACAGTTCTCGACGATTGACGGTTTTGCCCTTTTTTCCATCAATCTCTGTTTTTACCATCGCTGCGACGGCTTCAAACCGGACCATTGGCGAAACGTAGTATTTTCCCCCGGATCGATCGAGGCGCTTAACCAGCTCCTCGTAGCCGGGCTCCTCTTTCAATATGGCCACTATCACAGAAGCATCGACAAACGTCATTCACCATCCTCCCAAAGAGCATCCATAACGTTCTTCATATCGACACCGGCGATGGGCGTGGGAAACTCCGTTCTGGCCTTTGCCTGTAATGTAGAAAGGCGGATTCTGAGGGGGACTGCACTCCTTACTCGTTCTAATTCGTTAGCTAGAGCAATCTTCACTGCATCTGCCTTGGTTTTTGTTTTGAGGGCTCGGTGAAGTTCGGTGGCGAGACCATCAACGGCCTCATCACGGACAAACAATGCCATGACACGTTCCTCTCAGATATCTTAAATAGATTTCTCAGATATCTTGGTTGAATATCCTAATTTGTCAAGGTTCGATGGCTGCCACCCTTCTACGCTCAAACGGGGCGCAGCGAAGATATCGCAGGCTATTTGCTCGCGGGCCTTACGCTGATGGTGGTCATAGCCGGACTCGCTGTATCGGCGTTAATCAGCCGCTTTCCGGATCGGCGGGGACCGCTGGCCGCCGCTCTTTTCGTCATCCGCGGGGGTCTGACCTGCCTGATCACCAGGCCACTGGCTCTTCCCGCAATAATCCTACTTGGACTGGGGATCGGCGCACTGTTTCCCCCACGCCGATGATCCGAAGAGTGCCGGCCACCTTATGGCTTTCGTTCAGGGAGGCGGTCACATCATCGCAAGCATCATGCCGTTTGCGGCAGGCTGGCTGCGCGGACGTTCAGACGATTTGCCAATATCTGGGTCATGATGGCTGTCGGGATCGACGGGCAAGTATTTCAAGAGAGATCAGCTTCGCGAGTTGCAATGACCGACCGAGGTGGATCGAAATCGATATCCGACAATCCTGGCATTGAGAGCGCATCGACAAGATTGTGACTTTCGCCTGTCAGACGCTCAAACTCACGGTATGTCATCAACACATGCGAAGGTCTGCCTCGATCGGTGATGACGACAGGCCCTGCCTCAGCAGCTTTTTTCGCTCTGCCAACATCGTGATTAAGTTCGCGGCTGGTCAAGCTGGTCGTAGCCGTAGTCGCCTCCATCTAGGAACGTATCTACATATTGCGCTTTCATTTTAATCGCGCAATCCCTTTCTGGCGCCGAAGGAGGCCAGCAGCTTGCCGAGTTCGGGACTGACCAGAAGATCCTGCTTGGGAGCGTGACCGCCCGGCACATCCGAACTACAAGGCTTTGATCAAGGCCGAGTGGTCTTTGTCGCCCAGGCCTTTTTCCAGCGCGCCATTCATCAACTGATGAACCATCGCTGTGTTGGGCAGCATAAGCTCGAGAAGACGCGCCGACTCAAGCGCGAGCGATATGTCCTTGTGATGCAAACGTATACGAAAGCCGGGCTCGAATGTTTCGTTGATCATCCTTTCGCCGTGAACCTCCAGCACGCGGGACGCGGCAAAGCCACCCATCAAGGCCCTGCGCACGATCGAGGGGTCCGCTCCGGCTTTCTTCGAAAAGCGAAGGGCTTCCGAAACGGCCTGAATATTCAGGGCGACGACAATCTGATTGGCGACCTTTGCAACCTGACCGTTTCCAACGTCACCGATCAGGGTGATGTTCTTGCCCATCTTTTCAAACAGGGGTCTTGCGCGCTCGAAAATATCCGGTTTTCCGCCGACCATGATGGTCAGCAACGCCGCTTTGGCGCCAACTTCACCGCCCGACACCGGCGCGTCGAGATAATCGCATCCCAGCGCCTCGATACGGGCGGCAAAATCCTTCGTCGCAATCGGTGAAATCGAACTCATGTCGATGACGAGCTTTCCCGCATCGAGGCCGTGCGCGACGCCATCGTCACCGAACAAAACCGCTTCTACATCGGGCGTGTCCGGCAGCATCAAAATCACGATCTCGGCGCTGCGCGCCACTTCTGCACCTGAACCGCATGCCCTCGCACCACTTTTCAGGAGGGAGTTCGACACGGGTTTGACCCTGTGAAGATAAAGCTCATGTCCGGCATCGATCAGATGCTGCGCCATCGGGCGGCCCATGACCCCCAGTCCGATAAATCCTATTTTCATGGCTCAATCCTCCCGTTGATATGCTTTCAACCAGTCAAGACCTGCGCTGGTGGTCGACGCCGGTCGATATTCGCAGCCGACCCATCCGTCGTAACCAAGTTCATCCAGACGCCTGAAGATGAAATCGTGGTTGATCTCGCCCGTCCCCGGCTCATGACGTCCGGGATTATCAGCGATCTGAACGTGGCCGATTTTGTCCTGCAGGCGCTCGAAGGTCGCGACGAGATCGCCCTGCATGATTTGCATGTGGTAGAAATCATATTGGATCAGCAGGTTGGAATGCCCGACACGGTCCATGATCCGCTCGGCCTGATCGGTGTTGGTGAGGAAATATCCGGGTATGTCGCGAGTGTTGATGGGCTCGAAGACAAGGGCAATTCCGGAATCCGCCAGACGTTGCGCAGCGTGACCGAGATTGCCGACAAGCGTGTCTTCCAGAACTTGCGGATCGACACCGGGCGGTTGAATACCGGCCAGACAATTGATCTTTCGGCATCCCAGAATTTTTGCATAACGGATCGCCGTTTCAACGGACGTCTCAAATTCGGAGACCCTGTCCGGCAGGCTGGCGATGCCGCGCTCGCCTGCGGCCCAGTTGCCGGGCGGCAGGTTGAACAGAGCCTGTGTGAGATTGCACCTCCTGAGCTCCGCTGCAATCGTCTCTGCCGCCTCCTCGTAGGGGCTGACATATTCGACTGCCGCAAAACCATCGGCAGCGGCAGCGGCGAACCGTTCCATGAACGGATGTTCGGTATAAAGCATGGACAGATTGGCAGCGAATTTCGGCATCGGCAATGACCTCAAACTTGGTCTGCGCGCAGGCGGTGCAGCATGCGCTTGTTGCGGGGGTCGGGGTTGGGAACAGAAGAGATCAGTCGCTTGGTATAGGGGTGCGAGGGCGTTGTGCAGATTGTTTCCGCATCTCCGATCTCGACAATCTTGCCTTTGTGCATGACGGCAACGCGATCGCAGAAATAGCGCACGACCGAGATATCATGCGAAATGAAGATGAAGCTGAGATTGAGCTGCTTTTGAATATCGAGCAACAGGTCGAGGATCTGGCTTCGGATCGACACGTCGAGCGCCGAGGTCGCCTCGTCGGCGACGATGACCTTCGGGTTGACGGCGAGCGCCCGCGCGATGCCGATCCGTTGGCGTTGGCCACCGGAAAAGGCATGTGGGTATCTTTCCATCGCCAAGGGATCGAGCCCGACCTTCTGCAAAAGTTCGCCAACGCGGGTCTCCACCGCTTTGCCGGACATGCCACCGGCGATGACCAGAGGATCTCCGATGATCTGCTTCACCGTCATACGGGGGTTGAGCGACGCAAATGGGTCCTGAAAGATAAGACGCACATCCTGGTGGTAGCGGCGCAAATCGAATTTGCTCAGCGTGGTGACGTCCATCGGGGCCGCATCCGGGCATCCGCGATACGCGATTTTGCCGGAACTCGGCTCAACGATCCGCAGTATCATTCGTCCTAGTGTGGTTTTGCCTGATCCGCTTTCGCCGACAATGCCGAGATTTTCTCCGGCAAATAGGTCAAGACTAGCATCGTCCACCGCCACCAGCCCGCGCCCACCGCCATACGAAAACAGTCCGGACGGCGCGCCGTAGACTTTGGAAAGATTGCGCACTGACAGGATCGGAGCCACGTTGGCCGTCAGCGACGCCGGCAGCGTGCGCGTTGCTACGCCGCTCTCGAGCTTGACCGTCGCTTCCAGAAGTTGCCGTGTGTAGGGATGCTGACTGGCGTGGAAAATCTCGTCCACCGGCCCGTTTTCGACGATCTTGCCAAAACGCATGACGGCGACCTCGTCCGCTACTTCGGCAACGATGCCCATGTCGTGGGTGATGAGGAGCATCGCCATGCCGCGCTCCACCTGCAGCCGCTTGATGAGATCGAGGATTTCGGCCTGCGTCGTCACATCCAGCGCCGTCGTCGGCTCATCGGCGATCAGAACTTCCGGGTCGCAGGCCAGCGCCATGGCAATCATCGCTCGCTGGCGCATTCCGCCGGAAAACTCGAAGGTGTATCTGCCGGCCATCACTTCCGGCTGCGGTATCTCCACCTGGCGCAGCAGTTCGATACATCGCTCCCGAGCCTGTCTCTTCGACACGCGGCGATGCAAACGCACGGCTTCGATGATCTGTGAGCCGATGGTGTGGACCGGCGACAGCGAACTCATCGGTTCCTGAAAAATCAGGCCGATCCGTCCACCGCGAATGGCCAGAACCTCGCGACTGCTTGCGGCAAATTTCGCGATATCGACTGGCCCCTTGGGTCCATCGAGAACGATCTGGCCGCCTGTCATCTGTCCGGGCTTGTCGATGATCCGCATCAGGGCGCGGGCGGTCACTGATTTGCCTGAACCGCTTTCCCCTACCAGCGCAAGCGTCTTTCCGCGTTCAAGGTTGAAACTGACATTGCGAACGGCATGGAGCACATGGGTTCTGAGATGGAAATCGATGGACAGGTCTTTGACCGCCAGCAAGGTTTCGTGCTTCGTCAT
Protein sequences of DBSCAN-SWA_49 >NC_022536|540965:548398|545888_546674_-|WP_006699134.1|DBSCAN-SWA MPKFAANLSMLYTEHPFMERFAAAAADGFAAVEYVSPYEEAAETIAAELRRCNLTQALFNLPPGNWAAGERGIASLPDRVSEFETSVETAIRYAKILGCRKINCLAGIQPPGVDPQVLEDTLVGNLGHAAQRLADSGIALVFEPINTRDIPGYFLTNTDQAERIMDRVGHSNLLIQYDFYHMQIMQGDLVATFERLQDKIGHVQIADNPGRHEPGTGEINHDFIFRRLDELGYDGWVGCEYRPASTTSAGLDWLKAYQRED >NC_022536|540965:548398|545033_545885_-|WP_022557652.1|DBSCAN-SWA MKIGFIGLGVMGRPMAQHLIDAGHELYLHRVKPVSNSLLKSGARACGSGAEVARSAEIVILMLPDTPDVEAVLFGDDGVAHGLDAGKLVIDMSSISPIATKDFAARIEALGCDYLDAPVSGGEVGAKAALLTIMVGGKPDIFERARPLFEKMGKNITLIGDVGNGQVAKVANQIVVALNIQAVSEALRFSKKAGADPSIVRRALMGGFAASRVLEVHGERMINETFEPGFRIRLHHKDISLALESARLLELMLPNTAMVHQLMNGALEKGLGDKDHSALIKAL >NC_022536|540965:548398|543882_544134_-|WP_022557648.1|DBSCAN-SWA MALFVRDEAVDGLATELHRALKTKTKADAVKIALANELERVRSAVPLRIRLSTLQAKARTEFPTPIAGVDMKNVMDALWEDGE >NC_022536|540965:548398|542661_543153_+|WP_022557646.1|DBSCAN-SWA MKPFRACCAALCLSMLMPAFSNAQSPADFTAEAASFNQFEVEASRLALKSAQDDATKMFANDIMRDHEKALVDLADAAKKDGTSVPADLSAEYAEKMRALETASQSEFDQAYLSTQVSVLEDAKTIFEAFIKTGIAGSLRAYAENQRGTLRTYSVRVQGLTNP >NC_022536|540965:548398|540965_542178_+|WP_111818031.1|transposase|DBSCAN-SWA MSNEYRHVELLTGDVRRRRWTTEQKLTIIEQSFEPGETVSSTARRHGVAPNLLYRWRRLLSEGGAAAVDSDEPVVGNSEVKKLEDRVRELERMLGRKTMEVEILREALSKADFKKTDIAADLVAEGRFAMKAVADTLGVSRSNLIERLKGRSKPRGSYHKAEDAELLPIIRRLVDQRPTYGYRRIAALLNRERRAADKPVVNAKRVHRIMGNHAMLLEKHTAVRKGRIHDGKVMVMRSNLRWCSDGLEFTCWNGEVIRLAFIIDAFDREIIAWTAVANAGISGSDVRDMMLEAVEKRFRATRAPHAIEHLSDNGSAYTARDTRLFAQALNLTPCFTPVASPQSNGMSEAFVKTLKRDYIRISALPDAQTALRLIDGWIEDYNEIHPHSALKMASPRQFIRAKSI >NC_022536|540965:548398|543454_543886_-|WP_022557647.1|DBSCAN-SWA MTFVDASVIVAILKEEPGYEELVKRLDRSGGKYYVSPMVRFEAVAAMVKTEIDGKKGKTVNRRELFATARSLVAEFIDLIGARDVMIDARIGEGALEAMATFGKLAGHKAQLNLGDCFSYAAAKSMRLELLYKGNDFVETDIG >NC_022536|540965:548398|544628_544895_-|WP_022557651.1|DBSCAN-SWA MEATTATTSLTSRELNHDVGRAKKAAEAGPVVITDRGRPSHVLMTYREFERLTGESHNLVDALSMPGLSDIDFDPPRSVIATREADLS >NC_022536|540965:548398|546685_548398_-|WP_022557653.1|DBSCAN-SWA MTKHETLLAVKDLSIDFHLRTHVLHAVRNVSFNLERGKTLALVGESGSGKSVTARALMRIIDKPGQMTGGQIVLDGPKGPVDIAKFAASSREVLAIRGGRIGLIFQEPMSSLSPVHTIGSQIIEAVRLHRRVSKRQARERCIELLRQVEIPQPEVMAGRYTFEFSGGMRQRAMIAMALACDPEVLIADEPTTALDVTTQAEILDLIKRLQVERGMAMLLITHDMGIVAEVADEVAVMRFGKIVENGPVDEIFHASQHPYTRQLLEATVKLESGVATRTLPASLTANVAPILSVRNLSKVYGAPSGLFSYGGGRGLVAVDDASLDLFAGENLGIVGESGSGKTTLGRMILRIVEPSSGKIAYRGCPDAAPMDVTTLSKFDLRRYHQDVRLIFQDPFASLNPRMTVKQIIGDPLVIAGGMSGKAVETRVGELLQKVGLDPLAMERYPHAFSGGQRQRIGIARALAVNPKVIVADEATSALDVSIRSQILDLLLDIQKQLNLSFIFISHDISVVRYFCDRVAVMHKGKIVEIGDAETICTTPSHPYTKRLISSVPNPDPRNKRMLHRLRADQV |
8 | Pseudomonas_phage(33.33%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_50 |
554952 : 555966
Sequences of DBSCAN-SWA_50
Nucleotide sequences of DBSCAN-SWA_50 >NC_022536|554952:555966|DBSCAN-SWA TATGCGCAAAGTGACATTGCAGGACATCGCCAATCACACCGGCCTGTCAAAATTTGCGGTCTCATGTTCACTTTCAGGAAAAGCAGGAGTCAGTGAGGCGACGCGCAAGCGCGTCGAGGACGCTGCCGCGATGATGGGCTACCAGCGCCCGAAGCCCGCCGACGAGCAACGCGAAATCACTCTGATTTTTCACGATCAGGTCGATAGCGTGAGCTACGAGTTGCGGACGATGCTGCAGGACGGCATGCAGAAGGAGGCGCATCGACTAGGTCAAGCTGTACGGTTGCAGTGGACGCATGACGCCGACAGGGTCGAGGCGATGGTGAAAGATAGCGCAGGTATCATTCTTGTTGGGCCGCACGAACAAAAGACGCTGGACATTGTCAAAGCGTCGGGCTTGCCTTCGGTTCGACTGGGGTGGGTGCTTCCACTTGAACAGGCCGATCATGTCGGCGGCACGGACCACGAAGCAGGGATAGCGGTCGGCGATTATTTGTTTGGGCTCGGCCATCGCGACATCGCCTTTGTGCAAGGAAAGGAAGGCTATCGCGGTCGTATGGAACGATATCACGGTCTGCGCGAAAGTTTAGAGCAACATGCCGATGCGCGCCTGCATGTCCTGCGCTTTGACGAAGATGGAGGCTTTATCCCTGCGCTGCAAACTCTTCAAACTGCCGGAATCGCACCAACAGCCCTGTTCTGCGCTCACGATGGGCTAGCGCTCACCGCTGTTTCCGAACTTCTAGCACGCGGCTACCGCATACCAGACGATATGTCCGTTATCGGCTTCGGTGATTTTTCCGCCGCGACCCAAATATCTCCGCCACTGACGACCATCAGAGTACAAGGGGGGGAGATGGGCGCTACAGCGATGCGGCTATTGTTGGAACGCATTGAAACAAGAGGGCAGCCGGTCGTCGCAAAACGCATTCTTATCGCCTCGACCTTTATCGAGCGACGCTCATCGGGGCCTGCCCCTAGACATTCGAAATGGTTGACTCCCTTCGAGCGATGA
Protein sequences of DBSCAN-SWA_50 >NC_022536|554952:555966|554952_555966_+|WP_022557658.1|DBSCAN-SWA MRKVTLQDIANHTGLSKFAVSCSLSGKAGVSEATRKRVEDAAAMMGYQRPKPADEQREITLIFHDQVDSVSYELRTMLQDGMQKEAHRLGQAVRLQWTHDADRVEAMVKDSAGIILVGPHEQKTLDIVKASGLPSVRLGWVLPLEQADHVGGTDHEAGIAVGDYLFGLGHRDIAFVQGKEGYRGRMERYHGLRESLEQHADARLHVLRFDEDGGFIPALQTLQTAGIAPTALFCAHDGLALTAVSELLARGYRIPDDMSVIGFGDFSAATQISPPLTTIRVQGGEMGATAMRLLLERIETRGQPVVAKRILIASTFIERRSSGPAPRHSKWLTPFER |
1 | Enterobacteria_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_51 |
559242 : 561177
Sequences of DBSCAN-SWA_51
Nucleotide sequences of DBSCAN-SWA_51 >NC_022536|559242:561177|DBSCAN-SWA CTCAGAATTCCTCCCAGCTTTCAGCCGCGGCTGCGCGGCCTCCGATAGCTCCTGCTACTTTCGCAAGCATGCGGCGGGCAGGAGAAATGACAGGTTTGTGTTCGACTCTCACCGCCGTCGGCTTGCCGGTTTCAGCTCTTCCTTCCGCGCCGACGCGAAATTCAGATATGATCTCACGCAGGCGCTGCACCTCGCTGGCAAGAGACGCGCTGGCAGCCGTCGATTCTTCCACCATGGCCGCGTTTTGCTGCGTCACCTGATCCATCTGGTTCACGGCCGTGTTGACCTCGGCAAGACCAATCGACTGCTCCTTGGCCGACGTTGATATGGCGTCGAGCTGCGTGTTGATCGAAACAACCTGCTCCTCGATCACCTTGAGCGCTTGGCCCGTTTCCTGAACCAGCTTTACACCCGTGCTGACTTCATCCACGGAGTTTCGGATGAGATCCTTGATCTCCTTCGCTGCTTGAGCCGACCGCTGAGCAAGCTCGCGCACCTCCTGAGCGACCACCGCAAAGCCCTTGCCCGCTTCCCCGGCCCGCGCAGCCTCGACACCGGCATTCAGCGCCAACAGGTTGGTCTGGAAGGCAATCTCATCGATGACACCGATAATGCTGGAGATCTGGTTCGACGACTGTTCGATGCGCTGCATGGCGATTACAGCATTCGAAACTACTTCGCCCGAGCGGCGCGCTGACTTGTTGGCCTCGATTGCCACATGACGTGCTTCTTCGGTGCGTTTGGAGGAGTTTGCAACGTTGGTGGTAATCTGGTCGAGAGCGGCTGCCGTTTCCTCAAGCGACGCGGCCTGCTGCTCGGTCCGTTTTGAGAGGTCTTCGGCACTCTGGCTGATTTCACGTGCTCCACTATCGATTGATCCTGTGGCGTTCGCGACAGATCGCAAGCTCTCTGCCAGTTGACTAACAGCTGCGTTGAAGTCAGACCTCAATGCCTCGAAATCTGTGGCAAAGGGTTCGTTCAGTCGGAAAGTCAGGTTCCCGCTCGACAGGTGCTTCAGACCCTCCGCAAGGCCTGACGTCGCCTGTGCCATCTGCTGGGCGCGCTCACGCTCAAGTTCCGCGACCCGCAATTGCTCCGCCTCGCTAGCGGAGCGCGTCGCGGCCGCTTCCTGTTCAAGCTCCCTTGCCCGCAGGCCGTTTTCCTTGAATACCTGGACGGCTCTCGCCATGCCGCCAACTTCATCGCGGCGATCCGTTCCATCGACGCCGGCGTTCAAATCCCCATTGGCGAGCACGTTCATGGTGACCGACATCCGCTGGATCGGACGTACCAACCAGGAACGGATAGCGAGGAAGCCGACCGCGAGAACGATGATCAAGCCGACGATGACAGCGGCGAGGGTCTTGGTAGCGGTGCTGCTGGCGTGCGCGGACAGCTCCTCACCCGTCTTCGCAGCACTATCAACGATTTCATTCGTCGCACCTGTAAACTTCGGAGTGAGCGCGCTGAAAGCCGGCTGACACTGGCTCAGAAACAATTGTTGTGACGCCGCAATGTCTTGCTCCGTCGTCGCGTTTCGCGCCGCCACAATGGTCGGGCCACACACCTCTTTCATGACCTTGAGGCCTTCGGCCTTGAGAAGAGCAATCTCGGCGTGCTGCGGGACAGCGCCAGCAACGGTGTCCATGTATTTCACAAAACCGGCTTCAGCATCCTTGATACCCTTTTCCGCCCGCTCGTTCAGGTCTGCCGAGCGCGACATCAACAGTTCCCCGATGGACGCTCGTGCTGCCTGGAGGTTGCGATTGGAGCGAGCCAAATACAAGGCGGCGGTTGATTCGCCGTCCATCAGGCCAGCATAGTCTTCGTCAACTCTGGCAATCTGGATCCCGGAATAAAGGGCGACCCCAAGCGAGAACAAGCCGAAGCATGCCATAATAATAAGAAATTTTCCGACGATAGAAATGTGTTTCAT
Protein sequences of DBSCAN-SWA_51 >NC_022536|559242:561177|559242_561177_-|WP_022557666.1|DBSCAN-SWA MKHISIVGKFLIIMACFGLFSLGVALYSGIQIARVDEDYAGLMDGESTAALYLARSNRNLQAARASIGELLMSRSADLNERAEKGIKDAEAGFVKYMDTVAGAVPQHAEIALLKAEGLKVMKEVCGPTIVAARNATTEQDIAASQQLFLSQCQPAFSALTPKFTGATNEIVDSAAKTGEELSAHASSTATKTLAAVIVGLIIVLAVGFLAIRSWLVRPIQRMSVTMNVLANGDLNAGVDGTDRRDEVGGMARAVQVFKENGLRARELEQEAAATRSASEAEQLRVAELERERAQQMAQATSGLAEGLKHLSSGNLTFRLNEPFATDFEALRSDFNAAVSQLAESLRSVANATGSIDSGAREISQSAEDLSKRTEQQAASLEETAAALDQITTNVANSSKRTEEARHVAIEANKSARRSGEVVSNAVIAMQRIEQSSNQISSIIGVIDEIAFQTNLLALNAGVEAARAGEAGKGFAVVAQEVRELAQRSAQAAKEIKDLIRNSVDEVSTGVKLVQETGQALKVIEEQVVSINTQLDAISTSAKEQSIGLAEVNTAVNQMDQVTQQNAAMVEESTAASASLASEVQRLREIISEFRVGAEGRAETGKPTAVRVEHKPVISPARRMLAKVAGAIGGRAAAAESWEEF |
1 | uncultured_Caudovirales_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_52 |
566467 : 577024
Sequences of DBSCAN-SWA_52
Nucleotide sequences of DBSCAN-SWA_52 >NC_022536|566467:577024|DBSCAN-SWA CATGACTGAAGATGCGTCCGCCCAATACAAACGCCACCGCTTTCCAGCCGAGATCATCGCGCATGCGGTATGGCTCTATTATCGGTTCCCGCTGAGCCTGCGCGATGTCGAAGACCTGCTTGCCGAGCGCGGCATTGGCGTCTCGTTTCAGACCGTCGCAGAATGGGCGACCAAGTTTGGCCTGAAATTCGCCCATCAACTCAAGCGGCGGTCGATCGGCAACTTCGCAGACAAATGGCATCTCGATGAGATGGTCGTATCGATCAAGGGCAAGAAATACTGGCTGTGGCGCGCCGTCGATGGCAACGGCTACGTTCTCGATGCTCTGCTGCAAAGCTGCAGAAACAAGGGAGCTGCTCTTCGGCTGATGCGCAAGTTGTTGAAGAGCCAAGGTGTTGCGCCACGCGTGATGGTGACAGACAAGCTGCGTTCTTATGCCGCCGCAAAGCGAGAGATCATGCCGGGGATCAAGCACCGTTCGCACAAAGGGCTCAACAACCGCGCAGAGAATTCTCACCTTCCCGTTCGACGACGAGAGCGACGCATGATGCGGTTCAAGTCGGCACGACAATGCCAACGTTTCGTCTCAACCCACGGTCAAATTGCCAATCTTTTCCAACTTCATCGAAAACACCTGAACGCCGCTGATCATCGCCAACTCCGCGCCCTCGCCAGAGCCATCTGGCGCGAGATCGCGTTGCCGATCCAAGCCTGAAATTCAAGACGAAGAAATGATTGCGTCCTACGTCAGGTTAAGGCGACGCCACTCGCGGCGGACATTCCGAACACCCTGCTGACGGGGAAAACACGCCGGGAAAAGAGGACCTTCCCGATGGGAACGGCAAGACCGTGGTCTGCCTCGGAGGTAGAGGGCCACTTGACGACGCAGCGGCGGCGATGCTCGCTCAAGTCCTCAAGGTTCAGGGAGCCGAAGTCGTGGTAGCCCGCCACTCCGACATCGCAACGCGCCGAAGTCGCGATCTCATGCCAAAGGAAGCCGATGCCATCGTTGTATGCTTGCTAAACGAGGACTCTGTGAAACATGCTGGCCTGCTTGTGCGCCGCTTCAAGCGGATTTATACCGGCTCACGAGTAGGTGCGGTACTTTTGTCCGACGGTCCAGAGCCACGGGCACTGCCGCCGATGGGAGACGCGGACTTCATCGCCTCCACCCTGACGCGCGCGACTCGTGAGGCCCTTCGTGGAGATGCGCCAACGACGCCAGTAGTGCCGAGAAAAATCCACATTCGAAGGCGGTCTACAACAAAAGGTGCTACCGATCAGGTGTAAGGCGCAGAGCTAACCGAGATCCGGTCGGCATGGAATCAACAATCCTTGGCACGGTGTTGGCGAACGCGAACAATGCGGTGGCTACCGGGGGCAGGCCCCGCTTGCGGGAGATGTGGTATCTTTCATGGTTCCCGAATGAGCCGACCATCGTACACCGACGATCCGTCGCCACGAAACCTCATTTCGGCGTCATAGCGTTAGCTGGCTCTGCTACGAGACAGAACTGCACCTCGGCGGTATATACAAGGCGCGGTTGTCGATCCGGCAAGCGGTCACTGCCGTTTCGAGCGCGCAATAGTGAGCCATATCAAGGCCCGTAGCGGCGGTGGCACGGAAGACCTCGCAGGACGTTTTGAAACGGGTGGCCGTGCTCATCTCCATGCTCGCTCATGGGCCTGACGCCCGCCCGGCATTGACGCCGCGGATCGTTGACAAGATGTCGCCTCTGCTGGGATGCGGATTGTCCGAAGGTCGAATAATGCAGGCCCCGGCTTGCATTGCGGATGTCCGGCTGTGCCACTCCTGCTTCTGCTTGTAGCTCCTTTAATCGGTCCTCGCTCATTGCCCGCCTCCCCCGAAGATGCCGTCTCCATTGTGGCTCCCGTCTTGGCCCATGATTATAAAGGTATTTTATCCGGATCTCCAATCATGTGGAACCAGATGCGAAATTCGTTCCCATTCATGAGCGGTCGCCTCGCCTTGATCCAGAGGTTGGCTCAAGCCTTCCCACCTTCATGCGTGGTTGTTAAATCGAACGGCCTCTCACCTCGATGGTGGCTTGGGAGAGTTATTGGCCTCTGAAGACATTCGCCAATGCTGAACTATGTCCTCTTCGATGGCGGAACAGACCGTTTCATACTCCCGCACGCGGGTGCGGTCTTCCTCTTTCCGGAACCTTTCAAGCGCGCTGGCAGCCATTTCGTAGGCTTCGAAGAGTTCATTGAGATCGGTGGTCTGTGCTGCGATGATCTGAAGCTGGCCTCGTAATGCGGGCAACTTCAGCATCAGTCGCCACAGTCCTCTCCGCGCGGAATCTCTCGTTGTTCCAGCCAACCCGTTTTCCCTGCCCATCATCGCAAAGCCTCCGCGCGAGGTGATCGCGTAACGCTAGATATCATCGTGGCGCTTGGAAAGTACGTTACCGTAAACGCGGGAGGCGCTACGGCCGGAGGACGCGTTTGCGCAGCGATTGAGCCGGTCGATCGATGCACCGGGGACGGCCTCGCCTCCTCCCCTCGGAACAAAACCTGGAATTATTGATTTTGCTTGTTGAGATAGTGTGCAGTGCTCCCGGTTCTCAGCGGGCCGCAGTCGAGCGCCTCTCTCTGTCCCGCCCGCGCCTCACCCCACTCCATGGGGGACGGCAGGCTGATTAGTTACCGGGGATGCTCATGGCCAGCGATAAGCTCTCAACCTACAAGCAAAAACGCGATTTCCAGAAGACGCAGGAGCCGAGCGGCAAAGCGAAGCTGAAAGCTTCAAACCGCAGGCGCTTCGTAATCCAGAAACATGACGCCACGCGGCTCCATTATGACCTTCGTCTCGAACTCGATGGCGTCTTCAAGTCCTGGGCTGTCACCAAGGGGCCGTCTCTCGATCCGCAGGACAAGCGCCTGGCCGTGGAGGTGGAAGACCACCCGCTGGACTATGGTGACTTCGAAGGCACGATTCCCAAAGGCCAGTATGGCGGCGGCACGGTCATGCTCTGGGACCGCGGCTACTGGGAGCCGGAAGGCAACAGGACGCCCGAGCAGGCTCTCGCCAAGGGCGACTTCAAGTTTACCCTTGAAGGGGAGCGGCTGCATGGCAGCTTCGTGCTGGTGCGGATGCGCAACGACCGTGACGGGGGCAAACGGACGAACTGGCTGCTGATCAAGCATCGCGACGACTTTTCTGTCGAAGAAAACGGCGCCGCCATCCTTGACGAAAACAACACGTCGGTTGCCTCGGGCAGAACCATGGCCGCTATCGCCGCCGGTAAAGGAAAGAAACCAAAACCGTTCATGGTGCAGAGCGGCGATGTGCAGGCAGACGCCGTCTGGGACAGCAATCACGGACTGGCGGCAGACAAACGTGCGGCGGATACGAAGAAAAAGCGCCCGGCATCGCGGAAAGCAGCGAAGTCCGCGATGCCCGACTTCATACCTCCGCAGCTTTGCGAAACGCTCGAACGCCCACCCTCGGCCGATGGCTGGATCCACGAGATCAAATTCGACGGATACCGTATTCAGGCGCGCATTGAAAACGGCGAAGTCACGCTGAAGACGCGCAAGGGGCTGGACTGGACGGCCAAGTATCCGGCGATTGCGACGTCGGCCGCAAGCTTGCCTGATGCCATCATCGACGGTGAGATCTGTGCGCTCGATGAAAACGGTGCCCCGGACTTCGCCGCACTTCAGGCGGCGCTGTCAGAAGGTAAAACGGATGCGCTCGTCTATTTTGCTTTCGACCTGCTGTTCGAGGGGAGTGAGGATCTAAGGCAACTACCGCTCACGGAACGCAAGAAAAGGCTCGAAGCGCTGCTGAGTAAGGCGGGGGGAGATCCGCGCCTGCGGTTCGTCGAACACTTTGAGACCGGCGGCGATGCAGTATTGAAATCGGCGTGCAAGCTCTCGCTGGAGGGCATTGTCTCGAAGCAGGCGGATGCCCCCTATCAGTCGGGGCGTACCGATACCTGGGCGAAATCCAAATGCCGCGCCGGACATGAGGTTGTCATTGGCGCTTACGCCAAAACCGACGGCAAGTTTCGTTCACTTCTTGTCGGCGTCTTCAGGGACAATCACTTCGTCTATGTCGGGCGGGTCGGGACCGGCTATGGTGCCAAAACGGTGGATACAATCCTGCCGAAGCTTCGGGAACTGGAAACCTCAAAATCGCCTTTTACCGGAATAGGCGCTCCGAAAAAGGAACCCAACATCGTCTGGGTCAAGCCCGAACTGGTGGCCGAGATCCAGTTCGCTGGCTGGACCGCGGATGGACTGGTCCGTCAGGCAGCCTTTAAGGGTTTGCGCGAAGACAAGCCGGCGAAGGAGGTCGAGGCCGAAATGCCGGCATCGCCCGGGAAAACTGAAACGCCAACGCCGGCCCGGCCAAAGCCGTCGCGACCGGCACGCGGCAAGAACACCAAGGCAGAAGTTATGGGCGTGATGATCTCCAGCCCTGACAAGCCACTCTGGCCCGATGCAAATGACGGTGAGCCGGTCACCAAGGAAGATCTCGCGCACTATCATGAGGCCGTTGGTCCATGGCTGATCGACCATATCAAGGGACGGCCATGCTCTATCATCCGGACACCTGACGGCATCGGCGGTGAGCAGTTCTTCCAGAGACACGCGATGCCGGGAACGTCGAACCTGCTCGAGCTTGTCAAGGTGTTCGGCGACAAGAAACCCTATCTGCAGATTGATCGCGTTGAGGGCCTTGCCGCCATCGCCCAGATCGGCGGCGTCGAACTGCATCCCTGGAATTGCGAGCCCAACGAACCGGAGGTGCCCGGTCGTCTTGTCTTCGACCTTGATCCGGGTCCGGACGTACTCTTCTCCACCGTGGTCGAAGCCGCGCGGGAAATGCGCGACCGGCTGGAGGAGCTTGGCCTCGTCAGCTTCTGCAAGACCACCGGCGGCATGGGCCTTCACGTCGTCACCCCGCTTGCGGTACCGAAGGGAAAGAAGCTGAGCTGGCCGGAGGCGAAAGGCTTCGCACACGATGTCTGCATGCAGATGGCCCGTGACAACCCTGACCTTTACCTGATCAAGATGGCGAAGAACCAGCGCAACGGCCGGATCTTCCTCGACTACCTGCGCAATGACCGAATGGCGACTGCTGTTGCGCCGCTGTCACCGCGCGCCCGGCCAGGCGCACCAGTATCGATGCCACTGACCTGGAAACAGGTAAAAGCGGATCTTGATCCGAAACGCTTCACCATTCGCACCGTACCGGCGCTTTTGTCGAAGACAACCGCCTGGGAGGATTACTGCGACGGGCAGCGTTCGCTGGAGCAGGCCATCAAGCGGCTGACGAAGTCGATGAAACAGGCCGCCTGAGAGCGGCCTGGATCAAGACCCGTAGCGCCGGTCAAGGTCGCTCAGCGCAGCCTGAACCTCATCGATTGGACAGGTTCCGGCGTTCCAGGCATATGGCGGAACGCCGGCCTTGAGTCGATCGTGTAGGGGTCCGAAGCCCAGGCGGCCAAAATGGTCCTTTTTTCGTCGACTTGGAGCGTGGTTGCATCCACCACATCGACCGGCCGCGCAATTTTCATACCCGTGGACAGGATTGTCGTGCCCATCAGGGCGATATCGCTCGCGGTAAGCTGCGGATTCTCATGCATGTCGGCTTCCTCAGCGAACGTGCGGATCCTTGTCGTGCGGGTGACGGGGCGCGGAGATGACATCAACGTCCGGGGCCAGCAGATCGTGTTTGGCGGCAAAGGCTGCAAACAGACCCCGGGCGGTGTCCGCTTCGATCTCACCGCGCAATGCCGCGCTGCAGGCCTTCAGCGCCATTGAATGCGCCGTATCGCGCATCGATGCCGGCCATTCCGCCAGATGCCGGTAAGCGTCGATGACGCTGCGAACCTCGGTCGGAAATCCAAGGCCGATAAGGATGGTTACGGGCTCTTTGAACATATCGGGTTTCATCGCGATCTCCTTCCTACCCCCGATGTTGACGGGCGCCGCTGCAGCGGCGCCCGTTTTTTAAAAATCGATCAGGCAGCCTTTTGCGCTTCGATCTGCGCCGGGGCAACCTTATGCTGAAGGGCCGGGCTGGTCTGGATATCGATCTTGCGCGGTTTCAAGGCCTCAGGAATCTCACGAACGAGATTGATCGACAGAAGACCGTTGCGGAGATCGGCGCCATTCACCCTGACATGGTCGGCAAGTTCGAAACGATGTTCGAAGGGCCGGCCGGCTATACCGCGATGAAGGTATTCATCGGCGGAAGCGTCCTGCTTCTTGCCGGTAACGGTGAGAAGGTTGGATTGAAAGGTGATGTCGAGCTCGTCCTCGGCAAACCCTGCGACCGCGACCGAAATCCGGTAGCTGTCATCGCCAGTCTTGACGATGTCATAGGGCGGCCAGTCACTGATCGAGCGGGCGCGCTGGGCATTTTCAAGAAGATTGAAGATCCGGTCGAAGCCGACGGTCGAGCGGAACAGGGGTGCGTAGTCATAAGATGTTGCCATAGCCATATCCTCCTTGAAGCAACATGGGTACAGAAGCGCCGAAACGCGGCGCTTCCAGCAAGTCCAGCCCTGTCTCAGCAGCTGGCGGCAATCGATTTGGTTTTCGGGAATTTGCGATTCAAGGGCGCCGTCAGAAAAAAATGACATCCAGGCGCCGGCCCAGACCAACTCATTTGAGTTCATGGACCGCGGCATTGTCCTGTACCTCGCGCAGGCCTTTATAGGACGCATGACGCAGGTTGCCATCGTCGGTCCAGCCACGGAACTCGATTTCGGCGATGAGCGTCGGCTGCGCGAAGACGTAATTCCTGCCCTTGAGCGGAACGACCGGTGTCTTCGTCTTCAGCCTATCCAGGGTCTTCTTCAAATACGCCGCATCTTTCTCCTTGAACCCGGTTCCAACGGCGCCAACATAGACCCAGTCATGGCCACGTCTCGCCGCCAGCAGCAAACTGCCAATCCCGCCGCGCGCCACCGCCGAGTGCTCGTAGCCGACAATCATGAAGCTCTCGCTTTGAACACACTTGATCTTCAGCCAGTCTCCGGTCCGGCCCGAGCGATAAGGGCTGTCACGATGTTTTGCGATGATGCCTTCCATGCCGAGGGAGCACGCGCTCGCCAATAATTCAGCGCCGTCCGCTTCTATTTCTTCGGACAGCTGGATTGCGCCGGTCGCGTCTTCCAGCAGGTCCCCCAGCAAGTGCCGGCGGACCGACAGCTCGGTACGAGTGAGATCGTGACCATCGAAATACATAAGATCGAAGGCGAAGAGTACGGCTTCTGTCGACGCCCGTTTGCCGCCTCGCCCACCGAGCGAGCGTTGCAGAGCGCCAAAATCGGAGCGGCCCTCGCCGTTCAGCACCACCGCTTCGCCGTCAAGGATGGCTGTTGCAACTCCAAGGTCCCTGGCCGCTGCGGCGATAGCGGGAAACCGGTGCGTCCAGTCGTGGCCGCCCCGGGTAATGATCCGCACACCCTTCGGCTCGATATGGACCGCGAGCCGATAGCCGTCCCACTTCATTTCATAGACCCACTCGGGGCCGGACGGCGGGGCCGTCTTCAGGAGCGCCAGACATGGCTCGACGCGATCTGGCATCGGGTCGAACGGCAGATTGGGCTGAGCCGGATCGCGCCTGCGGCGCGGCTGCGATTTTGCCGTCAGGCTGTCATTCTGCAGGAGCGGCTTGGCGGGACGTTTCATTTGCCCGCCCGGTTCTCGGCCGCAACCGACTTCTTCAGCGCGTCCATGATATTGATGACATTGCTGGCTGCCGACTTTGCCGGTGTGGATGTCATAGGTTTTGCAGACTTCTTGAGCGCTTTCTTCTTGGCGGCAATGATGTCAAGCAACCGATCCTGAACCGGATCGATGACCATCTTGGGATCCCAGTGCTGCGTTTGCTTTTTGATCAGTTGCTGCACGAGACGCATCATCTCGCTATCGGCCGTTTCATCTTCTTCGATCCCGCCAAAGTAAGTATTCTCGTCACGGACTTCATCCCCATACCGCAGCGTCCAGAGAACGATACCCTTGCCGCGTGGCTCAAGCATGACGGCGCGCTCGCGGCGGGTAATCACTAGACGTGAAATCCCCACCATGTTCTGCGCTTCCATCGCATCCCTGATGACCGAGAATGCTTCCTGGCCGACGGCATCGTTGGGTGAAAGATAATAGGGCGTGTCGAGCCATATCCACTCAATGCTGTCACGGGGCGTAAACGTCGAGATGTCGATGGTCTTCGTGCTGTCGAGCGCAACGTTTTCCAGTTCTTCGTCTTCCAGGATGATGTATTCGTTCTCGCCACGCTCATAGCCCTTGACCTCATCCTCCTCCCTCACCTCTTTTCCGGTAACACTATCGACGTAGTGGCTGACGACCCGGTTTTCGGTATCGCGGTTGAGGGTGTGGAAGCGGACCTTCTCACTTTCGGACGTTGCCGGCATCATCTGCACGGGACAGGTGACGAGCGAGAGTTTCAGATAACCTTTCCAATAGGGACGGACAGCCATGATTTCCTCCGTCAGCCGGCGCGGCGCGTCTGTGACGAGACCGCATTTGATTTGGCAGCAGCACCTGGTTTTGAAGCCTTCTGACGGCCGGCACTCTTATTGGCGTTAGAAGCGGCGCGTTTCTGGCTGGATTTGGCGCCCATGCCAGCACTCTCACGCAGCGCCTGCAAAAGGTCGTTTTGCCTGGGGACCGCTGCGGCCTTCTTCTTCGGCAGGGCCCTGCCCTCGATCTTGGCCTTCACCAGCTCGGCAACGGCGGCCTCATAACGGTCGTCGAACGTGCTGGCGTCGAAGGTCCCCTTCTTTGTGCCGATAATGTGCGCGGCAAGCTCAAGCATCTCGCCTTCAATCTTCAGGTCCGGCAACTCCTCGAAGGCCTCCGCCGACGAGCGGACTTCATAGTCAAAGTTGAGCGTCGTGGCGATCAGGCCCTTCCCGTGTGGCCTGATCAGCACCGTACGCAGGCGTCGGAAGAGCACGGTGCGGGCGATCGCAGCAACTTTGGCTCGTTTCATACCATCGCGCAACAGAACGAAAGCTTCCGTCCCCATCCTGTCGGGCGCCAGATAATACGGCTTGTCGAAATAGACGCTGTCGATCTCGCTGCAGGGGATGAAGGCGTCGATCTTCAGCGTCTTGTCGCTGTGAGGCACGGCGGCAGCCACCTCATCCGGCTCGAGAACGATATACTGGCCGTTCTCGATCTCATAACCCTTGACCTGGGCGTCCCGTTCCACAGGATCGCCGGTCTCGCTGTCGATGAATTCGCGTTTCACCCGGTTGCCGGTTCGGCGATTGAGGGTGTTGAAAGCGATCCGTTCAGATGATGATGCCGCAGTATAGAGAGCCACAGGACACGCGACCTCTCCGAATTTGATAAAGCCCTTCCAGTTAGCTCGCGGGGCAACCATGACGCAAAACTCCTGCCACAACCAATGCGATTCAAGCGAATCAGCATGCATTTGTTCCGAGTCAAAACGAATCATTTTTCAAGGACTTAAGGCCACGATCACCACTCGTTATGCGAGTCTGCGCTAGACGCTTGATTCGGATGCATCTTTTTGCGGCGACGAACGTTTCCCGTCACGAATCCTAAAAGAGGAGGTCTGTCATGAGCAAGCGTGAACTGATTGATACCGGGACCGACAAGCGTTATGTGCGCCGCGATAAGGACGGCAAGTTCAAGGAAAGCGTCGACGTCTCGCGATCGCTCTCCGCCGACGCGCGCCACGATGCGAAGCATGATGCGAAACCGGGCCAGGGCGACCGGGGTGATCGCAAGCACTGACGAACCCGGTGAAGCTCGCCACGTACAACGTCAACGGCATCAACGGCCGGATCGAAGTCCTGCTCCGCTGGCTCGATCAGGCAAAACCCGATGTCGTCTGCCTGCAGGAGTTGAAGGCGCCGGACGAGAAGTTTCCGCGCCGCCAAATTGAGCGCGCAGGCTACGGCGCCATCTGCCACGGTCAGAAATCATGGAATGGCGTCGCAATCCTTGTCCGGGGTCAGGAACCTCTCGAGACGCGCCGGGGACTTCCCGGAGACCCTGACGATACCCACAGCCGGTATATCGAGGCAGCCATTGACGGCATGATCATCGGCTGCCTTTACCTGCCGAACGGTAACCCGGCACCCGGCCCGAAATTTGACTACAAGCTGCGCTGGTTTGAGCGGCTGCATTCCTACGCGGCTCAACTGCTCGAGCTGGACGTGCCATGCGCCCTTGTCGGAGACTTCAACGTCATGCCGACCGATCTCGATGTCTATAAGCCTGAGCGCTGGCGAGACGACGCCCTGTTTCGCCCCGAGGTTCGCGCAGCCTATGCCGACCTTATTGCCATGGGCTGGACGGACGCCATCAGACGGCTGCATCCGAATGAGAGAATCTATACCTTCTGGAAGTACTTCCGGAACGCGTTCGCTCGTGACGCGGGTCTGCGCATCGATCATTTTTTGCTGAGCTCGTCACTGATCGAACGACTGCAGGCGTGTGGCGTCGACAAGTTTGCGCGTGGGTGGGAGCACACCAGTGATCACGCCCCCGCCTGGATAAAGCTGGACGACCGATAG
Protein sequences of DBSCAN-SWA_52 >NC_022536|566467:577024|572096_572396_-|WP_022557679.1|DBSCAN-SWA MKPDMFKEPVTILIGLGFPTEVRSVIDAYRHLAEWPASMRDTAHSMALKACSAALRGEIEADTARGLFAAFAAKHDLLAPDVDVISAPRHPHDKDPHVR >NC_022536|566467:577024|572464_572941_-|WP_003585428.1|DBSCAN-SWA MATSYDYAPLFRSTVGFDRIFNLLENAQRARSISDWPPYDIVKTGDDSYRISVAVAGFAEDELDITFQSNLLTVTGKKQDASADEYLHRGIAGRPFEHRFELADHVRVNGADLRNGLLSINLVREIPEALKPRKIDIQTSPALQHKVAPAQIEAQKAA >NC_022536|566467:577024|568518_568827_-|WP_084317470.1|DBSCAN-SWA MGRENGLAGTTRDSARRGLWRLMLKLPALRGQLQIIAAQTTDLNELFEAYEMAASALERFRKEEDRTRVREYETVCSAIEEDIVQHWRMSSEANNSPKPPSR >NC_022536|566467:577024|566467_567181_+|WP_022557673.1|transposase|DBSCAN-SWA MTEDASAQYKRHRFPAEIIAHAVWLYYRFPLSLRDVEDLLAERGIGVSFQTVAEWATKFGLKFAHQLKRRSIGNFADKWHLDEMVVSIKGKKYWLWRAVDGNGYVLDALLQSCRNKGAALRLMRKLLKSQGVAPRVMVTDKLRSYAAAKREIMPGIKHRSHKGLNNRAENSHLPVRRRERRMMRFKSARQCQRFVSTHGQIANLFQLHRKHLNAADHRQLRALARAIWREIALPIQA >NC_022536|566467:577024|574138_574951_-|WP_022557681.1|DBSCAN-SWA MAVRPYWKGYLKLSLVTCPVQMMPATSESEKVRFHTLNRDTENRVVSHYVDSVTGKEVREEDEVKGYERGENEYIILEDEELENVALDSTKTIDISTFTPRDSIEWIWLDTPYYLSPNDAVGQEAFSVIRDAMEAQNMVGISRLVITRRERAVMLEPRGKGIVLWTLRYGDEVRDENTYFGGIEEDETADSEMMRLVQQLIKKQTQHWDPKMVIDPVQDRLLDIIAAKKKALKKSAKPMTSTPAKSAASNVINIMDALKKSVAAENRAGK >NC_022536|566467:577024|576062_576239_+|WP_080823558.1|DBSCAN-SWA MSKRELIDTGTDKRYVRRDKDGKFKESVDVSRSLSADARHDAKHDAKPGQGDRGDRKH >NC_022536|566467:577024|576247_577024_+|WP_022557684.1|DBSCAN-SWA MKLATYNVNGINGRIEVLLRWLDQAKPDVVCLQELKAPDEKFPRRQIERAGYGAICHGQKSWNGVAILVRGQEPLETRRGLPGDPDDTHSRYIEAAIDGMIIGCLYLPNGNPAPGPKFDYKLRWFERLHSYAAQLLELDVPCALVGDFNVMPTDLDVYKPERWRDDALFRPEVRAAYADLIAMGWTDAIRRLHPNERIYTFWKYFRNAFARDAGLRIDHFLLSSSLIERLQACGVDKFARGWEHTSDHAPAWIKLDDR >NC_022536|566467:577024|573110_574142_-|WP_006699351.1|DBSCAN-SWA MKRPAKPLLQNDSLTAKSQPRRRRDPAQPNLPFDPMPDRVEPCLALLKTAPPSGPEWVYEMKWDGYRLAVHIEPKGVRIITRGGHDWTHRFPAIAAAARDLGVATAILDGEAVVLNGEGRSDFGALQRSLGGRGGKRASTEAVLFAFDLMYFDGHDLTRTELSVRRHLLGDLLEDATGAIQLSEEIEADGAELLASACSLGMEGIIAKHRDSPYRSGRTGDWLKIKCVQSESFMIVGYEHSAVARGGIGSLLLAARRGHDWVYVGAVGTGFKEKDAAYLKKTLDRLKTKTPVVPLKGRNYVFAQPTLIAEIEFRGWTDDGNLRHASYKGLREVQDNAAVHELK >NC_022536|566467:577024|574962_575862_-|WP_022557682.1|DBSCAN-SWA MVAPRANWKGFIKFGEVACPVALYTAASSSERIAFNTLNRRTGNRVKREFIDSETGDPVERDAQVKGYEIENGQYIVLEPDEVAAAVPHSDKTLKIDAFIPCSEIDSVYFDKPYYLAPDRMGTEAFVLLRDGMKRAKVAAIARTVLFRRLRTVLIRPHGKGLIATTLNFDYEVRSSAEAFEELPDLKIEGEMLELAAHIIGTKKGTFDASTFDDRYEAAVAELVKAKIEGRALPKKKAAAVPRQNDLLQALRESAGMGAKSSQKRAASNANKSAGRQKASKPGAAAKSNAVSSQTRRAG >NC_022536|566467:577024|569147_571799_+|WP_022557677.1|DBSCAN-SWA MASDKLSTYKQKRDFQKTQEPSGKAKLKASNRRRFVIQKHDATRLHYDLRLELDGVFKSWAVTKGPSLDPQDKRLAVEVEDHPLDYGDFEGTIPKGQYGGGTVMLWDRGYWEPEGNRTPEQALAKGDFKFTLEGERLHGSFVLVRMRNDRDGGKRTNWLLIKHRDDFSVEENGAAILDENNTSVASGRTMAAIAAGKGKKPKPFMVQSGDVQADAVWDSNHGLAADKRAADTKKKRPASRKAAKSAMPDFIPPQLCETLERPPSADGWIHEIKFDGYRIQARIENGEVTLKTRKGLDWTAKYPAIATSAASLPDAIIDGEICALDENGAPDFAALQAALSEGKTDALVYFAFDLLFEGSEDLRQLPLTERKKRLEALLSKAGGDPRLRFVEHFETGGDAVLKSACKLSLEGIVSKQADAPYQSGRTDTWAKSKCRAGHEVVIGAYAKTDGKFRSLLVGVFRDNHFVYVGRVGTGYGAKTVDTILPKLRELETSKSPFTGIGAPKKEPNIVWVKPELVAEIQFAGWTADGLVRQAAFKGLREDKPAKEVEAEMPASPGKTETPTPARPKPSRPARGKNTKAEVMGVMISSPDKPLWPDANDGEPVTKEDLAHYHEAVGPWLIDHIKGRPCSIIRTPDGIGGEQFFQRHAMPGTSNLLELVKVFGDKKPYLQIDRVEGLAAIAQIGGVELHPWNCEPNEPEVPGRLVFDLDPGPDVLFSTVVEAAREMRDRLEELGLVSFCKTTGGMGLHVVTPLAVPKGKKLSWPEAKGFAHDVCMQMARDNPDLYLIKMAKNQRNGRIFLDYLRNDRMATAVAPLSPRARPGAPVSMPLTWKQVKADLDPKRFTIRTVPALLSKTTAWEDYCDGQRSLEQAIKRLTKSMKQAA |
10 | Escherichia_phage(16.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_022545_1 | 1596210-1596331 | Orphan |
NA
Consensus repeat of NC_022545_1
|
1 spacers
spacers of NC_022545_1
>1.1|1596242|58|NC_022545|CRISPRCasFinder GTAATCGGTCTTGATTTCGGTGCTGGTTTCGGTGCTCGTCTTGGTCGTCGTGTCGTTA |
CRISPR arrays and Neighbor proteins around NC_022545_1
The CRISPR arrays of NC_022545_1 >merge|NC_022545|1|1596210-1596331|CRISPRCasFinder TCGCTGTACGACTTGCTTTCATAGCTCCAGTCGTAATCGGTCTTGATTTCGGTGCTGGTTTCGGTGCTCGTCTTGGTCGTCGTGTCGTTATCGCTGTACGACTTGCTGTCATAGCTCCAGTC >NC_022545|1|1|1596210-1596331|CRISPRCasFinder TCGCTGTACGACTTGCTTTCATAGCTCCAGTC GTAATCGGTCTTGATTTCGGTGCTGGTTTCGGTGCTCGTCTTGGTCGTCGTGTCGTTA TCGCTGTACGACTTGCTGTCATAGCTCCAGTC
>NC_022545.1|WP_022563653.1|1593428_1595450_-|hypothetical-protein MHIDKISDMIAHFIGLFDTMAEEARLRNNYSEGPARSGPEQLPEEEAARLLDKNYDVALQDYDPGVKYHAGYYDFDYLPPHFARTVEHDMQQFANSIPVDISAANFRFPGRLSFEDERELVIHTGPGSVVAHIAQVNILQDDDYLNMTDGPNVARDTSFVTERTVEFYNEASVFTPFSGFQRTDSYDALQALAKTAHDYIEHARDNDVTSLGTGADQDFVLAGNDINGLYINGVAAAEKPALDDYMPDRGIAKPPEEPERSDVALHEESPAGNSLDVAAGANVVANVATLVNTAVMTSVTAVMGDYHQIDAITQAYIYSDRDEIASVFTRSEDQADTAAYNIANFERSIFPGAENTAAESPETGEMPIFPTAWRVSVLEGDVSFVHWIEQYQFVSDNDTMTITTSGASVSLLTGGNAALNIANFLGIGMQYDLIIVGGNVLDMNLISQIAVLYDNDWARANPDAPGGATIQSGNNLLWNDASIYNVGSNDRFETMPDYMAQTVSAINERDPNMPDALAHDSNFAGYQGLNVLYITGNIYDVSIIKQVSVLGDSDDVTQAAAKVLENNENADVHIDTGSNAVINLAQIIDYDSFGSTTYVAGGVYSDAILIQGGIIENDTSQPAQPGQLANEVIAFLHDDPATIENESDGVINAGHDLSWSNAHSSDVMQAVTA >NC_022545.1|WP_022563652.1|1591152_1593360_-|type-I-secretion-system-permease/ATPase MDHISRRTIADGRSLDLPLDQDAEDSKLTDACISAINEAVENLRQLSGAEQIARATPPSPEVAPPPSPTTPQSGTLRDMPAGNPAPQQEPHPASRPAPKAKIDNAELKSAPDLPFVKTIDNNDGPIRETERRPATGGGGDGKEPTGNGGGGGGGGSSQSGFHKRSEPINFAASLAKGIAAVRRNMIVVMLFTVAINVLLLAIPLYLFQISDRVLTSRSMDTLVMLTVAVLGAVLLQAFMDAIRRFILMRTAVELEVQLGAPILSAAARASLHGSGKDYQILQDLQQLRSFLTSGTLIAFLDAPLMPLFIVVVYLVHPHLGIIIMVCCAVLFTIAWLNQRFTARQFSEASGYLSRANFHLDSMSRNSQIINALAMIPEAVKMWGRETAGSLKSHVAAQDRNIMFSGVSKAARMITQVALLGWGAHLSLSGELTGGMVIAASIVSGRALAPIEGAIEGWHQFNKSAASYGRIKQLLISSPLNFPRLRLPNPEGRLDVERILFVPPPQKKVILNGISFSLKKGESLAIIGNSGSGKTTLGKMLVGSILPTSGNVRLDLMDLRNWDQRQFGESIGYLPQDVQLFPGTIKANICRMRDDIEDRQIYEAAVLADVHELIAGFPQGYETVVAADGAPLSGGQKQRIALARAFFGDPKFVVLDEPNSNLDTQGEQALAKALLHAKRQGITTVTITQRPALLQCVDKIMVLKDGSVAMFGERMDVLKALSGNGRQASQSPQIEG >NC_022545.1|WP_022563651.1|1589785_1591144_-|HlyD-family-type-I-secretion-periplasmic-adaptor-subunit MFRKKNAIAEVKPQGQLEWYSEVPRSIRLHSSIGLAVLLASFGGFGYWAGTAPLSSAIIAQGSFVATGNNKVVQHLEGGIIKEMMVSEGDTVKVGDVLLTLDKTTALANERMLQLRRLRLETIVTRLRAEAQGAKSFKVPDIVMKEAGDPDINAIIQSQNIVFHSKLIKLEEQLNLIEKNIRSLEFRYAGYDGQKQSFDRQLALLTQERDSKDRLAKDGVIRKTDMLALERAIADAMGDIARLSGEMNQSEAEIAKFKQEAIIAVNANKQAALDALETAESDLDSVREQVRGAAEVLERTVIRSPVNGTVVRAYYHTPGGVITTGKPIMEILPAHVPLILEAQVLRTSIDQLHEGQTAAIRLSALNRRTTPVLNGKVFYVSADSIEENAGLQVKDVYIVRVQVPDEEIAKVHNFHPVPGMPADVLIQTSERTFFEYLTKPIADSMSRAFKER >NC_022545.1|WP_004438593.1|1589205_1589700_-|hypothetical-protein MAAMSDVLLLVGRLNYVWTNTESLMIYLIVHLLKVDKEAAIVVFLTLNTTRARMDLIERLAKLPSTDAKDRKTVLSIMARLKREAKTRNKYNHCIYSFDEKGDIASTQLMRLVEDDSQVRYGKVERMDEKEIGQLEKSIADIVEVSKDMWSFIHASPHVSADYL >NC_022545.1|WP_022563650.1|1587940_1589209_+|O-antigen-ligase-family-protein MRIAKSALIDPERNEIYGMTAVALSFFVFAYSSRFGQVSVLAYYGMWLPLVAVDYRRVLGNYPRYLWIFAFSILTVLSSFWSEAVSVTMRASIQYLTHIVCALVAMRVISIRTLTRGALIGIGVVLLYSLLFGIYLFDALDGTYSFVGAFSSKNQLGFYASLGVIFAASSVLVLKQRGIWLPIAGVTGLLSAYSLIASQSATSAITTAAVVALIIGFIPIGMLSPANRKMMFFALGGLGALLAVASLQFGLLDAILGVFGKDSTLTGRTYLWQQGIEAAKQAPILGVGYQGFWVAGFADAERLWNDFFITGRSGFHFHNTYIETVVENGFVGMILLGMVLYGTLLGHLRSVLMLRSDPQGVILFAICALFVVRSFVEIDIIFPYQIGSFLLYFAAGKLCLPVKAARNGETHPAIGMRLQTRP >NC_022545.1|WP_022563649.1|1586676_1587933_+|polysaccharide-biosynthesis/export-family-protein MNGFSAAHRPFVALVLAATVAFSAPLSAMAADGAQYKLGTADKLRIRVAEWQPADGSIRNWDVINGDYSVGPSGTLSLPFIGQLDVAGKTPSEVSDQIGAQLQSKFALRNLPSASVEIAQFRPIFLSGDVQTPGEYPYAPNITVLKAVSLAGGLRRSDAGQRFARDFINARGDAAVYDNQRARLLARQARLIAEVKGDQTITKTPEMEKIAEIDTLLASESALMKSRTERYTLQLKALTDLHALLQSEVESLKKKSETQNRQLQLANEDRDRVNRLNEQGLALSQRRISAEERAAEVESTLLDIDTQSLRAKQDINKATQDEINLRNDWVAQRSKELQDTEAELDKLNLQLTTSRELMSEALAQSAEAIRFDPSGKSATISYVVVREENGKPKELKVDENALLQPGDVIKVSSEILMQ >NC_022545.1|WP_004438583.1|1585811_1586492_+|sugar-transferase MKSATQSAEQTLSSSEDFDVSFPIGGIAKRSFDMTSAALALLIFSPIFLLIAVLVKMSDPGPIFYGHRRVGHNGRYFHCLKFRTMAMNGDEILRQYLAANPEAAEEWRATRKLKNDPRVTAVGAVLRKLSLDELPQLINILRGEMSVVGPRPVVDEELSYYESAAAYYLSTRPGLTGLWQISGRNDVSYKTRVAFDTQYVQNWSMRQDVFIIVKTIPAVCLSRGSY >NC_022545.1|WP_004438580.1|1584646_1585195_+|cupin-domain-containing-protein MSVDIGGRLRHLRLRHNISQRELARRAGVTNSTISLIESNTSNPSVGALKRILDGIPIGLAEFFAFEPETSRKAFYRADELVEIGKGPISFRQVGENVFGRSLQILKECYQPGADTGKVPLVHDGEEGGIILSGRLEVTVDEERRVLGPGDAYYFESRRPHRFRCVGPVACEVISACTPPTF >NC_022545.1|WP_022563648.1|1583165_1584500_-|aspartate-aminotransferase-family-protein MDNPSRSNSTSLDSYWMPFTANRQFKANPRLLASAEGMYYTSNDGRQVLDGTAGLWCVNAGHGRQQIASAVKHQLSTMDYAPSFQMGHPVAFEFAERLAEIAPGPEGGKLDRVFYTGSGSESVDTALKIAIAYQRAIGQGTRSRLIGRERGYHGVGFGGISVGGLVNNRRVFPQIPADHLRHTHDLTKNSFVKGQPEHGAELADDLERLVALHGAETIAACIVEPVAGSTGVLVPPKGYLERLRTICDKHGILLIFDEVITGFGRMGSSFASNYFGVTPDIVTTAKGLTNGAIPMGAVFTSREVHDALMHGPESQIELFHGYTYSGHPVACAAGIATLDIYRDEGLFTRASELQDAWHDAIHSLKGSPNVIDIRTIGFIAGIELQPRDGAIGARAYDVFVDCFERGLLIRVTGDIIALSPPLIAEKSHFDDIVSILGDALKRAE >NC_022545.1|WP_022563647.1|1582029_1582956_+|homocysteine-S-methyltransferase-family-protein MSDIRILDGGMSRELQRLGAELKQPEWSALALINAPDIVRQVHAEFIEAGADVVTTNSYALVPFHIGEYRFDKEGASLIALSGRLAREAAEASKRNVTVAGSLPPIFGSYEPENFDPSRVQDYLKVLVENLQPHVDVWLGETLSLIAEGEAVRQAVAETGKPFWISFTLNDEPAQVNGAEPKLRSGETVRSAAEWAAGSGAAALLFNCSKPEVMRAAVETASAVFKEKGVALDIGVYANAFEGEQGDSAANEGLHGTRADLTDDVYSRFACSWADAGATLIGGCCGIGAAHIHTVADTLRRRGTSRTI >NC_022545.1|WP_022563657.1|1597478_1598249_+|response-regulator-transcription-factor MFMGTSDYAAKQKNAVTHINGTLLIVADPDLFSECLMEALGKKFPTFSVVSVTSSATIDDDYGADVRLVLPYRLAGERLNSVLSAIREKHPEAPIALVVETIDKIEEPLKRLVGMRIIDGVLPLNLRLDVFMAAVDLLMKGGEHFPAALLGKLTPYPTAVGGKSVRNSPVIANRADALAESRSDMATLTTREVQILDLLCKGTQNKIIADRLHLSENTVKVHVRNIYKKMNVRNRTEAASRFFSKDEGATFSGWKN >NC_022545.1|WP_006698574.1|1598291_1598639_-|hypothetical-protein MMNEMPYFRGGKTVRQCRLALVAGALLTFTFATASCSVVEDAVLTTASASPTTIKSRVTPAKAAYGYQKTGNAAVTLVADASDTPAVSRPSYSGSSPYICSPSGFGQKSRCFLRP >NC_022545.1|WP_006698573.1|1599031_1600054_+|SDR-family-oxidoreductase MRNFVPNEHDNGVTIYSDWKPGQRVLVNGGAGFLGSHLCERLLSSGHEVICLDDLSTGRTANVEHLRNNKRFLMVEHDVRKPYDIDVSLIFNFASPASPPDYQRDPVGTLLTNVLGAVNVLEVARRCGATVVQSSTSEVYGDPHVNPQPETYFGNVNTIGPRACYDEGKRSAETLFFDYHRTFGVDIKVGRIFNTYGPRMRPDDGRVVSNFIVQALKGDDITIYGDGSQTRSFCYVDDLIDGFLRFSAKPKDCTGPINLGNPTEIPVRQLADIVIRMTGSRSRIVHLPAAIDDPQQRRPDISRANELLKWQPRVPLEIGLERTIVYFDALLAGRKVAEAV >NC_022545.1|WP_022563658.1|1600055_1601039_+|UDP-glucose-4-epimerase-GalE MPRTILVTGGAGFIGSHICKALAQSGFKPIAYDNLSTGHADSVRWGPFIEGDILDRGLLKATLQEFSPAFVIHCAANAYVGESVEDPRKYYRNNVGGSLSLLDACLDQNIGGLVFSSSCATYGVPPQLPISEETAQTPVNPYGRTKLIFEMALDDYAAAYGLRFVALRYFNAAGADPDGELCERHEPETHLIPRALMAAAARLPQLDVFGADYDTSDGTCIRDYIHVSDLADAHVAAVNYLADGGETLRVNLGSGHGTSVGDIIRAIHRVTGQEVPVHFGARRAGDPPALFADIERARQTLGFAPRRSDIDTIIRTAGPGFGLEVLS >NC_022545.1|WP_022563659.1|1601035_1602967_+|glycosyltransferase MTMRSTPSVASPRAAAGVLRMLPIFTGWNRLAYLLGIGGWLVTLAYFWIWWLDRDRVIDWPYYSVVTLALAWITLLPSYFIFIFLNARVVDRRSPLPGGRVAMVVTKAPSEPFAVVEKTLLAMLEQKGLEFDVWLADEAPDAETLKWCGAHGVFVSTRQGIAEYHRKTWPRRTRCKEGNLAYFYDRYGYERYDFVAQFDADHVPEPDYLSEVIRPFADPRIGYVSAPSICDANANESWAARGRLYAEASLHGALQLGYNNGWAPLCIGSHYAVRTKALREVGGLGPELAEDHSTTLVMNAGGWRGVHAVNAIAHGDGPVTFADLIVQEFQWSRSLMTILLQYSRHYVPKLPARLKFQFLFSQLWYPLFSGFMALMFVLPAVALVRGHVLVNASYPAFLAHFLPVSLIMIVFAFFWRATGAFRPHDAKLFSWEAMVFLFLRWPWSLMGVLAAVRDTIRGDFVDFRITPKGTQAKPPLPLRVVAPYMVLAALSLLAMVLAPRQNGAEGFFIFAAINVAIYAGLSVFLLIRHAVENGLPKLPALRGGASAAACSLLLVAGSAAELSSHAIGGLEALSHGQPYISFTETRFTVAGAGAEGARSVRLKLRIALPGLRGPQEMQAEPIVAPPPAVATGEIMLADNRVGQ >NC_022545.1|WP_034498547.1|1602981_1603485_+|DUF995-domain-containing-protein MKTTISTCGFSAFVLCLAVVAPSVVHAAGGTKTPKPLRTSEVVEMYFDKTWKWDTGGGRFIADGRKFIAATEEKGKKSIGEGRWTVDANGTLCMRATWKAEAGSGKADTCFDHGRIGKVLYQRKQGGQWYVFRHNPPRPGDEFLKLVRKDDVTPQIAAYDKAMTATR >NC_022545.1|WP_022563661.1|1603498_1604461_+|glycoside-hydrolase-family-26-protein MQITRRTLLFASGAAAAFTAGMYPVLKLDAQGVAPMTSTGMKTLADKRPTLHADGIRFGAYDPHGDFTGQSGVATEHLFLPWEDVDLDSLALADAYALERKRNVMITVEPWSWDVNWRLSSDELRRKVLSGDYDKNMQAIAARMSAMKSPLILRWGQEMEDTTGRFSWSGWNPRDYITAYKRVVDMTRKAVPGVKVMWSPKGLDGLRAYYPGDNYADLVGLSVFGLEDYDKIEYGAPKTFTDLLRKGYGLVETFDKPVWVAELGYEGSDSYVRPWMNDVTLKQADFPKLEEVVYFNDKDVHPWPHNLGRPDWRVVRPAKV >NC_022545.1|WP_173402664.1|1604514_1605690_-|AGE-family-epimerase/isomerase MQFPSIAQTLAEEIGTLRKWLDEDALPLWWEAGSARPDGGFYERLGQDAKPVFSDDRRARVQPRQAYCYAAAGQHGWHGPWKDAVLHALSWFEKVYRLENGLYGNLADQTGKLIDPSFDLYNQAFALFAAAQTAAILPERRNEMRSRALEILAILERDYRHPIAGFEEANPPRTPLCSNPHMHLFEAMLAWEEQDRDGPWSALADEIAGLALSRFIDDGNGGLREFFAHDWTPYEGEKGRIMEPGHQFEWAWLLVRWGSLRGNEEAIRKAKRLFEIGEAYGICPRRKVAVMSLYDDFSMRDGLARLWPQTEWLKAAVRLASVTDGEERQRYLACGLSAIGALQPFLDTPVKGLWFDKWPADRPMLDEPAPASTFYHIVCAIYEAEAVLAAG >NC_022545.1|WP_162472002.1|1606033_1607023_+|cellulose-biosynthesis-protein-BcsN MKFSAYTSLLFSIAVLSGCNTPAGVRSFGGTQLLSPSEALIFPPPGGPEIVTVVSRTYSNAVAQQVILRSEAATPGQNYLKAEFFGPQQAGDTDFDSLAFTGFGASSLAREIRAEFPGETIAMSANYLQNSYGAFSYAAGKGRGEDTCLYGWQDIRSPESMRQDFRNLGRIKVRLRLCQSGASVERLLAVMYNYTITGTYASPSWNPYGTPQAVDKNLGRPGNPVYPIKSEEVPMRPGGEVTASVPVRPVRRAAATAPVQPEQQPLPPVAAVNIPSPVSAGASGQPAVTAPRAAGGGAVTGQQQSSGVPQVSIPSPSCLSGSGAGQGCR >NC_022545.1|WP_022563664.1|1607090_1609280_+|UDP-forming-cellulose-synthase-catalytic-subunit MNKAITIIVWLLVSLCVLAIITMPVSLQTHLVATAISLILLATIKGFNGQGVWRLVALGFGTAIVLRYVYWRTTSTLPPVNQLENFIPGFLLYLAEMYSVVMLALSLVIVSMPLPSRKTRPGSPTYRPTVDVFVPSYNEDAELLANTLAAAKNMDYPADRFTVWLLDDGGSVQKRNAANIVEAQAAQRRHEELKKLCEDLDVRYLTRERNVHAKAGNLNNGLAHSTGELVTVFDADHAPARDFLLETVGYFEEDPRLFLVQTPHFFVNPDPIERNLRTFETMPSENEMFYGIIQRGLDKWNGAFFCGSAAVLRREALQDTEGFSGVSITEDCETALALHSRGWNSIYVDKPLIAGLQPATFASFIGQRSRWAQGMMQILIFRQPLFKRGLTFTQRLCYMSSTLFWLFPFPRTIFLFAPLFYLFFDLQIFVASGGEFLAYTAAYMLVNLMMQNYLYGSFRWPWISELYEYVQTVHLLPAVVSVIFNPGKPTFKVTAKDESIAEARLSEISRPFFVIFGLLVVAMIFAVWRIYSEPYKADVTLVVGGWNLLNLIFAGCALGVVSERGDKSASRRITVKRRCEVKLEGSDTWVPASIDNVSVHGLLINLFDNATTVQKGETAIVRVKPHSEGVPETMPLNIVRTVRGEGFISIGCTFSPQRAVDHRLIADLIFANSEQWSEFQRVRRRNPGLIRGTATFLAISLFQTQRGLFYLARALRPGSKAVKPAGAVK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
925740 : 935557
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_022535|925740:935557|DBSCAN-SWA CATGCGATTTCCCTTTTCCCTGCCGCGCAAGCGCCCGGCGGATGGCAACGCCATGCCCGAAAACCGGAAAATGGCTGGCGGCTTCATGGCCGTGGCCATGCAGGGCGGACAGGCCTTCTGGTCCGGCCGGTCCTATGCCGCGCTTGCCCGTGAGGGTTTCATGACGAATCCCGTAGCCCACCGGGCTGCCCGCATGGTGGCGGAAGCGGCCGCGTCGGTGAACTGGCTGCTTTACGACGGCGATGACGAGATAGGCGATCATCCGCTGCTGGCGCTTCTGGCGAGGCCGGGCGCGCACATGGGCGGGCCGGATTTCTTCGAGGCGCTTTATGGCCACCTCATGCTGGCCGGAAACGCCTATGTCGAGCCGCTTGTGATCGGCGGGCGGCTGCGTGAACTGCATCTTCTCCGGCCTGACCGGCTCAGCATTGTCGAAGGGCCGGATGGCTGGCCGGCGGCTTATGATTACCGCGCCGAAGGCCGGGCCACGCGGCGCATCGCCGCCGAGCGGGACGGGCTGGGGCTGCTGCATCTCAAACTGTTCCATCCGCTGGATGACCGGGTGGGGTTCGCGCCGCTCGCCTCCGCCGGCGCGGCGCTTGATCTTCACAACGCGGCCAGCCGGTGGAACAAGCGCCTGCTCGATAATTCCGCCCGGCCTTCCGGCGCGCTGGTCTATCAGCCGCGGGAAGGCGGCAATCTTTCCGCCGAACAATATGAACGGCTGAAGCTCGAGCTGGAGGAGGGCTATCAGGGCGCCATGAATGCCGGCCGGCCGCTGCTTCTGGAGGGCGGGCTGGACTGGAAGGCGATGGGACTATCGCCGCGCGACATGGACTTTCTGGAAGCCCGCAACGGCGCGGCCCGCGATATCGCGCTCGCACTCGGAGTACCGCCAATGCTGATCGGCATTCCCGGCGACAATACCTATGCCAATTACCAGGAAGCAAACCGCGCCTTCTATCGCCTCACCGTGCTGCCGCTCGTCAACCGCACGGCGGCGAGGCTTTGCGGCTGGCTTGCGCCGATCTTCGGCGTGGGCCTGCGACTGGAAGCCGATCTCGACCGGATTGCCGGGCTCGCGGGCGAGCGGGACGCGCTCTGGACACGCATCGGCGCAGCCTCGTTCCTGAGCGACGAGGAAAAACGCGAGGCCGTCGGTTATTGAACGGACGGCGTTTGCTTTCGCTTTGACGGTTCAAGCATTGTCTGAAGAAGCGGATTCTCTTTGTGAGCTCATCCGTCCAAGCGATTCCGAAGATTCGTAAAATCACCCGGTGCAGGCGCGCAGGAAGCGGCTGCGTGCGACGCGGCGCGTCCGCCCGATGACAACATCCTAGAACGGAATGATTACCCATGTCTGAATTCGCCAATGAGGCCGGCATCTGGGCCGCCCGCATCACCGGCGCCGTGGCGGGCGCGGGTGTATCGCTCGTTTATCTCCTGCCGAAAAGCAAACGCGAGGCCGCGAGCCGTTTTATCACCGGGGTCTCCTGCGGCATGATCTTCGGCGGGCCTACCGGCCTGTGGATCGTGCAGCAGCTTGATATTGCCGGTGCGCTTTCGGGTCGCGAAATCATGGTGGCGGGTTCCGCCGCCGCCAGCATGGGGGCCTGGTGGGGGTTGGGCGTGCTGGTGCGCATCGCCGACCGTTACAGCGCCCGCCCGCGCGCCTGATACTCCCCTCACATCGCAGGAGTTACCCATGCACGCCTATCGCGGGCCGCGTCCCGCCACGCGTAAATTCGCCAATCTGGAACTGCGCGGCATTGCCGGCGACGGCACTTTTTCCGGTTACGCCAGCGTCTTTGGCGAGATCGATCTCGGCCGGGATGTGATCGAGCGCGGCGCCTTCCGCCGCTCCATCGAGGAGCGCGGTGCGTCCGGTAAAGACCATCAGCAGCGTCAGGGATCGACCGCTTTCCGCCCCGACGCGCTCCACCACCGCGGTGTGTGCCGCCGTTGTCAGCCCGCACGCGCCGGCAAAGCCCATCACCGCCCAGCCCAGCAGATAGCTTGTCACGCCGCCTGCAAAAGAAAGCAGCGCGAAGCCGCAGGCAAAAAGCAGGGTGCCGCTGACCAGCACCGGCGCTGCGCCGTGGCGCACAAGCATCCGCCCGAGCAGCGGCCCGCAAAGCGCACTGATGGTCATCATCACGGTCAGGCCCGCAAAAACGACTTCGTTGGCGATGGCGAGTTCGCTGCCCATTCGTGGACCGAGAATGGCCAGCATGTCGAAACCGGAACCCCAGCTGACGACCTGCCCAAGCGCCAGCACGCCGATAAGGCGTGTACGGAACGTCGGGGAAGCAGCATCGGACATGGTAAGATCGCACAATGCGAGGGCAAGGAACATTTTTGGTCGCAGGCCCGCAACACCCTGCAACCGCAATCGAAACGGAAGGAAACAGGATGGTGGCGCAGAAGGGCAAGGATTTTCTGCTGAAATTCAACAATGCCGGAACATATGTCACTGTTGCCGGGCTCAGAACGCGGCGGCTGGCCTTTAACGCCCAGGCGGTGGACGTCACCGATGGGGAAAGCGTCGGGCGCTGGCGCGAGTTGCTGGCGGGTGCCGCCGTGCAAAGGGCCGCGCTGACGGCGTCAGGCATATTCAAGGATGCGGCGAGCGATGCGCTCGTGCGGGGCGCGTTTTTCGCGGGCACCATTCCCGGTTGGCAGATCGTCATACCCGATTTCGGCACGATTATCGGGCCGTTCCAGATCGTGGCGCTGGAGTATTCCGGCCGTCACGATGGCGAAGTGCAATTTGAGATCGCGCTGGAATCGGCCGGTCTTTTGACCTTTGGAGCGCTGTGATGCCGCAAGGGTTGCGTTATGGGCGGGCGAACCGCCATCGCGGCGAGATCGAAGCGCTGTTCGACGGCGAGAGGCGCATATTGTGCCTGACGCTCGGCGCGCTTGCCGAGCTCGAAACCGCCTTCGAAGTCGATGACCTGACAGCGCTTGCCGAACGTTTTGCGAGCGGACGCATGAAGGCCACCGACATGATCCGGGTGATCGGCGCGGGGTTGCGCGGCGCCGGTAATGTTTTTTCGGATGAGGATGTTGCCGGCGCCACGGTGGAGGGTGGCATTGCCGGCCATGCCGCCATCGTCGCCGAACTTCTGACCGCCACCTTCGGCGGTTTGAAAGGGGAGACGCCGCGGGACCCCTGAATGCCGCAGCAGGCGATGCGACGCCGCGCCCCTTTCCCTGGGATGCGGTTCTCCATGCGGGTTTCTGCCTGCTGCGGCTTTCTTCCGAAGTCTTCTGGCGGCTGACGCCCAGAGAGTTTTTCGCCATGACGGGCGGCGTGCGCTCCGGTTCACACGGCCCCGATCGACAGGAGATGGAGGCGATGATGCGGCGTTTTCCCGACAGGGACGCATCCGATCATCGTTTCGCGAAGAAGCTCCAGTTTGAATCGGAAAGGCAGGTACGATGGCGGGTGAAGGATCGATCGCGGAAAATCGGGAGGAGGCGGAAGCCCTGTCCGAAGTGATGGGCGATCTCGAACGCCGCTCCGAACGGTTCGGAGCGGCGCTGACGTCTGCACTGCAGGCGGCAACGACAGGCGGCAAAGGGCTGGACGATGTGCTGCGCGGCCTTGGGCAGCGGCTGTCGGGCATCGCGCTCTCCGCCGGGCTCAAACCCTTTGAGGCCATGATTGGCAATGCCGTCGGCGGGTTGTTGAACGGCGGCGGCTCGCTCTTCGCCTTCGCCGATGGCGGGATGCCGGGGCGCAGCGTCACGCCTTTCGCCGAGGGTGGTGTCGTCTCCAGCCCGGCGTTTTTTCCCATGGGCGGCGGGCTTGGCCTGATGGGAGAAGCAGGGGCGGAAGCCATATTGCCTCTGAAGCGCGGCCCGGATGGTGCACTCGGTGTTGCGGCGCCTTCCGCCGGCGGCGGGTCCCAGATCATTTTCAACGTCACGGCGACGGATGCGGCAAGCTTCCGCAAAAGCGAAGGCCAGATTGCCGCAATGCTGGCGCGCAGCGTCGGGCGCGGCCAGCGTGGGTTGTAATATGCAGACCTTGTCCCTGTCCCGCATGGACAACCGCGAAGGATGAGGGCCTGCGGCCGGCACACCCCGAACAGCGAAATGAATCTAGGAACAACGACATGGTGGCATTTCATGAAGTGCGGTTTCCGCTGCGGCTGGCGCTCGGTGTGAGCGGCGGCCCGGTGAGACGGACCGACATCGTCAATCTTTCCAATGGGCGGGAAAGCCGCAATCAGCGCTGGCGAAATGCCAGGCGTGCCTATGACGCCGGCTCGGGCATCCGCTCGCTCACCGATCTCTATGAGGTGCTGGCCTTTTTCGAAGCGCGGCGCGGCGAGCTTTACGGCTTTCGTTTTCGCGATCCCGTGGACTTCAAATCCTGCCCGCCGGGAGACACGCCGGCATCGACCGATCAGAGGATCGGCGTGGGTGATGGCGTCACGGTGCGGTTTCAACTGGTCAAGACCTATGCCGATGCGGGTGGCTTCTTCACCCGCCGGGTGGAAAAGCCTGCCGAGGGCTCCGTCCTCGTATCGGTCGAGGGTGTCCCCGTATCCGCTTCGGATGTGTCCTGCGATTACGCGACCGGTATGGTGACATTCGGTTCCGGAAAGGTTCCGCCCGCCGGTGCGGTGGTCCGTGCCGGCTATGAATTCGATGTACCGGTACGGTTCGCGACCGACCGCATCGACATCAACCTCACCGCCTTCGAAGCCGGGCGAATTCCCGCCATTCCGCTGATGGAAATCCAGCCATGAAAACGATCGCTCCCGCCCTTGCCGGGCATCTTGAAAGCGATGCCACGACCACTTGCCATTGCTGGAAAGTGACGTTGAAGGATGGCGCCGAAATCGGCTTCACCGATCATGACGAGGCGCTTTTCGTCGCCGGCACGGCCTTTCTGGCCGCCAGCGGTTTTTCGGCCAGCGATAATGACAGCGAAACCGGGCTCGGCGCCGGCGCGGGTGAAGTGGCAGGCGGTTTTTCCAGCGAGGCGATTTCAGAAGGGGATCTTGCCGCCGGCCGTTTCGACGGAGCGAGAGTGGAGCTTTACCTCGTCAACTGGCAGGCTCCGGAAGAGCATGTGCTTCTCTCCCTGCGCGAGATCGGCGACGTAACCCGGGCGGGCGGGTCCTTTCGCGCCGAGCTGCGCAGTTTCGCCCATCGCCTCAACCAGCCGCAGGGGCGGGTTTATGGACGGCGCTGCGATGCCGCGCTGGGAGACGCACGCTGCGGCGTTGCTCTGGCGCGGTTCACCGGCAATGCCCGGGTGGTGGCGGTGGACGCGGCCGGCAATCTCGTCGCAACCGGGCTCGATGCTTTCCCGGATGGTTTTTTCAACGGGGGAAAGGTGCGTTTTCTCACCGGTCCGCTTGCAGGGCGCGGGTTCGATCTCGATGGCCATGAGCAGCGCAATGGCGGCGTGCTTCTTTCTTTCTGGCTTCCACCGGATGAGGCGCCGTTGCCGGGAGACAGCTTTCCCCATCTTCCAGGGGCGGATTTCGCCTATTCTTATGCCAGCGGCGGTCACACCCACGACGGCGGGGCCTTGTTCCCATGAGCGAGACGGGAGACAAGGTGCTGGCGCTGGCCGAGCCGTGGATCGGCACACCCTATCGGCATCAGGCGTCGCTAAGGGGCGTTGGCTGCGATTGTCTCGGCCTGATCCGAGGCGTCTGGCGCGAACTCTACGGCAGCGAACCGGAACTTCCGCCGCCCTATGCGCCTGACTGGGCCGAGCGCTGTGGTGAGGACCGGCTGATGGCGGCCGTGCAACGGCATTTTCCTAGCGTTTCGGGCCTGGACGAGGCAAAGCCCGGAGACCTGCTGCTGTTCCGCTGGCGGGCCGATGCGGCGGCCAAACATCTCGGCATCCTCGCGGAATCCCGGCATTTCATCCACGCTTATGAGCAGGCGGCGGTGGTGCGTTCCGCCTTGGTGCCATGCTGGAAACGGCGCATCGCCGGCGTTTTCCGATTTCCCGATCCCTGAGATTTTTTGAGGCAGGCATGGCAACCATACTATTTCAGGCAGCAGGTGCGGCGCTCGGCGGTATTTTTGGGCCCGTGGGAGCGATCATCGGGCGTGCGGCCGGCGCGCTGGCTGGCAACGCCGTGGATCGCGCCTTGCTTTCAGACGGTCGTACCGTGTCCGGGGCACGGCTCTCGGCCGCGCGTATTCCCGGTGCGGATGAAGGCGCGGCGATAAACCGGCTTTACGGTACGGCGAGGATCGGCGGCACCCTGATCTGGGCGACACGTTTCGAAGAAAGCCGTGTCGTGCAGAGACGTGGTGGCAAAGGCAACCGCGGCCCAAGGGCGGAGACCTATCGATATTTCGCCAATCTCGCCATCGGTCTCTGCGAGGGGGAGGCGGCCCTCGTGCGGCGCGTCTGGGCCGATGGGCGCGAAATTGACCTGACCGGCATCGAGATGCGCTTTTATCCCGGCAGCGCAACGCAATTGCCCGATCCGCTGATCGAGGCGAAACAGGGGAGCGGCAATGCTCCCGCCTTTCGCGGCCTCGCTTATGTCGTCATCGAGCGCCTGCCGCTTGAAGCCTATGGCAACCGCATTCCGCTCATGCAGTTCGAGGTGGCGCGGCCGGTCGGCAGGCTGGAGAAGTCGATCCGCGCCGTGACCGTCATTCCGGGCTCAACCGAACATGGTTACGCAACGGTGCCGGTTTCCGAGCATACGGGCATGGGTGAAAGCCGCGTGATGAACCGTAACGGCCTGACGGCCGAGACGGACTGGCGCGCTGCCATCGATGAATTGCAGGCGCTTTGCCCCGCTCTTCAAAGCGTGGCGCTGGTGGTGAGCTGGTTCGGCACGGATATGCGGGCGGGCGAATGCCGCATCCTGCCGGGTGTTGAAGTGGGGGACCGGGAGGGCGAGACGGCGCCGTGGTCCGTTGCCGGCCTGACACGCGCCGGCGCGCACCCGGTCAGCCATCATCATGGCGGTCCCGCCTATGGCGGCACGCCGAATGATGAAGGCGTGCTGCAGGCGATAGCCGACCTCAAGGCGCGGGGCCTGAAGGTCTGCCTTTATCCCTTTGTGATGATGGACGTGCCGGCGGGCAACGGGCTGCCGGATCCCTATGGCAAGGCCGAACAGGACGCCTATGCCTGGCGGGGGCGCATCACCTGCTTTCCCGCACCCGGGAGGCCCGGATCGGCCGATCGTACCGCCGGTGCGCGGCTGGAGATCGATCGTTTCTGTTCCCGCGCCGAGGGCTATCGACGCATGGTGCTGCATTACGCCGACCTTTCGGTAAAAGCGGGAGGCGTCGACGCCTTCCTGATCGGTTCCGAACTGCGGGGGCTGACGGGGTTGCGCGACGACAACGGCGCCTTTCCCTTCGTGGAGGCGCTGGTGCGGCTCGCGCGCGATGTACGGGCTGTCGTCGGAAGGGCGACGAAACTAACCTATGCTGCGGACTGGAGCGAATATTTCGGTCACCAGCCGGACGACGGCTCGAGGGACGTGTTCTTCCATCTCGATCCGCTCTGGACGAGTTCAGACATCGACGCCATTGGCGTCGACAATTACATGCCGCTTTCCGACTGGCGCGACGAAGATGCCGCAAACGGCAATCCTGACGGCATGACCGGCCCGGATGACGCTGTGGCCTTCCGCCGCGCCGTTACGGCTGGCGAGGGTTTCGACTGGTTTTATGCAAGCGAAGCCGACCGCACGGCCCGGCGGCGAACCCCCATTGCCGATGGCACAACGGCAAAGCCATGGGTATTCCGCTACAAGGACCTTCGAAACTGGTGGGAAAATCTCCATTATGATCGGCCCGGCGGGAAGGAGAGCACCACGCCAACCGCATGGAGGCCGGGCTCGAAGCCCATCTGGTTTACGGAACTCGGCTGCCCGGCGGTGGACAAGAGCGCGACACGCCCCAATGTCTTTCCCGATCCGAAATCGGCGGAAAACGCCTTTCCCCATTTTTCACGCAAAAGCAGGGCGGACAGCCAGCAGCGGCGTTTTCTGGAGGCGCATCTGGGGCACTGGGAAAATGGCGATGCCGCGATGGTGGATACGAGCCGGATCTATCTGTGGACCTGGGATGCGCGGCCTTTCCCGGCTTTTCCGCAAAACGGTGCAACGTGGAGCGATGGCGGTAACTGGCGCACGGGCCATTGGCTGAACGGTCGGCTGGGAACGGCGACGCTTGCCGACACCATCGCCGCCATTTTGACGGATCACGGCTTTTCCGCTTTCGACGTCTCCGCCATCAGCGGCGATCTCATCGGTTATGTGCAAGGCGAGGTAACTTCGGCCCGCAGCCTGCTGGAGCCGCTGATCGCGGCATTTCAGCTGGATGTGGCGGAGGATGCCGGCACCCTGCGTTTCCGCTCCCGCAACGGGGCTGTCACGCCCGTTCGGGATCTTTCCATTCTCGCCGATCTGGAGGGTGAACCGCTGTGGTCGGAAAATCGCGGCCACGACAGCGATTTCCCCGCGGAAGCCGTTCTGACCTCGTTCAACCCGGCGCTGGACTACGAGCAGGCGAGCGTGCGTTCGCGGCGCGTGGAGAATGCCGGCAGCCGTGTCATGCGGCTCGATCTCAACGCCGCTCTATCTTCGGAAACGGCGCAAGCGGCGATCGAGGCGGCCTTGCGTGACAGCCGCCAGGCACGGCGCAGCCTGCGTTTTGCCCTGCCGCCGGGGCAGATTGCGCTGGAACCGGGTGACTGCATCCGTCTCTCCGCGGACGCCTTTCCGCAGGCGCCCGCAGGTCGGTTCATGGTCAGCCGTATCGAGGATGGCGCGGTGCGGCAGGTGGAAGCGCGGGCCTTTTCCCCGGCCTTGTCAACTTTTGCCGGTGTGGCGGACGAGCGACGTGCCCATGGCGCAAGCGGCAGCGAGGGTTTCGCGCCGGAAGTGCTGTTTCTCGATCTGCCGTATCAGGACGGTGCGGCGGCCGAGCAGTCGGCGCGAGTTGCCGCCTTCGCGAAACCCTGGCGGCACATCGTCGTCTCGGCCTCATCAGGCACGGAAGGATACGGCCAGCGCGTCGTGCTCGACCGTCCGGCCAGGATCGCAACGCTGGCAATGCCGCTGAAGCCTGGACCCTCCGGTCGCTTCGACCGTGCAAATACCATCGTGCTCGATCTGCCTTTCGGCGAAGTTGCATCGGCCGAACCGCTTTCGGTGCTGAACGGCGAGAACCGCATCGCCGTGAGGGCGGCGAATGGTGTCTGGGAAATCGTCGCTTTCGCGAAGGCAGAGGAAACCGCGCGGTCGCGCTGGCGGCTTTCTGTTCTCCTGCGCGGCCTTGCCGGCACGGAAGACGCGTTGGCCACCGGTGCCCCGGCAGGGGCGCCCGTGGTTGTTCTGGATGCCGCGGTACAGCCTCTCGGTCTGCTCGCCGGAGAGCGTGGCCGGCGAATGAACTGGATTGTCGAGGCAGCGGAACAGGCCGGTGCGCCGGTCGGGCCTTTCTCCGTCGAGGCCGGATTTCGGGCGCTGACGCCGCTTGCGCCGGTGCATCTTGCCGGCCTGCGAAAAACGGATGGCGTTCTCATCAGCTGGAAACGCCGGGGGCGGATGGACGCCGATGGCTGGGACGCAAGCGAAATTCCGCTGGATGAAGCTTTCGAACGTTATCGGGTGGAGGTGATGGACGGCGACGAGGTGCGGCGTACCGCCGACGTCTCGGAACCCTTCTGGTTTTATCCAGCAACAGCCGAACTCACAGATTTTCCGGCGCTGCGGGATCACATTTCCGTGCGAGTCCGCCAGCTCGGCCGCGCGGTGCCCCTGGGAGTGGCGGCACAGGCCGTTCTCCCGCTTTCATAA
Protein sequences of DBSCAN-SWA_1 >NC_022535|925740:935557|931753_935557_+|WP_022555866.1|tail|DBSCAN-SWA MATILFQAAGAALGGIFGPVGAIIGRAAGALAGNAVDRALLSDGRTVSGARLSAARIPGADEGAAINRLYGTARIGGTLIWATRFEESRVVQRRGGKGNRGPRAETYRYFANLAIGLCEGEAALVRRVWADGREIDLTGIEMRFYPGSATQLPDPLIEAKQGSGNAPAFRGLAYVVIERLPLEAYGNRIPLMQFEVARPVGRLEKSIRAVTVIPGSTEHGYATVPVSEHTGMGESRVMNRNGLTAETDWRAAIDELQALCPALQSVALVVSWFGTDMRAGECRILPGVEVGDREGETAPWSVAGLTRAGAHPVSHHHGGPAYGGTPNDEGVLQAIADLKARGLKVCLYPFVMMDVPAGNGLPDPYGKAEQDAYAWRGRITCFPAPGRPGSADRTAGARLEIDRFCSRAEGYRRMVLHYADLSVKAGGVDAFLIGSELRGLTGLRDDNGAFPFVEALVRLARDVRAVVGRATKLTYAADWSEYFGHQPDDGSRDVFFHLDPLWTSSDIDAIGVDNYMPLSDWRDEDAANGNPDGMTGPDDAVAFRRAVTAGEGFDWFYASEADRTARRRTPIADGTTAKPWVFRYKDLRNWWENLHYDRPGGKESTTPTAWRPGSKPIWFTELGCPAVDKSATRPNVFPDPKSAENAFPHFSRKSRADSQQRRFLEAHLGHWENGDAAMVDTSRIYLWTWDARPFPAFPQNGATWSDGGNWRTGHWLNGRLGTATLADTIAAILTDHGFSAFDVSAISGDLIGYVQGEVTSARSLLEPLIAAFQLDVAEDAGTLRFRSRNGAVTPVRDLSILADLEGEPLWSENRGHDSDFPAEAVLTSFNPALDYEQASVRSRRVENAGSRVMRLDLNAALSSETAQAAIEAALRDSRQARRSLRFALPPGQIALEPGDCIRLSADAFPQAPAGRFMVSRIEDGAVRQVEARAFSPALSTFAGVADERRAHGASGSEGFAPEVLFLDLPYQDGAAAEQSARVAAFAKPWRHIVVSASSGTEGYGQRVVLDRPARIATLAMPLKPGPSGRFDRANTIVLDLPFGEVASAEPLSVLNGENRIAVRAANGVWEIVAFAKAEETARSRWRLSVLLRGLAGTEDALATGAPAGAPVVVLDAAVQPLGLLAGERGRRMNWIVEAAEQAGAPVGPFSVEAGFRALTPLAPVHLAGLRKTDGVLISWKRRGRMDADGWDASEIPLDEAFERYRVEVMDGDEVRRTADVSEPFWFYPATAELTDFPALRDHISVRVRQLGRAVPLGVAAQAVLPLS >NC_022535|925740:935557|925740_926907_+|WP_022555858.1|portal|DBSCAN-SWA MRFPFSLPRKRPADGNAMPENRKMAGGFMAVAMQGGQAFWSGRSYAALAREGFMTNPVAHRAARMVAEAAASVNWLLYDGDDEIGDHPLLALLARPGAHMGGPDFFEALYGHLMLAGNAYVEPLVIGGRLRELHLLRPDRLSIVEGPDGWPAAYDYRAEGRATRRIAAERDGLGLLHLKLFHPLDDRVGFAPLASAGAALDLHNAASRWNKRLLDNSARPSGALVYQPREGGNLSAEQYERLKLELEEGYQGAMNAGRPLLLEGGLDWKAMGLSPRDMDFLEARNGAARDIALALGVPPMLIGIPGDNTYANYQEANRAFYRLTVLPLVNRTAARLCGWLAPIFGVGLRLEADLDRIAGLAGERDALWTRIGAASFLSDEEKREAVGY >NC_022535|925740:935557|929183_929765_+|WP_004440761.1|tail|DBSCAN-SWA MAGEGSIAENREEAEALSEVMGDLERRSERFGAALTSALQAATTGGKGLDDVLRGLGQRLSGIALSAGLKPFEAMIGNAVGGLLNGGGSLFAFADGGMPGRSVTPFAEGGVVSSPAFFPMGGGLGLMGEAGAEAILPLKRGPDGALGVAAPSAGGGSQIIFNVTATDAASFRKSEGQIAAMLARSVGRGQRGL >NC_022535|925740:935557|930498_931305_+|WP_022555864.1|DBSCAN-SWA MKTIAPALAGHLESDATTTCHCWKVTLKDGAEIGFTDHDEALFVAGTAFLAASGFSASDNDSETGLGAGAGEVAGGFSSEAISEGDLAAGRFDGARVELYLVNWQAPEEHVLLSLREIGDVTRAGGSFRAELRSFAHRLNQPQGRVYGRRCDAALGDARCGVALARFTGNARVVAVDAAGNLVATGLDAFPDGFFNGGKVRFLTGPLAGRGFDLDGHEQRNGGVLLSFWLPPDEAPLPGDSFPHLPGADFAYSYASGGHTHDGGALFP >NC_022535|925740:935557|931301_931736_+|WP_022555865.1|DBSCAN-SWA MSETGDKVLALAEPWIGTPYRHQASLRGVGCDCLGLIRGVWRELYGSEPELPPPYAPDWAERCGEDRLMAAVQRHFPSVSGLDEAKPGDLLLFRWRADAAAKHLGILAESRHFIHAYEQAAVVRSALVPCWKRRIAGVFRFPDP >NC_022535|925740:935557|928558_928918_+|WP_006700098.1|DBSCAN-SWA MPQGLRYGRANRHRGEIEALFDGERRILCLTLGALAELETAFEVDDLTALAERFASGRMKATDMIRVIGAGLRGAGNVFSDEDVAGATVEGGIAGHAAIVAELLTATFGGLKGETPRDP >NC_022535|925740:935557|927095_927416_+|WP_022555859.1|DBSCAN-SWA MSEFANEAGIWAARITGAVAGAGVSLVYLLPKSKREAASRFITGVSCGMIFGGPTGLWIVQQLDIAGALSGREIMVAGSAAASMGAWWGLGVLVRIADRYSARPRA >NC_022535|925740:935557|929863_930502_+|WP_022555863.1|DBSCAN-SWA MVAFHEVRFPLRLALGVSGGPVRRTDIVNLSNGRESRNQRWRNARRAYDAGSGIRSLTDLYEVLAFFEARRGELYGFRFRDPVDFKSCPPGDTPASTDQRIGVGDGVTVRFQLVKTYADAGGFFTRRVEKPAEGSVLVSVEGVPVSASDVSCDYATGMVTFGSGKVPPAGAVVRAGYEFDVPVRFATDRIDINLTAFEAGRIPAIPLMEIQP >NC_022535|925740:935557|928914_929244_+|WP_084317300.1|tail|DBSCAN-SWA MNAAAGDATPRPFPWDAVLHAGFCLLRLSSEVFWRLTPREFFAMTGGVRSGSHGPDRQEMEAMMRRFPDRDASDHRFAKKLQFESERQVRWRVKDRSRKIGRRRKPCPK >NC_022535|925740:935557|928151_928559_+|WP_022555861.1|tail|DBSCAN-SWA MVAQKGKDFLLKFNNAGTYVTVAGLRTRRLAFNAQAVDVTDGESVGRWRELLAGAAVQRAALTASGIFKDAASDALVRGAFFAGTIPGWQIVIPDFGTIIGPFQIVALEYSGRHDGEVQFEIALESAGLLTFGAL |
10 | Paracoccus_phage(33.33%) | portal,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1418490 : 1430543
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_022535|1418490:1430543|DBSCAN-SWA ATCAGCCAAGGCCGTTCTCCTTCATAAAGTCCTCGAGGCCGACCGGCTGCTCGGAGGCGGTGAAATAATGGGTGATGGAATTGATTAGGGTGCGCCAAGCGCTGCCGTACCGGTTTGAAATGCTCCACCATTGGGTATCGCGAGTTGGATCGGCGTATTGCGTCTTGCGCCTCAGCCGGGTGAACTTTCGATCCCGTGCCTTTAGTTGCTTCCAGCGCGCGCGATAGCGTGCAGGTATCGCCCGCCAGCATTTGCCGCAGATGATCGAGGTGGACCCGGGGAACCGGTCCTGCGCCGCGGTGCAGCGGCAGTTGGGGTTGATGCAGGGGATGCGGTCAGTCATTTGGAGCCGCCTCCCATACGCGCTCGACCTTCTCATTTGCCGCATCGGTGGCGGGCTGATTGACCACAGATTTACCATCGGTGGTGCGGTCATGGATGGTAATGGTCTGGCCGTTAAGCGCGGCACTCAACACGGTGTCCACGCCTGCCCTGTCGGTTCCCCACTGCACCTGCGCCGCCGGCGTGACCTGCGGAACGGTGATGGCCCATTTCGCATTGGCAGCGGAATAGAACGCCCGTGGCTTTACCGCGCCCTGCCCGATGTGGCTGACAAAGTCCTCGACGTGGTTTGCTGGCAGCCATGGCGCACCTGGCTTCACATCGATGTCGATCGCTTCGATATCCGCGGGGATGACGTCGCGCAGCGCGTTGACGTTGCGCCGATATTCCGGATCCTGTTCTGCGGCACGCTCTGCCTCGGCGAGCTTCTGCTTCACGTTTCCGGAGAGATACTGGTCGGCAGTCTCATATCCCCCGGTCGGCGTCTTGAACACCAACGGGCCCAGCTCATCGACGATCGCATCCACCGGCTTGCCGTAGAGCTGCGACATGGCCTCGAGGTTGATGCGGCCATAATCGTTCAGAACCGTGGCGAGCGCGTCCTTGGCACTGCTGGCGCTGGTGGGGCGGCGATACGGCTGCTGGGTGCGACGGGTGAAGATGGCTGCCTTCTCAGCCGTCGCCGCGCGGGCCTTCTCGCCCGTTTTCTTCGCCATCGCTGCACTGAGGCCCTTATCGAACGACTGCTCGAGCGCGCTGATCTGTGGCCACGTCGGATCGTCGCGGAAAAGGCGCTTGTTTGCGTCGGAATTGATAGGACCGTGAGACTTTACGAAGGCATCGTAGAGATTGTTCAACCGATTTCGCAGGTTTTCAATCTGCTGATCGGTTGCCGTTTCACTGATCTGAGCGCGCCGCAGCTTGGCAAAGGCGTCTCGGACACGGATCATGCCGGAAACGCGCTCCTTGGCCGTTTCGTTCGGGAAGGCCACCGCCTGCGAAGTGGGCTGGCCGATATGATCGGCGGTGCGCAGATGGATCGTGCCATCGGGTGCGGCAAACATGGTGCCAACCGGTCGAGGAACTGGTTAAATTCGGTGAGGAGGGCAGTAGCCATGGCAGGATTCGGCCCGGCGTCGTCGTTCGGCAGCATCAGCGCCAACCCGGCGGCACCCTTGCCTATTTCCGTCGCATACTTCTCGAGCATGAAATCGGGGAGCGTCATGGCGCGCAGCGACGGTTTCAGGACGAGGCGAACGGAAAGGGTGCCGGTGGCGCGGGGCGCGACCATGATTTTGCCAGGCGTGATCTGCGTTATAAACCGCGCCGGCGCTTCGTTCTGATTATCGCCGTCCCAGCCCGGGTGGTTTTCGTCGAGCCATGCCGCTGAGTACGGGGTGAGGGGGACGCCGTTGAGCTTGGCCGCCTGAATCTTCTTAATCTCGGCGTCGCCGAACGTGCTCAGGCACTCGCCATCGATATCGGTGATTTCGATCGTGTCTTTCTCGCGCCAGATATCGGCCTTCTCGCACACGTCCCGCGCTGCCTCGCGGATGCAGCGGTACGCGGTCAGATCGGCGCAGTTCGGAACGTAGGGCAGGAGGTACGGCAGCATATCGTCTATGTCACGCATCACCGCCTCCTGTTCGGGTTGGACGCGGCCTCGCCCTGCACCTTGATGCCAAGGGCGGTGGCGAAGGTCTGGTAATGCGTCATGGCTTTGGTCGGGTCGCCAGCGATGTCATCCTTGGAGAAGGCCTTGAACAGCGCATAGTCGATCAGCGGGACCGTGTAGGGCTCCGGAAGCCCAATATCGACGTCCCACGCTTCGAGTTTTTCGACGTCCTTGTTCGGGAGGGGCGTGACCTTGGCTGGAAGATAGGAAATTGCTATCTGCACAACGCCGCTGCCGTTGTTGCCCGGATAGCACTCAAACTCGAGCGGCACGTTTTCGTCGAACGCGACCTGGCGCACTTCCTTGGCGAAGGGCGCATAGGCAGGGTTGCGCCAGTTTGGCTCGTGTGAATCGAGCATCGAGCGCGATGCGGTGCGGATCGCGCGACCGCCGATGTTTTTCACCGTATCGATGATGTTGTGATTGACGCCGAGGAGCTGGAGCGGGGTGACGTTGTCGAGCGTTTCGGGGATCTTCTGATATGTGCCCTGCTCGAGCGGGAACTGAGCAGTTTTTGCCGACGCCGAGGGCTTGGCAAGGATGATGGCCTTCACCGCATCGTTGATGCAGTCGGCAAGCTCGCTAAGCGGCCAGCGAACATTGTCTTCGTCCAACAAGAGGACGCTGGCGCGCTTCATCACTTCACTTGCCTTCGGCATGGCTTACTTGCTCTTGCTGGTGGCCTTTTGGGCGGCGGTGGCTACCGGTGCCTGCTCGGGTTGCTGGTTCGTCTCAGCATTTCCTTGATCTGCATCTGCCGGAGTGGATGGGTTTTCTTCGCCGTTGGGCTGCTCGCCGCCATTGACGGCATTGGCTTCGCCTCCAGCGTTGGAACCCTCACCTTCACCTTCGCCGGTGCCTTCACCTTCGCCGCTGCCGCCCTGTCCGGAGAGGAACGCGGGAATTTCCTGCTCGTCCTCAGGCGGGGGCGGATCAAGAGGGACCTCGCGGTAGTGCTGGACGTTGAGGAAGATCGAGCGATGCAGAACGCTGTTCACCTCGTTGACGAAACGGCCATGCGCGTCGCGGTCGAAGCTATAGGTGATGCCACCCACCGTCTGTTCTGTAGCGCCCAGCATGCATTCGATGACAGTCTTCATATTCCACTCCTTGAAAGAAAAAGGGGCTTGGCAGCCCCTTGTCACATCCAGCCTTGAGGGGCTTACTCGCTGGTGAGGGTGACGGTCAGACCGATCGCACCGGCTTGGAAGGTGGCCGCCGCCGTAGTGAATTTCACGCCGATACCGCGATCCACATTCGATGCGGTGGTGCGGTATGCGGTCTTGAGTGTCGGTCGGGCCACGCCGCCCGCCTGCGCCAGATTGGAGCCGGAGAAGAATTCAGCGCCGCAGGTGCGGGCGTTATCTTCCTTGCCGAAATCGCCGCTCATGATACCGACGTCAACAGCGACAGTCGGCGCACCGTTGGTATCGAGATCGTCAATGTCGAGGACGATGTCGGCGACGCGGCAATTCGACGGAATGCAGCCCAGCTCGAGGATATCGCCAGCGGCGGGCGCGGAGGCGAGCTGATGCGAGAAGCGGATGGCAACAGCTTCGCCAGCCGTGGACGGGTAGGAAAGGGGTTCGGTCCCTTTGGCATATTTGCTCAGAATGAGCGACATGAGCGTGTTCCTTTGAAGTTCGGGCCGCAGTGGCGAAGCTGGAAAAGCCCGGGGCGAGCCCCGGGCAGGTCATCAGGCGTTGGGGTCTTTCGACGCGGTGTCGATCGAGATCACGCCATAGTCGCGGTTGTTGAAGCGCGTCTTCTTGACGCCGGCGATCACGCCAGAGGCAACCACGGGCTCATTGCCATGGTCCTTGGTTTCTTCGGTCCAAGTGTAGCGGAAGCCGCCCGCAGAACCGAAAGCGATCACGCCACCTTGACGTCCCATGAACAGGGCACGACCCGCGGCAACATCGGCCCCGGCGCCATAGTCGGCAAAACGTATCGCCCATTCGTGGCTGTGCAGGACCGTGTTGTTGATCATGCCGAGGCCACCCTTGAAGATCGGGTTATTGCGGCCTTCGGCGGTCGCTGCCGCCTTCTGGATTTCGAGCCAGCCGCCCTGATCCTTGTTGCGCAGGTCGTGCTCCTGGAACGGGTTCATGACGCAGACGTAGTGCGCTTCGCCATTGATCATGATCGGCATCATGTTGGCGTTCTTGGGGTCCTTGGCCGACATCATGCGGGCCTTGGTCTGCGCGCGCTCGATTACCGAACGGGACATGATGTCCGCCGTGTCGATAGTCGCTTTCGACGTCGCATCGCCACCGTAGAGGATGTGATCCGCGTCGGGCGCTTCGATCGGGTTCTCGGCGTGGCCAGCCCATGCCGTCGTTTCGATGAAGTCTTCATTGATGCCACGTGCGCCGGACATGTAGATGAAGATCATCTGGTCGTTGAACTTCGCCCAATAGTCGGAGAGGCGGTTCTTGCCGACCTGGCGCATGTTGTGCGCGGTGCGCTTGCGGCTCATCTTGCCGCCGGCGGAAACGCCGTGACGCATCTGGTCGATTTTGATCTGGTCAGAGAAGAAGCGAAGGCTTTCCTCCTTGCCCTCGAGGCGCTGGTCGCCATAGGTCGGGCGGTTGCGGAGCTGCACCGACAAATCAAAGGTGATCGTGTCACCTGCTTCCGACTCGAGATCGGTCAGGCGCTGGATCGCATATTCGTCAGAAGTGCCGATGAACTTCTTGTCGAAATAGCTTTTTTTCGTGATGTCGATGAAGAGCGCGCCAGACCACTTTTTCTGCGCTTTCGGATCGCCAAAGGCAACCACAGTCTTGGTCATGAGTGATGTCTCCGGTTCAGGATCAAACAGCACTCATGCGCATCTGCTCTCTAAATACTGGATTTGGCGGATTGTTGCAATAACTCCCGCATGATATTGAATCCCATCGCTAGCGCATGAGTGCGGCCCCTGATTTTGGAGCGCACCCATGACTGACCGATTCGAGATTTGCCACGCCATCACGGCTAAGTGGGAAGGTGGATGGAGTGACCATCCAGCCGATCCGGGCGGCAAGACGATGTACGGCATCACCGAAACCCGCTGGCACGAGTATCAAGATAAGCTAAAGGTCAAGCGGACGCCGGTGCGCAACGTCACCAAGGCGCAGGCCCTCGCGTTCTACCGCAGCGAATTCTGGCTTGCCTGCGGAGCTGACAAGCTATTCCCCGGTGTTGACCTTGCTGTACACGACGGGTCGGTAAACTCTGGCGTCTCCCGCGGCCGGAAATGGCTGCTGGCTTCCGCCGGAAGCAACGATCACAGCGAGACGGTCAAGAAAATCTGCCGCGCTCGCCTTTCCTTCATGCAGTCACTCGCGATCTGGAAAACGTTCGGCAATGGCTGGGGGCGTCGTGTCGCTGATATCGAAGCGCGTGGCGTTGCCATGGCGCTCGCAGCGATGGGGCTTTCCCCTTCGCAGGTCAGCGGGAAGATCAAGACAGAAGCGGCCAAATCGGCCCAGCAGGCCAGCTCGGCAAAGAAAGCGGCCACCACAAGCGCCACCGCGGCGTCAGCGCCAGCCGCCGCGCCCGTTGTCGAGCCTTCCACCGTGACGGACGCAACAACCGTCTGGATCCTCGTCGCCATTGTGGCGGCCGGTGCCGTTGCCACCGTTATCTTCATCGCCAAGAAGCGTGCCGCTGATGCCCGCGTTGAGGCCTATAACGAGGTGGCAGCATGAGCGCTCTCGCGTCTGTCCTTATCGGAGCTGCGCTGCGCGTCGGCGCGACCACGGTCAAAACCATCCTCGAGAAGCAGGTAGGCGGCGTCGCGGGCGAAATCGGCGGCACAGTCATCGACGCGATTGCAAAACAGGCTGGCGTAACGGTGGACGAGCTACCGACGGTGCCGCAGTCCACGCTGGACGAGGCGGTAAGCCAGGTCGAACCCATCGCCCCGGCCTTGATCCTTGCCGAAGTTGAACAGCAGAAAGAGGCGAACCGCCTCATGCTGGCCGAGATGAACAAAGATACGTCGTTCGGCTGGCTTTGGCGGCCCGCAGGCATGTGGCTCATGCTGGTCTGCATCGCATGGTTCGTCATCGTCCGGCCGTTGCTCAACGCGCTGCTTTGGGCAACCGGTACCGGCATCCAGATTGAAGTCGGGCTCGACCTCGCCACCTTCCTTGGCATTTTCACGATCTATACTGGCCTCTACATGGGCGGCAACACGGTCATTCGTGCTGTGAAGAAAGAAGGCTGATGTCTTTTTGGGACTGGTGGGGTTCGGCAGAAGGTAAGATCGCGCTCGCAGGTATTGCGGGCTCGGCCGTTTCTGTCGCGATGGAATGGACCGGCTGGGCTCCCAGCGCTCGAAAGTTCTTGGTCGGTGCCGCTGCCGCCTACTTTCTCAGCCCGGTTGGCATGAAGTTCTTTCACTTCATCTTCGGGGCGATGAGCATTGCGGAAGAACAGTCGGCAAGTGTGGGCGGTTTCATCACCGGTATTGGCGGTGTTATCATCGTCGAGATTATCCTGAAGGCATTTCGCCTCCGTCACGCGGAGATCGGGAGACGCCGGCATGACGAGACCTAGAGCAAGGCACATAAGGGAGGCGGCGAAGCCGCAGGGCAGGGTAGTGGCCGTCGCGACGGTGATACTCGTCGTGTGGCTCATTTTTCTCCAAGCTTAGTTCCCATGGGTTGACATAGCGCCCCATATGTTCCCATATTGTTCCTGCGGATAGTTCCAACGCGGCGAGGCAGCTCGCCGTCACGTTTTGTATGTTTGGAGCCCGCTATGTCTGTCTCACCCAATGAAAATCTTGTCTCTCTCGACGCCTACCTGTCCCTCAAAGAAGTCCTGGCAATGGTGAAGGTGGGTTCATCAACGCTGTACCGCTGGATGGACGATGGCGACTTCCCGAGACCACGCCAGTTGGGCGAAAGATGCGTTCGGTGGACGGTAGCGGACATAAAGCAGTGGCAAGACACCCGCCAAACAGTTGGTCGGCTGAAAAAGGCATCATAGAAACTGAATTTTATTCCCGGGTATAATACCGGGTACATTTCAAAATCACCGAAAAATTTCTAAGCCAATTTCAATGCGGTAGGGGTCTAAACGTGACAGACACCCCATCTGCCATCAAGCGCAACATCATTCCTACAATGTCGCGACGAGGCGTGTTCCCGCAGCGTTTGGGCATCGCGTACATTCCGTACGAATAAGAAACCCGCCTTTCGGCGGGTTCCAGGCAGCGGTAGTGGATCAGATTTCCGGACAGCCTCGTTCATTTGCGAAGGTGATCCGGTCCGGTCCCCGGCGCGTGAAGCCTTCGACAACAACGCGGCCCGGCGTGACGCGCACGACTTCGGGATCACGAAGGCCTGCTTCACGGGCCGCGGCGCGGGCTTCGCGTTCGCTGCAGCCACGCATGCGTGGAGGACCCCGGCGGTCCATGTCCCGATCACGGTCGCGGATGACGGGACGGACGCCATCCGGGCCGATTCTGAGCTCCATATCCTGGGCACCGGCAGGCATAGGTGCGGCGAAGGATACCAGGGCGATGCCAATCGCCAGACCTGTGGTTTTCATAAAATTTATCATGTTTCCTCCCTCAGCCGAAACCGGCGGAAGCTCGTTTGTTTGCGCGGAGAAAACGCGGTTACCGCTGCTTTGTTCCTTGCGAGGCTGCGAGGGGAGGACTGACGGCGGGCGGATTCCGCCGTGGAGGGGCATGCCGAAAAAATAGAAATCATTAACCACGTTCGTTACAATTTTATCGAATTTGCAAAAGTCGTTCGCATTTTCGCCACGTTATGCACAAGATTAAGCGACTGTTAGGTGCTTCGATGGCAGGGGCAATATCGAGGCAAGTGAGGGTGCGGTTCTCCGAGCCAATAGCAGCGTGGGGCAGACGATTTGAATTCTACGGTCAAGAAAACAGGCATCAGCGCTAGGTCGGGCGTGAAGTGGCTTGCAATTTCCGTGCTTTGCGCTGCAACGGCGTCGTGTTCCACCACCTCCGAGACAAAGCCTAAGCCCAAGCGCAGCAAGGAATATTTCTCCGAATCCGAATATGGCGTGAAGGCCAGCCCGCGCGTTGCCGATGGAAAGAACATCCCGAAGGGTGGCGGACGCGAGCTGCTGGGTAACGCCTACACCGTCAAGGGCCGCCGTTACTTCCCGAAAGAAGAGCCGGGCTACAACAAGGTTGGCCTTGCTTCCTGGTATGGCTCCGCCTTCCATGGACGCCTCACGGCCAATGGCGAGGTTTACGACAAGGAACATCTTTCTGCCGCACATCCGACATTTCCGCTGCCGAGCTACGCGCGCATCACCAATATGGATAATGGCGCGTCGGTGCTCGTCCGTGTCAACGACCGCGGTCCGTTCCACGAGGGCCGTCTGATCGACGTTTCCAGCAAGACGGCAGATCTTCTGGACATGAAGGCGACGGGGACAGCAAATGTACGGGTTCAATATGCGGGCCGTGCGCCGCTCGACGGCCACGACATGCCTTACCTGATGGCTTCCTATATTCCGAAGGGATCGCGCTCGCCGGGCGTGGCGCCTGAAGGGCAGATCGCGACCGGCGTGATGGTTGCTTCGGCCTCGCCGAATTTCGTGCCGGCGCCGACATCGAACCCCAATTATGCCGGCTCGGCTCAGACCGCGCTCGTCGGTTCCAAGAAGAATGCGGCGCTTCAGGCCATGCCACTCGTCAACGGTGTACAGCCGGCATTCGAACAATTCGCGATCCTGCCGGAAATCGGCCCGATCCTGGCGGAGCGCCCCGAAGGTAACTTCCTGAGGGAACCAGCGGCTCCAGGCGGCAATTATCTGCGTGTGCCCACGCCGTCTTCCAAATATGCAGCGGCCTATTCCGAAGAATCCGCCGCCGTATCCAAAACCGCAAAGGTGTTCGACACCGTGCTCGTCAATCGTGACGGCTTGAGCGAAGAGTCGATCCTCGCCCATGTGAAGCGTCAGCAGGCGAAGTCACGCTAACGGATTAAGGACACCGCCGCACCGTTCTCATGGCGGGTGGCCGGTTGTGGCAGACAGCACTGGGCGGTTTCATGCGCAAGACCTTTCATCGCTCCTATCTGTCGGCCGTCGCGGCGGCCGCTTTTGCATTTTTGAGCGCCGGCGCGGCGCAATCGCAGACAGCGCCCTTCATCGCTAAGGCCGAGCAGGCCTATATGATCGATGCCGAAACCGGTACGGTGCTTCTTGCCCAAAATGAGGATCAGTCCTTTCCGCCCGCATCACTTGCCAAGCTGATGACGGTGGAGGTGGTGCTGGATGCGCTGTCGAGAGGACAGCTGGCGTCGGAAACGGTCTATCCCGTTTCGGAATATGCCTGGCGCACGGGCGGGGCCCCATCGCGAACGTCAACCATGTTCGCCGCCCTCAAATCCTCCGTTGGCATCAATGACCTTTTGACCGGAATCATCGTGCAGAACGCCAATGACGGCTGCATCATCATCGCCGAAGGCATGGCGGGATCGGACGCTGCCTTTGCCAAGCGCATGACGGCACGGGCCGGCGAACTTGGCATGAACCGCAGCACGTTTGCCAATTCCACCGGACTTCCCGATTCCGGAAACCAGACGACCGCCAGGGACATGGTGCGCCTTGCCCAGCATCTGCATGACACCTATCCCGACCGCTACTCGTTGTTTACGAAGCCGGACTTCGAATGGAACCGGATTTTCCAGCGCAACAAGAACACGTTGCTGATGCCGGGAAGCGGTATAGACGGTCTTGGCCTCGGTTTTGCAGAAGGCAGCGGTTTTGCCGCCGTGGTTTCGGCGGAGCGGGAAGGGCGCCGCATCTATCTGGCGCTTGCCGGCATCGCAGACGACAAGACGCGGCAGGAGGAAGCGCGCAGAGTGGTGGACTGGGGCCTGACCGCCTTCGAAAAACGCCGCCTGTTCAGCAAGGATGAAGTGGTTGGCTCCGTCAGCGTGTATGGCGGCGATGCCTCCTATGTCGACCTTTCTCCGAAGGACGATGTCAGCGTCCTGGTGCCCGTCAACAATCCGGAACGCCTCTCCGGACGTATCGTCTATCGCTGGCCGCTGAACGCGCCGCTCGATGTCGGGGCGAATGCCGGAACGCTGAAGATATTCTCGGGCGAGCGTCTATTGCGCGAAGTGCCGCTCTATACGAAGAGCCGGGTGAACAAGGGTTCGCTCACCCAGAATGCCGCAGGAGCATTGAAGGAACTTCTGCTGTTCTGGCTGTGAAGTGCGGTTCAGCACGCGCCTTTTCTCCTATGCTTTTGTTCAGGCACCGGTTTTTCTGAAAAGCGCTTCACACATTTCGGTCCGATGCTGTAAACAGGCATTTCAACATTGACGTGAATTCCTTGCAGGACGTGACATTGGCCGAAAAGACCGGATTGTTCATCAGCTTCGAAGGCGGCGAAGGCGCCGGCAAATCGACGCAGATTCGCACGCTTGCCGAGGCGCTTCGCGATCGTGGCTTCAAGGTGGTCGTAACGCGTGAGCCCGGCGGTTCGCCGGGTGCGGAGGCCGTAAGACATGTTCTCCTTTCCGGTGCGGCGGAGCCGTTTGGCGTCCGGATGGAAGCCATCCTGTTCGCCGCCGCGCGCAACGATCACGTCGAAGAGGTCATTCGCCCTGCGCTGGAGCGCGGCGAGATCGTGCTCTGCGACCGGTTTCTCGATTCATCCCGCGTCTATCAGGGCACGACCGGCAATCTGGAGCCGGATTTCATCGAGACGCTGCAGCGCATCGCGATTGATGGCGTCGTTCCGGAGCTGACACTGATTTTCGATATCGCTGCCGAAAAAGGCCTTGCCCGCGCCAGAAAGCGTGCGGATGAAGGCGCGAAGCCCGATCGCTTCGAGAAAGAGGAAATCGAAACCCACGAGAAGCGGCGGGAAGCCTATCTCGATATCGCGCTTGCCGAGCCGCGTCGTTGCCGGATCGTCAATGCCGATCAGCCGGAAGAAAAAGTTGCGGAGGATGTCATGTCCTTCGTCGAGCCCTTGCTCGAACAGCTGGGGACCGAGGCGGGCAGGGCCCATGAGTGAGGTCCAGGGCGTTCTCGACGGTGCAATCGCGCCGCAGCAGAACACGAGGCTTTTTGGCCATGAGGAGGCGGAGGCGTTTCTCGCGCAGTCCTATCGCTCTGGCAAAGGCCATCATGCCATTCTCATCGAAGGGCCGGAGGGCATCGGCAAGGCGACGCTCGCTTTCCGTTTCGCCAATCATGTGCTGAACCATCCCGATCCTTTGTCGTCGCCGGAGTTTCTGCCAGACCCCGATCCGCAGTCGCTGGTCAGCCGGCAGATCACCGCCGGCGCCTCGCATAACCTGCTGCATCTCACCCGGCCCGTGGATGAAAAGACCGGCCGCGTTAAATCCGCCATCACGGTGGACGAGGTGCGACGGGCAGGGCATTTTTTCTCGCAGACCTCGGGCACCGGCAACTGGCGCATCGTCATCATCGATCCCGCCGATGATCTCAACCGCAATGCCGCGAACGCGATCCTGAAAATACTGGAAGAGCCGCCCAAAAGGGCGATGTTTCTCGTGCTCTCCCACGCACCGGGAAAATTGCTGCCGACAATCCGGTCACGCTGCATGCCGCTCAGGCTGCTACCGCTTTCGGATTCGGCCATGGTGCAGGCGCTCGATCATCTCGGCATAAATCTCTCGGAGGAAAAGCGCGATGCCCTGCTTGCCGCCTCCAAGGGTAGCGTTGCGCAGGCGCTGAAGCTGATGAACTACGGCGGCTCCGATATTGTCGAGGCCCTTGCCGCAGTCATGAATGCCGACGGGCCGGGCGCCCGCAAACAGATGCACAAGCTCGCCGAGATCCTGGCCCAGAAGGATGGCGACATCGTCCTCGGTTTTTTCATGGAACATGTAACGGAAGATCTGATGGCGCGCGCGCGCGCGGCGGCCATGACGGGCGATATCACGGCTGCGGAAAGGCTGGCGCGGCTTTCTTCGGCGCTCTCCGAACGCATCACCGTGGCCCAGGCCTATAATCTTGACAAGAAACAGATGGTAATTTCCATTCTTGAGGATATCCGGGGCGTCTAA
Protein sequences of DBSCAN-SWA_2 >NC_022535|1418490:1430543|1418490_1418832_-|WP_048902683.1|DBSCAN-SWA MTDRIPCINPNCRCTAAQDRFPGSTSIICGKCWRAIPARYRARWKQLKARDRKFTRLRRKTQYADPTRDTQWWSISNRYGSAWRTLINSITHYFTASEQPVGLEDFMKENGLG >NC_022535|1418490:1430543|1429517_1430543_+|WP_022556152.1|DBSCAN-SWA MSEVQGVLDGAIAPQQNTRLFGHEEAEAFLAQSYRSGKGHHAILIEGPEGIGKATLAFRFANHVLNHPDPLSSPEFLPDPDPQSLVSRQITAGASHNLLHLTRPVDEKTGRVKSAITVDEVRRAGHFFSQTSGTGNWRIVIIDPADDLNRNAANAILKILEEPPKRAMFLVLSHAPGKLLPTIRSRCMPLRLLPLSDSAMVQALDHLGINLSEEKRDALLAASKGSVAQALKLMNYGGSDIVEALAAVMNADGPGARKQMHKLAEILAQKDGDIVLGFFMEHVTEDLMARARAAAMTGDITAAERLARLSSALSERITVAQAYNLDKKQMVISILEDIRGV >NC_022535|1418490:1430543|1419777_1420464_-|WP_022556141.1|DBSCAN-SWA MRDIDDMLPYLLPYVPNCADLTAYRCIREAARDVCEKADIWREKDTIEITDIDGECLSTFGDAEIKKIQAAKLNGVPLTPYSAAWLDENHPGWDGDNQNEAPARFITQITPGKIMVAPRATGTLSVRLVLKPSLRAMTLPDFMLEKYATEIGKGAAGLALMLPNDDAGPNPAMATALLTEFNQFLDRLAPCLPHPMARSICAPPIISASPLRRRWPSRTKRPRSAFPA >NC_022535|1418490:1430543|1420463_1421165_-|WP_022556142.1|DBSCAN-SWA MPKASEVMKRASVLLLDEDNVRWPLSELADCINDAVKAIILAKPSASAKTAQFPLEQGTYQKIPETLDNVTPLQLLGVNHNIIDTVKNIGGRAIRTASRSMLDSHEPNWRNPAYAPFAKEVRQVAFDENVPLEFECYPGNNGSGVVQIAISYLPAKVTPLPNKDVEKLEAWDVDIGLPEPYTVPLIDYALFKAFSKDDIAGDPTKAMTHYQTFATALGIKVQGEAASNPNRRR >NC_022535|1418490:1430543|1428850_1429525_+|WP_004441759.1|DBSCAN-SWA MAEKTGLFISFEGGEGAGKSTQIRTLAEALRDRGFKVVVTREPGGSPGAEAVRHVLLSGAAEPFGVRMEAILFAAARNDHVEEVIRPALERGEIVLCDRFLDSSRVYQGTTGNLEPDFIETLQRIAIDGVVPELTLIFDIAAEKGLARARKRADEGAKPDRFEKEEIETHEKRREAYLDIALAEPRRCRIVNADQPEEKVAEDVMSFVEPLLEQLGTEAGRAHE >NC_022535|1418490:1430543|1424721_1425054_+|WP_022556148.1|DBSCAN-SWA MSFWDWWGSAEGKIALAGIAGSAVSVAMEWTGWAPSARKFLVGAAAAYFLSPVGMKFFHFIFGAMSIAEEQSASVGGFITGIGGVIIVEIILKAFRLRHAEIGRRRHDET >NC_022535|1418490:1430543|1427540_1428713_+|WP_035208907.1|DBSCAN-SWA MRKTFHRSYLSAVAAAAFAFLSAGAAQSQTAPFIAKAEQAYMIDAETGTVLLAQNEDQSFPPASLAKLMTVEVVLDALSRGQLASETVYPVSEYAWRTGGAPSRTSTMFAALKSSVGINDLLTGIIVQNANDGCIIIAEGMAGSDAAFAKRMTARAGELGMNRSTFANSTGLPDSGNQTTARDMVRLAQHLHDTYPDRYSLFTKPDFEWNRIFQRNKNTLLMPGSGIDGLGLGFAEGSGFAAVVSAEREGRRIYLALAGIADDKTRQEEARRVVDWGLTAFEKRRLFSKDEVVGSVSVYGGDASYVDLSPKDDVSVLVPVNNPERLSGRIVYRWPLNAPLDVGANAGTLKIFSGERLLREVPLYTKSRVNKGSLTQNAAGALKELLLFWL >NC_022535|1418490:1430543|1423448_1424201_+|WP_022556146.1|DBSCAN-SWA MTDRFEICHAITAKWEGGWSDHPADPGGKTMYGITETRWHEYQDKLKVKRTPVRNVTKAQALAFYRSEFWLACGADKLFPGVDLAVHDGSVNSGVSRGRKWLLASAGSNDHSETVKKICRARLSFMQSLAIWKTFGNGWGRRVADIEARGVAMALAAMGLSPSQVSGKIKTEAAKSAQQASSAKKAATTSATAASAPAAAPVVEPSTVTDATTVWILVAIVAAGAVATVIFIAKKRAADARVEAYNEVAA >NC_022535|1418490:1430543|1425258_1425489_+|WP_022556149.1|DBSCAN-SWA MSVSPNENLVSLDAYLSLKEVLAMVKVGSSTLYRWMDDGDFPRPRQLGERCVRWTVADIKQWQDTRQTVGRLKKAS >NC_022535|1418490:1430543|1426380_1427469_+|WP_006697463.1|DBSCAN-SWA MNSTVKKTGISARSGVKWLAISVLCAATASCSTTSETKPKPKRSKEYFSESEYGVKASPRVADGKNIPKGGGRELLGNAYTVKGRRYFPKEEPGYNKVGLASWYGSAFHGRLTANGEVYDKEHLSAAHPTFPLPSYARITNMDNGASVLVRVNDRGPFHEGRLIDVSSKTADLLDMKATGTANVRVQYAGRAPLDGHDMPYLMASYIPKGSRSPGVAPEGQIATGVMVASASPNFVPAPTSNPNYAGSAQTALVGSKKNAALQAMPLVNGVQPAFEQFAILPEIGPILAERPEGNFLREPAAPGGNYLRVPTPSSKYAAAYSEESAAVSKTAKVFDTVLVNRDGLSEESILAHVKRQQAKSR >NC_022535|1418490:1430543|1418824_1419895_-|WP_022556140.1|DBSCAN-SWA MFAAPDGTIHLRTADHIGQPTSQAVAFPNETAKERVSGMIRVRDAFAKLRRAQISETATDQQIENLRNRLNNLYDAFVKSHGPINSDANKRLFRDDPTWPQISALEQSFDKGLSAAMAKKTGEKARAATAEKAAIFTRRTQQPYRRPTSASSAKDALATVLNDYGRINLEAMSQLYGKPVDAIVDELGPLVFKTPTGGYETADQYLSGNVKQKLAEAERAAEQDPEYRRNVNALRDVIPADIEAIDIDVKPGAPWLPANHVEDFVSHIGQGAVKPRAFYSAANAKWAITVPQVTPAAQVQWGTDRAGVDTVLSAALNGQTITIHDRTTDGKSVVNQPATDAANEKVERVWEAAPND >NC_022535|1418490:1430543|1422202_1423300_-|WP_022556145.1|capsid|DBSCAN-SWA MTKTVVAFGDPKAQKKWSGALFIDITKKSYFDKKFIGTSDEYAIQRLTDLESEAGDTITFDLSVQLRNRPTYGDQRLEGKEESLRFFSDQIKIDQMRHGVSAGGKMSRKRTAHNMRQVGKNRLSDYWAKFNDQMIFIYMSGARGINEDFIETTAWAGHAENPIEAPDADHILYGGDATSKATIDTADIMSRSVIERAQTKARMMSAKDPKNANMMPIMINGEAHYVCVMNPFQEHDLRNKDQGGWLEIQKAAATAEGRNNPIFKGGLGMINNTVLHSHEWAIRFADYGAGADVAAGRALFMGRQGGVIAFGSAGGFRYTWTEETKDHGNEPVVASGVIAGVKKTRFNNRDYGVISIDTASKDPNA >NC_022535|1418490:1430543|1421668_1422130_-|WP_022556144.1|DBSCAN-SWA MSLILSKYAKGTEPLSYPSTAGEAVAIRFSHQLASAPAAGDILELGCIPSNCRVADIVLDIDDLDTNGAPTVAVDVGIMSGDFGKEDNARTCGAEFFSGSNLAQAGGVARPTLKTAYRTTASNVDRGIGVKFTTAAATFQAGAIGLTVTLTSE >NC_022535|1418490:1430543|1421168_1421606_-|WP_022556143.1|DBSCAN-SWA MKTVIECMLGATEQTVGGITYSFDRDAHGRFVNEVNSVLHRSIFLNVQHYREVPLDPPPPEDEQEIPAFLSGQGGSGEGEGTGEGEGEGSNAGGEANAVNGGEQPNGEENPSTPADADQGNAETNQQPEQAPVATAAQKATSKSK >NC_022535|1418490:1430543|1425726_1426065_-|WP_004441749.1|DBSCAN-SWA MINFMKTTGLAIGIALVSFAAPMPAGAQDMELRIGPDGVRPVIRDRDRDMDRRGPPRMRGCSEREARAAAREAGLRDPEVVRVTPGRVVVEGFTRRGPDRITFANERGCPEI >NC_022535|1418490:1430543|1424197_1424722_+|WP_022556147.1|DBSCAN-SWA MSALASVLIGAALRVGATTVKTILEKQVGGVAGEIGGTVIDAIAKQAGVTVDELPTVPQSTLDEAVSQVEPIAPALILAEVEQQKEANRLMLAEMNKDTSFGWLWRPAGMWLMLVCIAWFVIVRPLLNALLWATGTGIQIEVGLDLATFLGIFTIYTGLYMGGNTVIRAVKKEG |
16 | Agrobacterium_phage(78.57%) | capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1591458 : 1599715
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_022535|1591458:1599715|DBSCAN-SWA GATGACACCTCCTTACCTGCTGCACCACCTGCTGGGCGCGCGCGCGGCAAGCGACGGGCAGGCGATCGTTTATAAAGACACATCCCTGAGCTACCGGCAATTTGCCGAAGCGGCGGAACGATGCGCGGCTGCCTTGCAGCAGGCTGGGGCGCAACGCGGCGACCGGGTCGTCATTTTTCTGCCGCGAGGCACGGAAGAATGCTGGGCGATCTTCGGCGTCAGCATGGCCGGATGCGTCTTCGTGCCGGTCAATGCGCTGCTGAAGGCGCAGCAGATACGCCATATCATCGTGGATTGCGGCGCCGAACTTGTCATCAGCAATGCGACCATGCGCGACGAGCTGAGCGCGGCGCTGGAAGGTCTGGCCGGTGTCCGCGTGCTTCTGGCGAACGATATCGCCGAAGGCAGCAAGACGTCAGTCAAAAGCCCGGCGGCAATCGGCGAAGACCTCGCGGCGATCCTCTATACATCCGGTTCTACCGGCTCACCCAAGGGCGTGATGCTGTCGCATCGCAATCTCCTGGCCGGCGCGCGCATCGTGCGCACCTATCTGGAGATTACCGGCAAGGATCGCATTCTTTCCCTACTGCCCTTCAGCTTTGATTATGGCCTCAACCAGCTGCTGACGGCAGTGGAACAGGGCGCGACAACAATCATCTCCACCTTTAGGCTCGGTGACGACATCGTCCGCGACCTGCGCGACCATGCCGTGACCGGGCTTGCCGGGGTGCCGACAGTCTGGGCGATCCTGACGAGGGCGGCGCCGTCGCTTGCCAGGACGCCGCTTCCGCATCTGCGTTACATCACCAATTCTGGCGGGCGCGTGCCGCAGGAAACTGTGAAGGCGCTGCGCGAAAAGCTGCCGGACACGAAAATCTACCTGATGTACGGCCTGACCGAAGCCTTCCGCTCCACCTTCCTGCCGCCGGATGAAATCGATCGCCGCCCGACCTCCATCGGCAAGGCCATTCCCGAATGTGAGATATTCATCGTCACCGCCGAGGGACAGCGGGCAAAGCCCGGCGAACCCGGCATTCTCGTGCATCGAGGCCCGACCGTTTCGCTGGGCTACTGGAACCGGCCGGAAGACACGGCGAAAGTCCTGCGCCCGCATCCCTTCATTCCGGCGGCGCTGGGCGGAGAAACCGTATGTTACTCCGGCGATCTGGCGGTGGAAGACGAAGACGGGTTCTTCAGCTTCGTTGCCCGTAACGATGCGATGATCAAGTCTTCGGGTTATCGCATCAGCCCGACCGAAGTCGAAGAAAGCCTGATGTCGACAGGCTTGTTCCAGCAGGTCGCCGTCATCGGCCTTCCGGACCCCTTTGCGGGGGAAAAGGTGCATGCCGTGGCGACTGCCGCCAATGAAAATATCGATGTTTCCGCGGCACTGAAGAAGGCCGCTGAAATGCTCGCCCCCTTCATGATCCCGCGCGCCATCGAGCTGGTCGACCGGTTGCCGGTCACCGCCAATGGCAAGGTGGACTACCGCGCGCTTGTGCGCGAACGGACGGACGATGGCGCCCACGGATAAACCCCAGAGCCACGGTCCCGCCCTTGCGGCCGCGTCATTCGATATCAGTGAAAACGATCTTGTGATCGGCGGGCTTGCCGTGCGCGATATCGTCGCCGAAACCGGAACTCCATGCTTTCTCTATGATGCTGGCGCGATGCGCCGCGCCTACCGCGATCTCCAGGCGACGCTCTCCGGTTTTGCCGATATCTATTATTCGGTGAAGGCCAATCCCCTGCCCGCCATCATCGCACTCTTCCGGCAGGAAGGCGCCGGCGCGGAAATCGCTTCCGCTGGCGAATATCGCGCCGCTATAAAGGCTGGTGTCGCGCCGGAAAACATCATCTTTGCCGGGCCCGGCAAAAGCAGAGCCGAATTGCACGAAGTGATCACGGGCGGCATCGGCGAAATCCATATCGAAAGCGCCGAGGAAATTGCCCGCATCGAGGCAATCGGCAAACCCGTCAAGGCCTCTGTCCGCATCAATCCGGTGCCGGATGCGCAAGCCGGCGCCATGCGTATGGGCGGCAAGGCCACCGCTTTTGGTTTCGATGAGGAAGAACTGGAAAATATCCTGCCGCTTTTCCGGCGCGCGGGCGTTATCGATCTCGTCGGCATCCATATCTATGGCGGCACTCAGATCCTCGATGCGGACATGCTTGTCTCGCAATGGCGACATGCCATCTCAATTGCCGCCCGCATGGCGGAAATGCTCGGCAAGCCGCTTGAAACCATCGATCTCGGCGGCGGCCTCGGCATCCCTTACTTCGCCGGCGAAGCACCGCTCGATCTCGCGAAGGTAAAAGCCGCCGTCCCCGATCTGAAGGCACTGCTGAAAGCGCATCCGCTGATCGCCGACGCCCATGTCATCGTCGAGCCCGGCCGCTTCCTCGCCGGTCCGGGCGGCATCTATACGGCCGAAGTCAATTCCGTGAAGACCTCGCGCGGAACCACCTTCGTTGTGACGGATGGCGGCATGCACCATCACCTGGCCGCTTCGGGCAATCTCGGTCAGATCGTCAAGCGCAACTACCCCATCGTTGCGCCCGCCATGATGCAGGCCGACCACGCGGAGACTGCAACCATCGTCGGCCCGCTCTGCACGCCACTCGATACGCTGGCTCGCAATGCGGCATTGCCGAGGCTCCAGGCCGGTGACCTCATCGCCATCCTGCAGTCAGGCGCCTATGGCGCAAGCGCCAGTCCCGGGGGTTTTCTAAGCCACGCGGTGGCGAAGGAAGTGCTGGTGGAAAATGGGGCGTTTGAGATGATCGGGCGATAATCACCTTCGCAGCAGCCTGTCCCCCACAGCGAATTTCGATTTCAGCAGGCACTCGTCATATTCCGCCTCAGCCATGGAATCGAAGACGATGCCGCCGCCGACATTAAAGACGGCGCGACCGTCATCGAACAGTGTCATCGTACGGATCGCCACCGAAAAGCGCATTTCACCACCAGGAGACATGAGGCCGATGGCGCCGCAATAGGCGTCGCGGGGCGTCGCTTCCAGTTCGCGCAGGATTTTCATCGCCCACATTTTGGGCGCACCGGTAACCGATCCGCAGGGAAACAACGCGGCGAAAATGTCTTCCACCGTCATGCCAGGCAACAGTTTCGCCCTGATATGGCTGACCATTTGGTGCACGGTCGGATAGGTCTCGATATCGAAAAGCCGTGGCACATCGAGGCTGCCGACCTCGGTGATGCGGGAAATATCGTTGCGCAAGAGATCAACGATCATCCGGTTTTCGGCGAGCGTCTTTTCATCCGCAAGCATTGCCGCGATGATCGCCCTGTCCTCCTCCGCATCCGCTCCGCGCGGCGTCGTTCCCTTCATGGGATGGGTTTCGATGAACCCCTGCCCGTCAACCGAGAAAAACAGTTCCGGCGAGCGTGACAGGATGACCGGGCCGCCGAGATCGACCAGCGCGCCATATTTCACCGGCTGGCGCTCGATCAATGACCAGAATGCCGTCAGCGGATCGCCGCTCCAGCGGGCATGGACGGGCATGGTCAGATTGCCCTGATAACAGTCGCCCCGGCGTATATGGTCGTGCAACAGCTCGAAACGCCGCCGATACTCGGGAAGTGTCCATGCCGGGACCGGATCGGCAAGAAACGCATCCGCGCCCGGAACTTCGTCAGGCCGCGCAAAGCGGCCGTCATCCGGCTGCGGGCCGGAAAAGACCCCGAAATTCAGGAACGGCACGTTGCGCGGTTCGGCCGCAAAAGACGCAAGCTTCGGCTCGAACAGGAAACCGGCCTCATAAGACACATAGCCAGCTAAATATTTTCCGGCACGGCGCAGTTCTTCCATGCGCTTCAGCGCTGCAAAAAATGCTTCTCGCTCATCCGCGACGATGATTTCCTCCGGCTCGGTGAAGGCTGTCACCGTGCCGGTCGTGTCATCCCGGAAAAGAACGTAAGGCGCATGTGCCAAGGAAAAATCCGTAGAAGGTAAATCTAGAGACTGCAGGGCGTCGGAAAGATACCTTTCCATCGTCATCCCCGGTTCCCGGATCAAATCCGAGGATGTCCCGAGGATCTACCACCCGTTGACTTTAGCGATCTCGGCAGATCCTCGGCACGAGACCGAGGATGACGTCGAGTGTGGAGACAGGTCACGAGGCATTAAACCGGGTCTATATCGCCCCGCGCCCACATTTCGATGGTTTCCGCATAAAAATCGGCGAAACGGCCTTCCTCGATCGACTTGCGGATACCCTGCATCAGCTCCTGATAATAGGCAAGATTGTGCCAGGAGAGCAGCATGCCGCCCAGCGCCTCGTTGGCGCGTACGAGGTGATGCAGATAGGCGCGGGAGTAATCGCGCGAGGCCGGGCAGTTGGATTGTTCGTCGAGCGGGCGCATATCCTCGGCATGGCGGGCATTGCGGATATTGACCTTGCCGCGGCGCGTAAAGGCCAGACCATGACGGCCGGAACGCGTCGGCATCACGCAGTCGAACATGTCGATGCCGCGCGCCACCGATTTCAGGATATCGTCAGGCGTGCCGACGCCCATCAGGTAACGCGGCTTTTCAGTGGGCAGCACCGGCAGGGTAATATCGAGCATGCCCAGCATCACATCCTGCGGCTCACCGACGGCAAGGCCGCCGACCGCATAACCCTTGAGATCGAGCTGCTTCAGCCCTTCGGCGGAGCGAATGCGCAGGTCCGGCTGGTCGCCGCCCTGCACGATGCCGAACATGGCCTTGCCGGGCTGGTCGCCAAAGGCGACGCGGCAGCGCTCGGCCCAGCGCAGCGACATTTCCATGGCGCGCTCGATTTCCTTGCGCTCGGCCGGCAGCGCGATGCACTCATCGAGCTGCATCTGGATGTCCGAATCGAGCAGTCCCTGAATTTCGATCGAGCGTTCCGGTGACATGTGGTGCAGCGAACCGTCCACATGGCTCTTGAAGGTCACGCCCTTCTCGTCCAGCTTGCGCAGGCCGGACAGCGACATGACCTGAAAACCGCCGCTATCGGTGAGGATCGGGTGCGGCCAGCGGATCAGCTCATGCAGGCCGCCAAGGCGGGCGACACGCTCGGGACCGGGCCGCAGCATCAGGTGATAGGTATTGCCGAGAATGATATCCGCCCCCAGCTCGCGCACCTGATCAAGATACATGGCCTTGACGGTGCCGACAGTGCCGACGGGCATGAAGGCGGGCGTGCGGATGACGCCACGCGGCATGGCGACTTCGCCGAGGCGCGCGCCGCCGCTCGTGGCTTTCAGGGTGAAGGTGAATTTGTCGTGCATCAGTTTTTCCGGAACAACAGGCTGGAATCGCCATAGGAATAGAAACGGTATCCTGTTTCGATGGCGTGCTTATATGCGTCACGCATCGTTTCGAGACCGCAGAAAGCCGAAACCAGCATGAACAGCGTCGATTTCGGCAGGTGGAAATTGGTCATCAGCATATCGACCGCGCGGAAACGATAACCGGGCGTGATGAAAATGCCGGTCGCATCGGACCACGGGTGAATGGTGCCATCCTCCGCAGCCGCGCTTTCGATCAGACGCAGCGACGTCGTGCCCACGCAGACAATGCGCCCGCCCCGCGCCTTGACCGCATTCAGCCTGTCGGCTGTTTCCTGCGAGACGTGGCCGATTTCGAAATGCATCTTGTGATCGTCGGTATCATCAGCCTTGACCGGCAGGAAGGTGCCTGCCCCAACGTGAAGCGTCACGAAATGCCGCTCGATACCGGCCTTGTCGAGCGCCGCAAATAGCTCAGGCGTGAAATGCAGCCCGGCGGTAGGCGCGGCAACGGCGCCCTTCTCGCGGGCGTAGATGGTCTGGTAGTCGGTCTGGTCCTGCGCATCTTCCGGCCGTTTGGCCGCGATATAGGGCGGCAGGGGAATATGGCCGACGGAGGCAATGGCCTCGTCCAGAACGGGACCGGAGACGTCGAACAGCAGCGTGATCTCGCCCTCCTCGCCCTTCTGCTCGACGGTGGCCTCCAGATGGGCAAGGCCGCAGGCGTTGTCGCGCTCATAACCGAAACGGATGCGGTCGCCCTGCTTGATGCGCTTGCCGGGACGCGCGAAAGCCTTCCAGCGGGACTGATCGGCGCGCATGTGCAGCGTGGCCGAAACGGCCGTCTCCGGCGCGCCTTCGCGCAGCCTGACACCTTCAAGCTGGGCGGGAATGACGCGGGTGTCGTTGAAAACAAGCGCATCGCCGGCCCGCAGGAAGGACGGCAGGTCGAAAACGCGATGGTCTTCCATGCGATTTTCATTCGGATCGACCACCAGCAGGCGCGCGCTGTCGCGCGGGTTGGCCGGGCGAAGGGCGATATTCTCCTCGGGCAGATCGAAATCGAATAGGTCTACGCGCATTGCAAGGACTCTCGAAAATATCAAAACCCGCCCTCAATGGGGCGGGCGGTGGGTAGTGACAAGGCTTCGTCTAACGAGCCGTCATCCTCGGGCTTGTCCCGAGGATCTAACCACGTTGCCGTTGGGGATCGTTAGATCCTCGGGACATCCTCGGGCTTGATCCGGGGACCGAGGATGACGGCTGAGAGGTTGGCACCCTTGTCATCAATGTGAAACGCCCCGCAGTTTCCTGCGGAGCGGGTAAACACGTTATACGCTACCTTCAAGCCGCCGAAGCGAGCTTCATGGACACGATCGAATCGGGGTCCTTCACCGGCTCGCCGCGCTTGACCTTGTCGATGGCTTCCATGCCCTCGATGACCTGGCCCCAGACGGTGTACTGCTTGTTCAGCCACGGGGAATCCGTGAAGCAGATGAAGAACTGCGAGTTGGCGGAGTTCGGGTTCTGCGAACGGGCCATGGAGCAGGTGCCGCGCACATGCGGGATGGCGGAGAATTCGGCCTTCAGGTCCGGCTTTTCGGAGCCGCCCATGCCTGCCCGGGCCGGGTTGAAGCTTTCGGAACCCTTCTTGCCGAATTTCACGTCGCCGGTCTGGGCCATGAAGTCTTCGATGACGCGGTGGAAAACGACGCCGTCATAAGCACCTTCCGCTACCAGTTCCTTGATGCGGGCAACGTGGCCCGGAGCAACTTCCGGAAGCAGCTGGATCACGACCTTGCCGGTCGTCGTTTCCATGATGATGGTGTTTTCCGGATCCTTGATCTCGGCCATGATTATTCTCCTCTGTTCGGGCTCTATTGCCCTGTCTTACTTCTTGCCGACAGTGACCTTGATCATCCGGTCGGGGTTGGAAACTTCGCCGTTGCTGCCCTGCCCGCGCTTGATCTTGTCCACGGCTTCCATGCCGGACACGACCTTGCCGACCACGGTGTACTGGCCGTTCAGGAACGGACCGTCGGCAAACATGATGAAGAACTGCGAATTGGCGGAATTCGGATCCTGCGAACGGGCCATGCCGACCACGCCGCGCGTGAACGGAACCTTGGAGAATTCGGCCGGAATATCCGGCAGGTCGGAGCCGCCAGTACCTGCGCGCTGGGCGCTGAAGCCCTTCTCCATGTTGCCATACTGCACGTCGCCGGTCTGCGCCATGAAGCCGTCGATGACGCGGTGGAAGGCGACGTTGTCGTAAGCGCCCTTCTTGGCCAGCGCCTCGATCTGCGCCACGTGCTTCGGCGCGACGTCGGGCATAAGCTCGATGACAACGGGGCCGTCCTTCAGCTGGACGGTGAGAAGTTCGGCGGCAGACGCAAAGGTGCTGGCGGCAAGCGCGCCCGCAAACATGGCGCCGGCAAAGGCAAATCGAACGAGTTTCATCGGATTGGCTCCAAATGTAAGGCTGAAGTCAGCGCTTCAGCTTGGCGTTAAGGGCTTCAAGCACGGCTTTCGGCACAAAGGCTTCGACATTGCCGCCCATGGCGGCGATCTGCCGAACCAATGTGGCTGTAATGGGTCGCGACGAGGTGCCGGCGGGCAGGAACACGGTCTGGATATCGGGCGCCATCTGGCGGTTCATGCCCGCCATCTGCATTTCATAATCGAGATCGGTGCCGTCACGCAGGCCGCGCAGCAGAAGTCGCGCGCCATGCTGACGGGCCGCATCGACAACCAGATTATCGAAAGAGACGACGTCCATGCGCGCCGCTTCGCCCGGCAGCTGCTCCGCAAGCGCCTGCTTTATCAGCGCTGCCCGTTCCTCGAAACTGAACATCGGCGCTTTTCCGGGATGAATGCCAACCGCGACAATGACCTTTGACGCCACGTTCAGCGCCTGGAGAAGCACATCCAGATGTCCGTTGGTCATCGGATCGAAGGATCCTGGATAAAAGGCAATCGTCAT
Protein sequences of DBSCAN-SWA_3 >NC_022535|1591458:1599715|1591458_1592991_+|WP_022556252.1|DBSCAN-SWA MTPPYLLHHLLGARAASDGQAIVYKDTSLSYRQFAEAAERCAAALQQAGAQRGDRVVIFLPRGTEECWAIFGVSMAGCVFVPVNALLKAQQIRHIIVDCGAELVISNATMRDELSAALEGLAGVRVLLANDIAEGSKTSVKSPAAIGEDLAAILYTSGSTGSPKGVMLSHRNLLAGARIVRTYLEITGKDRILSLLPFSFDYGLNQLLTAVEQGATTIISTFRLGDDIVRDLRDHAVTGLAGVPTVWAILTRAAPSLARTPLPHLRYITNSGGRVPQETVKALREKLPDTKIYLMYGLTEAFRSTFLPPDEIDRRPTSIGKAIPECEIFIVTAEGQRAKPGEPGILVHRGPTVSLGYWNRPEDTAKVLRPHPFIPAALGGETVCYSGDLAVEDEDGFFSFVARNDAMIKSSGYRISPTEVEESLMSTGLFQQVAVIGLPDPFAGEKVHAVATAANENIDVSAALKKAAEMLAPFMIPRAIELVDRLPVTANGKVDYRALVRERTDDGAHG >NC_022535|1591458:1599715|1594252_1595410_-|WP_048902825.1|DBSCAN-SWA MAHAPYVLFRDDTTGTVTAFTEPEEIIVADEREAFFAALKRMEELRRAGKYLAGYVSYEAGFLFEPKLASFAAEPRNVPFLNFGVFSGPQPDDGRFARPDEVPGADAFLADPVPAWTLPEYRRRFELLHDHIRRGDCYQGNLTMPVHARWSGDPLTAFWSLIERQPVKYGALVDLGGPVILSRSPELFFSVDGQGFIETHPMKGTTPRGADAEEDRAIIAAMLADEKTLAENRMIVDLLRNDISRITEVGSLDVPRLFDIETYPTVHQMVSHIRAKLLPGMTVEDIFAALFPCGSVTGAPKMWAMKILRELEATPRDAYCGAIGLMSPGGEMRFSVAIRTMTLFDDGRAVFNVGGGIVFDSMAEAEYDECLLKSKFAVGDRLLRR >NC_022535|1591458:1599715|1598622_1599192_-|WP_004442095.1|DBSCAN-SWA MKLVRFAFAGAMFAGALAASTFASAAELLTVQLKDGPVVIELMPDVAPKHVAQIEALAKKGAYDNVAFHRVIDGFMAQTGDVQYGNMEKGFSAQRAGTGGSDLPDIPAEFSKVPFTRGVVGMARSQDPNSANSQFFIMFADGPFLNGQYTVVGKVVSGMEAVDKIKRGQGSNGEVSNPDRMIKVTVGKK >NC_022535|1591458:1599715|1599220_1599715_-|WP_006698870.1|DBSCAN-SWA MTIAFYPGSFDPMTNGHLDVLLQALNVASKVIVAVGIHPGKAPMFSFEERAALIKQALAEQLPGEAARMDVVSFDNLVVDAARQHGARLLLRGLRDGTDLDYEMQMAGMNRQMAPDIQTVFLPAGTSSRPITATLVRQIAAMGGNVEAFVPKAVLEALNAKLKR >NC_022535|1591458:1599715|1596731_1597814_-|WP_022556255.1|tRNA|DBSCAN-SWA MRVDLFDFDLPEENIALRPANPRDSARLLVVDPNENRMEDHRVFDLPSFLRAGDALVFNDTRVIPAQLEGVRLREGAPETAVSATLHMRADQSRWKAFARPGKRIKQGDRIRFGYERDNACGLAHLEATVEQKGEEGEITLLFDVSGPVLDEAIASVGHIPLPPYIAAKRPEDAQDQTDYQTIYAREKGAVAAPTAGLHFTPELFAALDKAGIERHFVTLHVGAGTFLPVKADDTDDHKMHFEIGHVSQETADRLNAVKARGGRIVCVGTTSLRLIESAAAEDGTIHPWSDATGIFITPGYRFRAVDMLMTNFHLPKSTLFMLVSAFCGLETMRDAYKHAIETGYRFYSYGDSSLLFRKN >NC_022535|1591458:1599715|1592974_1594252_+|WP_022556253.1|DBSCAN-SWA MAPTDKPQSHGPALAAASFDISENDLVIGGLAVRDIVAETGTPCFLYDAGAMRRAYRDLQATLSGFADIYYSVKANPLPAIIALFRQEGAGAEIASAGEYRAAIKAGVAPENIIFAGPGKSRAELHEVITGGIGEIHIESAEEIARIEAIGKPVKASVRINPVPDAQAGAMRMGGKATAFGFDEEELENILPLFRRAGVIDLVGIHIYGGTQILDADMLVSQWRHAISIAARMAEMLGKPLETIDLGGGLGIPYFAGEAPLDLAKVKAAVPDLKALLKAHPLIADAHVIVEPGRFLAGPGGIYTAEVNSVKTSRGTTFVVTDGGMHHHLAASGNLGQIVKRNYPIVAPAMMQADHAETATIVGPLCTPLDTLARNAALPRLQAGDLIAILQSGAYGASASPGGFLSHAVAKEVLVENGAFEMIGR >NC_022535|1591458:1599715|1598076_1598586_-|WP_022556256.1|DBSCAN-SWA MAEIKDPENTIIMETTTGKVVIQLLPEVAPGHVARIKELVAEGAYDGVVFHRVIEDFMAQTGDVKFGKKGSESFNPARAGMGGSEKPDLKAEFSAIPHVRGTCSMARSQNPNSANSQFFICFTDSPWLNKQYTVWGQVIEGMEAIDKVKRGEPVKDPDSIVSMKLASAA >NC_022535|1591458:1599715|1595601_1596732_-|WP_004442089.1|tRNA|DBSCAN-SWA MHDKFTFTLKATSGGARLGEVAMPRGVIRTPAFMPVGTVGTVKAMYLDQVRELGADIILGNTYHLMLRPGPERVARLGGLHELIRWPHPILTDSGGFQVMSLSGLRKLDEKGVTFKSHVDGSLHHMSPERSIEIQGLLDSDIQMQLDECIALPAERKEIERAMEMSLRWAERCRVAFGDQPGKAMFGIVQGGDQPDLRIRSAEGLKQLDLKGYAVGGLAVGEPQDVMLGMLDITLPVLPTEKPRYLMGVGTPDDILKSVARGIDMFDCVMPTRSGRHGLAFTRRGKVNIRNARHAEDMRPLDEQSNCPASRDYSRAYLHHLVRANEALGGMLLSWHNLAYYQELMQGIRKSIEEGRFADFYAETIEMWARGDIDPV |
8 | uncultured_Mediterranean_phage(66.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1616093 : 1624495
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_022535|1616093:1624495|DBSCAN-SWA CCTATTCAAGGAAAGTGGATGGGTTCACCGGCGTCGCATCCTTGCGAACTTCGAAGTGAACCTGCGGACGTTTGGCGCTGCCGGTCATGCCCGAGGTGGCGATGGTCTGACCGCGCTGAACCTTCTGGCCACGCTGCACGTCGAGATTGGCAGCGTTGCCATATACCGTGACCTTGCCGTCGTCATGGCGGACAAGAACCGTGTTGCCGAGCTGCTTCAGGCCATTGCCGGCATAGATGACGACACCGTTTTCAGCGGCCTTGATCGGCGTGCCCTCGGGAACCGAAATGTTGATGCCGTCGTTGCGGCTGCCTTCGACATTGTCACCAAAATTATTGATGACCGCGCCGCGCACCGGCCAGCGATATTTGCCGATGCCGGTGGATTCCGGTGCGACGGACGCCATGTCGGCCTTTTTCTCGATATCGCTGACACTGGCGGTGGCCGGAGTTGCGGGCGCAGCCGGAGCGGCGGCTGCTGCCGGCGCCTTATAAGGCTCCGGCTTGACGGATGCGGTTTCGACGGGCTTTGCGGCTTCCTTGGCGGGAACGGAAGCGGTCTTGATGGCATCCGTCGAAGCACCCGGCATGTTGAGCGTCTGGCCGACGCGGATGCTTTCGTTGCCGAGACCGTTTGCCGCCTTGAGAGCAGCCACCGACACGCCGTTTGCACGGGCGATCTTGGCGAGGCTGTCGCCCGGCTGAACCTTGTAGCCACCAGAAGGCGGCAGCGGCTTGCCGCCCGGAGGCGTCAGTTTGCCAGCTTCAGCAGAAACCTTGTCACGGGCGGCGGCCTGCGACGGCAGCACGGCCACGTTTCCATCCGGCGCACGCAGCGGTGTCGGCTGGTCGCCATTGCGGTTGAGCGCAATATCTCCGGCGGCATCCTTGGCGGCGTTGCGGACCTGGCCGAACTTCGGAATGAGGATCGACTGGCCGGCCTGTGCGGCGGAAGCGGTCTTCAGGCCATTGACCCGCAGCAGTTCCTTTTCCGGAACGCCGTAGCGGTTGGAGAGGGTCGCGATGCTTTCGCCGGGACGCAGGGTCACGGAGGCGGCGCCATCGGTGTGCCAGCCGTTTTTGTCGGAACGAACGGTCGCGGTGGTCAGGTTGTCAGCGGCGGGTGCGGCAGCCTGCCGTGCGGGCGCTGCCAGAGACGGCGCGGCCTTGGGCTGGGCGGACGGGAAAGGCTGGGCCATGGCTTCGTTGCGGGTCGACGGATCGCGGCTTGCCACGGCTGTCGGCGCGGACAGCTCGGAACGCTGCACCGGCGAGGCCGAGGCGCGGGCCGTGGACGGCTGAACGCCACCGTAACCGCCCGATTGCTGCGGATAGGAATTGCCGGCCATGTTCTGGTTGGCATAACCTCCCTGTGGCGCGGCAGAGGCGTAACCGCCATTCATGTCGCCCTGCGGAACCGGGGCCTGCCCGTAAGCACCGCCGCCCTGACGGGCAGGAATTGACGCCGTGGTCATCTGATCCGGTCCACTGGAAAAGAGGCCGCTGAACCGCGTTGCATCCGAGCTACAGCCCGTTGCGACGCTTGCCAGCAATGCCGCTGCGAACATTTTTGCGACTGATTTTCCGATTTTAGACGAATGTCTCATACGCATGACTCGATCTACACTGACGCCAACCCGCCGCATCGAGATCGATCCGGCACAAATTTCAATCTTGCAAGGCCCATTTCCCGAGGTGGGGAAAACCATGCGTCGATGTTTATGATTAAAGCGCGTTAGTGTTACCACGCCGTTAAAACGTTAAGCCGTTTGGCGCTTTTTTGACACTTCGATAACCATGACAAAGAGAGGATGCGCCATGATGAAACAATAAAATCTCTTAAAATCAAATGCTTGCGAAAATTCCGAACGTTAGGAGACCAAAATACAATGTCTCCAAAATTTTCATTTTGTCACAAATGACGCGCAATATGGGTGCTGAGCGGCAGGTAGGGCGCCTCGAACAGTTCTTCCTTCTCGAAGCGGCTGCCGGTCTTGGAAAAGCGCGTCATCACGCAGCGATCACCTTCAACGATGATCGGCGCGATCATCATGCCGCCGGAAACGATCTGCTCGGCAAAGAAACGGGGCATGGTGGTGAAGGCCGCCGTGGAAACGATTCGGTCAAAAGTGCCCTCACCCACCAGGCCGTTGCTGCCATCCGCCTGGCGGATGACGACGTTGCGGATTGCAAGATCATCGAGACAGTTCTGCGCCTGCTGCACCAGCGTCTTGTACCGCTCCAGCGAAAAGACGCGCTCTACCCGGCGGGCAATGATCGCCGTCATGAAACCGCTGCCGGTACCGATTTCCAGCACGCGCTGGCCGGGTTTCAGCTGCAGGCGGGCAAGGATTTTCACCGCCATGTCGGCCCCTTCCATGAAGGAACCGCAGTCGATCGGGATGGTGCGGCTGGAATAGGCGTCGGCGGCAAATTGCGGCGGCACGAATTTGGATCGTGGCGTCTGCTCAACCGCCGTCAGCAGGTCGAGATCGGAAATCCCCTCGCCCCGCAAGCGCAGAACGAGAGCTGCAAAACCTTCCTTTTCGACCATGGCAGATTTCAATCGGCGACTCCGAATCCAAGCGCTTGCGCAACCCGATCCTTCACCGTGTAATCCGTCAGGTCAAGCTTCAGCGGCGTGACGGAAATCTTGCCATGCTTTAAGGCGTGGATATCGGTGCCTTCGCGGAAGGTGCCCATGCGTTCGCCGAAACGCAGCCAGTAATAGGGAAAACCGCGGCCATCCTGGCGTTCTTCCACCGTCAGGCCGAAATCGAGCTTGCCCTGCCCCGTGACGGACACGCCCTGCACATCCTTTGGCGCGCAATTGGGAAAATTGAGATTGAGGAAGGTGCCGTCAGGCAGATCGACATCGATGAGCTTGCGAATGAGATCGGGCGCATAGGTCTCGGCCACCTCCCACGGCACGACGCGGCCTTCGGCATGGCTGAAGGCCTGGCTGAGCGCAAAAGACCGTACGCCCTGCAGCGTGCCTTCAATGGCCCCGGCAATCGTGCCGGAATAGGTCACGTCATCGGCCATGTTGGCGCCGGCATTGACACCTGACAGTACGAGATCTGGCTTTTCCGGCAGAACCTCGCGAATGCCCATGATAACGCAATCAGTCGGCGTGCCGCGCAGAGCGAAATGCTTGTCGGAAACCTTGCGAAGACGCAACGGCTCTGACAGCGTCAGGGAGTGAGCAAGCCCGCTCTGGTCTGTCTCCGGCGCAACGATCCAGACGTCGTCGGAGAGCGTGCGAGCGATACGCTCCAGCACTGCCAGCCCTTCGGCGTGGATACCGTCGTCATTTGTCAGCAAAATCCGCATGTTTTCCTCCCGCTCTTCATTTTGCTCAGGCAGAACACTCCCCGAACCGTCATCCCGGCCTTGAGCCGGGATCCAGTCGACGCGCGACTGCGCGACAGGAAGAGTCCTTTTCAGCCCAACGACTTGGGCCGGCTGGACGCCGGATCAAGTCCGGCATGACGGCATCCTTGAAATCAAGCCGCCTTTTCGATCCGCGTCAGACCGCCCATATAGGGCAGCAGCACATCAGGAATTGTGACAGAGCCATCGTCGTTCAGGTAATTTTCGAGAACGGCGATCAGGCAGCGGCCGACAGCCGTGCCGGAACCGTTCAGCGTATGCACGAACTTCGTCGCCTTGTCGTCCTTGCCGCGATAGCGCGCATTCATGCGGCGCGCCTGGAAATCGCCGCAGACCGAGCAGGACGAGATTTCGCGATAGGTATTCTGACCGGGCAGCCACACTTCCAGATCGTAGGTCTTGCGCGCGCCAAAGCCCATGTCACCGGTGCAGAGCGTCATGGTGCGGAAATGCAGGCCCAGCCGCTTCAGCACCTCCTCGGCGCATGCCGTCATGCGCTCATGCTCGGCGACAGCGCTTTCCGCATCCGTGATGGAGACGAGCTCGCACTTCCAGAACTGATGCTGGCGCAACATGCCGCGCGTGTCACGGCCTGCCGAACCCGCTTCCGAGCGGAAGGACGGTGTGAGTGCGGTGAAACGCAGGGGCAGCTTTTCCTGCTCGAGGATTTCGCCGGCCACGAGGTTGGTCAGGGTCACCTCCGCCGTCGGGATCAGCCAGCGGCCATCGGTGGTCTTGAACAGGTCCTCGGAGAATTTCGGCAATTGCCCGGTGCCGAACATCGCCTCGTCGCGCACCATCAGCGGCGAGGAAACTTCGGTATAGCCATGTTCCGAAGTGTGCAGGTCGATCATGAACTGGCCGAGCGCCCGCTCCAGCCGCGCGAGCTGGCTGGTGAGAACGGTGAAACGCGAACCCGAAAGTTTGGCGGCCCGCTCGAAATCCATATAACCAAGCGCTTCGCCGATTTCGAAATGTTCCCTGGCCTCATGGTTCCAGCCGGGCTTCTGGCCGACGACGCGGGTCACCACATTGTCATGCTCGTCCTTGCCATCGGGCACATCGTCGAAGGGCATGTTCGGCAGGCGCGACAGGGCGTCGTTCAACTCGGCGGTGACCTGACGGTCCTCCTCCTCGGCGCGCGGCATCTTGTCCTTGAGGTCGGCGACCTCGGCTTTAAGTTTTTCCGCAAGCTCCATGTTCTTCTGCGCCATGGCGGCGCCGATTTCCTTGGAGGCGGCATTGCGGCGGGACTGCATGTCCTGCAGGGACTGGATGACGGAACGGCGCTTTTCATCGAGCGCGATCAGACCTGCGGCCGCAGGCTCCGCGCCACGGCGTGCGAGCGCCGCGTCAAAAGCTTCGGGGTTTTCACGTATCCATTTAATGTCGTGCATCGTCGTTCCAGACCGTTGTTGCATCACCGATGTTTTACAGCAAAACGCCGGGCAGAAGCCCGGCGCGAGGAGATGTCTTGAGCGGCGAGACCTTCGAAAACTCAGGTCTCCTCCAGCTCCGTGCTTGCGGATTCCGCAGCGCGCTTCCTCTCCACGAGTCGAGCCATGTAGATGGAAATCTCGTAGAGAATGATCGCAGGCAGTGCAAGACCGATCTGGGACATCGGATCCGGCGGGGTCAGCACGGCGGCGACCACAAAGGCCATGACAATCGCGAACTTGCGCTTCTCACGCAGCCAGTCGCTGGTCAGAAGGCCCACGCGCGCCAGAAGCGTCGTGACCACGGGCAGCTGGAACACCAGACCGAAGGACAGCACCAGCGTCATGATGAGGCTCAGATATTCCGACACCTTCGGCATCAGCGAAATCGCAACCTCGCCATCTTCTGGCAATTGCTGCATGGCGAGGAAGAACCACATGACCATGGGCGTGAAGAAGAAATAGACGAGCGCCGCTCCGATGAGGAACAGGATCGGCGATGCGACGAGGAACGGCAGGAAAGCGGCGCGCTCGTTCTTGTAGAGGCCGGGCGCCACGAATTTATAAAGCTGCGAGGCGATCACCGGAAAGGAGATGACCATCGCGCCGAACATGGCGACCTTGATCTGCGTGAAGAAGAATTCCTGCGGCGCGGTATAGATCAGCGACGACTTCGTCACATCAAGGCCAGCCCAGAGAACCGCCCACTTATAGGGAATGACAAGCAGGTTGAAGAGATGCTTGGCAACGGCAAAACAGGCGATGAAGGCAACGAAGAACGCACCCAGCGACCAAATCAGCCGTGTGCGCAGTTCCATCAGATGTTCGATAAGCGGCTGCGGCTTGTCCTCGATGTCCCCGCTCATGCTTCATCCTTTTTGGGCTTTGCAGCCTTGGTTCTCGCCGGCTTCACCGGCTTGGCATCGGCGGCCACGACAGGCTTTTCAGCCGCGACCTTCCTGGCGGCCGCTTTTCTCGCGACGGCTTTTACGGTCGGCTTTGCTGCTGAAGTCTCCGTGGAATTGACAGCAACGGGTTCGGACGCCGCGACCGCTTTGGCGCGGCTGGCGCGTTTCGGCTTGGCGGCAACCGCTTCCGCCTCGACGGTCGCAATCGATTTAGCCCGGGCACGCTTCGGCTTTTCCGCCGGTGCGACCGGTGCCGCAACGGGGGCGCTGGCAGTCACGGGCGGGGTATCCGGCAGCTTCATCTCAGGCTCGGGAATGCTGACGAGCGGTGCAACCGGCTCACTGGTCGCCGGTGCTGCGGTCGACGTAGCAGGCGAGGATGACAGGCCCTCAGGCGGCGTGGTCGCCTTCTGAAGATCGGACTTGATCTCGTTGCCGAGCTGGCGAAGCGGGTTCATCGCATCACGCAGCGAGTTGGTCGGGTTGAGATTGCGGACATCCGATATGGTCTGGCGCACATCGTCCATGTCGGCCTCTTTGAGAGCCTCATCGAACTGGGTACGGAAATCCCCCGCCATCTTGCGAAGGCCAGCCATGGTCTTGCCAAAGGCGCGGATCATGGGCGGCAAGTCTTTCGGCCCGACAACCACGATCAGTACGACCGCAATCACCAGCAGCTCGCTCCAGCCGATATCAAACATCAATATGCTCCCGAGACCCTAACGCGCGTCAGTCTCAAACCAAACGGCAGGGCCGCTTACTTGATCTCGTCAGCCTTGTGGTCGACGGTCTTCGTGTTGGCATCGGCCGGCGGCGGCGTCTGGTCTTCGTCAGCCATGCCCTTCTTGAAGCTCTTGATGCCCTTGGCGACATCGCCCATCAATTCCGGGATCTTGCCGCGTCCGAAAAGGACGAGAACAATCACCAGCACGATGAGCCAATGCCACACACTAAAAGAACCCATAACTGAAACTCCTGAATTTCGCTTTCAGACGATGTAAGACGTTTGAAAGGCTTTTTCAAACAACAAATTGCCGCTTCAACAGGGTGTGAGCATAAAGTTTTTGCGTCGTGTCGCCCGCGCGATAACGATCTCAATCGTCGCCGTCGCCACCACCAGGCGCAAGCAGGCCCAGCTCTTCCAGATCAAGTTGCGTGATCGGGTCTTCATCCTCACGCAATTCGTCGCTCATCATCGGCAGGGGCACGCCGAAATTGGAAGGTATGCGGCCGGAAAGCAGCCCTGCTCCCTTTAGCTCCTCGAGCCCCGGCAGGTCGCGCAACTCCTCGAGGCCGAAATGGTCGAGAAACTCAACCGTGGTACCGATCGTCACCGGCCTGCCGGGTGTGCGCCTGCGGCCACGGAACCGGACCCAGCCCGCCTCCATCAATACGTCAAGCGTGCCGCGTGAGGTCTGAACACCGCGGATTTCCTCGATTTCAGCACGTGTCACCGGCTGATGATACGCGATAATCGCCAGGACTTCCAGCGCCGCGCGAGAGAGCTTCTTCGGTTCCTTTTCCTCGGCGCGGATGACGAAGGAAAGATCGCCGGCAGTGCGGAAAGCCCATTGCCCGCCCACCTGCACGAGATTGACGCCCCGGCCCGCATAAGCGGCTTTCAGACGCTGAAGAACGGCGTCGACATCCATGCCGCGCGGCAGGCGCTCCGCAATGAAACCCGGAGAGACCGGCTCGGCCGAGGCGAAAACCAAAGCCTCGGCGATCCGCTCGGCCTCTTTCAACTGGCGCTCGGAAAATACCGTGGGTTCTAGTCCCACCCCATCCGTCACGATCGTCGCCTCGGCTGTCTCATCATTATCAGTCATTGTCAGTCCTCTCGGCGACCGCCGCACGATCATCCCTCGTGCCACGGCGCATATAGATGGGCTGGAAAGCGCCCTCCTGCCGGATTTGCAGCGTGCCCTCGCGCACAAGTTCCAGTGACGCGGCAAAGGCGCTGGCGATCGCCGTCACCCGCATCGCGGGATCGGGGACATATTGCAGCAGATATTGATCGAGCGCGGTCCATTCTCCGACATCGCCGAGGAGACCGGTTAAAAGCTCCCGCGCCTCGACCAGCGACCAGACCTGCCGTTTTTCAATGGTGACCTGGGTGATCGCCTGTCTCTGCCGCAGATTGGCGTAGGCGCTGAGCAGGTCATAAAGGCTCGCCTCGTAGGCAGAGCGGTTGATGTGCGGAATATGCTCCGGCGCACCACGTGCAAACACATCGCGGCCGAGCTGGGCGCGATTGATGAGCCGTTCCGCCGCCTCGCGCATCGCCTCCAGCCGCTTCAGCCGGAAGGCAAGGGTTGCGGCCATTTCCTCACCCGAGGGGCCGTCATCCTTGGATTGCTGCGGAATGAGAAGCTTGGACTTCAGGAAGGCGAGCCACGCCGCCATGACCAGATAATCGGCCGCCAGCTCGATACGCACGCGCCGCGCGCTTTCCACGAATTGCAGATATTGCTCGGCAAGCGCCAGCACTGAAATGCGCGACAGGTCCACCTTCTGCGTGCGGGCAAGATGCAGCAGCAGATCGAGCGGTCCTTCGAAACCCGCGACATCGATGACCAGTCCGGCCTCGCCCGTCAGCCGCTCCGGCGTCACATCCTGCCAGAGCTTGTCCATCGGCGTCGAATTGCGAGATTTGTCTGCGGCCAT
Protein sequences of DBSCAN-SWA_4 >NC_022535|1616093:1624495|1618003_1618657_-|WP_022556269.1|DBSCAN-SWA MKSAMVEKEGFAALVLRLRGEGISDLDLLTAVEQTPRSKFVPPQFAADAYSSRTIPIDCGSFMEGADMAVKILARLQLKPGQRVLEIGTGSGFMTAIIARRVERVFSLERYKTLVQQAQNCLDDLAIRNVVIRQADGSNGLVGEGTFDRIVSTAAFTTMPRFFAEQIVSGGMMIAPIIVEGDRCVMTRFSKTGSRFEKEELFEAPYLPLSTHIARHL >NC_022535|1616093:1624495|1621782_1622529_-|WP_022556271.1|DBSCAN-SWA MFDIGWSELLVIAVVLIVVVGPKDLPPMIRAFGKTMAGLRKMAGDFRTQFDEALKEADMDDVRQTISDVRNLNPTNSLRDAMNPLRQLGNEIKSDLQKATTPPEGLSSSPATSTAAPATSEPVAPLVSIPEPEMKLPDTPPVTASAPVAAPVAPAEKPKRARAKSIATVEAEAVAAKPKRASRAKAVAASEPVAVNSTETSAAKPTVKAVARKAAARKVAAEKPVVAADAKPVKPARTKAAKPKKDEA >NC_022535|1616093:1624495|1618653_1619424_-|WP_004442133.1|DBSCAN-SWA MRILLTNDDGIHAEGLAVLERIARTLSDDVWIVAPETDQSGLAHSLTLSEPLRLRKVSDKHFALRGTPTDCVIMGIREVLPEKPDLVLSGVNAGANMADDVTYSGTIAGAIEGTLQGVRSFALSQAFSHAEGRVVPWEVAETYAPDLIRKLIDVDLPDGTFLNLNFPNCAPKDVQGVSVTGQGKLDFGLTVEERQDGRGFPYYWLRFGERMGTFREGTDIHALKHGKISVTPLKLDLTDYTVKDRVAQALGFGVAD >NC_022535|1616093:1624495|1616093_1617659_-|WP_037093448.1|DBSCAN-SWA MFAAALLASVATGCSSDATRFSGLFSSGPDQMTTASIPARQGGGAYGQAPVPQGDMNGGYASAAPQGGYANQNMAGNSYPQQSGGYGGVQPSTARASASPVQRSELSAPTAVASRDPSTRNEAMAQPFPSAQPKAAPSLAAPARQAAAPAADNLTTATVRSDKNGWHTDGAASVTLRPGESIATLSNRYGVPEKELLRVNGLKTASAAQAGQSILIPKFGQVRNAAKDAAGDIALNRNGDQPTPLRAPDGNVAVLPSQAAARDKVSAEAGKLTPPGGKPLPPSGGYKVQPGDSLAKIARANGVSVAALKAANGLGNESIRVGQTLNMPGASTDAIKTASVPAKEAAKPVETASVKPEPYKAPAAAAAPAAPATPATASVSDIEKKADMASVAPESTGIGKYRWPVRGAVINNFGDNVEGSRNDGINISVPEGTPIKAAENGVVIYAGNGLKQLGNTVLVRHDDGKVTVYGNAANLDVQRGQKVQRGQTIATSGMTGSAKRPQVHFEVRKDATPVNPSTFLE >NC_022535|1616093:1624495|1620982_1621786_-|WP_022556270.1|DBSCAN-SWA MSGDIEDKPQPLIEHLMELRTRLIWSLGAFFVAFIACFAVAKHLFNLLVIPYKWAVLWAGLDVTKSSLIYTAPQEFFFTQIKVAMFGAMVISFPVIASQLYKFVAPGLYKNERAAFLPFLVASPILFLIGAALVYFFFTPMVMWFFLAMQQLPEDGEVAISLMPKVSEYLSLIMTLVLSFGLVFQLPVVTTLLARVGLLTSDWLREKRKFAIVMAFVVAAVLTPPDPMSQIGLALPAIILYEISIYMARLVERKRAAESASTELEET >NC_022535|1616093:1624495|1622585_1622792_-|WP_003509722.1|DBSCAN-SWA MGSFSVWHWLIVLVIVLVLFGRGKIPELMGDVAKGIKSFKKGMADEDQTPPPADANTKTVDHKADEIK >NC_022535|1616093:1624495|1619597_1620881_-|WP_004442134.1|tRNA|DBSCAN-SWA MHDIKWIRENPEAFDAALARRGAEPAAAGLIALDEKRRSVIQSLQDMQSRRNAASKEIGAAMAQKNMELAEKLKAEVADLKDKMPRAEEEDRQVTAELNDALSRLPNMPFDDVPDGKDEHDNVVTRVVGQKPGWNHEAREHFEIGEALGYMDFERAAKLSGSRFTVLTSQLARLERALGQFMIDLHTSEHGYTEVSSPLMVRDEAMFGTGQLPKFSEDLFKTTDGRWLIPTAEVTLTNLVAGEILEQEKLPLRFTALTPSFRSEAGSAGRDTRGMLRQHQFWKCELVSITDAESAVAEHERMTACAEEVLKRLGLHFRTMTLCTGDMGFGARKTYDLEVWLPGQNTYREISSCSVCGDFQARRMNARYRGKDDKATKFVHTLNGSGTAVGRCLIAVLENYLNDDGSVTIPDVLLPYMGGLTRIEKAA >NC_022535|1616093:1624495|1622922_1623657_-|WP_048902690.1|DBSCAN-SWA MTDNDETAEATIVTDGVGLEPTVFSERQLKEAERIAEALVFASAEPVSPGFIAERLPRGMDVDAVLQRLKAAYAGRGVNLVQVGGQWAFRTAGDLSFVIRAEEKEPKKLSRAALEVLAIIAYHQPVTRAEIEEIRGVQTSRGTLDVLMEAGWVRFRGRRRTPGRPVTIGTTVEFLDHFGLEELRDLPGLEELKGAGLLSGRIPSNFGVPLPMMSDELREDEDPITQLDLEELGLLAPGGGDGDD >NC_022535|1616093:1624495|1623649_1624495_-|WP_022556273.1|DBSCAN-SWA MAADKSRNSTPMDKLWQDVTPERLTGEAGLVIDVAGFEGPLDLLLHLARTQKVDLSRISVLALAEQYLQFVESARRVRIELAADYLVMAAWLAFLKSKLLIPQQSKDDGPSGEEMAATLAFRLKRLEAMREAAERLINRAQLGRDVFARGAPEHIPHINRSAYEASLYDLLSAYANLRQRQAITQVTIEKRQVWSLVEARELLTGLLGDVGEWTALDQYLLQYVPDPAMRVTAIASAFAASLELVREGTLQIRQEGAFQPIYMRRGTRDDRAAVAERTDND |
9 | uncultured_Mediterranean_phage(75.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1761573 : 1768497
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_022535|1761573:1768497|DBSCAN-SWA GTCACGCCGCGTCCTCCAAGTGCCAGTGCAGGAAAGCATCAAGCTGGCGATGCAAATCCACGATGTCACCGATGTTGTGGATCTCGACGTCGGCACCACCACACCCCGCCTCGCTCTCATGCGATCCTGCAATACCGCCACGACCGACAAGCTGCCAGACCACGCCACCGAGCTTGCGAACCTCGTCGGCTTCGTTCGGGAAGCGGCAGTCATCGACGACTACGCAGCCGCCAAACGCAAGCACGCCGTCTACGCGTGAGCGCCAGAGGTTTGTCCAGAAGTCCTCCCCCATGCACTTGCGGCCCCATTCAGTGCCAAGCGTTTGCATAGCGTAGCGAGGCGTGCGTCCGGACAGCATTGCAAGGTCGGTTTCTTTGTGCGCTCCTTCGATGTGGCCTTCACCAAGACCTATGGCCCGCAACATGTCTTTCAGCGGCCCTGCGAACTTCACCAGTTGGTATCCGTGCCTCTCAACCAGGTATTTGCTGGCCGTGCTCTTACCGCTGCCAGCAAGGCCGGTGAGCGCGATGACGGTTGGCAGACTGGTGCTAACCTTCCCAATCGCCGCGCCGAGCGCGCGCAGTTCTGCTGGCACATTATCGTTCGCCGGTACGGCTGTGAGGCAGCCGGTGTCTTTGGATGTGATTGCGTCTACCATAAATCCTCCGAATAGAGCTGGGGGCCATTGGTGGCCGATTTGATTTGGCCTGCCGCATGCGCGAGATAGAATGCCTCAACGCGGTTCACGAAACGACCTCTATCTGTGATGAATCCTTGCGCCTGAGGCGTCGCCACCGCGCCATCTATGCTCATTTGGGTGTCCATCGTTTGGATGATGGTGTGATGTCTGGCAGGCGGCGGGAGCGAAATCGTTGCGCCGAACTGGATTGCCGCGGCGGCTATACGCTCTAGCATTTGTCTGCCTCAGCGAGCGCTTCAAATATTACTTCTTCGTTCAAGTAATTCTCTCGCGCCAGACTTGCGAAATCCACTTCAGCAGGGCAGTGAACGATCGCCGCTGCAATGGCCTCCCTCGACCGCTGGAGGTATCGAAGTCGCTCTGCTCTTGGCATCCCCATCCACGACCTCTTATCAAAAGAGGAGTCCGATTTTGCCATCGCTGCAATGACCTTAGCTAACATACCGTCTCCTTAATCCTCTTCTTGCCTTCGACATTCGGCTTCACCACCGGCTTAAACCGGCGCAGCGCGAACGGTGGATCCTCATCGCCGAACTGCGGACACACGCCACGGTTCACGCCCGCCAGCCTGATGCCGGCGTAATCTCCACCGAGATACGTGCGGCACATGCCCACCCACGTCGCCGTGTAGATCTCCCCGGCACGAATGCCGGGGTATTGCTCAGGAAGGGTGGTGTCGTCGACGCAGACGACTTCGTCGCCGGGTTTGATCGCATTCAAGAGACGGCCTCCTCCCGCGTCATTTCACCAAGGTAGCTCACTGCGATTGGGCACGCGTCGGCGCCTCCCAAAACAGCATTCCGCGCTTCTCGAATCCGCCCCTCGTTCACGTATTTGTCGGGCCACCCCATCGTGACGCGCGCAAACGTTCCATTCGCTGCCGCGTAGCTGAACATGAAATAATGGAGCTTCTTCAAGCCGCCCTCCCCGCCATCAACGCATCAAACGTCGCCAGGAACTCAAGCTCCGCCTCCTCCGGCGACCAGTACCGCAGCTTCACGCCCAGCTCTTTCCCGGCGTACCGCGCATGCCACGGCATCTCCCGCAGGTTGACCAGCTCGTCGGCGATGATGCTGTTGTCGGCGTCGTGAACCTCGCGCGGCAACTCCTTCGGCAAACCAAACCGCGCCGCGATAGCCAGCCAGTTGTCTTGCTCGATCGCACGGTAGTTCGTCAGGTACGGCTTGAGTGGGCGCGGAATGTCCACGCAGTATGCTTCAGGGGCATCGTGCAGAAGGCCAGCCAAAGCCACCTCCGGCGCGTGCTTGGCTGCAAGGTGACGGGCAATCAGAACTGAATGTTCGGCGACGCTGTAGAACTTGATGCAGTGCCCTGCGTAGCGGCATTGCAGGCCGAGCGAATGCGCGATGTCTTCGATGTAGACTTCGTGCGGGCGCGGCGACATCGGCCAATATTTGCGACCGGTGAAGGTTTGCATGAAATCGCCGGCGCGGGGTGGTTCGGCTGGTGTGTTGCGGGTTAGGCCGATGTATTGTCCGTTGTCCTGGGACTGAGGACGGTTGTCGTTGGCGGCAACTGGGGTACACCTTGTGCGCTCTTCGCGCAGCGCCATTGCAGCCTTAAAGGTGGCGATCTTTTTATCTGCGATCTCTCTCGCGTCTATCACTGCAGCGTATGGCATCAAAACGCCTTCCCACCAGCCATAAGCCGGTTTTCGATTTTGTGGTCAGGGCGGTTGGCGTTGAATGCCATCTTCTCGACGGCGGCGCCGCCGAGGTCATAGCCAAGGAACGCAGCCAGATCGCCGATCCGAATCATTGCGTCAGCAAGCTCCACCTCAGCCATAGGTCGGTGCGGCAGCTTGTCGTCCATCAGCACTTTACCGGGCTTGGACTTGCGATAGCCCTCCATCGCTTCGCTGATCTCGCTGTGAATCAGGCAAAGCATCTCAGGCACGTTGCGGTCCAGCGCCCTGCCTGTGGCGAGGTCAGTGTACCAGCCTGCTCTACGGCTTGCTGCGTGGCAGTCGGCTGCAAACAGGTTGATCGCTGCGGCGTGGTGGAATTCCACCTCCGCAGCAATTTTTAAAGTCACGACGTTGTCACTCATTCTTGTCTCCTCAGTGGCGGTGGTTGACCGCCAGTTGGTAGCTGGCGGCTGGTTGGGGATTGCTGGGGAAGTCGCTGTATAAATGCTAAAAAAACCCGAATTTGCCGGTTGTTTCGGTGGTTATCCCCCATTGGCAATTGACCTTCGGCCCGTAGGGAGTCCTTATTGCCATTGTGGTGCCAGCGAATCTTTCACTGGAGCACCAAATGAAGACCTTCCTTAAGTTCAGGCACGGTAAAACCGAAGGTGAAGCGAACAACACTCTCGGAATGTTGCTGCTTACGATCATTGCCGTAACCTACAGGCTTGCGCCGTTCATCGCGCTGGGCGCTGGCGGATACTGGTATGTTCGCTAGGGCGTCACGCCACCGCCCTCGCCGGCTCATTGTCATTGGCCGCCGCATATCGCCCCGCGACCATCTCCGGACGCAACGTGGCGCGGGCAACTTCGCCGAAGCGGCGAGAATACGTGATAACTTTGGCAGACCGGCCCGACAGCCATCCGCCGCCAGCAGCGTATGCGTCAGGCGCGGCAAGCGTTTCGTGGCGCTCGACATACATCAGCGCCGACTTGCGGCCCTCATCGCTGTGCAGGTGCCCGACATGCGCGAAGGCGTACTTCGATGCGCCGAACATTTCGCGGAACATGCCTGCCAACGTCGCATCGACCTGGGCCACGCCACGCTTATGGCCGTGGTGATAGAACAGCGCTGTGTCGCCCCACTTGTAGGCGTAATAAAGCATAGGGCTGGTATCGACCGACACGCGCGGCTCGTTCTCGTAGATCGTCGCCAACAGCTCGCGAACCCATACGGACGAGGCCGGGTCATGGTTGCCCGAAGCCATGACGACGTGAACGTGCTTGTGCTTTTGCAACAGCATATCAACGACGCGACGCAGAGTGCGAATAACGACGCGGACCACCTTATGCAGGCGGGAATCGGCGTCGAGAACGTGCTTATGCGCAGGCGTCACGCTCTCAAGAGCGTCATGGTGCAAGAGGTCGCCGAGCTGCGCGAGAACAGCTGTGTGCGCCTGCGGAGCGCCAGCCACAGCGGCGGCAAACCAATCCAGCAGCAACTGTTCGGCCAGCCTTAGATCGTAGTCGGCGCCCGTTTCCTCGCGATGGGCCAGCATGCCAAAATGGCTGTCGGTCACAACGAACTGGTTGAGGAGGTCTTCTTCGATGTGCTGCGGCGCAGGCATTATCGTGATGCGAGGCAGATTTTCCTTGAGGCCGTCGACAACGGCGCGGAACGCGGCCAACTGCCCTTCGGCATCCACCGCCGTCTTCTGCCACTGCTGGATAACGCGCCCCTCAGCATCAACGAGGGCAGACACACCTTTGACGACGTGGCCAGTCGGCACCGCGAACTTCTCGCCCCGTTCCGGCGTCTGCTGGATGAACGTGCCGCTCGGAGTGTTGCTGATACGGCTTATGCGAAAGCCAGGAAGGACAGGCTCCGTGCCAAGAAGGCCAGCTTCGGCGGCGCGTTTGATGCTCTCGGCAAGAGCGGACTTGCCGATGCCAAGCACGGCGGCAGCCTTCACGAGCGTGCCGTGCTCTTTGTAGGCATCGGCTCTCCGGCGTAGTTCTTCTATGGGTAGAGGCATGCAGTCTCCTCGATGTGGTGGTGAAGGCCGCTTGGTGGGCGGCCTGATTATTTTACTTACGCGCCAGCAACTCACAAACCGCAGTCGAAGGCACGACATAGCCATAGCCAACCAGCGAGCCTGAGAAGCCAATAGGCGCAGCCATGACGCCAACAGTGATGCCGATCAGGTCGCCGTTGTCTGCGTAGACGGGTCCGCCTGACTGGCCCATCACAGTTGTGATGTCTGTCACATAGACCGATTTCCAAGGACCTGTTTCGCGGGACTCGCCAGCGATCTTTCCGTATGCGGCAACGAATTCGATTTTCAGAGGATTGCCGTAAGCAACGATAGGATCGCCAACATTCACAGCGTGGCAGGCCAGTTTTGCGACGCCGAGGCCGTCCGACGACGTACGCAGCAGTGCAATGTCATTGGCTTTGTTGACCCAGAGGACATCAGCCTTGCGAAGAGCTCCGCCCTTCGCTTTCAACTGAACTTCCTTCGCGTCGCCGACGACGTGCGCAGCAGTGACTATGAAGCCATCGCCGATATGAACGCCGGAGCCGTGGCCGTTTTCAAGCTGGATTTTAACGGTGGCTGTTTCTGTTGCTGCGGGTGTTGGTGGAGGAAGGAGCGCATAGGCGGCCGCAGATGTCGCGGCGATGACGAGGAATAAAGCAACGAGGAATGCTGTTGGCGTACGCCGCGACGCGGCATGCAGGAATCGTCTGAGCATGATTATCCCCTCTTTCTGGCGACCGTGCGCCAATGTGGTGGTCTCGGTTGGTTGCCGAGCGGCTGGTGAGGCCGACAATCAAGCTATACTGTTTTTACAAATTTGTCAAGATTTGCGGCCGCCTCAGAATCCCGATTGACTCTTTAGAGCATGAGAACATAATAAGAACATCATGTCTCACAAGATGCGAACACAAGCAATGAAACAGTACAGATTTCGGGCTGGGCGCTGGATCGTAACAGTGCGCGCGCCGACCTTTGCCCACGCTAAAATAGCCGCCGCAGCAAAGCTGGATCAACGGGCAGCGAAGCTTTTCGCTAGTCCGCCTGCATGCGGCTGGAAGCTTGAACGACTAGCAGACACCACCAGAGGATAGTCCTGTGCGCCCAAATTCAAGAGAACCGCAAGCTGATCCGGTTGACCACATCATTGCATGGCATGACGGCGATAGCCGCGCCGCGATTGAGACGCTGATGGAAGACATCCAGCACTTGCGGCTGCAGCTTGCGCTGGCGACCGCAGCGATGGGAACCGGCTTCACGAGAGGATGGAAGCCGGACGCCGACAGAAATGCCAGGTGAAGGAGCTTATCTGCTTTCGGACTACGGCGGCGAGCCTGTCGGCGTTATATGTGAGGAATGCGAGCTCCTGAAATTCATTAGTTCGGCCGAGCTAATGGTCGAATTTGGCGACCTGTCGATGCCGACGATGCTGCGGCGCATTTCCCAAGAAGTGATCAAATGCGCTAGGCCGATGGAGGGCTTTTCAGGCCGCTGCATGCTTCATTACCACGCGCGCGCCGGCAGTCAGATCGATGCATTGAAACAAGCGACGCCGCCGGCCGTTAGAGTGAAGGAAATTCGAAACTGGGAGATTGTCGTTGCCAAGTGCAACTACTGCGGCCACGTTTCAAATATCCCGCACTGGCAACTGAACCGCGCGGCGAAGACGGACACTACGGTGGACGAAATTGCCAAGCGACTGAAATGCAAACGATGCAGCGTGAAGGGTGATGTGAAAATCACCATAGCCAAGATGCCGAGGTGAGCATGTGTAACTTGTATCGCATGGAGGACAAGGACTGGGTCTCGAAATGGGCTCAGGACGCCGAAAGCTTTATCAACCTGATGCCGGCCTATCAAATGAACCCCGATCAGATGGGCCCCATCGTCCGCAACACTGCGGATGGCAAGAAGCAGCTTGTGCATGCGCGCTGGGGCCTGCCCTCACCGATTTTCGTGCAGAAGAAGGCTGCGGAAGCTCGAGCGGATAAGCTGAGGGCCAAAGGCAAGGCCGTCGATATGGATGAACTCATCCGGATGGAGCCGGACCGCGGTGTGACGAATGTGCGCAAGCTCAATCTGCCTCACTGGACGCGCTGGTTCGGCGTCGAGCACAGGTGTCTCGTCCCGGTCACCAGCTTTGCCGAGCCGGATCCGGCAAGCAAACAGGAAGGTGGCAATGTGCCTAATGCCTGGTTTGCCCGAGACGAGGCAAAATCACTCATGTTCTTCGCGGGCATCCACGTGCCGCAATGGAAAAGCGTTCGCAAGGTGAAGGACGGAATGACCACGGACGATCTCTATGGCTTCCTGACCACGGATCCCAATGATCTCGTTAGGCCCATCCACGAGAAGGCTATGCCCGTTCTTCTGTTGACGAAGGAAGAAACCGACGTCTGGATGCGGGCGCCTTGGGATGAGGCAAAGCACCTCGCCCGCCCACTGCCAAACGATGCCCTGATTGTGCTGTCGCGTGAACCTTATGGCTCAACGATTGTTTCGCGAAGCGGAGAACCGGTGGAGCAAGGAAGCCATTTATAA
Protein sequences of DBSCAN-SWA_5 >NC_022535|1761573:1768497|1766008_1766674_-|WP_022556384.1|DBSCAN-SWA MLRRFLHAASRRTPTAFLVALFLVIAATSAAAYALLPPPTPAATETATVKIQLENGHGSGVHIGDGFIVTAAHVVGDAKEVQLKAKGGALRKADVLWVNKANDIALLRTSSDGLGVAKLACHAVNVGDPIVAYGNPLKIEFVAAYGKIAGESRETGPWKSVYVTDITTVMGQSGGPVYADNGDLIGITVGVMAAPIGFSGSLVGYGYVVPSTAVCELLARK >NC_022535|1761573:1768497|1762707_1762983_-|WP_052349964.1|DBSCAN-SWA MKPGDEVVCVDDTTLPEQYPGIRAGEIYTATWVGMCRTYLGGDYAGIRLAGVNRGVCPQFGDEDPPFALRRFKPVVKPNVEGKKRIKETVC >NC_022535|1761573:1768497|1767244_1767721_+|WP_006697486.1|DBSCAN-SWA MPGEGAYLLSDYGGEPVGVICEECELLKFISSAELMVEFGDLSMPTMLRRISQEVIKCARPMEGFSGRCMLHYHARAGSQIDALKQATPPAVRVKEIRNWEIVVAKCNYCGHVSNIPHWQLNRAAKTDTTVDEIAKRLKCKRCSVKGDVKITIAKMPR >NC_022535|1761573:1768497|1761573_1762230_-|WP_022556376.1|DBSCAN-SWA MVDAITSKDTGCLTAVPANDNVPAELRALGAAIGKVSTSLPTVIALTGLAGSGKSTASKYLVERHGYQLVKFAGPLKDMLRAIGLGEGHIEGAHKETDLAMLSGRTPRYAMQTLGTEWGRKCMGEDFWTNLWRSRVDGVLAFGGCVVVDDCRFPNEADEVRKLGGVVWQLVGRGGIAGSHESEAGCGGADVEIHNIGDIVDLHRQLDAFLHWHLEDAA >NC_022535|1761573:1768497|1763185_1763845_-|WP_048902829.1|DBSCAN-SWA MALREERTRCTPVAANDNRPQSQDNGQYIGLTRNTPAEPPRAGDFMQTFTGRKYWPMSPRPHEVYIEDIAHSLGLQCRYAGHCIKFYSVAEHSVLIARHLAAKHAPEVALAGLLHDAPEAYCVDIPRPLKPYLTNYRAIEQDNWLAIAARFGLPKELPREVHDADNSIIADELVNLREMPWHARYAGKELGVKLRYWSPEEAELEFLATFDALMAGRAA >NC_022535|1761573:1768497|1767723_1768497_+|WP_022556387.1|DBSCAN-SWA MCNLYRMEDKDWVSKWAQDAESFINLMPAYQMNPDQMGPIVRNTADGKKQLVHARWGLPSPIFVQKKAAEARADKLRAKGKAVDMDELIRMEPDRGVTNVRKLNLPHWTRWFGVEHRCLVPVTSFAEPDPASKQEGGNVPNAWFARDEAKSLMFFAGIHVPQWKSVRKVKDGMTTDDLYGFLTTDPNDLVRPIHEKAMPVLLLTKEETDVWMRAPWDEAKHLARPLPNDALIVLSREPYGSTIVSRSGEPVEQGSHL >NC_022535|1761573:1768497|1764702_1765956_-|WP_022556383.1|DBSCAN-SWA MPLPIEELRRRADAYKEHGTLVKAAAVLGIGKSALAESIKRAAEAGLLGTEPVLPGFRISRISNTPSGTFIQQTPERGEKFAVPTGHVVKGVSALVDAEGRVIQQWQKTAVDAEGQLAAFRAVVDGLKENLPRITIMPAPQHIEEDLLNQFVVTDSHFGMLAHREETGADYDLRLAEQLLLDWFAAAVAGAPQAHTAVLAQLGDLLHHDALESVTPAHKHVLDADSRLHKVVRVVIRTLRRVVDMLLQKHKHVHVVMASGNHDPASSVWVRELLATIYENEPRVSVDTSPMLYYAYKWGDTALFYHHGHKRGVAQVDATLAGMFREMFGASKYAFAHVGHLHSDEGRKSALMYVERHETLAAPDAYAAGGGWLSGRSAKVITYSRRFGEVARATLRPEMVAGRYAAANDNEPARAVA >NC_022535|1761573:1768497|1767054_1767255_+|WP_022556386.1|DBSCAN-SWA MRPNSREPQADPVDHIIAWHDGDSRAAIETLMEDIQHLRLQLALATAAMGTGFTRGWKPDADRNAR >NC_022535|1761573:1768497|1762988_1763189_-|WP_022556379.1|DBSCAN-SWA MKKLHYFMFSYAAANGTFARVTMGWPDKYVNEGRIREARNAVLGGADACPIAVSYLGEMTREEAVS >NC_022535|1761573:1768497|1764548_1764698_+|WP_158454197.1|DBSCAN-SWA MKTFLKFRHGKTEGEANNTLGMLLLTIIAVTYRLAPFIALGAGGYWYVR >NC_022535|1761573:1768497|1763913_1764279_-|WP_052349965.1|DBSCAN-SWA MNLFAADCHAASRRAGWYTDLATGRALDRNVPEMLCLIHSEISEAMEGYRKSKPGKVLMDDKLPHRPMAEVELADAMIRIGDLAAFLGYDLGGAAVEKMAFNANRPDHKIENRLMAGGKAF |
11 | Rhizobium_phage(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
1783178 : 1822766
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NC_022535|1783178:1822766|DBSCAN-SWA ATCACCCCAACTTCACGTTTCGTTTCTGAGCAGACCGAACGGCCGCTTCAACACGGCTTTGGAGCTCGCCCTGCATCTTGGCCAGCGCCTTTTCCTGTCTCGCGACGGCTTCAACAGACGCGCCGCGATTGTCGACGACTGGATTGAAGTTGACGACGACACCGGCAGATTGCGCGGACATGGATCTCAAACTCGGTACACTCGGGACCGAAATCCCGACCGGGCCGCCGTTGGCGTAGCCTTTACGGAGCGCTTCAACCGCCCCTACTCCACCGGCCCGCGCCACGTCGGCCTGGGACCACACGACTTCGCCTTTGTGCACGATGCCGGCAGGCTGATATTTGCCGCCCGGACCAGTGAAGCCGCCCTCTGAGAAGAGGCCGCCTGACAGCCCGGCGTAGCGAGATGCCCCACCGCCAAAAAGGCTGAACAGGCCACTCAATAAACCGCCACTGCCGCCAGCCGCACTGTTGACCTTGAAGATGCTATTGAGCACATCGTCCAGGAGCGCATCTGCGATGCGGCTGAGCGCGCCAGCAAATGCATCCGCGGCACTCTCACCATGGATCAAATCGTCGATGAAGCCGCGCGTTGCGTCTCTCGCGACGTTGTTCCACTCGTTCAGCTTCTGCTGTTCCTCTTGCGCTTTTCTAAGCGCCTCAGATTGCCGAGCATATGCAGCAGACTCCTGCTCTATTGCAGCTATCTTGTCGGGCGAAAGCTTGATGCTTTCGAGGTCTTTTTCGCCCTTCTTGCGAGCCTCTTCGCGAAGGTCGGCCAGCGCCTTCTGCTCCAGGTCTAGCGCTGTACGGCGCTTAACTTGAGCCTCATTGGACAAGCCAATAAGGTTCATTTCCTGGCGCAACGCCTCAGTCCTATCTCGGACAGCCTGAAGGTCTTCCGCGAAGCGGTCAGATGCGGTCTTTTTGGGAGCCCTTGGCCGCTTCGGCGTACCAAAGCCTCTGTTTTTGTCTGTGTCCAGATCGGATGGGCGGCGTTCTGGTGTGGGGCCGTTGTCCGGTAGGGGAAACTGCGTTCCTTGGATTGTGGCGTCCGCGCCGAACTGGGAATCCTGTTGGCCTGCTCCGCGCCACGTCCTCGGATCATTGATCCTGGAGGTCGCAACAGAGGTGACCTCATTGACCTTTTGGACGCTATCCGCGGCAGTCAACGCTGCCGCAGACAGTGTGGAGAAGTATTTAGCGAACTCAGCGAGTGCAGGAATGCCAGTGCTGTTTATCGCCGCACTTAGAGCGGCCTGCACACGGTCGACATCTTCAGTCTGCGCCTTACCTTCTTCGGCAGCCTTCGCGAAGTCATTGAATGCTGACTGGAGGTTCTTGATAACGTCAGCTTCTTCACCTGCAGACTGGAGTTGCGAAACTAGATCGGCAATGGTGGCACGCGTACTTTCAAGCTCTTTGCGAACGTCAGCGAGCGTATTGGTATTTACAATGTCGGCGCCCTTGGTGAGGTCGGCATTGTCCTGCGCTCGTTTCAGCTGGTCGGCGTAGTCGCGCAAAGCGGGAACGGCATCGCCCCAGCGTTCGGCGACCGCAGCAATCAGCGCAGCCTGCTCTTTGAGCACTTCGGCGGACTTATCGCCTTCGCTCATAATCGTCGAAAAGTACTGAAATGCCGCTGTGCCTGCGGCGATGACGCCAATCGTTACCAGCGACAACGGAGAAATCACCGACGCGAAAGCAGCCGCAAGACCTTGGCCGACACCCTGCCCGCTATCCTTTATCTGCTGCAGGACGGCCGAAAGCTGGGTGCCCTGCTGCAGGGCGATCTGAATGGGAGACATCCCCATCGCTGCGGTCACGCCGATGTCCTGGAACTGAGCGGCAATGTTTGCGGTATTGAAGCTGTTGCCGCTACTGGTAGTGACCGTAGCCTTTAGCGCTGCGTTGCGACCCTTAATGGCCGCGGTCGACGCAAGTGCCGCCTGTCGCTCCCTCTGAATCGCAGACGCCATCTCGTTGGCAGAAATTGCGCCGGCGGCATGGGCCTGCCGAATTTCGGCGACCGCATTCTTATAATTCGAAATTGTTGCGAACAACGGGCTGTATTTGGCGCGGAGGCGATCAAGCTCTTTCTGCTGATCAGCAAGAACCCCGTTCCATTCTTTTGCTGCCGTCGTACCAATGCCCACCATGCTGTTGATGCGATCCTGCATCGAGGTGGTGAGGGAGTTGTTGATAGACTTACCGGTAGCGGCAAAACGTTTCTCAATGCCGTTGGATGCTGCGCCTACGTCCGACACCAGCCGATTTAGCGCACGCTTTACGGTTGCAAGATCGGTGCTGATTGAGATAATCAGATCATCACTGTTGTTGCCGGCCAAGTCGGTGTCCTAACGTGAAAAAGCCCGCCAATGGCGAGCTGGGAGGCGATCCAATGGATAACTGGTTGAAGGCGCTCGTTGCGTGCGCGTGCGTCGTCATCATTTCTGGCGGCGGCTATTATGCTTGGGGAGAGTACAGTGCCCACCAACGCGAGAACGAACGGCGCGAGAAGTCTAACCGCGAAGCCTCACTTAAGAATCAGGCGACGCTATTAAAATCCAAATTTTCGGCAGCTGAGTGCATCAGGATGGCGAAAGAAACCCTGCCAGACAAAAAAGGCGAGCCGGTTAAGACGACAAAGTATAACGGCGACTTATCTATATGCGATGACCTCCAAATGTTCGACCCGACATGGAGGCAAGCACTCGATATGGCTGGCGTGTTCTGATGGACAAGACGGCAAACTGTTGCCCGCCGTCAACTTGACCCATCCTGGGGGGACGCAACCCTATCTTCACTCCATGTTTTGCCATGGTGATCTTCGTCATTCGGGGCTCAACACAGAATCTGGCTAGACAACCCATCATTGAATTTTGCATCCTCGACAAATGAATGGGGGCGCAACATGCATAAGCAATACCACCTTGAAAACTCCACCTATCCCGACACTCACCGCATCTACGAAGAGCGGCTATCCATCGCTGGTATTCACCATTATCGGAAGGATGCCATTTCGTTTTGCAGATCCAGAGAGAAAGCCATCTATTTCGATCTGGATGCGGCCAATCCTTACGATAGGAACGCAATCAGAATCATGGGCCGATGGAAAGGTCTGTGGGGTACGAAGGTTAAAATACTGGGATACGTAGACGCAGACACAGCTAGTAAGATAGCTGCATTGGGTATTCAAAACGATATTCTTCCACGACTACTCAAGACATATGTCGGCGAAGACGACTACGTCGAGATTATGTACCAAATCGTCGGCCCCAAAGATGGGTACGCAGAATACTCCCCGCCACGGATAACACCAGTATCGACGGCCAAGAAACTCATGGAGGCTGGAAACGATGTTGAAGCTGTAAAAGCGCTGCTCGCCGACATTGACAAGGAAGAGATTGAGGCGAAAAAATCGGGCGGTGGTGTAGCGGCCAGATCATACAAAGCGCTTGCGGACTTTTACAAAAAGCAAAAAAGCTACGATGAAGAATACGCTATCCTTGAGCGGTTTGTGTCGCAGAGGCGCGCAAGGGGTGTAAACCAAGATAAGTTGGCCGAGCGATTTTTGAAGGCCCGCGAATCGCGAGATAAACGAAACGCTTCGAAAACTCCGTAACTTGCTTTTGAAAGCAAGTTTCCAGACGCCTTCGAAGAGAGTTCGTCGCCGCCACGGGTACAGCGGTCAAGCTGAAACCAGCCTAACCACCATACTTCTTGATCAACTCATCCATCTCTTCGTCAGACGGTGGCGCCACGGACTTTTTGGCTCCGTTCGCTTCGGCCTTCCCCTTCACCGCAAGGGTGAACTCGGTCAAGCTTGACGACCAGAATATTACTGGGGGCCAGCCGAGGCCGCCGAATGCGATCTTTTGCCAATCGCGCCAAGGGAAAGGTTCTTCTATGCCGCCTTTAGAGCGGCTTCCCCGTTTCCCTCGTCGTCCTCATCAAAATGATGAGACAGCGCCTCGGAAATCGCCTTGGCAACGGCGCCGAAGTGCTTCAGCTTCAGCGCACCAATTGCCGCGACTTTGTCGCCGCGCACGGTAAGCAGGTCGAGTGCGGCCACAGTAGCGGCCGGCTCGACGCCTGAAAGGCGAAGAAACAGATCAGACATGCTCTTGCAGGAAAGGCGCGAAGACACAGCGGCAAGACCGCCCATCTCGGCAACGATAACCAGCGGCACCTTGCCAACGAACAGGCCGACTTCGCCCCGCGCACCATTCACTTCAAGCGGGAACGGCTTTTCAGCATCAGCCAAATTACACCTCCGCAACGAACGTCAGAACGCCGGCAGCAACGAACGTGGCCGTGAATTCCATGTTGCCCTCCATCTCGCCGCTGAACTCGAATTCAGAAACGAACCAAGGGCCGGTGTAAGTACCCAGGCCAGGAACGATCACCTTGGCATTGAATTTGGTAGCATCATTGACGTGCGTCATGAACGCGGTGTTCGAAGCGCTCTTAACGAACTTGCCGGAGCCAGAGAACGTGCGGTTCTTGATGCCCGGCTCTGCTGTTTTCTGTGGCGTATTCTCTGGATTGACGCAGTCCGTGATGGTCGTGTCGACCTCATTGGCGGACATATTGAAACTGCGGGTCGTGAGACCGCACAGGTTCGAAAAGACTTCAGGAGTTTCGCCGTCACCGATCTGGATGAGCAGCGTACGACCAATCTGTTGACCGTCGGCCATTTGTAAACCTCAAAAAGTAGGGGTTGGTGGCCAATCAGGCCGGTGTCTCGACGCGCGCGACAAACTCGACGACACCGTGGGTCGTGACTTCGTCCGGGTCTTTGAAATGACGGGTGTCTTGCCGCGTGATCGATATCAATCGATGTGAGGGCAGCACTAATGGCGCTTCATCCAGAGCCTCGACGACCTCGTGAATGAGTTCCTTCAGCTCCTTGAAGCCTCCGGAGTATTGCGACCAAACGTGAATCGTCACGTAGATGAGGTTCGACTTTAAACAGCCGACATCATCCCTGATGACTTGGCTTTCGCCGTACTCGACGTACGGAAACGGTGCGTTGGTCGGCGGTCTGTCATAAATTCTTTGGGCCACCTTCGCCGTCAGACCTGCTCGCGCCTTTAGCCTCGCAACGATGGCACCCTGGAGTTCGAGATCGGGGTTAACCATTATTTTTTCATGGCCTCCCTCACGCCTCGCCAGACGGCGTCGTTGATGCGTTTCTTCGCTTTGGCCCTAAATGCTCGCCACGTTGGAAAAATATGCGGTTGCGCCCGCGTGCCAGGGTGCATCTTTGCGCCGGCCGCCTGCTTCTTGCCGGCAACCGTACCGCCACCCTTCGCAACGTTATGCGGCCGCGTGCCGAACTCCAAAAAATGCCAAATCCACGCAGCGAAAACGCCAGTCGCATCAGGATCCTTGCTGGCAGATGCACCGACAAGCGCTTTCGCAGACGGTCTGTCAGAGATCTTGGCGCCCTGTATGGAGGCAGCGTAGTCGCCAGCCGTTGCGCTGTTGCTTATCGGCGCTCGGTCGGAGATTTTATCGGCGGCTTCGGTAGCGATCTGTAGCTTCGCTTCGGCGGCATACTTGTTGGCGAGAGGAGCGACCTGATTCAGCTTCTTCGTCAGCGCTTCGCGGCCTAAGACTTTTGCCTTGATCACGACGCCTCCCCGTGTACCACAAACAGCTCGATCCACTGGTTGCGTTCGTCGATGTTGACCGCAGCCTTGATCGCATAGAGCACACCGGTGCGCTTGTTCCGCGCCCGCCAAGCTGGCGTGATGATGCGCGTGCGCTCGTTGCTGCGGACAGTCATGGTGTAGGGCTGCAAACCTTGAAGGCGGCTGGCAATTACCGTCTCGCTGCCGACGCGCGGTTCAAGGCGGGCTGGCTCCACGAATTGCTCAGCAAAGCCGACCACTACACCGCCATACCCATCATCGCCCTCACCCTCGGCCTCAAAGCCGATGCGTTCACCCAGCGAGCCTGCGCCCGCCCTCCTGCGTTTTGGCATTCGGTCGATCCTTGGTGGGTTCGGCAGCCTCGGCCGCTATTGCAGCCGCGGCGCACTTGCGCGTGACGTTGTAAAGACCGATCGAATAGGCAATCGTGAAACCCGGCTGGCGCCAGTCGAACGGTTCATGGAAGCGGAGCCACATAGTCATCCGCCACTGTTCGCCAGACACGCCACGGAGCCAGAAGCATCCGTACCGCTCGCGGAAGAACTGCGTCGCCCGTTGCCTTTGGGTCAGGCTCTCGCACCTCGTAGAGATCGCCGGTCACGAGAAGGATCGCCGACACGATCGCGGGCGTTGCTGCAATGCCGTCGGTGACCGTGGGCGTTTCGCCTACGGCTACAACTTCGCGATCAAGATGCTGCGTAACGATGCTTTCTGCAGCATCACGGTAAAGGCCGATCTCCGCGTCTTCATCATCATGAAAGACACGTAGATGCTTCTTTACGGTTTCGAGATCGACGATTGGCATATCAGGCCGCCACTACTGCGGCAGTAGGCGCGCTTGTGACGGGCACGCTGCCCTTGTCATTGGTGGCTGTGACGCGAACCGTGATGGCCTTACCAACGTCGCCGGCGACCGGCACATATGTAGCCGCAGTGGCGCCAGAAATCGCGACGCCAGCAGCAAACCACTGCCGCGCGTAAGTCGGCGAGCCGGACCATGTGCCGGTTGTCGACGTCAGCGTCTGGCCAACCTGTGCGGTGCCTGTGATGGCAGGTGCAACAGAATTCACCGGTGAGCCGATACCGTTGACGATACCAGCGCCGATATAGGATGCTACCCTGCGCTTACGAACCTTCGTTGACAGCATCGGGTTTGTCCTTCTTCTTCGTCGAGCGAGCAGAGGAGATCACAGGCTTGTCAGCGGCTTCATCCGTGGCGTCGTCTTCAACGGCGGCCTGCGCCTCGCCCACGATGTCGACAAGACCCTGCGCCTCGAGCTGCTTCGCCTCGCCTGCCTCGACCTTGAACGGATCGCTCTTTTTGGTTTTCAGTTCTTTGCCCACCGCAAACGTGCGCTTGGCCTTCACTTCCAGAAAATCGGTCATGTTCACTCCCTTTCAAGAAAGGGGAGCCGAAGCTCCCCATAGGTCAATCAGGCGCCTTCAACGTCGCCGGTCACGAAGGACTCAGGGCGATAGACTGCGAACGCCAGGCGCTCTTCGGCGCGGATGGTGAACATGTTCTTCTCGAAGTCGTCGACGTTCTCGCTGGAAAGCAGAACCTCGATTTCCATGCGGTCGAAGATCTGGGCTGCGAAGCTGAACGCACCGGTGAGGAACTCACCTGCGGCCATGGCCTGCGTCGAAACGACCGGCAGGTTCCAGAGCGTCGGAGTGAGCGAGCCCTGCGGATTGCCGATGATGTAGTTGCCGCCGGCATCCTTGGTCAGTTCGATCTTCGCCCAGTCGATGGGGTTAAGCACGAACGCAGTTGCCGGATACTCGGCGAGAACGACCTGCAGAACAGCCAGACGGAGACGGTCAATGCCCGTTTCGTCAGCGGCAGCGAACGCCGGGTTGAACGCGGTAGCCTGCGGAACCAGGCCGTGGATGTTCTGGCCGGTGCCAGAGCCATTCAGCAGCTGATTTTCTTCGGCGAAGCGCAGACCGTAACGAGCGCGGCCGTCGATGTAGGAGCGAAGCGCGGGCGCGTCATCCAGGATCTGGCGAGAGGCCTTGAACAGGTGCGCGATCGTGCGAACCGGCGCGGACGTCATGTCGAACGTCAGATCCGAATAAGGCTTTGCAGTCGTTTCAGCGACCGGCGCAGCATTGTTCGTGTAGCCCGTTTCCTTGACGTACTCGATCGAGCTCGAAGCAGTCTGACCCGGCAGAACGAGATCGCGGATCGTCAGCTGGCGCTCAGGCAGGCCGAAGATGCCGGGAACACGCGCGCCGGGGACCAGAGACGTACCCTGAGAGCGGCCGGCGCCGACAGTGGTATTGGCAGAGGTGATTGCAGCACGATCAGCCTTCACGCGGATAGAGCCGCGAGATGCACCGGTGAGCATACCGGCCTTGAACTCGGCAGAATCGATGACGAGGTCACCGAGCGACTTCTGCTCGTTTTCGCCGATTTCGTTTTCTCGAGCGGCGCGCTTCTCAAGGTCACCGAGGCGGGTCGTAACGTCGCCGAGTTCGGAAAGTGCCTTGTCGGTCTTTTCCTTCAGCTCAGCGGAAACTTCGCCGTTGGCGGCAAGCTTCGTGGTAAAGTCGGTTGCGAGATTGCCCACCTGCTCCTTGATGGACGCAAGCGAAGTACCGAGCTCGCCGATCTTATCGGCAAGTTGATTATCAGCCATGAGTGGCTCCTTTATCGAATGAGTGGTGATTTTGCTTCGGCGATTAGCCGGTCGATGGCTGCCAAAGCAGCAGCATCCGCATCGACGTCAGGAGCCCCCTGACCATCCTTGAGGTAGAGCCGAGCGGCCCGCTCTGCCTCAGAGCCCGACAACCCCATCAGTCCCCTGATGCCGTTTTCGAACTCGCGTTTGGTGATTTGCTCGCCGGCCGTCATCTTCGCGACCAGCGTTTGTGCGGCATCCGCCTTTGCGGCGTTCGCGGCCTTGATGCGCTTTACCGGCGCAGGCTCGACGTCGGCGCCGTAGCGGGCCAAGGTCTCGTCAAGCGTCGCAATGCGGTCGACCATACCGCGGTCCATCAACGCTTCCGCGTAGAAAACTCGGCCTTGGCCATAATTATCTTCGACCTTGCTGACAGTCACGCCTCGCCCTTCGGCGACGGCTGCGACGAAGCGATTGTATGACCGGTTAATACCGTCCTGCACATGCGCCAGCGCTTCCTTGCCGAGCGGTTCGGTCTCGTTGCCTTCGACCTTGTGCTTGCCAGCAGAAATGTACGTGCGTTTGATTCCGCGCTGCTCAAGGGCAGCAGAAAGATCGTCGTGTGCAGTGTAAACGCCGATCGAGCCGGCGCGGCCGGAAGGCGTGACGACAATTTCGTCGGCCGACGCCGCAATCCAGAACGCCGCACTTGCAGCCAGGCTATTGACCTGCGCGATGATCGGCTTATCGCCACCGCGAAGCTTGCGAATTTCGGTTGCGAGCTCGTCGGTGCCTGGGACCGTCCCGCCGGGGCTGTCGATGTCGAGCACGACAGCCTTGATATCCTCGTTCGAAAGCGCTTTGTGCAGCGCTCTCTTGATGCCGGCATAGGAAGTGCCGCCGCTCATCGCGGAAAACAGGTCCATTTTATCGGCCAGAACCCCGTAAACCGGGATTACGGCGACGCTGCCGCTGGTTTCAGCAATTTCCTTGGCGCGCGCGTCGTCGATTGATGCCGCGAACTCGGACGAAAACAGCTTCTCACCTTCGGCCCGCGCCACCAAAACATCAGCCAAAACGCCCAGTTTTTCGCGCTGAATAGCCCAAGGTTCGGCCAGAAAGGCCGAAATCAGGTGTTCAAACTTCATGATTTTCCCTTATGCAGCGCGCGCTGCTGGCGTTGGCGCCGGAGTTTCTGTCTTGCCGAGCGTATCGAGGCGCGTCATCGTGCCATTCACGATGGCTTTGTTGCCGCCGTCGACTGGCGCCTTGTCTTCGTAAGATCGAGCCTCATCGACGAGGTAGATGCCGTTCGTGACCATCTTCGACAGGAATTCTGCTCGCGCCGTGCTGTCGCCGCGCAAGAGCTCTTCCATGTTGAATTTCACCTTCGTGGTCTTCCTGGTCTTTGCGTCCAGCAGGTCACGATAGATTGCCGCCTCGATGCGCTTGAGCATCGGTCGCATGCAGGTCTTGGTGAATTGGAGGATCAGTTGCTCGATGCCGCTGCCCCAGGTGGTCGTGCCGTTGGCTGCGTGCCCGATCATAACTGGCGGCACGCCGAAGATGCGGCAGATCTGCTCGACGCTGTACTGCCTCGCCTCCAGGAACTGAGCGTCCTTGGGGTTGATCGACATCGGATAGGGCTTGAAGCCAGCCTCCAATACCGTCACGCCGCCAGCCTTCTCGGCGCCCGCGAACTGCGTCAGTGTGTCGGCGATCTGTTTGCGTTGCTCAGGCTTTAGGATCTGATCCGAGCTGACAATGAGAGACGAAAGCAGGCCGTTCTTGAACATCCGGCCAGCGACCTTTTCGCCTGCCAATGCACTCCCGACCGTATTGCGCACAACCGCGATCGGCGACATGCCACGATCACAGCCCGGCAGTCGGACGCCGCGGACATGAAACATCTTGCCTTCGGGCACACGGCGCTTTTTGCCGTCTTCCGTCACCTCGTAGTGGCGCGTATTGCGACCGTCTTTCGACCGGCACACATCAACGCTAAGAGGGTGAAGTGGGTTGAGCGCAACGAGGCGCTCGCCGTTCATCTTCTTTTCCGCGAAGAAGTTGCCATCTAGCAGCAAGCACATCGCTGCCATCGACCAGAACTCTGGCGCCGTGTCGTCCATGTTCGGCATGTCGTGCAGAAGCTCGTAAAGCGGAGCGTTCTTATCGACCGTCACGCCGTCCTCGCCGTAAACGATGCAGGGAAGCGTGCCGGCCGCGTTCTGCACGAGGTTGACGCACGCCCAAACCGCATCAAGCGACAGGGAACTCTCAATTGTGACTGTCTCCCCGGACGTGGTGCCGAGGCCAAAGAAGCCTCGCCAGAACTCGCCGTCGGTGAGCTTGATAGGCCTTCCGACCCATCTCTCAAAAAAGCCCATCAGGCCTCACCACTAGGTTGCCCGTCACCAGGTGACAGAGATGATATTGTTGACGAAGTCGTCGAGGTTGCCGCTATCGCCGCCCTCGTAGGTCCCGGCCATCGCCGTTGCCATTGCCAAGGCGACTGCGCCGTCGATGCGGCGTTCGCGATTGTGTTTGACGAGCTTTCGATTTCCAGCAGGATCCGCCTTGACGGTGGCATTCATCATGCACATCGTAAGCACCGGATGGTCCCCGTGGGCCAGATTGCCGTTCAGGATGATGCTCTCAAGCTCGCGAAGTGCCGGAGACATCGACTGGAAGCCCTGCCCGAAAGGCTGGAAAACAGCATCATCGCCCTCAAGCTGATCGTCCGTGAAGCCGACCTTCTGCAGCCATGGCTTCAGGTGCCTGAAATTCCATCGGTCGAAAGCGATCTTGCGGATATCCATCTCTTCGAACCGGTCGCGCAGGTAATGCGCAACAAACTCGTAGTCGACGGTTCTGCCTGGAGCGGCTTCGAGATGTCCATCCGTATGCCAGACATCGTAAGGCACGCGGTCGGCTTTTGCCTTGGCGCGTATCCCGTCGCCGGGCAGCCAGAACGTCGGCTTCACGTGCCAGATGGTTTTGCCTTCCTGCTCCTTCGGCGCCATGAGCACCAGAGCAGTCAGGTCGCTCACCTCAGAAAGGTCGAGCCCACCAAAGACAGGGAGACCATCAAAGTCCACAACTCGAGCGTTACACGCTCGCCAAATAGCCGGCGACACAAACGGAGCATTGGCATCGATCCTTTGGTTGAGATGGAGCCAGCGGAAGCTGGCCTCCTCAGTCGGCATTCGTGCCGCGCGCTCCGCGTCGTCTCGCACCGATGAAACGGACTTGAACTTACCAAGCGCAGGGTTCGCGGCCTTCCACGCTTCCTCGTCGAGGACGTCGCAATCAGCCGGAGCCGTGTAGAGGTGCGAAACCGTTCTTGGTGCTTTCGACGTCTCGGCATCGTCCAGCCATCGCGAGAACAAGTCGCCGTCGGTTGCCGCCTGCGTCGAGATGGCGAAGATCATCGCCTTATCGCCGTAGGCGCCTTGCGACGTCACGATCGCTTCAACGAAGTCATCGTGCGGACCTTTGATCTGGCCGACCTCATCGAGGATGGCGACCAGTGGCGAACCGCCGTGCGCGCTCTTGGCTTCCGCTGAGCTGGCGCGGTAAACGACGTTCTTGCGCAGACCGACAATCATCTTGCCGGACGGGACGATGCGATACAGCCCTTTCAGGCGCGGCGACATCATCAACATCTTGCTGGCGTAGTTGAAAACTTCCGCAGCCTGGTCGCGAGAACGCGCGCCCGACATGATGCGGCTGTTTGGAAACGCCTCAGGGCCAATAACGTGGCCGAGCAAGAGGCAGGCGATAGTGGCCGTCTTGGAGTTCTTTCGCGCGATCGACAAATACGCTCGCGACGTGCCGTTCGGGTTGTCATAGACCGACAGGATGAAGGCCACCTGGAAGTCCAGCAGCCTGATCGGCTGCCCAACCAGCGCACCCTCTGGTACGACCAAATACTCTTCGATAAAGCGGCACATCTTCTCGCCGCGGGTGAGCTCCGACGTCGGCAGTCCGCGCCAGTCGCGCAGAACCGGGATCGGGCCGCACTTGATGGCGCCGACCACGGCCTCAGAAAGCATTCACAACCTCGATTAGGCTAGGAGTTCGTCATCCACGCTCGCGCCCGCCTCAATCTCCTTGGCATGGTCGCGCCGCTTGGCTGCGTCCCTCGCCTCGCCCTGTACGGCGCGCGCATGCAGCGCCAGCGATCGGCGGAACGAAAGAATGGAAGAAGCATGCATCTGGACCACGGACTTGCGGGGGTTCGCGACCGGAGTGCCTTTTTCGGTGACGGCGACCGAACCCTCTGTGCGTAGCAGATCCTGCTCCCTCACAAGGTCGGCCATCGTGCGAGCAAGCATCGCGGCAATCTCAAGCTGGTGCGCCGACCAATCGGCGCGGGCGTATTCAGCGATGACGTTCTTGAAAAACGGGACGTCACCGTCGTCGAGCGGCACGTTTTCGGGAAACTGGATCTCCTCAGATGCCGCAGAGGCAATCCTTATGGCCTCATCAACGCTGTCGACGCGGCTTTTCTTCTCAGACATGCGGAATCCCCTCGCGCACGCGCGCTTGCGCACGCGCTAGGCAAAAATCTGTGTTTGCATTTGCGTTGTGCTGCCCTCACGGTCCTGCGGTGGTCGGTCGGCCACTTTCGAGGCACCCCTACCCACCAACCTCGACCGGATATCCGTCAACGCCGATGACCACGGCCTGCTGGCCTCGCTCGAGGCGAGCCTTCAGCTTGTCGTGGCATGGAGCGCACAACGATTGCAGATTGCTTGGGCCGTAGAACAAAGCCTCATCACCCTTATGCGGCTCGATATGGTCACACACCGTAGCTTCAGTGACGTCCTCAATGGCTAGGCAGAACCGACACAAGGGCTCGGCAGCAAGCTGAGCCTCGCGCAGTCGCGCCCATCGTGCCGTCTTGTAGAGGCGACGGTAGAGAGCGGCTTCGGTCGAGCGGCCGTAGGGTTTGGGCATTGCACTACACCCTTATGACGTGCATTCTTCCTTACATTCTATCGCTGACTGGGGAACAGAATGGACATTGTTGGATCAATTAGCGCCGTTACATCGGCTATTTCTCTCGTGAAGGAACTTAGGGAAATTGACGCCCAGTTCGACAAGGCAGAGATGAAACTCAAAGTTGCCGAACTTACTGAAGCTCTGTCGGACGCGAAGTTAGGCCTCGTGGATGTTGCTCAATCGCTGAAAGAGAAAGATGAGCAAATCAACAAGCTGAAGGCCTCGATCCAACGTCGAGAAGAAACGGTCGAGAGAAATGGGTATCGGTACCGCAAAGGCGAAAACGGAGAGCCAGTCGGTAAGCCTTTTTGTCCGGTTTGTCTGGAGGAAGGCAACTTCATATTGACGGTCAACAGCTTCGATGACGGCAGGCCGACAAAATGTCCGCGATGCAAGGCGAACTTTGGAAATGTGACTGGCTACCGAGAACAATCCTGAACAGCGTCATCGAACGCTAAAACAAAAGCGCCGGGCTCAACCACGATGGGGAACCCGGCGCACGATCACCATGCAAGCGGAGGAGAACGCGCATGGGATTGGTTGCGGAGGTGAGATTTGAACTCACGGCCTTCAGGTTATGAGCCTGACGAGCTACCGGACTGCTCTACTCCACGTGATAGTTACCCGCTGCGTTCAGCGGCTGCACCGAATGCAGCATGACGGCGGGGCGGTCGTAACCGCAAAGGGGCGAAGGTATCTTTCGATACCCTCTCACCCCTCGGGGATTTGAGTGCGCGTGGCGGTCCCTATGCTGCGCGGTCACGGAAACGGCTGAGTGCCAACTCTGCTAACCTTGCCTCGCGTTTATCGATCTCCTGCCACCTAATGCTCACAGTGCCTATTGCCATGCCAACCAATGCGCGACCAGCAACAGCTGGCTTCAAGACCTCACCCCTGCGCTTTCCTATTTCGGAGTAGCTCTGGCCGCCCAACACCGCATCCTCGAAAGGTTCGACCAGCGGTCCTAGTGACGACCGAAGCTCCGCCAGGATCGGTTTGGCGTCGATATATTCATTCAGGATTTCATCGGTGATCTTCACGTGTAAGCTTTCCGTCTTGATCACACTGCCTGAAGCCACGTTGTCGTTGGCTGCGACCACAGTCCTCCGTGGCGGTATCGAATGCGATCCCTTGCTCTTCTTGACCTTCGTAGAGATTTTGATCTCGCCACTCGGCATTTCCTTCCAGTCGGACGCGGCGGCTCGGTCAATGTCCGATTCTGGCGTAAGGCGTTTGGTTTCCCTGACCACCTCCCCTCCGTCAGCCTTGCTGTAGTCCAACCCTTTCAGCGGCTCAGCCTCACAAAGCGCCACAAGCCGGCGATATCGAAGGACGACAGCTACAAGATCCTCACGCTCATCACGGCGAAGCGCCTCAAGCAACGGAAAGTCCTCGCCGCGGCTCTGAACACAAGAGGGATCACCAATCGACTGCCGCTTCACTATCGCCCTCCGAACCTTCGCCATTTTCCTCGCCTCGTCAGCCGCTTTGGCTTTTGCTCGTTCTCTGCCTGCCGCCCTCTCCTCTGCAGTCTGCGGGATTTTGAGAACGTCTCTTGTTTTGGTTTCCTCGATTGGCTCCCACTTTCCATCCACCTTACGGAAGCGAGTGTTGGTCCTGTATGGATGCGGCTGGTCGCCGGTGAAGGCTACCCTGCCAGTGGCGTCGATGATCTGGTCTTCCTCGTTCATTGCTGTCTCCCCTGTGGTGCTCTTACTGCTGCTGCGCTTACGTTGGATGATTGTCGTTATCGGCCAGCCATCCTTTGACGAGTGCCACTGCCTTGTCGGCCGCTTCTGTCGTGGATGTGAACCGCACCACTTCCACTGGGTGCCCTAGCCGCGCCAGCGACGCATGCCGCTCAACCTGAGCCGGAGACAGGCGGCCTTTGCCGACTTTGTTCTCGATCATCCGCAGGGTGCCGCCCTTGAGGTAGATCCGCAGGTCAGCTTCTCCCGGCGTCATGCCAGTTGCGATTGCGTCAGCCTGGGCCCGTGGCCCGCGCTTCGCGCTGTTCATGTCGCCTGCCAGCAGGAACTGGCGGCCGTACTCCGGCAGCGACCGCAGGGCGCGAACCTGGGCCGCCTGCCCTTCACTTTCCTTGATGGGTGCGTCTGCGACAGTGACCTTGCCTTTTGGCGAAGTACGGATGACGACGCGCTTGCCATTGATGCGGGTGGTTTGGCTTGTGGCTCTGCCCATGGGCGTCTCCTCGTGGTGCGGTGTCGTGGTGGCGACACACTTACTTTCCGAGAGACGAGCGAAAACGGGTAGTCGAATTTGAAAATATTTTTGGGTGAGCATCAGGTTGCACCCGGACCGCCCCTCTGGCCGGTGCACGTTGCCCGAATAAATCATCTTGAGAACCGTGCGCGCGGCACACCGCCCTGCACGGTTGCACGTGTAGGGGTATATATAAATATATACCCTACTAAACGTGCACTAACCGGGCAGCGCTGTGCAGGTGTCTCCGCCCGGACGTGCACAGTTACTGCACGGATAAAATTTTGTTATCCGGGCAAACTTTAACGCGAATATGATGATGCGTTTTGGTTGTATTGCCGGTCGCTTACGACAGCTCTTTTTCCGATTTCGACAAGAACAAGATTGTGGACGCTGTCATATGCATTATGTACTCGACGACACTATCGTCAACGTCAACCGGAGTTGCACCCTGGCCATGCCCAGCGTTCTTGTTGCGGCCGGTGGGTACCCCACTCTCTAGGAGCGCTTTTAAATTCGATAGATTGTTTTGCCAATACGAAGGCACGAGACCCTTATCAAAGCATATTGTAATGAGGTCTTTTGCCGTGGCCCTTCCCTTATCATAAGTCCACCCGCGCTTGTCGCAGATCGCCTTCATCGTGCTCTCGAATGCCTTAAGGCAGTCTACAATCGCTTGCTTGTGTTCGCCAGCACGGTAATGTTTGTGCGCACTGAGGAATTCCTGTCTTGGTCCAGCGTACATTGCGCCATTCAGGAAGGATAGCGCAGGCTTTACAATCTCCGCATGCACAAGCTCGGAGTCTATGCGAACGATTTGATCGTTTAGCCACTCAAACCCTACGCCGTTTTCTTTGATTCTATAGTTTATTTCTTTGATAGCTTCGTCGCAACGATGCGCCGCATCTTCCAGCGCGCGATAAGATTGTGATCTGCAAAAAACATTAATGGCAGTTATGCCAACTTCAACCGCATCTAAATAGAAATCAGCCGATTTAGATGATAGAATATAGTTCAAGTATTCCAGCCGGAAATCATCATCGTATGACTTCGAGCCCGAGAGGCGCATTACCCCATGCTCTCTGCGGAGAATTCTAACTACTTCCTCATAGCATCGATGCACATTCACGGCGCGGTCGTCGCTGCGATAATAGTCATCTGGATTGCCAAAAATCTCATCGAATATCTGTACCGTCTGCACCTTGAGCTTTTGCGGAATATTTTCGTATTCGTAAACGTCAACTACGTTCCCAGATTTCTTGGCCGCGCGCTTTGAATACAGTTCAAATATCATGGAAAACTACCCCAGCAAACCAAAGACAATTCACCTATCCTCAATACACCTCAAAGTTTATCTTTCAGAGTAAAAAGCCGCTCAGTCCACAACAAAAAAGGCCGGTGCATCACTGCCCCAGCCTTCCTGTCGCTCGCCGCTGCTGCCAGTACATGCGTGGTCAGTGCGCAAGCTACTTGGCGTTATGAATGCGTGGAGGCTGGTTTGGTTCCTCCTTCTCGCTTAGAATTCAGAAAGTTCTCGACAACACTTTCTGAACGACTCCCTCAATAACTTTCTCCGAAAGGCTGTGCCCTCCGAAGCTTGAAAGCATGGGATCAGTTGCGCTTTTGATGAGAGGAGCAAGGGAGTTTATCGGGGTGTCACACCTCATCTGAACATAACCAGTTTCGCTATCGACGTATGACGGATCCCATGCTGAAATGTAGCGGCTCCAGTCACTCCAAGCTTCCAACTCCCTGCCTGCGATACCCGACCATTTAAATAAAACATGAATCTCTGCATCATCGCTAGCGCCCAATTCGCGCGCAAACGCAAGAAGCACAGCCAATGTTTCTGCAACGCGCCAGAGAACCAATATTGGATCAAGTTTTGTTCTCGGCGGTACCTTGTCTGTTAAATCATCTTGGAGAATACGCCACAAATAGAACTCTCCCGTCTTCCGAATTCGATAGAAATCTACATGACCGGATAAGATCGTAGTCTCCCAGTGGCTGTTGACTTTTCTTGGCTTCCATCTTTGGTCATAAAAATTTCGGGAATCTAGCCAAACAGGCCATCCGGTATACCTTGGGTTACTCGAAGCGACCTTGTTCAAGAAATCAATGTCGTTTCCCTCGCTTATCGGCGGGTCGATTATTGCCATTGCGCTCCATGAACCCATATCAGCAAACGGACTGTAATCTGTTGCAAAGTCGGATGAACTGAGGACTTGATCTCGTCGATCCTCGTTTTCTGCCATAAGGACAGTCAATCTTTCCAACATAGTTGGTGGAGCGGGTGCGGCTCCGAATGACTGCAAGAGAGCGGTAAGCGATTCCCCGCCAAGGTGCCTGCGAAGAAATCTACCGATGTCGGCCTCACGGTTATCGAAACAGATATCTACAATATCTTTCCAATCGGAAGCCAGAGCTGGCGCCGAACTTACTCGACCGTTTGAGTTGAGCGTCCTGAAATAAACCCTATCTTTCAGCACACTGTTTCCACCTGGCACCGTCAATTCACGCTTCGTGGCTACAGGCACTGTAACGCCCGCAGGGACTTTTATTACTGGGTATGGTCGGTCTTTAAACCTGGCGAAACCTACAAGAACTTCGAATGGTATCGACGCGTATTTTGAGACAATTCCCTGCACCACGTCCGCATGAAATGCTTCAAATACGTCTGTTAGTTCATTTTCCAGATCAGGCAGTAGAGTGGTGTCATCGAAGCCGACAACAAGATAACCCCCGTTTCTATTTCGAAGCGCGATGGTAGCTTGTATTATTTTTGCTGCCCCTGAGGGTAAACGTGGATCAATCCATCTCTTAACCTCAACACTTAGCGACTCAGAAAGGCTGGAAACAAGTCCATCTATCTCGTCTTGCGTGATATCCATTAGTCCCTCCAGCATTGTTCGCCTACGAATAAGAACTTTATTTGCCTGGGGTAGCAAGATGCTGTAGCGACACCTCCGCTTCACGCCGCCCTCACAAACACCGCCAACTCTCGCCGCACAGGATCCCGCTCCTCCCGCTCGACAAGCGCGCCTTCACGCATAAGGGCGCCCACAAGGCTGGCCGCCCGCTTGCGCTGCACGTTGTCGTCCAAATCCAGCCCCACAGCGTACGCTACAGCGCCTCCGACCCAGTTCTTGGCCTTCGGTGACTTCTTGTAGTCGGACGCGCTTACGGCCGCCAGGATGGACGCGCGCTGGTCTTCCGTGAGTTCGCCCGCTACTTCCGCAGCGCTTGGCCACTGCCACTCCGTGACGACCGGTGCGAAGTCCTGCGGCTGAGCAAGGCCGGTCCCGTTTCCAAGCGGCGTCGAGACGAGATGGCGCCACTCGGCCTTGTGCGAAAGCGGCGTCAGGTTCGACTTGCCATAGGTGGTGTAGAAATAGCCGAAGCGGTCAGCCTTATCGATACCAGCCTCGCCTGCCTGCTCCTCTGACATGCGGTTCAGGACGCGCACCGAACGCGCCGCGCCTATCAGTGACACCGCACCACGCGCATCTTCAACCGTCGCCTCTCGGTCAGCGACCTTGCGAAGGTGGTGCACGATGTCGATTGAGCAGTTGGTGTAGTCAGCAATTTGGGCCCAAAGCTTCGCCACCTTGTCGATCGCACCGTTGTCGTTCTCATTGACGCCGTGCGTGGACACGAACGGATCGACAATCATGACGTCGATGCCATTGCGCTCGATCTGCTCGACCACCGCCTCGACGATCGGTTGCTGAATCCGAACGCCAGCCTTCTTGTCCTCGATGGCGACGCACAGTTCCTGCTCACGACCGCTGTCCAGGAAGAGATGGCCTTCGAGGTCGGCAGGCTTCAGTTTGTAGTGGATGCAGGCCGCCATGATGCGCCGCTCCATTTCGTCGCGCGGATCCTCGGCGTTGAATATCCACGTGCGCAGACGCTTCGGCGGCTTGGTGCCAAGCAGCGCCCTGCCCGACACCATGGCAAGCGCCTCAGCGATGCTGCTGGACGTCTTGCCTAGGCCGCCAGGCGACACCGTGACGGACACGTATTTGCGGATGAAGTGCGAGCCGTATGCGAACTCACGGCGCGGCAGTGTCTTGGGATCGATCCACTTGAAGGCAGTTGCGGTGATGGGTGACGGAGGCGTGTTGTCGTTGGCGGGCTCGATATCACCAGCGGGTGATACGACTTCCTGCTCGGGAATATCCGTGCGGTTCTGCGGCACGGCTTCCACCACGACATCCGCTTCCACCGCAGCCGCCTGCTCGCGCAGCCTGCCTTTTTCAAGGCCTCGCTGGATCATCCGCGTAATGTCCACCAGACGCGTGTTGTCGTGCGCCGGGAAGTCGGGCTCAGGAATGTGGCGCGGGTTCTGGATGCCAGCCTTAAGACCGTTCTCGATGGTCTTGCAGCAGCGCGACCAGTCCCTGCCCCAGCCGCGAGCAACGTCCTGTAGCAGCGCACGCGCCTCCGCCTCGCTAAGTGCACCAGCGCCGACGATAGTGCCGATTGAGAACGCGGCATCGTTGAGCGCATTGTTACGCGTCCCCATGGGAGCGCCAGCAAGGTCCGCCAGCTCACGGTCGACGGCGGCATCCACATAAGCGTTGTTGGTAGCTGCCGACAGGCTGTACTGCGTGTGAGCGGGCGCCGACTTCGGCAGCAGCAGGTCAAGCAGCCAGGCCGGCGCGTCCGCGATTTCGCGCGTGTCTGTTTCCCACTTGTATGATCGCCCATTCGCCATCGTGCTGCCAGCGGCCAGCACGTAGCCGCCCTCGGACCGGATATCGACGCCAGCACCAAGCGCGCCGCGGTTGCGCGTGCCCACGACGTACTTGAAGTAGATATGCAGCCCGCCGTTTGGGCTCGTAACGCGTGCCGTGTCTGGCAGCGGACCGTGCTCCGCTTCCATCTCGGACAACCAATCGAAGCCATTCGCGCCGCCGGGCTTATTGTCGATGTCCAACGCGAAGAAGCCCGTCTTTTCGCCCGTCGGAAGGCCAACGGCGGCATCCGGCCAATCCGACCACCATCTCTCGATGATGCGCGGAAAACGCGTCGCGCCCTTGAAGCCATTGGGCGTCAAAGGCGTCTTTTCGCCAAGCGTAATGATCTCGCCGGTTGCCTGGTCGACGTGTTCCTCGGCGTGTGAGCGGCATGGAAATACGGGCCAGCCTTGGGCGACGTAGTATTGCGCTAATTCAAGCGGGGTTTGCAATGGGGGTCTCCTTGGCTTGATAAGGCGAGACGGCGGCTGCTACACAACGCAGACACGCATTTTGGCTAGGGGTGGCTATGTCCTGGGGATTCGATAAAGGTCGCATTTACAACAGACGCGCGAGCATTCACGCGCCGTTCGGCGGTCAGCAGCAGGGCGGCATCATCACCCCTAAGAAACATAATCTGGTGATCATCGTCACAGGCGAAGAAGGTCACGAGCATGGCTACAATGACACGTGGTCCCCAGATGGTAGGTTCGATTACTTTGGCGAGGGCCAAGTCGGCGACATGAAGATGGAGAAGGGCAATCTCGGCATTGCCAGCCATTCGGCCCGCGGCAAAAGCCTCTTGCTCTTCACTAAACTAAAGGCAGGCCTAAGGTTCGAAAGCGAGATGGTTTATGAGGGCCATAAGATCGTCAGAGCTCCCGACCGTCTGGGAAATATGCGTGACGCCATAGTGTTCGCCCTTCGCCCGATCGACACGGTGTTCGAGCACGTCGAAAGTGTGCCTGTCGCCTCGCCACAGACGGCCAAATCCTTGGATGATCTCAAGAAGCGTGCCTTTGCCGCATCTGTCCAGAAACCGGCAAAAAATCAGGTGACTACGAGCGTCTTCGACAGAAGTCGTGACGTGCGAGATTACGTGGTTGCGCGATCTCTCGGCAAATGTGAGGGCTGTGGGAACGACGCTCCCTTTACCCGGCCGAATGGTGTGCCCTACCTCGAGCCCCATCACATCCGCCGCCTCACTGACGGAGGGCCAGACGACCCGAGATTTGTGATTGCGCTCTGCCCCAACTGTCATCGACGTGTTCATTCCGGGAGTGATGGCGCCGATTACAACGGGACGCTTCTTGAGAAGATGAAGACTATCGAGCTGGGCGTGCCCAGTTAGCCGGCTCAAAATGGCGCCTCGCTCAACGCCGCCCTTAGCCCCCGAGCGCAGCCCTCCCACGCTGCCTTGACCATCATGCGCTGCATGAGCTCGTCAAAGTGCGCCAGGTCAGTAACACCATGCTCGGCGATGTATTCGCCGACGGCATCAACGCCTGCATCCAGGGCGCGCAGCTCGTAGTCATCGAGGCGATCGATTTTCTTGTAGTTGTCGATGCCCACGGCGCACCTCCGGCAGATGTAGTTAGGGTCCTTGTCTCGGCCGTTGGCGTTCACGCCGATACCGAAGGCGTGCATGCCGCAGACAAAGCATGTGGTCGGATTATGGTCGGCGTCGACCGTTGGCGTGTGTTGTCTCGGGGTGGTTGGGAGTTTGGTCATGCCTGCGCCTCCGCCCATTCGACCCACGCCAGCAACTTCTCTGCAGGAAAGCTGCTAAGACGGCTGCCATCATCCTTGATCTGCAACAGGTATCCGCGCGGGAACCTGGGCGGTGGCTTGAAGCCTTCTGGCAAAACCAGCGAAACGCTGGCCTTGCTCTCATCCAGACCTAGTTCGCGCGCACCGGTAAGGCTTGCGTTGCAAGCTTCTTTCATCTCGGCCGAGGTCATGCTGCCACCTTGGCAACAGCATGATTATCGTTGGCGGCAGAGAACAGATCGGCCACCGGCTGCTCTTTCGTGCCAAGCGACGCGATGTTCTTCACTGCCTGACGGAAGTATGAAGGCTTGAGCTCGAAGCCGACGCCCTTGCGGCCCATCTCGACGGCTGAATAAACCTCGCTGCCAATCCCGAGGAACGGCGTAAGCACTGTCTCACCGGGCAGGCTCCACAGATCAATGCAACGTTCGATCACGTCCAGCTGCAAAGGCGAGATATGCTGTTCGTCTTGCTCATCTCGGGCGGATCGGTATTGCAGTGTGCGTGTTTGGCGAATGTCGCTCCAAACCGGCGATGCGTATCGCTGCCAGACGAATACGGAGCGCCACTGCTCGAAGCTCCAGGGCGTGCGACCATCGGCAATTGTTTCTGCGGCGTGACGGTCATAGGCCTCGCGGCTGATGTCGAGACTTTCGTCGCCCACCCACATATCGAACATGCCATCGACCGGTTCAGAGTTTTCGCCCGGCTTTCGGAACGAGACGATGTAGTCGGCAAGCCCCTGCCCGCTGATGCAGCTGTCCTTCGTGATCTGCTTGTGCAGCAGGCGAATGGATTTGGTGCGCTGCTGGGCAACTACCGGGTCTTTCCAGATGCAGACTTCGGAATGAAAGATCCAGCCGGCGTCCTCATAAGCACGGATGATCTCACCGCGGAAATCGCGCATGCCGATAAAGCCGTTGCGTCTCTTGCTCGTCGGCAGCTGCATGCAGTGAACGCTATGAATGCGTCCCGGTTTTGTGACGCGCAACAGTTCTTGGATAAGAAACGCGTAATGCTCCCAGAATGCGCCGCCCTCATTGTTGCTGATGTCGCGGTCAAAGCTCGAAAACTTGTAGAGGCCTTCGAACGGTGGAGAGTGGATGCCGAAATGCACGCTGTCACCAGGGATAGCGCGAATGAGCTCACACGAGTCACCTTCGTAAATGGCGTAGTTGTCGGTGATGACCTGGTTCACGGCGTTGATGTCGGCGGTCAAGGCCGCAGTATTCTGTGCGTTCGTCATTATGCGGCACCTCCTAGCCATGATGGGATTTGCATCGGGATTTTCGGGTCGTAGCTAGCCTTCTCACGCGCCTGCGCATTGATCGCCTGCTTGGTGATATTGGCTGTGTGGAGCACCATCGCTGCGGCCATGCGGTCGGCGTCGGCCTCCTTGCGCTTAAGGTTGGCAACCACAGCGCCTTCGGTCTCGGCGGCGATAAAGTGAGCTGTGACCTCTTTCTGCTGCCCAAACCGGTAGAAGCGGCGAACCGCCTGATGGATCTGCTCGAAGCTATCGTTGAGACCAACAAATCCAGTGTCTGCACAGTGCTGCCAGTTCATGCCGAAGCCAGCGATCGAGGGCTTGGTTACGAGCACGCGAATCCGGCCTTCGCTGAAGTCGATGAGCTTGCGCTCCTTCACATCCTCTTTATCCGAACCGCGCACTTCGACCGCGCCGGGGATGGCCTTGGTCAGAGCCTCGCTCTCGGCGTTGAGATTGCACCACCACACGAAAGGCCTGTCCTTCGGCGTCATCGCGGCAGCGAAGGCTACGCGATCATCCACACTGTCACGACGCGCCTTGATGCGCTCTTGGAGCGTCGATGCACGGCTGCCCACGAGATCACCGACAGCAACGGGAGCTGTCACCGTGTGGTGAATTTGATGCAGCGCCGGCAGCAAATAGGCGCCATCGTCGTATCCAAGATCGGAAGGCTTGCGAAGCATGACTGCCCACGACGCCATCCAGCGCCAGAAGTCGTTTTCGGCGTGACCTTTGAGCCGCCACTTCTGCGTTTCCCCGCCGTCGTGAGTAAAGAAGGTCGCTAGCATGTCGGAGTAGGACATGATGCCGAGAAATTCAGCGTGGTTGCCAAGCTCCATAAAATCGTTCGGTGCAGGCGTAGCAGTAGCAGCCAGCCGGAACGGTATGCCGTGGCACGCCTCGACTAGCTCGTTGCGATAATGGCCAGTCTCGGATTTCAGAATGCTGCTCTCGTCGAGAATGACGCCAGAGAACTGCGACAGGTCGAAAGCATCAATCTTCTGATAGTTCGTGACGTTAATGCCTGAGCCGATGTCTGACTGTGATCGCACGTGTCTGGCGGGGATGCCGAACTTCTCAGCCTCGCGCACCATCTGCGCCGCGACTGCCAGAGGCGCGAAATGTAGGACATCGCCCTGTGTCGACGCACTGACCGCCTGCCCCCACGCAAGCTCCATGAAGCTCTTGCCGAGGCCAGTGCCGGCGAAGAGCGCAGCGCGGCCACGCTTGAGAGCCCAGGTCACGATGTCGCGCTGGAACGGGAATAGCACCGATGGAAGATCTGGAATCTCGGTTAGACCCGTCGGCGGGTCTAGAATGGCTTTTCTAGCCAAGAACGCAGCATAGGCGTTCGCTTGCGCACTGAGCGCAACATTCATGTTCATATTGTCTCCTCAGACTGTGGTGTTCCCGCCGAAGCGGAAGCCGCGTTGGTGGCGCGGCGGGTGATGTGGTAGTGTTGCCGATGGGGAATCGGGGTCGGCAAGTGGTTGAAAAAATCAAACGTTATTGGTGGCTTGCTGCATGCCTGTGGGCACTTCTCGCCTTTGTGTCCTGGCACTTTGAGAGCACATGCGGGGTGTTTTTCTTCTCCGCGTGCCTCTCCGAGTATTGGGCAGGAATTCGGTGGATCGCCTTGCTCAAATGGGTTTCTCCCTATCAGGCACTACTAGCTGGCATTGCGGCTGTTGTCGGCGGATACTTTGTCCTCCTTTCCCAGCGGCTTCAGATCGATGAGGCGCGCCGCGTTGGCGTCGCATCGAAAGATGCCTCATTTCGCGCAGCTTTAGCGACTGTGCGTTCTGAATGCCTGCACGTCGCCGATCAGCTGGGCTCTGACGAGCTTCATACATCGACCAAGACCCTAGATTTCACACGCGCGTCATTTCCTATCTTCGCCGATCGAGATCCAAGGCTTCTTCATATAACTATGAGCGTGACCCACCGTCTTGAGAAAGCTCTAGAAGAGAAGAACAAGGGACGGGAATCTGCTTTCTCACAAACAAAACTTTTGTGGTACTCAAGCATTGGAATGGCTTTTGCAGAACTCCTGATACAAGTCAGCGAGAAGTTGAACGCAAACGAGGACGCCAGCGTCAGCTACGCGTCTTTCGATGGCCACCGACTCTACAGTTTCCTAGAAAGGAGAGGCCAAACCCCGAGCGTTTTGTTTGAACTCGGGGCATATTTTAGTTGGCCGGTGGAGCAACATATGGAGGATGAGTGGCGCCCATAGCCATCACGCCACCCTCACCGACAGCTGCTCACCCGCCTCGCCCATCTTTGCGCCGCGCACCTTCTTGCCAGCCTGCAACGCCTCCTTGATGGCGGTCTTGTCGGGCGATGTCGAAACGCGCACGTAAGCCTTTGGCAACGCGGCTTCGTCGATGATCTCAACCGATGCAGCCTTCTTGCCGATGGATATCGTCGCCTCGGCCAGCGGCACGCGCGGCACGCCGGCAGCCTTCAGCAATTTTAACATCAGGCTTCGCATGGCCTCCTTGCGTCGCTCTGCGCGCGACTTGCGCGCTTGCAGATCGGAAATGCGGAGCGCAACTGACTTTGCCAGGCTGTCGGCGTCGCGCTCGCCGTTGACGAGGCGCGTCAGGACGGCGTGGAAGTTGGTCTCTCCTTCCAGCATGTCTGCGCGCAGTTCTTCGTCTGTCTCAAGCTCCGGATAAGCGGCCAGCATGTCGGCGAAAAGCGCTTCGAGGTTGGCGACGTCGGCGGCCAAATAGTTGTCGTTGGCGGATCTTCTCTCGCTCATTTGTGTCTCCTCAATGTGGTGACGCCATTGGTGTGGCGTAGAGTTTGAAGCTATACTGTTTTTACAAATTTGTCAAGATATAACATTGAATGCAGTTTTTGTGATTGCGCGAAGTTGAAACCACCCACAACTTTTCCTACGTATTTTGTGTAATCAACTACAGGAGATGAAATTGGGGCTTCGGGACAATCTGAAAAACATTATCGACGATCAACAGAGAGCGGCGGATGAGGCGGCAGAGCGAGCAAAGCGCGAAGCCGCAATCGCTCGGGCTAAATCTCTCGACACGGCAATAAAGTCTTTTTGCCGTCTCCTGGATGAAGGCACGGTGCAGGAGATCATTGAACATCACATTAAGAGCATCCGTGGGATGCGGGTAGTAGTACTCGGGTGTTCCCGTGACCGAGCTGTGATCTACTTAGATCCGGAAAGCAATGTTCCTCGTCAGTTCCCGCTAGACATCGAATTCAATGCTGACCCATTTGTCATGAATGAAGCTATCGAATCCGTGGAGGTCCAAGCCGCGATTGAAGCTCTGAAGAAGGAGGGTGTGATTGTGACAGCCGGACGAGAGCAAGGAAATGGCGCAATCCGCGTGACGCTTGACTACAGGAATATCTAAAATCGTCGGCGGGGCAGAGCCCCGCCTTTCCCCTAAAATGGCACGTCATCGTCCATCAACACCCGCCAATCCTCTTCTTCCGGCGCGTTGTCATTCGCCGGCGACACGCGATTATCATTGGCCGCTCCAACCACGTGACCCACGACATCCCAGTATTTCTGGCGCGGCTTCACCGTGATCTCCACCGTGTCGGCCAGCTCTGACTGACGCTCAATCCACTCCAGAACCGTCTTGGGAAACGGCATCTTGCCGCCATGCGCGCGCCAGTAACGGTCGGCCTTTGACTTCGGGAATCCGCTATGTTGCGGGCAAACCCACTCGTTGATCGCCGTCATGCCGACCATGTAGCTGACCTTCACCGACGGCGGCTTATCGCCCTTCCCTTCGTGGTAGTAGAAGCTACGCGAAGTGACGGTGCGGGGTTCTGGTTCTGCCGTCGACATGATCGGCGTATCAGCAGCCGTCGCCGTAATCTTAGGGCTGTCATCGATATCGAACTCATAGCCGCAGCAAGAGCACGTGCGCGCTGAGGCATGCACCTTCTCGCCACAGCCGAAGCGACCATTCTTGTCCTCGACGTCGAACGGGCAGACCTTCACTGGCGCTTCCCCGTCGCCCTTGTTGGGCGTCTTTGGCTGCACCATGTCGACCGGTCCGTGCTTGTCGACGAGACCGGCGAAGTCCAGCACAAGGCACGAGGGTTTCGGACCAGCCTTAATCGCGGCAATGCGCCCCTGAGGCGTGTCGAGAGGCATGCCAGGCGCGTATATCACGCGAGTGCCGCGGCCCATCATCTGCACGTACAGCGAAACTGACAACGTTGGCCGAAGGGCAGCAATCAGGTCGACACCCTTGTGATTAAATCCGGTGGTCAGAACCGAGTTGTTTGTCAGCGCGCGGATCTTGTAGGACTTGAAGTCCTCGATAATGCGACGGCGCTCATCCTTCGGTGTTTCGCCGCTAATCATCTCGCAGGAGATGCCGCGTGAACGAATCTCGTCACGCACATGCTCGGCGTGCTCGACGCCGGAGCAGAAGCAAAGCCATGACTTGCGGTCGGCGCCCTTTGCGACGATCTCATCGACGGCGGAACGCGTGACGTCCATCTTGTCCACGGCAGCCTGCAGCGCGGACTGCTTGTAGTCGCCGCCCTGCCTTCCGACGCCCTTCATGTCGAAGGTCGTGGCAGTGGCCTTGGACGAAAGCGGCGCGAGATAGCCGTCAGCAACGCCGTCGGCGATGCCATAGGTGTAGACGATCTGGTCGAACAGCCGATCGTCGCCCTCATCCAATCGGCCTGTATCCAGCCGGTAAGGCGTGGCGGTGAGGCCTAGGATTTTCATGTCGGGATTGATGGCACGCAGGGCCGCGATGAAGCGACCGTACATCGTGTTGCTGTTGGCGGGGATCAGATGGCACTCGTCCACCATCAGCACGTCGATGTGCCCGATGAGCGCAGCTTTGCTGTGCACGGTCTGAATGCCGGCGAAGATGATCTGGCTGCGCGCGTCACGACGGCCAAGGCCGGCCGAGAAGATCCCCGCCGGAGCAAATGGCCAAATGCCGAGCAGCTCAAGGTAATTCTGCTCGATTAGCTCAGCGACATGGGTGGCGACAAGGATGCGCATGTCCGGCCAGCCTTCAACCAGCCTCTTGATGAGCGAAGCCATGAGCAGCGACTTGCCGCAGCCGGTGGCAAGGTCGACGAGCGGATTGCCGGCCGTCGTTGACCAATAGTCGAAAACGGCGTTTTCTGCTTCTTCTTGGTAGTGGCGCAGTTGGAGCATACGAAATTCCAGACCTTGAGATAAACGATTGGATTGCAGGGTTATCTGCGCTCGCCGCTATGCTCGCGGCTGCTTTTGCTTTCTCGTCGGCCCGCCAAGCCAAAAGACAAGCGGACGCGGTTCTCGGAGACGTAGAGCCTGTATTCAGCGCCTATCAGCTACCAGACGATGGGAAAAACTATCGCCACAGGGTCGCCATCGAGATAGTCAACCACAATCGGCTACCGCTCTATGTCCAAAGCATCAGGTTGGAGTACCCTGACTGGGTGCTTATTCACCGTGGAGCGGAAGAGGTTCGCGATCTTGTTGCCGCACTTTACGACGTAGTCATCGAGAAAAAGCGTGAGCACGTATTTGACGTTCCTTTCCGTCTCGCTGGGCGCTTCTCGAGTGATATGCCTGCAGTATCCACAAGCGTTTTCAATGTGAGATTTCTCGACCAGAACCAGCAGGGCGGTTTTGTACTAGGCTTCATCGTCCATTATCGCATGCATGGATCGAAGCGGGTTCGAGTTGCGTTTGCCTCGACAGGATTCGACGTTTTGGACTAACCAGCCTCCCGCAGCAGCCCAAGCACCCTCGCCCGCTCCTTGATGACGATCTGGCGAACGCGCTCTTTGGTGAGGCCGTGGTCATTGCCGATGGCCTCCAGCGTCTCGCCCATCGCCCTGCGCATCAGCATCGTGCCATTCCGGCCTTCCAGCAGGGAGACAACGCGCGATAAGTCGGTGCCCTCCTCCTGGTGCGGATCAGCAGAACCCGGCAGCTCTTCGAACGCGGAGAGACTGCAGACCTCAGCGGACCGCGACTTGGTGGAATTTGTGCGAACGAATTCCTGCGCCGTACCCCGAACACAGAGGACGGCCCATGTCCAAAAGGTCTCAAGCCGGCATTCCCGATGACGGCGCAACATGACGACCATCGCCGACTGAAAAAGTTCGTCGGCCGCGTCTTCGTTCTTGGTGATCTTTCGCGCCAGTCTTCGTAAAGCCGGCTCGTAGGCCAGAAGCTTGCGGTCGAACTCGGGACTGCGCAGATTGTTATCATTGGCAGCGACAAGCGGCGTGCGTGCAACAAGCGGCATGGTGGTCTCCTCATAAGGTGTTAGGCGTTGGTGGCGCCATCTATCCATGTCGTGCCGTCTCGCAGCACATAAGTGATTGTCTCAGCTTCCTCATCGCAATCCGTTTGCGTGCCGGGGACGAGGGCCGGGATCGTCAGATGCGTGGGGCAGCCTTCCTTTTGCTCGTCGAATGAAATCGGCTTTGCCCAGCGTGCGCAGGACCAATGACCGCCGCCGCCCATTTCCGGCGACGAGTGAATGCAGGATCGACAGGTGACGCGTGGCCATGCACTCTCCTTGCATACAGGCTTGTGCTTGCAGAAGGTGCATTCGAACCAGTCCGGCGCATCGTTGATGCGCGACGGGGGCTCTGGCGAATTGATGATCCGTTCTAGCCGCGCCAGCAGACGCAGACAGAACTCCGGATCGTATTCGATGCGCTCGGCATAAAGCGTGTCATCATCCTTGCAGCTGACGAGGTAGAGGCAGCGCGATAGACCGAACGCGTGCATTCCAAGCTGGCACTGTCCGTAGTGGAGCGGTTTCGCTTCCTTGCAGCCGTCCTTCACAATCAGTTTCATGCCCTTGGCGTTGCTCGACTTGAATTCGAGCAGGTGCTCTGTCTTCGGAGCCTCTACGACACCCATTGCCTTGCCGTCGCACTTTCCGCGCACGTGCCCCTGCACCAGCCTGATCTTGTCTTGCTGGCCGTAGACATCGACGCCGATGCGCTCAAGGTCTTCAACCAAGCGCGATTCCTCAAGATTGCCGGTTTCGAACAGACGCAGCTGGCGGCCGTGATGCTTTTCCAGTGGTGAGGCCCATCGGAAGGCATACCAGAGAGCGCGATCGCATGGATTGTTGGCCTCACCAACGGATATGCCGAGGCTATCCCATGACGAGGCTGCGGCTTCGTAGGCGGCGTAGATGGCACGGACTGTGCTGGATTCTGGCTTGGGTAGTGGGGCCATTATGACCCCACCCCAGAAGAGAACATAAAAACGGCTACCATCAGCCGACCCTCATCGGCATGCAAACGAGCGTCAGCCCCTCGAAGCCGTCGGACGTGATCAGCCCCGGCGTGCCGCCATCCTGCAAGGCCAGCTTGACCGGACCAGACGGCAAAACGTTCAGAACGTCGCGGACGTAGGCAGCGTTGAATCCGATATCCATCGGCTCGCCGCTGTATTCTGCCTCGACTTCGTCATTTGCCGACGCCTCGCCAGCCGCGACAGCAAGCGCGATGCTGCCTGGCGCGATGCTGAACTTCACGGCACGGCCGCGCTCAGACGACACGGTCGAAACGCGGTCGGAAGCCTTCATCAGCGCGTCACGATCAACGGTGATGACACGTTCGTTGCTCGTAGGAATGACGCGCTCGTAGTCAGGAAATGTGCCGTCGATCAGCTTCGACGTGATGCGCACGTCGTCCGACACGATGCGGATTTTCTGCTGGCTGACGGCCACCTGCACCTTGCCCTTCGGAAGCAGGCCGACGGTCTTACGTGGCACGATGACGCCATCGAAGGTTGGCAACTCAGGGCCGATGTTACGTCCGAGACGATGGCCGTCGGTGGCAACTGCTTCCGACTTGCTGCCGCCCTTGAAGAACACGCCGTTCAGATAATAGCGCGTTTCCTCTGTCGAGATCGCGAACGACACCGGTGCAAACAGTGCGGCCAGATCGATCCCGAATTCAGCGTCGAACTTGTCTTCGCCAAGCGTCGGGAAGTCTTCTGCCGACAGCGTCGAAAGCGAAAAACGCGAGCGGCCGGACTTCACCGAAAGTTTGTCACCCTCCAGAGACATAGTAATGTCGCCGGTCGCCTTGCGGGCGATGTCGTTGAGCAGCTTGGCGCTAACGCAGATGTTGCCGGGCTTGGCGACCTCCGCCGGCACGCCTGCAGTGGCGCTGATGTCGAGGTCGGTTGCAGTGATCGCGAGGCCATCGCCTGCGGCTGCAAGCTGGACACTCGACAGGATGGGGATGGTGGATCTTGCTTCGACGACCTTAGTCGTGGCGGCAAGCGCGCGCGTCAGGTCTTCCTTGTGAATGACAAGGTGCATGGGTGTCTCCTCGGTGATGTGTGGTGTGCCTGCCGTGGTGAGCGGCAGGCAGGTGAGTTAGGCTGCCCAAGGGGCCGGTTGACGCTTCGGCGGTTCGTAGCCGATTGGATAGCCGAAACCATCCTTGCCTTCATCGTACCGGCTCTCGTCGAACTGCTCGCCCTTCAACCCAGCGATCTTCTTGAACAAACCGATCGCGCTGCCTGGCTGCTCTTCGTTGTCGTTTGCCACCCACGATTCCTTGCCTTCATCATCGACGGTCAGGGCAGATATTGCCGCATCGAGAGAGAAAGGCGTGCCTTCCGTGCCGAGAACAACCTTCTCGCCATAAGCGCCGCCGATCTGGTCTTTCAGCTCCTCCCACGTGTCGATCTTCACCTTACGTCCTGCCAACACGATCTTGAGGATATCGCCCTTGCCGATCGTGTAGCCACGGTCCTGGTACTTGAGGACGCGAGTCGCCGACGCCAACGGATAGCGAGTGCCTGGATGGAATCGCAGGAATCGCTGTGAATTGTGCTTCAGGAAGTCGTCGTGAAATGAGAATTCGTCACTGTCGAAATCCAGCGCACCCATGACGGCCGTGAAGTCGAACGCGTCGAAGATGTCCTGCGCCGTCGGGAAGAAATCGAAGTGCATGAACTGGATCGGTGTGCCGCCGCTTTCTGCAAACGTCACGGCACGCTTTGTCGTTGAGACGCACCAGAAACCATTTTCGTAAGCGTCTGCGACTGCATATTCGAAGGCCTCGCGGCTCTTGAAGTAGACATCAACGTCGTTGATATCTCGATTGGTGAACACGCTTGTCACCGCTCCGCCGGCCGCGAATGCGCCGGGGATCGGGTAGCATTTCTCCGTGATCTTTCTGGCTTCCGCCTTGTAACTGCTCATGCAAATCTCCTCAAACGTGGTGAAATAGCGGGCCGCTGGTGAGGCAGCCCGCGTTGACGGTTACTTGCTGCCCCAAGGCCGACGCGTCGTGCCGGCGGCCGCAGCGGCAGGCTTGTTGTCATTGCTGGCGGCTGTGCGGCGGTTGTCGTTGGCTGCGGCTGGCGCAGCATCGACCTTCGGCTCAGGCAGGTTGCCCTCATCGGGGTAATAGTACTTCTTCACCTCGTTGCGGGCCGCGTACTTCGGCGTGCCGTCGGCGTTCTTCTCCTTGCTATCCTTCCCCATGCCGATGCGCGCCATGAAGGAGATGAAGTGGAGCTCGTCGGAGTCTTCTGGCGCCTCGTTCAAGCCGAGTGAGCGCAGGAGGCAGGCAAATTGGCGCTGACCGATTTCCTGGGTCTGAGGGTTTGGATGCTGCAGATTGTAGTTGTTGAAGACCTTGCGGCCTTTGAGTTCCTCTGGAGCAAGGACGTCGATGGTCACGCTCAGATTGATGGCGTGGTCGCGGGTGTCCTTGTTCTTTTCCTTGATCTCGGAGGCGCTGATTTCCAGCTGGTAGTCGCCGTTCGGCAGGTTGGTGAAATCGCGCTGTTGGGTGTTCTCTTCGGTCGCTTCAACTCTGACGCCAATCTTGGCCATGCGTAGTCTCCTTCAGTGGTGGTGTGGTGGTTAGCGGAGGTAGTAGCGGAAAGGTCCGCTACCGAATTGCGTCGATGCCTGGTCACGCCAGACATGAGTGAGCCAAACCCAGATCAGCTGGCCGGAGCGAGTGCGCGCCTTGACGGGGTGCCACGCGAACCACGATTCGCCGCGGAACATGGTCACGCCGCCACTCCCGTGGGCGCCGGGAAGTGCTTCGCCAACTCCGCATAGCCCTGCCCTTTTCGATAAGGCACGGCGTCCGGCATCGAGTAGCGATTCTTGGCGTTGAAGCCTGCGCCCTCGACCAGATGGATCTGGCGTTCCTTGCCGCCCTCAGCGTGAGCCACTTTCGTCTGCCGCGCGACTTCCTTTTCCTTGATGGAGATGCGGTAGTTCATGAAGGCGACGATGTCGGACTTCTCGCGGACCAGCGCATTGGCGCGCTTGTGCAGCTTCGGCTGATAGCGGCTATAGGGGTCTGTGACAGGGCTGTCGAAGCGGACGATCTCGGGGTGCGCAAGCATCACTACGCAGATGCCTCGTTGAGCCAGAGCAGCAACCGCTGCCATTAGCTCATTCCACTCTGTGTCTGTTTCGATGTAGCCGCGACCGTAGCCAGGCTCCTCGATCGATGCCACGCCGATGCGGCGGCAGGTAGCTGCCCAGACCAAAGGCTCCAGGCCGTCGAGGCTGTCGATAATAACCGTCTTACGGTCGTGCTCCTCGGTGAGCAGTTCGCCGAAAACGTCCAGCAGTTCGTCGAAGCTTTCAATGGTGCCAGGCGTGACGAGTTCAACGTCGGACGGTGTGCGCTCGCCTTCTGTTGGCAAATAGAGCGCGTCAGGAAATTCTGCGGCAAGGCTGGTCTTGCCTATACCGTCGACGCCGTAGAGCAGAATGACAGGCGGGTCCGCTCTCTTCGTCGACTTCAGGGACGAAAGTGAAATAGCCATAAGGCCTCCTCAATGTGGTGCGGTTAAAATGATGGCGACGATGTAGGCAGCGGCTGTGATAACGATCAGCGCGCGAAAAGACGGCACAAGAGCCACCAGAATGCGCTGAGGACGGCGATCGCGACGAGTGCGAACGGAAAGAGCATGATGAGGCCGACGGCCGCTGCTGCGGTCAGCGCCGCGATGCTGCCGCGCTTCAGGCCACGCGTTACGTATTTGCGCGGCGTCGGTGTGACTGGCACGTGGTCGAGCGGAGGTGCGGTGATGGATTCGGTGTACCAGGGGGATGTCATGCCAGAGCTCCCAAGAAATAGAGAACGCCAACAGAAGGCAGGACAGCCAGCCATAGTACAAGTAGCGCAAAAGCAGTTAAGGTTAGCGCAGCCTCAGAGAAGCCAGACCAGAAGGCTGGCCAGTTCGTTGCGGGGCGCGTCATGCCGCAATCCTCACCTCGTCAGCCAAAGCCCTCCGACGAGCCGCGCTGTGCATGTAGATCGGCAGGCGGTGTTCGCTCACCTTTGCATTGGCTGCATCGCGTGCCGCCCGCTTGCGATCGCCCCTCGTTGTTCCATCAGCAAACTTGCGCACCTCTTTTTGGAAGTTGCTGAAGCCATCGCCGCCGGTGAGGACGGACATCAGGTGCATGCGACGGTCGAAATGAGCGGCGTCGGTGGAAGGCATGCCGAGCTTGGTGTCGTTCTCATGCTCACGCTGCGCAAGCTTATCGAATACGTTCTTAGACATATTAGTCTCCTCAATGATGTGGTGGAGCGCGGCTGGTTGGTGGCCAGCCGCTAGCGCTGGTTAGGCTGCTTGCTCGTATTCGATCCACGCCTCGACGGCGTCTTTGGCGCCCTTCAGTGTGAGACCCGCAACCGCCCGCAGCTCCTTGATCGCATCGATCTTGAGGCCCAGCGCAGCCATGTTCTGCCACTTGTGGTCATAGACCGGCGCGGCTTCTTCGTGCGTCGTGGTCAGGGTGAAGACGCCGAACTGCTTGCCTTTGTACTTTGCCGCGAGGCGCTTTGCTTCCTTCTCGGCTGCGACGGTGGACGCGTGTACGTGCGGTGTCGACGATGGTTTGGGCTGGCCGTTTTCGATGAGGGCGACGATGGTGGTGGTGGTGGTGGTGGTGATTAGTTCGATGCTATCTATGGGGTCGGTCGCACCCCAACCCTCTTTCGTCCAGTGCGTGCGAACTGTGCGCTGGTCGACAACCTCTTTCACGACGCCCTCAAACTTGAGGGTCTTGTGGCGAATAAGGTCACCAACCTTGAACTTCGGCTGCGCGTTGTCGTTGCTTGGCTTGGCCACGGGATCGCCGATCCATTCGGCGATGATGTCGTTCTGCTCATGACCTCCGACGCCATTCCAGTAGTATCCATCGCGCCGCCACTGCTTGCCGTCAGCCGCGTAGATGTTGCCCCCTGCGTAAGTCACCGGCCCAACCTTCCGGCCATCGCGCGTTTTGTAGAACTTACCTTCCTCGATCTTGAGGGTGGAGGCAGAAGGTTCGAGGTTCGAGGCCTTAGCCCACCACCGTCCGGATTCAGAGAAATCCACATAGTACGTGTGGTCAACGGCGTCGTACTGGATGACCGTGCCGAGCTCTCCGGTCTTTCCGTATCGACCGTGCCCGTTGTAATCTCCAGAATTGATCAGTCTAACTTCGTCACCCTTCTTAAACTTCCCCATCACGCTGCTCCTTCCGTTGTTGTCTCGGCGGTCATCGCGCGCCGTTGTGTGAAGTCGACCTTGACGACGTTGGTGTCGTCATCCTCTTTTGCTGGCGGCTCGCCCTCCGGATCGTGTTCGATCTCGAAGCCGTGCCACCAGATCGTCGATGCGCCGTCAGCAAGTCGCACCTGGTACTCGCTGCCCCAGTTGCGATCGCCGATGACGACGCCAGTGAGATGAGGGTTTTGTCTGTTGCGGACCGGATCGCCGAAATTGAAAAATTCGCAATCGCAGCTCATGCCGCCACCCGCTCACTACGCAGTGCGTAGTCGTTGACCGCCTGCCCCGCCGCCAGATCTTCAGCATTGTCATTTGCCGCGGCAAGAACGCGCGGGAGCGAAACCGGCATAAGGCCGGAAAGCGTGGAGCAACCGCCGTTGTGCGGCGCCAGGTGCGTCGTGCGGTCGGGATTGTTGTCGTTGGCGGCAATGATGGCGCGCTGCTTGTGCAGGCCGTATGTGCGCCGAAGCCGCTGGTATGCTGCCATTGGCTTGACGTTGTATCTGGCGGCGATATCTGCGACGCTCTCACCGTTTTCACGGCGCGCGTGCATATCCGCCAGCATTGTGCTGGTGATAAGCGTCATGTCGTCTCCTCTGGGTGGTGGCCGCTTGTGGTTGACAAAGCGACAGGGTTATGAAACTAAACTATTTTTACAAATTTGTCAAGATTTCACGAAGGTGGATGCATGCCTGAAGTTAAGTCGCGCCTCCTGCAGTCGATACTGAAAGAACAGAAGCGGAGAGCCGTAAAGGATCGCGCTGTCTACGAAGAGTTAGGCGTGCCTCAGCAGACCTTCAGCACCTGGAAAGCTGGCGTCATACCGAGACCAAGACAATTCCCGGCGATTGCCGCCTTCCTCGGTGTCTCCGAAGAAGATGTGGCGGAGATGGCTCGCGAGGCGGCCGAAACAGCCCCGTCCATCACGCCGATTACGGTTGCCCGTACCTACGGCAAGATTTCCGACCGCAAGGCTGGCAAATTCAAATTCGAACCAATCAACGATGGCCGCAAACGTATTCCTGAAGGCAGGTACGCGATCATCATAGACACGAAAGTGATGGAGCCTGTCTTCCACGTTGGCGTGAAAGCTTGGCTTGACCCTTCCCGCTGGCCCGCCGCCGGTGACGACGTTCTGGCCCATTCTGGCGGATTCGCCTGGATTGGGCGCTTCGAAGGAATGAGCAACGGCGCCGTTCAGCTTGGCCACTACGATGGATCGCAGCTTGAAGTGAAGAACGTGGAAGCTGTCCATGTCATCGTCCTTTCGGAGCGGGTGGTTACAGCATAGCGAGGTGGTGGCTCGGAATGCTGCTTGACAATTCCTACAAATTTGTGTAGGAGATTGATCGTCCGCTGTGGTGGCGGATATGGAATACGCGCTTTGATCCCGCCTTACGGCGAAGGTCTCCTCGGCGTGTTAGTACGGAGAAGGGGCGGAGTTACGGGTGGTGCCGGCTCATTACGCCCCTTTTCGTTTTCTGCTGCATGCACGACAACGCCGGCAGAGCTTCATGCTCCACCGGCGCTAAGTTTTCGAAGTCATTGTTTGCGGTCTTTCTCATAGTCGTCTCCCCTTCTGGCATTATTGGTTTTGTTATCGCCAGTTGGTTGCTGGCGTTTACGGCGGCCCGGTCACTGGCCGCCATCTTCTTCTGCGATCGGGAGCTTCATGCTCCCATCTTCTTCTCGCCGTCATGGCGGTCATGCCGATCAGCGCTTGGGTCGTCGATCGTCTTCTTTCGCAATCCTTAAACATGCGCATTCCCTTTCATGCTGCAGGTTCGTCTTGGGGTCTGTTGCCCTGTCGGCTTCTTGTTTGGCCGCTGTTGATTTTGGTTTTACCCGTATTTGGGTTGGTCGTCAACCCGAATATGCATTTTACTTGCATTTATTTATGGGTTATCTGACATCATGGATACTGTGGCCCTCACTCGAATGCTGATCTCACACTACGGCAATGAAGAAGCGTTGGCCGCCCTAGCCGGCGTGGGGCAGTCATCGGTAAACAGGTGGGCGAACGGCGGAAACATTCGGGGTAACCACCTTCTTCGCTTGATGGAACTAGCGCGTAGCGTTCCGTCTGCCGCGGACATCATTGCTGACGTTACTGACGTCGCCTCACATAATGAAGGCTTCACGCCGACGGTCGTACCTGGTCGAGAACTAGTAGGCGAGAAAAATCTCCCTGTTTTCGCAGCCGCGATGGGCGGAGAGGGCCATCAGATCGTGACATTTGATGCGATCGATTATGTGAAACGACCATCCATCCTAGAGAACGTCAAGGATGCCTACGGCGTATACATCGTCGGAGAATCGATGACTCCCGCCTTCGAACCGGGAGACATGGCGCTCGTGCATCCACACCTACCTCCGGCGCGCGACAAGAACGTTGTGCTTTACCATGTCCCTCCATTTGGGCACGGAGAAGCAGAAGCGATCGTCAAGCGCCTGGTCAGGTGGAGCGACCGAGACTGGCACCTAAAACAGTACAATCCGAGCATCGAATTCACCGAATCTCGCGCGGACTGGAAGTGGTGCCATCGTATCGTAGGCAAGTATGAGGCTCGATAAGCGGCATCCGTCACTCATAAGCAGGCAAATCCGCAATATGCTCTGGCACCTCACCGGCGCCAGCAACTAAAAAAAGCTCACTGGACGATGCGGCTATAGCGACTACGCAGACTTTCTCTGGGGAAATCCTGTCAACGATTTTCCTTGCGTGCGCGACTGAACCAACCTGCATTGGAGCATCCGGAGATACCCCTCCTCGCGTTGAACGCGTGAAACTCTGCACAATAAACACTTCGCTCAAATCCATAACCCTCCATTGCCAGAACACTGACTCCAAAACTAGAACAAAACAAGAACTTTATTCCTGATTCGCGCCGCTGCATTTTTTGTTATGCACATTAGGTTGACAAATAACCCGTATACGCATATCCATTCTCTCGCCAGCACGAAGAAGCCCACCAGCAGATCTGCCGGCCCACCACCACCACAAGAGGAGATACACAATGTACAGACCGCGTCCCGAAGAGTTTGACGATATCCATGTAGCAGCGTCCAGCGTCGAGCCAATGCTCCGCCCGGACCACAAAGCGAAGAAGCACGGCAAGCCCAAGACGAAATATGAATACCTGCGCCGCCTCCCCAAGAAGCCGCGCAATGGCGAGGAAGTCGGCGGTGGCCACTTCGTATTCCGCCGCGGCGACAGCACTGGACGCATCCGTCCCTGCATGTGGCCCTTCGAGCACCCCTCCTACGATTCAGCGCTGGTTGAAGCCGCACGCCTTCACAAAGAGCATGGCGGCACCTTTGAAGTGTTCGTTCGCGTCGGCCGCGTCGAGACGCTGGAGGCCGGCGAATGAGCGCGGCTGCCGCCGAGAAGCCGACCGTCCAGCAAATCATCACCGAAGAGCACGCCGACCTGATGAAGGACATAGCCAGTTTTCTGGCTTTATCGCTCTTCATCACGGCAGTGCTGATCTGGGGTTATTGAGGGAGGCCGGCATGGCATATGGCGACTACAGCGGTCCGGATAAGGCCGACAAAGGCAAAGAGGGAGGCAGCTGCAACCGGCGTAGATGCCAATGCGCCCCAGCCAACTGGTATAACCACGGCTCCCTTTCTTGGTACTGCGAGGACTGCAAGGAGCAAATCTACGACCCGATCGGCCAGCGGTATTGGAAGCAGGACTTTCCGAACGCCACTCACCCGATGTTCGAAACACGCAAGATGATGGACGACCGCCAAGCGTCCTAA
Protein sequences of DBSCAN-SWA_6 >NC_022535|1783178:1822766|1806447_1807851_-|WP_048902834.1|DBSCAN-SWA MNVALSAQANAYAAFLARKAILDPPTGLTEIPDLPSVLFPFQRDIVTWALKRGRAALFAGTGLGKSFMELAWGQAVSASTQGDVLHFAPLAVAAQMVREAEKFGIPARHVRSQSDIGSGINVTNYQKIDAFDLSQFSGVILDESSILKSETGHYRNELVEACHGIPFRLAATATPAPNDFMELGNHAEFLGIMSYSDMLATFFTHDGGETQKWRLKGHAENDFWRWMASWAVMLRKPSDLGYDDGAYLLPALHQIHHTVTAPVAVGDLVGSRASTLQERIKARRDSVDDRVAFAAAMTPKDRPFVWWCNLNAESEALTKAIPGAVEVRGSDKEDVKERKLIDFSEGRIRVLVTKPSIAGFGMNWQHCADTGFVGLNDSFEQIHQAVRRFYRFGQQKEVTAHFIAAETEGAVVANLKRKEADADRMAAAMVLHTANITKQAINAQAREKASYDPKIPMQIPSWLGGAA >NC_022535|1783178:1822766|1793871_1795515_-|WP_022556421.1|terminase|DBSCAN-SWA MLSEAVVGAIKCGPIPVLRDWRGLPTSELTRGEKMCRFIEEYLVVPEGALVGQPIRLLDFQVAFILSVYDNPNGTSRAYLSIARKNSKTATIACLLLGHVIGPEAFPNSRIMSGARSRDQAAEVFNYASKMLMMSPRLKGLYRIVPSGKMIVGLRKNVVYRASSAEAKSAHGGSPLVAILDEVGQIKGPHDDFVEAIVTSQGAYGDKAMIFAISTQAATDGDLFSRWLDDAETSKAPRTVSHLYTAPADCDVLDEEAWKAANPALGKFKSVSSVRDDAERAARMPTEEASFRWLHLNQRIDANAPFVSPAIWRACNARVVDFDGLPVFGGLDLSEVSDLTALVLMAPKEQEGKTIWHVKPTFWLPGDGIRAKAKADRVPYDVWHTDGHLEAAPGRTVDYEFVAHYLRDRFEEMDIRKIAFDRWNFRHLKPWLQKVGFTDDQLEGDDAVFQPFGQGFQSMSPALRELESIILNGNLAHGDHPVLTMCMMNATVKADPAGNRKLVKHNRERRIDGAVALAMATAMAGTYEGGDSGNLDDFVNNIISVTW >NC_022535|1783178:1822766|1805389_1806448_-|WP_022556434.1|DBSCAN-SWA MTNAQNTAALTADINAVNQVITDNYAIYEGDSCELIRAIPGDSVHFGIHSPPFEGLYKFSSFDRDISNNEGGAFWEHYAFLIQELLRVTKPGRIHSVHCMQLPTSKRRNGFIGMRDFRGEIIRAYEDAGWIFHSEVCIWKDPVVAQQRTKSIRLLHKQITKDSCISGQGLADYIVSFRKPGENSEPVDGMFDMWVGDESLDISREAYDRHAAETIADGRTPWSFEQWRSVFVWQRYASPVWSDIRQTRTLQYRSARDEQDEQHISPLQLDVIERCIDLWSLPGETVLTPFLGIGSEVYSAVEMGRKGVGFELKPSYFRQAVKNIASLGTKEQPVADLFSAANDNHAVAKVAA >NC_022535|1783178:1822766|1790269_1791478_-|WP_022556418.1|capsid|DBSCAN-SWA MADNQLADKIGELGTSLASIKEQVGNLATDFTTKLAANGEVSAELKEKTDKALSELGDVTTRLGDLEKRAARENEIGENEQKSLGDLVIDSAEFKAGMLTGASRGSIRVKADRAAITSANTTVGAGRSQGTSLVPGARVPGIFGLPERQLTIRDLVLPGQTASSSIEYVKETGYTNNAAPVAETTAKPYSDLTFDMTSAPVRTIAHLFKASRQILDDAPALRSYIDGRARYGLRFAEENQLLNGSGTGQNIHGLVPQATAFNPAFAAADETGIDRLRLAVLQVVLAEYPATAFVLNPIDWAKIELTKDAGGNYIIGNPQGSLTPTLWNLPVVSTQAMAAGEFLTGAFSFAAQIFDRMEIEVLLSSENVDDFEKNMFTIRAEERLAFAVYRPESFVTGDVEGA >NC_022535|1783178:1822766|1798195_1798669_-|WP_022556426.1|DBSCAN-SWA MGRATSQTTRINGKRVVIRTSPKGKVTVADAPIKESEGQAAQVRALRSLPEYGRQFLLAGDMNSAKRGPRAQADAIATGMTPGEADLRIYLKGGTLRMIENKVGKGRLSPAQVERHASLARLGHPVEVVRFTSTTEAADKAVALVKGWLADNDNHPT >NC_022535|1783178:1822766|1815729_1816308_-|WP_022556445.1|DBSCAN-SWA MAKIGVRVEATEENTQQRDFTNLPNGDYQLEISASEIKEKNKDTRDHAINLSVTIDVLAPEELKGRKVFNNYNLQHPNPQTQEIGQRQFACLLRSLGLNEAPEDSDELHFISFMARIGMGKDSKEKNADGTPKYAARNEVKKYYYPDEGNLPEPKVDAAPAAANDNRRTAASNDNKPAAAAAGTTRRPWGSK >NC_022535|1783178:1822766|1804788_1805163_-|WP_022556432.1|DBSCAN-SWA MTKLPTTPRQHTPTVDADHNPTTCFVCGMHAFGIGVNANGRDKDPNYICRRCAVGIDNYKKIDRLDDYELRALDAGVDAVGEYIAEHGVTDLAHFDELMQRMMVKAAWEGCARGLRAALSEAPF >NC_022535|1783178:1822766|1787083_1787443_-|WP_022556411.1|DBSCAN-SWA MADAEKPFPLEVNGARGEVGLFVGKVPLVIVAEMGGLAAVSSRLSCKSMSDLFLRLSGVEPAATVAALDLLTVRGDKVAAIGALKLKHFGAVAKAISEALSHHFDEDDEGNGEAALKAA >NC_022535|1783178:1822766|1783178_1785524_-|WP_022556408.1|tail|DBSCAN-SWA MAGNNSDDLIISISTDLATVKRALNRLVSDVGAASNGIEKRFAATGKSINNSLTTSMQDRINSMVGIGTTAAKEWNGVLADQQKELDRLRAKYSPLFATISNYKNAVAEIRQAHAAGAISANEMASAIQRERQAALASTAAIKGRNAALKATVTTSSGNSFNTANIAAQFQDIGVTAAMGMSPIQIALQQGTQLSAVLQQIKDSGQGVGQGLAAAFASVISPLSLVTIGVIAAGTAAFQYFSTIMSEGDKSAEVLKEQAALIAAVAERWGDAVPALRDYADQLKRAQDNADLTKGADIVNTNTLADVRKELESTRATIADLVSQLQSAGEEADVIKNLQSAFNDFAKAAEEGKAQTEDVDRVQAALSAAINSTGIPALAEFAKYFSTLSAAALTAADSVQKVNEVTSVATSRINDPRTWRGAGQQDSQFGADATIQGTQFPLPDNGPTPERRPSDLDTDKNRGFGTPKRPRAPKKTASDRFAEDLQAVRDRTEALRQEMNLIGLSNEAQVKRRTALDLEQKALADLREEARKKGEKDLESIKLSPDKIAAIEQESAAYARQSEALRKAQEEQQKLNEWNNVARDATRGFIDDLIHGESAADAFAGALSRIADALLDDVLNSIFKVNSAAGGSGGLLSGLFSLFGGGASRYAGLSGGLFSEGGFTGPGGKYQPAGIVHKGEVVWSQADVARAGGVGAVEALRKGYANGGPVGISVPSVPSLRSMSAQSAGVVVNFNPVVDNRGASVEAVARQEKALAKMQGELQSRVEAAVRSAQKRNVKLG >NC_022535|1783178:1822766|1789293_1789641_-|WP_022556416.1|head,tail|DBSCAN-SWA MPIVDLETVKKHLRVFHDDEDAEIGLYRDAAESIVTQHLDREVVAVGETPTVTDGIAATPAIVSAILLVTGDLYEVREPDPKATGDAVLPRAVRMLLAPWRVWRTVADDYVAPLP >NC_022535|1783178:1822766|1803961_1804783_+|WP_022556431.1|DBSCAN-SWA MSWGFDKGRIYNRRASIHAPFGGQQQGGIITPKKHNLVIIVTGEEGHEHGYNDTWSPDGRFDYFGEGQVGDMKMEKGNLGIASHSARGKSLLLFTKLKAGLRFESEMVYEGHKIVRAPDRLGNMRDAIVFALRPIDTVFEHVESVPVASPQTAKSLDDLKKRAFAASVQKPAKNQVTTSVFDRSRDVRDYVVARSLGKCEGCGNDAPFTRPNGVPYLEPHHIRRLTDGGPDDPRFVIALCPNCHRRVHSGSDGADYNGTLLEKMKTIELGVPS >NC_022535|1783178:1822766|1800213_1801581_-|WP_022556429.1|DBSCAN-SWA MDITQDEIDGLVSSLSESLSVEVKRWIDPRLPSGAAKIIQATIALRNRNGGYLVVGFDDTTLLPDLENELTDVFEAFHADVVQGIVSKYASIPFEVLVGFARFKDRPYPVIKVPAGVTVPVATKRELTVPGGNSVLKDRVYFRTLNSNGRVSSAPALASDWKDIVDICFDNREADIGRFLRRHLGGESLTALLQSFGAAPAPPTMLERLTVLMAENEDRRDQVLSSSDFATDYSPFADMGSWSAMAIIDPPISEGNDIDFLNKVASSNPRYTGWPVWLDSRNFYDQRWKPRKVNSHWETTILSGHVDFYRIRKTGEFYLWRILQDDLTDKVPPRTKLDPILVLWRVAETLAVLLAFARELGASDDAEIHVLFKWSGIAGRELEAWSDWSRYISAWDPSYVDSETGYVQMRCDTPINSLAPLIKSATDPMLSSFGGHSLSEKVIEGVVQKVLSRTF >NC_022535|1783178:1822766|1792617_1793847_-|WP_022556420.1|portal|DBSCAN-SWA MGFFERWVGRPIKLTDGEFWRGFFGLGTTSGETVTIESSLSLDAVWACVNLVQNAAGTLPCIVYGEDGVTVDKNAPLYELLHDMPNMDDTAPEFWSMAAMCLLLDGNFFAEKKMNGERLVALNPLHPLSVDVCRSKDGRNTRHYEVTEDGKKRRVPEGKMFHVRGVRLPGCDRGMSPIAVVRNTVGSALAGEKVAGRMFKNGLLSSLIVSSDQILKPEQRKQIADTLTQFAGAEKAGGVTVLEAGFKPYPMSINPKDAQFLEARQYSVEQICRIFGVPPVMIGHAANGTTTWGSGIEQLILQFTKTCMRPMLKRIEAAIYRDLLDAKTRKTTKVKFNMEELLRGDSTARAEFLSKMVTNGIYLVDEARSYEDKAPVDGGNKAIVNGTMTRLDTLGKTETPAPTPAARAA >NC_022535|1783178:1822766|1807958_1808708_+|WP_144115322.1|DBSCAN-SWA MVEKIKRYWWLAACLWALLAFVSWHFESTCGVFFFSACLSEYWAGIRWIALLKWVSPYQALLAGIAAVVGGYFVLLSQRLQIDEARRVGVASKDASFRAALATVRSECLHVADQLGSDELHTSTKTLDFTRASFPIFADRDPRLLHITMSVTHRLEKALEEKNKGRESAFSQTKLLWYSSIGMAFAELLIQVSEKLNANEDASVSYASFDGHRLYSFLERRGQTPSVLFELGAYFSWPVEQHMEDEWRP >NC_022535|1783178:1822766|1788811_1789168_-|WP_022556414.1|head,tail|DBSCAN-SWA MPKRRRAGAGSLGERIGFEAEGEGDDGYGGVVVGFAEQFVEPARLEPRVGSETVIASRLQGLQPYTMTVRSNERTRIITPAWRARNKRTGVLYAIKAAVNIDERNQWIELFVVHGEAS >NC_022535|1783178:1822766|1805159_1805393_-|WP_022556433.1|DBSCAN-SWA MTSAEMKEACNASLTGARELGLDESKASVSLVLPEGFKPPPRFPRGYLLQIKDDGSRLSSFPAEKLLAWVEWAEAQA >NC_022535|1783178:1822766|1795527_1795983_-|WP_022556422.1|DBSCAN-SWA MSEKKSRVDSVDEAIRIASAASEEIQFPENVPLDDGDVPFFKNVIAEYARADWSAHQLEIAAMLARTMADLVREQDLLRTEGSVAVTEKGTPVANPRKSVVQMHASSILSFRRSLALHARAVQGEARDAAKRRDHAKEIEAGASVDDELLA >NC_022535|1783178:1822766|1818068_1818959_-|WP_022556450.1|DBSCAN-SWA MGKFKKGDEVRLINSGDYNGHGRYGKTGELGTVIQYDAVDHTYYVDFSESGRWWAKASNLEPSASTLKIEEGKFYKTRDGRKVGPVTYAGGNIYAADGKQWRRDGYYWNGVGGHEQNDIIAEWIGDPVAKPSNDNAQPKFKVGDLIRHKTLKFEGVVKEVVDQRTVRTHWTKEGWGATDPIDSIELITTTTTTTIVALIENGQPKPSSTPHVHASTVAAEKEAKRLAAKYKGKQFGVFTLTTTHEEAAPVYDHKWQNMAALGLKIDAIKELRAVAGLTLKGAKDAVEAWIEYEQAA >NC_022535|1783178:1822766|1819782_1820292_+|WP_048902835.1|DBSCAN-SWA MPQQTFSTWKAGVIPRPRQFPAIAAFLGVSEEDVAEMAREAAETAPSITPITVARTYGKISDRKAGKFKFEPINDGRKRIPEGRYAIIIDTKVMEPVFHVGVKAWLDPSRWPAAGDDVLAHSGGFAWIGRFEGMSNGAVQLGHYDGSQLEVKNVEAVHVIVLSERVVTA >NC_022535|1783178:1822766|1817329_1817557_-|WP_022556448.1|DBSCAN-SWA MTSPWYTESITAPPLDHVPVTPTPRKYVTRGLKRGSIAALTAAAAVGLIMLFPFALVAIAVLSAFWWLLCRLFAR >NC_022535|1783178:1822766|1813722_1814778_-|WP_022556443.1|DBSCAN-SWA MHLVIHKEDLTRALAATTKVVEARSTIPILSSVQLAAAGDGLAITATDLDISATAGVPAEVAKPGNICVSAKLLNDIARKATGDITMSLEGDKLSVKSGRSRFSLSTLSAEDFPTLGEDKFDAEFGIDLAALFAPVSFAISTEETRYYLNGVFFKGGSKSEAVATDGHRLGRNIGPELPTFDGVIVPRKTVGLLPKGKVQVAVSQQKIRIVSDDVRITSKLIDGTFPDYERVIPTSNERVITVDRDALMKASDRVSTVSSERGRAVKFSIAPGSIALAVAAGEASANDEVEAEYSGEPMDIGFNAAYVRDVLNVLPSGPVKLALQDGGTPGLITSDGFEGLTLVCMPMRVG >NC_022535|1783178:1822766|1789961_1790222_-|WP_022556417.1|DBSCAN-SWA MTDFLEVKAKRTFAVGKELKTKKSDPFKVEAGEAKQLEAQGLVDIVGEAQAAVEDDATDEAADKPVISSARSTKKKDKPDAVNEGS >NC_022535|1783178:1822766|1817696_1818008_-|WP_022556449.1|DBSCAN-SWA MSKNVFDKLAQREHENDTKLGMPSTDAAHFDRRMHLMSVLTGGDGFSNFQKEVRKFADGTTRGDRKRAARDAANAKVSEHRLPIYMHSAARRRALADEVRIAA >NC_022535|1783178:1822766|1822514_1822766_+|WP_048902716.1|DBSCAN-SWA MAYGDYSGPDKADKGKEGGSCNRRRCQCAPANWYNHGSLSWYCEDCKEQIYDPIGQRYWKQDFPNATHPMFETRKMMDDRQAS >NC_022535|1783178:1822766|1809893_1811648_-|WP_022556439.1|DBSCAN-SWA MLQLRHYQEEAENAVFDYWSTTAGNPLVDLATGCGKSLLMASLIKRLVEGWPDMRILVATHVAELIEQNYLELLGIWPFAPAGIFSAGLGRRDARSQIIFAGIQTVHSKAALIGHIDVLMVDECHLIPANSNTMYGRFIAALRAINPDMKILGLTATPYRLDTGRLDEGDDRLFDQIVYTYGIADGVADGYLAPLSSKATATTFDMKGVGRQGGDYKQSALQAAVDKMDVTRSAVDEIVAKGADRKSWLCFCSGVEHAEHVRDEIRSRGISCEMISGETPKDERRRIIEDFKSYKIRALTNNSVLTTGFNHKGVDLIAALRPTLSVSLYVQMMGRGTRVIYAPGMPLDTPQGRIAAIKAGPKPSCLVLDFAGLVDKHGPVDMVQPKTPNKGDGEAPVKVCPFDVEDKNGRFGCGEKVHASARTCSCCGYEFDIDDSPKITATAADTPIMSTAEPEPRTVTSRSFYYHEGKGDKPPSVKVSYMVGMTAINEWVCPQHSGFPKSKADRYWRAHGGKMPFPKTVLEWIERQSELADTVEITVKPRQKYWDVVGHVVGAANDNRVSPANDNAPEEEDWRVLMDDDVPF >NC_022535|1783178:1822766|1789642_1789984_-|WP_006697512.1|DBSCAN-SWA MLSTKVRKRRVASYIGAGIVNGIGSPVNSVAPAITGTAQVGQTLTSTTGTWSGSPTYARQWFAAGVAISGATAATYVPVAGDVGKAITVRVTATNDKGSVPVTSAPTAAVVAA >NC_022535|1783178:1822766|1812195_1812732_-|WP_022556441.1|DBSCAN-SWA MPLVARTPLVAANDNNLRSPEFDRKLLAYEPALRRLARKITKNEDAADELFQSAMVVMLRRHRECRLETFWTWAVLCVRGTAQEFVRTNSTKSRSAEVCSLSAFEELPGSADPHQEEGTDLSRVVSLLEGRNGTMLMRRAMGETLEAIGNDHGLTKERVRQIVIKERARVLGLLREAG >NC_022535|1783178:1822766|1785538_1785913_+|WP_144115320.1|DBSCAN-SWA MKKPANGELGGDPMDNWLKALVACACVVIISGGGYYAWGEYSAHQRENERREKSNREASLKNQATLLKSKFSAAECIRMAKETLPDKKGEPVKTTKYNGDLSICDDLQMFDPTWRQALDMAGVF >NC_022535|1783178:1822766|1808711_1809239_-|WP_022556437.1|DBSCAN-SWA MSERRSANDNYLAADVANLEALFADMLAAYPELETDEELRADMLEGETNFHAVLTRLVNGERDADSLAKSVALRISDLQARKSRAERRKEAMRSLMLKLLKAAGVPRVPLAEATISIGKKAASVEIIDEAALPKAYVRVSTSPDKTAIKEALQAGKKVRGAKMGEAGEQLSVRVA >NC_022535|1783178:1822766|1788320_1788815_-|WP_022556413.1|DBSCAN-SWA MIKAKVLGREALTKKLNQVAPLANKYAAEAKLQIATEAADKISDRAPISNSATAGDYAASIQGAKISDRPSAKALVGASASKDPDATGVFAAWIWHFLEFGTRPHNVAKGGGTVAGKKQAAGAKMHPGTRAQPHIFPTWRAFRAKAKKRINDAVWRGVREAMKK >NC_022535|1783178:1822766|1820915_1821575_+|WP_022556454.1|DBSCAN-SWA MDTVALTRMLISHYGNEEALAALAGVGQSSVNRWANGGNIRGNHLLRLMELARSVPSAADIIADVTDVASHNEGFTPTVVPGRELVGEKNLPVFAAAMGGEGHQIVTFDAIDYVKRPSILENVKDAYGVYIVGESMTPAFEPGDMALVHPHLPPARDKNVVLYHVPPFGHGEAEAIVKRLVRWSDRDWHLKQYNPSIEFTESRADWKWCHRIVGKYEAR >NC_022535|1783178:1822766|1799036_1799984_-|WP_022556428.1|DBSCAN-SWA MIFELYSKRAAKKSGNVVDVYEYENIPQKLKVQTVQIFDEIFGNPDDYYRSDDRAVNVHRCYEEVVRILRREHGVMRLSGSKSYDDDFRLEYLNYILSSKSADFYLDAVEVGITAINVFCRSQSYRALEDAAHRCDEAIKEINYRIKENGVGFEWLNDQIVRIDSELVHAEIVKPALSFLNGAMYAGPRQEFLSAHKHYRAGEHKQAIVDCLKAFESTMKAICDKRGWTYDKGRATAKDLITICFDKGLVPSYWQNNLSNLKALLESGVPTGRNKNAGHGQGATPVDVDDSVVEYIMHMTASTILFLSKSEKELS >NC_022535|1783178:1822766|1801661_1803884_-|WP_022556430.1|DBSCAN-SWA MQTPLELAQYYVAQGWPVFPCRSHAEEHVDQATGEIITLGEKTPLTPNGFKGATRFPRIIERWWSDWPDAAVGLPTGEKTGFFALDIDNKPGGANGFDWLSEMEAEHGPLPDTARVTSPNGGLHIYFKYVVGTRNRGALGAGVDIRSEGGYVLAAGSTMANGRSYKWETDTREIADAPAWLLDLLLPKSAPAHTQYSLSAATNNAYVDAAVDRELADLAGAPMGTRNNALNDAAFSIGTIVGAGALSEAEARALLQDVARGWGRDWSRCCKTIENGLKAGIQNPRHIPEPDFPAHDNTRLVDITRMIQRGLEKGRLREQAAAVEADVVVEAVPQNRTDIPEQEVVSPAGDIEPANDNTPPSPITATAFKWIDPKTLPRREFAYGSHFIRKYVSVTVSPGGLGKTSSSIAEALAMVSGRALLGTKPPKRLRTWIFNAEDPRDEMERRIMAACIHYKLKPADLEGHLFLDSGREQELCVAIEDKKAGVRIQQPIVEAVVEQIERNGIDVMIVDPFVSTHGVNENDNGAIDKVAKLWAQIADYTNCSIDIVHHLRKVADREATVEDARGAVSLIGAARSVRVLNRMSEEQAGEAGIDKADRFGYFYTTYGKSNLTPLSHKAEWRHLVSTPLGNGTGLAQPQDFAPVVTEWQWPSAAEVAGELTEDQRASILAAVSASDYKKSPKAKNWVGGAVAYAVGLDLDDNVQRKRAASLVGALMREGALVEREERDPVRRELAVFVRAA >NC_022535|1783178:1822766|1812752_1813682_-|WP_022556442.1|DBSCAN-SWA MAPLPKPESSTVRAIYAAYEAAASSWDSLGISVGEANNPCDRALWYAFRWASPLEKHHGRQLRLFETGNLEESRLVEDLERIGVDVYGQQDKIRLVQGHVRGKCDGKAMGVVEAPKTEHLLEFKSSNAKGMKLIVKDGCKEAKPLHYGQCQLGMHAFGLSRCLYLVSCKDDDTLYAERIEYDPEFCLRLLARLERIINSPEPPSRINDAPDWFECTFCKHKPVCKESAWPRVTCRSCIHSSPEMGGGGHWSCARWAKPISFDEQKEGCPTHLTIPALVPGTQTDCDEEAETITYVLRDGTTWIDGATNA >NC_022535|1783178:1822766|1818958_1819240_-|WP_022556451.1|DBSCAN-SWA MSCDCEFFNFGDPVRNRQNPHLTGVVIGDRNWGSEYQVRLADGASTIWWHGFEIEHDPEGEPPAKEDDDTNVVKVDFTQRRAMTAETTTEGAA >NC_022535|1783178:1822766|1791489_1792608_-|WP_022556419.1|DBSCAN-SWA MKFEHLISAFLAEPWAIQREKLGVLADVLVARAEGEKLFSSEFAASIDDARAKEIAETSGSVAVIPVYGVLADKMDLFSAMSGGTSYAGIKRALHKALSNEDIKAVVLDIDSPGGTVPGTDELATEIRKLRGGDKPIIAQVNSLAASAAFWIAASADEIVVTPSGRAGSIGVYTAHDDLSAALEQRGIKRTYISAGKHKVEGNETEPLGKEALAHVQDGINRSYNRFVAAVAEGRGVTVSKVEDNYGQGRVFYAEALMDRGMVDRIATLDETLARYGADVEPAPVKRIKAANAAKADAAQTLVAKMTAGEQITKREFENGIRGLMGLSGSEAERAARLYLKDGQGAPDVDADAAALAAIDRLIAEAKSPLIR >NC_022535|1783178:1822766|1822018_1822372_+|WP_022556456.1|DBSCAN-SWA MYRPRPEEFDDIHVAASSVEPMLRPDHKAKKHGKPKTKYEYLRRLPKKPRNGEEVGGGHFVFRRGDSTGRIRPCMWPFEHPSYDSALVEAARLHKEHGGTFEVFVRVGRVETLEAGE >NC_022535|1783178:1822766|1787444_1787876_-|WP_013636135.1|tail|DBSCAN-SWA MADGQQIGRTLLIQIGDGETPEVFSNLCGLTTRSFNMSANEVDTTITDCVNPENTPQKTAEPGIKNRTFSGSGKFVKSASNTAFMTHVNDATKFNAKVIVPGLGTYTGPWFVSEFEFSGEMEGNMEFTATFVAAGVLTFVAEV >NC_022535|1783178:1822766|1796482_1796905_+|WP_022556424.1|DBSCAN-SWA MDIVGSISAVTSAISLVKELREIDAQFDKAEMKLKVAELTEALSDAKLGLVDVAQSLKEKDEQINKLKASIQRREETVERNGYRYRKGENGEPVGKPFCPVCLEEGNFILTVNSFDDGRPTKCPRCKANFGNVTGYREQS >NC_022535|1783178:1822766|1809411_1809861_+|WP_022556438.1|DBSCAN-SWA MGLRDNLKNIIDDQQRAADEAAERAKREAAIARAKSLDTAIKSFCRLLDEGTVQEIIEHHIKSIRGMRVVVLGCSRDRAVIYLDPESNVPRQFPLDIEFNADPFVMNEAIESVEVQAAIEALKKEGVIVTAGREQGNGAIRVTLDYRNI >NC_022535|1783178:1822766|1786883_1787087_-|WP_084317319.1|tail|DBSCAN-SWA MEEPFPWRDWQKIAFGGLGWPPVIFWSSSLTEFTLAVKGKAEANGAKKSVAPPSDEEMDELIKKYGG >NC_022535|1783178:1822766|1819236_1819587_-|WP_022556452.1|DBSCAN-SWA MTLITSTMLADMHARRENGESVADIAARYNVKPMAAYQRLRRTYGLHKQRAIIAANDNNPDRTTHLAPHNGGCSTLSGLMPVSLPRVLAAANDNAEDLAAGQAVNDYALRSERVAA >NC_022535|1783178:1822766|1797213_1798158_-|WP_022556425.1|DBSCAN-SWA MNEEDQIIDATGRVAFTGDQPHPYRTNTRFRKVDGKWEPIEETKTRDVLKIPQTAEERAAGRERAKAKAADEARKMAKVRRAIVKRQSIGDPSCVQSRGEDFPLLEALRRDEREDLVAVVLRYRRLVALCEAEPLKGLDYSKADGGEVVRETKRLTPESDIDRAAASDWKEMPSGEIKISTKVKKSKGSHSIPPRRTVVAANDNVASGSVIKTESLHVKITDEILNEYIDAKPILAELRSSLGPLVEPFEDAVLGGQSYSEIGKRRGEVLKPAVAGRALVGMAIGTVSIRWQEIDKREARLAELALSRFRDRAA >NC_022535|1783178:1822766|1816490_1817264_-|WP_022556447.1|DBSCAN-SWA MAISLSSLKSTKRADPPVILLYGVDGIGKTSLAAEFPDALYLPTEGERTPSDVELVTPGTIESFDELLDVFGELLTEEHDRKTVIIDSLDGLEPLVWAATCRRIGVASIEEPGYGRGYIETDTEWNELMAAVAALAQRGICVVMLAHPEIVRFDSPVTDPYSRYQPKLHKRANALVREKSDIVAFMNYRISIKEKEVARQTKVAHAEGGKERQIHLVEGAGFNAKNRYSMPDAVPYRKGQGYAELAKHFPAPTGVAA >NC_022535|1783178:1822766|1786090_1786801_+|WP_022556410.1|DBSCAN-SWA MHKQYHLENSTYPDTHRIYEERLSIAGIHHYRKDAISFCRSREKAIYFDLDAANPYDRNAIRIMGRWKGLWGTKVKILGYVDADTASKIAALGIQNDILPRLLKTYVGEDDYVEIMYQIVGPKDGYAEYSPPRITPVSTAKKLMEAGNDVEAVKALLADIDKEEIEAKKSGGGVAARSYKALADFYKKQKSYDEEYAILERFVSQRRARGVNQDKLAERFLKARESRDKRNASKTP >NC_022535|1783178:1822766|1787910_1788321_-|WP_022556412.1|DBSCAN-SWA MVNPDLELQGAIVARLKARAGLTAKVAQRIYDRPPTNAPFPYVEYGESQVIRDDVGCLKSNLIYVTIHVWSQYSGGFKELKELIHEVVEALDEAPLVLPSHRLISITRQDTRHFKDPDEVTTHGVVEFVARVETPA >NC_022535|1783178:1822766|1796101_1796422_-|WP_048902713.1|DBSCAN-SWA MPKPYGRSTEAALYRRLYKTARWARLREAQLAAEPLCRFCLAIEDVTEATVCDHIEPHKGDEALFYGPSNLQSLCAPCHDKLKARLERGQQAVVIGVDGYPVEVGG >NC_022535|1783178:1822766|1811707_1812199_+|WP_022556440.1|DBSCAN-SWA MLAAAFAFSSARQAKRQADAVLGDVEPVFSAYQLPDDGKNYRHRVAIEIVNHNRLPLYVQSIRLEYPDWVLIHRGAEEVRDLVAALYDVVIEKKREHVFDVPFRLAGRFSSDMPAVSTSVFNVRFLDQNQQGGFVLGFIVHYRMHGSKRVRVAFASTGFDVLD >NC_022535|1783178:1822766|1814835_1815669_-|WP_022556444.1|DBSCAN-SWA MSSYKAEARKITEKCYPIPGAFAAGGAVTSVFTNRDINDVDVYFKSREAFEYAVADAYENGFWCVSTTKRAVTFAESGGTPIQFMHFDFFPTAQDIFDAFDFTAVMGALDFDSDEFSFHDDFLKHNSQRFLRFHPGTRYPLASATRVLKYQDRGYTIGKGDILKIVLAGRKVKIDTWEELKDQIGGAYGEKVVLGTEGTPFSLDAAISALTVDDEGKESWVANDNEEQPGSAIGLFKKIAGLKGEQFDESRYDEGKDGFGYPIGYEPPKRQPAPWAA |
49 | Rhizobium_phage(53.85%) | portal,tail,terminase,capsid,head | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|