Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP051471 | Rhodobacter sphaeroides strain CH10 plasmid pRspCH10B, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP051473 | Rhodobacter sphaeroides strain CH10 plasmid pRspCH10DE, complete sequence | 0 crisprs | csa3 | 0 | 0 | 0 | 0 |
NZ_CP051469 | Rhodobacter sphaeroides strain CH10 chromosome 2, complete sequence | 0 crisprs | csa3 | 0 | 0 | 5 | 0 |
NZ_CP051472 | Rhodobacter sphaeroides strain CH10 plasmid pRspCH10C, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP051470 | Rhodobacter sphaeroides strain CH10 plasmid pRspCH10A, complete sequence | 1 crisprs | NA | 0 | 1 | 1 | 0 |
NZ_CP051468 | Rhodobacter sphaeroides strain CH10 chromosome 1, complete sequence | 2 crisprs | cas3,DEDDh,csa3 | 0 | 1 | 8 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP051470_1 | 19453-19531 | Orphan |
NA
Consensus repeat of NZ_CP051470_1
|
1 spacers
spacers of NZ_CP051470_1
>1.1|19476|33|NZ_CP051470|CRISPRCasFinder TTTGCCCCCGCCGTGCCGCCCTGCCGACCCCCC |
CRISPR arrays and Neighbor proteins around NZ_CP051470_1
The CRISPR arrays of NZ_CP051470_1 >merge|NZ_CP051470|1|19453-19531|CRISPRCasFinder GATTTGTTTCGCGCGCGAAACAATTTGCCCCCGCCGTGCCGCCCTGCCGACCCCCCGATTTGTTTCGCGCGCGAAACAA >NZ_CP051470|1|1|19453-19531|CRISPRCasFinder GATTTGTTTCGCGCGCGAAACAA TTTGCCCCCGCCGTGCCGCCCTGCCGACCCCCC GATTTGTTTCGCGCGCGAAACAA
>NZ_CP051470.1|WP_011836179.1|18306_19422_-|ParB-N-terminal-domain-containing-protein MAKRKRLTLPPLGGFAPGPASASEASPAPASPPALGPGLAPAPISRVTGEAAATAALREVAEELGAARSEGRLVLRLELDRIEEGWLVRDRIGLEAEELASLMTSLSAHGQRSPIEVTDMGEGRFGLISGWRRVTALRRLHAETGEERFATVLALLRRPDTAAAAYVAMVEENEIRAGLSYFERARIAAKAVEAGVFATDKLALQQLYGSASRAKRSKIGSFLGIYRALEGALRYPSALPERLGLALARRLEEEPGVAAKITAALAAAAPDSAEAELALIADLLAPAAEAAPEQEPARPSAPDPAPDAPAPAPAPDPAPAQPRSTRPPAPAAELRPGLFLKAEGAGSLHLYGPAVDETFRARLRDWLASEG >NZ_CP051470.1|WP_002724800.1|17186_18008_+|ABC-transporter-ATP-binding-protein MIELRNVSKSYVLNGTRKVVARDLNFTFPSGESVGLLGRNGAGKSSLLRMLAGTMLPDTGEILSSGTISWPVGFSGSFHPELTGAQNVRFVARLYGVDTEAMIAFVRDFAEIGQHFHLPVRSYSSGMRSRLAFGISMAVPFDTYLIDEVTAVGDAAFSAKSNAILRARLEESGAVIVSHSMPLLKKLCRSGAVLHDGQLFYYERIEQAIRHHESMMRGALPRWMRGDLDEGEGEEAGAAAPGPRARARAGAARAGGGRAARPGPASGTSEPRG >NZ_CP051470.1|WP_017140428.1|16030_17170_+|sugar-transporter MPAAAPRAAARLRHHGLLASFLGLVLAPILASGLYLFAIAEDQYTSTVGFSVRTEEMGSALDLLGGLSSFGLTGGGSASDSDILYQFIQSQELVQRINERIDLRAIYSKPGFDPVFSFDPDGGIEDLVDYWKDMVRISYDSTTGLIELRVHAFTPEDAQAVAQGILDESNRMINDLSAIARADATRYAREELDNAVERLRVQRVAMTEFRSRTQIVDPSADIQAQMGLLNTLQQQLASASIDLNLLRQTTQPSDPRIAQNERRIGVIEELIQREREKFGLGGGTGTGASTYSTMIAEFERLTVDLDFAEKAYIAALTNHDAAIAEAQRMSRYLATYVRPTLAQQSLYPQRGLLTLMIGGFALMLWAIGMLIYYSVRDRR >NZ_CP051470.1|WP_002724798.1|14501_15302_-|ABC-transporter-permease MTGLSGLLAGRRFLWGRSVLSLMLREMATSYGRSPGGYLWAVAEPVAALTLLTVVFSFLMRSPPLGNNFPLFYATGYLPFMYYTALTAKIGQAIRFSRPLLAYPAVSFFDAILSRFLLNTLTNLVVFVIVMVGIIAFYDLNVSLDLPAILEAFSMAAALAMGVGTLNCYLFSVAPVWERVWAILNRPMFLLSGIFFLMETVPEPFRTLLSYNPLTHVIAKMRQGFFATYDARLVDPLYAYSVSLVCLFFGMLLLNRHHRMLLNEGA >NZ_CP051470.1|WP_011836245.1|12182_14438_-|hypothetical-protein MLKFHRLRRFARILVLTGEERRYLRAIRKSGLFDRTYYRGAYPGLNPIYLKYPEKHYIAYGERLGYRPNPDFSPQAYLRYHPDVAEAGVPPFLHYVRVGHAEQRLTKELPEVVALPARGMPQVRFEHGRQTAPYAVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYRGEETDALAEEIRADFPGAFVTPMPNRGRDILPFVTLLNAGAFDGYRAVCKFHTKKSPHRQDGDLWRKHLIEGILPETGLEEKLEAFVEAPEAGFWVADGQHYTGTQWWGSNVEATRHLLQRIEIPLDREALSFPAGSIYWVKPLVLGLLRSLQLRLEDFDLEEGQVDGTLAHAIERVLGYLTARAGQKVLQTSELRPAAAAAPAKPAFVSAFYLPQFHPVPENDAWWGKGFTEWRSVVKAPSMFEGHLQPMLPADLGFYDLRATEVMGEQAAMAREAGIDAFCVYHYWFDGRRILEAPIDRLMARPEIDFPFYLCWANESWRRNWDGLSGTVLLEQTYGAGFEEKLAADTAPYLRDPRYARPDGRRPRFVIYRPEDMPDPQASVARLREGWRRAGIGEVELGAVRFHVEGAHPVPEGLFDFWVEMPPHGLVKGPDYLFGGPDGNRMPAAMNPAFSGLIYDYAAVARRALSETYVRTLPKATIAGVMPGWDNTARRGAAGHVAYGANPATFNVWLAGALERRVPASYRRELFVNAWNEWAEKAVLEPSLTFGDLNLQVMRQHLGAAEPATHLAEPPAHGMRSH >NZ_CP051470.1|WP_009565051.1|11189_12182_-|hypothetical-protein MARVVLHIGTHKTATTTIQDMFAHNADLLRQHGVIYPRLSRVAGHHGLVMEWNKLPDMYALPQGSIATLKQLTRDYAHVPGTLVLSSEEFSRGKPGAQVDFRAVRELLSDFESVSVVCVLREQWQFMQSIYLQASKERQPPKPSTMVDSVLKRDMTDGLWIDYNLLYDHLLSAFAPEEITFLDFDACRRHRDGVVGAMLDTLGCGLSASSLQVVHDGLSNVSPLALPTFAACVITEPDQAAPWVIDCATGAFRIEYGEEAKSCLWTYPEFQQLFAYAKERNGRLSERRKAVQPEFRISESDPEENRVYRDGLNTSFWIRCSRWIYRARRG >NZ_CP051470.1|WP_011836244.1|9673_11032_+|hypothetical-protein MTSDQTPARPALLVLGMHRSGTSALAGVLGRAGFALPQELMPPTEHNPRGYFESTRIFRLNDALLAAAGSSWDDWRAFDADWHLSPAAEPFHAEAQEALAAEFPGTAPIVLKDPRICRLLPFWTRALTEAGFRPLAVCTHRPAREVGASLARRNGWPEARGLLLWLRHVLDAEAQTRGRPRVFVSYDGLLADWRGTLGRIAEAFDLALPRPLDEAAPEIEAFLSADLRHAPETPAAAAGLSDWIARPEEILDRKAAGEDRPGDRETLDRIAAEVAAAAPLLADLSGAVEEQGARLEREAALRHEAQTILQQERQRLDDLTAELQLQLHHRTLHVAELERHAGELAQQLRQKTQHAAELERHAEELAQQLRQQRSHAAELERHAGELSALTHELRQQVHHKGRHVQELEAHSGELEARLLALEAEHAALLGSTSWKVTHPLRRMSLALRRPKT >NZ_CP051470.1|WP_011836243.1|7769_9071_-|DUF2793-domain-containing-protein MSDASPILSLPYILPSQAQKHVTHNEALQRLDVLVQPAVLDRDRSAPPAAPAAGARHLVGPGAEGAWAGREEAFAVWDAEAAVWRFLAPQPGWQTFVLAEGAGLVFTAQGWRTLIGLLPEFPSLGIATSADATNRLAVAGPATLFTHAGASHRIKVNKAAEAETASLLFQSDWSGRAEIGLAGSDDFALKVSPDGTSFRTALSADRASGRVALPQGAVVTGSLTGSAVQASAADATPGRLLTVGAFGLGAPAPLVGNAGAVDGALAPGFYGYDSAQGSSGGPAGVQAGLLLHQSRGAGEVQLFLVEAGGGGLMPGILFSRARGEGAWSPWVAGGIVESAGNANGRYIRHQDGTQSCWQKVTTSASADVVAPFPAAFSTATGLVTVSSVVSNGAQALSPRLTGRTTTSVGVSVFSATNTRLAAQVELISMGRWY >NZ_CP051470.1|WP_011836242.1|6646_7588_-|calcium-binding-protein MPEVSPFPSLSPLSAPLAQSALAQTPAGTPGNDLLNGGPGADRLVGGDGNDTINGRDGVDRLYGGDGDDLLDGGFYHDACYGQAGNDTFRIRGVDLADDVYGGAGTDTLDLSGYTNLRLGFRVDLAAGRYDFRPEPFGPYVVKSIEIVIGSARADVLLGSRLAETLSGGLGNDRLEGRGGADVLRGMEGADALFGGTGNDLLLGGAGNDRLDGQWERDVLVGGAGNDILIGGGGADRFVFAGNFGRDRIRDFSPDMEGEWIDLRGVGAITGFGDLMANHLSEVNGHVVIEVGANRITLEGISSEMLTRDDFLF >NZ_CP051470.1|WP_011836241.1|5208_6531_-|hypothetical-protein MTLLRTTLLTALLSLALGAGAEAAPEAIYGPGPRALDNREIFAAANLPLWQELDRAGVEGIVLNASFLMTTDFRGIGAKGEKRRPDDTYLTDADLGGLAAIVARTGLQVTYEAGVGLSGARCDASLSPEALGRAAAAFEFERNVRRLTDAGIPVSAINVDGPMLRLLPDSDKPAGCRETAGAGFDVRSAGQIVHAYMVELRDRIEAAQEKQTVQVRWLVNLPLWQVGRVPRNPVYSEPTPDAKPTTDLTDAVRSLAKAQAEQELPGRALEIAEVVIGYPYSMGKQNPNRYAIRVRNLWLSSRALNPRSGHPPPLGFIVNTHSYLNPCLKREGRPDVAFLTFRRGKGSVSEACQRAQIGEDTVANAQDGAMDNDADYLRDSFTYADELSPGGELARQLVLRDGTRIADHVAHIYYQSWGVNPLSNAWYMERLIERLEHGDR >NZ_CP051470.1|WP_050988687.1|19546_20953_-|AAA-family-ATPase MRKSPPAREPQVPLPPYFNISPDAALAELEAPLTTAGFAEIARSCAQGRDDLAARGLDAEGRRSLRLFSTWEITRYLIPVATAHFRRVLKANPDLPQGISETEGGAKWFTLDEVLRLRAHFAAEGSKAKEYRPYRPAGLPAKLVAVANFKGGVGKTSTAAHLAMSAALDGYRVLVIDLDSQGSMTSIFGGRVADEWGTVFPLLARHYAAHLQAENRARVARGDPPVPMDETLTEAQKIRAGDLIAKTHWPNIDLIGAQLNLYWAEFQIPVWRMQGRSWKLWDALTDVLAEDGVLDRYDVIFLDTPPALGYLTINGLAAADILLVPLGASFLEFDSTGRFFDMLHSTFRSIEEGENIAARALGREELAFEWDAVRAVLTRYDGAQQAEMAALMQAYMGRTMAPERQDFTALIGQAGEQVNGIYEADYRDFNRDTYIRGRETFDATYAAFKRLLVGIWRREELAAADAAE >NZ_CP051470.1|WP_002724803.1|21691_22654_+|replication-initiator-protein-A MADFFICDILDALPKDDMASMEHPVFSLATRPDLRVLDYAHNGVRITVTPSVRGLATLFDKDILIYCVSQLMAALNAGRAISRTLHLTAHDLMTATHRETSGDGYQRLREAFERLAGTRITTNMEVGGREITTGFGLIESWQIVRRSRGGRMVQVMVTLSEWLFQAVLTKSVLTLSRDYFRLRKPLERRIYELARKHCGQQPEWRVSIATLAKKSGSASPLRVFRKMIRDMIAADSLPGYSLAEEPGDLLCVTRRAAVLAPGLAPALRDTTLERVRARMPGWDVHALVAEWHAFWHSSGQPRLRSADAAFLGWIERRVPG >NZ_CP051470.1|WP_002724804.1|22704_23643_+|glycosyltransferase MTDAPPPRIALLLATYNGAANLEAQLESFAAQTLRPTWLVVSDDGSTDATRALLAAFAARHPWLALRLVEGPCRGSAQNFLHLLGQVPPEADMAALSDQDDVWLPEKLARGAAAMADLPADLPVLYGGSSWICDAELGNRRPYPLPVRPPGFRHALVQNIAGGNTMMLNRGAIALLAAASREPERIVVHDWWIYQIVSGAGGRVIFDPVPLLLYRQHGGNLIGANDGFRAKYRRLRMLLSGGFRQWNAINIRALSASAHRFTPENRRLLAEFEALRRAGPWGRLRQLKRIGLYRQGLPGRLSLWLAAVLGRI >NZ_CP051470.1|WP_009565041.1|23706_24270_+|dTDP-4-dehydrorhamnose-3,5-epimerase MQIEETELPGVLILTPRVFGDARGSFCEAWNRATLQGLGIDLDFVQDNQSISAPVGTVRGLHYQAPPHAQDKLVRVGHGAILDVAVDVRVGSPTYGRWVGVELTAGNARQLLVPKGFLHGFVTREPDTVVLYKTTDVYAPDCDGAVHFADPDLGIDWGIDPASAVLSDKDARAPRFADWTSPFSIEV >NZ_CP051470.1|WP_011836181.1|24273_25314_+|dTDP-glucose-4,6-dehydratase MKLIVTGGAGFIGSAVVRKAVADGHHVVNLDCLTYAACLDNLASVAGAPNYVFEKADIRDAEAMARIFATHRPDAVMHLAAESHVDRSIDGPGAFIDTNVRGTYVLLEAARAYWVGQGKPQGFRFHHISTDEVFGTLGETGQFTEETPYAPNSPYSASKAASDHLVRAWGETYGLPYVLTNCSNNYGPFHFPEKLIPVVILKALAGAPIPVYGKGENVRDWLYVEDHADALLTVLARGENHRSYNIGGENEAKNIDIVRKICAILDARRPKATPYADQIAFVTDRPGHDLRYAIDPTRIRTELGWRPSVTLDEGLERTVDWYLANEPWWRALQDRAGVGERLGVKA >NZ_CP051470.1|WP_011836182.1|25310_26162_+|dTDP-4-dehydrorhamnose-reductase MILVFGRTGQVARELARQAPDARFLGRDEADLADPEACARAIREAKPDAVINAAAWTAVDRAEEEEAPATVVNGEAPGAMARACAELGIPFVQISTDYVFDGSGTRPWQPGDPVGPLGAYGRSKLAGEEAVRAAGGPHAILRTSWVFSAHGANFVKTMLRLGATRDRLTVVCDQVGGPTPAADIAAACLAMARGLAARPDLSGTYHLSGGPDVSWADFAREIFRQAELDCLVADIASADYPQKAHRPANSRMDCSDLARFGLSRPDWRQGLARVLADLQEVSE >NZ_CP051470.1|WP_002724812.1|26158_27049_+|glucose-1-phosphate-thymidylyltransferase-RfbA MSVRKGIILAGGSGTRLYPLTIGVSKQLMPVYDKPMIYYPLSVLMLAGIREIAIITTPQDQEQFRRALGTGAQWGISLTYLVQPRPEGLAQAYTIAEEFLAGSPSCMVLGDNIFFGHGLPDLLALADAKTEGGTVFGYHVADPERYGVVAMDERGRVTQIVEKPKVAPSNYAVTGIYFLDARAPDLVHGIRPSERGELEIVSLLEIYLEEGLLDVQRMGRGFAWLDTGTHASLLDAGNFVRTLQLRQGMQTGCPEEIAFARGWVDAEALTGMAAQLSKNDYGRYLQGLLTDRMMEG >NZ_CP051470.1|WP_011836183.1|27023_27806_-|deacetylase-sulfotransferase MRAASPRVRFLVAGAQKCGTTALHRFLSAHPGLFLPAGKELHFFDRPLPDDWSGPEGPLYETAFAQARPDQLCGEATPVYLFHTPSLQRIRAYNPAMRLILLLRDPVLRAYSHWRMERTRGAETLPFPEAIRAGRARVAEDWRTFSYVERGFYGAQLTELERLFPREQRLVLWTDEMQRDHAGTLARIWRFLGCAAPPVPPPPAEIRPLDPAPGLAPLAEEDARYLRSLYAPDIALTEALTGRDLSLWREGALSPPSSGR >NZ_CP051470.1|WP_011836184.1|28172_28898_-|sulfotransferase MDRDLPETPALVLGASGGSGTRVAGQLLRAAGGYLGGARNEAGDSLALVAAIEALAARGARLLEGGRPEEADLALWRAALTEHLALHAGEPFWGWKNPRSMFLLPFSVALVPGLRFIHMVRDGRDMALSGNRRQYERHVPEGPGTPADRARFWAESNLRVKRHAEAVLGDRYAILRFEDLCRDPEGCMARLGRQFGLRLEPAGTGIEVRPPDGVGRHRALPAADRAAVEAAAAPAMEAFGY >NZ_CP051470.1|WP_002724637.1|29099_29936_+|3-deoxy-8-phosphooctulonate-synthase MTETPTIVTIGDIAIGGGHPIALITGPCQLESLDHARMMAERIAEACAPTGTKFIFKASYDKANRSSLSTARGLGMEKGLEILGRIREEFGVPVLTDVHEPGHCATAAEVCDVLQIPAFLCRQTDLLLAAGETGRAVNVKKGQFLAPWDMKNVADKVASTGNRRILLCERGTSFGYNTLVTDFRGLPTMAATGWPVVFDATHSVQQPGGLGGSSGGQREFAPVLARAACAVGVSALFIETHEDPDRAPSDGPNMIPVDRMGRLIADLCAFDALAKSLA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | NZ_CP030273 | Rhodobacter sphaeroides 2.4.1 plasmid pA, complete sequence | 67220-67252 | 0 | 1.0 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | CP047033 | Rhodobacter sphaeroides strain DSM 158 plasmid pEA, complete sequence | 60898-60930 | 0 | 1.0 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | NC_009040 | Rhodobacter sphaeroides ATCC 17029 plasmid pRSPH01, complete sequence | 21550-21582 | 0 | 1.0 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | NZ_CP047039 | Rhodobacter sphaeroides strain 2.4.1 substr. H2 plasmid pEA, complete sequence | 60874-60906 | 0 | 1.0 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | NZ_CP015213 | Rhodobacter sphaeroides strain MBTLJ-13 plasmid b, complete sequence | 104690-104722 | 0 | 1.0 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | NZ_CP051470 | Rhodobacter sphaeroides strain CH10 plasmid pRspCH10A, complete sequence | 19476-19508 | 0 | 1.0 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | NZ_CP015290 | Rhodobacter sphaeroides strain MBTLJ-20 plasmid b, complete sequence | 3847-3879 | 0 | 1.0 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | NC_011960 | Rhodobacter sphaeroides KD131 plasmid pRSKD131B, complete sequence | 72598-72630 | 2 | 0.939 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | CP036421 | Rhodobacter sphaeroides strain HJ plasmid unnamed1, complete sequence | 28105-28137 | 2 | 0.939 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | NZ_CP033321 | Azospirillum brasilense strain Cd plasmid p3, complete sequence | 291403-291435 | 7 | 0.788 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | NZ_CP012915 | Azospirillum brasilense strain Sp 7 plasmid ABSP7_p1, complete sequence | 1573927-1573959 | 7 | 0.788 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | NZ_CP032340 | Azospirillum brasilense strain MTCC4038 plasmid p1, complete sequence | 558458-558490 | 7 | 0.788 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MK864266 | Gordonia phage Arri, complete genome | 8692-8724 | 9 | 0.727 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MN284907 | Gordonia phage Fireball, complete genome | 8624-8656 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MK864264 | Gordonia phage VanDeWege, complete genome | 8812-8844 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MH479910 | Gordonia phage Danyall, complete genome | 8520-8552 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MT639651 | Gordonia phage Portcullis, complete genome | 8580-8612 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MH479917 | Gordonia phage KimmyK, complete genome | 8648-8680 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MH669015 | Gordonia phage TillyBobJoe, complete genome | 8335-8367 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MK814761 | Gordonia phage SmokingBunny, complete genome | 8499-8531 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | KX557286 | Gordonia phage Twister6, complete genome | 8431-8463 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MK864267 | Gordonia phage Valary, complete genome | 8884-8916 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MK967381 | Gordonia phage RogerDodger, complete genome | 8884-8916 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MT310872 | Gordonia phage Evamon, complete genome | 8501-8533 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MT521998 | Gordonia phage Jambalaya, complete genome | 8332-8364 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MK864265 | Gordonia phage Barb, complete genome | 8648-8680 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | NC_030913 | Gordonia phage Wizard, complete genome | 8624-8656 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MK305889 | Gordonia phage Mutzi, complete genome | 8520-8552 | 10 | 0.697 |
NZ_CP051470_1 | 1.1|19476|33|NZ_CP051470|CRISPRCasFinder | 19476-19508 | 33 | MN010760 | Gordonia phage Nubi, complete genome | 8521-8553 | 10 | 0.697 |
1. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to NZ_CP030273 (Rhodobacter sphaeroides 2.4.1 plasmid pA, complete sequence) position: , mismatch: 0, identity: 1.0
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tttgcccccgccgtgccgccctgccgacccccc Protospacer *********************************
2. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to CP047033 (Rhodobacter sphaeroides strain DSM 158 plasmid pEA, complete sequence) position: , mismatch: 0, identity: 1.0
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tttgcccccgccgtgccgccctgccgacccccc Protospacer *********************************
3. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to NC_009040 (Rhodobacter sphaeroides ATCC 17029 plasmid pRSPH01, complete sequence) position: , mismatch: 0, identity: 1.0
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tttgcccccgccgtgccgccctgccgacccccc Protospacer *********************************
4. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to NZ_CP047039 (Rhodobacter sphaeroides strain 2.4.1 substr. H2 plasmid pEA, complete sequence) position: , mismatch: 0, identity: 1.0
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tttgcccccgccgtgccgccctgccgacccccc Protospacer *********************************
5. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to NZ_CP015213 (Rhodobacter sphaeroides strain MBTLJ-13 plasmid b, complete sequence) position: , mismatch: 0, identity: 1.0
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tttgcccccgccgtgccgccctgccgacccccc Protospacer *********************************
6. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to NZ_CP051470 (Rhodobacter sphaeroides strain CH10 plasmid pRspCH10A, complete sequence) position: , mismatch: 0, identity: 1.0
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tttgcccccgccgtgccgccctgccgacccccc Protospacer *********************************
7. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to NZ_CP015290 (Rhodobacter sphaeroides strain MBTLJ-20 plasmid b, complete sequence) position: , mismatch: 0, identity: 1.0
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tttgcccccgccgtgccgccctgccgacccccc Protospacer *********************************
8. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to NC_011960 (Rhodobacter sphaeroides KD131 plasmid pRSKD131B, complete sequence) position: , mismatch: 2, identity: 0.939
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tttgcccccgccgtgccgccctgccggcccccg Protospacer **************************.*****
9. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to CP036421 (Rhodobacter sphaeroides strain HJ plasmid unnamed1, complete sequence) position: , mismatch: 2, identity: 0.939
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tttgcccccgccgtgccgccctgccggcccccg Protospacer **************************.*****
10. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to NZ_CP033321 (Azospirillum brasilense strain Cd plasmid p3, complete sequence) position: , mismatch: 7, identity: 0.788
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tgttcgcccgccttgccgccctggcgacccggc Protospacer * * * ****** ********** ****** *
11. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to NZ_CP012915 (Azospirillum brasilense strain Sp 7 plasmid ABSP7_p1, complete sequence) position: , mismatch: 7, identity: 0.788
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tgttcgcccgccttgccgccctggcgacccggc Protospacer * * * ****** ********** ****** *
12. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to NZ_CP032340 (Azospirillum brasilense strain MTCC4038 plasmid p1, complete sequence) position: , mismatch: 7, identity: 0.788
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer tgttcgcccgccttgccgccctggcgacccggc Protospacer * * * ****** ********** ****** *
13. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MK864266 (Gordonia phage Arri, complete genome) position: , mismatch: 9, identity: 0.727
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaaggatct Protospacer ************ .**********.* .*.
14. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MN284907 (Gordonia phage Fireball, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
15. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MK864264 (Gordonia phage VanDeWege, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
16. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MH479910 (Gordonia phage Danyall, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
17. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MT639651 (Gordonia phage Portcullis, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
18. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MH479917 (Gordonia phage KimmyK, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
19. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MH669015 (Gordonia phage TillyBobJoe, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
20. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MK814761 (Gordonia phage SmokingBunny, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
21. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to KX557286 (Gordonia phage Twister6, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
22. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MK864267 (Gordonia phage Valary, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
23. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MK967381 (Gordonia phage RogerDodger, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
24. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MT310872 (Gordonia phage Evamon, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
25. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MT521998 (Gordonia phage Jambalaya, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
26. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MK864265 (Gordonia phage Barb, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
27. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to NC_030913 (Gordonia phage Wizard, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
28. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MK305889 (Gordonia phage Mutzi, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
29. spacer 1.1|19476|33|NZ_CP051470|CRISPRCasFinder matches to MN010760 (Gordonia phage Nubi, complete genome) position: , mismatch: 10, identity: 0.697
tttgcccccgccgtgccgccctgccgacccccc CRISPR spacer gttgcccccgccggaccgccctgccaggagtct Protospacer ************ .**********.. .*.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
19546 : 29936
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP051470|19546:29936|DBSCAN-SWA CTCACTCCGCGGCGTCGGCCGCCGCCAGCTCCTCGCGCCGCCAGATGCCCACCAGAAGCCGCTTGAAGGCGGCATAGGTCGCATCGAAAGTCTCGCGGCCGCGGATGTAGGTGTCGCGATTGAAGTCGCGGTAATCGGCCTCATAGATCCCGTTCACCTGCTCGCCGGCCTGCCCGATCAGCGCGGTGAAATCCTGCCGCTCGGGCGCCATGGTGCGGCCCATGTAGGCCTGCATCAGGGCCGCCATCTCGGCCTGCTGCGCCCCGTCGTAGCGGGTGAGCACGGCGCGCACCGCATCCCATTCGAAGGCCAGCTCCTCGCGCCCGAGCGCGCGGGCGGCGATGTTCTCGCCCTCCTCGATCGAGCGGAAGGTCGAATGCAGCATGTCGAAGAAGCGGCCCGTCGAATCGAACTCGAGGAAGGAGGCGCCGAGCGGCACCAGCAGGATGTCGGCCGCCGCGAGCCCGTTGATCGTGAGATAGCCGAGCGCCGGCGGCGTATCGAGGAAGATCACGTCGTAGCGGTCGAGCACCCCGTCCTCGGCCAGCACGTCGGTCAGCGCATCCCAGAGCTTCCAGCTGCGCCCCTGCATCCGCCAGACCGGGATCTGGAACTCGGCCCAGTAGAGGTTCAGCTGCGCGCCGATCAGGTCGATGTTGGGCCAGTGGGTCTTCGCGATCAGGTCGCCCGCGCGGATCTTCTGCGCCTCGGTCAGCGTCTCGTCCATCGGCACCGGCGGGTCGCCGCGGGCCACGCGGGCCCGGTTCTCGGCCTGCAGATGCGCCGCATAATGGCGGGCGAGGAGCGGGAAGACCGTGCCCCATTCGTCGGCCACCCGGCCGCCGAAGATCGAGGTCATCGAGCCCTGACTGTCCAGATCGATCACCAGCACCCGGTAGCCGTCGAGCGCCGCCGACATGGCCAGATGGGCGGCGGTCGAGGTCTTGCCCACCCCGCCCTTGAAGTTCGCCACCGCCACCAGCTTGGCAGGCAGCCCCGCGGGGCGGTAGGGGCGGTATTCCTTGGCCTTCGACCCTTCGGCCGCGAAATGGGCGCGCAGCCGCAGCACCTCGTCGAGCGTGAACCATTTGGCCCCGCCCTCGGTCTCGGAGATGCCCTGCGGCAGGTCGGGGTTGGCCTTCAGCACCCGGCGGAAATGGGCGGTCGCGACCGGGATCAGATAGCGGGTGATCTCCCAGGTCGAGAACAGCCGGAGGCTGCGACGGCCCTCGGCATCGAGCCCGCGCGCCGCCAGATCGTCCCGGCCCTGCGCGCAGGAGCGGGCGATCTCGGCGAAGCCCGCGGTGGTCAGCGGCGCCTCGAGCTCGGCCAGCGCCGCGTCGGGAGAGATGTTGAAATAGGGCGGCAGGGGCACCTGCGGCTCGCGCGCGGGCGGGCTCTTCCGCATCTCCACGCGGCCGCGCCCGGCCGTCTTCCGTCCCCTGGTCTCGGCCATGCCTGTTCCCTTTTCCGGGTTCGTCTGCTCGCTTTCGGCAGAAGTATCACCTTTTTGCGGATATGGCAATTTTCTGCGCACATCGCCCGGCAACTTTCTGGTTTTTCTAAGCGATTCCCGCGCTTGGCATCGCACGGGGGGCGGGGGCCGCTCAAGCGCCGTGGCCTGCCTGCATCAGTTAGTTTTGGGTTATATATAGTTACAGGCTCCGGCGGATTCGGCCAAGTGCTTGAGGAGACGCGGGAAAGGGGAGGTTCCGGGAAGCCTCTCCGACTCCGGAACCCCGAAGATCCGACTCCGAAGCCACGCTTCTCCGACTCCGGAACCCCGATCTTTCGCCCCGGCGCGCGCGCCCCGTGCTGGACGGACGGGGCCGCTTGCGCATAGGCTCGGTGTCATCGAAACCACGAAGGCGCGCGCACATCCCGGGGACACATCGGGGGGCCGCACCGTCGAGCAGGCACAGGCAGCGCGATGCAGGAAACAGGCGGACGGCGGACAGCCGCCCCGCGAACGGGTGCGCGGAAGCGCGCCGCGGGACCGGACCCTTCGGTGAACCCGGCCGGGCAGGATACGGCAGCAGAGACCCTTGCCGACGGGGCAGGCCCCGTGGCCCGCGGGCGGAGCCGGTCGCGCGGGGCAGGGGCCCCCGAGCCGTCCCTTCCCGAGACGGCGGACCGCACGCCCGAAATGGCGGACTTCTTCATCTGCGACATCCTCGATGCCCTGCCCAAGGACGACATGGCCTCGATGGAGCATCCGGTCTTCTCGCTCGCGACCCGCCCCGACCTGCGCGTGCTCGACTATGCCCACAACGGGGTCAGGATCACCGTGACCCCTTCCGTGCGGGGGCTCGCCACCCTCTTCGACAAGGACATCCTGATCTACTGCGTGAGCCAGCTCATGGCCGCGCTGAACGCCGGGCGCGCCATCAGCCGGACGCTGCATCTGACCGCGCACGACCTGATGACGGCCACCCACCGCGAGACCAGCGGCGACGGCTACCAGCGGCTGCGCGAGGCCTTCGAGCGGCTGGCGGGCACGCGCATCACCACCAACATGGAGGTGGGCGGGCGCGAGATCACCACGGGCTTCGGCCTGATCGAGAGCTGGCAGATCGTGCGCCGCAGCCGGGGCGGGCGCATGGTGCAGGTGATGGTGACGCTGTCGGAATGGCTGTTCCAGGCGGTGCTCACGAAATCGGTGCTCACGCTCAGCCGCGACTATTTCCGGCTCAGAAAGCCGCTCGAGCGGCGGATCTACGAACTCGCCCGCAAGCACTGCGGCCAGCAGCCCGAATGGCGGGTCTCGATCGCCACGCTCGCCAAGAAGTCGGGCTCGGCCTCGCCGCTGCGGGTCTTCCGCAAGATGATCCGCGACATGATCGCGGCCGACAGCCTGCCGGGCTACAGCCTCGCCGAGGAGCCGGGCGACCTGCTCTGCGTCACCCGCCGCGCGGCGGTGCTGGCGCCGGGCCTCGCGCCCGCGCTGCGCGACACCACGCTCGAGCGGGTGCGGGCGCGGATGCCGGGCTGGGATGTCCATGCGCTGGTGGCCGAATGGCACGCCTTCTGGCACTCGAGCGGCCAGCCCCGGCTGAGATCCGCCGATGCGGCCTTCCTCGGCTGGATCGAGCGTCGGGTGCCGGGCTGACCCCGCCATTTCCCCCTGCCCATGCCGCGCCGCGGCGTTATAGGTACGTCATGACCGACGCCCCGCCTCCCCGGATCGCGCTCCTTCTGGCCACCTACAACGGAGCGGCCAATCTCGAGGCGCAGCTCGAGAGCTTCGCGGCCCAGACCCTGCGCCCGACCTGGCTCGTGGTGAGCGACGACGGCTCGACCGATGCCACCCGGGCGCTCCTCGCAGCCTTCGCCGCCCGTCACCCCTGGCTCGCGCTGCGGCTCGTCGAGGGGCCCTGCCGGGGGTCGGCGCAGAACTTCCTCCATCTCCTGGGCCAGGTGCCGCCCGAGGCCGACATGGCCGCCCTCTCGGATCAGGACGATGTCTGGCTGCCCGAGAAGCTCGCCCGCGGCGCGGCGGCGATGGCCGATCTGCCCGCGGACCTGCCCGTCCTCTACGGCGGCTCGAGCTGGATCTGCGACGCGGAGCTCGGCAACCGCCGGCCCTATCCGCTGCCCGTCCGCCCGCCGGGCTTCCGCCATGCGCTGGTCCAGAACATCGCCGGCGGCAACACGATGATGCTGAACCGCGGTGCGATCGCGCTGCTCGCCGCCGCCAGCCGCGAGCCCGAGCGGATCGTCGTCCATGACTGGTGGATCTACCAGATCGTCTCGGGCGCGGGCGGGCGGGTGATCTTCGATCCGGTGCCGCTCCTGCTCTATCGCCAGCACGGCGGCAACCTGATCGGGGCGAACGACGGGTTCCGCGCCAAATACCGGCGGCTGAGGATGCTGCTGAGCGGCGGCTTCCGGCAGTGGAACGCGATCAACATCCGCGCGCTCTCCGCCTCGGCGCACCGTTTCACCCCCGAGAACCGCCGGCTTCTGGCCGAGTTCGAGGCGCTGCGCCGCGCCGGCCCCTGGGGGCGGCTCCGGCAGCTGAAGCGGATCGGCCTCTACCGGCAGGGGCTGCCGGGGCGGCTCTCGCTCTGGCTGGCCGCGGTGCTGGGGCGGATCTGAGGTGCGTGGACAACCTGCGGATGAGACCTTATTCCAGACAGCATAAAGACAGGGGAATTTCGGGTGCAGATTGAAGAAACGGAACTGCCGGGTGTCCTGATCCTGACGCCGCGGGTCTTCGGCGATGCGCGGGGCAGCTTCTGCGAGGCCTGGAACCGGGCCACGCTCCAGGGGCTTGGGATCGATCTCGATTTCGTGCAGGACAACCAGTCGATCAGCGCTCCCGTGGGCACGGTGCGCGGGCTGCATTATCAGGCGCCGCCCCACGCTCAGGACAAGCTCGTCCGCGTGGGCCATGGCGCGATCCTCGATGTGGCGGTCGATGTGCGGGTGGGCTCGCCCACCTACGGCCGGTGGGTGGGGGTGGAGTTGACAGCCGGCAACGCCCGCCAGCTTCTGGTGCCGAAGGGCTTCCTGCACGGTTTCGTCACGAGAGAGCCCGACACGGTCGTCCTCTACAAGACCACCGATGTCTATGCGCCCGACTGCGACGGGGCGGTGCATTTCGCCGATCCCGACCTCGGCATCGACTGGGGCATCGATCCGGCCTCGGCCGTGCTCTCCGACAAGGATGCGCGCGCGCCGCGCTTCGCCGACTGGACCAGCCCCTTCAGCATCGAGGTCTGACCCATGAAACTGATCGTGACCGGAGGAGCGGGCTTCATCGGCTCGGCCGTGGTGCGCAAGGCGGTGGCCGACGGCCACCATGTCGTCAATCTCGACTGCCTGACCTATGCCGCCTGCCTCGACAATCTCGCAAGCGTCGCGGGCGCGCCGAACTATGTCTTCGAGAAGGCCGACATCCGCGATGCGGAGGCCATGGCGCGGATCTTCGCCACCCACCGGCCCGATGCGGTGATGCATCTGGCGGCGGAAAGCCATGTCGACCGTTCGATCGACGGGCCGGGCGCCTTCATCGACACCAATGTCCGCGGCACCTATGTGCTCCTCGAGGCCGCCCGCGCCTACTGGGTGGGGCAGGGCAAGCCGCAGGGCTTCCGCTTCCACCATATCTCGACCGACGAGGTCTTCGGCACGCTGGGCGAGACCGGGCAGTTCACCGAAGAGACGCCTTACGCGCCGAACTCGCCCTATTCGGCCTCGAAGGCCGCCTCCGACCATCTGGTGCGCGCCTGGGGCGAGACCTATGGGCTGCCCTATGTGCTGACCAACTGCTCGAACAATTACGGGCCGTTCCATTTCCCGGAAAAGCTCATTCCGGTGGTGATCCTGAAGGCGCTCGCGGGCGCCCCGATCCCGGTCTACGGCAAGGGCGAGAATGTCCGCGACTGGCTCTATGTCGAGGACCATGCCGACGCGCTGCTGACGGTGCTGGCCAGAGGTGAGAACCACCGCAGCTACAATATCGGCGGCGAGAACGAGGCGAAGAACATCGACATCGTCCGCAAGATCTGCGCGATCCTCGATGCGCGGCGCCCCAAAGCCACGCCCTATGCCGATCAGATCGCCTTCGTGACCGACCGTCCGGGCCACGACCTGCGCTATGCGATCGACCCCACGCGCATTCGCACCGAACTGGGCTGGCGCCCCTCGGTCACGCTCGACGAGGGGCTCGAGCGCACCGTCGACTGGTATCTGGCCAACGAGCCCTGGTGGCGCGCGCTGCAGGACCGCGCCGGGGTGGGCGAGCGGCTGGGAGTGAAGGCATGATCCTCGTCTTCGGCCGCACCGGGCAGGTGGCGCGCGAACTGGCGCGGCAGGCTCCGGACGCCCGCTTCCTCGGCCGCGACGAGGCGGATCTCGCCGATCCCGAGGCCTGTGCCCGCGCGATCCGCGAGGCAAAGCCTGACGCGGTGATCAATGCCGCCGCCTGGACCGCCGTCGACCGGGCCGAGGAGGAGGAGGCCCCGGCCACAGTGGTGAACGGGGAGGCCCCCGGCGCCATGGCGCGGGCCTGCGCCGAGCTCGGCATCCCCTTCGTGCAGATCTCGACCGACTATGTCTTCGACGGCTCGGGCACCCGCCCCTGGCAGCCCGGCGATCCGGTGGGGCCGCTCGGCGCCTACGGCCGCTCGAAGCTCGCGGGCGAAGAGGCGGTGCGCGCGGCCGGCGGACCCCATGCGATCCTGCGCACCTCCTGGGTCTTCTCGGCCCATGGCGCGAATTTCGTGAAGACCATGCTGCGCCTCGGGGCCACGCGCGACCGGCTCACCGTCGTCTGCGATCAGGTGGGCGGGCCCACGCCCGCGGCCGACATCGCCGCGGCCTGCCTCGCGATGGCGCGCGGCCTTGCGGCCCGGCCCGATCTGTCGGGCACCTACCATCTCTCGGGCGGCCCGGACGTGAGCTGGGCCGACTTCGCCCGCGAGATCTTCCGTCAGGCGGAGCTCGACTGCCTCGTGGCCGACATCGCGTCCGCGGACTATCCGCAGAAGGCCCACCGGCCGGCCAATTCCCGGATGGACTGTTCCGACCTCGCCCGCTTCGGCCTCTCCCGCCCCGACTGGCGGCAGGGTCTCGCCCGTGTTCTCGCCGATCTCCAGGAGGTTTCCGAATGAGTGTTCGTAAGGGTATCATTCTCGCCGGCGGCTCGGGCACGCGGCTCTATCCGCTGACCATCGGCGTCTCCAAGCAGCTGATGCCGGTCTATGACAAGCCGATGATCTACTATCCGCTCTCGGTGCTGATGCTGGCCGGGATCCGCGAGATCGCCATCATCACCACGCCGCAGGATCAGGAGCAGTTCCGCCGGGCGCTCGGCACGGGCGCGCAATGGGGGATCTCGCTCACCTATCTGGTCCAGCCCCGGCCCGAGGGGCTCGCCCAGGCCTATACGATCGCCGAGGAGTTCCTTGCGGGCAGCCCCTCCTGCATGGTGCTGGGCGACAACATCTTCTTCGGCCACGGCCTGCCCGACCTTCTGGCTCTGGCCGATGCCAAGACGGAGGGCGGCACGGTCTTCGGCTATCATGTGGCCGATCCCGAACGCTACGGCGTGGTGGCGATGGACGAGCGCGGCCGCGTGACCCAGATCGTCGAGAAGCCGAAGGTCGCGCCCTCGAACTATGCGGTGACGGGGATCTATTTCCTCGATGCCCGCGCCCCCGATCTGGTGCATGGCATCCGGCCCTCCGAGCGGGGCGAGCTCGAGATCGTCTCGCTTCTCGAGATCTATCTCGAGGAGGGGCTGCTCGACGTCCAGCGCATGGGCCGCGGCTTCGCCTGGCTCGACACGGGCACCCATGCGAGCCTGCTCGATGCGGGCAATTTCGTGCGCACGCTGCAGCTGCGTCAGGGGATGCAGACGGGCTGCCCTGAGGAGATCGCCTTCGCCAGGGGCTGGGTCGATGCCGAGGCCCTGACCGGCATGGCGGCGCAGCTTTCCAAGAACGATTACGGCCGCTACCTGCAGGGGCTGCTCACCGACCGGATGATGGAGGGCTGAGCGCGCCCTCCCGCCAGAGGCTCAGGTCGCGCCCGGTCAGCGCCTCGGTGAGGGCGATGTCCGGGGCATAGAGGCTGCGCAGATAGCGCGCATCCTCCTCCGCGAGCGGCGCGAGCCCGGGGGCGGGGTCGAGCGGCCGGATCTCGGCAGGCGGCGGGGGCACAGGCGGGGCCGCGCAGCCGAGGAAGCGCCAGATCCGCGCGAGCGTGCCTGCATGGTCGCGCTGCATCTCGTCGGTCCAGAGCACGAGCCGCTGCTCGCGCGGGAAGAGCCTCTCGAGCTCCGTGAGCTGGGCGCCGTAGAAGCCGCGCTCCACATAGCTGAAGGTGCGCCAGTCCTCGGCCACCCGGGCGCGCCCCGCCCGGATGGCTTCCGGGAAGGGCAGCGTCTCGGCCCCGCGCGTCCTCTCCATCCGCCAGTGCGAATAGGCCCGCAGGACCGGATCGCGCAGGAGCAGGATCAGCCGCATCGCCGGATTGTAGGCCCGGATCCGCTGCAGCGAGGGGGTGTGGAAGAGATAGACCGGCGTGGCCTCGCCGCAGAGCTGGTCGGGGCGGGCCTGGGCGAAGGCCGTCTCGTAGAGCGGCCCCTCGGGGCCCGACCAGTCGTCCGGCAGGGGGCGGTCGAAGAAATGCAGCTCCTTTCCGGCCGGCAGGAACAGGCCCGGATGGGCCGACAGGAAGCGGTGGAGCGCGGTGGTGCCGCATTTCTGCGCCCCGGCCACGAGGAATCGCACCCGGGGAGAGGCGGCCCTCATGCGGCTGCTCCGATCCTGTGAGATCCGCATCCCGAGAAAGGCGCGCGCGACCGTGCGCGCAGAAGGCTCCGGACGCCGCGGGCAGGCCCCTTGGACCGGTTGAGCTTTCCCGCGGCGGCGCCCGGCAGGACCGGGGCCCGGCTGCGCCTTCCGTCTCTCGTTTCGATCCTTTGCAGAGGACAGGCCATGCTCTTCCCTCACCGGCGAGACCGCCGCGCCCGCCCCGACTGTCCCGAAATCTGCGAACGGAATCAACAGGCTCGATCTGCGAGCTTCCGGGCGCGCCGGAAAGCCCGGGCGCCGGGAGCCCCGGATCCGCGGGATGGGCCCGCCGCCCGGTCCTCGATGCGGCTCACCCCGGGGTCCTCAGTAGCCGAAGGCCTCCATCGCGGGGGCGGCGGCGGCCTCGACCGCCGCGCGGTCGGCGGCGGGCAGGGCGCGGTGCCGACCGACCCCGTCCGGCGGGCGCACCTCGATCCCGGTCCCGGCCGGCTCGAGCCGCAGCCCGAACTGCCGCCCGAGCCGGGCCATGCAGCCCTCGGGATCCCGGCAGAGATCCTCGAACCGCAGGATCGCATAGCGGTCGCCGAGCACCGCCTCGGCATGGCGCTTCACCCGGAGATTGCTCTCGGCCCAGAAGCGCGCCCGGTCGGCGGGCGTGCCCGGGCCCTCGGGCACATGGCGCTCATACTGCCGGCGGTTGCCCGAGAGCGCCATGTCGCGCCCGTCGCGCACCATATGGATGAAGCGCAGCCCGGGCACGAGCGCCACCGAGAAGGGCAGCAGGAACATGCTGCGGGGGTTCTTCCAGCCCCAGAAGGGCTCGCCCGCATGGAGGGCCAGATGCTCCGTCAACGCCGCGCGCCAGAGGGCCAGATCGGCCTCCTCGGGCCGGCCGCCCTCGAGAAGCCGCGCGCCGCGCGCCGCCAGCGCCTCGATCGCCGCGACCAGCGCGAGCGAATCCCCGGCCTCGTTCCGCGCGCCCCCGAGATAGCCGCCCGCCGCCCGCAGGAGCTGGCCCGCCACCCGCGTGCCCGAGCCGCCGCTCGCGCCGAGAACGAGGGCCGGGGTCTCGGGAAGGTCCCTGTCCATCCGATCGTCCTTCTGCCGAAAGGCCGCAGCTTTCCCGCAGGCCCCGGGGAACTCAAGGCTCATTCGGGCCTTGCCGCCGCAGGCGACCCTCCGGCGCGCGCGCGGGGCCGGCGGGAGGCGGGACCGATGGCGGCCCCTCCGGCGGATTTGAACCCCGCGCGCGCTTGGCATAAGGGTGGGACAGAAGCGCGGGAGACGGCCATGACAGAGACGCCCACCATCGTGACCATCGGCGACATCGCCATCGGGGGCGGCCATCCGATCGCGCTCATCACCGGGCCCTGCCAGCTCGAGAGCCTCGATCATGCCCGCATGATGGCCGAGCGCATCGCCGAGGCCTGCGCGCCCACCGGAACGAAGTTCATCTTCAAGGCGAGCTACGACAAGGCCAACCGCTCGTCGCTCTCGACGGCGCGGGGCCTCGGGATGGAGAAGGGGCTCGAGATCCTCGGCCGGATCCGCGAGGAGTTCGGCGTGCCGGTCCTGACCGACGTCCATGAGCCCGGCCATTGCGCGACGGCCGCCGAGGTCTGCGACGTGCTGCAGATCCCGGCCTTCCTCTGCCGGCAGACCGACCTTCTGCTCGCGGCGGGCGAGACCGGCCGCGCCGTCAACGTCAAGAAGGGCCAGTTCCTCGCGCCCTGGGACATGAAGAACGTGGCCGACAAGGTGGCCTCGACCGGCAACCGGCGGATCCTGCTCTGCGAGCGGGGCACCTCCTTCGGCTACAACACCCTCGTGACCGATTTCCGCGGCCTGCCGACCATGGCCGCGACCGGCTGGCCCGTGGTGTTCGACGCCACCCATTCGGTGCAGCAGCCGGGGGGCCTCGGCGGCTCCTCGGGCGGGCAGCGCGAATTCGCCCCGGTGCTCGCGCGGGCGGCCTGCGCGGTGGGGGTCTCGGCGCTCTTCATCGAGACGCACGAGGATCCCGACCGCGCGCCCTCGGACGGGCCGAACATGATCCCGGTGGACCGGATGGGCCGGCTCATCGCCGATCTCTGCGCCTTCGACGCGCTGGCCAAGTCGCTCGCCTGA
Protein sequences of DBSCAN-SWA_1 >NZ_CP051470|19546:29936|29099_29936_+|WP_002724637.1|DBSCAN-SWA MTETPTIVTIGDIAIGGGHPIALITGPCQLESLDHARMMAERIAEACAPTGTKFIFKASYDKANRSSLSTARGLGMEKGLEILGRIREEFGVPVLTDVHEPGHCATAAEVCDVLQIPAFLCRQTDLLLAAGETGRAVNVKKGQFLAPWDMKNVADKVASTGNRRILLCERGTSFGYNTLVTDFRGLPTMAATGWPVVFDATHSVQQPGGLGGSSGGQREFAPVLARAACAVGVSALFIETHEDPDRAPSDGPNMIPVDRMGRLIADLCAFDALAKSLA >NZ_CP051470|19546:29936|25310_26162_+|WP_011836182.1|DBSCAN-SWA MILVFGRTGQVARELARQAPDARFLGRDEADLADPEACARAIREAKPDAVINAAAWTAVDRAEEEEAPATVVNGEAPGAMARACAELGIPFVQISTDYVFDGSGTRPWQPGDPVGPLGAYGRSKLAGEEAVRAAGGPHAILRTSWVFSAHGANFVKTMLRLGATRDRLTVVCDQVGGPTPAADIAAACLAMARGLAARPDLSGTYHLSGGPDVSWADFAREIFRQAELDCLVADIASADYPQKAHRPANSRMDCSDLARFGLSRPDWRQGLARVLADLQEVSE >NZ_CP051470|19546:29936|22704_23643_+|WP_002724804.1|DBSCAN-SWA MTDAPPPRIALLLATYNGAANLEAQLESFAAQTLRPTWLVVSDDGSTDATRALLAAFAARHPWLALRLVEGPCRGSAQNFLHLLGQVPPEADMAALSDQDDVWLPEKLARGAAAMADLPADLPVLYGGSSWICDAELGNRRPYPLPVRPPGFRHALVQNIAGGNTMMLNRGAIALLAAASREPERIVVHDWWIYQIVSGAGGRVIFDPVPLLLYRQHGGNLIGANDGFRAKYRRLRMLLSGGFRQWNAINIRALSASAHRFTPENRRLLAEFEALRRAGPWGRLRQLKRIGLYRQGLPGRLSLWLAAVLGRI >NZ_CP051470|19546:29936|19546_20953_-|WP_050988687.1|DBSCAN-SWA MRKSPPAREPQVPLPPYFNISPDAALAELEAPLTTAGFAEIARSCAQGRDDLAARGLDAEGRRSLRLFSTWEITRYLIPVATAHFRRVLKANPDLPQGISETEGGAKWFTLDEVLRLRAHFAAEGSKAKEYRPYRPAGLPAKLVAVANFKGGVGKTSTAAHLAMSAALDGYRVLVIDLDSQGSMTSIFGGRVADEWGTVFPLLARHYAAHLQAENRARVARGDPPVPMDETLTEAQKIRAGDLIAKTHWPNIDLIGAQLNLYWAEFQIPVWRMQGRSWKLWDALTDVLAEDGVLDRYDVIFLDTPPALGYLTINGLAAADILLVPLGASFLEFDSTGRFFDMLHSTFRSIEEGENIAARALGREELAFEWDAVRAVLTRYDGAQQAEMAALMQAYMGRTMAPERQDFTALIGQAGEQVNGIYEADYRDFNRDTYIRGRETFDATYAAFKRLLVGIWRREELAAADAAE >NZ_CP051470|19546:29936|24273_25314_+|WP_011836181.1|DBSCAN-SWA MKLIVTGGAGFIGSAVVRKAVADGHHVVNLDCLTYAACLDNLASVAGAPNYVFEKADIRDAEAMARIFATHRPDAVMHLAAESHVDRSIDGPGAFIDTNVRGTYVLLEAARAYWVGQGKPQGFRFHHISTDEVFGTLGETGQFTEETPYAPNSPYSASKAASDHLVRAWGETYGLPYVLTNCSNNYGPFHFPEKLIPVVILKALAGAPIPVYGKGENVRDWLYVEDHADALLTVLARGENHRSYNIGGENEAKNIDIVRKICAILDARRPKATPYADQIAFVTDRPGHDLRYAIDPTRIRTELGWRPSVTLDEGLERTVDWYLANEPWWRALQDRAGVGERLGVKA >NZ_CP051470|19546:29936|23706_24270_+|WP_009565041.1|DBSCAN-SWA MQIEETELPGVLILTPRVFGDARGSFCEAWNRATLQGLGIDLDFVQDNQSISAPVGTVRGLHYQAPPHAQDKLVRVGHGAILDVAVDVRVGSPTYGRWVGVELTAGNARQLLVPKGFLHGFVTREPDTVVLYKTTDVYAPDCDGAVHFADPDLGIDWGIDPASAVLSDKDARAPRFADWTSPFSIEV >NZ_CP051470|19546:29936|28172_28898_-|WP_011836184.1|DBSCAN-SWA MDRDLPETPALVLGASGGSGTRVAGQLLRAAGGYLGGARNEAGDSLALVAAIEALAARGARLLEGGRPEEADLALWRAALTEHLALHAGEPFWGWKNPRSMFLLPFSVALVPGLRFIHMVRDGRDMALSGNRRQYERHVPEGPGTPADRARFWAESNLRVKRHAEAVLGDRYAILRFEDLCRDPEGCMARLGRQFGLRLEPAGTGIEVRPPDGVGRHRALPAADRAAVEAAAAPAMEAFGY >NZ_CP051470|19546:29936|26158_27049_+|WP_002724812.1|DBSCAN-SWA MSVRKGIILAGGSGTRLYPLTIGVSKQLMPVYDKPMIYYPLSVLMLAGIREIAIITTPQDQEQFRRALGTGAQWGISLTYLVQPRPEGLAQAYTIAEEFLAGSPSCMVLGDNIFFGHGLPDLLALADAKTEGGTVFGYHVADPERYGVVAMDERGRVTQIVEKPKVAPSNYAVTGIYFLDARAPDLVHGIRPSERGELEIVSLLEIYLEEGLLDVQRMGRGFAWLDTGTHASLLDAGNFVRTLQLRQGMQTGCPEEIAFARGWVDAEALTGMAAQLSKNDYGRYLQGLLTDRMMEG >NZ_CP051470|19546:29936|21691_22654_+|WP_002724803.1|DBSCAN-SWA MADFFICDILDALPKDDMASMEHPVFSLATRPDLRVLDYAHNGVRITVTPSVRGLATLFDKDILIYCVSQLMAALNAGRAISRTLHLTAHDLMTATHRETSGDGYQRLREAFERLAGTRITTNMEVGGREITTGFGLIESWQIVRRSRGGRMVQVMVTLSEWLFQAVLTKSVLTLSRDYFRLRKPLERRIYELARKHCGQQPEWRVSIATLAKKSGSASPLRVFRKMIRDMIAADSLPGYSLAEEPGDLLCVTRRAAVLAPGLAPALRDTTLERVRARMPGWDVHALVAEWHAFWHSSGQPRLRSADAAFLGWIERRVPG >NZ_CP051470|19546:29936|27023_27806_-|WP_011836183.1|DBSCAN-SWA MRAASPRVRFLVAGAQKCGTTALHRFLSAHPGLFLPAGKELHFFDRPLPDDWSGPEGPLYETAFAQARPDQLCGEATPVYLFHTPSLQRIRAYNPAMRLILLLRDPVLRAYSHWRMERTRGAETLPFPEAIRAGRARVAEDWRTFSYVERGFYGAQLTELERLFPREQRLVLWTDEMQRDHAGTLARIWRFLGCAAPPVPPPPAEIRPLDPAPGLAPLAEEDARYLRSLYAPDIALTEALTGRDLSLWREGALSPPSSGR |
10 | Enterobacteria_phage(42.86%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
52605 : 62291
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP051469|52605:62291|DBSCAN-SWA GTCATTTCACGACCCGCAGAGCCCCGAATTCCAGCGCATCCGCGGCCTTCCGTAGATGCTGCGGAGAGAATCGCGCATAGACGCTGGAGGTGATCTGCACGTTCGAGTGGCCGAGATATTGGCTGATCTCGTCCATGGGCACGCCGGCCTCGGCCATGTGGACCGCGGCCGTGTGGCGCAAGGTGTGCAGCGTCACGTCCAACAGGCCGGCATTGGAAACCGCCCGCAGGAAGCCCTTGCGGATGCACTTGACCGGCCCGCCCGCCCACTCGATCACATGATCAGACAGCGCGGCCGCGCGGGCGGATGTCAGCGCGGCGCGGAGTGTGTTGTTGATCGGCACCGTCGCCCGGCCTTTCCGCGGCCCCTCGGCTTCCACCCGCAGCCGGATCTGCCCATGCTCGAGGTCAACCCGATCCCATGTCAGCTCCAGCACCGCACCGACGCGAGCCGCCGTCGTCAGCAGGACGGTGATCGCCAGCTTGATGTGGGGCTCGCACTCGGCCTCCATCAGCCTCGAAACCTCGGCGTGCGTCAGATAGCGGTCGCGCGGCGCCGGCTTGGCTGGCCTCTCGATGTGCGGCGCCACCTGGATCAGGCCGCGCTTGGCCGCCCAGAGCAGGCATGTCCTCAGGTGGCCGAGCTCCGTCCAGACGGTGCCGACGGAGACCGTCTTGCGCCGTGCCTTGGTGTAGCTCCGGCAGGCTTCCGGCGTGATCTGATCGGGCCGCAGCGCGCCGAAATGCGGCAGGACCGCGTTGCCGCTGGACCGCATGTTGCGCTCGACCGGGCGGCCCTTGCGATCGGCAAGATAGGCCGCCCACAGGTCTCGGACGGTCGTCGCCCCGGAGGGCAGGGTCTCCCTGCGGATCCGGTCTAGCGCCTCGACCTCGGCTTCCGCTCGCGTGCGTGCGTCAAGACGATAGCGCCTTCGGGCTCCATCCTCCCACCATGAGACGACGAACCGCCCGTTGAGGCGGCCGATCCGATACTCGCGCATTCGTAGTCCTCCACTGATCGTGCTGGGATTCGGATCATGCGTCCCACCCTGAACCCGGACAAGCGGCCGGACTTGACCAGCTGGCGCACGGTCTCGGCCGAGCAGCCCCAGCGGTCCGCCAGCATGTCAGGGGTGAAAGGCCTCGTGTCGCTCATGCCTGCCTCCTTTCCCGGCGCATCCGCGCCATGTCCCTCATCGTGATCTCGACAGCTCCAGCGGCCACGGACCGATCCCACGGTAGGATCACGCCACGGCCTCCTCTGCGGCCCATCCGCTCCACTGATCGGCGCAGGCCTCGGCCACGCCCTCGAAGGTGCGGCTGCGGAATTTCCAGCGGCCGGGCCCCGGCGGCGCGCGGTGGACGGCCGACCAGCGCTTGTGCTCGGGCGTGCCGGGCCGGGGCGGCGTCAGCCGGTCGGTGGCGGCGAGCCGCGGCAGGCCGCGCAGATAAAAGCTCGTCGCCTTGAAGAAGGGCTCGCCGAACCACCAGGGCTGCACCGTCTGCGGCCGCGGCAGATCTGCCGGCAGACGAGCTCGCGCGTGCGGGTTCATTACCGGGTTTTCGACCGCCACGCGCGGCACCGGCGCCTGCCAGCAGGCCGCAAACAGCGCCGCGCCGCGGTCGAGGTCGGCCCACAGGAAGGCCAGCCGCTCCTTGCGACCCATCCGGGCGAAAGCCTCGCGCTCCGCCGCCGCATAGGTCTCGGGCAGGCGCTTCGAGGGCTCGTGCAGCCACCGGACGCCGCTGTTGCAGAGCCGCGTGCAGGGCGGGTGCGCCACGATCAGCAGGTCCCAGCCGTCTGACAGATGGTCCCGAACGTCGCCCACGATATGGCGGTTCGACCGATCCTCGGCCGGCAGGAGGTCGCAGGACCACACGTCATGCCCGCGGGCGACAAAGGCCCGGCGCATCACGCCCGAGGTCTCGCAGCCGATCAGGATGCGCAGGGGCTTAGCCATGCTGCACCTCCGGCAGGTACCGCAAGGCGGCAGGCGACACGTTGCCCGCAGATGTGGGGAGGCTGCTATGCCTGGGGGCCGGACGACCTACAGGTCTTGCATGTCCGGCGCGGATGACCCCTCGCGCCCTTGGGCGCAAGCACAGGCTAGTAGCGCAGGCGGGTGCTGCCGATCCTGCGGGGGCGAGTGGGATGTTCAGTAAGTTGATGAGTTTATTTGCTCGATAGTGCATGTATGCACCAATATTGGTTAAGTAAACGCTACCACTTTCGATTGAATCAAAAGCCGGACAAATTACCCAAGGATCAGCCATTGGCGATCTCCAGCAGCACGTCGGCATGACAGGGCGCACCGGGCTGGCACCAGCAGGCGAGGTCTTTGCCGCGCAGTTCGCGGATCACATCCTCAACGGTGGTCCATTGCGCCGGGATCTTCCCCTTCGGCCACGGCAAGGGAACTGGTGACCAGCAGCCTTCCTTCTCGAGCAGCATCCGAAACGACTGGACGGCAAGCGCCGGGTCCATCGGCTTTTCCAGCGCGCCGCCCGGGTGGTTGACCACAAACGGGTTGCCCCACTTGGTCGAGCGATCAACCTTCACGGCGCCGGGCGGCATCCGCCATCCTTTCGCGCGGGAAAGCTGGATCCGCCGTGGCTTGCCGCCTGCAACCGCGCCTGTATCCTGCGCGAACCCGGTCGCAGGGGGAGCCGTCCCTGCGCTCGCAGCGGGCACGTCCGTCCTCCGTCCCTGCCGGGTGCTGTTCTCAGCCATTGGCGTGCCCCTCGATCACTTGGGGCTCCTGCCGGCCCCGTGCCGGAGCCTCGAGCGTCTTCACCGCCGTCAGATAGAGGCGGACCCCGTTGAAGATCTGCTTCGCCACGGCGGCGCGGGCGAGCCCATCGGCCGGGCTGATCTTGCCCGCACGCATGTCGCGCAGATCCTCGGCAAGGCCGGAGATCATGCCATGCAACCCGAGGCTGTCGGCTACGGGGGTCGAGGCGTAATCACGCGGTTCCATAGTGCCTCCTGTAGTGCTGGTGGGCCGCCTCGGTGTGCAGGGCCTCCGCGAGCGCGGAGAACACAGAGAGAAGCCGCCGGTCCCGGAAAGTCCCGGGCGCGGGAACGACGGACCCGAAGGTCTGCTCGCACCAGCGGCAGACCGGTTCCACCGGCCACGTCTGCACCGGTCCCGGCCAGCGGTAGTCGCGGTGCTGTCCGCAGATCGCGCAGGTGAAGGGGCGCGGCCGGTCAGGACAGATGGGCATGCGCCAGCTCCGGCATCTCGTCCCAAGTGCGGCCGTCGAGGCGGCGGCCGGCGGCCCGCTTGCCAACACGCCGAAGCGACCAGCCCGGGCCGCCATTGTCGTTCGCGTTTCCGGCGCCGATCGAGCCGTCCGGCCAATAGGTCACGTCGCCCGGATGTCCGGGGAGCAGCTTGTCTGCCCGCGCCCACTCCCCCCACTGCTTGAACAGGAACGGCACGCCCGCGGCCTGCGCCTGATCCCGTAGGGAACGTGCCCAAGCCGGATGCATCGGCCGCGCGTGGCGCCCGCTCTCCCCCCCGACGATGATCCAGTCGAGGCCGCGCAGCTTTTCTGCGCCCGGGCTGAAATGACCGCGCTGCAAGAACCCGGCAGCGATGCCGAAGGGGTGTCCCAGCGCCCATGCCACATCCATCGGCCCGAGCAGCGGTTCGAAACTCCCGAACCGCACCGCTGCGGACGTAGCGAGCAGGTCCGGGATCCGCTCATCCGCCCGCTGCTGATCCTCGGCCGAGACGCCAAGCCAGACGTTTGGAAGGGGCTTTCCTGCCTCGAGGCAGGTGCGCGCAGCGGAAAGAGCGTCCTCGAGCGCTGCGGTCTGCGCATGCCATGCCGGGCTGCCGTTCGGATCGTCGTCACACGCCACGCGATGGTCAGCGTCCCGCCACTCGTCGGCCTCCTCCAGAGCGTAGATGTCGAGCTTCGATACATACCCCCGCATCCGCGCCGCCCGCTTGGTCAGCACCTGAAACGTGTGCTGCGGGGCCAGCGCCATGACCGCGAACACCCGGTCGATCCACTCGTCCGGCACCGCTTCGTGGAACAGGTCGGCATGGGCGCAGACGAAGATCCTGCGCGGGCGCTTCCAGCGCAGCGGCTGCTCGAGCCATTGCTCATTGAGGCGGACCTCGCCCGTCCAGGCGGCGACTCCGGCCGCGTTGCGCCGGGCCAGCCCCGCACGGCTCGGGTGCTGGGACAGGCGCGTTGCCGCGAGATCGGCCGCATAGCAGAACTGGCAGCCGGGCGAAGCGAGCGTGCAGCCCGTGATCGGGTTCCATGTCGCGTCGGTCCATTCGATGGTGCTGGTCTCAGCCATGCTCTGCCCCCACCACCTCGAACACGCGCCCGGTCGTACAGCCGAGACGCTGTGCGATCCGCCGGGCGATGCCGATCCGGTCAAGGGGCATGGGCTGGCCGCCCGGCAGATGCGGGGACACGGCGGCGATGGCGTCGGACCACGCGGAGAGGATCTCGTCGCGCGCGAAGGTGTCGATCTGGTCATGATCGAGCATGCTCAGTCCTCGCCCCAATACGACATGTCGGCGTCGGCACAGGCTTCCGGGCCATCCTCGCGCTGGTGGTCATCGGCCCAGTAGCTGGGCGCCACCTCGCGGGCGTAATCGGCGATGGATGAGCCATCGTCGAAGGTCGGGCCGGCGACCGAGACACACCGCGCCACGAAGGCCTCGACGAATTGCTCCTTCGTCATCTCACCCGCGCTCATGGCCGGCCCCCGGTCTGGCGCCATGGCAGGCGCAGTCATCGTCGGCGCCGAGGCAGCGGCAGTGCGGGCGGCGGGCTTGGCGGACGATCTCCTGGTGGAGCTGGTGCAGGAGGAGATCGGTGGCGGGGATGAGGGTCATGGGGGCTCCTGTATCCTCGGAGGAAGAGGCCCCGGCCCGCAGGCCGGGGAGTTGCAACGGGAGGGTGCGCGGATTGCCCGCCGCGCCGGGGTCGCGTCAGGCCCTCTCGGGCTCGCCGCGGAACAGCGGCAGGCCGGTGGCCTCCGTCGCCTCGTGGAGCGCCTCCTCGATGGCGTCTTCGAGGGCGATCTCCGCGTTGTGGAGCGAGAGGATGAACTTCACCTCGGACCCCGCCTTGCGGTAGCGGAAGCGGACTGGGATGCGATAGGCCGCGCCGCGGTCGAAGATCGGGATCGCGATCATGAACAGGTTCGGGATCTTCAGCGGCTGGCCGTCCGGCTCGCGATGCTCGTTGATGAACTGGATCGAGGTCTCGCCCGTGTCGCGGTTCAGGGTGGCGGTCAGGTTGCTGACCTCGTGCACCTGGAAGCTGCGCGAGAGCTGCACCAGCGTCTGATACTGGCCGAAGCGCCCGTTCAGCTGGCGTGCGACCTCGATCATGCGGACCTCCCACGGCTCGACATTTGCCGCGCCGATATGGCCGTTCAGGAGGTTCGGCGTCGGGTCGAGCAGATCCTTCGCGTTGGCCTCGATGAACTCGCCGAACTCGGCCTTGTCGAGCGCCTTGTTGTTGACGCCGGTCCAGAGCTTCCATTCCTTCGACATCGGGAAGGTATAGAGCGCCCGGTGCCGGCAATGGCTGGCCTTCGGATCGCGGGTCTCGTGATCCATGACCGGCGCGCCCGCACCCATGTAATCGGCGATGCAGGTGAGGCTCGGCGCCGCGCCGATGTCGGCGAAGAGGGCCGAGGTCTCGCCCTTGTGCCGGTTCGCCCAGGCGATGAGGCTGGCCAGATCCTGCAGCTTCGCCGTGCCCGCGCGGCGCCACGGCTGGAGCTTCGTCGCCAGCCGGTCGACCGCCTCCGTCAGATCGACGTGGCGCATGTCGTCGGGGATGGCGTAGTGCGAGGCGGCGGGCGCAGCACCCGGCAGGTGGTTCAGGATCTCGACCGGCTGCGCCATCTTCGGCAGCACCTCGTCCAGCGCCGTCTCGAGCACGTTCTTCGCGGTGTTCTCGTCCATTTTCTGTCCTTCTCGGGGTTACTCGTCCGCGCCGGGCGTGCGCAGCTGGCGCCGGCCGGCAACCTCACGGATCTCCATTCGCGACTGGTTGGGGTTCGCGATGGTCAGCCCGCCGCCGTCGGCGGTCCATGCGGTGGCTTTCGCCTTGGGCGTCCTGGGCTTCGTGACCTTGTGCTCGATCGCCAGATCGATCTGGCCGAAGCGGTCGGTCGTGTAGCTGATGGTGATCTGCAGCTTGCCGCCCGCCTTCGTGCCGTAGGCCTGGCTGAAGTTGACGATCTCGGAGATCAGGTCGTCATTCTCCTGCAGCAGCAGCGGCTGATACTGGCCGTTGTCGGCCAGCGAGATGATCTGATCGAGGGTGCGCAGTTCGCTCATGGGTTTCCTTTCGGGCGGCGGCGCGGGTTGCGGGTTCGGTAAGGCGTTCATCTGCTCCTCTCGGCTGCACATGGAGCCGCCCCGGGCGGGACGGCCGAGCTGCAGCCGCGTCATGAGGTGCGCAGGATCTCGGGAACCTCGGAGGCGAGCACGCGCCAGGCGCGGCCCGTCTTCTTCTCGTAGGCCGCCCAGCCGGTGCCGCCCTCGGTGCGGTAGAAACCGAGTTCCATGCTCACATGCCCCGCATGGCCGCGGCCTGCGCGCGCATGGTCGCGGCAACGTCGGTGGCGTGGGCCCAGAGCACGGTGGCGACGAAGAGGAAGCCCAGGAGCGCCAGCGCCCCCAAGAGCAAGCCGGTCAGGTTCGGCCCGAGCGGGCTGGGCCTGCTCGTCCGGAAGGTCGGCCGCGGCACCGCACGGTGCAGCGGGCGGCCCATGGCGATGGCCAGCTCGTGATCCGTCAGAAGGCGGCTGGCGGCCTCGCGGATCTCGGGCGTAGGCGCGTGCTCGGCGTGATGCCGAGCCATGTTGAGGGCGCGGGGCGAGAGGGGAAGATCCTGTTTCATTGTTCGAGCGCCTCCCGCGCCTGGGCGAGGATGGTGTCGACCTCGGCCCCGGGCTCCTCATCGGCAGTGTCGAGTTGCGGGAGCCCTGCCTCGGCCGCGGCGAGCAGCCGGCGCAGCTGCGGCTCGGTCAGGTCGAGGAGAGCGAAGCGGCCAACCGGCGTCGAGGCCGGGTAGCGGATGGCGGTGGCGGTCATGGCCGGCCCTCCGCAATGTGGTAGCGGTTGCCAGAGGGGCACAGGCGCGCCCTGGCGTTGGCGAGCAGGCTGGCCCAAGCCAGCGGCTGGAGGATCATGGGCAGTCTCCCGTCAAATGCACGGTCAAGGATCGGCCGTCTGACGAGAAGCTATTGGGGGAGTTTCCCCATGTCAACGCTCTAATGGGGATTTACCCCCACTTGCATGCAGAATCGTGCCGGCCTCCCCATCGGCTCCCACTGCCGGCGGGTAAAGGACGGCTCTAAAGCCGGGCACCCCGAAACCCGCGGAGGCGGGCGGAAAGTGAATTGTGCTTTATGACGGAGCGAAGCAGACGGCCTGACGACCAAGCGCCATCAGACGTAGACCCGCTCCACATAATCGGGCGAAAGCGTCAGCAGCACAGGCGCTGCCCACTTCAACCGGACGCCGTGCATGTTGTCTGCATCCGGGTTCAGCGACAGGAGGCTGAAAGTGCCCTCCTGGCTACCCACCTTGACAACCTTGAGCCATGCCCTTCCGTCAGCGTCCTCGCAGACACAAGGGGTGTTGAGCGCCTCGACCGGCACGCCCTCTGCGGCGGCGCGCGTGTAGAAGAGCACCGATCCAGGGCGATAGAGCGGCATCATGCTCTCTCCCTTGACCTCTACCGCAACTATGCCGTGTGGCTTCAGCTGCGGGGGGCGGGCGACGTGGTACATTCCATCGCCCTTCTCGTAGGCGTCTAGGAGGTCAACCCGCGCACCCGCACCCACGCAGCCCGCCACCGCAATGGGCGCGTGAGCCGGACGCGCATTCAGATCTCCCGTCATGCCGATCTGAATGATCTCATCGACCGTTCGCCCAAGCTCTCTGGCCAGGGCGTAGGCGTTGGCGACCTTAGGCGATGACTCGTTGCGGATCAGATCTCGCACGCCAGACTCGCCCATGCCTGCAGCTACCGAGAGCGGCTTCATCTTGATGCCCTCAGCCTCCATGACGATCTGCAGGCCACGCACGAAGGCGTCCAGGGTCTTTTTCTGCATGTGGGGAAAGTGCCCCATACCCCAGCTCCGTTCCATGGGGAAGTTCCCCTTGCCAATATGGGGGAGTTCCCCCATATTGCCCGCCATGGAACAGCTCATCTCAGACATCGAGGCCCATTGCGCGGCGTGCGGGATCAGTCCGCAGAAGCTGTTGCGCGAGGCCATCAACGCGAAGTGGGGACAATGGCAGGACTGGAAGGACGGCAAGTCCAGCCCGACCATGAAGGTTGTGGATCGCCTGCGCGCCCACATGGCGGGCGCCGAGATGACCGCTACCGGTCATGGGCTGGTCTCTCAGGAAGAAGCAAAGGGGGCAGCGTGATGACCTACGCCGCCCCCTTCGCCCGTCATGAAATCGCTTCGTCATCAAAGCCTCTCGCCAAGGCACAAGATGGAGCGAACATGCGGAAAAATCCTGCCAATTCGGACGAGCACGCCAGAACGAGCCGAAAATGGTTTTCGAACCTGCTGCGCCGAGCCTTTCCGGCCAACTCTGAGGCTGAACTGGCCGAGCGCGCAGCGCCCGTGCTGGGGGTGAGCACCCGGCAGGTGCGCAACTGGCTGCGCGAGGACCACGATGCGTCCCTCCGCTACGTCACCGCCGTGATGATGATCGCGGGCGCGGAGGTGGTCTTCTCCCGGATGGAAAGCCGCCAGCCATGATCCGGGTCTGCTGGCACATCACTAGGCGCTTCTACGAGGTGCGGGCCAGCCGCGCCTCATCCAAGGGCCTGACGGATAAGCATCGGCATCTGCAAGGAAAAGCCATGAAGTATTCTTCCCTGATCGAGCACTACGAGGCGCATCACAAGAGCCCAGAGCCGCAGGAGCGTCGCCTGCCGTCCGGCTGGTGGCTGATCGTGGCCGCGCTTGCCTATGCGCTCGTCTACGTCCTCGCGACATGGGCCGTCCTCGCATGATTACCGCCGCCGCGGCCGTGCCTGCCTTGGCCGCGGCCACTCAGCCGGGGGCGCTCGCTCCTTCTGCCCCCGGCCTTTTCCCTTCGGCGCGACCCTCCTCTGCGGCGGGGCAGGGACCTGCGGGAGCGGTGCGTCAATGACCGCCTCCCTGCACCTCTCTTTGCCGCCGCCCGGCAATCCAGACTGGACCGCGCTCTATGCTCATCCCGCCCGGGACGCGGGGCTCCTGCATTTCGGCGCCGGCCCGGCGCTGGTGGCGCGCCGCGCGAAGCTCGGCCGGCCGGTCTATCTCGCCACGCCCTACAGCTTGCGCGCCGTGGACCGGGAAGGCCGGTGGTCGGCAGACATGTCGGCTGCGGCCATGGGCGATGCCGGGCGCGAGATCGTGCGGCTGCAGCAGGTGGGCGTGACGGCGATTTCGCCCGTCGCCCTCTCGGGCGTGGCGGTGCATGCCACGCTTTATCCTCGGCCCATGCTCGATCCGCTCGATGCGGTGCTCTGGGCCGAATGGTGCCGCCCGATCCTCGATACATGCTCCGCCGTCGTGGTGCCCGACATCCGCGGCTGGTCGCGCTCTCTCGGGATCTGGCACGAGGTGCGGGCCGCGCTCGCGCGCCAGACCTCGGTCTTCGTCTATGCGGAGGGGCCGGAGCGATGA
Protein sequences of DBSCAN-SWA_1 >NZ_CP051469|52605:62291|59161_59494_-|WP_011339625.1|DBSCAN-SWA MKQDLPLSPRALNMARHHAEHAPTPEIREAASRLLTDHELAIAMGRPLHRAVPRPTFRTSRPSPLGPNLTGLLLGALALLGFLFVATVLWAHATDVAATMRAQAAAMRGM >NZ_CP051469|52605:62291|59490_59688_-|WP_011339624.1|DBSCAN-SWA MTATAIRYPASTPVGRFALLDLTEPQLRRLLAAAEAGLPQLDTADEEPGAEVDTILAQAREALEQ >NZ_CP051469|52605:62291|55326_55581_-|WP_011339630.1|DBSCAN-SWA MEPRDYASTPVADSLGLHGMISGLAEDLRDMRAGKISPADGLARAAVAKQIFNGVRLYLTAVKTLEAPARGRQEPQVIEGHANG >NZ_CP051469|52605:62291|57564_58551_-|WP_011339628.1|DBSCAN-SWA MDENTAKNVLETALDEVLPKMAQPVEILNHLPGAAPAASHYAIPDDMRHVDLTEAVDRLATKLQPWRRAGTAKLQDLASLIAWANRHKGETSALFADIGAAPSLTCIADYMGAGAPVMDHETRDPKASHCRHRALYTFPMSKEWKLWTGVNNKALDKAEFGEFIEANAKDLLDPTPNLLNGHIGAANVEPWEVRMIEVARQLNGRFGQYQTLVQLSRSFQVHEVSNLTATLNRDTGETSIQFINEHREPDGQPLKIPNLFMIAIPIFDRGAAYRIPVRFRYRKAGSEVKFILSLHNAEIALEDAIEEALHEATEATGLPLFRGEPERA >NZ_CP051469|52605:62291|56913_57117_-|WP_023004254.1|DBSCAN-SWA MLDHDQIDTFARDEILSAWSDAIAAVSPHLPGGQPMPLDRIGIARRIAQRLGCTTGRVFEVVGAEHG >NZ_CP051469|52605:62291|52605_53604_-|WP_114071702.1|integrase|DBSCAN-SWA MREYRIGRLNGRFVVSWWEDGARRRYRLDARTRAEAEVEALDRIRRETLPSGATTVRDLWAAYLADRKGRPVERNMRSSGNAVLPHFGALRPDQITPEACRSYTKARRKTVSVGTVWTELGHLRTCLLWAAKRGLIQVAPHIERPAKPAPRDRYLTHAEVSRLMEAECEPHIKLAITVLLTTAARVGAVLELTWDRVDLEHGQIRLRVEAEGPRKGRATVPINNTLRAALTSARAAALSDHVIEWAGGPVKCIRKGFLRAVSNAGLLDVTLHTLRHTAAVHMAEAGVPMDEISQYLGHSNVQITSSVYARFSPQHLRKAADALEFGALRVVK >NZ_CP051469|52605:62291|61372_61633_+|WP_011339621.1|DBSCAN-SWA MIRVCWHITRRFYEVRASRASSKGLTDKHRHLQGKAMKYSSLIEHYEAHHKSPEPQERRLPSGWWLIVAALAYALVYVLATWAVLA >NZ_CP051469|52605:62291|61115_61376_+|WP_002723285.1|DBSCAN-SWA MRKNPANSDEHARTSRKWFSNLLRRAFPANSEAELAERAAPVLGVSTRQVRNWLREDHDASLRYVTAVMMIAGAEVVFSRMESRQP >NZ_CP051469|52605:62291|60798_61035_+|WP_017140293.1|DBSCAN-SWA MEQLISDIEAHCAACGISPQKLLREAINAKWGQWQDWKDGKSSPTMKVVDRLRAHMAGAEMTATGHGLVSQEEAKGAA >NZ_CP051469|52605:62291|54869_55208_-|WP_030002922.1|DBSCAN-SWA MQLSRAKGWRMPPGAVKVDRSTKWGNPFVVNHPGGALEKPMDPALAVQSFRMLLEKEGCWSPVPLPWPKGKIPAQWTTVEDVIRELRGKDLACWCQPGAPCHADVLLEIANG >NZ_CP051469|52605:62291|60044_60800_-|WP_017140294.1|DBSCAN-SWA MAGNMGELPHIGKGNFPMERSWGMGHFPHMQKKTLDAFVRGLQIVMEAEGIKMKPLSVAAGMGESGVRDLIRNESSPKVANAYALARELGRTVDEIIQIGMTGDLNARPAHAPIAVAGCVGAGARVDLLDAYEKGDGMYHVARPPQLKPHGIVAVEVKGESMMPLYRPGSVLFYTRAAAEGVPVEALNTPCVCEDADGRAWLKVVKVGSQEGTFSLLSLNPDADNMHGVRLKWAAPVLLTLSPDYVERVYV >NZ_CP051469|52605:62291|53847_54564_-|WP_011339632.1|DBSCAN-SWA MAKPLRILIGCETSGVMRRAFVARGHDVWSCDLLPAEDRSNRHIVGDVRDHLSDGWDLLIVAHPPCTRLCNSGVRWLHEPSKRLPETYAAAEREAFARMGRKERLAFLWADLDRGAALFAACWQAPVPRVAVENPVMNPHARARLPADLPRPQTVQPWWFGEPFFKATSFYLRGLPRLAATDRLTPPRPGTPEHKRWSAVHRAPPGPGRWKFRSRTFEGVAEACADQWSGWAAEEAVA >NZ_CP051469|52605:62291|57119_57329_-|WP_017140297.1|DBSCAN-SWA MSAGEMTKEQFVEAFVARCVSVAGPTFDDGSSIADYAREVAPSYWADDHQREDGPEACADADMSYWGED >NZ_CP051469|52605:62291|61769_62291_+|WP_011339620.1|DBSCAN-SWA MTASLHLSLPPPGNPDWTALYAHPARDAGLLHFGAGPALVARRAKLGRPVYLATPYSLRAVDREGRWSADMSAAAMGDAGREIVRLQQVGVTAISPVALSGVAVHATLYPRPMLDPLDAVLWAEWCRPILDTCSAVVVPDIRGWSRSLGIWHEVRAALARQTSVFVYAEGPER >NZ_CP051469|52605:62291|55811_56921_-|WP_011339629.1|DBSCAN-SWA MAETSTIEWTDATWNPITGCTLASPGCQFCYAADLAATRLSQHPSRAGLARRNAAGVAAWTGEVRLNEQWLEQPLRWKRPRRIFVCAHADLFHEAVPDEWIDRVFAVMALAPQHTFQVLTKRAARMRGYVSKLDIYALEEADEWRDADHRVACDDDPNGSPAWHAQTAALEDALSAARTCLEAGKPLPNVWLGVSAEDQQRADERIPDLLATSAAVRFGSFEPLLGPMDVAWALGHPFGIAAGFLQRGHFSPGAEKLRGLDWIIVGGESGRHARPMHPAWARSLRDQAQAAGVPFLFKQWGEWARADKLLPGHPGDVTYWPDGSIGAGNANDNGGPGWSLRRVGKRAAGRRLDGRTWDEMPELAHAHLS >NZ_CP051469|52605:62291|57315_57468_-|WP_017140296.1|DBSCAN-SWA MTLIPATDLLLHQLHQEIVRQARRPHCRCLGADDDCACHGARPGAGHERG >NZ_CP051469|52605:62291|58569_58929_-|WP_017140295.1|DBSCAN-SWA MSELRTLDQIISLADNGQYQPLLLQENDDLISEIVNFSQAYGTKAGGKLQITISYTTDRFGQIDLAIEHKVTKPRTPKAKATAWTADGGGLTIANPNQSRMEIREVAGRRQLRTPGADE |
17 | Rhodobacter_phage(44.44%) | integrase | attL 51504:51519|attR 67354:67369 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
66939 : 76537
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP051469|66939:76537|DBSCAN-SWA GATGGTCGAGATGCTCGACCGCGGCATCGGGCGGCTCACCCGCATTCCGCCCCTGCCGCCCTTCACCGCCCCCGAGGAGATCCTGGCAGACGCGCTGCCGCTCCTCGATCCGCCGAGCCGGGTCACGGTGACCGAGGCGGCCGAGCGGCACATGCGCGTGCCGGTGCAGGGCAACTGGGTGCCGTTCGACCGGGCGGTGACGCCCTATACCGTCGAGCCCGCCGACATGACCCAGTCGCGCCGCTTCAAGGCCGTGGTCTTCCTCGGGCCGTCGCAGAGCGGCAAGAGCCAGATGATGCAGTCGGTCTCGGCCCATGCCGTCACCTGCGCGCCGGGCCCGGTGCAGGTCATCCACATGACCAAGACCGATGCCGACGCCTGGGTCGAGGAGAAGCTCGACCCCACGATCCTGAACAGCCCGGCGCTGCGCGAGCGCCTGGGCACCGGGCGCGACGACAGCACCTTCAGCCGCAAGCGCTTCAAGGGCATGCGGCTCACCATCGGCTATCCGGTGCCGAACCAGCTCTCGAGCCGGTCTCAGCGCCTCGTGATGCTCACCGATTACGATCACATGCCCCAGAAGCTCGGGCCGAAGGACAGCCCGGAGGGCTCGCCCTTCGGCATGGCGCTGCAGCGGATCCGCACCTTCATGAGCCGGGGCTGCGTCCTGGCCGAGACCTCGCCCGCCTTCCCGGTGGACCCGAATGCGGACTGGGCGCCGCATGCCGGCCATCCGCACATGCTGCCGCCGGCCACGGCCGGGCTCGTGCCGATCTACAACGAGGGCACGCGCGGGCGCTGGTACTGGGAATGCCCGGACTGCGGCGATCTCTTCGAGCCGCGCTTCGACCGGCTGCATTACGATGCGGATCTCGATCCGGGCGCGGCGGGCGAGCAGGCGATGATGGAATGCCCGCACTGCGGAACGCTCATCGCCCACCGTCACAAGGTCGGCCTCAACCGCGCCGCGCTCGAGGGTCGCGGTGGCTGGCTGCACGAGGGCCGCCACATCGAGGCGAACGGGCGCCGGGCGCTGGTCCGGATCGACGATCCCGACATCCGACGCACGCCCATCGCGAGCTACAGTCTGAACGGGGCCGCCGCGGCCTTCGCCTCGTGGGAGGAGCTGGTCCAGCGCTACGAGACCGAGCGGCGGCGGTTCGAAGCCTTGGGCGACGACACCGACTTCGCCCGGGTGCATTACACCGACATCGGCGTGCCTTACCGGCGCCCCGAGGCCGAAGAGGAGGGCGCCCTCACGGCGGCGCAGATCCGCGAGCACATGCGCAGTCAGGAACGGCGCGTGGCCCCGGCCTGGACGCGCTTCGTCACGGTCTCGATCGACGTGCAGGGCAACCGCTTCGAGGTGCTGGTCATGGCCTGGGGCGCGCAGGGCGAGCGGATGCCGATCGACCGGTTCGCGGTGGCGCAGCCTCCCGACCATGCCCCGCGCGCGAAGGGTGACGACGAGCGATACCGGGCGCTCGACCCCGGCCGCTATGTCGAGGATGCCGATGCGCTCCTCGATCTGCCCGAGCGTCTCTATCCGGTGGAGGGGGCGAGCTGGAGCCTGAAGCCCTGCGCGCTGGTGATCGACTTCAACGGCCCGGCCGGCTGGTCGGACAATGCCGAGAAGTTCTGGCGCGCGCGCCGGCGCAACGGTCAGGGCGGGCTCTGGTGGCTCTCGATCGGCCGCGGGGGCTTTCAGCAGCGCGACCGGGTCTGGCACGAGGCGCCCGAGCGGGGCTCGAAGGGCAGGCGCGCGCGCGGCATCAAGCTGCTGAACATGGCGACCGACCGGATGAAGGAGAGCGTCCTCGCGGCCGTCGGCCGGTTCGAGGGCGGTCAGGGCGCCCAGCATGTGCCCTCCTGGCTCGAGGCGGAGCATCTCGACGAGCTCCTCGCCGAGCGCCGGGGCGCCAAGGGCTACGAGAAGCGCCAGGGCGCTGTCCGCAACGAGACGCTCGATCTCTCGGTGCAGGCGCTGGCCGTAGCGGAGTTCAAGGGGCTGAACCGGATCGACTGGCAGGCGCCGCCCGCCTGGGCCGAGGCGGGGCCCGCCAACCCGTTCGCCGTGGCCGTGTCCGCAGCTGCGGCAGAGGCCGCACCGGCCCCGCGCCGGCGCGCGCGGACCTCGCGCTCGCGATACATGGAGGGATCATGACGCTCGACGATATGGAGCGGCGGCTCACGGGGCTGCTCGACATCCGCCACCGGGGCGTGCGGTCGGGCTCGGTCGGGTCCGAACGGGTGGAGTATCAGAGCGATGCCGATCTCGCGCGGGCCATCGCCGATCTCGAACGGCGCATCGCGAAGGCGCGGAAGACGGCGCGCCGGGTGATCCGGCCCTATGCGGTGAAGGACCTGTGATGGCGGGCCCCTTCCTGCGCCGGCTCGGCGCCTGGGTCGGCGGGTTCGACGCGGGTCTCGCCAACCGGCGCCTGCGCGGCTTCCGTCCCGCGCGGGCGCATGTGAACGCGCTTCTCGCCGCGGCCGGGCCCGACATGAACGCCCGCGCGCGCTACCTCGTGCGCAACAACGGCTATGCCCAGGGCGCGCTCGACAGCTGGGCCGCGAACACGGTCGGCACCGGGGTGAAACCCTCCTCGCTCATCGCGGCGCCGGCGCGGAAGGCCGCCCTCCAGCGGCTCTGGCAGGACTGGACCGACGAGGCCGACGCCGAGGGCGTGACCGATTTCTACGGCCTGCAGCGCCGCATCGCGCGCGAGTTCTTCCTCACCGGCGAATGCTTCGTGCGCCTGCGCGCGCGGCGGCCCGGCGACGGGCTCACGGTGCCGCTCCAGCTTCAGTGCCTGCCCTCCGAGATGCTGCCGATCGGCAGGACCGAGGTGCTGGGCGGCGGGCGCGCGATCCGGCAGGGGATCGAGTTCGATGCGGTGGGCCGGCGGGTGGCCTATCACTTCCATCGCCGCCATCCGGGCGATCCGACCGAGCCGGGGCTTGCGGGCGAGACGGTGCGCGTGCCGGCCGAGGATGTGCTCCACATCGTCGATCCGGTCGAGAGCGGGCAGCTCCGCGGCGTCTCGCGCTTCGCACCCGCCATCGTGAAGCTCTTCCTGCTCGATCAGTACGACGATGCCGAACTCGACCGGAAGAAGGTCGCGGCCATGTATGCGATGTTCATCACCTCGAACGATCCGGATGCGGCGCCGCTCGAGGGCGAGCTGGGCGATCAGGTGGCGCCGGGGCAGATCGTGCGTCTCGACCCGGGCGAGGACATGAAGGTGGCCGATCCCGCGGATTCGGGCGCGACCTACGAGCCCTTCCAGTACCGCACGCTCCTGCAGGTCTCGGCCGCCCTCGGCATCCCCTATGCTCACCTGTCGCAGGACATGGTGAAGGCGAACTACTCGAATGCCCGCACCGCGCTCATGGAGTTCCGCCGCCGGGTGGAGGCCTTCCAGCATTCGGTCCTCGTCTATCAGCTCTGCCGCCCGGTCTGGGCGCGCTTCACCGATCTCGCGGTGCTGACCGGAGCGGTGGGGCTGCCGGGCTATGAGCGGCGGCGGCGGGACTATCTCGCCTGCGAGTGGCTGCCGCCGAAGTGGCAATGGGTCGATCCGCTGAAGGACATCCGCGCCGAGATCGAGGAGATCGGCGCGGGCCTCAAGAGCCGGTCGCAGGCGATCGGGGAGCGCGGCTACGACGCCGAGGAGGTCGATCGCCAGATCGCCGCCGACCGGAAGCGCGAGGGGCGGCTCGGGCTCGACTTCCGCCGCAGCGCGCAGGGGCCTTCCGCGCCTGCGACGCAAGACGGGGCGCGCGCCGACGAGGAGGACGATGAGGACGACGACGGACGCGCGGCGGACCGCGACGCCGGCAGGAGGGCAGAGCCATGAACTATCCGATGATCGCGGGCCGGGTGTTCGGCACGCCGCTGCTGGTCGATCCCGTGAAAGGCGCGGCCTTCCTCGCGGGCCTCGGCCCCCGGCTCGTGAACGGGGCGCTCGAGCTGCGCGGGCTCGAGGAGCTCGCGCCCGACCGCGTGGCCGAGGCCGGGCGGATCGCGCCGCGCGCCTCGGTGCTCCTCGACGATGCGGGCGACGCCCGGCGGGAGGCGGGCCGGCCGCTCTACCTTGTGGAGGGCGGCGTCGCGGTGATCGAAGTCACCGGCACGCTCGTTCACCGTGGCGGCTGGATCGGCCAGTCCTCGGGGACGACCTCCTACGAGGGGCTGATGGCGCAGATCTCCGCGGCCGTGGCCGATCCGTCCGTGCGCGGCATCGCGCTCGAGATCGACAGTTATGGCGGCGAGGTGGCGGGGCTCTTCGATCTGGCCGACGCGATCCGGGCCGCGCGGGCGGTGAAGCCGGTGCGCGCCTTCGTGGCCGAGGCGGCCCTGTCGGCGGCCTATGCGATTGCAAGCCAGGCCGAGCGGATCGTGCTGCCGCGCACCGGCGCCGTGGGCAGCATTGGCGTGCTCCTCGTGCATGCCGACTTCTCGCAGGCCATGGCCGACCGCGGTGTCGCGGTCACGCTGATCCATGCCGGGCGGCACAAGGTCGACGGCAATCCCTACGAGGCCCTGCCCGAGGGGGTGCGCGCCGACCTGCAGGCCCGCGTCGAAGCCTCCCGCGCGCTCTTCGCCGAGACGGTCGCGGCGGGCCGCGGCGCGCGGCTCAGCCGACAGCAGGCGCTCGCCACCGAGGCGCAGGTCCTCGACGGTGCCGCCGCGGTGGCCGCGGGTCTCGCCGACGAGGTCTCCGATCTCCGGAGGGCCTTCGCCGCCTTCCGCGCCGAACTGTTCGATCCGCACCTTACATCCCCCCGGGCCGGCACGCCGGCCGCAGCCAAGGAGACCCCGACCATGACCGATGAGACCACGACCGGCGCCGCGCGAGGTACGGCCGCAGAGGGCAGCGCGCCCCCAATGGAGACCGCGGAAGGATCCGGCGGCGGCGCGCCGGCCGCGAATGTCGCCGTGGCCGAGGCCGCCGAACTGATCGAGATCGGCCAGCAGGCGGCCCGGCTCGGCCTGGCCGTCGATGTGGCCGACGCGATGCGCCGCGGCCTCTCGGCCGCTGCCCTCCGCCGCACCGTGCTCGACGGGCTGGCAGCCCGGGGCGACGGGGCCGACCTCGTGGCCCACGCCCCGACTGCTGCCGCCGGGCCGAAGGAAAGCCCGCTCCTCGCCGCCGCGCGCCGCACCGCCGAAGCGCAGGCCGCCAGCCGCCGGGCCTGACCCGGCGGCTGTGAGACGGGGGGCTAGACCGCTCCCCGCACGCACGCTCATCCGCCCGCATCGTTCCGAAGCCCCCGCCTCCCCATGGACCGCGGGCGCTTCCGCACGCCCTCACCCCCGAAAGGACCCCCGACCATGGCACCCCTGATCAAGCCGCCGAGCCTCGGCGATCTCGTGAAGTACGAGCTCGAGCCGAACTTCACCCGCGAGACGGTCACGCTGCGGGCCGGCACCGCCTATCCGCTGGGCGCCGTCCTGGGCCTCGTCGCCACCGGCCCCGACGCCGGCCGGTTCGCCTTCGCCGCGGACGAGGCGGAGACCGGCGAGACCGCGGCCGCGGCCGTCCTCCTTGAGCCGGTCGATGCGACGGAGGGAGAACGCCGCGGCACCGTCCTCCGCCGTGGCCCCGCGATCCTCTCCCGCGCGGAGCTGGTCTTCGACCCGAGCCTCGGGGACGAGAGCCAGCGGGCCGGCCGGATCGCCGAGCTCACCGATCTCGGCCTCGTCGTGCGCGAAACGGCCTGAGCCCCCGCCATCCGTCCCGTCCCATCCATTCTCCCCGGCGACCCGCCCGGCGCCGATCCCCGTTCCTGAAGGAGCCCGATCATGACGATCACCCGCAATCCGTTCGACGCCGGCGGCTATTCGCTGGCCGAGATGACCCAGGCCATCAACATCCTGCCGAACCTCTACACCCGCCTCGGCCAGATGGGCCTCTTCCAGTTCGAGGGGGTGACCCAGCGCAGCGTGATCATCGAGCAGGCCGAGGGCGTGCTCTCGCTGCTGCCGTCCCAGCCCTGGGGCGGGCCCGCGACGGTGGGCGGCCGCGAGCGCCGCTCGATGCGCTCCTTCGCGCTGCCCCACATTCCGCATGACGACGTGATCACCGCGGCCGATGTGCAGGGCCAGCCCGCGCTGGGGTCGACCGGGCAGGCCGATCCTCTGGCCGAGGTCATGACGCGCAAGCTCGCGCTCATGCGCCGCAAGCACGCGGCGACCCGGGAATATATGGAGATGAACGCGCTCCGCGGGGTGGTGAAGGACGGGGCCGGTCTCACGCTCTACGACTACTTCGCCGAGTTCGGGCTGGCGCGGATCTCGGTGGACTTCCTCCTCGGCACCGCGGGGACGAACGTCCAGGCCAAGTGCCGCGAGGTGCTGCGGGCGGTCGAGGAGGAGCTCAAGGGCGAGTCCATGACCGGCGTCACCGCCCTCGTGAGCCCCGAGTTCTTCGACAAGCTGATCGGCCATCCGAAGGTCGAGGAGGCCTACAAATACTACGCCTCGAGCGGGGCGCAGCCGCTGCGGCAGGACGTGCGGCGGAGCTTTCCCTTCGCGGGCCTCCTCTTCGAGGAATATGTGGGCTCGGTCACCCTCGCAGGCGGGGCCTCCGAGCGGCTGGTGCCGGCACAGGAGGGCACGGCCTTCCCGCTCGGCACGATGGACACGTTCCGCACCTACGGCGCCCCGGCCGATCTTCTGGAAGCCGTCAACACGATCGGCCAGCCGATCTATGCCCGCCAGCTCCTCGATCCGAAGGGCCGCTGGATCGATCTCATGACCGAGGCCAACATCCTGCCGGTCAACAAGCGCCCGCGTCTCGCGGTGCGGATCCTGACCTCGAACTGAGGCGGCCGCCATGTCCGTCCCCCTCGAGGGCATGGGCGCGACCCTGACCGCCCTCTTCGGCGCGCCCGTGAGCTACCTGCCGCAGGGCGGGCCCGCGCGCGACGTGCCCTCGATCTTCCGCGAGGAGCAGGTCGAGGCCGAGGATCCCGAGGGCCGGATCGTGCTCGTCATGGCGCCCACCTGGCGGGTGCGCCGCGATCTGGTCCCCGAGCTCGCGCGGCGGGACCGGATCCGGCTGGCCGACGGCCGGGTCTATGAGGTGGACGAGATCTGGCCGCCCGCGACCCCGGCGGCGGACGCGCTCACGCGCTGCACCCTGCGGAAGGCCGCGCCATGACCGGGCGCATCCGCTTCCGTCAGATCGCCCGCGCGGCGCTCGCCGCCGATCCGCGCATGGGCAGCTTCTCGCAGATCTCCGCCTGGGAAGCCCGGCCGAATGCCGACCGCCTGCCGCTCCTGATGGTGGTGACGCCGGTCGAGCGGGCGACCCAGGCCACGCTCTCCGCCTTCGAGCGCGCGACCGTCCTGCAGGTGGGCGTGAAGCGGCTCGGCCGAGACGATCTCGAGGATCTCCTCGACGCGGATGCCGATGCCGTCGAGGGCGCCATCTGCCGGGCGTTCCAGCAGGCGGCCATCGTCTGCCTGCCCGAGGAGGTGACGGTCACGCTCAACACGGAAGGCGAGCAGGCCGTAGGCACGCTGATCTCGAGCTTCCGCATCGTCTGGCGCAGGCCGATCCCTCGGCCCGCGCCCTGACCTTGCCCCGGTCCGCGGGCTGGCCTCGGCCGAAGAGCGGGCGATCCATTGCAACCGGGCCTCAGGCACCCCGCACCGCGGGCCGCGGCCCCCTGCCGAAAGGGCAACACCATGAACGATACCGTCACGCCGGGCATCGGCACGCTGATCTATGCCTCGACCGCGCTGCCCGCCGCCGCCACCGAAACGGCCTACGGGGGCCTCACCTGGACCGCGGTGGGCGAGGTCACCGAAGTGCCCGAATACGGCGGCTCCGCCGAGGTGGTGAACCACACGCCGCTCGCGACCGGCATCACGCAGAAATACCACGGCGCGGTGAACTACGGCTCGATGCAGATCCCGCTCGCCTTCAACAGCACCGACGCGGGCCAGGCCATCCTCGAGGCCGCGCGGAAGAACCGCAACCGCATCGCCTTCAAGATCGCCTTCCCGAAGATCGACCCGCTCTCGACCGAGGGGGCGGCCGATTACTTCCAGGGCAAGGTCTTCGGCTTCACCAAGAGCGCGCCCGCCAACGGCGTCGTCTCGGGATCGGTGACCGTCGAGATCGAGACCGAGCTTACCTCGGTCGAGGAGGCCTGAGCCTCCCCCCGGATCCCGCCCGGCGACGGGCGGGACCGGCCGCGGCCGGCGTGGGTCATCGGCGGCGGCCACCCCCTGAACCCGAACCCCAAGGATCATGGACATGGACTTCACCCAGTTCGACAGCCGCACCGCGGCCGAAACCGCCCGCCCGCTGCATCTCCGCCACCCGGCCACGGGCCGGCTCCTCTTCGCCGACGAGGCCGAAGCGAAGCCCTGCGAGGTGCTGGTGCTCGGCTCCGAGAGCCGCGCCGCCCAAGCCGCGATCCGCGCCGCGCAGAAGGCACGGCTGAAGACCGACCGCGACGACGAGCGCCAGACCATGGAGGAGGTCCATGCGAACCTCGTCGCCGCCGCAAAGCCGCTGGTTGCGGGCTTCCGGAACGTGAACCGCGGCGAGGCGCCCGCCGGCCCGGCCGATGCCGAGTGGTTCCTGAACCTCAACCTGATCACCGGCCGCGAGGGCGAGAAGAGCTTCGTCGAGCAGGTCATGGGCTTTGCCACCAGCCGCGCGAACTATCTGGGAAACGGCTCGCCCGACTGACGCTCTATGCGCGTCAGACGGGCTTCCTGCAGGCCACGCCGGAGACGGCGAAGCGGACCCGGATGGAAGATCTGCGGGCGGCCCGGCGGCCGCTCGGCCTGCCCGAGATCGAGGCCGGCAAATATCTGATCGCCTGCCTCACCGCCGAGGACGGTCTCGGCTGGTGCGCGACCGACCCGATGGGCGGGCTCGCCCCGCATTCCTGGGCCGAGATCGAGGCCTACAGCCGCGGGGCAGGCCTCGACCTCGAGCCCTGGGAGGCGCGCCAGCTCCGCGCCATGTCGGCCGCCTACGTCGAAGGCCGGATCGAGGGTCGGAAGAAGAACGGCGTGGCGCCGACCTTCTCCGGCGGAGAGGCCGCCCGGAAGCGCGAGCTGGCTCAGGCCATCAGGGCGCAGATGCGGCTGGCGCAGGCGCCAGCGTGACCGCGGTTCCGGTCGCGACGACCATCACCATCGAACTGTGAATTGCGCCAGCCGCGATTTGAGAATGCGTGAATGAGATCGCTATAACCGCGTCCGCCCCAAGCTCTGAGGCTCTGGTTTTGATATCTTCGAGTGCATCTCTGCGGGCGTCTTGAAAGGCCTTCTCCAAGGTGACGCTCTTTCCGCCGGCGAGGTCGCGGACAGCGACAAACATGTCCTTGAAGGCGTTGAGCCCGAATACCCGTTCTGACGAGACGATACCGAGCCTCTGGATTACCGGCAAATTTGGCGCCGTCTCGGTCGTCAAGATAATCGCATCCCTTTGCTTTTTGCGGGTCGCGGCTTCCTGGGCCGAGGCCCTCTGATCGTCCTCAAGGGATTCCTGTCGGCACCTCCGACATTGGCCGTCAATGACGTCAAGAAATCCGGTCTTGTGGCCGCATGTTTTGCAGACGGGCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP051469|66939:76537|70794_72141_+|WP_011339611.1|DBSCAN-SWA MNYPMIAGRVFGTPLLVDPVKGAAFLAGLGPRLVNGALELRGLEELAPDRVAEAGRIAPRASVLLDDAGDARREAGRPLYLVEGGVAVIEVTGTLVHRGGWIGQSSGTTSYEGLMAQISAAVADPSVRGIALEIDSYGGEVAGLFDLADAIRAARAVKPVRAFVAEAALSAAYAIASQAERIVLPRTGAVGSIGVLLVHADFSQAMADRGVAVTLIHAGRHKVDGNPYEALPEGVRADLQARVEASRALFAETVAAGRGARLSRQQALATEAQVLDGAAAVAAGLADEVSDLRRAFAAFRAELFDPHLTSPRAGTPAAAKETPTMTDETTTGAARGTAAEGSAPPMETAEGSGGGAPAANVAVAEAAELIEIGQQAARLGLAVDVADAMRRGLSAAALRRTVLDGLAARGDGADLVAHAPTAAAGPKESPLLAAARRTAEAQAASRRA >NZ_CP051469|66939:76537|76039_76537_-|WP_017140287.1|DBSCAN-SWA MPVCKTCGHKTGFLDVIDGQCRRCRQESLEDDQRASAQEAATRKKQRDAIILTTETAPNLPVIQRLGIVSSERVFGLNAFKDMFVAVRDLAGGKSVTLEKAFQDARRDALEDIKTRASELGADAVIAISFTHSQIAAGAIHSSMVMVVATGTAVTLAPAPAASAP >NZ_CP051469|66939:76537|75714_76077_+|WP_017140288.1|DBSCAN-SWA MEDLRAARRPLGLPEIEAGKYLIACLTAEDGLGWCATDPMGGLAPHSWAEIEAYSRGAGLDLEPWEARQLRAMSAAYVEGRIEGRKKNGVAPTFSGGEAARKRELAQAIRAQMRLAQAPA >NZ_CP051469|66939:76537|72747_73770_+|WP_011339609.1|capsid|DBSCAN-SWA MTITRNPFDAGGYSLAEMTQAINILPNLYTRLGQMGLFQFEGVTQRSVIIEQAEGVLSLLPSQPWGGPATVGGRERRSMRSFALPHIPHDDVITAADVQGQPALGSTGQADPLAEVMTRKLALMRRKHAATREYMEMNALRGVVKDGAGLTLYDYFAEFGLARISVDFLLGTAGTNVQAKCREVLRAVEEELKGESMTGVTALVSPEFFDKLIGHPKVEEAYKYYASSGAQPLRQDVRRSFPFAGLLFEEYVGSVTLAGGASERLVPAQEGTAFPLGTMDTFRTYGAPADLLEAVNTIGQPIYARQLLDPKGRWIDLMTEANILPVNKRPRLAVRILTSN >NZ_CP051469|66939:76537|69307_70798_+|WP_011339612.1|portal|DBSCAN-SWA MAGPFLRRLGAWVGGFDAGLANRRLRGFRPARAHVNALLAAAGPDMNARARYLVRNNGYAQGALDSWAANTVGTGVKPSSLIAAPARKAALQRLWQDWTDEADAEGVTDFYGLQRRIAREFFLTGECFVRLRARRPGDGLTVPLQLQCLPSEMLPIGRTEVLGGGRAIRQGIEFDAVGRRVAYHFHRRHPGDPTEPGLAGETVRVPAEDVLHIVDPVESGQLRGVSRFAPAIVKLFLLDQYDDAELDRKKVAAMYAMFITSNDPDAAPLEGELGDQVAPGQIVRLDPGEDMKVADPADSGATYEPFQYRTLLQVSAALGIPYAHLSQDMVKANYSNARTALMEFRRRVEAFQHSVLVYQLCRPVWARFTDLAVLTGAVGLPGYERRRRDYLACEWLPPKWQWVDPLKDIRAEIEEIGAGLKSRSQAIGERGYDAEEVDRQIAADRKREGRLGLDFRRSAQGPSAPATQDGARADEEDDEDDDGRAADRDAGRRAEP >NZ_CP051469|66939:76537|73780_74107_+|WP_011339608.1|DBSCAN-SWA MSVPLEGMGATLTALFGAPVSYLPQGGPARDVPSIFREEQVEAEDPEGRIVLVMAPTWRVRRDLVPELARRDRIRLADGRVYEVDEIWPPATPAADALTRCTLRKAAP >NZ_CP051469|66939:76537|66939_69102_+|WP_011339613.1|terminase|DBSCAN-SWA MVEMLDRGIGRLTRIPPLPPFTAPEEILADALPLLDPPSRVTVTEAAERHMRVPVQGNWVPFDRAVTPYTVEPADMTQSRRFKAVVFLGPSQSGKSQMMQSVSAHAVTCAPGPVQVIHMTKTDADAWVEEKLDPTILNSPALRERLGTGRDDSTFSRKRFKGMRLTIGYPVPNQLSSRSQRLVMLTDYDHMPQKLGPKDSPEGSPFGMALQRIRTFMSRGCVLAETSPAFPVDPNADWAPHAGHPHMLPPATAGLVPIYNEGTRGRWYWECPDCGDLFEPRFDRLHYDADLDPGAAGEQAMMECPHCGTLIAHRHKVGLNRAALEGRGGWLHEGRHIEANGRRALVRIDDPDIRRTPIASYSLNGAAAAFASWEELVQRYETERRRFEALGDDTDFARVHYTDIGVPYRRPEAEEEGALTAAQIREHMRSQERRVAPAWTRFVTVSIDVQGNRFEVLVMAWGAQGERMPIDRFAVAQPPDHAPRAKGDDERYRALDPGRYVEDADALLDLPERLYPVEGASWSLKPCALVIDFNGPAGWSDNAEKFWRARRRNGQGGLWWLSIGRGGFQQRDRVWHEAPERGSKGRRARGIKLLNMATDRMKESVLAAVGRFEGGQGAQHVPSWLEAEHLDELLAERRGAKGYEKRQGAVRNETLDLSVQALAVAEFKGLNRIDWQAPPAWAEAGPANPFAVAVSAAAAEAAPAPRRRARTSRSRYMEGS >NZ_CP051469|66939:76537|69098_69308_+|WP_002723266.1|DBSCAN-SWA MTLDDMERRLTGLLDIRHRGVRSGSVGSERVEYQSDADLARAIADLERRIAKARKTARRVIRPYAVKDL >NZ_CP051469|66939:76537|72276_72666_+|WP_011339610.1|head|DBSCAN-SWA MAPLIKPPSLGDLVKYELEPNFTRETVTLRAGTAYPLGAVLGLVATGPDAGRFAFAADEAETGETAAAAVLLEPVDATEGERRGTVLRRGPAILSRAELVFDPSLGDESQRAGRIAELTDLGLVVRETA >NZ_CP051469|66939:76537|75211_75652_+|WP_017140289.1|DBSCAN-SWA MDFTQFDSRTAAETARPLHLRHPATGRLLFADEAEAKPCEVLVLGSESRAAQAAIRAAQKARLKTDRDDERQTMEEVHANLVAAAKPLVAGFRNVNRGEAPAGPADAEWFLNLNLITGREGEKSFVEQVMGFATSRANYLGNGSPD >NZ_CP051469|66939:76537|74103_74526_+|WP_011339607.1|DBSCAN-SWA MTGRIRFRQIARAALAADPRMGSFSQISAWEARPNADRLPLLMVVTPVERATQATLSAFERATVLQVGVKRLGRDDLEDLLDADADAVEGAICRAFQQAAIVCLPEEVTVTLNTEGEQAVGTLISSFRIVWRRPIPRPAP >NZ_CP051469|66939:76537|74637_75108_+|WP_011339606.1|DBSCAN-SWA MNDTVTPGIGTLIYASTALPAAATETAYGGLTWTAVGEVTEVPEYGGSAEVVNHTPLATGITQKYHGAVNYGSMQIPLAFNSTDAGQAILEAARKNRNRIAFKIAFPKIDPLSTEGAADYFQGKVFGFTKSAPANGVVSGSVTVEIETELTSVEEA |
12 | Acidithiobacillus_phage(42.86%) | capsid,terminase,portal,head | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
85051 : 90581
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP051469|85051:90581|DBSCAN-SWA GATGCCGGAAAAGGGACTGATCGACACCATCACGGCGCTCTGGGGCGGGGCCATCGCCACGCTGATCGCGGCCGCCATGGGGCGCCTGATGTATCACACCGGCGAGGTCCGCGCCCGCCGCCGCGCCTTCTTCGGCCGCGAGCTCCTCTGGGAGATCCCCGCCCTCGTCGCCATGGCCTTCGTGGGCGAGGCGCTCAGTTCGTATCTCAGCCTCGATGGTCGGGCGGCGATGGGCCTCGTCGCGATGCTGGCCTATCTCGGGCCGAGGGGCACCACGGCGATGCTGGAGCGGCTCTGGCGGGGACGGAGTGCGGGGTGAAGCGCCAGCGACAAACCGGCACATCTGCCTCCGCGACTGCGGCCCTGCCGCGCCGAAGCGTTCGCCGGATACTGTCAGCACAGGCTGAATTCCTTCGACCGAGCGGGACGTTAGCGCCAGCGCTCGCCGGGCCGCCGCAAACTCCGCCCGTCCATACCGGAACGAGAGAAAAGGCCGGCCACGGTTTCGCCACGGAGGTAACGAACTTTTGACCCGCGGATGAGAAAAGCCAATCTCTCGAAGTGGAGAACATGATGAAAAGCAAATACGTGGCTGCCGTCAGCCTGATGATCCTGCTTGCGGCATGCGCGAAGCAACCTGACCGCATCGCTGCCGTGGAAGTCGGAGGCGACAACTACTCTCGGCATAACTGCCGCCAATTGGCGAGCGAGCGGATGGCGATCTCTCAGGATCTTGCCAATCTGAGCGCAAAGCAGAAGTCCGCGGCCAATGGGGATGCCTGGGGGGTCTTTCTGCTCGGGCTTCCGCTCTCCAGCATGTCCGGAGGCGATCAGGAGGCCATGATTGCCATCGCGAAAGGCAAGATTCAGGCAATCGACCGGCAAGTGGTGGCAAAGGGCTGCCCCGTGAGCCGCCCGTTGACCCAGGACTGATCTCTCGGCTTTCGTGCCACCGCTGGCACCAGCAGCGAAGGTGCGGGAACCTCACGCCTGAGGCTCCCGCACAGAACCTCCATCAGCGTCGCCGCCTGCGATCACGATGTGGCCGCCGTCTCAGGTGGCTCGAGCGCTTGCGGGGTTTGATCCCCGCTATTCCCCAACGCCCCAAACTTCGACAATCCGAGCTCGGCGCGCCCTTTCTCAGGGCGCCCGCAGCGCCCCTCAGGCAGGGGCCGGGCTGCGTCAACAGCCCGAACCACGCGGCCAATGTGCAACCAGCGACCGCGCCAGCCTGTCCGAAGCCTTTCAGACTGCCTGCCACCCCTCGCGAGGGCAGGCGCCTTGTGAGGCAGAATCACCTTATGAAACAAGTACCTCCCGCCGCCCCGGTTGCCCCCTGGCTCGGCGGCAAGAAACGTCTCCACCCGCTCATCCTCGAGCGGATCGAGGCCATCCCGCACCGCGCCTATGTCGAGCCCTTCGTCGGCATGGGCGGGATCTTCCTCCGCCGCCGCTTCCGGCCCCGCCTCGAGGTGATGAACGACCGCAACGGCGAGATCATCAATCTCTTCCGGATCCTGCAGCGGCATTACCCCCAACTCCTCGAGATCATGCGGTTCCAGATCTGCAGCCGGCGCGAGTTCGATCGGCTGCGGCTCACCGATCCCGCCACGCTCACCGACCTCGAGCGGGCGGCGCGGTTCCTCTACCTCCAGCGCCTCAGCTTCGGCGGCAAGCCGAACGGCAGCTTCGGCATATCGCCCGGCAATGGGCCGCTCTTCTCGCTGGCGCGCCTCGAGCCGCTCCTCGACGCCGCCCGCACGCGGCTCGATGGCGTGGTCTTCGAGTGCCTCGACTGGGCCGACCTCATCCCGCGCTACGACACAGCCGAGACGCTTTTCTATCTCGATCCGCCCTACTTCGGCGGCGAGAACGACTATGGGAAAGGTCTCTTCGACCGGGCGCAGTTCGCGCGGATCGCCGAGATCCTCGGCAGCCTCAAGGGCGCCTTCCTCCTGTCGATCAATGACACGCCCGAGATCCGGGCGCTCTTCGGCCGGTTCCATCTCGAGCCGGTGCGGCTGAATTACTCGGTCTCCGCTGCAGGCAGCACCGAGGCGCAGGAGCTCCTCGTCTCGAACCGCGAGCGGATCGCGACCCTCCTCTGAAAAACCCCACCAGTCCGACCACCCACGCCCCGCCCTCGCGCGGGGCTTTTGCATATGGAGAAAGACGTGACGACATCCGACATCCAGCGGCTGCTCGCGGCCGCGGGGCTCTACCGAGGCGCCATCGACGGCGATGCGGGGCCGCTGACCCAGGCGGCCGCAAGGGCGGCGCTCGAGGGAGAGCCGGTACCTTGGCGCGCCTGGCCCTCCCGCCGGCAGCGGATCGCGGCCGGCCAAGCGGTGCTGGCGCGGCTCGGCCATGCGCCGGGAAGGATCGACGGGCTCCTCGGGCCGAACACCCGCGAGGCGCTGACCGCATGGGCCTCCGGGCCGGTGCGCGCCGCCGTCGACCGGGTGCCGCTGCCGGGCCATGCCGTGGCCGATGCCCAGGGCGCCTATCCCCGGCAGGAGTCGGTCGCGACCTTCTACGGCGTGGCGGGCGGCCCCGACTGCACGGCCGGGATCGTCGAGCTGCCGATGCCGTTCCGCCTGGCTTGGGATCTCAATACGAGCATCACGAGCTTCCGCTGCCACAGGCTGGTGGCGGCGCCCATGGCGCGGATCTTCAGCGAGGCCGTGGCCCACTACGGCGCGGCCGAGTTCGAGCGGCTGCGGCTGAACCTCTTCGGCGGCTGCTTCAACCACCGGCCCATGCGCGGCGGCTCGGCCCTCTCGATGCATGCCTGGGGCATCGCCGTCGACCTCGACCCGGAGCGCAATCCGCTCCGCTGGGGCCGCGACCGGGCGAGCTTCGCCGCGCCCGCCTACGAGCCCTTCTGGACCATCGTCGAAGCCGCCGGCGCCACAGGCCTCGGCCGCGCCTGCAACCGCGACTGGATGCACTTCCAGTTCGCCCGCCTCTGAAGGAGAGAACCCATGTCCCTGATCTTCATCCTTGCCCTCGCCCTCTGCGCGCTCGCCTTCATATTGCTCATCGGCTGGCCGCTGGTCGCTGCTCTGGCGCGGTTGGCCGCTTCGACGATCTCGGCCTCGATCCTCGCCACGCTCGCCGGGCCCGCCGCCGCCTCGACCGGTAGCGACCTGCTGACCGCGCTGACGCCGAGCCTCCTCGATCTCGCTGGCGTAGCGCTGACCGCGCTGATCGGGCTCGCGACCGTCCGCTTCCAGCGCTGGACCGGGATCCAGATCGAGGTCCGACATCGCGAGGCGCTGCACTCGGCGATCATGACCGCTGCACGGGTGGCGGTAGCGCAGGGGTTGACGCGGGAGGTCGCGACAGAGTTTGTCGCTGCCTATGCCCGCTCCTCGGTGCCCGACGCGCTGAAGCGCCTGTCGCCCTCGTCCGAGACGATGGATGCGCTTGTCCGCTCGAAGCTGCTCGAGGTCCGCGGACGCTGATGGCCACGCTGGCAGGCGACCTGATGCGCGCGCCAGCCGTGCCCCTTCCACGGCCGCACCGCTGCGACCGGTCATTGCGTCGCGAAATATGTACAGAGATTTCGGCGGGCCTCGGACGGATAACGCACCAGGGTCGTGGCTTGAAGCACAGGTTGCAGGCCGGTAGCAACCGTCACCTTCACCCGCTCATCGTTGAGCAGATCTTCCGCCGGGCGTACATAGCGCTCAACACAGCGAGCAATGGTGAGCCGAATGGCAGACACGAGATCTCCGGAGCAACGCCGGCAGATCATGCGGGCGGTGGGGACCAAGAACACGGGCCCCGAGCTCATCGTGCGCTCGTTGCTGCACGCAAAAGGCTATCGCTATCGACTCCACGCGAGGTCTCTGCCCGGACAACCGGACATAGTCTTCCCAGCCCGGCACAAGGCCATCTTCGTGAACGGGTGCTTCTGGCACGGACACGACTGCGCAAAGGGGCGGTTGCCGACCTCTCGACTGGACTACTGGGGCCCGAAAATCGAGGCGAACGTCGCACGGGATCGCAGGAAGCTCTCCGAGCTCGAGGCGCTCGGCTGGCAGACGCTGGTCGTCTGGCAGTGCGAACTGAAAGACATCGAGCGCGTGGGATCGCGGTTGTGCGCATTCCTCGGGCCGACGAAAACCGATCGACAAGCATGATGCGGTAGGCTAATCTGCAGTCCGATCCGCGCCAGCCGAGGAAGCAAACCTTTGAGACCTATTGGAATCGACCTGTTCGCCGGCGCAGGAGGCCTGTCGCTCGGCTTCGAGCAGGCGGGCTTCGACGTTGCCGCTGCCGTCGAGATCGATCCCGTGCACTGCGCCGTGCACAAGTTCAACTTCCCCGACACCGCCGTAATCCCGCGGTCGGTCGTGGGGCTCACGGCCGAGGAGATCCGCGAATCCGCCGGCATCGGGAACCGCCCCATCGACTGTGTCTTCGGTGGGCCGCCCTGCCAAGGCTTCTCCTTGATCGGGCACCGCGCCCTGGAGGATCCACGCAACAGCCTGGTGCTCGAGTTCGTACGCCTGGTGCGGGAGCTCGATGCACGGACCTTCGTCTTCGAGAACGTCAAGGGGCTGACGGTTGGCTCCCATCGGACCTTCCTCAGCGAGCTCGTCGCCGCATTCGGCATGGCGGGCTACGATGTTCGCTTGCCTTGGAAGGTTCTCGATGCGGCGGACTACGGAACGCCGCAGCATCGGCAACGCCTTTTCCTCATCGGAGCGAAGCGAGGCGAGACGCTTCCCGAGATCCCCCCGCCCCAGACGAATGCAGCCGACGCGCGAAAACCGCTCGCTCACCTTCCTGGCGGCCCGACCGTTCGGGACGCGATCGAGGACCTTCCCGATGCAGATCGGTTCGCCTCGCTCGTGGAGAGCGACGCTGTTCGGACCTCTGCCATGGGCGAGCCGTCTACCTACGCGGCCGAATTGCGCTGCCTGAACAACGATGCCTGGCATTACGGCTATCCCCGGAACTGGACGCCGACATGGATGACATCCAGCGCGCGCACGGCCCATTCGGAAATTTCTCGCCGGCGCTTCCAAGAGACCCCGCAAGGGGCGGTTGAGCCGATCAGCCGCTTCTACAAGCTGGCACCGGGCGGCCTTTGCAACACTCTGAGGGCAGGCACTGACGGTGCCCGGGGGGCCTTCACCAGTCCTCGGCCGATCCACTACGAGTACAACCGCTGCATCACCGTGCGCGAGATGGCGCGGCTTCACGGGTTCCCCGATTGGTTCAGGCTCCACGCCACTAAGTGGCATGGCGGAAGGCAAGTCGGCAACTCCGTGCCGCCGCCCCTCGCCCGCGCGGTCGCCTCCGAGATAGTCCGAGCACTGGGCGTCGCCCCGGAGCGGCCTGCGCGTGCAATCGATTTGGGAGAGCCCTCGTTGCTCTACATGGAGATGTCGGAAGCGGCGGAGCACTTCGGAGTGGCAGCGCCGTCCTCGCGGCGCGACCGAAAGAGCGGCGCAAAGAAGCGCAAGCAGCACGAGATCGAAGCGGCGCGTGTCCAATTACGTGTGGTAAATGGCTGA
Protein sequences of DBSCAN-SWA_3 >NZ_CP051469|85051:90581|88769_89198_+|WP_011339592.1|DBSCAN-SWA MADTRSPEQRRQIMRAVGTKNTGPELIVRSLLHAKGYRYRLHARSLPGQPDIVFPARHKAIFVNGCFWHGHDCAKGRLPTSRLDYWGPKIEANVARDRRKLSELEALGWQTLVVWQCELKDIERVGSRLCAFLGPTKTDRQA >NZ_CP051469|85051:90581|89249_90581_+|WP_011339591.1|DBSCAN-SWA MRPIGIDLFAGAGGLSLGFEQAGFDVAAAVEIDPVHCAVHKFNFPDTAVIPRSVVGLTAEEIRESAGIGNRPIDCVFGGPPCQGFSLIGHRALEDPRNSLVLEFVRLVRELDARTFVFENVKGLTVGSHRTFLSELVAAFGMAGYDVRLPWKVLDAADYGTPQHRQRLFLIGAKRGETLPEIPPPQTNAADARKPLAHLPGGPTVRDAIEDLPDADRFASLVESDAVRTSAMGEPSTYAAELRCLNNDAWHYGYPRNWTPTWMTSSARTAHSEISRRRFQETPQGAVEPISRFYKLAPGGLCNTLRAGTDGARGAFTSPRPIHYEYNRCITVREMARLHGFPDWFRLHATKWHGGRQVGNSVPPPLARAVASEIVRALGVAPERPARAIDLGEPSLLYMEMSEAAEHFGVAAPSSRRDRKSGAKKRKQHEIEAARVQLRVVNG >NZ_CP051469|85051:90581|86351_87158_+|WP_011339595.1|DBSCAN-SWA MKQVPPAAPVAPWLGGKKRLHPLILERIEAIPHRAYVEPFVGMGGIFLRRRFRPRLEVMNDRNGEIINLFRILQRHYPQLLEIMRFQICSRREFDRLRLTDPATLTDLERAARFLYLQRLSFGGKPNGSFGISPGNGPLFSLARLEPLLDAARTRLDGVVFECLDWADLIPRYDTAETLFYLDPPYFGGENDYGKGLFDRAQFARIAEILGSLKGAFLLSINDTPEIRALFGRFHLEPVRLNYSVSAAGSTEAQELLVSNRERIATLL >NZ_CP051469|85051:90581|87212_88022_+|WP_011339594.1|DBSCAN-SWA MEKDVTTSDIQRLLAAAGLYRGAIDGDAGPLTQAAARAALEGEPVPWRAWPSRRQRIAAGQAVLARLGHAPGRIDGLLGPNTREALTAWASGPVRAAVDRVPLPGHAVADAQGAYPRQESVATFYGVAGGPDCTAGIVELPMPFRLAWDLNTSITSFRCHRLVAAPMARIFSEAVAHYGAAEFERLRLNLFGGCFNHRPMRGGSALSMHAWGIAVDLDPERNPLRWGRDRASFAAPAYEPFWTIVEAAGATGLGRACNRDWMHFQFARL >NZ_CP051469|85051:90581|85051_85369_+|WP_011339597.1|DBSCAN-SWA MPEKGLIDTITALWGGAIATLIAAAMGRLMYHTGEVRARRRAFFGRELLWEIPALVAMAFVGEALSSYLSLDGRAAMGLVAMLAYLGPRGTTAMLERLWRGRSAG >NZ_CP051469|85051:90581|85620_85983_+|WP_011339596.1|DBSCAN-SWA MMKSKYVAAVSLMILLAACAKQPDRIAAVEVGGDNYSRHNCRQLASERMAISQDLANLSAKQKSAANGDAWGVFLLGLPLSSMSGGDQEAMIAIAKGKIQAIDRQVVAKGCPVSRPLTQD >NZ_CP051469|85051:90581|88034_88517_+|WP_011339593.1|DBSCAN-SWA MSLIFILALALCALAFILLIGWPLVAALARLAASTISASILATLAGPAAASTGSDLLTALTPSLLDLAGVALTALIGLATVRFQRWTGIQIEVRHREALHSAIMTAARVAVAQGLTREVATEFVAAYARSSVPDALKRLSPSSETMDALVRSKLLEVRGR |
7 | Rhodobacter_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
229781 : 236952
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP051469|229781:236952|DBSCAN-SWA GTCAGTTCCCGGCCACCTCAAGCAGCTTAGAGCGGACAAGCGCGTCGAGCGTATCAACCGAGGGCGACAGGCGCTTCATTGCCTCAGGCACGGAGGCGCGGACGTAAGACGAGACGAACTCGGCCGCGATCTCGCGTGTCAGTCCCCGCGCCACCGCCACGCGGGCAGCGGTCATGATTGCCGAATGCAGCGCCTCACGGTGCCGGGCCTCGATCTGAATCCCAGTCCAGCGCTGGAAGCGCACCGTCGCGAAGCCGATCAACGCGGTCAGTGCCACGCCCGCGAGATCGAGGAGGCTTGGCGTCAGCGCGGTCAGCAGGTCGCTGCCGGTCGAGGCTGCGGCGGGCACCGCGAGCGTGGCGAGGATCGCGGCGGCGATCAGCGAAGCGCGCATCCGGGCGATGGCCGCGACCAGCGGCCAGCCGACGAGCAGGATGAAGGCCAGCGCGCAGAGGGCGAGCATGAGGACGAGGGGCGCCTGGGGCGCGAGAGTGAAGATCGGGGACATATGGTCTCTCCTTCAGAGGCGGGCGAACTGGAAGTGCATCCAGTCGCGGTTGCAGGCTCGGCCGAGGCTCGTGGCTCCGGCGGCTTCGACGATGGTCCAGAAGGGCTCGTAGGCGGGCGCGGCGAAGCTCGCCCGGTCGCGGCCCCAGCGGAGCGGGTTGCGCTCGGGGTCGAGGTCGACGGCGATGCCCCAGGCGTGCATCGAGAGGGCCGAGCCGCCACGCATGGGCCGGTGGTTGAAGCAGCCGCCGAAGAGGTTCAGCCGCAGGCTCTCGAACTGAACGGCGCCGTAGTGCGCCACCGCCTCGCGGAAGATCCGCGTCAGGGGCATGGCCACCAGCTTGTGGCACCGGAAGCTCGTGATGCCCGTGTTGAGATCCCAGGCGAGGCGGAACGGGAACGGCAGCTCGACGATGCCGGCGGTGCAGTCGGGACCGCCCGCCACGCCGTAGAAGGTCGCAACCGACTCCTGCCGCGGATAGGCGCCCTGGGCATCGGTCACCGCGTGGCCCGGCAGCGGCACCCGGTCGACGGCGGCCCGCGCCGGCCCGGAGGCCCAGGCGGTCAGCGCCTCGCGGGTGTAGGGCCCGAGGAGCCCGTCGATCCGACCCGGCGCGTGGCCGAGCCGCGCCAGCACCGCCTGGCCGGCCGCGATGCGCTGCCGGCGGGAGGGCCAGGCGCGCCAGGGCACCGGCTCTCCCTCGAGCGCCGCCAGTGCGGCCGCCTGGGTCAGCGGCCCCGCATCGCCTTCGATGGCGCCGCGGTAGAGCCCGGCGGCCGACAAGAGCCGCTGGATGTCGGATGTCGTCACGTCTTTCTCCATATGCAAAAGCCCCGCGCGAGGGCGGGGCGTGGGTGGTTGGACTGGTGGGATTGCTCAGAGCAGGGTCGCGATCCGCTCGCGGTTCGAGACGAGGAGCTCCTGCGCCTCGGTGCCGCCCGAGGCGGAGACCGAGTAGCTCAGCCGCACCGGCTCGAGGTGGAACCGGCCGAAGAGCGCCCGGATCTCCGGCGTGTCATTGATCGACAGGAGGAAGGCGCCCTTCAGGCCGCCCAGGATCTCGGCGATCCGCGCGAACTGCGCCCGGTCGAAGAGGCCTTTTCCGTAGTCGTTCTCGCCGCCGAAGTAGGGCGGATCGAGATAGAAAAGCGTCTCGGCTGTGTCGTAGCGCGGGATGAGGTCGGCCCAGTCGAGGCACTCGAAGACCACGCCATCGAGCCGCTCGTGGGCGGCGTCGAGCACCGGCTCGAGCCGCGCGAGCGAGAAGCGGGGCCCGTGCCCGGCCGAGACGCCGAAGACCCCGTCGAGCTTGCCGCCGAAGCTGAGGCGCTGGAGGTAGAGGAACCGCGCGGCCCGCTCGAGGTCGGTGAGCGTGGCGGGATCGGTGAGCCGCAGCCGGTCGAACTCGCGCCGGCTGCAGATCTGGAACCGCATGATCTCGAGGAGCTGGGGATAGTGGCGCTGCAGGATCCGGAAGAGGTTGATGATCTCGCCGTTGCGGTCGTTCATGACCTCGAGGCGGGGCCGGAAGCGGCGACGGAGGAAGATCCCGCCCATGCCGACGAAGGGCTCGACATAGGCGCGGTGCGGGATGGCCTCGATCCGCTCGAGGATGAGCGGGTGGAGACGTTTCTTGCCGCCGAGCCACGGGGCCACCGGGGCGGCGGGAGGTACTTGTTTCATAAGGTGATTCTGCCTCACAAGGCGCCTGCCCTCGCGAGGGGTGGCAGGCAGTCTGAAAGGCTTCGGACAGGCTGGCGCGGTCTTGGGTGAGACTTTGGCCGCGTGGTTCGGGCTGTTAACGCAGCCCGGCCCCTGCCTGAGAGGCGCTACGGCCGCGCTGTCGCGCGACCGCTTTTGACTACACACCTATATTGATTGCTGCCTTTTGCGTGTGTATATTTAATCCCGTGAGCGACGGCATCGAACTCGAATGGAACGAAGATAAGCGGCAGGCCACGCTGGCAGACCGCGGCCTCGATTTCGCCGACGTTGCTCTGATCGACTGGGATGCCGCGCTGACCCTCGAGGACACGCGGCGGCCCTACCCGGAGACCCGCTACATCACCGTCGCGCCCATCCGCGACCGTCTTTGCGTTGTGGCCTGGTGCTGGCGGGGTGACGTGCTCCGCGTGATCAGCCTGCGCAAGGCGAACGCCCGAGAGGAGAGGAAGTATGGCTAAGGCAAAACCCTACATCGGCGACGACGACGAGGTCCGCGAGCTGGACGATCCCTTCTTTGCAAACGCGCGCCGCGGCCGGCCGCCGAAGCCGAGCGAGCAGAAGAAGGTGCGCATGAACCTGATGATCGATCCGGAGCTCGCCTCCAGGCTGGACGGGATGCCGAACAAGAGCGCCTTCGTGAACGAGGCGCTGCGGAAGGCGCTCGCATCCTGATCGTGCCGAAGGGCGGCCTTGGAGCGCTCACCGGTTCCGTGGTCATGCGCCGTCACCCGGCGCTCCTCCCCCGCCAGAGCCGTTCGATCATCGCCGTGGTGCCCCTTGGCCCCAGATAGGCGAGCATCGCCACGAGCCCCATGGCCGCCCGACCGTCGAGATTGAGGTACGAGCTGAGAGCCTCGCCGATGAAGGCCATGGCGACGAGGGCGGGGATCTCCCAGAGGAGCTCGCGGCCGAAGAAGGCGCGGCGGCGGGCGCGGACCTCGCCGGTGTGATACATGAGCCGGCCCATGGCGGCCGCGATCAGCGTGGCGATGGCCCCGCCCCAGAGCGCCGTGATGGTGTCGATCAGTCCCTTTTCCGGCATCGCGCCGCTCCTGTTGTGTGCGCCATGCGCGTATGTCGGCCCGGCCGGGAGCGGCCGGGTCCTGTCATGATGATCGGGAAGGCGGGGGCGCCTCAGGCGGCCGACTGCTTGACGCGCGCCTGCTCGATGAGACTCACAGCGTCACCTGCGCGGCCTCAATGAACAGCCGGTCGGTCTCCTCGGGCGAGAAGCCGAGGAGGAAGGCCAGCGTGTCCATCGTCTCCGATCCCCGGACGAGCGTCGTCGCGCTGCGGATCGCCGCCCGCATTGCCCAGGGGTAGGAGGCGTTGTCTGCGATGGTCATGGCCTGGCCCCACTTCTCCTCGCCGATCACGATCATGGCCTGCAGCAGGCTGATCTCGGCCGGGATCAGGTCGCGCTGCGCCTTGATGTCCCGGGCCTTGATGAGCCTGCTGTAATCGATGGTCGCGTTCGTCATTGCTGCTCTCTTCCTGCTCATGGATCAGCCTTCCGCGGACGGCGGGAGGTAGGGCGGGATCGGGATGTCCTGATCCTCGACCTCGAGGATGCCGGGGTTCGGCAGCGGATTGTCCCCGTAATAGCCGTGGGGGAAGACCACCTCGAACTCGAGGACGCCGTTCACGCGGGTGACGTTGCTGGTGACCCAGGTATTGTCCATGGCGTCCCAGGGCAGCACGTCTCCCTCCTCGACGCCGCTGAAGTCGTAGACCATGTCATTGCAGGTCAGGGTGTCGCCCTGCACGCGGAAGACGGTCAGCTCATACATCCGCCGAAGCGGAACCATGCGAATGCGCATCATGGCGGGTTACCTCCAGCGGCCGATGACGAGAAGATCGACGGTCTGCGTGGCCCCGTAGGACGTCGGCGACAGCAGCATGTAGGCGGCGACGTTCGAGGCCCCCGGCCGGCCGAGGAGCGAGATCACCGAGCCACCCCGGCAGCCGCCGGTCACGCAATGGCCGTTCAGGGCCGCGCTCACGAACTCGACGGGCAGCGTCACCGTCTGCCATTCGGTGCGATAGAGCGAGCCCTGCGCAATGGTCGGACCCGGGACCCCTGTCTGGACCAGGCGGCAAAGCTGCGTCCCATCCGCAAAGCGGACATACTCGGCGCCAGCCGTCTCTCCCTTTTCGATGATGCCGCCGCGCGGGAAGCCGGAGGACCAGCTGACGGCGCCCACGATGTTGCGCTGGTCGTAGGTGAGGAACCAGTCGCCCCAGCTGGTATCCTTGTAACGGCACCAGCGCCCGGTATCGCTGGCCGTCTGGGGATAGGCGATCTGCACCGCCCGCGCGGCCCCGTGCTGGATGTGCTGCAGGGTGCCGACGCCCATGCCGGCGGGACGGTTCAGCGTGGCGCTTGTGACCTGATAGAAGCCCGTCGTCCCGATCTGGTCCGCGTCGTTGCCCGGGATCGGGCGCGCCGTGCCGCCGAGGCCATAGTCTCCGACCTTCAGGAGGCGCCCGGGCGTGGTGTCGAGATCGCCCTGGGTCACGGCCGTGCCGGAGAGGAGCCCCTGCAGCTGCATGCCCGACGGCGTGAACCGCGCCCGTTCCGTCCCCTGGCAGGAGACGCCGAGCTCGTTCTCGGCCGCGAGGAAGAAGCCCGTGTTCGAGCCCACCTCGCCGTTGAAGGTGAGGCCGGGGAAGGTCGGCGTAGCGGCGGCCTTCCGGGTAATCTTTCGCCTCGACCAGTCCGAGGGTGAGCAACCTCGGCCGCAGATCGGACACCTCAGGCGGGTGCGTGGCGCCGACCGGAAGGGCCTTGATAGTCAGCAGGGCCGCGCGCAACGCGGTCGCCTCATCGCGTGTCATCTTATTGCCTCCATGGGATCAAGCGGGCCGGGGAACCGGCCTGCGCCAGACGATGCGGAAGCTCGAGATCAGCGTGCCCACGGCCTGCTCGCCTTCGGTGTTGAGCGTGACCGTCACCTCCTCGGGCAGACAGACGATGGCCGCCTGCTGGAACGCCCGGCAGATGGCGCCCTCGACGGCATCCGCATCCGCGTCGAGGAGATCCTCGAGATCGTCGCGGCCGAGCCGCTTCACGCCCACCTGCAGGACGGTCGCGCGCTCGAAGTTGGAGAGCGTGGCCTGCGTCGCCCGCTCGACCGGCGTCACCACCATGAGGAGCGGCAGGCGGCCTGCGTCGGGGCGCGCCTCCCAGGCGGAGATCTGCGAGAAGCCGCCCATGCGCGGATCGGCGGCGAGCGCCGCGCGCGCGATCTGGCGGAAGCGGATGCGCCCGGTCATGGCGCGACCTTCCGCAGGGTGCAGCGCGTGAGCGCGTCGGCCGCCGGGGTCGCAGGCGGCCAGATCTCGTCCACCTCATAGATCCGGCCGTCCGCCAGCCGGATCCGGTCCCGCCGCGCGAGCTCGGGAACCAGATCGCGGCGCACGCGCCAGGTGGGCGCCCTGACGAGCACGATCCGGCCCTCGGGATCCTCGGCCTCGACCTGCTCCTCGCGGAAGATCGAGGGCACGTCGCGCGCGGGCCCGCCCTGCGGCGCGGGAGGAAGAGGATGCGAGCACGGGGAGCGGGTCTGCCCCCCGTCTTGCATGCGCCGGGTCAGGCCCGGCGGCTGGCGGCCTGCGCTTCGGCGGTGCGGCGCGCGGCGGCAAGGAGCGGGCTTTCCTTCGGCCCGGCGGCAGCAGTCGGGGCATGGGCCACGAGGTCGGCCCCGTCGCCCCGGGCCGCCAGCCCGTCGAGCACGGTGCGGCGGAGGGCGGCAGCCGAGAGACCGCGGCGCATCGCGTCGGCCACGTCGACGGTCAGGCCGAGCCGGACCGCCTGCTGGCCGATCTCGATCAGTTCGGCGGCCTCGGCCACGGCGACATTCGCCACCGCTACGGCGCCGCCGGATCCTTCCGCGGCCTCCAGCGGGGGCGCGTTGCCCTCGGCGTCCGTACCTCGCGCGGCGCCGGTCGTGGTCTCGTCGGTCATGGTCGGGGTCTCCTTGGCTGCGGCCGGCGCGCCGGCCCGGGGGGATGGAAGGTGCGGATGGGACAGTTCGGCGCGGAAGGCGGCGAAGGCGCTGCGGAGATCGGAGACCTCGTCGGCGAGCCCCGCGGCCACCGCGGCGGCGCCGTCGAGGACCTGCGCCTCGGTGGCGAGCGCCTGCTGTCGGGTCAGCCGCGCGCCGCGGCCCGCCGCGACCGTCTCGGCGAAGAGCCCGCGCGAGGCCTCGACGCGGGCCTGCAGGTCGGCGCGCACCCCCTCGGGCAGGGCCTCGTAGGGATTGCCGTCGACCTTGTGCCGACCGGCATGGATCAGCGTGACCGCGACGCCGCGGTCGGCCATGGCCTGCGAGAAGTCGGCATGCACGAGGAGCACGCCGATGCTGCCCACGGCGCCGGTGCGCGGCAGCACGATCCGCTCGGCCTGGCTCGCAATCGCATAGGCCGCCGATAGGGCCGCCTCGGCCACGAAGGCGCGCACCGGCTTCACCGCCCGCGCGGCCCGGATCGTGTCGGCCAGATCGAAGAGGCCCGCCACCTCGCCGCCATAACTGTCGATCTCGAGCGCGATGCCGCGCACGGACGGATCGGCCACGGCCGCGGTGATCTGCGCCATCAGCCCCTCGTAGGAAGTCGTCCCCGAGGACTGGCCGATCCAGCCGCCGCGGTGAACGAGCGTGCCGGTGACTTCGATCACCGCGACGCCGCCCTCCACACGGTAGAGCGGCCGGCCCGCCTCCCGCCGGGCGTCGCCCGCATCGTCGAGGAGCACCGAGGCCCGCGGGGCGATCCGCCCGGCCTCGGCCACGCGGTCGGGCGCGAGCTCCTCGAGCCCGCGCAGCTCGAGCGCCCCGTTCACGAGCCGGGGCCCGAGGCCTGCGAGGAAGGCCGCGCCCTTCACGGGATCGACCAGAAGCGGCGTGCCGAACACCCGGCCCGCGATCATCGGATAGTTCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP051469|229781:236952|232198_232471_+|WP_017140411.1|DBSCAN-SWA MSDGIELEWNEDKRQATLADRGLDFADVALIDWDAALTLEDTRRPYPETRYITVAPIRDRLCVVAWCWRGDVLRVISLRKANAREERKYG >NZ_CP051469|229781:236952|231164_231971_-|WP_114071701.1|DBSCAN-SWA MKQVPPAAPVAPWLGGKKRLHPLILERIEAIPHRAYVEPFVGMGGIFLRRRFRPRLEVMNDRNGEIINLFRILQRHYPQLLEIMRFQICSRREFDRLRLTDPATLTDLERAARFLYLQRLSFGGKLDGVFGVSAGHGPRFSLARLEPVLDAAHERLDGVVFECLDWADLIPRYDTAETLFYLDPPYFGGENDYGKGLFDRAQFARIAEILGGLKGAFLLSINDTPEIRALFGRFHLEPVRLSYSVSASGGTEAQELLVSNRERIATLL >NZ_CP051469|229781:236952|235285_235597_-|WP_011339480.1|DBSCAN-SWA MQDGGQTRSPCSHPLPPAPQGGPARDVPSIFREEQVEAEDPEGRIVLVRAPTWRVRRDLVPELARRDRIRLADGRIYEVDEIWPPATPAADALTRCTLRKVAP >NZ_CP051469|229781:236952|234869_235289_-|WP_011339481.1|DBSCAN-SWA MTGRIRFRQIARAALAADPRMGGFSQISAWEARPDAGRLPLLMVVTPVERATQATLSNFERATVLQVGVKRLGRDDLEDLLDADADAVEGAICRAFQQAAIVCLPEEVTVTLNTEGEQAVGTLISSFRIVWRRPVPRPA >NZ_CP051469|229781:236952|235605_236952_-|WP_011339479.1|DBSCAN-SWA MNYPMIAGRVFGTPLLVDPVKGAAFLAGLGPRLVNGALELRGLEELAPDRVAEAGRIAPRASVLLDDAGDARREAGRPLYRVEGGVAVIEVTGTLVHRGGWIGQSSGTTSYEGLMAQITAAVADPSVRGIALEIDSYGGEVAGLFDLADTIRAARAVKPVRAFVAEAALSAAYAIASQAERIVLPRTGAVGSIGVLLVHADFSQAMADRGVAVTLIHAGRHKVDGNPYEALPEGVRADLQARVEASRGLFAETVAAGRGARLTRQQALATEAQVLDGAAAVAAGLADEVSDLRSAFAAFRAELSHPHLPSPRAGAPAAAKETPTMTDETTTGAARGTDAEGNAPPLEAAEGSGGAVAVANVAVAEAAELIEIGQQAVRLGLTVDVADAMRRGLSAAALRRTVLDGLAARGDGADLVAHAPTAAAGPKESPLLAAARRTAEAQAASRRA >NZ_CP051469|229781:236952|230300_231110_-|WP_011339489.1|DBSCAN-SWA MEKDVTTSDIQRLLSAAGLYRGAIEGDAGPLTQAAALAALEGEPVPWRAWPSRRQRIAAGQAVLARLGHAPGRIDGLLGPYTREALTAWASGPARAAVDRVPLPGHAVTDAQGAYPRQESVATFYGVAGGPDCTAGIVELPFPFRLAWDLNTGITSFRCHKLVAMPLTRIFREAVAHYGAVQFESLRLNLFGGCFNHRPMRGGSALSMHAWGIAVDLDPERNPLRWGRDRASFAAPAYEPFWTIVEAAGATSLGRACNRDWMHFQFARL >NZ_CP051469|229781:236952|232463_232685_+|WP_011339486.1|DBSCAN-SWA MAKAKPYIGDDDEVRELDDPFFANARRGRPPKPSEQKKVRMNLMIDPELASRLDGMPNKSAFVNEALRKALAS >NZ_CP051469|229781:236952|233518_233836_-|WP_017140410.1|DBSCAN-SWA MMRIRMVPLRRMYELTVFRVQGDTLTCNDMVYDFSGVEEGDVLPWDAMDNTWVTSNVTRVNGVLEFEVVFPHGYYGDNPLPNPGILEVEDQDIPIPPYLPPSAEG >NZ_CP051469|229781:236952|233188_233515_-|WP_011339484.1|DBSCAN-SWA MSRKRAAMTNATIDYSRLIKARDIKAQRDLIPAEISLLQAMIVIGEEKWGQAMTIADNASYPWAMRAAIRSATTLVRGSETMDTLAFLLGFSPEETDRLFIEAAQVTL >NZ_CP051469|229781:236952|229781_230288_-|WP_011339490.1|DBSCAN-SWA MSPIFTLAPQAPLVLMLALCALAFILLVGWPLVAAIARMRASLIAAAILATLAVPAAASTGSDLLTALTPSLLDLAGVALTALIGFATVRFQRWTGIQIEARHREALHSAIMTAARVAVARGLTREIAAEFVSSYVRASVPEAMKRLSPSVDTLDALVRSKLLEVAGN >NZ_CP051469|229781:236952|232737_233055_-|WP_011339485.1|DBSCAN-SWA MPEKGLIDTITALWGGAIATLIAAAMGRLMYHTGEVRARRRAFFGRELLWEIPALVAMAFIGEALSSYLNLDGRAAMGLVAMLAYLGPRGTTAMIERLWRGRSAG |
11 | EBPR_siphovirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
547158 : 558161
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP051469|547158:558161|DBSCAN-SWA CTCAAAAGAGCTTCCCTCCGAGTGGCACGTCCTGGCCGGGTCCGATCAGCACCACCTCGCCCTCGGCATCCGGCACGCCCAGCACCAGCACCTCCGACAGGACCGGGCCGATCTGGCGTGGCGGGAAGTTGACCACCGCCAGCACCTGCCGGCCGACGAGGCTTTCGGGTTCGTAATGCCGGGTGATCTGGGCCGAGGAGCGTTTCTCGCCGATCCCGTCCCCGAAATCGACCCAGAGCTTCAGGGCGGGCTTGCGTGCCTCGGGGAAGGGTTCGGCGCGGGTGATGCGGCCCACGCGGATGTCGACCTTGGCGAAATCCTCGTAGGCGATCATCGGCCGAGCTCCCGTCCGCGGTCGGCCGCAGCCTTCACCGCGCGGCGGAGGAGCGGCGGCAGGCCGCTCTCGGGATCCATCAGCACGCCGAGCGCCGCCGCCGTCGTGCCCCCCGGCGAGGTCACGTTCACGCGCAGCTGGCTCGCGCTTTCCGGCGCGCGATGGGCAAGCTCGCCCGCGCCGGTGACGGTGGCGCGGGCGAGGGTCATGGCGAGATCGGCCGGCAGCCCCTCGGCCTCGCCCGCTGCGGCGAGCGCCTCGATCAGGTGGAAGACATAGGCCGGCCCCGAGCCCGAGACCGCGGTCACCGCATCCATCTGATGCTCGCCCTCGAGCCGGACCACCTGGCCCACCGCCGAGAGGAGATCCTCGGCCAGCGCCATCTGCGCCTCCGAGGCGGCAGCATTGCCCACCAGCGCCGTGATGCCGCGGCCGACCGCCGCGGGTGTGTTCGGCATGGCGCGGATCACCGGCGTGGCCGCGCCCAGCGTGCGTTCGTAATAGGAGATGGGCGTGCCCGCCGCGACCGTCAGGAAAAGCACCCGCCCTTCGCCGAGCGCCGCGAGCCCCGGCAGGGCCTCGGACATCATCTGCGGCTTGACCGCGACCAGCACCACCGCGGGCTCGTCGGGCAGCGCCTCGTTCAGATGCAGCCCGCGCGCCGCGAGCGATTTCAGCCAGTCCGACGGCCTGGGGTCCGTCACCCAGACGGCGGAGAGCGCAAGGCCCCGTACGAGCCAGCCCTCGAGCATCGCCGAGCCCATCTTCCCGCAGCCGAGCAGCACCAGCCCCCGGCGGCCAAGATCCGCAAGATCCATTGTTTCCCCCGTTTGTGCGGGGAAAGACTAGGCGCGGCCGTAGGCCTCCGCAATGGCGACCTTGATCGCCTCGGCCGGCGGGCTCTCGGCCCAGGCCACCATCTGGAAGGCGGGATAGAAGCGCTCGCTCGCGGTGACGGCCTCGGCGATGAGCTGGCCGATCTGCTCGGCGTTGACGCTCTGTCCGCCCGACAGCAGCAGGCCGTAGCGCCAGGACATCAGCTTCTGCTCGCCCCAGTAGGTGAAGGCCCCGGTCCAGACCAGATCGTTGCAGCGGTTGAGCAGGTCGTGCAGCTCAGGCATCCGGTGGGTGGGCGGCTCCATCTCGAAGGTGCAGAGGAGCCGCAGCATCTCGTCTGCGGGTGACCAGGCGAGGGTCAGCGAGTAGGTGCGCCACTGGCCCTCGACCGCCATCGCGATCTGGTCCTCGGTGATGCGGTCGAACTCCCACGCGTGATGTTCGGCCAGCGTCTCGACGATGTCGATCGGATGGATTTCCTCCGAAGAGAGATAGGTCTCGCTGAGCGACATGTGCCCGGGCCTCATCATTGAACCGCCTGCAGGGGATCACCCACAAGAAGCTGGGTGCGCGGCAAGGGACGGCGCTGCCTGCCTCCGTCCTCACTAGATATGGTGTTCGCAAAGCGGAAGACTGTAAAGGGGCCTGTGCAAACTCTTCTGTGGACAAAACGAATTGTAGCGGTTGTAGCGGGACACGAAGACCTGCAGTGCGGGACACGAGAACCTGCAGCAGAGCAGCTTCGTTCTGCCCCCCGACACTCTGATCTCAACGCGACCTGACGGCGCTGGCATCTTCGCCCATAGCCGCGCTGGCAACCTGCAGTCGTGCCGTAGCTGCAAGTCGTCGTCGCCATATGGTTGACGACCGAAGAATCGGAGTTCCAGCGGGCTCTGAAGTGGCGGTATAGGCTTGGCGGGTGAATGTCGTCTCCGGAGGCAAGAGTGACGCTCAGTGTGACCAATGATGACGAGCAAGATAGGATTGAGCAGATATTCTTTCATCATGCGCGATCTCGCTCCGCCGATCTGCGAGCGAGAGGAGCGCGCTTTGCTCATTATACAAGCGCTTACGCGGCTCTTCAGATAATCGAAAAAGAAGAAGTTTGGATGCGAAACGCGGTCGTGATGAACGACTTCTCCGAGATCCAGCATGGCCAAGATTGTGTAAATCATGCTTTGCAGGATCCGGTCATTGGCGGCGTGGCAGGGGGCAGGCTTCACACCATTCTGGAAGATGTAAAAGCGGGTCTTTCTGCTACCTTGCTTGAAGATTTAATGAATATTGCGGTACATCAGACGATAGGAAGTTATCTTGTTTCGATCAGTGAACACGGCAACGGTTCGACAGAGGAAGACCAGTACGGCCGCCTCTCTATGTGGCGAGCCTACGGCGGGAACACGAACGTTGCCTTCGTGTTTCGCACTACTCCTTTCTTTGCAAGAAGCACGGCGTACACGGCGTTCACCAGTCCCGTCCATTATTGCTGCCCAACTTCGTTCAAAGGAGAATTCGAGAGTTTTGTCTCAAGGCTGGAGCGGGAGCGCGCCTTCTTGTGCTCTTTGGGCGGAGAAGAGGTGCGCATGTACGTCACGAATGCATTGCACTTCGCCATCCTATCTCTAAAGCATCCAGGATTCAGGGAGGAGCGGGAGTGGCGTGTTATCCATTCGCCAAGTCTGATGCCTTCTCCACGAATAAGATACGATATCGAAACGATCAATGGCGTGCCGCAAAGAGTTTACAAGCTTCCCCTGATGAACTTTCCAGAAGAAGGTTTGATAGGGGCGACGCTGCCAGAATTGATCGAGGAAATCATTATCGGCCCAACGGCGACGCCTTGGCCAATATATGAGGCCTTGGCTTCAAAGCTGGAAGCGAAGGGGATGCAAGACGCATGGCGGCGAGTACGGATCTCCAACATCCCCCTGAGGCGCTAGAATTTCAGGCACCGGGTCCGTTGTGGGGACTATAACCGGCCGTTCCATGCCGTCTGTGGCACGTCCGGGAGTTGAAAGACCTTGTCGAGCTGCTGCCGAATGATCTATCGGCAGCAGCCTTGATTGGCCGCAGATCGTGACCCGAGACAAAAAATCTAAACTGAGGCCTAGGTGCTGGACATTTCGACGTATCGAGCGGCAGTCACTCGGAGGCGCTCAGCGTAGGCATAGATGTCATCCGTGGACGTAAGCGGAACGCGTTCCTCTTGCTCACCGTCGAATAGCCCGATGTACTTGACCGAACGATTGAAGTGGAGACGTGCCAGCGGACGGCGGTTGTTGTTGTCAACGAGGATCCCGCAGTAGCTCTTCTGATCACGCATATGTACGCGGCCTGGCTTGATCGTACCTGCGACGATTGCCTTGACCATCATGAAGCCCTCCATCTCCTCCTGCGTCGTGACGACGTCTGGTTCTGAAGGTTCCAATTCAGACGGTTCGGCTACCGGCTCGTTACTGGCGAGAGCCGACGAAAGCCGTTCATGTACTGCCTCGCGGATCATCTCTTTGAACGCAGATTTCAAGAGTCCGCGGAACTGATCAATCACGGCTGCCGTATATCGTCCTTCATGGAGCGGAGCGGTTACCAAGCGAATCAGTTCCTCGGGCGGATCCTCCATGATCCGATGGATGTGAGCCTTAATAGCTGACGTATATTTAAGCCGCTCTGCATTCGCGAGAATCGTGTCGACGTCAAAGCCGATTTTTTCGAACTTCTTCAACTCCGCCAGAACCTGAGCGTGCGGCTCGGCCAGATCGAACGTAAAGAAGGGTCGGCTATCCAGCTTGTTTGGTTCGTCGAGATCCGTGTGGAAGTGGAAGTAGCGGCCGTTCGTCAGGATGGCGAATTTCGCGCTGGTGACCGTGAAGTAGCGGAAGAGCTGTGCGAGGTGCACCTTGTCGAGGTTCGTGCTGATCGGTTTGCACTCGACAAGGATGCGGATTTCGTTCTCGAGCTTGATCGCGTAATCGACCTTCTCGCCTTTCTTTCCTACCGCGTCCGCCGTGAACTCCGGAACAACCTCGAAAGGGTTGAAGACGTCGTATCCGAGGCTCTGAAGAAACGGCAGGACGACTGCTGTCTTGACTGCCTCTTCCGTCATCATGTTCGCGCTGTGCTGCCTGACCCGCTCAGAGAGAACCTTGATCGTTTGCTCGATAGACATTGAACTCATTCCTCCACAGGTATCGGGCAGCGTGTTCGGCGGGATCTATGCCGGGTTGGCGCTTCGTTGCCCTTGGCAGTTTTGCGCCACGCCATCAAAGAAGACAGCATCACCTGGCTTGCTCAAGGAGCAAGGCAGGTGATGCGTCATGGCGCCGCAACCTGCGCGCAGAACACTTGAGGGCTGCCCTCCTCTCGATTGTGACAACCCGCCGCGTACGGTAATCGGTACCTTGCCCCAAGAGCACGAGATCTGGTTTGGGTCGCGGCGTCGGACGCGGGGCCTCATACCCTGCCAAGGTCGCCGCGGGCGGATCGCTCCGCCCACGGCACCTTCCTCAGCCCTCTTCGCCGATGTCCCAGTTCGAGACCAGCAGCTCCGGCCGCGCCGTGCCATTGCCCTTCGCGCTGCCGCTGATCGAATAGGTGGTCGTCACCTCAGTGATCCGGAACCTGCCGAAGATCGCGCGGATCTGGGGCACATCGTTGATCGAGAGGAGGAAGCGGCCCTGCAACTTGGCCAGCCGCTCGGCCATCAGCCCGAAGTCCTCGCGCTGGAACAGGCCGCGGCCGTAGTCGGTCTCGGATCCCCAGTACGGCGGGTCGAGATAGAAGAGCGTCTCGGGCCGGTCGTAGCGGGCGAGGAAGTCGGCCCAGTCCAGGCATTCGACGATCACGCCGGCGAGGCGGCTGTGCAGGTCCTCGAGCATGGGCTCCAGCGTGGTGAGGTTGAACCGCCCAGGCCGATGCGAGGAGACGCCGAAGTTCCGCCCGCTCACCTTGCCGCCGAAGGCGGTGCGCTGGAGGTAGAGGAACCGCGCCGCCCGTTCGAGGTCGGTCAGCGTGTCGGGATCGGTGGCGCCCAGACGCTCGAATTCGGCCCGCGTGGTCAGCTGGAAACGCAGCGTGTCGAGGAACTGCGGATAATGCCGTTGCAGAATGCGGAACAGGTTCGCCACGTCGCGGCCGCGGTCGTTGATGACCTCGGCCCGGGGGCGCGAGCTGCGGCGCAGGAAGATCCCGGCCATGCCGACGAAGGGCTCGGCATAGGTGAGGCAGGGGCTGCGGTCGAGGATGGCGCAGATCCGGCGGGCGAGGTTGCGCTTGCCGCCGATCCAGGGCGCCACAGGCTGCAGCGGGGTGACGGGCGAGAGTGGACTCTTCATCTGGTTCGTGTCTCTTCGGCGTGCCCGCAAGGGTGCGGGAGCGGCCATGATTCTTGTCTGGTCGGTGGGGGATCGATCGCCATCATGCCCCATGTCGGAGGGTGGTTGCACACCCTCTGGCCTCCCGGTTGACGCTCAGCCGGTCATGCCGTGCCCTTCCTGCCGCGGAACGAGGTCACGAGGCCGCCCGGGTCGAGCTGGTGGCTGACGCTTTCCAGATGCCAGTCGCCGTTCAGCTCGGGGCGCAGACCGGTCACGGTGACCGAGGCGCCGCCGAGGAGGCCGGGCTCGAAGCCCGCGAGCCGGGCCTCGAGCGTCATGGCCGCGCGGGCGGCGCCCGACAGCACCGCCTCGGCCGCCCGCAGCGCCTCGGCCTCGGTCGGATAGGCGTGGCGCAGCTTCTTCAGCGGCGTGCCGGAGCCCACCCGGACCCGGTGGGTGATGCCGCTGCCGGTCTCGGTCCATTCCGCCTCGGCCGCGCGATAGACGCTGCGGCCCTCGAAGGACCAGCTCCAGTCCGAGAGGCGCCAGGGCGGCAGGACCGGCGGGGTGAGCGGGTCGCCGGCGGCGGTCCGGCCCTCGCCGCGGCGCTGCACCACCAGCGCGCCGCCGGCGGGCTTGGCGGTGGCGTCGAGCGTTGCCGCGATCCGGGTCAGGAAGTGCAGGTTCGACTCGGCCGTCTGGGCGAGATAGTCCCAGGCCGTGCCCGCCACGCTCTCGCCCACCACCGGCCTGAGGCCGGCCTCGGCCGCGACGGTCGCCACGATGTCGCGCAGGCTCCGGTTGCGCCAGGCCCGGGTGCGGGGGCTGCGGATCTCGCCCTTCAGGTCGGCCGCGGTGGCGGTGATCCGCAGGGACTGCCGCGGCCCGCTGCCCGCCACGCCATCGACGGCGAAGACCCCCATCGCGGCCAGCGGCTGGCCGGCGAACCCCAGTGCCACCTCGATCCGCGCCTCGGTGTCGGGAAAGGCCAGCCGGCCGTCGCGGTCGTCGAGCTCGATCTCCAGCCGGTCGGCCTTCGTGCCGTCCTCGTCGGTGATCACGAGGCTCAGGAGCCGGTCGGCCACGGCACCGGTCGCGTCCTCGCCATCGACGGTCAGCCGGAAGGCCGGGATCATGTCCGGCCCCAGAGCCTGATCTGGCCGGTGGCCACCGGCTCGGCCACCTCGGGCAGGGTGATCAGCACGCCGGCCGGATAGACCGAGCCGAGCGCCGCCAGGTGCGGATTGGCCGCCAGCACCGCCGGTACATGCCGCTCGGACCCGAGCCGGGCCTTGCAGATCGCATCCAGCATGTCGCCGGCGAGGGTGCGGTAGATCCGGCTCATGCGCGGTCGCTGCCGTAGGCGCGGAGGCCGAGGGTGAACTCGATCTTCCGCGGCGCGCCGTCAGTGAGGAACAGGCTCTTCCGCTCCTCGACCGTGGCGATCACCCAGCGCTCCCAGACCCAGCCGAGCCCGTCGACGAGGACCATCGGCTGGCCGGTGCCGGCCACCAGCCGCATCAGCTCGACCTGGCGCAGGCCGCCGCGGAAGTGCGGATAGATCACGCCCTCGAGGGTGATCTCGTCGCTGCCCGGCCCGAGATACTGCAGCGCCGGCGCCCGGCCGAGCCGGTCCTGCGCCTCCCAGCGCCACGATGCCGATCGGCTGAAGCTCTGGTAGCCCGCCCGGTTCACGCCGAAGCGGAAGCTGCCGAGCGCCATCATCACGAGGCTGGCGCCGAGATCAATCATGCTCGCCCCCGTCGTGGAGCGCGAGGCCGCGCTCGCGGGCAACCGCCTGCAGCTCGCGCCGAACCGCACGGGCCACATCGGCCGGGCTCATGCCGGGGGCCGCGTGGATCGTGATGCCGCCGATCTCGAGCCGGGGTGCCCGAGCCGCCGCCGGGGCCGACCCGCCGGCCCGGCCACCGCCCGCCACCACCCGGATCTGCGCCGGCGCCGCAGCCATGGCCTCCAGAGCGCGGAGCTGGCGGTTCGAGATCACCTCGCCATCGGTGCGGGGCACGAAGAGTTCCTGCCCCTGTTCCTGCCAGCGGTAGATCTGGCCCGCCCGCACCGGCCCGCCGAGCGCGCGCTGCCCGGCCGTCATTGCGGCCCAGGGCTTCGGCGCCACCGCCGCCGGACCGGCCACCGCACCCCGCGCCGGGCGGTCGGAACCTTCGCCGCCACCACCTTGGCCACCCTCGCCGCCGCCGATGCCCTTCAGCCAGGCCGGCATCTCGGGGATCAGGCCCGAGAGCTGTTCCTGCACATAGGCGGTCAGCTGGCCCATGACGGACTTGATCCCCTCCCAGAGCGACAGGATCATGTCGACGCCCAACTGGAACAGGTCGATGTCGAAGGCGGCGGTCAGCGCGGCCGTGACGTCGGCGAAGTCCCAGCCGGTGAGATAGCGGAGCAGCCCCTCCGCCGCGTCCATCATCAGCACGAACGGGTTGAACGCGGCCAGCGAGGCCAGCACGCCGTTCAAGAGCCCCTGGTCGAAGGCCGCCCGCACCCGGTCGATCTTGCCCTGGAAGTAGGCGACGACCCCGTCCCAGTTCCGGTAGATCACGTAAGCCGCGCCGGCGATCGCGGCGATCACCAGCAGGATCGGGTTGGCCAGCGCCGCCCGGCCCACCACGAGCAGGGTCCGGCCGACGAGCTGCGAGGCCGTCACGAGGCTCCGGAGCGCGAGCGAGGCCACGCCGATCGCCGCCCGGGCGATGCCGTGCAGCGAGACCAGGAACAGCCGGGCCGCCGCCGTCGCCGTCCAGATCATCGCCCGCCCGGCAAGAAGCAGCGCCCGGCCGAGCCACTGGATGGCGGTGCCGGCCATGATCGCGCCGGCGCGCAGCAGCCGGAAGCCGGCCGCCAGCACTCCCCGCATCAGCAGCCCGAGCCGGGCGAGGATCGCAGCCGCCGCCACCGCATGGCCGGCCATCCCGGCGAGGCCGGATCCCACCAGCAGCAGGCCGCGGCCGAAGGTCAGGAAGCCGCCGATCAGGCTCAGGACGATCCGCCCCATCAGCAGCGAGGCCGCGAACCAGGCCAGCCGGTCCCAGCCGCCGAGCAGGTCTGCCGCCTGGGCGACGACCGGCACCAGAAGGTTCCAGGCCGCGACCGCCTGACCGGCGAAGGCCCACATGGCCTCGAGCGTCCTGAGGATCACCCGGGCGACCTCGTCGGCCCAGAGCTGCAACCGGCCGTCCTCGGCCATGGCGTTCAGCGTGCCGAGGATCTCCTGCAGCCGGCCCTTCAGGAAGTCGAACACCCCGGAGGCCATGACCATCCGCTGGAAGCGGCCCCAATGGTCGAGCACGTTCGACAGGATCCCGTCCCAGGTCTTGGCCATCCCCTCCGAGGCGCCCTTGTTCTTCTCGGCCATGGCCTCGATCAGCAGTTCGATCTCCTTCCGGCCGAGCTGACCCTTCGAGGCCATCTCCTGCAGCTCGGCCGCAGTCTTGCCCATCTTCGCGCCGAGGAGATCCCAGACCGGCACGCCGCGCTCCAGCATCTGCAGCGCCTCCTCGCCCTGCAGTTTGCCCTTGGTCCAGGCCTGCCCCAGCGCCAGCACCAGCCCGTCGAGCTGCTCGGCCCCGCCGCCGGTCGCGGCCATCGTGTCGACGAGCGCCTGCATCGCGCCCTTCGTCGGGTCGAGCCCGAAGGCCTTCAGCCGGGCATAGGCCGCAACCGTGTCATTCAGCTGCAGGGGCGTCCGGGTGGCGAAGTCCTCGATCCAGCGCAGGGCGCGGTCGGCGCCCTCGGCCGAGCCTTCGAGATTGGTCAGCTGCACCCGGAAGCGCTCGAACTCGGCCGCCGGGCGCAGGAAGGCCCCGGCGATCAGCGAGACGGTCCCGGCATAGCCCGCGAGCACCGCGCCCCCTCGGAGCGCCGCCCCGGTGACCGAGCCGAGGCCCTCGCCCATCAGAGCCGCGCCGCGCCCGACCGTCGCCGCCTGGCGCATCATGCCCTCGCCGCCCAGCCGCTCGACGGCCCGCATCGCCGCGCGGGCCGGGGCCGTTGCGCGGTCGACGAGGCGCAGGATGAGCTGGATGTTCAGGTCGGCCATCGCGATCGGTCAGCCTTCGCTCATGTCGGGGGGTGCCGACCGAGCCCGGGCCTTCGCCCACCAGCGGGCGAGCTCCTCCGGGCTCATCGGGTCCATGCTCTGGGGCGGCCAGTGGAAGACGGCGGCAATGTCGGCCATCGCTTCCTCGGGATCGTCCGGCAGCTCTAGTGGCGGAGAAGATCCGCCTCCGCCGTGGCCACCATCTCCGCCGTCGCAAAAAAACCCACGACCGCCGCGGAGAGGCCGAGGAAGTCGGCCGGATCGAGCGCTGCCACCTCGTCGGGCAGAAGCGCGGGCTCGGTGATGCGCGGCAGCAGCCGGCTGAGCGCGGTCACGTCCATCTGCAGGATGTCGGTGAGCTTCAGGCCGCGCAGACTGCCGACGTCAGGCTTGCGCACCGTGACGCTCGCGATGTCGCCCGAGGCTCGCTTCAGCGGCTGGAGAAGAGTGGTTTTGGACATGGTTTTAGGCCCCCTTAAATACCCATGGCGCGGCGCATCTCTGCCAGCTGGTCCTGGCCGCCGATCACCCGCTTCGCGTTCGGCAGGTCGATGTCGAGCAGCAACTCGCCGTCGATCTCCAGCCGGTAGCTCCGCACATCCATCGAGATCTTCAGCTTCGAGGGCGTGCCGGGCTTCAGATCGCCCGGCTCGCTCACCGTGATCAGCCCCGAAAGGGTGGCCACGATCGGCACGGCGCCGAAGTCCGACGGGTTGGCCATCGCCGGGCGGAACACGAAGCGCTCCTGACGGCCGAGCTTCTTGACGATGGCGGGCACCCATTCCGAGAAGGACATCTTGGAAGCGAGCGCCTCCATGCCGACGTCGATGCCGATCGGGCCGTCCATGCCGGCGCCGCGATGCGCCTCGGTCTGGATCTTGACCGCTGGCAGCTTGCCCTCGTCGACGATGCCGAAGTAGCTCACCCCGTCGACGAAGGCGTTGAAGTTCCTGATCGTGCGCGGAAGGGCCATGCCTTGTCTCCTTACCGGTTGCCGGTGACCGAGAGCACGAGCTCTTCGTAATAGTCGCCGTTGCGATGCGCCTGGAGCGTCAGGTGCTCCAGCGGCGCGGGCGGCTCGATGTCGAAGTCGAGAAAGAGCTTCCCGGCCTTCATCGTGGCCTCGGTGTTCAGCTCGGGATCGATCCAGGCCGGGCGGCGGCATCTAGATGAGCCTGCACCGCCCGGGCCACGCCGGCGAGGATCTCCTCGGGGCCGGGATCGGGGTCGGGCGCGGGGGCCTCCGCCCCGTCCTACACGCCGACGGGCAGCGGCTCATAGACCCATCCGGCCCCGGTCCAGCGGGCGCGCGAGCCTGCCGGCGGATCGGGATCCGGCGCCTCGATGCAGCCTGCGGGGATCAGGAAGACGCCGCGCTCGAGCGGGCTCTCGTCGGCCTCCTCGATCCCGACGAAGTGGCCGGTCTCGTTCAGTTGCATGACCTTCAT
Protein sequences of DBSCAN-SWA_5 >NZ_CP051469|547158:558161|553982_554393_-|WP_017140365.1|tail|DBSCAN-SWA MIDLGASLVMMALGSFRFGVNRAGYQSFSRSASWRWEAQDRLGRAPALQYLGPGSDEITLEGVIYPHFRGGLRQVELMRLVAGTGQPMVLVDGLGWVWERWVIATVEERKSLFLTDGAPRKIEFTLGLRAYGSDRA >NZ_CP051469|547158:558161|550435_551494_-|WP_017140367.1|DBSCAN-SWA MSIEQTIKVLSERVRQHSANMMTEEAVKTAVVLPFLQSLGYDVFNPFEVVPEFTADAVGKKGEKVDYAIKLENEIRILVECKPISTNLDKVHLAQLFRYFTVTSAKFAILTNGRYFHFHTDLDEPNKLDSRPFFTFDLAEPHAQVLAELKKFEKIGFDVDTILANAERLKYTSAIKAHIHRIMEDPPEELIRLVTAPLHEGRYTAAVIDQFRGLLKSAFKEMIREAVHERLSSALASNEPVAEPSELEPSEPDVVTTQEEMEGFMMVKAIVAGTIKPGRVHMRDQKSYCGILVDNNNRRPLARLHFNRSVKYIGLFDGEQEERVPLTSTDDIYAYAERLRVTAARYVEMSST >NZ_CP051469|547158:558161|547487_548309_-|WP_114071698.1|DBSCAN-SWA MDLADLGRRGLVLLGCGKMGSAMLEGWLVRGLALSAVWVTDPRPSDWLKSLAARGLHLNEALPDEPAVVLVAVKPQMMSEALPGLAALGEGRVLFLTVAAGTPISYYERTLGAATPVIRAMPNTPAAVGRGITALVGNAAASEAQMALAEDLLSAVGQVVRLEGEHQMDAVTAVSGSGPAYVFHLIEALAAAGEAEGLPADLAMTLARATVTGAGELAHRAPESASQLRVNVTSPGGTTAAALGVLMDPESGLPPLLRRAVKAAADRGRELGR >NZ_CP051469|547158:558161|554385_556713_-|WP_011339237.1|DBSCAN-SWA MADLNIQLILRLVDRATAPARAAMRAVERLGGEGMMRQAATVGRGAALMGEGLGSVTGAALRGGAVLAGYAGTVSLIAGAFLRPAAEFERFRVQLTNLEGSAEGADRALRWIEDFATRTPLQLNDTVAAYARLKAFGLDPTKGAMQALVDTMAATGGGAEQLDGLVLALGQAWTKGKLQGEEALQMLERGVPVWDLLGAKMGKTAAELQEMASKGQLGRKEIELLIEAMAEKNKGASEGMAKTWDGILSNVLDHWGRFQRMVMASGVFDFLKGRLQEILGTLNAMAEDGRLQLWADEVARVILRTLEAMWAFAGQAVAAWNLLVPVVAQAADLLGGWDRLAWFAASLLMGRIVLSLIGGFLTFGRGLLLVGSGLAGMAGHAVAAAAILARLGLLMRGVLAAGFRLLRAGAIMAGTAIQWLGRALLLAGRAMIWTATAAARLFLVSLHGIARAAIGVASLALRSLVTASQLVGRTLLVVGRAALANPILLVIAAIAGAAYVIYRNWDGVVAYFQGKIDRVRAAFDQGLLNGVLASLAAFNPFVLMMDAAEGLLRYLTGWDFADVTAALTAAFDIDLFQLGVDMILSLWEGIKSVMGQLTAYVQEQLSGLIPEMPAWLKGIGGGEGGQGGGGEGSDRPARGAVAGPAAVAPKPWAAMTAGQRALGGPVRAGQIYRWQEQGQELFVPRTDGEVISNRQLRALEAMAAAPAQIRVVAGGGRAGGSAPAAARAPRLEIGGITIHAAPGMSPADVARAVRRELQAVARERGLALHDGGEHD >NZ_CP051469|547158:558161|556877_557174_-|WP_011339236.1|tail|DBSCAN-SWA MSKTTLLQPLKRASGDIASVTVRKPDVGSLRGLKLTDILQMDVTALSRLLPRITEPALLPDEVAALDPADFLGLSAAVVGFFATAEMVATAEADLLRH >NZ_CP051469|547158:558161|549272_550268_+|WP_160384142.1|DBSCAN-SWA MTLSVTNDDEQDRIEQIFFHHARSRSADLRARGARFAHYTSAYAALQIIEKEEVWMRNAVVMNDFSEIQHGQDCVNHALQDPVIGGVAGGRLHTILEDVKAGLSATLLEDLMNIAVHQTIGSYLVSISEHGNGSTEEDQYGRLSMWRAYGGNTNVAFVFRTTPFFARSTAYTAFTSPVHYCCPTSFKGEFESFVSRLERERAFLCSLGGEEVRMYVTNALHFAILSLKHPGFREEREWRVIHSPSLMPSPRIRYDIETINGVPQRVYKLPLMNFPEEGLIGATLPELIEEIIIGPTATPWPIYEALASKLEAKGMQDAWRRVRISNIPLRR >NZ_CP051469|547158:558161|557188_557686_-|WP_011339235.1|tail|DBSCAN-SWA MALPRTIRNFNAFVDGVSYFGIVDEGKLPAVKIQTEAHRGAGMDGPIGIDVGMEALASKMSFSEWVPAIVKKLGRQERFVFRPAMANPSDFGAVPIVATLSGLITVSEPGDLKPGTPSKLKISMDVRSYRLEIDGELLLDIDLPNAKRVIGGQDQLAEMRRAMGI >NZ_CP051469|547158:558161|548336_548840_-|WP_017140369.1|DBSCAN-SWA MSLSETYLSSEEIHPIDIVETLAEHHAWEFDRITEDQIAMAVEGQWRTYSLTLAWSPADEMLRLLCTFEMEPPTHRMPELHDLLNRCNDLVWTGAFTYWGEQKLMSWRYGLLLSGGQSVNAEQIGQLIAEAVTASERFYPAFQMVAWAESPPAEAIKVAIAEAYGRA >NZ_CP051469|547158:558161|557966_558161_-|WP_011339233.1|DBSCAN-SWA MKVMQLNETGHFVGIEEADESPLERGVFLIPAGCIEAPDPDPPAGSRARWTGAGWVYEPLPVGV >NZ_CP051469|547158:558161|556722_556851_-|WP_017140364.1|tail|DBSCAN-SWA MADIAAVFHWPPQSMDPMSPEELARWWAKARARSAPPDMSEG >NZ_CP051469|547158:558161|551831_552659_-|WP_017140366.1|DBSCAN-SWA MKSPLSPVTPLQPVAPWIGGKRNLARRICAILDRSPCLTYAEPFVGMAGIFLRRSSRPRAEVINDRGRDVANLFRILQRHYPQFLDTLRFQLTTRAEFERLGATDPDTLTDLERAARFLYLQRTAFGGKVSGRNFGVSSHRPGRFNLTTLEPMLEDLHSRLAGVIVECLDWADFLARYDRPETLFYLDPPYWGSETDYGRGLFQREDFGLMAERLAKLQGRFLLSINDVPQIRAIFGRFRITEVTTTYSISGSAKGNGTARPELLVSNWDIGEEG >NZ_CP051469|547158:558161|547158_547491_-|WP_002724123.1|tRNA|DBSCAN-SWA MIAYEDFAKVDIRVGRITRAEPFPEARKPALKLWVDFGDGIGEKRSSAQITRHYEPESLVGRQVLAVVNFPPRQIGPVLSEVLVLGVPDAEGEVVLIGPGQDVPLGGKLF >NZ_CP051469|547158:558161|552802_553777_-|WP_011339240.1|tail|DBSCAN-SWA MIPAFRLTVDGEDATGAVADRLLSLVITDEDGTKADRLEIELDDRDGRLAFPDTEARIEVALGFAGQPLAAMGVFAVDGVAGSGPRQSLRITATAADLKGEIRSPRTRAWRNRSLRDIVATVAAEAGLRPVVGESVAGTAWDYLAQTAESNLHFLTRIAATLDATAKPAGGALVVQRRGEGRTAAGDPLTPPVLPPWRLSDWSWSFEGRSVYRAAEAEWTETGSGITHRVRVGSGTPLKKLRHAYPTEAEALRAAEAVLSGAARAAMTLEARLAGFEPGLLGGASVTVTGLRPELNGDWHLESVSHQLDPGGLVTSFRGRKGTA >NZ_CP051469|547158:558161|553773_553986_-|WP_011339239.1|tail|DBSCAN-SWA MSRIYRTLAGDMLDAICKARLGSERHVPAVLAANPHLAALGSVYPAGVLITLPEVAEPVATGQIRLWGRT |
14 | Vibrio_phage(30.0%) | tRNA,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP051468_1 | 216860-216962 | Orphan |
NA
Consensus repeat of NZ_CP051468_1
|
1 spacers
spacers of NZ_CP051468_1
>1.1|216887|49|NZ_CP051468|CRISPRCasFinder GGTGGCCGCCGTGTCCCCGGGCTGCCCCGTCCGACTGTGCGGCGTGGTG |
CRISPR arrays and Neighbor proteins around NZ_CP051468_1
The CRISPR arrays of NZ_CP051468_1 >merge|NZ_CP051468|1|216860-216962|CRISPRCasFinder AAAGCTCCGCGCTCCGATCAGCGCGCGGGTGGCCGCCGTGTCCCCGGGCTGCCCCGTCCGACTGTGCGGCGTGGTGAAAGCTCCGCGCTCCGATCGGCGCGCG >NZ_CP051468|1|1|216860-216962|CRISPRCasFinder AAAGCTCCGCGCTCCGATCAGCGCGCG GGTGGCCGCCGTGTCCCCGGGCTGCCCCGTCCGACTGTGCGGCGTGGTG AAAGCTCCGCGCTCCGATCGGCGCGCG
>NZ_CP051468.1|WP_011338545.1|215487_216738_-|BtaA-family-protein MTQFALTHLPAPPVARQIGAAVHRTSLLSAEGLMERMFSRLFHGLVYPQIWEDPAVDMAALAIRPGDRLVAIASGGCNVLSYLTQGPGSILAVDLSPAHVALGRLKLAAARTLPDHAAFFDLFGRADLPGNAALYDRHIAPALDGRSRRYWEARSPFGRRIQLFERGFYRHGALGRFIGAAHTLARAAGTDLRGFLDCPDIEAQRSFFYAHIGPLFEAPVVQALARRPAALFGLGIPPAQYALLAGDGDGDVLPVLRQRLHRLLCDFPLRENYFAFQAIARRYPRPGEGALPPYLEPTAFETLRENAGRVQIENRSLTEALAAEPEESIHGFTLLDAQDWMTDAQLTALWRQVTRTAAPGARVIFRTGGAADLLPGRVPEEILGHWRADRAAGQAGHAADRSAIYGGFHLYRRRDA >NZ_CP051468.1|WP_011338546.1|214858_215491_-|methyltransferase-domain-containing-protein MTDATHAALMDATYRHQRRIYDVTRRHFLLGRDRLIAELDPPPGARVLEIACGTGRNLDLIGRRWPGCRLSGLDISQEMLASARARLGRRATLALGDATRFEALPLFGTDRFERIVLSYALSMIPDWREALREAALHLVPGGRLHVVDFGDQAGLPGWARAGLRGWIGRFHVTPRDDLGTALGETALGIGGYAEYRSLGGGYAILGTLTR >NZ_CP051468.1|WP_002721051.1|213499_214453_+|DMT-family-transporter MAKSNTRGALLALLAFAIYATHDVVVKVLGRDYTAVQILFFATLLGFPLASMMLMRDRADGNLRPRHPWWVLLRTCCIVTTGLCAFYAFSVLPLAQTYAILFTMPLLVTLLAIPLLGETIGLHRGAAIATGLLGVLIVLRPGAEPLSLGHAAALLAAVTSSLGSVIVRKIGQDERSVVLLLYPMVANFFVLGAALPFVYRPMPVEHLGMMGIIAAFSFVAMLLTITAYRMAEAVIVAPMQYSQMVWAAIFGWLIFGERSDVWTWVGAGVIIASGLYIVGRESRRNVSENRPASQTRLRPEVGLMPRLLRRRETRDTP >NZ_CP051468.1|WP_017139959.1|212025_213135_+|3-isopropylmalate-dehydrogenase MANPSLLILAGDGIGPEVMAEVKRIIGWFGEKRGVSFDVSEDLVGGAAYDAHGTPLADETMARAQEVDAVLLGAVGGPKYDVLDFSVKPERGLLRLRKEMDLYANLRPAQCFDALADFSSLKRDIVAGLDIMIVRELTSGVYFGEPRGIFPDNEGGRFGVNTQRYTTEEIRRVARSAFELARRRNNRVCSMEKANVMESGILWREEVQWVHDNEYPDVELSHMYADNGAMQLVRWPRQFDVIVTDNLFGDILSDCAAMLTGSLGMLPSASLGAPMANGRPKALYEPVHGSAPDIAGQGKANPIACILSFAMALRYSFDMGEEATRLEKAVETVLADGVRTADLMGPEGGTPVSTSGMGDAVLAALDASL >NZ_CP051468.1|WP_011338548.1|210999_211929_+|endonuclease/exonuclease/phosphatase-family-protein MDLERAGPGLLLRDIRAGKDPQILAVRRMIEALRADVLFLTGVDYDADLLAARALAGDLYPHLLALRPNVGRPTGIDLDGDGRLGGPGDAQGWGRFGGQGGMVLLSRLPLGLPRDFSDLLWADLPEANLPPMPPEARAVQRLASVGAWEVPLPRTGGPALKLLLWSATPPVFDGPEDRNGRRAGDEALFWARLLDGRLPFPAPEGPVLLMGKANIDPLAGDGPRAAIAALLAHPALQDPAPRGPDGSRATADFRADDGPGLLRTDYILPARGLPVTASGVLWPPPGDPLAAVAAQASRHRPVWVELDLP >NZ_CP051468.1|WP_002721054.1|210076_210919_+|hypothetical-protein MNVFSRLSGALLRASLVVLLLALPSVLLPGVGADGQQMVALAALFGGVLTLVEYNATYPGLVEFRDAPPFNRLRFALLLVMVLTLTLMQREASDPSALTALSARAAGAVGTAMDFAYSPVRLATLMLSDRATPEQTAVLRNAAGISYLIATAAVAMFVAVIRLNRWPSRNLAFNVWVNLPTFEATAGGDVVQRLLRDARFNLALGFLLPFLIPIAVKIGTSGLDPLTLASPQALVWITVAWAFLPASLLMRGVGMARIAEMIREKRRTGEVFGTRGLAPV >NZ_CP051468.1|WP_002721055.1|209238_209844_+|3-isopropylmalate-dehydratase-small-subunit MQEFTKVTGVAAPMPLVNIDTDMIIPKQFLKTIQRSGLGKNLFDEMRYNPDGSEIPEFVLNQPAYRDAQIIVAGDNFGCGSSREHAPWALLDFGIRCVISTSFADIFYNNCFKNGILPIVMPPEVVEVLMEDARRGANARMTVDLEAQTVTTSDGQSFPFQVDSFRRHCLMNGLDDIGLTLEKAASIDGFERDLATLRPWV >NZ_CP051468.1|WP_002721056.1|208710_209235_+|hypothetical-protein MKGTSSLIVSGILALLGGIAALAFPLPVSLAVTVLAGCVFVASGAFGLWAAFSDRGMPSRGAAAFFSLVSLVAGVWMLANPLAGMVSLTLMLGALFLVSGVVRLGLSLATWRGTVMFWLMALSGLISAGLGLFILLRLPEASLVLLGTLVAVELIVMGATLVAMGFALRKSGNP >NZ_CP051468.1|WP_011338550.1|207219_208647_+|3-isopropylmalate-dehydratase-large-subunit MTAPRTLYDKIWDDHVVHQSEDGTCLLYIDRHLVHEVTSPQAFEGLRMTGRKVRAPEKTIAVPDHNVPTTEGRDTKIDNEESRIQVEALDKNARDFGINYYPVSDIRQGIVHIVGPEQGWTLPGMTVVCGDSHTATHGAFGALAHGIGTSEVEHVLATQTLIQKKSKNMKVEITGSLRPGVTAKDITLSVIGLTGTAGGTGYVIEYCGQAIRELSMEGRMTVCNMAIEGGARAGLIAPDEKTFAYVMGRPHAPKGAAWEAALAYWKTLFTDEGAQFDKVVTIRGEDIAPVVTWGTSPEDVLPITATVPAPEDFTGGKVEAARRSLEYMGLTPGQKLTDIKIDTVFIGSCTNGRIEDLRAAAEILKGKKVAPGMRAMVVPGSGLVRAQAEEEGLAQIFIDAGFEWRLAGCSMCLAMNPDQLSPGERCASTSNRNFEGRQGRNGRTHLVSPGMAAAAAITGHLTDVRDLMMAPAEPA >NZ_CP051468.1|WP_017139961.1|206433_206973_-|heme-binding-protein MNGAVDHKGYEQPTYDLEFAETATEIRRYGPYLVAEVTMAGDRSTAITRGFRVLARYIFGGNAESRRIEMTVPVSQLPAGEDLWTVRFTMPAVRSASLLPAPKDSRIRFVTVPPSRQAVRRFSGWPTDHALRRQAEGLAHWIAERGLPKREGPYFYFYDSPMTLPWQRRNEVAFGLGEG >NZ_CP051468.1|WP_011338544.1|217112_217757_+|hypothetical-protein MSGTQSETGRQLAPGLLSLGVGLAALAGLWGGPLPDLARVSFVWHMMLHLGVILGAAPLIALGLARLAPLRTAWPVLFAGAASLLELAVVWGWHAPRLHEAAALDPTLFRIQQATFLGAGVLVWLPGLAPGRAAAGAGLLAMLFSLMHMTMLGVLLTLSPRLLYAPEICGTAFGLAPLDTQRLGGAMMAVAGLGPYLVGAAVFTMRLTREGPED >NZ_CP051468.1|WP_011338543.1|217761_218673_-|ornithine-cyclodeaminase MSIEIIGPEAEAHLSWDGLTTALTQGHKLPRAEVADIFLYRGKDTVLSRAAWIDGLGQLVKTATIFPGNGAAGKPTVNGAVTLYSDRDGTLEALVDFHLVTKWKTAGDSLLAAKRLARRDSREILLVGAGNVARSMIEAYGSHFPDARFTVWNRSADKARAMEGPQVRATEDLEAAVRAADIICTATMSQEPVVKGEWLKPGTHLDLIGAYRPDMREVDDEALRRATVFVDSRKTTIGHIGEIQDPLDRGILTEADIRADYYDLASGLYARRSDEEITLAKNGGGAHLDLMTASYIAAMWRQR >NZ_CP051468.1|WP_011338542.1|218669_219158_-|YaiI/YqxD-family-protein MTDLYIDADACPVKAEAERVAVRHGVRMFLVSNGGIRPPAHPLVESIFVPEGPDVADMWIADRARTGDVVVTSDIPLAAKVVAAGALVVKPNGETLTQANIGNALATRDLMADLRSADPFRQGGGRPFSKADRSRFLDALERAMRKAQEAGRSASGGNEAGS >NZ_CP051468.1|WP_002721045.1|219150_219981_-|S-formylglutathione-hydrolase MKTLSESRCFGGTQGVYSHTSQVTGCDMTFGLFLPPEAENEAVPLVWYLSGLTCTHENAMVKAGAQKWAAQEGLALVFPDTSPRGEGVPDDENYALGQGAGFYVNATEAPWSTNFRMWDYITEDLPRVLFSAFPLDESRQSIMGHSMGGHGALTIAMSFPGRFRSVSAFAPITHPTASDWGRKQLTAYLGTDESKWAPHDSVLLMRKRGFDGPILIDQGASDQFLDALKPEALAEAMMARRQQGIIRMQPGYDHSYFFVSTFMEDHIQFHAEALYD >NZ_CP051468.1|WP_002721044.1|219996_220926_-|AEC-family-transporter MLPVLLETLPFFALIGTGYMAGRMGMFTPEATAWLTKFVFYFALSAMLFRFAANLSLAEIWSLPFVAAYLAGSGAVYLLATLVARMRGVGTEVAAMEAQCAVIGNTGFLGVPMLVLLLGAGAAGPVLMMLSIDLIFFSSLITLIITGKREGHMSLRVVKVLALGLLRNPMIVSMVAGLGWSATGAAVPEPVNEFLALLGGAATPGALFAIGASLAGKSAERLEIAAWLSFCKLVLHPLSVAVAALAIFPVERQAAGVMIAAAALPVAGNVYILAQHYGVAPQRVSASILISTAVSILTITGVIAWVTTF >NZ_CP051468.1|WP_011338541.1|221284_222262_-|membrane-protein MFSSFSSRRRFRDLSDAEILALAISSEEDDARIYRSYADRLRAPYPGTAALFEAMAEEEDSHRRRLIGTFRDRFGETIPLIRREHVEGYPSRKPVWLLETLPLERVRREAWEMENAARSFYETAARHVTDASTRKLLGDLAAAEAAHEHRAERLAEEHLTDEATAAEDETARRQFILTWVQPGLAGLMDGSVSTLAPIFATAFATQDPATTFQVALAASVGAGISMGFTEAAHDDGVLSGRGAPWKRGLASGVMTTAGGLGHALPYLIPDFAWATGIAIAVVFFELWAIAWIQNRYMETPFLRAALQVVLGGGLVLAAGILIGFG >NZ_CP051468.1|WP_011338540.1|222607_223231_+|TetR/AcrR-family-transcriptional-regulator MPAETPPARKGRKMPQVLEGARTIFLRDGFEGASVDDIARAAGVSKATLYSYFPDKRLLFLEVAKAECLRQSEEAVALITADLAPRAVLTLAATRIVAFVLSDFGIRTYRICVAEADRFPELGHEFYESGPALVRQRIVDYLAQAVGRGELAIDDLELAADQFAELCKADLFNRIVFGVGNSVSEAERQKVAQGAVEMFLARYAPRP >NZ_CP051468.1|WP_002721037.1|223261_224050_-|exodeoxyribonuclease-III MTFTLATWNINSVRLREALVSRLLTEEMPDILCLQECKSPVEKMPLEAFRALGYHWCVARGQKSYNGVAILSKLPLVDAGDHDFADLGHARHVSAALENGVTIHNCYVPAGGDIPDREQNVKFGQKLDYLTRMRDWCRADTPKKAILVGDLNIAPREDDVWNHKSLLKIVSHTPIEVDHLNALMEAGAWIDVTRKDLPEGRLYSWWSYRSPDWEAADKGRRLDHIWATPDIVNAAHGSRILKAARGWLQPSDHAPVFATFDL >NZ_CP051468.1|WP_002721035.1|224398_225085_-|response-regulator-transcription-factor MASLKKILLVDDDDDLREALSEQLVMTEDFDVFEAASGAEGMEKAKAGLYDLVILDVGLPDTDGRELCRRMRKAGVKCPVLMLTGHDSDSDTILGLDAGANDYVTKPFKFPVLLARIRAQLRAHEQSEDAVFQLGPYSFRPAQKMLVDEKERKIRLTEKETNILKFLYRASQGVVAREVLLHEVWGYNAGVTTHTLETHIYRLRQKIEPDPSNARLLVTESGGYRLVA >NZ_CP051468.1|WP_011338539.1|225237_226326_+|GTP-cyclohydrolase-II MSLSLGIVERLNRARGDLRMGVPVVLTEEGAGAVAVAVEALEPGRLADLRLLGEPVLAITARRAETLKARVYDDDLARIELPDAAGIEWLRAVADPADDLRLPMKGPLRALRGGGAGLARAALALAKSAHLLPAAVLVPVADPLRLAAQHALTVLPLEEARAELSRGSPLHPVVSARVPMVASQAGRVHVFRPEDGGEEHYAIEIGRPDRGLPVLARLHSACFTGDVLGSLKCDCGPQLRSALARMGEEGAGVLLYLNQEGRGIGLANKMRAYSLQDQGFDTVEANHRLGFEDDERDFRIGAGILRQMGFAAVRLLTNNPAKIRMMEANGIRVTERVPLHVGRNEFNAAYLATKAAKSGHLA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP051468_2 | 3139425-3139621 | Orphan |
NA
Consensus repeat of NZ_CP051468_2
|
2 spacers
spacers of NZ_CP051468_2
>2.1|3139465|68|NZ_CP051468|PILER-CR CGCCACCGTCCAGCGCGTCGTTGCCGGCCCCGCCCAGAAGGGCATCGTTCCCGCTGCCCCCCTCCATC >2.2|3139573|33|NZ_CP051468|PILER-CR GCAAGAAGGTCGTCTCCGGCGCCGCCGAACAGA |
CRISPR arrays and Neighbor proteins around NZ_CP051468_2
The CRISPR arrays of NZ_CP051468_2 >merge|NZ_CP051468|2|3139425-3139621|PILER-CR AGATCGTCGCCCGCGTCGCCCGAGAGCGTGTCGTCTCCCTCGCCACCGTCCAGCGCGTCGTTGCCGGCCCCGCCCAGAAGGGCATCGTTCCCGCTGCCCCCCTCCATCGTGTCGTCGCCCGCGCCGCCTGCAAGAAGGTCGTCTCCGGCGCCGCCGAACAGAAGATCCTGCCCCGCCCCGCCCGAGAGCGTGTCGTT >NZ_CP051468|2|1|3139425-3139621|PILER-CR AGATCGTCGCCCGCGTCGCCCGAGAGCGTGTCGTCTCCCT CGCCACCGTCCAGCGCGTCGTTGCCGGCCCCGCCCAGAAGGGCATCGTTCCCGCTGCCCCCCTCCATC GTGTCGTCGCCCGCGCCGCCTGCAAGAAGGTCGTCTCCGG CGCCGCCGAACAGAAGATCCTGCCCCGCCCCGC CCGAGAGCGTGTCGTT
>NZ_CP051468.1|WP_017139984.1|3136915_3138961_-|hypothetical-protein MNMRPDAFGEARAPAIACPPRPLDELRIVPLSTGLAAIYLDAPPAGGRRPRLAVTLDGRPVPSPSLSAALPLERGGLRHLVLFTQDMPELLARDLLLTLDGVPRAEARSEWLQPPLRPLPALVEGLAASGRTRFLKMLLTTAASLFSGHGAELAALAQRMLDLCAVPVARPLGAAAFGAGHAIRSHAWAEPPAPGAPVVALGRQIGRLAGCDARSDGGLLHLHLPPAARAEGDLVIFTPDPVRLPAPNKAAAPMTAWLNGRSPAIRAWALGRVRSAAAGDPAAQALLAEMQRPDLPVRLAVRSLCVAPGGLLYAFRLEDPERLVRTLRLEIAGRHLDLDLHRGPAGTGLCAGFAPLEAEAAPCRIQMVLGSGRITTMATVQPAAFDGSEPDGFAEVWQAWHRISGPPEVNPARVLAAIRSARAPRRIPVLVQRFGSPRTGRIALIAPAGAHPDLISARAAAVGAGGWQAEILLTLPEGPAAARMRDAAAAAAALHDLPHRLVLHDARAHPAEVLRAAAALIEGPLLLLGATVLPATPGWLEDWLRHLEETPALVTAGAVLDHEGAVWEGPLDLGAEGLLRPLDGLPARRLAGTSTSFPSPELAGLSRAGVMRLLASATGHADLAPALAELAAADRAAGTASPFRVSHPFRRFGGTAHPTGFAEALRGEELRRQALPAAEGL >NZ_CP051468.1|WP_011338728.1|3135674_3136916_-|glycosyltransferase-family-4-protein MALRILQIAHDHPDWTSGGTEIVAHDLHRALSRRGLDSRFLAAATSLQRPEAMAGSLGHLGEDMVLRTGGYDRFLMKRHDGLRWLESVRRLLTDAAPDVVHLHGLDRIGAEILPALRRFAPAARIVMTLHDYQIICPNEGLLLTRPDGSRCRGARPDSCRACFPELGADRHALRKTYLATLLQGVDCFLAPSRFLKDRFLDWGLPATKLQLMPNAVAPLALMQDKPRVRRDRFAFFGTLALHKGVLTLLEAAAQLKAEGAELRLAFHGGLRHPEPAFRTAFETALAAARPLAQHPGPYDRAALASRMADVDWVVVPSLWWENAPLVILEAQAAGRPVIASGIGGMAEMVKDSETGLLVPPGDAAALAEALRTAAGDADLWARLAAAQPRTSHERFVDAHLDLYRTLLEGRHAA >NZ_CP051468.1|WP_011338729.1|3133497_3135678_-|hypothetical-protein MTPADGSFLLPGSATVAEVRLSLVLPGGRSLLGLTSTEDLPRAGHAHWSGGQAAWQAEVWAHHAGFAGLALVEAALCERATLRGADGQAARIETPRRIEIAPEALAEFVREAGLMPREVLAFLLRALLADPALPPAEARAHRAFAAGFLQAAALPDGFIEILATPETDGLFAQGWAMSLGTGRSRIAKLDGDLSFCEAEIAAFARNDILPPGQGIALYCRDWRGQDHEAVQALIFDTEGEFRRLELVRGSVQHLSGSVASAHVARMLPRLSGPESSRRALRRICRPRYPGSDTLSATTLPIAAAFEAIYQAPDGGLLAMGWLLDPLSRVERVILKSDANLYAPLQDRWHAMPRPDLHEGFARDPRFANLLDPRDTMHGFVAYAPASRAKVEGAEVYLELVLDDNSCLFRPIAITPLAGRERLPALLAPISPEDPALGPLVERVLAPFLAGLPATPPRAIARRRATVPLGAEAPPRDIAAVMPFRTLAQLQPVFALLSGTPEAAALDLTLVSTSRAAQGLAERLDEAFRFYGLSGRLLLVPESESLLARMESGIAVSRGSHLLLWEPSALPSSSNWLARLLREAQWAQPPGLVSPRLVYEDGSICFGGGEEHGAGAQTICPQLGYPAAWLAQGRPRRTGFGAAEIALVGRAALEAAGGLTGRLFGDRMAHRDLADRLHATGSGTWCSGSVTFWALQEPSDRNDSFTTLMAKVDGALIAARARERLGP >NZ_CP051468.1|WP_011338730.1|3132223_3133501_-|glycosyltransferase-family-4-protein MKALVLSHAHPAFSIGGAQVASHNLFTGLKSLPGWQAHYMAGVGPPVARHDATPLMSLGQAPDETLFWTGDYDWFHLGTTDLDGLMRHFERFLSDLRPDVVNFHHVMGFGIQAIASVRRALGPVPIVMTLHEYLPICAHHGQMIKARSHSLCLRASASDCGLCFPELGAAAMKRRELFIKSFFDRVDAFVSPSRFLLTRFEDWGLPRQKLVMIENGLEGGPVAPPRPLAEGRPRNRFAYFGQLNPFKGIKVLVEAVTRIPDRIWGDAILYIFGGNLEHQPEEFQTQVRNLFRMAGRRLRFMGSYKSADLPHLMREVDWTIVPSTWWENAPVVIQEAFHHGRPIIASDIGGMAEKVRDGIDGLHFRVSNPESLAETMMRALRDPALWDRLRAGIEPPTDAASAAREHARLFERLLTRQGTAGANHG >NZ_CP051468.1|WP_011338731.1|3130509_3132231_-|type-I-secretion-system-permease/ATPase MADPAEPAGLYRLILREAALGAGAAAFVGFFVNLLHLALPLYNAQVYDRVIGSANLDTLVALTGLVAILLVFGAVLDVLRARIFAILAARLAGQLGQPVFVAAVETALRGGPTAAAEGMRAVADLRSFLAGGAIALPIDLTFTPVLLLVLFALDPAYGLIGLGGAAALAGTGIVTEILARRPVASALAAGGSVQAETATALRSAEVIAAMGMLPDIARRWRRAQARSLAAADRGQARAKALSSLARFLRTAIQIAVICTGTTLVVDQAATIGTIVAAMAIMSRLLLPFEHLIDGWRQWMDAAEAHGHLRRLLREGGSPRSSLPAPVGRALLVADRVTYIPAGQDLPLLRNVSFRIEPGELLGVIGPSGAGKSTLARLTVGLWPPSAGGLFLDGQSTFLHERTSFGRAVGYLPQEPLLLDGSVRDNIARFRDAPMDEVIAAARQAGVHELIGRLPRGYGTRLADAGARLSGGQRQRIALARAIFGDPQLLVLDEPNASLDAEGEAALVSAVEQARARGAAVLVVAQRMSVLAKADRLLVLREGAVTHYGPRAEVMAAIGPQRRPAPVPVAQACS >NZ_CP051468.1|WP_011338732.1|3129211_3130513_-|HlyD-family-type-I-secretion-periplasmic-adaptor-subunit MSAAGSWDDPAPPSYRPIFQAALVLILSLTVSLGSWAVYARLDGAVITQGVVLAESRRKTVENLEGGLLERLLVAPGDRVAAGQPVAQLATVQDRERLIQLEAEREGLLFDIWRLRTEAAGARVLDPATAPGPDAERTAAQLRLFQSRLHAHRDRIGSLDRQIELLIAQGAANDAQARAADLQIESWRAERADLVTLVDRGASPRQKLVELDRNITTLTGTREQYLALARAARADVERARMDRSTAGQLRLAEVAEQLAAAGRQLPGLEAQIRATRDVLERRTLRAPQAGLVVEVPTVTPGAVIGSGTAVMEIVPDLDHLVIQMRVPPEAIDNVRKGGLARVRLTAYRRALAPVVRGQVSFVSPDLIEDPRDGTTYFEARVTLDPESLAEQPDWVRLSAGMPVEVSVSTGERRAGDYLLEPILRHLRGALHDP >NZ_CP051468.1|WP_002721525.1|3128335_3129142_-|CatB-related-O-acetyltransferase MAQIPFDLKHADFARLAEVGLRLPGLREVKTLTQTSWIEAPTTIIGTVVSGAELQVGAFCSLSGGTLNNVHIGRYSSIAAGTIIGVHEHPTSWLTTSRTSYWPQVYGWDELIAPDRAAEIRAGKRPFTRSCPITEIGHDVWIGQGCFIKSGVRIGHGSVIGARATVTKDVPPYSIVLGTPGRVVRTRVPEPLIERLLAVEWWRYSIYDLFAAPFDDVARALDVIEDRIAEGAIKPYEAPLVTPADLAEPLGLAARLAARFPRPLARAS >NZ_CP051468.1|WP_011338733.1|3127744_3128332_-|hypothetical-protein MTLADPVPDKAKEAVSPLYGVVDVLRRDRIAGWVIDRRDPGASVTVEIRREGQRIASVPANRRRRDLEANRFGSGLYGFGAAIDPPLMEGMEFTVSVRAMAADGTEAILQPLSRVLEPSGEQRALARLLEEVAACRALLEAQQSLAERVERAQLRMETLWPEPDPVPPTGWGLRLLSLAALALALLSLTLSVMSL >NZ_CP051468.1|WP_023003735.1|3124946_3127733_+|error-prone-DNA-polymerase MPFVELSATSNFTFLTGASHPEELMRRAASLGLPALAIADENSVAGIVRAYGEAKEIARETGSAPRLIPAARIVVREGMSVTVLPRDRAAWGRLCRLLTLGRRAAPKGSCDLGFADLLAHAEGQEMLLHPPARSSGGWQTLAARLAAACPGRVALILAPHYAGQDRVRFERQALLAARLGIPTVASGLPILHHGARRRLADVLTAIRLGCRVDELGRRALVNGEQRLRSEAEMLRLYPGHEEAVFRSAEIAGRLDFCLSELRYEYPSEISKGESPADRLARLTREGLVWRFPAGVPEKIAAQARHELELIGKLGYEPYFLTVNDIVRFARDRGILCQGRGSAANSIVCYALGVTSVDPEVGTMVFERFVSEARNEPPDIDVDFEHERREEVIQHIYSRYGRDRAGLCATVIHYRGKRAVREVGRAMGLSEDTLTAMASQIWGWGGPGALTERRLREIGLDPNDRRLWLALQLIDEIQGFPRHLSQHVGGFVITEGRLDELVPIENAAMEGRTVICWDKDDIDMLGILKVDVLALGMLTCIRKAFDLIESHHHTSYTLATLPRDDSQTYDMLCRADSLGVFQVESRAQMNFLPRMRPRCFYDLVVQVAIIRPGPIQGDMVHPYIRRRNGQEDPHLPSDALGKVLGKTLGVPLFQEQAMQIAIIGANFSPEEADRLRRALATFKKHGNVTELKERFMNGMLANGYDKDFATRCFSQIEGFGSYGFPESHAASFALLVYASAWIKCHHPGIFACALLNAQPMGFYAPAQIVRDARAHGVIVLPPCINASLWDNVMERDREGRLALRLGFRQIKGVAEEEAQWIVAARGNGYRAVEDIWRRAGVPPATVTRLAEADCFAALGLTRREAQWQARALSGDRPLPLFAGDMDGEGIVEPEVVLREMTLGEEVVEDYVSFRLSLRSHPMALLRELV >NZ_CP051468.1|WP_011338735.1|3123612_3124866_+|lytic-murein-transglycosylase MRTFLLGLGLALAAAPGMADPVTVSQRPAARAEAAVTEAERAAQAGLEAWVIQFRPRALAAGIRPATFAAAFRDLRYDADAVARDRNQAEFNRTLWDYLDSAVSETRIAGGRAALVQHGALLGRIEARYGVPKEIVVAVWGMETSYGTRRGDHPLVGALATLAHDGRRARFFEEQLIAALKILDNGDVRPEAMTGSWAGAMGHTQFMPTSYLDFAVDFTGDGRRDIWSDDPSDALASTAAYLAKSGWRRGEPWGVEVRLPAGFDFAQSGKSLRQPAARWEALGVRRVDGSPLPAGEAALLLPAGAQGAAFLIYPNFRAIARYNPADAYVIGVGHLADRLKGAPAIAGGWPRGDRALASAERTELQRLLTAAGFDTGGADGMIGPNTLSALRAWQAARGLVADGYANEAVLNRLREGA >NZ_CP051468.1|WP_011338725.1|3140083_3141397_+|calcium-binding-protein MELAVTTTTLPWAEPPGWLGSFFATGLRFESALIAPPWGALAMGSCGRDLLIASPFGSSLPGGPEGDVLVGLWGNDQLYGNGGNDLVRGGGGADILTGGTGNDLLAGGAGNDSLHGGTGANLLIGGAGDDLLDSRWSTDPDRGGLSFETWRAAKIGTGDEMHGGDGNDRIEASFGNDTLTGDAGDDVIRGQDGDDQLEGGEGQDTLMGDAGADVLSGGEGDDSLDGGLGDDRLDGGEGSDRLIGGNGNDVMDGGAARDVLFGGYGDDQITAGEGDDWVDGGGGDDLIDLGAGDDFVAAIEDLGRGDDVIFGGEGNDTMLGGRGCDQIDGGEGADVLNGGAGDDFLTGGAGADAFQMRSAHGIDTIADFSSEDVLTITRNINGIGGADGLVSVEELTGRIEDLGSDCFLDLGGGNGAMFLGITADELGGLLPAHLEYL >NZ_CP051468.1|WP_002721512.1|3141719_3143870_-|polyribonucleotide-nucleotidyltransferase MFNVTKKSIEWGGETLTLETGKVARQADGSVIATLGETSVMANVTFAKAAKPGQDFFPLTVHYQERYYAAGKVPGGFFKREARPSEKETLTSRLIDRPIRPLFVDGFKNEVLLIVTVLSHDLVNEPDIVAMIAASAALTISGVPFMGPIGAARVGFAGGEYVLNPDVDDMQKLRENPEQRLDLVIAGTKDAVMMVESEAYELSEAEMLGAVKFGHEAMQPVIDMIIDFAEEAAHEPFDFSPPDYAALYAKVKSLGETQMRAAFAIREKQDRVNAIDAARAAIKAQLSEAELADENLGTAFKKLESSILRGDIINGGARIDGRDTKTVRPIISETSVLPRTHGSALFTRGETQALVVTTLGTGEDEQIIDALHGNSRSNFLLHYNFPPYSVGEVGRFGPPGRREIGHGKLAWRALQAVLPAATDFPYTIRVVSEITESNGSSSMASVCGGSLSMMDAGVPLKAPVAGVAMGLILEDDGKWAVLTDILGDEDHLGDMDFKVAGTENGITSLQMDIKVAGITPEIMEQALAQAKDGRMHILGEMSKALSSANSFSAYAPKIETLTIPTDKIREVIGSGGKVIREIVETSGAKVDINDDGVIKIASNDQAAIKKAYDMIWSIVAEPEEGQIYTGKVVKLVDFGAFVNFFGKRDGLVHVSQIANKRLTHPNEVLKEGQEVKVKLLGFDERGKVRLGMKMVDQETGQEIQPEKKEKEEAGEA >NZ_CP051468.1|WP_002721510.1|3144050_3144320_-|30S-ribosomal-protein-S15 MSITVEEKARLIKEYATKEGDTGSPEVQVAVLSSRIATLTEHFKAHKKDNHSRRGLLMMVAQRRKLLDYLKKKDEGRYTALIARLGLRR >NZ_CP051468.1|WP_002721508.1|3144560_3145073_-|DUF1643-domain-containing-protein MITRSFVKGDAPSTAVYSDCERYRYLLTRVWDPEGTKALFVMLNPSTATEFQNDPTVERCERRARTLGFGAFRVCNIFAWRDTDPKKMRAAPDPVGNPENDEAIAHSAPWADRIVCAWGAHGAFLDRGRQVEALLRATGLPLHHLGLNRDGQPKHPLYIGYDQQPILWDA >NZ_CP051468.1|WP_011338724.1|3145471_3146548_-|pyridoxal-phosphate-dependent-enzyme MDAQKIRTTEGRGRLYDSVLDTVGNTPVIRINNLSPEGVTIYVKAEFFNPAASVKDRLALNIIEAAERSGKLKPGMTVVEATSGNTGIGLAMVCAQKGYPLVITMSEAFSVERRRLMRLLGAKVVLTPRGGKGFGMYRKAQELAEANGWFLASQFETDANADIHEATTAREIVADFAGERLDWFVTGYGTGGTVTGVARVLRRERPEVKIVLSEPANAQLVASGVPQDRNADGTAASGHPAFEAHPIQGWTPDFIPKVLQEGLDAGAYDELIPVAGEDGMKWARELAAKEGILTGVSGGSTFAVARQVAERAPKGSVILAMLPDTGERYMSTPLFQAIGEDMNEEEKALSASTPSFQL >NZ_CP051468.1|WP_011338723.1|3146768_3147674_-|tRNA-pseudouridine(55)-synthase-TruB MGRTRKGRAISGWLVVDKPAGMTSTAVVNKVRWALEAQKAGHAGTLDPDATGVLAVALGEATKTVPYITDALKCYRFMVRLGLSTRTDDASGEVIATSEARPTDAEIEAALAAFRGEIQQVPPQFSAVKVEGERAYDLARDGERLDLAARPLWVESLEILSRPDADHVELEMVCGKGGYVRSIARDLGEALGCHGHVAWLRRTWSGPFEAEDGISVATIDELARSEALLSHVLPLAKGLADLPELPATPEGAARLRCGNPGMVIASDVEFGEEAWASFQGQPVAVGIYKSGELHPSRVFNL >NZ_CP051468.1|WP_011338722.1|3147741_3148479_-|hypothetical-protein MRTRLLAILLAVWPAACATAEPACRDLTFEDTRYSLCEAQAGDDIRIFQTAPDGRPYGSFERINSALEGEGRQLAFAMNAGMYHADRRPVGLLIEEEVERAPLVTSAGPGNFGLLPNGVFCVGDGFRVIESRAFAAERPTCRHASQSGPMLVIGGELHPRFLVDSDSRYIRNGVGVSADGRRAVFAISNRPVTFHEFGRLFRDELGLPEALYFDGSISRLYDRGARRSDWGTPMGPIVGLVVPKP >NZ_CP051468.1|WP_011338721.1|3148468_3148891_-|30S-ribosome-binding-factor-RbfA MGPMAHRSHTGTGPSQRQLRVGELIRRTLADVLNRGEIHDPELNRLSITVGEVRCSPDLKVATVHVMPLGGKDVEEAISLLSKHRGELRHHITRQMTLKYAPDLRFRPDETFDRLDETRRLFSDKTVMRDIRGGGEADED >NZ_CP051468.1|WP_011338720.1|3149128_3149938_+|4-hydroxy-tetrahydrodipicolinate-reductase MSDLPGIVVTGASGRMGQMLMKTVLASGKARLVGAVERPGSDWVGRDAGAAMGGAAVGVTVTDDPLAAFAQAQAVIDFTAPEATVQFAELAAQARAVHVIGTTGLEPAHLERLAWAAHHAVIVRAGNMSLGVNLLTRLTQKVAEALDEDWDIEVVEAHHRMKVDAPSGTALMLGEAAARGRGVDLAQARVSGRDGITGPRAPGSIGFSAIRGGDIVGEHDVIFAAAGERITLRHVATDRAIFARGALKAALWGQDRRPGQYDMMDVLGL >NZ_CP051468.1|WP_043764254.1|3150150_3150330_-|DUF1674-domain-containing-protein MTEETRKDLPPEALRALAEAEERRRRAKALDLPKEIGGRNGPEPVRFGDWEKKGIAIDF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP012917 | Azospirillum brasilense strain Sp 7 plasmid ABSP7_p3, complete sequence | 95476-95508 | 5 | 0.848 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP032343 | Azospirillum brasilense strain MTCC4038 plasmid p4, complete sequence | 632715-632747 | 5 | 0.848 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NC_013860 | Azospirillum sp. B510 plasmid pAB510f, complete sequence | 12280-12312 | 6 | 0.818 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP051473 | Rhodobacter sphaeroides strain CH10 plasmid pRspCH10DE, complete sequence | 56659-56691 | 7 | 0.788 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP030276 | Rhodobacter sphaeroides 2.4.1 plasmid pDE, complete sequence | 56666-56698 | 7 | 0.788 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP015289 | Rhodobacter sphaeroides strain MBTLJ-20 plasmid a, complete sequence | 143791-143823 | 7 | 0.788 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP015212 | Rhodobacter sphaeroides strain MBTLJ-13 plasmid a, complete sequence | 55970-56002 | 7 | 0.788 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NC_013855 | Azospirillum sp. B510 plasmid pAB510a, complete sequence | 295562-295594 | 7 | 0.788 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP022424 | Vitreoscilla filiformis strain ATCC 15551 plasmid pVF1, complete sequence | 129769-129801 | 7 | 0.788 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP054617 | Azospirillum oryzae strain KACC 14407 plasmid unnamed3, complete sequence | 259353-259385 | 7 | 0.788 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP007794 | Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence | 74495-74527 | 8 | 0.758 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP007795 | Azospirillum brasilense strain Az39 plasmid AbAZ39_p2, complete sequence | 574872-574904 | 8 | 0.758 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP030864 | Streptomyces globosus strain LZH-48 plasmid unnamed2, complete sequence | 386509-386541 | 8 | 0.758 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP032686 | Rhizobium sp. CCGE531 plasmid pRCCGE531c, complete sequence | 650861-650893 | 8 | 0.758 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP032323 | Azospirillum brasilense strain MTCC4035 plasmid p2, complete sequence | 530049-530081 | 8 | 0.758 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP032326 | Azospirillum brasilense strain MTCC4035 plasmid p5, complete sequence | 187783-187815 | 8 | 0.758 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP032691 | Rhizobium sp. CCGE532 plasmid pRCCGE532c, complete sequence | 642177-642209 | 8 | 0.758 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP032350 | Azospirillum brasilense strain MTCC4039 plasmid p5, complete sequence | 28970-29002 | 8 | 0.758 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP012917 | Azospirillum brasilense strain Sp 7 plasmid ABSP7_p3, complete sequence | 85846-85878 | 9 | 0.727 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP032343 | Azospirillum brasilense strain MTCC4038 plasmid p4, complete sequence | 7565-7597 | 9 | 0.727 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP032343 | Azospirillum brasilense strain MTCC4038 plasmid p4, complete sequence | 642345-642377 | 9 | 0.727 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP029357 | Azospirillum sp. CFH 70021 plasmid unnamed2 | 226471-226503 | 9 | 0.727 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP029357 | Azospirillum sp. CFH 70021 plasmid unnamed2 | 226777-226809 | 9 | 0.727 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NC_013855 | Azospirillum sp. B510 plasmid pAB510a, complete sequence | 283055-283087 | 9 | 0.727 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP032341 | Azospirillum brasilense strain MTCC4038 plasmid p2, complete sequence | 71875-71907 | 9 | 0.727 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP032326 | Azospirillum brasilense strain MTCC4035 plasmid p5, complete sequence | 179960-179992 | 9 | 0.727 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP015094 | Pelagibaca abyssi strain JLT2014 plasmid pPABY6, complete sequence | 49820-49852 | 9 | 0.727 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP022366 | Azospirillum sp. TSH58 plasmid TSH58_p04, complete sequence | 376760-376792 | 9 | 0.727 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP012916 | Azospirillum brasilense strain Sp 7 plasmid ABSP7_p2, complete sequence | 168166-168198 | 9 | 0.727 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP029357 | Azospirillum sp. CFH 70021 plasmid unnamed2 | 227056-227088 | 10 | 0.697 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP030074 | Streptomyces sp. ZFG47 plasmid unnamed1, complete sequence | 274972-275004 | 10 | 0.697 |
NZ_CP051468_2 | 2.2|3139573|33|NZ_CP051468|PILER-CR | 3139573-3139605 | 33 | NZ_CP022366 | Azospirillum sp. TSH58 plasmid TSH58_p04, complete sequence | 376253-376285 | 10 | 0.697 |
1. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP012917 (Azospirillum brasilense strain Sp 7 plasmid ABSP7_p3, complete sequence) position: , mismatch: 5, identity: 0.848
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer tccagaaggtcgtcgccggcgccgccggacagg Protospacer * *********** ************.****.
2. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP032343 (Azospirillum brasilense strain MTCC4038 plasmid p4, complete sequence) position: , mismatch: 5, identity: 0.848
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer tccagaaggtcgtcgccggcgccgccggacagg Protospacer * *********** ************.****.
3. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NC_013860 (Azospirillum sp. B510 plasmid pAB510f, complete sequence) position: , mismatch: 6, identity: 0.818
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer cgcagcaggtcgtccccggcgccgccgaacagc Protospacer ** ********.*****************
4. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP051473 (Rhodobacter sphaeroides strain CH10 plasmid pRspCH10DE, complete sequence) position: , mismatch: 7, identity: 0.788
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer tccagaaggtcgtctccgccgcctccgaaaatg Protospacer * *************** **** ***** * .
5. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP030276 (Rhodobacter sphaeroides 2.4.1 plasmid pDE, complete sequence) position: , mismatch: 7, identity: 0.788
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer tccagaaggtcgtctccgccgcctccgaaaatg Protospacer * *************** **** ***** * .
6. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP015289 (Rhodobacter sphaeroides strain MBTLJ-20 plasmid a, complete sequence) position: , mismatch: 7, identity: 0.788
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer tccagaaggtcgtctccgccgcctccgaaaatg Protospacer * *************** **** ***** * .
7. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP015212 (Rhodobacter sphaeroides strain MBTLJ-13 plasmid a, complete sequence) position: , mismatch: 7, identity: 0.788
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer tccagaaggtcgtctccgccgcctccgaaaatg Protospacer * *************** **** ***** * .
8. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NC_013855 (Azospirillum sp. B510 plasmid pAB510a, complete sequence) position: , mismatch: 7, identity: 0.788
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer tccagctggtcgtcgccggcgccgccgatcagg Protospacer * ** ******* ************* ***.
9. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP022424 (Vitreoscilla filiformis strain ATCC 15551 plasmid pVF1, complete sequence) position: , mismatch: 7, identity: 0.788
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer tccagcacatcatcgccggcgccgccgaacaga Protospacer * ** * .**.** ******************
10. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP054617 (Azospirillum oryzae strain KACC 14407 plasmid unnamed3, complete sequence) position: , mismatch: 7, identity: 0.788
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer gcaagatggtcgtcaccggcgccgacccgcagc Protospacer ****** ******* ********* * .***
11. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP007794 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence) position: , mismatch: 8, identity: 0.758
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer tacagcaggtcgtcgccggcgccgccgacgagc Protospacer ** ******** ************* **
12. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP007795 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p2, complete sequence) position: , mismatch: 8, identity: 0.758
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer accatgacgtcgtcgccggctccgccgaacagc Protospacer .* * .* ****** ***** ***********
13. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP030864 (Streptomyces globosus strain LZH-48 plasmid unnamed2, complete sequence) position: , mismatch: 8, identity: 0.758
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer accacgaggtcggctccggcgcggccgaactgc Protospacer .* * .****** ********* ******* *
14. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP032686 (Rhizobium sp. CCGE531 plasmid pRCCGE531c, complete sequence) position: , mismatch: 8, identity: 0.758
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer atcaggacatcgtttccggcgccgccgaagaga Protospacer .. **.* .****.*************** ***
15. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP032323 (Azospirillum brasilense strain MTCC4035 plasmid p2, complete sequence) position: , mismatch: 8, identity: 0.758
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer accatgacgtcgtcgccggctccgccgaacagc Protospacer .* * .* ****** ***** ***********
16. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP032326 (Azospirillum brasilense strain MTCC4035 plasmid p5, complete sequence) position: , mismatch: 8, identity: 0.758
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer tccagcacgtcgtctccggcgccgccgtcgagg Protospacer * ** * ******************* **.
17. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP032691 (Rhizobium sp. CCGE532 plasmid pRCCGE532c, complete sequence) position: , mismatch: 8, identity: 0.758
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer atcaggacatcgtttccggcgccgccgaagaga Protospacer .. **.* .****.*************** ***
18. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP032350 (Azospirillum brasilense strain MTCC4039 plasmid p5, complete sequence) position: , mismatch: 8, identity: 0.758
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer acgaagatgtcgttgccggcgccgccgaacagc Protospacer .*.*..* *****. *****************
19. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP012917 (Azospirillum brasilense strain Sp 7 plasmid ABSP7_p3, complete sequence) position: , mismatch: 9, identity: 0.727
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer cacaacacgtcgtcgccggcaccgccgaacagc Protospacer *. * ****** *****.***********
20. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP032343 (Azospirillum brasilense strain MTCC4038 plasmid p4, complete sequence) position: , mismatch: 9, identity: 0.727
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer cacaacacgtcgtcgccggcaccgccgaacagc Protospacer *. * ****** *****.***********
21. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP032343 (Azospirillum brasilense strain MTCC4038 plasmid p4, complete sequence) position: , mismatch: 9, identity: 0.727
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer cacaacacgtcgtcgccggcaccgccgaacagc Protospacer *. * ****** *****.***********
22. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP029357 (Azospirillum sp. CFH 70021 plasmid unnamed2) position: , mismatch: 9, identity: 0.727
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer atgtagacgtcgtcgccggcgccgccgatcaga Protospacer ... ..* ****** ************* ****
23. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP029357 (Azospirillum sp. CFH 70021 plasmid unnamed2) position: , mismatch: 9, identity: 0.727
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer atgtagacgtcgtcgccggcgccgccgatcaga Protospacer ... ..* ****** ************* ****
24. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NC_013855 (Azospirillum sp. B510 plasmid pAB510a, complete sequence) position: , mismatch: 9, identity: 0.727
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer atcagcgtgtcgttgccggcgccgccgaacagg Protospacer .. ** . *****. *****************.
25. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP032341 (Azospirillum brasilense strain MTCC4038 plasmid p2, complete sequence) position: , mismatch: 9, identity: 0.727
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer aacagcgtgtcgtctccgtcgccgccgaagagg Protospacer . ** . ********** ********** **.
26. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP032326 (Azospirillum brasilense strain MTCC4035 plasmid p5, complete sequence) position: , mismatch: 9, identity: 0.727
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer accaggaggtcgtcgccggcgccgccgttgatc Protospacer .* **.******** ************ *
27. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP015094 (Pelagibaca abyssi strain JLT2014 plasmid pPABY6, complete sequence) position: , mismatch: 9, identity: 0.727
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer aacagcgtgtcgtcgccggcgccgccgatcagc Protospacer . ** . ****** ************* ***
28. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP022366 (Azospirillum sp. TSH58 plasmid TSH58_p04, complete sequence) position: , mismatch: 9, identity: 0.727
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer cgcagcgtgtcgtttccggcgccgccggacagc Protospacer ** . *****.*************.****
29. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP012916 (Azospirillum brasilense strain Sp 7 plasmid ABSP7_p2, complete sequence) position: , mismatch: 9, identity: 0.727
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer aacagcgtgtcgtctccgtcgccgccgaagagg Protospacer . ** . ********** ********** **.
30. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP029357 (Azospirillum sp. CFH 70021 plasmid unnamed2) position: , mismatch: 10, identity: 0.697
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer atgtagacgtcgtcaccggcgccgccgaccagc Protospacer ... ..* ****** ************* ***
31. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP030074 (Streptomyces sp. ZFG47 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.697
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer aagggcaggtcgcgtccggcgccgccgaacccg Protospacer . ..* ******. **************** .
32. spacer 2.2|3139573|33|NZ_CP051468|PILER-CR matches to NZ_CP022366 (Azospirillum sp. TSH58 plasmid TSH58_p04, complete sequence) position: , mismatch: 10, identity: 0.697
gcaagaaggtcgtctccggcgccgccgaacaga CRISPR spacer agcagaaggtcgttgccggcgccgccggcgatg Protospacer . **********. ************. * .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
891106 : 910009
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP051468|891106:910009|DBSCAN-SWA CATGAAGTTGAGTGCGGCGGGCGTCGAGAAGATCAAGCCGACAGACAAGCGACAGGAGATCCCGGACAGCCTCTGCGTCGGGCTCTATCTCATCGTCCAGCCGACCGGAAAGAAGGGCTGGCAGGTCCGTTATCGGCATGGCGGGGTGCATCGGCGCATGTCGCTCGGCGCCTTCCCGGTGCTGTCCCTCGCGGGCGCCCGCGAGCGGGCGCGGCAGACCCTGGCCGCGGCGAGCGCCGGCACCGATCCGGCGGCCGAGGTCAAGGCCGCGAAGGCTCCGAAGCTCGACGGCGATCGCGATAAGATCAAGACCCTGATCGGCCAGTTCGACAAGCGGCACCTGTCCAAGCTCAAGAGCGGGGCGGTCGTGCGCCGTGAGCTGGAGCGGCACGTCGTTTCGGCGTGGGGCGAGCGCGACGTTCACCAGATCGGAAAGCGCGACGTGATCGACCTTCTCGACGGGATCGCGGACAGCGGGCGCGTCGTCACAGCGAACCGCGTCCGGGCCTATCTCGCCAAGTTCTTCGGATGGTGCGTCGAGCGGGACATTCTGGCGACGAGCCCGGCCGCCGGCGTGAAGCCTGCCGGTAAGGAGACGAGCCGTGCCCGCGTGCTGACCGACGACGAGATCCGCTGGTTCTGGACGGCGTGCGAGGCTGAGGGCTTCCCGTGGGGGCCGTTCGGCAAGGTGCTGCTCCTCACCGGCCAGCGGCTGAACGAGGTGGCGCAGATGACGGACAGCGAGATCCGCGGCGACCTGTGGCACCTGACAGCCGACCGGACAAAGAACGGCCGGGCACACGATGTGCCGCTGACAGGGGCCGTGCGCGCCGCGCTCGACGAAGTGGAGCGCGTGACGGACGAGGAGGGCAGGGTGCGCTTCATCTTCACCACGACCGGCCGGACACCGGTCAGCGGCTTTTTCAAGGCCCGCGCTGTGCTGGCCGAGGCGATGGTGGGCATCGCGCAGAAGGAGCGGGGCGAGCCCGTCGAGATCCCGCGCTGGACCTTCCATGATCTGCGCAGGACGGCGGCGACCGGCATGGCCCGGCTCGGCATCCCCGTGCGGGTGACAGAGGCCGTGCTGAACCATGTGAGCGGGACAGGGGGCGGCATCGTCGCGGTCTACCAACGGCACGACTATGCCGACGAGAAGCGGCAGGCGCTGGAGGCTTGGGCGCGGTTCGTTCTGTCGCTGGTGGAGGGGGAGGCGGACAACGTGGTGAGGCTCCAGCATGGCTAAGGAGGTCTGGTCTGATGCTATGCACCGCACGGAGAAAGAGGTCGTCTCTGCTGAGAAATGGGAGACGCTGCTTGCAGAGGCTGCAGCAGATTGGCCTACATCGAGCAAATACCTGCCGGAGCTGAAACAGAGGCTCGCCGAGATTATTGTTCAGGATGGGTTTCCAGACCCGGGTGCCGCGATCTTCCACAAGGGGCGCTTTTGGTGCAAACGCCTAGAAGGCGAGAGGCCCGTGGGTAAGGCAATCTCTCCCGGCTGGGCTTTCACACGAGCGCAAGCCGAGCCGCTAACGCTGCCACATGAAGCTGCTGACCTCTATTTGAAGGTGATCGAGGTGGAGCGAGCGCTCGCCGCCGGGAATCTGGAGAGTGTCTTTTCCCATAGTCTGCTTCTGGGGCTGGCGCACAGTCGGTTCAACGCGCGCAGACTCCACCTTAAGCACGTGACGCGTGGAAAGAAGGTTGCGGGAGGACAGCTAAACTCCGCGCACAAGACCAATCAGCGGCATATTCCGATGAGGACCCTTCGGTTCGCGCGGATGGCGGAACTGGTCCCCAAGCTTGGCGTCGAGAATGCGGCACGGCAATGCGAAGCGGAGGGAATGGGAGGTTGGCAGGGCATCAAGAAACAGTGGGACCGCTTCAAGTAAATTGGGACACCTAGGCCGCTGTCCCGCGGAATATCTGAACATTGGGAATCCTAACGTCACCGAACGTGAGGCACCCCAATGGAACGCAAGCTTATCTCCGCGGCCGCCGTGCGCGCCCTCTGTGGCGGCATCAGCGACATGTCCCTGTGGCGCTGGTTGAACAACCCCGCCATGGCATTCCCGAAACCCGTCACCATTCAGCGCCGCCGCTACTGGCGCGAGGCCGAGGTGCTCGCGTGGCTCGACGCGCGGGCCGAAGCGCGCGAGGTGGCCTGATGTTCGCCGTTTACAGCATAGACGAGCTACTGGCCCGCAAGGCCAAAGGTCACTTCCGCGTCGAAACCGTGGCCGGCCGCTGCGTCATTTCCGTGCATCGCCCTGGCGAACCGGATGAGACGGTGTTTTGCCTCTCGGCCGGGCACGCAAATCAGGTGCGGCAGTCGCTGACCGACGAAGGGCTGACCGGCTATTTTGAGGGGGCGCGGTGATGCAGGGTCCGCACTTCTGCCCCCTCTGCGACTTCTACGGCAAGCCGTGGCAGGAGGTTCCGTCTAGCGCGTGGTGCGGGACGATGCTTGAGGCGCGCCGCTGGGCCAAAGGCGCTCACGAGGAATACCGCAAGGCCGTCTGCGTGGCCGCGTTCGAGGCGATGAAGCCTGAGCACCAGTCGGCTTTCCTCGCCTATCTCCGGCAGCGGAGGGCAGCGGCATGACGAGGCACTACGACCCGGACGCCTGGCGCAGGGAGGTGGAGGCCGAATGCGGAGCCGAGTTCGGGTCCGACGCCGCCGGAACCGGCCACGAAGAACCGCCGCGATCCGGCCTGCTGTCGCCGACGACCTGGCGCCGCAACGTTCCGCCCCCGCGTCAGTGGCTCTACGGCTACCACCTGCAACGTCGGATGCTGTCGGGCACCATCAGCCCCGGCGGCCTCGGCAAATCATCGCTGGCGCTGGTGGATGCGCTCGCCATGGCGACCGGCCGGAAGCGGTTGCATAGCTGGGTGCATTCCCGCGAGGGCCTGCGCGTCTGGTATTGGTGCGGCGAAGATCCGGTTGAGGAAATCAGCCGGCGCCTTGAAGCGGCCTGCATCCATTACGGCATCAGCGACGACGAGATCGGCGGGCGCCTGCATGTGGATAGCGGGCGCGATCAGCCGATCAAGATCGCGGCGGCAGACCGTGAGGGCGTGAAGGTGGCAAAACCCGTGGTGGCTGCGCTTGTGGCGCAGATGCGAGCCCGCCGGATCGACGCCTTGATTATCGACCCGTTCATCACCTGCCACACGGTTCCCGAGAACGACAACACCGCAATGAATGCCGTGATCGACGCATGGCGGGACGTGGCGAACCAGACCAACGCCGCAATTGAGTTGATCCATCACGCGAACAAGGCCGGGACGGGAGACAGCATCAACGATGGCCGGGGCGGCTCTGCCTTTCGCGACGGTATCAGGGCGGGGCGCGTGCTGGCACCCATGACCTTCGAGGAAGGCGAGAAGATGGGGCTTGCGCCCGCAGAAGCCGCGCGGATCTTCCGGGCGATGGACGGCGCCAAGACCAACATGTCCGCCCGCGGGGGCGCGTTCACCTGGTATCGGATGCAGTCTGTCCCTCTGGGCAACGCTACACCCGAGTATCCGCGGGGCGACGAGGTGGGTGTTTGCACGGCATGGACGCCGCCGGATGCGTTCGAGGGCGTCAAGCTGGACGACCTGCGGCGGGTGCAGGAAGCCATTGCTGCGCGGGCGACCCCGCCGGCCAAGGCGCCCACCAGCAAGGACTGGGCCGGCTATGTCGTAGCCGACGCGCTGGGCTTGGACGTTGGCGCCCCCGGCACGACGAAGGCGCAGCAAGGCCCTGACCAGATTGCCGCCCGCAGTCGGGTGTCGAGCCTCATCAACACATGGCTCAGGAACGGCGCGCTTCGGGTGGAGACCGTCCGTTCGCTGCGGGACGGGCGCGACGTTCCGTCGATCACGGTGGGAATGCCGGCTGAGGACCATAACGCGGCCTATCCGCAGTCTCCGCACTCGACTGCGGAATGAGTGCGGTGAGAGCGGAATATGCATGTGGGGCGCGTGGTATGGGGGCGTAAGCCCCACCACCGCACCGCAGTCATATACTTAAGGGCCTGACTGCGGTGCGGATAGACAGCGTTATAAAGTGTTAGGATTCTCAGTTAGTTGCGGCTGATCTAGTATTGGCGCATGAGCAATCCCGACCTTCTCTCTGCAACTAAGGCGCTTGCGCGGATCGACCTCAGCGACGAGGACGCAACGCTGGAGCTGATGATTGCGGCTGCACAGGCTGACGTGCTGGCGGCTGCAGGCTACACCCTGGCCGAGGGTGCATCCCTGCCGAGCGACCTGGCCTATGCGATCTGCGATCAGGCGGCCATGCTGTTCGACGCACGCGGTGGCACAGAGGATAGGCCCATGGGCCTGTCCCTTGCCGCGTCACGCATCGTCTCGCGATACCGCGGGGTGCGGGTATGCCTGCCGACGAGCGAGTGAAGCGCCCGCGCGGCCGACCGCGCAAGGATGGCCTGCCCCCCGGATCTGGTCCCAATGCCCCGGCTCGGGGAGGGGCGGATACCGTGGGTCGGGGTGTTCCTCTCTCGCTTCTGGAAAAAATCCGGGGGAAAGACCCGGCCTCGAGGGCCATTGCCTTCCTGAAACAGCTCCGAGTGCCCGAGGGGAAGAAGGCCGGGAAGCCTCTCAAGCTGGCCGAATTTCAGTGCCGGTTCGTCCGTGGCTCCATGGCGCCGGGCGTGATGGTGGGCGTGCTTTCAATCGGACGCGGCAACGCCAAGACCGCGCTCTCTGCCGGCCTCGCCCTGGCCGAGCTGGTGGGCGCGCTGCATCCCAAGCCCCAACCCAAGCGCGAGATCCTGTTTGCCGCCCGGAACCGGGACCAGGCAAAGACCGCATTTCAGTTCCTCGTGGGCTATATCGACGGCCTCCCGGAGGAGGATCGGGCAGGCTTCACCATCCGGCGCGGCTCAAAGCTCGAGGTGGAGTTCGAGGGCAACGGCGGCGGGCTGGCGCGCGTCATCGCGGCCGATGGCAAGTCGATCCTGGGCGGCGCTCCGACGCTGGCCCTCATGGACGAGCGCGCGGCGTGGGAACGGGAGAAGGGCGACAACCTCGAGAACGCCATTCTCTCGGGCCTTGGCAAACGCGACGGCCGCGCCCTCATCATTTCCACCAGCGCGCCCGACGACGCCAACACCTTCTCCCGGTGGCTCGATGAGCCCCCGCCGGGGACGTATGTCCAAGAGCATCGGCCGGCCTTCGGGCTTCCTGCAGACGATCTGCCGTCCCTTCTGGAGGCCAATCCGGGCGCGGCCGAGGGCATCGGTTCCACGCCAGAATGGCTGGTGGCGCAGGCCCGGCGCGCGATTGCGCGGGGTGGTTCGGCCCTCTCCAGCTTCCGCAACCTGAACCGGAACGAGCGTGTATCGACCGAGGATCGGTCCGTTCTGGTGACGGTGGACGAGTGGCTTTCGGCCGAGGTGGCGCCCGACGAGCTGCCCCCGCGCTCCGGTCCCTGCGTTCTGGGCGTCGATCTGGGCGGCTCCCGCTCCATGTCTGCGGCGGCCTTCTACTGGCCCGAGACGGGGCGTCTGGAGGCCCTGGGCACCTTCCCCGCGTCTCCCTCCCTGGCCGACCGTGGCGCCGCTGATGGCGTCTCCGACCGCTATGTGCAGATGGAGGCCCGCGGCGAGCTGACCGTGATGGGCGATGCCACGGTGCCCCCTGGCCCATGGCTGGCGGCCATCGTGCGGCACCTGGACGGGGCCGAGGTGGCCTGTGTCGTGGGCGACCGCTTCCGCCATGCCGAGTTCACCGAGGCGATGCAGGCCGCGGGCCTTGCCCGTGTGCCCTTTGTCTGGCGCGGCTTCGGGTGGAAGGACGGCAGTGAGGATATCGAGCGGTTCCGGCGCGCGCTGTTTGACGGAGAGGTGCAGGTGGCGCCCTCCATGCTCCTGCGCTCCGCCTTCTCGGATGCAATCACGCTGGTGGATCCGGCCGGGAACCACAAGCTCGCCAAGGCCCGCTCCCTCGGGCGGATCGACGCTGCCGCGGCCTCTGTCCTGGCGGTGGCGCAGGGCGCCCGCATGAAGGCCGCTCCGATGCGGAAAGCGAGGGCGGCATGGCTCTAGACCGCTTCTCCCGCTTCTCTCGGCCGGTGCTGCGGACGCGGCGCTGGAAGGTGCTGCGGCATCAGATCCTCGAGCGCGACGGCTGGGCCTGCACCGAGTGCGGCAAGCGCGGCCGACTGGAGGTTCACCACAAGAAACCCGTTCGCACGGCTCCCGAGCTGTCGTTCGACCCGTCCAACCTGACTTCCCTTTGTCCTGCCTGCCACACGCGCCACACCCGGCGCGAGTGCGGCATTCCTGACCCCGATCCCGCCCGCGTGGCGTGGCGACAAGCCGTCTCCGACCTGGAGGCGCCAACCGAGCAAAAGGAGATCCCATGCTCACTTCCGTGACCATCGCGCGGCGGCAATCCGAGATCCGCCAAGCCCTCTCGACCCTCGCCGGCAAGCCGACCCCGACCGAGGATGAAACCCGGCAGATGGAACAGCTCGACGGCGAATATCGCCAGAACGAGACCCGCTATCGTGCCGCCCTCATCGCCGAGGACACCGAGCGCCGCGAGGCCGGCGCCGATCTGGAAACCCGTTCCGGCCGGGAGTGGTCCGAAATGGTGGCGGGCTATGAGCTGCGGCAGGTGGTGCGGGCGCTGGAGGAGGGCCGCGCGCTCTCGGGCCGCACGGCCGAGGTGGTGGCCGAGCTGCGCAACGCTGGCGGCTATCGCGGCGTCCCGGTGCCCCTGATGGCGCTGGAACAGCGTGCCGGCGAAACCGTGGCCTCGGGCACGCCCGACCCGCTCCAGACGCGCCCGATCATCGACCGGCTGTTCCCGGCTTCGGTGGCGGCGCAAATGGGCGTCCAGCTCATCACCATCGGCTCGGGCGCCGTCGAATGGCCGGTGACGACCTCGGCCGTCTCGGCCGGCTGGGCAGACGGCGAGCTGGCAAACGTGGCCGGGCCGACCGCCTATGCCACGACCGACAAGGCCCTGAAGCCCGAGCAAACCCTGGGCATTCATATGCGGATCAGCCGCAAGGCCATGCTGCAATCGGGCGATGCGCTCGAGGCGGCGATCCGGCGCGACATGTCGGGCACCATGTCGGCCGAGCTGGACAAGGCGATTTTCCGCGGGACCGGCGCGAACGGGCAACCGCTGGGCCTGATCCCAGGCGTCTCCACCTATGGCATCACCTCGACGGCAGTCGGTGCCGCGGCCTCCTGGGCGGCGATCCGCGCGGCGGTGGTGCGCTTCATGACCGCCAACGCGGCGGCCGGGCCGGGCGAAGTCAAGGCGCTGATCCGTCCCGAGCTGTGGAGCTATCTGGACGGCATCCTTGTCAGTGACTCGGGCTTCAAGTTCGAGTTCGACCGGCTGAAGGAGAACCTTGGCGGCATCGTCATGTCGTCCAATGCGCTGGCCGCTCCGAGCGGCTCGCCGCTCGCGACCTCGGCCCTTCTGGCGACCTCGGCCGGCGGCGTGGCGCCCGCTTTCGTGGGCCTCTGGGGTGCCTTCGACCTGATCCGCGATCCCTACTCGGACGCGCAATCGGGCGGCGTGCGGCTGACTGCTCTCACCACGGCAGACGTTACCGTGGCGCGCGGCGCGCAGCTGGAGCTTGTCACCGGACTGCAGGTGGCCTGATGCTCTGGGGGGCGTCTCTCGGCGCGCTGGAGCTGCGCAGCGAGGGCGGGGCAACCCGCCTTCGGGCGACGTTCCCCTATGGCGCGGAAACCGAGCTGGCACCGGGACGGCGAGAAGTCATCGCCGCTCGGGCCTTCTCCGACCGGATCGAGGCAGGCGAGGATATCCACCTTCTCGCCGGCCACGACTACAACCGCCCGTTGGCCTCTCGGGCCGCGGGCTCCCTCACCATTTCCGACACTGACGCGGCGCTGGTGCTCGAGGCGCGGATCGAGGGCGGCACCTCCTGGGCGCAGGACTTCCTTGCGGCTCACCGGGCGGGCCTCGTTCGGGGCCTGTCGCCGGGCTTCCGCGTCCAGTCCGGGGGCGAGCGGATCGAGCAACGCGGCGCAGGGCTGCTGCGCACGATCACCCGCGCGGCTCTGTTCGAGCTGTCGGCCGTGACAGTGCCAGCCTACCCCGAGGCGCAGATCGAGGCCCGATCCTGGGAAACGCATCAGGACCGGCAACCCTTCCGCGGCGCGGCTTTCCACCTCAACCGCTGGAGGCTTTGACATGGGCCTGATGAGCCTTTTCCGCCGCAAGCCCATGGAAACCCGAGCGACCGGCTCGGGCTACACCGCCCTTGTCATGGCAAGCCGCGCAAGCTTTATCGCCGGCACCGCGGGCATGGGCGAGCTGACGGCGACGGTGCAAACCGCTGTCAGCCTCTGGGAAGCCGGCCTGTCCCTTGCGGACGTGACCGGCACCGACATGCTGGACAGGCAGACCATGGCCCTTTGCGCCCGCTCCCTGGCGCTCCGGGGCGAGGCCCTGTTCCTGATCCGCGACCGGCTGGTGCCTGCGATCGATTGGGACTTGAGCACGCGCGACGGGCAACCGCGGGCCTATCGGTGCCAGATCCCGGAAGTGGGCGGCGGCCGAGGCGAAACCGTGCTGGCGCCCGAAGTGCTGCATTTCCGCATCGGCTGCGATCCTGTCACGCCCTGGGCGGGCTCTGCCCCGCTGAAACGGGCGCAGCTCTCGGCCGAGCTTCTGGGCGAGCTGGAGACTGCCTTGCGCGACGTGTTCCGGGACGCGCCTCTGGGCTCCCAGATCGTGCCGGTTCCCGAGGGCTCGGCCGACGACATGGAGGCGCTTCGGCAGGGCTTCCGCGGCCGGCGCGGCGCGGCTCTCGTGATCGAGGGTGTTGCGCAGGCAACCGCGGCCGGCATGAACCCGAACCTCGGCCAGAAGCCGGACCAGCTCTCGCCGGATCTGGCGCGGACGCTGGCCGACAAGATGCTGACCGAGGCCAAGGGGGCGATCCTGGGCGCCTATGGCGTCCTGCCCGGCCTCATGAACGAGGCGACCACGGGGCCGATGGTTCGCGAGGCGCAGCGGCATCTGGCGCAACTGGTGCTCCAGCCGATTGCCGGCCTGATGGCCGAGGAAGCGACCCGCAAGCTCGGGGGCGCGGTGGCTATCGACGTTGTGCGGCCGATGCAGGCCTTCGACCACGGCGGCAAGGCTCGAGCCCTGGCGACCATGGTTCAAGCCATGGCGCAGGCGAAAGAGGCCGGGATCGACGGCGCGGCCATGAAGGATGCCCTGCAATTCATCGACTGGAGCGACTGACATGCTGATGCTCACGCCCGCGGCTCTGGAGATGGCTCGCGCCCTCAAGCGTCGCGAGCGTGACGCGAGGCGGGGAAAGCTCGCGACCGAGCGCGCCGAACGGCTCACGGATGCCATGGCCGCCCGGATGCGCGCGGCTTTCGCCGAGGGCCGCGCCGCGTCCCTGTTCGCGCTCGAGGGTCCGTTCCGTCACGCGATCCGCTCGGGGCTCTGCCTGCAAGGATGGAAATGGGATGCGGCCGACGAGATGGCCGCGGCGATGGTGGCCGAGGCCCTGCGCAAGGCGGGTGTCAAGCGGCCGAGCTGGAACGAGGGGCAGCCGGAATGGACTATCGAGAGCGGAGCCCTGATCGAGCACACCAGGTGCCAGCGGTGCCACAAGCCGCTTCCCGAGGGGCATCAGAAGTTCTGTTCCAGCATATGCAAGCGCAGCCATCACAGCCGGCTTTCGGCTCTGCGCCATGCGGATGCGGAGACGGCCATTCGCATGGCAATCCGTCTGGACTGAGCTGCAACTGGTGCGGAAAGCCCCTCCCGGCCGGCGTGACGATCCGGCGCGAGTTCTGCGATAGCAAGTGTCGGCAACGGTTCTACACGGCCGAGAAACAGGCGGCACGGGCCGAGGCGCGGAAGTCGCAGAAATGCCTGTTCTGCGGCGGGCAGATGGGCGCCTATGGCCGGATCTACTGTTCCGAAGCCTGCAAGGTCCGCTCTGGTGACGACATGCGCGCCAAGCGGAACAAGCGAAACTGCAAGGTCTGCGGCAAGCGGTTCTTCCCTACAAACGAGGGGCAGGTGTATTGCGGCCTCAAGTGCCGGGACGACGACAAGCGCATCCGGTGGCCAAAGGTGTGCCTCGTCTGCGGCATCACATTCCGGCCTCATCAGGTGGAACAGGTGACGTGTTCAAGGGCGTGCAAGGGGCAGATGCAGCGCACGGTGCCGGATCGGACCTGCATCGGGTGCGGGCAGGTATTCCGACCGAAGCGGGCCGCGCAAACGGCCTGCTCCAAATCATGTGCAATGAAGGCGCGGATGCGGAAGGGCTGACCCGTTCCGGGATTATGCTCGAGGGTAGACTCTCGTCTTCCGGCCATTGATCCACTCGACCGCAAGCCATCCCTTATCGGACAGCTTGAAGGTTCTGCCATGCAGGCACGGCTGCCCTTCCTCGGCCAGACCCTTTGCAACCAAGGCATCCATGCTCAATTGACCGACGCCAGGGGCCTTTGCAAGCGGTTCGTAGGACATGGAGACTCGCGAGAACGTCTTCAGCGCCTTCTTATCGGCGGCAGTCAGATCTTCCCAGTCCGCTTCGTCGAACGTGGCCAACTCGTCCATCAGGATCGATCCCTTGCTGATGATTCGAGAGCAGCTTAGGCGCCCGTCAGATGGATAACAACGGCAGCGGGTTCCAGTCCTACCCTGTTCCCCCGGTGTTCCCCAGAAGCAAAACAAGAAAGCCCGCCGGTGGCGGGCTTCTCTCATGTAGCTGATCCCTCAGGGATTTTTGGCTCCGGCGGTAGGGATCGAACCTACGACCAATTGATTAACAGTCAACTGCTCTACCGCTGAGCTACGCCGGAACACGAGGGCGATATAGCAACCGGATTCGGGGGCGTCCAGAGGGAATGTGGAAGGTTTTTCAACTTTCTTCCCGAGGCTCCGACAGGGTGAAAATCGCGTCAGGGGACGGCCCCGGATCGGCCGACACCTGCCTCCGGTTCCAGAGGTCGAGGAGGTGGGCCAGGAGGTTCCGCGCCGCCGCAGGCAGAAGCTCGGGGGGCGTCTCGCGGTAGATGCGAGGCGTCAGGTCGCGCACGGTTGCGGGGCCCCGGGCGAGTTCCGCGAGGATCGAGCTCTCGCGCTCGCGGCGGTGGGCCGCGAGCCAGGCCAGCCGCGATCCGGGGTCGGTCACCGGCGCGCCGTGGCCCGGATAGAGCAAGCGCCAGCTCCGGGCAGAGAGCTGGGCGAGGGAGGCGACATAGGCGCCCATGTCCCCGTCCGGAGGAGAGACGAGCGAGGTGGCCCAGCCCATCGCGTGGTCGCCCGAAAAGCAGCGGTCGCCCCAGGCGAAGCAGAGATGGCTGCCGAGATGGCCCGGAGTATGGATCGCCTCGAGCACCCAGCCGGGGCCCTGGACGCGCTCGCCGTGGGCGAGGCAGCGGTCGGGGCGGAAGGCGGTATCCACTCCTTCGCCGCCCTCCAGCCCCCCCGCCGCCAGCTCCGCCATCAGCGGACTGCGTCCGGCCTCGGCCGAGCCGAAGGCCAGCACTTCGGCGCCAGTGGCTGCTGCCAGCGGCGCGGCCAGCGGCGAGTGATCCCGGTGCGCATGGGTCACGAGAATCGTCGAGATACGCTCGCCCGGTGCCAGCGCCGCGAGCAGCGCCTCGCGATGCGCAGGCAGGTCGGGGCCGGGATCGATCAGCGCCACCTCTCCCTCTCCCACAAGGTAGCTGTTCGTGCCCCAGAGGGTCATGGGCGAGGGGTTCGGCGCCAGCACCCGTCGGAGCCCGGGCTCCAGCCTCTCGCAGCGTCCGGCTTCGGGTCTGTCCATGGCTCTTTCCTCGTCCGGAGCGCAGGGCTAGGGTCGGCCCATGTTCGCGCCGCTCAAATCCTTCGTCCCGCGCAGTCTCTATGGCCGCGCCGCCCTCATCCTGATCGTGCCCATCGTCACGATCCAGCTCGTGATTTCCATATCCTTCATCCAGCGCCATTACGAGGGCGTGACGCGGCAGATGGCGCGGGGGCTGATGATCGAGCTCCGGCACCTGATCGATGAGGTGAACGAGGCGGGCTCCCTGCCCGAGGCAGAGGCGCGGGCCGCCCGGGTGGCGGAGGCGCTCGAGTTTCGCGTGGCGCTGCCCGCCGACTGGAGCACGCCAGAGGACCGGCGAGACTTCTGGGACCTGTCCGGGCGTCAGCTCGTGCTCACGCTGCACGAGGGGCTGGCTCCGCTGTCGGCGGTCGATCTCGTGACCAACGACAACGAGGTTCGCCTCCTCATCGCGACCGACAAGGGGCCGATGGAGGTCGCGGTCGAGCGGAGGCGCGTCTCGGCCTCGAACCCGCACCAGCTTTTGGTGCTGATGATCGTGACCTCGATCCTGATGACGATCATCGCCTATCTCTTCCTCTCGAACCAGCTCCGGCCGATCAAGCGGCTGGCGGATGCGGCGGAGGCCTTCGGCAAGGGCCAGCACATCGCATACCGTCCGCGCGGCGCCGTCGAGGTCCGGGCTGCGGGGCAGGCCTTCCTCGAGATGCGTTCCCGGATCGAGCGGCAGATCGAGCAGCGCACGCTGATGCTGTCGGGGGTGAGCCACGATCTGCGCACGCCCCTCACCCGGCTGCGGCTCGGCCTGTCGATGCTGTCGGATCCCGAGGAGGGCCAGGAGCTTCTGGGCGACGTGGCGGACATGGAGCGGCTGGTGGACGAGTTCCTGTCCTTCGTCCGGGGCGACGCGCTCGACGAGGCGGAGGAGGTGGACCCGCTCGCGCTGGTCGGCCGTGTCGTGGAGAATGCGCAGCGCGCGGGACAGGATGTGACGCTGGTGCGGGCCGAAGGGACGGGCACCATGGTGCTTCGGGAGGCCGCCGTGGGCCGGGCGCTGGAGAACCTTATCGGAAACGCGGTGCGCTACGGGACGAAGGCGGAGGTCTCGGTGACGCTCGGAGAGCGGGCGCTGCGCATTGCGGTCGAGGATGACGGGCCGGGCATTCCGCGTGAGCGGCGCGAGGAGGCGCTGCGGCCCTTCACCCGGCTCGACAAGGCCCGCAACCCGAACCGCGGCGGGGGCGTGGGGCTGGGTCTCACGATCGCCATGGATATCGCCCGCAATCACGGCGGCGCGCTCCGGCTGGGCGAAAGCGAGACGCTGGGCGGACTTCGCGCGGAACTGACGCTCGCCCGCTGATCCGGCGGCAGGTCCTCTCGCGACCGGAGCTACCTTCGGCGCGCGCTCCGGGGCTGCGGTTCCTCCCTGCGCGGGTTTACCGGCGCGCGCCCAGAACCAGCGACCATTGCGCCCGTCCGTCCTTGCCGGGGGCGATGCCCACGCCGGCCTCGCGCACGCCCCGCATCAGGATGTTCTTGCGGTGGCCCGAGGAATTCATCCAGAGATCGACCACCCGCTCCGCGGTGGCCGGGCCGCGGGCCACGTTCTCGGCCGCGGTGCCGAAGCGGTAGCCCTCGGCGCGCAGGCGCTCGGTGAGGCCCGCGCCGTTGCCGAGTGTGTGGCCGAGGACCGAGCGGCGGGCATTGTCGCAGGCCTGGACCTGCGCCGCGCGCTGAAGCTTCGGGGCGAAGGCGAGGGGCGGCAGGCCGGCCTGGCGTCGCTCGGCGCTGACCAGGCGGATCACCTCGCTGCGCAGGGCGTCGGCGCCCTGCGGGGCGGGGCAGAGGGCGGCGGCGGGCAGGGGAGCTGCCGCAACCAGAAGAAGGCTGAGAAGGATCGGGCGCATGGGTCTCCTCCAGTGGGGTGCGCGGCGATGACGGGGCGGGCGTGCTGCCGGAGGCCCTGCACGCCCAGGTTGAAGCTGTCGGGCGGCCTCAGGTTCCCTGCGAAATGTTGACAAGGGATGTGCGTCCGCCCCTCAGGTGCCCGCCGAGGGCCGGGGCGCTACGATCCGCCCGCCTGCGGCCGCGGCATCCTCCGGGTCGTGCGAGACGAGGATCACTGGCAGGCCCTCGCGCCGGGCGCGGTCCAGAACGAAGCTGCGGATCTGCGCACGGAGGTCCACGTCGAGCCGCCCGAAGGGCTCGTCGAGCAGGAGCGCCGCGGGCTCGGCCAGAAGCGTGCGCATGAGTGCCACGCGCGCCCGCTGGCCGCCCGATAGGGTGGCCGGATCGCGGTCGGCAAAGCCTGCCAGATCGGCTTGGGCCAGCGCCTCCTCGATCCGGGCGCGGCGCGCGGGCCCGCGGACGGAGGAGGGCAGGCCGAAGCCGAGGTTGCCGCCGACCGAGAGGTGGGGAAAGAGCACATCGTCCTGAAACAGGATCCCGATCCGTCGCGCGCGGGTGGGAAGGTCCGTCACGTCGCGCCCGTCGAGGAGGATGCGGCCACTCGCCCGAAAGCCCTGCCCGAGCCTGCCGATGATGGCGGCAAGCGCCGTCGATTTGCCCGAACCGGAGGGACCCATGATCGTCACCACCTCGCCGGGGGCCACGGTGAGATCGAGCGCGACGAGCGGCGTATCGCCCCGCGTCACCCGCAGCCGGTCGAGCGTCAGGCCCTTCATGCAAGCCTCAGGCCGCGGCGTCGGCGGAAGAGGAGGGCGGGCAGGACGAGGGCTCCGGCGAAGGCGGCCGCGGGCAGGATCATCTGCAGGAGGGCATAGGCGCCCACAATCCGCCGGTTGCCGCCCGAGGACAAAGCGACCGCCTCCGTCGTGAGGGTGGAGACGCGACCGCCGCCGATGAGGAGGGTGGGCAGATACTGGCCCACCGACACCGCCATGCCCACGGCGAAGGCGGTCAGCACGGGGCGCAGCAGCATCGGCAGGCGCAGGCGCCAGAAGATGCGGCCCGGCGAGGCGCCAAGGGCCGCCCCCACCGTGCCGATCCGCGGATCCCACGCCCGCCAGGGCGCCGAGAGCGAGAGAAAGACATAGGGCAGAACGAAGACGAGATGCGCGGCCGCCACAGCCATGGGCGTGCCGTCGAGCCCGAGGCTCAGGGCCGCCACCTGAAGCCCCGGCAGGAAGGCCACCTGCGGCGCAATGAGAGGCAGGTAGAGCGCGGCCATGGCCCGGGGAGAGGGGGAGAGCCCATGGCGCGCCTCGGCCTCGAGGCAGGCGAGCGTCAGGGCGAGCGCGGCCATGGCCACGGCCGCGGCAATCCCGAGCGTGAGAGCTGCATGTCGCGCAAGGCTCGGCGCGGCCTCCTGCCAGACGCGCAGGCTCACGCTCTGCGGCAGCGCGTCCGGGAAGGTCCAGAGGCCCGCCACCGACCAGAGCCCGAGGCCCGCGAGCCCCAGCATCACCGCGGCAAGCGCCAGCGCCGTGCCGCCGAGGGTGAGCGGGCGCAGTACCCGGTCCAGCGCGGGCGCGCGCCGGCCCCCCGCGGCGGCGCGCAGCAGGAGCCTCCGGCCGAGGATCTCGCCAGAGCGCCAGAGGCCGAGCGCGCCGAGGACCAGCGCGAGTTGCAGAAGCGCCGCCGCCGCTCCGCGTCCGCGTTCGGCCAGCGCGGGATCGGCCATCCAGAGCGCGATCTGCACGGAAAGCGTCGGCGGCCGGGTCGGGCCGAGGATCATCGCCATCTCGACGGCCGTCATGGAATAGGCCAGCACCGCATAGCAGGGCAGCCGGATCTGGGCATAGAGCGCGGGCAGCACGCCGAAGACCCAGCCTGCGGCGCGCCCGTAGCCCAGGGACTCCGCCAGCATCAGCCGTCGCTGGGCATCGGTCTGGGGCAGGGCCGCGAGCGACATGAGGAGGAGGAAGGGCACCTCCTTGGCGACAAGGCCCCCGATGAGCGCAAGCCCTGCCGGATCGTTCAGCGTGAGCAGATCCGGCGGCGCCTGCCAGCCGGTGGCCCAGGGCGAAAGGGCGCGGGCGATCCAGCCCGACGGCGCGATGAGAAAGGCGAGCCCCAGCGCCACGGCGGCATGGGGCACCGCGAGGAGCGGCGAGAGAAGGCGCAGGAGCGCCTCGAACAGCCGTCCGCCGGGGAGGAGCGCGAGAATGAGGAGCGTCAGCGCAAGCGCGAGCGCTGTCGCGACGAAGCCCGTCACGAGCGAGAGCCGCACCGAAGCGCCGAGCCCGGGCCAGGCGGCGAGCGCGCGGAAGGCCTCGAGATCGGGGCCGCCCTCAAGGCCGAAGGCGGGCGCAATCGTGCCGGAAAGCCCTGCCGCCACCGGCGCCAGCATCAGCGCGAGCGTGATCCGCGGCGCCTGCCGGAGAAGCCCGCCCGCGCGCTGGCTCACTGCGCGGTCCCGTAGCGGCGGACCCAATCCTCGGCGATGCGCTCCATCCAGCTCGCGTGCGGCTCGAGGAGGGCCGGGCCCAGTTCGTCAGGCGCAAGCGTGGCAGGGCCGAGATCGAGGGCTGCAAAGCGGGCGCGGTCCGGCTCGGGCAGGGCCGCCACATCGAGCACGGTCTGAAAACCCAGCACTGCCGGATCCTGCGCGCGCGCCTGCACTTCCGGGTCGAGCAGCAGGTTGGCGATCACCTTCGCCCCCGCGGCATGGGCCGCGTTGAAGGGGATCGCCACGAAGCTCGCATTGCCGATGGTGCCGCCCTCCAGCGTGAAGGTGCGCACCGTCTCGGGCAGTTCGCCCGCCGCGATGGCGGCAGAGGCCCGACCGGGGTTGAAGGCGAAGGAGATGGCGATCTCGCCATCGGCAAAGAGCTGGCCGAGCGCGGGCTCATTGGCCGGATAGGCGCGCCCCTCACGCCAGAGGTAGGGCTCGGCGCGGTCGAGCCAGGTCCAGAGCGGCGCGGTGACGGTCTCGTAGGTGGCGGGATCGACCGGCTTCTGGAGGAGCCCGGGATCCTCGATCACGCCATAGAGCACCTGCTTGAGGAAGGTGGTTCCCAGATAGTCCGGCGGCTGCGGGAAGGTGAAGCGACCCGGATTGACCTCGATCCAGGCGAGGAGCTGGGCGAGCGTCCGGGGCGGCGCGGGCAGCCGCGCGCTGTCATGCTCGAACACGAGCTGCGCCATGGCCCAGGGGCTTTCTAGCCCGCCCGTGGGTTCGGTGAAGTCGGTGACGACGGCGGGCTTGCCCGCCACATCGACATGGGTCCAGTTCGGCAGGCTCTCGGCGAAGGGCTCGGACAGCAGGCCCTGCGCCTTCATCGCCGCGAAATTCTCGCCGTTGATCCAGATGAGATCGACCGCCCCGCCCTCGGTGCGGCCGGCCTGCTTCTCGGCCAGCACGCGCGCCACCGCCTCGGTCGTGTCGGCGAGCTTCACCTGCTCGAGCATGAAACCGTGGCGGGCGCGGGCCTGCTCGCCTACCCAGGCGATGAAGGCGTTGATCTTCGGATCGCCACCCCAGGCGTGCCAGTAGACGGTCTGGCCCTCCGCCTCGGCCAGCACCGCGTCCCACCCCGTCTCGGCCTCTGCCGCGGGCGTCAGCGCCGCGAGGCAGACCAGTGCCGCCCCCATGATCCGTCCGAACATCCCGCTCTCCCTTCTGTGGCTCGCCCCCGGCTTAGCGTGAGGCGCGGGGCCCCCGCAATCGCCCCCGTTGCACGAGCCCGCCGCTTCGGCCACAGAGGGCACACAGGAGGTGCCATGCTCGACGGTCTGATGCGACGCCTGATCGACCCGCCGCTCGACCGTGCGGGGCCCATGCTCGCCCGCCGCGGCTGGAGCGCGGATGCCGTGACGATGGTGGGCCTTGCGCTCGGGCTCCTCGCGGCGGGGCTGGTGGCGGCGGGTGCCTCCACCCTCTGGGCGATCCTGCCGCTTCTCGCGGGCCGGATCGCCGACGGGCTCGACGGCGCGATCGCGCGTGCGGGACGCAGGACCGACTTCGGCGGCTATCTCGACATCACATGCGACTTCCTGTTCTACGCGGCCTTACCGCTCGCCTTCGTGCTGCGCGCGCCTGAAAACGGGGCGGCCGGCGCCTTTCTGCTCGCCTCTTTCTATGTGAACGGCGCGAGCTTTCTGGGCTTTGCCGTGCTGGCCGCAAAGCGCGGGATGGAGACGACGGCCCGCGGCGAGAAGTCGCTCTACTTCACGGCAGGGCTTCTGGAAGGCAGCGAGACGATCCTGTTCTTCCTCTTCCTCTGCCTCTTTCCGGGGCTCTTCGCGCCGGCCGCCTGGATTTTCGGCGCCCTCTGCTTCGTGACCGCGGCGAGCCGCGTCCTGCTCGCCCGCAGGCTCTTTCGCGACTGACCTTCTGTGAGGGCGGCCCATGGCCCGGGCTTCCTCCGCTTCAAGGGAGAGGAACGGGAGCCGGGCCATTTTTTCCACCGCCCTTGCGGGGGCGGGGGCTCGCCACTAAGACTCGGGTAACAGCTGGACGATAATCCTTGCAAACGTCACGGGACAGCAGGCGCACATGAAAGACCCTATCGATCTCTACATGAACACGCTCGTGCCGATGGTGGTCGAACAGACCAGCCGGGGCGAACGGGCCTATGATATCTACAGCCGGATGCTGAAGGAGCGGATCATCTTCCTCTCCGGGCCGGTGCACGACGGCATGTCGTCGCTCATCTGCGCGCAGCTGCTGTTCCTCGAGGCCGAGAACCCCTCGAAGGAAATCGCGATGTACATCAACTCGCCCGGGGGCGTGGTGACCTCGGGCCTGTCGATCTACGACACGATGCAGTACATCCGGCCGAAGGTCTCGACGCTGGTGATCGGGCAGGCGGCCTCGATGGGCTCGCTGCTGCTCACCGCGGGCGAAAAGGGGATGCGCTTCTCGCTCCCGAACAGCCGCGTCATGGTTCACCAGCCCTCGGGCGGCTACCAGGGGCAGGCCACCGACATCATGATTCACGCGCGCGAGACGGAAAAGCTCAAGCGCCGGCTGAACGAGATCTACGTCCGGCACACCGGGCAGGACCTCGAAACCGTGGAAGCCGCTCTCGAGCGGGACAATTTCATGTCGGCCGAGGATGCCAAGGCCTGGGGCCTGATCGACGAAATCCTCGAGAGCCGCAACCGGCCCGACGACACGGCGAAGTGA
Protein sequences of DBSCAN-SWA_1 >NZ_CP051468|891106:910009|893272_893485_+|WP_011338073.1|DBSCAN-SWA MFAVYSIDELLARKAKGHFRVETVAGRCVISVHRPGEPDETVFCLSAGHANQVRQSLTDEGLTGYFEGAR >NZ_CP051468|891106:910009|893705_895043_+|WP_011338072.1|DBSCAN-SWA MTRHYDPDAWRREVEAECGAEFGSDAAGTGHEEPPRSGLLSPTTWRRNVPPPRQWLYGYHLQRRMLSGTISPGGLGKSSLALVDALAMATGRKRLHSWVHSREGLRVWYWCGEDPVEEISRRLEAACIHYGISDDEIGGRLHVDSGRDQPIKIAAADREGVKVAKPVVAALVAQMRARRIDALIIDPFITCHTVPENDNTAMNAVIDAWRDVANQTNAAIELIHHANKAGTGDSINDGRGGSAFRDGIRAGRVLAPMTFEEGEKMGLAPAEAARIFRAMDGAKTNMSARGGAFTWYRMQSVPLGNATPEYPRGDEVGVCTAWTPPDAFEGVKLDDLRRVQEAIAARATPPAKAPTSKDWAGYVVADALGLDVGAPGTTKAQQGPDQIAARSRVSSLINTWLRNGALRVETVRSLRDGRDVPSITVGMPAEDHNAAYPQSPHSTAE >NZ_CP051468|891106:910009|897153_897495_+|WP_011338069.1|DBSCAN-SWA MALDRFSRFSRPVLRTRRWKVLRHQILERDGWACTECGKRGRLEVHHKKPVRTAPELSFDPSNLTSLCPACHTRHTRRECGIPDPDPARVAWRQAVSDLEAPTEQKEIPCSLP >NZ_CP051468|891106:910009|900360_900867_+|WP_168746003.1|DBSCAN-SWA MLMLTPAALEMARALKRRERDARRGKLATERAERLTDAMAARMRAAFAEGRAASLFALEGPFRHAIRSGLCLQGWKWDAADEMAAAMVAEALRKAGVKRPSWNEGQPEWTIESGALIEHTRCQRCHKPLPEGHQKFCSSICKRSHHSRLSALRHADAETAIRMAIRLD >NZ_CP051468|891106:910009|892367_892997_+|WP_011338074.1|DBSCAN-SWA MHRTEKEVVSAEKWETLLAEAAADWPTSSKYLPELKQRLAEIIVQDGFPDPGAAIFHKGRFWCKRLEGERPVGKAISPGWAFTRAQAEPLTLPHEAADLYLKVIEVERALAAGNLESVFSHSLLLGLAHSRFNARRLHLKHVTRGKKVAGGQLNSAHKTNQRHIPMRTLRFARMAELVPKLGVENAARQCEAEGMGGWQGIKKQWDRFK >NZ_CP051468|891106:910009|902004_902916_-|WP_011338062.1|DBSCAN-SWA MDRPEAGRCERLEPGLRRVLAPNPSPMTLWGTNSYLVGEGEVALIDPGPDLPAHREALLAALAPGERISTILVTHAHRDHSPLAAPLAAATGAEVLAFGSAEAGRSPLMAELAAGGLEGGEGVDTAFRPDRCLAHGERVQGPGWVLEAIHTPGHLGSHLCFAWGDRCFSGDHAMGWATSLVSPPDGDMGAYVASLAQLSARSWRLLYPGHGAPVTDPGSRLAWLAAHRRERESSILAELARGPATVRDLTPRIYRETPPELLPAAARNLLAHLLDLWNRRQVSADPGPSPDAIFTLSEPREES >NZ_CP051468|891106:910009|904955_905600_-|WP_011338060.1|DBSCAN-SWA MKGLTLDRLRVTRGDTPLVALDLTVAPGEVVTIMGPSGSGKSTALAAIIGRLGQGFRASGRILLDGRDVTDLPTRARRIGILFQDDVLFPHLSVGGNLGFGLPSSVRGPARRARIEEALAQADLAGFADRDPATLSGGQRARVALMRTLLAEPAALLLDEPFGRLDVDLRAQIRSFVLDRARREGLPVILVSHDPEDAAAAGGRIVAPRPSAGT >NZ_CP051468|891106:910009|898741_899296_+|WP_011338067.1|head,protease|DBSCAN-SWA MLWGASLGALELRSEGGATRLRATFPYGAETELAPGRREVIAARAFSDRIEAGEDIHLLAGHDYNRPLASRAAGSLTISDTDAALVLEARIEGGTSWAQDFLAAHRAGLVRGLSPGFRVQSGGERIEQRGAGLLRTITRAALFELSAVTVPAYPEAQIEARSWETHQDRQPFRGAAFHLNRWRL >NZ_CP051468|891106:910009|902956_904276_+|WP_011338061.1|DBSCAN-SWA MFAPLKSFVPRSLYGRAALILIVPIVTIQLVISISFIQRHYEGVTRQMARGLMIELRHLIDEVNEAGSLPEAEARAARVAEALEFRVALPADWSTPEDRRDFWDLSGRQLVLTLHEGLAPLSAVDLVTNDNEVRLLIATDKGPMEVAVERRRVSASNPHQLLVLMIVTSILMTIIAYLFLSNQLRPIKRLADAAEAFGKGQHIAYRPRGAVEVRAAGQAFLEMRSRIERQIEQRTLMLSGVSHDLRTPLTRLRLGLSMLSDPEEGQELLGDVADMERLVDEFLSFVRGDALDEAEEVDPLALVGRVVENAQRAGQDVTLVRAEGTGTMVLREAAVGRALENLIGNAVRYGTKAEVSVTLGERALRIAVEDDGPGIPRERREEALRPFTRLDKARNPNRGGGVGLGLTIAMDIARNHGGALRLGESETLGGLRAELTLAR >NZ_CP051468|891106:910009|899297_900359_+|WP_011338066.1|portal|DBSCAN-SWA MGLMSLFRRKPMETRATGSGYTALVMASRASFIAGTAGMGELTATVQTAVSLWEAGLSLADVTGTDMLDRQTMALCARSLALRGEALFLIRDRLVPAIDWDLSTRDGQPRAYRCQIPEVGGGRGETVLAPEVLHFRIGCDPVTPWAGSAPLKRAQLSAELLGELETALRDVFRDAPLGSQIVPVPEGSADDMEALRQGFRGRRGAALVIEGVAQATAAGMNPNLGQKPDQLSPDLARTLADKMLTEAKGAILGAYGVLPGLMNEATTGPMVREAQRHLAQLVLQPIAGLMAEEATRKLGGAVAIDVVRPMQAFDHGGKARALATMVQAMAQAKEAGIDGAAMKDALQFIDWSD >NZ_CP051468|891106:910009|905596_907285_-|WP_011338059.1|DBSCAN-SWA MSQRAGGLLRQAPRITLALMLAPVAAGLSGTIAPAFGLEGGPDLEAFRALAAWPGLGASVRLSLVTGFVATALALALTLLILALLPGGRLFEALLRLLSPLLAVPHAAVALGLAFLIAPSGWIARALSPWATGWQAPPDLLTLNDPAGLALIGGLVAKEVPFLLLMSLAALPQTDAQRRLMLAESLGYGRAAGWVFGVLPALYAQIRLPCYAVLAYSMTAVEMAMILGPTRPPTLSVQIALWMADPALAERGRGAAAALLQLALVLGALGLWRSGEILGRRLLLRAAAGGRRAPALDRVLRPLTLGGTALALAAVMLGLAGLGLWSVAGLWTFPDALPQSVSLRVWQEAAPSLARHAALTLGIAAAVAMAALALTLACLEAEARHGLSPSPRAMAALYLPLIAPQVAFLPGLQVAALSLGLDGTPMAVAAAHLVFVLPYVFLSLSAPWRAWDPRIGTVGAALGASPGRIFWRLRLPMLLRPVLTAFAVGMAVSVGQYLPTLLIGGGRVSTLTTEAVALSSGGNRRIVGAYALLQMILPAAAFAGALVLPALLFRRRRGLRLA >NZ_CP051468|891106:910009|900902_901409_+|WP_011338064.1|DBSCAN-SWA MTIRREFCDSKCRQRFYTAEKQAARAEARKSQKCLFCGGQMGAYGRIYCSEACKVRSGDDMRAKRNKRNCKVCGKRFFPTNEGQVYCGLKCRDDDKRIRWPKVCLVCGITFRPHQVEQVTCSRACKGQMQRTVPDRTCIGCGQVFRPKRAAQTACSKSCAMKARMRKG >NZ_CP051468|891106:910009|891106_892348_+|WP_011338075.1|integrase|DBSCAN-SWA MKLSAAGVEKIKPTDKRQEIPDSLCVGLYLIVQPTGKKGWQVRYRHGGVHRRMSLGAFPVLSLAGARERARQTLAAASAGTDPAAEVKAAKAPKLDGDRDKIKTLIGQFDKRHLSKLKSGAVVRRELERHVVSAWGERDVHQIGKRDVIDLLDGIADSGRVVTANRVRAYLAKFFGWCVERDILATSPAAGVKPAGKETSRARVLTDDEIRWFWTACEAEGFPWGPFGKVLLLTGQRLNEVAQMTDSEIRGDLWHLTADRTKNGRAHDVPLTGAVRAALDEVERVTDEEGRVRFIFTTTGRTPVSGFFKARAVLAEAMVGIAQKERGEPVEIPRWTFHDLRRTAATGMARLGIPVRVTEAVLNHVSGTGGGIVAVYQRHDYADEKRQALEAWARFVLSLVEGEADNVVRLQHG >NZ_CP051468|891106:910009|908601_909210_+|WP_011338057.1|DBSCAN-SWA MLDGLMRRLIDPPLDRAGPMLARRGWSADAVTMVGLALGLLAAGLVAAGASTLWAILPLLAGRIADGLDGAIARAGRRTDFGGYLDITCDFLFYAALPLAFVLRAPENGAAGAFLLASFYVNGASFLGFAVLAAKRGMETTARGEKSLYFTAGLLEGSETILFFLFLCLFPGLFAPAAWIFGALCFVTAASRVLLARRLFRD >NZ_CP051468|891106:910009|897479_898742_+|WP_017140247.1|capsid|DBSCAN-SWA MLTSVTIARRQSEIRQALSTLAGKPTPTEDETRQMEQLDGEYRQNETRYRAALIAEDTERREAGADLETRSGREWSEMVAGYELRQVVRALEEGRALSGRTAEVVAELRNAGGYRGVPVPLMALEQRAGETVASGTPDPLQTRPIIDRLFPASVAAQMGVQLITIGSGAVEWPVTTSAVSAGWADGELANVAGPTAYATTDKALKPEQTLGIHMRISRKAMLQSGDALEAAIRRDMSGTMSAELDKAIFRGTGANGQPLGLIPGVSTYGITSTAVGAAASWAAIRAAVVRFMTANAAAGPGEVKALIRPELWSYLDGILVSDSGFKFEFDRLKENLGGIVMSSNALAAPSGSPLATSALLATSAGGVAPAFVGLWGAFDLIRDPYSDAQSGGVRLTALTTADVTVARGAQLELVTGLQVA >NZ_CP051468|891106:910009|895205_895511_+|WP_011338071.1|DBSCAN-SWA MSNPDLLSATKALARIDLSDEDATLELMIAAAQADVLAAAGYTLAEGASLPSDLAYAICDQAAMLFDARGGTEDRPMGLSLAASRIVSRYRGVRVCLPTSE >NZ_CP051468|891106:910009|904352_904823_-|WP_002720349.1|DBSCAN-SWA MRPILLSLLLVAAAPLPAAALCPAPQGADALRSEVIRLVSAERRQAGLPPLAFAPKLQRAAQVQACDNARRSVLGHTLGNGAGLTERLRAEGYRFGTAAENVARGPATAERVVDLWMNSSGHRKNILMRGVREAGVGIAPGKDGRAQWSLVLGARR >NZ_CP051468|891106:910009|901421_901700_-|WP_017140245.1|DBSCAN-SWA MDELATFDEADWEDLTAADKKALKTFSRVSMSYEPLAKAPGVGQLSMDALVAKGLAEEGQPCLHGRTFKLSDKGWLAVEWINGRKTRVYPRA >NZ_CP051468|891106:910009|907281_908487_-|WP_011338058.1|DBSCAN-SWA MFGRIMGAALVCLAALTPAAEAETGWDAVLAEAEGQTVYWHAWGGDPKINAFIAWVGEQARARHGFMLEQVKLADTTEAVARVLAEKQAGRTEGGAVDLIWINGENFAAMKAQGLLSEPFAESLPNWTHVDVAGKPAVVTDFTEPTGGLESPWAMAQLVFEHDSARLPAPPRTLAQLLAWIEVNPGRFTFPQPPDYLGTTFLKQVLYGVIEDPGLLQKPVDPATYETVTAPLWTWLDRAEPYLWREGRAYPANEPALGQLFADGEIAISFAFNPGRASAAIAAGELPETVRTFTLEGGTIGNASFVAIPFNAAHAAGAKVIANLLLDPEVQARAQDPAVLGFQTVLDVAALPEPDRARFAALDLGPATLAPDELGPALLEPHASWMERIAEDWVRRYGTAQ >NZ_CP051468|891106:910009|895684_897163_+|WP_017140248.1|terminase|DBSCAN-SWA MPEGKKAGKPLKLAEFQCRFVRGSMAPGVMVGVLSIGRGNAKTALSAGLALAELVGALHPKPQPKREILFAARNRDQAKTAFQFLVGYIDGLPEEDRAGFTIRRGSKLEVEFEGNGGGLARVIAADGKSILGGAPTLALMDERAAWEREKGDNLENAILSGLGKRDGRALIISTSAPDDANTFSRWLDEPPPGTYVQEHRPAFGLPADDLPSLLEANPGAAEGIGSTPEWLVAQARRAIARGGSALSSFRNLNRNERVSTEDRSVLVTVDEWLSAEVAPDELPPRSGPCVLGVDLGGSRSMSAAAFYWPETGRLEALGTFPASPSLADRGAADGVSDRYVQMEARGELTVMGDATVPPGPWLAAIVRHLDGAEVACVVGDRFRHAEFTEAMQAAGLARVPFVWRGFGWKDGSEDIERFRRALFDGEVQVAPSMLLRSAFSDAITLVDPAGNHKLAKARSLGRIDAAAASVLAVAQGARMKAAPMRKARAAWL >NZ_CP051468|891106:910009|893075_893273_+|WP_017140250.1|DBSCAN-SWA MERKLISAAAVRALCGGISDMSLWRWLNNPAMAFPKPVTIQRRRYWREAEVLAWLDARAEAREVA >NZ_CP051468|891106:910009|909376_910009_+|WP_002720342.1|protease|DBSCAN-SWA MKDPIDLYMNTLVPMVVEQTSRGERAYDIYSRMLKERIIFLSGPVHDGMSSLICAQLLFLEAENPSKEIAMYINSPGGVVTSGLSIYDTMQYIRPKVSTLVIGQAASMGSLLLTAGEKGMRFSLPNSRVMVHQPSGGYQGQATDIMIHARETEKLKRRLNEIYVRHTGQDLETVEAALERDNFMSAEDAKAWGLIDEILESRNRPDDTAK |
22 | Bacillus_phage(40.0%) | protease,terminase,integrase,portal,capsid,head | attL 876990:877005|attR 893604:893619 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1126283 : 1142098
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP051468|1126283:1142098|DBSCAN-SWA GATGAGTGATCAATCTTTACCCGTTCTCTCGTCCGCTCAACCCGTCATTCCGCCTAAGCCGAAACTGTCTAGGGCAGAAAAGAATATCAAGTGGATTGAGTCGAACCTGTTTATTCCAGAAGGTAAGGACGTTGGTAAGCCATTCAGGCTTGTGGACTTTCAGAAGGATATCATTCGTTCGATCTATGACAATCCGGCTGGAACTCGTCGGGCTATTATTTCGATGCCCCGTAAGGCTGCGAAGACAACGCTCTGTGCCGCTCTGATGCTGTTGCATCTGGTAGGTCGAGAAGCCTTGCCCAACTCGCAACTATACAGTGCAGCCCGAAGCAGGGATCAAGCTGCGGAGTTGTTCAAACTGGCAGTCAAGATGATCAGGATGAGTCCGCGTATATCGCGTTTTGTTCGGATTGTGGAGACTAGTAAGCGGCTGAAGGTGCCCGAGCTAGGGACAGAGTATAGGGCTCTCTCCAAGGATGCTGGAACGGCTCAAGGGCTGAGTCCGTGCCTCGTGATCCACGATGAGCTAGGGCAGGTGAGAGGGCCAGTAGATCCGCTCTACGAAGCCTTGGAGCTTGCCACTGCGGCTCAGGCAAACCCGCTTACCCTCGTGATCTCTACGCAGGCTCCGACTGACAATGACCTTCTTAGTCAGTTGATCGATGACGCCGCAACCGGGGCAGATCCGACCAAGGTTCTTAAGCTCTATTCGTGTCCGATGAATATCGATCCGTTCTCAGAAGAAGCCCTAGCTGTCTCGCATCCTGCATGGAATTCCTTTGTGAACCGCAAGGAACTCAAGCAAATGCAGGCCGAGGCCGCACGGATGCCTGCCCGTGCTGCGGATTTCCGCAACTACACCCTCAACCAGAGGATCGAAGTCAACGCTCCATTCATTTCAAAAGATGTTTGGGATGAAGGCAAGGATAATCCCGAAGAATGGCATGGAAAGGATGTTTGGCTTGGCCTTGATCTATCTGAAACCCGAGATCTCACTTCTCTTACTTTAGCACATAAAGACGAGAATGGTTTGCTTCACGTTCATCCATTCTTTTGGCTTCCCGATGAGGGAATAGAGGATAAATCGAGAAGTGATAAGGTTCCTTATGACATTTGGGCCAAGGGTGGACTAATCCATTTAAGCCAAGGAAGAACCATCCAATATAAGGATGTCGCTGCCAAGCTTAAGGAGATTGCGGATAACGCCAATGTCCAGAAGGTAGCCTTTGACCGTTACAAAATAAAATACTTCAAGCGCGACATGATTGATTGTGGTTTTGATGAGCGATGGATTGACGAGCACATGGTTTCTTATGGGCAGGGCTTCGTTTCTATGGGCATCGGAATTAACGAGTTGGAGCGTTTAATTCTGGATGGCAAAATTCGCCATGGGAACAACCCCGTCATGAATATGTGCATGGCAAACGTGAAAGTTGTTTCGGACACTTCAAACAACCGCAAATTCATCAAGCATACTTCGACAAGACGAATTGACGGCGCTGTTACGTTAGCGATGCTCGCCGGAATGCTTGCTGATCCAGATAACAAGCCAAAGCCCAAGCGAAAAGCTCTATTTGCTTAATCCTAAATATCGCAAAGTATTATACTTGCGAGGATAAGCATGTTCGGATTTGGCAAACGAGAGTCAGGCAGCAATCAGCCGACCGTTATTAGTAGGATCTCTGAGGCGTTCGGCTGGTGGGGCGGATCGTCTTCTATTGCCCCGGCACTAAGCAATACAACTGCGATGCAGAACCCGGCTGTTATGTGCGCAGTCCGAACAATCGCGGAAGGTGTAGCTTCCATGCCTATCAATATTATCGAGACAAAAGAAGTAGACGGGCTGTCAAAGCGAACAATTCGGAAAGATCATTGGGCGTCAAAGCTGATTAATAAGCCAAATGCCTATCAGACCCGATTTGAATTTGTTGAAATGATGATTTCAAATGCCGTGCTCGGAAAAGGCGCATTGGCACTCAAAACCGTTGTCGGTGGAGAAGTCCGCGAACTCTTGCCTATCCCTAGCGGTATTTGGGAAATGGAAATCCTCACTAATGGATCATACAATTTCCGGGTAAGGTTTACCGATGGTTCCAGCCGCGTATTCGCAGCTAAGGATTGTCTATTCTTCCGTGGTTTGTCGCTTGACGGGTATTCGTCTATCTCCGCTATTGAGACCGCCAGAAAGGCTGTCGGTATCGCGAACGCCCTTGAAGGCCAGACTCTTCAGACGGCTTCGAATGGTGGAAGACCTTCAGGTGTCTTGAGCATCGGTGATCCAGAAGACGGCGTTGCTCTGGATGAAGATACCCGTGCCAAAATCATCGCACTTTGGAAGGACCGATTCTCATCGAATGGGGAAGGCGGTATCCTGATTTCATCTGGATATTCGACCGACTTCAAACCGATCCAACAGAACGCGGTTGATAGCCAACTTATCGAAAGCCGCAAGTATCAGGTCGAAGAGATAGCTCGCATCTTCCGGGTGCATCCGGCTTATCTGATGGCGTCCGGGACTATCACTCCCGAGATCCAACGGGCGCATGTCCGCAATACCCTCATGCCTTGGGTAGCTCGTTTTGAACAAGCATTAGCAGCGTCACTGCTCCAAGCCGAACCAAATCTGTTGTTTGATTTTGATGAGCACGAATTACTTCGCGGGGACCATTCTGCCCTAAAAGATTTCTTCGCATCAGTGACGGGCGTTGGTGGAAGTCCTGCAATCATGTCGGTCAACGAATGCCGTTATGAATTGGGCCTTGATCCTATTGCGGATGAATGGGCCAGAACTCCGCTCAAAGGCGGGTATGAAAACTCCGCTATTCAGAAAGAGGAAAGCAGCAAATGAGCGATATGGAATTCAAACACTTTAAATTCGAAATCAAAGCTGAAGATGATGGGGATATGACCGTATCGGGTTACGCTTCCATCTTCGGCAATCGCGATAGGGTCGGGGATGTTGTCGTTGCGGGGGCTTTCACTAAATCGCTCGCTTCTGGCCGCAGAGTCAAAATGCTTTGGCAGCATGACCCCCGCGAAGTTATTGGCGTTTGGGATGAAATGATTGAAGATGATAAAGGTCTATTCATCCAAGGCCGCTTTGCTAATACCGAAAAAGGAAAGGAGATTCGCGAATTAGCAATGATGGGAGCAATCGATAGCTTCTCTATTGGTTATCGGACCCTTGATTACGAGTATTCGAATAGCGGCGACCGACTGCTTAAAGAAGTTGATCTTTTTGAAGTCTCGCTTGTCACGTTTCCTGCAAACGAACTCGCCACTATCACTGCGGTCAAAACAGAATTCGATGAGGAACGCGAATTGCTTTCGCTAATGCTTAAGCGGATTGAACTGCTTACTGCGAAATAAGCCTGATTTTAACGTCCTACATAATGCATCAAAACGAATGAAACCATTCCATTTTGCAAAAAGGAAAGAAACATGACTATTGAACTTAAAAATGCCATTGAAGCTTCGAACCAGTTGATTCAAGCTATCCGTTCGGAAGTCGAAGGCGTTAAATCTGCCGACGCTCTGTTTGAAGGAAAAATGGCTCGCATGGAAGCCGAACTTGCGGCGTCTCTGTCGGCCAAGTCGGCTCTTGAAGCTCGCCTGAACGCTCTGGAAACCGCCGCCGCGCGTCCTGTGTCGGGTAAAGCTGCCGAAGCTGCCGATGAATGCAAGTCCGCGTTCCGCAACTGGCTTGCTAACCCGGAAAGCTTCGAAGCAAAACAAGCGTTCGAACAGAAGGCTCTTGCCACCACTGGCCTAGCGAACGTCATTCCGCGCACTGTTTCGGATGAAGTTATCGCTGCTGCTCGGGGTTATTCGGCTCTTGCTGGTCTTGCTAAGTATGTCGTTACCGGCACTTCGGAATTTGGCATCATGGTTTCGGGTGGTTCGGCTGTTACTCGCGGTGGGGAAACCACGGTTCGCGGCGAAAACACTACTTCGCTCGTTTCTAAAAAGCCGATCTGGACCGATGTGGGTTCGAACGTCGCCGTTACCAAGCATTCCGCTATGGACCTTCAAACGGACGTGATCTCGTTCATCGCTGATCAGTTTGCGGAAGACTTCGCTGCCGACATGGCCGATGGGTTTATCAACGGCACTGGCCTAAATGACGACCCGCAAGGCATCCTTACCGCTGGCATCGAACTGAGCGGCGCTTCGGTTACGCCGGAACTGATCATCGATCTTGCTTACAAAGTGAAGACGGTTGATCGCAACAACGGCGCTTACCTGATGGCTGGCACGACTGCTGCGGCTCTTTCGAAAGCCAAGGCTAACTCGCAATTCGTTCTCGAAATCAAAGAGGGCGTTACGATGATCAACGGTCGTCCTGTGCATGTTGACGATTACCTCCCGGAAGAGACGCCGGTTGTGTTCGGTAACTACAAGCGCGCCTTCCTGACTGCGGTTCGCGCCGAAGGTGTAACCGTCCAGATCAACCCGTATAAGCAATCGAACGTGATCTTCGTTGAAGGCAATCTTCGTTACGGTTCTGTTGTGCTTAACGGTGACGCTTACGCGAAGCTCGTTATCGCCTAATCTCGGCGACTAACATAAGGAAAATGGCCCGCTTCGGCGGGCTTTTTCTTTTTCTAAATACCGGAAAGATTATAAGGAATTGCGCAATGGTCGATTTAGAAAAATTCAAGTTTCATGCGCGGATAGACGATGACTTCGAGGACTCGTATATCCAGCTTCTTCTAGATGCCGCTATCAATTACGTCTCTAAAATTACTGGCGTTCCAAATGATGAGAATGCCCCGCCTGAATACGATTTGGCGATTATGATCCTTGCACTCCATTGGTATACAAACAGGGAAGTTTTATCGGATCGTGGAACCAATCAAGTCCCCTATGGCCTTGGAATGCTAATCGCCAATCTCAGAACGAATTGGACAATCTGAAATGCATACGGGACTATTGGATAAGCGGATTACGATCTTGTCGAGGCAAGTGATTGACAATGGATTATCGAAGAAAGAGACGTATGCGCCTTTGAAATCGGTATGGGCAAAGGTCTCTTATATCTCTGATGGGGAGCGTATTCAGGCTGCAATCGCTCAACGGCAAATCTCCATGCGGTTTGTAATTCGGCACTCAAAATCTCTGTCTCTGAATAACGAATATCGCATCCGCTACGATGGTCGAGATCTTGATATTGAGAACGTCAAAATCTCCGATGACCGGCAATGGATGGAAATTACTTGTGGGGCGCGGAAGTAAATGGCGAAGAACAAGTCCGGTATGGAGATCGATAAGAAGTCGTTTCGGGAACTCGAAAAGAACCTCATGTCCCTCGAAAAGTTGGCTACTCAAAAGCGGCTTATCAAGACTTCCATGAAGAAAGCAATGCAGCCGGTAGCTGATGCTGCACAGTCTGCCGCCCCGGTTGATGAAGGGGATCTTCGCGATTCTATTATCGTAACGGATAAACTCAACAAGACGCAGAAGAGGCTTGAGCGAAAAGAGGGCAAGCATTCGTTTGTGATGTATGCCGGGGCAGGTTCCCCCAAGGGGCATTTGCTAGAGTTTGGAACTGAGGAGACTTCCCCTCAACCCTATCTTCGCCCCGCGTGGCAAACTGAAAAAGAGAATGTGCTTAACATCCTGAAAGACGAAATGGCCGCAAGAATTCAAAAGGCCATTAAACGACAAGATCGGGCCAGAGCCAAGGCCGCGAAAGGATAAGACCAATGGAAAGACAACTTCGACAACTCATTACAGACGCATCCGGCATCGAAGTCCATTGGATTCGGGCTCCCGCAAATACACCGCGTCCATACATCAGGCTATCCCTTATCACCGAATTAGGGGATCATACGATGGAAGGTAGAACGCCATTGCGGCAATCGATGATTCAAGTTGATATTTGGGCCGAAGATATGGCCGATCTCTTGACTATTAAGAATGCGGTCATGGTTCTGGACGGTTACAACGATCAATCCGGCGATGATCCTATCAAGGTAATTCTTCTGGATAAGGTTAGATCGGGCGAAGACACTTCCAATCCGGCTGATATCGTGCTTCGTTATTCAATCGACTTTCGAGTTTTCCATTCTTAATGGGTCCGAAATCCCATCCTCTACATACCGCATAAAGGTATCAACTTAGAAGGATGGATTTAAATGACTATTGCAATTGGAATTGATGCTAAGGTCGAAATCGGTCGCGGCGCTACTCCGACTTGGACCGAATTGGCTTACGTTACGGATATCACGTTGCCTGAATTTACGCGCGATAAAATCGAAGTAACTCATCAGAAAAGTCCGAGTGGAAAGAAAGAATACATTCCCGGTCTAGGCGAATTTTCAGACATGACCGCTGCAATGCACTATGTTCCGGGCAGCGCAACTGATGACCTTTTGCTTGAACTTCAGGACTCCGGCGAGACTGTCAAAGTTCGCGTCACGCTCAAGGGCGAAGATGGTCCAGTTGTTTCGACTTATTCAGGCTTCATGCAGGGATATTCCAGAAACATCCCTGTGGGCGATGTGATGGAAGCCGAAGCTACCTTCAGCATTAACGCCTTGGTGGTTGGCCCGTAATCTCGACAACATCAATAACGAAGAGAACCCGGCTTAGGTCGGGTTTTTTATTTCACGAAATTGCATTTTATTTCTCAAATGGCCTGATTTGTTGCTTAAATATCACATTCGGAAATCTCGGTTTTTCTAAATATTTTATAACAATAAGATAACGGAGAAACCAAAATGCTTAAATCAAAACCATTGCCGAGCCAAGAACGCCTGAATGAAGTTTTCAAATACGATTCCGAAACCGGATATCTGATCCGCAAAGGGCAATCAGGTCATGCAGGCTGCATCAATAAACGAGGCTACTGGCAGGTCTATTGTGATGGGGTGCTCTATTACGCTCATCGTCTAATTTGGAAGATGCATTACGGCGACATCCCCAAAGGTATGACCATCGATCACATTTCGGGAGATCCGAGTGACAACCGCCTATCGAATATCCGACTGGCGAGAGTTGAAGAGAACAATCTGAATAGAAAAATTTATGCGAGTAACAAAAGCGGTTACGCTGGCATCTTTAAGTCAGGAAACAAATTCGTAGCCTCTATCGGTCGTGGAGGCGTCAATCACTATTTGGGGCGTTTTGATGATATCGAGACGGCGATTGCAGTCCGGGAAGCTGCAGCGATGAGACTTCATGGCGAATTTAGGAGGGTGCATTGAACATAAATAACGATGAGCTTGAATGTCAGGTCGAAAAGCATCCTGACATTGAGAGACTAAAGATTGAATCCGCAAGCGTGGATCGTCTGCTAGAGAATAACGAAAAACACATCTTGGAGAATAAGAATGAAAACGAATAGCAAAGGTATTTACACATTCCAGAACGGCATGAAGCTTGATTTGAACTTCAATGCTCTTGCGGAATACGAGTCCATGGTTGAGGGATCGAACGGACTCGAAGTTATGGCCCGCATGGCATCGGGCCAGCTTAATGTAACTGAATGGCGAACGCTGCTTTATTGCGGTCTCAAGGTGAATCAACCCGAAGTGACCCTAGAAGAAGCAGGGGAAATGTTGGCCATCTATTTCGAAGATCTGGTTGCTTCGATCAACAAGGATATCGATGAGGCTGCGGCAGAGGCGGGAAAGCCGAAGGCGGGGAGAAAGAAGGCGAAGTAACCCCGCCAACAATCGCTGATCTGTATGAGACCTATCTCATTTCCTATCCAAATACTCCCGATGCATTCTGGAATTTGAATCAGGTAACATTTTCGATCTTCATGAAGGCAGCAAGGAAGAAAGCAGAAACGCAATACGATAACGATATTCGAACTGCTTATTACACCGCCGTCCTAAGCCGTGTTGATAAGATGCCCAAACTCGAAACACTGCTCATCCGCCATAGGGAATTGAAGCCAAAGCTTTCACCTAAAGAGCGCAATAAGAAGTTGATCGCAGATATGAAGCGATATAACGATCATCTAATTAAGGCTGGAAACGTAATTGCAGAAGCCTCGGAGTAGATCCGGGGCTTTTTGTTTGGTCATACATAATGGAGTAATTTTTTCGCGAGGATTTCAATAATGTCTTTAGCATCAGTAGTCGGGGCTTTACGGGCGAACCTTTCACTCGATACAGCGGCTTTCGAGGCCGGTGGCCGCAAGGCTGTAAAAGGTGCGAAGAACCTTGAATCCTCGTTTACCAAAACTGCTGCGAAGATCGGCAAGGCCGCTGCGGCGATTGGAACGGCGATGGGTGCTGCCGTAACTACTGGCACGGTTTTGATGGCGAACAACGCCAAGGAAGTTGAACGTCAGGCCAATGCATTAGGTCTTTCTACTGAGGGATATCAAGCCCTTACTAGCGCTGCGAAACGATATGGTATCGAGGCAGACAAGGTGTCCGACATTCTGAAGGATGGCGCAGATCGACTCGGTGAATATCTCGCTACCGGTGGTGGTGAGATGAAGGACTTCATGGAAATGGTTGCGAAGCCGCAGGGAGTTTCCGCAAAGGATATACTCAGGCTGCCGCCCGAGCAACGTCTGATCGAGATCCAAAGCTTAATGGAGGGAATGAACCTTCCTCTAGAAACTCAGGTTTTCTTGTGGGAGAGTCTATCCGACGAGGCTTCGAAGCTCGCTCCTCTTCTGGCAAATAACGGTGCCGAATTCAAACGGATCAAAGCAGAACTGCTTTCGACCGGTCAAATTCTATCCGATCAAGTTATCAAAAACCTATCGACCTTCTCGGCAAACCTGTCGGGTATCGGAGGGATTGTCTCGGGATGGGGCAACATCTTCGTGTCCGGGTTCGCTGGCCCGCTGGCCGAAGTCTCCACTCGTCTTCAGGTCTTCCTAGCTCAGGCAGACGGTATCCGCACGGTCTTTGCCACGATGGGGCAGGCAGTTGGATCCAGCCTCATGGTTCTGGCAGACAACATCGGTCGGATTGCCACTTACGCCGCTACAGGAGCCGTTGGGCTGGCCGCCTACGGGGCTGCGACACTTGCCGCCAGTGTCTACACCCTTGGCCTTGCCGGGGCTCTGACGCTCATGCGGGCAGCATTGGTGCGGACTGGATGGGGAGCCCTAGCCGTCGTCATCGGCGAAATCGTTTATCAGACGCTCAACCTTGTGGACTCCGTTGGCGGGGTATCTAATGCCATGCAACTTGCCGGAACAATTTGGACCGGAGTCGAAAACACGATCAAGACAGGTCTGAGCGCCATTGGTTCGATGTTCGAGGGCGTCGGCAAAGTGATCTATGGCGCGTTTGCCTATGCCTTCGGTCAAGCGGAAGCTGCATTCGCCAAGCTGATGATGAAAATCGGGCAGGGCATCAACATAGCAATCGATGGCATAAACGCTATCAAACCCGGTGATAAAATTCCGAAAATCAACATTGGCGTCGAAATGTCCGGTGATGCTAAGGCATTAATTGCCGGGGCAAAGGCCACAATGTCAGAAGGAATGTCCGCTGTTACGAATGGCGCATCTTCGTTGGTCGATTCCGGTCAAGGATTTGCCGATGCAATGGCTAAAGCCCGGTCGGACTACGACGCTATGCAGGCCAAGACTAAAGCCGAACGTGATGCAAACGGTTCAAATCCGAACGTTCCGAAACTGCCTGATCTGAATACGAAAGGAATTGGCGATATCTCGGGCGGGGATGCTGCCGGTGGTGGTTCTGCGAAGCTTTCGAAATACCAAGAGACCATCAGGACTCTTAATGGCGATATCGAGCAACTGAAAAATACTATGGGCCAGACCGCTCTAGCCGAGCAAATTTGGAATGCCCAACGTCAGGCCGGGGTAACTGCATCGTCTCTACAGGGGCAGAAGATTGCCGGTTTGGTGACTGAGCTAGAACGGTTGAAAACTCAAGATGCTGCAATCGATAGGTTCAAGGGACAATTGGGAGACATGTTCTCCAGCATCGTCACTGGATCGGAGAGCGCAAGATCGGCCCTAAGCAACTTCTTCGCTTCTCTGGCAAGTAACCTCGCTAACGGGGCATTCACGTCATTCCTCGGCAGTCTTGGCGGTGGCGGTGGCTTCTGGGGAAGTGTTGCCGGGATCCTTGGAGGAGGAGCAATAGGAGCCAATGCGAATGGCACGAGTGGCTGGAGAGGCGGACTCTCATGGGTTGGAGAAAGAGGCCCTGAGCTTGTGTCCATGCCGCGTGGGGCACAGGTTCTCAGCAACCGTGAGTCCATGAACATGATGGGCGGGGCTGCTCAACGTGTTCACCTCGACCTGAGCATGAGTCCAGATCTTGAAGCTCGCATTCTAGATAAGTCTGCTCAAACCTCAGTCAAAATCACACAACAGGGCTTGAATCAGTATTCGAAAACGCAGCCCCAAAGAAACGCAATTATCAGCAAAGACCCGCGTAAGAGGTAAAAAATGACTGTTTATAACCTTCAAAACTTTCAATACAAACTCAAGATTCGAGACTACGCATTTAGCTTGTCCGAAAACTTAATTTCCAGCCAGTCCTATTCGGGTGCGATGACTGTCAATCAACTCGGCCCTCGTATCTGGATGGGATCGATTACTTGCTCTCCGACGAGTATCGAAGTTGCCCGCGAAATAGAAACTGCAATTCGTTCTATTCAGACTGCCGGAAACTATTTCGCATTCTGCCCTCCGAAATACGAATATCCGCAAGCTGATCCTACTGGTTCTAAATTAGGAAGTGCGACCGTAACGATTACTTCGCCAGTTGCCGGATACACCGTGAACCTTGCTGGTTTGCCTGCGGACTACAAGATCACGACAGGCGATTATTTCAGTGTGATCCAAAACGGCGTTTCTGTGATCTATCAATTTGCGGAAACTAAAACGGCAGCATTCAACGGAACGATATCAATCAAGATGTTGAACCCATTTCGTGCCAATTTCGTGCCAGCGAATGGTGCATCGGCATACCTCACTTATCCATGGGCGACATGCCGATACATTCCCGGATCGATGAAGACAGGCACGATTGGGCTAGATAGTGCGAGTGGCTATCAATTCGATTTCCAACAAGTATTTTCAACACAGTAAGGTGATCCATGCAATTGAATTCGACGTTACAGGCCGCATTATCCCGGACGACTATCCAGCCATATTGGTTTTTGTGGATTTCCGCGAAGAATAGGGAAACGGGAGCAATCGAGGAATCGGGGATTTGGAATGGTATTGGTCCGACGACCATTTCTGTTGCTGGTCAAACTCGAACTTATGCAGGAGCGGGCGGGCTGTTGAGTATCGATGATCTGGTTTACTCAAGCGGAACCAATATCCAGACGCAGAACGTTGCTCTGAGCATTCTAGATCCGAAAGTGGTGGAGGCTATTCGGGTGTATGACTGCACCTTCGCTCCTATTCAAATTCACCTTGGGCTAATTGACCCCGAGACCGAAGGCTTTCTAGGTGTCACCCCGGCATTTGAAGGCTTCATTGACTCAATCGATATCTCCACAGACACGAGTGAATCGACTGCAAGCCTTACTCTTGCATCTTCATTGAGAAATGGATCTAGACCGCTGTATTCTCGTCAGTCGGATGCCGATCAACGCAAACGTGATCCGAACGACAAAGGACGTCTGTATTCATCGGTTGCCGGAAACCAAACTGTATTCTGGGGTCGCAAAAAGAAAGGCGATAAGGAGGTTAATTCTTTTGCTGGCATCCTTATGGGGACCGTCTTGGATCACTTTAAGAAAAAGGTTCAGCCGCAGCCATAAGAAATAGACAATCTAAATACCTCGAATCTTCTTTGTTCGAGGTATTTTCATGTCAAGACTATCCGATTATAAAAGCCGACTGTTTGCGGTAATTCGAGATTACGAATTCGCAGAATTCCAATACGGCATTACAGACTGCGCCATGTTCGGTGGCGAATGTGTCGAAGCCATTACCGGGATAAATCCCGTTATTGAATGGAAGGGGCGTTATACAACGTCACTCGGCGGTCTTCGTGTTGCGAAGAAGAACGGTTACGAAAATCAAGCTGACTGGTTTGTAAAGAACGGATCGGAAATCCCGGTTGCCATGGCGCAATTTGGGGATTTGGCCTTGCTTGAAGGCGATGCTGCCGAGACCGGATGGACTGTAGGTTTAGTCGGCGGAAGCTTCATCCTCGCCATGAGTGAGAAAGGGCTAATTAGAGTCCCCTTGTTCAATGCGACTAAAGTATTTCGTTTCGGGGAGGATCAATAATGTCCTTCTCAATTATCAAAGCCGCTGCTCTATGCACTACTTCAATCACTCCGATTGACTCATTCACTTCAGATGTCCGCGAAGCTCATGCAGATCCGGTTACGGGTGCCATTGTCGCTATTACGGGATGGAGTGCCGCTACAGCCGCATTCATCGTGCAGGTCGGACTCTCTGTTGCGTTTTCGACCCTGAGCCGATTGCTTGCCCCAAAACCGACATTCGAGAGCGGGGGCATTCAAACAGACGTCACGCTTTCTGGTTCAAAGACGCCCCAAAAGATCATTCTTGGCTATTACGCTACTAACGGTTGTTTGGCCGTCGCTCCGCTTTCGCGGCATGGCTCCAATCGTAAGAAGCTGAATTACTGGTTGAACTACGTCACGGTGATTTCGGATCTCCCGATCAATGAACTTCTAGGCGTGATCGTTGATGGCAAAGAGCAAACGATCATCGATGACGGGAATAACGAGTTTGGACGTAAGGTCTCGGGCGATCTGGACGGGACAGCTAGATTCCGTTGGTATCTCGGAAACCAGACTGCCGCAGACGATTATCTGTTAGAGTCCTATGGCGAGCATCCTCAATTTCCTTGGACAACTCAACATATCCTTAAGGGCTGTGCTTATGTAATTTCTAGCTTCTATTACGATGATGAGCGTTACAGTCAGCTTCCTTCAACTCGTTTCATTGTAAAAGGTGCAAGTCTCTATGATCCGCGTAAAGACTCCTCAGTAGGGGGATCAGGTCCGCAGCGGTGGAATGACCGCTCGACTTGGACGTTTACGGCAAACCCAATCGTCATGGTCTACAACCTCATTCGAGGTATCACGATGCCGGACGGCCTTGTTTATGGGGTTCGGGCTCCCGCCGAAAAACTCCCATTGTCGTCTTGGTTCGCAGCGATGAACGTTTGCGACGAGAACGGGACGCAGCCGAATTCCACCGACTGGAAAGCAGACCGTAAACGTTATCAAGCTGGTATCGAGATCAGCCTCGATACAGAACCATTGGAAGTTATCGAAGAGTTGTTGAAAGCAGCCGGGGCCGAAATGGCGGAAGCTGGTGGGTATTTCTATGTCCGCGCCGGGGCTCCGGCTGCACCAGTTGCTCATATCACCGATGACGTGATCATTGTCTCCGATCCGATCTCGGAACGGCCATTCCAAGGGCTGAATGAGAGCTACAACACAGTTCGGGCAACCTATCCCGATCCGAAAGCCGTATGGGAATCGACTGAAGCCGAACCATTTAGTAAGCCGGAATGGGTGACTGCGGACCAAGGGCTTGAACTTGTGGCAGATATCAAACTGTCGGCAGTCCCGTTCCCGAGACAGGTTCGCAGATTGATGCGGGAAGCTGCGGCAGACCATCGTCGCAGACGTGTTCATACTCTTACACTGCCTCCAAGCTACCTCGGATTGAAGCCGTTGGACTCGATTTCATGGACTTCGCCTTCCAGATTCTATGATGCGAAGGTATTTGAAATCCAGCAAATTCGCATAAACCCGATGACGCTGAATGTTCAAATCGATATCAAAGAGCGGAATCCTGCGGATTACGACATTGACGCCGCTAACGACTCCATTGCGCCTTATTACCCGCCACCGCCCCCGGTTCCCCAAGAGGAAATCGGAATGGATGGATTGGCAGTTGCGGGAGTTGTCGTCAAAGACGCTGCCGGTAATGACCGGATGCCGGGGATTAAAGTCTCGTGGGATCTCGCAGACGACGGGACGTCTTATGACGCTGTGAATTACCAAGTTCGGCTCAAATCATCGGGGGCAGTTGTCGTCACTGGCACGACTCAAGCCACTGATGTTGGATCGGTTATTATTACTGAGTCACTTCTTCCGAATGAAATCTATCAGGTTCGAGCAAAGGTGCATGAGCCGGAATCGGGTGTCCGTTGGAGATATTCAAATTGGATTGATGTGACTACCCCGAATGTGCGGATTTCGCCAATTGATCTTGACGACACTATCTTCGATCAAATGAAGGAAACGGCAATTCGTCATGGTGTTAAACCCGTGACGACTCTTCCGGCTACCGGTCAACTCGATCAACTAGTTATGCAGGTTCCGACTGGCAAGCTTTACCGCTGGAACGGTTCTGCATGGGTATCTGAGCTTGTTGCTGCTCCGACTCCGGGCAGTGTCGATATTGCTTCTTTCGCATTGGGGATTGAGCCAATCACTCCATGGGCTGGTGCTGCACTTCCAACAACTAAGAAGACATCGCTGATTTTCTGGAAAGGTGAAACCTATAGATGGTCTAATGGTGCATACGTCAAGAACGTCAATGCTACGGATGTGATTGGCGAATTGGTTGCAGCACAAATCGCTGCCGGGGCAATCGGAACTAGGGAACTTGCCAGCCAATCGGTTGTTGCGTCGAAAGTTGCCATTAGCGACTTCTCGAACCTTGTTCCTGATGCAGACTTCTCGGAGTTTTTCAGCGGAGGGGCAACTTGGGTTGGAGGTGGCGGGCTTGGCCCGTTCCAAGCTTTCAGCGTGGCCGGTAATGCTGTCTGGGCAAGCCCTTATATTCTGCGGTTGAGCGTTAACGGGACAGGAGACACGAACTATTCTGGTGTTCAATCTTCTACCTTCGCCGTTACAGGCGGCACCGATTATCACGTTTCGGCTCTGCTCCAAACGCAAGGCTCAGGCGGTGCCAACGTTATCCTTCGTTTGGCGTTCTATAATAACGCTGGTGCTTTGCTCGGGCAAAGAAACGTTTATGCCGATACTGCTAACCTTGGTGTTGATCGAAAAACGGCTAACGTAACTGCTCATGCCAATGCGGCTTTCGCTCGGGTTCAGATGTATATCCGCAATGACTCCGTGGCTTCCTATGTCCAAATGGGTTCATTGGCCGTCCGTAAAGCAGCAGGAGGCGAATTGATCGTTGATGGAAGTGTTAAGGGGAATCACATTTCTTCTAACACGCTGACATCGAACCATCACACTACTGGTTCACTTCTGGCCGAGCATATCAATACCTCTAGCTTCAATGCCGCTGGATTGGCTGTCTTTAATAACGTGCTGCAATCTAATAACTTCGTTGCAAACACTTCGGGATGGCGCATTCAGCAAAATGGAAATGCTGAATTCAACAGTCTGAAAGTCCGTTCGGACATGATCGTGTCTGGTGCCGTATCGAAGACTTATTATCAGCAATTCCAGACATTCACGAAAGACGACACTTCGGTTTGGGAAGGTGTCATTCCTCTTGGATCTGGATTGAATTTCGCGGCGGCTGAAGATGACGGAGGAGCATTGCTTAACCCGATCAGAACTGAATTCTATTGTTCTCTTGGGTGTCGTTCATCGGTTTCGAGTTGCAGAATTGCAGTCACCTTGTGGGCTTCAACTGCGGCAAACCCGAATGCTTGGATTAACTTGTATGGGACGAACTCCGAATTCAATAGCATTTACTTGGCTTCGAAAGAATTGGAGGGTGGTGAGGCGCTGGCGGTGAATGGCCTTCTCATTGGAGGTGTATACTCAAGCATCTTATTCGAGCGCATTGCTAACCTGAAAGTTCGAGTTGTGATGGATACCGGTAAGGGTGTTGTTTACCGTCCGAAGGTGATTGTCAGTCAGATTAGCAGATAA
Protein sequences of DBSCAN-SWA_2 >NZ_CP051468|1126283:1142098|1129126_1129651_+|WP_011337911.1|head,protease|DBSCAN-SWA MSDMEFKHFKFEIKAEDDGDMTVSGYASIFGNRDRVGDVVVAGAFTKSLASGRRVKMLWQHDPREVIGVWDEMIEDDKGLFIQGRFANTEKGKEIRELAMMGAIDSFSIGYRTLDYEYSNSGDRLLKEVDLFEVSLVTFPANELATITAVKTEFDEERELLSLMLKRIELLTAK >NZ_CP051468|1126283:1142098|1127903_1129130_+|WP_011337912.1|portal|DBSCAN-SWA MFGFGKRESGSNQPTVISRISEAFGWWGGSSSIAPALSNTTAMQNPAVMCAVRTIAEGVASMPINIIETKEVDGLSKRTIRKDHWASKLINKPNAYQTRFEFVEMMISNAVLGKGALALKTVVGGEVRELLPIPSGIWEMEILTNGSYNFRVRFTDGSSRVFAAKDCLFFRGLSLDGYSSISAIETARKAVGIANALEGQTLQTASNGGRPSGVLSIGDPEDGVALDEDTRAKIIALWKDRFSSNGEGGILISSGYSTDFKPIQQNAVDSQLIESRKYQVEEIARIFRVHPAYLMASGTITPEIQRAHVRNTLMPWVARFEQALAASLLQAEPNLLFDFDEHELLRGDHSALKDFFASVTGVGGSPAIMSVNECRYELGLDPIADEWARTPLKGGYENSAIQKEESSK >NZ_CP051468|1126283:1142098|1130856_1131198_+|WP_011337909.1|head,tail|DBSCAN-SWA MARFGGLFLFLNTGKIIRNCAMVDLEKFKFHARIDDDFEDSYIQLLLDAAINYVSKITGVPNDENAPPEYDLAIMILALHWYTNREVLSDRGTNQVPYGLGMLIANLRTNWTI >NZ_CP051468|1126283:1142098|1133616_1133949_+|WP_023003646.1|DBSCAN-SWA MKTNSKGIYTFQNGMKLDLNFNALAEYESMVEGSNGLEVMARMASGQLNVTEWRTLLYCGLKVNQPEVTLEEAGEMLAIYFEDLVASINKDIDEAAAEAGKPKAGRKKAK >NZ_CP051468|1126283:1142098|1133004_1133490_+|WP_011337905.1|DBSCAN-SWA MLKSKPLPSQERLNEVFKYDSETGYLIRKGQSGHAGCINKRGYWQVYCDGVLYYAHRLIWKMHYGDIPKGMTIDHISGDPSDNRLSNIRLARVEENNLNRKIYASNKSGYAGIFKSGNKFVASIGRGGVNHYLGRFDDIETAIAVREAAAMRLHGEFRRVH >NZ_CP051468|1126283:1142098|1137331_1138006_+|WP_023003640.1|DBSCAN-SWA MQLNSTLQAALSRTTIQPYWFLWISAKNRETGAIEESGIWNGIGPTTISVAGQTRTYAGAGGLLSIDDLVYSSGTNIQTQNVALSILDPKVVEAIRVYDCTFAPIQIHLGLIDPETEGFLGVTPAFEGFIDSIDISTDTSESTASLTLASSLRNGSRPLYSRQSDADQRKRDPNDKGRLYSSVAGNQTVFWGRKKKGDKEVNSFAGILMGTVLDHFKKKVQPQP >NZ_CP051468|1126283:1142098|1136678_1137323_+|WP_011337903.1|DBSCAN-SWA MTVYNLQNFQYKLKIRDYAFSLSENLISSQSYSGAMTVNQLGPRIWMGSITCSPTSIEVAREIETAIRSIQTAGNYFAFCPPKYEYPQADPTGSKLGSATVTITSPVAGYTVNLAGLPADYKITTGDYFSVIQNGVSVIYQFAETKTAAFNGTISIKMLNPFRANFVPANGASAYLTYPWATCRYIPGSMKTGTIGLDSASGYQFDFQQVFSTQ >NZ_CP051468|1126283:1142098|1129723_1130833_+|WP_011337910.1|capsid|DBSCAN-SWA MTIELKNAIEASNQLIQAIRSEVEGVKSADALFEGKMARMEAELAASLSAKSALEARLNALETAAARPVSGKAAEAADECKSAFRNWLANPESFEAKQAFEQKALATTGLANVIPRTVSDEVIAAARGYSALAGLAKYVVTGTSEFGIMVSGGSAVTRGGETTVRGENTTSLVSKKPIWTDVGSNVAVTKHSAMDLQTDVISFIADQFAEDFAADMADGFINGTGLNDDPQGILTAGIELSGASVTPELIIDLAYKVKTVDRNNGAYLMAGTTAAALSKAKANSQFVLEIKEGVTMINGRPVHVDDYLPEETPVVFGNYKRAFLTAVRAEGVTVQINPYKQSNVIFVEGNLRYGSVVLNGDAYAKLVIA >NZ_CP051468|1126283:1142098|1132419_1132839_+|WP_023003648.1|tail|DBSCAN-SWA MTIAIGIDAKVEIGRGATPTWTELAYVTDITLPEFTRDKIEVTHQKSPSGKKEYIPGLGEFSDMTAAMHYVPGSATDDLLLELQDSGETVKVRVTLKGEDGPVVSTYSGFMQGYSRNIPVGDVMEAEATFSINALVVGP >NZ_CP051468|1126283:1142098|1126283_1127864_+|WP_017140232.1|terminase|DBSCAN-SWA MSDQSLPVLSSAQPVIPPKPKLSRAEKNIKWIESNLFIPEGKDVGKPFRLVDFQKDIIRSIYDNPAGTRRAIISMPRKAAKTTLCAALMLLHLVGREALPNSQLYSAARSRDQAAELFKLAVKMIRMSPRISRFVRIVETSKRLKVPELGTEYRALSKDAGTAQGLSPCLVIHDELGQVRGPVDPLYEALELATAAQANPLTLVISTQAPTDNDLLSQLIDDAATGADPTKVLKLYSCPMNIDPFSEEALAVSHPAWNSFVNRKELKQMQAEAARMPARAADFRNYTLNQRIEVNAPFISKDVWDEGKDNPEEWHGKDVWLGLDLSETRDLTSLTLAHKDENGLLHVHPFFWLPDEGIEDKSRSDKVPYDIWAKGGLIHLSQGRTIQYKDVAAKLKEIADNANVQKVAFDRYKIKYFKRDMIDCGFDERWIDEHMVSYGQGFVSMGIGINELERLILDGKIRHGNNPVMNMCMANVKVVSDTSNNRKFIKHTSTRRIDGAVTLAMLAGMLADPDNKPKPKRKALFA >NZ_CP051468|1126283:1142098|1138055_1138481_+|WP_023003639.1|DBSCAN-SWA MSRLSDYKSRLFAVIRDYEFAEFQYGITDCAMFGGECVEAITGINPVIEWKGRYTTSLGGLRVAKKNGYENQADWFVKNGSEIPVAMAQFGDLALLEGDAAETGWTVGLVGGSFILAMSEKGLIRVPLFNATKVFRFGEDQ >NZ_CP051468|1126283:1142098|1131235_1131517_+|WP_162129770.1|head|DBSCAN-SWA MSRQVIDNGLSKKETYAPLKSVWAKVSYISDGERIQAAIAQRQISMRFVIRHSKSLSLNNEYRIRYDGRDLDIENVKISDDRQWMEITCGARK >NZ_CP051468|1126283:1142098|1134353_1136675_+|WP_023003642.1|DBSCAN-SWA MSLASVVGALRANLSLDTAAFEAGGRKAVKGAKNLESSFTKTAAKIGKAAAAIGTAMGAAVTTGTVLMANNAKEVERQANALGLSTEGYQALTSAAKRYGIEADKVSDILKDGADRLGEYLATGGGEMKDFMEMVAKPQGVSAKDILRLPPEQRLIEIQSLMEGMNLPLETQVFLWESLSDEASKLAPLLANNGAEFKRIKAELLSTGQILSDQVIKNLSTFSANLSGIGGIVSGWGNIFVSGFAGPLAEVSTRLQVFLAQADGIRTVFATMGQAVGSSLMVLADNIGRIATYAATGAVGLAAYGAATLAASVYTLGLAGALTLMRAALVRTGWGALAVVIGEIVYQTLNLVDSVGGVSNAMQLAGTIWTGVENTIKTGLSAIGSMFEGVGKVIYGAFAYAFGQAEAAFAKLMMKIGQGINIAIDGINAIKPGDKIPKINIGVEMSGDAKALIAGAKATMSEGMSAVTNGASSLVDSGQGFADAMAKARSDYDAMQAKTKAERDANGSNPNVPKLPDLNTKGIGDISGGDAAGGGSAKLSKYQETIRTLNGDIEQLKNTMGQTALAEQIWNAQRQAGVTASSLQGQKIAGLVTELERLKTQDAAIDRFKGQLGDMFSSIVTGSESARSALSNFFASLASNLANGAFTSFLGSLGGGGGFWGSVAGILGGGAIGANANGTSGWRGGLSWVGERGPELVSMPRGAQVLSNRESMNMMGGAAQRVHLDLSMSPDLEARILDKSAQTSVKITQQGLNQYSKTQPQRNAIISKDPRKR >NZ_CP051468|1126283:1142098|1131987_1132356_+|WP_023003649.1|DBSCAN-SWA MERQLRQLITDASGIEVHWIRAPANTPRPYIRLSLITELGDHTMEGRTPLRQSMIQVDIWAEDMADLLTIKNAVMVLDGYNDQSGDDPIKVILLDKVRSGEDTSNPADIVLRYSIDFRVFHS >NZ_CP051468|1126283:1142098|1138636_1142098_+|WP_162791925.1|DBSCAN-SWA MQVGLSVAFSTLSRLLAPKPTFESGGIQTDVTLSGSKTPQKIILGYYATNGCLAVAPLSRHGSNRKKLNYWLNYVTVISDLPINELLGVIVDGKEQTIIDDGNNEFGRKVSGDLDGTARFRWYLGNQTAADDYLLESYGEHPQFPWTTQHILKGCAYVISSFYYDDERYSQLPSTRFIVKGASLYDPRKDSSVGGSGPQRWNDRSTWTFTANPIVMVYNLIRGITMPDGLVYGVRAPAEKLPLSSWFAAMNVCDENGTQPNSTDWKADRKRYQAGIEISLDTEPLEVIEELLKAAGAEMAEAGGYFYVRAGAPAAPVAHITDDVIIVSDPISERPFQGLNESYNTVRATYPDPKAVWESTEAEPFSKPEWVTADQGLELVADIKLSAVPFPRQVRRLMREAAADHRRRRVHTLTLPPSYLGLKPLDSISWTSPSRFYDAKVFEIQQIRINPMTLNVQIDIKERNPADYDIDAANDSIAPYYPPPPPVPQEEIGMDGLAVAGVVVKDAAGNDRMPGIKVSWDLADDGTSYDAVNYQVRLKSSGAVVVTGTTQATDVGSVIITESLLPNEIYQVRAKVHEPESGVRWRYSNWIDVTTPNVRISPIDLDDTIFDQMKETAIRHGVKPVTTLPATGQLDQLVMQVPTGKLYRWNGSAWVSELVAAPTPGSVDIASFALGIEPITPWAGAALPTTKKTSLIFWKGETYRWSNGAYVKNVNATDVIGELVAAQIAAGAIGTRELASQSVVASKVAISDFSNLVPDADFSEFFSGGATWVGGGGLGPFQAFSVAGNAVWASPYILRLSVNGTGDTNYSGVQSSTFAVTGGTDYHVSALLQTQGSGGANVILRLAFYNNAGALLGQRNVYADTANLGVDRKTANVTAHANAAFARVQMYIRNDSVASYVQMGSLAVRKAAGGELIVDGSVKGNHISSNTLTSNHHTTGSLLAEHINTSSFNAAGLAVFNNVLQSNNFVANTSGWRIQQNGNAEFNSLKVRSDMIVSGAVSKTYYQQFQTFTKDDTSVWEGVIPLGSGLNFAAAEDDGGALLNPIRTEFYCSLGCRSSVSSCRIAVTLWASTAANPNAWINLYGTNSEFNSIYLASKELEGGEALAVNGLLIGGVYSSILFERIANLKVRVVMDTGKGVVYRPKVIVSQISR >NZ_CP051468|1126283:1142098|1131517_1131982_+|WP_023003651.1|DBSCAN-SWA MAKNKSGMEIDKKSFRELEKNLMSLEKLATQKRLIKTSMKKAMQPVADAAQSAAPVDEGDLRDSIIVTDKLNKTQKRLERKEGKHSFVMYAGAGSPKGHLLEFGTEETSPQPYLRPAWQTEKENVLNILKDEMAARIQKAIKRQDRARAKAAKG |
16 | Paracoccus_phage(44.44%) | tail,protease,terminase,portal,capsid,head | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1693033 : 1714552
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP051468|1693033:1714552|DBSCAN-SWA GATGACCTACCATCCCAAATCCGATTTCCTGCGCGTGATGCAGGAGCGCGGCTACCTCGCCGACTGCACCGACATGCAGGCACTGGACGAGGCCCTGTCGAAAGGGGTCGTTCCTGCCTACATCGGCTATGATGCGACGGCGGCCTCGCTGCATGTGGGGCACCTTCTCAACATCATGATGCTGCGCTGGCTGCAGAAGACCGGGCACAAGCCGATCACCCTGATGGGCGGCGGCACGACGAAGGTGGGCGATCCCTCCTTCCGCTCCGAAGAGCGTCCGCTGCTCACGCCCGACAGGATCGACGAGAATATCGAAGGCATGCGCAAGGTCTTCGCGCGCTACCTCTCCTACGGCGAGGGCGCGACCGACGCGCTGATGCTCAACAATGCCGAATGGCTGGACCAGCTGAACTACCTCGATTTCCTGCGCGACATCGGGCGGCATTTCTCGGTCAACCGGATGCTCTCGTTCGAAAGCGTGAAGAGCCGGCTCGACCGCGAACAGTCGCTGTCGTTCCTCGAATTCAACTACATGATCCTGCAGGCCTACGACTTCCTCGAGCTGTTCCGCCGCACCGGCTGCCGGCTGCAGATGGGCGGGTCGGACCAGTGGGGCAACATCGTCAACGGGATCGACCTGACGCGCCGCGTGCTCGAAGGCGAGATCTTCGGGCTGACCTCGCCGCTGCTCACAACCTCGGACGGGCGCAAGATGGGCAAGTCCGCGGGCGGCGCGGTCTGGCTCAACGGCGAGATGCTGGCGCCCTACGACTTCTGGCAGTTCTGGCGCAACACGACCGATGCGGATGTGGGCCGGTTCCTCAAGCTCTACACCGAGCTGCCCGTCGAGGAGTGCGACCGGCTGGGCGCGCTGCAGGGCTCGGAGATCAACGCGGCCAAGATCCTGCTCGCGAACGAGGTGACGACGCTCCTCCACGGCCGGGACGCCGCCGAAGCGGCCGAGGCCACCGCGCGCGCGGTCTTCGAGGAGGGCGGCGTGGGCGGCGCCCTCGAAGTGGTGGAGCTTCCGGCCGCGACGCTGGGCGAGGGCCTCTCGGTCGCACATTTCCTCGTGGCCGCAGGCCTCGTCGCCTCCGGCAAGGAGGCCAAGCGCCTCGTGGCCGAGAACGGGCTGCGCTTCAACAACGAGCCGGTGGGCGATGCCAACACCCCGGTCACGGCCGCGACCGTGGGCGAGGAGCTGAAGGTCTCGATCGGCAGGAAGAAGCACAAGCTCGTCCGTCTCTCCTGACGGCCCGCGCGCGGCGGCCGTCCGCCGCTCTGCCCGGTCTCTCCGCCCTGCCGGGGCGCGGTGCTCGGGCCGCTGCGCTGCGCCCCGCGCCGCTCGACCGGCCCCCGGCGAGCCGCGCGGCCTTGCAGCCAACCATCCTGCCGGGCCGGACGCAAAAAGGACGCCTTCCGGCGCCCTTTCACTCTGCCCAAATATCCTCGGGGGGTGCGGGGGGCAGACAGCCCCCCGTCCGACCGCGGCCTCACTCGGCCACGGTCACCTTCGCCATCGCATCCGGCGTCCGCGGAGGCTCGCCGCGGGCGATCCTGTCGACCACATCCATGCCCTGCACCACGCGGCCCACCACCGTATACTGGCCGTTCAGGAACTCGCCCGGCGCGAACATGATGAAGAACTGCGAATTGGCCGAGTTCGGATCGTCCGAGCGCGCCATGCCCACCACGCCGCGCGTGAAGGGCACGTCCGAGAATTCGGCCGGAAGGTCGGGATAGCTCGACCCGCCCGTGCCGGCGCGGCGCAGGTTGTTCCCCTTGCCGAATTCCACGTCGCCGGTCTGGGCCATGAAGCCCTCGATCACGCGGTGGAAGACCACGCCGTCGTAGGCGCCCTCGCGGGCCAGCGTGACCATGCGCTCGACATGGGCGGGCGCCACGTCGGGCAGGAGGTCGATCACCACCTCGCCCTCGGCCTGTCCCGTCAACTCGATCACGAGGTTCGGCCCCGGCCCGTCCTCGGCCGCCTGCGCGAAGACCGCCGATTGCGAGAGGCCGTAGCCGCCGAGCACCGCGAGGCCCAGCGACATCAGCCCGGCGAAGAGAAATTTCTCAGACGACATCGGCCGCCACCCGCACCGTCAGCATCTTGTCCGGGTTCGCCGGCGGCTCGCCGCGGGCGATCTTGTCCACATGCTCCATCCCCGAGATCACGCGGCCGTAAACGGTGTACTGGCCGTTCAGGAAGTCGTTGTCGCGGAAGTTGATGAAGAACTGCGAATTGGCCGAGTTCGGGTTCTGCGACCGGGCCGCACCGAGCGTGCCGCGCGCGTGCGGCAGCTTCGAGAATTCCGCCGGCAGGTCCGGCATGTCCGACCCGCCCGTCCCGGCGCGGCGGATGTTGAAGCCGTCTTCCGTGTTTCCGTTCGCGACATCGCCGGTCTGGGCCATGAAGCCCTCGATCACGCGGTGGAAGACCACGTTGTCATAGGCCTTGGCGCGGGCCAGGGCCTTCATCCGTTCGCTGTGCTTCGGGGCGACGTCCGACAGAAGCTCGATCACCACCTCGCCGTCCTTCAGCGTCATGATGATGGTGTTCTCGGGATCCTTGATCTCGGCCATGGGGCCCTCCTCTTGCAATCGGGGTCAACCTAGTGGCCGCGCGCGGTCAGGAAAAGGTCTGAATTCGCGCCCCTGCGGCCGAACGCCGGTTGACGGGCGCACGTCGATCGGCTTGGAAGCGGGAAACGTCGCTTCAGGAGGAACCGGAATGGGCTGGAAGACACTCGACGACATGGATCTTGCCGGCAAGGTCGTGCTGGTGCGCGTGGATGTGAACGTGCCGATGGAAAATGGCGAAGTCACCGACGCCACCCGGATCGAGAAGATCGTCCCCACCGTCGAGGATATCCTGAAGAAGGGCGGCAAGCCCGTCCTGCTCGCCCATTTCGGCCGTCCGAAGGGCAAGGTCGTGGACGAGATGAGCCTCCGCCTCGTGCTGCCCGCGCTGCAGAACGCGCTGCCTGGCACCAAGGTGAGCTTTGCCGCCGACTGCGTGGGCCCCGAGCCCGAGCAGGCGGTGGCCGCCATGCTCGAGGGCGAGGTGCTCCTCCTCGAGAACACCCGCTTCCATGCCGGCGAGGAGAAGAACGACCCCGAGCTGGCCGCCGCGATGGCGAAGCTGGGGCAGGTCTATGTCAACGATGCCTTCTCGGCCGCGCACCGCGCCCATGCCTCGACCGAGGGCCTCGCCCGTCTTCTGCCCTCGGCCGCCGGCCGGCTGATGGAGGCCGAGCTGAAGGCGCTCGAAGCCGCTCTCGGCCATCCCGAGCGCCCCGTTGTGGCCGTGGTGGGCGGGGCCAAGGTCTCGACCAAGCTCGACCTTCTGGGCAATCTCGTGGGCCGGGTCGATCATCTGGTGATCGGCGGCGGCATGGCCAACACCTTCCTCGTGGCGCAGGGGATCGAGGTCGGCAAGTCGCTGGCCGAGCGCGACATGGCCGATACGGCGCGCGAGATCCTCTCCAAGGCGAAGGCCGCGGGCTGCACGATCCATCTTCCGCTCGATGTGGTGGTGGCGCGCGAGTTCAAGGCGGGGGCCGCGAACGAGACGGTCGAGACGGCGGCCTGCCCGGCCGACGCGATGATCCTCGATGCCGGTCCGAAGACCGTGGCCGCCCTCTCCGAAGTGTTCGCCTCGGCTAAGACGCTGATCTGGAACGGCCCGCTCGGCGCCTTCGAGATCGAGCCCTTCGACGCCGCGACGAATGCGGCGGCGCTTCAGGTGGCGCAGCTCACCAAGGCGGGCCAGCTCATTTCGGTCGCGGGCGGCGGCGATACGGTGGCCGCCCTCAACAAGGCGGGCGCGGCCGAAGGCTTCTCCTACATCTCGACGGCGGGCGGTGCCTTCCTCGAATGGATGGAGGGCAAGGAGCTGCCCGGAGTGGCCGCGCTCACGGTCTGAGGCGAATTTCCGCGCCGCAGCGCCAAGTTAGCGCGAACAACATTGATATGCGCATTTCGCCGGTCTAAGTCCCGTGGGACCCAATTGAAAGGACCCGCGGAATGCGCGCTAGCAGAGCAGTTCAGAAGATCCTCGCCAACTATGAGGGCGAAACGCCCGGCGTGAAGGCCAACCTGTGCCGGATGCTGATGGAGGGGAAACTGGGCGGCACGGGCAAGATGATCATCCTGCCGGTCGACCAGGGCTTCGAGCACGGCCCGGCCCGCACCTTCGCGCCGAACCCGGCGGGGTACGATCCCCACTACCACTACCAGCTCGCCATCGATGCGGGACTCAGCGCCTATGCGGCCCCGCTCGGCATGCTGGAAGCCGGCGCCGACACGTTCGCGGGCCAGATCCCGACCATCCTGAAGGTGAACTCGGCCAACTCGCTGATGAGCGACACCGCGGGCAAGAACCAGGCGGTCACGGCGTCGGTCGACGATGCGCTGCGGCTCGGCTGCGCGGCCATCGGCTTCACGATCTACCCGGGCTCGGATGCGCAGCTCGACATGTATGAAGGGATCGTGGCCATGCGGAAGGAAGCCGCGGCCAAGGGGATCGCCACCGTGATCTGGTCCTATCCGCGCGGCGAGGCGATCAGCAAGGACGGCGAGACGGCGATCGACGTGGCGGCCTATGCCGCGCAGATCGCGGCCCTGATCGGCGCCCATATCATCAAGATCAAGCTTTCGACCGATCATCTGATGCTGGGCGAAGCGAAGAAGGTCTACGAGGCGCAGCAGATCGACGTCTCCACCCAGGCCGCGCGCGTGAAGCACTGCATGGATTCGGCCTTCGCCGGCCGCCGCATCGTGGTCTTCTCGGGCGGCGCCAAGAAGGGCGAGGATTCGGTCTATGACGACGCCCGTGCGATCCGCGACGGCGGCGGGAACGGCTCGATCATCGGCCGCAACAGCTTCCAGCGCAGCCGCGAGGACGCGCTCGCGATGCTCGGCAAGCTCGTCGACATCTACAAGGGCCGCGCCTGAGGCCTGCCCTGCCGGCCCTTCCCCTCGGGGGAGGGCCAACCCCGCCCTTCGAAGGGCCCCTTCCGGGCTTTTCGCTCCTCCGCACGCCAGAATGCCCCGGCCCGACCGATAAGCCCCGCCATCGATCTTCCCGCGGTCGGCAGATGGCCGACCTACGCCGGGATCGCCTCTCATGCCGCCACGGCCAGCGCGCGTTTTCCGTTTCCCAAGTACGGCCCGCTCTGGCATGATGCGCGCCATCGCCCCCGCCGAACGAGGGGCGTGACCAGAGGCAGAGCATGAACATCCCGACCTCCCGCCCCGCCATCGGCGCCAGCCTCTATTTCGCGCTGGCCTTCTCGCTCTCCGTCTATTTCACCTTCGCCGCGGTTCAGGGCGACTATGGTGTCTTCCGGCAAGTGCAGATCCAGGCCGAGGCCGAGGCGCTGCGCAGCGAGCGCGACCGGATCGCCGCCGAGCTGGCCGATATGGAGAACAGGACGCGCCGGCTGTCGGACAGCTATCTCGATCTCGACCTTCTGGACGAGCAGGCCCGCTCGGTGCTGGGCTATGTCCGCGCCGACGAGATCGTCATCCGCTGACACGCGCGGAAACCGCCGGCCCTTTCCTGTCGTTTGTCTTCCTGCATCCTCGCTTCGGGGCGCGATCCGTGCCGCGTCCGCGGCAGCGGCGCGCGATGATGGCCCCGGAGGGGCCTGAAGAGCGCGCCCCCGCCGGGCCGCAGATTCGCTCTCGCCCCCGCTTCTGGGATGCGCTAAAGGTAGTTTAGCCTTGAACTACTTGTCTCGACGGGAGGCGCCGCCATGGCCACCAGAAAATCGCCGGAGCAATCCAACGCATCGAAGGAGGAGCTTGTCCGTTACTACCGCGAGATGCTCCTGATCCGCCGCTTCGAAGAGAAGGCGGGCCAACTCTACGGCATGGGCCTCATCGGCGGCTTCTGCCATCTCTACATCGGCCAGGAAGCCGTGGTGGTCGGCCTCGAAGCCGCCGCCAAGGAGGGGGACAAGCGCATCACCTCCTACCGCGACCACGGCCACATGCTGGCCTGCGGCATGGATGCCAAGGGCGTGATGGCCGAGCTCACGGGCCGCGAGGGCGGCTATTCGAAGGGCAAGGGCGGCTCGATGCACATGTTCTCGAAAGAGAAGCATTTCTACGGCGGCCACGGCATCGTGGGCGCCCAGGTGCCGCTCGGCGCGGGGCTGGCCTTCGCCGACCGCTATCTCGGCAACGACAATGTCACCTTCACCTATTTCGGCGACGGTGCCGCGAACCAGGGCCAGGTCTACGAGGCCTACAACATGGCCCGGCTCTGGAGCCTGCCGGTGATCTTCGTGATCGAGAACAACCAGTATGCGATGGGCACCAGCGTGAAGCGCTCGACGAAATCGCCCTCGCTCTGGGAGCGCGGCGCGGCCTACGGCATCAAGGGCGAGTCGGTGGACGGCATGGATGTGCTGGCCGTGAAGGCCGCGGGCGAGAAGGCGGTCGCCGCCTGCCGCGCGGGCCAGGGGCCCTACATTCTCGAGATGATGACCTACCGCTACCGGGGCCACTCCATGTCCGACCCGGCCAAATATCGCACCCGCGAGGAAGTCCAGCGGATGCGCGACGAGAAGGACGCGATCGAACATGTCCGCGACCTGCTGATCCAGGGCAATCTCGCGACCGACGACGACCTCAAGGCGATCGACAAGGAGATCAAGGCCGTGGTGAACGAGGCCGCCGACTTCGCCAAGGAGAGCCCCGAGCCCGCGCTCGAGGAACTCTGGACCGACATCTACGCCTGAGGAGTAAAGACGCATGGCAACCCAGGTTCTGATGCCCGCCCTGTCGCCGACGATGGAGGAAGGCACGCTCGCCAAATGGCTGGTGAAGGAAGGCGATGCGGTCAAGTCCGGCCAGATCATCGCCGAGATCGAGACCGACAAGGCCACGATGGAATTCGAGGCCGTCGACGAGGGCACGGTGGGCAAGCTGCTCGTGGCCGAGGGCACGTCCGGCGTGAAGGTGAACACGCCCATCGCCGTTCTGGTCGAGGAGGGGGAGAGCGCCGACGAGGTGCAGGCTCCGGTCCCGACCCAGAAGGAAAAGCAGCCCGAGCCCGCCGAAGCCTCGGAAGGCAAGGCCGTGGACGAGCCGCTCGTCTCCTCGCCGGGCGCCCCGGTGCCGGGCAAGCGCGACCGCTCGCCCGACTGGCCGGACGGCACGCAGATGAAGACCATGACGGTGCGCGAGGCGCTCCGCGAGGCGATGGCGGAAGAGATGCGCGGCGACGAGCATGTCTTCCTGATGGGCGAGGAAGTGGGCGAGTATCAGGGCGCCTACAAGATCAGCCAGGGCCTTCTGGACGAGTTCGGCGACCGGCGCGTGGTCGACACGCCGATCACCGAGCATGGCTTCGCCGGCATCGCGGTCGGCGCGGCCTTCGGCGGGCTCCGCCCCATCGTCGAGTTCATGACCTTCAACTTCGCCATGCAGGCGATCGACCAGATCATCAACTCGGCCGCCAAGACGCTCTACATGTCGGGCGGTCAGATGGGCTGCCCCATCGTGTTCCGCGGCCCGAACGGCGCCGCCGCGCGCGTGGGCGCCCAGCACAGCCAGGATTATGCGGCGTGGTATGCGCAGATCCCCGGCCTCAGGGTGGTGATGCCCTATTCGGCGGCGGATGCGAAAGGTCTTCTGAAGACCGCGATCCGCGACCCGAACCCGGTGATCTTCCTCGAGAACGAGATCCTCTACGGCCGGTCCTTCGAGGTGCCGGTGATGGACGATTTCACGATCCCCTTCGGCAAGGCCCGGATCTGGCGCGAGGGGACGGATGTCACCATCGTCTCCTTCGGCATCGGCATGACCTATGCGCTGGAAGCGGCCGACAAGCTCGCCGCCGAGGGCATCTCGGCCGAGGTGATCGACCTGCGCACGCTGCGCCCCATCGATTACGAGACGGTGATCGAGTCGGTGAAGAAGACCAACCGCTGCATCACCGTCGAGGAGGGCTGGCCGGTGGGCTCCATCGGCAACCATCTCGCCGCGACGATCATGCAGCAGGCCTTCGACTGGCTCGATGCGCCGGTGCTGAACCTGACGGGCAAGGACGTGCCGATGCCCTATGCCGCCAATCTCGAGAAGCACGCGCTCGTGACCACGGCCGAGGTGGTCGAGGCCGCGAAATCCGTCTGCTACCGCTGAGGAGAAGCCCATGGCAACCGAGATCCTGATGCCCGCGCTGTCTCCGACGATGGAGGAGGGGACGCTCGCGAAATGGCTGAAGAAGGAAGGGGATGAGGTCCGCTCGGGCGACATCATCGCCGAGATCGAGACCGACAAGGCCACCATGGAGTTCGAGGCGGTCGACGAGGGCATCCTCGGCAAGATCCTGATCGCCGAGGGCACGGCAGGCGTGAAGGTCAACACGCCCATCGCCGTGCTGGTGGAAGAGGGCGAGAGCGTGGACGCCGTGTCCTCCGCCAAGGTGCCGGAGCCGCAGGAACCGGCCGACGAGGCCGCACCCGCGCAGGGGGCTCCGAAGGAGGCCCCTGCCCCGGCCGCCAAGGCGCCCGCGGCGCAGGCGGCCCGATCCGAGGGAGAGCGCGTCTTCGCCTCGCCGCTCGCCCGCCGGATCGCCAAGGAGAAGGGGATCGACCTTGCCGCGGTGCAGGGCTCGGGCCCGCGCGGCCGGATCGTGAAGGCCGATGTCGAGGGGGCGCAACCCTCGGCCGCTCCCGCCGCCAAGGCGGACGCCGCGGCACCGAAGGCAGAAGCGCCCGCCGCTGCGGCCGCGCCCGTCGCCGCGCCGGCCGCCTCCGCGGCTTCGGTGGCGAAGCTCTTCGCGGATCGCGACTATGAGGAAGTGACCCTCGACGGGATGCGCAAGACCATTGCCGCGCGTCTGTCCGAGGCCAAGCAGACCATCCCGCACTTCTACCTCCGGCGCGAGGTGGCTCTGGATGCGCTGATGGCTTTCCGCGCCGATCTCAATGCGAAGCTCGAGAGCCGGGGCGTAAAGCTCTCGGTCAACGACTTCATCATCAAGGCCTGTGCGGTGGCGCTCCAGCAGGTGCCGAACGCGAATGCCGTCTGGGCCGGAGACCGGATCCTGCGGCTGAAGCCCTCGGACGTGGCGGTGGCCGTGGCGATCGAGGGCGGGCTCTTCACGCCGGTCCTGCGCGATGCGCACCAGAAGAGCCTGTCGGCGCTGTCGGCCGAGATGAAGGATCTCGCCGCCCGCGCCCGCACGAAGAAGCTCGCACCGCACGAATATCAGGGCGGCAGCTTCGCGATCTCGAACCTCGGCATGTTCGGGGTCGAGAATTTCGATGCGGTCATCAACCCGCCGCACGGCTCGATCCTCGCCGTCGGCGCAGGCATCCGCAAGCCGGTGGTGGGCAAGGACGGCGCGATCACGACGGCCACCATGATGTCGATGACGCTCTCGGTGGACCACCGGGTGATCGACGGCGCGCTGGGGGCCGAGTTCCTGAAGGCGATCGTCGAGAATCTCGAGAACCCGATCGCCATGCTGGCCTGACGCCGGCGGCGCTCCTCGCGAGGGAGCGCCCGCTGCACGCTCTGGACATGAAGAAGGGGCCGATCCTTGCGGATCGGCCCCTTCTTCATGTCGGGCAGCTCACCCGTCGCTGAGCAGATGGTCCATCGTGACGGCGGGCTGTGCGCAGCCCGCTTCCCCCACCACGCGGGCTGGCACGCCCGCCACCGTCGTGCAGGGCGGCACGTCCTGCAGCACGACCGAGCCGGCCGCGATCCGGCTGCAATGGCCCACATGGATGTTGCCGAGCACCTTGGCCCCTGCCCCAATCAGCACCCCGTTGCCGATCTTCGGATGGCGGTCGCCATCCTCCTTGCCGGTGCCGCCGAGCGTGACCGAATGGAGCATCGAGACATTGTCGCCCACGACGGCCGTCTCGCCGATGACGATGGAATGCGCATGGTCGATCATGATGCCGCGCCCGATCCGGGCGGCTGGGTGGACATCGACGCCGAAGGCTTCCGACACGCGCATCTGCACGAAATAGGCCAGGTCCTGCCGTCCGTTCGTCCAGAGCCAGTGGCCCACGCGATAGGCCTGAACCGCCTGATAGCCCTTGAAGAAGAGGATCGGCTGCAGGAAGCGATGGCAGGCGGGGTCGCGCTCGTAGACGGCGACCATGTCGGCCCGCGCCATCTGCCCGAGCTCCGGATCCGACGCATAGGCCTCGTCCGCGATCTCGCGCAGGATCTGCTCGCTCATCTCGCCCGAAGCGAGCTTCAGCGAAAAGCGGTAGGCGAGCGCCCGTTCCATGGACGGATGGTGCAGGAGGCTCGAATGAACGAGTCCGCCGAGAAGAGGCTCGGCAAGGATGGCGGCATGGCCCTCTTCCACGATCCGTTGCCAGACCGGGTCCACTGTCGCAACACGGGGTCTTGCCTCAGCCATTGCAGCGTCCTTTCCTCGTCACCGCCGAAAATACCACATAGTCACTCCGGCCGACAGGGGAAGGGACGCATTCACTTTCCGAAGGTGCCTTCTTTCGGCCTCGCCCCTCGCGGAGGGCCTGCGCCCGCCGTTGCCGTCCGGCGACGGCAGCGGCCACCAGGGCCAGGGACTCGAGCTGGTCGGGATCCGGGTCGTGCCCCTCGCTGCGGTGCGCGAGCCCGGCCACGGCCGAGGCGAGGCACCCGTTTCCCCATAAGGGATGCGCGTGGCCGAGCCGTCTGGAGACCTTGTGAGCGAGATGCGCCCGCTCGATGAGGGTTGCCAGCGCCTCCGGTCTCTGCCCCGGCGGCATCGGCGCGAGCCAGTTTGCTACGGCAAGGAGGTCGGCGAGGAGAACCCGCCGCATGGGTCAGAGCGCGATGCGCCGGAACGGCCCGGGCCCGAAGCTCTCCGATACCTGGGCGACTTCGAGGACGAAGGCGCTTCCCACACCATCCGCCGCCTGCGCGGCCCCGGTGTATGTCCAGCGGGGCTCCCTCACCTCCTCCTGCCGGCGTACCTCTCCCCACGCCACGACGCGGACCAGATAAAGCTCGCTCTCCTCTCCCAGCGGCACCTCGCGCCCCTCCCAGCTGTCGCCGTCGATGCGGGTGCGACGGATCCAGCGCAGCAGGATGTGCTCCCCGTCCCGCTCCACCGCCAGATGGACCGGCGAATAGGGCCGGAGCCCGACCCCTGCGAAAGCCTGAACCCGGTGGACTGTGTCCGGGTCGTCGTAGCCGCGGGCGGCGGCGCCGATCCGATAATGCCGGGCGAGCCCCCGGTCGGCGAGCGGCATCTGAAGCTGCGTCAGGGCGCCGTTGACGAGAACGACCGTGCTGCCCTCGGGCCAGATCTCGGGCATCTGCCCATCCGTTCCCGCCTGCCCCCGGAGCCTGCGCGAAAGCTCCCAGACCCGCGGCGAAACCAGCCGCGCGTCCGCGAACTGGAAGAGCTCCCACCCGTCGGGTCGGCCGTCGCCGATGGCCATGAGGTTGGCTCCGTTCAGCACCTCGAGATCCGAAGCCGCGGCGAGCGGCGGCCCCAGCAGCCGCACCCGGAGCGGCGCGCCCATGTCCCATCGCCCCATCGGTGCAGCCCGCAGGATCGTCTCGGTGACGCCCGCCGTCGCAGCCACGCCCAGAAGCCGGTTCAGGCGATAGCCCGCATCCTGCGTCGCGCTCCAGACCGCAACCTGCCCCGGCCAGGGCGTTGCTGCCACACCGAAGTGGGGGGAATGGGGCAGCTCCGTTCCCGTGAGCAGGGGCAGATCGAGAAAGATCGGCCGGACCGGACCGGAAGCCGCAATGCGACGCGCGCGCACCTGCTCTTCGGCCATATCGGCGGGGCGGTAGATGCCGGGTTCCACACGAACCGCCTCCGCCAGCTGGCATTCGGCCTGATCGAGGCGGTCGATGCGATAGCGCCGGCCTTCAATCTCCAGAACGTCGCCGGCACCGAAGGCCAGAGCCGACCGCGGCAGGGCCAGACGCAGGCTGTCGCGCGCCACGCGGGCCTCGGCCAGCCACCGCTCGACGGCGCCCCGCGCCTCACCCGAGGTGAGCACGAGCGGCACTTCGGATTGCGAGACGGCAAAGGTCGCTTCGTCCGGGAAGGCCGCCTCGACCTGCCGTGCCTCATAGTCGCCTTCCGCCTCGAGGAAATGCAGCCGAACCCGGCCTGCGGTCTCGACCTCCGCCGCGCGCACGCGCTCGAGCGCGCCCTCGAGATCGTCCGATGCCACCAGGTCCTCGCGCGAGACCTCGGCCTTCGCGCGCCCGTCGCGCATGCGGAAGATCAGCTTGCCATCGCGTTCCAGCGCCTCGAAGCCATAGGCCAGCATCAGCGGCTGCAAGGCCGCCCGGGCACTGACGACCTCGGAAATCCCATAGCCGCGCACCACGCCCTGCAATTGCGTGACGTCGATGGCGGTTACCCCGGACCGTTCGCAGATCTCGGCCACCACGGCGGACAGCGGTTGCGAGGCCGCGCGACCGTTCAGCCAATGACCGCGCGCATAGTTCGAACCGTCGGACCAGACATCGCCGAGCGCGGGAAAGGCCGGAAATGGACGGGCGTCCCAGGCCCAGGCATGGGCGCGGTCCATGTCGACCATGGAGCCACCATAGAGGTCCGACACCGGATTGTTGGCCTCGTCCTGCCAGTATTCGGCCATCGCGCGCAGGTAATTCATCGCGATCCCGTCGTCGCGTCGTCCGGACGAGCCCCGCGGCAGCGCGGATTCCGAGCTTTTGGGATCGATGAACAGGTTCGGCTGATTGCTCCCGAGGTCGAGCGCCGGACACCCATATTCGGTGAACCAGATCGGCTTCGAGCGCGGGACCCAGCCGGTCGGCTGCGCCAACCGCACGCCGTCGATGCGCTCGTGATGGGGCAGCGACCACCAGGATCGGAGGTCCTTGGGGCGATGCACCCAAGGCTCTCCGAAATCCGCATCCGTGATCGGCCGCCGGTCCTGAGCCGCCGCCGCGGCCTCGTCGGGATAATACCACGCGAAACCCTCGCCTCCGGCGATGTTCGACTTGAGGTAGCCCAGATCCTGCACCGAGGGCCACCGCGCGTCCGCATGAGTCTCGGCCTCGCGCCAGTCCGAGAGCGGCATGTAATTGTCGATGCCGATGAAATCGATGTTCGGATCCGACCAGAGCGGATCGAGATGGAAGTAGAGATTGCCATCGGCATGGAGGCTGGACCATTCGCTCCAGTCGGCGGCGTAACCGATCTTCACCTCGGGCCCGAGGATGCCGCGGACGTCGGCCGCGAGCCTCCGCAGTTCGGCCACCGCCGGGAAGCTGTCGTCCGCCCCCCGGATCGCCGTCAGCCCGCGCAACTCGGAGCCCACGCAGAAGGCCTCCACCCCGCCGGCCAGCCGGCAGAGCCAGGCCGCGTGGAGAATGAAGCGCCGGTAGCGCCATTCGTCGGCGCCGTCATAGAGCACGCGCCCGTCCTCGATGCGGAACTGGCGAGGCTCCGCATCCCCGAAGAAGGCGCGCACCTCATCCTCGGCGGCTCCCGTTCTGTCAGGCGTCCCCGGCTGACCCGGCGCTGTCGAGAGCGTGATCCTCCCGCGCCAGGGCAGCACCGGCTGGCCCGCAGCACCTGACCAGGGGTCCGGCAGATCATTACCGGGAAGCTGTTCCATGAGGATGAACGGATAGTAGAGCACCGAGAGGTTCTGCGCCTTCAGGGCATGGATCGCCTCGACCACCGCCGCATCCGCAGGCGTTCCCCCATAGACCGGACGGTCGTCGATCCGCGCCACCCGCTCGGCCTGCGCGCGGTCGATCCCACCCGACCGCCAGGCCATGGTCGCTGCGTCGAGATGGCCCTCGACCTTGGGGCGGATCTCGCAGGCGCCGCAGCGCAGGTCGCTCCCGAACCAGGACACGACCAGCGACACCGACCGGCAGGCCGGGAGCTCGCCCCTCAGTTGTGCAAGCGAGCTGCGCATGTCGCTTTCCCCCGACGCCGTGTGCACGTTGGCGACCTCGGTCTCTCCCGGATCGGGCCTGTAGGCGAGACGGCTCGTGGCCAGAGCGTATTCTCCCGTGCCCGGGATCAGTGCCACGCCCTTCACCGCGCGCTGAAGCGTCATCACCCGATCCGCCAGAGCCCCCTGCGCTGCCCGCATCACCTCGAACGAGAATTGCGGCACCCGGTTGCCGAACCGGGCGAGGTCGAGATCCTCGATCACCACATAAGCGATCCCGCGGTAGGCCGGCGCCATGCCCGCCCCCTCCACTGCCTCGATCTTGGGATCCGGCAGCTGACCCTCCGTCCCGCGATAGAGCCGAAGGTTGAGCGACCCCGTCGCGATCTCGGCACCATCGGCCCAGATGCGGCCCACGCGCAGAATCTCTCCCTCGCACAGCGCGACGGCGAGGTTGACGGAATAGGAATAGCTCACGACGGTCGATCTCGGGCCGCCGCCCTTTCCGCCGCCCGTGCTCTCGGACTGGCTCTCGAGGAAGCGCGAGGCCCAGATCACCTGTCCTGCCACCCGATTGCGACCGAACAGCTGGGCCACGGGGGCGCCCTCGCCGGCGCCCATCAGACGGAAACGTTCGATCCGCCCGGTGCGGACCGTCTGCGATCCGGCGCCCATCAGCCGCTGGTCGATCAGCTGGCCGACCGTCGCCCCGATCGCGCGCCCGATGACCGCCCCTGTCAGCCCGAGCACCGTGCCGCTGAAGCCTGCTCCGATCGCCGATCCGGCCGCGGCCAGCAGAAGCGTCGCCATCAGTCCGTCCTTTCCGGAAATGCATAACGGGCCGCGATCCGCTCGGCCCAGGGCTCGGAGAGCGGGCTTTCGACGGTGCCATGGCCGCCATAGGCGTGGATGAACGAGGCATTCGGGCCCGGACGAACCAGAATGCCGAGATGTTTGGCGACCGCGCCCGGCCGCATGCGGAAGAGCAGGACGTCTCCCTCCGCCGCAGCCTCAAGCGGCTTGCGGACAAGGCACCGGTCGCAGGCCGCCATCAGCTCTTCATGGCCAGACGGCTCCGACCAGTCCGGCGTATAGGCAGGCACTTTCAGCGGCTCCGGGCCGATCAACGACCGCCAGATGCCCCTGAGAAGGCCGAGACAGTCCGCGCCGGCCCCCCTGGCCGAAGCCTGGTGGACGTAGGGCGTGCCCAGCCAGGACCGTGCCTCGGCCACCACATCGGCCCCGTTCATCGCCACAGGCTCCCACCGTCATGCGTCCCCGTCGCCGCGGGATAGGCACTGAGCCAGTCTTCTCCCGAAAGATGAGGAAAGCCCCGGAAGTTTCTGAGATTGTTGAACTTCAGGCGACAGGTCTCGGCCATCCGGTCGCAGCCCGCCTCAAGACGGACACGGTCGCCCACGGTGATCTGCGGACCGAGCGCCTCCCACAGCTCCACTGTTCGCACCTCTCCCCGGCGCCGGTCGGACTTGATCACGCCCATCGCTCCTGCCGCCTCACCGGTCATCACCACAAGCCGCCCCTGCTCGAACCAGCGATCCGGGAAAGAGTGCAGACCCTCGAAACTGAAGCTCTGCCGCCCGACGACGGTTTCGACAGCCCGTTCGGTGGCAAATTCTTCGGCGCTCAGGTCGAAACCGCAGGCCCGGTCGCCCAGAACCGCCTGACAAGACCGCTGATAGACCCGCCCCTGCGGCTGGTTCAGCGCCTCGGTCAGGCCACGCAACTCTGCCCGGAACCGTCCCCCGGAATGGGTGATCTCGCCCAGCGTTCCGCGGAACTGCAGCAGCCTCTCACTGACATCGGCCCAGTTCACGAGCCATGCCAGCACCTCCGCGCCGTCGAACCGGCCTGCGAGCAGATCTGCCTCCGTCACCGACACATGCGAGAGCGCCCCGACGGTCTCGGCATTGTCCACCGACAGGCCCGTCGTCTGCTGCAACGCCGACGAGGTGAGCCCCGTCTCTGCCCGGAACAGCCGCCCCTCGAACGTCAGATCCCGGTCGTGATCGGTGAAGCCGAACACCGTCCCGTCCCGCCGTGTCACGGCCCAGCACCGGCAGACGGTGGCAGCGCCGCCCTTCAGATGCGCATGAAGCTCTGCCGCGCCCATCAGACCCGCACCTCCATCACCGGAACGCTCGGAACCTCGCCTGCCTGGAAGGAGGCCATCGAGATCTGGATCCGGTCGGTATCGAACCGCACCGGCACGTCGAACTCGAACCCCGCCGTGACCGCGGCGCCTTCGGCAGGGGCCGCGGCGAGCGTGACGAGCCCCCTGGTCTCGTCGACCGAGAAGCCCGCGCCTTCTACCAGCACCGCGCCTGCCACCGCCACCTTCACCGAGCCTGCAACGGGTTTCGCCACCGGCCGCACATACTGCTCGCTTCCCGAGCGATAGGTCTTGACCAGCGCGAACTCCCGGCTGATCCCGTCGCCGATGCCGAGAGGCTGATCCTTGGGCCCCGGAGAGGCGGTGGCCGGGCACGAGCGATAGTCGGCCCAGTCCTTCCAGCGAAAACCGTAAAGCTGGCCACGGCGCGCCTCGAAGAAGGCCAGGAGCGCCTGGATATCGTCGAGCGAGCGCAGCGCCACGCCCGCGTCGTAGCGGCGGCGCGAATGGGCCCAGAGGCTGTTGCGTTCCTCGAAGCCATTGACCAGCGTGACGATCTCGGTCCGACGCTCCGGCCCTCCGATCGAGCCGAAGCTGAGGTTAGTCGGAAACCGGACCTCGTGAAAGCCCATGGCGCCCCCCTACCCGTTGCGCTGGCCGCGCGCGAGCATCCGGCTCACCTGCGCGGCGATCTGGCCCCGGCTGCGCTCGAAGCCCTGCACATCGGGCGTGGCCACGTTCATAACCACCTGCACCGCGCGGCCGCCGCCCGCCTGCACGCCGAGCCGACCGTCTGCCCCGCGGGCGAGCGGCAGGATCGCTTCGGGGCCCGCCTCGCCCATGAGCCCGGTGGCCCCGCGCATGGGAAAGGGGGTCGCCCCTGCCACCACACCGCCGCGCGCGAAGGGCATGACCCTCCCCTGCGAGAACGCGCCCCCCTTCGCGAAGGGCATCAGCCCGCCGAGGATCGAGTTCAGCCCATCGGCCAGAAAGCCCGACACCGCATCCTGCACCGGCTTCATCGCGACCGAATAGACGGTGCGCGAAATCGACTGCGCGAGACCCTGCATCGCGTCCGAGAGCTTCATGCCGTCGAACAGCACCCCGTCGAAGGCGCGCCTGAGGCCCGTCCCCATGCCGCTCGACAAACGATCCACCTCGCGGCTGGTGAAGACCATTGCGTCCCGCATCCGCGCCAGTTCGCTCTCGAAGCTCGCCGTCATCCCGGCCGCACTGCCGAGCGTGCTCTCGAGCGCCGCCACCTGCTCTTCCAGAGACTCGATGTCAGCCATGACCCTCTTCCTTCTCGACATCCGGAAAGGCGCGCAGCAACTCGTCGAGGCGCGCGCGCGACAGGGGCGCCACGGCGGACGCGCCCAGCATGATCCTCAGTTCGGCAGGCGTGAGCCGCCAGAACGCCTCTGGCGACAGCCGCAACCCCTCGAGCCCCAGCCGCAGGAGGGCGGGCCAGTCGAGCCGGCCCATCTCACCCCTCCGGCAGTGCGAAGGCACGCACCAGAAGTTCCGCCGCAGCGCGCGCCGCTCCCGCCGCCCCCCGCCCACCTCCGCCCGGCCGAGCTCCTCGGCCGTGCCCTGCCAGCCGCCGCCCCGGAGCCCCGCCACGAGAACGGCGATCACGTCGCGTGTCGAGAAGCGCCGCTCCTCGAACCGCTCCACCAGCGCGATCAGACTGTCCGCGCCGAGGTCCGCCTCCAGCTCCGCCAGCGCGCCGAGCGTGAGCTTGCCCGTGTGGCACCGGCCGTCGAGCCAGATCGCCACTTCGCCCGTCCAGGGATTGGCCATCAGTGCGCCGTGAAGGTGAGCTTGCCCGCCGAGGCGAGCGTCATTTCGTAGGTGGCCTCGCCATCGTGGGTCCCCGCATATTCGATCGCCGAGATCACGAAGGGGCCCTCGACCGTGCCGAAACTCGGGATCACCACCTGAAAGCGGGGCGCCTCTCCGGCGAAGAAGAGCGCCCGCGCCCGCTCGTCCGTCGCCGCGTCGCGAAACACGCCCGAGCCCGAGATGGAAGCCGATTTCACCCCGGCCCCCGCCAGCAGCTCGCGCCAGCCGCCCTGACTCTCCAGCGACGTCACATCGACCGTCTCGGCATTGAAGCCGATGCGCGTGGCCCGGAGGCCCGCCATCGTCTCGAACCCGCCGCTGCCCGTCAGATCCACCTTCACCAGAAGATCCCTGCCATTCTGCACGCTCATGCGATCACTCCTCGCCAATCCCGGCCTCGGGCCGGCCTCATTCCTCGATCCGGGCCCGGAAGGTCAGGTCGATGCGCCGCGCGTCTCCCGAGCCGAGCCGTCGGGCCACCGCCTTCAGGAAATCGAGCCGCACCAGCCGACCCCGTTCGAGCGCGGGCGGCGCGGTCACCAGCGCGTCGGACAGGGCCAGGGCCACGCGCTTCGCTGCGAGAAACCCTGCCGCATCGGACATCACGCTGACGGTAAAGTCATGCCGGGCGCCGCGGCCGCTGCCGTCCGAGGCATCGCGGACCTCCTCGGGACCGATCAGGATCCAGGTTCCGCGCCCGCCGCCGCGGGGAACCGCGTCGTGGATCGGCACGCCCGCCAGCGCGGGCTGGCCCGAAAGATGCTCATACAGGGCCGCCTGAAGCGCGACCGCTCCCGCATAGCTCATGCCGGAACCTCCTCGCGCGCAAAGCACATGAGGTATCGCCCGTCCGCATCCTTCTCGGTCACGGCCAGCACCGGATAGAGACGCGCGCCCTCGCGGAAGCGCTGACCCGGCTGCGGCCGGGCGGCAGCGCCCGCGGGGGCTGCGCGCACCGTGATCCTCAGCGGCACCGTTCCGAGAGGGATTTCGATCCCTCGCGTCTCACGTCCTGCGCCGGGCTCGATCCGCCCCCAGACGATCCCGAGCGCCGTCCAGGACAGGATCGATCCCCCGGCGCCGTCCGCCCCTGCCACGGGCGCCTCGAGGACCAGCTGGCGGTTCAGCCTCTCGCTCATCAGACCGCCCCTCCACCCAGCACACGGACGGTGCGCCAGCGTTCAATAAGGCCCGTGACAGCCCGAGGCAGGCCGGCGGCCCCGAGGCCGGGTTCATAGCGGTTCTCGTAATAGTCGGCCGCCAGCAGCATCACGGCCTGCGCCAGATCGGCCGGCACGACGGACCAGTCGGCTCCGAAGCCCGCCTCGAACAGGATCTCGATCCGCCCGTCCTCCGGCGCCGCCGGCAGGACCGACCCGCGCGGCACCAGTTTCGGGCGATGCTGATCCCGCACCAGCCGCCAGAGCGTCGGCACAAGGGGAAAAGTCCGCCCCGCCGCATCGACGAGCGTCACCGACAGGATCTCGCCCACCGGCGCGATCGGCAGCGCTTGCGCCTCGTCTGTCCGCCATTCCTCCAGCATCAGCAGATAGCGCCGCGACAGCAGCACCTTGCCCGTCCGGCCCTCGATGGTGGTGATGGCCGCCCGGAGGAACGCCGCGAGCAGCGCCTCCTGCAGCCCCTCTTCGGTAAAGCCCGACCCGAGCCGCAGATGCTCCCTGAACCGGGCCATCGGCAGCGCGCTGTCTGGCACGGCCGTCTGCTCGATCAACATCATGAATGCCTCCGAAAGCTCCCGAGGGATGCGCGCGCCGGACGAAGCCGGCGCGCGCCCGCTGCGCCCCGTGCGAGGGCGGCCGTCCGCGGGGCCCTCAGGACACCGCGATCTTCAGGAGCTTGATCGCCGCATAGTCGCTGACATCGCCTCCCACGCGCTTCGTCGCATAGAAGAGGACATGGGGCTTGGCCGAGAACGGATCGCGCAGGACCCGCACCTCGGGCCGCTCGGCGATCGTGTAGCCGGCGGCGAAATCCCCGAAAGCGATGGCAAAGGCGCCCGCGGCAATGTCCGGCATGTCCTCGCACAGCAGAACCGGATATCCCATCAGCCGCGCAGGCTCCGCCGCCGCCAGCCCGTCCGACCACAGGAAGCGGCCGTCCGAGTCCTTCATCTTCCGCACCGCGCCCGCGGTCTTCGAATTCATCACGAAGGTCGCATTCGCGCGGTAATCGGCGCCGAGCGAATAGATCAGGGTGATGATACAATCGGCCGGGTTCGTGGCGAGGAAATCGCTCGCCGCACCCGAGGGGACATAGCCGATCGAGCCCCAGCTCCAGCTCGCGTTCGCAACCTTCGCCGGCGCCAGAAAGCCCCGCGGCTTGTCGATCCCGTCGCCGCTGACGAAGGCCGCGCTCTCGGCCCGCATGAAGCGCGTCGCGATCTTGCCCGCCAGCCAGCTCTCGACGTCGAAGGCCGAGTCGTCCAGAAGCCGCTGGCTCGCCTTCGGCATCGCCGACAGTTCGTGCAGCTTGATCGAGATCCGCTCGATGGTGGGCGAGGCGCTTTCGCTGATCGTGGCGGCCTCCGTGGCCCAGCCCGACCCCACCTCAGTGCGGTCGATCAGCACGTCGAAGCTCGTGGCTTCCACATGGACCACACCGGCGATCTGACGGATCGAGGCCGTGGACAGCAACATCGAGCGGATGGCGTCCGAGGTCTGCGGATCGACCAGATAGCCGCCGTCCGAGGCGACGCTCGCCGTCATCGCCTTGCCCTCGAGGACGAGGCCGCGCAGACCGTCGTCGTCGCCCGAGCGGAGATAGGCCCCGAACGCCTTGCGATGCGGCGCCTCCTGGTCGGCCGCGGCCGCCAGCGCCGGGCGCCCGTAGATCATGGTTTTGCGGTCCAGCATGGTCAAACGCTCTTCCTGTTGTTGCAGCACATTCTTCACCTCCTCCTGAAAGCGATTGATCTCCTTCAGGAAACCGGCCATTGCGGCTTTCGCCTCCACGGCCGGATCGGGGCCTGCGGACATGCCTGTCCCGGCCCGAGCCCAGGTCTCGGTCATCCTCGTCTTCCTCTGACTTGCGGTGGTTCGGGTCAGGCGCGGAGCTTCCCGGTCCGGCCTCCTGTTCAGCCCCCGGAGAGGCTTCGCCGGGCGTCCTCGAAGAGGTCGGCGATGTCCTGCCAGTCCAGCCCGTCACGGTCCTCGCCCTTGGCGGCGACCCGCGCCTCGGGCAGCATCGGGAAGGTGACGAGCGAGACCTCCCAGAGCTCGATCTCGGTCAGCAGCCGCCGACCCTTGGCATCCCGCTCGGCGCGCACCGTCCGGTAGCCGATCGACAGCCCGTCGAGCGCCTTCGCCGCAATCAGCGCGGCGGCTTCCCGGCCCCGTCCGATCTCGGGCAGGATATGGCCTCTGACCCACAGGCCGGTCTCGTCCTCGTGCACCTCGTCCCAGACGCCGATCACCTCGGCCGGGTCGTGCTGCCAGAGCATCTTCGCCCGCCGCCCCTGCGCCCGCATGGCCTCGAGCGAGGCGGCATAGGCGCCGCGCGCCACCACGTCGCCGCCGTTGTCCGCCCGCCCGAAGATCGACGCATAGCCCTCGATCACCGAGCCCTCGGACAAGACGAGCCCGGTCTCGGGCCGGTGAAACTTCCGTTCGGGCGCACCGAATTCGTCCCTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP051468|1693033:1714552|1698269_1698572_+|WP_002719640.1|DBSCAN-SWA MNIPTSRPAIGASLYFALAFSLSVYFTFAAVQGDYGVFRQVQIQAEAEALRSERDRIAAELADMENRTRRLSDSYLDLDLLDEQARSVLGYVRADEIVIR >NZ_CP051468|1693033:1714552|1710857_1711271_-|WP_002719628.1|tail|DBSCAN-SWA MSVQNGRDLLVKVDLTGSGGFETMAGLRATRIGFNAETVDVTSLESQGGWRELLAGAGVKSASISGSGVFRDAATDERARALFFAGEAPRFQVVIPSFGTVEGPFVISAIEYAGTHDGEATYEMTLASAGKLTFTAH >NZ_CP051468|1693033:1714552|1702627_1703434_-|WP_002719636.1|DBSCAN-SWA MAEARPRVATVDPVWQRIVEEGHAAILAEPLLGGLVHSSLLHHPSMERALAYRFSLKLASGEMSEQILREIADEAYASDPELGQMARADMVAVYERDPACHRFLQPILFFKGYQAVQAYRVGHWLWTNGRQDLAYFVQMRVSEAFGVDVHPAARIGRGIMIDHAHSIVIGETAVVGDNVSMLHSVTLGGTGKEDGDRHPKIGNGVLIGAGAKVLGNIHVGHCSRIAAGSVVLQDVPPCTTVAGVPARVVGEAGCAQPAVTMDHLLSDG >NZ_CP051468|1693033:1714552|1693033_1694284_+|WP_011337526.1|tRNA|DBSCAN-SWA MTYHPKSDFLRVMQERGYLADCTDMQALDEALSKGVVPAYIGYDATAASLHVGHLLNIMMLRWLQKTGHKPITLMGGGTTKVGDPSFRSEERPLLTPDRIDENIEGMRKVFARYLSYGEGATDALMLNNAEWLDQLNYLDFLRDIGRHFSVNRMLSFESVKSRLDREQSLSFLEFNYMILQAYDFLELFRRTGCRLQMGGSDQWGNIVNGIDLTRRVLEGEIFGLTSPLLTTSDGRKMGKSAGGAVWLNGEMLAPYDFWQFWRNTTDADVGRFLKLYTELPVEECDRLGALQGSEINAAKILLANEVTTLLHGRDAAEAAEATARAVFEEGGVGGALEVVELPAATLGEGLSVAHFLVAAGLVASGKEAKRLVAENGLRFNNEPVGDANTPVTAATVGEELKVSIGRKKHKLVRLS >NZ_CP051468|1693033:1714552|1712038_1712638_-|WP_011337512.1|DBSCAN-SWA MMLIEQTAVPDSALPMARFREHLRLGSGFTEEGLQEALLAAFLRAAITTIEGRTGKVLLSRRYLLMLEEWRTDEAQALPIAPVGEILSVTLVDAAGRTFPLVPTLWRLVRDQHRPKLVPRGSVLPAAPEDGRIEILFEAGFGADWSVVPADLAQAVMLLAADYYENRYEPGLGAAGLPRAVTGLIERWRTVRVLGGGAV >NZ_CP051468|1693033:1714552|1694525_1695119_-|WP_011337525.1|DBSCAN-SWA MSSEKFLFAGLMSLGLAVLGGYGLSQSAVFAQAAEDGPGPNLVIELTGQAEGEVVIDLLPDVAPAHVERMVTLAREGAYDGVVFHRVIEGFMAQTGDVEFGKGNNLRRAGTGGSSYPDLPAEFSDVPFTRGVVGMARSDDPNSANSQFFIMFAPGEFLNGQYTVVGRVVQGMDVVDRIARGEPPRTPDAMAKVTVAE >NZ_CP051468|1693033:1714552|1710339_1710540_-|WP_017140161.1|tail|DBSCAN-SWA MGRLDWPALLRLGLEGLRLSPEAFWRLTPAELRIMLGASAVAPLSRARLDELLRAFPDVEKEEGHG >NZ_CP051468|1693033:1714552|1712732_1713929_-|WP_011337511.1|capsid|DBSCAN-SWA MTETWARAGTGMSAGPDPAVEAKAAMAGFLKEINRFQEEVKNVLQQQEERLTMLDRKTMIYGRPALAAAADQEAPHRKAFGAYLRSGDDDGLRGLVLEGKAMTASVASDGGYLVDPQTSDAIRSMLLSTASIRQIAGVVHVEATSFDVLIDRTEVGSGWATEAATISESASPTIERISIKLHELSAMPKASQRLLDDSAFDVESWLAGKIATRFMRAESAAFVSGDGIDKPRGFLAPAKVANASWSWGSIGYVPSGAASDFLATNPADCIITLIYSLGADYRANATFVMNSKTAGAVRKMKDSDGRFLWSDGLAAAEPARLMGYPVLLCEDMPDIAAGAFAIAFGDFAAGYTIAERPEVRVLRDPFSAKPHVLFYATKRVGGDVSDYAAIKLLKIAVS >NZ_CP051468|1693033:1714552|1707733_1708174_-|WP_009564474.1|DBSCAN-SWA MNGADVVAEARSWLGTPYVHQASARGAGADCLGLLRGIWRSLIGPEPLKVPAYTPDWSEPSGHEELMAACDRCLVRKPLEAAAEGDVLLFRMRPGAVAKHLGILVRPGPNASFIHAYGGHGTVESPLSEPWAERIAARYAFPERTD >NZ_CP051468|1693033:1714552|1695766_1696960_+|WP_011337524.1|DBSCAN-SWA MGWKTLDDMDLAGKVVLVRVDVNVPMENGEVTDATRIEKIVPTVEDILKKGGKPVLLAHFGRPKGKVVDEMSLRLVLPALQNALPGTKVSFAADCVGPEPEQAVAAMLEGEVLLLENTRFHAGEEKNDPELAAAMAKLGQVYVNDAFSAAHRAHASTEGLARLLPSAAGRLMEAELKALEAALGHPERPVVAVVGGAKVSTKLDLLGNLVGRVDHLVIGGGMANTFLVAQGIEVGKSLAERDMADTAREILSKAKAAGCTIHLPLDVVVAREFKAGAANETVETAACPADAMILDAGPKTVAALSEVFASAKTLIWNGPLGAFEIEPFDAATNAAALQVAQLTKAGQLISVAGGGDTVAALNKAGAAEGFSYISTAGGAFLEWMEGKELPGVAALTV >NZ_CP051468|1693033:1714552|1708170_1709055_-|WP_011337518.1|DBSCAN-SWA MGAAELHAHLKGGAATVCRCWAVTRRDGTVFGFTDHDRDLTFEGRLFRAETGLTSSALQQTTGLSVDNAETVGALSHVSVTEADLLAGRFDGAEVLAWLVNWADVSERLLQFRGTLGEITHSGGRFRAELRGLTEALNQPQGRVYQRSCQAVLGDRACGFDLSAEEFATERAVETVVGRQSFSFEGLHSFPDRWFEQGRLVVMTGEAAGAMGVIKSDRRRGEVRTVELWEALGPQITVGDRVRLEAGCDRMAETCRLKFNNLRNFRGFPHLSGEDWLSAYPAATGTHDGGSLWR >NZ_CP051468|1693033:1714552|1713994_1714552_-|WP_043764141.1|head,protease|DBSCAN-SWA MRDEFGAPERKFHRPETGLVLSEGSVIEGYASIFGRADNGGDVVARGAYAASLEAMRAQGRRAKMLWQHDPAEVIGVWDEVHEDETGLWVRGHILPEIGRGREAAALIAAKALDGLSIGYRTVRAERDAKGRRLLTEIELWEVSLVTFPMLPEARVAAKGEDRDGLDWQDIADLFEDARRSLSGG >NZ_CP051468|1693033:1714552|1709696_1710347_-|WP_017140162.1|tail|DBSCAN-SWA MADIESLEEQVAALESTLGSAAGMTASFESELARMRDAMVFTSREVDRLSSGMGTGLRRAFDGVLFDGMKLSDAMQGLAQSISRTVYSVAMKPVQDAVSGFLADGLNSILGGLMPFAKGGAFSQGRVMPFARGGVVAGATPFPMRGATGLMGEAGPEAILPLARGADGRLGVQAGGGRAVQVVMNVATPDVQGFERSRGQIAAQVSRMLARGQRNG >NZ_CP051468|1693033:1714552|1711703_1712039_-|WP_011337513.1|head,tail|DBSCAN-SWA MSERLNRQLVLEAPVAGADGAGGSILSWTALGIVWGRIEPGAGRETRGIEIPLGTVPLRITVRAAPAGAAARPQPGQRFREGARLYPVLAVTEKDADGRYLMCFAREEVPA >NZ_CP051468|1693033:1714552|1709054_1709687_-|WP_011337517.1|DBSCAN-SWA MGFHEVRFPTNLSFGSIGGPERRTEIVTLVNGFEERNSLWAHSRRRYDAGVALRSLDDIQALLAFFEARRGQLYGFRWKDWADYRSCPATASPGPKDQPLGIGDGISREFALVKTYRSGSEQYVRPVAKPVAGSVKVAVAGAVLVEGAGFSVDETRGLVTLAAAPAEGAAVTAGFEFDVPVRFDTDRIQISMASFQAGEVPSVPVMEVRV >NZ_CP051468|1693033:1714552|1699797_1701189_+|WP_011337522.1|DBSCAN-SWA MATQVLMPALSPTMEEGTLAKWLVKEGDAVKSGQIIAEIETDKATMEFEAVDEGTVGKLLVAEGTSGVKVNTPIAVLVEEGESADEVQAPVPTQKEKQPEPAEASEGKAVDEPLVSSPGAPVPGKRDRSPDWPDGTQMKTMTVREALREAMAEEMRGDEHVFLMGEEVGEYQGAYKISQGLLDEFGDRRVVDTPITEHGFAGIAVGAAFGGLRPIVEFMTFNFAMQAIDQIINSAAKTLYMSGGQMGCPIVFRGPNGAAARVGAQHSQDYAAWYAQIPGLRVVMPYSAADAKGLLKTAIRDPNPVIFLENEILYGRSFEVPVMDDFTIPFGKARIWREGTDVTIVSFGIGMTYALEAADKLAAEGISAEVIDLRTLRPIDYETVIESVKKTNRCITVEEGWPVGSIGNHLAATIMQQAFDWLDAPVLNLTGKDVPMPYAANLEKHALVTTAEVVEAAKSVCYR >NZ_CP051468|1693033:1714552|1695108_1695618_-|WP_002719643.1|DBSCAN-SWA MAEIKDPENTIIMTLKDGEVVIELLSDVAPKHSERMKALARAKAYDNVVFHRVIEGFMAQTGDVANGNTEDGFNIRRAGTGGSDMPDLPAEFSKLPHARGTLGAARSQNPNSANSQFFINFRDNDFLNGQYTVYGRVISGMEHVDKIARGEPPANPDKMLTVRVAADVV >NZ_CP051468|1693033:1714552|1697061_1697991_+|WP_011337523.1|DBSCAN-SWA MRASRAVQKILANYEGETPGVKANLCRMLMEGKLGGTGKMIILPVDQGFEHGPARTFAPNPAGYDPHYHYQLAIDAGLSAYAAPLGMLEAGADTFAGQIPTILKVNSANSLMSDTAGKNQAVTASVDDALRLGCAAIGFTIYPGSDAQLDMYEGIVAMRKEAAAKGIATVIWSYPRGEAISKDGETAIDVAAYAAQIAALIGAHIIKIKLSTDHLMLGEAKKVYEAQQIDVSTQAARVKHCMDSAFAGRRIVVFSGGAKKGEDSVYDDARAIRDGGGNGSIIGRNSFQRSREDALAMLGKLVDIYKGRA >NZ_CP051468|1693033:1714552|1698794_1699784_+|WP_002719639.1|DBSCAN-SWA MATRKSPEQSNASKEELVRYYREMLLIRRFEEKAGQLYGMGLIGGFCHLYIGQEAVVVGLEAAAKEGDKRITSYRDHGHMLACGMDAKGVMAELTGREGGYSKGKGGSMHMFSKEKHFYGGHGIVGAQVPLGAGLAFADRYLGNDNVTFTYFGDGAANQGQVYEAYNMARLWSLPVIFVIENNQYAMGTSVKRSTKSPSLWERGAAYGIKGESVDGMDVLAVKAAGEKAVAACRAGQGPYILEMMTYRYRGHSMSDPAKYRTREEVQRMRDEKDAIEHVRDLLIQGNLATDDDLKAIDKEIKAVVNEAADFAKESPEPALEELWTDIYA >NZ_CP051468|1693033:1714552|1701199_1702528_+|WP_011337521.1|DBSCAN-SWA MATEILMPALSPTMEEGTLAKWLKKEGDEVRSGDIIAEIETDKATMEFEAVDEGILGKILIAEGTAGVKVNTPIAVLVEEGESVDAVSSAKVPEPQEPADEAAPAQGAPKEAPAPAAKAPAAQAARSEGERVFASPLARRIAKEKGIDLAAVQGSGPRGRIVKADVEGAQPSAAPAAKADAAAPKAEAPAAAAAPVAAPAASAASVAKLFADRDYEEVTLDGMRKTIAARLSEAKQTIPHFYLRREVALDALMAFRADLNAKLESRGVKLSVNDFIIKACAVALQQVPNANAVWAGDRILRLKPSDVAVAVAIEGGLFTPVLRDAHQKSLSALSAEMKDLAARARTKKLAPHEYQGGSFAISNLGMFGVENFDAVINPPHGSILAVGAGIRKPVVGKDGAITTATMMSMTLSVDHRVIDGALGAEFLKAIVENLENPIAMLA >NZ_CP051468|1693033:1714552|1711308_1711707_-|WP_011337514.1|DBSCAN-SWA MSYAGAVALQAALYEHLSGQPALAGVPIHDAVPRGGGRGTWILIGPEEVRDASDGSGRGARHDFTVSVMSDAAGFLAAKRVALALSDALVTAPPALERGRLVRLDFLKAVARRLGSGDARRIDLTFRARIEE >NZ_CP051468|1693033:1714552|1703843_1707734_-|WP_011337519.1|DBSCAN-SWA MATLLLAAAGSAIGAGFSGTVLGLTGAVIGRAIGATVGQLIDQRLMGAGSQTVRTGRIERFRLMGAGEGAPVAQLFGRNRVAGQVIWASRFLESQSESTGGGKGGGPRSTVVSYSYSVNLAVALCEGEILRVGRIWADGAEIATGSLNLRLYRGTEGQLPDPKIEAVEGAGMAPAYRGIAYVVIEDLDLARFGNRVPQFSFEVMRAAQGALADRVMTLQRAVKGVALIPGTGEYALATSRLAYRPDPGETEVANVHTASGESDMRSSLAQLRGELPACRSVSLVVSWFGSDLRCGACEIRPKVEGHLDAATMAWRSGGIDRAQAERVARIDDRPVYGGTPADAAVVEAIHALKAQNLSVLYYPFILMEQLPGNDLPDPWSGAAGQPVLPWRGRITLSTAPGQPGTPDRTGAAEDEVRAFFGDAEPRQFRIEDGRVLYDGADEWRYRRFILHAAWLCRLAGGVEAFCVGSELRGLTAIRGADDSFPAVAELRRLAADVRGILGPEVKIGYAADWSEWSSLHADGNLYFHLDPLWSDPNIDFIGIDNYMPLSDWREAETHADARWPSVQDLGYLKSNIAGGEGFAWYYPDEAAAAAQDRRPITDADFGEPWVHRPKDLRSWWSLPHHERIDGVRLAQPTGWVPRSKPIWFTEYGCPALDLGSNQPNLFIDPKSSESALPRGSSGRRDDGIAMNYLRAMAEYWQDEANNPVSDLYGGSMVDMDRAHAWAWDARPFPAFPALGDVWSDGSNYARGHWLNGRAASQPLSAVVAEICERSGVTAIDVTQLQGVVRGYGISEVVSARAALQPLMLAYGFEALERDGKLIFRMRDGRAKAEVSREDLVASDDLEGALERVRAAEVETAGRVRLHFLEAEGDYEARQVEAAFPDEATFAVSQSEVPLVLTSGEARGAVERWLAEARVARDSLRLALPRSALAFGAGDVLEIEGRRYRIDRLDQAECQLAEAVRVEPGIYRPADMAEEQVRARRIAASGPVRPIFLDLPLLTGTELPHSPHFGVAATPWPGQVAVWSATQDAGYRLNRLLGVAATAGVTETILRAAPMGRWDMGAPLRVRLLGPPLAAASDLEVLNGANLMAIGDGRPDGWELFQFADARLVSPRVWELSRRLRGQAGTDGQMPEIWPEGSTVVLVNGALTQLQMPLADRGLARHYRIGAAARGYDDPDTVHRVQAFAGVGLRPYSPVHLAVERDGEHILLRWIRRTRIDGDSWEGREVPLGEESELYLVRVVAWGEVRRQEEVREPRWTYTGAAQAADGVGSAFVLEVAQVSESFGPGPFRRIAL |
22 | Paracoccus_phage(40.0%) | tail,protease,tRNA,capsid,head | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1811285 : 1844002
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP051468|1811285:1844002|DBSCAN-SWA TATGCCGGAGGATCGCTCGCTGGGCGATCTTCTGTCCGACCTCGTGCGGGATGTCACCGGGCTCGTGCGCAGCGAGAGCCGCCTGATCCGCGCGGAGGTTGCCGAGGCCGGACGCAGCATGGCCGTCGGCGCCGAGATGATCGCTGCGGGGGGGATCCTCCTCCTCGTGGCGCTTCTGGTGCTGGTGCAGGCGCTGGTCGTCCTCCTCGCCCATTGGGTCGGTCCTGCCTGGGCCGCCCTCATCGTTGGCGCCGTCCTGGCCGTGATCGGCGGGCTGCTGATCGCCCGAGGCCGCAAGGACATGAGCGCCGCCAATCTGGTGCCGGAGCGCACCATCGAACAGACCAGCCGCGACGTGCGGCTTGCACGGGAGCAGATCTGATGAATGCCGACAAGATCGAACGTGAAGTGGAAGAGAACCGCGCGCGAGTCGAAAGCACTCTCGACGCGCTGAAGGAGCGGATGTCCGTCAATCAGGTCGTCGACGATCTCGCGAATTTCGTGGGCGTCGAGGATCTGCGGGGCGTGATGCATTCCGCCGGCCGGCAGGTGCGCGACAATCCGGTGGCTCTGGGCCTCATCGGGGTGGGTCTGGCGTGGCTCGCCTTCGGCGGCTCCTCGAGCCGCTCGCGCCATGTCAGCGCCTATGACCGCGAAGAATATTACCGGAGCGACTACGGCCCCGCCCGCCGGTCCTACGAGCCCTACGGCGGTGGCGCCTCCTATCGTTCGGACCGGGGCGAGGGCGTGGTCTCGCGCGTCAAGCATGCGGTGAGCGACGCTGCCGACAGCGTGAGCCGCGCGGCCCATTCCGCGACGGACAAGGTGGCCGAAACTTTCGGCGACGCGCGTGACCGCGCGGGCAGCCTGCGCGACGATGTCTACGACCGCGCCGGTCGGATGCGCGAGGATGCCTATGATCGTGCCGGCCACTGGCGCGACGATCTCGGCGAGCGATCGGCGCATCTGCGCGACCGTGCCGGGCATCTGCGCGACCGCGCGTCCCACGGCGCGCATCAGATGCGCGACAGCATGAGCCACGGCATGGAGCAGCAGCCCCTGCTCGTGGGTGCCGCGGCCGTGGCGCTCGGCGCCGTGATCGGGGCCGCGCTTCCCCGGACGCGCACCGAGGACGAGTGGATGGGCCGCACCAGCGACGAGCTCTGGGACGAGGCGAAGGCTTCGTCGTGGGAGCTGCGCGAGCGGGCGATGAAGGCCGCGCGCGAGACCTACGACGCCACCATCGCCGCGGCGCGCGACGAAGGTCTGGTGCCCGAGAAGGGCGAGACGCTGGCCTCGAAGGTGGGACGCGTCGCCGATGCGGCGGCCAGCGAGGCGAAGGCTCAGGTGGAGCCCGTGCTTCACGGCAAGGAGGCGGACAAATCCTCGACCGGCATGTCCTCGGCTGGCGCGGGCTCGACGGGCTCGACCGACTCCACGACCAAGAGTCCCGGCACGTCCGGCCCCAAGGTTGCAGGCTCGGGCTTCTGAGGCTGTCCTGAAAAGCAGAAGGCTCCGGTCACAGACCGGAGCCTTTTTCATGTCGCAACCCTCCGGCATGCCTCAGCTGTAGATCGGGGGCGTCTCCTCGCCCGGGTTCGGCATGTCCGGATCGCCGGGCGGGATAGGATCCGGGTCGGGCATCGGATCGGGATAGGGATTCGGCTCGGGCATCGGGTCCGGAGCCGGCGGATCGTTCGGGCCGAGTTGCAGGTCGTTCGTCATCGCATCCTCCTCTCCTGCGGGCGCAGCGCGCGCCCCTTCCTTCCAAGAGCCGCCGGGCGGGATTGTTCCGCCTCAGAACCTGCGGCTGAGGCAGGGAGAGCCGCGCAGGCCGAAGCGCCAGGGCATGTCCGCCGCCCGGCTGATGCCGATGCGCGGGCCGCACAGGATCTCGCCTGCCGCCACCGGCGCACCCCACCGCACCGCAAAGCCCCGCGTGCCGAAGGCGGTGCCATCGTCGGCCAGTGTCAGACCCAGCGCCTGACCGATCCGGCCCGGCCCGGAACAGAGGAGCCGCGCGACATCGGTGCCGCGCCGGGCCGCCATCTGCGCCAGACCCTCCGTCGGTTCCAGAGCGCGGATCAGCACGGCATGGCCCGGCGCGCAGACCACGTTGAGGCAGAGATGGATGCCGTAGGACAGATAGACATAGGCGCAGCCCGGCGGCCCGAACATCGCCGCATTGCGCGGGGTGGGTCCGCGGAAACTGTGCGACGCCGGATCGTCCGGCGTATAGGCCTCCGTCTCGACGATCCGTCCGCCCACGCCGCGCACCTGCAGATGCGCCCCCAGAAGATCGACCGCCACCGCGGGCGCCTCGCGCGCGAAACAGGCTGCCTGCCCCTCCATCTGCCCCCCTTCCTGCGCGGTCTGACGCCGGTGTGACATCCGCCTCCCGGAACATTCGTCCGCCGCTGCCGTTCCTTCCGCAACCCCGCACACGCTGCGGGACCGGTGATGTTCACGAATGGAGAGACTGTCGATGAGAAAGATTGCTTTCACCGCGAGCCTCCTCGCCCTGGCCGCGGCCCCGGGCTTTGCACAGAGCGGCGCGGACGGGATGGATCCCACCCGGCCGGTAGCTCCCGAGACCGGCACGGGCAGCATCAAGGATATCAGCCAGCAGACCTGGTCCGACGATGTGCGCGACGTGTTCTTCGAGGATGCGGCGATGACTTCGCTCCGAGCGACCGACGAGATGGAAAAGCGGTGGGCCGCGCTCGACCCCGAGGCCAGGGAGCGCGCCGTGCGCGATTGTCAGGCCTATCTGATGGACGCGGGTTCCATGGGCGGCGCCTCGTCCGGCGGCATCGGCGGCTCGCTGACGCCGGATGCCGATGAGGCCGGGACGACCGGCGAGGATCTCGGCGGCACGGATACCCCGGCCCCGGTTCCTGAGGACACGACGAACGCGTCCGACGCGCCCGCCTCTCCGGGCGATGCCTCGGTCTGGACGCAGGCCTGCTCGTTCGTGACCTCGTTCAAGCCGCAGTAACGCCGCCCGACGCCCATCGCGGCCCGGTCGATCCTCGCCCGGGCCGCAATCCTGCCGGGGCGCGCCGGATGAGACGCGCCCCGCACAGACTCCAGCCGTTCCCCCTGTCAAGGAGAGCCAGATGGCCGAAGACAAACCTGCCGCCCGTTCCGGCAGTTTCGAAAACCTCGTGTCGGACTACGGCGCGGGCGGCGAGTTCCACCAGCACGCCGGCGAAGGGGACGAGCGTCTGACGACGGCCCAGGGCGTCACCATCGCCGACAACCAGAATTCGCTCAGGAGCGGCGCGCCGGGACCGACCGCGCTCGAGGATTTCATCCTCCGCGAAAAGATTTTCCACTTCGACCATGAGCGGATCCCGGAGCGCGTGGTCCATGCCCGCGGCTTCGGCGCGCACGGGCTCTTCGAATGCACCAAGGCCATTCCCGAAATCACCCGCGCCGCCCCCTTCCAGAAGGAAGGCAAGATCACCGAGACCTTCGTCCGCTTCTCGACGGTGGCGGGCTCCAAGGGGTCGGTGGACCTTGCCCGCGACGTGCGCGGCTTCGCGGTGAAATTCTATACCGATGAAGGCAACTGGGACCTCGTCGGCAACAACATCCCGGTCTTCTTCATTCAGGATGCGATCAAGTTCCCGGACCTCATCCATTCCGTGAAGCCCGAGCCCGACCGCGGCTTCCCTCAGGCCCAGTCGGCGCACGACAATTTCTGGGATTTCATCTCGCTCACGCCCGAGTCGATGCACATGGTCATGTGGCAGATGTCGGATCGTGCCATCCCCCGCAGCTTCCGCTTCATGGAGGGCTTCGGCGTCCACACCTTCCGCTTCATCAATGCCGAAGGCCAGTCGAGCTTCGTCAAGTTCCACTGGAAGCCCAAGCTCGGCATGCAGTCGGTGCTGTGGGACGAGGCGCTGAAGATCAACGGCGCCGACCCCGACCGGCACCGCCGCGACCTCTGGGAAGCGATCACCGCCGGCGATTTCCCAGAATGGGAGCTGGGCCTGCAGATCTTCGACGAAGAGTTCGCGAACCGCTTCGACTTCGATGTGCTCGATCCCACCAAGATCATCCCGGAAGAGCTCGTACCCCTCACCATCGTCGGCACGCTGACGCTGGACCGCGTCGTGGATAACTTTTTCGCCGAGACTGAGCAGGTGGCCTTCTGCACCCAGAACGTGGTGCCCGGCATCGACTTCACCGACGATCCGCTGCTGCAGGGACGGAACTTCTCCTACCTCGACACCCAGCTCAAGCGTCTGGGCAGCCCCAACTTCACCAAGCTGCCGATCAACGCCCCGCGTGGCTGCCCCGTCCACAATTTCCAGCAGGATGGGCACATGCAGACCTCGAACCGCAAGGGTCGGGTGAACTACGAGCCGAACGGCTGGGGCAAGGGCCCGCGGGCCCATCCGCTCGAGGGCTATGTCAGCTACGCCGTGCGCAACGAGGGCGAGAAGCGGAGGCTCCGGCCCGAGAGCTTCGCCGACCATTACAGCCAGGCGCGGCAGTTCTTCCTGAGCCAGACCCCGGTCGAGCAGGGTCACATCCGCGATGCGCTGGTCTTCGAACTGTCGAAGTGTCAGGAGATCGAGATCCGCGCCCGGATCGTCGCGCATCTCCTCAACATCGACGGCGCTCTGGCGCAGGCGGTGGCCGACGGGCTGGGCCTGACCGAGATGCCCGAGCCCGCTCCCGCGGCACGGCCGACGCGCACCGACCTGCCGCCGTCCGATGCGCTCAGCATGCTGAAGAACCCGCCGCCGAACTTTGCGGGCCGCACCGTCGGGGTTCTGGCGGGCAACGGCGTCGAGGCCGAGCTGCTGACCGAGCTGCGCTCGATCCTGCAGGCCGAGGGGGCGATGATGAAGCTCGTGGCCCCGCGCGTAGGCGGCGTCACCGACAGCGCGGGCACGCTTCATCCGGCGGACGAGAAGCTGGGCGGCGGCCCGTCGGTGATCTTCGACGCGGTGGCGGTGCTGCCGGGCTCGGATGCGGCGATGCTAGCCGCCAACCCGGCGGCGCAGGACTTCCTCACCGATGCGCATGCGCATTGCAAGTTCATCGCCCACCATGGGGCGGGGCCGCTGATCGAGGCCTGTGCGCTCGGGGCCAAGATGGACGAGGGTTGGATCGAGATCGGCTCGAGCAGCGACATCGACGGTTTCGTGGCGGCCTGCCGCGGGATGCGCCACTGGCCGCGCGAGGCGACGCTGGGCGGGAACCCTCCCCTCGCCCGGACGTTTTCCCCGTGACAGCCCTTTGGGCGCATGAAGAGAGAGGACCCACAATGGCTTATGCGAAAACTGGGACCCATGAGCGGCAGGGCCGCGGCTCGGCCCTGATGCTCACCTCGGCGGTGGTGCTGGGCGCCTTCGGGGCGCTCGCGGTGCGCCATCTGATGAGCGGCCGCCGCAGCACCCACTATCCCGAGGTGCGGCAGGCGGGACGCGACGCGATGCGCGATCCGCCCCGCGATTGGGACCGGCTGGACGAGACGCTGGACGAGTCCTTCCCGGCAAGCGATCCGCCCGCGACCTATTGAGCGGCTCCCGGACGACCTGCGCAGCATTGGAAGCTGGCCCCCGTCCGGGGGCCAGCTTCTGAGATTCTGGCCCTGCCCCTCGGGGCCGCCGGAGACGCGAGGATACACGCATGACAACAGATGAGGGCCGCAGGCCAGAGGAGCCCGGCACGCCCGCCAGCCTGCGGGAGATGCCGCGGGATCCGCGGATCGATGCCTCGATGGCCTTGATGAGCGAGGGCTACCGCTTCGTCTCGAACCTGTGCGACCGGATGGACAGCGACGCGGTGGCGACCCGCCTGCGTCTGCGCGAGGTCGTCTGCCTGCGCGGCAGCGCGGCCGCGCGGCTGCTCTATGGCGCGGAGGGGCTTACCCGCGTGGGGGCGATGCCCTCGACCGTGCTGCACCTCCTGCAGGACAAGGGCTCGGTGCAGCAGCTCGAGGGTCCGGCCCATCGGCATCGGAAGGCGCTGTTCCTCTCGATCTGCATGGATCCCGCCCGCGTCGAGGCGCTGGTGTCCGAGATGCGCCTTGCATGGCGCGAACGGCTGCCCGCCTGGGAAGCGGAGGGGAGGATCGTCCTCCAGCAGGAGGCGGCGCGCCTTCTGACCCGGGCGGGCTGCCGCTGGGCCGGGGTCGCCCACCAGCCCGAAGCGCAGCTCGCCGACGAGATTTTCGACATGATCGACAAGGCGGGCAGCGTCGGCCCGCGCAACTGGCTGGCGCAGATGCGCCGCGCCGGCACCGAGAAGCGCCTGCGCACCCTCGTGGAGGAGGTTCGGGCCGGCGAAGTGGTTCCGGAGGCCGCGACCGCCCTCCATGCGATTGCCTTCCACCGCGAGGAGGACGGCACGCTGCTGGATCCGTCCGTCGCGGCGGTGGAGCTTCTGAACCTGCTCCGTCCCATCGTGGCCGTGGGGCGCTACATCACCTTCGCGGCGCTGGCGCTGCATCGCGAGACGACATGGCGCGAGCTCTTCCGGTCGGGCAATCTGGAGCTGGCGGGCGACTTCGCCGAGGAGGTCCGCCGCGCCTCACCCTTCTTTCCCTTCACCGCCGCCGTGACCACCCGGCCCATCACTTGGGAGGGCTACGACTTCCCGGAAGGTCAATGGCTTCTGCTCGATCTCTACGGCACGACGCACGACCCGCGGCACTTCCCCGAGCCCACCCGCTTCCGCGCCGAGCGAATGCTGAGCTGGACGGGACAGGACGAGGCCTTCATCCCCCAGGGCGCGGGGGACGTGGCCCGCACCCACCGCTGCCCGGGAGAGATGATCACCGTCGAGCTGATGAAGGAGGCCATCCGCCTCCTCTGCTGCGAGATGGACTACGAGGTTCCCGCGCAGGATCTGGGCGTACGGCTGAACCGGATGCCGGCCCAGCCCCGCAGCGGCATGATCCTGTCGGCGATCTCGCGACGCGCGGGAACAGAGGCTTCCCGCAACGGTTGAGTCGCAAACGGAAAGGAGACTCCCATGCTCAAGTTCATCGGCGGCACCGTCGGCATCATCTTTCTCGTCGGCCTGCTCGTCATCATCGGCATTCTCGCGCTGATCTTCTGAGCGACGAGGCGACCACCGACACATGACTGCGGCGGCCGGATGGGCCGCCGCTCGACATCAGGGGCGGACGTCCGCTCCGATCAGCCCCAGTCCAGCACCACCTTGCCCGACGCGCCGCTGCGCATGGCGGCAAAACCCTCGGCGAAATCCGCCACAGGAAAGCGGTGGGTGATGACGCGCCGGATATCGAGGCCGTTCTCCAGCATCGCGATCATCTTGTACCATGTCTCGAAGATCTCGCGGCCGTAGACGCCCTTGATGGTCAGCGCCTTGAAGACGATCCGGCTCCAGTCCACGGGGCTGCGGCCGGGCGGGATCCCCAGCATCGCGATGCGACCGCCCATCACCATCGCTTCGACCATCTGGTCGAAGCCCGCGGGTGCGCCCGACATCTCCATCCCTACGTCGAAGCCCTGCACGATCTTCAGTCGGCCCATCACCGAGCGCAGATCCTCGGTGGCCACATTGACCGGCACCACATCGGCCACCTCGGTTGACAGCCGCAACCGGTCGGCATTGACGTCGGTGATGACGACATGGCGCGCGCCCACATGCCGCGCCACAGCCGCGGCCATGATCCCGATGGGGCCTGCGCCGGTCACGAGCACATCCTCTCCCACCAGATCGAAGCTGAGCGCCGTGTGCACGGCATTGCCCAGGGGATCGAGGATCGCCCCCACCTCGTCGTCGATGGCATCGGGCAGCGGCACCACGTTGAAGGCGGGAAGCCGCAGATAGTCGGCGAAGGCGCCCGGAACATTGACGCCGATGCCGCGCGTCTCGGGATCGAGGTGGAAGCGCCCGGCCCGCACCTGCCGCGAATGGTGGCCGATCAGATGGCCCTCGCCCGAGCAGCGCTGGCCCGGGCTGAGGTCGCGCACGTCGCGGCCCACCTCGACGATCTCGCCCGCAAACTCGTGCCCGGTGACGAGCGGCACCGGCACCGTCTTCGCCGCCCAGTCGTCCCAGTTCCAGATATGGACATCGGTGCCGCAGATCCCCGTCTTGCGGACCCGGATGAGCACCTCGTCGGGGCCGATCTCGGGCACGGGCCGCTCTTCCATCCAGAGGCCCGGTTCCGCCTTGGCCTTCACCAGTGCCCGCATCAGATCGCCCCCACCGCCCTGCCTGCCTCGCGGAACGCCGCGAGGGCCAGATCGAGATCGTCCCGCGTCAGAGCCGCATTCATCTGGGTGCGGATGCGCGCCTGCCCGCGCGGGACCACCGGGAAGAAGAAGCCCGAGACATAGACCCCGCGGTCATAGAGCGCCGCCGCCATCTCCTGTGCGAGCCGCGCCTCGCCGAGCATCACCGGCACGATCGGATGCGCGCCGGGCAGCAGGCGGAAGCCCGCCTCGGTCAGGCCCGCGCGCCAGTAGGCGGTGTTCTCGAAGAGCGCGGCGCGCAGATCGTCGGCGGCCTCGACCAGATCGAGCGCGGCGAGACCGGCCGCCACCACCGCCGGCGGCAGCGCGTTCGAGAAGAGATAGGGCCGCGCCCTCTGCCGCAGGAGATCGATCACCGGCTGCGGGCCCGCCACATAGCCGCCGAGTGCGCCGCCCAGAGCCTTGCCGAGCGTGCCGGTCAGAATGTCCACCGTCACGCCGTGATGGGCGGGCGTCCCCTGCCCCTTCGGCCCCATGAAGCCCGTCGCGTGGCAATCGTCCACCATGACAAGCGCGCCGAATTCCTCGGCCAGCGCCACGATCTCGGGCAGCCGGGCGAGATAGCCATCCATCGAGAAGACGCCGTCGGTCGCGATCAGGATATGGCGCGCGCCCTCGGCCCGGGCCTCTGCCAGCTGCCGGCGCAGATCCTCCATGTCGGAATTGGCGTAGCGGTAGCGGCGCGCCTTGCAGAGGCGGATGCCGTCGATGATGCTCGCATGGTTGAGCGCATCCGAGATCACCGCATCCTCGGGGCCGAGCAGCGGCTCGAAGAGACCGCCGTTGGCATCGAAGCAGGCCGCAAAGAGGATCGCATCCTCCATCCCGAGGAAGCGGGCGATCCGGCCTTCGAGCGCACGGTGAATGTCCTGCGTGCCGCAGATGAAGCGCACCGAGGCCATGCCGTAGCCGTGCTCGTCCATCCCGCGGCGGGCGGCCTCGATCAGCGCCGGATGGTTGGCAAGGCCCAGATAATTGTTCGCGCAGAGGTTCAGGAGGTCGCGATCCCCCACCCGCACATGCGTCCCCTGCGCCGAGAGGATGGGCCGTTCGCGCTTCGTGAGACCCTCGGCCTCGATCCCCTCCAGCAGCTCGCGCATCCCCGCGAGAAAGGCGTCCCGTCCGCTCATCGCCCTCTCCCTTCCTGTCGGCCGATCCTACGACACCCGGCGGCTCACGTCCCGCAGAAGGTGCGGTAGCTGTCGTCGAACCCCTCCGGGGCCGGTAGGGCGAACAGCTCGCGGTCGGCCCGTTCGGTGAAGGGGTCGGCCAGCGCCTCGAGCAGCCGGTCGAAGGGCGCGAGATCGCCGGCATGGGCCGCGGCCAGCGCCTCCTCGACCCGGTGGTTCCGCGCGATGTAAATCGGGTTTGTCGCCCGCATCCGCTCGGCCGCGTCGGGCGCCAGCCGGGCCCGCCAGCGCGGCAGCCAGCCCGCGAGCGCGGCCGGATCCCGGAACAGGGGGCGGAGCGCGCCTTCGTCGGTCACCGCCTCCGCGAGACGGCGGAAGGTCAGCGTCCAGTCGGCGCGCTGGCTACGCATGGCCTCCAGCAGATCCTCGGCAAGCCGCGCATCGCCCTCTTCGGCCCCGGCCAGCCCGAGCTTGGCGCGCATGCCCGCAAGCCAGTGGCCCTGATAGCGCGCGCCCACCGTTTCCAGCACGGAAGTGGCCTTGTCCGCCGCCCGCACCGGATCGCCATCGAGAAGCGGCAGAAGCGCCTCGCCCAGCCGCGCAAGGTTCCAGGCGAGGATGTAGGGCTGGTTGCCATAGGCATAGCGCCCCTGCAGGTCGATGGAGGAGAAGACCGTGCCGGGATCGTAGCCCTCCATGAAGGCGCAGGGACCGTAGTCGATGGTCTCGCCCGAGATCGTCATGTTGTCGGTGTTCATCACGCCATGGATGAATCCCACCAGCATCCAGCGCGCCACGAGCTGCGCCTGCGCCTCGGCCACCGCTTCATAGAAGGCGAGATAGGGCTCGGGCGCGGAGGCGAGTTCGGGGTAATGCCGCGCGATGGCGTAGTCGGCCAGCCGTCGCACCCGGTCGATGTCGCTGCGGGCGGCGAAAAACTGGAAGGTGCCCACGCGGATGTGGCTCGCCGCGACGCGCGTCAGGATCGCGCCCGGACGCTCGCCCTCCTGCCGCAGGAGCGGCTCGCCGGTGGCGACGGCCGCCAGCGCGCGGGTGGTGGGGATGCCGAGCCCGTGCATCGCCTCGCCCACCAGATATTCGCGCAGCACCGGCCCGAGGGCCGCCTTGCCGTCCGCGCCGCGCGAAAAGGGCGTGCGCCCCGAGCCCTTGAGCTGAAGGTCGCGCCTCCGGCCCGCGCGGTCGGTGATTTCGCCGATGAGCAGCGCCCGCCCGTCGCCGAGCTGGGGCGAGAAGCCGCCGAACTGATGGCCCGCATAGGCCTGCGCCAGCGGGTGCGCCCCCTCGGGCAGGCGCCTCCCCGAGAAGATCTCCGCCCCCTCGCGCTCGAGAAGGTCGGGATCGAGCCCCAGCTCTTCGGCCAGCGGGCGGTTCAGCCGCAGAAGCCGCGGCGCGGGGACCGGCGCGGCGGGCCAGTCCACATAGAAGCCCTCGAGGTCGCGGGCATAGCTGTTGTCGAAGCGGAAGGTCATAAGGGTCCTGCTCGGGGTCTCTTTCACATATGCGCCCCCGGGGCGCTCGGAACAAGCGGGGGGCGGGAAAGAGGCAGGACCGCGGGTCAGGCGCCCGCGCGAGGCACCAGCCGGATCACCTGAGCGCCGGGCCGGTTCGCGCGGAAGACCGGCAGCTCGTCGAGGAGCCGCTTCTGGATGTCGGCCACCGCGCATTCGCGCCGCAGGATCTCGGGCAGGCAGCTACCGCAGAGCCCGATGAGACCGCCCACCACCGCCGCGACCAGCTGCTCCTGCCGCTGCGACAGCAGGCCGTCGAGCACGCCGCCCTTGCCCACGGCGAACATGGCCGCGAGCGCGAGGAAGCTGCCCGAGAGGATGAAGACCGCCCGGCGATAGGCGAGCGTCCGTTCCACGAGCCGCCGGGTGAAGCGGTGACGGCGCACCGGGTCGATCCGCTCCCATCCCGCGCTGGGCATCGGCGTGCCCGCCCGCAGCACCAGCACCGAGGCCACGACGGGCAGCACGGCGATCAGTGGCAGCATGAGCTCATCCAGCGCGTGCGGAGGCACCAGCACGCCGAACAGCGATGCGAGCCCCGCAATCGTGCCGAAATGAAGCAGCCGAGTCATCCGTGATCCCTCTATCGGTGGAATGTGGAACCTGCCTCCGGAGAGGGCCCGAGTCAAATGATGGCGGTGTGTCCCGGCGCGATCTCGCCGCAGGGCAGCAACCGGATCGGCGGCAAGAAGCCCTGCGTGCGGCAGCCCGTAGCCTGCTGCCTGATTTTGCGCCGCTTCGCCTGCAGGGCAAGCAGGGGTGGCCGCCCCGGTCACGCGCCTTCACCCGTGCAGGCGGCGCGGGTCATGCCCCGTCGATGGCGTCCGGCCGACGCTCGAGAACACCCAGCAGCGCGCCTGCGAGCACGACGAGGCTCGCAAACATCAGCGGCGCATCGAGACGCCCGGTGAGATCGGCCGTCCAGCCGAGGACGACCGGGCCTCCGAACTGCCCCAGCGCGAAGGCGATGGTGAAGGCGCGGATCGCGCGGCCCCGATCGAAGGCCTGCGGCAGGGCGCTGGCGAAGGCGCTGGTGGCCGCCACCGTGCTGAAGAAGGTGCTGCCGAACAGGAAGGCCGAGAGCCAGAGGCCGAGCGCGCCCGGCATCAGGAACGGCAGCACCGAGCCTAGCGCATTGGTGGCCACGAGGAGCGCGAAGCTGCGCTCGGGGCTCGCGCCGCCGATCAGCCGCCGCCAGATCGAGGGCGCGGCCACCGCCGCGAGACCCAGAGCGCACCAGAAGAGCATCGCCTGCCGCCAGCCGCCCGCGCTCTCGGCGAGGTGGCCGTAGATGAAGGTCATGTAGCCGATGGAGCCCAGGCCGAAGAGGAGATAGCCGGCGAGGATCCTCCAGAAGCGGAGAGGCGTGGGGCCGCGGCTGCCGGCCTGCCGCACGGAGGCGCCGAGCCCGTCCCTGAGCGGCAGGAGCGCCAGCGCGGCGCAGAGGGCGGAGAGGCCCGCGAGGATCAGCCAGCCCTGCGGCCAGTGGGTGGCGCCCGCGATCCCGAGGAGGGGCGCCACCGCCAGCGCCGACAGGATCATCCCGAGCCCGGTGCCTGCGTAGAAGGTGCCGAGCACGAGCCCGCCCGAGCCCGCCCGCAGGCTGAGGCCCACGGCAAGGAGGCCGCCGCAGACGAAGACGACCCCGCCGGAAGCGCCCGCGAGCAGGCGGAGCGCGGCCAGTGCCGCCACGTCCCGCGTCAGGGCCACCGCGGCCAGCGCCGGCAGCAGCATCGCCAGCCCGGCGGCGAAGGCGCGCGCCGCGCCCACCCGCTGCGCCAGCGCCGGCGCGAGCATGGCGCCCCCGAGATAGCCCGCGGCATTGGCCGCGTTGATCCAGCCCGCCGCGGCATAGCTCCAGCCGAGGTCGGCCTGCATCGCCGGCAGAAGCAGCGCATAGGCGAAGCGGCCGAGCCCGATCCCGGCGGCGGGCCCGAGGCTCAGGAGAAGGGCATTGCGGTAGGCGCTGGTCACAATCCGGCCGTGGCTTTCGTCTCGAGGTAGTCGGCGAGGCCCCAGGCGCCGTGTTCGCGCCCGTTGCCCGATTGCTTGTAGCCACCGAAGGGCGCGCGCGCGTCCCAGCCCGCGCCGTTGATATAGACCATGCCCACGCGCAGCTTCCGCGCGATGCGCCGCGCCGTCTCGGGGTCGCCGGTCTGGATATAGCCCGCAAGCCCGTAGATCGAGTCATTGGCGATGCGGACCGCCTCTTCCTCGTCCCGGTAGGGAATGAGGGCCACGACGGGGCCGAAGATCTCCTCGGTGGCGATGGTCGAGCCATGGGCCACGTCGGCAAAGACGGTCGGGCGCACATACCAGCCACGCGGCAGATGGTCGGGCCGGCCCGGTCCCCCGGTAACGAGGCGCGCGCCCTCGGCGATGCCCGCCTCGATCAGCGACTGGATGCGGCGCCACTGGCTTTCGTTCACGACGGGGCCGAGATCGACCTCGCCCTCGGGCGCCCCCACCGTCAGCGCCTCGGCCGCCGCGCCTGCGAGGGCCGCGGCCTCCTCCATCCGCTCGGCCGGCACCAGCATCCGCGCGGGCGCCTTGCAGGCCTGTCCCGCATTGCCGAAACAGTCGAGCACCCCCTGCCGGACGGCGGTGGCCAGATCCGCATCGGGCAGGATGATGTTGGCCGACTTGCCGCCGAGCTCCTGCGTCACGCGCTTGACCGTGTCGGCGGCCGCGCGCGCCACCGCGATCCCTGCGCGGGTCGAGCCGGTGATCGAGATCATGTCCACGTCCGGGTGCCGCGCCAGCGCCTCGCCCGCGACGGGGCCTGCGCCGGTGATGTGGTTGTAGACGCCCGGCGGAAAGCCCGCGGCCTCGACCAGTTCGGCGAAGCGGATCGAGGAGAGCGGCGAGAATTCCGAGGGTTTCGCCACCACCGTGCAGCCCGCGGCGAGGGCGGGGGCCACCTTCACGACCAGCTGGTTCATCGGCCAGTTCCACGGCGTGATGAGGGCGCAGACGCCGATGGGCTCCTTCGAGATCAGGGTCGAGCCGCGCATCTCCTCGAAGCTCTCGGCCTCGAGAGCCTCGATGGCGGCCTCCAAATGGGCGCGCCCGACCCAGGCCTGCGCCTCGCGACTGAAGCGGGCGACGGTGCCCATCTCGCGGGTCATGAGCTCGGCCAGCTCGTCGTAGGCCTCGTTGTAGAGATCGAGGAGACGGCGGAGGAGGAGGAGACGCTCCTCCTTCGAGCTGGCCTGCCAGCCCTCGAAGGCGGCGCGCGCCGCGGTGACGGCGCGGTCGACATCCTCGGTCGAGGCCATCGGGATCGCGGCGATCTCCTCCTCCGTCGCCGGGTTCACCAGCCGGTGCCGGTCGGTGCCGAGGGGGGCCGTCCAGGCGCCGCCAATGAAGAAACGGTCGATGTTGCGCATGAGGGGCCTCCTGTGGCGGCGAAGAATGGCTGTCGCGGCCCGTAAGATGCCGCGAGAGGATGGTTGGGGAGCATGTCCGGTGCGGGGGGCCGGCCACGGGCGGGCCACAGGTCGCAAGCGCCCTGTGCGGATCCTCGGACCCGCCAACGGTGCGTCCGCTTATCGGGGGCACCGCGTGGTCCCGCGTCGGAGGAGGAGGGTTGGGGTCCGCGGCGGAGATGCGCCGCGGACGAAGGTGGAACCGCTCTGCGGTCTAATGGAAGTAGATCCCGCCGCAGACGTTCAGCGCCTGCCCGGTGACGAAGGCCGACTGCTCGCCCGCGAAGAACAGGACCGGCCCCACGATATCTTCGGGCGCGCCGAGACGCTTCAGCGCGGCCACCTCTTCCCAGTGGCGGATGGCCTCGTCCGAGCCGAGGTTGTTCTTGCCCATCTCGGTCAGGATGATGCCCGGGCAGATGGCATTGACGGTGATCCCGTCCATGCCGGCCTCCTGCGCCAGCACCCGCGTCAGGGTAATGACGGCGGCCTTGGTCGCCGCATAATGGCCTTGGGTCGGCACGCCCTGACGGCCCGCGATGGAGGCGATGTTGACGATCCGCCCCGCCTTGCGCGCGCGCATTCCCGGCAGCACCGCCTGACAGCAGGAGAGGACGCCCTTCACGTTCACGTCCATATGCGCGTCCCAGTCGGCCTCCGAGAGATCCTCGAGCCGCGCGGGCTTCAGGATGCCCGCATTGTTCACCAGCACGTCGATCCCGCCCGCGGCGTCCTCGATCCGCGCCATCGCGGCGACGACCGCGGCCCGGTCCGAGACATCCACCGTCTCGACGAAGGCGCGCCGGCCCGCGGCCTCGATCAGCGCGCGCGTCTCGGCCAGCCCGTCGGCCTGCGCGGCCAGATCCGTCACCGCCACATCGTAGCCCGCCTCTGCGAGCCCCACCGCGAGCGCGCGCCCGATCCCGCGCGCCGCCCCCGTCACAACCGCCGTCTTCGTCATCCGTCCGTCCTCAGTTCGCCCGCAGGGGCAAGGTGTTCATGGCCTCGATCGTGTCCTCGAACGAGACGAGCTTTCCGTCGGAGCCGAGCCCCATCGCGACCCGCGCCGCCACCGCCGAGGCGAAGCGGCAGCAGTCGCGCAGGTCCCAGCCGCGGGCGAGGCCCACGATCACGCCGGCGGTGAAGCTGTCGCCGCAGCCCGTGGTATCGACCACCTCGATGGCATGGGCGGGCAAGCGGAAATCCTCGCCCGCCTCGGGCGAGACGTAGACCCCCTCCCCGCCCATGGTCAGGATCGCGTTCTTCACGCCGAGGCCCTTGAAGAAGCGCGCCACCTCGGCCGGATCGGTCGTGCCCGCCATCTCGGAGGCCTCCTCGATCGAAGGCACGAAATAGTCGATGTAGGGCAGGCAGGGGCGGACCAGCTCAAAGGTCTCGGGCGTGGCCTGGATCAGATCGAAAGTGGTGATGCGCCCGAGCTCCTTCGCGCGGCGCAGCGCCTCGACCGTAGGGGCGCCGTCGAAGCTCTTCAGAAGGCCGGTGCCACCGACATGGACGATGCGCGCATCGAGTGCGGCGTCGAGCCGGCCCTCGGGGATGCGGAAGGTGGCGGCGGTGCCCGGCACATGGAGCGCGGGCCGCTCGCCGTTCGGGCGCACCGGCAGGATGGTGGCCGACGTCTGGACCGAGCCGTCGCGCGCCACCATGGCGCAATCGACGCCGAAGCGGGTGAGCTTGGCCACCATGAAGTCGCCCATCTCGTCCTCGCCCACCGTGGTGACGGCGAGGGTCTTCAGGCCGAGGATGGCGCAGTCGGCGGCGGTGGCTCCGGCGGTGCCCGCGACCGTCATCCGGATCTCCTGGATGTAGTCGGCGCGGCCGCCGTCGGGGATGCGGGTCACGGGGCGACCGAGGATGTCGCAGACATAGAACCCGATGGAACTCACGTCATAGGTCGGCATGATCTCTGTTTTCCGTTAGTGGCGGCGCTGGAGGCCGAAGCTGAGGGCGAGCACGGCGAAGACGAGGACCCCGATCCCCACCTGCTGCCAGTAGAAGTTCCAGCCGATGAGCAGGAGGCCGTTCTTGACCATGGCGAGGAGCGCCACGCCCAGAAGCGTGCCGAGGATCGAGGGCCGGCGCTCGCGGCTGATGGTGGTGCCGATGAAGGTGGCGCCGATGGCATCGAGGAGGAAGGCATTGCCCGACATCGGCACATAGGAGCGCACCGTGGCCGAGAGCAGGATCCCTGTGATCCCGGCGAGGACGGCCGCGAGGATATAGACCTGCGTCAGCCGCGAGGGCACGCGGATGCCGGAATACCAGGCCACGCCCGGGCGCGCGCCGATGGCCTGCACCTGCCGGCCGAAGCCCATGCGGTGCAGCGCAAGCCAGGTTGCGAGGACGCACAGGATCAGCACCCAGACCGGCGTCGGCACGCCCAGAAGGCTCGAGCGCGAGATGGCGTTGAAGATCGCGGCATACTCCTTGGTCACGAGATAGATCGGCTGGCCGCCGCCGGTGGCCAGACGCTGGACCGACTGGCCGATGAAGAGCACGCCCAGCGTGGCGAGGAAGGGGCTGATCTTCAGCCGCGCGATGAGGATGGCGTTCAGCAGCCCCACGAGGAGCGCCGCGCCGAGACCGGCGGCGAGCCCCAGCGGCGCCGCCACGCCCGCGGCCAGCGCCGAGACGAAGACGAGGCTCGCCATGTCCACCGACACGCCCACCGAGAGGTCGATGCCGCCCGAGGAGACGACGAAGGTCACGCCGAGCGCCACGATGGCGAGGAGGACCACGTTGTTCACCAGCACGTTGCGCAGGTTGCCGAGCGTCAGGAACGCATCGGCCTGCGTGGCGAAGACGAGGAACAGGCCCACGAAGGCCACCAGCACGGCCAGCGTCGAGAAGCTCTCGCGGTCCTTCGGCAGATGGCGGGAGAGGGAGAGGCTCATGGCTCAGTGCTCCTGCGGACGGGCATAGGAGGTGACGGCGACGACCAGCAGGATCAGCGCGCCCTGCACGCCCGAGACCCAGTAGCTCGACAGATGCAGAAGCTGGAAGCCGTTGGCGAGAAAGCCGATGAAGATCACCGACAGGACGGTGCCGCCCACGGTCGGCACGAAGCGGCGCGAGAAGACCGTGCCGAGAAGCGCCGCGGCGAGGATCGAGAGCAGAAGCTCGCCCGTCCCCGGCGTCGAGGCCGAGAGGCGCGAGACGATGAGGATGGCCGCCAATCCCGCGCAGAGGCCCGCGAAGACATAGGTGCCCCAGACATGGAGACCCACGTTGAGGCCCGCCGCGCGCGCGGCCTCGGGATGGCCGCCCACCGCATAGAGCCGGAGGCCCACCGGCGTGCGGTGGATCACGAGCCCCACGATCACCGAGGCCGCGATCAGCACCCAGGCCAGCGCCGAGATCCCGAGGAAGCGGCCCGAGGCGAGGACGCTCAGCAGCGGCGAGCTGGTCGAAAGGACCGTGTTCTCGGTCAGCGTGAGTTCGATGCCGGCCACGATGTTCATCGAGGCGAGGGTCGCCAGAAGCGGCAGGATGCCGAGACCCACCACGGCAAAGCCGTTGACTGCGCCCACGAGCATCCCCGTGCCGAGCGTGGCCGCAATCGCCACCGGCGCGGCCATGCCGGCGTTGGTCGCGGTGGCGAAGACCGCGGCGCAGAGGCCCGCATTGGCGGCCAGCGACAGGTCGATGCCGCCGGTCTGCACCTCCGAGCCGCCGCCGATGATGACGACCGACATGCCGAAGGCGAGAATGCCGAGGATCGCCGATTGCTCGATCACGTTGACGATGTTGAAGGGCGTGGCGAAGCCGCGCGCGGTCAGGGCGAAATAGGCCATGATGGCCGCGAAGGTGGCGAGCGCGCCGAGCCGGATCGCGGCTGCGGCGAGCGCCTTGCCGCGGCTGGCGCCCGCGGGGCTCTCGCTGGAGGACGGGGACAGGGCCATGGTCATTCAGGCCTCCTTTCGTTGGGCCCGCGCGCCCGAGGCGGCGGCCAGGAGCAGGTCGCTGTCGAGCGTCTCGCCCGCGAACTCGCCGTTCAGCGTGCCGCGGTAGACGACGAGGGCGCGGTCGCAGAAGCCCGCGATCTCGAGCAGGTCGGAGGAGAGGAAGAGGATCGCCGCCCCTTCCGCCGCCAGCCGGTTGAGCAGCGTGTAGATCTCGACCTTGGCGCCCACATCGACGGCGACGGTCGGCTCGTCGAGCACATAGACCCGGCTCTGGCACGAGAGCCATTTGGCGAGAGCCACCTTCTGCTGGTTGCCGCCCGAGAGGTTGCGCACCAGCTGGTCGCGGTGCGGCGTCTTGATCGAGAGCTCGCGGATGAAGCCGTCCACCGCCTCGTTCTCCCGCGCGCGGGAGACGAAGCCGCGGGTCATGTAACGCTCGAGGCTCGCGATGGTGATGTTGTCGCGCACCGACATGTCGGTGGCCACGCCATGCGCCCGCCGGTCCTCGGGGACGAGGGCCACCCGGCCCTGCACCGCCCGGCCCGGATTGGCGAAGCGGCGCACCTCGCCGTCGATGGTGACGCTGCCCGCATCGGGCTGCTCAAGGCCGAAGAGGCACTCGACCAGCTCCTTCACGCCCGAGCCGAGGAGGCCCGTGATGCCCAGCACCTCGCCCGCGCGGACCTCGAAGCTCACGTTGCGGAAATGGCCCGCCTGAGACAGGCCCTCGACCCGCAGCACCGGCGCGCCGAGCGCGTGGCTGCGGCAGGGGAACATCTCGCCCACGTCGCGGGCGATCATCATCGAGACGATCTCGTCGATCGAGGTCTCGCCGGGGCGCACGACGCCCACGTCGGTGCCGTTGCGCATCACGGTCACCTCGTCGCAGAGATCCTCGATCTCCTGCATGTAGTGCGAGATGAAGATCACGGCGATGCCCTGCGCCCGCAGATTGCGCAGCACCGCAAAGAGGCTGTCCACCTCGCGCTTCACGAGCGCCGCGGTCGGCTCGTCCAGCACGAGCACCTGCGCCTCCTGCGCCAGCGCCCGGGTGATCTGCACGATCTTCTGCTGCGCGGTCGTCAGGTCGCGCACCAGCGTGTCGCCCGGCAGCTCGAGCCCGAAATGGGTGCGGATCAGCTCTTCGGCGCGCCGCTTCATCGCGCCGGGGCGCAGGAAGGGGCCGAAGCGCAGCTCGTAGTTCAGGAACACCGCCTCGGCCACGGTGGCCGTCGGCACGAGAAGCCGCTCCTGATGGATGAAATGGACGCCCAGCCGCTCGACCGAGGCGGGCGTCAGGCTCTCCACCCGGGTGCCGTTGATGGTGATACGGCCGCTGTCGGGTTTCAGGATCCCGGCCAGCACCTTGATGATCGTCGATTTGCCGGCGCCGTTCTGGCCCACGAGCCCGTGGATCGTGCCGCGCGCCACGCGGAGGTCGGCGTCGACCAGCGCCTTCACCGGCCCGAAGGCCTTGGAGATGCCGGTCATGTCGATGGCGAGACCGCTCATGGACGTGCTCCCGAGGAATGATGGCAGGAACGGGCGGGGCCGGAGAAGGGAGGAGAGACGGCACCCGCCCGAGGAAGCCCCTCGCGGGGCTCCGCAGCCGGTGAGGCGGCCCTGTCGGGCCATGGCGCGCGGGCCTAGGCCCGGTCCCGTTCGGGGTCCGGGCTCTGCCCGCGCGCGACCGGCTTACTTCGGCAGGAAGTCCGCGGCGGTCTCGGCCGCATTCTCCTTGGTGATGAGCACCGCCGGCGCGAAGGTGAAGGGCTTCACCTCCTGCCCCGCGAGGTGACGCGCCACGTTCTGCACCGCGAGCTTGCCGATCTCGGAGGGCTGCTGCGCCGCCACCGCGCCCGCGGGGCTCTCGGGATCGGCCACCATCTCGACGAACTCGGGGCTGCCGTCCACGCCATAGGTGCGGATGTCGGTGCGCCCGGCCGCCTGCAGCGCCTGCGTGGCGCCGATCATCGGCACGTCCCAGCAGGCCCAGATGGCGCCGACATCGCCCTCGTTGGGGTATTTGGTCAGCATGTCGGTGACGTTGGAATAGGCCGACTGAATCGTGTTCGGGATCACGTCGCGCAGCTCGGGCTCGATGATCTTCACGTCCGGGAAGGCCTCGAGAACGTATTTCATCTGGTCGTAGCGGATCTTGCAGACCGGCACCGAATAGAAGCCGTTGAAAACCAGCACATTGCCCTTGCCGCCCAGATCCGCCACCATCTGCAGCGCCAGCTCCGCCCCGATGGAATAGTTGTTCGAGGTGGTGTTGTTGATCGCGTGCGGGGTCGCCGTATCGACGGTGAAGAGCGGGATGCCCGCGTCGTTGATCTTCTGCAGCCACGGGTTCAGCACGTCGAGGTTGCCCAGCTGCTCGATGATCGCATCGGGCTTCTGCGCGATCAGCGTCTGGATCTGGCTCACCTGGGTCTGGTCGTTGCGGCCCGCATCGAGCGCGATGGCCGTTCCGCCCAGCCGCTCGATCTCGGCGATCTGCGCCTGATAGGCCTTGAGGTCCCAGTCGTGATCGGTGCCGATCGCGGTGATGCCGATCGTCTTGCCTTCGAGCGTCAGCTCCTCGGCCAGCGCGGGCGACAGGGTCGCGAGCAGCAGAAGCGCAATCGTCAGGCCCTTCGCGGTCTTGGTCGTCATGTCTCTTCCTCCCTATGCAACCGGCTCCTCCGCCGGCCTTGTCCCTTGCGGGGTCCTTCTGCCGCTCAGCCCTGGTGGCCGTAGCCGGCGATCTTGTCGATCACGTCCTGGATCTGCCGCTCGCTCAGCAGCCGCGGCTCGCCGATCTGAAGGCAGCCGTGATACTGCCGGGCCAGCGTCTCGACCTCGACGGCGAGCCACATGGCCTGCGCGAGGCTCTTGCCCACGGCGATCATGCCGTGATGTTCGAGAAGGCACGCCTTGCGGTCGTGCAGCGCCTCGACCGCGTTCCGCGACAGCTCCTCCGAACCGTAGATCGCGTAGGGCGCGCAGCGGATGTTCGGGCCGCCCACCACCGCCAGCATGTAGTGGATCGAGGGGATCTCGCGGTTCATGATGGCGAGCGTGGTGCAGTAGGTCGGATGCGCATGCACCACCGCATTCACGTCCGCCCGCTCGCGCAGGATGTCGCGGTGAAAGCGCCATTCGCTCGACGGCTTGTGCCGGCCCCGCACCTCGCCCTCCATCGAGACGAAGACGATGTCCTCGGGCACCAGCGTGTCGTAGGGCAGCGACGTGGGGGTGATCAGCATCCCCTCGCCATGGCGCACAGAGATGTTGCCCGAGGTGCCCTGATTGAGGCCCGAGCGGTTCATCTCGAGACACTGGCGGATGATCTCTTCGCGCTTGGCGAGGTCGGACAGATCGCTCATCTCGTTCATCGCCTCACCAGACCGTGTAACCGCCATCGACCGCGAGGATCGCGCCCGTGACATAGCTCGCCGCCGGCGAGGCAAGGAACAGCGCCGCCGCCGCAATCTCCGACGGCTCGCCGCAGCGGCCCATCGGCGTCATGTCGAGCCATGTTCCGAAGAGCTCGGGCCGCTCGCGCATCTTCAGCGTCATCTCGGTCGCCACATAGCCCGGCGCCAGCGCATTCACCCGCACGCCCCGGCCCGCCCATTCGGCGGCGAGCGCCCGCGTCAGCTGATGGACCGCGCCCTTCGAGGCCATGTAGCTCGAGGCGAACTGCGGGCGGTTGACGATGGTGCCCGACATGGAGCCGAGGTTCACGATGGCCCCTGCCCCGCGCGCGACCATGGCGCGCCCGAAGGCGCGGCTGGCCCAGAACATGCCGTCCACATTGACCGCCATCACCTGCCGCCAGGTGGCGTCGTCGGTCTCGAGCGCATCGTGGAGGCGGGCAATGCCCGCGGAATTGACGAGGATCGACACCGGCGCCACGGCTTCCGCCGCGGCGGCCGCGGCGGTCATCGCCTCCGCATCCGTCACATCGGCCACGATCCGCGAAGCCACCGCCGCGCCGAGCTCTTCAGCTGCCCGGTCGAGGGCCGCGCCCTCGCGGTCGATCAGGATGAGCCGCGCGCCCGAGGCTGCGAAGGCCCGGCAGATTTCGAGGCCGATGCCGCTGCCCGCTCCGGTGACGGCCGCGCAGGCGCCATCGAGACGAAAAACCGTCCTGTAGTCCATTCTTCCTCCCTGACGCGGAGCCCGGACCCGAAGGTTCCTCCCGAGGCTCGGCCGCGTCCTTCCCAGCGCCCGACGTGCGGTTTCTGGTCGTTACCAGAAAAAGGTAGGAGGCAAATCTGCTTCTGTCAATGTGGTAATGATCAGAAATCGAAGATCGGGGCCGCTGGAGGTTTTTCGCTGTGACGACAGGCGCTTAATGGGCGTCGGTTGCGACGTGGATCGGGGCGACTTCGCTGTTGTAGTAGGCGCGGTAATATTCGAGCGCATTGCCGTCGTGGTCGTAGGCGACCGACTCGATGGCGATCATGGCGCCGCCGGGCGGCACGCCCATCCGCGCCGCCACTTCCGGCGGAACGATGGCGCCCTTCAGCCATCGATCCGCCCGCGCGACCGTCAGCCCGTAGAGCTCGCGGATCGTGCCGAAGATCGAGCGCCCTTCCATATCGAGCTTCTCGAAGCCCGGCAGCCGGTGTGCGGGCAGCGAGATCAGCGTATGGGTCAGCGGCGTCTCGTTCGCGTGATAGACGCGCAGCGCCCGGATCACCTGAAATCCCTCGGGCAGCTTGAAGACGCGCCGTTCGTCCTCGTTGGCCGGATGCAGGCCGAATTCGTAGGTCTTGACCGTGACCTTGTAGCCCTTGGCCGAGAGATCGTCGAAGACGCCGAGCGCCGAGGTCATGAAGGCAAGCTCGGGCGCGCGCGGAGCCACGAACATGCCCTTTCTCGGCTGCTTGATGACCCGCCCCTCGCCCGCGAGCGCCTGCAGCGCATTGCGGATCACCGGGCGCGACAGGTCGAACATGCTGCACATGGCCTGTTCGGAGGGCAGGCGCGCGTTCTCGGGGAGGGTTCCGTCGAGGATGGCGTGCTCGATCCGGTTCTTCAGCTGCACCCAGAGCGGGCTTGGATCCTCGCGCTGAAGGGCGACGCCCGCAGCGAAGGCGCGGAAGGCCTCGGCGCTGTCGCCGTCCTCCATGTCATAGCTCGGACGCCCGCGGCGCTGAACGGTGCTGTGCTGTCCCTGAGCCAATCGCCGGATTTCGCCCCCTCGCCTCTGGCTGCGGATCCATCCCGCAGCCGATCAGGATGATCCGGCACGATTATTGGTTATGACCAGTATGTCAACCCTCGGAGGTGTCAGAGAAACCTCTGCCGGGCGCGGGTCAGTGCCGGCCTGAAGGATCGGGGATGGTCGGAGTGGAGGGAACATAACCCGGTTTTTCTCCAACCCCATGAAATTCACCAAGTAACTGATTTTAATTGCCTATCAGATCCAAAACTGTAACACATGACTGTAACACACGGCTAGTCCACATGGGGCAATCATGGCACGTCTTAGTCATCTTTTGCGGCGGGGTGCCGCCTACTACGCCCGGACCCGCGTTCCCCTCGACCTGATCGCCATCGTGGGCAAGAAGGAGCTGGTGAAGGCCCTCGGAACGAAGGATGAAACCGAGGCGAAGCGGCGCCTCTACCCCCAGCTTGACGCATGGCAGCGCGAGTTTGACGACCTGCGCGCGAGACGGTCGCTTGTCGCTGCTGACCGCGAACATGCAGTCTGGGACCACTACACGGCCGCCCTCACCCGCGACGAGGCCGAGCGGGCAACGCTGCCGGGCGAGGCGGAGATTGCAGCCGCCGAGGCTGACGTGATCCGGCGCGCCGACCGTGGAGAGATTACCAGCGCCGAGCCGCTGGCGATCCTCGACGCGACTCTTGAGCTACAAGTGAAGCAACAGGCCCCTGCCCTCGCCTCGGATACTCGCAAGGCGAAGCTGGCCGACCTGCGCGAGCATTTGGTGAAGGGCGAGACGGCATTGATCTCTCACGAGGTTGACGACTACCTGAACCGCAACCGGCTTCTGGTGGAACGTGGAACTCCTGACTGGATCAGCTTAGCGAGGCGCATGATGCGGGCCGAGGCCGAAGCCCTCGAACGCACGCTTGAGCGGGACCGGGGCGACTACACCGGCCAGCCACGCGACCCGCTGGTGAAGCCCCCTACCGCAGAGAAGCGCGCCGAGATGGAAGTTGCGCCGCCGGGCGAGAGCATCGCCGAGGCCGTCGAAGCCTTCCGAACCGAGAACCCCCGGAATGTGTCCAAGGGCAGGATCGAGGAAGCATGTCGCGACATCGGCGTTTTCATGGAGACGGTCGGTCCCTCGTTTCCCGTGTCCAAGCTCACGAAGAAGCACGTCCGCGAGTGGAAGGCGCTTCTGATCAAGTATCCCCTGCGCGCAACCGAGGTCGCCGAGTTTCGCGGAATGACAATCCGGCAGATTGTTGAAGCGAATGAGCAGGCGAAGCGGCCGGTCCTGTCAGACCGCACCGTCAACCGCTACCTGTCCAGTCTCTCGGCGTTCTGCTCGTGGGCCGAGGCAAACGGCTACATCGCGAGCAATCCCTGTTCGGGCATGGCGCTGCCAAAGGAACGCGGCTCGAAGACCCTGCCCTTCACAAGCGACCAGATGAACACCCTCTTCCGCTCACCTCTGTTCGCAGGGTGCGAAAGCGAGGCAGCATGGCGCTACATTTCGAAGCCCGGCCCGGTCCTGATCCGCGATCATCGCTACTGGGTGCCGCTGATCATGCTCTATTCCGGCGCCCGCCCCGGTGAGATTGCGCAACTTGCGACCAGCGACCTTCGACAGGAACACGGCCGCTGGATCTTTCACATTACGACCGAGGGCGAGACACACGAGGAAGGAAAACAGGTCAAGACGGCTGGATCAATGCGCGTGATCCCGGTTCACTCCGAGCTGATCCGCCTGGGCTTTCTCGCTTATCATTCGCAGCGCGTTGAAGCTGGCGACAAGCGACTTTTCCCCGGCGCCAAGAGGAACGAGCGCGGGCAGATGATGAGCGAGTTTTCCCGCGAGTTCGGGAAGTATCTCGCGCGCATCGGTTTGAAGTCAGGTCGCGGATTGTCCCTCTACTCGTTCCGTCACGGTGCTGCCGACGCGCTGCGCCGGGCTGGCTACCTAGATAATGAGTTCGGCTTCATCCTCGGCCACACCGAAGGCAGCATGACCGGGCGCTACGGGATCATGCCTCAGGGGATGCTTGAGCAACGGGTGAAGCTGATCGAAGCTATTCAGTATCCCGGCCTGAACTTGGACCACCTCGCTAATGTTATAACATAACAATTTGCTTGATCCCCGACTCGCGGTGTGAGATTCTTCTGTCAAGATAAATTTGATGGAATGCACCGCGTCTCATGTTTTTCAGTCGGAAGCCGAAGGCCGAGCAGAAGGCGGCGGTCACGCTGACCTCAGCCGAGGCGTTCGAGATCTTCGGCGCCCGGCCGACCGTGACCGGCGCCTTCATCACCACGGGCGCGGCCTTGCGCGTTCCGGCCGTGGCCTGCGCCGTGGGCCTGATCGCCGAGACGGTCGGCGCGCTGCCGGTGAAGATCTTCGACGCCGGGAAGCTGACGCGGCGCGATCATCCGGCCTATCGCCTGATCCATGACGAGGCGAACCCGTGGACCTCGGCCGCCGAGCTGCGCGCCCAGCTCACGGCCGACGCCCTTCTGAACGATCATGGCGGCTTCGCCCTCGTCGTGCGCGCGAGCGACGGCCGCCCGCTGGAGCTGCACCGGCTGGACCCGGCCGCCGTGCGCCTCGACGCCGCGCCGGACGGCTCGCCCCTCTACATCGTCGCGACCAACACTGGCCCCGCGAGCTACGGCTTCGCGGACGTGCTGCATGTCCGCGCCTTCGGCGGCTCGCCGATCAGCCTGGGCCGCGAGGCCATCGCGCTCGCCATTGCCTTCGAGGACCATATCGGCAGCCTCTTCCGCAACGGCGGGCGCCCGTCCGGCATCATCAAAAGCCCGAAGATCCTCGACGTGGACGCGAAGCGGAAGCTCGCGGCCTCGTGGTTCACCACGCACTCCGGGCGCAACTCCGGCGGCACGGCGATCCTTGACGAAGGCATGGACTATCAGGCCCTGTCCGCGACCCTCGCCGACAGCCAGTTTGCCGAGAATCGGCTTGAACAGATCCGCGAGATTGCCCGCGCATTCAGGATTCCGGTCACGATGCTAGGCGAGCTGTCGCGCGGAACGTGGAGCAACCTCGAAGAGCAGAACCGGCAATTCCTGCAATTCACGCTGCGCCCGTGGCTGCGCGCATGGGAATGGGCCTATGCCCGCTGTCTGCTGACCCCCGACGAGCGCCGCGAGCTGACGGCCGAGTTTGTCACCGACGACCTGCTGACCACGAGCCACGCCGCCCGCGCAACCGCCTATGGGCAGTATCGCGCCATGAGCGCGATGACCGCGAACGAGGTCCGCGCGGGCCTGAACCTTCCGGCGCACCCGGATGGCGACACGCTCGAAAACCCCCACATCACGACCGGCGGCAAAAGCGAAACCGTTTCGCCTTTGCCCGACGAGGAATCCGCATGATCGACGCCATCAAGCACGTCGCCTTCTTCGGCGACCGCGAACGCACTTTCGCCCTGACCGACGCCATGTGCGCCGAGCTGGAAGCCCTGACCGAAACCGGGATCGGCGCCATCTACTTGCGCACCGTCGCGGGCGCCTTCCGGGGGCAGGACGTGCCCGAGGTGATCCGCCTCGGCCTGATCGGCGGCGGTGCGACCCCGCAAGAGGCCGCCCGGCTGGTCGATACCTACGCCCGCAACCGCCCGATGGGCGAGGTCTATCCGCTCGCCCTCGACATTCTCGAAACGCGTTGGAACGGCGCGGAGACGGCGGCATGAGCGACCGTATCGAGATCAAGGCGGCGCTGACCGTCGAAGAGACCGGCGAGATCACCGGCCTTGCCTGGCCCTTCGGAACCCCCGACCGCGTGGGCGACGTGATCGAGAAGGGCGCCTTCACCGGCCCGGCCGAGTTGCCGATGCTGTTCGCGCATGATCAGACGCAGGTGATCGGCGTCTGGGACCAGATCGCCGAGACGCCCGAGGGCCTGACCGTGAAGGGCCGCCTTCTCGTGCAGGACGTGGAACGCGCCCGCGAGGTCCGCGCCATGATCCGCGCAGGCGCCGTCTCGGGCCTCTCCATCGGCTTCGAGACCGAGGCCGCCAAGCCCCGCGCCCGTGGCCGCTCCATCTCGAAGCTGAGGCTTCTCGAAGTCTCCGTTGTCGCCGTCCCGTGTCATCCGGGCGCGCAGATCCATTCCATCAAGGCCGCAGATGACACGGCAGAACCATGCACCGAAGGAAAGACCCCCGTGGAGAACGAAGACCAGACCACCCCGGCCAACGCGCCGGAGATCGACACCAAGGCGTTCGACGCGCTGAAGCAGCGCCTCGACCAGCTCGAAGCAAAGGCCAACCGCCCCGGCGTCACGACGACCGGCCCGGCCCCGAGCGCCGAAGCGAAGGCCTTCGGCGGCTATGTCCGGCGCGGCGTGGAGCGGATGGACCCCGCTGACACCAAGTCGCTGACCGTCTCGACCGCCGCGAACGGCGGCTACCTCGCGCCGAAGGAGTTCGGCGACGAGCTGTTCAAGAACCTGATCGAGTTCAGCCCGATCCGCAAGTATGCCCGCGTCGTCCAGATCAGCGCGCCCGAGATCACCTATCCCAAGCGCGTCACCGGCACCTCGGCGACCTGGGTCTCGGAAGTCGGCGACCGCACCGGATCGGAACCGAGCTTCGATCAGGTCACGCTGACCCCGCACGAGCTGGCGACCTTCACCGACATCTCGAACGCACTTCTGGAAGACAACGCCTACAATCTCGAAGGCGAGCTGATGGCCGACTTCGCCGAGAGCTTCGGGCGCGCCGAGAGCGCGGCCTTCGTCAACGGCGACGGTGTGGGCAAGCCGAAGGGTATCATGGCGGCGGCGGGCATCGCGACCCTGAGCGGCGGTGCGGGCACGATCACCGTTGCATCGCTGATCGAAGCCTATCACGCGATCCCTACCGTCTATGCACAGAATGCTGTCTGGGTGATGAACCGCACCACGCTGGCCAAGCTGCGCACCTACTTCAACGGCATGGGCGAGCCGCTTCTCCTGGACAGCATCTCGGAGAAGGCCCCGACCACGCTTCTCGGCCGCCCCGTGGTCGAAGCGCCGGATATGCCGAACATGACGGCGGGCGCCACCCCGATCCTGTTCGGCGATCTGTCCGGCTACCGCATCGTGGATCGCGTGGGCCTCGCGATCATGCGCGACCCGTTCAGCCTCGCGACCAAGGGGCAGGTCCGCTTCCACGCCCGCAAGCGTGTGGGTGCCGACCTGACGCACCCCGACCGCTTCGTGAAGCTGAAGGTCGCGGCCTGATGCCTCTCGCGCCCGAAGAGATCACGCTGACGCATAGGGAACACACCCTGCGCCTGCGCCCGTCTCTGCGGGCGGCGCTCATCCTTGAGCGCCTGCATGACGGCTTTGCCCGGCTCTTCGAGAAGCTGGACGAGGCCGACACCCAGACGCTGCACGCGATCATCCGCACCGCCGCGACCGATCCCCACCGGGCCGAGAGCTTCCTTGCCAGCGCCCGGAACGTCCCACTGGCGCCCTTTCTGCGCGCGGTGCAGGCACCCGTTGCCGCCCTCTGCGAAGGGCTGCTGCTGCCGCCTGACGAGTCCAGCAAGCCCACCCCGAAGGCGAAGCCGCTGCCGTGGGTCGACGCGCTGACCGAGCTTTACAAGATCGGCACCGGCTGGCTGGGCTGGACCCCGGCCGAGACTCTGAATGCGACTCCGGCGGAAATTATCCTGGCCTTCGACGGTCGGATTGCCCAGCTCAAGGCGCTGCACGGCACCGCCGACGAAGCCGACCCCGGCGACACCGCCCGGCGCGAGAGGAACCTAGCGGAAGGTCTCGACCCCGACTTTGACCGCGAGGGCCTGCGCGCCCTTGCCGCTCTGGCGGGCTGACCATGCCCCGGCCCCCGCACCTCTGCACCTGCGGCCAGCTCGTGCCGCACGGCGCCCGCTGTTCCTGCCAGATCCAGCGCGACCGCGAGCGCAAGGCCCGCTTCGACCAGAAGCGCCCCTCGTCGCGCGAGCGCGGCTATACCCACGAGTGGCGCAAGGCCCGCGCCGAATTCCTGCACCAGCACCCCACCTGCGCCTTCTGCGGTGCGCCCGCTGGCGTGGTCGATCACGTCATCCCGCACAAGGGCGACATGACGCTGTTCTGGGATCGCACCAATTGGCAAGCCCTATGCAAACCCTGCCACGACCGCCAGAAGCAGATGCAGGAGCGATACAATCACATAATTAGTTGATTGAACTTTCTTCTAAGCGTTGGGAACTCATACGTCTGAAGCAGGCCCTCCTTGGACTCTTCAATCAATTGACCGAGATCCGAGTCTGACAGGACCACAACGAACCCGTGTCCATCTTGTGCCGCCGCCTTACACCGTCGATCTAGAAGATCCGCATTCTCGACATTCCGGCAGACCACAATCCCAAACTGGCCTCTCTCCTTTGAGAACCGCATTGCGATTTGGTCAATCTCTGGATTACCCAGCTCCTCCCCAAAGTTCTTGCACTCGACAAATATGTATGAGCACGAATAATGCCGAGAAAGCCACTCAAAGAACCCAGACCGCGCATAATTGGTAAAGCGCAGATCGACCCTTTTTATTCCACCATGCAGAACTGATTGCTTCTCCGGGTCAACGAGAACCGGATGAAAAAGAGCGGAAAGTAAATCCGTTATGGCGTCCTCGTAGAGATATGCTTGCTTCCTTCCCGGCTCCAAACTGGTAACAGCTTGAAGCAGGGCCTTCCAATCAGGAAGCTGCAAACCTTGCGCCTCAGCGATTTGCCTATGTGTGAGAGCTGGCGTAGGAGCCGAATGATCCCTCTTGTATTTCTCCAAAAGATCTGGCCGCTTATTCGTGTAATCGATTGAGATAGATTTTTGTTGACTTCCGTATTTTTTCTCCAAGCTGGTCTTGGTGACAGTCCTCTCACCCTTTCTTGCGCCTGATTTAATCGTATACACTAGCTCACTATTGCGAGCGATCTCATCGATCTTCATGGACTCCAATACGTAATGACGATAATACTGGCCGACCTTATAATCCGCATCAACCCGCACGATTGACTTCGGAACCAGTATTAACTTCTCCCCTTCAGGTAAAGGCAAGCTCACAAAATCCTCTTCCCAACATCTCGCGCCCGGGTTCCAAACCGGACCAGACGCCACACCTTTTTCCATGGGAATGCCGTATTCCTCGCACGTTTCTTGAGTAAAGGTTATGAGTGGGCCTCTGATTATGTTTGTAACAACATCCGACAGAATATCCACCGAAACCCCGTCCACCAGAAGAACCGTGTCCTCTAGATCACTCAATAGGCCGGTTTCCACTGCCTTACTTTGCTTCAATGATTTCCAGATTTGATACGCCTTTTGGGGACCCAGACCTCTTCCGTCAGACTCGCCAGCAGACAATCCCAAGTGAGTTTCATTTGGCTCGCGCAGTGCTTGCAGGGCTGTTAGCGCAGCTGTCCCGTCGCCGACCCTAATTGAGTCAATAATCGTTTGGAAAAAATTGGACAGGAGATACTCGCAATGCTCCCCCCACTCCGACTCCAGCAATCTGATTGCACGTGCATTCACAAATAGGCGCGCATCGTTTTTAATATCAACATCTATGAACTTTAATGTAGCTTGGTTGCGCCCGAGCTTGTAATATTCTGAAACTCTCAAAATTCGCTCCTCTCCATCGAGCATCCCACCTTAAACGACACAGTTCCCTATGTCCTCACCTTCTACAAGCCGTTTCCCACGCGGGGGCGGTGGCCTGAGATTCGCACTAATCTAGGGACCGGCGGGGGGAGGTCCGCGCAAGATAGGCCCGAAATAACTTTTCCGGGCCGAATCAGTGTGCTACATCTATGTTATGTTATAACGTATGAGGTAGCGCCGTGGCTGCGCAGTCTCCCCTCGCGCTGCTGACGGCGCAACTGAACCTTGATCCTGACACGGCAATAGCCGAGGCGGATCTTATGTCGCACAAGCTGGCCGTGGCCGAGGCATGGATTGCCGGGCATGTCGGCGATGCTTTCAACTCGGCATGTCCTGCCCATGTCGAAGCCGCGCTGATGCTGGCGGCGCATCTTTATGAACAGCGTGAGGCTGTTGCTTTCGGCGTCTCTGCCGAGGCCGTGCCGTTCGGCGTTCGCGATCTTCTGGCGCCTTTCCGGGAACAGGTGACGGGCCATGTCGGCAACCTCTGACAGCTCGCGCCGCTTGGCCGCGCGCCTCGACCAGATCCCCGGCGACGTGCTGGCCGAGCTGCGCCCGGCTCTGGTGAAGGCCGCCGAGGATGTGGCCGCGAAGATGCGCGCCCTCGCCCCGGTGGACACGGGCGCCTTGCGCGACAGCATCGCCGTCACCGGACCCGGCCAGACGACACCCGCCTATGCCAGCGACGGCGGGCGGCGCACCATCCCGGACAATCAGGCCGTGGTCACGGTCGGCTCGCCCGCCATGCGCCACGGGCACCTTGTCGAGTTCGGCACCGTGACGATGGAGGCGCAACCCTTCACGCGTCCGGCGTGGCGGATCGCCCGGCCGAGGATACTTAGCCGCCTGTCGCGCGCCATCGGCAAGGCGATCCGAAAGGCGGGCAGCCATGCTTGATCCCGCCCTGGCCCTTCAGACGGCGGTGCGCGCGGCGCTGCTGGACACGCCCGAAGTTCTGGCGCTGGTGCCTGCGGATCACGTCCGCACCGGCCCGGTGCGCCCCGACCGGATGCCCTGCGTCATCCTGAAGGCCGATCAGGTGCAATATCTCGGCCGGGCTTCGGGCAACCAGCACGTTGCGCGGATCTACATGACCTTGGACGTGTGGGCGCTGGAAGACGCGGGCACCGCCCGGCAGATCGGCACCGCCTGCATGGGCGCGCTGATCGACGCGCCTGTGATCCCCGGCGCCTTCGTGGTGGAGTGGCAGCGCCCTTCGATCCTCTGGCTTCGCGACCCGCAGCCCGAGCGCGCCTATACGCATGGCGTGGTGCAGCTCGAAGCCGTCATTCAGTGGAGGGACTGACAATGCGCAGCGCAGCCCTGCGCCATCTGATCCAGATCCAACGCAAGACCGAGCTGGTGGACCCGTCCGGCGAGGTTGTGACCCTCTGGGGTGCCGTGGCCGTGGCCCACGCCGAGCTGGTGCAGCGCACCGACCGCGAGAGCGCGACCGGCTTCGGCGAGTCCGAGGCCGCGCATGTGACGTTCCGCATCCGCTGGCGGGCGGGCATCACCACGGGCGACCGGATCGTGACGGGCGACGGCCGGACCTTCGACATTCGCGAGGTGGCCGAGATCGGCCGCCGGCGAGGTCTTGAGCTGAAGGCGGTGGCGGCATGA
Protein sequences of DBSCAN-SWA_4 >NZ_CP051468|1811285:1844002|1813081_1813636_-|WP_011337446.1|DBSCAN-SWA MEGQAACFAREAPAVAVDLLGAHLQVRGVGGRIVETEAYTPDDPASHSFRGPTPRNAAMFGPPGCAYVYLSYGIHLCLNVVCAPGHAVLIRALEPTEGLAQMAARRGTDVARLLCSGPGRIGQALGLTLADDGTAFGTRGFAVRWGAPVAAGEILCGPRIGISRAADMPWRFGLRGSPCLSRRF >NZ_CP051468|1811285:1844002|1816571_1816826_+|WP_002719530.1|DBSCAN-SWA MAYAKTGTHERQGRGSALMLTSAVVLGAFGALAVRHLMSGRRSTHYPEVRQAGRDAMRDPPRDWDRLDETLDESFPASDPPATY >NZ_CP051468|1811285:1844002|1816936_1818259_+|WP_011337443.1|DBSCAN-SWA MTTDEGRRPEEPGTPASLREMPRDPRIDASMALMSEGYRFVSNLCDRMDSDAVATRLRLREVVCLRGSAAARLLYGAEGLTRVGAMPSTVLHLLQDKGSVQQLEGPAHRHRKALFLSICMDPARVEALVSEMRLAWRERLPAWEAEGRIVLQQEAARLLTRAGCRWAGVAHQPEAQLADEIFDMIDKAGSVGPRNWLAQMRRAGTEKRLRTLVEEVRAGEVVPEAATALHAIAFHREEDGTLLDPSVAAVELLNLLRPIVAVGRYITFAALALHRETTWRELFRSGNLELAGDFAEEVRRASPFFPFTAAVTTRPITWEGYDFPEGQWLLLDLYGTTHDPRHFPEPTRFRAERMLSWTGQDEAFIPQGAGDVARTHRCPGEMITVELMKEAIRLLCCEMDYEVPAQDLGVRLNRMPAQPRSGMILSAISRRAGTEASRNG >NZ_CP051468|1811285:1844002|1842558_1842870_+|WP_023003570.1|head,tail|DBSCAN-SWA MAAQSPLALLTAQLNLDPDTAIAEADLMSHKLAVAEAWIAGHVGDAFNSACPAHVEAALMLAAHLYEQREAVAFGVSAEAVPFGVRDLLAPFREQVTGHVGNL >NZ_CP051468|1811285:1844002|1840556_1840907_+|WP_011337425.1|DBSCAN-SWA MPRPPHLCTCGQLVPHGARCSCQIQRDRERKARFDQKRPSSRERGYTHEWRKARAEFLHQHPTCAFCGAPAGVVDHVIPHKGDMTLFWDRTNWQALCKPCHDRQKQMQERYNHIIS >NZ_CP051468|1811285:1844002|1812847_1813009_-|WP_002719534.1|DBSCAN-SWA MTNDLQLGPNDPPAPDPMPEPNPYPDPMPDPDPIPPGDPDMPNPGEETPPIYS >NZ_CP051468|1811285:1844002|1836915_1838109_+|WP_011337429.1|portal|DBSCAN-SWA MFFSRKPKAEQKAAVTLTSAEAFEIFGARPTVTGAFITTGAALRVPAVACAVGLIAETVGALPVKIFDAGKLTRRDHPAYRLIHDEANPWTSAAELRAQLTADALLNDHGGFALVVRASDGRPLELHRLDPAAVRLDAAPDGSPLYIVATNTGPASYGFADVLHVRAFGGSPISLGREAIALAIAFEDHIGSLFRNGGRPSGIIKSPKILDVDAKRKLAASWFTTHSGRNSGGTAILDEGMDYQALSATLADSQFAENRLEQIREIARAFRIPVTMLGELSRGTWSNLEEQNRQFLQFTLRPWLRAWEWAYARCLLTPDERRELTAEFVTDDLLTTSHAARATAYGQYRAMSAMTANEVRAGLNLPAHPDGDTLENPHITTGGKSETVSPLPDEESA >NZ_CP051468|1811285:1844002|1822242_1822767_-|WP_002719525.1|DBSCAN-SWA MTRLLHFGTIAGLASLFGVLVPPHALDELMLPLIAVLPVVASVLVLRAGTPMPSAGWERIDPVRRHRFTRRLVERTLAYRRAVFILSGSFLALAAMFAVGKGGVLDGLLSQRQEQLVAAVVGGLIGLCGSCLPEILRRECAVADIQKRLLDELPVFRANRPGAQVIRLVPRAGA >NZ_CP051468|1811285:1844002|1819475_1820666_-|WP_011337442.1|DBSCAN-SWA MSGRDAFLAGMRELLEGIEAEGLTKRERPILSAQGTHVRVGDRDLLNLCANNYLGLANHPALIEAARRGMDEHGYGMASVRFICGTQDIHRALEGRIARFLGMEDAILFAACFDANGGLFEPLLGPEDAVISDALNHASIIDGIRLCKARRYRYANSDMEDLRRQLAEARAEGARHILIATDGVFSMDGYLARLPEIVALAEEFGALVMVDDCHATGFMGPKGQGTPAHHGVTVDILTGTLGKALGGALGGYVAGPQPVIDLLRQRARPYLFSNALPPAVVAAGLAALDLVEAADDLRAALFENTAYWRAGLTEAGFRLLPGAHPIVPVMLGEARLAQEMAAALYDRGVYVSGFFFPVVPRGQARIRTQMNAALTRDDLDLALAAFREAGRAVGAI >NZ_CP051468|1811285:1844002|1843268_1843685_+|WP_011337422.1|DBSCAN-SWA MLDPALALQTAVRAALLDTPEVLALVPADHVRTGPVRPDRMPCVILKADQVQYLGRASGNQHVARIYMTLDVWALEDAGTARQIGTACMGALIDAPVIPGAFVVEWQRPSILWLRDPQPERAYTHGVVQLEAVIQWRD >NZ_CP051468|1811285:1844002|1811285_1811666_+|WP_002719536.1|holin|DBSCAN-SWA MPEDRSLGDLLSDLVRDVTGLVRSESRLIRAEVAEAGRSMAVGAEMIAAGGILLLVALLVLVQALVVLLAHWVGPAWAALIVGAVLAVIGGLLIARGRKDMSAANLVPERTIEQTSRDVRLAREQI >NZ_CP051468|1811285:1844002|1820710_1822156_-|WP_011337441.1|DBSCAN-SWA MTFRFDNSYARDLEGFYVDWPAAPVPAPRLLRLNRPLAEELGLDPDLLEREGAEIFSGRRLPEGAHPLAQAYAGHQFGGFSPQLGDGRALLIGEITDRAGRRRDLQLKGSGRTPFSRGADGKAALGPVLREYLVGEAMHGLGIPTTRALAAVATGEPLLRQEGERPGAILTRVAASHIRVGTFQFFAARSDIDRVRRLADYAIARHYPELASAPEPYLAFYEAVAEAQAQLVARWMLVGFIHGVMNTDNMTISGETIDYGPCAFMEGYDPGTVFSSIDLQGRYAYGNQPYILAWNLARLGEALLPLLDGDPVRAADKATSVLETVGARYQGHWLAGMRAKLGLAGAEEGDARLAEDLLEAMRSQRADWTLTFRRLAEAVTDEGALRPLFRDPAALAGWLPRWRARLAPDAAERMRATNPIYIARNHRVEEALAAAHAGDLAPFDRLLEALADPFTERADRELFALPAPEGFDDSYRTFCGT >NZ_CP051468|1811285:1844002|1814403_1816536_+|WP_011337444.1|DBSCAN-SWA MAEDKPAARSGSFENLVSDYGAGGEFHQHAGEGDERLTTAQGVTIADNQNSLRSGAPGPTALEDFILREKIFHFDHERIPERVVHARGFGAHGLFECTKAIPEITRAAPFQKEGKITETFVRFSTVAGSKGSVDLARDVRGFAVKFYTDEGNWDLVGNNIPVFFIQDAIKFPDLIHSVKPEPDRGFPQAQSAHDNFWDFISLTPESMHMVMWQMSDRAIPRSFRFMEGFGVHTFRFINAEGQSSFVKFHWKPKLGMQSVLWDEALKINGADPDRHRRDLWEAITAGDFPEWELGLQIFDEEFANRFDFDVLDPTKIIPEELVPLTIVGTLTLDRVVDNFFAETEQVAFCTQNVVPGIDFTDDPLLQGRNFSYLDTQLKRLGSPNFTKLPINAPRGCPVHNFQQDGHMQTSNRKGRVNYEPNGWGKGPRAHPLEGYVSYAVRNEGEKRRLRPESFADHYSQARQFFLSQTPVEQGHIRDALVFELSKCQEIEIRARIVAHLLNIDGALAQAVADGLGLTEMPEPAPAARPTRTDLPPSDALSMLKNPPPNFAGRTVGVLAGNGVEAELLTELRSILQAEGAMMKLVAPRVGGVTDSAGTLHPADEKLGGGPSVIFDAVAVLPGSDAAMLAANPAAQDFLTDAHAHCKFIAHHGAGPLIEACALGAKMDEGWIEIGSSSDIDGFVAACRGMRHWPREATLGGNPPLARTFSP >NZ_CP051468|1811285:1844002|1824165_1825584_-|WP_011337439.1|DBSCAN-SWA MRNIDRFFIGGAWTAPLGTDRHRLVNPATEEEIAAIPMASTEDVDRAVTAARAAFEGWQASSKEERLLLLRRLLDLYNEAYDELAELMTREMGTVARFSREAQAWVGRAHLEAAIEALEAESFEEMRGSTLISKEPIGVCALITPWNWPMNQLVVKVAPALAAGCTVVAKPSEFSPLSSIRFAELVEAAGFPPGVYNHITGAGPVAGEALARHPDVDMISITGSTRAGIAVARAAADTVKRVTQELGGKSANIILPDADLATAVRQGVLDCFGNAGQACKAPARMLVPAERMEEAAALAGAAAEALTVGAPEGEVDLGPVVNESQWRRIQSLIEAGIAEGARLVTGGPGRPDHLPRGWYVRPTVFADVAHGSTIATEEIFGPVVALIPYRDEEEAVRIANDSIYGLAGYIQTGDPETARRIARKLRVGMVYINGAGWDARAPFGGYKQSGNGREHGAWGLADYLETKATAGL >NZ_CP051468|1811285:1844002|1832933_1833698_-|WP_011337431.1|DBSCAN-SWA MDYRTVFRLDGACAAVTGAGSGIGLEICRAFAASGARLILIDREGAALDRAAEELGAAVASRIVADVTDAEAMTAAAAAAEAVAPVSILVNSAGIARLHDALETDDATWRQVMAVNVDGMFWASRAFGRAMVARGAGAIVNLGSMSGTIVNRPQFASSYMASKGAVHQLTRALAAEWAGRGVRVNALAPGYVATEMTLKMRERPELFGTWLDMTPMGRCGEPSEIAAAALFLASPAASYVTGAILAVDGGYTVW >NZ_CP051468|1811285:1844002|1842883_1843276_+|WP_017140145.1|DBSCAN-SWA MAARLDQIPGDVLAELRPALVKAAEDVAAKMRALAPVDTGALRDSIAVTGPGQTTPAYASDGGRRTIPDNQAVVTVGSPAMRHGHLVEFGTVTMEAQPFTRPAWRIARPRILSRLSRAIGKAIRKAGSHA >NZ_CP051468|1811285:1844002|1831244_1832207_-|WP_011337433.1|DBSCAN-SWA MTTKTAKGLTIALLLLATLSPALAEELTLEGKTIGITAIGTDHDWDLKAYQAQIAEIERLGGTAIALDAGRNDQTQVSQIQTLIAQKPDAIIEQLGNLDVLNPWLQKINDAGIPLFTVDTATPHAINNTTSNNYSIGAELALQMVADLGGKGNVLVFNGFYSVPVCKIRYDQMKYVLEAFPDVKIIEPELRDVIPNTIQSAYSNVTDMLTKYPNEGDVGAIWACWDVPMIGATQALQAAGRTDIRTYGVDGSPEFVEMVADPESPAGAVAAQQPSEIGKLAVQNVARHLAGQEVKPFTFAPAVLITKENAAETAADFLPK >NZ_CP051468|1811285:1844002|1833891_1834674_-|WP_002719513.1|DBSCAN-SWA MEDGDSAEAFRAFAAGVALQREDPSPLWVQLKNRIEHAILDGTLPENARLPSEQAMCSMFDLSRPVIRNALQALAGEGRVIKQPRKGMFVAPRAPELAFMTSALGVFDDLSAKGYKVTVKTYEFGLHPANEDERRVFKLPEGFQVIRALRVYHANETPLTHTLISLPAHRLPGFEKLDMEGRSIFGTIRELYGLTVARADRWLKGAIVPPEVAARMGVPPGGAMIAIESVAYDHDGNALEYYRAYYNSEVAPIHVATDAH >NZ_CP051468|1811285:1844002|1813754_1814282_+|WP_011337445.1|DBSCAN-SWA MERLSMRKIAFTASLLALAAAPGFAQSGADGMDPTRPVAPETGTGSIKDISQQTWSDDVRDVFFEDAAMTSLRATDEMEKRWAALDPEARERAVRDCQAYLMDAGSMGGASSGGIGGSLTPDADEAGTTGEDLGGTDTPAPVPEDTTNASDAPASPGDASVWTQACSFVTSFKPQ >NZ_CP051468|1811285:1844002|1832272_1832929_-|WP_011337432.1|DBSCAN-SWA MNEMSDLSDLAKREEIIRQCLEMNRSGLNQGTSGNISVRHGEGMLITPTSLPYDTLVPEDIVFVSMEGEVRGRHKPSSEWRFHRDILRERADVNAVVHAHPTYCTTLAIMNREIPSIHYMLAVVGGPNIRCAPYAIYGSEELSRNAVEALHDRKACLLEHHGMIAVGKSLAQAMWLAVEVETLARQYHGCLQIGEPRLLSERQIQDVIDKIAGYGHQG >NZ_CP051468|1811285:1844002|1839957_1840554_+|WP_011337426.1|DBSCAN-SWA MPLAPEEITLTHREHTLRLRPSLRAALILERLHDGFARLFEKLDEADTQTLHAIIRTAATDPHRAESFLASARNVPLAPFLRAVQAPVAALCEGLLLPPDESSKPTPKAKPLPWVDALTELYKIGTGWLGWTPAETLNATPAEIILAFDGRIAQLKALHGTADEADPGDTARRERNLAEGLDPDFDREGLRALAALAG >NZ_CP051468|1811285:1844002|1811665_1812775_+|WP_011337447.1|DBSCAN-SWA MNADKIEREVEENRARVESTLDALKERMSVNQVVDDLANFVGVEDLRGVMHSAGRQVRDNPVALGLIGVGLAWLAFGGSSSRSRHVSAYDREEYYRSDYGPARRSYEPYGGGASYRSDRGEGVVSRVKHAVSDAADSVSRAAHSATDKVAETFGDARDRAGSLRDDVYDRAGRMREDAYDRAGHWRDDLGERSAHLRDRAGHLRDRASHGAHQMRDSMSHGMEQQPLLVGAAAVALGAVIGAALPRTRTEDEWMGRTSDELWDEAKASSWELRERAMKAARETYDATIAAARDEGLVPEKGETLASKVGRVADAAASEAKAQVEPVLHGKEADKSSTGMSSAGAGSTGSTDSTTKSPGTSGPKVAGSGF >NZ_CP051468|1811285:1844002|1838422_1839958_+|WP_017140147.1|capsid|DBSCAN-SWA MSDRIEIKAALTVEETGEITGLAWPFGTPDRVGDVIEKGAFTGPAELPMLFAHDQTQVIGVWDQIAETPEGLTVKGRLLVQDVERAREVRAMIRAGAVSGLSIGFETEAAKPRARGRSISKLRLLEVSVVAVPCHPGAQIHSIKAADDTAEPCTEGKTPVENEDQTTPANAPEIDTKAFDALKQRLDQLEAKANRPGVTTTGPAPSAEAKAFGGYVRRGVERMDPADTKSLTVSTAANGGYLAPKEFGDELFKNLIEFSPIRKYARVVQISAPEITYPKRVTGTSATWVSEVGDRTGSEPSFDQVTLTPHELATFTDISNALLEDNAYNLEGELMADFAESFGRAESAAFVNGDGVGKPKGIMAAAGIATLSGGAGTITVASLIEAYHAIPTVYAQNAVWVMNRTTLAKLRTYFNGMGEPLLLDSISEKAPTTLLGRPVVEAPDMPNMTAGATPILFGDLSGYRIVDRVGLAIMRDPFSLATKGQVRFHARKRVGADLTHPDRFVKLKVAA >NZ_CP051468|1811285:1844002|1826594_1827545_-|WP_011337437.1|DBSCAN-SWA MPTYDVSSIGFYVCDILGRPVTRIPDGGRADYIQEIRMTVAGTAGATAADCAILGLKTLAVTTVGEDEMGDFMVAKLTRFGVDCAMVARDGSVQTSATILPVRPNGERPALHVPGTAATFRIPEGRLDAALDARIVHVGGTGLLKSFDGAPTVEALRRAKELGRITTFDLIQATPETFELVRPCLPYIDYFVPSIEEASEMAGTTDPAEVARFFKGLGVKNAILTMGGEGVYVSPEAGEDFRLPAHAIEVVDTTGCGDSFTAGVIVGLARGWDLRDCCRFASAVAARVAMGLGSDGKLVSFEDTIEAMNTLPLRAN >NZ_CP051468|1811285:1844002|1843687_1844002_+|WP_011337421.1|head|DBSCAN-SWA MRSAALRHLIQIQRKTELVDPSGEVVTLWGAVAVAHAELVQRTDRESATGFGESEAAHVTFRIRWRAGITTGDRIVTGDGRTFDIREVAEIGRRRGLELKAVAA >NZ_CP051468|1811285:1844002|1825837_1826584_-|WP_011337438.1|DBSCAN-SWA MTKTAVVTGAARGIGRALAVGLAEAGYDVAVTDLAAQADGLAETRALIEAAGRRAFVETVDVSDRAAVVAAMARIEDAAGGIDVLVNNAGILKPARLEDLSEADWDAHMDVNVKGVLSCCQAVLPGMRARKAGRIVNIASIAGRQGVPTQGHYAATKAAVITLTRVLAQEAGMDGITVNAICPGIILTEMGKNNLGSDEAIRHWEEVAALKRLGAPEDIVGPVLFFAGEQSAFVTGQALNVCGGIYFH >NZ_CP051468|1811285:1844002|1838105_1838426_+|WP_011337428.1|DBSCAN-SWA MIDAIKHVAFFGDRERTFALTDAMCAELEALTETGIGAIYLRTVAGAFRGQDVPEVIRLGLIGGGATPQEAARLVDTYARNRPMGEVYPLALDILETRWNGAETAA >NZ_CP051468|1811285:1844002|1835023_1836841_+|WP_011337430.1|integrase|DBSCAN-SWA MARLSHLLRRGAAYYARTRVPLDLIAIVGKKELVKALGTKDETEAKRRLYPQLDAWQREFDDLRARRSLVAADREHAVWDHYTAALTRDEAERATLPGEAEIAAAEADVIRRADRGEITSAEPLAILDATLELQVKQQAPALASDTRKAKLADLREHLVKGETALISHEVDDYLNRNRLLVERGTPDWISLARRMMRAEAEALERTLERDRGDYTGQPRDPLVKPPTAEKRAEMEVAPPGESIAEAVEAFRTENPRNVSKGRIEEACRDIGVFMETVGPSFPVSKLTKKHVREWKALLIKYPLRATEVAEFRGMTIRQIVEANEQAKRPVLSDRTVNRYLSSLSAFCSWAEANGYIASNPCSGMALPKERGSKTLPFTSDQMNTLFRSPLFAGCESEAAWRYISKPGPVLIRDHRYWVPLIMLYSGARPGEIAQLATSDLRQEHGRWIFHITTEGETHEEGKQVKTAGSMRVIPVHSELIRLGFLAYHSQRVEAGDKRLFPGAKRNERGQMMSEFSREFGKYLARIGLKSGRGLSLYSFRHGAADALRRAGYLDNEFGFILGHTEGSMTGRYGIMPQGMLEQRVKLIEAIQYPGLNLDHLANVIT >NZ_CP051468|1811285:1844002|1827560_1828538_-|WP_011337436.1|DBSCAN-SWA MSLSLSRHLPKDRESFSTLAVLVAFVGLFLVFATQADAFLTLGNLRNVLVNNVVLLAIVALGVTFVVSSGGIDLSVGVSVDMASLVFVSALAAGVAAPLGLAAGLGAALLVGLLNAILIARLKISPFLATLGVLFIGQSVQRLATGGGQPIYLVTKEYAAIFNAISRSSLLGVPTPVWVLILCVLATWLALHRMGFGRQVQAIGARPGVAWYSGIRVPSRLTQVYILAAVLAGITGILLSATVRSYVPMSGNAFLLDAIGATFIGTTISRERRPSILGTLLGVALLAMVKNGLLLIGWNFYWQQVGIGVLVFAVLALSFGLQRRH >NZ_CP051468|1811285:1844002|1822999_1824169_-|WP_011337440.1|DBSCAN-SWA MTSAYRNALLLSLGPAAGIGLGRFAYALLLPAMQADLGWSYAAAGWINAANAAGYLGGAMLAPALAQRVGAARAFAAGLAMLLPALAAVALTRDVAALAALRLLAGASGGVVFVCGGLLAVGLSLRAGSGGLVLGTFYAGTGLGMILSALAVAPLLGIAGATHWPQGWLILAGLSALCAALALLPLRDGLGASVRQAGSRGPTPLRFWRILAGYLLFGLGSIGYMTFIYGHLAESAGGWRQAMLFWCALGLAAVAAPSIWRRLIGGASPERSFALLVATNALGSVLPFLMPGALGLWLSAFLFGSTFFSTVAATSAFASALPQAFDRGRAIRAFTIAFALGQFGGPVVLGWTADLTGRLDAPLMFASLVVLAGALLGVLERRPDAIDGA >NZ_CP051468|1811285:1844002|1840891_1842364_-|WP_011337424.1|DBSCAN-SWA MLDGEERILRVSEYYKLGRNQATLKFIDVDIKNDARLFVNARAIRLLESEWGEHCEYLLSNFFQTIIDSIRVGDGTAALTALQALREPNETHLGLSAGESDGRGLGPQKAYQIWKSLKQSKAVETGLLSDLEDTVLLVDGVSVDILSDVVTNIIRGPLITFTQETCEEYGIPMEKGVASGPVWNPGARCWEEDFVSLPLPEGEKLILVPKSIVRVDADYKVGQYYRHYVLESMKIDEIARNSELVYTIKSGARKGERTVTKTSLEKKYGSQQKSISIDYTNKRPDLLEKYKRDHSAPTPALTHRQIAEAQGLQLPDWKALLQAVTSLEPGRKQAYLYEDAITDLLSALFHPVLVDPEKQSVLHGGIKRVDLRFTNYARSGFFEWLSRHYSCSYIFVECKNFGEELGNPEIDQIAMRFSKERGQFGIVVCRNVENADLLDRRCKAAAQDGHGFVVVLSDSDLGQLIEESKEGLLQTYEFPTLRRKFNQLIM >NZ_CP051468|1811285:1844002|1829552_1831061_-|WP_011337434.1|DBSCAN-SWA MSGLAIDMTGISKAFGPVKALVDADLRVARGTIHGLVGQNGAGKSTIIKVLAGILKPDSGRITINGTRVESLTPASVERLGVHFIHQERLLVPTATVAEAVFLNYELRFGPFLRPGAMKRRAEELIRTHFGLELPGDTLVRDLTTAQQKIVQITRALAQEAQVLVLDEPTAALVKREVDSLFAVLRNLRAQGIAVIFISHYMQEIEDLCDEVTVMRNGTDVGVVRPGETSIDEIVSMMIARDVGEMFPCRSHALGAPVLRVEGLSQAGHFRNVSFEVRAGEVLGITGLLGSGVKELVECLFGLEQPDAGSVTIDGEVRRFANPGRAVQGRVALVPEDRRAHGVATDMSVRDNITIASLERYMTRGFVSRARENEAVDGFIRELSIKTPHRDQLVRNLSGGNQQKVALAKWLSCQSRVYVLDEPTVAVDVGAKVEIYTLLNRLAAEGAAILFLSSDLLEIAGFCDRALVVYRGTLNGEFAGETLDSDLLLAAASGARAQRKEA >NZ_CP051468|1811285:1844002|1818450_1819476_-|WP_009564818.1|DBSCAN-SWA MRALVKAKAEPGLWMEERPVPEIGPDEVLIRVRKTGICGTDVHIWNWDDWAAKTVPVPLVTGHEFAGEIVEVGRDVRDLSPGQRCSGEGHLIGHHSRQVRAGRFHLDPETRGIGVNVPGAFADYLRLPAFNVVPLPDAIDDEVGAILDPLGNAVHTALSFDLVGEDVLVTGAGPIGIMAAAVARHVGARHVVITDVNADRLRLSTEVADVVPVNVATEDLRSVMGRLKIVQGFDVGMEMSGAPAGFDQMVEAMVMGGRIAMLGIPPGRSPVDWSRIVFKALTIKGVYGREIFETWYKMIAMLENGLDIRRVITHRFPVADFAEGFAAMRSGASGKVVLDWG >NZ_CP051468|1811285:1844002|1828541_1829552_-|WP_011840754.1|DBSCAN-SWA MTMALSPSSSESPAGASRGKALAAAAIRLGALATFAAIMAYFALTARGFATPFNIVNVIEQSAILGILAFGMSVVIIGGGSEVQTGGIDLSLAANAGLCAAVFATATNAGMAAPVAIAATLGTGMLVGAVNGFAVVGLGILPLLATLASMNIVAGIELTLTENTVLSTSSPLLSVLASGRFLGISALAWVLIAASVIVGLVIHRTPVGLRLYAVGGHPEAARAAGLNVGLHVWGTYVFAGLCAGLAAILIVSRLSASTPGTGELLLSILAAALLGTVFSRRFVPTVGGTVLSVIFIGFLANGFQLLHLSSYWVSGVQGALILLVVAVTSYARPQEH |
34 | Tupanvirus(10.0%) | tail,integrase,portal,capsid,head,holin | attL 1815230:1815248|attR 1844950:1844968 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2148778 : 2172343
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP051468|2148778:2172343|DBSCAN-SWA TTTACGCTCCCGCAGGGTCGCGTTTTACAGGGTGCCCTCCGGAGATCACCCGGCGGCGGTCGGCCCGGCGGGTGTATTCCTGCACCTCGAGCAGAGTCGTGTGCCCCACCCATGACTGCATGACGAGCACGGAAGCGCCATGTTCGGCCAAGGCGTTCATGCGGTATTTGCGCAATCCGTGCGCGGTCCGGCTTTCGAGCCCCGCCTCCCGCGCGGCGGCCGAGAACCACTGGCTCACGCCCTTCTGCGTCCGGGGCTTGCCGTATTCCGTCAGCATGTAGACCGGCGATCCGACCGGCAGGCAGCGCATCAGATCAGCGCGCTGCTCCTCGAGTCCGAAGGCGGGACAGGTCCAGGGCACATGGGCCGGATTGCCGGTCTTCTGCTGCACATAGGTCAGGACCCCGTCCCGCCCCACCATCTGCGGCCCGAGGCGGATCACGTCGACGCACCGGGCACCCGTCCACTGATAGAGCTCGAGCGCAAGCCGCTGCGCCGTACCGATGGGCCAGCGCGCGCGGAACGCCTCGAGATCCTCGGCGGTCCACTCCGTGAAGCCGGCCACCTTTCCCATCTTCTTGCGCTTCACGCCCTCGCTCGGATCGGCGTCCAGCATCCCCTTCAGGAGCCAGAACTCCGCGAGCTTCCTCCAGGCTTTCAGCCGCGAGGAGGCCACGGGCGGCGACAGCGGTTCGAGATCGGCGCGGATGTGGCGCGGCAGGAGATCCTTCAGCAGCGCCTTCTCCCCCTGCTCCTTGATGCGCTCGACGTGGCGCCGGATCACCGGCCGGTATCCGTCCGAGAGCTCGCGATAGGAGCGGCTGCCAAGATAGGCCTCGCAGGCGGCGGCAATCGTTCCCGCCTGGCTCCGCGAGCGCACGGTCGGCGCGCGCTGCTCTTCCGCAAGCCATGCCGCCACGAAGCGCGGATCGTCCTCGGGCACGTCCCGCGGCAGCTCCTTCCGGGTGCGGCGGTGGTACTTGTAGACCGTCCCGCCGCGCCTGACCCTGTGAACGCGGGGCAGCATCAGGTGATCCCGAAGGCGCCGTCGCATGAGTTTGCCTTCACCTCGCCCTCGGTGGAGAGGGAATCGGCATAGGCCTCGAGCTCCAGCCGGTCATAGAGCCGCTTCCCGCCCAGCACCTTGCGCGGGATGGGCAGCCCCCGCAACGTGGTCGCACTGACACCGAGATAAGCCGCGGCCTGCGGCGCCGGCAGCAACCGCGGTGCGAAAGTGAGAGACGATTTAGCGCTCATGGCCCCTGCTCCCTCTCAGCGCCCGACACAGCACGCCCACCGGCGGCGCCGCGCGCACCTGTTCCTTCTCCGCGCTCATTTGTCACCCTCCGGCCAGCCGAGCTCCGCCGGCACCCAGGCGTCGATCTTCGCGCAATCCTCGCGGCTGAGGCCAAGGGCCTCGCGCGTGGCGGGATCGGCGAAGAGCGCCTCGAGGCGCTTCGCCTTCTCGCCCTTCTTCAGCTTCTGGAAGCTCTGATCCGCCTCGGCCGCCGGCACGAGTTCGCCCCAGAGCCGGTCGAGATAGCTCGCGCTGCAGCGGCCGAGGAAGCCCTGTGCGGTCGGCGTCCAGATCCGGCGCACCTCCACCCCGAGCTGGCGCGCGAGCGTGGCCGAGAGGCCGGAGCTGCCGGTGCAGAACGTTCGAGCGAGGGCTTCGGTCAGGATCTGGTTGCGGTGCTTCTTGCCCTGCGCCCGGAAGGCCTCGAACTCGGCCGGGGTGCCGTCCGGCCCGAGGCTCGTGTTCGGCTCCATCCGTGCCGCCAGCCGCGGTGGGTAGCTCGTGCCCTCCCCCTTCTCCGGCACAATGGGCTGATCGGTGGGCGAGATCGCGAGCGGGCGCGCCCAGGGGCGGAGGCCGCCGCCGAGCGACCATGCCAGAAGGTCGAGCATGAGCTCGGACTGATCCATCAGCCGCGCCTGCAGCGCCCCGAGCCGGATCCGGTGCAGGTCCTCGATCAGCGATTGCGGCAGGTCGGGCGCCGCCTCGGCCGCGCCACGCGGAACCGCCCGCGGCTGCTGATAGGCGCGCTCGACCGAGAGCCCGTCCCGGCTGCTGCAATAGACGAAGATCCCGGCCGAGGCGCGCTGCGCGTCGGTGAAGCCGCCCTCCCGCCGGGCCTCGAGCTCAGCCAGCCGCGCGCGGCCAGCCTCGTCGAGGGTGTCGTCGGCCTCGCGCTCCTCGAACGCCGCGAGCTCCGCTTCCTCTCCGTCCGAGAGCTTCCCCGGCCGGGCGTAGAGCCGCACCAGCTTCTGCGTGACGGTCCAGGAGACATATTCCTCCGGCACCCACGTCGCCCATTCCCAGCCCTCTTCCGCGCGGATCCGCTCGGCCTCGGCCGCGCCCTTCTCGGCGAAGAGCCGGTCGAGGAGCGCCTCGTCCTCCAGCAGCGTCCGGTCGGCGAAGAGATCGCGCTGAGAGGTTCCGCCGGCAGCAAGGTAAGCCTCCAGCCCCACGAAGAGGACGCGGCGGTCGGTCGAGGGCACGGTGCCGGGCGTGAGCTCGCGCCGGACCTGCTCGGGCCGCATGTCGCGGCCGCGCACCGATCCCAGCACCTCGAGGCAGCGCTCGCCGCTCGGCGCGAGCGTCAGCGCCTTCGCCATCTCGAGCGAGATCTCGTTCGCCCGCAGCGCCGCCAGCGCCTCGGCCGGCAGGTCCGCGAGCTTCAGGCGCCGCTCGACATGGGCCTCGGAGCGCGCAAAGCTGCGCGCGATCCGCGACAGGCTCGCGCCGCGGTCGCGCATCGCGGCATAGGCGCGGACCTCGTCGGCCGGATGCAGCGCCGAGCGCGCGCTGTTCTCGGTCCCCGCCCAGGCCACCGCCTCCTGCAGGTCGGCCGTCACCTTCACCGGTACCGGATCTATCGGAATGAGATCCGGGTGCCGGCTCCAGCCCTCGGCCGCGAGGCGCTGCAGCGCGCGGAGCCGGGTGCCGCCGCCCACGATCTCGACGCCCTGGGGCGTCAGATGGCCGATGAGGTTCTGCAGCAGCCCCGCCACGGCGAGGCTCTCGGCCATGGCCTCGACCTCCTCGGCCGCGATCTCCTGCCGGCTGTTGAGGGGCGAGAGCCGGAGCTCGGCGAGCGGGATGAGGCGCAGATCGCCTCGGAAGATGAAGTCCTTAGCCATGCGGGATGCCCTTCATGCAATTCTTCTCCGGCGCGCCGGAGCGCGGGAATGAACCCGGGCGCCGAGGCGCCGGGCAGGTTGAACGGAAGCTGCGCGGACGGACCCGCCGCGCCGGGGGTGGGTCAGGCCATGCCGAGCGCGGCCTTGTAGAGCTCGAGGAGCGTCTCCTCCTCGGCGATGTCGTCGGGGGTGCGCTTGCGGAGCGCCACGATCATCCTGAGGATCTTCGGCGCGTAGCCCCTGCCCTTCGCTTCGGCCATCAGCTCCTTCTGCTGGCCGCCGATCTCCTTCTTCTCGGCCTCGAGCTGCTCGTAGCGTTCGATGAACTGCCGCAGCTCGTCGGCCGCGACCTGATAGGCGGCCTCGGCCACGGCGCGGTCGGCCGGCGTTTCCTTCATCGGCGGCCTGCGGATCCGGCCGCTGTCGATGGCCGCGGCCATGCGGTCGAGGTCGGCCGCGTTGAGGCTTTCGACCGGAACGGGCCCGGACCGCGGCGCGGGATGCGGGGCGGCCTCCATCACATGCCCCGCATGGCGCTGGCCTGCTGGCGCATGGTCTCGGCCACGTCGGTAGCGTGCGCCCAGAGGACCGTGGCGACAAACAGGAAGCCCAGGAGCGCGAGCGCCCCCACGAGGAAGCCGGTGAGCCTGGACCCGAGCGGGCTGGGCCTGCTCGTCCGGAAGGCGGGCCGCGGCCGGGCGCGGTGCAGCGGGCGGCCCGTGGCGACGGCCAGCTCGTGTTCCGTCAGCAGCCGGCTCGCGGCCTCGCGGACCTCGGGCGTGGGCGCGTGCTCGGCACGATGGCGCGCGGCATTCAGATTGCGGGGCGTGAGGGGCAGGATCCGGCTCATGCCCCGAGCGCCTCCCGCGCCTGGCCGAGGATCGCGGCGAGCTCGGCCTCGGTCTCGTCGTCTATGGGCTCGAGCTGCGGGAGCCCCGCCTCGGCCGCCGCGAGCAGCCGGCGCAGCTGGGGCTCGGTCAGGTCAAGAAGGGCGAAGCGGGCAACCGGCGCGGAGGCCGGGTAACGGATAGCGGTGGCGGGCATGAGGATCTCCATGCAAGTAAGCACGTCTCAACATGCTGGACATTTCTTCCATCGTCAACTTATTGATGGACGAAACGTCCAGTTACTTGAGAAGCGCCGCTTACGTAGGCTCCCGAGAGGTCATTGATGAGCTGAAGTGGCCGCAGCGGCAGGTGGGGAAGCCTTGGCCGTGGAGTGGCAGAACTCAGAGGCGTCAGAAGTGGCCGAGGGTTGCGCTTGCCGTCATCGTATGATGCGAGCAAAAAAGAGAATTTTCACTAGGACGCGGATTTTCGCCGTTGCGACATACGACGATCCCCGACCTGGCACTGACCGAAATGGCCTTGTGATGGCCATAGCTTGCCTAGAACCGCATGTGAACGAGAGGGGTAGGGATGAGACTCGATGCAGGAGACCGGTTGCTGTTCGCGCTTGCCGATGCTTCCGGAATGTCGATTGATCGCCTCTATCGCATGGGTGCGGTGAGGTTCTGCATGACCTTAGTTCGTAAAGTCTCGTCGATCCGGGACATGTCGCCCCGATAGATAAAGTCCATGCTGACGCCCCAGCCTTCGGCTATGGCATGGCCGTACTCGGACTTAAGCGGCTTCTTGCCCTGCAACACCTTGGAGTAGCTCGAAGGATCGAGGCCGAACGATTGTGCGAATTCGCCCTTATCGAGACCAAGTGCGTCTCTTAGGGCCTCCAGCCGACGCGCCACGCCAGCCTGATCAATCTTGGGGAGCCTGCTAGGCATTGATTTCATGGGCACATTTTGACGGCAAATGGAAAGTTTGCCCATTGAAGTTTCTTCCATCTTGTCTTTCTGGATGTTTTGTCCAGAATATGCGGCATGCAGACCGAGAACCTGATCGAAAAGCTGATCGCTTGCTGGCCGACGCGCCGGGCCTTCGCCGCTGAAGTCGGCGCGAATGTGGAGGCAGTCCACAAGTGGGCGAGGGCAGGACGTGTTCCTGCTGACTGGCAAGCCGCAGTCGTGAAAGCTGCCGCCGCTCGAGGCCTGACGTTTGCGACCGCAGAATGGATGCTGGCCGTCCACGACCGGTCCAGCGAAGTTAGAGGGGCGGCGTAATGCTCTACGCCGCCCCCCTCTCCGGTCGTGAAATCGCTTCGTCATCGAAAACCCTCTCCAAGGGAGAAGATGGAGTGAACATGCGGAAAAATCTTGCCCAAAGTGACGTTCATGCCCGAGCGAGCCGAAAAAGGTTCGCGGGGCTTCTGTGGCGGGCCTTCCCGGCCTCCTCGGAGCGGGATCTGGCGGCGAAGGCCGCGCCGGTGCTCGGCGTGAGCGAGCGGCAGGTCCGGAACTGGCTGCAATGCGAGAACGACGCGGCGCTGCGCCATGTCTTCGCGGTCATGACCATCGCCTGCGGCGAGGAAATCTTCAGCATCATCGAGGGACGGGGATGAGGCGGGTCTATTGGTATCTCATGCAGCGGTTCTACGAGGCGCGGTCCTTCCGCGCCTTCTGCAACCATCTGCACTTCAAGAAACAGTCGCGGAAATATTCTTCCCGGCTGGCCGAGACATGGCTGCACCGCGATGACGAAAGCGAGGAGCATTCGCCTTGCCGCCTGCCCTCCTGCTGGTGGCTCGTCCCCTCCGTGCTCGCCGGATCGGCGATCCTCGCCACGGCCGTCGGGAGCCTTCTGTGACCGCCGCCGCGCCCTTGCCTGTACAGGACGCGGCCACTTCGCCGGGGGCTGCGGCCTCCGGCGCTTTTCGATCGAGCGAATGGGCCGCGCTGCGGCGCCATCCTGCAGGGCGCGCGGATCTGTTGCGCTGGGGCGCGACCCCGGCGCTCGTCGCGCGGCATGCGCGCTGGGGGCGGCCCGTCTATCTGGCGAGCCCTTACACGCTCCGCGCGGTCGGCCCGGACGGCCGCTGGTCGCGGGATCTGTCCGAGGCCGCGATGGCCGACGCCGCGCGCGAGGTGGCGCGGCTACTGGAGGTCGGCGTCACGGCGATCTCGCCCGTGGTGCTCTCGGCCGCCGCGCTCCATGCCACCATGTTCCCCCGGCTCCGGATCGACCCGTTCGCTCTGGCCCTCTGGGAGGACTGGTGTCGTCCCCTTCTCACTGTCTGCGCCGCCGTCGTGGTCCCCGAGATCCGCGGTTGGTCAGACTCCACCGGCATCCGGCACGAGGTCGCATCCGCCCTCGCCGCCCAGGTGCCCGTCTTCATCTATGGAGGCCTGCCATGACGAAGCGTGACCGCTTCCGACGCGCCGCGGGAGGTGCCCGTGCCGCATGACCGACTGACCGAGCCCGCGACCGAGATCGTCGGCGAATTCTGGGAATATCCGCTGGCCTTCGGCGACACGCTCTCGAGCCACGAATGGGTGCCGCTCCACATCAACCGCCTCCTCACCTCCCGCTTCGTCGCCCGAGCGCTCGCGGAGAACCGGCGCGCCGACATCGGCACCGCGCTCCTTCTCTGGTGCGAGGCGTTCCGGCAGGACCCGGCTGGGACGCTGCCCGACGACGATCTTGAGCTCGCCCGGCTTGCCGGCTATGGCGCCGATCTCGAGAGCTGGCGCGCGGCGCGCGAGGGCGCGCTCTACGGCTGGCGCGAGACCCATATCTCGAACCAGGAGGATGCGCCCGCCAACCGGCGGCTCGGCCATGTGATGATCGCGGGCATCGCCCGCGACATGCACCGCCGAAAGCGCGGGCGCGATCAGGCGCGGACGGAAGGCTCGAAGGCCGTGGCCCGGACGCGGGTGAAGAAGAAGCTCATCGAGATCAAATGCTCGCGCGCGGCCGAGAGCCCGGACGTGGTGATCGCCATCGCGGACTGGCTCGGCGAGCGCGATCTCTACATCACGACGGACAATGTGCGCGCGGCCTTCGAGGCCACGCGCGGCGGGCCCAAGCTCGTGCAGGTCCAGTGATTTACAGCTGTAATCGACTGTAATCGACTGCAATCCTTACAGTCATTTACAGCAATTTTCAGGGCCGGAACTGTAATTGCCCCACAGGACAGGACAGGACCGGACCCGACAAAACAGAACCTGACAAAACATCCCTTCTGCAGGGGTGAGAAGATCGGGCGGTGGCGAGACGGCAGGCGTGGCTGGCTGAGAAAGGGATGGCCATGACGAAGGCAGAGGAGCGAGCGCGGGTGAAGGCGCTGGTGGTGGACCGGCTGGAGCAGGCCGGGATGGTGAGGCGGCGCGGCACGGCCGCCGCGGCGCACGAGGCGGCGATGGCCAGGCTCTGCGAGCAGCTGGGCTACATGGGCGCGGAGAACCTGATGACCTTGGCCGAGGTGCTGATCGACAGCGCGGCCGATGGGGTCTGGCCGTCCGAGGTGCTGATCCGGCAGTTCGCGCGGGCCATCGAGGAGCCGCCGCCGGCGGAACGCCGGCTGGTCTCGAGCTGGCTCGCCTCGGTCGAGGGGCCGAAGGCCGAGGCCGGCGGGCATCTGGTGGAGCTCTACCGCTGGCTCCTCAAACACCCCCGCCCGCCGCTCGCGATGGACCTGCGCGAGATCCGGGATGAGGCGCAGGACAACGCCCGGCGCTGCGAGCTGATCCGTGACCGGATCGACCGCGAGACGGCCAGCCGCGAGGATCGCGACTGGCTCGAGCTCTATCTCCGCGACCGGGCACAGGCCCGCGCGCTGGTGGATGCCGGCCGCGCCCGCAAGGAAGGGGCGGCGGCATGATCGGGCGCGCGGTGGACCTGCGGCGGGCGGAGCCGGTGGCGCCGGCGCGGCGAGAGATGTCGATCGAACGCGCGCTGGTCTGGGCGTTCCAGACCGAGTGCGCCAGCGTCGACTTCGCGGAAGAAGCCGCACCGGACAGCTATCGCCGGACCGTCTCGTCGGCCTGGCTGGTGGCGCAGCGGGGCGCCATCGGGTGCCGCATCGACGGCGGCGGGCATTCCCTTCCGGCTGACGATGCCGAGATGATCGCCTCGGCCGTGGCGGCGCTTCCACCCGAGCATGGCGGGCGCGGCATGGCGGTGAAGATCGCCACCCTCGCGCGCGCCGGCCTGCGGCCCGACTGGATGCCCGACGCGCGCCCGCGTTGTGTGCCGCGCGACTGGCGCCGATCCAAGCACGGGATGTTCGCGCGGATCGAGGTGGTGGATGAGATCGTGACGGTGCACCGCGGGCGTCGGGTCGTCCGGCCGGTCGAGGCCTGTCCTGTTACCTATGCCCCGAGCCACGCCCAGATTGCCGCCGCGCGGCGCGAGTGGCTCACATGGTGGGGGGCTTTGCTGCATCTCGGCCACGAGCTGCGCACGCTCGACATTCTGTCGACGGTCCAACTGACCGCAGACATGCCGCCCATGTCGCCATGGCGCGAACAAGGCCGTTGACAGAACCTCACTGCATTGACATTTTGCAGGCGGACCGAATGGCGCCCGGAGAGCAGATGCTCCCCGGGCGCTTCTCGTTCCGGCACCCTCATCATGGCAGGCAGACATGGCGCGGCTCAGGAAGATCCCGCAGCGGCTTGCCTCTGCGCCCGCACGCCTGGCATCGCTCTCGGGCGGTGAAGGCCGCAGCAGGACCAAGGCTCGCCTGACGTTCTCGCCGTGGCGCGCGTGGTATAACACCGCCCGCTGGCGCGCGCTTCGGTGGGAGATCCTGACCGAGGCGGCCTTCACCTGCCGCATGTGCGGCCGGGTCGAGGGGAACACATCGCAGCTCGTGGCTGACCACAAGGTACCGCACCGGGGCGACGAGGCGCTGTTCTGGGATCGCGCCAACCTGCAGTGTCTGTGCAAGTCCTGCCATGACAGCGTGAAGCAGAGTGAGGAGCGCGCGCAACCGCAGGCGGCATCGCCTGCTCCGATGGCACGGCCCGATTGGTTCCGGCCCGTGCATGTGCCGCTCACCATCGTCGTGGGCCCGCCGGGCGCGGGCAAGTCGACGTGGGTGAGAGCAGCGGCCGGGCCCGACGATCTGGTCATCTGCTTCGAGCAGATCACGCGCCGGCTGCTCGGGATCGACCGCCTCAATGCGGAGGAGGGCGACCGGCGCATCGGTGATGTGCTGCGGGAGCGGAACGCCATGCTGGGCGACCTGATGCGGACCAGCGCCCGGGGCCGGTGGCCGCGCGCGTGGCTGATCCTGACCGAGCCCCGGGCCGATCACCGGCAATGGTGGGCGGATCGCCTGCGCCCCGAGCGGATCGTGGTGCTGGCCACGCCGGAGGCAGAGTGCGTGCGCCGGTGCCAGGCCGATGCAGCTGCCGGAGACCGGCGGAAGGCAGGCATTGCCGCCGTCGTCGCGAGGTGGTGGGCCGACTATGGCCCCCGGCCCGGCGAGGTTGTCATCGCGCCCCCTCACCCGGGGGTGGGTGCAAAGTCCAGGGGGTGACGGAAGGAAGACCCGCGCTCCCCTCACGCGGAGATTTTTTTCTTGGGCAGCGGAAAAACGACAATCGACCTTTTCGGCGACTGCGTCACGCTGCCCTCGGGGCGTCGTGGACGGCCGTCCCACCAGTGGTCCAAATCGAACGCCGACAAGGTGATCATGGGCTTGGCGCTCGGCTACAAGGCCGAGGAGATCGCGCAGGGGCTGCACATCTCCCTGCCCACCTTGCGGAAGTATTATTTTTCCGAGCTGCGTGCCGCGTCGATGCAGCGGACCCGGTTCGAGCTGTGGCGGGCGAAGGTCCTCGCGGATGAGGCCAACAAGGGCAATGTCGGCGCGCTCAAGGAACTCGGGAAGATCATGGAGAAGCGGGACCGCCTGGCGGCCGAGCAGGCCCTGAAGGATCAGCCGGCCGATGCGGCCGCTGAGCCGATCGGCAAGAAGGAGGCCGCGCGCCGCGCCGCGGCAGAGGCGGCGATCTCGGACCCGGACCTGACGCCGGGCGTCTACCGGACGCACTGACATGATCCACGCCAATGACTGGTCGACCGCCTGCCCGGACTGGGCGGAGCGGCTCGCGTCAGGCCGTTCGCTGATCCCCGACCTGCCGCTCTTCCGGCCGGTCGCGGACAAGGCGCTGCGGATCTTCAAGAGCCTGCGCGTCCCGGACATGATCGGGACGCCGACGCTGGGCGAGGTCTGCGAGGAATGGATCTTCGATCTCGTGAGGGCCATCTTCGGCGCCTACGACCCGGAGACCCGCCGCCGGATGATCCGGCAGTTCTTCGTGATGATCCCGAAGAAGAACGGGAAGTCATCGATCTCGGCCGCGATCATCGTGACGGCGGTGATCCTCAACGAGCGCCCGCTGGCGGAGGCGATCCTGATCGCCGAGACGCAGAAGATCTCTGACATCGCCTTCCGCCAGGCGGCCGGGATCATCCGGCTCGACCCGCGGCTCGACAAGGAAAAGGGCGGCATTTTCGACGTCAAGGATCACTCCAAGACGATCGTGCACATGAACACGGGCGCGGTGATCCGCATCCTCTCGGCCGATGGCGATGTCATTACCGGGTCGAAGGCGGCCTACATCCTCGTGGATGAGACCCACGTTCTCGGCCACAAGTCGAAGGCGGATGCGATCTATCTAGAGCTGGAGGGCGGGCTCGCCTCCCGGCCCGAGGGCTTCCTGCTGGAGATCACGACCCAGTCGAAGGTCCAGCCGCATGGCGAGTTCAAGCGGCGGCTGAAGCTGGCCCGCGACGTGCGCGATGGCAAGGTGAGCCTGCCGATCCTGCCGGTGCTCTACGAGCTGCCGGCAAAGATGCAGGCCGCCAAGGCATGGATGGACGACAGCACCTGGGGGCTGGTGAACCCGAACCTCGAGCGGTCGGTCTCGATCGACTTCCTGCGCGAGAAGTTCGTGGAGGCGCAGGAGGGCGGGGACGACAAGCTCGCGCTCTTCGCCTCGCAGCATCTCAACGTGGAGATGGGCATCGGCCTGCATTCCGACCGGTGGGTCGGGGCGGACTACTGGCTGAAGAATGCGGAGCCGGGGCTGACCTATGCGCAGCTGCTCGACCAGTGCGAGGTTGTCATCTTCGGCGGCGATGTCGGCGGCGCGGACGATCTCTTCGGCCTCACGGCCATCGGCCGGCACCGCGAGACCAAGATCTGGCTGACCCGCAGCTGGGCGTGGTGCGTCAGAGACGTGCTGAAGAACCGCAAGGAGATCGCGCCCCGGCTCGAGGAGTTGGAGAGGGCCGGAGATCTTCGGATCACCGACGGCGCGGCCGAGCATGTCGAGGAGGCGGTCGCGATCATCTGCGAGGCGCGGGACGCGGGCAGGCTTCCGGACGGCGTCTGCATCGGCCTCGACCCCTATGGCGTGGCGGCTCTCGTCGATGCGCTGGAGGCCGAGGGCTTCGATCCGTCCACGCGGATCGCACCCATCGGGCAGGGCTACAAGCTGAACGGCGCGGTGAAGGGACTGGAGCGGCGGCTGCTCGACGGCCGCATCCGGCATGCCGGCCAGCCGATGATGACATGGTGCGTCGGCAACGCGAAGGCCGAGCAGCGCGGGAACAATGTCTACATAACGAAGGAGGCTGCAGGTGTGGCTAAGATCGATCCCCTGATCGCCCTCTTCACGGGGGCGGTGCTGATGGACACCAATCCTCAGGCTCCGGCAAGCCTCGACGACTTCCTGTCCGACCCGGTGCTGGTGATCTGATGTCCCTCATCACCCGCCTCGCCGCGCGCCTGCCGGCGCAGGTCCGCAGCGCCGCCTACGACATCGAGAAGGAACGGCGTCTGTCGCTGTCGGACGGCCCGGGATGGTCGCGGCTCTTCGGCAGGACATCCGCGGCCGGCAAGCCGGTCACCCTCGACAAGGCCATGCAGCTCTCGGCCGTCTGGGCCTGCGTCCGTCAGACCGCCATGGCCATCTCGGCCCTGCCGCTCGCCGTCTACCGCAAGGAAGGCGACGGCTCCCGCAGCTCGGTGGATGACCGGCTGGCCGAGGTCCTCTCGGTCTCGCCGAACCTCGATCAGACCGCGCTCGAGCACTGGGAGGGGCAGGTGGCGTGGCTGATGGTCAACGGCAACTGCTATTCCGAGCGGACCGACATCGGCGGGCGGCTGTCGTCGCTGCAGCCGCTGCCGGCCAACATGACCCGCCCGATCCGCAACAGCGACGGCGAGCTCTTCTACCAGATCCTTGATCGGGGGAAGAGCGAGGTGCTGCCCCGCGACAAGGTCTTCCATGTGAAGGGGTTCGGCTTCGGCGGGGACATGGGGCTGTCGGCCATCAACTTCGGCGTCCAGACCATGGGCACGGCGCTGGCGGCCGACGAGAGCGCGGGCAAGCTCTTCTCGAACGGGATGCAGATCTCGGGGGTGCTGAAGGCAGGGCAAACGCTGACCGCCGAGCAGCGTCAGCAGATGCGGACGATGCTGGAGGCCTACCGCAGCTCGGACAACGCCTGGAAGGTGATGGTGCTCGAAGCCGGAATGAGCTTCGAGGCGCTGACGCTGAACCCCGAAGATGCCCAGATGCTGGAGACCCGGCGTTTCCAGGTCGAGGACATCTGCCGCTGGTTCGGGGTGCCGCCGATCGTGATCGGCCACGCGGGCGAGGGCCAGACGATGTGGGGCTCGGGCGTCGAGCAGATCCTGATCGCCTGGATGGAGCTCGGGCTGAACCCGGTGCTGCGGCGCATCGAGAAGCGGATCCAAAAGGATCTGATGCCCCGGGGTGAGCGGCTCTCGCGCTACGCCGAGTTCAACCGCGAGGGCATCCTCCAGATGGACAGCAAGGCCAAGTCCGAGTTTCTGACCAAGCTCGTCTCCAACGGGATCATGTCCCGCAACGAGGCCCGCGAGAAACTGAACCTTTCCCGGCGCGACGGCGGCGACGAGCTGACGGCTCAGACCGCGATGGCGCCGCTATCCGATCTCGGCCAGAAGGAGAATCAGGCATGAGCATGCGCGACCTGCCGAAGGCCGAAGTCTCGGCCAAGCCCGGCATCCGGAGCGATGTGAACGTGAAGGCGCTGCAGCGCTGGAACCCCGATGTTCGATCGGCCGCCGAAGAGGGCGATGCCAGCATCTCGATCCTCGAGGTGATCGGGCAGGACTTCTGGGGCGACGGCGTGACGGCAAAGCGGATCAGCGGGGCGCTCCGCGCAATCGGCGACCGGGATGTGGTGGTCAACATCAACAGCCCGGGCGGGGACTTCTTCGAGGGCCTCGCGATCTACAACGCGCTGCGCGAGCATCCCGCGAAGGTGACGGTCCGGGTGCTCGGTGTCGCCGCCTCGGCCGCCAGCGTCATCGCCATGGCGGGCGACGAGATCCGCATCGCCCGGGCGGGCTTCCTGATGATCCACAACACCTGGGTGCTCGCGGCCGGCGACCGCCACGCGCTGACCGAGGTGGCGCAGTGGCTCGAGCCCTTCGATGCCGTCTCGGCCGACATCTACGCGGCACGCAGCGGGATCGATGCAAAGAAGATCTCGGCCATGCTCGACCGGGAGACGTGGATCTCGGGCGGCCAGGCGGTGGAGCAGGGCTTCGCGGACGGGCTGCTGTCGGCGGACGAGCTGGACCTGTCCGATGCCGACGAGGGGCGGGCCTCCGCCCGCGCCGAAAGGAAATTCGACGTCCTCGCCAGCAAGGCGGGGGTCTCACGCTCTGAGGCGCGCGAGCTGCTCGCCGCCCTGAAGGGGAGCAAGCCGGGCGCTGCTCCTGCCAGCATGCATGACGCTGCGGTCGCTGCGGAGGTGCGGAACCTCCTGAACTTCGCGAAAACCATCTGATCGGAGATCACCATGAAACATCTGAACATGCCGCTGGTGGCGTCCGCGCTCCTCGCGGCGACCCAGCCCCATGCCGTGCTCTGTGCCCCCCGGGCAGAGGGTGGCGCGGGCAACCTCGAAGCCCTGCTGAAGGAGGTCAAGCAGGAGCTCGACCGCATCGGCAATGACGTCCGCAAGACGGCCGACACCGCCTTCCAGGAGGCGAAGAACGCGGGCAAGCTCTCGGACGAGACCAAGGTCAAGGCCGACAGTCTGCTGACCGCGCAGAACGCCCTGCAGGATTCGGTCGCCAAGCTGCAGCAGCGGCTGGAGGACATGGATGCGCGCAACCTCGACATCGAGCAGCGCATGTCCGGTCGCCGGGGCGGGGGCACTGCGCGCCAGACCCTCGGGCAGGCAATCTCGATGGACGCCCAGGTCAAGGCCTTCAACGGCAAGGGCACCATCACTCTCATCGTGCAGAACGCGATTACCTCGGGTTCGGCCTCGGCCGGCCCGCTGATCGCGCCCCAGCGCGAGACCGAGATCGTGGGTCTCCCCCGCCGGCAGGTGTTCGTCCGCGATCTTCTGAGCCGGTCCACCACCAACTCGAACCTCGTCCAGTATGCCCGGATGAAGGCCCGCACCAATGCCGCCGGCGTCGTGGCGGAAGGCGCGCTGAAGCCCGAGAGCGGGCTGGAATACGAGGCCGCTGACGCTCCGGTGCGAACCATCGCGCACTGGATCCCGGTTTCGCGGCAGGCTCTGGAAGATGCCGACCAGCTGCAGGGCGAGATCGACGGCGAGCTTCGCTACGGTCTCGACCTGACCGAGGAGGCGGAGATCCTCTCGGGCGACGGCGAGGGTCAGCACCTGTCGGGCCTGATCACCAACGCCAGCGCCTATTCCGGCGTCTACGAGCCCGCGGGCGCCACGGCGATCGACAAGCTGCGCTTCGCGCTGCTGGAGGCGAGCCTCGCTCTCTATCCGGCGGACGGGATGGTGCTCAACGAGATCGACTGGGCGCTGATCGAGACGGCCAAGGATTCCGAGAACCGCTACATCTTTGCGAACCCGCTGCAGCTGGCCGGTCCCGTGCTCTGGGGCCGCCCCGTGGTGCCGACGACCGAGATCGACGAGGACAAGTTCCTCGTGGGGGCCTTCCGCGCGGCCGCCACGATCTACGACCGCATGGACACCGAGGTGCTGATCTCGTCCGAGGACCGGGACAACTTCGTGAAGAACATGCTGACCGTGCGGGCCGAGAAGCGGCTGGCGCTGGCCATCAAGCGTGCGGCCGCGCTGATCTACGGCGACTTCGGCCGCGTCGCCTGATCGCGGCGCAGGTGACCGGGGCGGGCCTTCGGGCCCGCCTCTCAGTTCCGATCCCCTTTGGAGAGAGACCATGTCGAAGATGACGGTCACGCCGCTGCGCATGCAGGTCGGCGATTACGGGCGTGCGCGTGCGCATGTGCCGATCAAGGTGGATGCCGACCTCGGCGCCCGCCTCGTGAAGGGCGGCAACTTCGTGGAGGGCTTGTCCGATGTCGCGAAGAAACGCGCCGCCGGCATCAAGGCCCGGCTCGATGCCGAGACCGCAGCGGCGCGTGAGGCCGAGGCCGCGCGCCAGAAGGCCGAGAAGGAGGCCGAGCGCGAGGCGGAAGCCGCCCGCAAGAAGGCCGAGGTCGCGCGGAAGGAGGCCGAGAAGGAGGCGAAAGCCGCGGCCGAGAAGGACGCGGCAGCTGCTCGTGAGCGGGCGGAGCAGGAGGCGAAGGCCGAGCAGCAGCGCCAAGCCGATGCCGCCAAGGCGGCCGGCGGTGAGGGGGGCGGCTCCGAATGATCACCGATCTGGCCATTCTGAAGGAGCATCTTCGGGCCAGTGCCGAGATCCAGAATGGCCTGATCGTTCTCTATGCCGAGGCGGTGGAGGAGCGGCTGACGGCATTCCTGGATCGCCCCGTCTACGCCGACGCCGCGCTGATCCCGGCGCCGGGTGACCCCGACCATGATCCGCTTGCCATTGTTGCGCCGCGCGCCTTCCACGTCGCGGTGATGCTGCTGGTCGGCATGATCTATGACGGCGAGTGGACGCAAGCCGCCCCCGACCTGCCCGGGCCGGTGCGGAGCCTGATGGAGCCCTATCGCGCGTGGCGGGATCTGCCGGAGGAGCGGCCATGAAGGCCGAGAAGCTCGACCGCAGGATCCAGTTCCGCCGGGCTGCCCTGGTGGACGATGGCTTCGCCGAGGTTGAGACGTGGTCCGATCACGGATCGCCGGTCTGGGCCGCCCGCGCGGATCTCAGCGACGGCGAGCGCTGGCGCGCGGCCGAGGTCGCGGCCGGCGTCACGACCCGCTTCACGGTGCGGTGGTCGGCCTTCGCGGCCGCCATCAATCCGAAGGATCGTCTCGTCTGCGAAGGTCGGGAGTTCGACATCACCGGTATCAAGGAGCCGCCGGAGACCCGGCGGCAGTGGATCGAGATCACGGCGGCGGCGAGGATCGACTGATGCGGGTCAAGGTCGAGGGGCTGAAGGAGCTCGAGGCGCAGCTGGCGCGCCTGAGCAAGGGGGCCGCTCGGGGCGCCCTGCGCCGCGCCGGTGTGAAGTCCCTGCAGCCGATGGCGGAGATCGCCCGCGGTCTCGCCCCGAAAGACACAGAGGAGTTGTCGAAGAGCATCACCGTCGCGGCCAAGGCCGTGGGCGGCGGCGCCGAGATCGGGAAGGCGGAGTTCGGAGCCGTGATGCGCGCTGGCGGCTCTGTGGGAGAGGCCCGAGCCGCCCTGCGCGATGCGCGCCGGGCGGCAACCGACGCGGGCAACCTCAGCGCGATCGAGCTCTACATGGGCCCCATCAAGGCGTCGAAGCGAGCGGCCATCAAGGCGGTGGTGCAGGAGTTCGGCTCCATCAAGCAGGCACCGCAGTCCTACATGCGGCCCGCATGGGATCAGGATCGAGAGGCCCTGCTCGGGCGGCTCAAGGTCGAGATCTGGGCCGAAATCCGGAAGGCCATCGTCCGCGCCGAGAAGCGCGCGGCCCGGGCCGCAGCGAAAGCCGCAGGGGGCTGACCCATGGAAGAAGCTCTCCGCGCGCTCCTGCTGGTCTCCAGCGGGGTGACGGCGCTTGCCGGCAGGCGGGTCAACTTCGGCCGCCACCCCCAGGGCGACCCGCTCCCCGCGCTCGTGCTCAACACCATCAGCGACCGCGAGGGGCTGACTGTCAGCGGGCCTGACGGAGTGCAGCAGGCCCGCGTCCAGATCGACTGCTACGCCGAAAGCTACGGCGCAGCGAAGCAGCTTTCCCGCGCCGTGCGCGCCGTGCTCCACGGCCACAGCGGCGGCGGGTTCCGGGGTGTCTTCCTCGACGGCGCGCGCGACCTGCGCGAGCCCGGCGACGACACCGGGCGGCCCTTCCGGGTCTCGCTCGACTTCCTCACCATCTACTCAGCATAGGAGGGCCACATGGCCTCGAAACAGATCATCGCCTACGGGGCCAAGGTGGAGCGCTCCACCGATGGGACCAGCGGCTGGACCGTGATCCCGGAAGCCAAGGGCATCGCCGTACCTGTCGTCGAGCAGGACTATCAGGATGTGACCTCGCTCGACAGCGAGGGCGGCTACCGCGACTACATCAAGGGGCTGAAGGACATCGGCCAGATCACCATCCCGATGGGCTACACCTCGGCGGGTTACGCCGCCATGATCGCCGATCAGGAGGCCCCGAACCCCATCCACTACCGCGTGACGATGAAGCCCGCCCCGGACCAGAGCACCGGCGACGTGTTCGAGTTCCGCGGCTTCCCGGTGCCGCAGATCGAAGCGGGCGACCTCGGCGCCCCGGTCGGCATCAACCTCAACATCCGCGGGACCGGCGCGCCCACCTGGACGCGGGGGACGGAGGCATGAACATGATCCGTGGCGCGATCCCGTTCGAGGCCGAGGGGCGTGAGCGCTTCATCCGGCTCACCACCAATGCCCAGGTCCGCTATCAGGAGCGCGCGGGGGAGACCCTCGTCGATGCCATCGTGGCCATGCAGGGCGAAGGCTCGCAGGGCGACATGCTGCGGCTCCGGCGGCTGATCTGGGCCGGCATGGGCCACGAGGGGCTGAGCGAGGATGCGGCGGGCGACCTGATAGACGAGATCGGGCTGGCCGAGGCCTCGCGGCTTCTGGGCGATGCGATCCGCGCCGCCTTCCCCGAGTCGGCCAAGGCCGAGGCAGAGGTCGAGAACGCCGGGGGAAACGCCCCGGCGCCGGCGAAGCCCAAGGCCAAGCCGGCCGCGGCCTGATCGAGGACCTCCTCGCCCGGTGGCTCGCTGCCGGGCAGGAATACGAGCTGTTCTGGCGGCTCACTCCGCGCGAGCTGATCTCGATCCTCGAAGGCGACTACAAGCGGCGCCGGCGCGAGATCGAGGACCGGCGCGTGCTGCAGCACGAGCTCGCGACCCTCGTGGCCTTCGCCTTCCACCAGCCGGGCAGGATGCCGGACTACAAGCCGCCGGCAGAGGCAGGCGCGCCGCCCGCGCAGAAGGCGGACGCCGGCTGGGATACCGACCACGAGCGGGTGCGCGGCCTGCTCATCGGCATGGCGCTCAGAGGGCGCGGGTAATCCCCGTGAGGGCGCGGCCCGCCGCCGGACCAAGACCGCCCCTTGCAACATGAACCGGCCGCCGTCCCCGGACCAGCTCCGCACAAACGCAACTGACCAGGCCGCCTCCGGGCGGCCTTCTCCATGAGGAGGCTTCCCATGTCGGCAGTCATCGGCGCACTCCGGGTCAACCTCGGCCTCGACAGCGCAGAGTTCCAGAAGGGCCTGAAGAGGGCGCAGTCCTCGCTCGGGGCGGCGGCGAAGGCCTTCGGCGCGCTCTCGGCCATCGGCGCCACGGTCGGCGCCGCCATGACGGGCATCGTCGTGCCGACCGCCCGCGCGGCGAACGAGATCTCGCGGCTGGCGCAGGTCGCGAACACCACGCCCGGGACGCTGCAGCGCTGGTCCGCCACCTCGAAGAGCGTGGGCATCGAGCAGGAGAAGCTCGCCGACATCCTGAAGGATGTGAACGACAAGGTGGGCGACTTCCTCTCGACCGGCGGCGGCGAGATGAAGGACTTCTTCGAGAAGATCGCCCCGAAGGTCGGGGTCACGGCGAAGGAGTTCCGCAATCTGTCGGGGCCGCAGGCGCTGCAGCTCTATGTCTCGAGCCTCGAGAAGGCCGGCGTCTCGCAGGCCGAGATGACCTTCTACATGGAGGCCATCGCCAATGACGCGACCCTGCTGCTCCCCCTTCTGCGAAGCAACGGCGCCGAGATGGAGCGGCTCGGGGACCAGGCGGCCGACCTCGGCGCGATCCTCGGCGACGATGCGGTGGCGGCCCTCCGCGACGCCCACCTCGCGCTCGGGCAGATGGCCACGGCGGTCTCGGCCGCCCGCGACCGGATCGCGGCCGAGCTCGCCCCCGCCGTCCAGGCCATGGCCGTGGCCTTCACGACCTCGATGCGCGAGGGCGGGCTGCTGCGCGGCGTGATCGACGGGATCGGTTCGGTCGCGGGCGCGGTGGCGGACAACATCGACCGGCTCGCGGTCTACACGGCGACGGCGGCCGCGGCGCTCGCCGTCTCCATGACCCCGGCCCTGATCGTCGCCACCCGCGCGGCGTGGGCCTTCGTGGCCGGATTGGTCGCGACGCGGGCGGCCCTGATCCGCACCGGCCTCGGCATCGCCGTGGTCGCGGTGGGCGAGCTGGCTTATCAAGTGACGCGCACGGTCAAGGCGGTGGGCGGTCTCGGCAATGCCATGGAGATCATGGGCCGGGTCGCGAGGGGCGTCTGGGAGGGGATGAAGACCAGCGCCACCGCTCTCGGCCCGGCGCTGAACGCGGTCTGGAAGACGGTCGAGGCGGGCTTTCTGACCATGATCGCCGCCGTCGCCCGCAAATGGACCGACTTCCTGCGCGACCTGTCGCAGGGCATGGCGGCCGTCCCCGGCATGGGCGAGGCGGCGCTCGAGGTGGGCAACATGGCGATCCTGGCCGGGTCCGGGGTCCATGCCCTGACCTCGGCCGCCCGGGGCGCGCGCGACGAGGCGCAGGCGCTGAAGGAGGAGGCCCGGGCGCTGGCCTCCGAGGGCTTCGACGCCGCCGCTGCCGCCGCGGCCGAGCTGCGCTCCGCGGTGACGGGCACCGGGGAGGCCGCCGACGGGGCGGCGGCGCCCGTCTCCGACCTCGGCGACAACGTGTCGGATCTGGGCGCCGCCGCGGGCGGGGCGAAGCAGAAGCTCTCCGATCTGGTCACGGCCGCGAAGGCGTGGAAGGAGCGGCTGAAGACGCCGGTTCAGAAGTACCGCGAGGAGATCGCGAAGCTCGGCGAGCTCTCGAAGAAGGGCCTGCTCTCGGCCGACGAGCATCGGCGCGCCATCGGAGAGCTGAACCGGGAGCTCGGCGAGGGCATTCCGATCATCGGCGATGTGGCCACGGCCTGGGGCGAGTTCGTCACCGGCGGGTTCAAGGACTTCAAGGGCTTCGTGAGCAACGTCCTCGGGAGCTTCAAGAGCATGCTGGCCGAGATGATCGCCACGGCGGCGCGTAACCGGATCGTCATCGGGATGGGGCTCGGCGGCGGTGGCGTCGCCGGCACCGCGGCCGCGGCGGGCGTGCCGGGCATGGGCGGCGGCGGCCTCGGGATGCTGGGGAGCCTCTTCGGTGGTGGCGGCGGTGGCGGGGTCCTTGGCATCGGCAATGCCTTCAGCGCCTTCGGCAGCGGGGCTCTGGGCTCGCTCGGGAACTTCTTCTCCGGCGGCCTCTCCGGGGGCTTTGCCTATATCGGCCAGTCGCTCAGCATGGCGACGAGCGGTCTTGTCGGCTTCGCGCAGGCGGCGGGCGCCATTCTGGGCCCCATCGCCGCCGTGGCGGCGGCCGTCTCCTTCTTCGGCTCGAAGACGAAGCTCCTCGATGCAGGGCTGCGCGTCACCGTGCGCGAGCTGAATGCCATGGTCGAAACCTACAAGAAGGTGGAGAAGTCCCGGTTCGGCGGGCTCTCGAAGTCGCGGCGCACCAGCTATGGCCTCGCAGACGGGGCGGTGGCGAGCCCCATCGTCAAGGCCGTGAGCCAGATGCAGGCCTCGGTCATGGATGTTGCGGACACGCTCGGCATCGGCGCCGAGGCCTTCAAGGGCTTTGCGGCGTCTGTGAAGTTCTCGACCAAGGGGCTCTCCGACGAGGAGATCGGCGCGAAGCTGCAGGAGAAGCTGACCGAGCTCGGCGACAGCTTCGCCGCGCGCGCCTTCGGCTATGTCGGAAAGAACGACCAGGCGATCAAGGATCTCGAGAAGCGGATCGCCGAGGGGACGTCCGATGCGGTGGTGAGCGGGCTCAAGGGGTCCATCGGCGACCGGCTTCTCTCCGCCTTCTTCGGCCGGAAGCGGCAGGGCGATCTGGCAGACCTGATCGCGGGCAACAGCCTCGTCTCGACCCGCCCCGAGCTCGCCGCTCTGGTGAAGGAGGGCGAGAGCTTCGTCGAGGCCCTGCAGCGGCTGAGCGCGGCGATGACCGGGGTCAACGGCGTCATGGACACGCTGGGCATGAGCTTCCGGGCGGTGGACATGGTGACCGCCGGCATGGCCTCGGATCTGGCCGCGCTCTTCGGCGGGCTTGAAGGAATGGTCTCCGCCACCTCCTCCTATTACCAGGCCTTCTATAGCGAGGCCGAGCGGATGGAGACCGCGACGCGGCAGGCGACCGAGGCGCTGGCCAAGCTGGGCGTGGCGCTGCCCGCGACCCGCGCCGAATATCGCCGGCTGGTCGAGGCGCAGGATCTCACCACCGAGCGGGGGCGCGAGCTTTACGCGGCCCTCGTGGGCATGGCGGGCGTCATGGATCAGATCCTGCCGAGCGTGGCGAGCCTCTCGGCCGAGCTGGCGGGCCTCGTGGGCACGATCAGCACGGATCTCGACGGGATGATCTCCGGCGCGGCCGAGGCGCAGCGGGGGGCGGCCGCGGCGGCGAAGGGCTGGTATCAGGTCACCGTGGCGCTGCGCGACTATATCGGCGATCTGCGCAGCGCGGCCTCCGAGCTGATCTCGCCCGCGGTGGCCGCGGCCCAGTCGCAGGCGCGCTATCAGACGATGCTGGCAAGCGCGATGGCGGGCGATCAGGAGGCGGCCAAAGCCGTCTCCGGCGCGGCCTCGGCCTATATCGAGGCGGTGCGCGGGCAGGCCCGGTCGGCGGTGGATGTGGCCCGCGCGCAGGCGCAGGTGCTCTCCGACCTGCAGCTCCTGCAGGGCGTGACCGGGCTTGAGGGGGCGAAGGAGGATGTGCTGGCCAGCCTCTATCGGGAGCAGGTCGATCTCCTGACCGAGGTGCGCGATTACCTCGCCGGCGGCGAGGCGTTGAAGCCCGAGCAGATCGCGGCGCTGAACGCGCAGCTGGGCTCGCTCGAGGGCGCCATCGCGGCGGCGAAGGAGATCTCCTACGCCGCCCTCCGCGAGCGGATCGACGTGACCGTGGGGCTGATGGCGACGGCGGACATCCCGGCCGACCTGCGCCGCATCCTGAAGAATGCCACGAGCGGCGTCGAGGTCTCGCTCGACATGGTGCTGCGGCGGATGGATCTCACGCCGGATCTGGTCTGGATCGCGGCGAAAGCCTCCTCCGACCACCTCGCGCGCATCAGCTATCTAGCGAAGACCGACGCGCTGCCCGACGATCTGCGCGCGCTCGCCGCCGTTCGCGTGGCGCAGTCGGTGCGCCGGCTCGCGCTGGTGATGGACAAGCCCGCCTCCGATCTCGGCATGGCGGAGCTCCTCAAGGCCCTCGGCGCCCAGGGCGGCCGGATCACCCTCGGCGGCAGCTTCGCCTTCGACCCCTCGACCGGCTTCTCGAGCTGGTTCGAGACCACGACGAGGGGGGCGATCACGGCGCCGATGACGGCGCTGCGCACGGCACTCGACGATCTGAGGGACGCGATCCTCGCGGAAGGGCGCGCGGCCGGGCAGCGCGAGCGCGGAGCGGCGCTGTCGGCCTTCGCCGGGGGCCTCGCGACGAACGCGGCCGGCGACATCCTTGCCACCGACAAGCAGATCCTGGCGATGGCCGCCAAGGCCGGGATCTCGACCGACGGCAAGACCATCGGGCAGGTCATGCGGGCCATCGAGGGCTTCTCCCCGCTCGACGGGATCGAGACGATCCGCCGGCTGCCGGGGAGCCTGAAGGACTACCTCTGGGGCCTCTTCCAGCAGCGGCAGGGCCGGATCCCGCTCGATACAGCCGATTATCTGCGGCTCTACCCGGACGTGGCCGCGGACGAGTACGGCTACGACCCGACCATCCACTACCGCAACCACGGCCGTGAGGCGATCCTCGCGGGGCTGCGGCCGTTCAAGCCGGAGGTGTTCGACTGGTCGGCCATCGGCCTCGACGTCCCGGGCTTCGCCGCGGGCGGCCTGCATGCGGGCGGCCTGCGCCTCGTGGGCGAGCTCGGGCCCGAGCTCGAGGCCACCGGCCCGAGCCGGATCCACAGTGCGGGGCGGACCGCGGACATTCTCGGCGGCGCCGCCATGGGCGCCTCCGAGGTGGCCGGCGCCGTGCGCGATCTGCAGGCCGAGCTCGTGGCTCTGCGGGCCGAGAATGCCCAGATCGCGCGCGAGCTCGCCGAGATGAAGGTCTGGGCCCGCAAGGGGGCCGAGGCCTCCACCGCCACGGCGAAGGACCTGCGCCGGATCGGAACGGTGGGCGTCCGGATCGACCCGACGGAGGCCGTCTGA
Protein sequences of DBSCAN-SWA_5 >NZ_CP051468|2148778:2172343|2157399_2158299_+|WP_011337212.1|DBSCAN-SWA MARLRKIPQRLASAPARLASLSGGEGRSRTKARLTFSPWRAWYNTARWRALRWEILTEAAFTCRMCGRVEGNTSQLVADHKVPHRGDEALFWDRANLQCLCKSCHDSVKQSEERAQPQAASPAPMARPDWFRPVHVPLTIVVGPPGAGKSTWVRAAAGPDDLVICFEQITRRLLGIDRLNAEEGDRRIGDVLRERNAMLGDLMRTSARGRWPRAWLILTEPRADHRQWWADRLRPERIVVLATPEAECVRRCQADAAAGDRRKAGIAAVVARWWADYGPRPGEVVIAPPHPGVGAKSRG >NZ_CP051468|2148778:2172343|2156055_2156634_+|WP_017140108.1|DBSCAN-SWA MAMTKAEERARVKALVVDRLEQAGMVRRRGTAAAAHEAAMARLCEQLGYMGAENLMTLAEVLIDSAADGVWPSEVLIRQFARAIEEPPPAERRLVSSWLASVEGPKAEAGGHLVELYRWLLKHPRPPLAMDLREIRDEAQDNARRCELIRDRIDRETASREDRDWLELYLRDRAQARALVDAGRARKEGAAA >NZ_CP051468|2148778:2172343|2152767_2152965_-|WP_023003555.1|DBSCAN-SWA MPATAIRYPASAPVARFALLDLTEPQLRRLLAAAEAGLPQLEPIDDETEAELAAILGQAREALGA >NZ_CP051468|2148778:2172343|2149803_2150034_-|WP_011337223.1|DBSCAN-SWA MSAKSSLTFAPRLLPAPQAAAYLGVSATTLRGLPIPRKVLGGKRLYDRLELEAYADSLSTEGEVKANSCDGAFGIT >NZ_CP051468|2148778:2172343|2164430_2164772_+|WP_011337205.1|head,tail|DBSCAN-SWA MITDLAILKEHLRASAEIQNGLIVLYAEAVEERLTAFLDRPVYADAALIPAPGDPDHDPLAIVAPRAFHVAVMLLVGMIYDGEWTQAAPDLPGPVRSLMEPYRAWRDLPEERP >NZ_CP051468|2148778:2172343|2163999_2164434_+|WP_011337206.1|DBSCAN-SWA MSKMTVTPLRMQVGDYGRARAHVPIKVDADLGARLVKGGNFVEGLSDVAKKRAAGIKARLDAETAAAREAEAARQKAEKEAEREAEAARKKAEVARKEAEKEAKAAAEKDAAAARERAEQEAKAEQQRQADAAKAAGGEGGGSE >NZ_CP051468|2148778:2172343|2166051_2166495_+|WP_011337201.1|DBSCAN-SWA MASKQIIAYGAKVERSTDGTSGWTVIPEAKGIAVPVVEQDYQDVTSLDSEGGYRDYIKGLKDIGQITIPMGYTSAGYAAMIADQEAPNPIHYRVTMKPAPDQSTGDVFEFRGFPVPQIEAGDLGAPVGINLNIRGTGAPTWTRGTEA >NZ_CP051468|2148778:2172343|2162627_2163929_+|WP_011337207.1|capsid|DBSCAN-SWA MKHLNMPLVASALLAATQPHAVLCAPRAEGGAGNLEALLKEVKQELDRIGNDVRKTADTAFQEAKNAGKLSDETKVKADSLLTAQNALQDSVAKLQQRLEDMDARNLDIEQRMSGRRGGGTARQTLGQAISMDAQVKAFNGKGTITLIVQNAITSGSASAGPLIAPQRETEIVGLPRRQVFVRDLLSRSTTNSNLVQYARMKARTNAAGVVAEGALKPESGLEYEAADAPVRTIAHWIPVSRQALEDADQLQGEIDGELRYGLDLTEEAEILSGDGEGQHLSGLITNASAYSGVYEPAGATAIDKLRFALLEASLALYPADGMVLNEIDWALIETAKDSENRYIFANPLQLAGPVLWGRPVVPTTEIDEDKFLVGAFRAAATIYDRMDTEVLISSEDRDNFVKNMLTVRAEKRLALAIKRAAALIYGDFGRVA >NZ_CP051468|2148778:2172343|2153410_2153701_-|WP_017140110.1|DBSCAN-SWA MPSRLPKIDQAGVARRLEALRDALGLDKGEFAQSFGLDPSSYSKVLQGKKPLKSEYGHAIAEGWGVSMDFIYRGDMSRIDETLRTKVMQNLTAPMR >NZ_CP051468|2148778:2172343|2167336_2172343_+|WP_017140106.1|DBSCAN-SWA MSAVIGALRVNLGLDSAEFQKGLKRAQSSLGAAAKAFGALSAIGATVGAAMTGIVVPTARAANEISRLAQVANTTPGTLQRWSATSKSVGIEQEKLADILKDVNDKVGDFLSTGGGEMKDFFEKIAPKVGVTAKEFRNLSGPQALQLYVSSLEKAGVSQAEMTFYMEAIANDATLLLPLLRSNGAEMERLGDQAADLGAILGDDAVAALRDAHLALGQMATAVSAARDRIAAELAPAVQAMAVAFTTSMREGGLLRGVIDGIGSVAGAVADNIDRLAVYTATAAAALAVSMTPALIVATRAAWAFVAGLVATRAALIRTGLGIAVVAVGELAYQVTRTVKAVGGLGNAMEIMGRVARGVWEGMKTSATALGPALNAVWKTVEAGFLTMIAAVARKWTDFLRDLSQGMAAVPGMGEAALEVGNMAILAGSGVHALTSAARGARDEAQALKEEARALASEGFDAAAAAAAELRSAVTGTGEAADGAAAPVSDLGDNVSDLGAAAGGAKQKLSDLVTAAKAWKERLKTPVQKYREEIAKLGELSKKGLLSADEHRRAIGELNRELGEGIPIIGDVATAWGEFVTGGFKDFKGFVSNVLGSFKSMLAEMIATAARNRIVIGMGLGGGGVAGTAAAAGVPGMGGGGLGMLGSLFGGGGGGGVLGIGNAFSAFGSGALGSLGNFFSGGLSGGFAYIGQSLSMATSGLVGFAQAAGAILGPIAAVAAAVSFFGSKTKLLDAGLRVTVRELNAMVETYKKVEKSRFGGLSKSRRTSYGLADGAVASPIVKAVSQMQASVMDVADTLGIGAEAFKGFAASVKFSTKGLSDEEIGAKLQEKLTELGDSFAARAFGYVGKNDQAIKDLEKRIAEGTSDAVVSGLKGSIGDRLLSAFFGRKRQGDLADLIAGNSLVSTRPELAALVKEGESFVEALQRLSAAMTGVNGVMDTLGMSFRAVDMVTAGMASDLAALFGGLEGMVSATSSYYQAFYSEAERMETATRQATEALAKLGVALPATRAEYRRLVEAQDLTTERGRELYAALVGMAGVMDQILPSVASLSAELAGLVGTISTDLDGMISGAAEAQRGAAAAAKGWYQVTVALRDYIGDLRSAASELISPAVAAAQSQARYQTMLASAMAGDQEAAKAVSGAASAYIEAVRGQARSAVDVARAQAQVLSDLQLLQGVTGLEGAKEDVLASLYREQVDLLTEVRDYLAGGEALKPEQIAALNAQLGSLEGAIAAAKEISYAALRERIDVTVGLMATADIPADLRRILKNATSGVEVSLDMVLRRMDLTPDLVWIAAKASSDHLARISYLAKTDALPDDLRALAAVRVAQSVRRLALVMDKPASDLGMAELLKALGAQGGRITLGGSFAFDPSTGFSSWFETTTRGAITAPMTALRTALDDLRDAILAEGRAAGQRERGAALSAFAGGLATNAAGDILATDKQILAMAAKAGISTDGKTIGQVMRAIEGFSPLDGIETIRRLPGSLKDYLWGLFQQRQGRIPLDTADYLRLYPDVAADEYGYDPTIHYRNHGREAILAGLRPFKPEVFDWSAIGLDVPGFAAGGLHAGGLRLVGELGPELEATGPSRIHSAGRTADILGGAAMGASEVAGAVRDLQAELVALRAENAQIARELAEMKVWARKGAEASTATAKDLRRIGTVGVRIDPTEAV >NZ_CP051468|2148778:2172343|2150109_2151921_-|WP_011337222.1|DBSCAN-SWA MAKDFIFRGDLRLIPLAELRLSPLNSRQEIAAEEVEAMAESLAVAGLLQNLIGHLTPQGVEIVGGGTRLRALQRLAAEGWSRHPDLIPIDPVPVKVTADLQEAVAWAGTENSARSALHPADEVRAYAAMRDRGASLSRIARSFARSEAHVERRLKLADLPAEALAALRANEISLEMAKALTLAPSGERCLEVLGSVRGRDMRPEQVRRELTPGTVPSTDRRVLFVGLEAYLAAGGTSQRDLFADRTLLEDEALLDRLFAEKGAAEAERIRAEEGWEWATWVPEEYVSWTVTQKLVRLYARPGKLSDGEEAELAAFEEREADDTLDEAGRARLAELEARREGGFTDAQRASAGIFVYCSSRDGLSVERAYQQPRAVPRGAAEAAPDLPQSLIEDLHRIRLGALQARLMDQSELMLDLLAWSLGGGLRPWARPLAISPTDQPIVPEKGEGTSYPPRLAARMEPNTSLGPDGTPAEFEAFRAQGKKHRNQILTEALARTFCTGSSGLSATLARQLGVEVRRIWTPTAQGFLGRCSASYLDRLWGELVPAAEADQSFQKLKKGEKAKRLEALFADPATREALGLSREDCAKIDAWVPAELGWPEGDK >NZ_CP051468|2148778:2172343|2158455_2158818_+|WP_017140107.1|DBSCAN-SWA MGLALGYKAEEIAQGLHISLPTLRKYYFSELRAASMQRTRFELWRAKVLADEANKGNVGALKELGKIMEKRDRLAAEQALKDQPADAAAEPIGKKEAARRAAAEAAISDPDLTPGVYRTH >NZ_CP051468|2148778:2172343|2152438_2152771_-|WP_011337220.1|DBSCAN-SWA MSRILPLTPRNLNAARHRAEHAPTPEVREAASRLLTEHELAVATGRPLHRARPRPAFRTSRPSPLGSRLTGFLVGALALLGFLFVATVLWAHATDVAETMRQQASAMRGM >NZ_CP051468|2148778:2172343|2154616_2155168_+|WP_023003553.1|DBSCAN-SWA MTAAAPLPVQDAATSPGAAASGAFRSSEWAALRRHPAGRADLLRWGATPALVARHARWGRPVYLASPYTLRAVGPDGRWSRDLSEAAMADAAREVARLLEVGVTAISPVVLSAAALHATMFPRLRIDPFALALWEDWCRPLLTVCAAVVVPEIRGWSDSTGIRHEVASALAAQVPVFIYGGLP >NZ_CP051468|2148778:2172343|2152043_2152439_-|WP_017140113.1|DBSCAN-SWA MEAAPHPAPRSGPVPVESLNAADLDRMAAAIDSGRIRRPPMKETPADRAVAEAAYQVAADELRQFIERYEQLEAEKKEIGGQQKELMAEAKGRGYAPKILRMIVALRKRTPDDIAEEETLLELYKAALGMA >NZ_CP051468|2148778:2172343|2166491_2166878_+|WP_011337200.1|DBSCAN-SWA MNMIRGAIPFEAEGRERFIRLTTNAQVRYQERAGETLVDAIVAMQGEGSQGDMLRLRRLIWAGMGHEGLSEDAAGDLIDEIGLAEASRLLGDAIRAAFPESAKAEAEVENAGGNAPAPAKPKAKPAAA >NZ_CP051468|2148778:2172343|2155207_2155858_+|WP_023003551.1|DBSCAN-SWA MPHDRLTEPATEIVGEFWEYPLAFGDTLSSHEWVPLHINRLLTSRFVARALAENRRADIGTALLLWCEAFRQDPAGTLPDDDLELARLAGYGADLESWRAAREGALYGWRETHISNQEDAPANRRLGHVMIAGIARDMHRRKRGRDQARTEGSKAVARTRVKKKLIEIKCSRAAESPDVVIAIADWLGERDLYITTDNVRAAFEATRGGPKLVQVQ >NZ_CP051468|2148778:2172343|2165100_2165658_+|WP_011337203.1|DBSCAN-SWA MRVKVEGLKELEAQLARLSKGAARGALRRAGVKSLQPMAEIARGLAPKDTEELSKSITVAAKAVGGGAEIGKAEFGAVMRAGGSVGEARAALRDARRAATDAGNLSAIELYMGPIKASKRAAIKAVVQEFGSIKQAPQSYMRPAWDQDREALLGRLKVEIWAEIRKAIVRAEKRAARAAAKAAGG >NZ_CP051468|2148778:2172343|2154371_2154620_+|WP_017140109.1|DBSCAN-SWA MRRVYWYLMQRFYEARSFRAFCNHLHFKKQSRKYSSRLAETWLHRDDESEEHSPCRLPSCWWLVPSVLAGSAILATAVGSLL >NZ_CP051468|2148778:2172343|2148778_2149720_-|WP_029534501.1|integrase|DBSCAN-SWA MPEDDPRFVAAWLAEEQRAPTVRSRSQAGTIAAACEAYLGSRSYRELSDGYRPVIRRHVERIKEQGEKALLKDLLPRHIRADLEPLSPPVASSRLKAWRKLAEFWLLKGMLDADPSEGVKRKKMGKVAGFTEWTAEDLEAFRARWPIGTAQRLALELYQWTGARCVDVIRLGPQMVGRDGVLTYVQQKTGNPAHVPWTCPAFGLEEQRADLMRCLPVGSPVYMLTEYGKPRTQKGVSQWFSAAAREAGLESRTAHGLRKYRMNALAEHGASVLVMQSWVGHTTLLEVQEYTRRADRRRVISGGHPVKRDPAGA >NZ_CP051468|2148778:2172343|2160528_2161779_+|WP_011337209.1|portal|DBSCAN-SWA MSLITRLAARLPAQVRSAAYDIEKERRLSLSDGPGWSRLFGRTSAAGKPVTLDKAMQLSAVWACVRQTAMAISALPLAVYRKEGDGSRSSVDDRLAEVLSVSPNLDQTALEHWEGQVAWLMVNGNCYSERTDIGGRLSSLQPLPANMTRPIRNSDGELFYQILDRGKSEVLPRDKVFHVKGFGFGGDMGLSAINFGVQTMGTALAADESAGKLFSNGMQISGVLKAGQTLTAEQRQQMRTMLEAYRSSDNAWKVMVLEAGMSFEALTLNPEDAQMLETRRFQVEDICRWFGVPPIVIGHAGEGQTMWGSGVEQILIAWMELGLNPVLRRIEKRIQKDLMPRGERLSRYAEFNREGILQMDSKAKSEFLTKLVSNGIMSRNEAREKLNLSRRDGGDELTAQTAMAPLSDLGQKENQA >NZ_CP051468|2148778:2172343|2156630_2157293_+|WP_011337213.1|DBSCAN-SWA MIGRAVDLRRAEPVAPARREMSIERALVWAFQTECASVDFAEEAAPDSYRRTVSSAWLVAQRGAIGCRIDGGGHSLPADDAEMIASAVAALPPEHGGRGMAVKIATLARAGLRPDWMPDARPRCVPRDWRRSKHGMFARIEVVDEIVTVHRGRRVVRPVEACPVTYAPSHAQIAAARREWLTWWGALLHLGHELRTLDILSTVQLTADMPPMSPWREQGR >NZ_CP051468|2148778:2172343|2154117_2154375_+|WP_002719227.1|DBSCAN-SWA MRKNLAQSDVHARASRKRFAGLLWRAFPASSERDLAAKAAPVLGVSERQVRNWLQCENDAALRHVFAVMTIACGEEIFSIIEGRG >NZ_CP051468|2148778:2172343|2166952_2167198_+|WP_023003549.1|DBSCAN-SWA MISILEGDYKRRRREIEDRRVLQHELATLVAFAFHQPGRMPDYKPPAEAGAPPAQKADAGWDTDHERVRGLLIGMALRGRG >NZ_CP051468|2148778:2172343|2165661_2166042_+|WP_011337202.1|DBSCAN-SWA MEEALRALLLVSSGVTALAGRRVNFGRHPQGDPLPALVLNTISDREGLTVSGPDGVQQARVQIDCYAESYGAAKQLSRAVRAVLHGHSGGGFRGVFLDGARDLREPGDDTGRPFRVSLDFLTIYSA >NZ_CP051468|2148778:2172343|2161775_2162615_+|WP_011337208.1|protease|DBSCAN-SWA MSMRDLPKAEVSAKPGIRSDVNVKALQRWNPDVRSAAEEGDASISILEVIGQDFWGDGVTAKRISGALRAIGDRDVVVNINSPGGDFFEGLAIYNALREHPAKVTVRVLGVAASAASVIAMAGDEIRIARAGFLMIHNTWVLAAGDRHALTEVAQWLEPFDAVSADIYAARSGIDAKKISAMLDRETWISGGQAVEQGFADGLLSADELDLSDADEGRASARAERKFDVLASKAGVSRSEARELLAALKGSKPGAAPASMHDAAVAAEVRNLLNFAKTI >NZ_CP051468|2148778:2172343|2158819_2160529_+|WP_011337210.1|terminase|DBSCAN-SWA MIHANDWSTACPDWAERLASGRSLIPDLPLFRPVADKALRIFKSLRVPDMIGTPTLGEVCEEWIFDLVRAIFGAYDPETRRRMIRQFFVMIPKKNGKSSISAAIIVTAVILNERPLAEAILIAETQKISDIAFRQAAGIIRLDPRLDKEKGGIFDVKDHSKTIVHMNTGAVIRILSADGDVITGSKAAYILVDETHVLGHKSKADAIYLELEGGLASRPEGFLLEITTQSKVQPHGEFKRRLKLARDVRDGKVSLPILPVLYELPAKMQAAKAWMDDSTWGLVNPNLERSVSIDFLREKFVEAQEGGDDKLALFASQHLNVEMGIGLHSDRWVGADYWLKNAEPGLTYAQLLDQCEVVIFGGDVGGADDLFGLTAIGRHRETKIWLTRSWAWCVRDVLKNRKEIAPRLEELERAGDLRITDGAAEHVEEAVAIICEARDAGRLPDGVCIGLDPYGVAALVDALEAEGFDPSTRIAPIGQGYKLNGAVKGLERRLLDGRIRHAGQPMMTWCVGNAKAEQRGNNVYITKEAAGVAKIDPLIALFTGAVLMDTNPQAPASLDDFLSDPVLVI >NZ_CP051468|2148778:2172343|2164768_2165101_+|WP_011337204.1|head|DBSCAN-SWA MKAEKLDRRIQFRRAALVDDGFAEVETWSDHGSPVWAARADLSDGERWRAAEVAAGVTTRFTVRWSAFAAAINPKDRLVCEGREFDITGIKEPPETRRQWIEITAAARID |
28 | Rhodobacter_phage(23.08%) | tail,protease,terminase,integrase,portal,capsid,head | attL 2138502:2138519|attR 2168673:2168690 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2438126 : 2447173
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP051468|2438126:2447173|DBSCAN-SWA GTCAGGCGTCGATATTGGCGAACTGAGTGCCCGCCTTGCTGTCCTTCTTCGGCCCACCCCGGTCCACGCCGAGATAGAGCACGATGTTCTTGGCCATGTAGACCGAGGAATAGGTCCCGATCACCACGCCGAAGGTGATGGCGAAGACGAAGCCGCGGATCACGTCGCCGCCGAAGACGAGAAGCGACACGAGCGCGATGAGTGTGGTCATCAGCGTCATGATCGTCCGGCTCAGCGTCTCGTTGACCGAGAGGTTCATCACGTCGCGGAGCGGCATGGTCTTGTACTTGACGAGGTTCTCGCGCAGCCGGTCGAACACCACCACCGTGTCGTTGATCGAATAGCCGAGAACGGTGAGGAGGGCCGCGACCGTCGTCAGGTCGAACTTGATCTGGAACAGCGCGAAGACGCCGATCGTCACCAGCACGTCGTGGATCAATGCGGCCACCGAGCCCAGGGCGAACTGCCATTCGAACCTGAGCCAGATATAGACGGCGATGCCCGCGCAGGCGGCGGCGACCGCCAGGATCGCCGACCGGATCAGCTCGCCCGAGACCTTCGGCCCCACCGACTCGACGGACGGGAAGGTGATGGACGGATCGACGGTCTTCAGCGCCTCCTCGACCTGTCCGATCTGCTCGGGCGTGATCGACTGGGTCGCGTCCTGTGCGCCGATGCGGACCATGGCGACGTGCTGATCCGCGCGGAAGCCGGGATCGAAGACCTCGGTGATCGAGATGTCGCCAAGATCCTGCCCCTCGAGGGCGGCGCGATAGGCGGCCACATCCACGGCCTGTGTCGATTCCGTCCGGATCGTCGTGCCGCCGCGGAAGTCGATCCCGAAATTCAGCCCCAGCGTCAGCCAGGCCACGAGCGAGGCCGCCATCAGGAAGACCGAGAAGCCGAAGGTGACCGGCGCGGCCCAGAAGAAGTCGATGTTCGTCTTTTCGGGGCAGAGTTTGAGGCGGAAAGCCATGGGAGCTCCTCAGACGACGATGGTGCGCGGCCGCCGCCAGGCGAACCAGGTGGCGATCAGGATCCGGGTGACGTAGACGGCCGTGAAGACCGACGTGAAGATGCCGATGGTCAGCGTCACGGCAAAGCCGCGCACGGGCCCGGCCCCCACGAGAAACATGATGAGTGCCACGAGGAAGGTGGTGACGTTGGCATCGACGATGGCGGACAGGGCCTTCTCGAAGCCGAGTTCGATGGCGCGGCCCGGCTGCTTGCCCTGCCTCAGCTCCTCGCGGATCCGTTCGTAGATCAGCACGTTCGCATCCACCGCCACGCCGATGGTCAGCACGATGCCCGCGATGCCCGGCAGCGTCAGGGTCGCCCCCACCACCGACAGGAGCGCGATGATCGCCAGCATGTTGATGGCGAGCGCGATGTTGGCCAGCACGCCGAACCAGCCGTAGCTCGCGATCATGAAGGCCACGACGGCGATCATGCCCACGATGGCGGCCGTCCGGCCGGCGTCGATCGAATCCTGCCCGAGTTCCGGGCCGATGGTCCGTTCCTCGAGGAAGGTCATCTCGGCCGGCAGCGCGCCTGCGCGCAGGAGGACGGCGAGCTCGGTGGATTCCTCGACCGAGAAGTTGCCGGTGATGATGCCGGAGCCGCCCGCGATATGCGCCTGGATCACCGGGGCCGAGATCACCTCATTGTCGAGGACGATCGCGAAGGGCTTGCCGATGTTCTGCGCCGTATAGTCGCCGAAGGCCCGCGCCCCCGAGGGATTGAAGCGGAAGCTGACCGCCGGGCGGTTGTTCTGGTCGAACGAGGGCTGGGCGTCGACGAGCTCCTCGCCGGTCACGACGGGCGAGCGGTCGAGGACGTAGAAGATCCCCTGCTCGTTGGCCGCCGGCAGCACGAGATCGCCGCTCGCGCTGCTCTGCGCGTCGGCGGTGCGGCCGAGCACCGGATGGAAGGTCAGCTTCGCCGTCGTTCCGATGAGCGACTTCAGCTCGGCAGCCGAGCCGATGCCCGGCACCTGGATCAGGATCCGGTCCTCGCCCTGGCGCTGGATGGTGGGCTCGCGCGTGCCGACCTCGTCGACACGGCGGCGGATGATCTCGAGGCTCTGCTGCACCGTCCGGCTGTCCATCGCCGTCTTCTCGGCGTCGGTCAGGCGGATCACCAGCACGTCGCCCTGGCTCGAGATCTCGATGTCGTTCTGCCCCATGCCCGTGAAGGACACCACGGGCTGGGCCAGCCTGCGCGCGGCCTCGAGCGCCGCCTGCATCCCCTCGGGGCGCGAGATCGCCACCCGAAGCTCGCCCGCCGGCGCGGTCTGCCGCCGGATCGAGCCCACCTGATCGCGCAGGCCGCGGAGCGCGTCGCGCACCTCCGGCCAGAGCCCGTCCATGCGCGCGGCATAGACGTCCTCGACCCGGACCTCGGCCAGAAGATGCGCGCCGCCGCGCAGGTCGAGGCCGAGGTTCACGATGTTCGAAGGCAGCCATTCGGGCCAGAGCCCCACCGCCGCCGTCTGCTCGGGAGTGGCGGTTCCCCCGGCCTTCTCGATGGCGGCGACCGCATCATTGTGACTTTCGACGCGGCTGTAGAAGAAGTTGGGCGCGGCATAGAGGATCGCGAGGGCACAGATGCCCCAGATCAGAACGCGCTTCCAAACGGGGAAATGGAGCATCTGGACCCGATCCGATCAGGCGGCAGCAGGTTCGGTCTTGGACATGACCTGAACGATGGTGGATTTCATCACGCGGACCTTCACGCCATCGGCGATCTCGACATCGACGATGCCGTCCTCGTGGACCTTCACCACCTTGCCGACGATGCCGCCGCCGGTGAGAACCTGATCGCCACGGCGGAGCGCTTCCACCATAGCCTTGTGTTCCTTCAGTTTTTTCTGCTGAGGACGGATCAGAAGGAAATACATGATCACGAAGATCAGGATCAGCGGCACGAAACTGGTGAAGGCGCCGGCAGCACCGCCAGAAGCGGCCTGGGCGAAGGCGGGGGTCACGAACATTGTCGTCCTCTTGGATTCACCCGGGGATCAGGCCCGGACTTTGGCGGGGCGCACCCTAGCCGCGGGGTGCCGCGATGGCAAGCAAGCCATGGGCGCCCATATATGCACGCGACGCGCGGTTCCAAGCCCGGGATGAACCGGCCCGGGCGGGCGCAGGGTCCGATCTCCCGACATGCCGGCCGCCGGCCGGAGACGGGGCCAACGCAGGGAACAGTCGCGCGCCCCTCCCGTTTCCTCGGCTGAACCCGGTGGCCGGAAGACCCGAGCCGCCCCATCCGACGGAGGAGACACATGTCGAGGACCATCGCCATCGCCGCCGCGGCCCTTCTGGCCGCCCCCGCCCTTGCCCAGGACGTTCCGGCGCGAGCCGAGAATGCCACCGCCGCGCCCGGACAGCCGGGCTTCCCGGAACGGACGGCCACCTTCATGTCGGTCGAGGGCACGCCTCTCGGGAAGGCCACCATCGCCCCCCTGCCCCACGGTGTTCTGTTCACGCTCGACCTGCAGGGGTTGCCGCCGGAGAGCTGGCTCGGGTTTCACATTCACGAGACCGGCGAATGCGATACGGGCACGGGCTTCAAATCCGCCGGCGGTCACTTCTCGATCGCGGAAACGGAGCACGGGCTGATGGTCGAGGCCGGCCCCCATTCGGGCGACATGCCGAACCAGTATGTCGGCGCCGACGGCATCCTACGCGCGCAGGTCTTCAACACCTTCGTCCAGCTCGGCGGCGAGAATGCCTCGGATCTGACGGGGCGCGCGCTCATGCTCCATTCCGGGCCCGACGACTATGTGAGCCAGCCCGCAGGCGATGCCGGCGACCGGCTCGCCTGCGCTGTCATCGAATAGGCCGCAGGCAAGATTTCGGTTCACCTGACCGGAGGTATCGGGCATTAGAGGCCCGATCCTCCGGCACCGAAGGGCCGGGGGCTCGCCCTTCAGCCAAGGTGACCCCATGCACGACATCCGCATCATCCGCGAGTTTCCCGACCGCTTCGACACCGCGCTCTCGCGGCGCGGGCTTTCGGCGCTCTCGGGCGAGATCCTCGCCATCGACGAGGCGCGGCGGTCGAAGATCCTCGCCGCCGAGACCGCGCAGGCCGAGCAGAACCGGGCCTCCAAGGAGGTCGGTGCGGCCAAGGCCCGCGGCGACGAGGCCGAGTTCGAGCGGCTGCGCGCCCTCGTGGCCGAGAAGAAGGCCGAGGTGGCGCGGCTGAACGAGGAGGCCGCGGCCGAGGATGCGCGCCTGCGCGACATGCTGGCGGCGATCCCGAACCTGCCGCTCGACGAGGTGCCGGAGGGCCCCGACGAGAGCGCCAACGTCGAGATCCGCCGCTGGGGCACGCCGCGCGGCTTCAACTTCGCGCCGAAGGAGCATTTCGAGATCGCGGGCGTGAAGCCGGGGATGGATTTCGCCACCGCCGCCAAGCTCTCGGGCAGCCGCTTCGTGATGCTGAAGGGCGCGGTGATCCGCGTCCACCGCGCGCTCGCGCAGTTCATGCTCGACACCCATGTGACCGAGCACGGGCTGACCGAGACCTGGACGCCGGTTCTGGTCAAGGACGAGGCGATGTTCGGCACCGGCCAGCTGCCGAAATTCTCCGAAGACAGCTACCAGACCACGAACGGCTGGTGGCTGATCCCCACCTCCGAGGTGACGCTGACCAACTCGGTCGCGGGCGAGATCCTCGACGAAGCGCAGCTGCCGATCCGCATGACCGCCCATACCCAGTGCTTCCGCTCGGAGGCGGGCAGCGCGGGCAAGGACACGTCCGGGATGCTGCGCCAGCACCAGTTCGAGAAGGTCGAGATGGTCTCGATCACCACGCCCGAGACCTCGCTGGCCGAACATGAGCGCATGACCGGCTGTGCCGAAGCGATCCTGCAGAAGCTGGGCCTGCCCTACCGCACCATCGTCCTCTGCACCGGCGACATGGGCTTCGGGGCGACCAAGACCCACGATCTCGAGGTCTGGCTGCCCGGGCAGAACACCTACCGCGAGATCAGCTCGGTCTCGGTCTGCGGCGATTTCCAGGCGCGGCGCATGAACGCCCGCTTCCGCCCGGCAGGCGGCGGCAAGCCCGAGTTCGTCCATACGCTGAACGGCTCGGGCCTCGCCGTGGGCCGCTGCCTGATCGCGGTGCTCGAGAACGGACAGCAGGAGGATGGCTCCGTGGAGCTGCCCGAGGCGCTGCATCCCTATCTCGGCGGCAAGACCCGCATCTCGGCAGACGGCACGCTGGAATGAATGAGGATCGGGGGCGGCCCGCTGCGGCCGCTTCCGAGGCATGAGGGCAGAGAGCCCTCGGCCGCTTCCGACGCGTGAACGCAGAAAGCTCTCGAGACGGCCGTCGGCGGCGTCAGCCGTTGCCCCAGATGACCTTCACATAGTTCTGCGTCTCGCGGTAGGGCGGGATGCCCGAATGTTTCTCGACCGCCTGCGGCCCGGCATTGTAGGCGGCAAGCGCGAGCTGCCAGGTGCCGAACTTGTTGTACATCATCTTGAGGTACCGCGCCCCGCCCTCGAGATTCTGATGCGGGTCGTTGATGTTCACCCGCAAAAGCTGCGCCGTGCCCGGCATCAGCTGGGCCAGCCCCGTGGCGCCCTTCACCGAGACCGCGCCGGGGTTCCAGCCCGATTCCTGCTGCACGAGCCGGAGATAGAGATCCTCCGGCACGCCATGCTTGCGCGCCGCGGACTTGGCCGATTCGAGGAACTCGCCCTTGTAGCGGCCGTTGTAGCGCGCCGTCGGCAGAGCCGCCGCCAGCTTCTCGACCTTCGGCTTCAGCCGGGTGGAGCTTTCGTACTGCTGCGACAGGCGCCCGTCGAGAAGTCGCGTCTGCGACTTGAAGATTCCCGCCCGGTTCATCGTGGATCGCTGGATCTTCAGCCCATCGGCCAGACCCGCGGTCGGCCCCGAGGCAAGACCCAAGCCCACGACCGCCGCGCCCAATGCACCGATCAATCGCCGCATGAAAACAACCTTCATCTGTCCCGCCCGCACATTACACGGATTGCCGGCAAGTTCGCCACCCTTTTGAACACAACCGGGGCGGGTGAGGCCGGGCGCCGCGCGGGTCCCGGCCCGGTGCGCCCGTCCGGCGTTGCGGGTGCGGCCCGGTCAGTGTAGCCATTCCGTTAACCACATCCGGGGCATCCTTCCCCGGCCACAGGATTCGGAGGAAGCATGGCGGGCTCGGTCAACAAGGTCATCATCATCGGCAATCTCGGCCGCGATCCCGAGGTGCGCAGCTTCCAGAACGGCGGCAAGGTGGTGAACCTCCGCATCGCCACGTCCGAGCAGTGGCGCGACCGCGCCTCGGGCGAGCGGAAGGAGCGCACCGAATGGCACTCGGTCGCCATCTTCGATGAAAACCTGGCGCGGGTGGCCGAGCAGTATCTGCGCAAGGGCTCGACCGTCTATATCGAGGGCCAGCTCGAGACGCGGAAGTGGCAGGACCAGTCGGGTCAGGACCGCTACTCGACCGAGGTCGTGCTGCGCCCGTTCCGCAGCTCGCTGACGATGCTCGGAGGGCGCGGCGAAGGCGCGGGCGCCGGCGGCGGCATGGGCGGTGGCGGCTACGAGGATCGCGGCGGGCCGGACAACTACGACAATTACGGCAGCGGCCCCCGCGGCGGCGCCTCCTCGGGCGGCGCCCCGAGTGGCGGCGGCCGCCGGAACGACCTCGACGACGAGATCCCATTCTAAGCCCCCCGACCCGACAGGCCCGGACGGATCCGGCGCCGGCCGCGCCACGCTGGGGCCGCCTGGCCTTCGCTGTCCGAGCCCGAAGCCCGGAGCGGCCAGCCGCCGCTCCGAGCACGCCGCGCCTCAGCCCCGCGCCTCGGGATCGAAGCGCAGAAGACGCAGCGCGTTCAGCGTCACCAGCACGGTGGCGCCCGTGTCGGCGAGGATCGCGATCCAGAGCCCGGTGATCCCGAGGACCGAGGTCACGAGGAACACTCCCTTCAACCCCAGCGCCACCGCGATGTTCTGGCGGATGTTGCGCATGGCGGCGCGGGCGAGGCGGATCGTGGCCGGCACATCCGTCACCCGGTCGCGCAGGATCGCCGCATCCGCCGTCTCGAGCGCCACGTCGGTGCCCGAGCCCATGGCCACGCCCACGCTCGCGGCCTTCAGCGCGGGCGCATCGTTGATGCCGTCGCCGATCATCATGACGCCCCCCTGCCCCGCCATCTCGCGGATGGCGGCGAGCTTGTCGTCGGGGCGCATCTGCGCGCGGAAGCCTGTGCCGAGCCGGGCCGCGATGGCCTCGGCGGTGCGGGCATTGTCGCCGGTGAGGATCGTGGCGCTCACGCCCATCCGGCCCAGCTGCCGCACCGCTTCGGCCGCATCCCCACGCGGCTCGTCGCGCAGCGCCAGAAGCCCCAGCGTCCGACCCTCGCGGAAGAGGACCGTGACGGTCTTGCCCTCCGCCTCCAGCGCCTCGGCGCGGGCGCGGGTTCCGGCATCGAGCACGCCCCTTTCGGCCGCGAAGGAGGGACTCGCGACCCAGAGGTCTTCCCCCTCGACACGGGCCTCGGCCCCGCGCCCGGCGAGGCTGCGCGCGGCCTCGGCCGGCAGGTCCGGCGCGCCCGCGGCCTCGGCCCGTGCAAGGATCGCCTGCGCCAGCGGATGGGCCGAGCCACGCTCGACCGCGGCGGCCAGCGCCAGCACCCGGACCTCGGACTCGACGCTCAAGGTCACGACATCCGTCACCTCAGGCCGCCCGCGGGTGAGCGTGCCGGTCTTGTCGAACGCCACCTGCCGCACGGCGGCCGCGGCCTCGATCACCGCGCCGCCCTTCATCAGGAGACCGCGCCGCGCGCCCGAGGCCAAAGCCGAGGCGATGGCCGCCGGCACCGAGATCACCAGAGCGCAGGGGCAGCCGATCAGGAGGAGCGCGAGACCGCGATAGATCCAGGTGGACCAGTCGGCCCCGGCGGCGAGCGGCGGCAGCACGGCCACGGCCACGGCCAGCCCCACGATCAGCGGCATGTACCAGCGGCTGAACCGGTCGATGAACCGCTCGGTCGGGGCACGCGCTTCCTCGGCCTCCTCGACCAGGCGGACGATGCGCGCGATCGTGTTGTCCTTCGCCGCGCGCGTGACGCGGATGCGCAGGGCCGCATCGGTCGCGACCGCACCCGCGAAGACCGCCTCGCCGGGGCCCCGGGTCACGGGGATGCTCTCGCCCGTCACGGGGCTCTCGTCGAGCCCGCCCTGCCCCTCGAGGATCTCGCCGTCGGCCGGGATGCGCTCGCCCGGCCGGACGCGGACGATCTGGCCGGGTTGCAGGCCGGCCGCGGGCACCTCCAGCGTGCGTCCGTCCTGCTCGAGCCGCGCGGTCTCGGGCACGAGGCGCGAGAGGGCGCGGATCCCGTCGCGGGCGCGGCTCGCCGCCACCCCTTCCAGCACCTCGCCCACGGCGAAGAGGAAGATGACCAGCGCTGCCTCCTCCGCCGCGCCAATGAAGAGCGCGCCTGCGGCGGCGACCGTCATCAGCCCCTCGATGGTGAAGGGCTGGCCGAGCCGCAGGGCGGCAAAGGCGCGCCGGGCGATGGGCACGAGCCCGATGAGGCAGGCCAGCGCGAAGGCCACCGGCCCGACCGGCCCGGGCACCAGCGCCTCGACGGTCCAGGCGCAGGCCAGAAGCACCCCGGTCAGCAGGACGAGACGACCCTTCGCGGTTCGGTACCAGGCGCCCGGCTCGGCTGCCTCATGGTCGTGGGAATGGCTGTGCGCGTGCGGCGCGTCTGTCCCGCAGGCGCCGCAGCCGCAGGCCGCTTCGGCCGCCTCGGGCAGGACGAAGGGCTGCGCCTCGGCCCGCGGCGCGATGCCGAAGCCGAGCTTGCGCACCGTGGCCTCGACCGCCTCGGGGCCGCCCTGCCCTTCGTCGAGCGAGAGCTTCAGCCGCTCGGACATGATCGCCACCTCGACGCCGCTCACGCCCGGCAGGCGCCCCACCGCATCGCGGATCCTGCCCGCGCAGGAGGCGCAATCCATCCCCGTCACGCGCCATTCGTGCACCGCTCCACTCGTGTCGTTCGTCAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP051468|2438126:2447173|2438126_2439101_-|WP_002722650.1|DBSCAN-SWA MAFRLKLCPEKTNIDFFWAAPVTFGFSVFLMAASLVAWLTLGLNFGIDFRGGTTIRTESTQAVDVAAYRAALEGQDLGDISITEVFDPGFRADQHVAMVRIGAQDATQSITPEQIGQVEEALKTVDPSITFPSVESVGPKVSGELIRSAILAVAAACAGIAVYIWLRFEWQFALGSVAALIHDVLVTIGVFALFQIKFDLTTVAALLTVLGYSINDTVVVFDRLRENLVKYKTMPLRDVMNLSVNETLSRTIMTLMTTLIALVSLLVFGGDVIRGFVFAITFGVVIGTYSSVYMAKNIVLYLGVDRGGPKKDSKAGTQFANIDA >NZ_CP051468|2438126:2447173|2441408_2441966_+|WP_011336992.1|DBSCAN-SWA MSRTIAIAAAALLAAPALAQDVPARAENATAAPGQPGFPERTATFMSVEGTPLGKATIAPLPHGVLFTLDLQGLPPESWLGFHIHETGECDTGTGFKSAGGHFSIAETEHGLMVEAGPHSGDMPNQYVGADGILRAQVFNTFVQLGGENASDLTGRALMLHSGPDDYVSQPAGDAGDRLACAVIE >NZ_CP051468|2438126:2447173|2443477_2444092_-|WP_011336991.1|DBSCAN-SWA MRRLIGALGAAVVGLGLASGPTAGLADGLKIQRSTMNRAGIFKSQTRLLDGRLSQQYESSTRLKPKVEKLAAALPTARYNGRYKGEFLESAKSAARKHGVPEDLYLRLVQQESGWNPGAVSVKGATGLAQLMPGTAQLLRVNINDPHQNLEGGARYLKMMYNKFGTWQLALAAYNAGPQAVEKHSGIPPYRETQNYVKVIWGNG >NZ_CP051468|2438126:2447173|2440790_2441117_-|WP_002722646.1|DBSCAN-SWA MFVTPAFAQAASGGAAGAFTSFVPLILIFVIMYFLLIRPQQKKLKEHKAMVEALRRGDQVLTGGGIVGKVVKVHEDGIVDVEIADGVKVRVMKSTIVQVMSKTEPAAA >NZ_CP051468|2438126:2447173|2444305_2444827_+|WP_011336990.1|DBSCAN-SWA MAGSVNKVIIIGNLGRDPEVRSFQNGGKVVNLRIATSEQWRDRASGERKERTEWHSVAIFDENLARVAEQYLRKGSTVYIEGQLETRKWQDQSGQDRYSTEVVLRPFRSSLTMLGGRGEGAGAGGGMGGGGYEDRGGPDNYDNYGSGPRGGASSGGAPSGGGRRNDLDDEIPF >NZ_CP051468|2438126:2447173|2442072_2443365_+|WP_009563716.1|tRNA|DBSCAN-SWA MHDIRIIREFPDRFDTALSRRGLSALSGEILAIDEARRSKILAAETAQAEQNRASKEVGAAKARGDEAEFERLRALVAEKKAEVARLNEEAAAEDARLRDMLAAIPNLPLDEVPEGPDESANVEIRRWGTPRGFNFAPKEHFEIAGVKPGMDFATAAKLSGSRFVMLKGAVIRVHRALAQFMLDTHVTEHGLTETWTPVLVKDEAMFGTGQLPKFSEDSYQTTNGWWLIPTSEVTLTNSVAGEILDEAQLPIRMTAHTQCFRSEAGSAGKDTSGMLRQHQFEKVEMVSITTPETSLAEHERMTGCAEAILQKLGLPYRTIVLCTGDMGFGATKTHDLEVWLPGQNTYREISSVSVCGDFQARRMNARFRPAGGGKPEFVHTLNGSGLAVGRCLIAVLENGQQEDGSVELPEALHPYLGGKTRISADGTLE >NZ_CP051468|2438126:2447173|2439110_2440775_-|WP_002722648.1|DBSCAN-SWA MLHFPVWKRVLIWGICALAILYAAPNFFYSRVESHNDAVAAIEKAGGTATPEQTAAVGLWPEWLPSNIVNLGLDLRGGAHLLAEVRVEDVYAARMDGLWPEVRDALRGLRDQVGSIRRQTAPAGELRVAISRPEGMQAALEAARRLAQPVVSFTGMGQNDIEISSQGDVLVIRLTDAEKTAMDSRTVQQSLEIIRRRVDEVGTREPTIQRQGEDRILIQVPGIGSAAELKSLIGTTAKLTFHPVLGRTADAQSSASGDLVLPAANEQGIFYVLDRSPVVTGEELVDAQPSFDQNNRPAVSFRFNPSGARAFGDYTAQNIGKPFAIVLDNEVISAPVIQAHIAGGSGIITGNFSVEESTELAVLLRAGALPAEMTFLEERTIGPELGQDSIDAGRTAAIVGMIAVVAFMIASYGWFGVLANIALAINMLAIIALLSVVGATLTLPGIAGIVLTIGVAVDANVLIYERIREELRQGKQPGRAIELGFEKALSAIVDANVTTFLVALIMFLVGAGPVRGFAVTLTIGIFTSVFTAVYVTRILIATWFAWRRPRTIVV >NZ_CP051468|2438126:2447173|2444950_2447173_-|WP_011336989.1|DBSCAN-SWA MTNDTSGAVHEWRVTGMDCASCAGRIRDAVGRLPGVSGVEVAIMSERLKLSLDEGQGGPEAVEATVRKLGFGIAPRAEAQPFVLPEAAEAACGCGACGTDAPHAHSHSHDHEAAEPGAWYRTAKGRLVLLTGVLLACAWTVEALVPGPVGPVAFALACLIGLVPIARRAFAALRLGQPFTIEGLMTVAAAGALFIGAAEEAALVIFLFAVGEVLEGVAASRARDGIRALSRLVPETARLEQDGRTLEVPAAGLQPGQIVRVRPGERIPADGEILEGQGGLDESPVTGESIPVTRGPGEAVFAGAVATDAALRIRVTRAAKDNTIARIVRLVEEAEEARAPTERFIDRFSRWYMPLIVGLAVAVAVLPPLAAGADWSTWIYRGLALLLIGCPCALVISVPAAIASALASGARRGLLMKGGAVIEAAAAVRQVAFDKTGTLTRGRPEVTDVVTLSVESEVRVLALAAAVERGSAHPLAQAILARAEAAGAPDLPAEAARSLAGRGAEARVEGEDLWVASPSFAAERGVLDAGTRARAEALEAEGKTVTVLFREGRTLGLLALRDEPRGDAAEAVRQLGRMGVSATILTGDNARTAEAIAARLGTGFRAQMRPDDKLAAIREMAGQGGVMMIGDGINDAPALKAASVGVAMGSGTDVALETADAAILRDRVTDVPATIRLARAAMRNIRQNIAVALGLKGVFLVTSVLGITGLWIAILADTGATVLVTLNALRLLRFDPEARG |
8 | uncultured_Mediterranean_phage(50.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2569000 : 2588747
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP051468|2569000:2588747|DBSCAN-SWA CCTATTTGGCGCGTTTACAGGAATTTCCGGGGTCGTTTACATCGTTCCGTTCTTGTTCCGGCCCCATCACGAGGCCGCGCCGGCTGGCGGCGCGGGTGTAAGCCTCGGCTTCTGAGAGGCTCGCATGCCCGCCCCAGGCCATGATCGCATGGGCAGGACCGCCCGCTTCCGCGATGGCCGTGAGCCGGAATTTGCGCAAGCCGTGCGGCGTGCGCCCCTCGATCCCCGCCTCGGCCGCGGCAGCGGTGATGAAGTTCGACAGACCCTTCACCGACCGCGCCTTCCCGTAGCTCGTCTCAAGCAGGGTGAAGCCGGCCGTGGCCATCACCGCTGCGCGCACCGTCTCGCGCTCCTCTTCCCACCGGAGCGCCCACGAGGGGAGCGCGCAGGTCCAGGGCACATAGGCGGGCCCTCCGGTCTTCTGCTGGCGGAAGGTCATGAGCCCGTCGCTGCCGAGATGGGTGCGCGTCATGCGGCAAGCGTCCGAGACGCGGCAGCCGGTCCAGGCGAGCAGCTCGAAGGCTGCGCGCTGGCGCGTCCCCACCGCCCACCGCGCCCGGAAGGCCGCCACCTCGGCCGTCGTCCACGCGACGTGGCCGGGCGATTTCGTCACGATCTTTCCGATCTCGTGGCTCGGGTCGCTCTCGATGATCCCGTCCGCCTTGGCCTGCGCGAAGATGAGCCGCCAGCATTTCAGCGACACATTGCGCGGGTTCGGATCGAGCTTGCCGAGATCCATCTCGACATGCTTCTGCCGCAGGCCCCGCAGCGGCAGGGCGCCGTGCTCCGTCTCGATTCGCTCGAGATGGAAGCGCAAGTTCAGCCGATAGCTCGCGGCAAACCCCTTCCACCTCTTCGACGCGCGGAGCGCTCGCACGACCTTCGCGACCGACTTGTCCGCAACGGGCGGCGCCAGATCTGGCTTCGAGGCCTCGGCCTTGGCCCAAGCCGCGATGAAGTCGGGATGGGTCTCGGGCAGATCCGGCAACCGGGTTCCGGTCGGCCGATGATAGCAGAGCACCCGGTCGCCACGCCGGATGCGCTGCACCCGGGGCAGCTTCACGCCTGCATCCCGAAATGGCGGTCGCACAGCTCCACCTCCGATTCGAAAGTCTCCCCCTCAACCGGCAGGTCCGAGGCGAAGGCGTCAAGATCGAGCCTGTCATACAGCCGCCGCCCGCCCAGCACCCGCCGCGGCAGGTCAAGGCCACGGAGCGTTGTCGCGCTCACGCCCAGATAGGCCGCCGCCTCGGCCGCGCCGAGAAGCCGCGGCGCGAAAGTTAGGGATCGCCTAGCGCTCATCGTCCTTTCCCCTTCCTCCGAGCCAGCACGGCCCGCCCACCGGCGGCGTCGCGCGCAATTCTTCCTGTGCCCTGCTCATTCCTCGCCCTCCGGCCAGCCGAGCTCGGCCGGCACCCACGCGTCGATCTTCGCGCAGTCCTCGCGGCTGAGGCCAAGGGCCTCGCGGGTGTCGGGATCGGCGAAGAGCGCCTCGAGGCGCTTCGCCTTCTCCCCCTTCTTCAGCTTGTTGAAGCTCTGATGAGCTGGCTCCGCCTCGGCCGCCGGCACGAGCTCATTCCAGAGCCGGTCGAGATAGCCCGCGCTGCAGCGGCCGAGAAATCCCTGGGCGGTGGGCGTCCAGATCCGGCGCACCTCGACCCCGAGCTGGCGCGCGAGCGCGGCCGAGAGGCCGGAGCTGCCGGTGCAGAAAGTCCGCGCGAGGGCCTCGGTCAGGATCTGGTTGCGGTGTTTCTTCCCCTGCGCCCGGAAGGCCTCGAACTCGGCCGGGGTGCCGTCGGGGCCGAGGCTCGCATTCGGCTCGAGCCGTGCCGCCAGCCGCGGCGGGTAGCTGGTGCCTTCGCCCTTCTCCGGCACGATGGGCTGGTCGGTGGGTGAGATCGCGAGCGGGCGCGCCCAGGGCCTGAGGCCGCCGCCGAGGGACCAGGCGAGAAGGTCGAGCATGAGCTCGGACTGGTCCATCAGCCGCGCCTGCAGCGCCCCGAGCCGGATCCGATGCAGGTCCTCGATCAGCGACTGCGGCAGGTCGGGTGCCAGCTCGGCCGCGCCGCGCGGAACCGCCCGCGGCTGCTGATAGGCCCGCTCGACCGAGAGCCCGTCCCGGCTGCTGCAATAGACGAAGATCCCGGCCGAGGCGCGCTGCGCGTCGGTGAAGCCCCCCTCGCGCCGGGCCTCGAGCTCGGCCAGCCGCGCGCGGCCGGGCTCGTCGAGGGTGTCGTCGGCCTCGCGCTCCTCGAGCGCCGCGAGCTCCGCTTCCTCTCCTTCCGAGAGCTTCCCCGGCCGGGCGTAGAGACGCACCAGCTTCTGCGTGACGGTCCAGGAGACATATTCCTCCGGCACCCATGTCGCCCATTCCCAGCCCTCTTCCGCGCGGATCCGCTCGGCCTCGGCCGCGCCCTTCTCGGCGAAGAGCCGGTCGAGGAGCGCCTCGTCCTCCAGCAGCGTCCGGTCGGCGAAGAGATCGCGCTGAGAGGTTCCACCGGCGGCGAGATAGGCCTCGAGCCCCACGAAGACCGCCCGCCGGTCGGTCGAGGGCACGGTGCCGGGTGTGAGCTCGCGCCGCACCTGCTCGGGCCGCACGTCGCGGCCCCGCACCGACGCGAGAACTTCGAGGCAGCGCTCGCCGCTCGGGGCGAACGTCAGCGCCTTCGCCATCTCGAGCGAGATCTCGTTCGCGCGGAGCGCCGCCAGCGCCTCGGCCGGCAGGTCCGCGAGCTTCAGCCGCCGCTCGACATGAGCCTCGGAGCGGGCAAAGCTGCGCGCGATCCGCGACAGGCTAGCGCCCCGCTCGCGCATCGCGGCATAGGCGCGGACCTCGTCGGCCGGGTGCAGCGCCGAGCGCGCGCTGTTCTCGGTGCCGGCCCAGGCCACCGCTTCCTGCAGGTCGGCCGTCACCTTCACCGGCACCGGATCTATCGGAATGAGATCCGCATGCCGGCTCCAGCCCTCGGCCGCGAGGCGCTGCAGCGCGCGGAGCCGGGTGCCGCCGCCCACGATCTCGATGCCTTGGGGCGTCAGATGGCCGATGAGGTTCTGGAGAAGCCCCGCCACGGCGAGGCTCTCGGCCATGGCCTCGACCTCCTCGGCCGCGATCTCCTGCCGGCTGTTGAGCGGCGAGAGCCGGAGCTCGGCGAGCGGGATGAGGCGCAGATCGCCTCGGGAGATGAAGTCCTTAGCCATGCGGGATGCCCTTCATGCAAATCTTCTCCGGCGCGCCGGAGCGCGGGACAGGGCCCTGCGCGAAAGAGGCGCCGGGCGGATTGAACGGAAGCTGCGCGGATCGCCCGCCGCGCCGGGGTGCGTCAGGCCATGCCGAGCGCGGCCTTGTAGAGCGCGAGGAGGGTCTCCTCCTCGGCGATGTCGTCGGGGGTGCGCTTGCGGAGCGCCACGATCATCCTGAGGATCTTCGGCGCGTAGCCCCGGCCCTTCGCCTCGGCCATCAGCTCCTTCTGCTGGCCGCCGATCTCCTTCTTCTCGGCCTCGAGCTGCTCGTAGCGCTCGATGAACTGCCGCAGCTCGTCGGCCGCGACCTGATAGGCGGCCTCGGCCACGGCGCGGTCCGCCGGGGTCTCCTTCATGGGCGGCTTGCGGATCCGGCCGCTGTCGATGGCCGCGGCCATGCGGTCGAGATCGGCCGCATCGAGGCTCTCGACCGGAACGGGCCCGGGGCGCGGCGCGGGATGCGGGGCGGCCTCCATCACATGCCCCGCATGGCGCTGGCCTGCTGGCGCATGGTCTCGGCCACGTCGGTGACATGGGCCCAGAGCACGGTGGCGACGAACAGGAAGCCGAGGAGCGCGAGCGCGCCCACCAGAAAGCCGGTGAGGTTCGGCCCGAGCGGGCTCGGTCTGCTCGTCCGGAAGGCGGGCCGCGGCCGGGCGCGGTGCAGCGGGCGGCCGGTGGCGACGGCCAGCTCGTGTTCCGTCAGGAGCCGGCTGGCGGCCTCGCGGATCTCGGGCGTCGGCGCGTGCTCGGCACGATGGCGCGCAGCATTCAGATCGCGGGGCGTGAGGGGCAGGATCCGGCTCATGCTCCGAGCGCCTCCCGCGCCTGGCCGAGGATCGCGGCGAGCTCGGCATCGGTCTCGTCGTCGGCGGCGTCGAGCTGCGGGAGCCCCGCCTCGGCCGCCGCGAGCAACCGGCGCAGCTGGGGCTCGGTCAGGTCGAGGAGGGCAAAGCGGGCAACCGGCGCTGCGGCCGGGTAGCGGATGGCGGTGGCGGTCACGACAGCATCCTCCGCCGGTCCGTCGGCTCGTCCTCAATGTCCGAGAGGTACTTGCCGGCGGCGATAGCGCTTGCGAGCGACACATGCGCCGAGCGCAGCCGCAGCTGCAGATGCAGCTTCTGGCCGGGCGTCAGGACGCCGATGCCGTAGTGCTGGTGAGCAATGAAGTCAGGGGATTGCACGAATCTCTCCGTCGTAGCCAAGGGCGCGGGACTGTCGCGCCTCACTAGCCATGCGTCACATATTGCGACACTTGCGTCAAGTCCGTATGTCGCAATATGTGACGCTCATGAAGCCTTACCCTGAGGTCAACGTGGCGTTGACAAGGACGGACCTCGCGGCGCAGATAGAACGCAACAGGAACCGCAAGGGGTTGATTTGTTGTCGGAAGAGTGGAAGCAGCGCATGATCCTAGCCATCCGCCGCATGGGGGAGGATGACCTGATCATGTTCGAGAAGCTGGTCCTTAGGCTTGCGACTTCGCCGCAAGGCCTTCAGCAACGCGTATCAACGCTTCGACATCCTCGCTCGGAAGCAGGCGAATAAGCTCAAGTAAAGCACGCTCTCGCGGGTTGCTTTCGAACAAGTCTTCCGTAGAGACGCCCAGTGCCCCGGCAATCGCATTGAGGCGGATTGTGTTGGCGGGCCGCGTCTCGCGCTCAATCATCGCGAGTTGTGACCGCGACATCCCTGAACGGGCAGCCAGGACTTCCTGAGTCCAGCCGCGAGCTTCCCGTAACTGTTTGATCCGCAGGGCCATAGGTTTGGTATAGAACGCCCATACCCCGAAAGCTCGCCCGGAATGGCGACAACCAGCCCTGATACTCATTGACAGCCCTGTCGCAATGTGTGACGACCAGCGACATGAGCATCCTGTCCGATTACATCAAGGGCCGACCCGAGAAATCCCGGTCCCAATGGGCCGATGAGTTCGGCATTTCAAGGCCGCACCTGCACTGTCTGCTCGAAGGTTCCAGATTGCCAAGCCTCGATGTGGCAACGCGGATCGCCGTGGCGACAGGCGGAGCAGTACCGATCTCTTCCTGGCCGAATATCGCGGCAGTGCTGGCTGCCGCTCAGCATGAGCTACCGGCGAAGTTCGGCTGAAAGCATGGAGAAACACATGACATTCAGGCTTCGCGATGCCGGGCAACCTGAGGAGCGCCACCGTGAAGAACCGCAAGCCGGCTGTTGCGGTTTTGCGTGCCTCCCTCTCCCCTCCTGCTGGTGGCTCGTGCCGTCCGTCATCATCGGATCCTCCCTCATCGCTTGGGGCGTCTGGGCACTGCTCACCGCTCTCGGGAGCCTTCTGTGACCGCCGCCGCGCCCTTGCCTGTACAGGACGCGGCCACTTCGCCGGGGGCTGCGGCCTCCGGCGCTTTTCGATCGAACGGGTGGACCGCGCTGCGACGCCATCCTGCAGGCCGGGCGGATCTGCTGCGGTGGCATGCCGATCCGGCGCTCGTCGCGCGGCACGCGCGCTGGGGGCGGCCGGTCTATCTGGCGAGCCCTTACACGCTCCGCGCGGTCGGCCCGGACGGCCGCTGGTCGCGGGATCAGTCGGAAGCGGCCATGGCCGAGGCGGCGCGCGAGGTCGCGCGGCTCCTCGAGGTGGGCGTCACGGCGATCTCGCCGGTGGAGCTCTCGGCCGCCGCGCTCCATGCCACCATGTTCCCCAGGCTTTGGATTGACCCGTTCAACCCCGTCCTCTGGGAGGACTGGTGCCGCCCCCTTCTCACCGTCTGCGCCGCCGTCGTGGTCCCCGAGATCCGCGGCTGGTCAGACTCCACCGGCATCCGGCACGAGGTCGCATCCGCCCTCACAGCCCAGGTGCCCGTCTTCATCTATGGAGGCCTGCCATGACGAAGCGTGACCGCTTCCGACGCGCCGCGGGAGGTGCCCGTGCCGCATGACCGACTGACCGAGACCGATCCTCCGACCGAGATCGTCGGCGATTTCTGGGAATATCCGCTGGCCTTCGGCGACACGCTCTCGAGCCACGAATGGGTGCCCTTGCACATCAACCGCCTCCTCACCTCCCGCTTCGTCGCCCGGGCGCTGGCCGAGAACCGGCGGGCGGACATCGGCACCGCCCTCCTCCTCTGGTGCGAGGCCTTCCGGCAGGACCCGGCCGGGACGCTGCCCGACGACGATCTCGAGCTCGCCCGCCTCGCGGGCTACGGCGCCGATCTCGAGAGCTGGCGGGCGGCGCGCGAGGGCGCGCTCTACGGCTGGCGCGAGACCCATATCTCGAACCAGGAGGAGGCGCCGGGCAACCGCCGGCTCGGCCATGTGATGATCGCGGGGATTGCCCGCGACATGCACCGCAGGAAGCGCGGGCGCGATCAGGCGCGGACGGAAGGCTCCAAGGCGATGGCGCGGACCCGCGTCCGCAGGAAGCTCGTCGAGATCAAATGCGGCCGCGCGGCCGAGAGCCCGGACGTGGTGAACGAGATTGCCGAATGGCTGGGCCAGCGCGATCTCTTCATCACCTCCGACAACGTGCGGGCCGCGTTCGAGGCGACGCGCGGCGGGCCGAAGGTCGTTCAGCTGACATGACTTAACACTGTTAATCACAGTTATCACTGCAATCGTAACAGTTAATCACTGTGAGATTGCAGTGATCTCATAGGCCCGGAACTGTGATTGCCCTACAGGACAGGACAGGACAGGACCCCACAAAACAGAACCTGACAAAACATCCCTTCTGCAGGGGTGAGAAGATCGGGCGGTGGCGAGACGGCAGGCGTGGCTGGCTGAGAAAGGGATGGCCATGACGAAGGCAGAGGAGCGGGCGCGGGTGAAGGCGCTGGTGGTGGACCGGCTGGAGCAGGCCGGGATGGTGAGGCGGCGCGGCACGGCCGCCGCGGCGCACGAGGCGGCGATGGCGCGGCTCTGCGAGCAGCTGGGCTACATGGGGGCGGAGAACCTGATGACCTTGGCCGAGGTGCTGATCGACAGCGCGGCGGATGGGGTCTGGCCGTCCGAGGTGCTGATCCGGCAGTTCGCGCGCGCCATCGAGGAGCCGCCGCCGGCGGAGCGGCGGCTGGTGTCGAGCTGGCTCGCCTCGGTCGAGGGACCGAAGGCGGAAGCGGGCGGGCATCTGGTCGAACTCTACCGCTGGCTGCTCAAACACCCGCGCCCGCCGCTCGCGATGGACCTGCGCGAGATCCGCGACCAGGCGCAGGACAATGCCCGGCGCTGCGAGCTGATCCGCGACCGGATCGACCGGGAGACCGCGAGCCGGGAGGATCGCGACTGGCTCGAGCTCTATCTCCGCGACCGGGCCCAGGCCCGCGCGCTGGTGGACGCCGGCCGCGGCCGCAAGGAAGGGGCGGCAGCATGATCGGGCTCGGGAAGAAGACGGCGGTCCGGCCGGGCAGGAGCGCGGTGCAGAGGGAGACGTCGGCATGATCGGCAGAGCGGTGGACATGCGGCGGGCAGAGCCGGTGCGGACGAGCGGACAGCAAAAACAACAAGAACAAGGACAGGGGTGCCGCATGAGGGGCGCGAAGGGCCGCAAGGCCATGAAGGCGAAGCTGGTGAGCCTGAAGCTCGCGCCGTGGGATCTGGGGCCGCTGACGCCCGCACAGATCGCGGGCAAGCGGATCGAGGAGGCGGCCGAGGTCGATCCGAAGACGGGCAAGAAGAGCAACCCGAACAGGGTGATCCGCACGAGGCGCGAGACCTGGGTCGGGCGCTATCACCGGCAGGGCAAGCTCTCGGTCGAGCAGGCCAACATCGCGGCCGAGCTCTTCGAGGCGGCCTCGGGCATGCCCGCCCGCGATCCGCTGGCCGCCATCGTCCGCGTCGATATGAGCGGCGATCAGGATCCGCAGGTGGCTCAGGTGGATCGGCGGCGGAAGTTCTTCCGCATGTGGGAGTTGGTCCCGACGTTTGCTCGGCCCGTCATTCAACATGTCGTTCTGGACGATCAGTCGCTCCGTGGCATGCCGGGACGTGTGGACAGCCGATCCGAGGCACGGCAGCTCGACCGCCTTCAGCGAGGCCTGGACGCGCTCTCCGAGGCGTGGCGCTGAGAGCGGCTTGACCTGCCCCAAACAATTCGGCAGATTGCCATCATCGAAGACAAGCGCCCGGCGAGCCCCAAGCTCTCCGGGCGCTTCGCTTTGGAGCCTCCCCTTGATCCGCAAACTCTGCTGCGCGCCCGGCTGCGAGGAGCTCGCGCCGGCCGGCCAGGCACACTGCGCCGACCATGCCGCCGAGGCGGAGGCGCGGGCTCGCAGCCGGAAGGCTCGCGCCAAGGCGGGCGCCGCGGCGCAGGCGGGCGCGGCCTTCTATGCCACCGGCCGCTGGCGGCGGGCACGGGCGCGGTTCCTCGCGGCCCATCCGCTCTGCGCCGACTGCGCGGGGCTCGGGCTCGTGGTGGCGGCGGCCGAGGTCGACCACATCCGGCCGCACCGCGGCGATCCCGGCCTGATGTGGGACCGGGCCAACTGGCAGCCGCTCTGCCGGCCCTGCCACAGCCGCAAGACCGCGCGCGAGGTGTTCCACCCACCGGGGGGCATCGCAAAATCGGAGGGGTCGGCCAGGTAACCGGCGCTCGAACCTTCCTTTTCACGCACGGCGAATTGGCAAAAAAAGCCCACCGTGCAGAGGGATGAGAGGAAGCGGGAGACCTGCCCGCCGCCCCTGCCCCCGCCCCAGCGGATCCACGACGAGCCGATGGCCGCGAGCCCCCGCTCCGCGCGGCGCACAACAGACATGATGGGCCGCCGATCCCGGCGCCCCCGATCCGGCCGCCGATGCGGCGAGGATCGGGCCGGCACGGGCGCAGCCCACAGAACAAGGACAGTCACGATGCGCGGGCAGAAGCCGAAGGTCTCGAACGTCATCCCGATGAAGGGCGATCTCGCGGCGCCCGTGCCGGAGGCTCCGGGCTGGATGAGCGCGGAAGGCCGCGAGGTCTGGGACCGGCTCGCCCCGGTGCTGGTGGCGAAGAAGCGGCTCGAGCCCGCCTTCGAGGATCCGTTCGCGGTCTATTGCGAGGCCGTGGCCGATGTCATCCGCTTCACCGGCGACATCGCGGCCTTCGGCAGCTGGTACGAGGTCGAGACCCGCAACGGCCGCCAGCAGAAGAAGCGCGCCGTCTGGGGCCAGCGGCAGGATGCCATCGCCGTCATGAACCAGCTCGCCGCCCGCTTCGGCCTCACGCCGGTCGACGAGGCCCGCGTCCGCGTCACCGGACAGGGCGACCTCTTCGACGAGATCCTGAAGACCCTCGATGGAGCCGATTGACCATCCGGTCTCGCGCTATGCGCTCGATGTGGTCGAGGGCCGCGAGACCGCCGGAGACCTGGTGCGGCTCGCCTGCCTGCGCCACCTGACCGACCTCGAGACCGGGCGCGACCGCGGCCTCTGGTTCGACTGCAAGGCCGCGAGCCGGGTCCTGAACTTCGCCGAGCTGATCCAGCACACGACCGGGCCGCTCGCCGGGCGGCCGCTCACGCTCCGCCCCTGGCAGGCCTTCCGCCATGGTTCGGTCTTCGGCTGGAAGAAGGAAGGCGGCCTCCGCCGCTTCCGCACCACCTACCATCAGGTGGCCAAGAAGAACGGCAAGACCACCGACACGGCGGTGCCCGCCCTCTTCACCGCGCTCTTCGACCGGGAGGCGGCGCCGCAGGGCTATTGCGCCGCCACCACGCGCGACCAGGCCGGGCTCCTCTTCCGCGAGCTCAAGCGCATGATCCGCGCCTCGCCGCATCTCTCGGCCCTGATGCAGGTCTGGCGCACCTCCATCGAGGTGCCGGCGACCGAGGGGCTGATCGCCTGCCTCTCGCGCGACGGCAACAGCTCGGACGGGATCAACCCGCACTTCGCCGCCCGCGACGAGGTCCACCGCTGGACCGACCGCGAGCTTGCCGAGGTGCTCACGAACTCGATGATCGCCCGCGCCCAGCCCATCGACTGGGCGATCACCACCGCCGGCGCCGACCGCGCGAGCCTCTGCGGCGAGATGCGGGACTATGCCGAGGAGGTGGTGCGGGGCACGGTCAGCGACGACAGCTTCTTCGCCTATGTGGCCGAGCCCCCGCCCGACTGCGACGTGGCCGATCCGCGCTTCTGGAAGATGGCGAACCCGAACCTCGGCGTGGCCTTCTCGGAAGAACGGTTCGGCGAGATGTACCGCGAGGCCACGGTCATCTCGGGCAAGATGCCGAACTTCCGCCGGCTGCACATGAACCTCTGGACGGAAGGCGCCCAGACCTGGATCGCCCGCGACGTCTGGGACCGGGGCGCCGAGCCGTTCGACCCGCGGCCGCTCTACGGGCTGCCCGCTTGGGTCGGCCTCGATCTCTCGAAGACCACCGACCTCACGGCGATCTCGGTCGCCGTGCCGAAGGACGGCCAGATCTATCTCCTGGCCTATTCCTTCCTGCCCGAGGGGCCGAAGGGCTTCATCGCGCGGGCGCAGAAGGAGAAGCGCGAATATGTCGCCTGGCGCGATGCCGGCTGGCTCGAGGTCCATTCCGGCGGGGTGATCGACGAGGATCAGGTGATCGAGCGGCTGGAGACGATCCGGGCGCGGTTCGACCTGCGCGAGCTCGCCTACGACCGCTGGGGCATGAAATACATGGCGAAGGAGCTCCTGAAGCGCCGCTTCCCGCTGGTCGAGCACGGGCAGGGCTACGGCTCGATGTCCTCGCCGATGAAGCGGTTCGAGGAGGCGGTGGCGAAGGGCCGGATCCGCCACGCCGGCAATCCGGTGCTGGCCTGGGCGGTGGGCAACGTCCACCGCGACGAGGATGCGGCCGAGAACATCAAGCCGAACAAGGCGCGCTCGAAGGGCCGGATCGACCCGGCGGTGGCCGCGATCATGGCCCTGGGGCGCGCCGAAGCCGCCGAAGGCCGCCGCCGCGCAAGGGAGGTGGAGACGGCGTGAGCGGGGGCGCCGCTCGCGCTTCGGGAATGGCGGCCCTCGGGACGGGCGGAACGGCCAAGGCTCCTGGCGAGATCTCCGGATCACTCTGCCCGGAGGCGGCATTGCGGACAAAGCTGCCAGTCGCATATTGATGCCGCTGGTCGATAGTGGTTAGCATAACTGCATCGACTGAGTGGCAACTAGGGTTCCGGGCCTTTGGCCGCAGGACCGAGCGTTGCAGAGACCGGGGGAGTTCTCCGGGCCAGCACCCAAGGGACAAAAGCCCAGGGGAGGAGACGTCGAGATCAACCGCGCCGATGGGCGCATAAGGGGGTCTCAATGACGCTTGCCTCGCTGCTTCTTGTCGTTCTCGCATCCTTCATACATGCCAGCTGGAACCTGCTCGCCAAGACGGCCGCATCGGTCGGGCCGGTCTTCGTCTTCGCCTACAACCTGATCGCCTGCGTGGCCTACGCGCCATGGGTGCTGTATCTGCTGATACACGACGGCATCATCTGGACCTGGACGGGCATCGGGTTCGTGCTGCTCAGCGGACTGATCCATCTGGCCTACAGCCTGTGCCTGCAGCGCGGTTATCAGGTGGCCGATCTGTCGGTGGTCTATCCCGTGGCGCGAGGGACGGGGCCGATGCTCTCGACCCTCGGGGCGTTCCTGATCCTCGGCGAGACACCCTCCGGCACCGGTCTGATCGGTCTGGCCCTGGTCGTCGCGGGGATCCTGCTGATCGCCACCCGAGGCAGCCTGGCCGCCTTCACGCGCCCCGGCGGGCAGGCCGGGGTCCGCTGGGGCACCGCAACGGGCGGCCTGATCGCGAGCTACACGGTGGTCGACGCCTTCGCGGTGACAGCGCTCGGCATCGCGCCGGTGGTGCTCGACTGGTTCTCGAACCTGCTCCGGTTTTTCCTGCTGCTGCCGCTGGTGATCGCTGACCCTCGCCGCGCCCTGAGCGCGATGCGCGGGCACTGGGGGGCCGCGATCGGAGTGGGCCTTCTGTCTCCGCTCTCCTACATCCTCGTGCTGGCCGCTCTGACGGGTGGCGCCCCGCTCAGCCTTGTCGCCCCGATGCGCGAGATGTCGATGATGGTGGGGGCGCTTCTGGGCATGTTGATCCTGCGCGAAGCAGTTGGCCGGTGGCGCCTGGTGGGATGTGCCGTGCTGATAGCGGGAGTCATCCTGCTCTCGGCAGGCTGACACCTGCGACCGTCCGCAAAGGGCTCGCAGGAACATAGAGTCTGCCCACCGGCACCACGGAACACAATGAGCAGGCGCTCCCGCCGGATCGCTTGGCGAAAGGAAGGACCGGCATGAGCAGATGGCCGCGATTTGGCGCTTCGCGAATGGCGGGCGCCTCCGTCCGCACCGAGCCGCCCGTGACGGCGCCGCAGGCCGCGGCCGAGGCGAGCGGCACGGCCGTGCCGAGGCCCTGGCTGCAGGAGGTCGGCTGGAGCTCGGGCGGCGCGAGCCGGATCCGCACCCTGCCCCGCGTCTCGGCCGAGGTGGCGCAGCGCCATGCCACGGTCTATGCCTGCTGCGCCGTCATCGCGGGCGATCTCGCGAAGGTGCCGCTGAAGCTCTTCCAGCGCAGCGGCGACGGCCGCGAGGTCCGGGTGCGGGACCATGCCGCGCCCTATCTTCTCAATGTGGAAGCCGCGCCCGGCGTGGCGGCCTCGGTGGTCCGCTTCGCGCTGGGGTACGCCTTCACCCTCCGCGGCAATGCCTTCGCCTGGGCGCCGCGCGACGGGGCGGGCGAGCTCGAGCTGATCGATCTGGTGCGCCAGTCCGGCTGCAGCGTGCTCCGCGCCGGCCGAGAGCGGTTCTACGACTTCGAGGACGGCGCGGGCCTCCGCCGCCGCGCCCCCGCCCGCGCCATGATCCATCTGCGCTACATGGCCGAGGACGGCTGGACGGGCCGCAGCCCGCTCGAGGTCGCGGCCGAGAGCGTGGGCCTCGCGCTCGCGGGCCAGGAGTCCGCGGCCCGTGCGGCCTCGGGCGTCACCGCCCGCGCCGTGATCCGGCTCCGCGACGATTATGAGGATGACGAGGCCCGCGTCAGGAGCGCCCGCCGGGTGGCCGCGGCCCTCCGCGCGCCGGAGGTCGAGGGCTTCCCGATCCTCGGCGAGGGCGAGGATGTGCAGACGCTCGACATGAAGGCTGCCGATCAGGAGCTGCTCGGGAGCCGCAAGTTCGACCGCGAGCAGATCGCGGCGATCTACCGGGTGCCGCCGGCGAAGCTTCAGATGATGGAATACGGGGTGAAGGCCAATGGCGAGCAGCAGGCCATCGACTATCTGACCGACTGCCTCCTCCATTGGGCGAAACAGGTCGAGGACCAGCTCGCGCTCGGCGTGCTGACCGAGGCCGAGCGGCGGGCGGGCCTCTTCTTCCGCCATGACTTCGGGGCCCTGCTCCGGCCGACAACCAAGGAGAGATTCGAGGCGCTCGCCAAGGCGGTGGGCGGCCCGATCCTGACCCCGAACGAGGCCAGGCGCATCGACGGCTACGATCCGATCGAGGGCGGCGACCGGCTGAACCCGGCGCCGAACATGACCCGCAGCGAGGAGACAGACCCATGACCCGCACGCTCGCCAGCCTCTTCGGCCCCCTGCAGCCCATGGCGCTGGCCGAGGATCTCGCGGCGCCCCTCCTCGCCCTCGCCATTCCGGAAGGCGGTGTCGGCGGATCGGCCCCCTCCGAGAGCATCCTGTCGGCCGCAACCGAGCCCGCCGCCCTCGCGCACGGGCCAACGGCCGGCCCGACCGTCCCCGACCGCTTCACCGTCGCGCGCGGCCTCGCGGTGGTGCCGGTGCGCGGGATCCTCACGCCGAACATGGCGCAGTACGAGCGCTGGTTCGGCTGGGCCACCTACCATGGTCTGGCCGAGACGCTGGCCCACCTCGCCGCCAGCGAGGATGCCGCCGCCATCGTGCTCGAGATCGACAGCCCCGGCGGCCTCGTCTGCGGGATCGAGGCCGCGGCCGAGGCCATCGCCACCGCGGCCGCCGTGAAGCCGGTCCATGCCCTCGTCTCGCCACTGGCCGCGTCCGCCGCCTATTGGCTCGCCTCGCAGGCCTCCGAGATCGTGATGACGCCGGGCGCGGTGGCGGGCTCCATCGGCATCGCGCTGACCGCCGCGGCCCACGTCCAGCCGGGTGCCAACGGCGCGCAGATCTTCGAGATGAGCTCCCGCCACGCCCGCGCCAAGCGCCCGGACGCCTCGACCGAAGCCGGCCGCGCCGAGCTCCAGCGCAGCCTCGACGAGGCCGAGGCCGCCTTCCACGCTGCCGTCTCCACCGGCCGCGCCATCCCGGCGGCCGAGCTCGCCGCCCGCCTCAGCGTGACAGACGATCCGCAGGACGGCGGCGCGACCTTCCGCGCCCCCGCGGCCATCCGCCGCGGCCTCGCCGACCGCAGCGAGACCCGAGCCGCCTTCTACGCCCGTCTTACCGCCCGCACCGCGCCGAAGCCCCGCAGCCCCAGCCGCGCCTTCGCCGCCCGCGCCGCAGCCGCGGCAGCGCTCGCCCGGAGCTGAGGGCACCTCCTCACCAGATCAGCTTGATGCCCGACATCAGGAGCAGGACCGACAGCGTCCCGCGGAGCCAGCGGTCAGACAGATAGCGGCTCCCGATATAGGCGCCGATGGTCCCACCGACAGCGACGGCGACCAGCCAAACGGGAAGCGCCGAGGGAGCCTGGTCCCAGGCGGCGTAAGTCCCAAGCAGGGCCGCAGAGGAGTTGATGAGGTTGTAAACTGCTGTCGTGGCTGCCGTTTGCCGCGCCGATCCCCACTTCATCGTCAGGATGATCGGGGCAAGGAAGACCCCGCCACCCGTGCCGGTGGTCCCCGAGACGAACCCTATGACCGCCCCCGTGGCCAGGGCTGCGGCGAAGGGCGGGCGGCCGACCACTGCCGGACCGACCCCCGGGCTCAGGATCGCGGCGCGGGCCATCTGAAGGGCGGACAGGATGAGAACCGCGCCGACAACCGGAAAGTAGATCCCCTCCGGCAGATGGATCGCCCCTCCGAGGAGCGAGAAGGGAAAGCCCAGGACGGCGAAGGGCCAGACATTCTGCCATGAGAGCCGTCCTGCCCGCAGGAAGAGGGCGGTGCCGATGGCGGCCACCAGCAGGTTCAGGGACAGGGCCGTCGTCTTCATCGCGAGCGGGCCGAAGCCCGCCAGACCCATGATGGCGATGTAACCCGATGCTCCGGCCTGCCCGACCGCCGCATACAGCAGGGCAGTCACCAGAAATGCCCCCGAGAGCAGAAGCGCCTCGCCGTTCACAACGACCTCGTAGCCTGCGACGTCTCACCTATGCCGCATCCGGAGACCGGGTCCAAGAGAAGCCGCTCGCTCCGAACCTGCGGTGCGAGCCGGACCGCCGCCTCTCCGCCTCGCCGGCGCCCCGCAGCAGGCCTGCACCTGATCCACCTCTGAGCTGAGCCACGCCAGGCGGTCAGCCTTCACCCCTTCCCATCCCCCGCCCGTGGCGGGGGCTTTCGGCTGCGCGGATGCAGCCCTGCCAAGAGAGGATCCCATGGCACGACAGAATCTCGACGACCTGCGCCGCGCCCGGAAGGCCGCGGCCGACACGATGGCCGCGGTGGCCGCCCGCATCGGCGCGCTCGAGGCGGCAGAGACACCGGACGCCGCCGCGCTCGAGGCCGAGACCGCGGCCTTTGCCGCCGCCGAAGCCGCCTTCGCCAGGGCCGATGCCGCCGTGACGCGCGCGGCCGCTGTGGAGGCCGCGCAGGCGGCTGCAGCTCAGGGCGACGGTGCGGGCGGCGGGAGTGGAACGGGTGCCGCCGGCACTGACGCCGTGCCGGCGGTGGCCACCGATCCGGCGCATCGCGGGGTGGCAGCGGGCTTCATGGTCCAGGCGCTCGCGCGCACGAAGGGCGACCGGGACAAGGCCGCCCGTCTCCTCGAAGCCGAGGGCCATGGCGCGATCTCGGCCGCGCTCTCGGGCGCGAGCGAAGGCGCGGGCGGCGTCACCATCCCCCGTCCCCAGGCGGCCGAGCTGATCGAGATGCTGCGCGCCCGGGTCGTCGTGCGCGCCTCGGGCGCCCGCACCCTGCCGATGCCCGCGGGCGAGATGCGGCACGCCAAGCAGGTGGGCTCGGCGGTCGCCGCCTATGCCGCCGAGAATGCCGCCATCGCGCCGAGCCAGCCCAGCTTCGACAAGATCGACCAGAGCTTCAAGAAGCTCGTCGGCATGGTGCCCATCGGCAACTCGCTCCTGCGGCACTCGGGCGTGGCGATGGCGCAGCTCGTGCGCGACGATCTCCTGAAGGTCATGGCGCTCCGCGAGGATCTGGCCTTCCTGCGCGGCGACGGCAGCGCCGACACGCCGAAGGGTCTGCGTCACTGGATGCTGCCCGCGAACTGGTCCGCCGCACCGGTCGCGGCCACGCCGGCGGCGGCCGAGGCGGCGATCCGGCGGGCGGTCTCGCTCGTGGAGGATGCCGACGTGGGCATGGTCTCGCCCGGCTGGATCATGCGGGCCTCGACGAAGAACTGGCTCGCGAGCCTGAAGGACGCGAACGGCAACCCGCTCTTTCCCTCCATCGGCGCGTCGGCCCAGCTCATGGGCTTCCCGATCCGCACGAGCTCGCAGATCCCCGACAACTTGGGCGCGGGCGGCGACGAGACCGAGATCTACTTCGGCGACTTCGACGAGGCGATGATCGGCGACAGCATGGCGCTGGTGGTGGGCTCCTCCACCGACGCCTCCTTCGTCGACGGCAACGGGGCGACCGTCTCGGCCTTCCAGAACGACCTCACGCTGATGCGGGCGATCTCCGAGCACGACTTCGCGCCGGCGCATGACGAGGCCTTTGCCGGCTTCAACGCCTCGGGCTGGACGCTCTGACGCCCGGATCGCGGCGCGCCACCCTCCCCTGTCCACGTCCCTCTTCGGCCCCGGCGCTCCCCGGGGCCGAACTCGTTCGGCGTCCCGTCCCCTCCACCCCGGCGCCTCCCCCCCATTCTCCCGGAGAGATCCATGAAGACCATCGTGACCTTCATCCGCCCTTGGAACCGCTACAACCGCGGCGAGACCGCGGGCTTTGATCCCGCCACGGCCGCAGGCCTGATCGGCGTCCATGCCGTGCCGTACCGGCCCGCGGAAGGAGCGCCCGCCGCGATCCCGGCCGCCGCCCCTGCCCCCGCGCCAGCGGAACCCGCCCCGAGCTTCGAGACCGCCATGGCCGCGCTCGAGACGCCGGCCGAGACCGCGCCGGCGGCCGCCGCACCCGGCACGGCCGATCTTCCGGTGCAGGGCCGGCGGAAGTGAGCCCATGCGCGTGATCGAGCCCCCGGCGCTCGCGGTGTCGGTCGAGGCCTTCAAGCGGGCGGTCCATCTCGACGGGCCGGACGACGATCTCCTGATCGCCGAGCTCCTCGCCGCCGCCACCGAGGTGGTCGAGACCGCCGCCCGCCGCGCCCTCATGCCCCGGCTGGTCGCCTTCGAGACCCCGGCCGGGCGCTGGTCGCGCTGGTACCTGCCCATCGCCCCGGTGATCGAGCTCGTGGAGATCTCCGCCCCCGCCGCCCGCCTCGTCCGCGGCTTCACCGAGCCCGCGCTCGAGCGGACGGCGGCGGAGGGCGCGGTCAGCCTCACCGCGCTCTGCGGCCACGAGGATCCGGCCCGGATCCCGCGCGGCCTCTGCCAGGCGGTGATCCTGCTCGCCAAGGAATGGCATGACGCCGGGATCGGCCCCGTTGAGAGCGCGCCGCCCCTCTCCTTCGGCATCCAGCGCCTGATCCGTCAGGCCCGCTACGCCCGGCCGATGGTGTCGGAATGAGAGCCCCGCGCTTCGACCGCCGGGTGCAGATCCGGCGGGCCACGCTCGCCGACGACGGCTTCGCCTCGGTCGAGGTCTGGGCCGATCATGGCGCTCCGATCTGGGCTGCGAAGGCAGACCTGAGCGACGGCGAGCGCTGGAGCGCCGGCGAGATCGCGGCCAGCGTCACCACCCGTTTCACGCTCCACCGCACCGCCTTCGCGAGGGGCCTCACCCCGAAGGACCGGCTCCTCTGCGAGGGGCGCAGCTTCGAGATCTCCGGCATCAAGGAAAGCGGCGCCGGCGGCCGCTTTCTCGAACTGACCTGTTCCGCGAGGACCGACCGATGAGCGTCACCGTCTCCGTCACAGGCCTCCGCGAGATCGAGGCGCAGCTTGCGAAGCTCTCGAAGGCCGCGGGCAAGGCAGCCCTGCGGCGGGCGCTGAAGACGGCGGCGCAGCCGCTGGCCGATCTGGCCCAGAGCAAGGCCCCGGTCGGCGACACCCGAACGCTCGCGCCCTCGATCACGGTCGGGACGCGCCTCAGCAAGCGGCAGGCGAAGCGGCACCGCCGCATGTTCCGCGACGACCGGGCCAGCGTCGAGATGTTCGTGGGCGCGGGGCCGCTGCCGAGCGCGCACAATCAGGAGTTCGGCAACATCCACATGGCGGCCCAGCCCTTCCTCCGCCCGGCCTGGGATCAGGACCGCGAGGCCCTCCTCGAGCGCCTCCGCGCCGATCTCTGGCAACAGATCTCGAAGGCGATCACCCGCGCCGAGAAACGCGCGGCGCGGGCCGCAGCGAAAGGAGCAGCGAAGTGACCGATGAGAAGAAGCACCCCGACGCGGTGGTGCGCCAGCTCTCCGATCTGGCGGCGCAGCATGGCTATGGCCTCATGGCGGTCGAGCTCGTCAGCCGTACGCCGGTCGGCATCACAGTGCCGTGCCACATCTATCGCGCGCCCCGCCCCGCCTCGGACGGCTGACCCATGGAAGAGGCCCTCCGCGCGCTCCTCCTCGGCGCGCCGGCGGTGACGACCCTCGTCGGCCGCCGCGTGAACTTCGGCCGCCATCCACAGGGCGAGCCGCTGCCGGCGCTGGTGCTCACCACGGTGAGCGACCGCGAGGGGCTGACCCTCGCCGGGCCGGACGGGCTGCAGCGGGCCCGCGTCCAGATCGACTGCTACGCCGAAAGCTACGGCGCAGCCAAACAGCTTTCCCGCGCCGTGCGCGCCGTGCTCCACGGCCACAGCGGCGGCGGGTTCCGAGGTGTCTTCCTCGACGGCGCGCGCGACCTCCGCGAGCCGGGCGACGACACCGGGCGGCCCTTCCGGGTCTCGCTCGACTTTCTCACCATCTACTCAGCATAGGAGGGCCGGATGGCCTCGAAACAGATCATCGCCTACGGGGCCGAGGTGGAGCGCTCCACCGATGGCGCCGCCTGGAGCGCGATCCCGGAAGCCAAGGGGATCGCCGTGCCTGCCGTCGAGCAGGATTATCAGGACGTGACCTCGCTCGACAGCGAGGGCGGCTATCGCGAATACATCAAGGGGCTGAAGGACATCGGCCAGATCACCATCCCGATGGGCTACACCTCGGCGGGCTACGCCGCCATGATCGCCGATCAGGAGGCCGCGCACCCGATCTACTACCGCGTGACGATGAAGCCCGCGCCGGATCAGAGCACGGGCGACGTGTTCGAGTTCCGCGGCTTCCCGGTGCCGCAGCTCGAGGCGGGCGACCTCGGCGCCCCGGTCGGCATCAACCTCAACATCCGCGGGACCGGCGCGCCCACCTGGACGCGGGGGACGGAGGCATGA
Protein sequences of DBSCAN-SWA_7 >NZ_CP051468|2569000:2588747|2572319_2572715_-|WP_011336927.1|DBSCAN-SWA MEAAPHPAPRPGPVPVESLDAADLDRMAAAIDSGRIRKPPMKETPADRAVAEAAYQVAADELRQFIERYEQLEAEKKEIGGQQKELMAEAKGRGYAPKILRMIVALRKRTPDDIAEEETLLALYKAALGMA >NZ_CP051468|2569000:2588747|2577384_2577798_+|WP_017140048.1|DBSCAN-SWA MIRKLCCAPGCEELAPAGQAHCADHAAEAEARARSRKARAKAGAAAQAGAAFYATGRWRRARARFLAAHPLCADCAGLGLVVAAAEVDHIRPHRGDPGLMWDRANWQPLCRPCHSRKTAREVFHPPGGIAKSEGSAR >NZ_CP051468|2569000:2588747|2570058_2570301_-|WP_011336930.1|DBSCAN-SWA MSARRSLTFAPRLLGAAEAAAYLGVSATTLRGLDLPRRVLGGRRLYDRLDLDAFASDLPVEGETFESEVELCDRHFGMQA >NZ_CP051468|2569000:2588747|2586148_2586439_+|WP_011336912.1|DBSCAN-SWA MKTIVTFIRPWNRYNRGETAGFDPATAAGLIGVHAVPYRPAEGAPAAIPAAAPAPAPAEPAPSFETAMAALETPAETAPAAAAPGTADLPVQGRRK >NZ_CP051468|2569000:2588747|2578062_2578500_+|WP_011336919.1|terminase|DBSCAN-SWA MRGQKPKVSNVIPMKGDLAAPVPEAPGWMSAEGREVWDRLAPVLVAKKRLEPAFEDPFAVYCEAVADVIRFTGDIAAFGSWYEVETRNGRQQKKRAVWGQRQDAIAVMNQLAARFGLTPVDEARVRVTGQGDLFDEILKTLDGAD >NZ_CP051468|2569000:2588747|2588306_2588747_+|WP_011336906.1|DBSCAN-SWA MASKQIIAYGAEVERSTDGAAWSAIPEAKGIAVPAVEQDYQDVTSLDSEGGYREYIKGLKDIGQITIPMGYTSAGYAAMIADQEAAHPIYYRVTMKPAPDQSTGDVFEFRGFPVPQLEAGDLGAPVGINLNIRGTGAPTWTRGTEA >NZ_CP051468|2569000:2588747|2587275_2587749_+|WP_011336909.1|DBSCAN-SWA MSVTVSVTGLREIEAQLAKLSKAAGKAALRRALKTAAQPLADLAQSKAPVGDTRTLAPSITVGTRLSKRQAKRHRRMFRDDRASVEMFVGAGPLPSAHNQEFGNIHMAAQPFLRPAWDQDREALLERLRADLWQQISKAITRAEKRAARAAAKGAAK >NZ_CP051468|2569000:2588747|2587916_2588297_+|WP_011336907.1|DBSCAN-SWA MEEALRALLLGAPAVTTLVGRRVNFGRHPQGEPLPALVLTTVSDREGLTLAGPDGLQRARVQIDCYAESYGAAKQLSRAVRAVLHGHSGGGFRGVFLDGARDLREPGDDTGRPFRVSLDFLTIYSA >NZ_CP051468|2569000:2588747|2570376_2572197_-|WP_017140052.1|DBSCAN-SWA MAKDFISRGDLRLIPLAELRLSPLNSRQEIAAEEVEAMAESLAVAGLLQNLIGHLTPQGIEIVGGGTRLRALQRLAAEGWSRHADLIPIDPVPVKVTADLQEAVAWAGTENSARSALHPADEVRAYAAMRERGASLSRIARSFARSEAHVERRLKLADLPAEALAALRANEISLEMAKALTFAPSGERCLEVLASVRGRDVRPEQVRRELTPGTVPSTDRRAVFVGLEAYLAAGGTSQRDLFADRTLLEDEALLDRLFAEKGAAEAERIRAEEGWEWATWVPEEYVSWTVTQKLVRLYARPGKLSEGEEAELAALEEREADDTLDEPGRARLAELEARREGGFTDAQRASAGIFVYCSSRDGLSVERAYQQPRAVPRGAAELAPDLPQSLIEDLHRIRLGALQARLMDQSELMLDLLAWSLGGGLRPWARPLAISPTDQPIVPEKGEGTSYPPRLAARLEPNASLGPDGTPAEFEAFRAQGKKHRNQILTEALARTFCTGSSGLSAALARQLGVEVRRIWTPTAQGFLGRCSAGYLDRLWNELVPAAEAEPAHQSFNKLKKGEKAKRLEALFADPDTREALGLSREDCAKIDAWVPAELGWPEGEE >NZ_CP051468|2569000:2588747|2576008_2576587_+|WP_002720775.1|DBSCAN-SWA MAMTKAEERARVKALVVDRLEQAGMVRRRGTAAAAHEAAMARLCEQLGYMGAENLMTLAEVLIDSAADGVWPSEVLIRQFARAIEEPPPAERRLVSSWLASVEGPKAEAGGHLVELYRWLLKHPRPPLAMDLREIRDQAQDNARRCELIRDRIDRETASREDRDWLELYLRDRAQARALVDAGRGRKEGAAA >NZ_CP051468|2569000:2588747|2580463_2581336_+|WP_011336917.1|DBSCAN-SWA MTLASLLLVVLASFIHASWNLLAKTAASVGPVFVFAYNLIACVAYAPWVLYLLIHDGIIWTWTGIGFVLLSGLIHLAYSLCLQRGYQVADLSVVYPVARGTGPMLSTLGAFLILGETPSGTGLIGLALVVAGILLIATRGSLAAFTRPGGQAGVRWGTATGGLIASYTVVDAFAVTALGIAPVVLDWFSNLLRFFLLLPLVIADPRRALSAMRGHWGAAIGVGLLSPLSYILVLAALTGGAPLSLVAPMREMSMMVGALLGMLILREAVGRWRLVGCAVLIAGVILLSAG >NZ_CP051468|2569000:2588747|2578486_2580145_+|WP_011336918.1|terminase|DBSCAN-SWA MEPIDHPVSRYALDVVEGRETAGDLVRLACLRHLTDLETGRDRGLWFDCKAASRVLNFAELIQHTTGPLAGRPLTLRPWQAFRHGSVFGWKKEGGLRRFRTTYHQVAKKNGKTTDTAVPALFTALFDREAAPQGYCAATTRDQAGLLFRELKRMIRASPHLSALMQVWRTSIEVPATEGLIACLSRDGNSSDGINPHFAARDEVHRWTDRELAEVLTNSMIARAQPIDWAITTAGADRASLCGEMRDYAEEVVRGTVSDDSFFAYVAEPPPDCDVADPRFWKMANPNLGVAFSEERFGEMYREATVISGKMPNFRRLHMNLWTEGAQTWIARDVWDRGAEPFDPRPLYGLPAWVGLDLSKTTDLTAISVAVPKDGQIYLLAYSFLPEGPKGFIARAQKEKREYVAWRDAGWLEVHSGGVIDEDQVIERLETIRARFDLRELAYDRWGMKYMAKELLKRRFPLVEHGQGYGSMSSPMKRFEEAVAKGRIRHAGNPVLAWAVGNVHRDEDAAENIKPNKARSKGRIDPAVAAIMALGRAEAAEGRRRAREVETA >NZ_CP051468|2569000:2588747|2569000_2570062_-|WP_017140053.1|integrase|DBSCAN-SWA MKLPRVQRIRRGDRVLCYHRPTGTRLPDLPETHPDFIAAWAKAEASKPDLAPPVADKSVAKVVRALRASKRWKGFAASYRLNLRFHLERIETEHGALPLRGLRQKHVEMDLGKLDPNPRNVSLKCWRLIFAQAKADGIIESDPSHEIGKIVTKSPGHVAWTTAEVAAFRARWAVGTRQRAAFELLAWTGCRVSDACRMTRTHLGSDGLMTFRQQKTGGPAYVPWTCALPSWALRWEEERETVRAAVMATAGFTLLETSYGKARSVKGLSNFITAAAAEAGIEGRTPHGLRKFRLTAIAEAGGPAHAIMAWGGHASLSEAEAYTRAASRRGLVMGPEQERNDVNDPGNSCKRAK >NZ_CP051468|2569000:2588747|2587745_2587913_+|WP_011336908.1|DBSCAN-SWA MTDEKKHPDAVVRQLSDLAAQHGYGLMAVELVSRTPVGITVPCHIYRAPRPASDG >NZ_CP051468|2569000:2588747|2573043_2573241_-|WP_011336925.1|DBSCAN-SWA MTATAIRYPAAAPVARFALLDLTEPQLRRLLAAAEAGLPQLDAADDETDAELAAILGQAREALGA >NZ_CP051468|2569000:2588747|2575143_2575800_+|WP_023003515.1|DBSCAN-SWA MPHDRLTETDPPTEIVGDFWEYPLAFGDTLSSHEWVPLHINRLLTSRFVARALAENRRADIGTALLLWCEAFRQDPAGTLPDDDLELARLAGYGADLESWRAAREGALYGWRETHISNQEEAPGNRRLGHVMIAGIARDMHRRKRGRDQARTEGSKAMARTRVRRKLVEIKCGRAAESPDVVNEIAEWLGQRDLFITSDNVRAAFEATRGGPKVVQLT >NZ_CP051468|2569000:2588747|2573707_2574001_-|WP_043762677.1|DBSCAN-SWA MALRIKQLREARGWTQEVLAARSGMSRSQLAMIERETRPANTIRLNAIAGALGVSTEDLFESNPRERALLELIRLLPSEDVEALIRVAEGLAAKSQA >NZ_CP051468|2569000:2588747|2573237_2573423_-|WP_009564675.1|DBSCAN-SWA MQSPDFIAHQHYGIGVLTPGQKLHLQLRLRSAHVSLASAIAAGKYLSDIEDEPTDRRRMLS >NZ_CP051468|2569000:2588747|2574552_2575104_+|WP_011336923.1|DBSCAN-SWA MTAAAPLPVQDAATSPGAAASGAFRSNGWTALRRHPAGRADLLRWHADPALVARHARWGRPVYLASPYTLRAVGPDGRWSRDQSEAAMAEAAREVARLLEVGVTAISPVELSAAALHATMFPRLWIDPFNPVLWEDWCRPLLTVCAAVVVPEIRGWSDSTGIRHEVASALTAQVPVFIYGGLP >NZ_CP051468|2569000:2588747|2583684_2584428_-|WP_011336914.1|DBSCAN-SWA MNGEALLLSGAFLVTALLYAAVGQAGASGYIAIMGLAGFGPLAMKTTALSLNLLVAAIGTALFLRAGRLSWQNVWPFAVLGFPFSLLGGAIHLPEGIYFPVVGAVLILSALQMARAAILSPGVGPAVVGRPPFAAALATGAVIGFVSGTTGTGGGVFLAPIILTMKWGSARQTAATTAVYNLINSSAALLGTYAAWDQAPSALPVWLVAVAVGGTIGAYIGSRYLSDRWLRGTLSVLLLMSGIKLIW >NZ_CP051468|2569000:2588747|2576741_2577281_+|WP_017140049.1|DBSCAN-SWA MRGAKGRKAMKAKLVSLKLAPWDLGPLTPAQIAGKRIEEAAEVDPKTGKKSNPNRVIRTRRETWVGRYHRQGKLSVEQANIAAELFEAASGMPARDPLAAIVRVDMSGDQDPQVAQVDRRRKFFRMWELVPTFARPVIQHVVLDDQSLRGMPGRVDSRSEARQLDRLQRGLDALSEAWR >NZ_CP051468|2569000:2588747|2586443_2586950_+|WP_011336911.1|head,tail|DBSCAN-SWA MRVIEPPALAVSVEAFKRAVHLDGPDDDLLIAELLAAATEVVETAARRALMPRLVAFETPAGRWSRWYLPIAPVIELVEISAPAARLVRGFTEPALERTAAEGAVSLTALCGHEDPARIPRGLCQAVILLAKEWHDAGIGPVESAPPLSFGIQRLIRQARYARPMVSE >NZ_CP051468|2569000:2588747|2574105_2574348_+|WP_043764066.1|DBSCAN-SWA MSILSDYIKGRPEKSRSQWADEFGISRPHLHCLLEGSRLPSLDVATRIAVATGGAVPISSWPNIAAVLAAAQHELPAKFG >NZ_CP051468|2569000:2588747|2572714_2573047_-|WP_011336926.1|DBSCAN-SWA MSRILPLTPRDLNAARHRAEHAPTPEIREAASRLLTEHELAVATGRPLHRARPRPAFRTSRPSPLGPNLTGFLVGALALLGFLFVATVLWAHVTDVAETMRQQASAMRGM >NZ_CP051468|2569000:2588747|2584681_2586016_+|WP_011336913.1|capsid|DBSCAN-SWA MARQNLDDLRRARKAAADTMAAVAARIGALEAAETPDAAALEAETAAFAAAEAAFARADAAVTRAAAVEAAQAAAAQGDGAGGGSGTGAAGTDAVPAVATDPAHRGVAAGFMVQALARTKGDRDKAARLLEAEGHGAISAALSGASEGAGGVTIPRPQAAELIEMLRARVVVRASGARTLPMPAGEMRHAKQVGSAVAAYAAENAAIAPSQPSFDKIDQSFKKLVGMVPIGNSLLRHSGVAMAQLVRDDLLKVMALREDLAFLRGDGSADTPKGLRHWMLPANWSAAPVAATPAAAEAAIRRAVSLVEDADVGMVSPGWIMRASTKNWLASLKDANGNPLFPSIGASAQLMGFPIRTSSQIPDNLGAGGDETEIYFGDFDEAMIGDSMALVVGSSTDASFVDGNGATVSAFQNDLTLMRAISEHDFAPAHDEAFAGFNASGWTL >NZ_CP051468|2569000:2588747|2581482_2582718_+|WP_017140047.1|portal|DBSCAN-SWA MAGASVRTEPPVTAPQAAAEASGTAVPRPWLQEVGWSSGGASRIRTLPRVSAEVAQRHATVYACCAVIAGDLAKVPLKLFQRSGDGREVRVRDHAAPYLLNVEAAPGVAASVVRFALGYAFTLRGNAFAWAPRDGAGELELIDLVRQSGCSVLRAGRERFYDFEDGAGLRRRAPARAMIHLRYMAEDGWTGRSPLEVAAESVGLALAGQESAARAASGVTARAVIRLRDDYEDDEARVRSARRVAAALRAPEVEGFPILGEGEDVQTLDMKAADQELLGSRKFDREQIAAIYRVPPAKLQMMEYGVKANGEQQAIDYLTDCLLHWAKQVEDQLALGVLTEAERRAGLFFRHDFGALLRPTTKERFEALAKAVGGPILTPNEARRIDGYDPIEGGDRLNPAPNMTRSEETDP >NZ_CP051468|2569000:2588747|2586946_2587279_+|WP_011336910.1|head,tail|DBSCAN-SWA MRAPRFDRRVQIRRATLADDGFASVEVWADHGAPIWAAKADLSDGERWSAGEIAASVTTRFTLHRTAFARGLTPKDRLLCEGRSFEISGIKESGAGGRFLELTCSARTDR >NZ_CP051468|2569000:2588747|2582714_2583674_+|WP_011336915.1|DBSCAN-SWA MTRTLASLFGPLQPMALAEDLAAPLLALAIPEGGVGGSAPSESILSAATEPAALAHGPTAGPTVPDRFTVARGLAVVPVRGILTPNMAQYERWFGWATYHGLAETLAHLAASEDAAAIVLEIDSPGGLVCGIEAAAEAIATAAAVKPVHALVSPLAASAAYWLASQASEIVMTPGAVAGSIGIALTAAAHVQPGANGAQIFEMSSRHARAKRPDASTEAGRAELQRSLDEAEAAFHAAVSTGRAIPAAELAARLSVTDDPQDGGATFRAPAAIRRGLADRSETRAAFYARLTARTAPKPRSPSRAFAARAAAAAALARS |
28 | Paracoccus_phage(26.67%) | tail,terminase,integrase,portal,capsid,head | attL 2567395:2567409|attR 2572098:2572112 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
2595418 : 2603995
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP051468|2595418:2603995|DBSCAN-SWA CATGGACTTCTTCTCTCCCCCGCCGACGCCGCCGAACAGCGGCAACCCCGGCACCTTCAACGACGATGCCGACGCCTTCCTCGGCTGGTTCCCGGCCTTCGTGGCCGAGCTGAACGCGCTGCTGCCCTATCTCACCGGCGCGGGCTTCTCCGACGGCACCGCGGCCGCGCCCGGCCTCGTCTGGAAGGGCGATCCCGACACCGGGCTCTTCCGGCCGGGCAGCAACGCGGTGGGGGTGACGGCCGGCGGCGTCCTCCGGCTCACGGTCTCGGCCCTCGCGCTCACCTCGACCGTGCCGCTCCGGGCGCCGCTCGGCACGGCCGCGGCGCCGGGGATCTCGTTCGAGGCCGATCCCAACACCGGGATCCGCAGCGATGGGGCGGACGTCCTCCACTTCGTCACCGGCGGCGTGACGCGTGGCTTCTTCTCCACCACCCACTTCCAGTCCACGCTGCCGGCCGCGCTTCCCGGCGGGGCGGCGGGAGCCCCCGGCCTCACCTTCGCGGGCGATCTCGACACCGGGATCTTCCGGGCCGCGGCCGACCTGCTCGGGATCGCGGCCGGGGGAGAGGAACGGTTCCGCGTCGGCTCCGGCCGCGCGGCGGCGCTCGTGCCCTTCAGCGTGCCGGACGGGACCCAGACCTTCCCCGGCCTCACCTTCAACGGCGAGGTCGGCTCGAACACCGGCTTCTTCCTCGCGGCCGAGAACGAGATCGGCGTCACCTGTCAGGGGACGGAGCGGGCACGGTTCACGCCCTCCGGCATGCAGCTGCAGGGGCTCCTGTCCGGCACGGCCGTGACCCAGAGCGATCTCGACACCACGCCCGGGCGCCTCCTGAAGGTCGGGGACTACGGGCTCGGCGGCACGGCGCGCCCGATCCCGGGCAACGATGCCGACCAGATCGGGACGACCGGCTTCTATCAGGTGACGGGCGCCACGCTGAACCGTCCCGCCGGCATGAGCGTCGGTACCCTGCAGCACATCCAGCACGGGGCGGCCCGCGCGGTGCAGATCGCCTATCCTCAGACGGCATCCGATACGGGGCGCTGGTGCCGGCACAAGGATACCAGCTGGGGCGACTGGTTCCTGACCTACGACCAGCGCAACATCGTCGGCGCCGTCAGCTGGGCCTCCGGCTTTCCGCGCGGCGGCATCATCGAAAAGGGCGAGACCGCTGGCGCCGAGTATGTCCGCTTTGCGGATGGGACGCAGCTCTGCCGCCTTGTCCAGACCGGGGTTCCGGGTCCGACCACGCCGCAGGGGTCGCTCTATCGCACCGAATGGCAGACGGTGACGCTGCCCGTCGAGTTCGTGAGCGGGGCCCTGAACGGCCATTGCGTGACCGGCGGCTGCCGGGGCGGTTCGGTGATCTCGCTCCTCGGCCGGCCGGGGGCCTCGAACGTGGCCGCCTACATGCTGCTGGCTCCGACGTCCTACGGGGCCACGCAGACCGTCGATCTTCTCGTCACCGGCCGCTGGAGGTAATCCGCCATGATGCGCATTCGCATGGTTCCGCTTCGGCGGATGTATGAGCTGACCGTCTTCCGGGTGCAGGGCGACACCCTGACCTGCAACGACATGGTCTACGACTTCAGCGGCGTCGAGGAGGGCGATGTGCTGCCCTGGGACGCCATGGACAACACGTGGGTCACGAGCAACGTCACCCGGGTGAACGGCGTCCTCGAGTTCGAGGTGGTCTTCCCGCACGGCTATTACGGGGACCTGCCCTTGCCGACCCCCGGCATCCTCGAGGTCGAGGATCAGGATATCCCGATCCCGCCCTATCTCCCGCCGTTCGCGGAGGGCTGATCCATGGCAAGAAAGAGAGCAGCAGCGATGACAAGCGCGACCATCGATTACAGCAGGCTCGTGAAGGCCCGGGACATCAGGGCGCAGGCCGAGGCCCGCGCGCGGGGGCCGGCCGAGATCAGCGTCCTCCAGGCCATGATCGTGGTCGGCGAGGAGAAGTGGGGCCAGGCAATGGCCATCGCGGAGGACGCGGCCTACCCCTGGGCGATGCGGGCCGCGCTGCGCGGCGCGACGGTGCTCGTCCGGGATTCGGAGACGACGGACACGCTGGCCTTCCTCCTCGGCCTCTCCCCGGAGGAGACCGACCGGCTGTTCATCGAGGCCGCAGAGGTGAGGCTGTGAGTCTCATCGAGCAGGCGCGCGTCAAGCAGTCGGCCGCCTGAGGCGCCCCCGCCTTCCCGATCATCATGACAGGACCCGGCCGCCCCCGGCCGGGACGAGACCCGCGCGACGCGCGCACAACAGGAGCGGCGCGATGCCGGAAAAGGGACTGATCGACACCATCACGGCGCTCTGGGGCGGGGCCATCGCCACGCTGATCGCGGCCGCCATGGGCCGGCTCATGTATCACACCGGCGAGGTCCGCGCCCGCCGCCGCGCCTTCTTCGGCCGCGAACTCCTCTGGGAGATCCCCGCCCTCGTCGCCATGGCCTTCATCGGCGAGGCCCTAAGTTCGTATCTCGACCTCGACGGCCGCGCGGCGATGGGCCTCGTGGCTATGCTGGCCTATCTCGGCCCGCGCGGGACGACGGCGATGCTGGAGCGGCTCTGGCGAGGACGGAGCGCGGGGTGAGCGACGATATTGAAGCGAGCAGGGTGCCGCTCGGGATCGGACAAAGGCCTCCCTGCAGCGGCTCCGTCGATCATGATCCGCACGCCTGGCTCAGGGCCTCGATCAGCATGCGCGGGTGGACGGGCTTCCCGATCGCTGTGACGCCAGTCAGAATCTCGCCGCCGGGAAAGTCGAAGGTCCTGTAGGCTGAAGTGACGATAAACGGAACCTCGAGCTCGCGGAGTTTCTCGGCCACCGGAATGGAGGATTGCCCTCGCAAGTTTAGATCGAGCGACACCACGTCCGGGCGCGTCTCGTCCAGGATCCTGAGCGCGTCATCAACCGTGCCAACCGGCCCGAGTACGGCGTAGCCCACATCCTCCAGCATCGACTGCAGGTCGATGGCGGTCAGGAATTCGTCCTCGACAACAAGGACGAGCAGGCCCTGCTTCGGACGAGGCTCGGCCGGCTCGTTCGGCATGGTATACTCGGGTCAGGGGTTCTGGCTCGGAAAGCTCAACTTCGCCACGAAACCATCAGGCTCATAGGAAAGTTCCGCGGAACCTCGCAGTTCGAATTCGGTGGTAAATTTGATGAGACGGGTGCCGAAGCCGGTCTTCGAGGGTGCCGCCACCTTCGGCCCCCCTCGTTCCTGCCATCGCAAATGCACTTTTTTCCGGCCTTCCCCGTCCGAGACGACCTTCCAGTCCAGTGTCACGGTTCCACTCGACGTGGACAGGGCTCCGTACTTTGTGGCATTCGTCCCCAGCTCGTGAAGAACGAGGCCAAGCGGGGTGGCCTGCGAAGGGGAGAGCCGGACGGCCGGCCCTTCCGCGATGACAACGGCGGATGCCGCCTCCAGATATGGCTCCATCACGGCGACGGCCAGGTCCCGCAGCTGCGACGACTGCTTGTGCGACGCCACTTCGAGTGATCGCGCCAGAGCATCGAACCGTCCGAGGAGCGCGGTGCGATATTCGCCGGCACTTCTGCCTTCGACGGCCGTCTGTCGGGTCAGAGCCTGTGTCACCGACATCAGGTTCTTGATGCGGTGATCGAGCTCGTTGATCAGGATGTCCTTCGCGGCCTCCATCTGACGCCGGGCCGTCGCATCCACCATCGAGAGGAGGAGGACACGCTGCCCATTGTCCGGGTGGATCAGGCGTTGCGCGCTGATCAGCATGGTCCGCAGGCCCACCTCCGGAAATCCTGCCTGCACCTCGAAATCCATGATAGAAGCGCTTTTCGGGATCACGTTCTCGAGCAGCATGCGCAGTTCGGGAATGTTCCACTGGCCGTTACCGAGGTCGTAGAGCGGGCGGCCGATGGTGCTGTCCCGGTCGGTCGCAAAGGTCCGGTAGAAGGCCGGGTTCGCGCTGATGACCGTCAGATCGCCTGCGAGGACGAGCAGCGGGTCCCGGATGGTATCGACGATGCCCTGAGCCTGCACGTGCGATACACGCAGAAGGCGGTATAGATCATCGAGGTCCATTTGAGTTCCTTCTCAACTGATCTGCTGTAATTATCGGAGCCTTCACCGCACGCGCATCTTGCCCTTGTGGCACTCCCTGATGCGCTCATCAAGCAGCACCCATGCGGACCACGAGGCTTTGGCCTTTGGTCGCATGGGAATGTGACGCTGGAAGTCGAGCTGCGGACGCCAAGGCTCTGCCGTAGCATTTCCCACATGGGGTCTACGGCCTCAACAATCGAGATCCGGAACGCCCCTTCCTCGGGGCGCCCGTAGCGCCCCTCAGGCAGGGGCCGGGCTGGTGTAACAGCCCGAACCACGCGGCCAAAGTACCACCTCCGACCGCGCCAGCCTGTCCGTCGCCTTTCAGACTGCCTGCCACCCCTCGCGAGGGCAGGCGCCTTGTGAGGCAGAATCACCTTATGAAACAAGTACCTCCCGCCGCCCCGGTCGCCCCGTGGCTCGGCGGCAAGAAACGTCTCCACCCGCTCATCCTCGAGCGGATCGAGGCCATCCCGCACCGCGCCTATGTCGAGCCCTTCGTCGGCATGGGCGGGATCTTCCTCCGCCGCCGCTTCCGGCCCCGCCTCGAGGTCATGAACGACCGCAACGGCGAGATCATCAACCTCTTCCGGATCCTGCAGCGGCATTACCCCCAGCTCCTGGAGATCATGCGCTTCCAGATTTGCAGCCGGCGCGAGTTCGATCGGCTGCGCGTGACGGATCCCGCCACGCTGACCGACCTCGAGCGGGCCGCGCGGTTCCTCTACCTCCAGCGGCTGAGCTTCGGCGGCAAGCCAAACGGCAGCTTCGGCATTTCGCCTGGGAACGGGCCGCTCTTCTCGCTCGCCCGCCTCGAGCCGCTCCTCGACGCCGCCCGCACGAGGCTCGATGGCGTCGTCTTCGAGTGCCTCGACTGGGCCGAGCTGATCCCGCGCTACGATACGGCCGAGACGCTGTTCTATCTCGACCCACCCTACTTCGGCGGCGAGAACGACTACGGCCGCGGGATCTTCGACCGGGCGCAGTTCGCGCGGATCGCCGAGATCCTCGGCGGCCTGAAGGGCGCCTTCCTCCTGTCGATCAATGACACGCCGGAGATCCGGGCGCTCTTCGGTCGGTTCCACCTCGAGCCGGTGCGGCTGAGCTACTCGGTCGCCGCCTCGGGCAGCACCGAGGCGCAGGAGCTCCTCGTCTCGAACCGCGAGCGGATCGCGACCCTGCTCTGAAAACCACACCGGTCCGACCACCCACGCCCCGCCCTCGTGCGGGGCTTTTGCATATGGAGAAAGACGTGACGACATCCGACATCCAGCGGCTGCTCGCAGCCGCGGAGCTCTACCGCGGCGCCATCGACGGCGACGCGGGGCCGCTGACCCAGGCGGCCGCACAGGCGGCGCTCGAGGGAGAGGCGGTGCCCTGGCGCACCTGGCCTTTCCGCCGGCAGCGGATCGCGGCCGGCCAGGCGGTGCTGGCGCGGCTCGGCCATGCGCCGGGCCGGATCGACGGGCTCCTCGGGCCCAACACCCGCGAGGCGCTGACCGCCTGGGCCTCCGGGCCGGTGCGAGCGACCGTCGAGCGGCTGCCGCTGCCGGGCCATGGCGTGGCCGATGCCCAGGGCGCCTATCCGCGGCAGGAGTCTGTCGCGACCTTCTACGGCGTCGCGGGCGGCCCCGACTGCACCGCGGGGATCGTCGAGCTGCCGATCCCGTTCCGCCTCGCCTGGGATCCCGGCACGAGCATCACGAGCTTCCGCTGCCACAAGCTCGTGGTCGCACCGATGGCGCGGATCTTCCGCGAGGCGGTGGCGCACTACGGCGCCGTCCAGTTCGAGCGGCTGCGGCTGAACCTCTTCGGCGGCTGCTTCAACCACCGGCCCATGCGCGGCGGCTCCGCCCTCTCGATGCATGCCTGGGGCATCGCCGTGGACCTCGACCCCGAGCGCAACCCGCTCCGCTGGGGCCGCGACCGGGCGAGCTTCGCCGCACCCGCCTATGAGCCCTTCTGGACCATCGTCGAAGCCGCCGGCGCCACGAGCCTCGGCCGCGCCTGCAACCGCGACTGGATGCACTTCCAGTTCGCCCGCCTCTGAAGGAGAGAACCCATGTCCCTGATCTTCATCCTTGCCCTCGCCCTCTGCGCGTTCGCCTTCATATTGCTCATCGGCTGGCCGCTGGTCGCTGCTCTGGCGCGGCTGGCCGCTTCGACGATCTCGGCCTCGATCCTCGCCGCGCTCGCCGTGCCCGCAGCTGCCTCGACCGGCAGCGACCTTCTGACCGCGCTGACGCCCAGCCTCCTAGATCTGGCGGGCGTGATCCTGACGGCGCTGATCGGGCTCGCGACCGTCCGCTTCCAGCGTTGGACCGGGATCCAGATCGAGGCACGGCATCGCGAGGCGCTGCATTCGGCGATCATGACCGCCGCGCGGGTGGCGGTAGCCCGGAAGCTTGCCCCTGATGCGGCGACCCAGTTTGTTTCGAGCTATGTCCGGAACTCTGTACCGGATGCGTTGCAGCAGCTGGCCCCGCCGGCGGACACGCTCGATGTGCTGGTCCGGTCCAGGCTTGCCCAGGCGGCTGAGTTCTGAGCATGGCGCGTGCCCGTGGTGCTGTCACGGGCCACGCCCCTAGGACCGCCATGCACATGTTGCGGACGTGACGCTCGGTACAGCAGCACGTCTCATCTGGCTTGCGTAGCTCCCAAGCCGGACGCGTCAGAATACGAACTGAATAATTTATCTTGAATTTGCAAAACGCGGCACCTATGATGACCTAGATCCATCAGTAATAGGAAGAAGCACGTGCAGTACGGTGAAGATGAGGTTAGGGAGCACTTGAACCCGTACCATGATCGTATCCTGCGCGTCATAGAGCAAGGTTACGGCGAGTGGGCATCTATTAAGCGCAGCATGGCATCGAGTGGTACTGGGCCAGTTCTTTATCCGCGAACGACCGCTAACTACGTCTTTGACGCTATCGCTCGACATGCCATTACCGAATTCAGCGCCGATCCAGAGGTGCGGGTTTATCCCAATGCGCAGACAGTAAAGTTTTGCTTCAAGGACGTGGTGCTAGCGCGGTTTAAAAAGGGTGATGAAGACAATCTGGGCCGCAATCACCCCACGCAAGCAGTCTTGGATTTTGTGAGCATTCAAAGCGAGTTGCCGGGCATGCCTCCCAGTGCCACCAAGGTGGAAGTGCTCTATACCGCTAATGAGATCGAGGACCGCATAGAGCGGGTTGTTGTTGCAGCACGTGACGGTGACGAGCTACTTTGGCACTATGAGATTGTCCCGAGCGCTGCTAGCACGTCTAGTGGCACTGGCATTACTCTCCTCCCGATGTCGACCGAGCCGCTGGAAGATGCGGCTGAAGCAAGCGAAGAACTAGTTGTTCTCCGCAAACGTCAGCGCGACGACGACACCTCTGACAGCAACGGCAACTAAGTAATGGCTACTCGTGGAAACATGCTGCGATTGGCGCGGCATCTCAGAGGACTTACGCAGAAAAAGACGGCGGAGCAACTAGGCGTTGCTCAGGCCGTATATTCTCGTATGGAAAATGATCTTGTCGAAGTTGACGATGAGTGCATACGGGTGGCATCAAGAGCCTTTAATCTTCCTCCTGGATTCTTCGACCTTCCCGATACAGTTTACGGGCCGCCCGTCAGCATTCATCCAATGCTGCGGGGGCATTCTGACGTAACTGCGCGCGAATTGGATATGATCACGGCGGAGCTTAATGTTCGCATGTTTAATCTGCGCCGCTTCCTAGAGGAAATGGACTTAAAGCCATCTCTAGATCTCCGACGTTTCGACCTGGAGCAATACGGCTCTCCAGCGGATATTGCAGATCTACTGCGGCGACATTGGAAAATTCCATCGGGACCAATCAAGAACTTGACCAGACTTGTCGAGCGAGCAGGTGTAGTCGTCGGCTACTCTGACTTCGGGGGCGCCAATGTCAGTGGTGTTACTTTTGCCGTACCTGGACGACCACCCCTCGTCCTACTTAACCCTTCGCATCCGGCGGATCGTGTACGTTTCACCCTTGCTCACGAGCTAGGGCACTTGGTGATGCACAGGTTCCCGACACCAAACATGGAAGAGGAAGCAAACCTCTTTGCGTCGAACTTTCTTTTGCCGAGAAAAGAACTTAACGACGCGCTTCGCGGAAGGAAGGTTACATTGGCGCTCCTAGCAGCACTGAAACCCGAATGGCGAGTTTCAATGCAAGGCATTCTTTACGCCATCCAGCGCGAGAAGATTATTACGCCAAATCAGGCGCGATATCTCTGGCAGCAAATCGCCACGCGCGGATGGAAGACAAGAGAGCCGGCAAACCTTGACTTCGAGCATGATCGACCAACGGTTCTACCAACCATCATTAAAGCCATGCGGCAAGAGCTCGGATTAAGCGGCGAAGATATAAAGTCGATAACGCAAATCTATAGTGAAGAGTTTGACCGCTTTTATCCTTATGCCTCAGAGGCGGGCACTCGCCCGATGCTCCGAATTGTTAGCTAA
Protein sequences of DBSCAN-SWA_8 >NZ_CP051468|2595418:2603995|2600752_2601562_+|WP_011336894.1|DBSCAN-SWA MEKDVTTSDIQRLLAAAELYRGAIDGDAGPLTQAAAQAALEGEAVPWRTWPFRRQRIAAGQAVLARLGHAPGRIDGLLGPNTREALTAWASGPVRATVERLPLPGHGVADAQGAYPRQESVATFYGVAGGPDCTAGIVELPIPFRLAWDPGTSITSFRCHKLVVAPMARIFREAVAHYGAVQFERLRLNLFGGCFNHRPMRGGSALSMHAWGIAVDLDPERNPLRWGRDRASFAAPAYEPFWTIVEAAGATSLGRACNRDWMHFQFARL >NZ_CP051468|2595418:2603995|2602270_2602915_+|WP_011336892.1|DBSCAN-SWA MQYGEDEVREHLNPYHDRILRVIEQGYGEWASIKRSMASSGTGPVLYPRTTANYVFDAIARHAITEFSADPEVRVYPNAQTVKFCFKDVVLARFKKGDEDNLGRNHPTQAVLDFVSIQSELPGMPPSATKVEVLYTANEIEDRIERVVVAARDGDELLWHYEIVPSAASTSSGTGITLLPMSTEPLEDAAEASEELVVLRKRQRDDDTSDSNGN >NZ_CP051468|2595418:2603995|2596909_2597227_+|WP_011336900.1|DBSCAN-SWA MMRIRMVPLRRMYELTVFRVQGDTLTCNDMVYDFSGVEEGDVLPWDAMDNTWVTSNVTRVNGVLEFEVVFPHGYYGDLPLPTPGILEVEDQDIPIPPYLPPFAEG >NZ_CP051468|2595418:2603995|2598492_2599491_-|WP_011336896.1|DBSCAN-SWA MDLDDLYRLLRVSHVQAQGIVDTIRDPLLVLAGDLTVISANPAFYRTFATDRDSTIGRPLYDLGNGQWNIPELRMLLENVIPKSASIMDFEVQAGFPEVGLRTMLISAQRLIHPDNGQRVLLLSMVDATARRQMEAAKDILINELDHRIKNLMSVTQALTRQTAVEGRSAGEYRTALLGRFDALARSLEVASHKQSSQLRDLAVAVMEPYLEAASAVVIAEGPAVRLSPSQATPLGLVLHELGTNATKYGALSTSSGTVTLDWKVVSDGEGRKKVHLRWQERGGPKVAAPSKTGFGTRLIKFTTEFELRGSAELSYEPDGFVAKLSFPSQNP >NZ_CP051468|2595418:2603995|2595418_2596903_+|WP_168746008.1|DBSCAN-SWA MDFFSPPPTPPNSGNPGTFNDDADAFLGWFPAFVAELNALLPYLTGAGFSDGTAAAPGLVWKGDPDTGLFRPGSNAVGVTAGGVLRLTVSALALTSTVPLRAPLGTAAAPGISFEADPNTGIRSDGADVLHFVTGGVTRGFFSTTHFQSTLPAALPGGAAGAPGLTFAGDLDTGIFRAAADLLGIAAGGEERFRVGSGRAAALVPFSVPDGTQTFPGLTFNGEVGSNTGFFLAAENEIGVTCQGTERARFTPSGMQLQGLLSGTAVTQSDLDTTPGRLLKVGDYGLGGTARPIPGNDADQIGTTGFYQVTGATLNRPAGMSVGTLQHIQHGAARAVQIAYPQTASDTGRWCRHKDTSWGDWFLTYDQRNIVGAVSWASGFPRGGIIEKGETAGAEYVRFADGTQLCRLVQTGVPGPTTPQGSLYRTEWQTVTLPVEFVSGALNGHCVTGGCRGGSVISLLGRPGASNVAAYMLLAPTSYGATQTVDLLVTGRWR >NZ_CP051468|2595418:2603995|2602918_2603995_+|WP_160384111.1|DBSCAN-SWA MATRGNMLRLARHLRGLTQKKTAEQLGVAQAVYSRMENDLVEVDDECIRVASRAFNLPPGFFDLPDTVYGPPVSIHPMLRGHSDVTARELDMITAELNVRMFNLRRFLEEMDLKPSLDLRRFDLEQYGSPADIADLLRRHWKIPSGPIKNLTRLVERAGVVVGYSDFGGANVSGVTFAVPGRPPLVLLNPSHPADRVRFTLAHELGHLVMHRFPTPNMEEEANLFASNFLLPRKELNDALRGRKVTLALLAALKPEWRVSMQGILYAIQREKIITPNQARYLWQQIATRGWKTREPANLDFEHDRPTVLPTIIKAMRQELGLSGEDIKSITQIYSEEFDRFYPYASEAGTRPMLRIVS >NZ_CP051468|2595418:2603995|2597702_2598020_+|WP_011336898.1|DBSCAN-SWA MPEKGLIDTITALWGGAIATLIAAAMGRLMYHTGEVRARRRAFFGRELLWEIPALVAMAFIGEALSSYLDLDGRAAMGLVAMLAYLGPRGTTAMLERLWRGRSAG >NZ_CP051468|2595418:2603995|2597230_2597569_+|WP_011336899.1|DBSCAN-SWA MARKRAAAMTSATIDYSRLVKARDIRAQAEARARGPAEISVLQAMIVVGEEKWGQAMAIAEDAAYPWAMRAALRGATVLVRDSETTDTLAFLLGLSPEETDRLFIEAAEVRL >NZ_CP051468|2595418:2603995|2601574_2602057_+|WP_011336893.1|DBSCAN-SWA MSLIFILALALCAFAFILLIGWPLVAALARLAASTISASILAALAVPAAASTGSDLLTALTPSLLDLAGVILTALIGLATVRFQRWTGIQIEARHREALHSAIMTAARVAVARKLAPDAATQFVSSYVRNSVPDALQQLAPPADTLDVLVRSRLAQAAEF >NZ_CP051468|2595418:2603995|2598090_2598480_-|WP_011336897.1|DBSCAN-SWA MPNEPAEPRPKQGLLVLVVEDEFLTAIDLQSMLEDVGYAVLGPVGTVDDALRILDETRPDVVSLDLNLRGQSSIPVAEKLRELEVPFIVTSAYRTFDFPGGEILTGVTAIGKPVHPRMLIEALSQACGS >NZ_CP051468|2595418:2603995|2599892_2600699_+|WP_011336895.1|DBSCAN-SWA MKQVPPAAPVAPWLGGKKRLHPLILERIEAIPHRAYVEPFVGMGGIFLRRRFRPRLEVMNDRNGEIINLFRILQRHYPQLLEIMRFQICSRREFDRLRVTDPATLTDLERAARFLYLQRLSFGGKPNGSFGISPGNGPLFSLARLEPLLDAARTRLDGVVFECLDWAELIPRYDTAETLFYLDPPYFGGENDYGRGIFDRAQFARIAEILGGLKGAFLLSINDTPEIRALFGRFHLEPVRLSYSVAASGSTEAQELLVSNRERIATLL |
11 | Pseudomonas_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|