Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP023313 | Caulobacter vibrioides strain CB2 chromosome, complete genome | 1 crisprs | csa3,WYL,DEDDh,DinG,cas3 | 1 | 0 | 4 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP023313_3 | 3061875-3061954 | Orphan |
NA
Consensus repeat of CP023313_3
|
1 spacers
spacers of CP023313_3
>3.1|3061898|34|CP023313|CRISPRCasFinder TGGGACAGGCGTCCGCCGCTTCACCCAGAACAAT |
CRISPR arrays and Neighbor proteins around CP023313_3
The CRISPR arrays of CP023313_3 >merge|CP023313|3|3061875-3061954|CRISPRCasFinder CCTCCCCCTCTGGGGGGAGGATTTGGGACAGGCGTCCGCCGCTTCACCCAGAACAATCCTCCCCCTCTGGGGGGAGGATT >CP023313|3|3|3061875-3061954|CRISPRCasFinder CCTCCCCCTCTGGGGGGAGGATT TGGGACAGGCGTCCGCCGCTTCACCCAGAACAAT CCTCCCCCTCTGGGGGGAGGATT
>CP023313.2|ATC26794.1|3057460_3059992_-|host-specificity-protein MVLHQAALAAQAGGVDGFIIGSELRALTTTRGPGGTYPAVTKLKTLAADVRAVVGPATKLGYAADWSEYFGHQPRDGSGHAVFHLDPLWADPNLDFVGVDWYPPVTDWREGEDHLDAMAGYDGPHDPAYLRAGLTGGADFDWFYAGGADRDAQVRTPITDGAHGEAWMFRPKDLLSWWSQPHHDRPGGVRSATPTAWVPRSKPIRLTEFGCPAVDKGSNSPNLFIDPKSSESFLPPYSSGERDDFGQRRYLEAVLAWLDEPSANPVSPLYGGPMIEAASAWCWDARPFPDFPARADVWSDGGAWLLGHWLTGRAGIAPLPELIQALGARAGVALDPGEAGGAVGGYVVDRPMRLRDALSPLTEAFALDPVERGDQVRMMSRTGRAVAAIEPDDLVLPEDGPAERETRTLDPAAEALRLRFLDAARDYQVGALIVRREAGEGTRDMDAPIVLSAAEAAAVARRMLDADAAARRTRIVRLSPSAALRFEAGDRVTLDAETWRIQRLDLDERPRANLAPVLRSQGVDAVIDWTPAPPREPASPPVLHVLDLPSDGALADDARPLVAAAAEPWRPLDVHAGAGVETLTVRARLAAPATLGLTLTDLAPASPHRLDRSARLDVRIEGASLSSAPLAAVLAGGNALAIRAPSGDWEVIAFQTAQLIAPDVWRLSGLLRGQRDGAASEGVIPAGAAVVLLDEAVVPISVAAFERGTTLMVRAAPAGGPPSGAGMTQISAVWTGRALRPLAPAHLRKRSIGGDLSVSWIRRARVGGDVWDGEVPLAEGVERYRVRVLDGAAVLREAEVETPGFTYTAAMRAADAPSSGARLEVVQGASLYGWGAPASTSLW >CP023313.2|ATC25681.1|3056424_3057372_-|J-domain-containing-protein MARDPYQELGVTRTASADEIRKAFRKLAKQYHPDANPGDKKAEERFKQVSAAFDIVGDAEKRKKFDLGQIDADGRETMRGFGGQPGNGPFNAGGFGQGGFHRSNEGPEIDLSDLFGGMFGGGGGGAGAGRGPFSGGAGGGFSAKGADVKARLDIDLEDAIKGGKKRVAFSDGRTIDVTIPTGAQEGQTLRLKGQGAPGRGGQGDALIELAIKPHPIYRREGEALVMDLPVSIPDAVLGGKVEAPTPDGNVMLAVPKGSNSGQTLRLKGRGMPDGKGKRGDLLARLVVTLPETVDQDLEKFAEAWRAQKPYTPKRK >CP023313.2|ATC25680.1|3055902_3056262_-|transcriptional-regulator MKFESLLSDEAVLAEMGQRLVAARLERRLTQAQLAQAAGVSKRTVERLEDGASAQLTNLVRCLRALDRLEGLERLLPETPANPLDLLKQAKTGRSRVRNARASGVAETGGAPWVWGDEK >CP023313.2|ATC25679.1|3054598_3055906_-|type-II-toxin-antitoxin-system-HipA-family-toxin MTTVAEVRLWGSRIGAVSLEDGAQTAVFAYEPGFIASGIQPAPLMMPLKAGVFSFPDLPPRSFHGLPGMLADALPDKYGHVLIDAWLATQGRSPESFNAVERLCYTGRRGMGALEFSPMAGPRRRVSSKIDIDALVTLASEVLTHRHDLRASFADADKADALRDILSVGTSAGGARAKAVIAWNPATNEVRSGQVEAGAGFGYWLLKFDGVSGNRDKELADPKGYGAVEHAYGQMAAAAGIDVAESRLLEEGGRRHFMSKRFDRLDGGGKLHMQSLAAIAHLDFNDPVANSYEQALFTMRRMGLPMAQLEEQFRRMVFNVLARNQDDHVKNIAFLMDRAGRWSLSPAFDITWSYNPDGEWTSRHQMSINGKRDGFDFADLEACAKTASISRGHVGRIFEEVRAAVMRWPTFADAAGVDERWRDQIGATLRLELRR >CP023313.2|ATC25678.1|3054384_3054567_-|hypothetical-protein MTPLNAKGRGAACGARGGAGQAQGAAGCLIGGLSTALIRRTAAPSEDRGAIGRRELSCSS >CP023313.2|ATC25677.1|3053887_3054397_-|hypothetical-protein MFVLMLCAALLVIVGCPETALGKGLRRWLVDWPAKVLAGLTPARLVLLLALLAVSTLVVVLFEVEGAILLGMALPEVAVWFMAFDVAAFIDLFAAIALAANGARLKGLGDRIKALPGQFGRGLVARVRRSGQGRQGGRRFRRPGSGATKSEDGDAGWSGAAQPGALAWA >CP023313.2|ATC25676.1|3053441_3053759_-|hypothetical-protein MKPIRALIVAAALAAVLPDAAFAQRRPDSLGADWRQQQDQARGGVQSGRLVPLSRVIEMIGRRVPGRVLDAGLEGDNYRVRWAAADGRRIDFIVDAQTGQILSGG >CP023313.2|ATC25675.1|3052748_3053426_-|DNA-binding-response-regulator MRILLVEDDPDLTRQLKLALADAGYAVDHAPDGEEAQYLGENEPYDAVILDLGLPKVDGVSVLERWRRGNVTTPVLILTARGAWSDKVAGFDAGADDYLAKPFHTEELLARLRALLRRSAGHAAPSLSCGALRLDPRAARASVNGEPLRLTSLEYRLLHYMIMHQGRVIGRTELVEHLYDQDFDRDSNTIEVFIGRLRKKLGADRIETVRGLGYRLAALPGEDAA >CP023313.2|ATC26793.1|3051315_3052707_-|sensor-histidine-kinase MIVPSHRSLVFRLVVAAGIWTLLVVAGAGVFVNTQFRDAQVRRFDQGLSILIDDMYANTSVEDGLVKAPFLTDIRATRAYSGRYWTIFDSTGDGSLRVIDRSRSLFDSDLMLPSAQLDRLIAAPGKTIYFDLRGPQDARLRAGGLLARLPGHATPVIFLVAEDRSPIDADTGRFVRITAFALLILSGGLILAVVVQVRFGLQPLFQLRREVAHVRRGKAERVDGRYPEELEPLAAELNALLAHNQEVVERQRTHVGNLAHALKTPLSVMLTEASQQPGQLAEVVTRQAQTMREQVDHHLRRARAAARSQTSGERTPVEPILDELAVTLERIFQDKADGRGVEIDWRCPEDLCFQGEKQDLMELAGNVMENAGKWCRGKIRVDAVRTGEARMTLTVDDDGPGLTPDERAQALKRGQRLDENAPGSGLGLSIVDELARAYGGSVQLGESPLGGLRVSLDLPCAES >CP023313.2|ATC25674.1|3049784_3051155_-|hypothetical-protein MTLVGSLDPARGIMAAIAAHKLQIIQTLVETAPDAALRSLELALAGAGSQGALASVRGLVEDETANRFVRNNVLAPIVPLCARRASSQVSFPAPVLSRLWRALKSVAAARVEEAAARCNPWDLEQGSPEVFDELCKLAAAGLRDPENAAFDSVRSLCDPEQLALCLQLSTLTRGCLPKLAEWVSRMSDERAAAARLAYRDACRISEDAGPLLLDILSAHLPDDWRIMRVISAVMDRPSDRYLASSEVSQFGERILTEIEETIALIESFSFADGEKAGRLAAQAAQKVQLQMVEIQQSVDIAKDGPWGKRLARQKQAMAKACELRMDQAEKELDKALPTRPISMLAKKGARGVAKLIEAPDEAMIRRAQSALAFVAELRACADKAGYGSSRTKVLEKLNARLDPYIEDVLHVARTGEGGDSAVAVKYLDIAASFIAYTRDDKTAEIVRRRAAAAIAA >CP023313.2|ATC25682.1|3061969_3062596_-|DUF2163-domain-containing-protein MRALPEGLDEARLCHVWILTRADGTRLGFTDHDQDLVVDGVTCRAGGGWSPGATENSVGYAPGQGAVLGVLDEAGIAEADLAAGLYDGARVEALRVDWSAPSRRVSLWTATIASVTREGEAFTATLAGPLAALERVAGRTFTRLCDARLGDVRCGITPAPGATCDKRWATCVGVFGNSVNFRGFPTSPGEDFLTLYPVEGERNDGGRR >CP023313.2|ATC25683.1|3062595_3063231_-|TIGR02217-family-protein MMSEFHEARLPARLAFGCTGGIERRTEVVSLASGHERRTSPWSQSRRRYLIATAPRPLDEIAELVAFFEARRGRLHGFRFRDPADFKSCAPSVQPAAGDQAIGTGDGVRKAFQLRKTYGAGGEAVARTITKPVAGTVTVAVAGVVLAPGAFAVDVTTGLITLNTAPPAGAAVTAGFAFDTPVRFDLDRLDVTLEGFAAARVTACSLVEVLV >CP023313.2|ATC25684.1|3063378_3063843_+|DUF805-domain-containing-protein MDWKTLFLSPEGRIGRQSFWVGWLILLGVNVVASWIPFVGTLIILASIYASVCIHSKRLHDMGQTGWWQVLPWVLGPVLVFGAAISVGVMPAIAALTNGEPEVSALTALVGLFVSIFIAFAIWLAFTLWVGCSLGQPRENKYGPPPINPTAVTV >CP023313.2|ATC25685.1|3063993_3064407_-|DUF4437-domain-containing-protein MRRAMAWTPMFLAMAVQAAPVTLAIDDARFAPLDPGNPNGPQMAVLRGDPATGPSDMLMRFSRGQGVPHVHSSDYRLVVLEGVMTHAQAGEGAGAKPLGPGSYWFQPGEQPHLDGCLSERCTMFISWSGKRDARRAP >CP023313.2|ATC25686.1|3064486_3064993_-|phage-tail-tape-measure-protein MSFDQEGLSAVPARAAEAAAALESLKAPAERAARSIDEAFARAGASLVRSLARAASDGEVSLAELARAMLGAAGAALKGGGLGEALSKTFAGARADGGPVLPGGAYLVGERGPELFRPASAGNIEPVGNSGVSVTVNVQGGEAQGLIRSDAQIAQALARAVSLGARGL >CP023313.2|ATC25687.1|3064989_3065169_-|phage-tail-assembly-chaperone MSWAAPLRLALSLGLPPEAFWRLSLKEWRALTQAPDAPCLNRAGLQDLLARYPDEETAP >CP023313.2|ATC25688.1|3065165_3065450_-|hypothetical-protein MLPPNPARGEVVVTLAGAPRRLCLTLGALARIEAALNLADWSQLPDRIATLSAAELSAILAALLEGGGEAPEIAARATVPEAAGALAAALAACA >CP023313.2|ATC25689.1|3065583_3066000_-|phage-major-tail-protein,-TP901-1-family MAAQAGKDMLLKISDGAPTPTFHTVAGLRARTISLNARTLDITDSDSTGRWRELLAGAGVKSVAVSGSGVFRDAVSDAQVRTSFFDQSARVWRLIIPDFGQLEGAFIVAALEYAGEHDGEAVFALSLASAGAVSFTAL >CP023313.2|ATC25690.1|3066015_3066414_-|DUF3168-domain-containing-protein MSDKPLIDALVATLKAAPAVTAIAGQRVYGAGPRLPTYPCVVVTRAEGRAVGGVGGEGIEHLLTLTCASRFGGPEEARALVAAVRAALHDARPSLVGRRLVNLRVPYADVFAGADRETTLGIVRVRAVTETL >CP023313.2|ATC25691.1|3066410_3066692_-|phage-gp6-like-head-tail-connector-protein MPQALTLAEARAFLRVSDASEDALLTLLIDAAEARVGAVAGVALTATSPAPLRLSVLILAAHAYEHRSSLGDTGEPSLALVEPWLTPYRKARL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
CP023313_1 | 1.1|1475194|19|CP023313|CRISPRCasFinder | 1475194-1475212 | 19 | CP023313.2 | 1093008-1093026 | 2 | 0.895 |
CP023313_1 | 1.1|1475194|19|CP023313|CRISPRCasFinder | 1475194-1475212 | 19 | CP023313.2 | 2669859-2669877 | 2 | 0.895 |
CP023313_1 | 1.1|1475194|19|CP023313|CRISPRCasFinder | 1475194-1475212 | 19 | CP023313.2 | 2820809-2820827 | 2 | 0.895 |
1. spacer 1.1|1475194|19|CP023313|CRISPRCasFinder matches to position: 1093008-1093026, mismatch: 2, identity: 0.895
ttcagcaggggcggcggcg CRISPR spacer ttcagcagggtcggcgccg Protospacer ********** ***** **
2. spacer 1.1|1475194|19|CP023313|CRISPRCasFinder matches to position: 2669859-2669877, mismatch: 2, identity: 0.895
ttcagcaggggcggcggcg CRISPR spacer ttcagcagtagcggcggcg Protospacer ******** .*********
3. spacer 1.1|1475194|19|CP023313|CRISPRCasFinder matches to position: 2820809-2820827, mismatch: 2, identity: 0.895
ttcagcaggggcggcggcg CRISPR spacer ttcggctggggcggcggcg Protospacer ***.** ************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1850635 : 1859661
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP023313|1850635:1859661|DBSCAN-SWA CATGTCCGATCCTCGCTGGCTAACCGAAGGCGCACCGCACGTCTGGCGCCCGTACTGCCAGATGAAGACCGCCAGGCCTCCACTGCCGGTCGTCGCCACGCGCGGCGCCCGCCTGATCCTTGAGGACGGGCGTGAACTGGTCGACGGTCTGGCGTCCTGGTGGACGGCCTGTCACGGCTACAACCACCCCCATATTGCTGGCGCGCTGCGCAAGCAGATCGAGACCATGCCGCATGTCATGTTCGGTGGTCTGGCGCATGAACCCGCCTATCGCCTCGCCAAGCGGCTGGCCCGCCTGCTGCCGGGTGATCTGGACCATGTCTTCTTCGCCGAAAGCGGCTCGGTCGCGGTCGAGATCGCGATGAAGATGGCGCTGCAGCACCAGATCAATCGGGGCGTGGGCGGTCGTACACGGTTCCTGGCTTTCCGGGGCGGCTACCACGGCGACACCCTGGCGACGATGACGGTCTGCGATCCCGAGGAAGGGATGCACAGCCTGTTCGCCGGCGTGATGCCCGCTCAGGTGATCGCCGACCTGCCCCGGGACCCTGCGTCCGAAGCGGCGCTCGATGCTCTGCTCGCCGCGCGCGGACACGAGATCGCCGCCATGCTGGTGGAGCCTCTGATCCAGGGCGCAGGCGGGATGCTTCCGCACCCGCCTGAGGTGCTGCGGACCCTTCGCCGCCTGGCCGACAAGCACGGCGTGCTGCTGATCTTCGATGAGATCTTCACAGGCTTTGGCCGGACGGGATCCTTGTTCGCCATGCAGGCGGCGGGCGTCGAGCCGGATATCGTCACCCTGTCCAAGGCGCTGACCGGCGGAACCTTGCCGCTGTCGGCGGCGGTCGCCCGCCGGCATGTGTTCGAGGCCTTCTGGTCGGATGATCCGGGCGCGGCCCTGATGCACGGCCCCACCTACATGGCCAATCCGCTGGCCTGCGCCGCCGCCAACGCCTCGCTGGATCTGTTCGAGGATGGCGCCTGGGCGCGCAACGTCGCGCGCGTGTCGGCGGCGTTGGCCGAAGGGCTTGAGCCCTGTCGCGCCGGCGAGGGCGTGGTCGATGTCCGCACCTTGGGCGCGATCGGCGTCGTCGAGTTCGAGGCGCCCGTGCCTGTATCGGATCTCTGTGCGCGGTTCGCCGCCCTGGGCGTCTGGATCCGGCCGATGGGCAAGGTGGTCTATCTGACGCCGGCCTTCACGACGCCAGACGAGGATCTTTCGCGGCTCACCTCGGCGGTGCGACAGGTTGTCGGCGTCGATTGACCGGGCTCGCCGGGTCTGGTCCAAGCGTGGCCATGTCCGGCAAGTCATCCCCGCTCTCGGTCCTGGAGTATCTGGCGATCTTCGCGATCATCATGACCTGGGGGATCAACAACGCCGCCGCCAAGGTCGCGACCGCCTATTTGCCGCCCATGACCGTCGGCGGTCTGCGTTTCCTGGCCGCGCTGGTCTTTCTGTTTCCGTTCATTCGTCCGCCGTTCCCCGAGCCCAAGAAACTGGCGGCGATCGTGCTGCTGACCGGCCCGATCCACTTCGGTCTCGTCTATGTCGGGTTCGGCATGGGCCAGTCGCTGAGCCCCCTGGTGGTGGCCAGCCAACTCTGGATCCCGTTCACGGCCCTGGTCGCCTGGAAACTGCTCGGCGAGACCATGCGGCTGCCGGCGGTTCTGGGCCTGATCGTCGCCTTCGTCGGGGTCGCCTGGATGACCCTGGATCCCCACACCTCCGGTGATCTGCCTGCGATTGTCCTGATCGTGCTGGCCAGCGCCTGCTGGGCGGTGGCGACGATTCTGGTCCGTATGACGCCGGGCGCGAAACCCCTGAAGGTGCAGGCGGTCACGGCGCTGTTCGCCGCGCCGAGCCTTCTGGCGATGTCGTTCGCGTTCGAGACCCAGGTCGTAGAACGGATCATGACCGCGCCGCCGATCGCCTGGGCGTGCGTGATCTTCGCGGGGGTGGTCTCAACGATCGGCGCCAGCGCGCTGCTCTTCTGGCTGGTCCAGCGGCGCGAGGCGGGCCGGGTGACGCCGTACTTTCTGCTGACGCCTCTGGTGTCCTGCACGCTCGGCGTCGCTTTCCTGGGCGACAAGCTTACACCGCAACTGCTGATCGGCGGGGCGGCGACGATGGTCGGCGTGGCGCTGGTGGCCATGACGGAAAAGCGCGCGCGGGCCGAAGAGGCGCTGGCGGAAACGGCCTAGAGCCCTTTCCCATGGCGGGCGGACCGCCATGGGAGGAGGCTCATGATGGCTACTTGATCACCCCGCAAGCCACCCGCGCGCCGGCGCCGCCGATGGGCTGAGTCTTGTGATCGTCGGGGTTGGCGTGAACGACGATCGACGAGCCGTCCGCATCCAGCAGGGCGGGGCGGCCGCCGGCGCCCTTCAGCGAAACGAGCGGCGAGTAGATTTCAGCCGTCGCCGCGCCATCTGCTGCGGCGAAGATGTTCGGCAAGTCGCCGCTATCGTTGGCGTCGGGGTTCAGCAGGCCGTGGACGGTGGTCGCGGCGGTGTGGACGTGCGCGCCCGCCGACTTGAAGTCCGGGGTCCCGCAGTCGCCCTTCTCGTGGAAATGCACCGCGTGCCAGCCGGGCGTCAGGCCCTTGAGTTCGAGCTTGAGGAGCACGCCGCGCGGGGCTTCGGTGACGGTCACCGCGCCGGCGTCCTTGCCGTCGCCCGCCTTGACGACGGCGGTCGCGCTGGTCTGCGCCAGGGCCGGGGAGGCGGCGAGGGCGGCGGCGAGGCCGAGCGCGGCGGCGGCGGAGAGACGGATCATGCGTTTTTCCTTCGGATTCAAGCGAAACGGTGGGGAGAAGCCCCCCGAAAGGTGGCGTCTTCCCTGTCCGAATGCTAACAGTTTTAGACAGTTCCGTGTTCGATTCTGACGGCGAAAAATTCCCTTGAGCGACGATCACAACACCATCCCCGCTGACGGTTCGCGCGGGGATATCGCCCCGATCAATATCGAGGACGAACTCCGCCGCTCGTATCTGGATTACGCGATGAGCGTGATCGTCAGCCGCGCCCTGCCGGACGCGCGCGACGGTCTTAAGCCCGTGCACCGCCGGGTGCTGTTCTCGATGCATGAGCAAGGGCAGACGCCCGAGCGGCCTTACGTCAAGTCGGCCCGCGTGGTCGGTGACGTGATGGGTAAGTATCACCCGCACGGCGACGCCTCGATCTACTTCACCCTGGTGCGGATGACCCAGTCGTTCTCGATGGGTCTGGTTTTGATCGACGGCCAGGGCAATTTCGGCTCGGTCGACGGCGATATGCCCGCGGCCATGCGCTACACCGAGTGCCGGATGGCCCCGCCGGCCATGGCCCTGCTGGCTGATCTCGACAAGGACACGGTCGATTTCGCCGACAACTACGATGGCAAGGAGCAGGAACCGACCGTCCTGCCGTCGCGGATTCCCAATCTACTCGTCAACGGCGCGGGCGGCATCGCCGTCGGCATGGCCACCAATATCCCGCCCCATAACCTGGGCGAGGTCATCGACGCCTGTCTGCTGCTGATCGATCAGCCGGACGTGACGACCGACCAACTTCTTGATCTCGTGCCCGGCCCCGATTTCCCGACCGGTGGCGAGATCATCGGTCGCGCGGGTCCGCGCCAGGCGCTGCTGACCGGCCGTGGCTCGGTGATCATGCGCGGCGTCGCTAGCGTTGAAGAGCTGCGCGCGGGGCGCGAAGCGATCATCGTCACCGAGATCCCGTATCAGGTGAACAAGGCCAATCTGGTCGAGCACATCGCCGAACTGGTGCGCGACAAGAAGATCGAGGGTGTCGCCGACATCCGCGACGAGTCCAACCGCGACGGCATGCGCATCGTAGTCGAGCTGAAGCGCGACGCCTCGGGCGAGGTGATCCTGAACCAGCTCTATCGCTTCACCGCGCTGCAGAGCTCGTTCGGCGTCAACATGCTGGCCCTGAACCGGGGGCGCCCGGAGCAGATGGGCCTGCACCAGCTGATCCAGCTGTTCGTCGACTTCCGCGAGGAAGTCGTCGTCCGCCGGACCAAGTTCGAGCTGGGCAAGGCCCGTGATCGCGGCCACGTGCTGGTCGGTCTGACGATCGCCGTCGCCAATATCGACGAGTTCATCCACATCATCCGCTCGTCCAAGGATCCGACCGAGGCGCGCGAGCGCCTGGTGGCCAAGTCCTGGCCGGCCGGTGACATGCTGCCCCTGGTCGAGCTGATCGCCGACCCGCGCACCGTGCAGGAAGACGGCGGCCTCATCCGCCTGACCGACGAGCAGGCCCGCGCCATCCTGGCCCTGACCCTCTCGCGCCTGACCGGCCTTGGCCGTGACGAGATCGGTAACGAGGCGGCGGCCCTGGCCGACGCCATCCGCGGCTATCTGGAGCTGCTGTCCGACCGCGCCAACATCATGGCCGTGGTCCGCGAGGAGCTGGTCGAGGTCCGCGAGAAGTTCGCCATTCCGCGCCGCTGCCAGATCGTCGACGGCGACGCCGACATGGAAGACGAAGACCTGATCGTCCGCGAGGACATGGTCATCACCGTCACCATGGGCGGCTACGTCAAGCGCACGCCGCTGGCCAACTACCGCACCCAGCACCGGGGCGGGAAGGGCAAGTCGGGCATGGCGACCAAGAACGAGGACGCCGTCACCCGCGTGTTCTCGGCCTCGACCCATGCGCCGCTGCTGTTCTTCACCTCGGGCGGCAAGGTCTACAAGATGAAGGTGTGGCGTCTGCCGCTGGGCGTCGCCAATTCGCGCGGCAAGGCGTTCGTCAATCTGCTGCCGATCGAGCCGGGCGAGACCATCACCTCGATCCTGGCCCTGCCGGAGGACGAGGCCACCTGGGGCGGCCTGGATGTGATGTTCGCCACCCGCTCGGGCAGCGTCCGCCGCAACAAGCTCAGCGACTTCGTGGACGTGCGCCGCAACGGCAAGATCGCGATGAAACTGGACGAAGGGGATGGCATTGTCGGCGTGGCGGTGTGCAACGCCGACCAGGACGTGCTGCTGACCACGGCGGCCGGCCGCTGCATCCGCTTCTCGGTCGATGAAGTGCGTGTGTTCGCCAGCCGCGACTCGACCGGCGTGCGTGGCGTGAAGCTGCTGGACGGCGACCAGGTCATCTCGATGGCCGTGCTGCGCAGCGTGGACGCCACACCTGCTGAGCGCGCCGCCTACCTCAAGCATCAGCGCGCCATGCTGCGCGCCGCCGACGGCGAGGAGGGCGAAGACACCGCGACCGCCGCGGATGACGGCGATGAGGAGGTCGGCGAAGCGTCCCTGACGCCCGAGCGCATCGCCGAGCTGGGCGCTGCCGAGGAAATCCTTCTGACCGTCTCGTCGGAAGGCTTCGGCAAGCGGACCAGCGCCTATGATTTCCGTCGCACGGGTCGTGGCGGTCAGGGCCTTGCGGCGCAGGATCTGAGCAAGCGAGGCGGCCGTCTGGTGGGCTCGTTCCCCATCGATGAGAGCGACCAGATCCTTCTCGTCACCGACCAGGGACAGCTGATCCGGGTGCCAGTCTCGCAAATTCGTGTTGCGGCGCGGAATACTCAGGGCGTAACCATCTTCCGCACCGCGCAGGACGAGCACGTCGTCAGCGTCGAGCGCCTCGCCGATTCCGGCGGGGACGACAACCAGGGCGAGGACAGCGGGGCGGACGAGACCCCGTAACCCTCGGAGCCAGAATAGGGGGTTGCATGCGGGTCGGGCTTTATCCGGGGACCTTCGATCCGGTCACCAACGGTCACCTCGACATCATCGGGCGGGCCGTGAAGCTGGTCGACAAGCTGGTGATCGGCGTCGCGATCAATATTGGCAAGGGCCCGCTGTTCTCTCTGGAGGAGCGGGTTGAGATCCTCGAGCGCGAGACCGCGCATCTCAAGAAGATCGCCGAGATCGAGGTGCGGCCCTTCGACAGCCTGCTGATGCACTTCGCCCGTGACGTGAACGCCCAGATGATCGTGCGCGGCCTGCGGGCCGTGGCCGACTTTGAGTACGAATTCCAGATGACGGCGATGAACCAGCAGCTGGATCGTGAGATCGAGACAGTCTTCCTGATGGCCGATCCGCGCCACCAGGCGATTGCTTCACGGCTGGTGAAAGAAATCGCGACCCTCGGCGGCGACATCGGCAAGTTCGTGCCGCCCGGCGTGGCGGAGCAGCTGCTCGCCAAGGTCCGTAGGGGCTAGTTTGTCCGGCCTGGTGGTTCGTCGCGCGCGTCAGACCGACGGGAGCGCGCTTTGCGCCATCCTTAACGATACCTACGAGAGCACTTGGCTTCCTCAGCTCAAGCCCGACGCCGCGCGGGCTTTTCCGGGTGAGAACCGGCCAGCGGCCTATGTCGCCGAGCGCGGGACCCTTTTCTGGGTGGCCGAGACGGACGACGAGGTTGTGGGCTTCGTCGATTGGGACGCCGATTTCGTCAATGCGCTTCACGTGCGCGCAAGTCATGCCCGGAGGGGTATTGGCGCTCGCCTGATGGATATGGCTGAAGATGAGATTGCCCGCGCTGGGTACACCGCCGCGCGCCTTGAGACCGATACCTTCAATACACGTTCTCGCGCGTTTTACGCCAAGCGTGGCTATCGCGAGGTCGATCAGTATCCGGATGAGGAATGGAGAAGCGGTCTCACGACTTTGCTCCTAGTGAAGACATTTGATGAAGCGTGACCCTTGGCGACAAAACAACGAGGCGCTCGCCCCGAAGCCGCCGAATTCCTGCGCTTGAACGGTCGCCGCACTTGTCGGAAAGGCCGCGCCGTTCTACGCCCGGCTCCGACTGTTCTGTTCGGGGTACTCCATGAAGCTCGCCATGACCGCCGCCGCGCTGGCGCTGGCCGCCACCACGGTTTCCTTTGGCGCGGCCTCGGCCCAGGCCGTGTCTGACTGGCGCACGCCCGATCCCAACAATGTCCTGGTGGTCGAGACCAACAAGGGCCGCATCATCGCCGAGCTCTATCCCGAGGTCGCGCCGAACCACGTCGAGCGCGTGCGCGGCTTGGCCAAGAGCGGCTTCTATGACGGCCTGACCTTCTTCCGCGTGATCTCGGATTTCATGGCCCAGACGGGCGACCCGAAAAACACCGGGGAGGGCGGATCGGACCTCCCGGACCTGACTGCGGAGTTCAACTTCCGTCGCGGCGCCGACATGCCGCTGGGCGCCGGTTTCAAGGTGGGCAGCAACGAGTCGGGCTGGGTCTACGCGCTGCCCGTCACCAGCCAGCCCTCCGCCCTGGCCGTCATGACCGCCGACCGCAAGGTGTCGACCTGGGGCAACTTCTGCCCCGGCGTGGTGGGCATGGCGCGGGCGGGTGACCCGAACAGCGCCAACAGCCAGTTCTACTTCATGCGCGCTGCGAACGCGGGCCTCGACAAGACCTACACGGCCTTCGGCCGCGTGCTGCAGGGCGTGGATGTGGTCAAGGCCATCAAGACCGGCGAGCCCGTGCCGGACCCGCAGGACAAGATGCTGAGCGTGAAGGTTCTGGCCGACATCCCGGCTGACAAGCGCCCGACGATCCAGGTCATGGACACGCGCGGCCCCGCGTTCAAGGCTCTGGTGACCAAGCGCCAGGGCGAGCTGGGCTCCAGCTTCACCAATTGTGACGTCGACGTCCCCGCCCAGGTGAAATGATGATGCTCTCGCGCGCGCTTCTGACCGGTCTTTGTCTTGCCGCCTGCGCCAGTGTCGCAACGGCGGCGCCGAAGACGGCGAAGCCGGGCGAGGCTGATTGGCGCACGCCTGCGCCTGAATCGATCATGGTGATCGACACCAACAAAGGCCGCGTGCTGGTCGAACTGGTTCCAGAGGTCGCCCCCAACCACGTGGCGCGCCTGCAGGACCTCTCCCGCGCGGGTGTCTATGACGGCCGTACGTTCTTCCGCGTCATCGACCGTTTCATGGCTCAGACCGGTGACCCGACGAACACGGGCGAGGGCGGTTCGGATCGCCCGAACCTGAAGGCCGAGTTCACGTTCCGGCGCGCCGCCGACACCGGCTTTGTACCGATGGCCGCGCCGGCCGGGCTTGAGGTCGGCTACATCAAGTCGCTGCCCGTCGTCAGCCAGAACTGGAGCTGGAGCGACGTCACCAGCGACAAGAAGGTCGCCGCCTGGGCCACCTATTGCCCCGGCGTGATCGGCATGGCGCGAAGTGAGGATAACAACTCCGCCAACAGCCAGTTCTTTCTGATGCGCCAGCCTTATCCGTCGCTCGACAAGCGCTATACGGCGTTTGGTCGGGTGATCAGCGGCCTGGATGCGGTCCGCGCGATCAAGACGGGCGAACCGGTTCCCGCGCCGCAGGACATGATGCAGAAGGTCCGTCTGCTCTCGGACATCCTCGAGAGCGAGCGGCCCAAGGTGCGCGTGATCGACCCCAAGGGTCCCTGGTTCGCCGCCGAGACCAAGCGCCTGCGCGCGGAGAAGGGCGCGGACTTCTCGGTCTGCGATATCGCGCTCCCGGTCGAGGTCCGCTGAGGCTAAACCGATTTCCGGCCGCAACTTGGTTCACAGGCGGGCGTGAACCGCGCTAGAAGCGCGGATAACACTCGCCATCCAAGGGAAGACCATGTCGGCCGACCTCGAAAACACCCTGATCCTGACGCTCGAGAGCGGTCCGGTCACCATCAAGCTGCGTCCCGACCTGGCGCCGGGGCACGTGGCCCGCATCAAGGAACTGGTGCGCGAAGGCTTCTACGACGGCGTCGTCTTCCACCGCGTGATCCCGGGCTTCATGGCCCAGGGCGGCGACCCGAGCGGCACCGGCCGTGGCGGTTCGGACAAGCCGGATCTGAAGGCCGAGTTCAACGACGAATCGCACGTGCGCGGCGTCTGCTCGATGGCCCGCACGCCGAACCCGGACTCGGCCAACAGCCAGTTCTTCATCGTCTTCGACGACGCCACCTTCCTCGACAAGCAATACACTGTCTGGGGCCAGGTGACCGAAGGCATGGAACATGTTGACGCCCTGCCGAAGGGCGAGCCGCCGCGCGCGCCGGGCAAGATCGTCAGCGCCAAGATCGCCGCTGACGCATAG
Protein sequences of DBSCAN-SWA_1 >CP023313|1850635:1859661|1853534_1856294_+|ATC24592.1|DBSCAN-SWA MSDDHNTIPADGSRGDIAPINIEDELRRSYLDYAMSVIVSRALPDARDGLKPVHRRVLFSMHEQGQTPERPYVKSARVVGDVMGKYHPHGDASIYFTLVRMTQSFSMGLVLIDGQGNFGSVDGDMPAAMRYTECRMAPPAMALLADLDKDTVDFADNYDGKEQEPTVLPSRIPNLLVNGAGGIAVGMATNIPPHNLGEVIDACLLLIDQPDVTTDQLLDLVPGPDFPTGGEIIGRAGPRQALLTGRGSVIMRGVASVEELRAGREAIIVTEIPYQVNKANLVEHIAELVRDKKIEGVADIRDESNRDGMRIVVELKRDASGEVILNQLYRFTALQSSFGVNMLALNRGRPEQMGLHQLIQLFVDFREEVVVRRTKFELGKARDRGHVLVGLTIAVANIDEFIHIIRSSKDPTEARERLVAKSWPAGDMLPLVELIADPRTVQEDGGLIRLTDEQARAILALTLSRLTGLGRDEIGNEAAALADAIRGYLELLSDRANIMAVVREELVEVREKFAIPRRCQIVDGDADMEDEDLIVREDMVITVTMGGYVKRTPLANYRTQHRGGKGKSGMATKNEDAVTRVFSASTHAPLLFFTSGGKVYKMKVWRLPLGVANSRGKAFVNLLPIEPGETITSILALPEDEATWGGLDVMFATRSGSVRRNKLSDFVDVRRNGKIAMKLDEGDGIVGVAVCNADQDVLLTTAAGRCIRFSVDEVRVFASRDSTGVRGVKLLDGDQVISMAVLRSVDATPAERAAYLKHQRAMLRAADGEEGEDTATAADDGDEEVGEASLTPERIAELGAAEEILLTVSSEGFGKRTSAYDFRRTGRGGQGLAAQDLSKRGGRLVGSFPIDESDQILLVTDQGQLIRVPVSQIRVAARNTQGVTIFRTAQDEHVVSVERLADSGGDDNQGEDSGADETP >CP023313|1850635:1859661|1850635_1851898_+|ATC24589.1|DBSCAN-SWA MSDPRWLTEGAPHVWRPYCQMKTARPPLPVVATRGARLILEDGRELVDGLASWWTACHGYNHPHIAGALRKQIETMPHVMFGGLAHEPAYRLAKRLARLLPGDLDHVFFAESGSVAVEIAMKMALQHQINRGVGGRTRFLAFRGGYHGDTLATMTVCDPEEGMHSLFAGVMPAQVIADLPRDPASEAALDALLAARGHEIAAMLVEPLIQGAGGMLPHPPEVLRTLRRLADKHGVLLIFDEIFTGFGRTGSLFAMQAAGVEPDIVTLSKALTGGTLPLSAAVARRHVFEAFWSDDPGAALMHGPTYMANPLACAAANASLDLFEDGAWARNVARVSAALAEGLEPCRAGEGVVDVRTLGAIGVVEFEAPVPVSDLCARFAALGVWIRPMGKVVYLTPAFTTPDEDLSRLTSAVRQVVGVD >CP023313|1850635:1859661|1856813_1857293_+|ATC24594.1|DBSCAN-SWA MSGLVVRRARQTDGSALCAILNDTYESTWLPQLKPDAARAFPGENRPAAYVAERGTLFWVAETDDEVVGFVDWDADFVNALHVRASHARRGIGARLMDMAEDEIARAGYTAARLETDTFNTRSRAFYAKRGYREVDQYPDEEWRSGLTTLLLVKTFDEA >CP023313|1850635:1859661|1859193_1859661_+|ATC24597.1|DBSCAN-SWA MSADLENTLILTLESGPVTIKLRPDLAPGHVARIKELVREGFYDGVVFHRVIPGFMAQGGDPSGTGRGGSDKPDLKAEFNDESHVRGVCSMARTPNPDSANSQFFIVFDDATFLDKQYTVWGQVTEGMEHVDALPKGEPPRAPGKIVSAKIAADA >CP023313|1850635:1859661|1852885_1853410_-|ATC24591.1|DBSCAN-SWA MIRLSAAAALGLAAALAASPALAQTSATAVVKAGDGKDAGAVTVTEAPRGVLLKLELKGLTPGWHAVHFHEKGDCGTPDFKSAGAHVHTAATTVHGLLNPDANDSGDLPNIFAAADGAATAEIYSPLVSLKGAGGRPALLDADGSSIVVHANPDDHKTQPIGGAGARVACGVIK >CP023313|1850635:1859661|1856320_1856812_+|ATC24593.1|DBSCAN-SWA MRVGLYPGTFDPVTNGHLDIIGRAVKLVDKLVIGVAINIGKGPLFSLEERVEILERETAHLKKIAEIEVRPFDSLLMHFARDVNAQMIVRGLRAVADFEYEFQMTAMNQQLDREIETVFLMADPRHQAIASRLVKEIATLGGDIGKFVPPGVAEQLLAKVRRG >CP023313|1850635:1859661|1858253_1859102_+|ATC24596.1|DBSCAN-SWA MMMLSRALLTGLCLAACASVATAAPKTAKPGEADWRTPAPESIMVIDTNKGRVLVELVPEVAPNHVARLQDLSRAGVYDGRTFFRVIDRFMAQTGDPTNTGEGGSDRPNLKAEFTFRRAADTGFVPMAAPAGLEVGYIKSLPVVSQNWSWSDVTSDKKVAAWATYCPGVIGMARSEDNNSANSQFFLMRQPYPSLDKRYTAFGRVISGLDAVRAIKTGEPVPAPQDMMQKVRLLSDILESERPKVRVIDPKGPWFAAETKRLRAEKGADFSVCDIALPVEVR >CP023313|1850635:1859661|1851930_1852836_+|ATC24590.1|DBSCAN-SWA MSGKSSPLSVLEYLAIFAIIMTWGINNAAAKVATAYLPPMTVGGLRFLAALVFLFPFIRPPFPEPKKLAAIVLLTGPIHFGLVYVGFGMGQSLSPLVVASQLWIPFTALVAWKLLGETMRLPAVLGLIVAFVGVAWMTLDPHTSGDLPAIVLIVLASACWAVATILVRMTPGAKPLKVQAVTALFAAPSLLAMSFAFETQVVERIMTAPPIAWACVIFAGVVSTIGASALLFWLVQRREAGRVTPYFLLTPLVSCTLGVAFLGDKLTPQLLIGGAATMVGVALVAMTEKRARAEEALAETA >CP023313|1850635:1859661|1857423_1858257_+|ATC24595.1|DBSCAN-SWA MKLAMTAAALALAATTVSFGAASAQAVSDWRTPDPNNVLVVETNKGRIIAELYPEVAPNHVERVRGLAKSGFYDGLTFFRVISDFMAQTGDPKNTGEGGSDLPDLTAEFNFRRGADMPLGAGFKVGSNESGWVYALPVTSQPSALAVMTADRKVSTWGNFCPGVVGMARAGDPNSANSQFYFMRAANAGLDKTYTAFGRVLQGVDVVKAIKTGEPVPDPQDKMLSVKVLADIPADKRPTIQVMDTRGPAFKALVTKRQGELGSSFTNCDVDVPAQVK |
9 | uncultured_Mediterranean_phage(57.14%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
3043036 : 3068053
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP023313|3043036:3068053|DBSCAN-SWA CTTACGGCGCGATCTTCAGCGGGATGAAGACCGGACGGTTCTGGCGGACGATCTTGACCAGCACGCTGGCGCGGCCGGCCTTCTTGGCCGTTTCCACCGCAGAGGTCACGTCGGCGACGCTGGTGACCGGCGCGCCGTTGATGTTGGACAGGACATCACCCTTGGCCAGGCCCTTCTCGCCCGCGTCGCTGTCGCCCTTCACGCCGATGATCAGCAGGCCCTTGATGTCCGGCTCGATCTTGTAGGTCTGGCGCGAAGCCGCATCGATCGGCCCCAGGGTCAGGCCCAGGGCGTCGACCTTCTGGCTGGCCGGCTTGTCCGGCGTCGGCGCGGCGCCGTCCTGGCCCTGCTCGTCATCCGTCACCGCCAGGCTGCTCTCCGAGGGACGCGTGCCCGACTTGACGTCGACGATGCGCGGCTTGCCGTCACGGATGATCGAGACCTTGATCGTCTCGCCCGGACGGGCCTTGGAGACCTCGCGGGTCAGTTCACTGCTGTCGCTGATCTTGACGCCGTTGACGGCCACCAGGATGTCGTCCGGCAGCAGGCCGGCCTTGGCGGCGGGACCGCCCGGAACGACAGAGGCGACGATCGCCCCCTTCACATCGCTCATGCCCAGGGCTTCAGCCATCTCGGCGTTAAAGGCCATGATGCTCACGCCGATATAGCCGCGCACGACCTTGCCGTTTTCGATCAGCTGCTTGGCGACGCCCTCGGCCACCTCGGCGGGAATGGCGAAGCCGATGCCGACCGAACCGCCCGACGGCGAGTAGATGGCGCTGTTGACGCCGATCACCCGGCCATAGATGTCGAAGCTGGGACCGCCCGAATTGCCGCGGTTGATCGGCGCGTCGATCTGGATGTACGGCACGAACGACGAGGTGGTGTCGTTCAGATTGCGATCATAGGCCGAGATGATGCCGGCCGTGGCCGTGCCGCCCAGACCGAACGGGTTGCCGATGGTGATCACCCAGTCGCCGACGCGCGGCTTGGCCTGGTTCTCGAAGTTGACGAAGGTGAAGTCCTTACCCTTGGCCTTGGGGTCGACCACCTTGATCACGGCCAGGTCGGTGCTCTCGTCGCGGCCGACCAGCGTGGCCTTCAGCTCGCGGCCGTCCTTAAGCACCACCTGGATGTCGTCGGCGTCGGCGACGACGTGGTTGTTGGTGACGATGTAGCCGTCGGCCGAGATGAAGAAGCCCGAGCCCGCCGATTGCTGCTTGGGCGTCGCCGGCGTGTCGCCGTCGTCTTCCTGTTGCTGGCCCTGCTGCCCCGGCTGACCGGGGCGGCGCTGACCGCGCGGCACGATGTCGAAGCCTTCCAGGCCAGGGATGCGCAGCGACGGCGCCTGGGCCTTGGAGGTCACATTGATCTGGACGACGGCGGGCGAGACCTTCTCGAAGATGTCGGCGAACGACATCGGCGCGCCCGGCGGCGGCGCGAAGGTCGGCGCGCTGGCGGTGGACACGCGGACGACCGGGGCCTGGGCCGCGTCGGCCGTCCCCATCCGCATGCCCATTCCGGCCAGGGCCGCGCAGGCCACGCTCGCGCCGGCGACGGCGCCCACAATGAAACCCGACTTCCTAGCGGTCATTCTCGTCTTTCCTCACCCACGCCCTGCGGACGTGGTCCCTTAACGTCATCCGCGCGCCCGGCGCGCGGCCCCGATCTCTTTGAGAACCTAGTCGCGGTATTTAAAGGTGCAATGGTTCTCGAATTATCTTTCCATCATCCGGAAGGCGCTTCTTGCGGCGCTATTTCGGCGCAATCGTGTCAATCGCCTTCGAGAAGGCGCCGAACGCGCGCCTCCTCGTCCTGCGACAGCGCCGGTTCAGGCGTCTCGCGGCGACGCAACAGGCCCGCCATCAGAACACCGCCAACCAGCAACACCACGACCGGCGCCAGCCAAAGCACGGCGTTTCCCATGGAAAAACGAGGCTTTAGCAGGACGAACTCGCCATAGCGCTCGACCAGGAAGTCGCGGATCTGCGCGTCGCTGCTCCCCGCCTTGACCTGTTCGCGGACAATTTGCCGCAGATCGCCGGCCAGCTGGGCCTCGGAGTCGTCGATCGACTCGTTCTGACAGACCAGACAGCGCACTTCCTTGAACAGCGTCCGGGCGCGAGCCTCCTGCGCCTGGTCGGGCAGACGTTCGGACGGATCGGACGCGCCGGCCGTCAGGGCGACCGCCGCCGCAGCGATCAACAGAGCCCGCAACCGCTTCACGACACCACCTCCCCGGTCCGTCGTCCCACGCCCAGGCGCACGCGGCGGTCCGACAGCGAGACGATCCCGCCGATGGCCATCAGCAGCGGACCCAGGAAGATCAACCGCACCCACGGGTTCACATAGGCCCGCACCAGCCAGGCCGGCTTGCCGCCCTCGCCCGCCCGGCGCTCGCCCATCACCACATAGATGTCGTCCAGGCCCTTGGCGCAGATGGCGACCTCAGACGTGGTCTGGGCCCCCGTCGGATAGAAGCGGCGCTCAGGCTGGGCGCGGCACACCTCGGCGCCGGCCTTGTTGGTGACGGTGATGATGCCGCGCTCGGCCAGATAGTTCGGCCCCTCGATCGTCACCACATCCGTCAGGGTTACGGTATAGGCGCCCAGCGGCTGGCTGCCGTTCAGGCTCAGGGCCTGAGCCGCCTCGACGCGCCAGGCGGTCTCGAACGAGGCCCCCAGCACGAACACGCCCAGACCCGCATGGGCCAGGGTCGTGCCCCAGGCGCCGCGCGGCAGGCCGCGTGCCCGGCGAAGGCTCTCGGCGAAGGGCGCGCGCGCCAGCTTCAGGCGCTCGGCGATCTCGAGCAGCGCGCCGCCGATCAGCCAGAAGCCGACCACCAAGCCGCCGCTGGCCAGGGCCTTGCGCGGCTGCACCACGGCATAGGCGATCAAGCCCAGCAGGGCCGCCAGGGCCAGCACGACCCACAGCTTGCGCGCCACGCCCTTGGCGTCGCCGCGCTTCCAGGCCAGCAGCGGCCCGGCGGGCAGCACGGCGAAGGCCAGGATCATCAGCGGCGTGAAGGTCAGGTTGAAGAACGGCGCGCCCACCGACACGGCCTCGCCGTCCATGGCCTCGCGGATCAGCGGATAGAGGGTGCCCAAGAGCACCACCGCCGTCGCCGTCGACAGCAGGATGTTGTTGAGCACGATGGCGCTCTCGCGGCTGATGGGGCGGAACTGGCCGCCGCGGTTCAGGCTGGGCGCGCGCAGGCCAAACAGCAGGAACCCTGCGCCGGCGGCCACGCCCATCATGATCAGCAGCAGGACGCCGCGCGTCGGATCCACGGCGAAGGCATGGACGGAGGTCAGCACGCCCGAGCGCACCAGGAAGGCGCCCAGCATCGAGAAGGTGTAGGCCGCCAGGGCCAGGAACGCCGTCCAGCCCGGCAGCGCCCCGCGCCTTTCTGTGACGATGGCCGAGTGCAGCAGAGCAGCCCCGATCAGCCAGGGCATGAAGCTGGCGTTCTCGACCGGGTCCCAGAACCACCAGCCGCCCCAGCCCAGCTCGTAATAGGCCCAGAAGGCGCCCAGCGTGATGCCGACGGTGAGCATGCTCCAGGCCGCCAGGGTCCAGGGGCGAATCCAGCGCGCCCAGGCCGCGTCGATCCGGCCTTCGATCAGGGCCGCCATCGACAGCGAATAGACTACCGAGAACCCGACATAGCCGATGTAGAGAAACGGCGGGTGGAAGGCCAAGGCCCAGTCCTGCAGCAGCGGATTGAGCGACTTGCCCTCAATGGGGGCCTCAAGCAGCCGCGCCAGCGGGTTCGACGCCAACACAGTGTAGGCCAGGAACATGACGCCCAGCGCGCCCTGCACGGCGATGGCGTAGGCGCGCAGGCGCGGCGGCAGGCTGTCGCCGAACACGGCCATGGCCGCGCCAAAGCCGGTCAGCACCACGCACCACAGCAGCATCGAGCCCTCGTGGCTGCCCCAGGCGCCGGCCACCTTGTAGAGCATCGGCTTGTCGGTGTGGGAGTTGGTGGCGACGTTGGTCACCGAGAAGTCCGACGTCACGAAGGCGTAGATCAGCGCCGCGAACGAGACCAGCAGGGCCACGAACGTCGCAATCGCCGCGCCCTGGCCCGCGCCCGCCAGCACCGGCGAGCGCCGCGCGCCGCCCACGGCCGACAGGCCGGTCTGGGCGACCGACAGCATCAGAGAGAGAATGAGCGCGAAAGCGCCGAGTTCGACGATCATCGAAATACTCCCCTTCTCCCCTTGCGGGAGAAGGTGGCGCGCAGCGCCGGATGAGGGGTCGCGCGAGACTCGCCGGACAGGGCTTTCAACCCCTCATCCGACCCGCTGCGCGGGCCACCTTCTCCCGCAAGGGGAGAAGGAATACGACGAGCGATCATGGCTTCTGGCTCCCATAAGCCGGCGCATCCGCGCCCTCGCCGCGCCACTCGCCCTGCTCCTTCAGCGCCTTGGACACCTCGCGCGGCATGTAGCGCTCGTCGTGCTTGGCCAACACCAGCTTGGCCTCGAAGACGCCCGCTGGATTGAAGCTGCCCTCGGCGACGATGCCCTGGCCCTCGCGGAACAGGTCCGGAAGGTCGCCGTGATAGACAACCTTGGCCGTCGCCTTCTGGTCGGCGATCACGAACTCGACATTGCCGTCGGGATACTTGACCACGCTGCCGTGCTGGACGAGGCCGCCCAGCTGCACCTTGCGCCCGGCCGACACCTTGGCTTCCTGGGCCTGGGCCGGGGTGTAGAACAGCGAGATGCTGTCGCGCAGGCCATACAGCGCCAGGCCCACGGCCAGGGCCAGCACCGGCGCGATGGCCAGCAGGATGGTCAGGCGACGGCGCGCCTTGCGGGATTGGGGCCAGAAACTCATGGGTTCGTCGTCTTGGCTTGAGCGGGCGTATCGGCGGCCTGACGCAGAGCCGACATCACCTTGGGCTGATCCTTGAAGAGCGCCGAGGCCGTCGCCAGGGCCGCGTCGCGCTTGGCCGTCTCGCCCAGCACGGCGTAGGCGCGCACAAGGCGCACCCAGCCGTCGGGGTCGTCAGGCGACTGCTTCAGACGCTGGGCCAGGCCGTCGACCATGCCGGCGATCATGCCCTGGACCTCGGGATTGGCGCTCTCGGCCGCCGGCTGGGGCCCCGGCAGCTTGCCGGACGCCTCGACCTCGGCGATCTCGGCGGCCAGCGACTCGCGGCCCTGCGCGTTCGGCGCCAGGCTGGCCAGCAGGCCGCGCCAGTCCGACAGACCCGCGGCGACCTCGCCCTCGGCGATGCGGGCGCGGCCCAGATAGTAGCGGGCGCGGACGTCGGCGGCGTCGACCTTCAGCGCCTGGCCAAAGGCGAGCTTGGCGTCGGGACCCACCTGCCCCTCGGCCTGGATGACGAAGGTCTCGCCCAAGAGGCTCCAGATGTCGGCCCGCTTGGGGGCGATACGGACGGCCTTGCGCAGCGCCTGTTCCGCGCCGGCCATGTCGCCGGCGGCCGCGCGGGCCTTAGCCATGAACACCAGCGGCTCGGGATCGTTGGGCCGTTCGGCGGCCACGCCTTCGAGCACGGCCGCGATCTTGGCCGGTTCCAGCGTCGCAGGATCAGCCGCGCGCCAGGCGGCCACGCGCTTGGCGAAGGGCTGATCCGGCAGTCCGGGATAGCCGAGCACCAGATAGAGCCCGACGGCCGACGCACAGGTCAGGCCCACAGCGGCGACGGCGGCCTTGCGCGGACCGGCGCCGTCACGGCTCCAGCTTTCGGCCTGGTCGGCGGCGGCCAGCAGACCGCGGCCGGCTTCGGCGCGGGCGGCCTTCAGCTCGTCTTCGGCCAAGAGGCCGTCCAGGGCCAGGCGCTCGACCTCGGCCAGACGGCGACGGTGCGGCTCCAGCCGGGTCTCGGCGTCGTCAACGGCTCGGGCCTGCGCGGCCCCGCGCAGCACCAGCCCCGCCGTCACCACCGACAGCCCCGCCGCAGCGATCCAGAAAGCGATCATGGCCGGGTGTTAGCTTGTTCCGGCCGTCGCCGGGAGAGGAAAAAGCGCTTAAGGAGGCCCGCGCCCCGTCATTCTGGGGCGCCTGTCGAGGAAGGCTCCATGACGGCGATCCTGTTGGTCCCCGGGCTTCTCTGCTCGGAGGAGATCTTCGCCCCGCAACTGCCCGTGCTTTGGCCGCGCGGCCCTGTGACGATCGCCAACACCCTGGCCGGTGACAGCCTGGCCGAAATCGCCACCCGTATTCTCGCCGACGCCCCGCCGACCTTCGCCCTCGCCGGCCTTTCGATGGGCGGGTACCTGGCCTTCGAGATCGTTCGCCAGGCGCCGGAGCGCGTGTCGCGGCTGGCGCTGATCTGCACCTCGGCCAGGCCCGACACGCCCGAACAGGCCGCAGGTCGGCGCAAGATGGTCGCCCAGGCGCACAAGGTCGGCTTCGAGCGGTTCTGCGCGCTGGGGGCCGACGCCCTCACCCACCCCAGCCGCAAGGGTGATCCCGCGCTCAACGCGCTGAGCGCGCGCATGGGCCTTGCGGTGGGCCTGGAGGGCTTCGCGCGCCAGACCGAGGCCGTGATCGGGCGGCCCGACAGCCGCCCCCTGCTGGCGTCCATCGCCGTTCCAACCGCGATCATCGTCGGCGACGCCGATCCGCTGACGCCTAAAGCGCTCTCCGAAGAGATGGCCGCGATGATCCCAGACGCGCGCCTCGTCATCGCTGAGGACTGCGGCCACGTGATCACCCACGAACGCCCGGACGTGGTGAACCCGGCGATGGCGGCCTGGCTGGGCGTATAGCTAAGCCGCGATGGCGGCCGCCGCGCGGCGGCGGACGATCTCGGCGGTCTTGTCGTCGCGGGTGTAGGCGATGAAGCTGGCGGCGATGTCGAGATACTTCACGGCCACGGCGGAATCGCCGCCCTCGCCGGTCCGCGCCACGTGCAGCACGTCCTCGATATAGGGGTCCAGACGGGCGTTCAACTTTTCCAGGACCTTGGTCCGTGAACTGCCGTAGCCGGCCTTGTCGGCGCAGGCCCGCAGCTCGGCCACGAAGGCCAGCGCGGACTGGGCGCGACGGATCATCGCCTCATCCGGCGCCTCGATCAGCTTGGCGACGCCCCGCGCGCCCTTCTTGGCCAGCATCGAGATCGGCCGGGTGGGCAGGGCCTTGTCCAGTTCCTTCTCGGCCTGATCCATGCGCAGCTCGCAGGCCTTGGCCATCGCCTGCTTCTGGCGCGCCAGGCGCTTGCCCCACGGGCCGTCCTTGGCGATGTCGACCGACTGCTGGATCTCGACCATCTGCAGCTGGACCTTCTGGGCGGCCTGGGCGGCCAGGCGGCCGGCCTTCTCGCCGTCGGCGAAGCTGAAGGACTCGATCAGGGCGATCGTCTCCTCGATCTCGGTGAGGATGCGCTCGCCGAACTGCGACACCTCGGACGAGGCCAGATACCGATCGGACGGTCGGTCCATGACCGCCGAGATCACGCGCATGATGCGCCAGTCGTCGGGCAGGTGCGCCGACAGAATATCGAGCAGCAGCGGGCCGGCGTCCTCGCTGATCCGACAGGCGTCGCGATAGGCCAGGCGCGCGGCGGCGGCGCGCTCGTCCGACATGCGGCTGACCCATTCCGCAAGCTTGGGCAGGCATCCGCGCGTCAGGGTGGACAGCTGCAGGCACAGCGCCAGTTGCTCGGGGTCGCACAGCGAGCGGACGCTGTCGAACGCGGCGTTCTCCGGGTCGCGCAGACCCGCCGCGGCCAGCTTGCACAGCTCGTCGAACACCTCGGGCGAGCCCTGCTCCAGGTCCCAGGGGTTGCAGCGCGCGGCGGCTTCCTCGACCCGGGCGGCGGCGACGGATTTCAGCGCGCGCCACAGGCGCGACAGGACCGGCGCGGGGAAGGAGACCTGGCTGGAGGCGCGCCTGGCGCAAAGCGGCACGATCGGGGCCAGGACGTTGTTACGCACGAAGCGATTGGCGGTCTCGTCCTCGACAAGACCGCGCACGGACGCCAACGCTCCCTGGCTCCCGGCGCCGGCAAGGGCCAACTCAAGGCTGCGCAGAGCCGCATCCGGCGCGGTTTCCACCAGCGTCTGGATGATCTGCAGTTTATGCGCAGCGATGGCGGCCATGATCCCCCGTGCCGGATCGAGACTTCCGACAAGCGTCAACCATGTTCGAAACGAGGTTAAGGATTTCCGAAACGACGCAAAAAACCTGCGCGTCCGCGCCTGGGGTCTGCGCCCGAAAAGTGTTGTCCACCGCAGGTTTAGGTAACCGATAACGTGTTACGCCAGGACCCTGCGCACCTCATACCGCCACAACCAGGAGTCAGGATTCGGCGCAGGGCAGATCAAGGCTCACCCGCAAGCCGCCCAGCGGGGATTCACCCAGTTGCACCGAGCCGCCATAGGCGCGGGCCAGCTCGTCGACGATCGACAGGCCCAGCCCCGATCCCGGCGCGTTCTCGTCGAGCCGTTGCCCACGCTTGAGCGCCTGGGCCCGCTCGTCGGGCGTCAAGCCCGGACCATCGTCGTCGACGGTCAGGGTCATGCGCGCCTCGCCCGTTCGGACGGCGTCGACGCGGATCTTGCCACGACACCACTTGCCGGCGTTCTCCATCACATTGCCGGCCAGCTCCATCAGGTCCTGCTTTTCGCCCTGGAAGCACAGGTCCTCAGGGCAGCGCCAATCGATCTCCACGCCTCGCCCGTCAGCCTTGTCCTGGAAGATCCGCTCCAGGGTCACGGCCAGCTCGTCCAGGATCGGCTCCACCGGCGTGCGCTCGCCGCTGGTCTGCGAGCGGGCGGCGGCGCGGGCGCGGCGCAGGTGGTGATCGACCTGCTCGCGCATGGTCTGGGCCTGGCGGGTCACCACCTCGGCCAGTTGCCCCGGCTGCTGGCTGGCCTCGGTCAGCATCACCGACAGCGGCGTCTTCAGCGCATGGGCCAGGTTGCCGACGTGGGTGCGCTGGCGCTCGACGACCTCCTGGTTGTGAGCCAGCAGCGCGTTCAGCTCGGCGGCTAGGGGCTCCAGCTCCTCGGGATAGCGCCCGTCCACGCGCTCGGCCTTGCCGCGGCGGACATGGGCCACCTCGCGGCGCAACTGAAACAGCGGCTGCAGGCCAAAGCGCACCTGCACCACCACCGCCAGGATCAGGCCGCCGCTCAGGATCAGCAGGGCGAAGGCGGTGATGCGCACAAAGCGGCCAGTGTCCGCGTCGATCGGCGAGCGGTCCTCGGCCACAAGGAAAATCACGGGCGTGGCATGGCCGGGCAAGCGGGCTAAGAGCCCCCCGGCACGCAGCCGCGCATCCTGCGGGCCCCGCAAATCGAAATAGATAGTCTTACCCGGGGCGGCGATCAGCCGGTCCAACTGAGCAGACGGGAGCATCAGGTCGCTGTCAAACAGCGAGCGCGAGCGGTCGATTACTCTTAGCGAGCCGTCGCCGGTCGAGTCGAAGATTGTCCAGTAGCGGCCCGAGTAGGCTCGAGTTGCGCGAATGTCGGTCAGGAACGGCGCCTTAACCAGCCCATCCTCGACGCTGGTGTTAGCGTACATGTCATCGATCAGGATCGATAACCCCTGGTCGAAGCGACGCACCTGAGCGTCGCGGAACTGGGTGTTCACGAAAACGCCGGCTCCGGCGACGACCAGCAGAGTCCAGATGCCAGCCGCCACCACCAGTCGAAAGACGAGCGAGCGATGGCTCGGGACGATCATGCTCCAGAGACGCGCCAGGACAGGGACGCGCGTGGCCTTGCTCACGCCGCGTCTTCGCCCGGCAAGGCCGCGAGGCGATAGCCAAGGCCGCGCACGGTCTCGATCCGGTCCGCGCCCAGCTTCTTGCGCAGGCGGCCGATGAACACCTCGATGGTGTTGCTGTCGCGATCGAAGTCCTGGTCGTACAGGTGCTCGACCAGCTCGGTGCGGCCGATCACCCGGCCCTGGTGCATGATCATGTAGTGCAGCAGGCGGTATTCCAGCGAGGTCAGGCGCAGCGGCTCGCCATTGACGCTGGCGCGGGCCGCGCGCGGGTCCAGGCGCAGGGCCCCGCACGACAGCGACGGCGCGGCGTGACCGGCCGAGCGGCGCAGCAGGGCGCGCAGGCGGGCCAGAAGCTCCTCGGTATGGAACGGCTTGGCCAGATAGTCGTCGGCGCCCGCGTCGAAGCCGGCGACCTTGTCGCTCCAGGCGCCGCGGGCGGTCAGGATCAGCACCGGCGTCGTGACATTGCCGCGCCGCCAGCGCTCCAGCACCGACACGCCGTCCACCTTGGGCAGGCCAAGGTCCAGGATCACCGCGTCATAGGGCTCGTTCTCGCCCAGATACTGGGCTTCCTCGCCGTCGGGCGCGTGGTCGACGGCATAGCCGGCGTCGGCCAGGGCCAGCTTCAGCTGACGCGTCAGGTCGGGATCGTCCTCGACGAGCAGGATGCGCATGACGTGAAGTCCTTCCTAGCCGCCGCTCAGGATCTGGCCGGTCTGGGCGTCGACGATGAAGTCGATGCGTCGGCCGTCGGCCGCGGCCCAGCGAACCCGGTAGTTGTCGCCTTCGAGGCCGGCGTCCAGCACCCGGCCCGGCACCCGGCGACCGATCATCTCGATCACCCGCGACAGCGGCACCAGCCGTCCCGACTGCACGCCGCCCCGCGCCTGGTCCTGCTGCTGACGCCAGTCGGCGCCCAGCGAGTCCGGACGGCGTTGCGCGAAGGCCGCATCCGGAAGGACGGCGGCCAGCGCGGCGGCGACGATGAGGGCGCGGATCGGTTTCATGCTGACCGGTCTAAAGGACCCCGGCTGAATGGAGGCTGAACGTTCCGGTTATGAACGGCGCGCGGCGTTCTGTCACGGGGCGGAACGTCCGGCGGCGAAAGCGCCCGCCGTCCGCCCGCTTTCGCGCCTCAGGCCCAGGCCAGCGCGCCCGGCTGCGCGGCGCCGCTCCAGCCCGCATCGCCGTCCTCGCTTTTGGTCGCCCCGCTCCCCGGACGCCGGAAGCGGCGTCCGCCCTGGCGGCCCTGTCCCGAGCGACGGACGCGCGCGACGAGCCCGCGACCGAACTGGCCCGGCAGCGCCTTGATCCGGTCACCCAGTCCTTTCAGACGGGCGCCGTTGGCGGCCAGTGCGATCGCCGCGAACAGGTCGATGAAGGCCGCAACATCGAAGGCCATGAACCAGACCGCGACTTCAGGCAGGGCCATGCCGAGCAGGATCGCGCCCTCGACTTCGAATAGGACGACCACAAGGGTGGAGACCGCCAGCAAGGCCAGCAGCAACACCAGGCGCGCCGGGGTCAGTCCCGCCAACACCTTCGCCGGCCAGTCGACCAGCCACCGCCTCAGCCCCTTGCCCAGCGCCGTCTCCGGGCAACCGACAATGACCAGCAAGGCCGCGCACAGCATCAAGACGAACATGAAAGCTCCCGTCGGCCGATCGCGCCACGGTCTTCAGATGGGGCGGCGGTCCGGCGGATCAACGCTGTTGAGAGGCCGCCGATCAGACATCCAGCAGCGCCTTGAGCTTGGCCCGCACCTCCGCGAGCACCGCAGGCGGCGCCACGGCCTTTCGCGTTGAGCGGCGTCACCAGACCTCGTCGATGCTTTTCAATCCCCCGCCTACCGCCTGAGCTCCAGCCGCAGCGTCGCGCCGATCTGATCCCGCCAGCGCTCGTCCACGCCGGCCGCGTCGGCGAAGGTCGGCCAGCGCATCACGGCCGCCCGCACCTCTTCAAAGATCCGGCCGACATGACCGCGACTGATCGAGGCGGTTTTCGCGCAGGCCTCCAGATCGGCGAAGTCGAACCCGTCCCGCTTGCCGTTGATCGACATCTGGTGGCGCGAGGTCCATTCGCCGTCCGGATTGTAGCTCCAGGTGATGTCGAAGGCCGGTGACAGGCTCCAGCGTCCTGCGCGGTCCATCAGGAAGGCGATGTTCTTGACGTGGTCGTCCTGGTTTCGCGCCAACACGTTGAAGACCATGCGCCGGAACTGCTCTTCCAACTGCGCCATCGGCAGGCCCATCCGCCGCATCGTGAACAGGGCCTGTTCATAGGAATTGGCGACGGGATCGTTGAAATCCAGATGGGCGATGGCGGCCAGCGACTGCATGTGCAGCTTGCCGCCGCCGTCCAGCCGGTCGAAGCGCTTGCTCATGAAGTGGCGACGACCGCCCTCCTCCAGCAAGCGACTCTCGGCGACGTCGATTCCCGCCGCCGCCGCCATCTGTCCATAGGCGTGCTCGACCGCGCCATAGCCCTTGGGGTCGGCCAGTTCCTTGTCCCGGTTGCCGGAGACGCCGTCGAACTTCAGCAGCCAGTAGCCGAAACCCGCGCCCGCCTCGACCTGACCGGAGCGCACTTCGTTGGTTGCCGGGTTCCAGGCGATCACCGCCTTGGCCCGCGCCCCGCCCGCCGAGGTGCCGACGCTCAAGATGTCGCGCAGGGCGTCGGCCTTGTCCGCATCGGCGAACGAGGCGCGCAGGTCGTGGCGATGGGTCAGCACCTCCGAAGCCAGGGTCACCAGGGCGTCGATGTCGATCTTGCTGGAGACCCGCCGACGCGGCCCCGCCATCGGCGAGAACTCCAGCGCCCCCATGCCCCGGCGGCCGGTGTAGCAAAGGCGCTCGACGGCGTTGAAGCTCTCGGGGCTGCGACCTTGCGTGGCCAGCCAGGCGTCGATCAGCACATGGCCGTACTTGTCCGGCAGGGCGTCGGCCAGCATGCCGGGCAGGCCATGGAAGCTGCGCGGCGGCAGATCGGGAAAGCTGAACACGCCGGCCTTCAGCGGCATCATCAGCGGGGCGGGCTGGATTCCGCTGGCGATAAAGCCCGGCTCATAGGCGAAGACCGCCGTCTGGGCGCCGTCTTCCAGCGACACCGCGCCGATCCGGCTGCCCCACAGGCGAACCTCGGCGACGGTGGTCATTTTTCATCGCCCCACACCCAAGGCGCGCCGCCGGTCTCGGCGACGCCGGACGCGCGGGCGTTGCGCACTCTTGAGCGGCCGGTCTTGGCCTGTTTCAGCAGGTCCAGCGGATTGGCGGGCGTCTCGGGCAGCAGGCGCTCCAGCCCCTCCAGCCGGTCCAGGGCCCGCAGGCAGCGCACCAGATTGGTCAGCTGGGCCGAGGCGCCGTCTTCCAGACGCTCGACGGTGCGCTTGGACACGCCGGCGGCCTGCGCCAGCTGAGCCTGGGTCAGCCGCCGCTCCAGCCGCGCCGCCACCAGACGCTGACCCATCTCGGCCAACACCGCCTCATCGGACAGGAGAGATTCAAATTTCATAAATCGCCTCTTGTGGCGATATATATCAACTTAAGGCCCCATATCGTCAAGTCTGTCGATATAGGAGAGCTCTATTTTTAGCTCTAAAATTCGCCACTTCTGGCGATTTAAGGCGCAAAATAGCTAAAAGTCGCCAGAGATGACGACTTTTAAAACCCGCCCCTACTTCCGCTTCGGCGTGTAGGGCTTCTGGGCGCGCCAAGCTTCGGCGAACTTTTCGAGGTCCTGGTCGACGGTCTCGGGCAAGGTGACCACCAGGCGAGCCAGCAGGTCGCCGCGCTTGCCCTTGCCGTCGGGCATGCCCCGGCCCTTCAGGCGCAGGGTCTGGCCGCTGTTGCTGCCCTTGGGCACGGCCAGCATGACATTGCCGTCGGGCGTCGGGGCTTCGACCTTGCCGCCCAGCACCGCGTCGGGGATCGAGACCGGCAGGTCCATGACCAGCGCCTCGCCCTCGCGGCGATAGATCGGGTGGGGCTTGATGGCCAGCTCGATCAGGGCGTCACCCTGGCCGCCGCGGCCCGGCGCGCCTTGGCCTTTCAGGCGCAGGGTCTGACCCTCCTGGGCGCCGGTGGGGATGGTGACGTCGATCGTGCGGCCGTCCGAGAAGGCCACGCGCTTCTTGCCGCCCTTGATGGCGTCTTCCAGGTCGATGTCCAGCCGCGCCTTGACGTCGGCGCCCTTGGCTGAGAACCCGCCGCCCGCGCCGCCGCTGAACGGCCCCCGCCCCGCGCCGGCGCCGCCGCCGCCGCCGCCGAACATGCCGCCGAACAGATCCGACAGGTCGATCTCGGGTCCTTCGTTCGAGCGATGGAAGCCGCCCTGGCCAAAACCGCCGGCGTTGAACGGGCCGTTGCCCGGCTGGCCACCGAAGCCGCGCATGGTCTCACGGCCGTCGGCGTCGATCTGGCCCAGGTCGAATTTCTTGCGCTTTTCGGCGTCGCCGACGATGTCGAACGCGGCGCTGACCTGCTTGAAGCGCTCTTCGGCCTTCTTGTCGCCCGGGTTGGCGTCCGGGTGATACTGCTTGGCGAGCTTGCGGAACGCCTTGCGGATCTCGTCCGCGCTGGCGGTGCGGGTAACGCCGAGTTCCTGATACGGGTCGCGCGCCAAGGAAGGTCCTCTGAGTTAAGCGGGGGGCTTGGGTAACCCCCTCCATCTAGGACGTCCAGCGTGGCGCGCAAGCCCTTATGTGGCATTTTTACCACAGACTTGTGCTGGCGGGCGCGCCCCAGCCGTACAGGGACGCGCCCTGGACTACCTCCAGCCGGGCGCCGGACGAGGGCGCGTCGGCCGCGCGCATGGCGGCGGTGTAGGTGAAACCGGGCGTTTCCACCTCGGCCTCACGCAGTACCGCGGCGCCGTCCAGCACCCGCACGCGATAGCGCTCGACGCCCTCGGCCAGCGGGACCTCGCCGTCCCAGACATCGCCGCCGACGCGGGCGCGGCGGATCCACGAGACCGACAGATCGCCGCCGATCGAGCGTTTGCGCAGATGCGCCGGGGCCAGCGGACGCAGCGCCCGACCCGTCCAGACGGCGCTGATCTGCGTCATCCCCGCCCCCGACGGCGGACCGCCCGCCGGGGCGGCGCGGACCATCAGGGTCGTTCCCCGCTCGAAGGCCGCGACGCTGATCGGGACCACCGCCTCGTCCAGCAGCACCACCGCCGCCCCGGCCGGGATCACGCCTTCGCTCGCCGCTCCGTCGCGCTGGCCGCGCAGCAGGCCCGACAGCCGCCAGACGTCCGGCGCGATCAGTTGGGCGGTCTGGAAGGCGATCACCTCCCAGTCGCCGGACGGCGCGCGGATCGCCAGGGCGTTCCCCCCCGCCAGCACGGCGGCCAGCGGCGCGGAGGACAGGCTCGCCCCCTCGATCCGCACGTCCAGGCGTGCGGACCGGTCCAGCCGATGCGGCGAGGCGGGCGCAAGGTCGGTCAGGGTGAGCCCCAGCGTGGCGGGCGCGGCCAACCGGGCGCGGACCGTCAGGGTCTCGACGCCGGCGCCTGCATGCACGTCCAGCGGCCGCCAGGGCTCGGCGGCGGCGGCGACCAGCGGCCGCGCGTCGTCGGCCAGCGCGCCGTCCGACGGCAGGTCCAACACGTGCAGCACCGGCGGCGAGGCCGGCTCGCGCGGCGGCGCGGGCGTCCAGTCGATCACCGCGTCCACACCCTGGCTGCGCAGAACCGGCGCGAGGTTGGCGCGCGGGCGCTCGTCGAGATCCAGGCGCTGGATTCGCCAGGTCTCCGCATCCAGCGTCACCCGGTCGCCGGCCTCGAACCGCAGCGCCGCCGAGGGCGACAGCCGCACGATGCGCGTGCGCCGCGCGGCCGCGTCGGCGTCCAGCATGCGGCGCGCCACGGCGGCGGCCTCGGCCGCCGACAGCACGATCGGCGCATCCATGTCACGCGTCCCCTCCCCCGCCTCGCGGCGGACGATCAAGGCGCCGACCTGATAGTCGCGGGCGGCGTCCAGGAAGCGCAGGCGCAGGGCCTCGGCGGCGGGGTCGAGGGTCCGGGTCTCGCGCTCGGCCGGGCCGTCCTCGGGCAAGACCAGATCGTCGGGTTCGATCGCCGCCACCGCCCGCCCCGTTCGCGACATCATCCGCACCTGGTCGCCGCGCTCGACCGGATCCAGGGCGAAGGCCTCGGTCAGGGGCGACAGCGCATCGCGCAGGCGCATGGGCCGATCAACGACATAGCCGCCCACGGCCCCGCCCGCCTCGCCGGGGTCGAGCGCCACACCGGCCCGCGCGCCCAGGGCCTGGATCAGCTCTGGCAGCGGCGCGATGCCGGCCCGACCGGTCAGCCAGTGACCCAAGAGCCAGGCACCGCCGTCGCTCCAGACATCGGCGCGGGCGGGAAAGTCCGGAAACGGCCGCGCGTCCCAGCACCAGGCGCTGGCCGCCTCGATCATCGGCCCGCCATACAGCGGCGAGACCGGATTGGCGCTCGGCTCATCCAGCCAGGCCAGCACCGCCTCCAGATAGCGCCGCTGGCCAAAGTCGTCGCGCTCGCCGCTCGAATAGGGCGGCAGGAAGCTTTCCGAGCTCTTGGGGTCGATGAAGAGGTTCGGCGAGTTGCTCCCCTTGTCCACGGCCGGGCAGCCGAACTCGGTCAGGCGGATCGGCTTGGACCTTGGAACCCAGGCCGTGGGCGTCGCCGACCGCACGCCGCCCGGCCGGTCGTGGTGCGGCTGGCTCCACCACGACAGCAGGTCCTTGGGCCGGAACATCCAGGCCTCGCCATGGGCGCCGTCGGTGATCGGGGTGCGGACCTGGGCGTCGCGGTCGGCGCCGCCGGCGTAGAACCAGTCGAAGTCGGCCCCGCCTGTCAGGCCGGCGCGCAGATAGGCCGGATCGTGCGGACCGTCATAGCCGGCCATGGCGTCCAGATGGTCCTCGCCTTCGCGCCAGTCGGTGACCGGCGGGTACCAGTCGACGCCGACGAAATCGAGATTGGGATCGGCCCACAGCGGGTCGAGGTGAAACACCGCATGGCCCGAACCGTCGCGGGGCTGGTGGCCGAAATACTCGCTCCAGTCGGCGGCGTAGCCGAGCTTGGTGGCGGGCCCCACCACCGCGCGGACATCGGCGGCCAGGGTCTTCAGCTTGGTCACGGCCGGATAGGTCCCGCCCGGTCCCCGCGTGGTGGTCAGGGCGCGCAGCTCCGAGCCGATGATAAAGCCGTCGACCCCGCCCGCCTGGGCCGCCAAGGCCGCCTGGTGCAGCACCATCCGGCGCAGGCCCCATTCGCCGTCGAAGAAGGCGGCCACCTGTGCGCTCGCCGCCGCCGTCCCCTCGGCCGAGGGCGGATCGCAGGCGATGCGGCCGCGCCACGGATAGGCCGGCTGGCCCGGCGGGGTGTCCATCAGGATGAACGGATACAGCGTCACCTTCAGGCCTCGCCGCTTCAGCTCGGCGATGGCCTGCACCACCACCGCGTCGGCGGGCGTGCCGCCATAGGCCGGCGCGCCGTCCTTCAGCGAAATCAGATGCGCGCCGCTGCGATCCACGCCCGCGACGCTCCAGGTCAGGGGCAAGGTGTCCTTGGCCGCGCCCTCGACGCCCGGCTTGATCTGGCACTGGCCGCAGCGCAGATCCGTGCCGAACCAGGCGACGATCAGGGTCACGTGATCGACGTTGGGCAACTGGGCCTGCAGTTGGTCCAGCGCCACCATCAGGTCGGGACGGCCTTCGGAATTGTGGACGCTCTCGCTGGCGGTGCGGGTCAGGCCCTCGCGGCGCAGCACCGCTTGGGTCGCATAGACAAACTCGCCCGCGCCGGGGATCAGGCAAACGCCCTTCAGCCGTTCCTCCAGCCCCACCGTCCCCGGCGGGCGCGGCCGGCGGAAGACCTCGAACGACAGCTGCGGCGGGCGATTGCCGAACGCCTCCAGCGGCAGATCCTCGAACACCACATAGGCCACGCCGCGATAGGCCGGCGCCTGGCCCTCGATCGCGGCGATCAGCGGGTCGGGGGCTTGGTCCTCGGTTCCCAGATGGACGCGCATCGTCACGGCCGACAGGTCCATGACCCGCCCGTCGGCCCAGACCCGGCCGATCCCGTCGATCGGCCCCTCGGCCACGGCGACGGCGAACGACAGGCTGTAGGCGTAGGCGCTCGTCTTCTGCCCCTTGGAGCCGCCCACCCGGCCCTCGACCCGTTTCTCGCGAAACCGCGCCGCCCAGATCACCTGACCGCCGACCCGCGCGCGACCGAACACGCACGGCAGCGCCGCGCCCTCGGCCGCGCCGGTCAGGCGCAGCTCAGGAATGCGCGGTCCGACCTGGCGCGCCGGCGACAGGGCGCTGATCGCCGCCTGGTCGATCAGACCGCCGACGACGGCGCCGGCCACGGCCCCGATCGGACCACCGATCGCCGAGCCGACGCTGGAGAGGATCACCTGGGCCATATGAGCATTTCTCCTCCGATTTCCCCGGCGAAGGCCGAGGAGGTCGAGCGTTTGCCTCGGTGCGTCCTTCGAGGCTCGCCTTCGCCTCGCACCTCAGGATGAGGAGGCTTTTTTGCTGAACACCTCAGCCTGAGGTGCGCGCTCCTGAGCGCGCCTCGAAGGACGCAGGGCGGCAAGACGCAGTCGGGTGTTTGTTGGGAAAGGGTGGAAACCGAAACGCCGCCACACAGCGTTCGCGCCACCAGCGGCCCAGCGCGCTCTCGACGCAGGCCCGGCCCCAATAGGCGTGGATGATGCGGTCGGGCGCGGCCTTTATCGCGCAGTGCTTGGCGGCCGCGCCGGGGGCCATGCGGAAGACGAGCACGTCGCCGGGCAGGGCCTGCGACACCGGGATCGCCACCAACCAGCGCCTGAAGGCTTCCAGCAGCGGCTCGCCCACGCCGATCTCGGCCCAGTCGGGACGATAGGGCGGCGGCGGCTCGGGCTCCTGCCCGTACAGCCCCCGCCAGACGCCGCGGATGAGGCCCAGGCAGTCACATCCCGCGCCGAGGGTCGAGGCTTGGTGGCGGTAGGGTGTGCCGAGCCAGAGGCGGGCTTGGACGACAACCAAATCCTCCCCCCAGCGGGGGAGGTGTCGGCCCGCAGGGACGACGGAGGGGGAAGAGGCCGGGCCAGGAGACCTTCCCCCTCCGGCCTTCGGCCACCTCCCCCTCTGGGGGGAGGATTTGGGACAGGCGTCCGCCGCTTCACCCAGAACAATCCTCCCCCTCTGGGGGGAGGATTGCGCGCCCCCGCCGCTCATCGCCGCCCGCCGTCGTTGCGCTCGCCCTCGACGGGGTACAGCGTCAAAAAGTCCTCGCCGGGCGAGGTCGGAAAACCTCGGAAGTTCACGCTGTTGCCGAACACCCCGACACAGGTCGCCCAGCGCTTGTCGCAGGTCGCGCCCGGCGCGGGCGTGATCCCGCAGCGGACATCGCCCAGGCGCGCGTCGCACAGCCTTGTGAAGGTGCGGCCCGCCACCCGCTCCAGCGCCGCCAGCGGCCCCGCGAGCGTGGCGGTGAAGGCCTCGCCCTCGCGGGTGACCGAGGCGATGGTGGCGGTCCACAGCGAGACGCGGCGCGAGGGTGCGCTCCAGTCCACGCGCAGCGCCTCGACCCGCGCGCCGTCATAGAGGCCGGCGGCCAGATCGGCCTCCGCGATCCCCGCCTCGTCCAGCACGCCCAGCACCGCGCCCTGGCCCGGCGCATAGCCGACGCTGTTTTCGGTCGCGCCAGGACTCCAGCCGCCGCCGGCCCGGCAGGTCACGCCGTCGACGACGAGATCTTGGTCATGATCGGTGAAGCCCAGCCGGGTTCCATCGGCGCGGGTCAGGATCCAGACGTGACACAGGCGGGCCTCGTCCAGGCCTTCGGGCAAGGCGCGCATCAGACCAGCACCTCGACCAGCGAGCAGGCGGTGACGCGGGCGGCGGCGAAGCCCTCCAGGGTCACGTCCAGCCGGTCGAGATCAAAACGCACCGGCGTGTCGAACGCAAAGCCCGCCGTCACCGCCGCCCCGGCCGGCGGCGCGGTGTTCAGGGTGATGAGGCCCGTCGTGACATCCACGGCGAAGGCGCCCGGCGCCAGGACGACGCCCGCCACCGCCACCGTCACCGTCCCGGCGACCGGCTTGGTTATGGTGCGGGCCACCGCCTCGCCGCCCGCGCCATAGGTCTTGCGCAGTTGAAAGGCCTTGCGGACGCCGTCGCCGGTCCCGATCGCCTGGTCGCCCGCCGCCGGCTGAACCGACGGCGCGCACGACTTGAAATCGGCCGGATCGCGGAAGCGGAAGCCGTGGAGGCGCCCGCGCCGCGCCTCGAAAAAGGCGACCAGTTCGGCGATCTCGTCCAAGGGGCGCGGGGCGGTGGCGATCAGATAGCGGCGGCGGCTCTGGCTCCAGGGACTGGTGCGCCGCTCGTGGCCCGAGGCCAGGCTCACCACCTCGGTGCGCCGCTCGATCCCGCCCGTGCAGCCGAACGCCAGGCGCGCGGGCAGGCGCGCCTCATGGAACTCGCTCATCAAGCGATCCTTCTGGAAAGTCGAGAAAAGGTTTGAGGAAAAGCAAGACGGCGCTTAGTCGACGCCATTAAGACGACGTCATGACCCGCTTTTCACTTCTTGCGCGGGACATCGGTCTCTAGGTTAGGGATCGAGATTGGAGACCACGAAATGGATTGGAAGACCCTGTTCCTGTCGCCTGAGGGCCGCATCGGCCGGCAGTCGTTCTGGGTCGGCTGGCTGATCCTGCTGGGCGTCAACGTCGTGGCCAGCTGGATCCCGTTCGTCGGAACCCTGATCATCCTCGCCAGCATCTACGCCAGCGTCTGCATCCACTCCAAGCGCCTGCACGACATGGGTCAAACGGGCTGGTGGCAGGTCCTGCCGTGGGTCCTGGGTCCGGTGCTGGTGTTCGGCGCCGCCATCTCGGTGGGCGTGATGCCGGCCATCGCCGCCCTGACCAACGGCGAGCCGGAGGTGTCGGCCCTGACCGCCCTGGTGGGCCTGTTCGTCTCGATCTTCATCGCCTTCGCCATCTGGCTGGCCTTCACCCTGTGGGTCGGCTGCAGCCTGGGCCAGCCGCGCGAGAACAAGTACGGCCCGCCGCCGATCAACCCGACGGCGGTGACGGTGTAGCGGATCGAAGAGCCCGCGCGCCCACGCTTGTCCGCCGAAGAGCTAGAGCGCGTTAGGTTTAAGTGTGAGCGGTTAAACCTAACGCGATCTAAGTCTATGAATTAGAGCCTGTTTATCTGGTTTAGATGGGCCATCTAAACCAGACAGGCTCTAGGGGGCCCGACGGGCGTCGCGCTTGCCTGACCAGCTGATGAACATGGTGCAGCGCTCGCTCAAGCAACCGTCGAGATGCGGCTGCTCCCCGGGCTGGAACCAGTAGCTGCCAGGTCCGAGCGGCTTGGCGCCGGCCCCCTCGCCGGCCTGGGCGTGGGTCATCACGCCCTCGAGCACGACGAGCCGATAGTCCGACGAGTGGACGTGGGGTACGCCTTGTCCACGGGAAAAGCGCATCAGCATGTCGGACGGACCGGTGGCTGGATCACCGCGCAGAACGGCCATCTGAGGCCCATTCGGATTGCCTGGATCTAAGGGCGCGAAGCGGGCGTCGTCGATCGCGAGGGTTACGGGCGCCGCCTGCACGGCCATGGCCAAAAACATCGGCGTCCAAGCCATAGCCCGCCTCATCGCAAAAAAGACGCCACGACCATACAACGCGTCGCGCTGCTGTCTATGGTTGCCCTGAGCCCAGGCGCTTCGGGCGCCCCTACAGCCCCCGCGCGCCCAAACTGACCGCCCGCGCCAAGGCCTGGGCGATCTGGGCGTCGGAGCGGATCAGGCCCTGAGCCTCGCCGCCCTGGACATTGACCGTCACCGACACCCCGCTGTTTCCGACCGGCTCGATGTTTCCGGCCGAGGCCGGGCGAAACAGCTCCGGCCCGCGCTCGCCGACCAGATAGGCGCCGCCGGGCAGCACCGGGCCGCCGTCGGCCCGCGCTCCGGCGAAGGTCTTGCTCAGGGCCTCGCCCAACCCGCCGCCCTTCAGCGCCGCTCCGGCCGCGCCCAGCATGGCGCGGGCCAGCTCGGCCAGCGACACCTCACCGTCCGAGGCCGCACGGGCCAGGGAGCGGACAAGCGACGCGCCCGCGCGAGCGAAGGCCTCGTCGATCGAGCGGGCCGCGCGCTCGGCGGGGGCTTTCAGGCTCTCCAGAGCAGCGGCCGCCTCGGCGGCGCGGGCGGGCACGGCGGACAAGCCTTCCTGGTCGAAACTCATGGTGCGGTCTCCTCGTCGGGATAGCGGGCGAGCAGATCCTGCAGGCCCGCGCGGTTCAGGCACGGCGCGTCCGGCGCCTGCGTCAGCGCCCGCCACTCCTTCAGCGACAGACGCCAGAAGGCCTCGGGCGGCAGGCCCAGCGACAGGGCCAGGCGCAGCGGCGCGGCCCAGCTCATGCGCAGGCGGCAAGCGCGGCGGCGAGGGCGCCGGCGGCCTCAGGCACGGTGGCGCGCGCGGCGATCTCAGGCGCCTCGCCGCCGCCCTCCAGCAGCGCGGCCAGGATGGCGGACAGCTCGGCGGCGCTGAGCGTGGCGATGCGGTCGGGCAGCTGCGACCAGTCGGCGAGGTTCAGCGCCGCCTCGATCCGGGCCAAGGCGCCGAGTGTCAGACACAGCCGACGCGGCGCGCCGGCCAGGGTCACGACGACCTCGCCGCGGGCGGGGTTGGGGGGGAGCATGGGGGTCACTTTCGCAAATCCTCCCCCAGCGGGGGAGGTGTCGGCCCGCAGGGACGACGGAGGGGGAAGAGGCCGGGCCAGGAGAACTTCCCCCTCCGGCCCTTCGGGCCACCTCCCCCTCTGGGGGGAGGATCTAAAGCGCCGTAAAGCTCACCGCCCCGGCGCTGGCCAGGCTGAGCGCGAACACCGCCTCGCCGTCGTGTTCGCCTGCGTATTCCAGGGCCGCGACGATGAACGCCCCTTCCAGCTGGCCAAAGTCGGGGATGATCAGCCGCCAGACGCGGGCCGACTGGTCGAAGAAGCTGGTGCGCACCTGGGCGTCGGACACAGCGTCGCGGAACACCCCCGAGCCCGACACGGCCACCGACTTGACCCCCGCCCCGGCCAGCAGCTCGCGCCAGCGGCCGGTGCTGTCGCTGTCGGTGATGTCCAGGGTCCTGGCGTTGAGGCTGATCGTGCGGGCGCGCAGGCCCGCCACGGTGTGGAAGGTGGGCGTCGGCGCCCCGTCGCTGATCTTCAGCAGCATGTCCTTGCCGGCTTGGGCGGCCATGGGAGGCTCCTTGTTCTAAAGGGTCTCGGTCACCGCCCGCACGCGGACGATCCCGAGGGTGGTTTCGCGATCGGCGCCGGCGAAGACGTCGGCATAGGGCACGCGCAGGTTCACCAGGCGCCGGCCGACAAGGCTGGGCCGGGCGTCGTGCAGAGCGGCGCGGACGGCGGCGACCAGGGCGCGGGCCTCTTCGGGGCCGCCGAAACGGCTGGCGCAGGTCAGGGTCAGCAGGTGCTCGATCCCCTCCCCGCCGACGCCGCCGACCGCCCGGCCCTCGGCGCGGGTGACGACGACACAAGGATAGGTCGGCAGGCGGGGGCCGGCGCCATACACCCGCTGGCCGGCGATGGCGGTGACGGCGGGCGCGGCCTTCAGCGTCGCGACCAGGGCGTCGATCAGGGGCTTGTCGCTCACAGCCGCGCCTTTCGGTAGGGGGTCAGCCACGGCTCGACGAGGGCCAGCGACGGCTCGCCGGTGTCTCCCAAAGACGAGCGATGCTCGTAGGCGTGGGCGGCCAGGATCAGCACCGAGAGGCGCAGCGGCGCGGGGCTCGTCGCGGTCAGGGCGACGCCGGCGACGGCCCCGACACGCGCCTCGGCCGCGTCGATCAGCAGGGTCAGCAGCGCGTCTTCCGAAGCGTCGGACACGCGCAGAAACGCCCTGGCCTCGGCCAGGGTGAGGGCTTGGGGCATGGGGGGATGTCCTTGAAAATCCTCCCCCAAGCGGGGGAGGTGTCGGCCCGCAGGGACGACGGAGGGGGAAGAGACAGGCGCGCCAGCCTTCCCCCTCCGTCCGCTTCGCGGACACCTCCCCCGTCAGGGGGAGGATTGGATTACGACGCCGCGAACTTCAGCAGCTTCACCGCATCGAAGTTCTGCACGCCGCCGCCGACGCGCTTGGTGGTGTAGAACAGCAAGTGCGGCTTGGCCGAATAGGGGTCGCGCAGCACCCGCACCCCGGCGCGGTCGACGATCAGATAGCCCTTTTCGAAGTCGCCGAACGCCACCGACAGGCTGTTGGCGGCCACATCGGGCATGGCCTCGATCTCGGTGACCGGGAAGCCCAGCAAGCTGGCCGACTGCCCCGGCTGCAGCGCCGCGTTCCAGATGTAGTTGCCTTGGGCGTCCTTGAACTTGCGCACGGCGCTGACCGTGCGGCGGTTCATCACGAAGCGGCCGTTCTGGCGGTACTGGGTCCTGGTCGCGTAGATCAGGTCGATCAGCCTGTCGGTCGGATTGCTGGCGGTCCAGCCGCCGGCGACGCCCGTGGCCAGATAGCCGACCTGGCCCCAGGTGTACGACGCGTCCGGCGCGGCGGTGTAGGCCAAGAGGCCCTTGGGCTTGTTGACCCCGTCGCCGGTGACAAAGGCCGTGGTCTCCTGGGCGGCGAAGGCGTCCTGCACCTCTTCGGCCAGCCACTCGTCGATGCTGACATAGGCGTCGTCCAGCAGGGCCTGGGTGGCGGCCGGGCTGGCGTAGAGCTCGCCGGCCGGGAAGTCGATGACGTCGAGCGTGGGCGCCGTGGTCTCGGGCCGCGCGGCGGTCTCGGCCACCCAGGCGGCGGCCAGACCCGTGGGCGACACCGGCTTGCGGAAGGTCCCCGCGCCGATGGTGCGCACCTGGCAAATTTCTCGCATGGGGCTGGTGGCCGCAAGGCGACGCAGGATCAGCCGCTCCAGCTCCGGCGGGGCGACATAGCCGCCGGCCGTGGCCACGCCCGCCGACAGACCCTTGGCTTCCAGCAGAGCGGCGGCCGTCTCGCCGGTCTTCACATAGCGGTCGAAGGCGGCCTTGCGCTCGTCCACATGCGCCACCGGCGCCTCGCCGCCCAGGGAAGGCCTGCGGGCGTCGCTCAGCATGCGGTCCAGGCGGTCCTGAGCCCGCGAAACGGCCTCGTCGATGCGGCTGACCTTCTCCTCCAGCAGCACGTCGGCCCGCTTGGTTTCGATGGCGGCCAGGCGCTGATCGTTGGCGGCCTTGAAGGCCTCGAACGCGGCCAGCACCTCATGCATCATCGCGCCCGGCGAGGCCGGGGCGTGTTTGGTCTCTTTCAT
Protein sequences of DBSCAN-SWA_2 >CP023313|3043036:3068053|3064989_3065169_-|ATC25687.1|tail|DBSCAN-SWA MSWAAPLRLALSLGLPPEAFWRLSLKEWRALTQAPDAPCLNRAGLQDLLARYPDEETAP >CP023313|3043036:3068053|3053887_3054397_-|ATC25677.1|DBSCAN-SWA MFVLMLCAALLVIVGCPETALGKGLRRWLVDWPAKVLAGLTPARLVLLLALLAVSTLVVVLFEVEGAILLGMALPEVAVWFMAFDVAAFIDLFAAIALAANGARLKGLGDRIKALPGQFGRGLVARVRRSGQGRQGGRRFRRPGSGATKSEDGDAGWSGAAQPGALAWA >CP023313|3043036:3068053|3047879_3048992_-|ATC25672.1|DBSCAN-SWA MIAFWIAAAGLSVVTAGLVLRGAAQARAVDDAETRLEPHRRRLAEVERLALDGLLAEDELKAARAEAGRGLLAAADQAESWSRDGAGPRKAAVAAVGLTCASAVGLYLVLGYPGLPDQPFAKRVAAWRAADPATLEPAKIAAVLEGVAAERPNDPEPLVFMAKARAAAGDMAGAEQALRKAVRIAPKRADIWSLLGETFVIQAEGQVGPDAKLAFGQALKVDAADVRARYYLGRARIAEGEVAAGLSDWRGLLASLAPNAQGRESLAAEIAEVEASGKLPGPQPAAESANPEVQGMIAGMVDGLAQRLKQSPDDPDGWVRLVRAYAVLGETAKRDAALATASALFKDQPKVMSALRQAADTPAQAKTTNP >CP023313|3043036:3068053|3065165_3065450_-|ATC25688.1|DBSCAN-SWA MLPPNPARGEVVVTLAGAPRRLCLTLGALARIEAALNLADWSQLPDRIATLSAAELSAILAALLEGGGEAPEIAARATVPEAAGALAAALAACA >CP023313|3043036:3068053|3047394_3047883_-|ATC25671.1|DBSCAN-SWA MSFWPQSRKARRRLTILLAIAPVLALAVGLALYGLRDSISLFYTPAQAQEAKVSAGRKVQLGGLVQHGSVVKYPDGNVEFVIADQKATAKVVYHGDLPDLFREGQGIVAEGSFNPAGVFEAKLVLAKHDERYMPREVSKALKEQGEWRGEGADAPAYGSQKP >CP023313|3043036:3068053|3051315_3052707_-|ATC26793.1|DBSCAN-SWA MIVPSHRSLVFRLVVAAGIWTLLVVAGAGVFVNTQFRDAQVRRFDQGLSILIDDMYANTSVEDGLVKAPFLTDIRATRAYSGRYWTIFDSTGDGSLRVIDRSRSLFDSDLMLPSAQLDRLIAAPGKTIYFDLRGPQDARLRAGGLLARLPGHATPVIFLVAEDRSPIDADTGRFVRITAFALLILSGGLILAVVVQVRFGLQPLFQLRREVAHVRRGKAERVDGRYPEELEPLAAELNALLAHNQEVVERQRTHVGNLAHALKTPLSVMLTEASQQPGQLAEVVTRQAQTMREQVDHHLRRARAAARSQTSGERTPVEPILDELAVTLERIFQDKADGRGVEIDWRCPEDLCFQGEKQDLMELAGNVMENAGKWCRGKIRVDAVRTGEARMTLTVDDDGPGLTPDERAQALKRGQRLDENAPGSGLGLSIVDELARAYGGSVQLGESPLGGLRVSLDLPCAES >CP023313|3043036:3068053|3056424_3057372_-|ATC25681.1|DBSCAN-SWA MARDPYQELGVTRTASADEIRKAFRKLAKQYHPDANPGDKKAEERFKQVSAAFDIVGDAEKRKKFDLGQIDADGRETMRGFGGQPGNGPFNAGGFGQGGFHRSNEGPEIDLSDLFGGMFGGGGGGAGAGRGPFSGGAGGGFSAKGADVKARLDIDLEDAIKGGKKRVAFSDGRTIDVTIPTGAQEGQTLRLKGQGAPGRGGQGDALIELAIKPHPIYRREGEALVMDLPVSIPDAVLGGKVEAPTPDGNVMLAVPKGSNSGQTLRLKGRGMPDGKGKRGDLLARLVVTLPETVDQDLEKFAEAWRAQKPYTPKRK >CP023313|3043036:3068053|3054598_3055906_-|ATC25679.1|DBSCAN-SWA MTTVAEVRLWGSRIGAVSLEDGAQTAVFAYEPGFIASGIQPAPLMMPLKAGVFSFPDLPPRSFHGLPGMLADALPDKYGHVLIDAWLATQGRSPESFNAVERLCYTGRRGMGALEFSPMAGPRRRVSSKIDIDALVTLASEVLTHRHDLRASFADADKADALRDILSVGTSAGGARAKAVIAWNPATNEVRSGQVEAGAGFGYWLLKFDGVSGNRDKELADPKGYGAVEHAYGQMAAAAGIDVAESRLLEEGGRRHFMSKRFDRLDGGGKLHMQSLAAIAHLDFNDPVANSYEQALFTMRRMGLPMAQLEEQFRRMVFNVLARNQDDHVKNIAFLMDRAGRWSLSPAFDITWSYNPDGEWTSRHQMSINGKRDGFDFADLEACAKTASISRGHVGRIFEEVRAAVMRWPTFADAAGVDERWRDQIGATLRLELRR >CP023313|3043036:3068053|3057460_3059992_-|ATC26794.1|DBSCAN-SWA MVLHQAALAAQAGGVDGFIIGSELRALTTTRGPGGTYPAVTKLKTLAADVRAVVGPATKLGYAADWSEYFGHQPRDGSGHAVFHLDPLWADPNLDFVGVDWYPPVTDWREGEDHLDAMAGYDGPHDPAYLRAGLTGGADFDWFYAGGADRDAQVRTPITDGAHGEAWMFRPKDLLSWWSQPHHDRPGGVRSATPTAWVPRSKPIRLTEFGCPAVDKGSNSPNLFIDPKSSESFLPPYSSGERDDFGQRRYLEAVLAWLDEPSANPVSPLYGGPMIEAASAWCWDARPFPDFPARADVWSDGGAWLLGHWLTGRAGIAPLPELIQALGARAGVALDPGEAGGAVGGYVVDRPMRLRDALSPLTEAFALDPVERGDQVRMMSRTGRAVAAIEPDDLVLPEDGPAERETRTLDPAAEALRLRFLDAARDYQVGALIVRREAGEGTRDMDAPIVLSAAEAAAVARRMLDADAAARRTRIVRLSPSAALRFEAGDRVTLDAETWRIQRLDLDERPRANLAPVLRSQGVDAVIDWTPAPPREPASPPVLHVLDLPSDGALADDARPLVAAAAEPWRPLDVHAGAGVETLTVRARLAAPATLGLTLTDLAPASPHRLDRSARLDVRIEGASLSSAPLAAVLAGGNALAIRAPSGDWEVIAFQTAQLIAPDVWRLSGLLRGQRDGAASEGVIPAGAAVVLLDEAVVPISVAAFERGTTLMVRAAPAGGPPSGAGMTQISAVWTGRALRPLAPAHLRKRSIGGDLSVSWIRRARVGGDVWDGEVPLAEGVERYRVRVLDGAAVLREAEVETPGFTYTAAMRAADAPSSGARLEVVQGASLYGWGAPASTSLW >CP023313|3043036:3068053|3048998_3049784_+|ATC25673.1|DBSCAN-SWA MLACSGRRRERKKRLRRPAPRHSGAPVEEGSMTAILLVPGLLCSEEIFAPQLPVLWPRGPVTIANTLAGDSLAEIATRILADAPPTFALAGLSMGGYLAFEIVRQAPERVSRLALICTSARPDTPEQAAGRRKMVAQAHKVGFERFCALGADALTHPSRKGDPALNALSARMGLAVGLEGFARQTEAVIGRPDSRPLLASIAVPTAIIVGDADPLTPKALSEEMAAMIPDARLVIAEDCGHVITHERPDVVNPAMAAWLGV >CP023313|3043036:3068053|3044808_3045261_-|ATC25669.1|DBSCAN-SWA MKRLRALLIAAAAVALTAGASDPSERLPDQAQEARARTLFKEVRCLVCQNESIDDSEAQLAGDLRQIVREQVKAGSSDAQIRDFLVERYGEFVLLKPRFSMGNAVLWLAPVVVLLVGGVLMAGLLRRRETPEPALSQDEEARVRRLLEGD >CP023313|3043036:3068053|3066410_3066692_-|ATC25691.1|head,tail|DBSCAN-SWA MPQALTLAEARAFLRVSDASEDALLTLLIDAAEARVGAVAGVALTATSPAPLRLSVLILAAHAYEHRSSLGDTGEPSLALVEPWLTPYRKARL >CP023313|3043036:3068053|3063378_3063843_+|ATC25684.1|DBSCAN-SWA MDWKTLFLSPEGRIGRQSFWVGWLILLGVNVVASWIPFVGTLIILASIYASVCIHSKRLHDMGQTGWWQVLPWVLGPVLVFGAAISVGVMPAIAALTNGEPEVSALTALVGLFVSIFIAFAIWLAFTLWVGCSLGQPRENKYGPPPINPTAVTV >CP023313|3043036:3068053|3052748_3053426_-|ATC25675.1|DBSCAN-SWA MRILLVEDDPDLTRQLKLALADAGYAVDHAPDGEEAQYLGENEPYDAVILDLGLPKVDGVSVLERWRRGNVTTPVLILTARGAWSDKVAGFDAGADDYLAKPFHTEELLARLRALLRRSAGHAAPSLSCGALRLDPRAARASVNGEPLRLTSLEYRLLHYMIMHQGRVIGRTELVEHLYDQDFDRDSNTIEVFIGRLRKKLGADRIETVRGLGYRLAALPGEDAA >CP023313|3043036:3068053|3066832_3068053_-|ATC25692.1|capsid|DBSCAN-SWA MKETKHAPASPGAMMHEVLAAFEAFKAANDQRLAAIETKRADVLLEEKVSRIDEAVSRAQDRLDRMLSDARRPSLGGEAPVAHVDERKAAFDRYVKTGETAAALLEAKGLSAGVATAGGYVAPPELERLILRRLAATSPMREICQVRTIGAGTFRKPVSPTGLAAAWVAETAARPETTAPTLDVIDFPAGELYASPAATQALLDDAYVSIDEWLAEEVQDAFAAQETTAFVTGDGVNKPKGLLAYTAAPDASYTWGQVGYLATGVAGGWTASNPTDRLIDLIYATRTQYRQNGRFVMNRRTVSAVRKFKDAQGNYIWNAALQPGQSASLLGFPVTEIEAMPDVAANSLSVAFGDFEKGYLIVDRAGVRVLRDPYSAKPHLLFYTTKRVGGGVQNFDAVKLLKFAAS >CP023313|3043036:3068053|3053441_3053759_-|ATC25676.1|DBSCAN-SWA MKPIRALIVAAALAAVLPDAAFAQRRPDSLGADWRQQQDQARGGVQSGRLVPLSRVIEMIGRRVPGRVLDAGLEGDNYRVRWAAADGRRIDFIVDAQTGQILSGG >CP023313|3043036:3068053|3065583_3066000_-|ATC25689.1|tail|DBSCAN-SWA MAAQAGKDMLLKISDGAPTPTFHTVAGLRARTISLNARTLDITDSDSTGRWRELLAGAGVKSVAVSGSGVFRDAVSDAQVRTSFFDQSARVWRLIIPDFGQLEGAFIVAALEYAGEHDGEAVFALSLASAGAVSFTAL >CP023313|3043036:3068053|3043036_3044629_-|ATC25668.1|protease|DBSCAN-SWA MTARKSGFIVGAVAGASVACAALAGMGMRMGTADAAQAPVVRVSTASAPTFAPPPGAPMSFADIFEKVSPAVVQINVTSKAQAPSLRIPGLEGFDIVPRGQRRPGQPGQQGQQQEDDGDTPATPKQQSAGSGFFISADGYIVTNNHVVADADDIQVVLKDGRELKATLVGRDESTDLAVIKVVDPKAKGKDFTFVNFENQAKPRVGDWVITIGNPFGLGGTATAGIISAYDRNLNDTTSSFVPYIQIDAPINRGNSGGPSFDIYGRVIGVNSAIYSPSGGSVGIGFAIPAEVAEGVAKQLIENGKVVRGYIGVSIMAFNAEMAEALGMSDVKGAIVASVVPGGPAAKAGLLPDDILVAVNGVKISDSSELTREVSKARPGETIKVSIIRDGKPRIVDVKSGTRPSESSLAVTDDEQGQDGAAPTPDKPASQKVDALGLTLGPIDAASRQTYKIEPDIKGLLIIGVKGDSDAGEKGLAKGDVLSNINGAPVTSVADVTSAVETAKKAGRASVLVKIVRQNRPVFIPLKIAP >CP023313|3043036:3068053|3062595_3063231_-|ATC25683.1|DBSCAN-SWA MMSEFHEARLPARLAFGCTGGIERRTEVVSLASGHERRTSPWSQSRRRYLIATAPRPLDEIAELVAFFEARRGRLHGFRFRDPADFKSCAPSVQPAAGDQAIGTGDGVRKAFQLRKTYGAGGEAVARTITKPVAGTVTVAVAGVVLAPGAFAVDVTTGLITLNTAPPAGAAVTAGFAFDTPVRFDLDRLDVTLEGFAAARVTACSLVEVLV >CP023313|3043036:3068053|3061295_3061925_-|ATC26795.1|DBSCAN-SWA MGEAADACPKSSPQRGRWPKAGGGRSPGPASSPSVVPAGRHLPRWGEDLVVVQARLWLGTPYRHQASTLGAGCDCLGLIRGVWRGLYGQEPEPPPPYRPDWAEIGVGEPLLEAFRRWLVAIPVSQALPGDVLVFRMAPGAAAKHCAIKAAPDRIIHAYWGRACVESALGRWWRERCVAAFRFPPFPNKHPTASCRPASFEARSGARTSG >CP023313|3043036:3068053|3055902_3056262_-|ATC25680.1|DBSCAN-SWA MKFESLLSDEAVLAEMGQRLVAARLERRLTQAQLAQAAGVSKRTVERLEDGASAQLTNLVRCLRALDRLEGLERLLPETPANPLDLLKQAKTGRSRVRNARASGVAETGGAPWVWGDEK >CP023313|3043036:3068053|3066015_3066414_-|ATC25690.1|DBSCAN-SWA MSDKPLIDALVATLKAAPAVTAIAGQRVYGAGPRLPTYPCVVVTRAEGRAVGGVGGEGIEHLLTLTCASRFGGPEEARALVAAVRAALHDARPSLVGRRLVNLRVPYADVFAGADRETTLGIVRVRAVTETL >CP023313|3043036:3068053|3061969_3062596_-|ATC25682.1|DBSCAN-SWA MRALPEGLDEARLCHVWILTRADGTRLGFTDHDQDLVVDGVTCRAGGGWSPGATENSVGYAPGQGAVLGVLDEAGIAEADLAAGLYDGARVEALRVDWSAPSRRVSLWTATIASVTREGEAFTATLAGPLAALERVAGRTFTRLCDARLGDVRCGITPAPGATCDKRWATCVGVFGNSVNFRGFPTSPGEDFLTLYPVEGERNDGGRR >CP023313|3043036:3068053|3049784_3051155_-|ATC25674.1|DBSCAN-SWA MTLVGSLDPARGIMAAIAAHKLQIIQTLVETAPDAALRSLELALAGAGSQGALASVRGLVEDETANRFVRNNVLAPIVPLCARRASSQVSFPAPVLSRLWRALKSVAAARVEEAAARCNPWDLEQGSPEVFDELCKLAAAGLRDPENAAFDSVRSLCDPEQLALCLQLSTLTRGCLPKLAEWVSRMSDERAAAARLAYRDACRISEDAGPLLLDILSAHLPDDWRIMRVISAVMDRPSDRYLASSEVSQFGERILTEIEETIALIESFSFADGEKAGRLAAQAAQKVQLQMVEIQQSVDIAKDGPWGKRLARQKQAMAKACELRMDQAEKELDKALPTRPISMLAKKGARGVAKLIEAPDEAMIRRAQSALAFVAELRACADKAGYGSSRTKVLEKLNARLDPYIEDVLHVARTGEGGDSAVAVKYLDIAASFIAYTRDDKTAEIVRRRAAAAIAA >CP023313|3043036:3068053|3045257_3047240_-|ATC25670.1|DBSCAN-SWA MIVELGAFALILSLMLSVAQTGLSAVGGARRSPVLAGAGQGAAIATFVALLVSFAALIYAFVTSDFSVTNVATNSHTDKPMLYKVAGAWGSHEGSMLLWCVVLTGFGAAMAVFGDSLPPRLRAYAIAVQGALGVMFLAYTVLASNPLARLLEAPIEGKSLNPLLQDWALAFHPPFLYIGYVGFSVVYSLSMAALIEGRIDAAWARWIRPWTLAAWSMLTVGITLGAFWAYYELGWGGWWFWDPVENASFMPWLIGAALLHSAIVTERRGALPGWTAFLALAAYTFSMLGAFLVRSGVLTSVHAFAVDPTRGVLLLIMMGVAAGAGFLLFGLRAPSLNRGGQFRPISRESAIVLNNILLSTATAVVLLGTLYPLIREAMDGEAVSVGAPFFNLTFTPLMILAFAVLPAGPLLAWKRGDAKGVARKLWVVLALAALLGLIAYAVVQPRKALASGGLVVGFWLIGGALLEIAERLKLARAPFAESLRRARGLPRGAWGTTLAHAGLGVFVLGASFETAWRVEAAQALSLNGSQPLGAYTVTLTDVVTIEGPNYLAERGIITVTNKAGAEVCRAQPERRFYPTGAQTTSEVAICAKGLDDIYVVMGERRAGEGGKPAWLVRAYVNPWVRLIFLGPLLMAIGGIVSLSDRRVRLGVGRRTGEVVS >CP023313|3043036:3068053|3054384_3054567_-|ATC25678.1|DBSCAN-SWA MTPLNAKGRGAACGARGGAGQAQGAAGCLIGGLSTALIRRTAAPSEDRGAIGRRELSCSS >CP023313|3043036:3068053|3064486_3064993_-|ATC25686.1|tail|DBSCAN-SWA MSFDQEGLSAVPARAAEAAAALESLKAPAERAARSIDEAFARAGASLVRSLARAASDGEVSLAELARAMLGAAGAALKGGGLGEALSKTFAGARADGGPVLPGGAYLVGERGPELFRPASAGNIEPVGNSGVSVTVNVQGGEAQGLIRSDAQIAQALARAVSLGARGL >CP023313|3043036:3068053|3063993_3064407_-|ATC25685.1|DBSCAN-SWA MRRAMAWTPMFLAMAVQAAPVTLAIDDARFAPLDPGNPNGPQMAVLRGDPATGPSDMLMRFSRGQGVPHVHSSDYRLVVLEGVMTHAQAGEGAGAKPLGPGSYWFQPGEQPHLDGCLSERCTMFISWSGKRDARRAP |
28 | Rhodobacter_phage(28.57%) | protease,tail,capsid,head | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
3945076 : 3967085
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP023313|3945076:3967085|DBSCAN-SWA TATGGCCCGTCTCGTCAATCGCCTCACTGATCGCACCGTCCGCGCCCTCAAGGAGCCGGGGCTCTACCCCGACGGGGCTGGTCTGTACCTGGAAGTGACGAAGGGCGGCTCAAAGCAGTGGGCCTACATCTTCCAGTGGAGGAAGAAGCGGACCCAGATGGGCCTGGGTGGCCTTCTCGCCGTCAGCCTGGCCCAGGCCCGCGAGGCGGCGGCTGAGGCCCGCAAACTCGTTAAGCAGGGCGTCAATCCGATCGAGGCGCGCAAGGCCGAGGCGATGGCGGCCAAGACCTTCGGCGACATGGCCGACGCAGTCCTGGAGACGAAGAAGGACGGCTGGAAGAACGAGAAGCACCGGGAGCAGTGGGAGACGTCGTTGAACGTCCACGCAGCCCCGATCCGACCGAAGCCGCTGGCCGATATCACAACAGACGACGTCCTGGACATCCTGAAGCCGATATGGACGCGGATCCCCGAGACTGCATCGCGAACGCGTGGCCGGATCGAAACCGTGCTGGACGCCGCTAAGGCCAAGGGCCTGCGGGCCGGTGAGAACCCGGCGCGATGGAAGGGGCACTTGGAGCATCTGCTGCCGAAGCGGAAGAAGCTTTATCGCGGCCACCACGCTGCTCTCGGCCTAGCCGAGCTGGGCGCCTTCATGGCCGCCCTCAGAGAGCGCCAGGCGATGGCCGCGCGGGCGCTGGAGTTCACCATCCTGACGGCGGCGCGCACCGGCGAAGTACTGGGCGCCACCTGGGCCGAGTTCGATCTCTATCGCGCGGTTTGGACGGTGCCGGCGGAGCGGATGAAGCTGCGCGTAGAGCACCGCGTTCCGTTGAGCACCCCGGCTATGGCTCTGCTAACTGAGCTGGCGCGCGCCAGCAACGCCAAGCCGGACGAGCTGGTATTCCCAAGCGTGATCACCGGCGGGAAGATGTCGAACATGGCCATGCTGATGCTGCTCCGCCGGATGACAAGGCCGGAACTGACAGTCCACGGCTTCCGGTCGACGTTCCGCGATTGGGCCGGCGAACTGACGAACTTCCCGCGCGAGGTCGCCGAGGCGGCCCTGGCGCACACCGTGGGTACTGACGTCGAGAAGGCCTATCGGCGCGGCGATGCGCTACTGAAGCGGCGGAAGATGATGGAAGCCTGGGCGACCTTCTGCGCTCGTACCGGCGTGCTGGTCGAGTTTCGCCGGGCCTAAAGGGCGTCACTCGTCGTCGAGATCGATGACCTGATGCAGCTCTGTGACGCGGTAAGCGATCATCTTGCCGCCCCGCACCTCGGCGCTCACGTCCACCACGAAGCCCTTCTTGAAGACGTTGTCCTCGGCTTCCGTGATCTCGTGCTTGATCTGCTGCTCGGCAAGTTCCGAGGCGTAAATCAGGGGCAGGCTTTTTGGCGACAGGGCTTCGATCTTTACCAATTCCCCTGACCTCTTCCCGATCGCCTGGCTGCGCACATCCGACCGCGTAAAGACCATCAGGACACGAGGATGGTTGGCGCCGCTCTTGTGGTCCACCAGGACTTTCTGTTCTTCGACCCTCTCCCGGATCTCGCGGGCCTGGGAGGTGTCAAACTTGAATGCCGCCTTTGCCTTGTATGCACCGTTCTCGATCTCGAAGGCCGCGATCTCAAGGGAAGATTGTGGGGTTGCCGCCACGGCGGCGACTTGATCGTAAAAGTCCTTCAGCTCGCCCTTGCCCGCGCCCTCCACGCGACCGCCGGGTTTAAGGAACGCCGAAAGGCGACCACCATAGTGCTCGACAAACTCCGCAACCGTGTTGGCGCCGTCCAACACCTTGACGATGGTTTCGGCGCCGCCAGCGGTAGCGAGGAATGGCACCAGATCAGCGATGATCGAACCGGCCCGAACTTCCTTGACGAAAAGCGTTGCTTCGCCAGGCCGATCAGGGCGCTCCGTCTTGATGAAGCGCTCGTACTCCGCCGCTACGGAGGTGAAGGTCGATACGAAATCAGTCAGCTCGATGGGCTCTTTCACGTCGAGCCGCAGGCGAATGAATGGTTGACCATGCTCCATAGCGGCACCGTAGGAGCGCAATTTCGAGTTGTCACTAGACATGGCCACGCTCCAGGCCGGGCATTCTAAGAGCTGGTCACGTCACCTTTGGACGTGGCGGCGTGCTGGCCACTTCAAGGCGCTGGAGAAGGCCTTCGAGCACTTCTCGCCTGATCAGTGTGCGCGTTCCGACCTTGAGCGTGGTCAGCTCGCCGCGTTCGATGAGGCGGTAAAGGGTGGCCTTGCTGAGACCGGTAGCCTCGGCGGCCTCGTCCATGCGGTAGGACAGCTTCTCAGACGCCGCGATCATCGCTTGGCTCCTTGGGTAGGTCGGGCAGGTCCGCCGCGTACATCCAGTGCGTCGCGGGGAACTGCAGGTGGGTCGGGGCTCCCGGTGTCATGGCCACCTTCAGAGCGCTGGGCGCCCAGAACATCCGGCGGCCGTCGGCGGTGGCGACGAGGATCGGCTCGTCCCGCTCGGGCCAGTCGGCGATCGGCTTCCAGGCGCTGGGCGTGCCGCGTCCCTCGGCCGCGAGGCGGACCACCACCCAAGCTTCCGAGAACTTGAACGGCGGGCGCGGGAGGTCGCGGTGGGGGTTGAGAGCGAGGAGATCCTCGCAGTCTTCAAGCCAGCGCCAGACCTGGCCGAGAGCCTCGGGCAAGGCCAGGGACATCAGACCGTGCTTACTGGTCATGATCGGCCTCCTCGTCTCCGGCTTCGGGGAAGTCATCGAGCCAGCCGTCAGCGACAAGCTGCGCGTCCAAGCGCGTCCACTCGCCGTCTTCACTACCAAACTCACCGACATCTTCGACCACACGGGCGTAGGCCTTGTGGGCCAGCTCCTGGGCGCGACCCGCGTCTAGGCCATGCATCTCCGTGAACCGACCGATGACCATGTCGATGAACTCGACCTCGTTGAGAGGATCAGGCCCATCGTCGGCCGCCTCATCCTCCTGGTGAGGCCTGATCTCCGAGCCGCTCCAGATGTCTGCGCTGCCCATGGCGTAGTTCAGCGCGATCGCCTGTAGCCGCCGCGTCACCTCGAACTGCGTACCGGCGCCCTGCAGCATGTTGTCCTGCCCATCGACCAGTACAAAGGTCTGGGGCAGCGGTTGCCCCCACTGGTCTTTCCCAGGGCTGCGAAGCGCTATGCGAAACGGTCCGACATAGCCTCGCTCGTTGAGCAGGACGGCGAACGCGGCCGCGAAGGCTTCGCGCGACATCGCCGGCGCCTCGGCCTCGAATGCGCGCAGCTGGACAAGCAACTGCTGAGCCTGAGCCTCGGCTTGGGCGGCACGTTCCCTGGCGGCGTTCTTCTCCTCGGCGGCGAGACGGGTCTGTTCCGCCGAGGCCAGGTCGCGCGCGTTCTTCTCGTCGATCAGTTTCTGGACAACCACCGAGTTGACCAGAGGCTTGCCGAGCCATTGGGACAGCGCCATCGCTTCGGTGCTGACGACCTTGTCGAAGGCGTATGCGGCGTTGTCGATCGACATCATCGCGCCCCTGCGCGCTTCCTCGGTCTCCGCCCTGGTCAGCTCACTGCAGTGCTGCTCCAGCGCCAGGAACCCCTTGTAGGTGGTCCCGAGCTTGTGCTTCGGCTCCCAACTCGTCGGCTCTTCGGTCTCGGCCAGGCCGCGTTCAATGAGGCTGGCGATCAAGCGTTGGTCTTGGGCTTCGCCGTGGCATTCACCCTTGCGCCAATAGCTGAGGTGGGCGGCGGGCTCCGGGTAGGCGTTCGTCCAGATCAGCGCGAGAGCGAGCAGCTCGATCGGCGTGGCCTCGACGGGTTCAGGCCGGTTCGCCAGCGCCTTCAGCGCTTCCTTGACCGTCATGTCACCGCGGTTCAGGCGGGCCTTCTGATCATCTGTCAGCTTGAGCAGCGATAACCGCAGCTGGACGAACCGCTGCGTCTTGCTGACCTTCTCGGCGATCTTGGCCGTGGACCAGCCATTCATGCTGAACAGCTGGTCGAAGGCGGCGGCCTCTTCGAGGGGTGTCAGGTCGACGCGATGCAGGTTCTCCAGCAGGGCGGCGACGACGTGGTCGTTGTCGGTCAGGTCCTCGATCTTGACCAGGATGGGGAAATCTCGATCGAGACGGCCCTGTTCGATGGCGCGTCCGATCGCGCGCCACCGCCGCTCGCCCGCGATCAGCTGGTTCGTCGGAAGCCCCTCGGCGTCGGCGACAGGATCGTCGGCGCGGACTTCTAGGTTGGTCTTCAGGCCCCGCTCGGCGATGTCGGCCGCGAGCTCGTCCAACGCCTCGTCGTCGAACTGCTTGCGAGGATTGAACTCGCCGATGCGGATCTGGCGGTGGAAGCGGAACGCATAGCCTTCCAGCGCGCGAACCGGGTTCGCCTCGCTCTGGGGCACCGCGCCGGTCTCCGCGATCTCGCGGGCGGCGATGACCGCGTGGGCGCCGGCGGCCAGGCGCAGTCCGCCCTCTTGTTTCGTGAGGACGCCGGCGTCGACGAGACGACCAAGCGTCTTATGCAGGTTCGACTTGTCCCGCCCCGTGATGCGGGCCAGCTCAGCGGCCTGGATTTGCGGGCCGTGCTCGTCGATCGAGCGCAGCACGTCGGTGTTCTGGAAAGCAAGATTGAGATCCATGGTCAGCCCCAATGGATGTAGGCGGCTACGGCCGCGATGATGGAGAGGAAGATGACGGAGGCCACGAACGCGCGGCCGTCGTCGGTGAGGAGACGCCCGCGCGGCCTACGCATCGCGGCGAGCCTCGAACCAGCTCAGCACCAGGACGGTGAGCGGCGTCAGAACCGCCCATCCGATCAAGACACCGGCCCAGAAATCCGACCGACACATCAGATTTCGCCGATCGCGGAGAGGTAGAGGTCGAGGATCGCGTCCTCTTCCTGGCGCTTGGCGCGGTCCTGCTTGCGAACCCTCACGACCTTCTTCAGCACCTTCACGTCGAAGCCGTTGCCCTTGGCCTCGGCGTAGACCTCCTTGATCTGCTCCATGATCTCGGCCTTCTCGACCTCGAGCCGCTCGACCCGGTCGATGATCGACTTGAGCTGGTTCTGGGCGGTGCTGTTCAGCACGTCCGAGTGCGGGATGGTGTCGTCAGCCATCAGTGCAGCTCCCGCTTTCCGCCCCAGCGGAGCGCCTTCGCGGAAGTGCCCGGCAGGAACTCCGCTGTTTGAGGTTCGAGAAGGGCGTAGGCCGCCTGGGCGCGGGCGAGCCGCTCATCGCAGCCGGTGCAGGCGCCAGGGACGTAACCGTCGCTCGTGATGCTCGCGATGATGTCGCCGAGCAGGGCCTTGGCCGCGAACTCGGTCTCGCCATAGGTCGCGCGATGGACCGTGTGCGCCGCTAGGCCGCTGCGCTCCTCGCTGATAGCGATCGCGATCATAGGTTTGGCCGTCTTGTCGGCGAGGCGACGGACAGCGGCCTCCCACCGGGCGAGCCACGTGAGGGGTTGGAACCTCGACATCACGCGGCCCTCCGAAGCGGCTTGCCGGCGGCGCGGATGGCGGCGGCCAGCTGGGGACCTTCGGCATGGGGCAGGACGACGGTGGCGATCCAGTCGGTCGGCCCGCCGGCCCGGTCGCCGACCTTGATGTTGAAGCCCTGGAAGGTCTCGGACGTGTCGTAGGCGCCGACGCACGGCGCGACGGACAGGGCCTGCTTGCCGCGCAGCGCCTGGATGCGAAGCGCGACGGCTTCCAGCGACGGATGTTCCTCGACGCCGACGCCCAGGCCTCGGATCAAGGTGAAGGGGGGGATAGCTTCGCGGGACATCAGGCGCGCCCCGCTTCCAGCTGGGCGGCCTGGGCGAACGCGGCGGTGGCGGCGCGCGCCATGCGCCGGGCCGGATCCCGCAGCGGCTTCTGGCGCAGCGCGGTGGCCGCGTCGTTCAGCTGGCGGGCGGCCGTCCCGGCGGTCGCGGGGGCCAGCGTCCAGTGGACGCCGCCGACGGCCACGTCGACGACAACCTGAGCCTTACGACCGGCTTTGACGTCAACGCGGTGTGTGATGGTCGCCGCCGCAGGGTGCGGCGACAACATGAGGCAGTCGACGAGCGTATCGACGACGGATTGGGAGGATCTGACCATGGACGCCCCGTGGTTCTTGTGACCAAGCGGCACGCGAACCCGGTTCGTCGGCCACTACCCGTCGTGGACGGGTATGGACGATATGGATGGAGATTTCTTCCACATGTCAAGCGCACGTCCATGTGGACGTAACGTCCACAATGGGCGCTTTGACGCTTGTGATTCTAGCGAAGCATCCGGAAGGGCAGATCAACGGCATGGATCGCGCGGACGCTCGAAGCGTCATAGCGAACCTCGTTCCCCTCGCCGCTCGGCGGGTTGTACTGCCATGCGAAGACCTGGCCGTCGCGCTGGCCGCGATAGCTCTTGATCAGCCCAGATCCGTCATTGAACTCGATCAGGCAGTCCTGGCCGCGCGCAGGCGGCAAGCCGAGGCGGACGGTAACAGCCTCGCCTGCGAAGTAGCGAGGCTCCATGCTGTCTCCGATGATCCGGACGACCGCTAGATCGCCCACGCCATTCCAGAGCGGCGGCGCATCTACCCAGTCGATTACCTGACCCGGATTGATGGCGATGCGCTCTGAGCCGCCCGCAGCCGCATAGCCATAGACAGGGATGCGCCTTGGCGCAGTGCGTCGGCGCTCCGACAGCTGTTCGACTGCCGCCTCGCGGTCGATCGGCTCGCCGAAGAACTGCTCGATCTTCAGCAGCTCGTCAGCGCGAAGCCGGCGCTCGCCCTTCAGCGTCTTAGTCAGGCTGGACGGATCCAGGTTCAGGTGACGCGCCAAGTCGGCCTGCGAGCGTTTGAGCTGGCGCAGGCGCGAGGGGATTTCGGAGATTTCCATAACCGCAAGCATGGAAAATATTTCCGTGGAACGTCTCTTGAGGTTTTTTCCAAATCGGTCCAGGCCTGGACGTGGAAATAATCTCTAGATGGATTTTCCATGATGTCCATGGGTGACGCTTCGGAAACGCCGGCCAATCGTGCCGTGCGCCTATTGGGCCTGATCGATCTGGCCCATGCATGCCGGCTGACGACCGACGCGGTTCGCAAGTGGCAGAAGAGCAAAGGCGGCCTGATCCCAGCCGCTCACCAAGCTGGGGTGCTTCGATTAGCGCGCGAAAAAGGTGTGTCGCTGACTGCCGCCGATATCATCGGCGGAGAGGCAGGGGGGGCAGATCAATGACCCGCACACCTCACGCCACCTCGGCCGATATCGCCAAGTGGAAGCGCCTGGCCAAGGAGGCGTCGCGCGCAAGCGGTGACGGCTCCGTCCCGCTCGGCGACCTGGAGAAGGCGACCGCCGCCGCGCGCCGCGCGATCGTTCCCGTCGGGGTCATCGACAAGAACAACCCGTTCGTTCGCCTGGTGCGGACCTCGCAGCGGTATCTCGCCAGCGACCGGGAACGCATCGAGCTGGCCTCCGACATGGCGATGCTGGCCCAGACCTGCGAGGCGCAACTTGCCCCGGCCCCTGACCCTCGGGCCTTGGCGCGCGAACCGCGTTCGCGCCTGCCCTACGCAGACGAGTAGGCGCCATGTCGGGGGGCGGTCACTTCAGCTGGACGGTCGGAGCGCTCGAAGATCTGCGCGCCTTCAACCGGATCGACGGCGACCTGAAGAAGCTCGCCCACATCATGGGCTGCACCTCGCAGGACGTGGACCAGGCGCTCTGGTTCCTCCTCGGTCGCAGCCCCGAACAGGCCCTTGAGGCCATGCACCAATATCGAATGGGGGCCTGCGCGTGAGCTGGGGCAACGATCAGAACCTGCGCATCGAAGATCTACCCGACTTCGCGACGCGGGCGCGTGGCGTCTTACGCCACTCGGGCGTGACCACTCTGGCCGACGCCGCCGCCAATCGCGGCGCCTGGCGAGACCATCCGCTGGCGACCAAGACCGTGATCGCCCATGTCGAGGACGTGCTCGCCGAGTACGGCGCGCCGGCATGACGGACGCGGGCCGTTTCTTGCGCGCGGCCATGGACTACGCCCGCCGTGGGATCGCGGTCTTTCCGCTCCAGCCGCGCGACAAAGCCCCATATGGTCGGACCATCGGCTTCAAGCAGGCCGCGCACATGCCGGGTCTGGTCGAGGACTGGTGGACCGGCCGGCGCCGTCTCGAGTTGAAGGCGGACGCCGACAACAAGTCGCCGGTGCGCGCACGGCTCAACAGCAACATCGGCATCGCGACGGGCGCGATCTCGGGCTTCTGGGTCCTGGATCTAGACGGCCCAGAGGCCGAGGCGGCTATCGCGCGGCTCGAAGCGCTGCATGGTCCGCTGCCGAAGACCGTCCAGCAGGCGACGGGGCGAGGTCGCCACCTATGCTTTGCGTGGAACCCGGCCCTCCCGGTGCGCAACATGAGCAAGCGCAGCCAAGAGCGGATCGGCGCCAAGATCGACGTGCGTGGCGACGGCGGCTACATCGTCGCGCCGCCGTCGGTTCACCCCGGCAAGCCCGAGGAAGGGATCCCGCCCGGCCGGATCTACGCTTGGGCGCCAGGCTGCTCGCCCCAGGATCTGCCGTTCGCGGACGCCCCGGCATGGCTGATGGAGCTGGTTTGCCCGCCGCCGGAACCAGAGCCTGTTCGCGCACCCATCAAAACTCGGGCGCCGGCGGCTGGTCGCGCGAGCGCCTATGGCGAGGCTGCGCTGGATGGCGCCGTCCGCACGATCCACGGCGCACGGGTCGGAAGCCGTGACACGACGCTCTATCGCGCCAGCTGCTCGATCGGCTGTCTGATCGCGGGCGGAGAGATCGACCACGACTATGGCCGGTCGGTACTGATCGAAGCGGGCCGGGTTCACGTGCCCGACGCCATGACGGTCGCCCAGCTGGAGCGCCAGGTGGATCGCGCGCTGGCCTGGGGCGAAAGCCGCCCGCGATCCGCGGGCGAGCGCCCGCGCCAGCGCAGTGTCCAAACCGAACGGCGGGCGTCGGCGACCAAAGGGGTAGAAGCCATCCCTGGGGATGAGGCGAACGCGGCGGCCCAGCTGTGGGACACGGCGCGGTCGGCCTGGTGCAAGGCGACAGTTCAGTGGTTCGAGGCCCGAGGCCTTGCCGGCACGCCGTGCGGCGTCACCGAGCTGCTCAATCGCTTCCGGGTCCACCCGAACGCCCCGATCGGCGGCGGCCGGACAGGTCCCGCCCTGATCGCGCCGCTGGTGCGCCTCGACGGCGATCCCATCGAGGCGCTGGCGGTGCTGCCGTTCGAAGCCGACCGCATCACGCACCTGGTCGGCGACAGCGACGGTCGCGTCGTCATGCTGACGCCGTTGCGCCCAGGGCATGAGCCGGAAAGCCTGATCGTCGCGCTCGACTTGCAGGACGCTTGGTACCTGCTGACCCAAGCCTGGCGCGAAGAGATATCGGCCGGCGCGGTGATCGCGCCTCGCCTTTCCACCTTTGCCGGCGGCGCGCTGGGCGACCGCTGGGGGCGCATTGATCCTGACGCCCCCGCGCACGATCCGACACGTCCGCCGTGGCGCGCCAGCGATCAGCGCAGCGTCTGGCTTGCTGTCCGGCGTGACATTCGTGGTCCCGAGATGCGGGCGCGGGCGTTCGGCGGCGGCTCGCGGCCGGTGCGCCTGGAAGGCGACGAGGCCAGCCGGTTCTACGGCGGGCTCGCGACCCAGGCCTGGCAACGGCCGGCCGAGAAATTCAATCCCGCGAACCGGGTTCGCGTGATCGGCCCGCTTGGGACGGGCGGCTTCAACGTGGGAGGACAAATCTAGTGGCCGACGGTGAGTTTCCGGGCGCGGCGCCGATGAGCGCAGGCCCCTCGGCAGCTGAATTGAGCGGGTATCCGCTGAACGACTTCGGCAACGCGATGCGCTTCATTCGGCTCGTCGGCGGTGAGGTCGACAAGGACGGCGACGTGCGCGAGCTGTCGGCGGCGACCGTGCTGTACGTGCGAAACCATGGCTGGGTCGGCTTCAACGGCCAGCACTGGGACCTGAAGGCTGGCGAGGGGTTGGCGCGCAAGTGGGCGGCCAAGGTCGCGCGCGGCATGCACGCCCAGGCCGAGATCCTGTCGCAGCAAATCTCGGCGACGGGCACCGCCTCCAAGAAGGACATCGAGGCCCCTTACGACTTCGCCGAAAGCTGCGGCAACAGCGGGCGTATGGACGCCATGTTGAAGGTGGCCAAGACGTACTTGGAGGTCGAGCTGGACGCGTTCGACCGCGATCCGCTGGCGCTGAACGTCCGCAACGGCACGCTGTTCTTCAAGCGCAAGAGGGACGCGAGGGACCGGGTTGTCGGCGCGGAGTTCGAGTTCCGGGCGCGGCACGACCCGTCCGACCGCATCACCCGCATGGCCGAGGTCAGTTACGACCCGAAAGCCGAAGCCCCAACCTTCCAGGCCGTGCTGTCGACTTGGCAGCCGCAAGAGGCGCTGCGCCGCTATCTCCAGGTGCTGACCGGGTACGGCTTCACCGGCGACACGTCCGAGCAGATCTTCATCATCTTCCAGGGCCTGGGCCGAGACGGCAAATCCACCTTCATGAACATGCTGCGGAAGCTGGCGGGCAGCTATGCCGCGACGGCCGACGTGAAGACGTTCCTGGAGCAGTCAGCCAAGGGCGGCGGGGACGCCAGTCCCGACCTGGCGCGCTTGGCCGGGGACACCCGCATGATCTCGACGGCGGAGCCGCCCAAGAACGCCAAGCTCTCGGACGATCGGATCAAGAGCTTCACCGGCGGCGGCAACATCACGGCCCGCCATCTGCGCGAGGGCATCTTCGAGTTCGAGCCGGTGGGCAAGGTCTTCATGGAGTGTAACGGCAGGCCTCAGCCACAGGGCTCCGATGAGGGGATCTGGCGACGGCTGAAGCTGCTGCTCTGGGAAAATCAGATCCCCAAGGGGACCGAGGACAAGGAGCTGCCGGGCAAGCTGGCCAAGGAGTGGCCCGGCATCCTGAACTGGATCATCGAGGGCATCGTCCGGTGGCTGACCGAAGGGGTCAAGGATCCGCCGCGCGTGCTGGAAGCGATCGAGGACTATCGCAAGGGCTCGTCCAGCTTCGCCGAGTGGGTCAGCGACAGCCTGGTTCTGGACAAGCAGGCGATCACGCCCGCCAAGGAGCTGTACGACAGCTACAAGACCTTCATCACCGATCGCGACGAGAAGCCGATGAGCCAGACGGCCTTCGGACGCGCGCTCGCCGATCTCCAGGTCATCCGAGGCAAGCGCGACAGCGTGGGCCGCGTCATGCGCACCGGCGGTCGCCTGAAGACTGATGCCGAGCGCGCGGAAGAGACGGCGGCATCGTCAGAGGATGGCGCCGGCGGCTCGTCGTTTGGCGACCTCGGCGGCGCATCCGGCTTCGACATCCCGCCAGACGAGGAGGAGGACTGAGGGCGCGCTCGCCCATGGCTGAGCGGCGCGGAGCCGAACAGTCCTGAACAGTTTGAACAGTTGAGCGCTGGGGCAAGGGCCTCGCCGAACAGTTCGAACCCTTCGGCGAGGCGGGACTATTCGGGGACTAAGTGACTGAACATGCTGGGGTTCTGAACAGTCCGAACAGTCCGAGGAGTTTTTAGGCGACGGGCGTTCGTGAGGCGAGTGCGGGCAGGCGTGGCGTTTGTGGGTGTGTGAGGCGGGTATCTGTTCGGCGTGTTCGGTGGGTGTCGAGGCGTCGTTTTCTGGGTCGGGAGAAGTTGAGATGATCAAATCCATCAAAAAAGAAACCAAGGGGAAGGCTGGCCGGAAGGCCAGCGGGCTGCGTGACCTGTCGCCGGAAGCCGCCCGTCTGGCGATCAGCCGCGCGGCGCTGAAGGCGATCGAGACGCCGCTTCCCGCCGACATGTCGGCGGCGGCGCAGCAGGCCCGCCGGGCGGAGATCGCTCGGGTCGGCCTGAAGCGGGAAGCGAACCTGGTTCGCGGCGCGGTCGCCCTGCAACAGACCGCCATCAAGGCCACGGGCATGTCGGGCGGCGTCCCGACCCTGGAGCGCCTGCTGCGCGAGGATGTCTCGATCGCCAGCTTGCAGACGGTCGAGGACGGCGAGTTCGCTATCCGCCCGGTCATGCGCAGCAAGACGATGCAGGAAGTGCTTGTCGGCTGTGGCGTTGAACGGGCGGTCGCCTGGGCTGGCGAGGAGTTCATTGCCGATGTTGAGCGTGCTACGATCGGGCGTCTGACGGCGAGCTACGGCGAGGGCCTTGGCGGTGCGGCCACGGCCGAGCCGCTGCGTGTTCTTCAAGCCCTCGACCGCCTTGGCCGGGCGCAAGAACGGCTGACCCGCAAAGAGCGTGTCGCCGTGTGGGGGTTCCTAGTGCTGGGCATGTCGGCCACCGACGTAGGCTGGGCCTTGGTGGGCAACGCGCTGGCCAAGTCCAAGGGCGGCGAGCGCGACATGCGGACTGCGACGGCCCTGGTCGTCGAGGCGGCCCTCGAACGAATGGCGGTCTTCTACAAGTCGGTGGCTTGACAAAAACGGCACTTTAATCGCATTGATTTCCTCACGCACAGAGTTGCGCCTGAAGCCCGCCTGGCCCCGAGCCCGGCGGGCTTTTTCGTGGCCCGACGCTGGCGCGTGCGTCCCCTCCCAAGGTCCCTCGGCGTTCGCCCTGTCGCGGGGCGGACGCCCTTCTTGGAACGTCATGCCGTCTCAGCCCCGCATCTTCCGGCCGGGCGGCTCACGCACGCGCGCCGCCGCCCGCCGCGCCTACGACGCCACGCGCCGGGAGCGCCACGCCTGGCGCGCCTGGTACGGCTTGGCACGCTGGCATCGCATCCGCGCCCGCCAGCTGCGCGACCATCCGCTTTGCGCTGAGTGCGACCGCCAAGGGCGCATCACCCCGGCCACCGTCTGCGACCACGTCGAGCGCCATGGTGGTGACGAGGAGAAGTTCTGGGCGGGTCCGTTTCAGTCGCTCTGCAAGCCCTGCCACGACCAGACCAAGCAGGCCGAGGAGGCCGCCGCCCGCCGCGCCGCCCCCCGGCCGCGCGAGGGTGGGGGGGCATCAAAAGTCTGAGGTCGCACGGCCCCCACACCGGCATCCCAAGGACGCGCATTTCTCCGCGATATTTTCCGAGAAGTTTTTTTATAGGGAGGGGCGCATGGGCCGACGCCCTGATCCCGCGAGCGTCCAGCACGCGAAAGGCAATCCTGGCAAGCGGCTCTCGAAAGCCGAGCGCCAACGGTTGGAAACGGAACGCCTGGCGGGCCTCATGGCCGCCGCACCGGCGGCTGGTGCCGACCCGCTGTCGCCGCCCGCGTTCCTCGACGATCGTTTCGGCCCGGCCCTTGCGATCTGGCGCGAGTACGCGCCGAAGCTGGCGGCCACCAACCGCCTGGGCGAGCTGCACCGCCACACCTTCGCGCTGTTCTGCGTCTACATGGGCGAGTGGGTGGCCGCGAACGAAGAGATCGCGACCAAGGGCGCGACCCAGCGGATCAAGAACGTGTCGGGCTCGTACCGGGAAGTCGACCGCCCGGCCGTCTCGCGCCGCGCCACGGCCTTCGCCGCCGTCATGGAGCTGTCGGAACACTTCGGCTTCACGCCGCACGACGAGTACGCGCTGATGAAGGACCAGGTGGCGATGGGCCAGCTGGGCCTCTTCGGTGGCCACAAACCTGCCGCCGGCGCAGCACCGCAACCCGAACAGCCTGCGGCGGCCACCCAGGCAGATGACCCCATCGGCGCCCTCGGGCGCATGGACTCGGCGCCTCCGCGCCTCAACTGACCATGACGCGCGACGAGGCGTCGGCCGGCGCCTCGCCGGACGCGATCCAGGCGGGCGCGGGTGAGGGTCTGTGGCCGCTGCCCGACTGGCTGAAGGCGGTCGAGAACGACCCGACCTACGCCTGGGTGGTCAGCCAGTGGAAGCGCGCGGCGTCGGTGCCCGGCGCGTGGTTCGACTACGCCAAGGCCCAAGGCGCGGTCGATCTGTGGCCCACGATCTTCACGCTGACCGAGGACCGTTTCGCCGGGAAGCCTTTCCGGCTGGTGCTCTGGCAGGAGTTCATCGTCAGGCTTCTGGTCGGCTGGAAGGTCCCGGTGGACCTGCTGGACGAAGAGACCGGCGAACCGAAGGTCGAGCAGGTCCGGCTGTTCCGCCGGCTGATGCTCTGGATCCCGCGCAAGAACGGCAAGAGCGAGTTCCTGTCGGCGCTGGCCCTGCTGTTCTTCGTGCTGGACGGCACCGTCGCCGGGCAGGGCTTCGCCTTCGCCCGCGACGAGAAACAGGCCAAAATCGTCTTCGACAAGATGAAGGCCATGGTCTTCGGCGAGCGGTCGGACGGCAAGCCGCCGCCGCTGGCCAAGGGCATCGTCGGCTTTAAGAAGTCGCTCTGGATCCCCAAGATCCGCGCGCTGTTCGAGCTGCTCACCGGCAAGGCCGAGGGCAAGCACGGCCGCTCCCCCACCGTCATCGTCGGCGACGAGATGCACGAATGGGAGAGCCTGGATCTCTCGACCACGCTTCGCCAGGGCACCGGCGCTCGCCTGCAGCCGATCGAGCTGTACGCCTCGACCGCCGGCTTGAAAGACAAGGTGGTCGGCTTCGGGCTCTGGGAAGAGAGCCGAGCGATCCTGGAAGGCCGCATCGACGATCCGACCACCCTGGTGGTGATCTTCGCGGCGGATCCCGACGCCGACCCGTTCGACGAGGCCAACTGGCCGGGCGCGAACCCATCTCTGGGCCTGTCGCCCACCATGGCCTTCCTTCGCCGCGAGGCGGCGCTGGCCAAGGACAATCCGCGCGCCGAGGCCCACTTCCGGCGTTATCACCTGAACCAGTGGGTCGACAGCCTCGTCCGGTGGCTGAACATCAAGCGCTGGGATGCTTGCGCCAAGGACAAGAAGGCCTGGAAGCGGTTCCCCAAGGATCTGCTGGGCCGCAAGTGCTTCCTGACCATCGACGTGTCGTCGACCCAGGACGTCACGGCGCTGGTGCTGCTCTTTCCGCCCGTCGAACCGGGCGAGCCCTGGAAACTGGTCTGCCGCTTCTGGGTGCCCGAGGAGACGCTGGCGAACCGGGTTCGCAACGATCGCGTCAGCTACGACAAGTGGCTGTCGGTCGGCGCGCTGGAAACGACCGACGGCGACTACGTCGACCAGAACGCCGTCTATGAGGCGGTGCTGGAAGCGTTCAACGACTACGAGATCGAGCTGCTCGGCTACGATCCGTGGAACGCCCGCAAGCTGATCGGCGACCTGCAAAAGGCCGGCGTCGATCCTGAGAAGATGGTCGAGATCCGGCAGGGCATCCCCTCGCTGGGCGAAGGCACCAAGCACTTCGAGCGCCTGGTCTATGCCGGGCAGATGGACCACGGCGGCAACCCGATGCTGCGGTGGATGGCCGGCAACACCGTCGTCCGCTTCGACGAAAACATGAACTTCGCCCCGGCGAAAAAGAAGTCGCGCGAGAAGATCGACGGCATCGTGGCTGCGGTCATGGGCTGTGTCCTGGCCTTCCATGAGGAGCCCGAGGACGAGACGATCGGCGGCGGCGTGGTGGAGGTCTGATGCTCGGATCCAAACCCAAGGCCTCAGCCGCCTACCGACCGCTGGTTCTCGACGAGCCCACGGCGGATCGGCCCCAGGCGGCCACCACTTTTCTGTCGTCGGATCTGGAAGCCTGGCAGGGTCTGTTCCCGGACCTGGGCGCCGGCGTTTCGCCCGACACGGCGATGCGGCACTCGACGGTCTATCGCTGCGTCTTCCTGATCGCCTCGGCCATCGCCAAAGCGCCGCTGCTGTCCTTCCGTCGTGGCGAGGACGGCTTCGACGTCGAGCTGGTCGATCATCCGACGGCCCGGTTGCTGAAGGATCGCCCGAACCCGCGCATGACGCCGACCATCTTCTGGCGGCTCGTCGTCTCGCAGATGCTGCTGCGCGGGAACGGCATCGTCTGGATCGAGCGCAAGCGCTCGGGCGAGCCGGTGGCGTTGTGGCCGATCCCCATGGCGCGAGTGACGATCAGCCTGCGGAATGGGCGTCTGCGCTACCAGCTCACGCTCGATGACGGCACGATCGTCATTGCCGACCAGGATGACGTGCTGCACTTCCCCGGCTCGACGGAATGGGACGGCCTCAAGTGCAAGACCCCGATCCAGGCGATGGGGGCCTCGGTCGGCATCGGCCTGGAAGCCGACCGCTATGCGCGCGCCTTCTTCGAGAACGACGCCACGCCGCCGAGCTACATCACCTATCCGAACCAGTTCAAGAACGCCTCGGGCCAGGCCGACGAGATCCGGCGCGTCTGGAAAGACCGCTTCGGCGGCGCCAATCGCCACTCTGGCCCGGCCGTCCTCGACCAGGGCGGCGAGGTCAAGCAGCTGGCCATCACGGCCGAGGACGCCCAGCTGCTGGAAACCCGCAAGTTCCAGGTCGAGGATATCGCCCGCCTGTTCGGCGTGCCCCGCCCTCTGCTGGCCATGGACGACACCACCTGGGGTTCGGGCGTCGACAGCCTGGGCCTGCTCTACCTCGTCCACACCCTCGACCCGCACTTCGTGGCCATCGCCCAGGAGTGCGGCTGGAAGCTCTACACGCCGTGGGCCATCTACTGCGCCCACGATCCCGAGGCCCTGACGCAGTCCGACACGAAGGGCCGATCGGAAGCCGATCGCGTGGCCCTCGGGGGTTCGGCCGGCCCTGGCTACATGACCCCCAACGAAGTCCGCCGCCGTCGCCGTCTGCCTCGCAGCAGCGACCCCAACGCCGACAAGCTGACCGGCTGGACCCCAAAGCAACAAAAGGACACCGGCGATGCAAAAGCTGATCCAGCTGCTGGCCAGCAATAGGGACCGCGGTTCACGGCCGAAGGCGGAAGCCTCGGGCGACGAGGCCACGGTCTATCTCTACGACGCCATTGGCTATTGGGGCGTCGAGGCCTCGGACTTCGTGAAGGATCTGAAGGCGATCGACGCCAAGACGATCAACCTCCGCATCAACTCGCCGGGCGGTGATGTCTTTGACGCGCGCGCCATGAAGGTCGCGCTGGAGCAGCACCCCGCCAAGATCGTCGCCCATATCGACGGCCTGGCCGCCTCGGCCGCCTCTTTCATCATGCTGGCGGCTGACGAGATCCGCATCGCCGACGGCGCCTTCGTGATGATCCACAAAGCCTGGGGGCTGGCTATCGGTAACGCCGATGAGATGCGATCGACGGCCGATCTTCTGGACAAGGTCGACGGCACGATCGTCAACGACTACGTGGCCAAGACCGGCAAGACGGTCGACGAGATCAAGGCCTGGATGGTCGCCGAGACGTGGTTCACGTCGGCCGAGGCCGTCGAGCATGGCTTCGCCGATAGCGTCGCCGAGAAGCAGAAGGCCGACGCCTCCGCCGCCAACTTCGATCTGTCCGCCTACCGCAACGCGCCCAAGGCGCTGCGCGAGCCGGCGGCGAAAACCTTCGACGCGATGGCCAATGACCGCCAGCGCTACGAAGCCCGGCTCGGTCTTTACGAACGCGCCGCCTAAGCCCGCCCCCGCAAGGGCGTCACACCCAAGCCGCCAGGGACCTCCCGGCGGCTTTTTTTATGGAGACTACTGATGTCGAAAGGCATTCAGGCCCTGCGGGAACAGCGCACGGCCCACGCGAAGGAAGCCCGCAACATCCTGGACACCAAGACCGGGAAGGACTGGACGCCGGAAGCCGCCGCCCAGGTCGACGAGCTGTACGCCAAGATCGACGATCTGGACGGCCAGATCGAACGCTTCGAGCGCGCCCTGCAGCTGGAAGACAGCCTGGACGAACGCGGCCAGCAGCGCGCCGAACGCACGGGCCGCTCGGCCGACGAAGAGACCCAGAACGTCGCCACCGAGAAGGCGATCTTCGACGCCTGGTGCCGGGGCGGCACCGAGCAGCTGAACGACGAGCAGCGCGAGTACGTCAACAACCGCCGCAACGAAGCCCGCCGTCTGTACGGCGCGCAGTCGGTCGGCACCGGCTCGCAGGGCGGCTATCTGGTGCCGCGCGACTTCTCGGCCACCCTGCTGGAGAAGATGGCGGCCTACGGCGGCGTGCGCTCGGTCGCCGACGTCATCCAGACGGACGGCGGCAACTCGATCGACTATCCGACGGTCGACGAGACCGGTCAGGAAGGCGAGCTGGTCGGCGAGAACACGGCGGCCACCGGCCAGGACGTCACGTTCGGCACGACCGACATCGGCGCCTACAAGTACAGCTCGAAGGTCGTGGTGATCCCGATCGAGCTGATCCAGGACTCGCGCATCGACGTGGAAGCCTACGTCAATCGCGCCCTGGCCGAGCGCATCGCCCGCATCACGAACCGCCACTTCACCGTCGGCGACGGCACCAACAAGCCGCGCGGCGTCGGCGTCGCTGCCGCCTTGGGCAAGACCGGCGCGGCCGGCCAGACCACCAGCGTCACCTACGACGATCTGGTGGACCTGGAGCACAGCGTGGATCCGGCTTACCGCGCCTCGGGCGCGCGCTGGATGTTCCACGACCAGACGCTGAAGGTGCTGAAAAAGCTGAAGGACAGCACGGGCCGCCCGCTGTGGCGTCCGGGCGTCACCGGCGGCGATCCGAACGACATCCTGGGCTACGGCTACACCATCAACCAGCACATGCCCCAGATGGCCGCCGGGGCGAAGTCGATCCTGTTCGGCGACTTCAAGAAGTACCTGATCCGCGACGTGATGGCGGTGACGCTCTTCCGCTTCGCGGACAGCAAGTACATGGAAAAGGGCCAGGTCGGCTTCCTGGCCTGGTCGCGCCATGACGGCGACCTGATCGACGCCTCGAACGAGGCGATCCGTCACTACGCCAACGCCGCCTCGTAAGGCGGTCACGCCGGCCGGGGCCAGCCTCGGCCGGCTTTCTCTTCCCCTCTCGCGACAGGATCGGACGCCATGTTCGTCAAGATGCTCACGGCCATGGCCGGAGACTCGTTCTCGTATGACCACGGCGCTGTGGTCGAGGTTTCGGCCAAGTACGGCAAGGCGTGGATCGCCGCCGGCCTCGCCGAGGAGACCCGCCCCACCGATGTGCTGGAGGCCGAGGTCGACAAGCAAGCGGGCGTCGCGAAAGAGGCGGTCGCCAAGTTCAAGGTCGCCGAGCGCGATCTGGCCATCCACAAGGCCGATCTCGCCACCGCTCGACAGCAGATCGAAGCTCTGACTGGACAGCTCTCTGAAGCGCAGGCGGCGACCCAGGCTCTCGCGACCGAAGTCGAGGAGCTGAAGGCCGCTCTGGGCGACGAGAAGGAAGCGAAGCTCACCGCCCTGGAGGAGCTGGACACCGAACGGGCGGCCAAGGAGATCCTCGAAAAGGAACTCGAGGCGCTGAAGACTGCCACGGCCCAAGAGCCGCCGCCGGCTGAAGGCGCCGCGTGATGGCGGACCCGGTCGAAGTCCCGGCCGGGCCGCTGGTCACGCTCGACCTGGCCAAGCAGCACCTGGGCGTCTGGTCCGATGAGACCGACGCCCTGATCACCCTCTATATCAACGCCGCCAGCGACCGGATCCGCACCCGCCACGTGTTCGGCGATCCGGTGCCCGCCAATGTCCAGGCCGCCGCCCTGCTGATGGTCGAAGACCTCTACGACCCACCGGAAGCAGCGGCAGGCGAGCAGCGGCTCAAGACGATCGACAACCTGCTGCGTCCCTTCGAGACGCCCGAGGTCTAGGAGGCCGCGCCATGGCTTGGGTCGAGTTCACCGACAAGTTCCGCTTCGTGCCGCCGGCCGATCGCCGCGTGACCGTGCGCTACAGCGCCGGCCAGCGTCTGTCGGTCACCGCCGAGTGCGCGCGCCAGGCCGTCGAGGCCGGCGTGGCCAAACGGATCAAGGCCCCGGCGCGCGGCGAGGTCCCGGATGGGACGCCCGCCGAGGAGGCCGAGGCGAACCCGGTTCGCGAGGACTGACATGCGTTCCGGCGAGTTTCGCGAGCGCGTCCAGTTCCAGCAGCGCGCGGAGGACGCCAACGGCGACCGCCTGGGGGACTGGGAGACCGAGGACAGGTTCAAGACCGCCGCGCGCTACACCTTCCTGCGCGGCGGCGAGACCGTCATGCAGGCCCGCCTGACGGGCGTCCAACCCGTGGTCATCCGCGTGCGCGCCTCTGGCGCCATGCGCGAGGTGACCGCCGACTTCCGTGTCCTGGATCTGCGCACCGGCGCGGCCTTCAACATTCGATCGGTGCTGCCGGATCTGCGCCGGAAGGTCATCGACTTCACCTGCGATACGGGCGGCGCCGATGGCTAGGACCAAGATGGAAGGTCGCCTTGAGCTGAAACGGAAGCTGGCGCGGGTGTCCGCCGCCGGCAAGGCCGTGCTCGAAGCGGAGGTCGAGTTCGAGGCGAACGACTTGGTCAAGCGCATGAAGCGCATCGTGCCGCGTCGCAGCGGCAACCTGGCCAAGTCGATCCGCAAGGAGCCCGGCCCGCACGAGCTATCGTGGGACGTCAGGGCTGGCGGGCCGCTCACCACGAAGAAGGTCGGCAACAGAACTTACGATGGCGATGTCATTCTAGGGTCTGGGGACACCCAGGGCCGCAAATCCAAGGCCGGCGGCAAGCACGTAACCTACGACTACGCGAACGCCTTTGAGTTCGGCACCCAAAACCAAGCGGCTGACCCATTCTTCTTCACCACAGTCCGCGCTCGGCGGAAGGCCTACAAGCGCCGCCGGACCACGGCGCTGAACAAGGCCGTGAAGGCGGCGGTGACGTGATGAAGGATCCTACCGGACCCATCTGCGCCTCGGTCGAGTTGCGGCTTCGCGACAATGCCGGCGTCAAGGCCAGCATGGGCGGCAAGACCCGGTTCTACGACCGCGTGCCGCCCAAGGCCGTCTTCCCGTATGTCGCGCTCGGCCCCGTCGAGGTCGATTTCGAGGACGAGACCGACTGCAACAGCGGCGCGGAAACCGTCGTCCAGCTGGACGTCTATTCGCGCGCCGTATCGTCGGACGAGATCCGCGCCGTCGCCGGCGCCGTGGTCGAGGCCTTCCGGGCCGACCTGGCAGTCCCCGGCCACGACGTGATCGATCAGGCGGTCAGCGCCGCCCGATACCTGGACGATCCGGACGGCCTTAGCCGTCACGCCGTCCTGACCCTGCGGTTCGATACCGAACCTTCCTAACCATCGCCGCCCTGGCGCGGCCTTTCCCGGAGACAACCATGGCCTCGGTCAAGGGCATCAAGCTCGTGCTGAAGGTGGGCAATGGCGCCACACCCGAAGTTTTCACCGCCCGCTGCAGCCTCAACGCTCAGCGCGGCATCAAATTCACCGCCGACCTGCAAGACACCGCCGAGGTGGACTGCACGGACCCGGAGAAGGTCGCCTGGCTCGTCCGCGACAAGGTTTCGGTCTCGGGCGAGGTGAACGCCTCGGGCACTCTGGACAAGGCCGACCTGGCCTTCTTCTTCGACTGGGTGAAGGACAAGGACGCCAAGAATTGCGAAGTCATCGTCGACATCGCGGGCGGCTACCTCTGGGAGGGCGCCTGGCACTGCTCGGACTTCGAGGTGACGGGTGATCGCGGCAAGCGCTGCGAGATCTCGATCAACCTCAAGTCCGAGGGCGAGATCGAAGGCTCGGCCGTCACCTGATGCGCTACGACGGCAGCATCCAGCTGTCGTTCGGAGGCGGAAGGCATACCTTCCGCCTCGCGCTCGGCGAGCTGCAGGAGCTGGAAGAGGTTTGCGGCGACCGCAAGCCCGACGGCTCGATCCGCCGCGTGGGGCCGGGCCTGGTGCTCGATCGCCTTCGCACCAACCAGTGGACCACGGCCGACGTGCTCCACACGATCCGCCTCGGCCTGATCGGCGGCGGCATGAACCAGTACGAGGCCCAGCGCCTGGCCGACCGCTACGTCGCCGAGCGCCCCGCCTGGTACGAGAACGCCCTGGTGGCCCTGGCCGCGCTGGACGCCGCCCTGGCGGAGCCCGACGAAAGCCTGGGGGAGCTGGGGGCGGAGGGGACGGGGACAGGCTCCCAGACGGCCGGATCTCCTTCGCCAGCATCTACCGAAACGCCGGCGCCCTAG
Protein sequences of DBSCAN-SWA_3 >CP023313|3945076:3967085|3964909_3965299_+|ATC26494.1|head,tail|DBSCAN-SWA MGRPPRRPRRTRFARTDMRSGEFRERVQFQQRAEDANGDRLGDWETEDRFKTAARYTFLRGGETVMQARLTGVQPVVIRVRASGAMREVTADFRVLDLRTGAAFNIRSVLPDLRRKVIDFTCDTGGADG >CP023313|3945076:3967085|3949914_3950184_-|ATC26476.1|DBSCAN-SWA MADDTIPHSDVLNSTAQNQLKSIIDRVERLEVEKAEIMEQIKEVYAEAKGNGFDVKVLKKVVRVRKQDRAKRQEEDAILDLYLSAIGEI >CP023313|3945076:3967085|3945076_3946279_+|ATC26471.1|integrase|DBSCAN-SWA MARLVNRLTDRTVRALKEPGLYPDGAGLYLEVTKGGSKQWAYIFQWRKKRTQMGLGGLLAVSLAQAREAAAEARKLVKQGVNPIEARKAEAMAAKTFGDMADAVLETKKDGWKNEKHREQWETSLNVHAAPIRPKPLADITTDDVLDILKPIWTRIPETASRTRGRIETVLDAAKAKGLRAGENPARWKGHLEHLLPKRKKLYRGHHAALGLAELGAFMAALRERQAMAARALEFTILTAARTGEVLGATWAEFDLYRAVWTVPAERMKLRVEHRVPLSTPAMALLTELARASNAKPDELVFPSVITGGKMSNMAMLMLLRRMTRPELTVHGFRSTFRDWAGELTNFPREVAEAALAHTVGTDVEKAYRRGDALLKRRKMMEAWATFCARTGVLVEFRRA >CP023313|3945076:3967085|3966216_3966648_+|ATC26497.1|DBSCAN-SWA MASVKGIKLVLKVGNGATPEVFTARCSLNAQRGIKFTADLQDTAEVDCTDPEKVAWLVRDKVSVSGEVNASGTLDKADLAFFFDWVKDKDAKNCEVIVDIAGGYLWEGAWHCSDFEVTGDRGKRCEISINLKSEGEIEGSAVT >CP023313|3945076:3967085|3964734_3964959_+|ATC26493.1|DBSCAN-SWA MAWVEFTDKFRFVPPADRRVTVRYSAGQRLSVTAECARQAVEAGVAKRIKAPARGEVPDGTPAEEAEANPVRED >CP023313|3945076:3967085|3952647_3952857_+|ATC26482.1|DBSCAN-SWA MSGGGHFSWTVGALEDLRAFNRIDGDLKKLAHIMGCTSQDVDQALWFLLGRSPEQALEAMHQYRMGACA >CP023313|3945076:3967085|3947388_3947790_-|ATC26474.1|DBSCAN-SWA MTSKHGLMSLALPEALGQVWRWLEDCEDLLALNPHRDLPRPPFKFSEAWVVVRLAAEGRGTPSAWKPIADWPERDEPILVATADGRRMFWAPSALKVAMTPGAPTHLQFPATHWMYAADLPDLPKEPSDDRGV >CP023313|3945076:3967085|3950183_3950546_-|ATC26477.1|DBSCAN-SWA MSRFQPLTWLARWEAAVRRLADKTAKPMIAIAISEERSGLAAHTVHRATYGETEFAAKALLGDIIASITSDGYVPGACTGCDERLARAQAAYALLEPQTAEFLPGTSAKALRWGGKRELH >CP023313|3945076:3967085|3951332_3951965_-|ATC26480.1|DBSCAN-SWA MLAVMEISEIPSRLRQLKRSQADLARHLNLDPSSLTKTLKGERRLRADELLKIEQFFGEPIDREAAVEQLSERRRTAPRRIPVYGYAAAGGSERIAINPGQVIDWVDAPPLWNGVGDLAVVRIIGDSMEPRYFAGEAVTVRLGLPPARGQDCLIEFNDGSGLIKSYRGQRDGQVFAWQYNPPSGEGNEVRYDASSVRAIHAVDLPFRMLR >CP023313|3945076:3967085|3961832_3962549_+|ATC26833.1|protease|DBSCAN-SWA MQKLIQLLASNRDRGSRPKAEASGDEATVYLYDAIGYWGVEASDFVKDLKAIDAKTINLRINSPGGDVFDARAMKVALEQHPAKIVAHIDGLAASAASFIMLAADEIRIADGAFVMIHKAWGLAIGNADEMRSTADLLDKVDGTIVNDYVAKTGKTVDEIKAWMVAETWFTSAEAVEHGFADSVAEKQKADASAANFDLSAYRNAPKALREPAAKTFDAMANDRQRYEARLGLYERAA >CP023313|3945076:3967085|3956776_3957544_+|ATC26486.1|DBSCAN-SWA MIKSIKKETKGKAGRKASGLRDLSPEAARLAISRAALKAIETPLPADMSAAAQQARRAEIARVGLKREANLVRGAVALQQTAIKATGMSGGVPTLERLLREDVSIASLQTVEDGEFAIRPVMRSKTMQEVLVGCGVERAVAWAGEEFIADVERATIGRLTASYGEGLGGAATAEPLRVLQALDRLGRAQERLTRKERVAVWGFLVLGMSATDVGWALVGNALAKSKGGERDMRTATALVVEAALERMAVFYKSVA >CP023313|3945076:3967085|3964429_3964723_+|ATC26492.1|head,tail|DBSCAN-SWA MADPVEVPAGPLVTLDLAKQHLGVWSDETDALITLYINAASDRIRTRHVFGDPVPANVQAAALLMVEDLYDPPEAAAGEQRLKTIDNLLRPFETPEV >CP023313|3945076:3967085|3946285_3947116_-|ATC26472.1|DBSCAN-SWA MEHGQPFIRLRLDVKEPIELTDFVSTFTSVAAEYERFIKTERPDRPGEATLFVKEVRAGSIIADLVPFLATAGGAETIVKVLDGANTVAEFVEHYGGRLSAFLKPGGRVEGAGKGELKDFYDQVAAVAATPQSSLEIAAFEIENGAYKAKAAFKFDTSQAREIRERVEEQKVLVDHKSGANHPRVLMVFTRSDVRSQAIGKRSGELVKIEALSPKSLPLIYASELAEQQIKHEITEAEDNVFKKGFVVDVSAEVRGGKMIAYRVTELHQVIDLDDE >CP023313|3945076:3967085|3947192_3947405_-|ATC26473.1|DBSCAN-SWA MIAASEKLSYRMDEAAEATGLSKATLYRLIERGELTTLKVGTRTLIRREVLEGLLQRLEVASTPPRPKVT >CP023313|3945076:3967085|3952291_3952642_+|ATC26481.1|DBSCAN-SWA MTRTPHATSADIAKWKRLAKEASRASGDGSVPLGDLEKATAAARRAIVPVGVIDKNNPFVRLVRTSQRYLASDRERIELASDMAMLAQTCEAQLAPAPDPRALAREPRSRLPYADE >CP023313|3945076:3967085|3957945_3958803_+|ATC26487.1|DBSCAN-SWA MVVTRRSSGRVRFSRSASPATTRPSRPRRPPPAAPPPGRARVGGHQKSEVARPPHRHPKDAHFSAIFSEKFFYREGRMGRRPDPASVQHAKGNPGKRLSKAERQRLETERLAGLMAAAPAAGADPLSPPAFLDDRFGPALAIWREYAPKLAATNRLGELHRHTFALFCVYMGEWVAANEEIATKGATQRIKNVSGSYREVDRPAVSRRATAFAAVMELSEHFGFTPHDEYALMKDQVAMGQLGLFGGHKPAAGAAPQPEQPAAATQADDPIGALGRMDSAPPRLN >CP023313|3945076:3967085|3950545_3950854_-|ATC26478.2|DBSCAN-SWA MSREAIPPFTLIRGLGVGVEEHPSLEAVALRIQALRGKQALSVAPCVGAYDTSETFQGFNIKVGDRAGGPTDWIATVVLPHAEGPQLAAAIRAAGKPLRRAA >CP023313|3945076:3967085|3966647_3967085_+|ATC26498.1|DBSCAN-SWA MRYDGSIQLSFGGGRHTFRLALGELQELEEVCGDRKPDGSIRRVGPGLVLDRLRTNQWTTADVLHTIRLGLIGGGMNQYEAQRLADRYVAERPAWYENALVALAALDAALAEPDESLGELGAEGTGTGSQTAGSPSPASTETPAP >CP023313|3945076:3967085|3960586_3961867_+|ATC26489.1|portal|DBSCAN-SWA MLGSKPKASAAYRPLVLDEPTADRPQAATTFLSSDLEAWQGLFPDLGAGVSPDTAMRHSTVYRCVFLIASAIAKAPLLSFRRGEDGFDVELVDHPTARLLKDRPNPRMTPTIFWRLVVSQMLLRGNGIVWIERKRSGEPVALWPIPMARVTISLRNGRLRYQLTLDDGTIVIADQDDVLHFPGSTEWDGLKCKTPIQAMGASVGIGLEADRYARAFFENDATPPSYITYPNQFKNASGQADEIRRVWKDRFGGANRHSGPAVLDQGGEVKQLAITAEDAQLLETRKFQVEDIARLFGVPRPLLAMDDTTWGSGVDSLGLLYLVHTLDPHFVAIAQECGWKLYTPWAIYCAHDPEALTQSDTKGRSEADRVALGGSAGPGYMTPNEVRRRRRLPRSSDPNADKLTGWTPKQQKDTGDAKADPAAGQQ >CP023313|3945076:3967085|3954843_3956469_+|ATC26485.1|DBSCAN-SWA MADGEFPGAAPMSAGPSAAELSGYPLNDFGNAMRFIRLVGGEVDKDGDVRELSAATVLYVRNHGWVGFNGQHWDLKAGEGLARKWAAKVARGMHAQAEILSQQISATGTASKKDIEAPYDFAESCGNSGRMDAMLKVAKTYLEVELDAFDRDPLALNVRNGTLFFKRKRDARDRVVGAEFEFRARHDPSDRITRMAEVSYDPKAEAPTFQAVLSTWQPQEALRRYLQVLTGYGFTGDTSEQIFIIFQGLGRDGKSTFMNMLRKLAGSYAATADVKTFLEQSAKGGGDASPDLARLAGDTRMISTAEPPKNAKLSDDRIKSFTGGGNITARHLREGIFEFEPVGKVFMECNGRPQPQGSDEGIWRRLKLLLWENQIPKGTEDKELPGKLAKEWPGILNWIIEGIVRWLTEGVKDPPRVLEAIEDYRKGSSSFAEWVSDSLVLDKQAITPAKELYDSYKTFITDRDEKPMSQTAFGRALADLQVIRGKRDSVGRVMRTGGRLKTDAERAEETAASSEDGAGGSSFGDLGGASGFDIPPDEEED >CP023313|3945076:3967085|3965291_3965768_+|ATC26495.2|DBSCAN-SWA MARTKMEGRLELKRKLARVSAAGKAVLEAEVEFEANDLVKRMKRIVPRRSGNLAKSIRKEPGPHELSWDVRAGGPLTTKKVGNRTYDGDVILGSGDTQGRKSKAGGKHVTYDYANAFEFGTQNQAADPFFFTTVRARRKAYKRRRTTALNKAVKAAVT >CP023313|3945076:3967085|3952853_3953060_+|ATC26483.1|DBSCAN-SWA MSWGNDQNLRIEDLPDFATRARGVLRHSGVTTLADAAANRGAWRDHPLATKTVIAHVEDVLAEYGAPA >CP023313|3945076:3967085|3958805_3960587_+|ATC26488.1|terminase|DBSCAN-SWA MTRDEASAGASPDAIQAGAGEGLWPLPDWLKAVENDPTYAWVVSQWKRAASVPGAWFDYAKAQGAVDLWPTIFTLTEDRFAGKPFRLVLWQEFIVRLLVGWKVPVDLLDEETGEPKVEQVRLFRRLMLWIPRKNGKSEFLSALALLFFVLDGTVAGQGFAFARDEKQAKIVFDKMKAMVFGERSDGKPPPLAKGIVGFKKSLWIPKIRALFELLTGKAEGKHGRSPTVIVGDEMHEWESLDLSTTLRQGTGARLQPIELYASTAGLKDKVVGFGLWEESRAILEGRIDDPTTLVVIFAADPDADPFDEANWPGANPSLGLSPTMAFLRREAALAKDNPRAEAHFRRYHLNQWVDSLVRWLNIKRWDACAKDKKAWKRFPKDLLGRKCFLTIDVSSTQDVTALVLLFPPVEPGEPWKLVCRFWVPEETLANRVRNDRVSYDKWLSVGALETTDGDYVDQNAVYEAVLEAFNDYEIELLGYDPWNARKLIGDLQKAGVDPEKMVEIRQGIPSLGEGTKHFERLVYAGQMDHGGNPMLRWMAGNTVVRFDENMNFAPAKKKSREKIDGIVAAVMGCVLAFHEEPEDETIGGGVVEV >CP023313|3945076:3967085|3953056_3954844_+|ATC26484.1|DBSCAN-SWA MTDAGRFLRAAMDYARRGIAVFPLQPRDKAPYGRTIGFKQAAHMPGLVEDWWTGRRRLELKADADNKSPVRARLNSNIGIATGAISGFWVLDLDGPEAEAAIARLEALHGPLPKTVQQATGRGRHLCFAWNPALPVRNMSKRSQERIGAKIDVRGDGGYIVAPPSVHPGKPEEGIPPGRIYAWAPGCSPQDLPFADAPAWLMELVCPPPEPEPVRAPIKTRAPAAGRASAYGEAALDGAVRTIHGARVGSRDTTLYRASCSIGCLIAGGEIDHDYGRSVLIEAGRVHVPDAMTVAQLERQVDRALAWGESRPRSAGERPRQRSVQTERRASATKGVEAIPGDEANAAAQLWDTARSAWCKATVQWFEARGLAGTPCGVTELLNRFRVHPNAPIGGGRTGPALIAPLVRLDGDPIEALAVLPFEADRITHLVGDSDGRVVMLTPLRPGHEPESLIVALDLQDAWYLLTQAWREEISAGAVIAPRLSTFAGGALGDRWGRIDPDAPAHDPTRPPWRASDQRSVWLAVRRDIRGPEMRARAFGGGSRPVRLEGDEASRFYGGLATQAWQRPAEKFNPANRVRVIGPLGTGGFNVGGQI >CP023313|3945076:3967085|3965767_3966178_+|ATC26496.1|DBSCAN-SWA MKDPTGPICASVELRLRDNAGVKASMGGKTRFYDRVPPKAVFPYVALGPVEVDFEDETDCNSGAETVVQLDVYSRAVSSDEIRAVAGAVVEAFRADLAVPGHDVIDQAVSAARYLDDPDGLSRHAVLTLRFDTEPS >CP023313|3945076:3967085|3957830_3958091_+|ATC26832.1|DBSCAN-SWA MARWHRIRARQLRDHPLCAECDRQGRITPATVCDHVERHGGDEEKFWAGPFQSLCKPCHDQTKQAEEAAARRAAPRPREGGGASKV >CP023313|3945076:3967085|3947779_3949705_-|ATC26475.1|DBSCAN-SWA MDLNLAFQNTDVLRSIDEHGPQIQAAELARITGRDKSNLHKTLGRLVDAGVLTKQEGGLRLAAGAHAVIAAREIAETGAVPQSEANPVRALEGYAFRFHRQIRIGEFNPRKQFDDEALDELAADIAERGLKTNLEVRADDPVADAEGLPTNQLIAGERRWRAIGRAIEQGRLDRDFPILVKIEDLTDNDHVVAALLENLHRVDLTPLEEAAAFDQLFSMNGWSTAKIAEKVSKTQRFVQLRLSLLKLTDDQKARLNRGDMTVKEALKALANRPEPVEATPIELLALALIWTNAYPEPAAHLSYWRKGECHGEAQDQRLIASLIERGLAETEEPTSWEPKHKLGTTYKGFLALEQHCSELTRAETEEARRGAMMSIDNAAYAFDKVVSTEAMALSQWLGKPLVNSVVVQKLIDEKNARDLASAEQTRLAAEEKNAARERAAQAEAQAQQLLVQLRAFEAEAPAMSREAFAAAFAVLLNERGYVGPFRIALRSPGKDQWGQPLPQTFVLVDGQDNMLQGAGTQFEVTRRLQAIALNYAMGSADIWSGSEIRPHQEDEAADDGPDPLNEVEFIDMVIGRFTEMHGLDAGRAQELAHKAYARVVEDVGEFGSEDGEWTRLDAQLVADGWLDDFPEAGDEEADHDQ >CP023313|3945076:3967085|3962621_3963878_+|ATC26490.1|capsid|DBSCAN-SWA MSKGIQALREQRTAHAKEARNILDTKTGKDWTPEAAAQVDELYAKIDDLDGQIERFERALQLEDSLDERGQQRAERTGRSADEETQNVATEKAIFDAWCRGGTEQLNDEQREYVNNRRNEARRLYGAQSVGTGSQGGYLVPRDFSATLLEKMAAYGGVRSVADVIQTDGGNSIDYPTVDETGQEGELVGENTAATGQDVTFGTTDIGAYKYSSKVVVIPIELIQDSRIDVEAYVNRALAERIARITNRHFTVGDGTNKPRGVGVAAALGKTGAAGQTTSVTYDDLVDLEHSVDPAYRASGARWMFHDQTLKVLKKLKDSTGRPLWRPGVTGGDPNDILGYGYTINQHMPQMAAGAKSILFGDFKKYLIRDVMAVTLFRFADSKYMEKGQVGFLAWSRHDGDLIDASNEAIRHYANAAS >CP023313|3945076:3967085|3952055_3952295_+|ATC26831.1|DBSCAN-SWA MSMGDASETPANRAVRLLGLIDLAHACRLTTDAVRKWQKSKGGLIPAAHQAGVLRLAREKGVSLTAADIIGGEAGGADQ >CP023313|3945076:3967085|3950853_3951036_-|ATC26479.2|DBSCAN-SWA MAVGGVHWTLAPATAGTAARQLNDAATALRQKPLRDPARRMARAATAAFAQAAQLEAGRA >CP023313|3945076:3967085|3963947_3964430_+|ATC26491.1|DBSCAN-SWA MFVKMLTAMAGDSFSYDHGAVVEVSAKYGKAWIAAGLAEETRPTDVLEAEVDKQAGVAKEAVAKFKVAERDLAIHKADLATARQQIEALTGQLSEAQAATQALATEVEELKAALGDEKEAKLTALEELDTERAAKEILEKELEALKTATAQEPPPAEGAA |
31 | Rhizobium_phage(25.0%) | integrase,protease,head,portal,capsid,tail,terminase | attL 3943481:3943496|attR 3955648:3955663 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
3983257 : 3990358
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP023313|3983257:3990358|DBSCAN-SWA CATGAAGCCGAGCCCGCGATGCGTCGCGTTCATCAAGAGCTTCGAGAAGTGCAAGCTCACCGCCTACATGCCCACCAAGAACGATCGGCCGACCCTGGGTTGGGGCTCGACCGGGCGCGACATCAAGATGGGCATGACGTGGACCCAGGAGCAGGCCGACGCCCGCTTCGACCGTGACCTGGCCATGTTCGCCGCTGGCGTCGATCACCTGATCGGCGGCGCGCCGACGACCCAGGCGCAGTTCGACGCCATGGTCTCGTTCGCCTACAACGTGGGCCTCGATGACGACGGCGACGGCGTGGCCGAGGGCTTCGGCGACAGCACGCTGCTGAAGAAGCACAAGGCTGGCGACTACAAGGGCGCGGCCGACCAGTTCAAGTTCTGGAACAAGCAGAAGGGCCTCGTCCTGGCCGGCTTGACCCGCCGCCGTGGCGCCGAGGCGGCGATGTACAGGGGCGAGATATGAGCGCCCTCCTCGCGTGGGCCATCCGAGCGGGTGGATGGCTGAAAGCCTTCGCCCAGACCCGCTTCGGCCGCCTGGTGCTCGCGGTCCTCTTCATCGCCCTGCTGGTGGGCCTGGCCTGTGGCCTGAGCTACGGCGCCGGCGTCGACCATGAGAAGGCCGCCGAGGCCAAGCGGCGCGCGGCGGCCGTCAAGGTCGTGCAGCGCGTGGCGACCAAGGGCCGGGAGATCTCGGCCGACGTCTCCTACCAGCTCGACGCCCGCAAGGTCGAGATCCGCACCGTCACTCAAACCCTGATCAAGGAGGTACCGGTCTATGTCACCGCTGAAAGCGACCGCGCTTGCGTTGTGCCTGTTGGCTTTGTCGGCCTGCACGACGCCGCAGCTCGCGGGTCCGCCGTTCCCGCCGCCCCCGGTGGACCTGTCGAAGCCCCTTCCGGGGTTCCGCTCTCTGCCGTCGCCGAAACCGTCGCCGCCAACTACGGCGTCGCCTTCGAATGGCGGGCTGAGGCCCTAGCCTGGCGCGACTGGTACGCGCGTCAGGCCGATCTCTGGCGAAAGAATATCAGGCGGCCGGATCCAGGGCCTGACCGCTCGCCGACAGGTTAGAGCGAACGAAAAGGCCCCGCCGGTTAGGCGGGGCCTCAAGCCCGTTACCGGCGCCAGCTAATCTGCGGCTTCGGGCGCGGCGCGCGCACACCCGGCATCAACTTCGGCAGATGGCGGATGGCCTCGGGGTCCCGCGAGAGGAAATAGTCGCGGGAGCGATCCGGGATCCAAACGGTTTCGAGAAATTCAATGAACAGCGGCAGCATGCGCATCGGGTACTGGCGGGCCTCGAACTCGCGCTTCGGCGTCCAGTGCATGTACATGCTGTAGCAGTCAGCGACACTGTCGTGGTTTTCCTTGAGCCACTCCGAGAACAGCCGGCCGGCGCTCACGTCCGGGCGCAGCTCCTTACCGTCCGGAGCGACGTCTGCCATCAGGTGGCCGACCATCTCCAGGCGGCCGTGAAGGCGCACGGCCATTTCGCTGATCACAGAGAAGTAGCCGGGCGAGACCCGATCCCAGTTCTCGTTGAACCGTCGAATGAACTTCGGCGTACCGGCGCGGCCATAGCTGCCCGTCTTCCGAATGGACGGCAGCACCTCGGAGGTCACCCACTTCTTGAACCGCTTCGAGCCTTCCTTCCGGCTCGTGAGGATCAAGCTGTAGAGCCCGGACTCGTTGATGATCGTCTGCTGCTGCGTCCGTCCCATGGCGTCGGCAATACCGACGGCATCCTTCTCATCTGCATCGAGGCGTCCAGCCGCATCACGCGGATTGAGAATTTCCAGCGCCTTGCACACGTCCGTGAGGACGAACCAGGGTTCGCCATCTCGATTGATAGTCCGGAATTGGTCTTTGTCTTCAGTCTCGAAGACTTGCAGTTCAAATTGGAGCGCCATGGCGCTGTAGGCCTTTCCCCGGCTGTGCCGGATTGTAAGAAGGCCGTTGACAGCGAGGTCATCCCGCCGGTACCACCGGCGTACTCCCAAGGTAACTCCACTGCCGAGGCGGGGCCTATACCGCATCGGACGTTGCCCTGGACTGGCGGGTCGTTGGCGCCCAACCGACGACCCGTCCACAGCAACGCTCCCCTACCCAATCAGCCTCTTCGCGGCGCGCTTCAAGCGGCTGCAGGCGTGTTTTTGCTTAACAACCTGTGAACGGCGGTGATTGCGCGAGCAATTTCAAGGGGTTGCGCGACCGCAAAAAAACCGCTTTTCAAACGGGAAAAGAGGCAAAGTTTTAAGAACTCAGCCTCGAACAGCGATTCTGACCTTGAGGCTTGGCCGGCGATTCTGTCTCGCGCGAGTAGATCTGTCGGCGGGACCTGGTGCCAGAGAACTTCTTGAGTAGTCGGAGAGCGACTGTCAGCGACCGTCTGGGCCGTAGCCTCGGTAGGCATTGTAGGCGCGGTTGGCGCGCCGGCTCTTCCAGACCTCGATATCGCGCGCGCTCGGCGCCTTCAGCTGGGTCCATGACCCACCCTTGGCCGTCCTGGCCGAGAACGTCACGAAGCCCTCGCAGGGGAACCGCCCATCGCTGACCACGGTGCACCGCGGATGTCGGCCCCAGCCGATAGTGTCCGGGCCAAGCACCTGCACCATCACCTTCAACGAGACCCGCTGGGTGACGCCGCAGCGCCCACAATGCGACCGTAGCTGCCAGCCTAGAACGTGCATCTCGCCCAGGGTCAGCGCCGTGAGCGGCATAGCCGGCGGACAGACGCGATCGGGGCTTGGTCCACCCATCTCCAGTTTCCAGCCTTCTTGCCCGGACCTCGGGCGGGACCGGCGCGCGCCAACGCGCCGATCGACGAGCGCTCCCTCGTCACGCACCGCCGTCGATCGCCGACGGCCCGTCCGCCCTGCCCAGGGCGGCGGACTCATAAGCGCCAAGAGGAGCTTCCCGCAAATGAAATGTTCCGTAAATGTTCACGTTCCGCCATGCGACCCGGTGACGCCTGCCGAACCGGCAGCGCCGTGGATCGGAGGTAAGCGCCACCTAGCCAAGCGCATTTGCCAGATCCTGGCCGCGACGCCTCATGACGCCTACTGCGAGCCGTTCATCGGCATGGGTGGCGTGTTCCTTCGCCGGGCAGTGCGCCCCGGCGTCGAGGTCATCAACGATGTGTCCGGCGACGTCGTGACGCTGTTTCGCGTCCTGCGGGCTCACCCCGAGGCGCTGCTTCGCGAGCTGCGCTGGCGGCCGGCGATGCGCGCCGAGTTCGACCGTCAAAAGGCGCTCGCGGCCCACGACCTCACGGACGTCGAGCGGGCCGCACGGTTCCTCTATCTCCAGGTGCTAGCTTTCGGCGGCAAGGTGCGCGGCCGAAACTTCGGCGTGGACCCGTCGGCGCCGCACAACTTCGATATCCGGCGCCTGGAGCCCAGGCTGCGCCGGATCCACGACCGACTGGCTGGTGTGACGATCGAGAACCTGGACTGGTCGGAGTTCATCCCGCGCTATGACCGCGCCGGCACGCTGTTCTACCTCGACCCGCCTTACTGGGGCAGCGAAGACGACTACGGCCGGGAGCTGTTCGCCCGCGCCGACTTCGAGCGCTTGGCCGACATGCTGTCGTCGATCGAGGGCCGGTTCCTGCTGTCGATCAATGATGTGCCGGCGATGCGAACCGCGTTCGCCTGGGCTCAGATCGAGGCGGTGAACACCGTCTACTCGGTTGGCAATGCGGACCCGTCGGCGCCGGCAAAGGAGCTGCTGATCGGACGCGGCGTGAACCTCGCAAGCGTCGCACCGCCCCCGACACTCTTCTGATGCGGCGACGCGACGCACCGAGCTAGCCTCGCGGAAGAATATGCCCTGAAGGCGCTCTAAGGCACGCCAAAGCTCGGCCGAGCCGCGCAGTCGCCTCAGAAAGTTTTTAGGGTAAGTTTGAGGGTACGAGATCGCGTGTTGGCGCGAAAGCCAAGCGCTGCGGGTGTTCTCGGAAAGTCCTTGGTGGAGCCGAGGGGAATCGAACCCCTGACCTCCTCATTGCGAACGAGGCGCTCTACCATCTGAGCTACGGCCCCAAGGAGCGGCGGAGATACGGCCCCAAAGCGCGCTCGTCAAGCGGCTCCTTGCTTGCCAGCGCGCGCGTGGCGCGGCAAGAAGCGGAAAGTCCGGACGGGAAACCCATGACCGCCATCATTCAATTCGTTTTCTTCATCCTCGGCGGCCTGCTCAGCCTCCTTTGGTGGGCCATCGTCATCTCGGCGATCCTCAGCTGGCTGGTCGCGTTCGACGTGATCAACCGCCGCAACACCGCTGTGTACCAGGTGCTGGATTTCCTGGATCGCGTGACCGGACCGGTGCTTCGCCCGTTCCAACGCGTGATCCCGTCGCTCGGCGGCGTCGACATCAGCCCGATCGTTGTTCTGCTGATCATTTCGGGCGTGCAGAACTATCTGCTGCCTGCGCTTCAAGGCACGCTGATCGCTCTTCTTGGCTGACGGCGTGGCGGTGACGCTCGTGGTGCGCCTGACCCCGAGGGGCGGGCGAGACGCCGCCGAGGGTTGGGCGCTTGACGCCGACGGCCGCCTCTATCTGAAGGTGAGGGTCGCCAGTCCGCCCGTCGAGGGCGCGGCGAATGCGGCCCTCATCGCCTTTCTGGCAAAGACCCTGAAGATTCCTCGCTCGGCCGTACGACTGGCGGCTGGCGAAACCGCGCGGCTAAAGCGTCTCGAACTGGAGGGCGTAGACCCAGCCGATGTCGCACGCGCGTTCGGGCCGCCGAACTAGCGCGGGTCGGCATTCTTGGGATTTTATGCCTCAAAATCTGCCCCAAAGGCAGCAACACGCAAAAATCTTCCGGTGCGCGCCGGAGACCCAAGCCGTTGACGAAACGCTGGTCTTTCATTTCCTCTTAGCCCCTCGTGACTAGGGTTATTGGGTAAGGGATTGCTCTGCGATCTCCTTGGGGGGGCGGAGCGCCCATGAGCGCCGAGACGAAGGATATTGACCTGGAACGCGCTGTCGCCTCACGACGCGTCAACACCTATCTGCGCCTCAGCTCGACCTTTGCGATCTGCGCGACCCTGCACCTCGTCGTTGGCTTGCGCTGGGTTTGGATCTGGGGAGCGCTCTACACCGCGATCCAGTTCCTGGAGACCTGGCTGGCGCTTCAGCTCATCAAGCGACCGCAGAGCAGCCCTGATCGCTGGCGCCGCTTCTCCGTCATCGCCCTGCCCTTTCTGACCTCCTCCGTCTTCGGATTCCTGGCGATTCCCCTGTTCGCGTCCGACGCCCGGTTCGCGCCGACCTTGGGCGGCATGCTGCTGGCCGGAGCGCTGATGAACGTCGTCATCGTCCACGGCGGCCTTCGCTCAGCGACCATCGCCGCAGCGACGCCGTACGTCGGCTATCTTCTGGTCATCCCCATCGTGGCCCGGCAGGCCAATCCCGATGCACCCCTGGCCAACGCCCTGTGGTTTGGCGCGCTCCTGCTCATCGCAGCGATGGCGGTCGCCTCGCGCAAGCTTCACGCCGCGCTGAAAGCGGAGGCGGACGCCAAGACCGAGGCGGAGCGTCGCCGCCACGAGGCCGAGGAAGCCGTCGCGGCGAAATCGGCCTTCGTCGCCATGATCAGTCACGAACTTCGCACGCCGATCAGCGCCATTCTCGCGGGGGCCAGCCGCCTGCACAGCGAAGCGCCCGAAGCCAGCTCCAAGGTGCACGCCCAGCTGATCGCCGACGCGGGCGGGCTGATGCGCACGCTGCTCAATGACCTTCTGGACTTTTCGCGGCTCGAGGCGGGCCGAATGTCGGTGGAGAAATCGCCGTTCAACCTCCGCCAGAGCCTGTCGGATACGCTGCGCTTCTGGCGGCCGGAGCTTGCCCGCAAGGGACTGAAGCTGCGCGTCGTCGGCGCCTCGACGATCCCGCAGTGGACGCTCGGCGACGCCATGCGCCTGAAGCAGGTGCTCAACAACCTATTGTCCAACGCCGCCAAGTTCACCCGAAGCGGCGGCGTGTCGGTGACGCTGGGGGCCGAGATCGTCGGCGATCAGGTGCGGCTGGTTGTCGACGTGGTCGATACCGGTCCAGGCATTCCCGAAGCCGCGCTTCCGCGCCTGTTCACGCCTTTCGATCAACTCAATGAGTCGGTGGCTCGCCTGCACGGCGGGTCCGGGCTTGGTCTGGCCATCAGCCGCGAGCTCGCGCGCCTGATGGGCGGTGATCTGACGGCGTCCAATGCGCTAGGCCAGGGCGCGCACTTTCGGTTTTCGGCATTGCTGGAGGCCGCCGAAGCGCCTGTGGTTCCGACGCTGGGCCCATCGATCAGCGGCCTGCGCGTGCTGGTCGTCGACGATCACGTCGTGAACCGGCGGGCCGTCGAACTGGTCCTGCAGCCCTTCGGCGTGGAAGCCACCCTCGCCGAATCCGGGGAGGAGGCCCTCGAACTTCTTCGCTCCGAGGTCTTCGACGCCATTCTGATGGACGTCTACATGCCGGGTATGGACGGGCGCGAGACGACCCGCACGCTCCGCGCCGGCCAGGGCCCCAATCGCGATGCGCCGGTCGTTGCGGTCACCGCCTCGGCCACGATCAAGGATTGGGAGGCCTGCGCGGCGGCCGGCATGAACGCCCATGTCGCCAAGCCCATCGATCCGGCCGAACTGTTCTCGGCGCTCGCTCAAGTCATGGCGGCGCGCGGCGCTGTCGAACCTTTCAAAACCGAAGTCGCCGCGAGCAGCGCCTGAAGCCGGGCGCTCAGGGCCGCTTCGTCCTTGAAGTCACACTCCCAGACCACCTGGGTCCGCCATCCGGCCTGCGCCAGAGCCGCCAGGGCGCGCTGATCCCGCGCGATGTTTCGGGCGACCTTGGCCAGCCAGTAGTCGCGGTTCGCCTTGGGCACGCGCGCGCCCCGGGCGCAGTCGTGACCGTGCCAGAAGCAGCCGTGGACGAAGAAGGCCAGCCTGCGCCCCGGCATGACCACGTCCGGCGACCCCGGCAGATCCTTGCGATGCAGGCGATAGCGGGCGCCCAGCCCCGTAAGCAGCCGGCGCAGGCGAAGCTCAGGACCGGTGTCCTGGCTTCGCACGCGCGCCATGACGGCCGAGCGCTTGGCCTTGTCATAGACGTCGGACAA
Protein sequences of DBSCAN-SWA_4 >CP023313|3983257:3990358|3985634_3985976_-|ATC26516.1|DBSCAN-SWA MPLTALTLGEMHVLGWQLRSHCGRCGVTQRVSLKVMVQVLGPDTIGWGRHPRCTVVSDGRFPCEGFVTFSARTAKGGSWTQLKAPSARDIEVWKSRRANRAYNAYRGYGPDGR >CP023313|3983257:3990358|3983718_3984327_+|ATC26514.1|DBSCAN-SWA MSALLAWAIRAGGWLKAFAQTRFGRLVLAVLFIALLVGLACGLSYGAGVDHEKAAEAKRRAAAVKVVQRVATKGREISADVSYQLDARKVEIRTVTQTLIKEVPVYVTAESDRACVVPVGFVGLHDAAARGSAVPAAPGGPVEAPSGVPLSAVAETVAANYGVAFEWRAEALAWRDWYARQADLWRKNIRRPDPGPDRSPTG >CP023313|3983257:3990358|3983257_3983722_+|ATC26513.1|DBSCAN-SWA MKPSPRCVAFIKSFEKCKLTAYMPTKNDRPTLGWGSTGRDIKMGMTWTQEQADARFDRDLAMFAAGVDHLIGGAPTTQAQFDAMVSFAYNVGLDDDGDGVAEGFGDSTLLKKHKAGDYKGAADQFKFWNKQKGLVLAGLTRRRGAEAAMYRGEI >CP023313|3983257:3990358|3988202_3989969_+|ATC26520.1|DBSCAN-SWA MSAETKDIDLERAVASRRVNTYLRLSSTFAICATLHLVVGLRWVWIWGALYTAIQFLETWLALQLIKRPQSSPDRWRRFSVIALPFLTSSVFGFLAIPLFASDARFAPTLGGMLLAGALMNVVIVHGGLRSATIAAATPYVGYLLVIPIVARQANPDAPLANALWFGALLLIAAMAVASRKLHAALKAEADAKTEAERRRHEAEEAVAAKSAFVAMISHELRTPISAILAGASRLHSEAPEASSKVHAQLIADAGGLMRTLLNDLLDFSRLEAGRMSVEKSPFNLRQSLSDTLRFWRPELARKGLKLRVVGASTIPQWTLGDAMRLKQVLNNLLSNAAKFTRSGGVSVTLGAEIVGDQVRLVVDVVDTGPGIPEAALPRLFTPFDQLNESVARLHGGSGLGLAISRELARLMGGDLTASNALGQGAHFRFSALLEAAEAPVVPTLGPSISGLRVLVVDDHVVNRRAVELVLQPFGVEATLAESGEEALELLRSEVFDAILMDVYMPGMDGRETTRTLRAGQGPNRDAPVVAVTASATIKDWEACAAAGMNAHVAKPIDPAELFSALAQVMAARGAVEPFKTEVAASSA >CP023313|3983257:3990358|3987711_3988008_+|ATC26519.1|DBSCAN-SWA MADGVAVTLVVRLTPRGGRDAAEGWALDADGRLYLKVRVASPPVEGAANAALIAFLAKTLKIPRSAVRLAAGETARLKRLELEGVDPADVARAFGPPN >CP023313|3983257:3990358|3987404_3987719_+|ATC26518.1|DBSCAN-SWA MTAIIQFVFFILGGLLSLLWWAIVISAILSWLVAFDVINRRNTAVYQVLDFLDRVTGPVLRPFQRVIPSLGGVDISPIVVLLIISGVQNYLLPALQGTLIALLG >CP023313|3983257:3990358|3984371_3985166_-|ATC26515.1|DBSCAN-SWA MALQFELQVFETEDKDQFRTINRDGEPWFVLTDVCKALEILNPRDAAGRLDADEKDAVGIADAMGRTQQQTIINESGLYSLILTSRKEGSKRFKKWVTSEVLPSIRKTGSYGRAGTPKFIRRFNENWDRVSPGYFSVISEMAVRLHGRLEMVGHLMADVAPDGKELRPDVSAGRLFSEWLKENHDSVADCYSMYMHWTPKREFEARQYPMRMLPLFIEFLETVWIPDRSRDYFLSRDPEAIRHLPKLMPGVRAPRPKPQISWRR >CP023313|3983257:3990358|3989902_3990358_-|ATC26521.1|DBSCAN-SWA MSDVYDKAKRSAVMARVRSQDTGPELRLRRLLTGLGARYRLHRKDLPGSPDVVMPGRRLAFFVHGCFWHGHDCARGARVPKANRDYWLAKVARNIARDQRALAALAQAGWRTQVVWECDFKDEAALSARLQALLAATSVLKGSTAPRAAMT >CP023313|3983257:3990358|3986178_3987042_+|ATC26517.1|DBSCAN-SWA MKCSVNVHVPPCDPVTPAEPAAPWIGGKRHLAKRICQILAATPHDAYCEPFIGMGGVFLRRAVRPGVEVINDVSGDVVTLFRVLRAHPEALLRELRWRPAMRAEFDRQKALAAHDLTDVERAARFLYLQVLAFGGKVRGRNFGVDPSAPHNFDIRRLEPRLRRIHDRLAGVTIENLDWSEFIPRYDRAGTLFYLDPPYWGSEDDYGRELFARADFERLADMLSSIEGRFLLSINDVPAMRTAFAWAQIEAVNTVYSVGNADPSAPAKELLIGRGVNLASVAPPPTLF |
9 | Burkholderia_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|