Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LR133964 | Klebsiella pneumoniae strain NCTC11359 genome assembly, chromosome: 1 | 3 crisprs | cas3,csa3,DEDDh,DinG,WYL | 0 | 0 | 5 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR133964_1 | 1660494-1660736 | Orphan |
NA
Consensus repeat of LR133964_1
|
2 spacers
spacers of LR133964_1
>1.1|1660543|66|LR133964|PILER-CR GATGTGCGCAATATCATCCGTTTGCGTCGCCACTTAATGTAGGCCTGATAAGCGCAGCGCCATCAG >1.2|1660658|52|LR133964|PILER-CR ATACCCCGATTAGTTAGCCATCCCGTAGGCCTGATAAGCGAAGCGCCATCAG |
CRISPR arrays and Neighbor proteins around LR133964_1
The CRISPR arrays of LR133964_1 >merge|LR133964|1|1660494-1660736|PILER-CR GTGGCGGCTGCGCCTGACCCGGCCTGGGATGTGCGCAATATCATCCGTTTGCGTCGCCACTTAATGTAGGCCTGATAAGCGCAGCGCCATCAGGCAATGTGGCGGCTACGCCTGACCCGGCCTGGATTATACGGTGGGTGTCATACCCCGATTAGTTAGCCATCCCGTAGGCCTGATAAGCGAAGCGCCATCAGGCAGTGTGGCAGCTAAGCCTGACCCGGCCTGGATTATACGGTGGGTGTC >LR133964|1|1|1660494-1660736|PILER-CR GTGGCGGCTGCGCCTGACCCGGCCTGGGATGTGCGCAATATCATCCGTT TGCGTCGCCACTTAATGTAGGCCTGATAAGCGCAGCGCCATCAGGCAATGTGGCGGCTACGCCTGA CCCGGCCTGGATTATACGGTGGGTGTCATACCCCGATTAGTTAGCCATC CCGTAGGCCTGATAAGCGAAGCGCCATCAGGCAGTGTGGCAGCTAAGCCTGA CCCGGCCTGGATTATACGGTGGGTGTC
>LR133964.1|VDY59442.1|1658574_1660422_+|AsmA-protein MRRILTTLMILLAVIVAGLTSLVLLVNPNDFRAYMVNEVAERSGYQLDLDGPLRWHVWPQLSILSGRMTLTARGAEEPVIRADNMRLDVALLPLLSHQLQVKQVMLKGAVIQLTPKTEAVRDSSAPVVPHDNTLPLAPEDRGWSYDVRQLQVADSVLFFQHENGEQVTVRDIRLQMEQDENHRATVDFSGRVNRDQRDLALSFSATVQGGDYPHSLKADFTQLSWQLRGAELPPDGINGQGSLQASWQEDDKTLRFDNLNLMANRSTVTGSGSVVLGDRPDWSLDLHATTLDLDSLLAQRSPATDSSASQQGQSQTRPLRPVIADSDEREDYQSLRGFNGRMALSADQLQWRGLNFTQVQSEISNQQGLLTVSKMQGNLDGGQLSLPGTLDARGDTPLATFQPALQNVEIGSLIKAFNYSLNLTGKLSLSGEFSGTRIDADDFRRHWQGQAQLQMADTRTEGLNFQQLVQQAVERSTNVRAQENYDNATRLDSVSSRLTLDNGLVTLNRLQGQSDVMAMTGEGQLDLQKENCDMRFNVRVLGGWKGEGKLIDRLKQTAIPLRIYGEWQSLSYSLQVDQILRKQLQDEAKQRLNDWVERNKGSKDGNDAKKLLDKL >LR133964.1|VDY59441.1|1657962_1658544_+|Deoxycytidine-triphosphate-deaminase MRLCDRDIEAWLDEGRLAINPRPPVERINGATVDVRLGNKFRTFRGHTAPFIDLSGPKAEVSAALDRVMSEEIVLPEGEAFFLHPGELALAVTYESVTLPADLVGWLDGRSSLARLGLMVHVTAHRIDPGWSGCIVLEFYNSGKLPLALRPGMPIGALSFEPLSGPAARPYNRREDAKYRDQQGAVASRIDKD >LR133964.1|VDY59440.1|1657230_1657872_+|uridine-kinase MTDMSHQCVIVGIAGASASGKSLIASTLYRELREQVGDEHIGVIPEDSYYKDQSHLSMEERVKTNYDHPSSMDHSLLFQHLQMLKSGQPIELPVYSYVEHTRTPNTIHVEPKKVIILEGILLLTDARLRNELNFSIFVDTPLDICLMRRIKRDVNERGRSMDSVMAQYQKTVRPMFLQFIEPSKQYADIIVPRGGKNRIAIDILKAKISQFFE >LR133964.1|VDY59439.1|1656267_1657116_+|DNA-3-methyladenine-glycosylase-II MVLLPWTPPYDWAWMVGFLQARAVAGVERFDEGGYSRSFGVEGHRGLIHLAPDEEAQGLRVTLSPGLQPVAEICYARIGQLFDLACDPRQVARALGSLAQARPGLRLPGALDAFEQAVRAVLGQLVSVAMAARLTAKVAAGWGEPLAEAPGSVLFPTPEALSRADPQALKALGMPLRRAEALIHLARAALSGELPLTAPADIDAGLRQLQTLPGIGRWTANYFALRGWQAKDIFLPDDYLIKQRFPGMTPAAIARYARRWQPMRSYALLHIWYTDDWAPAAE >LR133964.1|VDY59438.1|1654777_1656130_-|heat-shock-protein-YegD MFIGFDYGTANCSVAVMRENTPQLLTLENGSALLPSMLCAPTREAVSEWLYRHHDVPTHSDENQALLRRAIAANRDEDIEVLRNSVQFGLASLHQYVEDPEEVYFVKSPKSFLGASGLKPQQVALFEDLVCAMMLHIKLQAESQLPEQIDQAVIGRPINFQGLGGDEANAQAQGILERAAHRAGFRDVVFQFEPVAAGLDFEATLSEEKRVLVVDIGGGTTDCSLLLMGPQWRERADRQQSLLGHSGCRIGGNDLDIALAFKCLMPLLGMGGETEKGTALPILPWWNAVAINDVPAQSDFYSTANGRLLNDLLRSARDADKVALLLKVWRQRLSYRLVRSAEESKIALSSAASVETALPFIQDDLATAIAQQGLEAALDQPLTRIMEQVRLALDSSQTTPDVIYLTGGSARSPLIKKALAAQLPGIPLAGGDDFGSVTAGLARWAQVVFR >LR133964.1|VDY59437.1|1652632_1654489_-|histidine-kinase MLLVAPQAQAATRQVGIDIPVQWYADDSGQMTLDRFAALPADQLATTRQIPSFGYSRKTWWLRSELPGTWFAGEPRWLQLGPTFVDHLTIYYRPLGSDGPWTQRIFGDRDGARGSDLHYRERVLILPPPPTAAGYELVFRLQSTSTLILLASLSSAQAFVQRATADTAFWSFYFGLAAVASGVALWLALALRRRLLWGICLFSLNYPLVAALHGFPEWFFGHAALPFQDFMISSLSLFSYATALWLHSEIFDLKKNMPRLHQLLIAAVILNLVLQVSIPLGFYGFAMQIEGVVFIIITPVLLFTSWWLWRKKAIDRTTLLLGLLPPFYVVAAVLVQLSIHGIIPFHMAIYSLWQYALIVHIITVLIIAILRVRAENRQLEQKQRLARELQIEREASFHQRQFMGMVAHEFRTPLAVIQAALENLRLSAASTSQEARFDRIGRAATRLVQLTDNCLADARLASHDLHVERQQTALLTVINMAASVVAISHDHYLNIRQHGAVESPQLQADAGLLCIAIANLLDNAVKYSPPGEIAIDIHSDAGQTELRIRDHGPGLPAGQAELIFERYRRGEHTSPVPGGTGLGLYVARQIVQAHDGKLWLAEHGPDGCTFILTLPTVA >LR133964.1|VDY59436.1|1651916_1652618_-|putative-response-regulator-receiver MALRLAIIEDNADLLDELLAWLGYRGFEVWGTRSAEAFWRQLHSHPVDIVLVDIGLPGEDGFSVLNYLHELGHYGLVVVSARGQQQDKLQALSLGADAYLIKPVNFAHLAETLTALGARLRQDRPAAPPAEAIGTPPAVSPGSWRLQEDKLISPDARTLELTQQEYRLVQLLMRNRNEVCSKLDLHACLFSHESEPDLHRIDVVVSRLRHKARQQGIHLPVRAIFGKGLAFIS >LR133964.1|VDY59435.1|1651538_1651679_-|Uncharacterised-protein MAMQTWLVLLLCIFFFSISVYSFISYLKDRRRQKFTFNDKRSMRRK >LR133964.1|VDY59434.1|1650811_1651423_-|positive-transcription-regulator MRNINVTINTRNAFVRESLVAMVNDLTRGDLRARFSWRNTDLSAEDIIICEVIPGEIYLCNTLIKNRKRGSSLIILHSYDQLPEDEFMINCLKGVIFVSLKTASIPQLLAIIKSELQHCMTPTATDAAGRELSCASCPHRVLSRSQTAVVHGILEGLDMSKIAALQRVSPRTAAYHKNKIMEKYSLNNNHEFFQFMNLLRERW >LR133964.1|VDY59433.1|1649264_1650539_+|C4-dicarboxylate-transport-protein MANANKLTLFIVVFMLMGILSGAAIHAYATPTTVSAWADNITLLTDLFLRLIKMVIAPLVFSTLTVGIMRLGETATIGRVGGKAMVWFITSSVLSILVGLVIVTFQHPGAGLNLAVPKEAVDTGLAVSGMSLKGFLSHTIPTSITEAMANNEILQIVVFSMFFGIAGASLGEKFNAPLVAALNVVSHIMLKVTGYVMYVAPLAIFAAISSVIASQGLGILLNYASFIGGYYLAVLLTSAVLIAVGYMVLKKEVFRLLNMLKDPVLVAFTTSSSEAAYPKTLERLVKFGCSRNIVSFVLPIGYSFNLVGSMVYCSFAAMFIAQAYNVPLSFSEITVMMLTLMLASKGIAGVPRSALVVLAATIPSFNIPVAGILLLMGIDHFLDMGRSAINVLGNGIATAMLSKNEGLLTDEEAQPDWEVEKAEA >LR133964.1|VDY59443.1|1660857_1662444_-|putative-transmembrane-protein MMEWIADPSIWAGLVTLVVIELVLGIDNLVFIAILAEKLPPAQRDRARVIGLLLAMLMRLLLLASISWLVTLTKPLLTYHDLSFSARDLIMLFGGLFLLFKATVELNERLEGKDSDNPVQRKGARFWGVVTQIVVLDAIFSLDSVITAVGMVDHLAVMMAAVVIAISLMLMASKALTRFVNSHPTIVILCLSFLLMIGFSLIAEGFSFIIPKGYLYAAIGFSVMIEALNQLAQFNRRRFLSANMTLRQRTTEAVMNLLSGQKEKAELDADTASLVADQDQHPLFNPQERLMIERVLNLNQRSVSSIMTSRHDIERINLSAPEEEIRSLVEKNQHTRLVVTGGKDNEDLLGVVHVIDLLQQSLRQEPLDLQALVRQPLVFPEGLPLLSALEQFRQARTHFAFVVDEFGSVEGIVTLSDVMETIAGNLPNEVEEIDARHDIQHHQDGSWTVNGHMPLEDLVQYVPLPLDDKREYHTVAGLLMEYLQHVPQVGETIEIDGYTLRTLQVDSHRVQKVQIVPPVKQDELDYEV >LR133964.1|VDY59444.1|1663315_1664206_+|UDP-glucose-pyrophosphorylase MASLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMVQYIVDEIVAAGIKEIVLVTHSSKNAVENHFDTSYELEALLEQRVKRQLLAEVQAICPPGVTTMNVRQGQQLGLGHSILCARPIVGDNPFVVVLPDIVLDDASADPLRYNLAAMVARFNETGRSQVLAKRMPGDLSEYSVIQTREPLETEGQIGRIVEFIEKPDEPQTLDSDLMAVGRYVLSADVWGELERAAPGAWGRIQLTDAIAELAKKQSVDAMLMTGESYDCGKKMGYMQAFVNYGLRNHKEGNKFRESIKKLLA >LR133964.1|VDY59445.1|1664668_1665319_+|acid-phosphatase MSWQFVSFLGDSTVLLPSAAVLLIVLFLRAPSRQVACNWALLFGITGAIVSASKLAFMGWGIGIREIDFTGFSGHTALSAAFWPIFLWLLFSRATTGVRRAAIVVGYALAALVGYSRLMIHAHSTSEVVAGLLLGASGSALLLLLQSRTSRAGNVQLSWSGVFSVMMIPVLLLNTGVKAPTQSLLGEIAVKIGPLEKPFTRADLHKHFIDCAKEMR >LR133964.1|VDY59446.1|1665631_1665763_-|Uncharacterised-protein MKIIHSLIPIYLDAKTMHTLSIKLIKQRRKICRLNIHRCKQNH >LR133964.1|VDY59447.1|1666322_1667756_+|Surface-assembly-of-capsule MMKIARIALALGLLTSVSSPVFAAGLVTNDNELRNDLSWLSDRGVIQLSLSTWPLSQEEITRALKKAKPSYSSEQVVLARINQRLSSLKADFRVSGYTSTDQPGTPQGFGQSQPADNSLSLAFNNSGEWWDVHLQGNVEGGERISNGSRFNANGAYGAVKFWNQWLSFGQVQQWWGPGYEGSLIRSDAARPMTGFLMQRAEQAAPETWWLRWIGPWQYQISASQMNQYIAVPHTKIIGGRLTFTPFQSLELGASRIMQWGGEGRPQSFSSFWDGFTGQDNTGTDNEPGNQLAGFDFKFKLEPTLGWPVSFYGQMIGEDEAGYLPSANMFLGGVEGHYSWGKDAVNWYIEAHDTRTNMERTGYSYTHHIYKDGYYQQGYPLGDAVGGDGQLFAGKAELVTEDNQRWSTRLVYAKVNPKNQSINKAFHNSDTLKGVQLGWSGDVYKTVRLNTSLWYTDADNSDNDDIGASAGIEIPFSL >LR133964.1|VDY59448.1|1667900_1669037_+|polysaccharide-export-protein MKKKIVRLSALALAISFLSGCSIIPGQGLNSLRKNVVELPDSDYDLDKLVNVYPMTPGLVDKLRPESLMARPNPQLDNLLKSYEYRIGVGDVLMVTVWDHPELTTPAGQYRSASDTGNWVNSDGTIFYPYIGKVQVAGKTLSQVRQEIANRLTTYIESPQVDVSIAAFRSQKVYVTGEVTNSGKQAITNIPLTVMDAINAAGGLAPDADWRNVVLTHNGQDTKVSLYALMQKGDLTQNHLLYPGDILFIPRNDDLKVFVMGEVGKQSTMKMDRSGMTLAEAIGNAEGMSQAFSDATGVFVIRQVKGDKQGKIANIYQLNAQDASAMVLGTEFQLQPYDIVYVTTAPLVRWNRVISQLVPTITGVHDMTETARYIRTWP >LR133964.1|VDY59449.1|1669491_1671648_+|tyrosine-autokinase MSSVKNKSVQKETDELDLGRLIGEFIDHRKLIISVTSLFIMLGLVYAIFATPIYQADALVQVEQKQANAILSNLSQMLPDSQPQSAPEIALLKSRMVLGKTVDDLNLQAEVKQKYFPIFGAGWARLMGEKPGDISVTRLYLKPYQDEVPEITLTVNDSKNYTISSNDLNLNGKVGQLLEGKGISLKINKIDADPGTQFNINYLTKLKAITDLQENLNIADQGKDTGMLSLSLTGSNAELIKNIVDSISNNYLAQNISRQAAQDEKSLDFLNQQLPTVRAELDLAEDKLNLYRQQKDSVDLSLEAKSVLDQIVNVDNQLNELTIRESEVSQLYTKEHPTYKALMEKRKTLQDEKTKLNKRVSAMPETQQEVLRLSRDVESGRAVYMQLLNRQQELSISKSSAIGNVRIIDKAVTLPKPIKPKKIIVVLGSVILGLVISIGVVLLRIFLRRGIESPEQLEDIGINVYASIPVSETFAQKLVKKAGWKKNKIEEHQGFLAIENPADLAIESIRGLRTSLHFAMMEARNNVLMISGASPNAGKTFVSSNLAAIIAQTGKRVLFVDADLRKGYTHKLFNVSNDNGLSDFLSGKVSIEQTVKTLPNVDFDFISRGAVPPNPAELLMHSRFGEIVSWANKHYDLIIMDTPPILAVTDAAIIGNYAGTTLLVARFEQNTAKEIDVSVKRFEQSGVNVKGCILNGVIKKASSYYGYGYNHYGYKYND >LR133964.1|VDY59450.1|1672048_1673446_+|UDP-glucose-lipid-carrier-transferase MNIIPDRYHITGNASIISMLQRFSDIAVVSFSLYMIFLAHGLKSDLAFGMLILLALVVFQMIGGITDFYRSWRGSKLSLELFMIIKNWTLSILIVFFVLSIFPILHINNFIFYQWYMLVALGFVICRSFIRFTIGLLRKLGYNRRTVALVGSMASGVELLHSFQEQIWLGFVVKGIYDDSLTSEIRGIPYAGDLAKLVEDARLGKIDRVYIALGIENESKIKQVVKELTDTTCSVLLIPDLFTFNILQSRTEEINGVPVVPLFDTPLNGINMVFKRLEDIVVSCLILVLISPVLLLISCAVKMSSSGPVFFRQLRYGIDGKPIRVWKFRSMTVMENDTNVKQATKNDVRITKVGKFLRRTSLDELPQFFNVLFGHMSVVGPRPHAVSHNEQYRTLIQGYMLRHKVKPGITGLAQVNGWRGETDTLEKMEKRIEYDLLYIRNWSIGLDFKIIFLTIFKGFINKSAY >LR133964.1|VDY59451.1|1673592_1674390_+|family-2-glycosyl-transferase MNKYAILIPSYNSTVSEINSTFMQLPINVPIYVIDDGSEIPFELQAKHVLTKFPLLNIIRFEKNKGIEHALNAGLDLIIENFTYVARIDIGDSCSPERFIEQIKFMEENKDYSIVGCWASFYNENKDFLFVKKMPLQDKNIRRYMYINNAFVHPSVMMRCSSVKAVGGYGYKYAACEDYDLFFRLMHVGKVKNFDKAWVNYEVSSKSISSLKRKIQVKNRIKIIIANFTFIENGIYPYYGLSRNIIMYFIGRNITTWLQKIMNRS >LR133964.1|VDY59452.1|1674445_1675486_+|polysaccharide-polymerase MLPYVLVLILVAGWAYLEKNALNWNAFWIPAIILILFATVRDYTIGTDTPTYTRNFRLHINPDGIAFNPYIEQGYQFLEHLALRISFDYSVYFFICASIIIPLTLLSIKKLSPDYVMSLFCHITYGFYTFFFSGVRQGIAMAICLYAHSFIINKKIIPAIVIIFIATLFHVSAYILFIFLISTSVKIRLEYKVLIYFIMSLLLSNVAINYMADSNDRYATYTGTVNNSGGYIILAFYSVIAFVFYIFCSAYRRKDKVYSYFEELMLCGIAFLIPIALLGTDPSGPQRFLYYFSWSVVILFPFLLRRFKGIGVRILFIVISIMYYCIATERFFSLSPYLVNRQFSFF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR133964_2 | 4269327-4269421 | Orphan |
NA
Consensus repeat of LR133964_2
|
1 spacers
spacers of LR133964_2
>2.1|4269351|47|LR133964|CRISPRCasFinder GTTTTTATTCGCTGAACTGCCCGGGAGTGATATATCTCTGATTTATG |
CRISPR arrays and Neighbor proteins around LR133964_2
The CRISPR arrays of LR133964_2 >merge|LR133964|2|4269327-4269421|CRISPRCasFinder TTAATGGTCACTATGGTTTTTATGGTTTTTATTCGCTGAACTGCCCGGGAGTGATATATCTCTGATTTATGACTGTGGTTTTTATGGTTTTTATT >LR133964|2|1|4269327-4269421|CRISPRCasFinder TTAATGGTCACTATGGTTTTTATG GTTTTTATTCGCTGAACTGCCCGGGAGTGATATATCTCTGATTTATG ACTGTGGTTTTTATGGTTTTTATT
>LR133964.1|VDY61868.1|4267873_4269238_-|malate-Na(+)-symporter MSTTDNAFSATLEPIDTPKTTLKQRWWHIMDNWKVGIVPLPLFLLAGGLIALDCLGGKLPSDIVVMVATLAFFGFACGEFGKRLPVLGKLGAAAICATFIPSALVHYGLLPDVVIESTTKFYKSTNILYLYICCIIVGSIMSMNRTTLIQGFLKIFFPMLCGEVVGMLVGIGVGTLLGMEPFQVFFFIVLPIMAGGVGEGAIPLSMGYAALMHMEQGVALGRVLPMVMLGSLTAIVISGCLNQLGKRFPHLTGEGQLMPNRSHETRSLSESEGVSGKTDVGTLASGALLAVLLYMMGMLGHKLIGLPAPVGMLFLAVLLKLANVVSPRLQEGSQMVYKFFRTAVTYPILFAVGVAITPWQELVNAFTLTNLLVIVSTVSALVATGFLVGKKIGMHPIDVAIVSCCQSGQGGTGDVAILTAGNRMSLMPFAQIATRIGGAINVSLGLLFLSHYLA >LR133964.1|VDY61867.1|4267154_4267694_-|phosphoribosyl-dephospho-CoA-transferase MSVDTPAQAGVSIDALLAAKEQRAARQADWLAHYQQPVISLTLVTPGAVKDSIRYRNMMGVALQACDQLLWKHRWQTLDRQVLWLPTGPEALWCVAHPASEIKAMCSTLEQSHPLGRLWDIDVICPQNGLVGRQSLGESQRRCLLCDEPAHACARSRRHDTDLVVARVEQMIDAWFARD >LR133964.1|VDY61866.1|4266420_4267080_+|ribosomal-large-subunit-pseudouridine-synthase-A MAMENYNPPQDPWLVILYQDEHIMVVNKPSGLLSVPGRLEEHKDSVMTRIQRDFPQAESVHRLDMATSGVIVVALNKAAERELKRQFREREPKKQYVARVWGHPQPAEGLVDLPLICDWPNRPMQKVCYETGKAAQTEYEVLEYAEDNTARVRLKPITGRSHQLRVHMLALGHPILGDRFYATPEALAMAPRLLLHAETLTITHPAYGNAMTFRAPIDF >LR133964.1|VDY61865.1|4263501_4266408_+|RNA-polymerase-associated-protein-RapA MPFTLGQRWISDTESELGLGTVVALDARMVTLLFPAIGENRLYSRNDSPITRVMFNPGDTITSHEGWQLHVDKVNEENGLLSYTGTRLDTQEANVTLREVLLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKFQSEQYRMPWSGLRGQRTSLIPHQLHIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAAERVLIVVPETLQHQWLVEMLRRFNLRFSLFDDERYAEAQHDAYNPFETEQLVICSLDFVRRSKQRLEHLCDAEWDLMVVDEAHHLVWSEEAPSREYQAIEQLAERVPGILLLTATPEQLGMESHFARLRLLDPNRFHDFEQFVEEQQNYRPVADAVALLLAGNKLSDSELNTLGDLIGEQDIEPLLQAANSDREDAQAARQELISMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIRLPLPTQYQTAIKVSGIMGARKTAEERARDMLYPEQIYQEFEGDTGTWWNFDPRVEWLMGYLTSHRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWFAEEDTGAQVLLCSEIGSEGRNFQFASNLVMFDLPFNPDLLEQRIGRLDRIGQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTVYDSVHDELINYLAAPESTDGFDDLIKSCRQQHDALKAQLEQGRDRLLEIHSNGGEKAQALAESIEEQDDDTSLIAFSMNLFDIVGINQDDRGENLIVLTPSDHMLVPDFPGLPEDGCTITFERDVALSREDAQFITWEHPLIRNGLDLILSGDTGSSTISLLKNKALPVGTLLLELIYVVEAQAPKQLQLNRFLPATPVRMLLDKNGNNLAAQVEFESFNRQLSAVNRHTGSKLVNAVQQDVHAILQQGEAQIAKAAQGLIDAARNEADEKLTAELSRLEALKAVNPNIRDDELAAIESNRQQVMDALAQAGWRLDALRLIVVTHQ >LR133964.1|VDY61864.1|4260956_4263314_+|DNA-polymerase-II MTQPRAGFLLTRHWRDTPQGTELSFWLATDDGPLQVTLPPQESVAFIPEAQRAQAERLLQGEKGLRFAPLALKDFHRQSIVGLYCRAHRQLMRLEKMLRDSGVTVYEGDIRPPERYLMERFITAPVWVEGETRGSQLVNARMKPNPDYRPPLKWVSLDIETSRHGELYCIGLEGCGQRVVYMLGPEPETPPDVDFELVFIASRPLLLEKLNAWFAEHDPDVLIGWNVVQFDLRVLQKHAERYRIPLRLGRGNSELEWREHGFKNGVFFAQANGRLIIDGIDALKSAFWNFSSFSLEAVARELLGEGKAIDNPWDRMDEIDRRFHEDKPALAIYNLQDCELVTRIFHKTEIMPFLLERATVNGLPADRHGGSVAAFSHLYFPRMHRLGYVAPNLGDVPPQASPGGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGLAQPDDQHSIEGFLGARFSRDKHCLPGIVSQIWHGRDEAKRQHNKPLSQALKIIMNAFYGVLGTSACRFFDPRLASSITMRGHAIMRQTKALIEAKGYDVIYGDTDSTFVWLKRPHSEALAAEIGRELVSDVNAWWAQELSKSQLTSALELEYETHFCRFLMPTIRGADTGSKKRYAGMIQEGDAQRMVFKGLETVRTDWTPLAQQFQQELYLRIFRNQPYQDYVRETIARLMNGELDEQLVYRKRLRRPLAEYQRNVPPHVRAARLADEHNLKLGRAQQYQQRGTIKYVWTTSGPEPVDYQQSPLDYDHYLTKQLQPVAEGILPFVNDDFATIVTGQLGLF >LR133964.1|VDY61863.1|4260085_4260781_+|L-arabinose-isomerase MLEDLKRQVLEANLALPKHNLVTLTWGNVSAVDREKGVFVIKPSGVDYRVMTADDMVVVSLESGEVVEGNKKPSSDTPTHRLLYQAFPTLGGIVHTHSRHATIWAQAGQSIPATGTTHADYFYGPVPCTRLMTDAEINGDYEWETGNVIVETFRQQGIDPAQMPGVLVHSHGPFAWGKNAEDAVHNAIVLEEIAYMGIFCRQLAPQLPAMQQTLLDKHYLRKHGAKAYYGQ >LR133964.1|VDY61862.1|4258445_4259948_+|L-arabinose-isomerase MTIFDNYEVWFVIGSQHLYGPEALRQVTKHAEHVVNSLNAEAKLPCKLVLKPLGTTPDEITHICRDANYDDKCAGLVVWLHTFSPAKMWINGLTILNKPLLQFHTQYNAALPWDSIDMDFMNLNQTAHGGREFGFIGARMRQQHSVVTGHWQDKEAHQRIGGWMRQAVSKQDTRHLKVCRFGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVVNSISDGDISALVDEYESSYRLTPAAQVHGDKRQNVLDAARIELGMKRFLEQGGFHAFTTTFEDLHGLKQLPGLAVQRLMQQGYGFAGEGDWKTAALLRIMKVMSTGLQGGTSFMEDYTYHFDNGNDLVLGSHMLEVCPTIATAEKPILDVQPLGIGGKADPARLIFNTQTGPAIVASLIDLGDRFRLLVNTIETVPTPHDLPKLPVANALWKAQPDLRTASEAWIIAGGAHHTVFSHALNLDDMRQFAELHDIELTVIDNDTRLPSFKDALRWNEVYYGSKR >LR133964.1|VDY61861.1|4256725_4258435_+|ribulokinase MAIAIGLDFGSDSVRALAVECASGAELATSVEWYPRWREGQYCDGANNRFRHHPRDYIESMEAALKSVLASLSAEQRADVVGIGVDSTGSTPAPVDAEGNVLALREEFADNPNAMFVLWKDHTAVEEAEAITRLCHQPGKEDYSRYIGGIYSSEWFWAKILHVTREDSAVAQAAASWVELCDWVPALLSGTTRPQDLRRGRCSAGHKSLWHESWGGLPPASFFDELDPIINQHLAWPLFTDTWTADVPVGTLSAEWAQRLGLSQSVAISGGAFDCHMGAVGAGAQPNALVKVIGTSTCDILIADKESVGERTVKGICGQVDGSVVPHFIGMEAGQSAFGDIYAWFGRILGWPLEQLAQQQPALREQIKASQKQLLPALTEAWANNPSLEHLPVVLDWFNGRRTPNANQRLKGVITDLNLATDAPALFGGLIAATAFGARAIMECFTEQGIPVNNVMALGGIARKNQVIMQACCDVLNRPLQIVASDQCCALGAAIFAAVAAGVYEDIPAAQQRMASQVETTLQPRPAQAQRFEQLYQRYQQWSVSAEQHYLPSAAKAEKAPQSQAALTH >LR133964.1|VDY61860.1|4255542_4256388_-|Arabinose-operon-regulatory-protein MAETQNDPLLPGYSFNAHLVTGLTPIEAQGYLDFFIDRPLGMKGYILNLTIRGEGVINNHGEQFVCRPGDMLLFPPGEIHHYGRHPDASEWYHQWVYFRPRAYWHEWLNWPTIFAQTGFFRPDEQWQARFGELFGQIVDAGQGAGRYSELLAINLLEQLLLRRMEAINESLHPPLDNRVRDACQYISDHLADSHFDIASVAQHVCLSPSRLSHLFRQQLGVSVLGWREDQRISQAKLLLSTTRMPIATVGRNVGFEDQLYFSRVFKKCTGASPSEFRAGCE >LR133964.1|VDY61859.1|4254646_4255414_-|DedA-family-inner-membrane-protein-YabI MQALLEHFITQSTVYSLLAVMLVAFLESLALVGLILPGTVMMAGLGALIGGGEVNFWQAWLAGIVGCLLGDWISFWLGWRFKKPLHRWSFMKKNRALLEKTEHALHQHSMITILIGRFVGPTRPLVPMVAGMLDLPVAKFVLPNIIGCLLWPPLYFLPGILAGAAIDIPADENSASFKWLLLGAALLAWLAGWLCWRLWRSAKTSGDRLTRWLPRGRLLWLSPLMVALAATALTFVFRHPLMPVYLAILHKVIAR >LR133964.1|VDY61869.1|4269562_4271188_+|two-component-sensor-kinase-for-citrate MKLSFQYKLFISLVAFFSVLFIALGIYYYFDASRQLYQEMSARAKIQAEEIALMPNLRQQVSRHDPQAIQAFMQQIAAHSDASFIVIGDRQGVHLFHSVHPEWVGTRLVGGDNQAVLEGKSITTIRKGGLGVSLRSKTPIVDDAGRVIGIVSVGYLTSYLDSITLTKVINIFIAAVLLLIALFIFSWYFTRSIKKQIFSLEPREIGLLVRQQKAMMESIFEGVIVIDRQRRIEVINHAARSLLGLSQPARQLRGQSIDSVISPQPFFASGDMLERDTHDELCRFNQLTVLASRVRIMLENTLQGWVITFRDRNEINALTAQLSQVKRYVDNLRIMRHEQLNRMTTLSGLLHMGHYDEAIRYIQAQSEHAQELLDFISSHFHSPTLCGLLLGKATRAREKGVALSFDPACRIDRPLPSLMESELISIIGNLLDNAIEATQRAELPHEPVEVLIQLNARELIIEVADRGVGIRPDIRERIFERGVTTKTRGDHGIGLYLIEHYVTQAGGTIEVADNAPRGTIFTLFIPADAHACPQPEAHDAS >LR133964.1|VDY61870.1|4271177_4271876_+|two-component-response-regulator-for-citrate MHHDLVDVLIIEDESELARLHAELVQKHPRLRLAGMAASLAQARQLLHATPPQLVLLDNYLPDGKGVTLMTDPALATSQCSVIFITAASDMETCSQAIRNGAFDYILKPVSWKRLSQSLERFIQFYDQQREWKIVDQQNVDSLYQLQAKNYRVDSGSKGIEEKTLALVQGLFSDREAHCFSVDEVVSAAGLSKTTARRYLEHGVETGFLEVEMLYGKIGHPRRLYRRAKVKS >LR133964.1|VDY61871.1|4271908_4272736_-|DnaJ-like-protein-DjlA MQYLGKVIGVAVALLMGGGFWGVVLGFLVGHMFDRARSRRLNLFANQQERQSLFFSTTFEVMGHLTKSKGRVTEADIHVANALMDRMNLHGASRTAAQQAFRDGKADNYPLREKMRQLRSVCFGRFDLIRMFLEIQLQAAFADGELHPNEREVLFVIADELGISRAQFDQFLRMMQGGAQFGGGSQQRSYGQHGGNAGWQQAQRGPTLEDACNVLGVKPTDDAATVKRAYRKLMNEHHPDKLVAKGLPPEMMEMAKQKAQEIQKAWELIKEQRGF >LR133964.1|VDY61872.1|4272979_4275328_+|outer-membrane-protein-Imp MKKRIPSLLATMIASALYSQQGLAADLATQCMLGVPSYDRPLVEGRPGDLPVTINADHAKGNYPDNAVFTGNVDINQGNSRLRADEVQLHQQQAAGQAQPVRTVDALGNVHYDDNQVILKGPKAWSNLNTKDTNVWQGDYQMVGRQGRGTADLMKQRGENRYTILENGSFTSCLPGSDTWSVVGSEVIHDREEQVAEIWNARFKLGSVPIFYSPYLQLPVGDKRRSGFLIPNAKYSTKNGVEFSLPYYWNIAPNFDATITPHYMNKRGGVMWENEFRYLTQLGSGLTEFDYLPSDKVYEDDHSSDSNSRRWLFYWNHSGVIDQVWRLNADYTKVSDPDYFNDFSSKYGSSTDGYATQKFSAGYVNQNFDATVSTKQFQVFDRESSNSYSAEPQLDVNYYQNDVGPFDTHLYGQVAHFVNSNNNMPEATRVHFEPTINLPLSNGWGSLNTEAKLLATHYQQSNLDKYNAANGTDYKESVSRVMPQFKVDGKMVFERDLQEGFTQTLEPRVQYLYVPYRDQSEIGNYDSTLLQSDYTGLFRDRTYSGLDRIASANQVTTGLTSRVYDAAAVERFNISVGQIYYFTESRTGDDNINWENNDTTGSLVWAGDTYWRIADEWGLRGGIQYDTRLDNVATGNGTIEYRRDENRLVQLNYRYASPEYIQATLPSYSTAAQYKQGISQVGMTASWPIVDRWSVVGAYYYDTNTRKAANQMLGVQYNSCCYAIRLGYERKVNGWNSNDNGGESKYDNTFGINIELRGLSSNYGLGTQQMLRSNILPYQSSL >LR133964.1|VDY61873.1|4275382_4276669_+|survival-protein-SurA-precursor-(Peptidyl-prolyl-cis-trans-isomerase-SurA) MKNWKTLLLGIAMIANTSFAAPQVVDKVAAVVNNGVVLESDVDGLMQSVKLNAGQAGQQLPDDATLRHQILERLIMDQIVLQMGQKMGVKISDDQLDQAIANIAKQNNMTLDQMRSRLAYEGINYNTYRNQIRKEMLISEVRNNEVRRRITVLPQEVEALAKQIGDQNDASTELNLSHILIPLPENPTSDEVAAAQEQANSIVEQARNGANFGKLAITYSADQQALKGGQMGWGRIQELPGIFAQALSTAKKGDIVGPIRSGVGFHILKVNDLRGGTQNISVTEVHARHILLKPSPIMNDAQAQAKLEQIAAEIKSGKITFAQAAKTYSEDPGSANQGGDLGWATPDIFDPAFRDALMRLNKGQTSGPVHSSFGWHLIELLDSRQVDRTDAAQKDRAYRMLMNRKFSEEAATWMQEQRASAYVKILSN >LR133964.1|VDY61874.1|4276674_4277664_+|4-hydroxythreonine-4-phosphate-dehydrogenase MPETRKVVITPGEPAGIGPDLVVQLAQRDWPVELVICADGALLTDRAKRLGLPLSLLPYDPAQPPVPQRAGTLTLLNVPLNVPAEPGVLNVQNGAYVVETLARACDGCLSGEFAALVTGPVHKGNINDAGIAFTGHTEFFEERAKASKVVMMLATEELRVALVTTHLPLKAISEAITPELLREIITILDHDLRTKFGIAQPHVLVCGLNPHAGEGGHMGTEEIDTIIPVLEEMRAKGMNLSGPLPADTLFQPKYLDHADAVLAMYHDQGLPVLKYQGFGRGVNITLGLPFIRTSVDHGTALELAGQGKADVGSFITALNLAIKMIVNTQ >LR133964.1|VDY61875.1|4277660_4278482_+|dimethyladenosine-transferase MNNRVHQGHLARKRFGQNFLNDQFVIDSIVSAINPQKGQAMVEIGPGLAALTEPVGERLDQLTVIELDRDLAARLQTHPFLGPKLTIYQQDAMTMNFGELAEKMGQPLRVFGNLPYNISTPLMFHLFSYTDAIADMHFMLQKEVVNRLVAGPNSKAYGRLSVMAQYYCQVIPVLEVPPSAFTPPPKVDSAVVRLVPHSTMPYPVKEIRVLSRITTEAFNQRRKTIRNSLGNLFSVEVLTELGIDPAMRAENISVAQYCLMANWLSDNLPTKES >LR133964.1|VDY61876.1|4278484_4278862_+|ApaG-protein MIHSPRVCVQVQSVYIESQSSPEEERYVFAYTVTIRNLGRSQVQLLGRYWLITNGHGRETEVQGEGVVGEQPHIPAGGEYQYTSGAVIETPLGTMQGHYEMIDIDGAPFRIEIPVFRLAVPTLIH >LR133964.1|VDY61877.1|4278866_4279715_+|diadenosine-tetraphosphatase MSTYLIGDVHGCYDELIALLAQVEFDPRRDTLWLTGDLVARGPGSLEVLRYVKSLGDSVRLVLGNHDLHLLAVFAGISRNKPKDRLKSLLEAPDADELLNWLRRQPLLQVDEEKKLVMAHAGITPQWDLETAQQCARDVEAVLSSDSYPFFLDAMYGDMPNHWSNELSGLARLRFISNAFTRMRYCFPNGQLDMYSKEAPEDAPAPLKPWFAIPGPVSNAYSIAFGHWASLEGRGTPEGIYALDTGCCWGGELTCLRWEDKQYFTQPSNRQKSLDEGEAVAS >LR133964.1|VDY61878.1|4279966_4280446_-|dihydrofolate-reductase MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVVMGRLTWESIGRPLPGRKNIVISSKPGSDDRVQWVSSVEEAIAACGDVEEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVEGDTHFPDYDPDEWESVFSEFHDADAQNSHSYCFEILERR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR133964_3 | 4460905-4461070 | Orphan |
NA
Consensus repeat of LR133964_3
|
1 spacers
spacers of LR133964_3
>3.1|4460959|58|LR133964|CRISPRCasFinder CTATCGCAGGTATGGAGCCTGACCGCCCGGCGGCGCTGCGCTTGCCGGTCCTGGGGGG |
CRISPR arrays and Neighbor proteins around LR133964_3
The CRISPR arrays of LR133964_3 >merge|LR133964|3|4460905-4461070|CRISPRCasFinder GAGCCCGCAGGAAGGAACGTAGGCCGGATAAGGCATGTATGCCGCCATCCGGCACTATCGCAGGTATGGAGCCTGACCGCCCGGCGGCGCTGCGCTTGCCGGTCCTGGGGGGGAGCCCGCAGGTAGGAACGTAGGCCGGATAAGGCATGTATGCCGCCATCCGGCA >LR133964|3|2|4460905-4461070|CRISPRCasFinder GAGCCCGCAGGAAGGAACGTAGGCCGGATAAGGCATGTATGCCGCCATCCGGCA CTATCGCAGGTATGGAGCCTGACCGCCCGGCGGCGCTGCGCTTGCCGGTCCTGGGGGG GAGCCCGCAGGTAGGAACGTAGGCCGGATAAGGCATGTATGCCGCCATCCGGCA
>LR133964.1|VDY62046.1|4459887_4460847_+|membrane-protein MNTQPVIGISGCLTGSAVRFDGGHKRMGFVMDELAQWVAFKPVCPEMAIGLPVPRPALRLVQTPVGEIRMRFSHAPHEDVTEKMTDFASAHLATLGELSGFIVCAKSPSCGMERVRLYDEKGNRGRKEGVGLFTGALMARYPWLPVEEDGRLHDPLLRENFVERVFALHELNALRARGLTRHGLLAFHSRYKLQLLAHHQAGYREIGPFVASLHEWEDLEAFFEVYREKLMAILKQPASRKNHTNVLMHIQGYFRDQLNSRQRGELREVILNYRAGLLPILAPLTLLKHYLAEYPDRYLQAQNYFDPYPQDLALRLTVN >LR133964.1|VDY62045.1|4459339_4459891_+|lipoprotein MKRILTLGLALLMLILAGCSTEVTEYRQQQPALDIFHYFQGRTEAWGMVQDRSGKQLRRFHVAIDGDVVGDTLTLHERFVYDDGEKQQRVWRIRRTGDNRYQGTAGDIEGVASGQAAGNAFHWRYSMNVEASGSRWLLHFDDWMFLQDGSHLFNKTEMKKFGITVATVTLFFTRTTAEERTAP >LR133964.1|VDY62044.1|4458818_4459343_+|Uncharacterised-protein MRFALLLLWLTALAPAAHAADWLTWRRVGEALLTWGPFTIYHSQLRTPNGRYDGPQQDRALIITYRRDIDREALLEATRDQWQAQGILQQEPRSEAWLRMLQGIWPDVAPGSQLAFVVSGGEGQFWYRASAAQTAFTPLGPRQSAAFSTRFLAIWLDPRTTYPELRQQLIGGTP >LR133964.1|VDY62043.1|4458336_4458822_+|Protein-of-uncharacterised-function-(DUF2878) MTRPAQHLLMALAFDVYWTLVVMLRERGLLIWLTLAIFAWLRLPAASRPPALLLAAAGCGLDACWALAGLIDFRGDSLLPLWMVALWLMFAVVWTRLTRTTTLPGWVLATAATVGGPVAYLIGARLGAMSLLVPMALAVAAMACGWLVLMLLFHLGMGRRK >LR133964.1|VDY62042.1|4457119_4458340_+|S-adenosyl-L-methionine-dependent-methyltransferase MTDPVFALEPDVPRNVRLARWLLFRLLSGLREGSLTVREGAQTFHFGDPAAALRAEARVCTPEVYWRLLTGGSLAAAEAWMDGDWESHQLTALLQILARNGEVLGRLERGFRLLGKPVARLRHWTRRNTRAQARENIAAHYDLGNEFYAHFLDDDLLYSSALFTDDQQDLTQAQRAKMARLCDQLALNPGDHLLEIGTGWGALAEYAARHYGCRVTTTTLSREQHRWATERMARAGLQDRVEVLLCDYRDLRGEYDKLVSVEMIEAVGQRYLPAFFRTCQARLRPGGRMALQAITIQDQRYRDYSKSVDFIQRYIFPGGFLPSITAMSELMTRHTDFVVRNLFDMGPDYARTLAHWRQRFTHAWQDIEKLGFDERFRRMWLYYFGYCEAGFNARTISVVQLTAERV >LR133964.1|VDY62041.1|4456400_4457123_+|plasmid-partition-ParA-protein MNSCLYQGVLRHRRLQPKAHHFVYRLFMAWLDLDELDRLPEAGIRRNRLAAAAWYDADYPLGAPLKAQALNRLESLTGCRPAGRVMLLTQLRYFGFHFNPVNFYYCYDEADTLRWVLAEVRNTPWNERHYYAVDGQQARPLEKAFHVSPFNPMDMVYHWRFNAPGKTLHMHIENHQASKVFDATLALSRVPLTRANLRGLLLRLPMMTLKTVLAIYWQALRLWLKRVPLYNHPVSRSERS >LR133964.1|VDY62040.1|4455139_4456399_+|amine-oxidase,-flavin-containing MNIAIIGSGIAGLTCAWRLAGHHQVTLFEAGATPGGHTATVDVATPQGTWAIDTGFIVYNDRTYPRFMGLLSELGIAGQKTQMSFSVHNPASGLEYNGHSLTSLFAQRRNLLKPAFWGLLSEIVRFNRLAKLALTEALDPGATLESFLTRHRFSPFFARHYILPMGAAIWSSSLQEMRRFPLPLFLRFFENHGLLDIRDRPQWYVVPGGSREYVRALLARLGDRLDLRLNAPVQQVDRHPAGITLRLASGEAHFDQVIFACHSAQALAMLAAPTDSEREVLGDIGWQRNEVVLHSDPRWLPERQRAWASWNYRLSDGDRARACVTYNMNILQGLPAGAPLFCVTLNPDAPVDDRYVWQRFVYEHPLFNPQSWSAQLRREEINGQQRSWYCGAYWYNGFHEDGVRSALDVVQGIAAAEDN >LR133964.1|VDY62039.1|4454423_4455143_+|oxidoreductase MMTVLITGASSGIGAGLAKSFAADGHLVIACGRDASRLAALQQLSPNISVRLFDMTDRDACRQALTGCFADLIILCAGTCEYLDHGQVDAALVERVMATNFLGPVNCLAALQTQLEAGDRVVLVSSMAHWLPFPRAEAYGASKAALTWFANSLRLDWEPKGVAVTVVSPGFVDTPLTRKNDFAMPGRVSVDRAVAAIRHGLAKGKNHIAFPTGFSLALRLLASLPSGIQRLLLRRMVRS >LR133964.1|VDY62038.1|4453992_4454427_+|transcriptional-regulator MSTSPSVIRRFVEYYAGLDAQPPAALAALYHPDATLSDPFGQHQGLFAIQRYFTHLLANVEQCRFTIDTPLCDGQRFAVTWTMHWSHPRIAGGETLALPGCSVVDIAGEQVLHQRDYYDAGEMIYEHLPLLGWAVRGVKRRVRS >LR133964.1|VDY62037.1|4453032_4453761_+|MerR-family-regulatory-protein MPYSIGEFARLCGINATTLRAWQRRYGLLKPLRTDGGHRLYSDDDVQQALKILDWVKKGVPVSQVKPLLSRPGARRTNNWLTLQETMLQRLKEGKIESLRQLIYDAGREYPRQELVTEVLRPLRSQVSANVPAIMTLREILDGIIIAYTSFCLEGDKKAPGDNFLITGWHLTDACEIWLEALKRTGQGHRIDVLPVPPAALAPEIFPQRNWLLVTSGKLSAARQRQVELWQQQVVFLEVIPL >LR133964.1|VDY62047.1|4461145_4462528_-|maltoporin MTMIKKLPLTMAVIAAFFPLTSVMAQEFTQEQIDAIVAKAVDKALADRQAKIDAAADKKVDVITNPQTTAASPDMAIPFGLKFSGYARYGAHFQTGDQKYVGVDGSYNGASAIGRLGNESNGGEFQISKAFKSAQGAIWDLNVMFDHWSDEVNLKKAYVGVTNVLESNPNAYIWAGRDFHQRPQQGINDYFWMNHDGQGAGVKNFDIGGVQFDVAAVSQVKSCSPEVMADETNPSRITCTGSSDTGDNGHYALTTKTHNIKAGPIDVEVYANYGFDSKAVDSDARLEAWQGGLVLSHTNDSGVNKVILRYSDNSDNSVYNKTDDLTTVYASFEGSHKFTQQAQVEYLLAFHDYDNGKDNTDNRKNYGAIVRPMYFWNDVHSTWLEAGYQRVDYDQGGDNHGWKLTLSQNIAIGMGPEFRPMLRFYVTGGQVDNEHTAKVNGTQDQQLDSLNVGGMFEAWF >LR133964.1|VDY62048.1|4462799_4464119_-|Xylose-isomerase MQTYFDQVDRVRFAGPKTDNPLAFRHYNPDEIVLGKRMADHLRFAACYWHNFCWNGADMFGAGSFERPWQAAGDALEMAKRKADVAFEFFYKLNVPYYCFHDVDVSPEGASLKEYLHNFAIMTEVLAEKQQQTGVKLLWGTANCFTHPRYGAGAATNPDPEVFAWAATQVVTAMNATHQLGGENYVLWGGREGYESLLNTDLRQEREQIGRFLQMVVEHKHKIGFGGALLIEPKPQEPTKHQYDYDVATVYGFLKQFGLENEIKVNIEANHATVAGHSFHHEIASAIALGIFGSVDANRGDAQLGWDTDQFPNSVEENTLVMYEILKAGGFTTGGLNFDAKVRRQSTDKYDMFYGHIGAMDVMALSLKLAARMIEDGKLDQGLAKRYAGWQGEQGQKIMSGQMSLDNIARYAEQHNLNPQPQSGRQELLENLVNTYIFG >LR133964.1|VDY62049.1|4464530_4465985_+|xyloside-transporter MSKNIMHSRHDEYHKLTRGERIGYGMGDFAQNLVFGTIGGFLALHMLTVNTISTATAGFIFLFVRIINVFWDPMVGTYVDKRTSKAGKYRPWLLRAGVPLVILSALLFAPIPGVKGSVAFAFIIYLALDLVYSLVNIPYGSLNASLTRDPESIDKLTSTRMMLANSANLLVYTLFPMFVQMAAPKDRSLKDTGFFGLELNLGNYTDPSANYAWFGVYAIYMIIGAVALFICYKCTKERVVATAEQTANVKTTDLFHELRHNRPLVILGMFFMLAFTFMFFMNTVNGFFNQFVVGHSEWMGAVGLVASIPGIAFPVFWPKLKKIFGKKGFFHLFLAMFIVGELLTYVWSREGMHDALWLAYIATFIKQWGLTSATGFMWALVPEVIAYGELKSGKRNAAIINAIMGLFFKIGFTIGGAIPLWLLAVYGFNESGAVQSASAIDGIIMTAVWIPIALAAISMVIMQVYPISDKHVTDINRQLDEIRV >LR133964.1|VDY62050.1|4466043_4467723_+|beta-xylosidase MSLIQNPILRGFNADPSIIRVEDTYYIANSTFEWFPGVRLHESKDLKNWNLLPSPLSTTTLLDMKGNPSSGGIWAPALSWADGQFWLVYTDVKVTEGAFKDMTNYLTTAKDIRGPWSDPIKLNGVGFDASLFHDDDGRKYIVQQTWDHREYHHPFDGITLTELDTKTLKLMPETARTIYRGTAVALVEGPHLYKLNGYYYLFAAQGGTVFTHQEVVARSKTLEADSFETEPGDVFLTNVDTPDSYIQKQGHGALVSTPEGEWYYASLCARPWNRPGESIYDPRGWSTLGRETAIQKVYWDDEGWPRIVGGHGGKTFVEGPKDAIFTESASDNSQQDDFTSPALDPNWNTLRVPFTAKMGTTGNGKLTLIGQGSLANTHDLSLIARRWQAFYFDAAVKVKFEPFSYQQMAGLTNYYNDRHWSFVFLTWNEINGKVIEVGENNRGKYTSYLKDNAIKVPDGVEYVWFRTKVRKQTYSYEYSFDGVSFTEIPVQLDAAVLSDDYVLQSYGGFFTGAFVGLAAVDYAGYGTQAEFYQFEYQELGDALAADGSYSWEAGETRDK >LR133964.1|VDY62051.1|4467805_4468444_-|HNH-endonuclease-domain-protein MKKLPDHLSRECIIAAIRAYDAGVAHQFKSARLYEIEFEGRRYPSKAIVGLAATLATGTEFTPTDFSGGIKSKCVRLLLDQGFTIASQGAGDDEVALFPDEGQTTMEFVEGAAMQVVVNRYERDRQARQAALRLHGCRCQVCGLDMASRYGEIGQGFIHIHHLIPLAGIKQDYRLNPETDLIPVCPNCHAMLHRRDPPFTPEELKARLRPAD >LR133964.1|VDY62052.1|4468623_4469493_+|restriction-endonuclease MPSSTTLQYAIENITIWRKGEQRAPHKPLLLLYVLSQYQRGHARMFDYASEIRDELHSLLERFGPQRRQYRPDMPFWRLKGDGFWELHNSEQCSTQGSRQPPGKELELCHVAGGFDEPHFALLNRNKKLINTLAHQILEAHFPESIQEELAEEMGFDLLQIRKERDPHFRQQVLRAYNYECAICGFNMRHDNTSVALEAAHIKWKQHGGPCEIPNGLALCAIHHKAFDKGSIGLDEDMRIQVSPAVNGGGIVGRLFWDFDGKPITLPQGKECYPQERFVAWHRREVFRG >LR133964.1|VDY62053.1|4469625_4470117_-|Hcp1-family-type-VI-secretion-system-effector MPIPPYMWLKDDGGADIKGSVDVQDREGSIEIIGLSHGVNLPVDSANGKITGTRQHSSMMIEKEVDSSTPYLYKAAATGQALQSAEIKFYHINHAGQEVAYYSVLMEQVRITGVNCGVPNCKLSSNDKINHVESVSLQYEKITWRIHDGNIQFSDAWNERPSA >LR133964.1|VDY62054.1|4471188_4471863_+|Uncharacterised-protein MADIEGIARKIYDIIISPDTVTGLINGGLSVPLDYGYLIYGIFDTDTRYERETERIRMMTAIRHDILNYENIVNAVKLILQLFNKHLSESEQDRIYRSVVTSIVGRISTNIIASNIAKAVIEKTSFTYVVFKGKGNPIAFLSTILLFGGMAERSIRTSDKLRNEAPEVYSLLRPRDYDLLYFLFADAVQPFVDAIHAEYTEGNSVFDKIMKRVEEHLNTNAKAG >LR133964.1|VDY62055.1|4471864_4472212_+|Uncharacterised-protein MKKIITPLVDIVLCTYIITTCFIFLTEFNNWQNKIIAVTIIIGSAYIIAIALNRLLGTKIGVLSGIFNLLSGPTLIIGAIVCIALLDPWPVKVIGLFLWVIIMFCLPSFINRDKE >LR133964.1|VDY62056.1|4472214_4472694_+|type-VI-secretion-system-effector MLNPAYLWLTDANGSPITGSSLVTGRVCAIELKAVTHHLSIPVDGNTGRLTGTRVHTPITVQKDFDKTTPLLYKALSENQTSKSATIKMYQILDAGVESEYFNIILENVKVTGITPNLFPDSGTGTHSETIQLRYEAITWKHCDGNIIYKDSWNHRATA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1633209 : 1640114
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LR133964|1633209:1640114|DBSCAN-SWA CGTGACATCCTCGATCCCCGTAGTGAATAATGCACCGCTGCTCGACGTGCGCGATTTGTGCGTCGACTTTGTTAACGGTAGCGCGGTGACCCACGCCGTGCGCGGCGTTTCCTTCCAGCTGGGGCATGAGAAGCTGGCCATTGTCGGCGAGTCCGGTTCGGGTAAGTCCACCGTTGGGCGCGCGCTGCTGCAGCTGCATCCGAAAAAGGCCCGCGTCAGCGCCAGCCGGATGCAGTTTGCCGACCTCGATCTGCAGCATCTTAGCGAAGCGCAGATGCGCGGCGTACGCGGCAAACGTATCTCGATGATTATGCAGGATCCGAAATATTCGCTGAATCCGGTAGTCTGCGTGGGTAAGCAAATCGCCGAGGCCTGGCTTACTCATCATCCCGGGCGCAAAGATGAGGCGAAAGCCAGGGCGCTGGAGATGCTGGAGGTGGTGCGTATTCGCCAGCCGGAGCGCGTTTATCAGCTTTATCCGCATGAAATATCCGGCGGACAGGGGCAGCGCATCATGATCGCCATGATGCTGATTACCGACCCGGAGCTGATCATTGCCGATGAGCCCACCTCCGCGCTGGACGTGTCGGTGCGCCTGCAGGTGCTGGGCCTGCTGGATGACCTGGTTCAGTCGCGCGGGTTGGGGCTGATTTTTATCAGCCACGATATCAACCTCGTGCGCAGCTTCTGCGACCGGGTGCTGGTGATGTACGCCGGACGGGTAGTGGAGTCCATCGCCGCTAAAGATCTCGATAACGCCCGTCACCCCTACACTCAGGGATTGATTAACTCGCTGCCGGATATGCAGCATCGTCGCCCGATACTGCCGGTGCTGCAGCGTCAGGCCAGCTGGTTAACGGATTAAGGAGCGGATAATGATTGAAGTTCAACATCTCAACCTCGCGTTTGGCGAGGGGGAGAAGCGTAACCAGGTGCTGTATGACGTCAGTTTCCACGTTAAGCCCGGCGAGATCTACGGTCTGGTAGGAGAGTCCGGTTCCGGTAAAACGACGGTGCTCAAGTGCCTGGCGGGGCTGTTTACCCACTGGCAGGGGGAACTGACCATCGATGCTCAGCCCTTAGGGCATGAAATTAGCCGCGAGCGCTGCCGTCAGGTGCAGATGGTATTTCAGGACCCCTACGGATCGCTGCATCCCCGTCACACCATTGGCGATATCCTCGAAGAGCCGCTGCAGATCCATCGTATCAACGACCGCGACCGGCGGATCAATGTGCTGCTGGATAAAGTTGGCCTCAACCGCGCCTTTCGCGATCGCTATCCGCATCAGCTCTCCGGCGGGCAGCGGCAGCGCGTGGCGATAGCCAGAGCGCTGATTCTTGAGCCTCAGGTGCTGCTGCTGGATGAGCCGACGTCGGCGCTGGATGTGTCGGTGCAGGCTGAGATCCTTAACCTGCTGGCAGAATTGCAGCGGGAGGCTAAGCTCACTTACCTGATGGTCACCCACGATCTTGGGGTTATTGCCCACTTATGCCAGAAAGTGGCGGTGATGCAGTACGGGAAGATCCTCGAAACCCTGACGGTCGATGCGCTGGTCAGCGGCCAGGCGCAGACGGCCTATACGCAGATGCTGGTTAACGCCAGTCGGCAATATACGCGCGAAATGGCGCGCGAGGTGGCGGAATACTAAGCTTTATGGCGCATTGTCTCACGGCAGATGCGCCCATTTTAATAGGTAACCATTTGATGTTATAGGTTTTTTGTGGTTGTGGGTATCGTCTGTTAATGTAAGCGCGGTGCTTGGATCGATATCCTGCCCAGGTTTTTCCTGCAGCCGCAGGTGCCGTTTTACCTTAACATTCAGAGTCTCAATCAACGAACCCGGCTGCTGTATACCAGCCGGGCGCCGTCTTTGCTTTGCGAAGCCTGATCAGCGTAGCAGCGGGCAGTCGGGCGGCAGCCGGCAGCGCAGGGCCGCCGGTAACAGCTCCATGCGGAAGGTTTTGCCGCTGAGCGGCTCGCCGTCGAGGTTAAACGTCATCTCATGGGGCGCGGTTATTTCAAACCACGAGGAGACACCGTCAACAATATTCGGCGAGTTTTCCGGATTCGCCAGGGTGCTGAACAGGGCAGGGATCAGCTCTTCGCCGGTGAAAATGCGCAGATGCAGCAGACCATCGTTAATCAACGCCTCCGGACACAGCTGCTGGCCGCCGCCGGCCTGGCGGCCATTGCCGATGCCGATGACCAGCGCATCGCCCTGCCAGTGAAAGTTCTCCCCGCGGATCTCGCAGCGGTCCGGCTTCAGGGTATCCATGCGCATCAGACCGTGGATTAAATAAGAGACACCTCCGAGCGCGGCCTTGAGTTTTTCCGGCGTTTCGGTGGTGATCCGGGTGCCGAACCCGCCGGTCGCCATATTAATAAAGCCGGTTTTATCATTCACCCGGGCGATATCTATCGGCACATCGCGGCCGACGATCGCCAGCTTGAGGGCGCTGGCCAGGTCCTGCGGAATACCGACGCTGGTGGCAAAGTCGTTAGCCGTGCCCAGCGGTAAAATCCCCAGCGCCATCTTGCCGCCGCGCTCCACCAGCGCGGTCGCCACTTCATTAATGGTGCCATCGCCACCGCCGGCGATCACCGTTTCAACGTTCAGCTGCAGGGCTTCATCGATAAACCGAGCGGCATCGCCCTTTTCCCAGGTCACGCGGACGTGAATATCAATCCCTTCATCGCGTAACAGGTTAACCGCTTCGCGTAGCTGCGGTTCATTGGCACCTTTGCCGTTAAGGATCAGTAAGCTGGCAGGAAACGTAGACATCATCAAGGGAACCTTTATTGCTCAATGTTGCATAAAGTGTACCGCATGACGAAAGCGAAGACCGGAGGAGAGGCAAAAAAATAAGCCCGCGTAAGGGAGATTACGCAGGCTAAGGAGGTGGTTCCTGGTACAACTAGCATTTATGGGTTATGTTTTTCAGCGAAATGGATGATACCTGTTTCGATCGAAACGGTATGTGATCCGATTCTAATATCCTTACATCGTGAAAAAATAACGCAAATTAACTAGTTACCGTGCGGATTTCTGGTTGTTTGACCAGTAAAGTTGCGCATCAGCAGGGCATACTGCAGGGGCAGGTCATCCGGCACAGGCAACCAGACGGTATAGCCATCGCCCGGGGCCACCGGCATGGTTTCGCCTTTGCCATTCTCCATATGCTCAAGGGTGAAGTTGACGTTACCTTGCGGGGTCATCAGCTCAAGGCTGTCGCCGACGCTGAACTTATTTTTGACCGCCACGGCCGCCAGCGGGCCGTTACGCTCTCCGGTGAATTCGCCGACAAACTGCTGGCGCTCGGAGACGGAGTAGCCGTACTCGTAGTTCTGATAATCGTCATGGGTATGGCGGCGCAGGAAGCCTTCGGTGTACCCGCGGTGGGCCAGCCCCTCCAGGGTCTCCAGCAGGCTGGTATCGAACGGTTTGCCGGCGGCGGCATCGTCGATCGCCTTGCGGTAAACCTGAGCGGTGCGCGCGCAGTAGTAGAAGGATTTGGTGCGGCCTTCGATCTTCAGCGAGTGAACGCCCATCTGGGTCAGACGCTCAACGTGGGCGATAGCCCGCAGATCTTTCGAGTTCATGATGTAGGTGCCGTGTTCGTCCTCAAAGGCGGTCATGTATTCGCCCGGACGTTTGGCTTCCTCAATCATAAAGACTTTATCGGTCGGCGCGCCGATGCCGAGGGTAGGCTCCACGGTCTGCACCGGGATCGGCTCATATTTGTGGACGATGTTGCCGACGTCGTCCTCTTTGCCTTCCGCCACGTTGTATTCCCAGCGGCAGGCGTTGGTGCAGGTCCCCTGGTTCGGGTCGCGCTTGTTGATATACCCGGAGAGCAGGCAGCGGCCGGAGTAGGCCATGCACAGCGCGCCGTGGACGAAAATCTCAATCTCCATCTCCGGCACCTGGCGGCGAATCTCTTCGATTTCATCCAGCGACAGTTCACGCGACAGGATCACGCGGGTGAGCCCCATCTGCTGCCAGAATTTGACCGTCGCCCAGTTGACGGCGTTGGCCTGCACTGAAAGATGGATCGGCATCTCCGGGAAGTGTTCCCGCACCAGCATGATTAAACCCGGATCGGACATGATCAGCGCGTCCGGACCCATCTCCACCACCGGCTTCAGATCGCGAATAAACGTTTTCAGCTTGGCGTTATGCGGGGCGATGTTGACCACCACATAGAATTTTTTACCCAGCGCGTGGGCTTCATTGATGCCGAGCTGCAGATTTTCGTGATTGAATTCGTTGTTACGCACGCGCAGCGAGTAGCGCGGCTGGCCGGCATAAACGGCATCGGCGCCATAGGCGAAAGCGTAACGCATATTCTTCAGCGTTCCCGCCGGGGAGAGGAGTTCTGGTTTAAACATGGTGTTCTCGTTCTGATGACAGGTCAGATTCGCCGCACCTGGTGCGGCGAGTTGGGGAGTGTCCCCACATTAAGGGCGGGCATTGTAGCGCCGCGGCGAAGGGGAGTAAATATCAGATAGGTATCTTTCAGTATGGCTAATTTGTTTGCCATTTTGAACCTGGGCAGTACTCACACTGACAGCGTTAATAACGCCTGGTGGGATAGGCTCTAGGGAGTTCCCCGGGTTGCGGCGTAAACGCCTTACCCGGGCTACGGATTGCAGATGGCGGTGAACGTGTAGCCCGGCGAAGCGCAGCGCGAGCCGGGAAAAGGCTAACTACGCCAGCCGGCAGGCGTCCGCTTCCCAGCGGTAGCCAACGCCATACACCGCGCGGATAAACGACTGCTCGGCGTCCAGCGCCTCCAGCTTGCGCCGCAGGTTCTTGATGTGGCTGTCGATGGTGCGATCGGTGACTACCCGATAATCGTCATACAGATGGTTGAGTAGCTGTTCGCGGGAGAACACTTTGCCCGGCTCCTGGGAGAGGGTTTTCAGCAGCCGGAACTCGGCGGGAGTGAGGTCCAGCAGCTTGTCGCGCCAGGAGGCCTGAAAACGACCTTCGTCGACAATCAGCGGGCTCTGGGCGTCCAGCGCCTGCAGATCGCGCTGCGGCTTGCAGCGGCGGAGGATGGTTTTTACCCGGGCGACCACTTCCCGCGGGCTGTAGGGCTTGCAGATATAGTCGTCGGCGCCTATTTCCAGCCCCAGCAAACGGTCGATCTCTTCGATTTTCGCCGTCACCATCACCACCGGAACATCGGAGAAGCGGCGAATTTCCCGGCACAGCGTCAGACCGTCGGTACCGGGCAGCATCAGATCCAGCAGGATCAGGTGCGGCGGCGTCTGGCGGACGTAGGGCAGGACCTTGTCGCCGTGGTTAATCAAGGCAGGCGCATAGCCGGCGGCCTGCAGATAATCAATCAGCAGCTGGCCGAGCTTCGGTTCATCTTCCACGATAAGGATGCGTGGGGTGTTTTCATCTACAGGCAATTCAGTCATACGTCTCTCGGTAAATCATGTTCCAGAGGTAACTCAACTTTAATGCTAACCCCGCCAAAAGGCGAATGGTCGGCACGCAGGGTGCCGCCGTGGGCCGCTACGATATTGAGGCAAATCGCCAGCCCGAGGCCGGAGCCGCCGCTGGCGCGGTTGCGCGATCCTTCCGCCCGGTAAAAGCGTTCGCACAGGCGGGCCAGCTGCTCATCGCTGACGCCGGGGGCGCTGTCGGCGAAGTCAATCACCAGCATCCGGCCGCTGCGGCGGGCGGTAATGTGCAGGCTGCCGCGGCTGTCGGTATAGCGCAGGCTGTTCTCCAGCAGATTGTTGAAGAGCTGCATCAGGCGGTCGCGGTCGCCGAAGATCATCGCCTGCTCCGGGAGCGACACCTGAATGCTTAGCTGCCGGCTGGCGAAGCGCTCGCGAAAAGCGCCCGCCGCCACTTCCAGCAGGGTGATGATATCCAGTGAGGTTTTCTGGTAGGCCAGCGCCCCCTCGTCGGACATCGACAGCTGATGGAGGTCGTCCACCAGCTTGGTGAGGGTAGCCACCTCAGCCTGCAGAGAAGGAATCGACTCGGGGGTAAAGCGCCGGACCCCGTCCTGAATCGCCTCCAGCTCGCCGCGCAGCACCGCCAGCGGGGTACGCAGCTCATGGGAGATATCGGCCATCAGATCGCGACGCATCTGCTGGTTGCGCTCCAGGGTGCTGGCGAGCTGGTTAAAATCCTGCGCCAGGCGGCCGAGCTCATCGCCGCCGGTCACCGTCACGCGGGTGGAAAAGTCGCCTGCCGCCAGCTTATGGGTGCCTTCCACCAGCCGCTTCACCGGCGCCAGCAGACCGCGGGCCAACGGGAAAGTGGCCAGCGCGGCAAGCAGGGTGGCAAGGGCGACAATCAGCCAGCTGGTGCGCTTTTGCTGCCGGTCGAAGTTAATGTCGGTATTGCGCGTCAGCCGCTCTACCGGCGAGGCGATCACTTTGCCCACCTCCGCGCCGTTGACCACGATGCTGCGCTGGGTGCCGTCCTCCGGGACGCGCTCACGCGGCCCCACCAGCACGCGCCCCGACTGATCCACCACCCAGAACATGGTGCGCCAGCCGTGGGGCGGCATTTCCGGCCTCGGGCGCGGGCCGTCAGGGGGGCCATCCGGCGGCGGCCCGTCCGGGGCGGCGTCCGGCTTCATGGCGTGCCCCGGCGGCGAGCGGTCGTCGTTATCCCGTTCGAAGGTGCGCAGCAGCTGGAAGATAAAGCGGTCATTATTGCGCAGAAACGCCCAGCTGCCATGCTGGGCATACTGCTCGCTCAGCGCGTCGCTGAGCATGGTTAAGCGTTGCTCATTGCCGCGCTTGATATAGTCGATAAACCCGCGTTCAAAACTAATGCGCACCGCCCAGTGCATGCTGATCAGCAGCACGATGCAGGTGGCGAATATAGCGAGGAACAGCTTGCCGGTAATACCCGGGCGCCAAAATTTCAC
Protein sequences of DBSCAN-SWA_1 >LR133964|1633209:1640114|1634083_1634857_+|VDY59422.1|DBSCAN-SWA MIEVQHLNLAFGEGEKRNQVLYDVSFHVKPGEIYGLVGESGSGKTTVLKCLAGLFTHWQGELTIDAQPLGHEISRERCRQVQMVFQDPYGSLHPRHTIGDILEEPLQIHRINDRDRRINVLLDKVGLNRAFRDRYPHQLSGGQRQRVAIARALILEPQVLLLDEPTSALDVSVQAEILNLLAELQREAKLTYLMVTHDLGVIAHLCQKVAVMQYGKILETLTVDALVSGQAQTAYTQMLVNASRQYTREMAREVAEY >LR133964|1633209:1640114|1637916_1638639_-|VDY59425.1|DBSCAN-SWA MTELPVDENTPRILIVEDEPKLGQLLIDYLQAAGYAPALINHGDKVLPYVRQTPPHLILLDLMLPGTDGLTLCREIRRFSDVPVVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCKPQRDLQALDAQSPLIVDEGRFQASWRDKLLDLTPAEFRLLKTLSQEPGKVFSREQLLNHLYDDYRVVTDRTIDSHIKNLRRKLEALDAEQSFIRAVYGVGYRWEADACRLA >LR133964|1633209:1640114|1636236_1637598_-|VDY59424.1|protease|DBSCAN-SWA MFKPELLSPAGTLKNMRYAFAYGADAVYAGQPRYSLRVRNNEFNHENLQLGINEAHALGKKFYVVVNIAPHNAKLKTFIRDLKPVVEMGPDALIMSDPGLIMLVREHFPEMPIHLSVQANAVNWATVKFWQQMGLTRVILSRELSLDEIEEIRRQVPEMEIEIFVHGALCMAYSGRCLLSGYINKRDPNQGTCTNACRWEYNVAEGKEDDVGNIVHKYEPIPVQTVEPTLGIGAPTDKVFMIEEAKRPGEYMTAFEDEHGTYIMNSKDLRAIAHVERLTQMGVHSLKIEGRTKSFYYCARTAQVYRKAIDDAAAGKPFDTSLLETLEGLAHRGYTEGFLRRHTHDDYQNYEYGYSVSERQQFVGEFTGERNGPLAAVAVKNKFSVGDSLELMTPQGNVNFTLEHMENGKGETMPVAPGDGYTVWLPVPDDLPLQYALLMRNFTGQTTRNPHGN >LR133964|1633209:1640114|1635097_1635994_-|VDY59423.1|DBSCAN-SWA MMSTFPASLLILNGKGANEPQLREAVNLLRDEGIDIHVRVTWEKGDAARFIDEALQLNVETVIAGGGDGTINEVATALVERGGKMALGILPLGTANDFATSVGIPQDLASALKLAIVGRDVPIDIARVNDKTGFINMATGGFGTRITTETPEKLKAALGGVSYLIHGLMRMDTLKPDRCEIRGENFHWQGDALVIGIGNGRQAGGGQQLCPEALINDGLLHLRIFTGEELIPALFSTLANPENSPNIVDGVSSWFEITAPHEMTFNLDGEPLSGKTFRMELLPAALRCRLPPDCPLLR >LR133964|1633209:1640114|1638635_1640114_-|VDY59426.1|DBSCAN-SWA MKFWRPGITGKLFLAIFATCIVLLISMHWAVRISFERGFIDYIKRGNEQRLTMLSDALSEQYAQHGSWAFLRNNDRFIFQLLRTFERDNDDRSPPGHAMKPDAAPDGPPPDGPPDGPRPRPEMPPHGWRTMFWVVDQSGRVLVGPRERVPEDGTQRSIVVNGAEVGKVIASPVERLTRNTDINFDRQQKRTSWLIVALATLLAALATFPLARGLLAPVKRLVEGTHKLAAGDFSTRVTVTGGDELGRLAQDFNQLASTLERNQQMRRDLMADISHELRTPLAVLRGELEAIQDGVRRFTPESIPSLQAEVATLTKLVDDLHQLSMSDEGALAYQKTSLDIITLLEVAAGAFRERFASRQLSIQVSLPEQAMIFGDRDRLMQLFNNLLENSLRYTDSRGSLHITARRSGRMLVIDFADSAPGVSDEQLARLCERFYRAEGSRNRASGGSGLGLAICLNIVAAHGGTLRADHSPFGGVSIKVELPLEHDLPRDV >LR133964|1633209:1640114|1633209_1634073_+|VDY59421.1|DBSCAN-SWA MTSSIPVVNNAPLLDVRDLCVDFVNGSAVTHAVRGVSFQLGHEKLAIVGESGSGKSTVGRALLQLHPKKARVSASRMQFADLDLQHLSEAQMRGVRGKRISMIMQDPKYSLNPVVCVGKQIAEAWLTHHPGRKDEAKARALEMLEVVRIRQPERVYQLYPHEISGGQGQRIMIAMMLITDPELIIADEPTSALDVSVRLQVLGLLDDLVQSRGLGLIFISHDINLVRSFCDRVLVMYAGRVVESIAAKDLDNARHPYTQGLINSLPDMQHRRPILPVLQRQASWLTD |
6 | Planktothrix_phage(33.33%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1682603 : 1703207
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LR133964|1682603:1703207|DBSCAN-SWA AATGTCCAAGCAACAGATCGGGGTTGTCGGTATGGCAGTGATGGGGCGCAACCTTGCGCTCAACATCGAAAGTCGTGGTTATACCGTCTCTGTTTTCAACCGTTCCCGTGAAAAGACTGAGGAAGTGATTGCCGAGAACCCAGGCAAGAAACTGGTCCCATATTATACCGTCAAAGAGTTCATTGAATCCCTCGAAACTCCACGCCGTATCCTGATAATGGTGAAAGCGGGTGCTGGTACGGATAGTGCGATCGACTCTCTCAAGCCGTATTTGGATAAAGGCGACATCATTATTGATGGTGGCAATACCTTCTTCCAGGACACCATCCGCCGCAATCGTGATCTATCTGCTGATGGTTTTAACTTCATCGGTACCGGCGTTTCTGGTGGAGAAGAGGGCGCACTTAAGGGCCCATCTATTATGCCAGGCGGGCAGAAAGAGGCCTATGAGCTGGTGGCGCCAATCCTTGAGCAGATTGCCGCACGAGCAGAAGACGGGGAACCTTGTGTTACCTATATTGGTGCTGATGGTGCGGGCCACTATGTAAAAATGGTTCATAACGGCATCGAATATGGTGACATGCAACTCATTGCAGAAGCGTACGCTTTGCTGAAAGGCGGCCTTGCACTGTCTAATGAGGAACTGGCTACTACCTTCACTGATTGGAATCATGGTGAGTTAAGCAGCTATCTGATCGATATCACTAAAGACATCTTCACCAAAAAAGATGACGAAGGTAAATTCCTGGTTGACATGGTTCTTGATGAAGCGGCCAACAAAGGTACAGGTAAATGGACTAGCCAGAGTTCTCTGGATCTGGGTGAACCTCTGTCCCTGATCACCGAGTCTGTATTCGCTCGCTACATCTCTTCTCTGAAAGATCAGCGTGTTGCCGCTTCTAAAGTGTTGACTGGTCCGCAGGCGCAACCGGCTAGTGATAAAACAGAATTTATCGAGAAAGTGCGTCGTGCTCTTTATTTGGGTAAAATTGTTTCCTATGCGCAGGGCTTCTCTCAACTGCGTGCTGCATCTAATGAATACAGCTGGGACCTGAACTACGGCGAGATTGCCAAAATCTTCCGTGCGGGTTGCATTATTCGTGCCCAGTTCCTGCAGAAAATTACTGATGCTTACGAAGAGAATGCCGGGATTGCTAACCTGTTACTGGCACCGTACTTCAAACAGATTGCGGATGAATATCAGCAGGCGTTGCGTGATGTCGTTGCTTATGCTGTGCAGAATGGTATTCCGGTACCAACTTTCTCTGCTGCTATCGCATACTACGACAGTTATCGTTCTGCAGTTCTGCCGGCTAACCTGATTCAGGCTCAACGTGATTATTTTGGTGCGCATACTTATAAGCGTACTGATAAAGAAGGTGTATTCCATACTGAGTGGATGGAATAATCATAGTTTGTTAAACTAATTAAAACCCGGTTAATCCCTAACCGGGTTTTTTCTATAAAGTTCTTTCAATTCAAACATTGTTGTTGTGATTGACCGTCCCCCTTTAAGCTATTCAGTTTGCTCTCATACAAAAAGGTGTGTCTTGACATACAGTCAGTCGGAGAGTAAAACCGCCACAAGTGTTGATTACCAGGGTGGCAAATTGCCATTGTCCAGTTAACTGATTAAAGTGTGATCGAATGAAAATTACTATTTCCGGTACCGGTTATGTTGGTTTATCGAACGGCATCCTAATTGCGCAAAACCACGAAGTGGTCGCGCTTGATATCGTGCAGTCAAAAGTAGATATGCTTAACCAGAAGATTTCTCCGATTGTCGACAAAGAGATCCAGGAATATCTGGCCGAAAAGCCTTTGAATTTTCGTGCGACAACTGACAAGCAGGACGCTTATCGTAATGCTGACTATGTCATTATCGCTACCCCGACTGACTATGATCCAAAAACGAACTATTTCAATACATCTACCGTTGAAGCCGTCATTCGTGATGTCGCAGAGATTAACCCTGCGGCAGTAATGATTGTTAAATCAACTGTTCCCGTCGGTTTTATCCGTGATATTAAAGAGCGTCTAGGGATTGATAACGTTATTTTCTCCCCAGAGTTCTTGCGCGAAGGCCGTGCGTTGTATGACAACCTGCATCCGTCTCGTATCGTTATTGGTGAACGTTCCGAACGTGCTGAACGATTTGCCAATCTCCTAAAAGAAGGGGCGATAAAGCAGGATATCCCAACGCTGTTTACTGATTCAACTGAAGCCGAAGCGATTAAACTGTTTGCTAACACCTATCTTGCGCTTCGCGTTGCCTATTTTAATGAGCTGGATAGCTACGCTGAGAGTCAGGGCCTTAACAGCAAGCAGATCATTGAAGGTGTGTGCCTGGATCCGCGCATTGGGAACCACTATAACAATCCGTCGTTTGGCTATGGTGGCTACTGCTTACCAAAAGATACCAAGCAACTGCTGGCAAACTATAAATCGGTACCGAACAACATCATTGGCGCTATCGTTGATGCTAACCGTACCCGTAAAGACTTTATTGCCGATTCTATTCTTGCGCGTAAGCCGAAAGTGGTTGGCGTTTATCGTCTGATAATGAAGAGTGGTTCAGACAACTTCCGTGCGTCCTCTATCCAGGGCATCATGAAGCGCATGAAAGCCAAAGGTATTCCGGTTATTATTTACGAACCTGTCATGCAGGAAGATGAGTTCTTTAACTCTCGAGTAGTTCGCGATCTGGAAGCTTTCAAGCAGGAAGCTGATGTCATCGTGTCTAACCGTATGGCAGAAGAACTTGCTGATGTCGCAGATAAGGTTTATACCCGCGATTTGTTCGGCAACGACTAGCATTCCGCTGACCATGACTGGCGGTACCTGCCGCCGGTCTGTTTTAATCATCTTCTTTTATAAACCCTCCTTATCTCTTCCTCTACTGTTACTTGGCACCACTAAAATTGTGATGTATCCCGTAATCTAGCACTTCATCCATTGCACCGCTCGAACAGGTCGGATATGTTGAATCTACCTTGTGAATAGCTACCGTCACTGTGATGTAGATGCTGGAGATAAGCGCCGGAAGGTAAAACGAAGCCTGCAATACAGGCTTTTGTTATTTCTGTAGTCTGATAAACCTGGTATATTACTAACTGAATTGGGTAATGGTGCCAACTTACTGATTTAGTGTATGATGGTGTTTTTGAGGTGCTCCAGTGGCTTCTGTTTCTATTAGCTGTCCCTCCTGTTCAGCTACTGAAGGCGTGGTGCGTAACGGTAAAAGTACTGCCGGACATCAGCGCTATCTCTGCTCTCAATGCCATAAAACATGGCAACTACAGTTCACTTACACCGCCTCTCAGCCCGGTACGCACCAGAAAATCATTGATATGGCCATGAATGGCGTTGGATGCCGGGCAACCGCCCGCATTATGGGCGTTGGCCTCAACACGATTTTACGTCACTTAAAAAACTCAGACCGCAGTCGGTAACCTCACGCATACAACCGGGCAGTGACGTCATTGTTTGCGCGGAAATGGACGAACAGTGGGGTTATGTCGGCGCTAAATCACGCCAGCGCTGGTTGTTTTACGCGTATGACAGGATACGGAGGACGGTTGTGGCGCACGTATTCGGTGAACGCACTATGGCGACGCTGGGGCGTCTTATGAGCCTGCTGTCACCCTTTGACGTGGTGATATGGATGACGGATGGCTGGCCGCTTTATGAATCACGCCTGAAGGGAGAACTGCACGTTATCAGCAAGTGGATTGGCCCCTATATTTCCAGACACTTTTTATCGCTTAACCCATTACTGGTTCGCCGCCGCAGATATTCCCGTGGCGAACGATACCCCAGTGCACTATGCGGATGCCATTCGTTGTAATGTTCGAAGGCCTCTGCAAGGTTCTTTACCGCTGTTAACCCGTCGGGTTTCGGCATGATGCTGATGTAATCGCGCTTCATCGTTTTCACGAAGCTCTCTGCCATCCCGTTGCTTTCCGGGCTACGTACCGCCGTATGTTTAGGCTCCAGTCCTACCATTCTGGCGAACTGACGCGTCTGATAAGAACGGTAGGCTGAACCGTTGTCTGTCAGCCACTCAACTGGGGATGTCGGCAGGCTGTTACCGAAGCGACGCTCCACGGCACCCAGCATGACGTCCTGCACGGTTTCACTGTCATATCCACCGTTACTGGCCGCCCAGTAAAGTGCCTCGCGATCGCAACAGTCCAGAGCGAACGTGACCCGCAGTTTTTCACCGTTATCACAGCTGAACTCGAAGCCGTCAGAGCACCACCGCTGGTTACTTTCTCCAACGGCCACTTTCCCTGTATGCGCCCGCTTCGATGGCGGTATTTCCGGTTTACGCTCAAGCAGCAGCGCATTCTGACGCATGATGCGGTATACGCGTTTGGCATTGATCACCGCCATGTCGTCAGTTTCTGATTGTCTGCGCAGCAGTGCCCATACCCGACGATAACCATAGGTGGGCAGATCGCCGATAACGGTATGGATACGGGCCAGCGCGTCAGTATCATCAGGCTTGCGCTTGCACCGACGATCCTGCCAGTCCTTCGACCGACGGGCCATGGCATGCAGTTGCGCACGTGAGACCCGGAGGCAACGACTGACAAGGCTTATTCGCCATCCTCCGGCAACAAGGGCACGTGCGCTATCCACTTTTTTGTCGGCCATATTCAACGGCTTCTTTTAGCAGCTCGTTTTCCATGGTTTTCTTGCCCAACAGGCGCTGCAGCTCTTTAATTTGCTTCATCGCAGATGCCAGCTCCGACGCGGGCACAACCTGTTCTCCTGCGGCAACGGCTGTAAGGCTGCCTTCCTGATACTGCTTACGCCACAGGAACAGCTGACTGGCAGCAACGCCATGCTGACGGGCGACCAGCGACACGGTCATTCCGGGCTCAAAACTCTGCTGAACAAGGGCGATTTTTTCCTGAACACTTCGCCGTCTGCGCTTCTCTGTGACCTGCCCCCAGAGTTAGATACAAGCTTCAGTTAGTAATGTCGGTTGGTTTTTCTTCATATTTCCCGTTTCGCCAGCCCGCTGCAAATTCAGCCGGCGTCAGGTAATTCAGTGATGAATGTGGTCGACACTCGTTATAATCCAGTCGCCAGTCATTAATGATCTTCCTGGCGTGAACAATATCGCTGAACCAGTGCTCATTCAGGCATTCATCACGAAAGCGTCCGTTAAAACTCTCAATAAATCCGTTCTGCGTTGGCTTGCCGGGCTGGATAAGTCGCAGCTCCACACCATGCTCAAAAGCCCACTGATCGAGTGCGCGGCAGGTAAACTCCGGGCCCTGATCGGTTCTTATCGTAGCCGGATAGCCGCGAAACAGCGCAATGCTGTCCAGAATACGCGTGACCTGCACGCCTGAAATCCCGAAGGCAACAGTGACCGTCAGGCATTCCTTTGTGAAATCATCCACGCAGGTAAGACACTTGATCCTGCGACCTGTGGCCAGTGCGTCCATGACGAAATCCATTGACCAGGTCAGATTGGGCGCCATCGGGCGGAGCAGCGGCAGACGTTCTGTTGCCAGCCCTTTACGACGTCGTCTGCGTTTTACACTCAGGCCATTAAGTTGATAGATGCGGTAAACCCGCTTGTGGTTAACGCAAAGACCTTCACGTCGCAGAAGCTGCCAGATACGCCGGTAACCAAAACGGCGGCGTTCAAGTGCCAGCTCTGTGATGCGTAGAGACAGCTGCGCGTCAGCAGCCGGACGCTGAGCCGAATATCGGCAGGTTGACAGGGACAGACCTGCCAGCCTGCAGGCACGACGTTGCGACAGACCCTTAGCCTCGCACATGACTTCCACGGCTTCCCGCTTCTGGTCTGTCGTCAGTACTTTCGCCCAAGAGCCACCTGAAGCGCCTCCTTATCCAGCATGGCTTCAGCAAGCAGCTTCTTGAGTCTGGCGTTCTCTTCCTCAAGCGACTTCAGGCGCTTAACCTCAGGCACCTCCATACCGCCATACTTCTTACGCCATGTGTAAAAGGTGGCGTCGGAAATGGCGTGCTTACGGCAGAGCTCACGGGCAGAAACGCCGGCTTCGGCCTCGCGGAGAATACATATGATCTGTTCGTCGGAAAAACGCTTCTTCATGGGGATGTCCTCATGTGGCTTATGAAGACATTACTAACATCACGGTGTATTAATCAACGGGGAGCAGGTCACGATATACGCAGCGCATTGAGCGGCATAACCTGAATCTGAGGTAACATCTGGCAAGGCTGGGCAGGAAGTCACTGTCGTTCTCAAAATCGGTGGATCTGCATGACAAGGTCATCGGGCATTATCTGAACATAAAACACTATCAGTAAGTTGGAGTCTTTACCCTGAATTGCATATTAATTATTAAAGGAGTCCGTACAAAAAACACGGCTGCTCTTCTTAAATTTTAATTCAGGTGCCAGTATACTGACACTGAGACTACCATTTACGTTTATTGAATGATAATTTAGTTCTTTTTCTATCCTTTAAATAAGAAATTAAAAAATAAATAGAAACAACGAGCAGAAAGGCCGCAACAACCAAGAAAAATAAGTTTTCCATATGATCTACCAGAGCGAAGTTACGGAAGTAAATGATTAGCTTGCAATTAAAAGAACGATACAAATTATCTTTATCGCACTAACATATCTTCTTTCTAAGAAGATTCCTATACGGTAATTGTGTTTTTTTTATGGAGGATGAAAAACTATACTTTCAAAAGAAAACTGGCTCGAGCTGCGGCTCCTTGGTGAAAGTTACAGTTTTAGTCTGCCACAACGAGGTAAGCGAGCTTCTGTTGAAAGAAGGGCCAGGTACTGTGAGTCGAAGATATAAAACTTTCCGCACCAGTGAACAAGGTTACGCGTAAGAGGGGAGGAGCGATGTCAGAAGAGAGAACCGCAGCTATGCCGCTGCGGTTACGAAACTGATTGCGCGATTAAGCCTTATAGTAGGCCTTATACCAGTCGACAAAATTCTGCACGCCTTCTTTGACGGTGGTCTGCGGTTTGAAGCCAACAAGATCGTACAGCGGTTTGGTATCCGCGCTGGTTTCAAGCACGTCGCCCGGCTGGATAGGCATCATGTTTTTCTCAGCTACCATTCCCAGCGCTTCTTCCAGGGCGGTGATGTAATCCATCAGTTCAACCGGAGAGCTGTTGCCGATATTGTAAACCCGATATGGAGCCGAGCTGTCTGCCGGATAGCCATTCTCGACGGTCCACTCCGCGTTCGGCTGCGGGATAACATCCTGAACGCGTACAATGGCTTCCACGATATCGTCAACGTAGGTGAAATCACGCTTCATCTTGCCGTAGTTGTACACGTCAATGCTTTTACCTTCCAGCATGGCTTTCGTGAATTTGAACAATGCCATATCCGGGCGGCCCCAAGGGCCGTACACGGTGAAGAACCGCAGGCCGGTAGTCGGTATGCCGTAAAGATGGGAGTAGGTATGCGCCATCAGCTCATTCGCTTTCTTGGTGGCGGCATAGAGTGAAACCGGATGGGCGACCGAGTCGTCGGTTGAGAACGGCATTTTACGGTTCAAACCGTAGACGGAACTGGACGATGCGTACAGCAGGTGTTCGACCTTGTGGTGTCGGCAACCTTCCAGAATATTCAAATAACCGATCAGGTTGGAGTCGGCATAGGCATGAGGATTTTCCAGCGAGTACCGAACGCCAGCCTGAGCGGCCAGATGGATCACGCGGTTAAACTTCTCTTCTGCGAACAGCTCTGCTATGCCCTGGCGATCGGCAAGATCGACTTTATGGAAACTGAAAAGGGAGGATTGCAGTAAATCCAGACGCGCCTGCTTAAGATTGACGTCGTAGTAATCATTCATGTTGTCGAGACCAACAACCTGATGACCCGCCTCAAGAAGGCGCTTGCAGGTATGGAAACCGATAAATCCTGCGGCTCCAGTGACTAAAAACTTCATGTTCATCCTCCGGGTTCGTTGTTTCCCGCTATATATAATTGTCGATTTTACACCGGGTTGCCTTAGGGGTCATCTGCTTTCCCGTTACAGGATCTTCATAAAGCGGTCAGAACGGGGCAGCGCTTAAGGAGCAGCGAGCGTTTTTGCGTCATATTTCCGGTATTATTCTCACAAATAGAAATAATTATCATTTGAATGTAGCTTAGCACCATAATAATACCTATAGTTTCTTATACCAATAATTGATGTTCCGCCGATAAAGTACTCAATATGAGACAGAGCGCGCCCTGCTGACTATATTGTTACTGATTATATGATTTCCGAGTATTCCGCCCTTGAATGTTCGTGATTCCTATGCGGATTGAGCCTTTTTTGATTTTTTTTAATTACATATCAATTTAGACTGGCTGACCTGCTCCCCGTTGATTAGTACACCCCGATGTTAGTAATGTCTTCATAAGCCACATGAGGACATCCCCATGAAGAAGCGTTTTTCCGACGAACAGATCATCTGTATTCTCCGCGAGGCCGAAGCCGGCGTTTCTGCCCGTGAGCTCTGCCGTAAGCACGCCATTTCAGATGCCACGTTTTACACATGGCGTAAGAAGTATGGCGGTATGGAGGTGCCTGAGGTTAAGCGCCTGAAGTCGCTTGAGGAAGAGAACGCCAGACTCAAGAAGCTGCTTGCTGAAGCCATGCTGGATAAGGAGGCGCTTCAGGTGGCTCTTGGGCGAAAGTACTGACGACAGACCAGAAGCGGGAAGCCGTGGAAGTCATGTGCGAGGCTAAGGGTCTGTCGCAACGTCGTGCCTGCAGGCTTACAGGTTTATCCCTGTCGACCTGCCGCTATGAGGCTCACCGTCCGGCTGCTGATGCGCATTTATCAGGGCGCATCACTGAGCTGGCACTGGAGCGCAGGCGTTTTGGCTACCGTCGTATTTGGCAATTGCTGCGCCGTGAAGGGCTTCATGTTAATCATAAGCGCGTGTACCGGCTTTATCACCTCAGTGGCCTGGGCGTAAAACGCAGACGACGTCGTAAAGGGCTGGCAACAGAACGTCTGCCGCTGCTCCGTCCGGCGGCGCCCAATCTGACCTGGTCGATGGATTTCGTCATGGACGCACTTTCCACCGGTCGCAGGATCAAGTGTCTTACCTGCGTCGATGATTTCACAAAGGAATGCCTGACGGTCACTGTTGCCTTTGGGATTTCAGGCGTTCAGGTCTCGCGTATTCTGGACAGCATTGCACTGTTTCGAGGCTATCCGGCGACGATAAGAACTGACCAGGGGCCGGAGTTCACTTGCCGTGCACTGGATCAATGGGCCTTTGAGCATGGTGTTGAGTTGCGCTTAATCCAGCCGGGCAAGCCAACGCAGAACGGATTTATTGAGAGCTTTAACGGACGATTTCGCGATGAATGCCTGAATGAGCACTGGTTCAGCGATATCGTTCACGCAAGGAAAATGATTAATGACTGGCGGCAGGATTATAACGAGTGTCGTCCACATTCAGTAGACTGGCCCCCTGAATCTCCAGACAGTTGGTATCACTTAAGTTATTGATAGTCTTAATACTAGTTTTTAGTAAGCGTCAAGTCACTGCCGCCTGTGATTCCAGTCCCGGGATCGCTAGCTTAGAGCTCCGTCTAATTTAGAAGGCGTTCTGTTCATGGAAAATGCTGCCAACTGGCGAACTGAATCGCGTACCGTCTATTCCAGTGACTTCAAACTTCGGATGGTCGAACTGGCTTCACGACCAGATGCCAACGTCGCACAACTGGCGCGGGAACATGGCGTTGATAATAATCTCATTTTTAAGTGGCTACGCCTCTGGCAGAGAGAGGGGCGAATCTCTCGTCGAATGCCTGCAACTATCGTGGGGCCGGTGGTACCTCAATAGACTCAATCTCTTCCGGCTTCTCCGACTCTGTTGTCCGTCGACGTTATCAACGACCCGTTGCCCGCAGCAGAGAATGACACTCTGTGTACGTTCTCCTCCGCTCACGCCAGCGCCACTTCCTGTCATGTTGAGTTCCGCCACGGCAAAATGACGCTGGAGAACCCGTCGTCAGAGTTGCTGACTGTGCTGATCCGCGAACTGACCGGGAGGACACAATGATATCCCTCCCGTCAGGCACTCGCATCTGGCTGGTCGCCGGGGCCACGGATATGCGTAAATCCTTCAACGGGCTGGGCGAACAGATACACCACGTTCTGGATGAGGATCCCTTCTCCGGCCATCTGTTTATCTTCCGGGGACGCCGTGGCGATACCGTGAAAATACTTTGGGCTGATGCTGACGGTCTGTGCCTGTTTATCAAACGCCTGGAAGAGGGACAGTTCGTCTGGCCTGCTGTACGCGACGGCAAAATCGCGATCACCCGCTCACAACTTGCCATGCTCCTTGATAAGCTGGACTGGCGCCAGCCAAAAATATCCCGCCTTAACTCACTGACAATGTTGTAAAAAAATCATAACCATATTATAAAAGCGGTTATGAATCACGACTATCTCGCCCGTATCGCTGCGCTGGAAGACACGCTTCGCCAGAAAGACAGTCAGCTCAGTCTCGTTGCAGAGACTGAGTCGTTCCTGCGTTCTGCACTGGCCCGCGCAGAAGAGAAAATAGAGAACGAAGAGCGCGAGATAGAGCATCTGCGGGCACAGATAGAAAAACTGCGTCGAATGTTGTTCGGTACCCGCTCCGAAAAACTTCGCCGACAGGTTGAAGAAGCCGAAGCCCTGCTGAAACAGCAGGAGCAGCAAAGCGATCGCTACAACGGACGGGAAGACGATCCGCAGGTACCTCGCCAGTTGCGTCAGTCCCGCCATCGTCGCCCGTTACCGGCACACCTTCCCCGCGAGATACATCGGCTGGATCCTGCGGAAACCAGCTGTCCGGAATGCGGCAGCGGTATGGCGTACCTCAGCGAAGTCAGCGTGGAGCAGCTGGAACTGGTCTCCAGCGCCCTGAAAGTGATCCGCACGGTCAGGGTGAAAAAGGCCTGCACCCGATGTGACTGTATCGTTGAAGCACCTGCGCCATCACGTCCCATCGATCGGGGTATCGCCGGGCCGGGTCTGCTGGCCCGCGTGTTAACGGCCAAATACTGCGAACACCTGCCGCTGTATCGCCAGTGCGAAATCTTTGCCCGTCAGGGCGTGGATCTGAGCCGGGCTCTGCTCTCCAACTGGGTGGATGCATGTTGCCGGTTAATGGCCCCGCTGGATGAGGCCCTTTACCACTACGTGATGGACTGCCACAAACTGCATACGGATGACACTCCGGTTCCGGTGCTTGCACCGGGCAGAAAGAAGACGAAAACCGGGCGTATCTGGACGTATGTACGTGACGACAGAAGCGCAGGTTCATCAGATCCGCCAGCGGCATGGTTCGCCTTCTCGCCAGACCGGCAGGGAAAACATCCTCAGCAACACCTTCGCCACTATCATGGTGTGCTGCAGGCGGATGCCTTCGCAGGCTACGATCGGTTGTTCAGTCCGGAACGTGAAGGAGGGCCGCTGACAGAAGCCGCATGTTGGGCCCATGCCCGCCGAAAAATCCATGATGTCTATATAAGCACTCACACAGCGACAGCAGAAGAAGCCCTGAAACGTATCGGTGAGCTGTACGCGATAGAAGAAGAAATACGCGGCCTCACGACAGAAGAGCGTCTGGCAGCCAGACAATCGCAAAGCAAACCACTGCTGGCATCGCTGCATGAATGGCTGGTAGAAAAAAATGAGACGCTGTCGAAAAAGTCCCGTCTGGGCGAAGCGTTCGCTTATGTCCTGAACCAGTGGGATGCGCTGTGCTACTACTGTGAAGATGGCCTGGCAGAGCCTGATAACAACGCTGCCGAACGAGCCCTTCGTGCCGTCTGTCTTGGGAAAAAAAATTTTATCTTCTTCGGCAGCGACCACGGTGGTGAGCGCGGAGCCCTGCTGTACGGACTGATCGGGACGTGCAGGCTGAATGGTATCGATCCGGAAGCCTACCTTCGCCATATCCTGAGCGTACTGCCGGAGTGGCCCAGCAACAAAGTGGCCGAACTGCTGCCATGGAACGTGGTTCTTACCGATAAATAACCATCAATACGGCTCTCACTTAACGCTTACAGTTTTTAGACTAGTCATTGGAGAACAGATGATTGATGTCTTAGGGCCGGAGAAACGCAGACGGCGTACTACACAGGAAAAGATCGCTATCGTTCAGCAGAGTTTTGAACCGGGAATGACGGTCTCCCTTGTTGCCCGGCAACATGGTGTGGCAGCCAGCTATTTTTCTGGCGTAAGCAATACCAGGAGGGAAGTCTTACTGCTGTGGCTGCCGGAGAACAGGTCGTTCCTGCCTCTGAACTTGCTGCCGCCATGAAGCAGATTAAAGAACTCCAGCGCCTGCTCGGCAAAAAAACGATGGAAAATGAACTCCTTAAAGAAGCCGTTGAATATGGGCGAGCAAAAAAGTGGATAGCGCACGCGCCCTTATTGCCCGGGGATGGGGAGTAAGCTTCGTCAGCCGTTGTCTCCGGGTGTCGCGTGCGCAGTTGCACGTCATTCTCAGACGAACCGATGACTGGAAGGACGGCCGCCGCAGCCGTCACACGGATGATACGGATGTGCTTCGCCGTATACACCATGTTATCGGAGAGCTGCCCACATATGGTTATCGTCGGGTATGGGCGCAGCTTCGCAGACAAACAGAATCTGATGGTATGCCTGCGATCAATGCCAAACGTGTTTACCGGATCATGCGCCAGAATGCGCTGTTGCTTGAGCGAAAACCCGCTGTACCGCCATCGAAACGGGCACATACCGGCAGAGTGGCTGTGAAAGAAAGTAATCAGCGATGGTGCTCTGACGGGTTTGAGTTCCGCTGTGATAACGGAGAAAAACTGCGGGTCACGTTCGCGCTGGACTGCTGTGACCGTGAGGCACTGCACTGGGCGGTCACAACGGGTGGCTTCGACAGTGAAACAGTACAGGACGTCATGCCGGGAGCAGTGGAACGCCGCTTTGGCAGCGAGCTTCCGGCGTCTCCAGTGGAGTGGCTGACGGATAATGGTTCATGCTACCGGGCGAATGAAACACGTCAGTTCGCCAGGATGTTGGGACTTGAACCGAAGAACACGGCAGTGCGGAGTCCGGAGAGTAACGGAATAGCAGAGAGCTTCGTGAAAACGATAAAGCGTGACTACATAAGTATCATGCCCAAACCAGACGGGTTATCGGCAGCAAAGAACCTTGCAGAGGCGTTCGAGCATTATAACGAATGGCATCCGCATAGTGCGCTGGGTTATCGCTCGCCACGGGAATATCTGCGGCAGCGGGCCAGTAATGGGTTAAGTGATAACAGGTGTCTGGAAATATAGGGGCAAATCCACAATTGGTTATGATAGCGCTTTTGAATGGTATAAATAATGTATCCTTATTGTTTACTTTTCACATGAATTGATTTGCCATGCTTTTTGATTTTTATTCTTAGTTACCAGCGTTATAATGCGCGGATAATCTATGCGTCGCAGGAGTGTTGTTTTTCAGGGTAACGCTTCCTCAGAACATCCGATGAGTCGTGTGACAAACCCGGAAATGGGTACTCTTTATCTGGGGGGAAGAATATGAGCAATGTGCTAGAACATTGCGTCAGTTTGGCTACCCTTGATGCTTTAGTGATGCATACAGCAGCTTTACTGGCGCGTGCAGTCGGCTTTCAACGACCGTACTGGTGATGTTGGTGAACCCTTGAGCTTGCTAAATGAGAAAGTCCTGATAGATAGCCGGCAGCAAAACGGGGAGAGTAGCGCTCGCGCATCTACTGCCCGGTTGTGCGGCATAGGGCCTGAAAACAGAGATATGGTTTCCGAGATATATCTCTGTTTCAGTTGTTTATCGCGGATCTGTCAGTCTGTTTCGTTTGTGGCAGCAGAACGCAGCGCAAAGGCCAGCCGTCAGGTAGCAGCAAGAGCATGCTGTGACCGACTGGCCAGAGCCATAATCATGAAAGTAGTACAGCAGTGTTACGCGCTTTGGTTGCGATAGCCAATGGCGGTAGCGTGCCTGAAATTTATGATATATGTTGATTAACATATAACCATTTGAAAAATGAGATAAAGTATGAAGATCCTTGTTACGGGTGGCGCGGGTTTTATCGGTTCCGCCGTTGTTCGTCATATTATTGAAAATACCCAGGATGACGTTCGCGTCGTGGACTGCCTGACGTATGCGGGTAATCTCGAGTCTCTGGCATCCGTTGCCCAAAATGAACGCTATTCGTTTTCACGCACGGATATCACCGACGCGCAGGGGATCGCCGGGCAGTTCAGTGAGTTTCAGCCTGATATCGTGATGCACCTGGCGGCTGAAAGCCACGTTGACCGCTCTATTGACGGTCCTGCAGCCTTTATCCAGACTAACCTGATCGGCACGTTTACGCTGCTTGAAGCAGCGCGTCACTACTATCAGTCGCTGAACGAAGCGCAGAAACAACGTTTACGCTTCCACCATATCTCCACCGATGAAGTCTACGGCGACCTGCATGGCACCGATGATCTGTTTACGGAAGAAACCTCCTACGCACCGAGCAGCCCGTACTCGGCCTCGAAAGCCGGTAGCGATCATTTAGTGCGCGCCTGGAATCGCACCTATGGACTGCCGGTGGTGGTGACTAACTGCTCGAACAATTACGGTCCGTATCACTTCCCGGAAAAACTGATCCCGCTGACTATTCTCAACGCGCTGGCCGGCAAACCGCTGCCGGTCTATGGCAACGGTGAACAGGTCCGCGACTGGCTGTACGTCGAAGATCACGCGAGAGCGCTGTATAAGGTCGCTACCGAAGGGCATAGCGGCGAAACCTACAATATCGGCGGCCATAACGAGCGTAAAAATATCGATGTGGTCAGAACCATCTGCGCCATCCTTGATAAAGTGGTGGAGCAGAAACCGGGCAATATCAGCCACTTTGCCGATCTGATTACCTTTGTCGAAGACCGCCCGGGCCATGACCTGCGCTACGCTATCGATGCGGCCAAAATTCAGCGCGATCTCGGCTGGGTGCCGGAAGAAACGTTCGAAAGCGGCATTGAAAAAACGGTGCACTGGTATCTGAACAATCAGACCTGGTGGCAGCGCGTGCTGGATGGCTCCTATGCCGGCGAGCGTCTTGGTCTGAATAACTAATTAAGGTGGATTGCAGAATGAAAGGTATTGTTCTGGCCGGCGGGTCCGGCACCCGCTTGTACCCGATTACTCAGGGTGTGTCGAAACAGCTGCTGCCGATTTACGACAAACCGATGATTTACTATCCGGTTTCGGTGCTGATGCTTGCCGGTATTAGGGATATCCTGATTATCACCACGCCGGAAGATATGCCGGCGTTCCAGCGACTGCTGGGCGATGGTGCGCAGTTTGGCGTCAATTTCTCCTATGCAATCCAGCCGTCCCCGGACGGTCTGGCGCAGGCCTTTATCATCGGTGAAGAGTTTATCGGAGATGATTCCTGCGCGCTGGTGCTGGGCGATAATATTTACTTCGGGCAAAGCTTCGGTAAAAAGCTGGAAGCAGCGGCCAGCAAAACGTCGGGCGCTACCGTCTTCGGCTACCAGGTTCTCGACCCTGAACGCTTTGGCGTGGTGGAGTTTGATGAAAACTTCAGGGCGCTGTCGATCGAAGAGAAACCGCTGAAGCCGAAATCAAACTGGGCAGTAACCGGCCTCTATTTTTATGACAACGACGTCGTAGAGATGGCGAAGGAAGTGAAGCCTTCCGCGCGCGGCGAGCTGGAAATTACCACCCTTAACCAGATGTATCTGGAGCGTGGCGACCTGCAGGTTGAACTGCTGGGACGCGGTTTTGCCTGGCTTGATACCGGTACCCACGACAGCCTGATGGAAGCCTCTCAGTTCATTCATACCATTGAGAAGCGCCAGGGCATGAAAGTGGCCTGTCTGGAAGAGATAGCCTTACGGAATAAATGGCTGTCTGCCGAAGGGGTTGCGGCTCAGGCCGAGCGGCTTAAGAAAACGGAATATGGCGCTTATTTGAAACGTCTGCTCAATAAGCGTCAACCGTCTGCCGACCAAATACAGGGTATGAATTAATGAAAATTTTACTTATTGGCAAAAATGGTCAGGTTGGCTGGGAGCTGCAGCGCTCGCTGTCCACGCTGGGTGAGGTGGTCGCCGTTGACTATTTCGATAAGGAACTATGCGGCGATCTGACGGATCTTGCCGGGATTGCGCAAACCGTCCGCCAGGTTAAACCGGATGTGGTTGTCAATGCGGCCGCGCATACGGCGGTGGATAAAGCCGAAAGCGAGCGCGAGCTATCCGATCTGCTGAATGAGCGGGGCGTGGCGGTTCTGGCGGAGGAGTCGGCGAAGCTGGGCGCCCTGATGGTGCACTACTCCACCGACTATGTTTTTGACGGAGAAGGCGAGCATTATCGCCCTGAAGACGAGGCAACCGGGCCGCTGAACGTTTATGGCGAAACCAAGCGCGCCGGTGAACTGACGCTGGCGCAGGCGAATCCTCGTCATCTGATTTTCCGTACAAGCTGGGTCTACGCGACGCGTGGGGCCAATTTTGCCAAAACCATGCTGCGCCTGGCGGGCGAAAAAGAGACGCTGTCGATTATTAACGACCAACACGGCGCGCCGACCGGTGCCGAGCTGCTGGCTGACTGTACCGCCATCGCCATTCGTGAAGAGTTGCGTAACAGGGCGGTTGCCGGAACCTATCATTTGGTGGCCAGCGGTGAAACCAGCTGGTATGACTATGCCCGCTTTGTCTTTGATATCGCGCGCGCCAACGGAGCTGAGCTGGCGATCAAAGAGGTCAACGGGATCCCGACGACGGACTACCCGACACCGGCTAAACGTCCGCTGAATTCCCGCCTGTTGAATGAGAAATTCCAGAGGGTATTTGGCGTAACGCTCCCGGATTGGCGCCAAGGCGTTGAGCGCGTTGTTGTTGAAGTTTTAGGTAAGTAGGTAAATACAAAAGCATGAATATTATTAAAACAGACATTCCTGAGGTCTTAATCTGCGAGCCGAGAGTCTTTGGCGACGCGCGCGGGTTTTTCTTTGAAAGCTTCAGCAGCAAGATCTTTGATGAGGCGGTAGGTCGCAAAGTTGAGTTTGTGCAGGATAACCACTCGCAGTCGCAGAAAGGCGTGCTGCGCGGGCTGCACTATCAGCTGGCCCCGCATGCGCAGGGCAAACTGGTGCGCTGCGTTGAGGGCGAGGTATTTGATGTTGCCGTCGACGTACGTCGCTCGTCGCCAACGTTTGGTCAGTGGGTTGGAGTGGAGTTGAGCGCAGAAAACAAGCGTCAGCTGTGGATCCCGGAAGGGTTTGCGCACGGATTTATGGCGCTGAGCGAGACGGTGCAGTTTGTCTATAAAGCGACCAACTACTACGCGCCGCAGTCGGAGCGCAGCATTCTCTGGAACGATCCGGAAATTGGCATTGAGTGGCCTGAGCTAAGCGGGTGCGCGCTTTCATTGTCAGAGAAAGATATGCAGGCTCATACTCTGGCGACAGCTGAAGTATATGCCTGACCGTAAGTCATTTGTAGATTTGATATGCTTAACGCATCCTCTTTTATTCATTAAGAATTATAATAATAATTAAATAGCAATGTTTACTCTAAAGGTAATAGATTCATGACTTATGAAGCAATGAAGCCGAAGATAATCGCTTCTATTGTATTATTTAATCATTCCTATGATGATATTAAAGATACGTTGATCTCGTTATGTCATGAGAGTGGTGTTGAAAAAATTGTCTTAGTTGATAATGGAGGATGTCAGTGGGTTACAGAGCTGAGCGAGCCGAAGATTAGTTATATCAAATCTCCTTATAACTGCGGGTTCGGTGCTGGTCACAATCTGGCAATCAAAGCTAATACTGATTTTGATGGTTATTTTCTGATTTGTAACCCGGATGTTAGTTTTGACAGAGGCGAGCTGGATAAATTATTGTCCTTTGCCTGGGATAACCAGTTCCCCTTTGTCTCGCCTAAAATTCTCTATAAAAATGGAGATCGGCAATACAGCTGTCGCTTACTGCCCACGCCGGTTAATCTGTTTTTAAGGCGTTTTTTACCCACCACTGCGGTGAAGTTTGACGTTGAGTATGAGCTGATGGATGCCGAGTACGATAAGGTCTTCTCTCCGCCTTCTGTCTCCGGCTGCTTCATGCTGCTAACGAATAGTTTGCTCCAGAAGCTGAATGGGTTTGATGAGCGTTACTTCATGTATTTAGAAGACGTCGATCTATGTCGCCGCGCGCTGCCGTTCACTAAGCTATATTATTTTCCGGATGCGACAATCATTCACCTTTTCAATAAAGGGTCATATAAAAGCAAGCTGTTGCTTTGGTATCATGTTCGCTCTGCGATTTTCTACTTTAATAAGTGGGGGTGGTTTTTTGACCGTAAGAGGCATGCGTACAATAAAAAAGCGCTGCTTGAAATCCCCCGGAAGTCAGGTTGATTGTTCAGATAATTTTATTACGAGTTTAAAATGAAAAACCCGCATCAAAAACTTTCATCATCACCCGTAGAGATTGCACGTAGTATTTTTGGCCATCGTGGGATCATCCTTCAGATGGCTAAACGGGATGTCATCGGAAGATACAAAGGATCGGTGATGGGCCTGCTTTGGTCCTTCCTGAATCCGCTATTCATGCTAACCGTATATACCTTCGTGTTCTCCGTCGTGTTCAAAGCTCGCTGGACCGCCGGTGGCGATGAGAGCAGAACGCAGTTTGCGATAATCTTGTTTGTCGGTATGATCGTGCACGGTTTCTTCAGTGAGGTCATCAATAAGGCGCCACTGATCATTCTTAGCAACACTAACTATGTTAAAAAGGTCATATTTCCGCTTGAAACCTTGCCGGTGATTTCCCTGCTGGCGGCTCTTTTCCATACCTGCATCAGCCTGTTCGTTCTGCTGGTGGCTTTTGTGCTTTTTAACGGCTTTTTGCACTGGACGATTGTGTTCCTGCCGTTGGTATTCTTCCCGCTAGTCATTTTCTGTCTTGGCTTATCGTGGATCCTCGCCTCATTGTGCGTTTTCCTGCGCGACGTCAGCCAGACTACGGTGATTATCACCAACGTCATGATGTTCTTATCTCCGGTATTTTTCCCGATTAGTGCTTTGCCTGCGAAATATCATATTTGGATTATGCTCAACCCGCTGACCTTTATTATTGAACAAGCCCGAAGCGTTCTGATTTGGGGCGGTATGCCTAATTTCCTGGGTCTTTTACTCTATTCCATGGGCGCCGCAGTTGTCGCCTGGCTGGGTTTTGTCTTCTTCCAGAAAACAAGGAAGGGATTTGCTGATGTCCTCTAATGAAATTGTTATCCAGGTCACTAATCTGAGCAAATGCTATCAGATCTACGCCAGTCCGACCGATCGTTTAAAACAGTTTTTTGTGCCAAAAATTCAGCAGGTTGCCAGAAGAGAGCGCAACTGCTACTTCCGCGAGTTCTGGGCATTAGACGATGTCTCCTTCAGCATCAAGAAGGGCGAAACCGTAGGTATTATCGGCCGTAACGGGGCGGGTAAATCGACGCTTCTGCAAACGATCTGCGGAACCTTAACGCCTTCTGCGGGAGAAGTCCGGGTCAATGGGCGCATTGCCGCCCTGTTGGAGCTCGGTGCTGGCTTTAACCCCGAGTTTACCGGGCGTGAAAACGTCTATATGAACGGTTCGGTGCTGGGGCTAACGAAAGAGCAAATTGCGGCGAAGTTTGCTGAGATTGAAGAGTTTGCCGACATTGGCCAGTTTATCGATCAGCCCGTGAAGACCTACTCCAGCGGGATGTACGTGCGCCTGGCTTTTGCCGTTCAGGCCTGTGTAGAACCTGAGATCTTAATTGTCGACGAGGCGTTGGCGGTTGGCGACATTGGCTTCCAGTACAAGTGCTATAAGCGGATGGAAGCGCTGCGTGCGAAGGGGGTAACGATCATTATGGTGACCCATTCCACCGGCAGCATTCTTGAATATGCAGACCGCTGTCTGGTGATGGAGCACGGTAAGCTGATTGGCGATACCACGGATGTGCTGGCCGCGGTTCTGGCCTATGAAAAAGGGATGATCCTTGCTAACGGGAGCGAGAAGCCAGCCGCAAGCCGCACTGCGGACCAGGGGTCAACTGAAGAAGTTAGCGTCGAAGAGCTGAAAGCCATCCAGCTGCGCACCACCAATGAGGCTACTGGCGAGAAGCGCTTCGGCAGTGCGCGTGCAATCATTGAAGATCTGACGATTTATAAATCGGATGGCACCACTCTGGCCGAAAAGCCGCTGATTAAATCAGGCGAAGAAGTGACATTTGATTTTACCATTCTGGCTAGCGAAGAAATTAAAGATATCGCGTTAGGGATCTCCATGTCGAAAGCGCAGGGGGGCGATATTTGGGGAGACAGTAATATCGGCGCCGGTTCAGCAATTACGCTGCGACCCGGCCGCCAGCGCATCGTGTATAAAGCGACGCTGCCTGTTAACTCGGGCGATTATCTCATTCACTGTGGCCTGGCCAAGGTTGGCAACGGCGATCGGGAAGAACTCGATCAGCGTCGTCCGATGATGAAAGTTAAATTCTGGTCTGCAAGGGAGTTGGGTGGTGTGATTCACGCTCCGTTGAAAATTGTTTCGAATGGAGAGTAA
Protein sequences of DBSCAN-SWA_2 >LR133964|1682603:1703207|1684249_1685416_+|VDY59460.1|DBSCAN-SWA MKITISGTGYVGLSNGILIAQNHEVVALDIVQSKVDMLNQKISPIVDKEIQEYLAEKPLNFRATTDKQDAYRNADYVIIATPTDYDPKTNYFNTSTVEAVIRDVAEINPAAVMIVKSTVPVGFIRDIKERLGIDNVIFSPEFLREGRALYDNLHPSRIVIGERSERAERFANLLKEGAIKQDIPTLFTDSTEAEAIKLFANTYLALRVAYFNELDSYAESQGLNSKQIIEGVCLDPRIGNHYNNPSFGYGGYCLPKDTKQLLANYKSVPNNIIGAIVDANRTRKDFIADSILARKPKVVGVYRLIMKSGSDNFRASSIQGIMKRMKAKGIPVIIYEPVMQEDEFFNSRVVRDLEAFKQEADVIVSNRMAEELADVADKVYTRDLFGND >LR133964|1682603:1703207|1682603_1684010_+|VDY59459.1|DBSCAN-SWA MSKQQIGVVGMAVMGRNLALNIESRGYTVSVFNRSREKTEEVIAENPGKKLVPYYTVKEFIESLETPRRILIMVKAGAGTDSAIDSLKPYLDKGDIIIDGGNTFFQDTIRRNRDLSADGFNFIGTGVSGGEEGALKGPSIMPGGQKEAYELVAPILEQIAARAEDGEPCVTYIGADGAGHYVKMVHNGIEYGDMQLIAEAYALLKGGLALSNEELATTFTDWNHGELSSYLIDITKDIFTKKDDEGKFLVDMVLDEAANKGTGKWTSQSSLDLGEPLSLITESVFARYISSLKDQRVAASKVLTGPQAQPASDKTEFIEKVRRALYLGKIVSYAQGFSQLRAASNEYSWDLNYGEIAKIFRAGCIIRAQFLQKITDAYEENAGIANLLLAPYFKQIADEYQQALRDVVAYAVQNGIPVPTFSAAIAYYDSYRSAVLPANLIQAQRDYFGAHTYKRTDKEGVFHTEWME >LR133964|1682603:1703207|1699534_1700089_+|VDY59473.1|DBSCAN-SWA MNIIKTDIPEVLICEPRVFGDARGFFFESFSSKIFDEAVGRKVEFVQDNHSQSQKGVLRGLHYQLAPHAQGKLVRCVEGEVFDVAVDVRRSSPTFGQWVGVELSAENKRQLWIPEGFAHGFMALSETVQFVYKATNYYAPQSERSILWNDPEIGIEWPELSGCALSLSEKDMQAHTLATAEVYA >LR133964|1682603:1703207|1694678_1694906_+|VDY59468.1|DBSCAN-SWA MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASYFSGVSNTRREVLLLWLPENRSFLPLNLLPP >LR133964|1682603:1703207|1694997_1695903_+|VDY59469.1|transposase|DBSCAN-SWA MDSARALIARGWGVSFVSRCLRVSRAQLHVILRRTDDWKDGRRSRHTDDTDVLRRIHHVIGELPTYGYRRVWAQLRRQTESDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFRCDNGEKLRVTFALDCCDREALHWAVTTGGFDSETVQDVMPGAVERRFGSELPASPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLSAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRASNGLSDNRCLEI >LR133964|1682603:1703207|1697727_1698630_+|VDY59471.1|DBSCAN-SWA MKGIVLAGGSGTRLYPITQGVSKQLLPIYDKPMIYYPVSVLMLAGIRDILIITTPEDMPAFQRLLGDGAQFGVNFSYAIQPSPDGLAQAFIIGEEFIGDDSCALVLGDNIYFGQSFGKKLEAAASKTSGATVFGYQVLDPERFGVVEFDENFRALSIEEKPLKPKSNWAVTGLYFYDNDVVEMAKEVKPSARGELEITTLNQMYLERGDLQVELLGRGFAWLDTGTHDSLMEASQFIHTIEKRQGMKVACLEEIALRNKWLSAEGVAAQAERLKKTEYGAYLKRLLNKRQPSADQIQGMN >LR133964|1682603:1703207|1687578_1688064_-|VDY59463.1|transposase|DBSCAN-SWA MDALATGRRIKCLTCVDDFTKECLTVTVAFGISGVQVTRILDSIALFRGYPATIRTDQGPEFTCRALDQWAFEHGVELRLIQPGKPTQNGFIESFNGRFRDECLNEHWFSDIVHARKIINDWRLDYNECRPHSSLNYLTPAEFAAGWRNGKYEEKPTDITN >LR133964|1682603:1703207|1687238_1687481_-|VDY59462.1|DBSCAN-SWA MTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELASAMKQIKELQRLLGKKTMENELLKEAVEYGRQKSG >LR133964|1682603:1703207|1698629_1699520_+|VDY59472.1|DBSCAN-SWA MKILLIGKNGQVGWELQRSLSTLGEVVAVDYFDKELCGDLTDLAGIAQTVRQVKPDVVVNAAAHTAVDKAESERELSDLLNERGVAVLAEESAKLGALMVHYSTDYVFDGEGEHYRPEDEATGPLNVYGETKRAGELTLAQANPRHLIFRTSWVYATRGANFAKTMLRLAGEKETLSIINDQHGAPTGAELLADCTAIAIREELRNRAVAGTYHLVASGETSWYDYARFVFDIARANGAELAIKEVNGIPTTDYPTPAKRPLNSRLLNEKFQRVFGVTLPDWRQGVERVVVEVLGK >LR133964|1682603:1703207|1696645_1697710_+|VDY59470.1|DBSCAN-SWA MKILVTGGAGFIGSAVVRHIIENTQDDVRVVDCLTYAGNLESLASVAQNERYSFSRTDITDAQGIAGQFSEFQPDIVMHLAAESHVDRSIDGPAAFIQTNLIGTFTLLEAARHYYQSLNEAQKQRLRFHHISTDEVYGDLHGTDDLFTEETSYAPSSPYSASKAGSDHLVRAWNRTYGLPVVVTNCSNNYGPYHFPEKLIPLTILNALAGKPLPVYGNGEQVRDWLYVEDHARALYKVATEGHSGETYNIGGHNERKNIDVVRTICAILDKVVEQKPGNISHFADLITFVEDRPGHDLRYAIDAAKIQRDLGWVPEETFESGIEKTVHWYLNNQTWWQRVLDGSYAGERLGLNN >LR133964|1682603:1703207|1701181_1701889_+|VDY59475.1|DBSCAN-SWA MGLLWSFLNPLFMLTVYTFVFSVVFKARWTAGGDESRTQFAIILFVGMIVHGFFSEVINKAPLIILSNTNYVKKVIFPLETLPVISLLAALFHTCISLFVLLVAFVLFNGFLHWTIVFLPLVFFPLVIFCLGLSWILASLCVFLRDVSQTTVIITNVMMFLSPVFFPISALPAKYHIWIMLNPLTFIIEQARSVLIWGGMPNFLGLLLYSMGAAVVAWLGFVFFQKTRKGFADVL >LR133964|1682603:1703207|1690625_1690742_-|VDY59465.1|DBSCAN-SWA MVLSYIQMIIISICENNTGNMTQKRSLLLKRCPVLTAL >LR133964|1682603:1703207|1700194_1701025_+|VDY59474.1|DBSCAN-SWA MTYEAMKPKIIASIVLFNHSYDDIKDTLISLCHESGVEKIVLVDNGGCQWVTELSEPKISYIKSPYNCGFGAGHNLAIKANTDFDGYFLICNPDVSFDRGELDKLLSFAWDNQFPFVSPKILYKNGDRQYSCRLLPTPVNLFLRRFLPTTAVKFDVEYELMDAEYDKVFSPPSVSGCFMLLTNSLLQKLNGFDERYFMYLEDVDLCRRALPFTKLYYFPDATIIHLFNKGSYKSKLLLWYHVRSAIFYFNKWGWFFDRKRHAYNKKALLEIPRKSG >LR133964|1682603:1703207|1701878_1703207_+|VDY59476.1|DBSCAN-SWA MSSNEIVIQVTNLSKCYQIYASPTDRLKQFFVPKIQQVARRERNCYFREFWALDDVSFSIKKGETVGIIGRNGAGKSTLLQTICGTLTPSAGEVRVNGRIAALLELGAGFNPEFTGRENVYMNGSVLGLTKEQIAAKFAEIEEFADIGQFIDQPVKTYSSGMYVRLAFAVQACVEPEILIVDEALAVGDIGFQYKCYKRMEALRAKGVTIIMVTHSTGSILEYADRCLVMEHGKLIGDTTDVLAAVLAYEKGMILANGSEKPAASRTADQGSTEEVSVEELKAIQLRTTNEATGEKRFGSARAIIEDLTIYKSDGTTLAEKPLIKSGEEVTFDFTILASEEIKDIALGISMSKAQGGDIWGDSNIGAGSAITLRPGRQRIVYKATLPVNSGDYLIHCGLAKVGNGDREELDQRRPMMKVKFWSARELGGVIHAPLKIVSNGE >LR133964|1682603:1703207|1686340_1687261_-|VDY59461.1|DBSCAN-SWA MADKKVDSARALVAGGWRISLVSRCLRVSRAQLHAMARRSKDWQDRRCKRKPDDTDALARIHTVIGDLPTYGYRRVWALLRRQSETDDMAVINAKRVYRIMRQNALLLERKPEIPPSKRAHTGKVAVGESNQRWCSDGFEFSCDNGEKLRVTFALDCCDREALYWAASNGGYDSETVQDVMLGAVERRFGNSLPTSPVEWLTDNGSAYRSYQTRQFARMVGLEPKHTAVRSPESNGMAESFVKTMKRDYISIMPKPDGLTAVKNLAEAFEHYNEWHPHSALGYRSPREYLRRRTSNGLSDKKCLEI >LR133964|1682603:1703207|1689525_1690530_-|VDY59464.1|DBSCAN-SWA MKFLVTGAAGFIGFHTCKRLLEAGHQVVGLDNMNDYYDVNLKQARLDLLQSSLFSFHKVDLADRQGIAELFAEEKFNRVIHLAAQAGVRYSLENPHAYADSNLIGYLNILEGCRHHKVEHLLYASSSSVYGLNRKMPFSTDDSVAHPVSLYAATKKANELMAHTYSHLYGIPTTGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYVDDIVEAIVRVQDVIPQPNAEWTVENGYPADSSAPYRVYNIGNSSPVELMDYITALEEALGMVAEKNMMPIQPGDVLETSADTKPLYDLVGFKPQTTVKEGVQNFVDWYKAYYKA >LR133964|1682603:1703207|1692646_1692997_+|VDY59466.1|transposase|DBSCAN-SWA MISLPSGTRIWLVAGATDMRKSFNGLGEQIHHVLDEDPFSGHLFIFRGRRGDTVKILWADADGLCLFIKRLEEGQFVWPAVRDGKIAITRSQLAMLLDKLDWRQPKISRLNSLTML >LR133964|1682603:1703207|1693027_1694620_+|VDY59467.1|transposase|DBSCAN-SWA MNHDYLARIAALEDTLRQKDSQLSLVAETESFLRSALARAEEKIENEEREIEHLRAQIEKLRRMLFGTRSEKLRRQVEEAEALLKQQEQQSDRYNGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLDPAETSCPECGSGMAYLSEVSVEQLELVSSALKVIRTVRVKKACTRCDCIVEAPAPSRPIDRGIAGPGLLARVLTAKYCEHLPLYRQCEIFARQGVDLSRALLSNWVDACCRLMAPLDEALYHYVMDCHKLHTDDTPVPVLAPGRKKTKTGRIWTYVRDDRSAGSSDPPAAWFAFSPDRQGKHPQQHLRHYHGVLQADAFAGYDRLFSPEREGGPLTEAACWAHARRKIHDVYISTHTATAEEALKRIGELYAIEEEIRGLTTEERLAARQSQSKPLLASLHEWLVEKNETLSKKSRLGEAFAYVLNQWDALCYYCEDGLAEPDNNAAERALRAVCLGKKNFIFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWPSNKVAELLPWNVVLTDK |
18 | Shigella_phage(26.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2628946 : 2639833
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >LR133964|2628946:2639833|DBSCAN-SWA TATGCAAATTAGCGATACCGGCCGCAGCCACACTCCTGACTTTCACGCCGTCCTCGCCCGTGAAGACTGGCAGAACCAGACCATTACCCACCTTAACCGCCTGCCAGCGCATCCCGTTTTCGCCAGCTGGCGCGATGAGCTTGCCGCCCGCGATAACCTACCTTCATCCCGCCGCCGTCAACTGGACGGCGAGTGGCAGTTCTCTTACGCCCGTAGCCCGTTTGCCGTCGATGCGCAGTGGTTGACGCAGGATCTACCGGACTGCCGCGGCACGCCTGTGCCTTCCAACTGGCAGATGGAGGGCTATGACGCGCCGATCTACACCAACGTCCGCTATCCCATCGACACCACCCCACCGCGGGTGCCGGAGGATAACCCGACCGGCTGCTACTCCCTGCACTTTACGGTTGAGGACACATGGCGCGAGAACGGGCAAACGCAGATTATTTTCGATGGCGTCAACTCGGCGTTTCATCTGTGGTGCAATGGCGTGTGGGTCGGCTATTCGCAGGACAGTCGCCTGCCGGCGGCGTTCGATCTCAGCCCCTTTCTGCGTCCGGGCGACAACCGCCTGTGCGTGATGGTCATGCGCTGGAGCGCCGGCAGCTGGCTGGAAGACCAGGATATGTGGCGGATGAGCGGTATTTTCCGCTCGGTATGGCTGCTGAATAAGCCGCAGCAACGGCTATGCGACGTGCAGTTGACGCCAGCCCTTGACGCCCTCTATCGCGACGGCACTCTGCAGGTCCAGGCGACCGTCGAAGCGACTGAGGCGGCGCTTGCCGGGCTCAGCGTCGGGGTTAGCCTGTGGCGCGGCGAGGAGCAGATCGCCGCCGGGCGGCAGCCGTTAGGTACCCCGACGGTGGATGAGCGCGGCCACTACGCGGAACGGGTCGATTTCTCCCTGGCGGTGGCGACGCCGGCGCACTGGAGCGCGGAAACCCCGAACTGTTATCGCGCCGTGGTCACCCTGTGGCGCGGCGACGAACTGCTGGAGGCCGAAGCGTGGGACATCGGTTTTCGCCGCATCGAGATTGCCGATGGCCTGCTGCGTCTCAACGGTAAACCGCTGCTGATCCGCGGCGTTAACCGCCATGAGCATCATCATTTGCGCGGGCAGGTGGTCACCGAGGCGGATATGGTGCAGGACATTCTGTTGATGAAGCAGAACAACTTTAACGCCGTGCGCTGCTCGCACTATCCCAACGCGCCGCGCTGGTATGAACTCTGCAACCGCTACGGTCTGTACGTGGTCGATGAAGCCAATATTGAAACCCACGGGATGGTGCCGATGAATCGGCTGTCCGACGATCCGGCGTGGCTGCCAGCCTTCAGCGCCCGCGTCACCCGGATGGTACAGAGCAACCGCAACCATCCGTGCATTATCATCTGGTCGCTGGGCAACGAGTCCGGCGGTGGCGGCAACCACGAAGCGCTGTACCACTGGCTGAAACGCAACGATCCGAGCCGTCCGGTGCAGTACGAGGGCGGCGGCGCGGATACCACCGCCACCGATATTATCTGTCCGATGTACGCCCGCGTCGAACGCGACCAGCCGATCCCGGCGGTCCCCAAATGGGGGATCAAAAAATGGATCAGCCTGCCCGGTGAGCAGAGGCCGCTGATCCTTTGCGAGTACGCCCACGCGATGGGCAACAGCCTCGGCAACTTCGCCGATTACTGGCAGGCCTTTCGCGAGTACCCGCGGCTGCAGGGCGGGTTTATCTGGGACTGGGCCGACCAGGCGATCCGCAAAACCTTTGCCGACGGCAGCGTCGGCTGGGCCTATGGCGGCGACTTTGGTGATAAGCCTAACGATCGCCAGTTCTGTATGAACGGTCTGGTGTTTCCCGATCGCACGCCGCATCCGTCGCTGGTGGAGGCGAAGCACGCCCAGCAGTATTTTCAGTTCACGCTGCTGTCGACCTCGCCGCTGCGGGTGCGCATCATCAGCGAATACCTGTTCCGCCCAACCGATAACGAAGTGCTGCGCTGGCAGGTGCAGGCGGCCGGTGAACCCCTGTATCACGGCGACCTGACCCTGGCGCTGCCCCCTGAGGGCAGCGACGAGATAACGCTGCTCGATAGCCTGATCCTGCCTGAAGGCGCCCGCGCGGTGTGGCTGACGCTGGAGGTGACCCAGCCCCAGGCGACCGCCTGGTCAGAAGCGGAGCACCGCGTCGCCTGGCAACAGTTTCCCCTGCCCGCCCCGCTGGCGCTGCCGGCGCCCACCGTGTCTGCCGGCGCTCCGGATCTTATCGTCAGCGATGAGGTCTGGCAGATCCGCGCCGGTTCGCAATGCTGGACCATCGATCGCCGGACGGGTCTGCTGAGCCGCTGGTCGGTTGGCGGTCAGGAGCAGCTGTTGACTCCCCTGCGTGACCAGTTTATTCGCGCGCCGCTCGACAACGACATCGGGGTCAGCGAAGTAGAGCGTATCGACCCCAACGCCTGGGTGGAGCGCTGGAGAAGCGCCGGCCTGTACGATCTTGAGGCGCACTGCGTCCAGTGCGATGCGCAGCGCCTGGCAAATGAAACCCTCGTCGACTGCCGCTGGCACTACCTGCGCGGCGAAGAGGTAGTGATTGTCAGCCACTGGCGCATGCACTTCACTGCTGACGGAACCCTGCGGTTGGCAGTGGACGGCGAACGGGCGGAAACCCTGCCGCCGCTGCCGCGGGTCGGGCTGCACTTCCAGGTGGCGGATCAGCAGGCGCCGGTGAGCTGGCTGGGTCTGGGGCCGCATGAGAACTACCCCGACCGGCGGAGCAGCGCCTGCTTCGCCCGCTGGGAGCAGCCGCTGGCGGCGATGACCACCCCCTACATCTTCCCGACGGAAAACGGCCTGCGCTGTGATACCCAGGCGCTGGACTGGGGGCGCTGGCACATCAGCGGTCATTTCCACTTCTCCGTTCAGCCATGGAGCACCCGTCAGCTGATGGAGACCGACCACTGGCACAAGATGCAGGCCGAAGACGGCGTGTGGATCACCCTCGACGGCCTGCATATGGGGGTGGGAGGCGATGACTCCTGGACCCCCAGCGTGCTGCCGCAGTGGCTCCTGAGCCAAACGCGCTGGCATTACGAGGTCTCATTGCGTTGCCTTTAATCCGTGGGGGCGACAGCCCCCACCCCACAAACAGAATAACAAGGGATCATGGTGATGAAATTCTCAGAACTGGCGCCACGAGAACGGCATAATTTTGTCTATTTCCTGCTGTTCTTTTTCTTTTACTATTTCATTATGTCGGCCTACTTCCCGTTTTTTCCGGTGTGGCTGGCGGACGTTAACCATTTAACTAAAACGGAAACCGGGATCGTTTTCTCGTCTATCTCGTTATTCGCCATTATTTTTCAGCCGGTGTTCGGCCTGATGTCGGATAAGCTCGGCCTGCGCAAACATCTGCTGTGGACCATTACGGTATTGTTAATTCTGTTCGCGCCATTCTTTATTTTCGTTTTCTCTCCGCTGCTGCAGATGAATATTATCGCTGGTTCGCTGGTAGGCGGGATCTACCTGGGGATTGTTTTCTCCAGCGGCTCCGGGGCGGTGGAAGCTTATATCGAACGCGTCAGCCGGGCCAACCGCTTTGAATATGGCAAAGTGCGGGTCGCAGGCTGCGTCGGCTGGGCGCTATGCGCCTCGATAACCGGCGTGCTGTTCGGCATCGATCCCAATATCACCTTCTGGATCGCCTCCGGCTTTGCGCTGGTGCTCGGTCTGCTGCTCTGGCTGTCGCGGCCGGAAAGCAGCAACAGCGCTCAGGTTATCGAGGCGCTGGGCGCCAATCGCCAGGCCTTTTCGCTGCGTACGGCGGCAGAACTGCTGCGTATGCCGCGCTTCTGGGGCTTTATCGTCTATGTGGTGGGTGTCGCCAGCGTCTACGACGTGTTTGACCAGCAGTTCGCCAACTTTTTCAAAAGCTTTTTCGCCAGTCCGCAGCGCGGTACCGAAGTGTTTGGCTTCGTCACCACCGGCGGCGAGTTGCTCAACGCGCTGATCATGTTCTGCGCGCCGGCCATCGTTAACCGCATCGGCGCCAAAAACGCCCTGCTGACCGCGGGGATGATCATGTCGGTGCGTATTCTCGGGTCGTCCTTCGCCTCATCGGCAGTCGAGGTGGTGATCCTCAAGATGCTGCATATGTTTGAGATCCCCTTCCTGCTGGTGGGAACTTTTAAATATATCTCTTCCGCCTTTAACCCGCGCCTCTCGGCCACCCTGTTCCTGATCGGTTTTAATCTGTCAAAACAGCTGTCCGGGGTGGTGCTTTCCGCCTGGGTGGGCCGGATGTACGACACCGTCGGCTTCCATCAGGCCTATCTGATCCTCGGCTGCATAACCCTGAGCTTCACCCTGCTCTCGTTCGTCACCCTACGCGGCGGCAACCGTCTGCTGCCGACCGCAGAGACGCAGAGCCCCGCCTGACTCCAGCGCCCCCGTCAGGGATGACGGGCTTCACTCGACAATCCGCGTCTCCCCGCCGCGATTCTCCAGCGCATAGCGCTGACAAGGCGCCCCGGCGGCGATAAGCTCCGCCAGCTCCGCCGAGTGGCTGGTCAGCCAGATCTGGCTGTAGCGCGAGGCCTCGATAATCAGCCGGGCCAGCGCGGGCAACATGTCGCGATGCAGGCTGTTCTCCGGTTCATTGATCGCCAGAAACGCCGGCGGGCGTGGACTGAGCAGCGCGACGGCGAGGCACAAAAAGCGCAGCGTACCGTCGGACATCTCCGCCGCCAGCAGCGGGCGGCGGATCCCTTCGCGACGCATTTTCAGAGCGAAACGCGAGTGCTCGTTTTCACAATAAAACTGGCAGCCGGGAAAGGCGTCGGCAAGGATTTCATGCAAAATCTCCTCCGCGCCGATCTCGACAATGGTCTGAAAGGCGGCGGCGAGATTCTGGCCATCGCTGTCGAGGACTGGCGAACGGTAACCCACGGCCGGCTGACGCAGCGGCGAATGACGGCCAATCGCGAATTCATGATAAAACCGCCAACGGCGCAAGGTCTCCCGCACTCGCGATACTTCGGGGAAACGGTGCGGCTCGCCAAGCTGACCGAACACCGACTCATTTTCATAAATACTCTCGGTGAAAGTACTTTTCTCCCCGGTGACATCGACGAGAAACGCCGCCTGATTCTTGCGCTGCAGTACGCGGGAGGAAGGCCGGCGGGAGTAGCCGGCCAGCCAGATATTCTCTTCTTTAACGATAGGATCGAGCATGAATTGCGTCGGATAGGGCAATTTTTCAGGGAAGCCTATCTGCAGCTCATAATCGAAGCTGTCGGTCCGGCAGGCGATTTGCAGCCGGCGGGGGTGACGATCGAGGGGCGAACGCTCCCCGGACCACATCATATTCTCCAGACCGCCCTCTTCGCTGATAAACCCCGAAAGCCTGCCCTCAGCGGCAGCGGTCAGCAGGTGGATGGCGTTATAAATATTGGATTTACCGCAGCCGTTAGGGCCGAAGACAATGTTCAGCGGCCCGAGCTCCAGCGCGATATCCTTCACTGAGCGAAAATTTTGAATACGAATGTACTGAATCATTATGCGTCCGGCCCTGGAAATAGAAGGCTGCCACAGTAGCACAGCGGCGCCCGGCATACCCTCTTACGCCCCCTTTCGGGAAGGGTTTAACAGAACATTTTTTTCATTCCACGGGTCAGGGCGACCCGCGTCAGCTTATCGCTATCCTGATGGGCGTCGCTGAAAACGCCAAAAAAATAGCGTTCATCGTCAATGGTTTGGCTAACAATTCGGTAGCTCAACGGTTGCCCGGATAGCGCACCGTACTGCGCAGGCGTCACCGAGACCTCGGTCGCGTCATCGGCGACGCTGACCAACAGCCCGTCCGCCTTACCGCCCACCAGCATGACCTTAATTATCGGGTGCACCAGTTCCTCCCGGGATGGCATATAGCGTTGGCGAACCGCAGGGACCGGCGGGCGTGGTGATCGGCAAATAGCGACGCCCGACAGAGTGCGGTACTTAGTACTTTCTTATAGTTCATCACGGCCTTGAGTCAAAAAATAGCGTGCTTAGGCAGGGCTAGATATTGATTATTCGAAATAAAAGATGACAAATGATGAAGGAAAAAAGAGGAATTGTGAATCAGCAAAACGCCGGGTTATTCTTATTTGTCGCTTCTTTACTCGCCTTTATCGGCCCTCACTCAAGGATGTATTGTGGTTATGCGTTATATTCGCCTGTGTATTATCTCCCTGTTAGCCACCCTGCCGCTGGCGGTACACGCCAGCCCGCAGCCGCTTGAGCAAATTAAACAAAGCGAAAGCCAGCTGTCGGGCCGCGTAGGCATGATAGAAATGGATCTGGCCAGCGGCCGCACGCTGACCGCCTGGCGCGCCGATGAACGCTTTCCCATGATGAGCACCTTTAAAGTAGTGCTCTGCGGCGCAGTGCTGGCGCGGGTGGATGCCGGTGACGAACAGCTGGAGCGAAAGATCCACTATCGCCAGCAGGATCTGGTGGACTACTCGCCGGTCAGCGAAAAACACCTTGCCGACGGCATGACGGTCGGCGAACTCTGTGCCGCCGCCATTACCATGAGCGATAACAGCGCCGCCAATCTGCTGCTGGCCACCGTCGGCGGCCCCGCAGGATTGACTGCCTTTTTGCGCCAGATCGGCGACAACGTCACCCGCCTTGACCGCTGGGAAACGGAACTGAATGAGGCGCTTCCCGGCGACGCCCGCGACACCACTACCCCGGCCAGCATGGCCGCGACCCTGCGCAAGCTGCTGACCAGCCAGCGTCTGAGCGCCCGTTCGCAACGGCAGCTGCTGCAGTGGATGGTGGACGATCGGGTCGCCGGACCGTTGATCCGCTCCGTGCTGCCGGCGGGCTGGTTTATCGCCGATAAGACCGGAGCTGGCGAACGGGGTGCGCGCGGGATTGTCGCCCTGCTTGGCCCGAATAACAAAGCAGAGCGCATCGTGGTGATTTATCTGCGGGATACGCCGGCGAGCATGGCCGAGCGAAATCAGCAAATCGCCGGGATCGGCGCGGCGCTGATCGAGCACTGGCAACGCTAACCCGGCGGTGGCCGCGCGCGTTATCCGGCCCGCAGCACCTCGCAGGCGTGCCGGGCGATATGACTGGCGGCGGCATCGGAGAGATGCCGGTCGGTAATGATGGTGGTGAACCGGGTCAGAGGTAACGCCATGAACGTGGCCACCTGATTGTATTTCGAACTGTCGCACAGCAGGATGCTTCTCGCGCTGACCTGGCTGACGGTCTCCTTGACGGTAACCTTGTTCTCATCAGGGGTGAAAATCCCGCGACTGTCCCAGCCGCTGGCGGAGATAAAGGCCGTATCGATAGCCAGGTGGCGTAACGTACGCGCCGCCGATTCGCCCACGCAGGAGCGGTTCTCCCGGCACAGAGTGCCGCCGGTGTGGATCACGCCGCACTGGCTGGCATCGATCAGCAGCTGGGTTATCTCAAAATCATTGGTGACCACCTGGAGATCGTTCCGGTCGAGGATCGCCCGCGCCAGCGCCAGGGTGGTAGTCCCGGCATCCAGATAGATGCAACTGTTTTTAGCGATATGACTCGCCGCCAGCGCGCCGATCGCCTGTTTCTCCTCACTCTGCAGCGTGCTTTTCACCAGATGACTGGGTTCCGCAGCCAGCCGGCTGACGGCGCGCACGCCGCCCGAGACGCTGACCAGCAGCCCCTGCTCCTCCAGTTTACTAACGTCCCGACGGATGGTCATATGGGACACGCCTAGGATCTCCGTCAGCTCGTTAATGCTTACCGCCCCACGCTGCTCCACCAGGGCTAAAATACGCTGATGTCGTTCTATTGGAATCACCGCTCTCCCCTTACCATTTTTTCACACCAGGCGTCACCACCCAGGCTACGGCCCTGGCGACACCCGGTGTTGTTATCGCGCTTTGCCGCTGTTTTTACGCTATTTTAGGGCAAGAATCGCCGTTGTGCAGCCTCTTTCCGCCTGTGAATTTTTTATATTCATGTGGGTTATTCGTGATAACTCTCACATATTTTCACATGATAACGCTTTATTCTATCATTAAATCACATTAATTAACTAATATTCACAAGGAGACCAGCATGGCTGCACACACTAATGTCTGCGTGATTGGACTGGGTTCAATGGGCATGGGCGCCGCCCGCGCCTGCCTGCAGGCGGGCCTGAACACCTGGGGCGTTGACATCAATCCCGACAACTGTCGCGCACTGCTGGCGGCGGGCGCCAAAGGCGCGGGCCCCAGCGCGGTGCCGTTCGCCGCGGAACTGGATGCAGTTGTGCTGCTGGTGGTCAATGCCGCCCAGGTGCGGGGGATCCTGTTCGGCGAGAGCGGCCTCGCCGCCCATCTGAAGCCGGGCACCGTCGTGATGGTGTCGTCCACCATCGCCTCCGCCGATGCTCAGGCCATTGCCGAGGCGCTGGCGGAGTACCAGCTATTGATGCTCGACGCGCCGGTATCGGGCGGCGCCGTGAAAGCGGCCGCCGGCGACATGACGGTGATGGCCTCCGGGAGCGATGCCGCCTTTGCCCGCCTCGCGCCGGTGCTGGACGCCGTGGCCGGCAAAGTCTACCGCATAGGGAGCGACATTGGTCTTGGCTCAACGGTAAAAATTATCCATCAGCTGCTGGCCGGGGTGCACATCGCCGTTGCCGCCGAAGCGATGGCGCTTGCCGCCCGCGCCGGGATCCCACTGGAGACGATGTATGACGTGGTCACCCACGCGGCGGGTAATTCCTGGATGTTTGAGAATCGCATGCAGCATGTCCTGGATGGCGATTACTCGCCAAAATCCGCTGTCGATATTTTTGTCAAAGATCTCGGGCTGGTGAATGACACTGCCCGGGCGCTGACCTTCCCGCTGCCGCTCGCTACCACCGCGCTGAATATGTTCACCTCCGCCAGTAATGCCGGATTCGGTCGGGAAGATGACAGCGCGGTGATCAAGATTTTCAACGGCATTACCCTGCCGGGCCATAAACAGTGAGGAGAGACAACATGCAGCTTGGTGTCATTGCCGATGACTTCACCGGCGCCACGGATATTGCCAGCTTCCTCGTGCGCAACGGCATGCCGACGGTGCAACTGAATGGCGTGCCGACCCGCGATATTCCGCTGACCAGCGAGGCGGTGGTCATCAGCCTGAAAACCCGCTCCTGCCCGGCGGAAATGGCCGTCAGCCAGTCGCTGGCGGCCCTGCGCTGGCTGCAGGCCCAGGGCTGTCAGCAGTTTTATTTCAAGTACTGTTCCACTTTCGACAGCACCGCGCAGGGCAACATTGGCCCGGTGCTGGATGCCCTGCTGGCCGAGCTGGGTGAGACGCGGACGGTGATTTCCCCGGCGCTGCCGGTTAACGGCCGCACGGTCTATCAGGGATATCTGTTCGTCGGCGAGCAACTGCTGAATGAGTCCGGGATGCGCCACCATCCGGTGACGCCGATGGAGGATGCGCACCTGGGCCGCTTAATTGAGCGCCAGGGGCGCGGAAAAGCCGCGCTGATTGCCTGGCCGATTGTCGCCCGGGGGCCGGAGGCGGTCGCCGCCGCGCTGGCGGCAGTCAACGATCCGGCGGTGCGCTATGTGGTGCTCGACGCCCTCAGCGAACAGGATCTGCTCACCCAGGGCGTGGCGCTGCGGGAGATGAAGCTGGTCTCCGGCGGTTCCGGCCTCGCCATCGGCCTCGCCCGCGACTGGGCGCAGCGCCATGGCGCCCGGGGTGAAAGCGCTCAGGCCGGCATGCCGCTGGCCGGCCCGGCGGTGGTGCTCTCGGGCTCCTGCTCGGTGATGACCAACAGCCAGGTGGCGGCCTATCGTCAACATGCCCCCGCCCGCGCCGTCGACTTAAGCGCCTGCTTTACCGATCTGGAGAGCTACGTCAGGACGCTGACTGACTGGGTGGACGCGCAGCGCGATGCGCCGCTGGCGCCGATGATCTATGCCACCACCGAGCCGCAAACGCTGCAGCGGATCCAGGCGCAGTATGGCGACAAGGCCAGCAGCGAACGGGTGGAACAGTTGTTTGCCGCTCTTGCCGCCGCCCTGAAGACGAAAGGATTTACCCGCTTTATTGTGGCCGGAGGGGAAACGTCGAGCATTGTGGCGCAGACCCTGGGGGTTGAGGCGTTCCATATTGGGCCGACCATCTCCCCTGGCGTGCCCTGGGTGCGTGACACCCGCCAGCCGCTCTCCCTGGCGCTGAAGTCGGGTAACTTCGGCGATATCCAGTTCTTTGCCCGTGCCCAGCAGGAGTTTCGTCATGACTGAGCAACAACTGCGAGAGGAAATGGTACAGATTGGCGCCTCGTTGTTTAGCCGCGGCTATGCCACCGGCTCCGCTGGCAATCTGTCGTTGCTGCTGCCGGACGGCAACCTGCTGGCGACGCCGACCGGCGCCTGCCTCGGCGAACTGCAGGCTCAGCGGTTGTCGGTGGTGACGCTGCAAGGGGAATGGATCTCCGGCGATAAACCATCGAAAGAGGTCACTTTTCACCGGGCGGTCTATTTGCACAACCCGGCCTGCAAGGCGATCGTCCACTTGCACAGCCACTATCTGACCGCGCTCTCCTGCCTGCAGGGGCTCGACCCGCACAACTGTATCCGCCCCTTTACCCCCTATGTGGTGATGCGCGTCGGCGACGTCCCGGTGGTTCCCTACTACCGGCCGGGCGATGACCGTATTGCCCAGGCGCTGGCCGGGCTGGCGCCGCGCTATAACGCCTTTTTACTGGCCAACCACGGACCGGTGGTCACTGGCTCATCGCTGCGCGAAGCCACCAACAATACCGAGGAACTGGAAGAGACCGCACGGCTGATATTTACCCTCGGCAACCGCGAGATCCGCTACCTGACCGCTGACGAAGTAAAAGAACTGAGATAA
Protein sequences of DBSCAN-SWA_3 >LR133964|2628946:2639833|2632108_2633374_+|VDY60363.1|DBSCAN-SWA MKFSELAPRERHNFVYFLLFFFFYYFIMSAYFPFFPVWLADVNHLTKTETGIVFSSISLFAIIFQPVFGLMSDKLGLRKHLLWTITVLLILFAPFFIFVFSPLLQMNIIAGSLVGGIYLGIVFSSGSGAVEAYIERVSRANRFEYGKVRVAGCVGWALCASITGVLFGIDPNITFWIASGFALVLGLLLWLSRPESSNSAQVIEALGANRQAFSLRTAAELLRMPRFWGFIVYVVGVASVYDVFDQQFANFFKSFFASPQRGTEVFGFVTTGGELLNALIMFCAPAIVNRIGAKNALLTAGMIMSVRILGSSFASSAVEVVILKMLHMFEIPFLLVGTFKYISSAFNPRLSATLFLIGFNLSKQLSGVVLSAWVGRMYDTVGFHQAYLILGCITLSFTLLSFVTLRGGNRLLPTAETQSPA >LR133964|2628946:2639833|2636018_2636780_-|VDY60367.1|DBSCAN-SWA MIPIERHQRILALVEQRGAVSINELTEILGVSHMTIRRDVSKLEEQGLLVSVSGGVRAVSRLAAEPSHLVKSTLQSEEKQAIGALAASHIAKNSCIYLDAGTTTLALARAILDRNDLQVVTNDFEITQLLIDASQCGVIHTGGTLCRENRSCVGESAARTLRHLAIDTAFISASGWDSRGIFTPDENKVTVKETVSQVSARSILLCDSSKYNQVATFMALPLTRFTTIITDRHLSDAAASHIARHACEVLRAG >LR133964|2628946:2639833|2628946_2632054_+|VDY60362.1|DBSCAN-SWA MQISDTGRSHTPDFHAVLAREDWQNQTITHLNRLPAHPVFASWRDELAARDNLPSSRRRQLDGEWQFSYARSPFAVDAQWLTQDLPDCRGTPVPSNWQMEGYDAPIYTNVRYPIDTTPPRVPEDNPTGCYSLHFTVEDTWRENGQTQIIFDGVNSAFHLWCNGVWVGYSQDSRLPAAFDLSPFLRPGDNRLCVMVMRWSAGSWLEDQDMWRMSGIFRSVWLLNKPQQRLCDVQLTPALDALYRDGTLQVQATVEATEAALAGLSVGVSLWRGEEQIAAGRQPLGTPTVDERGHYAERVDFSLAVATPAHWSAETPNCYRAVVTLWRGDELLEAEAWDIGFRRIEIADGLLRLNGKPLLIRGVNRHEHHHLRGQVVTEADMVQDILLMKQNNFNAVRCSHYPNAPRWYELCNRYGLYVVDEANIETHGMVPMNRLSDDPAWLPAFSARVTRMVQSNRNHPCIIIWSLGNESGGGGNHEALYHWLKRNDPSRPVQYEGGGADTTATDIICPMYARVERDQPIPAVPKWGIKKWISLPGEQRPLILCEYAHAMGNSLGNFADYWQAFREYPRLQGGFIWDWADQAIRKTFADGSVGWAYGGDFGDKPNDRQFCMNGLVFPDRTPHPSLVEAKHAQQYFQFTLLSTSPLRVRIISEYLFRPTDNEVLRWQVQAAGEPLYHGDLTLALPPEGSDEITLLDSLILPEGARAVWLTLEVTQPQATAWSEAEHRVAWQQFPLPAPLALPAPTVSAGAPDLIVSDEVWQIRAGSQCWTIDRRTGLLSRWSVGGQEQLLTPLRDQFIRAPLDNDIGVSEVERIDPNAWVERWRSAGLYDLEAHCVQCDAQRLANETLVDCRWHYLRGEEVVIVSHWRMHFTADGTLRLAVDGERAETLPPLPRVGLHFQVADQQAPVSWLGLGPHENYPDRRSSACFARWEQPLAAMTTPYIFPTENGLRCDTQALDWGRWHISGHFHFSVQPWSTRQLMETDHWHKMQAEDGVWITLDGLHMGVGGDDSWTPSVLPQWLLSQTRWHYEVSLRCL >LR133964|2628946:2639833|2637954_2639220_+|VDY60369.1|DBSCAN-SWA MQLGVIADDFTGATDIASFLVRNGMPTVQLNGVPTRDIPLTSEAVVISLKTRSCPAEMAVSQSLAALRWLQAQGCQQFYFKYCSTFDSTAQGNIGPVLDALLAELGETRTVISPALPVNGRTVYQGYLFVGEQLLNESGMRHHPVTPMEDAHLGRLIERQGRGKAALIAWPIVARGPEAVAAALAAVNDPAVRYVVLDALSEQDLLTQGVALREMKLVSGGSGLAIGLARDWAQRHGARGESAQAGMPLAGPAVVLSGSCSVMTNSQVAAYRQHAPARAVDLSACFTDLESYVRTLTDWVDAQRDAPLAPMIYATTEPQTLQRIQAQYGDKASSERVEQLFAALAAALKTKGFTRFIVAGGETSSIVAQTLGVEAFHIGPTISPGVPWVRDTRQPLSLALKSGNFGDIQFFARAQQEFRHD >LR133964|2628946:2639833|2633404_2634493_-|VDY60364.1|DBSCAN-SWA MIQYIRIQNFRSVKDIALELGPLNIVFGPNGCGKSNIYNAIHLLTAAAEGRLSGFISEEGGLENMMWSGERSPLDRHPRRLQIACRTDSFDYELQIGFPEKLPYPTQFMLDPIVKEENIWLAGYSRRPSSRVLQRKNQAAFLVDVTGEKSTFTESIYENESVFGQLGEPHRFPEVSRVRETLRRWRFYHEFAIGRHSPLRQPAVGYRSPVLDSDGQNLAAAFQTIVEIGAEEILHEILADAFPGCQFYCENEHSRFALKMRREGIRRPLLAAEMSDGTLRFLCLAVALLSPRPPAFLAINEPENSLHRDMLPALARLIIEASRYSQIWLTSHSAELAELIAAGAPCQRYALENRGGETRIVE >LR133964|2628946:2639833|2639212_2639833_+|VDY60370.1|DBSCAN-SWA MTEQQLREEMVQIGASLFSRGYATGSAGNLSLLLPDGNLLATPTGACLGELQAQRLSVVTLQGEWISGDKPSKEVTFHRAVYLHNPACKAIVHLHSHYLTALSCLQGLDPHNCIRPFTPYVVMRVGDVPVVPYYRPGDDRIAQALAGLAPRYNAFLLANHGPVVTGSSLREATNNTEELEETARLIFTLGNREIRYLTADEVKELR >LR133964|2628946:2639833|2637040_2637943_+|VDY60368.1|DBSCAN-SWA MAAHTNVCVIGLGSMGMGAARACLQAGLNTWGVDINPDNCRALLAAGAKGAGPSAVPFAAELDAVVLLVVNAAQVRGILFGESGLAAHLKPGTVVMVSSTIASADAQAIAEALAEYQLLMLDAPVSGGAVKAAAGDMTVMASGSDAAFARLAPVLDAVAGKVYRIGSDIGLGSTVKIIHQLLAGVHIAVAAEAMALAARAGIPLETMYDVVTHAAGNSWMFENRMQHVLDGDYSPKSAVDIFVKDLGLVNDTARALTFPLPLATTALNMFTSASNAGFGREDDSAVIKIFNGITLPGHKQ >LR133964|2628946:2639833|2635137_2635998_+|VDY60366.1|DBSCAN-SWA MRYIRLCIISLLATLPLAVHASPQPLEQIKQSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPASMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARGIVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR >LR133964|2628946:2639833|2634579_2634840_-|VDY60365.1|DBSCAN-SWA MHPIIKVMLVGGKADGLLVSVADDATEVSVTPAQYGALSGQPLSYRIVSQTIDDERYFFGVFSDAHQDSDKLTRVALTRGMKKMFC |
9 | Escherichia_phage(87.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2884031 : 2925584
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >LR133964|2884031:2925584|DBSCAN-SWA CTTAAGACTGAATCTCCTTTTTAAGTGCCGTTTCTGCATCCTCAAAAATAGTTACCAGGTCAGTAAACTCAAAGGAATAGTAGAGCTGACCTGTAGCATCCAGCCCCTCAACCTGCGTATCAAACAGCACCGTCGCGGTAGTCCCGTTGATGCTGTCGACCCCTTTGGCAGTATATGTCACGGCTACAGGAGCAGCACTGAGCGGCTCAACGAGATTAATGTATGGGATGGTACGAAACGACTTCAAATTTTGCGTGAGGGTAAATGACATTTTAGACTCCTGCTACCCGTGAAACTTCGACCCAGCCCTGACGGAACGATGTGTTCATCGCCACCAGGGTTATCGTACATGCTGCCGTTACCGCACCACCCGGAAGGAAGATATTGCCGCCAGTCATCAGGCAATTTGATGTGTCGGCACACCTGACAATCATTCGTTTGCCTGTGTGGCAGTATCCGATATTGGCTACATTTCCTGAGGCGTTCCATGTCCAGGACTCATCATAACCAAGAGGGTCTGTACCCTCAGAGAGCGTCAGTGCTGAAGACTGAATTGATTTACCCCTCACACTGATTATTGACTGCCGGCGCGATGCGGTGGTCATTGCTGGATTGACATAAACAGTTCCAACAACGCTGCCGTTATATTTATCAACGTTTACGTCTGCCTTTACTGATGACGCGGCAACATTAAACAGTGAGGTCTGATAATCGCTGGTAACAGCACCTAACAGCGTCAGATTTTCAATATCAATTTTGCAGTTCCAGTTGATGTACATGAAGGTATTTGGTCGCGCCAGGCGAATAGCCTTTGCCCGGAAAATAACCTCGCCCTCAAAATCAAACGGTGAGAGCTGATTCGCCCCGTTATATCCGTACCCGTCCACATGCATCATGTCAATATCAGCGACCAGCTGAACGCACGCAGTCGCACTCGCCCAGTTTGTACGGTGGATGCGGAACCACCTCAGACCGGTATATCCACCTAAATCCGACGGGCTCATAAATGAGCCTCCGCGAATTGATACGTGGCGGCCATTGCTGTCCAGCCAGTAAACTGTGATATGGGCCGGGTGGCTTACGAAGATATTTTCAAATTTGACACTCTCTGAGGTGGACCCGTAAATCTGGTTAACAGGAACAAGGTTGTTTCTGCTGTTATTTTCAAATTTACACATTCCGCCAAAGAGAATGTTGTTTGACCCACCACGAATATAAGAGACGCCTGAATAGTTGTTTTCAAAATGGCAGCTCATGTATCTTATTGCATTACAGTTATCATACTGATTCAGCATGTTAATCTGATGCGTTACTTCTGAGCTTGCAGTGAGGAACCGGCCTCCGGAGTTGATCCTCACCTGATTAAATACCCCGTCCATGACATCCGTCATATCGAATGACGAAAGTGGAGGCGACCAGACCTGCACGTTCTCTACAGCAAAATCCCAGCCGACGCCGTAGAATTTAAAGATTCTTTTACGAATCCAGCTCGCAGTTTCCCCAAGCACTGAGAAGTTACTCAGATAAAGTTCGGATATTCTGACCCTTCCTGTTGAATCCCAGCCGGCTGAGCTGAAGTCAAACAGATAATCATCATCTGAAGCTGCGCTGGGATGTATAGCAAAGATGGTCTGGAATATCCCCTCGCCGCACATTGCGAAGGGGCCAAAAGAAAGTTCGACTTTCGTTTTGATGTGCAGCAGACCTGCGGGGAGATTGATTCTGCGTCTGGGAATATGCTCTACGCTGGTTGAACCTAAAGAACTCATAGCGGCAATGGCGCGGTTAAGCCCAAGACCATAATCAACAAAACCATCTACAACATCCGTGGCGTATACAAAATCCAGAAGGTTAATATTGTCCCAGTTTTTGTCGTGCTGAGTCCTGGCTACAGCCCCTGTGTATGGTTGTTTGACAGCAAGTAACGCATCCCCCACTCCCTCTTCGCCTGAACCCAGGTTTGAGCGAAGAGCCGCGTCACCGATGTTCGACCATTTCCCTGTGGGGTTTGCAGCCGACCACACACCGCCATCGTTCTCAGGAGAATCTCCGGCAATGACGTGTTCAAGCTCACCAAGGTATTTGTACCAAGAGCCATTGTAGTAGACGATCTGCTGGCGATTATCTACAGCCAGACCAACAGCCCAGTTGCCAAGCTCCTGCCAGCCGATAGCTGCAACTGCCTGCTCGCCGCGACCAGTGATGTAGCCTATAAAGCGGCTGAAGATCATCTCCATGCCGTGCCAGGTTTTGCGCAGTACACCAAAGCGATCAGGTAATGATTCCGATTCCCGGCCATTGACTAACTGATCGAGGTTAGTTGCATTCTTAAGCAATACCTCTGGTGATGACGAGCCAAGTTTTTCGATGTTGGCCATACATTGTGCTCCAAAAATGAAAAAACCCGCCGAAGCGGGTGAATTGATTTTTTTGAAATTGCTAAGCTACGTCGCCAGGGTATGTAGCGTCGTCGTAGGCATAGAACGATTCGAGGTATTCTTTAGTGGTAACCTGGCATGTTCCGTCAGACTGCGGGGCGATCTCCTCTACAATGGCGTCGTAGACGTGGCGCGTTGAGCCGCAGAACACCAGGCGGATCGACTCGATGGTTGTCGACGACAGGTCAACCTTCATCGGGTCATCAAACTCGCTCAGGTGCGGGACTGACAGCTGAAAATCACCCACCCTGCTCGCCACCATCAGCCCGGATGCAGAGCCATCCTGATAGCGGATCAGCGCTCGGGGGTTTTCGAAAGACCAGTCCAGCGGCTCCGTAACGGTGAACGTTGTCACGCCACCAGCCGTTGTCATCGCCTCCACCAGGCAGGAAATCGTGTTGTTACCCGGAATATCATCCGTGAGCACAATGCGATCGCCCGTGTTGTAGCACAGCGCGTCCAGCTCGGTGGTGGTCTGGAACGTCACCCGCTGCAGCAGGTATTTCATCAGGCGACGCATGCCGATCTGATAGGCGTGGTCCTGATTGAGTACCCCATCGAGTTTATAATTCTCGATTTTCACTGGCGTCGGATTGTCCGGCGTCCGGCATTTAACGGTCTCCTCTGCCCAGGTAGCCCCGTTGATGTACGTCACGTCGACGCCATCAAAATCATCGTCGGACGGTACGGTAAATCCGCTCTGCAGCTCCTCCACCATCTCATGCGGAGTGATCATGCCAGTCCAGGGCTTAATCCCCTCGCGGTTGACCGTCGCCAGGCCATCGCTTAACAGGAAGCGGGACTTGCCAGCATTGGCTATCTTCTGCAGCATTTCCAGCGCTGAGATACTGTCGCCCGTGGCGAAATCGAAATACTCGCCCCGTGGCGTCCAGTACGCGTACTCCAGCGCGTTGATGGTGTCGGTATCCATCTCCAGGCCCAGTGAGTTCCCTACATGCAGCAGCGCTCCCGAAATGGTTCTGGCCGTTCCTGAGTCGTAGGCCCGCGTGGCCACAACGTTTACGCGGCGGTCCGACTGCGCCGCCAGCTTCCCGCCCGTCTCAACGGTCACCGCCATCAGCGACACGCCGGGATAGGATGAAGGGCGCGTCAGCAGTCGCCCGCGCAGTGCCTGCCAGTACATTGAATCCCTGGCGTTGTTTGAGCCCTGCTCATTGCGCCGGCGACAGCGAACCTCTACCAGTCCCGGAGAACTGAGGGTGATCCGCTCAGTGAAACCTAACCCGTTGATGTTTTTAAGCGCGTACTCTCCCTGGTGACTCACCCACCCCGATCCGGAACCATAGACGCGATACTGAATCTCCCACTCAACGTGGCGAATCCGTTTTTTGCCCTTACTGTCAAAGCCACAGATGCCGTTCGGGAAGGAGAAATTCACCTCGAATGCATCCACCACTTCATTCTCAGGGCAAACCAGGAACGGCCCCAGCCAGCTCAGCGTGTCGTTAAGACCAGTGGCCTCATAGTCGATCATCGTCCGGGCGGAGAATCCCGGCCATGACTCATCAACGGCACCGGAAACCAGGCGCGCCACCGTCGCCGTCGTGCCGTCGGCAGAGACAATGCGGTACTCATTCCCGCGGTGAGCAAGTGAAAGCCGTTGCACCCCCTCCGGCATGCCGGAAAAGGCCGTTCCCGTGGCAGAGTTATAGGCGAGTGTCACATTCGCCGTTACCGCCGGGCTGCCGCCGGTTGATGCCGTGCCGGAGGTGTAAACCGGGGCATCACCGAAAACAGCTGCAGGCAGCGAAGAGGACGTGATCGCCCCACCCGCGAACGGACTGGCCGACTCGGTTATCAGTACAGTTCCGCCGTTGTCCTGCGCAACCAGGCCGGAGCCGGTGAGTCCCTCGGTGATGGCCGCCAGCAGTCCCGACATCGAGACGTAGTTAGCCACCAGCGACACCGGGTAGGAAACCCCCTGCCAGGTGATCGTGAACGTGCTGGAGCTGGTCGAAAAATCGTAGGTGGTCGGGGCCGCACTGGCCTGGACTTTTGCCGCACTCCCCCCGGTGCCGGGCACTGCAGCCTGACCGGGGGTATATGACGCGATAAACAGATCGTAATCGACAGAGTTAAACCCCAGCGTCACCGGCATACCAACCATCGGCGCGATCTCCGTCAGCAGCGGGCTGGCGATAACGCTGTATCCACCCGCCGAAGTGATCTGGTAGTTAGCCGGGGCTTTCAGTTCGACCACGGCGCCGGCGACCCAGCTGGGCGGCAGCGCGTTATCGTTCTCGTCATTATCGTCATCATCATCCGTATCCAGCCCGGTAAACGTCACGCTCGATCCGGAGACGGTCATGCTGTCTGCGATAATGTCGTCTGCATCCGGCGACGTCTGGGCCATATCCAGTCCGGTACCGGATGATGTCCCGCCGACCTCCGTACTGTTGACCCAGTTTTCGCTGCGATCATCACCGGAAACGTCCGCGCCTGGCGGGTAATGGGTGCTGCTGAATCCCGGTAGCGTTGAAGCTGGCGTACTGCCAACCCGGATATCGCCATTGGTATAAATCAGATCACCGACACCGAGACACAGCAGCATCTGGACGCGCATTTTCGTAGGATCGGCGGCATCAAACCGGGTAACGGGCTGCACGACATAATCCGGATAAATACGCACGCGCCCAAAAACTTCACGAATGGCATCACCGAGTTTTGCGTTATTCGCCTTTGCCGGGTTCACGTCGAGACTCCGCCCTGTGGATGAGGTATAGCCGCCCGTATCGATGGTGCTCATCATAAACAGCGAATAGGCTGCAGCGGCAACGGAGATACCGACACCTATCCACGCAATGGTCGCGGCCTCCAGCCCGAAGGGGACCGGATAAAGCCGGACATCACTATCAGGGCGGATCACACAAGTAGCCCAGTCGCCTGGCGGAATTGACTGCCCCTCAACCTCAACGGTCAGCGGAGGGACATCCCGGTCGGTATAGTTTTCGACATTGGCAACCAGCCAGCTGCGAATGCTCGTAACACCATGCTCATGCGTTTCAAGTGGTTCTCCGGGAAGCCGGGAAGGATAAAAACGAATGGTCATTGCCAGAACTCCACGCGAACAAAGCGCCGCTTAAATCGCGGCAACGGCAGAAAGGTGACGTTCGTACCCGGGTTGCATTCCGCCACATGCAACAGACCACCGATACTGACCACGATCCCTACGTGGGTGACAGTCGACCCGGAATAACAGGCCACCCCGGCCCCTTCGCAGGGTTCGCAGCGCTCAAGGGTAAGCATCATCCGGCGCGCTTCCCGGTCGAGGCCGCCGTCGTCTTTGGTAACCCCTGCAAAATCGGGCCATTCGGGTAAATTCAGGTCGCGGCGTATCTCGTTCACAATGCCGAAACAGTCGAGTTGCGGGTATACGCGCCCGCCCTTCAGCCAGGTGACTGAACGATATTTATCAGGGTTGAACATTGGGATTCCTTAGCTGATATAACGCAGTCCGGGGAATACAGGTAGCGTGTAGCGGTAACGTGGCCAGGCGGTATCGAGGATATTCATGTAGCCCGCAGTGATCTGCACCTCTGTCGCCGTCCAGGAGCCCGACTTGATTTTCAGCGTATACGGCACTTCCGCAGGGGCCGCTAAATCCGTGGAGATATAACGCCGGTACGTCAGAAATGCAGACAGACGGTTAGCCAGCGCATTGCGGATCGCCGTGGACACAACACCATCGATATTGCACAAGGCAAATTTGAGGTCCTGCGTGCCGTCCGCATTGCGCGCCGGCAGCGCAATGTCTATCGCACAGGCTGAAAACGTTACGGTATCGCCGTTCTCCGTCGTCGCCGTGATGTTGTCGTAACCCTGGCAAAGGTAGTGAACATCAGAGCCAATGGTGATCTGCAGCGTTTCAATGATCACCTCCGGTCCGCTGCTGGCGTAGAGGCGGTTGAGTCTTGTCATGATTTTTACCCAATAAAAAAGGCCACCCTAAGGTGACCTTAAAAATTGGTGTCGAATGTGGGTGTACCCTCACCGGCAGGATCGCTATTCCGCGCTTTATTTCACGCTCCGGCTACGGAGCGGCATGAAGGACTTTCCCACAAATCGACACAAGTGATTATGAAGGTGAAACGGTTTTAATCAAGCCTTGGGCCACTCCTTATTCAGCGCAATATCCAGCAGTGAGCTGCCGACGATCCATTCCGGGTAATTACCCCATGGGGCAGGAGCAAGGGGGCGTTCCCATAACTCAAGCGTCGCCGTGTACTTCCAGTAAATCGGGGCCACCAGCACCGGTCCCTGATATATATCTGTGAAGCGGCATTTGTAAAACTTAATGCCTGCCGGCGTCTGCAGCTTCATCATGAACCATGCAGCCCCGTCAGATAACGCATCACGGAACCAGGACTCAAACGCCAGTCCCTTCGCATCGGTTTCCATAAACCAGGTGATGCTGGCCTGCGTCGGTGTGGACGTATAAGCTCGCCTTTGCCGCGCGCGGCCGGTGGTTAACTGGGTTCGTTTTAACGGGCTTACAGGCTGGAATCCGTATCCTTCCTGTAATGGCATTGGAAGACTGTCATGCGGGTAGTAGATATCAGTCATCACTCTAACCCTCTGCCTGGATATTTACTGCGCATTGCCTTACCAACTTTCCCATCTCCTCTCAACACTTGCGCAGCAACCTGATCAAGGGCTTCCGTTGTCGCCCGCTTCTGCGTTTGAGCCATGGAGAGAGCCATCTGATCAGGTGTCACACCGGGCGGCGTATGGAAATGCTGCTCAATGGGAGCATGGATGGTGGTCTTGCTGCTGTTGTCGCTGTTAACGTTCTGAACACCAGTACCAAACCCTGTACGCCCCAGAGTTGCATCAAGCGGTTGGCCATTTCGAAGTGCCTCAAGCTGAGACACGCCGATCCGGTTCGTTGATGCCTGGTCGAAGACGTACTCTCCTTTGTGAACAATACCCGCTGGCTGATACTTACCACCGGGGCCGGTGTAACCGCCGGAGGCGAAGCCAACGCCTGAAACAGCCTGAATATTTGAGACGATACTGGCGGTCTGCGCAGCGATTGAGGCCATAGCGATGATGTTGGCCGGATAAGGCGCGCTTACTGCACCGCTTGCTATAGCCTGCTGGATTTTCACCATCGAGTCCGCGATAGCGAATGCCTTGCTCGCAGCAAAAGCGACCTTGTAGATTGCCGATTGCTCACCAAACCCCGTTCGCATGATGTCGGCGGTACTGTCAAACAAGGACTGCGTGGCCGCAGATATGATGGTGTTTTTCTGAGCCTCTATGACCTGATTTGCATCCGCCGCACGCTGACGAATAGAGGTCATTCTGGCCTCACCCTCGGCAGTTATTTCACCGGCCTTCGCATAAGCTTCCTCCTGAGCTGCCAGCCAGCGCTGGAGCTCTTGCTGAGCCTGGTCATATTCGTTGATTTGCCCCTGCATCCCCTCAAAAGTTCCAGAGAGTCGCCCTCCTGTGGGTGTCAGGTTTCCTACAACATTACGAACCGTCGCGGGCAGTTGCATATCGGTGTTTTGATAAATATCTGCCCGTGTTTTTTCATATTCACCGGGTTTTAGTTGCCCGGTTGCTTTGGCTTTCTCCAGCAGTTCAAGACGGGTTTTAAGCAGATCGTTGGTCCGCTCATCCTTCGTCTTTACCTGTTCCTGCATTTTCCGGTAATCATCCAGGGTTTTTACGGAGTTTTGCAGTGCCTCCTGCTGCTTATACGCCTGGAGGATTTCATCTGAACGGGAAAGGATCGATTTCTGGTCAGCGGTGAGCTGCGTTTTAGACTTGAGGTCAGTAATTTGCTGTTCGAACTTAACACGTGCCTGGGTTGCGCTGTTAAGCTTGTCACTGGCATCCAGTTGTGACTGCAAGGCAGCTGTCTGCTGGTTTATTTGATCAAGCAACCTGGTTGCTGCGTCCTCGGTATATGCTTTACCCTTTGGCGTCTTGGGTGGTTTCGGATCTTTGTACATCTCGTTAATGCGAGAAACATTTTTTGCATATTGCTCTGCAGTAATTGCACCAGCCTTCAGGAATTCGCTTTGCTGCTTAATAGCTTTATTGCGCTTATCCGCATTGCTCAGATATTGCTGGTTAACGCGATCTGCTTCCTGCTGCGTTTTAATTCTTTGCTGTTCGGCTTCCTTAGCCTTCGCCTGTCCTTTGGTTACATCCCCCTGAAGATTGGCAACTGATTCGAGCAAATCTCTCTGTTTTATCATCTCCGGGAGGTTGGTAAACCTCGCGCTAAAACTGTTCCAGAACCCACCATCTTTTTGCCCTTTTTGGGCTTCAGCAATATTTTCGTTTAAGGTGGCAAGTTTATCCGTTAGTGTTTGTTCACGCCCAATATTGAGCATCGCATCCCAGGCGCCTTTGGCCGTTTTACCCAGCGAGTCCCATGCACTTTCAAGAAGACCAAGATTCTGATGAATATCATTCGCACGCTGCTGCATGGCATTGGCGTAAGCATCAGTAGCCACCCGTGCAGCATCCTGCTGATTACCTTCATCCTGTAGCGCTTTAATCTGGTTGTAGGTTGCCAGTGTCAGAAAGTGGTACTGGTCGTTAAGTTTGGTAATGGCCGCAACCGGGTCAGCAGTAATGTCGTTGAAATCACCAACCAGCTTATCGGTAGCAATGCCCGTCGCCTCGCTGGTCTTAACAATGGCGGTTGTCACGCGCTCCAATGAGTCGCCAGCTACTTTACCGGATAACACCAACTGATTCAGCGTTGAAGCTGCTGCACCGGTTGTGGAGTTAGCTGCGACCGATACACGGGCCGCCATATCTGCCAGTTGACCGGAAGTTTTGCCTACCAGATTACCAGTGAGAACGAGAGACTTATAAAATTCGTCCTGCTCCTGAGTGCCTTTGTAATAGGCCAGACCAAGAAATCCGACCGCCGCAGCTGCAAGAGTTAAAGGGTTAACCAACCCCATAACATAGGTGCCCACACCCTTAATTGCCGGACCAATACCACCAAACATATCTTTTAACTGCCCGCCCTGCTGCATCAGCACCATAAACGGAGACTGACCGGTGGATAAGCCGACAACAATATCTGTCATCTGAGCCGGGATCATGCGCATGGCATAGGCGGTCTGGGCGGCGGATTGGCCGGTTTTACCAAGGTCGTCGCGAAATCCTGTTAGCCTGTTTCGTGTTTCCTCGATTTTCTTTGAATAAAGATCGAATGTATCGGTATCTACCATCCCCTTGGATTTGAATTTCGCAAGATCCTGCTGTTGTTTATCCAGTTTGTTCAGGGCGGCGTTTACCGGGTCGATACGATCTAAAAGTTCAGAAAGGGACTGTTTTTCTTCATCAGTGGCCTTTGTCACTTTCCCTGCACTGGTGGCAGCACGTTCACCTGCCTGCGTCATTTTTACAAGTGCAGTTGCGAGATTGTCAGCCTGCTTTTCTGCCCCAGAGCTGTCAATAATAATGGCCAGGCGGGAGGTTTGTTCTGTCACGTGCTTTTCTCCGGGCAATAAAAAACCCCGCCAAAGCGAGGTTGGAACTTTTTGAAACTGTCGGGTCTTTACTTCATTGGCGGTAAAACATTATTGCTACGATAATCACCGCAAAGACAGTAATTGCAATTCTAGCGATTAACTTTACATTGACATCAGCCAGCCTATCACTAGCTCCAGTATTGTCAGTGTTAGCTATTATCTTCGAAGGAGTTACATCACTCCCGCAATGCTTGCACTTCACCGCTTCGGAATTTATTAATTCTGCGCAGTAAGGGCATTTGACTGAAGTTCCGGACGCTTTTAGCTTATCTCCCACCAGAGCAATAATGATACCTGCGATGGCTACGAAACCTCCAAATATCATATAATTTTGGCGCGATGATATTAATCCAAGATTGTTAACCCTATAGCCACCGCTTGTCGCTACTGTCACATCCATAAATAGCGCCGATACAGCAAAGATCACCCCTATTACAATCGCTAAGTATCCAATAATCTTCACTTGTCTACCCCATAAATTAAAAAGCCACCAGATGGTGGCTTTATCATTCAGCTTGCGTTCTCACAACCCGGCAGGCTGCGGTCAATCACAAGATTACCCTCAACACGCAGACCAATCTTACCGAACAGGAAGGAGTGGTTAAGTTGAGTGACAACTACGTCAGACAGACCAACTGCACAGCGATCTTTTTCAATCGCTCGATCAGCGGCTGTTTTAACGTTCGGGATGCCAAGAGGGAAGATGATAACCGGATAGCTATCTTCTGCTGTTACACGTTTCCCTTTATAGAACTTACCCCCATTGAGGTTGTAATTTTTAGTACTCGCCACAGTCAAATCTGCAACACGTACTGTACAACCAGAAAGTAACAGCGCTCCAAGCGCCAAAGCGATGACTTTTTTCATTATATGTTTCCTTTGATTGCAATCGGAAACATCCTATCATCGACTTTCAGGAGCATGGACCACCATTAATGGTAGGTCAGTTGCTTCCTTTCTTATCCGCTGCACGTTTCTGTGCCTCTGCCCACTCAGCCCTCCAGGCATCATCGAGAGCCAGTATGGCTGCGTCAAACTCAATGCGGTCGATCAGGATGGTGCGCGATGCCAGGTAAAGCTCAATATCGTTCAGGGATAGAGGGAGCGGCACTCCGGCCATGCCGGCATACTTCCTGCCGCGCGATATCATGGCGTAAGCGTTGAGGATCTCCCCAGTAACTGCATCGATTTCAGGCTCTGGAATGGGCGGGAGATTTAGTTTCTCCCTGCGCCACTTTGCTTTCTCGCCCTGTTCGCCAGCGAATTCCTTTAGCCACTTTTGGGCCTCTATGGCTTTTTTACGGTTTCCTGAGTCTGCTGCTCCTTACCCTGAGCAATGCTTGCGGCCTCGGCCAGTATCCGCCAGTACAAATCCGGGTACTGTTTCAGCATGGCGATCCCGAGCTCTGGGGTATAGTCGAGAGCAACCTCTGAGCCATCCACCAATTGGCCCACCCCCTCCCAACCTTTCAGCAGGAACCGAGCGGCGTTATCGATCAGCAGGTCATCAACAGAGTCGATATCGCCCACACTGGCGAGATCGAAAGCATCCGTACCGACCTGGTAGCTCGCGTCCATTTTGTCGATATGGCGCCGCACCAGCGCATTGCGTGAGCGGTATTGTGGATTCTCGCTACTGGCCACCAGCAGACGGAGTTTAAATAGCGCCTCGTCTTCCGGCGTGAATTTCTTTTTACTTTCTGCTGGCTTTTTGTAAGGGTAAAACCAGCGTTCTCCGTTCAAATCAATTTGAGAAGAAATAATCAGCATAAAGACTCCCAAAAAAGCCCGTTCCGCGATGACTGCAGAACGGGCCAGGTAAATTAAGGCGCGGTAACGGTGATTTCAGACGTTGCGGTAAAGGTGCGGGCCTTACCGGTGATGGTTGCAGTACCGGCTGCGTTACGTGTGACTTTCGCTGTTTTCTGCCCGGTAGAAACCACGCTGGCGATAGTCGGATCCGATGACGTCCACTGGACGGTATCAGTTGAATCAGCTGGCGTAAGCGTGGCGGTTAACGTCACCGTGGATCCCACGGCCCCAGTTGAAGTGGCTGGCGCAACACTGATTGCCGTCGCCGGCACTTTAGGCACGCGCGTAATCGTCGGCGGAGTATTGGCCGCGGTGATATCCAGCTGAACCTGAACAATGTCAGTGCTCCCCGCATCCGGCCAGTCGCCGGAGATCTGCACTTCCGGGAAATCGAAGGTATAGGCGCCTTCAGCATTCTCCAGCGTGAAGCTAAACGGCACCGTTTCGCCGGTGAACGTTTTTTTGTAAACCTCCCAGGCAGCCTTTGACCATGACAGCGTGATTTGACCTGACGGGGTAAAGGTTGTCGGAATGTTTGCGCCGGCGAATGCCGAACCGGTACCGATGCAGCGCTGAGTCTGCATATTGTTGTTGAACTGGATGTTGAAGGTGTCGACGCAGAAACCTGTCCCGCCATCAACACCATTTAGCCGGATGTTCGTGACCTCTTTGAAGGAGTAACGCAGCGCCCCCGCTAAATCCACCGGCGCGGTGAAATAGCCGGTATCGTCCCCCTTCGTCTCCCAGTCCAGCCCTGCAAACGTAATGGTTGCAGTGATATCACCATCGGCCGGGATTTCCATCTGGAAGGTGCCAACCTGGCAACCGCGGGCAATCTGGGCGATCCCCACATCACTGGCAAAAGTCGCCACGGAGAACGTAATGCGACCATTACCCATCGTTAGCACGTTATTTAGCCATTCGGAACCGAAGCAGCTGGCAAGAAAATCATCATGCTGGTTCCAGCGAAACCGCGTGCCGACATCGCCGCCGACATCCACTGTGCCGCGTGAAACACCTTGCGCCATGCGGTCACCAGCGATTTCGTCATTGTCGTTGGTGTTCTGCGTTGGTTTCAGACCAAATGAAGAACGACGCAGCAGGTTCCACGCCCCTGCTGTAGGCGTGATTCCTGGCGTTGTCTCGCGAATAAACGCGGCTACTACTTTTGCACCTGAGCTCACAGGAGCCTCCTGTTTTTTGTGCGCTACAGAGCGCGATAAGGAATTTGAAGATTGAGCTGTAACCAGCCATCGGTCTCACCCGCCGGCACAGCAGAAACAGCGAAATAACTCAGCTTTCCGTCATCCTTAAACTCGAATAGCTCCGTTAGCTGATCGGCCGTTCGGGAGATAAGCAACGTCCCGGAACCGACCGGAACAAAAAGCTGAATGATGAGTAAGCCCGTCCTGTGGACTACCGGCCCGTCCCCGATCTCGGTTGCGCCAGCCTGCCCAGCAATGTTGGTTAGTCGGGCCCAGATATCGCGGTTACTGGGGTCAAATACCGGGCCATTGGGATAATCCACCGCATCAGAGGCAATAGCGGTCTGTGCCGCCATTCGGGAAATGACAGCGTTTCTGATTTCTGTAAGGGTCATTTGTAGGCCTGAATAACACCATTAAACGAGACGGCATAGACGCCTGTCGGCGCCTGTGTTGAGTGGCCATTCTCCAGAGGCACGGAGTAAGGCAGGTTCGACTGGATGTAAATCACCGAGTAGGCTGGCGCCTGGTCAATGATATTTTTGCCATTAAGAAACGTCATTGTCCCGCGCGGATCCGGTTCGGTCGGGACGGAATGATTAGGTTCGCCGATGCTGACGAAATGCGATGCCCTGAAGGTTCCTGCGCGATACTCAGCCGGCCGCCTGATATCCATGCTGTCATTAACACGGACTTTCTTTCTGAGACGGCCTGTCTTTGTCAGGTTGGCAGGATCGGCATAAAGAGATTCGTTCCATTCCCCAACAGCTTTGTTGTACTGAACCGCGGTCGCGTTAATGGCCCACAGCTCCGGGTTTCCTACCGGCGACCGCTGAACGATTTCATTCAGCAGTTGAATGGCGATTGTCCGCTGGCGTAGTTTGACATCTTCTGCCACCAGCCCGGCGAATGCCGCCGGGTCAATGTTCCAGCCCTTAGCCATATCACGCCCTCCGCAGTTGAATGGAGTACGCAGCGCCAGCAGAGTCGGCAGAAGCGGTGATGACCTCGTAGCGCTGAAGCTCACCCGTAACCGGATCCGGTGCGGTGATGATATGCCCGACGGCCGGCTTATCAGTCACCTCGTTAACCAGTGCGGTTAGCTTCACATCACCATGCAGAATGTTAACGCCATCGATACGGCGCAGCTTATAGCGCGCCAGTACTCCACGCCCCGAGTAAGTCACCTGCGTTTCAGTGCCGGTTTCCGTCACCGGGTCCCAGGCACCCCGAACGGTATATGACCCAGTGAAATCCTTAACGGCATCCTGCAGGTCGGTATCGAATGCCGCGGCGACTTCGGTTTGCAGCTCGTCACGAATGCCCATTGCACCCACCAATACGCTGCTGAGGTTTAACGATCACTGTACCGTGGAGTTTGCGGGTATAAATTTCGCCATTGCGCTTAACCCGCAGCGGGAGCGGAGCAAACTCTACAACACCCTTTGCCTGATTTGCGTAAACGACATGTCTGATCGGGTTTCCATTCACAAACACATCGCGGGGACCGAGCCCGTCGCCGGCATAATGCACATATGGATTTTGCATGTTACCCCCTTACCGCCGCTCAATATGAGCATGGATAAAGTCGGTTTTAAGCGACTCCATAGCGCCAACCATCACATAGGGACGTCCACCGTTATGCCAGCAATCAATCGCGTTACCCTCATCATCAAGCAGTATCACTGCGACACTGTGGCAGCCGCCGTTTTCGGCTCTCTCCAGAGCCTGTTTCAGCAGGCGAATAACCTGGTCGTTATCGAGGTTGTGATGGCTGGGCTTTTGAAATGGGACCACCTTCAAATCGGACATATCACGCCCTCACAAAGAACGTCTGGAAAGGGTTAATCATCCACGGTTTGAGCATATCCAGCGCCAGCTGCAAATCAGGATCGAGTAATTCAGTGCTGGTGGTTGAAAGCTCGGCAAAAGTGCGGGAAACCTTCACATCGTCGGCCTCAACGCTTTTGCTCGTCACCACGCCGGAATCTGTTTTTTGCTGATACAGATTGCCTGCAGCGGCTACGGAAGCGATAAACGCTCCGGCTTGCTTAACTTCTTCAGGAATATGCTCCGGGTCGATATCCTGAAGGTTAAGCGCCGTCATCCAGGTGTTTGCCTGGAGCACGGCTTTACCCTTTTTGTCGGCGGCAGCCCAGGTATCCCCCAGCAACTCGTCAACGTCCTGGATTGTTATATAAACGGTCATCGGATCCTCACCAAAAGAAACGGGGCTTTCGCCCCGTCGGTTAACCACCCGCAGGAGCAGTGAACGCAATCGCTTCAGTTGTTTTCACCACACCGTCAACGGTAGCCGTCACCGTGAAAGAGCCGGCCGTAGGAGAGGTGAGTTTCACCGTCGAGCCACCAGCAGACCCTGTCTGTGACGTCGAAGCACTTAGCGTGCCGCCAGTAGACGTCCACGCCACAGATACCCCGGAAACTCCGGCACCATTTCTGGTGTACTTGAGCGAAACGGTCACCGCGTCGGTACTGTCAGCAGTTGCGGAAGTTTTATCCACTGACAGGGTTACTCCCCCGCAGGGGCTTCCAGCTTAATCAGTACGCCTGCAGTGGATTTGTTACTGGTGAAATGTTTCTTCCAGTTCGCGCCGGTGCCGATTTTGGTCAGGTCAGGGTTAGCGCCCTTCGTCTCATCCCAGCTGTAACCCAGCAGTTCAACGTTAACCGTACCCTCTGCGCGATAGCCAATGGCAAGGTTTTCCTGGTCGTTGATATCGTAGGAACGGAAGCCCGGAGCCTGTGATTCCGTTACGGATACCGCGCCGGCCACCAGCCCCAGAATCGCATCAACTGGCATGGTGTCAGTTACCAGCACCGGTTTACCCAACGTGCCTGGCTGTCCGCCATAAACCACCACGCCAGCTTCTTCGTAAATTTTGTTGTCGATAGCCTGATCAACAATGTCGAAATAGGTCGTGGAATGCATAACGAACAGCGCAACACGGTTAAATTTATCGCCGTATTTACGCAGGCCACGGGTCAGCGTTTTCTTACCATCAGTGGCAATATCCGCGGATACCGTCATGTCAGCATTTGCGCCAATGGCTGCTACAAGACCCTGTAGGGCATACTTGATATAACCTTCAAGCGTTGCATCAGCGACGTCGACGCCGATCACCTCGGAGAATTCGCTAACGTCGCGACCCCGACGTTTAAACGCCTCCTCCGTGGTTTCATACGGGCCGTATTTCCACGGCGCCTTAACGCTGACAGATTCACCGGCACCGATTTTTTTACCCGTTACCGGGTCGGTGGAGTTAACGTTGCGCGATTCGATAGAACCACCAACTTTATAGAAGGTGCGCTTGCGAAAATCACCCTCGATCAGTTCGTTGTCGAGAATGATTGCGCCGTTTGAAGCGGCGTTGAAGACTTCCAGATTATCCTGGCGACGCTCAAGAAACGCAGTCTGCGCGAGGTCGTCATAGATAATCAGGTCACTGTTTACGGTCGTAGGCATTGATTAGTCCTTACTTAGGCAATTTGAGATAGGCCTGCTGGCCATGTTTGCGGATGTAGTCCGCTTTGTCGCTTGAGCTCATTTCTGAACGTTTCAGACTACCGCCCCCGCCACCGGGTTTATGACCACCAGCCCCGGAGCCTTCGGCGCGCGGGAACAGGTGCGGGGCCGTCTCTTTCAGAGATTCAGCCCACTCAACCGGGGTGAGCGGAGTTTTGCCGTCTTTACCGAACAGAACATCGCCATTTGCATCAACTGCTACGGCCTCGCCTTCGTCGTTGAGCTGGAATGTGCCTTTAGCACGAAGAATCAGATCGTCGGATGCTTCTGGCAGCGCGCCTGCCTTAAGCGCTGCGCTGCGGATAGCATCACCCAGGACACGATCACGGAATTTGTTGGAGAACGCTTCCGCCTTTTCAGCGCGTTCATTAGCGGCTTTGATTTGCTTATCAACATCAGCACGTAGCCGCTCAGTGCGTTTATCCAGAACCTCATCAACTTTCCCGGCGGCAATCAGCTGCGCTTCCTCATCGTCGGAAAAGCGCTGGAGAATGGTTTTCACCGCGTCGGGGTCGATACCATCAAAACGTTTAAGCGACTCGGTGGACTCTTTGAGCTTACCAAGCAGCTCGCTATTTTTATTTTTCAGGCCAGAAACCTGAGCGCTGACTTGCTCATCGATCAACTTCTGGATTTCCGGCGTAATCTCAGGCGCTCCGCCGCCGGAACCGCCACCTTCACCACCTTCGCTGCCAGCTGCCGAATAATATTTAATGAGCATGTTACGAATAAGCATGTTGTCCCCTTGGGATAGTAACTGTGGGCCTGGCCCAATAAAAAAGGCCGCCCTTAGGCAGCCTGTTGTAAATTTCAGATAATAAAAAAGCCGCGCTAAGGCGACCTCTTCATTTAGCTATTTTCTAGCATGTATTCTTTTGCATCTTTAATGGCTTTATCCATTCTCTGCAAAGAAGACTTTGGCTCTGCAATGCTTCGCACTGTGCAAATCTCTTTAATGAGCCCTCTTGCGATTACCAGCTCTTCATACAGGCTTGCAATAAGGTCTCTTTGTTTTTGTGAATCCATAACAACCTCGTCTCGTTGCTTGTCGGGTTATTGGTTGCAGGTGGTGACGATTCCGCTTTTCGGGAGCGACCCTAGCCACTGACAATACAATTAGGTGTGGTGGCCGGTGCTGCCACGGCATTCTGATACTTCAGAACGGCGGGGACTCACCGAAGTGAGTCTGGTTTCCGGCTTGCCCGTTTCTCACGGGACGCTTTGGCGCGCAGGTCAGCATCCTGCATTCACCACGAATTTACTCTATCACACTCTGGCATCCTTAAACGCCTGCGCGTCAAGGTTGCGCAATTGGTCAAGCGTCAGCCACTCGCCCCTGTCGTTGTAGAGCTCATCGGGAGACATGCCGCCATCACGAATCAGCCTGGCGCGCGTTTCTCCGACAATCTCAGCTTGTCGCGTGAACGACTGCCGGGAGAACCAGTCCTGGTAAGTCGTATCAGCCGGAACCTGTCCATCCATACTGGCGCGCGAGCTGTCCTTGATTTCGCCGACTTTGATACCCAATTCCTCGGACGATTTCAGTATGTATGTTTCGGTGCTACGACAGCAAAAGTGGATTTTCCCCGGTCCCTGCAAATAAGGCACCTTGTGCCCTATCGGTTTGTTATCCAGCGTGTACTTGAGTCGGTCGCGGATCCGACAATCCTTTGATGTCCGGTTATCCAAAGTAGATAACCACTGCTTACCCTTCAGAATGTCGTCGTTCGCCGACGCAAAGCTTTGTCTTGCTGTTGATGCAAGATGCCCTACTGCCGTTTTCGCTATGCTGGCCGCATTGGCCCGGCTCATCTGAAGCGCACCATCCTGGTAGCCGCGGTTAGCATGTCCGCGAACCTTTTTTGCGATCTGCTCATGCGTATCGCCCAGGAGAAAACCCTGCCGCACCGTATTGGATATGCGCGCCATACGATCAGCTTCGAGGTTGCTGGCCCATTCGCTTAGCAACCGCCCCTGAAATGGACGCGCCATCGCCGCGGCATAAACTGCATCCGGGGAGATGCCAACCAGTGGATGAAGAGCCAGAACATCGTCGGGAATGGCAAACTGGAAGAGGCTCATCTGAAAACTGGCCTCATGCTTCGCCAGCTCCTGCAGCTCGGTAGAGAGGGCTGCATACATGGACTGTATGGCATCCTTGTTTATGACCCTGACACTGACCAGTAACGCTTCCAGACGCGAAACGGTAAAGCTCTCGGGATCCAGCGTATCGATAGCCACCAGCAGCCTGGCGGTAAGTTCGGCGTCGCTGTCATTCAGAACTTTTATCATCCTGTTGGCAACGCCGGTGCTGTAGCGACTAACCCATATAGCGTGGGCTATGGATTCATCCTGCAGTTTGTCATTCGCCGTTGCCATTATTGCCACCAATCAGGTTAGGCGCGCCGTTACGAATAGCGTCAATGACAGTTTCAGGGTCATCAGCGGGATCTATCAGGTCAAGCCTCTGCAGAGCTCTGACCATATCCGTGTCGCGAATCGCACCGTACTGCCAGGCATTGACGATTGCCGTTACCATGCCGGATTCTGCGACTTTGGCGATAAACTCCTGATTGATGCTGTAACGGTATTCCTCGCCTTTTATGCCGAGATATCTGGCGCACCAGCCTAGCGCCAGCGTATAGGCCTCCGAGACATTGGAAACGCAAATGCCGAGCACCGATGTGGATGCGGTTTGCTCGCCGCTGGATTGCGTGGCGGTTTTAACCGCGCCGTTCTGCTCGATAAGCCGGGCGCCAAGCTGAACAGAATAATCACGCTTACTGTCCATCGCCTCTTTAGCCAGGGTGTTTGGTTGCGCCTGAGCATAGGTAAAACTCCCCTCCTTCGGCAGCAGGAATGGAGAACGAGAACCGACACGAATTCCCTTATCCTGCAGCCAGTCACGCCAGGCGGTATCAAGACCGGAAATCACCGGCTGAACCTGACCGCAGAAAAATACGCTGTCTTCGTAATCTGCCGAATTACGATAATGGCCAAGGTTAATTTCAACGAGGGCGGCTAAAGGCGACTCGTCGATGGTGGGATCATTATTCTGCGCACCAACGAAGGTAAAGGGGATCTCATCCCAGAAATCCTCACCTTTTGGCTTAGGATGATACTCGGAAGTGACGGAAAAAGAGCCTGCGTCAGCTGACTTTCGCCATACCCGGCAGACAAACTTTCCGTTCTCCAGAGCCAGTTCGCGATACTGGATTTCATCCTCGTACGCAAAACCATCTTCCTTTTCCATGCATTCGCGTAAAACCACCAGCACCAGTTGATCACGCCCATTGATGCGTTTGGTGCGCCAGTTAATGATGCTTTCCGCCTGATAACGAAGGATGATCGCCTCGTCGGTCTCAGCTGCATAATCCGTATAAAGCCCCTCGCGCGCGGCCTCCAGAATATTTTCTGTAACCTGCTGGGACTGCTGATAAATGCTGGCACCAGCACCATCGGCGTTGTCACGAAGATAATTCAGTTTATCCGGCGCGGTCATGGTCGGGTCTTTTCTGAATGCCAGCCCCAGTAGACCCACTTTTGTATTGCCCGTTATCGCGTAGAAAACGGCGCGCTGAATGTAATCAGCATTGCGCTTTTTATTGCGTGCAGACTTATCGGACGGATCCAGAAAAGGGAGGTATTCATTCCCGGCGGCCTTTACAGCATCAGCCCCTTTGCACACGTCACGAATTTTTTTCCACACGGGCATTGCCGCCCTGACCTCAGGGCGAACGTAAGTAATATCGTTATTGGCCATCAGAATGTCGTGTCCAGTGAAATAGAGAATGCAGGTCGAACGATTGGGAATTGCTTCACAATGAAGTAACCGGCAGCATCGTTGGGGTGATCGTTATCGCTCTTTTTATCCGGCTCGCCATTTTTATCCCACACCTGTTGTTCCAGGCAGTCGGCATAGACCGGGCAACGGGCCACATTCACCTTGTACCGGCGATCGCCATTACCATTGCAGAACATGGCGTTCATGGAGTTAATGCGGTCCTTTACCGGCGGGTTAGCATCATCAACGATGACGTTAAATCCGGCCTGCCGGAGCTGCTCAATATCTGTTTTGCTGGCGTTGTTTGACTTCCTGGAATCACCAGAGGCATCCGGGTAAATATAAATCTCGCGGACCTTGCGGTAGTCACCGTCGGCATACAGCCAGAAACGTTCCTTGATGATGCGTATCATGTCGGGCGTATCGTAAGCGTTGATAATCTCTGTTACCGCGTGTGGTAAGCCGAGCCGCAATACATGGACGATCCCTGCCATCTTCCCGACGTTGAAATCCATCCCGATATACAGCGCTTCACCTGGCTGCTCTTCCTCACTGGAATTATTCAGCACCCTGTCGAACTGATGATAAATGGTGCCGCTGGTCAGGTTAGTAAACTGGCCATTCAGATATGCCTTGATCAATTCCGGCGGGTAACTCGCCAGGAGCGAAGGAATATAGTCAACCGGCAGGTTCTTTTCGTTGTCGAATGTCGAAGCCTGTACCAGACCATACATCGACCTCAGTTCAGGCTTTTCCCTCACAGCCTTAACAAACTGGTTATAGACGAACTTAAATCCTTCAGGTGTGGTGGTCACGTCAATGCCATTACGCAGACCATCAACCTTATAACGCATACGCGCGATTATTTTTCGCCACGCCTGACGCGCCTTATCCGCTTTCAGAACGTCGAGTTCATCCACCAGCGCATTGCCGATTTTAAAGCCTACTATCGTGTCGGGCTTTTCCATCGACCGACAAATTGTCGTGCCGCGGTACTGGCGCCCACTGTAGAAATGGACCTCTTTGTTGCTTTCAACGATTTTGACTTTCAGTCCCCAGTCGTGAGCAACTTCTTCCACCGTGGGGTAGAAGATATCGCGGATCTGAGGATAGGTCGGGGCAAAGTAGCCCTGGTTTATTTTGGGGAACTCCCAAAACCCTTTGCATATTCCACCGCAGCCAACCCATGTCTTTCCGGATCCAAAACCAGCTACATAGGCTTTGAACTTCTGCTGCATAGCCAGAAAACGAGCCTGGGGAACGTTAAGCGTCGGAGCTATCGCCATCCTCTTCCCTCACTCGCGCATCGACTACGTTGATATTGATCGCAACTGGCGTTGGTTCGTCATCTTCCGGGTCAGCGGCCAGCTCTTTACGGAGCTTGTCGATCTCCAGCTGCCGGCGCTCGATTTCAATCTGCTGTAGACGCTGGGTGAACTCACTGTCAGCCAGGCCGAGACGTTTCATCACCGCCTCGTACATTCGCTCGCGGCTGATAGCGGTTATCTCCACACCGTTCTTTCCGAGCTTAACGCCGGAATAGGCAAGTGCAGCATCAGGCGCCAGCTTGCGCGTATCGGCGAAGAAAGGCTGGCCGATGCCATCACCATTACAGCGAGGACATTTCGGGTTAGGCGAGCTGGTATGGTCGTAACCGTAGCCGCCTCTGTCGTTTGGCTCTTTCCCTTTCTTCGCTAAAGCCTCAGCCAGCTTCTCTTCGAACTCAACCGCATCGCGCCATTGATACTGGTGACCGAAGCCCCAGCAGTAACGGCAGCTCCCGCGGCGATACTGAGAAAGTTGGTTGGCGTCGAATGTTGCCAGCCGCCACATCTGCTCAAGCACTTCATCAGCGCTGCCAAGCGTGCGCACAATGGATGCTTTCTGCTGCTGCGCAATGGCCTGCGCAACTGAAGTTTTCTGAAGCAGCTGATAGCCAATTTGTTCAGCAGTCTTCTTGCTGTACCCGGCACGGATAGCGGCCTGCGTGGCGTTGTGGTCCTTCAGGTATTCTGCGACAAATAAACGTTGCTGATCGGTGAGGCCATCATCATCCACCAGCTCTTCTGCGCACTTTTCCTTTTGCGCAGTGCGCAGTTTCTTCTGCGCAGGTTTTTGCGCAGTTTGCGCAGTGGGTTTCTTGATGTATCGGCGGGCAGTAGCGTAATTCAGTCCCTGCGCTTCACACCAATCCTTCGGTGATACGCCGGTTGCGGCATGATCGGACAGGAACCGTCGCTGAAGCTCGCCCCAGTCCGGTTTTGCCATGGATTATTCCTATTTAACGTGAGGGAGAAAAAGGAATTACTGATTCTCAATAAAATATTCACTTTTATGTTTTGGAATTAAGGCTCTTTAGTTCAGGAGTTATTATGAAAAGAATTATGCTTGCTGTTTTTGTGATCTGTGGTGCGCTGTCTCTTTCAGGATGTATCCTTCCCCCTGGGCCTCATAGCGGCGGACATGGTGGAGATCACTTCCATGGTCCAGAGCATCGTTAACCGCCTGAGGACTTCCATTTTACAGAAATGAAAAAGGCCGCAAAATTATGCGGCCTTTGGTCACTACCAACCAGCGTATAAAGAATCTCTCAGGAGCCAACAGAGAGAGGCTTTTCTGCTTTTTAACTGACCACTGCCGTTTTGGTGTTGGCTGGCAGTGATAACGTGTTGATAGCTTCATTTAAGTTATCGAAAGCATTTAAATATCGAAAGAGCTCATTGAACCAATCATTTTCAACTTGCCGGAACATTCAACCAGAGCACCAGGCATCTCTGCTGGTCTTTTGATGGCAATTCTCAGCTCTCCCGAACGAGGCCGGTAACTAACAATTTATTCGACAGTTTCTTCGGCATTAACCCAAAGATCTAGATGCTTGATGTAGCGTTGGATGGGCACATAAATAACCACCCCATCTACAAGGTTAATGGACTTGATAACACATCCCTGCGGAGCTAAATAATCCCCATCACAATGAGGGTGAATAGAGTGCTCGTCACCGTATCGATAACCATGCGGAAGTTGAGGGAGTGAATTTCTTGTCATGGGCAGCTTCTTAGATAGAAGGAATTGAAAATCCATAGTGCCTTAATGCACCTGACTTAGATACCAACTTTTCATTTTTCAGCGCTCTGTTGTCTCGTATTCTGATTTTTTGTTCATGTGGCCATGTAAATTTCAATACCTAAAGTATTCTGCGTTTGTAGCTGAATTACCTGGAACCCTTCTCTGTGAGCTGCGAGCAATTGGCCTGCACTGCTTTGTTGTGCGCCAGGATGTCACGCTTGGTCTGCTTATCCAACACATCGATATCGTGGTCAGTCAGGTAGATGATCCGCACCCAGCTGCAGGCCGTGTCAACGACTACCGGGGCGGGTAAACTTTTCGCGCAACTCCCGATCAACATCGTCATCGCCCATACGCTTAACGTCTTCCTGTACATCGCTGGCCCCTTTCGTGACTTCAGCACGGCGTTCTGCCGCGGCGACAGTAGCAGCGGCGTTCTCTTCGGTACGTTGCTGATCAGCTTTGGCTTTAGCCTTACTGGCCCCGCGAGCATGACCAATGCCGAACGCGCCAGCAATAGCACCCAGGATGACGACCACCAGTCCCGCGATAATTTCAAAGCTCATTGCTGCTCCTTCAGTTCGTCGGCCTTTTCTTTCAATGCTGGCTGGCGTACGTATTGCGATAGTACGGCCAGCACCACCAGCGTAGGGCTAATCAACGCAACGATATTTGGCGGCAGGATGTTTTTGATATCCGGCGGCAGCACCGCCCAGGCGTGCAGCGCAGCATCCGGGAACGACTGCGCCCATACACCAACCAGCGCGCCGATAGCTCCCAGCTTTACAGACCACGTTTTCAGCAGCAAGCTGGCATGCCCTACGAACTCCAGCCGGGTATATTTGCGCAGAAGTAACAGAACGAGCACAGCCACCAGCACAAGCAAAGCGAAAATGATCATCTTCACAGGACACGCTCCTTAACCCAGCCGTAGAGAAAATCCTCGTTGGCTTCGCGGCCCTCCGCCAGTTCGAGGTATCTGGCACCCTGGCTGCAGTTCAGCGCACGCAACAGAACCTGTTCACCCTCTTTCCCGCGGGCGGAAAGGTATCCCTTAAGCGCGGTGATGGTTCGGGGACCAATGGCGCCATCCGGAATCAGATCGGGATACAGCTTTCCGCGCATATTCATTGCGGTCAGCCAGCGCTGGAAAAACTTACTGGCTACAGATGGCCCTATGTTCACGCCAGTGTCGCAAAGCTCATCTGCCAGTAACGTAGATAGAGCTGCCACCTGGTCAAACCGGGGGCCGGTCCAGTAATCGCTCAGCAGGATTTGCTTTGCTGTTTCCCTGGGCAGGTTCCGCATATCACCGGTGTAGCCATGTGCACGGGCGGTGGTCTGCGTGATGCCCCAGCGGGTCGGCCCGCCTTTATCCGACGGATGATCGACATAACCACCCTCCTTGCCGAGGATCCCCTCGATAATCTGGTCTGCTGTCATTGTGCTTTCACTCCGGTGATTCGTTCCCAGAAATACGTGAGCGCTACGGAACCCATAGCACCACTGATACCGGCAGTGGCCAGTATCATGTAAATACTCAGCCCACCTTCAATGCTGATGAGCCCACCAATGACCCCGGTAAAAGCCGAAACCACAATTTGCGCAAAAGCATTTATCCAGCTCCATTTTGCTTTGCCCTGCTTCACATCCATCAGGAATCGGACAAGGCCGCCCCAACCAGCAATGATCAGCAGAGCCAGCCAGGTGATTCCGGCCATGCTCTCTTTGTCTTGCATATGCTTTGCCATAGTTTCACCTCCGGGTTAACGGGGTGCTGTGTGAATAAAGGGGACAGGCCCATCGGGCTGATTTAGCGACAAGCCTTAAAAGGAGTCATCCGTGAGCCAGAAATGAAAAAGGCCACGTATTAGCGCAGCCCTTAAATGTTTTTGGTTAGTTGAAGTGCCTTAATCAGACGAAAAAAAGCCCGCTCAGAGGAACGGGCAGAAATGTAGGCAATACTGATTCTGTACCGGATCGAGACGTACCTAATAGTCCGAGCTACCGATTTACCAGGAGAGCGCTCGCTTTTTCCGTTACTGCCTTTTAAACATAGCTGGAGAAGCCGAAACAGCAACCCCACTACCAAATAGCTTAGTAGCATTGCGTGGTGCCGGGTGCCTCCCGGTGAGCATGTCCCAGCCGACATGGCCCGCGCTGCATTTACAGATCACTGTAAGTGACTGGTCGCCCCACCGCACAGGGGGATTCACCACACGAATAGATTAACAAGATGTTATTTTTCTGGTCAATAAGATGTAAGCAAATGATGACATGCAGTTTTCTTATTGCTGAGTAACTTCAATCTGGTTCAGGGCTCTCGCGCATGGGCGTTAATGTGTCGTGCAGCACATCTCAACCCAAGAGCCCTGACCGGATTGCAGATACGAAAAAGCCCCGGCATTTGCCGAGGCTTTAAATTTTTTCTTCAACGGTGAACATACAATGCCCATCGTTAGAACAAATTAACACGAATTCGGGAAAAGTAAATATCTCACCGCGTTATTTGTTTGAGTTGCGCCTCTGCCCACGCCTCCTCTATATCGAATTTAGTGATCAATACATCGAAGAACGGTTTAACCGATTTCTTCCAGGTATCCAAAGTGATGGCGTCCGTTATCTGGCAAATGGCCCTATGCACAGCAGTGGAGAGGATTCGCTCATACCCGCGACCGCCACAGCGTTTACAGTTACCCATCACAGGCACTCCCTGCTTCTCCGTCTCATCCTGGTTCACTACCTTCCCCCGACCGTGGCAGTCGTTACAGGCGGCGCTAACAGTCCCTTTTCCCTTGCACTTTTGGCAAAGCACCCGGACCTGCTCCCGGACCGACTTCACCTCCTCCCAGTATGATGGATAGATCCCCTTTGTAACTTTGACCCACTTCGGCGGTTTGCCGTCCGGATACGTTACTTTGTTGGTGAACGCCACTGTGTCGATGAATCCAGACCCGTTGCAGCAGTCGCATGTTTTTTTACTGGAAGCACTGCGGGAGTAATCCTCAAAGGCGTACTCTGCGAGGATCCGTATAACCCGGGGTTTTACGCTTGGCGAGAGCTTTCGCAACGCAGCAACCTTATCGCATTTTGTCAGCGCGTACTCAGCCAATAGTCCTATAGCCCGATCCCGGTCATTGTTGCTTATGCCCATCTTGCCCAGGAAAGCGCTATACCCCATAGCGGCACGTTCCTGGGTCATGCCCATTGCTGCCATGATGTCGGTACCGGTCAGTGAATCTGAGGCGGTAGCACGCGGAGAATCGCTAATCAGCGTGGATTTTGCGAAGTGGTATTTCACTGTGTTTTCAAGATTCACGCTGCGGCCCTCTTTGGCTGTTTTGGTTTGGTCTGGTTCAGGTTGTGCTTTGCTACTGGCGGCATACTGGCGCGCTTAACGCTCTCAGTTTGGTACTGCATGAAGTGATCGAGGGTCATAGAGATTCCCCAATGATGATCTGCCCTTTCTCGCCCCATATTTTGGTGATGCGGCAATCCCAGACGTGTGAATCATCCTCATAGAGAGCGTCCATTAGGGCTTTCAGCATATTGTCGCAGTCGGGCTTTGACTGATGTGGACGTCCTGCGTATTGCGCTCTCTTTTTCTGACTCCAGCTTTGCGGCATAGGCATGACGAACGTGACGTGAGCGCCGGAATCTGGCAGGTGAATTTTGCGCAGACGAGCTTCATCACAGAACGCCCGGTAACGTATTACTTCCGGACGCTGCTTCCATTTATCAGCTCTGGTCATCCTGGGTTTGCCGATGGGCGTGATATCGTAGATTTTCATGATTTGATGAGTCCCTCTTTCCGCCAGATTTCCAGGGTGCGCATTACCCCCTCTGCGTGCATCAGGCGCAATTCGTCGTAGGTGAAATCGGTGGTTTTGGTTCTGCCGTCAATTACGTCATGGCACCCGTTGCAGGCGATCGCCGCCTGAGTATCGTCAGGCTTGTATCCTGTGCCGCACGTACCCGCCAGGCGGTAATGCGCCAACACGCTGGTTTCCGGGTTGCCGTTGCAGTAACCAGGGATCCGCACTGTACATTCGCGACCTCGGGCCGCTTTGCGAAGGTTCGCCATACTCACCCCCACATCCTGTTGCGCCAGCGAGAGTCTGGCCGCGGCGGATTTTTGTCCTCCACCAGCTGCGCGCTGACGGTCCATGTCATAAAGTCAGGGTTTAAGCTTCGTTCGACCTTTACGCCCCGCTGACGATATCTCGCTACCAATTCGTCGGCCTGCTGCGTTGTGCATTCGAGATGGTGAAACCATGAGTGTTTCATCGGCATCACCCCGCGAAGCTTAAAAGCTGGTTGGCGGCGTTCTCAGCTTCCTGCAGGCTGTTGAATGAACGAGAGAGGATCCACCGCCAGAGAACATCCAGCGATGCTTTGTACAGTTCCTGGAACTCGCATTCGTCCATGCTTGCGAAAGAAATGCTGCGAGGGTGTTTTTTCAGCGTGCCGTCCGGCAGCTGTATGGCGTCATAGTGGCCAGCTTCAACGATGACCCACGCCCGGTAAGCATCGAAGGATTTGCAAATACTGATATAGCCGGATCGCTTCTCAGCTATCCGGTCGAGATATTGCCCGGCGGCATCAAGCAACGCCGATTCACTCCCGCCATATGCAGCAAGGTATTTGGCGTAACCTGTGATAAGCCTGCGCTCGTTAGACGAAATCGCCCCGCCGGTAGGTTCCCAATATTCAAAGCCGAGATTGAGTAAAGCAAAGTAACGGCGGTGAAACGCCGGATTGCGGACAAGCTTAAAATCGGCCTCCAGAACGGCGCCGAGCTTGCATTTTGATTGCAAGAAATCGCTGGTCTCCTGCGTGGCAGGGATCAGTATTCCTTGGGTCTGTTTTATCAGGTGTAATTGTTGCGCCATGGGTTTCACTCCGTGGCGCTGAGATGCTCCGTTGCCGTTGTTCAGGCGGCAGGTAAATTATTGCAGCTTACTCTCGGTTTCGTCAATGCAGCCAGCTTCTTTAGCTAGCTCTTTAAACTCTTCAATCGTCAGCAAAAACTGATTTTTTCTTACCTTTTCGAGCCCAGTTATTTTCCCCCCATCGCTCGAAATTAAAAACTTCCCGCCCTGCCTGATAATGTCCACCACTTCGGCGATATCGAGATCCACTTCATCCCCCTGAGCGACATACAGACGCAAAAATATAGTCTGGCGACAGCATCAAAGGGACACGCGGTAATGACTCCAACTTACTGATAGTGTTTTATGTTCAGATAATGCCCGATGACCTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGTGACTTCCGTCCCAGCCTTGCCAGATGTTGTCTCAGATTCAGGTTATGTCGCTCAATGCGCTGAGTGTAACGCTTGCTGATAACGTGCAGCTTTCCCTTCAGGCGTGATTCATACAGCGGCCAGCCATCCGTCATCCATACCACGACCTCAAAGGCCGACAGCAGGCTCTGAAGACGCTCCAGTGTGGCCAGAGTGCGTTCACCGAAGACGTGCGCCACAACCGTCCTCCGTATCCTGTCATACGCGTAAAACAGCCAGCGCTGACGTGATTTAGCACCGACGTAGCCCCACTGTTCGTCCATTTCAGCGCAGACAATCACATCACTGCCCGGTTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAATCGTGTTGAGGCCAACGCCCATAATGCGGGCGGTTGCCCGGCATCCAACACCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGTTGAGAAGCGGTGTAAGTGAACTGTAGCTGCCATGTTTTACGGCAGTGAGAGCAGAGATAGCGCTGATGTCCGGCAGTACTTTTACCGTTACGCACCACGCCTTCAGTAGCTGAGCAGGAGGGACAACTGATGGAGATGGAAGCCACTGGAGCACCTCAAAAACACCATCATACACTAAATCAGTAAGTTGGCACCATCACCCCCCCTCAATACCGCCGACCTGATAAACGATCGAGCCATCTTCCCGATATTCCATTGGTGCAGCGCTCCAGCCTGCGCCACTGGGATCGTCATCGTCGCCTACCTGGACAAATCCACCAGCAACTACACGGGCCGGATACATTTCACCCTCAGTCCAGTACCCCTCTGTATCTTTGATACATAGAATTTGCAGTGAGTTGCTCATTTGTCTGCCCCTTCTAACGCCGCTGCTATCTCTTCGAAAAAGCCATCTCGGGTATGGCTGGTCATTGCTGGTAAAAATACGGACATCAGCCTGTTTGTGTTGCAGTTCTCATCGTCTGCGAACAGAGCGATTTTTTTATCCAAGCGCACCTTCGCTTCCTGCAACTGCTCGTTTTTCTTGTTAGTGCGCTGGATATAGTCGGCAATGATTTCTATAGCCTTGTTTGTGTATTTTTCGACGTGTTCAGTCATGTGAACCACCTATCGCCTCAATCGTTTCCAACAACAACCGGCGGCGCGTATTTTCTGCAAAGTGACGGCGCCCGGTTTCTTTGTGGTAAAACTCGTTTTTGCCGACGACCCACATCCGCTCTGTCTGGTGCAGTTTTTTTACCTTCGGACCGTCTTTGGTGATCACGGTGCCGGTATGGGTTTTGATAATTGTCATACGGCCTCCCCAAGCACCCAACGGAGTGCGCTCGCATACTCACCATCGGCAGATTCCAGGGCTTTTGTGATTTCTTTGCGGGTTTTCAGGCGAGGCTTTGCATCACCGAGGATCTGACGCTGACGCCGGGCTCTTTCATGGCCGGTTGTGCCAGCAGTTGCTACTTCGATTTCAGAGACCTTCTCCCGTTGCTCTTCGGGTTTAAGCGATGCCAACTGACGCGCCTGGGTAACGGTAATTGTGCCAGCCTCTACCGCTTCCCGGACGGCCTGGGTAGCATCGAGGAGGGAGAGCGTTGCTCGAACGGTCTGAACGCTGCAGCCAAACAACACCGCAATGTCGTCCTCATCGAGCCCGCGGTCGAGCGCGTCTGACATTTTTTTAGCCCGGCCAAGCGGTGTATCAGGTCGGCGAATTTCGTTTTCACTGACCATGTATTTAGCCATCTGATTTGCTGATCCGCGCTTAACGACTCCAGGTACAAGCAGTGGGTCTTTGCCTTCTTTCAGACGGAGTTTATTTGCCTCCAGGGTATGTTTAACGCGCTGACGGCCAACAACTACGCAGGTGAGCCCCGTTTCGGGGTCTTTCCAGACGATGATCGGCTCCAGTACACCCAGCTCCGCAATGTTCAGTACCATCCCTTCCTCGATAGGCAGGTGTACACGCTCATCGTAAAGTGGGTGGGTCTTATCGGTGACCAGGTGCAGGTTTTCAGGCTCGAAACTGAGCACGTTTGTTTTGCCGCTGGCACCGTATACATCGATTGAATTCTTAGCCATGAATAGCCTCCTGAACATCTAAAACTCGCTGAAAAACAGGACTGCCAAGCAGGCTGTAATTCATCCCAACAGCAACTTTCGGCACCAGGCCAAAACGCTTCATGTCAAAGTCGATGACGGCCCGCTGATCGCGGAACAGCCCCAAACGACCATGCCGGACAACCTCGCCGGTCGCTTCTGCTTCGGAAAAATACCGCTGGACAGTAGCGCGGCTCAGCCCCAGTTTTTTCATTGCCTCGGTGGTCGTGAGGCGCCCCTGATGCCTGGTGATCCGAATCACTGCGCGGACGTACTCTCTGCGCTCAACTGCTGACAATGCTCTAGCCATGATTCCGCCCTCTGCCTAAACCGAATTTCGCGCGGATTTCCGCAATTTTGTTTAAGCCCTGCTCGTTACTCAGCGGACGTCCGCCAAGCTTTGGGATCTGTTTAACCGGATCGGGAATCACTTCCCCGGCATTCAAGCGACGAACCATACGCAGCAGCTCATCCTGCGCCTTACGTCGCAACTCAGTGTCGCTGAGGCCGTTTGAGCGCATGTCTGCGTACAAGCCAGTAACCATCCAGTAGCAGGCTTTGTGTTTGAGCGTTAACGGTTCGATTTTGTGCTCAGGCCATGGGTACGACTCAGCGTCTGGATACTGGCCGCGAGTCCGGCAATACTGGTAAACCATTTCAACCAGCTCACTCGCATCTGGCAGGCCTACAGTTACCGCCTCCTCAGAACGACACCAGGCGACGAACTGTCCCGGTGATGGCATGAATGGTTTTTCCTGTTTGCGCGCAACCCGCATTCCTGCGTTAATCTGCTCAACCGTGGTGATCCCGTTCTCTTTGAACGCCAACAACCACTGGCGACGCATCTCGTTGAGGTCTTCCACTGATTTGTTGGCCAGCACCGGGAAGACGGCAAGCAGTTGGCGGAACAGCTCGTTGAAGATCTCTGCAGTCTTGGCCGCCTGGCGCTTTACTGCCTGCTCGTCCTGCATTTCCGGAAGCCCGGCAGCCACTCGCTGGAAGTTTTCACGGTCGAAGTTGTGCATGCTTTCTGCGATAGATTTCATTCGAGCACCCCGTCGATCCAGTCTGTGTTGTCCAGCGCACTGGCGCCTGAGGAGTTTCTTGATGGACCATGGCTGCGCAGGCGTTTAGTTGTGAGCTGATCCCATTTCTTACGTAGTTTTGAGGGGCAAAGGATGTTTTCCTGCCAGAAACCGTCCTCATTTGCCCACTTGAACAGTTCGCAGATCTCATAGTGAGTGCGCTTGTCCTGCAGACGCATCAGGCGGATGGTGTTCGCCCATTCAACCCAGTTCGGCTCAGAGAGGGAGGCATTCACGGTCAGGGCTTTATCGAAAATCCATCGCGCGGCTTTGAGGTCGTCGGCTGTTCCCCAGGATTTACCCGCAGGGGTATAAATCCCATCGGCCGCTTCTGGATGACGAGAGAGAAACTTCAAGGTTTCCTCGTTTCGGGATTCTTTAGAATTCCGAGACGAAGAAGATCTTTTACTATTGTTCTTGTTCTTGTATTGGGTGTCTCCCGTTTCCGGGAAAGGTTTTCCCGTTTTCGGTAACACTTTTCCCGATTCCGGGAAGAGTTTTCCCGTTTTCGGTTTGTCTAAAATCCACTCAGATAGCTCAGTATTTATACCGACAATTTTCATCACTCCCTGCTTATGAGCGAAGATAATTTTCCGCTCCGCGAGAGATTTGATTGTGTCGGAAATATGCGACTCTCCGAGGTCTGTCAGCTCAGCAATCACCGTGTTTGTTACTCGGTCCTGCTTCTTGTTCCATCCATAGGTAAGCCAGATAACAGCCTCAAGACACTGCCACTCACGACCTGACATCCGCAGACGCGGCTTGAGCTTCTGTATCTCGTTTGCGATCTTGGTATACCCGTTAGCCAGGTCGGCCATTTGACCTCCCGAACGCTCGGTTTTAATCGGAAAATTGATAACTTCAGCGGTATTTGACATACTCACTCCGTGAACTAAGAGCCCTTTTTTCACACCCCGAAGACTGGCTGTGTTGGCGCACAACAGTCTTCACCCTTTCAGAACAACCCAGCCTGGTCGCCGCCCTTTCGCACTTTCCGCTTTGCTTCCCGGCGTTCAGCTGCGCTGGTCTGCTTCTCAGCCCATAACTTTGCGTATCGCATAACATCGTCAAACATTCCCCCTTTGCGGCTTGCCTGTGACATCCGCTTGTACATATCGACCGCCTGGTATGCCCCCCCCCTGAGCCACTGCTTGCGTGAAGCCCTGGCGAAGAAGTTCCTCGCGGACATTCTTTTCAATAAATTCGATGTGATTCATGGAACCTCGCTTACATCACGCCGAGCATTGAGCTCACGATCGTCATCAGCGCTCCTGTCTGCTCAGGCATTAGCCTGAAAAGCGACGCAATCCCCTCGCTCACTTCCTTCAGTTTCTGGTGCTCTGGCGCGTTCAGCATCACCGCCTGCTTTGCTTCAGCGCATTCCTTCATGGCCGAAGACAGGCGCGACAAAATATCGTCCTGAGGCATCAGACGATGTCGGAACTCCAGCGGAAGAACGGCCATGATTGCCGGGGTAAGAAGGCGAACGTACTCGCGATAGCGCTCAGACTCGGCCGGGTTGTCCAGGTAGCGAAAAAGCTTCTGTCGGGCACGGCTGATATCATCAGGGAACGCGACCTCCTCGCCGCCCTGCTGGCGCCACTCATCGATGATATGTGCCGACACAACATCCTGGCCCTCAGCTGCAGCCCAGGCGCGAACAGCAGAGCGAATAGCGTCGTGATCTGACTCTCTCAGCTGATTTCGCTTTATCAGAGCGCCGGTGTTGAATCCGGTATTTTGTTGAAAGGGAAGTGTTTGCATGGTTAATCCCCATTAAGCTCGGAGCTGTTAGTTAGGGGAGTTGTAGGAGGTATAAACTCAGGCCAGATTGTTTCCCAATCATCTGGGAAACAATCTGCTCTGGTAACAGCGCCTTCCGTAAGCTGTTCTATTTGTATGGCGCGAGATGGGGATATGGCAGCAATCCCAGATGCCATTTGTGAAAGATAAGAAGTCGATACCTCGAGCTTTGTAGCCAAGGCCTTGGAGCTGCCTCGTTTCATGTTTAGATAATCTTTAAGTTGCATAGTGGCTCCCTCATGTGATTACGGTGAGTTTATAAAACACTAAACCAAAACGTCAAGTATTTGCTTGTTTATAAATTACTAATCAAAATGCTTTCTATGACGACACAGGAAATTAGACGCAGGCGACTTAAGGAATGGTTCTCAGAAAAGTCGCTTCCAGAGAAAGAGAAAAGCTATTTATCTCAATTGATAAATGGCCGCAGTTCCTTTGGCGAAAGAGCGGCAAGAAGGTTAGAAAGAGATTACGGAATGCCCTCAGGCTTCCTTGACTCAGACACCTCTGACTCCCAAAGCACACCGCCAAGTCTTGTGTTGAGTGAAGAAGAACTTAAGCTCATTACTTTTTTTCGTGGATTCCCTGACTCCGCAAAGAAAGAAGCGCTAATTGAATTTGAATCTAAGTTCAATAAATTCAACGAACTTTTCAAAGAGTTACTGGCTTCACGCAGTTAACGCTCACGCCTCCCAAACCAAATCTCGTCTAAGGCGGGCTTTGGTTTTTTCACAACCCCTTCCTTCATTTGGGCCTCAGGATCTGAAGGCTTCAATTTTTTACGCATAAAAGTTTACTTTTTACTTTACAGAATAGTTTAGCAATGATTAAACTCATTACATCAACAACGCGCTGCGTTGCTCCGATAAACGTTCCGCTGGCCGGCGACAAGGCAATGAGGGTGAGATGAGTAAGGTAAAGGTGGCGCCTATTGAACTCGAAATAGACGCCACGGAAGTAATCAATCAGGTCGAGGAACTACTGGGGTTACTTGAGCTTCCAGCCCGTTCCCTTGAAGGCATCCCTGAGGATGTCGTCAACCTGCTTTTTGACAACATCCGTCCCTTGCTTAACAACATCGTCCTTAGTGATTTCTCGACCACAGTTGGCACAACTGACGCCAACAAAATTTGTATCAAAGTCGAAATCATCGGGACGCTTGAGCATCTCGCTTCCGCAATCAGGGCAAGCAACTTTCATAGTTGTCAGTTTTGACATTTTTTATTTCCTTGCTGGCTGTGTGAGAACTACCAGCATACCACCGAGCCTGAAGTGGTTAAAAGACAGGCAAACATGAGGAGTTGGAATGAGCAAGCAAGGCATCAGAGCCCTGATCATTTCAGCAGTTATTGGGCTCTTCATCTGGATCGCGCTCTTCAGCGCACTGAGGGGATTGTTTCTATGAATGATTTCGCACGCAAACCCGCTCGTCAGCAGGCTGTTCGTTTAAATCCGCTGTCGGCTTTCATCCGCCGGGTGTGCTACATGCTCGCGCAAAAAGGAGGCCCTTCATGAGCACGATGTTTGCCCTGGTTCTCACCGTCAGCATGCTGACGGGCGGTAATCAGGATGTCCTGCTCGGCGTTTACGACACTGAGAATGACTGCAAGGCAGCTGCAGAAGAGCAACACGTGAAAGCTGAATGTTATCCACTGAAAGGTTTACTGGACGAGCATCCGGCCGGGTTCACGGTGCAAATGTAGGGGGAAGAATGCAGAAGAAATGCGGTTACTGCAGTAAAGCAATCGAGGGAAAGCCAGTGGTAAGCACCCTGTTGTACCTCCAGGGGAACCAGCTAGCACGGAAAGAAAAAGAGTATTGCTCTGAACGTTGCGCCTCTCACGACCAGATGGCTCACGAGGGCTAACGTAAACCCGCCGAAGCGGGCTGTACGTCCGGTGCCACCGACCAAAGTTTCACCGGAAATTACCAAAACCAATGACCACCCTGAATGGGCGCTACCAATGGCCCGGGGGATTCTACATCCAAAATAGAGGCTATCACATGGAATATTTTTATCTGATAAAAGCGACTCAAAAATCGCGTAAAGCTGATGCCGTAATCTGGCGCACTAATAAATCAGAAGCTCGCGCCCTTCTGCAGCTGGACGTCGATCTGGAAGACGCTGGGATCGAAACAGGCCGCGGCAAAGACTATCAAAAACCAATTCGCACCGATTTTCCGGTATTCAATGACCTGCCGGCGGAAGGTGTTCTCGATTACTCATGGTGCGAACGCTACCAGCTCGGCGACGATGGTCGCACCTGGGCTCTGAAGCCAGGTCAGGTGCCTGCGGATCATCACATCGATGATGCAGGAGTAACCTCTGAGACCGTGGAAACTTTCGGTAGTGATGAATACCAGGACGATTCCAGCGCGCTTTTTAACGTGGCCGAACTCCCCTTTCGCGCGCAGCTGCTGGCGCAGTACATGGCCGAAGAACGTCACGTTTATCATATCAGCATGCCTCACCGGCAGGAGCTGTCAGCTCTTGAAATGGACACTGATAACGCAGCCGTCCAGGATCTGATTCTGGCCGCCGAGAATATCCCTGAAATCAAAAAATACGATATGCCGACGCTCTGGAAATTCACCAGCGCCAATAAAAAAGTCTTCCCGGAAGGGAAACGGCATGAGCTCGGCAAACGTATTCAGTTTGCAAAGCTGTGGTTCGCCACGAACGCGATCGACCGCGGCATTCTCACCAGGGAATGGGCTGCCGGTAACTGCATTTCTTCGGTTTTGAAAACTGATGCAGGAACTAATGCTGGCGGCGGTAATAAAACCGATCGCAACCCTGACTACACCCATACCCTTGATACGCTCGATGTAGAAATAGCCCTGGCCACAATGCCAATGGATTTCGATATCTACAATTTCCCGGCATCAATTCACCGCCGGGCCAAAGAGATCGTTCAGAAGAAAGAAAGTCCGTTCAAGGAATGGTCGGCAGCGCTGCGCAAGGTTGCAGGCATCCTGGATTATTCCCGCGCCGCGATTTTTGCCCTTATTCGTGGCGCCACCAGCGACATTCATCATTTCCCGGTAAGTCTGCAGACCTATATCAATGCGAACCTGACCGAGCATAAGCATGACGCCCCTTCTGCTGAGACTCTTGAAAAAGCTGGTCATGTTTAATCTGCCGCCGTCACTCTGGACGCTGTGAAAAAGGCTATCGATGGAGATGAAGGTGTGCCTGACCTGGAAACTCTCCCAACTGACTTTCAGGTAATTGGCACCGAACTGGTGAAAGAAGCTCAAAAGAAACGCCCTGACGCTAATCAGGTTCTGGCCGCCGAACGTGGCGAATATGTCGAAGGCATCAGTGACCCCACGGATCCGAAGTGGATAACCGAAGACCTGACCAAACCCAAACAGCCTGAAGTTTCAAACATGGGCAATGGTGTTTTTTCGATTGATGGTCTGATGGATAGCCAGCCATCACCAGCACCAGCACCAGCACTTTCTATCGTGGACCAGGCGCGCCAGCGCGCTGCAGAGGAAAAATTACATCCAGCTAATTCCGGGGAAACCACCAGCGATGTGCAGATGGAAACGGCTCAGCCGGTCGAAGACGAAAATGATAATGCGGTATCAGCAGGCGAAGGCGCTGATGAGCCTCCTGCGCAAACAATTGCCGTGAACATGAGCAAAATACTGGCTGAACGCTGCCCGGATCTTACCGCCGAAGTGCTGAAAAGCCAGGTTTCCGAGAGTGCTCATAGAGATGAAGAGAAAGAGGCTGAACAAGCAGCACCAGCATGGCCGGAGTATTTCGAGCCTGGTCGATATGAAGGCGTGCCAAATGAGGTCTACCACGCCGCTAACGGCATCAGCTCCACGATGGTTAAAGATGCCCGGGTATCGCTGATGTATTTCGAGGCGCGCCACGTATCCAAAACCATCCAGAAGGTGCGCTCTCCTGTTTTGGATATGGGCAATCTGGTGCATGCACTGGCGCTGCAGCCTGATCAGCTGGAAAAAGAATTCAGTATCGAGCCGGAAATCCCGGAAGGTGCCTTCACCACGACGGCGACGATCCGCGCATTTATCGACGAATACAACAACGGGCTTCCGGTTTTGCTCAGCGCAGAGGACATCAAGAGATTCCTGGAGGAATACAACGCGAACCTGCCCGCCCAGGTTCCCTTGGGTACATCATTTGAAGAAACCGGCCAGGGTTATATGTCTTTACCTGCTGAGTTCCAGCGCATTGAAGACGGTCAGAAGCAAACCGCCAGCGCAATGAAGGCCTGCATCAAAGAATACAACGCCACCCTGCCCGCCCAGGTGAAAACCAGCGGTGGCCGCGATGTCTTACTGGAACAGCTGGCGCTTATTAATCCTGACATGGTTGCTCAGGAAGCACAGAAGGCGCAGCCCCTGAAAGTCTCTGGTACAAAGGCCGAACTGATTCAAGCCGTGAAATCGGTAAAACCGGATGCCGTGTTTGCCGACGAGCTGCTGGATGCATGGCGCGAGAACCCGGAAGGAAAAGTGCTGGTTACCCGCCAGCAGCTGGCTACGGCACTGGCCATTCAGAAAGCACTGTTGAATCACCCGACCGCTGGCAAGTTGTTGACGCACCCGAGCCGTGCCGTCGAGGTGAGCTATTTCGGCATTGATGAGGAAACCGGGCTGGAAGTTCGCGTGCGCCCTGACCTTGAGATAGACATGGGCGGCCTGCGCATTGGTGCGGACCTGAAAACCATCAGTATGTGGAACATTAAGCAGGAAGGCCTGCGCGCGAAGCTGCACCGGGAAATCATCGAGCGCGATTACCACCTGAGCGCGGCTATGTACTGCGAAACCGCAGCCCTTGACCAGTTCTTCTGGATATTCGTCAACAAAGACGAGAACTACCACTGGATCGCCATCATCGAGGCATCCGAAGAACTGCTGGAACTCGGCATGCTGGAATATCGCAAAGCAATGCGTGCCATCGCGAACGGTTTCGACACTGGCGAATGGCCGGCGCCGATTACCGAAGACTACACCGAAGAACTTAACGATTTTGATATGCGCCGTCTCGAAGCGCTGCGCGTACAGGCATAAGGGGGAACAGTCATGGAAAACACTAACATTGTTACAGCCGAACAGCAGGCACCAAACACCATTTCAGCTAGCAACGCGATCTTTAACGTTCAGGCTCTCGGTCAGTTAACTGCTTTCGCAAACCTTATGGCTGATTCACAAGTGACAGTGCCAGCTCACCTTGCAGGTAAGCCAGCCGATTGCATGGCCATCGTTATGCAGGCTATGCAGTGGGGCATGAATCCCTATGCAGTCGCGCAAAAAACGCATCTGGTAAACGGCGTGCTCGGATATGAAGCCCAGCTCGTTAACGCGGTAATCGCCAGTTCCAGCGCTATTAACGGTCGATTTCATTATCGCTACGGCGGCGACTGGGAACGTTGCACAAGGACGCAGGAAATTACCAGGGAAAAACACGGTAAAAATGGGAAATACAGCGTTACAGAACGGGTGCGCGGCTGGACTGATGAAGACGAAATCGGGTTATTCGTCCAGGTCGGCGCGATTCTGCGCGGTGAATCAGAAATCACCTGGGGGGAGCCACTTTATCTCTCTGGAGTCGTCACACGTAATTCTCCTTTGTGGGTTTCTAACCCGAAACAGCAGATCGCTTATCTGGGCGTCAAATACTGGGCACGGCTGTATTGCCCGGAAGTCATCCTGGGTGTTTACAGCCCGGATGAAGTTGAACAAAGGACCGAGCGAGAAATAAACCCGGCGCCGGCGCAAAGAATGTCTGTCGCAGAGATCACCAGCGGAACAGACATCACCACCAGCGCGCAGGATTCAGCTCTCAATATTGATTCCCTGGCAGATGATTTCCGTGACCGCATTGAGCGCGCCGAATCGGTCGATGCAGCAAAAGCCATCAGGGCGGATCTGGATAAAGAGAAAGCTGTGTTGGGCACTGTTCTTTTCACCGAACTGAAAGGTAAAGCCGTGCAGCGTTATTTCATGGTAGACGCCCGAAACAAAGTTGAGGCCGCGATCAACTCTCTACCTAATCCCGGAGAACCGGAAGCCGTCGAACTGTTCGCTAAAGCTGAAGGCATTCTCAACGGCGCGAAACGCCACCTCGGTGATGAACTGTATGACCAGTTCCGCATCGCCCTGGACGACATGAAACCGGAATACGTGGGTTAACCAGATTGGGAGGGGAAACTCTCCCGATAAAGGAATGTATATGCGATTGATTAACCGAAGCAGACACTCCCCTCTGGGCCGCCAAGCGTGCGATGCGGCACTGGCAAAACACGTTGAGCTTTATGGAGCCTACGGGCGACAGAAAACAAAGAGAACTTATACGGTGGTGGTTCAAGGCTCAAAGATCACTGTAGAAGTTGTTAACAGAAAAAGTAGCTATGTGGCCACAGCCATGAGCTGCGCGCGCCGGCTACACCATCTGCCTGGACAATGTAACTAAGGGGTTTTTATGACTAATACATCTCATAAATCAGATGAAATTTTGATAACCGATGACGTTCTGTCCAGATACAAAATATCGCGCAGCACACTTTATTTCTGGAGCACCCCATCCCGGATGCCCTCTTACTTTGCTCAGCCATTCCCGCAGCCTAAAATAAATGGCAGCCCTAAAAGGTGGAGACTTTCAGACTTGTTGGCCTGGGAAGATAACGTGGGGATCAAACCAGAGGCTGACCAACCAGCTTCTCAAGGTGATCCTGCCAAACAGCAAGCCAGTGACGCTGATCATCCAGATAATCATGCAGGTTATAACGTGCCATGACACCTGCCATATGATGGCCAAGCAGTTTTTCCACAACATGTGGCGGCGCACCTAATTCAGAAAGGCGTGTCGCCACTGTTCGTCTGAGGTCATGGAGAGACCAGGGCTTCATGCCTGTTTTAGCTATAATCTGAGCAGAAAACAGAGCGACGTTTGGTTGTAGTGGCGGTCTGTCATCTTCTGGCCCCCTGTAACGTGACAGTGTCACAACATGTTTTGAAACTGACGTTTCCTTCTCTGCTAACATCATTCTTACTACTGCCTCGGGAAGTGCCCTTCTGACCGATTTCCCGGTTTTATAATCGCTTGCCGGAATGGTCCACGTTTGCTCATGGAAATCAAACCACTCCCATCTTGCTGTTCTGATCTCTGTACTCCGGCAGCCAGTCATGATGAGAAACTTCATTATCAGTTGTTGTCGGTACTTCATTTCAGGAAGGATGTTCCAAACTGTTTTGATTTCCTCATCACTCAACCTGCGATCTTTTACGGATGCTGTGAGACCTACGTCAGAGCGCCTAAGGCTCTCAATCGGGTTCACATTAATTACCCCTCGATTGGAGCAGAAACGGAACGTACGCTGCATCAGCCCCAGCATCTGACCAGTGACAACTCTTCGCCCCATGCCATCAAAAAGGTTAAGCCAGTGTGCTTTAGTGGTCTGATCAACAATCATGTTCCCCAGCACAGGCGCTATATGGTTATTGAAGTCCCGCCGGTTAACCTTGATTTTCACAAGACCTTCAGGGATGCAGTAATACTTTTCCCAGTAATCGAAAGCCTCTTTAACGGTGAGTGCTTCGACTTTTTTCTGTTTCTCCAGAACTGTTTGCCGTCTCGGATCGAGTCCTTCTGTCAACCAGGCCCTGAACTGCTGCCTACGTTCGCGAGCTTGAGATAAGGAGGTGGTGGGATAATCGCCAATCGTTAGCTGAGCGGCTTTCCCGTTCCATCTGTAGCGGTAAAAGAATGTTATACTGCCGGAAGTAGACAACCGGACATTCAGACCATGAGCGTCCGATATGACCTCGATCTGGTCTCTCTTTTTGCCAAGAGCTTTTCTTAATTTTGTGTCGGTAAGCAA
Protein sequences of DBSCAN-SWA_4 >LR133964|2884031:2925584|2898325_2898559_-|VDY60613.1|DBSCAN-SWA MQNPYVHYAGDGLGPRDVFVNGNPIRHVVYANQAKGVVEFAPLPLRVKRNGEIYTRKLHGTVIVKPQQRIGGCNGHS >LR133964|2884031:2925584|2914805_2915141_-|VDY60638.1|DBSCAN-SWA MARALSAVERREYVRAVIRITRHQGRLTTTEAMKKLGLSRATVQRYFSEAEATGEVVRHGRLGLFRDQRAVIDFDMKRFGLVPKVAVGMNYSLLGSPVFQRVLDVQEAIHG >LR133964|2884031:2925584|2910839_2911196_-|VDY60628.1|DBSCAN-SWA MKIYDITPIGKPRMTRADKWKQRPEVIRYRAFCDEARLRKIHLPDSGAHVTFVMPMPQSWSQKKRAQYAGRPHQSKPDCDNMLKALMDALYEDDSHVWDCRITKIWGEKGQIIIGESL >LR133964|2884031:2925584|2919723_2920857_+|VDY60645.1|DBSCAN-SWA MEYFYLIKATQKSRKADAVIWRTNKSEARALLQLDVDLEDAGIETGRGKDYQKPIRTDFPVFNDLPAEGVLDYSWCERYQLGDDGRTWALKPGQVPADHHIDDAGVTSETVETFGSDEYQDDSSALFNVAELPFRAQLLAQYMAEERHVYHISMPHRQELSALEMDTDNAAVQDLILAAENIPEIKKYDMPTLWKFTSANKKVFPEGKRHELGKRIQFAKLWFATNAIDRGILTREWAAGNCISSVLKTDAGTNAGGGNKTDRNPDYTHTLDTLDVEIALATMPMDFDIYNFPASIHRRAKEIVQKKESPFKEWSAALRKVAGILDYSRAAIFALIRGATSDIHHFPVSLQTYINANLTEHKHDAPSAETLEKAGHV >LR133964|2884031:2925584|2924396_2925584_-|VDY60648.1|integrase|DBSCAN-SWA MLTDTKLRKALGKKRDQIEVISDAHGLNVRLSTSGSITFFYRYRWNGKAAQLTIGDYPTTSLSQARERRQQFRAWLTEGLDPRRQTVLEKQKKVEALTVKEAFDYWEKYYCIPEGLVKIKVNRRDFNNHIAPVLGNMIVDQTTKAHWLNLFDGMGRRVVTGQMLGLMQRTFRFCSNRGVINVNPIESLRRSDVGLTASVKDRRLSDEEIKTVWNILPEMKYRQQLIMKFLIMTGCRSTEIRTARWEWFDFHEQTWTIPASDYKTGKSVRRALPEAVVRMMLAEKETSVSKHVVTLSRYRGPEDDRPPLQPNVALFSAQIIAKTGMKPWSLHDLRRTVATRLSELGAPPHVVEKLLGHHMAGVMARYNLHDYLDDQRHWLAVWQDHLEKLVGQPLV >LR133964|2884031:2925584|2884302_2886408_-|VDY60600.1|DBSCAN-SWA MANIEKLGSSSPEVLLKNATNLDQLVNGRESESLPDRFGVLRKTWHGMEMIFSRFIGYITGRGEQAVAAIGWQELGNWAVGLAVDNRQQIVYYNGSWYKYLGELEHVIAGDSPENDGGVWSAANPTGKWSNIGDAALRSNLGSGEEGVGDALLAVKQPYTGAVARTQHDKNWDNINLLDFVYATDVVDGFVDYGLGLNRAIAAMSSLGSTSVEHIPRRRINLPAGLLHIKTKVELSFGPFAMCGEGIFQTIFAIHPSAASDDDYLFDFSSAGWDSTGRVRISELYLSNFSVLGETASWIRKRIFKFYGVGWDFAVENVQVWSPPLSSFDMTDVMDGVFNQVRINSGGRFLTASSEVTHQINMLNQYDNCNAIRYMSCHFENNYSGVSYIRGGSNNILFGGMCKFENNSRNNLVPVNQIYGSTSESVKFENIFVSHPAHITVYWLDSNGRHVSIRGGSFMSPSDLGGYTGLRWFRIHRTNWASATACVQLVADIDMMHVDGYGYNGANQLSPFDFEGEVIFRAKAIRLARPNTFMYINWNCKIDIENLTLLGAVTSDYQTSLFNVAASSVKADVNVDKYNGSVVGTVYVNPAMTTASRRQSIISVRGKSIQSSALTLSEGTDPLGYDESWTWNASGNVANIGYCHTGKRMIVRCADTSNCLMTGGNIFLPGGAVTAACTITLVAMNTSFRQGWVEVSRVAGV >LR133964|2884031:2925584|2907695_2907971_-|VDY60623.1|DBSCAN-SWA MSFEIIAGLVVVILGAIAGAFGIGHARGASKAKAKADQQRTEENAAATVAAAERRAEVTKGASDVQEDVKRMGDDDVDRELREKFTRPGSR >LR133964|2884031:2925584|2922786_2923896_+|VDY60647.1|DBSCAN-SWA MENTNIVTAEQQAPNTISASNAIFNVQALGQLTAFANLMADSQVTVPAHLAGKPADCMAIVMQAMQWGMNPYAVAQKTHLVNGVLGYEAQLVNAVIASSSAINGRFHYRYGGDWERCTRTQEITREKHGKNGKYSVTERVRGWTDEDEIGLFVQVGAILRGESEITWGEPLYLSGVVTRNSPLWVSNPKQQIAYLGVKYWARLYCPEVILGVYSPDEVEQRTEREINPAPAQRMSVAEITSGTDITTSAQDSALNIDSLADDFRDRIERAESVDAAKAIRADLDKEKAVLGTVLFTELKGKAVQRYFMVDARNKVEAAINSLPNPGEPEAVELFAKAEGILNGAKRHLGDELYDQFRIALDDMKPEYVG >LR133964|2884031:2925584|2912354_2912546_-|VDY60631.1|DBSCAN-SWA MDLDIAEVVDIIRQGGKFLISSDGGKITGLEKVRKNQFLLTIEEFKELAKEAGCIDETESKLQ >LR133964|2884031:2925584|2897013_2897406_-|VDY60610.1|DBSCAN-SWA MTLTEIRNAVISRMAAQTAIASDAVDYPNGPVFDPSNRDIWARLTNIAGQAGATEIGDGPVVHRTGLLIIQLFVPVGSGTLLISRTADQLTELFEFKDDGKLSYFAVSAVPAGETDGWLQLNLQIPYRAL >LR133964|2884031:2925584|2898824_2899220_-|VDY60615.1|DBSCAN-SWA MTVYITIQDVDELLGDTWAAADKKGKAVLQANTWMTALNLQDIDPEHIPEEVKQAGAFIASVAAAGNLYQQKTDSGVVTSKSVEADDVKVSRTFAELSTTSTELLDPDLQLALDMLKPWMINPFQTFFVRA >LR133964|2884031:2925584|2907967_2908312_-|VDY60624.1|DBSCAN-SWA MKMIIFALLVLVAVLVLLLLRKYTRLEFVGHASLLLKTWSVKLGAIGALVGVWAQSFPDAALHAWAVLPPDIKNILPPNIVALISPTLVVLAVLSQYVRQPALKEKADELKEQQ >LR133964|2884031:2925584|2908844_2909156_-|VDY60626.1|DBSCAN-SWA MAKHMQDKESMAGITWLALLIIAGWGGLVRFLMDVKQGKAKWSWINAFAQIVVSAFTGVIGGLISIEGGLSIYMILATAGISGAMGSVALTYFWERITGVKAQ >LR133964|2884031:2925584|2890587_2891052_-|VDY60604.1|DBSCAN-SWA MTDIYYPHDSLPMPLQEGYGFQPVSPLKRTQLTTGRARQRRAYTSTPTQASITWFMETDAKGLAFESWFRDALSDGAAWFMMKLQTPAGIKFYKCRFTDIYQGPVLVAPIYWKYTATLELWERPLAPAPWGNYPEWIVGSSLLDIALNKEWPKA >LR133964|2884031:2925584|2915133_2915877_-|VDY60639.1|DBSCAN-SWA MKSIAESMHNFDRENFQRVAAGLPEMQDEQAVKRQAAKTAEIFNELFRQLLAVFPVLANKSVEDLNEMRRQWLLAFKENGITTVEQINAGMRVARKQEKPFMPSPGQFVAWCRSEEAVTVGLPDASELVEMVYQYCRTRGQYPDAESYPWPEHKIEPLTLKHKACYWMVTGLYADMRSNGLSDTELRRKAQDELLRMVRRLNAGEVIPDPVKQIPKLGGRPLSNEQGLNKIAEIRAKFGLGRGRNHG >LR133964|2884031:2925584|2899541_2900495_-|VDY60616.1|DBSCAN-SWA MPTTVNSDLIIYDDLAQTAFLERRQDNLEVFNAASNGAIILDNELIEGDFRKRTFYKVGGSIESRNVNSTDPVTGKKIGAGESVSVKAPWKYGPYETTEEAFKRRGRDVSEFSEVIGVDVADATLEGYIKYALQGLVAAIGANADMTVSADIATDGKKTLTRGLRKYGDKFNRVALFVMHSTTYFDIVDQAIDNKIYEEAGVVVYGGQPGTLGKPVLVTDTMPVDAILGLVAGAVSVTESQAPGFRSYDINDQENLAIGYRAEGTVNVELLGYSWDETKGANPDLTKIGTGANWKKHFTSNKSTAGVLIKLEAPAGE >LR133964|2884031:2925584|2911697_2912297_-|VDY60630.1|DBSCAN-SWA MAQQLHLIKQTQGILIPATQETSDFLQSKCKLGAVLEADFKLVRNPAFHRRYFALLNLGFEYWEPTGGAISSNERRLITGYAKYLAAYGGSESALLDAAGQYLDRIAEKRSGYISICKSFDAYRAWVIVEAGHYDAIQLPDGTLKKHPRSISFASMDECEFQELYKASLDVLWRWILSRSFNSLQEAENAANQLLSFAG >LR133964|2884031:2925584|2894504_2894861_-|VDY60606.1|DBSCAN-SWA MKKVIALALGALLLSGCTVRVADLTVASTKNYNLNGGKFYKGKRVTAEDSYPVIIFPLGIPNVKTAADRAIEKDRCAVGLSDVVVTQLNHSFLFGKIGLRVEGNLVIDRSLPGCENAS >LR133964|2884031:2925584|2901404_2901581_-|VDY60618.1|DBSCAN-SWA MDSQKQRDLIASLYEELVIARGLIKEICTVRSIAEPKSSLQRMDKAIKDAKEYMLENS >LR133964|2884031:2925584|2904317_2905625_-|VDY60621.1|DBSCAN-SWA MAIAPTLNVPQARFLAMQQKFKAYVAGFGSGKTWVGCGGICKGFWEFPKINQGYFAPTYPQIRDIFYPTVEEVAHDWGLKVKIVESNKEVHFYSGRQYRGTTICRSMEKPDTIVGFKIGNALVDELDVLKADKARQAWRKIIARMRYKVDGLRNGIDVTTTPEGFKFVYNQFVKAVREKPELRSMYGLVQASTFDNEKNLPVDYIPSLLASYPPELIKAYLNGQFTNLTSGTIYHQFDRVLNNSSEEEQPGEALYIGMDFNVGKMAGIVHVLRLGLPHAVTEIINAYDTPDMIRIIKERFWLYADGDYRKVREIYIYPDASGDSRKSNNASKTDIEQLRQAGFNVIVDDANPPVKDRINSMNAMFCNGNGDRRYKVNVARCPVYADCLEQQVWDKNGEPDKKSDNDHPNDAAGYFIVKQFPIVRPAFSISLDTTF >LR133964|2884031:2925584|2912626_2912920_-|VDY60632.1|DBSCAN-SWA MAHVFGERTLATLERLQSLLSAFEVVVWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >LR133964|2884031:2925584|2917143_2917680_-|VDY60641.1|DBSCAN-SWA MQTLPFQQNTGFNTGALIKRNQLRESDHDAIRSAVRAWAAAEGQDVVSAHIIDEWRQQGGEEVAFPDDISRARQKLFRYLDNPAESERYREYVRLLTPAIMAVLPLEFRHRLMPQDDILSRLSSAMKECAEAKQAVMLNAPEHQKLKEVSEGIASLFRLMPEQTGALMTIVSSMLGVM >LR133964|2884031:2925584|2895817_2896990_-|VDY60609.1|DBSCAN-SWA MSSGAKVVAAFIRETTPGITPTAGAWNLLRRSSFGLKPTQNTNDNDEIAGDRMAQGVSRGTVDVGGDVGTRFRWNQHDDFLASCFGSEWLNNVLTMGNGRITFSVATFASDVGIAQIARGCQVGTFQMEIPADGDITATITFAGLDWETKGDDTGYFTAPVDLAGALRYSFKEVTNIRLNGVDGGTGFCVDTFNIQFNNNMQTQRCIGTGSAFAGANIPTTFTPSGQITLSWSKAAWEVYKKTFTGETVPFSFTLENAEGAYTFDFPEVQISGDWPDAGSTDIVQVQLDITAANTPPTITRVPKVPATAISVAPATSTGAVGSTVTLTATLTPADSTDTVQWTSSDPTIASVVSTGQKTAKVTRNAAGTATITGKARTFTATSEITVTAP >LR133964|2884031:2925584|2894937_2895114_-|VDY60607.1|DBSCAN-SWA MAGVPLPLSLNDIELYLASRTILIDRIEFDAAILALDDAWRAEWAEAQKRAADKKGSN >LR133964|2884031:2925584|2905602_2906607_-|VDY60622.1|terminase|DBSCAN-SWA MAKPDWGELQRRFLSDHAATGVSPKDWCEAQGLNYATARRYIKKPTAQTAQKPAQKKLRTAQKEKCAEELVDDDGLTDQQRLFVAEYLKDHNATQAAIRAGYSKKTAEQIGYQLLQKTSVAQAIAQQQKASIVRTLGSADEVLEQMWRLATFDANQLSQYRRGSCRYCWGFGHQYQWRDAVEFEEKLAEALAKKGKEPNDRGGYGYDHTSSPNPKCPRCNGDGIGQPFFADTRKLAPDAALAYSGVKLGKNGVEITAISRERMYEAVMKRLGLADSEFTQRLQQIEIERRQLEIDKLRKELAADPEDDEPTPVAININVVDARVREEDGDSSDA >LR133964|2884031:2925584|2900505_2901291_-|VDY60617.1|DBSCAN-SWA MLIRNMLIKYYSAAGSEGGEGGGSGGGAPEITPEIQKLIDEQVSAQVSGLKNKNSELLGKLKESTESLKRFDGIDPDAVKTILQRFSDDEEAQLIAAGKVDEVLDKRTERLRADVDKQIKAANERAEKAEAFSNKFRDRVLGDAIRSAALKAGALPEASDDLILRAKGTFQLNDEGEAVAVDANGDVLFGKDGKTPLTPVEWAESLKETAPHLFPRAEGSGAGGHKPGGGGGSLKRSEMSSSDKADYIRKHGQQAYLKLPK >LR133964|2884031:2925584|2889924_2890407_-|VDY60603.1|DBSCAN-SWA MTRLNRLYASSGPEVIIETLQITIGSDVHYLCQGYDNITATTENGDTVTFSACAIDIALPARNADGTQDLKFALCNIDGVVSTAIRNALANRLSAFLTYRRYISTDLAAPAEVPYTLKIKSGSWTATEVQITAGYMNILDTAWPRYRYTLPVFPGLRYIS >LR133964|2884031:2925584|2914027_2914813_-|VDY60637.1|DBSCAN-SWA MAKNSIDVYGASGKTNVLSFEPENLHLVTDKTHPLYDERVHLPIEEGMVLNIAELGVLEPIIVWKDPETGLTCVVVGRQRVKHTLEANKLRLKEGKDPLLVPGVVKRGSANQMAKYMVSENEIRRPDTPLGRAKKMSDALDRGLDEDDIAVLFGCSVQTVRATLSLLDATQAVREAVEAGTITVTQARQLASLKPEEQREKVSEIEVATAGTTGHERARRQRQILGDAKPRLKTRKEITKALESADGEYASALRWVLGEAV >LR133964|2884031:2925584|2886469_2889538_-|VDY60601.1|DBSCAN-SWA MTIRFYPSRLPGEPLETHEHGVTSIRSWLVANVENYTDRDVPPLTVEVEGQSIPPGDWATCVIRPDSDVRLYPVPFGLEAATIAWIGVGISVAAAAYSLFMMSTIDTGGYTSSTGRSLDVNPAKANNAKLGDAIREVFGRVRIYPDYVVQPVTRFDAADPTKMRVQMLLCLGVGDLIYTNGDIRVGSTPASTLPGFSSTHYPPGADVSGDDRSENWVNSTEVGGTSSGTGLDMAQTSPDADDIIADSMTVSGSSVTFTGLDTDDDDDNDENDNALPPSWVAGAVVELKAPANYQITSAGGYSVIASPLLTEIAPMVGMPVTLGFNSVDYDLFIASYTPGQAAVPGTGGSAAKVQASAAPTTYDFSTSSSTFTITWQGVSYPVSLVANYVSMSGLLAAITEGLTGSGLVAQDNGGTVLITESASPFAGGAITSSSLPAAVFGDAPVYTSGTASTGGSPAVTANVTLAYNSATGTAFSGMPEGVQRLSLAHRGNEYRIVSADGTTATVARLVSGAVDESWPGFSARTMIDYEATGLNDTLSWLGPFLVCPENEVVDAFEVNFSFPNGICGFDSKGKKRIRHVEWEIQYRVYGSGSGWVSHQGEYALKNINGLGFTERITLSSPGLVEVRCRRRNEQGSNNARDSMYWQALRGRLLTRPSSYPGVSLMAVTVETGGKLAAQSDRRVNVVATRAYDSGTARTISGALLHVGNSLGLEMDTDTINALEYAYWTPRGEYFDFATGDSISALEMLQKIANAGKSRFLLSDGLATVNREGIKPWTGMITPHEMVEELQSGFTVPSDDDFDGVDVTYINGATWAEETVKCRTPDNPTPVKIENYKLDGVLNQDHAYQIGMRRLMKYLLQRVTFQTTTELDALCYNTGDRIVLTDDIPGNNTISCLVEAMTTAGGVTTFTVTEPLDWSFENPRALIRYQDGSASGLMVASRVGDFQLSVPHLSEFDDPMKVDLSSTTIESIRLVFCGSTRHVYDAIVEEIAPQSDGTCQVTTKEYLESFYAYDDATYPGDVA >LR133964|2884031:2925584|2902917_2904318_-|VDY60620.1|DBSCAN-SWA MANNDITYVRPEVRAAMPVWKKIRDVCKGADAVKAAGNEYLPFLDPSDKSARNKKRNADYIQRAVFYAITGNTKVGLLGLAFRKDPTMTAPDKLNYLRDNADGAGASIYQQSQQVTENILEAAREGLYTDYAAETDEAIILRYQAESIINWRTKRINGRDQLVLVVLRECMEKEDGFAYEDEIQYRELALENGKFVCRVWRKSADAGSFSVTSEYHPKPKGEDFWDEIPFTFVGAQNNDPTIDESPLAALVEINLGHYRNSADYEDSVFFCGQVQPVISGLDTAWRDWLQDKGIRVGSRSPFLLPKEGSFTYAQAQPNTLAKEAMDSKRDYSVQLGARLIEQNGAVKTATQSSGEQTASTSVLGICVSNVSEAYTLALGWCARYLGIKGEEYRYSINQEFIAKVAESGMVTAIVNAWQYGAIRDTDMVRALQRLDLIDPADDPETVIDAIRNGAPNLIGGNNGNGE >LR133964|2884031:2925584|2901821_2902934_-|VDY60619.1|head|DBSCAN-SWA MATANDKLQDESIAHAIWVSRYSTGVANRMIKVLNDSDAELTARLLVAIDTLDPESFTVSRLEALLVSVRVINKDAIQSMYAALSTELQELAKHEASFQMSLFQFAIPDDVLALHPLVGISPDAVYAAAMARPFQGRLLSEWASNLEADRMARISNTVRQGFLLGDTHEQIAKKVRGHANRGYQDGALQMSRANAASIAKTAVGHLASTARQSFASANDDILKGKQWLSTLDNRTSKDCRIRDRLKYTLDNKPIGHKVPYLQGPGKIHFCCRSTETYILKSSEELGIKVGEIKDSSRASMDGQVPADTTYQDWFSRQSFTRQAEIVGETRARLIRDGGMSPDELYNDRGEWLTLDQLRNLDAQAFKDARV >LR133964|2884031:2925584|2884031_2884301_-|VDY60599.1|DBSCAN-SWA MSFTLTQNLKSFRTIPYINLVEPLSAAPVAVTYTAKGVDSINGTTATVLFDTQVEGLDATGQLYYSFEFTDLVTIFEDAETALKKEIQS >LR133964|2884031:2925584|2897402_2897954_-|VDY60611.1|DBSCAN-SWA MAKGWNIDPAAFAGLVAEDVKLRQRTIAIQLLNEIVQRSPVGNPELWAINATAVQYNKAVGEWNESLYADPANLTKTGRLRKKVRVNDSMDIRRPAEYRAGTFRASHFVSIGEPNHSVPTEPDPRGTMTFLNGKNIIDQAPAYSVIYIQSNLPYSVPLENGHSTQAPTGVYAVSFNGVIQAYK >LR133964|2884031:2925584|2918625_2918934_+|VDY60642.1|DBSCAN-SWA MSKVKVAPIELEIDATEVINQVEELLGLLELPARSLEGIPEDVVNLLFDNIRPLLNNIVLSDFSTTVGTTDANKICIKVEIIGTLEHLASAIRASNFHSCQF >LR133964|2884031:2925584|2919230_2919422_+|VDY60643.1|DBSCAN-SWA MSTMFALVLTVSMLTGGNQDVLLGVYDTENDCKAAAEEQHVKAECYPLKGLLDEHPAGFTVQM >LR133964|2884031:2925584|2909902_2910724_-|VDY60627.1|DBSCAN-SWA MNLENTVKYHFAKSTLISDSPRATASDSLTGTDIMAAMGMTQERAAMGYSAFLGKMGISNNDRDRAIGLLAEYALTKCDKVAALRKLSPSVKPRVIRILAEYAFEDYSRSASSKKTCDCCNGSGFIDTVAFTNKVTYPDGKPPKWVKVTKGIYPSYWEEVKSVREQVRVLCQKCKGKGTVSAACNDCHGRGKVVNQDETEKQGVPVMGNCKRCGGRGYERILSTAVHRAICQITDAITLDTWKKSVKPFFDVLITKFDIEEAWAEAQLKQITR >LR133964|2884031:2925584|2898568_2898823_-|VDY60614.1|DBSCAN-SWA MSDLKVVPFQKPSHHNLDNDQVIRLLKQALERAENGGCHSVAVILLDDEGNAIDCWHNGGRPYVMVGAMESLKTDFIHAHIERR >LR133964|2884031:2925584|2911192_2911495_-|VDY60629.1|DBSCAN-SWA MSMANLRKAARGRECTVRIPGYCNGNPETSVLAHYRLAGTCGTGYKPDDTQAAIACNGCHDVIDGRTKTTDFTYDELRLMHAEGVMRTLEIWRKEGLIKS >LR133964|2884031:2925584|2908308_2908848_-|VDY60625.1|DBSCAN-SWA MTADQIIEGILGKEGGYVDHPSDKGGPTRWGITQTTARAHGYTGDMRNLPRETAKQILLSDYWTGPRFDQVAALSTLLADELCDTGVNIGPSVASKFFQRWLTAMNMRGKLYPDLIPDGAIGPRTITALKGYLSARGKEGEQVLLRALNCSQGARYLELAEGREANEDFLYGWVKERVL >LR133964|2884031:2925584|2920911_2922774_+|VDY60646.1|DBSCAN-SWA MPDLETLPTDFQVIGTELVKEAQKKRPDANQVLAAERGEYVEGISDPTDPKWITEDLTKPKQPEVSNMGNGVFSIDGLMDSQPSPAPAPALSIVDQARQRAAEEKLHPANSGETTSDVQMETAQPVEDENDNAVSAGEGADEPPAQTIAVNMSKILAERCPDLTAEVLKSQVSESAHRDEEKEAEQAAPAWPEYFEPGRYEGVPNEVYHAANGISSTMVKDARVSLMYFEARHVSKTIQKVRSPVLDMGNLVHALALQPDQLEKEFSIEPEIPEGAFTTTATIRAFIDEYNNGLPVLLSAEDIKRFLEEYNANLPAQVPLGTSFEETGQGYMSLPAEFQRIEDGQKQTASAMKACIKEYNATLPAQVKTSGGRDVLLEQLALINPDMVAQEAQKAQPLKVSGTKAELIQAVKSVKPDAVFADELLDAWRENPEGKVLVTRQQLATALAIQKALLNHPTAGKLLTHPSRAVEVSYFGIDEETGLEVRVRPDLEIDMGGLRIGADLKTISMWNIKQEGLRAKLHREIIERDYHLSAAMYCETAALDQFFWIFVNKDENYHWIAIIEASEELLELGMLEYRKAMRAIANGFDTGEWPAPITEDYTEELNDFDMRRLEALRVQA >LR133964|2884031:2925584|2913048_2913324_-|VDY60633.1|DBSCAN-SWA MASISISCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR >LR133964|2884031:2925584|2919430_2919586_+|VDY60644.1|DBSCAN-SWA MQKKCGYCSKAIEGKPVVSTLLYLQGNQLARKEKEYCSERCASHDQMAHEG >LR133964|2884031:2925584|2891051_2893952_-|VDY60605.1|tail|DBSCAN-SWA MTEQTSRLAIIIDSSGAEKQADNLATALVKMTQAGERAATSAGKVTKATDEEKQSLSELLDRIDPVNAALNKLDKQQQDLAKFKSKGMVDTDTFDLYSKKIEETRNRLTGFRDDLGKTGQSAAQTAYAMRMIPAQMTDIVVGLSTGQSPFMVLMQQGGQLKDMFGGIGPAIKGVGTYVMGLVNPLTLAAAAVGFLGLAYYKGTQEQDEFYKSLVLTGNLVGKTSGQLADMAARVSVAANSTTGAAASTLNQLVLSGKVAGDSLERVTTAIVKTSEATGIATDKLVGDFNDITADPVAAITKLNDQYHFLTLATYNQIKALQDEGNQQDAARVATDAYANAMQQRANDIHQNLGLLESAWDSLGKTAKGAWDAMLNIGREQTLTDKLATLNENIAEAQKGQKDGGFWNSFSARFTNLPEMIKQRDLLESVANLQGDVTKGQAKAKEAEQQRIKTQQEADRVNQQYLSNADKRNKAIKQQSEFLKAGAITAEQYAKNVSRINEMYKDPKPPKTPKGKAYTEDAATRLLDQINQQTAALQSQLDASDKLNSATQARVKFEQQITDLKSKTQLTADQKSILSRSDEILQAYKQQEALQNSVKTLDDYRKMQEQVKTKDERTNDLLKTRLELLEKAKATGQLKPGEYEKTRADIYQNTDMQLPATVRNVVGNLTPTGGRLSGTFEGMQGQINEYDQAQQELQRWLAAQEEAYAKAGEITAEGEARMTSIRQRAADANQVIEAQKNTIISAATQSLFDSTADIMRTGFGEQSAIYKVAFAASKAFAIADSMVKIQQAIASGAVSAPYPANIIAMASIAAQTASIVSNIQAVSGVGFASGGYTGPGGKYQPAGIVHKGEYVFDQASTNRIGVSQLEALRNGQPLDATLGRTGFGTGVQNVNSDNSSKTTIHAPIEQHFHTPPGVTPDQMALSMAQTQKRATTEALDQVAAQVLRGDGKVGKAMRSKYPGRGLE >LR133964|2884031:2925584|2913827_2914031_-|VDY60636.1|DBSCAN-SWA MTIIKTHTGTVITKDGPKVKKLHQTERMWVVGKNEFYHKETGRRHFAENTRRRLLLETIEAIGGSHD >LR133964|2884031:2925584|2913374_2913584_-|VDY60634.1|DBSCAN-SWA MSNSLQILCIKDTEGYWTEGEMYPARVVAGGFVQVGDDDDPSGAGWSAAPMEYREDGSIVYQVGGIEGG >LR133964|2884031:2925584|2915873_2916734_-|VDY60640.1|DBSCAN-SWA MADLANGYTKIANEIQKLKPRLRMSGREWQCLEAVIWLTYGWNKKQDRVTNTVIAELTDLGESHISDTIKSLAERKIIFAHKQGVMKIVGINTELSEWILDKPKTGKLFPESGKVLPKTGKPFPETGDTQYKNKNNSKRSSSSRNSKESRNEETLKFLSRHPEAADGIYTPAGKSWGTADDLKAARWIFDKALTVNASLSEPNWVEWANTIRLMRLQDKRTHYEICELFKWANEDGFWQENILCPSKLRKKWDQLTTKRLRSHGPSRNSSGASALDNTDWIDGVLE >LR133964|2884031:2925584|2913580_2913835_-|VDY60635.1|DBSCAN-SWA MTEHVEKYTNKAIEIIADYIQRTNKKNEQLQEAKVRLDKKIALFADDENCNTNRLMSVFLPAMTSHTRDGFFEEIAAALEGADK >LR133964|2884031:2925584|2897955_2898339_-|VDY60612.1|DBSCAN-SWA MGIRDELQTEVAAAFDTDLQDAVKDFTGSYTVRGAWDPVTETGTETQVTYSGRGVLARYKLRRIDGVNILHGDVKLTALVNEVTDKPAVGHIITAPDPVTGELQRYEVITASADSAGAAYSIQLRRA >LR133964|2884031:2925584|2889534_2889915_-|VDY60602.1|DBSCAN-SWA MFNPDKYRSVTWLKGGRVYPQLDCFGIVNEIRRDLNLPEWPDFAGVTKDDGGLDREARRMMLTLERCEPCEGAGVACYSGSTVTHVGIVVSIGGLLHVAECNPGTNVTFLPLPRFKRRFVRVEFWQ >LR133964|2884031:2925584|2895281_2895764_-|VDY60608.1|DBSCAN-SWA MLIISSQIDLNGERWFYPYKKPAESKKKFTPEDEALFKLRLLVASSENPQYRSRNALVRRHIDKMDASYQVGTDAFDLASVGDIDSVDDLLIDNAARFLLKGWEGVGQLVDGSEVALDYTPELGIAMLKQYPDLYWRILAEAASIAQGKEQQTQETVKKP |
50 | Klebsiella_phage(22.73%) | head,integrase,tail,terminase | attL 2878138:2878153|attR 2925760:2925775 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
3308136 : 3317600
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >LR133964|3308136:3317600|DBSCAN-SWA AATGCGCGCACTTTTACCCTATCTGGCGCTCTATAAACGCCATAAATGGATGCTGCTGCTTGGCGTCGTGCTGGCCATTGTGACCCTGCTGGCCAGCATCGGCCTGCTGACGCTCTCCGGCTGGTTCCTCTCCGCCTCAGCGGTGGTGGGCGTCGCCGGGATCTACAGTTTTAACTATATGCTGCCTGCCGCGGGCGTTCGCGGCGCGGCAATTATCCGTACCGCCGGGCGCTACTTTGAGCGCCTGGTCAGCCACGACGCTACCTTTCGCGTGCTGCAGCACCTGCGCGTTTTCACCTTCAGCAAACTGCTCCCCCTCTCTCCGGCGGGGCTGGCGCGCTTTCGCCAGGGTGAGCTGCTCAACCGGGTGGTAGCCGACGTTGACACCCTGGATCATCTCTACCTGCGGGTCATTTCGCCGCTGGTCGGCGCGCTGGTGGTGATTGTGGTCGTCACCTGCGGATTAAGTCTGCTCGACGTCACCCTCGCGCTGACGCTCGGCGGTATCATGCTGGCGACCCTGCTGGTTATGCCGCCGCTGTTCTATCGCGCAGGTAAGCCAGCCGGCGAAAGTATGACGCAGCTGCGCGGCCAGTATCGCCAACAGCTCACCGCCTGGCTGCAGGGTCAGGCGGAGCTGATGGTATTTAACGCCAGCGACCGCTACCGCGCACAAATGGAAAAGACCGAGCTGAGCTGGCAGGATGCACAGCGGCGCCAGGCCGAGCTGACCGCGCTTTCGCAGGCGGTAATGCTGTTAATCGGTGGCATCGCCGTGGTGGCCATGCTCTGGCTGGCCTCAGACGGCGTCGGCGGCAATAGTCAGCCCGGGGCGCTGATCGCGCTATTCGTATTCTGCGCGCTGGCGGCTTTTGAAGCCCTGGCGCCAGTCACTGGCGCTTTCCAGCATTTAGGCCAGGTGATCGCCTCCGCGCGGCGGATCTCGCAGATTACCGACCAGCAGCCGGAGGTCACCTTTGTCGAAGATGAAGCCAGTCCGCCAGCGCAGGTGGCCCTAACGCTTCAGGAGGTCACCTTCCGCTATCCTCAGCAGCCCTCCCCTGCCCTGGAGAATATTTCCCTGCAGATTGCCGCCGGAGAGCACATCGCCATTCTTGGCCGGACCGGCTGCGGAAAATCGACGCTGTTGCAGTTGCTTACCCGCGCCTGGGACCCGTCACAGGGAGAGATTCTGCTCAACAATCAGCCGCTCTCCGGCCTCAGCGAAGCCACTCTTCGGCAGGCAATGAGCGTAGTGCCGCAGCGCGTGCACCTGTTCAGCGCCACCCTGCGCGACAACCTGCTGCTGGCGGCGCCTGAAGCGGATGACGCTCATCTCAGCGCTACCCTTGAGAAGGTGGGCCTCGAAAAACTGCTGCAAGATGGTGGTCTTAACGGCTGGCTGGGCGAAGGCGGGCGTCAGCTCTCCGGCGGCGAACTGCGCCGACTGGCCATTGCCCGCGCGCTGCTCCACGATGCGCCGCTGATGCTGCTCGATGAACCGACAGAAGGTCTGGATGCGGCCACCGAAAGCCAGATCCTGCATCTACTGGCAGATGTCATGCGCGACAAAACCGTGCTGATGGTGACCCATCGCCTGCGGGGCCTGGCGGGTTTTAATCAGATAATAGTCATGGACAACGGCCAGATAATTGAGCAAGGTAGTCACGCGGAGCTGCTCGCTAAACAGGGCCGGTACTTCCAGTTTAAACAGCGTCTGTAGGCTATTCTATTATCTAATTTTTGCTGATGCGTATTGAGGTTAAAATGCGTCTGGTTCAGCTCTCCCGGCACTCTATCGCCTTCCCCTCTCCGGAAGGCGCCTTGCGAGAACCTAATGGCCTGCTGGCGCTCGGCGGCGATCTGAGCCCCGCGCGGCTGCTGATGGCCTATCAGCGTGGTATTTTTCCCTGGTTTTCCCCTGGCGACCCGATCCTCTGGTGGTCGCCCGACCCCCGGGCGGTACTTTGGCCAGAGCAATTCCATCTCAGCCGCAGCATGAAGCGCTTTCATCAGCGTTCGCCCTACCGGGTCACGCTCAACCACGCCTTTGGCGAAGTGATTGAGGGCTGCGCCAGCGATCGTGACGAAGGCACCTGGATAACCAGCAGTATCGTTCGCGCCTATCACCAGCTGCATGAGTTGGGTCATGCCCACTCCATTGAAGTCTGGCAGGAGAATACCCTGGTGGGAGGAATGTACGGCGTAGCGCAGGGAGCGCTGTTTTGCGGCGAATCGATGTTTAGCCGGGCGGAAAATGCCTCCAAAACCGCGCTGCTGGTATTTTGCCAGGACTTTGCGCACAGCGGCGGTAAATTAATTGACTGTCAGGTACTGAACAACCATACCGCGTCATTAGGCGCAGTGGATATTCCCCGCCGCGATTATCTGGACTATCTGTCGGTATTACGCGGCTACCGACTGCCCGAGCGATTTTGGGTGCCGCGGGTGCTATTTCCTGGCGGATAATAAAATGTTTTCAGCACATTTTGCGTCAGGATGGTATAATTGCGCGGTAGAGTAGCTTCTGCCTGTTGCCCCGCCGCCGTTTGGGAATAATAAGAGTCAGATAACGCCCATCGTTCTGTCACTCCCCTTTACGTTTGCGCCTGTCACGCGGCATGGCCGTGTGTGTCCCGGGCAATGCGCCAAAACTACTTCGGCGATTCATACCCGTCATATTTCCGTAGCCGCTGCGCTGACTATGCGCCTTGCGGCAAGGTGAATGATTCAGGGTAGATACGTAAAACTTTACTTACATGATGAACTTCGGCATTATCTTGCCGGTTCAAAACTACGGTAGTGATACCCCAGAGGACTAGATGGCCAAAGAAGACAATATTGAAATGCAGGGCACCGTACTTGAAACGTTGCCTAACACTATGTTCCGCGTAGAACTGGAAAACGGTCACGTGGTAACCGCGCATATCTCCGGTAAAATGCGTAAAAACTACATCCGCATTCTGACGGGCGACAAAGTGACTGTTGAGCTGACCCCGTACGACCTGAGCAAAGGCCGCATTGTCTTCCGTAGTCGCTGATTGTTTTCGCCCTTAAGGCGTCAGATGCTGAAGGCCGGGATAATTCCCGGTCTTTTCATTTCTGCCGTCAGCAAAAAAAAACCGGCCCCTGGCCGGTTTTTTTACATTGAAGATATCGCCTTAGTGGGCCGCTTCCGGTTTGTGCTTCGGAGCGCTCTGGAAGTCGTAAGTCAGCGTATTTTTGTCGCTGTCCAGAGAGACAGTGACCTGACCACCGTCGACCAGCGAGCCAAACAGCAGTTCGTTCGCCAGCGGTTTTTTCAGGTTGTCCTGAATGACGCGTCCCATTGGACGTGCCCCCATCGCCCGATCGTAGCCTTTCGTCGCCAGCCAGTCGCGCGCTTCCTGACTCACTTCCAGGGAGACGCCTTTCTGGTCCAGCTGGACCTGCAGCTCGACGATAAACTTGTCGACAACCTGATGGATCACCGTGGTAGACAAATGATCGAACCAGATGATGTTGTCGAGGCGGTTACGGAACTCCGGCGTAAAGATCTTCTTGATCTCATCCATCGCATCCGGACTGTTGTCCTGCTGAATAAGGCCAATGGATTTACGCTCGGTTTCCCGCACCCCGGCGTTGGTGGTCATCACCAGCACCACGTTGCGGAAGTCCGCTTTACGGCCGTTATTATCGGTCAGCGTCCCGTTATCCATCACCTGCAGCAGCAGGTTGAACACGTCCGGATGCGCTTTTTCGATCTCATCAAGCAGCAATACCGCGTGCGGATGTTTAATCACCGCATCGGTCAGCAGCCCGCCCTGATCGAAGCCGACGTAGCCCGGAGGCGCCCCGATCAAACGGCTGACGGTATGACGTTCCATATATTCGGACATGTCAAAGCGCAGCAGCTCAATACCCAACGCCTTCGCCAGCTGGACGGTCACTTCGGTTTTCCCGACCCCGGTCGGCCCGGCGAACAGGAACGATCCCACCGGCTTGTGCTCATGGCCCAGCCCGGCGCGCGCCATTTTAATAGCCTCGGTCAGCGCCTCGATGGCTTTATCCTGTCCAAAGACCAGCATTTTCAGACGGTCGCCCAGGTTTTTCAGCGTGTCGCGATCGCTGCGGGAGACGCTCTTCTCAGGGATCCGCGCAATGCGCGCCACCACTGATTCGATATCCGCCACGTTAACCGTTTTCTTACGCTTGCTGACCGGCATTAAGCGCGCCCGCGCGCCCGCTTCGTCAATGACGTCAATCGCCTTATCCGGCAGATGACGATCGTTAATGTATTTCACCGCCAGTTCTACCGCCGCACGCACCGCTTTGGCGGTATAACGCACATCATGGTGCGCTTCGTACTTCGGCTTCAGGCCGTTGATAATCTGGACCGTCTCTTCCACCGACGGTTCGGTAATATCGATCTTCTGGAAACGGCGGGCCAGCGCACGATCTTTTTCAAAGATGTTGCTGAACTCCTGGTAAGTGGTGGAGCCGATCACCCGGATCTTGCCGCTGGAAAGCAGCGGTTTGATCAGGTTAGCCGCATCGACCTGGCCGCCAGACGCCGCCCCCGCACCGATAATGGTGTGGATTTCATCAATAAACAGGATGCTGTTGGTATCCTGCTCCAGCTGTTTGAGCAACGCTTTGAAGCGTTTTTCAAAATCGCCACGATATTTGGTCCCTGCCAGCAAAGAGCCAATGTCGAGCGAGTAGATAGTGCAGTCGGCCATCACTTCCGGCACATCGCCCTGCACAATACGCCAGGCAAGGCCTTCGGCGATCGCGGTTTTACCGACGCCGGATTCCCCCACCAGCAGCGGGTTATTTTTCCGCCGACGGCACAGGACCTGAATCGCGCGCTCCAGCTCTTTTTCACGGCCGATCAGGGGATCAATGCCGCCGACGCGAGCAAGCTGATTGAGGTTGGTGGTGAAGTTTTCCATACGTTCCTCCCCGCCGGCTTGCTCTTCATTGCCAGGCTGGCTTCCGCTGTTATCGGAAGACTGGCTCGGCTCGTCTTTGCGTGTACCATGAGAAATGAAGTTCACCACATCAAGACGGCTCACTTCATGTTTACGCAGCAGGTAGGCGGCCTGCGACTCCTGCTCGCTGAAAATCGCCACCAGCACGTTGGCGCCGGTCACTTCGCTGCGGCCAGAAGATTGTACGTGGAAGACGGCGCGCTGCAGCACACGCTGGAAGCTTAGCGTCGGCTGGGTGTCGCGCTCTTCTTCCGTGGCTGGCAGAACGGGTGTGGTTTGTTCGATGAAGGCTTCAAGTTCCTGACGCAATGCCACCAGGTCCACTGAACACGCTTCCAGGGCCTCACGGGCCGAGGGGTTGCTTAGCAGAGCCAGCAGCAAGTGCTCGACGGTCATAAACTCATGACGGTGCTCGCGCGCTCTGGCGAAAGCCATATTTAAACTGAGTTCCAGTTCTTGATTGAGCATAGGCACCTCCCCCAATTTTCATGCCTGTATTCAGGCTTTTTCCAGCGTACACAGCAATGGATGCTCGTTCTCCCTCGCGTACTGATTCACCATAGCGACTTTGGTCTCCGCCACTTCCGCGGTAAACACGCCACAAATCGCCTTACCTTCATAGTGAACCGTGAGCATCAGTTGCGTTGCACGTTCAACATCATAAGAAAAGAATTTTTGTAACACGTCAATAACAAACTCCATCGGAGTGTAGTCATCATTGACCAATATCACTTTATACATAGATGGCGGTTTAATCGCATCACGAACTTCGTCGTCAACCAGGTGCTCGAAGTCCAGCCAATCTCTCTTACTCATCGTCAGTGTTCATCATCGGTTGCTGTTGCCAACAGGCGGAGGCCTGTCGATGACCAGAGGTTATGCTCATCACAAATCTACAATAGATCATAGATAACTATCATCTATCGCTTCTATCCGCGACGGCTGTCACATTCCCCGGCAATAGCGTTAACTGCTTCAAATTTTGACGCATTTTTCGCCGTTCCCCCTCCTCAATCGCTTGACGCGTTTTCGTATTTCTCTAAATTGTAGTGGCGAGAGTTGGCGAGCAATTGAACAACTCGTCACTCCACCACCGGTTCATTCCATCTTACTAATAAAGATTTACGAAGGATGTCGAAGTATGGAAATGGGTACTGTTAAGTGGTTCAACAATGCCAAAGGGTTCGGTTTTATCTGCCCTGAGGGCGGCGGCGAAGACATTTTCGCCCATTACTCCACCATCCAGATGGATGGTTACAGAACGCTAAAAGCCGGACAAGCCGTTCGGTTTGATGTTCACCAGGGGCCAAAAGGCAATCACGCCAGCGTGATCGTCCCTGTGGAAGCGGAAACGGCTGCATAACTCTTTTGCTTCATTGTGTACATCCTGCTAATAAAATGCCAGTCCTTCTGACTGGCATTTTTATTTCTCGCCCCGCCTATTCTCGCGCCAGGGCATCCACCGGATCGAGCCGCGCCGCGTTGCGCGCAGGCAGCCAACCAAACAGCACCCCGGTCAGCGTCGAACAGAGAAACGCCGTCAGCAGCGCCAGCGGAGAAAAACCAATCTCCCAGCCGGGGAGAAACAGCTGTAAAATAAAGGCGATCATCAGCGACAATGCTACGCCCAGCGCGCCACCGACCAGGCAAACCAGCACCGCTTCAATGAGAAACTGCTGCAGCACATCGCTGGCCCGGGCGCCAACCGCCATGCGGATGCCGATCTCCCGGGTTCGTTCGGTCACCGACACCAGCATAATATTCATTACGCCAATCCCGCCGACCACCAAGGCAATCACCGCCACCAATGTCAGGAACAGCTGTAAGGTATGTGTGGTCCTTTCGGCGGTTTTCAAGACGCTGTCCATATTCCAGGTGAATACATCTTTTTTACCATGCCGCAGCTCCAGCAGACGCAGCAGCTGTTTCTCTGCGGTTTCACTGTCGTAGCCTTCATGAACGCGCACAGTGATGGAGTTCAGCCACGACTGGCCCATTACTCTTCCCGCCATGGTGGTGTAGGGCAACCAGACGCGGAGAATTTTGCTGCTGCCGAACATCGACTGCTTTTCATCCGCAACGCCAATCACCGTGGCCGGCATATTGCCCACCAGAATCACCTCTCCCACCACTTTCGCTTTATTGGGAAACAGCTGCCGACGGGTATTGCTGTCGAGCACTACCACCTGTGCCCGACTGTTGAGCTGCAGTTCATTGAAGGTATTACCTTCGCTAAAGGTCATGCCGTAGACATTAAAATACTGCGGACCGACCCCCTCCGCGCTGGCGGCAACGTCAATATTGTTGGCGCGCAGGCGCAGGCTTTTTGAAACCGCCGGCGTGGCGGAGCGCACCCACGGCTGCTTCTGGATCGCCAGCAAATCGTCATACTTCAGCGCCTGCTGATAGCGCGGGTCGTCGTCGCCAAAATCCTTACCAGGATACACATCAATGGTATTGGTGCCGATGGCGCGAATATCCGCCAGCACCATCTGCTTGGCGGCATCCCCCACCACCACGATCGACACCACCGAGGCAATACCGATAATAATCCCGAGCATGGTCAGCAGGGTACGCATTTTATTCGCCGCCATCGCTCGCCAGGCCATCACCAGCGCTTCGCGAAAGCCGCTGGTAAACTGACGCCACGCCGAAGGCTCCGCCTGCGGACGGGCGCGCAGTCCGCCGCCCTGGCGGCTGGCGGGCGGGTTGCGGACGATCTCGCCATCGCGGATCTCGACGATCCGCTCCGCCTGAGCGGCGACCTGCGGATCGTGGGTCACAATGATGACCGTGTGCCCCTGCGCTTTTAGCTGGTGCAGGATCGCCATTACCTCTTCGCCGGAATGGCTGTCCAGCGCGCCGGTCGGTTCATCAGCGAGGATCACCTCCCCGCCATTCATCAGGGCGCGGGCGATACTGACCCGCTGCTGCTGACCGCCGGACAACTGTGACGGTTGATAGTCGGCGCGCTCGCCCAGCCCCAGCCGCACCAGCAGCTCATGAGCGCGCGCCAGGCGCGCGCGCCGTTCACTGCCGGCATACACCGCCGGCACTTCGACATTCTGCGCGGCGGTCAGATGGGACAGCAAATGGTAGCGTTGAAAGATAAAGCCAAAGTGTTCCCGCCGCAGACGGGCCAGCGCATCGCCATCCAGCTGCGCGATATCGGTTCCCGCCACCCGATAGGTGCCGCTGGTGGGTTTGTCGAGGCAGCCAAGGATATTCATCAGCGTCGATTTTCCCGAGCCGGAGGCCCCGACGATCGCCACCATCTCACCCGCATGGATGCTGAGGGTGATCCCCTTCAGGACCTCAACGCTGCCGTCGCCGGAAGGATAGCTGCGACGTATATCTCGCAGCTCAAGCAGCGCCGTCATTTTGCCGCCCCGCTCGCGCTCTCGCCGACGATCACTTCGTCGCCCTCTTCCAGACCTTGCACCACTGCGACGTCGGTATCATTGCGGGCGCCAATGATAACCTCACGCTCTTTGACCTCACCGGTGCGCAACAGTCGAACGTGGTAGCGGTTATCGCCTACCGCGTCGCCCAGGGCGCTCAGCGGGATAGTAATCACGTTCTTCACCTCAGCCAGCTGAATGTGCACCTGCGCGGTCATGTCGAGGCGCAGGATCCCCTGCGGGTTCGGTACCTCGAAGCGGGCATAATAGAAGATGGCGTCATTCACTTTTTCCGGCGTCGGCAGAATGTCTTTCAGTTTACCTTCATAACGGGTAAGCGGGTCGCCCAGCACGGTAAACCACGCTTTTTGCCCCGGTCTGAGGTGGATGACATCCGCTTCAGAGACCTGCGCTTTAACCAGCATGGTGCTGAGATCCGCCAGCGTCAGAATGTTAGGCGCCTGCTGGGCGGCGATCACTGTCTGTCCCTGCAGGGTGGTAATTTGCGTCACTTCGCCGGCCATCGGCGCCAGGATCCGCGTGTAATCGAGGTTGGTCTTCGCCGTATCGAGCGTCGCCTGGTTACGCTTTATTTGCGCTTCAATAGTGCCGATCTGCGCCTGTTTCACCGCCAAATCGGTGGCCGCGGTATCAAGGTCCTGCCGCGAGACCAGCTGGCGCTGCGCCAGCTGCTGCTGACGGGCCAGAGTCACTTGCGCCAGTTTGCTCTCCGCGCGCGCCTGATTGAGCTGGGCGCGCAGCTCCATGAGCGTCGCTTCCACCTCTTTAATCTGGTTTTGCGCCTGCTCCGGATCGATAACCCCCAGCAACTGATCCTTTTTAACCTTATCGCCAATATTGACGTGCAGCGTTTTCAGCTGTCCGCTGACCTGGGCGCCGACGTCAACTTTTCGCAGGGCATCCAGTTTGCCGGTGGCCAGCACGCTCTGCTGAAGATCCCCTTTGCGAACTACCAGCGTCTGGTAGTTCGGCAGCGGCGCATTGAGGATCCGCCATCCCCAGATAGCCAGCCCCAGCACAACGATCGCCAACAGCCACCAGACGGTCCTGCGTTTTCCCTTCACTTTCAT
Protein sequences of DBSCAN-SWA_5 >LR133964|3308136:3317600|3309902_3310604_+|VDY61006.1|DBSCAN-SWA MRLVQLSRHSIAFPSPEGALREPNGLLALGGDLSPARLLMAYQRGIFPWFSPGDPILWWSPDPRAVLWPEQFHLSRSMKRFHQRSPYRVTLNHAFGEVIEGCASDRDEGTWITSSIVRAYHQLHELGHAHSIEVWQENTLVGGMYGVAQGALFCGESMFSRAENASKTALLVFCQDFAHSGGKLIDCQVLNNHTASLGAVDIPRRDYLDYLSVLRGYRLPERFWVPRVLFPGG >LR133964|3308136:3317600|3313606_3313924_-|VDY61009.1|protease|DBSCAN-SWA MSKRDWLDFEHLVDDEVRDAIKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLTVHYEGKAICGVFTAEVAETKVAMVNQYARENEHPLLCTLEKA >LR133964|3308136:3317600|3314547_3316488_-|VDY61011.1|DBSCAN-SWA MTALLELRDIRRSYPSGDGSVEVLKGITLSIHAGEMVAIVGASGSGKSTLMNILGCLDKPTSGTYRVAGTDIAQLDGDALARLRREHFGFIFQRYHLLSHLTAAQNVEVPAVYAGSERRARLARAHELLVRLGLGERADYQPSQLSGGQQQRVSIARALMNGGEVILADEPTGALDSHSGEEVMAILHQLKAQGHTVIIVTHDPQVAAQAERIVEIRDGEIVRNPPASRQGGGLRARPQAEPSAWRQFTSGFREALVMAWRAMAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRAIGTNTIDVYPGKDFGDDDPRYQQALKYDDLLAIQKQPWVRSATPAVSKSLRLRANNIDVAASAEGVGPQYFNVYGMTFSEGNTFNELQLNSRAQVVVLDSNTRRQLFPNKAKVVGEVILVGNMPATVIGVADEKQSMFGSSKILRVWLPYTTMAGRVMGQSWLNSITVRVHEGYDSETAEKQLLRLLELRHGKKDVFTWNMDSVLKTAERTTHTLQLFLTLVAVIALVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGVALSLMIAFILQLFLPGWEIGFSPLALLTAFLCSTLTGVLFGWLPARNAARLDPVDALARE >LR133964|3308136:3317600|3308136_3309858_+|VDY61005.1|DBSCAN-SWA MRALLPYLALYKRHKWMLLLGVVLAIVTLLASIGLLTLSGWFLSASAVVGVAGIYSFNYMLPAAGVRGAAIIRTAGRYFERLVSHDATFRVLQHLRVFTFSKLLPLSPAGLARFRQGELLNRVVADVDTLDHLYLRVISPLVGALVVIVVVTCGLSLLDVTLALTLGGIMLATLLVMPPLFYRAGKPAGESMTQLRGQYRQQLTAWLQGQAELMVFNASDRYRAQMEKTELSWQDAQRRQAELTALSQAVMLLIGGIAVVAMLWLASDGVGGNSQPGALIALFVFCALAAFEALAPVTGAFQHLGQVIASARRISQITDQQPEVTFVEDEASPPAQVALTLQEVTFRYPQQPSPALENISLQIAAGEHIAILGRTGCGKSTLLQLLTRAWDPSQGEILLNNQPLSGLSEATLRQAMSVVPQRVHLFSATLRDNLLLAAPEADDAHLSATLEKVGLEKLLQDGGLNGWLGEGGRQLSGGELRRLAIARALLHDAPLMLLDEPTEGLDAATESQILHLLADVMRDKTVLMVTHRLRGLAGFNQIIVMDNGQIIEQGSHAELLAKQGRYFQFKQRL >LR133964|3308136:3317600|3311296_3313576_-|VDY61008.1|protease|DBSCAN-SWA MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEAFIEQTTPVLPATEEERDTQPTLSFQRVLQRAVFHVQSSGRSEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDVVNFISHGTRKDEPSQSSDNSGSQPGNEEQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERAIQVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASGGQVDAANLIKPLLSSGKIRVIGSTTYQEFSNIFEKDRALARRFQKIDITEPSVEETVQIINGLKPKYEAHHDVRYTAKAVRAAVELAVKYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSRSDRDTLKNLGDRLKMLVFGQDKAIEALTEAIKMARAGLGHEHKPVGSFLFAGPTGVGKTEVTVQLAKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGLLTDAVIKHPHAVLLLDEIEKAHPDVFNLLLQVMDNGTLTDNNGRKADFRNVVLVMTTNAGVRETERKSIGLIQQDNSPDAMDEIKKIFTPEFRNRLDNIIWFDHLSTTVIHQVVDKFIVELQVQLDQKGVSLEVSQEARDWLATKGYDRAMGARPMGRVIQDNLKKPLANELLFGSLVDGGQVTVSLDSDKNTLTYDFQSAPKHKPEAAH >LR133964|3308136:3317600|3310957_3311176_+|VDY61007.1|DBSCAN-SWA MAKEDNIEMQGTVLETLPNTMFRVELENGHVVTAHISGKMRKNYIRILTGDKVTVELTPYDLSKGRIVFRSR >LR133964|3308136:3317600|3314249_3314471_+|VDY61010.1|DBSCAN-SWA MEMGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQAVRFDVHQGPKGNHASVIVPVEAETAA >LR133964|3308136:3317600|3316484_3317600_-|VDY61012.1|DBSCAN-SWA MKVKGKRRTVWWLLAIVVLGLAIWGWRILNAPLPNYQTLVVRKGDLQQSVLATGKLDALRKVDVGAQVSGQLKTLHVNIGDKVKKDQLLGVIDPEQAQNQIKEVEATLMELRAQLNQARAESKLAQVTLARQQQLAQRQLVSRQDLDTAATDLAVKQAQIGTIEAQIKRNQATLDTAKTNLDYTRILAPMAGEVTQITTLQGQTVIAAQQAPNILTLADLSTMLVKAQVSEADVIHLRPGQKAWFTVLGDPLTRYEGKLKDILPTPEKVNDAIFYYARFEVPNPQGILRLDMTAQVHIQLAEVKNVITIPLSALGDAVGDNRYHVRLLRTGEVKEREVIIGARNDTDVAVVQGLEEGDEVIVGESASGAAK |
8 | Brazilian_cedratvirus(16.67%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|