Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LR134233 | Salmonella enterica subsp. enterica strain NCTC9684 genome assembly, chromosome: 1 | 4 crisprs | cas3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,csa3,DEDDh,DinG,WYL | 0 | 13 | 8 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134233_1 | 935872-936388 | TypeI-E |
I-E
Consensus repeat of LR134233_1
|
8 spacers
spacers of LR134233_1
>1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT CCAGAACGATGACGTTGAAGTACCGAAAAATA >1.2|935962|32|LR134233|PILER-CR,CRISPRCasFinder,CRT TTAATTTTCCCCTGTGGTCGAAACCGTCACAC >1.3|936023|32|LR134233|PILER-CR,CRISPRCasFinder,CRT CGCCTGTATAGGTGGCGCGGAATGATGGACGA >1.4|936084|32|LR134233|PILER-CR,CRISPRCasFinder,CRT ACGATTATTTGACGCCAGAAGCAGCAGATGCA >1.5|936145|32|LR134233|PILER-CR,CRISPRCasFinder,CRT ACAATGTTGCGTCTAATTCTCATTAATTAAAA >1.6|936206|32|LR134233|PILER-CR,CRISPRCasFinder,CRT CGATTCAGATTCGCCATTGTTAATTTTCCCCT >1.7|936267|32|LR134233|PILER-CR,CRISPRCasFinder,CRT CCGTTTCGGCGCAGCCGCTTATTCTTCACTGT >1.8|936328|32|LR134233|CRISPRCasFinder,CRT GTCACGAGGTCTGACGCGGATGTGATGAGTTA |
cas3,cas8e,cse2gr11 |
CRISPR arrays and Neighbor proteins around LR134233_1
The CRISPR arrays of LR134233_1 >merge|LR134233|1|935872-936388|PILER-CR,CRISPRCasFinder,CRT GTGTTCCCCGCGCCAGCGGGGATAAACCGCCAGAACGATGACGTTGAAGTACCGAAAAATAGTGTTCCCCGCGCCAGCGGGGATAAACCGTTAATTTTCCCCTGTGGTCGAAACCGTCACACGTGTTCCCCGCGCCAGCGGGGATAAACCGCGCCTGTATAGGTGGCGCGGAATGATGGACGAGTGTTCCCCGCGCCAGCGGGGATAAACCGACGATTATTTGACGCCAGAAGCAGCAGATGCAGTGTTCCCCGCGCCAGCGGGGATAAACCGACAATGTTGCGTCTAATTCTCATTAATTAAAAGTGTTCCCCGCGCCAGCGGGGATAAACCGCGATTCAGATTCGCCATTGTTAATTTTCCCCTGTGTTTCCCGCGCCAGCGGGGATAAACCGCCGTTTCGGCGCAGCCGCTTATTCTTCACTGTGTGTTCCCCGCGCCAGCGGGGATAAACCGGTCACGAGGTCTGACGCGGATGTGATGAGTTAGTGTTCCCCGCGCCAGCGGGGATAGCCGT >LR134233|1|1|935872-936327|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACCG CCAGAACGATGACGTTGAAGTACCGAAAAATA GTGTTCCCCGCGCCAGCGGGGATAAACCG TTAATTTTCCCCTGTGGTCGAAACCGTCACAC GTGTTCCCCGCGCCAGCGGGGATAAACCG CGCCTGTATAGGTGGCGCGGAATGATGGACGA GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGATTATTTGACGCCAGAAGCAGCAGATGCA GTGTTCCCCGCGCCAGCGGGGATAAACCG ACAATGTTGCGTCTAATTCTCATTAATTAAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CGATTCAGATTCGCCATTGTTAATTTTCCCCT GTGTTTCCCGCGCCAGCGGGGATAAACCG CCGTTTCGGCGCAGCCGCTTATTCTTCACTGT GTGTTCCCCGCGCCAGCGGGGATAAACCG >LR134233|1|1|935872-936388|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACCG CCAGAACGATGACGTTGAAGTACCGAAAAATA GTGTTCCCCGCGCCAGCGGGGATAAACCG TTAATTTTCCCCTGTGGTCGAAACCGTCACAC GTGTTCCCCGCGCCAGCGGGGATAAACCG CGCCTGTATAGGTGGCGCGGAATGATGGACGA GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGATTATTTGACGCCAGAAGCAGCAGATGCA GTGTTCCCCGCGCCAGCGGGGATAAACCG ACAATGTTGCGTCTAATTCTCATTAATTAAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CGATTCAGATTCGCCATTGTTAATTTTCCCCT GTGTTTCCCGCGCCAGCGGGGATAAACCG CCGTTTCGGCGCAGCCGCTTATTCTTCACTGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTCACGAGGTCTGACGCGGATGTGATGAGTTA GTGTTCCCCGCGCCAGCGGGGATAGCCGT >LR134233|1|1|935872-936388|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCAGAACGATGACGTTGAAGTACCGAAAAATA GTGTTCCCCGCGCCAGCGGGGATAAACCG TTAATTTTCCCCTGTGGTCGAAACCGTCACAC GTGTTCCCCGCGCCAGCGGGGATAAACCG CGCCTGTATAGGTGGCGCGGAATGATGGACGA GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGATTATTTGACGCCAGAAGCAGCAGATGCA GTGTTCCCCGCGCCAGCGGGGATAAACCG ACAATGTTGCGTCTAATTCTCATTAATTAAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CGATTCAGATTCGCCATTGTTAATTTTCCCCT GTGTTTCCCGCGCCAGCGGGGATAAACCG CCGTTTCGGCGCAGCCGCTTATTCTTCACTGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTCACGAGGTCTGACGCGGATGTGATGAGTTA GTGTTCCCCGCGCCAGCGGGGATAGCCGT
>LR134233.1|VEC90964.1|935605_935752_+|Uncharacterised-protein MPSVTYPLPCIVYFLYVNFDECIKARINFSIAMHGSRYFGKFKEKIIL >LR134233.1|VEC90963.1|934903_935575_+|Queuosine-Biosynthesis-QueE-Radical-SAM MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWDKLSDREVSLFSILAKTKESDKWGAASSEDLLAVINRQGYTARHVVITGGEPCIHDLMPLTDLLEKSGFSCQIETSGTHEVRCTPNTWVTVSPKVNMRGGYDVLSQALERANEIKHPVGRVRDIEALDELLATLSDDKPRVIALQPISQKEDATRLCIETCIARNWRLSMQTHKYLNIA >LR134233.1|VEC90962.1|933469_934768_+|enolase MSKIVKVIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVGAVNGPIAQAILGKDAKDQAGIDKIMIDLDGTENKSNFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKGKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >LR134233.1|VEC90961.1|931749_933387_+|CTP-synthetase MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQLAVDIGREHALFMHLTLVPYLAAAGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISMKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIYEEANPAGEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVTVNIKLIDSQDVETRGVEILKDLDAILIPGGFGYRGVEGKIATARYARENNIPYLGICLGMQVALIEFARNVAGMDNANSTEFVPDCKYPVVALITEWRDEDGNVEVRSEKSDLGGTMRLGAQQCQLSDDSLVRQLYGAPTIVERHRHRYEVNNMLLKQIEAAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAANEHQKRQAK >LR134233.1|VEC90960.1|930721_931522_+|Nucleoside-triphosphate-pyrophospho-hydrolase-MazG MTTNHQIDRLLTLMQRLRDPENGCPWDKEQTFASIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFGELSADNSEEALVRWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCSNVGFDWTTLGPVVDKVYEEIDEVMFEARQAVVDQAKLEEEMGDLLFATVNMARHLGTKAELALQKANDKFERRFREVERIVAARGLEMTGVDLETMEEVWQEVKRQEIDL >LR134233.1|VEC90959.1|929528_930116_-|fimbrial-subunit MKSSHFCKLAVTASLVMGIVSGAQAAGSNTAKVTFLGNIVDSPCSVTLDTEDQTVNMGSSIGNGTLSNGKTTINNARTFHIDLEGCTWATEKNMNVVFTTGSGTTAATGATDNLALMKTDGTGAISNVSLAIGDAGKNNIKLGDTYTQAIADLDGDTILDEKQSLNFTAWLVGAATGTVGTGEFSSAANVTISYL >LR134233.1|VEC90958.1|926749_929395_-|outer-membrane-usher-protein MLLSVSPYSASGKDIEFNTDFLDVKNRDNVNIAQFSRKGFILPGVYLLQIKINGQTLPQEFPVNWVIPEHDPQGSEVCAEPELVTQLGIKPELAEKLVWIMHGERQCLAPDSLKGMDFQADLGHSTLLVNLPQAYMEYSDVDWDPPARWDNGIPGIILDYNINNQLRHDQESGSEEQSISGNGTLGANLGAWRLRADWQASYDHRDDDENTSTLHDQSWSRYYAYRALPTLGAKLTLGESYLQSDVFDSFNYIGASVISDDQMLPPKLRGYAPEIVGIARSNAKVKVSWQGRVLYETQVPAGPFRIQDLNQSVSGTLHVTVEEQNGQTQEFDVNTASVPFLTRPGMVRYKLALGRPQDWDHHPITGTFASAEASWGVTNGWSLYGGAIGESNYQAVALGSGKDLGVVGAVAVDITHSIAHMPQDDGFDGETLQGNSYRISYSRDFDEIDSRLTFAGYRFSEKNFMSMSDYLDAKTYHHLNAGHEKERYTVTYNQNFREQGMSAYFSYSRSTFWDSPDQSNYNLSLSWYFDLGSIKNLSASLNGYRSEYNGDKDDGVYISLSVPWGNDSISYNGTFNGSQHRNQLGYSGHSQNGDNWQLHVGQDEQGAQADGYYSHQGALTDIDLSADYEEGSYRSLGMSLRGGMTLTTQGGALHRGSLAGSTRLLVDTDGIADVPVSGNGSPTSTNIFGKAVIADVGSYSRSLARIDLNKLPEKAEATKSVVQITLTEGAIGYRHFDVVSGEKMMAVFRLADGDFPPFGAEVKNERQQQLGLVADDGNAWLAGVKAGETLKVFWDGAAQCEASLPPTFTPELLANTLLLPCKMLEGQPPTAPQKSSPLPAQPLIQEHTQTDGQPAAPVATTTQTPPIPLADNHAVNRKDME >LR134233.1|VEC90957.1|925963_926737_-|chaperone-protein-PapD MNKTNHFKRQALIASVLLAAPLVSHSAIVPDRTRVIFNGNENSITVTLKNGNATLPYLAQAWLEDDKFAKDTRYFTALPPLQRIEPKSDGQVKVQPLPAAASLPQDRESLFYFNVREIPPKSDKPNTLQLALQTRIKFFYRPVAVARQVDKTHPWQTKLTLTYQGDGVIFDNPTPFYLVISNAGSKENETASGFKNLLIAPREKVTSPIKGASLGSSPVVGYVDDYGGHRLLVFTCSGNTCKVNEEKTRDAEKKANK >LR134233.1|VEC90956.1|925572_925938_-|Protein-sfmH-Flags:-Precursor MLTRWKILVLLCGGFVTGTEAAGTKTVQLELHLVVTQPPPCTIGGASVEFGDVLTTKVGDASQTKPVGYSLNCDGRASDYLKLQIQGTTTTISGEQVLQTSVQGLGIRIQQAAISNWFRLA >LR134233.1|VEC90955.1|924953_925439_-|fimbrial-subunit MERFIVKRVLILTLLITQFACADNLTFHGKLINPPACTINNGETLEVSFGSVIIDNIDGVNYLTEIPWTLTCDSSFRDDALTFTLSYLGTATPYSANALTTNVPELGIELQQNGTVFPPGTSLTIDESSLPTLKAVPVKQPGKEPAEGDFEAFATLQVDYQ >LR134233.1|VEC90965.1|936485_937283_+|beta-lactamase MALRIRVLLENHKGTGADKSLKARPGLSLLVEDESTSVLFDTGPDGSFMQNALAMGIDLSDVSAVVLSHGHYDHCGGVPWLPDNCRVICHPDIARERYAAMTFLGITRKIKKLSCEVDYSRYRMMYTRDPLPIGENFIWSGEIPVVAPEAYGIFGGHDAEPDSILDEGVLIYQSTKGLVIITGCGHRGIANIVRHCQNITGIKRIYALVGGFHLRCASPFTLWRVRRFLQEQKPEKLCGCHCTGAWGRLWLPEITAPATGDVLRF >LR134233.1|VEC90966.1|937372_937735_-|6-pyruvoyl-tetrahydrobiopterin-synthase MSTTLYKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIMDFADLKAAFKPTYDRLDHYYLNDIPGLSNPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCVYRGE >LR134233.1|VEC90967.1|938399_939968_+|sulfite-reductase-subunit-alpha MAEALRDDLLAANLNVTLVNAGDYKFKQIASEKLLVIVTSTQGEGEPPEEAVALHKYLFSKKAPKLENTAFAVFSLGDTSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDVLKSRAPVAAPSQSVATGAVNDIHTSPYTKDAPLTATLSVNQKITGCNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVDGKTLPLAEALEWHFELTVNTANIVENYATLTRSESLLPLVGDKAQLQHYAATTPIVDMVRFSPAQLDAQALIDLLRPLTPRLYSIASAQAEVESEVHITVGVVRYDIEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPQTPVIMIGPGTGIAPFRAFMQQRAAEGAEGKNWLFFGNPHFTEDFLYQVEWQRYVKEGLLSRIDLAWSRDQKEKIYVQDKLREQGAELWRWINDGAHIYVCGDARRMAADVEKALLEVIAEFGGMDLESADEYLSELRVERRYQRDVY >LR134233.1|VEC90968.1|939967_941680_+|sulfite-reductase-subunit-beta MSEKHPGPLVVEGKLSDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAAQKLEPRHAMLLRCRLPGGVITTTQWQAIDKFAADNTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGATYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGFLSLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVDTFKEEVERRAGITFEPIRPYEFTGRGDRIGWVKGIDDKWHLTLFIENGRILDYPGRPLKTGLLEIAKIHQGEFRITANQNLIIASVPESQKAKIETLASDHGLMNAVSAQRENSMACVSFPTCPLAMAEAERFLPSFTDKVEAILEKHGIPDEHIVMRVTGCPNGCGRAMLAEIGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPDILASLGELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDFWE >LR134233.1|VEC90969.1|941755_942490_+|phosphoadenosine-phosphosulfate-reductase MSQLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >LR134233.1|VEC90970.1|942785_943655_-|secreted-protein-in-the-Sop-family;-transferred-to-eukaryotic-cells MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKLDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDSLTEDEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHSYSPFAEFKPTNSGFLKNLIRIITTIMQKKCLLM >LR134233.1|VEC90971.1|944098_946762_+|helicase MSIYHYWGKSRRGETNGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYVLWNSWLSECPELFPPSSLSVRKSKRVMTLWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPDSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERSVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGAIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFARQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDGVVTPYASGEHAWEMSVLRVRQSWWDKHKDEFERLDGEPLRKWCAQQHQDKDFATVVVVTDFAACGYSANEGLIGMMGE >LR134233.1|VEC90972.1|946773_948327_+|CRISPR-associated-protein-CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADALHKALAQLEHAFQFGAETPSFMQDFEPLTGEKVSMASLLPETPGAQTTKFNKDHFIKRGVTERFCPHCAALALFSLQLNAPSGGKGYRTGLRGGGPLTTLVELQEYQGERQTPLWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIWGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFADAKGARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRHYFDDHVFTNPYESSDLERIMTARKKYFTTSAEKTKRKSRQSKETGGC >LR134233.1|VEC90973.1|948327_948702_+|transposase MSVVTKDDKATLRQWHEELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKRRMNCCASYGVR >LR134233.1|VEC90974.1|948698_948887_+|transposase MKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDGDTDNEQD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134233_3 | 953305-953455 | TypeI-E |
I-E
Consensus repeat of LR134233_3
|
2 spacers
spacers of LR134233_3
>3.1|953334|32|LR134233|CRISPRCasFinder GGTTAACCAGGGGTTTTTCCCCACTATTTCGC >3.2|953395|32|LR134233|CRISPRCasFinder AGGGGCGTTCCGCAGTCGACAAGGGCTGAAAA |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around LR134233_3
The CRISPR arrays of LR134233_3 >merge|LR134233|3|953305-953455|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACCGGGTTAACCAGGGGTTTTTCCCCACTATTTCGCGTGTTCCCCGCGCCAGCGGGGATAAACCGAGGGGCGTTCCGCAGTCGACAAGGGCTGAAAAATGTTCCCCGCGTCAGCGGGGATAAACAC >LR134233|3|3|953305-953455|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACCG GGTTAACCAGGGGTTTTTCCCCACTATTTCGC GTGTTCCCCGCGCCAGCGGGGATAAACCG AGGGGCGTTCCGCAGTCGACAAGGGCTGAAAA ATGTTCCCCGCGTCAGCGGGGATAAACAC
>LR134233.1|VEC90980.1|952263_952557_+|ssRNA-endonuclease MSMVVVVTENVPPRLRGRLAVWLLEVRAGVYVGDTSKRIREMIWQQITQLGGCGNAVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVENQ >LR134233.1|VEC90979.1|951343_952264_+|CRISPR-associated-protein-Cas1 MTFVPLNPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVRLASTVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALDDDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGARVRATYALLAKQYGVKWHGRNYDPKDWEKGDVVNRCISAATSCLYGISEAAILAAGYAPAIGFIHSGKPLSFVYDIADIIKFESVVPKAFEIAARHPAEPDKEVRLACRDIFRSSKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPESLGDSGHRGHG >LR134233.1|VEC90978.1|950915_951347_+|CRISPR-associated-protein-Cse3-family MQTRPFAPTLSAGQTLRFNLRANPTVCKNGKRHDLLMEAKRQRKTQGDSQDIWSYQQQAALTWLARQGEQNGFTLREASVDAYRQQQIRRGKDRQMIQFSSVDYTGVLVINEPALFLQRLAQGYGKSRAFGCGMMMIKPGDDA >LR134233.1|VEC90977.1|950697_950946_+|CRISPR-associated-protein-Cse3-family MYLSRITLHTSELSPAQLLHLVECGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFLFSHRNSLPPARFLTCRLAHLRQR >LR134233.1|VEC90976.1|949969_950716_+|CRISPR-associated-protein-Cas5e-family MSQYLVFQLHGPMASWGVDAPGEVRHSHELPSRSALLGLLAAALGIRRDEEERLNAFNRHYQFLLCASGNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRDYYTDAWWMIAVSATPDAPYRLAQLQASLQHPVFPLYLGRKSHPLALPLAPQLLEGSAADVLREAYRWYQDQFNALKLPLPRLQNECWWEGEHDGLTASKILRRRDMPLSRQQWLFGERSVNQGPWLRKEDACISQE >LR134233.1|VEC90975.1|948900_949959_+|CRISPR-associated-protein-Cse4 MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPAIEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNDNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITSLHKNMNKVYGQRTDTASFDVMNQQGSMEDVLDFICA >LR134233.1|VEC90974.1|948698_948887_+|transposase MKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDGDTDNEQD >LR134233.1|VEC90973.1|948327_948702_+|transposase MSVVTKDDKATLRQWHEELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKRRMNCCASYGVR >LR134233.1|VEC90972.1|946773_948327_+|CRISPR-associated-protein-CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADALHKALAQLEHAFQFGAETPSFMQDFEPLTGEKVSMASLLPETPGAQTTKFNKDHFIKRGVTERFCPHCAALALFSLQLNAPSGGKGYRTGLRGGGPLTTLVELQEYQGERQTPLWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIWGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFADAKGARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRHYFDDHVFTNPYESSDLERIMTARKKYFTTSAEKTKRKSRQSKETGGC >LR134233.1|VEC90971.1|944098_946762_+|helicase MSIYHYWGKSRRGETNGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYVLWNSWLSECPELFPPSSLSVRKSKRVMTLWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPDSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERSVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGAIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFARQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDGVVTPYASGEHAWEMSVLRVRQSWWDKHKDEFERLDGEPLRKWCAQQHQDKDFATVVVVTDFAACGYSANEGLIGMMGE >LR134233.1|VEC90981.1|953469_954516_-|alkaline-phosphatase-isozyme-conversion-aminopeptidase MFSATRRFAVILALGVGFILPAQAASPGPGEIANTQARHIATFFPGRMTGSPAEMLSADYLRQQFTQMGYQSDIRTFNSRFIYTTKDNRKNWHNVTGSTVIAAHEGRVPQQIIIMAHLDTYAPQSDADVDANLGGLTLQGMDDNAAGLGVMLELAARLKDIPTHYGIRFIATSGEEEGKLGAENLLKRMSDAEKKNTLLVINLDNLIVGDKLYFNSGKNTPEAVRTLTRDRALAIARRYGIAANTNPGRNPSYPKGTGCCNDAEVFDKAGISVLSVEATNWNLGKKDGYQQRVKNASFPNGNSWHDVRLDNQQHIDKALPGRIERRSRDVVRIMLPLVKELAKAEKTS >LR134233.1|VEC90982.1|954766_955675_+|Sulfate-adenylyltransferase-subunit-2-1 MDQKRLTHLRQLEAESIHIIREVAAEFANPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYAFRDRTANAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWRNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMVDDDRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESHAQTLPEIIEEMLVSTTSERQGRMIDRDQAGSMELKKRQGYF >LR134233.1|VEC90983.1|955684_957124_+|ATP-sulfurylase MNTILAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTLQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCDLAILLIDARKGVLEQTRRHSFISTLLGIKHLVVAINKMDLVDYREETFARIREDYLTFAEQLPGDLDIRFVPLSALEGDNVAAQSANMRWYSGPTLLEVLETVDIQRAVDRQPMRFPVQYVNRPNLDFRGYAGTLASGSVKVGERIKVLPSGVESSVARIVTFNGDKEEACAGEAITLVLNDDIDISRGDLLLAANETLAPARHAAIDVVWMAEQPLAPGQSYDVKLAGKKTRARIEAIRYQIDINNLTQRDVESLPLNGIGLVEMTFDEPLALDIYQQNPVTGGLIFIDRLSNVTVGAGMVRELDERGATPPVEYSAFELELNALVRRHFPHWDARDLLGDKHGAA >LR134233.1|VEC90984.1|957110_957716_+|adenosine-5-phosphosulfate-kinase MALHDENVVWHSHPVTVAAREQLHGHRGVVLWFTGLSGSGKSTVAGALEEALHQRGVSTYLLDGDNVRHGLCRDLGFSDADRQENIRRVGEVASLMADAGLIVLTAFISPHRAERQLVKERVGHDRFIEIYVNTPLAICEQRDPKGLYKKARAGELRNFTGIDAIYEAPDSPQVHLNGEQLVTNLVSQLLDLLRRRDIIRS >LR134233.1|VEC90985.1|957766_958090_+|Inner-membrane-protein-ygbE MRNSHNITFTRSDAFMVDDDATSVFPGAVVGFVSWLLALGIPFLLYGPNTLFFFLYTWPFFLALMPVSVIIGIALHLLVKGKILFSIMFTLLAVGALFGALFIWLLG >LR134233.1|VEC90986.1|958280_958592_+|Cell-division-protein-DivIC-(FtsB)-stabilizesFtsL-against-RasP-cleavage MGKLTLLLLALLVWLQYSLWFGKNGIHDYSRVNDDVVAQQATNAKLKARNDQLFAEIDDLNGGQEAIEERARNELSMTKPGETFYRLVPDASKRAATAGQTHR >LR134233.1|VEC90987.1|958610_959321_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MAATLLDVCAVVPAAGFGRRMQTECPKQYLSIGNKTILEHSVHALLAHPRVTRVVIAISPGDHRFAQLPLANHPQITVVDGGNERADSVLAGLQAVAKAQWVLVHDAARPCLHQDDLARLLAISENSRVGGILASPVRDTMKRGEPGKNAIAHTVERADLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPALVEGRADNIKVTRPEDLALAEFYLTRTIHQEKA >LR134233.1|VEC90988.1|959320_959800_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDEVNVKATTTEKLGFTGRGEGIACEAVALLMKAAK >LR134233.1|VEC90989.1|959796_960846_+|tRNA-pseudouridine-13-synthase MTEFDNLTWLHGKPQGSGLLKANPEDFVVVEDLGFTPDGEGEHILLRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDFSAFQLEGCKVLEYARHKRKLRLGALKGNAFTLVLREISDRRDVETRLQAIRDGGVPNYFGAQRFGIGGSNLQGALRWAQSNAPVRDRNKRSFWLSAARSALFNQIVHQRLKKPDFNQVVDGDALQLAGRGSWFVATSEELPELQRRVDEKELMITASLPGSGEWGTQRAALAFEQDAIAQETVLQSLLLREKVEASRRAMLLYPQQLSWNWWDDVTVELRFWLPAGSFATSVVRELINTMGDYAHIAE >LR134233.1|VEC90990.1|960826_961588_+|stationary-phase-survival-protein-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFDNGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLNGYQHYDTAAAVTCALLRGLSREPLRTGRILNVNVPDLPLAQIKGIRVTRCGSRHPADKVIPQEDPRGNTLYWIGPPGDKYDAGPDTDFAAVDEGYVSVTPLHVDLTAHSAHDVVSDWLDSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134233_2 | 952654-953231 | TypeI-E |
I-E
Consensus repeat of LR134233_2
|
9 spacers
spacers of LR134233_2
>2.1|952683|32|LR134233|PILER-CR,CRISPRCasFinder,CRT TCACTTTTTGCACACTCAGTGACGACATTGTA >2.2|952744|32|LR134233|PILER-CR,CRISPRCasFinder,CRT GCAAAAGTCATGGCGTCATTCGGTGCTGATAT >2.3|952805|32|LR134233|PILER-CR,CRISPRCasFinder,CRT GACAAAGGAGATATCCGGTTAGCGCTTTCACA >2.4|952866|32|LR134233|PILER-CR,CRISPRCasFinder,CRT CCTGGCGGCACACTCTACCGCCCGTGGTGACA >2.5|952927|32|LR134233|PILER-CR GCATCGCCTGGCTGAGTCCGTCGCTAACTGCG >2.6|952988|32|LR134233|PILER-CR ACATCTGGGAGGCGTGGCTAAAACTTACAGCT >2.7|953049|32|LR134233|PILER-CR GGCGCGCGTGTTGTTGGTGATTCCGCATATCT >2.8|953110|32|LR134233|PILER-CR CGGGCGCATTGTACGCGATGCAGACCATGCCG >2.9|952927|31|LR134233|CRISPRCasFinder,CRT GCATCGCCTGGCTGAGTCCGTCGCTAACTGC >2.10|952987|32|LR134233|CRISPRCasFinder,CRT ACATCTGGGAGGCGTGGCTAAAACTTACAGCT >2.11|953048|32|LR134233|CRISPRCasFinder,CRT GGCGCGCGTGTTGTTGGTGATTCCGCATATCT >2.12|953109|32|LR134233|CRISPRCasFinder,CRT CGGGCGCATTGTACGCGATGCAGACCATGCCG >2.13|953170|32|LR134233|CRISPRCasFinder,CRT CAGACCGATCCGCGATCATTTTCTGTGGCTGT |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around LR134233_2
The CRISPR arrays of LR134233_2 >merge|LR134233|2|952654-953231|PILER-CR,CRISPRCasFinder,CRT GTGTTCCCCGCGCCAGCGGGGATAAACCGTCACTTTTTGCACACTCAGTGACGACATTGTAGTGTTCCCCGCGCCAGCGGGGATAAACCGGCAAAAGTCATGGCGTCATTCGGTGCTGATATGTGTTCCCCGCGCCAGCGGGGATAAACCGGACAAAGGAGATATCCGGTTAGCGCTTTCACAGTGTTCCCCGCGCCAGCGGGGATAAACCGCCTGGCGGCACACTCTACCGCCCGTGGTGACAGTGTTCCCCGCGCCAGCGGGGATAAACCGGCATCGCCTGGCTGAGTCCGTCGCTAACTGCGGTGTTCCCCGGCCAGCGGGGATAAACCGACATCTGGGAGGCGTGGCTAAAACTTACAGCTGTGTTCCCCGCGCCAGCGGGGATAAACCGGGCGCGCGTGTTGTTGGTGATTCCGCATATCTGTGTTCCCCGCGCCAGCGGGGATAAACCGCGGGCGCATTGTACGCGATGCAGACCATGCCGGTGTTCCCCGCGCCAGCGGGGATAAACCGCAGACCGATCCGCGATCATTTTCTGTGGCTGTGTGTTCCCCGCGCCAGCGGGGATAAACCGA >LR134233|2|2|952654-953169|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACCG TCACTTTTTGCACACTCAGTGACGACATTGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCAAAAGTCATGGCGTCATTCGGTGCTGATAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GACAAAGGAGATATCCGGTTAGCGCTTTCACA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCTGGCGGCACACTCTACCGCCCGTGGTGACA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCATCGCCTGGCTGAGTCCGTCGCTAACTGCG GTGTTCCCCGGCCAGCGGGGATAAACCGA CATCTGGGAGGCGTGGCTAAAACTTACAGCTG TGTTCCCCGCGCCAGCGGGGATAAACCGG GCGCGCGTGTTGTTGGTGATTCCGCATATCTG TGTTCCCCGCGCCAGCGGGGATAAACCGC GGGCGCATTGTACGCGATGCAGACCATGCCGG TGTTCCCCGCGCCAGCGGGGATAAACCG >LR134233|2|2|952654-953231|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACCG TCACTTTTTGCACACTCAGTGACGACATTGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCAAAAGTCATGGCGTCATTCGGTGCTGATAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GACAAAGGAGATATCCGGTTAGCGCTTTCACA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCTGGCGGCACACTCTACCGCCCGTGGTGACA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCATCGCCTGGCTGAGTCCGTCGCTAACTGC GGTGTTCCCCGGCCAGCGGGGATAAACCG ACATCTGGGAGGCGTGGCTAAAACTTACAGCT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGCGCGCGTGTTGTTGGTGATTCCGCATATCT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGGGCGCATTGTACGCGATGCAGACCATGCCG GTGTTCCCCGCGCCAGCGGGGATAAACCG CAGACCGATCCGCGATCATTTTCTGTGGCTGT GTGTTCCCCGCGCCAGCGGGGATAAACCGA >LR134233|2|2|952654-953230|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG TCACTTTTTGCACACTCAGTGACGACATTGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCAAAAGTCATGGCGTCATTCGGTGCTGATAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GACAAAGGAGATATCCGGTTAGCGCTTTCACA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCTGGCGGCACACTCTACCGCCCGTGGTGACA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCATCGCCTGGCTGAGTCCGTCGCTAACTGC GGTGTTCCCCGGCCAGCGGGGATAAACCG ACATCTGGGAGGCGTGGCTAAAACTTACAGCT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGCGCGCGTGTTGTTGGTGATTCCGCATATCT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGGGCGCATTGTACGCGATGCAGACCATGCCG GTGTTCCCCGCGCCAGCGGGGATAAACCG CAGACCGATCCGCGATCATTTTCTGTGGCTGT GTGTTCCCCGCGCCAGCGGGGATAAACCG
>LR134233.1|VEC90980.1|952263_952557_+|ssRNA-endonuclease MSMVVVVTENVPPRLRGRLAVWLLEVRAGVYVGDTSKRIREMIWQQITQLGGCGNAVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVENQ >LR134233.1|VEC90979.1|951343_952264_+|CRISPR-associated-protein-Cas1 MTFVPLNPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVRLASTVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALDDDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGARVRATYALLAKQYGVKWHGRNYDPKDWEKGDVVNRCISAATSCLYGISEAAILAAGYAPAIGFIHSGKPLSFVYDIADIIKFESVVPKAFEIAARHPAEPDKEVRLACRDIFRSSKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPESLGDSGHRGHG >LR134233.1|VEC90978.1|950915_951347_+|CRISPR-associated-protein-Cse3-family MQTRPFAPTLSAGQTLRFNLRANPTVCKNGKRHDLLMEAKRQRKTQGDSQDIWSYQQQAALTWLARQGEQNGFTLREASVDAYRQQQIRRGKDRQMIQFSSVDYTGVLVINEPALFLQRLAQGYGKSRAFGCGMMMIKPGDDA >LR134233.1|VEC90977.1|950697_950946_+|CRISPR-associated-protein-Cse3-family MYLSRITLHTSELSPAQLLHLVECGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFLFSHRNSLPPARFLTCRLAHLRQR >LR134233.1|VEC90976.1|949969_950716_+|CRISPR-associated-protein-Cas5e-family MSQYLVFQLHGPMASWGVDAPGEVRHSHELPSRSALLGLLAAALGIRRDEEERLNAFNRHYQFLLCASGNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRDYYTDAWWMIAVSATPDAPYRLAQLQASLQHPVFPLYLGRKSHPLALPLAPQLLEGSAADVLREAYRWYQDQFNALKLPLPRLQNECWWEGEHDGLTASKILRRRDMPLSRQQWLFGERSVNQGPWLRKEDACISQE >LR134233.1|VEC90975.1|948900_949959_+|CRISPR-associated-protein-Cse4 MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPAIEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNDNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITSLHKNMNKVYGQRTDTASFDVMNQQGSMEDVLDFICA >LR134233.1|VEC90974.1|948698_948887_+|transposase MKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDGDTDNEQD >LR134233.1|VEC90973.1|948327_948702_+|transposase MSVVTKDDKATLRQWHEELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKRRMNCCASYGVR >LR134233.1|VEC90972.1|946773_948327_+|CRISPR-associated-protein-CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADALHKALAQLEHAFQFGAETPSFMQDFEPLTGEKVSMASLLPETPGAQTTKFNKDHFIKRGVTERFCPHCAALALFSLQLNAPSGGKGYRTGLRGGGPLTTLVELQEYQGERQTPLWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIWGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFADAKGARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRHYFDDHVFTNPYESSDLERIMTARKKYFTTSAEKTKRKSRQSKETGGC >LR134233.1|VEC90971.1|944098_946762_+|helicase MSIYHYWGKSRRGETNGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYVLWNSWLSECPELFPPSSLSVRKSKRVMTLWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPDSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERSVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGAIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFARQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDGVVTPYASGEHAWEMSVLRVRQSWWDKHKDEFERLDGEPLRKWCAQQHQDKDFATVVVVTDFAACGYSANEGLIGMMGE >LR134233.1|VEC90981.1|953469_954516_-|alkaline-phosphatase-isozyme-conversion-aminopeptidase MFSATRRFAVILALGVGFILPAQAASPGPGEIANTQARHIATFFPGRMTGSPAEMLSADYLRQQFTQMGYQSDIRTFNSRFIYTTKDNRKNWHNVTGSTVIAAHEGRVPQQIIIMAHLDTYAPQSDADVDANLGGLTLQGMDDNAAGLGVMLELAARLKDIPTHYGIRFIATSGEEEGKLGAENLLKRMSDAEKKNTLLVINLDNLIVGDKLYFNSGKNTPEAVRTLTRDRALAIARRYGIAANTNPGRNPSYPKGTGCCNDAEVFDKAGISVLSVEATNWNLGKKDGYQQRVKNASFPNGNSWHDVRLDNQQHIDKALPGRIERRSRDVVRIMLPLVKELAKAEKTS >LR134233.1|VEC90982.1|954766_955675_+|Sulfate-adenylyltransferase-subunit-2-1 MDQKRLTHLRQLEAESIHIIREVAAEFANPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYAFRDRTANAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWRNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMVDDDRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESHAQTLPEIIEEMLVSTTSERQGRMIDRDQAGSMELKKRQGYF >LR134233.1|VEC90983.1|955684_957124_+|ATP-sulfurylase MNTILAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTLQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCDLAILLIDARKGVLEQTRRHSFISTLLGIKHLVVAINKMDLVDYREETFARIREDYLTFAEQLPGDLDIRFVPLSALEGDNVAAQSANMRWYSGPTLLEVLETVDIQRAVDRQPMRFPVQYVNRPNLDFRGYAGTLASGSVKVGERIKVLPSGVESSVARIVTFNGDKEEACAGEAITLVLNDDIDISRGDLLLAANETLAPARHAAIDVVWMAEQPLAPGQSYDVKLAGKKTRARIEAIRYQIDINNLTQRDVESLPLNGIGLVEMTFDEPLALDIYQQNPVTGGLIFIDRLSNVTVGAGMVRELDERGATPPVEYSAFELELNALVRRHFPHWDARDLLGDKHGAA >LR134233.1|VEC90984.1|957110_957716_+|adenosine-5-phosphosulfate-kinase MALHDENVVWHSHPVTVAAREQLHGHRGVVLWFTGLSGSGKSTVAGALEEALHQRGVSTYLLDGDNVRHGLCRDLGFSDADRQENIRRVGEVASLMADAGLIVLTAFISPHRAERQLVKERVGHDRFIEIYVNTPLAICEQRDPKGLYKKARAGELRNFTGIDAIYEAPDSPQVHLNGEQLVTNLVSQLLDLLRRRDIIRS >LR134233.1|VEC90985.1|957766_958090_+|Inner-membrane-protein-ygbE MRNSHNITFTRSDAFMVDDDATSVFPGAVVGFVSWLLALGIPFLLYGPNTLFFFLYTWPFFLALMPVSVIIGIALHLLVKGKILFSIMFTLLAVGALFGALFIWLLG >LR134233.1|VEC90986.1|958280_958592_+|Cell-division-protein-DivIC-(FtsB)-stabilizesFtsL-against-RasP-cleavage MGKLTLLLLALLVWLQYSLWFGKNGIHDYSRVNDDVVAQQATNAKLKARNDQLFAEIDDLNGGQEAIEERARNELSMTKPGETFYRLVPDASKRAATAGQTHR >LR134233.1|VEC90987.1|958610_959321_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MAATLLDVCAVVPAAGFGRRMQTECPKQYLSIGNKTILEHSVHALLAHPRVTRVVIAISPGDHRFAQLPLANHPQITVVDGGNERADSVLAGLQAVAKAQWVLVHDAARPCLHQDDLARLLAISENSRVGGILASPVRDTMKRGEPGKNAIAHTVERADLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPALVEGRADNIKVTRPEDLALAEFYLTRTIHQEKA >LR134233.1|VEC90988.1|959320_959800_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDEVNVKATTTEKLGFTGRGEGIACEAVALLMKAAK >LR134233.1|VEC90989.1|959796_960846_+|tRNA-pseudouridine-13-synthase MTEFDNLTWLHGKPQGSGLLKANPEDFVVVEDLGFTPDGEGEHILLRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDFSAFQLEGCKVLEYARHKRKLRLGALKGNAFTLVLREISDRRDVETRLQAIRDGGVPNYFGAQRFGIGGSNLQGALRWAQSNAPVRDRNKRSFWLSAARSALFNQIVHQRLKKPDFNQVVDGDALQLAGRGSWFVATSEELPELQRRVDEKELMITASLPGSGEWGTQRAALAFEQDAIAQETVLQSLLLREKVEASRRAMLLYPQQLSWNWWDDVTVELRFWLPAGSFATSVVRELINTMGDYAHIAE >LR134233.1|VEC90990.1|960826_961588_+|stationary-phase-survival-protein-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFDNGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLNGYQHYDTAAAVTCALLRGLSREPLRTGRILNVNVPDLPLAQIKGIRVTRCGSRHPADKVIPQEDPRGNTLYWIGPPGDKYDAGPDTDFAAVDEGYVSVTPLHVDLTAHSAHDVVSDWLDSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134233_5 | 3029672-3029779 | Orphan |
NA
Consensus repeat of LR134233_5
|
1 spacers
spacers of LR134233_5
>5.1|3029696|60|LR134233|CRISPRCasFinder CTGTCTCTTGATCAGACCTCCTGATCAAGAGACAGGGCCTGGCTGGGGGTTTTCCATAGA |
CRISPR arrays and Neighbor proteins around LR134233_5
The CRISPR arrays of LR134233_5 >merge|LR134233|5|3029672-3029779|CRISPRCasFinder CAATAAATATCCTCCGGCATAGCCCTGTCTCTTGATCAGACCTCCTGATCAAGAGACAGGGCCTGGCTGGGGGTTTTCCATAGACAATAAATATCCTCCGGCATAGCC >LR134233|5|4|3029672-3029779|CRISPRCasFinder CAATAAATATCCTCCGGCATAGCC CTGTCTCTTGATCAGACCTCCTGATCAAGAGACAGGGCCTGGCTGGGGGTTTTCCATAGA CAATAAATATCCTCCGGCATAGCC
>LR134233.1|VEC93181.1|3028484_3029564_-|iron-ABC-transporter-permease MTCQDKSLLAPGLASTPVIPRYRQIIRKRAVLMLAIALAMMASLMVDVTCGSSGLPLSALWQALFQPEKVNAGIHVIVLDIRLPYALMALLVGMALGLAGAEMQTILNNPLATPFTLGVSSAAAFGAALAIVLGIGIPGIPAAWFIPANAFIFALLSALLLDGITRWTGVAASGVVLFGIALVFTFNALVAIMQFVADEDTLQGLVFWTMGSLVRASWEKLGVLAVVFIAVLFCALRSAWQLTALRLGEERAMSFGIHVRRLRLLSLLRISLLSALAVAFVGPIGFIGLVAPHIARILLGEDHRFYLPGSVLIGGLVLSLASIAAKNSIPGVMVPVGIVTSLVGVPFFLSIVLRHRGSL >LR134233.1|VEC93180.1|3027708_3028485_-|ABC-transporter-ATP-binding-protein MRGLMLRSFSAGYSTQPVIADLNVPLLPRGKITILLGPNGCGKSTLLRSLAGLNNADGEALLDGEDLMSLSFAERAQKVVFLPQSLPQGVHLHVLESIIVVLRASGGRDNAQGRAQILAILEQLGITHLALQYLDQLSGGQRQLVGLAQSLIRRPELLLLDEPLSALDLNYQFHVMDLVRRDTQAQNRVTIVVAHDINIALRHGDHVLMLKDGRLVASGAPETVITAERLAEVYRVRGRVERCSQGKLQVVLDGVIAV >LR134233.1|VEC93179.1|3026865_3027618_+|phosphoglyceromutase MAVTKLVLVRHGESQWNKENRFTGWYDVDLSEKGVSEAKAAGKLLKEEGFSFDFAYTSVLKRAIHTLWNVLDELDQAWLPVEKSWKLNERHYGALQGLNKAETAEKYGDEQVKQWRRGFAVTPPELTKDDERYPGHDPRYAKLSEKELPLTESLALTIDRVIPYWNDTILPRMKSGERVIIAAHGNSLRALVKYLDNMSEDEILELNIPTGVPLVYEFDENFKPLKHYYLGNADEIAAKAAAVANQGKAK >LR134233.1|VEC93178.1|3025757_3026642_+|aldose-1-epimerase MLGCASPEHYPEQTSFLGASIGRYANRIANSRYTFAGETVQLSPSQGENQLHGGPEGFDKRRWQIVNQNDRQVLFALTSDDGDQGFPGHLCATAQYRLTDDNRISITYRATVDKPCPVNLTNHVYFNLDGDRTDVRQHKLQILADEYLPVDESGIPRQGLKSVANTSFDFRMPKVIASEFLADDDQRKVKGYDHAFLLQTQGDGKKPAARLWSQDGKLQMMVYTTAPALQFYSGNYLAGTPSRGPEPYADWQGLALESELLPDSPNHPEWPQPDCILRPGEEYASLTEYQFIPF >LR134233.1|VEC93177.1|3024461_3025610_+|galactokinase MNLKEKTRALFAEIFGYPATHTIQAPGRVNLIGEHTDYNDGFVLPCAIDYQTVISCAPRDDRTVRVIAADYDNQADEFSLDAPIVTHDSQQWSNYVRGVVKHLQQRNNAFGGVDMVISGNVPQGAGLSSSASLEVAVGTVFQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKKDHALLIDCRTLGAKAVSMPKGVAVVIINSNFKRTLVGSEYNTRREQCETGARFFQQPALRDVSLEAFNAVASELDPVVAKRVRHVLSENARTVEAASALEKGDLQRMGQLMAESHASMRDDFEITVPQIDTLVDIVKATIGDQGGVRMTGGGFGGCVVALIPEDLVPAVQQAVAQQYEAKTGIKETFYVCKPSQGAGQC >LR134233.1|VEC93176.1|3023412_3024459_+|galactose-1-phosphate-uridylyltransferase MTPFNPIDHPHRRYNPLTGQWVLVSPHRAKRPWQGAQETPSQQMLPAHDPDCFLCAGNTRVTGDKNPDYKGTYVFTNDFAALMADTPDAPDSHDPLMRCQSARGTSRVICFSPDHSKTLPELSLPALTEIVRTWQTQTAELGKTYPWVQVFENKGAAMGCSNPHPHGQVWANSFLPNEAEREDRLQKAYFAEQRSPMLVDYVQRELADGSRTVVETEHWLAVVPYWAAWPFETLLLPKTHVLRITDLSDEQRDSLALALKKLTSRYDNLFQCSFPYSMGWHGAPFNGEENAHWQLHAHFYPPLLRSATVRKFMVGYEMLAETQRDLTAEQAAERLRAVSDIHFRESGV >LR134233.1|VEC93175.1|3022385_3023402_+|UDP-glucose-4-epimerase MRVLVTGGSGYIGSHTCVQLLQNGHDVVILDNLCNSKRSVLPVIERLGGKHPTFVEGDIRNEALITEILHDHAIDTVIHFAGLKAVGESVARPLEYYDNNVNGTLRLVSAMRAANVKNLIFSSSATVYGDQPKIPYVESFPTGTPQSPYGKSKLMVEQILTDLQKAQPEWSIALLRYFNPVGAHPSGDMGEDPQGIPNNLMPYIAQVAVGRRESLAVFGNDYPTEDGTGVRDYIHVMDLADGHVVAMEKLADKSGVHIYNLGAGVGSSVLDVVNAFSKACGKPINYHFAPRRDGDLPAYWADASKADRELNWRVTRTLDEMAQDTWHWQSRHPQGYPD >LR134233.1|VEC93174.1|3021254_3022163_+|membrane-protein MAFRFPQIILFLLAAMLFCPSSYAEQKPTAAQEARKTAVEVAVEGMSRAAVAGPTKISLGDKATLNLPEGFTWIPAKEAAVFMREIGNYVDDEYFYGLVFKKEMNGFISIEYDDSGYVKDDDAKNWDADELMDNLRKGTKEANKDRIAKGIEPIEIIGWIEKPTYDATNHRLIWSAAIQDIGTNEPLNEQGVNYNTYLLGREGYFSLNLVTDRGSVDHEIPLAKRILSSVKFNAGQRYADFNESTDKIAEYGLAALIGGIAAKKVGLLAMLGIALLKFWKVTAIGVVAVGALARKLLSRKKD >LR134233.1|VEC93173.1|3019608_3021084_+|Putative-molybdenum-transport-ATP-binding-protein-modF MSSLQISQGTFRLSDTKTLHLDSLTLNAGDSWAFVGANGSGKSALARALAGELPLLTGERQCRFTRITRLSFEQLQKLVSDEWQRNNTDMLSPGEDDTGRTTAEIIQDDVHHPARCAMLAQQFGISALLNRRFKYLSTGETRKALLCQALMSEPELLILDEPFDGLDVTARQQLAQRLTALNQAGMTLALVLNRFDEIPDFVQFAGVLADCTLAETGTTTELLQRALVAQLAHSERLTDVQLPEPDEPSARHALPDGEPRIVLNDGVVSYNDRPILHHLSWRVNPGEHWQIVGPNGAGKSTLLSLITGDHPQGYSNDLTLFGRRRGSGETIWDIKKHIGYVSSSLHLDYRVSTTVRNVILSGYFDSIGIYQAVSDRQRKLAQQWLDILGIDKRTADAPFHSLSWGQQRLALIVRALVKHPTVLILDEPLQGLDPLNRQLVRRFVDVLIRQGETQLLFVSHHAEDAPACITHRLEFVPEGERYKYVSGRCNN >LR134233.1|VEC93172.1|3018752_3019541_+|molybdenum-transport-protein-ModE MQAEILLTLKLQQKLFADPRRISLLKHIALSGSISQGAKDAGISYKSAWDAINDMNQLSEHMLVERATGGKGGGGAVLTRYGQRLIQLYDLLGQIQQKAFDVLSDDDALPLDSLLAAISRFSLQTSARNQWFGTITARNRDQVQQHVDVLLADGKTRLKVALTAQSGERLGLEEGKEVLILLKAPWVGITRDAAVARAADNQLSGTISHIERGAEQCEVLMALPDGQTLCATIPTADAATLKEGDDVIAWFNADRVIIATLC >LR134233.1|VEC93182.1|3029865_3030324_-|transposase-for-IS200 MGDEKSLAHTRWNCKYHIVFAPKYRRQAFYGEKRRAVGSILRKLCEWKNVRILEAECCADHIHMLLEIPPKMSVSSFMGYLKGKSSLMLYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEEDKMGEQLSIPYPGSPFTGRK >LR134233.1|VEC93183.1|3030654_3031353_-|membrane-protein MNETIAFRPEGRASTKDVVVNYTLASLDELEQLHPALNLLAHPSIVRGSHDFFVRCYDAGVQHAQAGAAETVAVLPDSLPQERQRSGLLCAAAGRLDALRQPLTHNRLCDLASQFCAGMADVDSETRSGFYTVRSISLPVYRRLLRDQHSHGVCLQQALLHLLAWKSDSPWARQQAQRLLWQGGVLGDKGEFALMTLDDELRERQIEWPGLWSLLAVTGFLAKFPAGPIFAD >LR134233.1|VEC93184.1|3031471_3032086_-|oxaloacetate-decarboxylase-beta-chain MALVPLIQPPIMKALTTETERKIRMVQLRTVSKREKILFPVVLLLLVALLLPDAAPLLGMFCFGNLMRESGVVERLSDTVQNGLINIVTIFLGLSVGAKLVADKFLQPQTLGILLLGVVAFGIGTAAGVLMAKLLNLCSKNKINPLIGSAGVSAVPMAARVSNKVGLESDAQNFLLMHAMGPNVAGVIGSAIAAGVMLKYVLAM >LR134233.1|VEC93185.1|3032238_3032772_-|oxaloacetate-decarboxylase-subunit-beta MESLNALLQGMGLMHLGAGQAIMLLVSLLLLWLAIAKKFEPLLLLPIGFGGLLSNIPEAGMALTALESLLAHHDAGQLAVIAAKLNCAPDVHAIKEALALALPSVQSQMENLAVDMGYTPGVLALFYKVAIGSGVAPLVIFMGVGAMTDFGPLLANPRTLLLGAAAQFGIFATVLGR >LR134233.1|VEC93186.1|3032787_3033924_-|oxaloacetate-decarboxylase-alpha-subunit MALLKAIEAGVDGVDTAISSMSATYGHPATEALVATLAGTKYDTGLDILKLENIAAYFREVRKKYHAFEGQLKGYDSRILVAQVPGGMLTNLESQLKQQNAADKLDQVLAEIPRVREDLGFIPLVTPTSQIVGTQAVLNVLTGERYKTIAKETAGILKGEYGHTPVPVNAALQARVLEGGAPVTCRPADLLKPELAELEADVRRQAQEKGITLAGNAIDDVLTVALFPQIGLKFLENRHNPAAFEPLPQAEAAQPVAKAEKPAASGIYTVEVEGKAFVVKVSDGGDISQLTAAAPAASSAPATAPAGAGTPVTAPLAGNIWKVIAAEGQTVAEGDVLLILEAMKMETEIRAAQAGTVRGIAVKSGDAVSVGDTLMTLA >LR134233.1|VEC93187.1|3033970_3034558_-|Oxaloacetate-decarboxylase-alpha-chain MTVAITDVVLRDAHQSLFATRLRLDDMLPIAAQLDDVGYGSLECWGGATFDACIRFLGEDPWLRLRELKKAMPKTPLQMLLRGQNLLGYRHYADDVVERFVERAVKNGMDVFRVFDAMNDPRNMKAALQAVRSHGAHAQGTLSYTTSPAHTLQTWLDLTEQLLETGVDSIAIKDMSGILTPMAAFELVSEIKKRF >LR134233.1|VEC93188.1|3034573_3034819_-|oxaloacetate-decarboxylase-subunit-gamma MNSSVLLGEGFTLMFLGMGFVLAFLFLLIFAIRGMSAAVNRFFPEPVPVPKAAPAAAPADDFARLKPVIAAAIHHHRRLNP >LR134233.1|VEC93189.1|3034868_3036137_-|cation-transporter MSPAIITLLVLVVAIIIFVSDRLPMGLVAFMVPMALYFTGVIDAKDIFASIVNANVILIVAMCVLGAAFFKTGLAWQSSKILLKYAKTERSLSVLIFLIGGVMSAFVSNSGTVAVLIPIVLGIAASSQIKPIKLLMPLVFGATIGADISIIGSPGNLIAKNTIETFSKGSLSVPFFEYAKIGIPLLIACSIFLYFFGSKLIADRDGNTQSDSQMDYSQIPAWHRNLTLAVFILSIAGMVATDYIKFLPPMHIIACCSAIVLVFAGVLTQKETFNSFETLTVFMLAFMMPLGAALNSTGAGEMIANAVISVTGDSGVMIIMASLWILTWALTQVMSNTAACTLLCPVGWTIAQSIGADPRAVVIAVFIASSVAVCTPMAIPANSMIIGPGNVKFKDFLKPGLAISLVCFIVSMILLPIFYPFY >LR134233.1|VEC93190.1|3036416_3037343_+|LysR-family-transcriptional-regulator MQYRHHLELTWLEDCLALKETLNFSKASASRYVTQPAFSRRIQSLEEWVGTPLFERSKRGVTLTKAGEVFTDQLPELIHSLYTLKSDTLEAAGNKQPSFVFSATHALSFSFVPHLLKQSDKIAKFGSFRLLSDSLNACEKMMRQGDSQFLLCHHHPHMHLNLNKNNFMSIRLGFDTLIPFSKPDSETLKPLWNINNKIQFPYLSFSSQSGLGRIIANTASINRITHNINVAFVADLAATLLAMVRSGDGVAWIPQSLARQDIEAKTIVTAAEKESNLWVPIEIRLYRPAKRMPPDAEELWEIFVEEQI >LR134233.1|VEC93191.1|3037343_3038234_-|LysR-family-transcriptional-regulator MEIRQLEYFVSASLLGNLTRVAERHFVSQPNITIAIKKLETELGVTLFDRKKNKLVLTEEGMFFLQKIEPILIALKNSVAEMKDYRNENGGIVTLGIPPMISLFLFSPLFKHFREIYPEMELSLVEEGTFGLHDKLSAGELDLAIVIINDCPKELVTVPLMTQQHVVCMSDKHPLAKKDSIDWEDLQYEPLILMKKDSWHRKTIIEECTKRGIHTHIFLSSNRVQTNIDLVAKNEGISFILDAVDLKAEKVITKAMTEPTYVTIGLAWKKDKYLSYATRALIKFIEDYIKEHFTIK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | MN694362 | Marine virus AFVG_250M129, complete genome | 9109-9134 | 2 | 0.923 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | NZ_CP042181 | Pseudomonas sp. KBS0802 plasmid unnamed, complete sequence | 54795-54820 | 4 | 0.846 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | NC_003350 | Pseudomonas putida plasmid pWW0, complete sequence | 56547-56572 | 4 | 0.846 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | MF288922 | Bacillus phage Janet, complete genome | 111855-111880 | 4 | 0.846 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | MH593834 | Siphoviridae sp. isolate ctbe1, complete genome | 59906-59931 | 4 | 0.846 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417837 | Uncultured Caudovirales phage clone 8F_7, partial genome | 4030-4061 | 5 | 0.844 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417913 | Uncultured Caudovirales phage clone 9S_2, partial genome | 8426-8457 | 5 | 0.844 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417916 | Uncultured Caudovirales phage clone 7S_13, partial genome | 24771-24802 | 5 | 0.844 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417911 | Uncultured Caudovirales phage clone 9AX_3, partial genome | 31909-31940 | 5 | 0.844 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417918 | Uncultured Caudovirales phage clone 7F_17, partial genome | 3512-3543 | 5 | 0.844 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417915 | Uncultured Caudovirales phage clone 8AX_8, partial genome | 17323-17354 | 5 | 0.844 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417914 | Uncultured Caudovirales phage clone 8S_2, partial genome | 13978-14009 | 5 | 0.844 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417920 | Uncultured Caudovirales phage clone 10F_8, partial genome | 35999-36030 | 5 | 0.844 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417898 | Uncultured Caudovirales phage clone 7AX_6, partial genome | 27407-27438 | 5 | 0.844 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417896 | Uncultured Caudovirales phage clone 3S_18, partial genome | 9076-9107 | 5 | 0.844 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417912 | Uncultured Caudovirales phage clone 10AX_4, partial genome | 16487-16518 | 5 | 0.844 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | MF417910 | Uncultured Caudovirales phage clone 2AX_5, partial genome | 6471-6502 | 5 | 0.844 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | NZ_CP011543 | Corynebacterium mustelae strain DSM 45274 plasmid pCmus45274, complete sequence | 26860-26885 | 5 | 0.808 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | MF288919 | Bacillus phage AaronPhadgers, complete genome | 51881-51906 | 5 | 0.808 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | NZ_MF788070 | Raoultella ornithinolytica strain 23141 plasmid p23141-2, complete sequence | 51537-51562 | 5 | 0.808 |
LR134233_1 | 1.4|936084|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 936084-936115 | 32 | CP022016 | Salmonella enterica subsp. enterica serovar India str. SA20085604 plasmid unnamed1, complete sequence | 437275-437306 | 6 | 0.812 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | NZ_CP029732 | Citrobacter sp. CRE-46 strain AR_0157 plasmid unnamed4, complete sequence | 42066-42091 | 6 | 0.769 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | MK033137 | UNVERIFIED: Pseudomonas phage UNO-SLW1, complete genome | 39396-39421 | 6 | 0.769 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | KX449362 | Pseudomonas phage UNO-SLW3, complete genome | 36319-36344 | 6 | 0.769 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | KX449363 | Pseudomonas phage UNO-SLW4, complete genome | 36362-36387 | 6 | 0.769 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | NC_047873 | Pseudomonas phage UNO-SLW1, complete genome | 36441-36466 | 6 | 0.769 |
LR134233_4 | 4.6|2871102|26|LR134233|CRT | 2871102-2871127 | 26 | KX449361 | Pseudomonas phage UNO-SLW2, complete genome | 36393-36418 | 6 | 0.769 |
LR134233_2 | 2.13|953170|32|LR134233|CRISPRCasFinder,CRT | 953170-953201 | 32 | NZ_CP020932 | Marinobacter salarius strain SMR5 plasmid pSMR5, complete sequence | 199077-199108 | 7 | 0.781 |
LR134233_1 | 1.8|936328|32|LR134233|CRISPRCasFinder,CRT | 936328-936359 | 32 | JX434031 | Pseudomonas phage JBD24, complete genome | 2242-2273 | 8 | 0.75 |
LR134233_2 | 2.7|953049|32|LR134233|PILER-CR | 953049-953080 | 32 | NC_012520 | Rhodococcus opacus B4 plasmid pROB01, complete sequence | 539213-539244 | 8 | 0.75 |
LR134233_2 | 2.9|952927|31|LR134233|CRISPRCasFinder,CRT | 952927-952957 | 31 | NZ_CP037868 | Hydrogenophaga pseudoflava strain DSM 1084 plasmid pDSM1084, complete sequence | 18058-18088 | 8 | 0.742 |
LR134233_2 | 2.11|953048|32|LR134233|CRISPRCasFinder,CRT | 953048-953079 | 32 | NC_012520 | Rhodococcus opacus B4 plasmid pROB01, complete sequence | 539213-539244 | 8 | 0.75 |
LR134233_3 | 3.1|953334|32|LR134233|CRISPRCasFinder | 953334-953365 | 32 | MN855803 | Bacteriophage sp. isolate 108, partial genome | 10057-10088 | 8 | 0.75 |
LR134233_2 | 2.5|952927|32|LR134233|PILER-CR | 952927-952958 | 32 | NZ_CP037868 | Hydrogenophaga pseudoflava strain DSM 1084 plasmid pDSM1084, complete sequence | 18057-18088 | 9 | 0.719 |
LR134233_2 | 2.5|952927|32|LR134233|PILER-CR | 952927-952958 | 32 | MH651189 | Gordonia phage Schmidt, complete genome | 39255-39286 | 9 | 0.719 |
LR134233_2 | 2.9|952927|31|LR134233|CRISPRCasFinder,CRT | 952927-952957 | 31 | MH651189 | Gordonia phage Schmidt, complete genome | 39255-39285 | 9 | 0.71 |
LR134233_1 | 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT | 935901-935932 | 32 | NZ_CP015745 | Shinella sp. HZN7 plasmid pShin-09, complete sequence | 119225-119256 | 10 | 0.688 |
LR134233_2 | 2.8|953110|32|LR134233|PILER-CR | 953110-953141 | 32 | MG189906 | Stenotrophomonas phage vB_SmaS_DLP_5, complete genome | 42813-42844 | 10 | 0.688 |
LR134233_2 | 2.9|952927|31|LR134233|CRISPRCasFinder,CRT | 952927-952957 | 31 | NC_008771 | Verminephrobacter eiseniae EF01-2 plasmid pVEIS01, complete sequence | 19099-19129 | 10 | 0.677 |
LR134233_2 | 2.12|953109|32|LR134233|CRISPRCasFinder,CRT | 953109-953140 | 32 | MG189906 | Stenotrophomonas phage vB_SmaS_DLP_5, complete genome | 42813-42844 | 10 | 0.688 |
LR134233_2 | 2.13|953170|32|LR134233|CRISPRCasFinder,CRT | 953170-953201 | 32 | LR134122 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 2 | 18012-18043 | 10 | 0.688 |
LR134233_2 | 2.13|953170|32|LR134233|CRISPRCasFinder,CRT | 953170-953201 | 32 | NZ_AP014640 | Leptolyngbya boryana IAM M-101 plasmid pLBX | 186198-186229 | 10 | 0.688 |
LR134233_2 | 2.13|953170|32|LR134233|CRISPRCasFinder,CRT | 953170-953201 | 32 | NZ_AP018205 | Leptolyngbya boryana NIES-2135 plasmid plasmid2 DNA, complete genome | 470163-470194 | 10 | 0.688 |
LR134233_3 | 3.2|953395|32|LR134233|CRISPRCasFinder | 953395-953426 | 32 | NC_016113 | Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCAT, complete sequence | 1151729-1151760 | 10 | 0.688 |
LR134233_3 | 3.2|953395|32|LR134233|CRISPRCasFinder | 953395-953426 | 32 | NC_017585 | Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCATT, complete sequence | 661429-661460 | 10 | 0.688 |
LR134233_2 | 2.5|952927|32|LR134233|PILER-CR | 952927-952958 | 32 | NC_008771 | Verminephrobacter eiseniae EF01-2 plasmid pVEIS01, complete sequence | 19099-19130 | 11 | 0.656 |
1. spacer 4.6|2871102|26|LR134233|CRT matches to MN694362 (Marine virus AFVG_250M129, complete genome) position: , mismatch: 2, identity: 0.923
ttgcggctgctggtactgtggctgag CRISPR spacer ttgaggctgctggtactgtggcagag Protospacer *** ****************** ***
2. spacer 4.6|2871102|26|LR134233|CRT matches to NZ_CP042181 (Pseudomonas sp. KBS0802 plasmid unnamed, complete sequence) position: , mismatch: 4, identity: 0.846
ttgcggctgctggtactgtggctgag CRISPR spacer gtgcggctgctggtgctgtgcctgac Protospacer *************.***** ****
3. spacer 4.6|2871102|26|LR134233|CRT matches to NC_003350 (Pseudomonas putida plasmid pWW0, complete sequence) position: , mismatch: 4, identity: 0.846
ttgcggctgctggtactgtggctgag CRISPR spacer gtgcggctgctggtgctgtgcctgac Protospacer *************.***** ****
4. spacer 4.6|2871102|26|LR134233|CRT matches to MF288922 (Bacillus phage Janet, complete genome) position: , mismatch: 4, identity: 0.846
ttgcggctgctggtactgtggctgag CRISPR spacer ttgcggctgctggaactgtggttgtt Protospacer ************* *******.**
5. spacer 4.6|2871102|26|LR134233|CRT matches to MH593834 (Siphoviridae sp. isolate ctbe1, complete genome) position: , mismatch: 4, identity: 0.846
ttgcggctgctggtactgtggctgag CRISPR spacer ttgctgctgctggtactgtgactggt Protospacer **** ***************.***.
6. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417837 (Uncultured Caudovirales phage clone 8F_7, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
7. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417913 (Uncultured Caudovirales phage clone 9S_2, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
8. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417916 (Uncultured Caudovirales phage clone 7S_13, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
9. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417911 (Uncultured Caudovirales phage clone 9AX_3, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
10. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417918 (Uncultured Caudovirales phage clone 7F_17, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
11. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417915 (Uncultured Caudovirales phage clone 8AX_8, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
12. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417914 (Uncultured Caudovirales phage clone 8S_2, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
13. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417920 (Uncultured Caudovirales phage clone 10F_8, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
14. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417898 (Uncultured Caudovirales phage clone 7AX_6, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
15. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417896 (Uncultured Caudovirales phage clone 3S_18, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
16. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417912 (Uncultured Caudovirales phage clone 10AX_4, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
17. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to MF417910 (Uncultured Caudovirales phage clone 2AX_5, partial genome) position: , mismatch: 5, identity: 0.844
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer ccagaacgatgacgttgaggtaccgcgcaaca Protospacer ******************.****** . **.*
18. spacer 4.6|2871102|26|LR134233|CRT matches to NZ_CP011543 (Corynebacterium mustelae strain DSM 45274 plasmid pCmus45274, complete sequence) position: , mismatch: 5, identity: 0.808
ttgcggctgctggtactgtggctgag CRISPR spacer gtgctgctgctggtactgcggctgct Protospacer *** *************.*****
19. spacer 4.6|2871102|26|LR134233|CRT matches to MF288919 (Bacillus phage AaronPhadgers, complete genome) position: , mismatch: 5, identity: 0.808
ttgcggctgctggtactgtggctgag CRISPR spacer ctgcggctgctgctactgcggctgct Protospacer .*********** *****.*****
20. spacer 4.6|2871102|26|LR134233|CRT matches to NZ_MF788070 (Raoultella ornithinolytica strain 23141 plasmid p23141-2, complete sequence) position: , mismatch: 5, identity: 0.808
ttgcggctgctggtactgtggctgag CRISPR spacer tttgtgctgctggtactgtggctgtt Protospacer ** *******************
21. spacer 1.4|936084|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to CP022016 (Salmonella enterica subsp. enterica serovar India str. SA20085604 plasmid unnamed1, complete sequence) position: , mismatch: 6, identity: 0.812
--acgattatttgacgccagaagcagcagatgca CRISPR spacer ccacga--gattgacgccagcagcagcagacgca Protospacer **** . ********** *********.***
22. spacer 4.6|2871102|26|LR134233|CRT matches to NZ_CP029732 (Citrobacter sp. CRE-46 strain AR_0157 plasmid unnamed4, complete sequence) position: , mismatch: 6, identity: 0.769
ttgcggctgctggtactgtggctgag CRISPR spacer aaagggctgctggtactgttgctgat Protospacer . *************** *****
23. spacer 4.6|2871102|26|LR134233|CRT matches to MK033137 (UNVERIFIED: Pseudomonas phage UNO-SLW1, complete genome) position: , mismatch: 6, identity: 0.769
ttgcggctgctggtactgtggctgag CRISPR spacer gccttgctgctgggactgtggctgag Protospacer . . ******** ************
24. spacer 4.6|2871102|26|LR134233|CRT matches to KX449362 (Pseudomonas phage UNO-SLW3, complete genome) position: , mismatch: 6, identity: 0.769
ttgcggctgctggtactgtggctgag CRISPR spacer gccttgctgctgggactgtggctgag Protospacer . . ******** ************
25. spacer 4.6|2871102|26|LR134233|CRT matches to KX449363 (Pseudomonas phage UNO-SLW4, complete genome) position: , mismatch: 6, identity: 0.769
ttgcggctgctggtactgtggctgag CRISPR spacer gccttgctgctgggactgtggctgag Protospacer . . ******** ************
26. spacer 4.6|2871102|26|LR134233|CRT matches to NC_047873 (Pseudomonas phage UNO-SLW1, complete genome) position: , mismatch: 6, identity: 0.769
ttgcggctgctggtactgtggctgag CRISPR spacer gccttgctgctgggactgtggctgag Protospacer . . ******** ************
27. spacer 4.6|2871102|26|LR134233|CRT matches to KX449361 (Pseudomonas phage UNO-SLW2, complete genome) position: , mismatch: 6, identity: 0.769
ttgcggctgctggtactgtggctgag CRISPR spacer gccttgctgctgggactgtggctgag Protospacer . . ******** ************
28. spacer 2.13|953170|32|LR134233|CRISPRCasFinder,CRT matches to NZ_CP020932 (Marinobacter salarius strain SMR5 plasmid pSMR5, complete sequence) position: , mismatch: 7, identity: 0.781
-cagaccgatccgcgatcattttctgtggctgt CRISPR spacer tcacgac-acccgcggtctttttctgtggctgt Protospacer ** . * *.*****.** **************
29. spacer 1.8|936328|32|LR134233|CRISPRCasFinder,CRT matches to JX434031 (Pseudomonas phage JBD24, complete genome) position: , mismatch: 8, identity: 0.75
gtcacgaggtctgacgcggatgtgatg--agtta CRISPR spacer tcggcgaggtctgccgcgaatgtgatggcagt-- Protospacer . .********* ****.******** ***
30. spacer 2.7|953049|32|LR134233|PILER-CR matches to NC_012520 (Rhodococcus opacus B4 plasmid pROB01, complete sequence) position: , mismatch: 8, identity: 0.75
ggcgcgcgtgttgttggtgattccgcatatct CRISPR spacer cccccagatgttgttggtgattcttcatatct Protospacer * *. .***************. *******
31. spacer 2.9|952927|31|LR134233|CRISPRCasFinder,CRT matches to NZ_CP037868 (Hydrogenophaga pseudoflava strain DSM 1084 plasmid pDSM1084, complete sequence) position: , mismatch: 8, identity: 0.742
gcatcgcctggctgagtccgtcgctaactgc CRISPR spacer ccatcgcctcgctgagttcgtcgcccggtga Protospacer ******** *******.******. . **
32. spacer 2.11|953048|32|LR134233|CRISPRCasFinder,CRT matches to NC_012520 (Rhodococcus opacus B4 plasmid pROB01, complete sequence) position: , mismatch: 8, identity: 0.75
ggcgcgcgtgttgttggtgattccgcatatct CRISPR spacer cccccagatgttgttggtgattcttcatatct Protospacer * *. .***************. *******
33. spacer 3.1|953334|32|LR134233|CRISPRCasFinder matches to MN855803 (Bacteriophage sp. isolate 108, partial genome) position: , mismatch: 8, identity: 0.75
ggttaaccaggggtttttccccactatttcgc CRISPR spacer aggtaacgaggggtttttccccaatattgaaa Protospacer .* **** *************** **** .
34. spacer 2.5|952927|32|LR134233|PILER-CR matches to NZ_CP037868 (Hydrogenophaga pseudoflava strain DSM 1084 plasmid pDSM1084, complete sequence) position: , mismatch: 9, identity: 0.719
gcatcgcctggctgagtccgtcgctaactgcg CRISPR spacer ccatcgcctcgctgagttcgtcgcccggtgaa Protospacer ******** *******.******. . ** .
35. spacer 2.5|952927|32|LR134233|PILER-CR matches to MH651189 (Gordonia phage Schmidt, complete genome) position: , mismatch: 9, identity: 0.719
gcatcgcctggctgagtccgtcgctaactgcg CRISPR spacer agatcgccgggctgaatccgtcgctgcactcg Protospacer . ****** ******.*********. . **
36. spacer 2.9|952927|31|LR134233|CRISPRCasFinder,CRT matches to MH651189 (Gordonia phage Schmidt, complete genome) position: , mismatch: 9, identity: 0.71
gcatcgcctggctgagtccgtcgctaactgc CRISPR spacer agatcgccgggctgaatccgtcgctgcactc Protospacer . ****** ******.*********. . *
37. spacer 1.1|935901|32|LR134233|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015745 (Shinella sp. HZN7 plasmid pShin-09, complete sequence) position: , mismatch: 10, identity: 0.688
ccagaacgatgacgttgaagtaccgaaaaata CRISPR spacer gctcaacgatggcgatgaagtaccgaagttgc Protospacer * *******.** ************.
38. spacer 2.8|953110|32|LR134233|PILER-CR matches to MG189906 (Stenotrophomonas phage vB_SmaS_DLP_5, complete genome) position: , mismatch: 10, identity: 0.688
cgggcgcattgtacgcgatgcagaccatgccg CRISPR spacer agaaacagctgtacgaaatgcagaccatgccg Protospacer *.. ..****** .***************
39. spacer 2.9|952927|31|LR134233|CRISPRCasFinder,CRT matches to NC_008771 (Verminephrobacter eiseniae EF01-2 plasmid pVEIS01, complete sequence) position: , mismatch: 10, identity: 0.677
gcatcgcctggctgagtccgtcgctaactgc CRISPR spacer tcatcgcctcgctgagttcgtcgccgtaaaa Protospacer ******** *******.******.. .
40. spacer 2.12|953109|32|LR134233|CRISPRCasFinder,CRT matches to MG189906 (Stenotrophomonas phage vB_SmaS_DLP_5, complete genome) position: , mismatch: 10, identity: 0.688
cgggcgcattgtacgcgatgcagaccatgccg CRISPR spacer agaaacagctgtacgaaatgcagaccatgccg Protospacer *.. ..****** .***************
41. spacer 2.13|953170|32|LR134233|CRISPRCasFinder,CRT matches to LR134122 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 2) position: , mismatch: 10, identity: 0.688
cagaccgatccgcgatcattttctgtggctgt CRISPR spacer ccacttttaccgcgatcatgttctctggctgt Protospacer * . .. ********** **** *******
42. spacer 2.13|953170|32|LR134233|CRISPRCasFinder,CRT matches to NZ_AP014640 (Leptolyngbya boryana IAM M-101 plasmid pLBX) position: , mismatch: 10, identity: 0.688
cagaccgatccgcgatcattttctgtggctgt CRISPR spacer gcgactgatccgcgatcgttttctgcccatcg Protospacer ***.***********.*******. *
43. spacer 2.13|953170|32|LR134233|CRISPRCasFinder,CRT matches to NZ_AP018205 (Leptolyngbya boryana NIES-2135 plasmid plasmid2 DNA, complete genome) position: , mismatch: 10, identity: 0.688
cagaccgatccgcgatcattttctgtggctgt CRISPR spacer gcgactgatccgcgatcgttttctgcccatcg Protospacer ***.***********.*******. *
44. spacer 3.2|953395|32|LR134233|CRISPRCasFinder matches to NC_016113 (Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCAT, complete sequence) position: , mismatch: 10, identity: 0.688
aggggcgttccgcagtcgacaagggctgaaaa CRISPR spacer ccgggcggtccgcagtcgacgagggtgaggga Protospacer ***** ************.****. ....*
45. spacer 3.2|953395|32|LR134233|CRISPRCasFinder matches to NC_017585 (Streptomyces cattleya NRRL 8057 = DSM 46488 plasmid pSCATT, complete sequence) position: , mismatch: 10, identity: 0.688
aggggcgttccgcagtcgacaagggctgaaaa CRISPR spacer ccgggcggtccgcagtcgacgagggtgaggga Protospacer ***** ************.****. ....*
46. spacer 2.5|952927|32|LR134233|PILER-CR matches to NC_008771 (Verminephrobacter eiseniae EF01-2 plasmid pVEIS01, complete sequence) position: , mismatch: 11, identity: 0.656
gcatcgcctggctgagtccgtcgctaactgcg CRISPR spacer tcatcgcctcgctgagttcgtcgccgtaaaaa Protospacer ******** *******.******.. . .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
136706 : 143835
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LR134233|136706:143835|DBSCAN-SWA CTTACGCGTCGCGGTTCAGCCAGGCCATATACTCCGTGACGCCTTCGGCGACGGTCTTAAAGGGTTTGTCGTAGCCCGCGTTGCGTAGATTGGTTAAATCCGCCTGCGTAAACGCCTGATAGCGACCTTTCAGCTTATCCGGGAACGGAATGTATTCAATGCTGCCTTTTTTATGGTATGCCAGCGTCGCGTCGGCGACGGCCTGGAAAGATTCCGCACGGCCTGTGCCCAGGTTAAAGATGCCGGACTTGCCGCTTTCCAGGAACCACAGGTTAACGGCGGCCACATCGCCCACGTAAACGAAGTCGCGCTTGAAGTTTTCGCTGCCTTCAAACAGTTTCGGGCTTTCGCCGTTGTTTAACTGTGTATTCAGATGAAATGCCACGCTTGCCATGCTGCCTTTATGGCCTTCACGTGGTCCATAGACGTTGAAATAGCGGAAACCGACAATCTGCGAGTTCGCTTCTGGCAGGATCTGGCGCACATATTCATCAAACAGGAACTTAGAATAGCCGTAAACGTTAAGCGGTTTTTCGTATTCGCGCGATTCGATGAAATCAGACGTGCGACCGCCATAGGTGGCGGCCAGAAGAGGCATAGAGGAACGGGATTTCGCGCTCAAGACAATAGTGCAGCAGCTCTTTGGAGTATTGATAGTTATTATCCATCATATACTTGCCGTCCCACTCGGTGGTGGAAGAGCAGGCGCCTTCATGGAAAATAGCTTCGATATCGCCGAGCTCTTCTCCGGACATAATCTGGATCAGGAAATCTTCCTTATCCATATAGTCAGCAATGTTCAGATCCACCAGGTTTACAAACTTGGTGCCGTCTTTCAGGTTATCCACCACCAGAATATCGGTGATACCTTTATCATTCAGGGCCTTAACGATATTGCTGCCGATAAAGCCCCGCGCCGCCGGTAACGATGATCATAACTGTAACCTTTGAATTATGGTGTCAGAGACTGTCTCAGACATGAATGCTCTTATCATAACACTATTCATCCATTCACTGATAACCGTGAGGATACGCGTGATTTATGCTGCATTTTAAAGAAGTGATGACGCGTCATCTCGTATCAACGTATCGCTTTGGGTAATATGTGCTGAAATTTGCCCAGTCTGGAGAATCGCAATGCGTGGGGATTTTTACAAACAGTTAACCAACGATCTGGAAACCGCTCGGGCGGAAGGATTGTTTAAAGAAGAGCGCATTATTACGTCTGCGCAGCAGGCGGATATCACCGTGGCGGATGGAAGCCACGTTATTAACTTTTGCGCCAATAACTATCTCGGGCTGGCGAATCACCCTGAGCTGATTAATGCCGCAAAAGCGGGCATGGACAGCCACGGTTTCGGTATGGCGTCCGTGCGTTTTATCTGCGGCACCCAGGACAGCCACAAAGCGCTGGAGCAAAAGCTGGCAAGCTTCCTCGGCATGGAAGATGCGATTCTGTACTCCTCCTGTTTCGATGCTAACGGCGGCTTGTTTGAGACGTTGCTGGGCGCGGAAGACGCCATTATCTCCGACGCGCTGAACCACGCGTCTATCATTGACGGCGTGCGTTTGTGTAAAGCGAAGCGCTATCGCTATGCCAACAACGATATGGCAGAGCTGGAAGCGCGGCTGAAAGAGGCGCGTGAGGCCGGCGCGCGTCATGTCCTGATCGCCACCGACGGCGTGTTCTCAATGGATGGCGTCATCGCCAATCTGAAAGGCGTCTGTGATCTGGCCGATAAATACGATGCGCTGGTGATGGTTGATGATTCCCACGCGGTAGGCTTTGTCGGCGAAAACGGTCGTGGTTCCCATGAATACTGCGACGTAATGGGCCGCGTAGATATTATTACCGGTACGCTGGGCAAGGCGTTAGGCGGCGCATCCGGCGGCTATACCGCGGCGCGTAAAGAGGTCGTTGAGTGGTTGCGTCAGCGTTCCCGCCCATATCTGTTCTCCAACTCCCTGGCGCCGGCGATCGTGGCGGCTTCTATTAAAGTGTTAGAGATGGTGGAAGCGGGCGCAGAACTGCGCGATCGCTTGTGGGCGAACGCCCGCAGTTCCGTGAGCAGATGTCTGCCGCGGGATTTACGCTGGCAGCGCGGATCACGCCATCATCCCGGTAATGCTGGGAGATGCGGTTGTCGCGCAGAAATTTGCCCGCGAACTGCAAAAAGAGGGCATTTACGTTACCGGTTTCTTCTATCCGGTCGTACCAAAAGGTCAGGCGCGTATTCGTACCCAGATGTCTGCGGCGCATACGCCTGAGCAAATTACGCGTGCGGTAGACGCGTTCACGCGTATTGGTAAACAACTGGGCGTGATTGCCTGAGGATGTAAGATGAAAGCGTTATCCAAACTGAAAGCGGAAGAGGGCATCTGGATGACCGACGTTCCGGAACCGGAAGTCGGCCATAACGATTTGCTGATTAAAATCCGTAAAACAGCCATCTGCGGTACTGACGTTCACATCTATAACTGGGATGACTGGTCGCAAAAAACCATCCCGGTTCCGATGGTCGTGGGGCATGAATATGTCGGCGAAGTGGTCGGCATCGGTCAGGAAGTGAAAGGCTTTAAAATCGGCGATCGCGTCTCCGGCGAAGGTCATATCACCTGCGGTCATTGCCGCAACTGCCGTGGCGGTCGTACTCACCTGTGTCGCAACACCACCGGCGTGGGTGTCAATCGTCCCGGCTGCTTCGCGGAATACCTGGTCATCCCGGCGTTCAATGCGTTTAAAATCCCGGATAACATTTCTGATGATTTAGCCTCTATTTCGACCCGTTTGGTAATGCGGTGCATACGGCGCTGTCTTTCGATCTGGTCGGCGAAGATGTACTGGTATCGGGGGCGGGGCCAATCGGTGTAATGGCCGCCGCAGTGGCGAAACATGTCGGCGCGCGTCATGTGGTGATTACTGACGTCAATGAATACCGTCTGGAGCTGGCGCGTAAAATGGGCGTCACCCGCGCGGTCAACGTCGCGAAAGAGAGCCTGAACGACGTCATGGCGGAGCTGGGAATGACCGAAGGGTTCGATGTGGGTCTGGAGATGTCCGGCGCGCCGCCGGCGTTTCGTACCATGCTGGACACCATGAATCACGGCGGTCGTATTGCGATGCTGGGGATTCCGCCATCAGATATGTCTATCGACTGGACAAAAGTTATCTTTAAGGGCTTGTTCATTAAAGGTATTTATGGTCGTGAGATGTTCGAAACGTGGTACAAAATGGCGGCGCTGATCCAGTCCGGTCTGGATCTGTCACCGATTATCACCCATCGTTTCTCTATTGATGATTTCCAGAAAGGTTTTGACGCCATGCGTTCAGGCCAGTCAGGAAAAGTTATTCTGAGCTGGGATTAATATGTATATACCCAAAATAATTCGAGTTGCTTAAAGGCGGCAAGGGAGTGAGTCCCCAGGAGCATAGATAACTATGTGACTGGGGTGAACGACAAATCTGCCGGGAGCAGATTTGAACGTCGCTTGCGACGGCCCGTCAGGGCGAGGCCCATGGATGGGCCGAGTAATCCGTAGCCAACACATAAGCAACTTGAAGTATGAAGGGTATAGAGACCCGATAATTTTATCGGGTCTGATATACGTTTCATCGTAAAAGGTTGGTGTCATTCGCCAACCTTTTTTGTTAGGGAAAATCTGGAAAGCCGTAAAGAATTGTCATAGACATCAAGCATTCGTAATTGCGCTTTACTCTTATTTTACTCGCTAACGTCACGCTCTACTCTGAGTTTTGTGCTTGCTTTTTACTGTAAAAATTAATTATGGCGGCTTAATAGTTTCTTAATAGAGCCACAGTATAAAGGCAGGGTAAATTAAGGTTTTTCTGGTAATCGTTATGAAAAATAGTAAAACCAAAGTGAGTATCATTGTCCCGTTATATAATGCGGGAGCGGATTTTAATGCTTGCATGGCGTCGTTAATCGCGCAAACGTGGTCGGCGCTGGAAATTATTATTGTGAATGATGGATCGACGGATCATTCCGTTGAGATAGCAAAACATTACGCGGAACATTACCCGCATGTTCGACTGCTTCATCAGGCCAATGCTGGCGCATCTGTCGCCCGTAATCTTGGCCTGCAAGCAGCGACCGGCGATTATGTCGCCTTTGTCGATGCAGATGACCTGGTCTACCCGAAGATGTATGAAACGCTGATGACCATGGCGCTTAATGATGATCTGGACGTTGCGCAGTGCAACGCGGACTGGTGCGTCCGAAAAACCGGGCACGCCTGGCAATCTATTCCGACCGATCGCCTGCGCTCCACCGGGGTATTAAGCGGACCGGATTGGTTGCGTATGGCGTTGGCCTCGCGGCGCTGGACGCATGTTGTCTGGATGGGCGTTTATCGACGTGCGCTAATTATCGATAACAATATTACTTTCGTTCCCGGACTGCATCATCAGGACATATTATGGTCGACGGAAGTTATGTTTAATGCCACGCGCGTACGTTATACCGAACAATCATTATATAAATATTTCCTGCATGATAATTCGGTAAGCCGTTTGCAAAGACAAGGCAATAAAATTCTTAATTATCAGCGGCATTATATTAAAATTACGCGATTATTAGAAAAGCTCAACCGTGATTATGCCCGGCGTATTCCGATTTACCCGGAGTTTCGCCAGCAAATTACCTGGGAAGCGTTACGCGTTTGTCATGCGGTGCGTAAAGAACCTGATATTTTGACCCGCCAGCGTATGATTGCCGAAATTTTTACTTCTGGCATGTATAGACGGATGATGGCTAACGTCCGCAGCGCGAAAGCGGCTTATCAGACGCTGCTCTGGTCTTTCCGGCTGTGGCAATGGCGCGACAAAACTTTGTCGCACCGTCGTATGGCCCGTAAGGCGCTCAATCTGTCTTAGCGCTCACGTTTTTAGGCGCGGCGATTTTCCCCCAGCCTTGCCACTGGTGCTGAAACCAGACAACCACGGAACTTTGCGTAATGCTCTCGCCGATGACGCTGAAAAAGCGCGTAGCGTAGACCGGTTGCAGCGGTTTTTTCGGCTTGCACATCTTCACGCCGCGAAAGGGATTTCGCGGCGCGTCAATTTTCTGCGGCGTTACACCAGGTCGGGACGTATCTACCTGCGGTTCGTTGAGCAGGCTGCCTGGACGTACCAGGGTGATATCCGCCGGCAGGCGATAAACCATCTGTTGCAGCACGCGAACCGTTGCGGGATGCGGATGACCAATCGCGATAGCGGAACCGTTGCGACGGGCCAGTTCGATAGCGCGATTAAACTGACGACGGATATCCGCCTCGTTTTGCGTATCGTCGAGGAACACTTTGCGCTTGATCACTTTCACACCCGTACCGGATGCCGCGCGCATCGCCTGGCTATTGCCAATCGTCATGCTGTCGAGAAAATAGAGATTGTAATGTTCCAGCGCCTGCATAACTTTTTGCATACCGAACAGGCTGGAAGTCATTGCGCTGCCCATGTGGTTATTAAGCCCGACGGCATACGGCACGTTGTTTACCGCCTCGCGGATAATGCGCTCGATCTCATCGCTGCTCATATCTGGTCGCAGCGTATCCTTCTCCAGCGGCTGTTTGCTTAGCGGCGCCATCGGCAGATGGATTAACACCTCATGCCCGCTATTGTGCGCTTTAGTTGCCATTTCGCGCGCGTGCGGCGCGTTGGGCAGTACGGCGACGGAGATGTTTGGCGGCAGCGCCAGAACCTGGTTTTCCGTATGCGGGCGATAGCCAAAATCATCAATCACGATGGCGAGCTTGCCAGCGAAAACGGGATGTGCAAACGCCAGCAAAGTGGCCAGCGTGAGAATGGAGCGACGAAACTGAGGCAAAACTTATCTTCCCAACCACGGCTGTGGATTGACGGCCTGACCCTGGCGACGAATTTCGAAATAAAGCGAAGGTCTGCCCTGACCGCCGCTACTGCCCACAAGGGCGATAGGTTGTCCGGCGCGCACCTGCGCGCCTACGCTGACCAGCGCGCTCTGATTATAGCCGTAAAGGCTCATATCGCCTTTACCATGCTCCACGACCACCACCAGGCCATAGCCTTGCAGCCAGTCCGCCAGAATCACCCTGCCGTCGGCGATGGCTTTGACCTCGGTGCCTTCCGACGCGCCGATCACCATCCCCTTCCAACGTAGCTCACCCTGCAGCCGTTCGCCATAGCGATGCAATATCGGGCCGCGTACCGGCCAGTATGCCTGACCGCGAGGCGCGCCCAGACCGCCGGTGCGGGACATTAGCGAACGCTCGCTTTCCGTCGGCTTATAGGTGCTGCCTTTACGCGAGGCTTCCTGCTGCTTATCGCGTACCGCCTGCGCTTCGCGGGCTTCACGCTCCGCTCGCGCTTTGGCCGCCGCTTCCGCCCGCGCGATACTGTTGCGTAGGCGTGATTCGTTGGCGCGTAGTTCGCTTAACTGTTGCTGGCCTTGCTGAATAGACGACTCCAGTCCGGCAAGCGTTTTCTTACGTTCATTACGCGCCTGCTCCAGCTTCGCCTGCTGGGCGCGCTGTTCATACAGCAACGTTTGTTGTTGGCTTTGCTTCTCTTCCAGCTCGGCTTTTTGCGTGGCGACCTGTTCGCGAGTCTGTTTCAGTTCGGCGATAGTCTCCTGGCGCGCCTGGTTCAGATAGCCGAAGTAGGCCTGCAGGCGTTGCCCACGCTGGCTCTCTTCTCCGCTAAGAATAAGCTGAATGCCGGTGTGTTCGCCCTGGCGGAAAGCGGCGTCCAACTGCGCGGCAAGATTACGCTCCTGGCTGGCTTTTTGCTGCTCCAGTTTGGCAATTGAGGCGTTCATCTCATCAATTTGCGCGTTTAACTGATCCAGAGTGCTTTGCGTTTCGCGCAGCTTACGCGCTGCGGCCGAGATGGCCTCTTCCTGGGCTTTCAGTTGTGCCAGGAGGCTGGCGCGCTGTTGCTGCTGCTGGCGTACGTCGCGTTCTTTCGCGGCGATATCGGCCTGAATAGATTTGAGTTGATCACGGTCATCCGCGTGGGCGGAAAAGGCGCACAACAATACGCCAGCGCTAAGTACGCTGGCGTAGAACAAAGGCCTGACTGAAAACGTTCTTGGTTTCACGGCCTGTTGAATGGTGTTAATCGCCTTTCCCCTCAT
Protein sequences of DBSCAN-SWA_1 >LR134233|136706:143835|137842_138973_+|VEC90151.1|DBSCAN-SWA MRGDFYKQLTNDLETARAEGLFKEERIITSAQQADITVADGSHVINFCANNYLGLANHPELINAAKAGMDSHGFGMASVRFICGTQDSHKALEQKLASFLGMEDAILYSSCFDANGGLFETLLGAEDAIISDALNHASIIDGVRLCKAKRYRYANNDMAELEARLKEAREAGARHVLIATDGVFSMDGVIANLKGVCDLADKYDALVMVDDSHAVGFVGENGRGSHEYCDVMGRVDIITGTLGKALGGASGGYTAARKEVVEWLRQRSRPYLFSNSLAPAIVAASIKVLEMVEAGAELRDRLWANARSSVSRCLPRDLRWQRGSRHHPGNAGRCGCRAEICPRTAKRGHLRYRFLLSGRTKRSGAYSYPDVCGAYA >LR134233|136706:143835|137262_137556_-|VEC90150.1|DBSCAN-SWA MVDNLKDGTKFVNLVDLNIADYMDKEDFLIQIMSGEELGDIEAIFHEGACSSTTEWDGKYMMDNNYQYSKELLHYCLEREIPFLYASSGRHLWRSHV >LR134233|136706:143835|139576_140071_+|VEC90153.1|DBSCAN-SWA MAAAVAKHVGARHVVITDVNEYRLELARKMGVTRAVNVAKESLNDVMAELGMTEGFDVGLEMSGAPPAFRTMLDTMNHGGRIAMLGIPPSDMSIDWTKVIFKGLFIKGIYGREMFETWYKMAALIQSGLDLSPIITHRFSIDDFQKGFDAMRSGQSGKVILSWD >LR134233|136706:143835|136706_137303_-|VEC90149.1|DBSCAN-SWA MPLLAATYGGRTSDFIESREYEKPLNVYGYSKFLFDEYVRQILPEANSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVAAVNLWFLESGKSGIFNLGTGRAESFQAVADATLAYHKKGSIEYIPFPDKLKGRYQAFTQADLTNLRNAGYDKPFKTVAEGVTEYMAWLNRDA >LR134233|136706:143835|139046_139577_+|VEC90152.1|DBSCAN-SWA MKALSKLKAEEGIWMTDVPEPEVGHNDLLIKIRKTAICGTDVHIYNWDDWSQKTIPVPMVVGHEYVGEVVGIGQEVKGFKIGDRVSGEGHITCGHCRNCRGGRTHLCRNTTGVGVNRPGCFAEYLVIPAFNAFKIPDNISDDLASISTRLVMRCIRRCLSIWSAKMYWYRGRGQSV >LR134233|136706:143835|140564_141599_+|VEC90154.1|DBSCAN-SWA MKNSKTKVSIIVPLYNAGADFNACMASLIAQTWSALEIIIVNDGSTDHSVEIAKHYAEHYPHVRLLHQANAGASVARNLGLQAATGDYVAFVDADDLVYPKMYETLMTMALNDDLDVAQCNADWCVRKTGHAWQSIPTDRLRSTGVLSGPDWLRMALASRRWTHVVWMGVYRRALIIDNNITFVPGLHHQDILWSTEVMFNATRVRYTEQSLYKYFLHDNSVSRLQRQGNKILNYQRHYIKITRLLEKLNRDYARRIPIYPEFRQQITWEALRVCHAVRKEPDILTRQRMIAEIFTSGMYRRMMANVRSAKAAYQTLLWSFRLWQWRDKTLSHRRMARKALNLS >LR134233|136706:143835|142551_143835_-|VEC90156.1|DBSCAN-SWA MRGKAINTIQQAVKPRTFSVRPLFYASVLSAGVLLCAFSAHADDRDQLKSIQADIAAKERDVRQQQQQRASLLAQLKAQEEAISAAARKLRETQSTLDQLNAQIDEMNASIAKLEQQKASQERNLAAQLDAAFRQGEHTGIQLILSGEESQRGQRLQAYFGYLNQARQETIAELKQTREQVATQKAELEEKQSQQQTLLYEQRAQQAKLEQARNERKKTLAGLESSIQQGQQQLSELRANESRLRNSIARAEAAAKARAEREAREAQAVRDKQQEASRKGSTYKPTESERSLMSRTGGLGAPRGQAYWPVRGPILHRYGERLQGELRWKGMVIGASEGTEVKAIADGRVILADWLQGYGLVVVVEHGKGDMSLYGYNQSALVSVGAQVRAGQPIALVGSSGGQGRPSLYFEIRRQGQAVNPQPWLGR >LR134233|136706:143835|141585_142548_-|VEC90155.1|DBSCAN-SWA MPQFRRSILTLATLLAFAHPVFAGKLAIVIDDFGYRPHTENQVLALPPNISVAVLPNAPHAREMATKAHNSGHEVLIHLPMAPLSKQPLEKDTLRPDMSSDEIERIIREAVNNVPYAVGLNNHMGSAMTSSLFGMQKVMQALEHYNLYFLDSMTIGNSQAMRAASGTGVKVIKRKVFLDDTQNEADIRRQFNRAIELARRNGSAIAIGHPHPATVRVLQQMVYRLPADITLVRPGSLLNEPQVDTSRPGVTPQKIDAPRNPFRGVKMCKPKKPLQPVYATRFFSVIGESITQSSVVVWFQHQWQGWGKIAAPKNVSAKTD |
8 | Prochlorococcus_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
475331 : 515849
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LR134233|475331:515849|DBSCAN-SWA TTTACGCAAAATTTTCGAAGTATGCCTCCAACGCCTCCAGCTGTTCGCTGGCGTCCTCAATAGCGTTGAATGTGCGCCGAAACTGGTCGTCTGGAGCGTGCTCCTGGAGATACCAGGAGACGTGTTTACGCGCGATTCGGTACCCTTTTGCCTGACCGTAAAAGTCATGCAGTTCCCGAACATGCGTGCAAAGCAAGCGCTTTACCTCTGCCAGCGGCAGCGGCGGAAGCAGCTCCCCAGTGTCCAGATAATGCTGGATTTCCCGAAAGATCCAGGGTCTTCCCTGAGCTGCGCGGCCTATCATCAGGGCATCCGCCCCTGTATAGTCGAGAACAGCTCTGGCTTTATGCGGGTTAGTAATGTCGCCATTCGCGATAATCGGAATGGAAACTTTCTGCTTAACTGCCCGAATACTGTCGTATTCAGCTTCTCCGTTGAACAAACAGGCGCGGGTGCGGCCATGGATGGTCAGAGCCTGAATACCACAATCTTCAGCCAGTTGGGCAATCTCTACGCAGTTACGGTGTTCCGGCGCCCAGCCGGTGCGAATCTTGAGAGTCACAGGAACGTCCACTGCGTTTACGACCCCGATGAGTATAGACTTCACTAAATCCGGGTACTGCAAGAGGGCTGAACCTGCCAGCTTACGATTCACTTTCTTCGCCGGGCACCCCATATTGATATCAATAATTTGGGCGCCGCTTTCCACGTTAATACGTGCGGCATCGGCCATCTCAACAGGGTCGCTACCGGCAATTTGCACCGTGCGAATTCCTGGCTCATCAACGTGCACCATCCGTAAACGAGATTTATCGCTTTCCCACACTTGCGGGTTAGAAGACATCATCTCGGATACTGTCAATCCTGCTCCCATCTCATAGCACAGCGTCCGGAAAGGTCTGTCAGTAATGCCAGCCATGGGCGCTGCGATCAGGCGATTTCTGAGCTGATATTGTCCGATGCGCATGAGTTAAGAAATGACCATACTGTGACTGCAAGGCGGCGTATATTACGCATTTTTTGCACGAGATGAAAGGCCAAACTTTGAACAATCCACTGTTGTAGATCAATGAATCGTGCTTAAAAACAGCAAAATAAATTTATAATTCAGTAAAATCATATGGTTATTTATTGTCGGCGGTTTTGTACGCGCTTACAAATTCCCGTAACTTCTCTCTATTTCTTCGACCAGAAACAAGATAAGCGGAATAATCTGCTTTTTTTATAAGCTAATTACCCCAAATGAATGGGATCGCCGTCTCGTTTCAGGACAAGATCACTTCCCGCTGCGCGTGACTTTTCTCCGTTGCCAGTCGGAAGGGGGCGCACAGATGCGCTTTTGTGTTCATGAATGGCACGGATAAATTTTTCCAGCTGCGTTGAAGAGAGGGTTAGCGGATGCTGCAGAACAACCCGGACAGCATTTACTAAGCACGAGGGCGATGGTTGCGGGCCATTGAAGCGCCAGTACGTTTGTCCGCTGGTAGCAGCTGTTTATCCAGACGGATAGTGACAAAGAAGTTATTTCAGGCATTCATTTGATGAATTTAATGATGTCTCGGTGACAGAAAGGCGTTAAGGCAGTTGTGTGGAGAAAAGCGTGGCCGGGCGACCACGCTGACAAGAATTATTTTTTACGGCCAGTAATGCGGCACCATTCTTCTTTTTCGACGACCGGGTCGAGAGTGAAAAGCTCGGCGTAAGCATCGCAGACGCTTTCCGCCTGGCTGGCGAGAATACCTGAAAGCCCCAGCAGACCGCCCTCAACGGGCAGTACGCTGATTAACGGGGCCAGTTCGCGTAATGGGCCTGCCAGGATGTTAGCGACGACCACGTCGGCTTTCATGGCTTCTGGCTGGTCTTTCGGCAGGTACAGCTCCAGCCGGTCAGACACGCCGTTGCGTTCAGCATTATCGCGACTGGCCTGAATAGCTTGCGGATCGATATCAATACCAATCGCCTTTGCCGCTCCCAGTTTCAGCGCGGCAATCGCCAGAATGCCTGAACCGCAGCCAAAGTCGATCACGGTTTTGCCGTTGAGATCGAGCCCGTCCAGCCATTGCAGACATAAAGAGGTGGTGGGGTGCGTACCCGTACCGAACGCCAGTCCCGGGTCGAGCATCACGTTGACGGCATTCTCGTCCGGAATGTCACGCCAGCTGGGACAAATCCACAGACGCTCGCCAAAACGCATCGGATGGAAATTATCCATCCATTCGCGTTCCCAGTCTTTATCTTCCAGTTGTTCGATTTTATGGGCAAATCCGGCACCCAGCAGCGGATGTTGCTCCAGAATCGCGACAACGTCTTTCATGTCGGTTTCGGCGTCGAACAGACCGATCACATCGGTATCGCCCCACAGACGGGTTTCGCCCGGCAACGGCTCAAAAACCGGCGTATCGTGCGTATCCTGAAAGGTAATGGAAACGGCGCCCGCTTCCATCAGCGCATCACTCAGCTCTTCAGCATTCGCGCCGGTGGTATTCAGTTTCAATTGGATCCATGGCATGGCAAAACTCTTTATTTATCAGTAGACAATACGGTGGCTTGCGAAGCGGAAGAGCCGAATCGGTTTCCGATCAGAAAAGCCAATAAACTTAACAGTAACGCGGGCACAATCGGATGAAAGCCCAGGTACTGAATATTAAATGTGGCAAGTAGGGCATATAGCACGCCGCCCACGATCATGGCGCTTAGCGCGCCTGCCGCGTTTGCCCGCTCCCAGTAAAGGCCCAATACCAGCGGCCATAAGAACACCGCTTCCAGGCCGCCAAACGCCAGCAGGTTGAGCCAAATGATCATCTCTGGCGGTTTCCACGCGGCCAGCAGCAACAGCGCGCCTAGCAGTAATGTAATCACTGCCGACATACGCTTCAGCCGGATCTCATTTTGCATCTGATCGGGGCGCAGGTTGAGATAGAGATCTTTAATGATCGTCGCGGAACTTTGCAGCAGTTGGGCGTTAATTGTTGACATGATCGCCGCCATTGGCGCAGCAAGGAAGATCCCGGCGGCAAACGGCGGCAAAACTTTCACCATTAGCGTTGGGATCACCAGATCCGGCACCGTCAGATCCGGAAGCACCGCGCGACCCAGCGCGCCGGCAAGGTGCATCCCGAACATTAAAATCGCCACGACAATCGTACCGATAATAATGCCCCGGTGTACCGCTTTACTGTCTTTATAGGAGATGCAGCGCACCGCCGTATGCGGCAGGCCAATCACACCAAAACAGACCAGTACCCAAAAAGAGGTCATAAAGGCGGGAGAAAGTATGTCATCGGCGCCCTGCGGCGTTACCAGTTTGGGGTCGAGAGCGTGCAGCGTGTCAACGGCCTGACTTAGACCGCCCGCCGCATGCACCACGCCCACCAGCAGCACTATCGTGCCGACCAGCATCACCAGGCCCTGTAGCGTATCGTTAAGTACGCTGGCGCGAAAGCCGCCAAACGCCGTATAGAGTGCGATACTGACGCCAAATATCAGCAGGCCGGTTTCGTAGGGAATTCCCGCCGCGGTTTCCAGCAGTCGCGCGCCGCCGATAAATTGTACGGTCATTGCGCCAATAAACGCGACCAATAAACTCAGGCTTGCCAACCACACCAGCAGGCGGCTTTGGTAACGCGCGAATAGCATATCGTTAAGCGTGACGGCATTATAGCGACGCGCCAGGATAGCGAATTTTTTACCCAGAATACCCAGTGAAAGCCAGACGGCGGGAAGCTGGATCATCGCCAGCAAGACCCAGCCCAGACCGTATTTATAAGCCGCGCCGGGACCGCCGATAAATGAACTTGCGCTAATATAGGTGGCGGTAAGCGTCATCGCGAGCACGATCCCGCCCATCGAACGGCTGCCAAGAAAATATTCGTTTAAAAAAGTGCCGGCGGTACGCTTGCGCATAGCGTAAATCGAGACCCCGAAGACCACGATCAGATAGGCTACCAGCGGTAGAATGACCTCAAGCTGCATCGTCATCCTCCAGCGAAATATCGCGATAAATGAATTTCACCATCGCCCAGCACAGCAAAATAAAGACCAGCGGCGTCAATAAGCAGGCCATCTCGAACCAGTGCGGCAGACCGGTAATTCCCGGCGAGTCTCCAGGTAAGTAAGCAGCCACTAACCATGCCGCCAGATAGCAAAGGGTCAGCCACAGCGCCCAGCGCGCCTCTTTATGGGCCTGAACAAAACGAGCGTCCATTTTTTGTCCCTTATGGGTGAAGAAAGCGGGAATTGTACATGATGGGGCATCGCTATCCCCAGAAATAAAAAAGGCCGGATTCACCGGCCTTTTGAGATTAACGTGCGATTACTTTTCCTGAAGACCGAGTTTTTTCTCCAGATAGTGGATGTTGGTTCCACCGTGCTGGAAGTGCTCGTCATTCATGATGCGGGTCTGCAGATCGATATTGGTTTTGATACCATCGATAATCAGTTCCTGCAGGGCATTTTTCATACGGGCAATCGCCACGTCGCGGTTTTCACCGTAGCAGATGAGTTTGCCGATCATGGAATCATAGTACGGCGGCACCGTGTAGCCCGCGTAGATATGAGATTCCCAGCGAACGCCAAAGCCGCCAGGCGCATGGAAGCGCGTGATTTTGCCAGGGCTTGGCAGGAAGGTGTTCGGATCTTCGGCATTGATACGGCATTCTACCGCATGGCCTCGAACGACAACTTCGTCCTGTGTGATCGACAGCGGCTGACCCGCCGCGATGCGCAACTGCTCTTTGATCAAATCGACGCCGGTAATCATTTCAGTCACCGGGTGTTCAACCTGAATACGGGTGTTCATTTCGATGAAATAGAACTCGCCGTTTTCGAACAGGAATTCGAACGTCCCTGCGCCGCGATAGCCGATGTCTACGCACGCTTTCGCGCAGCGTTCGCCGATATAGCGACGCAGTTCCGGCGTAATGCCTGGCGCCGGGGCTTCTTCAACCACTTTCTGGTGGCGACGCTGCATGGAACAGTCGCGTTCCGCCAGATAGATAGCGTTGCCCTGGCCGTCCGCCAGCACCTGAATTTCGATGTGGCGTGGATTTTCCAGGTATTTTTCCATGTATACCATGTCGTTGCTGAAAGCCGCTTTCGCTTCCGCTTTGGTCATGGAGATGGACTGCGCCAGTTCAGCATCGCTACGAACCACGCGCATACCGCGGCCGCCGCCGCCGCCGGACGCTTTGATGATCACCGGATAGCCAATACGTTGGCATGAGCGCGGTTCGCATTCATATCGTCGCCCAGCGGGCCGTCAGATCCTGGTACGGTCGGCACGCCCGCTTTTTTCATCGCGGTAATCGCGGACACTTTATCGCCCATCAGGCGGATGGTATCCGCTTTCGGGCCGATAAAGATAAAGCCGGAGCGTTCAACCTGCTCGGCAAAATTGGCGTTCTCAGAAAGGAAGCCGTAACCCGGGTGGATTGCCACCGCGCCGGTGATTTCAGCGGCGCTAATGATAGCCGGGATGTTCAGATAACTTTTTACGGACGGTGCCGGGCCAATACAGACCGTCTCATCCGCCAGCAATACGTGTTTTAAATCGCGATCCGCGCTTGAGTGCACAGCGACGGTCTTGATGCCGAGTTCTTTACAGGCTCGAAGAATACGTAGTGCGATCTCGCCGCGGTTGGCGATGACAATTTTATCCAACATGTTCGCCTCGTTACTCGATGACGACCAGTGGCTCGTCAAATTCTACCGGTTGACCACTTTCGACCAGAATCGCTTTCACAGTACCTGCTTTGTCAGCTTCGATCTGGTTCATCATTTTCATGGCTTCAACGATGCACAGGGTATCGCCCACGTTCACTTTCTGGCCCACTTCGATGAACGCTTTCGCGTCCGGGCTTGGGGTGCGGTAGAAAGTACCAACCATTGGGGAGCGTACGATGTGACCACTGATTTCCGCTGCGGCAGGCGCTTCCATTGCTGGAGTAGCCGCTGGCGCGACAGCGTTAGACAGAGCGGGTTGCTGCATCATCGGCGCCGCATAAGCCTGCTGCATCACCGGAAAACCGGCGTTTGCCGTTGTGCGGCTGATGCGAACAGATTCTTCGCCTTCAGAAATTTCCAGTTCGGAGATACCTGATTCTTCAACCAGCTCGATCAGTTTTTTAATCTTACGAATATCCATGAGTGGGTTCCGTACTCTTTGTTTAGTGTGATTGTGACAGGCGTTTTAGCGCCGTCTGTAAAGCATGTGAATGACTGCCGCGGGTAAAATACCGCGCATCTTTGATTTGCCTGTTATGCCGTAGCACCCGAAGGTGAAATTCATGATTTGCCAGTCATTGTGTGGCTATCTTCTGCCATTTTCGGGCAAAAAACAAAATATACCTTCATTGCTCGATTGTCACCCTTTCTGCGAACGAAAATCGCCGTTGGGCAATGTAAAGTCGCACATTATAACTATTTCGTCGCAATTGGCAGCTAAATACTGGTCTTATCAGGGAAGATAATCAACCCGAAACATAAAAAGGACATTACCTGTTTTGCGTTTTGCAATCGACCGCACAGTTCACCAAACTAGCGCCACCACTGGCGAAACTTCCGGTAACGCAACGCTAAAAGGGCTAACGCTAATGCCGCGTAGATGACCGGTTGCGGTGATAAAATCTTTACCGACCACAGGTAATGTATGGGAGCCAGGATCGCGACAAGATAGACGACGTTATGCAGTGTTTGCCAGCGTTTGCCCAATTTACGCTGCGCAAATTGCGTTGAGGTCAACGTCAGCGCCAATAACACCAGCCAGCTAATGATCCCAAGCGTCAAATAAGGGCGTGAAATCAACTCGCTTCCTAACAACGCCAGGTTATGTATCCCCAGTTCCAGTAAGGCATAGCTGGTTAAATGCAGCGTAGCCCAAACGAAACACCATAATCCTAACAGGCGGCGTGTGCGTATCAATAATGGCTGTTTAGCGTAGCGCGCCAGCGGTGAAACCAGCAAGGTGGCGAGCAAGAATTTGAGCGCGGTTCTGCCGGTAAAGTGTTGGATATCCTTTACCGGGTCTGCGCTGAGTCCACCGTGATTTATCGCCCAAAAGAGCCACAGAAGCGGCAAAAAACCGGCAAGATGCAGACAAACTTTTAGCCAGGTTATCTGTTTTGCTGTCAGACGCACTTAAAAATTCTCCCGCAAATTGAGACCGCGATACAGCGAAGCGACTTCATTGGCGTAGCCGTTAAACAGCAGCGTCGGCTGTCTTTGCACATCGAGGATACCGCCTGAACCAATAAAGCGTTCGGTAGCCTGGGACCAGCGTGGATGATCCACATGCGGGTTCACATTGGCGTAAAAACCATATTCGTTGGGAGCCGACAAATTCCAGGTGGTTGGCGGACGTTCGCGGGTGAGTTTAATGCTAACAACAGATTTAATACCTTTAAAACCATACTTCCATGGAACGATGAGTCGAATAGGCGCGCCGTTTTGCGGGGGTAACGCCTTACCATAGACGCCAACGGTCATCAGAGTCAGCGGATGCATGGCTTCGTCCAGACGTAGCCCTTCGACATAAGGGTATTTCAGTCCGCCGCCAATAAAGCGATCTTTCTGTCCTGGCATATCATCCGGCGCGTATAGCGTTTCGAATGCCACATATTTAGCGTGGCTGGTGGGCTGTGCCTGCGCGAGTAGCTTATATAAAGGGAAACCAATCCACGGCACGACCATGGACCACGCTTCGACGCAGCGCATTCGATAGATACGCTCTTCTAATGGGAAACGATGTGTTAAATCGTCATAATCCAGCGTAAATGGCTTCGCGACTTCCCCGCTGATTTTCAACGTCCACGGTTCGGTTTTCAGACTTCCGGCATTGGCCGCCGGGTCGGCTTTATCAAGGCCAAACTCATAGAAATTGTTGTAGCCCGTCACCTTATCTTCCGGCGTTAACGCTAAATCGCTTCGCCAGGCGGCAGGCTGACTAAACTCAAGCGGTTTACCGGCAGGCGCTTTCGGACGATCGTTGCCTTTAAACCAACTGAAGAGATCGGCCTGCGCCGTTGAGGGTAAGGATAAGGCGGCCGCGCTGATGCCTAATGCTTTTAGCACCTGTCGGCGCTGCATAAAAAAAGCCGATTCCGCAGTCACATCGGCTTCTGTTAATGGACGTATCTTTTTCATCACAGACTCCCGGTCTACCCACCACAACATGGCGTGTAGTGAATATGACAGGGAGTGTGTCAATGTGTAAAAATAAATAGTTAATTAAGTGTAAATTTTTTTATACGTGACGTTATTTTATCTTCACCAGCGTACGGCCCTGAACCTGATTATTGATGATGGCATCGGCGAACTTCGGCGCATCCGCCAGCGTAATCTCTGTGGCGGCCTGAGCATAGAACGATTCCGGCAGATCCTTAACCAAACGCGCCCAGGCTTCTGCGCGACGTGCCGGTGGGGTCATCACCGAGTCCACCCCTTGCAAACGCACATTACGCAGAATGAATGGCATAACGGTGGTCGGCAGCGCGAAGCCGCCCGCCAGGCCGCAGGCGGCCACGCAACCGCCATAGTTCATCTGTGCCAGCACTTTCGCCAGCACTTTATCGCCAACGGTATCAATTGCACCGGCCCACAGCTGTTTTTCCAACGGGCGAGATTCAGCAAACTCATCGCGGCTGAGAATACGGTTAGCTCCCAGGCTCTTAAGATAGCCGTGGGTGCTTTCACGTCCTGATACTGCTGCAACCTGATAGCCCAGCTTATGCAGTAATGCAACGGCAGTGCTGCCCACACCGCCGCTGGCGCCGGTGACGACAACCTCACCGTCTTGCGGGCGAATACCTGCATCTTCAAGCGCCATGACGCACAGCATGGCGGTAAAACCGGCGGTGCCGATGATCATGGCGTTACGGCTGCTCAGTCCCGCAGGCAGCGCGACCAGCCAGTCGCCTTTAACGCGTGCGCGTTCGGCCAGGCCGCCCCAGTGGTTTTCGCCCACGCCCCAGCCAGTCAGTAGTACTTCCTGGCCTGCATGAAAGCGAGGATCTTCGCTTGCGTGAACGGTGCCGGCGAAATCAATACCAGGAATCATCGGAAAATGACGGATAATTTTCCCTTTTCCGGTGATAGCCAGAGCATCTTTATAATTCAGGCTGGACCAGTGGACATCCACCGTCACATCACCTGCCGGCAGTTGACTCTCTTCGAGATGTTGCACGGATGCGAGGGTTTTACCGTCCTGCTGTTCTAAGATCAACGCCTGCATAACCTGTCCTCACTTTACATGGTAGATTGGAAAAATAGCTACGAAGACTATACTCGCTAATTAAAACGTGATGCCGATGCAACGCAATAAATTGCCAGATAGATCCATTTTGGTAGTATGCCTGCTTCATTGCGCGCCGTGGCGAATTCCGCCTTCAGATTCGCTTTTTCATACTGTTTATAACCGTCGGAGTTAACTCAAGGATGCGATTAACGACGAAATTCTCAGCTTTTATCACGCTGCTCACGGGGTTAACGATCTTCGTGACTCTGATAGGTTGCTCGCTAAGCTTTTATAACGCCGTACAGTATAAGTATGTCAGCCGTGTGCAGGCGACGGCAACAGCCATTGATACGCACCTGGTGACCCATGATATTGTCTCGCTGACGCCGCAAATCGACGAGCTCATGATTGCTTCCGATATTGTGCGTGTCGATCTGTTGCAGGGGGAGCGCAGCGTCTACAGCCATTCTCGCGCCAGAGGTTATCGGCCAGCCGGCACCAGCGACATGTATCGCGAGCTGGTCGTACCGCTGATAAAACATCCAGGAATGTCGCTACGTCTGGTTTATCAGGACCCGATGGGTAACTATTTTCACTCACTGATCACCACGGCGCCGCTGACGCTGGCGATCGGTTTTATCGTTCTGATTTTGTTTTTATCGGTGCGCTGGCTGCAGCGGCAGCTTTCCGGGCAGGAGCTTCTGGAAATTCGCTCCACGCGTATTCTGAACGGCGAACGTGGGGCGAATGTCCGCGGTTCTGTTTACGAATGGCCGGCGCGCACCAGTAGCGCGCTTGATGTATTGCTTTCCGAAATTCAGTTTGCTCACGAACAGCGTAGTCGCCTGGACACCCTCATTCGCTCCTACGCCGCGCAGGACACCAAAACCGGGCTCGGTAACCGTCTGTTTTTTGATAACCAACTGGCGACGCTGCTGGAAGATCAGGAAAAAGTGGGAGTGCATGGCGTGGTGATGATGATCCGCTTGCCAGATTTCAATCTTCTGCGCGACAGCCTGGGAGGCAATCAGGCGGAAGAGCAGATGTTTATGCTTATCAATCTGCTATCGACATTTATTATGCGTTACCCAGGCTCGCTGCTGGCGCGTTATCACCGCAGCGACTTTGCCGTGCTGCTACCGCATCGAACGCTGAAAGAAGCGGAAAGTATTGCCGGACAACTTTTAAAAGCTGTCGATGCGCTACCGGCTAACAAAATGCTCGATCGTGACGATATGGTACATATCGGCATTTGCGCCTGGCGCAGCGGACAGTCCACCGAGCAGGTGATGGAACATGCTGAAGCCGCGGCGCGGAACGCCGCCCTGCAGGGGGGAAATAGCTGGGCTATCTACGACGACACGCTGCCGGAAAAAGGGCGCGGCAATGTGCGCTGGCGCACATTGATTGAGCAGATGCTGAACCGGGGCGGACCGCGACTTTATCAAAAACCGGCGGTCACGCGAGAAGGTCGCGTACATCATCGGGAATTGATGTGTCGCATCTATGACGGTAAAGAAGAGGTGAGCTCGGCGGAGTATATGCCGATGGTGTTGCAATTTGGTTTATCGGAAGAGTATGACAGATTACAAATTAGCCGCCTGATTACGTTGCTGGGCTACTGGCCGGATGAAAATCTGGCAATGCAGTTGACGGTTGAGTCGCTGATTCGTCCGCGTTTTCAGCGCTGGTTGCGCGATACGTTAATGCAGTGTGAAAAATCACAGCGAAATCGCATAATTATTGAACTTGCGGAGGCAGATGTTTGTCAACACATCAGCCGGTTACAACCCATCCTTCGCTTAGTCAATGCGCTTGGCGTTCGGGTGGCGGTGACACAGGCGGGTTTGACGCTGGTTAGCACGAGCTGGATTAAAGCGCTCAATGTGGAATTACTCAAGCTGCATCCGAGTCTGGTCAGAAACATTGAGAAGCGAACGGAAAACCAGCTTCTGGTTCAAAGTCTGGTAGAGGCCTGCGCGGGCACGCCTACGCAGGTTTACGCGACGGGCGTCCGTTCGCGCGGCGAGTGGCAGACGCTGACAAAACGCGGCGTGGCTGGCGGGCAGGGTGATTTTTTTGCCTCCTCCCAACTACTTGACACAAACGTGAAAAAATATTCGCAAAGATACTCGGTTTAACCTGCCGTTTAATTTGTTTTCACGTAGAATAACGCGCGCTGCGCCTCATGGGGGTGTGCTTGTCTGCTCGCCAGATTGTTGCAGCAAATATGCAGGTGAATGACCTTTGACAGGTTGCAAACGAAGGCGAGGATTGCTGCTGATTTTTTCAGCCCTGAGGGACTCAGGCCGAGATTTGTAACAAAGGAAACGAACTGCACTAATTTTCACCGTAGCAGATGATTTTTGCGCCTTGTCGCTGCTGCGTGTGGTTGGTAAAGTAAGCGGATTTTGTTTTCCGCCCCAGCTTTCAGGATTATCCCTTAGTATGTTGAAAAAATTTCGTGGCATGTTTTCCAATGACCTGTCCATTGACCTGGGTACCGCGAATACCCTCATTTATGTAAAAGGACAAGGCATCGTATTGAATGAGCCTTCCGTTGTGGCCATTCGTCAGGATCGTGCCGGTTCGCCGAAAAGCGTGGCTGCAGTCGGTCATGATGCGAAGCAGATGCTGGGCCGTACGCCGGGCAATATCGCTGCGATCCGTCCGATGAAAGACGGCGTTATCGCTGACTTCTTTGTGACCGAAAAAATGCTCCAGCACTTTATTAAACAAGTGCACAGCAACAGCTTTATGCGCCCAAGCCCGCGCGTACTGGTATGCGTGCCGGTCGGCGCGACGCAGGTAGAACGCCGCGCGATTCGTGAATCCGCTCAGGGCGCAGGCGCCCGTGAAGTCTTTTTGATCGAAGAGCCGATGGCGGCCGCGATCGGCGCAGGTCTGCCAGTTTCTGAAGCAACCGGTTCAATGGTTGTGGATATCGGTGGTGGTACCACTGAAGTTGCCGTGATCTCCCTGAACGGCGTGGTTTACTCTTCTTCTGTCCGTATTGGTGGTGATCGCTTCGACGAAGCCATCATTAATTACGTTCGTCGTAACTACGGTTCTCTGATCGGTGAAGCCACCGCAGAACGTATTAAACACGAAATCGGTTCCGCTTATCCGGGCGACGAAGTCCGCGAGATTGAAGTGCGTGGCCGTAACCTGGCGGAAGGCGTTCCACGCGGCTTTACCCTGAACTCAAACGAGATTCTGGAAGCGTTGCAGGAACCGTTGACCGGTATCGTCAGCGCGGTAATGGTGGCGCTGGAACAGTGTCCGCCGGAGCTGGCGTCAGACATCTCCGAGCGCGGTATGGTGCTCACCGGCGGCGGCGCGCTGCTGCGTAACCTTGACCGTCTGTTAATGGAAGAGACGGGCATTCCTGTCGTGGTTGCTGAAGATCCGCTGACCTGTGTGGCGCGCGGCGGCGGCAAGGCGCTGGAAATGATCGATATGCACGGCGGCGACCTGTTCAGCGAAGAGTAGTTGGATACGGGCAGGATTATCCCTGGATTGTCCTGCCTCTCCGACGCGAGAATACGCATAGCCTATGAAGCCAATTTTTAGCCGTGGCCCGTCGCTACAGATTCGCCTTATTCTGGCGGTGCTGGTGGCGCTCGGCGTTATCATTGCCGACAGCCGCCTGGGGACGTTCAGTCAAATCCGAACGTACATGGACACTGCCGTCAGTCCTTTCTATTTTATTTCAAACGGCCCTCGTGAACTGCTCGACAGCGTGTCGCAAACGCTGGCTTCTCGCGACCAGCTTGAGCTGGAAAACCGGGCGCTGCGCCAGGAATTACTGCTAAAAAACAGCGACCTGCTGATGCTGGGGCAGTACAAACAGGAGAACGCGCGGTTGCGCGAGCTGCTGGGGTCGCCGCTGCGTCAGGACGAGCAGAAAATGGTGACGCAGGTTATTTCAACGGTTAACGATCCTTACAGCGATCAGGTCGTTATCGATAAAGGCAGCGTTAATGGCGTATATGAAGGTCAACCGGTCATTAGCGACAAAGGTGTCGTAGGGCAGGTTGTCGCCGTGGCGAAACTGACCAGCCGCGTGTTGCTGATTTGCGATGCGACTCATGCGCTGCCGATTCAGGTACTGCGCAATGACATCCGCGTGATTGCCGCCGGTAACGGCTGTACTGACGATCTTCAGCTTGAACATCTGCCCGCCAATACCGATATTCGCGTCGGCGATGTGCTGGTAACGTCCGGACTTGGTGGGCGTTTCCCGGAAGGGTATCCGGTGGCGGTGGTCTCCTCCGTCAAGCTGGACACTCAGCGCGCCTATACCGTCATTCAGGCACGTCCAACCGCGGGTTTGCAGCGTTTGCGCTATCTGCTGCTGTTGTGGGGAGCCGATCGTAACGGCGCCAATCCGATGACGCCGGAAGAGGTTCACCGCGTCGCCAATGAGCGCCTGATGCAGATGATGCCGCAGGTGTTGCCTTCTCCGGATGCGATGGGGCCGCCGGCGCCAGTGCCGGACCCGGCGACGGGGATTACTCAGCCATCTGCGGGCCAGACGGCGCCGGTATCAACGCAACCATCGCCTTCGGGCGCGACCACGCCGCCTGCGCGTGCGCCGGGAGGGTAATGGTGGCGAGCTATCGTAGCCAGGGGCGCTGGGTTATCTGGCTCTCTTTTCTTATTGCGCTGTTGCTGCAAATCATGCCCTGGCCGGACGACATCATTGTTTTCCGGCCAAACTGGGTTTTGCTCATCTTACTGTACTGGATTCTGGCCCTGCCGCACCGCGTAAATGTGGGGACGGGATTTGTGATGGGTGCCATACTTGATCTCATCAGCGGTTCCACACTTGGCGTGCGCGCTCTGTCGATGAGTATTGTCGCCTATCTGGTCGCTCTGAAATTCCAGCTCTTCCGTAACCTGGCGCTCTGGCAGCAGGCGCTGGTGGTAATGTTGCTTTCACTGGCCGTGGACATTATTGTTTTCTGGGCCGAGTTTTTAGTGATCAACGTCTCTTTCAGACCGGAAGTGTTCTGGAGTAGTGTAGTCAATGGCGTGCTGTGGCCCTGGCTCTTTTTGCTGATGCGTAGGGTACGCCAGCAGTTTGCTGTGCAATAAAGGTCGATATGACAACTCTGTATCTTGCTTCCGGTTCCCCGCGTCGCCAGGAATTATTGACTCAACTTGGATTTTCATTCGAGCAGGTCGTGCCTGGTATTGAAGAGCAACGGCGTGCTCAGGAAAGCGCGCAGCAGTATGTTGTTCGGCTGGCGCGCGAAAAGGCGCAGGCTGGCGTGGCGCTGGTTCCCCGTGATCTGCCGGTGTTGGGCGCGGATACGATAGTCGTGTTAAACGGCGAGGTGCTGGAAAAGCCGCGCGACGCCGCTCATGCCGCAGAAATGCTGCGTTTGCTTTCCGGAAATACGCATCAGGTGATGACTGCTGTTGCGCTTGCGGACTCGCAGCAGACGCTGGATTGTCTGGTGGTAACGGAGGTGACTTTCCGCACGCTTTCTGCGCAGGATATTACTGGCTATGTCGCCAGCGGCGAGCCTTTAGATAAAGCAGGTGCATACGGTATTCAGGGGCGGGGTGGCTGTTTTGTCAGGAAGATAAATGGCAGCTATCACGCCGTGGTCGGCTTACCGCTGGTTGAAACGTATGAGTTGTTGAGTCATTTTAACGCACTGCGTGATAAAAGGGATAAACATGACGGCTGAATTGTTGGTAAACGTAACGCCATCGGAAACGCGCGTGGCGTACATTGATGGCGGTATTCTTCAGGAAATCCATATTGAGCGCGAAGCGCGACGCGGAATCGTAGGCAATATCTACAAAGGTCGTGTAAGTCGTGTACTTCCAGGTATGCAGGCGGCTTTTGTAGATATTGGGCTGGATAAAGCCGCATTTCTTCATGCTTCCGACATTATGCCGCATACCGAATGCGTGGCAGGCGACGAGCAAAAACAGTTTACCGTGCGCGATATCTCCGAACTGGTGCGTCAGGGACAGGATCTGATGGTGCAGGTAGTGAAAGATCCGCTCGGCACGAAAGGCGCGCGTTTGACCACGGACATCACGTTGCCCTCCCGTTACCTTGTTTTTATGCCCGGCGCGTCACACGTTGGCGTTTCCCAGCGCATCGAAAGCGAAAGTGAACGCGAGCGCCTGAAAAAGGTGGTGGCGGAATACTGCGACGAACAGGGCGGGTTTATTATCCGTACCGCGGCGGAAGGCGTGTGTGAAGAGGATCTCGCGTCTGACGCCGCCTATCTCAAGCGCGTCTGGACGAAAGTCATGGAGCGCAAAAAGCGTCCGCAAACGCGCTACCAGATGTATGGCGAACTGGCGCTGGCGCAGCGCGTGCTGCGCGATTTCGCCGATGCGCAGTTGGACCGAATTCGCGTCGACTCACGTCTGACCTATGAGTCGCTGCTGGAGTTTACCGCAGAATACATCCCGGAAATGACCAGTAAGCTGGAGCATTACAGTGGCCATCAGCCGATCTTCGATCTTTATGACGTGGAAAATGAGATCCAGCGCGCGCTGGAGCGTAAGGTTGAACTCAAGTCCGGCGGTTATCTGATTATCGACCAAACCGAAGCGATGACGACGGTAGATATCAATACCGGCGCGTTTGTCGGACACCGTAATCTCGACGACACCATTTTTAATACCAATATTGAAGCGACGCAGGCCATTGCCCGCCAGCTACGCCTGCGTAATCTGGGCGGCATTATCATTATCGATTTTATCGACATGAATAATGAAGATCATCGCCGTCGGGTGCTGCATTCGCTGGAACAGGCGTTAAGTAAGGATCGCGTGAAAACCAGTATTAACGGTTTTTCGCCGCTGGGACTGGTGGAAATGACCCGTAAACGGACCCGTGAAAGCGTGGAACATGTGCTTTGTAACGAGTGTCCAACCTGTCATGGCCGCGGGACGGTAAAAACGGTGGAAACCGTCTGCTATGAGATCATGCGTGAAATTGTACGTGTTCATCACGCTTATGACTCCGATCGCTTCCTGGTTTATGCTTCTCCTGCGGTGGCGGAAGCGCTGAAAGGCGAAGAATCACACGCGCTGGCGGAAGTCGAAATCTTTGTCGGCAAACAGGTTAAAGTTCAGGTCGAACCGCTTTATAACCAGGAGCAGTTTGACGTCGTAATGATGTAATTGAGTAGTAGCACCAGCGCATGTAGGCCGGATAAGGCGCAAGCGCCGCCATCCGGCAATATACCTTCAAGGGGTGTGAGTCGTATTTTTTGGCAGACAAGGAGAGACGCGTGAGGCGATTGCCGGGGATTTTATTGCTCACTGGAGCCGCGCTCATCGTCATTGCAGCGCTGCTGGTTAGCGGGCTGCGCCTGGCGTTGCCTCATCTTGACGCCTGGCGTCCAGCTATCCTGAATAAGATTGAGTCTGTGACCGGCGTGCCGTTGGCCGCCAGCCAACTCTCCGCAAGCTGGCAAAATTTTGGCCCTACGCTGGAAGCGCATAATATTCATGCCGCCCTGAAAGATGGCGGCGAACTGTCGATAAAACGCGTTACGCTGGCGCTGGACGTCTGGCAAAGCCTGTTGCATATGCGCTGGCAGTTTCGCGACCTGACCTTCTGGCAACTCAATTTCCGCACCAACACGCCTCTTCAAAGCAGCGACGGCGAAGGCATCGAAACCAGCCGTTTAAGCGATCTTTTCCTGCGTCAGTTCGATCATTTCGATCTGCGCGATAGCCAAATCAGTTTTCTGACGCTTTCCGGACAACGCGCGGAGCTGGCGATCCCACAGCTTACCTGGCTAAACGGCAAAGAGCGTCACCGCGCCGAAGGCGAGGTCAGCCTCTCCAGCCTGACTGGGCAGCACGGCGTGATGCAGGTGCGGATGGATTTACGCGATGACGACGGTTTGCTAAATAACGGCAGAGTCTGGTTACAGGCCGACGATATTGACGTCAAGCCGTGGCTGGGCAAATGGATGCAGGATAATGTGGCGCTCCAGACGGCGCGTTTCAGTCTGGAAGGCTGGATGACGCTCAGTAAAGGCGAAATCGCCGGAGGGGACGTCTGGCTGAAACAAGGCGGCGCAAGCTGGCTGGGCGATAACACGACGCACACGCTGTCTGTCGATAACCTGACGGCGCAGATTAGCCGCGAGCAGCCGGGCTGGCAGTTTTATATTCCGGACACACGGATTACGCTTGATGGTAAACCCTGGCCGAGCGGAGCCTTAACCGTGGCCTGGCTCCCGCAGCAGGACGTCGGCGGCGAGAATCACACACGTAGCGATGAACTACGCATTCGCGCCAGTAATCTTGAACTGGCGGGGCTGGAAGCATTACGTCCGCTGGCGGCTAAACTGTCGCCTGTGCTGGGCGAAATATGGCAGGCTACGCAGCCGAGCGGAAAGATCGCCACACTGGCGCTGGATATTCCGCTACAGGCGACCGAAAAAACGCGTTTTCAGGCATCGTGGGAAAATCTTGCCTGGAAACAGTGGAAGCTGTTGCCCGGCGCGGAACATTTTTCCGGTACGCTGGCGGGAAGCGTAGAAGACGGGCAGATGAAGGTTGCGATGCAGCAGGCTAAAATGCCCTATGAAACCGTTTTTCGCGCGCCGCTGGAAATTGAAAACGGCGTCGCGACGCTTAGCTGGTTGAAAAATGAGAACGGCTTTCAGCTTGATGGCCGCGACATTGACGTCAAAGCAAAGGCGGTACATGCGCGCGGCGGGTTCCGCTATCTGCAACCGACGGGCGACGAACCGTGGCTGGGCATTCTGGCCGGAATCAGTACCGACGATGGATCTCAGGCGTGGCGCTATTTTCCGGAAAACCTGATGGGCAAAGCGCTGGTCGATTACCTTAGCGGCGCGATTCAGGGCGGCGAGGCGGATAACGCCACGTTGGTTTACGGTGGTAACCCGCACCTCTTTCCCTATAAGCATAACGAAGGTCAGTTTGAAGTGCTGGTGCCGCTACGCAATGCGACGTTTGCTTTTCAACCCGACTGGCCGGCGCTAAAAAATCTCAACATTGAACTGGATTTCCTGAACGACGGCCTGTGGATGCGTTCGGATAGCGTCGATTTGGGCGGGGTGAAGGCCAGCAAACTCGCGGCGGCGATTCCGGATTATTCCAAAGAGAAACTGCTCATTGATGCCGATATTAACGGGCCGGGAAAAGCCGTTGGGCCTTATTTTGACGAGACGCCGCTAAAAGACTCGCTTGGCTCGACGCTGGCGGAGCTCCAATTAGATGGCGATGTGAATGCTCGCTTACATCTTGATATTCCGCTGGACGGAGAACAGGTTACCGCCGAGGGCGATGTCTCGTTGCGTAATAACAGTCTGTTTATTAAGCCGCTAAACAGCACGCTCAAAAATCTGAACGGTAAATTCAGCTTTGTGAATGGCGCGCTAAAAAGCGGGCCGCTGACGGCAAACTGGTTTAATCAGCCGCTGAACCTGGATTTCAGTACGACGGAAGGGGCAAAAGCCTATCAGGTCGCCGTCAACCTGAACGGTAACTGGCAGCCAACGCGTATGGGCGTCTTACCGCCGCAACTGAATGACGCCCTGAGCGGCAGCGTGACGTGGAATGGTAAAGTCGGCATCGATCTTCCGTATCACGCTGACACCACTTATCACATCGAACTGAACGGCGATCTGAGAAATGTGAGCAGTCACTTACCTTCTCCGCTAAATAAACCCGCAGGCGAAGCCATTCCGGTGAACATTCAGGCTGACGGCAACCTGAAAAGTTTTGCGCTAACGGGGAGCGCAGGGAGTAAAAATCACTTTAACAGCCGCTGGTTATTAAATCAGAAGCTGACGTTGGATCGGGCTATCTGGACGACGGACAGCCGGACGATTCCGCCATTACCTGCTCAACAAGGCGTTGAGCTCAATCTGCCGGCGCTGGATGGCGCGCAGTGGCTGGCGTTATTCCAGAAAGGCGCGGCGGATAACGTGAGCAGTTCGGCCGAGTTTCCTCAACGCGTCACGTTGCGTACTCCCGCGCTATCGCTGGGTGGCCAGCAGTGGAACAATTTGAGCGTCGTTTCAGCCCCCTCGCTGAACGGAACAAAAATTGAAGCGCAGGGCCGTGAGGTGAACGCCACGCTGCTCATGCGCAACCATGCGCCGTGGCTGGCGAACATTAAGTACCTGTATTACAACCCTGGTGTCGCAAAAACGCACGCCTCGTCACCAACGCCGACATCGCCGTTGGCTTCGGCGAACACGATTAGCTTCCGCGGCTGGCCGGACTTACAGCTTCGCTGCGAAGAGTGCTGGCTGTGGGGGCAAAAATATGGGCGTATTGATGGCGATTTCGCCATCAAAGGCAATACGCTGACTCTGGCGAATGGCCTGATCGATACCGGATTCGCCCGTTTGAAAGCGAACGGCGAGTGGGTGAATGCGCCGGGTAATGAACGAACCTCGCTGAAAGGTAGTTTGCATGGTAGTAACCTCGACACGGCTGCCGGGTTCTTCGGCATCTCGACGCCAATCCAGAACGCGTCTTTTAACGTAGATTACGATTTGCACTGGCGAAACCCGCCCTGGCAACCTGATGAAGCCACGCTCAACGGGATTTTACGTACGCGTCTGGGCAAAGGCGAGTTTACTGATCTCAGTAGCGGTCATGCCGGACAGCTACTGCGGCTGCTCAGTTTTGACGCGTTGCTGCGTAAGCTGCGGTTTGACTTCAGAGATACCTTTAGTGAGGGCTTCTATTTCGACTCTATTCATAGTACCGCGTGGATTAAAGATGGCGTCCTGCATACTGACGATACGCTGGTGGATGGGCTGGAAGCGGATATTGCAATGAAAGGTTCTGTTGATCTGGTGCGTCGTCGTCTTGATATGGAGGCCGTTGTCGCGCCGGAAATTTCCGCCACCGTGGGCGTCGCCGCCGCGTTTGCCGTTAACCCGATTGTCGGCGCGGCGGTATTTGCCGCCAGTAAAGTGTTGGGGCCGCTATGGAGCAAGGTCTCCATTCTGCGTTATCGCATTACCGTCCGGTCGATGCGCCGCAGATCAACGAAGTTCTGCGCCAACCGAGAAAAGAAAGCCAGCAATGATTTGACGGGGGCGAGGAATTGCCCCACTCTCAGTAAATAGTCATGCCGGACGGCGCTGCGTTTATCCGGTTTGCACAATCTAACAACGTAGTCCGGATAAGGCGTGAACCGCCATCCGGCAGAAATCAATGAGTAGCAAAAACGATGAGTCTGAACCTGGTAAGTGAACAATTGCTAGCGGCGAATGGCCTGAACCATCAGGATCTGTTCGCTATTTTGGGCCAACTGGCCGAACGCCGTCTTGATTATGGCGACCTCTATTTCAGTCGAGCTATCACGAATCCTGGGTTTTAGAAGACCGCATCATTAAAGATGGTTCATATAATATCGACCAGGGCGTTGGCGTTCGCGCCATTAGCGGCGAAAAAACCGGTTTTGCTTATGCTGACCAGATAAGCCTCCTGGCGCTGGAGCAGAGTGCGCAGGCAGCGCGAACCATTGTACGCGAGAACGGCGAAGGCAAGGTAAAAACGCTCGCCGCCGTAGCGCATCAGCCGCTCTACACCACCCTTGATCCGCTGCAAAGTATGAGCCGCGAAGAGAAGCTGGATATCCTCAGACGCGTTGATAAAGCGGCGCGAGAAGCCGATAAACGCGTGCAGGAAGTTAACGCCAGCCTGACCGGCGTATATGAATTAATCCTCGTGGCGGCGACCGACGGGACGCTGGCGGCGGATGTCCGTCCACTGGTGCGGTTGTCCGTTAGCGTGCAGGTGGAAGAAGACGGTAAACGCGAGCGCGGCGCCAGCGGCGGCGGCGGTCGCTTTGGTTATGAGTATTTTCTTGCCGATCTCGACGGCGAGGTGCGCGCCGATGAGTGGGCGAAAGAAGCGGTACGCATGGCGCTGGTTAATCTCTCCGCGGTCGCTGCGCCAGCGGGGACGTTACCGGTGGTTCTGGGTGCCGGGTGGCCGGGCGTATTGCTGCACGAAGCGGTCGGGCACGGGCTGGAAGGTGATTTCAACCGTCGTGGGACGTCTGTGTTTAGCGGTCAGATCGGCGAGCAGGTTGCCTCCGCGCTTTGCACCGTAGTGGATGACGGCACAATGATGAACCGTCGTGGCTCCGTTGCTATCGATGATGAAGGTACGCCAGGCCAGTACAACGTATTGATTGAAAATGGCGTGCTGAAAGGATACATGCAGGACAAGCTGAACGCGCGCCTGATGGGCGCTACGCCGACCGGTAACGGGCGTCGCGAATCTTATGCGCATCTGCCGATGCCGCGTATGACGAATACCTATATGTTGGCGGGGCAGTCAACGCCGCAGGAAATTATCGAATCCGTTGAGTACGGCATCTATGCGCCTAACTTTGGCGGCGGTCAGGTGGATATCACCTCCGGCAAGTTTGTGTTCTCTACCTCGGAAGCGTATCTGATTGAAAACGGAAAAGTCACGACGCCGGTGAAGGGCGCGACGTTGATTGGATCAGGCATTGAAACGATGCAACAGATCTCCATGGTCGGCAACGACCTTAAGCTGGATAACGGGGTGGGGGTCTGCGGTAAAGAGGGGCAAAGTCTGCCGGTAGGCGTAGGCCAGCCGACGCTGAAAGTCGATAACCTGACGGTTGGCGGCACCGCATAACTGTCCGCCTTATCAGGCCTGTGGTGTGTGCCGCCCAGGCCTGATAAGCGCAGCGCCACCAGGCAATGTCTGCGTCGGGTGGCATCTGACCTACAATCGTCTTCAGCAATTACCCGGTGTTTTTATTTTTCTTTCCCTCGCCCGTGCATTCCCTGAAACAGGTGTGCGACATCGACAAAATAGTCAGTTAACGCGTTAATCACTACCTGCACTTTAAGCGGCAGCTTATCTTTTTCGGTGTACAGCGCGTACACCGGACGAGGATCGGACTGATAGCGAGGCAGCAGGATCTCCAGATCGCCACGATTGATCTCATCGATCACCCACATCAGCGGCACATAGGCGATTCCGGTTCCGGCGGTCAGCCAGCGTACCAGCGTCATCGGATCGTTAGTGACGAACCGCCCCTGTGGAATCAGTCGGGTAGAAATTCCTTCCGGCGCGATAAGTTCAAACTCATTATCCGGGCGCACGCTATATTCCAGCCAGGAATGGCTACTGAGATCGGCGGGTTTTTCCGGGACGCCGTACTGAGCGAGGTACGGCTTGGCGGCGCACACAACCATCGGCATCGCGCCGAGTCGACGGGAAAACAGGCTGGAGTCTTGCAGCGCGCCCACGCGGATCACCACGTCCAGCCCGTCGGCAATTAAATCCGGGGCCGGAATTCCTGTCACCAGATTGACCGCCAGACCGGGATACTCTTTTAATAATTTTGCGGTAAGCCCGGCCAGCACATTTTGCGCCATGGTTGAAGAACAGCCGATACGCAATGTACCGATAGGCGTGTTATTAAAAGCGTAAAGCTGCTCGTGAACATCCTGCACTTCATGCAGCATACGACGACAGCCCTGATAATAGATTTTTCCAGCCTCGGTGAGCCCAATGCTGCGTGTGCTGCGGTTGAGCAATTTAACCTGAAGCTCATCTTCCAGTTTCGCCACAGTCTGGCTGATGGACGATACGCTCATTTGCAGCTGTCTGGCTGCGGCGGTAAAAGAGCCAAACTCGACAACTTTGGCGAATACCGACATTCGTTTTAATCGTTCCATTATTCACTCTGGCTTAAAAGTGATTTAGATCACATAATATAGATAACAGCATAACAATTACGTTAATATACTATTATTCGAATAAGAGCTATGTCGCTCTCGCCTGGCGTCTCTTGCCCTTCTCTGCGGCGATGTAGCTTAACAATCTATCGCCTGCTCTCTCTCAGAAATCAAGGTCAACATGAGTCTGTTTCCCGTTATCGTGGTGTTTGGTCTTTCCTTCCCACCGATATTTTTCGAATTGCTTTTGTCACTGGCTATTTTTTGGCTGGTGCGTCGGATGCTTGTGCCGACGGGTATCTATGATTTTGTCTGGCATCCGGCTTTGTTCAATACCGCGCTGTATTGTTGTCTGTTTTATTTAATATCGCGCCTGTTCGTTTGAGGTTGAAGTGAAAACACTAACAAGAAAACTCTCCCGTACGGCCATTACCCTCGTACTGGTTATTCTGGCCTTTATCGCCATTTTCCGCGCCTGGGTGTATTACACCGAATCTCCCTGGACGCGCGATGCGCGTTTTAGCGCCGATGTCGTCGCTATTGCCCCGGATGTCGCGGGCCTGATTACCCATGTAAACGTGCATGATAACCAACTGGTGAAAAAGGATCAGGTCCTGTTTACCATCGATCAGCCACGCTATCAAAAAGCGCTGGCGGAAGCGGAAGCTGATGTAGCCTATTATCAGGTCCTGGCGCAGGAAAAACGCCAGGAAGCCGGTCGCCGTAATCGCCTCGGCGTACAGGCGATGTCCCGGGAAGAGATCGATCAGGCGAACAACGTCTTGCAGACGGTACTGCATCAACTGGCTAAAGCCCAGGCGACGCGCGATCTGGCTAAACTCGATCTGGAACGCACCGTCATTCGCGCGCCTGCCGACGGCTGGGTAACTAACCTCAATGTTTATGCCGGTGAATTTATTACCCGCGGTTCCACAGCCGTCGCCCTGGTGAAAAAGAACTCTTTTTACGTGCAAGCCTATATGGAAGAGACCAAGCTGGAAGGCGTGCGTCCGGGATATCGTGCCGAAATCACGCCGCTTGGCAGCAACCGGGTATTAAAAGGTACCGTCGATAGCGTGGCGGCAGGGGTAACGAATGCCAGCAGTACCAGCGATGCGAAAGGGATGGCGACGATAGACTCTAACCTCGAATGGGTGCGTCTGGCCCAGCGTGTCCCGGTGCGTATCCGGCTCGACGAGCAGCAGGGGAATTTGTGGCCTGCCGGTACGACGGCAACCGTCGTGATCACCGGTAAGCAGGATCGTGACGCAAGCCAGGACTCCTTCTTCCGTAAACTGGCGCACCGGCTGCGCGAATTCGGTTAATCGTCATGGGCATTTTCTCCATTGCCAACCAGCACATTCGTTTTGCGGTAAAGCTGGCGTGCGCCATTGTGCTGGCATTATTTATCGGCTTCCATTTTCAACTGGAGACGCCGCGTTGGGCGGTGCTGACCGCCGCGATCGTGGCGGCAGGCCCGGCATTTGCCGCCGGCGGGGAACCCTATTCCGGCGCTATCCGTTATCGCGGAATGTTGCGTATTATCGGGACGTTTATTGGCTGTATCGCCGCGCTTATCATTATTATCTCTATGATCCGCGCGCCGCTATTGATGATTCTGGTGTGCTGTGTCTGGGCCGGTTTTTGTACCTGGATCTCCTCTTTAGTACGGATCGAAAACTCTTATGCGTGGGGACTGTCAGGTTATACTGCGCTGATCATTGTGATTACCATTCAGACGGAGCCGCTGCTTACGCCGCAATTTGCGCTTGAGCGCTGTAGCGAAATCGTCATCGGCATTGGCTGCGCTATTTTAGCGGATCTGCTTTTCTCGCCGCGATCGATAAAGCAGGAGGTTGATCGCGAACTGGACAGTTTACTGGTAGCGCAATATCAGCTAATGCAACTTTGTATTAAGCATGGCGACAGCGAAGAAGTCGACAACGCCTGGGGCGATCTGGTGCGACGCACGGCGGCGCTGGAGGGAATGCGAAGTAATCTGAATATGGAATCTTCGCGCTGGGTACGCGCTAACCGCCGCCTGAAAGCGTTGAATACGCTTTCGCTCACGCTGATTACGCAATCCTGTGAAACTTATCTTATTCAAAACACGCGGCCGGAACTGATTACCGATACATTCCGGGAACTGTTCGAGACGCCGGTTGAAACGGTACAGGACGTTCACAGGCAGTTAAAACGGATGCGGCGTGTCATTGTCTGGACCGGCGAGCGCGAAACGCCCGTCACGCTTTATAGCTGGGTAGGCGCAGCGACGCGCTATCTGCTGCTTAAACGCGGCGTGATCAGCAACACCAAAATCAGCGCGACGGAAGAAGAGATCCTGCAAGGCGAGCCCGTAGTTAAAGTGGAATCCGCTGAACGTCACCATGCGATGGTGAACTTCTGGCGCACCACGCTTTCCTGCATACTGGGGACATTATTCTGGCTATGGACCGGCTGGACATCGGGTAATGGCGCGATGGTGATGATTGCCGTGGTGACCTCGCTGGCGATGCGTTTACCCAATCCGCGCATGGTCTGCATTGATTTTATTTATGGCACTCTGGCGGCGTTGCCGCTGGGATTGCTCTATTTCCTGGTGATTATTCCCAATACGCAGCAAAGTATGCTGTTGTTATGTTTGAGCCTGGCGGTATTGGGGTTCTTTATCGGCATTGAGGTCCAGAAACGGCGTCTGGGGTCGATGGGGGCGCTGGCCAGTACGATTAACATTATTGTGTTGGATAATCCGATGACGTTCCATTTCAGCCAGTTCCTCGACAGCGCGTTGGGGCAAATTGTCGGCTGTATGCTGGCGTTTATCGTCATCCTGCTGGTGCGTGATAAATCCAAAGATCGAACCGGTCGCGTGCTGCTTAATCAGTTTGTTTCCGCTGCGGTCTCGGCAATGACGACCAACGTGGTACGTCGTAAAGAAAACCGTCTGCCTGCGTTATATCAGCAACTCTTTTTGCTGATGAATAAATTCCCTGGCGATTTGCCGAAGTTTCGCCTGGCGCTAACGATGATCATTGCTCATCAACGTCTGCGCGACGCGCCGATACCGGTTAATGAGGATCTGTCAGTGTTCCACCGCCAGCTTCGTCGTACTGCAGACCATGTTATTTCCGCCGGAAGCGATGATAAACGGCGACGCTACTTTGGTCAGTTGCTGGATGAGCTGGATATCTATCAGGAGAAATTACGTATCTGGGAGGCGCCGCCGCAGGTGACGGAACCGGTTAAACGCCTGACGGGCATGTTACATAAATACCAGAACGCGCTTACCGATAGCTAACCCACAAGAACCGATGCCAAAAGCGTCGGTTTTTTTATGACTATACTTATTTGAGATACAAAAACAGCGCAAGAGTGCTCCGGGGCTAAAGCGTTTCTGGAAGTGAAACCTCTGCATAATGATGCTTAGTAGTAAAACAGACAATTGTCTGTCGGGGATGGTGGCTGGCTATGAATGTCTATACTTTTGATTTTAATGATATTAAAAACCAAAGCGATTTTTATCGCGAGTTTACGCAGACGTTTGGTCTTGCCAGTGAGAAAGTAAGCGATCTTGATACATTATGGGATGCGGTAATGAGCGATATCTTACCCCTACCGCTAGAGATTGAGTTCGTTCACCTGCCTGATAAACTGCGCAGGCGCTACGGGGCGCTGATTTTACTCTTTGATGAAGCGGAAGAAGAGCTGGAGGGGCGGCTACGGTTTAATGTCCGGCATTGAAAAAGCATAAAAAAGCCCCCGGCAGGCGGGGGCAAGTCGTCGGGTTAGACGACGAGGGTTTACTTATACAACTCGGCGGTGGCGTGCCAGTTATCACCTTCTTTTAATTCGATGATGCGATAGCTATTAGCGCCTGCTTTTTCAGCTTTAGCGACGATCTCCTGGCGCATATCCATTGGCGTACTGCCAATCTGCGAGACGGAAATGGTGCCCATCGGTTGCAGATTCTGCGCCTGTTCCGCGTTAACCTGATGTACTGCGGCATTTGCGCCGAAAGAGAGCAGAGTGGCCAGGCCGAGTGATGCGATAATGTATTTCGTTTTCATGGTCTTTTATTCCTTTATAGGGTCAACCTCGCGGATTAATGGGCTTTTCTTATGCGATATCTCCGCGAAGCAATGACACGTTATATGCGATGGCAGGTGAATTTTATTTGTACAGTTCGGCAGTGGCGTGCCAGTTGCTGCCGCTACGCGCTTCGGTAATGTGGTAGGCCGTAGCGCCTTGTTCATCCGCTTTTTTGCTCAGCATGGCGTTCATATCCATAGGCGATGAGCCGATAGCGCTGACTGAAACCGATCCGATTGCTTCACGGTTTTGCGCTTGCTCTGCGCTGATAGGTTCTGCGGCGAAAGCGCCAAAAGAGAGAACGGACAGAATACTTAATGTTGCAACAGTGGTTTTGATTTTCATGATTTTTACCTCGTCGAATTCCTTTACTGGGGTCTTTGTTTCGTGACCCTCATCACAAAATTAAGTATACACTAATCACGCGATTAATTAATACCACGCTAATTGTTTCTGCTTTTTAAAGACATTAAAAATTTTATTTTTTTATAACGTTATGGAATTAAAATTGCTGTTTTTCGCCGCTGTCATTTAAAAAATTAAATAAAATCAATGTCTTAATAAATTTGACATTTTGTGCGTTTTTTTGATTTGTCATTCATGTGGTGAATGTTGAAGGGCATGATTTGCCAAATGATAGCCGTACAGGAAAGCGGCGAGGGGAGAGGACAAAAAGGTGCCTCCGCCGTACGACGACGGAGGAAGTGGGACTAAAGCTCTTGTTCAAAAAGTTCCAGAATGGCTTCGTACAGATCTCTTACGGAAAAGCCGCTGGCAGGCGTAGTGAAGATAGTATCGTCGCCGGCAATCGTGCCGAGAATACCTTCTGCTTTCCCCAGTGAATCCAGCAGGCGTGCAATGAGTTGTGCCGCGCCGGGGCTGGTATGAATCACCACGACAGCGTCGTTATAGTCGATATCCAGCACCAGATTTTTCAACGGACTGGATGTTGTTGGGACGCCCAGTTCAGCCGGAAGACAGTACACCATTTCCATTTTTGCGTTACGTGTACGCACTGCGCCGAACTTAGTCAACATGCGCGAGACCTTGGACTGATTGATATTTTCAAAGCCCTGATCCTGCAACGCGAGGACGATTTCGCCCTGTGAGCTGAATTTTTCTTCTTTGAGGAGCGCTTTAAACGCTCTTACTAGTTCTTCTTGTTTAGCCGAGCTTCGCATAAGTCACCCGTAATATGGCAGTAGAAACAACATTATTGTGCATACAGATGAATTTTTATGCAAACAGTCTGCCTTGTTTAAGGCTGAAATAATGTTATGAAAGAGCGGAATTTTATCAAATTTCGTTATCAAGAAACATGCCTGTGTCACGCCTCGCAAATAAGATTAAAAGTAAATTAATTGTTATCAAAATGATGTTGTTTTGAGAGGATAGTAGGGTATATTGTCACCACCTGTAGGAACGTTGCTCTCTGGTTGCAGTCGATCCGGATTACGCAAATTAAATGCATAAAAGCCAAAATTGCGCGACTCCGCATTCTTGATGAGTGAGGATTGTAATCATTGAATTTGTGAATTAAGGTCGCCGCCGCGGAGCAATAGACACTTAGCTAATCATATAATAAGGAGTTTAGGATGAAAGTCGCAGTCCTCGGCGCTGCTGGTGGTATCGGTCAGGCGCTGGCATTACTTTTAAAAAACCAACTGCCTTCAGGTTCAGAACTCTCCCTGTACGACATCGCTCCAGTGACTCCCGGTGTGGCCGTTGATTTGAGCCACATCCCCACCGCTGTAAAAATCAAAGGTTTCTCCGGTGAAGACGCAACCCCGGCGCTTGAAGGCGCTGACGTAGTACTGATTTCTGCGGGTGTGGCGCGTAAGCCGGGTATGGACCGTTCCGACCTGTTTAACGTTAACGCTGGCATCGTGAAAAACCTGGTGCAGCAGATCGCTAAAACCTGCCCGAAAGCGTGCGTGGGCATTATCACCAACCCGGTGAACACCACCGTTGCGATTGCGGCGGAAGTGCTGAAAAAAGCAGGCGTATACGACAAAAACAAACTGTTTGGCGTCACCACGCTGGATATCATCCGCTCTAATACCTTTGTTGCAGAGCTGAAAGGTAAGCTGCCAACGGAAGTTGAAGTTCCGGTGATTGGCGGACACTCCGGCGTGACGATTCTGCCGCTGCTGTCGCAGATTCCAGGCGTAAGTTTTACCGAACAAGAAGCGGCCGAGCTGACTAAACGTATTCAGAACGCCGGTACTGAAGTCGTCGAGGCGAAAGCCGGCGGCGGATCGGCAACCCTCTCTATGGGTCAGGCTGCCGCACGTTTCGGTCTTTCTCTGGTTCGCGCTCTGCAGGGCGAAAAAGGCGTGGTGGAATGCGCCTATGTGGAAGGCGACGGTCAGTATGCCCGTTTCTTCTCTCAGCCGCTGCTGCTGGGTAAAAACGGTGTAGAAGAGCGTAAATCCATCGGCACACTGAGCGCTTCGAGCAACATTCGCTGGACGCTATGCTGGATACGCTGAAAAAAGATATTCAGTTGGGTGAAGATTTTATTAATAAATAAGCTGTTCCTGCCCGATGCCGGGGCTTTTGCTCCGGCTTCTTTGACTGTAAACAAGCCACGAAATCAAGTACTAAATCTGCTAAACTCCTCACGTCTTATCGGTTATTAACGAGAACGTCGTGAAAAAGATCCAAAGAACGCAAACCCGCGATCACATTACGCAAATGCTTCGCTATGAAATTCTTTCCGGCAATATAAAAGCGGGAGAAGAACTCGCGCAGGAAAGTATTGCGGAGCAGCTCGGTCTTTCACGAATGCCGGTTCGAGAAGCGCTACAGTCACTCGAACAGGAAGGTTTTTTAATCCGACTGCCCAACAGGCACATGCAGGTCGCGCATCTTGAGGCCGATCGCGTGAGTCATATTTTTCGCGTGATTGCCGCGATGGCCGCCGAAATGTTTTCGCTGATACCCAGCGAAGTAGGCGATGCCCTCCTGATACGTGCGCAAGCGCTGGCTGTCGCAGAAGACAAATCCTGCGAACTGGAGTGTCATGCGATGCTGATTTCTTATGTGAATAACCGCTATCTGGAAAAGGTGTATCAGCAGTTCCTGGACGGTTATGTCTCTTATGTTATTTTGCATCTGAAAAAAGATAATCAGGAGTCGGCGCAGCTTTTCGCTGAACTGGCTGACGTCATACGCCAGGGGCGGCGTGATGAGATTGGGCAAGTTATGCAGCGTTATTTCCTCTCTTTAGCAGAAATAATGCGTCAACATATGAAGGATTGGGAAAGTGCAGAAGCCTAAGTTAGGGAAGATTAAGCTTCTTTCAGCGAAAGAACAGGTCGCGGCAGTGCTGCGTAAAGCGATTCTTTCGAGGGAGTTAGTGGAAGGGCAGGAGATTACGTTAGAAGGCATTGCCAGAATGGTCGGCGTGTCGAGTATGCCAGTGCGCGAGGCGTTTCAGATTCTGGCGGCGGATGGGCTGATAAAAGTGCGCCCTAATAAAGGGGCGGTGGTGTTGGGCATCAATGAACAAACGATCCGTGAACACTATGAAATTCGCGCGCTGTTAGAAAGTGAGGCGGTGGCTAAAGCTTCTCGTCCCGGCACCGATATCTCGCGGATAGCCCAGGTGCATTATGCCGCTGAAAAGGCTCTCGCGGAAAATAACTCGGCGGAATACTCCGACCTGAATCAGGCATTTCATATGGAAATATGGAATGTTGCTGGCAACGAAAAAATGAAAATGTTGCTCTGTAATATGTGGAACGGCCTGTCGATGGGCCATAAGGTCACCGAAGAAGAGTATGCCGTGATTTCCATTCAGGAACATAAAAGTATTCTGCAAGCTCTTGAACTTCATGATGAAACATTGGCACGCCAGCGCATGCGCGAACACATTATTCGCTCGATGGAGAATATGCTGACCCGCTATGTTGGTGACCCTTCTGCCTGATCTTTTCGCCTGCGTGAGCATCGCAGGCGCTTCAACCAATTCTTCAATTCTATTAATCACGTCCAGTAACGATCTAACCACGATCTTGATGTAGATCTCACTCCTGCATCTTTCCTGAGAAACGATTTGACTTAGTTTTTGTATGGGAATAATTTCTCCCTGTCAACATTGGATACAATATCATATATCATTGATTGGATAATTATAAATATGGAACCCATTACCCTTACCTTATGTTTATTGGTGTTCGCTATCGTCATGTTTGTATGGGAAAAAGTGCCGCTAGCCGTCACTTCCATGATTGTCTGCGTGGCCTTAGTGATCACCGGCGTTCTGAATATCAAACAGGCCTTTGCTGGCTTTATCGACACTAACGTCATCCTCTTTGTCGCGATGTTTATCGTGGGCGGCGCGTTGTTCGAGACCGGTATGGCAAACAAAGTGGGCGGCGTCATCACGCGCTTCGCGAAAACAGAAAAACAGCTGATCTTCACCATTATGGTGGTGGTGGGTTTGATGTCTGGCGTCCTTTCTAACACCGGTACCGCTGCGGTTTTGATTCCGGTGGTGATCGGCGTGGCGGCAAAATCGGGCTTCTCGCGTTCTCGTCTACTGATGCCGCTGGTTTTCGCCGCTGCGTTGGGCGGTAACCTGTCGTTAATTGGCGCGCCGGGAAATCTTATCGCGCAGTCTGCGCTACAGAATATTGGCGGCGGCTTTGGCTTTTTTGAATACGCAAAAATCGGTCTCCCGATGCTCATCTGCGGCATTCTCTACTTCCTGACTATTGGCTATCGCTTTCTGCCGAATAATGCGACCGGCGGTGAGGTGGGGAGCGTCGGGGAGCAACGTGACTACAGCCATGTTCCACAGTGGAAACAACGGCTCTCACTCGTGGTGTTGATCGCAACGATCCTCGGCATGATCTTTGAGAAGAAGATCGGTGTCAGCCTGGCCGTGACCGGTTGTATTGGCGCTCTTGTGCTGGTAGTAAGCGGCGTACTCACGGAAAAACAGGCCTATAAAGCCATTGATTCACAGACTATTTTTATCTTTGGCGGCACCCTGGCGCTGGCAAAAGCGCTGGAAATGACCGGGGCAGGCAAGCTGGTTGCGGACTATGTAATCGGCATGCTGGGGCAAAACTCCTCACCGTTCATGTTGCTGATCGCCGTTTTCGCCTTATCCGTTGTGATGACCAACTTTATGTCCAATACCGCAACGACAGCGCTATTGGTGCCGGTGAGTTTGTCCATTGCCGCAGGCATGGGGGCCGACCCGCGTGCGGTTCTGATGGCGACGGTCATTGGCGGTTCCTGCGCCTACGCAACTCCTATCGGTATGCCTGCCAACATGATGGTGCTCTCCGCGGGTGGCTATAAATTTGTCGACTACGCTAAGGCCGGTATCCCCCTGATTATCGTTTCAACCATTGTCAGCTTGATCCTGCTGCCAATCCTTTTCCCTTTTCATCCGTAATTAACGGAAGCTGTTCTGACTAAGGAGCATTTATGAGTAAAAGTGAACAGATATCCCATATGACCGACGTTATGGCCAAGTTTGTGGGATATACCGGCAAAGTTTTGCCTGATGATGTCACTGCAAAACTGGAAGATTTGCATAAGAAAGAGACCAGTAAACTGGCCGACGTCATCTTTACTACCATGATTGAGAACCAGCGTCTGGCGAAAGAGCTGGATCGACCTTCCTGCCAGGACACCGGTGTCATTCAGTTTCTGGTGGAGTGCGGGACGAACTTTCCGCTGATTGGCGAGCTGGAAGCGTTGTTGCGTGAGGCGGTGATCAAAGCAACTGTAGATTCTCCGCTGCGCCACAACAGCGTAGAAACTTTTGATGAATACAACACCGGTAAAAACGTGGGTAAAGGCACGCCGACGGTCTTCTGGGAGATCGTTCCCAATTCCGATCAGTGCAGCATTTATACCTATATGGCAGGCGGTGGTTGTTCTCTGCCGGGGAAAGCGATGGTGCTGATGCCGGGTGCAGGCTATGAAGGTGTGACCCGCTTTGTACTGGATGTGATGACCAGCTACGGCCTCAACGCGTGTCCGCCGCTGCTGGTTGGCGTGGGCGTAGCGACCTCTGTTGAAACGGCGGCACTGCTATCGAAAAAAGCGCTGATGCGTCCGATCGGTTCGCATAACGAGAATGAGCGTGCAGCCTCGCTGGAAAAAATGCTGGAAGACGGCATTAACAAAATTGGCCTGGGGCCGCAGGGGATGTCCGGTAATACCTCCGTTATGGGCGTGAACATTGAAAACACCGCGCGTCATCCTTCTACCATCGGCGTGGCGGTAAACGTCGGCTGCTGGTCGCACCGTAAAGGGCATATCGTTTTCGATAAAGATTTGAATTACACCATTACGTCTCACAGCGGAGTGAATTTCTGATGACTAAAAAAATCCTGACTACGCCAATTAAAGATGAAGATTTAGCGGATATAAGGCTGGCGATATTATTTATCTCAACGGTCATATTGTTACCTGCCGTGACGTAGCCCATCGCCGTTTAATTGAAGGTGGTCGCGAACTTCCGGTAGATGTTCGCGGCGGCGCTATTTACACGCTGGCCCCATTGTCCGCCCAATTAAAGGCGAAGACGATAAGTTTGAAATGGTCTCCGTTGGACCAACCACCAGCATGCGTATGGAGAAATTCGAAAAAGAGTTTATTGCGCAAACCGGCGTGAAATTAATTGTGGTAAAGGCGGTATGGGTAAAGGTACCGAAGAGGGCTGTGCTGAACACAAAGCGCTGCACTGCGTATTCCCTGCCGGTTGTGCGGTTGTAGCCGCCGTGTGCGTGGAAGAAATTGAAGATGCCCAATGGCGCGATCTGGGGATGCCGGAAACGCTGTGGGTCTGTCGGTTAAAGAGTTTGGCCCGCTGATTGTTTCTATTGATACCCATGGGAATAATCTGTTTGAACAGAACAAAATTATTTTTAATCAGCGTAAAGAGATCGTCGCTGATGAAATATGCCAGAACGTAAGTTTCATCAAATAAACCTGATTACGTAAATTGCACGATAATGTGGTAATACGTTATGACGATAATAATAACCAAGGGCTCTGGCCGGGGGCGGCTCAGGGAAATCCACCCTTATTATTTCATCGTCATAAATAGCCCCATTATCGTGCCTCCTGCCTGAAGGCTATTTTATGACTAATGCAGCTCTGCTATTAGGTGAAGGCTTCACCCTGATGTTCCTCGGCATGGGGTTCGTGCTGGCGTTTCTGTTCCTGCTGATTTTCGCCATCCGCGGCATGTCCGCCGTGATTACTCGCTTCTTCCCGGAACCCGTCGCCGCGCCCGCACCGCGCGCCGTTCCCCGCCGTGGACGACTTCACCCGCTTAAAGCCGGTGATTGCCGCCGCCATTCACCATCACCGCCACCACGTTTAATTTCCCGGAGGGATCCATGACCATTGCCATTACCGACGTCGTCCTGCGTGACGCCCACCAGTCCCTTTTCGCCACCCGCCTGCGTCTTGACGATATGCTGCGATTGCCGCTCAGCTCGACGACGTGGGCTACGGGTCGCTGGAGTGCTGGGGCGGCGCCACCTTTGACGCCTGTATCCGCTTTCTCGGCGAAGACCCGTGGCTGCGCCTGCGCGAACTCAAAAAAGCCATGCCGAAAACCCCGCTGCAGATGCTGCTGCGCGGCCAGAACCTGCTCGGCTACCGCCACTACGCCGATGATGTGGTGGAGCGCTTCGTCGAACGCGCGGTGAAAAACGGCATGGACGTGTTCCGCGTCTTCGATGCCATGAACGACCCGCGCAATATGAAAGCCGCACTGCAGGCGGTGCGCAGCCACGGCGCGCACGCCCAGGGCACGCTCTCGTACACCACCAGCCCGGCGCACACCCTGCAGACCTGGCTGGATTTAACGGGAGCAACTGCTGGAAACCGGCGTCGATTCCATCGCCATCAAGGATATGTCCGGCATTCTCACGCCGATGGCGGCGTATGAGCTGGTCAGCGAAATCAAAAAACGTTTTGAGGTACGCCTGCATCTGCACTGTCACGCCACCACCGGGATGGCGGAGATGGCCCTGCTGAAGGCCATTGAAGCGGGCGTCGACGGCGTGGACACGGCGATTTCCTCCATGAGCGCCACCTACGGCCACCCGGCCACCGAAGCGCTGGTGGCGACGCTTGCCGGCACAAAATATGACACCGGTCTGGATATCCTGAAACTGGAAAACATCGCCGCCTACTTCCGCGAGGTGCGCAAAAAGTATCACGCCTTTGAAGGCCAGCTGAAAGGCTACGACAGCCGTATCCTCGTGGCGCAGGTGCCGGGCGGGATGCTGACCAACCTCGAAAGCCAGCTGAAGCAGCAGAACGCGGCGGACAAACTCGACCAGGTGCTGGCGGAAATCCCCCCGCGTGCGCGAGGACCTCGGCTTCATCCCGCTGGTGACCCCCACCTCGCAGATTGTCGGCACCCAGGCGGTGCTCAACGTCCTGACCGGCGAACGCTACAAAACCATTGCCAAAGAAACGGCAGGCATTCTGAAAGGGGAATATGGCCGCACGCCAGCGCCGGTGAATGCCGCGTTACAGGCGCGGGTGCTGGAAGGGGCTGAACCGGTGACCTGCCGCCCGGCGGATTTACTGAAGCCGGAACTGGCGCAACTGGAAGCCGACGTCAGGCGCCAGGCGCAGGAGAAGGGTATTACTCTGGCGGGAAACGCCATCGACGACGTGCTCACCGTGGCGCTGTTCCCGCAGATTGGCCTCAAATTCCTTGAGAACCGCCACAACCCGGCGGCGTTTGAGCCGGTACCGCAGGCGGAAGCCGCGCAGCCGGTGGCAAAAGCAGAGAAGCCTGCCGCTTCCGGTATCTACACCGTGGAAGTGGAAGGCAAAGCCTTTGTGGTGAAGGTCAGCGACGGCGGCGATATCAGCCAGCTCACTGCGGCTGCACCTGCTGCCTCTTCTGCTCCTGCCACCGCCCCGGCAGGCGCCGGCACCCCGTCACCGCCCCGCTGGCGGGCAACATCTGGAAGGTGATTGCCACCGAGGGCCAGACGGTGGCGGGAAGGCGATGTGCTGCTGATTCTGGAGGCCATGAAGTGGAAACCGAAATCCGCGCCGCGCAGGGCCGGGACGGTACGCGTATCGCGGTGAAGTTCGGCGACGCGGTCTCCGTCGGCGACACCCTGATGACGCTGGCGTAAGGACGAGGAGCGGAAATGGAAAGTCTGAACGCCCTGATTCAGGGGATGGGCCTGATGCACCTCGGCATCGGCCAGGCCATCATGCTGCTGGTGAGCCTGCTGCTGCTGTGGCTGGCGATTGCGAAGAAGTTCGAGCCGCTGCTGCTGCTGCCGATGGCTTCGGCGGCCTGCTCTCCAATATCCCGGAAGCGGGCATGGCGCTGACCGCGCTGGAGAGCCTGCTGGCGCACCACGACGCCGGGCAACTGGCGGTGATTGCCGCGAAGCTCAACTGCGCGCCGGACGTGCACGCGATTAAAGAGGCATTAGCGCTGGCGCTGCCGTCGGTGCAGGGACAGATGGAGAACCTGGCGGTGGACATGGGCTACACGCCGGGGGTGCTGGCGCTGTTCTATAAAGTGGCGATTGGCTCCGGCGTCGCGCCGCTGGTCATCTTTATGGGCGTGGGGGGCGATGACCGACTTCGGCCCGCTGCTGGCCAACCCGCGCACCCTGCTGCTCGGCGCGGCGGCGCAGTTCGGCATCTTCGCCACCGTGCTGGGGGCGCTGACGCTGAACTACTTCGGCCTGATTGCCTTCACCCTGCCGCAGGCGGCGGCCATCGGTATTATCGGCGGCGCGGACGGCCCGACGGCGATTTATCTGTCGGGCAAGCTGGCGCCGGAGCTGCTGGGGGCCATCGCGGTGGCGGCGTACTCGTATATGGCGCTGGTGCCGTTAATCCAGCCGCCGATTATGAAGGCGCTGACCACGGAGACGGAGCGGAAAATCCGCATGGTGCAGCTGCGCACCGTCAGCAAGCGGGAAAAAATCCTCTTCCCGGTGGTGTTGCTCATGCTGGTGGCGCTGCTGCTGCCGGACGCCGCGCCGCTGCTGGGGATGTTCTGCTTTGGCAATCTGATGCGTGAAAGCGGCGTGGTGGAGCGTCTGAGCGACACGGTGCAGAACGGGCTGATTAATATCGTGACCATCTTCCTCGGGCTGTCGGTGGGCGCGAAGCTGGTGGCGGACAAGTTCCTGCAGCCGCAGACGCTGGGCATTCTGCTGCTGGGGGTGGTTGCCTTTGGTATCGGGACGGCGGCCGGGGTGCTGATGGCGAAGCTGCTGAACCTGTGCAGTAAGAACAAAATCAACCCGCTTATCGGTTCGGCGGGGGTGTCGGCAGTGCCGATGGCGGCACGCGTATCGAACAAAGTGGGGCTGGAATCAGACCCGCAGAACTTCCTGCTGATGCACGCGATGGGCCCGAACGTGGCGGGGGTTATCGGTTCGGCCATTGCCGCGGGCGTGATGCTGAAATATGTGCTGGCGATGTAATCCAATCTAATCGGGAGCAGTGATTCTGCTCCCGCAAAATGAGGACACAGTAATGAGTGAAATGGTTGCGTTTCGGCAAGGTACATCCATGCCATCCAGAGAAACCATCTTGCGTTATGTTGTTGAGACGGTGAATCAGATAACTGAGCTTGAGCCAGCGCTGCATTTGCTGCCGTGGTCTGGCGTCAACTCAGCAATATATGAGCAACGTTTTGCACAGTGTTACGACGAGGGGCTTTGTGCCGCACAGACGTCGGCACCGAATGTTCCGCAGGGGATTCTGCCATCGACCGATTGGGCTCAGGGTATCGGGTTACTCTGTTTTGCCGCAGGGTATATGAGCGCAGGCGAGCGGCCATTAACACACAATCAGTTATGCGATTTCGTCAAACAGGCGGCTGTTGGTTTATCGCCGATTGAGGGGGAGGCGGCGAGCGGTTTCTCGACGGTACGTAGTATTGCATTACCCGTATTCAGGCGATTACAACGCGATGGTCACGCGTCTCGGGTCCTGCTGTTACAAACGTTGCTGCATCTGGTCGCCTGGAAAAGCGCATCGCAGTATGCGCGTCAACAGGCGCAACGACTGCTCTGGATGGGCGGCATTTTGGGGGAAGGGGGGGAGCATAGCCTGCTTGCACTGGATAAGGCGCTACGTGAAGAAGCCGTCGGGGAGAAAAGTCTCCCCGCTCTGCTGATCTTCACGAGTTTCCTGGCGCATTTTCCGGCAGGGCCAGTCTTTATTGACTAGCCGTCGAACGACGCAGGCAAAAGCTGCGTCGTTTTAGTTCGACGCCGGGTATTCCTGCACCGTCACCTGGAACGTGAGCTGCTTATCATCCCGCATTACCACGACCGGAATGACGGAGCCCGGGCGGATTTCCGCCACCTGATCCATCGTCTCCAGTGCGGACACGGCGGGTTTATTATTGACCGAAATAATCAAATCATTAACCTGAATACCGGCAAGCGCGGCGGGGCCGTTTGGCGTCACTTCATTAACGACAATGCCCTGAATCGGGTCCATGCCGCTACCCTGCTGCGCGTGCAGCGGCGCGATTTCTCGTCCGCCAATACCGATATAGCCGCGAATCACGCGACCGTCGCGGATAAGCTTATCCATAATTTTCGTGGCTAGCTGGAAGGGAATCGCAAAACCAAGGCCTTCCGGCGTTTCGCCATCGTTACTCTTATCAAAAGAGAGGGTGTTGATCCCCATCAGTTCGCCTAACGAGTTGACCAGCGCGCCGCCGGAATTACCGTGGTTAATCGAGGCGTCGGTCTGGAGAAAATTCTGGTCGCCCCGTCGGGTTCAGGCCGATACGACCCGTTGCGCTGATGATTCCCTGGGTGATGGTCTGTCCCAGATTATATGGGTTGCCGATAGCCAGTACGACGTCGCCAATATGCGGTGTACGCTTTGTATTAATCGGGATGGTAGGCAGCCCGCCAGTGGCGTTGATCTTCAGCACCGCCAGATCGGTAAGCGAATCGGAGCCAACCAGTAGCGCTTCAAAGACGCGGCCATCCTGTAGCGCGACGATAATCTGATCGGCATCGTTAATCACGTGCTTGTTGGTAATAATATAACCGCGTTGATCCATGATCACGCCGGAACCCAGCGTGCGGATCTCCAGTTGATTATGCGCGGTACTGTTCATACTGCGGTTATAGACGTTGACGACGGCAGGCGCGGCGCGGCGAACCGCAAAATTATAACTGGCTGGCGTCTCATCGGTACTGTCGAATTGCGGGACGGCGATAGGATTAATTTTGCGCAAAGAAGGCATCACGGCAACAGAATAGCGCCGACAATTAAACCTATTGCGACCGAACGTAAGAGCTTCACAACATGATGGAGGCGTCGTTAAAAAAGGGAACGGCAGCAGCATACCATGAGTTAACCGGACATCACATCGCAGGCTGATGCCCGGTTTAAGACAGGATTAGCGCAGCAATAGATAAATGTTCTCGTTGCCGCGTACTACCTGAAGAGCAATGATGGACGGTTTTGCCGCCATCACTTTGCGCATTTCGGCGATAGAACTGATGCGATCGCGATTAACGCCGATGATAACATCATCTTTTGCAAACCGGCCTGCGCGGCAGGACTGCTTTTTTCGACGCTATCAACCTTAACGCCTTTCGTCCCGTCTTTCAGTTGGCCGTCGCTCAACGTCGCGCCTTGCAACGCCGGGGCGATCATTTCGGCGCTGGCGGAAGAAGAGGTATTGGAATCCAGCGTGACTTCCACCTCCAGCGGCTTACCATCGCGCAGCAGGCCCAGCTTCACTTTCGTGCCCGGTTCGGTGGTGGCGATACGCGAACGCAGTTCGGCAAAGCTATTCAGCGGCTTACCGTTAAGACTGATATCACGTCTCGGATTTCACCCCGGCCTTCGCCGAACCTGAATTGGGTAAACCTCGCTGACAAAAGCGCCACGCTGAACGTTCAGTTTGAATGCCTTAGCGATATCAGCGGTCATTTCAGTGCCTTTAATTCCCAGCAATCCGCGTTTGATTTCGCCGAACTGAATCAACTGCTGCGCCAGCGTCTGCGCCATATTGGAAGGAATAGCAAAGCCAATGCCGATGCTCCCGCCCCCTGGCGCGAGGATCGCGGTATTAATCCCGATCAGCTCGCCGTTCAGGTTAAGCAGCGCGCCGCCGGAGTTGCCGCGGTTAATAGAGGCATCGGTTTGAATAAAGTTCTCAAAGCCCTTCCAGATTAAGCCCGCTGCGTCCCAGCGCTGAAATAATCCCGGAGGTGGCGGTTTGTCCAAGACCAAACGGATTACCGACCGCCACGGCGAAATCGCCGACGCGGAGTTTGTCGGAATCGGCGATGGCAATTTGCGTTAACTTGCTGGGATTCTGAATTTGTAACAGAGCGATATCGCTCTGGTCGTCGCCGCCGATCAGCTTCGCGTCGAATTCGCGTCCGTCATTCAGTTGAATACTGATCTTCTGTGCCTGATTAATCACATGATTATTGGTTAATACATAGCCTTTCGCGGCATCGATAATCAC
Protein sequences of DBSCAN-SWA_2 >LR134233|475331:515849|505628_506294_+|VEC90511.1|DBSCAN-SWA MQKPKLGKIKLLSAKEQVAAVLRKAILSRELVEGQEITLEGIARMVGVSSMPVREAFQILAADGLIKVRPNKGAVVLGINEQTIREHYEIRALLESEAVAKASRPGTDISRIAQVHYAAEKALAENNSAEYSDLNQAFHMEIWNVAGNEKMKMLLCNMWNGLSMGHKVTEEEYAVISIQEHKSILQALELHDETLARQRMREHIIRSMENMLTRYVGDPSA >LR134233|475331:515849|481001_481472_-|VEC90487.1|DBSCAN-SWA MDIRKIKKLIELVEESGISELEISEGEESVRISRTTANAGFPVMQQAYAAPMMQQPALSNAVAPAATPAMEAPAAAEISGHIVRSPMVGTFYRTPSPDAKAFIEVGQKVNVGDTLCIVEAMKMMNQIEADKAGTVKAILVESGQPVEFDEPLVVIE >LR134233|475331:515849|514085_514610_-|VEC90525.1|protease|DBSCAN-SWA MPSLRKINPIAVPQFDSTDETPASYNFAVRRAAPAVVNVYNRSMNSTAHNQLEIRTLGSGVIMDQRGYIITNKHVINDADQIIVALQDGRVFEALLVGSDSLTDLAVLKINATGGLPTIPINTKRTPHIGDVVLAIGNPYNLGQTITQGIISATGRIGLNPTGRPEFSPDRRLD >LR134233|475331:515849|505012_505642_+|VEC90510.1|DBSCAN-SWA MKKIQRTQTRDHITQMLRYEILSGNIKAGEELAQESIAEQLGLSRMPVREALQSLEQEGFLIRLPNRHMQVAHLEADRVSHIFRVIAAMAAEMFSLIPSEVGDALLIRAQALAVAEDKSCELECHAMLISYVNNRYLEKVYQQFLDGYVSYVILHLKKDNQESAQLFAELADVIRQGRRDEIGQVMQRYFLSLAEIMRQHMKDWESAEA >LR134233|475331:515849|509168_509318_+|VEC90516.1|DBSCAN-SWA MGLSVKEFGPLIVSIDTHGNNLFEQNKIIFNQRKEIVADEICQNVSFIK >LR134233|475331:515849|506504_507773_+|VEC90512.1|DBSCAN-SWA MEPITLTLCLLVFAIVMFVWEKVPLAVTSMIVCVALVITGVLNIKQAFAGFIDTNVILFVAMFIVGGALFETGMANKVGGVITRFAKTEKQLIFTIMVVVGLMSGVLSNTGTAAVLIPVVIGVAAKSGFSRSRLLMPLVFAAALGGNLSLIGAPGNLIAQSALQNIGGGFGFFEYAKIGLPMLICGILYFLTIGYRFLPNNATGGEVGSVGEQRDYSHVPQWKQRLSLVVLIATILGMIFEKKIGVSLAVTGCIGALVLVVSGVLTEKQAYKAIDSQTIFIFGGTLALAKALEMTGAGKLVADYVIGMLGQNSSPFMLLIAVFALSVVMTNFMSNTATTALLVPVSLSIAAGMGADPRAVLMATVIGGSCAYATPIGMPANMMVLSAGGYKFVDYAKAGIPLIIVSTIVSLILLPILFPFHP >LR134233|475331:515849|489167_489659_+|VEC90494.1|DBSCAN-SWA MVASYRSQGRWVIWLSFLIALLLQIMPWPDDIIVFRPNWVLLILLYWILALPHRVNVGTGFVMGAILDLISGSTLGVRALSMSIVAYLVALKFQLFRNLALWQQALVVMLLSLAVDIIVFWAEFLVINVSFRPEVFWSSVVNGVLWPWLFLLMRRVRQQFAVQ >LR134233|475331:515849|489667_490261_+|VEC90495.1|DBSCAN-SWA MTTLYLASGSPRRQELLTQLGFSFEQVVPGIEEQRRAQESAQQYVVRLAREKAQAGVALVPRDLPVLGADTIVVLNGEVLEKPRDAAHAAEMLRLLSGNTHQVMTAVALADSQQTLDCLVVTEVTFRTLSAQDITGYVASGEPLDKAGAYGIQGRGGCFVRKINGSYHAVVGLPLVETYELLSHFNALRDKRDKHDG >LR134233|475331:515849|481864_482464_-|VEC90488.1|DBSCAN-SWA MRLTAKQITWLKVCLHLAGFLPLLWLFWAINHGGLSADPVKDIQHFTGRTALKFLLATLLVSPLARYAKQPLLIRTRRLLGLWCFVWATLHLTSYALLELGIHNLALLGSELISRPYLTLGIISWLVLLALTLTSTQFAQRKLGKRWQTLHNVVYLVAILAPIHYLWSVKILSPQPVIYAALALALLALRYRKFRQWWR >LR134233|475331:515849|509473_509737_+|VEC90517.1|DBSCAN-SWA MTNAALLLGEGFTLMFLGMGFVLAFLFLLIFAIRGMSAVITRFFPEPVAAPAPRAVPRRGRLHPLKAGDCRRHSPSPPPRLISRRDP >LR134233|475331:515849|503070_503541_-|VEC90508.1|DBSCAN-SWA MRSSAKQEELVRAFKALLKEEKFSSQGEIVLALQDQGFENINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELGVPTTSSPLKNLVLDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTIFTTPASGFSVRDLYEAILELFEQEL >LR134233|475331:515849|475331_476297_-|VEC90481.1|tRNA|DBSCAN-SWA MRIGQYQLRNRLIAAPMAGITDRPFRTLCYEMGAGLTVSEMMSSNPQVWESDKSRLRMVHVDEPGIRTVQIAGSDPVEMADAARINVESGAQIIDINMGCPAKKVNRKLAGSALLQYPDLVKSILIGVVNAVDVPVTLKIRTGWAPEHRNCVEIAQLAEDCGIQALTIHGRTRACLFNGEAEYDSIRAVKQKVSIPIIANGDITNPHKARAVLDYTGADALMIGRAAQGRPWIFREIQHYLDTGELLPPLPLAEVKRLLCTHVRELHDFYGQAKGYRIARKHVSWYLQEHAPDDQFRRTFNAIEDASEQLEALEAYFENFA >LR134233|475331:515849|484759_486700_+|VEC90491.1|DBSCAN-SWA MRLTTKFSAFITLLTGLTIFVTLIGCSLSFYNAVQYKYVSRVQATATAIDTHLVTHDIVSLTPQIDELMIASDIVRVDLLQGERSVYSHSRARGYRPAGTSDMYRELVVPLIKHPGMSLRLVYQDPMGNYFHSLITTAPLTLAIGFIVLILFLSVRWLQRQLSGQELLEIRSTRILNGERGANVRGSVYEWPARTSSALDVLLSEIQFAHEQRSRLDTLIRSYAAQDTKTGLGNRLFFDNQLATLLEDQEKVGVHGVVMMIRLPDFNLLRDSLGGNQAEEQMFMLINLLSTFIMRYPGSLLARYHRSDFAVLLPHRTLKEAESIAGQLLKAVDALPANKMLDRDDMVHIGICAWRSGQSTEQVMEHAEAAARNAALQGGNSWAIYDDTLPEKGRGNVRWRTLIEQMLNRGGPRLYQKPAVTREGRVHHRELMCRIYDGKEEVSSAEYMPMVLQFGLSEEYDRLQISRLITLLGYWPDENLAMQLTVESLIRPRFQRWLRDTLMQCEKSQRNRIIIELAEADVCQHISRLQPILRLVNALGVRVAVTQAGLTLVSTSWIKALNVELLKLHPSLVRNIEKRTENQLLVQSLVEACAGTPTQVYATGVRSRGEWQTLTKRGVAGGQGDFFASSQLLDTNVKKYSQRYSV >LR134233|475331:515849|488115_489168_+|VEC90493.1|DBSCAN-SWA MKPIFSRGPSLQIRLILAVLVALGVIIADSRLGTFSQIRTYMDTAVSPFYFISNGPRELLDSVSQTLASRDQLELENRALRQELLLKNSDLLMLGQYKQENARLRELLGSPLRQDEQKMVTQVISTVNDPYSDQVVIDKGSVNGVYEGQPVISDKGVVGQVVAVAKLTSRVLLICDATHALPIQVLRNDIRVIAAGNGCTDDLQLEHLPANTDIRVGDVLVTSGLGGRFPEGYPVAVVSSVKLDTQRAYTVIQARPTAGLQRLRYLLLLWGADRNGANPMTPEEVHRVANERLMQMMPQVLPSPDAMGPPAPVPDPATGITQPSAGQTAPVSTQPSPSGATTPPARAPGG >LR134233|475331:515849|479642_480527_-|VEC90485.1|DBSCAN-SWA MIIKASGGGGGRGMRVVRSDAELAQSISMTKAEAKAAFSNDMVYMEKYLENPRHIEIQVLADGQGNAIYLAERDCSMQRRHQKVVEEAPAPGITPELRRYIGERCAKACVDIGYRGAGTFEFLFENGEFYFIEMNTRIQVEHPVTEMITGVDLIKEQLRIAAGQPLSITQDEVVVRGHAVECRINAEDPNTFLPSPGKITRFHAPGGFGVRWESHIYAGYTVPPYYDSMIGKLICYGENRDVAIARMKNALQELIIDGIKTNIDLQTRIMNDEHFQHGGTNIHYLEKKLGLQEK >LR134233|475331:515849|509025_509202_+|VEC90515.1|DBSCAN-SWA MGKGTEEGCAEHKALHCVFPAGCAVVAAVCVEEIEDAQWRDLGMPETLWVCRLKSLAR >LR134233|475331:515849|509840_510290_+|VEC90518.1|DBSCAN-SWA MGYGSLECWGGATFDACIRFLGEDPWLRLRELKKAMPKTPLQMLLRGQNLLGYRHYADDVVERFVERAVKNGMDVFRVFDAMNDPRNMKAALQAVRSHGAHAQGTLSYTTSPAHTLQTWLDLTGATAGNRRRFHRHQGYVRHSHADGGV >LR134233|475331:515849|491830_495670_+|VEC90497.1|DBSCAN-SWA MRRLPGILLLTGAALIVIAALLVSGLRLALPHLDAWRPAILNKIESVTGVPLAASQLSASWQNFGPTLEAHNIHAALKDGGELSIKRVTLALDVWQSLLHMRWQFRDLTFWQLNFRTNTPLQSSDGEGIETSRLSDLFLRQFDHFDLRDSQISFLTLSGQRAELAIPQLTWLNGKERHRAEGEVSLSSLTGQHGVMQVRMDLRDDDGLLNNGRVWLQADDIDVKPWLGKWMQDNVALQTARFSLEGWMTLSKGEIAGGDVWLKQGGASWLGDNTTHTLSVDNLTAQISREQPGWQFYIPDTRITLDGKPWPSGALTVAWLPQQDVGGENHTRSDELRIRASNLELAGLEALRPLAAKLSPVLGEIWQATQPSGKIATLALDIPLQATEKTRFQASWENLAWKQWKLLPGAEHFSGTLAGSVEDGQMKVAMQQAKMPYETVFRAPLEIENGVATLSWLKNENGFQLDGRDIDVKAKAVHARGGFRYLQPTGDEPWLGILAGISTDDGSQAWRYFPENLMGKALVDYLSGAIQGGEADNATLVYGGNPHLFPYKHNEGQFEVLVPLRNATFAFQPDWPALKNLNIELDFLNDGLWMRSDSVDLGGVKASKLAAAIPDYSKEKLLIDADINGPGKAVGPYFDETPLKDSLGSTLAELQLDGDVNARLHLDIPLDGEQVTAEGDVSLRNNSLFIKPLNSTLKNLNGKFSFVNGALKSGPLTANWFNQPLNLDFSTTEGAKAYQVAVNLNGNWQPTRMGVLPPQLNDALSGSVTWNGKVGIDLPYHADTTYHIELNGDLRNVSSHLPSPLNKPAGEAIPVNIQADGNLKSFALTGSAGSKNHFNSRWLLNQKLTLDRAIWTTDSRTIPPLPAQQGVELNLPALDGAQWLALFQKGAADNVSSSAEFPQRVTLRTPALSLGGQQWNNLSVVSAPSLNGTKIEAQGREVNATLLMRNHAPWLANIKYLYYNPGVAKTHASSPTPTSPLASANTISFRGWPDLQLRCEECWLWGQKYGRIDGDFAIKGNTLTLANGLIDTGFARLKANGEWVNAPGNERTSLKGSLHGSNLDTAAGFFGISTPIQNASFNVDYDLHWRNPPWQPDEATLNGILRTRLGKGEFTDLSSGHAGQLLRLLSFDALLRKLRFDFRDTFSEGFYFDSIHSTAWIKDGVLHTDDTLVDGLEADIAMKGSVDLVRRRLDMEAVVAPEISATVGVAAAFAVNPIVGAAVFAASKVLGPLWSKVSILRYRITVRSMRRRSTKFCANREKKASNDLTGARNCPTLSK >LR134233|475331:515849|495774_495924_+|VEC90498.1|protease|DBSCAN-SWA MSLNLVSEQLLAANGLNHQDLFAILGQLAERRLDYGDLYFSRAITNPGF >LR134233|475331:515849|477850_479302_-|VEC90483.1|DBSCAN-SWA MQLEVILPLVAYLIVVFGVSIYAMRKRTAGTFLNEYFLGSRSMGGIVLAMTLTATYISASSFIGGPGAAYKYGLGWVLLAMIQLPAVWLSLGILGKKFAILARRYNAVTLNDMLFARYQSRLLVWLASLSLLVAFIGAMTVQFIGGARLLETAAGIPYETGLLIFGVSIALYTAFGGFRASVLNDTLQGLVMLVGTIVLLVGVVHAAGGLSQAVDTLHALDPKLVTPQGADDILSPAFMTSFWVLVCFGVIGLPHTAVRCISYKDSKAVHRGIIIGTIVVAILMFGMHLAGALGRAVLPDLTVPDLVIPTLMVKVLPPFAAGIFLAAPMAAIMSTINAQLLQSSATIIKDLYLNLRPDQMQNEIRLKRMSAVITLLLGALLLLAAWKPPEMIIWLNLLAFGGLEAVFLWPLVLGLYWERANAAGALSAMIVGGVLYALLATFNIQYLGFHPIVPALLLSLLAFLIGNRFGSSASQATVLSTDK >LR134233|475331:515849|512872_513571_+|VEC90523.1|DBSCAN-SWA MSEMVAFRQGTSMPSRETILRYVVETVNQITELEPALHLLPWSGVNSAIYEQRFAQCYDEGLCAAQTSAPNVPQGILPSTDWAQGIGLLCFAAGYMSAGERPLTHNQLCDFVKQAAVGLSPIEGEAASGFSTVRSIALPVFRRLQRDGHASRVLLLQTLLHLVAWKSASQYARQQAQRLLWMGGILGEGGEHSLLALDKALREEAVGEKSLPALLIFTSFLAHFPAGPVFID >LR134233|475331:515849|513604_514039_-|VEC90524.1|protease|DBSCAN-SWA MGINTLSFDKSNDGETPEGLGFAIPFQLATKIMDKLIRDGRVIRGYIGIGGREIAPLHAQQGSGMDPIQGIVVNEVTPNGPAALAGIQVNDLIISVNNKPAVSALETMDQVAEIRPGSVIPVVVMRDDKQLTFQVTVQEYPASN >LR134233|475331:515849|507805_508705_+|VEC90513.1|DBSCAN-SWA MSKSEQISHMTDVMAKFVGYTGKVLPDDVTAKLEDLHKKETSKLADVIFTTMIENQRLAKELDRPSCQDTGVIQFLVECGTNFPLIGELEALLREAVIKATVDSPLRHNSVETFDEYNTGKNVGKGTPTVFWEIVPNSDQCSIYTYMAGGGCSLPGKAMVLMPGAGYEGVTRFVLDVMTSYGLNACPPLLVGVGVATSVETAALLSKKALMRPIGSHNENERAASLEKMLEDGINKIGLGPQGMSGNTSVMGVNIENTARHPSTIGVAVNVGCWSHRKGHIVFDKDLNYTITSHSGVNF >LR134233|475331:515849|501739_502012_+|VEC90505.1|DBSCAN-SWA MNVYTFDFNDIKNQSDFYREFTQTFGLASEKVSDLDTLWDAVMSDILPLPLEIEFVHLPDKLRRRYGALILLFDEAEEELEGRLRFNVRH >LR134233|475331:515849|499601_501569_+|VEC90504.1|DBSCAN-SWA MGIFSIANQHIRFAVKLACAIVLALFIGFHFQLETPRWAVLTAAIVAAGPAFAAGGEPYSGAIRYRGMLRIIGTFIGCIAALIIIISMIRAPLLMILVCCVWAGFCTWISSLVRIENSYAWGLSGYTALIIVITIQTEPLLTPQFALERCSEIVIGIGCAILADLLFSPRSIKQEVDRELDSLLVAQYQLMQLCIKHGDSEEVDNAWGDLVRRTAALEGMRSNLNMESSRWVRANRRLKALNTLSLTLITQSCETYLIQNTRPELITDTFRELFETPVETVQDVHRQLKRMRRVIVWTGERETPVTLYSWVGAATRYLLLKRGVISNTKISATEEEILQGEPVVKVESAERHHAMVNFWRTTLSCILGTLFWLWTGWTSGNGAMVMIAVVTSLAMRLPNPRMVCIDFIYGTLAALPLGLLYFLVIIPNTQQSMLLLCLSLAVLGFFIGIEVQKRRLGSMGALASTINIIVLDNPMTFHFSQFLDSALGQIVGCMLAFIVILLVRDKSKDRTGRVLLNQFVSAAVSAMTTNVVRRKENRLPALYQQLFLLMNKFPGDLPKFRLALTMIIAHQRLRDAPIPVNEDLSVFHRQLRRTADHVISAGSDDKRRRYFGQLLDELDIYQEKLRIWEAPPQVTEPVKRLTGMLHKYQNALTDS >LR134233|475331:515849|514840_515113_-|VEC90526.1|protease|DBSCAN-SWA MKLGLLRDGKPLEVEVTLDSNTSSSASAEMIAPALQGATLSDGQLKDGTKGVKVDSVEKSSPAAQAGLQKMMLSSALIAIASVLSPKCAK >LR134233|475331:515849|515531_515849_-|VEC90527.1|protease|DBSCAN-SWA MIIDAAKGYVLTNNHVINQAQKISIQLNDGREFDAKLIGGDDQSDIALLQIQNPSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIISALGRSGLNLEGL >LR134233|475331:515849|502071_502338_-|VEC90506.1|DBSCAN-SWA MKTKYIIASLGLATLLSFGANAAVHQVNAEQAQNLQPMGTISVSQIGSTPMDMRQEIVAKAEKAGANSYRIIELKEGDNWHATAELYK >LR134233|475331:515849|498452_498656_+|VEC90502.1|DBSCAN-SWA MSLFPVIVVFGLSFPPIFFELLLSLAIFWLVRRMLVPTGIYDFVWHPALFNTALYCCLFYLISRLFV >LR134233|475331:515849|490250_491720_+|VEC90496.1|DBSCAN-SWA MTAELLVNVTPSETRVAYIDGGILQEIHIEREARRGIVGNIYKGRVSRVLPGMQAAFVDIGLDKAAFLHASDIMPHTECVAGDEQKQFTVRDISELVRQGQDLMVQVVKDPLGTKGARLTTDITLPSRYLVFMPGASHVGVSQRIESESERERLKKVVAEYCDEQGGFIIRTAAEGVCEEDLASDAAYLKRVWTKVMERKKRPQTRYQMYGELALAQRVLRDFADAQLDRIRVDSRLTYESLLEFTAEYIPEMTSKLEHYSGHQPIFDLYDVENEIQRALERKVELKSGGYLIIDQTEAMTTVDINTGAFVGHRNLDDTIFNTNIEATQAIARQLRLRNLGGIIIIDFIDMNNEDHRRRVLHSLEQALSKDRVKTSINGFSPLGLVEMTRKRTRESVEHVLCNECPTCHGRGTVKTVETVCYEIMREIVRVHHAYDSDRFLVYASPAVAEALKGEESHALAEVEIFVGKQVKVQVEPLYNQEQFDVVMM >LR134233|475331:515849|502441_502705_-|VEC90507.1|DBSCAN-SWA MKIKTTVATLSILSVLSFGAFAAEPISAEQAQNREAIGSVSVSAIGSSPMDMNAMLSKKADEQGATAYHITEARSGSNWHATAELYK >LR134233|475331:515849|487007_488051_+|VEC90492.1|DBSCAN-SWA MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE >LR134233|475331:515849|511956_512820_+|VEC90522.1|DBSCAN-SWA MTDFGPLLANPRTLLLGAAAQFGIFATVLGALTLNYFGLIAFTLPQAAAIGIIGGADGPTAIYLSGKLAPELLGAIAVAAYSYMALVPLIQPPIMKALTTETERKIRMVQLRTVSKREKILFPVVLLMLVALLLPDAAPLLGMFCFGNLMRESGVVERLSDTVQNGLINIVTIFLGLSVGAKLVADKFLQPQTLGILLLGVVAFGIGTAAGVLMAKLLNLCSKNKINPLIGSAGVSAVPMAARVSNKVGLESDPQNFLLMHAMGPNVAGVIGSAIAAGVMLKYVLAM >LR134233|475331:515849|510709_511336_+|VEC90519.1|DBSCAN-SWA MREDLGFIPLVTPTSQIVGTQAVLNVLTGERYKTIAKETAGILKGEYGRTPAPVNAALQARVLEGAEPVTCRPADLLKPELAQLEADVRRQAQEKGITLAGNAIDDVLTVALFPQIGLKFLENRHNPAAFEPVPQAEAAQPVAKAEKPAASGIYTVEVEGKAFVVKVSDGGDISQLTAAAPAASSAPATAPAGAGTPSPPRWRATSGR >LR134233|475331:515849|482464_483469_-|VEC90489.1|DBSCAN-SWA MKKIRPLTEADVTAESAFFMQRRQVLKALGISAAALSLPSTAQADLFSWFKGNDRPKAPAGKPLEFSQPAAWRSDLALTPEDKVTGYNNFYEFGLDKADPAANAGSLKTEPWTLKISGEVAKPFTLDYDDLTHRFPLEERIYRMRCVEAWSMVVPWIGFPLYKLLAQAQPTSHAKYVAFETLYAPDDMPGQKDRFIGGGLKYPYVEGLRLDEAMHPLTLMTVGVYGKALPPQNGAPIRLIVPWKYGFKGIKSVVSIKLTRERPPTTWNLSAPNEYGFYANVNPHVDHPRWSQATERFIGSGGILDVQRQPTLLFNGYANEVASLYRGLNLRENF >LR134233|475331:515849|503955_504852_+|VEC90509.1|DBSCAN-SWA MKVAVLGAAGGIGQALALLLKNQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGEDATPALEGADVVLISAGVARKPGMDRSDLFNVNAGIVKNLVQQIAKTCPKACVGIITNPVNTTVAIAAEVLKKAGVYDKNKLFGVTTLDIIRSNTFVAELKGKLPTEVEVPVIGGHSGVTILPLLSQIPGVSFTEQEAAELTKRIQNAGTEVVEAKAGGGSATLSMGQAAARFGLSLVRALQGEKGVVECAYVEGDGQYARFFSQPLLLGKNGVEERKSIGTLSASSNIRWTLCWIR >LR134233|475331:515849|480523_480991_-|VEC90486.1|DBSCAN-SWA MLDKIVIANRGEIALRILRACKELGIKTVAVHSSADRDLKHVLLADETVCIGPAPSVKSYLNIPAIISAAEITGAVAIHPGYGFLSENANFAEQVERSGFIFIGPKADTIRLMGDKVSAITAMKKAGVPTVPGSDGPLGDDMNANRAHANVLAIR >LR134233|475331:515849|497341_498271_-|VEC90501.1|DBSCAN-SWA MERLKRMSVFAKVVEFGSFTAAARQLQMSVSSISQTVAKLEDELQVKLLNRSTRSIGLTEAGKIYYQGCRRMLHEVQDVHEQLYAFNNTPIGTLRIGCSSTMAQNVLAGLTAKLLKEYPGLAVNLVTGIPAPDLIADGLDVVIRVGALQDSSLFSRRLGAMPMVVCAAKPYLAQYGVPEKPADLSSHSWLEYSVRPDNEFELIAPEGISTRLIPQGRFVTNDPMTLVRWLTAGTGIAYVPLMWVIDEINRGDLEILLPRYQSDPRPVYALYTEKDKLPLKVQVVINALTDYFVDVAHLFQGMHGRGKEK >LR134233|475331:515849|476957_477839_-|VEC90482.1|DBSCAN-SWA MPWIQLKLNTTGANAEELSDALMEAGAVSITFQDTHDTPVFEPLPGETRLWGDTDVIGLFDAETDMKDVVAILEQHPLLGAGFAHKIEQLEDKDWEREWMDNFHPMRFGERLWICPSWRDIPDENAVNVMLDPGLAFGTGTHPTTSLCLQWLDGLDLNGKTVIDFGCGSGILAIAALKLGAAKAIGIDIDPQAIQASRDNAERNGVSDRLELYLPKDQPEAMKADVVVANILAGPLRELAPLISVLPVEGGLLGLSGILASQAESVCDAYAELFTLDPVVEKEEWCRITGRKK >LR134233|475331:515849|498663_499596_+|VEC90503.1|DBSCAN-SWA MKTLTRKLSRTAITLVLVILAFIAIFRAWVYYTESPWTRDARFSADVVAIAPDVAGLITHVNVHDNQLVKKDQVLFTIDQPRYQKALAEAEADVAYYQVLAQEKRQEAGRRNRLGVQAMSREEIDQANNVLQTVLHQLAKAQATRDLAKLDLERTVIRAPADGWVTNLNVYAGEFITRGSTAVALVKKNSFYVQAYMEETKLEGVRPGYRAEITPLGSNRVLKGTVDSVAAGVTNASSTSDAKGMATIDSNLEWVRLAQRVPVRIRLDEQQGNLWPAGTTATVVITGKQDRDASQDSFFRKLAHRLREFG >LR134233|475331:515849|483581_484556_-|VEC90490.1|DBSCAN-SWA MQALILEQQDGKTLASVQHLEESQLPAGDVTVDVHWSSLNYKDALAITGKGKIIRHFPMIPGIDFAGTVHASEDPRFHAGQEVLLTGWGVGENHWGGLAERARVKGDWLVALPAGLSSRNAMIIGTAGFTAMLCVMALEDAGIRPQDGEVVVTGASGGVGSTAVALLHKLGYQVAAVSGRESTHGYLKSLGANRILSRDEFAESRPLEKQLWAGAIDTVGDKVLAKVLAQMNYGGCVAACGLAGGFALPTTVMPFILRNVRLQGVDSVMTPPARRAEAWARLVKDLPESFYAQAATEITLADAPKFADAIINNQVQGRTLVKIK >LR134233|475331:515849|495947_496121_-|VEC90499.1|DBSCAN-SWA MRYGGERFYLAFAVLAYNGSRCLRTLLQRQEAYLVSISKTGFFAANGANANALVDII >LR134233|475331:515849|496157_497219_+|VEC90500.1|protease|DBSCAN-SWA MSREEKLDILRRVDKAAREADKRVQEVNASLTGVYELILVAATDGTLAADVRPLVRLSVSVQVEEDGKRERGASGGGGRFGYEYFLADLDGEVRADEWAKEAVRMALVNLSAVAAPAGTLPVVLGAGWPGVLLHEAVGHGLEGDFNRRGTSVFSGQIGEQVASALCTVVDDGTMMNRRGSVAIDDEGTPGQYNVLIENGVLKGYMQDKLNARLMGATPTGNGRRESYAHLPMPRMTNTYMLAGQSTPQEIIESVEYGIYAPNFGGGQVDITSGKFVFSTSEAYLIENGKVTTPVKGATLIGSGIETMQQISMVGNDLKLDNGVGVCGKEGQSLPVGVGQPTLKVDNLTVGGTA >LR134233|475331:515849|511518_511707_+|VEC90521.1|DBSCAN-SWA MESLNALIQGMGLMHLGIGQAIMLLVSLLLLWLAIAKKFEPLLLLPMASAACSPISRKRAWR >LR134233|475331:515849|508704_508812_+|VEC90514.1|DBSCAN-SWA MTKKILTTPIKDEDLADIRLAILFISTVILLPAVT >LR134233|475331:515849|479291_479534_-|VEC90484.1|DBSCAN-SWA MDARFVQAHKEARWALWLTLCYLAAWLVAAYLPGDSPGITGLPHWFEMACLLTPLVFILLCWAMVKFIYRDISLEDDDAA >LR134233|475331:515849|511332_511503_+|VEC90520.1|DBSCAN-SWA MIATEGQTVAGRRCAADSGGHEVETEIRAAQGRDGTRIAVKFGDAVSVGDTLMTLA |
47 | Organic_Lake_phycodnavirus(50.0%) | tRNA,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1055233 : 1062602
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >LR134233|1055233:1062602|DBSCAN-SWA GATGACTGACAGTGAACTGATGCGGCTTAGCGAGCAGGTCGGGCTGGCGTTAAAAGCGCGCGGGGCGACCGTAACGACAGCAGAGTCCTGTACCGGCGGCTGGCTGGCGAAAGCCATTACCGACATCGCCGGAAGCTCCGCCTGGTTTGAACGCGGCTTTGTCACCTATAGTAATGAAGCCAAAGCGCAGATGATTGGCGTGCGTGAGGAGACTCTGGCGCAGCATGGCGCGGTCAGCGAACCCGTAGTGGTCGAAATGGCGATCGGGGCGCTGAAGGCCGCTCGCGCCGATTTTGCTGTCGCTATTAGCGGGATTGCCGGGCCGGATGGCGGCAGTGAAGAAAAGCCGGTTGGTACGGTGTGGTTTGCTTTCGCCAGCGTCAGCGGAGAAGGGATTACGCGTCGGGAATGCTTCAGCGGCGACCGTGAATCGGTGCGTCGACAGGCGACGACGTACGCGCTACAAACCCTGTGGCAACAATTTCTACAAAACACTTGATACTGTATGAGCATACAGTATAATTGCTTCAACAGTACGAATTCACTATCCGGTTCAATACCAAGTTGCATGACAGGAGTAATAATGGCTATCGACGAAAACAAACAGAAAGCGTTGGCGGCAGCACTGGGCCAAATTGAAAAGCAATTTGGTAAAGGCTCCATCATGCGTCTGGGTGAAGACCGTTCTATGGATGTGGAAACTATCTCCACCGGTTCGCTTTCACTGGACATCGCACTCGGTGCGGGCGGTCTGCCGATGGGGCGTATCGTCGAAATTTACGGGCCGGAATCTTCCGGTAAAACGACCCTGACGCTGCAGGTGATTGCCGCTGCGCAGCGTGAAGGTAAAACCTGTGCGTTTATCGATGCGGAACACGCGCTTGACCCTGTTTACGCACGCAAGCTGGGCGTCGATATCGATAACCTGCTCTGCTCTCAGCCGGATACCGGCGAGCAGGCGCTGGAAATCTGTGACGCGCTGGCGCGTTCAGGCGCGGTGGACGTCATTGTGGTCGACTCCGTAGCGGCGCTAACGCCGAAAGCGGAAATCGAAGGCGAAATCGGCGACTCTCACATGGGCCTCGCGGCGCGTATGATGAGCCAGGCGATGCGTAAGCTGGCGGGGAACCTGAAACAGTCCAATACGCTGCTGATCTTCATCAACCAGATCCGTATGAAGATTGGCGTGATGTTCGGTAACCCGGAAACCACCACCGGTGGTAACGCGCTGAAATTCTACGCCTCCGTTCGTCTTGATATCCGTCGTATTGGCGCGGTGAAAGAGGGCGATAATGTCGTGGGTAGCGAAACGCGTGTGAAAGTGGTGAAAAACAAAATCGCCGCGCCGTTTAAGCAGGCCGAGTTCCAGATCCTCTACGGTGAAGGCATCAACTTCTATGGCGAACTGGTTGACCTGGGCGTGAAAGAGAAGCTGATCGAGAAAGCGGGCGCATGGTACAGCTACAACGGCGAGAAGATTGGCCAGGGTAAAGCGAACGCGACTACCTGGCTGAAAGAGAACCCGGCGACAGCGAAAGAGATCGAGAAAAGGGTTCGTGAATTACTGTTGAGTAATCAGAATGCCACGCCCGATTTCGCCGTTGACGATAGCGAAGGCGTTGCAGAAACCAACGAAGATTTTTAATCATCAGGTGATGGTGGGTCGGACAAAACGAAGCCGCCATTCAGCAAAACGTTAAATTACCATCAAGGGCTGCATTTATGCGGCCCTTTTGGCATTTCTCCCTGTTAAGGTTTTTTATGAGTGAACCCACATCGCGCCGTCCCGCTTATGCTCGTCTCCTTGATCGTGCGGTACGTATTCTTGCTGTCCGCGATCACAGCGAGCAGGAATTACGGCGTAAACTCTCCGCGCCGGTGATGGGCAAAAATGGGCCGGAAGAGATTGATGCGACGGCAGACGATTATGAACGCGTGATCGCCTGGTGCCACGAACATCACTATCTTGATGACGAGCGCTTCGTGATGCGCTTTATCGCCAGTCGTAGCCGCAAAGGCTATGGCCCGGCGCGTATTCGCCAGGAGTTGAATCAGAAAGGCATTTCCCGTGAATCGACTGAAAAAGCGATGCGTGAATGTGAAATTGACTGGAGTGAAATGGCACGTGAACAGGCCGTTCGCAAATATGGCGAGCCGCTCCCTTCAAACTTTTCAGAAAAGGTTAAGGTACAGCGCTTTTTGCTCTATCGCGGGTATCTGATGGACGATATCCAGCAAATATGGCGAAATTTTGCAGATTGAGCGCATACGGGATTTTACTTCCCGGTAAAGAAAACTTATCTTATTCCCACTTTTTCCGTCTGGACGCGGGTTGCTTAACGGCGAACAGCGTCTGCCGACGAATATTCCTTAAATACACAAAATCATTCAAGCCGCATCAAAGCGGCAAGTGAGCCTCGGGAACGCGTGACCGGGAAACTCGCAGCCAACGCAGAGGCGGCCTGAAGGATGAAGTGTAGCTTGATTTCAGGATAATTATGAGCAAGAGCACCGCTGAGATCCGTCAGGCGTTTCTCGACTTTTTCCATAGTAAAGGGCATCAGGTAGTTGCCAGCAGTTCCCTGGTGCCTAACAATGACCCCACTCTGTTGTTTACCAATGCCGGGATGAACCAGTTCAAGGATGTTTTCCTTGGGCTCGATAAACGTAATTATTCCCGTGCAACCACTTCCCAGCGTTGCGTACGCGCGGGGGTAAACACAACGATCTGGAAAACGTCGGATACACTGCACGTCACCATACTTTCTTCGAAATGCTGGGCAACTTCAGCTTCGGCGATTACTTCAAACACGACGCCATTCAGTTCGCATGGGAATTACTGACCGGTGAAAACTGGTTTGCTCTGCCAAAAGAGCGTTTATGGGTCACCGTTTATGAAACCGACGACGAAGCCTATGAGATCTGGGAAAAAGAAGTCGGCATCCCGCGCGAGCGTATCATCCGTATCGGCGACAATAAAGGCGCGCCATACGCATCGGATAACTTCTGGCAGATGGGCGACACCGGTCCCTGCGGTCCGTGCACCGAAATCTTCTACGATCATGGCGACCACATCTGGGGCGGCCCTCCGGGAAGCCCGGAAGAAGACGGCGATCGCTATATTGAGATCTGGAACATCGTCTTTATGCAGTTCAACCGTCAGGCTGACGGGACGATGGAACCATTGCCGAAACCGTCTGTAGATACCGGTATGGGCTGGAGCGTATTGCCGCGGTACTGCAACACGTTAACTCTAACTATGACATCGACCTGTTCCGCACGTTGATTGAAGCGGTAGCGAAAGTCACCGGCGCTACCGATCTGGGTAATAAATCGCTGCGCGTTATCGCCGACCACATCCGTTCCTGCGCATTCCTGGTTGCCGATGGCGTGCTGCCGTCGAATGAAAACCGTGGCTATGTGCTGCGTCGCATTATTCGTCGCGCGGTGCGTCACGGCAACATGCTGGGCGCGAAAGAGACCTTCTTCTACAAGCTGGTTGGGCCGCTGATTGAGGTAATGGGCTCCGCAGGCGAAGAGTTGAAACGTCAGCAGGCGCAGGTCGAGCAGGTTCTGAAAACGGAAGAAGAGCAGTTCGCCCGTACGCTGGAGCGCGGTCTGGCGCTGCTGGATGAAGAACTGGCGAAATTGCAGGGTGATACCCTGGACGGCGAAACCGCTTTCCGTCTGTATGATACTTATGGTTTCCCGGTCGATCTAACGGCGGACGTCTGCCGTGAGCGCAACATCAAAGTCGACGAAGCCGGTTTTGAAGCCGCTATGGAAGAGCAACGCCGTCGCGCGCGCGAAGCTAGCGGTTTTGGCGCGGACTACAACGCGATGATCCGCGTTGACAGCGCTTCTGAATTTAAAGGTTATGACCATCTGGAACTGAACGGCAAAGTGACCGCGCTGTTTGTCGATGGTAAAGCGGTTGAAGTGATTAACGCGGGACAGGAAGCGGTTGTCGTGCTGGATCAGACCCCATTCTACGCGGAATCAGGCGGTCAGGTTGGCGATAAAGGCGAGCTGAAAGGCGCTGGATTTACTTTCGCTGTGGACGACACGCAAAAATATGGTCAGGCGATTGGTCACCTCGGCAAATTGTCTGCGGGCGCTCTGAAAGTGGGCGATGTGGTACAGGCTGACGTAGATGAAGCGCGCCGCGCACGTATCCGTCTGAACCACTCCGCCACGCACCTGATGCACGCGGCGTTGCGCCAGGTGCTTGGGACACACGTTGCGCAGAAAGGCTCGCTGGTTAGCGATAAAGTGCTGCGCTTCGACTTCTCGCACAATGAAGCGATGAAGTCGTCGGAGATTCGTTGAGGTTGAAGATTTGGTGAATGCGCAAATTCGCCGCAACCTGCCGATTGAAACGAACATCATGGATCTCGACGCGGCGAAGGCGAAAGGGGCCATGGCGCTGTTTGGCGAGAAATACGACGAACGTGTACGCGTTCTGAGCATGGGCGATTTCTCTACCGAACTGTGCGGCGGTACTCACGCCAGTCGTACCGGTGATATCGGTCTATTCCGTATTATCTCTGAGTCCGGCACCGCAGCGGGCATTCGTCGTATTGAGGCGGTAACCGGTGAAGGCGCTATGGCCACCGTTCACGCGCAAAGCGATCGCTTAAACGATATTGCGCATCTGCTGAAGGGCGACAGCCCAGAATCTGGGCGACAAGTGCGTGCCGTGCTGGAACGTACGCGTCAGCTGGAAAAGAGTTGCAGCAGTTGAAGGACCAGGCCGCAGCGCAGGAGAGTGCAAATCTTTCCAGTAAAGCGGTTGATCTCAACGGCGTGAAGCTCCTGGTCAGCGAGCTTGCTGGTATTGAGCCGAAAATGCTGCGAACCATGGTTGATGATCTGAAAAATCAACTGGGGTCTACCGTTATCGTACTGGCAACGGTTTGTTGAAGGTAAGGTTTCTCTGATTGCGGGCGTGTCGAAGGATGTGACCGACCGGTTAAAGCAGGGGAATTGATTTGGATGGTCGCTCAGCAGGTGGGCGGTAAGGGTGGCGGTCGTCCGGACATGGCGCAAGCCGGTGGTACGGATGCGGCAGCTCTGCCTGCAGCGTTAGCCAGTGTACAAGGCTGGGTCAGCGCAAAATTGCAATAATTATAAGCGTTCGTTGGCGCCGCAGCTCGTGCTGCGGCGTTTACGTCAACGCTATCGACAACGATAAAGTCAGGTTGAAGTTTTGTATATCGGCTAAACTTAGGTTTAACAGAATGTAATGCCATGACTGCTTACACCTAAGGTGTTTGTCATCGCTTACTTTTTGGCGTTATATGATGGATAATGCCGGGATACAGAGAGACCCGACTCTTTTAATCTTTCAAGGAGCAAAGAATGCTGATTCTGACTCGTCGAGTTGGTGAGACCCTCATGATTGGCGATGAGGTCACCGTGACAGTTTTAGGGGTGAAGGGCAACCAGGTGCGTATTGGCGTGAACGCCCCGAAAGAAGTTTCTGTCCATCGTGAAGAGATCTACCAGCGTATCCAGGCTGAAAAATCCCAGCAGTCCAGTTACTAAGGTTTTCGCGTCTCACTTTTCGGGTGAGACGCAGCCTCTCTTCTCGCTCGCCCGCGTTGTCCCTTCATGACTTTTTTCAACCTCACTGTTTGATATGGTGCGCTTTTGGCGGCCAGCGTGTCATCAAATTATCTGTTGATAAAACACTCTTTTTGGCTTTTTTACAGACTAATTGAGCGTGAAGTGTGCAAACGATAAAAGCTCTGGAAAAATTGTTTGACTTATAAGTCCCAGAAAGTAATATGTGCGCCACGCAGCGACGATGAGCAGAACAAGTTCTTCGAAGCACTCGTAAGAGGCGTGTGGTGAGGTGGCCGAGAGGCTGAAGGCGCTCCCCTGCTAAGGGAGTATGCGGTCAAAAGCTGCATCCGGGGTTCGAATCCCCGCCTCACCGCCATTTGCATCCGTAGCTCAGCTGGATAGAGTACTCGGCTACGAACCGAGCGGTCGGAGGTTCGAATCCTCCCGGATGCACCATCTCTTACTTGATACGGCTTTCCCGCGGTATCAAAATCTGCAGTAAAGTAAGTTTCCCGATGCATCCGTAGCTCAGCTGGATAGAGTACTCGGCTACGAACCGAGCGGTCGGAGGTTCGAATCCTCCCGGATGCACCATCTCTTACTTGATACGGCTTTCACCGCGGTATCAATATCGGCAGTAAAGCAAGTTTTCCGATGCATCCGTAGCTCAGCTGGATAGAGTACTCGGCTACGAACCGAGCGGTCGGAGGTTCGAATCCTCCCGGATGCACCATATTCTCCGTACTTTCAGTGATGAAGATACGGATAAGATAGCGGTAATCACCGCTGGCACCAGGAAGGATAACGTTGCGTCAGCAACGGCCCGAAGGGCGAGCCGTAAGGCGAGTAATCCTCCCGGATGCACCATCTCTTACTTGATACGGCTTTCACCGCGGTATCAATATCAGCAGTAAAGCAAGTTTCCCAATGCATCCGTAGCTCAGCTGGATAGAGTACTCGGCTACGAACCGAGCGGTCGGAGGTTCGAATCCTCCCGGATGCACCATATTCTCCGTACTTTCAGCAATGCAGATAAGATAGCGGTAATCACCGCTGGCATCAGAAGGATAACGTTGCGTCAGCAACGGCCCATAAGGTGAATAATCCTCCCGGCTACGTCACTGCCCTGCAATCGCTTTTCCCCTTATCCTGAATATACCTCTTCATCTCTATTAGCGACAAAATCAACCAATTGCGATAAAGTACTGTCTGGTTATTCGGTTTGCGAGAACACGATGTACGCACGTTATGCTGGTTTGATTTTTGATATGGACGGTACTCTCCTGGATACTGAACCCACGCATCGCAAAGCGTGGCGTGAGGTTTTGGGGCGCTATGGTCTTCGTTTTGATGAACAGGCAATGGTCGCGCTTAACGGTTCGCCAACCTGGCTTATCGCCCAGTCAATCATTGAGCTTAACCATGCCGATCTTGATCCCCTGTCACTGGCGCGTGAGAAAACCGACGCGGTAAAAAGCATCCTGCTGGACTGCGTAGAACCGCTGCCGCTGGTTGAGGTGGTTAAAGCGTGGCATGGGCGGCGTCCCATGTCGGTCGGTACCGGTAGCGAAAGCGCTATCGCGGAAGCCCTGTTAGCGCATCTGGGGCTGCGTCGTTATTTTGATGCGGTTGTGGCGGCAGATCACGTACAGCATCACAAACCTGCGCCTGACACCTTCCTACTGTGCGCCCAGCGGATGGGCGTCATGCCGACGCAGTGCGTGGTATTTGAAGATGCCGATTTTGGTTTACAGGCGGCGCGGGCGGCGGGAATGGATGCCGTGGATGTCCGACTATTGTGA
Protein sequences of DBSCAN-SWA_3 >LR134233|1055233:1062602|1058514_1059561_+|VEC91100.1|tRNA|DBSCAN-SWA MIEAVAKVTGATDLGNKSLRVIADHIRSCAFLVADGVLPSNENRGYVLRRIIRRAVRHGNMLGAKETFFYKLVGPLIEVMGSAGEELKRQQAQVEQVLKTEEEQFARTLERGLALLDEELAKLQGDTLDGETAFRLYDTYGFPVDLTADVCRERNIKVDEAGFEAAMEEQRRRAREASGFGADYNAMIRVDSASEFKGYDHLELNGKVTALFVDGKAVEVINAGQEAVVVLDQTPFYAESGGQVGDKGELKGAGFTFAVDDTQKYGQAIGHLGKLSAGALKVGDVVQADVDEARRARIRLNHSATHLMHAALRQVLGTHVAQKGSLVSDKVLRFDFSHNEAMKSSEIR >LR134233|1055233:1062602|1055233_1055731_+|VEC91096.1|DBSCAN-SWA MTDSELMRLSEQVGLALKARGATVTTAESCTGGWLAKAITDIAGSSAWFERGFVTYSNEAKAQMIGVREETLAQHGAVSEPVVVEMAIGALKAARADFAVAISGIAGPDGGSEEKPVGTVWFAFASVSGEGITRRECFSGDRESVRRQATTYALQTLWQQFLQNT >LR134233|1055233:1062602|1058005_1058518_+|VEC91099.1|tRNA|DBSCAN-SWA MLGNFSFGDYFKHDAIQFAWELLTGENWFALPKERLWVTVYETDDEAYEIWEKEVGIPRERIIRIGDNKGAPYASDNFWQMGDTGPCGPCTEIFYDHGDHIWGGPPGSPEEDGDRYIEIWNIVFMQFNRQADGTMEPLPKPSVDTGMGWSVLPRYCNTLTLTMTSTCSAR >LR134233|1055233:1062602|1055815_1056877_+|VEC91097.1|DBSCAN-SWA MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDIALGAGGLPMGRIVEIYGPESSGKTTLTLQVIAAAQREGKTCAFIDAEHALDPVYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVDSVAALTPKAEIEGEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGDNVVGSETRVKVVKNKIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYNGEKIGQGKANATTWLKENPATAKEIEKRVRELLLSNQNATPDFAVDDSEGVAETNEDF >LR134233|1055233:1062602|1060593_1060779_+|VEC91104.1|DBSCAN-SWA MLILTRRVGETLMIGDEVTVTVLGVKGNQVRIGVNAPKEVSVHREEIYQRIQAEKSQQSSY >LR134233|1055233:1062602|1062035_1062602_+|VEC91105.1|DBSCAN-SWA MYARYAGLIFDMDGTLLDTEPTHRKAWREVLGRYGLRFDEQAMVALNGSPTWLIAQSIIELNHADLDPLSLAREKTDAVKSILLDCVEPLPLVEVVKAWHGRRPMSVGTGSESAIAEALLAHLGLRRYFDAVVAADHVQHHKPAPDTFLLCAQRMGVMPTQCVVFEDADFGLQAARAAGMDAVDVRLL >LR134233|1055233:1062602|1060227_1060359_+|VEC91103.1|tRNA|DBSCAN-SWA MVAQQVGGKGGGRPDMAQAGGTDAAALPAALASVQGWVSAKLQ >LR134233|1055233:1062602|1056954_1057494_+|VEC91098.1|DBSCAN-SWA MRPFWHFSLLRFFMSEPTSRRPAYARLLDRAVRILAVRDHSEQELRRKLSAPVMGKNGPEEIDATADDYERVIAWCHEHHYLDDERFVMRFIASRSRKGYGPARIRQELNQKGISRESTEKAMRECEIDWSEMAREQAVRKYGEPLPSNFSEKVKVQRFLLYRGYLMDDIQQIWRNFAD >LR134233|1055233:1062602|1059963_1060155_+|VEC91102.1|tRNA|DBSCAN-SWA MQQLKDQAAAQESANLSSKAVDLNGVKLLVSELAGIEPKMLRTMVDDLKNQLGSTVIVLATVC >LR134233|1055233:1062602|1059571_1059976_+|VEC91101.1|tRNA|DBSCAN-SWA MVNAQIRRNLPIETNIMDLDAAKAKGAMALFGEKYDERVRVLSMGDFSTELCGGTHASRTGDIGLFRIISESGTAAGIRRIEAVTGEGAMATVHAQSDRLNDIAHLLKGDSPESGRQVRAVLERTRQLEKSCSS |
10 | Pseudomonas_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1573664 : 1587464
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >LR134233|1573664:1587464|DBSCAN-SWA GGTGAGGTACGGACCAAAGCGCCGCTGGACAGCCCGGCATTCACTGGAAACGCCGACCACACCGACGCCGCCAGGCGATGCTAAAGGGCTTCAGACAACAAACGCGGAGTTTGTCCGCAAACTGATTGCCGCGCTGGTTGGTTCCGTACTGGAGCCACTGGACACCCTGCAGGAACTGGCTGACGCGTTGGGAAATGATCCGAACTTTGCCACCACGGTACTGAATAAACTGGCGGGCAAGCAGCCGCTGGACGAAACCCTGACGGCGCTGTCAGGAAAAAGTGTTGACGGTCTTATCGAATACGTTGGTTTGCGAGAAACCATAAGTCGTGCCGCCGATGCATTACAAAAATCACAGAATGGCGGCGATATTCCGGACAAGGATTTGTTTGTGCGTCGTATCGGTGCCGCGCGAGCGTTTGATGGCGTAGTTACTATCGGCTGTGATGATAATCCGTGGACGACGGCGGAGTTTATCGTCTGGCTGGAGTCTCAGGGCGCATTCAATCACCCTTACTGGATGTGTCGTGGCTCCTGGTCTTACGCTTATAACAAAATCATCACGGATACTGGCTGCGGTAATATCTGTCTCGCTGGCGCAGTGATTGAGGTAATGGGAGTGCGTGGCGCGATGACTATTCGGGTGACAACGTCCCATTCAGTATCTGGTTGGTGATACGTGGGTGACAGCCCCAAGTGTATAAGAAGGAATAATTATGACAGCGGAAAAAAAATAAAAAGAACAAACAGTTTTTAAATATTAAAAATTTCATTCCGTATGCACCGGAACCAGATGACACATTATTCGCCGATGCGGCGTATCTTAAATCAGAGGATGGTCAGGACTGGTATGGGTGCCAGCAATTATTTTCAGCAGACACGCTGAAAATTACCTACGACGATAACGATGTTATTACGTGTATTACGCGTGATGTTTCCGGGCTGTGGCCTGCTGGCCAGAGCGTTGCAGAGTTGCCTGATACGGATGAAAACCGTCGCGCTGATATTCATGCTGCTGGCAGTTTAAAGACGGTAAAGTCGTTCAAAGGGTTTATTCGCCGGAAGAGCTGCGCAGGCAGGCAGAATCGAAAATTGAACGCCCGGGCGTTGATACCGGATGATCTGGTCATCGTGGAAAGCGACCCTGAAAAAATCGACACTTTAGCTGTAAAATGACAGTCCCGCCATCCGGTCATCATAACGGATTTTTCTTCTGCACCTTCTGAAGCCCGCCATGTCAGGACGACCATGAATCCGCCGATAACCTTATTGTGAAATTAAGACCAGGAAGAGATGATGTCTGCCGGACAGATACTATATGTAAATTTATAAAGGTTTTTTGTTATGCCCTTTCATATTGGAAGCGGGTGTCTTCCCGCCACCATCAGTAATCGCCGCATTTATCGTATTGCCTGGTCTGATACCCCCCCTGAAATGAGTTCCTGGGGAAAAAATGAAGGAATTTTTTTGCTCAACGCACCAGACTGAAGCGCTGGAGTGCATCTGGACGATTTGTCACCCGCCGGCCGGAACGACGCGGGAGGATGTGGTCAGCAGATTTGAACTGCTCAGGACGCTCGCGTATGACGGATGGGAGGAAAACATTCATTCCGGCCTGCACGGGGAAAACTACTTCTGTATTCTGGATGAAGACAGTCAGGAGATGTTGTCAGTCACCCTTGATGATGCCGGGAACTATACCGTAAATTGCCAGGGGTACAGTGAAACCCATCGCCTCACCCTGGACACAGCACAGGGTGAGGAGGGCACAGGACACGCGGAAGGGGCATCCGGGACATTCAGGACATCCTTCCTCCCTGCCACAACGGCTCCACAGACGCCAGCAGAGTATGATGCTGTCTGGTCAGCGTGGAGAAGGGCTGCACCCGCAGAAGAGTCACGCGGCCGTGCAGCAGTGGTACAGAAAATGCGTGCCTGCCTGAATAATGGCAATGCAGTGCTTAACGTGGGAGAATCAGGTCTTACCACCTTGCCAGACTGTTTACCCGCGCATATTACCACACTGGTTATTCCTGATAATAATCTGACCAGCCTGCCGGCGCTGCCGCCAGAACTGCGGACGCTGGAGGTCTCTGGTAACCAGCTGACTAGCCTGCCGGTGCTGCCGCCAGGACTACTGGAACTGTCGATCTTTAGTAACCCGCTGACCCACCTGCCGGCGCTGCCGTCAGGACTATGTAAGCTGTGGATCTTTGGTAATCAACTGACCAGCCTGCCGGTGTTGCCGCCAGGGCTACAGGAGCTGTCGGTATCTGATAACCAACTGGCCAGCCTGCCGGCGCTGCCGTCAGAATTATGTAAGCTGTGGGCCTATAATAACCAGCTGACCAGCCTGCCGACGTTGCCGTCAGGGCTACAGGAGCTGTCGGTATCTGATAACCAACTGGCCAGCCTGCCGACGCTGCCGTCAGAATTATATAAGCTGTGGGCCTATAATAATCGGCTGACCAGCCTGCCGGCGCTGCCGTCAGAATTATGTAAGCTGTGGGCCTATAATAACCAGCTGACCAGCCTGCCGACGTTGCCGTCAGGGCTACAGGAGCTGTCGGTATCTGATAACCAACTGGCCAGCCTGCCGACGCTGCCGTCAGAATTATATAAGCTGTGGGCCTATAATAATCGGCTGACCAGCCTGCCGGCGTTGCCGTCAGGACTGAAGGAGCTGATTGTATCTGGTAACCGGCTGACCAGTCTGCCGGTGCTGCCGTCAGAACTGAAGGAGCTGATGGTATCTGGTAACCGGCTGACCAGCCTGCCGATGCTGCCGTCAGGACTACTGTCGCTGTCGGTCTATCGTAACCAGCTGACCCGCCTGCCGGAAAGTCTCATTCATCTGTCTTCAGAGACAACCGTAAATCTGGAAGGGAACCCACTGTCTGAACGTACTTTGCAGGCGCTGCGGGAGATCACCAGCGCGCCTGGCTATTCAGGCCCCATAATACGATTCGATATGGCGGGAGCCTCCGCCCCCCGGGAAACTCGGGCACTGCACCTGGCGGCCGCTGACTGGCTGGTGCCTGCCCGGGAGGGGGAACCGGCTCCTGCAGACAGATGGCATATGTTCGGACAGGAAGATAACGCCGACGCCTTCAGCCTCTTCCTGGACAGACTGAGTGAGACGGAAAACTTCATAAAGGACGCGGGGTTTAAGGCACAGATATCGTCCTGGCTGGCACAACTGGCTGAAGATGAGGCGTTGAGAGCAAACACCTTTGCTATGGCAACAGAGGCAACCTCAAGCTGCGAGGACCGGGTCACATTTTTTTTGCACCAGATGAAGAACGTACAGCTGGTACATAATGCAGAAAAAGGGCAATACGATAACGATCTCGCGGCGCTGGTTGCCACGGGGCGTGAGATGTTCCGTCTGGGAAAACTGGAACAGATTGCCCGGGAAAAGGTCAGAACGCTGGCTCTCGTTGATGAAATTGAGGTCTGGCTGGCGTATCAGAATAAGCTGAAGAAATCACTCGGGCTGACCAGCGTGACGTCAGAAATGCGTTTCTTTGACGTATCCGGCGTGACGGTTACAGACCTTCAGGACGCGGAGCTTCAGGTGAAAGCCGCTGAAAAAAGCGAGTTCAGGGAGTGGATACTGCAGTGGGGGCCGTTACACAGAGTGCTGGAGCGCAAAGCGCCGGAACGCGTTAACGCGCTTCGTGAAAAGCAAATATCGGATTATGAGGAAACGTACCGGATGCTGTCTGACACAGAGCTGAGACCGTCTGGGCTGGTCGGTAATACCGATGCAGAGCGCACTATCGGAGCAAGAGCGATGGAGAGCGCGAAAAAGACATTTTTGGATGGCCTGCGACCTCTTGTGGAGGAGATGCTGGGGAGCTATCTGAACGTTCAGTGGCGTCGTAACTGATGCACCAGGTGAATGAGGTGCGGTGCGACAAAGATATTCCCGGACGAACAACATCAGACAGTACGGATGATGTACAGGTGAAATATCCGGATGACGGCTAATCAGGCGTATCAGCAGTTAGCAAAGCTGGGTGTTGTTGAACATCGTGAGCGTTACAGTCGCTCCGCGATTAACGGCATTAAAAAATTCTGGTCGCTGACGGCAAAAGGCTGCATGTTCGGCAAAAACATCACCAGCCCGGCAAACCCTCGCGAGACGCAGCCGCATTTCTTCGAGTCCAAATTCCCTGAGCTGCTGAAGCTGCTCGATACCGTTCATTGAGGTGATCGTGAGAGCGTTACTGACCCCTGAAATTGCTCCTCGTATGGGCGTTGTATTGTTCAGGCCGGGATCGGAACTGATGCCCCTGTTTATGCAGGGGCGTGTTCTGCTTGAACCAGAGCCGGAGCAATTTTCATCTTTCGCCAGCGGCGCGGTCCCGGCGGTATCACAGCCGCTGGCGGATGATCCTGCTGTTCGTGATGTGTTCTGTAATGAGTCGGTTATCTATCGTGCTGGTGGTCTGGATAGTCTGGAAAGCTGGCTACTCCGGGGGAATGGCTGTCAGTGGCCGCATTCAGACTGGCACAGCGAACAGATGACAACCATGCGCCACGCTCCGGGGGCAATCCGACTGTGCTGGCACTGCGATAACCTGCTGCGCGAACAGTTTACGGAACGGCTGAAATCAATAGCTGTGGAGAACACGACAAAATGGGTTTTATCGGTTGTTTGTCGTGATCTGGGTTTTGACGATATGCACGCAGTTACTCTCCCGGAACTGTGCTGGTGGATGGTACGCAATAACCTGGCAGAAGTCTTACCGGAGAGCGCTGCGAGAAAAGCATTAAGGATGCCGAAGGCAATTGTCCAGTCAGCTACCCGTGAAAGTGAAATTGTTCCCTCGGTGCTGGCCACCAGCATTGTACAGGATAAGGCGAAAAAGGTACTGGCGCTCAGGGTTGATCCGGAATCGCCGGAAAGCTTCATGTTACGTCCGAAACGCCGTCGATGGGTCAATGAGAGATATACCCGCTGGGTTAAATCCCAGCCGTGCACCTGCTGCGGGAAGCAGGCGGATGATCCGCACCACCTGATAGGCTACGGTCAGGGAGGGATGGGAACAAAGGCGCATGACCTCTTTGTGCTGCCGTTGTGCAGAACGCATCACAATGAGTTACATGCGGACACCGTGGCATTCGAAGAGAAATACGGCTCTCAACTGGAGTTGATATTTCGTTTTATCGATCGCGCGCTGGCCTGATTTGGTGGAGAAAGTTGATGCGTGATATTCAGATGGTTCTTGAGCGTTGGGGGGCATGGGCGGCGAGTGATAGCTCAGGAGTGGACTATTCGCCTATAGCTGCTGGGTTTAAAGGGCTTCTTCCCTATACCTGCAAGACACGTGTGGCTTGTTCTGATAATGACGCATTAATTGTTGAGGGGTGTCTTGCTCGTTTAAAGCAAAAAAGGCCTGATGAGCATTCGCTTCTTGTGGCACATTATTTATACAGAATATCCAAGCGTAAGATTGCAAAGGTGCGTGGGAAGGATGAGAAATTGGTACGTATAGAAATACAGTTAGCCGAAGGATTTATTGATGGCTGCCTTTCAATGCTGGATCTAACATTAGATATGGACGTTTAATAATACGCCCCATGCAGGGGCGTATTATTTACTGGATGAATGACATTTGATTAATATATTTTATCATTAACTCTCTGGGAGTAGTGCTCCAAAATTTTAGGTGTTCATAATCAGTATAAACATTTGTGAACTTTTTAAGTTCCTCATCACTTTTAGGGGCAAATCGCTCATTAAGAGTAATGCTGTCTTTAATTATTCCGAAGAATATAATTGATATTTCAGGGATTTGCGATCTTCTCTCATAAGAATAAGGTATTTCTTTTATGTTAAGCAGGTCTGCTAAATAAGATATAGATTCAGCGGTAATTTCTTGTTTTAGTTTTTCCTTGTCACCAAAACAATGTATGAAGACAGCTACGACAACGATCATATTTATAAAAGGATCATTGCTTTTGCACTCATTGTCGTTTAACAGTCGGAAAATGTTAAGATTGCGTGAAAGGTTTGTGTTTCACGTAAGGATAAGTTGGTTCGTTGAATTAAATCACAAATAAAGCTACCAACTAAGCTGTTGATTTTATTTAGTAATGTAGTTTCACCTACAAGGTGATCCCAATATATAACCGAGGTTTTACATACGTTATGCCCATTTATTAAGCATGTGTCTGGAAGAGTAATGGTATATTTTATAAACTTGTCAAGATACTTTTGTGAGTTAATGCTATAACCATAAATATGATTTATAGATGCTTTTAGTTGTTCAGTGTTTGTAACTAAAATAAAAAATACGTTATTGATATCAAAGATGTGTTTTATTGTTTCAATGATATTTGTTGAAAAACTTGGCTTACATCGGTCTAATTCATCAATAATTATCACTATCTTTTGCTTGTTTGATATATCTTCAATGCAAGATTTTAGAGAGTTTATGTTTTTCTCTGATTCCATGTGGTCTTCAAGTATATTTTCAATAGTCCCATCTATTGCTGCATTGCTTGCTTTCTTCATCGCATCTTGGAATTCTTCGGCAACTTCACTAGCCTCCTGTCGTAAAAACCAACCTGCACCAGTTTTTAGTACAGTTTTTAAACCAAATCGAATTGCAGGAAGAGATCTCTTAATGAAGTTTTGTTTTTCCTCCTCCGGCAAAATGCTCGCAATAGCTGATGTTATTAGAAGTAATGGAGATTCTGCATGATCCCCTTTAAAGGCATCAACATAGACAACTTTAGACTCAGTTTCTTGCTCGATAATGAGATTCTTCAGTTTGATACTAAATTCTGATTTCCCTGTCCCCCATGCACCGTCTATTACCAGTGGTGAAATGTCTGCCTCTGGTTTTAGCAATTTGATGATATTTTCAGCGATGTTTCTTCGTTGGAACTCGTCACGTTCAGTGAATGATAGTGTTTCTAACATAATATAAGCCTGTTAACTCCTACAGTAAAAAACGCATAATACGACTTTGGGATTAAAAATCATTAACGCGGTCCGCAAAAATTCTTGTAATCTGCTAAGAGTGGTTACTTTGCCACGCAGCTTAAACCCGCCGTCGAGCGGGTTTTGTCGTTTCTGGGCCTGGGATTCGTTGGGCCTGGCCTATCCCGCAGTTATCGATTGGCTCGGCTTCTTTTACGTTTCCGCTTCTGATTTGCGGTACGTGGTATTCCCTCAATTTGCACCTGCTGTATCAGCGAGGTGAGAGATAACTACAAATGCCTCATAACCCAAATACCTGGCCTGATTTACTTGAATTGTTACAGAGCTGGTGGCGTGGAGACACACCGCTGGGCGCAGTCATTATGTCGATTGTTATGGCTGGTTTGCGCATCGCCTATTTTGGCAGTGGTGGCGGCTGGAAGCGAAAAATGCTCGAGATTTTGCTCTGTGGCGCTCTGACACTGACCTTTGCATCTGCTCTTGAGTATGTCGGATGGATGAGCCGCAACGAAGCACGCGCGTTTGAGGACATGAATCCGAAAGACGGCCTTGATGAAATGCTGGTCAGCGTCAACGCCTCCCGGCCAGCCAAAACCACAATCTAGGAGAACACTCAAGATGAGTGAACGTGAAATTCGCTGTTACAGCGGCGAGGTGCGCGCAGAAACGCACGACAGCGAGCCCAGCCGGATCATCGGGTATGGTTCGGTCTTTGACAGTCGTTCTGAACTGATTTTCGGTTCGTTTCGCGAAATCATCCGGCCCGGTGCGTTTGATGAAGTGCTGAATGACGATGTACGGGCGTTGTTCAACCATGACCCCAATTTTATCCTGGGTCGCAGAAGTGCGGGCACGCTGGCACTGACGGTTGATGAGCGGGGGCTGCGTTATGACATCACCGCGCCAGAAACTCAGACAATCCGTGATCTGGTGTTGGCACCAATGCAGCGCGGGGATATCAACCAGTCCTCTTTTGCATTTCGCGTCGCCCGCGACGGAGAGGAATGGTACCAGGACGAGGATGGTGTGGTGATTCGTGAGATTACCCGTTTTTCCCGTCTGCTGGATGGCAGACCTGTACAAGCACCTGCGATTCTGCGATCAGAACGTGGGGCGATACTGTTGCTGAATGTGTGATTGATAAGCTTTGTCCGTCACATACGGTTGTTGTTTTTGCTTATCCGGAAGGAAAAGAGAATGCACAGAATTGATACGCCCACTGCGCAAAAAGATAAATTTGGTCAGGGGAAAAACGGATTTACGAATGGTGATCCCGCCACGGGCCGCCGCGCAACGGATCTCAACAGTGATATGTGGGATGCAGTCCAGAAAGAGGTCTGTACTGTTATTGAAGCCGCCGGCATACCACTCAGTAAAGGCGAACATACGCAGCTTCACGCCGCCATTGGCAGGTTGATCTATGAACAGGTTAAAACCCGTCTTGAAAAAAATCAGAATGGCGCGGACATCCCGAATAAGCCGCTGTTTCTCCAGAACGTTGGTTTAGTAGACGTTCTTTTTAAAGGTGACGGGCGATTCCTCGCGGGAACGTTTGTCAGTGACGCAATTGACCGAACATCAATCGGCGCGCGTGCGGCTACAGGCTGTCAGTTTATGCGTGCGCATCAGGCCCCCGACGCGCCAGACCAGGTAAGTTTCTGGCAAATTATTACCCTTAGCGAGGTGGTAAGTCCGACCACTGTTGTGGATGTTCTTGCAGTCAGTGGCAATAACGTATTGTTTGGTCACGGAACAGGAACGGGTATTACCTCATGGCGTCAAGTGGCGATGCTGGAGGGGGGCGCCTTTACGGGGGGTATTTCTGCACCAAATATGCGTGGCGATACCCTAGTTACGGTTGGGGATGGCACTGGTGGGATGGCTAAAGGTGACGTTGATGGTGCAGGTTTTAATGGTAACAATCTGAACATTAAGTCATGGAATGGTATCGGATTTCAGAACTCAGAAGACCTGGCTATCCGGGCATATATCAGCACCCGACTCGGTGTTATCGCAGATGCTGAAAACTTGCAGGCTGGAAGTGCGATATTCAACAAAAATGGCGATGTATACGGTGATATATGGGGCGGTGGTAGCGGACCGGGCTGGCTGAGCGCGTTTGTGGCTTCTAAGCCAGCGCGTCAGTACATCACCATGGTTGGTGTGTACCAGAACGACAAAACAAAGCCATTTATGCTTCATGATGATGGGTCTGGTGTATTCCTTGCTACAACTGACATGCTGAGTGGGTATGTTCAGTCAATTCGATTCGGTGCCGTTGAGCATGGAAACTTATATCGTTCGCCCGGATTTGCAGACCAGTTAGGTTACGTCATTACAGGTGTTGAGAATGGAGACTCGAACGATACACCAGACAGGATCCAACGACGCCTGTTACAGCTTAAAGTGAATGGCCAGTGGTATACGGTGGGAGCATAAAAATGAGGCATTTTAAAAATTTCACTAAAACAACGGAATTAACCCTTGTTCAGCAGAAATTATCAGAGAATTGCAGTATTCAGTTTATCCAGGATGAATCAGGTGTTGACTGGTATGTGTTACAGAAGTTATTCCAGCCCGATACGCTGAAAATCCAGTATGACAAAACAGGGCTGATTATTGCTGCGGATAAAGATGCGACTAAGCTATTTCCGCTGAATTGCTCTGTTGTGGAATTCGCTGATACTGATATTCCCGATGGTTTTCAGCCTGGGAATTTTACCTACAGCAACGGCGTTATTGCCCCTGTTCAGATTGATTATGTTGCTTTAGCTACGGCAGAGCGCGACAGGCGTATGACGTCGGTCACAGCAAAAATCAATCAGCTTGTAGAAGCTCAGGACGATGGCGATATAACGGCTGCTGAATTGTCCGAACTCACTGCGCTACGTGAATACCGAACAAAGCTGCGCCGACTGGCGCTAGATGTTGCACCTGATATTAACTGGCCGGAATATCTGAATAAATAGATAAAAATCATGTGCTCGAAACGTTCAATTGAAATCGGCCGCACAACGGGACGTGGGCAACGGCGCGAATCAGAACCCGGACATTTCTTCATTTGTTCTCAGGGACAACCCTGCCGGGATCTATACCTCTCTGCCCGGTGGCGCAATCTTTCAGGCTTTAAATTGCTTTGTTCCCGGGAATAGCCCTTCAGGGTATGTTTTTCCATTACCAACAACATTTCCTTATGTATTCAAGGCTGCCACGCTTACCCCTCAGGATGCACAAGGTGATATTACGATATCCCCAACATATTTGATTGAGAACAACGGTGCTATTCGTATATTTAATCCATCCGGTGGCGATAATTCCATATCTGTAATTTACATGGGGTATTGACATGAGCATAATATGTTTTTCCTCGACATCAAACACATATTTTTCAGAGGAGAACATTCCAGCTTATAAGGCCAGTGGGTTGTGGTCGAATGATTTTTATGCTCTGACCGATGAAGAAGTTGATAAGTATTATATGAAAACTCCACCAAAGGGAATGTATCTCGGTTCATCAAATGGACGTATTGCCTGGGTTTGTATTCCACCTCCAACGCAAGATGATTTAATTTCTGCCGCAAATCAGGAAAAGAAGAAGAGAATCGATCAGGCCAACGAGCACATGAACAGCAGGCGGTGGCCGGGCAAAGCGGCGCTCGGGCGCCTGACGGGCGACGAACTGGCGCAGTACAATCTGTGGCTGGATTATCTCGATGCGCTGAAAGCGGTTGATACATCGGTTGCACAAAATATAGCATGCCCCATCAGAAAAGCTATACACATAAAATAATTTGCATTATTGTTGTAGTTTTATAAAATAAAAAGAGGGGCAAGCCCCTCTGTTTATTTTGAAATCTGCTTTTTCACTTCCTCAATAAACCACCCTGAGCCTTCTGGTGTTAGGTGCGCATTGTCATATTGTATAGGGTAAGCAATTCTATTCCCTATTATTGCTTTGCAATAACTTTCTGTACACATAGTCTCAAGTGGTGAAATATAAGTAAGGGAATGCTCTTTCGCCAGCTCTCTAAGATATTTGTCATTATCTCTCAGATTACGGGTTTCATCCGTCATGCTCCATGGAACGGTCCTTCCGGAATTAATCCCCATATCCTCAATAGTATCAATCATCGTTTTCTTCCAAACCGGGAACGGGCCAACAATGATGATGTTTTTTACTTTGTTATCCTTCAGGAATTTAATAGTCTCAGGCAGATAATCACGCATAGGATAGACAGGCCACAACGCGGACATTAATACCGTAGTAGGCTTGTTGTCTGAAATTTCCTTGGCTACCATATCATTGATGTCTTTGCAATACGGCCTGTCATCTTTTTGAAGCCCAATGATTGGTGGGCACAAGCTTGCAGTTCTCTGCGTAATGTTAAGTGAATTTCCAAATACCGATTTCAGCCCTGGCATAAGATGTGCGGCATGCGAGTCACCCCATACAACAAAAGACTTTTCAGTCATTTTATCCTGACATTTTGAGAATGCTGAATAATCTTGATCTGGATTGAGGAAGCAAATATCAGGCCTCCAGGGAGAGTTGTCCATACGATACTCAACGACTTGCTTTAGGGTGTCTGAAAATCTAAAACTAACACCTTTTGTGAACATGACAAACAAGCATAGAGCCAGAGTAGAGGAAAATAATACAATATTAAATTGTAGCTTAACTCGTTTCCTGAGCGTGTTTTCAATTGTTCTGTATGAAATATCCCCAAGCGCAAAAGACACAATTACACCAAAGAAAATATTTATAGCGCTGAACTCAATATCATAATGTTTCATCGCTACAATTACAGGCCAGTGCCATAAATAAACTGAGTAAGAAATCTTACCCACCCACTGTGCAATTCTGTTTGATGTGAATAAAGAGTTCTGTTTGTTAGCCAGAATAACCATCGAAGCACCCAGCACGGGAGCGAGTGCTGAATAGCTTGGCCAGTAACCATTACTGTGCAGTATAACAACAGCAACAACAATGAGGACTATCCCATAAACCTCACAATGCTTTATCCATTCTGGCATTTTGTAACGTACAGATGCTATGTATACAAGGCCGCCAGCCAGCATTTCCCATGCCCTGGTAGGGATAAGATAGAAAATATCTTCTTTGGTTCCAGTCACTCGCATAAGTGTAATTGCAAGTGACATGGCTAAAATTACAGATAATGAGAGTCCAACAGGAAACCGTAATTTTTTAACTATAATGACTAATAAAGGATATAAAATATAAAATTGCCACTCAACTGATAGTGACCAGGTGTGAAGCAAAAAGTTGAACTCTGATGATGAGTCGAAATAACTAGAGTGAATTGCGTAATAATTATTTGAATAAAAAAGTAAAGACGATATTGCGTTCTTGCTAAGTGCTTCGTATTCATTTGTACTTAGTGTAAATAAACCGAAAATCATCAACAAGAGAATTGCAAATACCAGAGCTGGTACAATTCTTAGGAATCTTGCAATATAAAAATCAAGTACTCCTTTGTGGTCTACGCGTTCAAGGACAATTCCAGTCATAAGAAAACCAGAAATTACAAAGAAAAACATCTACACCTATAAAGCCACCTGACACATAAGGTACACCGAAGTGATACAGCACAACACTTATCAATGCAAAAGCTCTTAGCCCATTTATATCGAGTCTGAATTTCTTGTAGATCATTTTTTACCTATAATTCCAAGGTGCCACACAAGACGAAATTAAGATAATGGATGATAGACTTTTTTATAAAAGTCGTCACCTAAAATTGACCTGTTACATCAACTACATCAACTACATCAAAGAAGGTCACTGACCAGCGCATCTTGCTGGTTATTGATACGTCCGTTTCGCCACGCGACGACTCATTGCTGGTTTGCATAGATGGAGAGGAGTTCAGGATTAAGCGGTACCGGACACATTCGCAATCGCACCCTGTAAATTTTGAGAATGGGAAAAGGGAACGTTTACGAAAATGAAAATCCGCCACAGCGCGATCAAAACGTAACTATGGCGGATATTGAAAAACTTTACTCGCTCACTAACCAGAATTCAGCCTCTTCAAACATTTCCTGAACAGTACGGCTTATCTGTTCCTTCTCATGCTTGCTGGCGTCAGTATTGATCGCCGGTAGTGTCATCATCGGTTTAACCCGAACATCAGCATCGGGGAAGATCCGGTGAACCCTCTTAGTCAATTCTCCCAGAATGATATCTTTTGCTCCGGGCAAACCATCAATATTGAAATAGCCTGGTATGCTTCGATACAGCAGGAACCTAATGGCCGGAAGACCGTCACCACACAGCGGTTTGTCCAGCAACTGAGCAAGGTTAACTGGAACTGGACGATGAAGCAGGCTAACGAATGGATCGAGTGGTATGTGACAACATTCCGCGATGTATCAACGCAGGAAGGGGAGACGTACCTTTCAGCTGTTCAATCCAAACGGAGGACTATAGCCATGGGCTTCCCTTCACCTGCCAGTGATTACGTTGAAACAAGGATCTCCCTCGATCAGCAGCTAATCAGCCAGCCCGCAGCGACTTATTTCATGCGGGCATCGCGTTCACATTTCAGGGAAGGGATAATCCAGGGGGCGCTACTTGTTGTGGACGCGTCGCTTTCAGCCTGTGATGGCTCGCTGCTGATATGCGCGATAGATGGGGAATTCAGGATCAAGCGATACCGAACTCACCCTCAGCCCCACCTGATAAATCTGGAGAACGGGAGAAAGGAAGCGCTGCCAGAAGATGGTGATGGCTACAATTCTTCACACGCAATATTTGGAGTTATCACGTACATCATTAATGATGCCAGGAACGCGGAGTTTGATGACTGCCCGGTGATGTGA
Protein sequences of DBSCAN-SWA_4 >LR134233|1573664:1587464|1586130_1586295_-|VEC91604.1|DBSCAN-SWA MIYKKFRLDINGLRAFALISVVLYHFGVPYVSGGFIGVDVFLCNFWFSYDWNCP >LR134233|1573664:1587464|1575116_1577555_+|VEC91592.1|DBSCAN-SWA MKEFFCSTHQTEALECIWTICHPPAGTTREDVVSRFELLRTLAYDGWEENIHSGLHGENYFCILDEDSQEMLSVTLDDAGNYTVNCQGYSETHRLTLDTAQGEEGTGHAEGASGTFRTSFLPATTAPQTPAEYDAVWSAWRRAAPAEESRGRAAVVQKMRACLNNGNAVLNVGESGLTTLPDCLPAHITTLVIPDNNLTSLPALPPELRTLEVSGNQLTSLPVLPPGLLELSIFSNPLTHLPALPSGLCKLWIFGNQLTSLPVLPPGLQELSVSDNQLASLPALPSELCKLWAYNNQLTSLPTLPSGLQELSVSDNQLASLPTLPSELYKLWAYNNRLTSLPALPSELCKLWAYNNQLTSLPTLPSGLQELSVSDNQLASLPTLPSELYKLWAYNNRLTSLPALPSGLKELIVSGNRLTSLPVLPSELKELMVSGNRLTSLPMLPSGLLSLSVYRNQLTRLPESLIHLSSETTVNLEGNPLSERTLQALREITSAPGYSGPIIRFDMAGASAPRETRALHLAAADWLVPAREGEPAPADRWHMFGQEDNADAFSLFLDRLSETENFIKDAGFKAQISSWLAQLAEDEALRANTFAMATEATSSCEDRVTFFLHQMKNVQLVHNAEKGQYDNDLAALVATGREMFRLGKLEQIAREKVRTLALVDEIEVWLAYQNKLKKSLGLTSVTSEMRFFDVSGVTVTDLQDAELQVKAAEKSEFREWILQWGPLHRVLERKAPERVNALREKQISDYEETYRMLSDTELRPSGLVGNTDAERTIGARAMESAKKTFLDGLRPLVEEMLGSYLNVQWRRN >LR134233|1573664:1587464|1580896_1581226_+|VEC91598.1|holin|DBSCAN-SWA MPHNPNTWPDLLELLQSWWRGDTPLGAVIMSIVMAGLRIAYFGSGGGWKRKMLEILLCGALTLTFASALEYVGWMSRNEARAFEDMNPKDGLDEMLVSVNASRPAKTTI >LR134233|1573664:1587464|1581818_1583060_+|VEC91600.1|DBSCAN-SWA MHRIDTPTAQKDKFGQGKNGFTNGDPATGRRATDLNSDMWDAVQKEVCTVIEAAGIPLSKGEHTQLHAAIGRLIYEQVKTRLEKNQNGADIPNKPLFLQNVGLVDVLFKGDGRFLAGTFVSDAIDRTSIGARAATGCQFMRAHQAPDAPDQVSFWQIITLSEVVSPTTVVDVLAVSGNNVLFGHGTGTGITSWRQVAMLEGGAFTGGISAPNMRGDTLVTVGDGTGGMAKGDVDGAGFNGNNLNIKSWNGIGFQNSEDLAIRAYISTRLGVIADAENLQAGSAIFNKNGDVYGDIWGGGSGPGWLSAFVASKPARQYITMVGVYQNDKTKPFMLHDDGSGVFLATTDMLSGYVQSIRFGAVEHGNLYRSPGFADQLGYVITGVENGDSNDTPDRIQRRLLQLKVNGQWYTVGA >LR134233|1573664:1587464|1586792_1587464_+|VEC91605.1|DBSCAN-SWA MNPLSQFSQNDIFCSGQTINIEIAWYASIQQEPNGRKTVTTQRFVQQLSKVNWNWTMKQANEWIEWYVTTFRDVSTQEGETYLSAVQSKRRTIAMGFPSPASDYVETRISLDQQLISQPAATYFMRASRSHFREGIIQGALLVVDASLSACDGSLLICAIDGEFRIKRYRTHPQPHLINLENGRKEALPEDGDGYNSSHAIFGVITYIINDARNAEFDDCPVM >LR134233|1573664:1587464|1583967_1584411_+|VEC91602.1|tail|DBSCAN-SWA MSIICFSSTSNTYFSEENIPAYKASGLWSNDFYALTDEEVDKYYMKTPPKGMYLGSSNGRIAWVCIPPPTQDDLISAANQEKKKRIDQANEHMNSRRWPGKAALGRLTGDELAQYNLWLDYLDALKAVDTSVAQNIACPIRKAIHIK >LR134233|1573664:1587464|1584464_1586180_-|VEC91603.1|DBSCAN-SWA MFFFVISGFLMTGIVLERVDHKGVLDFYIARFLRIVPALVFAILLLMIFGLFTLSTNEYEALSKNAISSLLFYSNNYYAIHSSYFDSSSEFNFLLHTWSLSVEWQFYILYPLLVIIVKKLRFPVGLSLSVILAMSLAITLMRVTGTKEDIFYLIPTRAWEMLAGGLVYIASVRYKMPEWIKHCEVYGIVLIVVAVVILHSNGYWPSYSALAPVLGASMVILANKQNSLFTSNRIAQWVGKISYSVYLWHWPVIVAMKHYDIEFSAINIFFGVIVSFALGDISYRTIENTLRKRVKLQFNIVLFSSTLALCLFVMFTKGVSFRFSDTLKQVVEYRMDNSPWRPDICFLNPDQDYSAFSKCQDKMTEKSFVVWGDSHAAHLMPGLKSVFGNSLNITQRTASLCPPIIGLQKDDRPYCKDINDMVAKEISDNKPTTVLMSALWPVYPMRDYLPETIKFLKDNKVKNIIIVGPFPVWKKTMIDTIEDMGINSGRTVPWSMTDETRNLRDNDKYLRELAKEHSLTYISPLETMCTESYCKAIIGNRIAYPIQYDNAHLTPEGSGWFIEEVKKQISK >LR134233|1573664:1587464|1573664_1574339_+|VEC91591.1|tail|DBSCAN-SWA MRYGPKRRWTARHSLETPTTPTPPGDAKGLQTTNAEFVRKLIAALVGSVLEPLDTLQELADALGNDPNFATTVLNKLAGKQPLDETLTALSGKSVDGLIEYVGLRETISRAADALQKSQNGGDIPDKDLFVRRIGAARAFDGVVTIGCDDNPWTTAEFIVWLESQGAFNHPYWMCRGSWSYAYNKIITDTGCGNICLAGAVIEVMGVRGAMTIRVTTSHSVSGW >LR134233|1573664:1587464|1578875_1579241_+|VEC91596.1|DBSCAN-SWA MRDIQMVLERWGAWAASDSSGVDYSPIAAGFKGLLPYTCKTRVACSDNDALIVEGCLARLKQKRPDEHSLLVAHYLYRISKRKIAKVRGKDEKLVRIEIQLAEGFIDGCLSMLDLTLDMDV >LR134233|1573664:1587464|1581239_1581758_+|VEC91599.1|head,protease|DBSCAN-SWA MSEREIRCYSGEVRAETHDSEPSRIIGYGSVFDSRSELIFGSFREIIRPGAFDEVLNDDVRALFNHDPNFILGRRSAGTLALTVDERGLRYDITAPETQTIRDLVLAPMQRGDINQSSFAFRVARDGEEWYQDEDGVVIREITRFSRLLDGRPVQAPAILRSERGAILLLNV >LR134233|1573664:1587464|1577645_1577876_+|VEC91594.1|DBSCAN-SWA MTANQAYQQLAKLGVVEHRERYSRSAINGIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESKFPELLKLLDTVH >LR134233|1573664:1587464|1583062_1583590_+|VEC91601.1|tail|DBSCAN-SWA MRHFKNFTKTTELTLVQQKLSENCSIQFIQDESGVDWYVLQKLFQPDTLKIQYDKTGLIIAADKDATKLFPLNCSVVEFADTDIPDGFQPGNFTYSNGVIAPVQIDYVALATAERDRRMTSVTAKINQLVEAQDDGDITAAELSELTALREYRTKLRRLALDVAPDINWPEYLNK >LR134233|1573664:1587464|1577554_1577656_+|VEC91593.1|DBSCAN-SWA MHQVNEVRCDKDIPGRTTSDSTDDVQVKYPDDG >LR134233|1573664:1587464|1577883_1578858_+|VEC91595.1|DBSCAN-SWA MRALLTPEIAPRMGVVLFRPGSELMPLFMQGRVLLEPEPEQFSSFASGAVPAVSQPLADDPAVRDVFCNESVIYRAGGLDSLESWLLRGNGCQWPHSDWHSEQMTTMRHAPGAIRLCWHCDNLLREQFTERLKSIAVENTTKWVLSVVCRDLGFDDMHAVTLPELCWWMVRNNLAEVLPESAARKALRMPKAIVQSATRESEIVPSVLATSIVQDKAKKVLALRVDPESPESFMLRPKRRRWVNERYTRWVKSQPCTCCGKQADDPHHLIGYGQGGMGTKAHDLFVLPLCRTHHNELHADTVAFEEKYGSQLELIFRFIDRALA >LR134233|1573664:1587464|1579649_1580600_-|VEC91597.1|DBSCAN-SWA MLETLSFTERDEFQRRNIAENIIKLLKPEADISPLVIDGAWGTGKSEFSIKLKNLIIEQETESKVVYVDAFKGDHAESPLLLITSAIASILPEEEKQNFIKRSLPAIRFGLKTVLKTGAGWFLRQEASEVAEEFQDAMKKASNAAIDGTIENILEDHMESEKNINSLKSCIEDISNKQKIVIIIDELDRCKPSFSTNIIETIKHIFDINNVFFILVTNTEQLKASINHIYGYSINSQKYLDKFIKYTITLPDTCLINGHNVCKTSVIYWDHLVGETTLLNKINSLVGSFICDLIQRTNLSLRETQTFHAILTFSDC |
15 | Salmonella_phage(35.71%) | holin,tail,protease,head | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1659520 : 1668689
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >LR134233|1659520:1668689|DBSCAN-SWA GATGATTGAATTTAACCATGTTAGTAAAACCTTCGGCGATCAACAGGCTGTTAGCGACCTCAATTTGCACTTTAGCGAAGGCAGCTTTTCGGTGTTAATTGGCACCTCCGGCTCGGGAAAATCGACCACTCTGAAGATGATTAACCGGCTGGTAGAGCATGATAGCGGAACGATCCGTTTTGCCGGGGAAGAGATCCGCAGCCTGCCGGTGCTTGAACTGCGCCGTCGCATGGGCTATGCCATTCAGTCTATCGGTCTTTTTCCCCACTGGACGGTGGCGCAAAATATCGCCACCGTACCGCAACTACAAAAGTGGTCGCGTGCGCGGATTAACGATCGTATTGATGAACTGATGGCATTATTGGGTCTGGAAAGCGCGCTGCGCGATCGTTATCCGCATCAGCTTTCCGGCGGGCAACAGCAGCGGGTCGGCGTTGCGCGGGCGCTGGCTGCCGATCCGCAGGTATTGCTGATGGACGAGCCTTTCGGCGCGCTTGATCCGGTAACGCGCGGCGCATTGCAGCAGGAGATGACCCGCATTCATCAGCTGCTGGGGCGCACCATCGTACTGGTGACGCACGACATCGACGAGGCGCTACGCCTCGCCGACCATCTGGTGCTGATGGACGGGGGCAATGTTATCCAACAGGGATCGCCACTTTCTATGCTGACCTCGCCGGAAAATGATTTCGTGCAGGCATTTTTTGGCCGCAGCGAGCTGGGCGTAAGGCTGCTTTCGTTACGTAGCGTAGGCGATTATGTACGTCGGCATGAACAGCTCAGCGGCGACGCGCTGGTGGAAGAGATGACGCTACGCGATGCGCTATCGATGTTTGTCGCCCGTCGGTGCGACGTCCTGCCGGTGGCGAATCAGCAGGGCGAGCCCTGCGGTACGCTCCATTTCCGCGATCTGCTTTCGGAGACGTCCCCCCGTGAAACGACTGTGTGATCCGCTTCTCTGGCTTATTGTTCTGTTCTTGCTTCTGCTGTTTGGATTGCCTTATAGCCAGCCGTTCTTCGCCGCGCTGTTTCCCGATTTACCGCGCCCGGTCTACCAACAGGAGAGTTTTGCCGCCCTCGCGCTCGCCCATTTCTGGTTGGTGGGCATCTCAAGTCTGTTTGCCGTCGTGGTGGGCGTCGGCGCAGGGATTGCGGTCACGCGAGAAAGTGGGAAAGAGTTTCGTCCCCTGGTGGAGACTATCGCCGCCGTCGGGCAGACCTTTCCGCCGGTCGCGGTACTGGCGATCGCGGTACCCGTCATGGGTTTTGGTCAGCAACCGGCCATTATCGCCTTGATCCTGTATGGGGTGCTGCCCATCCTGCAGGCGACCCTGGCCGGGCTGGGCGCGGTGCCTGCCAGCGTGATGAGCGTTGCCAGCGGTATGGGAATGAGCCGTCGCCAACAGTTGTATCAGGTTGAGCTGCCGCTGGCCGCGCCGGTGATTCTGGCGGGCATCCGAACCTCGGTGATTATCAATATTGGTACGGCGACCATCGCTTCAACGGTGGGGGCCAGTACGTTAGGCACGCCGATCATTATCGGGCTTAGCGGCTTTAATACGGCCTATGTTATCCAGGGGGCGCTGCTGGTGGCGCTGGCGGCGATCATTATCGATCGCCTGTTTGAAAGGCTGACGCGCGCGCTTACCCGGCACGCAAAATAAAACTGTAACCTGCCAGCATCACGCCGCCGATACCGCCAATAGCCATCAGCAGGAAAAGGGCGATCACCCCGATTTTCGCTACGCGCATTATGTACTCCTTATGTTAATAAAAGGAGTATACATTAAAGCGAATTTGTTAGCTGCTGTTTAAACGCCAAGGGGATGAATGTCGCGTCCTTGGGCGCGCCAGGCCAAAAGTTGCTGCTGCTGCGCCAGCGTCTGGTTTTCTCCGCACCATACCAGTAACGTCTTGCCGTCAAACAGTTCCGGACGGAACTGGCTAAGCGAGTGCGCCAGCACGTCGATTCGCCATCCCTGTTGGCTGGCGACCCAACCTTCCAGCCACAGGCGGGTGGTATCATGGATATTCCAGCCGATCACCAACGCATCTTTTCCCTGTTTCTTACGCGCAGACGCCAGGCAGAGCGCAATATAGTTGATTAGGATACCGTCAAGAATGCCGAGCAGCGCCTGAAGGGCGGGTTGTTGGCACTGTAACCGTCGTCGCAGCGGGACGAACAGGTTAGTGGTCAATGTTTGGGCTGGATAATCCTGACCGCGTTCTTTGACCCATAACCGTAAACTGTGCAGATTACTGCTTTGCAGGTAGTGCAGCAGGATCTCCTGCTGTTCGCGCCAGCCGTTAGGTTGTTCGCTACTGTCGCTACTGAGCAGCACTTTGACTTTGCTGACCTGAACGCCGTTATCTATCCAGCGCTTGATTTCGCGGATTCTGTCGATATCGGCATCGTTAAACAGACGATGACCGCCATCCGTTCGCTGTGGTTTTAAAAGTCCATAACGTCTCTGCCACGCGCGCAACGTGACAGGATTGATATCACAAAGCAAGCCACTTCACCAATTGTGTAAAGCGCCATCGTTTCACCCTTGCTCGCGAGGTCCCAGTTTAACTTTAGACGCTCTTTTAGGAACCAGGAAGTTTTGCCTGTTTTTTATGCATTAAAACGCGAAGTAGCGGGTTGCGGCGCGGCGTTTAAGTGATCGTATTCACGAATTCATATTTTTATGCAACAGTTCAAAGAAAGTTAATCGTACTCAATGTATGTTACGCGCTTTTAATTGAAGTGTGGTTTGCGGGTATGTACGAGTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTGTTTCTGGTCATTGCGTGGCTAATGAGTAAAACGCGCCTGTTCATCCCGCTTATGCAGGTCACGGTTCGTCTGCCGCACAAGCTTCTGTGTTACGTCACGTTTTCTATCTTCTGCATTATGGCACTTATTTTGGGCTGCATATCGAAGATTCGATTGCCAATACCCGCGCGATTGGCGCGGTGATGGGTGGCCTACTCGGCGGGCCGGTCGTCGGCGGGCTGGTCGGTCTGACCGGTGGGTTACATCGGTATTCTATGGGCGGCATGACGGCGCTGAGCTGTATGATTTCCACCATCGTCGAAGGGCTGCTGGGCGGGTTGGTACACAGCGTTCTCATACGTCGCGGACGCCCGGACAAAGTGTTTAGCCCGCTGACGGCGGGAGCAATTACGTGTGTTGCCGAACTGGTGCAGATGCTGATCATTTTACTGATAGCCAGGCCGTTTGACGATGCCCTGCATCTGGTCAGTAATATTGCCGCGCCGATGATGGTGACGAATACCGTTGGCGCCGCGCTGTTTATGCGTATTTTGCTCGATAAGCGCGCCATGTTCGAAAAATATACCTCGGCATTTTCTGCTACCGCGCTGAAGGTCGCCGCGTCAACGGAGGGGATTCTGCGCCAGGGATTTAACGAAGTGAACAGTATGAAGGTGGCGCAGGTGTTATATCAGGAGCTGGATATCGGCGCCGTCGCCATCACCGATCGTGAAAAACTGCTGGCTTTTACTGGTATTGGCGACGATCACCATCTACCGGGCAAACCCATTTCATCAGGTTATACGCTGAAAGCAATTGAAACCGGAGAGGTGGTTTATGCCGATGGCAACGAAGTGCCGTATCGTTGTTCGCTCCACCCGCAGTGTAAACTCGGTTCGACGCTGGTGATCCCGCTGCGTGGCGAAAATCAGCGAGTCATGGGCACCATTAAACTGTACGAAGCGAAAAACCGACTGTTCAGCTCAATTAATCGCACCCTGGGAGAGGGGATTGCGCAGCTTTTATCCGCGCAGATCCTGGCCGGGCAGTATGAACGGCAGAAGGCGTTGCTGACGCAGTCAGAGATCAAGCTGTTGCACGCGCAGGTGAATCCGCATTTTCTGTTTAACGCGCTCAATACCATTAAAGCGGTGATTCGTCGCGACAGCGAACAGGCCAGCCAACTGGTGCAGTACTTGTCGACCTTTTTTCGCAAAAATTTAAAACGCCCGTCGGAAATCGTCACGCTGGCGGATGAGATTGAACACGTGAACGCTTATCTGCAAATTGAAAAAGCGCGTTTTCAGTCGCGTCTGCAGGTACAGCTTGATGTTCCATCGACGCTTTCACGTCAGAAATTGCCTGCGTTTACATTACAGCCGATTGTTGAGAACGCCATTAAACATGGCACGTCGCAACTGCTTGATACCGGCAACGTCGCTATTCGCGCCCGGCGCGAAGGGCAGCATTTGATGTTAGATATTGAAGATAATGCGGGACTGTATCAGCCTTCCGCCGGCAGTAGCGGGCTGGGGATGAGTCTGGTTGATAAACGTCTGCGCGAACACTTTGGCGATGATTATGGTATTAGCGTGGCCTGCGAGCCGGACTGTTTTACCCGAATTACATTACGACTTCCACTGGAGGAGGACGCATGATTAAAGTGCTGATTGTGGATGATGAGCCGTTAGCGCGGGAAAATTTGCGGATTTTGCTCCAGGGGCAGGATGACATTGAGATTGTGGGAGAGTGCGCGAACGCGGTAGAAGCGATTGGCGCGGTACATAAGCTGCGACCTGATGTGCTGTTTCTGGATATTCAGATGCCGCGTATCAGTGGACTGGAGATGGTAGGAATGCTTGACCCGGAACACCGCCCGTATATCGTTTTTTTAACCGCGTTTGACGAATACGCCATCAAAGCCTTTGAAGAACACGCTTTTGATTATCTGCTCAAGCCGATAGAGGAGAAACGGCTGGAGAAAACGTTACATCGTCTGCGTCAGGAGCGCAGTAAACAGGATGTTTCGTTGTTGCCGGAAAACCAGCAGGCGCTTAAATTCATTCCCTGTACCGGACACAGCCGGATCTATTTGTTGCAAATGGATGATGTCGCCTTTGTCAGTAGCCGTATGAGCGGCGTTTATGTGACCAGCAGTGAAGGTAAAGAGGGGTTTACCGAGCTGACGCTGCGCACGCTGGAAAGCCGGACGCCGCTACTGCGTTGTCATCGTCAGTTTCTGGTGAATATGGCCCATTTGCAGGAAATTCGGCTGGAGGATAATGGGCAGGCAGAGCTGATTTTACGCAACGGCCTGACGGTGCCGGTAAGCCGTCGCTATCTGAAAAGTTTAAAAGAGGCGATTGGCCTGTAAAAGACTGTTAGAATATCGTTTTGCCATAGAAACGACCGAAGGCCTCATGCTGAGTAACGATATTCTGCGTAGCGTGCGCTACATTTTAAAAGCTAATAATACCGATCTGGCGCGTATCCTGGCGCTGGGTAACGTTGATGCTACGCCGGAGCAGATAGCAATCTGGTTGCGCAAAGAAGAGGAAGAGGGGTTTCAGCGTTGCCCGGATATCGTGTTGTCCTCATTTCTCAATGGCCTCATTTATGAAAAACGCGGCAAAGATGAGGCGGCGCCTGCATTGACGGCGGAACGTCGTATCAACAACAATATTGTGCTGAAAAAGCTGCGTATTGCCTTTTCGCTAAAAACAGATGATATCCTGGCGATACTTACCGGTCAGTTGTTTCGTGTCTCAATGCCAGAGATCACCGCGATGATGCGCGCGCCGGACCATAAGAACTTCCGCGAATGCGGCGATCAGTTTATGCGTTATTTTCTGCGCGGTCTGGCGGCCCGTGAACACGCGGCGAAGTAATTCTGCGGTATTGTTCCCGGCAGCGTCCTGTCTGACCGGGAAAACGCATTATTATACTAATTGATTCTATGATACCCGCTCTCTTCCAACAGTTTCTGCGAGCGAATCATTGACAGATAGTACGCGGAACAGTTGTCAATTGATGATCCTGGCAATTTACAGAGGTCGCTTATTTTTGCCTGGGTAAAATCAATATCCACATATTCCGTAGCATAGCTATCATAATAGTCGATTCGTTCAGTCAAACCCGGCATACCCTGATAAGCTTCGCCGACTTGACTCAGCATTTTTTGTGCTTCTTCTTTATTATTGGCTTTCAGGGTTTTATAAAGTATTTTATGTTCAGATATTTGACGTAAAACGATATCCCCTTTGTAGTAATAGGTTAACTTGATTTCAATACCGTTGAGATTACCTACATAGCGTTGTGTTTCCTTTGACTCTTTGTTAGCGGCTATCTTCTTGATAAACGCTGTCATGTTATTTTGCTTTTCCTGGGGAGTATCGTTTTTCTGATCGCAGCCAGTTATGCTAACCGATAGAGAGAGCGCGAATAGTGGCAGTGCCATAAGGCGTAGAACCCGCATAACAATTCCTTGTCGTTAAGTATTGGTGTGGCCAGGAATTCAGGGATTATAGGCTTTGGCGAGGGGACTTACAGCGAGGCTGTCTTTTTTCGGAATTCATAAAGAAAAGACGCTGCCGAAGCAGCGCCCTGAGCGACTTTACCAGTCGATGCAATACATTATGCCTGCCAGTTATTTCGCTTCTTTAAAACCAGCAGCTTCCAGCAGCGTCTGGGTTTGTTTCATGCTGATACCTTTGCTGGTATCGCCGGACACCATCGTCCCTGAGATTTGCTGTAACGCTTTAAAGTCCACTTTTTCCATATCCACAGAGACGTTTTCCTGGGCATAGGTATCTTCATAGGTTAATTTTTCTTCCACTCCGGCGATATTTTTATATTTCGCGCTCAGCGGATCGAGAATTTTGGCGGCATCTTCTTTCGTTTTAGCGCCTACAGTGGCATAGCTGATTTTACTTTCAGACGTCTGCTTAATGATTTTGTCACCTTTATAGGTGTAAGTAATTGAAATTTCTGTCCCCGCCAGGTTTGCGTTAAAGGTCTTTGATTCTTCTTTATCGCCACAGCCAGCAAGAGAGAACACCAGTACGGAAGCCAAAGCCGCGGACAATAACTTGCCAGAAATTTTCATCTAAAACTCCATTTTATATAATGATTGGGTTTTTAAAATAATTTCAATGAATTAATTTAACCCAGTAATAGCAATGTATCAGGGAGAGATAGAATATGACTTTTAGCCGTTATTTAGCAGTCCGGATATGGAGTCTTAACGCTATTGCTTATTAAGGAAAAAGTTAAAACACGCGGATGGGGTGATATGCCAGTCAGGATTAAGCGGTTAAAAAAGCCGGAGCATGCTCCGGCTTGTTGCTTATTTCACCTGTTGGCCAGGCTTCGCGCCGTCATCAGGGCTTAACAGGAAGATATCTTTCCCGCCAGGTCCTGCGGCCATCACCATTCCCTCGGAGACGCCAAAGCGCATTTTGCGCGGCGCGAGGTTGGCGACCATTACCGTCTGGCGGCCAATCAGCGCCTGCGGGTCCGGGTAGGCGGAACGAATGCCGGAGAAGACGTTACGCTTCTCGCCGCCCAGATCCAGCGTCAGGCGCAGCAATTTGTCAGAACCGTCTACGAACTCAGCATTTTCAATCAATGCTACGCGCAGGTCAATTTTGGCGAAATCGTCAAAGGTGATGGTTTCCTGAATCGGGAAGTCGGCTAACGGGCCGGTAACCGGCGCAGCTGCGGCTTTCACCTCTTCTTTAGACGCTTCAACCAGCGCTTCAACTTGCTTCATGTCGATGCGATTGTAGAGCGCCTTAAAGGTGTTGACCTTGTGACCGAGCAGCGGCTGTTCGATGGCATCCCAGTTCAACTCACTGTTCAGGAAGGCTTCAACGCGTTCAGAAAGCGTCGGCAGTACCGGTTTCAGATACGTCATCAGCACGCGGAACAGGTTGATGCCCATTGAGCAAATGGCCTGCAGGTCCGCGTCGCGGCCTTCTTGTTTAGCCACCACCCACGGCGCTTGCTCGTCAACATAACGGTTAGCGATGTCGGCCAGCGCCATAATCTCACGGATAGCTTTACCGAATTCACGGCTTTCCCATGCTTCGCCAATCACCGCAGCGGCGTCAGTAAAGGTTTTATACAATTGCGGATCGGCCAGTTCAGCCGCCAGCACGCCGTCGAAACGCTTATTGATAAAACCGGCGTTACGCGAGGCCAGGTTGACTACTTTATTGACGATATCGGCATTGACGCGCTGGACAAAGTCTTCCAGGTTCAGGTCGATGTCATCAATGCGTGAAGAAAGCTTCGCGGTGTAGTAGTAGCGCAGGCTGTCGGCGTCAAAGTGTTTCAGCCAGGTGCTGGCCTTAATAAAGGTGCCGCGAGACTTAGACATCTTCGCGCCGTTCACCGTCACGTAACCGTGAACGAACAGGTTGGTCGGCTTACGGAAGTGGCTGCCTTCCAGCATGGCAGGCCAGAACAGGCTGTGGAAATAGACGATGTCTTTGCCGATAAAGTGGTACAGCTCGGCATCGGAGTCTTTTTTCCAGTACTCATCAAAACTGGTCGTGTCACCGCGCTTATCGCACAGATTTTTGAAGGAGCCCATATAGCCAATCGGCGCGTCCAGCCAGACGTAGAAATATTTGCCCGGCGCGTTCGGGATTTCGAAACCAAAATACGGCGCATCGCGGGAAATGTCCCACTGCTGGAGGCCGGATTCAAACCACTCCTGCATTTTGTTCGCCACCTGCTCCTGCAGCGCGCCGCTGCGGGTCCACGCCTGCAGCATTTCGCTGAATGACGGCAGATCAAAGAAAAAGTGCTCGGAGTCACGCATTACCGGCGTCGCGCCGGACACCACGGATTTCGGTTCGATAAGTTCGGTCGGGCTGTAGGTCGCGCCGCACACTTCACAGTTATCGCCGTACTGGTCTGCGGATTTACATTTCGGGCAGGTGCCTTTCACAAATCGGTCCGGCAGGAACATGCCTTTTTCCGGATCGTAGAGTTGAGAGATGGTGCGGTTCTTAATAAAACCGTTCTCTTTCAGGCGCGTATAAATCAGCTCGGACAGCTCGCGATTCTCGTCGCTGTGCGTTGAGTGGTAGTTGTCGTAGCTAATATTAAAACCGGCGAAATCGGTCTGGTGCTCCTGGCTCATTTCACCGATCATTTGCTCCGGCGTAATACCAAGCTGCTGCGCTTTCAGCATGATCGGCGTGCCATGAGCGTCATCGGCACAGATGAAGTTAACCTCATGGCCGCGCATTCGCTGGTAACGGACCCAGACATCAGCCTGGATGTGCTCCAGCATATGGCCGAGGTGGATAGAGCCGTTGGCGTACGGCAGCGCGCACGTTACCAGAATTTTCTTCGCGACTTGAGTCAT
Protein sequences of DBSCAN-SWA_5 >LR134233|1659520:1668689|1661163_1661271_-|VEC91681.1|DBSCAN-SWA MRVAKIGVIALFLLMAIGGIGGVMLAGYSFILRAG >LR134233|1659520:1668689|1664730_1665198_+|VEC91685.1|DBSCAN-SWA MLSNDILRSVRYILKANNTDLARILALGNVDATPEQIAIWLRKEEEEGFQRCPDIVLSSFLNGLIYEKRGKDEAAPALTAERRINNNIVLKKLRIAFSLKTDDILAILTGQLFRVSMPEITAMMRAPDHKNFRECGDQFMRYFLRGLAAREHAAK >LR134233|1659520:1668689|1665254_1665785_-|VEC91686.1|DBSCAN-SWA MRVLRLMALPLFALSLSVSITGCDQKNDTPQEKQNNMTAFIKKIAANKESKETQRYVGNLNGIEIKLTYYYKGDIVLRQISEHKILYKTLKANNKEEAQKMLSQVGEAYQGMPGLTERIDYYDSYATEYVDIDFTQAKISDLCKLPGSSIDNCSAYYLSMIRSQKLLEESGYHRIN >LR134233|1659520:1668689|1661330_1662032_-|VEC91682.1|DBSCAN-SWA MLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKVLLSSDSSEQPNGWREQQEILLHYLQSSNLHSLRLWVKERGQDYPAQTLTTNLFVPLRRRLQCQQPALQALLGILDGILINYIALCLASARKKQGKDALVIGWNIHDTTRLWLEGWVASQQGWRIDVLAHSLSQFRPELFDGKTLLVWCGENQTLAQQQQLLAWRAQGRDIHPLGV >LR134233|1659520:1668689|1663964_1664684_+|VEC91684.1|DBSCAN-SWA MIKVLIVDDEPLARENLRILLQGQDDIEIVGECANAVEAIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIEEKRLEKTLHRLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMDDVAFVSSRMSGVYVTSSEGKEGFTELTLRTLESRTPLLRCHRQFLVNMAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >LR134233|1659520:1668689|1662417_1663968_+|VEC91683.1|DBSCAN-SWA MLRHVFYLLHYGTYFGLHIEDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSVLIRRGRPDKVFSPLTAGAITCVAELVQMLIILLIARPFDDALHLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSGYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQKLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQPSAGSSGLGMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLPLEEDA >LR134233|1659520:1668689|1665956_1666415_-|VEC91687.1|DBSCAN-SWA MKISGKLLSAALASVLVFSLAGCGDKEESKTFNANLAGTEISITYTYKGDKIIKQTSESKISYATVGAKTKEDAAKILDPLSAKYKNIAGVEEKLTYEDTYAQENVSVDMEKVDFKALQQISGTMVSGDTSKGISMKQTQTLLEAAGFKEAK >LR134233|1659520:1668689|1660451_1661183_+|VEC91680.1|DBSCAN-SWA MKRLCDPLLWLIVLFLLLLFGLPYSQPFFAALFPDLPRPVYQQESFAALALAHFWLVGISSLFAVVVGVGAGIAVTRESGKEFRPLVETIAAVGQTFPPVAVLAIAVPVMGFGQQPAIIALILYGVLPILQATLAGLGAVPASVMSVASGMGMSRRQQLYQVELPLAAPVILAGIRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIIDRLFERLTRALTRHAK >LR134233|1659520:1668689|1666655_1668689_-|VEC91688.1|tRNA|DBSCAN-SWA MTQVAKKILVTCALPYANGSIHLGHMLEHIQADVWVRYQRMRGHEVNFICADDAHGTPIMLKAQQLGITPEQMIGEMSQEHQTDFAGFNISYDNYHSTHSDENRELSELIYTRLKENGFIKNRTISQLYDPEKGMFLPDRFVKGTCPKCKSADQYGDNCEVCGATYSPTELIEPKSVVSGATPVMRDSEHFFFDLPSFSEMLQAWTRSGALQEQVANKMQEWFESGLQQWDISRDAPYFGFEIPNAPGKYFYVWLDAPIGYMGSFKNLCDKRGDTTSFDEYWKKDSDAELYHFIGKDIVYFHSLFWPAMLEGSHFRKPTNLFVHGYVTVNGAKMSKSRGTFIKASTWLKHFDADSLRYYYTAKLSSRIDDIDLNLEDFVQRVNADIVNKVVNLASRNAGFINKRFDGVLAAELADPQLYKTFTDAAAVIGEAWESREFGKAIREIMALADIANRYVDEQAPWVVAKQEGRDADLQAICSMGINLFRVLMTYLKPVLPTLSERVEAFLNSELNWDAIEQPLLGHKVNTFKALYNRIDMKQVEALVEASKEEVKAAAAPVTGPLADFPIQETITFDDFAKIDLRVALIENAEFVDGSDKLLRLTLDLGGEKRNVFSGIRSAYPDPQALIGRQTVMVANLAPRKMRFGVSEGMVMAAGPGGKDIFLLSPDDGAKPGQQVK >LR134233|1659520:1668689|1659520_1660468_+|VEC91679.1|DBSCAN-SWA MIEFNHVSKTFGDQQAVSDLNLHFSEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGTIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWTVAQNIATVPQLQKWSRARINDRIDELMALLGLESALRDRYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHQLLGRTIVLVTHDIDEALRLADHLVLMDGGNVIQQGSPLSMLTSPENDFVQAFFGRSELGVRLLSLRSVGDYVRRHEQLSGDALVEEMTLRDALSMFVARRCDVLPVANQQGEPCGTLHFRDLLSETSPRETTV |
10 | Enterobacteria_phage(66.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
1735591 : 1746096
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >LR134233|1735591:1746096|DBSCAN-SWA TATGCCCGCGACTAAATTCTCCCGACGTACCCTCCTGACGGCAGGTTCTGCGCTTGCTGTTCTTCCTTTTCTGCGCGCCTTGCCGGTACAGGCGCGTGAACCTCGCGAGACCGTCGATATTAAGGATTATCCGGCGGATGACGGTATCGCCTCGTTCAAACAGGCCTTCGCCGACGGACAGACCGTGGTCGTACCGCCAGGATGGGTGTGTGAAAATATCAATGCGGCGATAACGATTCCGGCGGGAAAAACGCTGCGGGTACAGGGCGCGGTGCGTGGGAATGGCCGGGGACGGTTTATTTTGCAGGACGGGTGTCAGGTGGTGGGGGAGCAGGGCGGCAGTCTGCACAATGTGACGCTGGATGTTCGCGGGTCGGACTGTGTGATTAAAGGCGTGGCGATGAGCGGCTTTGGCCCCGTCGCGCAAATTTTCATCGGTGGTAAGGAACCGCAGGTGATGCGTAATCTCATTATCGATGACATCACCGTTACCCACGCCAACTACGCCATTCTCCGCCAGGGATTTCATAACCAAATGGACGGCGCGCGGATTACGCATAGCCGCTTTAGCGATTTGCAGGGGGACGCCATTGAGTGGAATGTCGCGATTCACGACCGCGACATCCTGATTTCCGATCATGTCATCGAACGCATTGATTGTACCAATGGCAAAATCAACTGGGGGATCGGCATCGGGCTGGCGGGTAGCACCTATGACAACAGTTATCCTGAAGACCAGGCAGTAAAAAACTTTGTGGTGGCCAATATTACCGGATCTGATTGCCGACAGCTTGTGCACGTAGAAAATGGCAAACATTTCGTCATTCGCAATGTCAAAGCCAAAAACATCACGCCCGATTTCAGTAAAAATGCGGGTATTGATAACGCAACGATCGCAATTTATGGCTGTGATAATTTCGTCATTGATAATATTGATATGACGAATAGTGCCGGGATGCTCATCGGCTATGGCGTCGTTAAAGGAAAATACCTGTCAATTCCGCAAAACTTTAAATTAAACGCTATTCGGTTGGATAATCGCCAGGTTGCTTATAAATTACGCGGCATTCAAATTTCCTCCGGCAACACCCCCTCTTTTGTCGCCATCACCAATGTACGGATGACGCGTGCTACGCTGGAACTGCATAATCAACCGCAGCACCTCTTTCTGCGTAATATCAACGTGATGCAAACTTCAGCGATTGGCCCGGCGTTAAAAATGCATTTCGATTTGCGTAAAGATGTCCGTGGTCAATTTATGGCCCGCCAGGACACGCTGCTTTCCCTCGCTAATGTTCATGCCATCAATGAAAACGGGCAGAGTTCCGTGGATATCGACAGGATTAATCACCAAACCGTGAATGTCGAAGCAGTGAATTTTTCGCTGCCGAAGCGGGGAGGGTAGGTACCGCTATTTTTACGAAAATTCCTGGGAAAAAGTTGTTCATACTTAATGTTATGGTGCCGACTAAGACGTAATGTAAAGCGTGCCATCATTATCCCTGGCAGCAGAGTAATTCATGCTGGCGAAAACAAGCTAAAGAGCTATAATTCAGCAACCATTTTACAGGTGGAAGAAACAATGATGAATTTGAAAGCAGTTATACCGGTAGCGGGTTTGGGTATGCATATGTTGCCTGCCACCAAGGCAATCCCAAAAGAGATGCTACCGATCGTCGACAAGCCAATGATTCAGTACATTGTCGATGAGATTGTGGCTGCAGGGATCAAAGAAATCGTACTGGTGACTCACGCGTCTAAAAACGCCGTTGAGAACCACTTCGACACCTCTTATGAACTTGAATCACTTCTTGAACAGCGCGTTAAGCGTCAGCTTTTGGCGGAAGTGCAATCTATCTGCCCACCGGGCGTAACGATTATGAACGTTCGCCAGGCGCAGCCGTTAGGGCTGGGGCACTCTATTCTGTGCGCGCGTCCGGTCGTGGGCGATAACCCTTTCATTGTGGTACTCCCGGATATTATTATCGATGATGCTACCGCCGATCCGCTGCGCTATAACCTTGCGGCGATGGTGGCGCGTTTCAATGAAACGGGTCGCAGCCAGGTGCTGGCGAAGCGCATGAAAGGTGATTTATCGGAGTATTCCGTTATCCAGACGAAAGAACCTCTGGATAATGAAGGCAAAGTCAGCCGGATTGTGGAGTTTATCGAAAAACCGGATCAGCCGCAGACGCTGGATTCCGATTTGATGGCGGTAGGCCGTTATGTGCTTTCAGCCGACATCTGGGCGGAACTGGAAAGAACCGAACCGGGCGCCTGGGGCCGCATCCAGCTCACCGATGCCATTGCTGAACTGGCGAAAAAACAGTCGGTTGACGCTATGCTAATGACGGGTGACAGCTATGACTGCGGTAAAAAAATGGGCTACATGCAGGCATTTGTGAAGTACGGGCTGCGCAACCTGAAAGAAGGAGCGAAGTTCCGTAAGAGCATAGAGCAGCTTCTGCATAAATAAGTATTAACAACCGTGATAAATGGTTGGTGATAAACATAATAACGGCAGTGAACATTCGAAGCGGCAAGTTGGCTGAAACGAGTGTTGACTGCCGTTTTAGTTTTGTATAAAGGGCTTAAGTAACAAGGGGTTATCTGGAGCATTTTAATGCTGATTTTATAAGATTAATCCTTGTTTCCGGATGCAATTAATAAGACAATTAGCGTTTAAGTTTTAGTGAGCTTTGCCCTGCTGGGCGAGGTTTGTAACAAGTCGATATGTACGCAGTGCACTGGTAGCTGATGAGCCAGGGGCGGTAGCGTGTGTAACGACTTGAGCAATTAATTTTTATTGGCAAATTAAATACCACATTAAATACGCCTTATGGAATAGAAAAGTGAAGATACTTATTACTGGCGGGGCAGGTTTTATTGGATCAGCTGTTGTCCGCCATATTATTAAGAATACACAGGACACTGTAGTTAATATTGATAAATTAACCTACGCCGGTAATCTTGAATCCCTTTCTGATATTTCTGAAAGTAATCGCTACAATTTTGAACACGCGGATATTTGTGATTCCGCTGAAATAACGCGTATTTTTGAGCAGTACCAGCCGGACGCGGTGATGCATTTGGCTGCGGAAAGTCATGTGGACCGTTCGATTACCGGGCCGGCAGCATTTATTGAAACCAATATCGTCGGCACCTATGTACTTCTTGAAGTTGCGCGTAAATACTGGTCTGCGCTTGGCGAAGATAAAAAAAATAATTTTCGTTTTCATCATATTTCCACTGATGAAGTTTACGGCGATTTACCGCATCCTGATGAAGTTGAAAACAGCGTTACGCTGCCGTTATTTACTGAAACGACGGCATATGCGCCAAGTAGCCCCTATTCTGCGTCAAAAGCATCCAGCGATCATTTAGTCCGTGCCTGGCGGCGTACCTATGGTCTACCAACGATCGTTACCAATTGTTCTAATAACTATGGCCCTTATCACTTCCCTGAAAAACTGATTCCGTTGGTCATTTTGAACGCACTGGAAGGAAAGCCTTTGCCAATTTATGGCAAAGGGGATCAGATTCGCGATTGGCTATATGTAGAAGATCACGCTCGCGCGCTTCATATGGTAGTGACTGAAGGCAAGGCGGGGGAGACTTATAACATTGGTGGCCACAATGAGAAGAAAAATCTCGATGTGGTATTTACCATCTGTGATCTGCTGGACGAGATTGTACCCAAAGCGACTTCTTATCGTGAACAAATCACTTATGTCGCGGATCGTCCGGGCCATGATCGTCGTTATGCCATTGATGCAGGTAAAATTAGCCGTGAATTAGGCTGGAAACCGCTGGAGACCTTTGAAAGCGGTATTCGTAAAACAGTGGAATGGTACCTTGCAAATACTCAATGGGTAAACAATGTTAAAAGTGGGGCGTATCAGAGTTGGATAGAACAGAACTATGAAGGACGCCAGTAATGAATATCTTACTTTTTGGTAAGACAGGGCAAGTAGGCTGGGAGTTGCAACGTTCTCTGGCACCAGTAGGGAATCTGATTGCCCTGGATGTCCATTCAAAAGAGTTTTGCGGTGATTTTAGTAATCCGAAAGGCGTTGCCGAAACCGTTCGTAAGCTTCGTCCCGATGTGATTGTTAACGCAGCAGCACATACTGCAGTAGATAAAGCAGAGTCTGAACCAGAACTGGCGCAGTTACTTAACGCCACCAGTGTGGAAGCCATCGCTAAAGCAGCCAACGAAACTGGCGCATGGGTAGTGCATTATTCAACCGATTATGTATTTCCTGGTACCGGCGATATCCCATGGCAGGAAACGGACGCTACGTCGCCGCTGAATGTCTATGGCAAGACCAAACTGGCGGGAGAAAAGGCCCTGCAGGATAACTGCCCTAAGCATCTTATCTTCCGCACCAGTTGGGTTTATGCAGGTAAGGGCAATAATTTCGCAAAGACAATGCTTCGTCTGGCGAAAGAGCGTCAGACACTTTCAGTCATCAACGATCAGTACGGTGCGCCAACCGGTGCAGAATTACTGGCTGACTGCACGGCTCATGCGATCCGTGTGGCGTTAAAGAAACCAGAAGTCGCAGGTCTTTACCATCTGGTTGCCGGGGGAACCACAACCTGGCATGACTACGCGGCCTTAGTCTTTGACGAGGCGCGCAAAGCAGGGATAACGCTTGCGCTGACTGAGCTTAATGCTGTGCCGACCAGCGCCTACCCGACGCCGGCGAGCAGACCAGGCAATTCGCGTCTCAATACTGAAAAGTTTCAGCGTAATTTTGACCTTATTCTGCCGCAATGGGAATTAGGAGTTAAGCGCATGCTGACTGAAATGTTTACGACGACAACCATCTGATAAATTTAAATGCCCATCAGGGCATTTTCTATGAATGAGAAATGGAAATGAAAATGCGTAAGGGCATTATTTTAGCGGGGGGCTCCGGCACCCGTCTTTATCCGGTGACCATGGCGGTAAGTAAGCAATTGCTACCAATTTATGATAAACCGATGATTTACTATCCCCTTTCCACGCTTATGCTGGCAGGCATTCGGGACATCCTGATCATCAGTACGCCACAGGACACGCCGCGTTTTCAACAACTGCTGGGAGACGGCAGCCAGTGGGGGCTGAATCTTCAATATAAAGTACAGCCAAGCCCGGATGGCTTAGCACAGGCGTTTATTATTGGTGAAGAGTTCATTGGTAATGATGATTGTGCATTAGTACTGGGTGACAATATCTTCTATGGTCATGATTTACCAAAGTTAATGGAAGCTGCCGTTAATAAAGAAAGTGGTGCTACCGTCTTTGCTTATCATGTAAACGATCCGGAGCGCTACGGTGTGGTTGAGTTTGACCAAAGTGGCACAGCCGTTAGTCTGGAGGAAAAACCGTTACAACCGAAGAGTAATTACGCGGTAACGGGGCTGTATTTTTATGATAATAGCGTGGTGGAGATGGCGAAAAATCTTAAGCCTTCCGCTCGCGGTGAGTTAGAAATCACGGATATTAACCGTATCTATATGGATCAGGGAAGATTGTCTGTCGCCATGATGGGGCGCGGTTATGCCTGGCTGGATACAGGGACGCATCAGAGTTTGATAGAGGCCAGTAATTTTATTGCAACCATCGAAGAACGCCAGGGGCTAAAAGTGTCCTGCCCGGAAGAGATCGCATTTCGTAAAAATTTTATAAATGCACAACAGGTTATAGAACTGGCCGGGCCATTATCAAAAAATGATTATGGCAAATATTTGCTGAAGATGGTGAAAGGTTTATAAGTGATGATTGTGATTAAAACAGCAATACCAGATGTTTTGATCTTAGAGCCTAAAGTTTTTGGCGATGAGAGGGGATTCTTTTTTGAAAGTTATAACCAGCAGACCTTTGAAGAGTTGATTGGACGTAAAGTTACATTTGTTCAAGATAATCATTCAAAATCCAAAAAGAACGTACTCAGAGGGCTACATTTTCAGAGAGGAGAAAATGCACAGGGGAAGTTAGTTCGTTGTGCTGTCGGTGAGGTTTTTGATGTTGCGGTCGATATCCGAAAAGAATCGCCTACTTTTGGTCAATGGGTTGGCGTAAATCTATCTGCTGAGAATAAGCGACAGCTTTGGATTCCAGAAGGTTTTGCTCATGGTTTTGTTACTCTTAGTGAGTATGCAGAGTTTCTGTACAAAGCAACTAATTATTACTCACCTTCATCGGAAGGTAGCATTTTATGGAATGATGAGACAATAGGTATTGAATGGCCTTTTTCTCAGCTGCCTGAGCTTTCAGCAAAAGATGCTGCAGCACCTTTACTGCATCAAGCCTTGTTAACAGAGTAAGCATCGTGTCTCATATTATTAAGATTTTTCCATCAAATATTGAATTTTCCGGTAGAGAGGATGAATCAATCCTCGATGCTGCGCTATCGGCTGGCATCCATCTTGAACATAGCTGCAAAGCGGGTGATTGTGGTATCTGTGAGTCCGATTTGTTGGCGGGAGAAGTTGTTGACTCCAAAGGTAATATTTTTGGACAGGGTGATAAAATACTAACCTGCTGCTGTAAACCTAAAACCGCCCTTGAGCTAAATGCGCATTTTTTTCCTGAACTAGTTGGACAGACAAAAAAAATTGTCCCATGCAAGGTAAATAGTGCTGTACTGGTTTCAGGCGATGTTATGGCTTTGAAGTTACGCACACCACCAACAGCAAAAATTGGCTTCCTTCCAGGGCAGTATATCAATTTACATTATAAAGGTGTAACTCGCAGTTATTCTATCGCTAATAGTGATGAGTCGAATGGTATTGAGTTGCATGTAAGGAATGTTCCCAATGGTCAGATGAGCTCGCTCATTTTTGGGGAGTTACAAGAAAATACTCTTATGCGCATTGAAGGACCTTGCGGAACATTTTTTATTCGTGAAAGTGACAGACCTATAATCTTCCTTGCAGGCGGTACTGGATTCGCTCCAGTTAAATCAATGGTTGAGCATCTCATTCAGGGAAAATGTCGTCGTGAGATCTACATCTACTGGGGAATGCAAGATAGTAAAGATTTTTACTCTGCATTACCGCAGCAGTGGAGTGAACAGCACGACAACGTTCATTATATCCCTGTTGTTTCTGGTGATGACGCCGAATGGGGGGAAGAAAGGGATTTGTCCATCATGCCGTGATGGATGATTTTGATTCTCTAGAGTTCTTCGATATATATGCATGTGGTTCACCTGTGATGATCGATGCCAGTAAAAAGGACTTTATGATGAAAAATCTCTCTGTAGAACATTTCTATTCTGATGCATTTACCGCATCTAATAATATTGAGGATAATTTATGAAAGCGGTCATCCTGGCTGGTGGACTTGGTACCAGACTAAGTGAAGAAACAATTGTAAAACCAAAACCGATGGTAGAAATTGGTGGCAAGCCTATTCTTTGGCACATTATGAAAATGTATTCTGTGCATGGTATCAAGGATTTTATTATCTGCTGTGGTTATAAAGGATATGTGATTAAAGAATATTTTGCGAACTACTTCCTTCACATGTCAGATGTAACATTCCATATGGCTGAAAACCGTATGGAAGTTCACCATAAACGTGTTGAACCATGGAATGTCACATTGGTTGATACGGGTGATTCTTCAATGACTGGTGGTCGTCTGAAACGTGTTGCTGAATACGTAAAAGATGACGAGGCTTTCCTGTTTACTTATGGTGATGGCGTTGCCGACCTTGATATCAAAGCGACTATCGATTTCCATAAGGCTCACGGTAAGAAAGCGACTTTAACAGCTACTTTTCCACCAGGACGCTTTGGCGCATTAGATATCCGAGCTGGTCAGGTCCGGTCATTCCAGGAAAAACCGAAAGGCGATGGGGCAATGATCAATGGTGGTTTCTTTGTGTTGAATCCATCGGTTATCGATCTCATCGATAACGATGCAACAACCTGGGAACAAGAGCCATTAATGACATTGGCACAACAGGGGGAGTTAATGGCTTTTGAACACCCAGGTTTCTGGCAGCCGATGGATACCCTACGTGATAAAGTTTACCTCGAAGGGCTGTGGGAAAAAGGTAAAGCTCCGTGGAAAACCTGGGAGTAACTAGATGATTGATAAAAATTTTTGGCAAGGTAAACGTGTATTCGTTACCGGCCATACTGGCTTTAAAGGAAGCTGGCTTTCGCTATGGCTGACTGAAATGGGTGCAATTGTAAAAGGCTATGCACTTGATGCGCCAACTGTTCCAAGTTTATTTGAGATAGTGCGTCTTAATGATCTTATGGAATCTCATATTGGCGACATTCGTGATTTTGAAAAGCTGCGCAATTCTATTGCAGAATTTAAGCCAGAAATTGTTTTCCATATGGCAGCCCAGCCTTTAGTGCGCCTATCTTATGAACAGCCAATCGAAACATACTCAACAAATGTTATGGGTACTGTCCATTTGCTTGAAGCAGTTAAGCAAGTAGGTAACATCAAGGCAGTCGTAAATATCACCAGTGATAAGTGCTACGACAATCGTGAGTGGGTGTGGGGCTATCGTGAGAACGAACCCATGGGAGGGTACGATCCATACTCTAATAGTAAAGGTTGTGCAGAATTAGTCGCGTCTGCATTCCGGAACTCATTCTTCAATCCTGCAAATTATGAGCAACATGGCGTTGGTTTGGCGTCTGTGAGGGCTGGTAATGTCATAGGCGGAGGCGATTGGGCTAAAGACCGTTTAATTCCCGATATTCTGCGCTCATTTGAAAATAACCAGCAGGTTATTATTCGAAACCCATATTCTATCCGTCCCTGGCAGCATGTACTGGAGCCTCTTTCTGGTTACATTGTGGTGGCGCAACGCTTATATACAGAAGGTGCTAAGTTTTCTGAAGGATGGAATTTCGGCCCGCGTGATGAAGATGCGAAGACGGTCGAATTTATTGTTGACAAGATGGTCACGCTTTGGGGTGATGATGCAAGCTGGTTACTGGATGGTGAGAATCATCCTCATGAGGCACATTACCTGAAACTGGATTGCTCTAAAGCAAATATGCAATTAGGATGGCATCCGCGTTGGGGATTGACTGAAACACTTGGTCGCATCGTAAAATGGCATAAAGCATGGATTCGCGGCGAAGATATGTTGATTTGTTCAAAGCGTGAAATCAGCGACTATATGTCTGCAACTACTCGTTAAGAAAATAAGTTTAAGGAATCAAAGTAATGACAGCAAATAACCTGCGTGAGCAAATCTCTCAGCTTGTCGCTCAGTATGCGAATGAGGCATTGAGCCCGAAACCTTTTGTTGCAGGTACAAGCGTTGTGCCTCCTTCCGGGAAGGTTATTGGTGCCAAAGAGTTACAATTGATGGTTGAGGCGTCTCTTGATGGATGGCTAACTACTGGTCGTTTCAATGATGCCTTTGAGAAAAAACTTGGGGAATTTATTGGGGTTCCTCATGTTTTAACGACTACATCTGGCTCTTCGGCAAACTTGCTGGCACTGACTGCGCTGACTTCCCCAAAATTAGGCGAGCGTGCTCTCAAACCTGGTGATGAGGTTATTACTGTCGCTGCTGGCTTCCCGACTACAGTTAACCCGGCGATCCAGAATGGTTTAATACCGGTATTCGTGGATGTTGATATCCCGACATATAATATCGATGCTTCTCTTATTGAAGCTGCAGTTACTGAGAAATCAAAAGCGATAATGATCGCTCATACACTCGGTAATGCATTTAACCTGAGTGAAGTTCGTCGGATTGCCGATAAATATAACTTATGGTTGATTGAAGACTGCTGTGATGCCCTTGGGACGACTTATGAAGGCCAGATGGTAGGTACCTTTGGTGACATCGGAACCGTTAGTTTTTATCCGGCTCACCATATCACAATGGGTGAAGGCGGTGCTGTATTCACCAAGTCAGGTGAACTGAAGAAAATTATTGAGTCGTTCCGTGACTGGGGCCGGGATTGTTATTGTGCGCCAGGATGCGATAACACCTGCGGTAAACGTTTTGGTCAGCAATTGGGGTCACTTCCTCAAGGCTATGATCACAAATATACTTATTCCCACCTCGGATATAATCTCAAAATCACGGACATGCAGGCAGCATGTGGTCTGGCTCAGTTAGAACGCGTGGAAGAGTTTGTAGAGCAGCGTAAAGCTAACTTTTCCTATCTGAAACAGGGCTTGCAATCTTGCACTGAATTCCTCGAATTACCAGAAGCAACAGAGAAATCAGATCCATCCTGGTTTGGCTTCCCTATCACCCTGAAAGAAACTAGCGGTGTTAACCGTGTCGAACTGGTGAAATTCCTTGATGAAGCAAAAATCGGTACACGTTTACTGTTTGCTGGAAATCTGATTCGCCAACCGTATTTTGCTAATGTGAAATATCGTGTAGTGGGTGAGTTGACAAATACCGACCGTATAATGAATCAAACGTTCTGGATTGGTATTTATCCAGGCTTGACTACAGAGCATTTAGATTATGTAGTTAGCAAGTTTGAAGAGTTCTTTGGTTTGAATTTCTAA
Protein sequences of DBSCAN-SWA_6 >LR134233|1735591:1746096|1742898_1743672_+|VEC91759.1|DBSCAN-SWA MKAVILAGGLGTRLSEETIVKPKPMVEIGGKPILWHIMKMYSVHGIKDFIICCGYKGYVIKEYFANYFLHMSDVTFHMAENRMEVHHKRVEPWNVTLVDTGDSSMTGGRLKRVAEYVKDDEAFLFTYGDGVADLDIKATIDFHKAHGKKATLTATFPPGRFGALDIRAGQVRSFQEKPKGDGAMINGGFFVLNPSVIDLIDNDATTWEQEPLMTLAQQGELMAFEHPGFWQPMDTLRDKVYLEGLWEKGKAPWKTWE >LR134233|1735591:1746096|1741356_1741905_+|VEC91756.1|DBSCAN-SWA MIVIKTAIPDVLILEPKVFGDERGFFFESYNQQTFEELIGRKVTFVQDNHSKSKKNVLRGLHFQRGENAQGKLVRCAVGEVFDVAVDIRKESPTFGQWVGVNLSAENKRQLWIPEGFAHGFVTLSEYAEFLYKATNYYSPSSEGSILWNDETIGIEWPFSQLPELSAKDAAAPLLHQALLTE >LR134233|1735591:1746096|1739527_1740427_+|VEC91754.1|DBSCAN-SWA MNILLFGKTGQVGWELQRSLAPVGNLIALDVHSKEFCGDFSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPELAQLLNATSVEAIAKAANETGAWVVHYSTDYVFPGTGDIPWQETDATSPLNVYGKTKLAGEKALQDNCPKHLIFRTSWVYAGKGNNFAKTMLRLAKERQTLSVINDQYGAPTGAELLADCTAHAIRVALKKPEVAGLYHLVAGGTTTWHDYAALVFDEARKAGITLALTELNAVPTSAYPTPASRPGNSRLNTEKFQRNFDLILPQWELGVKRMLTEMFTTTTI >LR134233|1735591:1746096|1742740_1742902_+|VEC91758.1|DBSCAN-SWA MDDFDSLEFFDIYACGSPVMIDASKKDFMMKNLSVEHFYSDAFTASNNIEDNL >LR134233|1735591:1746096|1740474_1741353_+|VEC91755.1|DBSCAN-SWA MKMRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGNDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDQSGTAVSLEEKPLQPKSNYAVTGLYFYDNSVVEMAKNLKPSARGELEITDINRIYMDQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAFRKNFINAQQVIELAGPLSKNDYGKYLLKMVKGL >LR134233|1735591:1746096|1744782_1746096_+|VEC91761.1|DBSCAN-SWA MTANNLREQISQLVAQYANEALSPKPFVAGTSVVPPSGKVIGAKELQLMVEASLDGWLTTGRFNDAFEKKLGEFIGVPHVLTTTSGSSANLLALTALTSPKLGERALKPGDEVITVAAGFPTTVNPAIQNGLIPVFVDVDIPTYNIDASLIEAAVTEKSKAIMIAHTLGNAFNLSEVRRIADKYNLWLIEDCCDALGTTYEGQMVGTFGDIGTVSFYPAHHITMGEGGAVFTKSGELKKIIESFRDWGRDCYCAPGCDNTCGKRFGQQLGSLPQGYDHKYTYSHLGYNLKITDMQAACGLAQLERVEEFVEQRKANFSYLKQGLQSCTEFLELPEATEKSDPSWFGFPITLKETSGVNRVELVKFLDEAKIGTRLLFAGNLIRQPYFANVKYRVVGELTNTDRIMNQTFWIGIYPGLTTEHLDYVVSKFEEFFGLNF >LR134233|1735591:1746096|1738442_1739528_+|VEC91753.1|DBSCAN-SWA MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLTYAGNLESLSDISESNRYNFEHADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEVARKYWSALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDHLVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYVEDHARALHMVVTEGKAGETYNIGGHNEKKNLDVVFTICDLLDEIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLANTQWVNNVKSGAYQSWIEQNYEGRQ >LR134233|1735591:1746096|1735591_1736995_+|VEC91751.1|DBSCAN-SWA MPATKFSRRTLLTAGSALAVLPFLRALPVQAREPRETVDIKDYPADDGIASFKQAFADGQTVVVPPGWVCENINAAITIPAGKTLRVQGAVRGNGRGRFILQDGCQVVGEQGGSLHNVTLDVRGSDCVIKGVAMSGFGPVAQIFIGGKEPQVMRNLIIDDITVTHANYAILRQGFHNQMDGARITHSRFSDLQGDAIEWNVAIHDRDILISDHVIERIDCTNGKINWGIGIGLAGSTYDNSYPEDQAVKNFVVANITGSDCRQLVHVENGKHFVIRNVKAKNITPDFSKNAGIDNATIAIYGCDNFVIDNIDMTNSAGMLIGYGVVKGKYLSIPQNFKLNAIRLDNRQVAYKLRGIQISSGNTPSFVAITNVRMTRATLELHNQPQHLFLRNINVMQTSAIGPALKMHFDLRKDVRGQFMARQDTLLSLANVHAINENGQSSVDIDRINHQTVNVEAVNFSLPKRGG >LR134233|1735591:1746096|1743676_1744756_+|VEC91760.1|DBSCAN-SWA MIDKNFWQGKRVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLNDLMESHIGDIRDFEKLRNSIAEFKPEIVFHMAAQPLVRLSYEQPIETYSTNVMGTVHLLEAVKQVGNIKAVVNITSDKCYDNREWVWGYRENEPMGGYDPYSNSKGCAELVASAFRNSFFNPANYEQHGVGLASVRAGNVIGGGDWAKDRLIPDILRSFENNQQVIIRNPYSIRPWQHVLEPLSGYIVVAQRLYTEGAKFSEGWNFGPRDEDAKTVEFIVDKMVTLWGDDASWLLDGENHPHEAHYLKLDCSKANMQLGWHPRWGLTETLGRIVKWHKAWIRGEDMLICSKREISDYMSATTR >LR134233|1735591:1746096|1737172_1738066_+|VEC91752.1|DBSCAN-SWA MMNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGDNPFIVVLPDIIIDDATADPLRYNLAAMVARFNETGRSQVLAKRMKGDLSEYSVIQTKEPLDNEGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWAELERTEPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKSIEQLLHK >LR134233|1735591:1746096|1741910_1742741_+|VEC91757.1|DBSCAN-SWA MSHIIKIFPSNIEFSGREDESILDAALSAGIHLEHSCKAGDCGICESDLLAGEVVDSKGNIFGQGDKILTCCCKPKTALELNAHFFPELVGQTKKIVPCKVNSAVLVSGDVMALKLRTPPTAKIGFLPGQYINLHYKGVTRSYSIANSDESNGIELHVRNVPNGQMSSLIFGELQENTLMRIEGPCGTFFIRESDRPIIFLAGGTGFAPVKSMVEHLIQGKCRREIYIYWGMQDSKDFYSALPQQWSEQHDNVHYIPVVSGDDAEWGEERDLSIMP |
11 | Enterobacteria_phage(37.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
1939512 : 1987624
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >LR134233|1939512:1987624|DBSCAN-SWA TTTACATGCGTTCTACGGTTTCGATACCCAACGTATCCAGACCCAGTTTCAGCGTCTTCGCCGTCAGTTGCGCCAGTTTTAGGCGGCTATTGCGAACCGCGTCATTCTCGGCGCTGAGAATAGGGCAGTGCTCGTAGAAGCCGGAGAACAGCCCGGCGACGTCATACAGATATGCGCACATGACGTGCGGCGTACCTTCACGCGCGACCACCGTCAGGGTTTCTTCAAACTGCAGCAGACGGGCAGCAAGCTGCGCTTCGCGATCTTCGCTAATGATAACCGGCGCGCTCGCCAGCGCCTGCTCGTCGATGTTCGCTTTACGGAAAACAGACAGCACACGCGTATAGGCATACTGCATATAAGGTGCGGTATTGCCTTCGAAAGCCAACATATTATCCCAGTCGAAGATATAGTCAGTGGTGCGGTTTTTTGACAGATCGGCGTATTTCACCGCGCCAATACCGACGGCGTTAGCCAGTTTTTCCAGCTCGTCAGCCGGCATATCCGGGTTCTTCTCCGCCACCAGACGACGCGCGCGCTCCAGCGCCTCATCCAGCAGGTCCGCCAGTTTCACCGTACCGCCCGCGCGAGTTTTAAACGGTTTACCGTCTTTACCCAACATCATGCCGAACATGTGGTGTTCCAGCGGAACGGAATCCGGCACATAACCGGCTTTACGCACAATGGTCCAAGCCTGCATCAGGTGCTGGTGCTGACGGGAGTCAATGTAGTAGAGCACGCGGTCGGCGTGCAGCGTTTCGTAACGGTATTTGGCGCAGGCGATATCGGTAGTGGTATACAGATAACCGCCATCTTTCTTCTGAATAATGACGCCCATCGGGTCGCCTTCCTTGTTTTTAAACTCATCAAGGAAAACCACCGTTGCGCCTTCGCTTTCAACCGCCAGGCCTTTGGCTTTCAGATCGGCGACAATGCCCGGCAGCATCGGATTATACAGACTTTCGCCCATCACATCGTCACGCGTCAGAGTGACGTTCAGGCGATCATAGGTGATCTGGTTTTGCGTCATGGTGATATCAACCAGCTTACGCCACATCTCACGGAAATAGGTATCACCGCTCTGTAATTTGACGACGTAGTTACGCGCGCGCTCGGCAAACGCTTCATCTTCGTCATAATGCTTTTTCGCATCGCGGTAAAATCCTTCGAGATCCGCCAGCGCCATGTCGCCCGCGTTTTCCTGCTGCTGTTTTTCCAGCCAGGCGATAAGCATACCGAACTGGGTGCCCCAGTCGCCGACGTGGTTAGCGCGAATAACATGATGGCCCAGAAACTCCAGCGTGCGCACTGCCGCATCGCCGATGATCGTGGAGCGTAGATGACCAACGTGCATCTCTTTCGCGACGTTGGGCGCGGAGTAGTCAACGACAATCGTCTGACGAGTCGGCTGAGAAACGCCCAGACGATCGGAGGTCAGCGCCTGTTGCACCTGTTCAGCGAGAAAAGCCGGTTCGAGGAAAATATTGATAAAGCCAGGGCCGGCGATTTCAACCTTGCTGGCGATGCCGCTGAGATCCAGATGAGTCAGCACCTGCTCTGCAAGTTGTCGGGGGGCCATACCCAGTTTTTTCGCAACTGCCATCATGCCGTTGGCCTGATAGTCGCCGAACTGAACTTTGGCTGACTGACGAACCTGGGGCTCGCAATCTGCAGGCGCGCCTGCAGCAATCATGGCCTGACTAACTTTTTCTGAGAGAAGAGCCTGAATATTCACCGGAATACCTTACGTTTATGAAGCGGGCTGTATTGCCCGCGCCAGTTTAAGAATTAGGGCGGGAGTATACTGCAAATGCCCGTTGGCGTCAGCATTGCGCGCGCGGCGCGTCGCCTGAAATATACGCAACAATAATCTGCATAAGCATTCAATAAACGCCCAATCAAATCAAAGGTTGGTTGCCGGGGCGCAACAGCGTAGATTAGCGGTTTTCACAGCAAGAAGAGCGTCCTGATATGGCGAACTGGCAACACATTGATGAACTGCATGATATTTCCGCAGATTTACCTCGATTCACTCTGGCGTTCAGAGAACTTTCCACTCGCCTTGGTCTGCAGATTAGCGCCCTTGAGGCCGATCACATTTCATTGCGCTGTCACCAGAATACGACGGCGGAGCGTTGGCGTCGCGGTTTTGAACAATGCGGCGAGCTACTGTCGGAAAATATCATCAACGGCAGGCCGATTTGTCTGTTTAAACTGCATGAACCGGTATGTGTGGAACACTGGCGGTTTTCTGTTATTGAATTACCGTGGCCGGGAGAAAAACGCTATCCACACGAAGGGTGGGAGCATATTGAAATCGTGCTGCCCGGCGAGCCGGAGACCTTGAACGCGAGGGCGCTGGCGTTGTTGTCGGATGAGGGGCTTAGTCAGCCGGGGATCGTGGTTAAAACCAGTTCGCCACAGGGCGAGCATGAACGTTTGCCAAACCCTACGCTGGCCGTGACCGATGGCCGCGTCACGGTAAAATTTCATCCATGGTCGATTGAAGCGATTGTCGCCAGCGAACAGGCGGCCCATTAACCTGCCGAGCGGAGTGAATATGGCGTTACTTGAGATCTGTTGTTACAGCATGGAATGCGCGCTCACCGCGCAGCGAAACGGCGCGGATCGTATCGAACTGTGCGCCGCGCCGAAAGAAGGGGGGCTTACGCCTTCTTTCGGCGTCTTACGCAGCGTACGCGAGCATATTACGATTCCCGTACATCCGATTATTCGTCCTCGCGGCGGGGATTTTTACTACACTGACGGCGAATTTGCCGCCATGCTGGAAGATATCCGCCTCGTCAGAGAGTTGGGGTTTCCCGGGCTGGTTACTGGTGTGTTGACCGTTGATGGGGATGTCGATATGTCGCGAATGGAAAAAATAATGGCGGCGGCCGGACCGCTGGCGGTGACATTCCACCGCGCCTTCGATATGTGCGCTAATCCCTTCAATGCGCTAAAGAATCTGGCTGACGCAGGCGTAGCAAGAGTACTGACTTCCGGACAAAAAGCCGATGCGGCGCAAGGTTTATCAATAATTATGGAACTTATTGCCCAGGGGGATGCTCCAATCATTATGGCTGGTGCGGGGGTTCGTGCAAATAACCTGCAGAATTTCCTCGATGCCGGAGTACGGGAAGTACACAGTTCCGCCGGAGTCTTACTGCCTTCGCCGATGCGCTATCGCAATCAGGGGTTATCGATGTCTGCCGATATTCAGGCGGACGAGTATTCTCGCTATAGAGTAGAGGGTGCGGCGGTCGCTGAAATGAAAGGAATCATTGTTCGCCATCAGGCCAAATGATTTTTACCGTTGCATCATGTCGCCCAATATGATGCTTGCTCGTACCAGGCCCCTGCCAATTCAACAGGGGCCTTTTTTTGTCTATGGAAAACCCCCAGCTAGGCTGGGGGTTCCGGAAAGCTTTCAGCTTTAAGCCAGTTATTAAAACCCCTTTTGATTTGTTAAAACATCTTGCGGTCTGGCAACTGCAAAAGTTCAACAAGAAATCAAAAGGGGGTCCCAATGGGGGACGAAAAGAGCTTAGCGCACACCCGATGGAACTGTAAATATCACATAGTTTTCGCGCCCAAATACCGAAGACAAGCGTTCTATGGAGAGAAGCGTAGGGCAGTAGGTAGCATATTAAGAAAATTGTGTGAATGGAAAAACGTACGAATCCTGGAAGCGGAATGTTGTGCAGATCATATTCACATGCTTCTGGAGATCCCGCCGAAGATGAGTGTGTCGAGCTTCATGGGATATCTGAAGGGTAAAAGTAGTCTGATGCTTTACGAGCAGTTTGGGGATCTAAAATTCAAATACAGGAACAGAGAGTTCTGGTGCAGAGGGTACTATGTCGATACGGTGGGTAAGAACACGGCGAAGATACAGGACTACATAAAGCACCAGCTTGAAGAGGATAAAATGGGTGAGCAATTATCGATCCCTTATCCGGGCAGCCCGTTTACGGGCCGTAAGTAACGAAGTTTGATGCAAATGTCAGATCGTATGCGCCTGTTAGGGCGCGGCTGGTAAGAGAGCCTTACAGGCGCATCAGAAAAACCTCCGGCTATGCCGGAGGATATTTATTCTCCTTCATATTTCAAGCCGCAGCAGCGTTGGCCGCCTCGCGCACCCCGGTCACGTAACTACCTACGTTCCCGGGGATTTGCTGCGTTGCCGCCTTGCTGCAACTTGAACTATTTTGGAGAAACGGTGCTATACGTAACGGTCATTCGGCAATCCATCACGGTTTACGCGCAATCAGCACCGCACGCTGCGGGGCGGGATAGCCTTCCACCGTCTTGCTGCGATCGTTCGGGTCAAGAAAATCGGCCAGCGACTCGGTAACCATCCATTCGGTACGGCGCTGCTCTTCGGTGGTAGTGACGCAAACGTCAGCGATACGCACATCAATAAAGCCGCATTTTTCCAGCCATTTTTTTAATGCCGGCGCGGAAGGAATAAAATAGACGTTACGCATCTGCGCGTAGCGGTCACCCGGCACCAAAACGGTATTTTCGTCGCCGTCGACGACCAGCGTTTCCAGTACCAGTTCACCCTCGTTGACCAACTGATCTTTTAACTGCCACAGATGCTCCAGCGGCGAGCGACGGTGATAAAGCACCCCCATCGAGAAGACGGTATCAAATGCTTTCAGCGCCGGGAGTTGTTCAATGCCTAACGGCAGCAGATGCGCGCGCTGATCGTTACCCAACAGTTTACGCACCGCCTCAAACTGGCAGAGGAAAAGCTGCGTAGGATCGATGCCGACCGCCAGATGTGCGCCCGCGCCAATCATCCGCCACAGGTGATAGCCGCTGCCGCAGCCGACATCAAGGATCGTCCGGCCGGTTAAATCTGACAGATGCGGCAGTACGCGATCCCACTTCCAGTCGGAGCGCCATTCGGTATCAATATCTACGCCATAAAGCGAGAACGGGCCTTTACGCCACGGCATCAGATTACGCAGCAATGTATCAATACGCTTTAGCTGACCTTCGCTGAGCGGCGTTTCGCTTTCCGCCGTGACGCTGTGCAGTAAATCCAGCCGCCAGGGCGTCATTTCCGGTAAAAATTCTACCGCGTTCGACCACTGTTTAAACAAGCCATGCTGCTGCTCGCGTTGCCAGGCGGCAATCTGCGCGGGCAATGTTTCCAGCCAGTGGGAGAGATGATTTTTAGCAATGAGCTGATAAAAGTTACCAAACTCGATCATGCGGCAACTCCGGCTTTTAACGCGACAAGCGACCCGAAATTAAAGCACTGGAACCACAGTTCACTGTGTTCGAAACCCGCCTTACGCAGGCGCGATTTATGTGTTTCCACCGAGTCGGTGAGCATCACGTTTTCCAGCATACTGCGCTTCTGGCTGATCTCCAGTTCGCTGTACCCATTAGCGCGTTTAAAATCGTGGTGCATGTTGAATAACAGCTCGCCCACTTTCGCGTCTTCAAAGCTAAATTTTTCCGACAGTACCAGCGCGCCGCCGGGATTCAACCCCTGGTAAATCTTATCCAGCAATGCCTGGCGTTCGGCAGGTTCAAGGAATTGCAGGGTAAAATTCAGCACCACCATCGAGGCGTTTTCAATAGTGATATCGCGGATATCGCCTTCAACGACTTCAACCGGCGTAGGCGCTTTATACGCATCAATATGACGGCGGCAGCGTTCAATCATGGCCGGGGAGTTATCGACGGCGATAATGCGACAATGCTCATGGCGGATATTACGCCGAACAGACAGCGTCGCGGCGCCCAGCGAGCAGCCCAGATCGTAGACCTGCGTGTTCGGCTGCACAAAGCGTTCAGCCAACATGCCAATCATGGAGATGATGTTGGAGTAGCCTGGTACCGAACGCTGAATCATGTCGGGGAAAACTTCGGCTACTCGTTCATCAAAAGTCCAGTCGCCGAGACGGGCGATAGGCGCAGAAAAAAGCGTGTCACGGTGAGACATAACGTATACATCCGGGAAAAATAAAGGCGCGTATTGTGCGCTAATACAGGGAGAAAACCAACTCCCAGGGCATATACCACAGGTTTGCCAGCACCATCAGCAACAATGCGCACCAGGTCGCGCTCATCCCCGCCCGCCGCCAGCGGAACAGGCGGTGGTGAAAACCGTAATAATGCATCAGCCGACCGGCGATTAAAATGATGCCGCAGATATGCACCATCCACGTTTCGGCGCCGTTCATTTCCATAAACAGCAGCAGCACCAGCGCGACAGGAATATATTCCACCGCGTTGCCATGAATACGGATGGCGCTTTGTAATTCGCTAAAACCGCCGTCGCCATAGGCGACACGGTACTGCATTCGCAAGCGAACGACATTAAAAGAGAACTTCATTAATAATAACGCACCCAAAACGGCATACAGCGCGCTTACCATACAAACTCCCTTTTAAAATGGCTGATGGCACTGTCTATGATAGGGGGCATTTTCAGAAAAGAGAAGATTGCTGTGGAATGGACGGAGCAGTGCCGATTTCCGGTAAGACGCTGCGTAAATCATGCCAAAGCGTATTCACCAGTTCCGGCGCCTGCGCGATATCCGGAGTATGTAAAAACAGAAAAGGCGTGGTGGTCTGCCGCCACTGCGGGAGTTTCTGCAACCAGGCGGCGAAAAACTCACGGTTTTGCGCCATATTGTCGCTACCGATAAAACGCACCATCGGATGGCTGGCCGTCACGACCGCATGTACCGGGACTTTGGGTTTTTTTCGCTGCGCGTCGCGGACGGCCTCGCTGTGGGGATGGGCGGCATGTACCGGACGACTATCCAGGATCACGCGGTTTACGCCGCGCGCGTGTAACCCGCGATTGAGCCGCTGTTCATCCTCACCTTTGTCAAAAAACACGGATGACGAACTTCTACGCCATAGGTAAACGTGGCGGGGAGCGCATCAAGAAACTGCCATAAGGCGGGGAGATCGCGCGGTCCAAAGGCGGCGGGCAACTGCAGCCAGTATTGGCCAATACGCGTTTCCAGCGGCGCCAGGCGGGTAAAAAACGCCTGTACCAGATCGTCGCAATGGCGCAGCGCGGCCTGGTGAGAAATGGTCGCGGGAAATTTAAAGCAAAAACGGAAATCGTCCGTAGTTTGCGCATACCAGCGATCGACAATTTCCGCTTTCGGCAGCGCGTAAAGCGTGGTGTTGCCCTCCACGCAGTTAAAGTGGCGGGCATACTCTTCCAGACTGGTGATGCCGAGCCGCGCCCATTTCGGGTGCGACCATTGCGGCAAGCCAATGTAGATCATAAGGCGTTTAAAATATCCTCTACGCTGCGTACCCGGCCAATGCGCGGGAAAATATGCGTCATACTGCTCTGATGCTGCTTACTGCTGGCGGCGCTACAGGCATCTTCGGCAATAATAAGGTTAAATCCCAACTCCCAGGCGTTACGCGCTGTGGATTCGACGCCGATATTGGTTGAGATCCCGCACAATATGATGGTGTCGATACCCCGGCGGCGAAGCTGTAATTCCAGATCGGTGCCGTAAAACGCGCCCCACTGGCGTTTAGTCACTTCCAGATCGCTGTCCTTTTTACCCAATGCCGTAGGCCAGGTCCACCAGTTTTCCGGCAACGCGTGCGCAGGCGTTGCGGCATCCACGGGTTGTTTTAACGCTTCGGCATAATCATCAGACCATCCGACGCGTACCATAACCACGGGCGAGCCGTTGGCACGACACTTTTCCGCCAGCCGCGCGGCGCGGGCGACCACCTCATTTGCCGTATACGGGCCGCCGGCAAACGGAAGAATTCCTTCCTGTAAATCAATAACCACCAGGGCGGTTGTGGTCGCGTTGAGTTTCAGCATGATAATCACTCTCCTAAAAATAGCGGATTTCACTACACCTTAAGCGCGATTAAAGAGGAAAGCGGCGACAAATATTGTTAATTTTTGTGAGCATACGCAATACCCGCGTGAAAAACCCGGGTACAGTCAGTATGAACCAGGCGTCGATACCGCATGAAACGGGCAAGTTATCGCTCCCATGCTGGATTGACTCTTATCGTGCGGCAGAATTCCAGTATAATAGCCGCCTTTTTTCATCCAGTTGTGACATACAGCTAAAGCTGCGACTTCGTCGCCTGCATGGCAGGCAACAATCCGCCTGCGGCTAAGTTAAGGGATATCTCATGCGTACAGAATATTGCGGACAGCTACGTCTGTCCCACGTGGGCAGCAGGTGACTCTGTGTGGTTGGGTCAACCGTCGTCGTGATCTTGGCAGCCTGATCTTTATCGATATGCGCGACCGCGAAGGTATTGTGCAGGTGTTCTTCGATCCGGATCGTGCGGACGCGTTAAAGCTGGCCTCTGAACTGCGTAATGAGTTCTGCATTCAGGTTACGGGCACCGTGCGTGCGCGTGACGCGAAAAACGTCAATGCGGATATGGCGACCGGCGAAATTGAAGTGCTGGCGTCCTCTCTCACTATCATCAACCGCGCAGACTCACTGCCGCTTGACGCTAACCACGTTAATACCGAAGAGGCGCGTCTCAAGTACCGCTATCTGGATTTGCGTCGTCCGGAAATGGCGCAGCGCCTGAAAACTCGCGCCAAAATTACCAGCCTGGTGCGTCGTTTTATGGACGATCACGGTTTCCTTGATATTGAAACGCCGATGCTGACCAAAGCCACGCCGGAAGGCGCGCGCGACTATCTGGTGCCTTCGCGCGTGCACAAAGGTAAATTCTACGCGCTGCCGCAGTCGCCGCAGCTGTTCAAACAGCTCCTGATGATGTCCGGTTTCGACCGTTACTATCAGATAGTCAAATGCTTCCGTGATGAAGACTTACGTGCTGACCGTCAGCCGGAGTTTACTCAGATCGACGTCGAGACCTCCTTCATGACCGCGCCACAGGTGCGCGAAGTGATGGAAGCGCTGGTGCGCCATTTATGGCTGGAAGTGAAAGGCGTGGATCTGGGGGATTTCCCGGTCATGACGTTTGCCGAAGCGGAACGTCGTTACGGTTCCGACAAACCAGACCTGCGTAACCCGATGGAGCTGGTAGATGTCGCTGACCTGCTGAAATCGGTAGAGTTCGCGGTCTTCGCGGGCCCGGCTAACGATCCGAAAGGCCGCGTGGCAGCGCTGCGTGTGCCGGGCGGCGCACAGCTTAGCCGTAAGCAGATCGACGATTACGGTAACTTTGTTAAGATCTACGGCGCGAAAGGCCTGGCATATATCAAAGTTAACGAGCGCGCGAAAGGTCTGGATGGGATTAACAGTCCGGTGGCCAAGTTCCTGACCGCCGACATCGTCGAGGCTATCCTTGAACGTACCGGCGCGCAGGACGGCGACATGATCTTCTTCGGCGCAGATAACAAAAAAGTGGTTGCCGATGCGCTGGGCGCGCTGCGTCTGAAACTGGGCAAAGACCTGAGCCTGACCGACGAAGACAAATGGGCGCCGTTGTGGGTGATTGACTTCCCGATGTTCGAAGACGACGGCGAAGGCGGTCTGACCGCGATGCACCATCCGTTCACCGCCCCGCGTGACATGACGGCGTCTGAACTGAAAACTGCGCCGGAAGAAGCCGTCGCTAACGCTTACGATATGGTGATTAACGGCTATGAAGTGGGCGGCGGTTCGGTGCGTATTCACAACGGTGAAATGCAGCAAACCGTATTTGGTATTCTCGGTATCAATGAGCAGGAGCAGCGCGAGAAGTTCGGCTTCCTGTTGGATGCGTTGAAATACGGTACGCCGCCGCACGCGGGCCTGGCGTTTGGTCTGGACCGTCTGACCATGCTGTTGACCGGCACCGATAATATTCGTGATGTTATCGCCTTCCCGAAAACGACCGCCGCCGCGTGTCTGATGACCGAAGCGCCGAGTTTCGCCAACCAGGCAGCGTTGACGGAGTTGGGTATTCAGGTTGTGAAGAAAGCCGAGAATAACTGATATGAACAATGTACACGACATCATTCGCGTGATATCGCGGCGGCCCGCAAGGGGGAGCCTCATCAATGCGCGGCGTCACCCAACAGCCAACAACGAGGTCGCGTGAAGGATAAAGTGTACAAGCGCCCCGTTTCGGTCCTGGTGGTTATTTTCGCGCAGGATACGAAACGGGTGCTAATGTTGCAGCGACGCGACGACCCGGATTTTTGGCAGTCAGTGACTGGCAGCATAGAAGAAGGGGAGACCGCGTTGCAGGCCGCCGTACGTGAAGTCAAAGAGGAGGTCACGATTGATGTTGTGGCAGAGCAACTGACCTTAATCGACTGCCAACGTACGGTGGAGTTTGAAATTTTTTCACATTTACGTCATCGCTATGCGCCGGGCGTCATGCACAATACAGAATTCTGGTTCTGCCTTGCGTTACCGCATGAGCGGCAGGTGATATTCACTGAACATCTGACGTACCAGTGGCTTGATGCGCCTGACGCGGCGGCGCTTACCAAGTCCTGGAGTAACCGGCAAGCGATTGAAGAGTTTGTCATTAACGTCGCCTGAAAAGGCGCGCTTTTTTTGAGGAAATATTTATGGCAGGTCATAGTAAATGGGCCAACACCAGACATCGTAAAGCTGCGCAGGATGCCAAGCGCGGTAAAATCTTCACTAAAATTATTCGTGAACTGGTAACGGCAGCTAAATTGGGCGGCGGCGATCCGGATGCCAACCCGCGTCTGCGCGCGGCCGTAGATAAAGCGCTGGCCAACAACATGACCCGCGACACTCTGAACCGTGCCATCGCTCGCGGTGTGGGTGGTGATGAAGACTCAAACATGGAAACCATCATCTATGAAGGTTATGGTCCTGGCGGCACGGCCATTATGATTGAGTGTCTGAGTGACAACCGTAACCGTACCGTAGCGGAAGTGCGTCATGCTTTCAGCAAGTGCGGCGGTAATCTGGGTACCGATGGTTCAGTGGCGTACCTGTTCAGTAAGAAAGGGGTCATCTCCTTTGAGAAAGGCGATGAAGACACCATCATGGAAGCCGCTCTGGAAGCTGGCGCGGAAGATGTTGTGACCTATGATGATGGCGCGATTGATGTTTACACTGCCTGGGAAGAGATGGGCAAAGTGCGTGACGCGCTGGAAGCGGCGGGTCTGAAAGCGGACAGCGCCGAAGTGTCCATGATCCCGTCAACCAAAGCGGATATGGATGCGGAAACCGCGCCGAAACTGCTACGTCTGATCGATATGCTGGAAGACTGTGACGATGTCCAGGAAGTCTACCATAACGGTGAGATCTCTGATGAGGTGGCGGCGACCCTGTAATGGTGCTGTTAAACGCCGGCGAGCAGGGGGGATAGCGTGTCGATTATTCTCGGTATTGACCCCGGCTCGCGCATTACCGGTTATGGCGTTATCCGCCAGGTAGGCAGACAACTGACCTATCTGGGCAGCGGATGTATTCGCACCAAAGTCGATGATTTGCCGTCTCGTTTGAAGCTCATATACGCAGGCGTGACGGAAATCATCACGCAGTTCCAGCCGGACTATTTTGCTATCGAACAAGTGTTTATGGCGAAAAATGCCGACTCAGCCCTCAAGCTGGGACAGGCGCGCGGCGTCGCGATTGTCGCCGCGGTCAATCAGGAACTGCCTGTGTTTGAATACGCGGCACGTCAGGTGAAGCAAACTGTCGTCGGTATTGGTAGCGCGGAAAAAAGCCAGGTACAGCATATGGTGCGCACGTTGCTCAAACTGCCCGCTAACCCGCAGGCCGATGCCGCGGATGCGCTGGCTATCGCCATTACGCATTGCCATGTTAGCCAGAACGCGATGCAAATGAGCGAGAGCCGACTCAATCTGGCGCGGGGAAGGTTGCGCTAAATCATTCTCCCTCAGCCTCGCGTTGAGGGGTTTCAATTCCACAGGCGCTAATTAATTCTAAATTGGGATGATGCCACAGGCTGGCTGGCGTCACCGTCTTACGATCCCACGGGATTGAACCTAAAAACCAGAACTTCCAGAAGGTGAGTTTTGCATCAGGATTGCTGTGAAGTAATTCTTCGAATGTCTCAATATCGCCTACCGGAATACACAATGCTTCTTTATAGATATCAAACACAAATTTCGAGCAGAACTGGCGCGATGACTCATATTTAAACCCCGTGTGATAAAACTTATTCAGCCGGGCGGGAACCTGTTCCATGATGGCGAGTTTCTGTTCGACCGTCAGGCCGCCGCACAGGCGACGAACCGCGTAACGTTGTCCGGCTGAACGTTGTATAAACAGTGACAGCGTGGTGACGGTTGAGAGGGGGACGCGACTTTCCGCCACCAGATAGTCATCACCATTATGACCAATAATAATCCCAACATGGTTGCTCCAGCACTGTGAAGCGGTGGATATCTGGCCGAAAAGCGCGGCACCTATACAGGTAAAGACGATATCCCCGGTTTCATATTGGACAGAATAATGTTTTTTCATCTCTGCATTCCATTATTTGAAATAAATTCCTTTTATTTATTCTACAGGGTAGTTAGCATTATACCAGATAAATGACAAGCCTTAGTGTCATCGAAATATGTATGGTGAATATTTCATTTATTGGAATTAATGTCGCAAATATATTTGGGGGGGATTAAGGTGGTAAATTTCTCATCATCTGATCTCTCATATTGACAGGGAGTATGACGTGGTTGATGATAAACCAATGAGAATAGGTTGGGTATTTTATTTTTCTTATATTTTTGTTGCTATTATCTTTCATCTGTTTATTAGTTTTTGTTATTGTTTGAGTATTGATATGGCAGAGGCTAATACTGTTGTTTTTTTATTAAAGCCAGGGACGATTTCATTATTATTTTTGCTTCTCCCTGCCAGGCGGTTCAGGACCAGACTGTTGGCAACATTGAGTTCAGTTTTCATTACCTTAGTATTTAACCAGTGGCATTTGGTTGCCGGGAATAAAGAACTGGTTTTGTGCCTGCAGGCAGCGTGCTTTATGGCTTTCCTGGCTATGACATCGGTAAAAAAATCCGGGTGGATGATTAGCGCATCATTATTTTTGGTTTGTGCGGCCGGAACGATTCGCCAGTGCTGGCTGGAACAGCTGTTTAACGTTGCCGATATATATATTGTTGATGATGGGCGGAGCTGTGGCGCCAGCGGCCACTGCTTCCAGTATATTGCAGCAAAAGGCCGTGGGTTAGCAGCAAAACGACAGGCACTTTTTAGTAGCGAAGAGTATGTGAATATTTATTATAGTTATTCAGAAGGGATACCAGCCGTGAATTCGATGGCATGAAAAAATGAGTTTGTACAATATTTATTGTGTCATGGCGAATTAAAATCTGTTGCGCGCGACGATAAAACAACATGCGATTAAATTCAGCGCTAGTGGTTAATATTATTAAATCTATTCTCGGGTCTGTTTTTTTGGAGGCCCGATGTTCCGCTGTTGGCATTCGTACTGGATACATTATTAGTATTATCAGGATACTCATTATTGAGACTACCGAACGTGCGCCACGGCATGTTGGCAAAAAGTTATTCGCGTAGGACGCAAAAAGTTGTCAATGAGTGTACGACTTATTTTGTACGCAGCGGGGAGCGTACCCGTCGTTAACCATCGCGGGTTGTCGGAGTAGAGAAATATCGGGATACGTGTTTTTCATCTGGATAAACATCCAGTCTTTTTTTATGATAGTGCATTATCTTTTCAGCCCTTTACGCAGGAGCGTCATGTGATAGGCAGACTCAGAGGCATTATTCTCGAAAAACAACCCCCGATCGTGCTGCTGGAGACAGGGGGCGTAGGCTATGAAGTACATATGCCTATGACCTGCTTTTACGAGCTGCCGGAGGCCGGGCAGGAGGCAATCGTCTTCACCCACTTTGTGGTGCGTGAAGATGCGCAGTTGCTGTACGGATTCAACAATAAGCAAGAGCGCACGCTGTTTAAAGAGCTAATTAAAACGAATGGCGTCGGGCCTAAGCTGGCGCTGGCGATTCTCTCCGGGCATGTCGGCGCAACAGTTCGTCAATGCGGTTGAGCGCGAAGAGCTTGGCGCACTGGTGAAGCTGCCGGGTATCGGCAAGAAAACTGCCGAAAGGCTGATTGTCGAAATGAAAGACCGCTTTAAAGGGCTACATGGCGATCTCTTTACGCCAGCGGTCGATCTGGTATTGACGTCGCCGGCCAGCCCAACCTCGGAAGATGCAGAACAAGAAGCAGTTGCTGCGCTGGTGGCGCTGGGTTATAAACCGCAAAAGCCAGTCGGATGGTAAGCAAGATCGCCCGTCCGGATGCAAGCAGTGAAACACTGATTCGCGACGCGTTACGCGCCGCGTTATGAGGTAAAGGATGATAGAAGCAGATCGCCTGATTTCAGCAGGCGCCACGATCGCAGAAGACGTCGCCGATCGCGCCATTCGCCCGAAGTTATTGGCGGAATATGTCGGCCAACCGCAGGTGCGCTCCCAGATGGAGATCTTTATTCAGGCGGCAAAACTGCGTGGCGACGCACTAGATCACTTGCTGATTTTCGGCCCGCCAGGGTTAGGTAAGACGACTCTGGCGAATATCGTTGCCAATGAAATGGGCGTCAATCTCCGAACCACGTCCGGCCCTGTGCTGGAAAAAGCAGGCGATCTGGCGGCTATGCTCACCAATCTTGAACCGCATGATGTGCTGTTTATTGATGAGATTCACCGGCTTTCACCGGTGGTGGAAGAGGTACTCTATCCGGCGATGGAAGATTACCAGCTGGATATTATGATTGGCGAAGGGCCTGCTGCACGTTCAATTAAGATCGATCTGCCGCCGTTTACGCTCATTGGCGCGACGACGCGAGCAGGTTCGCTGACCTCGCCGCTACGTGACCGTTTTGGTATTGTGCAGCGCCTGGAGTTCTACCAGGTGCCCGATCTGCAGCACATCGTCGGGCGCAGCGCAAGGCACATGGGCTGGAGATGAGCGATGACGGCGCGCTGGAAGTGGCGCGTCGGGCGCGCGGTACGCCGCGTATAGCCAACCGCCTGCTCAGACGTGTGCGTGATTTCGCCGAGGTGAAGCATGACGGGGCTATTTCAGCGGAAATTGCCGCGCAGGCGCTGGATATGCTGAACGTCGATGCGGAAGGGTTTGACTATATGGATCGCAAGCTGCTGCTGGCGGTTATTGATAAATTCTTCGGTGGACCGGTCGGCCTGGATAACCTGGCGGCGGCGATTGGCGAAGAGCGTGAAACCATTGAGGATGTGCTGGAGCCGTATCTGATTCAGCAAGGCTTTTTGCAGCGCACGCCGCGTGGGCGTATGGCAACGGTGCGCGCCTGGAACCATTTCGGGATCACGCCGCCAGAGATGCCTTAGCGCCTGAAGTCGCTGCGCTCATCAGACCTGGGCGATTTTGTAGGCCCGGTAAGCGCAGCGCCACCGGGCAACACACGCTTAGCTTGCCTGCTTTTTCATCATACTGAAGATAAACAGCAGCGCGGCACACAGCACTACGGATGGCCCTGCCGGGGTATCATAAAAGGCTGAGAAGGTCAGTCCACCTGTAACCGCTATCATACCTACACCAACGGCGACGCCCGCCATCTGTTCCGGCGTACGGGCAAAGCGACGCGCGGTTGCGGCGGGGATAATCAGCAGCGACGTAATGATCAGCGCTCCGACAAACTTCATCGCCACACCAATGGTTAACGCTGTTACCAGCATCAATAGTAGCTTCACGCGCTGTAACTTCACGCCATCGACAAAGGCAAGATCCGGACTGATAGTCATTGACAGCAAGTTGCGCCACTGCCAGAAGATAATGGCCAGCACAATAACGACCCCAATAGCGATCGAGATAAGATCTTCCGGTGTGACGGCCAGTAAATCGCCAAACAGATAAGCCATTAAATCAACGCGGACGTTGGACATCAGGCTGACCACGACCAGTCCTAAAGACAAGGCGCTGTGCGCCATAATGCCCAGTAAGGTATCAATCGCGAGGTGAGGACGCTTCTCCAGCCATACCAGACCGGCGGCCAGCAGCAGCGTAACGGCGATTACCGCATAAAAGGGGTTAACGTCCAGCAACAGGCCAAACGCCACGCCCAGAAGCGACGCGTGAGCCAGCGTATCGCCAAAATAGGACATCCGACGCCAGACCACAAATGAGCCCAAAGGACCAGCGGCGCACGCCAGCATCATCCCGGCCAGCCAGCCGGGCAGTAATAATTCAATCATGAGTGACCATTTCCCCGGCGCAGTACAATACGACCCTGTAAATCGTGGCGATGATTATGATGATGGCGATAAATCCCTAATTGCTCCGCGCCTCGCGGGCCAAACATAGAGATAAATTCCGGATGCATAGACACCACTTCCGGCGCGCCGGAACAGCAAATATGATGGTTCAGGCATAACACTTCATCCGTCTTTGCCATGACCAGATGTAGATCATGCGACACCATCAGCACGGCGCAATCGAGTTCACGGCGCAGCTGATCGATAAGGTCGTATAACGCGACCTGGCCGTTGACATCCACGCCTTGCGTCGGCTCATCAAGTACCAGCAACTGAGGCCTGTTAAGCAGAGCACGCGCCAGCAGTACGCGCTGTGTCTCACCGCCAGAGAGTTTTTGCATGGGCGCGTCAATCAAATGTCCGGCCTGAACGCGTTTAAGCGCCGGGAGAATATCCGTTTTTTGTGTGCCGGGACGTAAACGTAAAAATCGGTTTACCGTCAGCGGAAGCGTGGTATCGAGATAGAGCTTTTGCGGGACATAGCCGATACGGAGTTGCCCGTTGCGCTTGATCACCCCTTCATCAGGGGCTACCAGTCCTAAAACCACGCGTACAAGCGTTGACTTCCCCGCGCCGTTAGGGCCGAGAAGCGTTAAAATTTTTCCGGGGCTCAATTCAAGCGACACGTCAGAGAGGACGCGGCGTTGACCAAATGAGACCGAGACGTTTTCCCAGTGAAACTAAACTTGTCATGTTAATTTTAGGCTTGCAGAAGTGATAGAATGTTATAATATCACATTTCACACATTCATTACGATGATTAGTCGCATTATGTTACAGAAAAATACGCTTCTTTTCGCAGCATTATCCGCCCGCGCTTTGGGGGAGTGCAACCCAGGCTGCCGACGCCGCCGTTGTCGCTTCGCTTAAACCGCTTGGGTTTATCGCTTCCGCCATTGCTGATGGCGTTACGGATACACAAGTATTACTTCCGGATGGGGCTTCCGAGCATGATTATTCATTGCGTCCATCAGACGTAAAACGCTTACAGGGCGCGGACTTAGTCGTCTGGGTTGGCCCGGAAATGGAAGCCTTTATGGAGAAGTCGGTCAGGAATATTCCTGATAATAAGCAGGTTACCATTGCGCAACTTGCCGATGTAAAACCGTTACTCATGAAGGGCGCGGATGATGATGAGGATGAACATGCGCATACGGGCGCCGATGAGGAAAAAGGTGACGTACATCACCATCACGGCGAATATAACATGCATCTTTGGCTCTCCCCAGAGATAGCGCGGGCTACAGCGGTTGCAATCCATGAAAAATTAGTGGAACTTATGCCGCAAAGTCGAGCCAAACTCGACGCCAACCTGAAGGATTTTGAGGCACAATTAGCCGCAACCGATAAACAGGTCGGTAAACGAGCTCGCGCCGCTCAAGGGGAAAGGGTATTTCGTTTTTCATGACGCCTACGGTTACTACGAAAAACACTACGGACTCACCCCGCTCGGTCACTTTACCGTGAACCCTGAGATACAACCTGGCGCGCAGCGTTTACATGAAATAAGAACACAGTTGGTTGAGCAAAAAGCAACCTGCGTTTTTGCTGAGCCACAATTCAGGCCAGCGGTCGTGGAAGCCGTGGCGAGAGGGACATCCGTTCGAATGGGAACACTGGACCCCCTCGGGACGAACATTAAACTGGGTAAAACAAGCTATTCAGCGTTTTTAAGCCAATTAGCCAACCAGTATGCGAGCTGCCTGAAAGGAGATTAACGAGGAAGTGAATACGTGCAACAGATAGCCCGCTCTGTCGCCCTGGCATTTAATAATCTGCCCCGACCCCACCGCGTTATGCTGGGGTCACTTACCGTTCTGACACTGGCCGTCGCCGTATGGCGGCCCTATGTTTACCACCCAGAATCCGCACCAATCGTTAAAACTATTGAACTGGAGAAAAGCGAGATTCGTTCCCTCTTACCGGAGGCCAGCGAACCCATCGATCAGGCCGCGCAGGAAGATGAAGCTATTCCTCAGGATGAGCTGGACGATAAAACCGCAGGCGAAGTCGGCGTCCATGAATACGTCGTCTCCACAGGCGATACGTTAAGCAGCATTCTGAATCAGTACGGCATCGATATGAGCGATATTAGCCGACTTGCCGCTTCTGATAAGGAGCTGCGCAATCTGAAAATTGGCCAACAGCTTTCCTGGACACTGACTGCCGATGGCGATTTACAGCGTCTGACATGGGAAGTCTCCCGCCGTGAAACGCGTACCTACGATCGCACTGCCAACGGTTTTTAAAATGAGCAGTGAAATGCAGCAGGGGGACTGGGTTAACAGTCTGCTGAAAGGCACGGTAGGGGGGAGCTTTGTCGCCAGCGCGAAAGAGGCAGGTTTAACCAGCAGCGAAATCAGCGCAGTGATAAAAGCCATGCAGTGGCAGATGGATTTTCGCAAGCTGAAAAAGGGCGATGAATTTTCGGTTCTGATGTCGCGCGAGATGCTGGATGGCAAGCGTGAACAGAGTCAGTTGTTGGGCGTGCGGATGCGTTCCGATGGTAAAGATTACTACGCCATTCGCGCCGCTGACGGTAAATTCTATGACCGTAACGGTGTTGGCCTGGCGAAAGGCTTTTTACGCTTCCCGACCGCCAAACAGTTCCGCATCTCCTCCAATTTCAATCCGCGCCGTCTGAACCCGGTTACTGGACGCGTTGCGCCGCATCGTGGCGTTGACTTTGCGATGCCGCAGGGTACGCCGGTGCTGTCGGTGGGGGATGGCGAGGTCGTGGTCGCTAAACGTAGCGGCGCTGCCGGTTACTACATTGCGATTCGTCATGGACGCACCTACACCACACGTTACATGCACTTGCGTAAGCTGCTGGTGAAACCGGGGCAAAAAGTGAAACGTGGCGATCGTATTGCGCTTTCTGGTAACACCGGGCGTTCCACAGGGCCGCATCTGCATTATGAGGTATGGATCAACCAGCAAGCCGTTAACCCTCTGACAGCAAAATTGCCGCGCACGGAAGGTCTGACGGGGTCAGATCGTCGTGAATACCTGGCACAGGTGAAAGAGGTTCTGCCACAACTGCGCTTCGATTAACAAATGCGCTGACAGAGCCGGTACGCGATGTGTGCCGGCTTTTTTGTTTTGTGTGAGACGCAGACGTCGCTACACTATTCACAATTCCTTTTCGCGTCAGCAGACCCTGGAAAAGCATGGAAACCAAAAAAAATAATAGTGAGTATATCCCTGAATTCGAAAAATCCTTTCGCTATCCGCAGTATTGGGGCGCCTGGTTGGGCGCGGCGGCAATGGCGGGGATCGCATTAACACCGGCATCATTCCGTGACCCTTTGCTGGCGACGCTGGGGCGCTTTGCCGGACGGCTGGGGAAGAGTTCTCGTCGCCGGGCGCTAATTAATCTGTCTTTGTGCTTTCCGCAGCGTAGCGAAGCTGAGCGCGAAGCGATTGTCGATGAGATGTTCGCCACCGCGCCACAGGCAATGGCGATGATGGCTGAGTTGGCGATGCGCGGTCCGAAAAAAATTCAACAGCGTGTTGACTGGGAAGGTCTGGAAATCATTGAGGAGATGCGTCGTAACGACGAAAAAGTCATTTTTCTCGTACCGCATGGCTGGGGCGTCGACATTCCAGCCATGCTGATGGCCTCTCAGGGGCAAAAAATGGCGGCGATGTTTCATAATCAGGGTAATCCGGTTTTTGACTATATCTGGAACACAGTGCGTCGGCGTTTCGGCGGACGTTTGCATGCGCGTAATGACGGGATTAAACCCTTTATTCAGTCTGTTCGTCAGGGCTACTGGGGTTACTACCTGCCGGACCAGGATCACGGCCCGGAGCATAGTGAATTCGTTGATTTCTTTGCGACATACAAAGCGACGCTGCCTGCAATTGGTCGGCTGATGAAAGTGTGCCGCGCACGCGTGATACCGCTTTTCCCGGTGTATAATGGTAAAACGCATCGCCTGACTATCCAGATTCGCCCGCCAATGGACGATCTGCTCACGGCTGACGACCACACTATCGCCAGACGGATGAACGAAGAGGTCGAAATTTTTGTCGGCCCGCATCCGGAACAGTACACCTGGATCCTGAAGCTGCTCAAAACCCGCAAGCCAGGCGAGATTCAGCCGTATAAGCGTAAAGATCTTTATCCCATCAAATAAATAAAGCCTCTCGTAAGAGAGGCTTTATGCTGACAAACCCTGTACTACCTGATGAACAGGTGTGGGGGAGTTTTACTCCACGGTCAAAATACGCGTGGTATTGGTTGAGCCGACGGTGCTCATGACATCGCCCTGAGTCACGATAACCAGGTCGCCGGAAACCAGATACCCTTTATCGCGCAGCAGATTAACAGCTTCATGTGCCGCGACAACGCCATCAGCCGCGCTGTCAAAATGCACCGGCGTTACTCCGCGGTAGAGCGCGGTCAGGTTCAGCGTGCGTTCATGGCGCGACATGGCGAAAATCGGCAGGCCGGAGCTGATACGGGAAGTCATTAGCGCGGTACGACCGGATTCCGTCATGGTGATGATCGCGGTAACGCCTTTCAGATGGTTTGCCGCATACATCGCAGACATGGCAATAGCTTCTTCAACGTTGTCGAACTGCACGTCGAGACGGTGTTTAGACACATTGATGCTGGGGATTTTTTCTGCGCCCAGGCAGACGCGCGCCATTGCGGCAACGGTTTCAGAAGGATACTGACCGGCTGCGGTTTCGGCAGACAGCATAACTGCATCCGTACCATCCAGGACGGCGTTCGCCACGTCCATCACTTCCGCGCGGGTCGGCATCGGGTTGGTGATCATCGACTCCATCATCTGCGTTGCGGTGATGACTGCGCGGTTTAGCTGACGCGCACGGCGAATCAGCGCCTTCTGGATACCAACCAGCTCCGGATCGCCGATTTCAACGCCCAGGTCGCCACGTGCGACCATCACAACGTCAGAGGCCAGAATGATATCGTCCATGGCGTTTTGGTCGCATACCGCTTCGGCGCGTTCGACTTTAGCCACAATTTTCGCGTCGCAGCCGGCGTCGCGTGCCAGGCGGCGTGCATAGTTCAGATCTTCGCCGCAGCGCGGGAAGGAGACGGCCAGATAGTCAACGCCTATCAGCGCAGCGGTTTGAATATCGGCTTTGTCTTTTTCGGTCAGCGCTTCAGCAGAAAGCCCGCCGCCGAGCTTGTTAATGCCTTTATTGTTAGACAGCGGGCCGCCGACGGTGACCTCGGTAAAGACTTTCATGCCCTGGACTTCAAGCACTTTCAACTGCACTCGACCATCGTCAAGCAGCAGGATATCGCCAGGAACGACGTCTGCCGGCAACCCTTTGTAATCAATGCCAACTTTTTCTTTGTCGCCTTCGCCTTTACCCAGGTTAGCGTCTAACAGAAATTTGTCCCCGATGTTGAGGAACACTTTGCCTTCTTTAAAAGTGGAAACGCGAATTTTTGGCCCTTGCAGGTCGCCTAAAATAGCCACATGACGCCCCAGTTTGGCGGCAATCTCACGGACTTTATCAGCACGCATTTTATGATCTTCCGGCGAGCCGTGAGAGAAGTTCATACGTACGACGTTTGCGCCTGCGGCGATAACCTTCTCAAGGTTGTTATCGCGGTCAGTTGCCGGGCCTAACGTGGTAACGATTTTGGTTCTGCGAAGCCTTCTGGACATGTAATACTCCGTTGACTAAAACAACTTGGTGTTGCGTGAACATTGATCCGGTCGTTCTGAAACCTTAACGGTATAAAATGAACGTCCCCGGATGACGAAAAGGGTAGAACATTGTTATTGCTTTTAGCGGTCATCACTCTTGATGAGTAATTCTTTATCAAAACGCGATTCCTTGAGCGCTTCCTTGACACGCTTCAAGTTATCTCTGAATTTTGCCCCGCGACGCAAAGTAAATCCCGTTGCCAGCACATCTATCACGGTCAGCTGAGCAAGTCGAGAGACCATGGGCATATAAATGTCTGTATCTTCCGGTACGTCGAGGGTAATGGCCAGCGTGGCTTCGCGCGCCAGCGGCGTTCCAGCAGAGGTCAGGGCGATCACCATCGCATCGTTTTCCCGCGCCAACTGCGCCAGTTCCACCAGGCTCTTGGTTCTGCCAGTATGCGAAATGAGCACGACGACGTCATCATCGCTACAATTCATACAACTCATCCGTTGCAGCACGACATCGTCGGAGTAAATCACCGGTACATTGAAGCGAAAAAACTTATTCATGGCGTCATGCGCGACCGCGGCGGAGGAACCAAGACCAAAAAAGGCGATTTTTTTTGCCTGGGTCAGTAAATCGACGGCGCGGTTGACGGCCGATTTATCCAGCGACTGGCGAACATGGTCGAGACTGGCCATAGCGGACTCGAAGATTTTCCCTGTATAGGCCTCGACGCTGTCATCTTCATCCACATTGCGATTAACATAGGGGGTGCCATTTGCCAGACTTTGCGCCAGATGCAGTTTAAAATCAGGAAAGCCGCGGGTATTCATGCTGCGACAGAAACGGTTCACCGTTGGCTCGCTAACGTTAGCTTCCTGGGCCAGCATGGCAATGCTTAAGTGAATCGATCTGCCTGGGGCGGCCAGAATAACATCGGCGACTTTTCGTTCAGATTTGCTTAAATGTTCCAGTTGAGACTGGACTTTTTCCAGCATATTCATGATTAGCAAGGCTCATGGGTATTAGCGATTTCAATGATGCGCGAAACCGAGCGCGTTTTGTTAGATATTACTCCTGGCGACTGCCGAAGGGGGCAAAATGGCTAAAAAAGGTGTCGTTTTTTTTCATTACATGACCGACGTCGGATTTTAAGTTCCAGCTTGTGTGGAAAAGCGACAATTTTATTAGGCGTTTTGCCGATTATATTGCCAATCAAACGCCGGTTCATCCATGAAAAGGCGTTTACGGTTTCCGTAATCTCGTAAAAGCAGTACAGTGCTGTGTAATAAAATTACAACGATATCCTGGCTAAAGTACCAGGAGATTAACTAAGGAGAATGACATGGCGGTAACGCAAACAGCCCAGGCATGTGACCTGGTCATTTTCGGCGCGAAAGGCGACCTGGCGCGCCGGAAATTGCTGCCTTCCCTGTATCAGCTTGAGAAAGCCGGCCAGATCCATCCGGATACCCGTATCATTGGGGTGGGGCGCGCGGACTGGGATAAAGAAGCCTATACCCACGTTGTGCGTGAAGCGCTGGAAACCTTCATGAAGGAAAAAATTGATGAAGGCTTGTGGGATACGCTGAGCGGCCGTCTGGATTTTTGTAATCTTGACGTTAACGATACGCCTGCGTTTAGTCGCCTGGGCGATATGCTGGATCAAAAAAACCGCACCACCATTAACTATTTTGCGATGCCGCCCAGCACCTTTGGCGCTATTTGCAAAGGGTTAGGAGAGGCTAAACTCAACGCCAAACCGGCGCGCGTCGTGATGGAAAAACCGTTGGGTACTTCGCTGGCGACCTCGCGTGAGATTAACGATCGGGTCGGCGAATACTTTGAAGAGTGCCAGGTATATCGTATCGACCACTATCTGGGTAAAGAAACGGTTCTCAACCTGCTGGCGCTGCGTTTTGCCAACTCGTTATTCGTTAATAACTGGGATAACCGCACTATCGATCACGTTGAGATTACCGTAGCGGAAGAGGTGGGGATTGAAGGGCGCTGGGGATACTTTGACCAGGCCGGTCAGATGCGCGATATGATCCAGAACCACTTGCTGCAAATTCTCTGCATGATTGCCATGTCACCGCCGTCTGACCTGAGCGCCGACAGTATTCGCGATGAAAAAGTCAAAGTGTTGAAATCGCTGCGCCGTATTGATCGCTCCAACGTGCGTGAAAAAACGGTTCGTGGTCAATACACCGCTGGCTTTGCGCAGGGTCAAAAGGTGCCGGGCTATCTGGAAGAAGAGGGCGCGAATAAAAGCAGCAACACCGAAACGTTCGTCGCGATCCGCGTCGACATCGATAACTGGCGTTGGGCGGGCGTGCCATTCTATCTGCGTACCGGCAAGCGTCTGCCAACCAAGTGCTCTGAAGTCGTGGTTTATTTCAAGACGCCTGAACTGAATCTCTTTAAAGAGTCCTGGCAAGACCTGCCGCAGAACAAATTGACGATTCGCCTGCAGCCGGATGAAGGCGTAGATATTCAGGTGCTTAACAAAGTACCGGGGCTGGATCATAAGCATAATCTGCAGATCACGAAGCTTGATCTGAGCTACTCCGAGACCTTTAATCAGACGCACCTGGCGGATGCATATGAACGCCTGCTGCTGGAAACGATGCGTGGCATTCAGGCGCTGTTTGTCCGCCGTGACGAAGTGGAAGAAGCCTGGAAGTGGGTGGACTCCATTACCGAAGCATGGGCGATGGACAACGACGCGCCGAAGCCGTATCAGGCGGGCACCTGGGGACCGGTAGCGTCCGTGGCGATGATTACCCGTGACGGTCGTTCGTGGAATGAGTTTGAGTAAATTTGCCACTCACTCTTAGGTGGTATTTTACCGGTAACATGATCTAACACAGATTGTAGAATCATTTTTGCACTTTTAAGCCTCGTGTGGATTCACCTGCGAGGCTTTTTTTATTACACTGCCTGAAACGATTTTGCCCCATTATCACCGGACATCATGTATTTCACTGTCTTTCCACATTGATGAAACGCATGGTAAACCCGGTAGCCGGACAGATAAATTTCAGGAGCCTCTATGAATCCTAATTTGTTACGCGTAACACAGCGTATTGTCGAACGCTCGCAGCAGACCCGAGAAGCCTATCTTGCCCGCATTGAGCAGGCGAAAACTGCCACGGTCCACCGATCTCAACTGGCCTGCGGCAACCTGGCGCATGGCTTCGCCGCCTGTCAGCCAGAGGACAAAGCCTCGCTGAAAAGTATGTTGCGCAATAATATCGCCATCATCACCTCCTATAATGACATGCTCTCTGCGCATCAACCGTATGAACATTATCCGGAAATTATTCGTCAGGCTCTGCATTCCGTGAATGCGGTTGGTCAGGTCGCAGGCGGCGTACCGGCAATGTGCGATGGCGTTACGCAAGGGCAGGATGGCATGGAGTTGTCATTACTCAGCCGCGAAGTGATAGCGATGTCGGCAGCAGTTGGCCTCTCTCACAATATGTTTGACGGCGCGTTATTCCTCGGCGTATGCGACAAAATAGTTCCGGGGCTGGCGATGGCCGCGCTCTCTTTTGGTCATTTACCCGCGATTTTTGTTCCGTCAGGCCCGATGGCGAGCGGCCTGCCGAATAAAGAAAAAGTCCGTATTCGTCAGCTATATGCGGAAGGAAAAGTTGACAGAATGGCGCTGCTGGAGTCGGAAGCCGCCTCTTACCATGCGCCGGGCACCTGTACATTTTACGGCACCGCCAACACCAACCAGATGGTGGTGGAGTTTATGGGAATGCAGTTGCCGGGTTCTTCGTTTGTGCATCCGGATGCGCCGCTGCGCGAGGCATTGACTGCCGCTGCCGCACGTCAGGTAACACGTCTTACCGGCAACGGCAATACGTGGATGCCGCTTGGTAAAATGATCGACGAAAAAGTCGTGGTGAACGGCATTGTCGCCTTGCTGGCTACCGGCGGCTCCACCAACCACACCATGCATCTGGTTGCAATGGCGCGCGCGGCGGGAATTCTGATCAACTGGGATGACTTCTCGGATTTGTCGGAAGTGGTTCCGTTGATGGCGCGCCTGTACCCGAACGGTCCGGCAGACATTAACCACTTCCAGGCGGCGGGCGGCGTACCGGTATTGATGCGTGAGCTGCTCAATGCCGGATTGCTGCACGAAGACGTTAATACTGTCGCAGGCTTCGGCCTGAAACGCTATACGCTGGAGCCCTGGCTCAACAACGGCGAGCTGGACTGGCGTGAAGGCGCGGAAAGGTCACTGGATAACGATGTCATTGCCTCTTTTGATAAGCCGTTCTCTCCTCACGGCGGTACTAAGGTGCTAAGCGGTAATCTGGGGCGCGCAGTAATGAAGACGTCTGCGGTACCGGTTGAAAACCAGATCATTGAAGCGCCTGCCATGGTATTTGAAAGTCAGCATGATGTGCTGCCTGCGTTTGACGCGGGCCTGCTTGACCGGGATTGTGTCGTTGTCGTGCGTCATCAGGGACCAAAAGCGAATGGAATGCCAGAATTACATAAACTCATGCCGCCACTTGGTGTATTATTGGACCGCCGTTTCAAAATCGCGTTAGTTACTGATGGACGACTTTCAGGCGCTTCGGGTAAAGTGCCTTCAGCTATCCACGTAACGCCGGAAGCCTACGATGGCGGCTTACTGGCAAAAGTGCGCGATGGCGACATCATTCGCGTGAATGGGCAGACAGGTGAGCTAACTCTGCTGGTCGACGAGGCGGAACTTGCCGCTCGTCAGCCTCATATTCCGGACCTGAGCGCGTCGCGCGTCGGAACGGGGCGTGAGTTGTTTGGCGCGCTGCGCGAAAAGCTGTCGGGTGCGGAGCAGGGCGCAACCTGTATCACTTTTTAAGATGACACACTAGTAATCAGGCGAGAGAAGAATTCCGATGAAAAACTGGAAAACAAGTGCAGAAGCAATCCTGACCACCGGCCCGGTTGTCCCGGTCATTGTAGTCAATAAACTGGAGCACGCGGTGCCGATGGCTAAAGCGCTGGTGGCCGGGGGCGTTCGCGTTCTGGAAGTGACTTTACGTACGGCCTGCGCGATGGATGCTATTCGCGCTATCGCTAAAGACGTGCCGGAAGCGATTGTCGGCGCCGGAACCGTTCTCAATCCGCAGCAGTTGGCGGAGGTGACGGAAGCGGGCGCGCAGTTTGCGATTAGCCCGGGACTGACTGAGCCACTGCTGAAAGCCGCGACGGCAGGCACTATCCCATTGATTCCCGGTATTAGCACCGTTTCTGAACTGATGTTGGGCATGGACTATGGTCTGAAAGAGTTCAAATTCTTCCCGGCGGAAGCGAATGGCGGCACTAAAGCGTTGCAGGCGATTGCCGGTCCGTTCTCTCAGGTACGTTTCTGCCCAACCGGCGGCATCTCTCCGGCAAACTATCGTGACTATCTGGCGCTGAAAAGCGTGTTGTGCATCGGCGGTTCCTGGCTGGTGCCGGCCGACGCGCTGGAAGCGGGTGATTACGATCGCATCACCAAACTGGCGCGCGAAGCGGTAGAAGGCGCGAAACAGTAAGCCGTTAAATGCCCGATGGCGCTTGCTTATCGGGCTTACGAGTGGCGATCAGGCAGGTCTGATAAAAATGCGCTAACGTCGCCATCAGGCGATGGCGCATTAGCCTTTTACCGTCACGCGGCTGGCGGCCTTTTTCGCTCTTATCACCGCTTCTTCAACGTTTTCACCTGTCGCTAACGCTACACCAAGACGACGGCTGCCGTCGATCTCAGGCTTACCAAACAGCCGTACCTGCACTCCGGCCCCTACCGCCGTGTGTACATTATCAAACGTCACATTTTGACTGGTAAGCTGTGGCAGAATCACGGCCGAGGCAGCGGGACCATACTGGCGAATAGCGCCTATGGGCATTCCCAGAAAGGCGCGCACATGCAGCGCAAACTCAGAAAGATCCTGAGAAATCAACGTCACCATTCCGGTATCGTGCGGGCGAGGGGAGACTTCGCTGAAAATGACTTCATCGCCACAGACGAAGAGTTCAACGCCGAACAGGCCATGCCCGCCTAACGCCAGTACCACATGACGCGCAATCTCTTGCGCCCGCTTCAGCGCCAGTTCGCTCATCTGCTGTGGCTGCCAGGATTCGCGATAGTCGCCATCTTGCTGACGATGACCGACTGGCGCGCAGAAATGCACGCCATCGACGGCGCTAACGGTGAGCAGCGTAATTTCAAAATCAAATTTAACCACGCCTTCCACAATCACGCGACCCGCGCCAGCGCGTCCGCCCTGTTGAGCATACTCCCATGCCTGCGCGAGCTGTTCGGCCGAGCGGATAAAGCTCTGGCCTTTGCCGGAAGAGCTCATGACCGGTTTGACGATGCAAGGAAAACCCACTGCGGCTACCGCATCATGAAAACTGGCCTCACTGTCGGCAAAGCGATACGTCGATGTCGGCAGACCTAATTCTTCTGCGGCCAGGCGACGGATCCCTTCGCGGTTCATCGTGAGCTGCGTTGCACGGGCGCAAGGCACGACATTCAGCCCTTCGTCCTCCAGCTCACGCAGCGTATCGGTGGCGATCGCTTCTATTTCCGGCACGATATAATGCGGTTTTTCCTCTGTAATCACATGACGTAGCGCCTCGCCGTCCAGCATATTAATGACGTGTGAACGGTGAGCCACATGCATGGCGGGAGCATCAGGATAGCGATCGACGGCGATAACCTCGATCCCCAGGCGTTGGCATTCAATCGCCACCTCTTTTCCCAATTCACCTGCCCCTAATAACATCACCCGCGTTGCTGCCGGACGCAGCGCAGTGCCTAATAGCGTCATATTTCTGTCCCCTTATTTACCTGCGCGCAGTATATACGAAACCGTTTGCGTATGTCTTTTCGAATAGATTGCTTTTCTTGCTGTCTTGCAATATACTGTATATAAACACAGGTAAATAAGGGGCTGCAAAATGGCGGTTGAAGTTAAATACGTAGTCATTCGCGAGGGTGAGGAGAAGATGTCATTTACCAGTAAAAAGGAAGCCGACGCCTGGGACAAAATGCTCGATACTGCCGATCTTCTTGATACCTGGCTTGAGCAGTCGCCAGTCGTGCTGGAAGATGGGCAGCGCGAAGCGTTGTCGCTGTGGCTGGCGGAACATAAAGAGGTGTTAAGCACTATCCTCAAAACCGGTAAGTTGCCTTCTCCGCAGGCGGTTGAAAAGGACGCCGCGAGTAAAACGAAAAAGCAAGCCGCCTGAACCCGTACTTGCGCTCCCGGCGTTTTGTACCATGCTTTTCCTGAATCATTGTGCTAAGGAGAAAAGTATGAATAAAAGAGGAGCGTTGTTGAGCCTGCTGCTGTTATCCGCCAGCGTATCGGCATTTGCGGCATCCACCGAGAGTAAATCGGTTAAGTTTCCGCAGTGTGAAGGGCTGGATGCCGCCGGGATTGCGGCAAGCGTGAAGCGCGACTACCAGCAAAATCGTATCGTGCGGTGGGCCGACGACCAAAAAAAGGTCGGGCAGGCTGACCCGGTCGCGTGGGTTAATGTGCAGGACGTTGTAGGTCAAAATGATAAATGGACAGTTCCGCTAACTGTTCGCGGTAAAAGTGCCGATATTCATTATCAGGTCATCGTTGATTGCAAAGCCGGCAAGGCGGAATATAAGCCCCGCTAGCAGCGTAATTTGCGCTTCTTTTGCCGGAACAGACAATTCCTGACATCCCCTCAGGTACGCTTGACCATTATTGGTCTGTATTTGAGGGGTACTATGGCTAACTGGTTGAACCAATTACAATCACTTCTTGGGCAAAAAGGCGCTTCCGCATCGTCTTCCGGTGAACAGGGGTTAAATAAACTGCTGGTTCCCGGCGCGCTGGGCGGTCTGGCTGGACTGTTGGTCGCCAATAAGTCTTCGCGCAAATTATTAACTAAATACGGTACTGGCGCTTTGCTGGTGGGCGGCGGCGCGGTGGCGGGTTCCGTATTGTGGAATAAGTACAAAGATAAAGTACGCGCTGCGCATCAGGGGGAGCCGCAATTCGGCAGCCAAAGTACGCCGCTGGATGTTCGCACTGAGCGACTGATCCTGGCGCTGGTTTTTGCTGCAAAAAGCGATGGGCATATTGATGCCAAAGAACGCGCGGCGATTGAGCACCAACTGCGAGAATCAGGTGTGGAAGAGCAAGGGCGTGTGTTTATCGAGAAAGCTATTGAGCAGCCGCTTGATCCACAACGTCTGGCACAGGGCGTTCGCAATGAAGAAGAAGCCCTGGAAATCTACTTTCTGAGCTGCGCCGCCATTGATATTGACCACTTTATGGAACGTAGCTATCTGAATGCGTTAGGAGACGCGCTAAAAATTCCTCAGGAGGTTCGGGACGGTATCGAGCAGGACTTGCAGCAGCAAAAACAGGCGCTACCAGGCTGATAACCTGCCATTAAGTAACATGTTTCGCTTGCATCATGCGTGTCTTTTGCCACCCTTATAGGATGGATAGTTTACGTCATTCATGTTGCAAGATAGCGGCGAAGCAGTGACATCTCAGGCGCTTACCGAGGTGAGCGGACCGGGACTACCATGCTAAGCCGCTGTGTTTGCCGCGTGATATATAACAAATATTAACGCAAAAGAAAAAACGACATGTTGCCAAAAGCCAATCGAATTCCCTATGCCATGACCGTACATGGCGATACGCGCATTGATAATTATTACTGGCTGCGAGATGACACTCGCTCGCAGCCGGAAGTCCTTGATTACCTGCATCAGGAAAATGAGTATGGCCGGAAGGTCATGTCCTCTCAGCAGGCGTTACAGGACCGCATTCTAAAAGAAATTATCGATCGTATCCCGCCCAGAGAAGTTTCCGCTCCGTATGTGAAAAATGGCTATCGCTACCGTTATATCTATGAACCCGGCTGCGAATATGCCATCTATCAACGACAATCGGCGTTAAGCGAAGAGTGGGATGTGTGGGAAACCTTGCTCGATGCGAACCAGCGGGCCGCGCACAGCGAATTTTATACGCTCGGTGGACTTGCCATTACGCCGGATAATACCATCATGGCGCTGGCAGAAGATTATTTATCCCGTCGTCAGTATGGGTTGCGTTTTCGTAACCTCGAAAGCGGTAACTGGTATCCGGAACTGCTGGATAACGTTGCGCCTGAATTTGTCTGGGCCAATGATTCCCTGACCCTTTACTATGTGCGTAAGCATAAGAAGACGCTGCTGCCCTATCAGGTTTGGCGGCACACGATTGGCACTCCGTCATCGCAAGATGAACTGGTATATGAGGAGAAAGACGATACCTTTTATGTCAGCCTGCATAAAACCACTTCGCAGCATTATGTGGTAATTCATCTTGCCAGCGCCACCACTAGCGAAGTGTTATTACTTGACGCGGAACTGGCCGATGCCGAGCCGTTTTCATTCTTACCGCGCCGCAAAGACCACGAATATAGTCTCGATCACTATCAACATAAGTTTTACCTGCGCTCTAACCGGAACGGTAAAAACTTTGGGTTGTACCGTACCCGCGTGCGCAATGAAAACGCCTGGGAAGAGCTGATCCCTCCGCGCGAGCATATTATGCTGGAAGGGTTTACCCTGTTTACCGACTGGTTAGTGGTCGAAGAGCGTCAACGGGGGCTTACCAGCCTGCGGCAAATTAACCGTAAAACCCGTGAAGTGATAGGCATTGCCTTTGACGATCCGGCTTACGTGACGTGGCTTGCCTATAATCCCGAACCTGAGACCTCCCGGCTGCGTTACGGCTATTCTTCAATGACGACGCCAGATACCTTGTTTGAACTGGATATGGATACCGGAGAACGACGGGTACTTAAACAGACGGAAGTGCCTGGGTTTGATTCTGGCTGTTATCAGAGCGAACACCTGTGGATCACCGCACGCGACGGCGTCGAAGTGCCAGTATCGCTGGTTTATCATCAGAAGTATTTTCGTAAAGGGCAAAATCCGCTTCTGGTTTACGGCTACGGATCTTACGGTTCCAGTATTGACGCCGACTTCAGCAGCAGCCGACTGAGCTTGCTGGATCGTGGCTTTGTTTACGCAATCGTACACGTTCGCGGCGGCGGTGAGCTGGGGCAGCAGTGGTATGAAGATGGCAAATTCCTCAAAAAGCGGAATACTTTTAATGACTATCTTGATGCCTGCGATGCCTTATTAAAACTGGGTTACGGTTCACCGTCGCTGTGTTATGGGATGGGCGGGAGCGCGGGCGGAATGCTAATGGGCGTCGCTATCAACGAACGCCCCGAGCTTTTCCACGGCGTTATTGCCCAGGTGCCCTTTGTTGATGTATTAACCACGATGCTGGATGAGTCGATCCCATTAACGACAGGAGAGTTTGAAGAGTGGGGGAACCCGCAGGATATTGAGTATTATAACTATATGAAAAGCTATAGTCCTTATGACAATGTCAAAGCGCAGGACTATCCGCACCTGCTGGTGACGACAGGATTGCACGATTCCCAGGTGCAATACTGGGAACCGGCGAAGTGGGTGGCAAAATTACGCGAGCTAAAAACAGACCAACGTCTGCTGCTGCTATGTACGGATATGGACTCCGGGCACGGTGGTAAGTCGGGGCGGTTTAAATCCTACGAAGGCGTCGCGCTGGAGTTCGCCTTTTTAATCGGCCTGGCGCAGGGAACCTTACATAGCGCATAGACGCGCGCGCCCGCAACGGCTTGTGAGCGTCGCGGATTATTGTGGTGTTCCATTGCTGCGTTGCTCGCCAGCCTGAACATCTTCCAGGTAGTGTTTAAGCGTCAGGCGAAGCTCCGGGCTCATATTATCAAGGTTATTAAAAAGCCAGCGTAAATAGCCCGGATCGCGTTTTGCGACTTCAGATACCGCCTTACCGCGGTATTTACCGAACGGGAAAGTGGTTAACAGCGCAGGACGGCCCGTAATATTGACCATCTCTTCCGCGGTCCAGCCGGTGGTTCGCATAATATCAATCAGCAAGGCGGCGGTGATATAGCAATCATAAAGCGCCCGGTGATGGTGCAGCCCCGGCGGCGTTTGTACGCTTAGTTTGCGCGATTTATAGAGCGCCATATTGCTGTATTTTATTCCCGGCCACAGGCGTCGCGACAACTTCATGGTACAGATCCACTCACCCGGCAATTCAGGTAATACGCGTCTGTCGAAACTGGCATTGTGAGCGACATACCACTCACTACCGTAATAAAGCGGTATGACATCTTCAATCCACGGCTTATCGGCGACCATGGCTTCGGTAATACGGTGTATCGCCATCGCCTGCGGCGTAATAGGGCGATCGGGGCGTATCAGGTGACTCATGGGATTGACAATGTTGCCATCAATGACATCAACAGAGGCTATCTCTACGATCCCGCCCTGCAGCCCGCAGGTTTCCGTGTCTATAATCCGCAACATGATTCATTCCTCACCGAAAGACCTTAGCGTAATGGAATGATATCGCCTTGCCAACCCTCCGTTGTGCGCAGGCCCGTCAGTAATAGCGAACCGCAATCGGCACGAACAATAAGTTGCCCGCTTTCATCCCACAGAGCGCTACTGCCACAGGCATTGGCCATCAGTACGGCCAGCGCATATTTATGCGAAAAACGTTGTAAGCGTGAGGTAGAAGCATGGAGTTCAGGTTCGTTAAGACATTGACTGGTTGTGAATAACGTAAATGAGGGATCTATATCACCACCTTCTGGCTGCTCATCCACCACGTTGATAGCGCTTCGCTGCCTGGCAATACAGGCGCCATGACTTTTGTGAAACATGAGCGGCGACGTTAGCCATGGGGCAAAAATAGCGATGCCTTTTACGAAGCGGCAGTTATGCTCAACAGGCATACCCACAATGATAGTCATATGGTGGGTATCGGCCGCATGAGTGAGCGGCTGTAAAAGCGCCTCGTCAGGCGGGGCGGGCAACGGTTTATTTCTTTCGTCGCAGCCTAATAGCGACAGCGACGGAAAGACCAGCAGTTCGCACTGTTGACGAGCCGCAAGCTCAATATATTCCAGATGATGGGCGACATGCTCCGCCGGCGAGGCGTTCAGGGGCGCATACTGCGCAGCAGCAATTTTCCAGGAGGACATAATGACTTCCTTTTCATAGTCTCTACAGCCATTCTAAAAAGCATAGGTTAAGAATCGCCTTATAATGTAGTCGTCTGCATGTAAGGAAGTGTAACGTTATATAATTTTTATTTAACTTTGGGTTCGTAAGGGAGTCGGGATAGTGATACGGAAGCCAGACGATGGGCAATCAGCCGCTCACGAAACCAGGCGCGTAGATGCTCTGGCTGCTCACGTTCGACCGCCTCCGCGACAACTGGCATATTGTAGCGCTCTTTAAATGCCACCCCGGCTGCCGCCAGGTCCACATTGACTTTATCCATTTCCGCTTGTTCAAGCTGAGCCAGATTTGTTTTCATACGTGTCTCCTTTTTTGTTGCCGCCAGAAGGTACTAAGAGTCCGCGGTAATGGCAAGATGGAAATGATGATCCTCGACTCATTGATTGTTGAAGGTTATTCTTTGTAACGTAGGTATAACAAAAAGGAATAGTGAATATGGCTTCTTCTGCACCATCGCGACGTTTAGCTTTACTGCTGCTGGCATCGACATTTGCGACGCCAGCGGCCTGGGCACATGCGCACCTGACGCATCAGTATCCAGCGGCGAATGCTGCCGTTACGGCCTCGCCACAGGCGCTGACCCTGAACTTTTCTGAAGGGATTGAGCCAGGGTTCAGCGGCGCAACCATTACTGGCCCTCAGCAAGAGCTCATCAAAACGCGCCCGGCAAAGCGAAATGAACAGGATAAAACGCAGTTGATTATCCCGCTTGAGCAGCCGTTAAAATCTGGCGCTTACACGGTAGACTGGCACGTTGTGTCGGTGGATGGACATAAAACAAAAGGGAAATACACCTTCAGCGTGAAATAAATGATGCTGACATTCGTCTGGATAACTCTCCGATTTATTCATTTTGCTAGTGTGATGCTGGTCTACGGCTGCGCGCTTTACGGCGCCTGGCTGGCACCCGCATCAATTCGTCGTTTAATGACGCGTCGATTTTTACATCTGCAACGACATGCCGCCGCCTGGAGCGTTATCAGCGCGGCTTTCATGCTGGCGATTCAGGGCGGGCTGATGGGCGGCGGCTGGCCCGATGTTTTTTCCGTCTCGGTGTGGGGCGCGGTACTGCAAACCCGCTTTGGCGCGGTCTGGATATGGCAAATTATCCTCGCGCTGGTCACGCTGGCGGTGGTAGTCATTGCGCCGATAAAAATGCAACGACGGCTTCTTATTCTCACCGTTGCTCAGTTTATCCTGCTGGCAGGCGTTGGACATGCGACGATGCGCGACTGTGTAGCGGGAACATTACAGCAGATTAACCATGCTCTGCATTTACTCTGTGCCGCTGCCTGGTTTGGTGGGCTGTTGCCAGTGGTTTATTGTATGCGCATGGCTCAGGGACGCTGGCGTCAACATGCTATTAGCGCCATGATGCGTTTTTCTCGTTATGGTCACTTTTTTGTGGCGGGCGTATTGCTCACAGGCATCGGCAACACGCTATTTATCACGGGATTTACGGCTATCTGGCAGACCACCTATGGACAGTTGCTTTTGTTAAAATGTGCGCTGGTGGTGCTTATGGTAGCAATTGCGCTGACGAATCGGTATGTTCTCGTACCACGTATGCGACAGGAAAATCCCCGGACTGACCTATGGTTTGTCAGGATGACGCAAATTGAATGGGGAGTTGGAGGCATAGTTCTGGCGATCGTCAGCCTGTTTGCAACCCTCGAACCTTTTTGATGGACTGGCATAACGAATGAAAAAAATACTCCTTCCGGCGCTTCTGCTGGCCACTTCGGGCGTAGCGTTGGCGGCGCCGCAGGTGATTACCGTAAGTCGTTTTGAAGTAGGAAAAGACAAGTGGGCGTTTAATCGGGAAGAGGTCATGTTGACCTGTCGGCCTGGCCAGGCGCTCTATGTGATCAACCCCAGTACGCTGGTGCAGTATCCCTTGAATGCCATTGCCGAACAGCAAGTAGCGGAGGGTAAAACGCGCGCTCAGCCTATTGCCGTCATTCAAATCGATAACCCGGCGAAGCCCGGTGAGAAAATGAGTCTGGCGCCGTTTATCGAACGTGCGCAAAAGCTTTGTGATCCATCCAATAGCTGACTGATTTTTAATAAAAAACCGTAAACCTTCACGAAAAGGCTTACGGTTTTTTTTATCTCTGATGACAGACAAAACGCCAGGTTTTTTCAATCACCTTCGTCACAAACTGGAAAACCTGGCGTCGTCATCTATTCTTAAAGGGCAAGGCGATTTAGCCTGCATTAATGCCAACTTTTAGCGCACGGCTCTCTCCCAAGAGCCATTTCCCTGGACCGAATACAGGAATCGTATTCGGTCTCTTTTTATTTGGATTATAAATCAATGGGTTATGTGTTTCCCCTCGAAATTCCTCGAAATTTCCTCGAATTTCTGTATTCCGGTCTTTTTGGTTATATCACATCCAAATCCAGTTTAACATTTCTTTTACAACAAAATCAGAGCATCACGTAAGCTTTATTATCGCGTTCATCGAGATAGAGTTTCGTGGTGTTCTCTGAGGTGTGGCCCAGGAGTTTTTGAGCGAATTCCTCGCCGCGTTCGTCTTTGTACAACCGCCCGAGCAGGCTACGGATCTCGTGAAATGTCGGTGGGTTATCACTACATTTAACGTCTGAAATTTTTCTGGCTTTTACAAATTTCTTTGTCAGCCCATCCGGGTGAATATTCCCGGTCGGGCTATTTTTCCTGATTCCGGCACTGATCATGAAATCGGTTCTGCTTACCAGTCGGCAGCGATCGATAACCGTCCCGGACGTAACCCTGGCGCCTGAAGGGTCAGGGATAGGGGGAATGCTATCTTCATTCCTGTCTTTATCTGGGTTACGTGTAGGCGACCATCAACGATATCACTAAACTTCATGTTAACTATATCCTCACGGCGTTGGCCAGTAACCAGCGCGAGATCCATCGCAAGAGGAAACCACACAGGCAGATATTCTGCCGCCGTCCGCGTGGCGTTATATGTTTCCAGTTGCAGGCGTTCCCTGGCAACCTTAATTTCAGGTGCTCGGGTTGGCTCAACCGGATTCGTTGTTATTCTTCCTTCCACAATTGCCTCACGAAACATGTCAGATAGTACAGACCTCATCGCCCCCGCCATCGTGTTTTTACCTTCCGCGATCCATGACTCCAGAAACTCAGCAATATGCCGCGTGGTCACCTCTGCCAGTATCATTTCCCCCATTTTTTCGCGTACGGTCGCTAATTGATTACTGCGAATCTTGTAGGTATTAACCGACAGACTCCGGCGCTGTAATAAAACCTCATAGCGATCAATCCATGCGGACACAGTGAATGAGTCCGTTCCTTTTAGTTTTTCAATAAGCGCCACTGGCGTGTGGTTTTGCGCTATGAAGTTGTTTGCCTCTATGGCCTGTGTGATAGCGTCCCTGCGGGCGATCTGACCGAGCGGAAATTCCTTGTCAGTTACCGGGTTACGCCAGAAAAAAGATTTACTGGCCTTACGGTAGGTGAGGTACCTCGGAAGGTTAGCATCGTACTTTTTTCGACTCACTGATCAACTTCTCCAGCAATGCACTCGGTTAACGCCATCGCACTCTGGTACATGAATTCGATTTTCCGGCCTTCTTCTGCCGTAAGCACGGTTATTCCTGTCCGGGCGCACTCTTCCAGAAAGGTTTTCTCTTCTTCTTTTCCTGCACTGGTACGGCGGTTAAACTCCGGTGCGATGATGAAGCGTTTACTGAATTCCTCTGGTTCCAGTACCCGGCAGTGAAAAGCCGTTCCTGTATCAAGAGACTTTGTTTTCTCCGTGTCCACGGGGGCATTTTTGCGCCAAAGATAAATTGCTGGTGTATCTTCGATATCATCAAGCTGTGATTTACTGACGCCGGGGCCAGCGTGATACGCCTCGTTAGGGATGTCATAGTAAATACCTGGCTGTATATCATCAGGGACAGTGAAATTTCCGTTTTCTACGGGATCTGCCGCTTCGCCAGCTTCATCACCGCCAGTACCTGATCCACCGTCCGTTGTAATTTCCTGCCCTGTATCGCCAGCCGTTTCCTGCTGGTTACTCTCTTTCGGCGTTTCTCCATCTCTTTCTGTTCTGGCTTCCGTTTTTTCGGTCTGGTTTGAGGGGGGCGGGAATAGCGCTGATACATCGAAAGTCCCGTCCGCGTTTCTGGTGACAGCCTCCGGCTCTGCTGCTGGTTGTTTTTCCTCTGGCACCACTTCTTCTTTTTCACCCTGATTTGAGGCGCTGTAATTGTTATGAACCCACTTCGGATCGTCCGGGTCGCTGATGCCTTCGACATATTCACCGCGCGCGGCTGCCAGTTGTTTACCAACATCAACCGGGTTTTTGGGTGGAATGTTTTTACGTGCTTCGTGCAGTTCTGCCCGTATTTTCTGGTAGCCTGCTTCTGTCTGGCTTACAGGTGGCTCATTCTCCAGCGGCTGCGGGTCCGGATGATGTTCAGTTGTGTCCTGTTCCACTGCTTCAGGCGTTGCTGGTTCATCTGCCAGTTCGCCTGTCGGTTGCGGTTTTTCTTCATCACACTGAAATCTCCCTGTCTCAATATCCCGCAGACATTTGCCCGCCTGACGAAGCCTTGCTGCATTTTCTTCATGGGTTGTTGGGGTGTTATCAGGCACATATTCGTACCAGTCCGGATCGCGAACACCATGAACGGCAAGAAAGCTTTCGCACCACGTCCGGCGAAGATCAGGATTACCGGGCTGGAGAATTACACCAGATGTGGCGTTGCACTGAAACTGGATCTAGTGGCGAATCCAGGACAGCTTGAGCTAGAACGTCATGCCGCCCGATCCGCAGCGTGGCTTTTTGTGACTAAAGGGTGTCTGAAATATTCCGGCGACCTGGTACGTGTTACGCAGATCATCAACGGAGGGTAGAACGGCATCGGTGATCGGCGGGAGCGCTTTGAGAAAGCAAAATCGGTGCTGGTATGAATCTGTTATCTGCTCTTCTGAAAAGATACTGGTTGCAGCTGGTGTTTATTTTGCTGATGGCTGGTGCGTTTATCGCCGGTAATGTCTGGAGTGACAGGGGCTGGCAAAAAAAATGGGCAGATCGCGACAGCGCTGAATCCTCTCAGGAAGTCAACGCCCAGACCGCCGCCCGTATTATTGAACAGGGCCGCGTTATTGCCCGTGATGAGGCTGTGAAAGATGCACAAGCGCAAGCCGCTAAATCTGCTGCCACTGCTGCTGGCCTGTCTGCCACTGTTAGCCAGCTGCGTACCGAAGCAAAAAAACTTGCCACCCGCCTGGACGCCGCAAAGCACACCGCAAATCTTGCCGCTGCCGTCAGAAGCAAAACAACCAACGCCGACGCCAGAATGCTTGCCAACATGCTCGGAGATATTGCAGAAGAAGCTAAACATTATGCTGGAATCGCTGACGAGCGCTACCGGGCAGGAATGACGTGTGAACGAGTATATGATTCGGTGAGAGAGTCAAATAATTACAGGAGGCATTGAAACTCCCCCTGTAATATTGCTGTAAAAAAGTGACTACATATCATCAGATGGAACCAGATGAATAAGAACAGGTTTTTCACCAGATGAAACTGATAAGTACTCACTCAGTTTTGATATAGCTGAAATCTGTCTGAATAACCTGTCGGGGTGCTGGAATAACAACTTTCCGGAAATTCTTCTGCAATGGATTTTACTTTTAGTGACCATTCGCCTCCTTATCTGTAGAGGTGGGTAACGAATTTAAAAAGCATTCTGCTTACTTAGGGGGAACATCCTGATGACTGCCTGCAATATTGCAAATTCCATTTTCATTGTATGAACCACCTGAATCAAGGCACTCATCTTCCATCAGGAATTTCTGCGACCACATACCTGCATAAAAAACGATAATAATGGCTACGATAATAGTGATGATGTTTTTCATTTATGTTCTCTGTGTGTTGTTATTGAAAATGATAATCAATATCGCAAAATGAAATAAATAATCATTAAGTGGTAGTTGTTGAGAATTGTTCGCATTTTAAAAAGGTACTCCCGGCGGGGCGGTCTGCCACGGGACGGCAGCGGCGCGGGATTTGGCGCATTTTTGATTTTTCATCCATCATCATCATGTTGTAACCCTCTGTTTTAATGTCATTTATTTTTAAAAGATGATGGTTTGTATGTTTTTTGTTCATTATATTTTGTTTTTCCGGGGGAGGGCGCGCTAAGAAACAGCCCCAGAGGTAAAAATGGATGACGAACCGAAGAACCTCAAATGCAATATCTGTCAGCTTGCCGCTATTACAGGGTTACATCGACAGACGGTTGTCAGTCGCCTCTCGGGCGTTCCCCTGGCACTGGGAAGCAATGAAAAAAACAAGCTGTATCTCCTGACGGATGTGATCCGCGTACTGATGGAAACGCCCGTTTCCCAGGCTGCTGAACATCAGGACCCGAATAAAATGACTCCAAAAGAGCGTAAGAACTGGTTTGACTCCGAAAAGGGGCGTTAATACCACGGGCAATCATACTAACCAGTTCGGCGGTTATATCAATTCATACTGGGGAGATTCCAATCACACTTCATTTCAGCCTGGAGGTGGTGCGTGGACACAGGCCGCTGGCGACCATGCGCATACAGTTTATATCGGAGGACACGAGCACACCATGTATATCGGTCCACACGGACACGTCGTTATTGTGGACGCAGACGGTAATGCGGAAACCTTTGGTCTTATGGACGGCGGTGTGGATGCTGCTATTACGGCATATTTCGGGTCGCAATTACAGGAACGGGTACAGCAAAATATCATCCGTGAATACCTGGGGGAACAGCCCGTCGGCACCGCCTTTGTTATTGAAACGGGTAACAGTAAACATCCGTGGCTGGTTCACGCCCCGACGATGCGCGTTCCGCTGATTATTGACGGCACCGACGCGGTTTATAATGCAACACGGGCTGCGTTACTGGCAATTTTTCAGCACAATAAAAGCGCCGGAGAAGACCGGAAAATTACATCTGTTGCATTACCTGCAATGGGGGCCGGATGTAGTCAGGTCCCCCCGGACAGCGTCGCCCGGCAAATTGTACTGATATAGCCCCTCCTGATATTCCCTCCAGTCATATTGCTGTTTTTGACGCTGAAACCCAAACGTGGAGTCTGCAGGAGGATCACCGCGGCGAGACGGTTTACGACACAACAACCGGCAATCAGGTTTATATCTCCGATCTTGGTCCGCTACCTGAAAACGTCACATCAGTTTCACCAGGTGGTGGATACAAAAAATGGGATAGTAAGGCTCAGGTCTGGATGAATGATGAAGCTGCGGAGGCCGCAGCCAGACTTCGTGAAGCTGAAGGAACGAAAAACAGACTCCTGCAAATAGCGTCTGAAAAAATCGCGCCGTTACAGGATGCAGTGGATCTGGACGAAGCAACCAATAAAGAAAAAGCTTCTCTTCTGGCATGGAGAAAGTACCGGGTACAGGTAAACCGTGTTGATACTTTAAAGCCTGTCTGGCCGGAGAAACCAGCCAGTAGTTTATAATTTGTCAGGAAAGCTCAGGCCTTATTTATAGCAAATATGAAGAAGGCCTGTCTGTCATAACTGATATGGTTACTGGCTAGTATATTAAATTTATACTCAATAACCTCTACACATTTTAAACCAATCTTCAGGGAAGGGTATGTCAGCAGGCCAAAGATTACACCACTTTTTAGGCATTGGCTTAATTGTTTCTTCTTTTTTATGATCTTGAGAGTCTGCCGCTATTGTAAGAGCAGGATATAGTGAAGATGGTAATATTAAAACCATTGCTAAAAATACGCTCTTAACGTGTTTCATAATATGTTACCTGTTAAATTGTGGCACACTATCCTTACGGTTACAGCATCCTTTACTATAGATATTAAACGTTATTCATTACCATCAGGTGAGTAAATAAAAACCATTTATAAAATATTTAACTTAAATAAAAATGATAAGCACTATTATATTTTATTTTCAATGTAAATTAATTCATGTGAAAGTGATTTCATGTGCTTTTCAAATCATGGCTGGCTCGCCCCCCCGGAGGAACAGGCCAGTTAACATTATCAGGAGTGTTTGTTATATCAATCAACTTCACTTCATTCTTGTAAGCCAGCCACACTAACAGTTTTGATCCTGTCCGCGTATTGTGGACACAGCCTTAAGCGAGGTTCTGGTTTTCAAATTGTTCCGGACTGAGATCGCCGCAGACCCTGACGGCGCAGGCTGGACGCTATCGTTTTACATTGTACCTCAGGCCCCAGGTTCACTTCCTCAGCAAGCCGGGGCGCACCGTAGCGCTGTTTTGCCTGGCTGAAGGCCTCTCTTACGACATTGTCGCAGTCCTGCCGGAACTGCTGGTGGGTATTAACGACTGTGCGCCTTGTTTGCCAATGCTGATTCCGAACTCATTCTGACGCCGATCATCATTGCCGTGTTGCTCGCGCCTGGTTTCAGCAAAAGCATCACGCTGGCTATACTGCATTTCTTTTTCCGCAGGGAGAAGTGAGTAAATGAGTATGGAAATGGAGCTTTCCCTATAATGGCGGCTGAAGCTAATCATGTGTCATGTATTGCAAATGTCAAAGTGATAAATAAAGTTTCAACAGTCTGTGTTTTTTGTGAAAACGACTCGCTATGGTGCTAACCCTATATGAGGGACTGATTGAACATCAGATATATCCCGGCTTATTTAATACTGCTTACTATGAGGGTAATAATATGTTTTCTCTTATTAAATCACGTAAAGTTTTATTCATAACAACTCTTCTGCTGTTGTCGTCTGCTGCCCAGCCAGCGTTCGCCCTGGGGTGCATAGGGGACGGTGGTAGCACTGCAGACTGTGCAAAGGTTTGTTCCCTGGCGGCAGGTTTAGGCCCCCTTGGTTTTCTCATTTGCTTTTAATGTCTGAGAAGTGGTTATCCTGAGCCTTTCCTGTTTATGTAGTGAAGGCCCATTCAGAACATCATATTTAAACTATCGTAATGGAACAAAGAGTTACTTTAAAGAGCTCTTTGTTCCTTTAACCTTGATGAAGATGGTCAACGTCTGGTTTTACAATTCCGATACAGACAGGATAGCTGCTATGTTTGTTATCCATCTGGAAGGTTATCTGGCTGATAGCATCAAGCGCCAGTTGTGGCCATTGCTTCAGTACCCAGTGTGCCACTACAGTGCCACTACATTGGTGGCCGTAGGCCAAAACTGACGCCGGAGCAATGGGCGCAGGCAGGGTGCCTTATTAGGGCAGGAGTACCGCGACAGCAGGTAGCGATTATTTATGATGCGGGGCTGTCGACGCTGTACAGAAAATTCCCTGTGTTAGGGTAATGCAGTATTTCCACGGTGCGTGTCACCCAAATATGTGCTGTGGATGCTTTATAGGGTGTGCCATTTTTGTGTCAAACATGGTGAAGCATCATTCTATTTCTAACATCATGTGCCATTGAGTTGAGCAATGTGAATGCGGAAATGTGTATTTGAAACAGTTAGTTAAATGTGATTCTACTAATTCGTAATGCGAAGGTTGTAAGTTTGATTCCTGTTATTGCACCATTTCAATGTTACTGTAACTTCCGTGTTAGCCCATAACCATTCCAATTTCTACAAGACAGGTGCTTGCTATATATTTACCAATCAGTCATTTTAAAAGTGCGTCATAACCGGAGAACTGTCCTCGTCAGGAGGCCTGTTTTCAGAAAGGCAATATCCTGTTTCGACAGTAGCTTATATGTGACCGGGTTAGCAGCACATTCCCCTCCGCCACATTCCAGTACAGTTGAAATAAGGTTGGCAGTAATCAGTAACAGAAATACCCAACCGACAATACTGAACAGGTCCGGATTAATATTCAATGAACGAAATGCAACATTCATGCTACTGATTGCCAGAATTACCGCAACAGCAATAATAATCAGAATTGACGTAATCAGAGTCCAGGTATAAAAGTGTAATCCAAAAAATGCAGAACCATATCCTGTGTCACCGGGCATGATATGCAAACATATTTGCCGGGATGCCATAACTCCGGTCAGGATACTGCCAATAATTACCATCCCGTAATGTATACCTCGTAGACCAAAGCAGAGGTTAAATAAAAAGCCAAAACCGGCGATGATGAGGCCAGCCCGCTGTAATAAGCAAATTGGACAAGGCAGCTCATGCCTGACGAGTTGATAATAAAAGGCTACAACCAGAGCAACGCATATCCCAAGGAGTCCCACCTGATTAATCAGATGAACAAATGTTGTTGGATGATGATGCGTTTTCATACACACTCCTTAAAATGACAAAGTCAGCGAATTGGTCATATGATGAATGAATGATAAAATAGTGACGATGAGTAAAACGGCCCACAGAATGTAGCTGACTTTATCCTTTCTGTTTATGGCCCCCATTGCCACGCTCAGTGCAAGCAAGAATGGCAGAAACATGAACATAAAAAACTCCTTTATTACATTGCTTGTTAATATAATCGCTGTGTATTACCGGGTCTGTATTCCAGCTTATCCACCCCGCATTAGCCGGAGCTCAGTAAGGCATCAGGAATTATTTGCGTTATTATAAATCGATTATTTCTGTGATGAGTTATCCCGGCATCTGCTGGTTACAGGGATATTTCTGGCGACAAGAGTAAGAAATATCCCGATAACTAATGGTATAATAAGCAGCATTGAGTGTGTTGAAATAGCATATCCATAAATATAGTCTTCCAGACTTACCGCAATCAGCGGAAAGATAAGAAATACAAGCGAAGCCTGGAAGGCATTAGCCTTTTGCTGAAGTGCAAAGTAGCACAGGATACCAAAAACTCCGGCAAAAGCCCCGAGATACAGGGTAGCTAATATTGAGTGTACTGAGAAGGTTGATACTTGTGGTCTTTCAAAAAACCATCCTGTCGCAGAAAGTATCAACCCAGCTAAAAGGCACGGGAGCGCATTAAATGTGATAACAGAGACAGTACAACTTCTTTTCTTACATTGTGTATATATTATGGCATGGATTAACACAGCAGAAATAAGCGCAGTGATACCCTGCCAGTGACTCTCTGTACTTGTATTCGTTTCTTCAAGAAGTATCCCCGTCAATGCAGTGATTGCGATAGTTAAGCCCGCGATCTGCATTAGTTTCGCTTTTTCATTTAGAAACAAAACCGATGCTATCAAAACGGCCACAGGCATATTCGCAAAGATAATGGCGGCAAGCCCAGAGTTGACATAGGTTTCACCATAAATCATTAGTGAGAAAGGAATGCAAAAATAAAAGATGCATATCACAAACTGGAATAATCGTTGTCCGGGAGGAAACAACAGTGTTTTTTTTCTTAACCATGCAATGATGATTAAAAAAGGTGCTGCAAACATAAACCTCATTCCGGTTGCAAACACTGGAGGGATAGTTTCAACGGCAATTCGCATAGCCAACCATGTAGTCCCCCAGGTCAGTGAAACTAGCATGAATAATATAGATATCGACACCTTGCGCATAAGATCCTCTCCATAAAAAAATTAATTTAGCTTTTTAGCTAGAAAAAAGTAGCGTTTTCTTCACTGTGTTTTTTATATGTTTAGATAAAAATTTCTCTATCAGAAGACGCAGTTATGCTGAATGAGAAAAATAAGAAATGACTCAGGCTGTTACCACACGATTGTCCACTGCGCCTGCAAGACCTGGCAGCAGTTGTCGATTTAATATCTATTCTTGCTGGAAGGCATAAAACGCCTTGAAGATGAAGAGCTCATCATTAGTCGGGTTCCCTTGTTGAGCCAGGACAAGTTTAACTTATCGCTTTCAACCCTTACGATGATAAATGATCTTATCCACCCATAATGGACATACGGCTAAATGAGTAAATTCTCAAGTAAGAGGCAGGATCTGTATCAACCAGTAAGAAGTCCCAGAAACAGCACCGGCTGAATTCCGTCAGGAAGCTCTGAAACTCGCTGAATGTACGTTGCCCGCGAACGATTGTTAATCAGCACCATAGCAAGAATAATTGTTTGTTTCACAAGGAGATAGAAAAAATTATAGCTTTGCGCCGGGGCGGAGGCAGGAGGAGATAACTCGTTGTAATAGAATGTTTTTATATGGCGACCTTAAGAGAAATGCTTTCGCCTCGGTATACCACCCGCTATGAAGACCTGCTTCAGGTTAAGTGGCAGGTTTATCCCGATTTTCACATTCCTGATGTGATGGAACCGCAGCCACATCGTATGCAAGAACGGGCTGCGGCAAGCTGGCGATCGTTCGATAGTGCGAGTATTAAATGGTTGCCAGTTGCGGCGGATTTTACTTGTTCAAGATGACTAACCAATGTGTTTAATCTGAAACTAGCCACCTGTTCAACTTTTCGCGATCGCTTTTGTTGGCATCACTATTAAGCCTTCGTCTACATGGGTATTTGGCATCGGGAGTCCCCCAATGGAAGAGGGGAGTGATGTATTGGGCAGTTAATTTGTTTGGTCGGGGGATTTATACGCAAAATGGCCCGAAATATGTCCGAATTAAAACGTAAAAAGTGATAACTAATTGAATCTCTTAGAATCAAAAATAACATATTCCAATCGTTAAAATCACTTAATTAATATGTAACATTATGAATTTAAAGAGAAATATCAATGGTTGCCGTAAGCGGGAATCGTATTCGGTCTCTTTTTTTCTTATGAATCAGTTGCTTATGATCAATAATCCGAAATTTCCGAAATTTTCCAGAATTTCTGTATTCCGGGCTTTTTCGTTATATCAGATCCAAATCCAGTTTAACATTTCTTTTACAGCAAAATCAGAGCATCACGTAAGCTTTATTATCGCGTTCATCGAGATAGAGTTTCGTGGTGTTCTCGGAAGTATGTCCCAGGAGTTTCTGAGCAAACCTTTCCCCGTAAGCATCTTTATACAGTCGTCCGGATAAGCTGCGGATCTCGTGAAATGTCGGTGGGTTATCACTAAATTTATTCCGGTAAATTTTCGTGCCGCAACAAATTTTTTAGTCAGGCTGTCAGGATGAATACTCCCGTTCGGGCTGTTTTTGCGTATGCCAGCGCTGATAAGGTATTCGCTACGGCTTACCAGGCGGCAACGCTCAATCACCGTACTGAGGCGTAAACCCATGGAAGGAAGGTTGAGCGAAAGTGGCAGAGCGATTTTCATCCCGGTCTTGATTTGCTTAACGTACAGTCGCTCATCAATAATGTGACTGAACTGAGACTGTAAGTAAAATTGTGTAATTGCCTGTTTTTGATATGTTCACTCCAGTAACGGAGACAGGCAATTATGGACGAAAAGAAACTCAAAGCCCTTGCGGTTGAACTGGCTAAAGGCCTTAAAACCGAAGCCGACCTCAATTCGTTTTCCCGTATGCTGGCGAAATTAACCGTCGAAACGGTGCTCAATGCGGAGCTGGCTGACCACCCCGGGTATGAGAAAAATGCGCCCAAAACAGGCTCAAACACCCGTAATGGCTACTCGTCCAAAACGGTGTTGTGCGATGACGGCGAGATAGAACTCAACACACCGCGTGAGCGTGAAAACACCTTCGAACCACAGCTGCTTAAGAAGCACCAGACGCGCATTACGCAGATGGACAGCCAAATTTTATCCCTCTACGCCAAAGGCATGACTACCCGCGAAATCGTCGCCACCTGCAAAGAGATGTACGACGCCAATGTGTCACCCTCGCTGATATCTAAAGTCACCGACGCCGTAAAAGAGCAGGTCACTGAGAGGCAAAACCGGCAACCGGATGCGCTGTACCCCATTGTTTATATGGACTGCATTGTCGTCAAAGTCCGTCAGAATGGCAGCGTAATCAACAAAGCCGTCTTCCTGGCACTGGGTATCAACACCGAAGGCCAGAAAGAGTTACTGGGTATGTGGCTGGCCGAAAATGAAGGCGAAAAGTTCTGGCTAAGCGTGCTGACGGAGCTGAAAACCCGGGGCGTTCAGGACATCCTGATTGCCTGTGTGGATGGTCTGAATGGCTTCCCGGATGCGATAAACAGCGTCTTCCCGCAGCCCCATATCCAGCTGTACAGTATCCACATGGTGCGTAACAGCCTGAAATACGTAGCCTGGAAAGGCTACAAAGTCGCCACCAGCGGCCTCAGGCCCCGTCCGAAGAGGCGGCACTGA
Protein sequences of DBSCAN-SWA_7 >LR134233|1939512:1987624|1951324_1951924_-|VEC92010.1|DBSCAN-SWA MKKHYSVQYETGDIVFTCIGAALFGQISTASQCWSNHVGIIIGHNGDDYLVAESRVPLSTVTTLSLFIQRSAGQRYAVRRLCGGLTVEQKLAIMEQVPARLNKFYHTGFKYESSRQFCSKFVFDIYKEALCIPVGDIETFEELLHSNPDAKLTFWKFWFLGSIPWDRKTVTPASLWHHPNLELISACGIETPQREAEGE >LR134233|1939512:1987624|1941482_1942052_+|VEC91997.1|DBSCAN-SWA MANWQHIDELHDISADLPRFTLAFRELSTRLGLQISALEADHISLRCHQNTTAERWRRGFEQCGELLSENIINGRPICLFKLHEPVCVEHWRFSVIELPWPGEKRYPHEGWEHIEIVLPGEPETLNARALALLSDEGLSQPGIVVKTSSPQGEHERLPNPTLAVTDGRVTVKFHPWSIEAIVASEQAAH >LR134233|1939512:1987624|1969142_1969802_+|VEC92030.1|DBSCAN-SWA MANWLNQLQSLLGQKGASASSSGEQGLNKLLVPGALGGLAGLLVANKSSRKLLTKYGTGALLVGGGAVAGSVLWNKYKDKVRAAHQGEPQFGSQSTPLDVRTERLILALVFAAKSDGHIDAKERAAIEHQLRESGVEEQGRVFIEKAIEQPLDPQRLAQGVRNEEEALEIYFLSCAAIDIDHFMERSYLNALGDALKIPQEVRDGIEQDLQQQKQALPG >LR134233|1939512:1987624|1968695_1969049_+|VEC92029.1|DBSCAN-SWA MNKRGALLSLLLLSASVSAFAASTESKSVKFPQCEGLDAAGIAASVKRDYQQNRIVRWADDQKKVGQADPVAWVNVQDVVGQNDKWTVPLTVRGKSADIHYQVIVDCKAGKAEYKPR >LR134233|1939512:1987624|1946317_1946782_-|VEC92004.1|DBSCAN-SWA MIYIGLPQWSHPKWARLGITSLEEYARHFNCVEGNTTLYALPKAEIVDRWYAQTTDDFRFCFKFPATISHQAALRHCDDLVQAFFTRLAPLETRIGQYWLQLPAAFGPRDLPALWQFLDALPATFTYGVEVRHPCFLTKVRMNSGSIAGYTRAA >LR134233|1939512:1987624|1943040_1943499_+|VEC91999.1|transposase|DBSCAN-SWA MGDEKSLAHTRWNCKYHIVFAPKYRRQAFYGEKRRAVGSILRKLCEWKNVRILEAECCADHIHMLLEIPPKMSVSSFMGYLKGKSSLMLYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEEDKMGEQLSIPYPGSPFTGRK >LR134233|1939512:1987624|1967028_1968207_-|VEC92027.1|DBSCAN-SWA MTLLGTALRPAATRVMLLGAGELGKEVAIECQRLGIEVIAVDRYPDAPAMHVAHRSHVINMLDGEALRHVITEEKPHYIVPEIEAIATDTLRELEDEGLNVVPCARATQLTMNREGIRRLAAEELGLPTSTYRFADSEASFHDAVAAVGFPCIVKPVMSSSGKGQSFIRSAEQLAQAWEYAQQGGRAGAGRVIVEGVVKFDFEITLLTVSAVDGVHFCAPVGHRQQDGDYRESWQPQQMSELALKRAQEIARHVVLALGGHGLFGVELFVCGDEVIFSEVSPRPHDTGMVTLISQDLSEFALHVRAFLGMPIGAIRQYGPAASAVILPQLTSQNVTFDNVHTAVGAGVQVRLFGKPEIDGSRRLGVALATGENVEEAVIRAKKAASRVTVKG >LR134233|1939512:1987624|1939512_1941246_-|VEC91996.1|tRNA|DBSCAN-SWA MNIQALLSEKVSQAMIAAGAPADCEPQVRQSAKVQFGDYQANGMMAVAKKLGMAPRQLAEQVLTHLDLSGIASKVEIAGPGFINIFLEPAFLAEQVQQALTSDRLGVSQPTRQTIVVDYSAPNVAKEMHVGHLRSTIIGDAAVRTLEFLGHHVIRANHVGDWGTQFGMLIAWLEKQQQENAGDMALADLEGFYRDAKKHYDEDEAFAERARNYVVKLQSGDTYFREMWRKLVDITMTQNQITYDRLNVTLTRDDVMGESLYNPMLPGIVADLKAKGLAVESEGATVVFLDEFKNKEGDPMGVIIQKKDGGYLYTTTDIACAKYRYETLHADRVLYYIDSRQHQHLMQAWTIVRKAGYVPDSVPLEHHMFGMMLGKDGKPFKTRAGGTVKLADLLDEALERARRLVAEKNPDMPADELEKLANAVGIGAVKYADLSKNRTTDYIFDWDNMLAFEGNTAPYMQYAYTRVLSVFRKANIDEQALASAPVIISEDREAQLAARLLQFEETLTVVAREGTPHVMCAYLYDVAGLFSGFYEHCPILSAENDAVRNSRLKLAQLTAKTLKLGLDTLGIETVERM >LR134233|1939512:1987624|1958907_1959879_+|VEC92021.1|DBSCAN-SWA METKKNNSEYIPEFEKSFRYPQYWGAWLGAAAMAGIALTPASFRDPLLATLGRFAGRLGKSSRRRALINLSLCFPQRSEAEREAIVDEMFATAPQAMAMMAELAMRGPKKIQQRVDWEGLEIIEEMRRNDEKVIFLVPHGWGVDIPAMLMASQGQKMAAMFHNQGNPVFDYIWNTVRRRFGGRLHARNDGIKPFIQSVRQGYWGYYLPDQDHGPEHSEFVDFFATYKATLPAIGRLMKVCRARVIPLFPVYNGKTHRLTIQIRPPMDDLLTADDHTIARRMNEEVEIFVGPHPEQYTWILKLLKTRKPGEIQPYKRKDLYPIK >LR134233|1939512:1987624|1979238_1979406_-|VEC92044.1|DBSCAN-SWA MKNIITIIVAIIIVFYAGMWSQKFLMEDECLDSGGSYNENGICNIAGSHQDVPPK >LR134233|1939512:1987624|1947718_1949440_+|VEC92006.1|tRNA|DBSCAN-SWA MTLCGWVNRRRDLGSLIFIDMRDREGIVQVFFDPDRADALKLASELRNEFCIQVTGTVRARDAKNVNADMATGEIEVLASSLTIINRADSLPLDANHVNTEEARLKYRYLDLRRPEMAQRLKTRAKITSLVRRFMDDHGFLDIETPMLTKATPEGARDYLVPSRVHKGKFYALPQSPQLFKQLLMMSGFDRYYQIVKCFRDEDLRADRQPEFTQIDVETSFMTAPQVREVMEALVRHLWLEVKGVDLGDFPVMTFAEAERRYGSDKPDLRNPMELVDVADLLKSVEFAVFAGPANDPKGRVAALRVPGGAQLSRKQIDDYGNFVKIYGAKGLAYIKVNERAKGLDGINSPVAKFLTADIVEAILERTGAQDGDMIFFGADNKKVVADALGALRLKLGKDLSLTDEDKWAPLWVIDFPMFEDDGEGGLTAMHHPFTAPRDMTASELKTAPEEAVANAYDMVINGYEVGGGSVRIHNGEMQQTVFGILGINEQEQREKFGFLLDALKYGTPPHAGLAFGLDRLTMLLTGTDNIRDVIAFPKTTAAACLMTEAPSFANQAALTELGIQVVKKAENN >LR134233|1939512:1987624|1950024_1950765_+|VEC92008.1|DBSCAN-SWA MAGHSKWANTRHRKAAQDAKRGKIFTKIIRELVTAAKLGGGDPDANPRLRAAVDKALANNMTRDTLNRAIARGVGGDEDSNMETIIYEGYGPGGTAIMIECLSDNRNRTVAEVRHAFSKCGGNLGTDGSVAYLFSKKGVISFEKGDEDTIMEAALEAGAEDVVTYDDGAIDVYTAWEEMGKVRDALEAAGLKADSAEVSMIPSTKADMDAETAPKLLRLIDMLEDCDDVQEVYHNGEISDEVAATL >LR134233|1939512:1987624|1975224_1975578_+|VEC92037.1|DBSCAN-SWA MKKILLPALLLATSGVALAAPQVITVSRFEVGKDKWAFNREEVMLTCRPGQALYVINPSTLVQYPLNAIAEQQVAEGKTRAQPIAVIQIDNPAKPGEKMSLAPFIERAQKLCDPSNS >LR134233|1939512:1987624|1962728_1964204_+|VEC92024.1|DBSCAN-SWA MAVTQTAQACDLVIFGAKGDLARRKLLPSLYQLEKAGQIHPDTRIIGVGRADWDKEAYTHVVREALETFMKEKIDEGLWDTLSGRLDFCNLDVNDTPAFSRLGDMLDQKNRTTINYFAMPPSTFGAICKGLGEAKLNAKPARVVMEKPLGTSLATSREINDRVGEYFEECQVYRIDHYLGKETVLNLLALRFANSLFVNNWDNRTIDHVEITVAEEVGIEGRWGYFDQAGQMRDMIQNHLLQILCMIAMSPPSDLSADSIRDEKVKVLKSLRRIDRSNVREKTVRGQYTAGFAQGQKVPGYLEEEGANKSSNTETFVAIRVDIDNWRWAGVPFYLRTGKRLPTKCSEVVVYFKTPELNLFKESWQDLPQNKLTIRLQPDEGVDIQVLNKVPGLDHKHNLQITKLDLSYSETFNQTHLADAYERLLLETMRGIQALFVRRDEVEEAWKWVDSITEAWAMDNDAPKPYQAGTWGPVASVAMITRDGRSWNEFE >LR134233|1939512:1987624|1973589_1973820_-|VEC92034.1|DBSCAN-SWA MKTNLAQLEQAEMDKVNVDLAAAGVAFKERYNMPVVAEAVEREQPEHLRAWFRERLIAHRLASVSLSRLPYEPKVK >LR134233|1939512:1987624|1974332_1975208_+|VEC92036.1|DBSCAN-SWA MMLTFVWITLRFIHFASVMLVYGCALYGAWLAPASIRRLMTRRFLHLQRHAAAWSVISAAFMLAIQGGLMGGGWPDVFSVSVWGAVLQTRFGAVWIWQIILALVTLAVVVIAPIKMQRRLLILTVAQFILLAGVGHATMRDCVAGTLQQINHALHLLCAAAWFGGLLPVVYCMRMAQGRWRQHAISAMMRFSRYGHFFVAGVLLTGIGNTLFITGFTAIWQTTYGQLLLLKCALVVLMVAIALTNRYVLVPRMRQENPRTDLWFVRMTQIEWGVGGIVLAIVSLFATLEPF >LR134233|1939512:1987624|1975839_1975968_+|VEC92038.1|DBSCAN-SWA MGYVFPLEIPRNFLEFLYSGLFGYITSKSSLTFLLQQNQSIT >LR134233|1939512:1987624|1979713_1979977_+|VEC92045.1|terminase|DBSCAN-SWA MDDEPKNLKCNICQLAAITGLHRQTVVSRLSGVPLALGSNEKNKLYLLTDVIRVLMETPVSQAAEHQDPNKMTPKERKNWFDSEKGR >LR134233|1939512:1987624|1955673_1956363_-|VEC92017.1|DBSCAN-SWA MSLELSPGKILTLLGPNGAGKSTLVRVVLGLVAPDEGVIKRNGQLRIGYVPQKLYLDTTLPLTVNRFLRLRPGTQKTDILPALKRVQAGHLIDAPMQKLSGGETQRVLLARALLNRPQLLVLDEPTQGVDVNGQVALYDLIDQLRRELDCAVLMVSHDLHLVMAKTDEVLCLNHHICCSGAPEVVSMHPEFISMFGPRGAEQLGIYRHHHNHRHDLQGRIVLRRGNGHS >LR134233|1939512:1987624|1946778_1947345_-|VEC92005.1|DBSCAN-SWA MLKLNATTTALVVIDLQEGILPFAGGPYTANEVVARAARLAEKCRANGSPVVMVRVGWSDDYAEALKQPVDAATPAHALPENWWTWPTALGKKDSDLEVTKRQWGAFYGTDLELQLRRRGIDTIILCGISTNIGVESTARNAWELGFNLIIAEDACSAASSKQHQSSMTHIFPRIGRVRSVEDILNAL >LR134233|1939512:1987624|1954891_1955677_-|VEC92016.1|DBSCAN-SWA MIELLLPGWLAGMMLACAAGPLGSFVVWRRMSYFGDTLAHASLLGVAFGLLLDVNPFYAVIAVTLLLAAGLVWLEKRPHLAIDTLLGIMAHSALSLGLVVVSLMSNVRVDLMAYLFGDLLAVTPEDLISIAIGVVIVLAIIFWQWRNLLSMTISPDLAFVDGVKLQRVKLLLMLVTALTIGVAMKFVGALIITSLLIIPAATARRFARTPEQMAGVAVGVGMIAVTGGLTFSAFYDTPAGPSVVLCAALLFIFSMMKKQAS >LR134233|1939512:1987624|1945964_1946312_-|VEC92003.1|DBSCAN-SWA MILDSRPVHAAHPHSEAVRDAQRKKPKVPVHAVVTASHPMVRFIGSDNMAQNREFFAAWLQKLPQWRQTTTPFLFLHTPDIAQAPELVNTLWHDLRSVLPEIGTAPSIPQQSSLF >LR134233|1939512:1987624|1961517_1962387_-|VEC92023.1|DBSCAN-SWA MNMLEKVQSQLEHLSKSERKVADVILAAPGRSIHLSIAMLAQEANVSEPTVNRFCRSMNTRGFPDFKLHLAQSLANGTPYVNRNVDEDDSVEAYTGKIFESAMASLDHVRQSLDKSAVNRAVDLLTQAKKIAFFGLGSSAAVAHDAMNKFFRFNVPVIYSDDVVLQRMSCMNCSDDDVVVLISHTGRTKSLVELAQLARENDAMVIALTSAGTPLAREATLAITLDVPEDTDIYMPMVSRLAQLTVIDVLATGFTLRRGAKFRDNLKRVKEALKESRFDKELLIKSDDR >LR134233|1939512:1987624|1949542_1949995_+|VEC92007.1|DBSCAN-SWA MKDKVYKRPVSVLVVIFAQDTKRVLMLQRRDDPDFWQSVTGSIEEGETALQAAVREVKEEVTIDVVAEQLTLIDCQRTVEFEIFSHLRHRYAPGVMHNTEFWFCLALPHERQVIFTEHLTYQWLDAPDAAALTKSWSNRQAIEEFVINVA >LR134233|1939512:1987624|1964438_1966250_+|VEC92025.1|DBSCAN-SWA MNPNLLRVTQRIVERSQQTREAYLARIEQAKTATVHRSQLACGNLAHGFAACQPEDKASLKSMLRNNIAIITSYNDMLSAHQPYEHYPEIIRQALHSVNAVGQVAGGVPAMCDGVTQGQDGMELSLLSREVIAMSAAVGLSHNMFDGALFLGVCDKIVPGLAMAALSFGHLPAIFVPSGPMASGLPNKEKVRIRQLYAEGKVDRMALLESEAASYHAPGTCTFYGTANTNQMVVEFMGMQLPGSSFVHPDAPLREALTAAAARQVTRLTGNGNTWMPLGKMIDEKVVVNGIVALLATGGSTNHTMHLVAMARAAGILINWDDFSDLSEVVPLMARLYPNGPADINHFQAAGGVPVLMRELLNAGLLHEDVNTVAGFGLKRYTLEPWLNNGELDWREGAERSLDNDVIASFDKPFSPHGGTKVLSGNLGRAVMKTSAVPVENQIIEAPAMVFESQHDVLPAFDAGLLDRDCVVVVRHQGPKANGMPELHKLMPPLGVLLDRRFKIALVTDGRLSGASGKVPSAIHVTPEAYDGGLLAKVRDGDIIRVNGQTGELTLLVDEAELAARQPHIPDLSASRVGTGRELFGALREKLSGAEQGATCITF >LR134233|1939512:1987624|1944732_1945476_-|VEC92001.1|tRNA|DBSCAN-SWA MSHRDTLFSAPIARLGDWTFDERVAEVFPDMIQRSVPGYSNIISMIGMLAERFVQPNTQVYDLGCSLGAATLSVRRNIRHEHCRIIAVDNSPAMIERCRRHIDAYKAPTPVEVVEGDIRDITIENASMVVLNFTLQFLEPAERQALLDKIYQGLNPGGALVLSEKFSFEDAKVGELLFNMHHDFKRANGYSELEISQKRSMLENVMLTDSVETHKSRLRKAGFEHSELWFQCFNFGSLVALKAGVAA >LR134233|1939512:1987624|1956698_1957145_+|VEC92018.1|DBSCAN-SWA MRPSDVKRLQGADLVVWVGPEMEAFMEKSVRNIPDNKQVTIAQLADVKPLLMKGADDDEDEHAHTGADEEKGDVHHHHGEYNMHLWLSPEIARATAVAIHEKLVELMPQSRAKLDANLKDFEAQLAATDKQVGKRARAAQGERVFRFS >LR134233|1939512:1987624|1975952_1976222_-|VEC92039.1|integrase|DBSCAN-SWA MISAGIRKNSPTGNIHPDGLTKKFVKARKISDVKCSDNPPTFHEIRSLLGRLYKDERGEEFAQKLLGHTSENTTKLYLDERDNKAYVML >LR134233|1939512:1987624|1970015_1972067_+|VEC92031.1|DBSCAN-SWA MLPKANRIPYAMTVHGDTRIDNYYWLRDDTRSQPEVLDYLHQENEYGRKVMSSQQALQDRILKEIIDRIPPREVSAPYVKNGYRYRYIYEPGCEYAIYQRQSALSEEWDVWETLLDANQRAAHSEFYTLGGLAITPDNTIMALAEDYLSRRQYGLRFRNLESGNWYPELLDNVAPEFVWANDSLTLYYVRKHKKTLLPYQVWRHTIGTPSSQDELVYEEKDDTFYVSLHKTTSQHYVVIHLASATTSEVLLLDAELADAEPFSFLPRRKDHEYSLDHYQHKFYLRSNRNGKNFGLYRTRVRNENAWEELIPPREHIMLEGFTLFTDWLVVEERQRGLTSLRQINRKTREVIGIAFDDPAYVTWLAYNPEPETSRLRYGYSSMTTPDTLFELDMDTGERRVLKQTEVPGFDSGCYQSEHLWITARDGVEVPVSLVYHQKYFRKGQNPLLVYGYGSYGSSIDADFSSSRLSLLDRGFVYAIVHVRGGGELGQQWYEDGKFLKKRNTFNDYLDACDALLKLGYGSPSLCYGMGGSAGGMLMGVAINERPELFHGVIAQVPFVDVLTTMLDESIPLTTGEFEEWGNPQDIEYYNYMKSYSPYDNVKAQDYPHLLVTTGLHDSQVQYWEPAKWVAKLRELKTDQRLLLLCTDMDSGHGGKSGRFKSYEGVALEFAFLIGLAQGTLHSA >LR134233|1939512:1987624|1977027_1978134_-|VEC92041.1|DBSCAN-SWA MPDNTPTTHEENAARLRQAGKCLRDIETGRFQCDEEKPQPTGELADEPATPEAVEQDTTEHHPDPQPLENEPPVSQTEAGYQKIRAELHEARKNIPPKNPVDVGKQLAAARGEYVEGISDPDDPKWVHNNYSASNQGEKEEVVPEEKQPAAEPEAVTRNADGTFDVSALFPPPSNQTEKTEARTERDGETPKESNQQETAGDTGQEITTDGGSGTGGDEAGEAADPVENGNFTVPDDIQPGIYYDIPNEAYHAGPGVSKSQLDDIEDTPAIYLWRKNAPVDTEKTKSLDTGTAFHCRVLEPEEFSKRFIIAPEFNRRTSAGKEEEKTFLEECARTGITVLTAEEGRKIEFMYQSAMALTECIAGEVDQ >LR134233|1939512:1987624|1968337_1968628_+|VEC92028.1|DBSCAN-SWA MAVEVKYVVIREGEEKMSFTSKKEADAWDKMLDTADLLDTWLEQSPVVLEDGQREALSLWLAEHKEVLSTILKTGKLPSPQAVEKDAASKTKKQAA >LR134233|1939512:1987624|1942071_1942818_+|VEC91998.1|DBSCAN-SWA MALLEICCYSMECALTAQRNGADRIELCAAPKEGGLTPSFGVLRSVREHITIPVHPIIRPRGGDFYYTDGEFAAMLEDIRLVRELGFPGLVTGVLTVDGDVDMSRMEKIMAAAGPLAVTFHRAFDMCANPFNALKNLADAGVARVLTSGQKADAAQGLSIIMELIAQGDAPIIMAGAGVRANNLQNFLDAGVREVHSSAGVLLPSPMRYRNQGLSMSADIQADEYSRYRVEGAAVAEMKGIIVRHQAK >LR134233|1939512:1987624|1978164_1978395_+|VEC92042.1|DBSCAN-SWA MNGKKAFAPRPAKIRITGLENYTRCGVALKLDLVANPGQLELERHAARSAAWLFVTKGCLKYSGDLVRVTQIINGG >LR134233|1939512:1987624|1982579_1982702_+|VEC92047.1|DBSCAN-SWA MFVIHLEGYLADSIKRQLWPLLQYPVCHYSATTLVAVGQN >LR134233|1939512:1987624|1953460_1953727_+|VEC92013.1|DBSCAN-SWA MSAQQFVNAVEREELGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAVDLVLTSPASPTSEDAEQEAVAALVALGYKPQKPVGW >LR134233|1939512:1987624|1943764_1944736_-|VEC92000.1|tRNA|DBSCAN-SWA MIEFGNFYQLIAKNHLSHWLETLPAQIAAWQREQQHGLFKQWSNAVEFLPEMTPWRLDLLHSVTAESETPLSEGQLKRIDTLLRNLMPWRKGPFSLYGVDIDTEWRSDWKWDRVLPHLSDLTGRTILDVGCGSGYHLWRMIGAGAHLAVGIDPTQLFLCQFEAVRKLLGNDQRAHLLPLGIEQLPALKAFDTVFSMGVLYHRRSPLEHLWQLKDQLVNEGELVLETLVVDGDENTVLVPGDRYAQMRNVYFIPSAPALKKWLEKCGFIDVRIADVCVTTTEEQRRTEWMVTESLADFLDPNDRSKTVEGYPAPQRAVLIARKP >LR134233|1939512:1987624|1953803_1954415_+|VEC92014.1|DBSCAN-SWA MIEADRLISAGATIAEDVADRAIRPKLLAEYVGQPQVRSQMEIFIQAAKLRGDALDHLLIFGPPGLGKTTLANIVANEMGVNLRTTSGPVLEKAGDLAAMLTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKIDLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYQVPDLQHIVGRSARHMGWR >LR134233|1939512:1987624|1972825_1973482_-|VEC92033.1|DBSCAN-SWA MSSWKIAAAQYAPLNASPAEHVAHHLEYIELAARQQCELLVFPSLSLLGCDERNKPLPAPPDEALLQPLTHAADTHHMTIIVGMPVEHNCRFVKGIAIFAPWLTSPLMFHKSHGACIARQRSAINVVDEQPEGGDIDPSFTLFTTSQCLNEPELHASTSRLQRFSHKYALAVLMANACGSSALWDESGQLIVRADCGSLLLTGLRTTEGWQGDIIPLR >LR134233|1939512:1987624|1953183_1953492_+|VEC92012.1|DBSCAN-SWA MIGRLRGIILEKQPPIVLLETGGVGYEVHMPMTCFYELPEAGQEAIVFTHFVVREDAQLLYGFNNKQERTLFKELIKTNGVGPKLALAILSGHVGATVRQCG >LR134233|1939512:1987624|1972103_1972802_-|VEC92032.1|DBSCAN-SWA MLRIIDTETCGLQGGIVEIASVDVIDGNIVNPMSHLIRPDRPITPQAMAIHRITEAMVADKPWIEDVIPLYYGSEWYVAHNASFDRRVLPELPGEWICTMKLSRRLWPGIKYSNMALYKSRKLSVQTPPGLHHHRALYDCYITAALLIDIMRTTGWTAEEMVNITGRPALLTTFPFGKYRGKAVSEVAKRDPGYLRWLFNNLDNMSPELRLTLKHYLEDVQAGEQRSNGTPQ >LR134233|1939512:1987624|1952132_1952744_+|VEC92011.1|DBSCAN-SWA MVDDKPMRIGWVFYFSYIFVAIIFHLFISFCYCLSIDMAEANTVVFLLKPGTISLLFLLLPARRFRTRLLATLSSVFITLVFNQWHLVAGNKELVLCLQAACFMAFLAMTSVKKSGWMISASLFLVCAAGTIRQCWLEQLFNVADIYIVDDGRSCGASGHCFQYIAAKGRGLAAKRQALFSSEEYVNIYYSYSEGIPAVNSMA >LR134233|1939512:1987624|1945516_1945912_-|VEC92002.1|DBSCAN-SWA MVSALYAVLGALLLMKFSFNVVRLRMQYRVAYGDGGFSELQSAIRIHGNAVEYIPVALVLLLFMEMNGAETWMVHICGIILIAGRLMHYYGFHHRLFRWRRAGMSATWCALLLMVLANLWYMPWELVFSLY >LR134233|1939512:1987624|1950801_1951323_+|VEC92009.1|DBSCAN-SWA MSIILGIDPGSRITGYGVIRQVGRQLTYLGSGCIRTKVDDLPSRLKLIYAGVTEIITQFQPDYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPVFEYAARQVKQTVVGIGSAEKSQVQHMVRTLLKLPANPQADAADALAIAITHCHVSQNAMQMSESRLNLARGRLR >LR134233|1939512:1987624|1983774_1983933_-|VEC92049.1|DBSCAN-SWA MFMFLPFLLALSVAMGAINRKDKVSYILWAVLLIVTILSFIHHMTNSLTLSF >LR134233|1939512:1987624|1978448_1978982_+|VEC92043.1|DBSCAN-SWA MNLLSALLKRYWLQLVFILLMAGAFIAGNVWSDRGWQKKWADRDSAESSQEVNAQTAARIIEQGRVIARDEAVKDAQAQAAKSAATAAGLSATVSQLRTEAKKLATRLDAAKHTANLAAAVRSKTTNADARMLANMLGDIAEEAKHYAGIADERYRAGMTCERVYDSVRESNNYRRH >LR134233|1939512:1987624|1966287_1966929_+|VEC92026.1|DBSCAN-SWA MKNWKTSAEAILTTGPVVPVIVVNKLEHAVPMAKALVAGGVRVLEVTLRTACAMDAIRAIAKDVPEAIVGAGTVLNPQQLAEVTEAGAQFAISPGLTEPLLKAATAGTIPLIPGISTVSELMLGMDYGLKEFKFFPAEANGGTKALQAIAGPFSQVRFCPTGGISPANYRDYLALKSVLCIGGSWLVPADALEAGDYDRITKLAREAVEGAKQ >LR134233|1939512:1987624|1973957_1974332_+|VEC92035.1|DBSCAN-SWA MASSAPSRRLALLLLASTFATPAAWAHAHLTHQYPAANAAVTASPQALTLNFSEGIEPGFSGATITGPQQELIKTRPAKRNEQDKTQLIIPLEQPLKSGAYTVDWHVVSVDGHKTKGKYTFSVK >LR134233|1939512:1987624|1980131_1980563_+|VEC92046.1|tail|DBSCAN-SWA MYIGPHGHVVIVDADGNAETFGLMDGGVDAAITAYFGSQLQERVQQNIIREYLGEQPVGTAFVIETGNSKHPWLVHAPTMRVPLIIDGTDAVYNATRAALLAIFQHNKSAGEDRKITSVALPAMGAGCSQVPPDSVARQIVLI >LR134233|1939512:1987624|1959951_1961394_-|VEC92022.1|DBSCAN-SWA MSRRLRRTKIVTTLGPATDRDNNLEKVIAAGANVVRMNFSHGSPEDHKMRADKVREIAAKLGRHVAILGDLQGPKIRVSTFKEGKVFLNIGDKFLLDANLGKGEGDKEKVGIDYKGLPADVVPGDILLLDDGRVQLKVLEVQGMKVFTEVTVGGPLSNNKGINKLGGGLSAEALTEKDKADIQTAALIGVDYLAVSFPRCGEDLNYARRLARDAGCDAKIVAKVERAEAVCDQNAMDDIILASDVVMVARGDLGVEIGDPELVGIQKALIRRARQLNRAVITATQMMESMITNPMPTRAEVMDVANAVLDGTDAVMLSAETAAGQYPSETVAAMARVCLGAEKIPSINVSKHRLDVQFDNVEEAIAMSAMYAANHLKGVTAIITMTESGRTALMTSRISSGLPIFAMSRHERTLNLTALYRGVTPVHFDSAADGVVAAHEAVNLLRDKGYLVSGDLVIVTQGDVMSTVGSTNTTRILTVE >LR134233|1939512:1987624|1957999_1958791_+|VEC92020.1|DBSCAN-SWA MQQGDWVNSLLKGTVGGSFVASAKEAGLTSSEISAVIKAMQWQMDFRKLKKGDEFSVLMSREMLDGKREQSQLLGVRMRSDGKDYYAIRAADGKFYDRNGVGLAKGFLRFPTAKQFRISSNFNPRRLNPVTGRVAPHRGVDFAMPQGTPVLSVGDGEVVVAKRSGAAGYYIAIRHGRTYTTRYMHLRKLLVKPGQKVKRGDRIALSGNTGRSTGPHLHYEVWINQQAVNPLTAKLPRTEGLTGSDRREYLAQVKEVLPQLRFD >LR134233|1939512:1987624|1984065_1984980_-|VEC92050.1|DBSCAN-SWA MRKVSISILFMLVSLTWGTTWLAMRIAVETIPPVFATGMRFMFAAPFLIIIAWLRKKTLLFPPGQRLFQFVICIFYFCIPFSLMIYGETYVNSGLAAIIFANMPVAVLIASVLFLNEKAKLMQIAGLTIAITALTGILLEETNTSTESHWQGITALISAVLIHAIIYTQCKKRSCTVSVITFNALPCLLAGLILSATGWFFERPQVSTFSVHSILATLYLGAFAGVFGILCYFALQQKANAFQASLVFLIFPLIAVSLEDYIYGYAISTHSMLLIIPLVIGIFLTLVARNIPVTSRCRDNSSQK >LR134233|1939512:1987624|1983150_1983765_-|VEC92048.1|DBSCAN-SWA MKTHHHPTTFVHLINQVGLLGICVALVVAFYYQLVRHELPCPICLLQRAGLIIAGFGFLFNLCFGLRGIHYGMVIIGSILTGVMASRQICLHIMPGDTGYGSAFFGLHFYTWTLITSILIIIAVAVILAISSMNVAFRSLNINPDLFSIVGWVFLLLITANLISTVLECGGGECAANPVTYKLLSKQDIAFLKTGLLTRTVLRL >LR134233|1939512:1987624|1954411_1954813_+|VEC92015.1|DBSCAN-SWA MSDDGALEVARRARGTPRIANRLLRRVRDFAEVKHDGAISAEIAAQALDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDVLEPYLIQQGFLQRTPRGRMATVRAWNHFGITPPEMP >LR134233|1939512:1987624|1957470_1957986_+|VEC92019.1|DBSCAN-SWA MQQIARSVALAFNNLPRPHRVMLGSLTVLTLAVAVWRPYVYHPESAPIVKTIELEKSEIRSLLPEASEPIDQAAQEDEAIPQDELDDKTAGEVGVHEYVVSTGDTLSSILNQYGIDMSDISRLAASDKELRNLKIGQQLSWTLTADGDLQRLTWEVSRRETRTYDRTANGF >LR134233|1939512:1987624|1986766_1987624_+|VEC92051.1|transposase|DBSCAN-SWA MDEKKLKALAVELAKGLKTEADLNSFSRMLAKLTVETVLNAELADHPGYEKNAPKTGSNTRNGYSSKTVLCDDGEIELNTPRERENTFEPQLLKKHQTRITQMDSQILSLYAKGMTTREIVATCKEMYDANVSPSLISKVTDAVKEQVTERQNRQPDALYPIVYMDCIVVKVRQNGSVINKAVFLALGINTEGQKELLGMWLAENEGEKFWLSVLTELKTRGVQDILIACVDGLNGFPDAINSVFPQPHIQLYSIHMVRNSLKYVAWKGYKVATSGLRPRPKRRH >LR134233|1939512:1987624|1976236_1977031_-|VEC92040.1|integrase|DBSCAN-SWA MSRKKYDANLPRYLTYRKASKSFFWRNPVTDKEFPLGQIARRDAITQAIEANNFIAQNHTPVALIEKLKGTDSFTVSAWIDRYEVLLQRRSLSVNTYKIRSNQLATVREKMGEMILAEVTTRHIAEFLESWIAEGKNTMAGAMRSVLSDMFREAIVEGRITTNPVEPTRAPEIKVARERLQLETYNATRTAAEYLPVWFPLAMDLALVTGQRREDIVNMKFSDIVDGRLHVTQIKTGMKIAFPLSLTLQAPGLRPGRLSIAADW |
56 | Bacillus_virus(10.53%) | tail,terminase,integrase,tRNA,transposase | attL 1975800:1975822|attR 1986128:1986150 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
3223629 : 3230620
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >LR134233|3223629:3230620|DBSCAN-SWA GTTAAGCCGTAATAGCCGGGATCACCGACTCTGCGCCAGTATTAATGAAGATCTTTTCTCCCCGTAACACCCGCTCGCCGTCCGCCTGAAATACCCGTAAGGTATGGTTATCAATAAACTCCGCCCTTCCCTCAATCACATCGACATTGTCCAGGTCGGCAAGATTATGAAAATTTTTGTCGCGTAAAAAATTTACGACCGCCGCTTTGCGTTGCATGGCGACTGAAAAATCGCCCTCACGTTCCGCATCATGCACCAGCGTTTTCGTTGGAATACAGCCAATGTTGATGCAGGTTCCGCCGAACATACTGGCGGACTGTTCGATAATCGCCACGCGCCATCCCGTCTTCGCCAGAGTTGTCGCCAACGTTTTTCCTGCCTTGCCAAACCAATAATGAGCGCCTGATACTGTGTCATCTTCTTTCTCCGCCAAATCACCATCTGTTGTCAGGACAAGTATAAATAACGTCAAAAAAGACAGGGTTATCCGCAACGCGCCATTTTTTGTCTTATCGTCTAATTGGCGGTTAGCAAAAACAAGGGTAGAAATGACATTGAGTCCGAAATATAGTGATCAGAGACAGAAATGGTTATCTCTAGTTTTGAAAATAAATCTGTTCTGGAGTCTATATGGATGCGCTTAGTCGGCTATTGACGCTTAACGCCCCTCAGGGCTCAATCGACAAGAATTGCCCGTTAGGCGGCGACTGGCAGTTACCGCACGCCGCTGGCGAGCTATCGGTGATTCGCTGGCATACCGTTACACAGGGCGAAGCGCAACTGGAAATACCTACCGGCGACGCCATGATACTTACGCCGGGAAAGGTGGTGATTCTGCCGCAAAACTCCGCCCACCGTTTGCGCCAGTCTGGCGAAGCACCGACACATATCGTGTGCGGTAGTCTACGTCTGCACACGACGTCACGTTATTTCCTTACCGCGTTGCCGGAAGTGTTATGTCTTGCCCCGCCGCCGCATAGCCCCGCCAGTATCTGGCTTAACGCCGCCATTTTGCTGTTACAGCAGGAGTCAGAACGCCATTTACCTGGCGCTGATGTCTTATACAGTCAGCAGTGCGCCACGCTGTTTACCCTCGCCGTTCGCGACTGGCTATCGCAGGCTGGCACAGCGAAAAGCGTGCTCAATTTATTGCTGCATCCCCGGCTGGGCCGCGTGATCCTTCACATGCTGGAAACACCGGCGCATCCCTGGACGGTCGAAACGCTGGCGCAACGGGTACACATGTCGCGGGCGAGCTTCGCCCAGCTATTTCGCGACGTGTCCGCTACAACGCCCTTAGCCGTGTTAACAACGCTGCGCCTGCAAATCGCCGCCCAGACCCTGTCGCGTGAAGCGCTGCCGGTGATGGTGATTGCCGAGTCGGTAGGTTATGCGAGCGAATCTTCTTTTCACAAAGCTTTTGTCCGTGAATTCGGTTGTACACCAGGCGAGTACCGCAAACGGGTCAACGCGCTCGGACGATAAACGTATACTGAACCCGCCTACAGTTTGACGCCGCAGGCGGGCGACACACGTTATGCCGCAAGGCGACGGTGCCTGCGCAGCCACATCAGCTTGTAGGCTGCCGGAATGATGAACAGCGACAGCAACGGCGCGGTAATACGGCTCATCACCTCAGAACCCGCGCCGGTCCCCCATAAAACAGGTAGCAGACCGGCGATAATCACCGCTACCGTCATGGCTTTTGGCCGCACCCGGAGCACCGCCCCGTGGTGAAGCGCCTCATCGAGTTTTTCTGCCGAAAAGGTGTTGGGAGCTTGCGCTTTGAAGGCCGCGCCTGCTGGATCACCGGCGATCCCATCATGCTGCGCAGAGCCATCAGTAATTGACTCTCTAATACGATGCGCTATACGCCGCCGGGTAAAACCATTACTCTTCAGATAAAAGAAGCGGACGACCAGATACATATTATTGTTGAAAACCCCGGTACGCCAATCGCGCCGGAACATTTGCCGCGCCTTTTCGATTGTTTTTATCGCATTGATCCCTCCCGCCAGCGTAAAGGGGAAGGCAGCGGTATAGGCCTAGCAATTGTAAAATCCATCGTCACTGCGCACCGGGGAAAAGTTTCTGTCACCTCCGATACGCGCGCTACTCGTTTTATTTTGACATTACCCAAACACCTCCGTTGAACTGTAGCGATAGTGAAAGGCGCTATCGCTAACCAGCAAAATAATCTGACATCATCGTCATTATCCCGTCACGCGCCAGACTCAACGACTGTTTTATTATTTCTCTTATTAACCCACACCGGAGAAATAAGTTGAATAAGTTTACATCCCTTTTAGCTGTCGTTTTTTTATTTCAGCTGGCGTAAAAGTCGCAGAGAAAATGGAATCCCGCCAATTATCCGCCACCGCGCATGAAATGATGAATAACGGTGACGCCTTCGCGCACCAGCAAATGGCTTAGACACAGAAAACCAGCGCCCCGACTCCTGTGGAAAATGCGACACCCTTTTCTGAAATGGACGACTATGAAAAAGCAATGGTCATCCATCAATCGATGAATAATGCCCATTCTTTCGCCCATGAGATCCAGGCTGAGGAGCACCATAAACAGATTAAACACAATTAGTTATTTGCATATTAGCGTCACGTACTGCGGATGGCACAACGTCGCCCCCGCAGAAGCTTACGGATTAACATAATTTTGAGAGGTTAAATGGCGCGCCCTGCAGGATTCGAACCTGCGACCCACGGCTTAGATGTTCCTGGAACTACCTGAGCTAACAACAAGTTATCCCATTACCACGGCGCTCACTCGCCCCATCTTCGAAAAACATGCAAAGCTTTGCAAGCCGATGCAAAGCTTTGTGTGTACCGTTTTTGTCCCAACCCACTCAGCAATCAGTAGCCTCAATCGATCGATAACAACGATCAATTAATAATATAACAATAAGTTTAAACTATCAAATATCACATTATTGATCGTTTATATCGATCAAATCAATTTGTAGTGCTACACTCCAGACCTTTCCGAATCCGCTGATTTTCATAATGTTGAAGTTATTCGCTAAGTACACATCGATCGGTGTTCTTAACACGCTTATTCATTGGGGCGTATTTGCTTTTTGTGTGTATGGGATGCATACGCATCAGGCGTTGGCGAACTTTTCCGGTTTTGTTATCGCTGTATCGTTCAGCTTCTATGCCAATGCGCGCTTCACCTTTAACGCCAGCACCACCACGCTTCGCTACATGATGTACGTGGGGTTTATGGGAACACTGAGCGCCGTTGTTGGCTGGATGGCTGACAAATGTTCCCTGCCGCCACTCCTTACTCTTGTCACCTTTTCAGCTATCAGCCTGATATGCGGATTCATTTATTCAAATTTCATTGTCTTTAGGGATGCGAAATGAAAGTCTCGTTAGTCGTTCCGGTCTTCAATGAAGAAGCCACGATACCTATTTTCTATAAAACGGTTCGCGAGTTTGAAGAGCTAAAACCGTATGAAGTTGAGATTGTTTTCATCAACGACGGAAGCAAAGATGCGACTGAATCAATAATTAACAAAATAGCTGCATCTGATCCGCTCGTTATTCCGCTTTCGTTTACGCGAAACTTCGGTAAAGAACCTGCTCTTTTCGCGGGTCTCGACCATGCAACCGGGGATGCGGTCATTCCTATTGATGTCGATTTACAGGATCCGATAGAAGTTATCCCCCATCTCATTGAGAAGTGGCAGGCTGGCGCGGATATGGTGCTGGCTAAGCGCTCAGACCGCTCAACTGACGGGCGCATGAAGCGTAAGACAGCTGAATGGTTTTATAAGCTGCACAATAAAATCAGCAATCCGAAAATTGAAGAGAACGTCGGTGATTTCCGGTTAATGAGCCGTGAAGTTGTCGAAAATATTAAACAGATGCCAGAACGTAATCTGTTTATGAAAGGCGTTCTGTCATGGGTGGGCGGCAAGACTGATGTAGTTGAATACGCCCGCGCTGAGCGTGTTGCCGGCGATTCAAAATTCAATGGCTGGAAATTATGGAATCTGGCGCTGGAAGGAATAACTTCTTTCTCAACATTTCCGCTCCGCATATGGACTTACATTGGATTGTTTATTGCAGGTATGTCATTCCTTTACGGTGCATGGATGATTATTGATAAATTAATATTTGGAAATAATGTTCCTGGCTACCCGTCTCTTCTTGTTTCTGTACTTTTTCTGGGTGGCGTTCAATTGATAGGAATAGGTATTCTTGGGGAATATATTGGCAGGATTTACATAGAAACAAAAAAAAGGCCTAAGTACATAATTAAGAATGAGAAGAAAAATGGTTAACAATAGATTAAAAATGGTAATCGCTATCCTTATAGTTTTCTCGTTGGTGTATTCAATAGGATTTATCACACCAATGAACTCTGATGACTATACTTATGCCCTAAGAGAGCTTTCGCTTTCTAGCGTAAAAATGCACTATTTGGGATGGAGCGGAAGGGTTGTGTCAGATACGATCAGCACATCTCTATTAAAGTTTTTCTCCCCGCATATTTACAATGCAATAAACTCAGCAGCGCTAACATTAATGGTGTTGTGCTGGACAATGATCCCAGCTACATTAACAAAGTCGTCACCGTCCCCATATGTGATGATTTTCTTATTTTTCTTATACTTCGTTGCAAATCCAGCCCTCGGTCAAACTAACTTCTGGCTTGTTGGGTCAGCAAATTACTTATGGACCAACATGTTCATTGCCATTTATATACTAATTTCTATATATTTAAGCAATGGTAAAAAATCCAATTTAATACTCTTTGTATATGCCATATCATCGATATTTGCAGGTTGCTCAAATGAAAATACATCTCTTGTAGTTGTATTAATTTCTGTAGCATATTTCTTTATAATGAATAGAAATAAATATTTACTGATTGGCGTATTCGGATCTGCAATAGGCGCGGGGGTTCTCCTGCTGGCTCCGGGAAACCTATCCCGTGCGTCTACAATACAAGACTGGTACAATCAACCACTTGCATGGAGGGTTCTTGAGCACTTTTCAGAAAGGCTGCCATCAGCAATGGGGGCATATTGGCAGGTCTATATTGCATTTATAATACTACTAATCTCTGTAGTGTTATCAAGGAATAGTAGCAGCAAACTTATGTTTGGAAGCTTTTTATTCATGCTGGGTGCAATTGCTGCAAATGTTGCGTTCCTTGCATCTCCTGCAATGCCCAGCAGGGCTCTCAATGGAGCACTCTGCTTTATGATTCTTTCGATTTCCTTTGTTGCTCATTCTGCATTTACGAAATTTAACAAGGCTTCCATTTATTTATCTGTTACTACATATGCAATGGCTTTTTTATATTTCATCCCATCATACATCCTCTATTACTCATCAATTAAATCAATAAGCAAACAAACGGAGATTAGGGAAGAAATAATAGACAGGGCGAAACATAACAAACAAGATCAAGCGATAATTCCTGATTATTACTTTCCACCGGTACTGCATGCAGGCCCAAGTCTTGATACGTTTAACAGTGAAGCAATGTCTAGATACTACGGGATTGATTTGAAAATAACTGCTCCAGGATTTTTCGACTATTCACGAGCCTTTAATTTCAAGCCTCTTAATATTAACGCAAAAATATGTAACAACGTTTATATAAATCATTATGGATATATAAACAGCAAATGGGCATTAAAACCTTTGTAATATTTGAGTTTAATAAAAATCCTGCTGATTCTCTTGATGAGAATACAGCGATGTTCATTAGCTTTAAAACAAAGGATGGAAAAATTATAAATGCAGATGTCGACAAGAAGACATTCCAAATAGACGGAAGATGGCTTTCCGGGCGGGCGATTAACGGCATAGATTCAAATGAATTAGAATCAATTACATCTGGGACATGGGACGTTAGAACAGGGGCGAGGACAAATGAGAATATTACGGAAATAATTAAATAACATTCATGTATAATTAATGTTCCCCGCCACCAATGGGTAACGGGCGGCGGGGATTCCTACATATCGTATGTCACAAATGCCCCATCAGGTTTCTTAGCAAGAAGTCGCAATGCACCATCAGTCCCAAAAAAGAACCCAACGGAGCCATAGTGCATTAATCCAGACTCCGGCAACTGACCAATACACGCAAGGATACGCATGTGCCCAAAGCCCATCCGGTTCAACTCCACCACAGATGCAGGAACACCGTCTTTTAGTGCGGATAAACTGCCCCCTACTGGTTTCACTACCGGGTAAAGGAACCTGACAATCTGCGGTCAATGTGAATCGCCACAGGTTTAACAGACACCTCAGAGCCATTTAAGATGACTTAAAGAGAGGTGCCTATGAGCGGTAAACGTTATCCCGAAGAGTTTAAAATTGAAGCGGTAAGACAGGTCGTTGAGCGTGGCCATTCTGTTTCCAGCGCCGCAACACATCTCGATATTACTACCCACAGTCTTTACGCCAGGATAAAGAAGTACGGGTCGAACTCCTCCACCAATAAAGAAATAAAGAGCAGTCAGATGATCAGACCGGAATCCGCCGACTCCCGAAAGAGTTGAAGCGGGTTACTGACGAGCGGGACATATAAAAAAAAGCCGCAATAGATTCAATTGGTCACTGCAACCATGAATGACAAAAGATAGAGTCCTGAATGCGCTTCTGATGGCCGTGTGGCGACGTAATCACCCGCTATGAATGGCAGTCGCTCCTGAAACCGCACGGACTGGAGAGCAGAATGAGCCGTCGTGGCAACTGTCATGACAATGCGGTTGCAGAAAGCTTTTTCCAGCTACTGAAGCGTGAACGGATAAAGGAAAAGATCTACGGAACGAGAGACGAAGCCAGAAGCGATATTTTTGATTACATAGAAATGTTTTATAACAGTAGACGCCGGCATGGTTCGAGCGAGCAGATGCCACCGGCTGAATATGAAAACCTATATCATCAACGACTCAGAAGTGTCCAGGTTATCCGTGGTGATTCACAGATATGCTCACCTTGCGCCTAA
Protein sequences of DBSCAN-SWA_8 >LR134233|3223629:3230620|3223629_3223995_-|VEC93384.1|DBSCAN-SWA MATTLAKTGWRVAIIEQSASMFGGTCINIGCIPTKTLVHDAEREGDFSVAMQRKAAVVNFLRDKNFHNLADLDNVDVIEGRAEFIDNHTLRVFQADGERVLRGEKIFINTGAESVIPAITA >LR134233|3223629:3230620|3227007_3227934_+|VEC93388.1|DBSCAN-SWA MKVSLVVPVFNEEATIPIFYKTVREFEELKPYEVEIVFINDGSKDATESIINKIAASDPLVIPLSFTRNFGKEPALFAGLDHATGDAVIPIDVDLQDPIEVIPHLIEKWQAGADMVLAKRSDRSTDGRMKRKTAEWFYKLHNKISNPKIEENVGDFRLMSREVVENIKQMPERNLFMKGVLSWVGGKTDVVEYARAERVAGDSKFNGWKLWNLALEGITSFSTFPLRIWTYIGLFIAGMSFLYGAWMIIDKLIFGNNVPGYPSLLVSVLFLGGVQLIGIGILGEYIGRIYIETKKRPKYIIKNEKKNG >LR134233|3223629:3230620|3229622_3229781_-|VEC93390.1|DBSCAN-SWA MGFGHMRILACIGQLPESGLMHYGSVGFFFGTDGALRLLAKKPDGAFVTYDM >LR134233|3223629:3230620|3229952_3230171_+|VEC93391.1|transposase|DBSCAN-SWA MSGKRYPEEFKIEAVRQVVERGHSVSSAATHLDITTHSLYARIKKYGSNSSTNKEIKSSQMIRPESADSRKS >LR134233|3223629:3230620|3230347_3230620_+|VEC93392.1|transposase|DBSCAN-SWA MSRRGNCHDNAVAESFFQLLKRERIKEKIYGTRDEARSDIFDYIEMFYNSRRRHGSSEQMPPAEYENLYHQRLRSVQVIRGDSQICSPCA >LR134233|3223629:3230620|3227926_3229312_+|VEC93389.1|DBSCAN-SWA MVNNRLKMVIAILIVFSLVYSIGFITPMNSDDYTYALRELSLSSVKMHYLGWSGRVVSDTISTSLLKFFSPHIYNAINSAALTLMVLCWTMIPATLTKSSPSPYVMIFLFFLYFVANPALGQTNFWLVGSANYLWTNMFIAIYILISIYLSNGKKSNLILFVYAISSIFAGCSNENTSLVVVLISVAYFFIMNRNKYLLIGVFGSAIGAGVLLLAPGNLSRASTIQDWYNQPLAWRVLEHFSERLPSAMGAYWQVYIAFIILLISVVLSRNSSSKLMFGSFLFMLGAIAANVAFLASPAMPSRALNGALCFMILSISFVAHSAFTKFNKASIYLSVTTYAMAFLYFIPSYILYYSSIKSISKQTEIREEIIDRAKHNKQDQAIIPDYYFPPVLHAGPSLDTFNSEAMSRYYGIDLKITAPGFFDYSRAFNFKPLNINAKICNNVYINHYGYINSKWALKPL >LR134233|3223629:3230620|3224258_3225113_+|VEC93385.1|DBSCAN-SWA MDALSRLLTLNAPQGSIDKNCPLGGDWQLPHAAGELSVIRWHTVTQGEAQLEIPTGDAMILTPGKVVILPQNSAHRLRQSGEAPTHIVCGSLRLHTTSRYFLTALPEVLCLAPPPHSPASIWLNAAILLLQQESERHLPGADVLYSQQCATLFTLAVRDWLSQAGTAKSVLNLLLHPRLGRVILHMLETPAHPWTVETLAQRVHMSRASFAQLFRDVSATTPLAVLTTLRLQIAAQTLSREALPVMVIAESVGYASESSFHKAFVREFGCTPGEYRKRVNALGR >LR134233|3223629:3230620|3225163_3225598_-|VEC93386.1|DBSCAN-SWA MFRRDWRTGVFNNNMYLVVRFFYLKSNGFTRRRIAHRIRESITDGSAQHDGIAGDPAGAAFKAQAPNTFSAEKLDEALHHGAVLRVRPKAMTVAVIIAGLLPVLWGTGAGSEVMSRITAPLLSLFIIPAAYKLMWLRRHRRLAA >LR134233|3223629:3230620|3226648_3227011_+|VEC93387.1|DBSCAN-SWA MLKLFAKYTSIGVLNTLIHWGVFAFCVYGMHTHQALANFSGFVIAVSFSFYANARFTFNASTTTLRYMMYVGFMGTLSAVVGWMADKCSLPPLLTLVTFSAISLICGFIYSNFIVFRDAK |
9 | Salmonella_phage(57.14%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|