Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP028151 | Salmonella enterica subsp. enterica serovar Enteritidis str. RM2968 chromosome, complete genome | 2 crisprs | WYL,DinG,DEDDh,cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK | 0 | 8 | 8 | 0 |
CP028153 | Salmonella enterica subsp. enterica serovar Enteritidis str. RM2968 plasmid pRM2968-2, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP028154 | Salmonella enterica subsp. enterica serovar Enteritidis str. RM2968 plasmid pRM2968-3, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP028152 | Salmonella enterica subsp. enterica serovar Enteritidis str. RM2968 plasmid pRM2968-1, complete sequence | 0 crisprs | cas14j | 0 | 0 | 1 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028151_1 | 2944818-2945395 | TypeI-E |
I-E
Consensus repeat of CP028151_1
|
9 spacers
spacers of CP028151_1
>1.1|2944847|32|CP028151|CRISPRCasFinder,CRT TATTTATAAGCGTGTCATCTATGCAACCCAAC >1.2|2944908|32|CP028151|CRISPRCasFinder,CRT ACCTGCCCGACCCAATAAGGGGGCCCTCGTGA >1.3|2944969|32|CP028151|CRISPRCasFinder,CRT GGCCGCTGGTCAAATTCCCAATCTGAGCAATC >1.4|2945030|32|CP028151|CRISPRCasFinder,CRT ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC >1.5|2945091|32|CP028151|CRISPRCasFinder,CRT GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC >1.6|2945152|32|CP028151|CRISPRCasFinder,CRT ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG >1.7|2945213|32|CP028151|CRISPRCasFinder,CRT GAGAATGCTCATGCGCGTGAGCGCCATATATT >1.8|2945274|32|CP028151|CRISPRCasFinder,CRT CATGGCAATTTTACGGCGGACGTGCTCGCTCT >1.9|2945335|32|CP028151|CRISPRCasFinder,CRT AGGCGGACCGAAAAACCGTTTTCAGCCAACGT >1.10|2944848|33|CP028151|PILER-CR TATTTATAAGCGTGTCATCTATGCAACCCAACC >1.11|2944909|33|CP028151|PILER-CR ACCTGCCCGACCCAATAAGGGGGCCCTCGTGAC >1.12|2944970|33|CP028151|PILER-CR GGCCGCTGGTCAAATTCCCAATCTGAGCAATCC >1.13|2945031|33|CP028151|PILER-CR ATAGCCCCGGCAGCGATAGCTAAACCAGTTCCC >1.14|2945092|33|CP028151|PILER-CR GCCTCAAAATCTCTCGGTGAGATGTAAGCGTCC >1.15|2945153|33|CP028151|PILER-CR ACCAGTGGTCAGCGGCGGATGAATTTGCCCTGC >1.16|2945214|33|CP028151|PILER-CR GAGAATGCTCATGCGCGTGAGCGCCATATATTC >1.17|2945275|33|CP028151|PILER-CR CATGGCAATTTTACGGCGGACGTGCTCGCTCTC >1.18|2945336|33|CP028151|PILER-CR AGGCGGACCGAAAAACCGTTTTCAGCCAACGTC |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around CP028151_1
The CRISPR arrays of CP028151_1 >merge|CP028151|1|2944818-2945395|CRISPRCasFinder,CRT,PILER-CR GTGTTTATCCCCGCTGACGCGGGGAACACTATTTATAAGCGTGTCATCTATGCAACCCAACCGGTTTATCCCCGCTGGCGCGGGGAACACACCTGCCCGACCCAATAAGGGGGCCCTCGTGACGGTTTATCCCCGCTGGCGCGGGGAACACGGCCGCTGGTCAAATTCCCAATCTGAGCAATCCGGTTTATCCCCGCTGGCGCGGGGAACACATAGCCCCGGCAGCGATAGCTAAACCAGTTCCCGGTTTATCCCCGCTGGCGCGGGGAACACGCCTCAAAATCTCTCGGTGAGATGTAAGCGTCCGGTTTATCCCCGCTGGCGCGGGGAACACACCAGTGGTCAGCGGCGGATGAATTTGCCCTGCGGTTTATCCCCGCTGGCGCGGGGAACACGAGAATGCTCATGCGCGTGAGCGCCATATATTCGGTTTATCCCCGCTGGCGCGGGGAACACCATGGCAATTTTACGGCGGACGTGCTCGCTCTCGGTTTATCCCCGCTGGCGCGGGGAACACAGGCGGACCGAAAAACCGTTTTCAGCCAACGTCGGTTTATCCCCGCTGGCGCGGGGAACAC >CP028151|1|1|2944818-2945395|CRISPRCasFinder GTGTTTATCCCCGCTGACGCGGGGAACAC TATTTATAAGCGTGTCATCTATGCAACCCAAC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCTGCCCGACCCAATAAGGGGGCCCTCGTGA CGGTTTATCCCCGCTGGCGCGGGGAACAC GGCCGCTGGTCAAATTCCCAATCTGAGCAATC CGGTTTATCCCCGCTGGCGCGGGGAACAC ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC CGGTTTATCCCCGCTGGCGCGGGGAACAC GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GAGAATGCTCATGCGCGTGAGCGCCATATATT CGGTTTATCCCCGCTGGCGCGGGGAACAC CATGGCAATTTTACGGCGGACGTGCTCGCTCT CGGTTTATCCCCGCTGGCGCGGGGAACAC AGGCGGACCGAAAAACCGTTTTCAGCCAACGT CGGTTTATCCCCGCTGGCGCGGGGAACAC >CP028151|1|1|2944818-2945395|CRT GTGTTTATCCCCGCTGACGCGGGGAACAC TATTTATAAGCGTGTCATCTATGCAACCCAAC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCTGCCCGACCCAATAAGGGGGCCCTCGTGA CGGTTTATCCCCGCTGGCGCGGGGAACAC GGCCGCTGGTCAAATTCCCAATCTGAGCAATC CGGTTTATCCCCGCTGGCGCGGGGAACAC ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC CGGTTTATCCCCGCTGGCGCGGGGAACAC GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GAGAATGCTCATGCGCGTGAGCGCCATATATT CGGTTTATCCCCGCTGGCGCGGGGAACAC CATGGCAATTTTACGGCGGACGTGCTCGCTCT CGGTTTATCCCCGCTGGCGCGGGGAACAC AGGCGGACCGAAAAACCGTTTTCAGCCAACGT CGGTTTATCCCCGCTGGCGCGGGGAACAC >CP028151|1|1|2944820-2945395|PILER-CR GTTTATCCCCGCTGACGCGGGGAACACT ATTTATAAGCGTGTCATCTATGCAACCCAACCG GTTTATCCCCGCTGGCGCGGGGAACACA CCTGCCCGACCCAATAAGGGGGCCCTCGTGACG GTTTATCCCCGCTGGCGCGGGGAACACG GCCGCTGGTCAAATTCCCAATCTGAGCAATCCG GTTTATCCCCGCTGGCGCGGGGAACACA TAGCCCCGGCAGCGATAGCTAAACCAGTTCCCG GTTTATCCCCGCTGGCGCGGGGAACACG CCTCAAAATCTCTCGGTGAGATGTAAGCGTCCG GTTTATCCCCGCTGGCGCGGGGAACACA CCAGTGGTCAGCGGCGGATGAATTTGCCCTGCG GTTTATCCCCGCTGGCGCGGGGAACACG AGAATGCTCATGCGCGTGAGCGCCATATATTCG GTTTATCCCCGCTGGCGCGGGGAACACC ATGGCAATTTTACGGCGGACGTGCTCGCTCTCG GTTTATCCCCGCTGGCGCGGGGAACACA GGCGGACCGAAAAACCGTTTTCAGCCAACGTCG GTTTATCCCCGCTGGCGCGGGGAACAC
>CP028151.1|AWP51456.1|2943756_2944803_+|aminopeptidase MFSATRRFAVILALGVGFILPAQAASPGPGEIANTQARHIATFFPGRMTGSPAEMLSADYLRQQFTQMGYQSDIRTFNSRFIYTTKDNRKNWHNVTGSTVIAAHEGRVPQQIIIMAHLDTYAPQSDADVDANLGGLTLQGMDDNAAGLGVMLELAARLKDIPTHYGIRFIATSGEEEGKLGAENLLKRMSDAEKKNTLLVINLDNLIVGDKLYFNSGKNTPEAVRTLTRDRALAIARRYGIAANTNPGRNPSYPKGTGCCNDAEVFDKAGISVLSVEATNWNLGKKDGYQQRVKNASFPNGNSWHDVRLDNQQHIDKALPGRIERRSRDVVRIMLPLVKELAKAEKTS >CP028151.1|AWP51455.1|2942597_2943506_-|sulfate-adenylyltransferase-subunit-2 MDQKRLTHLRQLEAESIHIIREVAAEFANPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYAFRDRTANAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWRNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMVDDDRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESHAQTLPEIIEEMLVSTTSERQGRMIDRDQAGSMELKKRQGYF >CP028151.1|AWP51454.1|2941148_2942588_-|sulfate-adenylyltransferase MNTILAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTLQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCDLAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYCEETFARIREDYLTFAEQLPGDLDIRFVPLSALEGDNVAAQSANMRWYSGPTLLEVLETVDIQRAVDRQPMRFPVQYVNRPNLDFRGYAGTLASGSVKVGERIKVLPSGVESSVARIVTFDGDKEEACAGEAITLVLNDDIDISRGDLLLAANETLAPARHAAIDVVWMAEQPLAPGQSYDVKLAGKKTRARIEAIRYQIDINNLTQRDVESLPLNGIGLVEMTFDEPLALDIYQQNPVTGGLIFIDRLSNVTVGAGMVRELDERGATPPVEYSAFELELNALVRRHFPHWDARDLLGDKHGAA >CP028151.1|AWP51453.1|2940556_2941162_-|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVAAREQLHGHCGVVLWFTGLSGSGKSTVAGALEEALHQRGVSTYLLDGDNVRHGLCRDLGFSDADRQENIRRVGEVASLMADAGLIVLTAFISPHRAERQLVKERVGHDRFIEIYVNTPLAICEQRDPKGLYKKARAGELRNFTGIDAIYEAPDSPQVHLNGEQLVTNLVSQLLDLLRRRDIIRS >CP028151.1|AWP51452.1|2940182_2940539_-|DUF3561-domain-containing-protein MPGMVKVTGFNMRNSHNITFTRSDAFMVDDDATSAFPGAVVGFVSWLLALGIPFLLYGPNTLFFFLYTWPFFLALMPVSVIIGIALHLLVKGKILFSIMFTLLAVGALFGALFIWLLG >CP028151.1|AWP51451.1|2939680_2939992_-|cell-division-protein-FtsB MGKLTLLLLALLVWLQYSLWFGKNGIHDYSRVNDDVVAQQATNAKLKARNDQLFAEIDDLNGGQEAIEERARNELSMTKPGETFYRLVPDASKRAATAGQTHR >CP028151.1|AWP51450.1|2938951_2939662_-|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MAATLLDVCAVVPAAGFGRRMQTECPKQYLSIGNKTILEHSVHALLAHPRVTRVVIAISPGDHRFAQLPLANHPQITVVDGGNERADSVLAGLQAVAKAQWVLVHDAARPCLHQDDLARLLTISENSRVGGILASPVRDTMKRGEPGKNAIAHTVERADLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPALVEGRADNIKVTRPEDLALAEFYLTRTIHQEKA >CP028151.1|AWP51449.1|2938472_2938952_-|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRISYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLMKAAK >CP028151.1|AWP51448.1|2937426_2938476_-|tRNA-pseudouridine(13)-synthase-TruD MTEFDNLTWLHGKPQGSGLLKANPEDFVVVEDLGFTPDGEGEHILLRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDFSAFQLEGCKVLEYARHKRKLRLGALKGNAFTLVLREISDRRDVETRLQAIRDGGVPNYFGAQRFGIGGSNLQGALRWAQSNAPVRDRNKRSFWLSAARSALFNQIVHQRLKKPDFNQVVDGDALQLAGRGSWFVATSEELPELQRRVDEKELMITASLPGSGEWGTQRAALAFEQDAIAQETVLQSLLLREKVEASRRAMLLYPQQLSWNWWDDVTVELRFWLPAGSFATSVVRELINTMGDYAHIAE >CP028151.1|AWP51447.1|2936684_2937446_-|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFDNGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLNGYQHYDTAAAVTCALLRGLSREPLRTGRILNVNVPDLPLAQVKGIRVTRCGSRHPADKVIPQEDPRGNTLYWIGPPGDKYDAGPDTDFAAVDEGYVSVTPLHVDLTAHSAHDVVSDWLDSVGVGTQW >CP028151.1|AWP51457.1|2945491_2945785_-|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAVWLLEVRAGVYVGDTSKRIREMIWQQITQLGGVGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVENQ >CP028151.1|AWP51458.1|2945784_2946705_-|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLNPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLASTVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALDDDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRATYALLAKQYGVKWHGRNYDPKDWEKGDVVNRCISAATSCLYGISEAAILAAGYAPAIGFIHSGKPLSFVYDIADIIKFESVVPKAFEIAARHPAEPDKEVRLACRDIFRSSKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPESLGDSGHRGHG >CP028151.1|AWP51459.1|2946701_2947352_-|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTSELSPAQLLHLVERGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQEQPAASTIFDVQTRPFAPMLSAGQTLRFNLRANPTICKNGKRHDLLMEAKRQRKTQGDSQDIWSYQQQAALEWLARQGEQNGFTLREASVDAYRQQQIRREKSRQMIQFSSVDYTGVLVINEPALFLQRLAQGYGKSRAFGCGMMMIKPGDDA >CP028151.1|AWP51460.1|2947333_2948080_-|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLVFQLHGPMASWGVDAPGEVRHSHELPSRSALLGLLAAALGIRRDEEERLNTFNRHYQFLLCASGNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRDYYTDAWWMIAVSATPDAPYTLAQLQAALQHPVFPLYLGRKSHPLALPLAPQLLEGNAADVLREAYRWYQDQFNALKLTLPGLQNECWWEGEHDGLTANKILRRRDMPLSRQQWLFGERSVNQGPWLRKEDACISQE >CP028151.1|AWP51461.1|2948090_2949149_-|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNGNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA >CP028151.1|AWP51462.1|2949162_2949717_-|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHDELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDTDNEQD >CP028151.1|AWP51463.1|2949713_2951270_-|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPIWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIRGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFTNAKDARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRCYFDDHVFTNPYESSDLERIMKARKKYFTSSAEKQSAKAAKAKKQEAAE >CP028151.1|AWP51464.1|2951281_2953945_-|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETDGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERCVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >CP028151.1|AWP53110.1|2954066_2954273_+|hypothetical-protein MFFRNIYPTNNYYKFFRLFLLEEALKEASSNAFCINQNLKKTTICGESSTFCLNYRHHKGLPTINVLY >CP028151.1|AWP53109.1|2954388_2955342_+|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDSLTEDEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028151_2 | 2961547-2962063 | TypeI-E |
I-E
Consensus repeat of CP028151_2
|
8 spacers
spacers of CP028151_2
>2.1|2961576|32|CP028151|CRISPRCasFinder,CRT GGCTACACGCAAAAATTCCAGTCGTTGGCGCA >2.2|2961637|32|CP028151|CRISPRCasFinder,CRT CCGATTAAGATCCGCAGTCTGCATCAGTAACT >2.3|2961698|32|CP028151|CRISPRCasFinder,CRT CGATTCTACGGCAACAGGCCAGGCTGCGACCG >2.4|2961759|32|CP028151|CRISPRCasFinder,CRT ATGACATGACCGTGACTAGCGAGAATGTCCCA >2.5|2961820|32|CP028151|CRISPRCasFinder,CRT TCATGCGCTATAAAAATCAGACTGTCACATGC >2.6|2961881|32|CP028151|CRISPRCasFinder,CRT TGATTATTGACGACAACAGCACAGACCGGCAG >2.7|2961942|32|CP028151|CRISPRCasFinder,CRT AATAATCGGCAATTTGTCCTGGACAGGCACGG >2.8|2962003|32|CP028151|CRISPRCasFinder,CRT GAATCTGGAGGCCAACAGCGCGGCGAAATCCT >2.9|2961637|34|CP028151|PILER-CR CCGATTAAGATCCGCAGTCTGCATCAGTAACTCG >2.10|2961698|34|CP028151|PILER-CR CGATTCTACGGCAACAGGCCAGGCTGCGACCGCG >2.11|2961759|34|CP028151|PILER-CR ATGACATGACCGTGACTAGCGAGAATGTCCCACG >2.12|2961820|34|CP028151|PILER-CR TCATGCGCTATAAAAATCAGACTGTCACATGCCG >2.13|2961881|34|CP028151|PILER-CR TGATTATTGACGACAACAGCACAGACCGGCAGCA >2.14|2961942|34|CP028151|PILER-CR AATAATCGGCAATTTGTCCTGGACAGGCACGGCA >2.15|2962003|34|CP028151|PILER-CR GAATCTGGAGGCCAACAGCGCGGCGAAATCCTCA |
cas3 |
CRISPR arrays and Neighbor proteins around CP028151_2
The CRISPR arrays of CP028151_2 >merge|CP028151|2|2961547-2962063|CRISPRCasFinder,CRT,PILER-CR ACGGCTATCCTTGTTGGCGCGGGGAACACGGCTACACGCAAAAATTCCAGTCGTTGGCGCACGGTTTATCCCCGCTGGCGCGGGGAACACCCGATTAAGATCCGCAGTCTGCATCAGTAACTCGGTTTATCCCCGCTGGCGAGGGGAACACCGATTCTACGGCAACAGGCCAGGCTGCGACCGCGGTTTATCCCCGCTGGCGCGGGGAACACATGACATGACCGTGACTAGCGAGAATGTCCCACGGTTTATCCCCGCTGGCGCGGGGAACACTCATGCGCTATAAAAATCAGACTGTCACATGCCGGTTTATCCCCGCTGGCGCGGGGAACACTGATTATTGACGACAACAGCACAGACCGGCAGCAGTTTATCCCCGCTGGCGCGGGGAACACAATAATCGGCAATTTGTCCTGGACAGGCACGGCAGTTTATCCCCGCTGGCGCGGGGAACACGAATCTGGAGGCCAACAGCGCGGCGAAATCCTCAGTTTATCCCCGCTGGCGCGGGGAACAC >CP028151|2|2|2961547-2962063|CRISPRCasFinder ACGGCTATCCTTGTTGGCGCGGGGAACAC GGCTACACGCAAAAATTCCAGTCGTTGGCGCA CGGTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACT CGGTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCG CGGTTTATCCCCGCTGGCGCGGGGAACAC ATGACATGACCGTGACTAGCGAGAATGTCCCA CGGTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGC CGGTTTATCCCCGCTGGCGCGGGGAACAC TGATTATTGACGACAACAGCACAGACCGGCAG CAGTTTATCCCCGCTGGCGCGGGGAACAC AATAATCGGCAATTTGTCCTGGACAGGCACGG CAGTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCT CAGTTTATCCCCGCTGGCGCGGGGAACAC >CP028151|2|2|2961547-2962063|CRT ACGGCTATCCTTGTTGGCGCGGGGAACAC GGCTACACGCAAAAATTCCAGTCGTTGGCGCA CGGTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACT CGGTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCG CGGTTTATCCCCGCTGGCGCGGGGAACAC ATGACATGACCGTGACTAGCGAGAATGTCCCA CGGTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGC CGGTTTATCCCCGCTGGCGCGGGGAACAC TGATTATTGACGACAACAGCACAGACCGGCAG CAGTTTATCCCCGCTGGCGCGGGGAACAC AATAATCGGCAATTTGTCCTGGACAGGCACGG CAGTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCT CAGTTTATCCCCGCTGGCGCGGGGAACAC >CP028151|2|2|2961610-2962063|PILER-CR GTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACTCG GTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCGCG GTTTATCCCCGCTGGCGCGGGGAACAC ATGACATGACCGTGACTAGCGAGAATGTCCCACG GTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGCCG GTTTATCCCCGCTGGCGCGGGGAACAC TGATTATTGACGACAACAGCACAGACCGGCAGCA GTTTATCCCCGCTGGCGCGGGGAACAC AATAATCGGCAATTTGTCCTGGACAGGCACGGCA GTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCTCA GTTTATCCCCGCTGGCGCGGGGAACAC
>CP028151.1|AWP51469.1|2960650_2961448_-|MBL-fold-metallo-hydrolase MALRIRVLLENHKGAGADKSLKARPGLSLLVEDESTSILFDTGPDGSFMQNALAMGIDLSDVSAVVLSHGHYDHCGGVPWLPDNSRIICHPDIARERYAAMTFLGITRKIKKLSCEVDYSRYRMMYTRDPLPIGKNFIWSGEIPVVAPEAYGIFGGHDAEPDSILDEGVLIYQSTKGLVIITGCGHRGIANIVRHCQNITGIKRIYALVGGFHLRCASPFTLWRVRRFLQEQKPEKLCGCHCTGAWGRLWLPEITAPATGDVLRF >CP028151.1|AWP51468.1|2960200_2960563_+|6-carboxytetrahydropterin-synthase-QueD MSTTLYKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIMDFADLKAAFKPTYDRLDHYYLNDIPGLSNPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCVYRGE >CP028151.1|AWP53112.1|2960101_2960272_-|6-pyruvoyl-tetrahydrobiopterin-synthase MPFRHVRQAVSGFKGEIFIQRGGHDELSGNKKTAVGYRKAHFLSLTKTILRLVTGN >CP028151.1|AWP51467.1|2957977_2959777_-|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTPAPLTGLLPLNPEQLARLQAATTDLTPEQLAWVSGYFWGVLNPRSGVVAVTPVPERKMPGVTLISASQTGNARRVAEALRDDLLAANLNVTLVNAGDYKFKQIASEKLLVIVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDTSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDVLKSRAPVAAPSQSVATGAVNDIHTSPYTKDAPLIATLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVDGKTLPLAEALEWHFELTVNTANIVENYATLTRSESLLPLVGDKAQLQHYAATTPIVDMVRFSPAQLDAEALIGLLRPLTPRLYSIASAQAEVESEVHVTVGVVRYDIEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPQTPVIMIGPGTGIAPFRAFMQQRAADGAEGKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLSRIDLAWSRDQKEKIYVQDKLREQGAELWCWINDGAHIYVCGDARRMAADVEKALLEVIAEFGGMDLESADEYLSELRVERRYQRDVY >CP028151.1|AWP51466.1|2956265_2957978_-|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLSDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTTQWQAIDKFAADNTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGLETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDNNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHQGEFRITANQNLIIASVPESQKAKIETLARDHGLMNAVSAQRENSMACVSFPTCPLAMAEAERFLPSFTDKVEAILEKHGIPDEHIVMRVTGCPNGCGRAMLAEIGLVGKAPGRYNLHLGGNRIGTRIPRMYQENITEPDILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDFWE >CP028151.1|AWP53111.1|2956090_2956309_+|hypothetical-protein MVSANASTRSTFGSSFRAFRSSLDISSSRYTLPDGASLIRPTRAVGRIRHLCRHPAIQVIPRNHARDRAPGE >CP028151.1|AWP51465.1|2955429_2956164_-|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >CP028151.1|AWP53109.1|2954388_2955342_+|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDSLTEDEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT >CP028151.1|AWP53110.1|2954066_2954273_+|hypothetical-protein MFFRNIYPTNNYYKFFRLFLLEEALKEASSNAFCINQNLKKTTICGESSTFCLNYRHHKGLPTINVLY >CP028151.1|AWP51464.1|2951281_2953945_-|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETDGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERCVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >CP028151.1|AWP51470.1|2962359_2963031_-|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWDKLSDREVSLFSILAKTKESDKWGAASSEDLLAVINRQGYTARHVVITGGEPCIHDLMPLTDLLEKSGFSCQIETSGTHEVRCTPNTWVTVSPKVNMRGGYDVLSQALERANEIKHPVGRVRDIEALDELLATLSDDKPRVIALQPISQKEDATRLCIETCIARNWRLSMQTHKYLNIA >CP028151.1|AWP51471.1|2963166_2964465_-|enolase MSKIVKVIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVGAVNGPIAQAILGKDAKDQAGIDKIMIDLDGTENKSNFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKGKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >CP028151.1|AWP51472.1|2964547_2966185_-|CTP-synthetase MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQLAVDIGREHALFMHLTLVPYLAAAGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISMKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIYEEANPAGEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVTVNIKLIDSQDVETRGVEILKDLDAILIPGGFGYRGVEGKIATARYARENNIPYLGICLGMQVALIEFARNVAGMDNANSTEFVPDCKYPVVALITEWRDEDGNVEVRSEKSDLGGTMRLGAQQCQLSDDSLVRQLYGASTIVERHRHRYEVNNMLLKQIEAAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAANEHQKRQAK >CP028151.1|AWP51473.1|2966412_2967213_-|nucleoside-triphosphate-pyrophosphohydrolase MTTNHQIDRLLTLMQRLRDPENGCPWDKEQTFASIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFGELSADNSEEALVRWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCSNVGFDWTTLGPVVDKVYEEIDEVMFEARQAVVDQAKLEEEMGDLLFATVNMARHLGTKAELALQKANDKFERRFREVERIVAARGLEMTGVDLETMEEVWQEVKRQEIDL >CP028151.1|AWP51474.1|2967820_2968408_+|fimbrial-protein-SteA MKSSHFCKLAVTASLVMGIVSGAQAAGSNTAKVTFLGNIVDSPCSVTLDTEDQTVNMGSSIGNGTLSNGKTTINNARTFHIDLEGCTWATEKNMNVVFTTGSGTTAATGATDNLALMKTDGTGAISNVSLAIGDAGKNNIKLGDTYTQAIADLDGDTILDEKQSLNFTAWLVGAATGTVGTGEFSSAANVTISYL >CP028151.1|AWP51475.1|2968487_2971187_+|PapC/FimD-family-outer-membrane-usher-protein MMNNTWKSVLCPIACGVGMLLSLSPYSASGKDIEFNTDFLDVKNRDNVNIAQFSRKGFILPGVYLLQIKINGQTLPQEFPVNWVIPEHDPQGSEVCAEPELVTQLGIKPELAEKLVWITHGERQCLAPDSLKGMDFQADLGHSTLLVNLPQAYMEYSDVDWDPPARWDNGIPGIILDYNINNQLRHDQESGSEEQSISGNGTLGANLGAWRLRADWQASYDHRDDDENTSTLHDQSWSRYYAYRALPTLGAKLTLGESYLQSDVFDSFNYIGASVVSDDQMLPPKLRGYAPEIVGIARSNAKVKVSWQGRVLYETQVPAGPFRIQDLNQSVSGTLHVTVEEQNGQTQEFDVNTASVPFLTRPGMVRYKMALGRPQDWDHHPITGTFASAEASWGVTNGWSLYGGAIGESNYQAVALGSGKDLGVVGAVAVDITHSIAHMPQDDGFDGETLQGNSYRISYSRDFDEIDSRLTFAGYRFSEKNFMSMSDYLDAKTYHHLNAGHEKERYTVTYNQNFREQGMSAYFSYSRSTFWDSPDQSNYNLSLSWYFDLGSIKNLSASLNGYRSEYNGDKDDGVYISLSVPWGNDSISYNGTFNGSQHRNQLGYSGHSQNGDNWQLHVGQDEQGAQADGYYSHQGALTDIDLSADYEEGSYRSLGMSLRGGMTLTTQGGALHRGSLAGSTRLLVDTDGIADVPVSGNGSPTSTNIFGKAVIADVGSYSRSLARIDLNKLPEKAEATKSVVQITLTEGAIGYRHFDVVSGEKMMAVFRLADGDFPPFGAEVKNERQQQLGLVADDGNAWLAGVKAGETLKVFWDGAAQCEASLPSTFTPELLANALLLPCKMLEGQPPTAPQKSSPLPAQPLIQEHTQTDGQPAAPVATTTQTPPIPLADNHAVNRKDME >CP028151.1|AWP51476.1|2971199_2971973_+|fimbrial-protein MNKTNHFKRQALIASVLLAAPLVSHSAIVPDRTRVIFNGNENSITVTLKNGNATLPYLAQAWLEDDKFAKDTRYFTALPPLQRIEPKSDGQVKVQPLPAAASLPQDRESLFYFNVREIPPKSDKPNTLQLALQTRIKFFYRPVAVARQVDKTHPWQTKLTLTYQGDGVIFDNPTPFYLVISNAGSKENETASGFKNLLIAPREKVTSPIKGASLGSSPVVGYVDDYGGHRLLVFTCSGNTCKVNEEKTRDAEKKANK >CP028151.1|AWP51477.1|2971992_2972499_+|fimbrial-protein MTMLTRWKMLVLLCGGFVTGTEAAGTKTVQLELHLVVTQPPPCTVGGASVEFGDVLTTKVGDASQTKPVGYSLNCDGRASDYLKLQIQGTTTTISGEQVLQTSVQGLGIRIQQAGNKQLVPVGITDWLNFTLSGSNGPELEAVPVKEPTTQLAGGDFNASATLVVDYQ >CP028151.1|AWP51478.1|2972513_2972984_+|fimbrial-protein MKRVLILTLLITQFACADNLTFHGKLINPPACTINNGEMLEVSFGSVIIDNIDGVNYLTEIPWTLTCDSSFRDDALTFTLSYLGTATPYSAKALTTSVPELGIELQQNGTVFPPGTSLTINESSLPTLKAVPVKQPGKEPAEGDFEAFATLQVDYQ >CP028151.1|AWP51479.1|2972980_2973517_+|fimbrial-protein-SteF MNRIFQTAGHLIGGVMLWAVCNTLPAATPNVHYSGKLVAGACNLVVDNDTMATVDFHTIGSDNFDASGQTTPVPFTLSLQDCKTALANGVLVTFQGVEDSTLPGLLALEPSSEASGFAIGVETAAQQPVSINATVGTAFVLKEGITTINLQARLQKYAGEEVMPGEFSGSATVSFEYQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP028151_2 | 2.3|2961698|32|CP028151|CRISPRCasFinder,CRT | 2961698-2961729 | 32 | MN694003 | Marine virus AFVG_250M677, complete genome | 17629-17660 | 8 | 0.75 |
CP028151_2 | 2.6|2961881|32|CP028151|CRISPRCasFinder,CRT | 2961881-2961912 | 32 | MG592432 | Vibrio phage 1.050.O._10N.286.48.A6, partial genome | 21687-21718 | 8 | 0.75 |
CP028151_2 | 2.6|2961881|32|CP028151|CRISPRCasFinder,CRT | 2961881-2961912 | 32 | MG592431 | Vibrio phage 1.049.O._10N.286.54.B5, partial genome | 21426-21457 | 8 | 0.75 |
CP028151_2 | 2.10|2961698|34|CP028151|PILER-CR | 2961698-2961731 | 34 | MN694003 | Marine virus AFVG_250M677, complete genome | 17627-17660 | 8 | 0.765 |
CP028151_1 | 1.1|2944847|32|CP028151|CRISPRCasFinder,CRT | 2944847-2944878 | 32 | NZ_MG266000 | Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence | 5501-5532 | 9 | 0.719 |
CP028151_1 | 1.3|2944969|32|CP028151|CRISPRCasFinder,CRT | 2944969-2945000 | 32 | MK449011 | Streptococcus phage Javan92, complete genome | 36157-36188 | 9 | 0.719 |
CP028151_1 | 1.3|2944969|32|CP028151|CRISPRCasFinder,CRT | 2944969-2945000 | 32 | MK448835 | Streptococcus phage Javan93, complete genome | 36157-36188 | 9 | 0.719 |
CP028151_1 | 1.3|2944969|32|CP028151|CRISPRCasFinder,CRT | 2944969-2945000 | 32 | MK448836 | Streptococcus phage Javan95, complete genome | 37400-37431 | 9 | 0.719 |
CP028151_1 | 1.3|2944969|32|CP028151|CRISPRCasFinder,CRT | 2944969-2945000 | 32 | MK448825 | Streptococcus phage Javan639, complete genome | 37400-37431 | 9 | 0.719 |
CP028151_1 | 1.7|2945213|32|CP028151|CRISPRCasFinder,CRT | 2945213-2945244 | 32 | KY006853 | Erythrobacter phage vB_EliS_R6L, complete genome | 41418-41449 | 9 | 0.719 |
CP028151_2 | 2.6|2961881|32|CP028151|CRISPRCasFinder,CRT | 2961881-2961912 | 32 | NC_047790 | Pseudoalteromonas phage C5a, complete genome | 34441-34472 | 9 | 0.719 |
CP028151_2 | 2.8|2962003|32|CP028151|CRISPRCasFinder,CRT | 2962003-2962034 | 32 | CP006879 | Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence | 405613-405644 | 9 | 0.719 |
CP028151_1 | 1.16|2945214|33|CP028151|PILER-CR | 2945214-2945246 | 33 | KY006853 | Erythrobacter phage vB_EliS_R6L, complete genome | 41417-41449 | 10 | 0.697 |
CP028151_2 | 2.8|2962003|32|CP028151|CRISPRCasFinder,CRT | 2962003-2962034 | 32 | NZ_CP049244 | Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence | 699963-699994 | 10 | 0.688 |
1. spacer 2.3|2961698|32|CP028151|CRISPRCasFinder,CRT matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.75
cgattctacggcaacaggccaggctgcgaccg CRISPR spacer ggcgagcacggcaacagcccaggctgcgatcg Protospacer * .********** ***********.**
2. spacer 2.6|2961881|32|CP028151|CRISPRCasFinder,CRT matches to MG592432 (Vibrio phage 1.050.O._10N.286.48.A6, partial genome) position: , mismatch: 8, identity: 0.75
tgattattgacgacaacagcacagaccggcag CRISPR spacer ttataattgactacaacagcacagagcagatt Protospacer * ** ****** ************* *.*
3. spacer 2.6|2961881|32|CP028151|CRISPRCasFinder,CRT matches to MG592431 (Vibrio phage 1.049.O._10N.286.54.B5, partial genome) position: , mismatch: 8, identity: 0.75
tgattattgacgacaacagcacagaccggcag CRISPR spacer ttataattgactacaacagcacagagcagatt Protospacer * ** ****** ************* *.*
4. spacer 2.10|2961698|34|CP028151|PILER-CR matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.765
cgattctacggcaacaggccaggctgcgaccgcg CRISPR spacer ggcgagcacggcaacagcccaggctgcgatcgcg Protospacer * .********** ***********.****
5. spacer 1.1|2944847|32|CP028151|CRISPRCasFinder,CRT matches to NZ_MG266000 (Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence) position: , mismatch: 9, identity: 0.719
tatttataagcgtgtcatctatgcaacccaac CRISPR spacer aatttataatcatgtcatctatgccataattc Protospacer ******** *.************ *. *
6. spacer 1.3|2944969|32|CP028151|CRISPRCasFinder,CRT matches to MK449011 (Streptococcus phage Javan92, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
7. spacer 1.3|2944969|32|CP028151|CRISPRCasFinder,CRT matches to MK448835 (Streptococcus phage Javan93, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
8. spacer 1.3|2944969|32|CP028151|CRISPRCasFinder,CRT matches to MK448836 (Streptococcus phage Javan95, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
9. spacer 1.3|2944969|32|CP028151|CRISPRCasFinder,CRT matches to MK448825 (Streptococcus phage Javan639, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
10. spacer 1.7|2945213|32|CP028151|CRISPRCasFinder,CRT matches to KY006853 (Erythrobacter phage vB_EliS_R6L, complete genome) position: , mismatch: 9, identity: 0.719
gagaatgctcatgcgcgtgagcgccatatatt CRISPR spacer cgaaatgatcatgcgcgtcagcgccattgcgt Protospacer ..**** ********** ******** *
11. spacer 2.6|2961881|32|CP028151|CRISPRCasFinder,CRT matches to NC_047790 (Pseudoalteromonas phage C5a, complete genome) position: , mismatch: 9, identity: 0.719
tgattattgacgacaacagcacagaccggcag CRISPR spacer agcttattgacgaaaacggcacagacaccaaa Protospacer * ********** ***.******** *.
12. spacer 2.8|2962003|32|CP028151|CRISPRCasFinder,CRT matches to CP006879 (Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence) position: , mismatch: 9, identity: 0.719
gaatctggaggccaacagcgcggcgaaatcct CRISPR spacer gaatctggagggcgacagcgcggtcgaccctg Protospacer *********** *.*********. .* .*.
13. spacer 1.16|2945214|33|CP028151|PILER-CR matches to KY006853 (Erythrobacter phage vB_EliS_R6L, complete genome) position: , mismatch: 10, identity: 0.697
gagaatgctcatgcgcgtgagcgccatatattc CRISPR spacer cgaaatgatcatgcgcgtcagcgccattgcgtt Protospacer ..**** ********** ******** *.
14. spacer 2.8|2962003|32|CP028151|CRISPRCasFinder,CRT matches to NZ_CP049244 (Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence) position: , mismatch: 10, identity: 0.688
gaatctggaggccaacagcgcggcgaaatcct CRISPR spacer gtggtcataggccatcagcgcggcgatatccc Protospacer * . ... ****** *********** ****.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
931792 : 939105
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP028151|931792:939105|DBSCAN-SWA TATGCGTGCTAAGGGAAAGAAATTTAAAAAGCGTTATCTGGTCATTATTTTAATTCTTTTAGTGGGGGGGATGGCTGGCTGGCGAATGATAAATGCCCCGCTGCCAACTTATCAGACATTAATCGTGCGGCCAGGCGATCTTGAACAGAGTGTACTGGCGACTGGAAAACTGGACGCGTTGCGTAAAGTGGATGTCGGCGCGCAGGTGAGCGGACAGTTGAAAACGCTGCTGGTCTCCATTGGCGATAACGTTAAAAAAGATCAGCTACTCGGCGTGATTGACCCAGATCAGGCGGAGAACCAGATAAAAGAGGTCGAGGCCACCTTGATGGAGCTGAACGCGGAGCGTCAGCAGGCAGCCGCTGAGTTAAAGCTGGCGCGGGTTACGCTGGCGCGCCAGCAGCAGTTAGCTAAGACTCAGGCGGTATCGCAACAGGATCTGGATACCGCGGCGACGGAGATGGCGGTTAAACAGGCGCGTATTGGCACCATAGATGCCCAGATCAAACGTAATCGGGCCTCGTTGGACACCGCGAAAACCAACCTGGAATATACCCGTATTGTCGCCCCCATGGCGGGGGAAGTGACGCAAATCACTACCCTGCAAGGACAAACGGTGATTGCAGCTCAGCAGGCGCCCAATATTCTGACGCTGGCGGATATGAGCACCATGCTGGTAAAAGCGCAGGTCTCGGAAGCGGACGTGATCCATCTTCGGGCGGGGCAGAAAGCATGGTTCACCATTGCAGGCGATCCGCAAACGCGCTATGAAGGCGTTTTAAAAGATATTCTGCCGACGCCGGAAAAGATCAACGACGCTATTTTTTATTACGCCCGGTTTGAAGTGCCGAATCCCAAAAGAATCTTGCGTCTTGATATGACCGCACAGGTTTATATTCAACTCATGGATGTCAAAAATGTGCTGATTATTCCTCTCGCCGCGCTTGGCGAACCGGTGGGCGGCAATCGTTATAAAGTGGCGCTGTTGCGTAACGGCGAAAAACGTGAGCGCGAAGTGGTCATTGGCGAGCGTAACGATACAGACGTGGAAGTGGTTAAAGGTCTGGAAGCGGGCGATGAGGTGATCATCGGCGAAAGCAGGCCAGGAGCGACGCCATGACGGCATTGCTTGAACTGTGCAATGTGAGTCGTAGCTACCCCTCCGGAGAAGAGCAGGTGGCGGTGTTGAAAGATATCTCCCTGCAAATCCACGCCGGGGAGATGGTGGCGATCGTCGGCGTTTCCGGTTCTGGAAAATCAACGCTGATGAATATCCTCGGGTGCCTGGATAAACCGACCAGCGGCACTTATCGGGTGGCGGGGCGGGACGTCTCGACGCTGGACCCGGACGCGCTGGCGCAGCTGCGGCGTGAGCATTTTGGCTTTATCTTTCAGCGCTACCATCTGTTGTCGCATTTAACGGCAGCGCAAAATGTTGAAATCCCCGCCGTCTACGCCGGCATTGAACGCAAAAAACGCCAGGCGCGCGCCAGAGAGTTGCTTTTGCGGCTGGGATTAAGCGATCGCGTCGATTACCCACCTTCACAGCTTTCTGGCGGACAGCAGCAGCGTGTCAGTATTGCCCGCGCTCTGATGAACGGTGGACAGGTGATTCTGGCAGATGAGCCGACCGGCGCGCTGGATAGCCATTCCGGCGAAGAGGTGATGGCGATTTTGCGCCAACTGCGCGATCGCGGACATACGGTGATCATTGTGACGCACGATCCGCTGATTGCCGCCCAGGCGGAGCGGATTATTGAAATTCACGATGGCAAGATTGTCCATAATCCGCCCGCGGAGGAAAAGAAACGCGAACAGGGCGTTGACGCTGCCGTAGTTAATACGGCTCCCGGCTGGCGGCAATTTGCCAGCAGCTTTCGCGAAGCGCTGTCAATGGCGTGGTTAGCGATGGCCGCTAACAAAATGCGTACTTTACTGACCATGCTGGGAATTATTATCGGTATTGCGTCGGTGGTGTCGATTGTGGTGGTCGGCGACGCCGCAAAACAGATGGTACTGGCGGATATCCGCGCTATGGGCACTAACACGATTGATATTCATCCAGGCAAAGATTTTGGCGACGACAATCCGCAGTATCGACAGGCGCTGAAATATGACGATCTGGTCGCTATTCAGAAACAGCCGTGGGTTAACTCTGCGACGCCCAGCGTTTCAAAGAGCTTACGTCTTCGCTATGGCAATATTGATATTGCCGTAAATGCTAATGGCGTCAGTGGCGATTATTTTAACGTTTACGGCATGTCCTTTAGGGAGGGGAACACCTTCAATTCTGTACAGCAACAGGATCGCGCGCAGGTGGTGGTGCTGGATGCCAACACGCGACGCCAGCTATTTCCAAATAAAGCGAATGTCGTAGGGGAAGTGGTGCTGGTGGGTAATATGCCGGTTATTGTTATTGGCGTGGCGGAAGAGAAACCGTCCATGTACGGCAATAGCAATCTGTTGCAAGTTTGGTTGCCCTATAGCACGATGTCAGATCGCATAATGGGTCAGTCATGGCTTAACTCGATCACCGTTCGTGTGAAAGATGGCGTTGATAGCGATCAGGCTGAACAGCAGCTTACCCGCCTGCTCACCTTACGCCACGGTAAAAAAGACTTCTTCACCTGGAATATGGACAGCGTTCTGAAAACGGCTGAAAAAACCACCTATACTCTTCAGTTATTTCTGACGCTGGTGGCCGTCATTTCGCTGGTTGTCGGCGGCATTGGCGTTATGAATATTATGCTGGTTTCCGTCACCGAGCGAACGCGTGAAATCGGCATCCGTATGGCGGTAGGCGCGCGCGCCAGCGATGTGCTACAGCAGTTTCTTATTGAAGCGGTGCTGGTTTGCCTGGTTGGGGGAGCGCTGGGGATTAGCTTGTCGATGTTCATCGCATTTATGCTACAGCTTTTCCTGCCCGGCTGGGAGATCGGTTTTTCACTGACTGCGCTGGCGAGCGCGTTTTTATGTTCGACGTTTACCGGGATACTGTTTGGCTGGCTACCGGCGAGAAACGCGGCGCGACTGGACCCGGTGGATGCGCTGGCAAGGGAGTAATCTCGCTGACGTCATTTGTCGGCCTGATAAGATGCAACAGTGTCGCTATCAGGCACTGGCGTTAAATAAAAATGCCAGCCGATCGGGCTGGCATTTTGCCTCCTGGATGTACACAATGAGACAGAGGAGCTATGCAACGGCCTCTGCTTCGATGGGCACGATGACGCTGGCGTGATTGCCTTTTGGCCCCTGGTGGACATCAAACCGGACAGACTGTCCGGCTTTAAGCGTTCTGTAACCATCCATTTGAATGGTGGAATAATGGGCGAAAATATCCTCGCCGCCGCCTTCAGGGCAGATGAAACCAAACCCTTTGGCATTGTTGAACCACTTTACAGTACCCGTTTCCATGCTTCGACATCCTTCGTAAATCTTATATAAGTAAGATGGAATGAACCGGTGACGGAGTGGGGGCTGTTCAAAACCTCACCAACTCTCGACATTACAATTTAGAGAAATCAGGCGAGGCGTCAAGCATCAGGCAGGGGGGATCGGGTAAAAATGAATCAAAAATTTGAAGCAGTTAACGCTATTGCCGGGAATGTGACAGATGTCGCGGATGGTACTGATAGATGTTAGTTATCTATCAATTGAGGTAGATTGATTGTGTGCATAGACTCTGGTCAGCGGCAGATTTTCCTGCCGACAACCGTAACCGATAATGACGACTGACAATGGGTAAGACGAACGATTGGCTGGATTTTGACCAGTTGGTGGAAGATAGCGTGCGCGACGCGCTAAAACCGCCATCTATGTATAAAGTGATATTAGTCAATGATGATTACACTCCGATGGAGTTTGTTATTGACGTGTTACAAAAATTCTTTTCTTATGATGTAGAACGTGCAACGCAATTGATGCTTGCAGTTCACTATCAAGGCAAAGCCATCTGCGGCGTGTTCACCGCCGAGGTGGCGGAAACCAAAGTGGCGATGGTGAACAAGTATGCAAGGGAGAACGAGCATCCGTTGCTGTGTACGCTGGAAAAAGCCTGAATGCAGGTATAAAAATTGGGGGAGGTGCCTATGCTCAATCAAGAACTGGAACTCAGTTTAAACATGGCTTTCGCCAGAGCGCGCGAGCACCGTCATGAGTTTATGACCGTCGAGCATCTGTTGCTGGCGCTGCTCAGCAACCCATCGGCTCGCGAAGCGCTGGAAGCATGCTCCGTGGATCTGGTGGCGCTCCGTCAGGAACTCGAAGCCTTCATTGAACAAACCACACCCGTACTGCCTGCCAGTGAAGAAGAGCGTGATACGCAGCCGACGTTAAGTTTCCAGCGTGTCCTGCAGCGTGCCGTCTTCCATGTTCAGTCTTCCGGGCGTAGTGAAGTGACTGGCGCGAATGTGCTGGTGGCTATCTTTAGCGAACAGGAATCACAGGCGGCTTATCTGCTGCGCAAGCATGAAGTGAGCCGTCTGGATATCGTGAACTTTATTTCTCACGGGACGCGAAAAGACGAACCGAGCCAATCTTCCGATCTCGGCAATCAGCCAACTGGCGACGAACAAGCTGGCGGGGAGGAACGTATGGAAAACTTCACGACGAATCTTAACCAACTTGCTCGCGTGGGCGGCATCGATCCGCTGATTGGTCGTGAAAAAGAACTTGAACGCGCGATCCAGGTCTTGTGTCGTCGCCGTAAAAATAACCCGTTGCTGGTAGGGGAATCCGGCGTCGGCAAAACGGCGATTGCCGAAGGGCTGGCCTGGCGTATCGTGCAGGGCGATGTGCCGGAAGTGATGGCCGATTGCACCATTTACTCTCTGGATATCGGTTCGCTGCTGGCGGGCACCAAATACCGCGGCGATTTTGAAAAACGGTTTAAGGCGTTGCTGAAACAGCTTGAGCAGGATACCAACAGCATCCTGTTTATCGATGAAATCCATACCATTATCGGCGCTGGCGCGGCGTCGGGCGGACAGGTGGATGCGGCAAATCTGATTAAACCGCTGCTTTCCAGCGGCAAGATCCGGGTGATCGGCTCAACGACCTATCAGGAATTCAGCAATATTTTTGAGAAAGACCGTGCATTAGCGCGCCGTTTCCAGAAAATTGATATTACCGAGCCTTCGGTGGAAGAGACGGTGCAAATTATCAACGGCTTGAAACCTAAGTACGAAGCGCACCACGACGTGCGTTATACCGCGAAAGCGGTGCATGCGGCGGTCGAGTTGGCGGTAAAATATATCAATGACCGCCATCTGCCGGATAAAGCCATTGACGTGATTGACGAAGCGGGCGCTCGGGCGCGTCTGATGCCGGTGAGCAAACGTAAGAAAACGGTCAACGTGGCGGATATTGAGTCCGTAGTGGCGCGAATTGCGCGAATTCCTGAAAAGAGCGTCTCGCAGAGCGATCGCGATACGCTGAAGAACCTGGGCGATCGTCTGAAAATGCTGGTCTTCGGCCAGGATAACGCGATTGAGGCGCTGACCGAAGCTATTAAGATGAGTCGTGCCGGTCTGGGCCATGAGCATAAACCTGTCGGCTCATTCTTGTTCGCCGGGCCAACTGGCGTAGGGAAAACTGAAGTTACGGTACAGCTTTCAAAAGCGCTGGGTATTGAGCTGTTGCGCTTCGATATGTCCGAATATATGGAGCGTCATACGGTGAGCCGTTTGATCGGCGCGCCTCCGGGATACGTCGGTTTCGACCAGGGCGGGCTGCTGACGGATGCGGTGATTAAGCATCCTCATGCGGTGCTGTTGCTGGATGAGATCGAAAAAGCGCACCCGGATGTCTTTAACCTGCTGCTGCAGGTGATGGATAACGGTACGCTGACCGATAACAATGGCCGTAAGGCGGATTTCCGCAACGTGGTGCTGGTGATGACCACCAACGCCGGCGTGCGAGAAACCGAACGTAAATCTATTGGTCTTATTCATCAGGACAACAGTACCGATGCGATGGGCGAGATCAAGAAAGTGTTTACGCCGGAGTTCCGTAACCGTCTCGACAACATTATTTGGTTCGATCATCTGTCTGGCGAGGTGATTCATCAGGTTGTCGATAAGTTTATCGTCGAGTTGCAGGCTCAGTTGGATCAGAAAGGCGTCTCTCTGGAAGTCAGTCAGGAAGCGCGCGACTGGCTGGCGGAAAAGGGCTATGACCGGGCGATGGGCGCACGACCGATGGCGCGTGTGATTCAGGATAACCTGAAAAAACCGCTGGCCAATGAGTTGCTGTTTGGATCGCTGGTTGATGGCGGACAGGTCACCGTCGCGCTGGATAAAGAGAAAAATGCGTTGACGTATGGCTTCCAGAGTGCGCAAAAGCACAAGCCGGAAGCCGCGCATTAATCTTCGTTTCACTGCCGTACAAACCGGGCCTTAGCGCCCGGTTTTTTTACGCCGGCTAATAGTTAGCCTGATGGCGTTGTGCTCGCGGTAGGCGGACAAGGCGCTTGTAAATTGCGTCATCATCCGGTAATCGACGGGTCAAATGCGAAAAAAAAGCCCGACCCGCTGCGTGAAAAGGGATGCTGAAGCCGCTGTTACCAATGCCAAAAAAACTAATCCTCTTCTCTGAAAAGAGTAGTCTTACACTTAGCACAAAGAATTCTTCTGTGATTATCCCATACCTGTATCAAACCAACATCGCTGAATTTCTCGTGACCACAACTGTAGCAGGATATTTTGCTCATATTGTGTTTTTTAATATATTCATCTTTGGTGGGAAGTTGCTTCCAGTAGCCTATCAGTCCGGGCATAACCTCTCCTTGAGTGGGTTATGTAATTTGCAAATTATGATAATAATTATATAAGAAATAGTTCAGTATGAAATTATTACTTATCACATTTTCGAAAGCGAATGCTATTAGCATCACACTTTCTACGTTGTTATCAACGATAATGCTTTCGTACAGGTAGCTTAGCTGTTGGCAGTGCTATACCCCGTCCAATCATCCATCAACGCCACCCGTTTAGGCCATAACGTCCCACGCTGATACGCTGCTTCAGCCTTATCTGCCAACTGGTGCGCCAACGCATGTTCAATAACCTCACGTTGATAATCCGTTGCTTCACCAGCCCACTCACGGAAAGTAGAACGGAAGCCATGTTGCGTTAAGTCGATATATCCCATTCGTTTCAATACAGCCAATAACGACATATCAGAAAGTGTTTCAGCGCGAGGGGCAGGGAATACATGATTGTTATCTTTTAATCGTGGTAAATCTTTTAACAAATCAACAGCAGCATCAGACAGAGGAACTCGGTGCTTTTTTACTACCGAGAGATACTTATGCAA
Protein sequences of DBSCAN-SWA_1 >CP028151|931792:939105|935879_938156_+|AWP49579.1|protease|DBSCAN-SWA MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEAFIEQTTPVLPASEEERDTQPTLSFQRVLQRAVFHVQSSGRSEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDIVNFISHGTRKDEPSQSSDLGNQPTGDEQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERAIQVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASGGQVDAANLIKPLLSSGKIRVIGSTTYQEFSNIFEKDRALARRFQKIDITEPSVEETVQIINGLKPKYEAHHDVRYTAKAVHAAVELAVKYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSQSDRDTLKNLGDRLKMLVFGQDNAIEALTEAIKMSRAGLGHEHKPVGSFLFAGPTGVGKTEVTVQLSKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGLLTDAVIKHPHAVLLLDEIEKAHPDVFNLLLQVMDNGTLTDNNGRKADFRNVVLVMTTNAGVRETERKSIGLIHQDNSTDAMGEIKKVFTPEFRNRLDNIIWFDHLSGEVIHQVVDKFIVELQAQLDQKGVSLEVSQEARDWLAEKGYDRAMGARPMARVIQDNLKKPLANELLFGSLVDGGQVTVALDKEKNALTYGFQSAQKHKPEAAH >CP028151|931792:939105|938368_938566_-|AWP49580.1|DBSCAN-SWA MPGLIGYWKQLPTKDEYIKKHNMSKISCYSCGHEKFSDVGLIQVWDNHRRILCAKCKTTLFREED >CP028151|931792:939105|938727_939105_-|AWP49581.1|integrase|DBSCAN-SWA MHKYLSVVKKHRVPLSDAAVDLLKDLPRLKDNNHVFPAPRAETLSDMSLLAVLKRMGYIDLTQHGFRSTFREWAGEATDYQREVIEHALAHQLADKAEAAYQRGTLWPKRVALMDDWTGYSTANS >CP028151|931792:939105|934983_935205_-|AWP49577.1|DBSCAN-SWA METGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQSVRFDVHQGPKGNHASVIVPIEAEAVA >CP028151|931792:939105|932907_934854_+|AWP49575.1|DBSCAN-SWA MTALLELCNVSRSYPSGEEQVAVLKDISLQIHAGEMVAIVGVSGSGKSTLMNILGCLDKPTSGTYRVAGRDVSTLDPDALAQLRREHFGFIFQRYHLLSHLTAAQNVEIPAVYAGIERKKRQARARELLLRLGLSDRVDYPPSQLSGGQQQRVSIARALMNGGQVILADEPTGALDSHSGEEVMAILRQLRDRGHTVIIVTHDPLIAAQAERIIEIHDGKIVHNPPAEEKKREQGVDAAVVNTAPGWRQFASSFREALSMAWLAMAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRAMGTNTIDIHPGKDFGDDNPQYRQALKYDDLVAIQKQPWVNSATPSVSKSLRLRYGNIDIAVNANGVSGDYFNVYGMSFREGNTFNSVQQQDRAQVVVLDANTRRQLFPNKANVVGEVVLVGNMPVIVIGVAEEKPSMYGNSNLLQVWLPYSTMSDRIMGQSWLNSITVRVKDGVDSDQAEQQLTRLLTLRHGKKDFFTWNMDSVLKTAEKTTYTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGISLSMFIAFMLQLFLPGWEIGFSLTALASAFLCSTFTGILFGWLPARNAARLDPVDALARE >CP028151|931792:939105|935528_935849_+|AWP49578.1|protease|DBSCAN-SWA MGKTNDWLDFDQLVEDSVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLLCTLEKA >CP028151|931792:939105|934834_935029_+|AWP49576.1|DBSCAN-SWA MRWQGSNLADVICRPDKMQQCRYQALALNKNASRSGWHFASWMYTMRQRSYATASASMGTMTLA >CP028151|931792:939105|931792_932911_+|AWP49574.1|DBSCAN-SWA MRAKGKKFKKRYLVIILILLVGGMAGWRMINAPLPTYQTLIVRPGDLEQSVLATGKLDALRKVDVGAQVSGQLKTLLVSIGDNVKKDQLLGVIDPDQAENQIKEVEATLMELNAERQQAAAELKLARVTLARQQQLAKTQAVSQQDLDTAATEMAVKQARIGTIDAQIKRNRASLDTAKTNLEYTRIVAPMAGEVTQITTLQGQTVIAAQQAPNILTLADMSTMLVKAQVSEADVIHLRAGQKAWFTIAGDPQTRYEGVLKDILPTPEKINDAIFYYARFEVPNPKRILRLDMTAQVYIQLMDVKNVLIIPLAALGEPVGGNRYKVALLRNGEKREREVVIGERNDTDVEVVKGLEAGDEVIIGESRPGATP |
8 | Dickeya_phage(16.67%) | protease,integrase | attL 920530:920544|attR 939323:939337 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1010423 : 1021648
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP028151|1010423:1021648|DBSCAN-SWA TCTACTGCGCTGCTACCATGCTTTGGGGCAGTGATGGGGCATAGCGGGAAAGTGCCTGGTTGAGCAGAGAAACCTGTTCTGCGCTCTTCTCTGACATCCACTTTCCATACACCTTGTAAACCATCTGTGCATCGGTATGCCCCATCTGCGTTGCTATAAAGTTTGGGTTAGCACCAGCTGATAATGACCAGCATGCATAAGGGTTCATGCCCCATTGCATAGCCTGCATGACGATAGCCATACAGTCGGCTGGTTTCCCTGCAAGGTGTGCCGGTACCGTCACCTGTGAGTCTGCCATCAGGTTAGCGAAAGCTGTTAACTGACCCAGTGCCTGAACGTTAAAAATTGCGTTACTGGCAGAAATGGTGTTTGGTGCCTGCTGCTCAGTGGTAACAATATTTGTGTTTTCCATGATTTTCCCCTTATGCCTGTATGTACGGTGTACAGGGAACGCCTGACTGTTACCGGATTGAACTGAAAAATGTTTATGGTGTACAGGAGAATCTGATCTCATACCGACAGGCATCGCTGGGGGCATGGGTAGCGATTGCTGGTGGCGGCGATCCTTATGAAGTGGCTTACGCTATCTATAAAGCCGTGCCAGATATCTCCGTACTGACGAATGATGTAGTGAATCCATCAGGCGCTGCGGTGGATAAAAAAACGATACCGATCATTGTGTATCCGGATACGTATCACGTGCCGTTTGTAGTGCCATCATCACAAAACGTTACGCTTTTAATCACCTGGAATACAGCCTCAACCAGCTATATCGATCCAACCGGGATTGAAAAAGCAGTGCAGCAAAGCATTGCTGATTACATTAACGGAATTGCAACGGGTGAACCAATAAACATTTTCCTGATTCGGGATATTTTTCTTAATCAGGTTAAGGGGCTTGTATCTTCAAACCTTGTATCAATGATTGATATTCAGGTTGGAATAAACGGAAAAATTGTCCCACCTGCAACCGACTCCAGCCTGGTTTATGGTGATACTTACGCCTATTTTTCCACTTCATCTTCACAAATTCAGGTTAAGCAATATGGCAGCTCTTCTTGAAAGCATTATTCCGGCCTACCCCTATACGCAATATAATGACGATCCGGATATAGTTGCCTTTTTTGATGCTTATAACAAACTGGCACAGGGGTATCTTGATTACTTTAACAACCTGAATTTACCTTGCTGGACCTCCCCGGCGATTACCGGTGAGTTGCTGGACTGGATTGCGGCGGGTATTTATGGGGAATCACGCCCCTTGCTTCAAATCTCCGAGGATGCCATTGCTCGTGGGGCGTATAACACTATTGAGTACAATAATGTCGCGTATGCAAAACTGAGAAATTATGTTCCCGGCTCAGCGTCATATGTTCCGGACGACTATTTTAAACGGATACTGACATGGAATTTTTATAAAGGCGATGGTTCGCACTTCTGTATCAACTGGTTCAAACGACGGCTTGCACGCTTTATACATGGAGCTAACGGAATAGACCCACCTGTACAGTCCACTTTTGATATTAGTGTAATGCCCGATAAGGGCATTTTTTTTGTCTCCATTCCTGACTATGGCGATGGTGTCGGACACTTTCTTAAAGATGCAATTGACCAGTCGCTGGTGAAACTCCCTTTTATTTATACCTATTCGGTAACGGTGGTTGAGCAATGATTATTGGATTCGGAAATAATGTCGTCTCCTCACTGGCGGCTGATATTACCGCCAGCCAGACGACCATTCAGGTGATGCCTGGTGTGGGAGCGATGTTTGCTAATTTGCTGACCAGCGATTATGCAAACAGCTCAAACCCTCTTAAAACTTACGCCAAAATTACACTGACAGACGCAAAAGAAACAGTTTTTGAGGTATGCCATCTGACAGCAGTTAATAATGACATGCTGACGGTTATTCGCGGTCAGGAAGGTACAACAGCGAAGGGATGGTCACTGAATGACGTTATAGCGAATTTTGCGACGCGAGGATCTGAAAATCAGTTTGTACAAATTGAAGAGCTCCAGAGTGGGCATTATGTCGCTGGTGTGGCCGGAGGTACAGAAAATAATCTGACGCTGGAGTTACCAGCAACTTATTTCGTCAATGGTGGAGTTGACTGGACATTGCGCACTCCACTTGTGGTTATTCCGGCGCTAAACAATACCGGAGCCAGCACTCTGCAACTGACGATGGGAGGACGTGTGCTTGGCATATTCCCACTATACAAGGGGAATAAAGCAGAGTTATCGGCCAATGATATTATTAAAGATATTCCTGTCTTATGCGTTCTGGATAATACAAAAACCTATTTTTCTGTGCTTAATCCCCTGGAGATTTATTTGGGATCACGGTATTTGCAGAAGGACCAGAACCTGTCCGACGTACCGGATAAGGCCAAAGGTCGCTCCAGTCTTGAGGTCTACAGCAAAACCGAAAGTGATGAAAACTACATGGCTAAAAGCCAGTGTGGTGCGGATATCCCGAATAAGCCGCTGTTTGTACAAAATATCGGAGCGCTCCCTGCATCAGGTACGGCTGTTGCAGCGAACAGACTGGCATCACGCGGCGCGCTTCCGGCACTGACTGGTACGACAAGAGGCAGCGATAGTGGCCTGATAATGGGCGAGGTTTACAACAATGGCTATCCGACGCAATACGGAAATATTTTACGTCTGACCGGAACCGGTGATGGGGAAATCCTCATTGGCTGGAGCGGGACAAATGGTGCGCCAGCGCCCGCATATATTCGCAGCCATCGAGATACCGCCGATGCTGAGTGGTCCGAATGGGCAATGCTTTACACCACACTAAACCCACCTCCGGATTCGCATCCAGTAGGGGCGGCGATTGCATGGCCATCTGATGCTACTCCGGCAGGTTACGCTCTGATGCAGGGGCAGTCCTTCGATAAATCTGCTTACCCGTTACTGGCTATAGCGTATCCGTCCGGCGTTATCCCTGACATGAGAGGCTGGACAATAAAGGGTAAGCCCATCAGTGGACGTGCCGTATTGTCGCAAGAAATGGACGGCAATAAATCGCACTCGCACACCGCGCGGGCGCAGGTTACTGACTTAGGGACAAAATCTACCTCATCCTTTGATTACGGCACGAAATCGACCAATACCACGGGCAATCATACTCACCAGTTCGGCGGTTATATCAATTCATACTGGGGAGATTCCAATCACACCTCATTTCAGCCAGGAGGTGGTGCATGGACACAGGCCGCTGGCGACCATGCACATACAGTTTATATCGGAGGACATGAGCACACCATGTATATAGGTCCACACGGACACGTCGTTATTGTGGACGCAGACGGTAATGCGGAAACCACGGTTAAAAACATTGCATTTAACTACATAGTGAGGCTGGCATAATGACTTTTAAAATGAGCGAACAGGCGCAGACAATTAAAATTTTCAATCTGCGTTCAGATACTAACGAATTTATTGGTGCAGGTGATGCGTATATTCCGCCGCACACAGGACTACCGGCAAACTGTACTGATATCGCCCCTCCTGATATTCCCGCCAGTCATATTGCTATATTTGACGCTGAAACCCAGACATGGAGTTTGCATGAGGATCACCGCGGCGAGATGGTTTACGACACAACAACCGGCAATCAGGTTTATATCTCCGCTCCTGGTCCGTTGCCCGAAAATGTCACATCAGTTTCACCAGGTGGTGAATACCAGAAATGGGATGGTAAGGCTAAGGTCTGGGTAAAAGACGAAGCGGCTGAAAAAGCAGCGCAGCTTCGTCAGGCGGAAGAAACCAAAAGCCGTCTTTTGCAAATGGCATCTGAAAAAATCGCGCCATTGCAGGATGCAGTTGATCTTGGAATCGCAACAGATGATGAGAAAGCGCGGCTCGACGAATGGAAAAAATATAGGGTGCTGGTAAACCGGATGGATACAGCCGCCCCTGACTGGCCGGAAAGACCAGCCAGCCAGTAGGCGTTACTGTGGTAATGCAGGCCACCTGATTGCTGTGAAGGTGGCCTCATCTGAAACACCTGTTAAGTCCAGCGATTTAAGAACATTGATATAATTCATCGGCAGAATCAAAGCTGCTTTATTTTCATCACTGATAATTCCCAACGTTAATTCTGTGCGCCAGTCCTTAATGGCATGATCTGCATCGTTAAGTAATTTCTGCCGGGTGACTTCTGCCCGCCAAAATTGAACGCCCGGGCGCTGATGCCGGATGATCTGGTCATCGTGGAAAGCGACCCTGAAAAAATCGACACTTTAGCTGTAAAATGACAGTCCCGCCATCCGGTCATCATAACGGATTTTTCTTCTGCACCTTCTGAAGCCCGCCATGGCAGGACGACCATGAATCCGCCGATAACCTTATTGTGAAATTAAGACCAGGAAGAGATGATGTCTGTCGGACAGATACTATATGTAAATTTATAAAGGTTTTTTGTTATGCCCTTTCATATTGGAAGCGGATGTCTTCCCGCCATCATCAGTAACCGCCGCATTTATCGTATTGCCTGGTCTGATACCCCCCCTGAAATGAGTTCCTGGGAAAAAATGAAGGAATTTTTTTGCTCAACGCACCAGACTGAAGCGCTGGAGTGCATCTGGACGATTTGTCACCCGCCGGCCGGAACGACGCGGGAGGATGTGGTCAGCAGATTTGAACTGCTCAGGACGCTCGCGTATGACGGATGGGAGGAAAACATTCATTCCGGCCTGCACGGGGAAAACTACTTCTGTATTCTGGATGAAGACAGTCAGGAGATATTATCAGTCACCCTGGATGACGTCGGGAACTATACCGTAAATTGCCAGGGGTACAGTGAAACACATCACTTAACCATGGCAACAGAACCGGGAGTGGAACGCACAGATATAACTTACAACCTAACCAGTGATATTGATGCTGCGGCCTATCTGGAGGAATTGAAACAGAATCCAATTATAAATAATAAAATAATGAATCCGGTAGGGCAGTGTGAGTCATTAATGACTCCTGTAAGCAATTTTATGAATGAAAAAGGGTTCGATAATATTCGTTATCGAGGTATATTTATCTGGGATAAACCAACAGAGGAAATACCAACAAATCATTTTGCAGTGGTTGGAAATAAAGAAGGGAAAGACTATGTGTTTGATGTTTCAGCCCATCAGTTTGAAAATAGAGGTATGAGTAATCTGAATGGCCCATTAATTCTTTCAGCAGATGAATGGGTGTGTAAATATAGAATGGCAACAAGAAGGAAACTTATTTATTATACTGATTTTAGTAATTCAAGTATAGCAGCTAATGCCTATGATGCATTACCACGAGAATTAGAATCAGAATCTATGGCAGGGAAAGTTTTTGTTACATCACCGAGATGGTTTAATACCTTTAAAAAGCAAAAATATTCCTTAATAGGTAAAATGTAAGCGCACCGTGGAGGACGTCTGTCAGAACCCTGTCAATCCGGCGATGATAGTGTCCACTTAAATTTTGATGGACACTATCACAGATGACAGAGTCCACAGCCCGGCGCAAACACGGCTGTCGGTCAGGAAAGAGAAAAGCCAGTCGCTGTACGACTGGATACAGGCGCAGTTGAAAACGTTGTCGGTGCATGCGGAGATGGCGAAGGAGTTCGGTTACATGCTGAAGCAGTGTGATGCGTTGAGCGTGTCCTCTGCAGCGACGGTCGGGTGGAGATCGACAACAACATCTGTGAAAACGCCTTACGGTGCGTGGCGCTGGGCCGACGTAACTATCTGTTCTTCGGCTCAGACAGGAGCGGCGAGGCAGCGGCGATCATCTACAGCCTGCTGGGTACGTGCAAACTAAACGGCGTAGAGTCCGAGGCATGGTTACGCGACGTGCTGTGGAAAATCAGCGACTGGTCATCGAACCGGGTGCACGAACTGCTGCCCTGGAACCTCGAAACCGTAAAATAATCCTTACGCTACGTCCTAAACGGGACGCTTACTGAGTTATGAGGGTGAGGTGGTTTTCTTAATGAGCATCTGAACGCCGGTTTGTGGGATAGGGCAGGTGGGACGGATTCGGGACAGTCATCCGTTTTTAACTATTGGCATGGATTGGTATCTTTTCGCATCATGGGACGTGTGAGCGCAGGTATGACGCGGTATGTTATTGACTTAGAATGTGGTTCCAGGAACTGGCGCATCCATACCCGAGCAGTTTCCGGTGTCAGTGCCAGCGGGCGACGATCGTGAATGTCTACCAGACCTTTATCGGCTGCGGAGGTAACAATCAGGAATCCCTCTGCTTCATCACCGCGCTCAAACGGCGTACTGCCAATGGCAGCCATGAATATCGGCTTCCCGTCCTTTCTGTGAATGAAATACGGCTGTTTTTTGTCGCCTTCCTTCTTCCACTCGAACCATCTATCGGCAAAACAGATAGCCCGGCCATGCTGCCATAGTGGCTTAAACATTCTGCTGGAGGCCGCTGTCGCGACACGGGCGTTAATAAGTGGAGCTTTATCCCACCATCCGGGAGCGTAACCCCAAATCACCGGGTCGAGATGTAATTGCTCGTCGCGTTCGCTCAATAGCAGGACTTTAGTCCCGGGCGCCACGTTATACCGGCCTATAGGCTGAGGGTCATAAGCAATATTACGATCGGCTTCGTCGGCCAGATATGCCAGATATTCTTCACGGGTCTGTGCTTGTGCAAAGCGTCCACACATATGAAACCTCCAGTCGTCAGACTGAAAGTATAGGGCAGGGGGAAAATGTGGCGCGCTCCGGTAATGATTTACAGGGAATTTATACAGTAATTCGAAATGAAAATTTGTGTATTGATGAATTCCGAAACTGAGCAGTGAGCAAACTGATGAATCAATCGCAAATTGCGTCGAATCTCGACGGCTTTATTCCCCAGTTTCACCCCATAGCTTCCCCGTAGGAAATTGAGCCATAAAAAAAACAGCCCTGACAGGCTGGTTTTTAAGGGGAATTTTGGTCGGCACGAGAGGATTTGAACCTCCGACCCCCGACCCCCCATGTTGGCGGTAGCCTGAATAATTATCTTGGTAAAGGTTAACTATCATAAAATGGTACACCAGTCTTTCCAGGAGGAGGAGTGTAAAGGTTTTGGCTCATAAACACCGTCATAGTAAAGACCACAAATACTGCAATCTTCATTTTTATATTTATCATTGAATGCTAAGGTAAGCTTGTCATAGACATCAATATTTGATTGCCATTCAGGTGAGGTATTCATTGAATCATAAATTAATACTTTTTTCACTTCACTATCATAGCCTACAATGCACTCTGCATGTAGACATTCCGAGCCAAGTGAAGGCCTGATCAACATCAGTGGTCCATGGTTTTGTAATTCGTAAGAAATAAAACTCTCAAAGTCATCTTCTACAGTACTATTGAAAACTTCTGCCTGTAAAGCTTCTTTAATGTTTGTAAAAAGAGAGGATTCAGGTATTTTTTGCACATTCATTTCTTTTAATAAATTCTCACATTCTATTATATCAATACCCTCTAATATATTATTAAACGCTTGATTATCAGAAGTGATGTCCTCTAATGCACTATAATTATTGCCATCTCTGGATTTGATAACATTTAATGAGCAAACCCAGCAATTATTGTAGAGGGTGTTGCCTGTGGTATCAAACTGGCTTTGAAAGTTACGATGGTGAATAATCTGACCCTGAGGAGTTGCTGTATTACTTCTGTAAACGCTGCCTAAACTATTTTGAATGTGTCTTAACATAATATACTCGCCGAATAGTAATTTTGTTAATGTAATTATATACTACAGTGTGGATATTAATACAATTCTTTTGTTGTTAATTATTATTTATGAAATTAATTGAAAGTGAATAAGTTAGAGGTGTTTGTTGGCCTTAAAATTACATTTGTTGAGGGGGCTTATATGATATGTTTTTATTGTATTGTCGCATTTTTCTTAAGCTGAATCCGGATTTTGGGGAGGTGGCTAAATGTAAATGACGTGGTTTAAGATAAATCTATTTTTAATAAGCTATCTGTTCAAATTTTCGCGATCGCTTTTGTTGGTATCACTATTCAAGCAGTTTGCCTGCATCGGCTTCACCCTCACTTCGGCATCAGGGAAAATCTGGTGCACCTGCTTCGTCAGTTCGGCCAGAATGATCTCGCTGGCCCCTTCGAGTCCTTCAACATTACGCTTGTCATAAACCAGTTCTACGAACATACGTGTTTCGCTAATAACTGTTTGTATATACAGTATTTTTGCTTTGGCGGTTTTGTCTGTCAAGGCATGAACCACTTGTTTTTAAATTTTGGGGAACATACTGCGGGCGTGTTTGTTATCGATTTTCCCTGCAGGGCTGATGGGGTCTGGCGTTGACTAAAATTATGTGTGGGGCATGGATGGGGCAAAAGTGGTCTGTGAAGTTCGTTAAAGTTCGTTAATCAAGCTTCATCTCGATCTCGCTCATCCCTTGTTTAAAGCGCTCCTGGACGATCTTTATCGATTTTAAAAACTATGAGTACATATTATAAAAATGTAGCAAATAGGCCGTTTGTGCCTGAAAAGATGAACATTCTGCGTAGCGCGATTTGCGCAACAGGAATAGACTGGAGTCGACACTCTACACAAAGATGCGAAAGGTTTTTTATGACACAACAGCCACAAGCCAAATACCGCCATGACTATCGCGCGCCGGATTACCAGATTACTGATATTGACTTGACCTTTGACCTCGATGCCGAAAAAACCGTGGTCACCGCAATAAGCCAGGCTGTTCGTCATAGCGCGCCTGATGCGCCTCTTCGCCTTGATGGGGAAGATTTAACGCTGGTATCTATCCACGTCAACGATGCGCCGTGGACAGCATATAAGGAAGAAGAGGGCGCGCTTATCATCAGCGACCTGCCAGAGCGTTTTACGTTACGCATTGTCAACGAGATAAGTCCGGCGGCGAATACGGCGCTGGAAGGATTGTACCAGTCCGGCGATGCGCTCTGTACCCAGTGTGAAGCGGAGGGCTTCCGCCATATTACCTGGTATCTTGACCGCCCGGACGTACTGGCGCGATTTACCACCAAAATTATTGCCGATAAAAGCAAATATCCGTTCCTGCTCTCCAATGGCAACCGTGTTGCACAGGGCGAGCTGGAGAATGGCCGTCACTGGGTTCAGTGGCAAGATCCGTTCCCGAAACCGTGTTATCTGTTTGCGCTGGTGGCCGGTGATTTTGACGTGCTGCGCGATACCTTTACCACCCGCTCCGGGCGTGACGTCGCATTAGAACTGTACGTTGACCGTGGCAATCTGGATCGCGCGCCGTGGGCAATGACCTCGCTGAAAAATTCCATGAAATGGGATGAAGCGCGTTTTGGGCTCGAATATGACCTCGACATCTATATGATTGTCGCGGTGGATTTCTTTAATATGGGCGCGATGGAGAATAAAGGTCTCAATATCTTTAACTCCAAATACGTGCTGGCGCGAACCGATACCGCGACGGATAAAGATTATCTCGATATTGAGCGCGTGATAGGCCATGAGTATTTCCACAACTGGACCGGCAACCGCGTCACCTGCCGCGACTGGTTCCAGTTGAGCCTTAAAGAGGGGCTAACCGTGTTCCGCGATCAGGAGTTTAGCTCTGATTTGGGGTCACGCGCGGTGAACCGCATCAGTAACGTGCGTACCATGCGCGGTTTACAATTCGCGGAAGACGCCAGCCCGATGGCGCATCCTATCCGCCCGGATAAAGTAATCGAAATGAATAACTTCTACACCCTCACCGTTTATGAAAAGGGCGCGGAAGTCATTCGCATGATCCACACGTTGCTGGGTGAGGAAAATTTCCAGAAGGGGATGCAGCTTTATTTTGAGCGCCATGACGGCAGCGCCGCGACGTGTGATGACTTCGTACAGGCGATGGAAGATGCTTCTAATGTCGATTTGTCCCATTTCCGCCGCTGGTACAGTCAGTCCGGCACGCCGATTGTAACGGTAAAAGATGATTATAATCCGGAAACCGAGCAGTACACGTTGACCATCAGCCAGCGCACTCCGGCGACGGCGGATCAGGCGGAGAAGCAGCCGCTGCATATTCCATTCGCCATCGAACTGTACGATAACGAAGGCAACGTCATTCCGTTGCAAAAAGGCGGTCACCCGGTCAACGCCGTGCTGAACGTCACGCAGGCGGAGCAGACATTTACCTTCGATAATGTTTACTTCCAGCCTGTTCCGGCCTTGCTGTGCGAGTTTTCAGCGCCGGTGAAACTGGAATATAAATGGAGCGATCAGCAGTTGACGTTCCTGATGCGCCATGCGCGCAATGATTTCTCCCGTTGGGATGCGGCGCAAAGCCTGCTGGCCACATACATTAAACTGAATGTGGCGCGTCATCAGCAGGGGCAACCGCTATCGCTTCCGGTGCATGTCGCTGATGCGTTCCGTGCAGTACTGTTGGATGAGAAAATCGATCCGGCGTTGGCCGCAGAAATTTTAACGCTGCCTTCGGCCAATGAAATTGCGGAGCTGTTTGAGGTCATTGACCCGATCGCCATTGCGCAAGTTCGTGAAGCGCTAACGCGTACGCTGGCGGCAGAACTGGCGGATGAGTTCCTGGCTATCTATAACGCCAATCATCTGGATGAGTATCGTGTTGATCACGGCGATATCGGTAAGCGCACGCTGCGCAATGCTTGCCTGCGCTTCCTGGCGTTCGGCGAGACGGAGCTGGCTAATACGCTGGTCAGCAAACAGTATCGCGACGCCAATAATATGACCGATGCGCTGGCGGCCCTGTCTGCTGCGGTGGCGGCGCAGTTGCCGTGCCGCGATACGCTGATGCAGGAGTATGACGATAAGTGGCATCAGGACGGCCTGGTGATGGATAAATGGTTTATCCTGCAATCCACAAGCCCGGCGGAAAATGTACTGGAAACCGTACGCGGCCTGCTCAAACACCGTTCTTTCAGTATGAGCAACCCGAACCGCGTCCGTTCATTAATTGGCGCGTTTGCTGGCAGCAACCCGGCGGCGTTCCATGCGCAAGACGGTAGCGGATACCAGTTCCTGGTCGAGATGCTGACCGATCTGAATAGCCGTAACCCGCAGGTAGCATCTCGCCTCATTGAACCGCTGATTCGTCTGAAACGTTACGATGAAAAGCGTCAGGAGAAAATGCGTGCGGCGCTGGAGCAGTTAAAAGGACTGGAGAATCTTTCCGGCGATCTGTACGAGAAGATAACTAAAGCGTTAGCCTGA
Protein sequences of DBSCAN-SWA_2 >CP028151|1010423:1021648|1012090_1013800_+|AWP49638.1|tail|DBSCAN-SWA MIIGFGNNVVSSLAADITASQTTIQVMPGVGAMFANLLTSDYANSSNPLKTYAKITLTDAKETVFEVCHLTAVNNDMLTVIRGQEGTTAKGWSLNDVIANFATRGSENQFVQIEELQSGHYVAGVAGGTENNLTLELPATYFVNGGVDWTLRTPLVVIPALNNTGASTLQLTMGGRVLGIFPLYKGNKAELSANDIIKDIPVLCVLDNTKTYFSVLNPLEIYLGSRYLQKDQNLSDVPDKAKGRSSLEVYSKTESDENYMAKSQCGADIPNKPLFVQNIGALPASGTAVAANRLASRGALPALTGTTRGSDSGLIMGEVYNNGYPTQYGNILRLTGTGDGEILIGWSGTNGAPAPAYIRSHRDTADAEWSEWAMLYTTLNPPPDSHPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGVIPDMRGWTIKGKPISGRAVLSQEMDGNKSHSHTARAQVTDLGTKSTSSFDYGTKSTNTTGNHTHQFGGYINSYWGDSNHTSFQPGGGAWTQAAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA >CP028151|1010423:1021648|1014858_1015827_+|AWP49641.1|DBSCAN-SWA MPFHIGSGCLPAIISNRRIYRIAWSDTPPEMSSWEKMKEFFCSTHQTEALECIWTICHPPAGTTREDVVSRFELLRTLAYDGWEENIHSGLHGENYFCILDEDSQEILSVTLDDVGNYTVNCQGYSETHHLTMATEPGVERTDITYNLTSDIDAAAYLEELKQNPIINNKIMNPVGQCESLMTPVSNFMNEKGFDNIRYRGIFIWDKPTEEIPTNHFAVVGNKEGKDYVFDVSAHQFENRGMSNLNGPLILSADEWVCKYRMATRRKLIYYTDFSNSSIAANAYDALPRELESESMAGKVFVTSPRWFNTFKKQKYSLIGKM >CP028151|1010423:1021648|1018417_1018609_-|AWP49644.1|DBSCAN-SWA MFVELVYDKRNVEGLEGASEIILAELTKQVHQIFPDAEVRVKPMQANCLNSDTNKSDRENLNR >CP028151|1010423:1021648|1019035_1021648_+|AWP49645.1|DBSCAN-SWA MTQQPQAKYRHDYRAPDYQITDIDLTFDLDAEKTVVTAISQAVRHSAPDAPLRLDGEDLTLVSIHVNDAPWTAYKEEEGALIISDLPERFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLLSNGNRVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGRDVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATADQAEKQPLHIPFAIELYDNEGNVIPLQKGGHPVNAVLNVTQAEQTFTFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMRHARNDFSRWDAAQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAELFEVIDPIAIAQVREALTRTLAAELADEFLAIYNANHLDEYRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRDANNMTDALAALSAAVAAQLPCRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRVRSLIGAFAGSNPAAFHAQDGSGYQFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDEKRQEKMRAALEQLKGLENLSGDLYEKITKALA >CP028151|1010423:1021648|1011467_1012094_+|AWP49637.1|DBSCAN-SWA MAALLESIIPAYPYTQYNDDPDIVAFFDAYNKLAQGYLDYFNNLNLPCWTSPAITGELLDWIAAGIYGESRPLLQISEDAIARGAYNTIEYNNVAYAKLRNYVPGSASYVPDDYFKRILTWNFYKGDGSHFCINWFKRRLARFIHGANGIDPPVQSTFDISVMPDKGIFFVSIPDYGDGVGHFLKDAIDQSLVKLPFIYTYSVTVVEQ >CP028151|1010423:1021648|1010854_1011484_+|AWP49636.1|DBSCAN-SWA MYGVQGTPDCYRIELKNVYGVQENLISYRQASLGAWVAIAGGGDPYEVAYAIYKAVPDISVLTNDVVNPSGAAVDKKTIPIIVYPDTYHVPFVVPSSQNVTLLITWNTASTSYIDPTGIEKAVQQSIADYINGIATGEPINIFLIRDIFLNQVKGLVSSNLVSMIDIQVGINGKIVPPATDSSLVYGDTYAYFSTSSSQIQVKQYGSSS >CP028151|1010423:1021648|1014384_1014633_-|AWP49640.1|tail|DBSCAN-SWA MRHQRPGVQFWRAEVTRQKLLNDADHAIKDWRTELTLGIISDENKAALILPMNYINVLKSLDLTGVSDEATFTAIRWPALPQ >CP028151|1010423:1021648|1010423_1010834_-|AWP49635.1|DBSCAN-SWA MENTNIVTTEQQAPNTISASNAIFNVQALGQLTAFANLMADSQVTVPAHLAGKPADCMAIVMQAMQWGMNPYACWSLSAGANPNFIATQMGHTDAQMVYKVYGKWMSEKSAEQVSLLNQALSRYAPSLPQSMVAAQ >CP028151|1010423:1021648|1017460_1018147_-|AWP49643.1|DBSCAN-SWA MLRHIQNSLGSVYRSNTATPQGQIIHHRNFQSQFDTTGNTLYNNCWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPKPLHSSSWKDWCTIL >CP028151|1010423:1021648|1013799_1014381_+|AWP49639.1|tail|DBSCAN-SWA MTFKMSEQAQTIKIFNLRSDTNEFIGAGDAYIPPHTGLPANCTDIAPPDIPASHIAIFDAETQTWSLHEDHRGEMVYDTTTGNQVYISAPGPLPENVTSVSPGGEYQKWDGKAKVWVKDEAAEKAAQLRQAEETKSRLLQMASEKIAPLQDAVDLGIATDDEKARLDEWKKYRVLVNRMDTAAPDWPERPASQ >CP028151|1010423:1021648|1016474_1017101_-|AWP49642.1|DBSCAN-SWA MCGRFAQAQTREEYLAYLADEADRNIAYDPQPIGRYNVAPGTKVLLLSERDEQLHLDPVIWGYAPGWWDKAPLINARVATAASSRMFKPLWQHGRAICFADRWFEWKKEGDKKQPYFIHRKDGKPIFMAAIGSTPFERGDEAEGFLIVTSAADKGLVDIHDRRPLALTPETARVWMRQFLEPHSKSITYRVIPALTRPMMRKDTNPCQ |
11 | Salmonella_phage(40.0%) | tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1187776 : 1243509
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP028151|1187776:1243509|DBSCAN-SWA TTTACATGCGTTCTACGGTTTCGATACCCAACGTATCCAGACCCAGTTTCAGCGTCTTCGCCGTCAGTTGCGCCAGTTTTAGGCGGCTATTGCGAACCGCGTCATTCTCGGCGCTGAGAATAGGGCAGTGCTCGTAGAAGCCGGAGAACAGCCCGGCGACGTCATACAGATACGCGCACATGACGTGCGGCGTACCTTCACGCGCGACCACCGTCAGGGTTTCTTCAAACTGCAGCAGACGGGCAGCGAGTTGCGCTTCGCGATCTTCGCTAATGATAACCGGCGCGCTCGCCAGCGCCTGCTCGTCGATGTCCGCTTTACGGAAAACAGACAGCACACGCGTATAGGCATACTGCATATAAGGAGCGGTATTGCCTTCGAAAGCCAACATATTATCCCAGTCGAAGATATAGTCAGTGGTGCGGTTTTTTGACAGATCGGCGTATTTCACCGCGCCAATACCGACGGCGTTAGCCAGTTTTTCCAGCTCGTCAGCCGACATATCCGGGTTCTTCTCCGCCACCAGACGACGCGCGCGCTCCAGCGCCTCATCCAGCAGGTCCGCCAGTTTCACCGTACCGCCCGCGCGAGTTTTAAACGGTTTACCGTCTTTACCCAACATCATGCCGAACATGTGGTGTTCCAGCGGAACGGAATCCGGCACATAACCGGCTTTACGCACAATGGTCCAGGCCTGCATCAGGTGCTGGTGCTGACGGGAGTCAATGTAGTAGAGCACGCGGTCGGCGTGCAGCGTTTCGTAACGGTATTTGGCGCAGGCGATATCGGTAGTGGTATACAGATAACCGCCATCTTTCTTCTGAATAATGACGCCCATCGGGTCGCCTTCCTTGTTTTTAAACTCATCAAGGAAAACCACCGTTGCGCCTTCGCTTTCAACCGCCAGGCCTTTGGCTTTCAGATCGGCGACGATGCCCGGCAGCATCGGATTATACAGACTTTCGCCCATCACATCGTCACGCGTCAGAGTGACGTTCAGACGATCATAGGTGATCTGGTTTTGCGTCATCGTGATATCAACCAGCTTACGCCACATCTCACGGAAATAGGTATCACCGCTCTGCAATTTGACGACGTAGTTACGCGCGCGCTCGGCAAACGCTTCATCTTCGTCATAATGCTTTTTCGCATCGCGGTAGAATCCTTCGAGATCCGCCAGCGCCATGTCGCCCGCGTTTTCCTGCTGCTGTTTTTCCAGCCAGGCGATAAGCATACCGAACTGGGTGCCCCAGTCGCCGACGTGGTTAGCGCGAATAACATGATGGCCCAGAAACTCCAGCGTGCGCACTGCCGCATCGCCGATAATCGTGGAGCGTAGATGACCGACGTGCATCTCTTTCGCGACGTTGGGCGCGGAGTAGTCGACGACAATCGTCTGACGAGTCGGCTGAGAAACGCCCAGACGATCGGAGGTCAGCGCCTGTTGCACCTGTTCAGCGAGAAAAGCCGGTTCGAGGAAAATATTGATAAAGCCAGGGCCGGCGATTTCAACCTTGCTGGCGATGCCGCTGAGATCCAGATGAGTCAGCACCTGCTCTGCAAGTTGTCGGGGGGCCATACCCAGTTTTTTCGCAACTGCCATCATGCCGTTGGCCTGATAGTCGCCGAACTGAACTTTGGCTGACTGACGAACCTGGGGCTCGCAATCTGCAGGCGCGCCTGCAGCAATCATGGCCTGACTAACTTTTTCTGAGAGAAGAGCCTGAATATTCACCGGAATACCTTACGTTTATGAAGCGGACTGTATTGCCCGCGCCAGTTTAAGAATTAGGGCGGGAGTATACTGCAAATGCCCGTTGGCGTCAGCATTGCGCGCGCGGCGCGTCGCCTGAAATATACGCAACAATAATCTGCATAAGCATTCAATAAACGCCCAATCAAATCAAAGGTTGGTTGCCGGGGCGCAACAGCGTAAATTAGCGGTTTTCACAGCAAGAAGAGCGTCCTGATATGGCGAACTGGCAACACATTGATGAACTGCATGATATTTCCGCAGATTTACCGCGATTCACTCTGGCGTTCAGAGAACTTTCCACTCGCCTTGGTCTGCAGATTAGCGCCCTTGAGGCCGATCACATTTCATTGCGCTGTCACCAGAATACGACGGCGGAGCGTTGGCGTCGCGGTTTTGAACAATGCGGTGAGCTACTGTCGGAAAATATCATCAACGGCAGGCCGATTTGTCTGTTTAAACTGCATGAACCGGTATGTGTGGAACACTGGCGGTTTTCTGTTATCGAATTACCGTGGCCGGGAGAAAAACGCTATCCACACGAGGGGTGGGAGCATATTGAAATCGTGCTGCCCGGCGAGCCGGAGACCTTGAACGCGAGGGCGCTGGCGTTGTTGTCGGATGAGGGGCTTAGTCAGCCGGGGATAGTGGTTAAAACCAGTTCGCCACAGGGCGAGCATGAACGTTTGCCAAACCCTACGCTGGCCGTGACCGATGGCCGCATCACGGTAAAATTTCATCCATGGTCGATTGAAGCGATTGTCGCCAGCGAACAGGCGGCCCATTAACCTGCCGAGCGGAGTGAATATGGCGTTACTTGAGATCTGTTGTTACAGCATGGAATGCGCGCTCACCGCGCAGCGAAACGGCGCGGATCGTATCGAACTGTGCGCCGCGCCGAAAGAAGGCGGGCTTACGCCTTCGCTGGGCGTCTTACGCAGCGTGCGCGAGCATATTACGATTCCCGTACATCCGATTATTCGTCCTCGCGGCGGGGATTTTTACTACACTGACGGCGAATTTGCCGCCATGCTGGAAGATATTCGCCTCGTCAGAGAGTTGGGTTTTCCCGGGCTGGTGACTGGCGTGTTGACCGTTGATGGGGATGTCGATATGTCGCGAATGGAAAAAATAATGGCGGCGGCCGGACCGCTGGCGGTGACATTCCACCGCGCCTTCGATATGTGTGCTAATCCCTTCAATGCGCTAAAGAATCTGGCTGACGCAGGCGTAGCAAGAGTACTGACTTCCGGACAAAAAGCCGATGCGGCGCAAGGTTTATCAATAATTATGGAACTTATTGCCCAGGGGGATGCTCCAACCATTATGGCTGGTGCGGGGGTTCGTGCAAATAACCTGCAGAATTTCCTCGATGCCGGAGTACGGGAAGTACACAGTTCCGCCGGAGTCTTACTGCCTTCGCCGATGCGCTATCGCAATCAGGGGTTATCGATGTCTGCCGATATACAGGCGGACGAGTATTCTCGCTATAGGGTAGAGGGTGCGGCGGTCGCTGAAATGAAAGGAATCATTGTTCGCCATCAGGCCAAATGATTTTTACCGTTGCATCATGTCGCCCAATATGATGCTTGCTCGTACCAGGCCCCTGCCAATTCAACAGGGGCCTTTTTTTCTCCTTCATATTTCAAGCCGCAGCAGCGTTGGCCGCCTCGCGCACCCCGGTCACGTAGCTACCTACGTTCCCGGGGATTTGCTGCGTTGCCGCCTTGCTGCAACTTGAACTATTTTGGAGAAACGGTGCTATACGTAACGGTCATTCGGCAATCCATCACGGTTTACGCGCAATCAGCACCGCGCGCTGCGGGGCGGGGTAGCCTTCCACCGTCTTGCTGCGATCGTTCGGGTCAAGAAAATCGGCCAGCGACTCGGTAACCATCCATTCGGTACGGCGCTGCTCTTCGGTGGTGGTGACGCAAACGTCAGCGATACGCACATCAATAAAGCCGCATTTTTCCAGCCATTTTTTTAATGCCGGCGCGGAAGGAATAAAATAGACGTTACGCATCTGCGCGTAGCGGTCACCCGGCACCAGTACGGTATTTTCGTCGCCGTCGACGACCAGCGTTTCCAGTACCAGTTCACCCTCGTTGACCAACTGATCTTTTAACTGCCACAGATGCTCCAGCGGCGAGCGACGGTGATAAAGCACCCCCATCGAGAAGACGGTATCAAATGCTTTCAGCGCCGGGAGTTGTTCAATGCCCAACGGCAGCAGATGCGCGCGCTGATCGTTACCCAACAGTTTACGCACCGCCTCAAACTGGCAGAGGAAAAGCTGTGTAGGATCGATGCCGACCGCCAGATGTGCGCCCGCGCCAATCATCCGCCACAGGTGATAGCCGCTGCCGCAGCCGACATCAAGGATCGTCCGGCCGGTTAAATCTGACAGATGCGGCAGTACGCGATCCCACTTCCAGTCGGAGCGCCATTCGGTATCAATATCTACGCCATAAAGCGAGAACGGGCCTTTACGCCACGGCATCAGATTACGCAGCAATGTATCAATACGCTTTAGCTGACCTTCGCTGAGCGGCGTTTCGCTTTCAGCCGTGACGCTGTGCAGTAAATCCAGCCGCCAGGGCGTCATTTCCGGTAAAAATTCTACCGCGTTCGACCACTGTTTAAACAAGCCATGCTGCTGCTCGCGTTGCCAGGCGGCAATCTGCGCGGGCAATGTTTCCAGCCAGTGGGAGAGATGATTTTTAGCAATGAGCTGATAAAAGTTACCAAACTCGATCATGCGGCAACTCCGGCTTTTAACGCGACAAGCGACCCGAAGTTAAAGCACTGGAACCACAGTTCACTATGTTCGAAACCCGCCTTACGCAGGCGCGCTTTATGTGTTTCCACCGAGTCGGTGAGCATCACGTTTTCCAGCATACTGCGCTTCTGGCTGATCTCCAGTTCGCTGTACCCATTCGCGCGTTTAAAATCGTGGTGCATGTTGAATAACAGCTCGCCCACTTTCGCGTCTTCAAAGCTAAATTTTTCCGACAGTACCAGCGCGCCGCCGGGATTCAACCCCAGGTAAATCTTATCCAGCAATGCCTGGCGTTCGGCAGGTTCAAGGAATTGCAGGGTAAAATTCAGCACCACCATCGAGGCGTTTTCAATAGTGATATCGCGGATATCGCCTTCAACGACTTCAACCGGCGTAGGCGCTTTATACGCATCAATATGGCGGCGGCAGCGTTCAATCATGGCCGGGGAGTTATCGACGGCGATAATGCGACAATGCTCATGGCGGATATTACGCCGAACAGACAGCGTCGCGGCGCCCAGCGAGCAGCCCAGATCGTAGACCTGCGTGTTCGGCTGCACAAAGCGTTCAGCTAACATGCCGATCATGGAGATGATGTTGGAGTAGCCTGGTACCGAACGCTGAATCATGTCGGGGAAAACTTCGGCTACTCGTTCATCAAAAGTCCAGTCGCCGAGACGGGCGATAGGCGCAGAAAAAAGCGTGTCGCGGTGAGACATAACGTATACATCCGGGAAAAATAAAGGCGCATATTGTGCGCTAATACAGGGAGAAAACCAACTCCCAGGGCATATACCACAGGTTTGCCAGCACCATCAGCAACAATGCGCACCAGGTAGCGCTCATCCCCGCCCGCCGCCAGCGGAACAGGCGGTGGTGAAAACCGTAATAATGCATCAGCCGACCGGCGATTAAAATGATGCCGCAGATATGCACCATCCACGTTTCGGCGCCGTTCATTTCCATAAACAGCAGCAGCACCAGCGCGACAGGAATATATTCCACCGCGTTGCCATGAATACGGATGGCGCTTTGTAATTCGCTAAAACCGCCGTCGCCATAGGCGACACGGTACTGCATTCGCAAGCGAACGACATTAAAAGAGAACTTCATTAATAATAACGCACCCAAAACGGCATACAGCGCGCTTACCATACAAACTCCCTTTTAAAATGGCTGATGGCACTGTCTATGATAGGGGGCATTTTCAGAAAAGAGAAGATTGCTGTGGAATGGACGGAGCAGTGCCGATTTCCGGTAAGACGCTGCGTAAATCATGCCAAAGTGTATTCACCAGTTCCGGCGCCTGCGCGATATCCGGAGTATGTAAAAACAGAAAAGGCGTGGTGGTCTGCCGCCACTGCGGGAGTTTCTGCAACCAGGCGGCGAAAAACTCACGGTTTTGCGCCATATTATCGCTACCGATAAAACGCACCATCGGATGGCTAGCCGTCACGACAGCATGTACCGGAACTTTGGGTTTTTTTCGCTGCGCGTCGCGGACGGCCTCGCTGTGGGGATGGGCGGCATGTACCGGACGACTATCCAGGATCACGCGGTTTACGCCGCGCGCGTGTAACCCGCGATTGAGCCGCTGTTCATCCTCACCTTTGTCAAAAAAACACGGATGACGAACTTCTACGCCATAGGTAAACGTGGCGGGGAGCGCATCAAGAAACTGCCATAAGGCGGGGAGATCGCGCGGTCCAAAGGCGGCGGGCAACTGCAGCCAGTATTGGCCAATACGCGTTTCCAGCGGCGCCAGGCGGGTAAAAAACGCCTGTACCAGATCGTCGCAATGGCGCAGCGCGGCCTGGTGAGAAATGGTCGCGGGAAATTTAAAGCAAAAACGGAAATCGTCCGTAGTTTGCGCATACCAGCGATCGACAATTTCCGCTTTCGGCAGCGCGTAAAGCGTGGTGTTGCCCTCCACGCAGTTAAAGTGGCGGGCATACTCTTCCAGACTGGTGATGCCGAGCCGCGCCCATTTCGGGTGCGACCATTGCGGCAAGCCAATGTAGATCATAAGGCGTTTAAAATATCCTCTACGCTGCGTACCCGGCCAATGCGCGGGAAAATATGCGTCATACTGCTCTGATGCTGCTCACTGCTGGCGGCGCTACAGGCATCTTCGGCAATAATAAGGTTAAATCCCAACTCCCAGGCGTTACGCGCTGTGGATTCGACGCCGATATTGGTTGAGATCCCGCACAATATGATGGTGTCGATACCCCGGCGGCGAAGCTGTAATTCCAGATCGGTGCCGTAAAACGCGCCCCACTGGCGTTTAGTCACTTCCAGATCGCTGTCCTTTTTACCCAATGCCGTAGGCCAGGTCCACCAGTTTTCCGGCAACGCGTGCGCAGGCGTTGCGGCATCCACGGGTTGTTTTAACGCTTCGGCATAATCATCAGACCATCCGACGCGTACCATAACCACGGGCGAGCCGTTGGCACGACACTTTTCCGCCAGCCGCGCGGCGCGGGCGACCACCTCATTTGCCGTATACGGGCCGCCGGCAAACGGAAGAATTCCTTCCTGTAAATCAATAACCACCAGGGCGGTTGTGGTCGCGTTGAGTTTCAGCATGATAATCACTCTCCTAAAAATAGCGTATTTCACTACACCTTAAGCGCGATTAAAGAGGAAAGCGGTGACAAATATTGTTAATTTTTGTGAGCATACGCAATACCCGCGTGAAAAACCCGGGTACAGTCAGTATGAACCAAGTGTCGATACCGCATGAAACGGGCAAGTTATCGCTCCCATGCTGGATTGACTCTTATCGTGCGGCAGAATTTCCAGTATAATAGCCGCCTTTTTTCATCCAGTTGTGACATACAGCTAAAGCTGCGACTTCGTCGCCTGCATGGCAGGCAACAATCCGCCTGCGGCTAAGTTAAGGGATATCTCATGCGTACAGAATATTGCGGACAGCTACGTCTGTCCCACGTGGGGCAGCAGGTGACTCTGTGTGGTTGGGTCAACCGTCGTCGTGATCTTGGCAGCCTGATCTTTATCGATATGCGCGACCGCGAAGGTATTGTGCAGGTGTTCTTCGATCCGGATCGTGCGGACGCGTTAAAGCTGGCCTCTGAACTGCGTAATGAGTTCTGCATTCAAGTTACGGGCACCGTGCGTGCGCGTGACGCGAAAAACGTCAATGCGGATATGGCGACCGGCGAAATTGAAGTGCTGGCGTCCTCTCTCACTATCATCAACCGCGCAGACTCACTGCCGCTTGACGCTAACCACGTTAATACCGAAGAGGCGCGTCTCAAGTACCGCTATCTGGATTTGCGTCGTCCGGAAATGGCGCAGCGCCTGAAAACTCGCGCCAAAATTACCAGCCTGGTGCGTCGTTTTATGGACGATCACGGTTTCCTTGATATTGAAACGCCGATGCTGACCAAAGCCACGCCGGAAGGCGCGCGCGACTATCTGGTGCCTTCGCGCGTGCACAAAGGTAAATTCTACGCGCTGCCGCAGTCGCCGCAGCTGTTCAAACAGCTCCTGATGATGTCCGGTTTCGACCGTTACTATCAGATAGTCAAATGCTTCCGTGATGAAGACTTACGTGCTGACCGTCAGCCGGAGTTTACTCAGATCGACGTCGAGACCTCCTTCATGACCGCGCCACAGGTGCGCGAAGTGATGGAAGCGCTGGTGCGCCATTTATGGCTGGAAGTGAAAGGCGTGGATCTGGGGGATTTCCCGGTCATGACGTTTGCCGAAGCGGAACGTCGTTACGGTTCCGACAAACCAGACCTGCGTAACCCGATGGAACTGGTAGATGTCGCTGACCTGCTGAAATCGGTAGAGTTCGCGGTCTTCGCGGGCCCGGCTAACGATCCGAAAGGCCGCGTGGCAGCGCTGCGTGTGCCTGGCGGGGCGCAGCTTAGCCGTAAGCAGATCGACGATTACGGTAACTTTGTTAAGATCTACGGCGCGAAAGGCCTGGCGTATATCAAAGTTAACGAGCGCGCGAAAGGTCTGGACGGGATTAACAGTCCGGTGGCCAAGTTCCTGACCGCCGACATCGTCGACGCTATCCTTGAACGTACCGGCGCGCAGGACGGCGACATGATCTTCTTCGGCGCAGATAACAAAAAAGTGGTTGCCGATGCGCTGGGCGCGCTGCGTCTGAAACTGGGCAAAGACCTGAGCCTGACCGACGAAGACAAATGGGCGCCGCTGTGGGTGATTGACTTCCCGATGTTTGAAGACGACGGCGAAGGCGGTCTGACCGCGATGCACCATCCGTTCACCGCCCCGCGTGACATGACGGCGTCTGAACTGAAAACTGCGCCGGAAGAAGCCGTCGCTAACGCTTACGATATGGTGATTAACGGCTATGAAGTGGGCGGCGGTTCGGTGCGTATTCACAACGGTGAAATGCAGCAAACCGTATTTGGTATTCTCGGTATCAATGAGCAGGAGCAGCGCGAGAAGTTCGGCTTCCTGTTGGATGCGTTGAAATACGGTACGCCGCCGCATGCGGGCCTGGCGTTTGGTCTGGACCGTCTGACCATGCTGTTGACCGGCACCGACAATATTCGTGATGTTATCGCCTTCCCGAAAACGACCGCCGCCGCGTGTCTGATGACCGAAGCGCCGAGTTTCGCCAACCAGGCAGCGTTGACGGAGTTGGGGATTCAGGTTGTGAAGAAAGCCGAGAATAACTGATATGAACAATGTACACGACATCATTCGCGTGATATCGCGGCGGCCCGCAAGGGCGAGTCTCATCAATGCACGGCGTCACTCAACAGCCAACAACGAGGTCGCGTGAAGGATAAAGTGTACAAGCGCCCCGTTTCGGTCCTGGTGGTTATTTTCGCGCAGGATACGAAACGGGTGCTAATGTTGCAGCGACGCGACGACCCGGATTTTTGGCAGTCAGTGACTGGCAGCATAGAAGAAGGGGAGACCGCGTTGCAGGCCGCCGTGCGTGAAGTCAAAGAGGAGGTCACGATTGATGTTGCGGCAGAGCAACTGACCTTAATCGACTGCCAACGTACGGTGGAGTTTGAAATTTTTTCACATTTACGTCATCGCTATGCGCCGGGCGTTATGCACAATACAGAATTCTGGTTCTGCCTTGCGTTACCGCATGAGCGGCAGGTGATATTCACTGAACATCTGACGTACCAGTGGCTTGATGCGCCTGACGCGGCGGCGCTTACCAAGTCCTGGAGTAACCGGCAAGCGATTGAAGAGTTTGTCATTAACGTCGCCTGAAAAGGCGCGCTTTTTTTGAGGAAATATTTATGGCAGGTCATAGTAAATGGGCCAACACCAGACATCGTAAAGCTGCGCAGGATGCCAAGCGCGGTAAAATCTTCACTAAAATTATTCGTGAACTGGTAACGGCAGCTAAATTGGGCGGCGGCGATCCGGATGCCAACCCGCGTCTGCGCGCGGCCGTAGATAAAGCGCTGGCCAACAACATGACCCGCGACACTCTGAACCGTGCCATCGCTCGCGGTGTGGGTGGTGATGAAGACTCGAACATGGAAACCATCATCTATGAAGGTTATGGTCCTGGCGGCACGGCCATTATGATTGAGTGTCTGAGTGACAACCGTAACCGTACCGTAGCGGAAGTGCGTCATGCTTTCAGCAAGTGCGGCGGTAATCTGGGTACCGATGGCTCAGTGGCGTACCTGTTCAGTAAGAAAGGGGTCATCTCCTTTGAGAAAGGCGATGAAGACACCATCATGGAAGCCGCTCTGGAAGCTGGCGCGGAAGATGTTGTGACCTATGATGATGGCGCGATTGATGTTTACACTGCCTGGGAAGAGATGGGCAAAGTGCGTGACGCGCTGGAAGCGGCGGGTCTGAAAGCGGACAGCGCCGAAGTGTCCATGATCCCGTCAACCAAAGCGGATATGGATGCGGAAACCGCGCCGAAACTGCTACGTCTGATCGATATGCTGGAAGACTGTGACGATGTCCAGGAAGTCTACCATAACGGTGAGATCTCTGATGAGGTGGCGGCGACCCTGTAATGGTGCTGTTAAACGCCGGCGGGCAGGGGGGATAGCGTGTCGATTATTCTCGGTATTGACCCCGGCTCGCGCATTACCGGTTATGGCGTTATCCGCCAGGTAGGCAGACAACTGACCTATCTGGGCAGCGGATGTATTCGCACCAAAGTCGATGATTTGCCGTCTCGTTTGAAGCTCATATACGCAGGCGTGACGGAAATCATCACGCAGTTCCAGCCGGACTATTTTGCTATCGAACAGGTGTTTATGGCGAAGAACGCCGATTCAGCCCTCAAGCTGGGACAGGCGCGCGGCGTCGCGATTGTCGCCGCGGTCAATCAGGAACTGCCCGTGTTTGAATACGCGGCGCGTCAGGTGAAGCAAACCGTCGTCGGTATTGGTAGCGCGGAAAAAAGCCAGGTACAGCATATGGTGCGCACGTTGCTCAAACTGCCCGCTAACCCGCAGGCAGACGCCGCAGATGCGCTGGCTATCGCCATTACGCATTGCCATGTTAGCCAGAACGCGATGCAAATGAGCGAGAGCCGACTCAATCTGGCGCGGGGAAGGTTGCGCTAAATCATTCTCCCTCAGCCTCGCGTTGAGGGGTTTCAATTCCACAGGCGCTAATTAATTCTAAATTGGGATGATGCCACAGGCTGGCGGGCGTCACCGTCTTACGATCCCACGGGATTGAACCTAAAAACCAGAATTTCCAGAAGGTGAGTTTCGCATCAGGATTGCTGTGAAGTAATTCTTCGAATGTCTCAATATCGCCTACCGGAATACACAATGCTTCTTTATAGATATCAAACACAAATTTCGAGCAGAACTGGCGCGATGACTCGTATTTAAACCCCGTGTGATAAAACTTATTCAGCCGGGCGGGAACCTGCTCCATGATGGCGAGTTTCTGCTCGACCGTCAGGCCGCCGCGCAGACGGCGAACGGCGTAACGTTGTCCGGCTGAACGTTGTATAAACAGTGACAGCGTGGTGACGGTTGAGAGGGGGACGCGACTTTCCGCCACCAGATAGTCATCACCATTATGACCAATAATAATCCCAACATGGTTGCTCCAGCACTGTGAAGCGGTGGATATCTGGCCGAAAAGCGCGGCACCTATACAGGTAAAGACGATATCCCCGGTTTCATATTGGACAGAATAATGTTTTTTCATCTCTGCATTCCATTATTTGAAATAAATTCCTTTTATTTATTCTACAGGGTAGTTAGCATTATACCAGATAAATGACAAGCCTTAGTGTCATCGAAATATGTATGGTGAATATTTCATTTATTGGAATTAATGTCGCAAATATATTTGGGGGGGATTAAGGTGGTAAATTTCTCATCATCTGATCTCTCATATTGACAGGGAGTATGACGTGGTTGATGATAAACCAATGAGAATAGGTTGGGTATTTTATTTTTCTTATATTTTTGTTGCTATTATCTTTCATCTGTTTATTAGTTTTTGTTATTGTTTGAGTATTGATATGGCAGAGGCTAATACTGTTGTTTTTTTATTAAAGCCAGGGACGATTTCATTATTATTTTTGCTTCTCCCTGCCAGGCGGTTCAGGACCAGACTGTTGGCAACATTGAGTTCAGTTTTCATTACCTTAGTATTTAACCAGTGGCATTTGGTTGCCGGGAATAAAGAACTGGTTTTGTGCCTGCAGGCAGCGTGCTTTATGGCTTTCCTGGCTATGACATCGGTAAAAAAATCCGGGTGGATGATTAGCGCATCATTATTTTTGGTTTGTGCGGCCGGAACGATTCGCCAGTGCTGGCTGGAACAGCTGTTTAACGCTGCCGATATATATATTGTTGATGATGGGCGGAGCAGTGGCGCCAGCGGCCACTGCTTCCAGTATATTGCAGCAAAAGGCCGTGGGTTAGCAGCAAAACGACAGGCACTTTTTAGTAGCGAAGAGTATGTGAATATTTATTATAGTTATTCAGAAGGGATGCCAGCCGTGAATTTCGATGGCATGAAAAATGAGTTTGTACAATATTTATTGTGTCATGGCGAATTAAAATCTGTTGCGCGCGACGATAAAACAACATGCGATTAAATTCAGCGCTAGTGGTTAATATTATTAAATCTATTCTCGGGTCTGTTTTTTTGGAGGCCCGATGTTCCGCTGTTGGCATTCGTACTGGATACATTATTAGTGTTATCAGGATACTCATTATTGAAGACTACCGAACGTGCGCCACGGCATGTTGGCAAAAAGTTGTTCGCGTAGGACGCAAAAAGTTGTCAATGAGTGTACGACTTATTTTGTACGCAGCGGGGAGCGTACCCGTCGTTAACCATCGCGGGTTGTCGGAGTAGAGAAATATCGGGATACGTGTTTTTCATCTGGATAAACATCCAGTCTTTTTTTATGATAGTGCATTATCTTTTCAGCCCTTTACGCAGGAGCGTCATGTGATAGGCAGACTCAGAGGCATTATTCTCGAAAAACAACCCCCGATCGTGCTGCTGGAGACAGGGGGAGTAGGCTATGAAGTACATATGCCTATGACCTGCTTTTACGAGCTGCCGGAGGCCGGGCAGGAGGCAATCGTCTTCACCCACTTCGTGGTGCGTGAAGATGCGCAGTTGCTGTACGGATTTAACAATAAGCAAGAGCGCACGCTATTTAAAGAGCTAATTAAAACGAATGGCGTCGGGCCTAAGCTGGCGCTGGCGATCCTCTCCGGCATGTCGGCGCAACAGTTCGTCAATGCGGTTGAGCGCGAAGAGCTGGGTGCGCTGGTGAAGCTGCCGGGTATCGGCAAGAAAACCGCCGAAAGGCTGATTGTCGAAATGAAAGACCGCTTTAAAGGGCTACATGGCGATCTCTTTACGCCAGCGGTCGATCTGGTATTGACGTCGCCGGCCAGCCCAACCTCGGAAGATGCAGAACAAGAAGCAGTTGCTGCGCTGGTGGCGCTGGGTTATAAACCGCAAGAAGCCAGTCGGATGGTAAGCAAGATCGCCCGTCCGGATGCAAGCAGTGAAACACTGATTCGCGACGCGTTACGCGCCGCGTTATGAGGTAAAGGATGATAGAAGCAGATCGCCTGATTTCAGCAGGCGCCACGATCGCAGAAGACGTCGCCGATCGCGCCATTCGCCCGAAGTTATTGGCGGAATATGTCGGCCAACCGCAGGTGCGCTCCCAGATGGAGATCTTTATTCAGGCGGCAAAACGGCGTGGCGACGCATTAGATCACTTGCTGATTTTCGGCCCGCCAGGGTTAGGTAAGACGACTCTGGCGAATATCGTTGCCAATGAAATGGGCGTCAATCTCCGAACCACGTCCGGCCCGGTGCTGGAAAAAGCAGGCGATCTGGCGGCTATGCTCACCAATCTTGAACCGCATGATGTGCTGTTTATTGATGAGATTCACCGGCTTTCACCGGTGGTGGAAGAGGTACTCTATCCGGCGATGGAAGATTACCAGTTGGATATTATGATTGGCGAAGGGCCTGCCGCACGTTCAATTAAGATCGATCTGCCGCCGTTTACGCTCATTGGCGCGACGACGCGCGCCGGGTCGTTAACGTCCCCGCTACGTGACCGTTTTGGTATTGTGCAGCGCCTGGAGTTTTACCAGGTGCCCGATCTGCAGCACATCGTCGGGCGCAGCGCAAGGCACATGGGGCTTGAGATGAGCGATGACGGCGCGCTGGAAGTGGCGCGTCGGGCGCGCGGTACGCCGCGTATAGCCAACCGTCTGCTCAGACGTGTGCGTGATTTCGCCGAGGTGAAGCATGACGGGGCTATTTCAGCGGAAATTGCCGCGCAGGCGCTGGATATGCTGAACGTCGATGCGGAAGGGTTTGATTATATGGATCGCAAGCTACTGCTGGCGGTTATTGATAAATTCTTCGGTGGTCCGGTGGGACTGGATAACCTGGCGGCGGCGATTGGCGAAGAGCGTGAAACCATTGAGGATGTGCTGGAGCCGTATCTGATTCAGCAAGGCTTTTTGCAGCGCACGCCGCGTGGGCGTATGGCAACGGTGCGCGCCTGGAACCATTTCGGGATCACGCCGCCAGAGATGCCTTAGCGCCTGAAGTCGCTGCGCTCATCAGACCTGGGCGATTTTGTAGGCCCGGTAAGCGCAGCGCCACCGGGCAACACACGCTTAGCTTGCCTGCTTTTTCATCATACTGAAGATAAACAGCAGCGCGGCACACAGCACTACGGATGGCCCTGCCGGGGTATCATAAAAGGCTGAGAAGGTCAGTCCACCTGTAACCGCTATCATACCTACACCAACGGCGACGCCCGCCATCTGCTCCGGCGTACGGGCAAAGCGACGCGCGGTTGCGGCGGGGATAATCAGCAGCGACGTAATGATCAGCGCTCCGACAAACTTCATCGCCACGCCAATAGTTAACGCCGTTACCAGCATCAATAGCAGCTTCACGCGCTGTAACTTCACGCCATCGACAAAGGCAAGATCCGGACTGATAGTCATTGACAGCAAGTTGCGCCACTGCCAGAAGAGAATGGCCAGCACAATAACGACCCCAATAGCGATCGAGATAAGATCTTCCGGCGTGACGGCCAGCAAATCGCCAAACAGATAAGCCATTAAATCAACGCGGACGTTGGACATCAGGCTGACCACGACCAGTCCTAAAGATAAGGCGCTGTGCGCCATAATGCCCAGTAAGGTATCAATCGCGAGGTGAGGGCGTTTCTCCAGCCATACCAGACCGGCGGCCAGCAGCAGCGTAACGGCGATTACCGCATAAAAGGGGTTAACGTCCAGCAACAGGCCAAACGCCACGCCCAGAAGCGACGCGTGAGCCAGCGTATCGCCAAAATAGGACATCCGACGCCAGACCACAAATGAGCCCAATGGACCAGCGGCGCAGGCCAGCATCATCCCGGCCAGCCAGCCGGGCAGTAATAATTCAATCATGAGTGACCATTTCCCCGGCGCAGTGCAATACGGCCCTGTAAATCGTGGCGATGATTATGATGATGGCGATAAATCCCTAATTGCTCCGCGCCTCGCGGGCCAAACATAGAGATAAATTCCGGATGCATAGACACCACTTCCGGCGCGCCGGAACAGCAAATATGATGGTTCAGGCATAACACTTCATCCGTCTTTGCCATGACCAGATGTAGATCATGCGACACCATCAGCACGGCGCAATCGAGTTCACGGCGCAGCTGATCGATAAGGTCGTATAACGCGACCTGGCCGTTGACATCCACGCCTTGCGTCGGCTCATCAAGTACCAGCAACTGAGGCCTGTTAAGCAGAGCACGCGCCAGCAGTACACGCTGTGTCTCACCGCCAGAGAGTTTTTGCATGGGCGCGTCAATCAAATGTCCGGCCTGAACGCGTTTAAGCGCCGGGAGAATATCCGTTTTTTGTGTGCCGGGACGTAAACGTAAAAATCGGTTTACCGTCAGCGGAAGCGTGGTATCGAGATAGAGCTTTTGCGGGACATAGCCGATACGGAGTTGCCCGTTGCGCTTGATCACCCCTTCATCAGGGGCTACCAGTCCTAAAACCACGCGTACAAGCGTTGACTTCCCCGCGCCGTTAGGGCCGAGAAGCGTTAAAATTTTTCCGGGGCTCAATTCAAGCGACACGTCAGAGAGGACGCGGCGTTGACCAAATGAGACCGAGACGTTTTCCAGTGAAACTAAACTTGTCATGTTAATTTTAGGCTTGCAGAAGTGATAGAATGTTATAATATCACATTTCACACATTCATTACGATGATTAGTCGCATTATGTTACAGAAAAATACGCTTCTTTTCGCAGCATTATCCGCCGCGCTTTGGGGGAGTGCAACCCAGGCTGCCGACGCCGCCGTTGTCGCTTCGCTTAAACCGCTTGGGTTTATCGCTTCCGCCATTGCTGATGGCGTTACGGATACACAAGTATTACTTCCGGATGGGGCTTCCGAGCATGATTATTCATTGCGTCCATCAGACGTAAAACGCTTACAGGGCGCGGACTTAGTCGTCTGGGTTGGCCCGGAAATGGAAGCCTTTATGGAGAAGTCGGTCAGGAATATTCCTGATAATAAGCAGGTTACCATTGCGCAACTTGCCGATGTAAAACCGTTACTCATGAAGGGCGCGGATGATGATGAGGATGAACATGCGCATACGGGCGCCGATGAGGAAAAAGGTGACGTACATCACCATCACGGCGAATATAACATGCATCTTTGGCTCTCCCCAGAGATAGCGCGGGCTACAGCGGTTGCAATCCATGAAAAATTAGTGGAACTTATGCCGCAAAGTCGAGCCAAACTCGACGCCAACCTGAAGGATTTTGAGGCACAATTAGCCGCAACCGATAAACAGGTCGGTAACGAGCTCGCGCCGCTCAAGGGGAAAGGGTATTTCGTTTTTCATGACGCCTACGGTTACTACGAAAAACACTACGGACTCACCCCGCTCGGTCACTTTACCGTGAACCCTGAGATACAACCTGGCGCGCAGCGTTTACATGAAATAAGAACACAGTTGGTTGAGCAAAAAGCAACCTGCGTTTTTGCTGAGCCACAATTCAGGCCAGCGGTCGTGGAAGCCGTGGCGAGAGGGACATCCGTTCGAATGGGAACACTGGACCCCCTCGGGACGAACATTAAACTGGGTAAAACAAGCTATTCAGCGTTTTTAAGCCAATTAGCCAACCAGTATGCGAGCTGCCTGAAAGGAGATTAACGAGGAAGTGAATACGTGCAACAGATAGCCCGCTCTGTCGCCCTGGCATTTAATAATCTGCCCCGACCCCACCGCGTTATGCTGGGGTCACTTACCGTTCTGACACTGGCCGTCGCCGTATGGCGGCCCTATGTTTACCACCCAGAATCCGCACCAATCGTTAAAACTATTGAACTGGAGAAAAGCGAGATTCGTTCCCTCTTACCGGAGGCCAGCGAACCCATCGATCAGGCCGCGCAGGAAGATGAAGCTATTCCTCAGGATGAGCTGGACGATAAAACCGCAGGCGAAGTCGGCGTCCATGAATACGTCGTCTCCACAGGCGATACGTTAAGCAGCATTCTGAATCAGTACGGCATCGATATGAGCGATATTAGCCGACTTGCCGCTTCTGATAAGGAGCTGCGCAATCTGAAAATTGGCCAACAGCTTTCCTGGACACTGACTGCCGATGGCGATTTACAGCGTCTGACATGGGAAGTCTCCCGCCGTGAAACGCGTACCTACGATCGCACTGCCAACGGTTTTAAAATGAGCAGTGAAATGCAGCAGGGGGACTGGGTTAACAGTCTGCTGAAAGGCACGGTAGGGGGGAGCTTTGTCGCCAGCGCGAAAGAGGCAGGTTTAACCAGCAGCGAAATCAGCGCAGTGATAAAAGCCATGCAGTGGCAGATGGATTTTCGCAAGCTGAAAAAGGGCGATGAATTTTCGGTTCTGATGTCGCGCGAGATGCTGGATGGCAAGCGTGAACAGAGTCAGTTGTTGGGCGTGCGGATGCGTTCCGATGGTAAAGATTACTACGCCATTCGCGCCGCTGACGGTAAATTCTATGACCGTAACGGTGTTGGCCTGGCGAAAGGCTTTTTACGCTTCCCGACCGCCAAACAGTTCCGCATCTCCTCCAATTTCAATCCGCGCCGTCTGAACCCGGTTACTGGACGCGTTGCGCCGCATCGTGGCGTTGACTTTGCGATGCCGCAGGGTACGCCGGTGCTGTCGGTGGGGGATGGCGAGGTCGTGGTCGCTAAACGTAGCGGCGCTGCCGGTTACTACATTGCGATTCGTCATGGACGCACCTACACCACACGTTATATGCACTTGCGTAAGCTGCTGGTGAAACCGGGGCAAAAAGTGAAACGTGGCGATCGTATTGCGCTTTCTGGTAACACCGGGCGCTCCACAGGGCCGCATCTGCATTATGAGGTATGGATCAACCAGCAAGCCGTTAACCCTCTAACAGCAAAATTGCCGCGCACGGAAGGTCTGACGGGGTCAGATCGTCGTGAATACCTGGCACAGGTGAAAGAGGTTCTGCCACAACTGCGCTTCGATTAACAAATGCGCTGACAGAGCCGGTACGCGATGTGTGCCGGCTTTTTTGTTTTGTGTGCGGCGCAGACGTCGCTACACTATTCACAATTCCTTTTCGCGTCAGCAGACCCTGGAAAAGCATGGAAACCAAAAAAAATAATAGTGAGTATATCCCTGAATTCGAAAAATCCTTTCGCTATCCGCAGTATTGGGGCGCCTGGTTGGGCGCGGCGGCAATGGCGGGGATTGCATTAACACCGGCATCATTCCGTGACCCTTTGCTGGCGACGCTGGGGCGCTTTGCCGGACGGCTGGGGAAGAGTTCTCGTCGCCGGGCGCTAATTAATCTGTCTTTGTGCTTTCCGCAGCGTAGCGAAGCTGAGCGCGAAGCGATTGTCGATGAGATGTTCGCCACCGCGCCACAGGCAATGGCGATGATGGCTGAGTTGGCGATGCGCGGTCCGAAAAAAATTCAACAGCGTGTTGACTGGGAAGGTCTGGAAATCATTGAGGAGATGCGTCGTAACGACGAAAAAGTCATTTTTCTCGTACCGCATGGCTGGGGCGTCGACATTCCAGCTATGCTGATGGCCTCTCAGGGGCAAAAAATGGCGGCGATGTTTCATAATCAGGGTAATCCGGTTTTTGACTATATCTGGAACACCGTGCGTCGGCGTTTTGGCGGACGTTTGCATGCGCGCAATGACGGGATTAAACCCTTTATTCAGTCTGTTCGTCAGGGCTACTGGGGTTACTACCTGCCGGACCAGGATCATGGCCCGGAGCATAGTGAATTCGTTGATTTCTTTGCGACATACAAAGCGACGCTGCCCGCGATTGGTCGGCTGATGAAAGTGTGCCGCGCACGCGTGATACCGCTTTTCCCGGTGTATAATGGTAAAACGCATCGCCTGACTATCCAGATTCGCCCGCCAATGGACGATCTGCTCACGGCTGACGACCACACTATCGCCAGACGGATGAACGAAGAGGTCGAAATTTTTGTCGGCCCGCATCCGGAACAGTACACCTGGATCCTGAAGCTGCTCAAAACCCGCAAGCCAGGCGAGATTCAGCCGTATAAGCGTAAAGATCTTTATCCCATCAAATAAATAAAGCCTCTCGTAAGAGAGGCTTTATGCTGACAAACCCTGTACTACCTGATGAACAGGCGTGGGGGAGTTTTACTCCACGGTCAAGATACGCGTGGTATTGGTTGAACCGACGGTGCTCATGACATCGCCCTGGGTCACGATAACCAGGTCGCCGGAAACCAGATACCCTTTATCGCGCAGCAGATTAACAGCTTCATGTGCCGCGACAACGCCATCAGCCGCGCTGTCAAAATGCACCGGCGTTACTCCGCGGTAGAGCGCGGTCAGGTTCAGCGTGCGTTCATGGCGCGACATGGCGAAAATCGGCAGGCCGGAGCTGATACGGGAAGTCATTAGCGCGGTACGACCGGATTCCGTCATGGTGATGATCGCGGTAACGCCTTTCAGATGGTTTGCCGCATACATCGCAGACATGGCAATAGCTTCTTCAACGTTGTCGAACTGCACGTCGAGACGGTGTTTAGACACATTGATGCTGGGGATTTTTTCTGCGCCCAGGCAGACGCGCGCCATTGCGGCAACGGTTTCAGAAGGATACTGACCGGCTGCGGTTTCGGCAGACAGCATAACTGCATCCGTGCCATCCAGGACGGCGTTCGCCACGTCCATCACTTCCGCGCGGGTCGGCATCGGGTTGGTGATCATCGACTCCATCATCTGCGTTGCGGTGATGACTGCACGGTTTAGCTGACGCGCACGGCGAATCAGCGCCTTCTGGATACCAACCAGCTCCGGATCGCCGATTTCAACGCCCAGGTCGCCACGTGCGACCATCACAACGTCAGAGGCCAGAATGATATCGTCCATGGCGTTTTGGTCGCATACCGCTTCGGCGCGTTCGACTTTAGCCACAATTTTCGCGTCGCAGCCGGCGTCGCGTGCCAGGCGGCGTGCGTAGTTCAGATCTTCGCCGCAGCGCGGGAAGGAGACGGCCAGATAGTCAACGCCTATCAGCGCAGCGGTTTGAATATCGGCTTTGTCTTTTTCGGTCAGCGCTTCAGCAGAAAGCCCGCCGCCGAGCTTGTTAATGCCTTTATTGTTAGACAGCGGGCCGCCGACGGTGACCTCGGTAAAGACTTTCATGCCCTGGACTTCAAGCACTTTCAACTGCACTCGACCATCGTCAAGCAGCAGGATATCGCCAGGAACGACGTCTGCCGGCAACCCTTTGTAATCAATGCCGACTTTTTCTTTGTCGCCTTCGCCTTTACCCAGGTTAGCGTCTAACAGAAATTTGTCCCCGATGTTGAGGAACACTTTGCCTTCTTTAAAAGTGGAAACGCGAATTTTTGGCCCTTGCAGGTCGCCTAAAATAGCCACATGACGCCCCAGTTTGGCGGCAATCTCACGGACTTTATCAGCACGCATTTTATGATCTTCCGGCGAGCCGTGAGAGAAGTTCATACGTACTACGTTTGCGCCCGCGGCGATAACCTTCTCAAGGTTGTTATCGCGGTCAGTTGCCGGGCCTAACGTGGTAACGATTTTGGTTCTGCGAAGCCTTCTGGACATGTAATACTCCGTTGACTAAAACAACTTGGTGTTGCGTGAACATTGATCCGGTCGTTCTGAAACCTTAACGGTATAAAATGAACGTCCCCGGATGACGAAAAGGGTAGAACATTGTTATTGCTTTTAGCGGTCATCACTCTTGATGAGTAATTCTTTATCAAAACGCGATTCCTTGAGCGCTTCCTTGACACGCTTCAAGTTATCTCTGAATTTTGCCCCGCGACGCAAAGTAAATCCCGTTGCCAGCACATCTATCACGGTCAGCTGAGCAAGTCGAGAGACCATGGGCATATAAATGTCTGTATCTTCCGGTACGTCGAGGGTAATGGCCAGCGTGGCTTCGCGCGCCAGCGGCGTTCCAGCGGAGGTCAGGGCGATCACCATCGCATCGTTTTCCCGCGCCAACTGCGCCAGTTCCACCAGGCTCTTGGTTCTGCCAGTATGCGAAATGAGCACGACGACGTCATCATCGCTACAATTCATACAGCTCATCCGTTGCAGCACGATATCGTCGGAGTAAATCACCGGCACATTGAAGCGAAAAAACTTATTCATGGCGTCATGCGCGACCGCGGCGGAGGAACCAAGACCAAAAAAGGCGATTTTTTTTGCCTGGGTCAGTAAATCGACCGCGCGGTTGACGGCCGATTTATCCAGCGACTGGCGAACATGGTCGAGACTGGCCATAGCGGACTCGAAGATTTTTCCTGTATAGGCCTCGACGCTGTCATCTTCATCCACATTGCGATTAACATAGGGGGTGCCATTTGCCAGACTTTGCGCCAGATGCAGTTTAAAATCAGGAAAGCCGCGGGTATTCATGCTGCGACAGAAACGGTTCACCGTTGGCTCGCTAACGTTAGCTTCCTGGGCCAGCATGGCAATGCTTAAGTGAATCGATCTGCCTGGGGCGGCCAGAATAACATCGGCGACTTTTCGTTCAGATTTGCTTAAATGTTCCAGTTGAGACTGGACTTTTTCCAGCATATTCATGATTAGCAAGGCTCATGGGTATTAGCGATTTCAATGATGCGCGAAACCGAGCGCGTTTTGTTAGATATTACTCCTGGCGGCTGTCAAAGGGGGCAAAATGGCTAAAAAAGGTGTCGTTTTTTTTCATTACATGACCGACGTCGGATTTTAAGTTCCAGCTTGTGTGGAAAAGCGACAATTTTATTAGGCGTTTTGCCGATTATATTGCCAATCAAACGCCGGTTCACCCATGAAAAGGCGTTTACGGTTTCCGTAATCTCGTAAAAGCAGTACAGTGCTGTGTAATAAAATTACAACGATATCCTGGCTAAAGTACCAGGAGATTAACTGAGGAGAATGACATGGCGGTAACGCAAACAGCCCAGGCATGTGACCTGGTCATTTTCGGCGCGAAAGGCGACCTGGCGCGCCGGAAACTGCTGCCTTCCCTGTATCAGCTTGAGAAAGCCGGCCAGATCCATCCGGATACCCGTATCATTGGGGTGGGGCGCGCGGACTGGGATAAAGAAGCTTATACCCACGTCGTGCGTGAGGCGCTGGAAACCTTCATGAAGGAAAAAATTGATGAAGGCTTGTGGGATACGCTGAGCGGCCGTCTGGATTTTTGTAATCTTGACGTTAACGATACGCCTGCGTTTAGTCGCCTGGGCGATATGCTGGATCAAAAAAACCGCACCACCATTAACTATTTTGCGATGCCGCCCAGCACCTTTGGCGCTATTTGCAAAGGGTTAGGAGAGGCTAAACTCAACGCCAAACCGGCGCGCGTCGTGATGGAAAAACCGTTGGGTACTTCGCTGGCGACCTCGCGTGAGATTAACGATCGGGTCGGCGAATACTTTGAAGAGTGCCAGGTGTATCGTATCGACCACTATCTGGGCAAAGAAACGGTTCTCAACCTGCTGGCGCTGCGTTTTGCCAACTCGTTATTCGTTAATAACTGGGATAACCGCACTATCGATCACGTTGAGATTACCGTAGCGGAAGAGGTGGGGATTGAAGGGCGCTGGGGATATTTTGACCAGGCCGGTCAGATGCGCGATATGATCCAGAACCACTTGCTGCAAATTCTCTGCATGATTGCCATGTCACCGCCGTCTGACCTGAGCGCCGACAGTATTCGCGATGAAAAAGTCAAAGTGTTGAAATCGTTGCGCCGTATTGATCGCTCCAACGTGCGTGAAAAAACGGTTCGTGGTCAATACACCGCGGGCTTTGCGCAGGGTCAAAAGGTGCCGGGCTATCTGGAAGAAGAGGGCGCGAATAAAAGCAGCAACACCGAAACTTTCGTCGCGATCCGCGTCGACATCGATAACTGGCGTTGGGCGGGCGTGCCATTCTATCTGCGCACCGGCAAGCGTCTGCCAACCAAGTGCTCTGAAGTCGTGGTTTATTTCAAGACGCCTGAACTGAATCTCTTTAAAGAGTCCTGGCAAGACCTGCCGCAGAACAAATTGACGATTCGCCTGCAGCCGGATGAAGGCGTAGATATTCAGGTGCTTAACAAAGTACCGGGGCTGGATCATAAGCATAATCTGCAGATCACGAAGCTTGATCTGAGCTACTCCGAGACCTTTAATCAGACGCACCTGGCGGATGCATATGAGCGCCTGCTGCTGGAAACGATGCGTGGCATTCAGGCGCTGTTTGTCCGCCGTGACGAAGTGGAAGAAGCCTGGAAGTGGGTGGACTCCATTACCGAAGCATGGGCGATGGACAACGACGCGCCGAAGCCGTATCAGGCGGGCACCTGGGGACCGGTAGCGTCCGTGGCGATGATTACCCGTGACGGTCGTTCGTGGAATGAGTTTGAGTAAATTTGCCACTCACTCTTAGGTGGTATTTTACCGGTAACATGATCTAACACAGATTGTAGAATCATTTTTGCACTTTTAAGCCTCGTGTGGATTCACCTGCGAGGCTTTTTTTATTACACTGCCTGAAACGATTTTGCCCCATTATCCCCGGACATCATGTATTTCACTGTCTTTCCACATTGATGAAACGCATGGTAAACCCGGTAGCCGGACAGATAAATTTCAGGAGCCTCTATGAATCCTAATTTGTTACGCGTAACACAGCGTATTGTCGAACGCTCGCAGCAGACCCGAGAAGCCTATCTTGCCCGCATTGAGCAGGCGAAAACCGCCACGGTCCACCGATCTCAACTGGCCTGCGGCAACCTGGCGCATGGCTTCGCCGCCTGTCAGCCAGAGGACAAAGCCTCGCTGAAAAGTATGTTGCGCAATAATATCGCCATTATTACCTCCTACAATGACATGCTCTCTGCGCATCAACCGTATGAACATTATCCGCAGATTATTCGTCAGGCCCTGCATTCCGTGAATGCGGTAGGTCAGGTCGCAGGCGGCGTACCGGCAATGTGCGATGGCGTTACGCAAGGGCAGGATGGCATGGAGTTGTCATTACTCAGCCGCGAAGTGATAGCGATGTCGGCAGCAGTAGGCCTCTCTCACAATATGTTTGACGGCGCGTTATTCCTCGGCGTATGCGACAAAATCGTTCCGGGGCTGGCGATGGCCGCGCTCTCTTTTGGTCATTTACCCGCGATTTTTGTTCCGTCAGGCCCGATGGCGAGCGGCCTGCCGAATAAAGAAAAAGTCCGTATTCGTCAGCTATATGCGGAAGGAAAAGTAGACAGAATGGCGCTGCTGGAGTCGGAAGCCGCCTCTTACCATGCGCCGGGCACCTGTACATTTTACGGCACCGCCAACACCAACCAGATGGTGGTGGAGTTTATGGGAATGCAGTTGCCGGGTTCTTCGTTTGTGCATCCGGATGCGCCGCTGCGCGAGGCATTGACTGCCGCTGCCGCACGTCAGGTAACACGTCTTACCGGCAACGGCAATACGTGGATGCCGCTCGGTAAAATGATCGACGAAAAAGTCGTGGTGAACGGCATTGTCGCGCTGCTGGCTACCGGCGGCTCCACCAACCACACCATGCATCTGGTTGCAATGGCGCGCGCGGCGGGCATTCTGATCAACTGGGATGACTTCTCGGATTTGTCGGAAGTGGTTCCGTTGATGGCGCGTCTGTACCCGAACGGTCCGGCGGACATTAACCACTTCCAGGCGGCGGGCGGCGTACCGGTATTGATGCGTGAGCTGCTCAATGCCGGATTGCTGCACGAAGACGTTAACACTGTCGCAGGCTTCGGCCTGAAACGCTATACGCTGGAGCCCTGGCTCAACAACGGCGAGCTGGACTGGCGTGAAGGCGCGGAAAGGTCACTGGATAACGATGTCATTGCCTCTTTTGATAAGCCGTTCTCTCCTCACGGCGGCACTAAGGTGCTAAGCGGTAATCTGGGGCGCGCAGTAATGAAGACGTCTGCGGTACCGGTTGAAAACCAGATCATTGAAGCGCCTGCCATGGTATTTGAAAGTCAGCATGATGTGCTGCCTGCGTTTGACGCGGGTCTGCTTGATCGGGATTGTGTCGTTGTCGTGCGTCATCAGGGACCAAAAGCGAATGGAATGCCAGAATTACATAAACTCATGCCGCCACTTGGTGTATTATTGGACCGCCGTTTCAAAATCGCGTTAGTTACTGATGGACGACTTTCAGGCGCTTCGGGTAAAGTGCCTTCAGCTATCCACGTAACGCCGGAAGCCTACGATGGCGGCTTACTGGCAAAAGTGCGCGATGGCGACATCATTCGCGTGAATGGGCAGACAGGTGAGTTAACTCTGCTGGTCGACGAGGCGGAACTTGCCGCTCGTCAGCCTCATATTCCGGACCTGAGCGCGTCGCGCGTCGGAACGGGGCGTGAGTTGTTTGGCGCGCTGCGCGAAAAGCTGTCGGGTGCGGAGCAGGGCGCAACCTGTATCACTTTTTAAGATGACACACTAGTAATCAGGCGAGAGAAGAATTCCGATGAAAAACTGGAAAACAAGTGCAGAAGCAATCCTGACCACCGGCCCGGTTGTCCCGGTCATTGTAGTCAATAAACTGGAGCACGCGGTGCCGATGGCTAAAGCGCTGGTGGCCGGGGGCGTTCGCGTTCTGGAAGTGACTTTACGTACGGCCTGCGCGATGGATGCTATTCGCGCTATCGCTAAAGACGTGCCGGAAGCGATTGTCGGCGCCGGAACCGTTCTCAATCCGCAGCAGTTGGCGGAGGTGACGGAAGCGGGCGCGCAGTTTGCGATTAGCCCGGGACTGACTGAGCCACTGCTGAAAGCCGCGACGGCAGGCACTATCCCATTGATTCCCGGTATTAGCACCGTTTCTGAACTGATGTTGGGCATGGACTATGGTCTGAAAGAGTTCAAATTCTTCCCGGCGGAAGCGAATGGCGGCACTAAAGCGTTGCAGGCGATTGCCGGTCCGTTCTCTCAGGTACGTTTCTGCCCAACCGGCGGCATCTCTCCGGCAAACTATCGTGACTATCTGGCGCTGAAAAGCGTGTTGTGCATCGGCGGTTCCTGGCTGGTGCCGGCCGACGCGCTGGAAGCGGGTGATTACGATCGCATCACCAAACTGGCGCGCGAAGCGGTAGAAGGCGCGAAACAGTAAGCCGTTAAATGCCCGATGGCGCTTGCTTATCGGGCTTACGAGTGGCGATCAGGCAGGTCTGATAAAAATGCGCTAACGTCGCCATCAGGCGATGGCGCATTAGCCTTTTACCGTCACGCGGCTGGCGGCCTTTTTCGCTCTTATCACCGCTTCTTCAACGTTTTCACCTGTCGCTAACGCTACACCAAGACGACGGCTGCCGTCGATCTCAGGCTTACCAAACAGCCGTACCTGCACTCCGGCCCCTACCGCCGTGTGTACATTATCAAACGTCACATTTTGACTGGTAAGCTGCGGCAGAATCACGGCCGAGGCAGCGGGACCATACTGGCGAATAGCGCCTATGGGCATTCCCAGAAAGGCGCGCACATGCAGCGCAAACTCAGAAAGATCCTGAGAAATCAACGTCACCATTCCGGTATCGTGCGGGCGAGGGGAGACTTCGCTGAAAATGACTTCATCGCCACAGACGAAGAGTTCAACGCCGAACAGGCCATGCCCGCCTAACGCCAGTACCACATGACGCGCAATCTCTTGCGCCCGCTTCAGCGCCAGTTCGCTCATCTGCTGTGGCTGCCAGGATTCGCGATAGTCGCCATCTTGCTGACGATGACCGACTGGCGCGCAGAAATGCACGCCATCGACGGCGCTAACGGTGAGCAGCGTAATTTCAAAATCAAATTTAACCACGCCTTCCACAATCACGCGACCCGCGCCAGCGCGTCCGCCCTGTTGAGCATACTCCCATGCCTGCGCGAGCTGTTCGGCCGAGCGGATAAAGCTCTGGCCTTTGCCGGAAGAGCTCATGACCGGTTTGACGATGCAAGGAAAACCCACTGCGGCTACCGCATCATGAAAACTGGCCTCACTGTCGGCAAAGCGATACGTCGATGTCGGCAGACCTAATTCTTCTGCGGCCAGGCGACGGATCCCTTCGCGGTTCATCGTGAGCTGCGTTGCACGGGCGCAAGGCACGACATTCAGCCCTTCGTCCTCCAGCTCACGCAGCGTATCGGTGGCGATCGCTTCTATTTCCGGCACGATATAATGCGGTTTTTCCTCTGTAATCACATGACGTAGCGCCTCGCCGTCCAGCATATTAATGACGTGTGAACGGTGAGCCACATGCATGGCGGGAGCATCAGGATAGCGATCGACGGCGATAACCTCGATCCCCAGGCGTTGGCATTCAATCGCCACTTCTTTTCCCAATTCACCTGCCCCTAATAACATCACCCGCGTTGCTGCCGGACGCAGCGCAGTGCCTAATAGCGTCATATTTCTGTCCCCTTATTTACCTGCGCGCAGTATATACGAAAACGTTTGCGTATGTCTTTTCGAATAGATTGCTTTTCTTGCTGTCTTGCAATATACTGTATATAAACACAGGTAAATAAGGGGCTGCAAAATGGCGGTTGAAGTTAAATACGTAGTCATTCGCGAGGGTGAGGAGAAGATGTCATTTACCAGTAAAAAGGAAGCCGACGCCTGGGACAAAATGCTCGATACTGCCGATCTTCTTGATACCTGGCTTGAGCAGTCGCCAGTCGTGCTGGAAGATGGGCAGCGCGAAGCGTTGTCGCTGTGGCTGGCGGAACATAAAGAGGTGTTAAGCACTATCCTCAAAACCGGTAAGTTGCCTTCTCCGCAGGCGGTTGAAAAGGACGCCGCGAGTAAAACGAAAAAGCAAGCCGCCTGAACCCGTACTTGCGCTCCCGGCGTTTTGTACCATGCTTTTCCTGAATCATTGTGCTAAGGAGAAAAGTATGAATAAAAGAGGAGCGTTGTTGAGCCTGCTGCTGTTATCCGCCAGCGTATCGGCATTTGCGGCATCCACCGAGAGTAAATCGGTTAAGTTTCCGCAGTGTGAAGGGCTGGATGCCGCCGGGATTGCGGCAAGCGTGAAGCGCGACTACCAGCAAAATCGTATCGTGCGGTGGGCCGACGACCAAAAAAAGGTCGGGCAGGCTGACCCGGTCGCGTGGGTTAATGTGCAGGACGTTGTAGGTCAAAATGATAAATGGACAGTTCCGCTAACTGTTCGCGGTAAAAGTGCCGATATTCATTATCAGGTCATCGTTGATTGCAAAGCCGGCAAGGCGGAATATAAGCCCCGCTAGCAGCGTAATTTGCGCTTCTTTTGCCGGAACAGACAATTCCTGACATCCCCTCAGGTACGCTTGACCATTATTGGTCTGTATTTGAGGGGTACTATGGCTAACTGGTTGAACCAATTACAATCACTTCTTGGGCAAAAAGGCGCTTCCGCATCGTCTTCCGGTGAACAGGGGTTAAATAAACTGCTGGTTCCCGGCGCGCTGGGCGGTCTGGCTGGACTGTTGGTCGCCAATAAGTCTTCGCGCAAATTATTAACTAAATACGGTACTGGCGCTTTGCTGGTGGGCGGCGGCGCGGTGGCGGGTTCCGTATTGTGGAATAAGTACAAAGATAAAGTACGCGCTGCGCATCAGGGGGAGCCGCAATTCGGCAGCCAAAGTACGCCGCTGGATGTTCGCACTGAGCGACTGATCCTGGCGCTGGTTTTTGCTGCAAAAAGCGATGGGCATATTGATGCCAAAGAACGCGCGGCGATTGAGCACCAACTGCGAGAATCAGGTGTGGAAGAGCAAGGGCGTGTGTTTATCGAGAAAGCTATTGAGCAGCCGCTTGATCCACAACGTCTGGCACAGGGCGTTCGTAATGAAGAAGAAGCCCTGGAAATCTACTTCCTGAGCTGCGCCGCCATTGATATTGACCACTTTATGGAACGTAGCTATCTGAATGCGTTAGGAGACGCACTAAAAATTCCTCAGGAGGTTCGGGACGGTATCGAGCAGGACTTGCAGCAGCAAAAACAGGCGCTACCAGGCTGATAACCTGCCATTAAGTAACATGTTTCGCTTGCATCATGCGTGTCTTTTGCCACCCTTATAGGATGGATAGTTTACGTCATTCATGTTGCAAGATAGCGGCGAAGCAGTGACATCTCAGGCGCTTACCGAGGTGAGCGACCGGGACTACCATGCTACGCCGCTGTGTTTGCCGCGTGATATATAACAAATATTAACGCAAAAGAAAAAACGACATGTTGCCAAAAGCCAATCGAATTCCCTATGCCATGACCGTACATGGCGATACGCGCATTGATAATTATTACTGGCTGCGAGATGACACTCGCTCGCAGCCGGAAGTCCTTGATTACCTGCATCAGGAAAATGAGTATGGCCGGAAGGTCATGTCCTCTCAGCAGGCGTTACAGGACCGCATTCTAAAAGAAATTATCGATCGCATCCCGCCCAGAGAAGTTTCCGCTCCGTATGTGAAAAATGGCTATCGCTACCGTTATATCTATGAACCCGGCTGCGAATATGCCATCTATCAACGACAATCGGCGTTAAGCGAAGAGTGGGATGTGTGGGAAACCTTGCTCGATGCGAACCAGCGGGCCGCGCACAGCGAATTTTATACGCTCGGCGGGCTTGCCATTACGCCGGATAATACCATCATGGCGCTGGCAGAAGATTATTTATCCCGTCGTCAGTATGGGTTGCGTTTTCGTAACCTCGAAAGCGGTAACTGGTATCCGGAACTGCTGGATAACGTTGCGCCTGAGTTTGTCTGGGCCAATGATTCCCTGACCCTTTACTATGTGCGTAAGCATAAGAAGACGCTGCTGCCCTATCAGGTTTGGCGGCACACGATTGGCACTCCGTCATCGCAAGATGAACTGGTATATGAAGAAAAAGACGATACCTTTTATGTCAGCCTGCATAAAACCACTTCGCAGCATTATGTGGTGATTCATCTTGCCAGCGCCACCACTAGCGAAGTGCTATTACTTGACGCGGAACTGGCCGATGCCGAGCCGTTTTCATTCTTACCGCGTCGCAAAGACCATGAATATAGTCTCGATCACTATCAACATAAGTTTTACCTGCGTTCTAACCGGAACGGTAAAAACTTTGGGTTGTACCGTACCCGCGTGCGCAATGAAAACGCCTGGGAAGAGCTGATCCCTCCGCGCGAGCATATTATGCTGGAAGGGTTTACCCTGTTTACCGACTGGTTAGTGGTCGAAGAGCGTCAACGGGGGCTTACCAGCCTGCGGCAAATTAACCGTAAAACCCGTGAAGTGATAGGCATCGCCTTTGACGATCCGGCTTACGTGACGTGGCTTGCCTATAATCCCGAACCTGAGACCTCCCGGCTGCGTTACGGCTATTCTTCAATGACGACGCCAGATACCTTGTTTGAACTGGATATGGATACCGGAGAACGACGGGTACTTAAACAGACGGAAGTGCCTGGGTTTGATTCTGGCTGTTATCAGAGCGAACACCTGTGGATCACCGCGCGCGACGGCGTCGAAGTGCCGGTATCGCTGGTTTATCATCAGAAGTATTTTCGTAAAGGGCAAAATCCGCTTCTGGTTTACGGCTACGGATCTTACGGTTCCAGTATTGACGCCGACTTCAGCAGCAGCCGACTGAGCTTGCTGGATCGTGGCTTTGTTTACGCAATCGTACACGTTCGCGGCGGCGGTGAGCTGGGGCAGCAGTGGTATGAAGATGGCAAATTCCTCAAAAAGCGGAATACTTTTAATGACTATCTTGATGCCTGCGATGCCTTATTAAAACTGGGTTACGGTTCGCCGTCGCTGTGTTACGGGATGGGCGGGAGCGCGGGCGGAATGCTAATGGGCGTCGCTATCAACGAACGCCCCGAGCTTTTCCACGGCGTTATTGCCCAGGTACCCTTTGTTGATGTATTAACCACGATGCTGGATGAGTCGATCCCACTAACGACAGGAGAGTTTGAAGAGTGGGGGAACCCGCAGGATATTGAGTATTATGACTATATGAAAAGCTATAGTCCTTATGACAATGTCAAAGCGCAGGACTATCCGCACCTGCTGGTGACGACAGGATTGCACGATTCCCAGGTGCAATACTGGGAACCGGCGAAGTGGGTGGCAAAATTACGCGAGCTAAAAACGGATCAACGTCTGCTGCTGCTATGTACGGATATGGACTCCGGGCACGGTGGTAAGTCGGGGCGGTTTAAATCCTACGAAGGCGTCGCGCTGGAGTTCGCCTTTTTAATCGGCCTGGCGCAGGGAACCTTACATAGCGCATAGACGCGCGCGCCCGCAACGGCTTGTGAGCGTCGCGGGTTATTGTGGTGTTCCATTGCTGCGTTGCTCGCCAGCCTGAACATCTTCCAGGTAGTGTTTAAGCGTCAGGCGAAGTTCCGGGCTCATATTATCAAGGTTATTAAAAAGCCAGCGTAAATAGCCCGGATCGCGTTTTGCGACTTCAGATACCGCCTTACCGCGGTATTTACCGAACGGGAAAGTGGTTAACAGCGCAGGACGGCCCGTGATATTGACCATTTCTTCCGCGGTCCAGCCGGTGGTTCGCATAATATCAATCAGCAAGGCGGCGGTGATATAGCAATCATAAAGCGCCCGGTGATGATGCAGCCCCGGTGGCGTTTGTACGCTTAGTTTGCGCGATTTATAGAGCGACATATTGCTGTATTTTATTCCCGGCCACAGGCGTCGCGACAACTTCATGGTACAGATCCACTCACCCGGCAATTCAGGTAATACGCGTCTGTCGAAACTGGCATTGTGAGCGACATACCACTCACTACCGTAATAAAGCGGTATGACATCTTCAATCCACGGCTTATCGGCGACCATGGCTTCGGTAATACGGTGTATCGCCATCGCCTGCGGCGTAATAGGGCGATCGGGGCGTATCAGGTGACTCATGGGATTGACAATGTTGCCATCAATGACATCAACAGAGGCTATCTCTACGATCCCGCCCTGCAGCCCGCAGGTTTCCGTGTCTATAATCCGCAACATGATTCATTCCTCACCGAAAGACCTTAGCGTAATGGAATGATATCGCCTTGCCAACCCTCCGTTGTGCGCAGGCCCGTCAGTAATAGCGAACCGCAATCGGCACGAACAATAAGTTGCCCGCTTTCATCCCACAGCGCGCTACTTCCACAGGCATTGGCCATCAGTACGGCCAGCGCATATTTATGCGAAAAACGTTGTAAGCGTGAGGTAGAAGCATGGAGTTCAGGTTCGTTAAGACATTGACTGGTTGTGAATAACGTAAATGAGGGATCTATATCACCACCTTCTGGCTGCTCATCCACCACGTTGATAGCGCTTCGCTGCCTGGCAATACAGGCGCCATGACTTTTGTGAAACATGAGCGGCGACGTTAGCCATGGGGCAAAAATAGCGATGCCTTTTACGAAGCGGCAGTTATGCTCAACAGGCATACCCACAATGATAGTCATATGGTGGGTATCGGCCGCATGAGTGAGCGGCTGTAAAAGCGCCTCGTCAGGCGGGGCGGGCAACGATTTATTCCTTTCGTCGCAGCCTAATAGCGACAGTGACGGAAAGACCAGCAGTTCGCACTGTTGACGAGCCGCAAGCTCAATATATTCCAGATGATGGGCGACATGCTCCGCCGGCGAGGCGTTCAGGGGCGCATACTGCGCAGCAGCAATTTTCCAGGAGGACATAATGACTTCCTTTTCATAGTCTCTACAGCCATTCTAAAAAGCATAGGTTAAGAATCACCTTATAATGTAGTCGTCTGCATGTAAGGAAGTGTAACGTTATATAATTTTTATTTAACTTTGGGTTCGTAAGGGAGTCGGGATAGTGATACGGAAGCCAGACGATGGGCAATCAGCCGCTCGCGAAACCAGGCGCGTAGGTGCTCTGGCTGCTCACGTTCGACCGCCTCCGCGACAACTGGCATATTGTAGCGCTCTTTAAATGCCACCCCGGCTGCCGCCAGGTCCACATTGACTTTATCCATTTCCGCTTGTTCAAGCTGAGCCAGATTCGTTTTCATACGCGTCTCCTTTTTTGTTGCCGCCAGAAGGTACTAAGAGTCCGCGGTAATGGCAAGATGGAAATGATGATCCTCGACTCATTGATTGTTGAAGGTTATTCTTTGTAACGTAGGTATAACAAAAAGGAATAGTGAATATGGCTTCTTCTGCACCATCGCGACGTTTAGCTTTACTGCTGCTGGCATCGACATTTGCGACGCCAGCGGCCTGGGCACATGCGCACCTGACGCATCAGTATCCAGCGGCGAATGCTGCCGTTACGGCCTCGCCACAGGCGCTGACCCTGAACTTTTCTGAAGGGATTGAGCCAGGGTTCAGCGGCGCAACCATTACTGGCCCTCAGCAAGAGCTCATCAAAACGCGCCCGGCAAAGCGAAATGAACAGGATAAAACGCAGTTGATTATCCCGCTTGAGCAGCCGTTAAAATCTGGCGCTTACACGGTAGACTGGCACGTTGTGTCGGTGGATGGACATAAAACAAAAGGGAAATACACCTTCAGCGTGAAATAAATGATGCTGACATTCGTCTGGATAACTCTCCGATTTATTCATTTTGCTAGTTTGATGCTGGTCTACGGCTGCGCGCTTTACGGCGCCTGGCTGGCACCCGCATCAATTCGTCGTTTAATGACGCGTCGATTTTTACATCTGCAACGACATGCCGCCGCCTGGAGCGTTATCAGCGCGGCTTTTATGCTGGCGATTCAGGGCGGACTGATGGGCGGCGGCTGGCCCGATGTTTTTTCCGTCTCGGTGTGGGGCGCGGTACTGCAAACCCGCTTTGGCGCGGTCTGGATATGGCAAATTATCCTCGCGCTGGTCACGCTGGCGGTGGTAGTCATTGCGCCGGTAAAAATGCAACGACGGCTTCTTATTCTCACCGTTGCTCAGTTTATCCTGCTGGCAGGCGTTGGACATGCGACGATGCGCGACGGTGTAGTGGGAACATTACAGCAGATTAACCATGCTCTGCATTTACTCTGTGCCGCAGCCTGGTTTGGTGGGCTGTTGCCAGTGGTTTATTGTATGCGCATGGCTCAGGGACGCTGGCGTCAACATGCTATTAGCGCCATGATGCGTTTTTCTCGTTATGGTCACTTTTTTGTGGCGGGCGTATTGCTCACAGGCATTGGCAACACGCTATTTATCACGGGATTTACCGCTATCTGGCAGACCACCTATGGACAGTTGCTTTTGTTAAAATGTGCGCTGGTCGTGCTTATGGTAGCAATTGCGCTGACGAATCGGTATGTTCTCGTACCACGTATGCGACAGGAAAATCCCCGGACTGACCTATGGTTTGTCAGGATGACGCAAATTGAATGGGGAGTTGGAGGCATAGTTCTGGCGATCGTCAGCCTGTTTGCAACCCTCGAACCTTTTTGATGGACTGGCATAACGAATGAAAAAAATACTCCTTCCGGCGCTTCTGCTGGCCACTTCGGGCGTAGCGTTGGCGGCGCCGCAGGTGATTACCGTAAGTCGTTTTGAAGTAGGAAAAGACAAGTGGGCGTTTAATCGGGAAGAGGTCATGTTGACCTGTCGGCCTGGCCAGGCGCTCTATGTGATCAACCCCAGTACGCTGGTGCAGTATCCCTTGAATGCCATTGCCGAACAGCAAGTAGCGGAGGGTAAAACGCGTGCTCAGCCTATTGCCGTCATTCGAATCGATAACCCGGCGAAGCCCGGTGAGAAAATGAGTCTGGCGCCGTTTATCGAACGTGCGCAAAAGCTTTGTGATCCATCCAATAGCTGACTGATTTTTAATAAAAAACCGTAAACCTTCACGAAAAGGCTTACGGTTTTTTTATCTCTGATAACAGACAAAACGCCAGGTTTTTTCAATCACCTTCGTCGCAAACTGGAAAACCTGGCGTCGTCATCTATTCTTAAAGGGCAAGGCGATTTAGCCTGCATTAATGCCAACTTTTAGCGCACGGCTCTCTCCCAAGAGCCATTTCCCTGGACCGAATACAGGAATCGTATTCGGTCTCTTTTTATTTTTCTTATAAAACAGTTACTTATGATCAATGATCCGAAATTTTCCGAAATTTTTCCGAATTTCTGTATTCCGGTCTTTTTGGTTATATCACAATCAAATTAAATTTAACATTTATTTCACAACGAAAATTGGAGTATTAGAGCATCATATAAGCTTTATCATCACGCTCATCGAGATAGAGTTTCGTGGTGTTCGCTGATGTGTGGCCCAGGAGTTTTTGGGCGAACACCTCGCCGTGCTCGTTTTTGTACAGCCGCCCGGCCAGACTTCGGATCTCGTGAAATGTCGGTGGATTATTGCTGAAGTTAACACCGGAGGCTTTTCTTGCTTTTACAAATGTCTTTGTCAATCCATCCGGATGAATATTCCCGGTTGGGCTATTTTTCCTGATTCCGGCACTGATCATGAAATCAGTGCGGCTTACCAGTCGGCAGCGATCGATTACCGTTCCCAGACGTAACCCCGTCGCCCGAAGTGTCAGGGAGAGGGGAATGGCTATTTTCATTCCGGTTTTAATCTGAGTTACGTATAAGCGGTTGTCAAAAACATCACTAAATTTCATATTTACGATATCCTCCCTACGTTGACCAGTAACGAGCGCTAAATCCATCGCGAGAGGGAACCATGCAGGCATATGCTCTGCTGCCGCTCGTGTGGCGTTATACGTTTCCAGTTGCAGGCGTTCCCTGGCCACCTTAATCTCTGGTATCCGGGTTGCTTCCACCGGGTTTTTCACAATATGCCCTTCGACAATAGCCTCTCTGAACATGTCAGATAGAACTGATCTCATTGCTCCCGCCATAGTGTTTTTTCCCTCGGTTATCCACGACTCAAGAAACTTGGCAATGTGCCTGGTTGTTACTTCTGCCAGTATTATTTCCCCCATTTTTTCGCGTACGGTCGCTAATTGATTACCGCGAATCTTGTAGGTATTAACCGACAGACTCCGGCGCTGTAATAAAACCTCATAGCGATCAATCCATGCGGACACAGTGAATGAGTCAGTTCCTTTTAGCTTTTCAATAAGCGCCACTGGTGTGTGGTTTTGCGCTATGAAGTTGTTTGCCTCTATGGCCTGTGTGATAGCGTCCCTGCGGGCGATCTGACCGAGCGGAAATTCCTTGTCAGTTAGCGGGTTACGCCAGAAAAAAGATTTACTGGCCTTACGGTAGGTGAGGTTCCTCGGAAGGTTAGCATCGTACTTTTTTCGACTCACTGATCAACTTCTCCAGCAATGCACTCGGTTTTCCGGTGCGCCCGTTTGGGTGGTGCTGTTCAAGCACAAGTCCCACTTTATTCGGCTTGATATAAAACGCGTCCGGATCAACCCGATACGTCCTGCCATGTAATACTGGAGTCGGGTAAATATTTCCGTTTCGCGCCCATCGTCTTAGTGTTGTGAGAGGTGGTGGATCATCGGGATAATTTAATTCACCCCAGGTTTCAAGTCTCACAAAGCTCATAGTCATGTCTCTTTACTTCATGACCGCCGCCAACTATACGGTGTGGCGGTCGGTCAGGGTTGAACATCAATGATCAGGGTAAAATTTAAAGGACTGCTGACCGCCGCCCGGTAAAACTTTTACATCTCCGGCGCGCCGTCCTGTAAATCCTGCCCAATGCGCGGCGCGTATCCTGTCAGCCTGCTCTTCAGTCAGGCAGGGTCCCGGAAACGGGAAGCGAGCTTACCCCACTTACTAAAAGAGGATGGAACTGGCTGACGTAAAACACGATTTATTTGTATCCATAAATAGCGAAATGTAACGTTTTGGTTATATTTAAAAGAGAGAAAATGGTCAGCAATAACTTTAATTGTTTGAATTAACAGATAATTAATCTACCAGACTGAGTGATACAGAATATTTTTACATGAGGGGTACAAATGAGACTTAAGTTGATCGTTAAAAGTTTTGCGCTGGCGGGGCTACTCTCTTCCACTGCGCTGACACCTTTATTTGCACAGGAAGCCCCAAAAGGTGCCACTGCTTCAACCAAGCAAGCTAACGATGCGCTTTATAACCAACTTCCTTTCTCTGATAACACCGATTTCACGAATGCCCATAAAGGCTTTATCGCTGGTTTACCTGAAGAGGTGATTAAGGGAGAGCAAGGGAATGTCATCTGGAATCCACAGCAGTACGCTTTCATAAAAGAAGGGGAAAAATCTCCTGACACTGTTAACCCTAGTCTGTGGCGTCAGTCCCAGCTAATCAATATCAGTGGCTTGTTTGAAGTCACAGACGGCGTCTACCAGATTCGTAACCTTGATTTATCCAACATGACGATTATCGAAGGTAAAGAGGGGATTACGGTTGTCGATCCGCTGGTTTCTGCGGAAACAGCCAAAGCCGGTATGGATTTGTATTTCAAAAACCGTGGCAATAAGCCTGTTGTCGCCATCATTTATACTCATAGCCATGTTGACCACTATGGCGGTGTGCGTGGCGTTGTCGATGAAGCGGACGTGAAATCCGGCAAGGTGAAAGTGTATGCGCCTGCTGGCTTTATGGAGGCAGCAGTAGCCGAGAATATTATGGCCGGCAACGTGATGAGCCGCCGTGCCAGCTATATGTATGGCAACCTCCTGAAACCAGATGCCTCCGGCCAGGTTGGCGCCGGACTGGGGACGACCACCTCTGCGGGGACGGTGACACTGATTGCGCCCACTAATATCATCGATAAAGACGGCCAGAAAGAAGTGATTGATGGCCTGACTTACGACTTTATGCTGGCCCCTGGTTCGGAAGCCCCTTCGGAAATGCTGTGGTTCATCGAAGAGAAGAAACTCATCGAAGCCGCAGAGGACGTCACTCACACCCTGCATAACACTTACTCGCTACGTGGCGCAAAAATTCGTGAGCCGTTGCCGTGGTCGAAATATATCAACGAAGCTATAGTGCGTTGGGGTGACAAAGCTGAAATTATTATGGCCCAGCACCACTGGCCGACCTGGGGTAACGAGAATGTTGTTGGTCTGCTGAAAAGCCAGCGAGACCTGTATCGTTATATCAATGACCAGACTCTGCGCATGGCCAATGAAGGTCTGACTCGCGACGAAATAGCGGCCAACTTTAAACTACCGGATAGCCTGGCAAAAACCTGGGCCAACCGCGGCTATTACGGCTCCATCAGCCATGACGTAAAAGCAACGTATGTGCTGTATCTCGGTTGGTTCGATGGCAATCCGGCAACCCTTGATGAGCTGCCACCCGAAGAAGCGGCCAAGAAATTTGTTGAATACATGGGCGGTGCCGATGCGATTCTTCAGAAAGCTAAAGCAGACTTTGACCAGGGGAACTACCGTTGGGTTGCTCAGGTGGTGAGTAAGGTCGTGTTTGCCGATCCAAATAACCAGAATGCACGTAACCTTGAAGCCGATGCGCTGGAGCAATTGGGGTATCAGGCTGAATCTGGTCCATGGCGTAACTTCTACCTGACCGGTGCGCAGGAGCTGCGTAACGGTGTGGTTAAAGGTCCGACGCCAAATACAGCAAGTCCGGATACCGTTCGGGCGATGACCCCTGAAATGTTCTTCGACTTTCTGGCTGTACATATCAACGGTGAAAAAGCGGGTAATGCCCGGGCGGTATTTAATATTGACCTTGGCAGCGACGGCGGAAAGTACAAGCTTGAGCTGGAAAATGGCGTGCTGAACCACACGGCTAATGCTGAAGCGAAAGATGCTGATGCCACGATTACTCTGAACCGTGACACGCTGAATAAAATTATCCTGAAGGAAGAAACTCTGAAGCAGGCTCAAGATAAAGGAGAAGTCAACGTTACCGGTAATGCTGCGAAACTGGATGAGATGCTGGGCTATATGGACAAGTTTGAGTTCTGGTTCAATATAGTTACACCATAAATAGATTCCCTGCGGCGTCAATGCTGCAGGGAAGTTACTTCAGACAATTCTGTACGTTTTTTATACTCTATTTTCCCTTCATTCTTTATATCTTGCTTCATCTTATGTATTTGCTGCTGAAGAACATGGCCCTGATACCAGTCAGTTCTGATTCTGTTATGCACAGCCTTTTTCATCAGATGACAGTAACTGGTTGTTGCGTGATTCAATGGCCTGCGAGTCTGGCCAACATGCTTTTCGATGCCGGTTGGCCATGATGCCAGTATGGTTAACTGGCATCATGGCAGCATAATTTTGCCGGATAAGTCAACCGCAGCGATGTTAATCGTCCTGATTATCATCTGCATCACTGTCACAGTGACTGCACCAGTAACGAGGAGAGACTGCGATCGAACCGGCCAGACAGAGAGGAGGTAGTTGTCTTCATTGCTAAGTAAGAGACCTTGGGGGATGAATCTCCATCACCTGTGATGTGTCAGACAACCTCAATGTACCCGCACTTAATACCTGCGCCGGCGGTTTTTTTAATGTCCGGGAAATGAGCATGTCAAAAAATAACCAGTTATAAGATTATAAATAGAACACAGAGAAAATGTCATTGCATATGGTCAAAAAATAGACATATTTATTGATGATGATAATTAATAGTCTACTATATATTCATGTTGAGAATGAAGATGCTTTAAAAATGCTCAAGTTCGTTATCTATGGAGACACCGTGAAAAATTTAAATAAAACATTCACTTGTAAATATGCTGTTATTCGCCGTGATGACATGACAGTAATTGCTGAAATGGATTTTTTTCCTGACTGCAACAGGTCATTGATGTATCGGGATGGCCGCTATGTCCGGTTTCTGCCGTTGTTGCAAAATGACATCATGGGGAGCGATACCCTGATTAATGAGCTGACTATCAGGGCCGGTTATCATGAATAATCATCCTTTGTTATACTCGTCTGCGGGCTGAACTCCCAATCTACTGCGCCAACGGAGAGAACGATGGCGCATTTACAACTGGTCAAGCAAACCTCATCAGGGCTTCTGCTCCCGGCGACGCCGGAGAGTGGGGATTTCCTGCGCTCAGTAAAAATCGGTGAGTGGATACACGCCGATTTTAAACGTGTCCGCAACTACGCCTTTCATAAACGATTTTTTAAACTCCTTCAGCTTGGTTTCGACTACTGGATGCCAACGGGCGGCACGGTCACATCGCGGGAACAGAAACTTATCTCCGGGTTCGTTAATTTTCTTTGCGACTCCGCAGGCCAGGAATATACCCCGGCCCTTAACGAGGCGGCGGAACAGTACCTCCATAACGTAGCTACCCTGCGAACCGGGGACGTCGCCCTTCTTAAGTCTTTCGATGCCTTCCGGGAATGGGTAACCGTTCAGGCCGGGTTTTATACCGAGCATTTTTATCCGGATGGCAGTCGCGGGCGCCGGGCGAAATCCATAGCATTCGCCAGTATGGACGAAACCGAGTTTCAACAGGTCTATAAAGCTGTGCTGAACGTCCTGTGGAACTGGATTCTGTTTCGTAAATTTTCCTCTCTGGAAGAAGTTGAAAATGTGGCCGCGCATCTGCTGGAGTTCGCATGAAAATGACATGGTTTCAGCATCCGGTGTGTACCACCGAAGAGGCGGATGAGCTGGTGGCGGGATACCGGCGCCGTGGCGTGAAGGTTGAGCGTTACGGTGAGGCGGAGGTGCTGGAACTTGAGAGCAATAATACTCCGCAACGTTGGACGGTTGAGGAGCTGAAAGAAATCAGGATCGCTGCACTGGCGGATCTGCGTGCGCTAAAAAAGCTGGAGGCGGCATGACATTCGAATCCTACTTTGCCGATCATCTCCGTGCTCGCTGGTAGCAGTTGTGCTTATACCATTTCCGGGTTCCATCCTGATCGATTACCGGATTTTGAAAAACTACGTGAAGATAACGGGCGGTACCGTATGAATACACAATATCTGGAATATGTGCGACAGCAGCTCATCGTGGCGACTGCAGATCTGAGTGGCGCCACGAAAGGTCAGTTGCAGGCATGGCTGGAGAACGCCCAGCTCTATACGAAAAACTATCCCCGAAAAAAACAGCGTATCAGGGATGAAGTGACCGGAAAAATGATAACGCTGAATAATCCACCGATTGCTGGTAAGCAATCACTGGCGAAAGGAAGCGCAATTCCGCTTGTGCAGCCCGTAGAATACTCCACTTCCTCATGGCGCCGTGCGCTTTTGTCACTCGAAGAACATAATAAGGCCTGGCTATTGTGGAATTACAGTGAAAACACCTGCTGGGAATATCAGGTCACTGTAACTCGATGGGCTTGGGAAAAATTCAGCCAGCAGTTGGAAGGGAAGCGAGTTGCGAAGAAGACTTTAGCACGGTTGCGCCAGCTCATCTGGCTTGCTGCGCAGGATGTGAAGGCGGAACTGGCCAGACGTGAGACGTATGAGTACCAAACGTTGGCGGAACTGATGGGCGTGGCAAAATCTACCTGGACAGAGACGTACATGTCTCATTGGTTAGTAATGCGTAACAGCTTTAAACGGCTTGATAGTGATGCGCTTATCTCCGTAACACGATCGCGTTCACAACAAAAGGCGACAAATTTGGATATAAGTCTTGCAAAACCGAACTGAAATACATATATTTCATGTAAATTTGATATCATGCCTAAAATATGCAAGCCTGCTGAGGAACGGGATTTTTGTATTAATCAGCAATAGGAATTGATGATGTTTTATCGTGATTTATTTCAAGTTTTTGGTCCCGACCCGTTGTATAAGGAAGAAGAAGGAATTGCCATCCTTCGTGAGCAATATGGGATCGAAGCTCCAGAACAAATTTTTAAGCAAATTTATTGTGGGTTATCTAATAATTCTGAATTTCAAACCTTGTATGGGCATCTAAATCTTAAATCACTGAAGTGGGATTTGGTCAGATTGAAAACAGCAGAGTTTACAAAGTTTGGCAGAAATGCCACATATCCTGATTACATGCTCGAGATTTCAGAAGACTTTAATGCCTGCGGCAGCAAGTTTTGCATTGATGCCCGTGAAGAGGTTGCAAACCATTGGCTTAAATTCGGTACATGGGCTGAACCACCGATGTTTATTGAGCGTTCGCTTATTATTCCTGGAGAGAGCGGCTTACACCTTATGGAGGGTCATACAAGATTAGGTACTTTATTGGGGGCTATTAAGTACAAATTTGTGCAGTTAGCTGATACTCATGAACTTTATATAGCCTCGCAGAAATAGTTTAGAGAGGATTGCTCAACACCCTCGGTAAGAATTGCCACACATCAAACTAATGTTAGGGTATCTTCGTCCACAGAGTCGAAATGGCCTTATTTACATCTTCCTGGCTTTTCGCCGGTTTTTTTATTCAGCCCTCGGAAATCATCATCTACACGCTTCGTTGTTAAAACCCCGCCCGAGGGCCTCTCACCCTTACAAATACAGCGCCATCCAAGCTATCGGGGGCGAGGCTTATGAAAATGCACAACGATCCCCATTCAATGGACTCACAATCTATTTTTGCGCTGATTGCGAGTCTGACTTTTTTCGATTGTGAGTCGTTACAGATAGCCGCCGGGCCAGACACCACAACGGTACCAGGTGGCGTTATGTGCTAGAAACCGAAATTCTTGAACATCTCATTACTACTCATATCGTTGGCTGGCGCACGGTTCATCGACTCCATCTGCTTGATCAACCCTATGCTAGCACCAACAGGCCCCCCGACGCTGAAACCGACGCTGCTGGAGCGGGTTTTGCTGGTGCTACTGCTGTGGGTTTGACTACGGGTGTTACAGTTGCTTTGGTGTGTCCGTGTGAGACTATTTTCGCTGAGCTGGCTATCTTCACTTTGGGTGCAGTTTCCCTGTGACTGGCTGTGACTGTCTCCTGTGGTGTGGTTGCGGCTATCCGTGTGCCACTCGCTATAGCCTCCCGCTGGCGCAATGATTCCGACCGATGTACCGTGGGTGGCATAGGGAGGCATGACCATGCCACTACAGCCACCGAGCAGTAACGCGGTGCCGACGATAAAAATGAACTTACCCAGAGCGGTGTTTTTCATTAGTTCCTACCTATTTCTTCTCGTTGGTTATTTCTATGCCCTGCCTTGGCAGGGCTCGCTATTATTCATTCTTCAACTGTGAGATCAAGTTATTTTAATTAAATCGGTATTATTCTTAATTGTGTTCTTTATGAATAAATATCCTCCGGCTATGCCGGAGGATATTTATTATTTCCCCTCATAACTGAGAGGCCCCACACAACCAGAGGGGGATGAATGTCCGAATCGATTTCTGGTACTGGGTTAGCAGGTGGCATCCTGACAGGAGCCAGTGTCTATGGACTGCTGACCTGTATTAGCTCAGACCTGAACTGGTTACTGTGTTGTAGCATCGTGGGATTTTGCATTTTTTGATGAGTGTCAATTACTAAATTCGTAGGCGATTCTTGGTGGTGATGTGTGACCCATCTCTTTTAAAATGATATTGGTATACTCGACTACCGGGCCTCTTGGATTACTGTCTTCTTTGTCCTGAAGGTGAGTCAACGCGTGTACAACTTCGTGAATAAATGAGCGTGTTGTATCAAATTGTTGTGGGCCATCATTACTTTCATAGTACTCTGGTATTGAATCATCGTCTGTATCATCCAGGTTGAGGGCAATCACTTTTCTGCCTTCTGAACTCTCCAGGTCCTCATCAGTTACGGTAGTACCAAAGTTTTCTCCGGCTCCCAGCAACCAGCGTTGTTCTACATCACGCAATTCCTGATCGTAGGCATAATTCATCAGTCTGCGGAATGTCCCGCTTTGAGTGTATGCATCTTCAAGTATGCGTGATAGCACCTCACGGCATTCATCATAGGTATCATCATCAATTTCGATATCAGGATCCATTCCTCCTGGTCCAGAGATAAGGTATTCAGCAAGACACATTGGTTCCAGCCTGGCTTTATCATCGGTAGCAAGACCATCATGTTGGAGGCGTAATTGCGAAGGATTATCTTGGTGTTCTGGAAGGTCTGGAAATACCTTGCTGTCATGAGGATGGGATAATCCATATGTTGACATCATATTATTGATAAATATTGGTTTAATTCCCGTTGGCATGATGAGTTACACATCCTTTTTATTACATGGAATTAACATTCTATAAATAGCATGTTTTTGTCAAACAGAATTCACTCAGCACGCAATCAATTAAACTAAAAGCTAAATTTGCAGTATTTGTGCCTCACCTCCATTAAAATTGTACTCTGCGTGATTTTACTTTCAGATTCTGCAACCACAGGCAATCCTGTTTTACAAGATATTAAACCCTGCAACCCAACCATTTCACTCACTCTAGTTACCATCCGAAATCATCGGAGGTGAGGCTTATGAAAATGAATGACAAGACTCCTGAATTCTGGGCTGCGGTTTTGACCGGACTCAAAAATGCGTGGCCCCAGATACTTGGGGCGTTAATGGCCGGACTCATTGCCTACGGCCGACTGATATACGACGGCGCCACCCGTAAAAATAAATGGCTTGAGGGCGTCCTGTGTGGCGCTCTTTCCTTATGTGTCACCAGTGCGCTTGATGTGGTAGGCCTGCCGGTTTCCATTTCGCCTTTCGTTGGCGGAATTATTGGCTTTGTCGGTGTGGACAAGCTGCGCGAAATCGCAATTAGCGCACTCAAAAAACGTGCAGGGGTTAATGATGAGAATCAGTGAAAAAGGCATTACCCTAATCAAAGAGTTTGAAGGTTGTAGCCTGACAGCTTATCCGGACCCGGGAACGGGGGGAGATCCCTGGACGATTGGTTATGGCTGGACCCACTCTGTTGACGGTAAGCCAGTTAAGCCCGGAATGATGATTGACGAGGCTACTGCCGAGCGCTTGCTTAACACTGGTTTAGTCGGTTATGAAAATGATGTGTCCAGACTGGTTAAGGTCAAGTTGACGCAAGGCCAGTTTGATGCGCTGGTGTCGTTCGCGTACAACCTCGGCGCCCGGACATTATCCTCATCAACTCTGCTGCGGAAGCTAAACGCTGGTGATTACGCTGGCGCCGCTGATGAGTTCCTGCGCTGGAATAAGGCTGGTGGCAAAGTACTGAACGGGCTTACCCGTCGGCGTGAGGCGGAGCGTGCTCTGTTCCTGTCATGATGTTCAACTGGAAAACGATGTTTGTTGGCCTGTTGCTTGTCTCGCTAATTGTTGCCGGTCGGCTGGCAAATCACTACCGAAATAACGCCATCACCTACAAAGAGCAGCGCGATACCGTTACTCATAGGCTGACGCTGGCGAACGCGACAATTACCGACATGACTAAGCGCCAGCGTGACGTTGCCGCCCTCGATGAAAAATACACGAAGGAATTAGCCGATGCGAAAGCTGAGAATGATGCTTTGCGCGATGATGTTGCCGCTGGCCGCCGTCGCCTGTACGTCAACGCAACATGCCCCGCAGTGCCGACAGGTAAATCCACCTCCACCGCCCGCATGGATAATGCAGCCAGCCCCAGACTGGCAGACTCCGCTCAACGGGATTATTTCGCCCTCAAAGAGCGAGTGAAGACGATGCAAAAGCAACTGGAAGGGGCGCAGGCGTACATTCGCACCCAATGCCACGGTAATGCAGGAAAAACTAGTAACCAATGGTGACTGTATTAAAAAGGTACTCCCGGGCAGGGGGCGCCACGGGTGGCTTCGGGCTCGCGGGAATCGGCTGATTTTTGATTTTTTAGTCTCTGTCAGCACTGAAAAATAACCTTAAAAATCAATACATTTACTGTTTTCAGTGTCGAGGTGGTACGTTTTTTGTTCGACACTGAACTCCATTTTCACCGTATACAGGAAAAGAGCACGACTGTGGATCAGGAAATTAAAAGCCTCGAATTAAACATCACACAGCTTTCGGCCATCACTGGTGCACACCGACAGACCATCGCCAGCAGGCTGAAGGGCGTAAAAACCTCAGGTGGGAACGGTAGTAACCTGAAAATCTACCGGCTGGTGGATATTCTGACCGCCATGATGACGATGCCGGCTGTTACCGGGGAGAATGACCCCAATAAGATGAAACCCTCAGATCGACGGGCATGGTTTCAGTCGGAAATGACGCGTATTGAGCTGGAAAAGGAGATGAGAACTCTGATCCCGGCCAGCGAGGTGCTGAGCGTTTAAGGTAGGTGATGCTGATTATACCTCCCTTCTTGTTGGTCCTTCATACCGTTTTAACGACTATCTGAATGCTTACGTGATGATTGGTGCAGCAAACGGACATATTAAGGATAACTGGGGAAATTCTGACAATAAAACCGCCTTTGCTTATGGGGCAGGTATTCAGCTTAACCCGGTTGAAAATATTGCCGTTAATGCGTCTTATGAGCATACAAGTTTTTCCACTGATGCTGACAGTGACGTCAAAGCTGGAACCTAGGTGCTTGGCGTAGGTTACAGCTTCTGACCTTTAACATCGATACAGATTTAATGCCCTCCAGTGAGAGGGCTTTTTTATGGGTAAAACGAAATTATGACGATATGGCTATGTTGCTGTTATTTCTCAATGACACCACAGGCAAAACGTGCACCGCCACCACCCAGTGGAGCAGGTTTATCGGAGTAATTGTCACCGCCTTTATGGATCATCAATGAGTGACCTTTCAGTTCTGACAGTGATTTAAGGCGTGGTGCCAGTAACGGATACGTGGCTGTACCATCTGCATTGACAACCAGTCCAGGCAGATCCCCCAAATGCCCTTTGTCATTATATGGGCCAAGATGTTTCCCGGTTTTTTCGGGGTCAAGATGTCCTCCGGCCATGAGCGCCGGAACCTCTTTACCGTCTTTCATTCCCGGCATACAACTTGGGTTTGTGTGGACATGGAAGCCGTGAATTCCTGGCGTAAGACCATTTAGGTGAGGAGTGAAAAGCAGACCGTAAGGTGTCTCTGAAACTGTGATTTCACCTATGTTTTCTCCTGTTCCGCTGGACAGGGCATCGTTCATCTTTACAGTCAGGGTATTCTCTGCCATTGCTGAACAACTGATGAGCGCACCAGCTACCAGCGACAATATTGTGTATTTCATTAGTTACCTCGTTTTTTGGTTGTATCGTAAATACCATTAATAAAAGCAGGTATATTTTTGCAAGATAAATAATAAAGGATCTCTCATATATGCAGGATATACCACAGGAAACCTTGAGCGAGACCACCAAAGCGGAGCAGTCCGCGAAGGTGGATTTGTGGGAATTTGATTTAACCGCGATTGGCGGTGAGCGCTTTTTCTTCTGTAACGAACCGAACGAAAAAGGCGAGCCGTTAACCTGGCAGGGGAGGCAGTACGAACCGTACCCGATACAGGTACAGGATTTTGAGATGAACGGGAAAGGCGCATCTCCCCGCCCGAACCTCGTTGTTGCCAATCTCTTTGGTCTGGTCACGGGGATGGCGGAGGATTTGCAAAGTCTCGTCGGCGCGTCAGTGGTAAGGCATCAGGTTTACAGCAAGTTTCTTGATGCGGTGAATTTCAGTAACGGCAATCCGGGCGCTGACCCGGAGCAGGAGGCGGTAGCGCGCTATAACGTGGAGCAGTTGTCAGAACTGGATTCATCAACTGCTACCATTATTCTGGCATCACCGGCAGAAACCGACGGTTCTGTGGTGCCGGGGCGTACCATGCTGGCGGACTCCTGTCCGTGGGATTACCGGGATGAAAACTGCGGATACGACGGCCCGCCCGTGGCCGATGAGTTCGATAAGCCCACCTCAGACCCGAAAAAGGATAAATGCAGCCACTGCATGAAAGGCTGTGAAATGCGTAACAATCTGGTGAATGCCGGATTTTTCGCTTCCATCAACAAACTGTCTTAACAGGTTCCCATGATTAACGATGACATTCTGGCACATGCCCGACAGTGTGCGCCTGCGGAATCGTGCGGTTATGTGGTCAGAACAGCACAGGGAGAGCGGTATTTTCCGTGTGAAAATCTGTCTGCTGAACCCACGATGTATTTTCGTATATCCCCGGAGGATTACCTGAATGCCCGGAACCGCGGCGACATCGTGGCGCTGGTACACAGCCATCCTGACGGTAAGCCCTGTCTCAGCAGTGCGGATCGTACCCTCCAGATACAAAGCGGAACGGTCTGCAGAGCGTGCTGATTAATAACACGCCGGTGGTGGACGCGGACGGTAACAGTAATATTCACGGCGTGACCGTGGTATATCAGGTGGGGGAGACACCACAGGCACCGCTGGAAGGTTTTGAGGCTTCCGGCGCGGAAACGGTGCTGGGTGTGGAAGTGAAACACGATAATCCCGTTACCCGTACTGTTGTCTCAGAGAATGTCGACCGGCTACGCTTCACCTTTGGTGTACAGATGCTGCAGGAGACCATGGACAAGGGGGACCGTAACCCGTCCTCCGTGAATCTGCTGATACAGTTTCAGCGTAGCGGGATCTGGAACACAGAATTTGATATCACTATTAACGGCAAGATCACAACACAATATCTGGCATCGGTAGTGGCTGATAATTTACCGCCGCGCCCGTTCAGTGTCCGCATGGTCAGGGTGACACCGGACAGCACCACCGACAGGCTTCAGAACAAAACGCTGTGGTCGTCGTATACGGAAATCATCGATATCCGGCAGGGTTATCCTGGCACAGCGGTTGCCGGTCTGCTGGTGGATGCGGAACAGTTCGGCAGCCAGCAGGTCACGCGTAACTACCACCTGCGCGGACGTATTTTTCAGGTCCCCTCAAACTATGACCCGGATACCCGCACATATACCGGCCTGTGGGACGGGGCGTTTAAACCGGCGTACACGAATAACCCGGCGTGGTGCACGATGGATAAACTGACCCACCCCCGTTACGGGCTGGGCAGGCGTATCGGGGGGGCGGATGTGGATAAATGGGCGCTGTACGCCATCGCGCAGTACTGCGATCAACCGGTGCCGGACGGATTTGGCGGCACGGAACCCCGCATGACGCTTAATGCGTATATTACCACCCAGCGTAAGGCGTATGACGTTCTGGCGGATTTCTGCTCGGTGATGCGTTGTATGCCGGTATGGAATGGCCGCAAAATGACCTTCATCCAGGACCGCCCCTCCGATAAAGCATGGACCTACACCAACGGTAACGTGGTGGGCGGGCGCTTTAAATACAGCTTCAGTGCCCTGAAAGACCGCCATAACGCGATAGAAGTGAGATACACCGATCCGCTGAATGGCTGGCAAACCTCCACGGAGCTGGTGGAAGACCATGCCTCACAGGCCCGTTATGGACGCAATCTGCTGAAAATGGACGCGTTCGGCTGTACCTCACGTGGACAGGCGCACCGGACGGGGTTGTGGGTGATGATGACGGAGCTGCTGGAAACGCAGACCGTGGATTTTTCTGTCGGTGCGGAAGGTCTGCGTCATACACCGGGCGATATTATTGAGGTCTGCGACAACGATTACGCCGGGGCGTCGGTCGGTGGGCGTATCACTGACCTGGATATTTCCACCCGCACGCTGACGCTTGACCGGGAAATAACACTACCGGAAAGCGGCGCCACCACGCTGAATATTGTCGGGCCTGACGGTAAGCCGTTCAGTACGGAGATTCAGTCGCAGCCCGCACCGGATCGGGTGGTAACGAAAGTCCTGCCGGAAACCGTGCAGCCATACAGTATCTGGGGGCTGAAACTGCCCTCCCTGAAGCGCCGCCTTTTCCGTTGCGTGCGTATTAAGGAGAATGACGACGGCACATACGCCATCACTGCCTTGCAGCACGTTCCGGAAAAAGAGTCCATCGTGGACAACGGGGCGCACTTTGACCCGTTACCGGGGACCACCAACAGCATTATTCCGCCCGCTGTGCAGCATCTGACAGTCAGCACGGATAACGACAGTACCCTGTATCAGGCCAAAGCGAAGTGGGGCACGCCGCGGGTGGTAAAAGATGTGCGTTTTGTGGTGAGGCTGACCACAGGCAGTGGGAACGAGGGCGATCCGGTTCGTCTGGTGACAACGGCGACGACCAGCGAAACGGAGTACGCCTTCCACGAACTGCCACTGGGTGACTACACGCTGACAGTCAGGGCAATAAACGGTTACGGGCAGCAGGGTGAACCGGCGTCCGTGGCATTCAGTATTCAGGCACCGGAAGCGCCATCCACGATTGAGATGACGCCGGGTTATTTTCAGATAACGGTGACGCCGCACCAGACTGTCTACGATGCCAGTGTGCAGTATGAGTTCTGGTACTCCGCCACGCAACTGGCGACTGCCGCCGATATTCAGTCAAAAGCACAGTATCTGGGCGTCGGGTCATTCTGGATAAAGGATGGACTGAAACCACTGCATGATGCCTGGTTTTACGTGCGCAGTGTAAATCTGGCTGGAAAATCAGTATTTGCGGAGGCATCCGGACGTCCGGGGGATGACGCGAAAGGGTATCTGGATTTTTTTAAGGGACTGATTACGGAGACGTATCTTGGTACGGAGTTGCTGAAAAAAAATTGACCTGACGGAGGATAACGCCAGCAAACTGCAACAATTTTCGAAGGAGTGGCAGGACGCTAACGATAAATGGAGCGCCACGTGGGGCGTCAAAATAGAGCAGACCAAAGACGGCAAATATTATGTGGCCGGACTTGGACTGAGCATGGAAGACACGCCTGACGGGAAGATAAGCCAGTTCCTGGTGGCGGCGGATCGCATTGCTTATATTAACCCGGCAAACGGAAACGAGACGCCCGGATTCGTCATGCAGGGCGACCAGATAATCATGAATGAGGCGTTCCTGAAATACCTGAGCGCGCCGACCATTACCAGTGGCGGGAATCCTCCGGCATTTTCCCTGACGCCGGATGGAAAGCTGACTGCGAAAAATGCGGATATCAGCGGCCATATCAACGCTGTATCTGGCTCGTTTACGGGAGAAATCAATGCCACCTCCGGTAAGTTTTCTGGCGTGATAGAAGCAAGAGAGTTTGTCGGTGATATCTGCGGCTCAAAAGTCATGCAGGGCGTGAGCATCAGGGCGACGAACGACGAACGCAGCACCTCAACACGGTATACCGACAGCGCCACCTATCAGATAGGGAAAACCATCACGGTGATGGCTAACTGTGAGCGTAACGGTGGCACCGGTGCCATCACCGTCACGATAAATATTAACGGCCAGGTGAAAACGGCGGAGGTTATGCCGTATACCGCAGGGATTCCGGCCATGTATCAGACCGTCGTCTTTTCGGTCTACACCACTTCACCTGTCGTGGATATCAGCGTCTCTCTGAGGGTCCGTGGGCAGTACACCACGTCTGCTTCCGTCTGGCCGCTGGTGATGGTTTCCCGGTCGGGGAGCAACTTCACAAACTGACCGGATTTACGACACAACAACCGGCAATCAGGTTTATATCTCCGAACTCGGCCCGTTGCCCGAAAACGTCACATCAGTTTCACCAGACGGTGAATACCAGAAATGGGATGGTAAGGCGTGGGTGAAGGATGAAGCGGCTGAAAAAGCAGCGCAGCTTCGTCAGGCGGAAGAAACCAAAAGCAGGCTCCTGCAAATGGCATCTGAAAAAATCGCGCCGCTTCAGGATGCGGTTGATCTTGGACTCGCAACAGATGATGAGAAAGCGCAGCTCGACGAATGGAAAAAATACAGGGTGCTGGTAAACCGGGTGGATACCTTAAATCCTGACTGGCCGGAGAAACCATCTTAGTTATAAAAATATAGCTATGTAGTAGAGATTGCTGCTATATGTTATATAGCAGCAATGGCTATTATTTTGATGGTTGAGTGTATAATTTTAGCACTGGTAAATGACGGTTTAGCTCCAGAGTTAGTTCCTGGGAAAAATTATGGATACTATTGGTTCATATTAATCAGGAAGAGGCTCGCATATTTTTTGGTGTTTCTGTGTTCAGGGAGTGTTTTGGATATATTTATTAGCAATGTTTTCTAGTATTTGTTGGAATTGCTGTGGAGTCGGCATAGCACACTCATTAAACAGCGGTACAACTTCTGTCATAATGATCTTCTCCGCATAGATGTTAAAGGATGCCTTCTGATGTTGACTGGTAAACATATGTGGATACTTACTGTATGCTGATGTAATCAACGGAGTTATAAAAGGATTGGCTCCTGCGCCGCTTGGCGTAAAAACGTCATTCTTGGTTGCTCCGGGCAGACCTGCATTTTGCGCAGCCTCGCCAATTTCCTTAAGAAAAGGCGCTATGTCGATTCCTTTGCTGATGAGCAAGTTACAATACTGATCTTTATACTTGCTGTAAACTGCCGATAGTATAGCTTCACGGGTCTGGCTGGCGTATGCGGGGTCTTTACTCGCGCTACCTCTAATATCTATATCATGGAGCGTTTGAAGCATAAAGTTTTTAACGACTTTATTTGTCAACACTGCCCGGCCCTCAGATGCGCTTCCTCGGTGAAAGTGTGTTGCAGAAGATTCAGTGTTCTTATGCGAAATAAAACGTTCCGATAATTTTGAATTTAATTTGATGAAGTGATTTTTTACTGCGAGAATACTTTTTGCTAAAGAATTTTTCTCGGTTGATTTTTCTTTTAGTGGTGTGGTTTCCTGTTTTTGGATTCTAAAGTTATGGGGAAATAAAGTTATTTTTGTCACGGTAATGCTCCTTTTATATGTACATAACTCATTTATATATAGATAGCAGGAATACTTTTATTTTTTATAGCAAATGCTATGTCCATCTGATTGATGAATTAGAAAAAATCGGCAGATTAGATTAATGCTCAAATAGTACTATTTTTATTTTCCAGAAACTTTCAAAAAATGTCCTTTTCGCTCAGGAGGAGCCTTGCTGTTCTGGCATTGAAATGGAGCGTGAGCTGATCGTTGAGCGTACCAGCGCCGGGCTGGCAGCGGCAAGGGAGCAGGGGCAAATTGGTGGCCGACGACCGAAACTAACAACGGAACAATGGGCGCAGCCGGGCGCCTCATTAGGGCCGGAGTACCGTGACAGCAGGTAGCGATTATTTATGATGTTGGCGTTTCGACGTTGTACAGAAAATTTCCGGCGAGTAACATGCCTTTGATGCCATTCATGGGAAAATAACACTCCTTTAATCAGCCATAAGAGTTAATTATATAAACTTAAATGGATTTGTTAATTGAATTTCATCAAGATTAACTCTATATAAAAGATTTATTGTCTTTGTTATGGCGTCTATATATTCTGCGCTATTCATTATAGCAGTGTATTTATCGTATGCTTCGAAGCAATGATTTTCATTGATATATTCACTGCTATTGTTTTTATAAATTTCATATGACTCTTCCTTTTTGAAAATGCGCGCATCCAGCCAAGGGTCAAGTATATATTCCGATATTCCTTTTGAATCAGTAAAAGTCAAAAAAACGACCACATGATTCCCTCCTGACGCGCTATTATACATAAGGGATGTGCTTATTCTGGCATCAAATACATTATTTTTCGAAAATCCTATTCCGGTTAGTCTTTGTGGTATATATTTTGCAATAATCGCACCTAAAATTAGCGACATATCAGCACAGTTCCCTGTGTTATATTTTATTGAATCAATGGAAGAATATATTAGACTCAGAGATATTGGGTTTTCTGGTCTTTCAGCCATAGCTCTGTCAAAAGAGTTTGCTTTTATAGTTTGCAGAAATTTCTTGTGATGTTCTCTTTGTAAATTAAGAGCGTCGTATTTTTCCTGCGTATCAATTTTAAGCCTGTCAATTTGCATAGCATCATAACTGTCCGATTTCTTTATTATTGTTCTGACAAAAACGGTACATTCTGAAGCAGCATTAATAATACAGGAGGATAAATCAACTGAAGTACTCTGTTGGTTGTTACAGGCGCTCACTCCAGTTGTGTCGAATTTTTCTGTAGCTACATGTCCAACATTTATAATCAATTACATTTCCCTGTATTCAAAATTAAGCGATATTCTTATGTCACGACAACCTAACAGGGACTGACTACAAAGATCTTTAAAGAAACGTTCATTATTGTATCTATTGATGAACTCCCTTCAGATTTAGAGGTTGTGCCCATGGTCGAAAAGCTCCTCTACGCTCATATACTTAACTTATCGGGGACATGGTAGGTTAGGATGTGAAAACACAGGGACAGTTACTGTCACTGCGAAGGCAGCAATAAGGAGGCCTATCCTGTACGAACTGTGGAAAACTATATCTTGTTCACGATCACCTGCATCGTTGACGATGCGCGATCCTGGAGGTTTGACGACTGCCCGGTATGATCGGTGCCGCAGCCACGTCGTATGCAGGAACGGCCTGCGGCAAACTGGCGATCGTTCGATAGTGCGAATATTGAATGGTTGCCAGTCGCGGCGGATTCTACTGGTTAAGAATGGCTAATCAATGTGTTTAATCTGAAACCAGGCATCTGTTCAACTTTTCGTGATCGCTTTTGTTGGCATCACTATTAAGCCTTCGTCTACATGGGCATTTGGCATCGGGAGGCCACCAGTGGAAGAGGTCGAGTGATATATTGGGCAGTGAGTTTGATTGGTCGGGGGATTTATACGCAAAATGGTTCGAAATATGTCAGAATTAAAACGTAAAAAGTGATAACTAATTGAATCTATTAGAATCAAAAATAACGTATTCTAATCGTTAAAATCACTTAATTAATATGTAACATTGTGAATTTAAAGAGAAAGATCAATTGTTGCCGTAAACAGGAATCGTATTCGGTCTCTTTTTATTTGGATTATAAATCAATGGGTTATGTGTTTCCCCTCGAAATTCCTCGAAATTTCCTCGAATTTCTGTATTCCGGTCTTTTTGGTTATATCACATCCAAATCCAGTTTAACATTTCTTTTACAACAAAATCAGAGCATCACGTAAGCTTTATTATCGCGTTCATCGAGATAGAGTTTCGTGGTGTTCTCTGAGGTGTGGCCCAGGAGTTTTTGAGCGAATTCCTCGCCGCGTTCGTCTTTGTACAACCGCCCGAGCAGGCTACGGATCTCGTGAAATGTCGGTGGGTTATCACTACATTTAACGTCTGAAATTTTTCTGGCTTTTACAAATTTCTTTGTCAGCCCATCCGGGTGAATATTCCCGGTCGGGCTATTTTTCCTGATTCCGGCACTGATCATGAAATCGGTTCTGCTTACCAGTCGGCAGCGATCGATAACCGTCCCGGACGTAACCCTGGCGCCTGAAGGGTGTCTGAAATATTCCGGCGACCTGGTACGTGTTACGCAGATCAT
Protein sequences of DBSCAN-SWA_3 >CP028151|1187776:1243509|1217568_1219620_+|AWP49849.1|DBSCAN-SWA MLPKANRIPYAMTVHGDTRIDNYYWLRDDTRSQPEVLDYLHQENEYGRKVMSSQQALQDRILKEIIDRIPPREVSAPYVKNGYRYRYIYEPGCEYAIYQRQSALSEEWDVWETLLDANQRAAHSEFYTLGGLAITPDNTIMALAEDYLSRRQYGLRFRNLESGNWYPELLDNVAPEFVWANDSLTLYYVRKHKKTLLPYQVWRHTIGTPSSQDELVYEEKDDTFYVSLHKTTSQHYVVIHLASATTSEVLLLDAELADAEPFSFLPRRKDHEYSLDHYQHKFYLRSNRNGKNFGLYRTRVRNENAWEELIPPREHIMLEGFTLFTDWLVVEERQRGLTSLRQINRKTREVIGIAFDDPAYVTWLAYNPEPETSRLRYGYSSMTTPDTLFELDMDTGERRVLKQTEVPGFDSGCYQSEHLWITARDGVEVPVSLVYHQKYFRKGQNPLLVYGYGSYGSSIDADFSSSRLSLLDRGFVYAIVHVRGGGELGQQWYEDGKFLKKRNTFNDYLDACDALLKLGYGSPSLCYGMGGSAGGMLMGVAINERPELFHGVIAQVPFVDVLTTMLDESIPLTTGEFEEWGNPQDIEYYDYMKSYSPYDNVKAQDYPHLLVTTGLHDSQVQYWEPAKWVAKLRELKTDQRLLLLCTDMDSGHGGKSGRFKSYEGVALEFAFLIGLAQGTLHSA >CP028151|1187776:1243509|1228835_1229063_+|AWP49862.1|DBSCAN-SWA MKMTWFQHPVCTTEEADELVAGYRRRGVKVERYGEAEVLELESNNTPQRWTVEELKEIRIAALADLRALKKLEAA >CP028151|1187776:1243509|1203231_1203987_-|AWP49834.1|DBSCAN-SWA MTSLVSLENVSVSFGQRRVLSDVSLELSPGKILTLLGPNGAGKSTLVRVVLGLVAPDEGVIKRNGQLRIGYVPQKLYLDTTLPLTVNRFLRLRPGTQKTDILPALKRVQAGHLIDAPMQKLSGGETQRVLLARALLNRPQLLVLDEPTQGVDVNGQVALYDLIDQLRRELDCAVLMVSHDLHLVMAKTDEVLCLNHHICCSGAPEVVSMHPEFISMFGPRGAEQLGIYRHHHNHRHDLQGRIALRRGNGHS >CP028151|1187776:1243509|1233431_1233911_+|AWP53042.1|lysis|DBSCAN-SWA MFVGLLLVSLIVAGRLANHYRNNAITYKEQRDTVTHRLTLANATITDMTKRQRDVAALDEKYTKELADAKAENDALRDDVAAGRRRLYVNATCPAVPTGKSTSTARMDNAASPRLADSAQRDYFALKERVKTMQKQLEGAQAYIRTQCHGNAGKTSNQW >CP028151|1187776:1243509|1198880_1199480_-|AWP49830.1|DBSCAN-SWA MKKHYSVQYETGDIVFTCIGAALFGQISTASQCWSNHVGIIIGHNGDDYLVAESRVPLSTVTTLSLFIQRSAGQRYAVRRLRGGLTVEQKLAIMEQVPARLNKFYHTGFKYESSRQFCSKFVFDIYKEALCIPVGDIETFEELLHSNPDAKLTFWKFWFLGSIPWDRKTVTPASLWHHPNLELISACGIETPQREAEGE >CP028151|1187776:1243509|1221142_1221373_-|AWP49852.1|DBSCAN-SWA MKTNLAQLEQAEMDKVNVDLAAAGVAFKERYNMPVVAEAVEREQPEHLRAWFRERLIAHRLASVSLSRLPYEPKVK >CP028151|1187776:1243509|1209071_1209941_-|AWP49840.1|DBSCAN-SWA MNMLEKVQSQLEHLSKSERKVADVILAAPGRSIHLSIAMLAQEANVSEPTVNRFCRSMNTRGFPDFKLHLAQSLANGTPYVNRNVDEDDSVEAYTGKIFESAMASLDHVRQSLDKSAVNRAVDLLTQAKKIAFFGLGSSAAVAHDAMNKFFRFNVPVIYSDDIVLQRMSCMNCSDDDVVVLISHTGRTKSLVELAQLARENDAMVIALTSAGTPLAREATLAITLDVPEDTDIYMPMVSRLAQLTVIDVLATGFTLRRGAKFRDNLKRVKEALKESRFDKELLIKSDDR >CP028151|1187776:1243509|1191317_1192289_-|AWP49822.1|tRNA|DBSCAN-SWA MIEFGNFYQLIAKNHLSHWLETLPAQIAAWQREQQHGLFKQWSNAVEFLPEMTPWRLDLLHSVTAESETPLSEGQLKRIDTLLRNLMPWRKGPFSLYGVDIDTEWRSDWKWDRVLPHLSDLTGRTILDVGCGSGYHLWRMIGAGAHLAVGIDPTQLFLCQFEAVRKLLGNDQRAHLLPLGIEQLPALKAFDTVFSMGVLYHRRSPLEHLWQLKDQLVNEGELVLETLVVDGDENTVLVPGDRYAQMRNVYFIPSAPALKKWLEKCGFIDVRIADVCVTTTEEQRRTEWMVTESLADFLDPNDRSKTVEGYPAPQRAVLIARKP >CP028151|1187776:1243509|1222777_1223131_+|AWP49855.1|DBSCAN-SWA MKKILLPALLLATSGVALAAPQVITVSRFEVGKDKWAFNREEVMLTCRPGQALYVINPSTLVQYPLNAIAEQQVAEGKTRAQPIAVIRIDNPAKPGEKMSLAPFIERAQKLCDPSNS >CP028151|1187776:1243509|1232961_1233414_+|AWP49866.1|DBSCAN-SWA MMRISEKGITLIKEFEGCSLTAYPDPGTGGDPWTIGYGWTHSVDGKPVKPGMMIDEATAERLLNTGLVGYENDVSRLVKVKLTQGQFDALVSFAYNLGARTLSSSTLLRKLNAGDYAGAADEFLRWNKAGGKVLNGLTRRREAERALFLS >CP028151|1187776:1243509|1197098_1197551_+|AWP49827.1|DBSCAN-SWA MKDKVYKRPVSVLVVIFAQDTKRVLMLQRRDDPDFWQSVTGSIEEGETALQAAVREVKEEVTIDVAAEQLTLIDCQRTVEFEIFSHLRHRYAPGVMHNTEFWFCLALPHERQVIFTEHLTYQWLDAPDAAALTKSWSNRQAIEEFVINVA >CP028151|1187776:1243509|1205025_1206345_+|AWP49836.1|DBSCAN-SWA MQQIARSVALAFNNLPRPHRVMLGSLTVLTLAVAVWRPYVYHPESAPIVKTIELEKSEIRSLLPEASEPIDQAAQEDEAIPQDELDDKTAGEVGVHEYVVSTGDTLSSILNQYGIDMSDISRLAASDKELRNLKIGQQLSWTLTADGDLQRLTWEVSRRETRTYDRTANGFKMSSEMQQGDWVNSLLKGTVGGSFVASAKEAGLTSSEISAVIKAMQWQMDFRKLKKGDEFSVLMSREMLDGKREQSQLLGVRMRSDGKDYYAIRAADGKFYDRNGVGLAKGFLRFPTAKQFRISSNFNPRRLNPVTGRVAPHRGVDFAMPQGTPVLSVGDGEVVVAKRSGAAGYYIAIRHGRTYTTRYMHLRKLLVKPGQKVKRGDRIALSGNTGRSTGPHLHYEVWINQQAVNPLTAKLPRTEGLTGSDRREYLAQVKEVLPQLRFD >CP028151|1187776:1243509|1193517_1194336_-|AWP49825.1|DBSCAN-SWA MIYIGLPQWSHPKWARLGITSLEEYARHFNCVEGNTTLYALPKAEIVDRWYAQTTDDFRFCFKFPATISHQAALRHCDDLVQAFFTRLAPLETRIGQYWLQLPAAFGPRDLPALWQFLDALPATFTYGVEVRHPCFFDKGEDEQRLNRGLHARGVNRVILDSRPVHAAHPHSEAVRDAQRKKPKVPVHAVVTASHPMVRFIGSDNMAQNREFFAAWLQKLPQWRQTTTPFLFLHTPDIAQAPELVNTLWHDLRSVLPEIGTAPSIPQQSSLF >CP028151|1187776:1243509|1204050_1205010_+|AWP49835.1|DBSCAN-SWA MISRIMLQKNTLLFAALSAALWGSATQAADAAVVASLKPLGFIASAIADGVTDTQVLLPDGASEHDYSLRPSDVKRLQGADLVVWVGPEMEAFMEKSVRNIPDNKQVTIAQLADVKPLLMKGADDDEDEHAHTGADEEKGDVHHHHGEYNMHLWLSPEIARATAVAIHEKLVELMPQSRAKLDANLKDFEAQLAATDKQVGNELAPLKGKGYFVFHDAYGYYEKHYGLTPLGHFTVNPEIQPGAQRLHEIRTQLVEQKATCVFAEPQFRPAVVEAVARGTSVRMGTLDPLGTNIKLGKTSYSAFLSQLANQYASCLKGD >CP028151|1187776:1243509|1219656_1220355_-|AWP49850.1|DBSCAN-SWA MLRIIDTETCGLQGGIVEIASVDVIDGNIVNPMSHLIRPDRPITPQAMAIHRITEAMVADKPWIEDVIPLYYGSEWYVAHNASFDRRVLPELPGEWICTMKLSRRLWPGIKYSNMSLYKSRKLSVQTPPGLHHHRALYDCYITAALLIDIMRTTGWTAEEMVNITGRPALLTTFPFGKYRGKAVSEVAKRDPGYLRWLFNNLDNMSPELRLTLKHYLEDVQAGEQRSNGTPQ >CP028151|1187776:1243509|1227870_1228176_+|AWP49860.1|DBSCAN-SWA MMIINSLLYIHVENEDALKMLKFVIYGDTVKNLNKTFTCKYAVIRRDDMTVIAEMDFFPDCNRSLMYRDGRYVRFLPLLQNDIMGSDTLINELTIRAGYHE >CP028151|1187776:1243509|1225259_1227239_+|AWP49858.1|DBSCAN-SWA MRLKLIVKSFALAGLLSSTALTPLFAQEAPKGATASTKQANDALYNQLPFSDNTDFTNAHKGFIAGLPEEVIKGEQGNVIWNPQQYAFIKEGEKSPDTVNPSLWRQSQLINISGLFEVTDGVYQIRNLDLSNMTIIEGKEGITVVDPLVSAETAKAGMDLYFKNRGNKPVVAIIYTHSHVDHYGGVRGVVDEADVKSGKVKVYAPAGFMEAAVAENIMAGNVMSRRASYMYGNLLKPDASGQVGAGLGTTTSAGTVTLIAPTNIIDKDGQKEVIDGLTYDFMLAPGSEAPSEMLWFIEEKKLIEAAEDVTHTLHNTYSLRGAKIREPLPWSKYINEAIVRWGDKAEIIMAQHHWPTWGNENVVGLLKSQRDLYRYINDQTLRMANEGLTRDEIAANFKLPDSLAKTWANRGYYGSISHDVKATYVLYLGWFDGNPATLDELPPEEAAKKFVEYMGGADAILQKAKADFDQGNYRWVAQVVSKVVFADPNNQNARNLEADALEQLGYQAESGPWRNFYLTGAQELRNGVVKGPTPNTASPDTVRAMTPEMFFDFLAVHINGEKAGNARAVFNIDLGSDGGKYKLELENGVLNHTANAEAKDADATITLNRDTLNKIILKEETLKQAQDKGEVNVTGNAAKLDEMLGYMDKFEFWFNIVTP >CP028151|1187776:1243509|1241380_1242181_-|AWP49870.1|DBSCAN-SWA MIINVGHVATEKFDTTGVSACNNQQSTSVDLSSCIINAASECTVFVRTIIKKSDSYDAMQIDRLKIDTQEKYDALNLQREHHKKFLQTIKANSFDRAMAERPENPISLSLIYSSIDSIKYNTGNCADMSLILGAIIAKYIPQRLTGIGFSKNNVFDARISTSLMYNSASGGNHVVVFLTFTDSKGISEYILDPWLDARIFKKEESYEIYKNNSSEYINENHCFEAYDKYTAIMNSAEYIDAITKTINLLYRVNLDEIQLTNPFKFI >CP028151|1187776:1243509|1193069_1193465_-|AWP49824.1|DBSCAN-SWA MVSALYAVLGALLLMKFSFNVVRLRMQYRVAYGDGGFSELQSAIRIHGNAVEYIPVALVLLLFMEMNGAETWMVHICGIILIAGRLMHYYGFHHRLFRWRRAGMSATWCALLLMVLANLWYMPWELVFSLY >CP028151|1187776:1243509|1224567_1224846_-|AWP49857.1|DBSCAN-SWA MTMSFVRLETWGELNYPDDPPPLTTLRRWARNGNIYPTPVLHGRTYRVDPDAFYIKPNKVGLVLEQHHPNGRTGKPSALLEKLISESKKVRC >CP028151|1187776:1243509|1235428_1236124_+|AWP49868.1|tail|DBSCAN-SWA MQDIPQETLSETTKAEQSAKVDLWEFDLTAIGGERFFFCNEPNEKGEPLTWQGRQYEPYPIQVQDFEMNGKGASPRPNLVVANLFGLVTGMAEDLQSLVGASVVRHQVYSKFLDAVNFSNGNPGADPEQEAVARYNVEQLSELDSSTATIILASPAETDGSVVPGRTMLADSCPWDYRDENCGYDGPPVADEFDKPTSDPKKDKCSHCMKGCEMRNNLVNAGFFASINKLS >CP028151|1187776:1243509|1221510_1221885_+|AWP49853.1|DBSCAN-SWA MASSAPSRRLALLLLASTFATPAAWAHAHLTHQYPAANAAVTASPQALTLNFSEGIEPGFSGATITGPQQELIKTRPAKRNEQDKTQLIIPLEQPLKSGAYTVDWHVVSVDGHKTKGKYTFSVK >CP028151|1187776:1243509|1228239_1228839_+|AWP49861.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESGDFLRSVKIGEWIHADFKRVRNYAFHKRFFKLLQLGFDYWMPTGGTVTSREQKLISGFVNFLCDSAGQEYTPALNEAAEQYLHNVATLRTGDVALLKSFDAFREWVTVQAGFYTEHFYPDGSRGRRAKSIAFASMDETEFQQVYKAVLNVLWNWILFRKFSSLEEVENVAAHLLEFA >CP028151|1187776:1243509|1223513_1224593_-|AWP49856.1|integrase|DBSCAN-SWA MSRKKYDANLPRNLTYRKASKSFFWRNPLTDKEFPLGQIARRDAITQAIEANNFIAQNHTPVALIEKLKGTDSFTVSAWIDRYEVLLQRRSLSVNTYKIRGNQLATVREKMGEIILAEVTTRHIAKFLESWITEGKNTMAGAMRSVLSDMFREAIVEGHIVKNPVEATRIPEIKVARERLQLETYNATRAAAEHMPAWFPLAMDLALVTGQRREDIVNMKFSDVFDNRLYVTQIKTGMKIAIPLSLTLRATGLRLGTVIDRCRLVSRTDFMISAGIRKNSPTGNIHPDGLTKTFVKARKASGVNFSNNPPTFHEIRSLAGRLYKNEHGEVFAQKLLGHTSANTTKLYLDERDDKAYMML >CP028151|1187776:1243509|1229192_1229882_+|AWP49863.1|DBSCAN-SWA MNTQYLEYVRQQLIVATADLSGATKGQLQAWLENAQLYTKNYPRKKQRIRDEVTGKMITLNNPPIAGKQSLAKGSAIPLVQPVEYSTSSWRRALLSLEEHNKAWLLWNYSENTCWEYQVTVTRWAWEKFSQQLEGKRVAKKTLARLRQLIWLAAQDVKAELARRETYEYQTLAELMGVAKSTWTETYMSHWLVMRNSFKRLDSDALISVTRSRSQQKATNLDISLAKPN >CP028151|1187776:1243509|1198357_1198879_+|AWP49829.1|DBSCAN-SWA MSIILGIDPGSRITGYGVIRQVGRQLTYLGSGCIRTKVDDLPSRLKLIYAGVTEIITQFQPDYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPVFEYAARQVKQTVVGIGSAEKSQVQHMVRTLLKLPANPQADAADALAIAITHCHVSQNAMQMSESRLNLARGRLR >CP028151|1187776:1243509|1240178_1240901_-|AWP49869.1|DBSCAN-SWA MTKITLFPHNFRIQKQETTPLKEKSTEKNSLAKSILAVKNHFIKLNSKLSERFISHKNTESSATHFHRGSASEGRAVLTNKVVKNFMLQTLHDIDIRGSASKDPAYASQTREAILSAVYSKYKDQYCNLLISKGIDIAPFLKEIGEAAQNAGLPGATKNDVFTPSGAGANPFITPLITSAYSKYPHMFTSQHQKASFNIYAEKIIMTEVVPLFNECAMPTPQQFQQILENIANKYIQNTP >CP028151|1187776:1243509|1232648_1232978_+|AWP53041.1|holin|DBSCAN-SWA MNDKTPEFWAAVLTGLKNAWPQILGALMAGLIAYGRLIYDGATRKNKWLEGVLCGALSLCVTSALDVVGLPVSISPFVGGIIGFVGVDKLREIAISALKKRAGVNDENQ >CP028151|1187776:1243509|1214582_1215761_-|AWP49845.1|DBSCAN-SWA MTLLGTALRPAATRVMLLGAGELGKEVAIECQRLGIEVIAVDRYPDAPAMHVAHRSHVINMLDGEALRHVITEEKPHYIVPEIEAIATDTLRELEDEGLNVVPCARATQLTMNREGIRRLAAEELGLPTSTYRFADSEASFHDAVAAVGFPCIVKPVMSSSGKGQSFIRSAEQLAQAWEYAQQGGRAGAGRVIVEGVVKFDFEITLLTVSAVDGVHFCAPVGHRQQDGDYRESWQPQQMSELALKRAQEIARHVVLALGGHGLFGVELFVCGDEVIFSEVSPRPHDTGMVTLISQDLSEFALHVRAFLGMPIGAIRQYGPAASAVILPQLTSQNVTFDNVHTAVGAGVQVRLFGKPEIDGSRRLGVALATGENVEEAVIRAKKAASRVTVKG >CP028151|1187776:1243509|1187776_1189510_-|AWP49818.1|tRNA|DBSCAN-SWA MNIQALLSEKVSQAMIAAGAPADCEPQVRQSAKVQFGDYQANGMMAVAKKLGMAPRQLAEQVLTHLDLSGIASKVEIAGPGFINIFLEPAFLAEQVQQALTSDRLGVSQPTRQTIVVDYSAPNVAKEMHVGHLRSTIIGDAAVRTLEFLGHHVIRANHVGDWGTQFGMLIAWLEKQQQENAGDMALADLEGFYRDAKKHYDEDEAFAERARNYVVKLQSGDTYFREMWRKLVDITMTQNQITYDRLNVTLTRDDVMGESLYNPMLPGIVADLKAKGLAVESEGATVVFLDEFKNKEGDPMGVIIQKKDGGYLYTTTDIACAKYRYETLHADRVLYYIDSRQHQHLMQAWTIVRKAGYVPDSVPLEHHMFGMMLGKDGKPFKTRAGGTVKLADLLDEALERARRLVAEKNPDMSADELEKLANAVGIGAVKYADLSKNRTTDYIFDWDNMLAFEGNTAPYMQYAYTRVLSVFRKADIDEQALASAPVIISEDREAQLAARLLQFEETLTVVAREGTPHVMCAYLYDVAGLFSGFYEHCPILSAENDAVRNSRLKLAQLTAKTLKLGLDTLGIETVERM >CP028151|1187776:1243509|1206461_1207433_+|AWP49838.1|DBSCAN-SWA METKKNNSEYIPEFEKSFRYPQYWGAWLGAAAMAGIALTPASFRDPLLATLGRFAGRLGKSSRRRALINLSLCFPQRSEAEREAIVDEMFATAPQAMAMMAELAMRGPKKIQQRVDWEGLEIIEEMRRNDEKVIFLVPHGWGVDIPAMLMASQGQKMAAMFHNQGNPVFDYIWNTVRRRFGGRLHARNDGIKPFIQSVRQGYWGYYLPDQDHGPEHSEFVDFFATYKATLPAIGRLMKVCRARVIPLFPVYNGKTHRLTIQIRPPMDDLLTADDHTIARRMNEEVEIFVGPHPEQYTWILKLLKTRKPGEIQPYKRKDLYPIK >CP028151|1187776:1243509|1234805_1235339_-|AWP49867.1|DBSCAN-SWA MKYTILSLVAGALISCSAMAENTLTVKMNDALSSGTGENIGEITVSETPYGLLFTPHLNGLTPGIHGFHVHTNPSCMPGMKDGKEVPALMAGGHLDPEKTGKHLGPYNDKGHLGDLPGLVVNADGTATYPLLAPRLKSLSELKGHSLMIHKGGDNYSDKPAPLGGGGARFACGVIEK >CP028151|1187776:1243509|1202449_1203235_-|AWP49833.1|DBSCAN-SWA MIELLLPGWLAGMMLACAAGPLGSFVVWRRMSYFGDTLAHASLLGVAFGLLLDVNPFYAVIAVTLLLAAGLVWLEKRPHLAIDTLLGIMAHSALSLGLVVVSLMSNVRVDLMAYLFGDLLAVTPEDLISIAIGVVIVLAILFWQWRNLLSMTISPDLAFVDGVKLQRVKLLLMLVTALTIGVAMKFVGALIITSLLIIPAATARRFARTPEQMAGVAVGVGMIAVTGGLTFSAFYDTPAGPSVVLCAALLFIFSMMKKQAS >CP028151|1187776:1243509|1223188_1223308_+|AWP53039.1|DBSCAN-SWA MITDKTPGFFNHLRRKLENLASSSILKGQGDLACINANF >CP028151|1187776:1243509|1190335_1191082_+|AWP49820.1|DBSCAN-SWA MALLEICCYSMECALTAQRNGADRIELCAAPKEGGLTPSLGVLRSVREHITIPVHPIIRPRGGDFYYTDGEFAAMLEDIRLVRELGFPGLVTGVLTVDGDVDMSRMEKIMAAAGPLAVTFHRAFDMCANPFNALKNLADAGVARVLTSGQKADAAQGLSIIMELIAQGDAPTIMAGAGVRANNLQNFLDAGVREVHSSAGVLLPSPMRYRNQGLSMSADIQADEYSRYRVEGAAVAEMKGIIVRHQAK >CP028151|1187776:1243509|1210282_1211758_+|AWP49842.1|DBSCAN-SWA MAVTQTAQACDLVIFGAKGDLARRKLLPSLYQLEKAGQIHPDTRIIGVGRADWDKEAYTHVVREALETFMKEKIDEGLWDTLSGRLDFCNLDVNDTPAFSRLGDMLDQKNRTTINYFAMPPSTFGAICKGLGEAKLNAKPARVVMEKPLGTSLATSREINDRVGEYFEECQVYRIDHYLGKETVLNLLALRFANSLFVNNWDNRTIDHVEITVAEEVGIEGRWGYFDQAGQMRDMIQNHLLQILCMIAMSPPSDLSADSIRDEKVKVLKSLRRIDRSNVREKTVRGQYTAGFAQGQKVPGYLEEEGANKSSNTETFVAIRVDIDNWRWAGVPFYLRTGKRLPTKCSEVVVYFKTPELNLFKESWQDLPQNKLTIRLQPDEGVDIQVLNKVPGLDHKHNLQITKLDLSYSETFNQTHLADAYERLLLETMRGIQALFVRRDEVEEAWKWVDSITEAWAMDNDAPKPYQAGTWGPVASVAMITRDGRSWNEFE >CP028151|1187776:1243509|1220378_1221035_-|AWP49851.1|DBSCAN-SWA MSSWKIAAAQYAPLNASPAEHVAHHLEYIELAARQQCELLVFPSLSLLGCDERNKSLPAPPDEALLQPLTHAADTHHMTIIVGMPVEHNCRFVKGIAIFAPWLTSPLMFHKSHGACIARQRSAINVVDEQPEGGDIDPSFTLFTTSQCLNEPELHASTSRLQRFSHKYALAVLMANACGSSALWDESGQLIVRADCGSLLLTGLRTTEGWQGDIIPLR >CP028151|1187776:1243509|1243125_1243509_-|AWP53043.1|integrase|DBSCAN-SWA MICVTRTRSPEYFRHPSGARVTSGTVIDRCRLVSRTDFMISAGIRKNSPTGNIHPDGLTKKFVKARKISDVKCSDNPPTFHEIRSLLGRLYKDERGEEFAQKLLGHTSENTTKLYLDERDNKAYVML >CP028151|1187776:1243509|1231686_1232373_-|AWP49865.1|protease|DBSCAN-SWA MPTGIKPIFINNMMSTYGLSHPHDSKVFPDLPEHQDNPSQLRLQHDGLATDDKARLEPMCLAEYLISGPGGMDPDIEIDDDTYDECREVLSRILEDAYTQSGTFRRLMNYAYDQELRDVEQRWLLGAGENFGTTVTDEDLESSEGRKVIALNLDDTDDDSIPEYYESNDGPQQFDTTRSFIHEVVHALTHLQDKEDSNPRGPVVEYTNIILKEMGHTSPPRIAYEFSN >CP028151|1187776:1243509|1227256_1227448_-|AWP49859.1|DBSCAN-SWA MNHATTSYCHLMKKAVHNRIRTDWYQGHVLQQQIHKMKQDIKNEGKIEYKKRTELSEVTSLQH >CP028151|1187776:1243509|1200740_1201352_+|AWP49831.1|DBSCAN-SWA MIGRLRGIILEKQPPIVLLETGGVGYEVHMPMTCFYELPEAGQEAIVFTHFVVREDAQLLYGFNNKQERTLFKELIKTNGVGPKLALAILSGMSAQQFVNAVEREELGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAVDLVLTSPASPTSEDAEQEAVAALVALGYKPQEASRMVSKIARPDASSETLIRDALRAAL >CP028151|1187776:1243509|1213841_1214483_+|AWP49844.1|DBSCAN-SWA MKNWKTSAEAILTTGPVVPVIVVNKLEHAVPMAKALVAGGVRVLEVTLRTACAMDAIRAIAKDVPEAIVGAGTVLNPQQLAEVTEAGAQFAISPGLTEPLLKAATAGTIPLIPGISTVSELMLGMDYGLKEFKFFPAEANGGTKALQAIAGPFSQVRFCPTGGISPANYRDYLALKSVLCIGGSWLVPADALEAGDYDRITKLAREAVEGAKQ >CP028151|1187776:1243509|1211992_1213804_+|AWP49843.1|DBSCAN-SWA MNPNLLRVTQRIVERSQQTREAYLARIEQAKTATVHRSQLACGNLAHGFAACQPEDKASLKSMLRNNIAIITSYNDMLSAHQPYEHYPQIIRQALHSVNAVGQVAGGVPAMCDGVTQGQDGMELSLLSREVIAMSAAVGLSHNMFDGALFLGVCDKIVPGLAMAALSFGHLPAIFVPSGPMASGLPNKEKVRIRQLYAEGKVDRMALLESEAASYHAPGTCTFYGTANTNQMVVEFMGMQLPGSSFVHPDAPLREALTAAAARQVTRLTGNGNTWMPLGKMIDEKVVVNGIVALLATGGSTNHTMHLVAMARAAGILINWDDFSDLSEVVPLMARLYPNGPADINHFQAAGGVPVLMRELLNAGLLHEDVNTVAGFGLKRYTLEPWLNNGELDWREGAERSLDNDVIASFDKPFSPHGGTKVLSGNLGRAVMKTSAVPVENQIIEAPAMVFESQHDVLPAFDAGLLDRDCVVVVRHQGPKANGMPELHKLMPPLGVLLDRRFKIALVTDGRLSGASGKVPSAIHVTPEAYDGGLLAKVRDGDIIRVNGQTGELTLLVDEAELAARQPHIPDLSASRVGTGRELFGALREKLSGAEQGATCITF >CP028151|1187776:1243509|1230876_1231326_-|AWP49864.1|DBSCAN-SWA MKNTALGKFIFIVGTALLLGGCSGMVMPPYATHGTSVGIIAPAGGYSEWHTDSRNHTTGDSHSQSQGNCTQSEDSQLSENSLTRTHQSNCNTRSQTHSSSTSKTRSSSVGFSVGGPVGASIGLIKQMESMNRAPANDMSSNEMFKNFGF >CP028151|1187776:1243509|1192285_1193029_-|AWP49823.1|DBSCAN-SWA MSHRDTLFSAPIARLGDWTFDERVAEVFPDMIQRSVPGYSNIISMIGMLAERFVQPNTQVYDLGCSLGAATLSVRRNIRHEHCRIIAVDNSPAMIERCRRHIDAYKAPTPVEVVEGDIRDITIENASMVVLNFTLQFLEPAERQALLDKIYLGLNPGGALVLSEKFSFEDAKVGELLFNMHHDFKRANGYSELEISQKRSMLENVMLTDSVETHKARLRKAGFEHSELWFQCFNFGSLVALKAGVAA >CP028151|1187776:1243509|1206235_1206463_-|AWP49837.1|DBSCAN-SWA MLFQGLLTRKGIVNSVATSAPHTKQKSRHTSRTGSVSAFVNRSAVVAEPLSPVPGIHDDLTPSDLPCAAILLLEG >CP028151|1187776:1243509|1221885_1222761_+|AWP49854.1|DBSCAN-SWA MMLTFVWITLRFIHFASLMLVYGCALYGAWLAPASIRRLMTRRFLHLQRHAAAWSVISAAFMLAIQGGLMGGGWPDVFSVSVWGAVLQTRFGAVWIWQIILALVTLAVVVIAPVKMQRRLLILTVAQFILLAGVGHATMRDGVVGTLQQINHALHLLCAAAWFGGLLPVVYCMRMAQGRWRQHAISAMMRFSRYGHFFVAGVLLTGIGNTLFITGFTAIWQTTYGQLLLLKCALVVLMVAIALTNRYVLVPRMRQENPRTDLWFVRMTQIEWGVGGIVLAIVSLFATLEPF >CP028151|1187776:1243509|1189746_1190316_+|AWP49819.1|DBSCAN-SWA MANWQHIDELHDISADLPRFTLAFRELSTRLGLQISALEADHISLRCHQNTTAERWRRGFEQCGELLSENIINGRPICLFKLHEPVCVEHWRFSVIELPWPGEKRYPHEGWEHIEIVLPGEPETLNARALALLSDEGLSQPGIVVKTSSPQGEHERLPNPTLAVTDGRITVKFHPWSIEAIVASEQAAH >CP028151|1187776:1243509|1216249_1216603_+|AWP49847.1|DBSCAN-SWA MNKRGALLSLLLLSASVSAFAASTESKSVKFPQCEGLDAAGIAASVKRDYQQNRIVRWADDQKKVGQADPVAWVNVQDVVGQNDKWTVPLTVRGKSADIHYQVIVDCKAGKAEYKPR >CP028151|1187776:1243509|1229978_1230503_+|AWP53040.1|DBSCAN-SWA MFYRDLFQVFGPDPLYKEEEGIAILREQYGIEAPEQIFKQIYCGLSNNSEFQTLYGHLNLKSLKWDLVRLKTAEFTKFGRNATYPDYMLEISEDFNACGSKFCIDAREEVANHWLKFGTWAEPPMFIERSLIIPGESGLHLMEGHTRLGTLLGAIKYKFVQLADTHELYIASQK >CP028151|1187776:1243509|1195223_1196996_+|AWP49826.1|tRNA|DBSCAN-SWA MRTEYCGQLRLSHVGQQVTLCGWVNRRRDLGSLIFIDMRDREGIVQVFFDPDRADALKLASELRNEFCIQVTGTVRARDAKNVNADMATGEIEVLASSLTIINRADSLPLDANHVNTEEARLKYRYLDLRRPEMAQRLKTRAKITSLVRRFMDDHGFLDIETPMLTKATPEGARDYLVPSRVHKGKFYALPQSPQLFKQLLMMSGFDRYYQIVKCFRDEDLRADRQPEFTQIDVETSFMTAPQVREVMEALVRHLWLEVKGVDLGDFPVMTFAEAERRYGSDKPDLRNPMELVDVADLLKSVEFAVFAGPANDPKGRVAALRVPGGAQLSRKQIDDYGNFVKIYGAKGLAYIKVNERAKGLDGINSPVAKFLTADIVDAILERTGAQDGDMIFFGADNKKVVADALGALRLKLGKDLSLTDEDKWAPLWVIDFPMFEDDGEGGLTAMHHPFTAPRDMTASELKTAPEEAVANAYDMVINGYEVGGGSVRIHNGEMQQTVFGILGINEQEQREKFGFLLDALKYGTPPHAGLAFGLDRLTMLLTGTDNIRDVIAFPKTTAAACLMTEAPSFANQAALTELGIQVVKKAENN >CP028151|1187776:1243509|1207505_1208948_-|AWP49839.1|DBSCAN-SWA MSRRLRRTKIVTTLGPATDRDNNLEKVIAAGANVVRMNFSHGSPEDHKMRADKVREIAAKLGRHVAILGDLQGPKIRVSTFKEGKVFLNIGDKFLLDANLGKGEGDKEKVGIDYKGLPADVVPGDILLLDDGRVQLKVLEVQGMKVFTEVTVGGPLSNNKGINKLGGGLSAEALTEKDKADIQTAALIGVDYLAVSFPRCGEDLNYARRLARDAGCDAKIVAKVERAEAVCDQNAMDDIILASDVVMVARGDLGVEIGDPELVGIQKALIRRARQLNRAVITATQMMESMITNPMPTRAEVMDVANAVLDGTDAVMLSAETAAGQYPSETVAAMARVCLGAEKIPSINVSKHRLDVQFDNVEEAIAMSAMYAANHLKGVTAIITMTESGRTALMTSRISSGLPIFAMSRHERTLNLTALYRGVTPVHFDSAADGVVAAHEAVNLLRDKGYLVSGDLVIVTQGDVMSTVGSTNTTRILTVE >CP028151|1187776:1243509|1199706_1200381_+|AWP53038.1|DBSCAN-SWA MRIGWVFYFSYIFVAIIFHLFISFCYCLSIDMAEANTVVFLLKPGTISLLFLLLPARRFRTRLLATLSSVFITLVFNQWHLVAGNKELVLCLQAACFMAFLAMTSVKKSGWMISASLFLVCAAGTIRQCWLEQLFNAADIYIVDDGRSSGASGHCFQYIAAKGRGLAAKRQALFSSEEYVNIYYSYSEGMPAVNFDGMKNEFVQYLLCHGELKSVARDDKTTCD >CP028151|1187776:1243509|1191059_1191269_+|AWP49821.1|DBSCAN-SWA MFAIRPNDFYRCIMSPNMMLARTRPLPIQQGPFFLLHISSRSSVGRLAHPGHVATYVPGDLLRCRLAAT >CP028151|1187776:1243509|1197580_1198321_+|AWP49828.1|DBSCAN-SWA MAGHSKWANTRHRKAAQDAKRGKIFTKIIRELVTAAKLGGGDPDANPRLRAAVDKALANNMTRDTLNRAIARGVGGDEDSNMETIIYEGYGPGGTAIMIECLSDNRNRTVAEVRHAFSKCGGNLGTDGSVAYLFSKKGVISFEKGDEDTIMEAALEAGAEDVVTYDDGAIDVYTAWEEMGKVRDALEAAGLKADSAEVSMIPSTKADMDAETAPKLLRLIDMLEDCDDVQEVYHNGEISDEVAATL >CP028151|1187776:1243509|1201360_1202371_+|AWP49832.1|DBSCAN-SWA MIEADRLISAGATIAEDVADRAIRPKLLAEYVGQPQVRSQMEIFIQAAKRRGDALDHLLIFGPPGLGKTTLANIVANEMGVNLRTTSGPVLEKAGDLAAMLTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKIDLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYQVPDLQHIVGRSARHMGLEMSDDGALEVARRARGTPRIANRLLRRVRDFAEVKHDGAISAEIAAQALDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDVLEPYLIQQGFLQRTPRGRMATVRAWNHFGITPPEMP >CP028151|1187776:1243509|1194332_1194899_-|AWP53037.1|DBSCAN-SWA MLKLNATTTALVVIDLQEGILPFAGGPYTANEVVARAARLAEKCRANGSPVVMVRVGWSDDYAEALKQPVDAATPAHALPENWWTWPTALGKKDSDLEVTKRQWGAFYGTDLELQLRRRGIDTIILCGISTNIGVESTARNAWELGFNLIIAEDACSAASSEQHQSSMTHIFPRIGRVRSVEDILNAL >CP028151|1187776:1243509|1215891_1216182_+|AWP49846.1|DBSCAN-SWA MAVEVKYVVIREGEEKMSFTSKKEADAWDKMLDTADLLDTWLEQSPVVLEDGQREALSLWLAEHKEVLSTILKTGKLPSPQAVEKDAASKTKKQAA >CP028151|1187776:1243509|1210124_1210310_-|AWP49841.1|DBSCAN-SWA MPGLFALPPCHSPQLISWYFSQDIVVILLHSTVLLLRDYGNRKRLFMGEPAFDWQYNRQNA >CP028151|1187776:1243509|1216696_1217356_+|AWP49848.1|DBSCAN-SWA MANWLNQLQSLLGQKGASASSSGEQGLNKLLVPGALGGLAGLLVANKSSRKLLTKYGTGALLVGGGAVAGSVLWNKYKDKVRAAHQGEPQFGSQSTPLDVRTERLILALVFAAKSDGHIDAKERAAIEHQLRESGVEEQGRVFIEKAIEQPLDPQRLAQGVRNEEEALEIYFLSCAAIDIDHFMERSYLNALGDALKIPQEVRDGIEQDLQQQKQALPG |
60 | Salmonella_phage(17.39%) | integrase,tRNA,tail,protease,lysis,holin | attL 1233917:1233934|attR 1244630:1244647 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1462634 : 1473421
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP028151|1462634:1473421|DBSCAN-SWA TGTGACCGCTTTTTCAACCCTGAATGTTTTGCCCGCCGCCCAGCTCAATAACCTTACTGAGCTGGGCTATCTTGAGATGACGCCTGTTCAGGCCGCAGCATTACCCGTCATTCTGGCGGGTAATGATGTGCGTGTGCAGGCCAGGACCGGTAGCGGCAAAACGGCGGCGTTTGGTCTTGGGCTCTTGCATCGAATTGACGTCACTCTGTTCCAGACACAGGCATTAGTGCTGTGCCCGACGCGGGAGCTGGCGGATCAGGTTGCCGGAGAGTTACGTCGCCTGGCCCGTTTTCTGCCAAATACCAAAATTCTGACCTTGTGTGGCGGGCAACCCTTTGGCGCACAGCGCGACTCGCTTCAGCACGCTCCGCATATCATTGTCGCGACGCCGGGGCGACTGCTGGATCATTTACAAAAAGAAACCGTATCGCTGGATGCGCTGCATATTCTGGTAATGGATGAAGCAGACCGAATGCTGGACATGGGATTCAGTGACGCCATTGATGAGGTGATCCGCTTTGCGCCTGCGACGCGCCAGACGTTATTGTTTTCAGCAACCTGGCCTGAGGCCATCGCGGCGATTAGCGGTCGTGTACAGCAGCAGCCAATACGTATTGAAATCGATACGGTAGATGCGCTACCGGCTATCGAACAACAGTTCTTCGAAACGTCTGCGCATGAAAAAATTTCGCTGCTACAAACGTTGCTTAGCCAGCATCAGCCAGCGTCCTGCGTGGTATTTTGCAATACCAAAAAAGATTGTCAGGCCGTTTGTGATGCGCTTAATGCGGTAGGACAAAGCGCGTTGGCGCTCCACGGCGATCTGGAACAACGCGACCGCGACCAGACGTTGGTGCGTTTTGCAAACGGTAGCGCGCGCATTCTGGTTGCCACCGACGTTGCCGCGCGAGGATTAGACATTAAATCGCTCGAACTGGTGGTTAACTATGAACTGGCCTGGGACCCGGAGGTGCATGTCCATCGTATTGGCCGTACGGCGCGCGCGGGAAGCAGCGGCCTGGCGATCAGTTTCTGCGCGCCGGAAGAGGCGCAGCGGGCGAATATTCTTTCAGAAATGCTGCAACTCAAGCTGAACTGGCTGAATGCGCCCGCCCGGCAGCCGTCACTCCCTCTGGCCGCAGAGATGGCTACCCTATGCATTGACGGCGGCAAAAAAGCGAAAATGCGTCCGGGAGATATTTTGGGCGCGCTGACCGGCGATATTGGATTAGACGGGGCGGATATTGGCAAAATTAACGTGCATCCAATGCACGTTTACGTCGCCGTACGTCAAGCAGTAGCGCAAAAAGCCTGGAAGCAGTTGCAAAACGGGAAGATCAAAGGCAAGTCATGCCGGGTACGGCTATTAAAATGATCGATAGCATGGCGTCTCAGCGGAGTGCTGAGACGCCTGCAGATTATTTCACTTCGATAACATCAAGCCGCAACGCCTCTAAGGCGGTGTCATCTTCTTCCGGCTGCCAGCCAGCGGGCTGCAAGGGAATCTCTTCACGATCGAACGCTAAATCGCCGCCGTCGACGACCTCGGAACCGTGAGTGATTCCTTTGAAATCGAACAGGTTAGTGTCACAAAGGTGAGACGGCACGACATTCTGCATGGCGCTAAACATCGTCTCGATCCGTCCAGGATAGCGCTTATCCCAGTCGCGTAGCATGTCGGCAATCACCTGGCGTTGCAGGTTTGGTTGCGAACCGCACAGATTACAAGGAATGATAGGGAAGGCTTTGGCCTCAGCAAAACGGACAATATCTTTCTCGCGGCAGTAAGCCAGCGGGCGGATCACGATATGTTTGCCGTCATCGCTCATCAGTTTCGGCGGCATCCCTTTCATTTTTCCGCCATAGAACATATTCAGAAACAGGGTTTGCAGAATATCGTCGCGATGGTGGCCCAGGGCGATTTTGGTCGCGCCCAGTTCAGTCGCCGTACGATACAGGATACCCCGACGCAAACGCGAGCACAGCGAGCAGGTGGTTTTTCCTTCCGGAATCTTTTCTTTCACAATGCCGTAGGTGTTTTCTTCGACGATTTTATATTCTACGCCCAGCTGCTCAAGGTAGGCTGGCAGGATATGTTCCGGAAAACCTGGCTGCTTTTGATCGAGGTTGACGGCGACCAGTGAAAAATTGATCGGGGCGCTTTGCTGCAAATTACGTAAAATTTCCAGCATCGTATAGCTATCTTTGCCGCCAGAAAGGCAAACCATAATGCGATCGCCTTCTTCAATCATATTAAAATCGGCAATCGCTTCGCCAACGTTACGGCGCAGGCGCTTTTGCAACTTGTTGAGGTTGTATTGTTCTTTCTTTGTATTCTTTTGAATTTCTTGCATTATTTCAGTTCTCTGGTACTAAATGGGGCAAATTGGGGGCAAACTTTGCAACTACGATAACCGTGCATTCAACATCGCTACTTGTTCGTCGTTCATGTCATCAATCCACATACCGTAAATTTCATACACCATCTGCGCAGTTTCATGCCCCATTTGGCTGGCGATAAATGCCGGGTTCGCTCCTGCCGTCAACAGCCAGCAGGCAAAAGTATGTCGCGTATGGTACGGATTACGGCGGCGAATACCAGCACGTTTTACTGCCGCATTCCACCTTGCCCCTAAACTGCTTACCGAGTAATAAGGTTTCTGTTTTCCTTTACTCATCCTGGGCATGAACCTGTTCAGACTTCCGCAATACCATTAACAGGTTCTGGATATAGCTGTTTTCCTGTTCCATTTCGAGAACGGTAATATGCCTGCGTTGAGCGCGGGTGGCGTGATGCAAATATTTTTTGTCTGTGGCAGCGTAAGTAAAAATGTGCAACACGCGCTGGGGGAACGGCAGGGTGGCGACGGAAACTTCGCAGTCCTGACATTCGTCGTTGTCACTGGTGTCTGTCCCGCTTTCTTCGGTGCCAGCGTCGTCGGCGCATTCCTCCGTTTTTTCGTCGTGGGCGTCAGTAGCGGGCGTGCCGGGGATGAGCATAAATGTTTTGCCATCTTCGCCGCCCAGCTCATATTTCTGACAGAAGGTGGTATCAAAACTTCCCTCCGGCGGAAGTTCGTTAACAGCGGGGGAATTAACGCGAACCGGTTTTGCAAAATCTGTCGGTTCGAATCCGGCGTCAAGCATTGATACGGCAAGGCGTGCTGTGGCAGCGGCTTCGCTGCTTTCAGTTTTCCACCAGAAAGCAAAAGGGAAGCCGAGACGTTTCCGGGCGCTTTCATTTTTAACCTTAATATAATATGAGTATTCTGTTTTATTGTCGCTCATTATCATTACCCTTATTACAAATCATACTTAATGAAGACTTTCATTTTTCATTGAGCAGAATGCGTTCGTGACGAGCTCTTTACACTCCATCAGTTTCAACAATGAAATAAAATTTTCGGGAGGAGCTTCGTTCATTTTTAACAGCATTATTGCGGCTGTTATTGACCTGTCATATGCGCCTGTTTCACTGTCTGTATATGCAACAAGAGTTCTGGAAACACTTTCATCGTCACAGTCCCGGGCATAAGAAACAACACAGGGCATATTGTTTTCGATACATAACGTACGAATTTTTCCTGAAAGCTGGCGGAGCTCTGCAATTACTGATTCAGGTACATTTTTCATATAGATTCCTTTTTTCAGGTTGAGTGAATTCCTGCCATTGCAGGCATATTTAAAAACAGGATGGTTTAAACGATTACTGTTCTGTTACCATGATTCAGCTTTGGCAGGCAGACCATTTCTGTTCAGCCAGACTTTTACCATGCAATCGGTAATACATCTTTGCGTTGTTAAATCACGGATATATAAACGGCGTTTTTTAATGTTATTCGCTGAGGCGATATAAGTACGACCATCATGGATAACATAATCTCCCGGCGTAATACACTGACGCGGTATTTCATCCGTTCCGAAGTGATGAGCAATCATAGCCCCTCCATTTCTGGTAAATAAATTTTGTGGTGCGGTGCCTGGTGCCTCCAGGTGACATTAACCAGTTAACAATTAATGCCGACTTAAACCACCCATACTGATTCAGGGAGTTTTAACTGTGCCGCGTGCGCTTAGCCGCATTCACCGCATCACAAAATTCACTTTAAAAAGGGCGGACATCAGAAAGGACTAAGAAAAACTGATGCCGCCAAGTACTACACACAGCATTATTGTCGCAGTGGCAACTACAACCGGAGGCGCACTTCCACTATTTGGATTTACAGACAAGACCGACTCAGAAAACATCAGAAATGCGCCTTCGTGTTGTGCCCGGCTTTATTTAACCACCTCCGGGCTTCGGTGGTCTCGGCTATACCCCTACAGCGAGAACCTGTGTTAACATTTCAATACCCTTACAGTTGAGAGTTATTGATATGTCAGAAACCGCTCTGGTTATCGTAAAATTCCTAATTGGTAAATCCGTCGGACAATTTATGCTCACAGTGGCTTTATTTTTCTTAATTATCATCTTCATTCCTAGAGATATTACGGAGCTTATTGAGGCGCGTAGAGATTTACCATATGCCGTTCAGATTTTTAGTTTTGCTGTGGCTTACCTTATAGTGCTGATCCTCAAAGTCACTGGTTATTTTTTCGTGTCGGCGCTGCCGTTGTGCCAGCGTAGGGGCAGGGCAAAACGCATGTTAAAAACGCTTAATTCATTGAGTACTGAACAGCTGTTTTTACTTGAACCCTTTCTTAAAACTCATTCTCCCACTTTCCGGGCGTCCTGGGATAACCCTGATGCAGATGCTCTGGTTAAGGCAGGTATCGTTCGTCCGGCTGGTTCGTGTATCGACGGTGTTTCTGTGATGTTCAAAATCGAACCCGAGTATGAGTCGTTAATGCTTTCCACCTGGAATCCCTGCACAAAACGGTTCGATATTAGCCGTTAGCTGAAAGCGCCAGCAGAAACTCACTGAAACTGAGTGCTTCTTCTCCTTCGTCAAGGCTTTCAAAGTATTCTTCGTAAGCCTTTTCCATGATTGTGTCGAAATCCATATCACTCACCTGAGTTTCTTTCTAACCAGCGACGTGCGCCTGTTTCAGTTTTAAACGTCCTGCTTCTGGTGTACGTCATGGCGGTGAACGTTCCATCCTGGTTGGGGAACACGCCACACACCAGGGATTCGTTATTGCCGAGGTGGATTTTTTGCAGCTTGTCCATTATCACCCCGGATAATACTGTTCCTGTAGCTCGCATTGAGCCAAAAACATATCCCATCCATTGTCGCGCAATGCTTGGAGAGCTGCGCTAAAGCTGACGTTGCAACTATCCATCAAATACTGAATCACTTCATTCCTTCCCATCTTTCCCTCTCCCCTTAACGCCGGGTGGCGGAACTAAAACCTACAGCGCCGTGCTGTTTCTGAGATTATATTAGCGATATTCATATAGTTGATCAAGATAAATATGCATATATATCATAAATATGATCTATCCTAATGAAAATAAATGTGTTTTATCTGATGCAAGAGGGGGAGGGAGGAGCTTTAGCCAAAAGAAAACCGCCGGGAGAGGCGGTTTGATGTGGTTGGTTCGTCACTGATTTTTTAGGCGCTTTTGTGCAGCGAGCATGTTCTGGAAAGCCTCTTTATATAGCTCATTCTGACCTTTAAGTCGGTCAATGAGTTTTTCTTTCTCAGATTCAGGGAGTATATCAAAAAGGTTTAGTAAATCAGCCTGTTGTCTGCTCACCATTCGCCAGCCACCACCTTCGAAGTTGTCATCGTAAGTACCAGAAGAACGAACGTAGTTCATTAGATCGGCCAAATCCGGTCGTAACTCTTCGGGTTTAACTCTCAATAGAACAGAAAATTTTAAGGCCGCATCAGTGTTGAGAGGTGCCTTACCGTTCAAATAGTGACTGACGGTAGATTGTGTCTCAAAGCCCATAAGATCAGCGGCGATCTCCTGAGTAAGTTTGAGGTCTCGCTTTTTGGCGTCCCAGATGGCGCGTAAGCGCTGGGTAGCTTCTGGTGGAGCTATTTCTTCACGTTTTTTTCTCATGCGCTCATCTTATGAATGTGACTCATAAACTCAAACTGATATAAGTATTGATCATTTAAATTAGTATGGTTAATATTTAGCGAGAATTACTAAGGTGACCTTTATGACGTTAGATGAATATTTGAAAAAAAATCGTGTACGACAGTCTTGTTTGGCCACGCTGGCTGGTTGTTCGCAATCGATGATTAGCCTTGTTACTACTGGCCGTAGTCAGTTAAGCCCCGAAAAGGTATTGCGTATCGCAGAGGCTACGAATTTCGAGGTTACACCTCATGAACTCCGGCCTGATAACACTTACGGAGCTGAGGAGGATGACGGGGTTAACCATTTATTCGACCCGCCACTACCTGGACAAGGCAGAACGTTGTGGGGATGTGTACCAGGCGGGCAGAAGAGGGGGGATTTTCCCGTCAGAAGAGGCTTATCGTGCCTGGAAGAAACAGGCGAAAGTGGACGCTGACCTGATTTGGAAGCTGCCTGACGGTGAGGTACGTCGTTACGACAGGCACCACAACGTAATTTGTCGTGAGTGTCGTAAAAGCGAGTACATGCAGCGGGTACTGGCGTTTTATCGGGGAAACTTTCAGGAGGTGCTGTTGTGAGCCAAATTAACAATCGGAACTGCGTGAAGTGAAAGAGAAAGCATAATCCAAATATGAATAATTAAATTTAGTGATGTAAATAAACTTTAATCCTTAACCGGATGGATTCCTGCACGCTCAGAACACCAGGAGACCGCCCGAAAGGGCGGTAGCTCCATTGCTTAATTGTCTAAAATCGTGCTAAATCTTTTTATTACCATTAAGAAAGTTATGACAGTGATAAAAAAGGATGTATAGGCTAAAAAGCTAACGATATATGCGGGTGCGCGAAAATACCATTTCATTAAATCCACTGCATTGTCAGGCAGGAAATATATTATTACTGAAAATATAACCAATACTATTGAAGTTAATATTGCATAAGCGACGTTGTGGCATAACTGCTCATATATGGTTTTGTTAGTGTTTAATGATATCAATTTGTCTCTTGATTTGTTTCCTTCAATTATATCTGATATCTTAGTAATGGTTTTTTGTTTTTGTTCATAAATCATTATTACTGCACTCATTAATAGTGCTGTTGTAATAGCCCCGAAGTTAACGAAGACGGAAGCAATTGCCGGTTTCATTATTCCGTATGTCCAGCACAGAACGAAAGAAAGAGATAGAGGAACTATAAAATGTACGGTAATGTCGCTCATCAACATTGTTCCACGCTGATCTGACATTGTTTTGTAGTGTTTTATTATTACACCCAGCACATTTATTTTATTCATATAATCACCCCTTTATTTCCACAGTGCAGTTCTTCCAATATATCATTGGAAAGGTTTTTTATCGTGTCATGAAGTGCTGTTAGATCAGGTATACCTGTTAATGGGTCGATTTTTAGATCATTATCATCTAACTCTGCTGAAATTCCTTTTTTTAGTATGGTATCATAATTGAAAACGACAGTCCGACTGCCGAGCTGTAAGCTTACTTTTATTGCATCACATTTATCTTCAATAATCTCAATGATGTTTCCTATATTCTTGTTTCTTAAATCCCTGAAACTTCCGAATATGCCATCGTTTGCTTTTATTATTAAGTCTGTCTTGATGTTTGTTTTGTTTTTACCAAAGGAATCAGCAATATCTTCTGGTGCTTTATATCCTTGAGCTTTAATTTGTTTCAATTCAGAATTGAGAATGTATTGAGGGATTTTCTTATGATGTAATGGATTGATTCTTGCTTCTAGTTGAAATTGTTTTTTTAGATATTCAGTGATAGAATCAGAAAGAACACCTCGAGCAGAAATATTATCGCATGAATGGAATGCAATAATTCCTTCTTCAAGATTATCTGGTAGATATATTAAAATATAACGCTCTTTGAGTGTTACATCATAAGCAGTTGTTCTGTAATGGACTTTTTTGAGTTTTACATCTTTTATTTCACTGCTTTCTCCATATTTTCCAACTTTTATATAACCATATATGATTTTTTTTGTGTTATCAAAGTGAAGTTTAGTATGTTGTTCCAGAGATATTTTAGTTTTTGAAACGCCGAACTCGATGGGGGTGTTTTTATAAAGAGTAAAATAATCAACAAAAAGTTCATATGCCGTTTTTTTATTACTTAAACCTAAGTCATTAAGTTTTTTGCTGGCTCGACTGCCTTTATGGGTCAATACGCGGAATGAATAGAAATTAACGCTGTGCATGAAAAATCCTTTTGAGTATACAGGAATATACTAGAACATAAATGTATGCAATGCATAAAGGAAAAGCTACCGCAGGGCGAATTCACCCACCGATAGCTCTTAAATGATTGTTTTCAAGCGATAATACATAAATTTTGGTCTGTGTAAAGAGGGATGTATCGTCAGGGCAAGGGAGATGTATTAAGGGGTATTGGTAAAATTTGCGTTGGGGATAAAAACGGTTTGCGGGAAAAGGAGAGTTAAGTAGAATTGTTGCGGGTGCTTGAGGCTATCTGCCTCAGGCATGAACACCAAAAGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCGGCAAGACGCTTAACATTAATCTGAGGCCAATTTCATGCTTTGCACATGTAGGTTAGCCTCTTACGTGCCGAAAGGCAAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAGTGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCTGGCGGGGAGAAATCCCCGCCACCTCTGACGTGTCAGGCATCCTCAACGCACCCACACTTAACCCGCTTCGGCGGGTTTTTTGTTACCCGTAAAATAAAAATTCATAAAAATGATCAACTTTCAGATTGGTTGCGCAACAAGTGAAAAATGTCCTTGCTGGTGAACATAAAATAAGCAAATTTATATAATGAAATAAATAGTCGCAGTGTTTATATTTCCCGCCTCAACAGAAATCGCGTTGAAATCGCACCTTTTCATTTTTCCTTAGTTGTCTGGAGGTAACGTGAAAAAACTCAAGGATTTATTAGAGTTAGATGAAGACGGGCTTTATGCAGTACGTGTAAAAAATGGTGAAATCTCATTCTGTACGCTAATTCCTGACGACCATCTGATTCTGTCTGTTGAAGCGTTTATTGATTATCTGATAAGACTGGGTTTCACTGTCAGTTATTAATGTTTTAATATGTTACGGCTGACCTGAACAATCAGCAACCTACAGCGCCACCGGAGAGAACGATGGCGCATCTACAACTTGTTAAACAAACCTCATCAGGGCTTCTGCTCCCGGCAACGCCGGAGAGTGAGGACTTCCTGCGCTCAGTAAAAATCGGTGCGTGGATACACGCCGATTTTAAGCGAGTACGTAACTACGCGTTCCACAAACGTTTTTTCAAGCTCCTTCAGCTTGGTTTCGATTACTGGACTCCGATCGGCGGGGCGATCCTGCCTCAGGAACAGGAGCTGATTACCGGCTTTGTCGATTTCCTGTGTGAGTCAGCAGCGCAGGGCCACAGTCCCGCACTCAGTGACGCGGCGGAACAGTACCTGCATAAGGTTGCTGTCAACCGAACGCTCGATGTTGCGCTGCTCAAGTCCTTCGACGCTTTCCGCGAGTGGGTAACCATTCAGGCCGGGTTTTATACTGAGCATTATTATCCGGATGGCAGCCGTGGGCGCCGGGCGAAATCCATCGCTTTTGCGAATATGGACGAAACCGAGTTTCAGCAGGTTTATAAGGCCGTACTGAACGTCCTGTGGAACTGGATTCTGTTTCGTAAATTCTCCTCTCCGGAAGAGGTCGAAAATGTCGCAGCGCAACTGCTGGAGTTTGCGTAATGGCGGATTTACGTAAAGCGGCGCGGGGCCTGATGTGTACGGTAAGAATTCCCGGCCATTGCAACCATAATCCTGAAACGTCCGTACTGGCACATTACCGGCTGGCGGGTACGTGCGGAACGGCGACAAAACCAAACGATATGCAGGCAGCAATTGCCTGTAGCTCGTGCCACGATATTGTCGATGGGCGGGTAAAAATCGACGACTTCACGAAAACAGAAATTCGCCTGATGCACGCAGAGGGCGTTTTCCGCACGCAGGAAATCTGGAGAGAGAAAGGCATTTTATGATTTACCCAACAAACACCGGAAAAAGCGGAGAACACCTTCGTCTCAGCACGCTGGAAAGTGTGTGGATTCAGGGGAAATTGCGTATGTGGGGGCGCTGGTCATACATTGGTGGCGGCAAAACAGGGAATATGTTTAACCAGTTGCTGGCGTCCAAAAAACTGACGAAGACGGCCATTAACGATGCTTTGCGCCGTATGAAAAAAGCGGGGCTGGAGAAACCTGAACTGGAAGTGTTCCTGAGAGAGATGATCAACGGAAAGCAAAAAAGCTGGCTGGCACACTGTACGGATACGGAAGCGCTGATTATCGATCGGGTTGTAGGCGAGGTACTGACGGATCATCCGGGGCTGCTTGGTATCCTGAACCAGCGTTACGTGGGGCGGGGGATGAGTAAGAGAAGGATGGCCGAGTTACTAAACGAACAGTACCCAGAGTGGGCGTTGATTACATGCCGACGCCGTGTTGAGCAGTGGTTGAGTATCGCTGAGTTCATTTTGTATTCACCTATGAGAAAAGCGTTCGATTATGCTTAA
Protein sequences of DBSCAN-SWA_4 >CP028151|1462634:1473421|1468918_1469248_+|AWP50093.1|DBSCAN-SWA MNSGLITLTELRRMTGLTIYSTRHYLDKAERCGDVYQAGRRGGIFPSEEAYRAWKKQAKVDADLIWKLPDGEVRRYDRHHNVICRECRKSEYMQRVLAFYRGNFQEVLL >CP028151|1462634:1473421|1465948_1466266_-|AWP50089.1|DBSCAN-SWA MKNVPESVIAELRQLSGKIRTLCIENNMPCVVSYARDCDDESVSRTLVAYTDSETGAYDRSITAAIMLLKMNEAPPENFISLLKLMECKELVTNAFCSMKNESLH >CP028151|1462634:1473421|1471998_1472598_+|AWP50097.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESEDFLRSVKIGAWIHADFKRVRNYAFHKRFFKLLQLGFDYWTPIGGAILPQEQELITGFVDFLCESAAQGHSPALSDAAEQYLHKVAVNRTLDVALLKSFDAFREWVTIQAGFYTEHYYPDGSRGRRAKSIAFANMDETEFQQVYKAVLNVLWNWILFRKFSSPEEVENVAAQLLEFA >CP028151|1462634:1473421|1464051_1464987_-|AWP50087.1|tRNA|DBSCAN-SWA MQEIQKNTKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFPEHILPAYLEQLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIVRFAEAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIETMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVDGGDLAFDREEIPLQPAGWQPEEDDTALEALRLDVIEVK >CP028151|1462634:1473421|1472597_1472888_+|AWP50098.1|DBSCAN-SWA MADLRKAARGLMCTVRIPGHCNHNPETSVLAHYRLAGTCGTATKPNDMQAAIACSSCHDIVDGRVKIDDFTKTEIRLMHAEGVFRTQEIWREKGIL >CP028151|1462634:1473421|1462634_1464008_+|AWP50086.1|DBSCAN-SWA MTAFSTLNVLPAAQLNNLTELGYLEMTPVQAAALPVILAGNDVRVQARTGSGKTAAFGLGLLHRIDVTLFQTQALVLCPTRELADQVAGELRRLARFLPNTKILTLCGGQPFGAQRDSLQHAPHIIVATPGRLLDHLQKETVSLDALHILVMDEADRMLDMGFSDAIDEVIRFAPATRQTLLFSATWPEAIAAISGRVQQQPIRIEIDTVDALPAIEQQFFETSAHEKISLLQTLLSQHQPASCVVFCNTKKDCQAVCDALNAVGQSALALHGDLEQRDRDQTLVRFANGSARILVATDVAARGLDIKSLELVVNYELAWDPEVHVHRIGRTARAGSSGLAISFCAPEEAQRANILSEMLQLKLNWLNAPARQPSLPLAAEMATLCIDGGKKAKMRPGDILGALTGDIGLDGADIGKINVHPMHVYVAVRQAVAQKAWKQLQNGKIKGKSCRVRLLK >CP028151|1462634:1473421|1469409_1469964_-|AWP50094.1|DBSCAN-SWA MNKINVLGVIIKHYKTMSDQRGTMLMSDITVHFIVPLSLSFVLCWTYGIMKPAIASVFVNFGAITTALLMSAVIMIYEQKQKTITKISDIIEGNKSRDKLISLNTNKTIYEQLCHNVAYAILTSIVLVIFSVIIYFLPDNAVDLMKWYFRAPAYIVSFLAYTSFFITVITFLMVIKRFSTILDN >CP028151|1462634:1473421|1468178_1468646_-|AWP50092.1|DBSCAN-SWA MRKKREEIAPPEATQRLRAIWDAKKRDLKLTQEIAADLMGFETQSTVSHYLNGKAPLNTDAALKFSVLLRVKPEELRPDLADLMNYVRSSGTYDDNFEGGGWRMVSRQQADLLNLFDILPESEKEKLIDRLKGQNELYKEAFQNMLAAQKRLKNQ >CP028151|1462634:1473421|1469960_1470893_-|AWP50095.1|DBSCAN-SWA MHSVNFYSFRVLTHKGSRASKKLNDLGLSNKKTAYELFVDYFTLYKNTPIEFGVSKTKISLEQHTKLHFDNTKKIIYGYIKVGKYGESSEIKDVKLKKVHYRTTAYDVTLKERYILIYLPDNLEEGIIAFHSCDNISARGVLSDSITEYLKKQFQLEARINPLHHKKIPQYILNSELKQIKAQGYKAPEDIADSFGKNKTNIKTDLIIKANDGIFGSFRDLRNKNIGNIIEIIEDKCDAIKVSLQLGSRTVVFNYDTILKKGISAELDDNDLKIDPLTGIPDLTALHDTIKNLSNDILEELHCGNKGVII >CP028151|1462634:1473421|1467009_1467531_+|AWP50091.1|DBSCAN-SWA MSETALVIVKFLIGKSVGQFMLTVALFFLIIIFIPRDITELIEARRDLPYAVQIFSFAVAYLIVLILKVTGYFFVSALPLCQRRGRAKRMLKTLNSLSTEQLFLLEPFLKTHSPTFRASWDNPDADALVKAGIVRPAGSCIDGVSVMFKIEPEYESLMLSTWNPCTKRFDISR >CP028151|1462634:1473421|1466350_1466572_-|AWP50090.1|DBSCAN-SWA MIAHHFGTDEIPRQCITPGDYVIHDGRTYIASANNIKKRRLYIRDLTTQRCITDCMVKVWLNRNGLPAKAESW >CP028151|1462634:1473421|1467638_1467794_-|AWP53057.1|DBSCAN-SWA MQKIHLGNNESLVCGVFPNQDGTFTAMTYTRSRTFKTETGARRWLERNSGE >CP028151|1462634:1473421|1465303_1465921_-|AWP50088.1|DBSCAN-SWA MSDNKTEYSYYIKVKNESARKRLGFPFAFWWKTESSEAAATARLAVSMLDAGFEPTDFAKPVRVNSPAVNELPPEGSFDTTFCQKYELGGEDGKTFMLIPGTPATDAHDEKTEECADDAGTEESGTDTSDNDECQDCEVSVATLPFPQRVLHIFTYAATDKKYLHHATRAQRRHITVLEMEQENSYIQNLLMVLRKSEQVHAQDE >CP028151|1462634:1473421|1472884_1473421_+|AWP50099.1|DBSCAN-SWA MIYPTNTGKSGEHLRLSTLESVWIQGKLRMWGRWSYIGGGKTGNMFNQLLASKKLTKTAINDALRRMKKAGLEKPELEVFLREMINGKQKSWLAHCTDTEALIIDRVVGEVLTDHPGLLGILNQRYVGRGMSKRRMAELLNEQYPEWALITCRRRVEQWLSIAEFILYSPMRKAFDYA >CP028151|1462634:1473421|1471262_1471475_+|AWP50096.1|DBSCAN-SWA MLCTCRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE |
15 | Escherichia_phage(72.73%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1913016 : 1923002
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP028151|1913016:1923002|DBSCAN-SWA TTCATCGTACCGGAACCCTCAGCGGCTCTACCGGGACAAATCCCGGATGGGGAACGGGATTACGCGTGAGGATGTCAGATTCCCGCCCGGCGTCGTCATACCAGGTCGCAGCCAGTACCAGTGCAGGCAGAACATCATCAGGCGTTCGCAATGCAGTACGTTCAACCTGTGCCAGTCGTGCAGAAATATCGCGATTGAGATCCGTCCGCATAACGGAAATTTGCTGGAAAAGCACATCATCCTGGATACGCAACTGCTCCTGGTCAATCGCAGCATTGAGCGCGGTCCGGATAGCTTTCAGATCTTCATAATTCGGTGGAGAGCTGCCATTACTGACTGTCTGTACACCATCCAGCGCCGGGTGCATGACAATGATAATGTCTGAGTCACGGCCTGTTCCTGCAGGCTGATTTACGCCCCGGACACCAGGTACATCACGCGGCTGCTTCAGTGTTGTCACGGCGTGGGCGGCTGTGCTGATGGCTGTTGTCCTGATGGCGGCTGCGATCATATTGCGTTGCATTTTCTGTTTCGCAGCAGATCCGGAGTCAGTGGGCCAGGTGCCACGGGGGGAAAGACCGGGATCAAGCGTGATACCTGACATCGTTTTTATCATCGTGACCAGATCCGATGTACTGCCTCTGAGCCTGTCACCGGAGCGCCAGGCTTTTTGCAGTGCGTTAACGAAATCACTTGCGGCGCTCGGTGGCATCAGAATGACAGACAAATCCCCCTGTAACAGCCGCATTGCGGCAGACACGCCGGAGTCAACCATCCTGAAAGCATCGGCAACATCGCCCAGCATGGAGGCAGCATCGGCAATAACATCGTTCTGGATAAAATCAGAAATACCTGACAACGAGAATGTGGAAAACATACTGTCAATCGCATCGTCGAAAAGCCCGCCTGATGTTTCCAGGCGCTTCGCCGTTGCCATTCCTGCCACCGGAAAAGAAAGTTCACCACTTTCCACAAACTGAAAGGAGACACGACACATGCGCCCTTCTGTACTGCTGTGAGTGATCCTGACCTGTCCGTCAATGCTGCCCTGCATTTCGCCATACTGCGGATGGACCAGCGTACCAGGGCCTGCGGTTTCAATGGCACTAATAAGACGATCCCGCCTGTCTGCGTAATCATCACCGACAAGATAAGCATTTATCGTCAGGCGGCGCGTGGCGCGGCCTAAATCCTCCGTCCAGGGCTTATCCCTGTTCGGATATTCATGCACCTGTACGCGGCGTCCAAACGTACTTTCATCATCTTCAACGGAGACCAGCATACCTCCGGTGCGCTTGGTGGTATAAAAACCGACAAACGGTTTGTTGGTGTACGGGTCACGCAGGATGCGGGTGCCGATACGGTCAACGATGGTGTAACCCCGTTTGAAGTTACCAAACGCAATGGCTTTCGCATCGGCGGCGATATCCGGCATCTGTTCGTTTTCAGCGATACCGTAACCCGCCAGTGAGGATGGCTGTCCCAGCTCCAGCCCCGGACGCCACAGATACCAGCACCATGACATAGGCCAGTAACGAAATCAGGGGGCGGTGTGTCGCATCACCGCGTCGGTAAAACATCAGAACGATGACTATTACCCCACAAATTACGGCATTCAGAACTGCAGAAGGGTCATTTGCTACCATCTGCTCCCCCTCCCCTGATACGAGAAAGAATACTGAACAGGCTGTTCAGATCCTGACTGTTAAGAAAAGTGAGAAACTTTATACACATTGCAGAAATAATCACTGCACCAAGTGCATCCAGTGGTTTTTCATAATGCGTTATTGCCGCAAGCTTAGTACCTATCAGCCCGGCGCCAAGCACTCCCACAATAAATGATGTAATAAAATAAGCGACCAGCCTGATGCGTCCGATGTTGGTTGCCGTGGCGACATAAAACACCGCGCCGGCAAAAGCACCGAATACCACACCATAATCAGTTCCGGTTGCCAGACCGAATACACTGGCCCCCATTAATCCACCAGCCAACACTGTCGCACTGGATACAGGTTCGGACATTCATCCCCCTCTGGTTATGTGGGTCCTCTCAGTTATGAGGGGAAATAAAAAAGGCCGCCCGAAGGCAGCCTCAAAGTATTTAGATTTTTTACAGCGATGATTGGTGGAACCGAACTGCAAGGAAGTAAAGTCCCAGCAAAGTTGTTGTTATGATATTCATTAAGATAAATATATAAACGCCAACAAATACCGTTTTAAGCCACAAGATTGCATCAGGAGAACAGAATGTGAGCAGCCACAAATGGAAAGGCTTCCCAATCAGAATAGAAATCATCCCTAAGCAAAATAACATAAAACTCACAAGAGCTAGATAACCAAAAAGGTAACAAACAAAGCGCCTGCGCGTCAGTTCTACAGTAAGCTTCTGCCCTCGGAATTTCTCTACTAGAGTCGGAGGTACGCCCGCCATTACTTCGTCGATCGAAGAGCTAGAAAAAGTAGAAACCGCAGCCAGTGCTGCGATATAAAAACCAATCAAGACTTGAAGTAACCCATTAACCTGAAGCAGGAGTCCGTTAGTCTCGATTAAAGAAATTTTGCTAGCGTGAAAATAATAAACAATAGTGACGATTAGAGAAACTGCAGCTGGTATTTTGTAATCATACCAGTCCTTTTCCTCATGCTTGATGCGGAGATAACTCAGCGGTGAAAAAAGTTTCATATGAAACTCCCGTTAGAGCAACCCTATCATTTTTGTTTCAAGCTGCAGATGTACTGTGCTTTCACATTGGTTGATGAGGTTACCTAATATTACCCTCTCACTTTTAGTGAACAGTTTTGTGGCAGCATCTTCGTTACGGTCAAGATCCAAACTGGCTTGCTTGCCATCTTTTGAGTAACTAATTGAAACCTTGGTATATCCAGACTGCTGCCCTTTCTTTCTTAAAATCTCTAACAACCTTTCTTTATCTTTCAATGGCGGCTGTCTAATGATTTTATACTTTACGGACCTTTCTGAGAGCTCAGTGTACGCCGTTTGGTCCAATCCACCTTTCCTTCTTGTACTCACAAGTTTAACGTTATGAATCTTTGCACCTTTTAATGCATCCATCAGCGTTTGTGAACCATGAGAATAGATTTCCAGCTTTGGGCGGTGCTGGCACATACCTTTAGTTGCAGGATTTTTAAACTCACATCCAGCGAAGGCTTCTCTGAGCATAGCATTTAAAAATGGCTCAAGAACTGACTTACTGATACCGGGGACAGATTCAACGAGAGTTTTGTGGTGATCGGCAGTGTTTTTGACAACATCTGTGGATATTACAATGTGACAAGAAACTGCGATACCTTCACCGGCAAGCTTAGGTTCTACTCTAAGGTTGCCTGTTGTTAACTCACCAAAAACAGGGTCAGAACCATTTTTATCACAAAGCTGGATAAGTAGAGTCGCCTGGCTATCCCCAATAGAATATTTCATCTCCGAAATCCTAAGCGCTCTAGACCTATGATTGTATAACTTTACAGCATTCCCTGAGCTCACCAGTACCTTCAATTTCTTGAGTATGTCTTCAATGGGAATACTCGGCGCCGCTGCGTGTGTAGGCGTAAAAGCAAAGTCAAAAAAGGAAACCCAACGTTCGTTATTAGAGAGCACAATGTCACCACTTAATTTTAGTTCTTGTCCATACAATAACTTAGCAAGACAAAAAATAAAGTGATCACAAATAAAAAACCCACTCGGAGGCGGGTTCTTGAGACTTATCAACGATAGACATACAAAGCCCATCGTTGGGAGAATCTTATCCATATTTTTTGAAATATGCAAGCATCATGTCGCTATCTTCGTTGAAAATCTTTCATCTTGTCACCTTTCTTAATTGCGCTTCTGCATATGCTTCTTCCTGCCAGCATTTTGTAACCAGTTTATCAATGACGTCTGCATATCCTTTGTACCACTGATAATCCGTCAGATCCGGTACCAGTTTCTGGACATGGTGCCGCGCCAGTGTGGTTGGTAAACGACTAAACCGGTTTCCATTGCAACGCCCACAAATCTTATAAACAGGCGCGCCATGAAGCCGAGTTCTTTTTTCATCCAAGACAATACCTTTACCCTTACACCCTCTGCACACCGTGCTGACTTCTCCCTTACTGAGTAAAAATGGCGAAATAGTGTGGTCTCACGATGTTGCCGCCCAGATTGACAAGAGCGGCGATTCATGGCGCAACGGTAATCATGAATTGATGGCTGGGGTCACATTCTCTCTGCGACGCGCTCTTGAGCAAGCAGAAGTGTTACCGGAAGAAAGCAACTGGATATGGCCTTTTTCAATCATACATGGATCTAACAAGCAATTTCGGCAAGTTGCTCGCCAAGTCGCTCTCGAAGAGCCTCTGAAAGTTGCATGCGAACTTTGTCAGGCATGCCATCAGCATCACAATAACAGCACCGATCAATCATGTCGAAAGCAGACTTGTAAAAATTTTGCTGCTGCTGATCGGTAAGACTGTTGAACAACGAGGTCATGACGATTTTGCTCAGGGCGTAATTCAGATCTTTTTCATCAAAGGTCATTTTGTTTTCCTTCTGTGTTTGAGAGCTATGAAGGATACCACCGCGCCTGATGTGGTTAAAAGCAGGCACTCAACGAATTGCTGTGTGTAGTCTTGTCGGCGTCCGGCTCTTCCAACAACAGGAGGAAGGCGACAGTGTTCTGCCGTGACGCCGACCTTTTTACACAACAGAACCGCGCGACGGGCTCATTACCCAATCCACCCGGAAAGCTGTTACAGCAGGTGCTCTTTTCTGTTTTGTGGAGAAACCAACGTAAAACGCCAGTGCAGAGGCGTTAAGGAAACAAAATGCATAACCCATTTTTTAAAAACATGCTGATATATCGCTTCAGCCGCGACTTTAACATCGACATAGACTCTCTTGATAAGAAACTTGAGCTGTTTCGCTTCTCACCATGCGGAAGCCAGGATATGGCAAAAAGCGGATGGTTTTCACCATTAGTCCAGTATTCAGATGTGCTATATCATGCAGTCAATAACCAGTTACTTTTGGTTATTCGTCGTGAAGAAAAAATCATACCTAAACAGACGATCGCCGATGAGATTAATAAGAAGGTTTCCACGCTTGAGCAGGAGCAAGGCCGCCGTCTTAAGAAAACTGAGAAAGACTCTATTCGTGATGAGGTTCTTCATTCCCTGTTACCCAGGGCGTTTACTAAAAACAGTCTGGTTCGTATTTGGATAAACACTGCAGCCGGGTTTATCGTTGTTGATACATCCAGCATCAAGCGCGCAGAAGATTCTCTCGCCCTGCTTCGTAAAACCCTAGGCTCCTTGCCAGTCGTGCCGCTGACGATGGAAAACCCTATCGAGCTCACGCTAACTGAGTGGGTTCGTAGCGAAGCGGCTCCTTCCGGGTTCTCCATCGGCGATGAAGCGGTCCTTAAAGCTATTCTCGAAGATGGTGGCACAGGCCAGTTTAAAAAGCAGGATCTTGCCTGCGATGAAATTCTTACCCATATCGAAGCTGGAAAGGTAGTTACCCAGATTTCAATGGAATGGCAGCAGCGTATCAGTTTCACCCTTTCCTGTGACGGTATATTGAAACGCGTAAAATTTGCAGATCAGTTAATCAGCCAGAACGACGATATCGACAGGGAAGATGTCGTGCAGCGATTTGATGCTGACATCATGCTGATGACAGGCGAACTCAGTAATCTTATTTCCGATTTAACGGCTGCCCTCGACGGCGAAGCCAAGCGATAACTCATAGCGGCAATTAACCCCACATCGTGGGTTGGGTTGCTGTACTCAAAATTCACGCGGTGCAGCGCGAAGTAAATTATAAGGAGAACCAACGATGAGTTTTATTCAAACACTTTCAGGTAAACAATTTGATTATCTCAGCGCAACTATTGACGACATTGATATTGAAGATATCGCCGTGGCGCTTTCCAATATTTGCCGCTTCTCCGGACATCTCCCTGAATTTTATAGCGTGGCGCAGCATTCCGTACTGTGCAGCCAGCTTGTATCACCGGAGTTTGCCTTTGAAGCCCTGATGCACGACGCAGCCGAAGCGTATTGCCAGGATATCCCTGCACCATTAAAAGCGTTACTGCCTGATTATCGCGAGATTGAGAAACGTACCGATCAACTGATCCGCTTTAAGTTTGGCTTGCCACTGGAAGAAGCCAGCGTAGTGAAGTATGCAGATCTGACCATGCTGGCAACTGAACGCCGCGATCTGGATATTGATGACAGTATTCCCTGGGTAATACTGGAAGGAATCCCCCCGACAGATTTATTCGAAATCTACCCCCTTCGCCCCGGTCAGGCTTTCGGCCTGTTTATGGCCCGCTTTAATGAGCTGATGGAGCTACGCCAATGTGCTGCATAAAAGATAAAGAGTCTGTAGTGAAGGCAATCAGATCAAGACGTTTGTGGGAGCGCGTTGAAGGCGGTGCAGCATGAACATCGACAAACAGGCGCTGCGTGAAGAGTTCCGCTACATGCGAGACCACTACAACGATCCGGCAGACCGTAAGAAACAGGAAATATATATCGCTGCTGAAGCGCTGCTGGATGAGCTGGATAAAAAACAGCAATACATCAAACTCCGCGACCAGGAGAACGAGGATATTGCGCTAACGGTAGGGAAGCTGCGAGTTGAGCTGGAAGCCGCAGAGAATAGGATCGCTGAGGTTGAATCACAGCGGCGAATGGCTTTCATGGCCTGTAATCGGTGGGCAGATAAGTTCAGAGAGGCAGAGAAGCGGGTTGCTGAGCTGGATGCTAAACCTGTAGCGTGGGAGTGTGGAGAAAACATAATTCTATTTAATCCTGACACCGTTGAGGCGTATGCAAAACGTGCGGAAATATCACCTAAACCGCTCTACGCCGCCCCGCCAGTGCCGGTAGTGCCGGATGATGGCCGCGCTGAATTTGAGGCGTGGATGCTTAAGAAGTGGGGGCGCGAGCGGCAAGAATATGATTTCGCAATGGGTAAGTTTCTTCATGGCGAAAACTACGCTGACAGCTACACGCGGCATATGTGGAAGGCATGGACAGAAAGCCGCGTCGCCCTGCTTCAGAGTAAATATCGCGACCTGTCACAACCAGTAGATCATCAGATTTCCGAATACGAGAAAATAATGGGGCAGGCTGGCAACTCTCCGGTAATTCCGGGTGTTTATCTGGTTGATATTAATACCGACCACCAGCACTGATATTTGATGTTATAGCCCGGGTGCAGCCGGGCTTTGTGGAGAAAAATAAATGTCACGAATGATCCCCTTACTCGACTGGGCCAATGAGGAGTTCGGAGCGCAAGCACCAAGTGAGCGTATCCTTAAGAAATACGCTAAAGGCAAAATGATGATACCTCCAGCTGTTAAAGTAGGTCGTTACTGGATGGTAGACCGTAATGCTCGATTTGTTGGTACGCTTGCCGAACCGAAAATTCCGGCAAACGCCAGTCCAAGATTACAACGGATTATTGCAGATGGCTGCTAGACCACGTTCTCACAAAATTTCAATTCCGAATCTATACTGCAAGCTTGATAAGCGGACGGGCAAGATTTATTGGCAATATAAACATCCTGTTTCCGGACGCTTTCACAGCTTGGGTACTGATGAAGTGGAAGCTAAAAAGGTTGCATCCGAAGCGAACACGATCATTGCAGAACAAAGAACCAGGCAGGTTCTTAGTGTTAATGACCGTCTTGCCAGAATGAAAGGCAGAAGAACGGACATTACTGTCACTGAGTGGATTGATAAGTATATTGAAATTCAGAACGAACGGTTAAAACACCGTGAACTCAGACCTAATTCTTATCGACAGAAAGCAAAACCAGTTAGGTTATTTCGCGAACATTGCGGTATGCAATATTTGAAAGATATTTCCGCATTGGATATCTCTGAGATAACGGATGCAGTTAAGGCTGAAGGCCATAATCGTATGGCGCAAGTTGTTCGCATGGTTTTGATTGATGTATTCAAAGAAGCGCAACATAACGGTCATGTCCCTCCAGGCTATAACCCTGCCCTGGCGACCAAGCAGCCGAGAAACAAAGTCACTCGTCAGCGTCTTTCTCTGGAAGAGTGGAAAACTATTTATGAAGCTGCCGAAAAGCAAGAACCATACCTCCAGTGTGGAATGTTGCTCGCGATAATAACAGGTCAGCGTTTGGGCGATATCTGTAACATGAAGTTTAAAGACATATGGGACGATATGCTCCATGTCGAACAGGAAAAAACAGGATCGCGTTTAGCCATACCATTGGACTTGAAATGTGAAGCGCTGGGTTTAACTCTTCGGGACGTTGTATCTAAATGCCGGGATGCAGTCATCAGTAAATATCTTGTGCATTTCAGACATACCACCTCACAAGCAAACCGCGGTGCTCAGGTTTCAACCAGTTCTTTAACTTCAACATTCAAAAAAGCACGTGACAGAAGTGGACTGAAATGGGATAAGGGATCCCCACCCACTTTTCACGAACAGAGATCATTATCAGAACGCTTGTACAGAGAACAAGGTGTCGACACGCAAAAATTACTCGGCCATAAATCAAGAAAAATGACAGACAAATATAATGATGACAGAGGAAAAGATTGGGTGATCGTCAACACAAAAACAGGGTGAAATCATAAGGGTTTTGGGGAAACGTTTTGGGGAAAGTTTTGGGGAAGAACTTTAAGAAGGTAATAAAAACGGGAACCCTTCGGCTCCCGTTCTTGTTTAACCCAAAAACTGGATTACATGTTTTCGATGATCGCGTCACCAAACTCTGAACATTTCAGCAGCTTAGCGCCGTCCATCAGACGTTCGAAGTCATAGGTCACGGTCTTCGCGGCAATCGCGCCTTCCATACCTTTAACAATCAGGTCTGCGGCTTCGAACCACTGCATGTGGCGCAGCATCATCTCCGCAGACAGAATAATGGAGCCTGGGTTGACTTTATCCTGGCCCGCATATTTCGGCGCAGTCCCGTGAGTCGCTTCAAACAACGCGCATTCGTCGCCGATATTCGCACCCGGTGCAATACCGATACCGCCAACCTGAGCCGCCAGGGCATCGGAAATATAGTCACCGTTCAGGTTCATACAGGCGATAACGTCGTATTCCGCCGGACGCAACAGGATTTGTTGCAGGAACGCATCGGCGATAACGTCTTTAACCACGATCTCTTTACCCGTGTTCGGGTTCTTAATTTTCAGCCACGGACCGCCGTCAATAAGCTCACCGCCGAACTCGTCGCGCGCCAACTGGTAGCCCCAGTCTTTAAACGCGCCTTCGGTGAACTTCATGATATTGCCTTTATGTACCAGCGTGACGGAATCGCGATCGTTGGTAATAGCGTATTCAATCGCCGCGCGAACCAGACGTTTAGTGCCTTCTTCAGAACACGGTTTGATACCGATACCGCAGTGCTCCGGGAAGCGAATTTTCTTCACGCCCATCTCTTCGCGCAGGAACTTGATCACTTTCTCAGCGTCTGCAGAGTCAGCTTTCCATTCAATTCCGGCGTAGATGTCTTCAGAGTTTTCACGGAAGATGACCATATCGGTCAGCTCCGGATGTTTCACCGGACTTGGCGTGCCCTGATAGTAACGAACGGGACGCAGACAAACGTACAGGTCAAGCTCCTGACGCAGGGCAACGTTCAGAGAGCGGATACCGCCGCCAACCGGCGTGGTCAATGGACCTTTAATCGCTACGCGGTAGTCACGAATTAAATCAAGGGTTTCAGCGGGAAGCCAGACATCCTGGCCGTAAACCTGTGTGGATTTTTCTCCGGTGTAAATTTCCATCCAGGAAATTTTACGCTCGCCTTTATAGGCTTTCTCGACGGCGGCATCGACTACTTTAAGCATGGCTGGAGTGACATCAACACCGATACCATCGCCTTCAATAAACGGGATAATCGGATTCTCAGGAACGTTGAGTTTGCCGTTTTGCAGGGTGATCTTCTTACCTTCCACCGGAACAACTACTTTGCTTTCCAT
Protein sequences of DBSCAN-SWA_5 >CP028151|1913016:1923002|1920495_1921638_+|AWP50532.1|integrase|DBSCAN-SWA MAARPRSHKISIPNLYCKLDKRTGKIYWQYKHPVSGRFHSLGTDEVEAKKVASEANTIIAEQRTRQVLSVNDRLARMKGRRTDITVTEWIDKYIEIQNERLKHRELRPNSYRQKAKPVRLFREHCGMQYLKDISALDISEITDAVKAEGHNRMAQVVRMVLIDVFKEAQHNGHVPPGYNPALATKQPRNKVTRQRLSLEEWKTIYEAAEKQEPYLQCGMLLAIITGQRLGDICNMKFKDIWDDMLHVEQEKTGSRLAIPLDLKCEALGLTLRDVVSKCRDAVISKYLVHFRHTTSQANRGAQVSTSSLTSTFKKARDRSGLKWDKGSPPTFHEQRSLSERLYREQGVDTQKLLGHKSRKMTDKYNDDRGKDWVIVNTKTG >CP028151|1913016:1923002|1920269_1920506_+|AWP50531.1|DBSCAN-SWA MSRMIPLLDWANEEFGAQAPSERILKKYAKGKMMIPPAVKVGRYWMVDRNARFVGTLAEPKIPANASPRLQRIIADGC >CP028151|1913016:1923002|1915710_1916784_-|AWP50526.1|DBSCAN-SWA MDKILPTMGFVCLSLISLKNPPPSGFFICDHFIFCLAKLLYGQELKLSGDIVLSNNERWVSFFDFAFTPTHAAAPSIPIEDILKKLKVLVSSGNAVKLYNHRSRALRISEMKYSIGDSQATLLIQLCDKNGSDPVFGELTTGNLRVEPKLAGEGIAVSCHIVISTDVVKNTADHHKTLVESVPGISKSVLEPFLNAMLREAFAGCEFKNPATKGMCQHRPKLEIYSHGSQTLMDALKGAKIHNVKLVSTRRKGGLDQTAYTELSERSVKYKIIRQPPLKDKERLLEILRKKGQQSGYTKVSISYSKDGKQASLDLDRNEDAATKLFTKSERVILGNLINQCESTVHLQLETKMIGLL >CP028151|1913016:1923002|1921751_1923002_-|AWP50533.1|DBSCAN-SWA MESKVVVPVEGKKITLQNGKLNVPENPIIPFIEGDGIGVDVTPAMLKVVDAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIRDYRVAIKGPLTTPVGGGIRSLNVALRQELDLYVCLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADSADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAITNDRDSVTLVHKGNIMKFTEGAFKDWGYQLARDEFGGELIDGGPWLKIKNPNTGKEIVVKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGIGIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSIILSAEMMLRHMQWFEAADLIVKGMEGAIAAKTVTYDFERLMDGAKLLKCSEFGDAIIENM >CP028151|1913016:1923002|1917321_1917552_-|AWP50528.1|DBSCAN-SWA MTFDEKDLNYALSKIVMTSLFNSLTDQQQQNFYKSAFDMIDRCCYCDADGMPDKVRMQLSEALRERLGEQLAEIAC >CP028151|1913016:1923002|1919722_1920220_+|AWP53073.1|DBSCAN-SWA MACNRWADKFREAEKRVAELDAKPVAWECGENIILFNPDTVEAYAKRAEISPKPLYAAPPVPVVPDDGRAEFEAWMLKKWGRERQEYDFAMGKFLHGENYADSYTRHMWKAWTESRVALLQSKYRDLSQPVDHQISEYEKIMGQAGNSPVIPGVYLVDINTDHQH >CP028151|1913016:1923002|1915125_1915698_-|AWP50525.1|DBSCAN-SWA MKLFSPLSYLRIKHEEKDWYDYKIPAAVSLIVTIVYYFHASKISLIETNGLLLQVNGLLQVLIGFYIAALAAVSTFSSSSIDEVMAGVPPTLVEKFRGQKLTVELTRRRFVCYLFGYLALVSFMLFCLGMISILIGKPFHLWLLTFCSPDAILWLKTVFVGVYIFILMNIITTTLLGLYFLAVRFHQSSL >CP028151|1913016:1923002|1914647_1915037_-|AWP50524.1|DBSCAN-SWA MSEPVSSATVLAGGLMGASVFGLATGTDYGVVFGAFAGAVFYVATATNIGRIRLVAYFITSFIVGVLGAGLIGTKLAAITHYEKPLDALGAVIISAMCIKFLTFLNSQDLNSLFSILSRIRGGGADGSK >CP028151|1913016:1923002|1917839_1918757_+|AWP50529.1|DBSCAN-SWA MHNPFFKNMLIYRFSRDFNIDIDSLDKKLELFRFSPCGSQDMAKSGWFSPLVQYSDVLYHAVNNQLLLVIRREEKIIPKQTIADEINKKVSTLEQEQGRRLKKTEKDSIRDEVLHSLLPRAFTKNSLVRIWINTAAGFIVVDTSSIKRAEDSLALLRKTLGSLPVVPLTMENPIELTLTEWVRSEAAPSGFSIGDEAVLKAILEDGGTGQFKKQDLACDEILTHIEAGKVVTQISMEWQQRISFTLSCDGILKRVKFADQLISQNDDIDREDVVQRFDADIMLMTGELSNLISDLTAALDGEAKR >CP028151|1913016:1923002|1913016_1914540_-|AWP50523.1|capsid|DBSCAN-SWA MSWCWYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVSVEDDESTFGRRVQVHEYPNRDKPWTEDLGRATRRLTINAYLVGDDYADRRDRLISAIETAGPGTLVHPQYGEMQGSIDGQVRITHSSTEGRMCRVSFQFVESGELSFPVAGMATAKRLETSGGLFDDAIDSMFSTFSLSGISDFIQNDVIADAASMLGDVADAFRMVDSGVSAAMRLLQGDLSVILMPPSAASDFVNALQKAWRSGDRLRGSTSDLVTMIKTMSGITLDPGLSPRGTWPTDSGSAAKQKMQRNMIAAAIRTTAISTAAHAVTTLKQPRDVPGVRGVNQPAGTGRDSDIIIVMHPALDGVQTVSNGSSPPNYEDLKAIRTALNAAIDQEQLRIQDDVLFQQISVMRTDLNRDISARLAQVERTALRTPDDVLPALVLAATWYDDAGRESDILTRNPVPHPGFVPVEPLRVPVR >CP028151|1913016:1923002|1916833_1917148_-|AWP50527.1|DBSCAN-SWA MSPFLLSKGEVSTVCRGCKGKGIVLDEKRTRLHGAPVYKICGRCNGNRFSRLPTTLARHHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEAQLRKVTR >CP028151|1913016:1923002|1918851_1919391_+|AWP50530.1|DBSCAN-SWA MSFIQTLSGKQFDYLSATIDDIDIEDIAVALSNICRFSGHLPEFYSVAQHSVLCSQLVSPEFAFEALMHDAAEAYCQDIPAPLKALLPDYREIEKRTDQLIRFKFGLPLEEASVVKYADLTMLATERRDLDIDDSIPWVILEGIPPTDLFEIYPLRPGQAFGLFMARFNELMELRQCAA |
12 | Salmonella_phage(50.0%) | capsid,integrase | attL 1908767:1908780|attR 1921673:1921686 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2033169 : 2044139
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP028151|2033169:2044139|DBSCAN-SWA GTCAACCGAAGAACTGGTTGAGCGGGTCTTCTTTCTCTTCTCCACCATCAACTTTCACCTTTGTTCTGGCGGCCGGGGTGAGACCGAATTCGACCAGGTAACTTTTAAACCGTCGATCGGCATCAGCCAGCATCGCAACAGCCGGGTTCGCCTTAATCAAAAATCCCCCTTCAGTCTGGACTGTATAAGTTCTCCCTTCGTCCGCGATCGTCAGACGAAGCTGAAGGATATCTGCATAGATATCGCAAAGACGCTCCAGCGCCAGTGAATCGGCAACTGTAAGAATACCCATGCCATCAAGTAAAACTGTGAGCCTGCCCCACGCAACTTTTCCCCAGTCGCTAAGATGTGCTGGCGGGCTGGGGATTTCTTTTGCTGGTTGGGGTTCTTTATCGTTGAGTTTACGTTTGCCCGGGTTGCCGGTAACCACTTTCAGGTGGGTCGGTTTCGGGCGCCGTCCTGCCATCGGAACCTCCCGGAAAAAAACTTTTCATTTCGCGGTTGTGCACACAGAGGGGGGCGGGCGGTCACGCAGGCACAAAGCTGTGAACTTTTAACCCGCCCACCTCCTTCATAGCTGCCACACATATGAGAATTGTTATCGTCTGAACCAGTGCGATGCACGGTCAAGTGGAATACCGTTCTCGTCACAGCCCACGACGACACCACGTTTCTCCATTCGTTGTTTCGTAGAGTCGTGGTGCTGCTTACACAACCCCTGCCAGTTCTTCCGGCTCCAGAATAGCTTTTGTGCCTTCGCTATCGCTTCGGCGTTTCCACTATTCAGTGCCTCTTTCAGTTTGTGCGGAATGATATGATCGACCACCGTTGCCGCCGTCACTCTTCCCTGCTCATGACACATGGCACACAACGGATGAGTACGAAGGAACAGGAGGCGCTCCCGGTCCCATTTGCTGCCGTAGATACGGAGCTCTTTTTTCACATCCGCCCCTATGTGTTGCGGGGCTCCCCGGTCCGGTTTTGTCATGTGGATACCTTGTTGTATAGTCGTCACCATAAACCGCATGGAGAAATGATATGGCTGTCAAAATATTCAGAAGCGATCATCAACCAGCCAGCGAATTAGCTTACCGTAACTGGCTTCGTGATAACCCTGATGGATTTGTCGTGAATGCCTTAAAGAGCGCCAGTGGACAGAACACGAAAAGTGATAAGCGCTTTACCCGAATTCACCGGGCCAAATGTAAAACAATTAATCCGCTGCTGAGTTCAACGGAAAAATCTGGCTTCACCACAGGGCGATACCAAAAGCTTTGCGCCATAAATTTTGACGCCGTTAATAGCGAGGCCCGGATAGTTACTGGCCTGCCGACCATTAAACCATGCCGCTGCGTCAAATGAAGTTACGTTATTATGATGAGTCTGCCCATGGTGATGGCAATAGAAAAGGCCGCCATAATATGCGGCCTTTGGTTACTATCAGCTAACAGATAAAACACAACCCTCAGGAGCCACCTGGAAGAGTTATACCTAACCGGATAGCTGGCAACCTCTGCTTTATACTGGTATTGGCTGGCAGAGGTAAAATAAGAGTAGTTTGTTTTATTTTATCCACATTGATATATCAACGATGATGGCAGTTATTCTTATAAATAGAGAATTGTTTAATCATTCGACTACAAAGGCAGCATTATATCCTGTCAATACTGGGTTATTTTCATGAGGTGTGCCAGTTTTTAACGTCTGGTTACGCTGCGTTGATACATGAGATCTTTTTCTTCAGTACCATGGTATGTGACATACGTCCGTATATCCCCTTATAGGACATTTTGTGCTCTTTATGACACCCCGCAGGCCGGAACCGTAACCGTCCTGCGGGAATTTTTTATCTGCACTGCGTCCGGATGTACTCCTGCAAATACTTCAGTTTTTCCTGATCGCTGATGATTCCGGCGCGGATATCGAGAACGTTTTGTCCAGCAACTGGAGAGAGTTCGACGGTGGCAGCATTGCCCACGCGGCGGGTACTGGCGGTTTTGGTTGTGGTTGGCACTGTACAGCGTCCTTCGACACGCACCCGGCCACCAGCAGCAAGGAGGCGCTGCAAATCAGTATTCCTGTTTTGTGCATCAGCTAGTTCCTTTGTGTATTTTGCATCGAGGGCGGCAACGTCACGCTGGCGCACCTCCATATCGCTGATAGTCTCGTTCGCCAGCGTCAGACTATGACTGGCGGTATCACGCTGCGATTTATACTGAATGGCGTTGTCGCGGTAATGGTCTGTTGTCCATGCAAGTAAGGGGGAGTGATATGGCTACTTTGACGAAACAGGAAAAAGCCTGGGTAAAGAAACTCAATAAGCTACTGGCGGAGTGTCCCTCAAATCGGATCGCGTTTGCGACGACTGGCGATTGTGAAGTATCGCTGTTTGATGCGACTCGCTATGACGAAATTTTTGATGAAGTAGAGAAGGGGAAAAGCGAATTTATCCCTTCCGCTATGCGTATCGGCGCAACCTTTAATGAGTGCCTGACATTTCCTAACCAGGTTGAAAGCACGGCAGGCTGAGGACTAACCCATGACCACTATTACCAAAGAATGGCTACAGCAAACTATCGCTGAATTTGAAAACACTCGCGACGATATTCCGTTTGGTCTCGACGATGATGACGCCAAAATTTTGTTGGTGTTGAAGCGGGCGCTGGCATCGCTGGATGCTGAGCCCGTGCGATACCTGAATAAATTTTCCGGTACATGCGTGACGTTAGAGCAGCAGTCAAACGCTGCAGATGATGTTGCCGTGTATATACCGCTCTACACAGCACCGCCAGCGCCGGTAGTGCCGGAAGAGGCTTACAGCGACGACTGCCCTGACTTATACGCCAGTCAGCCGGAAGCTTGGGCTGCTGGCTGGAACGCTTGTCGCGCTGCCCTGCTTCATGGTGCCGAACCTGTAAGCCAAACTTACGAGTTGCCACAAACGCAGTTTGAGCAGGTTGCTGACCTCTACGAAATGCAATTTGATGATGGGCGCACCTGCGCATTCCACACAGATGGTGCAAAAGCTGCTCAGTGGTTGCTCGCATGCGATGGTAATAAGGTGCAGGAATACGTCAGGCTTGAGCGCTATCATGAGGCTCTTATTGGCAACTCTCCGGTAATTCCGGATGATTGGGTTATGGTGCCGAAGAAACTAACTGCTGAGAACGGTGCCAAGAGTTTGCTATCCGGTGAGTTTTTAGAAACTACTTTTATAAGCTTTCCTGAATGCTTGGCCGACGAAGAATGCGAAAGCTGCGACGGCAGCGGGCGAATTAAAATTGAGGTTCCTGTCAGCTGGACGACGATTAAGGCTATTTGGAATAAAGGCGTTGAACATTTTCGTAGCAGCACCGCAACAGGGGACAACTAATTTATGAATAACTTGATGATCGACCTTGAAACTATGGGTAAAAAACCTAACGCGCCTGTTGTCTCCATCGGTGCTGTGTTCTTCGATCCGCAAAGTGGTGAAATTGGACCTGAGTTCTATACCGCCGTTAGCCTTGAAAGCGCAATGGAACAAGGTGCCGTTCCTGATGGCGATACCATTCTATGGTGGTTAAGACAAAGCCCGGAAGCGCGAGCGGCTATTTGCGCTGATGCAGTATCTGTTACGACCGCGCTTATTGAGTTCAATGACTTTATCACCTGTCACGCCGACGATTTGAAATACCTGAAGGTATGGGGTAACGGTGCCAATTTCGATAACGTTATCCTGCGTGGCGCTTTCGAACGTGCCAGCCTCCCCTGCCTGTGGAATTACCGGAACGATCATGACGTCCGCACGATGGTTACTTTGGGTCGTGCAATCGGCTTCGATCCCAAACGTGACATGCCGTTCGAAGGCGATATGCACAACGCGCTGGCTGATGCCAGGCATCAGGCGAAATACGTTTCAGCTATCTGGCAGAAACTGATCCCGCCCACCAGCAACAATATCTGATTTAAACCGGGTGCAGCCGGTTAGATGGAGAAGCAACTCATGAGCGATCGCTTCCTGACTGAGGAGGAACTGGAAGATGCTACAGGAGCAAGCCAGAAGTCACTCCAGAAAGAAGTATTAACGCTGAACGGTATTTATTTTATAGAACGCCGGGACGGTTCAATCAGAACAACCTGGTATCATATAAATCACCCAGTTTCGCGCCTTCTTCCACCAGCAGGGTATCAGCCTGTACCAGGCATGAATTTTGACGCTATAGAGAGTTAACATGGGTCGCAAACGTGCGCCCGGTAATGAGTGGATGCCAAAGGGTGTATTCTTTCGCCCTTCTGGTTACTACTGGAAACCGGGAGGATCAACAGAAAATATAGCTCCAGCTGATGCAACTAAAGCTGAGGTCTGGGTGGCTTACGAAAAAAAAGTTGAGGGTAGAAAAAACAGAATTACATTCACACAATTATGGCGAAAATTTCTTGCCAGTGCCGATTATGCTGATCTGGCCCCAAGAACGCAGAAAGATTATCTGGCACATGAGAAATATATACTTGCCGTATTTGGTGATGCCGAAGCTAAAGCAATAAAGCCAGAACATATCCGGCGTTATATGGATGCCCGTGGGCAAAAAAGCCGTGTCCAGGCGAATCATGAACACAGCTCTATGTCGCGCGTATTTCGTTGGAGTTATCAACGTGGTTATGTTCCTGGTAATCCTTGCGTTGGTGTGGATAAGTTTCCTAAGCCTCAACGCGATCGATATATTACCGATGAAGAGTACAGAGCGATATATAATAACGCAACGCCAGCCGTCAGGGCTGCAATGGAAATAGCTTATTTATGTGCTGCCAGAGTTTCTGATGTATTGAAAATGAACTGGAATCAAATACTGGAGAAAGGAATTTTTATTCAGCAAGGAAAAACCGGAGTTAAACAAATTAAATCCTGGACAGATCGCTTACGTGATGCCGTTGAAATATGTCGTGAATGGGGAGAGGAAGGCCCTGTTATCAGGACTATGTATGGCGAGCGTTATTCTTATAAAGGATTTAACGAGGCGTGGAGAAAGGCGCGAAAGGCTGCGGGGGATGATCTGGGACGTCCTCTTGACTGCACTTTCCACGATCTAAAGGCAAAGGGGATTTCAGACTATGAGGGAACGGCGAAAGACAAGCAGAAGTACAGTGGCCACAAAACCGAATCCCAGGTTCTTGTTTACGATCGCAAGGTGAAAATGAGCCCAACCCTGGACAGGAAGCGTTGAGCTTTTCGATGTGCGCCAGTAAAAATTCTGGCGTTTTTTTCTCACCGAATTTTCTCATTTTTTCTCAACGTGATTTTCATCACTATAAGAAAATCACGTAAGTGCTTGAATAGTGGCGGAGAGAGAGGGATTCGAACCCTCGGCGGAGTTACCCCCGCAACGGTTTTCGAGACCGGTCCGTTCAGCCGCTCCGGCATCTCTCCGTATATTGCAATGATGCCAGGTAATTTGGCATTTTAACAGACCCTATTCGGGTAATTTTGTTCAAGTGACGAGTTTACGAGCAAAACGATGATTAAGTGGCCCTGGAAAGCACAAGAAATAACCCAGAACGAAGACTGGCCGTGGGATGATGCGCTGGCTATACCTCTTCTGGTAAACCTCACCGCGCAAGAACAGGCTCGGCTTATTGCGCTAGCCGAACGTTTTTTGCAGCAGAAAAGACTGGTAGCGCTACAGGGATTTGAGCTCGACTCGTTAAAAAGTGCACGTATTGCGTTAATTTTTTGCTTACCGATCCTGGAGCTCGGTATTGAGTGGCTTGATGGTTTTCATGAAGTGCTCATTTATCCCGCGCCCTTTGTGGTAGATGATGAATGGGAAGATGACATAGGTCTGGTGCATAGCCAGCGTGTCGTACAGTCGGGGCAAAGCTGGCAACAAGGGCCCATCATTCTGAACTGGCTGGATATCCAGGACTCGTTCGATGCTTCGGGTTTCAACCTCATTATTCATGAAGTCGCGCACAAACTGGATATGCGTAATGGCGATCGCGCCAGCGGCATCCCTTTCATCCCGTTACGCGATGTGGCTGGCTGGGAACACGATCTCCACGCGGCAATGAATAATATTCAGGATGAAATCGATCTTGTTGGCGAAAGCGCTGCCAGTATAGATGCCTATGCCGCCACCGACCCTGCAGAATGTTTTGCCGTGTTGTCAGAGTATTTTTTCAGCGCGCCAGAACTGTTTGCTCCACGTTTCCCGGCACTATGGCAGCGTTTTTGCCAGTTCTATCGCCAGGATCCTTCTCAGCGCTTACGGGTAAGCGCTGCCGAAGGCGACTACGGCGAGGAATCCGAACATTAATTCCTCACTTTGTGGGTTAATTAACCAATTGAATTGGCGCGTTAATTTTACTGTTGACACGTTATAGCCGGCCCAGTATTATGCGCCTCGTTGAAACAATTCCTCTGTAGTTCAGTCGGTAGAACGGCGGACTGTTAATCCGTATGTCACTGGTTCGAGTCCAGTCAGAGGAGCCAAATTTAGGGAAGCAGACGTTCAGTGACGTCTGCTTTCTGCATTTATATCAACTGGTTATGCCCTTCTTCAGGTTCACCCTCGTTCACTAAAAACCACTCGAAGCCATACCCTTTTGCTGGTAAAGCTGGTTCGATTTGCGTTTTACCAGCACGCGGGGGGAACCGTCATGTCACTGACTGATACTAAAGTAAAAAATGCCAGACCAGCGGAAAAGGCCGTCAAGCTCACTGACGGGTTTGGCCTCTATCGATTCAAAATACTGGCAGACAGGCTATCGCTTCAATGGCAAACAGAAGGTGTTTTCTATTGGGGTTTACCTTGCGGTTTCTCTTACTGATGCCAGACAACGCCGTGACGAGGTCAAAAGGCTGCTGGCTCAGGGGATTGACCCGAACGCAAAAAACAGGCTGATGAAAAAATCCTTCAGGAAAAGCGCGATAAAACCCGCTCGTCCCGTGTCGTCGCCAAAAGCTGATGCGCCATAATTCTGCCTATGATATTGACGGAAATCTTTTCGCCTGCACCAGAAATTTATCTGCCATTTCCGCTACCGGCGTCAGACTGCCTGTATCAACCATTTTTACAAAATATTTCACGTCTAAAGTTCATTCTGCCCCCTGCCCTTAATCTCTACGGCGTTATGTCTCAGAATTATTTGCCAAGTGCCTGCCAGTTTTTCACGTTTCATCAGACGCCGGCACATAGCCATTGCGGTAAGGTCACAGCATTTGACTTGTGCAATTACAGACAAAGTTGCGCCATGCCGGGGCAAAGTAGAAATTAGATCAAAACTTCAACGCTTTGTTGTTTTTGTCAGCAAACAAACGCGCAACCTTATTTCCCCCTTTGACAAGCCGATCGCACATCGTTACTATACGCCCCGTTCACACGATTCCTCTGTAGTTCAGTCGGTAGAACGGCGGACTGTTAATCCGTATGTCACTGGTTCGAGTCCAGTCAGAGGAGCCAATTTAAGGGAAGCAGACGTTCACTGACGTCTGCTTTCTGCATTTCTATCAGTTGGTTATCCCTTCTTGCACGTTCACCCTCGTTCACTAAAAACCACTCGAAGCCATATCATTTTGATGGTAAAAATGCTGGTAATGCTGGTTCGATTTGCCTTTTACCAACAATCGAGGGGGTGTTTTCATGTCACTAACTGATATTAAAGCAAAAAATGCAAAACCCCTTGAGAAGGAATATAAGCTAACCGATGGCTTTGGTATGTTCCTCCGTGTTACCCCTAAAGGTTCGAAATACTGGCAAATGGCCTACCGTTTCGAAGGAAAGCAAAAACTCTTCTCTATTGGTGTTTACCCTGCTGTTTCTCTTTCTGACGCAAGACAACGCCGTGATGAGGCCAGAAGGCTTCTTGCTCAGGGCATTGACCCTAATGCAAAGAAACAGGCAGAGGTTAAAGAGCTAAAAGCCAAGCGTGATAAGACACGCTCTTTCAGTATTGTAGCTAAAGCTTGGTTCTCTACGAAAACCAAATGGTCAGAAGATTACGGTGATTCCGTATGGAAGCGCCTTGAAACCTATGTCTTCCCGGCAATTGGCGATAAAGATGTTACCGAACTGGATACGGGTGATCTGCTGGTTCCAGTCAAAAAGGTTGAGGCACTTGGCTATCTTGAAGTTGCCATGCGCATTCAACAATACATTACTGCGATCCTGCGTCATGCCGTACAGCAAAAACTCATACGCCATAACCCCGCTTACGATATGGAAGGTGCTGTTCAGAAACCACAGACCGAACATCGTCCCGCGCTGGAACTGGAAGAAATCCCTCAACTACTTAAAAAAATTACCGAATACAAAGGCCGCAGACTAACCATACTGGCAATCCAGCTCAATCTGATGATTTTCATTCGTTCCAGTGAACTTCGTTTCGCTCGCTGGTCTGAAATTGATTTCAAAAGTAAGTTGTGGGTGATACCCGAACAGCGTGAAGCGATTGAAAATGTCAAACATTCCACTCGTGGGGCCAAAATGAAGCGTAAGCACTTCGTTCCCCTTTGTAATCAAGCCATGAGGATACTGAAAGAGATCCAACAACTAACTTATGAAGAAGGTCATAATGACGGATTAATCTTTACTGGCTGTTATGACTCGTTTAAGCCCATGAGCGAAAACACCATCAACAAAGCCCTGCGCAATATGGGCTATGACACGAAGCAGGACATTTGTGGTCATGGCTTCCGCACACTGGCCTGTAGTGCCTTAATTGAGTCTGGTTTGTGGTCAGAAGATGCTGTGGAGCTTCAAATGAGCCATAAGGAAAGCAACAGCGTTCGTGCTGCTTACACCCATAAAGCCAAGCACCTTGACCAGCGACGCCTGATGCTTCAGTGGTGGGCTGACTACCTTGATGCAAGCAGAAACGGTATGGTCAGGCCGTTTGAATTTGCCACAGATGCTCGTTTTGGGTTAGAAAACCAAGCCTAACCGCTTGATAACAAATCACATAGCAGCATGTAACCCTTTGTTTTGAATATATTTGAATAGCCATCATTGGTTGATGGCTATGACTTCATACCGTAGAATATTGGCCAAATTTGAATGAAAGCTCACTGCGAGCAGATGCCGGAAAAATTTTATGATCAAAATAACTTCTGATATGCCAGAACTACAATCAATGAATATCACTGAAGAAAATATCAGCAAGCTTAAATCATTGTTTCCTGAAGCATTTAATGAAGGAAACATTGATTTTGATGTTTTAAAGCAGCTTTTGGGTGAAAATGTTGACGAAAAAGAAGAGCGCTATGGACTGAACTGGCATGGCAAACGTCAGGCTCGCCAACTTGCTCTCACCCCTTCCCGTGGCACCTTACGCCCATGTAAAGATGAAAGTGTTGATTGGGATAACACCAAAAATATTATGATCGAGGGCGATAACCTTGAGGTGTTAAAACTCCTTCAAAAAAGTCATGCCGGAAAAGTTAAGCTAATTTATATCGACCCACCGTATAACACCGGGAAAGACTTTGTTTATCCTGATAACTTCCAAGATAATATGAAAAATTATCTGGAGATTACTGGGCAAACAGAAGATGGCGTGCGTATAAGTTCTAATTCAGAAACCAGTGGGCGTTATCACACAGACTGGTTGAATATGATGTATCCACGTATTAAACTTGCAAGAAATCTTCTCAAAGAGGATGGGTTTATATTTTTAAGCATTGATGACAATGAAGTCAATAACTTGAAATTAATGTGCGATGATATTTTTGGGCAAGAAAATTTTGTTGCCAATGTTATTTGGCAAAAAAAATATAGCACAAAAGCAGATTCAAAAAACTTTTCGGAGTCACATGACTATATTCTTTGCTACAAAAAATCAGACCAATCAAAAATACTCGGTTTACCAAGAAGTAAGCAACAAGAATCAACATACAAGAATTTAGATAATGACCCTAGGGGTGTATGGGCTTCAGATAATCTTCTCAGAACTGAAGTCAGAGATTATGCAGTTTTTGGTATCACATCCCCCTCGGGGATGGAACATTATCCTCCCGCTGGTAGTAGTTGGAGATTTAACAAGGATAAAATTGAAGAGTTAATTTCTGATAACAGAATATGGTTTGGCGAAGATGGTAATAACAAGCCGCGACTAAAAAGATTTCGCTCAGAGGTGAGAGACACTATTCCTCCACAAACTCTTTGGGGATTTGAGCATGTTGGGCACACAGATGAAGGTACAAAACAGTTAGCAGAACTATTTGATAGCACACGTTCACCATTTCCTAATCCAAAACCCGTAAGGTTGCTACAGCGGATTGTACAAATAGCGACCAAAGAAAATGATATTATCATGGATTTTTTTGCGGGAAGTGGTACTACAGGGCAAGCAGTGTACGAATTGAACGAGACGGATAAACAAGATAGAAAGTTCATTCTGGTTCAACTCCCTGAAACACTATCTAAAAGCAATAAGGAAGATCTTTCTGCAATCTACTTCTGCGAAGAGCTTGATAAGCCATTGTATCTATCTGAATTAACGAAAGAACGCCTCCGCCGTGCAGGTAATAAGGTTAAAGCAATTAATCCAGACTGGAATGGAGATACCGGTTTTCGTGTGTTCAAACTTGATACATCCAACATCCGACCGTGGGAAGCGAGGGCTGAAACACTTTCAGAACAGCTTGATGCCTATGTAAGCCCAATCCTCGAAGGACGTAGTGAAGAAGATTTACTCACAGAGTTGATGCTCAAACGTGGCATTGACCTGAGTGTAAACATTGAAACCCGTCAGTTTGATGGATTAACAGTTTCTAGCGTCGAAGGTGGCAAATTGTTTACCTGCTTTGCCAGTCAAATCCCAGTCTCTTCAGTTGAAGGACTGACTAAGGGGATCATCGACTGGCATAAGAGTTTGAAGGCTGGCAAAGATACTGTCTGCTATTTTCTTGATGACGCATTTGAAAATAACGTCGCTAAAACAAATCTTTGCGCCATTCTGGAACAGCATGGCCTGACTAACTTGCACAGCCTGTAA
Protein sequences of DBSCAN-SWA_6 >CP028151|2033169:2044139|2033766_2034156_-|AWP50646.1|DBSCAN-SWA MTKPDRGAPQHIGADVKKELRIYGSKWDRERLLFLRTHPLCAMCHEQGRVTAATVVDHIIPHKLKEALNSGNAEAIAKAQKLFWSRKNWQGLCKQHHDSTKQRMEKRGVVVGCDENGIPLDRASHWFRR >CP028151|2033169:2044139|2033169_2033634_-|AWP50645.1|terminase|DBSCAN-SWA MAGRRPKPTHLKVVTGNPGKRKLNDKEPQPAKEIPSPPAHLSDWGKVAWGRLTVLLDGMGILTVADSLALERLCDIYADILQLRLTIADEGRTYTVQTEGGFLIKANPAVAMLADADRRFKSYLVEFGLTPAARTKVKVDGGEEKEDPLNQFFG >CP028151|2033169:2044139|2038663_2039461_+|AWP50654.1|DBSCAN-SWA MIKWPWKAQEITQNEDWPWDDALAIPLLVNLTAQEQARLIALAERFLQQKRLVALQGFELDSLKSARIALIFCLPILELGIEWLDGFHEVLIYPAPFVVDDEWEDDIGLVHSQRVVQSGQSWQQGPIILNWLDIQDSFDASGFNLIIHEVAHKLDMRNGDRASGIPFIPLRDVAGWEHDLHAAMNNIQDEIDLVGESAASIDAYAATDPAECFAVLSEYFFSAPELFAPRFPALWQRFCQFYRQDPSQRLRVSAAEGDYGEESEH >CP028151|2033169:2044139|2035014_2035389_-|AWP50648.1|lysis|DBSCAN-SWA MQYKSQRDTASHSLTLANETISDMEVRQRDVAALDAKYTKELADAQNRNTDLQRLLAAGGRVRVEGRCTVPTTTKTASTRRVGNAATVELSPVAGQNVLDIRAGIISDQEKLKYLQEYIRTQCR >CP028151|2033169:2044139|2037382_2038372_+|AWP50653.1|integrase|DBSCAN-SWA MGRKRAPGNEWMPKGVFFRPSGYYWKPGGSTENIAPADATKAEVWVAYEKKVEGRKNRITFTQLWRKFLASADYADLAPRTQKDYLAHEKYILAVFGDAEAKAIKPEHIRRYMDARGQKSRVQANHEHSSMSRVFRWSYQRGYVPGNPCVGVDKFPKPQRDRYITDEEYRAIYNNATPAVRAAMEIAYLCAARVSDVLKMNWNQILEKGIFIQQGKTGVKQIKSWTDRLRDAVEICREWGEEGPVIRTMYGERYSYKGFNEAWRKARKAAGDDLGRPLDCTFHDLKAKGISDYEGTAKDKQKYSGHKTESQVLVYDRKVKMSPTLDRKR >CP028151|2033169:2044139|2035439_2035697_+|AWP50649.1|DBSCAN-SWA MATLTKQEKAWVKKLNKLLAECPSNRIAFATTGDCEVSLFDATRYDEIFDEVEKGKSEFIPSAMRIGATFNECLTFPNQVESTAG >CP028151|2033169:2044139|2034206_2034530_+|AWP50647.1|DBSCAN-SWA MAVKIFRSDHQPASELAYRNWLRDNPDGFVVNALKSASGQNTKSDKRFTRIHRAKCKTINPLLSSTEKSGFTTGRYQKLCAINFDAVNSEARIVTGLPTIKPCRCVK >CP028151|2033169:2044139|2039832_2040123_+|AWP50655.1|DBSCAN-SWA MPDQRKRPSSSLTGLASIDSKYWQTGYRFNGKQKVFSIGVYLAVSLTDARQRRDEVKRLLAQGIDPNAKNRLMKKSFRKSAIKPARPVSSPKADAP >CP028151|2033169:2044139|2042237_2044139_+|AWP50657.1|DBSCAN-SWA MIKITSDMPELQSMNITEENISKLKSLFPEAFNEGNIDFDVLKQLLGENVDEKEERYGLNWHGKRQARQLALTPSRGTLRPCKDESVDWDNTKNIMIEGDNLEVLKLLQKSHAGKVKLIYIDPPYNTGKDFVYPDNFQDNMKNYLEITGQTEDGVRISSNSETSGRYHTDWLNMMYPRIKLARNLLKEDGFIFLSIDDNEVNNLKLMCDDIFGQENFVANVIWQKKYSTKADSKNFSESHDYILCYKKSDQSKILGLPRSKQQESTYKNLDNDPRGVWASDNLLRTEVRDYAVFGITSPSGMEHYPPAGSSWRFNKDKIEELISDNRIWFGEDGNNKPRLKRFRSEVRDTIPPQTLWGFEHVGHTDEGTKQLAELFDSTRSPFPNPKPVRLLQRIVQIATKENDIIMDFFAGSGTTGQAVYELNETDKQDRKFILVQLPETLSKSNKEDLSAIYFCEELDKPLYLSELTKERLRRAGNKVKAINPDWNGDTGFRVFKLDTSNIRPWEARAETLSEQLDAYVSPILEGRSEEDLLTELMLKRGIDLSVNIETRQFDGLTVSSVEGGKLFTCFASQIPVSSVEGLTKGIIDWHKSLKAGKDTVCYFLDDAFENNVAKTNLCAILEQHGLTNLHSL >CP028151|2033169:2044139|2036544_2037114_+|AWP50651.1|DBSCAN-SWA MNNLMIDLETMGKKPNAPVVSIGAVFFDPQSGEIGPEFYTAVSLESAMEQGAVPDGDTILWWLRQSPEARAAICADAVSVTTALIEFNDFITCHADDLKYLKVWGNGANFDNVILRGAFERASLPCLWNYRNDHDVRTMVTLGRAIGFDPKRDMPFEGDMHNALADARHQAKYVSAIWQKLIPPTSNNI >CP028151|2033169:2044139|2037138_2037381_+|AWP50652.1|DBSCAN-SWA MEKQLMSDRFLTEEELEDATGASQKSLQKEVLTLNGIYFIERRDGSIRTTWYHINHPVSRLLPPAGYQPVPGMNFDAIES >CP028151|2033169:2044139|2035707_2036541_+|AWP50650.1|DBSCAN-SWA MTTITKEWLQQTIAEFENTRDDIPFGLDDDDAKILLVLKRALASLDAEPVRYLNKFSGTCVTLEQQSNAADDVAVYIPLYTAPPAPVVPEEAYSDDCPDLYASQPEAWAAGWNACRAALLHGAEPVSQTYELPQTQFEQVADLYEMQFDDGRTCAFHTDGAKAAQWLLACDGNKVQEYVRLERYHEALIGNSPVIPDDWVMVPKKLTAENGAKSLLSGEFLETTFISFPECLADEECESCDGSGRIKIEVPVSWTTIKAIWNKGVEHFRSSTATGDN >CP028151|2033169:2044139|2040787_2042086_+|AWP50656.1|DBSCAN-SWA MSLTDIKAKNAKPLEKEYKLTDGFGMFLRVTPKGSKYWQMAYRFEGKQKLFSIGVYPAVSLSDARQRRDEARRLLAQGIDPNAKKQAEVKELKAKRDKTRSFSIVAKAWFSTKTKWSEDYGDSVWKRLETYVFPAIGDKDVTELDTGDLLVPVKKVEALGYLEVAMRIQQYITAILRHAVQQKLIRHNPAYDMEGAVQKPQTEHRPALELEEIPQLLKKITEYKGRRLTILAIQLNLMIFIRSSELRFARWSEIDFKSKLWVIPEQREAIENVKHSTRGAKMKRKHFVPLCNQAMRILKEIQQLTYEEGHNDGLIFTGCYDSFKPMSENTINKALRNMGYDTKQDICGHGFRTLACSALIESGLWSEDAVELQMSHKESNSVRAAYTHKAKHLDQRRLMLQWWADYLDASRNGMVRPFEFATDARFGLENQA |
13 | Salmonella_phage(54.55%) | lysis,terminase,integrase | attL 2037222:2037235|attR 2046233:2046246 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2158869 : 2169376
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP028151|2158869:2169376|DBSCAN-SWA ATTAGAAATTTAAACCAAAAAACTCTTCAAATTTGCTAACTACATAATCTAAATGCTCTGTAGTCAAGCCAGGATAAATACCAATCCAGAACGTTTGATTCATTATACGGTCGGTATTTGTCAACTCACCCACTACACGATATTTCACATTAGCAAAATACGGTTGGCGAATCAGATTTCCAGCAAACAGTAAACGTGTACCGATTTTTGCTTCATCAAGGAATTTCACCAGTTCGACACGGTTAACACCGCTAGTTTCTTTCAGGGTGATAGGGAAGCCAAACCAGGATGGGTCTGATTTCTCTGTTGCTTCTGGTAATTCGAGGAATTCAGTGCAAGATTGCAAGCCCTGTTTCAGATAGGAAAAGTTAGCTTTACGCTGCTCTACAAACTCTTCTACGCGCTCCAACTGAGCCAGACCACATGCTGCCTGCATGTCCGTGATTTTGAGATTATATCCGAGGTGGGAATAAGTATATTTGTGATCATAGCCTTGAGGAAGTGATCCCAATTGCTGACCAAAACGTTTACCGCAGGTGTTATCGCATCCTGGCGCACAATAACAATCCCGGCCCCAGTCACGGAACGACTCAATAATTTTCTTCAGTTCACCTGACTTGGTGAATACAGCACCGCCTTCACCCATTGTGATATGGTGAGCCGGATAAAAACTAACGGTTCCGATGTCACCAAAGGTACCTACCATCTGGCCTTCATAAGTCGTCCCAAGGGCATCACAGCAGTCTTCAATCAACCATAAGTTATATTTATCGGCAATCCGACGAACTTCACTCAGGTTAAATGCATTACCGAGTGTATGAGCGATCATTATCGCTTTTGATTTCTCAGTAACTGCAGCTTCAATGAGAGAGGCATCGATATTATATGTCGGGATATCAACATCCACGAATACCGGTATTAAACCATTCTGGATCGCCGGGTTAACTGTAGTCGGGAAGCCAGCAGCGACAGTAATAACCTCATCACCAGGTTTGAGAGCACGCTCGCCTAATTTTGGGGAAGTCAGCGCAGTCAGTGCCAGCAAGTTTGCCGAAGAGCCAGATGTAGTCGTTAAAACATGAGGAACCCCAATAAATTCCCCAAGTTTTTTCTCAAAGGCATCATTGAAACGACCAGTAGTTAGCCATCCATCAAGAGACGCCTCAACCATCAATTGTAACTCTTTGGCACCAATAACCTTCCCGGAAGGAGGCACAACGCTTGTACCTGCAACAAAAGGTTTCGGGCTCAATGCCTCATTCGCATACTGAGCGACAAGCTGAGAGATTTGCTCACGCAGGTTATTTGCTGTCATTACTTTGATTCCTTAAACTTATTTTCTTAACGAGTAGTTGCAGACATATAGTCGCTGATTTCACGCTTTGAACAAATCAACATATCTTCGCCGCGAATCCATGCTTTATGCCATTTTACGATGCGACCAAGTGTTTCAGTCAATCCCCAACGCGGATGCCATCCTAATTGCATATTTGCTTTAGAGCAATCCAGTTTCAGGTAATGTGCCTCATGAGGATGATTCTCACCATCCAGTAACCAGCTTGCATCATCACCCCAAAGCGTGACCATCTTGTCAACAATAAATTCGACCGTCTTCGCATCTTCATCACGCGGGCCGAAATTCCATCCTTCAGAAAACTTAGCACCTTCTGTATATAAGCGTTGCGCCACCACAATGTAACCAGAAAGAGGCTCCAGTACATGCTGCCAGGGACGGATAGAATATGGGTTTCGAATAATAACCTGCTGGTTATTTTCAAATGAGCGCAGAATATCGGGAATTAAACGGTCTTTAGCCCAATCGCCTCCGCCTATGACATTACCAGCCCTCACAGACGCCAAACCAACGCCATGTTGCTCATAATTTGCAGGATTGAAGAATGAGTTCCGGAATGCAGACGCGACTAATTCTGCACAACCTTTACTATTAGAGTATGGATCGTACCCTCCCATGGGTTCGTTCTCACGATAGCCCCACACCCACTCACGATTGTCGTAGCACTTATCACTGGTGATATTTACGACTGCCTTTATGTTACCTACTTGCTTAACTGCTTCAAGCAAATGGACAGTACCCATAACATTTGTTGAGTATGTTTCGATTGGCTGTTCATAAGATAGGCGCACTAAAGGCTGGGCTGCCATATGGAAAACAATTTCTGGCTTAAATTCTGCAATAGAATTGCGCAGCTTTTCAAAATCACGAATGTCGCCAATATGAGATTCCATAAGATCACTAAGACGCACTATCTCAAATAAACTTGGAACAGTTGGCGCATCAAGTGCATAGCCTTTTACAATTGCACCCATTTCAGTCAGCCATAGCGAAAGCCAGCTTCCTTTAAAGCCAGTATGGCCGGTAACGAATACACGTTTACCTTGCCAAAAATTTTTATCAATCATCTACTTACTCCCAGGTTTTCCACGGAGCTTTACCTTTTTCCCACAGCCCTTCAAGGTAAACTTTATCACGTAGGGTATCCATCGGCTGCCAGAAACCTGGGTGTTCAAAAGCCATTAACTCCCCCTGTTGTGCCAATGTCATTAATGGCTCTTGTTCCCAGGTTGTTGCATCGTTATCGATGAGATCGATAACCGATGGATTCAACACAAAGAAACCACCATTGATCATTGCCCCATCGCCTTTCGGTTTTTCCTGGAATGACCGGACCTGACCAGCTTGGATATCTAATGCGCCAAAGCGTCCTGGTGGAAAAGTAGCTGTTAAAGTCGCTTTCTTACCGTGAGCCTTATGGAAATCGATAGTCGCTTTGATATCAAGGTCGGCAACGCCATCACCATAAGTAAACAGGAAAGCCTCGTCATCTTTTACGTATTCAGCAACACGTTTCAGACGACCACCAGTCATTGAAGAATCACCCGTATCAACCAATGTGACATTCCATGGTTCAACACGTTTATGGTGAACTTCCATACGATTTTCAGCCATATGGAATGTTACATCTGACATGTGAAGGAAGTAGTTCGCAAAATATTCTTTAATCACATATCCTTTATAACCACAGCAGATAATAAAATCCTTGATACCATGCACAGAATACATTTTCATAATGTGCCAAAGAATAGGCTTGCCACCAATTTCTACCATCGGTTTTGGTTTTACAATTGTTTCTTCACTTAGTCTGGTACCAAGTCCACCAGCCAGGATGACCGCTTTCATAAATTATCCTCAATATTATTTAGATGCGGTAAATGCATCAGAATAGAAATGTTCTACAGAGAGATTTTTCATCATAAAGTCCTTTTTACTGGCATCGATCATCACAGGTGAACCACATGCATATATATCGAAGAACTCTAGAGAATCAAAATCATCCATCACAGCATGATGGACAAATCCCTTTCTTCCCCCCCATTCGGCGTCATCACCAGAAACAACAGGGATATAATGAACGTTGTCGTGCTGTTCACTCCACTGCTGCGGTAATGCAGAGTAAAAATCTTTACTATCTTGCATTCCCCAGTAGATGTAGATCTCACGACGACATTTTCCCTGAATGAGATGCTCAACCATTGATTTAACTGGAGCGAATCCAGTACCGCCTGCAAGGAAGATTATAGGTCTGTCACTTTCACGAATAAAAAATGTTCCGCAAGGTCCTTCAATGCGCATAAGAGTATTTTCTTGTAACTCCCCAAAAATGAGAGAGCTCATCTGACCATTGGGAACATTCCTTACATGCAACTCAATACCATTCGACTCATCACTATTAGCGATAGAATAACTGCGAGTTACACCTTTATAATGTAAATTGATATACTGCCCTGGAAGGAAGCCAATTTTTGCTGTTGGTGGTGTGCGTAACTTCAAAGTCATAACATCGCCTGAAACCAGTACAGCACTATTTACCTTGCATGGGACAATTTTTTTTGTCTGTCCAGCTAGTTCAGGAAAAAAATGCGCATTTAGCTCAAGGGCGGTTTTAGGTTTACAGCAGCAGGTTAGTATTTTATCACCCTGTCCAAAAATATTACCTTTGGAGTCAACAACTTCTCCCGCCAACAAATCGGACTCACAGATACCACAATCACCCGCTTTGCAGCTATGTTCAAGATGGATGCCAGCCGATAGCGCAGCATCGAGGATTGATTCATCCTCTCTACCGGAAAATTCAATATTTGATGGAAAAATCTTAATAATATGAGACACGATGCTTACTCTGTTAACAAGGCTTGATGCAGTAAAGGTGCTGCAGCATCTTTTGCTGAAAGCTCAGGCAGCTGAGAAAAAGGCCATTCAATACCTATTGTCTCATCATTCCATAAAATGCTACCTTCCGATGAAGGTGAGTAATAATTAGTTGCTTTGTACAGAAACTCTGCATACTCACTAAGAGTAACAAAACCATGAGCAAAACCTTCTGGAATCCAAAGCTGTCGCTTATTCTCAGCAGATAGATTTACGCCAACCCATTGACCAAAAGTAGGCGATTCTTTTCGGATATCGACCGCAACATCAAAAACCTCACCGACAGCACAACGAACTAACTTCCCCTGTGCATTTTCTCCTCTCTGAAAATGTAGCCCTCTGAGTACGTTCTTTTTGGATTTTGAATGATTATCTTGAACAAATGTAACTTTACGTCCAATCAACTCTTCAAAGGTCTGCTGGTTATAACTTTCAAAAAAGAATCCCCTCTCATCGCCAAAAACTTTAGGCTCTAAGATCAAGACATCTGGTATTGCTGTTTTAATCACAATCATCACTTATAAACCTTTCACCATCTTCAGCAAATATTTGCCATAATCATTTTTTGATAATGGCCCGGCCAGTTCTATAACCTGTTGTGCATTTATAAAATTTTTACGAAATGCGATCTCTTCCGGGCAGGACACTTTTAGCCCCTGGCGTTCTTCGATGGTTGCAATAAAATTACTGGCCTCTATCAAACTCTGATGCGTCCCTGTATCCAGCCAGGCATAACCGCGCCCCATCATGGCGACAGACAATCTTCCCTGATCCATATAGATACGGTTAATATCCGTGATTTCTAACTCACCGCGAGCGGAAGGCTTAAGATTTTTCGCCATCTCCACCACGCTATTATCATAAAAATACAGCCCCGTTACCGCGTAATTACTCTTCGGTTGTAACGGTTTTTCCTCCAGACTAACGGCTGTGCCACTTTGGTCAAACTCAACCACACCGTAGCGCTCCGGATCGTTTACATGATAAGCAAAGACGGTAGCACCACTTTCTTTATTAACGGCAGCTTCCATTAACTTTGGTAAATCATGACCATAGAAGATATTGTCACCCAGTACTAATGCACAATCATCATTACCAATGAACTCTTCACCAATAATAAACGCCTGTGCTAAGCCATCCGGGCTTGGCTGTACTTTATATTGAAGATTCAGCCCCCACTGGCTGCCGTCTCCCAGCAGTTGTTGAAAACGCGGCGTGTCCTGTGGCGTACTGATGATCAGGATATCCCGAATGCCTGCCAGCATAAGCGTGGAAAGGGGATAGTAAATCATCGGTTTATCATAAATTGGTAGCAATTGCTTACTTACCGCCATGGTCACCGGATAAAGACGGGTGCCGGAGCCCCCCGCTAAAATAATGCCCTTACGCGTTTTCATTTCCATTTCTCATTCATAGAAAATGCCCTGATGGGCATTTAAATTTATCAGATGGTTGTCGTCGTAAACATTTCAGTCAGCATGCGCTTAACTCCTAATTCCCATTGCGGCAGAATAAGGTCAAAATTACGCTGAAACTTTTCAGTATTGAGACGCGAATTGCCTGGTCTGCTCGCCGGCGTCGGGTAGGCGCTGGTCGGCACAGCATTAAGCTCAGTCAGCGCAAGCGTTATCCCTGCTTTGCGCGCCTCGTCAAAGACTAAGGCCGCGTAGTCATGCCAGGTTGTGGTTCCCCCGGCAACCAGATGGTAAAGACCTGCGACTTCTGGTTTCTTTAACGCCACACGGATCGCATGAGCCGTGCAGTCAGCCAGTAATTCTGCACCGGTTGGCGCACCGTACTGATCGTTGATGACTGAAAGTGTCTGACGCTCTTTCGCCAGACGAAGCATTGTCTTTGCGAAATTATTGCCCTTACCTGCATAAACCCAACTGGTGCGGAAGATAAGATGCTTAGGGCAGTTATCCTGCAGGGCCTTTTCTCCCGCCAGTTTGGTCTTGCCATAGACATTCAGCGGCGACGTAGCGTCCGTTTCCTGCCATGGGATATCGCCGGTACCAGGAAATACATAATCGGTTGAATAATGCACTACCCATGCGCCAGTTTCGTTGGCTGCTTTAGCGATGGCTTCCACACTGGTGGCGTTAAGTAACTGCGCCAGTTCTGGTTCAGACTCTGCTTTATCTACTGCAGTATGTGCTGCTGCGTTAACAATCACATCGGGACGAAGCTTACGAACGGTTTCGGCAACGCCTTTCGGATTACTAAAATCACCGCAAAACTCTTTTGAATGGACATCCAGGGCAATCAGATTCCCTACTGGTGCCAGAGAACGTTGCAACTCCCAGCCTACTTGCCCTGTCTTACCAAAAAGTAAGATATTCATTACTGGCGTCCTTCATAGTTCTGTTCTATCCAACTCTGATACGCCCCACTTTTAACATTGTTTACCCATTGAGTATTTGCAAGGTACCATTCCACTGTTTTACGAATACCGCTTTCAAAGGTCTCCAGCGGTTTCCAGCCTAATTCGCGGCTAATTTTACCTGCATCAATGGCATAACGACGATCATGGCCCGGACGATCCGCGACATAAGTGATTTGTTCACGATAAGAAGTCGCTTTGGGTACAATCTCGTCCAGCAGATCACAGATGGTAAATACCACATCGAGATTTTTCTTCTCATTGTGGCCACCAATGTTATAAGTCTCCCCCGCCTTGCCTTCAGTCACTACCATATGAAGCGCGCGAGCATGATCTTCTACATATAGCCAATCGCGAATCTGATCCCCTTTGCCATAAATTGGCAAAGGCTTTCCTTCCAGTGCGTTCAAAATGACCAACGGAATCAGTTTTTCAGGGAAGTGATAAGGGCCATAGTTATTAGAACAATTGGTAACGATCGTTGGTAGACCATAGGTACGCCGCCAGGCACGGACTAAATGATCGCTGGATGCTTTTGACGCAGAATAGGGGCTACTTGGCGCATATGCCGTCGTTTCAGTAAATAACGGCAGCGTAACGCTGTTTTCAACTTCATCAGGATGCGGTAAATCGCCGTAAACTTCATCAGTGGAAATATGATGAAAACGAAAATTATTTTTTTTATCTTCGCCAAGCGCAGACCAGTATTTACGCGCAACTTCAAGAAGTACATAGGTGCCGACGATATTGGTTTCAATAAATGCTGCTGGCCCGGTAATCGAACGGTCCACATGACTTTCCGCAGCCAAATGCATCACCGCGTCCGGCTGGTACTGCTCAAAAATACGCGTTATTTCAGCGGAATCACAAATATCCGCGTGTTCAAAATTGTAGCGATTACTCTCAGAAATATCAGAAAGGGATTCAAGATTACCGGCGTAGGTTAATTTATCAATATTAACTACAGTGTCCTGTGTATTCTTAATAATATGGCGGACAACAGCTGATCCAATAAAACCTGCCCCGCCAGTAATAAGTATCTTCACTTTTCTATTCCATAAGGCGTATTTAATGTGGTATTTAATTTGCCAATAAAAATTAATTGCTCAAGTCGTTACACACGCTACCGCCCCTGGCTCATCAGCTACCAGTGCACTGCGTACATATCGACTTGTTACAAACCTCGCCCAGCAGGGCAAAGCTCACTAAAACTTAAACGCTAATTGTCTTATTAATTGCATCCGGAAACAAGGATTAATCTTATAAAATCAGCATTAAAATGCTCCAGATAACCCCTTGTTACTTAAGCCCTTTATACAAAACTAAAACGGCAGTCAACACTCGCTTCAGCCAACTTGCCGCTTCGAATGTTCACTGCCGTTATTATGTTTATCACCAACCATTTATCACGGTTGTTAATACTTATTCATGCAAAAGCTGCTCTATGCTCTTACGGAACTTCGCTCCTTCTTTCAGGTTGCGCAGCCCGTACTTCACAAATGCCTGCATGTAGCCCATTTTTTTACCGCAGTCATAGCTGTCACCCGTCATTAGCATCGCGTCAACCGACTGTTTTTTCGCCAGTTCTGCAATGGCATCGGTGAGCTGGATACGGCCCCAGGCGCCCGGTTCGGTTCTTTCCAGTTCCGCCCAGATGTCGGCTGAAAGCACATAACGGCCTACCGCCATCAAATCGGAATCCAGCGTCTGCGGCTGATCCGGTTTTTCGATAAACTCCACAATCCGGCTGACTTTGCCTTCATTATCCAGAGGTTCTTTCGTCTGGATAACGGAATACTCCGATAAATCACCTTTCATGCGCTTCGCCAGCACCTGGCTGCGACCCGTTTCATTGAAACGCGCCACCATCGCCGCAAGGTTATAGCGCAGCGGATCGGCGGTAGCATCATCGATAATAATATCCGGGAGTACCACAATGAAAGGGTTATCGCCCACGACCGGACGCGCGCACAGAATAGAGTGCCCCAGCCCTAACGGCTGCGCCTGGCGAACGTTCATAATCGTCACGCCCGGTGGGCAGATAGATTGCACTTCCGCCAAAAGCTGGCGCTTAACGCGCTGCTCAAGAAGTGATTCAAGTTCATAAGAGGTGTCGAAGTGGTTCTCAACGGCGTTTTTAGACGCGTGAGTCACCAGTACGATTTCTTTGATCCCTGCAGCCACAATCTCATCGACAATGTACTGAATCATTGGCTTGTCGACGATCGGTAGCATCTCTTTTGGGATTGCCTTGGTGGCAGGCAACATATGCATACCCAAACCCGCTACCGGTATAACTGCTTTCAAATTCATCATTGTTTCTTCCACCTGTAAAATGGTTGCTGAATTATAGCTCTTTAGCTTGTTTTCGCCAGCATGAATTACTCTGCTGCCAGGGATAATGATGGCACGCTTTACATTACGTCTTAGTCGGCACCATAACATTAAGTATGAACAACTTTTTCCCAGGAATTTTCGTAAAAATAGCGGTACTTACCCTCCCCGCTTCGGCAGCGAAAAATTCACTGCTTCGACATTCACGGTTTGGTGATTAATCCTGTCGATATCCACGGAACTCTGCCCGTTTTCATTGATGGCATGAACATTAGCGAGGGAAAGCAGCGTGTCCTGGCGGGCCATAAATTGACCACGGACATCTTTACGCAAATCGAAATGCATTTTTAACGCCGGGCCAATCGCTGAAGTTTGCATCACGTTGATATTACGCAGAAAGAGGTGCTGCGGTTGATTATGCAGTTCCAGCGTAGCACGCGTCATCCGTACATTGGTGATGGCGACAAAAGAGGGGATGTTGCCGGAGGAAATTTGAATGCCGCGTAATTTATAAGCAACCTGGCGATTATCCAACCGAATAGCGTTTAATTTAAAGTTTTGCGGAATTGACAGGTATTTTCCTTTAACGACGCCATAGCCGATGAGCATCCCAGCACTATTCGTCATATCAATATTATCAATGACGAAATTATCACAGCCATAAATGGCGATCGTTGCGTTATCAATACCCGCATTTTTACTGAAATCGGGCGTGATGTTTTTGGCTTTGACATTGCGAATGACGAAATGTTTGCCATTTTCTACGTGCACCAGCTGTCGGCAATCAGATCCGGTAATATTGGCCACCACAAAGTTTTTTACTGCCTGATCTTCAGGATAACTGTTGTCATAGGTGCTACCCGCCAGCCCGATGCCGATCCCCCAGTTGATTTTGCCATTGGTACAATCAATGCGTTCGATGACATGATCGGAAATCAGGATGTCGCGGTCGTGAATCGCGACATTCCACTCAATGGCGTCCCCCTGCAAATCGCTAAAGCGGCTATGCGTAATCCGCGCGCCGTCCATTTGGTTATGAAATCCCTGGCGGAGAATGGCGTAGTTGGCGTGGGTAACGGTGATGTCATCGATAATGAGATTACGCATCACCTGCGGTTCCTTACCGCCGATGAAAATTTGCGCGACGGGGCCAAAGCCGCTCATCGTCACGCCTTTAATCACACAGTCCGACCCGCGAACATCCAGCGTCACATTGTGCAGACTGCCGCCCTGCTCCCCCACCACCTGACACCCGTCCTGCAAAATAAACCGTCCCCGGCCATTCCCACGCACCGCGCTCTGTACCCGCAGCGTTTTTCCCGCCGGAATCGTTATCGCCGCATTGATATTTTCACACACCCATCCTGGCGGTACGACCACGGTCTGTCCGTCGGCGAAGGCCTGTTTGAACGAGGCGATACCGTCATCCGCCGGATAATCCTTAATATCGACGGTCTCGCGAGGTTCACGCGCCTGTACCGGCAAGGCGCGCAGAAAAGGAAGAACAGCAAGCGCGGAACCTGCCGTCAGGAGGGTACGTCGGGAGAACTTATTCACGGGCAT
Protein sequences of DBSCAN-SWA_7 >CP028151|2158869:2169376|2167972_2169376_-|AWP50777.1|DBSCAN-SWA MPVNKFSRRTLLTAGSALAVLPFLRALPVQAREPRETVDIKDYPADDGIASFKQAFADGQTVVVPPGWVCENINAAITIPAGKTLRVQSAVRGNGRGRFILQDGCQVVGEQGGSLHNVTLDVRGSDCVIKGVTMSGFGPVAQIFIGGKEPQVMRNLIIDDITVTHANYAILRQGFHNQMDGARITHSRFSDLQGDAIEWNVAIHDRDILISDHVIERIDCTNGKINWGIGIGLAGSTYDNSYPEDQAVKNFVVANITGSDCRQLVHVENGKHFVIRNVKAKNITPDFSKNAGIDNATIAIYGCDNFVIDNIDMTNSAGMLIGYGVVKGKYLSIPQNFKLNAIRLDNRQVAYKLRGIQISSGNIPSFVAITNVRMTRATLELHNQPQHLFLRNINVMQTSAIGPALKMHFDLRKDVRGQFMARQDTLLSLANVHAINENGQSSVDIDRINHQTVNVEAVNFSLPKRGG >CP028151|2158869:2169376|2160209_2161289_-|AWP50770.1|DBSCAN-SWA MIDKNFWQGKRVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLSDLMESHIGDIRDFEKLRNSIAEFKPEIVFHMAAQPLVRLSYEQPIETYSTNVMGTVHLLEAVKQVGNIKAVVNITSDKCYDNREWVWGYRENEPMGGYDPYSNSKGCAELVASAFRNSFFNPANYEQHGVGLASVRAGNVIGGGDWAKDRLIPDILRSFENNQQVIIRNPYSIRPWQHVLEPLSGYIVVAQRLYTEGAKFSEGWNFGPRDEDAKTVEFIVDKMVTLWGDDASWLLDGENHPHEAHYLKLDCSKANMQLGWHPRWGLTETLGRIVKWHKAWIRGEDMLICSKREISDYMSATTR >CP028151|2158869:2169376|2165439_2166525_-|AWP50775.1|DBSCAN-SWA MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLTYAGNLESLSDISESNRYNFEHADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEVARKYWSALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDHLVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYVEDHARALHMVVTEGKAGETYNIGGHNEKKNLDVVFTICDLLDEIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLANTQWVNNVKSGAYQSWIEQNYEGRQ >CP028151|2158869:2169376|2161293_2162067_-|AWP50771.1|DBSCAN-SWA MKAVILAGGLGTRLSEETIVKPKPMVEIGGKPILWHIMKMYSVHGIKDFIICCGYKGYVIKEYFANYFLHMSDVTFHMAENRMEVHHKRVEPWNVTLVDTGDSSMTGGRLKRVAEYVKDDEAFLFTYGDGVADLDIKATIDFHKAHGKKATLTATFPPGRFGALDIQAGQVRSFQEKPKGDGAMINGGFFVLNPSVIDLIDNDATTWEQEPLMTLAQQGELMAFEHPGFWQPMDTLRDKVYLEGLWEKGKAPWKTWE >CP028151|2158869:2169376|2164540_2165440_-|AWP50774.1|DBSCAN-SWA MNILLFGKTGQVGWELQRSLAPVGNLIALDVHSKEFCGDFSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPELAQLLNATSVEAIAKAANETGAWVVHYSTDYVFPGTGDIPWQETDATSPLNVYGKTKLAGEKALQDNCPKHLIFRTSWVYAGKGNNFAKTMLRLAKERQTLSVINDQYGAPTGAELLADCTAHAIRVALKKPEVAGLYHLVAGGTTTWHDYAALVFDEARKAGITLALTELNAVPTSAYPTPASRPGNSRLNTEKFQRNFDLILPQWELGVKRMLTEMFTTTTI >CP028151|2158869:2169376|2163614_2164493_-|AWP53084.1|DBSCAN-SWA MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGNDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDQSGTAVSLEEKPLQPKSNYAVTGLYFYDNSVVEMAKNLKPSARGELEITDINRIYMDQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAFRKNFINAQQVIELAGPLSKNDYGKYLLKMVKGL >CP028151|2158869:2169376|2158869_2160183_-|AWP50769.1|DBSCAN-SWA MTANNLREQISQLVAQYANEALSPKPFVAGTSVVPPSGKVIGAKELQLMVEASLDGWLTTGRFNDAFEKKLGEFIGVPHVLTTTSGSSANLLALTALTSPKLGERALKPGDEVITVAAGFPTTVNPAIQNGLIPVFVDVDIPTYNIDASLIEAAVTEKSKAIMIAHTLGNAFNLSEVRRIADKYNLWLIEDCCDALGTTYEGQMVGTFGDIGTVSFYPAHHITMGEGGAVFTKSGELKKIIESFRDWGRDCYCAPGCDNTCGKRFGQQLGSLPQGYDHKYTYSHLGYNLKITDMQAACGLAQLERVEEFVEQRKANFSYLKQGLQSCTEFLELPEATEKSDPSWFGFPITLKETSGVNRVELVKFLDEAKIGTRLLFAGNLIRQPYFANVKYRVVGELTNTDRIMNQTFWIGIYPGLTTEHLDYVVSKFEEFFGLNF >CP028151|2158869:2169376|2163062_2163614_-|AWP50773.1|DBSCAN-SWA MMIVIKTAIPDVLILEPKVFGDERGFFFESYNQQTFEELIGRKVTFVQDNHSKSKKNVLRGLHFQRGENAQGKLVRCAVGEVFDVAVDIRKESPTFGQWVGVNLSAENKRQLWIPEGFAHGFVTLSEYAEFLYKATNYYSPSSEGSILWNDETIGIEWPFSQLPELSAKDAAAPLLHQALLTE >CP028151|2158869:2169376|2162082_2163057_-|AWP50772.1|DBSCAN-SWA MSHIIKIFPSNIEFSGREDESILDAALSAGIHLEHSCKAGDCGICESDLLAGEVVDSKGNIFGQGDKILTCCCKPKTALELNAHFFPELAGQTKKIVPCKVNSAVLVSGDVMTLKLRTPPTAKIGFLPGQYINLHYKGVTRSYSIANSDESNGIELHVRNVPNGQMSSLIFGELQENTLMRIEGPCGTFFIRESDRPIIFLAGGTGFAPVKSMVEHLIQGKCRREIYIYWGMQDSKDFYSALPQQWSEQHDNVHYIPVVSGDDAEWGGRKGFVHHAVMDDFDSLEFFDIYACGSPVMIDASKKDFMMKNLSVEHFYSDAFTASK >CP028151|2158869:2169376|2166901_2167795_-|AWP50776.1|DBSCAN-SWA MMNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGDNPFIVVLPDIIIDDATADPLRYNLAAMVARFNETGRSQVLAKRMKGDLSEYSVIQTKEPLDNEGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWAELERTEPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKSIEQLLHE |
10 | Enterobacteria_phage(37.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
2236573 : 2245744
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP028151|2236573:2245744|DBSCAN-SWA TATGACTCAAGTCGCGAAGAAAATTCTGGTAACGTGCGCGCTGCCGTACGCCAACGGCTCTATCCACCTCGGCCATATGCTGGAGCACATCCAGGCTGATGTCTGGGTCCGTTACCAGCGAATGCGCGGCCATGAGGTTAACTTCATCTGTGCCGATGACGCTCATGGCACGCCGATCATGCTGAAAGCGCAGCAGCTTGGTATTACGCCGGAGCAAATGATCGGTGAAATGAGCCAGGAGCACCAGACCGATTTCGCCGGTTTTAATATTAGCTACGACAACTACCACTCAACGCACAGCGACGAGAATCGCGAGCTGTCCGAGCTGATTTATACGCGCCTGAAAGAGAACGGTTTTATTAAGAACCGCACTATCTCTCAACTCTACGATCCGGAAAAAGGCATGTTCCTGCCGGACCGATTTGTGAAAGGCACCTGCCCGAAATGTAAATCCGCGGACCAGTACGGCGATAACTGTGAAGTCTGCGGCGCAACTTACAGCCCGACCGAACTTATCGAGCCGAAATCCGTGGTGTCCGGCGCGACGCCGGTAATGCGTGACTCCGAGCACTTTTTCTTTGATCTGCCGTCATTCAGCGAAATGCTGCAGGCGTGGACCCGCAGCGGCGCGCTGCAGGAGCAGGTGGCGAACAAAATGCAGGAGTGGTTTGAATCCGGCCTGCAACAGTGGGACATTTCCCGCGACGCGCCGTATTTTGGTTTCGAAATCCCGAACGCGCCGGGCAAATATTTCTACGTCTGGCTGGACGCGCCGATTGGCTATATGGGCTCCTTCAAAAATCTGTGCGATAAGCGCGGTGACACGACCAGTTTTGATGAGTACTGGAAAAAAGACTCCGACGCCGAGCTGTATCACTTTATCGGCAAAGACATCGTCTATTTCCACAGCCTGTTCTGGCCTGCCATGCTGGAAGGCAGCCACTTCCGTAAGCCGACCAACCTGTTCGTTCACGGTTACGTGACGGTGAACGGCGCGAAGATGTCTAAGTCTCGCGGCACCTTTATTAAGGCCAGCACCTGGCTGAAACACTTTGACGCCGACAGCCTGCGCTACTACTACACCGCGAAGCTTTCTTCACGCATTGATGACATCGACCTGAACCTGGAAGACTTTGTCCAGCGCGTCAATGCCGATATCGTCAATAAAGTAGTCAACCTGGCATCCCGTAACGCCGGTTTTATCAATAAGCGTTTCGACGGCGTGCTGGCGGCTGAACTGGCCGATCCGCAATTGTACAAAACCTTTACTGACGCCGCTGCGGTGATTGGCGAAGCATGGGAAAGCCGTGAATTCGGCAAAGCTATCCGTGAGATTATGGCGCTGGCCGACGTCGCTAACCGTTATGTTGACGAGCAAGCGCCGTGGGTGGTGGCTAAACAGGAAGGCCGCGACGCTGACCTGCAGGCCATTTGCTCGATGGGCATCAACCTGTTCCGCGTGCTGATGACGTATCTGAAACCGGTACTGCCGACGCTTTCTGAACGCGTTGAAGCCTTCCTGAACAGCGAACTGAACTGGGATGCCATCGAACAGCCGCTGCTCAGTCACAAGGTCAACACCTTTAAGGCGCTCTACAATCGCATCGACATGAAGCAAGTTGAAGCGCTGGTTGAAGCGTCTAAAGAAGAGGTGAAAGCCGCAGCCGCACCGGTTACCGGCCCGTTAGCCGACTTCCCGATTCAGGAAACCATCACCTTTGACGATTTCGCCAAAATTGACCTGCGCGTAGCATTGATTGAAAACGCTGAGTTCGTGGAAGGCTCCGACAAATTGCTGCGTCTGACGCTGGATCTGGGCGGCGAGAAGCGTAACGTCTTCTCCGGCATTCGTTCCGCCTACCCGGACCCGCAGGCGCTGATCGGCCGCCAGACGGTAATGGTCGCCAACCTCGCGCCGCGCAAAATGCGCTTTGGCGTCTCCGAGGGAATGGTGATGGCCGCAGGCCCTGGCGGGAAAGATATCTTCCTGTTAAGCCCTGATGACGGCGCGAAGCCTGGCCAACAGGTGAAATAAGCAACAAGCCGGAGCATGCTCCGGCTTTTTTAACCGCTTAATCCTGACTGGCATATCGCCTCCTCCGCACGTTTTAACTTTTTCCTTAATAAGCAATAGCGCTAAGACTCCATATCCGGACTGCTAAATAACGGCTAAAAGTCATATTCTATCTCTCCCTGATACATTGCTATTACTGGGTTAAATTAATTCATTGAAATTATTTTAAAAGCCCAATTATTATATAAAATGGAGTTTTAGATGAAAATTTCTGGCAAGTTATTGTCCACGGCTTTGGCTTCCGTACTGGTGTTCTCTCTTGCTGGCTGTGGCGATAAAGAAGAATCAAAGACCTTTAACGCAAACCTGGCGGGGACAGAAATTTCAATTACTTACACCTATAAAGGTGACAAAATCATTAAGCAGACGTCTGAAAGTAAAATCAGCTATGCCACTGTAGGCGCTAAAACGAAAGAAGATGCCGCCAAAATTCTCGATCCGCTGAGCGCGAAATATAAAAATATCGCCGGAGTGGAAGAAAAATTAACCTATGAAGATACCTATGCCCAGGAAAACGTCTCTGTGGATATGGAAAAAGTGGACTTTAAAGCGTTACAGCAAATCTCAGGGACGATGGTGTCCGGCGATACCAGCAAAGGTATCAGCATGAAACAAACCCAGACGCTGCTGGAAGCTGCTGGTTTTAAAGAAGCGAAATAACTGGCAGGCATAATGTATTGCATCGACTGGTAAAGTCGCTCAGGGCGCTGCTTCGGCAGCGTCTTTTCTTTATGAATTCCGAAAAAAGACAGCCTCGCTGTAAGTCCCCTCGCCAAAGCCTATAATCCCTGAATTCCTGGCCACACCAATACTTAACGACAAGGAATTGTTATGCAGGTTCTACGTCTTATGGCACTGCCACTATTCGCGCTCTCTCTATCGGTTAGCATAACTGGCTGCGATCAGAAAAACGATACTCTCCAGGGAAAGCAAAATAACATGACAGCGTTTATCAAGAAGATAGCCGCTAGCAAAGAGTCAGAGGAAACACAACGCTATGTAGGTAATCTCAACGGTATTGAAATCAAGTTAACCTATTACTACAAAGGGGATATCGTTTTACGTCAAATATCTGAACATAAACTACTTTATAAGACCCTGAAAGCCAATAATAAAGAAGAAGCACAAAAAATGCTGAGTCAAGTCGGCGAAGCTTATCAGGGTATGCCGGATTTGACTGAACGAATCGACTATTATGATAGCTATGCTACGGAATATGTGGATATTGATTTTACCCAGGCAAAAATAAGCGACCTCTGTAAATTGCCAGGATCATCAATTGACAACTGTTCCGCGTACTATCTGTCAATGATTCGCTCGCAGAAACTGTTGGAAGAGAGCGGGTATCATAGAATCAATTAGTATAATAATGCGTTTTCCCGGTCAGACAGGACGCTGCCGGGAACAATACCGCAGAATTACTTCGCCGCGTGTTCACGGGCCGCCAGACCGCGCAGAAAATAACGCATAAACTGATCGCCGCATTCGCGGAAGTTCTTATGGTCCGGCGCGCGCATCATCGCGGTGATCTCTGACATTGAGACACGAAACAACTGACCGGTAAGTATCGCCAGGATATCATCTGTTTTTAGCGAAAAGGCAATACGCAGCTTTTTCAGCACAATATTGTTGTTGATACGACGTTCCGCCGTCAATGCAGGCGCCGCCTCATCTTTGCCGCGTTTTTCATAAATGAGGCCATTGAGAAATGAGGACAACACGATATCCGGGCAACGCTGAAACCCCTCTTCCTCTTCTTTGCGCAACCAGATTGCTATCTGCTCCGGCGTAGCATCAACGTTACCCAGCGCCAGGATACGCGCCAGATCGGTATTATTAGCTTTTAAAATGTAGCGCACGCTACGCAGAATATCGTTACTCAGCATGAGGCCTTCGGTCGTTTCTATGGCAAAACGATATTCTAACAGTCTTTTACAGGCCAATCGCCTCTTTTAAACTTTTCAGATAGCGACGGCTTACCGGCACCGTCAGGCCGTTGCGTAAAATCAGCTCTGCCTGCCCATTATCCTCCAGCCGAATTTCCTGCAAATGGGCCATATTCACCAGAAACTGACGATGACAACGCAGTAGCGGCGTCCGGCTTTCCAGCGTGCGCAGCGTCAGCTCGGTAAACCCCTCTTTCCCTTCACTGCTGGTCACATAAACGCCGCTCATACGGCTACTGACAAAGGCGACATCATCCATTTGCAACAAATAGATCCGGCTGTGTCCGGTACAGGGAATGAATTTAAGCGCCTGCTGGTTTTCCGGCAACAACGAAACATCCTGTTTACTGCGCTCCTGACGCAGACGATGTAACGTTTTTTCCAGCCGTTTCTCCTCTATCGGCTTGAGCAGATAATCAAAAGCGTGTTCTTCAAAGGCTTTGATGGCGTATTCGTCAAACGCGGTTAAAAAAACGATATACGGGCGGTGTTCCGGATCAAGCATTCCTACCATCTCCAGTCCACTGATACGCGGCATCTGAATATCCAGAAACAGCACATCAGGTCGCAACTTATGTACCGCGCCAATCGCTTCTACCGCGTTCGCGCACTCTCCCACAATCTCAATGTCATCCTGCCCCTGGAGCAAAATCCGCAGATTTTCCCGCGCTAACGGCTCATCATCCACAATCAGCACTTTAATCATGCGTCCTCCTCCAGTGGAAGTCGTAATGTAATTCGGGTAAAACAGTCCGGCTCGCAGGCCACGCTAATACCATAATCATCGCCAAAGTGTTCGCGCAGACGTTTATCAACCAGACTCATCCCCAGCCCGCTACTGCCGGCGGAAGGCTGATACAGTCCCGCATTATCCTCAATATCTAACATCAAATGCTGCCCTTCGCGCCGGGCGCGAATAGCGACGTTGCCGGTATCAAGCAGTTGCGACGTGCCATGTTTAATGGCGTTCTCAACAATCGGCTGTAATGTAAACGCAGGCAATTTCTGACGTGAAAGCGTCGATGGAACATCAAGCTGTACCTGCAGACGCGACTGAAAACGCGCTTTTTCAATTTGCAGATAAGCGTTTACGTGTTCAATTTCATCCGCCAGCGTGACGATTTCCGACGGGCGTTTTAAATTTTTGCGAAAAAAGGTCGACAAGTACTGCACCAGTTGGCTGGCCTGTTCGCTGTCGCGGCGAATCACCGCTTTAATGGTATTGAGCGCGTTAAACAGAAAATGCGGGTTCACCTGCGCGTGCAACAGCTTGATCTCTGACTGCGTCAGCAACGCCTTCTGCCGTTCATACTGCCCGGCCAGGATCTGCGCGGATAAAAGCTGCGCAATACCCTCTCCCAGGGTGCGGTTAATTGAGCTAAACAGCCGGTTTTTCGCTTCGTACAATTTAATGGTGCCCATGACTCGCTGATTTTCGCCACGCAGCGGGATCACCAGCGTCGAGCCGAGTTTACACTGCGGGTGTAGCGAACAGCGATACGGCACTTCGTTGCCATCGGCATAAACCACCTCTCCGGTTTCAATTGCTTTCAGCGTATAACCTGATGAAATGGGTTTGCCCGGTAGATGGTGATCGTCGCCAATACCAGTAAAAGCCAGCAGTTTTTCGCGATCGGTGATGGCGACGGCGCCAATATCCAGCTCCTGATATAACACCTGCGCCACCTTCATACTGTTCACTTCGTTAAATCCCTGACGCAGAATCCCCTCCGTTGACGCGGCGACCTTCAGCGCGGTAGCAGAAAATGCCGAAGTATATTTTTCGAACATGGCGCGCTTATCGAGCAAAATACGCATGAACAGCGCGGCGCCAACGGTATTCGTCACCATCATCGGCGCGGCAATATTACTGACCAGATGCAAGGCATCGTCAAACGGCCTGGCTATCAGTAAAATGATCAGCATCTGCACCAGTTCGGCAATACACGTAATTGCTCCCGCCGTCAGCGGGCTAAACACTTTGTCCGGGCGTCCGCGACGTATGAGAACGCTGTGTACCAACCCGCCCAGCAGCCCTTCGACGATGGTGGAAATCATACAGCTCAGCGCCGTCATGCCGCCCATAGAATACCGATGTAACCCACCGGTCAGACCGACCAGCCCGCCGACGACCGGCCCGCCGAGTAGGCCGCCCATCACCGCGCCAATTGCGCGGGTATTGGCAATCGAATCTTCGATATGCAGCCCAAAATAAGTGCCCATGATGCAGAAGATAGAAAACGTGACGTAACACAGAAGCTTGTGCGGCAGACGAACCGTGACCTGCATAAGCGGGATGAACAGACGCGTTTTACTCATTAGCCACGCAATGACCAGAAACACGCACATCTGCTGAAGCAGCAGCAACACCAGATTAAACTCGTACATACCCGCAAACCACACTTCAATTAAAAGCGCGTAACATACATTGAGTACGATTAACTTTCTTTGAACTGTTGCATAAAAATATGAATTCGTGAATACGATCACTTAAACACCCCGCCGCAACCCGCTACTTCGCGTTTTAATGCATAAAAAACAGGCAAAACTTCCTGGTTCGTAAAAGAGCGTCTAAAGTTAAACCGGGACCTCGCGAGCAAGGGTGAAACGATGGCGCTTTACACAATTGGTGAAGTGGCTTTGCTTTGTGATATCAATCCTGTCACGTTGCGCGCGTGGCAGAGACGTTATGGACTTTTAAAACCACAGCGAACGGATGGCGGTCATCGTCTGTTTAACGATGCCGATATCGACAGAATCCGCGAAATCAAGCGCTGGATAAATAACGGCGTCCAGGTCAGCAAAGTCAAAGTGCTGCTCAGTAGCGACAGTAGCGAACAACCTAACGGCTGGCGCGAACAGCAGGAGATCCTGCTGCACTACCTGCAAAGCAGTAATCTGCACAGTTTACGGTTATGGGTCAAAGAACGCGGTCAGGATTATCCAGCCCAAACATTGACCACTAACCTGTTCGTCCCGCTGCGGCGACGATTACAGTGCCAACAACCTGCCCTTCAGGCGCTGCTCGGCATTCTTGACGGTATCCTGATCAACTATATTGCGCTCTGCCTGGCGTCTGCGCGTAAGAAACAGGGAAAAGATGCGTTGGTGATCGGCTGGAATATCCATGATACCACCCGCCTGTGGCTGGAAGGTTGGGTCGCCAGCCAACAGGGATGGCGAATCGACGTGCTGGCGCATTCGCTTAGCCAGTTCCGCCCGGAACTGTTTGACGGCAAGACGTTACTGGTATGGTGCGGAGAAAACCAGACGCTGGCGCAGCAGCAGCAACTCCTGGCATGGCGCGCCCAGGGACGCGACATTCATCCCCTTGGCGTTTAAACAGCAGCTAACAAATTCGCTTTAATGTATACTCCTTTTATTAACATAAGGAGTACATAATGCGCGTAGCGAAAATCGGGGTGATCGCCCTTTTCCTGCTGATGGCTATTGGCGGTATCGGCGGCGTGATGCTGGCAGGTTACAGTTTTATTTTGCGTGCCGGGTAAGCGCGCGCGTCAGCCTTTCAAACAGGCGATCGATAATGATCGCCGCCAGCGCCACCAGCAGCGCCCCCTGGATAACATAGGCCGTATTAAAGCCGCTAAGCCCGATAATGATCGGCGTGCCTAACGTACTGGCCCCCACTGTTGAAGCGATGGTCGCCGTACCAATATTGATAATCACCGAGGTTCGGATGCCCGCCAGAATCACCGGCGCGGCCAGCGGCAGTTCAACCTGATACAACTGTTGGCGACGGCTCATTCCCATACCGCTGGCAACGCTCATCACGCTGGCAGGCACCGCGCCCAGCCCGGCCAGGGTCGCCTGCAGGATGGGCAACACTCCATACAGGATCAAGGCGATAATGGCTGGTTGCTGACCAAAACCCATGACGGGTACCGCGATCGCCAGTACCGCGACCGGGGGAAAGGTCTGCCCGACGGCGGCGATAGTCTCCACCAGGGGACGAAACTCCTTCCCACTTTCTCGCGTGACCGCAATCCCTGCGCCGACGCCCACCACGACGGCAAACAGACTTGAGATGCCCACCAACCAGAAATGGGCGAGCGCGAGGGCGGCAAAACTCTCCTGTTGGTAGACCGGGCGCGGTAAATCGGGAAACAGCGCGGCGAAGAACGGCTGGCTATAAGGCAATCCAAACAGCAGAAGCAAGAACAGAACAATAAGCCAGAGAAGCGGATCACACAGTCGTTTCACGGGGGGACGTCTCCGAAAGTAGATCGCGGAAATGGAGCGTACCGCAGGGCTCGCCCTGCTGATTCGCCACCGGCAGGACGTCGCACCGACGGGCGACAAACATCGATAGCGCATCGCGTAGCGTCATCTCTTCCACCAGCGCGTCGCCGCTGAGCTGTTCATGCCGACGTACATAATCGCCTACGCTACGTAACGAAAGCAGCCTTACGCCCAGCTCGCTGCGGCCAAAAAACGCCTGCACGAAATCATTTTCCGGCGAGGTCAGCATAGAAAGTGGCGATCCCTGTTGGATAACATTGCCCCCGTCCATCAGCACCAGATGGTCGGCGAGGCGTAGCGCCTCGTCGATGTCGTGCGTCACCAGTACGATGGTGCGCCCCAGTAGCTGATGAATGCGGGTCATCTCCTGCTGCAATGCGCCGCGCGTTACCGGATCAAGCGCGCCGAAAGGCTCGTCCATCAGCAATACCTGCGGATCGGCAGCCAGCGCCCGCGCAACGCCGACCCGCTGCTGTTGCCCGCCGGAAAGCTGATGCGGATAACGATCGCGCAGCGCGCTTTCCAGACCCAATAATGCCATCAGTTCATCAATACGATCGTTAATCCGCGCACGCGACCACTTTTGTAGTTGCGGTACGGTGGCGATATTTTGCGCCACCGTCCAGTGGGGAAAAAGACCGATAGACTGAATGGCATAGCCCATGCGACGGCGCAGTTCAAGCACCGGCAGGCTGCGGATCTCTTCCCCGGCAAAACGGATCGTTCCGCTATCATGCTCTACCAGCCGGTTAATCATCTTCAGAGTGGTCGATTTTCCCGAACCGGAGGTGCCAATTAACACCGAAAAGCTGCCTTCGCTAAAGTGCAAATTGAGGTCGCTAACAGCCTGTTGATCGCCGAAGGTTTTACTGACATGGTTAAATTCAATCAT
Protein sequences of DBSCAN-SWA_8 >CP028151|2236573:2245744|2243202_2243934_+|AWP50837.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWINNGVQVSKVKVLLSSDSSEQPNGWREQQEILLHYLQSSNLHSLRLWVKERGQDYPAQTLTTNLFVPLRRRLQCQQPALQALLGILDGILINYIALCLASARKKQGKDALVIGWNIHDTTRLWLEGWVASQQGWRIDVLAHSLSQFRPELFDGKTLLVWCGENQTLAQQQQLLAWRAQGRDIHPLGV >CP028151|2236573:2245744|2238847_2239306_+|AWP50832.1|DBSCAN-SWA MKISGKLLSTALASVLVFSLAGCGDKEESKTFNANLAGTEISITYTYKGDKIIKQTSESKISYATVGAKTKEDAAKILDPLSAKYKNIAGVEEKLTYEDTYAQENVSVDMEKVDFKALQQISGTMVSGDTSKGISMKQTQTLLEAAGFKEAK >CP028151|2236573:2245744|2240578_2241298_-|AWP50835.1|DBSCAN-SWA MIKVLIVDDEPLARENLRILLQGQDDIEIVGECANAVEAIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIEEKRLEKTLHRLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMDDVAFVSSRMSGVYVTSSEGKEGFTELTLRTLESRTPLLRCHRQFLVNMAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >CP028151|2236573:2245744|2239477_2240008_+|AWP50833.1|DBSCAN-SWA MQVLRLMALPLFALSLSVSITGCDQKNDTLQGKQNNMTAFIKKIAASKESEETQRYVGNLNGIEIKLTYYYKGDIVLRQISEHKLLYKTLKANNKEEAQKMLSQVGEAYQGMPDLTERIDYYDSYATEYVDIDFTQAKISDLCKLPGSSIDNCSAYYLSMIRSQKLLEESGYHRIN >CP028151|2236573:2245744|2243993_2244101_+|AWP50838.1|DBSCAN-SWA MRVAKIGVIALFLLMAIGGIGGVMLAGYSFILRAG >CP028151|2236573:2245744|2236573_2238607_+|AWP50831.1|tRNA|DBSCAN-SWA MTQVAKKILVTCALPYANGSIHLGHMLEHIQADVWVRYQRMRGHEVNFICADDAHGTPIMLKAQQLGITPEQMIGEMSQEHQTDFAGFNISYDNYHSTHSDENRELSELIYTRLKENGFIKNRTISQLYDPEKGMFLPDRFVKGTCPKCKSADQYGDNCEVCGATYSPTELIEPKSVVSGATPVMRDSEHFFFDLPSFSEMLQAWTRSGALQEQVANKMQEWFESGLQQWDISRDAPYFGFEIPNAPGKYFYVWLDAPIGYMGSFKNLCDKRGDTTSFDEYWKKDSDAELYHFIGKDIVYFHSLFWPAMLEGSHFRKPTNLFVHGYVTVNGAKMSKSRGTFIKASTWLKHFDADSLRYYYTAKLSSRIDDIDLNLEDFVQRVNADIVNKVVNLASRNAGFINKRFDGVLAAELADPQLYKTFTDAAAVIGEAWESREFGKAIREIMALADVANRYVDEQAPWVVAKQEGRDADLQAICSMGINLFRVLMTYLKPVLPTLSERVEAFLNSELNWDAIEQPLLSHKVNTFKALYNRIDMKQVEALVEASKEEVKAAAAPVTGPLADFPIQETITFDDFAKIDLRVALIENAEFVEGSDKLLRLTLDLGGEKRNVFSGIRSAYPDPQALIGRQTVMVANLAPRKMRFGVSEGMVMAAGPGGKDIFLLSPDDGAKPGQQVK >CP028151|2236573:2245744|2241294_2242980_-|AWP50836.1|DBSCAN-SWA MYEFNLVLLLLQQMCVFLVIAWLMSKTRLFIPLMQVTVRLPHKLLCYVTFSIFCIMGTYFGLHIEDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSVLIRRGRPDKVFSPLTAGAITCIAELVQMLIILLIARPFDDALHLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSGYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQKLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQPSAGSSGLGMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLPLEEDA >CP028151|2236573:2245744|2244081_2244813_-|AWP50839.1|DBSCAN-SWA MKRLCDPLLWLIVLFLLLLFGLPYSQPFFAALFPDLPRPVYQQESFAALALAHFWLVGISSLFAVVVGVGAGIAVTRESGKEFRPLVETIAAVGQTFPPVAVLAIAVPVMGFGQQPAIIALILYGVLPILQATLAGLGAVPASVMSVASGMGMSRRQQLYQVELPLAAPVILAGIRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIIDRLFERLTRALTRHAK >CP028151|2236573:2245744|2244796_2245744_-|AWP50840.1|DBSCAN-SWA MIEFNHVSKTFGDQQAVSDLNLHFSEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGTIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWTVAQNIATVPQLQKWSRARINDRIDELMALLGLESALRDRYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHQLLGRTIVLVTHDIDEALRLADHLVLMDGGNVIQQGSPLSMLTSPENDFVQAFFGRSELGVRLLSLRSVGDYVRRHEQLSGDALVEEMTLRDALSMFVARRCDVLPVANQQGEPCGTLHFRDLLSETSPRETTV >CP028151|2236573:2245744|2240064_2240532_-|AWP50834.1|DBSCAN-SWA MLSNDILRSVRYILKANNTDLARILALGNVDATPEQIAIWLRKEEEEGFQRCPDIVLSSFLNGLIYEKRGKDEAAPALTAERRINNNIVLKKLRIAFSLKTDDILAILTGQLFRVSMSEITAMMRAPDHKNFRECGDQFMRYFLRGLAAREHAAK |
10 | Enterobacteria_phage(66.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
18115 : 44421
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP028152|18115:44421|DBSCAN-SWA CATGAGTCAGCCACCGTTACCTGCTGTCTGTACGCAGGCAGCGTCCGCCCTGCTGCCGGTGGCCATTGATTACCCGGCGGCCCTCGCACTGCGCCAGATGGCAATGCAGCATGACGACTACCCGAAATATCTGCTGGCACCGGAGGTAAGTGCCCTGCTCCACTACGTCCCTGATCTGCATCGCAGGATGCTGCTGGCCACTCTATGGAATACCGGCGCACGGATTAACGAGGCGCTGGCCCTGACACGGGGAGACTTTTCGCTGGCACCACCTTATCCGTTTGTGCAGCTGGCCACTCTGAAACAGCGGGCGGAAAAGGCGGCCAGAACGGCAGGACGGATGCCGTCCGGCAGCCAGCCCCACCGCCTGGTGCCGCTTTCTGATAACCAGTATGTCAGCGAACTGCAGATGATGGTGGCCACGCTGAAAATCCCGCTGGAGCGTCGTAACAGACGTACCGGCAGAACGGAAAAGGCGCGCCTCTGGGAGATCACCGACCGGACGGTCCGTACCTGGATTGGGGAGGCGGTTGAGGCCGCAGCGGCTGACGACGTGACGTTCTCAGTCCCGGTGACACCCCATACATTCCGCCACTCCTATGCGATGCACATGCTGTACGCCGGCATACCGCTGAAGGTGCTGCAGGCGCTGATGGGACACAAATCGGTGAGCTCGACGGAAGTGTACACGAAAGTGTTTGCGCTTGATGTTGCCGCACGACACCGGGTGCAGTTTCAGATGCCGGGTGCTGATGCAGTGGCTATGCTCAAAGGAGGTTCATAGAGACGTGTATGCATTTTTCAGCCTTTCGCCTGCAACAGGCCATCCGGAACCGGGAGTTTACGCCGTTTTATCAGCCCATTGTCTGCGCCACAGGAGGGGAGGTGGTGGGCTGCGAAATGCTGGCCCGCTGGCTGCATCCGCAGAAGGGCCTGCTGAGCGCCGGGAACTTTATTCCCGCCATTGAAGCCACCGGCCTCGGCGGAGCGCTGCTGCGCGGACTGGCCGACGAGGTCTGCGGGGACGGGCAGGACCTTGCCCGCAGTGCCGGCCGCAGGCTGATGATGACGCTGAATCTCAGCCTGAGCCTGGTTATGACGCCGCTTTTCCGTCCGCACCTGCTGGCCCTCAGCATCCGGCTGGAGCAGGCCGGCATGACCCCCGTCTTTGAAATCACCGAGCGGGAAGATATCCGCGCCTTTCCTCAGGCGGCGGTATTCCGCCAGCTTGCCGCCGGGGGGCTGCGGTTCGCCGTGGACGACTTCGGCACCGGTCATGCCGGTCCCGCCAGCACGGTGGCTGATCGGATGATTGCGCGCACCGTCAGTCTGGCCCGGTGCCAGGGGGCCCGGGTCATTGCCGAAGGTATCGAGACCCCGGCGCAGGCGGCGCGCCTGCGGGACGCGGGGGGCGATTACCTGCAGGGCTGGCACTGCGGCGCCCCGATGCCCTTCGGGCTGTTTCATTTCCGGCTGACGCAAAAAAGCCAGCCGGCCTTTGGTTAAGTTATGCGCACAGGAGTCATTTCTATGAACCAGCCGTTGTTATTCCGGACCGTTGCCGGCAGGCAGAGCAACCATGAAGGGTTTTACATGCCCCCGGGCATCCGCCATCTGGGCACCCTGTCCCTGTACCGGGCCGTGGCCTGGTGGGGGCTGTTTCTGGGCCGGGAGTTTACCCGCGATGACGTCAGCGAGGCGTTCAGCATTGAGCCCCGCCGTGCCAGCGGTATCCTCAATTACATCTGTAACCGTCACAATGACGACGACATTTGCTTTGATTCCCGCCTGCACCCCGTGCGGGGAGGCCGCGCCCAGCTCGTGGTGCGTATCCGGGCAGTGGAATCCCGCCCGGACACCATCCGCCGGCAGCGGACAGACCGGCCGGGCGGGAAGGTGAGTGACCGGCAGTATGACCGCCAGATGGCGCACTGGCTGCTGTCCCGCCCGGCCGGCGGTGATACGGCAAAACTAGCCGCCTGGCAGGCAGCCTGCCCGGTCCGGGAGGCATCATGCTGATTATCGTCAGTGACTGTCAGTTTACCCGCCTGGCGCTGACGCGCCTGCTCGCACATCTGGACCCCGTGAACATGAGTGTTGCCCGGTGGTTACAGACCGCGCCGCCGGCAGGCAGCCATGTGCTGCTGGCGGCCTCTCCCGGCATGCTGGCATCGCTGGTCCCGGCCTGCCATCACGCCAGGACATCACTGTCCCTGAAACTGGCGCTGCTGGGCTCAGGAGGACAGGCTGTCTTCCTGAACACCCTCGGTCTCAGGCCGGACTGCCTGCTGCCGCGCACCGCCTCCGCGACTCAGCTGAAGGCCGCGGTCAGCAGCTGGCTGCGCCGTGCGCGGGGCTGGCGGCAGGCGACGACGGAGAGTCTCTCCCTGCGCGAGCGACAGGCGCTGTGTGCCACGCTCGCCGGCCTCTCCGCGCGCACGGCGCCCGGCTCGGCATCAGCCCCAAAACCTTTTACGGCCATCGCCGCAGCGCGCTGAACCGCCTGGAGCTGCTGGCCAATCCGGTGATCGGCCTGCCTGACCCTGCGGCGCACTACAGCATGATGCTCCACCCGGGGTCGCCGCCCTTTTTCAGCAGCCGGACGACCGGCGCCGGTAGCCACAACGTTTCTGCACATCACCCCGTGCACCTTCCGGTAACACACCGTTCTCATTATACGGACATGACACTATGAAATTATCCTCCTGTACTCCCCGGCGTGGACGGGGTGTACCTGGGCCACATTGTGGCTGCAGGCCACCACACCGCTGGTTTTAACGCTCACGCCGGCCATGACGGCCCGGGCGGACATCGCGTCCCGGATCGGGCAGAGCCCGTTCTCTGCCGACCGGGATGCCGCCCTCGCCGGGATCCGGACCGTGCCTTATACCCTGAAAAAGGGGGAGACGGTGGCGCAGGCGCACGGCCTGACCGTCCCACAGCTGAAAAAACTGAACGGGCTCCGCACTTTCGCCCGCGGCTTTGACCACCTGCAGGCCGGCGACGAGCTTGACGTGCCGGCGGTCCCGCTGACCGGCGGGAAAGGTGACAATAACCGCCATGACGCCCGCGGTCCGTTTGCTGCTGACCGGGAAAATGAGGACGCGCAGGCGCAGCAGATGGCCGGCATGGCCTCACAGGCCGGCAGCTTCCTCGCCAGCCATCCGGACGGTCAGGCCGCCGCCGGGATGGTTACCGTCACTTCCGGCCGTCTGACATGCTCGGGGCGAACATCTTCCTCGACTATGACCTCTCCCGCGACCACGCCCGCGCCGGCTTCGGCGGCGAATACTGGCGCGACTTCCTTAAGCTTTCCGCCAACGCCTATGTCGGCCTGACCGGCTGGAAAACCTCCCCGGATGTGGAGGACTACGAAGAGCGTCCGGCCAGCGGCTGGGACCTGCGTGCTGAAGGCTACCTGCCGTCGTACCCGCAGCTCGGGGCGAAGATGGTGTATGAGCAGTATTACGGAAATGAAGTGGGCCTGTTCGGGAAGGATGAGCGGCAGAAGAACCCGCACGCCCTGACCGCCGGGGTCAGCTGGACTCCGGTGCCGCTGCTGAAGCTCAGCGCGGAGCAGCGCGCCGGCAAGGCCGGGGAGCATGACACCCGCTTCGGGGCAGAAGCGAGTTACCGCATTGGTGACAGCCTGCGCAGCCAGCTGGATCCGGATGCCGTGGGCGCCCTGCGCAGCCTGGCGGGCAGCCGCTACGACCTGACTGACCGTAACAATGACATTATCCTCGAGTACCGTAAGCAGGAGGTGACGTGCCAGTAATTTCACGATGCAATTGTGCTTCAATCACCGGGATTTTCGGGATTCTGACCGCACCATCCAGCACTTTTACCCCAACAGCTTGCCGTGTTTGCGCTTGAACATAGGGAAGCGGGCTTTTAGCTTCGGATTGAAGCAGTTGGAAAAGGCAACATCGAGATTGATCACCGCCTGTTGCAACGCAATAGAGTCATATTCCTTTAGCCATGCGTATTTACGGAATTTACGGAATTTTTTCGCTACAGCCAGAAGCGGTTTAATGTCTTTACGCGGGGTTAAACTTACGCCGTGTCGTTGATAAGCGTGTTTCTTGATATGCAGAGATTTGCTGTACACAAAACGGACTGCACCGAACTGGGCGTTAAGGTGCTCCGCCTGTTCCGGTGTCGGATATATGCATACTTTCGTGGCTCTTAACATATTCAGCGCTCATTAATATATTGTATTTATTTTAACATGTTAAGTGAAAAATTCAATTGAGTAATCACCACGAATCACTGGAAGGCTTTCTTCGTAAACGGCATAGTGTAAGTAAGCTGGTTGTGCATTTAATCTTTACGACAAAATACCGCTGCAAGCTGTTTGACGGGCAGATTATCGCTCAGTTACGTGATGCTTTTGGTTCGGCTGCGGCAAAGTTGGAGTGCGAAATTATCGAGATGGATGGTGAACAAGATCACGTCCATCTGCTGATAGCGTATCCGCTAAAACTGGGGGTCAGCGTGATGGTAAATAATTTAAAGTCGGTATCGTCGCGGCTTCTCCGTCAGCAGAATACCCACTTGCGGATGCAGAGCAAAACCGGGCTGCTGTGGTCTCGTTCTTACTTTGCCTGTAGCGCCGGTGGTGCCACGATAGAAACGCTTAAAGCATACGTGCTCAGACAGAATACGCCGGAGTAGCCTGCCCTAAAGCCCTTCGGGCTTTTCGCCTTATATCCCCGCCCGCATTGGGCGAGGGTTTACAGCGGATCTTGCTAAGGCCTTCTATTAACTTACCATTCATAAAATGAATATTTAAAAAAGTTATCAATCGTGTTTTTCATCATAAGCCCTGACATAAAATTCCCCTGATGATGAGAAGTGTTTTTTTATTTCTTGTAAGCTAGGAAGTATGTTGGATGTTTTCCAAGAATCAATGAATTGTACATATGGTTTTTTATCATATTCACAGATAACCACAGCGTTGAAATCATGTCCACATTCGCCAATTATTGGAGCTATGTAGTTCTCTGCGCTAATGATAAAAGAACTTTCCCTCTTAATGTCATAACGGTTTAATATTTCTTTTTCCACCTCATCGATGGAATTTAAACAATATGATTGGTTAAGCTCATGACCAAAAATGATTTTATCAACAATGGCTCTGTCTAATCCCACTGTAGGAGAGGCAGTGTTCTTCGAACTTAATATATCCTGTCGGTGATGTATGTTATAATGTAAACACGCTGATAATAAAACGCAGTTCCCAGAGTAATGGACGTATGTATTATTTACATATCTAGCCATTTGTTTTAAATTATCTCGACTCAAATTTTGTGGATACAAAATTCTCTCTTTTGATTCAATACAAAGCGCATCTTTAATTCTCTTGACTTCATTTGAATCATTATTATTGATATTTTTTGAATTTATACGTGATATTATATCTTGGGATGACGCACTACCAGAAACTCTCATAGTTACACTACCTCAATAAAATGCATATTACAAAACATTAAATATGATATTTTTTCAAAAAACGCTCTTGCTTATGTAATTTTCACCATTACAGTGCGGACATATCAATATGCATGAGCGTTATTCCTTTACGGTGCCGGAATTCGTCAGTAAGGGGGGAGCCTTCAATGAAAATTTGTAATTTATTGATTAATAAATTAATGAGCATTTAAAATAGCTGTTTAACGGCGTTTACTGTTCCGTTGCTCCCCAAACCCATACTTACTCTGTCATCAAACGATAAAACGGTTCCTCACGTAAAGCCTGTCTCTGCATTTCACCACCATCACGCCCACTTCGTAGTTCATTACGATAACTGAGATATTTCCAATTTTCAGGATGAACGTCTGACTCAGGACACTGTCCTGAAATAACCCCATTTTCGGATAGTCTGGATTCCAGACACTCTATAAATTGCCGTGTCTTGTGGAGAAACGACGCACTGTACTGCGAATTTTCCTGGTCTGGTTTTATATACAACGTGAACTGAGCGCCCAGGCTAACACGGGCTTGTTGAACGACCTTCTCCATATCGGTCACTTTCCACTTATCTACCGGACTGTCCTCTGAAAACAGCAATCCGGACAGCGCTTGAAATGCTTGTGGCACCATATCCCTGAGCACACTGATGTGGAACTTGTCACCGGCAAATTTGCCCTGAGACTGAGGTGATTCTCGACGAGCATGGATGAAAACATCGTAACCATGGTTATTCAGCTGAAAACCTTGGCTCATAGCAAAGAAACCACTCTGGCGCATCCCTGAATAGTCAGGCACATCCAGCGCCTCTTTAAAGTGGGATACCGGCATCTTCCGCATTTGGTTGTGCAAGGAGTTGAAATTATTTTTCAGGTGCTTATTTGTAGATGGTATTTCCGCCCCATCATAAGCAGCTACAATATTCAAAGGAGGGATGTTTAGATTTAGATTAGGCCTATTTATGGGCATGGGAAATCTCCTTTGCGATATGGATATATCGCCATATTATTAGATATAAATTCTCAGTTTTTTTTAATAAAACGATTAGTTATCCAGAAAACCTCATTTTTTATCAGGCGAAAAAATGATATTTCGGTCCGTTCCGAGCAGAAAGTTAAAGAAGGGTAATGGCAAAAGTTTAGATGCACTACTTTGCGGGCCTAGGGTGACGGAAAAATCTGGGGTTCCGGAGTATAACCCCTTATAGAGCTAGGCCGCTCATACCACTTCTGGAATAGATTCTTAGTATCTATGAGTTGAGTACCCTCATGTTTATTATTCTTTTTATCCTGTTTGTGTCAGCAGTTGCATCATCACTTAATCTCAGCTTACTAAGCTGGCTTGCAAAGTCTTGGGATCCACAATTTACAATGCTTTCGATTTTGAGTTTAGTATTTGGAGGGAAAAGCATCTCTGCCTCTCCCTTAAAATGTGCAACATCTCCGAGTATTCTACCTTTATGTCCTTTTTCTAGGTATATGTTGAGAATAGTGTCATTTATCCATGCCTTATCTGGCGATGTACTCATAAAAGCTTTATCTATTATTATATTACCTATAGTAGTGTATTCCTTCAGCACATCCGATAATGCGGGCTTATCAAGCTTCAGGCCCCGGTATACGACTCTGTGATCTGTTTCGGGAAGTGAACTTAATCCCTCCGCAATATCATTTATATAAACTGACATGGCATTTTTAAACTCCTCATCACTGGGTTCATTTGTGGAAAGATAGTCTCTGGAGAGCAGAGTTTCTTTTGCCTGTGTTTCAGGATAATCATCCCCACGTAAATATTTATTAATCACACTGTACCCTTGAGCTGAATAGTACCTCAGAGCTTGAATCTGCTTTGATTCCTCTACAATCGCCCATTTTGATTTTGGTCGAGATGAATTACCTCCCATCATCGGAGGAGGCGGTGGCGGTGGCATCATATTGTTGACAGGAGCTCTTCTATAACCGTCGCCTTCATAGGCCAGCGTCCGAGCAGCGCAAAGCTGTGTCAGTATCGGATTTTCGTCATACTCCAGCAGCAGACGGGAAACCAGCGTATCGGCTTCACCCAGTTCATCAGGAAAGTGGTGGAACATCAGGACTTGGCGGCACAGGCGATGGAGGCGGATCTCAAAGCCGTAGTTATACAGGGAGAAGGGATCCTGGCGGGCGAGCCAGCTGTTCTGAGCAGTGAATGCAGGCGGTACCTGTGGATCTACACCACGTTCGCCGTAGTCAAACACTAGGGTGAACAGCCACTGTACCGCGGGTGTGGCGCTAGTCCAGAGGTACAGATCGGCGGCGGGGGTCGCGTTGCCATACTGTACCTTGCTGAGATAGCGCATGGCGCTGCGATCGCGTCCGGCCTCGTTCCCATTGAGGTCCACATTGTCACCGTTCTCCGCCAGGTAGGAGTAATAGATATGCTCGCCGGCAGGGGTCACCGACTCCTCAACCAGCCATTGCGCCGTATGAGAGGCGGCCTGCGGATCGCTGAGGCGTGCTGCGGCGGTTTTCCCCAGCAGGTGCAGGATGCCGTTACTGTCATGCAGTAACCAGAAATCATCGCCGTTGCTGTTGCCCACCCAGTACTCCAGGCGATAAAAACTGCTCTCCGTGCGGGGCTGATAGCGGGTCACCGTGTAGCTTTGCGGGAACGATACGTCACCGTACGCGAAGCAGGTGACGGGATTGGGGGCATCACCGGTGCTGAGCGTTTGAACCAGCACTTCTCCGTCCGGCCCCAGAAACTCATCGCTGTCGTTATACTGCGGCACGCCATGGCTGGTGCGGCGGGCAATGCTCATTGTCGCGCAGGACCAGCCCACGCCGAAGGGGCCATTGCCGCCACCGCTGCTGTAGTGCAGCGCCAGCGCAGGCGCAAAGCCGCGTTCGGCGCTGATGGGCAGAGGCAGCGTTATACTGGCTAGGCCGTCAGGGCCTGACTGACTCAGCGCCTTGCCCCCTTTTGGCAGGAAAGGGGGAGTGATCAGCGCTAAAGTGGCAGATGAAAAACCATTTAGTATCAACATACACTATCTCCTGAAACTGGCCGTCTGACGATGGCCAGGTATATTTTCTGCTTACAGCGTCTGCTCTACGGCCTGTTCAAAAGTGCTGCCGCCGTACAGCGCGGTAGAATGCACCTGAACAATGACGTCGCTCAGATTCTGCAGAATGGTCTGCTGCTCACTACTCTGGTAGCGCGGGAAGCTAAACTGCCGGCTGGCACGCAGAGTACCCGCAATCAACTGTTCCACCTGGGGGAACGGTAATCGCTAACTGTCGGGCAAAGGTATTCAGTGCTTCAAATGGCGTATAGTCGGCGGTTTTCAGCCCCAGATTGGTCTGGTAAAAAGAGGTCAGGTCAGTGGCGGTACTGTTGTCCGGCTCGGTCGGCAGTGTCTGCTGCTCTGGTGACAGCCGGCTGGCGAGCTCCATGGCATAGTTACTGTTATCGCCGGTCAGCGAGTTGCTCAGGAACCACGGCCAGTCAATGCTGCTCTCGCTGATAATATCTTCCAGCGACACATCGGTATTCAGCAGAGATAACTCCACCTGGTCGTGTGGAAAATGATACGGCAGCAGGGTCGGTATTTGCTGGTTAATGGCATCATTAACCACCATCAGGGTGGCCAGGTCCGGCCGACGCTCTGCCTGTCCCTGAATACCGCCGACAGCGGAACTGCTCAGGTTCGTGCCATTGTCCAGTGTCTGCTGCAAGGTCCCCCGAACTCTGCGGGTCAATTGATTTTCTCGGAACATGCAGCGGATATACTGCGCCTGGCCCATCGCCAGGTCCCAGGCTTTACCTCCGCGGGAGCCCGGCCATGACATATTGGCGTTAATAAACTGTTTACGGGTCTGACGGACAATATCAAACACGGAATAATAATTATATTTTGCAAAATTCCCTGCCGGGACCCGGATGGCGGTTTCGACCTGTGAAAGTGCCGGACTGGTGGTCTGATTCATATTCATAAATAATGATGACTCCTGAAAATAAACAGAATGAAATCCTGAAGACTGATAATGTCTGCAGGGGAATTATTTTGTATTGCCAGAAAATGAGTGGCTTATATTGAGTTTATTTATGTGCTATTTTTCTGCTGTGCGGAATCGCGGTTCCCGGTGTGCAAAGATGGCACAACAAATTAACTCAGGAATAAACAACGTTCAGCGCTCCGGCGTCGGGTCAGGCCGGGAACAACCTGACCGCTGGCGTGGTCCCATTTCAGGAATTCATTCGCTGCGCCGTCATAGTCACCGTTATTCAGTTTCTTCAGTAAAGTGGCGTGAGCCAGCGCGTTAATGCCCATGTTTTAGGTAATCCACCAGAGCATCAAACTGATTCTGTGTCAGATTGACGGCGACCAGCCGGTTTACTCCATGCTCCGCGGTGAACTACCGCTATGGAGGATGGGGATTGCTGACAATCTGAATCTGGCAAATGCCGTGAATACAGGTGTTGCGGCCCTTACGCTGCATAAGGTCAGAAGGTGGACTGTTTCAGTTCCTGCTGTATCAGTTTTATAGCTTTACGATAATCTGGTGTCTCCCGTTTCTTGGTCGGGTAATACAAGGAGGTACAGAGCGCTACGCCTTTGATGTGCAGTGCGTGATCTGTTGATAATCCCAGAGCCCGACAGACTCTGACGGGGATGAGTAACATCGCCAGCCCTTGTTGTAACCGGTACAGTGAACTGAACAAATCGACGTTATCAAAACTGAATGCAGGGTTGGTAATACCTAGTGTCTGTTCAAAAAAATGGTATATGGTGTCCAGATTAAAATTTTTAGCCCCCTCATGAAAAAGTACAGGTGTTCCGGCCAGCCTGTTGATATCAGGTTTGCCGCAGAGAAAGAATTTTTTAGGAATAAATAACATGACCCCACCCTCCACTGATGTCCGGCAGACAAGCGATTCCCGATGAAAATAATTTCTGGCAGAAATAACAATGCAGTTGTTTGTCTGACACAGTTCTTCTATTATTTGGCTGTTAACGGCTCTCCCCATTATATTTGTTTTTTGGCCGGAAATGGTCAGTGCTGAAATGATCAGATTTTTTAAACTTTCCGGATAAATTTCGTCAAATATTATTTCTAGTTGTTTCGTTTTACCCGTAGGTCCGATTTCCTGCTCCAGTGCATGTAAGAAAATATAATGGGATTTTACTTTTCGATAAATAGTTTGTGCAAATTCGGTTGGGATAAGAGTGCCATTCTTCCGTATAAAGAGTCTTTGTTTCAGCTCTCTTTCCAGGTCTGAAATAACCCTGCTCAGCGGGGTTCGGGTGATATACAGTACTGATGTTGCGATACTGAAGGAACCTGTTTCCATCAGTGTTATGAAAATTTTTAATTTTTTATTAATCAAGAAATCCATAATATCTCCTTTTTTTTTCAGAATAATCCTGAAAAAATGTTTTGATGTGCAAAAACAGGATGGCGGTGACCTGTTTTTGCACAGACTGATTAGTTCTTACTAATCATGGTGTTACTGGATAGGATTATGGGCGATAGATTTTATGCATACCCTGACTTTATTTCTGAGTGGATTTAATTGACCTGTTATGATGTATAGCTGGATAACAACGATGTATACCCATGATCAATGAAGATGCTTGGTTTCAAAAATAATCAGGATCATGCTGATGGCTATGAATGTTGAGTTGACCGCAGAACAAAAAATCGCTCTTGAAGCATTACAACGAAAAAGTCGCGATCCCATAAATGAACAGAACCGATGTACGACCGGGCCGGAATTAAACGGGAGGCGTTATTAGGCCTGGCGCACTACAGGACAGAATGCAGAGCCTTTTTATAGGTTCAGTCGAACGAAAACTCACGCCAGGCGAGAAAGGTTGTCTGAAAAATCGATTAGTGTGCACTAAATAGCTTCGCAAAATAGTAGATCACTGAAAGGGAACTCAGCCCGGATTGTGCGATCTGATCAATCGCCAAACATCACAAATCACCAACCGGACTGAGCGATGCCGATCATAGCAGCAATTCCTGATGAAGAACGACAACTGATGCGCAAAGAGGCCCAACAAACCTACGATAAAAACCATGCGCGACGGCTTATCGCCATGCTGATGCTGCATCAGGGAATGACCGTCACTGACGTTGCCAGATTACTCTGTGCCGCCCGTTCATCCGTTGGTCGATGGATAAACTGGTTTACTTTACATGGTGTTGAAGGACTAAAAAGCCTCAGACCCGGACGTGCTCCACGCTGGCCGGTCGCTGATATCCTTCAGCTCTTGCCACTATTGGTACAGCGTTCCCCGAAGGATTTTGGCTGGCTGCGCTCACGCTGGAGCACGGAGCTTCTGGTACTCGTCATCAACCGACTTTTTGATGTGACGCTGCACCGCTCCACCCTGCACCGATATCTGAGGCAGGCAGATATGGTCTGGCGCAGGGCAGCGCCAACGCTGAAAATCAAAGATCCGCACTATGAAGAAAAACGGCTTGTCATTGATCAGGCGCTGGCTCAGGAGCAGACTGCACATCCTGTGTTTTATCAGGATGAAGTCGACATCGACCTGAACCCAAAAATTGGCGCTGACTGGATGCCCAAAGGGCAACAGAAACGCATCGCCACGCCGGGACAGAACCAGAAGCATTATCTGGCAGGCGCACTGCATTCAGGTACGGGACGGGTTCACTACGTCAGTGGCAGCAGCAAGAGTTCTGATTTATTTATCAGTCTGTTAGAAACGTTACGCCGTACATACAGGCGGGCGAAAACCATCACACTGGTGGCGGATAACTACATCATCCATAAAAGCCGCAAAGTGGAACGCTGGCTGGAAGAAAACCCGAAGTTCCGGTTGCTGTTCCTGCCGATGTATTCGCCGTGGCTGAATCCGATAGAGCGACAGTGGTTGTCGTTACACGAAACCATAACGAGAAATCATCAGTGTCGGTACATGTGGCAGTTACTGAAACAGGTAGCACAATTTATGAATGCCGCTTCACTGTTTCCCGGCAATCAACAGGGTCTGGCTAAAGTGGAGCGGTAATATGCGAAGCTATTTGTAGCGGCTCGCAATCGAAATCCCCCCGACTCGCAATCAGCGCCAATTTTGGCTCACATTAACTGCAAATGAAAAACGCCACCGGCTCACATTAGGTGATAATGGGCTCGCATTTATCCTGCCCGGCTCGCAATCCATTCCACCGGGCTCACAATAACTGCAAATCAACATCGTCATATTGGCAGACCGACCGCCAGCATTCTGTCGCGTAGTTCGGCCGTGCGGGCAGGGTCTAACTGATTATTAAATAATGCCCTGATTTCAGCTGGTTTTGCATCGAACTGCTCCACCATATCCTTCAGCCGGTAACCATGTCGCGTGAGCGTCGGCTCCAGGTCATCCAGCTGACGTGACAGCACGGATTCAACCAGAGTGGCAGTAATCGGTTTTTCGCCCGTCTGGTATCCTGCTTCCATCGCCAGCGTCAGGTGCAACTGGACCTGCAGCGGTGTTCGCAGTTTCATCGCCAGCAGGTCAACGGCTTCGGTCGTTAGAATATCTTCCGGCTTACCCTTACCGGTGCTGGTCTTCAGCAGCCACTGGATATATTCACGCTGACTGCCGGTGATCCCGTCCAGTGTAAAAATATCGGTACGATAGCCAATTTCCTCCATCGTCGGACGACGCAGGTCATTACGCAGTTTTGGGTGACCGGCCAGAACGACGGACAGCCTACCACCGCCGTCTTCAACCACTTCCATCAGCCGTTTGAGCCCCGTTAGTGTATTACCGTTCAGGTCATGGGCTTCATCAACAAACAGCGCCACCGGGCGTTTGCCTTTTTTGACCAGTTCCTGCAGCTCACGCTCGCGGCGTTCACCTTGTTTGTGGATCTGCACCTGTTTATCCTGTGCCAGGTCATAAAACAGGGCATTAATCAGCGTGGCGAGCCGGACACTCTGTTTGTCCACCGACAGGGAGCGGGCGACAATGATTTTATTTTCATCCAGCAGTTGCTGCTGCAGGCGTCGCAGAGTGACGGTTTTACCGCTTCCCACCACGCCACAGACCGCTATCAGTCGTCCTTCGCGTATCGCACCCTTAATGTCTTTCATCAGCTGCTTATGGTGGGCAGTTTCATAATACCCGGCCTGTTCAATAGACTGCGTCAGTCCATAATGCTCCATCACTTCAACCCGCATGGTCCTCTCCTGATTGTCTGCTACGAAAATAGTCTCTGATGCGGGCGAGTACTTCACTACGGTTAAGCGTCTCAGTCAGAATACTGTCGATAAACGCCCGGTCTTCATCCGACATTCTGGCCAGCGGTATGGCCAGATCGTCGGCAATTGCCAGTTTTGCCGCAATAACGGTGGGAAAGTGGTATTCAAATTTTCGGGTATCAAAGGGCTGATGTGGCAACGCATCAATGCGCTGCTGAGTGTCATCACTGACCACGCGGAGATCGCTGCCGGACAATGCGCTGATGGGGATGTTCAGCTGCCTGGCCAGAGCATGGATACGATCGGCCCGTTCAGCCGCTTTACCGCGTCTGAACGTCCGGTAACGGTGCAGCGGCACAGGCCCCGACACCGGATAATAAGGCCCCCATGTTTCACCGGTAAACTCCACATACATCTCTTCGTCAAACAGCCCCCACAGCAGAATGACGGTTTCACCCGCCATATCCGGTTCAACTTCCCATGCGGTGCCGTCTATCGTTATCCGCGCATCAACACCCACCTTGCGGGATTCCGGCTCCCGGGCGAACCGACAGTATTGTTCCCAGCTGCACATATCCCTCACACCTTCCTGGCCGATATTTGCCAGCCAGTCCTCCAGTCGGGAGTGTTTCTCGCTGCGGTGACGCTGCGCGTTATAACGGCTCAGATAGTTCCAGAGCCATTCGTTGGCCTGCAGCTCCGTTTCCGGTTTATGGAAATGGTACAGAGTTTCATGTGCCTCCTTGACGGTCCGGAAAGGACGTTCAACTTTACCTTTGGACCGTGCCGTCGTCCGGGGGCCGTCCTTGCCTGCCGGCGTATGCGTCAGCCAGTCAACCTTCAGGGACTGCATCACATTCTGGAATACATGGTTTTTGGCCACCGGTCCATTATCGAGGTACAGCAATTTCGGGCGGCCCTGAAAAGGAAAATCAGACCTGGTTTTTGGGGCCATGGCGTTAAAGAGAAAACGCAGCGCAGATTCCGCATCCTCACCGTACACACACCGGTATTCCTGGTATGCGACACCGCTCCGGTCATCCACTACGCTGAAGAGCATCAGCGTGGGTTCGCCCCGGGCAGGATCTACCCAGTCAGGCCGCTCAATATGTTTGAGATCGGATGGCGACATATCAAACTGCCAACAGTCATTGCTGTTTTCAGCCTGAAATCTTACCGCCGGAGGCTCCCGTAACAGGCGGGGCTGGTCGAGACGCCAGCGGGAAAGCCAGCGATTGACGGTCTGCTTACGAAGTAAGCCTTTGGGACTTTTAATCAATCCCTGGACCGTCTCCACGCCATGCTCTTCCAGCAGCTGTATTGCCCGTCCGGTCGACAGATGGCGGCCGGATTTATTTGTGGTTCGTAATTTCAGTGCAGCAATCAGTTCGCAGTAATGCTCCAGCTCGGACGGTGGCAGTATCCGGGGTTGACCATGATCACTGCGGTGGGCCGTTCGCGGTTTCAAAACAAGGTGAAGAGCGCGATAAACCGTCGTGACTGAAATACCATACAGCTGGGCGGTAGCGGCAATCTGATTGGCACGTTCAGGGCTTTTTGGCGGCAGGCGATCGAGCCGTTGCCGTAGTTGCAACAGGGAGTCAGACGGAATCGCGCTACGCCGGCTGCTCATAATTTTCCAGCGCCCGGTAGACTGACGCCCTCCCAATTCCGAGCTGGCGGGCGATGGCTGTGGCTCCCATCTTCTCAATGGTATACAGACGATATACCTCAGCAGGATCTATGGACGGTTTCCTTCCCCGGTATACGCCCCTGGCTTTCGCCGCGGCAATGCCTTCCATCTGGCGTTCACGTCTCAGATTGGTTTCAAACTCAGCGAAAACACCCAGCATATCGAGGAAGGCTTTGCCTGCAGCTGAACGCGTGTCCACTGGCTGTTCTGTTGCCCTGAGCGTTACGCCCTGTTGATTCAGGGCATACACAATGTCCTGCAGGTCCTTAATGCTGCGGGCCAGGCGATCCACGCGTGTCACCATCAACGTATCACCAGGGCGCAGGAACTCCAGCAGCAGCTGCAACTCGCTCCTTCCGGTCCGGCCGCTTCCGCTGGCTTTTTCTGCACGAATAATTTCACAACCAGCAGCGCGGAGAATTTGTGTCTGCAGAGTAAGATCCTGATCGCTGGTTGAAACGCGGGCATAGCCGTAAAGTGCCATAATGGTGAAAACTGTCTCGTTTAACTCTAGATGATGAAAACGTAACACCGGAATGGGATAAAAACAACCCAAATGAGACAGAAAAACTGTCTCACCAGACTGTGCGCTTTGGGTGTACTCAAAAAATACATTTTCATTCCAGAGAAAATTTAATGAGCTCTTAATGTAATGCGAGGCTGACTGTATTCGGACTCGGACTGCTTCAGTTATACTTCGAAAAACTCAACCTTACCGCCAGTAAGGTGATACATACTGCCAACGATTTTAATTTTCTTCTCATCTTCCAGTTGTTTCAGCACCGGGCTGTTTTTGCGGATGTTTTCGATAGTCAGCTCGACATTTTTTCGTGCCACCGCATCGACAAAATCATAATTGCTACCTTTACGCTCACCGCTATACTCCGTTTTTGCAATCGCGGGTTTAATTTCATCCAGCAGCCCCGTCAGGTTACCCAGTTCAGCATTATCAATAGCACAGCGTACTGCGCCACAGCGGGTGTGTCCTATAACCAGCACTACTTTCGCGCCGGCAACTGCACAGGCAAACTCCATGCTGCCCAGCATATCGCGATTACTGATATTACCCGCTACGCGGGAATTAAACGTCTCACCGATCCCCGCATCCAGCACAATTTCCGCCGGCGCACGTGAGTCAATGCAGCTAAGGATCACCGCAGCAGGATACTGACCCGCAATACTGTTGCGCTTCTGTGCCAGGTAATCATGTTTAGCCGGGCGATTTTCCCGGAAGCGCAGGTTACCCTGTTTAAAATGTTCAATGACTGCATCTGGAGTCATACCGTCACGTTCCTCTTTGCTCAGTGATGCGGCGAAAGAGATAGTTGGTACCGATAAGGCCGCCAGCCCGGTGACAGACAAAGCAGATACTGCAAGAGTTTGTTTCAGGATTGCACGTCGTGAAGGTTGTGCTGGTTGGTTTTGTTCCATTATTTGCTCCTATTTACTGAGTTAACTCCAATTGCTGCGATTATACGTAGGTAAACGTTAATCAAACCAAAACGACAATTTTTAACCAGTATGAATAAATATCTGAGACCAGTCGAACTGGAAAATAGAATGCTGGCAGTCGGTTTGCCGATATGACGAAGTTGACTTGCAGTTATTGTGAGCCCGGTGGAATGGATTGCGAGCCGGGCAGGATAAATGCGAGCCCATTGTCACTTAATGTGAGCCGGTGGCATTTTTCATTTGCAGTTAATGTGAGCCGAAATTTGCGCTGATTGCGAGTCGGGGGGATTTCGATTGCGAGCCGCTACAACTAGGAACTCTGTGTGTTCCGAACTTCATAAGTCTGGAGCGTTCATCTCATTCTTTGAGTGTCCGTTTGGTAATTCCGGTTGTCGCGTCACAGTTATGATACGGAGAAATCAGGTGTGGCTCTCAGTCTGGTCTGGCTTGAACGTGGCCATCCCAGAGCCAGTGCTGTTGCTGATGTACTTTCAGCCGCAGGCTTTCCTGATTTCGGGAAGTAGCTCCAGCAACTGGCCAAAGAACCTTCGCAACGTTGAATAAGCCTGATTTATTTTCACTGACTTATGTGATTAATGCTAATGTACGCATACATTAGCATTAATCATGCATTTAATCCCCTGCGCCTTTCTGTCTGTTCTGATTTTTCTGGTAGTCCTGCTTCACTCTCCCGATATTCTTCAGAACTGTCTGCCAGTGGTCCTCTGCCTGCTGCCGGTAGCGTTCGGCAACGGCAGCAATAATGTCAGGATCACTGTCATGGCTGCGGTGCATGCGTCCCTTCTGGCCAAAGATATCAACAATAATATTCTGGCTGCCCTGCCTGTCGAAAAGATGAGCGCACAGGCGCTTTACAACCAGCGGAGTATAGTCCAGCCCGGCCTCATCAGCGATCTGGAAGAAGAAGTACAGCATCTCATACGTTTTGTAGCGCTTGACGGCCGCACTGAGTCTGGCTGCCCGTTGTGGTGCTTCCCAGGTGTCATCCCTGTAGTTACGCAGTTCTGCCTGGTAGTCGGATTCCAGGTCTTCCTCTGTATACCGGTGGTGGCGGAAAAAGTATTCCCTGGCCCGTTTGACGACTGATTCATCCTGCGGCATGTCTGTCTCTGGTTAATGCTAACGTATGCGTACATTAGCATTAACACCAGTACCTGGCAAGGAGAGTGGCGTGCATCGTACCTGTGGTGCATCCCGCCCGGGCTCCGGACATGCTGTGCCAGCAATTCGACCGGCGGTGCGCTGTCCTGGTGCCCTTTTTCGCTACCCTGTCACGTCTTTAATTTGTTTTTTAGTTAGTTTATTTGTTAGTTTGTTTGTGTATAATATAAGCATACCCTTTATGCGGAGACTGACAGATGCGTCCGGCGACATACGAACCAGAACAGATTATTGAAGCAGGGCTGGCCCTGCAGGCTGAAGGACGGAATATCACCGGGTTCGCACTACGTAACCAGGTGGGTGGCGGCAATCCGACACGTCTCCGCCAGATATGGGACGAATACCAGGCTTCACAGAGCACGGTCGTCACTGAACCCGTTGCCGAGCTGCCAGTGGAAGTGGCTGAAGAAGTGAAGGCCGTCTCCGCCGCGCTGTCCGAACGCATCACCCAGCTGGCGACAGAACTGAATGACAAGGCGGTCCGGGCTGCAGAACGCCGGGTTGCGGAAGTCACGCGTGCTGCCGGTGAACAGACCGCACAGGCAGAGCGGGAGCTGGCCGACGCCGCGCAGACAGTCGACGACCTGGAAGAAAAACTGGATGAACTGCAGGACAGATATGACAGTTTGACGCTGGCGCTGGAGTCAGAACGTTCACTGCGTCAGCAGCATGATGTGGAGATGGCCCAGCTGAAAGAGCGTCTTGCGGCCGCTGAAGAGAATACCCGTCAGCGAGAGGAACGGTATCAGGAGCAGAAGACAGTGCTGCAGGATGCGCTTAATGCGGAGCAGGCACAGCACAAAAACACGCGGGAAGACCTGCAGAAACGACTGGAGCAAATTTCTGCCGAAGCTAATGCGCGTACAGAAGAACTGAAGTCTGAACGCGATAAAGTCAATACTCTCCTTGCCCGCCTTGAATCGCAGGAAAATGCGCTGGCCTCAGAACGTCAGCAGCATCTGGCCACCCGCGAAACGCTGCAGCAACGCCTCGAGCAGGCCATCGCTGACACGCAGGCGCGCGCCGGTGAGATTGCACTTGAACGTGACAGAGTCAGCAGCCTCACCGCAAGGCTGGAATCGCAGGAAAAGGCCTCCTCGGAGCAACTGGTGCGTATGGGCAGTGAAATAGCCAGTCTGACAGATCGTTGCACACAGCTGGAAAACCAGCGTGATGATGCCCGTCTGGAGACGATGGGGGAGAAAGAAACGGTCGCGGCACTGCGTGGTGAGGCTGAAGCCCTGAAGCGTCAGAACCAGTCACTGATGGCGGCGCTTTCAGGCAATAAACAGACCGGTGGCCAGAATGCGTGATGATGACAATTCAGACTCCTGTGATGAAGCGCCCCGGGCGGAGGATAAATGCTACACCAAAGGGCGGCTTCGGGATGAGTTCAGGATGAAGCCAGCCCCTGGTGCGGAACCGGTCAGAATGTACAAAAGTCCTTACGGCGGGAAATACGGCGTATGGCGGCTGGCTGACTGCGTCCCGATGCGGGCGAAACGCCCTCAGACAGAAAAACAGCGGCTGGCCAGCACACGTCTGGGCCTGCAGGCCCGGATGAAGAGTGAGCGTGGCAGGTTCGCCATGCTGGCACATACCTGGTTAGCACTGGGCCCCGTCTTCCTCGATACCGAAACGACCGGACTTGATGCCGGCGCGCAGGCGCTGGAAATCGGTCTGGTGAATGCCCGCGGTGAGAGGATATTTGAGACCCGCCTGAAACCCACAGTCGGCATCGATCCGGCAGCAGCTGCAGTACAAATCGTTACGGAACCATATCGCTTTCCGGTGCTGTCAGTCAGGCTGGCCTGAGCTGGATCGGGGTGGCTCATTCCGCCGTCACTGACGCCGTTATGACAGCACAGGTGGTAAACAATATTGCCGGATACTGGCATGAACTCCAGTGTGAAATGAATGATGACGCAGGGAGTGAGCCAGCATAACGCTGCTCTCTGGCTGGTCAGACAGTTGCAGGGATTAATTCAGGAGTCTGAAATGGAGAATATGTTGAAAACTATGCGTGAGCATTTCCCCGTTCGAACAGTGGTGATCGAACAGTCTCTGGATATGTTTGGTGTCAAACGAAATGAGGACGGGACGTTACTCTTCCCCCGTCACTCCGATAAACGTTTTCCCGTTGCACGTATCCGTAAGCGACTGGCGGCTTATGGCTGGCGAGGAGAACGCCTGAATAAGGAGGTGGCGCGAATCGTCGCGGGCCCTGATATCCTCCCTGAGAGGAGGGACTACAGCATTATTTTCCCTTACGGATGGGAGAAGAAGCAGCTGACGCGTGAGATGCGGCGAATGTTTCTCCGTAAGACGGGCTGACGTGTGCTGATTATGCCTGATGAATTACATCATTTTTCTGACTTGCGGGAGAAGCCGTGACATTGACAGAAAAATCAGGTCATCTGGCGTGGTGTGCCCTGGTGGCGCTTGCGCTGGCCAGGCAGGATGGTGGCGTTCTCTCGCCGGCACAGGAAAACCTCTTTCTCACCCGCTGGCTGGCGACTGCGCTGAAGCAACGGCGTTTTTCTCGTGATGTGACGCCCGATATCGAGTGGCTTTTGAAACAGGGACGTCAGATGGGCGTCAGCGCCAAGCTGGCCAGCAAGCTGAATTACCTCTGGCGTTCATGCACCGGTGAGTTGTCCGAGCAGAACGACCTGTTCCGGCTGACGTATGCTCTGGAAACAGCGAAAGATATGCACTGGAACTACCGTCTGCTGAGCGATCGGGAATGGTCCGGCCGGAATGCTGTGGCGCTCAGTGCAGGCGTGAACGGCATTTATCTCTCACAGGCCAAACTCGATGTCGGTTTTAACGACAGCGGCCGACAGATAAATTCGCTGACAGCACGTTTGACAGGGAACGTAGCGGGGGTAATGAAGCTGTTTGATCGCTGTGGCTGGCTGGCAGAGCCTGATGCCTCCCTGCCCCACCAGTATTCGCTGATGGCCGGGCAGGGAGTGCCAGAAAAGGGGGATTAGTGGCGGATCTGCGCCAGGTCGTCTTCAGTCAGACCGGTCATTTTCATGACGGTATTGCGGTCAATGCCGTTCTGGAGCATGGTACGGGCTATTTTCAGAGTCGCTTCGCGCTCCCCTTCTGAACGGCCTTTTTCAATACCACGTTGTTCACCGAGCTGAATCCCCTTCTCGATGCCCTTCTGTTCGAGCTGTTGTGCGATGGTCATAAGTGCGTCTCCGTGTTGCGGCACACGCTGTGCCAGTTCGCGTACAAAGGCTTCGGCGTCGGATGTTTCGCCTGCCTGCACTATATAGTGTACCAGCGATATTACCTGCGATGAAGACAGATATCCGGCCAGCAAAATGGGCGCCAGCCGGTCAACCAGTTCTGCCAGGTCCCGCTGATGAATATGTTTCTGCAATAAAGTCAGGGCGGCCATGCTACGGTGGCCGGCGATTTCATCATCCGGAATGACCGTAACGTCTACCAGCGGGAAAGCGCTGCTGTAGAGTTTGCCTGCCAGCGCCGTATCGTCAAATTCGTCCAGCCAGCGGGTGGAGTACGGATACGGGCTGCGTTTCCCCGTATAGAACAGCACTGGTATCACCAGTGGCAATTTTTTATGCCCTGCCTCCAGGTGGCGCTGCATGGCGGCCACCGCATAGCGTATCAGGCGGAAAGCCATATGTTTGTCAGGTGTTGACTGGTGTTCAACCAGGACATGAATATATCCGTCGCCGGCTGTGGTTTTCAGGCTGTAGAGGACGTCGCTGAAATACTGGCGGAGGTCATCCTCAACAAACGAGCCTGATTCCAGTTTCAGTGTACTGAGATCGCAGATGGCACGCAGCTCTGCCGGCAGATGCAGCTCCATAAAATCCCGGGCAATGTCAGGTTGTGTCAGAAACTGCCGGAATGTCGCGTCGTGGGGAGTCGGCGTAGTGTTTTTCTTTTTCATCAGTCCATGCTCTGAAAAATGACCCGGTCATAGTATCACAGGCAAAAAATGCTGACATTGCGAGGAACGGAATTGCAATTCCTTCAGGGGTTGTCCCCTCTTTCTTTTTTAATCATCAGCTTGCTTGCCCGGAGATATCCATGTCCTGGCAGAAACAGGTACTGCTGAACTATACCCCAATGTGACAGGCTGACGCGCCCTTTGGGCTTGTCCAACCCTTCGCCTGCTCAGATGCAAGATCTGAACAGACTCACGGTTTGCGCTGTGGGTCAACAAAATGATTGTGGATGTAGTGCCCTCTTCTCCATTTAATCTTCGGTGCCCCCTGTGAGAGCAAAATCAAAATATTATGATTTATTTCTTGCAACATGATTCTTTTTGTTAAAAATGACAAAAAACATCATGAGGTGATAAATGGAAAATATTGAGCAGTTACGGAAAGTCGCCACACGTGCAGGTAAACTTCTGACATCCTTGAGCGAAAGTATCCGCCAGCAAAAAGAGGAATTGAAGCTTACCGAGTTCTATCAGGAATACAGCAAAGCCGCCCTATATAAGCTGCCTAAGCTAAGCAAAGGGAGTGTTGAGTATGCTGTGGCCGAAATGGAAGCCAGTGGCTATATTTTTAAAAAGAAACCTTCCGGCAATACGATGAAGTACGCGATGACGATTCAGAACGTCATCGATCTGTATTTCCATCGTAAAGTGCCCAAGTACAGAGATCGCTTCGATAAAGCGTTCACTATATTTGTATGTAACCTTAAAGGTGGCGGTTCCAAGACAGTTTCTACCGCTTCATTATCCCATGCCTTTCGGGCTCATCCTCAGCTTCTGTTCGAAGACCTGCGTATCCTGGCCATCGATTTCGATCCGCAGGCATCTCTGACAATGTTCCTGAGCCATGAGAATTCAGTGGGCCTGGTTGAAAATACGGCTGCTCAGGCCATGCTCCAGAACGTGTCTCGTGAAGAATTACTTTCCGATTTCATCGTGTCGTCAATTATCCCGGGCGTTGATGTTATTCCCGCTTCCATTGACGATGCATTCCTTGCTGAAGGCTGGAAGGGATTGTGTGAGGAACATCTGCCTGGGCAAAATATTCATGCAGTGCTTAAAGAGAACATTATCGACAAGCTCCGGTATGACTACGACTTTATCTTCCTTGATAGTGGTCCTCACCTCGATGCGTTCCTGAAAAACTGCATCGGAGCGGCAGACCTGATGTTGACTCCACTTCCTCCGGCAACGGTTGATTTTCATTCTTCCCTGAAGTTTGTAGCCAGCCTTCCTGCGCTTATTGATTCGATCGAACAGGATGGACACACCTGCAATCTCATCGGAAATGTAGGTTTTATGTCCAAAATCCTGAACAAGTCTGATCACAAAATTTGCCATAGTCAGGCCAAAGAAGTATTCGGTGCCGATATGCTCGACATGGTTCTGCCGAGGCTGGATGGCTTCGAGCGATGCGGGGAGACTTTTGACACAGTTATTTCCGCAAATCCAGCAACCTATGATGGAAGCACCGAGGCGCTTAAAAGCGCAAAATCAGCAGCGGAAGACTTTGCGAAGGCCGTATTTGATCGCATTGAGTTCATTCGTACTAACGGAGGTATGTGATGTCGAATGAGAGAAGAAAAACTATTGGCAGACAGTTAAATACCCAGGCGTCAATGGTTGAGATGACTGACACCCAAAGAAGTCAGGTATTTACCCTAAAAACCGGCAGGAAGATAACGTTCAGGTTTGTTCGGGTACCTGCATCGGACGTTGAAAGTAAGACATTCGTAAACCAGGAAACCAACGGAAGAGATCAACTTGCTCTGACCAGGGAGTCCCTGAAATCTATTATCCAGACGATTAAGTTTCAGCAGTTCTTCCCCTGTATTGGCATTCAGCAAGGTGAAAGGATTGAGATACTGGATGGTTCAAGACGGCGAGCCTCAGCGATTTATATCCGTACAGGCCTTGATGTAATGGTTACCAATGAACGTTTATCAGCTGATGAAGCTCGCCATCTGGCTAAAGATGTCCAGACAGCTAAAGAGCATAACCTTCGGGAGATCGGCTTAAGATTGATGGCTCTGAAAGAGTCTGGATTCAATCAAAAGGAAATTGCTGAACTGGAGGGGTTATCCCAGGCTAAGGTCACCAGGGCACTGCAGGCGGCAGCAGTACCGCAAGAATTGATTTCTCTGTTCCCGGTTCAGTCTGAGCTGTCGTTTAGTGACTATAAAATTCTGTTAGAGGTTAATGAAAAACTCAGTGAAAAGGGTTTAACGTCTGAAGGGCTCATTCAGTCTGTCTCAGATCAGCATGATGCAATTTTGAGTGACTACGAGCGACCAGACGACGAACAGAAAGCCAGTATACTGAAATTGATTTCTCAGGCCTCTCAGGCATTGATAGCTCCGCCTCCTAAGGAAAAGTCAGTAATTTCCGCACTCTGGACCTTTGAAGAAAAGGACAAGTTCGCGCGCAAGCGTGTGAAAGGGCGTACGCTGACTTATGAGTTTAGCCGTATGTCAAAAGTGGTTCAGGATGAACTGGACAAAGCTATCAACGAGGTCCTTGAGAGAAATTTGAGTCAATAATTTCGCCACGGCGATTTCACATCTAACTTATTGTAATTAAGTAATTTATATTGAGAATTTCAAGGTGAAATCGCCACAATTTCAAGTCAATTTCGCTGCAGGAATATCTGCCCAACGTGTCGTATAAGCTGGCGAAAGTAGTTCCCTTTTCATCTGCCACTCGGGGGCAATCCCCCGACCGGCGAACCATATTTTCCCCTTGCCCGGATGGTTAATCCCATCCAGGACTTGCATCAGCTGCTCGCTCCGTTCCCGCGGCTGTACTTCATCAAATAAATTAAGCTGCGATACCCCGGTTTGCGTGAAATCGTTCAGCATACAGCCTGCCTTTGCGTAACGGTGACCATCCACCCAGATCCTGTCCAGAGCTCTGACGGCAGCGGCAATGATGTCCCGTGTATCCTGCGTGGGGATGAGCAGTTTTTCACTGGCCAGATTACCGTAGTAGGGCTCAGTTACGGCGAACGGCGATGTTTTCACAAACACGGCAATATGCCTGCAGAACTGCCGTTCCCCACGCAGCTTTTCGGCGGCACGTTCAGCGTGCTGACAGACTGCCTGCCGCATCGCTTCATAAGTCGTGACTCGTTCTCCAAAGCTTCTGCTGCAGACTATCTGCTGTTTGGGGGGCGGTGCTTCCTCCAGGGAAATACAGCTTTCTCCATTGAGTTCCCGGACCGTTCTCTCCAGAACCACATTAAAGTTTTTCCTGATGAACGTTGGATTCGCGCGGGCCAGCTGCAGTGCTGTGGTGATCCCCATTGTGTTCAGCTTCTTTGAGATCCTGCGGCCCACACCCCAGATTTCCTCCACCGGTTGCAGTGACAGCAGTTTCTCCGTTCGTTTCTGATTGTGTAACGTCAGGGCAAGCACGCCTCCGAACTGTGACCATTCTTTCGATGCCCATTGCGCGCTTTTAGCCAGCGTTTTGGTCGGTCCCATTCCGACGCCGATGGTCAGCCCTGTGCCGGAACGAACGTGCTCACGCAGCTGTCGGCCGAAATCCTCGAAATCGATGCAACTGTCTATACCGCGAATATCCAGAAACATCTCGTCAATGGAGTACTGCTCAACGCGTGGCGCCAGCTCTTCCAGGTGAACCATGACCCTGTTCGACATTGATGCGTAGAGGGCGTAATTACTGGAGAACGCGATAACCGGCTCCGGAAATTTCGCCGACCTCAGTTGAAACCACGGCACCCCCATTTTTATGCCCAGTTTTTTTGCTTCAGCGCTGCGCGCGATTACACAACCATCGTTGTTACTCAGGACCACCACAGACCTGTCACGCAAATCGGGGCGAAAGACCTTTTCGCAACTGGCATAGAACGAGTTGACGTCAGCCAAAGCGAACATCAGTCTGTGCTCCGGGTTTTGTGTATAAAAGCCGTCACCACCCCGAAAATCTGCAGTTCTTCCGGATAAAGCGTGGGGTAGGCCGGATTCATCGGCAGCAGGGCGATACGGGGCTTCAACTGCAGGCGCTTAACGGTGAACTCGCCATCTGTTTCCGCGATGACAATATCCCCCTGCATGGGGTTCTCAGCTTTGTCGACAACCATTAAATCGCCGGAATGCAGACCCATTTCTTTCATCGAATCCCCGATAGCCCTGACGAAGAATGTTGCAGCCGGACGCCGTATACAGTAGGCATTCAGATCCAGTTCCTCCTCCGTGTAATCGGCGGCCGGCGACGGAAACCCCGCCGGGCAACGTTCGGTGAACAGCGGGGCAGTGGACTGTACCGGCTCCTGTTCGGGTGCTACAAGCAGTAACAT
Protein sequences of DBSCAN-SWA_1 >CP028152|18115:44421|21958_22309_-|AWP53207.1|transposase|DBSCAN-SWA MLRATKVCIYPTPEQAEHLNAQFGAVRFVYSKSLHIKKHAYQRHGVSLTPRKDIKPLLAVAKKFRKFRKYAWLKEYDSIALQQAVINLDVAFSNCFNPKLKARFPMFKRKHGKLLG >CP028152|18115:44421|24835_26611_-|AWP53211.1|DBSCAN-SWA MLILNGFSSATLALITPPFLPKGGKALSQSGPDGLASITLPLPISAERGFAPALALHYSSGGGNGPFGVGWSCATMSIARRTSHGVPQYNDSDEFLGPDGEVLVQTLSTGDAPNPVTCFAYGDVSFPQSYTVTRYQPRTESSFYRLEYWVGNSNGDDFWLLHDSNGILHLLGKTAAARLSDPQAASHTAQWLVEESVTPAGEHIYYSYLAENGDNVDLNGNEAGRDRSAMRYLSKVQYGNATPAADLYLWTSATPAVQWLFTLVFDYGERGVDPQVPPAFTAQNSWLARQDPFSLYNYGFEIRLHRLCRQVLMFHHFPDELGEADTLVSRLLLEYDENPILTQLCAARTLAYEGDGYRRAPVNNMMPPPPPPPMMGGNSSRPKSKWAIVEESKQIQALRYYSAQGYSVINKYLRGDDYPETQAKETLLSRDYLSTNEPSDEEFKNAMSVYINDIAEGLSSLPETDHRVVYRGLKLDKPALSDVLKEYTTIGNIIIDKAFMSTSPDKAWINDTILNIYLEKGHKGRILGDVAHFKGEAEMLFPPNTKLKIESIVNCGSQDFASQLSKLRLSDDATADTNRIKRIINMRVLNS >CP028152|18115:44421|36312_37428_+|AWP53221.1|DBSCAN-SWA MRPATYEPEQIIEAGLALQAEGRNITGFALRNQVGGGNPTRLRQIWDEYQASQSTVVTEPVAELPVEVAEEVKAVSAALSERITQLATELNDKAVRAAERRVAEVTRAAGEQTAQAERELADAAQTVDDLEEKLDELQDRYDSLTLALESERSLRQQHDVEMAQLKERLAAAEENTRQREERYQEQKTVLQDALNAEQAQHKNTREDLQKRLEQISAEANARTEELKSERDKVNTLLARLESQENALASERQQHLATRETLQQRLEQAIADTQARAGEIALERDRVSSLTARLESQEKASSEQLVRMGSEIASLTDRCTQLENQRDDARLETMGEKETVAALRGEAEALKRQNQSLMAALSGNKQTGGQNA >CP028152|18115:44421|34171_34912_-|AWP53219.1|DBSCAN-SWA MEQNQPAQPSRRAILKQTLAVSALSVTGLAALSVPTISFAASLSKEERDGMTPDAVIEHFKQGNLRFRENRPAKHDYLAQKRNSIAGQYPAAVILSCIDSRAPAEIVLDAGIGETFNSRVAGNISNRDMLGSMEFACAVAGAKVVLVIGHTRCGAVRCAIDNAELGNLTGLLDEIKPAIAKTEYSGERKGSNYDFVDAVARKNVELTIENIRKNSPVLKQLEDEKKIKIVGSMYHLTGGKVEFFEV >CP028152|18115:44421|42724_43999_-|AWP53227.1|DBSCAN-SWA MFALADVNSFYASCEKVFRPDLRDRSVVVLSNNDGCVIARSAEAKKLGIKMGVPWFQLRSAKFPEPVIAFSSNYALYASMSNRVMVHLEELAPRVEQYSIDEMFLDIRGIDSCIDFEDFGRQLREHVRSGTGLTIGVGMGPTKTLAKSAQWASKEWSQFGGVLALTLHNQKRTEKLLSLQPVEEIWGVGRRISKKLNTMGITTALQLARANPTFIRKNFNVVLERTVRELNGESCISLEEAPPPKQQIVCSRSFGERVTTYEAMRQAVCQHAERAAEKLRGERQFCRHIAVFVKTSPFAVTEPYYGNLASEKLLIPTQDTRDIIAAAVRALDRIWVDGHRYAKAGCMLNDFTQTGVSQLNLFDEVQPRERSEQLMQVLDGINHPGKGKIWFAGRGIAPEWQMKRELLSPAYTTRWADIPAAKLT >CP028152|18115:44421|23829_24555_-|AWP53210.1|DBSCAN-SWA MPINRPNLNLNIPPLNIVAAYDGAEIPSTNKHLKNNFNSLHNQMRKMPVSHFKEALDVPDYSGMRQSGFFAMSQGFQLNNHGYDVFIHARRESPQSQGKFAGDKFHISVLRDMVPQAFQALSGLLFSEDSPVDKWKVTDMEKVVQQARVSLGAQFTLYIKPDQENSQYSASFLHKTRQFIECLESRLSENGVISGQCPESDVHPENWKYLSYRNELRSGRDGGEMQRQALREEPFYRLMTE >CP028152|18115:44421|31756_33421_-|AWP53217.1|transposase|DBSCAN-SWA MSSRRSAIPSDSLLQLRQRLDRLPPKSPERANQIAATAQLYGISVTTVYRALHLVLKPRTAHRSDHGQPRILPPSELEHYCELIAALKLRTTNKSGRHLSTGRAIQLLEEHGVETVQGLIKSPKGLLRKQTVNRWLSRWRLDQPRLLREPPAVRFQAENSNDCWQFDMSPSDLKHIERPDWVDPARGEPTLMLFSVVDDRSGVAYQEYRCVYGEDAESALRFLFNAMAPKTRSDFPFQGRPKLLYLDNGPVAKNHVFQNVMQSLKVDWLTHTPAGKDGPRTTARSKGKVERPFRTVKEAHETLYHFHKPETELQANEWLWNYLSRYNAQRHRSEKHSRLEDWLANIGQEGVRDMCSWEQYCRFAREPESRKVGVDARITIDGTAWEVEPDMAGETVILLWGLFDEEMYVEFTGETWGPYYPVSGPVPLHRYRTFRRGKAAERADRIHALARQLNIPISALSGSDLRVVSDDTQQRIDALPHQPFDTRKFEYHFPTVIAAKLAIADDLAIPLARMSDEDRAFIDSILTETLNRSEVLARIRDYFRSRQSGEDHAG >CP028152|18115:44421|22365_22791_+|AWP53208.1|transposase|DBSCAN-SWA MSNHHESLEGFLRKRHSVSKLVVHLIFTTKYRCKLFDGQIIAQLRDAFGSAAAKLECEIIEMDGEQDHVHLLIAYPLKLGVSVMVNNLKSVSSRLLRQQNTHLRMQSKTGLLWSRSYFACSAGGATIETLKAYVLRQNTPE >CP028152|18115:44421|20126_20612_+|AWP53205.1|DBSCAN-SWA MLIIVSDCQFTRLALTRLLAHLDPVNMSVARWLQTAPPAGSHVLLAASPGMLASLVPACHHARTSLSLKLALLGSGGQAVFLNTLGLRPDCLLPRTASATQLKAAVSSWLRRARGWRQATTESLSLRERQALCATLAGLSARTAPGSASAPKPFTAIAAAR >CP028152|18115:44421|18115_18898_+|AWP53202.1|integrase|DBSCAN-SWA MSQPPLPAVCTQAASALLPVAIDYPAALALRQMAMQHDDYPKYLLAPEVSALLHYVPDLHRRMLLATLWNTGARINEALALTRGDFSLAPPYPFVQLATLKQRAEKAARTAGRMPSGSQPHRLVPLSDNQYVSELQMMVATLKIPLERRNRRTGRTEKARLWEITDRTVRTWIGEAVEAAAADDVTFSVPVTPHTFRHSYAMHMLYAGIPLKVLQALMGHKSVSSTEVYTKVFALDVAARHRVQFQMPGADAVAMLKGGS >CP028152|18115:44421|40463_41669_+|AWP53225.1|DBSCAN-SWA MENIEQLRKVATRAGKLLTSLSESIRQQKEELKLTEFYQEYSKAALYKLPKLSKGSVEYAVAEMEASGYIFKKKPSGNTMKYAMTIQNVIDLYFHRKVPKYRDRFDKAFTIFVCNLKGGGSKTVSTASLSHAFRAHPQLLFEDLRILAIDFDPQASLTMFLSHENSVGLVENTAAQAMLQNVSREELLSDFIVSSIIPGVDVIPASIDDAFLAEGWKGLCEEHLPGQNIHAVLKENIIDKLRYDYDFIFLDSGPHLDAFLKNCIGAADLMLTPLPPATVDFHSSLKFVASLPALIDSIEQDGHTCNLIGNVGFMSKILNKSDHKICHSQAKEVFGADMLDMVLPRLDGFERCGETFDTVISANPATYDGSTEALKSAKSAAEDFAKAVFDRIEFIRTNGGM >CP028152|18115:44421|26792_27560_-|AWP53212.1|DBSCAN-SWA MNMNQTTSPALSQVETAIRVPAGNFAKYNYYSVFDIVRQTRKQFINANMSWPGSRGGKAWDLAMGQAQYIRCMFRENQLTRRVRGTLQQTLDNGTNLSSSAVGGIQGQAERRPDLATLMVVNDAINQQIPTLLPYHFPHDQVELSLLNTDVSLEDIISESSIDWPWFLSNSLTGDNSNYAMELASRLSPEQQTLPTEPDNSTATDLTSFYQTNLGLKTADYTPFEALNTFARQLAITVPPGGTVDCGYSACQPAV >CP028152|18115:44421|41665_42643_+|AWP53226.1|DBSCAN-SWA MMSNERRKTIGRQLNTQASMVEMTDTQRSQVFTLKTGRKITFRFVRVPASDVESKTFVNQETNGRDQLALTRESLKSIIQTIKFQQFFPCIGIQQGERIEILDGSRRRASAIYIRTGLDVMVTNERLSADEARHLAKDVQTAKEHNLREIGLRLMALKESGFNQKEIAELEGLSQAKVTRALQAAAVPQELISLFPVQSELSFSDYKILLEVNEKLSEKGLTSEGLIQSVSDQHDAILSDYERPDDEQKASILKLISQASQALIAPPPKEKSVISALWTFEEKDKFARKRVKGRTLTYEFSRMSKVVQDELDKAINEVLERNLSQ >CP028152|18115:44421|43998_44421_-|AWP53228.1|DBSCAN-SWA MLLLVAPEQEPVQSTAPLFTERCPAGFPSPAADYTEEELDLNAYCIRRPAATFFVRAIGDSMKEMGLHSGDLMVVDKAENPMQGDIVIAETDGEFTVKRLQLKPRIALLPMNPAYPTLYPEELQIFGVVTAFIHKTRSTD >CP028152|18115:44421|39107_40049_-|AWP53224.1|transposase|DBSCAN-SWA MKKKNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSDVLYSLKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYAVAAMQRHLEAGHKKLPLVIPVLFYTGKRSPYPYSTRWLDEFDDTALAGKLYSSAFPLVDVTVIPDDEIAGHRSMAALTLLQKHIHQRDLAELVDRLAPILLAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHGDALMTIAQQLEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNGIDRNTVMKMTGLTEDDLAQIRH >CP028152|18115:44421|29572_30610_+|AWP53215.1|transposase|DBSCAN-SWA MPIIAAIPDEERQLMRKEAQQTYDKNHARRLIAMLMLHQGMTVTDVARLLCAARSSVGRWINWFTLHGVEGLKSLRPGRAPRWPVADILQLLPLLVQRSPKDFGWLRSRWSTELLVLVINRLFDVTLHRSTLHRYLRQADMVWRRAAPTLKIKDPHYEEKRLVIDQALAQEQTAHPVFYQDEVDIDLNPKIGADWMPKGQQKRIATPGQNQKHYLAGALHSGTGRVHYVSGSSKSSDLFISLLETLRRTYRRAKTITLVADNYIIHKSRKVERWLEENPKFRLLFLPMYSPWLNPIERQWLSLHETITRNHQCRYMWQLLKQVAQFMNAASLFPGNQQGLAKVER >CP028152|18115:44421|35566_36055_-|AWP53220.1|DBSCAN-SWA MPQDESVVKRAREYFFRHHRYTEEDLESDYQAELRNYRDDTWEAPQRAARLSAAVKRYKTYEMLYFFFQIADEAGLDYTPLVVKRLCAHLFDRQGSQNIIVDIFGQKGRMHRSHDSDPDIIAAVAERYRQQAEDHWQTVLKNIGRVKQDYQKNQNRQKGAGD >CP028152|18115:44421|19623_20133_+|AWP53204.1|DBSCAN-SWA MRTGVISMNQPLLFRTVAGRQSNHEGFYMPPGIRHLGTLSLYRAVAWWGLFLGREFTRDDVSEAFSIEPRRASGILNYICNRHNDDDICFDSRLHPVRGGRAQLVVRIRAVESRPDTIRRQRTDRPGGKVSDRQYDRQMAHWLLSRPAGGDTAKLAAWQAACPVREASC >CP028152|18115:44421|38113_38449_+|AWP53222.1|DBSCAN-SWA MENMLKTMREHFPVRTVVIEQSLDMFGVKRNEDGTLLFPRHSDKRFPVARIRKRLAAYGWRGERLNKEVARIVAGPDILPERRDYSIIFPYGWEKKQLTREMRRMFLRKTG >CP028152|18115:44421|27733_27898_-|AWP53213.1|DBSCAN-SWA MGINALAHATLLKKLNNGDYDGAANEFLKWDHASGQVVPGLTRRRSAERCLFLS >CP028152|18115:44421|28071_28965_-|AWP53214.1|DBSCAN-SWA MDFLINKKLKIFITLMETGSFSIATSVLYITRTPLSRVISDLERELKQRLFIRKNGTLIPTEFAQTIYRKVKSHYIFLHALEQEIGPTGKTKQLEIIFDEIYPESLKNLIISALTISGQKTNIMGRAVNSQIIEELCQTNNCIVISARNYFHRESLVCRTSVEGGVMLFIPKKFFLCGKPDINRLAGTPVLFHEGAKNFNLDTIYHFFEQTLGITNPAFSFDNVDLFSSLYRLQQGLAMLLIPVRVCRALGLSTDHALHIKGVALCTSLYYPTKKRETPDYRKAIKLIQQELKQSTF >CP028152|18115:44421|30798_31824_-|AWP53216.1|DBSCAN-SWA MKYSPASETIFVADNQERTMRVEVMEHYGLTQSIEQAGYYETAHHKQLMKDIKGAIREGRLIAVCGVVGSGKTVTLRRLQQQLLDENKIIVARSLSVDKQSVRLATLINALFYDLAQDKQVQIHKQGERRERELQELVKKGKRPVALFVDEAHDLNGNTLTGLKRLMEVVEDGGGRLSVVLAGHPKLRNDLRRPTMEEIGYRTDIFTLDGITGSQREYIQWLLKTSTGKGKPEDILTTEAVDLLAMKLRTPLQVQLHLTLAMEAGYQTGEKPITATLVESVLSRQLDDLEPTLTRHGYRLKDMVEQFDAKPAEIRALFNNQLDPARTAELRDRMLAVGLPI >CP028152|18115:44421|33404_33965_-|AWP53218.1|DBSCAN-SWA MALYGYARVSTSDQDLTLQTQILRAAGCEIIRAEKASGSGRTGRSELQLLLEFLRPGDTLMVTRVDRLARSIKDLQDIVYALNQQGVTLRATEQPVDTRSAAGKAFLDMLGVFAEFETNLRRERQMEGIAAAKARGVYRGRKPSIDPAEVYRLYTIEKMGATAIARQLGIGRASVYRALENYEQPA >CP028152|18115:44421|21331_21892_+|AWP53206.1|DBSCAN-SWA MLGANIFLDYDLSRDHARAGFGGEYWRDFLKLSANAYVGLTGWKTSPDVEDYEERPASGWDLRAEGYLPSYPQLGAKMVYEQYYGNEVGLFGKDERQKNPHALTAGVSWTPVPLLKLSAEQRAGKAGEHDTRFGAEASYRIGDSLRSQLDPDAVGALRSLAGSRYDLTDRNNDIILEYRKQEVTCQ >CP028152|18115:44421|38505_39111_+|AWP53223.1|DBSCAN-SWA MTLTEKSGHLAWCALVALALARQDGGVLSPAQENLFLTRWLATALKQRRFSRDVTPDIEWLLKQGRQMGVSAKLASKLNYLWRSCTGELSEQNDLFRLTYALETAKDMHWNYRLLSDREWSGRNAVALSAGVNGIYLSQAKLDVGFNDSGRQINSLTARLTGNVAGVMKLFDRCGWLAEPDASLPHQYSLMAGQGVPEKGD >CP028152|18115:44421|22917_23568_-|AWP53209.1|DBSCAN-SWA MRVSGSASSQDIISRINSKNINNNDSNEVKRIKDALCIESKERILYPQNLSRDNLKQMARYVNNTYVHYSGNCVLLSACLHYNIHHRQDILSSKNTASPTVGLDRAIVDKIIFGHELNQSYCLNSIDEVEKEILNRYDIKRESSFIISAENYIAPIIGECGHDFNAVVICEYDKKPYVQFIDSWKTSNILPSLQEIKKHFSSSGEFYVRAYDEKHD >CP028152|18115:44421|37513_37930_+|AWP53246.1|DBSCAN-SWA MKPAPGAEPVRMYKSPYGGKYGVWRLADCVPMRAKRPQTEKQRLASTRLGLQARMKSERGRFAMLAHTWLALGPVFLDTETTGLDAGAQALEIGLVNARGERIFETRLKPTVGIDPAAAAVQIVTEPYRFPVLSVRLA >CP028152|18115:44421|18906_19620_+|AWP53203.1|DBSCAN-SWA MHFSAFRLQQAIRNREFTPFYQPIVCATGGEVVGCEMLARWLHPQKGLLSAGNFIPAIEATGLGGALLRGLADEVCGDGQDLARSAGRRLMMTLNLSLSLVMTPLFRPHLLALSIRLEQAGMTPVFEITEREDIRAFPQAAVFRQLAAGGLRFAVDDFGTGHAGPASTVADRMIARTVSLARCQGARVIAEGIETPAQAARLRDAGGDYLQGWHCGAPMPFGLFHFRLTQKSQPAFG |
28 | Escherichia_phage(27.27%) | transposase,integrase | attL 16950:16964|attR 50429:50443 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|