Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP007598 | Salmonella enterica subsp. enterica serovar Enteritidis str. 77-1427 chromosome, complete genome | 2 crisprs | cas3,DEDDh,WYL,DinG,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e | 0 | 14 | 8 | 0 |
NZ_CP007599 | Salmonella enterica subsp. enterica serovar Enteritidis str. 77-1427 plasmid pCFSAN000111_01, complete sequence | 2 crisprs | NA | 0 | 3 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP007598_1 | 4565718-4566295 | TypeI-E |
I-E
Consensus repeat of NZ_CP007598_1
|
9 spacers
spacers of NZ_CP007598_1
>1.1|4565747|32|NZ_CP007598|CRISPRCasFinder,CRT TATTTATAAGCGTGTCATCTATGCAACCCAAC >1.2|4565808|32|NZ_CP007598|CRISPRCasFinder,CRT ACCTGCCCGACCCAATAAGGAGGCCCTCGTGA >1.3|4565869|32|NZ_CP007598|CRISPRCasFinder,CRT GGCCGCTGGTCAAATTCCCAATCTGAGCAATC >1.4|4565930|32|NZ_CP007598|CRISPRCasFinder,CRT ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC >1.5|4565991|32|NZ_CP007598|CRISPRCasFinder,CRT GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC >1.6|4566052|32|NZ_CP007598|CRISPRCasFinder,CRT ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG >1.7|4566113|32|NZ_CP007598|CRISPRCasFinder,CRT GAGAATGCTCATGCGCGTGAGCGCCATATATT >1.8|4566174|32|NZ_CP007598|CRISPRCasFinder,CRT CATGGCAATTTTACGGCGGACGTGCTCGCTCT >1.9|4566235|32|NZ_CP007598|CRISPRCasFinder,CRT AGGCGGACCGAAAAACCGTTTTCAGCCAACGT >1.10|4565749|32|NZ_CP007598|PILER-CR TATTTATAAGCGTGTCATCTATGCAACCCAAC >1.11|4565810|32|NZ_CP007598|PILER-CR ACCTGCCCGACCCAATAAGGAGGCCCTCGTGA >1.12|4565871|32|NZ_CP007598|PILER-CR GGCCGCTGGTCAAATTCCCAATCTGAGCAATC >1.13|4565932|32|NZ_CP007598|PILER-CR ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC >1.14|4565993|32|NZ_CP007598|PILER-CR GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC >1.15|4566054|32|NZ_CP007598|PILER-CR ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG >1.16|4566115|32|NZ_CP007598|PILER-CR GAGAATGCTCATGCGCGTGAGCGCCATATATT >1.17|4566176|32|NZ_CP007598|PILER-CR CATGGCAATTTTACGGCGGACGTGCTCGCTCT >1.18|4566237|32|NZ_CP007598|PILER-CR AGGCGGACCGAAAAACCGTTTTCAGCCAACGT |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around NZ_CP007598_1
The CRISPR arrays of NZ_CP007598_1 >merge|NZ_CP007598|1|4565718-4566295|CRISPRCasFinder,CRT,PILER-CR GTGTTTATCCCCGCTGACGCGGGGAACACTATTTATAAGCGTGTCATCTATGCAACCCAACCGGTTTATCCCCGCTGGCGCGGGGAACACACCTGCCCGACCCAATAAGGAGGCCCTCGTGACGGTTTATCCCCGCTGGCGCGGGGAACACGGCCGCTGGTCAAATTCCCAATCTGAGCAATCCGGTTTATCCCCGCTGGCGCGGGGAACACATAGCCCCGGCAGCGATAGCTAAACCAGTTCCCGGTTTATCCCCGCTGGCGCGGGGAACACGCCTCAAAATCTCTCGGTGAGATGTAAGCGTCCGGTTTATCCCCGCTGGCGCGGGGAACACACCAGTGGTCAGCGGCGGATGAATTTGCCCTGCGGTTTATCCCCGCTGGCGCGGGGAACACGAGAATGCTCATGCGCGTGAGCGCCATATATTCGGTTTATCCCCGCTGGCGCGGGGAACACCATGGCAATTTTACGGCGGACGTGCTCGCTCTCGGTTTATCCCCGCTGGCGCGGGGAACACAGGCGGACCGAAAAACCGTTTTCAGCCAACGTCGGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP007598|1|1|4565718-4566295|CRISPRCasFinder GTGTTTATCCCCGCTGACGCGGGGAACAC TATTTATAAGCGTGTCATCTATGCAACCCAAC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCTGCCCGACCCAATAAGGAGGCCCTCGTGA CGGTTTATCCCCGCTGGCGCGGGGAACAC GGCCGCTGGTCAAATTCCCAATCTGAGCAATC CGGTTTATCCCCGCTGGCGCGGGGAACAC ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC CGGTTTATCCCCGCTGGCGCGGGGAACAC GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GAGAATGCTCATGCGCGTGAGCGCCATATATT CGGTTTATCCCCGCTGGCGCGGGGAACAC CATGGCAATTTTACGGCGGACGTGCTCGCTCT CGGTTTATCCCCGCTGGCGCGGGGAACAC AGGCGGACCGAAAAACCGTTTTCAGCCAACGT CGGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP007598|1|1|4565718-4566295|CRT GTGTTTATCCCCGCTGACGCGGGGAACAC TATTTATAAGCGTGTCATCTATGCAACCCAAC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCTGCCCGACCCAATAAGGAGGCCCTCGTGA CGGTTTATCCCCGCTGGCGCGGGGAACAC GGCCGCTGGTCAAATTCCCAATCTGAGCAATC CGGTTTATCCCCGCTGGCGCGGGGAACAC ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC CGGTTTATCCCCGCTGGCGCGGGGAACAC GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GAGAATGCTCATGCGCGTGAGCGCCATATATT CGGTTTATCCCCGCTGGCGCGGGGAACAC CATGGCAATTTTACGGCGGACGTGCTCGCTCT CGGTTTATCCCCGCTGGCGCGGGGAACAC AGGCGGACCGAAAAACCGTTTTCAGCCAACGT CGGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP007598|1|1|4565720-4566295|PILER-CR GTTTATCCCCGCTGACGCGGGGAACACTA TTTATAAGCGTGTCATCTATGCAACCCAACCG GTTTATCCCCGCTGGCGCGGGGAACACAC CTGCCCGACCCAATAAGGAGGCCCTCGTGACG GTTTATCCCCGCTGGCGCGGGGAACACGG CCGCTGGTCAAATTCCCAATCTGAGCAATCCG GTTTATCCCCGCTGGCGCGGGGAACACAT AGCCCCGGCAGCGATAGCTAAACCAGTTCCCG GTTTATCCCCGCTGGCGCGGGGAACACGC CTCAAAATCTCTCGGTGAGATGTAAGCGTCCG GTTTATCCCCGCTGGCGCGGGGAACACAC CAGTGGTCAGCGGCGGATGAATTTGCCCTGCG GTTTATCCCCGCTGGCGCGGGGAACACGA GAATGCTCATGCGCGTGAGCGCCATATATTCG GTTTATCCCCGCTGGCGCGGGGAACACCA TGGCAATTTTACGGCGGACGTGCTCGCTCTCG GTTTATCCCCGCTGGCGCGGGGAACACAG GCGGACCGAAAAACCGTTTTCAGCCAACGTCG GTTTATCCCCGCTGGCGCGGGGAACAC
>NZ_CP007598.1|WP_000490481.1|4564656_4565703_+|aminopeptidase MFSATRRFAVILALGVGFILPAQAASPGPGEIANTQARHIATFFPGRMTGSPAEMLSADYLRQQFTQMGYQSDIRTFNSRFIYTTKDNRKNWHNVTGSTVIAAHEGRVPQQIIIMAHLDTYAPQSDADVDANLGGLTLQGMDDNAAGLGVMLELAARLKDIPTHYGIRFIATSGEEEGKLGAENLLKRMSDAEKKNTLLVINLDNLIVGDKLYFNSGKNTPEAVRTLTRDRALAIARRYGIAANTNPGRNPSYPKGTGCCNDAEVFDKAGISVLSVEATNWNLGKKDGYQQRVKNASFPNGNSWHDVRLDNQQHIDKALPGRIERRSRDVVRIMLPLVKELAKAEKTS >NZ_CP007598.1|WP_000372384.1|4563497_4564406_-|sulfate-adenylyltransferase-subunit-CysD MDQKRLTHLRQLEAESIHIIREVAAEFANPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYAFRDRTANAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWRNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMVDDDRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESHAQTLPEIIEEMLVSTTSERQGRMIDRDQAGSMELKKRQGYF >NZ_CP007598.1|WP_001092251.1|4562048_4563488_-|sulfate-adenylyltransferase-subunit-CysN MNTILAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTLQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCDLAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYCEETFARIREDYLTFAEQLPGDLDIRFVPLSALEGDNVAAQSANMRWYSGPTLLEVLETVDIQRAVDRQPMRFPVQYVNRPNLDFRGYAGTLASGSVKVGERIKVLPSGVESSVARIVTFDGDKEEACAGEAITLVLNDDIDISRGDLLLAANETLAPARHAAIDVVWMAEQPLAPGQSYDVKLAGKKTRARIEAIRYQIDINNLTQRDVESLPLNGIGLVEMTFDEPLALDIYQQNPVTGGLIFIDRLSNVTVGAGMVRELDERGATPPVEYSAFELELNALVRRHFPHWDARDLLGDKHGAA >NZ_CP007598.1|WP_001173663.1|4561456_4562062_-|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVAAREQLHGHRGVVLWFTGLSGSGKSTVAGALEEALHQRGVSTYLLDGDNVRHGLCRDLGFSDADRQENIRRVGEVASLMADAGLIVLTAFISPHRAERQLVKERVGHDRFIEIYVNTPLAICEQRDPKGLYKKARAGELRNFTGIDAIYEAPDSPQVHLNGEQLVTNLVSQLLDLLRRRDIIRS >NZ_CP007598.1|WP_001537530.1|4561082_4561406_-|DUF3561-family-protein MRNSHNITFTRSDAFMVDDDATSAFPGAVVGFVSWLLALGIPFLLYGPNTLFFFLYTWPFFLALMPVSVIIGIALHLLVKGKILFSIMFTLLAVGALFGALFIWLLG >NZ_CP007598.1|WP_000517480.1|4560580_4560892_-|cell-division-protein-FtsB MGKLTLLLLALLVWLQYSLWFGKNGIHDYSRVNDDVVAQQATNAKLKARNDQLFAEIDDLNGGQEAIEERARNELSMTKPGETFYRLVPDASKRAATAGQTHR >NZ_CP007598.1|WP_000741653.1|4559851_4560562_-|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MAATLLDVCAVVPAAGFGRRMQTECPKQYLSIGNKTILEHSVHALLAHPRVTRVVIAISPGDHRFAQLPLANHPQITVVDGGNERADSVLAGLQAVAKAQWVLVHDAARPCLHQDDLARLLTISENSRVGGILASPVRDTMKRGEPGKNAIAHTVERADLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPALVEGRADNIKVTRPEDLALAEFYLTRTIHQEKA >NZ_CP007598.1|WP_001219253.1|4559372_4559852_-|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRISYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLMKAAK >NZ_CP007598.1|WP_000134246.1|4558326_4559376_-|tRNA-pseudouridine(13)-synthase-TruD MTEFDNLTWLHGKPQGSGLLKANPEDFVVVEDLGFTPDGEGEHILLRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDFSAFQLEGCKVLEYARHKRKLRLGALKGNAFTLVLREISDRRDVETRLQAIRDGGVPNYFGAQRFGIGGSNLQGALRWAQSNAPVRDRNKRSFWLSAARSALFNQIVHQRLKKPDFNQVVDGDALQLAGRGSWFVATSEELPELQRRVDEKELMITASLPGSGEWGTQRAALAFEQDAIAQETVLQSLLLREKVEASRRAMLLYPQQLSWNWWDDVTVELRFWLPAGSFATSVVRELINTMGDYAHIAE >NZ_CP007598.1|WP_001221538.1|4557584_4558346_-|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFDNGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLNGYQHYDTAAAVTCALLRGLSREPLRTGRILNVNVPDLPLAQVKGIRVTRCGSRHPADKVIPQEDPRGNTLYWIGPPGDKYDAGPDTDFAAVDEGYVSVTPLHVDLTAHSAHDVVSDWLDSVGVGTQW >NZ_CP007598.1|WP_001518648.1|4566391_4566685_-|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAVWLLEVRAGVYVGDTSKRIREMIWQQITQLGGVGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVENQ >NZ_CP007598.1|WP_000144830.1|4566684_4567605_-|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLNPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLASTVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALDDDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRATYALLAKQYGVKWHGRNYDPKDWEKGDVVNRCISAATSCLYGISEAAILAAGYAPAIGFIHSGKPLSFVYDIADIIKFESVVPKAFEIAARHPAEPDKEVRLACRDIFRSSKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPESLGDSGHRGHG >NZ_CP007598.1|WP_000281483.1|4567601_4568252_-|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTSELSPAQLLHLVERGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQEQPAASTIFDVQTRPFAPMLSAGQTLRFNLRANPTICKNGKRHDLLMEAKRQRKTQGDSQDIWSYQQQAALEWLARQGEQNGFTLREASVDAYRQQQIRREKSRQMIQFSSVDYTGVLVINEPALFLQRLAQGYGKSRAFGCGMMMIKPGDDA >NZ_CP007598.1|WP_000085107.1|4568233_4568980_-|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLVFQLHGPMASWGVDAPGEVRHSHELPLRSALLGLLAAALGIRRDEEERLNTFNRHYQFLLCASGNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRDYYTDAWWMIAVSATPDAPYTLAQLQAALQHPVFPLYLGRKSHPLALPLAPQLLEGNAADVLREAYRWYQDQFNALKLTLPGLQNECWWEGEHDGLTANKILRRRDMPLSRQQWLFGERSVNQGPWLRKEDACISQE >NZ_CP007598.1|WP_000206417.1|4568990_4570049_-|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNGNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA >NZ_CP007598.1|WP_000117945.1|4570062_4570617_-|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHDELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDTDNEQD >NZ_CP007598.1|WP_000368579.1|4570613_4572170_-|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPIWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIRGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFTNAKDARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRCYFDDHVFTNPYESSDLERIMKARKKYFTSSAEKQSAKAAKAKKQEAAE >NZ_CP007598.1|WP_000029737.1|4572181_4574845_-|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETDGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERCVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP007598.1|WP_001145541.1|4575288_4576242_+|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDSLTEDEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT >NZ_CP007598.1|WP_000039870.1|4576329_4577064_-|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP007598_2 | 4582447-4582964 | TypeI-E |
I-E
Consensus repeat of NZ_CP007598_2
|
8 spacers
spacers of NZ_CP007598_2
>2.1|4582476|32|NZ_CP007598|CRISPRCasFinder,CRT GGCTACACGCAAAAATTCCAGTCGTTGGCGCA >2.2|4582537|32|NZ_CP007598|CRISPRCasFinder,CRT CCGATTAAGATCCGCAGTCTGCATCAGTAACT >2.3|4582598|32|NZ_CP007598|CRISPRCasFinder,CRT CGATTCTACGGCAACAGGCCAGGCTGCGACCG >2.4|4582659|32|NZ_CP007598|CRISPRCasFinder,CRT ATCAAACATGGAAACCCCTTTAATGAGAGCAA >2.5|4582720|33|NZ_CP007598|CRISPRCasFinder,CRT TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTG >2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATT >2.7|4582843|32|NZ_CP007598|CRISPRCasFinder,CRT TCATGCGCTATAAAAATCAGACTGTCACATGC >2.8|4582904|32|NZ_CP007598|CRISPRCasFinder,CRT GAATCTGGAGGCCAACAGCGCGGCGAAATCCT >2.9|4582537|34|NZ_CP007598|PILER-CR CCGATTAAGATCCGCAGTCTGCATCAGTAACTCG >2.10|4582598|34|NZ_CP007598|PILER-CR CGATTCTACGGCAACAGGCCAGGCTGCGACCGCG >2.11|4582659|34|NZ_CP007598|PILER-CR ATCAAACATGGAAACCCCTTTAATGAGAGCAACG >2.12|4582720|35|NZ_CP007598|PILER-CR TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTGCG >2.13|4582782|34|NZ_CP007598|PILER-CR GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATTGG >2.14|4582843|34|NZ_CP007598|PILER-CR TCATGCGCTATAAAAATCAGACTGTCACATGCCG >2.15|4582904|34|NZ_CP007598|PILER-CR GAATCTGGAGGCCAACAGCGCGGCGAAATCCTCA |
cas3,cas8e,cse2gr11,cas7 |
CRISPR arrays and Neighbor proteins around NZ_CP007598_2
The CRISPR arrays of NZ_CP007598_2 >merge|NZ_CP007598|2|4582447-4582964|CRISPRCasFinder,CRT,PILER-CR ACGGCTATCCTTGTTGGCGCGGGGAACACGGCTACACGCAAAAATTCCAGTCGTTGGCGCACGGTTTATCCCCGCTGGCGCGGGGAACACCCGATTAAGATCCGCAGTCTGCATCAGTAACTCGGTTTATCCCCGCTGGCGAGGGGAACACCGATTCTACGGCAACAGGCCAGGCTGCGACCGCGGTTTATCCCCGCTGGCGCGGGGAACACATCAAACATGGAAACCCCTTTAATGAGAGCAACGGTTTATCCCCGCTGGCGCGGGGAACACTCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTGCGGTTTATCCCCGCTGGCGCGGGGAACACGCTGCCTTTCCCGGAGTTCCGGCCCCTAAATTGGGTTTATCCCCGCTGGCGCGGGGAACACTCATGCGCTATAAAAATCAGACTGTCACATGCCGGTTTATCCCCGCTGGCGCGGGGAACACGAATCTGGAGGCCAACAGCGCGGCGAAATCCTCAGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP007598|2|2|4582447-4582964|CRISPRCasFinder ACGGCTATCCTTGTTGGCGCGGGGAACAC GGCTACACGCAAAAATTCCAGTCGTTGGCGCA CGGTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACT CGGTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCG CGGTTTATCCCCGCTGGCGCGGGGAACAC ATCAAACATGGAAACCCCTTTAATGAGAGCAA CGGTTTATCCCCGCTGGCGCGGGGAACAC TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATT GGGTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGC CGGTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCT CAGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP007598|2|2|4582447-4582964|CRT ACGGCTATCCTTGTTGGCGCGGGGAACAC GGCTACACGCAAAAATTCCAGTCGTTGGCGCA CGGTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACT CGGTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCG CGGTTTATCCCCGCTGGCGCGGGGAACAC ATCAAACATGGAAACCCCTTTAATGAGAGCAA CGGTTTATCCCCGCTGGCGCGGGGAACAC TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATT GGGTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGC CGGTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCT CAGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP007598|2|2|4582510-4582964|PILER-CR GTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACTCG GTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCGCG GTTTATCCCCGCTGGCGCGGGGAACAC ATCAAACATGGAAACCCCTTTAATGAGAGCAACG GTTTATCCCCGCTGGCGCGGGGAACAC TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTGCG GTTTATCCCCGCTGGCGCGGGGAACAC GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATTGG GTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGCCG GTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCTCA GTTTATCCCCGCTGGCGCGGGGAACAC
>NZ_CP007598.1|WP_001207998.1|4581550_4582348_-|MBL-fold-metallo-hydrolase MALRIRVLLENHKGAGADKSLKARPGLSLLVEDESTSILFDTGPDGSFMQNALAMGIDLSDVSAVVLSHGHYDHCGGVPWLPDNSRIICHPDIARERYAAMTFLGITRKIKKLSCEVDYSRYRMMYTRDPLPIGKNFIWSGEIPVVAPEAYGIFGGHDAEPDSILDEGVLIYQSTKGLVIITGCGHRGIANIVRHCQNITGIKRIYALVGGFHLRCASPFTLWRVRRFLQEQKPEKLCGCHCTGAWGRLWLPEITAPATGDVLRF >NZ_CP007598.1|WP_000108313.1|4581100_4581463_+|6-carboxytetrahydropterin-synthase-QueD MSTTLYKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIMDFADLKAAFKPTYDRLDHYYLNDIPGLSNPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCVYRGE >NZ_CP007598.1|WP_000210932.1|4578877_4580677_-|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTPAPLTGLLPLNPEQLARLQAATTDLTPEQLAWVSGYFWGVLNPRSGVVAVTPVPERKMPGVTLISASQTGNARRVAEALRDDLLAANLNVTLVNAGDYKFKQIASEKLLVIVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDTSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDVLKSRAPVAAPSQSVATGAVNDIHTSPYTKDAPLIATLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVDGKTLPLAEALEWHFELTVNTANIVENYATLTRSESLLPLVGDKAQLQHYAATTPIVDMVRFSPAQLDAEALIGLLRPLTPRLYSIASAQAEVESEVHVTVGVVRYDIEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPQTPVIMIGPGTGIAPFRAFMQQRAADGAEGKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLSRIDLAWSRDQKEKIYVQDKLREQGAELWCWINDGAHIYVCGDARRMAADVEKALLEVIAEFGGMDLESADEYLSELRVERRYQRDVY >NZ_CP007598.1|WP_001290670.1|4577165_4578878_-|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLSDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTTQWQAIDKFAADNTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVAITDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGLETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDNNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHQGEFRITANQNLIIASVPESQKAKIETLARDHGLMNAVSAQRENSMACVSFPTCPLAMAEAERFLPSFTDKVEAILEKHGIPDEHIVMRVTGCPNGCGRAMLAEIGLVGKAPGRYNLHLGGNRIGTRIPRMYQENITEPDILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDFWE >NZ_CP007598.1|WP_000039870.1|4576329_4577064_-|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >NZ_CP007598.1|WP_001145541.1|4575288_4576242_+|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDSLTEDEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT >NZ_CP007598.1|WP_000029737.1|4572181_4574845_-|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETDGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERCVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP007598.1|WP_000368579.1|4570613_4572170_-|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPIWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIRGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFTNAKDARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRCYFDDHVFTNPYESSDLERIMKARKKYFTSSAEKQSAKAAKAKKQEAAE >NZ_CP007598.1|WP_000117945.1|4570062_4570617_-|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHDELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDTDNEQD >NZ_CP007598.1|WP_000206417.1|4568990_4570049_-|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNGNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA >NZ_CP007598.1|WP_001199961.1|4583260_4583932_-|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWDKLSDREVSLFSILAKTKESDKWGAASSEDLLAVINRQGYTARHVVITGGEPCIHDLMPLTDLLEKSGFSCQIETSGTHEVRCTPNTWVTVSPKVNMRGGYDVLSQALERANEIKHPVGRVRDIEALDELLATLSDDKPRVIALQPISQKEDATRLCIETCIARNWRLSMQTHKYLNIA >NZ_CP007598.1|WP_000036734.1|4584067_4585366_-|phosphopyruvate-hydratase MSKIVKVIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVGAVNGPIAQAILGKDAKDQAGIDKIMIDLDGTENKSNFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKGKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >NZ_CP007598.1|WP_000210863.1|4585448_4587086_-|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQLAVDIGREHALFMHLTLVPYLAAAGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISMKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIYEEANPAGEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVTVNIKLIDSQDVETRGVEILKDLDAILIPGGFGYRGVEGKIATARYARENNIPYLGICLGMQVALIEFARNVAGMDNANSTEFVPDCKYPVVALITEWRDEDGNVEVRSEKSDLGGTMRLGAQQCQLSDDSLVRQLYGASTIVERHRHRYEVNNMLLKQIEAAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAANEHQKRQAK >NZ_CP007598.1|WP_000210451.1|4587313_4588114_-|nucleoside-triphosphate-pyrophosphohydrolase MTTNHQIDRLLTLMQRLRDPENGCPWDKEQTFASIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFGELSADNSEEALVRWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCSNVGFDWTTLGPVVDKVYEEIDEVMFEARQAVVDQAKLEEEMGDLLFATVNMARHLGTKAELALQKANDKFERRFREVERIVAARGLEMTGVDLETMEEVWQEVKRQEIDL >NZ_CP007598.1|WP_000842512.1|4588720_4589308_+|fimbrial-protein MKSSHFCKLAVTASLVMGIVSGAQAAGSNTAKVTFLGNIVDSPCSVTLDTEDQTVNMGSSIGNGTLSNGKTTINNARTFHIDLEGCTWATEKNMNVVFTTGSGTTAATGATDNLALMKTDGTGAISNVSLAIGDAGKNNIKLGDTYTQAIADLDGDTILDEKQSLNFTAWLVGAATGTVGTGEFSSAANVTISYL >NZ_CP007598.1|WP_077907486.1|4589387_4592087_+|fimbrial-biogenesis-outer-membrane-usher-protein MMNNTWKSVLCPIACGVGMLLSLSPYSASGKDIEFNTDFLDVKNRDNVNIAQFSRKGFILPGVYLLQIKINGQTLPQEFPVNWVIPEHDPQGSEVCAEPELVTQLGIKPELAEKLVWITHGERQCLAPDSLKGMDFQADLGHSTLLVNLPQAYTEYSDVDWDPPARWDNGIPGIILDYNINNQLRHDQESGSEEQSISGNGTLGANLGAWRLRADWQASYDHRDDDENTSTLHDQSWSRYYAYRALPTLGAKLTLGESYLQSDVFDSFNYIGASVVSDDQMLPPKLRGYAPEIVGIARSNAKVKVSWQGRVLYETQVPAGPFRIQDLNQSVSGTLHVTVEEQNGQTQEFDVNTASVPFLTRPGMVRYKMALGRPQDWDHHPITGTFASAEASWGVTNGWSLYGGAIGESNYQAVALGSGKDLGVVGAVAVDITHSIAHMPQDDGFDGETLQGNSYRISYSRDFDEIDSRLTFAGYRFSEKNFMSMSDYLDAKTYHHLNAGHEKERYTVTYNQNFREQGMSAYFSYSRSTFWDSPDQSNYNLSLSWYFDLGSIKNLSASLNGYRSEYNGDKDDGVYISLSVPWGNDSISYNGTFNGSQHRNQLGYSGHSQNGDNWQLHVGQDEQGAQADGYYSHQGALTDIDLSADYEEGSYRSLGMSLRGGMTLTTQGGALHRGSLAGSTRLLVDTDGIADVPVSGNGSPTSTNIFGKAVIADVGSYSRSLARIDLNKLPEKAEATKSVVQITLTEGAIGYRHFDVVSGEKMMAVFRLADGDFPPFGAEVKNERQQQLGLVADDGNAWLAGVKAGETLKVFWDGAAQCEASLPSTFTPELLANALLLPCKMLEGQPPTAPQKSSPLPAQPLIQEHTQTDGQPAAPVATTTQTPPIPLADNHAVNRKDME >NZ_CP007598.1|WP_001044459.1|4592099_4592873_+|fimbria/pilus-periplasmic-chaperone MNKTNHFKRQALIASVLLAAPLVSHSAIVPDRTRVIFNGNENSITVTLKNGNATLPYLAQAWLEDDKFAKDTRYFTALPPLQRIEPKSDGQVKVQPLPAAASLPQDRESLFYFNVREIPPKSDKPNTLQLALQTRIKFFYRPVAVARQVDKTHPWQTKLTLTYQGDGVIFDNPTPFYLVISNAGSKENETASGFKNLLIAPREKVTSPIKGASLGSSPVVGYVDDYGGHRLLVFTCSGNTCKVNEEKTRDAEKKANK >NZ_CP007598.1|WP_000178270.1|4592892_4593399_+|fimbrial-protein MTMLTRWKMLVLLCGGFVTGTEAAGTKTVQLELHLVVTQPPPCTVGGASVEFGDVLTTKVGDASQTKPVGYSLNCDGRASDYLKLQIQGTTTTISGEQVLQTSVQGLGIRIQQAGNKQLVPVGITDWLNFTLSGSNGPELEAVPVKEPTTQLAGGDFNASATLVVDYQ >NZ_CP007598.1|WP_000832393.1|4593413_4593884_+|fimbrial-protein MKRVLILTLLITQFACADNLTFHGKLINPPACTINNGEMLEVSFGSVIIDNIDGVNYLTEIPWTLTCDSSFRDDALTFTLSYLGTATPYSAKALTTSVPELGIELQQNGTVFPPGTSLTINESSLPTLKAVPVKQPGKEPAEGDFEAFATLQVDYQ >NZ_CP007598.1|WP_001079646.1|4593880_4594417_+|fimbrial-protein MNRIFQTAGHLIGGVMLWAVCNTLPAATPNVHYSGKLVAGACNLVVDNDTMATVDFHTIGSDNFDASGQTTPVPFTLSLQDCKTALANGVLVTFQGVEDSTLPGLLALEPSSEASGFAIGVETAAQQPVSINATVGTAFVLKEGITTINLQARLQKYAGEEVMPGEFSGSATVSFEYQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP007598_2 | 2.5|4582720|33|NZ_CP007598|CRISPRCasFinder,CRT | 4582720-4582752 | 33 | NZ_CP032236 | Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence | 63403-63435 | 4 | 0.879 |
NZ_CP007598_2 | 2.5|4582720|33|NZ_CP007598|CRISPRCasFinder,CRT | 4582720-4582752 | 33 | NZ_LN681230 | Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence | 91355-91387 | 4 | 0.879 |
NZ_CP007598_2 | 2.12|4582720|35|NZ_CP007598|PILER-CR | 4582720-4582754 | 35 | NZ_CP032236 | Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence | 63403-63437 | 5 | 0.857 |
NZ_CP007598_2 | 2.12|4582720|35|NZ_CP007598|PILER-CR | 4582720-4582754 | 35 | NZ_LN681230 | Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence | 91353-91387 | 5 | 0.857 |
NZ_CP007598_2 | 2.13|4582782|34|NZ_CP007598|PILER-CR | 4582782-4582815 | 34 | NZ_CP044178 | Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1 | 18967-19000 | 5 | 0.853 |
NZ_CP007598_2 | 2.13|4582782|34|NZ_CP007598|PILER-CR | 4582782-4582815 | 34 | CP053324 | Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence | 25136-25169 | 5 | 0.853 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP044178 | Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1 | 18969-19000 | 6 | 0.812 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | CP053324 | Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence | 25138-25169 | 6 | 0.812 |
NZ_CP007598_2 | 2.13|4582782|34|NZ_CP007598|PILER-CR | 4582782-4582815 | 34 | NZ_LN890526 | Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence | 31716-31749 | 6 | 0.824 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_LN890526 | Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence | 31718-31749 | 7 | 0.781 |
NZ_CP007598_2 | 2.3|4582598|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582598-4582629 | 32 | MN694003 | Marine virus AFVG_250M677, complete genome | 17629-17660 | 8 | 0.75 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP053022 | Sphingobium yanoikuyae strain YC-XJ2 plasmid p-A-Sy, complete sequence | 329022-329053 | 8 | 0.75 |
NZ_CP007598_2 | 2.10|4582598|34|NZ_CP007598|PILER-CR | 4582598-4582631 | 34 | MN694003 | Marine virus AFVG_250M677, complete genome | 17627-17660 | 8 | 0.765 |
NZ_CP007598_1 | 1.1|4565747|32|NZ_CP007598|CRISPRCasFinder,CRT | 4565747-4565778 | 32 | NZ_MG266000 | Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence | 5501-5532 | 9 | 0.719 |
NZ_CP007598_1 | 1.3|4565869|32|NZ_CP007598|CRISPRCasFinder,CRT | 4565869-4565900 | 32 | MK449011 | Streptococcus phage Javan92, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP007598_1 | 1.3|4565869|32|NZ_CP007598|CRISPRCasFinder,CRT | 4565869-4565900 | 32 | MK448835 | Streptococcus phage Javan93, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP007598_1 | 1.3|4565869|32|NZ_CP007598|CRISPRCasFinder,CRT | 4565869-4565900 | 32 | MK448836 | Streptococcus phage Javan95, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP007598_1 | 1.3|4565869|32|NZ_CP007598|CRISPRCasFinder,CRT | 4565869-4565900 | 32 | MK448825 | Streptococcus phage Javan639, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP007598_1 | 1.7|4566113|32|NZ_CP007598|CRISPRCasFinder,CRT | 4566113-4566144 | 32 | KY006853 | Erythrobacter phage vB_EliS_R6L, complete genome | 41418-41449 | 9 | 0.719 |
NZ_CP007598_1 | 1.10|4565749|32|NZ_CP007598|PILER-CR | 4565749-4565780 | 32 | NZ_MG266000 | Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence | 5501-5532 | 9 | 0.719 |
NZ_CP007598_1 | 1.12|4565871|32|NZ_CP007598|PILER-CR | 4565871-4565902 | 32 | MK449011 | Streptococcus phage Javan92, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP007598_1 | 1.12|4565871|32|NZ_CP007598|PILER-CR | 4565871-4565902 | 32 | MK448835 | Streptococcus phage Javan93, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP007598_1 | 1.12|4565871|32|NZ_CP007598|PILER-CR | 4565871-4565902 | 32 | MK448836 | Streptococcus phage Javan95, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP007598_1 | 1.12|4565871|32|NZ_CP007598|PILER-CR | 4565871-4565902 | 32 | MK448825 | Streptococcus phage Javan639, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP007598_1 | 1.16|4566115|32|NZ_CP007598|PILER-CR | 4566115-4566146 | 32 | KY006853 | Erythrobacter phage vB_EliS_R6L, complete genome | 41418-41449 | 9 | 0.719 |
NZ_CP007598_2 | 2.5|4582720|33|NZ_CP007598|CRISPRCasFinder,CRT | 4582720-4582752 | 33 | NZ_CP031947 | Ruegeria sp. AD91A plasmid unnamed1, complete sequence | 143751-143783 | 9 | 0.727 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP048340 | Escherichia coli strain 142 plasmid p142_C, complete sequence | 2410-2441 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_LR130559 | Escherichia coli strain MS14385 isolate MS14385 plasmid 5 | 41882-41913 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP020518 | Escherichia coli strain 222 plasmid unnamed2, complete sequence | 13450-13481 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP020497 | Escherichia coli strain 103 plasmid unnamed2, complete sequence | 37140-37171 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP040921 | Escherichia coli strain FC853_EC plasmid p853EC2, complete sequence | 32060-32091 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | CP053252 | Escherichia coli strain SCU-204 plasmid pSCU-204-5, complete sequence | 19381-19412 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP042622 | Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-7, complete sequence | 2614-2645 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_LT985302 | Escherichia coli strain ECOR 39 genome assembly, plasmid: RCS82_pI | 11943-11974 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP028194 | Escherichia coli strain CFSAN018748 plasmid pGMI14-004_3, complete sequence | 15383-15414 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP024865 | Escherichia coli strain AR_0015 plasmid unitig_3_pilon, complete sequence | 22646-22677 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | AP019710 | Escherichia coli O145:H28 122715 plasmid pO145_122715_2 DNA, complete genome | 4361-4392 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP024829 | Escherichia coli strain CREC-544 plasmid pCREC-544_3, complete sequence | 2221-2252 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP009861 | Escherichia coli strain ECONIH1 plasmid pECO-b75, complete sequence | 2868-2899 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | CP025877 | Escherichia coli strain 503458 plasmid p503458_49, complete sequence | 18343-18374 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP023368 | Escherichia coli strain 1428 plasmid p48, complete sequence | 4914-4945 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP032259 | Escherichia coli strain AR_0067 plasmid unnamed2, complete sequence | 23402-23433 | 9 | 0.719 |
NZ_CP007598_2 | 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582782-4582813 | 32 | NZ_CP037450 | Escherichia coli strain ATCC 25922 plasmid unnamed, complete sequence | 15851-15882 | 9 | 0.719 |
NZ_CP007598_2 | 2.8|4582904|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582904-4582935 | 32 | CP006879 | Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence | 405613-405644 | 9 | 0.719 |
NZ_CP007598_2 | 2.4|4582659|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582659-4582690 | 32 | NZ_LR134399 | Listeria monocytogenes strain NCTC7974 plasmid 2, complete sequence | 103231-103262 | 10 | 0.688 |
NZ_CP007598_2 | 2.8|4582904|32|NZ_CP007598|CRISPRCasFinder,CRT | 4582904-4582935 | 32 | NZ_CP049244 | Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence | 699963-699994 | 10 | 0.688 |
NZ_CP007598_2 | 2.12|4582720|35|NZ_CP007598|PILER-CR | 4582720-4582754 | 35 | NZ_CP031947 | Ruegeria sp. AD91A plasmid unnamed1, complete sequence | 143751-143785 | 11 | 0.686 |
1. spacer 2.5|4582720|33|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP032236 (Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence) position: , mismatch: 4, identity: 0.879
tcaggaacgcgcggcggaagagcttggtgtttg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatg Protospacer ************.***************. **
2. spacer 2.5|4582720|33|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_LN681230 (Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence) position: , mismatch: 4, identity: 0.879
tcaggaacgcgcggcggaagagcttggtgtttg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatg Protospacer ************.***************. **
3. spacer 2.12|4582720|35|NZ_CP007598|PILER-CR matches to NZ_CP032236 (Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence) position: , mismatch: 5, identity: 0.857
tcaggaacgcgcggcggaagagcttggtgtttgcg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatgcc Protospacer ************.***************. ***
4. spacer 2.12|4582720|35|NZ_CP007598|PILER-CR matches to NZ_LN681230 (Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence) position: , mismatch: 5, identity: 0.857
tcaggaacgcgcggcggaagagcttggtgtttgcg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatgcc Protospacer ************.***************. ***
5. spacer 2.13|4582782|34|NZ_CP007598|PILER-CR matches to NZ_CP044178 (Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1) position: , mismatch: 5, identity: 0.853
gctgcctttcccggagttccggcccct----aaattgg CRISPR spacer ggtgcctttcccggagttccggccccttctcaaa---- Protospacer * ************************* ***
6. spacer 2.13|4582782|34|NZ_CP007598|PILER-CR matches to CP053324 (Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.853
gctgcctttcccggagttccggcccct----aaattgg CRISPR spacer ggtgcctttcccggagttccggccccttctcaaa---- Protospacer * ************************* ***
7. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP044178 (Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1) position: , mismatch: 6, identity: 0.812
gctgcctttcccggagttccggcccctaaatt CRISPR spacer ggtgcctttcccggagttccggccccttctca Protospacer * ************************* .
8. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to CP053324 (Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.812
gctgcctttcccggagttccggcccctaaatt CRISPR spacer ggtgcctttcccggagttccggccccttctca Protospacer * ************************* .
9. spacer 2.13|4582782|34|NZ_CP007598|PILER-CR matches to NZ_LN890526 (Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence) position: , mismatch: 6, identity: 0.824
gctgcctttcccggagttccggcccct----aaattgg CRISPR spacer ggtgccttttccggagttccggccccttctcaaa---- Protospacer * *******.***************** ***
10. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_LN890526 (Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence) position: , mismatch: 7, identity: 0.781
gctgcctttcccggagttccggcccctaaatt CRISPR spacer ggtgccttttccggagttccggccccttctca Protospacer * *******.***************** .
11. spacer 2.3|4582598|32|NZ_CP007598|CRISPRCasFinder,CRT matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.75
cgattctacggcaacaggccaggctgcgaccg CRISPR spacer ggcgagcacggcaacagcccaggctgcgatcg Protospacer * .********** ***********.**
12. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP053022 (Sphingobium yanoikuyae strain YC-XJ2 plasmid p-A-Sy, complete sequence) position: , mismatch: 8, identity: 0.75
gctgcctttcccggagttccggcccctaaatt--- CRISPR spacer tatgcctttcccggctttccggccc---aactgac Protospacer ************ ********* **.*
13. spacer 2.10|4582598|34|NZ_CP007598|PILER-CR matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.765
cgattctacggcaacaggccaggctgcgaccgcg CRISPR spacer ggcgagcacggcaacagcccaggctgcgatcgcg Protospacer * .********** ***********.****
14. spacer 1.1|4565747|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_MG266000 (Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence) position: , mismatch: 9, identity: 0.719
tatttataagcgtgtcatctatgcaacccaac CRISPR spacer aatttataatcatgtcatctatgccataattc Protospacer ******** *.************ *. *
15. spacer 1.3|4565869|32|NZ_CP007598|CRISPRCasFinder,CRT matches to MK449011 (Streptococcus phage Javan92, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
16. spacer 1.3|4565869|32|NZ_CP007598|CRISPRCasFinder,CRT matches to MK448835 (Streptococcus phage Javan93, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
17. spacer 1.3|4565869|32|NZ_CP007598|CRISPRCasFinder,CRT matches to MK448836 (Streptococcus phage Javan95, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
18. spacer 1.3|4565869|32|NZ_CP007598|CRISPRCasFinder,CRT matches to MK448825 (Streptococcus phage Javan639, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
19. spacer 1.7|4566113|32|NZ_CP007598|CRISPRCasFinder,CRT matches to KY006853 (Erythrobacter phage vB_EliS_R6L, complete genome) position: , mismatch: 9, identity: 0.719
gagaatgctcatgcgcgtgagcgccatatatt CRISPR spacer cgaaatgatcatgcgcgtcagcgccattgcgt Protospacer ..**** ********** ******** *
20. spacer 1.10|4565749|32|NZ_CP007598|PILER-CR matches to NZ_MG266000 (Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence) position: , mismatch: 9, identity: 0.719
tatttataagcgtgtcatctatgcaacccaac CRISPR spacer aatttataatcatgtcatctatgccataattc Protospacer ******** *.************ *. *
21. spacer 1.12|4565871|32|NZ_CP007598|PILER-CR matches to MK449011 (Streptococcus phage Javan92, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
22. spacer 1.12|4565871|32|NZ_CP007598|PILER-CR matches to MK448835 (Streptococcus phage Javan93, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
23. spacer 1.12|4565871|32|NZ_CP007598|PILER-CR matches to MK448836 (Streptococcus phage Javan95, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
24. spacer 1.12|4565871|32|NZ_CP007598|PILER-CR matches to MK448825 (Streptococcus phage Javan639, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
25. spacer 1.16|4566115|32|NZ_CP007598|PILER-CR matches to KY006853 (Erythrobacter phage vB_EliS_R6L, complete genome) position: , mismatch: 9, identity: 0.719
gagaatgctcatgcgcgtgagcgccatatatt CRISPR spacer cgaaatgatcatgcgcgtcagcgccattgcgt Protospacer ..**** ********** ******** *
26. spacer 2.5|4582720|33|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP031947 (Ruegeria sp. AD91A plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
tcaggaacgcgcggcggaagagcttggtgtttg CRISPR spacer ctttgcccgtgcggcggaagaccttggtgtttc Protospacer .. * **.*********** **********
27. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP048340 (Escherichia coli strain 142 plasmid p142_C, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
28. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_LR130559 (Escherichia coli strain MS14385 isolate MS14385 plasmid 5) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
29. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP020518 (Escherichia coli strain 222 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
30. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP020497 (Escherichia coli strain 103 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
31. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP040921 (Escherichia coli strain FC853_EC plasmid p853EC2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
32. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to CP053252 (Escherichia coli strain SCU-204 plasmid pSCU-204-5, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
33. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP042622 (Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-7, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
34. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_LT985302 (Escherichia coli strain ECOR 39 genome assembly, plasmid: RCS82_pI) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
35. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP028194 (Escherichia coli strain CFSAN018748 plasmid pGMI14-004_3, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
36. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP024865 (Escherichia coli strain AR_0015 plasmid unitig_3_pilon, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
37. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to AP019710 (Escherichia coli O145:H28 122715 plasmid pO145_122715_2 DNA, complete genome) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
38. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP024829 (Escherichia coli strain CREC-544 plasmid pCREC-544_3, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
39. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP009861 (Escherichia coli strain ECONIH1 plasmid pECO-b75, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
40. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to CP025877 (Escherichia coli strain 503458 plasmid p503458_49, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
41. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP023368 (Escherichia coli strain 1428 plasmid p48, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
42. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP032259 (Escherichia coli strain AR_0067 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
43. spacer 2.6|4582782|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP037450 (Escherichia coli strain ATCC 25922 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
44. spacer 2.8|4582904|32|NZ_CP007598|CRISPRCasFinder,CRT matches to CP006879 (Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence) position: , mismatch: 9, identity: 0.719
gaatctggaggccaacagcgcggcgaaatcct CRISPR spacer gaatctggagggcgacagcgcggtcgaccctg Protospacer *********** *.*********. .* .*.
45. spacer 2.4|4582659|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_LR134399 (Listeria monocytogenes strain NCTC7974 plasmid 2, complete sequence) position: , mismatch: 10, identity: 0.688
atcaaacatggaaacccctttaatgagagcaa CRISPR spacer ctaaaacatggaaaccactgtaatgacgaatc Protospacer * ************* ** ****** ..
46. spacer 2.8|4582904|32|NZ_CP007598|CRISPRCasFinder,CRT matches to NZ_CP049244 (Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence) position: , mismatch: 10, identity: 0.688
gaatctggaggccaacagcgcggcgaaatcct CRISPR spacer gtggtcataggccatcagcgcggcgatatccc Protospacer * . ... ****** *********** ****.
47. spacer 2.12|4582720|35|NZ_CP007598|PILER-CR matches to NZ_CP031947 (Ruegeria sp. AD91A plasmid unnamed1, complete sequence) position: , mismatch: 11, identity: 0.686
tcaggaacgcgcggcggaagagcttggtgtttgcg CRISPR spacer ctttgcccgtgcggcggaagaccttggtgtttctc Protospacer .. * **.*********** ********** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
2546103 : 2553416
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP007598|2546103:2553416|DBSCAN-SWA TATGCGTGCTAAGGGAAAGAAATTTAAAAAGCGTTATCTGGTCATTATTTTAATTCTTTTAGTGGGGGGGATGGCTGGCTGGCGAATGATAAATGCCCCGCTGCCAACTTATCAGACATTAATCGTGCGGCCAGGCGATCTTGAACAGAGTGTACTGGCGACTGGAAAACTGGACGCGTTGCGTAAAGTGGATGTCGGCGCGCAGGTGAGCGGCCAGTTGAAAACGCTGCTGGTCTCCATTGGCGATAACGTTAAAAAAGATCAGCTACTCGGCGTGATTGACCCAGATCAGGCGGAGAACCAGATAAAAGAGGTCGAGGCCACCTTGATGGAGCTGAACGCGGAGCGTCAGCAGGCAGCCGCTGAGTTAAAGCTGGCGCGGGTTACGCTGGCGCGCCAGCAGCAGTTAGCTAAGACTCAGGCGGTATCGCAACAGGATCTGGATACCGCGGCGACGGAGATGGCGGTTAAACAGGCGCGTATTGGCACCATAGATGCCCAGATCAAACGTAATCGGGCCTCGTTGGACACCGCGAAAACCAACCTGGAATATACCCGTATTGTCGCCCCCATGGCGGGGGAAGTGACGCAAATCACTACCCTGCAAGGACAAACGGTGATTGCAGCTCAGCAGGCGCCCAATATTCTGACGCTGGCGGATATGAGCACCATGCTGGTAAAAGCGCAGGTCTCGGAAGCGGACGTGATCCATCTTCGGGCGGGGCAGAAAGCATGGTTCACCATTGCAGGCGATCCGCAAACGCGCTATGAAGGCGTTTTAAAAGATATTCTGCCGACGCCGGAAAAGATCAACGACGCTATTTTTTATTACGCCCGGTTTGAAGTGCCGAATCCCAAAAGAATCTTGCGTCTTGATATGACCGCACAGGTTTATATTCAACTCATGGATGTCAAAAATGTGCTGATTATTCCTCTCGCCGCGCTTGGCGAACCGGTGGGCGGCAATCGTTATAAAGTGGCGCTGTTGCGTAACGGCGAAAAACGTGAGCGCGAAGTGGTCATTGGCGAGCGTAACGATACAGACGTGGAAGTGGTTAAAGGTCTGGAAGCGGGCGATGAGGTGATCATCGGCGAAAGCAGGCCAGGAGCGACGCCATGACGGCATTGCTTGAACTGTGCAATGTGAGTCGTAGCTACCCCTCCGGAGAAGAGCAGGTGGCGGTGTTGAAAGATATCTCCCTGCAAATCCACGCCGGGGAGATGGTGGCGATCGTCGGCGTTTCCGGTTCTGGAAAATCAACGCTGATGAATATCCTCGGGTGCCTGGATAAACCGACCAGCGGCACTTATCGGGTGGCGGGGCGGGACGTCTCGACGCTGGACCCGGACGCGCTGGCGCAGCTGCGGCGTGAGCATTTTGGCTTTATCTTTCAGCGCTACCATCTGTTGTCGCATTTAACGGCAGCGCAAAATGTTGAAATCCCCGCCGTCTACGCCGGCATTGAACGCAAAAAACGCCAGGCGCGCGCCAGAGAGTTGCTTTTGCGGCTGGGATTAAGCGATCGCGTCGATTACCCACCTTCACAGCTTTCTGGCGGACAGCAGCAGCGTGTCAGTATTGCCCGCGCTCTGATGAACGGTGGACAGGTGATTCTGGCAGATGAGCCGACCGGCGCGCTGGATAGCCATTCCGGCGAAGAGGTGATGGCGATTTTGCGCCAACTGCGCGATCGCGGACATACGGTGATCATTGTGACGCACGATCCGCTGATTGCCGCCCAGGCGGAGCGGATTATTGAAATTCACGATGGCAAGATTGTCCATAATCCGCCCGCGGAGGAAAAGAAACGCGAACAGGGCGTTGACGCTGCCGTAGTTAATACGGCTCCCGGCTGGCGGCAATTTGCCAGCAGCTTTCGCGAAGCGCTGTCAATGGCGTGGTTAGCGATGGCCGCTAACAAAATGCGTACTTTACTGACCATGCTGGGAATTATTATCGGTATTGCGTCGGTGGTGTCGATTGTGGTGGTCGGCGACGCCGCAAAACAGATGGTACTGGCGGATATCCGCGCTATGGGCACTAACACGATTGATATTCATCCAGGCAAAGATTTTGGCGACGACAATCCGCAGTATCGACAGGCGCTGAAATATGACGATCTGGTCGCTATTCAGAAACAGCCGTGGGTTAACTCTGCGACGCCCAGCGTTTCAAAGAGCTTACGTCTTCGCTATGGCAATATTGATATTGCCGTAAATGCTAATGGCGTCAGTGGCGATTATTTTAACGTTTACGGCATGTCCTTTAGGGAGGGGAACACCTTCAATTCTGTACAGCAACAGGATCGCGCGCAGGTGGTGGTGCTGGATGCCAACACGCGACGCCAGCTATTTCCAAATAAAGCGAATGTCGTAGGGGAAGTGGTGCTGGTGGGTAATATGCCGGTTATTGTTATTGGCGTGGCGGAAGAGAAACCGTCCATGTACGGCAATAGCAATCTGTTGCAAGTTTGGTTGCCCTATAGCACGATGTCAGATCGCATAATGGGTCAGTCATGGCTTAACTCGATCACCGTTCGTGTGAAAGATGGCGTTGATAGCGATCAGGCTGAACAGCAGCTTACCCGCCTGCTCACCTTACGCCACGGTAAAAAAGACTTCTTCACCTGGAATATGGACAGCGTTCTGAAAACGGCTGAAAAAACCACCTATACTCTTCAGTTATTTCTGACGCTGGTGGCCGTCATTTCGCTGGTTGTCGGCGGCATTGGCGTTATGAATATTATGCTGGTTTCCGTCACCGAGCGAACGCGTGAAATCGGCATCCGTATGGCGGTAGGCGCGCGCGCCAGCGATGTGCTACAGCAGTTTCTTATTGAAGCGGTGCTGGTTTGCCTGGTTGGGGGAGCGCTGGGGATTAGCTTGTCGATGTTCATCGCATTTATGCTACAGCTTTTCCTGCCCGGCTGGGAGATCGGTTTTTCACTGACTGCGCTGGCGAGCGCGTTTTTATGTTCGACGTTTACCGGGATACTGTTTGGCTGGCTACCGGCGAGAAACGCGGCGCGACTGGACCCGGTGGATGCGCTGGCAAGGGAGTAATCTCGCTGACGTCATTTGTCGGCCTGATAAGATGCAACAGTGTCGCTATCAGGCACTGGCGTTAAATAAAAATGCCAGCCGATCGGGCTGGCATTTTGCCTCCTGGATGTACACAATGAGACAGAGGAGCTATGCAACGGCCTCTGCTTCGATGGGCACGATGACGCTGGCGTGATTGCCTTTTGGCCCCTGGTGGACATCAAACCGGACAGACTGTCCGGCTTTAAGCGTTCTGTAACCATCCATTTGAATGGTGGAATAATGGGCGAAAATATCCTCGCCGCCGCCTTCAGGGCAGATGAAACCAAACCCTTTGGCATTGTTGAACCACTTTACAGTACCCGTTTCCATGCTTCGACATCCTTCGTAAATCTTATATAAGTAAGATGGAATGAACCGGTGACGGAGTGGGGGCTGTTCAAAACCTCACCAACTCTCGACATTACAATTTAGAGAAATCAGGCGAGGCGTCAAGCATCAGGCAGGGGGGATCGGGTAAAAATGAATCAAAAATTTGAAGCAGTTAACGCTATTGCCGGGAATGTGACAGATGTCGCGGATGGTACTGATAGATGTTAGTTATCTATCAATTGAGGTAGATTGATTGTGTGCATAGACTCTGGTCAGCGGCAGATTTTCCTGCCGACAACCGTAACCGATAATGACGACTGACAATGGGTAAGACGAACGATTGGCTGGATTTTGACCAGTTGGTGGAAGATAGCGTGCGCGACGCGCTAAAACCGCCATCTATGTATAAAGTGATATTAGTCAATGATGATTACACTCCGATGGAGTTTGTTATTGACGTGTTACAAAAATTCTTTTCTTATGATGTAGAACGTGCAACGCAATTGATGCTTGCAGTTCACTATCAAGGCAAAGCCATCTGCGGCGTGTTCACCGCCGAGGTGGCGGAAACCAAAGTGGCGATGGTGAACAAGTATGCAAGGGAGAACGAGCATCCGTTGCTGTGTACGCTGGAAAAAGCCTGAATGCAGGTATAAAAATTGGGGGAGGTGCCTATGCTCAATCAAGAACTGGAACTCAGTTTAAACATGGCTTTCGCCAGAGCGCGCGAGCACCGTCATGAGTTTATGACCGTCGAGCATCTGTTGCTGGCGCTGCTCAGCAACCCATCGGCTCGCGAAGCGCTGGAAGCATGCTCCGTGGATCTGGTGGCGCTCCGTCAGGAACTCGAAGCCTTCATTGAACAAACCACACCCGTACTGCCTGCCAGTGAAGAAGAGCGTGATACGCAGCCGACGTTAAGTTTCCAGCGTGTCCTGCAGCGTGCCGTCTTCCATGTTCAGTCTTCCGGGCGTAGTGAAGTGACTGGCGCGAATGTGCTGGTGGCTATCTTTAGCGAACAGGAATCACAGGCGGCTTATCTGCTGCGCAAGCATGAAGTGAGCCGTCTGGATATCGTGAACTTTATTTCTCACGGGACGCGAAAAGACGAACCGAGCCAATCTTCCGATCTCGGCAATCAGCCAACTGGCGACGAACAAGCTGGCGGGGAGGAACGTATGGAAAACTTCACGACGAATCTTAACCAACTTGCTCGCGTGGGCGGCATCGATCCGCTGATTGGTCGTGAAAAAGAACTTGAACGCGCGATCCTGGTCTTGTGTCGTCGCCGTAAAAATAACCCGTTGCTGGTAGGGGAATCCGGCGTCGGCAAAACGGCGATTGCCGAAGGGCTGGCCTGGCGTATCGTGCAGGGCGATGTGCCGGAAGTGATGGCCGATTGCACCATTTACTCTCTGGATATCGGTTCGCTGCTGGCGGGCACCAAATACCGCGGCGATTTTGAAAAACGGTTTAAGGCGTTGCTGAAACAGCTTGAGCAGGATACCAACAGCATCCTGTTTATCGATGAAATCCATACCATTATCGGCGCTGGCGCGGCGTCGGGCGGACAGGTGGATGCGGCAAATCTGATTAAACCGCTGCTTTCCAGCGGCAAGATCCGGGTGATCGGCTCAACGACCTATCAGGAATTCAGCAATATTTTTGAGAAAGACCGTGCATTAGCGCGCCGTTTCCAGAAAATTGATATTACCGAGCCTTCGGTGGAAGAGACGGTGCAAATTATCAACGGCTTGAAACCTAAGTACGAAGCGCACCACGACGTGCGTTATACCGCGAAAGCGGTGCGTGCGGCGGTCGAGTTGGCGGTAAAATATATCAATGACCGCCATCTGCCGGATAAAGCCATTGACGTGATTGACGAAGCGGGCGCTCGGGCGCGTCTGATGCCGGTGAGCAAACGTAAGAAAACGGTCAACGTGGCGGATATTGAGTCCGTAGTGGCGCGAATTGCGCGAATTCCTGAAAAGAGCGTCTCGCAGAGCGATCGCGATACGCTGAAGAACCTGGGCGATCGTCTGAAAATGCTGGTCTTCGGCCAGGATAACGCGATTGAGGCGCTGACCGAAGCTATTAAGATGAGTCGTGCCGGTCTGGGCCATGAGCATAAACCTGTCGGCTCATTCTTGTTCGCCGGGCCAACTGGCGTAGGGAAAACTGAAGTTACGGTACAGCTTTCAAAAGCGCTGGGTATTGAGCTGTTGCGCTTCGATATGTCCGAATATATGGAGCGTCATACGGTGAGCCGTTTGATCGGCGCGCCTCCGGGATACGTCGGTTTCGACCAGGGCGGGCTGCTGACGGATGCGGTGATTAAGCATCCTCATGCGGTGCTGTTGCTGGATGAGATCGAAAAAGCGCACCCGGATGTCTTTAACCTGCTGCTGCAGGTGATGGATAACGGTACGCTGACCGATAACAATGGCCGTAAGGCGGATTTCCGCAACGTGGTGCTGGTGATGACCACCAACGCCGGCGTGCGAGAAACCGAACGTAAATCTATTGGTCTTATTCATCAGGACAACAGTACCGATGCGATGGGCGAGATCAAGAAAGTGTTTACGCCGGAGTTCCGTAACCGTCTCGACAACATTATTTGGTTCGATCATCTGTCTGGCGAGGTGATTCATCAGGTTGTCGATAAGTTTATCGTCGAGTTGCAGGCTCAGTTGGATCAGAAAGGCGTCTCTCTGGAAGTCAGTCAGGAAGCGCGCGACTGGCTGGCGGAAAAGGGCTATGACCGGGCGATGGGCGCACGACCGATGGCGCGTGTGATTCAGGATAACCTGAAAAAACCGCTGGCCAATGAGTTGCTGTTTGGATCGCTGGTTGATGGCGGACAGGTCACCGTCGCGCTGGATAAAGAGAAAAATGCGTTGACGTATGGCTTCCAGAGTGCGCAAAAGCACAAGCCGGAAGCCGCGCATTAATCTTCGTTTCACTGCCGTACAAACCGGGCCTTAGCGCCCGGTTTTTTTACGCCGGCTAATAGTTAGCCTGATGGCGTTGTGCTCGCGGTAGGCGGACAAGGCGCTTGTAAATTGCGTCATCATCCGGTAATCGACGGGTCAAATGCGAAAAAAAAGCCCGACCCGCTGCGTGAAAAGGGATGCTGAAGCCGCTGTTACCAATGCCAAAAAAACTAATCCTCTTCTCTGAAAAGAGTAGTCTTACACTTAGCACAAAGAATTCTTCTGTGATTATCCCATACCTGTATCAAACCAACATCGCTGAATTTCTCGTGACCACAACTGTAGCAGGATATTTTGCTCATATTGTGTTTTTTAATATATTCATCTTTGGTGGGAAGTTGCTTCCAGTAGCCTATCAGTCCGGGCATAACCTCTCCTTGAGTGGGTTATGTAATTTGCAAATTATGATAATAATTATATAAGAAATAGTTCAGTATGAAATTATTACTTATCACATTTTCGAAAGCGAATGCTATTAGCATCACACTTTCTACGTTGTTATCAACGATAATGCTTTCGTACAGGTAGCTTAGCTGTTGGCAGTGCTATACCCCGTCCAATCATCCATCAACGCCACCCGTTTAGGCCATAACGTCCCACGCTGATACGCTGCTTCAGCCTTATCTGCCAACTGGTGCGCCAACGCATGTTCAATAACCTCACGTTGATAATCCGTTGCTTCACCAGCCCACTCACGGAAAGTAGAACGGAAGCCATGTTGCGTTAAGTCGATATATCCCATTCGTTTCAATACAGCCAATAACGACATATCAGAAAGTGTTTCAGCGCGAGGGGCAGGGAATACATGATTGTTATCTTTTAATCGTGGTAAATCTTTTAACAAATCAACAGCAGCATCAGACAGAGGAACTCGGTGCTTTTTTACTACCGAGAGATACTTATGCAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP007598|2546103:2553416|2550190_2552467_+|WP_000934059.1|protease|DBSCAN-SWA MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEAFIEQTTPVLPASEEERDTQPTLSFQRVLQRAVFHVQSSGRSEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDIVNFISHGTRKDEPSQSSDLGNQPTGDEQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERAILVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASGGQVDAANLIKPLLSSGKIRVIGSTTYQEFSNIFEKDRALARRFQKIDITEPSVEETVQIINGLKPKYEAHHDVRYTAKAVRAAVELAVKYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSQSDRDTLKNLGDRLKMLVFGQDNAIEALTEAIKMSRAGLGHEHKPVGSFLFAGPTGVGKTEVTVQLSKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGLLTDAVIKHPHAVLLLDEIEKAHPDVFNLLLQVMDNGTLTDNNGRKADFRNVVLVMTTNAGVRETERKSIGLIHQDNSTDAMGEIKKVFTPEFRNRLDNIIWFDHLSGEVIHQVVDKFIVELQAQLDQKGVSLEVSQEARDWLAEKGYDRAMGARPMARVIQDNLKKPLANELLFGSLVDGGQVTVALDKEKNALTYGFQSAQKHKPEAAH >NZ_CP007598|2546103:2553416|2546103_2547222_+|WP_001201751.1|DBSCAN-SWA MRAKGKKFKKRYLVIILILLVGGMAGWRMINAPLPTYQTLIVRPGDLEQSVLATGKLDALRKVDVGAQVSGQLKTLLVSIGDNVKKDQLLGVIDPDQAENQIKEVEATLMELNAERQQAAAELKLARVTLARQQQLAKTQAVSQQDLDTAATEMAVKQARIGTIDAQIKRNRASLDTAKTNLEYTRIVAPMAGEVTQITTLQGQTVIAAQQAPNILTLADMSTMLVKAQVSEADVIHLRAGQKAWFTIAGDPQTRYEGVLKDILPTPEKINDAIFYYARFEVPNPKRILRLDMTAQVYIQLMDVKNVLIIPLAALGEPVGGNRYKVALLRNGEKREREVVIGERNDTDVEVVKGLEAGDEVIIGESRPGATP >NZ_CP007598|2546103:2553416|2552679_2552877_-|WP_001117984.1|DBSCAN-SWA MPGLIGYWKQLPTKDEYIKKHNMSKISCYSCGHEKFSDVGLIQVWDNHRRILCAKCKTTLFREED >NZ_CP007598|2546103:2553416|2549839_2550160_+|WP_000520789.1|protease|DBSCAN-SWA MGKTNDWLDFDQLVEDSVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLLCTLEKA >NZ_CP007598|2546103:2553416|2553038_2553416_-|WP_001539594.1|integrase|DBSCAN-SWA MHKYLSVVKKHRVPLSDAAVDLLKDLPRLKDNNHVFPAPRAETLSDMSLLAVLKRMGYIDLTQHGFRSTFREWAGEATDYQREVIEHALAHQLADKAEAAYQRGTLWPKRVALMDDWTGYSTANS >NZ_CP007598|2546103:2553416|2547218_2549165_+|WP_000125875.1|DBSCAN-SWA MTALLELCNVSRSYPSGEEQVAVLKDISLQIHAGEMVAIVGVSGSGKSTLMNILGCLDKPTSGTYRVAGRDVSTLDPDALAQLRREHFGFIFQRYHLLSHLTAAQNVEIPAVYAGIERKKRQARARELLLRLGLSDRVDYPPSQLSGGQQQRVSIARALMNGGQVILADEPTGALDSHSGEEVMAILRQLRDRGHTVIIVTHDPLIAAQAERIIEIHDGKIVHNPPAEEKKREQGVDAAVVNTAPGWRQFASSFREALSMAWLAMAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRAMGTNTIDIHPGKDFGDDNPQYRQALKYDDLVAIQKQPWVNSATPSVSKSLRLRYGNIDIAVNANGVSGDYFNVYGMSFREGNTFNSVQQQDRAQVVVLDANTRRQLFPNKANVVGEVVLVGNMPVIVIGVAEEKPSMYGNSNLLQVWLPYSTMSDRIMGQSWLNSITVRVKDGVDSDQAEQQLTRLLTLRHGKKDFFTWNMDSVLKTAEKTTYTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGISLSMFIAFMLQLFLPGWEIGFSLTALASAFLCSTFTGILFGWLPARNAARLDPVDALARE >NZ_CP007598|2546103:2553416|2549294_2549516_-|WP_000447499.1|DBSCAN-SWA METGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQSVRFDVHQGPKGNHASVIVPIEAEAVA |
7 | Dickeya_phage(16.67%) | integrase,protease | attL 2534841:2534855|attR 2553634:2553648 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2625126 : 2635920
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP007598|2625126:2635920|DBSCAN-SWA TATGTACGGTGTACAGGGAACGCCTGACTGTTACCGGATTGAACTGAAAAATGTTTATGGTGTACAGGAGAATCTGATCTCATACCGACAGGCATCGCTGGGGGCATGGGTAGCGATTGCTGGTGGCGGCGATCCTTATGAAGTGGCTTACGCTATCTATAAAGCCGTGCCAGATATCTCCGTACTGACGAATGATGTAGTGAATCCATCAGGCGCTGCGGTGGATAAAAAAACGATACCGATCATTGTGTATCCGGATACGTATCACGTGCCGTTTGTAGTGCCATCATCACAAAACGTTACGCTTTTAATCACCTGGAATACAGCCTCAACCAGCTATATCGATCCAACCGGGATTGAAAAAGCAGTGCAGCAAAGCATTGCTGATTACATTAACGGAATTGCAACGGGTGAACCAATAAACATTTTCCTGATTCGGGATATTTTTCTTAATCAGGTTAAGGGGCTTGTATCTTCAAACCTTGTATCAATGATTGATATTCAGGTTGGAATAAACGGAAAAATTGTCCCACCTGCAACCGACTCCAGCCTGGTTTATGGTGATACTTACGCCTATTTTTCCACTTCATCTTCACAAATTCAGGTTAAGCAATATGGCAGCTCTTCTTGAAAGCATTATTCCGGCCTACCCCTATACGCAATATAATGACGATCCGGATATAGTTGCCTTTTTTGATGCTTATAACAAACTGGCACAGGGGTATCTTGATTACTTTAACAACCTGAATTTACCTTGCTGGACCTCCCCGGCGATTACCGGTGAGTTGCTGGACTGGATTGCGGCGGGTATTTATGGGGAATCACGCCCCTTGCTTCAAATCTCCGAGGATGCCATTGCTCGTGGGGCGTATAACACTATTGAGTACAATAATGTCGCGTATGCAAAACTGAGAAATTATGTTCCCGGCTCAGCGTCATATGTTCCGGACGACTATTTTAAACGGATACTGACATGGAATTTTTATAAAGGCGATGGTTCGCACTTCTGTATCAACTGGTTCAAACGACGGCTTGCACGCTTTATACATGGAGCTAACGGAATAGACCCACCTGTACAGTCCACTTTTGATATTAGTGTAATGCCCGATAAGGGCATTTTTTTTGTCTCCATTCCTGACTATGGCGATGGTGTCGGACACTTTCTTAAAGATGCAATTGACCAGTCGCTGGTGAAACTCCCTTTTATTTATACCTATTCGGTAACGGTGGTTGAGCAATGATTATTGGATTCGGAAATAATGTCGTCTCCTCACTGGCGGCTGATATTACCGCCAGCCAGACGACCATTCAGGTGATGCCTGGTGTGGGAGCGATGTTTGCTAATTTGCTGACCAGCGATTATGCAAACAGCTCAAACCCTCTTAAAACTTACGCCAAAATTACACTGACAGACGCAAAAGAAACAGTTTTTGAGGTATGCCATCTGACAGCAGTTAATAATGACATGCTGACGGTTATTCGCGGTCAGGAAGGTACAACAGCGAAGGGATGGTCACTGAATGACGTTATAGCGAATTTTGCGACGCGAGGATCTGAAAATCAGTTTGTACAAATTGAAGAGCTCCAGAGTGGGCATTATGTCGCTGGTGTGGCCGGAGGTACAGAAAATAATCTGACGCTGGAGTTACCAGCAACTTATTTCGTCAATGGTGGAGTTGACTGGACATTGCGCACTCCACTTGTGGTTATTCCGGCGCTAAACAATACCGGAGCCAGCACTCTGCAACTGACGATGGGAGGACGTGTGCTTGGCATATTCCCACTATACAAGGGGAATAAAGCAGAGTTATCGGCCAATGATATTATTAAAGATATTCCTGTCTTATGCGTTCTGGATAATACAAAAACCTATTTTTCTGTGCTTAATCCCCTGGAGATTTATTTGGGATCACGGTATTTGCAGAAGGACCAGAACCTGTCCGACGTACCGGATAAGGCCAAAGGTCGCTCCAGTCTTGAGGTCTACAGCAAAACCGAAAGTGATGAAAACTACATGGCTAAAAGCCAGTGTGGTGCGGATATCCCGAATAAGCCGCTGTTTGTACAAAATATCGGAGCGCTCCCTGCATCAGGTACGGCTGTTGCAGCGAACAGACTGGCATCACGCGGCGCGCTTCCGGCACTGACTGGTACGACAAGAGGCAGCGATAGTGGCCTGATAATGGGCGAGGTTTACAACAATGGCTATCCGACGCAATACGGAAATATTTTACGTCTGACCGGAACCGGTGATGGGGAAATCCTCATTGGCTGGAGCGGGACAAATGGTGCGCCAGCGCCCGCATATATTCGCAGCCATCGAGATACCGCCGATGCTGAGTGGTCCGAATGGGCAATGCTTTACACCACACTAAACCCACCTCCGGATTCGCATCCAGTAGGGGCGGCGATTGCATGGCCATCTGATGCTACTCCGGCAGGTTACGCTCTGATGCAGGGGCAGTCCTTCGATAAATCTGCTTACCCGTTACTGGCTATAGCGTATCCGTCCGGCGTTATCCCTGACATGAGAGGCTGGACAATAAAGGGTAAGCCCATCAGTGGACGTGCCGTATTGTCGCAAGAAATGGACGGCAATAAATCGCACTCGCACACCGCGCGGGCGCAGGTTACTGACTTAGGGACAAAATCTACCTCATCCTTTGATTACGGCACGAAATCGACCAATACCACGGGCAATCATACTCACCAGTTCGGCGGTTATATCAATTCATACTGGGGAGATTCCAATCACACCTCATTTCAGCCAGGAGGTGGTGCATGGACACAGGCCGCTGGCGACCATGCACATACAGTTTATATCGGAGGACATGAGCACACCATGTATATAGGTCCACACGGACACGTCGTTATTGTGGACGCAGACGGTAATGCGGAAACCACGGTTAAAAACATTGCATTTAACTACATAGTGAGGCTGGCATAATGACTTTTAAAATGAGCGAACAGGCGCAGACAATTAAAATTTTCAATCTGCGTTCAGATACTAACGAATTTATTGGTGCAGGTGATGCGTATATTCCGCCGCACACAGGACTACCGGCAAACTGTACTGATATCGCCCCTCCTGATATTCCCGCCAGTCATATTGCTATATTTGACGCTGAAACCCAGACATGGAGTTTGCATGAGGATCACCGCGGCGAGATGGTTTACGACACAACAACCGGCAATCAGGTTTATATCTCCGCTCCTGGTCCGTTGCCCGAAAATGTCACATCAGTTTCACCAGGTGGTGAATACCAGAAATGGGATGGTAAGGCTAAGGTCTGGGTAAAAGACGAAGCGGCTGAAAAAGCAGCGCAGCTTCGTCAGGCGGAAGAAACCAAAAGCCGTCTTTTGCAAATGGCATCTGAAAAAATCGCGCCATTGCAGGATGCAGTTGATCTTGGAATCGCAACAGATGATGAGAAAGCGCGGCTCGACGAATGGAAAAAATATAGGGTGCTGGTAAACCGGATGGATACAGCCGCCCCTGACTGGCCGGAAAGACCAGCCAGCCAGTAGGCGTTACTGTGGTAATGCAGGCCACCTGATTGCTGTGAAGGTGGCCTCATCTGAAACACCTGTTAAGTCCAGCGATTTAAGAACATTGATATAATTCATCGGCAGAATCAAAGCTGCTTTATTTTCATCACTGATAATTCCCAACGTTAATTCTGTGCGCCAGTCCTTAATGGCATTATCTGCATCGTTAAGTAATTTCTGCCGGGTGACTTCTGCCCGCCAAAATTGAACGCCCGGGCGCTGATGCCGGATGATCTGGTCATCGTGGAAAGCGACCCTGAAAAAATCGACACTTTAGCTGTAAAATGACAGTCCCGCCATCCGGTCATCATAACGGATTTTTCTTCTGCACCTTCTGAAGCCCGCCATGGCAGGACGACCATGAATCCGCCGATAACCTTATTGTGAAATTAAGACCAGGAAGAGATGATGTCTGTCGGACAGATACTATATGTAAATTTATAAAGGTTTTTTGTTATGCCCTTTCATATTGGAAGCGGATGTCTTCCCGCCATCATCAGTAACCGCCGCATTTATCGTATTGCCTGGTCTGATACCCCCCCTGAAATGAGTTCCTGGGAAAAAATGAAGGAATTTTTTTGCTCAACGCACCAGACTGAAGCGCTGGAGTGCATCTGGACGATTTGTCACCCGCCGGCCGGAACGACGCGGGAGGATGTGGTCAGCAGATTTGAACTGCTCAGGACGCTCGCGTATGACGGATGGGAGGAAAACATTCATTCCGGCCTGCACGGGGAAAACTACTTCTGTATTCTGGATGAAGACAGTCAGGAGATATTATCAGTCACCCTGGATGACGTCGGGAACTATACCGTAAATTGCCAGGGGTACAGTGAAACACATCACTTAACCATGGCAACAGAACCGGGAGTGGAACGCACAGATATAACTTACAACCTAACCAGTGATATTGATGCTGCGGCCTATCTGGAGGAATTGAAACAGAATCCAATTATAAATAATAAAATAATGAATCCGGTAGGGCAGTGTGAGTCATTAATGACTCCTGTAAGCAATTTTATGAATGAAAAAGGGTTCGATAATATTCGTTATCGAGGTATATTTATCTGGGATAAACCAACAGAGGAAATACCAACAAATCATTTTGCAGTGGTTGGAAATAAAGAAGGGAAAGACTATGTGTTTGATGTTTCAGCCCATCAGTTTGAAAATAGAGGTATGAGTAATCTGAATGGCCCATTAATTCTTTCAGCAGATGAATGGGTGTGTAAATATAGAATGGCAACAAGAAGGAAACTTATTTATTATACTGATTTTAGTAATTCAAGTATAGCAGCTAATGCCTATGATGCATTACCACGAGAATTAGAATCAGAATCTATGGCAGGGAAAGTTTTTGTTACATCACCGAGATGGTTTAATACCTTTAAAAAGCAAAAATATTCCTTAATAGGTAAAATGTAAGCGCACCGTGGAGGACGTCTGTCAGAACCCTGTCAATCCGGCGATGATAGTGTCCACTTAAATTTTGATGGACACTATCACAGATGACAGAGTCCACAGCCCGGCGCAAACACGGCTGTCGGTCAGGAAAGAGAAAAGCCAGTCGCTGTACGACTGGATACAGGCGCAGTTGAAAACGTTGTCGGTGCATGCGGAGATGGCGAAGGAGTTCGGTTACATGCTGAAGCAGTGTGATGCGTTGAGCGTGTCCTCTGCAGCGACGGTCGGGTGGAGATCGACAACAACATCTGTGAAAACGCCTTACGGTGCGTGGCGCTGGGCCGACGTAACTATCTGTTCTTCGGCTCAGACAGGAGCGGCGAGGCAGCGGCGATCATCTACAGCCTGCTGGGTACGTGCAAACTAAACGGCGTAGAGTCCGAGGCATGGTTACGCGACGTGCTGTGGAAAATCAGCGACTGGTCATCGAACCGGGTGCACGAACTGCTGCCCTGGAACCTCGAAACCGTAAAATAATCCTTACGCTACGTCCTAAACGGGACGCTTACTGAGTTATGAGGGTGAGGTGGTTTTCTTAATGAGCATCTGAACGCCGGTTTGTGGGATAGGGCAGGTGGGACGGATTCGGGACAGTCATCCGTTTTTAACTATTGGCATGGATTGGTATCTTTTCGCATCATGGGACGTGTGAGCGCAGGTATGACGCGGTATGTTATTGACTTAGAATGTGGTTCCAGGAACTGGCGCATCCATACCCGAGCAGTTTCCGGTGTCAGTGCCAGCGGGCGACGATCGTGAATGTCTACCAGACCTTTATCGGCTGCGGAGGTAACAATCAGGAATCCCTCTGCTTCATCACCGCGCTCAAACGGCGTACTGCCAATGGCAGCCATGAATATCGGCTTCCCGTCCTTTCTGTGAATGAAATACGGCTGTTTTTTGTCGCCTTCCTTCTTCCACTCGAACCATCTATCGGCAAAACAGATAGCCCGGCCATGCTGCCATAGTGGCTTAAACATTCTGCTGGAGGCCGCTGTCGCGACACGGGCGTTAATAAGTGGAGCTTTATCCCACCATCCGGGAGCGTAACCCCAAATCACCGGGTCGAGATGTAATTGCTCGTCGCGTTCGCTCAATAGCAGGACTTTAGTCCCGGGCGCCACGTTATACCGGCCTATAGGCTGAGGGTCATAAGCAATATTACGATCGGCTTCGTCGGCCAGATATGCCAGATATTCTTCACGGGTCTGTGCTTGTGCAAAGCGTCCACACATATGAAACCTCCAGTCGTCAGACTGAAAGTATAGGGCAGGGGGAAAATGTGGCGCGCTCCGGTAATGATTTACAGGGAATTTATACAGTAATTCGAAATGAAAATTTGTGTATTGATGAATTCCGAAACTGAGCAGTGAGCAAACTGATGAATCAATCGCAAATTGCGTCGAATCTCGACGGCTTTATTCCCCAGTTTCACCCCATAGCTTCCCCGTAGGAAATTGAGCCATAAAAAAAACAGCCCTGACAGGCTGGTTTTTAAGGGGAATTTTGGTCGGCACGAGAGGATTTGAACCTCCGACCCCCGACCCCCCATGTTGGCGGTAGCCTGAATAATTATCTTGGTAAAGGTTAACTATCATAAAATGGTACACCAGTCTTTCCAGGAGGAGGAGTGTAAAGGTTTTGGCTCATAAACACCGTCATAGTAAAGACCACAAATACTGCAATCTTCATTTTTATATTTATCATTGAATGCTAAGGTAAGCTTGTCATAGACATCAATATTTGATTGCCATTCAGGTGAGGTATTCATTGAATCATAAATTAATACTTTTTTCACTTCACTATCATAGCCTACAATGCACTCTGCATGTAGACATTCCGAGCCAAGTGAAGGCCTGATCAACATCAGTGGTCCATGGTTTTGTAATTCGTAAGAAATAAAACTCTCAAAGTCATCTTCTACAGTACTATTGAAAACTTCTGCCTGTAAAGCTTCTTTAATGTTTGTAAAAAGAGAGGATTCAGGTATTTTTTGCACATTCATTTCTTTTAATAAATTCTCACATTCTATTATATCAATACCCTCTAATATATTATTAAACGCTTGATTATCAGAAGTGATGTCCTCTAATGCACTATAATTATTGCCATCTCTGGATTTGATAACATTTAATGAGCAAACCCAGCAATTATTGTAGAGGGTGTTGCCTGTGGTATCAAACTGGCTTTGAAAGTTACGATGGTGAATAATCTGACCCTGAGGAGTTGCTGTATTACTTCTGTAAACGCTGCCTAAACTATTTTGAATGTGTCTTAACATAATATACTCGCCGAATAGTAATTTTGTTAATGTAATTATATACTACAGTGTGGATATTAATACAATTCTTTTGTTGTTAATTATTATTTATGAAATTAATTGAAAGTGAATAAGTTAGAGGTGTTTGTTGGCCTTAAAATTACATTTGTTGAGGGGGCTTATATGATATGTTTTTATTGTATTGTCGCATTTTTCTTAAGCTGAATCCGGATTTTGGGGAGGTGGCTAAATGTAAATGACGTGGTTTAAGATAAATCTATTTTTAATAAGCTATCTGTTCAAATTTTCGCGATCGCTTTTGTTGGTATCACTATTCAAGCAGTTTGCCTGCATCGGCTTCACCCTCACTTCGGCATCAGGGAAAATCTGGTGCACCTGCTTCGTCAGTTCGGCCAGAATGATCTCGCTGGCCCCTTCGAGTCCTTCAACATTACGCTTGTCATAAACCAGTTCTACGAACATACGTGTTTCGCTAATAACTGTTTGTATATACAGTATTTTTGCTTTGGCGGTTTTGTCTGTCAAGGCATGAACCACTTGTTTTTAAATTTTGGGGAACATACTGCGGGCGTGTTTGTTATCGATTTTCCCTGCAGGGCTGATGGGGTCTGGCGTTGACTAAAATTATGTGTGGGGCATGGATGGGGCAAAAGTGGTCTGTGAAGTTCGTTAAAGTTCGTTAATCAAGCTTCATCTCGATCTCGCTCATCCCTTGTTTAAAGCGCTCCTGGACGATCTTTATCGATTTTAAAAACTATGAGTACATATTATAAAAATGTAGCAAATAGGCCGTTTGTGCCTGAAAAGATGAACATTCTGCGTAGCGCGATTTGCGCAACAGGAATAGACTGGAGTCGACACTCTACACAAAGATGCGAAAGGTTTTTTATGACACAACAGCCACAAGCCAAATACCGCCATGACTATCGCGCGCCGGATTACCAGATTACTGATATTGACTTGACCTTTGACCTCGATGCCGAAAAAACCGTGGTCACCGCAATAAGCCAGGCTGTTCGTCATAGCGCGCCTGATGCGCCTCTTCGCCTTGATGGGGAAGATTTAACGCTGGTATCTATCCACGTCAACGATGCGCCGTGGACAGCATATAAGGAAGAAGAGGGCGCGCTTATCATCAGCGACCTGCCAGAGCGTTTTACGTTACGCATTGTCAACGAGATAAGTCCGGCGGCGAATACGGCGCTGGAAGGATTGTACCAGTCCGGCGATGCGCTCTGTACCCAGTGTGAAGCGGAGGGCTTCCGCCATATTACCTGGTATCTTGACCGCCCGGACGTACTGGCGCGATTTACCACCAAAATTATTGCCGATAAAAGCAAATATCCGTTCCTGCTCTCCAATGGCAACCGTGTTGCACAGGGCGAGCTGGAGAATGGCCGTCACTGGGTTCAGTGGCAAGATCCGTTCCCGAAACCGTGTTATCTGTTTGCGCTGGTGGCCGGTGATTTTGACGTGCTGCGCGATACCTTTACCACCCGCTCCGGGCGTGACGTCGCATTAGAACTGTACGTTGACCGTGGCAATCTGGATCGCGCGCCGTGGGCAATGACCTCGCTGAAAAATTCCATGAAATGGGATGAAGCGCGTTTTGGGCTCGAATATGACCTCGACATCTATATGATTGTCGCGGTGGATTTCTTTAATATGGGCGCGATGGAGAATAAAGGTCTCAATATCTTTAACTCCAAATACGTGCTGGCGCGAACCGATACCGCGACGGATAAAGATTATCTCGATATTGAGCGCGTGATAGGCCATGAGTATTTCCACAACTGGACCGGCAACCGCGTCACCTGCCGCGACTGGTTCCAGTTGAGCCTTAAAGAGGGGCTAACCGTGTTCCGCGATCAGGAGTTTAGCTCTGATTTGGGGTCACGCGCGGTGAACCGCATCAGTAACGTGCGTACCATGCGCGGTTTACAATTCGCGGAAGACGCCAGCCCGATGGCGCATCCTATCCGCCCGGATAAAGTAATCGAAATGAATAACTTCTACACCCTCACCGTTTATGAAAAGGGCGCGGAAGTCATTCGCATGATCCACACGTTGCTGGGTGAGGAAAATTTCCAGAAGGGGATGCAGCTTTATTTTGAGCGCCATGACGGCAGCGCCGCGACGTGTGATGACTTCGTACAGGCGATGGAAGATGCTTCTAATGTCGATTTGTCCCATTTCCGCCGCTGGTACAGTCAGTCCGGCACGCCGATTGTAACGGTAAAAGATGATTATAATCCGGAAACCGAGCAGTACACGTTGACCATCAGCCAGCGCACTCCGGCGACGGCGGATCAGGCGGAGAAGCAGCCGCTGCATATTCCATTCGCCATCGAACTGTACGATAACGAAGGCAACGTCATTCCGTTGCAAAAAGGCGGTCACCCGGTCAACGCCGTGCTGAACGTCACGCAGGCGGAGCAGACATTTACCTTCGATAATGTTTACTTCCAGCCTGTTCCGGCCTTGCTGTGCGAGTTTTCAGCGCCGGTGAAACTGGAATATAAATGGAGCGATCAGCAGTTGACGTTCCTGATGCGCCATGCGCGCAATGATTTCTCCCGTTGGGATGCGGCGCAAAGCCTGCTGGCCACATACATTAAACTGAATGTGGCGCGTCATCAGCAGGGGCAACCGCTATCGCTTCCGGTGCATGTCGCTGATGCGTTCCGTGCAGTACTGTTGGATGAGAAAATCGATCCGGCGTTGGCCGCAGAAATTTTAACGCTGCCTTCGGCCAATGAAATTGCGGAGCTGTTTGAGGTCATTGACCCGATCGCCATTGCGCAAGTTCGTGAAGCGCTAACGCGTACGCTGGCGGCAGAACTGGCGGATGAGTTCCTGGCTATCTATAACGCCAATCATCTGGATGAGTATCGTGTTGATCACGGCGATATCGGTAAGCGCACGCTGCGCAATGCTTGCCTGCGCTTCCTGGCGTTCGGCGAGACGGAGCTGGCTAATACGCTGGTCAGCAAACAGTATCGCGACGCCAATAATATGACCGATGCGCTGGCGGCCCTGTCTGCTGCGGTGGCGGCGCAGTTGCCGTGCCGCGATACGCTGATGCAGGAGTATGACGATAAGTGGCATCAGGACGGCCTGGTGATGGATAAATGGTTTATCCTGCAATCCACAAGCCCGGCGGAAAATGTACTGGAAACCGTACGCGGCCTGCTCAAACACCGTTCTTTCAGTATGAGCAACCCGAACCGCGTCCGTTCATTAATTGGCGCGTTTGCTGGCAGCAACCCGGCGGCGTTCCATGCGCAAGACGGTAGCGGATACCAGTTCCTGGTCGAGATGCTGACCGATCTGAATAGCCGTAACCCGCAGGTAGCATCTCGCCTCATTGAACCGCTGATTCGTCTGAAACGTTACGATGAAAAGCGTCAGGAGAAAATGCGTGCGGCGCTGGAGCAGTTAAAAGGACTGGAGAATCTTTCCGGCGATCTGTACGAGAAGATAACTAAAGCGTTAGCCTGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP007598|2625126:2635920|2629130_2630099_+|WP_001674638.1|DBSCAN-SWA MPFHIGSGCLPAIISNRRIYRIAWSDTPPEMSSWEKMKEFFCSTHQTEALECIWTICHPPAGTTREDVVSRFELLRTLAYDGWEENIHSGLHGENYFCILDEDSQEILSVTLDDVGNYTVNCQGYSETHHLTMATEPGVERTDITYNLTSDIDAAAYLEELKQNPIINNKIMNPVGQCESLMTPVSNFMNEKGFDNIRYRGIFIWDKPTEEIPTNHFAVVGNKEGKDYVFDVSAHQFENRGMSNLNGPLILSADEWVCKYRMATRRKLIYYTDFSNSSIAANAYDALPRELESESMAGKVFVTSPRWFNTFKKQKYSLIGKM >NZ_CP007598|2625126:2635920|2628071_2628653_+|WP_000143167.1|tail|DBSCAN-SWA MTFKMSEQAQTIKIFNLRSDTNEFIGAGDAYIPPHTGLPANCTDIAPPDIPASHIAIFDAETQTWSLHEDHRGEMVYDTTTGNQVYISAPGPLPENVTSVSPGGEYQKWDGKAKVWVKDEAAEKAAQLRQAEETKSRLLQMASEKIAPLQDAVDLGIATDDEKARLDEWKKYRVLVNRMDTAAPDWPERPASQ >NZ_CP007598|2625126:2635920|2625126_2625756_+|WP_000274547.1|DBSCAN-SWA MYGVQGTPDCYRIELKNVYGVQENLISYRQASLGAWVAIAGGGDPYEVAYAIYKAVPDISVLTNDVVNPSGAAVDKKTIPIIVYPDTYHVPFVVPSSQNVTLLITWNTASTSYIDPTGIEKAVQQSIADYINGIATGEPINIFLIRDIFLNQVKGLVSSNLVSMIDIQVGINGKIVPPATDSSLVYGDTYAYFSTSSSQIQVKQYGSSS >NZ_CP007598|2625126:2635920|2625739_2626366_+|WP_000729406.1|DBSCAN-SWA MAALLESIIPAYPYTQYNDDPDIVAFFDAYNKLAQGYLDYFNNLNLPCWTSPAITGELLDWIAAGIYGESRPLLQISEDAIARGAYNTIEYNNVAYAKLRNYVPGSASYVPDDYFKRILTWNFYKGDGSHFCINWFKRRLARFIHGANGIDPPVQSTFDISVMPDKGIFFVSIPDYGDGVGHFLKDAIDQSLVKLPFIYTYSVTVVEQ >NZ_CP007598|2625126:2635920|2632689_2632881_-|WP_000497441.1|DBSCAN-SWA MFVELVYDKRNVEGLEGASEIILAELTKQVHQIFPDAEVRVKPMQANCLNSDTNKSDRENLNR >NZ_CP007598|2625126:2635920|2631732_2632419_-|WP_001525490.1|DBSCAN-SWA MLRHIQNSLGSVYRSNTATPQGQIIHHRNFQSQFDTTGNTLYNNCWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPKPLHSSSWKDWCTIL >NZ_CP007598|2625126:2635920|2633307_2635920_+|WP_000193790.1|DBSCAN-SWA MTQQPQAKYRHDYRAPDYQITDIDLTFDLDAEKTVVTAISQAVRHSAPDAPLRLDGEDLTLVSIHVNDAPWTAYKEEEGALIISDLPERFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLLSNGNRVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGRDVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATADQAEKQPLHIPFAIELYDNEGNVIPLQKGGHPVNAVLNVTQAEQTFTFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMRHARNDFSRWDAAQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAELFEVIDPIAIAQVREALTRTLAAELADEFLAIYNANHLDEYRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRDANNMTDALAALSAAVAAQLPCRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRVRSLIGAFAGSNPAAFHAQDGSGYQFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDEKRQEKMRAALEQLKGLENLSGDLYEKITKALA >NZ_CP007598|2625126:2635920|2630746_2631373_-|WP_000334547.1|DBSCAN-SWA MCGRFAQAQTREEYLAYLADEADRNIAYDPQPIGRYNVAPGTKVLLLSERDEQLHLDPVIWGYAPGWWDKAPLINARVATAASSRMFKPLWQHGRAICFADRWFEWKKEGDKKQPYFIHRKDGKPIFMAAIGSTPFERGDEAEGFLIVTSAADKGLVDIHDRRPLALTPETARVWMRQFLEPHSKSITYRVIPALTRPMMRKDTNPCQ >NZ_CP007598|2625126:2635920|2626362_2628072_+|WP_000583382.1|tail|DBSCAN-SWA MIIGFGNNVVSSLAADITASQTTIQVMPGVGAMFANLLTSDYANSSNPLKTYAKITLTDAKETVFEVCHLTAVNNDMLTVIRGQEGTTAKGWSLNDVIANFATRGSENQFVQIEELQSGHYVAGVAGGTENNLTLELPATYFVNGGVDWTLRTPLVVIPALNNTGASTLQLTMGGRVLGIFPLYKGNKAELSANDIIKDIPVLCVLDNTKTYFSVLNPLEIYLGSRYLQKDQNLSDVPDKAKGRSSLEVYSKTESDENYMAKSQCGADIPNKPLFVQNIGALPASGTAVAANRLASRGALPALTGTTRGSDSGLIMGEVYNNGYPTQYGNILRLTGTGDGEILIGWSGTNGAPAPAYIRSHRDTADAEWSEWAMLYTTLNPPPDSHPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGVIPDMRGWTIKGKPISGRAVLSQEMDGNKSHSHTARAQVTDLGTKSTSSFDYGTKSTNTTGNHTHQFGGYINSYWGDSNHTSFQPGGGAWTQAAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA |
9 | Escherichia_phage(37.5%) | tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2837785 : 2850396
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP007598|2837785:2850396|DBSCAN-SWA ATTAGAGCATCATATAAGCTTTATCATCACGCTCATCGAGATAGAGTTTCGTGGTGTTCGCTGATGTGTGGCCCAGGAGTTTTTGGGCGAACACCTCGCCGTGCTCGTTTTTGTACAGCCGCCCGGCCAGACTTCGGATCTCGTGAAATGTCGGTGGATTATTGCTGAAGTTAACACCGGAGGCTTTTCTTGCTTTTACAAATGTCTTTGTCAATCCATCCGGATGAATATTCCCGGTTGGGCTATTTTTCCTGATTCCGGCACTGATCATGAAATCAGTGCGGCTTACCAGTCGGCAGCGATCGATTACCGTTCCCAGACGTAACCCCGTCGCCCGAAGTGTCAGGGAGAGGGGAATGGCTATTTTCATTCCGGTTTTAATCTGAGTTACGTATAAGCGGTTGTCAAAAACATCACTAAATTTCATATTTACGATATCCTCCCTACGTTGACCAGTAACGAGCGCTAAATCCATCGCGAGAGGGAACCATGCAGGCATATGCTCTGCTGCCGCTCGTGTGGCGTTATACGTTTCCAGTTGCAGGCGTTCCCTGGCCACCTTAATCTCTGGTATCCGGGTTGCTTCCACCGGGTTTTTCACAATATGCCCTTCGACAATAGCCTCTCTGAACATGTCAGATAGAACTGATCTCATTGCTCCCGCCATAGTGTTTTTTCCCTCGGTTATCCACGACTCAAGAAACTTGGCAATGTGCCTGGTTGTTACTTCTGCCAGTATTATTTCCCCCATTTTTTCGCGTACGGTCGCTAATTGATTACCGCGAATCTTGTAGGTATTAACCGACAGACTCCGGCGCTGTAATAAAACCTCATAGCGATCAATCCATGCGGACACAGTGAATGAGTCAGTTCCTTTTAGCTTTTCAATAAGCGCCACTGGTGTGTGGTTTTGCGCTATGAAGTTGTTTGCCTCTATGGCCTGTGTGATAGCGTCCCTGCGGGCGATCTGACCGAGCGGAAATTCCTTGTCAGTTAGCGGGTTACGCCAGAAAAAAGATTTACTGGCCTTACGGTAGGTGAGGTTCCTCGGAAGGTTAGCATCGTACTTTTTTCGACTCACTGATCAACTTCTCCAGCAATGCACTCGGTTTTCCGGTGCGCCCGTTTGGGTGGTGCTGTTCAAGCACAAGTCCCACTTTATTCGGCTTGATATAAAACGCGTCCGGATCAACCCGATACGTCCTGCCATGTAATACTGGAGTCGGGTAAATATTTCCGTTTCGCGCCCATCGTCTTAGTGTTGTGAGAGGTGGTGGATCATCGGGATAATTTAATTCACCCCAGGTTTCAAGTCTCACAAAGCTCATAGTCATGTCTCTTTACTTCATGACCGCCGCCAACTATACGGTGTGGCGGTCGGTCAGGGTTGAACATCAATGATCAGGGTAAAATTTAAAGGACTGCTGACCGCCGCCCGGTAAAACTTTTACATCTCCGGCGCGCCGTCCTGTAAATCCTGCCCAATGCGCGGCGCGTATCCTGTCAGCCTGCTCTTCAGTCAGGCAGGGTCCCGGAAACGGGAAGCGAGCTTACCCCACTTACTAAAAGAGGATGGAACTGGCTGACGTAAAACACGATTTATTTGTATCCATAAATAGCGAAATGTAACGTTTTGGTTATATTTAAAAGAGAGAAAATGGTCAGCAATAACTTTAATTGTTTGAATTAACAGATAATTAATCTACCAGACTGAGTGATACAGAATATTTTTACATGAGGGGTACAAATGAGACTTAAGTTGATCGTTAAAAGTTTTGCGCTGGCGGGGCTACTCTCTTCCACTGCGCTGACACCTTTATTTGCACAGGAAGCCCCAAAAGGTGCCACTGCTTCAACCAAGCAAGCTAACGATGCGCTTTATAACCAACTTCCTTTCTCTGATAACACCGATTTCACGAATGCCCATAAAGGCTTTATCGCTGGTTTACCTGAAGAGGTGATTAAGGGAGAGCAAGGGAATGTCATCTGGAATCCACAGCAGTACGCTTTCATAAAAGAAGGGGAAAAATCTCCTGACACTGTTAACCCTAGTCTGTGGCGTCAGTCCCAGCTAATCAATATCAGTGGCTTGTTTGAAGTCACAGACGGCGTCTACCAGATTCGTAACCTTGATTTATCCAACATGACGATTATCGAAGGTAAAGAGGGGATTACGGTTGTCGATCCGCTGGTTTCTGCGGAAACAGCCAAAGCCGGTATGGATTTGTATTTCAAAAACCGTGGCAATAAGCCTGTTGTCGCCATCATTTATACTCATAGCCATGTTGACCACTATGGCGGTGTGCGTGGCGTTGTCGATGAAGCGGACGTGAAATCCGGCAAGGTGAAAGTGTATGCGCCTGCTGGCTTTATGGAGGCAGCAGTAGCCGAGAATATTATGGCCGGCAACGTGATGAGCCGCCGTGCCAGCTATATGTATGGCAACCTCCTGAAACCAGATGCCTCCGGCCAGGTTGGCGCCGGACTGGGGACGACCACCTCTGCGGGGACGGTGACACTGATTGCGCCCACTAATATCATCGATAAAGACGGCCAGAAAGAAGTGATTGATGGCCTGACTTACGACTTTATGCTGGCCCCTGGTTCGGAAGCCCCTTCGGAAATGCTGTGGTTCATCGAAGAGAAGAAACTCATCGAAGCCGCAGAGGACGTCACTCACACCCTGCATAACACTTACTCGCTACGTGGCGCAAAAATTCGTGAGCCGTTGCCGTGGTCGAAATATATCAACGAAGCTATAGTGCGTTGGGGTGACAAAGCTGAAATTATTATGGCCCAGCACCACTGGCCGACCTGGGGTAACGAGAATGTTGTTGGTCTGCTGAAAAGCCAGCGAGACCTGTATCGTTATATCAATGACCAGACTCTGCGCATGGCCAATGAAGGTCTGACTCGCGACGAAATAGCGGCCAACTTTAAACTACCGGATAGCCTGGCAAAAACCTGGGCCAACCGCGGCTATTACGGCTCCATCAGCCATGACGTAAAAGCAACGTATGTGCTGTATCTCGGTTGGTTCGATGGCAATCCGGCAACCCTTGATGAGCTGCCACCCGAAGAAGCGGCCAAGAAATTTGTTGAATACATGGGCGGTGCCGATGCGATTCTTCAGAAAGCTAAAGCAGACTTTGACCAGGGGAACTACCGTTGGGTTGCTCAGGTGGTGAGTAAGGTCGTGTTTGCCGATCCAAATAACCAGAATGCACGTAACCTTGAAGCCGATGCGCTGGAGCAATTGGGGTATCAGGCTGAATCTGGTCCATGGCGTAACTTCTACCTGACCGGTGCGCAGGAGCTGCGTAACGGTGTGGTTAAAGGTCCGACGCCAAATACAGCAAGTCCGGATACCGTTCGGGCGATGACCCCTGAAATGTTCTTCGACTTTCTGGCTGTACATATCAACGGTGAAAAAGCGGGTAATGCCCGGGCGGTATTTAATATTGACCTTGGCAGCGACGGCGGAAAGTACAAGCTTGAGCTGGAAAATGGCGTGCTGAACCACACGGCTAATGCTGAAGCGAAAGATGCTGATGCCACGATTACTCTGAACCGTGACACGCTGAATAAAATTATCCTGAAGGAAGAAACTCTGAAGCAGGCTCAAGATAAAGGAGAAGTCAACGTTACCGGTAATGCTGCGAAACTGGATGAGATGCTGGGCTATATGGACAAGTTTGAGTTCTGGTTCAATATAGTTACACCATAAATAGATTCCCTGCGGCGTCAATGCTGCAGGGAAGTTACTTCAGACAATTCTGTACGTTTTTTATACTCTATTTTCCCTTCATTCTTTATATCTTGCTTCATCTTATGTATTTGCTGCTGAAGAACATGGCCCTGATACCAGTCAGTTCTGATTCTGTTATGCACAGCCTTTTTCATCAGATGACAGTAACTGGTTGTTGCGTGATTCAATGGCCTGCGAGTCTGGCCAACATGCTTTTCGATGCCGGTTGGCCATGATGCCAGTATGGTTAACTGGCATCATGGCAGCATAATTTTGCCGGATAAGTCAACCGCAGCGATGTTAATCGTCCTGATTATCATCTGCATCACTGTCACAGTGACTGCACCAGTAACGAGGAGAGACTGCGATCGAACCGGCCAGACAGAGAGGAGGTAGTTGTCTTCATTGCTAAGTAAGAGACCTTGGGGGATGAATCTCCATCACCTGTGATGTGTCAGACAACCTCAATGTACCCGCACTTAATACCTGCGCCGGCGGTTTTTTTAATGTCCGGGAAATGAGCATGTCAAAAAATAACCAGTTATAAGATTATAAATAGAACACAGAGAAAATGTCATTGCATATGGTCAAAAAATAGATATATTTATTGATGATGATAATTAATAGTCTCCTATATATTCATGTTGAGAATGAAGATGCTTTAAAAATGCTCAAGTTCGTTATCTATGGAGACACCGTGAAAAATTTAAATAAAACATTCACTTGTAAATATGCTGTTATTCGCCGTGATGACATGACAGTAATTGCTGAAATGGATTTTTTTCCTGACTGCAACAGGTCATTGATGTATCGGGATGGCCGCTATGTCCGGTTTCTGCCGTTGTTGCAAAATGACATCATGGGGAGCGATACCCTGATTAATGAGCTGACTATCAGGGCCGGTTATCATGAATAATCATCCTTTGTTATACTCGTCTGCGGGCTGAACTCCCAATCTACTGCGCCAACGGAGAGAACGATGGCGCATTTACAACTGGTCAAGCAAACCTCATCAGGGCTTCTGCTCCCGGCGACGCCGGAGAGTGGGGATTTCCTGCGCTCAGTAAAAATCGGTGAGTGGATACACGCCGATTTTAAACGTGTCCGCAACTACGCCTTTCATAAACGATTTTTTAAACTCCTTCAGCTTGGTTTCGACTACTGGATGCCAACGGGCGGCACGGTCACATCGCGGGAACAGAAACTTATCTCCGGGTTCGTTAATTTTCTTTGCGACTCCGCAGGCCAGGAATATACCCCGGCCCTTAACGAGGCGGCGGAACAGTACCTCCATAACGTAGCTACCCTGCGAACCGGGGACGTCGCCCTTCTTAAGTCTTTCGATGCCTTCCGGGAATGGGTAACCGTTCAGGCCGGGTTTTATACCGAGCATTTTTATCCGGATGGCAGTCGCGGGCGCCGGGCGAAATCCATAGCATTCGCCAGTATGGACGAAACCGAGTTTCAACAGGTCTATAAAGCTGTGCTGAACGTCCTGTGGAACTGGATTCTGTTTCGTAAATTTTCCTCTCTGGAAGAAGTTGAAAATGTGGCCGCGCATCTGCTGGAGTTCGCATGAAAATGACATGGTTTCAGCATCCGGTGTGTACCACCGAAGAGGCGGATGAGCTGGTGGCGGGATACCGGCGCCGTGGCGTGAAGGTTGAGCGTTACGGTGAGGCGGAGGTGCTGGAACTTGAGAGCAATAATACTCCGCAACGTTGGACGGTTGAGGAGCTGAAAGAAATCAGGATCGCTGCACTGGCGGATCTGCGTGCGCTAAAAAAGCTGGAGGCGGCATGACATTCGAATCCTACTTTGCCGATCATCTCCGTGCTCGCTGGTAGCAGTTGTGCTTATACCATTTCCGGGTTCCATCCTGATCGATTACCGGATTTTGAAAAACTACGTGAAGATAACGGGCGGTACCGTATGAATACACAATATCTGGAATATGTGCGACAGCAGCTCATCGTGGCGACTGCAGATCTGAGTGGCGCCACGAAAGGTCAGTTGCAGGCATGGCTGGAGAACGCCCAGCTCTATACGAAAAACTATCCCCGAAAAAAACAGCGTATCAGGGATGAAGTGACCGGAAAAATGATAACGCTGAATAATCCACCGATTGCTGGTAAGCAATCACTGGCGAAAGGAAGCGCAATTCCGCTTGTGCAGCCCGTAGAATACTCCACTTCCTCATGGCGCCGTGCGCTTTTGTCACTCGAAGAACATAATAAGGCCTGGCTATTGTGGAATTACAGTGAAAACACCTGCTGGGAATATCAGGTCACTGTAACTCGATGGGCTTGGGAAAAATTCAGCCAGCAGTTGGAAGGGAAGCGAGTTGCGAAGAAGACTTTAGCACGGTTGCGCCAGCTCATCTGGCTTGCTGCGCAGGATGTGAAGGCGGAACTGGCCAGACGTGAGACGTATGAGTACCAAACGTTGGCGGAACTGATGGGCGTGGCAAAATCTACCTGGACAGAGACGTACATGTCTCATTGGTTAGTAATGCGTAACAGCTTTAAACGGCTTGATAGTGATGCGCTTATCTCCGTAACACGATCGCGTTCACAACAAAAGGCGACAAATTTGGATATAAGTCTTGCAAAACCGAACTGAAATACATATATTTCATGTAAATTTGATATCATGCCTAAAATATGCAAGCCTGCTGAGGAACGGGATTTTTGTATTAATCAGCAATAGGAATTGATGATGTTTTATCGTGATTTATTTCAAGTTTTTGGTCCCGACCCGTTGTATAAGGAAGAAGAAGGAATTGCCATCCTTCGTGAGCAATATGGGATCGAAGCTCCAGAACAAATTTTTAAGCAAATTTATTGTGGGTTATCTAATAATTCTGAATTTCAAACCTTGTATGGGCATCTAAATCTTAAATCACTGAAGTGGGATTTGGTCAGATTGAAAACAGCAGAGTTTACAAAGTTTGGCAGAAATGCCACATATCCTGATTACATGCTCGAGATTTCAGAAGACTTTAATGCCTGCGGCAGCAAGTTTTGCATTGATGCCCGTGAAGAGGTTGCAAACCATTGGCTTAAATTCGGTACATGGGCTGAACCACCGATGTTTATTGAGCGTTCGCTTATTATTCCTGGAGAGAGCGGCTTACACCTTATGGAGGGTCATACAAGATTAGGTACTTTATTGGGGGCTATTAAGTACAAATTTGTGCAGTTAGCTGATACTCATGAACTTTATATAGCCTCGCAGAAATAGTTTAGAGAGGATTGCTCAACACCCTCGGTAAGAATTGCCACACATCAAACTAATGTTAGGGTATCTTCGTCCACAGAGTCGAAATGGCCTTATTTACATCTTCCTGGCTTTTCGCCGGTTTTTTTATTCAGCCCTCGGAAATCATCATCTACACGCTTCGTTGTTAAAACCCCGCCCGAGGGCCTCTCACCCTTACAAATACAGCGCCATCCAAGCTATCGGGGGCGAGGCTTATGAAAATGCACAACGATCCCCATTCAATGGACTCACAATCTATTTTTGCGCTGATTGCGAGTCTGACTTTTTTCGATTGTGAGTCGTTACAGATAGCCGCCGGGCCAGACACCACAACGGTACCAGGTGGCGTTATGTGCTAGAAACCGAAATTCTTGAACATCTCATTACTACTCATATCGTTGGCTGGCGCACGGTTCATCGACTCCATCTGCTTGATCAACCCTATGCTAGCACCAACAGGCCCCCCGACGCTGAAACCGACGCTGCTGGAGCGGGTTTTGCTGGTGCTACTGCTGTGGGTTTGACTACGGGTGTTACAGTTGCTTTGGTGTGTCCGTGTGAGACTATTTTCGCTGAGCTGGCTATCTTCACTTTGGGTGCAGTTTCCCTGTGACTGGCTGTGACTGTCTCCTGTGGTGTGGTTGCGGCTATCCGTGTGCCACTCGCTATAGCCTCCCGCTGGCGCAATGATTCCGACCGATGTACCGTGGGTGGCATAGGGAGGCATGACCATGCCACTACAGCCACCGAGCAGTAACGCGGTGCCGACGATAAAAATGAACTTACCCAGAGCGGTGTTTTTCATTAGTTCCTACCTATTTCTTCTCGTTGGTTATTTCTATGCCCTGCCTTGGCAGGGCTCGCTATTATTCATTCTTCAACTGTGAGATCAAGTTATTTTAATTAAATCGGTATTATTCTTAATTGTGTTCTTTATGAATAAATATCCTCCGGCTATGCCGGAGGATATTTATTATTTCCCCTCATAACTGAGAGGCCCCACACAACCAGAGGGGGATGAATGTCCGAATCGATTTCTGGTACTGGGTTAGCAGGTGGCATCCTGACAGGAGCCAGTGTCTATGGACTGCTGACCTGTATTAGCTCAGACCTGAACTGGTTACTGTGTTGTAGCATCGTGGGATTTTGCATTTTTTGATGAGTGTCAATTACTAAATTCGTAGGCGATTCTTGGTGGTGATGTGTGACCCATCTCTTTTAAAATGATATTGGTATACTCGACTACCGGGCCTCTTGGATTACTGTCTTCTTTGTCCTGAAGGTGAGTCAACGCGTGTACAACTTCGTGAATAAATGAGCGTGTTGTATCAAATTGTTGTGGGCCATCATTACTTTCATAGTACTCTGGTATTGAATCATCGTCTGTATCATCCAGGTTGAGGGCAATCACTTTTCTGCCTTCTGAACTCTCCAGGTCCTCATCAGTTACGGTAGTACCAAAGTTTTCTCCGGCTCCCAGCAACCAGCGTTGTTCTACATCACGCAATTCCTGATCGTAGGCATAATTCATCAGTCTGCGGAATGTCCCGCTTTGAGTGTATGCATCTTCAAGTATGCGTGATAGCACCTCACGGCATTCATCATAGGTATCATCATCAATTTCGATATCAGGATCCATTCCTCCTGGTCCAGAGATAAGGTATTCAGCAAGACACATTGGTTCCAGCCTGGCTTTATCATCGGTAGCAAGACCATCATGTTGGAGGCGTAATTGCGAAGGATTATCTTGGTGTTCTGGAAGGTCTGGAAATACCTTGCTGTCATGAGGATGGGATAATCCATATGTTGACATCATATTATTGATAAATATTGGTTTAATTCCCGTTGGCATGATGAGTTACACATCCTTTTTATTACATGGAATTAACATTCTATAAATAGCATGTTTTTGTCAAACAGAATTCACTCAGCACGCAATCAATTAAACTAAAAGCTAAATTTGCAGTATTTGTGCCTCACCTCCATTAAAATTGTACTCTGCGTGATTTTACTTTCAGATTCTGCAACCACAGGCAATCCTGTTTTACAAGATATTAAACCCTGCAACCCAACCATTTCACTCACTCTAGTTACCATCCGAAATCATCGGAGGTGAGGCTTATGAAAATGAATGACAAGACTCCTGAATTCTGGGCTGCGGTTTTGACCGGACTCAAAAATGCGTGGCCCCAGATACTTGGGGCGTTAATGGCCGGACTCATTGCCTACGGCCGACTGATATACGACGGCGCCACCCGTAAAAATAAATGGCTTGAGGGCGTCCTGTGTGGCGCTCTTTCCTTATGTGTCACCAGTGCGCTTGATGTGGTAGGCCTGCCGGTTTCCATTTCGCCTTTCGTTGGCGGAATTATTGGCTTTGTCGGTGTGGACAAGCTGCGCGAAATCGCAATTAGCGCACTCAAAAAACGTGCAGGGGTTAATGATGAGAATCAGTGAAAAAGGCATTACCCTAATCAAAGAGTTTGAAGGTTGTAGCCTGACAGCTTATCCGGACCCGGGAACGGGGGGAGATCCCTGGACGATTGGTTATGGCTGGACCCACTCTGTTGACGGTAAGCCAGTTAAGCCCGGAATGATGATTGACGAGGCTACTGCCGAGCGCTTGCTTAACACTGGTTTAGTCGGTTATGAAAATGATGTGTCCAGACTGGTTAAGGTCAAGTTGACGCAAGGCCAGTTTGATGCGCTGGTGTCGTTCGCGTACAACCTCGGCGCCCGGACATTATCCTCATCAACTCTGCTGCGGAAGCTAAACGCTGGTGATTACGCTGGCGCCGCTGATGAGTTCCTGCGCTGGAATAAGGCTGGTGGCAAAGTACTGAACGGGCTTACCCGTCGGCGTGAGGCGGAGCGTGCTCTGTTCCTGTCATGATGTTCAACTGGAAAACGATGTTTGTTGGCCTGTTGCTTGTCTCGCTAATTGTTGCCGGTCGGCTGGCAAATCACTACCGAAATAACGCCATCACCTACAAAGAGCAGCGCGATACCGTTACTCATAGGCTGACGCTGGCGAACGCGACAATTACCGACATGACTAAGCGCCAGCGTGACGTTGCCGCCCTCGATGAAAAATACACGAAGGAATTAGCCGATGCGAAAGCTGAGAATGATGCTTTGCGCGATGATGTTGCCGCTGGCCGCCGTCGCCTGTACGTCAACGCAACATGCCCCGCAGTGCCGACAGGTAAATCCACCTCCACCGCCCGCATGGATAATGCAGCCAGCCCCAGACTGGCAGACTCCGCTCAACGGGATTATTTCGCCCTCAAAGAGCGAGTGAAGACGATGCAAAAGCAACTGGAAGGGGCGCAGGCGTACATTCGCACCCAATGCCACGGTAATGCAGGAAAAACTAGTAACCAATGGTGACTGTATTAAAAAGGTACTCCCGGGCAGGGGGCGCCACGGGTGGCTTCGGGCTCGCGGGAATCGGCTGATTTTTGATTTTTTAGTCTCTGTCAGCACTGAAAAATAACCTTAAAAATCAATACATTTACTGTTTTCAGTGTCGAGGTGGTACGTTTTTTGTTCGACACTGAACGCCATTTTCACCGTATACAGGAAAAGAGCACGACTGTGGATCAGGAAATTAAAAGCCTCGAATTAAACATCACACAGCTTTCGGCCATCACTGGTGCACACCGACAGACCATCGCCAGCAGGCTGAAGGGCGTAAAAACCTCAGGTGGGAACGGTAGTAACCTGAAAATCTACCGGCTGGTGGATATTCTGACCGCCATGATGACGATGCCGGCTGTTACCGGGGAGAATGACCCCAATAAGATGAAACCCTCAGATCGACGGGCATGGTTTCAGTCGGAAATGACGCGTATTGAGCTGGAAAAGGAGATGAGAACTCTGATCCCGGCCAGCGAGGTGCTGAGCGTTTAAGGTAGGTGATGCTGATTATACCTCCCTTCTTGTTGGTCCTTCATACCGTTTTAACGACTATCTGAATGCTTACGTGATGATTGGTGCAGCAAACGGACATATTAAGGATAACTGGGGAAATTCTGACAATAAAACCGCCTTTGCTTATGGGGCAGGTATTCAGCTTAACCCGGTTGAAAATATTGCCGTTAATGCGTCTTATGAGCATACAAGTTTTTCCACTGATGCTGACAGTGACGTCAAAGCTGGAACCTAGGTGCTTGGCGTAGGTTACAGCTTCTGACCTTTAACATCGATACAGATTTAATGCCCTCCAGTGAGAGGGCTTTTTTATGGGTAAAACGAAATTATGACGATATGGCTATGTTGCTGTTATTTCTCAATGACACCACAGGCAAAACGTGCACCGCCACCACCCAGTGGAGCAGGTTTATCGGAGTAATTGTCACCGCCTTTATGGATCATCAATGAGTGACCTTTCAGTTCTGACAGTGATTTAAGGCGTGGTGCCAGTAACGGATACGTGGCTGTACCATCTGCATTGACAACCAGTCCAGGCAGATCCCCCAAATGCCCTTTGTCATTATATGGGCCAAGATGTTTCCCGGTTTTTTCGGGGTCAAGATGTCCTCCGGCCATGAGCGCCGGAACCTCTTTACCGTCTTTCATTCCCGGCATACAACTTGGGTTTGTGTGGACATGGAAGCCGTGAATTCCTGGCGTAAGACCATTTAGGTGAGGAGTGAAAAGCAGACCGTAAGGTGTCTCTGAAACTGTGATTTCACCTATGTTTTCTCCTGTTCCGCTGGACAGGGCATCGTTCATCTTTACAGTCAGGGTATTCTCTGCCATTGCTGAACAACTGATGAGCGCACCAGCTACCAGCGACAATATTGTGTATTTCATTAGTTACCTCGTTTTTTGGTTGTATCGTAAATACCATTAATAAAAGCAGGTATATTTTTGCAAGATAAATAATAAAGGATCTCTCATATATGCAGGATATACCACAGGAAACCTTGAGCGAGACCACCAAAGCGGAGCAGTCCGCGAAGGTGGATTTGTGGGAATTTGATTTAACCGCGATTGGCGGTGAGCGCTTTTTCTTCTGTAACGAACCGAACGAAAAAGGCGAGCCGTTAACCTGGCAGGGGAGGCAGTACGAACCGTACCCGATACAGGTACAGGATTTTGAGATGAACGGGAAAGGCGCATCTCCCCGCCCGAACCTCGTTGTTGCCAATCTCTTTGGTCTGGTCACGGGGATGGCGGAGGATTTGCAAAGTCTCGTCGGCGCGTCAGTGGTAAGGCATCAGGTTTACAGCAAGTTTCTTGATGCGGTGAATTTCAGTAACGGCAATCCGGGCGCTGACCCGGAGCAGGAGGCGGTAGCGCGCTATAACGTGGAGCAGTTGTCAGAACTGGATTCATCAACTGCTACCATTATTCTGGCATCACCGGCAGAAACCGACGGTTCTGTGGTGCCGGGGCGTACCATGCTGGCGGACTCCTGTCCGTGGGATTACCGGGATGAAAACTGCGGATACGACGGCCCGCCCGTGGCCGATGAGTTCGATAAGCCCACCTCAGACCCGAAAAAGGATAAATGCAGCCACTGCATGAAAGGCTGTGAAATGCGTAACAATCTGGTGAATGCCGGATTTTTCGCTTCCATCAACAAACTGTCTTAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP007598|2837785:2850396|2846920_2847250_+|WP_001574216.1|holin|DBSCAN-SWA MNDKTPEFWAAVLTGLKNAWPQILGALMAGLIAYGRLIYDGATRKNKWLEGVLCGALSLCVTSALDVVGLPVSISPFVGGIIGFVGVDKLREIAISALKKRAGVNDENQ >NZ_CP007598|2837785:2850396|2843464_2844154_+|WP_001097218.1|DBSCAN-SWA MNTQYLEYVRQQLIVATADLSGATKGQLQAWLENAQLYTKNYPRKKQRIRDEVTGKMITLNNPPIAGKQSLAKGSAIPLVQPVEYSTSSWRRALLSLEEHNKAWLLWNYSENTCWEYQVTVTRWAWEKFSQQLEGKRVAKKTLARLRQLIWLAAQDVKAELARRETYEYQTLAELMGVAKSTWTETYMSHWLVMRNSFKRLDSDALISVTRSRSQQKATNLDISLAKPN >NZ_CP007598|2837785:2850396|2838839_2839118_-|WP_001575998.1|DBSCAN-SWA MTMSFVRLETWGELNYPDDPPPLTTLRRWARNGNIYPTPVLHGRTYRVDPDAFYIKPNKVGLVLEQHHPNGRTGKPSALLEKLISESKKVRC >NZ_CP007598|2837785:2850396|2845958_2846645_-|WP_001574215.1|DBSCAN-SWA MPTGIKPIFINNMMSTYGLSHPHDSKVFPDLPEHQDNPSQLRLQHDGLATDDKARLEPMCLAEYLISGPGGMDPDIEIDDDTYDECREVLSRILEDAYTQSGTFRRLMNYAYDQELRDVEQRWLLGAGENFGTTVTDEDLESSEGRKVIALNLDDTDDDSIPEYYESNDGPQQFDTTRSFIHEVVHALTHLQDKEDSNPRGPVVEYTNIILKEMGHTSPPRIAYEFSN >NZ_CP007598|2837785:2850396|2849700_2850396_+|WP_001152416.1|tail|DBSCAN-SWA MQDIPQETLSETTKAEQSAKVDLWEFDLTAIGGERFFFCNEPNEKGEPLTWQGRQYEPYPIQVQDFEMNGKGASPRPNLVVANLFGLVTGMAEDLQSLVGASVVRHQVYSKFLDAVNFSNGNPGADPEQEAVARYNVEQLSELDSSTATIILASPAETDGSVVPGRTMLADSCPWDYRDENCGYDGPPVADEFDKPTSDPKKDKCSHCMKGCEMRNNLVNAGFFASINKLS >NZ_CP007598|2837785:2850396|2842199_2842448_+|WP_000911593.1|DBSCAN-SWA MLKFVIYGDTVKNLNKTFTCKYAVIRRDDMTVIAEMDFFPDCNRSLMYRDGRYVRFLPLLQNDIMGSDTLINELTIRAGYHE >NZ_CP007598|2837785:2850396|2845148_2845598_-|WP_000798708.1|DBSCAN-SWA MKNTALGKFIFIVGTALLLGGCSGMVMPPYATHGTSVGIIAPAGGYSEWHTDSRNHTTGDSHSQSQGNCTQSEDSQLSENSLTRTHQSNCNTRSQTHSSSTSKTRSSSVGFSVGGPVGASIGLIKQMESMNRAPANDMSSNEMFKNFGF >NZ_CP007598|2837785:2850396|2849077_2849611_-|WP_000877926.1|DBSCAN-SWA MKYTILSLVAGALISCSAMAENTLTVKMNDALSSGTGENIGEITVSETPYGLLFTPHLNGLTPGIHGFHVHTNPSCMPGMKDGKEVPALMAGGHLDPEKTGKHLGPYNDKGHLGDLPGLVVNADGTATYPLLAPRLKSLSELKGHSLMIHKGGDNYSDKPAPLGGGGARFACGVIEK >NZ_CP007598|2837785:2850396|2837785_2838865_-|WP_000087636.1|integrase|DBSCAN-SWA MSRKKYDANLPRNLTYRKASKSFFWRNPLTDKEFPLGQIARRDAITQAIEANNFIAQNHTPVALIEKLKGTDSFTVSAWIDRYEVLLQRRSLSVNTYKIRGNQLATVREKMGEIILAEVTTRHIAKFLESWITEGKNTMAGAMRSVLSDMFREAIVEGHIVKNPVEATRIPEIKVARERLQLETYNATRAAAEHMPAWFPLAMDLALVTGQRREDIVNMKFSDVFDNRLYVTQIKTGMKIAIPLSLTLRATGLRLGTVIDRCRLVSRTDFMISAGIRKNSPTGNIHPDGLTKTFVKARKASGVNFSNNPPTFHEIRSLAGRLYKNEHGEVFAQKLLGHTSANTTKLYLDERDDKAYMML >NZ_CP007598|2837785:2850396|2844250_2844775_+|WP_001574213.1|DBSCAN-SWA MFYRDLFQVFGPDPLYKEEEGIAILREQYGIEAPEQIFKQIYCGLSNNSEFQTLYGHLNLKSLKWDLVRLKTAEFTKFGRNATYPDYMLEISEDFNACGSKFCIDAREEVANHWLKFGTWAEPPMFIERSLIIPGESGLHLMEGHTRLGTLLGAIKYKFVQLADTHELYIASQK >NZ_CP007598|2837785:2850396|2847233_2847686_+|WP_000984586.1|DBSCAN-SWA MMRISEKGITLIKEFEGCSLTAYPDPGTGGDPWTIGYGWTHSVDGKPVKPGMMIDEATAERLLNTGLVGYENDVSRLVKVKLTQGQFDALVSFAYNLGARTLSSSTLLRKLNAGDYAGAADEFLRWNKAGGKVLNGLTRRREAERALFLS >NZ_CP007598|2837785:2850396|2839531_2841511_+|WP_001237395.1|DBSCAN-SWA MRLKLIVKSFALAGLLSSTALTPLFAQEAPKGATASTKQANDALYNQLPFSDNTDFTNAHKGFIAGLPEEVIKGEQGNVIWNPQQYAFIKEGEKSPDTVNPSLWRQSQLINISGLFEVTDGVYQIRNLDLSNMTIIEGKEGITVVDPLVSAETAKAGMDLYFKNRGNKPVVAIIYTHSHVDHYGGVRGVVDEADVKSGKVKVYAPAGFMEAAVAENIMAGNVMSRRASYMYGNLLKPDASGQVGAGLGTTTSAGTVTLIAPTNIIDKDGQKEVIDGLTYDFMLAPGSEAPSEMLWFIEEKKLIEAAEDVTHTLHNTYSLRGAKIREPLPWSKYINEAIVRWGDKAEIIMAQHHWPTWGNENVVGLLKSQRDLYRYINDQTLRMANEGLTRDEIAANFKLPDSLAKTWANRGYYGSISHDVKATYVLYLGWFDGNPATLDELPPEEAAKKFVEYMGGADAILQKAKADFDQGNYRWVAQVVSKVVFADPNNQNARNLEADALEQLGYQAESGPWRNFYLTGAQELRNGVVKGPTPNTASPDTVRAMTPEMFFDFLAVHINGEKAGNARAVFNIDLGSDGGKYKLELENGVLNHTANAEAKDADATITLNRDTLNKIILKEETLKQAQDKGEVNVTGNAAKLDEMLGYMDKFEFWFNIVTP >NZ_CP007598|2837785:2850396|2842511_2843111_+|WP_000940753.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESGDFLRSVKIGEWIHADFKRVRNYAFHKRFFKLLQLGFDYWMPTGGTVTSREQKLISGFVNFLCDSAGQEYTPALNEAAEQYLHNVATLRTGDVALLKSFDAFREWVTVQAGFYTEHFYPDGSRGRRAKSIAFASMDETEFQQVYKAVLNVLWNWILFRKFSSLEEVENVAAHLLEFA >NZ_CP007598|2837785:2850396|2847703_2848183_+|WP_001541990.1|lysis|DBSCAN-SWA MFVGLLLVSLIVAGRLANHYRNNAITYKEQRDTVTHRLTLANATITDMTKRQRDVAALDEKYTKELADAKAENDALRDDVAAGRRRLYVNATCPAVPTGKSTSTARMDNAASPRLADSAQRDYFALKERVKTMQKQLEGAQAYIRTQCHGNAGKTSNQW >NZ_CP007598|2837785:2850396|2843107_2843335_+|WP_000784710.1|DBSCAN-SWA MKMTWFQHPVCTTEEADELVAGYRRRGVKVERYGEAEVLELESNNTPQRWTVEELKEIRIAALADLRALKKLEAA |
15 | Salmonella_phage(33.33%) | integrase,lysis,holin,tail | attL 2837621:2837650|attR 2857242:2857271 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
3077873 : 3092149
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP007598|3077873:3092149|DBSCAN-SWA TGTGACCGCTTTTTCAACCCTGAATGTTTTGCCCGCCGCCCAGCTCAATAACCTTACTGAGCTGGGCTATCTTGAGATGACGCCTGTTCAGGCCGCAGCATTACCCGTCATTCTGGCGGGTAATGATGTGCGTGTGCAGGCCAGGACCGGTAGCGGCAAAACGGCGGCGTTTGGTCTTGGGCTCTTGCATCGAATTGACGTCACTCTGTTCCAGACACAGGCATTAGTGCTGTGCCCGACGCGGGAGCTGGCGGATCAGGTTGCCGGAGAGTTACGTCGCCTGGCCCGTTTTCTGCCAAATACCAAAATTCTGACCTTGTGTGGCGGGCAACCCTTTGGCGCACAGCGCGACTCGCTTCAGCACGCTCCGCATATCATTGTCGCGACGCCGGGGCGCCTGCTGGATCATTTACAAAAAGAAACCGTATCGCTGGATGCGCTGCATATTCTGGTAATGGATGAAGCAGACCGAATGCTGGACATGGGATTCAGTGACGCCATTGATGAGGTGATCCGCTTTGCGCCTGCGACGCGCCAGACGTTATTGTTTTCAGCAACCTGGCCTGAGGCCATCGCGGCGATTAGCGGTCGTGTACAGCAGCAGCCAATACGTATTGAAATCGATACGGTAGATGCGCTACCGGCTATCGAACAACAGTTCTTCGAAACGTCTGCGCATGAAAAAATTTCGCTGCTACAAACGTTGCTTAGCCAGCATCAGCCAGCGTCCTGCGTGGTATTTTGCAATACCAAAAAAGATTGTCAGGCCGTTTGTGATGCGCTTAATGCGGTAGGACAAAGCGCGTTGGCGCTCCACGGCGATCTGGAACAACGCGACCGCGACCAGACGTTGGTGCGTTTTGCAAACGGTAGCGCGCGCATTCTGGTTGCCACCGACGTTGCCGCGCGAGGATTAGACATTAAATCGCTCGAACTGGTGGTTAACTATGAACTGGCCTGGGACCCGGAGGTGCATGTCCATCGTATTGGCCGTACGGCGCGCGCGGGAAGCAGCGGCCTGGCGATCAGTTTCTGCGCGCCGGAAGAGGCGCAGCGGGCGAATATTCTTTCAGAAATGCTGCAACTCAAGCTGAACTGGCTGAATGCGCCCGCCCGGCAGCCGTCACTCCCTCTGGCCGCAGAGATGGCTACCCTATGCATTGACGGCGGCAAAAAAGCGAAAATGCGTCCGGGAGATATTTTGGGCGCGCTGACCGGCGATATTGGATTAGACGGGGCGGATATTGGCAAAATTAACGTGCATCCAATGCACGTTTACGTCGCCGTACGTCAAGCAGTAGCGCAAAAAGCCTGGAAGCAGTTGCAAAACGGGAAGATCAAAGGCAAGTCATGCCGGGTACGGCTATTAAAATGATCGATAGCATGGCGTCTCAGCGGAGTGCTGAGACGCCTGCAGATTATTTCACTTCGATAACATCAAGCCGCAACGCCTCTAAGGCGGTGTCATCTTCTTCCGGCTGCCAGCCAGCGGGCTGCAAGGGAATCTCTTCACGATCGAACGCTAAATCGCCGCCGTCGACGACCTCGGAACCGTGAGTGATTCCTTTGAAATCGAACAGGTTAGTGTCACAAAGGTGAGACGGCACGACATTCTGCATGGCGCTAAACATCGTCTCGATCCGTCCAGGATAGCGCTTATCCCAGTCGCGTAGCATGTCGGCAATCACCTGGCGTTGCAGGTTTGGTTGCGAACCGCACAGATTACAAGGAATGATAGGGAAGGCTTTGGCCTCAGCAAAACGGACAATATCTTTCTCGCGGCAGTAAGCCAGCGGGCGGATCACGATATGTTTGCCGTCATCGCTCATCAGTTTCGGCGGCATCCCTTTCATTTTTCCGCCATAGAACATATTCAGAAACAGGGTTTGCAGAATATCGTCGCGATGGTGGCCCAGGGCGATTTTGGTCGCGCCCAGTTCAGTCGCCGTACGATACAGGATACCCCGACGCAAACGCGAGCACAGCGAGCAGGTGGTTTTTCCTTCCGGAATCTTTTCTTTCACAATGCCGTAGGTGTTTTCTTCGACGATTTTATATTCTACGCCCAGCTGCTCAAGGTAGGCTGGCAGGATATGTTCCGGAAAACCTGGCTGCTTTTGATCGAGGTTGACGGCGACCAGTGAAAAATTGATCGGGGCGCTTTGCTGCAAATTACGTAAAATTTCCAGCATCGTATAGCTATCTTTGCCGCCAGAAAGGCAAACCATAATGCGATCGCCTTCTTCAATCATATTAAAATCGGCAATCGCTTCGCCAACGTTACGGCGCAGGCGCTTTTGCAACTTGTTGAGGTTGTATTGTTCTTTCTTTGTATTCTTTTGAATTTCTTGCATTATTTCAGTTCTCTGGTACTAAATGGGGCAAATTGGGGGCAAACTTTGCAACTACGATAACCGTGCATTCAACATCGCTACTTGTTCGTCGTTCATGTCATCAATCCACATACCGTAAATTTCATACACCATCTGCGCAGTTTCATGCCCCATTTGGCTGGCGATAAATGCCGGGTTCGCTCCTGCCGTCAACAGCCAGCAGGCAAAAGTATGTCGCGTATGGTACGGATTACGGCGGCGAATACCAGCACGTTTTACTGCCGCATTCCACCTTGCCCCTAAACTGCTTACCGAGTAATAAGGTTTCTGTTTTCCTTTACTCATCCTGGGCATGAACCTGTTCAGACTTCCGCAATACCATTAACAGGTTCTGGATATAGCTGTTTTCCTGTTCCATTTCGAGAACGGTAATATGCCTGCGTTGAGCGCGGGTGGCGTGATGCAAATATTTTTTGTCTGTGGCAGCGTAAGTAAAAATGTGCAACACGCGCTGGGGGAACGGCAGGGTGGCGACGGAAACTTCGCAGTCCTGACATTCGTCGTTGTCACTGGTGTCTGTCCCGCTTTCTTCGGTGCCAGCGTCGTCGGCGCATTCCTCCGTTTTTTCGTCGTGGGCGTCAGTAGCGGGCGTGCCGGGGATGAGCATAAATGTTTTGCCATCTTCGCCGCCCAGCTCATATTTCTGACAGAAGGTGGTATCAAAACTTCCCTCCGGCGGAAGTTCGTTAACAGCGGGGGAATTAACGCGAACCGGTTTTGCAAAATCTGTCGGTTCGAATCCGGCGTCAAGCATTGATACGGCAAGGCGTGCTGTGGCAGCGGCTTCGCTGCTTTCAGTTTTCCACCAGAAAGCAAAAGGGAAGCCGAGACGTTTCCGGGCGCTTTCATTTTTAACCTTAATATAATATGAGTATTCTGTTTTATTGTCGCTCATTATCATTACCCTTATTACAAATCATACTTAATGAAGACTTTCATTTTTCATTGAGCAGAATGCGTTCGTGACGAGCTCTTTACACTCCATCAGTTTCAACAATGAAATAAAATTTTCGGGAGGAGCTTCGTTCATTTTTAACAGCATTATTGCGGCTGTTATTGACCTGTCATATGCGCCTGTTTCACTGTCTGTATATGCAACAAGAGTTCTGGAAACACTTTCATCGTCACAGTCCCGGGCATAAGAAACAACACAGGGCATATTGTTTTCGATACATAACGTACGAATTTTTCCTGAAAGCTGGCGGAGCTCTGCAATTACTGATTCAGGTACATTTTTCATATAGATTCCTTTTTTCAGGTTGAGTGAATTCCTGCCATTGCAGGCATATTTAAAAACAGGATGGTTTAAACGATTACTGTTCTGTTACCATGATTCAGCTTTGGCAGGCAGACCATTTCTGTTCAGCCAGACTTTTACCATGCAATCGGTAATACATCTTTGCGTTGTTAAATCACGGATATATAAACGGCGTTTTTTAATGTTATTCGCTGAGGCGATATAAGTACGACCATCATGGATAACATAATCTCCCGGCGTAATACACTGACGCGGTATTTCATCCGTTCCGAAGTGATGAGCAATCATAGCCCCTCCATTTCTGGTAAATAAATTTTGTGGTGCGGTGCCTGGTGCCTCCAGGTGACATTAACCAGTTAACAATTAATGCCGACTTAAACCACCCATACTGATTCAGGGAGTTTTAACTGTGCCGCGTGCGCTTAGCCGCATTCACCGCATCACAAAATTCACTTTAAAAAGGGCGGACATCAGAAAGGACTAAGAAAAACTGATGCCGCCAAGTACTACACACAGCATTATTGTCGCAGTGGCAACTACAACCGGAGGCGCACTTCCACTATTTGGATTTACAGACAAGACCGACTCAGAAAACATCAGAAATGCGCCTTCGTGTTGTGCCCGGCTTTATTTAACCACCTCCGGGCTTCGGTGGTCTCGGCTATACCCCTACAGCGAGAACCTGTGTTAACATTTCAATACCCTTACAGTTGAGAGTTATTGATATGTCAGAAACCGCTCTGGTTATCGTAAAATTCCTAATTGGTAAATCCGTCGGACAATTTATGCTCACAGTGGCTTTATTTTTCTTAATTATCATCTTCATTCCTAGAGATATTACGGAGCTTATTGAGGCGCGTAGCGATTTACCATATGCCGTTCAGATTTTTAGTTTTGCTGTGGCTTACCTTATAGTGCTGATCCTCAAAGTCACTGGTTATTTTTTCGTGTCGGCGCTGCCGTTGTGCCAGCGTAGGGGCAGGGCAAAACGCATGTTAAAAACGCTTAATTCATTGAGTACTGAACAGCTGTTTTTACTTGAACCCTTTCTTAAAACTCATTCTCCCACTTTCCGGGCGTCCTGGGATAACCCTGATGCAGATGCTCTGGTTAAGGCAGGTATCGTTCGTCCGGCTGGTTCGTGTATCGACGGTGTTTCTGTGATGTTCAAAATCGAACCCGAGTATGAGTCGTTAATGCTTTCCACCTGGAATCCCTGCACAAAACGGTTCGATATTAGCCGTTAGCTGAAAGCGCCAGCAGAAACTCACTGAAACTGAGTGCTTCTTCTCCTTCGTCAAGGCTTTCAAAGTATTCTTCGTAAGCCTTTTCCATGATTGTGTCGAAATCCATATCACTCACCTGAGTTTCTTTCTAACCAGCGACGTGCGCCTGTTTCAGTTTTAAACGTCCTGCTTCTGGTGTACGTCATGGCGGTGAACGTTCCATCCTGGTTGGGGAACACGCCACACACCAGGGATTCGTTATTGCCGAGGTGGATTTTTTGCAGCTTGTCCATTATCACCCCGGATAATACTGTTCCTGTAGCTCGCATTGAGCCAAAAACATATCCCATCCATTGTCGCGCAATGCTTGGAGAGCTGCGCTAAAGCTGACGTTGCAACTATCCATCAAATACTGAATCACTTCATTCCTTCCCATCTTTCCCTCTCCCCTTAACGCCGGGTGGCGGAACTAAAACCTACAGCGCCGTGCTGTTTCTGAGATTATATTAGCGATATTCATATAGTTGATCAAGATAAATATGCATATATATCATAAATATGATCTATCCTAATGAAAATAAATGTGTTTTATCTGATGCAAGAGGGGGAGGGAGGAGCTTTAGCCAAAAGAAAACCGCCGGGAGAGGCGGTTTGATGTGGTTGGTTCGTCACTGATTTTTTAGGCGCTTTTGTGCAGCGAGCATGTTCTGGAAAGCCTCTTTATATAGCTCATTCTGACCTTTAAGCCGGTCAATGAGTTTTTCTTTCTCAGATTCAGGGAGTATATCAAAAAGGTTTAGTAAATCAGCCTGTTGTCTGCTCACCATTCGCCAGCCACCACCTTCGAAGTTGTCATCGTAAGTACCAGAAGAACGAACGTAGTTCATTAGATCGGCCAAATCCGGTCGTAACTCTTCGGGTTTAACTCTCAATAGAACAGAAAATTTTAAGGCCGCATCAGTGTTGAGAGGTGCCTTACCGTTCAAATAGTGACTGACGGTAGATTGTGTCTCAAAGCCCATAAGATCAGCGGCGATCTCCTGAGTAAGTTTGAGGTCTCGCTTTTTGGCGTCCCAGATGGCGCGTAAGCGCTGGGTAGCTTCTGGTGGAGCTATTTCTTCACGTTTTTTTCTCATGCGCTCATCTTATGAATGTGACTCATAAACTCAAACTGATATAAGTATTGATCATTTAAATTAGTATGGTTAATATTTAGCGAGAATTACTAAGGTGACCTTTATGACGTTAGATGAATATTTGAAAAAAAATCGTGTACGACAGTCTTGTTTGGCCACGCTGGCTGGTTGTTCGCAATCGATGATTAGCCTTGTTACTACTGGCCGTAGTCAGTTAAGCCCCGAAAAGGTATTGCGTATCGCAGAGGCTACGAATTTCGAGGTTACACCTCATGAACTCCGGCCTGATAACACTTACGGAGCTGAGGAGGATGACGGGGTTAACCATTTATTCGACCCGCCACTACCTGGACAAGGCAGAACGTTGTGGGGATGTGTACCAGGCGGGCAGAAGAGGGGGGATTTTCCCGTCAGAAGAGGCTTATCGTGCCTGGAAGAAACAGGCGAAAGTGGACGCTGACCTGATTTGGAAGCTGCCTGACGGTGAGGTACGTCGTTACGACAGGCACCACAACGTAATTTGTCGTGAGTGTCGTAAAAGCGAGTACATGCAGCGGGTACTGGCGTTTTATCGGGGAAACTTTCAGGAGGTGCTGTTGTGAGCCAAATTAACAATCGGAACTGCGTGAAGTGAAAGAGAAAGCATAATCCAAATATGAATAATTAAATTTAGTGATGTAAATAAACTTTAATCCTTAACCGGATGGATTCCTGCACGCTCAGAACACCAGGAGACCGCCCGAAAGGGCGGTAGCTCCATTGCTTAATTGTCTAAAATCGTGCTAAATCTTTTTATTACCATTAAGAAAGTTATGACAGTGATAAAAAAGGATGTATAGGCTAAAAAGCTAACGATATATGCGGGTGCGCGAAAATACCATTTCATTAAATCCACTGCATTGTCAGGCAGGAAATATATTATTACTGAAAATATAACCAATACTATTGAAGTTAATATTGCATAAGCGACGTTGTGGCATAACTGCTCATATATGGTTTTGTTAGTGTTTAATGATATCAATTTGTCTCTTGATTTGTTTCCTTCAATTATATCTGATATCTTAGTAATGGTTTTTTGTTTTTGTTCATAAATCATTATTACTGCACTCATTAATAGTGCTGTTGTAATAGCCCCGAAGTTAACGAAGACGGAAGCAATTGCCGGTTTCATTATTCCGTATGTCCAGCACAGAACGAAAGAAAGAGATAGAGGAACTATAAAATGTACGGTAATGTCGCTCATCAACATTGTTCCACGCTGATCTGACATTGTTTTGTAGTGTTTTATTATTACACCCAGCACATTTATTTTATTCATATAATCACCCCTTTATTTCCACAGTGCAGTTCTTCCAATATATCATTGGAAAGGTTTTTTATCGTGTCATGAAGTGCTGTTAGATCAGGTATACCTGTTAATGGGTCGATTTTTAGATCATTATCATCTAACTCTGCTGAAATTCCTTTTTTTAGTATGGTATCATAATTGAAAACGACAGTCCGACTGCCGAGCTGTAAGCTTACTTTTATTGCATCACATTTATCTTCAATAATCTCAATGATGTTTCCTATATTCTTGTTTCTTAAATCCCTGAAACTTCCGAATATGCCATCGTTTGCTTTTATTATTAAGTCTGTCTTGATGTTTGTTTTGTTTTTACCAAAGGAATCAGCAATATCTTCTGGTGCTTTATATCCTTGAGCTTTAATTTGTTTCAATTCAGAATTGAGAATGTATTGAGGGATTTTCTTATGATGTAATGGATTGATTCTTGCTTCTAGTTGAAATTGTTTTTTTAGATATTCAGTGATAGAATCAGAAAGAACACCTCGAGCAGAAATATTATCGCATGAATGGAATGCAATAATTCCTTCTTCAAGATTATCTGGTAGATATATTAAAATATAACGCTCTTTGAGTGTTACATCATAAGCAGTTGTTCTGTAATGGACTTTTTTGAGTTTTACATCTTTTATTTCACTGCTTTCTCCATATTTTCCAACTTTTATATAACCATATATGATTTTTTTTGTGTTATCAAAGTGAAGTTTAGTATGTTGTTCCAGAGATATTTTAGTTTTTGAAACGCCGAACTCGATGGGGGTGTTTTTATAAAGAGTAAAATAATCAACAAAAAGTTCATATGCCGTTTTTTTATTACTTAAACCTAAGTCATTAAGTTTTTTGCTGGCTCGACTGCCTTTATGGGTCAATACGCGGAATGAATAGAAATTAACGCTGTGCATGAAAAATCCTTTTGAGTATACAGGAATATACTAGAACATAAATGTATGCAATGCATAAAGGAAAAGCTACCGCAGGGCGAATTCACCCACCGATAGCTCTTAAATGATTGTTTTCAAGCGATAATACATAAATTTTGGTCTGTGTAAAGAGGGATGTATCGTCAGGGCAAGGGAGATGTATTAAGGGGTATTGGTAAAATTTGCGTTGGGGATAAAAACGGTTTGCGGGAAAAGGAGAGTTAAGTAGAATTGTTGCGGGTGCTTGAGGCTATCTGCCTCAGGCATGAACACCAAAAGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCGGCAAGACGCTTAACATTAATCTGAGGCCAATTTCATGCTTTGCACATGTAGGTTAGCCTCTTACGTGCCGAAAGGCAAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAGTGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCTGGCGGGGAGAAATCCCCGCCACCTCTGACGTGTCAGGCATCCTCAACGCACCCACACTTAACCCGCTTCGGCGGGTTTTTTGTTACCCGTAAAATAAAAATTCATAAAAATGATCAACTTCCAGATTGGTTGCGCAACAAGTGAAAAATGTCCTTGCTGGTGAACATAAAATAAGCAAATTTATATAATGAAATAAATAGTCGCAGTGTTTATATTTCCCGCCTCAACAGAAATCGCGTTGAAATCGCACCTTTTCATTTTTCCTTAGTTGTCTGGAGGTAACGTGAAAAAACTCAAGGATTTATTAGAGTTAGATGAAGACGGGCTTTATGCAGTACGTGTAAAAAATGGTGAAATCTCATTCTGTACGCTAATTCCTGACGACCATCTGATTCTGTCTGTTGAAGCGTTTATTGATTATCTGATAAGACTGGGTTTCACTGTCAGTTATTAATGTTTTAATATGTTACGGCTGACCTGAACAATCAGCAACCTACAGCGCCACCGGAGAGAACGATGGCGCATCTACAACTTGTTAAACAAACCTCATCAGGGCTTCTGCTCCCGGCAACGCCGGAGAGTGAGGACTTCCTGCGCTCAGTAAAAATCGGTGCGTGGATACACGCCGATTTTAAGCGAGTACGTAACTACGCGTTCCACAAACGTTTTTTCAAGCTCCTTCAGCTTGGTTTCGATTACTGGACTCCGATCGGCGGGGCGATCCTGCCTCAGGAACAGGAGCTGATTACCGGCTTTGTCGATTTCCTGTGTGAGTCAGCAGCGCAGGGCCACAGTCCCGCACTCAGTGACGCGGCGGAACAGTACCTGCATAAGGTTGCTGTCAACCGAACGCTCGATGTTGCGCTGCTCAAGTCCTTCGACGCTTTCCGCGAGTGGGTAACCATTCAGGCCGGGTTTTATACTGAGCATTATTATCCGGATGGCAGCCGTGGGCGCCGGGCGAAATCCATCGCTTTTGCGAATATGGACGAAACCGAGTTTCAGCAGGTTTATAAGGCCGTACTGAACGTCCTGTGGAACTGGATTCTGTTTCGTAAATTCTCCTCTCCGGAAGAGGTCGAAAATGTCGCAGCGCAACTGCTGGAGTTTGCGTAATGGCGGATTTACGTAAAGCGGCGCGGGGCCTGATGTGTACGGTAAGAATTCCCGGCCATTGCAACCATAATCCTGAAACGTCCGTACTGGCACATTACCGGCTGGCGGGTACGTGCGGAACGGCGACAAAACCAAACGATATGCAGGCAGCAATTGCCTGTAGCTCGTGCCACGATATTGTCGATGGGCGGGTAAAAATCGACGACTTCACGAAAACAGAAATTCGCCTGATGCACGCAGAGGGCGTTTTCCGCACGCAGGAAATCTGGAGAGAGAAAGGCATTTTATGATTTACCCAACAAACACCGGAAAAAGCGGAGAACACCTTCGTCTCAGCACGCTGGAAAGTGTGTGGATTCAGGGGAAATTGCGTATGTGGGGGCGCTGGTCATACATTGGTGGCGGCAAAACAGGGAATATGTTTAACCAGTTGCTGGCGTCCAAAAAACTGACGAAGACGGCCATTAACGATGCTTTGCGCCGTATGAAAAAAGCGGGGCTGGAGAAACCTGAACTGGAAGTGTTCCTGAGAGAGATGATCAACGGAAAGCAAAAAAGCTGGCTGGCACACTGTACGGATACGGAAGCGCTGATTATCGATCGGGTTGTAGGCGAGGTACTGACGGATCATCCGGGGCTGCTTGGTATCCTGAACCAGCGTTACGTGGGGCGGGGGATGAGTAAGAGAAGGATGGCCGAGTTACTAAACGAACAGTACCCAGAGTGGGCGTTGATTACATGCCGACGCCGTGTTGAGCAGTGGTTGAGTATCGCTGAGTTCATTTTGTATTCACCTATGAGAAAAGCGTTCGATTATGCTTAAAAAATCATTTGCAAAATGAGCCACAAACTGCTTCAATTCCAGTACGCTTCGCAAAGCTGTATCGCGAGGCTAATGACAGACATGAACGCATTTTGAAACCCGCCATCGTGCGGGTTTTGTCGTTTCTGCAATACAGAAAAATATTCGCCAGTGTAATCCGGGTTGTTAAACATGGTGCGTTTTAAATATGTTTTATGTACATTAAATTAATGTGAAATGTTTTGATAAAATAAAAATGTAATAATAACTTTACGTTTATTGACACAATGAATTGTTGAAACGCCTGTTCTGACTCGTATTATTTTCATCGGTCCGAAGGGGATGATGGATAACCTTCTTCTGCCCCCGAGGATCAGAGAGCCGGTTTTTTTGTGCATCCTGGAAAATTCACGTGAAGGAACCGGCTCTCAACCAGAGAAGAAGAGGCGTTTTTTTCGATACAACTATCGTAATTACTCCGTTGGTGGTGCTGGCACGTAAAGTGTGGATAGTGCGTTTATGATGAGTGCATGATGTATATCCTGATACAGCATCCGGTTTGTGGGGGCGGAGAAGCCCCGATGGACTCAGTGCCACAATTTTTTATTGCATTCAGATAGCGTACTGTGAATCGGATAAATGAGAATGTCAGTGTGTTGGTAATGCGGGGTTCTCAGTGCGCTATCTGAATGCAGTGAAATCTGCTCTGAGCAGAGCTAAACAGCATTGTCTGCGTTTGATCAATTTGTAGCGGGTCATAGTGGCTGACTAAAGACTCTCCGGGGCATCCCGGCACTGCATTTATTACTAAAAATCTTCATATCACAGAGGCAGAACATACGGAAAATTCTTGTCAATACAACACCTGACACAGCAATATTTTTCGGGAGTCCCCGGCGCCTCAGGTTTTTTATCGCCATCAATAAAACTATAATAATAACTCCATGTTATGATTACCACCTCTCTCTCATGAGGTGGTTTTTTTATTCCCGCAAATTGCAGAAATAAGATGGAGTCATCAGAATATGCCCTGATTGTATTTTGTCTTTTTTGAATTAATGCAAAACATTTGGAATAAATAAACATCTAATGATAAATTTACATTTCTTGACGCGACTGCTTGTTGAAATGAAATTTTTATGATTTATTATTGTCGACAGTTTGGCGGAGGTGACTGGCAGATTTCTCCACTCCGTCGAATAAGAGAGTTGATTCTTTATACCTCCTGAGTCGTCTGATTAAAGAATCATCACTCGATTTGGCATTAAGGTGAAATTAAGATTCCATTGATATAGGTATCGTTCTTACTCTTTGTGGTGCAGGCATATGGATATGGGGTGGTTACATTAATGTTCCTTTAATTTAACCTCCTGATAGTGAATCAGGCACCGCAATTTTTTTACTGCATTCAGATGGCGTACTGCAAAAAAACGGTCATTGCTTGCGCCACTGTCTGATGCTCTTGTAACACATACGGGATTTGTGGTACGCCATCTGAATGCAGTGAAACCCACATAAAGTGGGGCATAAACAGGATATGAGGTGCGTTTATTTTCGTCTGCGGGTCATGGTGACTGACCAACGGCCCTCCGGAGATAATTCCGGCACTGCATTATTTATTGAGGTGTTCCCCAGTGCGGGGGTGACCGGGAAAAATGTTCTGCCGATGGTCACAGACACATACCGGGCTAATATGTGTTTTCGGGAGGCACCCGACACCTCTACTGTTTTTCCAGTCGATAACTATAAAACATGCTTCAGATATTGAGCACCGCCTCCCGTGAGGCGGTTTTTTTTATTCCGGGAAAAAGTTCTTCCCGCCATATAATAAAGTTAACGTTTTCAGACCAGGGTGCGGGAAGTATCCGGGGCGGGAAATAATGAATTAAAAAAGAAGCGCGGCTGTCGGATTTAAGCCGCGGGACAATGTCCGTGATAGATAGTTGAAAAATTTCAGGCTATCCCTTTCGGGAGGTCGCCATTATTTTACTCATAACAAAATAAGACCGGAACCCCGGAAACAACCTTATTTTCCGGTAAGGCTTATTTCATTCCCCGCGCCACGCCCGGCGCACATTCATAACTAACCACGGAGCCTTTCAGGGGTGAGCTTACGGGATGGTCAGTGTGACTTTCTCTGTGGGCTGGTCACCCCCGGGCGCAGGCTCACCCACTAAAAGGAAAAGTCACGATGTTAGGTATTTTCAGAAAGAAAACCCGCAAGGCTATTGTTGAAGTGAAGAAGATGGAGAACCGGGATGCGGTGGAGGCGACCGTCTGGGGCGCATATTCCATTGCATACGCCGATGGCACCTGTGACGCGAAAGAAATTGCAGTGCTGGAGAAAACCATCGCGGCACTTCCTGCCTTTGCGCCGTTCTCCGGCGAGATTGCCCAGATGAGTGCCAATATCCGCGCCCGCTATGAGGCGTCACCGCGTAGCGCGAATGCTCAGGCACTGCGTGAGCTGGCTGACGTGGCAGGAACAGCAGAAGCGGTTGATGTGCTGTGCCTGTACTGGCCTCGTTCCTGTCCCGCCTTGCTGACTACAACGGTAAACCGCTGGATGCGCTGTGTGCAGTGGTGATGTCGGTGCTGTCAGTGAAATTTCTGACCTTCATTCATGACCAGGACATTTCATCGCTGACCGGGGTTTTTTCACGGATGCGGGGAGGAGGGAGTGGTCATGGAAAGTAATCTGACCGGCACACTGAATGCGGGCCTGTGCCTGGTGACAGTGCTGGCCCTTTTTCTCTACCGCCGGAACGGCGCCAGATACAAACCGGGAATAGCCTGGCTGTCGTACCTGCTGATGCTGGGCTATGCGCTGGTTCCGTTCCGTTTTCTGGCCGGACATTACCCGTCTTCATCCTGGCCTGTGGTGCTGATGAACGCGCTGTTCTGCGGGCTGGTGCTGTGGGCGCGGGGTAATGTGTCGAAAATACTTTCACTGCTGAGGCTGCGATGAAACCGAAGGACGAAATTTTTGATGAAATTCTGGGTAAGGAAGGCGGCTACGTCAACCATCCGGACGATAAAGGCGGGCCGACAAAATGGGGTATTACGGAAAAAGTTGCCCGCGCCCACGGATACCGTGGTGATATGCGCAATTTAACCCGTGGACAGGCGCTGGAAATTCTGGAGACCGACTACTGGTACGGTCCCCGCTTTGACCGGGTGGCGAAGGCCTCGCCGGATGTTGCTGCCGAACTGTGTGACACGGGCGTGAACATGGGGCCGTCGGTGGCAGCGAAAATGTTGCAGCGCTGGCTGAACGTGTTCAACCAGGGCGGGAGGCTGTATCCGGATATGGATACGGACGGGCGCATCGGGCCGCGAACCCTTAACGCGTTACGTGTTTATCTGGAAAAGCGCGGTAAGGATGGCGAGCGTGTACTGCTGGTGGCGCTGAACTGCACGCAGGGGGAGCGCTATCTGGAGCTGGCGGAAAAGCGGGAGGCTGACGAGTCGTTTGTCTATGGCTGGATGAAAGAGCGCGTATTGATATGA
Protein sequences of DBSCAN-SWA_4 >NZ_CP007598|3077873:3092149|3088123_3088660_+|WP_000640113.1|DBSCAN-SWA MIYPTNTGKSGEHLRLSTLESVWIQGKLRMWGRWSYIGGGKTGNMFNQLLASKKLTKTAINDALRRMKKAGLEKPELEVFLREMINGKQKSWLAHCTDTEALIIDRVVGEVLTDHPGLLGILNQRYVGRGMSKRRMAELLNEQYPEWALITCRRRVEQWLSIAEFILYSPMRKAFDYA >NZ_CP007598|3077873:3092149|3087836_3088127_+|WP_000774470.1|DBSCAN-SWA MADLRKAARGLMCTVRIPGHCNHNPETSVLAHYRLAGTCGTATKPNDMQAAIACSSCHDIVDGRVKIDDFTKTEIRLMHAEGVFRTQEIWREKGIL >NZ_CP007598|3077873:3092149|3082248_3082770_+|WP_000004762.1|DBSCAN-SWA MSETALVIVKFLIGKSVGQFMLTVALFFLIIIFIPRDITELIEARSDLPYAVQIFSFAVAYLIVLILKVTGYFFVSALPLCQRRGRAKRMLKTLNSLSTEQLFLLEPFLKTHSPTFRASWDNPDADALVKAGIVRPAGSCIDGVSVMFKIEPEYESLMLSTWNPCTKRFDISR >NZ_CP007598|3077873:3092149|3086501_3086714_+|WP_000882662.1|DBSCAN-SWA MLCTCRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >NZ_CP007598|3077873:3092149|3080542_3081160_-|WP_001676915.1|DBSCAN-SWA MSDNKTEYSYYIKVKNESARKRLGFPFAFWWKTESSEAAATARLAVSMLDAGFEPTDFAKPVRVNSPAVNELPPEGSFDTTFCQKYELGGEDGKTFMLIPGTPATDAHDEKTEECADDAGTEESGTDTSDNDECQDCEVSVATLPFPQRVLHIFTYAATDKKYLHHATRAQRRHITVLEMEQENSYIQNLLMVLRKSEQVHAQDE >NZ_CP007598|3077873:3092149|3091603_3092149_+|WP_000802786.1|DBSCAN-SWA MKPKDEIFDEILGKEGGYVNHPDDKGGPTKWGITEKVARAHGYRGDMRNLTRGQALEILETDYWYGPRFDRVAKASPDVAAELCDTGVNMGPSVAAKMLQRWLNVFNQGGRLYPDMDTDGRIGPRTLNALRVYLEKRGKDGERVLLVALNCTQGERYLELAEKREADESFVYGWMKERVLI >NZ_CP007598|3077873:3092149|3083417_3083885_-|WP_001227859.1|DBSCAN-SWA MRKKREEIAPPEATQRLRAIWDAKKRDLKLTQEIAADLMGFETQSTVSHYLNGKAPLNTDAALKFSVLLRVKPEELRPDLADLMNYVRSSGTYDDNFEGGGWRMVSRQQADLLNLFDILPESEKEKLIDRLKGQNELYKEAFQNMLAAQKRLKNQ >NZ_CP007598|3077873:3092149|3087237_3087837_+|WP_000940751.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESEDFLRSVKIGAWIHADFKRVRNYAFHKRFFKLLQLGFDYWTPIGGAILPQEQELITGFVDFLCESAAQGHSPALSDAAEQYLHKVAVNRTLDVALLKSFDAFREWVTIQAGFYTEHYYPDGSRGRRAKSIAFANMDETEFQQVYKAVLNVLWNWILFRKFSSPEEVENVAAQLLEFA >NZ_CP007598|3077873:3092149|3087004_3087175_+|WP_000734094.1|DBSCAN-SWA MKKLKDLLELDEDGLYAVRVKNGEISFCTLIPDDHLILSVEAFIDYLIRLGFTVSY >NZ_CP007598|3077873:3092149|3084157_3084487_+|WP_001676916.1|DBSCAN-SWA MNSGLITLTELRRMTGLTIYSTRHYLDKAERCGDVYQAGRRGGIFPSEEAYRAWKKQAKVDADLIWKLPDGEVRRYDRHHNVICRECRKSEYMQRVLAFYRGNFQEVLL >NZ_CP007598|3077873:3092149|3081187_3081505_-|WP_000800272.1|DBSCAN-SWA MKNVPESVIAELRQLSGKIRTLCIENNMPCVVSYARDCDDESVSRTLVAYTDSETGAYDRSITAAIMLLKMNEAPPENFISLLKLMECKELVTNAFCSMKNESLH >NZ_CP007598|3077873:3092149|3082877_3083033_-|WP_085981757.1|DBSCAN-SWA MQKIHLGNNESLVCGVFPNQDGTFTAMTYTRSRTFKTETGARRWLERNSGE >NZ_CP007598|3077873:3092149|3091325_3091607_+|WP_000445513.1|holin|DBSCAN-SWA MESNLTGTLNAGLCLVTVLALFLYRRNGARYKPGIAWLSYLLMLGYALVPFRFLAGHYPSSSWPVVLMNALFCGLVLWARGNVSKILSLLRLR >NZ_CP007598|3077873:3092149|3079290_3080226_-|WP_001156217.1|tRNA|DBSCAN-SWA MQEIQKNTKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFPEHILPAYLEQLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIVRFAEAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIETMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVDGGDLAFDREEIPLQPAGWQPEEDDTALEALRLDVIEVK >NZ_CP007598|3077873:3092149|3085199_3086132_-|WP_000556389.1|DBSCAN-SWA MHSVNFYSFRVLTHKGSRASKKLNDLGLSNKKTAYELFVDYFTLYKNTPIEFGVSKTKISLEQHTKLHFDNTKKIIYGYIKVGKYGESSEIKDVKLKKVHYRTTAYDVTLKERYILIYLPDNLEEGIIAFHSCDNISARGVLSDSITEYLKKQFQLEARINPLHHKKIPQYILNSELKQIKAQGYKAPEDIADSFGKNKTNIKTDLIIKANDGIFGSFRDLRNKNIGNIIEIIEDKCDAIKVSLQLGSRTVVFNYDTILKKGISAELDDNDLKIDPLTGIPDLTALHDTIKNLSNDILEELHCGNKGVII >NZ_CP007598|3077873:3092149|3084648_3085203_-|WP_001033796.1|DBSCAN-SWA MNKINVLGVIIKHYKTMSDQRGTMLMSDITVHFIVPLSLSFVLCWTYGIMKPAIASVFVNFGAITTALLMSAVIMIYEQKQKTITKISDIIEGNKSRDKLISLNTNKTIYEQLCHNVAYAILTSIVLVIFSVIIYFLPDNAVDLMKWYFRAPAYIVSFLAYTSFFITVITFLMVIKRFSTILDN >NZ_CP007598|3077873:3092149|3091147_3091336_+|WP_001688615.1|DBSCAN-SWA MPVLASFLSRLADYNGKPLDALCAVVMSVLSVKFLTFIHDQDISSLTGVFSRMRGGGSGHGK >NZ_CP007598|3077873:3092149|3081589_3081811_-|WP_000560208.1|DBSCAN-SWA MIAHHFGTDEIPRQCITPGDYVIHDGRTYIASANNIKKRRLYIRDLTTQRCITDCMVKVWLNRNGLPAKAESW >NZ_CP007598|3077873:3092149|3077873_3079247_+|WP_000123686.1|DBSCAN-SWA MTAFSTLNVLPAAQLNNLTELGYLEMTPVQAAALPVILAGNDVRVQARTGSGKTAAFGLGLLHRIDVTLFQTQALVLCPTRELADQVAGELRRLARFLPNTKILTLCGGQPFGAQRDSLQHAPHIIVATPGRLLDHLQKETVSLDALHILVMDEADRMLDMGFSDAIDEVIRFAPATRQTLLFSATWPEAIAAISGRVQQQPIRIEIDTVDALPAIEQQFFETSAHEKISLLQTLLSQHQPASCVVFCNTKKDCQAVCDALNAVGQSALALHGDLEQRDRDQTLVRFANGSARILVATDVAARGLDIKSLELVVNYELAWDPEVHVHRIGRTARAGSSGLAISFCAPEEAQRANILSEMLQLKLNWLNAPARQPSLPLAAEMATLCIDGGKKAKMRPGDILGALTGDIGLDGADIGKINVHPMHVYVAVRQAVAQKAWKQLQNGKIKGKSCRVRLLK |
19 | Escherichia_phage(66.67%) | tRNA,holin | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
3745726 : 3756230
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP007598|3745726:3756230|DBSCAN-SWA ATTAGAAATTTAAACCAAAAAACTCTTCAAATTTGCTAACTACATAATCTAAATGCTCTGTAGTCAAGCCAGGATAAATACCAATCCAGAACGTTTGATTCATTATACGGTCGGTATTTGTCAACTCACCCACTACACGATATTTCACATTAGCAAAATACGGTTGGCGAATCAGATTTCCAGCAAACAGTAAACGTGTACCGATTTTTGCTTCATCAAGGAATTTCACCAGTTCGACACGGTTAACACCGCTAGTTTCTTTCAGGGTGATAGGGAAGCCAAACCAGGATGGGTCTGATTTCTCTGTTGCTTCTGGTAATTCGAGGAATTCAGTGCAAGATTGCAAGCCCTGTTTCAGATAGGAAAAGTTAGCTTTACGCTGCTCTACAAACTCTTCTACGCGCTCCAACTGAGCCAGACCACATGCTGCCTGCATGTCCGTGATTTTGAGATTATATCCGAGGTGGGAATAAGTATATTTGTGATCATAGCCTTGAGGAAGTGATCCCAATTGCTGACCAAAACGTTTACCGCAGGTGTTATCGCATCCTGGCGCACAATAACAATCCCGGCCCCAGTCACGGAACGACTCAATAATTTTCTTCAGTTCACCTGACTTGGTGAATACAGCACCGCCTTCACCCATTGTGATATGGTGAGCCGGATAAAAACTAACGGTTCCGATGTCACCAAAGGTACCTACCATCTGGCCTTCATAAGTCGTCCCAAGGGCATCACAGCAGTCTTCAATCAACCATAAGTTATATTTATCGGCAATCCGACGAACTTCACTCAGGTTAAATGCATTACCGAGTGTATGAGCGATCATTATCGCTTTTGATTTCTCAGTAACTGCAGCTTCAATGAGAGAGGCATCGATATTATATGTCGGGATATCAACATCCACGAATACCGGTATTAAACCATTCTGGATCGCCGGGTTAACTGTAGTCGGGAAGCCAGCAGCGACAGTAATAACCTCATCACCAGGTTTGAGAGCACGCTCGCCTAATTTTGGGGAAGTCAGCGCAGTCAGTGCCAGCAAGTTTGCCGAAGAGCCAGATGTAGTCGTTAAAACATGAGGAACCCCAATAAATTCCCCAAGTTTTTTCTCAAAGGCATCATTGAAACGACCAGTAGTTAGCCATCCATCAAGAGACGCCTCAACCATCAATTGTAACTCTTTGGCACCAATAACCTTCCCGGAAGGAGGCACAACGCTTGTACCTGCAACAAAAGGTTTCGGGCTCAATGCCTCATTCGCATACTGAGCGACAAGCTGAGAGATTTGCTCACGCAGGTTATTTGCTGTCATTACTTTGATTCCTTAAACTTATTTTCTTAACGAGTAGTTGCAGACATATAGTCGCTGATTTCACGCTTTGAACAAATCAACATATCTTCGCCGCGAATCCATGCTTTATGCCATTTTACGATGCGACCAAGTGTTTCAGTCAATCCCCAACGCGGATGCCATCCTAATTGCATATTTGCTTTAGAGCAATCCAGTTTCAGGTAATGTGCCTCATGAGGATGATTCTCACCATCCAGTAACCAGCTTGCATCATCACCCCAAAGCGTGACCATCTTGTCAACAATAAATTCGACCGTCTTCGCATCTTCATCACGCGGGCCGAAATTCCATCCTTCAGAAAACTTAGCACCTTCTGTATATAAGCGTTGCGCCACCACAATGTAACCAGAAAGAGGCTCCAGTACATGCTGCCAGGGACGGATAGAATATGGGTTTCGAATAATAACCTGCTGGTTATTTTCAAATGAGCGCAGAATATCGGGAATTAAACGGTCTTTAGCCCAATCGCCTCCGCCTATGACATTACCAGCCCTCACAGACGCCAAACCAACGCCATGTTGCTCATAATTTGCAGGATTGAAGAATGAGTTCCGGAATGCAGACGCGACTAATTCTGCACAACCTTTACTATTAGAGTATGGATCGTACCCTCCCATGGGTTCGTTCTCACGATAGCCCCACACCCACTCACGATTGTCGTAGCACTTATCACTGGTGATATTTACGACTGCCTTTATGTTACCTACTTGCTTAACTGCTTCAAGCAAATGGACAGTACCCATAACATTTGTTGAGTATGTTTCGATTGGCTGTTCATAAGATAGGCGCACTAAAGGCTGGGCTGCCATATGGAAAACAATTTCTGGCTTAAATTCTGCAATAGAATTGCGCAGCTTTTCAAAATCACGAATGTCGCCAATATGAGATTCCATAAGATCACTAAGACGCACTATCTCAAATAAACTTGGAACAGTTGGCGCATCAAGTGCATAGCCTTTTACAATTGCACCCATTTCAGTCAGCCATAGCGAAAGCCAGCTTCCTTTAAAGCCAGTATGGCCGGTAACGAATACACGTTTACCTTGCCAAAAATTTTTATCAATCATCTACTTACTCCCAGGTTTTCCACGGAGCTTTACCTTTTTCCCACAGCCCTTCAAGGTAAACTTTATCACGTAGGGTATCCATCGGCTGCCAGAAACCTGGGTGTTCAAAAGCCATTAACTCCCCCTGTTGTGCCAATGTCATTAATGGCTCTTGTTCCCAGGTTGTTGCATCGTTATCGATGAGATCGATAACCGATGGATTCAACACAAAGAAACCACCATTGATCATTGCCCCATCGCCTTTCGGTTTTTCCTGGAATGACCGGACCTGACCAGCTTGGATATCTAATGCGCCAAAACGTCCTGGTGGAAAAGTAGCTGTTAAAGTCGCTTTCTTACCGTGAGCCTTATGGAAATCGATAGTCGCTTTGATATCAAGGTCGGCAACGCCATCACCATAAGTAAACAGGAAAGCCTCGTCATCTTTTACGTATTCAGCAACACGTTTCAGACGACCACCAGTCATTGAAGAATCACCCGTATCAACCAATGTGACATTCCATGGTTCAACACGTTTATGGTGAACTTCCATACGATTTTCAGCCATATGGAATGTTACATCTGACATGTGAAGGAAGTAGTTCGCAAAATATTCTTTAATCACATATCCTTTATAACCACAGCAGATAATAAAATCCTTGATACCATGCACAGAATACATTTTCATAATGTGCCAAAGAATAGGCTTGCCACCAATTTCTACCATCGGTTTTGGTTTTACAATTGTTTCTTCACTTAGTCTGGTACCAAGTCCACCAGCCAGGATGACCGCTTTCATAAATTATCCTCAATATTATTTAGATGCGGTAAATGCATCAGAATAGAAATGTTCTACAGAGAGATTTTTCATCATAAAGTCCTTTTTACTGGCATCGATCATCACAGGTGAACCACATGCATATATATCGAAGAACTCTAGAGAATCAAAATCATCCATCACAGCATGATGGACAAATCCCTTTCTTCCCCCCCATTCGGCGTCATCACCAGAAACAACAGGGATATAATGAACGTTGTCGTGCTGTTCACTCCACTGCTGCGGTAATGCAGAGTAAAAATCTTTACTATCTTGCATTCCCCAGTAGATGTAGATCTCACGACGACATTTTCCCTGAATGAGATGCTCAACCATTGATTTAACTGGAGCGAATCCAGTACCGCCTGCAAGGAAGATTATAGGTCTGTCACTTTCACGAATAAAAAATGTTCCGCAAGGTCCTTCAATGCGCATAAGAGTATTTTCTTGTAACTCCCCAAAAATGAGAGAGCTCATCTGACCATTGGGAACATTCCTTACATGCAACTCAATACCATTCGACTCATCACTATTAGCGATAGAATAACTGCGAGTTACACCTTTATAATGTAAATTGATATACTGCCCTGGAAGGAAGCCAATTTTTGCTGTTGGTGGTGTGCGTAACTTCAAAGTCATAACATCGCCTGAAACCAGTACAGCACTATTTACCTTGCATGGGACAATTTTTTTTGTCTGTCCAGCTAGTTCAGGAAAAAAATGCGCATTTAGCTCAAGGGCGGTTTTAGGTTTACAGCAGCAGGTTAGTATTTTATCACCCTGTCCAAAAATATTACCTTTGGAGTCAACAACTTCTCCCGCCAACAAATCGGACTCACAGATACCACAATCACCCGCTTTGCAGCTATGTTCAAGATGGATGCCAGCCGATAGCGCAGCATCGAGGATTGATTCATCCTCTCTACCGGAAAATTCAATATTTGATGGAAAAATCTTAATAATATGAGACACGATGCTTACTCTGTTAACAAGGCTTGATGCAGTAAAGGTGCTGCAGCATCTTTTGCTGAAAGCTCAGGCAGCTGAGAAAAAGGCCATTCAATACCTATTGTCTCATCATTCCATAAAATGCTACCTTCCGATGAAGGTGAGTAATAATTAGTTGCTTTGTACAGAAACTCTGCATACTCACTAAGAGTAACAAAACCATGAGCAAAACCTTCTGGAATCCAAAGCTGTCGCTTATTCTCAGCAGATAGATTTACGCCAACCCATTGACCAAAAGTAGGCGATTCTTTTCGGATATCGACCGCAACATCAAAAACCTCACCGACAGCACAACGAACTAACTTCCCCTGTGCATTTTCTCCTCTCTGAAAATGTAGCCCTCTGAGTACGTTCTTTTTGGATTTTGAATGATTATCTTGAACAAATGTAACTTTACGTCCAATCAACTCTTCAAAGGTCTGCTGGTTATAACTTTCAAAAAAGAATCCCCTCTCATCGCCAAAAACTTTAGGCTCTAAGATCAAGACATCTGGTATTGCTGTTTTAATCACAATCATCACTTATAAACCTTTCACCATCTTCAGCAAATATTTGCCATAATCATTTTTTGATAATGGCCCGGCCAGTTCTATAACCTGTTGTGCATTTATAAAATTTTTACGAAATGCGATCTCTTCCGGGCAGGACACTTTTAGCCCCTGGCGTTCTTCGATGGTTGCAATAAAATTACTGGCCTCTATCAAACTCTGATGCGTCCCTGTATCCAGCCAGGCATAACCGCGCCCCATCATGGCGACAGACAATCTTCCCTGATCCATATAGATACGGTTAATATCCGTGATTTCTAACTCACCGCGAGCGGAAGGCTTAAGATTTTTCGCCATCTCCACCACGCTATTATCATAAAAATACAGCCCCGTTACCGCGTAATTACTCTTCGGTTGTAACGGTTTTTCCTCCAGACTAACGGCTGTGCCACTTTGGTCAAACTCAACCACACCGTAGCGCTCCGGATCGTTTACATGATAAGCAAAGACGGTAGCACCACTTTCTTTATTAACGGCAGCTTCCATTAACTTTGGTAAATCATGACCATAGAAGATATTGTCACCCAGTACTAATGCACAATCATCATTACCAATGAACTCTTCACCAATAATAAACGCCTGTGCTAAGCCATCCGGGCTTGGCTGTACTTTATATTGAAGATTCAGCCCCCACTGGCTGCCGTCTCCCAGCAGTTGTTGAAAACGCGGCGTGTCCTGTGGCGTACTGATGATCAGGATATCCCGAATGCCTGCCAGCATAAGCGTGGAAAGGGGATAGTAAATCATCGGTTTATCATAAATTGGTAGCAATTGCTTACTTACCGCCATGGTCACCGGATAAAGACGGGTGCCGGAGCCCCCCGCTAAAATAATGCCCTTACGCGTTTTCATTTCCATTTCTCATTCATAGAAAATGCCCTGATGGGCATTTAAATTTATCAGATGGTTGTCGTCGTAAACATTTCAGTCAGCATGCGCTTAACTCCTAATTCCCATTGCGGCAGAATAAGGTCAAAATTACGCTGAAACTTTTCAGTATTGAGACGCGAATTGCCTGGTCTGCTCGCCGGCGTCGGGTAGGCGCTGGTCGGCACAGCATTAAGCTCAGTCAGCGCAAGCGTTATCCCTGCTTTGCGCGCCTCGTCAAAGACTAAGGCCGCGTAGTCATGCCAGGTTGTGGTTCCCCCGGCAACCAGATGGTAAAGACCTGCGACTTCTGGTTTCTTTAACGCCACACGGATCGCATGAGCCGTGCAGTCAGCCAGTAATTCTGCACCGGTTGGCGCACCGTACTGATCGTTGATGACTGAAAGTGTCTGACGCTCTTTCGCCAGACGAAGCATTGTCTTTGCGAAATTATTGCCCTTACCTGCATAAACCCAACTGGTGCGGAAGATAAGATGCTTAGGGCAGTTATCCTGCAGGGCCTTTTCTCCCGCCAGTTTGGTCTTGCCATAGACATTCAGCGGCGACGTAGCGTCCGTTTCCTGCCATGGGATATCGCCGGTACCAGGAAATACATAATCGGTTGAATAATGCACTACCCATGCGCCAGTTTCGTTGGCTGCTTTAGCGATGGCTTCCACACTGGTGGCGTTAAGTAACTGCGCCAGTTCTGGTTCAGACTCTGCTTTATCTACTGCAGTATGTGCTGCTGCGTTAACAATCACATCGGGACGAAGCTTACGAACGGTTTCGGCAACGCCTTTCGGATTACTAAAATCACCGCAAAACTCTTTTGAATGGACATCCAGGGCAATCAGATTCCCTACTGGTGCCAGAGAACGTTGCAACTCCCAGCCTACTTGCCCTGTCTTACCAAAAAGTAAGATATTCATTACTGGCGTCCTTCATAGTTCTGTTCTATCCAACTCTGATACGCCCCACTTTTAACATTGTTTACCCATTGAGTATTTGCAAGGTACCATTCCACTGTTTTACGAATACCGCTTTCAAAGGTCTCCAGCGGTTTCCAGCCTAATTCGCGGCTAATTTTACCTGCATCAATGGCATAACGACGATCATGGCCCGGACGATCCGCGACATAAGTGATTTGTTCACGATAAGAAGTCGCTTTGGGTACAATCTCGTCCAGCAGATCACAGATGGTAAATACCACATCGAGATTTTTCTTCTCATTGTTGCCACCAATGTTATAAGTCTCCCCCGCCTTGCCTTCAGTCACTACCATATGAAGCGCGCGAGCATGATCTTCTACATATAGCCAATCGCGAATCTGATCCCCTTTGCCATAAATTGGCAAAGGCTTTCCTTCCAGTGCGTTCAAAATGACCAACGGAATCAGTTTTTCAGGGAAGTGATAAGGGCCATAGTTATTAGAACAATTGGTAACGATCGTTGGTAGACCATAGGTACGCCGCCAGGCACGGACTAAATGATCGCTGGATGCTTTTGACGCAGAATAGGGGCTACTTGGCGCATATGCCGTCGTTTCAGTAAATAACGGCAGCGTAACGCTGTTTTCAACTTCATCAGGATGCGGTAAATCGCCGTAAACTTCATCAGTGGAAATATGATGAAAACGAAAATTATTTTTTTTATCTTCGCCAAGCGCAGACCAGTATTTACGCGCAACTTCAAGAAGTACATAGGTGCCGACGATATTGGTTTCAATAAATGCTGCTGGCCCGGTAATCGAACGGTCCACATGACTTTCCGCAGCCAAATGCATCACCGCGTCCGGCTGGTACTGCTCAAAAATACGCGTTATTTCAGCGGAATCACAAATATCCGCGTGTTCAAAATTGTAGCGATTACTCTCAGAAATATCAGAAAGGGATTCAAGATTACCGGCGTAGGTTAATTTATCAATATTAACTACAGTGTCCTGTGTATTCTTAATAATATGGCGGACAACAGCTGATCCAATAAAACCTGCCCCGCCAGTAATAAGTATCTTCACTTTTCTATTCCATAAGGCGTATTTAATGTGGTATTTAATTTGCCAATAAAAATTAATTGCTCAAGTCGTTACACACGCTACCGCCCCTGGCTCATCAGCTACCAGTGCACTGCGTACATATCGACTTGTTACAAACCTCGAGCAGGGCAAAGCTCACTAAAACTTAAACGCTAATTGTCTTATTAATTGCATCCGGAAACAAGGATTAATCTTATAAAATCAGCATTAAAATGCTCCAGATAACCCCTTGTTACTTAAGCCCTTTATACAAAACTAAAACGGCAGTCAACACTCGCTTCAGCCAACTTGCCGCTTCGAATGTTCACTGCCGTTATTATGTTTATCACCAACCATTTATCACGGTTGTTAATACTTATTCATGCAAAAGCTGCTCTATGCTCTTACGGAACTTCGCTCCTTCTTTCAGGTTGCGCAGCCCGTACTTCACAAATGCCTGCATGTAGCCCATTTTTTTACCGCAGTCATAGCTGTCACCCGTCATTAGCATCGCGTCAACCGACTGTTTTTTCGCCAGTTCTGCAATGGCATCGGTGAGCTGGATACGGCCCCAGGCGCCCGGTTCGGTTCTTTCCAGTTCCGCCCAGATGTCGGCTGAAAGCACATAACGGCCTACCGCCATCAAATCGGAATCCAGCGTCTGCGGCTGATCCGGTTTTTCGATAAACTCCACAATCCGGCTGACTTTGCCTTCATTATCCAGAGGTTCTTTCGTCTGGATAACGGAATACTCCGATAAATCACCTTTCATGCGCTTCGCCAGCACCTGGCTGCGACCCGTTTCATTGAAACGCGCCACCATCGCCGCAAGGTTATAGCGCAGCGGATCGGCGGTAGCATCATCGATAATAATATCCGGGAGTACCACAATGAAAGGGTTATCGCCCACGACCGGACGCGCGCACAGAATAGAGTGCCCCAGCCCTAACGGCTGCGCCTGGCGAACGTTCATAATCGTCACGCCCGGTGGGCAGATAGATTGCACTTCCGCCAAAAGCTGGCGCTTAACGCGCTGCTCAAGAAGTGATTCAAGTTCATAAGAGGTGTCGAAGTGGTTCTCAACGGCGTTTTTAGACGCGTGAGTCACCAGTACGATTTCTTTGATCCCTGCAGCCACAATCTCATCGACAATGTACTGAATCATTGGCTTGTCGACGATCGGTAGCATCTCTTTTGGGATTGCCTTGGTGGCAGGCAACATATGCATACCCAAACCCGCTACCGGTATAACTGCTTTCAAATTCATCATTGTTTCTTCCACCTGTAAAATGGTTGCTGAATTATAGCTCTTTAGCTTGTTTTCGCCAGCATGAATTACTCTGCTGCCAGGGATAATGATGGCACGCTTTACATTACGTCTTAGTCGGCACCATAACATTAAGTATGAACAACTTTTTCCCAGGAATTTTCGTAAAAATAGCGGTACTTACCCTCCCCGCTTCGGCAGCGAAAAATTCACTGCTTCGACATTCACGGTTTGGTGATTAATCCTGTCGATATCCACGGAACTCTGCCCGTTTTCATTGATGGCATGAACATTAGCGAGGGAAAGCAGCGTGTCCTGGCGGGCCATAAATTGACCACGGACATCTTTACGCAAATCGAAATGCATTTTTAACGCCGGGCCAATCGCTGAAGTTTGCATCACGTTGATATTACGCAGAAAGAGGTGCTGCGGTTGATTATGCAGTTCCAGCGTAGCACGCGTCATCCGTACATTGGTGATGGCGACAAAAGAGGGGATGTTGCCGGAGGAAATTTGAATGCCGCGTAATTTATAAGCAACCTGGCGATTATCCAACCGAATAGCGTTTAATTTAAAGTTTTGCGGAATTGACAGGTATTTTCCTTTAACGACGCCATAGCCGATGAGCATCCCAGCACTATTCGTCATATCAATATTATCAATGACGAAATTATCACAGCCATAAATGGCGATCGTTGCGTTATCAATACCCGCATTTTTACTGAAATCGGGCGTGATGTTTTTGGCTTTGACATTGCGAATGACGAAATGTTTGCCATTTTCTACGTGCACCAGCTGTCGGCAATCAGATCCGGTAATATTGGCCACCACAAAGTTTTTTACTGCCTGATCTTCAGGATAACTGTTGTCATAGGTGCTACCCGCCAGCCCGATGCCGATCCCCCAGTTGATTTTGCCATTGGTACAATCAATGCGTTCGATGACATGATCGGAAATCAGGATGTCGCGGTCGTGAATCGCGACATTCCACTCAATGGCGTCCCCCTGCAAATCGCTAAAGCGGCTATGCGTAATCCGCGCGCCGTCCATTTGGTTATGAAATCCCTGGCGGAGAATGGCGTAGTTGGCGTGGGTAACGGTGATGTCATCGATAATGAGATTACGCATCACCTGCGGTTCCTTACCGCCGATGAAAATTTGCGCGACGGGGCCAAAGCCGCTCATCGTCACGCCTTTAATCACACAGTCCGACCCGCGAACATCCAGCGTCACATTGTGCAGACTGCCGCCCTGCTCCCCCACCACCTGACACCCGTCCTGCAAAATAAACCGTCCCCGGCCATTCCCACGCACCGCGCCCTGTACCCGCAGCGTTTTTCCCGCCGGAATCGTTATCGCCGCATTGATATTTTCACACACCCATCCTGGCGGTAAGACCACGGTCTGTCCGTCGGCGAAGGCCTGTTTGAACGAGGCGATACCGTCATCCGCCGGATAATCCTTAATATCGACGGTCTCGCGAGGTTCACGCGCCTGTACCGGCAAGGCGCGCAGAAAAGGAAGAACAGCAAGCGCGGAACCTGCCGTCAGGAGGGTACGTCGGGAGAACTTATTCACGGGCAT
Protein sequences of DBSCAN-SWA_5 >NZ_CP007598|3745726:3756230|3748939_3749914_-|WP_000018223.1|DBSCAN-SWA MSHIIKIFPSNIEFSGREDESILDAALSAGIHLEHSCKAGDCGICESDLLAGEVVDSKGNIFGQGDKILTCCCKPKTALELNAHFFPELAGQTKKIVPCKVNSAVLVSGDVMTLKLRTPPTAKIGFLPGQYINLHYKGVTRSYSIANSDESNGIELHVRNVPNGQMSSLIFGELQENTLMRIEGPCGTFFIRESDRPIIFLAGGTGFAPVKSMVEHLIQGKCRREIYIYWGMQDSKDFYSALPQQWSEQHDNVHYIPVVSGDDAEWGGRKGFVHHAVMDDFDSLEFFDIYACGSPVMIDASKKDFMMKNLSVEHFYSDAFTASK >NZ_CP007598|3745726:3756230|3753755_3754649_-|WP_000981469.1|DBSCAN-SWA MMNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGDNPFIVVLPDIIIDDATADPLRYNLAAMVARFNETGRSQVLAKRMKGDLSEYSVIQTKEPLDNEGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWAELERTEPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKSIEQLLHE >NZ_CP007598|3745726:3756230|3749919_3750471_-|WP_000973709.1|DBSCAN-SWA MMIVIKTAIPDVLILEPKVFGDERGFFFESYNQQTFEELIGRKVTFVQDNHSKSKKNVLRGLHFQRGENAQGKLVRCAVGEVFDVAVDIRKESPTFGQWVGVNLSAENKRQLWIPEGFAHGFVTLSEYAEFLYKATNYYSPSSEGSILWNDETIGIEWPFSQLPELSAKDAAAPLLHQALLTE >NZ_CP007598|3745726:3756230|3750471_3751350_-|WP_000857535.1|DBSCAN-SWA MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGNDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDQSGTAVSLEEKPLQPKSNYAVTGLYFYDNSVVEMAKNLKPSARGELEITDINRIYMDQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAFRKNFINAQQVIELAGPLSKNDYGKYLLKMVKGL >NZ_CP007598|3745726:3756230|3754826_3756230_-|WP_001144948.1|DBSCAN-SWA MPVNKFSRRTLLTAGSALAVLPFLRALPVQAREPRETVDIKDYPADDGIASFKQAFADGQTVVLPPGWVCENINAAITIPAGKTLRVQGAVRGNGRGRFILQDGCQVVGEQGGSLHNVTLDVRGSDCVIKGVTMSGFGPVAQIFIGGKEPQVMRNLIIDDITVTHANYAILRQGFHNQMDGARITHSRFSDLQGDAIEWNVAIHDRDILISDHVIERIDCTNGKINWGIGIGLAGSTYDNSYPEDQAVKNFVVANITGSDCRQLVHVENGKHFVIRNVKAKNITPDFSKNAGIDNATIAIYGCDNFVIDNIDMTNSAGMLIGYGVVKGKYLSIPQNFKLNAIRLDNRQVAYKLRGIQISSGNIPSFVAITNVRMTRATLELHNQPQHLFLRNINVMQTSAIGPALKMHFDLRKDVRGQFMARQDTLLSLANVHAINENGQSSVDIDRINHQTVNVEAVNFSLPKRGG >NZ_CP007598|3745726:3756230|3751397_3752297_-|WP_001023658.1|DBSCAN-SWA MNILLFGKTGQVGWELQRSLAPVGNLIALDVHSKEFCGDFSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPELAQLLNATSVEAIAKAANETGAWVVHYSTDYVFPGTGDIPWQETDATSPLNVYGKTKLAGEKALQDNCPKHLIFRTSWVYAGKGNNFAKTMLRLAKERQTLSVINDQYGAPTGAELLADCTAHAIRVALKKPEVAGLYHLVAGGTTTWHDYAALVFDEARKAGITLALTELNAVPTSAYPTPASRPGNSRLNTEKFQRNFDLILPQWELGVKRMLTEMFTTTTI >NZ_CP007598|3745726:3756230|3745726_3747040_-|WP_000126349.1|DBSCAN-SWA MTANNLREQISQLVAQYANEALSPKPFVAGTSVVPPSGKVIGAKELQLMVEASLDGWLTTGRFNDAFEKKLGEFIGVPHVLTTTSGSSANLLALTALTSPKLGERALKPGDEVITVAAGFPTTVNPAIQNGLIPVFVDVDIPTYNIDASLIEAAVTEKSKAIMIAHTLGNAFNLSEVRRIADKYNLWLIEDCCDALGTTYEGQMVGTFGDIGTVSFYPAHHITMGEGGAVFTKSGELKKIIESFRDWGRDCYCAPGCDNTCGKRFGQQLGSLPQGYDHKYTYSHLGYNLKITDMQAACGLAQLERVEEFVEQRKANFSYLKQGLQSCTEFLELPEATEKSDPSWFGFPITLKETSGVNRVELVKFLDEAKIGTRLLFAGNLIRQPYFANVKYRVVGELTNTDRIMNQTFWIGIYPGLTTEHLDYVVSKFEEFFGLNF >NZ_CP007598|3745726:3756230|3752296_3753382_-|WP_000697848.1|DBSCAN-SWA MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLTYAGNLESLSDISESNRYNFEHADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEVARKYWSALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDHLVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYVEDHARALHMVVTEGKAGETYNIGGNNEKKNLDVVFTICDLLDEIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLANTQWVNNVKSGAYQSWIEQNYEGRQ >NZ_CP007598|3745726:3756230|3748150_3748924_-|WP_000648783.1|DBSCAN-SWA MKAVILAGGLGTRLSEETIVKPKPMVEIGGKPILWHIMKMYSVHGIKDFIICCGYKGYVIKEYFANYFLHMSDVTFHMAENRMEVHHKRVEPWNVTLVDTGDSSMTGGRLKRVAEYVKDDEAFLFTYGDGVADLDIKATIDFHKAHGKKATLTATFPPGRFGALDIQAGQVRSFQEKPKGDGAMINGGFFVLNPSVIDLIDNDATTWEQEPLMTLAQQGELMAFEHPGFWQPMDTLRDKVYLEGLWEKGKAPWKTWE >NZ_CP007598|3745726:3756230|3747066_3748146_-|WP_000565913.1|DBSCAN-SWA MIDKNFWQGKRVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLSDLMESHIGDIRDFEKLRNSIAEFKPEIVFHMAAQPLVRLSYEQPIETYSTNVMGTVHLLEAVKQVGNIKAVVNITSDKCYDNREWVWGYRENEPMGGYDPYSNSKGCAELVASAFRNSFFNPANYEQHGVGLASVRAGNVIGGGDWAKDRLIPDILRSFENNQQVIIRNPYSIRPWQHVLEPLSGYIVVAQRLYTEGAKFSEGWNFGPRDEDAKTVEFIVDKMVTLWGDDASWLLDGENHPHEAHYLKLDCSKANMQLGWHPRWGLTETLGRIVKWHKAWIRGEDMLICSKREISDYMSATTR |
10 | Enterobacteria_phage(37.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
3823427 : 3832598
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP007598|3823427:3832598|DBSCAN-SWA TATGACTCAAGTCGCGAAGAAAATTCTGGTAACGTGCGCGCTGCCGTACGCCAACGGCTCTATCCACCTCGGCCATATGCTGGAGCACATCCAGGCTGATGTCTGGGTCCGTTACCAGCGAATGCGCGGCCATGAGGTTAACTTCATCTGTGCCGATGACGCTCATGGCACGCCGATCATGCTGAAAGCGCAGCAGCTTGGTATTACGCCGGAGCAAATGATCGGTGAAATGAGCCAGGAGCACCAGACCGATTTCGCCGGTTTTAATATTAGCTACGACAACTACCACTCAACGCACAGCGACGAGAATCGCGAGCTGTCCGAGCTGATTTATACGCGCCTGAAAGAGAACGGTTTTATTAAGAACCGCACTATCTCTCAACTCTACGATCCGGAAAAAGGCATGTTCCTGCCGGACCGATTTGTGAAAGGCACCTGCCCGAAATGTAAATCCGCGGACCAGTACGGCGATAACTGTGAAGTCTGCGGCGCAACTTACAGCCCGACCGAACTTATCGAGCCGAAATCCGTGGTGTCCGGCGCGACGCCGGTAATGCGTGACTCCGAGCACTTTTTCTTTGATCTGCCGTCATTCAGCGAAATGCTGCAGGCGTGGACCCGCAGCGGCGCGCTGCAGGAGCAGGTGGCGAACAAAATGCAGGAGTGGTTTGAATCCGGCCTGCAACAGTGGGACATTTCCCGCGACGCGCCGTATTTTGGTTTCGAAATCCCGAACGCGCCGGGCAAATATTTCTACGTCTGGCTGGACGCGCCGATTGGCTATATGGGCTCCTTCAAAAATCTGTGCGATAAGCGCGGTGACACGACCAGTTTTGATGAGTACTGGAAAAAAGACTCCGACGCCGAGCTGTATCACTTTATCGGCAAAGACATCGTCTATTTCCACAGCCTGTTCTGGCCTGCCATGCTGGAAGGCAGCCACTTCCGTAAGCCGACCAACCTGTTCGTTCACGGTTACGTGACGGTGAACGGCGCGAAGATGTCTAAGTCTCGCGGCACCTTTATTAAGGCCAGCACCTGGCTGAAACACTTTGACGCCGACAGCCTGCGCTACTACTACACCGCGAAGCTTTCTTCACGCATTGATGACATCGACCTGAACCTGGAAGACTTTGTCCAGCGCGTCAATGCCGATATCGTCAATAAAGTAGTCAACCTGGCATCCCGTAACGCCGGTTTTATCAATAAGCGTTTCGACGGCGTGCTGGCGGCTGAACTGGCCGATCCGCAATTGTACAAAACCTTTACTGACGCCGCTGCGGTGATTGGCGAAGCATGGGAAAGCCGTGAATTCGGCAAAGCTATCCGTGAGATTATGGCGCTGGCCGACGTCGCTAACCGTTATGTTGACGAGCAAGCGCCGTGGGTGGTGGCTAAACAGGAAGGCCGCGACGCTGACCTGCAGGCCATTTGCTCGATGGGCATCAACCTGTTCCGCGTGCTGATGACGTATCTGAAACCGGTACTGCCGACGCTTTCTGAACGCGTTGAAGCCTTCCTGAACAGCGAACTGAACTGGGATGCCATCGAACAGCCGCTGCTCAGTCACAAGGTCAACACCTTTAAGGCGCTCTACAATCGCATCGACATGAAGCAAGTTGAAGCGCTGGTTGAAGCGTCTAAAGAAGAGGTGAAAGCCGCAGCCGCACCGGTTACCGGCCCGTTAGCCGACTTCCCGATTCAGGAAACCATCACCTTTGACGATTTCGCCAAAATTGACCTGCGCGTAGCATTGATTGAAAACGCTGAGTTCGTGGAAGGCTCCGACAAATTGCTGCGTCTGACGCTGGATCTGGGCGGCGAGAAGCGTAACGTCTTCTCCGGCATTCGTTCCGCCTACCCGGACCCGCAGGCGCTGATCGGCCGCCAGACGGTAATGGTCGCCAACCTCGCGCCGCGCAAAATGCGCTTTGGCGTCTCCGAGGGAATGGTGATGGCCGCAGGCCCTGGCGGGAAAGATATCTTCCTGTTAAGCCCTGATGACGGCGCGAAGCCTGGCCAACAGGTGAAATAAGCAACAAGCCGGAGCATGCTCCGGCTTTTTTAACCGCTTAATCCTGACTGGCATATCGCCTCCTCCGCACGTTTTAACTTTTTCCTTAATAAGCAATAGCGCTAAGACTCCATATCCGGACTGCTAAATAACGGCTAAAAGTCATATTCTATCTCTCCCTGATACATTGCTATTACTGGGTTAAATTAATTCATTGAAATTATTTTAAAAGCCCAATTATTATATAAAATGGAGTTTTAGATGAAAATTTCTGGCAAGTTATTGTCCACGGCTTTGGCTTCCGTACTGGTGTTCTCTCTTGCTGGCTGTGGCGATAAAGAAGAATCAAAGACCTTTAACGCAAACCTGGCGGGGACAGAAATTTCAATTACTTACACCTATAAAGGTGACAAAATCATTAAGCAGACGTCTGAAAGTAAAATCAGCTATGCCACTGTAGGCGCTAAAACGAAAGAAGATGCCGCCAAAATTCTCGATCCGCTGAGCGCGAAATATAAAAATATCGCCGGAGTGGAAGAAAAATTAACCTATGAAGATACCTATGCCCAGGAAAACGTCTCTGTGGATATGGAAAAAGTGGACTTTAAAGCGTTACAGCAAATCTCAGGGACGATGGTGTCCGGCGATACCAGCAAAGGTATCAGCATGAAACAAACCCAGACGCTGCTGGAAGCTGCTGGTTTTAAAGAAGCGAAATAACTGGCAGGCATAATGTATTGCATCGACTGGTAAAGTCGCTCAGGGCGCTGCTTCGGCAGCGTCTTTTCTTTATGAATTCCGAAAAAAGACAGCCTCGCTGTAAGTCCCCTCGCCAAAGCCTATAATCCCTGAATTCCTGGCCACACCAATACTTAACGACAAGGAATTGTTATGCAGGTTCTACGTCTTATGGCACTGCCACTATTCGCGCTCTCTCTATCGGTTAGCATAACTGGCTGCGATCAGAAAAACGATACTCTCCAGGGAAAGCAAAATAACATGACAGCGTTTATCAAGAAGATAGCCGCTAGCAAAGAGTCAGAGGAAACACAACGCTATGTAGGTAATCTCAACGGTATTGAAATCAAGTTAACCTATTACTACAAAGGGGATATCGTTTTACGTCAAATATCTGAACATAAACTACTTTATAAGACCCTGAAAGCCAATAATAAAGAAGAAGCACAAAAAATGCTGAGTCAAGTCGGCGAAGCTTATCAGGGTATGCCGGGTTTGACTGAACGAATCGACTATTATGATAGCTATGCTACGGAATATGTGGATATTGATTTTACCCAGGCAAAAATAAGCGACCTCTGTAAATTGCCAGGATCATCAATTGACAACTGTTCCGCGTACTATCTGTCAATGATTCGCTCGCAGAAACTGTTGGAAGAGAGCGGGTATCATAGAATCAATTAGTATAATAATGCGTTTTCCCGGTCAGACAGGACGCTGCCGGGAACAATACCGCAGAATTACTTCGCCGCGTGTTCACGGGCCGCCAGACCGCGCAGAAAATAACGCATAAACTGATCGCCGCATTCGCGGAAGTTCTTATGGTCCGGCGCGCGCATCATCGCGGTGATCTCTGACATTGAGACACGAAACAACTGACCGGTAAGTATCGCCAGGATATCATCTGTTTTTAGCGAAAAGGCAATACGCAGCTTTTTCAGCACAATATTGTTGTTGATACGACGTTCCGCCGTCAATGCAGGCGCCGCCTCATCTTTGCCGCGTTTTTCATAAATGAGGCCATTGAGAAATGAGGACAACACGATATCCGGGCAACGCTGAAACCCCTCTTCCTCTTCTTTGCGCAACCAGATTGCTATCTGCTCCGGCGTAGCATCAACGTTACCCAGCGCCAGGATACGCGCCAGATCGGTATTATTAGCTTTTAAAATGTAGCGCACGCTACGCAGAATATCGTTACTCAGCATGAGGCCTTCGGTCGTTTCTATGGCAAAACGATATTCTAACAGTCTTTTACAGGCCAATCGCCTCTTTTAAACTTTTCAGATAGCGACGGCTTACCGGCACCGTCAGGCCGTTGCGTAAAATCAGCTCTGCCTGCCCATTATCCTCCAGCCGAATTTCCTGCAAATGGGCCATATTCACCAGAAACTGACGATGACAACGCAGTAGCGGCGTCCGGCTTTCCAGCGTGCGCAGCGTCAGCTCGGTAAACCCCTCTTTCCCTTCACTGCTGGTCACATAAACGCCGCTCATACGGCTACTGACAAAGGCGACATCATCCATTTGCAACAAATAGATCCGGCTGTGTCCGGTACAGGGAATGAATTTAAGCGCCTGCTGGTTTTCCGGCAACAACGAAACATCCTGTTTACTGCGCTCCTGACGCAGACGATGTAACGTTTTTTCCAGCCGTTTCTCCTCTATCGGCTTGAGCAGATAATCAAAAGCGTGTTCTTCAAAGGCTTTGATGGCGTATTCGTCAAACGCGGTTAAAAAAACGATATACGGGCGGTGTTCCGGATCAAGCATTCCTACCATCTCCAGTCCACTGATACGCGGCATCTGAATATCCAGAAACAGCACATCAGGTCGCAACTTATGTACCGCGCCAATCGCTTCTACCGCGTTCGCGCACTCTCCCACAATCTCAATGTCATCCTGCCCCTGGAGCAAAATCCGCAGATTTTCCCGCGCTAACGGCTCATCATCCACAATCAGCACTTTAATCATGCGTCCTCCTCCAGTGGAAGTCGTAATGTAATTCGGGTAAAACAGTCCGGCTCGCAGGCCACGCTAATACCATAATCATCGCCAAAGTGTTCGCGCAGACGTTTATCAACCAGACTCATCCCCAGCCCGCTACTGCCGGCGGAAGGCTGATACAGTCCCGCATTATCCTCAATATCTAACATCAAATGCTGCCCTTCGCGCCGGGCGCGAATAGCGACGTTGCCGGTATCAAGCAGTTGCGACGTGCCATGTTTAATGGCGTTCTCAACAATCGGCTGTAATGTAAACGCAGGCAATTTCTGACGTGAAAGCGTCGATGGAACATCAAGCTGTACCTGCAGACGCGACTGAAAACGCGCTTTTTCAATTTGCAGATAAGCGTTTACGTGTTCAATTTCATCCGCCAGCGTGACGATTTCCGACGGGCGTTTTAAATTTTTGCGAAAAAAGGTCGACAAGTACTGCACCAGTTGGCTGGCCTGTTCGCTGTCGCGGCGAATCACCGCTTTAATGGTATTGAGCGCGTTAAACAGAAAATGCGGGTTCACCTGCGCGTGCAACAGCTTGATCTCTGACTGCGTCAGCAACGCCTTCTGCCGTTCATACTGCCCGGCCAGGATCTGCGCGGATAAAAGCTGCGCAATACCCTCTCCCAGGGTGCGGTTAATTGAGCTAAACAGCCGGTTTTTCGCTTCGTACAATTTAATGGTGCCCATGACTCGCTGATTTTCGCCACGCAGCGGGATCACCAGCGTCGAGCCGAGTTTACACTGCGGGTGTAGCGAACAGCGATACGGCACTTCGTTGCCATCGGCATAAACCACCTCTCCGGTTTCAATTGCTTTCAGCGTATAACCTGATGAAATGGGTTTGCCCGGTAGATGGTGATCGTCGCCAATACCAGTAAAAGCCAGCAGTTTTTCGCGATCGGTGATGGCGACGGCGCCAATATCCAGCTCCTGATATAACACCTGCGCCACCTTCATACTGTTCACTTCGTTAAATCCCTGACGCAGAATCCCCTCCGTTGACGCGGCGACCTTCAGCGCGGTAGCAGAAAATGCCGAAGTATATTTTTCGAACATGGCGCGCTTATCGAGCAAAATACGCATGAACAGCGCGGCGCCAACGGTATTCGTCACCATCATCGGCGCGGCAATATTACTGACCAGATGCAAGGCATCGTCAAACGGCCTGGCTATCAGTAAAATGATCAGCATCTGCACCAGTTCGGCAATACACGTAATTGCTCCCGCCGTCAGCGGGCTAAACACTTTGTCCGGGCGTCCGCGACGTATGAGAACGCTGTGTACCAACCCGCCCAGCAGCCCTTCGACGATGGTGGAAATCATACAGCTCAGCGCCGTCATGCCGCCCATAGAATACCGATGTAACCCACCGGTCAGACCGACCAGCCCGCCGACGACCGGCCCGCCGAGTAGGCCGCCCATCACCGCGCCAATTGCGCGGGTATTGGCAATCGAATCTTCGATATGCAGCCCAAAATAAGTGCCCATGATGCAGAAGATAGAAAACGTGACGTAACACAGAAGCTTGTGCGGCAGACGAACCGTGACCTGCATAAGCGGGATGAACAGACGCGTTTTACTCATTAGCCACGCAATGACCAGAAACACGCACATCTGCTGAAGCAGCAGCAACACCAGATTAAACTCGTACATACCCGCAAACCACACTTCAATTAAAAGCGCGTAACATACATTGAGTACGATTAACTTTCTTTGAACTGTTGCATAAAAATATGAATTCGTGAATACGATCACTTAAACACCCCGCCGCAACCCGCTACTTCGCGTTTTAATGCATAAAAAACAGGCAAAACTTCCTGGTTCGTAAAAGAGCGTCTAAAGTTAAACCGGGACCTCGCGAGCAAGGGTGAAACGATGGCGCTTTACACAATTGGTGAAGTGGCTTTGCTTTGTGATATCAATCCTGTCACGTTGCGCGCGTGGCAGAGACGTTATGGACTTTTAAAACCACAGCGAACGGATGGCGGTCATCGTCTGTTTAACGATGCCGATATCGACAGAATCCGCGAAATCAAGCGCTGGATAAATAACGGCGTCCAGGTCAGCAAAGTCAAAGTGCTGCTCAGTAGCGACAGTAGCGAACAACCTAACGGCTGGCGCGAACAGCAGGAGATCCTGCTGCACTACCTGCAAAGCAGTAATCTGCACAGTTTACGGTTATGGGTCAAAGAACGCGGTCAGGATTATCCAGCCCAAACATTGACCACTAACCTGTTCGTCCCGCTGCGGCGACGATTACAGTGCCAACAACCTGCCCTTCAGGCGCTGCTCGGCATTCTTGACGGTATCCTGATCAACTATATTGCGCTCTGCCTGGCGTCTGCGCGTAAGAAACAGGGAAAAGATGCGTTGGTGATCGGCTGGAATATCCATGATACCACCCGCCTGTGGCTGGAAGGTTGGGTCGCCAGCCAACAGGGATGGCGAATCGACGTGCTGGCGCATTCGCTTAGCCAGTTCCGCCCGGAACTGTTTGACGGCAAGACGTTACTGGTATGGTGCGGAGAAAACCAGACGCTGGCGCAGCAGCAGCAACTCCTGGCATGGCGCGCCCAGGGACGCGACATTCATCCCCTTGGCGTTTAAACAGCAGCTAACAAATTCGCTTTAATGTATACTCCTTTTATTAACATAAGGAGTACATAATGCGCGTAGCGAAAATCGGGGTGATCGCCCTTTTCCTGCTGATGGCTATTGGCGGTATCGGCGGCGTGATGCTGGCAGGTTACAGTTTTATTTTGCGTGCCGGGTAAGCGCGCGCGTCAGCCTTTCAAACAGGCGATCGATAATGATCGCCGCCAGCGCCACCAGCAGCGCCCCCTGGATAACATAGGCCGTATTAAAGCCGCTAAGCCCGATAATGATCGGCGTGCCTAACGTACTGGCCCCCACTGTTGAAGCGATGGTCGCCGTACCAATATTGATAATCACCGAGGTTCGGATGCCCGCCAGAATCACCGGCGCGGCCAGCGGCAGTTCAACCTGATACAACTGTTGGCGACGGCTCATTCCCATACCGCTGGCAACGCTCATCACGCTGGCAGGCACCGCGCCCAGCCCGGCCAGGGTCGCCTGCAGGATGGGCAACACTCCATACAGGATCAAGGCGATAATGGCTGGTTGCTGACCAAAACCCATGACGGGTACCGCGATCGCCAGTACCGCGACCGGGGGAAAGGTCTGCCCGACGGCGGCGATAGTCTCCACCAGGGGACGAAACTCCTTCCCACTTTCTCGCGTGACCGCAATCCCTGCGCCGACGCCCACCACGACGGCAAACAGACTTGAGATGCCCACCAACCAGAAATGGGCGAGCGCGAGGGCGGCAAAACTCTCCTGTTGGTAGACCGGGCGCGGTAAATCGGGAAACAGCGCGGCGAAGAACGGCTGGCTATAAGGCAATCCAAACAGCAGAAGCAAGAACAGAACAATAAGCCAGAGAAGCGGATCACACAGTCGTTTCACGGGGGGACGTCTCCGAAAGTAGATCGCGGAAATGGAGCGTACCGCAGGGCTCGCCCTGCTGATTCGCCACCGGCAGGACGTCGCACCGACGGGCGACAAACATCGATAGCGCATCGCGTAGCGTCATCTCTTCCACCAGCGCGTCGCCGCTGAGCTGTTCATGCCGACGTACATAATCGCCTACGCTACGTAACGAAAGCAGCCTTACGCCCAGCTCGCTGCGGCCAAAAAACGCCTGCACGAAATCATTTTCCGGCGAGGTCAGCATAGAAAGTGGCGATCCCTGTTGGATAACATTGCCCCCGTCCATCAGCACCAGATGGTCGGCGAGGCGTAGCGCCTCGTCGATGTCGTGCGTCACCAGTACGATGGTGCGCCCCAGTAGCTGATGAATGCGGGTCATCTCCTGCTGCAATGCGCCGCGCGTTACCGGATCAAGCGCGCCGAAAGGCTCGTCCATCAGCAATACCTGCGGATCGGCAGCCAGCGCCCGCGCAACGCCGACCCGCTGCTGTTGCCCGCCGGAAAGCTGATGCGGATAACGATCGCGCAGCGCGCTTTCCAGACCCAATAATGCCATCAGTTCATCAATACGATCGTTAATCCGCGCACGCGACCACTTTTGTAGTTGCGGTACGGTGGCGATATTTTGCGCCACCGTCCAGTGGGGAAAAAGACCGATAGACTGAATGGCATAGCCCATGCGACGGCGCAGTTCAAGCACCGGCAGGCTGCGGATCTCTTCCCCGGCAAAACGGATCGTTCCGCTATCATGCTCTACCAGCCGGTTAATCATCTTCAGAGTGGTCGATTTTCCCGAACCGGAGGTGCCAATTAACACCGAAAAGCTGCCTTCGCTAAAGTGCAAATTGAGGTCGCTAACAGCCTGTTGATCGCCGAAGGTTTTACTGACATGGTTAAATTCAATCAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP007598|3823427:3832598|3831650_3832598_-|WP_000569168.1|DBSCAN-SWA MIEFNHVSKTFGDQQAVSDLNLHFSEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGTIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWTVAQNIATVPQLQKWSRARINDRIDELMALLGLESALRDRYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHQLLGRTIVLVTHDIDEALRLADHLVLMDGGNVIQQGSPLSMLTSPENDFVQAFFGRSELGVRLLSLRSVGDYVRRHEQLSGDALVEEMTLRDALSMFVARRCDVLPVANQQGEPCGTLHFRDLLSETSPRETTV >NZ_CP007598|3823427:3832598|3826331_3826862_+|WP_001197951.1|DBSCAN-SWA MQVLRLMALPLFALSLSVSITGCDQKNDTLQGKQNNMTAFIKKIAASKESEETQRYVGNLNGIEIKLTYYYKGDIVLRQISEHKLLYKTLKANNKEEAQKMLSQVGEAYQGMPGLTERIDYYDSYATEYVDIDFTQAKISDLCKLPGSSIDNCSAYYLSMIRSQKLLEESGYHRIN >NZ_CP007598|3823427:3832598|3826918_3827386_-|WP_000950414.1|DBSCAN-SWA MLSNDILRSVRYILKANNTDLARILALGNVDATPEQIAIWLRKEEEEGFQRCPDIVLSSFLNGLIYEKRGKDEAAPALTAERRINNNIVLKKLRIAFSLKTDDILAILTGQLFRVSMSEITAMMRAPDHKNFRECGDQFMRYFLRGLAAREHAAK >NZ_CP007598|3823427:3832598|3827432_3828152_-|WP_000598637.1|DBSCAN-SWA MIKVLIVDDEPLARENLRILLQGQDDIEIVGECANAVEAIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIEEKRLEKTLHRLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMDDVAFVSSRMSGVYVTSSEGKEGFTELTLRTLESRTPLLRCHRQFLVNMAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >NZ_CP007598|3823427:3832598|3825701_3826160_+|WP_000703145.1|DBSCAN-SWA MKISGKLLSTALASVLVFSLAGCGDKEESKTFNANLAGTEISITYTYKGDKIIKQTSESKISYATVGAKTKEDAAKILDPLSAKYKNIAGVEEKLTYEDTYAQENVSVDMEKVDFKALQQISGTMVSGDTSKGISMKQTQTLLEAAGFKEAK >NZ_CP007598|3823427:3832598|3830056_3830788_+|WP_001240421.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWINNGVQVSKVKVLLSSDSSEQPNGWREQQEILLHYLQSSNLHSLRLWVKERGQDYPAQTLTTNLFVPLRRRLQCQQPALQALLGILDGILINYIALCLASARKKQGKDALVIGWNIHDTTRLWLEGWVASQQGWRIDVLAHSLSQFRPELFDGKTLLVWCGENQTLAQQQQLLAWRAQGRDIHPLGV >NZ_CP007598|3823427:3832598|3823427_3825461_+|WP_000195340.1|tRNA|DBSCAN-SWA MTQVAKKILVTCALPYANGSIHLGHMLEHIQADVWVRYQRMRGHEVNFICADDAHGTPIMLKAQQLGITPEQMIGEMSQEHQTDFAGFNISYDNYHSTHSDENRELSELIYTRLKENGFIKNRTISQLYDPEKGMFLPDRFVKGTCPKCKSADQYGDNCEVCGATYSPTELIEPKSVVSGATPVMRDSEHFFFDLPSFSEMLQAWTRSGALQEQVANKMQEWFESGLQQWDISRDAPYFGFEIPNAPGKYFYVWLDAPIGYMGSFKNLCDKRGDTTSFDEYWKKDSDAELYHFIGKDIVYFHSLFWPAMLEGSHFRKPTNLFVHGYVTVNGAKMSKSRGTFIKASTWLKHFDADSLRYYYTAKLSSRIDDIDLNLEDFVQRVNADIVNKVVNLASRNAGFINKRFDGVLAAELADPQLYKTFTDAAAVIGEAWESREFGKAIREIMALADVANRYVDEQAPWVVAKQEGRDADLQAICSMGINLFRVLMTYLKPVLPTLSERVEAFLNSELNWDAIEQPLLSHKVNTFKALYNRIDMKQVEALVEASKEEVKAAAAPVTGPLADFPIQETITFDDFAKIDLRVALIENAEFVEGSDKLLRLTLDLGGEKRNVFSGIRSAYPDPQALIGRQTVMVANLAPRKMRFGVSEGMVMAAGPGGKDIFLLSPDDGAKPGQQVK >NZ_CP007598|3823427:3832598|3830935_3831667_-|WP_000824854.1|DBSCAN-SWA MKRLCDPLLWLIVLFLLLLFGLPYSQPFFAALFPDLPRPVYQQESFAALALAHFWLVGISSLFAVVVGVGAGIAVTRESGKEFRPLVETIAAVGQTFPPVAVLAIAVPVMGFGQQPAIIALILYGVLPILQATLAGLGAVPASVMSVASGMGMSRRQQLYQVELPLAAPVILAGIRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIIDRLFERLTRALTRHAK >NZ_CP007598|3823427:3832598|3830847_3830955_+|WP_001261696.1|DBSCAN-SWA MRVAKIGVIALFLLMAIGGIGGVMLAGYSFILRAG >NZ_CP007598|3823427:3832598|3828148_3829834_-|WP_000272845.1|DBSCAN-SWA MYEFNLVLLLLQQMCVFLVIAWLMSKTRLFIPLMQVTVRLPHKLLCYVTFSIFCIMGTYFGLHIEDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSVLIRRGRPDKVFSPLTAGAITCIAELVQMLIILLIARPFDDALHLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSGYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQKLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQPSAGSSGLGMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLPLEEDA |
10 | Enterobacteria_phage(66.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
4069032 : 4075099
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP007598|4069032:4075099|DBSCAN-SWA CATGGATAGCCTTAACGACGATAAAATTAACCGGCAAAGCAGTGACCTGGAAGTTGAAAGCGAAGAAAAACAAAGCGGTAAAGAGATTGAAGTGGATGAAGATCGTCTTCCTTCCCGCGCCATGGCGATTCATGAACATATTCGCCAGGATGGTGAAAAAGAGATGGAACGCGATGCGATGGCTTTGCTCTGGTCAGCCATTGCCGCAGGACTTTCTATGGGGGCATCACTCCTGGCGAAAGGGATTTTCCACGTGCAGCTTGAAGGCGTTCCCGGCGGCTTTTTACTGGAAAATCTCGGCTATACCTTTGGTTTTATCATTGTCATCATGGCCCGCCAGCAATTATTTACTGAAAATACCGTTACCGCCGTGTTGCCGGTAATGCAAAATCCCACTCTGAGTAACGTTGGCCTGCTGATGCGCTTGTGGGGCGTAGTCTTATTGGGCAACCTTATTGGCACCGGGGTTGCGGCGTGGGCATTTGAATATATGCCTATATTTGATGAAGAGACCCGCGACGCCTTTGTCAAAATTGGTATGGAGGTCATGAAAAATAGCCCAACGGAGATGTTTGCCAACGCGATTATCTCTGGCTGGATCATCGCCACAATGGTATGGATGTTTCCTGCAGCAGGCGGGGCAAAGATTGTGGTCATTATTTTGATGACCTGGCTTATCGCGCTGGGCGATACCACCCATATTGTCGTCGGTTCCGTTGAAATTTTGTATTTGGTTTTCAACGGCACGCTGCCCTGGAGCGACTTTCTCTGGCCCTTCGCCCTTCCCACACTTGCCGGAAATATCTGCGGCGGCACTTTTATTTTCGCATTGATGAGCCACGCGCAGATCCGCAACGATATGAGCAATAAGCGTAAGGAAGAAGCGAGGCTACGCGGCGAGCGCCTGGAACGGGAGCGGAAAAAAGCGGAAAAACAGCGCTGAGTGGCGACGGTTTAACCAGTCAGGCGGCGCTACACTTACTCCGGCGAATAAAAAACGCTATACTGGCGCCGCGTTGTCCCCTTAGTTAAATGGATATAACGAGCCCCTCCTAAGGGCTAGTTGCAGGTTCGATTCCTGCAGGGGACACCATGACCTCTTTTACCCACCTCTCACAAATCTCAGCAACCCCACACCACACAAGGCGTTGCAGCGTTTTTGATGTCTTATGATTTCCAGCGAATAGTGCCTATAAACATAACGAAAAGCATCCCTTTAAAGTACCCTCTCGCTGTTAGAACTTAACCAAAAGTTAGCACCTTCACTTTCCGACAGATATAGGCCCAGAACTTCTTTTTTGCCTTCCAGATTCAGTGCCAGAACGGTATAAACCGCCTTGCTCTGATAACGGCCATCCTCACGGTTTTTATAATGAATAGCGTCCAGCCAGACGACGGGATAAACCTTCTCCAGCGGGCGCTGTTGCCACTGTTTTAGTTCAGGGATAACTTTATCGGTACTGCACTGACGGTGGCGGTTGAAACGCTGAAGGCATAAAGATCTTCACTCTCCCGGCCGATGTCCTGATAACTCATCTCCAGCGCAAACAGTCGAATGATATTGCGCTCGATCTCGTCGTACAGGGGGTCTGAAGCTTCTTCACCAGTTATGGCTCAAAAGTGCCGTTACGATTGCGCGGAGTTGCCAGTTCAAAACTGCCTGTTGGGGCTTTAATGGCTTTTTGCCGGAACCATTTTTACGGTTTGCCTCAACATCCTGAGCCAGATGGGAATCAAGTTCAGCAGACAGGGTAGACTCGGTTAAATACTTGATTAATGGCGTTAAGATGCCATCTTTGCCCGTTAATGCCTGGCCGGACCTGAAGGGCTTTAAGTGCTTTGTCGAAATCGAAGGGATGGGACATGTGCCATTCTTTTTTATTTTATGTTACTAAAATTATACAGAATTTTTAACGCTCCCCCTCCCCCCAGCACTTCCACCCTTTCAAGTACCTCTCCCTGAAAAAAAATGCAAAGCCTTGTAAGACGATGCAAAGCTTTACATGTCCCGTTTTTATTCCAAGACGCTTGGCAATCAGCAATACCAATTGATCGATAACATCGATCAATATATTAAAACTCAATAGTTTAAAACTATTAAAAATACAATTATTGATCGCTTATATCGATCAAACCAATTTGTAGTGCTACACTCCAGACCTTTCTGAATCTGATAATTTTCATAATGTTGAAGTTATTCGCTAAGTACACATCGATCGGTGTTCTTAACATGCTCATTCATTGGAGAGTATTTGCTTTCTGTATGTATGGGATGCATACGCATCAGGCGCTGACGAACTTTTCCGATTTTGTTATCGCTGTATCGTTCAGCTTCTATGCTAATGCGCGCTTCACCTTTAATGCCAGCACTACTGCAATTCGCTACATGATGTATATGGGATTCATGGGAGCACTGAGCGCTGTTGTTGGATGGATGGCTGACCAATGTTCTTTGCCACCATTGGTTACCCTCATCACTTTCTCGGCAATTAGCCTGGTATGCGGCTTTATCTATTCCAGATTCTTTATCTTCAGGGATAAAAATGAAAATCTCTCTTGTCGTCCTAGTTTTTAACGAAGAAGACACGATACCGATTTTCTATAGAACGGTACATGAGTTTAATGAACTTGAAAAATATAAAGTTGAGATTATTTTTATTAATGACGGAAGTAAAGATGTGACAGAGTCAATAATTAAAATAATAGCCGTATCTGATCCACTCGTCATTCCGTTTTCGTTTACACGAAACTTCGGGAATGATGCAAGATGCACAACCATTTTATTAATCTTTTTTTAAAATTGAGGTAATTTAAGTTGGAACACTTAAAATACAGACCTGATATAGATGGATTACGCGCAATAGCGGTTTTATCTGTGGTAATATTCCATTATTTCCCATCATTATTGCCGGGTGGGTTTGTTGGAGTAGATATATTCTTTGTGATATCTGGATACCTTATAACATCAATAATATTAAAATCTGCATCAAACAAATCATTTTCATACCTTGATTTCTATAAAAGGAGAGTGCTTAGAATATTTCCAGCATTATCCATAGTTCTTGTATCATGTCTTATTGTTGGCTGGGTTTATTTATTCCAGGATGATTACAAATTACTTGGTAAGCATGTTTTTAGCGGCTCATTCTTTATATCAAACTTTACTCTTTGGAGTGAGTCTGGCTATTTTGATTCAAAGTCATACCTTAAACCTTTACTACATTTGTGGTCACTGGGAATTGAAGAGCAATTTTATATAATATGGCCAGTAGTTATATTGCTATGCTTTAGAAGCAAAAACCATAACAGAAACATAGTATTATCATGCGCAACTATATTTATAATTAGCTATGCGATTAGCATTTTTACAATGGCATCTGATGGCGGAGCTAATTACTACTCTCCCGCATCAAGATTTTGGGAGTTAATGGCTGGAGCGATTATATCCACATTGAGATTTATAGGAATAAACACTTCGTTATCAAAATTAATGTCCCTGTTAGGAATTATACTAATCGCATTATCAATAACCATGATAGATGAAAAGATGTCATTTCCTGGATATATAGCAATAATCCCAGTACTTGGCGCCTCTCTTATAATAGCATCTAATGGTAATGATTTAGTTGTGTCGAAATTGCTTAGTGTTAGGCCTGTTGTTTTCTTTGGTCTTATTAGCTATCCTCTTTATTTGTGGCATTGGCCTATTTATTCATTCTATCGTTCAATATTTGCTGGTTCACCAGACTACCATGAATTAATTCTTCTTTTATTATCATCGTTCTTTTTGGCGATATTAACTTATTATTTAATTGAAAAACCACTGAGAAATGCCAGAAATAAATATATCACAGCAATATTATTAGCATTATCAGTATTTGGGACAGGTTTAATTGGCGCATTTATTTTTCATATAAATGGAGTTAAAGACAGGGAAATCAATAAATCAGCAGGTGAATATGCTTCTGTTACTGACGTGTACAATTATTATAAATATGGAGAACTACTCCGTGGAGGGATATGCCACTCAGTACAACTTACTGCTGCCATATCCAATGGATGTATAAAAAATGGCAAGCATAATATATTTATCATTGGTGATTCTTATGCGGCGGCTCTTTTCAATGGACTTTCTCATTATATAGATAATAAAGGTTCTGATTATATAATAAGCCAAATGACAGATGGTAACGCTCCTCCTCTATTTGTTGACGGTAAAGATGATTTACAGAGAAGTGTCATCACTCTAAACAATAATAGAATTAATGAAATTAAACGTGTTCAGCCTGAGGTGGTTCTGCTGACATGGTCAGTTCGAGGAACAAATGGAGTACATGATAAAAAGTTAGCAATTGATACGTTATCATTAACCATAAAAAAAATTAAAGAGGCATCCCCTGACTCAAGGATTATTTTCATTGGACCAGTCCCGGAATGGAATGCAAATTTAGTTAAAATAATATCTAACTACCTGAGTGAGTTTAAAAAAACCCCACCATTGTATATGACATATGGATTAAATAGTGAAATAAGCGAGTGGGACTCTTACTTTAGTAACAATGTTCCAAAAATGGGAATTGAATATATATCAGCATACAAAGCATTATGTAACGAAAGTGGATGTCTTACAAGAGTTGGTAATGGTCCTGATTTTATCACTGCCGTTGATTGGGGACATTTAACAAAGCCTGGTTCTGATTTCCTTTTTAATAAAATCGGAAATAAAATAATCAAATAGATAGGCTGTTACTATTACATATAAATCCAATATAGAACCTGCCAGTCATACTGTGTAACTGCCACTATATTAACGGTGATCGCTCAGGCGGTCACCGAACTCGATAATAAAGCGACTCATTGCCAGCGACCAGTCCTGGATCGGCATACTCCATTTTTTGACGCCCCTTTGACCAACTCTTTTACTGCGGCGGTGATCTTAAGTATCAGCATGGGCGACACATCAGCGTTGTACATCTCTTTGAAGGTGGCGATGATTTCGCGGGTAGTCATGTCATGCCTTTGGCATAAAGGGATAAATCTGACTGTCCATCTCGTAATACGCGTCTGATGCTCCTTAATCAGCTGCAGTTCAAAGGTATTTTCACGGTCACGCAGCATGTTCAGCACTATCTCGCCGCCATCATACAGCACCGTTTTTAATGAGTAGCCATTGTGGGAGTTTGAGCTCGTTTTAGGAGCATTTTTCTCATGCCCGAGATGGTCAGCCAGCTCAGCATTGAGCGCCGTTTCGACGGTTAATGTCGTCAGCATATGGGAAAACTGACTGAGGTCGGCTTCAGTTTTAAGACCTTTAGCCAGTTCAGCCGCAAGAGCTGTGAGTTTCTTTTCGTCCATAATTGCCTTTCTCCTTTGCTGGAGTGAATATATAATAATCAGACAATTACATAATTTTTAATTACAATCTCCCAATACTTATATTTTACCCCCATATTCATTACACGTTCAAAGTGTTGTTAACGTTAATCTAACTAATTATCCATTAGGTATTTCCCCTAAAGACCTTAACGAAGCCGAATTATTTTTGCATGTGACACAAAACATGAGTAATCTTTTATAAATTTATCTGACACGATATTAGAACTAACGACATCTCAACTACCATACAATCTTTATAGTTAACTATTAAGGATGCGGGGTCTGACAGATTACCAGATGGTGTTTGTTTTATGTCAATGAGTCTCAAACCATTAATAATGCAACCACTTTCTACAAACAATTAGTTAGTGTTAAAGTTTTAGTATCGATTACAGAAATATATGTGAATGCCTTCCCATATGTGTACCATAGATCACCAGACCCTGCGCAATCTTGCATAAGAATGTTCGATACATAACTTCCCCAGCCATCCATACCAAGACCAACACTCTATGAACCAATGGCAACGAGACCATCGACTAAATGATTTGTTGGTAGTTGGTAGTTGGTAGTTGGTAGTTGGTAGTTGGTAGTTGGTGTACCGGATATTCTGATAGCGAATAGTTGCCTGGTCTATCTTTTTCATTTCCGATATCTGCCCAAAGGTTGAAGCAGTCCCACAAAAAGTATAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP007598|4069032:4075099|4071574_4071829_+|WP_000703599.1|DBSCAN-SWA MKISLVVLVFNEEDTIPIFYRTVHEFNELEKYKVEIIFINDGSKDVTESIIKIIAVSDPLVIPFSFTRNFGNDARCTTILLIFF >NZ_CP007598|4069032:4075099|4069032_4069974_+|WP_000377779.1|DBSCAN-SWA MDSLNDDKINRQSSDLEVESEEKQSGKEIEVDEDRLPSRAMAIHEHIRQDGEKEMERDAMALLWSAIAAGLSMGASLLAKGIFHVQLEGVPGGFLLENLGYTFGFIIVIMARQQLFTENTVTAVLPVMQNPTLSNVGLLMRLWGVVLLGNLIGTGVAAWAFEYMPIFDEETRDAFVKIGMEVMKNSPTEMFANAIISGWIIATMVWMFPAAGGAKIVVIILMTWLIALGDTTHIVVGSVEILYLVFNGTLPWSDFLWPFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKEEARLRGERLERERKKAEKQR >NZ_CP007598|4069032:4075099|4074946_4075099_-|WP_115367774.1|DBSCAN-SWA MYFLWDCFNLWADIGNEKDRPGNYSLSEYPVHQLPTTNYQLPTTNYQQII >NZ_CP007598|4069032:4075099|4074758_4074902_-|WP_105789228.1|DBSCAN-SWA MDGWGSYVSNILMQDCAGSGDLWYTYGKAFTYISVIDTKTLTLTNCL >NZ_CP007598|4069032:4075099|4071216_4071606_+|WP_001576268.1|DBSCAN-SWA MLKLFAKYTSIGVLNMLIHWRVFAFCMYGMHTHQALTNFSDFVIAVSFSFYANARFTFNASTTAIRYMMYMGFMGALSAVVGWMADQCSLPPLVTLITFSAISLVCGFIYSRFFIFRDKNENLSCRPSF >NZ_CP007598|4069032:4075099|4071846_4073769_+|WP_000400616.1|DBSCAN-SWA MEHLKYRPDIDGLRAIAVLSVVIFHYFPSLLPGGFVGVDIFFVISGYLITSIILKSASNKSFSYLDFYKRRVLRIFPALSIVLVSCLIVGWVYLFQDDYKLLGKHVFSGSFFISNFTLWSESGYFDSKSYLKPLLHLWSLGIEEQFYIIWPVVILLCFRSKNHNRNIVLSCATIFIISYAISIFTMASDGGANYYSPASRFWELMAGAIISTLRFIGINTSLSKLMSLLGIILIALSITMIDEKMSFPGYIAIIPVLGASLIIASNGNDLVVSKLLSVRPVVFFGLISYPLYLWHWPIYSFYRSIFAGSPDYHELILLLLSSFFLAILTYYLIEKPLRNARNKYITAILLALSVFGTGLIGAFIFHINGVKDREINKSAGEYASVTDVYNYYKYGELLRGGICHSVQLTAAISNGCIKNGKHNIFIIGDSYAAALFNGLSHYIDNKGSDYIISQMTDGNAPPLFVDGKDDLQRSVITLNNNRINEIKRVQPEVVLLTWSVRGTNGVHDKKLAIDTLSLTIKKIKEASPDSRIIFIGPVPEWNANLVKIISNYLSEFKKTPPLYMTYGLNSEISEWDSYFSNNVPKMGIEYISAYKALCNESGCLTRVGNGPDFITAVDWGHLTKPGSDFLFNKIGNKIIK |
6 | Salmonella_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
4304485 : 4404431
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP007598|4304485:4404431|DBSCAN-SWA CTCACATAAACAGATAAAATGCCTGGGTCAGCGCCGTATAACTTTCAGAATAATGCTGGTCTGACCCGCGAATTACTAATCTGTCGCTAAAGCACTCCCCGGCTTGCGGGGAGAATGCGAGCAAAACCCGGTGCGGCAAGCGCGCTTCGTTTTCCGCAACGTCAGTACGTAGCCGCAAATGCCATCCCATGTTGAGCGCTTGCTGCGTAAAAGCGTTACCTATTTGTTCCGGCAAGACTACGCAAAAAAAGCCATCTTCGGTAATACAATCCGCCGCAATAGCCAATAACGTCTGGTGATCCAGCGTAGCGGTGTAACGCGCCTGCTCCCGCTGCGGCGTGGCGCATTCCACACCGGGTTCGTAATAAGGCGGGTTGCTGATAATAAGATCAAAGCGTACCGTCTGGCGAGGCGCCCAACTCTGAATATCGTCAGTATGCACCGTTATTCGATGCGGCCAGGGAGAGTGAGCCACGTTTTCCTGCGCTTGCATGGCAGCCCCGGCATCCAGTTCCACGGCGTCAATAGGCACATTGTCATCCGTCCGCTGCGCCAGCATTAACGCCAGCAGACCGCTTCCAGTACCGATATCCAGAATTCGCTTTACATCCGCGACCGGCGCCCACGCCCCGAGTAAAATACCGTCAGTTCCCACTTTCATCGCACACCGATCGTGTGCGACAAAAAACTGTTTAAACGTAAATCCGTTACGACGAAGAACGGAGCCAGACTGAGACATGTAAAAACAACCTTGCAAAAAACGGCGACAGCGCCGAGTGAAAACAATACCTGAGAAGGGATATCCATACAAACAGATGAAGATTGCGGCCGTAACGTCTATAATCAGCGCCCCACACAGAGGTAGAACATGACTGTAACGACTTTTTCCGAACTTGAACTCGACGAAAGCCTGCTGGATGCCCTCCAGGATAAAGGTTTCACTCGCCCGACCGCCATTCAGGCTGCCGCCATTCCGCCTGCGCTCGATGGCCGTGATGTACTCGGTTCTGCGCCGACAGGCACCGGTAAAACGGCGGCATATCTGCTGCCGGCGTTGCAGCACCTGCTCGACTTTCCGCGTAAAAAATCGGGGCCGCCGCGCATTCTGATCCTGACGCCAACCCGCGAACTGGCAATGCAGGTTGCCGATCACGCTCGTGAACTGGCTAAGCATACTCATCTGGATATCGCCACGATTACCGGCGGCGTGGCCTATATGAACCATGCTGAAGTGTTTAGCGAAAACCAGGATATCGTGGTCGCCACGACGGGCCGTCTGCTGCAATATATAAAAGAAGAGAACTTTGACTGCCGGGCGGTTGAAACACTGATCCTTGACGAAGCTGACCGGATGCTCGACATGGGCTTCGCTCAGGACATTGAACATATCGCTGGTGAGACCCGCTGGCGTAAACAGACGATGCTGTTCTCCGCCACGCTGGAAGGCGATGCGATTAAAGATTTTGCCGAGCGCCTGCTGGAAGACCCGGTTGAGGTTTCCGCCAATCCGTCAACCCGCGAGCGTAAGAAAATTCACCAGTGGTACTACCGTGCCGACAATTTCGAACATAAAGTTGCGCTGTTGAAGCACCTGCTAAAACAAGACGATGCAACCCGTTCTATCGTGTTTGTACGTAAACGTGAACGCGTCCATGAACTGGCCGAAACGCTGCGTCTGGCAGGGATCAACAATTGCTATCTTGAAGGTGAGATGGCGCAGATCAAACGTAACGAAGGTATTAAGCGTCTCACCGACGGTCGCGTTAACGTGCTGGTCGCGACCGACGTCGCGGCACGTGGGATTGATATTCCTGACGTCAGCCACGTCATTAACTTCGACATGCCGCGCAGTGGCGATACCTATCTGCATCGCATTGGTCGTACAGGCCGTGCTGGCCGCAAAGGCACTGCAATCTCGCTGGTAGAAGCCCACGACCATCTGTTGCTGCTGAAAATCGGTCGCTACATCGAAGAGCCGCTAAAAGCGCGCGTCATTGATGAGCTTCGCCCGACCACGCGTGCGCCCAGCGAAAAGCTGACCGGCAAACCGTCGAAAAAAGTGCTGGCGAAACGCGCCGAGAAGAAAAAAGAGAAAGAAAAAGAGAAGCCGCGCGTGAAGAAACGTCACCGCGATACAAAAAATATCGGTAAGCGCCGCAAGCCAAGCGGTACAAAAATGCAGGAGCAATCCAGCGAAGAGTGATGATAATGCCGGGTTCGTTAACCCGGCATTTTTTTATCTTCCACGGTAAAACACGCCTCCATACACTGTATAAATATCTGCATTGCCGGGCTGACCGCTTTCCCGGCATGATGGGCGCACAGCGCCATGATCGATAGCGACGGCGCGCCAAACGGCAGCTCTTTCAACTGACCGGTACTAAGTTCTCGTTCAACGGTAAAACGTGGTAAAAAGCTGATGCCCAGGTTAGCAGCCACACACTGTTTAATACTTTCAATACTCCACAGTTCAATGGTGTTTTCCAGCGTAATCCGCCTCTGCCGTAGCGTACTTTCAAATAGCTGGCGGAAGACACATTGCGGTTCATTGATGATGAAGCTGCAGGGGATATGCTGATCTGGCTGCGTAAAATCAGCATCCTGTAGCAGTGGAGAGGCCACCAGCGCCAGCGACTGCTCCCCCAGTTGCTGCATCGTGAGCGCGTCATCATTACCTACGCGATAAAAAACGCCCAGATCGACCTCGTCATTCAGCAACGCATCACGTATCACATAGCAATTCAGCGACTGTAGCGACAATTTGACATTCGGCGCTCGCAGCTTAAAACGTTGTAGAACCTGCGGCATTTTGTACGCCAGTAAGGTTTCACCTGTCGCCACGCGCAGCTCTCCGCCGGGTTCTGCATCCTGCCTTGCGGCTTCACGAATCAACTCCATCACACGCGTCAGTTCATGGATATGCGGCATCAGCTTTTTCCCCTCAGTGGTCAGGCACATTCGGCGCCCTATTTTCTCGAACAGTTGCAGGGAAAACTCACGTTCAAGCTGTTGAATATGAAAAGTTACCGTCGACTGAGTGCAGCACAGCTTTTGCGAAGCCCGCAGAAACGAGCCCTCTTCCACTACCGTTTTCAGCGTAATAAACCGACGCAGATCCATAACCACCAACCTATCGAAAATCTCGAATTCAGAATTAAAAAACATTCATTTTTTTAAATATTTCATATCGGGTACTGTCTGCTAAAACAGAGGAGATGACAAGTGACACCGATGCTTTTAAGCGCTTTCTGGACTTATACATTAATCACCGCCCTTACCCCCGGCCCAAACAATATTCTCGCGCTTAGCGCAGCAACTGCACATGGTTTTCGGCAGAGTATTCGTGTACTGGCAGGTATGAGTCTGGGATTTTTGGTCGTCATGCTGCTGTGCGCCGGGATCGCGTTTTCCCTTGCGGTTATCGATCCGGCAATCATTCATTTGCTCAGTTGGGTAGGGGCTGCGTATATCCTGTGGCTGGCGTGGAAAATTGCCACCAGTCCAGCGGCGGATGAGAAGGTCAGACCAAAACCGGTTGGCTTTTGGGTAAGCTTTGGCCTGCAGTTTGTGAACGTCAAAATTATTCTGTACGGCATTACCGCTCTGTCTACATTTGTTTTACCGCAAACGCAGGCGCTGAACTGGGTCATCGGCGTCAGTATATTGCTGGCATTGATCGGCACGTTCGGTAATGTGTGCTGGGCGCTGGCGGGACATCTGTTTCAGCGGGCGTTTCGCCATTATGGTCGCCAGTTAAATATCATTCTGGCGTTACTGCTGGTGTACTGCGCGGTGCGGATTTTTTACTGACCCAAATAAAACAAAAGCGGAAAAGGCCCGGCCTCTTCCGCTTGTTATTACCGGATAAGCCTTACAGGCTTTCGGTAAAGGTACGAGCGATAACGTCGCGCTGCTGTTCCGGCGTCAGGGAGTTAAAACGCACCGCATAACCAGAAACACGAATAGTCAGCTGCGGGTATTTTTCCGGATGCTTAACCGCGTCTTCCAGAGTTTCACGACGCAGAACGTTAACGTTCAGATGCTGACCGCCTTCAACGCGTACTTCCGGTTTAACTTCCATCGGAATTTCCCGGTATTCAATCTCGCCCAGTTTGCTGACAGCCACAACTTCATCTTCAGCAAAGCCGGATTTTGCAACGATGCAGCGCGCTTCGCCTTTTTCGCTGTCCAGCAGCCAGAAGGAATTCAGCAGGTCGTCGTTAGCGGCTTTAGTAATCTGGATACCTGTAATCATGTGATGCCTCCCCAGGCAAAATTATTTGATTTCGGTCGGCCTAATCCCGGCCAATTGGTAAAACCATTGTTGCTTGAGTGTATATATACCCTTCGATCACCCTTCATTCTTTGATTTAAATCAATAAAAACCACACCATAAAAAGGGTGATGAAGGGAATTTATTGTTTTACATCAACTTACACCATTCAGACCTATAAACTTTTTACATAAATTTTCAACTATCTTTTGTAGGTAACCGTTTTTGTAGCACGCAGGATTTTGCACTGCAGAAAGAAGCAGTTAAGCTAAGGGAACTGGAAATTCGCAGGAGAGCGAGATGGCGACTGAGTTAACCTGGCACGATGTGCTGGCTGATGAGAAACAGCAACCTTACTTTATCAATACGCTTCACACTGTTGCTGGCGAGCGTCAGTCGGGTATCACGGTCTACCCGCCGCAAAAAGATGTGTTTAACGCCTTTCGCTTCACGGAACTGGGCGACGTTAAGGTGGTGATCCTGGGACAAGATCCTTACCACGGCCCCGGCCAGGCACATGGTCTGGCGTTTTCCGTGCGCCCTGGGATCGCTCCCCCGCCGTCATTAGTGAATATGTACAAAGAGCTGGAAGCCTCCATTCCCGGCTTTGTTCGCCCAGCGCACGGTTATCTGGAAAGCTGGGCGCGCCAGGGCGTACTGTTACTGAATACGGTGCTCACGGTACGGGCAGGCCAGGCGCACTCTCATGCCAGCCTGGGGTGGGAAACCTTTACGGATAAAGTGATAAGCCTGATTAACCAACACCGCGAAGGGGTCGTCTTCTTGCTGTGGGGCTCGCACGCCCAGAAAAAAGGCGCGATTATTGATCCACAGCGCCACCATATTCTGAAAGCTCCGCATCCCTCGCCACTCTCTGCGCACCGCGGATTTTTTGGCTGTAACCATTTTGCGCTAACGAACCAATGGCTGGAACAACACGGCGAGAAAACTATCGACTGGACGCCAGTATTACCGGCTGAGAGTGAATAACGTTGGCGGCTCGCCGTTTTATCGCCCGGATAACAGCGCGAAAGCGCCTTGTCTGACCGGGCGGTAGGAAGTACATAATCGGCCCGGCGCGCGTTGCGTCACCGGGCTGCAGCCCTTACGCTTTATTCTGTCGCCACCACTCGGCGAGCAGCACGCCTGTCGCAACCGACACATTCAGGCTTTCCACGTTCCCGGTGCCGTTAATTTTCACGCACAGATCGTCGGGTTCGCGCGCCGCTTCCGGCAGGTAGTCATATTCACGGCCTAAGACCAGCACCATCTTTTCCGGCAGGGTCGTACTAAAGAGCGCCTGACCGCGATCGCTGGACGTCGTCACCACGGTGTACCCCGCCTGACGAAAATCATCCAGTACGTCAACAATGCTTTCACCAGTAATCGGCTGGACATGCTCCGCCCCGCCTTCCGCAGTACGAATCGCCGCACCGGACTCCAGCAACGCCGCATCCTGTACCACCACGCCTTTCACGCCGAAGTGCGCGCAGCTACGCATCATGCCGCCCAGATTATGCGGGTTAGCGACATCTTCCAACGCCAGCACGCAGTCTTGATCCGCCGCCTGCTTTACCCATTGCTTAACGGTCGTACCGTTACGCTTTTTAATCAGGAAACACACGCCACCGTGGTGCTCAGTCCCGGAAGCTTTTGCCAGTTCGGCTTCATCCACCACATGGTAGGCTTTACGATTCGCCGCCATCCAGCGCAGCGCCTCTTTAAAACGCGGCGTTACGCTTTGGATAAACCATGCGCGCACGATGGCGTCAGGACGGCTCTGGAATAGCGCCTGGCAGGCGTTTTCACCGTAGACGCGCGTCTCTTCCGCACGCTGGCGACGCAAAACTTCAGGGTCAATAAAACTTTTACCGCTAATACCACCATGATCGACTTTCTCAGGCGTTTCATCGCCTGGCGCGCGGGACACGGTGCGCCACGGCGAAGTTTCATGTTTACGGTCACGGCTCTGATTATTTCGCTCATCACGGGCGGGGCGACGTCCACCGTCAGTGCGAGACTTTGCTGGGCGGCCTCCCCCTTTCCCGGTACGCGGGTTATGGGTGCGTTTATCAGAGTCATCATCACTACGGACATACATCACTTTGACTTTGCCGCTTTTGTTTTTTAATTCATCGTTCATGCTTTTCTCCACCAGCGCTGCGCGAAGCGCGAAGATTACCCGATGTGCATCCGGTTAGCCATGATTTCGTTTAAAAGCTTGCGACTATTGTACTCATTGAATAAAACACATTGTTGTTTAGCACAGACTCCTCGATAATATGTAACATTATAGAAACATCCCGCGCGTAGCGGGACGTCTTCCGACGTATTCAGAGGTTAGCTATGAACACCGTTTGTACCCATTGTCAGGCCATTAACCGCATCCCCGACGATCGGCTTCAGGATGCGGCAAAATGTGGACGCTGCGGTCATGAGTTGTTTGACGGAGAAGTCATTAATGCGACCGGCGAAACGCTGGACAAACTGTTGAAGGACGATCTTCCAGTGGTGATCGACTTTTGGGCCCCCTGGTGCAGCCCTTGCCGTAACTTTGCGCCAATCTTCGAGGATGTCGCAGAAGAGCGCAGCGGTAAAGTCCGTTTCGTCAAAGTGAATACCGAGGCGGAACGGGAACTCAGCGCCCGTTTCGGTATCCGCAGCATCCCGACGATTATGATTTTTAAACATGGACAGGTGGTCGATATGCTTAATGGCGCAGTGCCTAAAGCGCCTTTCGACAGCTGGCTGAACGAAGCGCTGTAAACGCCACGGGGCGCATCTTGTGCCCCGTTTTCACCTCTGCCACAATAGCGTTTTTCTCCGCAAACGCTTTCTATGACCAATAACGCTGTCCTTCAGTTACGCGCCGAACGGCTTGCGCGCGCTACTCGCCCTTTTCTTGCCAGAGGCAATCGGGTTCGCCGCTGTCAACGCTGCCTGCTGCCGCTAAAATCTTGCCTGTGTGATACGTTAACGCCTTCTCAGGCGAAAAGCCGTTTTTGTCTGGTCATGTTTGACACTGAACCGATGAAACCCAGCAATACCGGCCGGCTGATAGCCGATATTTTGCCGGATACCGCGGCGTTTCAGTGGTCGCGTACCGAGCCGCCACAGGCGTTGCTGGAGTTGGTACAGCATCCGGATTATCAACCGATAGTGGTCTTTCCTGCGTCCTACGCCGGCGAAGCGCGCGAGGTCATTTCCACCCCGCCCGCCGGGAAACCGCCGCTGTTTATTATGCTGGACGGCACCTGGCCCGAAGCACGTAAAATGTTTCGTAAAAGCCCTTATCTGGATCATCTGCCGGTCATTTCCGTCGATTTATCGCGTCTTTCAGCTTATCGTCTGCGTGAAATCCATGCCGAAGGGCAATATTGTACCGCCGAAGTCGCGATAGCGTTGTTAGATCTGGCGGGAGATACCGAGGCGGCAACCAGTTTAGGTGAACATTTCACCCGTTTTAAAACGCGCTATCTGGCAGGAAAAACGCAACATCCGGGAAACGTCACAGCATAAAATAGCGAAAGCGTTTAAAATTATCCGGTCACTTCTGTGTAAGGGAAACCGGTATGAGCCAGCAAGGACTGGAAGCGCTACTGCGGCCAAAATCGATCGCGGTGATTGGCGCATCAATGAAGCCCCACCGCGCGGGTTACCTGATGATGCGTAACTTGTTGGCGGGCGGATTCAATGGCCCCGTCCTTCCCGTGACGCCCGCCTGGAAAGCCGTTTTAGGCGTCATGGCCTGGCCGGATATCGCCAGTCTCCCTTTCACCCCCGATCTGGCTATTTTATGCACTAACGCCAGCCGTAACCTGGCATTACTGGACGCGCTTGGCGCGAAAGGGTGTAAAACGTGCATTATCCTTTCTGCTCCCACGTCGCAACATGAAGAACTTCTTGCCTGTGCCCGGCATTATAAAATGCGTCTGCTGGGTCCAAACAGTCTTGGGCTCCTCGCGCCGTGGCAAGGGCTGAATGCCAGCTTTTCTCCCGTCCCGATTAAACAGGGCAAGCTCGCTTTTATTTCCCAGTCTGCCGCCGTTTCCAATACTATTCTTGACTGGGCGCAACAGCGTGAAATGGGCTTTTCCTACTTTATCGCGCTGGGCGATAGCCTGGATATTGATGTCGATGAACTACTGGACTATCTGGCGCGCGACAGCAAGACCAGCGCGATTTTGCTCTATCTGGAACAGTTAAGCGACGCCCGCCGTTTTGTTTCCGCCGCCCGTAGCGCTTCACGTAACAAACCGATTCTGGTGATTAAAAGCGGCCGAAGCCCGGCAGCCCAGCGTTTACTTAATACCAGCGCGGGAATGGACCCTGCGTGGGATGCGGCCATCCAGCGCGCAGGCCTGCTGCGAGTCCAGGATACGCACGAGCTTTTTTCCGCCGTCGAAACACTGAGCCATATGCGTCCGCTACGCGGCGACAGACTGATGATCATCAGCAATGGCGCAGCGCCTGCCGCGCTGGCGTTAGATGAGTTGTGGTCGCGTAACGGCAAGCTGGCGACGTTGAGCGAAGAGACCTGCCTGCAACTACGGCAGGCGCTTCCCGCGCACATTGATATTGCCAATCCGCTGGATCTGTGTGATGACGCCAGCAGCGAACATTACGTCAAAACGCTGGATATCCTGCTCGCCAGTCAGGATTTTGACGCGCTTATGGTTATCCACTCTCCCAGCGCTGCCGCGCCGGGTACGGAAAGCGCCCATGCCCTGATCGAGACGATTAAGCGCCACCCCAGAGGCAAGCTTGTTACGCTGCTGACAAACTGGTGCGGCGAGTTCTCGTCTCAGGAGGCAAGACGGCTATTCAGCGAAGCCGGATTACCAACCTACCGTACGCCGGAAGGCACGATTACCGCGTTTATGCATATGGTGGAATACCGGCGTAACCAGAAGCAACTGCGAGAAACGCCAGCGTTGCCGAGTAACCTGACGTCCAATACCGCTGAGGCGCATAATCTGTTACAGCGGGCGATTGCGGAAGGCGCCGCCTCACTGGATACCCATGAAGTACAGCCGATTTTACACGCCTATGGGCTGCACACGCTCCCAACCTGGATTGCCAGCGACAGCGCTGAAGCGGTGCATATCGCCGAACAGATAGGCTATCCGGTAGCTCTCAAGCTGCGCTCGCCCGACATTCCGCATAAATCTGAAGTTCAGGGGGTCATGCTTTACCTGCGGACCGCAAGCGAGGTACAACAGGCCGCGAACGCCATTTTTGATCGTGTAAAGATGGCCTGGCCGCAAGCGCGGATTCACGGTTTGCTGGTACAAAGCATGGCTAACCGCGCCGGCGCGCAGGAGCTTCGTGTGGTGGTCGAGCACGATCCGGTGTTTGGTCCTTTGATTATGTTGGGTGAAGGCGGCGTAGAGTGGCGTCCGGAAGAGCAGGCTGTCGTCGCGCTGCCGCCGCTCAACATGAACCTGGCGCGCTATCTGGTGATTCAGGGCATTAAACAGCGGAAAATTCGCGCCCGTAGCGCGCTGCGTCCGCTGGATATTGTCGGTTTAAGCCAATTGCTGGTCCAGGTTTCAAACCTGATTGTCGACTGCCCGGAAATTCAGCGTCTGGATATCCATCCGCTGCTGGCTTCCGCCAGTGAGTTTACCGCGCTGGATGTGACGCTGGATATTGCCCCGTTTGATGGCGATAACGAAAGTCGACTTGCGGTACGCTCCTATCCCCACCAGCTTGAAGAGTGGGTGGAGATGAAAAATGGCGATCGCTGCCTGTTCCGTCCTATCCTGCCGGAAGATGAGCCCCAACTGCGACAATTCATCGCACAGGTCACCAAAGAGGATCTTTACTACCGTTATTTCAGCGAGATCAACGAATTCACCCATGAAGATTTAGCCAACATGACGCAGATCGACTACGATCGAGAAATGGCCTTTGTGGCCGTGAGGCGGATGGACAATGCTGAAGAGATCCTCGGCGTAACGCGCGCGATCTCCGATCCTGACAACGTAGATGCCGAATTTGCCGTATTGGTACGTTCAGATCTCAAAGGGTTGGGTTTAGGACGCCGTTTAATGGAGAAATTGATTGCCTATACTCGCGATCACGGATTGAAGCGGCTGAACGGTATTACGATGCCAAACAATCGCGGCATGGTCGCGCTGGCCAGAAAACTGGGATTTCAGGTCGATATTCAGCTCGAAGAGGGCATCGTGGGATTGACGCTGAATCTGGCCAAATGTGATGAATCGTGAGTAATGTACTGGAAATGTTGACCACTTTAACGGGTACTGATGTTATTATTGCTCGCTTATGTCGTCTGCATAGCACAGAGGACCCTTCAATGAACAGAGAAGAAATGCACTGTGATGTTGTCAAAATTTAAGCGTAATAAACATCAACAACACCTTGCCCAACTCCCTAAGATTTCTCAGTCAGTTGATGATGTAGATTTCTTTTATACTCCTGCTACTTTTCGGGAGACACTGCTGGAGAAAATCGCCAGCGCGACGCAGCGTATTTGTATCGTTGCCCTGTATCTGGAACAGGACGATGGGGGTAAAGGTATACTCGACGCGCTCTATGCGGCTAAACGGCAGCGTCCTGAACTGGACGTGAGGGTGCTGGTCGACTGGCATCGCGCGCAACGCGGGCGTATTGGCGCAGCGGCCTCGAATACCAATGCAGACTGGTATTGCCGACTGGCGCAGGAAAACCCCGGCATTGATGTCCCGGTTTACGGCGTACCGATTAATACGCGAGAAGCGCTTGGCGTACTGCATTTTAAAGGCTTTATCATTGATGATAGCGTCCTGTACAGCGGCGCCAGCCTGAACGATGTCTATCTCCATCAGCATGACAAATATCGCTACGATCGCTACCAGCTGATTCGCAATCGGCAAATGGCAGACATCATGTTTGACTGGGTGACGCAAAATCTGATGAATGGCCGTGGTGTGAATCGTCTGGATAATACCCAGCGGCCGAAAAGCCCGGAGATTAAAAACGATATCCGCTTATACCGTCAGGAGCTGCGTGATGCGTCATATCATTTTCAGGGTGACGCTAATGACGAGCAACTCTCCGTTACGCCGCTGGTCGGGCTTGGCAAATCCAGTCTGCTGAACAAAACCATTTTCCATCTCATGCCCTGTGCGGAGCATAAGCTCACCATTTGTACCCCTTATTTCAATCTACCAGCGGTGCTGGTGCGGAATATTATTCAGCTACTGCGCGACGGAAAGAAAGTCGAAATCATTGTTGGCGATAAAACCGCGAATGATTTTTACATTCCGGAAGATGAACCGTTCAAGATCATTGGCGCGTTGCCCTATCTCTATGAGATTAACCTGCGCCGCTTCCTGAGCCGTTTACAGTATTACGTCAATACCGATCAGCTTGTGGTGCGTCTGTGGAAAGATGACGACAATACCTATCATTTGAAAGGGATGTGGGTAGATGATAAATGGATGCTACTCACCGGCAACAATCTGAATCCCCGCGCCTGGCGTCTGGATCTGGAAAACGCCATTCTGATCCACGATCCTAAGCAGGAGCTTGCCCCTCAGCGGGAAAAAGAGCTGGAACTTATCCGAACGCACACAACCATCGTAAAACATTATCGTGACCTGCAGAGCATTGCCGACTATCCGATAAAAGTGCGTAAATTGATTCGCCGTCTGCGCCGGATCCGCATCGACAGATTAATTAGCCGTATCCTGTAATCGAAGCCCCGTCCTGTACGGGGTTTTCTCTTAGGAGTGAACTGGTGCGCGTGTTGATTCCTTTTACGGTGTTGTTTCTCTCCGGGTGCAGCCATCTGGCTAACGACCACTGGAGCGGTCAGGATAAAGCCCAGCATTTTATGGCATCAGCGATGCTATCGGCAGCCGGTAATGAGTATGCGCGTCACCAGGGAGTAAGCCCTGACCGCAGCGCAGCGATAGGGCTGATGTTTTCTTTGAGTCTCGGAGCCTCAAAAGAACTCTGGGACAGCCGCCCGGAAGGCAGCGGCTGGAGCTGGAAAGATTTTGTCTGGGATGTCGCTGGCGCGACAACCGGTTACGCTATCTGGCAAATGGCGCGATACTAAAGGCGTAAGCCCTTTCCTTTACGGTGCAACATCAACGAAACCAGGAAAGCCAGAACGGCCATCGCGGTAACATACCAGAAGAACGAACTCTCCATACCGACAGATTTTAGCGACAGCGCGACATATTCCGCAGAGCCGCCAAATAACGCGTTTGCTACCGCGTAAGATAAACCGACGCCCAGGGCGCGAACCTGCGCCGGAAACATCTCTGCTTTCAGGATCCCGCTAATCGAGGTATAGAAACTCACAATCAATAGCGCGCACATCACTAAAGCAAATGCGGCATAAGGCGACGTCACATTCTGCAACGCGGAGAGAATTGGAACGGTGAAAAGCGTCGCGAGTGCGCCAAAACATAGCATAGAGGTGCGCCTGCCAATCTTATCGGATAACGCGCCGAATAACGGCTGCACAAACATGAAAACACACAGCGCGACCGTCATAATGCCGCTGGCGACATTGGCATGCATACCTGCTGTATTTACCAGATATTTCTGCATATAGGTGGTGAAGGTATAAAAGCACAGCGAGCCTGCCGCGGTGAAACCAAGCACCATCAAAAAGGCTTTGCGATTACGCCACAAACCTTTCAACGAACCGGCTTCTTTTAATGCGCGAGTCTCATGCTTCGAGGTTTCATCCAGCTGGCGACGTAGCCATAACGCCACAACCGCCAGCACGGCGCCCAGCGCAAAGGGAATACGCCATCCCCAGGCACGAAGCTCAGCATCCTCCAGAACCTGCTGCAAAATCACAACGACCAGCAATGCCAGTAGTTGGCCACCGATAAGCGTAACGTACTGAAACGACGCGAAAAATCCCTTACGACCTTCAAGCGCAACTTCACTCATATATGTCGCGCTGGTCCCATACTCTCCCCCTACCGAGAGCCCCTGAAATAACCTTGCAATAAGTAGCAGCGCCGGTGCCCAGGTGCCGATGGTATCGTAACCAGGTAAGCAGGCAATCACCAGCGACCCCATGCACATCATGCACACCGAAATCAACATTGAGGCTTTGCGCCCGCGTCGATCGGCAATGCGTCCAAACAGCCAACCACCGATAGGCCGCATTAAAAAACCGGCGGCAAAGACGCCTGCCGTTTGCAAAAGCTGAGTAGTGGTATTACCAGAAGGAAAGAAGATATGTGCAAAATAGAGCGAGCAGAAAGAGTAGACGTAGAAATCAAACCATTCCACTAAGTTTCCGGAAGAGGCCCCCACGATAGCCCATATGCGTCGCCGCATCTCCGGAGCCTCAGCGTTATTTTTATCTATACGCTGAATAATTTCTGCCATACTGTTCTCCTTGTTGCCAGGCTTAATCGGCCTTTTATTTTGGAGATTCAGATTACGCGAGACCCAAAAGTAAATAAAGTGTTAATTATTTGTTTGCTAATTTTTATCCCCACCGTGCCTGATGCTGGCGTTTCCGTTTCAGATAGTCTTCGTCCTGACGGATTTTATCCCACCAACCTTTGAAAACCGTTGCCGCCGCGGCCTTAATAGGATCATTACCCCGTGTATTCAGGCTGCCGTCTTCTGCATACTTTTTACCGCCTTTATAGTTAGCGTATCGCCTGGCGCGGGTGTATCCCATCTGAATAAATTTACGCGCCATGTCCATACCAACGAAATCATCCTGCTGCCGGTAAGCTTCGAACAACTGGTAGACCTGTTCTGCGGATTTCATCGCCGATGCTTCATCTTTATAGCGCCAGAAAGGAAGAATTTCGCTTTTGTAGGGTTCAACCAGTAGCACCCCCTGCTCGCCTCGTCCAACCTGATACAATTCAGGCTGCTGGCGGAAGTCAATGCTGGAAAAGTCCTGCTGGTAGTTAAAAGGTTTGATAGCCAATGAAAATCCTCACTGCTTATATAAGCACGTGTAAAGGATAGTTCATCGTTAACGATTGGGTGGGATGTACTCTTGTCGGTTCTACAAGAACACCAGCTTCGTAGGGGGGATAATTCCTGTAGGCCGGATAAGGCGGAGCCGCCATCCGGCAGAAACAGACAAAAACAAAAGGCCCTGTCTTCCGACAGGGCCCTTCGTTTTATTTGATGCCTGGCAGTTCCCTACTCTCGCATGGGGAGACCCCACACTACCATCGGCGCTACGGCGTTTCACTTCTGAGTTCGGCATGGGGTCAGGTGGGACCACCGCGCTACTGCCGCCAGGCAAATTCTTTGTGCTCTGTACGCAATTCTTTATCGCATCAGCGGCGTTGCCTGCGCTCGTAAACTCAGTCACATACTTCTGTATGCTCCTTCCTTTCCTTCGCTTGCCGCCTTGCTGCTGCGCAAATTATTGCGTACTCAACTCTGAATCACTGCTGAAAATCGTCTTCTCATCCGCCAAAACATCTTCGGCGTTGTAAGGTTAAGCCTCACGGTTCATTAGTACCGGTTAGCTCAACGCATCGCTGCGCTTACACACCCGGCCTATCAACGTCGTCGTCTTCAACGTTCCTTCAGGAGACTCTAAGTCTCAGGGAGAACTCATCTCGGGGCAAGTTTCGTGCTTAGATGCTTTCAGCACTTATCTCTTCCGCATTTAGCTACCGGGCAGTGCCATTGGCATGACAACCCGAACACCAGTGATGCGTCCACTCCGGTCCTCTCGTACTAGGAGCAGCCCCCCTCAGTTCTCCAGCGCCCACGGCAGATAGGGACCGAACTGTCTCACGACGTTCTAAACCCAGCTCGCGTACCACTTTAAATGGCGAACAGCCATACCCTTGGGACCTACTTCAGCCCCAGGATGTGATGAGCCGACATCGAGGTGCCAAACACCGCCGTCGATATGAACTCTTGGGCGGTATCAGCCTGTTATCCCCGGAGTACCTTTTATCCGTTGAGCGATGGCCCTTCCATTCAGAACCACCGGATCACTATGACCTGCTTTCGCACCTGCTCGCGCCGTCACGCTCGCAGTCAAGCTGGCTTATGCCATTGCACTAACCTCCTGATGTCCGACCAGGATTAGCCAACCTTCGTGCTCCTCCGTTACTCTTTAGGAGGAGACCGCCCCAGTCAAACTACCCACCAGACACTGTCCGCAACCCGGGTAACGGGTCCACGTTAGAACATCAAACATTAAAGGGTGGTATTTCAAGGTCGGCTCCATGCAGACTGGCGTCCACACTTCAAAGCCTCCCACCTATCCTACACATCAAGGCTCAATGTTCAGTGTCAAGCTATAGTAAAGGTTCACGGGGTCTTTCCGTCTTGCCGCGGGTACACTGCATCTTCACAGCGAGTTCAATTTCACTGAGTCTCGGGTGGAGACAGCCTGGCCATCATTACGCCATTCGTGCAGGTCGGAACTTACCCGACAAGGAATTTCGCTACCTTAGGACCGTTATAGTTACGGCCGCCGTTTACCGGGGCTTCGATCAGGAGCTTCGCTTGCGCTGACCCCATCAATTAACCTTCCGGCACCGGGCAGGCGTCACACCGTATACGTCCACTTTCGTGTTTGCACAGTGCTGTGTTTTTAATAAACAGTTGCAGCCAGCTGGTATCTTCGACTGACTTCAGCTCCATGAGTAAATCACTTCACCTACGTGTCAGCGTGCCTTCTCCCGAAGTTACGGCACCATTTTGCCTAGTTCCTTCACCCGAGTTCTCTCAAGCGCCTTGGTATTCTCTACCTGACCACCTGTGTCGGTTTGGGGTACGATTTGATGTTACCTGATGCTTAGAGGCTTTTCCTGGAAGCAGGGCATTTGTTGCTTCAGCACCGTAGTGCCTCGTCGTCACGCCTCAGTGTTAAAGTGAACCGGATTTACCTGGAACACACACCTACACGCTTAAACCGGGACAACCGTCGCCCGGCCAACATAGCCTTCTCCGTCCCCCCTTCGCAGTAACACCAAGTACGGGAATATTAACCCGTTTCCCATCGACTACGCCTTTCGGCCTCGCCTTAGGGGTCGACTCACCCTGCCCCGATTAACGTTGGACAGGAACCCTTGGTCTTCCGGCGAGCGGGCTTTTCACCCGCTTTATCGTTACTTATGTCAGCATTCGCACTTCTGATACCTCCAGCAACCCTCACAGGCCACCTTCGCAGGCTTACAGAACGCTCCCCTACCCAACAACACACAGTGTCGCTGCCGCAGCTTCGGTGCATGGTTTAGCCCCGTTACATCTTCCGCGCAGGCCGACTCGACCAGTGAGCTATTACGCTTTCTTTAAATGATGGCTGCTTCTAAGCCAACATCCTGGCTGTCTGGGCCTTCCCACATCGTTTCCCACTTAACCATGACTTTGGGACCTTAGCTGGCGGTCTGGGTTGTTTCCCTCTTCACGACGGACGTTAGCACCCGCCGTGTGTCTCCCGTGATAACATTCTCCGGTATTCGCAGTTTGCATCGGGTTGGTAAGCCGGGATGGCCCCCTAGCCGAAACAGTGCTCTACCCCCGGAGATGAATTCACGAGGCGCTACCTAAATAGCTTTCGGGGAGAACCAGCTATCTCCCGGTTTGATTGGCCTTTCACCCCCAGCCACAGGTCATCCGCTAATTTTTCAACATTAGTCGGTTCGGTCCTCCAGTTAGTGTTACCCAACCTTCAACCTGCCCATGGCTAGATCACCGGGTTTCGGGTCTATACCCTGCAACTTAACGCCCAGTTAAGACTCGGTTTCCCTCCGGCTCCCCTATTCGGTTAACCTTGCTACAGAATATAAGTCGCTGACCCATTATACAAAAGGTACGCAGTCACACCCAAAGGGTGCTCCCACTGCTTGTACGTACACGGTTTCAGGTTCTTTTTCACTCCCCTCGCCGGGGTTCTTTTCGCCTTTCCCTCACGGTACTGGTTCACTATCGGTCAGTCAGGAGTATTTAGCCTTGGAGGATGGTCCCCCCATATTCAGACAGGATACCACGTGTCCCGCCCTACTCATCGAGCTCACAGCACATGCGCTTTTGTGTACGGGGCTGTCACCCTGTATCGCGCGCCTTTCCAGACGCTTCCACTAACACACATGCTGATTCAGGCTCTGGGCTCCTCCCCGTTCGCTCGCCGCTACTGGGGGAATCTCGGTTGATTTCTTTTCCTCGGGGTACTTAGATGTTTCAGTTCCCCCGGTTCGCCTCATTAACCTATGGATTCAGTTAATGATAGTGTGACGAGTCACACTGGGTTTCCCCATTCGGGTATCGCCGGTTATAACGGTTCATATCACCTTACCGGCGCTTATCGCAGATTAGCACGCCCTTCATCGCCTCTGACTGCCAGGGCATCCACCGTGTACGCTTAGTCGCTTAACCTCACAACCCGAAGATGTTTCTTTCGATTCATCATCGTGTTGCGAAAATTTGAGAGACTCACGAACAACTTTCGTTGTTCTGTGTTTCAATTTTCAGCTTGATCCAGATTTTTAAAGAGCAAATATCTCAAACGTGACTCGTAAGTCAGTTTTGAGATATTAAGGCAGGTGACTTTCACTCACAAACCAGCAAGTGGCGTCCCCTAGGGGATTCGAACCCCTGTTACCGCCGTGAAAGGGCGGTGTCCTGGGCCTCTAGACGAAGGGGACGTATCAGTCTGCTTCGCAAGACGCCTTGCTATTTACTTTTCATCAGACAATCTGTGTGAGCACTGCAAAGTACGCTTCTTTAAGGTAAGGAGGTGATCCAACCGCAGGTTCCCCTACGGTTACCTTGTTACGACTTCACCCCAGTCATGAATCACAAAGTGGTAAGCGCCCTCCCGAAGGTTAAGCTACCTACTTCTTTTGCAACCCACTCCCATGGTGTGACGGGCGGTGTGTACAAGGCCCGGGAACGTATTCACCGTGGCATTCTGATCCACGATTACTAGCGATTCCGACTTCATGGAGTCGAGTTGCAGACTCCAATCCGGACTACGACGCACTTTATGAGGTCCGCTTGCTCTCGCGAGGTCGCTTCTCTTTGTATGCGCCATTGTAGCACGTGTGTAGCCCTGGTCGTAAGGGCCATGATGACTTGACGTCATCCCCACCTTCCTCCAGTTTATCACTGGCAGTCTCCTTTGAGTTCCCGGCCTAACCGCTGGCAACAAAGGATAAGGGTTGCGCTCGTTGCGGGACTTAACCCAACATTTCACAACACGAGCTGACGACAGCCATGCAGCACCTGTCTCACAGTTCCCGAAGGCACAAATCCATCTCTGGATTCTTCTGTGGATGTCAAGACCAGGTAAGGTTCTTCGCGTTGCATCGAATTAAACCACATGCTCCACCGCTTGTGCGGGCCCCCGTCAATTCATTTGAGTTTTAACCTTGCGGCCGTACTCCCCAGGCGGTCTACTTAACGCGTTAGCTCCGGAAGCCACGCCTCAAGGGCACAACCTCCAAGTAGACATCGTTTACGGCGTGGACTACCAGGGTATCTAATCCTGTTTGCTCCCCACGCTTTCGCACCTGAGCGTCAGTCTTTGTCCAGGGGGCCGCCTTCGCCACCGGTATTCCTCCAGATCTCTACGCATTTCACCGCTACACCTGGAATTCTACCCCCCTCTACAAGACTCAAGCCTGCCAGTTTCGAATGCAGTTCCCAGGTTGAGCCCGGGGATTTCACATCCGACTTGACAGACCGCCTGCGTGCGCTTTACGCCCAGTAATTCCGATTAACGCTTGCACCCTCCGTATTACCGCGGCTGCTGGCACGGAGTTAGCCGGTGCTTCTTCTGCGGGTAACGTCAATTGCTGCGGTTATTAACCACAACACCTTCCTCCCCGCTGAAAGTACTTTACAACCCGAAGGCCTTCTTCATACACGCGGCATGGCTGCATCAGGCTTGCGCCCATTGTGCAATATTCCCCACTGCTGCCTCCCGTAGGAGTCTGGACCGTGTCTCAGTTCCAGTGTGGCTGGTCATCCTCTCAGACCAGCTAGGGATCGTCGCCTTGGTGAGCCGTTACCTCACCAACAAGCTAATCCCATCTGGGCACATCTGATGGCAAGAGGCCCAAAGGTCCCCCTCTTTGGTCTTGCGACGTTATGCGGTATTAGCCACCGTTTCCAGTAGTTATCCCCCTCCATCAGGCAGTTTCCCAGACATTACTCACCCGTCCGCCACTCGTCAGCGAAGCAGCAAGCTGCTTCCTGTTACCGTTCGACTTGCATGTGTTAGGCCTGCCGCCAGCGTTCAATCTGAGCCATGATCAAACTCTTCAATTTAAAAGTTTGATGCTCAAAGAATTAAACTTCGTAATGAATTACGTGTTCACTCTTGAGACTTGGTATTCATTTTTCGTCCGAGGACGTTAAGAATCCGTATCTTCGAGTGCCCACACAGATTGTCTGATAAATTGTTAAAGAGCAGGTGCGACGCGGCTTACAGCTCACCGTCGCGAGGTGGCGTATATTACGCTTTCCTCTTTTTGAGTCAAGCCTTTATTTTCGCTTTCCTCACAGGTTTCTTAACGAAGCCTTTCTGACCCGGCGGCCTGTATGCCGTTGTTCCGTGTCAGTGGTGGCGCATTATAGGGAGTTATTAGAGCCTGACAAGACCTAAATGCAAAAAAAAGCTCAACCGTTCACTTTTCAAACAACATTTGAACCAAAAGCCTATTTTCGCCTGGTTTTTAAACAAAAACGAGCCCGTCAGGGCCCGTTTTATTCAAATTTGTGACTTACTGCACTGCCACAATACGATCATCACTGGCTTCAAGGCGAATCACTTTGCCAGGAACCAGTTCACCAGACAGGATTTGCTGCGCCAGTGGGTTTTCGATCTGCTGTTGAATAGCACGTTTTAACGGACGCGCCCCGTAGACCGGATCGTAACCGTTGGCGCTCAACAGTTTCAGCGCCTCGTCGGAGATATGGATTTCATAACCACGTTCTTCCAGACGTTTGTACAGACGCTGCAGCTGGATCTGAGCAATAGAAGCGATGTGTTGCTCACCTAACGGATGGAATACCACAACTTCATCAATACGGTTGATAAATTCCGGACGGAAGTTTTGGCTAACCACACCCAGCACCATCTCTTTCATTCGACCGTAATCCAGTTCGCCAAAGCGCTCCTGAATGAGATCGGAACCGAGGTTAGAAGTCATAATGACCACCGTATTACGGAAGTCGACCGTTCTCCCCTGCCCGTCAGTCAGCCGACCGTCGTCCAGCACCTGCAACAGAATGTTGAATACATCCGGATGCGCTTTTTCTACTTCATCCAACAAGATGACGGAATAAGGACGACGGCGTACCGCTTCCGTCAAATAACCGCCTTCTTCATAGCCGACATATCCCGGAGGCGCCCCGACCAAACGAGACACGGAGTGTTTCTCCATAAACTCGGACATATCGATACGCACCATCGCGTCGTCGCTGTCGAACATAAAGTTAGCCAGCGCTTTACACAGTTCGGTTTTACCGACCCCCGTCGGCCCCAGGAACAGGAAGGAACCAATCGGACGGTTCGGATCGGACAGCCCAGCACGGCTACGACGTATGGCGTTCGATACTGCTTCAACGGCTTCATTCTGCCCGATCACACGGCTGTGTAACTCTTGTTCCATACGCAACAGTTTTTCACGTTCGCCTTCCAGCATTCTGGCAACCGGAATACCGGTCCAGCGCGCCAGCACTTCGGCAATTTCCGCATCCGTTACTTTGTTACGTAACAGACGCATGGTTTTACCTTCCGACTGGGTCGCGGCTTCCAGCTGTTTTTCCAGCTCCGGAATTTTGCCGTACTGCAGTTCAGACATTCGCGCCAGGTCGCCAACGCGACGCGCCTGCTCGATGGCGATCTTCGCCTGCTCCAGCTCCGCTTTAATAGTTTGCGTCCCAGAAAGCGATGCTTTTTCCGCTTTCCACTCTTCTTCCAGCTCAGAGTACTGACGCTCTTTATCGTCCAGTTCTTCGTTGAGCATATCGAGACGTTTTTTACTCGCCTCGTCAGACTCTTTCATCAACGCCTGCTGTTCCAGTTTGAGCTGAATAATCCGGCGGTCGAGTCTGTCCAGCTCCTCCGGCTTAGAGTCAATCTGCATACGAATGCTGGATGCCGCTTCATCGATGAGGTCGATAGCTTTATCCGGCAACTGACGGTCGGCGATATAGCGGTGAGATAATGTTGCCGCCGCCACAATCGCCGGGTCAGTGATCTGCACATGGTGGTGCAGCTCGTAACGTTCTTTCAGGCCACGCAGAATAGCGATGGTGTCTTCCACTGAAGGTTCTGCGACAAACACTTTTTGAAAACGACGTTCCAGCGCCGCGTCTTTTTCAATGTACTGACGGTATTCATCCAGCGTGGTGGCGCCCACACAGTGCAGTTCACCACGCGCCAGCGCCGGTTTCAGCATATTCCCGGCGTCCATGGCGCCATCAGCCTTACCTGCGCCAACCATCGTGTGCAGCTCGTCAATAAACAGGATGACGTTGCCTTCCTGTTTCGCCAGATCGTTCAGCACGCCTTTCAGACGCTCTTCAAATTCGCCGCGATATTTCGCCCCCGCCACCAGCGCGCCCATATCCAGCGCCAGTACGCGGCGACCTTTTAAGCCTTCAGGGACTTCACCGTTAATAATGCGCTGCGCCAGTCCTTCAACAATTGCCGTTTTACCGACCCCTGGCTCACCGATTAACACCGGGTTGTTTTTGGTACGACGTTGCAGTACCTGAATGGTACGGCGAATTTCTTCATCACGACCAATTACCGGGTCAAGCTTGCCTTGTTCAGCCCGCTCGGTCAGATCGACGGTATATTTTTTCAAGGCCTGACGTTGGTCTTCAGCCCCTTGATCGTTCACGCTTTCACCTCCGCGCATCTGTTCAATTGCCTGAGTGATATTGGCCGTGGTCGCACCGGCTGATTTCAGCAGATCGGTTAACGTGCCGCGAGACTCAAGCGCCGCCAGAACAAACAGCTCTGACGAAATAAAATTGTCTCCCCGTTTTTGCGCCAGCTTGTCGCAAAGGTTCAGTACGCGTACCAGTTCCGAAGAAGGCTGTACGTCGCCGCCGGTGCCTTCCACCTGCGGTAAACGGCTCAGCGCCTGATCGATAGCGGTGCGCAACTGGCCAGCATTAATGCCGGCGGAGGTTAATAAAGGACGTATCGATCCCCCTTCCTGGTTCAGCAAGGCGCTCATTAAATGAAGAGGTTCGATGAATTGGTTGTCGTGCCCCAGCGCGAGCGACTGGGCATCGGCAAGAGCAAGCTGGAATTTGTTAGTAAGACGATCCAGACGCATAACTCCTCCCATAACAGGTCAAAATTGCTACTGGAGATTAAATGAGGTCATCCCTCAATTATTCAAGGTGAATGACCTGAATTATGTGAAAAGAAAGTTACCCGTACCGGATCGTCTTGATTCTTTAGGTTATATCAGCCAAATAAAACTTGCCATACGACCTGTCGTCTTGTCGCGACGATAAGAGAAGAAAGTCTCACTTTCGCTAAAGGTGCAGCGATCCCCGCCATAGACATGTTCAACGCCGGTATTCGCCAGACGTTGGCATGCAAGCTGATAAATATCCGCCAGAAATTTTTCCCCGTGCGGCAAAAATGCGCTATCGGCTTGCGCATCTTTTGCTAAAAATGCGTCACGCACTTCCGGCCCCACTTCAAACGCGGTCGGGCCGATCGCCGGACCTAACCAGGCGATGATATTTTCTGGTTTGTCAGCAAAGCAGGTGACAGTTTCCTCCAGTACGCCTTCACATAATCCGCGCCAGCCCGCATGAGCCGCCGCCACTTCTGTTCCCTCACGGTTACAGAATAGCACAGGCAGGCAGTCTGCCGTCATTACCGCGCAGACGGTTCCTGGCGTATTGCTGTAAGACGCATCCGCACGTTTGGAGGCATAAGGTTCTCCGGTAAGCCTCAACACATTTTTACCGTGAACCTGTTCAAGCCAGACAGGTTTCGAAGGCAGATTGCCCGCCGCAAACAGACGCTTGCGGTTCTCTTCAACATGCTCCGGGTTATCGCCGCAATGCGCGCCCAGATTCAGCGAGTCGTAAGGCGGTAAACTCACCCCGCCAATACGGGTAGAACTACAGGCTGCGACGCCTTTCGGCAGCGGCCACTGCGGGACAATCAGTGCATTCATAACCAGTCCACATCATCTTTATGATCTTCAAAATCGGCGCGCATCGCATCGATAAGGTCCACCATATCTTGTGGAATCGGCGCGTGCCACTCCATTTCGATACCCGATACAGGATGGTAGAGACGCAGCATTGTTGCATGCAGCGCCTGGCGGTCAAACTTACGCAGCGTGGAAATAAATTCTTCCGATGCGCCTTTCGGTGGACGTGGACGACCACCGTAGACCTGATCGCCCACTAACGGGTGCGTAATATGCGCCATATGCACGCGGATCTGGTGCGTACGCCCGGTCTCCAGACGCAACCTAAGACGCGTGTGCACACGGAAATGTTCCATAATGCGGTAGTGGGTCACCGCCGGTTTGCCCATGGGATGCACCGACATATGGGTACGTTTGGTCGGGTGACGACTGATGGGTTCATTCACCGTTCCTCCCGCCGTCATATGTCCGATCGCGACCGCTTCATATTCGCGGGTGATCTCGCGCAGCTGCAAAGACTCCACCAGGCGCGTTTGCGCCGGAACGGTTTTGGCCACCACCATCAGCCCCGTCGTGTCCTTATCCAGACGATGCACAATGCCCGCGCGCGGGACATCCGCAATCGGCGGATAATAATGTAGCAACGCGTTCAGTACCGTACCGTCCGGGTTACCCGCGCCAGGATGTACCACCAGATCGCGTGGTTTGTTGATAACCAGAATATCGTCATCTTCATAAACAATATCTAACGGGATATCCTGCGCTTCAAAACGAATCTCTTCGTCGATCTCCGCATCAATAGCCACCCGCTCGCCGCCTAACACTTTTTCTTTTGGCTTATCGCAAAGTTGACCATTGACCAACACGCGCTGATTCAAAATCCATTCTTTTATACGCGAACGTGAATAATCCGGGAACATTTCGGCCAAAGCCTGATCTAAGCGTTGACCGAGTTGGTTTTCGGAGACCGTTGCGGTGAGTTGTACTCGTTGTGCCATATACAGCTTCTTCGTTTAACGTTGGGTTTTACGGCTTTGCCGTTTAATATAGTGTGCTATTGTAGCTGGTCTTAACCGGGAGCAGGAACAGAGAATCTCCCGTAAAACATTTTGAGGAAAGTCAAAACGTCATGACGCGCATGAAATATCTGGTGGCAGCAGCCACGTTGAGCCTGTTTCTGGCGGGTTGCTCGGGTTCAAAGGAAGAGGTGCCCGATAATCCGCCGAATGAAATCTACGCGACTGCTCAGCAAAAGCTGCAGGACGGTAACTGGAAACAGGCAATAACGCAATTGGAAGCGTTAGATAATCGTTATCCATTCGGACCGTATTCTCAGCAGGTGCAGTTGGATCTTATCTACGCCTATTACAAAAACGCCGATTTGCCGCTAGCGCAGGCCGCCATCGATCGTTTTATGCGTCTTAATCCAACGCACCCTAATATTGACTATGTCATGTACATGCGCGGCCTGACGAACATGGCGCTGGATGATAGCGCGCTGCAAGGTTTCTTTGGCGTCGATCGCAGCGATCGCGATCCGCAACATGCGCGGGCGGCGTTCAATGACTTTTCGAAACTGGTACGCAGCTATCCTAATAGCCAGTACACCACCGACGCCACTAAGCGTCTGGTATTCCTGAAAGACCGTCTGGCGAAATATGAATATTCCGTCGCAGAGTACTATACCGCGCGCGGCGCATGGGTTGCGGTCGTAAACCGGGTAGAAGGTATGCTACGCAATTATCCTGATACGCAGGCCACGCGCGATGCGTTGCCGCTGATGGAAAACGCCTATCGTCAGATGCAGCTGAATGCGCAGGCTGACAAAGTCGCGAAAATTATTGCCGCTAACAGCAAAAACACCTGAAGCAATACCAACAAGCAAAACGGCAGCCTTCCGGCTGCCGTTTTTTTATTCGATATCTGATGCAACAGAAATGATTTCACTGCAACGCATCCAGTCAAATATGGCCTGCTTTCAGCCATTCCTCAAGTAAAAGATCGCTTCTTCCTGCGATGCCTCACAAAAAGTTCGCCTTGACAAAAAGTGACAAAATAATGTGATTTACATCACACATTTTGACATTAAGAACGGTATGCTGGAATCACCAAGACGGGAAAGACAAGAGGTAAAATTTATGACAATGAACATTACCAGTAAACAAATGGAAATTACTCCGGCAATCCGCCAACATGTCGCAGACCGTCTCGCCAAACTGGAAAAATGGCAAACTCACCTGATTAATCCTCATATCATTCTGTCTAAAGAGCCACAGGGCTTTATTGCTGACGCCACCATTAATACACCGAACGGACATCTGGTCGCCAGCGCAAAACACGAAGATATGTACACCGCTATTAATGAATTGATCAACAAGCTGGAACGGCAGCTCAATAAAGTGCAGCATAAAGGCGAAGCCCGTCGTGCCACCGCATCGGTGAAAGATGCCAGCTTCGTCGAAGCAGAAGAAGAGTAGTCCCTTACATTGAGTGTATCGCCAACGCGCCTTCGGGCGCGTTTTTTGTTGACAGGATTAAAACAGTACGGGTACTGTACTTACGTACACAAAGGAAACAGTTATGAAGCTAACCCGGTTTTTCTTCGCATTCTTTTTTATCTTCCCCTGACTGGGAGGCGTTTCGTCATGTGATAAAGAATGCGAAGACGAACAAGAAGGCCTCCCCCACCGGGAGGCCTTTTTTATTGATAACAAAAAAGGCAACACTATGACATCGGAAAACCCATTACTGGCGCTGCGAGATAAAATCAGCGCTTTAGACGAAGAGTTACTGGCCTTACTGGCAAAACGACGCGCGCTGGCGATTGAAGTGGGACAAGCAAAACTACTGTCGCATCGTCCGGTTCGGGATATCGATCGTGAACGCGCGCTGCTGGACAGACTCATCCATCTCGGTAAAGCCCACCATCTCGACGCACACTACATTACCCGTCTGTTCCAGCTTATCATTGAAGACTCCGTGCTTACTCAGCAGGCGCTGCTGCAACAACATCTGAATAATACTCACCCTCATTCGGCACGTATTGCGTTTCTTGGGCCGAAAGGCTCCTATTCTCATCTCGCGGCGCGCCAGTATGCTGCACGCCATTTTGAGCAATTTATTGAGAGCGGCTGCGCAAAATTCGCCGATATTTTTCATCAGGTCGAAACCGGCCAGGCCGATTACGCCGTGGTTCCGATAGAGAACACCAGCTCCGGCGCTATCAACGATGTGTACGACTTATTGCAACACACCAGTCTGTCGATTGTCGGTGAGATGACTGTCACTATCGATCACTGCGTGCTGGTTTCCGGCGCTACAGATCTGAATACCATCGAAACGGTGTACAGCCATCCGCAGCCGTTTCAGCAGTGCAGTAAATTTTTGAGCCGCTATCCGCACTGGAAAATCGACTATACCGAGAGTACGTCGGCAGCGATGGAAAAAGTCGCGCAGGCAAACTCTCCGCGCGTCGCGGCGCTCGGCAGCGAGGCAGGCGGCATGTTGCACGGTTTACAGGTGCTGGAACGCATTGCCGCAAACCAGACGCAGAATATCACCCGCTTTCTGGTACTGGCGCGCAAAGCCATCAACGTTTCCGATCAGGTTCCGGCAAAAACCACTCTGTTAATCGCCACCGGGCAGCAAGCTGGCGCGCTGGTCGAAGCGCTGCTGGTGCTGCGTAACCACAATCTCATCATGACGAAACTGGAGTCGCGCCCCATTCACGGCAATCCGTGGGAAGAGATGTTTTATCTCGATATTCAGGCGAACCTGGAGTCGCAGGTAATGCAAAGCGCGCTAAAAGAGCTGGGCGAGATCACGCGCTCAATGAAAGTGCTTGGCTGCTATCCCAGCGAAAACGTCGTGCCGGTAGAACCTGCCTGACGTTCAATAAAACGGCTTTTCTTCATCCCTGCGACGGCTACCTGCAGGGTGAAGATGGCGCCGGAGAGCGGATAATCCGCCACTTCCTGAGCAGACATATTTTCTCTGGTAGTTGTAATAAACAGCGTTTTCATATCTGCGCCGCCAAAGCAAACCATCGTCGGACAACGTACCGGCAACCGGTACTCTTCCAGCTGCTCTCCTTGCGGTGAGAAACGCGCCACGCGCCAGCCGTCAAACATCGCGCTCCAGTAGCACCCTTCGCTATCCATCGCCGCGCCGTCGGGAAGTCCTTCTCCTTCGCCAAAATGGCGAAAAAGTTCACGCTTGCCCGGATCGCCATGCTTATCAAGTAACGTGCGGTATATCACGCCATTCGGCGTATCGGAGGTATACATCCACTGGTTATCCGGGCTGAACGCCAGGCCGTTGTGTCCCTGGATATCGCACTGGATCACTTTTGCCGTCAGGTCATGATCGATCCGCATCAGCAACGCGCCATTATAGTCACCTGGCGCCCAGAACGTTCCGGCATAAAAACGTCCATCGCCATCAGTGCCGCCATCGTTGAAACGCGCCAGTTTCGTATTAGAGGGATTGTCGCAAACCTTACGCTGCAATAATCCATTTTTATCGGCCAGCCAGATAGCATGACGCATGGCCACAATAAATCCGCCCTGCTCACGCAAGGCAAAACAGCCCACCTCTTCCGGAAACGCCAGCACGCTATGCGTCCCGCTCGCCGGATGGTAACGATGGATCTCCTGTTCCAGGATATCGGTCCAATACAGCGCGCTTTCATCCTCGCTCCATGTGGGGCATTCTGGTAAATGTCCGGCATAATCAAATAAAACGTGCGGCGTAGTCATAACGGTTCCTTAAACGAAAAAAGCCAGCAAAGCTGGCTTTTAGTATAGATGTCATCATTATGGTCGGCTGTCATTCGCCTGACGCAATAACACACGGCTTTCATTCTGGAAGCGTCTGGCATAATCGCCAAACCAGTGTTCAACTTTGCGAAAACTGTCGATAAAAGCCTGCTTATCACCTTGTTCCAGTAACCCGATCGCATCGCCAAAACGTTTATAGTAACGCTTGATAAGCGCCAGATTGCGCTCCGACGACATAATAATGTCCGCATACAGCTGCGGGTCCTGGGCGAACAGACGCCCGACCATCGCCAGCTCCAGTCGATAAATCGGCGATGATAGCGCCAGAAGCTGCTCAAGCTGGACGTTCTCTTCCGCCAGATGCAGCCCATAAGCGAAGGTAGCAAAATGGCGCAACGCCTGGATAAAAGCCATGTTCTGATCGTGCTCGACAGCGCTAATTCGGTGCAACCGAGCGCCCCACACCTGGATTTGCTCAAGGAACCACTGATACGCTTCCGGTTGACGCCCGTCACACCAGACCACCACCTGCTTCGCCAGGCTCCCGCTGTCCGGGCCAAACATCGGATGCAAGCCCAACACGGGGCCATCATGGGCCGCCAACATTGCCTGTAACGGATCGCTTTTCACCGATGCCAGATCGACCAGAATACAGTCGGACGGCAGGGGCGGCAGTTGCGCTATGACCTGTTCAGTAACATGAATCGGCACGCTGACGATCACCATTCCGGCATCGGCGACAATGTCCCTGGCGCGCGGCCAGTCCTGCTGTTCCAGAATACGGACCTGATAGCCCGACAGCGTGAGCATTTTTTCAAACAGACGCCCCATCTGTCCGCCGCCGCCCACAATGACGACCGGACGCAGAGAAGGACAAAGCGTTTTGAACCCCTTATCATTTTCGCTGGAGTAAGATTCACGCATTACCCGGCGCAGGACATCTTCAATGAGATCGGGCGGGACACCGATCGCTTCTGCTTCCGCCCGTCGTGAAGCCAGCATAGAGGCCTCACGCTCCGGCACGTAAATAGGCAGGCCAAAACGGCTTTTCACCTCGCCGACTTTGGCAACCAGTTCCAGGCGCTTAGCCAGTAAATTCAACAACGCTTTATCGACATCATCTATTTGATCGCGTAACGCGGTCAATTCAGCAACCATAACAACCTCTTATGCGACGCGCACCGCCAGCTGGCCGCTCAAATCTTTATGAATTTCACGTAACAGGGCATCGGTCATCTCCCAGCTAATACAAGCATCGGTGACGGAAACGCCATACTTCATTTCGCTGCGCGGCTGTTCGGAAGACTGATTACCCTCATGAATATTACTTTCAATCATTAAGCCAATGATTGAACGATTGCCATCTTTAATCTGCGCAACCACAGATTCGGCAACGGCTGGCTGGCGGCGATAATCTTTATTGGAGTTACCATGACTGCAATCTACCATCAGCGAAGGACGTAGTCCCGCCTGTTCCATCTCTTTTTCACACTGAGCGACATCTGCCGGGCTATAGTTTGGCGCTTTGCCGCCACGCAAAATCACATGGCCATGCGGATTTCCCTGGGTTTGCAATAACGCAACCTGACCGGCCTGGTTAATGCCAACAAAACGATGAGGTTGCGCAGCGGCGCGCATGGCGTTAATCGCTGTCGCCAGGCTGCCATCCGTGCCGTTTTTAAAGCCGACCGGCATAGAAAGACCAGACGCCATTTCGCGGTGGGTTTGCGATTCGGTTGTGCGCGCGCCTATCGCCGACCAGCTAAACAGATCGCCCAGGTATTGCGGGCTGTTCGGATCCAACGCTTCGGTCGCCAATGGCAACCCCATATTCACCAGTTCCACCAGTAGCTGACGCGCTATTTTCAACCCGGCTTCCACATCAAATGAGCCATCCATGTGAGGATCGTTAATCAGCCCTTTCCAGCCGACGGTAGTTCGCGGCTTTTCAAAATAGACGCGCATTACCAGATAGAGGCTATCGCTGACCTCTGCGGCAAGGGCTTTAAATCGACGGGCATATTCCAGAGCGGTTTCAGGATCGTGAATAGAACAAGGACCGCATACCACCAACAGACGCGGATCGCGCCCGGCAATAATGTCAGAAATGATTCCCCGGGACTGCGCTATCTGCGCTTCCTGCGCCAGGCTCAACGGAAAGGCCGCTTTAAGCTGCTCCGGCGTCATTAATACCTGTTCATCGGTGATACGTACGTTATTCAGCGCGTCTTTTTGCATGATGGCGATCCTGTCTTTTCTTGTTTGCGATAGTTGATCCTCCACGAGGATGTTTAAACCATACCACAAAAAGTAAAGATTTCAATCCATATTACGTAAAATAAACTTTACACCATGCGCAATCGTCGATTAACCCCCTCCATTTCGTATCAAAAACTTTACACAACAAGTGAATTATTTTATTCCCAGCCTCAGAAAAACGGTAAAATGAGCGATACTCGCTATTTTTTGATTATTTTTTCTCCGCACACATATTTTTACTTTCTGATACAGGTTGATTAATACCATTATCACAAGAATTTAATATCAGGGACGCCACTTAAATTATTCATATTATCAATAAGACATAATGACGATATGGCAAAATAATGTTATTTTGCTACAAACAATCACGGATGTATTTTAATGCTTTACGACTGAAGATGGTGGGGACTTGTTAATGCGTTTTTCTCACCGATTTATCCTTCTGTTATCGCTATTATTAGCAAGCTTGCCTCTGTATGCTCAACGCGTCACGGAAGAAGAGAAATCCGTACGCGCCATCGTCTCCGGTATCGTCAGTTATACACACTGGCCGGCGTTATCCGGGCCGCCAAGGCTATGCATATTTTCATCTGCGCGTTTTGTCAGGGTACTTAGCGAAGAGGCTGATTGGGCTTTCCCTTATCAACCGCTTGTCATCCGTACCACACAAGAAGCCCTGAGCGCCCGCTGTGATGGTTTTTACTTTGGCAATGAATCGCCAGCTTATCAGGTGGAGTTAACGCGTCATTATCCAGTTAATGCCTTGTTATTAATTGCCGAGCAAAATACCGAATGTATTATTGGCAGCGCATTTTGTCTGATCATTAACAATGATGAAGTCAAATTTTCCGTCAACCTGGACTCCCTCTCGCATAGCGGCGTGAGAGTAAATCCAGAAGTATTAATGCTTGCACGGAATCAAAAGCATGAATAAGGAATTTTCTCTGTCCAGACCAACATTTAAACGCACACTACGGCGGATTAGTATAATCAGCGTGCTGCTTACAATGACATTGATCTGGCTATTAATTTGCGTTGCGTCTGTCCTTACGCTCAAACAGTATGCGCAAAAAAATCTCGATTTGACCGCCGCCACAATGGCCCATAGCCTTGAAGCGGCACTGGTATTTTCCGATAACGCGGCCGCAGCGGAAACGCTCGCCACACTGGGACGCCAGGGACAATTTTCAGCGGCGGAGGTCCGCGATAAAAATGGCCGTACTATCGCCTCATGGCGCTATGATGCGCGAGCCGCAGACGATAAGCTCATCGGCTTAATTAGCCACTGGCTTTTTCCATTGCCGGTATCACAACCCGTCTGGCACAACGGCAGGGCCATCGGCGAAGTACGGCTTGTCGCCCGCGACAGCCTTATTGGTCATTTTATCTGGCTATCGCTGGCAGTGCTGACAGGATGTATTCTGCTGGCATCCGGCATTGCCCTGCTGCTCACGCGTTATTTGCACAATGGCGTTGTGGATGCGCTGCAAAATATTACTGAAGTTGTACACGACGTTCGCACTAACCGAAATTTTTCACGCCGGGTACCTGATGAGCGTATTGCGGAATTTCACCTGTTTGCGCAGGATTTCAATAGCCTTCTGGATGAGATGGAAGAATGGCAGCTACGGCTTCAGGCTAAAAATGCCCAGTTACTACGTACCGCGCTGCACGATCCGCTGACGGGGCTTGCCAATCGCGCGGCATTTCGCAGCTGTATTAACGCGCTGATGAAGGACAATTCCGCTCGTAGCAGTTCGGCATTGTTATTTCTGGATGGCGATAACTTTAAATATATTAATGATACCTGGGGACATGCGGCAGGCGACCGCGTACTTATAGAGGTTGCCAAAAGATTAGCGGAATTCGGTGGTAGCCGTTATCAAACTTACCGACTCGGCGGCGATGAATTTGCGATGGTGCTTTACGATGTACATTCGGAGTATGAAGTACAACGTATTTGCGCAGCGCTATCCCAGGCGTTTAATCGACCTTTTGAACTGCATAACGGCCAGCGGATAACGATGACCCTGAGTATTGGCTTTGCGCTGACATGGGAACATGCCACTGCCGAAAAACTACAAGAACTGGCCGATCGAAATATGTATCAGGCTAAACACCGGCGTGCGGAACGCTCGCTAAATTAAGGAACGGGCCTGGCCGTTTCTGACTCAGCGTTGAGACCACAGACCCAGCCTAATCATGCAACCGTCCATGGCGTGGAGCAACCCCATAAAAAAGGGGCCAGCGTAATGCCAGCCCCTTCTTTTCTACAAGCTTTCGGATGTTGCGAAAGCGCGTTCTTAGTTAAGACGCTCTTTGATACGAGCCGCTTTACCAGTACGCTCACGCAGGTAGTACAGTTTAGCTTTACGAACAGCACCACGACGTTTAACAGCAATGCTGTCAACTACCGGAGAGTGAGTCTGGAAGACACGCTCAACGCCTTCGCCGTTGGAAATTTTACGAACAGTGAATGCAGAGTGCAGACCGCGGTTACGAATAGCGATAACCACGCCCTCGAATGCCTGCAGACGTTTCTTGGTACCTTCAACAACCCATACTTTCACTTCCACGGTATCGCCCGGACGGAAGGAAGGTACGTTCTGCTTCATCTGCTCTTGTTCAAGTTGCTTAATAATGTTGCTCATAATTTAATCTCTTATCCTGGGTAAACTGATATTCGGGGGCCTATGCCATCCCATCATGTTTATGCTGCTGTTGTGCGTGTTCTGTTTTGAACTCCGCCAGCAACCTTGCTTGCTCTTCAGTCAGAGCCAGGTTTTCCAGAAGTTCAGGTCTTCTAAGCCAGGTTCGGCCCAGCGACTGTTTCAAGCGCCAGCGACGTATCTCGGCATGGTTTCCCGACAGCAATACTGGCGGTACTTCCATCCCCTCTAACACTTCAGGGCGCGTATAGTGCGGACAATCCAGCAACCCATCAGCAAACGAATCTTCGATTGCTGATGCCTCATGCCCCAGAACCCCCGGTATAAACCGGGCGACGGAGTCAATCAGCGTCATTGCCGGTAGTTCGCCACCGCTGAGAACGTAATCGCCAATTGACCATTCTTCGTCAATTTCGGTCTGAATTACGCGCTCATCTACGCCTTCGTAGCGACCACACACCAGAATAAGCTTCTGATTCGTGGCCAGCTCGCTAACGCCCGCTTGATCAAGCTTGCGTCCCTGAGGCGACAGATAAATCACTTTAGCGCCTTCACCTGCCGCGGCTTTTGCTGCGTGAATGGCGTCCCGCAAGGGTTGCACCATCATTAACATCCCCGGTCCGCCGCCGTAAGGACGGTCGTCCACGGTACGGTGCCGGTCATGCGCGAAGTCGCGAGGACTCCAGCTTTGGATGTTCAGCAGGCCTTTTTTTACTGCCCGGCCAGTTACCCCGTAATCGGTAATTGCGCGGAACATTTCAGGAAACAGGCTAACGATGCCAATAAACACAAGCCAATCCCCATAACGCCATTTATTACCGTTTATTCGGTGGTTTAAAAACCAGGATCCCAGTCTACTTCGATAGTACGAGTAGCGAGATCGACTTTCTTGATAACCTGCCCATCGAGGAACGGTACTAGTCGTTCCTTGATACCAAATGCATCTTTCAGGTTCGCCTTAATGACGAGAACGTCATTCGACCCGGTTTCCATCATGTCGATAACTTTACCGAGATCGTAGCCTTCAGCGGTGACTACCTGGCAACCCATCAGGTCTTTCCAGTAATAGTCTCCCTCCTCAAGCGCAGGAAGTTGCGAGGAATCCACGACAATTTCGCAATTGGTCAGAAGATTCGCGGCATCTCGATCGTCAACGCCTTTCAGCTTGATGATCAGATCCTGATTGTGGTGCTTCCAGCTTTCCAGCTGTACCTGCTGCCACTGACCCGCCTTCTGGATAAACCAGGGCTGATAGTCAAAAATGCTTTCGGCGTCTTCAGTGGAGGAAAACACTCTGAGCCAACCACGGATACCGTATGAAGAACCCATTTTCCCCAGTACAACCGGTTCAGCGGGTACTTGTGCGGCGAGTTGCTTGCTCATCATGACCACCGTGACAGATTAAGCTGCTTTTTTTACTTCTTTGATCAGCGCCGCAACGCGATCGGAAATGGTTGCGCCCTGGCCAACCCAGTGAGCGATGCGATCCAGATCCAGGCGGGTGCCTTCTTCTTTTTCGCTAGCGATTGGGTTGAAGAAACCAACGCGCTCAATGAAGCGACCGTTGCGTGCATTACGGCTGTCGGTGACAACAACCTGGTAGAACGGACGCTTTTTAGCGCCGTGACGAGCTAAACGAATAGTTACCATAACATCCTCTTGTGTGAATAAAACAACCGGGCCCCATCGAGGAACGGAGCCCGGCGTCATATTAAAAGCCCGAAAATTTTACTGATTTCTGGGAGAATTGCAATCAACAGTTAATAACTCTGCTGTAGAAGGCCGTCGGCGGCGCAGCGAGTTCGGTGCCGGCGTGCGCGCAGCCACCGCAGCGTACACGCAGTACGTGAGGATGGCGAGCACACCCCGGTGCCGAAATAGCAAGTGAGCCAGGCCGGGAAATTAGCGACCCGGGAAGCCTGGCGGCATCATCCCCTTCATACTGCGCATCATCTTCGCCATCCCGCCCTTCTTCATCTTCTTCATCATGCGCTGCATGTCGTCGAACTGTTTCAGAAGGCGGTTAACGTCCTGCACCTGCATCCCACAGCCCTGCGCGATACGACGCTTGCGGGAACCTTTGATGATTTCCGGCTTCGCGCGCTCTTTCAGCGTCATCGAATTGATGATCGCTTCCATACGCACCAGCACCTTGTCATCCATCTGCGATTTAACGTTGTCCGGAATCTGCCCCATGCCCGGTAATTTGCCCATCAGACTGGCCATACCGCCCATGTTTTTCATCTGTTTGAGCTGTTCCAGGAAGTCGTTCAGGTCGAAACCGTCGCCTTTCTTCAGTTTAGTCGCCAGCTTTTCAGCCTGTGCGCGGTCAACTTTGCTTTCGATATCTTCGATAAGAGACAGTACGTCGCCCATACCGAGAATACGCGAGGCGATACGATCCGGGTGGAACGGTTCCAGCGCGTCGGTTTTCTCGCCGACACCGAGGAATTTAATCGGCTTGCCGGTGATATGACGAATAGAGAGCGCCGCACCGCCACGAGCATCACCATCAACTTTGGTCAGCACCACGCCGGTTAACGGCAGCGCTTCGTTAAAGGCTTTTGCGGTATTCGCCGCATCCTGACCGGTCATCGCATCGACGACAAACAGCGTTTCTACTGGCTTGATAGAAGCGTGGACCTGTTTGATTTCGTCCATCATCGCTTCGTCAACGTGCAGACGACCGGCGGTATCCACCAGCAGCACGTCGTAGAATTTGAGCTTCGCTTCTTTCAGCGCGGCGTTGACGATATCAACCGGTTTCTGGCCGACATCAGACGGGAAGAAATCCACGCCAACCTGTTCAGCCAGCGTTTCGAGCTGTTTGATCGCCGCCGGGCGATAGACGTCGGCAGAGACGACCAGCACTTTCTTCTTGTGCTTCTCGCGCAAGAATTTACCCAACTTACCGACGCTGGTGGTTTTACCCGCCCCCTGCAGACCCGCCATCAATACTACGGCTGGCGGCTGCGCAGCTAAATTCAGCGTCTGGTTCTCTTCGCCCATCGCCGCAACCAGTTCGCTACGGACAATCTTGACGAACTCCTGCCCTGGGGTCAGGCTCTTGTTAACTTCATGACCAACCGCTTTCTCTTTTACGCGATTGATGAACTCACGCACTACCGGCAGCGCAACGTCAGCCTCCAGCAGCGCCATGCGCACTTCGCGCAGCGTCTCTTTAACGTTGTCTTCAGTAAGGCGCCCACGGCCACTGATATTGCGCAGCGTGCGCGACAAACGATCGGTTAAATTATCAAACATTGTCTCTCGCCTGGGGTGGAAACGGTTGGTCGCCGCAGCGACACAGTTACAGAATTTCGCCACAGTATACCATGAAGCCGTCTTTGTTGTTATGCAACGGTTGGAGCTGCGGTCACGTAACGCTATACTGCTTCTCTTTCTCACTGGTCAACTGTCGACACAACTATGCCCGTTTTTGCTTTACTTGCCCTTGTCGCCTACTCCTTCAGCCTCGCGCTGATCGTTCCCGGACTGTTGCAAAAAAACAGCGGCTGGCGGCGTATGGCTATTCTTTCTGCGGTCATCGCGCTGGTGTGCCATGCTGTCGCGCTGGAATCGCGTATTCTGCCCGGCGGCGACAGTGGACAAAACCTGAGCCTGCTGAATGTCGGTTCGCTGGTCAGTCTGATGATCTGCACGGTGATGACCATTGTCGCGTCGCGTAATCGCGGCTGGCTACTTCTGCCCATTGTCTATGCTTTTGCGCTGATTAACCTGGCCTTCGCCACGTTCATGCCTAATGAATATATTACGCATCTGGAAGCGACCCCAGGCATGATGGTGCACATCGGTCTGTCGCTTTTCTCTTACGCGACGCTGATAATCGCCGCGCTGTATGCGCTACAACTGGCGTGGATCGACTATCAGTTGAAGAACAAAAAGCTGGCGTTTAGTAATGAAATGCCGCCATTGATGAGCATTGAGCGCAAAATGTTTCATATCACGCAGATTGGCGTCGTACTGCTGACCCTTACTCTGTGTACCGGCCTGTTTTACATGCATAACCTGTTCAGTACTGAAAATATTGATAAAGCCGTGCTCTCTATTGTGGCATGGTTTGTCTATATCGTTCTGCTGTGGGGGCATTACCATGAGGGCTGGCGGGGTCGGCGGGTCGTGTGGTTTAACGTCGCAGGCGCGGGTATTCTGACGCTGGCCTATTTTGGTAGCCGCATATTACAGCAATTTGTAAGCTAAGCTTTTAAAGGAGTTTCCCCTGGAACACATCTCCACCACCACGCTGATTATCATATTAATCATCATGGTGGTCATCTCGGCCTATTTTTCCGGTTCCGAAACTGGAATGATGACGCTAAACCGTTACCGTTTACGCCATATGGCGAAGCAAGGTAATCGCTCGGCAAAACGGGTAGAAAAATTACTGCGTAAACCCGACCGACTGATAAGCCTGGTGCTGATCGGCAATAACCTGGTCAACATTCTTGCCTCGGCGCTCGGCACGATTGTGGGTATGCGGCTTTACGGCGACGCTGGCGTGGCTATCGCCACCGGGGTACTGACGTTTGTCGTGTTAGTTTTTGCCGAAGTCCTGCCCAAAACTATCGCCGCGCTCTATCCGGAAAAAGTGGCCTATCCCAGCAGCTTCCTGCTGGCGCCGTTACAGATACTGATGATGCCGCTGGTGTGGTTACTTAACACCATCACCCGCCTGCTGATGCGTCTGATGGGAATTAAAACGGACATCGTGGTTAGCGGGTCGTTAAGCAAAGAGGAATTACGCACTATCGTGCATGAATCCCGTTCGCAAATCTCTCGCCGCAATCAGGATATGCTGCTGTCGGTTCTCGACCTGGAAAAGGTCAGCGTGGATGACATCATGGTGCCGCGTAATGAAATTATCGGGATCGATATCAATGACGACTGGAAGTCTATCGAGCGGCAGCTCACCCACTCGCCACACGGACGCATTGTGCTCTATCGCGATTCGCTGGATGACGCCATCAGTATGCTGCGCGTGCGTGAAGCCTGGCGGTTAATGGCCGAGAAAAAAGAGTTCACCAAAGAGATGATGCTGCGCGCCGCCGATGAAATCTATTACGTCCCGGAAGGTACGCCGCTCAGTACACAACTGATTAAATTTCAGCGCAATAAAAAGAAAGTCGGACTGGTCGTCAACGAATATGGCGATATCCAGGGGCTGGTCACCGTGGAAGATATTCTCGAGGAGATCGTTGGCGACTTCACCACATCGATGTCGCCGACCCTGGCGGAAGAGGTCACGCCGCAAAACGACGGTTCGGTCATTATTGACGGTACCGCCAACGTCCGGGAAATCAATAAAGCGTTTAACTGGCACCTGCCGGAAGACGACGCGCGCACCGTCAACGGGGTGATTCTGGAGGCTCTGGAGGAAATTCCGGTTGCTGGCACGCGCGTGCGCATTGAGCAGTACGATATCGATATTCTCGATGTACAGGAAAATATGATTAAGCAGGTAAAGGTTGTACCGGTAAAGCCGCTGCGCGAGAGTGTGGCGGAGTAACACAATGGCGAACTCCGGTTCGCCTTTTTTATCGCCGTCTTTTAAGATTCCATCAATTCCTGCCATTTACTTAACAATGCAATAATGTCTTTTCAGGCATTCAGAAAGGATATTGATGTTATGTCAGAAGATTATGTAATTGAATGGGATAAGAATTTTGCAGACGACCTCAACGTCGTCGCCAACGTTTTTTTATCACACAACCCAACCTTATGGCCCACTATTTTCTCTCAACTTTCGACGCAACCTGAAATCTTTGAAGACGAAGATGAAGACGAATATGGGTTGCAGGACGTCCTGGATTGTAGCGGTGGCGATCTCGGAAATAACGAGCTGGCACAGGCATTTTTACAAGTTCTACGCGGCGAAAGATTTATTCACCTCGTTGACTGGAAAGGTGAAGATGAAGAAGGCGAACTGGCTAATTTTGCCGCAGATCGCTTTTATGAACTCACAAAAAACCTGACGAATTCTGAGGAATTAAGAAATCTTCTTGTCGAAATAACCCAAGAGGATGAAATCTCCGATGTCTGTGAAGCCGGAGACCGTTACCTTGACGAGATCTTTGAACGCATCCAGACCGAACTCAATAAGAGGGGCTTCCAGATTTTTGATCTGAATGAAGGATCTGACACCTATAATGTGGTCGTTCTGCCAATGAGCGAATATAAAAAAATAGAGGACTTCAACACGCCGTGGCTGGAAGTGCAGGATTTTTTAAGCTAAAAAAATGGCCGACTTTACATCGGCCATCAGACTTTTACTTCGCCTTCGCTACAGTCACCATCGCCGCGCGAATGGTACGACCGTTCAGCGTATAGCCTTTCTGCATAATACCCAGCACATTGCCTGCCGGAACCTCTTCCGACTCCACCATCGCAATCGCCTGATGCACGTTCGGGTCCAGCGGCACGTTGGTCTCAGCAATCACTTCCACGCCGAACTTACGCACCACATCCAGCATAGACTTCAGCGTCAGCTCAATCCCTTCGACCATTGCCGCCATATCCGGATTGGCTTTGTCTGCGACTTCCAGCGCGCGATCCAGGCTATCGATGACCGGCAGCAATTCGTTGACGAACTTCTCGAGAGCGAATTTATGCGCTTTTTCGATATCCTGTTCAGTACGACGGCGCAGGTTTTCCATTTCCGCTTTGATGCGCAACACGGTGTCGCGTTCGCGAGTCTGGGCTTCTGCAAGCTGAACTTCCAGATTCGCAATTTTTTCATCGCGCGGATCCACCTGCTCAGCAGAATCGTTTGGTTCAACTGCCTCAACCTCTTCGTGCTGATCCATGATAATTTCTTCCGGGGCTTGCCCCTCAGGCGTTTTCTGTTCTTTACTACTCATGAATTTCTCCGCGTTTTTTTTCGCATTCATCTCGCTAACTTCGCTTATTATGGGGATCAGATTCAGGGTTTCAAGGGAAGTACTCACATTGTCATCAATCTTCGCTACAAGGACCTCAGAAAAATGAATAATCATTTCAAGTGTATCGGCATTGTGGGGCATCCGCGTCATCCCACCGCACTCACGACACATGAAATGCTCTACCGTTGGCTATGCGCTCAGGGTTATGAGGTCATTGTGGAGCAACAAATCGCCCACGAATTACAGCTAAAAAATGTGCCAACCGGCACGCTGGCGGAAATTGGTCAACAGGCGGATCTGGCGGTAGTCGTGGGCGGCGACGGCAATATGCTTGGCGCAGCGCGTACGCTGGCGCGTTATAACATCAATGTGATTGGCATTAACCGCGGTAATCTTGGCTTCCTGACCGATCTCGATCCGGACAACGCCCTGCAACAATTATCTGATGTGCTGGAAGGACGCTATATCTCCGAAAAACGTTTTCTGCTGGAAGCGCAGGTCTGCCAACAGGACCGCCAGAAGCGTATCAGCACAGCCATTAATGAAGTGGTGCTGCATCCGGGAAAGGTGGCGCATATGATCGAATTCGAGGTTTATATTGATGAAACCTTCGCTTTTTCACAGCGATCTGATGGTCTGATCATTTCTACGCCGACCGGCTCCACCGCCTATTCGCTCTCCGCAGGCGGCCCTATTCTGACCCCTTCACTGGATGCCATCACCCTGGTGCCGATGTTCCCGCATACCTTGTCGGCGCGCCCGTTAGTCATTAACAGCAGTAGCACGATTCGTTTGCGTTTTTCGCATCGCCGTAGCGATCTGGAAATTAGCTGCGACAGCCAGATAGCCCTGCCTATTCAGGAAGGAGAAGATGTGCTGATTCGCCGCTGCGACTACCATCTCAACCTGATACATCCCAAAGACTACAGCTATTTCAATACATTAAGCACAAAACTCGGCTGGTCAAAAAAATTATTCTAATTTTACGTTAGCCTCTTTACTGTATAAAAAACCAGTTTATACTGTATTTAATTACAGTTATGGTTTTTCATACAGGAAAACAGCTATGTTGGCGCAACTGACCATCAGCAATTTTGCTATCGTTCGCGAGCTTGAGATCGATTTTCAGAGCGGAATGACCGTCATTACTGGTGAAACCGGCGCGGGTAAATCTATTGCGATTGATGCCCTGGGTCTGTGTCTGGGCGGTCGCGCTGAGGCCGATATGGTCCGTACCGGAGCGACCCGCGCCGATTTGTGTGCCCGCTTTGCGCTGAAAGACACGCCTGCGGCATTGCGCTGGCTGGAAGAAAACCAGCTTGAAGAAGGTCGCGAATGCTTACTTCGCCGCGTTATCAGCAGCGATGGCCGCTCACGGGGCTTTATTAACGGCACCGCTGTGCCCCTGTCTCAATTGCGCGAGCTCGGTCAGTTGCTGATTCAAATTCATGGTCAACATGCGCACCAGCAGCTTACCAAACCCGAACAGCAAAAATCGCTGCTGGATAGCTATGCCAACGAAGCCGCTTTAGCGCAGCAGATGGCTGCCCGCTACCAATTATGGCATCAAAGCTGCCGCGATCTTGCCCATCACCAGCAACAAAGCCAGGAACGCGCCGCGCGCGCAGAACTGCTGCAATATCAGTTAAAAGAGTTGAACGACTTTAATCCGCAGGCCGGTGAATTCGAGCAAATTGACGAAGAGTACAAACGGCTGGCCAACAGCGGGCAGTTGCTTACGACCAGCCAGAACGCTCTGGCGCTACTCGCGGACGGCGAAGACGTTAACCTGCAAAGCCAGTTATATAGCGCAAAACAACTGGTTAGCGAGCTGGTAGGTATGGACAGCAAGCTTTCCGGCATCCTGGATATGCTGGAAGAAGCCACTATCCAACTCACCGAAGCCAGCGATGAATTACGCCACTATTGCGAGCGTCTGGATTTAGACCCCAATCGCCTGTTTGAGCTTGAACAGCGCATCGCTAAACAAATTTCGCTGGCGCGAAAACATCACGTCAGCCCGGAAGCGCTGCCGCAGCTTTATCAGTCACTGCTCGAAGAACAGCAGCAGCTCGACGATCAGGCCGATTCGCTGGAAACGCTAACGCTGGCGGTTAATAAGCATCACCAACAAGCGCTGGAAACGGCGCAAGCGCTGCATCAACAGCGTCAGTTTTACGCCCAGGAGCTGGGGCAGTTGATCACTGAAAGTATGCATTTACTTTCGATGCCGCACGGTCTGTTTACTATCGACGTGAAATTTGATGAGCATCATCTGAGCAACGACGGCGCCGATCGCGTGGAGTTTAAAGTGACGACAAACCCGGGTCAGCCGCTGCAGCCTATCGCTAAAGTCGCGTCTGGCGGTGAACTGTCGCGCATTGCGCTGGCTATTCAGGTGATTACCGCGCGCAAAATGGAAACGCCGGCGCTAATTTTTGACGAGGTCGATGTTGGCATCAGCGGCCCAACCGCCGCCGTGGTCGGTAAACTCCTGCGTCAGCTTGGCGAATCCACTCAGGTGATGTGCGTCACTCACCTGCCACAGGTAGCCGGGTGCGGTCACCAGCACTTCTTTGTCAGTAAAGAAACAGACGGGGCAATGACCGAAACCCATATGCAACCGTTGGATAAACGTGCGCGTTTACAGGAACTGGCCCGCCTGTTGGGAGGAAGTGAAGTTACGCGTAATACGCTGGCAAACGCAAAAGAACTGCTGGCGGCGTAAACTTTTTTATCTCCCGAAGGTCTGAGTTCACAACAAAAACGCCGTAAGACCCGACAGCAAAAGGTTTTAAAGTGATGAAAGGTCTATTATCATCGGCATATTAATATGAGCCACGTACTGCTCAGGCCCGAAAAGGAACCCAATCACTATGCGCTGTAAAACGCTGACTGCTGCCGCAGCAGTACTCCTGATGTTGACCGCAGGCTGTTCCACTCTGGAGCGAGTGGTTTACCGACCTGACATCAATCAGGGGAACTATCTTACACCTACGGATGTCGCCAAAGTGCGTGTTGGTATGACGCAACAACAGGTTGCGTATGCTTTAGGCACGCCGATGATGACCGATCCTTTTGGTACGAACACCTGGTTTTATGTCTTCCGCCAGCAGCCAGGACATGAGAACGTGACGCAGCAGACTCTGACGCTCACCTTTAACAGCAGCGGCGTGTTAACCAATATTGATAACAAACCGGCGTTGACGAAGTAATTTTGTGATACCGCCTGATGAGGTTACGTTCATCAGGCCTGCAAACCTCTCCCCCGCCATTTGGTACGAATTTGCCTGATGGCGCTACGTTTATCTGGCTTACAGGCACGCCTGTGCTTCAGGTGTGGTACGGCTTCCACACCGTTATCCGGCAAAAGAGGCTAATTATCTGCCAGCCGATTTTTCGGCGCGCTGGCGGCGTAACGCTTTTGGATCGGCGATTAAGGGACGGTAAATTTCAACCCGATCGCCATCCTGCACCGTATCGGTGAGTTTTACCGGACGACTGTAGATACCTACTTTATTTTTCGCCAGATCGATATCCGTGCGCAATTCCAGCAAGCCGGAAGCGCGAATCGCCTCTTCGACCGTCGCGCCCTCCTCCAGCGTTACGCGCTGCAAATACTGCTTTTCCGGCAGCGCATAAGCCACTTCAACCACAAGTTTATCCGGCACGGTAAACCTCTTTGGCGCGAACCGTAAACGCCTGCACCATGTTGGACGCTAACTCTTTGAATATACGACCAAACGCCAGCTCAATGAGTTTATTGGTAAACTCAAAATCAAGCTGAAACTCAATGCGGCACGCCTCGGGGCTGAGCGGGGTAAATTTCCAGCCGCCGATCAGCTTTTTAAAGGGGCCATCCACCAGATGCATCAGAATACTCTGGTTACGCGTTAGCTGATTACGCGTCGTAAACGTCTTGCTGATACCCGCCTTCGACACGTCAACAGCCGCCGTCATCTGTGCTGGCGATGACTCCAGGACGCGACTACCAACACACCCAGGCAAAAACTGGGGATACGATTGCACGTCATTCACTAACTGATACATCTGTTCCGCACTGTAGGGGACTAAAGCAGTCCGACTAATCTGAGGCATAGCAATTCCCATCAACATAAATCGTGTAAATAATACCACTTATCAGCGAGTAAAAAAAATGCTGTCCCCAACCGTCATGCCTTAATTGTGCGTGCATAGCCCAACTCATGCTAAGATAGATTTTTGCACCTCATCAGGACGTAATGGGGTGTTTTCGATATCAGATTACCTATGAATTCACGACACTTATGACGAAGAAAAAAGCACATAAACCGGGCTCAGCCACCATCGCGCTGAATAAGCGCGCCCGGCACGAATACTTTATCGAAGAAGAGTTCGAGGCTGGTCTCGCCCTGCAAGGCTGGGAAGTAAAATCTCTGCGCGCAGGAAAAGCCAATATCGGCGACAGTTACGTTATCCTCAAAGATGGCGAGGCCTGGCTGTTTGGCGCAAACTTTACGCCGATGGCGGTCGCCTCGACACATGTCGTATGCGACCCAACCCGAACCCGCAAACTGTTGTTAAATCAGCGTGAGCTTGATTCACTGTATGGCCGTATTAATCGTGAAGGTTATACTGTCGTCGCGCTTTCTCTGTACTGGAAAAATGCCTGGTGCAAAGTGAAGATCGGCGTGGCGAAAGGTAAAAAACAACATGATAAACGTTCCGATCTTAAAGAACGTGAGTGGCAACTGGATAAAGCACGCATTATGAAAAATGCAGGCCGATAATAGACATTTTTTGTGACCAGCTTCTCAGTATGAAGCTGGTCAATGTTCATATCTCATGCATGCCTCGAAAATATATTCTCTGTGTCTGACGCTTATCAATAGCAATAAATACCACAACAAGAATATATTCTTCGTTTAAAAAATTTTTATAGCCTAATAACACAATACTTCAAATGATTTTTATAAAATTTAATATTGCTGAATACCTTCACAAAAGGTATTGTTTTTATGTTAAAAAAGAAAAAGAAGAAGAAATTAAAATTCATTATAAAACATAAAGATAAATAAAAAACTATCATGCGTTGTTAGCCTGCCCCACCTCTCCTTAACACAAACTCCGTTATATTTCAGACGCACTAACATCTTTATCAATAGATCTTAATTTGCAGAAAGATTTTTTTCTGACTATATCATCTAATACGAAAGCACTAGTCAGGCACAAAAAACAAAGGATTATTCGGCGAGAAAACCAGACCTTCACCTACGCTCATAAAAAAGAATATGGCTACGGAAATTCATCTCTCATGATGAACGGGAAGGCTCGTCTACGCATTTTGCCCTGAACGTCGTGCCCGTTAATTAATACACAGAGCAAATCCATCAGGAGCTAATTTATGCGTCTACTCGCCGTGGTTTCGAAATTGACTGGCGTCTCCACCACTGTGGAATCCTCAGCGGTCACTCTTAACGCCCCGTCAATTGTTAAATTATCAGTGGCCCGGGAGGAAATTAGTCAACTTACGCGCATTAATCAGGATCTGGTGGTGACGCTCCATTCCGGCGAAACGATCACGATTAAAAACTTTTACGTTACCAACGATCTGGGCGCAAGCCAGCTGGTACTGGCGGAAAACGATGGCACGTTATGGTGGGTAGAAAATCCGCAAGCCGGGCTACATTTTGAACAAATCGCTGATATTAATGAACTGCTGGTCACTTCCGGCGCTTCCCATGAAGCAGGCGGCGCCGTTTGGCCGTGGGTACTGGCTGGCGCGGTGGCGGCTGGCGGCATTGCCGCTATCGCGTCTTCCGGCGGCGGCGATTCCCACCATCATTCGGATGGCGATAATCCGCCCCCCGATAACACCAATCCTGACGGTAATCCCCCTGATAACAGCAATCCCGGCGGCAGTACCCCCAACGGCAATACTCCAGGTAGCAGTAATCCTGTAGATACTACCCCGCCTCTCGCTCCCGGCGAATTATTGATTTCAGCGGACGGAAAAACGGTAAGCGGGCAAGCCGAAGCGGGCAGTACCATTACCATCAAAGATCCCTCAGGCAACGTCGTTGGCGAGGGCAAAGCGGATAGCGACGGTAAATTTAGTATTGATCTGACAGCACCACAGATTAGCGGCGAACAACTTACCGTGACCGCGACTGACGATGCCGGCAATACCGGCCCATCCGCAACCATCGATGCGCCCAACATTCCTCTCCCCGATACACCAGCTATCACCGCCGCTATCGATGATGCCGCTCCCCTCACCAGCACGCTGAGCAATAACCAGTTTACGAACGACAATACCCCCACTCTGGAGGGCACCGGCAGCGCAGGCACAGTCATCCATATTTACGCCAATGGTCAGGAAATAGGCTCAACAACGGTTGATACCAGCGGAAACTGGCATTTTGCCATTACCAGCGCGCTAGCGGATGGGGAAAATCATTTCACCGCCATTGCGACTAACGTGAAAGGCGAAAGTAGCGAATCAGCCCGCTTTACGCTGACTATCGACACACTCATCCCCGATGCCCCACGTGTTGAACTGATTGCCGATAACACCGGTTTGCTCACCGGGCCGCTACAGAATAATGACCGAACTGACGAGGCAAAACCGCTATTTTCCGGGCAGGGAGAGGCAGGCAATACCATCACGATTAAAGAGGGTTCAACCGTTATCGGCAGCGCTACCGTAGACGAAAATGGACGCTGGACCTTTACGCCGACTACGCCGTTAAGCGATGGCGAACATACCTTTACCGTCGAACAAAGCGACAAAGCCGGAAACACGAGCCGCGTGACGACAACGCCTACTATCATTGTGGACACCACGCCGCCTGACGCCGCTATCATTGATAATGTTGCGAAAGACGGCACAACCGTTAGCGGCACCGCTGAAGCTGGCAGTACTGTGTCGATCTATGACCCGGCGGGAAATTACCTGGGCTCCACGATTACCGGAGAAAATAACCACTTCAGCATCACGCTGAATCCGGCTCAGACCCACGGCGAGCGTCTGGAAGCGCGTATTCAGGACGCCGTCGGTAACATCGGCCCCGCCACGGAGTTTACCGCTTCTGACTCACAGTATCCTGCCCAGCCGACTATCCTTACCGTGACGGATGACGCTGGCGCCGTTACCGGGCTGCTGAAAAATGGCGATGCCACAGATGATAACCGCCCAACCCTCAGCGGTACTGCTGAACCAGGCAGTACGATATCGATTAACGATAATGGCTTTCCTGTAGCGACCTTTCCGCCCATTGTCGCTGACGCTGACGGCAAATGGAGCTTTACCCCCTCGCTGGCGCTTGCCGATGGCGACCATGTCTTTACCGCTACCGCGACCAACGATCGCGGCACCAGCGGGCAGTCCGTCTCCTTTACCATTGATATCGACACGCAGCCGCCGGTGCTGGAAGGCCTGGCGGTTAGCGACGTCGGCGACAGACTCACCGGCACTACGGAAGCTGGCAGTACTGTGGTTATCAAAGACAGCCTGGGAAATACGCTCGGGAGTGGAACGGCAGGCGACGACGGTACCTTCTCAATAGGTATTAGCCCGGCGAAAATTAACGGCGAAACATTAAGCATTAGCGTTACCGATAAAGCCGCGAATAGCGGTCCGGTAGAAACGCTGAACGCGCCGGATAAAACTGCGCCTGCGGCACCGGACGGTCTTACCGTGGCGACCGACGGTCTGTCCGTAAGCGGTCAGGCGGAAGCCGGGGCAACGGTCACTATCCGCGACAGTAGCAACACCGTACTTGGCAGCGCCGTCGCTAATGGCAACGGACAATTTATCGTTCCGCTGAATACGGCGCAGACTAACGGCCAGGCGCTTATCGCTACCGCCACCGATGTCGCGAAAAACGAAAGCGCCGCCGCGACGGTTATCGCGCCGGACAGTACCGCGCCGGAAATGCCGAAAAACGTGGTAATTAGTGAGGATGGCACCAGTATCAGCGGCACCGCCGAACCGGGTAGCGCCATCACGATCGCCACGCCGGACGGCAAGCCGCTTGGCAGCGGCAAAGCAGATGGCGAAGGTCATTTTACCCTTCCCCTCGTCCCCGCACAGACCAACGGCGAACAGGTTACCGTCACCGCCACCGACAGCGCCAACAACGTCAGCCCGCCAACCACAGCGCAAGCGCCCGATATCACCGCCCCGGATAAGCCCATTATCACTCAGGTACTGGACGATGTTGAAAGCTTCACCGGGCCGCTGGTTAACGGACAAACCACCAACGACAACCGTCCCACCCTTAGCGGTACGGCGGAGGCCGGCGCGCGTGTCGAAGTCTTTGATAACGGCGTTTCACTGGGACTCGCCACGCTACAGCCCAACGGTGCCTGGACGTTTACGCCGTCGCAAAATTTAGGTGAAGGCGCGCATCGACTGACCGTAATCGCAACCGACGCTAAAGGCAATGCCAGTCAGGCCGCGTCATTCGACCTGGTGGTCGATACGCAATCGCCGCAGCAACCGGTAATCACCTTCATTACAGATGATGCGCCGGGTATTCTCGGTAGCGTCGCGCATCTGGGGCTCACTAACGACAGCACGCCAACGATTAATGGTACAGGTGAACCGGGTTCCACAGTACACCTGTATCAGAATGGCGCCCGGATAGCAGATATTATCGTCGGTAATTCCGGCGTCTGGAGCTACGCTTACACCACGGCCTCGCCACTGGCGGACGACACCTACACCTTTACCGTGACGGCCAGCGACAGTAACGGCAACACCACGCCTTTTTCGACCGATTTTACGATTACCATTGATACCCAGGCCCCTGCCGCCCCCGGCGTTATCGGCGTAGCTGACGGCGACGGAAATACGATTGATACCAATCAGATTACCCAGGAATCCCAGCCCCGGTTGAGCGGTAGCGGCACCGCAGGCGACACAATCATCCTTTACGATAATGGCAATGCCATAGGTCAGGCGCTGGTCGGCACGGACGGGCGCTGGCAGTTTACGCCGCCTGCCGCGCTGGGCGACGGCGACCACCATCTGACCGCTCGCGCCAACGATCCGGCGGGGAACGAAAGTCCCGAATCCATCAGCTTTACCCTACGCATCGATACCCAGGCGCCGGATGCGCCGCAGATCGTGTCAGCCGCCATCACCGGCGGAGAAGGCGAGGTGCTACTGGCAAACGGCAGTATTACCAATCAGCGTATGCCGACCCTCAGCGGCACCGGCGAACCCGGCACCATCATCACCCTGTACAATAACGGCGTAGAACTGGCTACCGTCCAGGTCAATCCACAGGGTAGCTGGACCTATCCGCTAACCCGTAATCTGAGCGAAGGGTTAAACATCCTGACGGCCACCGCCACGGATGCCGCAGGCAATAGTAGCCCGACCTCCGGCGTTTTCTCCGTTACCCTTGATACCCAGCCTCCAGCGCAGCCTGACGCGCCGCTAATCAGCGATAACGTCGCGCCGGTTATTGGCAACATCGGCAATAATGGCGCAACGAACGATACCACGCCGACCTTCAGCGGCACGGGAGAGATCGGCAGCACGATAATTCTCTACAATAATGGCAGTGAAATTGGTCGCACAACGGTAGGCGATAACGGTAGCTGGAACTTTACGCCTGCGGCACTGACGCCAGAAACCTATACCATTACCGTCACGGAAACCGATAGAGCGGGCAATATCAGTCCACCTTCCGCCTCAGTCACTTTTACGCTAGACACCACTGCGCCCGCCAATCCGGTTATCACTTTTGCCGAAGATAACGTCGGCGAAGTCCAGGATACTATTGTCAGCGGCGCAACCACTGACGACAATACACCGGTCATTCACGGCACTGGCGACATCGGCAGCATTATTACGCTCTATAATGGCAGCAGCGTTTTAGGCGTAGTCACCGTCGATGAGACCGGCACCTGGACGCTAGCGGTGACCAGCGCGCTGCCGGATGGCGTCTACACCCTGACTGCCATTGCCGCCGATGCCGCCGGAAACAGCAGCGGCGTATCGAACAGCTTTACCTTCACCGTCGACACCGTTCCGTTGCAGCCGCCCGTCGTCAATGAAATCCTTGACGATGTTGCGCCAGTGACCGGGCCATTAACCGATGGCGCCTTTACTAACGATCGGACGCTGACTATTAACGGCAGCGGCGAAAACGGCAGCACCGTCACGATTTACGACAATGGCGTGGCAATCGGGACGGCGCTCGTCACCGACGGGGTCTGGACATTCAATACGCCCGAATTATCAGAAGCCAGCCATGCGCTAACCTTCAGCGCGACTGACGATGCTGGAAATACCACGGCGCAAACCCAGCCGATCACTATTACCGTGGATATCACTGCCCCGCCCGCGCCAACGATCCAGACGGTGGACGACGATGGCACGCGCGTCGCCGGACTTGCCGATCCTTACGCTACCGTTGAAATTCACCATGCCGATGGCACCCTGGTCGGCAGCGCTGTCGCTAATGGCACCGGTGAATTCGTCGTTACGCTCAGTCCGGCGCAAACCGATGGCGGTACGCTGACGGCAATTGCTATCGATCGCGCGGGGAATAACGGCCCGGCTACGAATTTTCCCGCTTCCGACAGCGGTCTGCCCGCCGTCCCGGCCATCACGGCGATTGAAGATGATGTCGGGAGCGTACAGGGGAATATTGCGGCGGGCGGCGCCACGGACGACACCATGCCGACGCTGCGCGGCACCACGGATATCGGCTCTACCGTTGAAGTTTTCATTGATGGCGATTCGGCAGGCTTTGCCACCGTTGACGCCAGCGGGAACTGGATCTTTGAGATCGCGACGCCATTAAGCGAAAGCACACATTACTTCACCGTCCAGGCAACCAATGCGAATGGCCCGGGCGGCCTGTCCGCACCGGTCGGGATCACTGTCGATCTTAGCGCGCCGGCGCAACCGGTTATTACCAGCGCAACGGATGATGTCCCCGGCATGACCGGTACGCTGGATAACGGCGCGCTCACCAATGATTCACGCCCGACGCTCAACGGAACGGGAGAAGCTGGCGCCACGATCCGCATTCTGGATAACGGCGTAGAAATCGGTTCCGCCACGGTAGATCAAAGCGGCAACTGGCGCTTCACCCCGAACGCGCCGCTAGAGAGCAACGCGCACATCTTTACCGCCGTGGCGACCGATCCCGCTGGCAATAGCGGCCAGCCTTCGGACGGCTTTACGCTGAACATTGACGCGCAGGCGCCAGATGTGCCGGTTATCACGTCCGTGATTGACGATAACAATCAACCGACCGTTCCGGTGTTACCGGGGCAATCCACCGACGATCGGCAGCCAATACTGAACGGAACTGGCGAACCTGGCGCGACAATCACCATTTTTGATAACGGTACGCCGCTTGGCACGGCTCAGGTAGGCGAAAACGGTAGCTGGACCTTCCCGGTGCCCCGCAATTTGTCAGAGGGAAGCCATAATCTGACGGTTAGCGCTACCGATCCGGCGGGCAATACCAGCGCGGTCTCCGCGCCGTGGACGATCGTGGTCGATATTACGCCTCCGGCGATCCCGGTTCTCACCTCCGTCGTGGATGACCAGCCCGGAATTACCGGCAACCTGGTTAGCGGGCAGCTAACGAACGATGCGACGCCCACCCTGAACGGGCGCGGAGAGGCAGGCGCGACGATTAATGTCTATCTTGACGGTAATCCCGCGTCCATCGGTACCACGACGGTGAATAGCGACGGCACGTGGAGTTTCACGCCGCAGACGCCGCTTGCAAACGGTAGCCACACGTTCACCCTTAGCGCCACCGATCCGGCGGGTAATAGCAGCTCGGTGTCCAGCGGATTTGTGCTGACGATTGACGCCACACCGCCCGCCGCGCCGGTTATCGCCAGCGTGGCAGATAATACGGCGCCGGTAACGGGCATCGTCCCCAACGGCGGCTCGACGAACGAAACCCGACCAACACTCTCGGGTACCGGTGAGGCGGGTACAACCATCTCGATTTATAATGGCAGCGCGCTGGTCGGTACGGCGCAAGTTCAGGCCAACGGTAGCTGGAGCTTTACACCGTCTACCTCGCTGGGCGCGGGCGTCTGGAACCTGACGGCGACAGCAACCGATGCGGCAGGCAATACCAGCGCCGCGTCCGAAATACGCTCGTTTACTATTGATACCACGGCTCCCGCCGCGCCTGTTATTGATACGGTCTACGACGGTACGGGCCCCATTACCGGCAATCTGAGTTCAGGACAGATCACAGACGAGGCGCGCCCTGTCATTAGCGGCACCCGTGAAGCCAACACAACTATTCGTCTCTACGATAACGGCACGCTGCTGGCTGAAATTCCCGCCGACAATAGCAGTAGCTGGCGCTACACGCCCGACGCCTCGCTGGCGACGGGCAACCATGTAATTACCGTCATTGCCGTTGATGCCGCAGGCAACGCCAGCCCCGTTTCGGACAGCGTTAATTTCGTCGTCGATACTACGCCGCCGCTGACGCCGGTAATCACATCAGTCAGTGACGATCAGGCGCCAGGCCTCGGCACGATCGCGAACGGCCAAAATACCAACGATCCTACGCCAACCTTCAGCGGCACCGCAGAAGCCGGCGCCACGATCACGCTCTATGAAAATGGTACGGTCATTGGCACGACAACGGCTCAGCCTGACGGCGCGTGGAGCGTCTCCACCTCAACGCTGGCAAGCGGAACGCACGTCATCACCGCCGTCGCCACCGATGCCGCAGGAAACAGCAGCCCGAACAGTACGGCTTTCACCCTGACGGTCGATACCACCGCGCCGCAAACGCCAATCCTGACGTCCGTGGTGGATGACGTCGCGGGTGGGGTCACAGGAAATCTCGCTAATGGTCAGATAACCAATGATAACCGCCCCACGCTGAACGGCACTGCCGAAGCGGGCAGCGTGGTCAGTATCTATGATGGCGACACTCTGCTTGGCGTCACCTCGGCTAACGCGAGCGGCGCGTGGAGCTTCACGCCGACGACAGGGTTAAACGACGGCACGCGCACATTAACAGTGACCGCCACCGACCCGGCAGGCAACGTTAGCCCGGCCACCAGCGGTTTTACTATCGTGGTCGATACCCTTGCGCCAACGGTTCCGCTTATAACCAGCATCGTTGATGATGTCCCGAACAATACCGGCGCCATTGGCAATGGACAATCGACCAACGACACACAGCCGACGCTCAACGGTACTGCGGAAGCCAACAGCGCGGTAAGCATCTTCGATAATAGCGCGCTGGTCGCGACCGTGAACGCCAATGCCAGCGGCAACTGGAGCTGGACGCCAACCGCCGCGCTCGGCCAGGGAAGTCACGCCTATAGCGTTAGCGCCGCCGATGCGGCTGGCAACGTTAGCGCCGCTTCGCCATCGATAACGATTATCGTGGATACCATTGCGCCCGGCGCGCCCGGCAACCTGGTCATCAATGCTACCGGTAATCGGGTGACGGGCACCGCGGAAGCAGGCAGTACAGTGACGATTACCTCTGATACTGGTGTGGTACTGGGAACCGCCACCGCCGACGGTACAGGCAGCTTCACCGCCACACTCACGCCCGCGCAGACCAATGGTCAGCCGCTACTGGCATTTGCCCAGGATAAAGCAGGCAACACAGGCATTGCCGCCGGATTTACCGCGCCCGATACGCGTGTGCCGGAAGCACCGATCATCACCAACGTAGTAGATGATGTGGGTATTTATACCGGCGCTATCGCCAACGGCCAGGTCACTAATGACGCACAACCCACATTGAATGGTACCGCTCAGGCGGGCGCCACGGTGAGCATTTATAACAACGGGGCGCTGCTCGGCACCACCACGGCGAACGCCAGCGGAAACTGGAGCTTTACCCCGACAGGCAATTTGACCGAAGGCAGCCACGCCTTCACCGCCACCGCGACTAACGCCAACGGTACAGGCAGCGTCTCCACCGCCGCGACGGTGATTGTCGATACGCTGGCGCCCGGTACGCCGTCAGGTACGCTCAGCGCCGATGGCGGTTCACTTTCCGGGCTGGCAGAGGCAAACAGCACCGTAACCGTCACGCTGACGGGGGGCGTGACGCTCACCACCACCGCTGGCAGCAACGGCGCATGGTCTCTCACCTTGCCGACAAAACAAATTGAAGGTCAGCTCATTAACGTGACGGCCACTGACGCTGCGGGTAACGCCTCCGGCACGTTAGGCATTACCGCGCCGGTTCTGCCGCTGGCGGCAAGGGATAACATCACCAGCCTTGATCTGACCTCTACCGCCGTCACCAGCACGCAAAGCTATTCGGATTACGGCCTGCTGCTGGTTGGCGCGCTTGGCAATGTCGCCTCGGTTTTGGGTAACGATACCGCTCAGGTTGAGTTCACCATTGCTGAAGGTGGTACGGGCGACGTCACCATCGATGCCGCCGCAACGGGAATCGTGCTTTCGCTGCTCAGTACTCAGGAGATTGTGGTACAGCGCTACGACACCAGCCTCGGCGCCTGGACGACGATCGTCAACACCGCCGTTGGCGACTTCGCGAATTTGCTTACCCTGACCGGGAGCGGCGTTACCCTGAACCTGAGCGGCCTGGGCGAAGGCCAGTACCGGGTACTCACTTATAACACCAGTCTGCTCGCCACCGGGTCATATACCAGCCTGGATGTCGATGTACACCAGACCAGCGCAGGTATTATTAGCGGGCCAACCATCAGTACCGGCAACGTCATGGCTGATGATACCGCGCCGACGGGCACCACGGTCACCGCCATCACCAACGCCAACGGCGTCAGTACGCCGGTCGGCGCGGGCGGCGTGGATATCCAGGGACAATACGGCACGCTGCACATTAATCAGGACGGCAGTTACACCTACACGCTGGCTAAGCCCACGGCAGGATACGGACATAAAGAGAGCTTCACCTACACCATTACCCAGAATGGCGTCGGTAGCAGCGCCGCGCAACTGGTTATCAATCTGGGTCCCGCGCCTGTACCGGGCAGCGTGATAGCGACAGACAATAACGCCTCGCTGGTCTTTGATACTCACGTTAGCTACGTCAACAACGGTCCCTCGACACAAAGCGGCGTCACGGTATTAAGCGTCGGACTTGGTAATGTACTGAACGCGAATCTGCTTGATGATATGACTAATCCGATCATCTTTAACGTTGAAGAAGGCGCTACGCGAACCATGACGTTACAGGGAACCGTCGGCGGCGTCTCACTGGTTTCCACGTTCGATCTGTACGTTTATCGCTTCAACGATGCCATTCAGCAATATGAGCAGTTCCGGGTGGAAAAGGGCTGGATTAACACCCTGCTGTTAGCCGGACAGTCCCAGCCGCTGACCCTGACGTTGCCTGGCGGCGAATACCTGTTCGTGCTGAATACCGCCAGCGGCATTAGCGTCCTCACTGGCTATACGTTGGCGATTTCCCAGGACCACACCTATGCCGTTGACAGTATCACCGCCAACACCACCGGCAACGTACTGACCAATGATGTCGTCCCTACGGACGCCCTCCTCACTGAAGTAAACGGCGTGGCGATTGCGGCGACCGGCACAACGGAGGTAAACGGGCTGTATGGCTCGCTCATCATTGACGCAAGAGGCAACTATACCTACACGCTGAAGAACGGCGTCGGCGCCGACAGCATTAAAACGCCGGACAGCTTTATCTATACGGTCAAAGCGCCAAACGGCGATACCGATACGGCCTCGCTCAATATCACGCCAACCGCCAGGGCGCTGGATGCGATTAATGATGTCAGCGATACCCTCAGCGTCGCCACGCTTCAGGATACCGCTGCCTGGCTGGACTCCAGCGTCGGCAGCGCCAGTTGGGGGCTACTCGGCAAATCGGGCAGCGGGAGCGGCACCTTTGACGTTGCAACGGGCACCGTACTTAAAGGCGCGTCACTGGTCTTTGATGTCTCCACGCTCATTACGCTGGGCAATCTGAATATTAGCTGGGCCATTCAGGAGAACGGGACCGTCATACGCAACGGAACCGTTCCGGTGGCGAATATCACGCTGGGCAGCGCGACGGTGACCGTCAACCTGAGCGGTCTGGAGCTGGATGCCGGAACGTACACGCTTAACTTTACCGGCACCAATACCCTGGCCGGGGCGGCGACGATCACGCCACGCGTCATCGGCACCACCGTCGATCTGGATAATTTTGAAACGTCCGGAACGCATACCGTTCTCGGCAATATTTTTGACGGCAGCGACGCGGCGGGGGCGATGGATCAGCTTAATACGGTGAATACCCGCCTGAGCATTAGCGGGTATAACGGCAGCGCCGCCACGCTGGACGCCGCGGCGAATACCACCAGCGCCACGATTCAGGGACATTACGGCACATTGCAAATTAACCTCGATGGCGCTTACACCTACACCCTGAATAATGGCGTCGCGATGTCGTCCATCACCAGTAAAGAGGTCTTTACCTATCAACTGGATGACAAGATGGGTCATACGGATAGCGCCACATTGACCATTGATATGGCGCCACAAATCGTCAGTACCAACCAAAACGATGTTCTCATCGGCTCCGCCTATGGCGATACGCTGATTTACCATCTGTTAAACGGCGCGGACGCGACCGGCGGCAACGGCGTCGATCGCTGGCAAAACTTCTCTACCGCGCAGGGCGACAAGATCGATATCCACGAACTGCTGACCGGCTGGGATCACCAGGCGGCGACGCTGGGTAACTTTGTGCAGGTTCATACCAGCGGCGCCAATACGGTGATATCCGTCGATCGCGACGGCGCCGGCAGTGCGTTTAAATCCACTGACCTTGTCACTCTGGAGAATGTGCAGCTCACGCTAAATGATCTGTTGCAGAACAACCACCTGATAACCGGCGGTTGATAAAAAAGCCCCGAACGCCACGACGTTCGGGGCAACAAGCGGTGATATTTAAAAGGGATAAACTATGGGAAGAGTCGCGCCTGTCGCCATCGTACTGGCATTTGCTTTGTTTCATCACCAGCCCCGGGGCGCAGAAGCGCCGCCAATGATTACATCCGAAGGATTAGCAACGGACCAGATGCTTCCTTCGCTGGATGGCTCCGCCGCTGAGTTGCCGCTCAGCGCCGCCGCGCCGGGCAACCTGACGCTCAATGACGCGGTCAATCGCGCCGTTAACTGGCATCCTTCTATTCGCGAAGCCGTCGGCAAACTGCTCGCACAGAATGAACAAATAGAGGTCGCCAAATCGAAATATTATCCGCAAGTCAGCGCTGGCGTGAACAATGGGTACAGCAATACCTACACCGATCACGGGTATAGCCCCTCGCTGGTGCTGTCGGTATCGCAAATGCTTTATGACTTCGGCAAAGTGGCAAGCCAGGTTCGCGCCGAAACCGCAGGCGCAGCGCAGCAACAGGCCAATGTGTTGCTCAGTATTGATACCGTCGCGCACGAAACCGCCAACGCCATTGTACAAACGCAGAGCTGGCAGCAAATGGTAGACGCGGCGGAAGAACAGCTCGTCGCGCTGGACAGCATCGGTAAACTTATCCGGCAGCGCAGCGATGAAGGCGCTACGTCGCTATCTGACGTGGTGCAAACCGAAGCCAGAATCGAATCCGCCCGCTCGCAACTGGCGCAGTATCAGGCCAATCTCGACAGTGCGAAAGCCTCACTGATGAGCTGGCTGGGCTGGAATTCGCTTAACGGCATCAATAATGACTTCCCGGCGAAACTTGCTCGCAGCTGTGAGACGGCGACTCCCGACGATCGACTGGTGCCCGCGGTACTGGCCGCCTGGGCGCAGGCTAACGTTGCGCGGGCGAATCTGGACTATGCCAGCGCGCAAATGACGCCGACGATTTCGCTCGAACCTTCTGTACAACATTATCTTAATGATAAATATCCCAGTCATGAAGTGCTGGATAAAACCCAGTATTCCACCTGGGTGAAAGTTGAGATGCCGCTTTACCAGGGCGGAGGGCTGACTGCCCGACGCAACGCCGCCAGCCATGCGGTAGACGCGGCTCAGTCGACCATTCAGCGCACCCGGCTTGATGTCCGCCAGAAACTGATGGAGGCGCGCAGCCAGGCAATGAGTCTTGCCAGCGCGTTACAAATCCTTCGTCGGCAACAGCAGCTTAGCGAACGCACGCGCGAACTGTATCAGCAGCAATACCTTAACCTCGGTTCCCGCCCGCTGCTCGACGTGCTTAATGCGGAGCAGGAAGTTTACCAGGCGCGTTTTGCCGAACTGCAAACGGAAAGCCAGTTGCATCAGTTGCAACTGAACTGTCTGTATAACACCGGCGCGCTTCGTCAGGCGTTCGCGTTAAATCATCGCAGCATTCAATCCGTGGAGATCCAGCCATGACCCGCGCCGCCCCCGATGTAGAAGAGGTACTCAGTGAGCGCGCGCTAAGCCAATGGGCGCAGGCCATCAGCTATGTCGCCGGTCATTATCGCGTTGCCTGTTCTCCCGGTTCAATTCAGGCCAACGCCCCGTGGTTTAGGGGTAAAAGCAGAACGACCGCCTTAACGCAGCTCGCGCGGCAGGCGGGTTTATCCTTTCATGCGCCGGACATAGACAAAACGGCATTTAGTCAGTGGCGATTGCCGTTGGTTGTCGAGCTCCGGGACGGGCAGTTATTGGTCATCGAGCATGTTAATGGCGAAGATGCGGTAGACGTTTTTGTGATTGAAGAAGAGGGTCAGCGTAACCGCCTGACGCTCAGCGAGTTATTGCCGGAGATCCTCTATGTTGCCGCGCTACGCCCGCTATCGGCGCTCAAGGATAGCCGCGTCGATCGCTATATCTCCCGCTTTAAACCCGACTGGATGCGCGAGCTGGTTCTGCAGGATATTCGCCCTTATTTACCGGTAATGGTCGCCGCCTTTCTGATTAACGTGCTGTCGCTGGCCGGGATTGTGTTCTCTATGCAGGTTTACGATCGGGTGATCCCCGCTCAGTCATATCCTACCCTGTACGTGCTCTCCTTCGGCGTGCTGGTAGCGGTGCTGTTCGGTTTTCTGCTGCGTGAAGCGCGTACGCACATTATGGACGTGCTCGGTAAACGCGCCGATATGCGCATTTCCGATCGGGTATTCGGTCATGCGTTAAGGCTGCGCAACAGCGCTATCCCCCGTTCCACCGGCAGTTTTATCTCGCAGTTACGCGAACTAGAGCAGATCCGCGAGATGATTACCTCTTCAACGCTGGCGACCATTGTCGATCTGCCTTTCTTTTTCCTGTTTATGATAGTCCTGGCGATTATTGCCCCACCGCTGGCATGGATAGCGCCCGTGGCGGCTCTGCTGATGATCCTGCCGGGCGTCGCACTGCAAAAAAAACTGGCTGTCCTCGCCAACCAGGCCGCCCACGAAGCGACGCTACGCAATGCAGTCTTGGTCGAAAGCGTTCAGGGGCTGGAGGATATTAAACTGATGCAGGCGGAGAACCGCTTTCTTCAGCAGTGGAATAGCTATATCCGTATTACCGGCGAGTCGGGTCTACGCACCCGCAAACTCACGCAAGGGCTTATCAGTTGGGGGATGTCGGTACAAAGTCTGGTGTATGCCGCAGTGATTATGTTCGGCGCGCCGATGGTCATTGAAGGCAGTATGACTACCGGCGCCGTGGTCGCCGCGTCGATGCTCGGCTCACGGATGATCGCGCCAATGGCTAACCTGTGCGGCGTGCTGGCGCGCTGGCAGCAGGTGAAAGCGGCGAAGATGGGTCTCGACAACATTATGCAACTGCCCACCGAGACACAGCATGACGATAGCCTGATACACCGCGACATCCTCCACGGGCATTACCTCTTTGAAAATGCTCAGTTTCGTTACCATAACGACGATCAACGCATACCGCTGCGCCTCGTGCGTCTGGAGATCATGCCAGGCGAGCGGATTGCGATACTGGGGCGTAACGGCGCGGGTAAATCCACACTTTTACAGGCGATGGCGGGCGGCCTGGAGATGATTCAGGGAGATGCCCGGCTTGATAATCTCAGCTTGTCGCACATCGATATGGCGGATTTACGCCGCAACATCGGCTTTCTTAGCCAGAACGCGCGGCTTTTCTTCGGCACCCTACGAGAGAACCTGACGCTCGGCGCGCCGCACGCCAACGATGAACAGATTTTTGACGCGCTCGAAGTCAGCGGCGGCGCCGTCTTCGTCAGGCGGCTGGCAAAAGGGTTGGATCATCCCATTATGGAGGGCGGTAACGGCCTGTCCGGTGGGCAGCGCCAGTCCCTGTTGCTGGCGAGAATGCTGCTGCGCTCCCCCAATATTGTGCTACTTGATGAACCCAGCGCCTCGCTGGACGAGCATACGGAACGAGAGTTTATTCAGCGGTTACATCAGTGGCTTGGCAACCGTACCCTGGTCGTCGCGACGCACCGGGTACCAATACTGGAGCTGGTTGAGCGTGTCGTTGTCCTGAAAGAAGGACAACTGGTGATGGATGCGCCAAAAGCGCAGGCGCTTAACGCGGATCGGATGCAAAGTCACCGTCGGGAGTGGAAAAATGAAAATCAATCAGCATGATGCCGCGATGGACGATCCCGATATTCAGCGTGAACGGGCGTTTTCCGGCGCGGGTCGTATTGTTCTGATCTGCTCACTGTTATTTCTCATTCTCGGCATCTGGGCGTGGTTTGGCCGACTGGATGAGGTTTCCACCGGCAACGGGAAAGTGATCCCCAGTTCACGCGAACAGGTTCTGCAGTCGCTGGATGGCGGCATTCTGGCGCAGTTGACGGTGCGGGAAGGCGACAGAGTTCAGGCTAACCAGATTGTCGCCCGGCTTGATCCGACGCGTCTGGCGTCCAATGTGGGTGAAAGTGCGGCAAAATATCGCGCTTCACTCGCCTCCAGCGCACGGTTAACCGCGGAAGTCAACGACTTACCTCTCGCCTTCCCCGCTGAGCTGAACGGCTGGCCGGATCTGATTGCCGCAGAGACGCGTCTCTATAAAAGCCGCCGCGCGCAGCTGGCCGATACCGAAGCCGAGCTACGGGATGCGCTGGCGTCGGTTAATAAAGAGCTGGCCATTACCCAGCGTCTGGAGAAAAGCGGCGCGGCCAGTCATGTTGAAGTGCTGCGCCTGCAACGACAAAAAAGCGATTTAGGCTTAAAAATTACCGATCTGCGCTCACAATATTATGTGCAGGCACGCGAAGCGTTATCAAAAGCGAACGCTGAGGTCGATATGCTCTCCGCCATTTTAAAAGGACGCGAGGATTCCGTCACCCGCCTTACCATACGTTCGCCGGTACGCGGCATTGTTAAAAATATCCAGGTCACGACGATTGGCGGCGTGATCCCGCCTAACGGTGAGATGATGGAGATAGTGCCGGTAGACGATCGTCTGTTGATTGAAACCCGCCTTTCGCCGCGTGATATCGCCTTTATTCATCCCGGCCAACGCGCATTGGTTAAAATTACTGCTTACGATTACGCCATTTACGGCGGGCTTGACGGCGTGGTGGAGACCATTTCACCGGATACCATTCAGGATAAAGTGAAACCGGAAATTTTCTACTATCGCGTGTTTATCCGCACCCACCAGGACTATCTACAAAATAAATCAGGACGCCGTTTTTCGATTGTTCCAGGCATGATCGCCACGGTGGATATCAAAACCGGTGAAAAAACCATTGTCGACTATTTAATCAAACCGTTTAATCGCGCGAAAGAAGCGCTGCGCGAGCGGTAAATCGTTGAAGATAAGAAGATACGGGGGCTGGTCACGCGTGTCACAAGTCTGTTATACTTACCTTACACATTGGGGCTGATTCTGGATTCGACGGGATTTGCGAAACCCAAGGTGCATGCCGAGGGGCGGTTGGCCTCGTAAAAAGCCGCAAAAAAATAGTCGCAAACGACGAAACCTACGCTTTAGCAGCTTAATAACCTGCTTAGAGCCCTCTCTCCCTAGCCTCCGCTCTTAGGACGGGGATCAAGAGAGGTCAAACCCAAAAGAGATCGCGTGGATGCCCTGCCTGGGGTTGAAGCGTTAAAACGAATCAGGCTAGTCTGGTAGTGGCGTGTCCGTCCGCAGGTGCCAGGCGAATGTAAAGACTGACTAAGCATGTAGTACCGAGGATGTAGGAATTTCGGACGCGGGTTCAACTCCCGCCAGCTCCACCAAATAAAACAAGGGGTTACGTGAAAACGTAGCCCCTTTTTCTTTGGTAGTGGCGGCAAAATGGCGACAGCGTGTCGGACTGGCGGCAACAAAAAACCCGCCATTAAAGCGGGTTCTGTTCAGAAACTCATGTGGCCTTGACCACCTTTTCCCGGATGTGGTGGCGCAGAAGAAATTAGATTCGGTTTTATAATGTGCCGTACAAACGTTTCATGAGTAACGAACGTACAGCCACAATTTATATTTTGGCACTGGTTGTAACGTTCTTTTGTCGTAGCTGATACCTGAAAGCTGCTTCTGGTATGTGCTGAATGACCGCACTCTGGACAGTTCATCATTGCTGTTATCCCACCACTTTTGCCGTAATCGCAATAATGATACATCATCATTCAATGTTGAGAACCAATTATTCAATTTCAAGATCATCAATCTTTACTTCAAGCTCCATGCTGGTCGTAAATCCATTATCTGGGCTGACCGAATGCATCAAGGTGGTAATGGTCCATTCGGCATCGTCTATGGGCTGCTTAAACCCCGTCACCTTCACCGGCATTTCGGTATAGAGATCGGCCCGCCCTTCAGCGAGCTGCAGGGAAAATGAAGCTACCCCGCGCTGCAGACGTTCCCACTGCATTTTTGCTGCGCGCTCTGCATTGCTGCGGTTAGCGTAGGTACGATTAAGAACCAGCACGTTTTCATCCGTTCCCACCAGATAATCCCCCTGTTTTGCTTCCGGCTCTTTAGGTTTGGTGGTTTTCTTTCGGCGACGCTTAACTTTGGTTGTCTCTTTTTTCCTGGGTTCACGGGTATGCAGCCAGCTGGCGATAACGCCGGTATAGGCGCCACGATCAGCCAGGGTAAAACGATGACCGTCACCGGCTTTACGCTCAATAGTGATAACCGGCAGCGGCTTACCGCTCGCCGTTCTTCCCTGCCCCTGGCGGATAAACAGCAGGTTTCCATCCTTAACGGAAGCAATCGCCCCATACTGTCTCGCCAGTTTCATCAGAAAACTTGCATCACTTTCATTGGTCTGGTCCATATGATCCAGCGCCTTATCCGTCAGGTCTTTACCCAGCGCCACTTTGAGGTTATGCCGGGCGGCGATGTCCTTGACCACATCCCCCACCGTTGTCTGATGCCATGATTTTTCGCGCCGTGTATTGAGGGTTTCACGGAAATCTGCGCTACGCGCCCTGATGGTCAGCCGGTCAGGGGCACCGCTGTGTTCAATTTCATCTACGGTAAAAGCCCCTTTAGGGAAAAGCGGCTGGCCTTTCCAGCCCAGCGCCAGCTGAATCACAGCCCCACGTCGCGGCAGGGCGATCAGCCCGTCGGCGTCGTCCAGCTCCAGATCAAGCTGGTCCGCTTCAAAGCCCCGGTTATCCGTCAGTGTCAGACTCATCAGGCGGGTATCCAGCACGGTCGTCACGTCCTTACCTTCAATGACGATACTGAAAGCCGGGCTTTTGCTGTTCAGATTCAGGAGATCAGAATTAACGTTCACTGCAGCAATCCTCCAACCGTGTTTTTAATCTCCCCAATAGCAGAAGCTGCAGAGTCCTGCAGGTTGCTGAGCTGGTCACTCAGGCTCCCGAACATGTCAGAGAGCGACTCATCAACCCGTTTGAGGGTGATCGTAAACTCAATGCGCCTGGGCATTCCGCTGGCAAAAAACTCCGTCTTTGTCTGGCTCAGACTCTCAATAACAAACATGCCGTAAATGGTTCCACTGCCTTGAATCAAAGGCCATGCCTTGCCCAGTTCAGCCATTTGTTCAAGCGCCAGTAATGACAGTCTGCCGCCGGTCACTTCCGGCAGCAGAACCCCGGATAGTGTCAGTGAGTCGTTATCCGGGCCAAGAAACTGCGTTGACGGACGGCGGTTCACCCGGCTGTTGGCGGCGTGTCGCCAGCTGCGCTGATACTGCAGTTCCTGATAAGGGACAGTGCGCAGCATAAATACATATAAACCCAGTACCATCATCATGATTCATACCCCCCCTGATCGCTGAAATTGCTGCGCGCTTTTGCCCTGGCCCGGCGCTCCCGTTCGTCAAGCTGGCGTGCCACTTCACGGGCAATATCCTGCGCGTTCTGCCCAGGCTGAGCGACAATATGAATGGGCGCATTTATCTCATAACGAATAACCGGCGGCGGGCTATCTGCCTTAGCAAGCGGGGGCTGGTATGCCCTCGCAGGCAAACTGAACGGATGAAGCGGAGCCGCTTCTGCAGGTGTCGCAGCTACCCCCATCACGCCAGCAACGACAGAGGCCAGCGCAGCAGTACGCCGCCTGCTGGTAACATTTGCCGGTCCGTTCACAATTTCAGGACCATTTTCTCCGACAATGCCAAACTGCCCGCGTGGAATGATCCCGCCCGTGTCGTACATCCCCGCGTAAGCCGGGAACCCGCCTGGCGGAAGCACCACTTTGCCGTCACTGTTCACTGTGGCGGACTGCTGCTGCGTAACCTGCGCAGGTAGTTTCGCCTTTGCCGCCTCCTTACTAACAATACCGAGCTTTTCCAGCAGCCATGACACACCGGATTTAAGTGACTCAAGTGGGTGCATCACCATATTCAGACCTTCCGCCAGCGCCTCACCAAACCGACGCCCCATTGCAGCTGCGCTGTTCAGTTCTTCGGCAGTGGATTTAACCGGGGTAAGTAAATCATTGAACCAGCCCCACAAGGCCTGCACCCTGTCACCAATCCACTGAAACACGGGTCTGAGCGGCTCAAAAGCGGCGCTGATGGGCGCAGCAGCGGCTCTGAACCCTTCCACCACGCCCCCCAGAAATGCACTGATGGGCTGCCAGTATTTCCAGATAACCAGCGCCACGCCTGCCAGTGCAGCCACAACCAGCCCTACAGGACTGAGCAGCGCACCCAGCAGGCTACCAACAGCAAATAATGCCACGCGCAGCAAAGCCAGCGGACCAGAGATCAACAAACGCAGCACGCTACCTGTACGTGTGGCAGCGGCGGCTGCAGAAGGTAACGCTTTAACTGACAGCATGGACAGGCCAAACCGGATAACCGCCAGCGGTCCCAGCACAGCAGCCACCGCCACTGCCAGCGCCCCCAACCCAACAGTAATGGCTGCCGTAGCTGCCGCCACTTTCATCAGCGTGCCAGCCAGCACGGGATTCTGCTCAACCCAGCGACGCAACGCCCCGGTCACGCGCTTAACCATGCCCATAATATCCATCAGCGGCTGGCGCAGCGTTTCCCCTAGGCTGCTGAAAGCGTTTTGCGCGCCCGTCTTAACCAGCAACCACTGCGCAGACAATGAATCCTTGTTAATGTCGGATTCTTTCTGCATGGAGCCATTAGCACCACTGCCTGATGTGAGTTTCAGCTGGCGCTGCAGCTCCGGCAGGTTGTTAGCCAGCTTTGCCGCATCATCGCCAAACTCTTTGCCAAAAATCATTGTCATGGCTGACAGGCGTTTATCCTGCGGCAGATTGTTGACCTTCTCCAGAACCCGCTGAATTGTGCCCATGGCATCGGTGGTCATCTGCTTTTCAATCTCCGCCGGATTGAGTTGCAACAGATTCATGCCTTCAAAAAATCGTTTACTTTGCATGGTAGCAATGGACAGTTCACGCACCATGGCATTAGAGGCGCTGGCGGCGATTTCCGGGGCAGCCCCAAGAGAAAGGAATGTTGAACCCAGCGCCGCGGCCTTTCGGAAGTCAAGGCGGTCAGCCACGCCCCCCATACGCTGCAGGACGTTGATAATGTCCCCACCCTTTGACATGGCGTTATCGTCCAGGTAGTTCAGCGCATCGCACAGTTGTTCAATATTGCGCGTCGGAACTTTATAGAGCTGCGCGATTTTCCCCAGTCCTTCTGCCAGTTCATCTGCGGGCAGCTCAAAGGCCGTTGCCGCTTTTGCCGCCGTGGATGCAAAAGCCAGCAGGTCACGTTTCTGCTCTTCGTAAGGATCGTCCTGATTGGTCACCCCCATGCGAGCACCACCTTCAACCAGCGCGGCATAGTCTATAGCGCCGTTCTCCATCGGCAGCTGTTCGCTGGCGGCCTTGATGGCATCCTGCATGTCATAAAACTGTTTTGTGCGGTTGCCATTATCGTCCCGCAGCCCGTTTACCTGCTTTGCCACGCCTTTCATGGCATCTTCCATGCTGGCGTAACTCTTAACTGCTGCCACAACAGGTGCGCCCATTGCCACCCCCGCAGCCGTAGTGGTAGCCCCTGCCCCGGCGATGCGATCCCGCACCTCAAGACGGCGTGAATACTGATCGCGGACGGCGTTCATTCGCGCCTGCTGTTCGCCCAGGCGTTTAAGGGATTTCTGCTGCCGGTCCAGGGCCTGCCGGGTTTCGTCGGCATTCTGCCGCAGTTCCCGCTGCACACTACTGAGTTTTTTCGTGTCCAGTCCGGCTTCATTGAGCGCAAGACGCTGGCGCTGCACCGACTGACGTAGACCGTTATATTTGCTCTGTAACTCCGTAACGCGGTTTTTTGCCTGCTCAAGCAGCCGTGCCTGCGCCGCCGTCGGGCGATTGGTAGCAGAGAATTGCGTGGCAAGTTTCGCAGTTTCTTCGCGTGCGGCTTTCAGGCTGTTACCGGTGACTGCCAGCTGCGCGCTGGCCTTGCGGAAACCGTCAATACGGCCCGCCTGAGCATCTAATTCTTTTAAACGGGCGCGGCTTTGCTGAATGGCTGCAGCCAGCTCTTTTGAGCTGGCCTGCGCGGATCGAAATGGGCGGGTGAGCTTGTCAACCGCATTAAGAATCACCTGCAGACGCAGGTTGTTGTCACTCATCGCTGGCCCCGCTTCGCTGAATTGCCTTATGCCGCCACGCCAGCACCTCAGTCAGCGGCATAACGTCAGTGATGGATGGCGACCAGTGAAAGATGGTGGCGATGTCCGCCACAAGATCATCAATCGTCAGGCTGTCGGTAAACCGGCAAGCACCGACCTCGGCAACAAAAAAGTCACCACCTCTACCGACAGCGCGGTGAGATCGGCGGGGTCCAGCTCTGCCATTTCCTGCGCGGTCAGCGTCGGGGTGGAGATTCGTGGAATCACAGTCATCATTGCGCCCACGTCCATATCCATAATGGCCTGCAGACGGGTGCCACGCAGTGCGCCGGACTGAGGCTTGCGCAGCACAATTTCGGTAATTTCAGCTTTACCGCGCATGATGGGAGTATCCAGTTTTACGGTCTTTTCAGTCAGCTTGTCGCTCATGTTCGTATCCTGTTAATGAAATACTGGCGCGGCTGCCCGCGCCGTTAAGGTTAATCAGAGGCCGAGGGCATTACGGTGTTCTTCCATCAGGTCCATGCCGCCAACGATTTCTACCATGTTGACCAGATCGACCTCATAGAGCACCTCACCATTAATGGTCAGCTTCGCGTAGCTGTTGGTACTGCTGACTTTGGTGCTGCTGCTCTCGCCGGTTTTCCACTCGCCGGAATCCACTTCTTTATGGCGCCCGCGCACAACCAGCTCAACGGCCTGCACTTCGCCGGTATCGTCACGCTGAATGGAACCGGTGAAACGCAGCTGGATGCCGTCAACGGTTGCCTTGCCCATCTGCTTGAATAACAGCAGCTCGGTACCGCCGATTGAAAATTCCGTGTCCAGTGCACCGTCATCCAGCCCCATGTCCACATCCACTGCGCCCGGCATACCGCCGCCGCGATACTTCTCAAACTTGCGGGTAAATTTCGGCAGGGTCAGAGACTCAACGATCCCCTGCCAGTTGTTCCCGTCGTTGAACAGGTTCAGGTGTTTTAACTTGCGTGGTAAAGCCATGATTCCCCCTTATGCAGCGACACGGCTGGCAAAATCGACCAGGTAACGATCGGTGATGCGCTGGCGCAGCATCAGGTTTTCAAGCGGAGGCACCGGCGTGTAGTCATAATCGATGGTCAGTTTCCCGGCTTTAAGGGCGTCTTTATCGTTAACAGACTCATCCAGCCAGCAGTCACCACCAATCAGGTATCCCTGGTTGACCAGACTGCGCATCTTGGCGCGTAGTCCTTCAATAATGTCGCGGGCCAGCGACGGATTAAGCACGCCATCCACCGCCCACATGTGCGCCTCCGCCATAGTGTCAGCCAGCACCTGCGCCGTACGGGTGTAGTTCTCAAAAGCAAACAGCGGATCGTCACTGAGACAACGGGAACCCCAGAAGCGGAAGCCGTCTTTGCGGATCAATGTGGTGACGTCATTTTTGTTCAGCAGTCCCGCATCGGTTGCCGGGTCCTGCAGATCCCAGAACACATCAGCGGAAATGCCGGTGACACCGTTCACACCCACATTGGACAGGGTTTTATGCCAGCCGATCTGCTCGTCGATTTTGGCACGCAGGCCGAGCGCACGGGCGGAGGCGTAAGCCGTCGCGTCTGCTTTCAGCACGGTGTCAAAGTTGATGAAGTCAGGCCAGATCAGCATTCCCTCGCGCTGACTGAAATTCTCGCGATAGGCAATAGCTTCCTCCACCGTTTTGCAGCCATTAGCAGCAAGGTAGGCAAACCCGCGCAAGCTTTGCGCCACGCCCAGCAGTTCAGTAGCAACGGCCTGAGTGTCATGTCCCGGCCCCCCAAGAATGCACGGCTTGACACCGAGCTGCGACTGCGCCGACAGTAGCGCTTTCATGCCCGTTTTCTTACCGTCGGAAGTTACGCCGCCGATAATATTGGAGGTGGTTTCCGCTTCGGTTTCGCCCTGCGCCACACGCACAACGACAGTCACGGGTTTTGCCTGATCTGCAATCGCGTCCAGCGAGCGGGCCAGCGTGCCGGACTCCCCCGCTTTACCGCTGGCGGTGAGCACATCAGTCAGCAGGACCGGCTTATTGAGGGGGAACACGGACGCATCAGCATCATCGCCGGTGCAGACCATGCCCACGATGGCAGTGCTCACCGTGGTAATAGGTCGGGTGCCCTCGTTGATTTCAACAACGCGCACCCCGTGGTGGTAATCCTGAGCCATAAGGCAGTCTCTCCGGTTGACAGGGATACCTTATGTTCTGGTTGCCAGGCGTGCGGCGCACGCATTTCACGATGTGTCAGTGCTGGTACAATATCGCCACTTTCAACGCGACTGATTTACAGGGAATTTCTTGTAAAGAGTGGAAATGCTAACATCAAAAAGTAATCCAACACGATGACGAGTTTCACCGGCGGCAAGCAACCGTCCGGCCTGCTCCCATTGCTCCGGAGTGAGCTTTGGACGCCTGCCACCGACTCGCCCTTGCGCCCTCGCAGCGGCAAGTCCGGCGCGGAAGTGGACGCTGCAGACTGCCCTGATGATTGCCATGGTGATGGCTCCTGGTGATACAGGGATGGTAAGATTTTACCCGTTTCGGTGGATTGTCAGGCTAAGGCCGAAACCACCCGAAATTACTTAACGATGCAGGTAATGCTATTAAGGACTGGCGCACAGAATTAACGTTGGGAATTATCAGTGATGAAAATAAAGCAGCTTTAATTCTGTGGATGAATTATATCAACGTTCTTAAATCGCTGGACTTAACTGGCGTTTCAGACGAGGCCACTTTCACAGCAATCAGGTGGCCTTCATTACCACGGGAGTGATTTACTGGCTATCAATATTCCTGCTCATCAGTTTCTTAGGCCATCTACACATTTATTTTGGTACGAGTTAAAATATTGCAAAAAATATCAAAGCTTATTATTTTTTCTTTAGGTAAATTTTCGCTCAAAAAACTTAATTGTTTATTCAATGATGATACAGCGTGAACTATGCTGGAAATGAAGGAAGTCAACAGTATGGATAATCTGAATATTCACGGGTGACATTATGAGACATCGTATATTTTTCCCATTGCTTCTGGTGTTGTCGGCTACAGCCTTTTCGACATCGGCGATGGCTGCCAGTGATTCAAAACCTCCACCAGATAATACAAAACACTCTTCCAGTGGCTGGCCGCCAATGCCTGCCCCATATATTCGCCCACCATGGTGTGACAAATGGCCACAAGATATAGAGAAACCACTGGAGTGGTGTCAGATTTGTGGTTGTTAATTTTTTACAGAAAACTATAACAACCATATCAGGACTGATGGTATATCTGGTCATTAAGTTTCATCAGTTCTTACTGAACTGAATATTACTTCAGGCTGGCATGTTTTATTAACGTCAGCCTGATTTTTCACAACGTCATATTACTGGCTTACAGGTACATCAGGCCAGTCAGGATTTGTGGTATCTACCCGGTTTACCAACACCCTGTATTTTTTCCACTCGTCGAGCTGCACTTTCTCATCATCTGTTGCGATTTCAAGATCAACAGCATCCTGTAATGGCGCGATTTTCTCCGCTGCTGTTTGCAAAAGACGGCTTTTGGTTCCTTCAGCTTCACGAAGTCTGGCCGCTGTTTCAGCCACTTCGTCTTTTACCCACGCCTTACCATTCCATTTCTGATATTCACCATCAGGTGAAACTGATGTGACGTTTTCAGGTAGCGGACCGGGAGAGGAGATATAAACCTGATTGCCGGTTGTTGTGTCGTAAACCGTCTCGCCACGGTGGTCCTCATGCAGACTCCACATTTCGGTTTCAGCGTCAAATACAGCAATATGACTGGCGGGAATATCAGGGGGTGCAATATCCGTACAGTTTGCCGGTAATCCCGTGTGTGGCGGTATATACGCATCACCTGCCCCAATAAATTCATTCGTATCTGAACGCAGGTTATAAATCGTAATTGTCCGCGCTTTGCTGCTCATTTTAAAAGTCATTATGCAAGCCTCACTATGTAATTAAATGCAATGTTTTTAACCGTTGTTTCCTCATTACCGTCTGCGTCCACAATAACGACGTGGCCGTGTGGGCCGATATATACAGTGTGGTCATGCGGACCAATCCATGTGGTATGTGCGTGATCGCCATTCCAGCTTGTTAACTGGTCATTGCCATCATGCTGGACGCGAATTTTTCCACCGATTGAATCACCACCGTATGTGCCGCCCGCCGAATGGTTATGACCGCCTGTTGTATTAGAGCTCTTCGTTCCGTAATCAAATGACGACGTACCTTTCGTTCCCAGATCGGTATCCTGCGCTCTGGCGCTGTGACTGTGCGATTTGTTGCCATCCAGCTCCTGAGATAGCACAGCGCGCCCGCTCGCGGGCTTGCCTTTGATTGTCCAGCCGCGCATGTCCGGAATAGTGCCGGATGGATATGCTATTGCCAGTAAGGGATAAACGTTCTTATCAAATGCTTGCCCCTGCATCAGGGCGTAACCAGCAGGGATGGTGTCAGACGGCCACGGAAGGGGAACGCCGGGCGGACACGACATCAGGGGACGCCAGGTAAACCCGGAATCATTGGTTGCGTCCCAGCGCGCGCCAATCCAGGTGTTACCAACTGAGTCTGTTATCAGAACGGCGCAGCCGTCTACGCAATCGCGTTTGGCAATCACTTCCAGGAAAACATAATCGTTTACCGGCAATATTGTTGCCGGAGCTTTATTGCTGGCAAACCGGCGAGCACCAGACGGCAGAATGCTCAAATCATCGGCGGTACTTACCCACATAACGGGCTGAGCACCAAGACCGGTCTGGGTGTCGTCTATAGCTCTTTTGGCGGTGGAAATACGGATGTCGTCACCAGCGGCTACAGTGTCTGGTGTGGTGCCGACGTTGAGTGTAGCGCTGTTGCCAAGCTGGAGGGACTGACGGGCCAGCGCTTTATCTGGCACATCATTCAAGTTCTGGTCTTTTTGCATGGCGCCAGTGGCCAGATTTATCGTTTCGCCTAAACCGAGGTATGCAAGAAGACCAGCTACATCTTTTCCGCTTAAATTCGTCAGTGTATTGTCCAGCGGTTGCTTTCCAGCCAGAGCGTTTATCATCGTCGTGGCAAAGTTCGGGTCATTCCCCAGAGCCGCTGCCAGCTCGTTCAGCGTATCCAGTGCTGCAGGCGCAGAATCCACTATTGCCGCGATAGACGATGCCACAAATTCCGTGTTTGCAATCTGTTTAGTGCTGTTACCCGCCGCTGGCGTCGGTACCTTTGGAATCCCTGTGAGTGTAGGGCTGTCCTTCTGCGCATACTGTGAATGCGGGTCCGGCGCAGCAAGATGCTTTGCCATCAGGTCGTCTACATATACCTTCAGCTCCAGCGCCTTATCATCTACATATTTACGGGTTGCCAGCACTACTGCAGTGTCAATTTTCAGGGTGATGTTATCGGTGCTGCTGGTAATCAGTACCATGCGCACGGTCTGCGTACGTCCGCTCCCTTCTGTCAGCTGCGGCTTGTAGCTCTCAGGGCAGTTACCCACGGCGATCAGTGCGCCGGTTTCATCAAACAGGCCGACCTCACGAATCCACCATCCCCCCTCAGTTTCCGGGATCACTTGCTCAGCAATAATCTGGCTGCTGTTCTGCGGGTCGATATACAGCATATTCAGCGCTGCTCGACGCTTCTCAGCAACTAACGCGGTCTGTTGCGCGCTGGGTGTGGGCAGCACACCGCCACCGTCGCCCACCGCCATATGGGTAATTTTCAGCGGGACACCGAGCGCGGCGGCGCTTGCCAGTTTCGCCGCGCCGATCTCCGTCAGCAGGGTATAAAATTTTGCGCTCATGGATTCACTCTCATTGTGTCAATAACATGGACCGCCCCGCCCTCATAAGCGGTGCCGCCGGAAATAATGGTTTCGTTGATATACGGGTAGATCGTGATTTCTTCGCCGGTGTAGGTGGCTGCCCCCACAAAATACGGGCCACCTGTCTGCAGGTTGATGGACATACCAACCAGATGACGGCTGCATGGTTTGGCATCGCTTATCAGGCGCTCAAGCTCCAGATAGGTGTCTTCGGTGATACCATGATCCTGTACGCCAATATCCAGACGGAACGTTCCCGGCGTTTCGCCGGTCTGCCACCACTCAATGATGCGGATCAGGAAGCCGAACGGCTCCACCACGCGCCGCACGGCGCGGGTTGTCCCCTTGTGCTGATGGATATAAAAAGCGTCCTGCACAACGCGGCGCTTGACGCTTTCTGTCCAGCTCTCATCCCAGCGGTCAACAGAAAACGCCCAGGCCAGATAAGGCAGGAATCTGATCGGGCAGGTTGCCGGGTTCCACAAATCACGCAGCGATACCTGCAGATCGGAAATCCCGCTGCAGGTCTGCGCCAGTCGGCGCTCAAGCGGCGACGAACCCGGCGGCAACAGACTATTCATCCGTGCCCCCGTTGGTAACGCTCCATTCAGTACAGGATGCCGCCTGCGTCTTATCCAGCACCACATCCTCCAGAGGGGACGTTAGCTCCACACGCTGGACGCCCTCCACGTGCAGCGCGGCATAAATGGCGCTGCGGCGGATATCACGTCCCAGCCTCGTCTGACTGGCGATGTACTTCTGCAGGCTGGCTTTTGCCGCCGCCATAACAGGCTCCGCTTCCGGCCCCGGATAAAGAAAAATGGTAGCCTCCACCCGGTACGGTATGATCTCCGCACTACGAACCGTCAGACGGTCAGCCACCGGGCGTACACTCTCACTGTTCAGGGCTTTTTCAACCACATCCAGCAGATCTTTTACTGCTGTACCGTCACCCTCCCGGCTCAGTACGGTAAGTACCACCTCTGCAGGAGCCGGACTGGTTGCGCTGGCATCTGCCACACGTCCGTCCGCACTTCTGGCGTGAAATTCATAGGCTCCCGTCGGGCCAGCAACGGACAGTCCCTCAAATGCTGCAGGGATGCGCTGGCGCAGCGCCTCATCATCTTCCATCACTGCGGCGACCGGCGGTACTGCATCATTATCAGCAGGCACTACCGTCAGACGTTTCACGTTGCAGTTGGCTGCCAGCTGCTCAAGATCATTTCCCATCGAATAGGCCACCATCACCGCCTGCGCAGCTTCGTTAATACGCTGGCGCAGCAGGATTTCGCGGTATGTGCTTTCCTGCAGCAGCTTGGTGACGGGTTCAGATTCCAGCGCCAGCGTGCGCCGCACCGCGTCCTGTTCATCCACAGGATAAAGAGCCACAAAAGCGGCCTTGCGCTCAGCCAGCAGCGTCTCAAAATCCGGCACGTCCACTATCTGCGGCGGCGGTAACCGGGAAAGGTCAATGACTGCCATTGTCTGCTCCTGTTGATATGGAAAGGGAAACCGGTGCTCCGTTATTACTATGTCCCGTAAGCTCAACCACCATAGAGCCGTCAAAATTGCCGTTGATGGTGATGGAGTCCAGCGTAAGGCGCGGCTCCCAGCGGTTCAGCGCCACATAGACTGCAGACATAATCTGCAGGCGCAGTGCCGGGTTCTGCGGCTGGTCAATCAGCGCGGACAGCAGCGAACCGTATTCCCGTCGGGCAATGCGGCTACCCTGCGGTGTCAGCAGAATATCCCGCACCGACTGGCGCAGATGGTCTGTATCCGCAAGGGCCTGCCCGTCATTCCGGCTCATACCGATATACGGCGTCATACCGGACCTCCCGATGTATCCCCGCCTGACTTAACGCCAGTGTGACCGTGTTTATCCATCACGACCCCGTTAGAACTCATTGCGCCGCCGCCCTGGGTGACGCCGCCGTTGATCACCACCTCGCTGTTAATGCGTGTTGTGTCAGCCTCCACCACAAACTCACCGGTTTTGAGGGTGATATTGTCCGCCGCCTCGATCACCATGGATTTGATACCCCGGACATGCCACCGCCCGGTGGCGGGTTCGTACTCAAACCAGCCCCCGTCCGGGTACTCCGTCACGCAGCCGTCCACAGAATCCGACGGCGGCGCAAACTGATTGGAGTAGATGGCGGGCAGCACAAAAGCGGTTTCCAGATTGCCGCCCATGCTCAGCACCACCACCTGCTCATCCGGCGACGGGCACCACCATGTACGGGCACTACCGGCACGCAGCGTCAGCCAGTTAATCCAGTTAGTTTCAAGCTCACCCACTTTCACCCGGCACAGCCAGTTTTCCCTGTCCACTTCGGTCACGGTGCCGGTGCGGATCAGGTTGGTGATAAGGCGCATGATTTCTGTGAGTTGTGCGTTCATTTCATAAGATTCTCACATTAGAATGAGTTACAAACTTGTGGTCCTTTGTATGGTCTACAGCACAAAAAGGAAGCCTTATGCTTACTGTCGGAATATATGGATTCAATATCACCAAAGTGACTCATTTCTCTTTTGGCACTATGTTTCCGACGTGTAAATCCATCTCAGAAATAATAAAAAAAATGAAATCCCGCGATGAATTACACCTTACAGCATTTCTCGAACTAGATATAAATGACGCCAACGAATGCCGAGATATACTATTTCATCTAACAGCAATATTATCCTTCATTGAACAGCGCCCCGTATCATTTGGCTACTCATTAAGAAAGCATGAAAGCATGGGCAATCTTGATGACGACTACCCTAAACTCATTAACATAGCGTATAGCATTAAGAGCACGGGAATAATAATCAAAGAAGATTATTATTCAAAAAACTCCAGAAGATATTTTATAGAGGCTGCATTAAATAAAATAATCATTGAAAAAGATCGTCACTACTCCACCTTACTTCATAAGAACGTACAGGTCTTTTCCACCCCTCAAAGATACATTGATGTATCATACTACCTTTTATTTTCAGGCTTAGAGTCCATAGCTCGCCAGCGAGAGAATGACCTTAGTAATAACGCACCGTCAGTATTGTACAAATATCTTTCGAAATTTAAATTTGATATAAAACAGCAAGACAACAAAAGACCACCGCGTTCATTGGACATTTATAGTGGTTTAAGGAATGCACTATTCCACAATGGGGAATATCAAACTGCCCCCATGAAAAGAAATGGTACTGAATGCACATTCCTTCTTAAGGATTACTACTCCTATTTCAGGCGTCTAAACAGCCTTGTAATTTTAAAAGAGGCTAATTTTGAAGACGGTAAAATTAATTGGGATTTTGTGAACTACAGACACTATTTTAAATAAGCTAGGCTATCAACCACCGTAACAGCGTGTCGCATGTCAACGCTGCCACCTCATCGTTGATGCCCAGCAGGTGGCGCTCTGCGTAACGGACCTCCGGGCCTTTACGGCTGACGCGATCGCGCAGGCCGTAATGGTGAACACGGGCAATACGCTGCACCTTGCCATCAAACTGCACGCTGGCGGAGTCCGCACTGGCTGCGGTTTTCAGGTATTTAGTGGTGCGCAGTTTTGCAAACATCTGGCGCTTGATGCGCCCCTTCTTGCTGCGGGCCGTCACCCGGCGTGGCTCATAGCCGCTGCCGTCGGGATTACGCTGCAACCTGATGTTCTGCTGTTGCGACCTGCGCAGCTGCTGCGCCAGCTGCCGCATCATACGACTGCGCGCGGTAGGCTCCAGATTTGCCAGCAGTGCCATCAGCCAGTCATCCACCCTCTGCAACTCATCCACGTTTCACCGTCCACATTTCTTCGGGTTCGTCCGGCTCCGTCACCGCTTCAACGCTCGACACGCTACCGTCAGTGTTGACCAGCACGCGCTCTGTAAGCTGCAGGTTCAGGCTGATATCGCACACATCGTTGCGCAGAATATCCACGTCAAAGGTGAACAGTTTTTCACGCAGCTCCGGGTTGTTGATGGCGTCCGGCTGGCTGGCACTGAGCCACAGCAGCACGGGAGCCATCAGCAGATTCTGGTCGCCGCTGAAATCCTCGATCACCACGTTCAGGGTGTAGCGGTACTCCCATGACATGGAGCTGGCCCCGGTTGCCACCAGTGAGCCGTTATCAACGAAAAGGTGCAGCTTGTCCGGGTTATTGCGGACATAGGGCACAGCTTTATTCAGGGCGTTGCGTAAGGACTGCGGCTTGTTCACTGTCTCGCTCCTGACACGCAATGATCGTGTCCACTTTGTCAGCACAGACCGCCCAAGCGGCCTCAGTTTCATCCAGCACCTGGTTCAGATCGCCGTTACTGCGCGGCGCTGACCTGTCCAAGCGGCACTGCGTCACTCTGGGACAGCCACTCACGGTAAGCTGCACCTCCGGCGAGGGCCGGACGCTCCCGCAGCCGGATAATGTCAGCAGGCAAAGGAGCGTCAGCCCAGCGGCGCAAATCCTCGTTTTCACGTTTCAGTTCCTCGATCCGGCGTTGTCGTTGTCTCAGCTGTGCGCTGGTCTGTTCTGCTTCGGCATAGAGCCGCGCCTGCTCCCGGTTGTTGGTTTCAGCCAGAATGGACAGACCGATCAGCTGGCTATTTTTCTTCGTCAGTTCCTGCGCTTTGCTTTTCAGCGCCGCGAGCTGCGTTTCGATGGTGTGGCTGGCACTGTTAAGCCGCCACGACTGCCAGCCCAGCGCAACGAATGCCAGCGCCACCACTACTGCCAGCGCACGCGTCATAGTCCAGCTCCTTTAAGGCACCAGGCCATCTCCCGCGCACGGCGGTTATCCAGCCCCTGATTAAATACACCTTTCACATAAACCCAGCGCGGCAGCTGAAGGCAGGCATCCGCCCAGCGCCGCTGGTTCAGCAACTTAACCAGCGTGGAGCTGCAGGCGTTGCCGGTGCCCACGTTGAAAGCAAACGACACCACCGCGTCATAGACCTTTTGCGGCATCGGCTGCACCACACATTTATCCAGTGCTCGCTCCACGCGCAGCACGTTGGTGATAAGTCCCTGCGCCGCCTGCCGTTCCGTGATGGTTTTTCCAGGCACCACACCGGATGTATTGCCGATCCCGTCAGTCCAGACGCCCGCGCTGCACTGATAAGGCTGCAGGCGGCATCCCTCGTAATCGGCGATCAGTTTCAGCCCCTCAACGGAGGTATGAAGCGACTGAAATCCGGGCAGCGTGGCTACGATAGCCAGCACCGCCCCGACAAGGCAGCGCTTAACGATTGAAGGATTCATATTCCCCCCGCGAAATCTTGCCGCCACGTAACAATTTGAAAGACTGGTGTTTGTAGTACCAGTTGATAGCCAGCATCAGCACACCAATCAGTACGCCACCAACCGTTGACGCATCCTTGAGCGACAGATCGCCCAGCCATGCCAGCAGCACGGCAATGCAGTAAGTGATAAAGGCGCTGATTCGTTCAAGCGTCATAATTCAGTCCCATAGCTGGACGGTCTGCGCCGTGGTTGACGCCGTAATGTCCGGCAGTTCCACCTGCAGCCCGTGCGGTAAAAATGGGCCGTACTCAGCCAGCCCCGGATTTGCCTGCAGAACCTGCTCAGTGACACCCTGCGTGCGCCCGTAATGACGCCAGCAAAGCGCGTCCACCGTGTCATACTGATGCGCACGCACTTTCATCAGATAAGCTCCACCGTACAGTGCGGTGTATCCTGCACCCGGCTGATGGCCCAGCGGGCATCACGCCACAGATCGCCGCTGGCCTCCGCCAGCTCCTCCCCTCGCTTCACGCCTGACACCGTGGCGTCATAGTCCTGATAACGCTCATTAAGCACAGCGCGCGCCCAGCAAAAAACAGCGTTGTGGTAGTGCCGGATGCGCTCGCTTTTACCGTCCAGCATTTCTGCGGGAACCTCAGCAAGTGTCCGCCAGCCCTGCATCTGCTGACGGTTGCGGAAGTCGTACAGCTCAGCGTTAACCTCAGAGATCGCCGTCAGCACGACCTGCTTTAAACGCTGCTGCGTCACCGTGCCGTCAGTGCGCATCACACTGCGAAATTCCGACAGGTCCACATCAGGCCAGAACGGCGTATTTTTGATGACCTCCGCCTGTTCCGGTGCCTGTTCGGGCGCAACAAACTTCATGCGGCTTTCTCCTGAATAAGTGGGCGGTGGACGGAATTTTGATGTAGCAGTGCCTTTCGCCATCCCGTGCCGCCCGTGCGCGGGGCACGTTCGTTAGCGGCTGTCATTGCGCAGTCTGCGCTCCAGCTGCTGCTTTTCTTTTTTCACACCGCAGCGGGGATCAAGCTGCAGCGCATGGTTAAGGTGATTCAGGGCAGACGCCGGGTTGCTTTCGCTCAGTACAGCACCGATGGCTTTATGCAGGCGCGCCCGCGACTGGTCCGGCATATCCAGATCGGTGGTCAGGTCCAGCGTCTGCAGAAGCAGATCGGCATCAAAACCGGCAGCGGCAAGCAGAGCGCTTTGCGCCGCGTCTGCCATTTCTTCTGCCAGCACGGTCTGCACGTTACGGTTGCCCAGTGGCATCACCCAGCCATGGCGCAGCGCATGGCGCCCGATTTCGAGCGCACCGGCATAATCACCGGCGTCGATACGCCACAGCATCACGTACATCAGCACGTCATCCTGCTGCGCACCTCCGGCAGCCAGCACGCCCTCCGCCCAGGCGGAATATTTCGGCAGCAGTTCCACCTTGATTTCCGCCTTTTTCACCGTGGACTGGACGCCCTTGAGGCGGCGGCGGTCTTCTGCCAGCTGCAGCAGCATCAGGTCATAGCCCGACGCATGGCGAACACTGCCGCCCTCACGGGCGGCCTGTTCAGCCTGAATGCGCAGGCGGTGCTGCCGTGCGGGACTCAGGCTCATGCGTTATTCCCCACTTTCCGGTGCGGCAGGCGCGCTGAAATCACCGATTTCGATGTTTTCTACCAGCGCCGCGCAGCGGTAGTCCTCGACCACATACACCTCGTTGACGGACTCAAAGTTTTCAATCCGGTCACGTTTCGGGTTGTCGATAACAGAACGGCGGCGGGTATCTTCCTGCCAGTAGATGGACAGGTTATCCAGACGGGTGATCAGCAGGGCATTTGCCGGGAAATAAGGCGCACGCACGGCCTGCAGGCCGCCCATACGTTTCTGGCTGATGATCAGATCGGCGGCAATTTTCTCGCTGTTGTCCTGCTCTTTGTTGACCAGCGGGAAATACTTGTCAGACAGCAGTTCACGTCCGCAGACAACAACCAGATCGTCATCATCCTGATAAACCGCGTCGATCAGCTCGTTGACGGCATCCATCACCACGGCGTCCAGGTTGGCATAGTCGCCGCCCTTGCCCACTTTGACCGCGCCTGCAGTCGTTGCACCGTCTTTTGTGGTGCTGCCCATAACATGATCCGGCGCGTCTTCGCGGATTTTCTGTAACCAGCCTTTATTGACGTCCTGCAGCAGCGGGTTTTCAGCACGATTTGAGGTTTTGGCGCGCTTCACGCCGTTAAAGCCGATCATGATGCGGTCCAGCGCCTGACGCTTGACGATGGCGTTGCGGATACGCACCTGGAAGTCCTGGAACTTGGCCCACAGGTCCAGTTTTGCGTAGGTCAGCACCGTATCAAAGTTGGTCTGTTCGCATTTGTATTCCACGTCTTCCATCAGCGTCGGATCGGTAGGCTCGCGCTCTTTGGTGGTGGTATCGGTGGTTCCGGCAATGGTGCTGCCAACGCCCAGCCCCAGCAACTGTCCTGACTGTTCAGTGACCGGCGTGATGTTAATCAGCGTCAGGAAAGCGGCGGACTGCTGGATCTGGTCTTCCAGCGTCTGCTGCACGGACGGCTCCACGGTAAACTTGCTGGAGAGTTCTTCAACCTCCACACTGTTCAGGCGCGCCAACTGCTGCAGGTAAGCGTTAAAGGCAAAGCGGGTTTTCTTTTTCATCGGGTTTTATGCTCCATCAGCAATTGGTCAGGGTGCCTGCCGGTGCGTCACCGCCCGGCGCGCGCTGGCGGTAGTCTTTACGGCTGTCTTCGCTGCTAAGCTTCTGCTCTAGCTCGGCAAAGGCGGCCAGCTGTTCCTGCAGGGAGGACTCCAGCTCAGAAAGGCGTTTGTCCTGTTCGGACAGGGATTTATCAGTGCGCTCGCTCAGGTTCTGCTGCTCGGAGGCGACCAGTTCCACGGCTTTATGCACATCGGAGAAACGCGCCTCATCGGTCTGCTCTTTTTTGGTGAACAGCGCGGTGACGCGGGCAAAGAGGGACGGCTTTTCGTCCTGGACCTCTTCCAGTTCGATCAGCGTTTCGACAGCTTCCGAAAACAGGTTTTCAGGGTTCTGCTTACGGTTTGCCAGCGGATTATGTGCGGCGCTGGCGCTGAAAGCCAGCATTTCGGTGCCAAGGCTCGCAGGATCGTCCGTCGCACCCAGCCCCACAAGGTAGGCTTTGCCGGTGTCGGCAAACTTCGTGCTGACCTCCATGGAGGTGAAAAGCTTCTGGCCTTTTTTCACCAGTTCCACCAGGGCGTCAGTGGGTTCGATATCGGCATAAAGTGCCATCTTGCCCGCCAGCGGGCCGTCCTGGATTTCTTCTGCAACCAGCCCCGTCACCCTGCCATAGCGGTTAAAAGTGCTCTCCGGCAGATAAGACTTGATGTGCTCAAGGTTAATCAGCGCGGTATAGACCGTCGGGTTATAGCTGGCAGCCATCTGTACCAGCCATTCACGCTGGATCTCGCGCCCGTCAGTGGTGGCACCTTCCACCCCGATGCGGAAACGCTTTGCTTTCACTGTCATGAGCCGTGCTCCGTTAGAAATAACTTACTGGAGCCTTATGTTTGCGGTGATAGGGGGAGTGAGACAACGCGCTGTATTTGTACGGTAAACCACACAAACCGCAGCCGGGGAAAGCCGCCATCCAAGGCCGTATGTTTGGGCCATGAACACGACACTGCCCCCCGCAGACCTCGATCCCCGTAGGCAGGCCATGCTGCTGTACTTTCAGGGATACCGCGTAGCCCGCATTGCTGAAATGCTGGGCGAAAAAGTTGCAACCGTTCACAGCTGGAAAAAACGCGACAAGTGGGGCGACTATGGGCCGCTGGATCAGATGCAGCTCACCACCGCCGCACGCTACTGCCAGCTCATTATGAAGGAGCATAAAGAAGGGAAAGATTTCAAAGAAATTGACCTGCTGGCGCGCCAGTCGGAGCGCCACGCGCGGATCGGCAAGTTTAACAATGGTGGCAACGAAGCCGACTTAAACCCTAACGTCGCCAACCGCAACAAAGGCCCACGCCGTCAGCCGGAAAAGAATGTTTTCACTGATGAACAGATCGAAAAGCTGGAAGAAGTCTTCCACGCCTCTATGTTCGACTATCAGCGTCACTGGTTTGAAGCCGGGAAAACAAACCGCATCCGCAACCTGCTCAAGTCGCGCCAGATTGGCGCCACGTTTTATTTTGCCCGTGAAGCATTGATTGACGCCCTGCTGACCGGACGCAACCAGATTTTCCTTTCTGCCAGTAAGGCACAGGCGCACGTCTTTAAGCAGTACATCATCGACTTTGCAAAAGAGGTGGATGTTGAGCTGAAAGGCGATCCCATGGTGTTACCCAATGGGGCCGCTTTGTACTTTCTCGGCACCAACGCCCGCACGGCGCAGAGCTATCACGGTAATCTGTACCTGGATGAATATTTCTGGATACCAAAATTCCAGGAACTGCGCAAAGTGGCTTCCGGTATGGCCATTCACAAAAAATGGCGACAAACCTATTTTTCCACGCCCTCCAGCCTGACCCACAGTGCTTATCCGTTCTGGTCCGGTGCGCTGTTTAACCGGGGCCGTGCCAAAGCGGACAAGGTGGATATTAACCTGACCCACAGCAACCTTGCGCGCGGCCTGCTCTGCCCTGACGGGCAGTACCGCCAGATCGTCACCGTGGAGGATGCGGTGCGCGGCGGCTGTAACCTGTTCGACCTCGACCAGCTGCGCATGGAGTACAGCCCGGACGAATACCAGAACCTGCTGATGTGTGAGTTCGTGGACGATCTCGCGTCCGTGTTCCCGCTCAGCGAGCTGCAGGCGTGCATGGTGGACAGCTGGGAAGTCTGGACCGATTTTCAGGCGCTGGCGCTGCGCCCGTTTGGCTGGCGCGAAGTGTGGATCGGTTATGACCCGGCAAAAGGTACGCAGAACGGTGACAGCGCAGGCTGCGTGGTTATGGCACCGCCAACTGTACCTGGCGGGAAGTTCCGAATTCTGGAGCGTCATCAGTGGCGCGGGATGGACTTTCGCGCCCAGGCTGATGCTATCAAAAAACTGACGCAGCAGTACAACGTGACCTATATCGGCATCGACTCGACCGGCGTCGGGCACGGTGTTTATGAGAACGTAAAAGCGTTCTTTCCTGCCGTGCGGGAGTTTGTCTACAACCCTAACGTCAAAAATGCCCTGGTGCTCAAGGCGTACGACATTATCAGCCACCGCCGTCTGGAGTTTGACGCTGGGCACACCGACATTGCGCAGTCCTTTATGGCTATCCGCCGCGCCACCACCGCCAGCGGCAACCGTCCAACCTATGAAGCCAGCCGCAGCGAAGAGGCCAGCCACGCAGATTTGGCCTGGGCAACGATGCATGCACTGTTTAACGAACCGCTGCAGGGCGAATCCGCCAATACCAGCAACATTGTGGAGATTTTTTGATGAGTGAGCTCGAAGCCTTAACCAGCACAACGCCAACAGAAGATATGGCGCCTAAAAACGCAGACGTAACTGCCGAGGCTTTCAGCTTTGGTGATCCAATCCCGGTGCTGGACCGCCGCGAGCTGCTGGACTATGTGGAATGCGTGCAGATGGACCGCTGGTATGAGCCGCCGGTCAGCTTTGACGGGCTGGCGCGAACCTACCGTGCCGCAGTGCATCACAGCTCTCCCATTGCGGTAAAACGCAACATTCTGACCAGCACCTTTATCCCGCATCCACTTCTTAGCCAGCAGGCGTTCAGCCGGTTTGTGCAGGACTATCTGGTATTCGGTAATGCCTATCTGGAGAAGCGCACCAACCGACTCGGCGGCATTCTGTCGCTGGAGCCATCACTGGCGAAATACACCCGCCGCGGGATCGATTTAGACACCTACTGGTTTGTGCAATACGGCCTAACCACGCCCCCCTACGAGTTCACCAAAGGCAGCATCTTTCACCTGATGGAGCCGGATTTAAACCAGGAGATTTACGGTCTGCCGGAATATCTGTCAGCTATCCCTTCCGCCCTGCTGAATGAGTCCGCAACACTGTTCCGCCGGAAGTACTACATTAACGGTAGCCACGCAGGCTTCATCATGTACATGACTGACGCCGCGCAGAACCAGGAGGACGTGAACAACATCCGCCAGGCAATGAAAAGCGCCAAAGGGCCGGGCAACTTCCGCAACCTGTTTATGTACTCGCCCAACGGTAAAAAGGACGGTATCCAAATCATTCCACTTTCGGAGGTTGCAGCTAAGGATGAGTTTTTGAACATCAAGAACGTGAGCCGCGATGACATGATGGCAGCACACCGCGTTCCGCCGCAGATGATGGGGATTATGCCGAGTAATGTTGGGGGGTTTGGGGATGTGGAGAAGGCAGCGAAAGTATTTGTACGTAATGAACTATTGCCTTTACAGAAGCGTCTAACTGAACTTAATTCTTGGCTAAACGATGAAGTAATCAAATTTGAAGCATATTCATATGATATAACCTAGATAACATAAAGGTTGAATTTAAAGACCATCTTATTCATATTTAAAACATAATAATTAAATCTCTCGCTAAGCCTGAAAATTCAAGCTTAGCGAACTCTGCACTTTACAAACCTTTTTCCATACCCTGAATATAAGCAGCAATCTTGGTTTTAGTCTTTGCGTCTTTGAAGTACCTATTAAATGAAAATGAAGCATCATCACGCTGGGCTTTTTCGACCATTGCACTTATATATTTTATCGCAGCAGGAACAAACGTAATTGCTTTTTGATAATTCAATCTATCAACGCCTTTAGCATCACAGATTTGTCCTACCGCAAATAACACATGATAAGCACCATCAATTAAAAACATATGCGCAGAATTGAATTTCTCCTCTTTCCTGATAGAGGATTGCAATAATTTCTTCTTATTTTCAATAACTGAAAGTACTTTGATAGAAGCCAGAAGCTCATCAGCCATTAATTCATCTGTAAATACAGTTTCATAAAGATCTGAGAAAATCCTCCCCCTATCCTTTTTAGCTACTTCAGGTAAATCTAGCGAATAAGCTAAATGAGCTTGTCCGGCACTCAAAGCATCAACTCTGACACTTTTAGGCTGGTTCGAATGCTGCCCATCCTTTCTATCATAAAAAAGCCCCATGCCCTCAAAAGCTTCCTCTAATTTTTTCTGAATATCATCATTTGAACGAAGGTCTCGACTTTTAATTGGAGTTTGACTGTTCGTTGACTCAGCAATAGCCAGACTAACTGGTTGAGATTTAGTTTCAATTATCCTTACTAATATTAAAACATCTTCCAGTCTTTCTTCGGAATTTAAGCTAGCTTCAAATAATGCATTAGATGTCTGCCCTCCATTCACGATTTGGATATTTTTTAATTCTACCAATGGAGCTCTTTTCCCTTTTATATAAGAAAAAGAGTCACAAGTTACAGTAATTCCATTATTCAAATACCAAAATAATGGGCTGCGATCTGATAATGCAGTTTCAATTATGCGTCTGTTTATTTTATTTGTTCGACTTAAATAAACTCTTACATTATCATTAAAAATTTCTTTCCTAACCTCCTTTGGATTTTCAGGGTTTGTAATTATCCTTACAATTTCAGAAGCTTCAACAGTACATATTAACCCTCTAATACTGCCATCGGTACGGTCAAAATAGTCCTTATCCACTATCTGTAACTGCTCATCAATAACGCTATTCTTTCTTTCAACAAAATAATTAACAATAGTGTCCAAACTATGATGGTGAACATTGAAATATTTATATTTACTTAATGACGCATTGGCTCTTTCTTTTTCTCCGTTCTGCATTTCCATTGTGTTACCACAAAAGTGAACTTCAATGGAAGGATTACTTTTCTCAAGTGCAGCCCAAATTTCTTTAATTTTATTCCATAATATCGGATTGCAAGTTTTCTCGAGAGATTTATTTAAATCCAACAAGTCATCAAAGAATGACACTAACTTATCGATTTCATTGCTGGGAAAATTCTTTTTAGTATTCTCAAAGGTATCAGCATATTTAAACTGAAATATATGGATAGAATTCCTGCCATCACGGTCATCGACATAAACAGCATCAACACCTCTATCCATTGAACCATCAGTTATGGCATCTTCAGCTTCTTCATCAGATACATTTAGTAACGTTGCCACCATCAAAATTGGAAAAGCTTTTTGTGGTTTATCGATTCCATTCTCAGGATCCAAGTATGCTTGTACTTTGTGATGAAGTGTATTCCAATCTAACAAATTAGCCATCGAATCACTCTTACTTTTGGGATAATAGAAGTTAAAGAAATTTTACTGATTTTATACCCTGTGATTCGTATGGAGCAACTGAAACAATATTTGTTTATACAATTATACATATCAGCGCGCGCTCGTATCCCCGCCACGCCTGCCCGCTTTATGTAATGGTTTTCATGCACCTGCATGATCTACGCAAAAGCCCGCCAGAACTGGCGGGCCTTAACACAAAAGATCCTCAAACGATCATGCGATCTCATGCAGCATAGACATGCGCGTTTATGCAGAATGTGCAAAATCGTAACATAGTCCGTAAGCGTGAAACCTAGAACGTGACAGCCTTGTCAAAGCCAGAAATAATTGTATAAGAAATAGACGAGTTATCAGCCTTGTTCACTTTGAACTTGGCACCTTTGTAAGCGATAACATCACTTCCCTTAGAATCTACAGAAAAATCTGTTGTAAATGCTGCACGAGCCATATCGTTTGCAAATTCACGATAGGTGAACTTCATTACACCGCCTGCATTTCCATTGTATTCGATAGTCTTAACCAATGAGTTACTCACTCGACACAGCCCATCAGGAACATGTTTGATAGAAATTTCTGATGCAGTATAAGAAGTGCCATTTGGCGGTGATATCTCATTTTTTGCAGCATCGTAACTAACATAATCAACATAGTTACCGATTTGCCCATAGAGATTTTTTAACGCAACAGCTTGAGGGTTATGATAATTGCGGTAAATTCCATTCCCCTCACTGCAATATGTACCAGCAGCGATAGAAGACAATGCACCATTAGCCGCACCAAGTTCTAGTACGTCCGTTTTAAATCCAGTAGCAGATGTGATAATGGGATCGCCCATGTAGGCGGTAGCACTTTGCCCAATAGCAGGCTTCACCACTTCAATAGTAGTGATATTTCGGTTAGAAGCATGTGGCACGCAACCAGTTAGGATTACAGCAAGAGATATTGTTAACGCTACATTATTAATTTTCATTTTTAGCCTATTATTCTTTTCTTGACGAAAAACAAGGCGATATCTGATTGACATCGCCTCTCACTCATATGTAACCCTTTTTGATTAGTAAAAACAAGCGTCTATTGACAAAATCAATGTAGCCAGCTGTCGTCTTCCCACACCTTCTGCATAATTTTCATCACTTGTTTTCTTTCTTCATCCAGTTGCAATCCGGTCAGTTCCACACCGTTAGAGCTACCTTTGCAGATACGAATTACCGTTTTGGGATACAGGGGGCGCAGATTGCGGTAAAGCTCGGATTCAAGGGCGTCCAGAGTAGACTGGCTAATCTTCTGCTCTTTATCGATCATTATTTCAATGCGCATAAAAGTCACCTCAATTGATGACATCCATTGAGCGGTTGTATTCGTGGGTTCTGATTTTTGCCATGAGTTCATCTGTCAGTTCAGAAACCCACTGCAGGGCCAGCCCCTTCTCTTCATCACTACACTCACTAGCTGCTACAAGCTTAAGAAAAAAATCAATGCGCTGGAGCTTCAAAGACTCCAAAAAATAGTCCTGCATCTTTCCTCCTATGACACCACAAGAAATACTGTATATATAACCACTGTTTATATTTACAGTATATAATAATCTTACTGATGTAAAACGTTTTTTACGTTCATCAGCCTGATATGCTTGGTATTATTAAGAGCACGAATTGTTAACCCGCGGAATTAATACAGGTTCCGCCACTTATCATCTTCCTTCAGACGCTGGTTCCGATAGAAGATACGCAGGCCTTCTCCTGATGGAATACTGCCACCGCGAAGGAGCAAATCGACTTCTTTCTCGCTGCCATCAAATCCTCTGGACTTCAGTTCATAGACGAGCTGCTGATGCTGATGATCTGTAATTCGCTGTTTGTAGTCTTTACGCCTTTTCGGTTTAACCAGGCGCAACCTTGCTGCCAGCTCCCGGCGATCTTGTTTGCTCATACTGTGCAGGTAATCGTGCAATTCCTTGTCATCCATGCGGGTAATGTCCGTTCTGGTGCCCCCATCAGCTGATTTATCTTTCTCCTGTTGGTTCAAATTTTCAGCAAGGGGACAGTTATTGCCACGAGTCCAAGGGGCGCAAGCGCCCTGGTCGGCTGCCGCCTCCTGAACGTCAACGGCCTTACGAACCATTTTCCACTTCACTGCATGAGTGCAGATCTTACCCTCTGCAATGGGTGACCAGATGCCATAAATACGAATGCCGTGATCGCCATAGGCGGTCGGCTCTTCGTTGATTTCATAAGCGGTTCTGATAAGGTGATATTTACGGGGAACCAGTACGCCGCCCTGCTTCATGATGTAGGTGGCAAAACAACCAGCATCAGCTGCAGCCAGAATGGCATCAAGGCGCGGGTTATCCAGTACCGGCGCACCTGCTTTTTTGTCACCCTGTTGCCTTGCCGCCTGACCAGCCAGCAATCGCAGTTCACGGTAAGCCTGACGCCCCGGAATGCCAAAGAAGCGGAATTGCTGAACACGATGCAGAGACGCCCAGGCATTAACGTATTCAGCGTTATCACGCAGAGATTTACCCGTTTCCTTGCTGATCTCGCCAGCCAGACCACGCCCGTCAATGTTCTTACTGATGTATTTCGCGATGTAGCTAGTCGGCGTTCCTTTGCGCGGGTTTATCAGCTCAGACTTAAAGCGTGGCCCCGTGTTATTACCCAGCTCCTCGCGGTCTTCACGGATAGCAAACTTACGCAACAATGCAGTAATGGCGCGGCGGTCTTTTTTGCGCATGAAACACAACAGGTGCCAGTGAACTGTGCCGTCATGATGCGGCTCAGCCACCCGCACGCCATACCAGCGCAATCCGGCTTTGTGCATCGCCTTACGAAATGCAGCAAACATGCCGACCAGATAATCACTGCTTTGTCTTACCGTCGCATTTGTCCAGGTCGGGTTGGGCCTGCCGTTATTTAGCGTGGAATGGAAACGTGACGGACAGGTGATGGTGTAGAAAACGGCGCAGTCACCGCGCATTTCCGCGATAAGCTCCAGACCTTTAACACAGGCCATCATCTCATTGCGGCGATGCGCAGGGTTGCTGCTGCTGGCGTTTACCACATCCTCCATGTCCAGCGTGTCGCCGTCTTCGTTCACCAGTTCATGAGAACGGAAAAACTCCAGCGACTTACGGCGCTGCTCACGTTTATGCATCACGACTTCATAGCTGACATAGGGAGATGCTTTTTTGCTGACCAGGCAAACAGCACGCAACTGCTCTTCCCGCCATTCGCAACGCATCTTCCATAATTTCCGGTACCACCAATCGGCGCACAACATACGCGCCAGCGAACCCGGAATGAGTTCATAGGGCACGGGTTTACGGCGGTTTCTTTTCCGGCGGAGTTGCTCAAACGCAGGCGGTATGACATCCAGTCGCAGGGTTTCTGCTGCCACCTTTTCCCATGTCTTGCGGATTTCTTCTGGCTTAACATCATCGGTGGCATACAAATCACCACAAGCGGCATCAAGACACATACTCATATGCGCAGCGACAAGAGTAGACAGGCGTTTCACCTGATCCTGACTCATTTCAGGCAAGATCAGCAGGCCGTCCAGCCCTTCATGGCTTGCCATAAAGCGAAAAGAAGTGGATAGCTGACTGTCGCGTACATGCTCCAGTCGTTCCAGACATGGCTTAATCGTCTCACGCAAATAGCGGGAATAAGCCTTTGGCCTGCCCAGGCTGCTGAAGTATTCAATACGTTGCATCAGCGGCTTGCTGATATGGGAAGGCTGGGCGTTAACGTCCGCCAGTATGACCATGTCCGGATTAAAACGCTGCTGCTCATGCGCCAGCTTTGCCCGGCTAATGAGCTTATCCTGCTCCATTTCGCGCTGGACAGGATCACGGGATTCATTAAAGAAATAACGCTCCCAGACCTGATCACTCAGTGCCTCGCGGCGCAGTTGTTCCTGCTCGTTATCGGCAGCGTACAGAGTGATCAGGTTTGAAAGCGCAGAAACCGGCGCAACTTCCGCCGGGTCCAGATAAGGGTTAATAGCCTTTTTCGGGCTGTTCCATGAGAATGCTGCGGCGGCCTCGTTAAAGCCGCTGCAGTTGTTCATATCAGCATGGCTCATGCACGCACTCCGTACACGGCAGAACTATCCACGCCACGCGAAGGCTCAAATCCCACCCAGCAGCGTGCCCCAGAAACAGCGATGATTTCTGTTGCAGATTTACTCTCACCAGCTGCTACGCCGATGCTGCGTTTTGCCTTGATGTAGTGGTGAGTAAAATTGCGATACAGCGAACGGATCAGGGATGTGTCACTGTTAGAAACAATGACCGGATGTCCTTCTGATGACCGATGTTCAAGAACGGATGCCAGGTGATACTGGTCATCTTCAGTGAAACCATCAGTGTGATAGCCGGAAAACGTACCGTCATACGGCGGATCGCAATACACCACATCCCCCGCCTTCAACATCGCCAGCGTTTCATCAAAGCTGGCGCAGATAAACGTTGCTCGCTGGGCCTTTTCTGCAAATGCGCGAATTTCTTTTTCAGGGAAATACGGATTTTTATAATTACCGTAGGGAATGTTGAAATGCCCGCTCTTGTTATAGCGACATAAACCACGGTAACCGTGACGATTGAGATACAGGAAATATACCGCTTTCATGAAATCAGTAATTTCAGTTGAGTAATTAAACTCCTGCCTTATGTTGTAATAAGCCACCTCCCTGTTTGCGATCTCAAATAAAACTCTGGCGCGAGATATAAACGATTCACAATCAGCAGCAACCTTTTTATAGAGGTTGATTAAATCAGGATTAATATCCGCAACAAGATAACTGGGGTAATCCGTCTCCATCATCACAGCACAGGAACCCGCGAAAGGTTCAACCAGTCGCGGGCCAGCAGGAAGATGTTTTTTCAGTTCGGACATAATGGTAGTTTTATTACCCGCCCATTTCAGGATGGTGCTCATACAGCACCTCCGTTGTAATGTTTGCCTTTCAGCTCTGCGATTTCTTGGCAAGTAATGCAAAGCTGCACACCCGGAATGGCGCGGCGTCGTGCTGGCGGAATTGGTGCTTCACACTCAATGCAAAGCACGCGGGACACGCCCGGCGTTTTGGCACGGGCAGCACGGATATGACGCTGACGTTCTTCTTCAACGCGCTGTTGTACGAGATCCATTGCATCAGCCATTAGTGGATCTCCTGCGCTTCGTTCTGGATTGCTTCAGCAGTCACACGCAGCAGCTCTGCCGCTTCGACGTGGTTTAGCTGGCGGGATGTGATATGACACGCCAGGCTATCAAGGCGAGCTGCCATTGCTTCAGCCCTTGCCCGGCGTTCTTCCAGACGAGCCTCTGTCAGTAAAATATTAAGCCCTGCGTCATCCGGTCCGGCTTTGGTCGTGAGGGTTTCAATATTACGCATAATCAATTCTCCTGAATTTAGATAAAGGGATGCCCGGCGGGTTTACGCCATTAATTTCATTAGTTGGTTAATTCGGCATGGTTAGCCGTCTTGGAAATAAGCTCACCACTGCACGAAAATGATTCATTGCTTTAATCAACTCCCGCTTTTCATCAGTGGTCAGCTCATTAATGCTGATGCTATGACGTTCAGCTGGAATTTTTGCCATAAAGAATATGGCAGCCAGTGCCCGTTTATTTTGTTCATTATTGATATCCCGTGGATTACGCATATCTTTAATAAACCGCTCAAGCTCTGACTCAATATTAAGGCCAAAAACTTTCGCCCTTAATTCCGCAATGTGATTAAGTCCATTCAGGCGTTCGCCGGGGCTTAATGGAACAGTCGCCGCAGCGCCTTCAATAGCCATTTGTTCCCCCGTTTTTTCGTTGATAGTTCTGCCAGCAATTCATCTTGTGAACGGCACGGATGCCAGCGTTTACCATCCTTACCCATTATCCAGCCGTGACCGTAGTGCATTGCCGGACTTTGTTTTACCAGCAGCGATGCAAATGATGGTTCTTTCGTCAGCATAAGCACCTCACAGCAAACCGAATGAAGCACCGAGGCCAGTCACGGTATCAACTGCACTCGCCATCGCAGGGTTAGCCTGTAAACGGGCCTGCAATGAAACAGCAGCCAGCGCCATCAGTCGTGTTACAGAGTTAATGCTGCTGATAGCATCACGACGACCTGCACTGGTTTTTACATCGCCAGATACCGCACCTGCAGCAACACGCCCGATCTCTGCGGTTGCACTCATGACGTAATGTGGCAGTTTCTCTTTTGCCACCTCATTAATCGGTACACATGGCAGGCAGTGAATCTGAGCCAGAAAACCATCTACCAGCGTTGAATCTTCAGTCAGATCGGTAAGCAACCAGATTTCTGGTGCAGTTAATAAATGAGGTTGAGCTGGGTTCAGCTTGTTCCGCAGAATCTGCACATTCATGCCTGCACGTTCTGCCAGTTGCACCAGGTTGTGGCGCAGTGCAAAAGCCCTACAGGCTTCATCGAAATGCGGATGTTTGGAAATCTTGTAATCAAACATGGTGCCCCCCTTAGAAAGTTCCCATAATTGAACTTACTTACCAACAATGACTCGGAAGTTGGAATGACCGAGGGATTCACGGACCTGATCGGTTTTGTACATCAGATAACGCAGGCTTACGCGGCCTTTGTTTTTTTCTTTCTTGACCATGTACTTAGCAAGTTGACCATGGTGAATTTTTTGGTAAACAGAGCCGCGGGAAATACCCTCCCACTCTGCGAACTCTGCAGGCGTAGCCATCTCTTTTGGTACACGAATTGAAATATCAGTACTCATAGTGCAGTATCTCTTACTTTGTGTGCGTGTTAGTTCGTTTTAGCCCGTCTCTTAAACTCTCACATCAAGAGACATGAAGACATTACGATCTTGATTCAAGATTGTCAAATGGAGATCACCAATGTTAAACATCAGAATGGGTTCCGATACGGGAGGTAAGGCAGCTATTGAAAGGCTGCTTGAGGCTTATGGATTCACAACTAAGCAGGCATTAAGTGAGCACCTGAATGTCTCAAAAAGCACTATGGCAAACAGAGTGTTACGTGACAGCTTTCCTGCTGACTGGATAATTCAGTGCGCACTAGAAACCGGTGTTTCGTTGCTTTGGTTAGCTACAGGACAGGGAAGCATGAAAGGAGGAGCTGAGCCTGAGAAAAGTTCTCATAATGAGAACAAACAAGCAATTAAACCGTTATCCAAACTCATAACTCCAGCTATTCCTAAAGGAACCCTGGAGAATGGACAACTCAGTATTGATGAAGAGATTTTCCTAGACCACAGCATATTACCTGCAGATTATGAAGAATCGATGTTCTTAGAAACCCCTACTGATTGTTATCTCATCGATAAATCAATTAAACAGGTCAGCAATGGATTCTGGCTTATCAATATTGATGGAATGATTATTGTTGCAAAAATCATGCGGATTCCCGGCAATAAGATTGTAGTAAATCAAGATGAAGCGTCTTTCGAGTGCTCTACTGATGATGTGGAAGTTATTGGGCGTGCAGTCAAAGTAATAAAGAGTATCTAAACATGACTGTCAGAAAACAGCCAAACGGTAAATGGTTGTGCGAGTGCTATCCCAATGGACGCAATGGCAAGCGCGTGCGTAAGCAATTTGCTACGAAAGGCGAAGCCATTGCTTTTGAAAGCTTCACAATGGAAGAAGTGAACAAAAAACCATGGCTGGGGGAAAAGGAAGATCGGCGACACCTATCAGAATTAATTGAGCAGTGGTATTCCCTGTATGGTCAAACACTCGCAGACCCCAAGCGCCTCATGGCGAAACTTAGAATTATCTGTAATGGTCTAGGCGATCCCATCGCCTCAGAACTAACAGCCGGTGACTTTACGAAATACCGCGAAGCACGGTTAAAAGGTGAAGTACGAAATGAAGATGGCACGCTTATGTCGCCCGTTAAGCCCCGCACGGTAAACCTTGAACAGCGCAATCTATCATCGGTGTTCGGTACATTAAAAAAACTAGGACACTGGTCAGCACCAAACCCGCTGGCAGGACTTCCGACCTTCAAAATTACCGAAGGTGAGCTGGCTTTTCTTTCCGTGGACGAAATCAAGCGCCTGTTGGCTGCATGTGCTGAATCTCAAAGCCCGAGCCTACTAATGATTGCAAAAATATGCTTAGCTACCGGCGCACGGTGGAGTGAAGCCGAAAACCTTCAGGGCCACCAAATATCGAAATACCGAATTACTTATACAAAGACAAAGGGCAAGAAAAACAGAACAGTACCAATATCTCAAGATCTGTATCACGAACTCCCCAAAAACAGAGGGAAGTTATTCACGCCATGCAGAAAATCTTTTGAGCGTGCAGTAAAAAGAGCAGGTATTGACTTGCCAGAGGGCCAGTGCACGCATGTATTGCGCCATACATTCGCTAGTCACTTTATGATGAATGGCGGAAATATTCTTGTATTGAGAGATATTTTAGGGCACTCAGATATAAAAATGACAATGGTCTATGCCCATTTCGCACCAGAACATCTTGAAGATGCTGTTACTAAAAACCCCTTATTTAACTTAAAGTGATAAATAAAAATGCATATTCAACAAGAACTCGATGAAGAACTTAATAATCTTTTTGACACTATTAGAAAAAAATCAAGTATTCGACCACCAATTGAGATTGAAAAAAACCTTACTTTGATAGATGACTTCGCTCTAAAATGCAGTAAATTCCGGGGTTGTTTAGTAGATTACATCCAGGAAAATGATAACAGGTTAAGTTTACGCTTGCGCAATAGACTTAGAGCTGTAGATATCATGCAGAAAGAAATCGTCTCGTGTTTAGAGTGTTTTTTATCAGGGGATATTAAGTCGGCATATGACTCATTTGAAAGTATGCTAGAGCCACGAACTATATCTCGTCATATTGAAAATATATGCATACCTCTTTCTGACTTATGTAATGAAGATAAACCATTATTCCGCGTTAGAAAATCTGATACGCCACTTACATCAAGAAGAGATATGTTTCATATTCCGTTCAGTCAGCGTCACTTTGTTAGAGCACAGAGATTTTCAGTTGCTGGTCTACCCTGTTTATATTTAGGAACATCTCTTTATATATGCTGGAGAGAAATGGATAAGCCAGATTTTGATAAGCTATATATATCTGCCTACAAGATCGATAAAAATAATGACTCAAAGGTACTAAATATAGGACCTGATTTTTTATATAAACAGAGATCTATATTAGAGTCAAAAAGAAAGAACAAATATGATTTCAATACCAAACTCTCATATTTAGCACTTTGGCCTTTGATAATTGCATGCAATTATTTAAAAAAATATGACAATGCTTCCTTTGTACAAGAATATATTATCCCCAATCTTTTGATGCAATGGATCAGTCGGAACAGCAATGAGAATGTTGTTGGTATAGCCTACCGCTCAACAAAATTACCTGCTAATGCCTTAGGTAGCAGAGGAATAAATGTGGTACTTCCTCCAAAAGTGCGTTACGAGGAGATGGCCAATAATGAATTTTGTCCAAATCTAGCGAAAATTTTCAAATTCACATTGCCTGTATCTTGGCAGGTCCTAAAAACAGTTGAGTACGTGCCTGAATCAGTTGCACAATCCGATCGAGAGAATCTCAGCAGAAGGCTACGAAGAAGAAAAAATCGTGAGCTAACAGGAAGCATAGATGATGAAATTTTGAACATCTATAATTTAACTGACTTTTATAAACTCGAAACTTGTATGGATGAAATTCAAGTATATGCCCATATTAAACCATGATAGTAATGGCGACATTTTGGCGGCAGAGCATTAAAAGCCTATAAAACGGACAAACACCAAATAACATTAACAATATGTTTTCAAAAGAAATTTACTTTTTTTGTTATAATAAAAATGGTATGTAGGAATTTCGGACGCGGGTTCAACTCCCGCCAGCTCCACCAAATATAACAAGGGGTTACGTGAAAACGTAGCCCCTTTTTAATGTCCAGTGTCCACTTAGCGTCCACCTAAGATCAGGTGACCAGATAGGGATAAAAAATGCCAACACCTGAAAGTTTGTGGAAAATTCTGGCTTCACCCATTTCGACAAAGACGGCTGTTAGGTATGGATTCCTCTCTGTAGCAATTGTCCTTTCTATCCGCTTTATACTCCCAATACTTAAGGACATTGCGCCTCTAACTGAACAAGACTTGCCGGGGTATTCCTATGCAGCGCATTTTGCTCTAACGATTTTGTTTTCACTTATTGCGGAAACTCTAACCTTTGAACTAGTAGCTTGGATGTATCTCAAGCTCGAAAAATTCATAGAAAGGAGGAAATTAAGAAAAGAACTCGCCCGCAAACAAGAGATTGAGAGCAAGGAAATCAAGAGAATCCAAGAAAAGTTTATTGAAAATTTCATACTGGCATGGCCCCTCATAGCCCCCCGCTATAAATATTTTGTCTATGAGCTACGAAGGGGCCACAAGGGCTACTCAGATACGAATTCGGCCATCAATTACCTTAGACAACAAGGTTGGATTCACCCAATAACTGAACTACCTTATAGCCATTGCCTTTATCGGTTAGACGAACTCATTTTTGAAGCCCTAAATAGGATAGAGTCAGCCGCAACAGCAACAGACTCCTAGCTTTGCATAGCTGCCCGCTATGCAAACTAGTGAGCCAAATAAATTTAGACTTCAGGCAAAGCCTGCTGGCTGAATCGCCACGGATAATCTAGACATTTCCGAGCCGTTGATAAGATTGTTTTTCATATTCCTTCCACGTCAGCCCTCTTAATGTGGTGGATTTCATGCACCTGTAGCAATCGTAGCGATCCGGATCAATACTTGCTAGGACGATGAAACACTCCACGGGAGCTCATGCAAACCCGTGCGCGGATTTGCGTTTCTGGCTAGCCAAAACCCCACCAGCACTTTCAACACGGCTCCATCCTCAATGGAAATCAAATCTTATCCTTATCAACCCGTTAAGAATGTGGTTAAATCATAACCTTACGTCTAATATCCATTCATGAAGGGTAAACATGGAATCGAACGATCCTGGAAAGCTGATTTGGCATGTCGCCTGCGATGAATCAGGGATAGATGGTCAGCGATTTTATGGATTTGGTAGCTTATGGATGAAATACCAGAGGCGTGGCGATTTTGCGCGTATTGTCCGCGAACTTCGTGAGAAACATAACTGTAGCGATGAAATTAAATGGCAGAAAGCTCATTCCAAGCGTAACGCTGCATTCTATCAGGATTTAATCGAGACCTTCTTCAAACACCCTTGGCTAGCTTTTCATTGCATAGTCGTGGAAAAGTCGAAGGTAGAAAAATCTTTCCACGGTGGCGACTATGATCTTGCAATGCGTAAGCATTTTGGCAAGCTAATAGAGACTAAAATTGGCAATGTCATCAAAGCCCACCCCGACAGAGAGTGTGAGTTTCGTGTTGAAGTAGATCCATTACCATCCCGCTACAAAAAAGCTGATGAGGAATTCCATGTAATCACCAACCACACTTTGGCACGTAGGTTTGGGCGCAAAGATATTATCAAAAGCGTAGTAAGTAAAGACTCAAAGGCGTCAGAACATATTCAGATAGCAGATTTTTTGCTAGGTGCGGTCATGTGCGCTTATCAAGGTAAAGCAACTTCTGAGGCAAAGCTAGCCGTCGCAAACAATGTTGCATCCTACTTGGGATGGGACTCACTGATGCATGATACTTGGCCAACAGAACGTAAATTCAATATTTGGTTTTTCTTCGACAGGTCAAAAGGACCAAGGGATATTGTCACCCAAGAAGTTAAGCTCACTTACGCCCTTCCCAACACACGAAAATAGTAGATCCGACCTCTCAGCCGGCACGGTTGGAGTCCCAGTCATTCGACGAAGTTACCAACTAGGCGGTATCAACTTTCGGGGGGCCGCCTCTAGATCCCCAAAATCTTAAACTCGTTATAAGTAGCATACCCCGTAACCAATATGACTTCAACAAATCCTTAAGAAAATCATAATTTAACAAGATACAACTTATCTATCTTATCTATCTCAACCTAAAAACTATTGACCTATAAATGTGCCCTAATTTTAGCTATCTAAGTAGAGTTGTGAAACGTCCTCTCTTGGCCTTGCCACAAGACGGACTAAAAGGAAGCGACAACACTTTAACTGCCATTCAGGACAGTTAGTCAACACGCAAAACCCCGCCTCTCAATAATCAACTTAAAAATCAAAAAATATACAAAACCCCTTCCATATCAACACCATCGCCACTCTTTTGCGCCTTCGCGCAAAAAATAAAAATCCCCAACAAACTCAACCACCTCCCCCACCAATTACAAGGACGCCAGCAAATAAAACAAGGGGCAACGTGAAAACGTAGCCCCTTTTTGTTGTCTATGGAAAACCCCCAGCTAGGCTGGGGGTTCCGGAAAGCTTTCAGCTTTAAGCCAGTTATTAAAACCCCTTTTGATTTGTTAAAACATCTTGCGGTCTGGCAACTGCAAAAGTTCAACAAGAAATCAAAAGGGGGTCCCAATGGGGGACGAAAAGAGCTTAGCGCACACCCGATGGAACTGTAAATATCACATAGTTTTCGCGCCCAAATACCGAAGACAAGCTGTAAATAGACCCGTTTTAGTTCCATACATTTTTTGAGTTCCCGGCCAAATAATGGTGCTGTCGGTATTCCTCCGGTGTCATATTGTTCAGCGATTCATACGGACGTTCACAGTTATATTCTGATAACCATTTTTCCGTGATTTCACGTACTTCATTCAGCGTTCTGAACAGATAAAAATCGAGTATTTCTGTGCGATATGTCCTGTTAAAACGCTCAATGAAAGCATTTTGTGTCGGCTTACCCGGCTGGATAAACACCAGTTTTACGGCATGTTTCTCTGCCCATTCAGTCAGTGCCAGAGAGATAAATTCTGGGCCGTTATCCATGCGAAGCATGGCCGGATAGCCACGGTTTGCCGCGATCCTGTCAAGCACCCGGACCACTCGTGGGGCTGGCAGATTCCGATCTATTTCAATCGACAACGCCTCACGGTTAAAGTCATCAACGACATTGAACGTGCGAAAACGACGCCCACAGACCAGGGCATCATGCATAAAATCAACAGACCAGCTCTGGTTCAGCGCTTCCGGCGTGGCCAGTGGCGAAGGGTTACGCACCGGCAGCCGTTGTTTGCCCTTACGGCGAAAATTCAGCTTCAGCAGACAATAAATACGGTGGATCCTTTTGTGATTCCACATGTATCCCTGCCGCCGCAGAACCTGAAAAAGCTTCGGAAAACCGTATCGTGGATACCGTTCAGCTGCCGCCTGCAGTGCGGTAATAACGGGTTCGTCACGTGTGTTATCCGGACGGTAATGATAAACCGTTCTGCTCAGGTTCAGGCTCCGGCAGGCCTGACGGATACTGAGTCCGAATGCCGTTATCAGATGAGTGACCAGCTCACGCTTAAAGGCTGGTTTTAAAGCTTTTTTTCGATAACGTCTTTCAGCGCCCGGTTCTCAAGGCTCAGGTCGGCAAACATCTGTTTGAGGCGTCGGTTCTCGTCCTCAAGATCTTTTATCTTTTTAATATCAGAAGGTTCCATGCCGCCGTATCTGGACTTCCAGTTATAGTAGGTGGCCTCAGAGATACCGGCCTCCCGGCAGACATCTTTAACGGTTCGTCCAGCTTCAACCGACTTAATCACAGTGATGATCTGATGCTCAGTAAAACGGGCTTTACGCAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP007598|4304485:4404431|4308355_4308739_-|WP_000627811.1|DBSCAN-SWA MITGIQITKAANDDLLNSFWLLDSEKGEARCIVAKSGFAEDEVVAVSKLGEIEYREIPMEVKPEVRVEGGQHLNVNVLRRETLEDAVKHPEKYPQLTIRVSGYAVRFNSLTPEQQRDVIARTFTESL >NZ_CP007598|4304485:4404431|4309862_4310900_-|WP_000997368.1|tRNA|DBSCAN-SWA MNDELKNKSGKVKVMYVRSDDDSDKRTHNPRTGKGGGRPAKSRTDGGRRPARDERNNQSRDRKHETSPWRTVSRAPGDETPEKVDHGGISGKSFIDPEVLRRQRAEETRVYGENACQALFQSRPDAIVRAWFIQSVTPRFKEALRWMAANRKAYHVVDEAELAKASGTEHHGGVCFLIKKRNGTTVKQWVKQAADQDCVLALEDVANPHNLGGMMRSCAHFGVKGVVVQDAALLESGAAIRTAEGGAEHVQPITGESIVDVLDDFRQAGYTVVTTSSDRGQALFSTTLPEKMVLVLGREYDYLPEAAREPDDLCVKINGTGNVESLNVSVATGVLLAEWWRQNKA >NZ_CP007598|4304485:4404431|4362811_4364992_+|WP_000196151.1|DBSCAN-SWA MTRAAPDVEEVLSERALSQWAQAISYVAGHYRVACSPGSIQANAPWFRGKSRTTALTQLARQAGLSFHAPDIDKTAFSQWRLPLVVELRDGQLLVIEHVNGEDAVDVFVIEEEGQRNRLTLSELLPEILYVAALRPLSALKDSRVDRYISRFKPDWMRELVLQDIRPYLPVMVAAFLINVLSLAGIVFSMQVYDRVIPAQSYPTLYVLSFGVLVAVLFGFLLREARTHIMDVLGKRADMRISDRVFGHALRLRNSAIPRSTGSFISQLRELEQIREMITSSTLATIVDLPFFFLFMIVLAIIAPPLAWIAPVAALLMILPGVALQKKLAVLANQAAHEATLRNAVLVESVQGLEDIKLMQAENRFLQQWNSYIRITGESGLRTRKLTQGLISWGMSVQSLVYAAVIMFGAPMVIEGSMTTGAVVAASMLGSRMIAPMANLCGVLARWQQVKAAKMGLDNIMQLPTETQHDDSLIHRDILHGHYLFENAQFRYHNDDQRIPLRLVRLEIMPGERIAILGRNGAGKSTLLQAMAGGLEMIQGDARLDNLSLSHIDMADLRRNIGFLSQNARLFFGTLRENLTLGAPHANDEQIFDALEVSGGAVFVRRLAKGLDHPIMEGGNGLSGGQRQSLLLARMLLRSPNIVLLDEPSASLDEHTEREFIQRLHQWLGNRTLVVATHRVPILELVERVVVLKEGQLVMDAPKAQALNADRMQSHRREWKNENQSA >NZ_CP007598|4304485:4404431|4398235_4399252_+|WP_000218402.1|integrase|DBSCAN-SWA MTVRKQPNGKWLCECYPNGRNGKRVRKQFATKGEAIAFESFTMEEVNKKPWLGEKEDRRHLSELIEQWYSLYGQTLADPKRLMAKLRIICNGLGDPIASELTAGDFTKYREARLKGEVRNEDGTLMSPVKPRTVNLEQRNLSSVFGTLKKLGHWSAPNPLAGLPTFKITEGELAFLSVDEIKRLLAACAESQSPSLLMIAKICLATGARWSEAENLQGHQISKYRITYTKTKGKKNRTVPISQDLYHELPKNRGKLFTPCRKSFERAVKRAGIDLPEGQCTHVLRHTFASHFMMNGGNILVLRDILGHSDIKMTMVYAHFAPEHLEDAVTKNPLFNLK >NZ_CP007598|4304485:4404431|4341013_4341805_+|WP_001537507.1|DBSCAN-SWA MPVFALLALVAYSFSLALIVPGLLQKNSGWRRMAILSAVIALVCHAVALESRILPGGDSGQNLSLLNVGSLVSLMICTVMTIVASRNRGWLLLPIVYAFALINLAFATFMPNEYITHLEATPGMMVHIGLSLFSYATLIIAALYALQLAWIDYQLKNKKLAFSNEMPPLMSIERKMFHITQIGVVLLTLTLCTGLFYMHNLFSTENIDKAVLSIVAWFVYIVLLWGHYHEGWRGRRVVWFNVAGAGILTLAYFGSRILQQFVS >NZ_CP007598|4304485:4404431|4366714_4366933_-|WP_000980500.1|DBSCAN-SWA MMNCPECGHSAHTRSSFQVSATTKERYNQCQNINCGCTFVTHETFVRHIIKPNLISSAPPHPGKGGQGHMSF >NZ_CP007598|4304485:4404431|4397236_4397479_-|WP_000102105.1|DBSCAN-SWA MSTDISIRVPKEMATPAEFAEWEGISRGSVYQKIHHGQLAKYMVKKEKNKGRVSLRYLMYKTDQVRESLGHSNFRVIVGK >NZ_CP007598|4304485:4404431|4395879_4396113_-|WP_001244219.1|DBSCAN-SWA MRNIETLTTKAGPDDAGLNILLTEARLEERRARAEAMAARLDSLACHITSRQLNHVEAAELLRVTAEAIQNEAQEIH >NZ_CP007598|4304485:4404431|4361405_4362815_+|WP_000533863.1|DBSCAN-SWA MGRVAPVAIVLAFALFHHQPRGAEAPPMITSEGLATDQMLPSLDGSAAELPLSAAAPGNLTLNDAVNRAVNWHPSIREAVGKLLAQNEQIEVAKSKYYPQVSAGVNNGYSNTYTDHGYSPSLVLSVSQMLYDFGKVASQVRAETAGAAQQQANVLLSIDTVAHETANAIVQTQSWQQMVDAAEEQLVALDSIGKLIRQRSDEGATSLSDVVQTEARIESARSQLAQYQANLDSAKASLMSWLGWNSLNGINNDFPAKLARSCETATPDDRLVPAVLAAWAQANVARANLDYASAQMTPTISLEPSVQHYLNDKYPSHEVLDKTQYSTWVKVEMPLYQGGGLTARRNAASHAVDAAQSTIQRTRLDVRQKLMEARSQAMSLASALQILRRQQQLSERTRELYQQQYLNLGSRPLLDVLNAEQEVYQARFAELQTESQLHQLQLNCLYNTGALRQAFALNHRSIQSVEIQP >NZ_CP007598|4304485:4404431|4389033_4390698_-|WP_001284991.1|DBSCAN-SWA MANLLDWNTLHHKVQAYLDPENGIDKPQKAFPILMVATLLNVSDEEAEDAITDGSMDRGVDAVYVDDRDGRNSIHIFQFKYADTFENTKKNFPSNEIDKLVSFFDDLLDLNKSLEKTCNPILWNKIKEIWAALEKSNPSIEVHFCGNTMEMQNGEKERANASLSKYKYFNVHHHSLDTIVNYFVERKNSVIDEQLQIVDKDYFDRTDGSIRGLICTVEASEIVRIITNPENPKEVRKEIFNDNVRVYLSRTNKINRRIIETALSDRSPLFWYLNNGITVTCDSFSYIKGKRAPLVELKNIQIVNGGQTSNALFEASLNSEERLEDVLILVRIIETKSQPVSLAIAESTNSQTPIKSRDLRSNDDIQKKLEEAFEGMGLFYDRKDGQHSNQPKSVRVDALSAGQAHLAYSLDLPEVAKKDRGRIFSDLYETVFTDELMADELLASIKVLSVIENKKKLLQSSIRKEEKFNSAHMFLIDGAYHVLFAVGQICDAKGVDRLNYQKAITFVPAAIKYISAMVEKAQRDDASFSFNRYFKDAKTKTKIAAYIQGMEKGL >NZ_CP007598|4304485:4404431|4377190_4377796_-|WP_001086804.1|tail|DBSCAN-SWA MNSLLPPGSSPLERRLAQTCSGISDLQVSLRDLWNPATCPIRFLPYLAWAFSVDRWDESWTESVKRRVVQDAFYIHQHKGTTRAVRRVVEPFGFLIRIIEWWQTGETPGTFRLDIGVQDHGITEDTYLELERLISDAKPCSRHLVGMSINLQTGGPYFVGAATYTGEEITIYPYINETIISGGTAYEGGAVHVIDTMRVNP >NZ_CP007598|4304485:4404431|4395652_4395880_-|WP_000752613.1|DBSCAN-SWA MADAMDLVQQRVEEERQRHIRAARAKTPGVSRVLCIECEAPIPPARRRAIPGVQLCITCQEIAELKGKHYNGGAV >NZ_CP007598|4304485:4404431|4307706_4308294_+|WP_000188410.1|DBSCAN-SWA MTPMLLSAFWTYTLITALTPGPNNILALSAATAHGFRQSIRVLAGMSLGFLVVMLLCAGIAFSLAVIDPAIIHLLSWVGAAYILWLAWKIATSPAADEKVRPKPVGFWVSFGLQFVNVKIILYGITALSTFVLPQTQALNWVIGVSILLALIGTFGNVCWALAGHLFQRAFRHYGRQLNIILALLLVYCAVRIFY >NZ_CP007598|4304485:4404431|4371380_4371500_-|WP_000763316.1|tail|DBSCAN-SWA MADIATIFHWSPSITDVMPLTEVLAWRHKAIQRSGASDE >NZ_CP007598|4304485:4404431|4364999_4366163_+|WP_012543392.1|DBSCAN-SWA MDDPDIQRERAFSGAGRIVLICSLLFLILGIWAWFGRLDEVSTGNGKVIPSSREQVLQSLDGGILAQLTVREGDRVQANQIVARLDPTRLASNVGESAAKYRASLASSARLTAEVNDLPLAFPAELNGWPDLIAAETRLYKSRRAQLADTEAELRDALASVNKELAITQRLEKSGAASHVEVLRLQRQKSDLGLKITDLRSQYYVQAREALSKANAEVDMLSAILKGREDSVTRLTIRSPVRGIVKNIQVTTIGGVIPPNGEMMEIVPVDDRLLIETRLSPRDIAFIHPGQRALVKITAYDYAIYGGLDGVVETISPDTIQDKVKPEIFYYRVFIRTHQDYLQNKSGRRFSIVPGMIATVDIKTGEKTIVDYLIKPFNRAKEALRER >NZ_CP007598|4304485:4404431|4386123_4387890_+|WP_001098395.1|terminase|DBSCAN-SWA MNTTLPPADLDPRRQAMLLYFQGYRVARIAEMLGEKVATVHSWKKRDKWGDYGPLDQMQLTTAARYCQLIMKEHKEGKDFKEIDLLARQSERHARIGKFNNGGNEADLNPNVANRNKGPRRQPEKNVFTDEQIEKLEEVFHASMFDYQRHWFEAGKTNRIRNLLKSRQIGATFYFAREALIDALLTGRNQIFLSASKAQAHVFKQYIIDFAKEVDVELKGDPMVLPNGAALYFLGTNARTAQSYHGNLYLDEYFWIPKFQELRKVASGMAIHKKWRQTYFSTPSSLTHSAYPFWSGALFNRGRAKADKVDINLTHSNLARGLLCPDGQYRQIVTVEDAVRGGCNLFDLDQLRMEYSPDEYQNLLMCEFVDDLASVFPLSELQACMVDSWEVWTDFQALALRPFGWREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQQYNVTYIGIDSTGVGHGVYENVKAFFPAVREFVYNPNVKNALVLKAYDIISHRRLEFDAGHTDIAQSFMAIRRATTASGNRPTYEASRSEEASHADLAWATMHALFNEPLQGESANTSNIVEIF >NZ_CP007598|4304485:4404431|4348769_4349252_+|WP_001518569.1|DBSCAN-SWA MTKKKAHKPGSATIALNKRARHEYFIEEEFEAGLALQGWEVKSLRAGKANIGDSYVILKDGEAWLFGANFTPMAVASTHVVCDPTRTRKLLLNQRELDSLYGRINREGYTVVALSLYWKNAWCKVKIGVAKGKKQHDKRSDLKEREWQLDKARIMKNAGR >NZ_CP007598|4304485:4404431|4391011_4391689_-|WP_001673609.1|DBSCAN-SWA MKINNVALTISLAVILTGCVPHASNRNITTIEVVKPAIGQSATAYMGDPIITSATGFKTDVLELGAANGALSSIAAGTYCSEGNGIYRNYHNPQAVALKNLYGQIGNYVDYVSYDAAKNEISPPNGTSYTASEISIKHVPDGLCRVSNSLVKTIEYNGNAGGVMKFTYREFANDMARAAFTTDFSVDSKGSDVIAYKGAKFKVNKADNSSISYTIISGFDKAVTF >NZ_CP007598|4304485:4404431|4367001_4368102_-|WP_001102269.1|DBSCAN-SWA MNVNSDLLNLNSKSPAFSIVIEGKDVTTVLDTRLMSLTLTDNRGFEADQLDLELDDADGLIALPRRGAVIQLALGWKGQPLFPKGAFTVDEIEHSGAPDRLTIRARSADFRETLNTRREKSWHQTTVGDVVKDIAARHNLKVALGKDLTDKALDHMDQTNESDASFLMKLARQYGAIASVKDGNLLFIRQGQGRTASGKPLPVITIERKAGDGHRFTLADRGAYTGVIASWLHTREPRKKETTKVKRRRKKTTKPKEPEAKQGDYLVGTDENVLVLNRTYANRSNAERAAKMQWERLQRGVASFSLQLAEGRADLYTEMPVKVTGFKQPIDDAEWTITTLMHSVSPDNGFTTSMELEVKIDDLEIE >NZ_CP007598|4304485:4404431|4341824_4343111_+|WP_127172650.1|DBSCAN-SWA MEHISTTTLIIILIIMVVISAYFSGSETGMMTLNRYRLRHMAKQGNRSAKRVEKLLRKPDRLISLVLIGNNLVNILASALGTIVGMRLYGDAGVAIATGVLTFVVLVFAEVLPKTIAALYPEKVAYPSSFLLAPLQILMMPLVWLLNTITRLLMRLMGIKTDIVVSGSLSKEELRTIVHESRSQISRRNQDMLLSVLDLEKVSVDDIMVPRNEIIGIDINDDWKSIERQLTHSPHGRIVLYRDSLDDAISMLRVREAWRLMAEKKEFTKEMMLRAADEIYYVPEGTPLSTQLIKFQRNKKKVGLVVNEYGDIQGLVTVEDILEEIVGDFTTSMSPTLAEEVTPQNDGSVIIDGTANVREINKAFNWHLPEDDARTVNGVILEALEEIPVAGTRVRIEQYDIDILDVQENMIKQVKVVPVKPLRESVAE >NZ_CP007598|4304485:4404431|4382435_4382651_-|WP_000171565.1|DBSCAN-SWA MTLERISAFITYCIAVLLAWLGDLSLKDASTVGGVLIGVLMLAINWYYKHQSFKLLRGGKISRGEYESFNR >NZ_CP007598|4304485:4404431|4347359_4347698_+|WP_001203445.1|DBSCAN-SWA MRCKTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTPTDVAKVRVGMTQQQVAYALGTPMMTDPFGTNTWFYVFRQQPGHENVTQQTLTLTFNSSGVLTNIDNKPALTK >NZ_CP007598|4304485:4404431|4332689_4333811_-|WP_000225188.1|DBSCAN-SWA MVAELTALRDQIDDVDKALLNLLAKRLELVAKVGEVKSRFGLPIYVPEREASMLASRRAEAEAIGVPPDLIEDVLRRVMRESYSSENDKGFKTLCPSLRPVVIVGGGGQMGRLFEKMLTLSGYQVRILEQQDWPRARDIVADAGMVIVSVPIHVTEQVIAQLPPLPSDCILVDLASVKSDPLQAMLAAHDGPVLGLHPMFGPDSGSLAKQVVVWCDGRQPEAYQWFLEQIQVWGARLHRISAVEHDQNMAFIQALRHFATFAYGLHLAEENVQLEQLLALSSPIYRLELAMVGRLFAQDPQLYADIIMSSERNLALIKRYYKRFGDAIGLLEQGDKQAFIDSFRKVEHWFGDYARRFQNESRVLLRQANDSRP >NZ_CP007598|4304485:4404431|4380987_4381419_-|WP_001039958.1|tail|DBSCAN-SWA MNKPQSLRNALNKAVPYVRNNPDKLHLFVDNGSLVATGASSMSWEYRYTLNVVIEDFSGDQNLLMAPVLLWLSASQPDAINNPELREKLFTFDVDILRNDVCDISLNLQLTERVLVNTDGSVSSVEAVTEPDEPEEMWTVKRG >NZ_CP007598|4304485:4404431|4337218_4337566_-|WP_000065257.1|DBSCAN-SWA MSNIIKQLEQEQMKQNVPSFRPGDTVEVKVWVVEGTKKRLQAFEGVVIAIRNRGLHSAFTVRKISNGEGVERVFQTHSPVVDSIAVKRRGAVRKAKLYYLRERTGKAARIKERLN >NZ_CP007598|4304485:4404431|4348143_4348620_-|WP_000242603.1|DBSCAN-SWA MVLFTRFMLMGIAMPQISRTALVPYSAEQMYQLVNDVQSYPQFLPGCVGSRVLESSPAQMTAAVDVSKAGISKTFTTRNQLTRNQSILMHLVDGPFKKLIGGWKFTPLSPEACRIEFQLDFEFTNKLIELAFGRIFKELASNMVQAFTVRAKEVYRAG >NZ_CP007598|4304485:4404431|4345549_4347211_+|WP_000880965.1|DBSCAN-SWA MLAQLTISNFAIVRELEIDFQSGMTVITGETGAGKSIAIDALGLCLGGRAEADMVRTGATRADLCARFALKDTPAALRWLEENQLEEGRECLLRRVISSDGRSRGFINGTAVPLSQLRELGQLLIQIHGQHAHQQLTKPEQQKSLLDSYANEAALAQQMAARYQLWHQSCRDLAHHQQQSQERAARAELLQYQLKELNDFNPQAGEFEQIDEEYKRLANSGQLLTTSQNALALLADGEDVNLQSQLYSAKQLVSELVGMDSKLSGILDMLEEATIQLTEASDELRHYCERLDLDPNRLFELEQRIAKQISLARKHHVSPEALPQLYQSLLEEQQQLDDQADSLETLTLAVNKHHQQALETAQALHQQRQFYAQELGQLITESMHLLSMPHGLFTIDVKFDEHHLSNDGADRVEFKVTTNPGQPLQPIAKVASGGELSRIALAIQVITARKMETPALIFDEVDVGISGPTAAVVGKLLRQLGESTQVMCVTHLPQVAGCGHQHFFVSKETDGAMTETHMQPLDKRARLQELARLLGGSEVTRNTLANAKELLAA >NZ_CP007598|4304485:4404431|4368098_4368584_-|WP_000980411.1|tail|DBSCAN-SWA MMMVLGLYVFMLRTVPYQELQYQRSWRHAANSRVNRRPSTQFLGPDNDSLTLSGVLLPEVTGGRLSLLALEQMAELGKAWPLIQGSGTIYGMFVIESLSQTKTEFFASGMPRRIEFTITLKRVDESLSDMFGSLSDQLSNLQDSAASAIGEIKNTVGGLLQ >NZ_CP007598|4304485:4404431|4382654_4382858_-|WP_000868184.1|tail|DBSCAN-SWA MKVRAHQYDTVDALCWRHYGRTQGVTEQVLQANPGLAEYGPFLPHGLQVELPDITASTTAQTVQLWD >NZ_CP007598|4304485:4404431|4343231_4343837_+|WP_001287926.1|DBSCAN-SWA MSEDYVIEWDKNFADDLNVVANVFLSHNPTLWPTIFSQLSTQPEIFEDEDEDEYGLQDVLDCSGGDLGNNELAQAFLQVLRGERFIHLVDWKGEDEEGELANFAADRFYELTKNLTNSEELRNLLVEITQEDEISDVCEAGDRYLDEIFERIQTELNKRGFQIFDLNEGSDTYNVVVLPMSEYKKIEDFNTPWLEVQDFLS >NZ_CP007598|4304485:4404431|4330602_4331763_+|WP_000200077.1|DBSCAN-SWA MTSENPLLALRDKISALDEELLALLAKRRALAIEVGQAKLLSHRPVRDIDRERALLDRLIHLGKAHHLDAHYITRLFQLIIEDSVLTQQALLQQHLNNTHPHSARIAFLGPKGSYSHLAARQYAARHFEQFIESGCAKFADIFHQVETGQADYAVVPIENTSSGAINDVYDLLQHTSLSIVGEMTVTIDHCVLVSGATDLNTIETVYSHPQPFQQCSKFLSRYPHWKIDYTESTSAAMEKVAQANSPRVAALGSEAGGMLHGLQVLERIAANQTQNITRFLVLARKAINVSDQVPAKTTLLIATGQQAGALVEALLVLRNHNLIMTKLESRPIHGNPWEEMFYLDIQANLESQVMQSALKELGEITRSMKVLGCYPSENVVPVEPA >NZ_CP007598|4304485:4404431|4385147_4385981_-|WP_000216276.1|capsid|DBSCAN-SWA MTVKAKRFRIGVEGATTDGREIQREWLVQMAASYNPTVYTALINLEHIKSYLPESTFNRYGRVTGLVAEEIQDGPLAGKMALYADIEPTDALVELVKKGQKLFTSMEVSTKFADTGKAYLVGLGATDDPASLGTEMLAFSASAAHNPLANRKQNPENLFSEAVETLIELEEVQDEKPSLFARVTALFTKKEQTDEARFSDVHKAVELVASEQQNLSERTDKSLSEQDKRLSELESSLQEQLAAFAELEQKLSSEDSRKDYRQRAPGGDAPAGTLTNC >NZ_CP007598|4304485:4404431|4381514_4381943_-|WP_000196199.1|lysis|DBSCAN-SWA MTRALAVVVALAFVALGWQSWRLNSASHTIETQLAALKSKAQELTKKNSQLIGLSILAETNNREQARLYAEAEQTSAQLRQRQRRIEELKRENEDLRRWADAPLPADIIRLRERPALAGGAAYREWLSQSDAVPLGQVSAAQ >NZ_CP007598|4304485:4404431|4330455_4330503_+|WP_010989056.1|DBSCAN-SWA MKLTRFFFAFFFIFP >NZ_CP007598|4304485:4404431|4383415_4384066_-|WP_000059173.1|terminase|DBSCAN-SWA MSLSPARQHRLRIQAEQAAREGGSVRHASGYDLMLLQLAEDRRRLKGVQSTVKKAEIKVELLPKYSAWAEGVLAAGGAQQDDVLMYVMLWRIDAGDYAGALEIGRHALRHGWVMPLGNRNVQTVLAEEMADAAQSALLAAAGFDADLLLQTLDLTTDLDMPDQSRARLHKAIGAVLSESNPASALNHLNHALQLDPRCGVKKEKQQLERRLRNDSR >NZ_CP007598|4304485:4404431|4378683_4379043_-|WP_000189373.1|DBSCAN-SWA MTPYIGMSRNDGQALADTDHLRQSVRDILLTPQGSRIARREYGSLLSALIDQPQNPALRLQIMSAVYVALNRWEPRLTLDSITINGNFDGSMVVELTGHSNNGAPVSLSISTGADNGSH >NZ_CP007598|4304485:4404431|4333820_4334891_-|WP_001168062.1|DBSCAN-SWA MQKDALNNVRITDEQVLMTPEQLKAAFPLSLAQEAQIAQSRGIISDIIAGRDPRLLVVCGPCSIHDPETALEYARRFKALAAEVSDSLYLVMRVYFEKPRTTVGWKGLINDPHMDGSFDVEAGLKIARQLLVELVNMGLPLATEALDPNSPQYLGDLFSWSAIGARTTESQTHREMASGLSMPVGFKNGTDGSLATAINAMRAAAQPHRFVGINQAGQVALLQTQGNPHGHVILRGGKAPNYSPADVAQCEKEMEQAGLRPSLMVDCSHGNSNKDYRRQPAVAESVVAQIKDGNRSIIGLMIESNIHEGNQSSEQPRSEMKYGVSVTDACISWEMTDALLREIHKDLSGQLAVRVA >NZ_CP007598|4304485:4404431|4380548_4380995_-|WP_000343949.1|DBSCAN-SWA MDELQRVDDWLMALLANLEPTARSRMMRQLAQQLRRSQQQNIRLQRNPDGSGYEPRRVTARSKKGRIKRQMFAKLRTTKYLKTAASADSASVQFDGKVQRIARVHHYGLRDRVSRKGPEVRYAERHLLGINDEVAALTCDTLLRWLIA >NZ_CP007598|4304485:4404431|4311103_4311523_+|WP_001098733.1|DBSCAN-SWA MNTVCTHCQAINRIPDDRLQDAAKCGRCGHELFDGEVINATGETLDKLLKDDLPVVIDFWAPWCSPCRNFAPIFEDVAEERSGKVRFVKVNTEAERELSARFGIRSIPTIMIFKHGQVVDMLNGAVPKAPFDSWLNEAL >NZ_CP007598|4304485:4404431|4327892_4328873_-|WP_000079130.1|DBSCAN-SWA MAQRVQLTATVSENQLGQRLDQALAEMFPDYSRSRIKEWILNQRVLVNGQLCDKPKEKVLGGERVAIDAEIDEEIRFEAQDIPLDIVYEDDDILVINKPRDLVVHPGAGNPDGTVLNALLHYYPPIADVPRAGIVHRLDKDTTGLMVVAKTVPAQTRLVESLQLREITREYEAVAIGHMTAGGTVNEPISRHPTKRTHMSVHPMGKPAVTHYRIMEHFRVHTRLRLRLETGRTHQIRVHMAHITHPLVGDQVYGGRPRPPKGASEEFISTLRKFDRQALHATMLRLYHPVSGIEMEWHAPIPQDMVDLIDAMRADFEDHKDDVDWL >NZ_CP007598|4304485:4404431|4305352_4306687_+|WP_000219174.1|DBSCAN-SWA MTVTTFSELELDESLLDALQDKGFTRPTAIQAAAIPPALDGRDVLGSAPTGTGKTAAYLLPALQHLLDFPRKKSGPPRILILTPTRELAMQVADHARELAKHTHLDIATITGGVAYMNHAEVFSENQDIVVATTGRLLQYIKEENFDCRAVETLILDEADRMLDMGFAQDIEHIAGETRWRKQTMLFSATLEGDAIKDFAERLLEDPVEVSANPSTRERKKIHQWYYRADNFEHKVALLKHLLKQDDATRSIVFVRKRERVHELAETLRLAGINNCYLEGEMAQIKRNEGIKRLTDGRVNVLVATDVAARGIDIPDVSHVINFDMPRSGDTYLHRIGRTGRAGRKGTAISLVEAHDHLLLLKIGRYIEEPLKARVIDELRPTTRAPSEKLTGKPSKKVLAKRAEKKKEKEKEKPRVKKRHRDTKNIGKRRKPSGTKMQEQSSEE >NZ_CP007598|4304485:4404431|4368580_4371388_-|WP_001282768.1|tail|DBSCAN-SWA MSDNNLRLQVILNAVDKLTRPFRSAQASSKELAAAIQQSRARLKELDAQAGRIDGFRKASAQLAVTGNSLKAAREETAKLATQFSATNRPTAAQARLLEQAKNRVTELQSKYNGLRQSVQRQRLALNEAGLDTKKLSSVQRELRQNADETRQALDRQQKSLKRLGEQQARMNAVRDQYSRRLEVRDRIAGAGATTTAAGVAMGAPVVAAVKSYASMEDAMKGVAKQVNGLRDDNGNRTKQFYDMQDAIKAASEQLPMENGAIDYAALVEGGARMGVTNQDDPYEEQKRDLLAFASTAAKAATAFELPADELAEGLGKIAQLYKVPTRNIEQLCDALNYLDDNAMSKGGDIINVLQRMGGVADRLDFRKAAALGSTFLSLGAAPEIAASASNAMVRELSIATMQSKRFFEGMNLLQLNPAEIEKQMTTDAMGTIQRVLEKVNNLPQDKRLSAMTMIFGKEFGDDAAKLANNLPELQRQLKLTSGSGANGSMQKESDINKDSLSAQWLLVKTGAQNAFSSLGETLRQPLMDIMGMVKRVTGALRRWVEQNPVLAGTLMKVAAATAAITVGLGALAVAVAAVLGPLAVIRFGLSMLSVKALPSAAAAATRTGSVLRLLISGPLALLRVALFAVGSLLGALLSPVGLVVAALAGVALVIWKYWQPISAFLGGVVEGFRAAAAPISAAFEPLRPVFQWIGDRVQALWGWFNDLLTPVKSTAEELNSAAAMGRRFGEALAEGLNMVMHPLESLKSGVSWLLEKLGIVSKEAAKAKLPAQVTQQQSATVNSDGKVVLPPGGFPAYAGMYDTGGIIPRGQFGIVGENGPEIVNGPANVTSRRRTAALASVVAGVMGVAATPAEAAPLHPFSLPARAYQPPLAKADSPPPVIRYEINAPIHIVAQPGQNAQDIAREVARQLDERERRARAKARSNFSDQGGYES >NZ_CP007598|4304485:4404431|4315104_4316460_+|WP_000949286.1|DBSCAN-SWA MLSKFKRNKHQQHLAQLPKISQSVDDVDFFYTPATFRETLLEKIASATQRICIVALYLEQDDGGKGILDALYAAKRQRPELDVRVLVDWHRAQRGRIGAAASNTNADWYCRLAQENPGIDVPVYGVPINTREALGVLHFKGFIIDDSVLYSGASLNDVYLHQHDKYRYDRYQLIRNRQMADIMFDWVTQNLMNGRGVNRLDNTQRPKSPEIKNDIRLYRQELRDASYHFQGDANDEQLSVTPLVGLGKSSLLNKTIFHLMPCAEHKLTICTPYFNLPAVLVRNIIQLLRDGKKVEIIVGDKTANDFYIPEDEPFKIIGALPYLYEINLRRFLSRLQYYVNTDQLVVRLWKDDDNTYHLKGMWVDDKWMLLTGNNLNPRAWRLDLENAILIHDPKQELAPQREKELELIRTHTTIVKHYRDLQSIADYPIKVRKLIRRLRRIRIDRLISRIL >NZ_CP007598|4304485:4404431|4384069_4385131_-|WP_000730760.1|capsid|DBSCAN-SWA MKKKTRFAFNAYLQQLARLNSVEVEELSSKFTVEPSVQQTLEDQIQQSAAFLTLINITPVTEQSGQLLGLGVGSTIAGTTDTTTKEREPTDPTLMEDVEYKCEQTNFDTVLTYAKLDLWAKFQDFQVRIRNAIVKRQALDRIMIGFNGVKRAKTSNRAENPLLQDVNKGWLQKIREDAPDHVMGSTTKDGATTAGAVKVGKGGDYANLDAVVMDAVNELIDAVYQDDDDLVVVCGRELLSDKYFPLVNKEQDNSEKIAADLIISQKRMGGLQAVRAPYFPANALLITRLDNLSIYWQEDTRRRSVIDNPKRDRIENFESVNEVYVVEDYRCAALVENIEIGDFSAPAAPESGE >NZ_CP007598|4304485:4404431|4372396_4373569_-|WP_000046107.1|tail|DBSCAN-SWA MAQDYHHGVRVVEINEGTRPITTVSTAIVGMVCTGDDADASVFPLNKPVLLTDVLTASGKAGESGTLARSLDAIADQAKPVTVVVRVAQGETEAETTSNIIGGVTSDGKKTGMKALLSAQSQLGVKPCILGGPGHDTQAVATELLGVAQSLRGFAYLAANGCKTVEEAIAYRENFSQREGMLIWPDFINFDTVLKADATAYASARALGLRAKIDEQIGWHKTLSNVGVNGVTGISADVFWDLQDPATDAGLLNKNDVTTLIRKDGFRFWGSRCLSDDPLFAFENYTRTAQVLADTMAEAHMWAVDGVLNPSLARDIIEGLRAKMRSLVNQGYLIGGDCWLDESVNDKDALKAGKLTIDYDYTPVPPLENLMLRQRITDRYLVDFASRVAA >NZ_CP007598|4304485:4404431|4338418_4338967_-|WP_000043266.1|DBSCAN-SWA MSKQLAAQVPAEPVVLGKMGSSYGIRGWLRVFSSTEDAESIFDYQPWFIQKAGQWQQVQLESWKHHNQDLIIKLKGVDDRDAANLLTNCEIVVDSSQLPALEEGDYYWKDLMGCQVVTAEGYDLGKVIDMMETGSNDVLVIKANLKDAFGIKERLVPFLDGQVIKKVDLATRTIEVDWDPGF >NZ_CP007598|4304485:4404431|4316824_4318126_-|WP_000807815.1|DBSCAN-SWA MAEIIQRIDKNNAEAPEMRRRIWAIVGASSGNLVEWFDFYVYSFCSLYFAHIFFPSGNTTTQLLQTAGVFAAGFLMRPIGGWLFGRIADRRGRKASMLISVCMMCMGSLVIACLPGYDTIGTWAPALLLIARLFQGLSVGGEYGTSATYMSEVALEGRKGFFASFQYVTLIGGQLLALLVVVILQQVLEDAELRAWGWRIPFALGAVLAVVALWLRRQLDETSKHETRALKEAGSLKGLWRNRKAFLMVLGFTAAGSLCFYTFTTYMQKYLVNTAGMHANVASGIMTVALCVFMFVQPLFGALSDKIGRRTSMLCFGALATLFTVPILSALQNVTSPYAAFALVMCALLIVSFYTSISGILKAEMFPAQVRALGVGLSYAVANALFGGSAEYVALSLKSVGMESSFFWYVTAMAVLAFLVSLMLHRKGKGLRL >NZ_CP007598|4304485:4404431|4387889_4388930_+|WP_001542203.1|portal|DBSCAN-SWA MSELEALTSTTPTEDMAPKNADVTAEAFSFGDPIPVLDRRELLDYVECVQMDRWYEPPVSFDGLARTYRAAVHHSSPIAVKRNILTSTFIPHPLLSQQAFSRFVQDYLVFGNAYLEKRTNRLGGILSLEPSLAKYTRRGIDLDTYWFVQYGLTTPPYEFTKGSIFHLMEPDLNQEIYGLPEYLSAIPSALLNESATLFRRKYYINGSHAGFIMYMTDAAQNQEDVNNIRQAMKSAKGPGNFRNLFMYSPNGKKDGIQIIPLSEVAAKDEFLNIKNVSRDDMMAAHRVPPQMMGIMPSNVGGFGDVEKAAKVFVRNELLPLQKRLTELNSWLNDEVIKFEAYSYDIT >NZ_CP007598|4304485:4404431|4399804_4400467_+|WP_000360326.1|DBSCAN-SWA MDKPDFDKLYISAYKIDKNNDSKVLNIGPDFLYKQRSILESKRKNKYDFNTKLSYLALWPLIIACNYLKKYDNASFVQEYIIPNLLMQWISRNSNENVVGIAYRSTKLPANALGSRGINVVLPPKVRYEEMANNEFCPNLAKIFKFTLPVSWQVLKTVEYVPESVAQSDRENLSRRLRRRKNRELTGSIDDEILNIYNLTDFYKLETCMDEIQVYAHIKP >NZ_CP007598|4304485:4404431|4335841_4337062_+|WP_001030985.1|DBSCAN-SWA MNKEFSLSRPTFKRTLRRISIISVLLTMTLIWLLICVASVLTLKQYAQKNLDLTAATMAHSLEAALVFSDNAAAAETLATLGRQGQFSAAEVRDKNGRTIASWRYDARAADDKLIGLISHWLFPLPVSQPVWHNGRAIGEVRLVARDSLIGHFIWLSLAVLTGCILLASGIALLLTRYLHNGVVDALQNITEVVHDVRTNRNFSRRVPDERIAEFHLFAQDFNSLLDEMEEWQLRLQAKNAQLLRTALHDPLTGLANRAAFRSCINALMKDNSARSSSALLFLDGDNFKYINDTWGHAAGDRVLIEVAKRLAEFGGSRYQTYRLGGDEFAMVLYDVHSEYEVQRICAALSQAFNRPFELHNGQRITMTLSIGFALTWEHATAEKLQELADRNMYQAKHRRAERSLN >NZ_CP007598|4304485:4404431|4343871_4344462_-|WP_001518875.1|DBSCAN-SWA MSSKEQKTPEGQAPEEIIMDQHEEVEAVEPNDSAEQVDPRDEKIANLEVQLAEAQTRERDTVLRIKAEMENLRRRTEQDIEKAHKFALEKFVNELLPVIDSLDRALEVADKANPDMAAMVEGIELTLKSMLDVVRKFGVEVIAETNVPLDPNVHQAIAMVESEEVPAGNVLGIMQKGYTLNGRTIRAAMVTVAKAK >NZ_CP007598|4304485:4404431|4374765_4375341_-|WP_000143187.1|tail|DBSCAN-SWA MTFKMSSKARTITIYNLRSDTNEFIGAGDAYIPPHTGLPANCTDIAPPDIPASHIAVFDAETEMWSLHEDHRGETVYDTTTGNQVYISSPGPLPENVTSVSPDGEYQKWNGKAWVKDEVAETAARLREAEGTKSRLLQTAAEKIAPLQDAVDLEIATDDEKVQLDEWKKYRVLVNRVDTTNPDWPDVPVSQ >NZ_CP007598|4304485:4404431|4324461_4327035_-|WP_001235093.1|DBSCAN-SWA MRLDRLTNKFQLALADAQSLALGHDNQFIEPLHLMSALLNQEGGSIRPLLTSAGINAGQLRTAIDQALSRLPQVEGTGGDVQPSSELVRVLNLCDKLAQKRGDNFISSELFVLAALESRGTLTDLLKSAGATTANITQAIEQMRGGESVNDQGAEDQRQALKKYTVDLTERAEQGKLDPVIGRDEEIRRTIQVLQRRTKNNPVLIGEPGVGKTAIVEGLAQRIINGEVPEGLKGRRVLALDMGALVAGAKYRGEFEERLKGVLNDLAKQEGNVILFIDELHTMVGAGKADGAMDAGNMLKPALARGELHCVGATTLDEYRQYIEKDAALERRFQKVFVAEPSVEDTIAILRGLKERYELHHHVQITDPAIVAAATLSHRYIADRQLPDKAIDLIDEAASSIRMQIDSKPEELDRLDRRIIQLKLEQQALMKESDEASKKRLDMLNEELDDKERQYSELEEEWKAEKASLSGTQTIKAELEQAKIAIEQARRVGDLARMSELQYGKIPELEKQLEAATQSEGKTMRLLRNKVTDAEIAEVLARWTGIPVARMLEGEREKLLRMEQELHSRVIGQNEAVEAVSNAIRRSRAGLSDPNRPIGSFLFLGPTGVGKTELCKALANFMFDSDDAMVRIDMSEFMEKHSVSRLVGAPPGYVGYEEGGYLTEAVRRRPYSVILLDEVEKAHPDVFNILLQVLDDGRLTDGQGRTVDFRNTVVIMTSNLGSDLIQERFGELDYGRMKEMVLGVVSQNFRPEFINRIDEVVVFHPLGEQHIASIAQIQLQRLYKRLEERGYEIHISDEALKLLSANGYDPVYGARPLKRAIQQQIENPLAQQILSGELVPGKVIRLEASDDRIVAVQ >NZ_CP007598|4304485:4404431|4375340_4377194_-|WP_001274649.1|tail|DBSCAN-SWA MSAKFYTLLTEIGAAKLASAAALGVPLKITHMAVGDGGGVLPTPSAQQTALVAEKRRAALNMLYIDPQNSSQIIAEQVIPETEGGWWIREVGLFDETGALIAVGNCPESYKPQLTEGSGRTQTVRMVLITSSTDNITLKIDTAVVLATRKYVDDKALELKVYVDDLMAKHLAAPDPHSQYAQKDSPTLTGIPKVPTPAAGNSTKQIANTEFVASSIAAIVDSAPAALDTLNELAAALGNDPNFATTMINALAGKQPLDNTLTNLSGKDVAGLLAYLGLGETINLATGAMQKDQNLNDVPDKALARQSLQLGNSATLNVGTTPDTVAAGDDIRISTAKRAIDDTQTGLGAQPVMWVSTADDLSILPSGARRFASNKAPATILPVNDYVFLEVIAKRDCVDGCAVLITDSVGNTWIGARWDATNDSGFTWRPLMSCPPGVPLPWPSDTIPAGYALMQGQAFDKNVYPLLAIAYPSGTIPDMRGWTIKGKPASGRAVLSQELDGNKSHSHSARAQDTDLGTKGTSSFDYGTKSSNTTGGHNHSAGGTYGGDSIGGKIRVQHDGNDQLTSWNGDHAHTTWIGPHDHTVYIGPHGHVVIVDADGNEETTVKNIAFNYIVRLA >NZ_CP007598|4304485:4404431|4382857_4383322_-|WP_000673537.1|head|DBSCAN-SWA MKFVAPEQAPEQAEVIKNTPFWPDVDLSEFRSVMRTDGTVTQQRLKQVVLTAISEVNAELYDFRNRQQMQGWRTLAEVPAEMLDGKSERIRHYHNAVFCWARAVLNERYQDYDATVSGVKRGEELAEASGDLWRDARWAISRVQDTPHCTVELI >NZ_CP007598|4304485:4404431|4312329_4314990_+|WP_000082648.1|DBSCAN-SWA MSQQGLEALLRPKSIAVIGASMKPHRAGYLMMRNLLAGGFNGPVLPVTPAWKAVLGVMAWPDIASLPFTPDLAILCTNASRNLALLDALGAKGCKTCIILSAPTSQHEELLACARHYKMRLLGPNSLGLLAPWQGLNASFSPVPIKQGKLAFISQSAAVSNTILDWAQQREMGFSYFIALGDSLDIDVDELLDYLARDSKTSAILLYLEQLSDARRFVSAARSASRNKPILVIKSGRSPAAQRLLNTSAGMDPAWDAAIQRAGLLRVQDTHELFSAVETLSHMRPLRGDRLMIISNGAAPAALALDELWSRNGKLATLSEETCLQLRQALPAHIDIANPLDLCDDASSEHYVKTLDILLASQDFDALMVIHSPSAAAPGTESAHALIETIKRHPRGKLVTLLTNWCGEFSSQEARRLFSEAGLPTYRTPEGTITAFMHMVEYRRNQKQLRETPALPSNLTSNTAEAHNLLQRAIAEGAASLDTHEVQPILHAYGLHTLPTWIASDSAEAVHIAEQIGYPVALKLRSPDIPHKSEVQGVMLYLRTASEVQQAANAIFDRVKMAWPQARIHGLLVQSMANRAGAQELRVVVEHDPVFGPLIMLGEGGVEWRPEEQAVVALPPLNMNLARYLVIQGIKQRKIRARSALRPLDIVGLSQLLVQVSNLIVDCPEIQRLDIHPLLASASEFTALDVTLDIAPFDGDNESRLAVRSYPHQLEEWVEMKNGDRCLFRPILPEDEPQLRQFIAQVTKEDLYYRYFSEINEFTHEDLANMTQIDYDREMAFVAVRRMDNAEEILGVTRAISDPDNVDAEFAVLVRSDLKGLGLGRRLMEKLIAYTRDHGLKRLNGITMPNNRGMVALARKLGFQVDIQLEEGIVGLTLNLAKCDES >NZ_CP007598|4304485:4404431|4304485_4305223_-|WP_000083345.1|tRNA|DBSCAN-SWA MSQSGSVLRRNGFTFKQFFVAHDRCAMKVGTDGILLGAWAPVADVKRILDIGTGSGLLALMLAQRTDDNVPIDAVELDAGAAMQAQENVAHSPWPHRITVHTDDIQSWAPRQTVRFDLIISNPPYYEPGVECATPQREQARYTATLDHQTLLAIAADCITEDGFFCVVLPEQIGNAFTQQALNMGWHLRLRTDVAENEARLPHRVLLAFSPQAGECFSDRLVIRGSDQHYSESYTALTQAFYLFM >NZ_CP007598|4304485:4404431|4379695_4380547_+|WP_000958562.1|DBSCAN-SWA MLTVGIYGFNITKVTHFSFGTMFPTCKSISEIIKKMKSRDELHLTAFLELDINDANECRDILFHLTAILSFIEQRPVSFGYSLRKHESMGNLDDDYPKLINIAYSIKSTGIIIKEDYYSKNSRRYFIEAALNKIIIEKDRHYSTLLHKNVQVFSTPQRYIDVSYYLLFSGLESIARQRENDLSNNAPSVLYKYLSKFKFDIKQQDNKRPPRSLDIYSGLRNALFHNGEYQTAPMKRNGTECTFLLKDYYSYFRRLNSLVILKEANFEDGKINWDFVNYRHYFK >NZ_CP007598|4304485:4404431|4379039_4379618_-|WP_001672413.1|plate|DBSCAN-SWA MNAQLTEIMRLITNLIRTGTVTEVDRENWLCRVKVGELETNWINWLTLRAGSARTWWCPSPDEQVVVLSMGGNLETAFVLPAIYSNQFAPPSDSVDGCVTEYPDGGWFEYEPATGRWHVRGIKSMVIEAADNITLKTGEFVVEADTTRINSEVVINGGVTQGGGAMSSNGVVMDKHGHTGVKSGGDTSGGPV >NZ_CP007598|4304485:4404431|4335330_4335849_+|WP_001212379.1|DBSCAN-SWA MRFSHRFILLLSLLLASLPLYAQRVTEEEKSVRAIVSGIVSYTHWPALSGPPRLCIFSSARFVRVLSEEADWAFPYQPLVIRTTQEALSARCDGFYFGNESPAYQVELTRHYPVNALLLIAEQNTECIIGSAFCLIINNDEVKFSVNLDSLSHSGVRVNPEVLMLARNQKHE >NZ_CP007598|4304485:4404431|4329004_4329742_+|WP_000197660.1|DBSCAN-SWA MTRMKYLVAAATLSLFLAGCSGSKEEVPDNPPNEIYATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFMRLNPTHPNIDYVMYMRGLTNMALDDSALQGFFGVDRSDRDPQHARAAFNDFSKLVRSYPNSQYTTDATKRLVFLKDRLAKYEYSVAEYYTARGAWVAVVNRVEGMLRNYPDTQATRDALPLMENAYRQMQLNAQADKVAKIIAANSKNT >NZ_CP007598|4304485:4404431|4371514_4371817_-|WP_001280963.1|tail|DBSCAN-SWA MSDKLTEKTVKLDTPIMRGKAEITEIVLRKPQSGALRGTRLQAIMDMDVGAMMTVIPRISTPTLTAQEMAELDPADLTALSVEVVTFLLPRSVLAGLPTA >NZ_CP007598|4304485:4404431|4347863_4348154_-|WP_001112990.1|DBSCAN-SWA MPDKLVVEVAYALPEKQYLQRVTLEEGATVEEAIRASGLLELRTDIDLAKNKVGIYSRPVKLTDTVQDGDRVEIYRPLIADPKALRRQRAEKSAGR >NZ_CP007598|4304485:4404431|4318229_4318685_-|WP_000985655.1|DBSCAN-SWA MAIKPFNYQQDFSSIDFRQQPELYQVGRGEQGVLLVEPYKSEILPFWRYKDEASAMKSAEQVYQLFEAYRQQDDFVGMDMARKFIQMGYTRARRYANYKGGKKYAEDGSLNTRGNDPIKAAAATVFKGWWDKIRQDEDYLKRKRQHQARWG >NZ_CP007598|4304485:4404431|4306704_4307604_-|WP_001675040.1|DBSCAN-SWA MDLRRFITLKTVVEEGSFLRASQKLCCTQSTVTFHIQQLEREFSLQLFEKIGRRMCLTTEGKKLMPHIHELTRVMELIREAARQDAEPGGELRVATGETLLAYKMPQVLQRFKLRAPNVKLSLQSLNCYVIRDALLNDEVDLGVFYRVGNDDALTMQQLGEQSLALVASPLLQDADFTQPDQHIPCSFIINEPQCVFRQLFESTLRQRRITLENTIELWSIESIKQCVAANLGISFLPRFTVERELSTGQLKELPFGAPSLSIMALCAHHAGKAVSPAMQIFIQCMEACFTVEDKKMPG >NZ_CP007598|4304485:4404431|4403319_4404431_-|WP_089113803.1|transposase|DBSCAN-SWA MRKARFTEHQIITVIKSVEAGRTVKDVCREAGISEATYYNWKSRYGGMEPSDIKKIKDLEDENRRLKQMFADLSLENRALKDVIEKKPLKPAFKRELVTHLITAFGLSIRQACRSLNLSRTVYHYRPDNTRDEPVITALQAAAERYPRYGFPKLFQVLRRQGYMWNHKRIHRIYCLLKLNFRRKGKQRLPVRNPSPLATPEALNQSWSVDFMHDALVCGRRFRTFNVVDDFNREALSIEIDRNLPAPRVVRVLDRIAANRGYPAMLRMDNGPEFISLALTEWAEKHAVKLVFIQPGKPTQNAFIERFNRTYRTEILDFYLFRTLNEVREITEKWLSEYNCERPYESLNNMTPEEYRQHHYLAGNSKNVWN >NZ_CP007598|4304485:4404431|4396485_4396686_-|WP_000956190.1|DBSCAN-SWA MLTKEPSFASLLVKQSPAMHYGHGWIMGKDGKRWHPCRSQDELLAELSTKKRGNKWLLKALRRLFH >NZ_CP007598|4304485:4404431|4396693_4397203_-|WP_000460848.1|DBSCAN-SWA MFDYKISKHPHFDEACRAFALRHNLVQLAERAGMNVQILRNKLNPAQPHLLTAPEIWLLTDLTEDSTLVDGFLAQIHCLPCVPINEVAKEKLPHYVMSATAEIGRVAAGAVSGDVKTSAGRRDAISSINSVTRLMALAAVSLQARLQANPAMASAVDTVTGLGASFGLL >NZ_CP007598|4304485:4404431|4377788_4378697_-|WP_000268332.1|plate|DBSCAN-SWA MAVIDLSRLPPPQIVDVPDFETLLAERKAAFVALYPVDEQDAVRRTLALESEPVTKLLQESTYREILLRQRINEAAQAVMVAYSMGNDLEQLAANCNVKRLTVVPADNDAVPPVAAVMEDDEALRQRIPAAFEGLSVAGPTGAYEFHARSADGRVADASATSPAPAEVVLTVLSREGDGTAVKDLLDVVEKALNSESVRPVADRLTVRSAEIIPYRVEATIFLYPGPEAEPVMAAAKASLQKYIASQTRLGRDIRRSAIYAALHVEGVQRVELTSPLEDVVLDKTQAASCTEWSVTNGGTDE >NZ_CP007598|4304485:4404431|4316504_4316828_+|WP_001264473.1|DBSCAN-SWA MRVLIPFTVLFLSGCSHLANDHWSGQDKAQHFMASAMLSAAGNEYARHQGVSPDRSAAIGLMFSLSLGASKELWDSRPEGSGWSWKDFVWDVAGATTGYAIWQMARY >NZ_CP007598|4304485:4404431|4339486_4340848_-|WP_000460052.1|DBSCAN-SWA MFDNLTDRLSRTLRNISGRGRLTEDNVKETLREVRMALLEADVALPVVREFINRVKEKAVGHEVNKSLTPGQEFVKIVRSELVAAMGEENQTLNLAAQPPAVVLMAGLQGAGKTTSVGKLGKFLREKHKKKVLVVSADVYRPAAIKQLETLAEQVGVDFFPSDVGQKPVDIVNAALKEAKLKFYDVLLVDTAGRLHVDEAMMDEIKQVHASIKPVETLFVVDAMTGQDAANTAKAFNEALPLTGVVLTKVDGDARGGAALSIRHITGKPIKFLGVGEKTDALEPFHPDRIASRILGMGDVLSLIEDIESKVDRAQAEKLATKLKKGDGFDLNDFLEQLKQMKNMGGMASLMGKLPGMGQIPDNVKSQMDDKVLVRMEAIINSMTLKERAKPEIIKGSRKRRIAQGCGMQVQDVNRLLKQFDDMQRMMKKMKKGGMAKMMRSMKGMMPPGFPGR >NZ_CP007598|4304485:4404431|4349866_4361341_+|WP_001237694.1|DBSCAN-SWA MRLLAVVSKLTGVSTTVESSAVTLNAPSIVKLSVAREEISQLTRINQDLVVTLHSGETITIKNFYVTNDLGASQLVLAENDGTLWWVENPQAGLHFEQIADINELLVTSGASHEAGGAVWPWVLAGAVAAGGIAAIASSGGGDSHHHSDGDNPPPDNTNPDGNPPDNSNPGGSTPNGNTPGSSNPVDTTPPLAPGELLISADGKTVSGQAEAGSTITIKDPSGNVVGEGKADSDGKFSIDLTAPQISGEQLTVTATDDAGNTGPSATIDAPNIPLPDTPAITAAIDDAAPLTSTLSNNQFTNDNTPTLEGTGSAGTVIHIYANGQEIGSTTVDTSGNWHFAITSALADGENHFTAIATNVKGESSESARFTLTIDTLIPDAPRVELIADNTGLLTGPLQNNDRTDEAKPLFSGQGEAGNTITIKEGSTVIGSATVDENGRWTFTPTTPLSDGEHTFTVEQSDKAGNTSRVTTTPTIIVDTTPPDAAIIDNVAKDGTTVSGTAEAGSTVSIYDPAGNYLGSTITGENNHFSITLNPAQTHGERLEARIQDAVGNIGPATEFTASDSQYPAQPTILTVTDDAGAVTGLLKNGDATDDNRPTLSGTAEPGSTISINDNGFPVATFPPIVADADGKWSFTPSLALADGDHVFTATATNDRGTSGQSVSFTIDIDTQPPVLEGLAVSDVGDRLTGTTEAGSTVVIKDSLGNTLGSGTAGDDGTFSIGISPAKINGETLSISVTDKAANSGPVETLNAPDKTAPAAPDGLTVATDGLSVSGQAEAGATVTIRDSSNTVLGSAVANGNGQFIVPLNTAQTNGQALIATATDVAKNESAAATVIAPDSTAPEMPKNVVISEDGTSISGTAEPGSAITIATPDGKPLGSGKADGEGHFTLPLVPAQTNGEQVTVTATDSANNVSPPTTAQAPDITAPDKPIITQVLDDVESFTGPLVNGQTTNDNRPTLSGTAEAGARVEVFDNGVSLGLATLQPNGAWTFTPSQNLGEGAHRLTVIATDAKGNASQAASFDLVVDTQSPQQPVITFITDDAPGILGSVAHLGLTNDSTPTINGTGEPGSTVHLYQNGARIADIIVGNSGVWSYAYTTASPLADDTYTFTVTASDSNGNTTPFSTDFTITIDTQAPAAPGVIGVADGDGNTIDTNQITQESQPRLSGSGTAGDTIILYDNGNAIGQALVGTDGRWQFTPPAALGDGDHHLTARANDPAGNESPESISFTLRIDTQAPDAPQIVSAAITGGEGEVLLANGSITNQRMPTLSGTGEPGTIITLYNNGVELATVQVNPQGSWTYPLTRNLSEGLNILTATATDAAGNSSPTSGVFSVTLDTQPPAQPDAPLISDNVAPVIGNIGNNGATNDTTPTFSGTGEIGSTIILYNNGSEIGRTTVGDNGSWNFTPAALTPETYTITVTETDRAGNISPPSASVTFTLDTTAPANPVITFAEDNVGEVQDTIVSGATTDDNTPVIHGTGDIGSIITLYNGSSVLGVVTVDETGTWTLAVTSALPDGVYTLTAIAADAAGNSSGVSNSFTFTVDTVPLQPPVVNEILDDVAPVTGPLTDGAFTNDRTLTINGSGENGSTVTIYDNGVAIGTALVTDGVWTFNTPELSEASHALTFSATDDAGNTTAQTQPITITVDITAPPAPTIQTVDDDGTRVAGLADPYATVEIHHADGTLVGSAVANGTGEFVVTLSPAQTDGGTLTAIAIDRAGNNGPATNFPASDSGLPAVPAITAIEDDVGSVQGNIAAGGATDDTMPTLRGTTDIGSTVEVFIDGDSAGFATVDASGNWIFEIATPLSESTHYFTVQATNANGPGGLSAPVGITVDLSAPAQPVITSATDDVPGMTGTLDNGALTNDSRPTLNGTGEAGATIRILDNGVEIGSATVDQSGNWRFTPNAPLESNAHIFTAVATDPAGNSGQPSDGFTLNIDAQAPDVPVITSVIDDNNQPTVPVLPGQSTDDRQPILNGTGEPGATITIFDNGTPLGTAQVGENGSWTFPVPRNLSEGSHNLTVSATDPAGNTSAVSAPWTIVVDITPPAIPVLTSVVDDQPGITGNLVSGQLTNDATPTLNGRGEAGATINVYLDGNPASIGTTTVNSDGTWSFTPQTPLANGSHTFTLSATDPAGNSSSVSSGFVLTIDATPPAAPVIASVADNTAPVTGIVPNGGSTNETRPTLSGTGEAGTTISIYNGSALVGTAQVQANGSWSFTPSTSLGAGVWNLTATATDAAGNTSAASEIRSFTIDTTAPAAPVIDTVYDGTGPITGNLSSGQITDEARPVISGTREANTTIRLYDNGTLLAEIPADNSSSWRYTPDASLATGNHVITVIAVDAAGNASPVSDSVNFVVDTTPPLTPVITSVSDDQAPGLGTIANGQNTNDPTPTFSGTAEAGATITLYENGTVIGTTTAQPDGAWSVSTSTLASGTHVITAVATDAAGNSSPNSTAFTLTVDTTAPQTPILTSVVDDVAGGVTGNLANGQITNDNRPTLNGTAEAGSVVSIYDGDTLLGVTSANASGAWSFTPTTGLNDGTRTLTVTATDPAGNVSPATSGFTIVVDTLAPTVPLITSIVDDVPNNTGAIGNGQSTNDTQPTLNGTAEANSAVSIFDNSALVATVNANASGNWSWTPTAALGQGSHAYSVSAADAAGNVSAASPSITIIVDTIAPGAPGNLVINATGNRVTGTAEAGSTVTITSDTGVVLGTATADGTGSFTATLTPAQTNGQPLLAFAQDKAGNTGIAAGFTAPDTRVPEAPIITNVVDDVGIYTGAIANGQVTNDAQPTLNGTAQAGATVSIYNNGALLGTTTANASGNWSFTPTGNLTEGSHAFTATATNANGTGSVSTAATVIVDTLAPGTPSGTLSADGGSLSGLAEANSTVTVTLTGGVTLTTTAGSNGAWSLTLPTKQIEGQLINVTATDAAGNASGTLGITAPVLPLAARDNITSLDLTSTAVTSTQSYSDYGLLLVGALGNVASVLGNDTAQVEFTIAEGGTGDVTIDAAATGIVLSLLSTQEIVVQRYDTSLGAWTTIVNTAVGDFANLLTLTGSGVTLNLSGLGEGQYRVLTYNTSLLATGSYTSLDVDVHQTSAGIISGPTISTGNVMADDTAPTGTTVTAITNANGVSTPVGAGGVDIQGQYGTLHINQDGSYTYTLAKPTAGYGHKESFTYTITQNGVGSSAAQLVINLGPAPVPGSVIATDNNASLVFDTHVSYVNNGPSTQSGVTVLSVGLGNVLNANLLDDMTNPIIFNVEEGATRTMTLQGTVGGVSLVSTFDLYVYRFNDAIQQYEQFRVEKGWINTLLLAGQSQPLTLTLPGGEYLFVLNTASGISVLTGYTLAISQDHTYAVDSITANTTGNVLTNDVVPTDALLTEVNGVAIAATGTTEVNGLYGSLIIDARGNYTYTLKNGVGADSIKTPDSFIYTVKAPNGDTDTASLNITPTARALDAINDVSDTLSVATLQDTAAWLDSSVGSASWGLLGKSGSGSGTFDVATGTVLKGASLVFDVSTLITLGNLNISWAIQENGTVIRNGTVPVANITLGSATVTVNLSGLELDAGTYTLNFTGTNTLAGAATITPRVIGTTVDLDNFETSGTHTVLGNIFDGSDAAGAMDQLNTVNTRLSISGYNGSAATLDAAANTTSATIQGHYGTLQINLDGAYTYTLNNGVAMSSITSKEVFTYQLDDKMGHTDSATLTIDMAPQIVSTNQNDVLIGSAYGDTLIYHLLNGADATGGNGVDRWQNFSTAQGDKIDIHELLTGWDHQAATLGNFVQVHTSGANTVISVDRDGAGSAFKSTDLVTLENVQLTLNDLLQNNHLITGG >NZ_CP007598|4304485:4404431|4401720_4402524_+|WP_000445376.1|DBSCAN-SWA MESNDPGKLIWHVACDESGIDGQRFYGFGSLWMKYQRRGDFARIVRELREKHNCSDEIKWQKAHSKRNAAFYQDLIETFFKHPWLAFHCIVVEKSKVEKSFHGGDYDLAMRKHFGKLIETKIGNVIKAHPDRECEFRVEVDPLPSRYKKADEEFHVITNHTLARRFGRKDIIKSVVSKDSKASEHIQIADFLLGAVMCAYQGKATSEAKLAVANNVASYLGWDSLMHDTWPTERKFNIWFFFDRSKGPRDIVTQEVKLTYALPNTRK >NZ_CP007598|4304485:4404431|4344585_4345464_+|WP_001059151.1|DBSCAN-SWA MNNHFKCIGIVGHPRHPTALTTHEMLYRWLCAQGYEVIVEQQIAHELQLKNVPTGTLAEIGQQADLAVVVGGDGNMLGAARTLARYNINVIGINRGNLGFLTDLDPDNALQQLSDVLEGRYISEKRFLLEAQVCQQDRQKRISTAINEVVLHPGKVAHMIEFEVYIDETFAFSQRSDGLIISTPTGSTAYSLSAGGPILTPSLDAITLVPMFPHTLSARPLVINSSSTIRLRFSHRRSDLEISCDSQIALPIQEGEDVLIRRCDYHLNLIHPKDYSYFNTLSTKLGWSKKLF >NZ_CP007598|4304485:4404431|4330013_4330352_+|WP_000178449.1|DBSCAN-SWA MTMNITSKQMEITPAIRQHVADRLAKLEKWQTHLINPHIILSKEPQGFIADATINTPNGHLVASAKHEDMYTAINELINKLERQLNKVQHKGEARRATASVKDASFVEAEEE >NZ_CP007598|4304485:4404431|4327164_4327896_-|WP_000992636.1|DBSCAN-SWA MNALIVPQWPLPKGVAACSSTRIGGVSLPPYDSLNLGAHCGDNPEHVEENRKRLFAAGNLPSKPVWLEQVHGKNVLRLTGEPYASKRADASYSNTPGTVCAVMTADCLPVLFCNREGTEVAAAHAGWRGLCEGVLEETVTCFADKPENIIAWLGPAIGPTAFEVGPEVRDAFLAKDAQADSAFLPHGEKFLADIYQLACQRLANTGVEHVYGGDRCTFSESETFFSYRRDKTTGRMASFIWLI >NZ_CP007598|4304485:4404431|4400728_4401322_+|WP_001142974.1|DBSCAN-SWA MPTPESLWKILASPISTKTAVRYGFLSVAIVLSIRFILPILKDIAPLTEQDLPGYSYAAHFALTILFSLIAETLTFELVAWMYLKLEKFIERRKLRKELARKQEIESKEIKRIQEKFIENFILAWPLIAPRYKYFVYELRRGHKGYSDTNSAINYLRQQGWIHPITELPYSHCLYRLDELIFEALNRIESAATATDS >NZ_CP007598|4304485:4404431|4392387_4394802_-|WP_000017507.1|DBSCAN-SWA MSHADMNNCSGFNEAAAAFSWNSPKKAINPYLDPAEVAPVSALSNLITLYAADNEQEQLRREALSDQVWERYFFNESRDPVQREMEQDKLISRAKLAHEQQRFNPDMVILADVNAQPSHISKPLMQRIEYFSSLGRPKAYSRYLRETIKPCLERLEHVRDSQLSTSFRFMASHEGLDGLLILPEMSQDQVKRLSTLVAAHMSMCLDAACGDLYATDDVKPEEIRKTWEKVAAETLRLDVIPPAFEQLRRKRNRRKPVPYELIPGSLARMLCADWWYRKLWKMRCEWREEQLRAVCLVSKKASPYVSYEVVMHKREQRRKSLEFFRSHELVNEDGDTLDMEDVVNASSSNPAHRRNEMMACVKGLELIAEMRGDCAVFYTITCPSRFHSTLNNGRPNPTWTNATVRQSSDYLVGMFAAFRKAMHKAGLRWYGVRVAEPHHDGTVHWHLLCFMRKKDRRAITALLRKFAIREDREELGNNTGPRFKSELINPRKGTPTSYIAKYISKNIDGRGLAGEISKETGKSLRDNAEYVNAWASLHRVQQFRFFGIPGRQAYRELRLLAGQAARQQGDKKAGAPVLDNPRLDAILAAADAGCFATYIMKQGGVLVPRKYHLIRTAYEINEEPTAYGDHGIRIYGIWSPIAEGKICTHAVKWKMVRKAVDVQEAAADQGACAPWTRGNNCPLAENLNQQEKDKSADGGTRTDITRMDDKELHDYLHSMSKQDRRELAARLRLVKPKRRKDYKQRITDHQHQQLVYELKSRGFDGSEKEVDLLLRGGSIPSGEGLRIFYRNQRLKEDDKWRNLY >NZ_CP007598|4304485:4404431|4373671_4373896_-|WP_000974843.1|DBSCAN-SWA MAIIRAVCSVHFRAGLAAARAQGRVGGRRPKLTPEQWEQAGRLLAAGETRHRVGLLFDVSISTLYKKFPVNQSR >NZ_CP007598|4304485:4404431|4396180_4396522_-|WP_000963474.1|DBSCAN-SWA MAIEGAAATVPLSPGERLNGLNHIAELRAKVFGLNIESELERFIKDMRNPRDINNEQNKRALAAIFFMAKIPAERHSISINELTTDEKRELIKAMNHFRAVVSLFPRRLTMPN >NZ_CP007598|4304485:4404431|4394798_4395656_-|WP_000104187.1|DBSCAN-SWA MSTILKWAGNKTTIMSELKKHLPAGPRLVEPFAGSCAVMMETDYPSYLVADINPDLINLYKKVAADCESFISRARVLFEIANREVAYYNIRQEFNYSTEITDFMKAVYFLYLNRHGYRGLCRYNKSGHFNIPYGNYKNPYFPEKEIRAFAEKAQRATFICASFDETLAMLKAGDVVYCDPPYDGTFSGYHTDGFTEDDQYHLASVLEHRSSEGHPVIVSNSDTSLIRSLYRNFTHHYIKAKRSIGVAAGESKSATEIIAVSGARCWVGFEPSRGVDSSAVYGVRA >NZ_CP007598|4304485:4404431|4311595_4312276_+|WP_000183642.1|tRNA|DBSCAN-SWA MTNNAVLQLRAERLARATRPFLARGNRVRRCQRCLLPLKSCLCDTLTPSQAKSRFCLVMFDTEPMKPSNTGRLIADILPDTAAFQWSRTEPPQALLELVQHPDYQPIVVFPASYAGEAREVISTPPAGKPPLFIMLDGTWPEARKMFRKSPYLDHLPVISVDLSRLSAYRLREIHAEGQYCTAEVAIALLDLAGDTEAATSLGEHFTRFKTRYLAGKTQHPGNVTA >NZ_CP007598|4304485:4404431|4397600_4398233_+|WP_000932273.1|DBSCAN-SWA MLNIRMGSDTGGKAAIERLLEAYGFTTKQALSEHLNVSKSTMANRVLRDSFPADWIIQCALETGVSLLWLATGQGSMKGGAEPEKSSHNENKQAIKPLSKLITPAIPKGTLENGQLSIDEEIFLDHSILPADYEESMFLETPTDCYLIDKSIKQVSNGFWLINIDGMIIVAKIMRIPGNKIVVNQDEASFECSTDDVEVIGRAVKVIKSI >NZ_CP007598|4304485:4404431|4309057_4309747_+|WP_000179978.1|DBSCAN-SWA MATELTWHDVLADEKQQPYFINTLHTVAGERQSGITVYPPQKDVFNAFRFTELGDVKVVILGQDPYHGPGQAHGLAFSVRPGIAPPPSLVNMYKELEASIPGFVRPAHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVISLINQHREGVVFLLWGSHAQKKGAIIDPQRHHILKAPHPSPLSAHRGFFGCNHFALTNQWLEQHGEKTIDWTPVLPAESE >NZ_CP007598|4304485:4404431|4338985_4339234_-|WP_000256453.1|DBSCAN-SWA MVTIRLARHGAKKRPFYQVVVTDSRNARNGRFIERVGFFNPIASEKEEGTRLDLDRIAHWVGQGATISDRVAALIKEVKKAA >NZ_CP007598|4304485:4404431|4371871_4372387_-|WP_001207651.1|tail|DBSCAN-SWA MALPRKLKHLNLFNDGNNWQGIVESLTLPKFTRKFEKYRGGGMPGAVDVDMGLDDGALDTEFSIGGTELLLFKQMGKATVDGIQLRFTGSIQRDDTGEVQAVELVVRGRHKEVDSGEWKTGESSSTKVSSTNSYAKLTINGEVLYEVDLVNMVEIVGGMDLMEEHRNALGL >NZ_CP007598|4304485:4404431|4337606_4338374_-|WP_000469804.1|tRNA|DBSCAN-SWA MFIGIVSLFPEMFRAITDYGVTGRAVKKGLLNIQSWSPRDFAHDRHRTVDDRPYGGGPGMLMMVQPLRDAIHAAKAAAGEGAKVIYLSPQGRKLDQAGVSELATNQKLILVCGRYEGVDERVIQTEIDEEWSIGDYVLSGGELPAMTLIDSVARFIPGVLGHEASAIEDSFADGLLDCPHYTRPEVLEGMEVPPVLLSGNHAEIRRWRLKQSLGRTWLRRPELLENLALTEEQARLLAEFKTEHAQQQHKHDGMA >NZ_CP007598|4304485:4404431|4381939_4382455_-|WP_001069923.1|DBSCAN-SWA MNPSIVKRCLVGAVLAIVATLPGFQSLHTSVEGLKLIADYEGCRLQPYQCSAGVWTDGIGNTSGVVPGKTITERQAAQGLITNVLRVERALDKCVVQPMPQKVYDAVVSFAFNVGTGNACSSTLVKLLNQRRWADACLQLPRWVYVKGVFNQGLDNRRAREMAWCLKGAGL >NZ_CP007598|4304485:4404431|4391802_4392036_-|WP_001217571.1|DBSCAN-SWA MRIEIMIDKEQKISQSTLDALESELYRNLRPLYPKTVIRICKGSSNGVELTGLQLDEERKQVMKIMQKVWEDDSWLH >NZ_CP007598|4304485:4404431|4331723_4332632_-|WP_000210984.1|DBSCAN-SWA MTTPHVLFDYAGHLPECPTWSEDESALYWTDILEQEIHRYHPASGTHSVLAFPEEVGCFALREQGGFIVAMRHAIWLADKNGLLQRKVCDNPSNTKLARFNDGGTDGDGRFYAGTFWAPGDYNGALLMRIDHDLTAKVIQCDIQGHNGLAFSPDNQWMYTSDTPNGVIYRTLLDKHGDPGKRELFRHFGEGEGLPDGAAMDSEGCYWSAMFDGWRVARFSPQGEQLEEYRLPVRCPTMVCFGGADMKTLFITTTRENMSAQEVADYPLSGAIFTLQVAVAGMKKSRFIERQAGSTGTTFSLG >NZ_CP007598|4304485:4404431|4392046_4392235_-|WP_001154433.1|DBSCAN-SWA MQDYFLESLKLQRIDFFLKLVAASECSDEEKGLALQWVSELTDELMAKIRTHEYNRSMDVIN |
91 | Salmonella_phage(76.36%) | capsid,tRNA,integrase,lysis,portal,transposase,terminase,head,tail,plate | attL 4322855:4322869|attR 4403336:4403350 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP007599_1 | 28446-28558 | Orphan |
NA
Consensus repeat of NZ_CP007599_1
|
2 spacers
spacers of NZ_CP007599_1
>1.1|28470|21|NZ_CP007599|CRISPRCasFinder AGCACCATAGACACACACAAT >1.2|28515|20|NZ_CP007599|CRISPRCasFinder AGCACACAAAAAACCTTGAC |
CRISPR arrays and Neighbor proteins around NZ_CP007599_1
The CRISPR arrays of NZ_CP007599_1 >merge|NZ_CP007599|1|28446-28558|CRISPRCasFinder GACACACAAAACGACACACACAATAGCACCATAGACACACACAATGTCACACAAAAGAACACACATAAAAGCACACAAAAAACCTTGACAACACACAAAACGACACACACAAT >NZ_CP007599|1|1|28446-28558|CRISPRCasFinder GACACACAAAACGACACACACAAT AGCACCATAGACACACACAAT GTCACACAAAAGAACACACATAAA AGCACACAAAAAACCTTGAC AACACACAAAACGACACACACAAT
>NZ_CP007599.1|WP_000963832.1|27955_28291_-|hypothetical-protein MAIERKNVISIRLTDEEYQPFKELLEHTDIGKSEFFRALILNRISELPVKPKPTTDYKRCLFLMNKTSNNLNQIAHRLNLDHNKGIISSSLYERALNTLINIRDLLQGALK >NZ_CP007599.1|WP_000586931.1|24761_27959_-|hypothetical-protein MIIRYGGGNHGIAEYLENGRKAEREYTRDELDHRFIIDGDLATTNKIIESIEDKGQERYLHITLSFAENEISEETLKNVTSDFRKMLMNAYHTDEYSFYAEAHLPKIKNIVDNKTGELVERKPHIHIVIPRTNLVTGNSLNPPGDLTNLKTQTQLDSVQEYLNNKYNLISPKDSVRVSDENYANVLSRVKGDLYRERNNEVKKEIFSRVTNENINSMDAFKNLLMQYGDVKIRNEGKTNQYFAVRLKDDKKYTNFKNPLFSSSFIEKRELPLVKPTPAQINNNVSTWLDKTSHEIKHIYSKSSKTREHYKALSEPEKKSFLKNRIDDYDKRYKLNTKNSERTQGRSCGNKFSIESTSRFSRTKTRVGVPRLPQRGLVYGIHGRGKPPESIRILQDNEQRNLANRMENRSHSDSAMRRNADRRFSERGIKNIGISTPLHEALFQKLNNDAERNELSTMAEIRKEIDPERFLSATAREFNVVPGEHAISKAKDGSPRFAVGNRNLNASDFLTKYINLSWSDAKSFLLKTYGEQLSEKTFEPVATIRKLSYEESRERFSSFKKTDISLRNIIRAEKKNMYNELREMRQQIYSLPKENRDVAKGVLVYKKITTLERLDDMYAAGRSFINQYHKDWNEDKNAMKAVDKLKNYLNKENENSISGAEIELSLEKAVQAQKRLQELQQTNTRLKDLVMDKQESKIVYRDQKTESPVFTDKGDFVVAGKNPSKEEIGIMLEYSKEKFGGVLKLTGSEEFKKECALVAAERDMNIILRPEKYQQMLLEHKAELESKAMQQTEQESQQQTQVDNEISKADVHQNQAPVEEAKQEIYLVSHTNVHEQNEGVIFHSKEAAYSYFEEDKKMAIEKYEANPYLGNGFEGALLVSKTIAASELSSYPEKLTAEFDKPYEILADSYKEYTRTPVYVVSFPKDELNQPVKKFESLSEAIEYKNNTAMLHDLDKREVIVKSVTREELGMMGEQLAIKEAEPVPRHELEKAQGRDVNEQDGLILDAIDRYQAKFEAEGLAFNRQETEAELLHHDFTREVAEDRLEQQFVQEKAEQQNQQQSEQER >NZ_CP007599.1|WP_001096505.1|24381_24684_+|hypothetical-protein MAKLPKKIDQEIKPLDLRQFVSSAAQKPASASAVTRINLTMTDDDLDTAQEFQNEYGASRAEVIRAALAALRMVSAEERRMLFNDIRKNSPKAGRPTVNK >NZ_CP007599.1|WP_000516225.1|23017_23677_+|peptidyl-arginine-deiminase MGKIITVAGHKGGIGKSTVLCSLCVCVIIKGKTACFLETDSQGSIKDFIEERKTNDRLSEIPYFECYTDIPAMARKLAARFDYVFVDTPGMKSPAFVKALSCADILFTFIEPGAGIEINTLGRLVFDIKTAQAGVNPSMKAWIVLNKCSTNPSDSEASELRRQLNDDPDWLPVPRQRIYMRTAHKKAYNGGMGVHEYDDKRGNKARGEIELLLKETGIL >NZ_CP007599.1|WP_000018884.1|22087_22903_+|tyrosine-type-recombinase/integrase MSHLPAKIFQISRSDAWNLTISPDAARNMRDLAAQIGAGMPKYLIAPEVSLLLSYIDDLSIRMYFETLWNTGARLNEALALKPADFMLTPTRQYPQPVVILKTLKQRIKEAERRPGRPTTRHPPLHPDPRYRKPPEPAVRLVPVLDAGYALRMREYLATWKKRIKHQPVWDITSRQTPLNWINTAVARAERDGVTFSVPVTPHTFRHSYAMHLTMSGVPPRVLQSLLGHRYARSTEIYTRVFSLDVLTGSGLSFTCDPQLAKQLLGYPDVP >NZ_CP007599.1|WP_000312658.1|21179_21428_-|hypothetical-protein MAWYIDSKNVKKNTNEKANAKYEKIWKPGEKPDHAGIYRCQACGYEDLINRQCNKLPPCSNCEKKAHKNNTWKLLVRAVDAE >NZ_CP007599.1|WP_000717995.1|20674_20968_+|hypothetical-protein MKKIFFKNDTVFPVNNFFVALTLVSSLLNSVGIFFPMISHSGNVIEMCFFVMFSVLLAAFSFVSTMGLANAIAPIKNAFPLLLLVIVFITNYNIYLR >NZ_CP007599.1|WP_001166027.1|20277_20646_+|transcriptional-regulator MQIKPIKPIKTEQDYEAALRAVEPMFDNEPPADTPEGDFFEIMCVLINEYEKKHYPIEAPDPVEAIKFRMEQQGLTVKDLEPAIGKSNRVYEVLNRKRNLTLPMIRKLHTMFGIPLKSLVGG >NZ_CP007599.1|WP_000691019.1|19969_20272_+|type-II-toxin-antitoxin-system-HigB-family-toxin MKIIAIKTLRDFWTANPDAEQPLKAWVDEASKAEWKSPAEIKEQYRSASILKNRRVVFNIKGNDYRLIVAIAYQRGWMFIKFIGTHKEYDKIDAETVSLE >NZ_CP007599.1|WP_000528103.1|18964_19480_+|hypothetical-protein MGNQLIMELKKLDSSFYQNNPVVLEALDFDAKTNSWIGGDKVRGHGIVQIQLHDLTFAIPVRSHIRHNDCYIIERDKGRNDIRGMGLDYSKAMLITDPTYVSADIFLLRNKKAAKDLLSKEAHVTKQFSQYVERYVEAVRKNDKNILRRDYRFSTLINYHAELGLTAPTTE >NZ_CP007599.1|WP_001190621.1|28636_29044_+|hypothetical-protein MQSIDSFLSALCEAIRNEKLKSKQAAFNLYLQEILEAKKTYTWEQICTYINQNTDSTLAVRAYRNMVERAKKKRLSQEKTNNESGVANKLPPQGDKSTQATKEPAIKGFFSQREKERKLEYDASATMAKFEDKYK >NZ_CP007599.1|WP_001097938.1|29053_29815_+|conjugal-transfer-protein-TraL MNTSINFILQGKGGVGKSFATSILSQYFIDEKQLENVVVADTDPVNTTTAKVKRLNAEIIKIVENNNIVQSKFDSMFESILEGGNINFVIDNGASTFLPLLQYFDDNCVMDMFNEVEQDVYIHTIIVGGQAQADTIEGFENIVKLVKGTKVKIIVWINEFQGEPILSGKHITETNFFEKNKDVIAGAILIKDRKSDAFDTDIKKLTANSMTLTEALESKEFGLMAKSRLKRVFNDVYVQLDAIYDPESVEANA >NZ_CP007599.1|WP_001279862.1|29811_30516_+|hypothetical-protein MSDIELQQPERAVHEAEDQINDFVVDVFKKTGVRISKDDPVLSLLFLHEKIQKKQSDLLKDDFTTLADAFKSVLSSLEEENIQRFRKIVDTCGDLDNEIKEAIEHGKNEINETAVLAKEKLNNEIIDIISLLRKNQEELNNSYKKSIAEFSKNTKPFSKSTAIALCVACTIGISAAFSGAFWYVAQSQKEQALQFYASGYMDMQKLTKETISLLPKEQQKIATAKLNALESRSR >NZ_CP007599.1|WP_000466317.1|30544_30982_-|type-II-toxin-antitoxin-system-HicB-family-antitoxin MFFSVGVETPKDDHTAYGITVPAFDRFDFGCVSAADTQSEIPVMAREAILAIVEEMVLSGSYSVDDIHDDGCLTYAANQDYSHCDSWFVIDVDLSEIEGKQQRINIALPDVLIRRIDGFVRESGGVYRDRSHFLAQAARHELSYK >NZ_CP007599.1|WP_000833471.1|31006_31189_-|type-II-toxin-antitoxin-system-HicA-family-toxin MKSADLLKELIAAGCELKRHKASSHQIWWSPITGKTFPVPHPKKDLPLGTVRSIRKMAGI >NZ_CP007599.1|WP_000545932.1|31621_31900_-|hypothetical-protein MHFTTFLKKHFDIEKIVGTSDSGHDTESIYVYEKGNDCEPLFILHESWINAEIKKSGIWTVGDIYSMLEHGKEYTEFELREMIKKGKVTSKY >NZ_CP007599.1|WP_000855123.1|31957_32287_-|hypothetical-protein MKTLTFNNGTVSVGDVFVSSWGYEQTNVNFYQVISVHGKKTVTVQEVRASVLLTRSMSGYKTPLLNDFCGEPLKRRVRDCYSVPAIEIESFEMAYKTQPEEKHEFTSYY >NZ_CP007599.1|WP_001704452.1|32725_32968_-|transcriptional-regulator MLTENTPENIKRLRKKIGLTQKECAEIFSMTPRTWRRKEEPVGTVSGTALTPVEFKYLQLLAGEHPEYVLCKRDKPKCSE >NZ_CP007599.1|WP_000868277.1|32957_33224_-|hypothetical-protein MKVRKGDRQYYLNKEGNMFHLVKRVKTFSKSATLGKTKATVKTVADLVFNEKAFDTIDFSRDGLRENDKEIVSMMIQEMNNERGKNVN >NZ_CP007599.1|WP_000749828.1|33274_33499_-|hypothetical-protein MKKSTFPVIVSTTGHAFSVARVTLCTICLKHEKTGKDYVVIFTDSNNIRDYKTGVVPCFGELYQEDVDLIVGKS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP007599_2 | 35418-35526 | Orphan |
NA
Consensus repeat of NZ_CP007599_2
|
1 spacers
spacers of NZ_CP007599_2
>2.1|35449|47|NZ_CP007599|CRISPRCasFinder CAAAAAGGGACAGATTTATGGTTCTTGGCGTTTTTATCTGGGGACAA |
CRISPR arrays and Neighbor proteins around NZ_CP007599_2
The CRISPR arrays of NZ_CP007599_2 >merge|NZ_CP007599|2|35418-35526|CRISPRCasFinder ATTTATGGTCAAAAAGGGACAGATTTATGGTCAAAAAGGGACAGATTTATGGTTCTTGGCGTTTTTATCTGGGGACAAATTTATGGTCAAAGAGGGACAGATTTATGGT >NZ_CP007599|2|2|35418-35526|CRISPRCasFinder ATTTATGGTCAAAAAGGGACAGATTTATGGT CAAAAAGGGACAGATTTATGGTTCTTGGCGTTTTTATCTGGGGACAA ATTTATGGTCAAAGAGGGACAGATTTATGGT
>NZ_CP007599.1|WP_000432875.1|33861_34185_-|hypothetical-protein MENYAKFVATEILNQLGGNCFIAMTGAKNFAYFDEDGECGLSFRLPSKFAMNGINLVKIKLTFSDTYQVTFSRVRGAMVKEVSTFDNVYCDQLECLFNEQTGLATRL >NZ_CP007599.1|WP_000866038.1|33594_33819_-|hypothetical-protein MKVLIGNINIRNHHMLLELAGIAGFAGSVEYTSEISASIDLMDDSFRRKVGISDSEILKMLEAFVENNFSIKLV >NZ_CP007599.1|WP_000749828.1|33274_33499_-|hypothetical-protein MKKSTFPVIVSTTGHAFSVARVTLCTICLKHEKTGKDYVVIFTDSNNIRDYKTGVVPCFGELYQEDVDLIVGKS >NZ_CP007599.1|WP_000868277.1|32957_33224_-|hypothetical-protein MKVRKGDRQYYLNKEGNMFHLVKRVKTFSKSATLGKTKATVKTVADLVFNEKAFDTIDFSRDGLRENDKEIVSMMIQEMNNERGKNVN >NZ_CP007599.1|WP_001704452.1|32725_32968_-|transcriptional-regulator MLTENTPENIKRLRKKIGLTQKECAEIFSMTPRTWRRKEEPVGTVSGTALTPVEFKYLQLLAGEHPEYVLCKRDKPKCSE >NZ_CP007599.1|WP_000855123.1|31957_32287_-|hypothetical-protein MKTLTFNNGTVSVGDVFVSSWGYEQTNVNFYQVISVHGKKTVTVQEVRASVLLTRSMSGYKTPLLNDFCGEPLKRRVRDCYSVPAIEIESFEMAYKTQPEEKHEFTSYY >NZ_CP007599.1|WP_000545932.1|31621_31900_-|hypothetical-protein MHFTTFLKKHFDIEKIVGTSDSGHDTESIYVYEKGNDCEPLFILHESWINAEIKKSGIWTVGDIYSMLEHGKEYTEFELREMIKKGKVTSKY >NZ_CP007599.1|WP_000833471.1|31006_31189_-|type-II-toxin-antitoxin-system-HicA-family-toxin MKSADLLKELIAAGCELKRHKASSHQIWWSPITGKTFPVPHPKKDLPLGTVRSIRKMAGI >NZ_CP007599.1|WP_000466317.1|30544_30982_-|type-II-toxin-antitoxin-system-HicB-family-antitoxin MFFSVGVETPKDDHTAYGITVPAFDRFDFGCVSAADTQSEIPVMAREAILAIVEEMVLSGSYSVDDIHDDGCLTYAANQDYSHCDSWFVIDVDLSEIEGKQQRINIALPDVLIRRIDGFVRESGGVYRDRSHFLAQAARHELSYK >NZ_CP007599.1|WP_001279862.1|29811_30516_+|hypothetical-protein MSDIELQQPERAVHEAEDQINDFVVDVFKKTGVRISKDDPVLSLLFLHEKIQKKQSDLLKDDFTTLADAFKSVLSSLEEENIQRFRKIVDTCGDLDNEIKEAIEHGKNEINETAVLAKEKLNNEIIDIISLLRKNQEELNNSYKKSIAEFSKNTKPFSKSTAIALCVACTIGISAAFSGAFWYVAQSQKEQALQFYASGYMDMQKLTKETISLLPKEQQKIATAKLNALESRSR >NZ_CP007599.1|WP_001208090.1|35867_36872_+|replication-initiation-protein MALRNNKRNDCNDVQSLLTQGNQLLEGAYDITLIEMRLLYLALTKIDSRKPQPASEYTLFAKEYRDAFSLDSKNCYEQLKSAASSLGSKPIVTYEWNETKKRIDAVKRFWFSSIRYGVGNSESDITLRFSDSVSQYLYELKSEFTQMNLEHMVKLDTPFSFRLYSWLYKYKNLSRNKKESGVISTDPITIEWMKERTGLTGKYPVYKDFKKRVLDPAVDIINANTNLSVTYEGIKSGKRIESVVFTYLVENETTHGKKMAVKPLRPRMPSRPRVVKGSLAEATWARNCLNVMRGYLVALKEYDHTLKLSSSDVAKVVVWCEIIGEPFDEDFWKK >NZ_CP007599.1|WP_000612441.1|37333_37672_-|helix-turn-helix-transcriptional-regulator MIPKRLKEAREAANFSQEKLAQLVDIESVNSRSRISNYESGRFTPPFEFIQKVAKVLDYPEGYFYTSDDDFAELLLLIHRGHSYNELKRSIKVINEAKILVEKLQDCLNTNN >NZ_CP007599.1|WP_000332248.1|37870_38158_+|helix-turn-helix-transcriptional-regulator MAYYENMRYDLLNKIFPDLTPVQAQCVLMYSFGMSSLEISGCVGVSRQMIDKNLHAAAKKMNVNNLIALKPAVVIGILLEVLASLPVKDDLTNED >NZ_CP007599.1|WP_001056371.1|38198_38432_-|EexN-family-lipoprotein MNMKKTMLIPLTALIFILTGCNEKVYDVDYYVNNIKEAEQMQKKCESGEVANQNCENARNALKQINRKKTISSMFAH >NZ_CP007599.1|WP_000844296.1|38444_39086_-|conjugal-transfer-protein-TrbJ MKSVYRYILFILSLFITHNTIAAIPVVDPASIAKTVEEGVTRAKEAAANLQQLKEQYEQTIKYAEEQKRRLEGFTDFSNGFDSAESYMKTRLSDLTDHSKENVNSLRDKYNLKSESNLAQTRYDSILKQIDFYEKFNKSLLERANRMQNLQNSFSHANTPQQKADLANQLNTEKLTLEMQIKQYDIAERQLASQAAAEYEQNRQSTISAMFKH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP007599_1 | 1.1|28470|21|NZ_CP007599|CRISPRCasFinder | 28470-28490 | 21 | NZ_CP038337 | Escherichia coli O157:H7 strain LSU61 plasmid pLSU61-2, complete sequence | 33096-33116 | 0 | 1.0 |
NZ_CP007599_1 | 1.1|28470|21|NZ_CP007599|CRISPRCasFinder | 28470-28490 | 21 | NZ_CP007599 | Salmonella enterica subsp. enterica serovar Enteritidis str. 77-1427 plasmid pCFSAN000111_01, complete sequence | 28470-28490 | 0 | 1.0 |
NZ_CP007599_1 | 1.1|28470|21|NZ_CP007599|CRISPRCasFinder | 28470-28490 | 21 | NZ_CP027450 | Escherichia coli strain 2014C-3097 plasmid unnamed1, complete sequence | 14997-15017 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP038409 | Escherichia coli O157:H7 strain 86-24 plasmid p86-24-2, complete sequence | 25476-25495 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP040110 | Escherichia coli O157:H7 strain MB9-1 plasmid pMB9_3, complete sequence | 34063-34082 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP008807 | Escherichia coli O157:H7 str. SS17 plasmid pSS17, complete sequence | 30209-30228 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP007599 | Salmonella enterica subsp. enterica serovar Enteritidis str. 77-1427 plasmid pCFSAN000111_01, complete sequence | 28515-28534 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP027450 | Escherichia coli strain 2014C-3097 plasmid unnamed1, complete sequence | 15042-15061 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NC_011351 | Escherichia coli O157:H7 str. EC4115 plasmid pEC4115, complete sequence | 30213-30232 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP015845 | Escherichia coli O157:H7 strain FRIK2455 plasmid p35K, complete sequence | 30856-30875 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP042629 | Escherichia coli strain NCYU-25-82 plasmid pNCYU-25-82-2, complete sequence | 19632-19651 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP041625 | Escherichia coli O157:H7 strain ATCC 43888 plasmid p35K_like, complete sequence | 24061-24080 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP024283 | Escherichia albertii strain 2014C-4356 plasmid unnamed1, complete sequence | 22093-22112 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP038303 | Escherichia coli O157:H7 strain SS TX 313-1 plasmid pTX313-2, complete sequence | 33465-33484 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP038337 | Escherichia coli O157:H7 strain LSU61 plasmid pLSU61-2, complete sequence | 33052-33071 | 0 | 1.0 |
NZ_CP007599_1 | 1.2|28515|20|NZ_CP007599|CRISPRCasFinder | 28515-28534 | 20 | NZ_CP038317 | Escherichia coli O157:H7 strain NE92 plasmid pNE92-2, complete sequence | 34690-34709 | 0 | 1.0 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP042629 | Escherichia coli strain NCYU-25-82 plasmid pNCYU-25-82-2, complete sequence | 12677-12723 | 0 | 1.0 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP038337 | Escherichia coli O157:H7 strain LSU61 plasmid pLSU61-2, complete sequence | 26072-26118 | 0 | 1.0 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP007599 | Salmonella enterica subsp. enterica serovar Enteritidis str. 77-1427 plasmid pCFSAN000111_01, complete sequence | 35449-35495 | 0 | 1.0 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_KY446064 | Escherichia coli strain GD81 plasmid pGD81-1, complete sequence | 44873-44919 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP041625 | Escherichia coli O157:H7 strain ATCC 43888 plasmid p35K_like, complete sequence | 17106-17152 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP028594 | Escherichia coli strain 150 plasmid pTA150-2, complete sequence | 26813-26859 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP020924 | Salmonella enterica subsp. enterica strain 16A242 plasmid unnamed2, complete sequence | 30622-30668 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP028605 | Escherichia coli strain 144 plasmid pTA144-2, complete sequence | 29261-29307 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP038307 | Escherichia coli O157:H7 strain SS NE 1040-1 plasmid pNE1040-3, complete sequence | 27490-27536 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP038311 | Escherichia coli O157:H7 strain Show KS 470-1 plasmid pKS470-3, complete sequence | 29282-29328 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP038303 | Escherichia coli O157:H7 strain SS TX 313-1 plasmid pTX313-2, complete sequence | 28403-28449 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_AP018807 | Escherichia coli strain E2863 plasmid pE2863-5, complete sequence | 30676-30722 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP022051 | Escherichia coli O157 strain FDAARGOS_293 plasmid unnamed1, complete sequence | 11602-11648 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP038317 | Escherichia coli O157:H7 strain NE92 plasmid pNE92-2, complete sequence | 29628-29674 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP038409 | Escherichia coli O157:H7 strain 86-24 plasmid p86-24-2, complete sequence | 30507-30553 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP038367 | Escherichia coli O157:H7 strain F6667 plasmid pF6667-2, complete sequence | 23541-23587 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | CP044351 | Escherichia coli strain 194195 plasmid p194195_1, complete sequence | 26599-26645 | 1 | 0.979 |
NZ_CP007599_2 | 2.1|35449|47|NZ_CP007599|CRISPRCasFinder | 35449-35495 | 47 | NZ_CP015845 | Escherichia coli O157:H7 strain FRIK2455 plasmid p35K, complete sequence | 3084-3130 | 1 | 0.979 |
1. spacer 1.1|28470|21|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038337 (Escherichia coli O157:H7 strain LSU61 plasmid pLSU61-2, complete sequence) position: , mismatch: 0, identity: 1.0
agcaccatagacacacacaat CRISPR spacer agcaccatagacacacacaat Protospacer *********************
2. spacer 1.1|28470|21|NZ_CP007599|CRISPRCasFinder matches to NZ_CP007599 (Salmonella enterica subsp. enterica serovar Enteritidis str. 77-1427 plasmid pCFSAN000111_01, complete sequence) position: , mismatch: 0, identity: 1.0
agcaccatagacacacacaat CRISPR spacer agcaccatagacacacacaat Protospacer *********************
3. spacer 1.1|28470|21|NZ_CP007599|CRISPRCasFinder matches to NZ_CP027450 (Escherichia coli strain 2014C-3097 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
agcaccatagacacacacaat CRISPR spacer agcaccatagacacacacaat Protospacer *********************
4. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038409 (Escherichia coli O157:H7 strain 86-24 plasmid p86-24-2, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
5. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP040110 (Escherichia coli O157:H7 strain MB9-1 plasmid pMB9_3, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
6. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP008807 (Escherichia coli O157:H7 str. SS17 plasmid pSS17, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
7. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP007599 (Salmonella enterica subsp. enterica serovar Enteritidis str. 77-1427 plasmid pCFSAN000111_01, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
8. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP027450 (Escherichia coli strain 2014C-3097 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
9. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NC_011351 (Escherichia coli O157:H7 str. EC4115 plasmid pEC4115, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
10. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP015845 (Escherichia coli O157:H7 strain FRIK2455 plasmid p35K, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
11. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP042629 (Escherichia coli strain NCYU-25-82 plasmid pNCYU-25-82-2, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
12. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP041625 (Escherichia coli O157:H7 strain ATCC 43888 plasmid p35K_like, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
13. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP024283 (Escherichia albertii strain 2014C-4356 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
14. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038303 (Escherichia coli O157:H7 strain SS TX 313-1 plasmid pTX313-2, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
15. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038337 (Escherichia coli O157:H7 strain LSU61 plasmid pLSU61-2, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
16. spacer 1.2|28515|20|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038317 (Escherichia coli O157:H7 strain NE92 plasmid pNE92-2, complete sequence) position: , mismatch: 0, identity: 1.0
agcacacaaaaaaccttgac CRISPR spacer agcacacaaaaaaccttgac Protospacer ********************
17. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP042629 (Escherichia coli strain NCYU-25-82 plasmid pNCYU-25-82-2, complete sequence) position: , mismatch: 0, identity: 1.0
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer caaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer ***********************************************
18. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038337 (Escherichia coli O157:H7 strain LSU61 plasmid pLSU61-2, complete sequence) position: , mismatch: 0, identity: 1.0
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer caaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer ***********************************************
19. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP007599 (Salmonella enterica subsp. enterica serovar Enteritidis str. 77-1427 plasmid pCFSAN000111_01, complete sequence) position: , mismatch: 0, identity: 1.0
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer caaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer ***********************************************
20. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_KY446064 (Escherichia coli strain GD81 plasmid pGD81-1, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
21. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP041625 (Escherichia coli O157:H7 strain ATCC 43888 plasmid p35K_like, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
22. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP028594 (Escherichia coli strain 150 plasmid pTA150-2, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
23. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP020924 (Salmonella enterica subsp. enterica strain 16A242 plasmid unnamed2, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
24. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP028605 (Escherichia coli strain 144 plasmid pTA144-2, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
25. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038307 (Escherichia coli O157:H7 strain SS NE 1040-1 plasmid pNE1040-3, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
26. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038311 (Escherichia coli O157:H7 strain Show KS 470-1 plasmid pKS470-3, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
27. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038303 (Escherichia coli O157:H7 strain SS TX 313-1 plasmid pTX313-2, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
28. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_AP018807 (Escherichia coli strain E2863 plasmid pE2863-5, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
29. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP022051 (Escherichia coli O157 strain FDAARGOS_293 plasmid unnamed1, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
30. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038317 (Escherichia coli O157:H7 strain NE92 plasmid pNE92-2, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
31. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038409 (Escherichia coli O157:H7 strain 86-24 plasmid p86-24-2, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
32. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP038367 (Escherichia coli O157:H7 strain F6667 plasmid pF6667-2, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
33. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to CP044351 (Escherichia coli strain 194195 plasmid p194195_1, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
34. spacer 2.1|35449|47|NZ_CP007599|CRISPRCasFinder matches to NZ_CP015845 (Escherichia coli O157:H7 strain FRIK2455 plasmid p35K, complete sequence) position: , mismatch: 1, identity: 0.979
caaaaagggacagatttatggttcttggcgtttttatctggggacaa CRISPR spacer taaaaagggacagatttatggttcttggcgtttttatctggggacaa Protospacer .**********************************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|