Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP019416 | Salmonella enterica subsp. enterica serovar Nitra strain S-1687 chromosome, complete genome | 2 crisprs | WYL,DinG,DEDDh,cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK | 0 | 15 | 8 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP019416_1 | 2967086-2967602 | TypeI-E |
I-E
Consensus repeat of NZ_CP019416_1
|
8 spacers
spacers of NZ_CP019416_1
>1.1|2967115|32|NZ_CP019416|CRISPRCasFinder,CRT TATTTATAAGCGTGTCATCTATGCAACCCAAC >1.2|2967176|32|NZ_CP019416|CRISPRCasFinder,CRT ACCTGCCCGACCCAATAAGGGGGCCCTCGTGA >1.3|2967237|32|NZ_CP019416|CRISPRCasFinder,CRT GGCCGCTGGTCAAATTCCCAATCTGAGCAATC >1.4|2967298|32|NZ_CP019416|CRISPRCasFinder,CRT ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC >1.5|2967359|32|NZ_CP019416|CRISPRCasFinder,CRT GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC >1.6|2967420|32|NZ_CP019416|CRISPRCasFinder,CRT ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG >1.7|2967481|32|NZ_CP019416|CRISPRCasFinder,CRT GAGAATGCTCATGCGCGTGAGCGCCATATATT >1.8|2967542|32|NZ_CP019416|CRISPRCasFinder,CRT AGGCGGACCGAAAAACCGTTTTCAGCCAACGT >1.9|2967117|32|NZ_CP019416|PILER-CR TATTTATAAGCGTGTCATCTATGCAACCCAAC >1.10|2967178|32|NZ_CP019416|PILER-CR ACCTGCCCGACCCAATAAGGGGGCCCTCGTGA >1.11|2967239|32|NZ_CP019416|PILER-CR GGCCGCTGGTCAAATTCCCAATCTGAGCAATC >1.12|2967300|32|NZ_CP019416|PILER-CR ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC >1.13|2967361|32|NZ_CP019416|PILER-CR GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC >1.14|2967422|32|NZ_CP019416|PILER-CR ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG >1.15|2967483|32|NZ_CP019416|PILER-CR GAGAATGCTCATGCGCGTGAGCGCCATATATT >1.16|2967544|32|NZ_CP019416|PILER-CR AGGCGGACCGAAAAACCGTTTTCAGCCAACGT |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around NZ_CP019416_1
The CRISPR arrays of NZ_CP019416_1 >merge|NZ_CP019416|1|2967086-2967602|CRISPRCasFinder,CRT,PILER-CR GTGTTTATCCCCGCTGACGCGGGGAACACTATTTATAAGCGTGTCATCTATGCAACCCAACCGGTTTATCCCCGCTGGCGCGGGGAACACACCTGCCCGACCCAATAAGGGGGCCCTCGTGACGGTTTATCCCCGCTGGCGCGGGGAACACGGCCGCTGGTCAAATTCCCAATCTGAGCAATCCGGTTTATCCCCGCTGGCGCGGGGAACACATAGCCCCGGCAGCGATAGCTAAACCAGTTCCCGGTTTATCCCCGCTGGCGCGGGGAACACGCCTCAAAATCTCTCGGTGAGATGTAAGCGTCCGGTTTATCCCCGCTGGCGCGGGGAACACACCAGTGGTCAGCGGCGGATGAATTTGCCCTGCGGTTTATCCCCGCTGGCGCGGGGAACACGAGAATGCTCATGCGCGTGAGCGCCATATATTCGGTTTATCCCCGCTGGCGCGGGGAACACAGGCGGACCGAAAAACCGTTTTCAGCCAACGTCGGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP019416|1|1|2967086-2967602|CRISPRCasFinder GTGTTTATCCCCGCTGACGCGGGGAACAC TATTTATAAGCGTGTCATCTATGCAACCCAAC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCTGCCCGACCCAATAAGGGGGCCCTCGTGA CGGTTTATCCCCGCTGGCGCGGGGAACAC GGCCGCTGGTCAAATTCCCAATCTGAGCAATC CGGTTTATCCCCGCTGGCGCGGGGAACAC ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC CGGTTTATCCCCGCTGGCGCGGGGAACAC GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GAGAATGCTCATGCGCGTGAGCGCCATATATT CGGTTTATCCCCGCTGGCGCGGGGAACAC AGGCGGACCGAAAAACCGTTTTCAGCCAACGT CGGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP019416|1|1|2967086-2967602|CRT GTGTTTATCCCCGCTGACGCGGGGAACAC TATTTATAAGCGTGTCATCTATGCAACCCAAC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCTGCCCGACCCAATAAGGGGGCCCTCGTGA CGGTTTATCCCCGCTGGCGCGGGGAACAC GGCCGCTGGTCAAATTCCCAATCTGAGCAATC CGGTTTATCCCCGCTGGCGCGGGGAACAC ATAGCCCCGGCAGCGATAGCTAAACCAGTTCC CGGTTTATCCCCGCTGGCGCGGGGAACAC GCCTCAAAATCTCTCGGTGAGATGTAAGCGTC CGGTTTATCCCCGCTGGCGCGGGGAACAC ACCAGTGGTCAGCGGCGGATGAATTTGCCCTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GAGAATGCTCATGCGCGTGAGCGCCATATATT CGGTTTATCCCCGCTGGCGCGGGGAACAC AGGCGGACCGAAAAACCGTTTTCAGCCAACGT CGGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP019416|1|1|2967088-2967602|PILER-CR GTTTATCCCCGCTGACGCGGGGAACACTA TTTATAAGCGTGTCATCTATGCAACCCAACCG GTTTATCCCCGCTGGCGCGGGGAACACAC CTGCCCGACCCAATAAGGGGGCCCTCGTGACG GTTTATCCCCGCTGGCGCGGGGAACACGG CCGCTGGTCAAATTCCCAATCTGAGCAATCCG GTTTATCCCCGCTGGCGCGGGGAACACAT AGCCCCGGCAGCGATAGCTAAACCAGTTCCCG GTTTATCCCCGCTGGCGCGGGGAACACGC CTCAAAATCTCTCGGTGAGATGTAAGCGTCCG GTTTATCCCCGCTGGCGCGGGGAACACAC CAGTGGTCAGCGGCGGATGAATTTGCCCTGCG GTTTATCCCCGCTGGCGCGGGGAACACGA GAATGCTCATGCGCGTGAGCGCCATATATTCG GTTTATCCCCGCTGGCGCGGGGAACACAG GCGGACCGAAAAACCGTTTTCAGCCAACGTCG GTTTATCCCCGCTGGCGCGGGGAACAC
>NZ_CP019416.1|WP_000490481.1|2966024_2967071_+|aminopeptidase MFSATRRFAVILALGVGFILPAQAASPGPGEIANTQARHIATFFPGRMTGSPAEMLSADYLRQQFTQMGYQSDIRTFNSRFIYTTKDNRKNWHNVTGSTVIAAHEGRVPQQIIIMAHLDTYAPQSDADVDANLGGLTLQGMDDNAAGLGVMLELAARLKDIPTHYGIRFIATSGEEEGKLGAENLLKRMSDAEKKNTLLVINLDNLIVGDKLYFNSGKNTPEAVRTLTRDRALAIARRYGIAANTNPGRNPSYPKGTGCCNDAEVFDKAGISVLSVEATNWNLGKKDGYQQRVKNASFPNGNSWHDVRLDNQQHIDKALPGRIERRSRDVVRIMLPLVKELAKAEKTS >NZ_CP019416.1|WP_000372384.1|2964865_2965774_-|sulfate-adenylyltransferase-subunit-CysD MDQKRLTHLRQLEAESIHIIREVAAEFANPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYAFRDRTANAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWRNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMVDDDRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESHAQTLPEIIEEMLVSTTSERQGRMIDRDQAGSMELKKRQGYF >NZ_CP019416.1|WP_076731084.1|2963416_2964856_-|sulfate-adenylyltransferase-subunit-CysN MNTILAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTLQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCDLAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYREETFARIREDYLTFAEQLPGDLDIRFVPLSALEGDNVAAQSANMRWYSGPTLLEVLETVDIQRAVDRQPMRFPVQYVNRPNLDFRGYAGTLASGSVKVGERIKVLPSGVESSVARIVTFDGDKEEACAGEAITLVLNDDIDISRGDLLLAANETLAPARHAAIDVVWMGDQPLAPGQSYDVKLAGKKTRARIEAICYQIDINNLTQRDVESLPLNGIGLVEMTFDEPLALDIYQQNPVTGGLIFIDRLSNVTVGAGMVRELDERGATPSVEYSAFELELNALVRRHFPHWDARDLLGDKHGAA >NZ_CP019416.1|WP_001173663.1|2962824_2963430_-|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVAAREQLHGHRGVVLWFTGLSGSGKSTVAGALEEALHQRGVSTYLLDGDNVRHGLCRDLGFSDADRQENIRRVGEVASLMADAGLIVLTAFISPHRAERQLVKERVGHDRFIEIYVNTPLAICEQRDPKGLYKKARAGELRNFTGIDAIYEAPDSPQVHLNGEQLVTNLVSQLLDLLRRRDIIRS >NZ_CP019416.1|WP_001683461.1|2962450_2962774_-|DUF3561-family-protein MRNSHNITFTRSDAFMVDDDATCAFPGAVVGFVSWLLALGIPFLLYGPNTLFFFLYTWPFFLALMPVSVIIGIALHLLVKGKILFSIMFTLLAVGALFGALFIWLLG >NZ_CP019416.1|WP_000517480.1|2961948_2962260_-|cell-division-protein-FtsB MGKLTLLLLALLVWLQYSLWFGKNGIHDYSRVNDDVVAQQATNAKLKARNDQLFAEIDDLNGGQEAIEERARNELSMTKPGETFYRLVPDASKRAATAGQTHR >NZ_CP019416.1|WP_000741653.1|2961219_2961930_-|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MAATLLDVCAVVPAAGFGRRMQTECPKQYLSIGNKTILEHSVHALLAHPRVTRVVIAISPGDHRFAQLPLANHPQITVVDGGNERADSVLAGLQAVAKAQWVLVHDAARPCLHQDDLARLLTISENSRVGGILASPVRDTMKRGEPGKNAIAHTVERADLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPALVEGRADNIKVTRPEDLALAEFYLTRTIHQEKA >NZ_CP019416.1|WP_001219244.1|2960740_2961220_-|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLMKAAK >NZ_CP019416.1|WP_000134246.1|2959694_2960744_-|tRNA-pseudouridine(13)-synthase-TruD MTEFDNLTWLHGKPQGSGLLKANPEDFVVVEDLGFTPDGEGEHILLRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDFSAFQLEGCKVLEYARHKRKLRLGALKGNAFTLVLREISDRRDVETRLQAIRDGGVPNYFGAQRFGIGGSNLQGALRWAQSNAPVRDRNKRSFWLSAARSALFNQIVHQRLKKPDFNQVVDGDALQLAGRGSWFVATSEELPELQRRVDEKELMITASLPGSGEWGTQRAALAFEQDAIAQETVLQSLLLREKVEASRRAMLLYPQQLSWNWWDDVTVELRFWLPAGSFATSVVRELINTMGDYAHIAE >NZ_CP019416.1|WP_076731083.1|2958952_2959714_-|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFDNGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLNGYQHYDTAGAVTCALLRGLSREPLRTGRILNVNVPDLPLAQIKGIRVTRCGSRHPADKVIPQEDPRGNTLYWIGPPGDKYDAGPDTDFAAVDEGYVSVTPLHVDLTAHSAHDVVSDWLDSVGVGTQW >NZ_CP019416.1|WP_001518648.1|2967698_2967992_-|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAVWLLEVRAGVYVGDTSKRIREMIWQQITQLGGVGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVENQ >NZ_CP019416.1|WP_000144832.1|2967991_2968912_-|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLNPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVRLASTVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALDDDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRATYALLAKQYGVKWHGRNYDPKDWEKGDVVNRCISAATSCLYGISEAAILAAGYAPAIGFIHSGKPLSFVYDIADIIKFESVVPKAFEIAARHPAEPDKEVRLACRDIFRSSKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPESLGDSGHRGHG >NZ_CP019416.1|WP_076731085.1|2968908_2969559_-|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTSELSPAQLLHLVERGEYVMHQWLWDLFPGSKERQFLYRREELQGAFRFFVLSQEQPAASAIFDVQTRPFAPTLSAGQTLRFNLRANPTVCKNGKRHDLLMEAKRQRKTQGDSQDIWSYQQQAALEWLARQGEQNGFTLREASVDAYRQQQIRREKSRQMIQFSSVDYTGVLVINEPALFLQRLAQGFGKSRAFGCGMMMIKPGDDA >NZ_CP019416.1|WP_076731086.1|2969540_2970287_-|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLVFQLHGPMASWGVDAPGEVRHSHELPSRSALLGLLAAALGIRRDEEERLNTFNRHYQFLLCSSGNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRDYYTDAWWMIAVSATPDAPYTLAQLQAALQHPVFPLYLGRKSHPLALPLAPQLLDGRAPDVLREAYRWYQDQFNTLKLTLPGLQNECWWEGEHDGLTANKILRRRDMPLSRQQWLFGERSVNQGPWLRKEDACISQE >NZ_CP019416.1|WP_076731087.1|2970297_2971356_-|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNDNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA >NZ_CP019416.1|WP_076731088.1|2971369_2971924_-|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHEELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDTDNEQD >NZ_CP019416.1|WP_000368582.1|2971920_2973477_-|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPLWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIWGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFANAKDARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRHYFDDHVFTNPYESSDLERIMTARKKYFTTSAEKQSAKAAKAKKQEAAE >NZ_CP019416.1|WP_076731089.1|2973488_2976152_-|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETNGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSARKSKRVMALWMPVTTGHHGRPPDAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERSVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP019416.1|WP_073998915.1|2976595_2977549_+|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDLLTADEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLV >NZ_CP019416.1|WP_000080390.1|2977636_2978371_-|phosphoadenosine-phosphosulfate-reductase MSQLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP019416_2 | 2983754-2984393 | TypeI-E |
I-E
Consensus repeat of NZ_CP019416_2
|
10 spacers
spacers of NZ_CP019416_2
>2.1|2983783|32|NZ_CP019416|CRISPRCasFinder,CRT GGCTACACGCAAAAATTCCAGTCGTTGGCGCA >2.2|2983844|32|NZ_CP019416|CRISPRCasFinder,CRT CCGATTAAGATCCGCAGTCTGCATCAGTAACT >2.3|2983905|32|NZ_CP019416|CRISPRCasFinder,CRT CGATTCTACGGCAACAGGCCAGGCTGCGACCG >2.4|2983966|32|NZ_CP019416|CRISPRCasFinder,CRT ATCAAACATGGAAACCCCTTTAATGAGAGCAA >2.5|2984027|33|NZ_CP019416|CRISPRCasFinder,CRT TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTG >2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATT >2.7|2984150|32|NZ_CP019416|CRISPRCasFinder,CRT TCATGCGCTATAAAAATCAGACTGTCACATGC >2.8|2984211|32|NZ_CP019416|CRISPRCasFinder,CRT TGATTATTGACGACAACAGCACAGACCGGCAG >2.9|2984272|32|NZ_CP019416|CRISPRCasFinder,CRT AATAATCGGCAATTTGTCCTGGACAGGCACGG >2.10|2984333|32|NZ_CP019416|CRISPRCasFinder,CRT GAATCTGGAGGCCAACAGCGCGGCGAAATCCT >2.11|2983844|34|NZ_CP019416|PILER-CR CCGATTAAGATCCGCAGTCTGCATCAGTAACTCG >2.12|2983905|34|NZ_CP019416|PILER-CR CGATTCTACGGCAACAGGCCAGGCTGCGACCGCG >2.13|2983966|34|NZ_CP019416|PILER-CR ATCAAACATGGAAACCCCTTTAATGAGAGCAACG >2.14|2984027|35|NZ_CP019416|PILER-CR TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTGCG >2.15|2984089|34|NZ_CP019416|PILER-CR GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATTGG >2.16|2984150|34|NZ_CP019416|PILER-CR TCATGCGCTATAAAAATCAGACTGTCACATGCCG >2.17|2984211|34|NZ_CP019416|PILER-CR TGATTATTGACGACAACAGCACAGACCGGCAGCA >2.18|2984272|34|NZ_CP019416|PILER-CR AATAATCGGCAATTTGTCCTGGACAGGCACGGCA >2.19|2984333|34|NZ_CP019416|PILER-CR GAATCTGGAGGCCAACAGCGCGGCGAAATCCTCA |
cas3,cas8e,cse2gr11,cas7 |
CRISPR arrays and Neighbor proteins around NZ_CP019416_2
The CRISPR arrays of NZ_CP019416_2 >merge|NZ_CP019416|2|2983754-2984393|CRISPRCasFinder,CRT,PILER-CR ACGGCTATCCTTGTTGGCGCGGGGAACACGGCTACACGCAAAAATTCCAGTCGTTGGCGCACGGTTTATCCCCGCTGGCGCGGGGAACACCCGATTAAGATCCGCAGTCTGCATCAGTAACTCGGTTTATCCCCGCTGGCGAGGGGAACACCGATTCTACGGCAACAGGCCAGGCTGCGACCGCGGTTTATCCCCGCTGGCGCGGGGAACACATCAAACATGGAAACCCCTTTAATGAGAGCAACGGTTTATCCCCGCTGGCGCGGGGAACACTCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTGCGGTTTATCCCCGCTGGCGCGGGGAACACGCTGCCTTTCCCGGAGTTCCGGCCCCTAAATTGGGTTTATCCCCGCTGGCGCGGGGAACACTCATGCGCTATAAAAATCAGACTGTCACATGCCGGTTTATCCCCGCTGGCGCGGGGAACACTGATTATTGACGACAACAGCACAGACCGGCAGCAGTTTATCCCCGCTGGCGCGGGGAACACAATAATCGGCAATTTGTCCTGGACAGGCACGGCAGTTTATCCCCGCTGGCGCGGGGAACACGAATCTGGAGGCCAACAGCGCGGCGAAATCCTCAGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP019416|2|2|2983754-2984393|CRISPRCasFinder ACGGCTATCCTTGTTGGCGCGGGGAACAC GGCTACACGCAAAAATTCCAGTCGTTGGCGCA CGGTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACT CGGTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCG CGGTTTATCCCCGCTGGCGCGGGGAACAC ATCAAACATGGAAACCCCTTTAATGAGAGCAA CGGTTTATCCCCGCTGGCGCGGGGAACAC TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATT GGGTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGC CGGTTTATCCCCGCTGGCGCGGGGAACAC TGATTATTGACGACAACAGCACAGACCGGCAG CAGTTTATCCCCGCTGGCGCGGGGAACAC AATAATCGGCAATTTGTCCTGGACAGGCACGG CAGTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCT CAGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP019416|2|2|2983754-2984393|CRT ACGGCTATCCTTGTTGGCGCGGGGAACAC GGCTACACGCAAAAATTCCAGTCGTTGGCGCA CGGTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACT CGGTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCG CGGTTTATCCCCGCTGGCGCGGGGAACAC ATCAAACATGGAAACCCCTTTAATGAGAGCAA CGGTTTATCCCCGCTGGCGCGGGGAACAC TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTG CGGTTTATCCCCGCTGGCGCGGGGAACAC GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATT GGGTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGC CGGTTTATCCCCGCTGGCGCGGGGAACAC TGATTATTGACGACAACAGCACAGACCGGCAG CAGTTTATCCCCGCTGGCGCGGGGAACAC AATAATCGGCAATTTGTCCTGGACAGGCACGG CAGTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCT CAGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP019416|2|2|2983817-2984393|PILER-CR GTTTATCCCCGCTGGCGCGGGGAACAC CCGATTAAGATCCGCAGTCTGCATCAGTAACTCG GTTTATCCCCGCTGGCGAGGGGAACAC CGATTCTACGGCAACAGGCCAGGCTGCGACCGCG GTTTATCCCCGCTGGCGCGGGGAACAC ATCAAACATGGAAACCCCTTTAATGAGAGCAACG GTTTATCCCCGCTGGCGCGGGGAACAC TCAGGAACGCGCGGCGGAAGAGCTTGGTGTTTGCG GTTTATCCCCGCTGGCGCGGGGAACAC GCTGCCTTTCCCGGAGTTCCGGCCCCTAAATTGG GTTTATCCCCGCTGGCGCGGGGAACAC TCATGCGCTATAAAAATCAGACTGTCACATGCCG GTTTATCCCCGCTGGCGCGGGGAACAC TGATTATTGACGACAACAGCACAGACCGGCAGCA GTTTATCCCCGCTGGCGCGGGGAACAC AATAATCGGCAATTTGTCCTGGACAGGCACGGCA GTTTATCCCCGCTGGCGCGGGGAACAC GAATCTGGAGGCCAACAGCGCGGCGAAATCCTCA GTTTATCCCCGCTGGCGCGGGGAACAC
>NZ_CP019416.1|WP_001208011.1|2982857_2983655_-|MBL-fold-metallo-hydrolase MALRIRVLLENHKGAGADKSLKVRPGLSLLVEDESTSILFDTGPDGSFMQNALAMGIDLSDVSAVVLSHGHYDHCGGVPWLPDNSRIICHPDIARERYAAMTFLGITRKIKKLSCEVDYSRYRMMYTRDPLPIGENFIWSGEIPVVAPEAYGIFGGHDAEPDSILDEGVLIYQSTKGLVIITGCGHRGIANIVRHCQNITGIKRIYALVGGFHLRCASPFTLWRVRRFLQEHKPEKLCGCHCTGAWGRLWLPEITAPATGDVLRF >NZ_CP019416.1|WP_000108313.1|2982407_2982770_+|6-carboxytetrahydropterin-synthase-QueD MSTTLYKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIMDFADLKAAFKPTYDRLDHYYLNDIPGLSNPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCVYRGE >NZ_CP019416.1|WP_076731092.1|2980184_2981984_-|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTPAPLTGLLPLNPEQLARLQAATTDLTPEQLAWVSGYFWGVLNPRSGAVAVTPAPEGKMPGVTLISASQTGNARRVAEALRDDLLAANLNVTLVNAGDYKFKQIASEKLLVIVTSTQGEGEPPEEAVALHKFLFTKKAPKLENTAFAVFSLGDTSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDVLKSRAPVAAPSQSVATGAVNDIHTSPYTKDAPLTATLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVDGKTLPLAEALEWHFELTVNTANIVENYATLTRSESLLPLVGDKAQLQHYAATTPIVDMVRFSPAQLDAEALIGLLRPLTPRLYSIASAQAEVESEVHVTVGVVRYDIEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPQTPVIMIGPGTGIAPFRAFMQQRAADGAEGKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLSRIDLAWSRDQKEKIYVQDKLREQGAELWRWINDGAHIYVCGDARRMAADVEKALLEVIAEFGGMDLESADEYLSELRVERRYQRDVY >NZ_CP019416.1|WP_076731091.1|2978472_2980185_-|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLSDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTTQWQAIDKFAADNTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVAITDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGLETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDNNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHQGEFRITANQNLIIASVPESQKAKIETLARDHGLMNAVSAQRENSMACVSFPTCPLAMAEAERFLPSFTDKVEAILEKHGIPDEHIVMRVTGCPNGCGRAMLAEIGLVGKAPGRYNLHLGGNRIGTRIPRMYQENITEPDILASLGELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDFWE >NZ_CP019416.1|WP_000080390.1|2977636_2978371_-|phosphoadenosine-phosphosulfate-reductase MSQLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >NZ_CP019416.1|WP_073998915.1|2976595_2977549_+|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDLLTADEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLV >NZ_CP019416.1|WP_076731089.1|2973488_2976152_-|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETNGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSARKSKRVMALWMPVTTGHHGRPPDAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERSVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP019416.1|WP_000368582.1|2971920_2973477_-|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPLWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIWGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFANAKDARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRHYFDDHVFTNPYESSDLERIMTARKKYFTTSAEKQSAKAAKAKKQEAAE >NZ_CP019416.1|WP_076731088.1|2971369_2971924_-|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHEELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDTDNEQD >NZ_CP019416.1|WP_076731087.1|2970297_2971356_-|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNDNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA >NZ_CP019416.1|WP_001199961.1|2984689_2985361_-|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWDKLSDREVSLFSILAKTKESDKWGAASSEDLLAVINRQGYTARHVVITGGEPCIHDLMPLTDLLEKSGFSCQIETSGTHEVRCTPNTWVTVSPKVNMRGGYDVLSQALERANEIKHPVGRVRDIEALDELLATLSDDKPRVIALQPISQKEDATRLCIETCIARNWRLSMQTHKYLNIA >NZ_CP019416.1|WP_000036734.1|2985496_2986795_-|phosphopyruvate-hydratase MSKIVKVIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVGAVNGPIAQAILGKDAKDQAGIDKIMIDLDGTENKSNFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKGKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >NZ_CP019416.1|WP_000210863.1|2986877_2988515_-|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQLAVDIGREHALFMHLTLVPYLAAAGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISMKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIYEEANPAGEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVTVNIKLIDSQDVETRGVEILKDLDAILIPGGFGYRGVEGKIATARYARENNIPYLGICLGMQVALIEFARNVAGMDNANSTEFVPDCKYPVVALITEWRDEDGNVEVRSEKSDLGGTMRLGAQQCQLSDDSLVRQLYGASTIVERHRHRYEVNNMLLKQIEAAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAANEHQKRQAK >NZ_CP019416.1|WP_000210451.1|2988742_2989543_-|nucleoside-triphosphate-pyrophosphohydrolase MTTNHQIDRLLTLMQRLRDPENGCPWDKEQTFASIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFGELSADNSEEALVRWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCSNVGFDWTTLGPVVDKVYEEIDEVMFEARQAVVDQAKLEEEMGDLLFATVNMARHLGTKAELALQKANDKFERRFREVERIVAARGLEMTGVDLETMEEVWQEVKRQEIDL >NZ_CP019416.1|WP_000842512.1|2990150_2990738_+|fimbrial-protein MKSSHFCKLAVTASLVMGIVSGAQAAGSNTAKVTFLGNIVDSPCSVTLDTEDQTVNMGSSIGNGTLSNGKTTINNARTFHIDLEGCTWATEKNMNVVFTTGSGTTAATGATDNLALMKTDGTGAISNVSLAIGDAGKNNIKLGDTYTQAIADLDGDTILDEKQSLNFTAWLVGAATGTVGTGEFSSAANVTISYL >NZ_CP019416.1|WP_000981789.1|2990817_2993517_+|fimbrial-biogenesis-outer-membrane-usher-protein MMNNTWKSVLCPIACGVGMLLSASPYSASGKDIEFNTDFLDVKNRDNVNIAQFSRKGFILPGVYLLQIKINGQTLPQEFPVNWVIPEHDPQGSEVCAEPELVTQLGIKPELAEKLVWITHGERQCLAPDSLKGMDFQADLGHSTLLVNLPQAYMEYSDVDWDPPARWDNGIPGIILDYNINNQLRHDQESGSEEQSISGNGTLGANLGAWRLRADWQASYDHRDDDENTSTLHDQSWSRYYAYRALPTLGAKLTLGESYLQSDVFDSFNYIGASVISDDQMLPPKLRGYAPEIVGIARSNAKVKVSWQGRVLYETQVPAGPFRIQDLNQSVSGTLHVTVEEQNGQTQEFDVNTASVPFLTRPGMVRYKMALGRPQDWDHHPITGTFASAEASWGVTNGWSLYGGAIGESNYQAVALGSGKDLGVVGAVAVDITHSIAHMPQDDGFDGETLQGNSYRISYSRDFDEIDSRLTFAGYRFSEKNFMSMSDYLDAKTYHHLNAGHEKERYTVTYNQNFREQGMSAYFSYSRSTFWDSPDQSNYNLSLSWYFDLGSIKNLSASLNGYRSEYNGDKDDGVYISLSVPWGNDSISYNGTFNGNQHRNQLGYSGHSQNGDNWQLHVGQDEQGAQADGYYSHQGALTDIDLSADYEEGSYRSLGMSLRGGMTLTTQGGALHRGSLAGSTRLLVDTDGIADVPVSGNGSPTSTNIFGKAVIADVGSYSRSLARIDLNKLPEKAEATKSVVQITLTEGAIGYRHFDVVSGEKMMAVFRLADGDFPPFGAEVKNERQQQLGLVADDGNAWLAGVKAGETLKVFWDGAAQCEASLPPTFTPELLANALLLPCKMLEGQPPTAPQKSSPLPAQPLIQEHTQTDGQPAAPVATTTQTPPIPLADNHAVNRKDME >NZ_CP019416.1|WP_001044459.1|2993529_2994303_+|fimbria/pilus-periplasmic-chaperone MNKTNHFKRQALIASVLLAAPLVSHSAIVPDRTRVIFNGNENSITVTLKNGNATLPYLAQAWLEDDKFAKDTRYFTALPPLQRIEPKSDGQVKVQPLPAAASLPQDRESLFYFNVREIPPKSDKPNTLQLALQTRIKFFYRPVAVARQVDKTHPWQTKLTLTYQGDGVIFDNPTPFYLVISNAGSKENETASGFKNLLIAPREKVTSPIKGASLGSSPVVGYVDDYGGHRLLVFTCSGNTCKVNEEKTRDAEKKANK >NZ_CP019416.1|WP_000178265.1|2994322_2994829_+|fimbrial-protein MTMLTRWKMLVLLCGGFVAGTEAAGTKTVQLELHLVVTQPPPCTVGGASVEFGDVLTTKVGDASQTKPVGYSLNCDGRASDYLKLQIQGTTTTISGEQVLQTSVQGLGIRIQQAGNKQLVPVGITDWLNFTLSGSNGPELEAVPVKEPSTQLAGGDFNASATLVVDYQ >NZ_CP019416.1|WP_000832395.1|2994843_2995314_+|fimbrial-protein MKRVLILTLLITQFACADNLTFHGKLINPPACTINNGETLEVSFGSVIIDNIDGVNYLTEIPWTLTCDSSFRDDALTFTLSYLGTATPYSAKALTTSVPELGIELQQNGTVFPPGTSLTINESSLPTLKAVPVKQPGKEPAEGDFEAFATLQVDYQ >NZ_CP019416.1|WP_076731093.1|2995310_2995847_+|fimbrial-protein MNRIFQTAGHLIGGVMLWAVCNTLPAATPNVHYSGKLVAGACNLVVDNDTMATVDFHTIGSDNFDASGQTTPVPFTLSLQDCKTALANGVLVTFQGVEDSTLPGLLALEPSSEASGFAIGVETAAQQPVSINATVGTAFMLKEGITTINLQARLQKYAGEDVMPGEFKGSATVSFEYQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP019416_2 | 2.5|2984027|33|NZ_CP019416|CRISPRCasFinder,CRT | 2984027-2984059 | 33 | NZ_CP032236 | Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence | 63403-63435 | 4 | 0.879 |
NZ_CP019416_2 | 2.5|2984027|33|NZ_CP019416|CRISPRCasFinder,CRT | 2984027-2984059 | 33 | NZ_LN681230 | Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence | 91355-91387 | 4 | 0.879 |
NZ_CP019416_2 | 2.14|2984027|35|NZ_CP019416|PILER-CR | 2984027-2984061 | 35 | NZ_CP032236 | Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence | 63403-63437 | 5 | 0.857 |
NZ_CP019416_2 | 2.14|2984027|35|NZ_CP019416|PILER-CR | 2984027-2984061 | 35 | NZ_LN681230 | Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence | 91353-91387 | 5 | 0.857 |
NZ_CP019416_2 | 2.15|2984089|34|NZ_CP019416|PILER-CR | 2984089-2984122 | 34 | NZ_CP044178 | Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1 | 18967-19000 | 5 | 0.853 |
NZ_CP019416_2 | 2.15|2984089|34|NZ_CP019416|PILER-CR | 2984089-2984122 | 34 | CP053324 | Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence | 25136-25169 | 5 | 0.853 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP044178 | Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1 | 18969-19000 | 6 | 0.812 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | CP053324 | Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence | 25138-25169 | 6 | 0.812 |
NZ_CP019416_2 | 2.15|2984089|34|NZ_CP019416|PILER-CR | 2984089-2984122 | 34 | NZ_LN890526 | Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence | 31716-31749 | 6 | 0.824 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_LN890526 | Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence | 31718-31749 | 7 | 0.781 |
NZ_CP019416_2 | 2.3|2983905|32|NZ_CP019416|CRISPRCasFinder,CRT | 2983905-2983936 | 32 | MN694003 | Marine virus AFVG_250M677, complete genome | 17629-17660 | 8 | 0.75 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP053022 | Sphingobium yanoikuyae strain YC-XJ2 plasmid p-A-Sy, complete sequence | 329022-329053 | 8 | 0.75 |
NZ_CP019416_2 | 2.8|2984211|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984211-2984242 | 32 | MG592432 | Vibrio phage 1.050.O._10N.286.48.A6, partial genome | 21687-21718 | 8 | 0.75 |
NZ_CP019416_2 | 2.8|2984211|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984211-2984242 | 32 | MG592431 | Vibrio phage 1.049.O._10N.286.54.B5, partial genome | 21426-21457 | 8 | 0.75 |
NZ_CP019416_2 | 2.12|2983905|34|NZ_CP019416|PILER-CR | 2983905-2983938 | 34 | MN694003 | Marine virus AFVG_250M677, complete genome | 17627-17660 | 8 | 0.765 |
NZ_CP019416_1 | 1.1|2967115|32|NZ_CP019416|CRISPRCasFinder,CRT | 2967115-2967146 | 32 | NZ_MG266000 | Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence | 5501-5532 | 9 | 0.719 |
NZ_CP019416_1 | 1.3|2967237|32|NZ_CP019416|CRISPRCasFinder,CRT | 2967237-2967268 | 32 | MK449011 | Streptococcus phage Javan92, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP019416_1 | 1.3|2967237|32|NZ_CP019416|CRISPRCasFinder,CRT | 2967237-2967268 | 32 | MK448835 | Streptococcus phage Javan93, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP019416_1 | 1.3|2967237|32|NZ_CP019416|CRISPRCasFinder,CRT | 2967237-2967268 | 32 | MK448836 | Streptococcus phage Javan95, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP019416_1 | 1.3|2967237|32|NZ_CP019416|CRISPRCasFinder,CRT | 2967237-2967268 | 32 | MK448825 | Streptococcus phage Javan639, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP019416_1 | 1.7|2967481|32|NZ_CP019416|CRISPRCasFinder,CRT | 2967481-2967512 | 32 | KY006853 | Erythrobacter phage vB_EliS_R6L, complete genome | 41418-41449 | 9 | 0.719 |
NZ_CP019416_1 | 1.9|2967117|32|NZ_CP019416|PILER-CR | 2967117-2967148 | 32 | NZ_MG266000 | Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence | 5501-5532 | 9 | 0.719 |
NZ_CP019416_1 | 1.11|2967239|32|NZ_CP019416|PILER-CR | 2967239-2967270 | 32 | MK449011 | Streptococcus phage Javan92, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP019416_1 | 1.11|2967239|32|NZ_CP019416|PILER-CR | 2967239-2967270 | 32 | MK448835 | Streptococcus phage Javan93, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP019416_1 | 1.11|2967239|32|NZ_CP019416|PILER-CR | 2967239-2967270 | 32 | MK448836 | Streptococcus phage Javan95, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP019416_1 | 1.11|2967239|32|NZ_CP019416|PILER-CR | 2967239-2967270 | 32 | MK448825 | Streptococcus phage Javan639, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP019416_1 | 1.15|2967483|32|NZ_CP019416|PILER-CR | 2967483-2967514 | 32 | KY006853 | Erythrobacter phage vB_EliS_R6L, complete genome | 41418-41449 | 9 | 0.719 |
NZ_CP019416_2 | 2.5|2984027|33|NZ_CP019416|CRISPRCasFinder,CRT | 2984027-2984059 | 33 | NZ_CP031947 | Ruegeria sp. AD91A plasmid unnamed1, complete sequence | 143751-143783 | 9 | 0.727 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP048340 | Escherichia coli strain 142 plasmid p142_C, complete sequence | 2410-2441 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_LR130559 | Escherichia coli strain MS14385 isolate MS14385 plasmid 5 | 41882-41913 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP020518 | Escherichia coli strain 222 plasmid unnamed2, complete sequence | 13450-13481 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP020497 | Escherichia coli strain 103 plasmid unnamed2, complete sequence | 37140-37171 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP040921 | Escherichia coli strain FC853_EC plasmid p853EC2, complete sequence | 32060-32091 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | CP053252 | Escherichia coli strain SCU-204 plasmid pSCU-204-5, complete sequence | 19381-19412 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP042622 | Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-7, complete sequence | 2614-2645 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_LT985302 | Escherichia coli strain ECOR 39 genome assembly, plasmid: RCS82_pI | 11943-11974 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP028194 | Escherichia coli strain CFSAN018748 plasmid pGMI14-004_3, complete sequence | 15383-15414 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP024865 | Escherichia coli strain AR_0015 plasmid unitig_3_pilon, complete sequence | 22646-22677 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | AP019710 | Escherichia coli O145:H28 122715 plasmid pO145_122715_2 DNA, complete genome | 4361-4392 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP024829 | Escherichia coli strain CREC-544 plasmid pCREC-544_3, complete sequence | 2221-2252 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP009861 | Escherichia coli strain ECONIH1 plasmid pECO-b75, complete sequence | 2868-2899 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | CP025877 | Escherichia coli strain 503458 plasmid p503458_49, complete sequence | 18343-18374 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP023368 | Escherichia coli strain 1428 plasmid p48, complete sequence | 4914-4945 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP032259 | Escherichia coli strain AR_0067 plasmid unnamed2, complete sequence | 23402-23433 | 9 | 0.719 |
NZ_CP019416_2 | 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984089-2984120 | 32 | NZ_CP037450 | Escherichia coli strain ATCC 25922 plasmid unnamed, complete sequence | 15851-15882 | 9 | 0.719 |
NZ_CP019416_2 | 2.8|2984211|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984211-2984242 | 32 | NC_047790 | Pseudoalteromonas phage C5a, complete genome | 34441-34472 | 9 | 0.719 |
NZ_CP019416_2 | 2.10|2984333|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984333-2984364 | 32 | CP006879 | Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence | 405613-405644 | 9 | 0.719 |
NZ_CP019416_2 | 2.4|2983966|32|NZ_CP019416|CRISPRCasFinder,CRT | 2983966-2983997 | 32 | NZ_LR134399 | Listeria monocytogenes strain NCTC7974 plasmid 2, complete sequence | 103231-103262 | 10 | 0.688 |
NZ_CP019416_2 | 2.10|2984333|32|NZ_CP019416|CRISPRCasFinder,CRT | 2984333-2984364 | 32 | NZ_CP049244 | Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence | 699963-699994 | 10 | 0.688 |
NZ_CP019416_2 | 2.14|2984027|35|NZ_CP019416|PILER-CR | 2984027-2984061 | 35 | NZ_CP031947 | Ruegeria sp. AD91A plasmid unnamed1, complete sequence | 143751-143785 | 11 | 0.686 |
1. spacer 2.5|2984027|33|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP032236 (Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence) position: , mismatch: 4, identity: 0.879
tcaggaacgcgcggcggaagagcttggtgtttg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatg Protospacer ************.***************. **
2. spacer 2.5|2984027|33|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_LN681230 (Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence) position: , mismatch: 4, identity: 0.879
tcaggaacgcgcggcggaagagcttggtgtttg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatg Protospacer ************.***************. **
3. spacer 2.14|2984027|35|NZ_CP019416|PILER-CR matches to NZ_CP032236 (Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence) position: , mismatch: 5, identity: 0.857
tcaggaacgcgcggcggaagagcttggtgtttgcg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatgcc Protospacer ************.***************. ***
4. spacer 2.14|2984027|35|NZ_CP019416|PILER-CR matches to NZ_LN681230 (Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence) position: , mismatch: 5, identity: 0.857
tcaggaacgcgcggcggaagagcttggtgtttgcg CRISPR spacer tcaggaacgcgcagcggaagagcttggtaaatgcc Protospacer ************.***************. ***
5. spacer 2.15|2984089|34|NZ_CP019416|PILER-CR matches to NZ_CP044178 (Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1) position: , mismatch: 5, identity: 0.853
gctgcctttcccggagttccggcccct----aaattgg CRISPR spacer ggtgcctttcccggagttccggccccttctcaaa---- Protospacer * ************************* ***
6. spacer 2.15|2984089|34|NZ_CP019416|PILER-CR matches to CP053324 (Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.853
gctgcctttcccggagttccggcccct----aaattgg CRISPR spacer ggtgcctttcccggagttccggccccttctcaaa---- Protospacer * ************************* ***
7. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP044178 (Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1) position: , mismatch: 6, identity: 0.812
gctgcctttcccggagttccggcccctaaatt CRISPR spacer ggtgcctttcccggagttccggccccttctca Protospacer * ************************* .
8. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to CP053324 (Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.812
gctgcctttcccggagttccggcccctaaatt CRISPR spacer ggtgcctttcccggagttccggccccttctca Protospacer * ************************* .
9. spacer 2.15|2984089|34|NZ_CP019416|PILER-CR matches to NZ_LN890526 (Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence) position: , mismatch: 6, identity: 0.824
gctgcctttcccggagttccggcccct----aaattgg CRISPR spacer ggtgccttttccggagttccggccccttctcaaa---- Protospacer * *******.***************** ***
10. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_LN890526 (Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence) position: , mismatch: 7, identity: 0.781
gctgcctttcccggagttccggcccctaaatt CRISPR spacer ggtgccttttccggagttccggccccttctca Protospacer * *******.***************** .
11. spacer 2.3|2983905|32|NZ_CP019416|CRISPRCasFinder,CRT matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.75
cgattctacggcaacaggccaggctgcgaccg CRISPR spacer ggcgagcacggcaacagcccaggctgcgatcg Protospacer * .********** ***********.**
12. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP053022 (Sphingobium yanoikuyae strain YC-XJ2 plasmid p-A-Sy, complete sequence) position: , mismatch: 8, identity: 0.75
gctgcctttcccggagttccggcccctaaatt--- CRISPR spacer tatgcctttcccggctttccggccc---aactgac Protospacer ************ ********* **.*
13. spacer 2.8|2984211|32|NZ_CP019416|CRISPRCasFinder,CRT matches to MG592432 (Vibrio phage 1.050.O._10N.286.48.A6, partial genome) position: , mismatch: 8, identity: 0.75
tgattattgacgacaacagcacagaccggcag CRISPR spacer ttataattgactacaacagcacagagcagatt Protospacer * ** ****** ************* *.*
14. spacer 2.8|2984211|32|NZ_CP019416|CRISPRCasFinder,CRT matches to MG592431 (Vibrio phage 1.049.O._10N.286.54.B5, partial genome) position: , mismatch: 8, identity: 0.75
tgattattgacgacaacagcacagaccggcag CRISPR spacer ttataattgactacaacagcacagagcagatt Protospacer * ** ****** ************* *.*
15. spacer 2.12|2983905|34|NZ_CP019416|PILER-CR matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.765
cgattctacggcaacaggccaggctgcgaccgcg CRISPR spacer ggcgagcacggcaacagcccaggctgcgatcgcg Protospacer * .********** ***********.****
16. spacer 1.1|2967115|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_MG266000 (Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence) position: , mismatch: 9, identity: 0.719
tatttataagcgtgtcatctatgcaacccaac CRISPR spacer aatttataatcatgtcatctatgccataattc Protospacer ******** *.************ *. *
17. spacer 1.3|2967237|32|NZ_CP019416|CRISPRCasFinder,CRT matches to MK449011 (Streptococcus phage Javan92, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
18. spacer 1.3|2967237|32|NZ_CP019416|CRISPRCasFinder,CRT matches to MK448835 (Streptococcus phage Javan93, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
19. spacer 1.3|2967237|32|NZ_CP019416|CRISPRCasFinder,CRT matches to MK448836 (Streptococcus phage Javan95, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
20. spacer 1.3|2967237|32|NZ_CP019416|CRISPRCasFinder,CRT matches to MK448825 (Streptococcus phage Javan639, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
21. spacer 1.7|2967481|32|NZ_CP019416|CRISPRCasFinder,CRT matches to KY006853 (Erythrobacter phage vB_EliS_R6L, complete genome) position: , mismatch: 9, identity: 0.719
gagaatgctcatgcgcgtgagcgccatatatt CRISPR spacer cgaaatgatcatgcgcgtcagcgccattgcgt Protospacer ..**** ********** ******** *
22. spacer 1.9|2967117|32|NZ_CP019416|PILER-CR matches to NZ_MG266000 (Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence) position: , mismatch: 9, identity: 0.719
tatttataagcgtgtcatctatgcaacccaac CRISPR spacer aatttataatcatgtcatctatgccataattc Protospacer ******** *.************ *. *
23. spacer 1.11|2967239|32|NZ_CP019416|PILER-CR matches to MK449011 (Streptococcus phage Javan92, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
24. spacer 1.11|2967239|32|NZ_CP019416|PILER-CR matches to MK448835 (Streptococcus phage Javan93, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
25. spacer 1.11|2967239|32|NZ_CP019416|PILER-CR matches to MK448836 (Streptococcus phage Javan95, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
26. spacer 1.11|2967239|32|NZ_CP019416|PILER-CR matches to MK448825 (Streptococcus phage Javan639, complete genome) position: , mismatch: 9, identity: 0.719
ggccgctggtcaaattcccaatctgagcaatc CRISPR spacer tacatcttgacaaattcccaatctgagcgact Protospacer .* ** * ******************.*..
27. spacer 1.15|2967483|32|NZ_CP019416|PILER-CR matches to KY006853 (Erythrobacter phage vB_EliS_R6L, complete genome) position: , mismatch: 9, identity: 0.719
gagaatgctcatgcgcgtgagcgccatatatt CRISPR spacer cgaaatgatcatgcgcgtcagcgccattgcgt Protospacer ..**** ********** ******** *
28. spacer 2.5|2984027|33|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP031947 (Ruegeria sp. AD91A plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
tcaggaacgcgcggcggaagagcttggtgtttg CRISPR spacer ctttgcccgtgcggcggaagaccttggtgtttc Protospacer .. * **.*********** **********
29. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP048340 (Escherichia coli strain 142 plasmid p142_C, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
30. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_LR130559 (Escherichia coli strain MS14385 isolate MS14385 plasmid 5) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
31. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP020518 (Escherichia coli strain 222 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
32. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP020497 (Escherichia coli strain 103 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
33. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP040921 (Escherichia coli strain FC853_EC plasmid p853EC2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
34. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to CP053252 (Escherichia coli strain SCU-204 plasmid pSCU-204-5, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
35. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP042622 (Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-7, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
36. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_LT985302 (Escherichia coli strain ECOR 39 genome assembly, plasmid: RCS82_pI) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
37. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP028194 (Escherichia coli strain CFSAN018748 plasmid pGMI14-004_3, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
38. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP024865 (Escherichia coli strain AR_0015 plasmid unitig_3_pilon, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
39. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to AP019710 (Escherichia coli O145:H28 122715 plasmid pO145_122715_2 DNA, complete genome) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
40. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP024829 (Escherichia coli strain CREC-544 plasmid pCREC-544_3, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
41. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP009861 (Escherichia coli strain ECONIH1 plasmid pECO-b75, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
42. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to CP025877 (Escherichia coli strain 503458 plasmid p503458_49, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
43. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP023368 (Escherichia coli strain 1428 plasmid p48, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
44. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP032259 (Escherichia coli strain AR_0067 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
45. spacer 2.6|2984089|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP037450 (Escherichia coli strain ATCC 25922 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
gctgcctttcccggagttccggcccctaaatt CRISPR spacer gtgccctttaccggagttccggccccttctca Protospacer *. ***** ***************** .
46. spacer 2.8|2984211|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NC_047790 (Pseudoalteromonas phage C5a, complete genome) position: , mismatch: 9, identity: 0.719
tgattattgacgacaacagcacagaccggcag CRISPR spacer agcttattgacgaaaacggcacagacaccaaa Protospacer * ********** ***.******** *.
47. spacer 2.10|2984333|32|NZ_CP019416|CRISPRCasFinder,CRT matches to CP006879 (Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence) position: , mismatch: 9, identity: 0.719
gaatctggaggccaacagcgcggcgaaatcct CRISPR spacer gaatctggagggcgacagcgcggtcgaccctg Protospacer *********** *.*********. .* .*.
48. spacer 2.4|2983966|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_LR134399 (Listeria monocytogenes strain NCTC7974 plasmid 2, complete sequence) position: , mismatch: 10, identity: 0.688
atcaaacatggaaacccctttaatgagagcaa CRISPR spacer ctaaaacatggaaaccactgtaatgacgaatc Protospacer * ************* ** ****** ..
49. spacer 2.10|2984333|32|NZ_CP019416|CRISPRCasFinder,CRT matches to NZ_CP049244 (Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence) position: , mismatch: 10, identity: 0.688
gaatctggaggccaacagcgcggcgaaatcct CRISPR spacer gtggtcataggccatcagcgcggcgatatccc Protospacer * . ... ****** *********** ****.
50. spacer 2.14|2984027|35|NZ_CP019416|PILER-CR matches to NZ_CP031947 (Ruegeria sp. AD91A plasmid unnamed1, complete sequence) position: , mismatch: 11, identity: 0.686
tcaggaacgcgcggcggaagagcttggtgtttgcg CRISPR spacer ctttgcccgtgcggcggaagaccttggtgtttctc Protospacer .. * **.*********** ********** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
934626 : 941939
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP019416|934626:941939|DBSCAN-SWA TATGCGTGCTAAGGGAAAGAAATTTAAAAAGCGTTATCTGGTCATTATTTTAATTCTTTTAGTGGGGGGGATGGCTGGCTGGCGAATGATAAATGCGCCGCTGCCAACTTATCAGACATTAATCGTGCGGCCAGGCGATCTTGAACAGAGTGTACTGGCGACTGGAAAACTGGACGCGTTGCGTAAAGTGGATGTCGGCGCGCAGGTGAGCGGCCAGTTGAAAACGCTGCTGGTCTCCATTGGCGATAACGTTAAAAAAGATCAGCTACTCGGCGTGATTGACCCAGATCAGGCGGAGAACCAGATAAAAGAGGTCGAGGCCACCCTGATGGAGCTGAACGCGGAGCGTCAGCAGGCAGCCGCTGAGTTAAAGTTGGCGCGGGTTACGCTGGCGCGCCAGCAGCAGTTAGCTAAGACTCAGGCGGTATCGCAACAGGATCTGGATACCGCGGCGACGGAGATGGCGGTTAAACAGGCGCGTATTGGCACCATAGATGCCCAGATCAAACGTAATCGGGCCTCGTTGGACACCGCGAAAACCAACCTGGAATATACCCGTATTGTCGCCCCCATGGCGGGGGAAGTGACGCAAATCACTACCCTGCAAGGACAAACGGTGATTGCAGCTCAGCAGGCGCCCAATATTCTGACGCTGGCGGATATGAGCACCATGCTGGTAAAAGCGCAGGTCTCGGAAGCGGACGTGATCCATCTTCGGGCGGGGCAGAAAGCATGGTTCACCATTGCAGGCGATCCGCAAACGCGCTATGAAGGCGTTTTAAAAGATATTCTGCCGACGCCGGAAAAGATCAACGACGCTATTTTTTATTACGCCCGGTTTGAAGTGCCGAATCCCAAAAGAATCTTGCGTCTTGATATGACCGCGCAGGTTTATATTCAACTCATGGATGTCAAAAATGTGTTGATTATTCCTCTCGCCGCGCTTGGCGAACCGGTGGGCGGCAATCGTTATAAAGTGGCGCTGTTGCGTAACGGCGAAAAACGTGAGCGCGAAGTGGTCATTGGCGAGCGTAACGATACAGACGTGGAAGTGGTTAAAGGTCTGGAAGCGGGCGATGAGGTGATCATCGGCGAAAGCAGGCCAGGAGCGACGCCATGACGGCATTGCTTGAACTGCGCAATGTGAGTCGCAGCTACCCCTCCGGAGAAGAGCAGGTGGCGGTGTTGAAAGATATCTCCCTGCAAATCCACGCCGGGGAGATGGTGGCGATCGTCGGCGTTTCCGGTTCTGGAAAATCAACGCTGATGAATATCCTCGGGTGCCTGGATAAACCGACCAGCGGCACTTATCGGGTGGCGGGGCGGGACGTCTCGACGCTGGACCCGGACGCGCTAGCGCAGCTGCGGCGTGAGCATTTTGGCTTTATCTTTCAGCGCTACCATCTGTTGTCGCATTTAACGGCAGCGCAAAATGTTGAAATCCCCGCCGTCTACGCCGGCATTGAACGCAAAAAACGCCAGGCGCGCGCCAGAGAGTTACTTCTGCGGCTGGGATTAAGCGATCGCGTCGATTACCCGCCTTCACAGCTTTCTGGCGGACAGCAGCAGCGTGTCAGTATTGCCCGCGCGCTGATGAACGGTGGACAGGTGATTCTGGCAGATGAGCCGACCGGCGCGCTGGATAGCCATTCCGGCGAAGAGGTGATGGCGATTTTGCGCCAACTGCGCGATCGCGGACATACGGTGATCATTGTGACGCACGATCCGCTGATTGCCGCCCAGGCGGAGCGGATTATTGAAATTCACGATGGCAAGATTGTCCATAATCCGCCCGCGGAGGAAAAGAAACGCGAACAGGGCGTTGACGCTGCCGTAGTTAATACGGCTCCCGGCTGGCGGCAATTTGCCAGCAGCTTTCGCGAAGCGCTGTCAATGGCGTGGTTAGCGATGGCCGCTAACAAAATGCGTACTTTACTGACCATGCTGGGAATTATTATCGGTATTGCGTCGGTGGTGTCGATTGTGGTGGTCGGCGACGCCGCAAAACAGATGGTACTGGCGGATATCCGCGCTATGGGCACTAACACGATTGATATTCATCCAGGCAAAGATTTTGGCGACGACAATCCGCAGTATCGACAGGCGCTGAAATATGACGATCTGGTGGCTATTCAGAAACAGCTGTGGGTTAACTCTGCGACGCCCAGCGTTTCAAAGAGCCTACGTCTTCGCTATGGCAATATTGATATTGCCGTAAATGCTAATGGCGTCAGTGGCGATTATTTTAACGTTTACGGCATGTCCTTTAGGGAGGGGAACACCTTCAATGATGTACAGCAACAGGATCGCGCGCAGGTGGTGGTGCTGGATGCCAACACGCGACGCCAGCTATTTCCAAATAAAGCGAATGTCGTAGGGGAAGTGGTGCTGGTGGGTAATATGCCGGTTATTGTTATTGGCGTGGCGGAAGAGAAACCGTCCATGTACGGCAATAGCAATCTGTTGCAAGTTTGGTTGCCCTATAGCACGATGTCAGATCGCATAATGGGTCAGTCATGGCTTAACTCGATCACCGTTCGTGTGAAAGATGGCGTTGATAGCGATCAGGCTGAACAGCAGCTTACTCGCCTGCTCACCTTACGCCACGGTAAAAAAGACTTCTTCACCTGGAATATGGACAGCGTCCTGAAAACGGCTGAAAAAACCACCTATACTCTTCAGTTATTTCTGACGCTGGTGGCCGTCATTTCGCTGGTTGTCGGCGGCATTGGCGTTATGAATATTATGCTGGTTTCCGTCACCGAGCGAACGCGTGAAATCGGCATCCGTATGGCGGTAGGCGCGCGCGCCAGCGATGTGCTACAGCAGTTTCTTATTGAAGCGGTGCTGGTTTGCCTGGTTGGGGGAGCGCTGGGGATTAGCTTGTCGATGTTCATCGCATTTATGCTACAGCTTTTCCTGCCCGGCTGGGAGATCGGTTTTTCACTGACTGCGCTGGCGAGCGCGTTTTTATGTTCGACGTTTACCGGGATACTGTTTGGCTGGCTACCGGCGAGAAACGCGGCGCGACTGGACCCGGTGGATGCGCTGGCAAGGGAGTAATCTTGCTGACGTCATTTGTCGGCCTGATAAGATGCAACAGTGTCGCTATCAGGCACTGGCGTTAAATAAAAATGCCAGCCGATCGGGCTGGCATTTTGCCTCCTGGATGTACACAATGAGACAGAGGAGCTATGCAACGGCCTCTGCTTCGATGGGCACGATGACGCTGGCGTGATTGCCTTTTGGCCCCTGGTGGACATCAAACCGGACAGACTGTCCGGCTTTAAGCGTTCTGTAACCATCCATTTGAATGGTGGAATAATGGGCGAAAATATCCTCGCCGCCGCCTTCAGGGCAGATGAAACCAAACCCTTTGGCATTGTTGAACCACTTTACAGTACCCGTTTCCATGCTTCGACATCCTTCGTAAATCTTATATAAGTAAGATGGAATGAACCGGTGACGGAGTGGGGGCTGTTCAAAACCTCACCAACTCTCGACATTACAATTTAGAGAAATCAGGCGAGGCGTCAAGCATCAGGCAGGGGGGATCGGGTAAAAATGAATCAAAAATTTGAAGCAGTTAACGCTATTGCCGGGAATGTGACAGATGTCGCGGATGGTACTGATAGATGTTAGTTATCTATCAATTGAGGTAGATTGATTGTGTGCATAGACTCTGGTCAGCGGCAGATTTTCCTGCCGACAACCGTAACCGATAATGACGACTGACAATGGGTAAGACGAACGATTGGCTGGATTTTGACCAGTTGGTGGAAGATAGCGTGCGCGACGCGCTAAAACCGCCATCTATGTATAAAGTGATATTAGTCAATGATGATTACACTCCGATGGAGTTTGTTATTGACGTGTTACAAAAATTCTTTTCTTATGATGTAGAACGTGCAACGCAATTGATGCTTGCAGTTCACTATCAAGGCAAAGCTATCTGCGGCGTGTTTACCGCCGAGGTGGCGGAAACCAAAGTGGCGATGGTGAACAAGTATGCAAGGGAGAACGAGCATCCGTTGCTGTGTACGCTGGAAAAAGCCTGAATGCAGGTATAAAAATTGGGGGAGGTGCCTATGCTCAATCAAGAACTGGAACTCAGTTTAAACATGGCTTTCGCCAGAGCGCGCGAGCACCGTCATGAGTTTATGACCGTCGAGCATCTGTTGCTGGCGCTGCTCAGCAACCCATCGGCTCGCGAAGCGCTGGAAGCATGCTCCGTGGATCTGGTGGCGCTCCGTCAGGAGCTCGAAGCCTTCATTGAACAAACCACACCCGTACTGCCTGCCAGTGAAGAAGAGCGTGATACGCAGCCGACGTTAAGTTTCCAGCGTGTCCTGCAACGTGCCGTCTTCCACGTTCAGTCTTCCGGGCGTAGTGAAGTGACTGGCGCGAATGTGCTGGTGGCTATCTTTAGCGAACAGGAATCACAGGCGGCTTATCTGCTGCGTAAGCATGAAGTGAGCCGTCTGGATATCGTGAACTTTATTTCTCACGGGACGCGAAAAGACGAACCGAGCCAATCTTCCGATCTCGGCAATCAGCCAACTGGCGACGAACAAGCTGGCGGGGAGGAACGTATGGAAAACTTCACGACGAATCTTAACCAGCTTGCTCGCGTGGGCGGCATCGATCCGCTGATTGGTCGTGAAAAAGAACTGGAACGCGCGATCCAGGTCTTGTGTCGTCGCCGTAAAAATAACCCGTTGCTGGTAGGGGAATCCGGCGTCGGCAAAACGGCGATTGCCGAAGGGCTGGCCTGGCGTATCGTGCAGGGCGATGTGCCGGAAGTGATGGCCGATTGCACCATTTACTCGCTGGATATCGGTTCGCTGCTGGCGGGCACCAAATACCGCGGCGATTTTGAAAAACGGTTTAAGGCGTTGCTGAAACAGCTTGAGCAGGATACCAACAGCATCCTGTTTATCGATGAAATCCATACCATTATCGGCGCTGGCGCGGCGTCGGGCGGACAGGTGGATGCGGCAAATCTGATTAAACCGCTGCTTTCCAGCGGTAAGATCCGGGTGATCGGCTCAACGACCTATCAGGAATTCAGCAATATTTTTGAGAAAGACCGGGCATTAGCGCGCCGTTTCCAGAAAATTGATATTACCGAGCCTTCGGTGGAAGAGACGGTGCAAATTATCAACGGCTTGAAACCTAAGTACGAAGCGCACCACGACGTGCGTTATACCGCGAAAGCGGTGCGTGCGGCGGTCGAGCTGGCGGTAAAATATATCAATGACCGCCATCTGCCGGATAAAGCCATTGACGTGATTGACGAAGCGGGCGCTCGGGCGCGTCTGATGCCGGTGAGCAAACGTAAGAAAACGGTCAACGTGGCGGATATTGAGTCCGTAGTGGCGCGAATTGCGCGAATTCCTGAAAAGAGCGTCTCGCAGAGCGATCGCGATACGCTGAAGAACCTGGGCGATCGTCTGAAAATGCTGGTCTTCGGCCAGGATAACGCGATTGAGGCGCTGACCGAAGCTATTAAGATGAGTCGTGCGGGTCTGGGCCATGAGCATAAACCTGTCGGCTCATTCTTGTTCGCCGGGCCAACTGGCGTAGGGAAAACTGAAGTTACGGTACAGCTTTCAAAAGCGCTGGGTATTGAGCTGTTGCGCTTCGATATGTCTGAATATATGGAGCGTCATACGGTGAGCCGTTTGATCGGCGCACCTCCGGGATACGTCGGTTTCGACCAGGGCGGGCTGCTGACGGATGCGGTGATTAAGCATCCTCATGCGGTGCTGTTGCTGGATGAGATCGAAAAAGCACACCCGGATGTCTTTAACCTGCTGCTGCAGGTGATGGATAACGGTACGCTGACCGATAACAACGGCCGCAAGGCGGATTTCCGCAACGTGGTGCTGGTGATGACCACCAACGCTGGCGTGCGAGAAACCGAACGTAAATCTATTGGTCTTATTCATCAGGACAACAGTACCGATGCGATGGGCGAGATCAAGAAAGTGTTTACGCCGGAGTTCCGTAACCGTCTCGACAACATTATTTGGTTCGATCATCTGTCTGGCGAGGTGATTCATCAGGTTGTCGATAAGTTTATCGTCGAGTTGCAGGCTCAGTTGGATCAGAAAGGCGTCTCTCTGGAAGTCAGTCAGGAAGCGCGCGACTGGCTGGCGGAAAAGGGCTATGACCGGGCGATGGGCGCACGACCAATGGCGCGTGTGATTCAGGATAACCTGAAAAAACCGCTGGCCAATGAGTTGTTGTTTGGATCGCTGGTTGACGGCGGACAGGTCACCGTCGCGCTGGATAAAGAGAAAAATGCGTTGACGTATGACTTCCAGAGTGCGCAAAAGCACAAGCCGGAAGCCGCGCACTAATCTTCGTTTCACTGTCGTACAAACCGGGCCTTAGCGCCCGGTTTTTTTACGCCGGATAATAGTTAGCCTGATAGCGTTGTGCTCGCGGTAGGCGGATAAGGCGCTTGTAAATTGCGTCATCATCCGGTAATCGACGGGTCAAATGCGAAAAAAAAGTCCGACCCGCTGCGTGAAAAGGGATGCTGAAGCCGCTGTTACCAATGCCAAAAAAACTAATCCTCTTCTCTGAAAAGAGTAGTCTTACACTTAGCACAAAGAATTCTTCTATGATTATCCCATACCTGTATCAAACCAACATCGCTGAATTTCTCGTGACCACAACTGTAGCAGGATATTTTACTCATATTGTGTTTTTTAATATATTCATCTTTGGTTGGAAGTTGCTTCCAGTAGCCTATCAGTCCGGGCATAACCTCTCCTTGAGTGGGTTATGTAATTTGCAAATTATGATAATAATTATATAAGAAATAGTTCAGTATGAAATTATTACTTATCACATTTTCGAAAGCGAATGCTATTAGCATCACACTTTCTACGTTGTTATCAACGATAATGCTTTCGTACAGGTAGCTTAGCTGTTGGCAGTGCTATACCCCGTCCAATCATCCATCAACGCCACCCGTTTAGGCCATAACGTCCCACGCTGATACGCTGCTTCAGCCTTATCTGCCAACTGGTGCGCCAACGCATGTTCAATAACCTCACGTTGATAATCCGTTGCTTCACCAGCCCACTCACGGAAAGTAGAACGGAAGCCATGTTGCGTTAAGTCGATATATCCCATTCGTTTCAATACAGCCAATAACGACATATCAGAAAGTGTTTCAGCGCGAGGGGCAGGGAATACATGATTGTTATCTTTTAATCGTGGTAAATCTTTTAACAAATCAACAGCAGCATCAGACAGATGAACTCGGTGCTTTTTTACTACCGAGAGATACTTATGCAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP019416|934626:941939|938713_940990_+|WP_000934063.1|protease|DBSCAN-SWA MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEAFIEQTTPVLPASEEERDTQPTLSFQRVLQRAVFHVQSSGRSEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDIVNFISHGTRKDEPSQSSDLGNQPTGDEQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERAIQVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASGGQVDAANLIKPLLSSGKIRVIGSTTYQEFSNIFEKDRALARRFQKIDITEPSVEETVQIINGLKPKYEAHHDVRYTAKAVRAAVELAVKYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSQSDRDTLKNLGDRLKMLVFGQDNAIEALTEAIKMSRAGLGHEHKPVGSFLFAGPTGVGKTEVTVQLSKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGLLTDAVIKHPHAVLLLDEIEKAHPDVFNLLLQVMDNGTLTDNNGRKADFRNVVLVMTTNAGVRETERKSIGLIHQDNSTDAMGEIKKVFTPEFRNRLDNIIWFDHLSGEVIHQVVDKFIVELQAQLDQKGVSLEVSQEARDWLAEKGYDRAMGARPMARVIQDNLKKPLANELLFGSLVDGGQVTVALDKEKNALTYDFQSAQKHKPEAAH >NZ_CP019416|934626:941939|941561_941939_-|WP_001531374.1|DBSCAN-SWA MHKYLSVVKKHRVHLSDAAVDLLKDLPRLKDNNHVFPAPRAETLSDMSLLAVLKRMGYIDLTQHGFRSTFREWAGEATDYQREVIEHALAHQLADKAEAAYQRGTLWPKRVALMDDWTGYSTANS >NZ_CP019416|934626:941939|935741_937688_+|WP_000125890.1|DBSCAN-SWA MTALLELRNVSRSYPSGEEQVAVLKDISLQIHAGEMVAIVGVSGSGKSTLMNILGCLDKPTSGTYRVAGRDVSTLDPDALAQLRREHFGFIFQRYHLLSHLTAAQNVEIPAVYAGIERKKRQARARELLLRLGLSDRVDYPPSQLSGGQQQRVSIARALMNGGQVILADEPTGALDSHSGEEVMAILRQLRDRGHTVIIVTHDPLIAAQAERIIEIHDGKIVHNPPAEEKKREQGVDAAVVNTAPGWRQFASSFREALSMAWLAMAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRAMGTNTIDIHPGKDFGDDNPQYRQALKYDDLVAIQKQLWVNSATPSVSKSLRLRYGNIDIAVNANGVSGDYFNVYGMSFREGNTFNDVQQQDRAQVVVLDANTRRQLFPNKANVVGEVVLVGNMPVIVIGVAEEKPSMYGNSNLLQVWLPYSTMSDRIMGQSWLNSITVRVKDGVDSDQAEQQLTRLLTLRHGKKDFFTWNMDSVLKTAEKTTYTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGISLSMFIAFMLQLFLPGWEIGFSLTALASAFLCSTFTGILFGWLPARNAARLDPVDALARE >NZ_CP019416|934626:941939|934626_935745_+|WP_001201751.1|DBSCAN-SWA MRAKGKKFKKRYLVIILILLVGGMAGWRMINAPLPTYQTLIVRPGDLEQSVLATGKLDALRKVDVGAQVSGQLKTLLVSIGDNVKKDQLLGVIDPDQAENQIKEVEATLMELNAERQQAAAELKLARVTLARQQQLAKTQAVSQQDLDTAATEMAVKQARIGTIDAQIKRNRASLDTAKTNLEYTRIVAPMAGEVTQITTLQGQTVIAAQQAPNILTLADMSTMLVKAQVSEADVIHLRAGQKAWFTIAGDPQTRYEGVLKDILPTPEKINDAIFYYARFEVPNPKRILRLDMTAQVYIQLMDVKNVLIIPLAALGEPVGGNRYKVALLRNGEKREREVVIGERNDTDVEVVKGLEAGDEVIIGESRPGATP >NZ_CP019416|934626:941939|941202_941400_-|WP_001117984.1|DBSCAN-SWA MPGLIGYWKQLPTKDEYIKKHNMSKISCYSCGHEKFSDVGLIQVWDNHRRILCAKCKTTLFREED >NZ_CP019416|934626:941939|938362_938683_+|WP_000520789.1|protease|DBSCAN-SWA MGKTNDWLDFDQLVEDSVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLLCTLEKA >NZ_CP019416|934626:941939|937817_938039_-|WP_000447499.1|DBSCAN-SWA METGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQSVRFDVHQGPKGNHASVIVPIEAEAVA |
7 | Dickeya_phage(16.67%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1013649 : 1024443
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP019416|1013649:1024443|DBSCAN-SWA TATGTACGGTGTACAGGGAACGCCTGACTGTTACCGGATTGAACTGAAAAATGTTTATGGTGTACAGGAGAATCTGATCTCATACCGACAGGCATCGCTGGGGGCATGGGTAGCGATTGCTGGTGGCGGCGATCCTTATGAAGTGGCTTACGCTATCTATAAAGCCGTGCCAGATATCTCCGTACTGACGAATGATGTAGTGAATCCATCAGGCGCTGCGGTGGATAAAAAAACGATACCGATCATTGTGTATCCGGATACGTATCACGTGCCGTTTGTAGTGCCATCATCACAAAACGTTACGCTTTTAATCACCTGGAATACAGCCTCAACCAGCTATATCGATCCAACCGGGATTGAAAAAGCAGTGCAGCAAAGCATTGCTGATTACATTAACGGAATTGCAACGGGTGAACCAATAAACATTTTCCTGATTCGGGATATTTTTCTTAATCAGGTTAAGGGGCTTGTATCTTCAAACCTTGTATCAATGATTGATATTCAGGTTGGAATAAACGGAAAAATTGTCCCACCTGCAACCGACTCCAGCCTGGTTTATGGTGATACTTACGCCTATTTTTCCACTTCATCTTCACAAATTCAGGTTAAGCAATATGGCAGCTCTTCTTGAAAGCATTATTCCGGCCTACCCCTATACGCAATATAATGACGATCCGGATATAGTTGCCTTTTTTGATGCTTATAACAAACTGGCACAGGGGTATCTTGATTACTTTAACAACCTGAATTTACCTTGCTGGACCTCCCCGGCGATTACCGGTGAGTTGCTGGACTGGATTGCGGCGGGTATTTATGGGGAATCACGCCCCTTGCTTCAAATCTCCGAGGATGCCATTGCTCGTGGGGCGTATAACACTATTGAGTACAATAATGTCGCGTATGCAAAACTGAGAAATTATGTTCCCGGCTCAGCGTCATATGTTCCGGACGACTATTTTAAACGGATACTGACATGGAATTTTTATAAAGGCGATGGTTCGCACTTCTGTATCAACTGGTTCAAACGACGGCTTGCACGCTTTATACATGGAGCTAACGGAATAGACCCACCTGTACAGTCCACTTTTGATATTAGTGTAATGCCCGATAAGGGCATTTTTTTTGTCTCCATTCCTGACTATGGCGATGGTGTCGGACACTTTCTTAAAGATGCAATTGACCAGTCGCTGGTGAAACTCCCTTTTATTTATACCTATTCGGTAACGGTGGTTGAGCAATGATTATTGGATTCGGAAATAATGTCGTCTCCTCACTGGCGGCTGATATTACCGCCAGCCAGACGACCATTCAGGTGATGCCTGGTGTGGGAGCGATGTTTGCTAATTTGCTGACCAGCGATTATGCAAACAGCTCAAACCCTCTTAAAACTTACGCCAAAATTACACTGACAGACGCAAAAGAAACAGTTTTTGAGGTATGCCATCTGACAGCAGTTAATAATGACATGCTGACGGTTATTCGCGGTCAGGAAGGTACAACAGCGAAGGGATGGTCACTGAATGACGTTATAGCGAATTTTGCGACGCGAGGATCTGAAAATCAGTTTGTACAAATTGAAGAGCTCCAGAGTGGGCATTATGTCGCTGGTGTGGCCGGAGGTACAGAAAATAATCTGACGCTGGAGTTACCAGCAACTTATTTCGTCAATGGTGGAGTTGACTGGACATTGCGCACTCCACTTGTGGTTATTCCGGCGCTAAACAATACCGGAGCCAGCACTCTGCAACTGACGATGGGAGGACGTGTGCTTGGCATATTCCCACTATACAAGGGGAATAAAGCAGAGTTATCGGCCAATGATATTATTAAAGATATTCCTGTCTTATGCGTTCTGGATAATACAAAAACCTATTTTTCTGTGCTTAATCCCCTGGAGATTTATTTGGGATCACGGTATTTGCAGAAGGACCAGAACCTGTCCGACGTACCGGATAAGGCCAAAGGTCGCTCCAGTCTTGAGGTCTACAGCAAAACCGAAAGTGATGAAAACTACATGGCTAAAAGCCAGTGTGGTGCGGATATCCCGAATAAGCCGCTGTTTGTACAAAATATCGGAGCGCTCCCTGCATCAGGTACGGCTGTTGCAGCGAACAGACTGGCATCACGCGGCGCGCTTCCGGCACTGACTGGTACGACAAGAGGCAGCGATAGTGGCCTGATAATGGGCGAGGTTTACAACAATGGCTATCCGACGCAATACGGAAATATTTTACGTCTGACCGGAACCGGTGATGGGGAAATCCTCATTGGCTGGAGCGGGACAAATGGTGCGCCAGCGCCCGCATATATTCGCAGTCATCGAGATACCGCCGATGCTGAGTGGTCCGAATGGGCAATGCTCTACACCACACTAAACCCACCTCCGGATTCGCATCCAGTAGGGGCGGCGATTGCATGGCCGTCTGATGCTACTCCGGCAGGTTACGCTCTGATGCAGGGGCAGTCCTTCGATAAATCTGCTTACCCGTTACTGGCTATAGCGTATCCGTCCGGCGTTATCCCTGACATGAGAGGCTGGACAATAAAGGGTAAGCCCATCAGTGGACGTGCCGTATTGTCGCAAGAAATGGACGGCAATAAATCGCACTCGCACACCGCGCGGGCGCAGGTTACTGACTTAGGGGCAAAATCTACCTCATCCTTTGATTACGGCACGAAATCGACCAATACCACGGGCAATCATACTCACCAGTTCGGCGGTTATATCAATTCATACTGGGGAGATTCCAATCACACCTCATTTCAGCCAGGAGGTGGTGCATGGACACAGGCCGCTGGCGACCATGCACATACAGTTTATATCGGAGGACATGAGCACACCATGTATATCGGTCCACACGGACACGTCGTTATTGTGGACGCAGACGGTAATGCGGAAACCACGGTTAAAAATATTGCATTTAACTACATAGTGAGGCTGGCATAATGACTTTTAAAATGAGCGAACAGGCGCAGACAATTAAAATTTTTAATCTTCGTTCAGATACAAACGAATTTATTGGCGCAGGTGATGCATATATCCCGCCGCACACGGGATTACCGGCAAACTGTACTGATCTCGCCCCTCCTGATATTCCCTCCAGTCATATTGCTGTATTTGACGCTGAAACCCAGACGTGGAGTTTGCATGAGGATCACCGCGGCGAGATGGTTTACGACACAACAACCGGCAATCAGGTTTATATCTCCGCTCCTGGTCCGTTGCCCGAAAATGTCACATCAGTTTCACCAGGTGGTGAATACCAGAAATGGGATGGTAAGGCTAAGGTCTGGGTAAAAGACGAAGCGGCTGAAAAAGCAGCGCAGCTTCGTCAGGCGGAAGAAACCAAAAGCCGTCTTTTGCAAATGGCATCTGAAAAAATCGCGCCATTGCAGGATGCAGTTGATCTTGGAATCGCAACAGATGATGAGAAAGCGCGGCTCGACGAATGGAAAAAATATAGGGTGCTGGTAAACCGGATGGATACAGCCGCCCCTGACTGGCCGGAAAGACCAGCCAGCCAGTAGGCGTTACTGTGGTAATGCAGGCCACCTGATTGCTGTGAAGGTGGCCTCATCTGAAACACCTGTTAAGTCCAGCGATTTAAGAACATTGATATAATTCATCGGCAGAATCAAAGCTGCTTTATTTTCATCACTGATAATTCCCAACGTTAATTCTGTGCGCCAGTCCTTAATGGCATTATCTGCATCGTTAAGTAATTTCTGCCGGGTGACTTCTGCCCGCCAAAATTGAACGCCCGGGCGCTGATGCCGGATGATCTGGTCATCGTGGAAAGCGACCCTGAAAAAATCGACACTTTAGCTGTAAAATGACAGTCCCGCCATCCGGTCATCATAACGGATTTTTCTTCTGCACCTTCTGAAGCCCGCCATGGCAGGACGACCATGAATCCGCCGATAACCTTATTGTGAAATTAAGACCAGGAAGAGATGATGTCTGTCGGACAGATACTATATGTAAATTTATAAAGGTTTTTTGTTATGCCCTTTCATATTGGAAGCGGATGTCTTCCCGCCATCATCAGTAACCGCCGCATTTATCGTATTGCCTGGTCTGATACCCCCCCTGAAATGAGTTCCTGGGAAAAAATGAAGGAATTTTTTTGCTCAACGCACCAGACTGAAGCGCTGGAGTGCATCTGGACGATTTGTCACCCGCCGGCCGGAACGACGCGGGAGGATGTGGTCAGCAGATTTGAACTGCTCAGGACGCTCGCGTATGACGGATGGGAGGAAAACATTCATTCCGGCCTGCACGGGGAAAACTACTTCTGTATTCTGGATGAAGACAGTCAGGAGATATTATCAGTCACCCTGGATGACGTCGGGAACTATACCGTAAATTGCCAGGGGTACAGTGAAACACATCACTTAACCATGGCAACAGAACCGGGAGTGGAACGCACAGATATAACTTACAACCTAACCAGTGATATTGATGCTGCGGCCTATCTGGAGGAATTGAAACAGAATCCAATTATAAATAATAAAATAATGAATCCGGTAGGGCAGTGTGAGTCATTAATGACTCCTGTAAGCAATTTTATGAATGAAAAAGGGTTCGATAATATTCGTTATCGAGGTATATTTATCTGGGATAAACCAACAGAGGAAATACCAACAAATCATTTTGCAGTGGTTGGAAATAAAGAAGGGAAAGACTATGTGTTTGATGTTTCAGCCCATCAGTTTGAAAATAGAGGTATGAGTAATCTGAATGGCCCATTAATTCTTTCAGCAGATGAATGGGTGTGTAAATATAGAATGGCAACAAGAAGGAAACTTATTTATTATACTGATTTTAGTAATTCAAGTATAGCAGCTAATGCCTATGATGCATTACCACGAGAATTAGAATCAGAATCTATGGCAGGGAAAGTTTTTGTTACATCACCGAGATGGTTTAATACCTTTAAAAAGCAAAAATATTCCTTAATAGGTAAAATGTAAGCGCACCGTGGAGGACGTCTGTCAGAACCCTGTCAATCCGGCGATGATAGTGTCCACTTAAATTTTGATGGACACTATCACAGATGACAGAGTCCACAGCCCGGCGCAAACACGGCTGTCGGTCAGGAAAGAGAAAAGCCAGTCGCTGTACGACTGGATACAGGCGCAGTTGAAAACGTTGTCGGTGCATGCGGAGATGGCGAAGGAGTTCGGTTACATGCTGAAGCAGTGTGATGCGTTGAGCGTGTCCTCTGCAGCGACGGTCGGGTGGAGATCGACAACAACATCTGTGAAAACGCCTTACGGTGCGTGGCGCTGGGCCGACGTAACTATCTGTTCTTCGGCTCAGACAGGAGCGGCGAGGCAGCGGCGATCATCTACAGCCTGCTGGGTACGTGCAAACTAAACGGCGTAGAGTCCGAGGCATGGTTACGCGACGTGCTGTGGAAAATCAGCGACTGGTCATCGAACCGGGTGCACGAACTGCTGCCCTGGAACCTCGAAACCGTAAAATAATCCTTACGCTACGTCCTAAACGGGACGCTTACTGAGTTATGAGGGTGAGGTGGTTTTCTTAATGAGCATCTGAACGCCGGTTTGTGGGATAGGGCAGGTGGGACGGATTCGGGACAGTCATCCGTTTTTAACTATTGGCATGGATTGGTATCTTTTCGCATCATGGGACGTGTGAGCGCAGGTATGACGCGGTATGTTATTGACTTAGAATGTGGTTCCAGGAACTGGCGCATCCATACCCGAGCAGTTTCCGGTGTCAGTGCCAGCGGGCGACGATCGTGAATGTCTACCAGACCTTTATCGGCTGCGGAGGTAACAATCAGGAATCCCTCTGCTTCATCACCGCGCTCAAACGGCGTACTGCCAATGGCAGCCATGAATATCGGCTTCCCGTCCTTTCTGTGAATGAAATACGGCTGTTTTTTGTCGCCTTCCTTCTTCCACTCGAACCATCTATCGGCAAAACAGATAGCCCGGCCATGCTGCCATAGTGGCTTAAACATTCTGCTGGAGGCCGCTGTCGCGACACGGGCGTTAATAAGTGGAGCTTTATCCCACCATCCGGGAGCGTAACCCCAAATCACCGGGTCGAGATGTAATTGCTCGTCGCGTTCGCTCAATAGCAGGACTTTAGTCCCGGGCGCCACGTTATACCGGCCTATAGGCTGAGGGTCATAAGCAATATTACGATCGGCTTCGTCGGCCAGATATGCCAGATATTCTTCACGGGTCTGTGCTTGTGCAAAGCGTCCACACATATGAAACCTCCAGTCGTCAGACTGAAAGTATAGGGCAGGGGGAAAATGTGGCGCGCTCCGGTAATGATTTACAGGGAATTTATACAGTAATTCGAAATGAAAATTTGTGTATTGATGAATTCCGAAACTGAGCAGTGAGCAAACTGATGAATCAATCGCAAATTGCGTCGAATCTCGACGGCTTTATTCCCCAGTTTCACCCCATAGCTTCCCCGTAGGAAATTGAGCCATAAAAAAAACAGCCCTGACAGGCTGGTTTTTAAGGGGAATTTTGGTCGGCACGAGAGGATTTGAACCTCCGACCCCCGACCCCCCATGTTGGCGGTAGCCTGAATAATTATCTTGGTAAAGGTTAACTATCATAAAATGGTACACCAGTCTTTCCAGGAGGAGGAGTGTAAAGGTTTTGGCTCATAAACACCGTCATAGTAAAGACCACAAATACTGCAATCTTCATTTTTATATTTATCATTGAATGCTAAGGTAAGCTTGTCATAGACATCAATATTTGATTGCCATTCAGGTGAGGTATTCATTGAATCATAAATTAATACTTTTTTCACTTCACTATCATAGCCTACAATGCACTCTGCATGTAGACATTCCGAGCCAAGTGAAGGCCTGATCAACATCAGTGGTCCATGGTTTTGTAATTCGTAAGAAATAAAACTCTCAAAGTCATCTTCTACAGTACTATTGAAAACTTCTGCCTGTAAAGCTTCTTTAATGTTTGTAAAAAGAGAGGATTCAGGTATTTTTTGCACATTCATTTCTTTTAATAAATTCTCACATTCTATTATATCAATACCCTCTAATATATTATTAAACGCTTGATTATCAGAAGTGATGTCCTCTAATGCACTATAATTATTGCCATCTCTGGATTTGATAACATTTAATGAGCAAACCCAGCAATTATTGTAGAGGGTGTTGCCTGTGGTATCAAACTGGCTTTGAAAGTTACGATGGTGAATAATCTGACCCTGAGGAGTTGCTGTATTACTTCTGTAAACGCTGCCTAAACTATTTTGAATGTGTCTTAACATAATATACTCGCCGAATAGTAATTTTGTTAATGTAATTATATACTACAGTGTGGATATTAATACAATTCTTTTGTTGTTAATTATTATTTATGAAATTAATTGAAAGTGAATAAGTTAGAGGTGTTTGTTGGCCTTAAAATTACATTTGTTGAGGGGGCTTATATGATATGTTTTTATTGTATTGTCGCATTTTTCTTAAGCTGAATCCGGATTTTGGGGAGGTGGCTAAATGTAAATGACGTGGTTTAAGATAAATCTATTTTTAATAAGCTATCTGTTCAAATTTTCGCGATCGCTTTTGTTGGTATCACTATTCAAGCAGTTTGCCTGCATCGGCTTCACCCTCACTTCGGCATCAGGGAAAATCTGGTGCACCTGCTTCGTCAGTTCGGCCAGAATGATCTCGCTGGCCCCTTCGAGTCCTTCAACATTACGCTTGTCATAAACCAGTTCTACGAACATACGTGTTTCGCTAATAACTGTTTGTATATACAGTATTTTTGCTTTGGCGGTTTTGTCTGTCAAGGCATGAACCACTTGTGTTTAAATTTTGGGGAACATACTGCGGGCGTGTTTGTTATCGATTTTCCCTGCAGGGCTGATGGGGTCTGGCGTTGACTAAAATTATGTGTGGGGCATGGATGGGGCAAAAGTGGTCTGTGAAGTTCGTTAAAGTTCGTTAATCAAGCTTCATCTCGATCTCGCTCATCCCTTGTTTAAAGCGCTCCTGGACGATCTTTATCGATTTTAAAAACTATGAGTACATATTATAAAAATGTAGCAAATAGGCCGTTTGTGCCTGAAAAGATGAACATTCTGCGTAGCGCGATTTGCGCAACAGGAATAGACTGGAGTCGACACTCTACACAAAGATGCGAAAGGTTTTTTATGACACAACAGCCACAAGCCAAATACCGCCATGACTATCGCGCGCCGGATTACCAGATTACTGATATTGACTTGACCTTTGACCTCGATGCCGAAAAAACCGTGGTCACCGCAATAAGCCAGGCTGTTCGTCATGGCGCGCCTGATGCGCCTCTTCGCCTTGATGGGGAAGATTTAACGCTGGTATCTATCCACGTCAACGATGCGCCGTGGACAGCATATAAAGAAGAAGAGGGCGCGCTTGTCATCAGCGACCTGCCAGAGCGTTTTACGTTACGCATTGTCAACGAGATAAGTCCGGCGGCGAATACGGCGCTGGAAGGATTGTACCAGTCAGGCGATGCGCTCTGTACCCAGTGTGAAGCGGAGGGCTTCCGCCATATTACCTGGTATCTTGACCGCCCGGACGTACTGGCGCGATTTACCACCAAAATTATTGCCGATAAAAGCAAATATCCGTTCCTGCTCTCCAATGGCAACCGTGTTGCACAGGGCGAGCTGGAGAATGGCCGTCACTGGGTTCAGTGGCAAGATCCGTTCCCGAAACCGTGTTATCTGTTTGCGCTGGTGGCCGGTGATTTTGACGTGCTGCGCGATACCTTTACCACCCGCTCCGGGCGTGAAGTCGCATTAGAACTGTACGTTGACCGTGGCAATCTGGATCGTGCGCCGTGGGCAATGACCTCGCTGAAAAATTCCATGAAATGGGATGAAGCGCGTTTTGGGCTCGAATATGACCTCGACATCTATATGATTGTCGCGGTGGATTTCTTTAATATGGGCGCGATGGAGAATAAAGGTCTCAATATCTTTAACTCCAAATACGTGCTGGCGCGAACCGATACCGCGACGGATAAAGATTATCTCGATATTGAGCGCGTGATAGGCCATGAGTATTTCCACAACTGGACCGGCAACCGCGTCACCTGCCGCGACTGGTTCCAGTTGAGCCTGAAAGAGGGGCTAACCGTGTTCCGCGATCAGGAGTTTAGCTCTGATTTGGGGTCACGCGCGGTGAACCGCATCAGTAACGTGCGTACCATGCGCGGTTTACAATTCGCGGAAGACGCCAGCCCGATGGCGCATCCTATCCGCCCGGATAAAGTAATCGAAATGAATAACTTCTACACCCTCACCGTTTATGAAAAGGGCGCGGAAGTCATTCGCATGATCCACACGTTGCTGGGTGAGGAAAATTTCCAGAAGGGGATGCAGCTTTATTTTGAGCGCCATGACGGCAGCGCCGCGACGTGTGATGACTTCGTACAGGCGATGGAAGATGCTTCTAATGTCGATTTGTCCCATTTCCGCCGCTGGTACAGTCAGTCCGGCACGCCGATTGTAACGGTAAAAGATGATTATAATCCGGAAACCGAGCAGTACACGTTGACCATCAGCCAGCGCACTCCGGCGACGGCGGATCAGGCGGAGAAGCAGCCGCTGCATATTCCATTCGCCATCGAACTGTACGATAACGAAGGCAACGTCATTCCGTTGCAAAAAGGCGGTCACCCGGTCAACGCCGTGCTGAACGTCACGCAGGCGGAGCAGACATTTACCTTCGATAATGTTTACTTCCAGCCTGTTCCGGCCTTGCTGTGCGAGTTTTCAGCGCCGGTGAAGCTGGAATATAAATGGAGCGATCAGCAGTTGACGTTCCTGATGCGCCATGCGCGCAATGATTTCTCCCGTTGGGATGCGGCGCAAAGCCTGCTGGCCACATACATTAAACTGAATGTGGCGCGTCATCAGCAGGGGCAACCGCTATCGCTTCCGGTGCATGTCGCTGATGCGTTCCGCGCAGTACTGTTGGATGAGAAAATCGATCCGGCGTTGGCCGCAGAAATTTTAACGCTGCCTTCGGCCAATGAAATTGCGGAGCTGTTTGAGGTCATTGACCCGATCGCCATTGCGCAAGTTCGTGAAGCGCTAACGCGTACGCTGGCGGCAGAACTGGCGGATGAGTTCCTGGCTATCTATAACGCCAATCATCTGGATGAGTATCGTGTTGATCACGGCGATATCGGTAAGCGCACGCTGCGCAATGCTTGCCTGCGCTTCCTGGCGTTTGGCGAGACGGAGCTGGCTAATACGCTGGTCAGCAAACAGTATCGCGACGCCAATAATATGACCGATGCGCTGGCGGCCCTGTCTGCTGCGGTGGCGGCGCAGTTGCCGTGCCGCGATACGCTGATGCAGGAGTATGACGATAAGTGGCATCAGGACGGCCTGGTGATGGATAAATGGTTTATCCTGCAATCCACAAGCCCGGCGGAAAATGTACTGGAAACCGTACGCGGCCTGCTCAAACACCGTTCTTTCAGTATGAGCAACCCGAACCGCATCCGTTCATTAATTGGCGCGTTTGCTGGCAGCAACCCGGCGGCGTTCCATGCGCAAGACGGTAGCGGATACCAGTTCCTGGTCGAGATGCTGACCGATCTGAATAGCCGTAACCCGCAGGTAGCATCTCGCCTCATTGAACCGCTGATTCGTCTGAAACGTTATGATGATAAGCGTCAGGAGAAAATGCGTGCGGCGCTGGAGCAGTTAAAAGGACTGGAGAATCTTTCCGGCGATCTGTACGAGAAGATAACTAAAGCGTTAGCCTGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP019416|1013649:1024443|1017653_1018622_+|WP_001674638.1|DBSCAN-SWA MPFHIGSGCLPAIISNRRIYRIAWSDTPPEMSSWEKMKEFFCSTHQTEALECIWTICHPPAGTTREDVVSRFELLRTLAYDGWEENIHSGLHGENYFCILDEDSQEILSVTLDDVGNYTVNCQGYSETHHLTMATEPGVERTDITYNLTSDIDAAAYLEELKQNPIINNKIMNPVGQCESLMTPVSNFMNEKGFDNIRYRGIFIWDKPTEEIPTNHFAVVGNKEGKDYVFDVSAHQFENRGMSNLNGPLILSADEWVCKYRMATRRKLIYYTDFSNSSIAANAYDALPRELESESMAGKVFVTSPRWFNTFKKQKYSLIGKM >NZ_CP019416|1013649:1024443|1016594_1017176_+|WP_076730918.1|tail|DBSCAN-SWA MTFKMSEQAQTIKIFNLRSDTNEFIGAGDAYIPPHTGLPANCTDLAPPDIPSSHIAVFDAETQTWSLHEDHRGEMVYDTTTGNQVYISAPGPLPENVTSVSPGGEYQKWDGKAKVWVKDEAAEKAAQLRQAEETKSRLLQMASEKIAPLQDAVDLGIATDDEKARLDEWKKYRVLVNRMDTAAPDWPERPASQ >NZ_CP019416|1013649:1024443|1019269_1019896_-|WP_000334547.1|DBSCAN-SWA MCGRFAQAQTREEYLAYLADEADRNIAYDPQPIGRYNVAPGTKVLLLSERDEQLHLDPVIWGYAPGWWDKAPLINARVATAASSRMFKPLWQHGRAICFADRWFEWKKEGDKKQPYFIHRKDGKPIFMAAIGSTPFERGDEAEGFLIVTSAADKGLVDIHDRRPLALTPETARVWMRQFLEPHSKSITYRVIPALTRPMMRKDTNPCQ >NZ_CP019416|1013649:1024443|1020255_1020942_-|WP_001525490.1|DBSCAN-SWA MLRHIQNSLGSVYRSNTATPQGQIIHHRNFQSQFDTTGNTLYNNCWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPKPLHSSSWKDWCTIL >NZ_CP019416|1013649:1024443|1013649_1014279_+|WP_000274547.1|DBSCAN-SWA MYGVQGTPDCYRIELKNVYGVQENLISYRQASLGAWVAIAGGGDPYEVAYAIYKAVPDISVLTNDVVNPSGAAVDKKTIPIIVYPDTYHVPFVVPSSQNVTLLITWNTASTSYIDPTGIEKAVQQSIADYINGIATGEPINIFLIRDIFLNQVKGLVSSNLVSMIDIQVGINGKIVPPATDSSLVYGDTYAYFSTSSSQIQVKQYGSSS >NZ_CP019416|1013649:1024443|1014885_1016595_+|WP_076730917.1|tail|DBSCAN-SWA MIIGFGNNVVSSLAADITASQTTIQVMPGVGAMFANLLTSDYANSSNPLKTYAKITLTDAKETVFEVCHLTAVNNDMLTVIRGQEGTTAKGWSLNDVIANFATRGSENQFVQIEELQSGHYVAGVAGGTENNLTLELPATYFVNGGVDWTLRTPLVVIPALNNTGASTLQLTMGGRVLGIFPLYKGNKAELSANDIIKDIPVLCVLDNTKTYFSVLNPLEIYLGSRYLQKDQNLSDVPDKAKGRSSLEVYSKTESDENYMAKSQCGADIPNKPLFVQNIGALPASGTAVAANRLASRGALPALTGTTRGSDSGLIMGEVYNNGYPTQYGNILRLTGTGDGEILIGWSGTNGAPAPAYIRSHRDTADAEWSEWAMLYTTLNPPPDSHPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGVIPDMRGWTIKGKPISGRAVLSQEMDGNKSHSHTARAQVTDLGAKSTSSFDYGTKSTNTTGNHTHQFGGYINSYWGDSNHTSFQPGGGAWTQAAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA >NZ_CP019416|1013649:1024443|1021212_1021404_-|WP_000497441.1|DBSCAN-SWA MFVELVYDKRNVEGLEGASEIILAELTKQVHQIFPDAEVRVKPMQANCLNSDTNKSDRENLNR >NZ_CP019416|1013649:1024443|1014262_1014889_+|WP_000729406.1|DBSCAN-SWA MAALLESIIPAYPYTQYNDDPDIVAFFDAYNKLAQGYLDYFNNLNLPCWTSPAITGELLDWIAAGIYGESRPLLQISEDAIARGAYNTIEYNNVAYAKLRNYVPGSASYVPDDYFKRILTWNFYKGDGSHFCINWFKRRLARFIHGANGIDPPVQSTFDISVMPDKGIFFVSIPDYGDGVGHFLKDAIDQSLVKLPFIYTYSVTVVEQ >NZ_CP019416|1013649:1024443|1021830_1024443_+|WP_000193784.1|DBSCAN-SWA MTQQPQAKYRHDYRAPDYQITDIDLTFDLDAEKTVVTAISQAVRHGAPDAPLRLDGEDLTLVSIHVNDAPWTAYKEEEGALVISDLPERFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLLSNGNRVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGREVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATADQAEKQPLHIPFAIELYDNEGNVIPLQKGGHPVNAVLNVTQAEQTFTFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMRHARNDFSRWDAAQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAELFEVIDPIAIAQVREALTRTLAAELADEFLAIYNANHLDEYRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRDANNMTDALAALSAAVAAQLPCRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRIRSLIGAFAGSNPAAFHAQDGSGYQFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDDKRQEKMRAALEQLKGLENLSGDLYEKITKALA |
9 | Escherichia_phage(37.5%) | tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1226300 : 1238911
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP019416|1226300:1238911|DBSCAN-SWA ATTAGAGCATCATATAAGCTTTATCATCACGCTCATCGAGATAGAGTTTCGTGGTGTTCTCTGATGTGTGGCCCAGGAGTTTTTGGGCGAACACCTCGCCGTGCTCGTTTTTGTACAGCCGCCCGGCCAGACTTCGGATCTCGTGAAATGTCGGTGGATTATTGCTGAAGTTAACACCGGAGGCTTTTCTTGCTTTTACAAATGTCTTTGTCAATCCATCCGGATGAATATTCCCGGTTGGGCTATTTTTCCTGATTCCGGCACTGATCATGAAATCAGTGCGGCTTACAAGTCGGCAGCGATCGATTACCGTTCCCAGACGTAACCCCGTCGCCCGAAGTGTCAGGGAGAGGGGAATGGCTATTTTCATTCCGGTTTTAATCTGAGTGACGTATAAGCGGTTGTCAAAAACATCACTAAATTTCATATTTACGATATCCTCCCTACGTTGACCAGTAACGAGCGCTAAATCCATCGCGAGAGGGAACCATGCAGGCATATGCTCTGCTGCCGCTCGTGTGGCGTTATACGTTTCCAGTTGCAGGCGTTCCCTGGCCACCTTAATCTCTGGTATCCGGGTTGCTTCCACCGGGTTTTTCACAATATGCCCTTCGACAATAGCCTCTCTGAACATGTCAGATAGAACTGATCTCATTGCTCCCGCCATAGTGTTTTTTCCCTCGGTTATCCACGACTCAAGAAACTTGGCAATGTGCCTGGTTGTTACTTCTGCCAGTATTATTTCCCCCATTTTTTCGCGTACGGTCGCTAATTGATTACCGCGAATCTTGTAGGTATTAACCGACAGACTCCGGCGCTGTAATAAAACCTCATAGCGATCAATCCATGCGGACACAGTGAATGAGTCAGTTCCTTTTAGCTTTTCAATAAGCGCCACTGGTGTGTGGTTTTGCGCTATGAAGTTGTTTGCCTCTATGGCCTGTGTGATAGCGTCCCTGCGGGCGATCTGACCGAGCGGAAATTCCTTGTCAGTTAGCGGGTTACGCCAGAAAAAAGATTTACTGGCCTTACGGTAGGTGAGGTTCCTCGGAAGGTTAGCATCGTACTTTTTTCGACTCACTGATCAACTTCTCCAGCAATGCACTCGGTTTTCCGGTGCGCCCGTTTGGGTGGTGCTGTTCAAGCACAAGTCCCACTTTATTCGGCTTGATATAAAACGCGTCCGGATCAACCCGATACGTCCTGCCATGTAATACTGGAGTCGGGTAAATATTTCCGTTTCGCGCCCATCGTCTTAGTGTTGTGAGAGGTGGTGGATCATCGGGATAATTTAATTCACCCCAGGTTTCAAGTCTCACAAAGCTCATAGTCATGTCTCTTTACTTCATGACCGCCGCCAACTATACGGTGTGGCGGTCGGTCAGGGTTGAACATCAATGATCAGGGTAAAATTTAAAGGACTGCTGACCGCCGCCCGGTAAAACTTTTACATCTCCGGCGCGCCGTCCTGTAAATCCTGCCCAATGCGCGGCGCGTATCCTGTCAGCCTGCTCTTCAGTCAGGCAGGGTCCCGGAAACGGGAAGCGAGCTTACCCCACTTACTAAAAGAGGATGGAACTGGCTGACGTAAAACACGATTTATTTGTATCCATAAATAGCGAAATGTAACGTTTTGGTTATATTTAAAAGAGAGAAAATGGTCAGCAATAACTTTAATTGTTTGAATTAACAGATAATTAATCTACCAGACTGAGTGATACAGAATATTTTTACATGAGGGGTACAAATGAGACTTAAGTTGATCGTTAAAAGTTTTGCGCTGGCGGGGCTACTCTCTTCCACTGCGCTGACACCTTTATTTGCACAGGAAGCCCCAAAAGGTGCCACTGCTTCAACCAAGCAAGCTAACGATGCGCTTTATAACCAACTTCCTTTCTCTGATAACACCGATTTCACGAATGCCCATAAAGGCTTTATCGCTGGTTTACCTGAAGAGGTGATTAAGGGAGAGCAAGGGAATGTCATCTGGAATCCACAGCAGTACGCTTTCATAAAAGAAGGGGAAAAATCTCCTGACACTGTTAACCCTAGTCTGTGGCGTCAGTCCCAGCTAATCAATATCAGTGGCTTGTTTGAAGTCACAGACGGCGTCTACCAGATTCGTAACCTTGATTTATCCAACATGACGATTATCGAAGGTAAAGAGGGGATTACGGTTGTCGATCCGCTGGTTTCTGCGGAAACAGCCAAAGCCGGTATGGATTTGTATTTCAAAAACCGTGGCAATAAGCCTGTTGTCGCCATCATTTATACTCATAGCCATGTTGACCACTATGGCGGTGTGCGTGGCGTTGTCGATGAAGCGGACGTGAAATCCGGCAAGGTGAAAGTGTATGCGCCTGCTGGCTTTATGGAGGCAGCAGTAGCCGAGAATATTATGGCCGGCAACGTGATGAGCCGCCGTGCCAGCTATATGTATGGCAACCTCCTGAAACCAGATGCCTCCGGCCAGGTTGGCGCCGGACTGGGGACGACCACCTCTGCGGGGACGGTGACACTGATTGCGCCCACTAATATCATCGATAAAGACGGCCAGAAAGAAGTGATTGATGGCCTGACTTACGACTTTATGCTGGCCCCTGGTTCGGAAGCCCCTTCGGAAATGCTGTGGTTCATCGAAGAGAAGAAACTCATCGAAGCCGCAGAGGACGTCACTCACACCCTGCATAACACTTACTCGCTACGTGGCGCAAAAATTCGTGAGCCGTTGCCGTGGTCGAAATATATCAACGAAGCTATAGTGCGTTGGGGTGACAAAGCTGAAATTATTATGGCCCAGCACCACTGGCCGACCTGGGGTAACGAGAATGTTGTTGGTCTGCTGAAAAGCCAGCGAGACCTGTATCGTTATATCAATGACCAGACTCTGCGCATGGCCAATGAAGGTCTGACTCGCGACGAAATAGCGGCCAACTTTAAACTACCGGATAGCCTGGCAAAAACCTGGGCCAACCGCGGCTATTACGGCTCCACCAGCCATGACGTAAAAGCAACGTATGTGCTGTATCTCGGTTGGTTCGATGGCAATCCGGCAACCCTTGATGAGCTGCCACCCGAAGAAGCGGCCAAGAAATTTGTTGAATACATGGGCGGTGCCGATGCGATTCTTCAGAAAGCTAAAGCAGACTTTGACCAGGGGAACTACCGTTGGGTTGCTCAGGTGGTGAGTAAGGTCGTGTTTGCCGATCCAAATAACCAGAATGCACGTAACCTTGAAGCCGATGCGCTGGAGCAATTGGGGTATCAGGCTGAATCTGGTCCATGGCGTAACTTCTACCTGACCGGTGCGCAGGAGCTGCGTAACGGTGTGGTTAAAGGTCCGACGCCAAATACAGCAAGTCCGGATACCGTTCGGGCGATGACCCCTGAAATGTTCTTCGACTTTCTGGCTGTACATATCAACGGTGAAAAAGCGGGTAATGCCCGGGCGGTATTTAATATTGACCTTGGCAGCGACGGCGGAAAGTACAAGCTTGAGCTGGAAAATGGCGTGCTGAACCACACGGCTAATGCTGAAGCGAAAGATGCTGATGCCACGATTACTCTGAACCGTGACACGCTGAATAAAATTATCCTGAAGGAAGAAACTCTGAAGCAGGCTCAAGATAAAGGAGAAGTCAACGTTACCGGTAATGCTGCGAAACTGGATGAGATGCTGGGCTATATGGACAAGTTTGAGTTCTGGTTCAATATAGTTACACCATAAATAGATTCCCTGCGGCGTCAATGCTGCAGGGAAGTTACTTCAGACAATTCTGTACGTTTTTTATACTCTATTTTCCCTTCATTCTTTATATCTTGCTTCATCTTATGTATTTGCTGCTGAAGAACATGGCCCTGATACCAGTCAGTTCTGATTCTGTTATGCACAGCCTTTTTCATCAGATGACAGTAACTGGTTGTTGCGTGATTCAATGGCCTGCGAGTCTGGCCAACATGCTTTTCGATGCCGGTTGGCCATGATGCCAGTATGGTTAACTGGCATCATGGCAGCATAATTTTGCCGGATAAGTCAACCGCAGCGATGTTAATCGTCCTGATTATCATCTGCATCACTGTCACAGTGACTGCACCAGTAACGAGGAGAGACTGCGATCGAACCGGCCAGACAGAGAGGAGGTAGTTGTCTTCATTGCTAAGTAAGAGACCTTGGGGGATGAATCTCCATCACCTGTGATGTGTCAGACAACCTCAATGTACCCGCACTTAATACCTGCGCCGGCGGTTTTTTTAATGTCCGGGAAATGAGCATGTCAAAAAATAACCAGTTATAAGATTATAAATAGAACACAGAGAAAATGTCATTGCATATGGTCAAAAAATAGATATATTTATTGATGATGATAATTAATAGTCTCCTATATATTCATGTTGAGAATGAAGATGCTTTAAAAATGCTCAAGTTCGTTATCTATGGAGACACCGTGAAAAATTTAAATAAAACATTCACTTGTAAATATGCTGTTATTCGCCGTGATGACATGACAGTAATTGCTGAAATGGATTTTTTTCCTGACTGCAACAGGTCATTGATGTATCGGGATGGCCGCTATGTCCGGTTTCTGCCGTTGTTGCAAAATGACATCATGGGGAGCGATACCCTGATTAATGAGCTGACTATCAGGGCCGGTTATCATGAATAATCATCCTTTGTTATACTCGTCTGCGGGCTGAACTCCCAATCTACTGCGCCAACGGAGAGAACGATGGCGCATTTACAACTGGTCAAGCAAACCTCATCAGGGCTTCTGCTCCCGGCGACGCCGGAGAGTGGGGATTTCCTGCGCTCAGTAAAAATCGGTGAGTGGATACACGCCGATTTTAAACGTGTCCGCAACTACGCCTTTCATAAACGATTTTTTAAACTCCTTCAGCTTGGTTTCGACTACTGGATGCCAACGGGCGGCACGGTCACATCGCGGGAACAGAAACTTATCTCCGGGTTCGTTAATTTTCTTTGCGACTCCGCAGGCCAGGAATATACCCCGGCCCTTAACGAGGCGGCGGAACAGTACCTCCATAACGTAGCTACCCTGCGAACCGGGGACGTCGCCCTTCTTAAGTCTTTCGATGCCTTCCGGGAATGGGTAACCGTTCAGGCCGGGTTTTATACCGAGCATTTTTATCCGGATGGCAGTCGCGGGCGCCGGGCGAAATCCATAGCATTCGCCAGTATGGACGAAACCGAGTTTCAACAGGTCTATAAAGCTGTGCTGAACGTCCTGTGGAACTGGATTCTGTTTCGTAAATTTTCCTCTCTGGAAGAAGTTGAAAATGTGGCCGCGCATCTGCTGGAGTTCGCATGAAAATGACATGGTTTCAGCATCCGGTGTGTACCACCGAAGAGGCGGATGAGCTGGTGGCGGGATACCGGCGCCGTGGCGTGAAGGTTGAGCGTTACGGTGAGGCGGAGGTGCTGGAACTTGAGAGCAATAATACTCCGCAACGTTGGACGGTTGAGGAGCTGAAAGAAATCAGGATCGCTGCACTGGCGGATCTGCGTGCGCTAAAAAAGCTGGAGGCGGCATGACATTCGAATCCTACTTTGCCGATCATCTCCGTGCTCGCTGGTAGCAGTTGTGCTTATACCATTTCCGGGTTCCATCCTGATCGATTACCGGATTTTGAAAAACTACGTGAAGATAACGGGCGGTACCGTATGAATACACAATATCTGGAATATGTGCGACAGCAGCTCATCGTGGCGACTGCAGATCTGAGTGGCGCCACGAAAGGTCAGTTGCAGGCATGGCTGGAGAACGCCCAGCTCTATACGAAAAACTATCCCCGAAAAAAACAGCGTATCAGGGATGAAGTGACCGGAAAAATGATAACGCTGAATAATCCACCGATTGCTGGTAAGCAATCACTGGCGAAAGGAAGCGCAATTCCGCTTGTGCAGCCCGTAGAATACTCCACTTCCTCATGGCGCCGTGCGCTTTTGTCACTCGAAGAACATAATAAGGCCTGGCTATTGTGGAATTACAGTGAAAACACCTGCTGGGAATATCAGGTCACTGTAACTCGATGGGCTTGGGAAAAATTCAGCCAGCAGTTGGAAGGGAAGCGAGTTGCGAAGAAGACTTTAGCACGGTTGCGCCAGCTCATCTGGCTTGCTGCGCAGGATGTGAAGGCGGAACTGGCCAGACGTGAGACGTATGAGTACCAAACGTTGGCGGAACTGATGGGCGTGGCAAAATCTACCTGGACAGAGACGTACATGTCTCATTGGTTAGTAATGCGTAACAGCTTTAAACGGCTTGATAGTGATGCGCTTATCTCCGTAACACGATCGCGTTCACAACAAAAGGCGACAAATTTGGATATAAGTCTTGCAAAACCGAACTGAAATACATATATTTCATGTAAATTTGATATCATGCCTAAAATATGCAAGCCTGCTGAGGAACGGGATTTTTGTATTAATCAGCAATAGGAATTGATGATGTTTTATCGTGATTTATTTCAAGTTTTTGGTCCCGACCCGTTGTATAAGGAAGAAGAAGGAATTGCCATCCTTCGTGAGCAATATGGGATCGAAGCTCCAGAACAAATTTTTAAGCAAATTTATTGTGGGTTATCTAATAATTCTGAATTTCAAACCTTGTATGGGCATCTAAATCTTAAATCACTGAAGTGGGATTTGGTCAGATTGAAAACAGCAGAGTTTACAAAGTTTGGCAGAAATGCCACATATCCTGATTACATGCTCGAGATTTCAGAAGACTTTAATGCCTGCGGCAGCAAGTTTTGCATTGATGCCCGTGAAGAGGTTGCAAACCATTGGCTTAAATTCGGTACATGGGCTGAACCACCGATGTTTATTGAGCGTTCGCTTATTATTCCTGGAGAGAGCGGCTTACACCTTATGGAGGGTCATACAAGATTAGGTACTTTATTGGGGGCTATTAAGTACAAATTTGTGCAGTTAGCTGATACTCATGAACTTTATATAGCCTCGCAGAAATAGTTTAGAGAGGATTGCTCAACACCCTCGGTAAGAATTGCCACACATCAAACTAATGTTAGGGTATCTTCGTCCACAGAGTCGAAATGGCCTTATTTACATCTTCCTGGCTTTTCGCCGGTTTTTTTATTCAGCCCTCGGAAATCATCATCTACACGCTTCGTTGTTAAAACCCCGCCCGAGGGCCTCTCACCCTTACAAATACAGCGCCATCCAAGCTATCGGGGGCGAGGCTTATGAAAATGCACAACGATCCCCATTCAATGGACTCACAATCTATTTTTGCGCTGATTGCGAGTCTGACTTTTTTCGATTGTGAGTCGTTACAGATAGCCGCCGGGCCAGACACCACAACGGTACCAGGTGGCGTTATGTGCTAGAAACCGAAATTCTTGAACATCTCATTACTACTCATATCGTTGGCTGGCGCACGGTTCATCGACTCCATCTGCTTGATCAACCCTATGCTAGCACCAACAGGCCCCCCGACGCTGAAACCGACGCTGCTGGAGCGGGTTTTGCTGGTGCTACTGCTGTGGGTTTGACTACGGGTGTTACAGTTGCTTTGGTGTGTCCGTGTGAGACTATTTTCGCTGAGCTGGCTATCTTCACTTTGGGTGCAGTTTCCCTGTGACTGGCTGTGACTGTCTCCTGTGGTGTGGTTGCGGCTATCCGTGTGCCACTCGCTATAGCCTCCCGCTGGCGCAATGATTCCGACCGATGTACCGTGGGTGGCATAGGGAGGCATGACCATGCCACTACAGCCACCGAGCAGTAACGCGGTGCCGACGATAAAAATGAACTTACCCAGAGCGGTGTTTTTCATTAGTTCCTACCTATTTCTTCTCGTTGGTTATTTCTATGCCCTGCCTTGGCAGGGCTCGCTATTATTCATTCTTCAACTGTGAGATCAAGTTATTTTAATTAAATCGGTATTATTCTTAATTGTGTTCTTTATGAATAAATATCCTCCGGCTATGCCGGAGGATATTTATTATTTCCCCTCATAACTGAGAGGCCCCACACAACCAGAGGGGGATGAATGTCCGAATCGATTTCTGGTACTGGGTTAGCAGGTGGCATCCTGACAGGAGCCAGTGTCTATGGACTGCTGACCTGTATTAGCTCAGACCTGAACTGGTTACTGTGTTGTAGCATCGTGGGATTTTGCATTTTTTGATGAGTGTCAATTACTAAATTCGTAGGCGATTCTTGGTGGTGATGTGTGACCCATCTCTTTTAAAATGATATTGGTATACTCGACTACCGGGCCTCTTGGATTACTGTCTTCTTTGTCCTGAAGGTGAGTCAACGCGTGTACAACTTCGTGAATAAATGAGCGTGTTGTATCAAATTGTTGTGGGCCATCATTACTTTCATAGTACTCTGGTATTGAATCATCGTCTGTATCATCCAGGTTGAGGGCAATCACTTTTCTGCCTTCTGAACTCTCCAGGTCCTCATCAGTTACGGTAGTACCAAAGTTTTCTCCGGCTCCCAGCAACCAGCGTTGTTCTACATCACGCAATTCCTGATCGTAGGCATAATTCATCAGTCTGCGGAATGTCCCGCTTTGAGTGTATGCATCTTCAAGTATGCGTGATAGCACCTCACGGCATTCATCATAGGTATCATCATCAATTTCGATATCAGGATCCATTCCTCCTGGTCCAGAGATAAGGTATTCAGCAAGACACATTGGTTCCAGCCTGGCTTTATCATCGGTAGCAAGACCATCATGTTGGAGGCGTAATTGCGAAGGATTATCTTGGTGTTCTGGAAGGTCTGGAAATACCTTGCTGTCATGAGGATGGGATAATCCATATGTTGACATCATATTATTGATAAATATTGGTTTAATTCCCGTTGGCATGATGAGTTACACATCCTTTTTATTACATGGAATTAACATTCTATAAATAGCATGTTTTTGTCAAACAGAATTCACTCAGCACGCAATCAATTAAACTAAAAGCTAAATTTGCAGTATTTGTGCCTCACCTCCATTAAAATTGTACTCTGCGTGATTTTACTTTCAGATTCTGCAACCACAGGCAATCCTGTTTTACAAGATATTAAACCCTGCAACCCAACCATTTCACTCACTCTAGTTACCATCCGAAATCATCGGAGGTGAGGCTTATGAAAATGAATGACAAGACTCCTGAATTCTGGGCTGCGGTTTTGACCGGACTCAAAAATGCGTGGCCCCAGATACTTGGGGCGTTAATGGCCGGACTCATTGCCTACGGCCGACTGATATACGACGGCGCCACCCGTAAAAATAAATGGCTTGAGGGCGTCCTGTGTGGCGCTCTTTCCTTATGTGTCACCAGTGCGCTTGATGTGGTAGGCCTGCCGGTTTCCATTTCGCCTTTCGTTGGCGGAATTATTGGCTTTGTCGGTGTGGACAAGCTGCGCGAAATCGCAATTAGCGCACTCAAAAAACGTGCAGGGGTTAATGATGAGAATCAGTGAAAAAGGCATTACCCTAATCAAAGAGTTTGAAGGTTGTAGCCTGACAGCTTATCCGGACCCGGGAACGGGGGGAGATCCCTGGACGATTGGTTATGGCTGGACCCACTCTGTTGACGGTAAGCCAGTTAAGCCCGGAATGATGATTGACGAGGCTACTGCCGAGCGCTTGCTTAACACTGGTTTAGTCGGTTATGAAAATGATGTGTCCAGACTGGTTAAGGTCAAGTTGACGCAAGGCCAGTTTGATGCGCTGGTGTCGTTCGCGTACAACCTCGGCGCCCGGACATTATCCTCATCAACTCTGCTGCGGAAGCTAAACGCTGGTGATTACGCTGGCGCCGCTGATGAGTTCCTGCGCTGGAATAAGGCTGGTGGCAAAGTACTGAACGGGCTTACCCGTCGGCGTGAGGCGGAGCGTGCTCTGTTCCTGTCATGATGTTCAACTGGAAAACGATGTTTGTTGGCCTGTTGCTTGTCTCGCTAATTGTTGCCGGTCGGCTGGCAAATCACTACCGAAATAACGCCATCACCTACAAAGAGCAGCGCGATACCGTTACTCATAGGCTGACGCTGGCGAACGCGACAATTACCGACATGACTAAGCGCCAGCGTGACGTTGCCGCCCTCGATGAAAAATACACGAAGGAATTAGCCGATGCGAAAGCTGAGAATGATGCTTTGCGCGATGATGTTGCCGCTGGCCGCCGTCGCCTGTACGTCAACGCAACATGCCCCGCAGTGCCGACAGGTAAATCCACCTCCACCGCCCGCATGGATAATGCAGCCAGCCCCAGACTGGCAGACTCCGCTCAACGGGATTATTTCGCCCTCAAAGAGCGAGTGAAGACGATGCAAAAGCAACTGGAAGGGGCGCAGGCGTACATTCGCACCCAATGCCACGGTAATGCAGGAAAAACTAGTAACCAATGGTGACTGTATTAAAAAGGTACTCCCGGGCAGGGGGCGCCACGGGTGGCTTCGGGCTCGCGGGAATCGGCTGATTTTTGATTTTTTAGTCTCTGTCAGCACTGAAAAATAACCTTAAAAATCAATACATTTACTGTTTTCAGTGTCGAGGTGGTACGTTTTTTGTTCGACACTGAACGCCATTTTCACCGTATACAGGAAAAGAGCACGACTGTGGATCAGGAAATTAAAAGCCTCGAATTAAACATCACACAGCTTTCGGCCATCACTGGTGCACACCGACAGACCATCGCCAGCAGGCTGAAGGGCGTAAAAACCTCAGGTGGGAACGGTAGTAACCTGAAAATCTACCGGCTGGGGGATATTCTGACCGCCATGATGACGATGCCGGCTGTTACCGGGGAGAATGACCCCAATAAGATGAAACCCTCAGATCGACGGGCATGGTTTCAGTCGGAAATGACGCGTATTGAGCTGGAAAAGGAGATGAGAACTCTGATCCCGGCCAGCGAGGTGCTGAGCGTTTAAGGTAGGTGATGCTGATTATACCTCCCTTCTTGTTGGTCCTTCATACCGTTTTAACGACTATCTGAATGCTTACGTGATGATTGGTGCAGCAAACGGACATATTAAGGATAACTGGGGAAATTCTGACAATAAAACCGCCTTTGCTTATGGGGCAGGTATTCAGCTTAACCCGGTTGAAAATATTGCCGTTAATGCGTCTTATGAGCATACAAGTTTTTCCACTGATGCTGACAGTGACGTCAAAGCTGGAACCTAGGTGCTTGGCGTAGGTTACAGCTTCTGACCTTTAACATCGATACAGATTTAATGCCCTCCAGTGAGAGGGCTTTTTTATGGGTAAAACGAAATTATGACGATATGGCTATGTTGCTGTTATTTCTCAATGACACCACAGGCAAAACGTGCACCGCCACCACCCAGTGGAGCAGGTTTATCGGAGTAATTGTCACCGCCTTTATGGATCATCAATGAGTGACCTTTCAGTTCTGACAGTGATTTAAGGCGTGGTGCCAGTAACGGATACGTGGCTGTACCATCTGCATTGACAACCAGTCCAGGCAGATCCCCCAAATGCCCTTTGTCATTATATGGGCCAAGATGTTTCCCGGTTTTTTCGGGGTCAAGATGTCCTCCGGCCATGAGCGCCGGAACCTCTTTACCGTCTTTCATTCCCGGCATACAACTTGGGTTTGTGTGGACATGGAAGCCGTGAATTCCTGGCGTAAGACCATTTAGGTGAGGAGTGAAAAGCAGACCGTAAGGTGTCTCTGAAACTGTGATTTCACCTATGTTTTCTCCTGTTCCGCTGGACAGGGCATCGTTCATCTTTACAGTCAGGGTATTCTCTGCCATTGCTGAACAACTGATGAGCGCACCAGCTACCAGCGACAATATTGTGTATTTCATTAGTTACCTCGTTTTTTGGTTGTATCGTAAATACCATTAATAAAAGCAGGTATATTTTTGCAAGATAAATAATAAAGGATCTCTCATATATGCAGGATATACCACAGGAAACCTTGAGCGAGACCACCAAAGCGGAGCAGTCCGCGAAGGTGGATTTGTGGGAATTTGATTTAACCGCGATTGGCGGTGAGCGCTTTTTCTTCTGTAACGAACCGAACGAAAAAGGGGAGCCGTTAACCTGGCAGGGGAGGCAGTACGAACCGTACCCGATACAGGTACAGGATTTTGAGATGAACGGGAAAGGCGCATCTCCCCGCCCGAACCTCGTTGTTGCCAATCTCTTTGGTCTGGTCACGGGGATGGCGGAGGATTTGCAAAGTCTCGTCGGCGCGTCAGTGGTAAGGCATCAGGTTTACAGCAAGTTTCTTGATGCGGTGAATTTCAGTAACGGCAATCCGGGCGCTGACCCGGAGCAGGAGGCGGTAGCGCGCTATAACGTGGAGCAGTTGTCAGAACTGGATTCATCAACTGCTACCATTATTCTGGCATCACCGGCAGAAACCGACGGTTCTGTGGTGCCGGGGCGTACCATGCTGGCGGACTCCTGTCCGTGGGATTACCGGGATGAAAACTGCGGATACGACGGCCCGCCCGTGGCCGATGAGTTCGATAAGCCCACCTCAGACCCGAAAAAGGATAAATGCAGCCACTGCATGAAAGGCTGTGAAATGCGTAACAATCTGGTGAATGCCGGATTTTTCGCTTCCATCAACAAACTGTCTTAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP019416|1226300:1238911|1236218_1236698_+|WP_001541990.1|lysis|DBSCAN-SWA MFVGLLLVSLIVAGRLANHYRNNAITYKEQRDTVTHRLTLANATITDMTKRQRDVAALDEKYTKELADAKAENDALRDDVAAGRRRLYVNATCPAVPTGKSTSTARMDNAASPRLADSAQRDYFALKERVKTMQKQLEGAQAYIRTQCHGNAGKTSNQW >NZ_CP019416|1226300:1238911|1233663_1234113_-|WP_000798708.1|DBSCAN-SWA MKNTALGKFIFIVGTALLLGGCSGMVMPPYATHGTSVGIIAPAGGYSEWHTDSRNHTTGDSHSQSQGNCTQSEDSQLSENSLTRTHQSNCNTRSQTHSSSTSKTRSSSVGFSVGGPVGASIGLIKQMESMNRAPANDMSSNEMFKNFGF >NZ_CP019416|1226300:1238911|1228046_1230026_+|WP_076730933.1|DBSCAN-SWA MRLKLIVKSFALAGLLSSTALTPLFAQEAPKGATASTKQANDALYNQLPFSDNTDFTNAHKGFIAGLPEEVIKGEQGNVIWNPQQYAFIKEGEKSPDTVNPSLWRQSQLINISGLFEVTDGVYQIRNLDLSNMTIIEGKEGITVVDPLVSAETAKAGMDLYFKNRGNKPVVAIIYTHSHVDHYGGVRGVVDEADVKSGKVKVYAPAGFMEAAVAENIMAGNVMSRRASYMYGNLLKPDASGQVGAGLGTTTSAGTVTLIAPTNIIDKDGQKEVIDGLTYDFMLAPGSEAPSEMLWFIEEKKLIEAAEDVTHTLHNTYSLRGAKIREPLPWSKYINEAIVRWGDKAEIIMAQHHWPTWGNENVVGLLKSQRDLYRYINDQTLRMANEGLTRDEIAANFKLPDSLAKTWANRGYYGSTSHDVKATYVLYLGWFDGNPATLDELPPEEAAKKFVEYMGGADAILQKAKADFDQGNYRWVAQVVSKVVFADPNNQNARNLEADALEQLGYQAESGPWRNFYLTGAQELRNGVVKGPTPNTASPDTVRAMTPEMFFDFLAVHINGEKAGNARAVFNIDLGSDGGKYKLELENGVLNHTANAEAKDADATITLNRDTLNKIILKEETLKQAQDKGEVNVTGNAAKLDEMLGYMDKFEFWFNIVTP >NZ_CP019416|1226300:1238911|1231979_1232669_+|WP_001097218.1|DBSCAN-SWA MNTQYLEYVRQQLIVATADLSGATKGQLQAWLENAQLYTKNYPRKKQRIRDEVTGKMITLNNPPIAGKQSLAKGSAIPLVQPVEYSTSSWRRALLSLEEHNKAWLLWNYSENTCWEYQVTVTRWAWEKFSQQLEGKRVAKKTLARLRQLIWLAAQDVKAELARRETYEYQTLAELMGVAKSTWTETYMSHWLVMRNSFKRLDSDALISVTRSRSQQKATNLDISLAKPN >NZ_CP019416|1226300:1238911|1232765_1233290_+|WP_001574213.1|DBSCAN-SWA MFYRDLFQVFGPDPLYKEEEGIAILREQYGIEAPEQIFKQIYCGLSNNSEFQTLYGHLNLKSLKWDLVRLKTAEFTKFGRNATYPDYMLEISEDFNACGSKFCIDAREEVANHWLKFGTWAEPPMFIERSLIIPGESGLHLMEGHTRLGTLLGAIKYKFVQLADTHELYIASQK >NZ_CP019416|1226300:1238911|1235748_1236201_+|WP_000984586.1|DBSCAN-SWA MMRISEKGITLIKEFEGCSLTAYPDPGTGGDPWTIGYGWTHSVDGKPVKPGMMIDEATAERLLNTGLVGYENDVSRLVKVKLTQGQFDALVSFAYNLGARTLSSSTLLRKLNAGDYAGAADEFLRWNKAGGKVLNGLTRRREAERALFLS >NZ_CP019416|1226300:1238911|1226300_1227380_-|WP_076730932.1|integrase|DBSCAN-SWA MSRKKYDANLPRNLTYRKASKSFFWRNPLTDKEFPLGQIARRDAITQAIEANNFIAQNHTPVALIEKLKGTDSFTVSAWIDRYEVLLQRRSLSVNTYKIRGNQLATVREKMGEIILAEVTTRHIAKFLESWITEGKNTMAGAMRSVLSDMFREAIVEGHIVKNPVEATRIPEIKVARERLQLETYNATRAAAEHMPAWFPLAMDLALVTGQRREDIVNMKFSDVFDNRLYVTQIKTGMKIAIPLSLTLRATGLRLGTVIDRCRLVSRTDFMISAGIRKNSPTGNIHPDGLTKTFVKARKASGVNFSNNPPTFHEIRSLAGRLYKNEHGEVFAQKLLGHTSENTTKLYLDERDDKAYMML >NZ_CP019416|1226300:1238911|1227354_1227633_-|WP_001575998.1|DBSCAN-SWA MTMSFVRLETWGELNYPDDPPPLTTLRRWARNGNIYPTPVLHGRTYRVDPDAFYIKPNKVGLVLEQHHPNGRTGKPSALLEKLISESKKVRC >NZ_CP019416|1226300:1238911|1237592_1238126_-|WP_000877926.1|DBSCAN-SWA MKYTILSLVAGALISCSAMAENTLTVKMNDALSSGTGENIGEITVSETPYGLLFTPHLNGLTPGIHGFHVHTNPSCMPGMKDGKEVPALMAGGHLDPEKTGKHLGPYNDKGHLGDLPGLVVNADGTATYPLLAPRLKSLSELKGHSLMIHKGGDNYSDKPAPLGGGGARFACGVIEK >NZ_CP019416|1226300:1238911|1231622_1231850_+|WP_000784710.1|DBSCAN-SWA MKMTWFQHPVCTTEEADELVAGYRRRGVKVERYGEAEVLELESNNTPQRWTVEELKEIRIAALADLRALKKLEAA >NZ_CP019416|1226300:1238911|1234473_1235160_-|WP_001574215.1|DBSCAN-SWA MPTGIKPIFINNMMSTYGLSHPHDSKVFPDLPEHQDNPSQLRLQHDGLATDDKARLEPMCLAEYLISGPGGMDPDIEIDDDTYDECREVLSRILEDAYTQSGTFRRLMNYAYDQELRDVEQRWLLGAGENFGTTVTDEDLESSEGRKVIALNLDDTDDDSIPEYYESNDGPQQFDTTRSFIHEVVHALTHLQDKEDSNPRGPVVEYTNIILKEMGHTSPPRIAYEFSN >NZ_CP019416|1226300:1238911|1231026_1231626_+|WP_000940753.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESGDFLRSVKIGEWIHADFKRVRNYAFHKRFFKLLQLGFDYWMPTGGTVTSREQKLISGFVNFLCDSAGQEYTPALNEAAEQYLHNVATLRTGDVALLKSFDAFREWVTVQAGFYTEHFYPDGSRGRRAKSIAFASMDETEFQQVYKAVLNVLWNWILFRKFSSLEEVENVAAHLLEFA >NZ_CP019416|1226300:1238911|1230714_1230963_+|WP_000911593.1|DBSCAN-SWA MLKFVIYGDTVKNLNKTFTCKYAVIRRDDMTVIAEMDFFPDCNRSLMYRDGRYVRFLPLLQNDIMGSDTLINELTIRAGYHE >NZ_CP019416|1226300:1238911|1238215_1238911_+|WP_001152416.1|tail|DBSCAN-SWA MQDIPQETLSETTKAEQSAKVDLWEFDLTAIGGERFFFCNEPNEKGEPLTWQGRQYEPYPIQVQDFEMNGKGASPRPNLVVANLFGLVTGMAEDLQSLVGASVVRHQVYSKFLDAVNFSNGNPGADPEQEAVARYNVEQLSELDSSTATIILASPAETDGSVVPGRTMLADSCPWDYRDENCGYDGPPVADEFDKPTSDPKKDKCSHCMKGCEMRNNLVNAGFFASINKLS >NZ_CP019416|1226300:1238911|1235435_1235765_+|WP_001574216.1|holin|DBSCAN-SWA MNDKTPEFWAAVLTGLKNAWPQILGALMAGLIAYGRLIYDGATRKNKWLEGVLCGALSLCVTSALDVVGLPVSISPFVGGIIGFVGVDKLREIAISALKKRAGVNDENQ |
15 | Salmonella_phage(33.33%) | holin,integrase,tail,lysis | attL 1226136:1226165|attR 1245757:1245786 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1466846 : 1481122
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP019416|1466846:1481122|DBSCAN-SWA TGTGACCGCTTTTTCAACCCTGAATGTTTTGCCCGCCGCCCAGCTCAATAACCTTACTGAGCTGGGCTATCTTGAGATGACGCCTGTTCAGGCCGCAGCATTACCCGTCATTCTGGCGGGAAATGATGTGCGTGTGCAGGCCAGGACCGGTAGCGGTAAAACGGCGGCGTTTGGTCTTGGGCTCTTGCATCGAATTGATGTCACTCTGTTCCAGACACAGGCATTAGTGCTGTGCCCGACGCGGGAGCTGGCGGATCAGGTAGCCGGGGAGTTACGTCGCCTGGCCCGTTTTCTGCCAAATACCAAAATTCTGACCTTGTGTGGCGGGCAGCCCTTTGGCGTACAGCGCGACTCGCTTCAGCACGCTCCGCATATCATTGTCGCGACGCCGGGGCGCCTGCTGGATCATTTACAAAAAGAAACCGTATCGCTGGATGCGCTGCACATTCTGGTAATGGATGAAGCAGACCGAATGCTGGACATGGGATTCAGTGACGCCATTGATGAGGTGATCCGCTTTGCGCCTGCGACGCGCCAGACGTTGTTGTTTTCAGCAACCTGGCCTGAGGCCATTGCGGCGATTAGCGGTCGTGTACAGCAGCAGCCAATACGTATTGAAATCGATACGGTAGATGCGCTACCGGCTATCGAACAACAGTTCTTCGAAACGTCTGCGCATGAAAAAATTTCGCTGCTACAAACGTTGCTTAGCCAGCATCAGCCAGCGTCCTGCGTGGTATTTTGCAATACCAAAAAAGATTGTCAGGCCGTTTGTGATGCGCTTAATGCGGTAGGACAAAGCGCGTTGGCGCTCCACGGCGATCTGGAACAACGCGACCGCGACCAGACGTTGGTGCGTTTTGCAAACGGTAGCGCGCGCATTCTGGTTGCCACCGACGTTGCCGCGCGAGGATTAGACATTAAATCGCTCGAACTGGTGGTTAACTATGAACTGGCCTGGGACCCGGAGGTGCATGTCCATCGTATTGGCCGTACGGCGCGCGCGGGAAGCAGCGGCCTGGCGATCAGTTTCTGCGCGCCGGAAGAGGCGCAGCGGGCGAATATTCTTTCAGAAATGCTGCAACTCAAGCTGAACTGGCTGAATGCGCCCGCCCGGCAGCCGTTACTCCCTCTGGCCGCAGAGATGGCTACCCTATGCATTGACGGCGGCAAAAAAGCGAAAATGCGTCCGGGAGATATTTTGGGCGCGCTGACCGGCGATATTGGATTAGACGGGGCGGATATCGGCAAAATTAATGTGCATCCAATGCACGTTTACGTCGCCGTACGTCAAGCAGTAGCGCAAAAAGCCTGGAAGCAGTTGCAAAACGGGAAGATCAAAGGCAAGTCATGCCGGGTACGGCTATTAAAATGATCGATAGCATGGCGTCTCAGCGGAGTGCTGAGACGCCTGCAGATTATTTCACTTCGATAACATCAAGCCGCAACGCCTCTAAGGAGGTGTCATCTTCTTCCGGCTGCCAGCCAGCGGGCTGCAAGGGAATCTCTTCACGATCGAACGCTAAATCGCCGCCGTCGACGACCTCGGAACCGTGAGTGATCCCTTTGAAATCGAACAGGTTAGTGTCACAAAGGTGAGACGGCACGACATTCTGCATGGCGCTAAACATCGTCTCGATCCGTCCAGGATAGCGCTTATCCCAGTCGCGTAGCATGTCGGCAATCACCTGGCGTTGCAGGTTGGGTTGCGAACCGCACAGATTACAAGGAATGATAGGAAAGGCTTTGGCCTCAGCAAAACGGACAATATCTTTCTCGCGGCAGTAAGCCAGCGGGCGGATCACAATATGTTTGCCGTCATCGCTCATCAGCTTCGGCGGCATCCCTTTCATTTTTCCGCCATAGAACATATTCAGAAACAGGGTTTGCAGAATATCGTCGCGGTGGTGGCCCAGGGCGATTTTGGTCGCGCCCAGTTCAGTCGCCGTACGATACAGGATACCCCGACGCAAACGCGAGCACAGCGAGCAGGTGGTTTTTCCTTCCGGAATCTTCTCTTTCACAATGCCGTAGGTGTTTTCTTCGACGATTTTATATTCTACGCCCAGCTGCTCAAGGTAGGCTGGCAGGATATGCTCCGGGAAACCTGGCTGCTTTTGATCGAGGTTGACGGCGACCAGTGAAAAATTGATCGGGGCGCTTTGCTGCAAATTACGTAAAATTTCCAGCATCGTATAGCTATCTTTGCCGCCAGAAAGGCAAACCATAATGCGATCGCCTTCTTCAATCATATTAAAATCGGCAATCGCTTCGCCAACGTTACGGCGCAGGCGCTTTTGCAACTTGTTGAGGTTGTATTGTTCTTTCTTTGTATTCTTTTGAATTTCTTGCATTATTTCAGTTCTCTGGTACTAAATGGGGCAAATTGGGGGCAAACTTTGCAACTACGATAACCGTGCATTCAACATCGCTACTTGTTCGTCGTTCATGTCATCAATCCACATACCGTAAATTTCATACACCATCTGCGCAGTTTCATGCCCCATTTGGCTGGCGATAAATGCCGGGTTCGCTCCTGCCGTCAACAGCCAGCAGGCAAAAGTATGTCGCGTATGGTACGGATTACGGCGGCGAATACCAGCACGTTTTACTGCCGCATTCCACCTTGCCCCTAAACTGCTTACCGAGTAATAAGGTTTCTGTTTTCCTTTACTCATCCTGGGCATGAACCTGTTCAGACTTCCGCAATACCATTAACAGGTTCTGGATATAGCTGTTTTCCTGTTCCATTTCGAGAACGGTAATATGCCTGCGTTGAGCGCGGGTGGCGTGATGCAAATATTTTTTGTCTGTGGCAGCGTAAGTAAAAATGTGCAACACGCGCTGGGGGAACGGCAGGGTGGCGACGGAAACTTCGCAGTCCTGACATTCGTCGTTGTCACTGGTGTCTGTCCCGCTTTCTTCGGTGCCAGCGTCGTCGGCGCATTCCTCCGTTTTTTCGTCGTGGGCGTCAGTAGCGGGCGTGCCGGGGATGAGCATAAATGTTTTGCCATCTTCGCCGCCCAGCTCATATTTCTGACAGAAGGTGGTATCAAAACTTCCCTCCGGCGGAAGTTCGTTAACAGCGGGGGAATTAACGCGAACCGGTTTTGCAAAATCTGTCGGTTCGAATCCGGCGTCAAGCATTGATACGGCAAGGCGTGCTGTGGCAGCGGCTTCGCTGCTTTCAGTTTTCCACCAGAAAGCAAAAGGGAAGCCGAGACGTTTCCGGGCGCTTTCATTTTTAACCTTAATATAATATGAGTATTCTGTTTTATTGTCGCTCATTATCATTACCCTTATTACAAATCATACTTAATGAAGACTTTCATTTTTCATTGAGCAGAATGCGTTCGTGACGAGCTCTTTACACTCCATCAGTTTCAACAATGAAATAAAATTTTCGGGAGGAGCTTCGTTCATTTTTAACAGCATTATTGCGGCTGTTATTGACCTGTCATATGCGCCTGTTTCACTGTCTGTATATGCAACAAGAGTTCTGGAAACACTTTCATCGTCACAGTCCCGGGCATAAGAAACAACACAGGGCATATTGTTTTCGATACATAACGTACGAATTTTTCCTGAAAGCTGGCGGAGCTCTGCAATTACTGATTCAGGTACATTTTTCATATAGATTCCTTTTTTCAGGTTGAGTGAATTCCTGCCATTGCAGGCATATTTAAAAACAGGATGGTTTAAACGATTACTGTTCTGTTACCATGATTCAGCTTTGGCAGGCAGACCATTTCTGTTCAGCCAGACTTTTACCATGCAATCGGTAATACATCTTTGCGTTGTTAAATCACGGATATATAAACGGCGTTTTTTAATGTTATTCGCTGAGGCGATATAAGTACGACCATCATGGATAACATAATCTCCCGGCGTAATACACTGACGCGGTATTTCATCCGTTCCGAAGTGATGAGCAATCATAGCCCCTCCATTTCTGGTAAATAAATTTTGTGGTGCGGTGCCTGGTGCCTCCAGGTGACATTAACCAGTTAACAATTAATGCCGACTTAAACCACCCATACTGATTCAGGGAGTTTTAACTGTGCCGCGTGCGCTTAGCCGCATTCACCGCATCACAAAATTCACTTTAAAAAGGGCGGACATCAGAAAGGACTAAGAAAAACTGATGCCGCCAAGTACTACACACAGCATTATTGTCGCAGTGGCAACTACAACCGGAGGCGCACTTCCACTATTTGGATTTACAGACAAGACCGACTCAGAAAACATCAGAAATGCGCCTTCGTGTTGTGCCCGGCTTTATTTAACCACCTCCGGGCTTCGGTGGTCTCGGCTATACCCCTACAGCGAGAACCTGTGTTAACATTTCAATACCCTTACAGTTGAGAGTTATTGATATGTCAGAAACCGCTCTGGTTATCGTAAAATTCCTAATTGGTAAATCCGTCGGACAATTTATGCTCACAGTGGCTTTATTTTTCTTAATTATCATCTTCATTCCTAGAGATATTACGGAGCTTATTGAGGCGCGTAGCGATTTACCATATGCCGTTCAGATTTTTAGTTTTGCTGTGGCTTACCTTATAGTGCTGATCCTCAAAGTCACTGGTTATTTTTTCGTGTCGGCGCTGCCGTTGTGCCAGCGTAGGGGCAGGGCAAAACGCATGTTAAAAACGCTTAATTCATTGAGTACTGAACAGCTGTTTTTACTTGAACCCTTTCTTAAAACTCATTCTCCCACTTTCCGGGCGTCCTGGGATAACCCTGATGCAGATGCTCTGGTTAAGGCAGGTATCGTTCGTCCGGCTGGTTCGTGTATCGACGGTGTTTCTGTGATGTTCAAAATCGAACCCGAGTATGAGTCGTTAATGCTTTCCACCTGGAATCCCTGCACAAAACGGTTCGATATTAGCCGTTAGCTGAAAGCGCCAGCAGAAACTCACTGAAACTGAGTGCTTCTTCTCCTTCGTCAAGGCTTTCAAAGTATTCTTCGTAAGCCTTTTCCATGATTGTGTCGAAATCCATATCACTCACCTGAGTTTCTTTCTAACCAGCGACGTGCGCCTGTTTCAGTTTTAAACGTCCTGCTTCTGGTGTACGTCATGGCGGTGAACGTTCCATCCTGGTTGGGGAACACGCCACACACCAGGGATTCGTTATTGCCGAGGTGGATTTTTTGCAGCTTGTCCATTATCACCCCGGATAATACTGTTCCTGTAGCTCGCATTGAGCCAAAAACATATCCCATCCATTGTCGCGCAATGCTTGGAGAGCTGCGCTAAAGCTGACGTTGCAACTATCCATCAAATACTGAATCACTTCATTCCTTCCCATCTTTCCCTCTCCCCTTAACGCCGGGTGGCGGAACTAAAACCTACAGCGCCGTGCTGTTTCTGAGATTATATTAGCGATATTCATATAGTTGATCAAGATAAATATGCATATATATCATAAATATGATCTATCCTAATGAAAATAAATGTGTTTTATCTGATGCAAGAGGGGGAGGGAGGAGCTTTAGCCAAAAGAAAACCGCCGGGAGAGGCGGTTTGATGTGGTTGGTTCGTCACTGATTTTTTAGGCGCTTTTGTGCAGCGAGCATGTTCTGGAAAGCCTCTTTATATAGCTCATTCTGACCTTTAAGCCGGTCAATGAGTTTTTCTTTCTCAGATTCAGGGAGTATATCAAAAAGGTTTAGTAAATCAGCCTGTTGTCTGCTCACCATTCGCCAGCCACCACCTTCGAAGTTGTCATCGTAAGTACCAGAAGAACGAACGTAGTTCATTAGATCGGCCAAATCCGGTCGTAACTCTTCGGGTTTAACTCTCAATAGAACAGAAAATTTTAAGGCCGCATCAGTGTTGAGAGGTGCCTTACCGTTCAAATAGTGACTGACGGTAGATTGTGTCTCAAAGCCCATAAGATCAGCGGCGATCTCCTGAGTAAGTTTGAGGTCTCGCTTTTTGGCGTCCCAGATGGCGCGTAAGCGCTGGGTAGCTTCTGGTGGAGCTATTTCTTCACGTTTTTTTCTCATGCGCTCATCTTATGAATGTGACTCATAAACTCAAACTGATATAAGTATTGATCATTTAAATTAGTATGGTTAATATTTAGCGAGAATTACTAAGGTGACCTTTATGACGTTAGATGAATATTTGAAAAAAAATCGTGTACGACAGTCTTGTTTGGCCACGCTGGCTGGTTGTTCGCAATCGATGATTAGCCTTGTTACTACTGGCCGTAGTCAGTTAAGCCCCGAAAAGGTATTGCGTATCGCAGAGGCTACGAATTTCGAGGTTACACCTCATGAACTCCGGCCTGATAACACTTACGGAGCTGAGGAGGATGACGGGGTTAACCATTTATTCGACCCGCCACTACCTGGACAAGGCAGAACGTTGTGGGGATGTGTACCAGGCGGGCAGAAGAGGGGGGATTTTCCCGTCAGAAGAGGCTTATCGTGCCTGGAAGAAACAGGCGAAAGTGGACGCTGACCTGATTTGGAAGCTGCCTGACGGTGAGGTACGTCGTTACGACAGGCACCACAACGTAATTTGTCGTGAGTGTCGTAAAAGCGAGTACATGCAGCGGGTACTGGCGTTTTATCGGGGAAACTTTCAGGAGGTGCTGTTGTGAGCCAAATTAACAATCGGAACTGCGTGAAGTGAAAGAGAAAGCATAATCCAAATATGAATAATTAAATTTAGTGATGTAAATAAACTTTAATCCTTAACCGGATGGATTCCTGCACGCTCAGAACACCAGGAGACCGCCCGAAAGGGCGGTAGCTCCATTGCTTAATTGTCTAAAATCGTGCTAAATCTTTTTATTACCATTAAGAAAGTTATGACAGTGATAAAAAAGGATGTATAGGCTAAAAAGCTAACGATATATGCGGGTGCGCGAAAATACCATTTCATTAAATCCACTGCATTGTCAGGCAGGAAATATATTATTACTGAAAATATAACCAATACTATTGAAGTTAATATTGCATAAGCGACGTTGTGGCATAACTGCTCATATATGGTTTTGTTAGTGTTTAATGATATCAATTTGTCTCTTGATTTGTTTCCTTCAATTATATCTGATATCTTAGTAATGGTTTTTTGTTTTTGTTCATAAATCATTATTACTGCACTCATTAATAGTGCTGTTGTAATAGCCCCGAAGTTAACGAAGACGGAAGCAATTGCCGGTTTCATTATTCCGTATGTCCAGCACAGAACGAAAGAAAGAGATAGAGGAACTATAAAATGTACGGTAATGTCGCTCATCAACATTGTTCCACGCTGATCTGACATTGTTTTGTAGTGTTTTATTATTACACCCAGCACATTTATTTTATTCATATAATCACCCCTTTATTTCCACAGTGCAGTTCTTCCAATATATCATTGGAAAGGTTTTTTATCGTGTCATGAAGTGCTGTTAGATCAGGTATACCTGTTAATGGGTTGATTTTTAGATCATTATCATCTAACTCTGCTGAAATTCCTTTTTTTAGTATGGTATCATAATTGAAAACGACAGTCCGACTGCCGAGCTGTAAGCTTACTTTTATTGCATCACATTTATCTTCAATAATCTCAATGATGTTTCCTATATTCTTGTTTCTTAAATCCCTGAAACTTCCGAATATGCCATCGTTTGCTTTTATTATTAAGTCTGTCTTGATGTTTGTTTTGTTTTTACCAAAGGAATCAGCAATATCTTCTGGTGCTTTATATCCTTGAGCTTTAATTTGTTTCAATTCAGAATTGAGAATGTATTGAGGGATTTTCTTATGATGTAATGGATTGATTCTTGCTTCTAGTTGAAATTGTTTTTTTAGATATTCAGTGATAGAATCAGAAAGAACACCTCGAGCAGAAATATTATCGCATGAATGGAATGCAATAATTCCTTCTTCAAGATTATCTGGTAGATATATTAAAATATAACGCTCTTTGAGTGTTACATCATAAGCAGTTGTTCTGTAATGGACTTTTTTGAGTTTTACATCTTTTATTTCACTGCTTTCTCCATATTTTCCAACTTTTATATAACCATATATGATTTTTTTTGTGTTATCAAAGTGAAGTTTAGTATGTTGTTCCAGAGATATTTTAGTTTTTGAAACGCCGAACTCGATGGGAGTGTTTTTATAAAGAGTAAAATAATCAACAAAAAGTTCATATGCCGTTTTTTTATTACTTAAACCTAAGTCATTAAGTTTTTTGCTGGCTCGACTGCCTTTATGGGTCAATACGCGGAATGAATAGAAATTAACGCTGTGCATGAAAAATCCTTTTGAGTATACAGGAATATACTAGAACATAAATGTATGCAATGCATAAAGGAAAAGCTACCGCAGGGCGAATTCACCCACCGATAGCTCTTAAATGATTGTTTTCAAGCGATAATACATAAATTTTGGTCTGTGTAAAGAGGGATGTATCGTCAGGGCAAGGGAGATGTATTAAGGGGTATTGGTAAAATTTGCGTTGGGGATAAAAACGGTTTGCGGGAAAAGGAGAGTTAAGTAGAATTGTTGCGGGTGCTTGAGGCTATCTGCCTCAGGCATGAACACCAAAAGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCGGCAAGACGCTTAACATTAATCTGAGGCCAATTTCATGCTTTGCACATGTAGGTTAGCCTCTTACGTGCCGAAAGGCAAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAGTGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCTGGCGGGGAGAAATCCCCGCCACCTCTGACGTGTCAGGCATCCTCAACGCACCCACACTTAACCCGCTTCGGCGGGTTTTTTGTTACCCGTAAAATAAAAATTCATAAAAATGATCAACTTTCAGATTGGTTGCGCAACAAGTGAAAAATGTCCTTGCTGGTGAACATAAAATAAGCAAATTTATATAATGAAATAAATAGTCGCAGTGTTTATATTTCCCGCCTCAACAGAAATCGCGTTGAAATCGCACCTTTTCATTTTTCCTTAGTTGTCTGGAGGTAACGTGAAAAAACTCAAGGATTTATTAGAGTTAGATGAAGACGGGCTTTATGCAGTACGTGTAAAAAATGGTGAAATCTCATTCTGTACGCTAATTCCTGACGACCATCTGATTCTGTCTGTTGAAGCGTTTATTGATTATCTGATAAGACTGGGTTTCACTGTCAGTTATTAATGTTTTAATATGTTACGGCTGACCTGAACAATCAGCAACCTACAGCGCCACCGGAGAGAACGATGGCGCATCTACAACTTGTTAAACAAACCTCATCAGGGCTTCTGCTCCCGGCAACGCCGGAGAGTGAGGACTTCCTGCGCTCAGTAAAAATCGGTGCGTGGATACACGCCGATTTTAAGCGAGTACGTAACTACGCGTTCCACAAACGTTTTTTCAAGCTCCTTCAGCTTGGTTTCGATTACTGGACTCCGATCGGCGGGGCGATCCTGCCTCAGGAACAGGAGCTGATTACCGGCTTTGTCGATTTCCTGTGTGAGTCAGCAGCGCAGGGCCACAGTCCCGCACTCAGTGACGCGGCGGAACAGTACCTGCATAAGGTTGCTGTCAACCGAACGCTCGATGTTGCGCTGCTCAAGTCCTTCGACGCTTTCCGCGAGTGGGTAACCATTCAGGCCGGGTTTTATACTGAGCATTATTATCCGGATGGCAGCCGTGGGCGCCGGGCGAAATCCATCGCTTTTGCGAATATGGACGAAACCGAGTTTCAGCAGGTTTATAAGGCCGTACTGAACGTCCTGTGGAACTGGATTCTGTTTCGTAAATTCTCCTCTCCGGAAGAGGTCGAAAATGTCGCAGCGCAACTGCTGGAGTTTGCGTAATGGCGGATTTACGTAAAGCGGCGCGGGGCCTGATGTGTACGGTAAGAATTCCCGGCCATTGCAACCATAATCCTGAAACGTCCGTACTGGCACATTACCGGCTGGCGGGTACGTGCGGAACGGCGACAAAACCAAACGATATGCAGGCAGCAATTGCCTGTAGCTCGTGCCACGATATTGTCGATGGGCGGGTAAAAATCGACGACTTCACGAAAACAGAAATTCGCCTGATGCACGCAGAGGGCGTTTTCCGCACGCAGGAAATCTGGAGAGAGAAAGGCATTTTATGATTTACCCAACAAACACCGGAAAAAGCGGAGAACACCTTCGTCTCAGCACGCTGGAAAGTGTGTGGATTCAGGGGAAATTGCGTATGTGGGGGCGCTGGTCATACATTGGTGGCGGCAAAACAGGGAATATGTTTAACCAGTTGCTGGCGTCCAAAAAACTGACGAAGACGGCCATTAACGATGCTTTGCGCCGTATGAAAAAAGCGGGGCTGGAGAAACCTGAACTGGAAGTGTTCCTGAGAGAGATGATCAACGGAAAGCAAAAAAGCTGGCTGGCACACTGTACGGATACGGAAGCGCTGATTATCGATCGGGTTGTAGGCGAGGTACTGACGGATCATCCGGGGCTGCTTGGTATCCTGAACCAGCGTTACGTGGGGCGGGGGATGAGTAAGAGAAGGATGGCCGAGTTACTAAACGAACAGTACCCAGAGTGGGCGTTGATTACATGCCGACGCCGTGTTGAGCAGTGGTTGAGTATCGCTGAGTTCATTTTGTATTCACCTATGAGAAAAGCGTTCGATTATGCTTAAAAAATCATTTGCAAAATGAGCCACAAACTGCTTCAATTCCAGTACGCTTCGCAAAGCTGTATCGCGAGGCTAATGACAGACATGAACGCATTTTGAAACCCGCCATCGTGCGGGTTTTGTCGTTTCTGCAATACAGAAAAATATTCGCCAGTGTAATCCGGGTTGTTAAACATGGTGCGTTTTAAATATGTTTTATGTACATTAAATTAATGTGAAATGTTTTGATAAAATAAAAATGTAATAATAACTTTACGTTTATTGACACAATGAATTGTTGAAACGCCTGTTCTGACTCGTATTATTTTCATCGGTCCGAAGGGGATGATGGATAACCTTCTTCTGCCCCCGAGGATCAGAGAGCCGGTTTTTTTGTGCATCCTGGAAAATTCACGTGAAGGAACCGGCTCTCAACCAGAGAAGAAGAGGCGTTTTTTTCGATACAACTATCGTAATTACTCCGTTGGTGGTGCTGGCACGTAAAGTGTGGATAGTGCGTTTATGATGAGTGCATGATGTATATCCTGATACAGCATCCGGTTTGTGGGGGCGGAGAAGCCCCGATGGACTCAGTGCCACAATTTTTTATTGCATTCAGATAGCGTACTGTGAATCGGATAAATGAGAATGTCAGTGTGTTGGTAATGCGGGGTTCTCAGTGCGCTATCTGAATGCAGTGAAATCTGCTCTGAGCAGAGCTAAACAGCATTGTCTGCGTTTGATCAATTTGTAGCGGGTCATAGTGGCTGACTAAAGACTCTCCGGGGCATCCCGGCACTGCATTTATTACTAAAAATCTTCATATCACAGAGGCAGAACATACGGAAAATTCTTGTCAATACAACACCTGACACAGCAATATTTTTCGGGAGTCCCCGGCGCCTCAGGTTTTTTATCGCCATCAATAAAACTATAATAATAACTCCATGTTATGATTACCACCTCTCTCTCATGAGGTGGTTTTTTTATTCCCGCAAATTGCAGAAATAAGATGGAGTCATCAGAATATGCCCTGATTGTATTTTGTCTTTTTTGAATTAATGCAAAACATTTGGAATAAATAAACATCTAATGATAAATTTACATTTCTTGACGCGACTGCTTGTTGAAATGAAATTTTTATGATTTATTATTGTCGACAGTTTGGCGGAGGTGACTGGCAGATTTCTCCACTCCGTCGAATAAGAGAGTTGATTCTTTATACCTCCTGAGTCGTCTGATTAAAGAATCATCACTCGATTTGGCATTAAGGTGAAATTAAGATTCCATTGATATAGGTATCGTTCTTACTCTTTGTGGTGCAGGCATATGGATATGGGGTGGTTACATTAATGTTCCTTTAATTTAACCTCCTGATAGTGAATCAGGCACCGCAATTTTTTTACTGCATTCAGATGGCGTACTGCAAAAAAACGGTCATTGCTTGCGCCACTGTCTGATGCTCTTGTAACACATACGGGATTTGTGGTACGCCATCTGAATGCAGTGAAACCCACATAAAGTGGGGCATAAACAGGATATGAGGTGCGTTTATTTTCGTCTGCGGGTCATGGTGACTGACCAACGGCCCTCCGGAGATAATTCCGGCACTGCATTATTTATTGAGGTGTTCCCCAGTGCGGGGGTGACCGGGAAAAATGTTCTGCCGATGGTCACAGACACATACCGGGCTAATATGTGTTTTCGGGAGGCACCCGACACCTCTACTGTTTTTCCAGTCGATAACTATAAAACATGCTTCAGATATTGAGCACCGCCTCCCGTGAGGCGGTTTTTTTTATTCCGGGAAAAAGTTCTGCCCGCCATATAATAAAGTTAACGTTTTCAGACCAGGGTGCGGGAAGTATCCGGGGCGGGAAATAATGAATTAAAAAAGAAGCGCGGCTGTCGGATTTAAGCCGCGGGACAATGTCCGTGATAGATAGTTGAAAAATTTCAGGCTATCCCTTTCGGGAGGTCGCCATTATTTTACTCATAACAAAATAAGACCGGAACCCCGGAAACAACCTTATTTTCCGGTAAGGCTTATTTCATTCCCCGCGCCACGCCCGGCGCACATTCATAACTAACCACGGAGCCTTTCAGGGGTGAGCTTACGGGATGGTCAGTGTGACTTTCTCTGTGGGCTGGTCACCCCCGGGCGCAGGCTCACCCACTAAAAGGAAAAGTCACGATGTTAGGTATTTTCAGAAAGAAAACCCGCAAGGCTATTGTTGAAGTGAAGAAGATGGAGAACCGGGATGCGGTGGAGGCGACCGTCTGGGGCGCATATTCCATTGCATACGCCGATGGCACCTGTGACGCGAAAGAAATTGCAGTGCTGGAGAAAACCATCGCGGCACTTCCTGCCTTTGCGCCGTTCTCCGGCGAGATTGCCCAGATGAGTGCCAATATCCGCGCCCGCTATGAGGCGTCACCGCGTAGCGCGAATGCTCAGGCACTGCGTGAGCTGGCTGACGTGGCAGGAACAGCAGAAGCGGTTGATGTGCTGTGCCTGTACTGGCCTCGTTCCTGTCCCGCCTTGCTGACTACAACGGTAAACCGCTGGATGCGCTGTGTGCAGTGGTGATGTCGGTGCTGTCAGTGAAATTTCTGACCTTCATTCATGACCAGGACATTTCATCGCTGACCGGGGTTTTTTCACGGATGCGGGGAGGAGGGAGTGGTCATGGAAAGTAATCTGACCGGCACACTGAATGCGGGCCTGTGCCTGGTGACAGTGCTGGCCCTTTTTCTCTACCGCCGGAACGGCGCCAGATACAAACCGGGAATAGCCTGGCTGTCGTACCTGCTGATGCTGGGCTATGCGCTGGTTCCGTTCCGTTTTCTGGCCGGACATTACCCGTCTTCATCCTGGCCTGTGGTGCTGATGAACGCGCTGTTCTGCGGGCTGGTGCTGTGGGCGCGGGGTAATGTGTCGAAAATACTTTCACTGCTGAGGCTGCGATGAAACCGAAGGACGAAATTTTTGATGAAATTCTGGGTAAGGAAGGCGGCTACGTCAACCATCCGGACGATAAAGGCGGGCCGACAAAATGGGGTATTACGGAAAAAGTTGCCCGCGCCCACGGATACCGTGGTGATATGCGCAATTTAACCCGTGGACAGGCGCTGGAAATTCTGGAGACCGACTACTGGTACGGTCCCCGCTTTGACCGGGTGGCGAAGGCCTCGCCGGATGTTGCTGCCGAACTGTGTGACACGGGCGTGAACATGGGGCCGTCGGTGGCAGCGAAAATGTTGCAGCGCTGGCTGAACGTGTTCAACCAGGGCGGGAGGCTGTATCCGGATATGGATACGGACGGGCGCATCGGGCCGCGAACCCTTAACGCGTTACGTGTTTATCTGGAAAAGCGCGGTAAGGATGGCGAGCGTGTACTGCTGGTGGCGCTGAACTGCACGCAGGGGGAGCGCTATCTGGAGCTGGCGGAAAAGCGGGAGGCTGACGAGTCGTTTGTCTATGGCTGGATGAAAGAGCGCGTATTGATATGA
Protein sequences of DBSCAN-SWA_4 >NZ_CP019416|1466846:1481122|1472390_1472858_-|WP_001227859.1|DBSCAN-SWA MRKKREEIAPPEATQRLRAIWDAKKRDLKLTQEIAADLMGFETQSTVSHYLNGKAPLNTDAALKFSVLLRVKPEELRPDLADLMNYVRSSGTYDDNFEGGGWRMVSRQQADLLNLFDILPESEKEKLIDRLKGQNELYKEAFQNMLAAQKRLKNQ >NZ_CP019416|1466846:1481122|1480576_1481122_+|WP_000802786.1|DBSCAN-SWA MKPKDEIFDEILGKEGGYVNHPDDKGGPTKWGITEKVARAHGYRGDMRNLTRGQALEILETDYWYGPRFDRVAKASPDVAAELCDTGVNMGPSVAAKMLQRWLNVFNQGGRLYPDMDTDGRIGPRTLNALRVYLEKRGKDGERVLLVALNCTQGERYLELAEKREADESFVYGWMKERVLI >NZ_CP019416|1466846:1481122|1470160_1470478_-|WP_000800272.1|DBSCAN-SWA MKNVPESVIAELRQLSGKIRTLCIENNMPCVVSYARDCDDESVSRTLVAYTDSETGAYDRSITAAIMLLKMNEAPPENFISLLKLMECKELVTNAFCSMKNESLH >NZ_CP019416|1466846:1481122|1476809_1477100_+|WP_000774470.1|DBSCAN-SWA MADLRKAARGLMCTVRIPGHCNHNPETSVLAHYRLAGTCGTATKPNDMQAAIACSSCHDIVDGRVKIDDFTKTEIRLMHAEGVFRTQEIWREKGIL >NZ_CP019416|1466846:1481122|1471221_1471743_+|WP_000004762.1|DBSCAN-SWA MSETALVIVKFLIGKSVGQFMLTVALFFLIIIFIPRDITELIEARSDLPYAVQIFSFAVAYLIVLILKVTGYFFVSALPLCQRRGRAKRMLKTLNSLSTEQLFLLEPFLKTHSPTFRASWDNPDADALVKAGIVRPAGSCIDGVSVMFKIEPEYESLMLSTWNPCTKRFDISR >NZ_CP019416|1466846:1481122|1468263_1469199_-|WP_001156218.1|tRNA|DBSCAN-SWA MQEIQKNTKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFPEHILPAYLEQLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIVRFAEAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIETMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVDGGDLAFDREEIPLQPAGWQPEEDDTSLEALRLDVIEVK >NZ_CP019416|1466846:1481122|1475977_1476148_+|WP_000734094.1|DBSCAN-SWA MKKLKDLLELDEDGLYAVRVKNGEISFCTLIPDDHLILSVEAFIDYLIRLGFTVSY >NZ_CP019416|1466846:1481122|1473130_1473460_+|WP_001676916.1|DBSCAN-SWA MNSGLITLTELRRMTGLTIYSTRHYLDKAERCGDVYQAGRRGGIFPSEEAYRAWKKQAKVDADLIWKLPDGEVRRYDRHHNVICRECRKSEYMQRVLAFYRGNFQEVLL >NZ_CP019416|1466846:1481122|1471850_1472006_-|WP_085981757.1|DBSCAN-SWA MQKIHLGNNESLVCGVFPNQDGTFTAMTYTRSRTFKTETGARRWLERNSGE >NZ_CP019416|1466846:1481122|1477096_1477633_+|WP_000640113.1|DBSCAN-SWA MIYPTNTGKSGEHLRLSTLESVWIQGKLRMWGRWSYIGGGKTGNMFNQLLASKKLTKTAINDALRRMKKAGLEKPELEVFLREMINGKQKSWLAHCTDTEALIIDRVVGEVLTDHPGLLGILNQRYVGRGMSKRRMAELLNEQYPEWALITCRRRVEQWLSIAEFILYSPMRKAFDYA >NZ_CP019416|1466846:1481122|1475474_1475687_+|WP_000882662.1|DBSCAN-SWA MLCTCRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >NZ_CP019416|1466846:1481122|1480120_1480309_+|WP_001688615.1|DBSCAN-SWA MPVLASFLSRLADYNGKPLDALCAVVMSVLSVKFLTFIHDQDISSLTGVFSRMRGGGSGHGK >NZ_CP019416|1466846:1481122|1480298_1480580_+|WP_000445513.1|holin|DBSCAN-SWA MESNLTGTLNAGLCLVTVLALFLYRRNGARYKPGIAWLSYLLMLGYALVPFRFLAGHYPSSSWPVVLMNALFCGLVLWARGNVSKILSLLRLR >NZ_CP019416|1466846:1481122|1466846_1468220_+|WP_000123700.1|DBSCAN-SWA MTAFSTLNVLPAAQLNNLTELGYLEMTPVQAAALPVILAGNDVRVQARTGSGKTAAFGLGLLHRIDVTLFQTQALVLCPTRELADQVAGELRRLARFLPNTKILTLCGGQPFGVQRDSLQHAPHIIVATPGRLLDHLQKETVSLDALHILVMDEADRMLDMGFSDAIDEVIRFAPATRQTLLFSATWPEAIAAISGRVQQQPIRIEIDTVDALPAIEQQFFETSAHEKISLLQTLLSQHQPASCVVFCNTKKDCQAVCDALNAVGQSALALHGDLEQRDRDQTLVRFANGSARILVATDVAARGLDIKSLELVVNYELAWDPEVHVHRIGRTARAGSSGLAISFCAPEEAQRANILSEMLQLKLNWLNAPARQPLLPLAAEMATLCIDGGKKAKMRPGDILGALTGDIGLDGADIGKINVHPMHVYVAVRQAVAQKAWKQLQNGKIKGKSCRVRLLK >NZ_CP019416|1466846:1481122|1469515_1470133_-|WP_001676915.1|DBSCAN-SWA MSDNKTEYSYYIKVKNESARKRLGFPFAFWWKTESSEAAATARLAVSMLDAGFEPTDFAKPVRVNSPAVNELPPEGSFDTTFCQKYELGGEDGKTFMLIPGTPATDAHDEKTEECADDAGTEESGTDTSDNDECQDCEVSVATLPFPQRVLHIFTYAATDKKYLHHATRAQRRHITVLEMEQENSYIQNLLMVLRKSEQVHAQDE >NZ_CP019416|1466846:1481122|1476210_1476810_+|WP_000940751.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESEDFLRSVKIGAWIHADFKRVRNYAFHKRFFKLLQLGFDYWTPIGGAILPQEQELITGFVDFLCESAAQGHSPALSDAAEQYLHKVAVNRTLDVALLKSFDAFREWVTIQAGFYTEHYYPDGSRGRRAKSIAFANMDETEFQQVYKAVLNVLWNWILFRKFSSPEEVENVAAQLLEFA >NZ_CP019416|1466846:1481122|1473621_1474176_-|WP_001033796.1|DBSCAN-SWA MNKINVLGVIIKHYKTMSDQRGTMLMSDITVHFIVPLSLSFVLCWTYGIMKPAIASVFVNFGAITTALLMSAVIMIYEQKQKTITKISDIIEGNKSRDKLISLNTNKTIYEQLCHNVAYAILTSIVLVIFSVIIYFLPDNAVDLMKWYFRAPAYIVSFLAYTSFFITVITFLMVIKRFSTILDN >NZ_CP019416|1466846:1481122|1470562_1470784_-|WP_000560208.1|DBSCAN-SWA MIAHHFGTDEIPRQCITPGDYVIHDGRTYIASANNIKKRRLYIRDLTTQRCITDCMVKVWLNRNGLPAKAESW >NZ_CP019416|1466846:1481122|1474172_1475105_-|WP_000556390.1|DBSCAN-SWA MHSVNFYSFRVLTHKGSRASKKLNDLGLSNKKTAYELFVDYFTLYKNTPIEFGVSKTKISLEQHTKLHFDNTKKIIYGYIKVGKYGESSEIKDVKLKKVHYRTTAYDVTLKERYILIYLPDNLEEGIIAFHSCDNISARGVLSDSITEYLKKQFQLEARINPLHHKKIPQYILNSELKQIKAQGYKAPEDIADSFGKNKTNIKTDLIIKANDGIFGSFRDLRNKNIGNIIEIIEDKCDAIKVSLQLGSRTVVFNYDTILKKGISAELDDNDLKINPLTGIPDLTALHDTIKNLSNDILEELHCGNKGVII |
19 | Escherichia_phage(66.67%) | holin,tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2015480 : 2068904
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP019416|2015480:2068904|DBSCAN-SWA ATTAATTACACACTACCTGATACCCGGTATCATTTCTCAGCATCGACCAGATTATTCTCGCGTTTTTGTTAGCGACCGCCACGGTCGTTTTATTAAATCCGCGCCGTTCCTTTAACTGGTTAACCCACTGATTCATATGACCATCATTGTTATTCGTGGCAACCCTGACGACAGCGCGGGCACCATGAATAAAAAGTGTCCGCAGATGCTTGTCGCCTTTTTTCGTCATATTCATCAGCACCTGCCTGTCGCCACTCGAATGCTGGCGTGGAACCAGACCCAGCCATGCAGCAAAGTGGCGACCATTCTTAAATTCAGTTCCTTTGCCAATAGCAGCAACAACGGCCGTGGCCGTTTTAGGACCAATGCCTTTAACTTTGGCGATACGCGGACAGGCTTCTGATTGCCTGAATACTGTTTCAATTTCCTTATCAAAAAAATGGATCCGACGCCCGAGATCGTTAAAGAGATCATAGAGTTCGGCAATTGTTCTGCGCATACGGGAACTTAGACCGTTTTCTGCATCTTCAAGGATAAGAGGAATAGCACGGCGAGCTCTGGAGACAGCACTGCCAATGGGGATCCCCCGGTCAAGTAACAGCCCCCTTATTTGACAGACTGTAGCAGTGCGGTGATTGACAATACGCTGCCTTGCCCGGTGTAAAGCCTGGATATCCTGCTGTTCGGGGCTTTTTGGCGGCACGAACTGCATTGTCGGTTGCATCAGAGCCACTGCGATGGCCTGTGCATCATTACCATCATTTTTTTGCCCGCGGACAAAGGGTCTTACATACTGAGGACTGATGACCTTTACTTTGTGCCCCAGTTTCTCAAACTCACGCTGCCAGTAAAATGCCCCGGTGGACGCTTCGATCCCAATCAGACATGCAGGAATATTTGCCAGCGTCTGGAGCAATTCTTTTCGGCCAGTGCGTTTCGTATAAACCGGTTTGCCGGCCTGGTTTAACCCGCAGAGCTGAAAAACATTTTTAGCCAGATCAATACCCAGAAATACGATATTCATGGTGATTCTCCTGGTGAGCACATTTCGTACAGTTAACCGCAGCGGGAGGAAGTAGTCCATCCCATTAACTGGAAGAAAAAATTTGCCGGGCCGGGCGTGACGGAACTGCGGCGTCTGCGGCAACTGGAGGATGAGAATCAGCGGCTGAAGAAGCTGGTGGCTGACCTGAGTCTGGACAAGGAGATGCTGCAGGAGGTACTGAACTGACCAGAACACCATGATAAACACTGACAACAAATTAAAAGCGTAGTATTTTTGATTTGCGGCAACAAAATTAATAAGAGTCATGGTAGCTAAAAAAACTCTTTTTCTTTTCCTGTACAAATAAGATAAAAATCGCTTCATCCACCACCGCAAGAATATGTATATTATCTGTTATTACTATTTCGATAACACAATATCAAATTAAATATATCTCTATTACCTATCCATTAGTGTAATTTTCTTAAACTGAAAATTTATCATTTACAGCCCAAACTCTTACGCATTTTTTCTTTTAATGGCTCGGCTACTTCTGGACGTAATTTTGAGTATAAATTTTGTAGATTTTCACAGCCAGCAGGTTGGCCTGATGATAAAGGCTCTATTTTTGTCAGGATATTCCACAAAATATAAATTGCAAATGCGATTAGCATTATACTTAAAAAACAAATACGGATACCCCGAGCATTAAACAACGCTGGGACATTTGATGCTTCTTGTTTAACGGATGGCACATAATTTACGACTGGAGCAGTAACCTCAGGAATAACAAGTTGTTTTCCATACGGAACGAAGTATTCAGAAATATGCTCTGGGCTATAATGCTGTTCAACACGTTTCCACTCATCTGACGGAATGAGTAAGAACGATAATGTATCACTAAAAGTGCTGTTCTGATATACCCGCAAACTATATCCACTATTAGCCAATATATGCTCAATGCGCACCAGTTCTGTATCAAAACCATCACTTCCCATTGGGTAGTGATGAAACACATACTGCGTATCAACCAAATTTATCGTTGCTGTTTCAATCTCACTGTCATAATCGAATTTGACCGTGACAGGCTTCCCTCCCTGATTATCTGAAAGTGCCTCAAGCAGGCTAAAAGTCCCCTCTTTCCAGTCAATATTAATTCCGTAGATACTCATAAACAGCGCCTGGTAGTCCTCTAATGCAGACTCAGACAGACACGGTTCTTTATGTTTCTCCAAAAAAAACTTACGCCTTAAAGAGGCTTGTTCATCATCATTGGCAGCAAAAAAATCAGAAAACTCATCCTGTTCTGGTAATTGCATCGTGGTTCCATCCAATATTTTATTAATCAAAAATCGATCCCAGCAGCTAATTTTCCCTGCTGTCTATGATAGAATAGAATATGTGATCTCAAAGCGTCAAGATGACAAGACCAAACGAGCACGCATCCATTCAATAAATGAATAAACCCTGTTAATACAAATAGTTTTAAAAATGATTTATCTGGACAACTGAAATTATCTTTATACTGATAGCAGTACAGCCATTTCTGTAAGTTTAAATGACAGATAAAAACACGGACAAATCGTTTTACGTTATTTCCAATATTTTCAATGTCTGCCCTCTGTTTTTTTACGTAATCCCTATGTTGTCGCGACGGGAACAATAATAGCGAATTCGCGTATGACGTTTTAGGGGCATTCATAAAAGAGTTTTTTGAAAACATGAAGGGAGATGAACGGAAAAAGGTTCGTTAAGGCTGGATGCGATCACGTTCGCTTACGGCGAACGTAACCCTTTTTACCTGGCTGCATTTTTCTTCTTCTTTTCGTCCTCAACATGCTTATCAACAGGTGAGTCTTTTAAGCGATTGTTTTGCATGATGTCATCTTCAAGCGATTTTAGATAGCTTGCATCAGTCAGCAAGGCTATTGCCAGCGTTTTCGCATAAAATTGCGCGTGGCTTCGTGTCGTAGGATTTACCGCGTAGGTTCTGGCAAGCTCATGTCCAGGGGAGGCGGTATCACCACTGAAAAAGATTTCGCCTCTTTTCCTCCATCCTCCAGCAATCCATTTCCTTTAAGATACCCTCACAGGATCCGTGTGAAGCAACGGCAGTTAAACTGTTTTGTCACCAGAGATGTGAGATGGGCGTTGCAAAAGAGTGTGAGCTTGTGAGCTAAAGAAGATAAAAGTGGATGCCCGACGCGCAGCAAAATGGTAAAAAACAGCATCAGTAGTAAGAAAAATCGAGCGAGTGAAAGAAATTTTAATGCAGGGATTTATGTATGACGAAATCAAAAACGCTTTGTCAGTGATTGATTTTGTTGAATGATTGGCGGAGAGAGGGGGATTTGAACCCCCGGTAGAGTTGCCCCTACTCCGGTTTTCGAGACCGTCCCAATATTCTTATTTAACATTAACTTATCTAATAACATGAGAAAAAAGTAGCATTTTTGTACGCAGTATTTTCAGAAAGTTAGCCTACTTATCAAAATCATTTTCTCACCATGATAGACTATTTTTTAAACAAAACCATCTCCTTTATTGACTTCCCACAACAACATGCGCCATAACATCTTCGTGGCATCTGGTGGTTTCCGTCCTTAATCAGTTATGGGATTCCTACAGGTTCACCGGATGCCACAACCTTCCCTCATGCTTCTAGTTAGCGCGGTAATCCCGTTTTTTAACTCCCTTCCGGTTAGCCGATAACAGAATCCAGTACAGCCCGTGTATCGACAGCCCCACATCATAAATAATCGCTACCTGCTGTCGCGGTATTCCTGCCCCAATCAGACGCCCGGCCTGCGCCCATTGCTCAGGTGATAACTTTGGTCTTCGCCCGCCAATCTCCCCTGTCCCCCTGCAATACAATAAAAACATATCGTATAAGAAGTATAAAATACTTATTCAAAAATGTAATTTTAAGCCCCCCCTAACCAAGTAAAAACTATCGTTTCAGATAGCTATGGCACGAAAATACGGTAGCAAATCTTCATACAAAAATCTTCCAATTTATTAAACTAAGGTTAAAACCCGATATCTTATCAATCTCAAATCATGGTATGTTATATTAATAGCGTAAGGGTTGAAAAATGTTTTCTCGAGTCAGAGGTTTTCTTTCATGCCAGAACTATTCTCATACTGCAACTCCAGCTATTACTCTGCCTTCATCAGGTAGTGCAAACTTTGCCGGAGTTGAATATCCTTTATTGCCATTAGATCAGCACACCCCCCTACTTTTTCAATGGTTTGAACGAAACCCAAGCAGGTTTGGGGAAAACCAGATCCCAATTATTAATACTCAACAAAACCCCTATCTCAATAATATTATCAACGCCGCTATAATAGAAAAAGAGAGAACTATCGGGGTTTTAGTTGATGGGAATTTTAGTGCTGGACAAAAGAAAGCATTAGCAAAGCTGGAAAAACAATATGAAAATATAAAGGTTATCTATAATTCCGACCTGGATTATAGCATGTATGACAAGAAACTATCAGATATTTATTTAGAAAACATCGCTAAAATTGAAGCTCAACCAGCAAATGTCAGAGATGAATATCTGCTTGGTGAGATAAAAAAGAGTTTAAATGAAGTTTTAAAGAACAATCCAGAGGAGTCCCTTGTTTCGTCCCATGATAAACGCTTGGGACATGTACGGTTTGATTTTTACAGAAATCTTTTTTTATTAAAAGGAAGTAATGCTTTTCTGGAGGCGGGCAAACATGGCTGCCATCACCTGCAACCTGGAGGTGGCTGCATATATCTTGATGCTGATATGTTACTTACAGGTAAACTCGGCACTTTGTATTTACCTGATGGTATTGCTGTTCATGTAAGTCGTAAAGGTAATAGCATGAGTCTTGAAAATGGGATTATCGCCGTTAACCGCAGCGAGCATCCGGCATTGAAAAAAGGACTTGAAATTATGCACAGTAAACCTTATGGCGATCCATACATTGATGGTGTCTGCGGTGGGCTAAGGCATTATTTTAATTGTTCTATACGGCACAATTATGAAGAGTTTTGTAATTTTATAGAATTTAAGCATGAACATATTTTTATGGATACCAGCAGTTTGACTATCAGCTCCTGGAGATAATTATTTGCAAACGTATGATATAAACGCGAGCAATGTTTCGGCAAAGCTAAACGTAAAGACCATACTGACGATTTCGATCGATATCGTGGATGTGACGCGCGTACCTCCATTAAATTGGATGACCTGCGTGCCGTGGTGAAAATCACTCATCCGGTTAACTCCGTGGTTAAGGGGTGAGTATATTTTCAGGTCAGTACACAAGAGGGGGCTATTTGTACCGGCTGTCAGGTTGATGGCACAACGACAGGAAAAAGAAAAGGCGGGTAATAAACCCGCCTGAATATTTAGCGTGGTATATCCGGCCAGTCAGGCGCAGATGTATCCACCCTGTTTACCATTACGCTATAAAGTTCCCATGCTTCCAGCCGTTTAATCTCTTCATCTGTGGCAATTTTTAGTTTTACTGCCCGTGCCAGTGGTGCGATGGCTGATTCAGCCTCAGCAAGGCGGCGAACTTTTTCAGCCTCCGCCTTTTTACGCAGCTCTTCCGGAGAATAACCCCGTTGAACGACTTTACCGTCCTGATATAACCACGTACCATCACCACGGCAATCATCAGGGCAGTCAGCAGCGTCTATTTCCGCAACAGACATATTAACCGGCCACAACATTGATACAGAATATGTGTTTCCTCGTTGCGGGACTGGCTTATTAACAACACCCCAGATAACCCCCTCATGGTCGTACATTATTTTTGCAGTATCATCAGAAAATAATGACTGACACTCATACCAGTCCTGTCCATCGTCCGACTCCAGAAAATATGCACCTATATTTATTTCAGCCTGAGTTTTATCCCTGTTTACGGGCGCGTCAATAAGTCTGAAATTTTTGATATTCTGATATTTTTTCATTATACCGTTCCCCCTTGTACGGTATACCACTGACTCCCGACTCGTTTTTGCAAAGGCGCATAATTAATACCATCAATATTTTCGCCCTGATAATCTTTCCAGACGGAGGTAACTACATATCCGGGAGTGTTAGGCCAGGAACCTGCATTGTTCCAGGTAGTCACTGATGTGCCAGCCCCTAACTGAACATCTAAGACGAGATTATTATTAATCCAGGTACTTAACCAGCCATTTCCCCATAGCGAACCAAAGATGTCGCCGTTATTCTGATAGATAGCCCCGCTTGCACGAAGCGTGTTAGCGGTGATATCTCCATTGACCGTAAAGACAATCGAACCGTCAGGATTTCGCTGGCTGTACAGATGCCATCCCTGATCGTCGTCCAGTTCAATAACTGTTGGCCTGTTTGCGTCGCCCCATAAATTAAACGTGGCTGTCATTGTCGAATTATTATTGCTCGTCAGTGACAGTTTTTTTGCGTTGCCTGCGCGTACGGCACCATTAGTGAGAACATCTACTGACATGTGCAGCCCGGAATTGTCGATATAACCAACCAGAGCATTATTGGCATAAATCCCCAGAACGCCGTCACTGTGCCACTTAAACCCTGTATCGTTATCTCCGAATACAATCGAATTACCACCCAGCGCATTATCAGTACCAATGCCTAACGGACCGTTTAGCCTCCCTCCATTAACTGACAGTGCCTCAACGTCACCGGCTGGGGGTTTCATCAGACTATTAAACAGTGTATATGTCTGACCGCTGGTTGAGTTTCCCGGCTGAACTGATGAATATTCAGGCGTACTGTGCAGCGTGACATTTGCATTACCGGTGTAATCATATTGCGCAATTAACCAGTACGCATACTGGCCGATATTAATATAAATATCGTAGGTGTCGCCTGATGTATTAACCCATGCGACCTCGTTAGCAGCAGAAGGTGAACGCCTCCATAATGTGGCGGTTATTCCAACAGGTGAACCATTACCGGCACGCAATACCAGTTCGCTGATTGCCGCCTGTTCAGGTGAACCAGCGTTAAACCCCGCCCCACCGTACAGTTTAATCACCGCAGTTGATGTAGCCTGCGGCATTACAACCGTGGCGATTTTGAACCAGCCTGATTCGCCAAGTGTAATGGTGGTTGACGTTACCGCACCGATGGTTCTCGCAAATTGTTTTTTGTCAGGAATATCGCCGCCGTTCTGCGATTTTTGCAGGGCCCCTGCAGCGAGATTTATCGTTTCTCCTAAACCGACGTTCTGGAGAAACAGCGGCTTATTCGGGATGTCCGCGCCATTCTGATTTTTTTCAAGACGGGTTTTAACCTGTTCATCGATCAGCCTGCCAATGGCGGCGTGAAGCTGCGTATGTTCGCCTTTACTGAGTGGTATGCCGGCGGCTTCAATAACAGTACAGACCTCTTCCTGGACTGCATCCCACATATCACTGTTGAGATCCGTTGCGCGGCGGCCCGTGGCGGGATCACCATTCGTAAATCCGTTTTTTCCCTGACCAAATTTATCTTTTTGCGCAGTGGGCGTATCAATTCTGTGCATTCTCTTTTCCTTCCGGATAAGCAAAAACAACAACCGTATGTGACGGACAAAGCTTATCAATCACACATTCAGCAACAGTATCGCCCCACGTTCTGATCGCAGAGTCGCAGGTGCTTGTACAGGTCTGCCAGCTGATGTTCGCATCAGCCGGAATATTCACACGCCAGTAGTAACGCCAGAATTCCCCCCATTCAGGATCGGGTGTGCTGTCGAGATTCTGAAACTGCTCAATGGTGGCAGCGGTATACCCCAACGCATCAAGCTGTTCCCGATAAAACCTCTCGTTTATACCGCCAGCAACATTTGCCTTTGCATCCAGCCGTTGCTGGCGCTGCTGTAATGTCTGAACGCCTTCCGGTGCACAGGAATCAGGCAGGCCATACAGCTGTTCATAACGGTCTATCAGTTCTGTGGTTCTGGCCGGATCAATTTCAGCCATCAGTTCATCCGCTCTCTGATGTACCCGGTTCAGCGACGGCGCCAGCCCTTCAATCAGTGGGTTTTCTCCGTCCCAGGCAGGCCCTTCCGGCAGAAGGTGATAAAGTAACTGCGTATATTCGTCCTGTAATGCCATAGTTATCCGTTCTCCCCGGTATAGGTGGCCCAGGTTATATTCCCCAGGACAGGAAGTTCAGTTTTTCCCAGCACCACATCTGCCGCCGGCACACGCAGCTGATGTGCCACTTCCCCGGTCGCCAGACTTATCGCCTCGCTGATTCGCGAAACATAAATTTTTCCGGACGGCGCGCCATCACGCAGCATCAGCGCATTTAGCTCCGCAATAATGGCAGTACGAATTTCCGGGGTATCTTTGGCCAGTGCGACTGTTACCGGAATGCTTTTTTCAGTGGCAGCGAAAACAAAGAGTCCGCCGCCAGCAACAGGTGCCAGCGGCAAAATATGGTCACGTACAGCCTTAACGAGATCGTCGCCAGGAGCCGGATTCACCGGGTTACTGGTAGCCACCATCACACCAACGGTGCCGGTCCCCTTATAATGGCGGAATGTCCACGCACGGGTTATTCCCGCGATTTCCTTTGCCCAGATGACGTAATCAGGATCAGCGCCCCCCTGTGGTATCCAGTAATAGCGTTCCATGACGCGCGCGCGCCACGTTTCAGGCTCCTCTGTATCAGCCCCCCCGGTCAGAGTGTCAGCGTAACCTGTAGAAGGAATACCAGTAATCGGCGTGCCAAGGCGTAACGCCGTACCATCGTCAGTATTACCGGCAGTTCCCGCCACATCAGCAATAACCGGCACACGTAACAGGCCGCCGGAAGCTTTCACCGTCTGCAGGGTCGTGAATGTAACCTGATCATCCCGCTGAATCTGTGTACCCGCGGGGATCTCCGGCGTTCCGGCAATACCATCCCAGCGTGCAAATCCCTTCGCAGATACGGCATTTTTCCTGGGACAACGCTTAATCCTCGCGTGACGGTAAAGCCAGTCCTCATCACACATATCAGGCAGCATATTCCGGGCCAGATAATCGATATAACCATACAGCGTATGTACGGCAGCAGCCTGTACCCGGCTGTAAACCTCGGCATCCATGCGACGTAACACAACATCCTGCTGAAAACGGGTCAGTAAATCGCTGCGAATGGTAGCAATCAACTGAGGAAGTTCAGGACGTGCAAATTGACTGTCAGCCATTAAGTTTGCTCCATATATCATCGAATGTAATATTGTGAATTACCCCGTCCCGCTGATATATCGTCACGCCAGCTGCCAGGGTATCTGTTCCTGTGCGTTCAGATGTCACATCAATACGTGCCGCCACGCCATCGTCTGTCATCCACGCCAGCGCCTGCTGCATGTATTCGCGGGCATCCTGCGGCGTTTTATTGGTGAGTTTGCGGCGTTTCAGCAGGTAGAGGCGGGAACCGATGCGGTCATTCTGAACAGCAGGCCAGGTGTCCCCCCACCAGCCGTATGGCTGTGGGGTCCTGTCATCCCGCTCCGCCCGGCGCCAGGTAAAAAGAGAAATCACCACTGCCCGCGTCAGAAAGTCGAGCGAAGCCGTGGCATCCTTACGGATTCCATTAACATAAAGGATCATGGTGTCAGCTCATGGGTTGGCCAGGCTTATCGGTTATACCGCCGCCATCGCCATTTTCTTTATGGGTATGACCGTTATAGGTCGTGCGCATTTCAGCCATCGTTTTTCCACTGCTGTCACAGTTGTCCCTGATATCGCCAGTGGATTCGATCGGCATTTCAAAACGTGCTTTAGTGGCATTCGTGAAAATAACTGGCTTTCCGCCGCCATTTACCACTATTCCGGCTCGGGTTAATGTGACCGACTGCCCCTGATCGTCATATAGCGCGACTTCCCCGCGCGCCAGCCCTTTCAGTCTGAAGCGGCGGTCAGCCACAACCACAGCCACTCCGTGCGAACGGCCACCGCCGGGAAACAATACCACCGCTTCTGCGCCATTCTGTGCTGCAGAGGTGAAACCGTAAGGTTCAAGATGCTCCACATTCTCTTTTTTTTCACCGGCAATAAGTTTCAGTCCGGCAGTCTGGCATTTTCTGACGGTATCAATCGCGGTAATGACTGCGCGCGTTATCATGTTCTGAAGAGGATGGTTAGCCATCAGAAATCTGCCTCCTCACTGACTTTTTTCTTCGCTTTCGGCCTGAATGGTTCAGGAAGATAGGCATCCGCAGGCCCCACCCGGATTTCGGTCAGGGTGCCGTTATTGTCCTGGCTGTACGTCACTTCGGCGATCACCAGCGTTTCATTGTCAAAACCGTTCAGCGGGTCATACACCACCACGGCCTGATTCGGTTTCCACAATTCGCCATTCCCCTGTCTCCATCCCTGTACGGTATAGGTGGTTTCCAGCGTTTTCGCCGCACGCTGACGGGCTTCAAATTCACAGCGGGATTTGCAGCTGTCAGTTGTGGCAGTTCCTGACTGCTGAATGGTGTGGGGACGATACCGCGTGACGCCTGCATCACCAGTACTCTGCCGGATAGCAGCAATGGTTGCCTCGCCGAAATCGTCATCCGTACCAGGACGCTGCCCCGTAACCAGATAACTGGAGAAACGCTCACGAACACTACGCTCGGTATCACAGGAAAGAATATTTTCGCCAAGTACCAGTGCCGTGGCTGCTTTCATACTGCCCGGCCTGCCGAGAACCAGCCGTCCCCGTTCGTCGTCATATGCCAGCGCCTGAGCCTGTCCAAGCAGCCTGTTCAGACAGTCCACAACCGTTTCACCATGTTCCGGCTGAGCCTCAATAACGGCGGCTGCCGGCGCGCCTGCATCAACAACGTCCACACCGAATGGCCGGGCAAGTGCGCTGGCGATCAGGAATAAATTTTTCCCGTTATGCTGTGCAGGCGATGCAGAACAGTCGATAAGATCTGCCGTTTTGCTGCGCCCGACAATGCCCGTCATAATGGTCTGCGCATCATAACGTAGCGGTAACGCCTCAACCCAGCCGGTAATAACTAAATCATCGCCAATGAGTACCTCTACAGCGTCACCATTTTTTACTGGCGGTACGTCTTCTCCACCAGGCCACTGCCGGGTGATCGAGACATTAAAGTCCCGGGCAATACGGTCAATGCCCGCACTTATCCGTACTGACGTCCATCCTCCCCAGTCACGCCCGTTGACGCGTAAAAAAACCGTATTATTCATCGTACCGGAACCCTCAGCGGCTCAACCGGGATAAATCCCGGATGGGGAACGGGATTACGAGTGAGGATATCAGATTCCCGCCCGGCGTCGTCATACCAGGCTGCAGCCAGTACCAGTGCAGGCAGAACATCATCAGGCGTTCGCAATGCAGTACGTTCAACCTGTGCCAGTCGTGCAGAAATATCGCGATTGAGATCCGTCCGCATAACGGAAATTTGCTGGAAAAGCACATCATCCCGGATACGCAACTGCTCCTGGTCAATCGCAGCATTGAGCGCGGTCCGGATAGCTTTCAGATCTTCATAATTCGGTGGAAAGCTGCCATTACTGACTGTCTGTACACCATCCAGCGCCGGGTGCATGACAGTGATAATGTCTGAGTCACGGCCTGTTCCTGCAGGCTGATTTACGCCCCGGACATCAGGTACATCACGCGGCTGCTTCAGTGTTGTCACGGCGTGGACGGCTGTGCTGATGGCTGTTGTCCTGATGGCGGCTGCGATCATATTGCGTTGCATTTTCTGTTTCGCAGCAGATCCGGAGTCAGTGGGCCAGGTGCCACGGGGGGAAAGACCGGGATCAAGGGTGATACCTGACATCGTTTTTATCATCGTGACCAGATCCGATGTACTGCCTCTGAGCCTGTCACCTGAGCGCCAGGCTTTTTGCAGTGCGTTAACGAAATCACTTGCGGCGCCCGGTGGCATCAGAATGACAGACAAATCCCCCTGTAACAGCCGCATTGCGGCAGACACGCCGGAGTCAACCATCCTGAAAGCATCGGCAACATCGCCCAGCATGGAGGCAGCATCGGCAATGACATCGTTCTGGATAAAATCAGAAATACCTGACAACGAGAATGTGGAAAACATACTGTCAATCGCATCGTCGAAAAGCCCGCCTGATGTTTCCAGGCGCTTCGCCGTTGCCATTCCCGCCACAGGAAAAGAAAGTTCACCACTTTCCACAAACTGAAAGGAGACACGACACATGCGCCCTTCTGTACTGCTGTGAGTGATCCTGACCTGTCCGTCAATGCTGCCCTGCATTTCGCCATACTGCGGATGGACCAGCGTACCAGGGCCTGCGGTTTCAATGGCACCAATAAGACGATCCCGCCTGTCTGCGTAATCATCACCGACAAGATAAGCATTTATCGTCAGGCGGCGCGTGGCGCGACCTAAATCCTCCGTCCAGGGCTTATCCCTGTTCGGATATTCATGTACCTGTACGCGGCGTCCAAACGTGCTTTCATCATCTTCAACGGAGAAAGGCACTCCACGAAATGATGCATCACGCAGGCGCCCGCGCCAGCCAGTTGAGGAGAAAAAAGCCATATTTACCCCATAAGAAAACCTGCCGGAGCAGGTTTATCGTGATGTACGAAAGGGTGAGTAACCCACATCATGGCTGATGTTCATCAATGGATTACCGGATTTCGGTATATCAGTCACACGCATACCTTGTGGTGCATTCTCAAATGTCACTTTGAGTTCGCTGCGCTGCGTTGATGGCGGGACAGCTCGCCCGAGTACGCCAGAACGCCGGGTCAGTGGCACATAAGGTTGATAACGCCCCTGCGGAATCGGGGTGTCCATACCAAGAAGCTCTTTGAGTCTGGGAATAAAACCGTTATACCCGCGTTCACGCTCCTTCGTTTGCAGCTTCTGTACAGCGAATGCGCCAGCATCCATACCCGCATCCTTTGCCCCCTGCTCCAGATCCTTAAGCTCTTTAAAGAGTGACACCGCCACGCCAATTGTCAGCGTCATGGCCCCCATCCGGCCAATTTTACCCAGCAGACCGGAAAGCCGTCCGGCCAGCAGGACGGACTGCTGCAGGGCACCAATGGTCCTGACGGTAAAAGAACCAGCCATAACCAGACCAACCCCTTCAATCACCGTCTCCCATCCGCCCATCTCCTGCGCAACGTTATCGACCCCCTGCCATACCGCCTTAATCACCGGAGCAACATCGTCCCAGTTCTCAATGATCAGCATAGCGCCGGCCACCAGCGCCGCAATGGCGACTTTCGCCGGAGAGAGGTTAATGACACTGTTCAGGATTTTGACAGCCCGGGACAGGCTGCCAATGGATACGCCAACAGCCAGCAGCGCCGCGCCGAACTTCGCCGCAGACTGAACCAGTTCAGGATTCGCGCGAACGAATGTCCGGAGCTGCTCCAGGTAAGGCATGACCGCTTCTGCAGCTTCGTTAATGGCGGGCAGGAAGGTATCGCCCAGCGTTACCGAAATCGCATTGACGCTGTTTTTCAGCAGAACCAGCTGGTTTTCTGTTGTGGACGCGCGGGATGCGTATTCCTTCTGCATCGAGCCGCCATATTCCTGGGCATCAGCCACACGATCAAAATTGGTGCGTAACAAATCCAGGTTGGTCAGCAGCGGGGCAATCGCGCTAAGTGACTCCTTGCCAAACAGCGCATTCATGACGGCGGCCTGTTTAGCTTTTGGCACTTTCGCGAGCGAGTCCAGCACCTTCAGCATGGCCCCGCGCGAATCCTTTTGCATATCCTCAGCGAGTTTCCGGGGATTCAGCTTCAGGAAAGCCATAGCCTGTTTCTGGGCTTTGGTTGCCGAATTACCTGCGGTTAACGACAGCATGAAGTTTTTGATGCCGGTGGAGGCAATTTCTGATTCAACCCCCATCCCGGCAATGGTGGCGCCCATCGCGGCAATTTCGCCGGATGCCACTCCGGCAACACCGCCCAGCGGACCAATCCGCGTCACGATATCAGAAATTTTCTTCGCATTTGCCGGGCCGGTATTCCCCAGATAGTTGATTTTATCGGCCAGGACAACCACGTCTTCCTGCGTCAGTCTGAACGCTGTCCGCCACTGCGCCATCATCTGACCGGACTCTTCGGCAGTGGTATCAAACGCCACACCCATTTTCACTGCGTCGTTCGCAAACTGCATCAAATCGCCGCGGGCAATGCCTGCCTGCCCGCCCGCCGCCACGATCTCTGCAATTCCCTCCGCCGCCATCGGTAACTGTGTGGACAGCGTCAGGATATCGTCACTCATCTGCGCGAATGCTTTTTTATCATCCAGGCCGTCAACCACCTTCCGGATGTCAGCCATTTTTGACTCAAAGCCGATCGCAGCATTCACGGGCAGCGCCAGCGCCCCAAGAACAGCGGTCCCGGCAGCGGCAGCACCGATCGCCAGCCCGGCCATTTCTTTCTGAAATCCCTTCAGTTCCCGCTGCATCCCTTTCAGCGGACCCGATAACTGGTCAACGGCAGTGATAATGGCCTTTAACTGGAAACTGTCAGCCATGCTTCATTTCCTCATTGATACGGACAGCCTCCGACTCCAGCTCCAGAAAATCGGATATCGCCGCCCGCCGGAGCTCCAGGGGATTTATTCGCCAGAAGTATGCGGTGTTGTAGACCCGCTTTCTGAGTCCTCCTCCGTCTCCGACCGGGTAAAAAAATTGAGGATCAACATACAGGCTTTGAAAATATCCAGTTTTGCCAGTTGCGCTGCCGAGGAGCGTGGAATACCTGCCAGCACAGGGATATATTTCAGAGCAACCGAACTGTCCAGCCGGACGCCGCCATCACCGGAAACGGTGAACGGAAAACCAATGGCTTCGATTTCATCGTAGGACGGTTCGCGCAGCTCCAGCACATGAAGCTTTTCGTTATGCGCCATAATCGGCTTTTTGAGCACAAGTTCTTTTATCACTGGTAAAATCCCTCCTCACCGTGGAACTCAAGATCCACGGTGCCCTCTTCCGGGTTATGGTTGGCTTCGCCGTGCAGCCAGGCGTTTGAGAGAACATACACCTGACCATTTGCCAGCTCTGATGTGATGGTCATGACATCAGAAGACGTAATTTTATCGACCGGGAAGTTTTTCGGCACTTTGGCGGTCACCTTCGTATACGGTGCCCGGCTGGTTTCCTTGTAGTCAACGGAACCATCCAGGCCAATCACGTCGTCACGAACTTTGGTGTTCATGGGGACTTCAATCCCTCCGGTTACCGACAGTTGCTGTCCGTCGATTTTGAAATATGTTGTTCCCGCAATTTTTCCCATTATGCAGCCTCCTCGCTGTACTGCAGACGGAACTGGTTAAGCACCGCAAACACACGTAACTGATTGACATAATCAGGCGGAAACAGCACATCAAGGCGGTTCGAATCGTTCGCGTTACGCTCAACTATCAGATGTTGCTGGAACAGATCGAAGTTTTCCACGATGCCTTCCCGCTCCAGCTGGCGATATGTTGATCCCAGCTCACCACGGATAACGGCAGGCGTGACAATGGCCTGACCAGGCCCGAAACGCGTACCATCATTAGCAAGTTTATGGCGCCCGTATTTACTGGTAATAACAGATTTCAGACGGCGCAACACATAAGCACTGGTATGCAGCGTCTCGCTGTCAAGGTAGCTGTTATCCGCCACACCATACGCATTTTTCCTGTACGTCGTGATATCCCGCTGAATACGCAGCACGCCGCTTTCCACATACGCCGTTGCCACACCGTGGGAAAGTAACGTCTGCTGTTCAGTCGTCGTGAAGCGTTTGCCTTTCGGTGCCGGCAGCATGTCCACCAGTTCCCCGGTCTGGGTCGGGCGCGCCGGATCGTTACGGATAAAAACCGCAGCACGGGCAGTACGGCTTGCAGCCAGTTCATCAGCAGGCGTCTGGGTGTCTTTCTCATAGCCCGCCAGGGTGATGTGCTGCAGGTTAAACTGGTCACCCGCGGCCACAAGCTCCGACAGCGTCCCCGTCTTCGCCGTATAAACGTGACCATACAACTGCCGGATATAACTCCAGCGACCGCTGGAATCATTCATTTCAGTTGCCATCGTGTTCACCGATGCCGTGTCGTTAAACGGAAGGCCGATATAATCGAACGGCTCATCTCCCATCGCTGCCACCGCGTCGTTAAGAGCTGGCGCACCAGCCCCCTTCACGCCGCTGGCAACCGTAATATTCACACCTGCCGGTAACACCTCCCCACCGCCAAAGCCGTAATAATTGAGAGTGACCGGAATTTCATTTCCATATAACCCCTTGTGGCGCGCAGTCAGTGTCACCACCCCCGCTTCTGATGTTGCCGTAAAGGGAAGATCAGGGTTTGCATTGACCGCATCCTTAATGCTCACAGCCACCGCCGCAGCGTCATCACCGCTGGTCACGGGAGCCTGAACGCGGGTTCGGCCGGTATAGACATTCACCGTTCCGGTTTCCGTCGCTTCGCCAGTTACCGTCAAAGCGACGGTTGCTGCCGCGCCTGTGGATTCAGGTACGGCAATGACATACAGTTCGCCAAATGGATCGGTCTTACGGTACGCCCCGACCATACGGGCCAGCTGGCTTCCTGCACCGCAAATCTGACGGGCATAATCAACCGATGACACCAGAACAAGACTGTTGACGGCAATTGACGCATCATTGCTGGCGTGACCAATCAGCAGTGATGCCCCGCTGTCCCGGGCGGTATTTGCCGCCGAGTTATCCATCTCGGCATAAAACAGCGGAACCCGCGTATCTGACGGGATGGAATTAAAACTAATCGCCATTTGTTTTCACCTTTTTATTCGTGCGCCGGACATCACCAGCGGCCTCGCGGCGCAGCCAGTAGTTATTCTCATCAACATTTCGACCTCCTTCAGGTAAAAGGTCGCCACGGGCCGGATCGGGAACCGATCGCCCTTTTGCGGGTTTCACAAACATGGTTTATTCCTGAAATGTAATTTCGGTGTGGTGCTCGATGTCGCCATCTGGCCCGGTACCGGGTTCGATAAAATCAACATCAATACTGAGCGTTTTAAGGTCGGGCAGGCCGTCCAGATCATCCTGCTGGCGGGTGTCTGTTTCGGTAATTTCATACTTCACCGTGAAGTCGAACTGGTAATACAGTTCGTGGCGGTTCAGATCGAGAAGCATCCCACCCGCATACTGAATTTCATGCGCCTGCGGGTCCGGTTCCCACCCCAGCAGCGCCTTCCAGATTTCCTGCCTGACGTCGTGAACTGCGTCGTAAGAAGCCCACTGCCCTTTTTCATCCCGTTCGTTGCTGAGTACCACGATGACGGAAAAACCCTCCGTCAAATCCTGCCAGTAGTCGGTCTGCGATTTCTGCTCACCCGTGACGTCTTCGGCTGGCACAACATACGCGGCTGGTAGCCTGAGCTTTCCGGCCTCCGGTATCGCTTTAAACTGCGCTGCGCCACCCACACGGTTTTCAAACCGAGGGCAACGGCTGCGAAGTGCCGCAATAATCGGGGTTAATTTCATTTTTTCTTCCTTCGCTGAGGACGGAGTGATTTTCGCAATTCGCGGGAGAGCACATAACGTGTCCAGCTGCGGCGTTTATCCAGAACCTCAGTCATGTAGTTGTTACGTGGTGCCACACGCCAGCCGCTGCCGCCTGATGCGCCTCGATGATGGCCTTTCTTACGCTTCGCCCCACGGCGAACACCGTAGAACAGAAAGGCAGGGTAAAATGCGCCTGAGATCGGGCGGTTCCCTTCCCCGTTCTTCTGATTAGGGGCAATTTTCACCATCAATCCAGGACGGCGACTGGATGCCCGCGGAACGTAATACCCGATGGAGCGTGCCAGTTTTCCCGTTCTGTATGAAGGATTATCGCCGGGTCCGGAACGCCCCCGCTTCATGACCAGGCGGCGGGCATCGCGCATATGTACCTGCCCGATACTGACAAACGCCCGGCGCATACGCGCCCGGTTAAAAACAAGCGTTTCCGGCTGTTCAAAATCAACGTGTAAATATGCTTTCTGCGGCATAGTCACTCCCGTTATCGGTACCCAGCTCTTCGCACTCGAGCAACAGAAAGCGGCGTTTACTGTTCAGATCACGGACCCGTTTAACCCGATAAGAAATATCGTCGTGGAGCACTTCATGATCGGCGGTGATACCGCGGCGAAAACGGATGGTGAAATAGTGCGTCACCCTGTTTTCTATCTGCACAGCCCCCTGATAAGCTGCCGCGCCGGGTTGCGCTTTTTTGGCCCACGCCCGGATCTGCTCCGGGTACGTCGGCGTTACGCCAAAGTCATCAGCCGGAACATCGACACGCCGCCGGATAACAATGCGCTGGTCAAGTTCGCCTGGGTCGGGCAAAAGGTATGTGGCGCTGGCCTGCGCCTGCCTGAGTTTCATAGCGGGATGTACCTGTATGGACCGACGAGCCAGTTAAAGCTCATCGGCAGTTCGACTTTCTCCACTTCGGTGACGGTAGAACGATTCTCGTAGAAGTGCGTCACCAGAAGCAGCATCCCCATCCTGATGTCATCCGAGAGAATAAGCCCCTCCGGATCGTCGGCTGGCACCCCCGCCTCCGCCGTATAAAGTCTCCGGTTCAGAAAATTTTCTGTCCTGGCCTGAACCGCCCGCCCCAACAGCTCAAGAAATTTATCTTCATCGGCGTAATCCTCATCCAGCCTGAGCTGCAACTTGATCTCCTCAGGAGAAAGCAACATAGGATCCTCCTGCGCCCGCCGGGTGGCGGGCACAAAAAAACCGCTTAACGCGGCATGGTTTGTTCAGAGGTGAGAGGGATTAGCTGCTTGCCGAGCCTTTGCCCACCAGCGCTTTAATCGCAGAGGTATCTTCGAGAATACAGTCAAAGCGATGGAACGCCAGGAAGCCGGTCTGGTCGAATTCCGCGTAACGCTCCACCAGGCGCTTCAGGATCATGTAGCGAACACGGCGGATAATGAAGCGGTCGAAGTCGCCACAGAACATGAATTTTTTGCCCGCGCCAATATCATCGATCTCCTGATCAATAACGTACGGAACATTCAGCACTGATGCTGGCGCCACGCCGACGATATCAGGCAGCCAGAGTGGACGGCCCTGACCGTCTTCCATCTCGCTGATGAGTTTCAGCGTATTGTCATTGAACGCCAGGCGGAACTTCGGCCCGCGGCGGTACGCCGGATCAATACTGTGTTTCAGCGCCAGAATCTCTTGCCATTTAACAGCTCCGGCAGCGGCCGTCTGCGTAGTGCCGGTTACGGATGCTTTCAGACCCTTAGGCTGTTTTGGCGTACCGGTGCCGGTCCCCTGAATAAGGTAACGCGCTTCACCGCGGCCAATGCGCTCCGCAATACGGCGGGCGAGATAGGCTTCCATGTCGATCGCACTGTCCTGTAGCAGCTCGTTGGATACGCGGATAATTTTGGATGTCATTTTCAGCGCGCCCAGACTATCCATACCGAATTCGGTATCTTCTTCACCCGCTTCTTCGTTTTCACCCAGCAGCACACCCACTTCAGCGGTACCATCAGCAGTGGCCCATTCCATAGTGCGCCCATCGGATGTAGCGAGGATCTGCGCCACGCTGGCAATACCACCGTAGGATTTCATCTGTTCGACCACTTTCGCCAGGAAGGTATCAGGCACGGTATAGCCGCCCTTTTCATCCGGCGCCACACCCTGCGCACGCAGTTCGCGTAAGGCTTTGCGCTCTTCGGAACTCAGTTCGCTGGCGCCGTGACGCATCCATTTATCAAAAATCTGGCCGCGTTTTTCGTCCTGCTGCGGGTCTTTATCAGGATCCTGATTATTGCGCTGCTCTTCCTCGTTTTCATCAACGTAGGTCTGGTCCTGGCGGCGCAGCTCTTCTTCGCGGGCGATGCGCTCGTCGAGTGCTTCCAGTTCAGATTTTGCCTTGTTCCATTCGGTACGCTGCTCATCCGTCCATGGGTTATCGCCGATTTTTTCGTTCAGCGCGCGCATGTCGGTCGCGATGGTGTTACGTTTTTGTTTCAGTTCATGCAATTTCATGGTTTTTCCTTACGCGTTAAGAAGGGTCAGGACGCGCTCACGCGCCATTCGTTGGTTAATGGCTTTCTGCAGTGCGCTACTATCGCGCGCCTCCTGCCAGGCTTTCATAGAGCGGACGGCGGAATCTGCCTCCTGATACGCCGGATATGTCACAGGGCTGACATCCAGCAGACGGGAAAAACGGGTAATCTCACGAATCACCACACCATCCTCGTCCTGGTACCATTCCTCTCCGTCGCGGGCGACGCGAAATGCAAAAGAGGACTGGTTGATATCCCCGCGCTGCATTGGTGCCAGCACCAGATCACGGATTGTCTGAGTTTCTGGCGCGGTGATGTCATAACGCAGCCCCCGCTCATCAACCGTCAGTGCCAGCGTGCCCGCACTTCTGCGACCCAGGATAAAATTGGGGTCATGGTTGAACAACGCCCGTACATCGTCATTCAGCACTTCATCAAACGCACCGGGCCGGATGATTTCGCGAAACGAACCGAAAATCAGTTCAGAACGACTGTCAAAGACCGAACCATACCCGATGATCCGGCTGGGCTCGCTGTCGTGCGTTTCTGCGCGCACCTCGCCGCTGTAACAGCGAATTTCACGTTCACTCATCTTGAGTGTTCTCCTGGGTTGTGGATTTGGCTGGCCGGGAGGCGTTAACGCTGACCAGCATTTCATCAAGGCCGTCTTTCGGATTCATATCCTCAAACGCGCGTGCTTCGTTGCGGCTCATCCAGCCATCGGTGATAGCGAAGTGATAGAACTCCGCGCGCTCTTTGGCAGTACCGCGCAATAAACCCGCCAGGTTAAAGCGCACGTAATACCCGGCTTCCCGTTCGGCGCGGGTGAACAACCGACGGTTAAGCTCCTGCTCCCAGTTCGTCACCCACGGCATCATTGTGTAGCGAACAAACTGAATCGCCTGTTCGGAAATATTGGAGAAGGTGGCTTTTTCGAGGTCGTTGATCATGTGTGCCGGCACGTTGAAAATCCCGGCAATCATGGAACGGTTGAGCTTCATCATGTCGATGAGCTGGGCATCGACTGGGGAAACCGTCAGCGCTTTATAATCCAGTTCAGCCGGGAGCAACATTGTCCTGTTTTCCTGGCTGCGCAGCATCGCCGTGGCTTTTTGCCACATCTCTTTCAGCCTTTTCCAGGAGCCGTCATTCAATTCTCCTTTTACTGAAACTATGCCCGCTGGTCTGGCGTTACCGCTGAAAAAACTTTCCGTGTATTTCTGCCCGCTCATACCCATACCGATGGTTTCGGCGTGCTGAAGAACCGGACTGAGCCCCATTTTCTGATCGTTACCCAACGCCCTGACGTGGATCATGTCATCAGGATTAATGGCAAAGGAACCTTCTTCGTTATACACGCCGTAGGTATAGCGCCCGCCGGTGTTCAGCAGCGTTGTTTCCCACGGCATACAGGCTTCAAGGCCAGTGACTTCACCGGTTCGGCGGTGACGGAGAACTCTGGTATATCCGTTACCCCAGCCCAGAATGTGACGTTGTTTGAGCTCGCGCCATTTATAGCTGGTCTGCCAGGAATTGGGTTCGTCATGAACCAGGTAAAAGGCAGGATGGTCGCGGGCAGTTTCAACCTTCTTCCCGGTTCGCCGCATGACGTGCAGGGGCATCTGCGCAACGTTGGATGAAATAACGTAGATACATGCATACACCGCCGCCAGTTTCATTGCCGTCCGGGGGTTAACAATCACATCACCATTAAAGATCCCGTCGTTTTCGACCGCTTCAACGGTGACCGGAACAGCTGGGTTTTCCAGCGAGTTGCTTCTGAAAATGGCATCAATCAGCATGTTTTTATTCTCCTGGCCGCCAGCAGTGCCCACAGCAGCAGGCCACTACCACCAGCCATAAGAGCAACCGCCGCGCCAAATTTGAGGTAAATACCTCCCACCAGCGCGCCGAAGCCTGCCACCCCGGCCACATCGATAATTAGTGATTTCACAGGAATAACAGTTCCTCATCAGGATCGAGGTTAGAAAGAAAGTCTTTCGGCTCGTTCAGCATTGCGCGGCCAACCCCCATCATCAGGCCAACTGCACCATCAATTTTGTTGCCTGCGCCTTCTTTCACCGGGCGAACAACATCGTCGCTACCAGGCAGGTACTTGCCAACCACGTTCGAAATACACCAGGTCATCAAGGGATTACCGTCATGATGGAATCGGCCAGCAGCGATCGCAGCCTCAATCTCACGCATCGGGTCGCTCATGTTCGTGTAGTTCTGGGTAATGGTGACAGGTTCAAGCCCTTCATCCTGCAGCATATGAGATAAGCCTGTTGCACCGTAGGGGTCAATCGGACTCGCGGCTATCTTCACCGTTTCCCGTAATTTCAGGATCGCTTCCAGGATAAGGCGGTAATCCACTTCTGCACCGTCTGACGGAACCAGCACGCCCTGATTAACAAAAGACTGGTAACGGTCTGCAATAGTTTTCAACGCCGGGTCCGTGGCGTAGACGGTGTCTTCCGGTACCCAGAACATAGGCGAAACGCAGTAATAATGACTCAGGCCGTCTATTTCACGGCGGAATACCGGCACCACTGCATTAAGGTCAAGTTTTGATGCCAGGTCGATGCCGGGATAACACTCCTCACCTGCAAAATCGGACAGTCTGAGCGTTTTGTCTGCTGCGGTCATCCACTTCTGCAGGTTGTAGTAAGCTGCTTTAGAACTCACCCATTTGTTGAAATGCTTGGTGAGTATTTTATTGGTCTGGCCTGGCGTGGACATCGCCAGCAACTGTTTAGCCTTGAGGAATCCCTCTTTCACCGAAATGTTGTAATTCGGGTTGGCTTTGATCAGAGCTTCCGGCTGTGTCCAGTCATCGTCATCATCCAGGGTATAGATGATCCCGAAAATTGCCTCGTTTTCACCACCCTCCCGGATGCGCTCCAGTATCTCGACCACCTGAGTACGTTTTTCATAGCAAGGCGAGGCAATATCAAAGCCTGCCGTGGTGATGATCAGCGTGATGGGCTGCTCCCTCGCCCCCATCCCGGTAGTCATTGTGGTGTATAGCGCGTCAGTATCATGCTCGTGGTACTCATCGATGATCGCACATGATGGTGAGTCGCCATCTCCAGGGTCACCGATAATTGGCGCGAACAGGGAACCATCCGGGCGAGTCATTTTCTTTGCCCAGGGTTTGATACAGAACTTCTGACGCAACGCCGGCAGCTTTTTCACCATCGCCAGTGCAGGCGCAAAAACTTTCCAGGCTTGTTTTTCCGTTGTGGCACCACAGTAAACTTCCGCTGCGTACTCGCCATCTGCACAGAACATATAGTTACCGACGGCGGCCGCAATCGCCGATTTCCCATTTTTACGCGGTACCTCGATGTAAATCTCAGTGAAGCGGCGAAAACCGGTATCCTTGCGCACCCAGCCAAACGGCACGCCCAGCGCAAATTTTTGCCAGGGTTCAAATTCTATCCGCAACTTCCGGCGAGCCCACTCTCCGGAGGTGTGCGGCATTTTCTGGGAAAAGCGAAGGAAACGTTCTGCTTTATTTTTATCGAAGCGGTAAGGCCAATGCGGATCTTTGGCACGTTCCAGGTCGTCAAGATGTCGCTGACAGGCAAGCATGGTTAACCGGCAGGCCAGTATCTTCCCGTTCACGACGTCCCGCGCATACTGGTTCGCCGCATTGACGTTCGGATATGTAGCCATCAGTCAAACTCATCAAATTCATTCCCTTCATCGTCCGGATCATTTTTTCCGCTGGTCATTCTTATGCGGCTGAGCGGGTCTAACCCGAGAAGTGAACCCAGACGGGCGAGCTGCGAAACGGAGTCATTACGGACATTGACTGCAGGGTGTTTTTTCTCACCACCCATTTCACTTGATACGGTCAGGCCTTCTTCCGCGATGACTTTTTCGGCCTCAATCATCAAGTGAAACGCATTGCAGTACGCCAGGAGTAGCGGCGCGTCTTCAAGATCAAAAACGCCCCGCTCAATTAAAATTTTGCTCTGCGTTTTCCAGATGCGGATCGCGATATCACTCATTAACTCTTCCGGCGGTGCGATCCTGGTCAGCTTGCTTTTCTGGCCCGAAGGCAAATTGCGCTTACGGCCACCACCGGAAGATCTCACAACAGCACCCATCAAAACCTCCAGTTCAATAGGTTGAACCTTCCGGAAAAAAGTTTCTTATTTTTGGCGCGTAAAAATTTGATGAGGCGGGCAGTCCGGAAGACGTCAGGCCACAGGGATTTACCCCGCCCCTCCCCTCTGGCTGTGGAAACTGGTTTTTATTTCAGCCGTTCACGAGCCGTCTTCGCCTTATGGCAGGGCCAGCACAGACTCTGCAGATTACTGTCGGCATCAGTTCCGCCATGCGCTTTAGGGATAATGTGGTCAACAGTTTTCGCCTCACGCACCACACCGGCACGCAGACATAACTGGCATAAACCTTTATCACGCTTCAGTATGCGCTCACGGATAACGTCCCACTTCGAACCATAACCGCGTTGATGACGGGATTGTCCTGGCTTGTATTGTTTCCAGCCCTCGCTTCTGTGACTTTCACAATATCCGGATGGATCTGTTGTTGTACTGCGACAGCCGCGAACACGACAGGATTTAGGTGTTCTTGGTGGCATATAAACTCCGGTAAAAAGCACCGCTAGTGCGGGGCTATGGGATTGTTGGTTGACTCTCTCACCGAGTTGTAAATACGCTCACACGTCATTCCTGCGCGGTAGCTTTCGTCAGATCGCTCAGCATAATATCGAGCTTCTTCTGCAAGGCGTCCGAGCATGTCGGCGAGCACTGCGGCGTCGGCTCCGGCTGTTTTGCTTCTGACGGCAGCGGCAAGATCTGAGGTGTGCTTTGCGGCGTCCAGGCGGGCGGCAAGCTTTGTTGCTTCGGTACGCAGCTGGCTAACAGTGGCAGACAGGCCAGCAGCAGTGGCAGCAGATTTAGCGGCTTGTGCTTGTGCATCTTTTACAGCCTCATCACGGGCAATTATGCGCCCTTGTTCAATCCAGCGTGCGGCAGTCTGCGCGTTCGCTTCCTGTGAAGATTCCATGCTATTGCGGTCAGCCCACTTCTTTTGCCAGCCCCGATCACTCCAGATGATCCCGGCAAGAAATGCACCAGCCAACATCAGCAAAACAATGATTGTTTTCCACCGCGCCTTAACGAAAGCAAAGACCGCTGTCATACCAGCAACGCCGCCCGCGCTTTGTTATAACGACTATTTCTGTCAGCCAGTCCATTCTGGCCACCGTTGATGATCTGCGTTACACGGACAACATCACCTGAATACATCAGGCAACCACGTAATGTGAAAAACCATGCAGCAGAACGGGCTGCATGCTTCTCCTGTGCCAGCAACTCTGGTGTGCTGATCAGATCAAGCTTCAGCGCAGCTCCGCATTTGACGTAGTTCTCGCGGCCGGTGATTTGAAGCAGACCACGCCCGCGATACTTCCAGCCATCTCCGGCGTCTTTGTTACCCATGCGGCCACCGTAAACCAGATTGGCTATTTGCGGCTGGTGGGCAACCTGGCGACCATCAATACGCCCCAGCATTTCACACTGATACGGCGTCAGGCGTTTACCAAACGTCTTTTTCAGCGCCTCCACCGAATAATTGAAGCTTTCCTTCAGAACAGTAAATCCTGCTGATTCATGTCCCGTTTGTGCAATAAACATGGCCTGATCATTAACTGCTGTAATACCAAACTCTTTCATTGCCGCATCAATGTGCGGAAACCAGCGTGCAGAAAGCCCGGCGCTAATACCAGCCGCCTGCTGAAATTGTTGTTGATTCATCAGTGCCTCAGTGCATCGACCAGACGCGCCACATTACCGCGAGCCCACAGCACAGCGGCGCAGATAAGGATATTCACCATCACCCCCAGCCAGTGGGATGATTCATATAAACCAAAAACAAACCGGAAAGGGACGCTGGCATATACCAGCACCATGACATAGGCCAGTAACGAAATCAGGGGGCGGTGTGTCGCATCACCGCGTCGGTAAAACATCAGAACGATGACTATCACCCCACAAATTACGGCATTCAGAGCTGCAGAAGGGTCATTTGCTACCATCTGATCCCCCTCCCCTGATACGAGAGAGAATACTGAACAGGGTGTTCAGATCCTGACTGTTGAGAAAAGTGAGAAACTTTATACACATTGCAGAAATAATCACTGCGCCAAGTGCATCCAGTGGTTTTTCATAATGCGTTATTGCCGCAAGCTTAGTACCTATCAGCCCGGCACCAAGCACTCCCACAATAAATGATGTAATAAAATAAGCGACCAGCCTGATGCGTCCGATGTTGGTTGCCGTGGCGACATAAAACACCGCGCCGGCAAAAGCGCCGAATACCACACCATAATCAGTTCCGGTTGCCAGACCGAATACACTGGCCCCCATTAATCCCCCAGCCAACACTGTCGCACTGGATACAGGTTCGGACATTCATCCCCCTCTGGTTATGTGGGTCCTCTCAGTTATGAGGGGAAATAAAAAAGGCCGCCCGAAGGCAGCCTAAAAGTATTTAGATTTTTTACAGCGATGATTGGTGGAACCGAACTGCAAGGAAGTAAAGTCCCAGCAAAGTTGTTGTTATGATATTCATTAAGATGAATATATAAACGCCAACAAATACCGTTTTAAGCCACAAGATTGCATCAGGAGAACAGAATGTGAGCAGCCACAAATGGAAAGGCTTCCCAATCAGAATAGAAATCATCCCTAAGCAAAATAACATAAAGCTCACAAGAGCTAGATAACCAAAAAGGTAACAAACAAAGCGCCTGCGCGTCAGTTCTACAGTAAGCTTCTGCCCTCGGAATTTCTCTACTAGAGTCGGAGGTACGCCCGCCATTACTTCGTCGATCGAAGAGCTAGAAAAAGTAGAAACCGCAGCCAGTGCTGCGATATAAAAACCAATCAAGACTTGAAGTAACCCATTAACCTGAAGCAGGAGTCCGTTAGTCTCGATTAAAGAAATTTTGCTAGCGTGAAAATAATAAACAATAGTGACGATTAGAGACACTGCAGCTGGTATTTTGTAATCATACCAGTCCTTTTCCTCATGCTTGATGCGGAGATAACTCAGCGGTGAAAAAAGTTTCATATGAAACTCCCGTTAGAGCAACCCTATCATTTTTGTTTCAAGCTGCAGATGTACTGTGCTTTCACATTGGTTGATGAGGTTACCTAATATTACCCTCTCACTTTTAGTGAACAGTTTTGTGGCAGCATCTTCGTTACGGTCAAGATCCAAACTGGCTTGCTTGCCATCTTTTGAGTAACTAATTGAAACCTTGGTATATCCAGACTGCTGCCCTTTCTTTCTTAAAATCTCTAACAACCTTTCTTTATCTTTCAATGGCGGCTGTCTAATGATTTTATACTTTACGGACCTTTCTGAGAGCTCAGTGTACGCCGTTTGGTCCAATCCACCTTTCCTTCTTGTACTCACAAGTTTAACGTTATGAATCTTTGCACCTTTTAATGCATCCATCAGCGTTTGTGAACCATGAGAATAGATTTCCAGCTTTGGGCGGTGCTGGCACATACCTTTAGTTGCAGGATTTTTAAACTCACATCCAGCGAAGGCTTCTCTGAGCATAGCATTTAAAAATGGCTCAAGAACTGACTTACTGATACCGGGGACAGATTCAACGAGAGTTTTGTGGTGATCGGCAGTGTTTTTGACAACATCTGTGGATATTACAATGTGACAAGAAACTGCGATACCTTCACCGGCAAGCTTAGGTTCTACTCTAAGGTTGCCTGTTGTTAACTCACCAAAAACAGGGTCAGAACCATTTTTATCACAAAGCTGGATAAGTAGAGTCGCCTGGCTATCCCCAATAGAATATTTCATCTCCGAAATCCTAAGCGCTCTAGACCTATGATTGTATAACTTTACAGCACTCCCTGAGCTCACCAGTACCTTCAATTTCTTGAGTATGTCTTCAATGGGAATACTCGGCGCCGCTGCGTGTGTAGGCGTAAAAGCAAAGTCAAAAAAGGAAACCCAACGTTCGTTATTAGAGAGCACAATGTCACCACTTAATTTTAGTTCTTGTCCATACAATAACTTAGCAAGACAAAAAATAAAGTGATCACAAATAAAAAACCCACTCGGAGGCGGGTTCTTGAGACTTATCAACGATAGACATACAAAGCCCATCGTTGGGAGAATCTTATCCATATTTTTTGAAATATGCAAGCATCATGTCGCTATCTTCGTTGAAAATCTTTCATCTTGTCACCTTTCTTAATTGCGCTTCTGCATATGCTTCTTCCTGCCAGCATTTTGTAACCAGTTTATCAATGACGTCTGCATATCCTTTGTACCACTGATAATCCGTCAGATCCGGTACCAGTTTCTGGACATGGTGCCGCGCCAGTGTGGTTGGTAAACGACTAAACCGGTTTCCATTGCAACGCCCACAAATCTTATAAACAGGCGCGCCATGAAGCCGGGTTCTTTTTTCATCCAAGACAATACCTTTACCCTTACACCCTCTGCACACCGTGCTGACTTCTCCCTTACCATGACAATGCTGACATAGTTCCTTCTCCCACTCCTCTTTGATGACGGGTTCACCATTTCTGGAGTGTTTCACCACTTCACGTAATACATGATGAAATCCCGTACCAGCACAATGCTCACAGCGAGCCTTGCTTGCAGCAGATCTGGAATAATCAGCAAATGCAAAATTCACGAGGTAAGGAACAATCTGTAACCGAGTTTCTTCACTCAATTTATTCAGTGTCGGGTTATCCAGTGCCATCGCGTAATTGAGCAGACCTTCAATCGCAAACTGAGGATCCTGAACACCAACTTTTGCCAGGAATAAGGCAAAACCCAGTGGTGCTTTCGACTGCACCATCCCCTGCGCCGCCATTACATCCGTAATCGTCAAAGATTCGGAGGCTGTCGCCGGAGCGTCATCGCTCAATTTTGGAGATTTTGGTGAGTAATATTTTGGTAAGGCTTCAAGGTTCATGCGTGTTCTCCATTTACGCCAGCACGCCAATTGCCAGTGCGCGATCGATAAAACGAAATATCAACTCCAGTTGAGAGCCGTATTTCTCTTCGAATGCCACGGTGTCCGCATGTAACTCATTGTGATGCGTTCTGCACAACGGCAGCACAAAGAGGTCATGCGCCTTTGTTCCCATCCCTCCCTGACCGTAGCCTATCAGGTGGTGCGGATCATCCGCCTGCTTCCCGCAGCAGGCGCACGGCTGGGATTTAACCCAGCGGGTATATCTCTCATTGACCCATCGACGGCGTTTCGGACGTAACATGAAGCTTTCCGGCGATTCCGGATCAACCCTGAGCGCCAGTACCTTTTTCGCCTTATCCTGTACAATGCTGGTGGCCAGCACCGAGGGAACAATTTCACTTTCACGGGTAGCTGACTGGACAATTGCCTTCGGCATCCTTAATGCTTTTCTCGCAGCGCTCTCCGGTAAGACTTCTGCCAGGTTATTGCGTACCATCCACCAGCACAGTTCCGGGAGCGTAACTGCGTGCATATCGTCAAAACCCAGATCACGACAAACAACCGATAAAATCCATTTTGTCGTGTTCTCCACAGCTATTGATTTCAGCCGTTCCGTAAACTGTTCGCGCAGCAGGTTATCGCAGTGCCAGCACAGTCGGATTGCCCCCGGGGCGTGGCGCATGGTTGTCATCTGTTCACTGTGCCAGTCTGAATGCGGCCACTGACAGCCATTCCCCCGGAGTAGCCAGCTTTCCAGACTATCCAGACCACCAGCACGATAGATAACCGACTCATTACAGAACACATCACGAACTGCAGGATCATCCGCCAGCGGCTGTGATACCGCCGGGACCGCGCCGCAGGCGAAAGATGAATATTGCTCCGGCTCTGGTTCAAGCAGAACACGCCCCTGCATAAACAGGGGCATCAGTTCCGATCCCGGCCTGAACAATACAACGCCCATACGAGGAGCAATTTCAGGGGTCAGTAACGCTCTCACGATCACCTCAATGAACGGTATCGAGCAGCTTCAGCAGCTCAGGGAATTTGGACTCGAAGAAATGCGGCTGCGTCTCGCGAGGGTTTGCCGGGCTGGTGATGTTTTTGCCGAACATGCAGCCTTTCGCCGTCAGCGACCAAAATTTTTTAATGCCGTTAATCGCGGAGCGACTGTAACGCTCACGATGTTCAACAACACCCAGCTTCGCTAACTGCTGATACGCCTGATTAGCCGTCATCCGGATACCATGCTGTTTTAACAGCGCGCTCAGTGCCAGCGTCGGGCGGCTTGAACCATCCGGCGCGCCAGCCGGAGCATCAATGGCATATTGTGGCGCCAGGTTAGGTAGTCCCACTGCCTCCTGGAGTTTCTGGCACGCGCCCAGTACCGATGAATTGGACAGGTTTAACTCTTTGCGCATAAAACCCAGCAGAATCACCCCCGCCTGCATCTTATCGGCAGCCATACCAGAAGATGTTTGTGGCGCACTGGTAATCCGATCGAACGTGCGGATCACCTTGAGATGGAAAGACGGGCTGATCCACATTGCATAAGCAAACACCAGTTCTTTGCATACGTATGTACCTTGTTCAGCACCACCGCGAACAGTATTTACTGGAGCACGTGCCAAACTTCGGGCATCACTACCGCCCTGAAAAAATCTAACTGATTGATTTTGTTCCGAGGATGGAATTCCGCCCTCGGTGAAAAGTTGCTCAATCAGCTCACGGGTTTGCTTATTATCAAGCCAGTATTTCGGACGGTATTTCTGCTCTCCACCCGCAGCACGGTGCAAATCGTTAAGACAATAGCGCCCATGAACGTCGCGGCGAACTTCGATACCATCAATGACCATTAAATTATTCATGCTTCTTTCTCCATGTTCAGGCAGCTGCACCCGCCCCTGTTTCAAATTTCGTGATCGTGATTTCTACCTTCCCCTTCGGGAAAACTGGTCCCCACTCCACCAGCATTCTTTTCACCTGGCTGTCGTCTTCCCACACACCCGCGTGGGTCAGGGCGTCAAACAGCGCCTTGTTGTAGTTGTCCAGATCCCTGATCCGCTTGTCTGGCGGATACAGGGTGATTTCCACCGCTGCCGGGGCTGTTGAAGGTTTTGGCAGACGGCGTAACTGCTCAATCACTGCCGCGCACGCCTCGCTCTGATACTTCCGGCCGCTGGCGCTGACCATGTGACGACCTTTAAGCGGTCCCTTATTCGGAGCCCGCCAGTAGGTGTTAACGCTGGGTGGAAATGGCAATGTCAGTTTCATGCCGCCCCCTACAGAATCGCCACGAGATCTTTCGCTTTCTCGCGGGTACTGCCCTTACTTGATATTGAGCGGCGGGCGCTGATATAGCGGATTTCAAAGCCGTGCTGCGAGTACAGGTCGATAACACGCGGAGATGTGGAATTGCCGATCACGACTCTGGCACCTCGCTGATGAGCAGCAACACAACATTCCGCCAGCGCGATATGATCATCCCAGGTAAAGCCACCAGCGGCGTAGTGAGTAAAACCATCCTTGCCGGGCATCGGTTCGTAGGGTGGATCGCAGTACACAACGTCACCCTCTCCCGCAAGTGCCAGCGTCCGGCGAAATCCTGCCGCCATGAATACGCAGTTGTGCGCCATTTCGGTAAATGCCCTGATTTCTTCTTCCGGGAAATAAGGCGACGGGTATTTGCCCCAGCCAACGTTAAACTGGTTGTTGCGGTTGTACCGGATCAGGCCATTGAAGCAGTGACGATTAAGGAAAAGGAAAGCGGCGGCGCGTTCCGGAGCGTCCATCACCTGAGCATTGAACTCTTCCCGTAGCGCCATATAGCTTTCAGCGTCATTGAGACGGTCAAACATTACCCTAGCATGTCTTATCACCGCACCAGGTACAACAGCCAGCATCTGATACAGATTGATAAGGTCGGTATTCACATCGGCCAGCAGGAAGCAGGCGTGTTTGTCTGAGTTGAGAAACACCGAACCACCGCCGACAAACGGTTCAATCAGGCGCATACCTGCCGGAATGACCTGGTAAAGATCGGGTAACATGGAATATTTACCGCCTGCCCATTTCAGGAACGGGCGACGCCAGGTGCGCGAAGCGGATTCATCAACAGATATCGTTACATCACTGCTTTCAATACAAACGGATTCACTGGTCATTCCACGTTCCCCACAAATCGCCCTGCCAGATAAACCTGCTCTACGGGTCTGATTCTCTTTTTGATGCGGTTCAGACACTTCTGACGGCGGCGTAAAAATTCCTCGCGTTCGTAGATAAACTGGCTTTCGCTAAATGCCCCCAGCCACACTATCGATGCACGATTAAATAACCCTCTGGATTCCAGTTCTTCAGCGCGGCTTACCAGCTTTGCAATCACCATCGGATCACCCGAACCTGTATGACCTGACCTTTTCTGTCGTAACGTCAAAATCACTGCCTCGTCTTTACCCGCGTAATAGCGAAATACATCTCCATCCAGGCGGCGGTTAACACGCTTACACCGATACAACTTGCTCGCTGAACGCTGCACATCAAAGCGTGCGTATTCAGGAAAAGCATCGGCAATCTCATTTGATGTCAGCCCCGGATTAAGCTCGATAAACGCCTGCACTTTTGCTAACAGACTCATCCTCTGAATCCCTCCGGAATCGTTGTATCAACCGGACCAAAAGCCATCACATCGCGCTTCTTCGCACCCCAGTCAGCGCGTTTAGGCCGCCCCTTCTGATCCCAGCGGGTAGCGCTTTGCAGATAGCTCTCGAATTTCTTCGGACCGAACAGCGTTTCCGGGCGCATGTACTGGTACTGCTCGTCGTTCTCGTGCCAGTGCTCATGCTTCAGGTCGATAACCAGTTGCAGATCAGCAACGCTGTACCCCTCACGCAGTCGGGCACGGATGTTTTCCAGGGAGGTTTTTGACTTCTGATACCGGGAGCCACTTACCTGGTTCAGGTGGGTTAAAACCTCAATCGCCTGATCGGTGAGCGTCACTTCGGGGTCGGGTTGCTCAGCAACCGGACAAAAAGGGGTTTTACTCTCCTGTGTAATCTCCTGAGTAGTCTCTGTGTAATCTCCTGTAGGAAAGATTGTGGGATCGCCACATGCTTGTTCGTTGGGTTGCCCCGTACTTGTTTGCGGGGTTTCCACATTCTTGTTTGCGGCATCACCGCAATCTGGTTTGCGGGGTTGCACCAATCCAGATTGCGGGGTTTCCACATTCTGGAAGTCCCGCAATCTGGTTTTCTCCTGAGCGCCCTTTTTCACTGGCTTCGCCGTTTCCAGCAACAGCGCTTCAAGCCGTGCGGTGTTGATGCGGTAGTGCATGGTGGCAGGTACACCGCGACGGGCTTCTTCCAGTACACCAAGTGCTACCAGACGTTTACGTGCTGTTTCCTGTTCGTCGCGGGTTAGCGCCGTTTCACTGGCAATATCAGCCTGTGTTTTGTACATCCAGCCGCCATCCATCCGGTTATGCCAGTAAACAAGCTGGGACAGGAATACTGCCGCAACCGGGCCAGCTTTTACCTTCCCGGCTTTCAGTTTTGCAAAAGCAGGGTTGTAGGCGATGGGGCGATCGAGTAATTGAATAAGGGAACTCATGCAGCACCTCCGAGATGCTTCATGTTTTTGCCGGAACGAAAGGCAATAAGCGGCATGTTGACGCGGTAATTACGCCCAAGAGGCTCACAGACAACCTTCTGACATTCGCGATCGACCAGGCTAATACGCAGAACGTACCCTTCTGGTGTGCTGTACCACTGTCCTGGACGAGGGCAATGAAAACGTTGGCTGGTGAACCGTTTAAAAATATTCCGGATCATTTGCGCCCCCTTACCTCTGAACGGTTCAGTGTCATATTGATAAGGCTCGCAAGCGCCGCAGCGTCATTGATGCGATCGTGCAGGCTTACAGCCAGCGGAGATTCCGCTTTTTCCAGCATGGGATAAAGCTGCTGTAACCAAACCTGATGAATGGATGAAATGTAGGAATAAAGAACGCTGGCATTATGTGCTGCATCGCTCAGCACCGATGGAGTTGAAAGCTGTTTCTCCATCTGGTTAAAGGCATCGATGTATGCCTCTTTAAACTGAGCAGCACGTTTACCAGTGAAACCCATTGCCAAAAACGCAAAACCATCGCGTGTGATTTGATAGCATGGGAGTTTGCGAGTACCGCCGTTGGGCTGATTTATTGAAATCGATGTAAACGCAAAATTGCGTTTACGAAATAGAGTGGAACATTCGAGAGATTCTATTTTACGGATAACATCAGCGTGACGCTTGATGAAGTAGTCGGCAACAGCCAGGGAGGAAGTAACGGCTTGGCCGTTAATAATTCTAATTTCAGGTTGAGCGAGGGTTGGGACAGTAGCCATAGTGGCAGCCTCTATGTTGAATTCAATGAACTCACCACCAAGGCTTTCCACGACCTTATAGGTGGTGAGACGTACAGGGGTGGAAATACCGGTCAACATAGAACCCGGCCCAGCCGAAACTGGCCCTGCACGCCCCACCATAATTTGGGCGTAACAATGCTCATGACAAGAAAAAACCGCATGAGCGCGGTTGTGCTCTATATTGAATTCCGGGTTTCCACGCCCGGCACCCGCTTTATGAGGTGCCTGAACAGTGTAACGTCCCGGAATTGCAGAATCAATGTGTTCCTGGCGCTTCACACTCAACAAAATCACGCCTGAATTTCCACAAAGGGCTAAAACACTCATGCGGATAGCCCTTGCGCAGATAGATAACGCGCTCAGTTTCTGGTTCCCAGCGAATGACATGGACATAAAGTCCCCTTCCATCCCGAAACCAGCGGTTAAGTTCCTGCACGATTCATCCCCCACGGTCAGGCTGTGTTCCCTGTGGTTACGCACGACCAGGCTATTTGATAATCTGCATTCATGACGCAACGGCCGGTACTCATACATCCCCGGTTGTTGCGACAAACGGTTATTTACCGTTAAACTGTTCATGCGTTGGTTTTCTCCATAAAATTTGACGCCACGGCGCCCGGAGCTGCACACTCGCGGGCGTCACCCTTTTCTGGCACGCAAAAAACTCTGTATACCAGTGTCGAATGCTGTTGCAGCTTTGCGATCGCCTGATACAACTCCTCATCAATCACGGCTTTTTCATGTGGCTCAATAACGCCATCTTCGATAGCCACCCTGATTTGCTGGGAATAACTGGTGATCTGCTCAATCGCTTCCAGCAGGCGCTGATTAATATCTGCGTTATCCACTTCTTCCATATCTGCCAGCGGAACAAAAACGCCACCTGATGCCCTGGCTACTGAATGTGCCAGGTGATAGGTTCCTCCGGCACGTTGCAGTACCAGCGCCCACCCAATCGGGAAGATCTGATCACCACCAGTACGCAGGCGGTTAAACAGAGCATCTTTGGTGACATCCAGCCATTCCGCAGCTTCTTCATAACCGCCATGCAGACTGGAAATCGTCTTTTTAATCGCAGCCACCAGCCAGCGGGGCTGCTTTTCAACTTTCCATTCAGGTTCATGTCCCACGGATCTACTCCTTCTGCTGTGGTGGCGGTCAAATCGCCGAATCACTAAGCTGATATCTGTTTGGATACAAAATTTGCATCTCGCTAATTTCTCCGGCGTAAAATTGAGCCAGGCGCTCAGCAAGCTCTGTTGAAGGAGCCTGCTCGCATCTTTCAACCCGGCTTAATGTTGCAGGATCAACCTGAACCCCTTTAGCGACGTGCTGTAACGTATAACCATGCGATTTCCGCAATTTTCTCAATGGTGATTGCATAAAACCTCCTTCTTTTGCGTATGTCGCATGTTATTTCATACAGCAAACTTGCGCAAGTTGATTTGCACAATGCGCAAAAAATTAATGTAATGAACGCATGAATATAGGAAACCGTGTCAGACAACTTCGCCGCGCGAAGAACATGAAAATTGCTGAGCTAGCAGAAGCGATCGGCGTGGATGCCGCAAACATCTCTCGTCTGGAGACTGGCAAGCAAAAGCAATTTACCGAACAAACACTTTCTAGGCTGGCTGACTGCTTAAGTGTTGATATAGCAGAACTCTTTACCTCAGACCCAAAAGGTAATACTGTATGTAAACACAGTGATATGAGGAAGGATTCAGCTAACGTGAAGGATTTGTTCCGTATCGAGATACTGGATGTCAGTGCAAGCGCCGGTAATGGACTCATTCAGGGCGGTGATGTTATCGATGTAATCCATGCTATCGAATATAACAAGGACAAAGCACTAGCTATGTTCGGCGGGCGTCCTGCCGCTGAGCTTAAAGTGATTAACGTGCGTGGTGACAGCATGGCACCAACAATTGAACCCGGAGATCTTATTTTTGTCGATATAAGCATCAACCAGTTCGATGGTGATGGCATCTATGTCTTTGGCTTTGATGATAAAATATACGTAAAAAGGCTACAGATGATCCCCGATAAATTACTGGTGATATCTGATAACACTAACTACAGGGAATGGAGTATTACCAAAGACAACGAGTGCAGGTTCGGTGTTTTTGGCAAGGTTCTGATAAGCCAGACGCAGTCACTCAAACGACACAATTAATAGAAAGCGTCGACAAGGCCACCATTATGGTGGCTTTTTTTTTGACTCAAAATTGCATATATCGCAATTTTATACTTGCGCAATGTGCAATTTAAATGTAATTTGCACTCATAGAGCAGCGAACAGGCAGGACGCCCACGAAGTAGCCGCCGGTGGCATACGAATGACCGGATGATTCGCAATGATGCATTACACAGGAGTTCAGATGAATAACTACTACACATGCTCCTTCTGCGGGGTCAGTGAGCTGGATGCAAAAAAGCTCATCGCTAAAGGAAGTAAGGACGAGCCAGCTATCTGTTCCGAGTGCGTTGTTTCATGCGTAAACATTCTTATCAACTACGCAGCCGTGATAAAGCCAGTAAAACTGAACGTCACGAAGGGGGAATGATGCTGGTTAATGCAAAGAAAAGCGCCATCACCCAACGGCGCTTTAAGGGAAATGAAGTAGTTCAGGTGTTCCTTATTAGTGCTTGCCTTCTTGTGGGCTGTGAACGTCGGAATTTCGCAAATCCTGAATATGCCTTTGCAAGTCTGAAGCCAGGCTCTCAGCCATCTCAGGAGTAAGTACTAAAAATTGCGTTTCCTGAGCAGACTCAATTGGTTGCATCGGTGATGAGAGAAACTGGAATTTCACTACCAGAGCGTCGTAACCAGGAAGCGGTCCAGCCTGCCAGCCGGTTACGGGAAAGACAGGAATATCATCCTTCTGAGACATGTTAAGCCCTTATGTTGTTGGGGAATTAAAAAGATTACCCGAATCCTTACTGTTGGGGAATAGCAAGGTCCACCGAGCCTGATGTGGTGAAAAGACAGGCACACAACGATGAGAGCATTGACGAGCAAGGCATAAGTGCTGGTTCGATTCCAGACAGTCCTGTTTAGTCAGGAGGGTTGGGCAGAGAAAAGGTCCGTTCAATTCGGACACCGGCAGTGCTCTCTTCGTTGTGGTAATCCGCGAAATGGCGCGGCGGTAAGTATGGCGGGTTTTTCTCCATTTGTACCCAGTAGGACACCGGGTTGTCAGGTTGACCATGCGCCTGAGTGACAGCCCCACCACAACGTTATTGCTGTGTGAAGTCTTGTCGGCGTCCGGCTCTTCCAACAACAGGAGGAAGGCGACAGTGTTCTGCCGTGACGCCGACCTTTTTACACAACAGAAAAGAGCATCTCCGCGCGACGGGCTCATTACCCAATCCACCCGGAAAGCTGTTACAGCAGGTGCTCTTTTCTGTTTTGTGGAGAAACCAACGTAAAACGCCAGTGCAGAGGCGTTAAGGAAACAAAATGCATAACCCATTTTTTAAAAATATGCTGATATATCGCTTCAGCCGCGACTTTAACATCGACATAGACTCTCTTGATAAGAAACTTGAGCTGTTTCGCTTCTCACCATGCGGAAGCCAGGATATGGCAAAAAGCGGATGGTTTTCACCATTAGTCCAGTATTCAGATGTGCTATATCATGCAGTCAATAACCAGTTACTTTTGGTTATTCGTCGTGAAGAAAAAATCATACCTAAACAAACGATCGCCGATGAGATTAATAAGAAGGTTTCCACGCTTGAGCGAGAGCAAGGCCGCCGTCTTAAGAAAACTGAGAAAGACTCTATTCGTGATGAGGTTCTTCATTCCCTGTTACCCAGGGCGTTTACTAAAAACAGTCTGGTTCGTATTTGGATAAACACTGCAGCCGGGTTTATCGTTGTTGATACATCCAGCATCAAGCGCGCAGAAGATTCTCTCGCCCTGCTTCGTAAAACCCTAGGTTCCTTGCCAGTCGTGCCGCTGACGATGGAAAACCCTATCGAGCTCACGCTAACTGAGTGGGTTCGTAGCGAAGCGGCTCCTTCCGGGTTCTCCATCGGCGATGAAGCGGTCCTTAAAGCTATTCTCGAAGATGGTGGCACAGGCCGGTTTAAAAAGCAGGATCTTGCCTGCGATGAAATTCTTACCCATATCGAAGCTGGGAAGGTAGTTACCCAGATTTCAATGGAATGGCAGCAGCGTATCAGTTTCACCCTTTCTTGTGACGGTATATTGAAACGCATAAAATTTGCAGATCAGTTAATCAGCCAGAACGACGATATCGACAGCGAAGATGTCGTGCAGCGATTTGATGCTGACATTACGCTGATGACAGGCGAACTCAGTAATCTTATTTCTGATTTAACGGCTGCCCTCGGCGGCGAAGCCAAGCGATAACTCATAGCGGCAATTAACCCCACATCGTGGGTTGGGTTGCTGTACTCAAAATTCACGCGGTGCAGCGCGAAATAAATTATAAGGAGAACCAACGATGAGTTTTATTCAAACACTTTCAGGTAAACAATTTGATTATCTCAGCGCAACTATTGACGACATTGATATTGAAGATATCGCCGTGGCGCTTTCCAATATTTGCCGCTTCTCCGGACATCTCCCTGAGTTTTATAGCGTGGCGCAGCATTCCGTACTGTGCAGCCAGCTTGTATCACCGGAGTTTGCCTTTGAAGCCCTGATGCACGACGCAGCCGAAGCGTATTGCCAGGATATCCCTGCCCCATTAAAAGCGTTACTGCCTGATTATCGCGAGATTGAGAAACGTACCGATCAACTGATCCGCTTTAAGTTTGGCTTGCCACTGGAAGAAGCCAGCGTAGTGAAGTATGCAGATCTGACCATGCTGGCAACTGAACGCCGCGATCTGGATATTGATGACAATATTCCCTGGGTAATACTGGAAGGTATCCCCCCGACAGATTTATTCGAAATCTACCCACTTCGCCCCGGTCAGGCTTTCGGCCTGTTTATGGCCCGCTTTAATGAACTGATGGAGCTACGCCAATGTGCTGCATAAAAGATAAAGAGTCTGTAGTGAAGGCAATCAGATCAAGACTTTTGTGGGAGCGCGTTGAAGGCGGTGCAGCATGAAACTGGAAATGTACACCCTGGACGGATCGGTGATTGTTGATAGCAATCTGGTAACACAGTTCTATCCAGACTACAAAAGCGGCGGCGAGTTAACAGTCATCGAGACTATTTCAGCCACCGGAGAAACTTTCACCGTGAGGGTTAAACACTCGTTTCTTCAGGTAACAAGTGCGCTGGCTACGGCGTGGAGCGTTGATGAAAAGAAAGCTGAAGGAGCAGCTCAATGAGCAACCGAATTCGCAATGCTCAGGTATTCGACGCGCGCACCGGTGAGTACCCGGTTGATATGTACATTCGCTGGATCATTGGTGGTGAACTTGATTTTGATGCCAATTATCAGCGCGGGTATGTCTGGGGGCATGAAGAGCAGCAGGCATTCTTAAACGCAGTTATTTCTGGTTTTCCTATCGGCTCAGTGGCGCTGGCAAAGGCACCTGACTGGTGTTCGCGTGAACTTCCTTACATAGAAGTTGTTGACGGCAAGCAGCGTCTCACAACCCTGAAAAAACTCATCACCAATGAGATTCCAATCATCTTGGCTGATGGTCCGCTTTACTGGCGAGATATGACTCGTGCGGAGCAATTGGCATTTGGGCGTCGTCCACTCCCAGCAGTTGTGCTGGATGAGGTGACGTACAAGGATCGCCTGGCTTATTTCATGGCGGTGAACTTTACAGGTGTTCCGCAAAGCGAAGAGCACAAGCGACACGTAATGCAACTAATGGAGGCTGCCCAATGAGCAACATCGACAAACTCAATGACCATGAACTGGTTGATCTGAAAAACGCTATCGAAAGAGAGCTTAAACGACGCGCTGATGGGCCAAAAGTCACCACGTATTATGTCGTCTCCTGCATAACTGATGCTCAGCATTTTACTGATTTGGACTGCGCTTTACGTTGCTTAAAAAGTGTCACCGAAAACCTTATGGAGTGGGTAACGGAATCCCCAGAAAACCGGGATTACGTCAATCAATGCACAGGCATTGTTGGGGCAAAACTCCAGGTGAAGGAGATGAATCTCGATCACTTCAACATGCGCGTAGCAGAAAAATATTTCGACGATATTTGTTATCCACAGGAGACAGCCCAATGAGCAACATCGATAAACAGGCGCTGCGTATTGAGAGGGGGTGAGATGTGTAACTTCCACGAACGCAAAGTTCGTCGAACTGAATATTACCAACGTTTCGTTTTTGGCTGGAAGCTGCGTCCCTGCACGGCGTGTAACGGCAGTGGTTATTACGACCACAACGGTAGTCCGAAATGCTCGTCCTGCAATGGCACAGGCAAAGAGCGGTACAAACCAAATTAGTAGAATTACGGCAGAAATTAAACGCCGCTGGCATCGGCAAGGGGAAGTGACTATGTGGAGAGGATTAAATCGCGGCGGCAGCCAGATGATTCTAACTGCCTACGAATACGATCCGGAAACCGAAAAATCTCAGTCTGTTTACCTTCTACGGCATCACAGCAAGGTTAAGAAGACCACGCTTGAACAGAAGCTGACAGTTAAGAACGACGCCTTCGGACGGTTTAAGCCTTTCGTTGAACTTGAAGATTTTCCGGAAGGGCTTAGCGAACGCGAAGCAATGCTGAAATTAGCTGACTGGCTGCACCGACTTAGTGTGGCTATCGAAGATAACTGGAGTACACCATGACCACTATTACCAAAGAATGGCTACAGCAAACTATCGCTGAATTTGAAAACACTCGCGACGATATTCCGTTTGGCCTGAGCGATGACGACGCCAAAGTTCTTATTGTGCTGAAGCGTGCGCTGGCATCGCTGGAAGCTGAGCCAGCCGGATATCACGTAATCAAAGAGTGCGGAAAGGTTGGCTGTAGTGTTGCAACGCTTGAGGAAGCCGAGAAAACGCGAGATTTCTGGAATAAAAAGTGGACTATCAGACCGTATTTTTACACCGCCCAGCCAGTACAGGAAACTGGCGTTTACAATGATGTGCTAAATATCATCAGCCTGTTGGAAAACAACGAATGGGCTGAGCACTGCACGAGTACAGTTTTAGGTTCACTTCTGGAATCAGAAATAACGCGTCTGGTTGGCAAAGAACAGTCAGCGCCGGTAGTCACTTTCTATCGCGATGGCGTTGAAGCCGCCGCCAAATGGATAGATCAGCAGCGTGAGGCATACGACAGCGAGCATGGATGGTCTGATCCTGATACCGGAGCGTTCGAGTTCGGCAATGATGCCCAACGCGGATATTCATCCACCCTGGAAGAATTGGCCGAAGGCATTCGCGCTCTGCATCCAAATGCTGGCAACTCTCCGGTAATTCCGGATGGTTGGATAAGCTGTAGTGAGCGGATGCCAGAAGACGAGCAAGAAGTAATTGTTCATAACAAGTTGGGATACCGTTATGTTTCATATTTTGATGAGCATTCTGGACTATTTTTTGACATGCGAGGCGGCAATCAGATGAACTGTATTGAGCACATCTTTGTTACACACTGGATGCCAGTGCCAGCAGCACCAAAACCGGAGATAAATAACGAATGAACAACTTAATGGTCGACCTTGAAACGATGGGTAAAAAACCTAACGCGCCTGTTGTCTCCATCGGTGCTGTGTTCTTCGATCCGCAAAGTGGTGAAATTGGACCTGAGTTCTATACCGCCGTTAGCCTTGAAAGCGCAATGGAACAAGGTGCCGTTCCTGATGGCGATACCATTCTATGGTGGTTAAGACAAAGCCCGGAAGCGCGAGCGGCTATTTGCGCTGATGCAGTATCTGTTACGACCGCGCTTATTGAGTTCAATGACTTTATCACCTGTCACGCCGACGATTTGAAATACCTGAAGGTATGGGGTAACGGTGCCAATTTCGATAACGTTATCCTGCGTGGCGCTTTCGAACGTGCCAGCCTCCCCTGCCTGTGGAATTACCGGAACGATCATGACGTCCGCACGATGGTTACTTTGGGTCGTGCAATCGGCTTCGATCCCAAACGTGACATGCCGTTCGAAGGCGATATGCACAACGCGCTGGCTGATGCCAGGCATCAGGCGAAATACGTTTCAGCTATCTGGCAGAAACTGATCCCGCCCACCAGCAACAATATCTGATTTAAACCGGGTGCAGCCGGTTAGATGGAGAAGCAACTCATGAGCGATCGCTTCCTGACTGAGGAGGAACTGGAAGATGCTACAGGAGCAAGCCAGAAGTCACTCCAGAAAGAAGTATTAACGCTGAACGGTATTTATTTTATAGAACGCCGGGACGGTTCAATCAGAACAACCTGGTATCATATAAATCACCCAGTTTCGCGCCTTCTTCCACCAGCAGGGTATCAGCCTGTACCAGGCATGAATTTTGACGCTATAGAGAGTTAACATGGGTCGCAAACGTGCGCCCGGTAATGAGTGGATGCCAAAGGGTGTATTCTTTCGCCCTTCTGGTTACTACTGGAAACCGGGAGGATCAACAGAAAATATAGCTCCAGCTGATGCAACTAAAGCTGAGGTCTGGGTGGCTTACGAAAAAAAAGTTGAGGGTAGAAAAAACAGAATTACATTCACACAATTATGGCGAAAATTTCTTGCCAGTGCCGATTATGCTGATCTGGCCCCAAGAACGCAGAAAGATTATCTGGCACATGAGAAATATATACTTGCCGTATTTGGTGATGCCGAAGCTAAAGCAATAAAGCCAGAACATATCCGGCGTTATATGGATGCCCGTGGGCAAAAAAGCCGTGTCCAGGCGAATCATGAACACAGCTCTATGTCGCGCGTATTTCGTTGGAGTTATCAACGTGGTTATGTTCCTGGTAATCCTTGCGTTGGTGTGGATAAGTTTCCTAAGCCTCAACGCGATCGATATATTACCGATGAAGAGTACAGAGCGATATATAATAACGCAACGCCAGCCGTCAGGGCTGCAATGGAAATAGCTTATTTATGTGCTGCCAGAGTTTCTGATGTATTGAAAATGAACTGGAATCAAATACTGGAGAAAGGAATTTTTATTCAGCAAGGAAAAACCGGAGTTAAACAAATTAAATCCTGGACAGATCGCTTACGTGATGCCGTTGAAATATGTCGTGAATGGGGAGAGGAAGGCCCTGTTATCAGGACTATGTATGGCGAGCGTTATTCTTATAAAGGATTTAACGAGGCGTGGAGAAAGGCGCGAAAGGCTGCGGGGGATGATCTGGGACGTCCTCTTGACTGCACTTTCCACGATCTAAAGGCAAAGGGGATTTCAGACTATGAGGGAACGGCAAAAGACAAGCAGAAGTACAGTGGCCACAAAACCGAATCCCAGGTTCTTGTTTACGATCGCAAGGTGAAAATGAGCCCAACCCTGGACAGGAAGCGTTGAGCTTTTCGATGTGCGCCAGTTAAAATTCTGGCGTTTTTTTCTCACCGAATTTTCTCATTTTTTCTCAACGTGATTTTCATCACTATAAGAAAATCACGTAAGTGCTTGAATAGTGGCGGAGAGAGAGGGATTCGAACCCTCGGCGGAGTTACCCCCGCAACGGTTTTCGAGACCGGTCCGTTCAGCCGCTCCGGCATCTCTCCGTATATTGCAATGATGCCAGGTAATTTGGCATTTTAACAGACCCTATTCGGGTAATTTTGTTCAAGTGACGAGTTTACGAGCAAAACGATGATTAAGTGGCCCTGGAAAGCACAAGAAATAACCCAGAACGAAGACTGGCCGTGGGATGATGCGCTAGCTATACCTCTTCTGGTAAACCTCACCGCGCAAGAACAGGCTCGGCTTATTGCGCTAGCCGAACGTTTTTTGCAGCAGAAAAGACTGGTAGCGCTACAGGGATTTGAGCTCGACTCGTTAAAAAGTGCACGTATTGCGTTAATTTTTTGCTTACCGATCCTGGAGCTCGGTATTGAGTGGCTTGATGGTTTTCATGAAGTGCTCATTTATCCCGCGCCCTTTGTAGTAGATGATGAATGGGAAGATGACATAGGTCTGGTGCACAGCCAGCGTGTCGTACAGTCGGGGCAAAGCTGGCAACAAGGGCCCATCATTCTGAACTGGCTGGATATCCAGGACTCGTTCGATGCTTCGGGTTTCAACCTCATTATTCATGAAGTCGCGCACAAACTGGATATGCGTAATGGCGATCGCGCCAGCGGCATCCCTTTCATCCCGTTGCGCGATGTGGCTGGCTGGGAACACGATCTCCACGCGGCAATGAATAATATTCAGGATGAAATCGACCTTGTTGGCGAAAGCGCTGCCAGTATAGATGCCTATGCCGCCACCGACCCTGCAGAATGTTTTGCCGTGTTGTCAGAGTATTTTTTCAGCGCGCCAGAACTGTTTGCTCCACGTTTCCCGGCACTATGGCAGCGTTTTTGCCAGTTCTATCGCCAGGATCCTTCTCAGCGCTTACGGGTAAGCGCTGACGAAGGCGACTACGGCGAGGAATCCGAACATTAATTCCTCACTTTGTGGGTTAATTAACCAATTGAATTGGCGCGTTAATTTTACTGTTGACACGTTATAGCGGGCCCAGTATTATGCGCCTCGTTGAAACAATTCCTCTGTAGTTCAGTCGGTAGAACGGCGGACTGTTAATCCGTATGTCACTGGTTCGAGTCCAGTCAGAGGAGCCAAATTTAGGGAAGCAGACGTTCAGTGACGTCTGCTTTCTGCATTTATATCAACTGGTTATCCCCTTCTTCAGGTTCACTCTCGTTTACTAAAAACCACTCGAAGCTATACCCTTTTGCTGGTAAAGCTGGTTCGATTTGCGTTTTACCAGCACGCGGGGGGAACCGTCATGTCACTTACTGATACTAAAGTAAAAAATGCCAGACCAGCGGAAAAGGCCGTCAAGCTCACTGACGGGTTTGGCCTCTATCGATTCAAAATACTGGCAGACAGGCTATCGCTTCAATGGCAAACAGAAGGTGTTTTCTATTGGGGTTTACCCTGCGGTTTCTCTTGCTGATGCCAGACAACGCCGTGACGAGGCCAAAAGGCTGCTGGCTCAGGGGATTGACCCGAACGCAAAAAACAGGCTGATGAAAAAATCCTTCAGGAAAAGCGCGATAAAACCCGCTCGTCCCGTGTCGTCGCCAAAAGCTGATGCGCCATAATTCTGCCTATGATATTGACGGAAAACTTTTCGCCTGCACCAGAAATTTATCTGCCATTTCCGCTACCGGCGTCAGACTGCCTGTATCAACCATTTTCACAAAATATTTCACGTCTAAAGTTCATTCTACCCCCTGCCCTTAATCTCTACGGCGTTATGTCTCAGAATTATTTGCCAAGTGCCTGCCAGTTTTTCACGTTTCATCAGACGCTGGTACATAGCCATTGCGGTAAGGTCACAGCATTTGACTTGTGCAATTACAGACAAAGTTGCGCCATGCCGGAGCAAAGTAGAAATTAGATCAAAACTTCAACGCTTTGTTGTTTTTGTCAGCAAACAAACGCGCAACCTTATTTCCCCCTTTGACAAGCCGATCGCACATCGTTACTATGCGCCCCGTTCACACGATTCCTCTGTAGTTCAGTCGGTAGAACGGCGGACTGTTAATCCGTATGTCACTGGTTCGAGTCCAGTCAGAGGAGCCAAATTAAGGAAAGCAGACGTTCACTGACGTCTGCTTTCTGCATTTATATCAACTGGTTATTCCCTTCTTCAGGTTCACTCTCGTTCACTAAAAACCACTCGAAGCCATACCCTTTTGCTGGTAAAAATGCTGGTAAAGCTGGTTCGATTTGTGTTTTACCAGCACGCGGAGGGAACCGTCATGTCACTTACTGATACCAAAGTAAAAAATACCAGACCATCGGAAAAGGCCGTCAAGCTCACTGACGGGTTTGGCCTCTATCTGCTGGTGCATCCTAACGGTTCAAAATACTGGCAGTTAGGCTATCGCTTTGATGGCAAACAGAAGGTGTTTTCCATTGGGGTTTACCCTGCGGTTTCACTTGCCGATGCCAGACAACGCCGGGACGAGGCCAAAAGGCTGCTGACTCAGGGGATTGACCCGAACGCTAAAAAACAGGCTGATGAAAAAGTCCTTCAGGAGAAGCGGGATAAAACCCGCTCGTTCCGTGTCGTCGCCAAAAGCTGGTTTGCCACCAAAACAAAATGGTCAGAAGATTACGCCGATACTGTCTGGAAGCGCCTTGAAACCTATGTCTTCCCGGATATAGGCGACAGAAACGTTTCAGAACTGGATACGGGTGATCTGCTTGTCCCGGTCAAAAAAGCGGAAACACTCGGCTACCTTGAAATTGCCATGCGGATTAAGCAATACATCACCGCGATCCTACGTCATGCCGTCCAGCAAAAGCTTATGCGTCATAATCCGGCCTATGATATGGAAGGCGCTGTCCAGAAACCAGAGACGGAACACCGCCCTGCACTGGAGCTGGAAGAGATCCCGCTACTGCTTGAACGTATTGATGCCTACAAAGGTCGTGGACTGACTACGCTAGCGATTAAACTCAATCTGCTGATCTTCATTCGTTCCAGCGAACTTCGTTTCGCCCGGTGGTCGGAAATCGACTTCAAAAGTAAGTTATGGGTGATCCCCGAACAGCGGGAAGCGATTGAAAACGTCAAGCACTCGACTCGTGGGGCTAAAATGAAACGTCAGCACTTCGTTCCCCTTTGCAGGCAGGCTCTCAAGATACTGAAAGAGATCCGCCAGCTTACCTATGAAGAAGGTAACGAAGCAGAATTAATTTTCACTGGCTGTTATGATTCATTCAAACCCATGAGTGAAAACACTATTAACAAGGCGCTACGTAAGATGGGCTATGACACCACACAGGACATCTGCGGTCATGGTTTCCGCACACTGGCGTGTAGTGCCTTAATTGAGTCTGGTCTATGGTCAGAAGATGCTGTAGAGCTTCAAATGAGCCATAAGGAAAGTAACAGCGTTCGCGCAGCCTATACCCATAAGGCCAAGCATCTTGACCAACGCCGCCTGATGCTCCAGTGGTGGGCTGATTTTCTTGATGAGAATCGGTATGAGATGGTCAGGCCGTTTGAGTTTGCTCAGAAACAATAATCACCTTCCTACAAGCAAACAATCGCATTAAATCAAAAAATACAGGGAGGTTTTACTCTCCCTGTTACTTATCACTTACGCCCTTTGTTTGATTGTGCCAATACCGAAGCAGCAAGCTGCTTCGTTAATTCGCTAGAGCGTGGGTTATCAAGGGCCGATGATGCCTTATGTTCCATTTTGCCACTGGTCTGGTTTGACGTCCCACGTTGAGAAAGCGCGGAACCCGCAAGGCTTTTCTGGATGGCTGAGGCATTAGGATCAGTCAGCGTTTTTGCAGCAGTAGAAGCCACATTTTCAGAAGTCTGTTTTACATTTTTACTCATTAAACGCATCCCCAAAAGTCATGAACATCACTTATGTAGTGTCTTCATCAAAAAAATCACTATATGCAGTTATAACCATATTCTTTATAAGCTTTAATTCCACTAAAGACAACATCATTGCTGTATAAAATCACAGTATACGTGTAGTTAATTGATTTTATTTATAAAAATATTTTGTCTAACAGGAACAAGATACGCTAAAAGCTCATTGTCATCCTGAAATTGTTGCCTTCTGAAAATCAGTGGGGAGGCCGTGCTCATAATACGCGAGGAATATCATTGAACATGCGTTAGCTCACCAGTTAGAAGATAAGGCCAAAGCAGAGTATCAACATGGGACGCTATGGGCTAAGCATGTGGCATTGATGGATGATTGGACGGGGTATCGCACAAGTGGTATCCAATCTGTTTATCAATACAGTATTATATATTTACAACATGATTATCATGCTTAGGTAATTATTTTAAATAACGTGTCTTAATCGGCATTTATTGCATTATAGTCGTTATTTTCCAGTTCATCCATTGAAAATGTTTTTCTTTCGGAGTCTCGGAGACGTTTTAAACTATTTTCAAGAACACCGACTTTTTCTAGATATTTCTCTATGTCTTTAAATTTTGATTCCTTTATTATCTCATATATAGGGGATAATATCATGTTTTTCCTTACAACATCATCTTCTTTCATAGACCGATTATAAGCATTATTTACTCTCAATACAGTTTCCCTAAGATCATCTACAAAAATGAATATATTATTGAGATCATCATCTAATTTTTTTCGCTCTTCTAATAGTTCATTATTTCGCGCTTCTTGGTTTTTGACAATTATATCAAGCCGTTCAAGTTTTTCTTTTAACTCGCTCGTATTGTATTGAGATTCAGATTTTATTCTTTCAAGTTCTGCTTTATGTTTTGCTTCACCTTCAATCATATCTATTTCATGTTGACGTTCAATCACTCTAATAGAGTTCATACTTTTTACCCTACGAGAGTCCCTTTTGGAGGAAAGAGTACAGATAATATCTGTCCGACTTAAGATGGAAAAATTCTATCGCTCTTGATAAAAAAGGCATTATTATGGTTATGATTATACCGTAAACAATAGGTTGCCAAAAGTTAATGCCCTGCATTTGAACTATCGACGTCCTATACTCAATGTTCCCTTTTCCAAACATAATAAATAATATCTTATCCCAATTATCAATACAGAAGGCTGAAATAACATATACATATAATGGTGTCGTGAAACGCTCGATCAAGGAACTTTTTATGTTATTGAAAAATGATTCCTCTGATGTTTTCTCATTGCTCATATCAATCTTCTTATAGAAAAATTAGTTTAAAAAGGGACGAACGAAATACTCGTCAACTCTAAATTTATTATCTGTGAATGTATGAATGGCATCAATACCTTCATGACATTTCCCTATTAATTCTCTAAGTATATTTTGCCTTTCAGTTTCTGTGCTTTGACTTACTTCATCCACTACATTAATCAATGGAAAACTTAATTTATTCCTATAGATATTTTTAATTATTTCAAAAGACTCTTTAAAATCATCTCTGATCTTCCATCTATGGCGTTCAATAATATTGGAACTGATAGTTACAATTTCAATTTGTGTATTAGACAGTCTATTGGTAATTTTCTCCAACCGAGTTTCCAATATATCATTTGGAATTCTATTGGATGTCATTGATAACAAAACGATAACCAATTTATGCAAATCAATTCTATATTGAGGATACAGTTCAAATATCAATTTTTTTGCATCATTATATGCTTCCTCATCATTTTTACTTTTGAAATAGCTCTTAGCCTTAAAAGCGCCATAAATCGCAGAACCGGCGATCATCACATTAGATATAGTATTAATCCAATCTGTAGTATTACTCATTTCCCACCATATCGTATACATAAAACAATAATCGATTATACACATTTCTATACCTAATAATAAATATCAAATGACATTCATTTTAATGATATTAATGCAAATTTCATAATGAAGGGGAATGTATCCATTAAGAATATTGGAGCAAAAATGACCAACTAAACGCGTAAATTTATTGTCCACGCGCCAAGCAACAGTGGTCATTCCGGGCACGCCAAAAGACTAAGCACCTTTACCAGCGGAGCTAAATACTTCAATGATGGAGATCATGTTCAGGTGTTGAAGTATAGAGTTAGAACGGCCTGATAAGAAAGTCCTCTTTTATCGATATATTATTAGGTAAAATCAATCGCCTACAAGTACATAGATAGTGTAGTATACACCATATGTAAATATCTCAGGGAGTTAATCTGATGTCTATAAGCACCACAATGTCAAATATCAACAGAATACAAAAAGACATTGCTAGTCTACAAAAACAACTTTCTGATGAGCAGCGTAAAGAGGCCCAACTTTCAGGAAAAATCAATCAAATAAAGCGTAGCGTCACTAAGTCAACGTCTTTAAGCACATTGAATTCCAAAATGTCAGAGATCTCTCGTCATAAAAATGATATTTCAAGATGTAACTCTAAAAAAGCAGATATTAATAAAAAAATAACAGCAAAAACTGGAGACTTACACCGTTATCAATTACAACTCATTAAAGAGCAAGAGAATGACCAGAAAAAAAGAATTGCTGCACAGAAGAAACTTGAAAAAGAACAATTAGATTACCAGAAGAAAATCACACGAGAATTGAAATCGCAGAAAAGGGCAATATCTCCAAGCCATAAATTCAATCGTCCAGTCATCAACAAATCTGATGAGACAGAAGATATTACAGAGAGTTATGATGTTTTTATCTCTCATGCAACAGAAGATAAAGATAGTTTTGTCCGTCCCCTTGCTGAGTTGTTAAGAGCAAAAGGGATAAATGTATGGTATGACGAATTCTCTTTAGGCTGGGGTAAAAGTCTACGTAAGACAATCGATTATGGATTAGCAAATTCTCGGTTTGGAGTTGTTGTTTTATCTAAATCCTTTATCAAAAAGGACTGGACAGAGTATGAATTAAATGGTTTGACTGCTAGAGAGATGAGCGGTGAAAACCAAGTAATACTGCCTATCTGGCACGAAGTATCTAAATCTGATATTTTAAAATTCAGCCCTACACTAGTAGATAAAATGGCATTAAATACATCGATAAATACCATTGATGAAATTGCTGAACAGCTTGAATCATTATTGAAATGAAATAAATATTTAAGCGAAAGGGAACTCTCCATTGGAGAGAATACATATCACTTGGGAAAACTGAATCAACTCATTGAGCTGTAAACTACTCCCTATATATTAGACAATCAGATATATTATAAAAGTGAATATCTTTAAAAGACACTTATTAAATAATATGTTATTTATTAAAAGTTAAATTTTATTATAATGTTTAAACACAAACATTAATTCCTCAACAGTTAACATTATTGGCATGCATATAATCTTCTGTAATTACTATAAAAGTGATCATTCTTTCCACCACCACAACGTTGATCATCATATTGAGCAATTAACGTTCCTGATAAATAGCTGTATTTTGAGCTATATGCCGATCCCGGACAAGAACATCCCCTTGTATCGCTGTTTGGTATTACACACTTATAGCCCGATACTTGGTCTGTGATCGTGATATAAATACCTTTTAATCTATCACCACTTACTGTTTGCCACTTACCACTGACGCAGGATAAAACCTTACCCATACTGTCTCGGCCCACCAGACCATTAGGCGTACATTTCGCATTTAAAGTCGCCTGATCATTAAAATAGTCATTGTGCCTGAACGATATAATTCATGGCAACATTCTTAGGTCTTGTTTCATTACCTGCCCGAACGACTCTCGAAGCATCAAACGTATATTTCCTTACTTCATTGCTTTTATGACCTTCATCACCTTCACCTAACGCACCACTATCCGCGAAAGCGCCACTCACATATTTACCGATGTTTCCACTCTGTGATACATCAGCCGCCATATGACCTGTAATATTTTGAATCTGATCATCCTGATAAGAATTAATAACTCTTCCTGAGTCAAGTCCTCTTCCCGAATCCAGCCCTCGAATAAATACTCCTCTTAAATCAGGAAGTCTTCCTCTTGGATAGGCTTTTGCTAACTGAGGATATAGGGCAGTATTAAAAGATTGTCCCCTGCATATTAACCAACCTACAGGCGCTGTATTTGAAGGCCAAGGAACAGGTGAACCAACTGGCAGGTTACTGACAACAGACCATTTTCCATTAACGCAGGATAATATATTTCCCATGTTGTCTTTAGATATCAGGCCTATACTTGGACATGTCCCACCAGTTGTATTCGTCGATGTAGGTTGTAGAGTATGTCCCTGAATATTTCCAGATGCTGCAATACTACCAGTGCTTCCTGTCCCCGGAGTAATAGCAAAATCACCTTGGTTGGTTTGTTTCCCAAAAACCTGAAGCCTTGTTTCAGTACTGGTGCCACCATTTCGCCAGATGGTTAGCGGCTTAACAGTATCTAATCTGAACTCATAATCGTTAGCATCTCCACCACCAATACTGAAACTATCACCACCACCGTTATGACCATTAAATTGTCCACCGGATGTGATACTTTTACCTGTATCTAATGTTCCACTAATCGTTCCATTACCAATCGTAGACAGCCCTCCACTAATTTTCAGCGTAGTGAGATCTGTTCGATTGCTTGCCATATGAATAGATAAGGTCTTATCCTTAGACATGGTTATTTCATAGTCGTTATTAACAGCATCGCCGCCAAAATGTATGGCATCACCATAACCATTATGCGCTATTATTTCTTGTGCTGCGATTATTTTCCCTGCGGAAGAAACATCCCCACCAAAGATCCCCGTTCCCGATGCGGTAATGTTCTTGGCATTATTAATATCGTTAGTGCCCATATTGAGATTTCCAGTCATAGGCAATGTTCCATCGCGACGCAAGTAAACAGAATACATAGAACTATCATAGCCCACACGGTAAACCAGTAATCCTGCTTTATTAATTGCCGGGTAATCAGTAGACTTTTCTGTCCACACTCCATTATATCCTGATGCCTGAGCAGGCGATTTCGTCATACCGCTATCAATACCTGCTGATTGCATCGATTTTCCTAACAGGTCGTATCGCGTTTTTCCACCTTCGACCCAAGGTATCGTCGTAGCAATCAAACCATTGATAACATAGTTTGGTGATACGCCTGATCGTTTTAACAGGATTTTATAAGAAGACTTTTGGGCATTAACTCCCGCATATGTAGAGGGCAAGAGACTTTCATTCACCAGCGTCTGGTAAGTGATTTCCCAGCCATTAGCCGTACAGGTTCTTGGTCCCGGATCGCTACTTTGACTGCGTGAAGATGACAGTGTGGAAAGCTTATTATAGCGAATGCTAATATAACGGTTAACTGCTTCGCCAACCTGTTTCATTTGAAAGCCTACCGCCTGTGCCATTACTGCTTCCTGTTCTTTCCTCATGTCCTGAAACTTCATAAAACCAATAAGAGAACCTATACCTAAAACGATGGTGATCTCTAATAGAGTAAATCCTCTTTTCTTTATCATAATTATATCTCCCATCTGAAAATATGATCCAAAAAGGGGCAT
Protein sequences of DBSCAN-SWA_5 >NZ_CP019416|2015480:2068904|2053317_2054235_+|WP_000551790.1|DBSCAN-SWA MHNPFFKNMLIYRFSRDFNIDIDSLDKKLELFRFSPCGSQDMAKSGWFSPLVQYSDVLYHAVNNQLLLVIRREEKIIPKQTIADEINKKVSTLEREQGRRLKKTEKDSIRDEVLHSLLPRAFTKNSLVRIWINTAAGFIVVDTSSIKRAEDSLALLRKTLGSLPVVPLTMENPIELTLTEWVRSEAAPSGFSIGDEAVLKAILEDGGTGRFKKQDLACDEILTHIEAGKVVTQISMEWQQRISFTLSCDGILKRIKFADQLISQNDDIDSEDVVQRFDADITLMTGELSNLISDLTAALGGEAKR >NZ_CP019416|2015480:2068904|2040090_2040630_-|WP_000127618.1|DBSCAN-SWA MTAVFAFVKARWKTIIVLLMLAGAFLAGIIWSDRGWQKKWADRNSMESSQEANAQTAARWIEQGRIIARDEAVKDAQAQAAKSAATAAGLSATVSQLRTEATKLAARLDAAKHTSDLAAAVRSKTAGADAAVLADMLGRLAEEARYYAERSDESYRAGMTCERIYNSVRESTNNPIAPH >NZ_CP019416|2015480:2068904|2047613_2048087_-|WP_076731005.1|DBSCAN-SWA MSLLAKVQAFIELNPGLTSNEIADAFPEYARFDVQRSASKLYRCKRVNRRLDGDVFRYYAGKDEAVILTLRQKRSGHTGSGDPMVIAKLVSRAEELESRGLFNRASIVWLGAFSESQFIYEREEFLRRRQKCLNRIKKRIRPVEQVYLAGRFVGNVE >NZ_CP019416|2015480:2068904|2056599_2057463_+|WP_000208076.1|DBSCAN-SWA MTTITKEWLQQTIAEFENTRDDIPFGLSDDDAKVLIVLKRALASLEAEPAGYHVIKECGKVGCSVATLEEAEKTRDFWNKKWTIRPYFYTAQPVQETGVYNDVLNIISLLENNEWAEHCTSTVLGSLLESEITRLVGKEQSAPVVTFYRDGVEAAAKWIDQQREAYDSEHGWSDPDTGAFEFGNDAQRGYSSTLEELAEGIRALHPNAGNSPVIPDGWISCSERMPEDEQEVIVHNKLGYRYVSYFDEHSGLFFDMRGGNQMNCIEHIFVTHWMPVPAAPKPEINNE >NZ_CP019416|2015480:2068904|2065610_2066492_+|WP_000028416.1|DBSCAN-SWA MSISTTMSNINRIQKDIASLQKQLSDEQRKEAQLSGKINQIKRSVTKSTSLSTLNSKMSEISRHKNDISRCNSKKADINKKITAKTGDLHRYQLQLIKEQENDQKKRIAAQKKLEKEQLDYQKKITRELKSQKRAISPSHKFNRPVINKSDETEDITESYDVFISHATEDKDSFVRPLAELLRAKGINVWYDEFSLGWGKSLRKTIDYGLANSRFGVVVLSKSFIKKDWTEYELNGLTAREMSGENQVILPIWHEVSKSDILKFSPTLVDKMALNTSINTIDEIAEQLESLLK >NZ_CP019416|2015480:2068904|2015480_2016503_-|WP_001028172.1|transposase|DBSCAN-SWA MNIVFLGIDLAKNVFQLCGLNQAGKPVYTKRTGRKELLQTLANIPACLIGIEASTGAFYWQREFEKLGHKVKVISPQYVRPFVRGQKNDGNDAQAIAVALMQPTMQFVPPKSPEQQDIQALHRARQRIVNHRTATVCQIRGLLLDRGIPIGSAVSRARRAIPLILEDAENGLSSRMRRTIAELYDLFNDLGRRIHFFDKEIETVFRQSEACPRIAKVKGIGPKTATAVVAAIGKGTEFKNGRHFAAWLGLVPRQHSSGDRQVLMNMTKKGDKHLRTLFIHGARAVVRVATNNNDGHMNQWVNQLKERRGFNKTTVAVANKNARIIWSMLRNDTGYQVVCN >NZ_CP019416|2015480:2068904|2041511_2041901_-|WP_001294874.1|holin|DBSCAN-SWA MSEPVSSATVLAGGLMGASVFGLATGTDYGVVFGAFAGAVFYVATATNIGRIRLVAYFITSFIVGVLGAGLIGTKLAAITHYEKPLDALGAVIISAMCIKFLTFLNSQDLNTLFSILSRIRGGGSDGSK >NZ_CP019416|2015480:2068904|2032357_2032918_-|WP_000779215.1|DBSCAN-SWA MKLTPIIAALRSRCPRFENRVGGAAQFKAIPEAGKLRLPAAYVVPAEDVTGEQKSQTDYWQDLTEGFSVIVVLSNERDEKGQWASYDAVHDVRQEIWKALLGWEPDPQAHEIQYAGGMLLDLNRHELYYQFDFTVKYEITETDTRQQDDLDGLPDLKTLSIDVDFIEPGTGPDGDIEHHTEITFQE >NZ_CP019416|2015480:2068904|2052496_2052748_-|WP_000078504.1|DBSCAN-SWA MSQKDDIPVFPVTGWQAGPLPGYDALVVKFQFLSSPMQPIESAQETQFLVLTPEMAESLASDLQRHIQDLRNSDVHSPQEGKH >NZ_CP019416|2015480:2068904|2056309_2056603_+|WP_000267991.1|DBSCAN-SWA MWRGLNRGGSQMILTAYEYDPETEKSQSVYLLRHHSKVKKTTLEQKLTVKNDAFGRFKPFVELEDFPEGLSEREAMLKLADWLHRLSVAIEDNWSTP >NZ_CP019416|2015480:2068904|2042574_2043648_-|WP_000357930.1|DBSCAN-SWA MDKILPTMGFVCLSLISLKNPPPSGFFICDHFIFCLAKLLYGQELKLSGDIVLSNNERWVSFFDFAFTPTHAAAPSIPIEDILKKLKVLVSSGSAVKLYNHRSRALRISEMKYSIGDSQATLLIQLCDKNGSDPVFGELTTGNLRVEPKLAGEGIAVSCHIVISTDVVKNTADHHKTLVESVPGISKSVLEPFLNAMLREAFAGCEFKNPATKGMCQHRPKLEIYSHGSQTLMDALKGAKIHNVKLVSTRRKGGLDQTAYTELSERSVKYKIIRQPPLKDKERLLEILRKKGQQSGYTKVSISYSKDGKQASLDLDRNEDAATKLFTKSERVILGNLINQCESTVHLQLETKMIGLL >NZ_CP019416|2015480:2068904|2064636_2065200_-|WP_000072670.1|DBSCAN-SWA MSNTTDWINTISNVMIAGSAIYGAFKAKSYFKSKNDEEAYNDAKKLIFELYPQYRIDLHKLVIVLLSMTSNRIPNDILETRLEKITNRLSNTQIEIVTISSNIIERHRWKIRDDFKESFEIIKNIYRNKLSFPLINVVDEVSQSTETERQNILRELIGKCHEGIDAIHTFTDNKFRVDEYFVRPFLN >NZ_CP019416|2015480:2068904|2058068_2058296_+|WP_001527041.1|DBSCAN-SWA MSDRFLTEEELEDATGASQKSLQKEVLTLNGIYFIERRDGSIRTTWYHINHPVSRLLPPAGYQPVPGMNFDAIES >NZ_CP019416|2015480:2068904|2035441_2036044_-|WP_000003793.1|head,protease|DBSCAN-SWA MSEREIRCYSGEVRAETHDSEPSRIIGYGSVFDSRSELIFGSFREIIRPGAFDEVLNDDVRALFNHDPNFILGRRSAGTLALTVDERGLRYDITAPETQTIRDLVLAPMQRGDINQSSFAFRVARDGEEWYQDEDGVVIREITRFSRLLDVSPVTYPAYQEADSAVRSMKAWQEARDSSALQKAINQRMARERVLTLLNA >NZ_CP019416|2015480:2068904|2063047_2063299_-|WP_000042271.1|DBSCAN-SWA MSKNVKQTSENVASTAAKTLTDPNASAIQKSLAGSALSQRGTSNQTSGKMEHKASSALDNPRSSELTKQLAASVLAQSNKGRK >NZ_CP019416|2015480:2068904|2036036_2037131_-|WP_077905357.1|portal|DBSCAN-SWA MKLAAVYACIYVISSNVAQMPLHVMRRTGKKVETARDHPAFYLVHDEPNSWQTSYKWRELKQRHILGWGNGYTRVLRHRRTGEVTGLEACMPWETTLLNTGGRYTYGVYNEEGSFAINPDDMIHVRALGNDQKMGLSPVLQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKGELNDGSWKRLKEMWQKATAMLRSQENRTMLLPAELDYKALTVSPVDAQLIDMMKLNRSMIAGIFNVPAHMINDLEKATFSNISEQAIQFVRYTMMPWVTNWEQELNRRLFTRAEREAGYYVRFNLAGLLRGTAKERAEFYHFAITDGWMSRNEARAFEDMNPKDGLDEMLVSVNASRPAKSTTQENTQDE >NZ_CP019416|2015480:2068904|2067065_2068904_-|WP_076731007.1|tail|DBSCAN-SWA MPLFGSYFQMGDIIMIKKRGFTLLEITIVLGIGSLIGFMKFQDMRKEQEAVMAQAVGFQMKQVGEAVNRYISIRYNKLSTLSSSRSQSSDPGPRTCTANGWEITYQTLVNESLLPSTYAGVNAQKSSYKILLKRSGVSPNYVINGLIATTIPWVEGGKTRYDLLGKSMQSAGIDSGMTKSPAQASGYNGVWTEKSTDYPAINKAGLLVYRVGYDSSMYSVYLRRDGTLPMTGNLNMGTNDINNAKNITASGTGIFGGDVSSAGKIIAAQEIIAHNGYGDAIHFGGDAVNNDYEITMSKDKTLSIHMASNRTDLTTLKISGGLSTIGNGTISGTLDTGKSITSGGQFNGHNGGGDSFSIGGGDANDYEFRLDTVKPLTIWRNGGTSTETRLQVFGKQTNQGDFAITPGTGSTGSIAASGNIQGHTLQPTSTNTTGGTCPSIGLISKDNMGNILSCVNGKWSVVSNLPVGSPVPWPSNTAPVGWLICRGQSFNTALYPQLAKAYPRGRLPDLRGVFIRGLDSGRGLDSGRVINSYQDDQIQNITGHMAADVSQSGNIGKYVSGAFADSGALGEGDEGHKSNEVRKYTFDASRVVRAGNETRPKNVAMNYIVQAQ >NZ_CP019416|2015480:2068904|2028011_2029940_-|WP_076731001.1|tail|DBSCAN-SWA MADSFQLKAIITAVDQLSGPLKGMQRELKGFQKEMAGLAIGAAAAGTAVLGALALPVNAAIGFESKMADIRKVVDGLDDKKAFAQMSDDILTLSTQLPMAAEGIAEIVAAGGQAGIARGDLMQFANDAVKMGVAFDTTAEESGQMMAQWRTAFRLTQEDVVVLADKINYLGNTGPANAKKISDIVTRIGPLGGVAGVASGEIAAMGATIAGMGVESEIASTGIKNFMLSLTAGNSATKAQKQAMAFLKLNPRKLAEDMQKDSRGAMLKVLDSLAKVPKAKQAAVMNALFGKESLSAIAPLLTNLDLLRTNFDRVADAQEYGGSMQKEYASRASTTENQLVLLKNSVNAISVTLGDTFLPAINEAAEAVMPYLEQLRTFVRANPELVQSAAKFGAALLAVGVSIGSLSRAVKILNSVINLSPAKVAIAALVAGAMLIIENWDDVAPVIKAVWQGVDNVAQEMGGWETVIEGVGLVMAGSFTVRTIGALQQSVLLAGRLSGLLGKIGRMGAMTLTIGVAVSLFKELKDLEQGAKDAGMDAGAFAVQKLQTKERERGYNGFIPRLKELLGMDTPIPQGRYQPYVPLTRRSGVLGRAVPPSTQRSELKVTFENAPQGMRVTDIPKSGNPLMNISHDVGYSPFRTSR >NZ_CP019416|2015480:2068904|2021420_2022983_-|WP_076731000.1|tail|DBSCAN-SWA MHRIDTPTAQKDKFGQGKNGFTNGDPATGRRATDLNSDMWDAVQEEVCTVIEAAGIPLSKGEHTQLHAAIGRLIDEQVKTRLEKNQNGADIPNKPLFLQNVGLGETINLAAGALQKSQNGGDIPDKKQFARTIGAVTSTTITLGESGWFKIATVVMPQATSTAVIKLYGGAGFNAGSPEQAAISELVLRAGNGSPVGITATLWRRSPSAANEVAWVNTSGDTYDIYINIGQYAYWLIAQYDYTGNANVTLHSTPEYSSVQPGNSTSGQTYTLFNSLMKPPAGDVEALSVNGGRLNGPLGIGTDNALGGNSIVFGDNDTGFKWHSDGVLGIYANNALVGYIDNSGLHMSVDVLTNGAVRAGNAKKLSLTSNNNSTMTATFNLWGDANRPTVIELDDDQGWHLYSQRNPDGSIVFTVNGDITANTLRASGAIYQNNGDIFGSLWGNGWLSTWINNNLVLDVQLGAGTSVTTWNNAGSWPNTPGYVVTSVWKDYQGENIDGINYAPLQKRVGSQWYTVQGGTV >NZ_CP019416|2015480:2068904|2032914_2033427_-|WP_001135695.1|DBSCAN-SWA MPQKAYLHVDFEQPETLVFNRARMRRAFVSIGQVHMRDARRLVMKRGRSGPGDNPSYRTGKLARSIGYYVPRASSRRPGLMVKIAPNQKNGEGNRPISGAFYPAFLFYGVRRGAKRKKGHHRGASGGSGWRVAPRNNYMTEVLDKRRSWTRYVLSRELRKSLRPQRRKKK >NZ_CP019416|2015480:2068904|2051012_2051237_-|WP_001191666.1|DBSCAN-SWA MQSPLRKLRKSHGYTLQHVAKGVQVDPATLSRVERCEQAPSTELAERLAQFYAGEISEMQILYPNRYQLSDSAI >NZ_CP019416|2015480:2068904|2033799_2034123_-|WP_000927251.1|head,tail|DBSCAN-SWA MLLSPEEIKLQLRLDEDYADEDKFLELLGRAVQARTENFLNRRLYTAEAGVPADDPEGLILSDDIRMGMLLLVTHFYENRSTVTEVEKVELPMSFNWLVGPYRYIPL >NZ_CP019416|2015480:2068904|2030347_2030704_-|WP_000515952.1|tail|DBSCAN-SWA MGKIAGTTYFKIDGQQLSVTGGIEVPMNTKVRDDVIGLDGSVDYKETSRAPYTKVTAKVPKNFPVDKITSSDVMTITSELANGQVYVLSNAWLHGEANHNPEEGTVDLEFHGEEGFYQ >NZ_CP019416|2015480:2068904|2024631_2025045_-|WP_000605050.1|DBSCAN-SWA MILYVNGIRKDATASLDFLTRAVVISLFTWRRAERDDRTPQPYGWWGDTWPAVQNDRIGSRLYLLKRRKLTNKTPQDAREYMQQALAWMTDDGVAARIDVTSERTGTDTLAAGVTIYQRDGVIHNITFDDIWSKLNG >NZ_CP019416|2015480:2068904|2054329_2054869_+|WP_057517787.1|DBSCAN-SWA MSFIQTLSGKQFDYLSATIDDIDIEDIAVALSNICRFSGHLPEFYSVAQHSVLCSQLVSPEFAFEALMHDAAEAYCQDIPAPLKALLPDYREIEKRTDQLIRFKFGLPLEEASVVKYADLTMLATERRDLDIDDNIPWVILEGIPPTDLFEIYPLRPGQAFGLFMARFNELMELRQCAA >NZ_CP019416|2015480:2068904|2049054_2049279_-|WP_000620702.1|DBSCAN-SWA MIRNIFKRFTSQRFHCPRPGQWYSTPEGYVLRISLVDRECQKVVCEPLGRNYRVNMPLIAFRSGKNMKHLGGAA >NZ_CP019416|2015480:2068904|2059578_2060376_+|WP_000598921.1|DBSCAN-SWA MIKWPWKAQEITQNEDWPWDDALAIPLLVNLTAQEQARLIALAERFLQQKRLVALQGFELDSLKSARIALIFCLPILELGIEWLDGFHEVLIYPAPFVVDDEWEDDIGLVHSQRVVQSGQSWQQGPIILNWLDIQDSFDASGFNLIIHEVAHKLDMRNGDRASGIPFIPLRDVAGWEHDLHAAMNNIQDEIDLVGESAASIDAYAATDPAECFAVLSEYFFSAPELFAPRFPALWQRFCQFYRQDPSQRLRVSADEGDYGEESEH >NZ_CP019416|2015480:2068904|2039132_2039570_-|WP_000501481.1|terminase|DBSCAN-SWA MGAVVRSSGGGRKRNLPSGQKSKLTRIAPPEELMSDIAIRIWKTQSKILIERGVFDLEDAPLLLAYCNAFHLMIEAEKVIAEEGLTVSSEMGGEKKHPAVNVRNDSVSQLARLGSLLGLDPLSRIRMTSGKNDPDDEGNEFDEFD >NZ_CP019416|2015480:2068904|2058297_2059287_+|WP_000532847.1|integrase|DBSCAN-SWA MGRKRAPGNEWMPKGVFFRPSGYYWKPGGSTENIAPADATKAEVWVAYEKKVEGRKNRITFTQLWRKFLASADYADLAPRTQKDYLAHEKYILAVFGDAEAKAIKPEHIRRYMDARGQKSRVQANHEHSSMSRVFRWSYQRGYVPGNPCVGVDKFPKPQRDRYITDEEYRAIYNNATPAVRAAMEIAYLCAARVSDVLKMNWNQILEKGIFIQQGKTGVKQIKSWTDRLRDAVEICREWGEEGPVIRTMYGERYSYKGFNEAWRKARKAAGDDLGRPLDCTFHDLKAKGISDYEGTAKDKQKYSGHKTESQVLVYDRKVKMSPTLDRKR >NZ_CP019416|2015480:2068904|2039716_2040067_-|WP_001135228.1|DBSCAN-SWA MPPRTPKSCRVRGCRSTTTDPSGYCESHRSEGWKQYKPGQSRHQRGYGSKWDVIRERILKRDKGLCQLCLRAGVVREAKTVDHIIPKAHGGTDADSNLQSLCWPCHKAKTARERLK >NZ_CP019416|2015480:2068904|2041989_2042562_-|WP_000765639.1|DBSCAN-SWA MKLFSPLSYLRIKHEEKDWYDYKIPAAVSLIVTIVYYFHASKISLIETNGLLLQVNGLLQVLIGFYIAALAAVSTFSSSSIDEVMAGVPPTLVEKFRGQKLTVELTRRRFVCYLFGYLALVSFMLFCLGMISILIGKPFHLWLLTFCSPDAILWLKTVFVGVYIFILMNIITTTLLGLYFLAVRFHQSSL >NZ_CP019416|2015480:2068904|2033398_2033803_-|WP_000776844.1|head|DBSCAN-SWA MKLRQAQASATYLLPDPGELDQRIVIRRRVDVPADDFGVTPTYPEQIRAWAKKAQPGAAAYQGAVQIENRVTHYFTIRFRRGITADHEVLHDDISYRVKRVRDLNSKRRFLLLECEELGTDNGSDYAAESIFTR >NZ_CP019416|2015480:2068904|2063777_2064275_-|WP_001084817.1|DBSCAN-SWA MNSIRVIERQHEIDMIEGEAKHKAELERIKSESQYNTSELKEKLERLDIIVKNQEARNNELLEERKKLDDDLNNIFIFVDDLRETVLRVNNAYNRSMKEDDVVRKNMILSPIYEIIKESKFKDIEKYLEKVGVLENSLKRLRDSERKTFSMDELENNDYNAINAD >NZ_CP019416|2015480:2068904|2057459_2058029_+|WP_001061370.1|DBSCAN-SWA MNNLMVDLETMGKKPNAPVVSIGAVFFDPQSGEIGPEFYTAVSLESAMEQGAVPDGDTILWWLRQSPEARAAICADAVSVTTALIEFNDFITCHADDLKYLKVWGNGANFDNVILRGAFERASLPCLWNYRNDHDVRTMVTLGRAIGFDPKRDMPFEGDMHNALADARHQAKYVSAIWQKLIPPTSNNI >NZ_CP019416|2015480:2068904|2032189_2032354_-|WP_000497739.1|DBSCAN-SWA MFVKPAKGRSVPDPARGDLLPEGGRNVDENNYWLRREAAGDVRRTNKKVKTNGD >NZ_CP019416|2015480:2068904|2025049_2025583_-|WP_001273650.1|plate|DBSCAN-SWA MANHPLQNMITRAVITAIDTVRKCQTAGLKLIAGEKKENVEHLEPYGFTSAAQNGAEAVVLFPGGGRSHGVAVVVADRRFRLKGLARGEVALYDDQGQSVTLTRAGIVVNGGGKPVIFTNATKARFEMPIESTGDIRDNCDSSGKTMAEMRTTYNGHTHKENGDGGGITDKPGQPMS >NZ_CP019416|2015480:2068904|2026637_2027978_-|WP_000863817.1|DBSCAN-SWA MAFFSSTGWRGRLRDASFRGVPFSVEDDESTFGRRVQVHEYPNRDKPWTEDLGRATRRLTINAYLVGDDYADRRDRLIGAIETAGPGTLVHPQYGEMQGSIDGQVRITHSSTEGRMCRVSFQFVESGELSFPVAGMATAKRLETSGGLFDDAIDSMFSTFSLSGISDFIQNDVIADAASMLGDVADAFRMVDSGVSAAMRLLQGDLSVILMPPGAASDFVNALQKAWRSGDRLRGSTSDLVTMIKTMSGITLDPGLSPRGTWPTDSGSAAKQKMQRNMIAAAIRTTAISTAVHAVTTLKQPRDVPDVRGVNQPAGTGRDSDIITVMHPALDGVQTVSNGSFPPNYEDLKAIRTALNAAIDQEQLRIRDDVLFQQISVMRTDLNRDISARLAQVERTALRTPDDVLPALVLAAAWYDDAGRESDILTRNPVPHPGFIPVEPLRVPVR >NZ_CP019416|2015480:2068904|2022969_2023557_-|WP_001207832.1|DBSCAN-SWA MALQDEYTQLLYHLLPEGPAWDGENPLIEGLAPSLNRVHQRADELMAEIDPARTTELIDRYEQLYGLPDSCAPEGVQTLQQRQQRLDAKANVAGGINERFYREQLDALGYTAATIEQFQNLDSTPDPEWGEFWRYYWRVNIPADANISWQTCTSTCDSAIRTWGDTVAECVIDKLCPSHTVVVFAYPEGKENAQN >NZ_CP019416|2015480:2068904|2052235_2052421_+|WP_001067433.1|DBSCAN-SWA MNNYYTCSFCGVSELDAKKLIAKGSKDEPAICSECVVSCVNILINYAAVIKPVKLNVTKGE >NZ_CP019416|2015480:2068904|2030703_2032200_-|WP_001007993.1|tail|DBSCAN-SWA MAISFNSIPSDTRVPLFYAEMDNSAANTARDSGASLLIGHASNDASIAVNSLVLVSSVDYARQICGAGSQLARMVGAYRKTDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPFTATSEAGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDTASVNTMATEMNDSSGRWSYIRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAGYEKDTQTPADELAASRTARAAVFIRNDPARPTQTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVESGVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIRGELGSTYRQLEREGIVENFDLFQQHLIVERNANDSNRLDVLFPPDYVNQLRVFAVLNQFRLQYSEEAA >NZ_CP019416|2015480:2068904|2019125_2019347_-|WP_001526483.1|DBSCAN-SWA MFLLYCRGTGEIGGRRPKLSPEQWAQAGRLIGAGIPRQQVAIIYDVGLSIHGLYWILLSANRKGVKKRDYRAN >NZ_CP019416|2015480:2068904|2055166_2055682_+|WP_000071068.1|DBSCAN-SWA MSNRIRNAQVFDARTGEYPVDMYIRWIIGGELDFDANYQRGYVWGHEEQQAFLNAVISGFPIGSVALAKAPDWCSRELPYIEVVDGKQRLTTLKKLITNEIPIILADGPLYWRDMTRAEQLAFGRRPLPAVVLDEVTYKDRLAYFMAVNFTGVPQSEEHKRHVMQLMEAAQ >NZ_CP019416|2015480:2068904|2043697_2044450_-|WP_076731003.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATASESLTITDVMAAQGMVQSKAPLGFALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEETRLQIVPYLVNFAFADYSRSAASKARCEHCAGTGFHHVLREVVKHSRNGEPVIKEEWEKELCQHCHGKGEVSTVCRGCKGKGIVLDEKRTRLHGAPVYKICGRCNGNRFSRLPTTLARHHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEAQLRKVTR >NZ_CP019416|2015480:2068904|2051334_2052030_+|WP_001020644.1|DBSCAN-SWA MNIGNRVRQLRRAKNMKIAELAEAIGVDAANISRLETGKQKQFTEQTLSRLADCLSVDIAELFTSDPKGNTVCKHSDMRKDSANVKDLFRIEILDVSASAGNGLIQGGDVIDVIHAIEYNKDKALAMFGGRPAAELKVINVRGDSMAPTIEPGDLIFVDISINQFDGDGIYVFGFDDKIYVKRLQMIPDKLLVISDNTNYREWSITKDNECRFGVFGKVLISQTQSLKRHN >NZ_CP019416|2015480:2068904|2034202_2035432_-|WP_000766103.1|capsid|DBSCAN-SWA MKLHELKQKRNTIATDMRALNEKIGDNPWTDEQRTEWNKAKSELEALDERIAREEELRRQDQTYVDENEEEQRNNQDPDKDPQQDEKRGQIFDKWMRHGASELSSEERKALRELRAQGVAPDEKGGYTVPDTFLAKVVEQMKSYGGIASVAQILATSDGRTMEWATADGTAEVGVLLGENEEAGEEDTEFGMDSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGTGTPKQPKGLKASVTGTTQTAAAGAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERYAEFDQTGFLAFHRFDCILEDTSAIKALVGKGSASS >NZ_CP019416|2015480:2068904|2037247_2037406_-|WP_000838395.1|DBSCAN-SWA MKSLIIDVAGVAGFGALVGGIYLKFGAAVALMAGGSGLLLWALLAARRIKTC >NZ_CP019416|2015480:2068904|2037402_2039133_-|WP_000257219.1|terminase|DBSCAN-SWA MATYPNVNAANQYARDVVNGKILACRLTMLACQRHLDDLERAKDPHWPYRFDKNKAERFLRFSQKMPHTSGEWARRKLRIEFEPWQKFALGVPFGWVRKDTGFRRFTEIYIEVPRKNGKSAIAAAVGNYMFCADGEYAAEVYCGATTEKQAWKVFAPALAMVKKLPALRQKFCIKPWAKKMTRPDGSLFAPIIGDPGDGDSPSCAIIDEYHEHDTDALYTTMTTGMGAREQPITLIITTAGFDIASPCYEKRTQVVEILERIREGGENEAIFGIIYTLDDDDDWTQPEALIKANPNYNISVKEGFLKAKQLLAMSTPGQTNKILTKHFNKWVSSKAAYYNLQKWMTAADKTLRLSDFAGEECYPGIDLASKLDLNAVVPVFRREIDGLSHYYCVSPMFWVPEDTVYATDPALKTIADRYQSFVNQGVLVPSDGAEVDYRLILEAILKLRETVKIAASPIDPYGATGLSHMLQDEGLEPVTITQNYTNMSDPMREIEAAIAAGRFHHDGNPLMTWCISNVVGKYLPGSDDVVRPVKEGAGNKIDGAVGLMMGVGRAMLNEPKDFLSNLDPDEELLFL >NZ_CP019416|2015480:2068904|2048083_2049058_-|WP_000096529.1|DBSCAN-SWA MSSLIQLLDRPIAYNPAFAKLKAGKVKAGPVAAVFLSQLVYWHNRMDGGWMYKTQADIASETALTRDEQETARKRLVALGVLEEARRGVPATMHYRINTARLEALLLETAKPVKKGAQEKTRLRDFQNVETPQSGLVQPRKPDCGDAANKNVETPQTSTGQPNEQACGDPTIFPTGDYTETTQEITQESKTPFCPVAEQPDPEVTLTDQAIEVLTHLNQVSGSRYQKSKTSLENIRARLREGYSVADLQLVIDLKHEHWHENDEQYQYMRPETLFGPKKFESYLQSATRWDQKGRPKRADWGAKKRDVMAFGPVDTTIPEGFRG >NZ_CP019416|2015480:2068904|2030024_2030351_-|WP_000588852.1|tail|DBSCAN-SWA MIKELVLKKPIMAHNEKLHVLELREPSYDEIEAIGFPFTVSGDGGVRLDSSVALKYIPVLAGIPRSSAAQLAKLDIFKACMLILNFFTRSETEEDSESGSTTPHTSGE >NZ_CP019416|2015480:2068904|2061701_2062976_+|WP_001680077.1|integrase|DBSCAN-SWA MSLTDTKVKNTRPSEKAVKLTDGFGLYLLVHPNGSKYWQLGYRFDGKQKVFSIGVYPAVSLADARQRRDEAKRLLTQGIDPNAKKQADEKVLQEKRDKTRSFRVVAKSWFATKTKWSEDYADTVWKRLETYVFPDIGDRNVSELDTGDLLVPVKKAETLGYLEIAMRIKQYITAILRHAVQQKLMRHNPAYDMEGAVQKPETEHRPALELEEIPLLLERIDAYKGRGLTTLAIKLNLLIFIRSSELRFARWSEIDFKSKLWVIPEQREAIENVKHSTRGAKMKRQHFVPLCRQALKILKEIRQLTYEEGNEAELIFTGCYDSFKPMSENTINKALRKMGYDTTQDICGHGFRTLACSALIESGLWSEDAVELQMSHKESNSVRAAYTHKAKHLDQRRLMLQWWADFLDENRYEMVRPFEFAQKQ >NZ_CP019416|2015480:2068904|2045460_2046321_-|WP_076731004.1|DBSCAN-SWA MNNLMVIDGIEVRRDVHGRYCLNDLHRAAGGEQKYRPKYWLDNKQTRELIEQLFTEGGIPSSEQNQSVRFFQGGSDARSLARAPVNTVRGGAEQGTYVCKELVFAYAMWISPSFHLKVIRTFDRITSAPQTSSGMAADKMQAGVILLGFMRKELNLSNSSVLGACQKLQEAVGLPNLAPQYAIDAPAGAPDGSSRPTLALSALLKQHGIRMTANQAYQQLAKLGVVEHRERYSRSAINGIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESKFPELLKLLDTVH >NZ_CP019416|2015480:2068904|2046735_2047617_-|WP_000200166.1|DBSCAN-SWA MTSESVCIESSDVTISVDESASRTWRRPFLKWAGGKYSMLPDLYQVIPAGMRLIEPFVGGGSVFLNSDKHACFLLADVNTDLINLYQMLAVVPGAVIRHARVMFDRLNDAESYMALREEFNAQVMDAPERAAAFLFLNRHCFNGLIRYNRNNQFNVGWGKYPSPYFPEEEIRAFTEMAHNCVFMAAGFRRTLALAGEGDVVYCDPPYEPMPGKDGFTHYAAGGFTWDDHIALAECCVAAHQRGARVVIGNSTSPRVIDLYSQHGFEIRYISARRSISSKGSTREKAKDLVAIL >NZ_CP019416|2015480:2068904|2046337_2046727_-|WP_000779149.1|DBSCAN-SWA MKLTLPFPPSVNTYWRAPNKGPLKGRHMVSASGRKYQSEACAAVIEQLRRLPKPSTAPAAVEITLYPPDKRIRDLDNYNKALFDALTHAGVWEDDSQVKRMLVEWGPVFPKGKVEITITKFETGAGAAA >NZ_CP019416|2015480:2068904|2044463_2045453_-|WP_076731260.1|DBSCAN-SWA MRALLTPEIAPRMGVVLFRPGSELMPLFMQGRVLLEPEPEQYSSFACGAVPAVSQPLADDPAVRDVFCNESVIYRAGGLDSLESWLLRGNGCQWPHSDWHSEQMTTMRHAPGAIRLCWHCDNLLREQFTERLKSIAVENTTKWILSVVCRDLGFDDMHAVTLPELCWWMVRNNLAEVLPESAARKALRMPKAIVQSATRESEIVPSVLATSIVQDKAKKVLALRVDPESPESFMLRPKRRRWVNERYTRWVKSQPCACCGKQADDPHHLIGYGQGGMGTKAHDLFVLPLCRTHHNELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >NZ_CP019416|2015480:2068904|2041243_2041525_-|WP_076731002.1|holin|DBSCAN-SWA MVANDPSAALNAVICGVIVIVLMFYRRGDATHRPLISLLAYVMVLVYASVPFRFVFGLYESSHWLGVMVNILICAAVLWARGNVARLVDALRH >NZ_CP019416|2015480:2068904|2020851_2021451_-|WP_015701331.1|tail|DBSCAN-SWA MVYRTRGNGIMKKYQNIKNFRLIDAPVNRDKTQAEINIGAYFLESDDGQDWYECQSLFSDDTAKIMYDHEGVIWGVVNKPVPQRGNTYSVSMLWPVNMSVAEIDAADCPDDCRGDGTWLYQDGKVVQRGYSPEELRKKAEAEKVRRLAEAESAIAPLARAVKLKIATDEEIKRLEAWELYSVMVNRVDTSAPDWPDIPR >NZ_CP019416|2015480:2068904|2040626_2041244_-|WP_001075993.1|DBSCAN-SWA MNQQQFQQAAGISAGLSARWFPHIDAAMKEFGITAVNDQAMFIAQTGHESAGFTVLKESFNYSVEALKKTFGKRLTPYQCEMLGRIDGRQVAHQPQIANLVYGGRMGNKDAGDGWKYRGRGLLQITGRENYVKCGAALKLDLISTPELLAQEKHAARSAAWFFTLRGCLMYSGDVVRVTQIINGGQNGLADRNSRYNKARAALLV >NZ_CP019416|2015480:2068904|2050429_2050984_-|WP_023139406.1|DBSCAN-SWA MGHEPEWKVEKQPRWLVAAIKKTISSLHGGYEEAAEWLDVTKDALFNRLRTGGDQIFPIGWALVLQRAGGTYHLAHSVARASGGVFVPLADMEEVDNADINQRLLEAIEQITSYSQQIRVAIEDGVIEPHEKAVIDEELYQAIAKLQQHSTLVYRVFCVPEKGDARECAAPGAVASNFMEKTNA >NZ_CP019416|2015480:2068904|2019559_2020567_+|WP_000492926.1|DBSCAN-SWA MFSRVRGFLSCQNYSHTATPAITLPSSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLENGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISSWR >NZ_CP019416|2015480:2068904|2025582_2026641_-|WP_001066630.1|plate|DBSCAN-SWA MNNTVFLRVNGRDWGGWTSVRISAGIDRIARDFNVSITRQWPGGEDVPPVKNGDAVEVLIGDDLVITGWVEALPLRYDAQTIMTGIVGRSKTADLIDCSASPAQHNGKNLFLIASALARPFGVDVVDAGAPAAAVIEAQPEHGETVVDCLNRLLGQAQALAYDDERGRLVLGRPGSMKAATALVLGENILSCDTERSVRERFSSYLVTGQRPGTDDDFGEATIAAIRQSTGDAGVTRYRPHTIQQSGTATTDSCKSRCEFEARQRAAKTLETTYTVQGWRQGNGELWKPNQAVVVYDPLNGFDNETLVIAEVTYSQDNNGTLTEIRVGPADAYLPEPFRPKAKKKVSEEADF >NZ_CP019416|2015480:2068904|2054939_2055170_+|WP_000764235.1|DBSCAN-SWA MKLEMYTLDGSVIVDSNLVTQFYPDYKSGGELTVIETISATGETFTVRVKHSFLQVTSALATAWSVDEKKAEGAAQ >NZ_CP019416|2015480:2068904|2049275_2050433_-|WP_001087406.1|DBSCAN-SWA MNSLTVNNRLSQQPGMYEYRPLRHECRLSNSLVVRNHREHSLTVGDESCRNLTAGFGMEGDFMSMSFAGNQKLSALSICARAIRMSVLALCGNSGVILLSVKRQEHIDSAIPGRYTVQAPHKAGAGRGNPEFNIEHNRAHAVFSCHEHCYAQIMVGRAGPVSAGPGSMLTGISTPVRLTTYKVVESLGGEFIEFNIEAATMATVPTLAQPEIRIINGQAVTSSLAVADYFIKRHADVIRKIESLECSTLFRKRNFAFTSISINQPNGGTRKLPCYQITRDGFAFLAMGFTGKRAAQFKEAYIDAFNQMEKQLSTPSVLSDAAHNASVLYSYISSIHQVWLQQLYPMLEKAESPLAVSLHDRINDAAALASLINMTLNRSEVRGRK >NZ_CP019416|2015480:2068904|2055678_2056038_+|WP_000065085.1|DBSCAN-SWA MSNIDKLNDHELVDLKNAIERELKRRADGPKVTTYYVVSCITDAQHFTDLDCALRCLKSVTENLMEWVTESPENRDYVNQCTGIVGAKLQVKEMNLDHFNMRVAEKYFDDICYPQETAQ >NZ_CP019416|2015480:2068904|2016964_2017783_-|WP_001176778.1|DBSCAN-SWA MQLPEQDEFSDFFAANDDEQASLRRKFFLEKHKEPCLSESALEDYQALFMSIYGINIDWKEGTFSLLEALSDNQGGKPVTVKFDYDSEIETATINLVDTQYVFHHYPMGSDGFDTELVRIEHILANSGYSLRVYQNSTFSDTLSFLLIPSDEWKRVEQHYSPEHISEYFVPYGKQLVIPEVTAPVVNYVPSVKQEASNVPALFNARGIRICFLSIMLIAFAIYILWNILTKIEPLSSGQPAGCENLQNLYSKLRPEVAEPLKEKMRKSLGCK >NZ_CP019416|2015480:2068904|2023559_2024639_-|WP_000785580.1|plate|DBSCAN-SWA MADSQFARPELPQLIATIRSDLLTRFQQDVVLRRMDAEVYSRVQAAAVHTLYGYIDYLARNMLPDMCDEDWLYRHARIKRCPRKNAVSAKGFARWDGIAGTPEIPAGTQIQRDDQVTFTTLQTVKASGGLLRVPVIADVAGTAGNTDDGTALRLGTPITGIPSTGYADTLTGGADTEEPETWRARVMERYYWIPQGGADPDYVIWAKEIAGITRAWTFRHYKGTGTVGVMVATSNPVNPAPGDDLVKAVRDHILPLAPVAGGGLFVFAATEKSIPVTVALAKDTPEIRTAIIAELNALMLRDGAPSGKIYVSRISEAISLATGEVAHQLRVPAADVVLGKTELPVLGNITWATYTGENG |
65 | Salmonella_phage(76.36%) | holin,head,terminase,plate,transposase,portal,integrase,tail,capsid,protease | attL 2060369:2060383|attR 2066695:2066709 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2175901 : 2186408
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP019416|2175901:2186408|DBSCAN-SWA ATTAGAAATTTAAACCAAAAAACTCTTCAAATTTGCTAACTACATAATCTAAATGCTCTGTAGTCAAGCCAGGATAAATACCAATCCAGAACGTTTGATTCATTATACGGTCGGTATTTGTCAACTCACCCACTACACGATATTTCACATTAGCAAAATACGGTTGGCGAATCAGATTTCCAGCAAACAGTAAACGTGTACCGATTTTTGCTTCATCAAGGAATTTCACCAGTTCGACACGGTTAACACCGCTAGTTTCTTTCAGGGTGATAGGGAAGCCAAACCAGGATGGATCTGATTTCTCTGTTGCTTCTGGTAATTCGAGGAATTCAGTGCAAGATTGCAAGCCCTGTTTCAGATAGGAAAAGTTAGCTTTACGCTGCTCTACAAACTCTTCTACGCGCTCCAACTGAGCCAGACCACATGCTGCCTGCATGTCCGTGATTTTGAGATTATATCCGAGGTGGGAATAAGTATATTTGTGATCATAGCCTTGAGGAAGTGATCCCAATTGCTGACCAAAACGTTTACCGCAGGTGTTATCGCATCCTGGCGCACAATAACAATCCCGGCCCCAGTCACGGAACGACTCAATAATTTTCTTCAGTTCACCTGACTTGGTGAATACAGCACCGCCTTCACCCATTGTGATATGGTGAGCCGGATAAAAACTAACGGTTCCGATGTCACCAAAGGTACCTACCATCTGGCCTTCATAAGTCGTCCCAAGGGCATCACAGCAGTCTTCAATCAACCATAAGTTATATTTATCGGCAATCCGACGAACTTCACTCAGGTTAAATGCATTACCGAGTGTATGAGCGATCATTATCGCTTTTGATTTCTCAGTAACTGCAGCTTCAATGAGAGAGGCATCGATATTATATGTCGGGATATCAACATCCACGAATACCGGTATTAAACCATTCTGGATCGCCGGGTTAACTGTAGTCGGGAAGCCAGCAGCGACAGTAATAACCTCATCACCAGGTTTGAGAGCTCGCTCGCCTAATTTTGGGGAAGTCAGCGCAGTCAGTGCCAGCAAGTTTGCCGAAGAGCCAGATGTTGTCGTTAAAACATGAGGAACCCCAATAAATTCCCCAAGTTTTTTTTCAAAGGCATCATTGAAACGACCAGTAGTTAGCCATCCATCAAGAGACGCCTCAACCATCAATTGTAACTCTTTGGCACCAATAACCTTCCCGGAAGGAGGCACAACGCTTGTACCTGCAACAAAAGGTTTCGGGCTCAATGCCTCATTCGCATACTGAGCGACAAGCTGAGAGATTTGCTCACGCAGGTTATTTGCTGTCATTACTTTGATTCCTTAAACTTATTTTCTTAACGAGTAGTTGCAGACATATAGTCGCTGATTTCACGCTTTGAACAAATCAACATATCTTCGCCGCGAATCCATGCTTTATGCCATTTTACGATGCGACCAAGTGTTTCAGTCAATCCCCAACGCGGATGCCATCCTAATTGCATATTTGCTTTAGAGCAATCCAGTTTCAGGTAATGTGCCTCATGAGGATGATTCTCACCATCCAGTAACCAGCTTGCATCATCACCCCAAAGCGTGACCATCTTGTCAACAATAAATTCGACCGTCTTCGCATCTTCATCACGCGGGCCGAAATTCCATCCTTCAGAAAACTTAGCACCTTCTGTATATAAGCGTTGCGCCACCACAATGTAACCAGAAAGAGGCTCCAGTACATGCTGCCAGGGACGGATAGAATATGGGTTTCGAATAATAACCTGCTGGTTATTTTCAAATGAGCGCAGAATATCGGGAATTAAACGGTCTTTAGCCCAATCGCCTCCGCCTATGACATTACCAGCCCTCACAGACGCCAAACCAACGCCATGTTGCTCATAATTTGCAGGATTGAAGAATGAGTTCCGGAATGCAGACGCGACTAATTCTGCACAACCTTTACTATTAGAGTATGGATCGTACCCTCCCATGGGTTCGTTCTCACGATAGCCCCACACCCACTCACGATTGTCGTAGCACTTATCACTGGTGATATTTACGACTGCCTTTATGTTACCTACTTGCTTAACTGTTTCAAGCAAATGGACAGTACCCATAACATTTGTTGAGTATGTTTCGATTGGCTGTTCATAAGATAGGCGCACTAAAGGCTGGGCTGCCATATGGAAAACAATTTCTGGCTTAAATTCTGCAATAGAATTGCGCAGCTTTTCAAAATCACGAATGTCGCCAATATGAGATTCCATAAGATCATTAAGACGCACTATCTCAAATAAACTTGGAACAGTTGGCGCATCAAGTGCATAGCCTTTTACAATTGCACCCATTTCAGTCAGCCATAGCGAAAGCCAGCTTCCTTTAAAGCCAGTATGGCCGGTAACGAATACACGTTTACCTTGCCAAAAATTTTTATCAATCATCTAGTTACTCCCAGGTTTTCCACGGAGCTTTACCTTTTTCCCACAGCCCTTCGAGGTAAACTTTATCACGTAGGGTATCCATCGGCTGCCAGAAACCTGGGTGTTCAAAAGCCATTAACTCCCCCTGTTGTGCCAATGTCATTAATGGCTCTTGTTCCCAGGTTGTTGCATCGTTATCGATGAGATCGATAACCGATGGATTCAACACAAAGAAACCACCATTGATCATTGCCCCATCGCCTTTCGGTTTTTCCTGGAATGACCGGACCTGACCAGCTCGGATATCTAATGCGCCAAAGCGTCCTGGTGGAAAAGTAGCTGTTAAAGTCGCTTTCTTACCGTGAGCCTTATGGAAATCGATAGTCGCTTTGATATCAAGGTCGGCAACGCCATCACCATAAGTAAACAGGAAAGCCTCGTCATCTTTTACGTATTCAGCAACACGTTTCAGACGACCACCAGTCATTGAAGAATCACCCGTATCAACCAATGTGACATTCCATGGTTCAACACGTTTATGGTGAACTTCCATACGGTTTTCAGCCATATGGAATGTTACATCTGACATGTGAAGGAAGTAGTTCGCAAAATATTCTTTAATCACATATCCTTTATAACCACAGCAGATAATAAAATCCTTGATACCATGCACAGAATACATTTTCATAATGTGCCAAAGAATAGGCTTGCCACCAATTTCTACCATCGGTTTTGGTTTTACAATTGTTTCTTCACTTAGTCTGGTACCAAGTCCACCAGCCAGGATGACCGCTTTCATAAATTATCCTCAATATTATTTAGATGCGGTAAATGCATCAGAATAGAAATGTTCTACAGAGAGATTTTTCATCATAAAGTCCTTTTTACTGGCATCGATCATCACAGGTGAACCACATGCATATATATCGAAGAACTCTAGAGAATCAAAATCATCCATCACGGCATGATGGACAAATCCCTTTCTTCCCCCCCATTCGGCGTCATCACCAGAAACAACAGGGATATAATGAACGTTGTCGTGCTGTTCACTCCACTGCTGCGGTAATGCAGAGTAAAAATCTTTACTATATTGCATTCCCCAGTAAATGTAGATCTCACGACGACATTTTCCCTGAATGAGATGCTCAACCATTGATTTAACTGGAGCGAATCCAGTACCGCCTGCAAGGAAGATTATAGGTCTGTCACTTTCACGAATAAAAAATGTTCCGCAAGGCCCTTCAATGCGCATAAGAGTATTTTCTTGTAACTCCCCAAAAATGAGCGAACTCATCTGACCATTGGGAACATTCCTTACATGCAACTCAATACCATTCGACTCATCACTATTAGCGATAGAATAACTGCGAGTTACACCTTTATAATGTAAATTGATATACTGCCCTGGAAGGAAGCCAATTTTTGCTGTTGGTGGTGTGCGTAACTTCAAAGTCATAACATCGCCTGAAACCAGTACAGCACTATTTACCTTGCATGGGACAATTTTTTTTGTCTGTCCAGCTAGTTCAGGAAAAAAATGCGCATTTAGCTCAAGGGCGGTTTTAGGTTTACAGCAGCAGGTTAGTATTTTATCACCCTGTCCAAAAATATTACCTTTGGAGTCAACAACTTCTCCCGCCAACAAATCGGACTCACAGATACCACAATCACCCGCTTTGCAGCTATGTTCAAGATGGATACCAGCCGATAGCGCAGCATCGAGGATTGATTCATCCTCTCTACCGGAAAATTCAATATTTGATGGAAAAATCTTAATAATATGAGACACGATGCTTACTCTGTTAACAAGGCTTGATCCAGTAAAGGTGCTGCAGCATCTTTTGCTGAAAGCTCAGGCAGCTGAGAAAAAGGCCATTCAATACCTATTGCCTCATCATTCCATAGAATGCTACCTTCCGATGAAGGTGAGTAATAATTAGTTGCTTTGTACAGAAACTCTGCATACTCACTAAGAGTAACAAAACCATGAGCAAAACCTTCTGGAATCCAAAGCTGTCGCTTATTCTCAGCAGACAGATTTACACCAACCCATTGACCAAAAGTAGGCGATTCTTTTCGGATATCGACCGCAACATCAAAAACCTCACCGACAGCACAACGAACTAACTTCCCCTGTGCATTTTCTCCTCTCTGAAAATGTAGCCCTCTGAGTACGTTCTTTTTGGATTTTGAATGATTATCTTGAACAAATGTAACTTTACGTCCAATCAACTCTTCAAAGGTCTGCTGGTTATAACTTTCAAAAAAGAATCCCCTCTCATCGCCAAAAACTTTAGGCTCTAAGATCAAGACATCTGGTATTGCTGTTTTAATCACAATCATCACTTATAAACCTTTCACCATCTTCAGCAAATATTTGCCATAATCATTTTTTGATAATGGCCCGGCCAGTTCTATAACCTGTTGTGCATTTATAAAATTTTTACGAAATGCGATCTCTTCCGGGCAGGACACTTTTAGCCCCTGGCGTTCTTCGATGGTTGCAATAAAATTACTGGCCTCTATCAAACTCTGATGCGTCCCTGTATCCAGCCAGGCATAACCGCGCCCCATCATAGCGACAGACAATCTTCCCTGCTCCATATAGATACGGTTAATATCAGTGATTTCTAACTCACCGCGAGCGGAAGGCTTAAGATTTTTCGCCATCTCCACCACGCTATTATCATAAAAATACAGCCCCGTTACCGCGTAATTACTCTTCGGTTGTAATGGTTTTTCTTCCAGACTAACGGCTGTGCCCTTTTGGTCAAACTCAACCACACCGTAGCGCTCCGGATCGTTTACATGATAAGCGAAGACGGTAGCACCACTTTCTTTATTAACGGCAGCTTCCATTAACTTTGGTAAATCATGACCATAGAAGATATTGTCACCCAGCACTAATGCACAATCATCATGACCAATGAACTCTTCACCAATAATAAACGCCTGTGCTAAGCCATCCGGGCTTGGCTGTACTTTATATTGAAGATTCAGCCCCCACTGGCTGCCGTCTCCCAGCAGTTGTTGAAAACGCGGCGTGTCCTGTGGCGTACTGATGATCAGGATATCCCGAATGCCTGCCAGCATAAGCGTGGAAAGGGGATAGTAAATCATCGGTTTATCATAAATTGGTAGCAATTGCTTACTTACCGCCATGGTCACCGGATAAAGACGGGTGCCGGAGCCCCCCGCTAAAATAATGCCCTTACGCGTTTTCATTTCCATTTCTCATTCATAGAAAATGCCCTGATGGGCATTTAAATTTATTAGATGGTTGTCGTCGTAAACATTTCAGTCAGCATACGCTTAACTCCTAATTCCCATTGAGGCAGAATAAGGTCAAAATTACGCTGAAACTTTTCAGTATTGAGACGCGAATTGCCTGGTCTGCTCGCCGGCGTCGGGTAGGCGCTGGTCGGCACAGCATTAAGCTCAGTCAGCGCAAGCGTTATCCCTGCTTTGCGCGCCTCGTCAAAGACTAAGGCCGCGTAGTCATGCCAGGTTGTGGTTCCCCCGGCAACCAGATGGTAAAGACCTGCGACTTCTGGTTTATTTAACGCCACACGGATCGCATGCGCCGTACAGTCAGCCAGTAATTCCGCACCGGTTGGCGCACCGTACTGATCGTTAATGACTGAAAGTGTCTGACGCTCTTTCGCCAGACGAAGCATTGTCTTTGCGAAATTATTGCCCTTACCTGCATAAACCCAACTGGTGCGGAAGATAAGGTGTTTAGGGCAGTTATCCTGCAGGGCCTTTTCTCCCGCCAGTTTGGTTTTGCCATAGACATTCAGCGGCGACGTAGCGTCCGTTTCCTGCCATGGGATATCGCCGGTACCAGGAAATACATAATCGGTTGAATAATGCACTACCCATGCGCCAGTTTCGTTGGCTGCTTTAGCGATGGCTTCCACACTGGTGGCGTTAAGTAACTGCGCCAGTTCTGGTTCAGACTCTGCTTTATCTACTGCAGTATGGGCTGCTGCGTTAACAATCACATCGGGACGAAGCTTACGAACGGTTTCGGCAACGCCTTTCGGATTACTAAAATCACCGCAAAACTCTTTTGAATGGACATCCAGGGCAATCAGATTCCCTACCGGTGCCAGAGAACGTTGCAACTCCCAGCCTACTTGCCCTGTCTTACCAAAAAGTAAGATATTCATTACTGGCGTCCTTCATAGTTCTGTTCTATCCAACTCTGATACGCCCCACTTTTAACATTGTTTACCCATTGAGTATTTGCAAGGTACCATTCCACTGTTTTACGAATACCGCTTTCAAAGGTCTCCAGCGGTTTCCAGCCTAATTCGCGGCTAATTTTACCTGCATCAATGGCATAACGACGATCATGGCCCGGACGATCCGCGACATAAGTGATTTGTTCACGATAAGAAGTCGCTTTGGGTACAATCTCATCCAGCAGATCACAGATGGTAAATACCACATCGAGATTTTTCTTCTCATTGTGTCCACCAATGTTATAAGTCTCCCCTGCCTTGCCTTCAGTCACTACCATATGAAGCGCGCGAGCATGATCTTCTACATATAGCCAATCGCGAATCTGATCCCCTTTGCCATAAATTGGCAAAGGCTTTCCTTCCAGTGCGTTCAAAATGACCAACGGAATCAGTTTTTCAGGGAAGTGATAAGGGCCATAGTTATTAGAACAATTGGTAACGATCGTTGGTAGACCATAGGTACGCCGCCAGGCACGGACTAAATGATCGCTGGATGCTTTTGACGCAGAATAGGGGCTACTTGGCGCATATGCCGTCGTTTCAGTAAATAACGGCAGCGTAACGCTGTTTTCAACTTCATCAGGATGCGGTAAATCGCCGTAAACTTCATCAGTGGAAATATGATGAAAACGAAAATTATTTTTTTTATCTTCGCCAAGGGCAGACCAGTATTTACGCGCAACTTCAAGAAGTGCATAGGTGCCGACGATATTGGTTTCAATAAATGCTGCTGGCCCGGTAATCGAACGGTCCACATGACTTTCCGCAGCCAAATGCATCACCGCGTCCGGCTGGTACTGCTCAAAAATACGCGTTATTTCAGCGGAATCACAAATATCCGCGTGTTCAAAATTGTAGCGATTACTTTCAGAAATATCAGAAAGGGATTCAAGATTACCGGCGTAGGTTAATTTATCAATATTAACTACAGTGTCCTGTGTATTCTTAATAATATGGCGGACAACAGCTGATCCAATAAAACCTGCCCCGCCAGTAATAAGTATCTTCACTTTTCTATTCCATAAGGCGTATTTAATGTGGTATTTAATTTGCCAATAAAAATTAATTGCTCAAGTCGTTACACACGCTACCGCCCCTGGCTCATCAGCTACCTGTGCACTGCGTACATATCGACTTGTTACAAACCTCGCCCAGCAGGGCAAAGCTCACTAAAACTTAAACGCTAATTGTCTTATTAATTGCATCCGGAAACAAGGATTAATCTTATAAAATCAGCATTAAAATGCTCCAGATAACCCCTTGTTACTTAAGCCCTTTATACAAAACTAAAACGGCAGTCAACACTCGTTTCAGCCAACTTGCCGCTTCGAATGTTCACTGCCGTTATTATGTTTATCACCAACCATTTATCACGGTTGTTAATACTTATTCATGCAAAAGCTGCTCTATGCTCTTACGGAACTTCGCTCCTTCTTTCAGGTTGCGCAGCCCGTACTTCACAAATGCCTGCATGTAGCCCATTTTTTTACCGCAGTCATAGCTGTCACCCGTCATTAGCATCGCGTCAACCGACTGTTTTTTCGCCAGTTCTGCAATGGCATCGGTGAGCTGGATACGGCCCCAGGCGCCCGGTTCGGTTCTTTCCAGTTCCGCCCAGATGTCGGCTGAAAGCACATAACGGCCTACCGCCATCAAATCGGAATCCAGCGTCTGCGGCTGATCCGGTTTTTCGATAAACTCCACAATCCGGCTGACTTTGCCTTCATTATCCAGAGGTTCTTTCGTCTGGATAACGGAATACTCCGATAAATCACCTTTCATGCGCTTCGCCAGCACCTGGCTGCGACCCGTTTCATTGAAACGCGCCACCATCGCCGCAAGGTTATAGCGCAGCGGATCGGCGGTAGCATCATCGATAATAATATCCGGGAGTACCACAATGAAAGGGTTATCGCCTACGACCGGACGCGCGCACAGAATAGAGTGCCCCAGCCCTAACGGCTGCGCCTGGCGAACGTTCATAATCGTCACGCCCGGTGGGCAGATAGATTGCACTTCCGCCAAAAGCTGGCGCTTAACGCGCTGCTCAAGAAGTGATTCAAGTTCATAAGAGGTGTCGAAGTGGTTCTCAACGGCGTTTTTAGACGCGTGAGTCACCAGTACGATTTCTTTGATCCCTGCAGCCACAATCTCATCGACAATGTACTGAATCATTGGCTTGTCGACGATCGGTAGCATCTCTTTTGGGATTGCCTTGGTGGCAGGCAACATATGCATACCCAAACCCGCTACCGGTATAACTGCTTTCAAATTCATCATTGTTTCTTCCACCTGTAAAATGGTTGCTGAATTATAGCTCTTTAGCTTGTTTTCGCCAGCATGAATTACTCTGCTGCCAGGGATAATGATGGCACGCTCTACATTACGTCTTAGTCGGCACCATAACATTAAGTATGAACAACTTTTTCCCAGGAATTTTCGTAAAAATAGCGGTACTTACCCTCCCCGCTTCGGCAGCGAAAAATTCACTGCTTCGACATTCACGGTTTGGTGATTAATCCTGTCGATATCCACGGAACTCTGCCCGTTTTCATTGATGGCATGAACATTAGCGAGGGAAAGCAGCGTGTCCTGGCGGGCCATAAATTGACCACGGACATCTTTACGCAAATCGAAATGCATTTTTAACGCCGGGCCAATCGCTGAAGTTTGCATCACGTTGATATTACGCAGAAAGAGGTGCTGCGGTTGATTATGCAGTTCCAGCGTAGCACGCGTCATCCGTACATTGGTGATGGCGACAAAAGAGGGGGTGTTGCCGGAGGAAATTTGAATGCCGCGTAATTTATAAGCAACCTGGCGATTATCCAACCGAATAGCGTTTAATTTAAAGTTTTGCGGAATTGACAGGTATTTTCCTTTAACGACGCCATAGCCGATGAGCATCCCAGCACTATTCGTCATATCAATATTATCAATGACGAAATTATCACAGCCATAAATGGCGATCGTTGCGTTATCAATACCCGCATTTTTACTGAAATCGGGCGTGATGTTTTTGGCTTTGACATTGCGAATGACGAAATGTTTGCCATTTTCTACGTGCACCAGCTGTCGGCAATCAGATCCGGTAATATTGGCCACCACAAAGTTTTTTACTGCCTGATCTTCAGGATAACTGTTGTCATAGGTGCTACCCGCCAGCCCGATGCCGATCCCCCAGTTGATTTTGCCATTGGTACAATCAATGCGTTCGATGACATGATCGGAAATCAGGATGTCGCGGTCGTGAATCGCGACATTCCACTCAATGGCGTCCCCCTGCAAATCGCTAAAGCGGCTATGCGTAATCCGCGCGCCGTCCATTTGGTTATGAAATCCCTGGCGGAGAATGGCGTAGTTGGCGTGGGTAACGGTGATGTCATCGATAATGAGATTACGCATCACCTGCGGTTCCTTACCGCCGATGAAAATTTGCGCGACGGGGCCAAAGCCGCTCATCGTCACGCCTTTAATCACACAGTCCGACCCGCGAACATCCAGCGTCACATTGTGCAGACTGCCGCCCTGCCCCCCCACCACCTGACACCCGTCCTGCAAAATAAACCGTCCCCGGCCATTCCCACGCACCGTGCCCTGTACCCGCAGCGTTTTTCCCGCCGGAATCGTTATCGCCGCATTGATATTTTCACACACCCATCCTGGCGGTACGACCACGGTCTGTCCGTCGGCGAAGGCCTGTTTGAACGAGGCGATACCGTCATCCGCCGGATAATCCTTAATATCGACGGTCTCGCGAGGTTCACGCGCCTGTACCGGCAAGGCGCGCAGAAAAGGAAGAACAGCAAGCGCAGAACCTGCCGTCAGGAGAGTACGTCGGGAGAATTTAGTCGCGGGCAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP019416|2175901:2186408|2182471_2183557_-|WP_000697840.1|DBSCAN-SWA MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLTYAGNLESLSDISESNRYNFEHADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYALLEVARKYWSALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDHLVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYVEDHARALHMVVTEGKAGETYNIGGHNEKKNLDVVFTICDLLDEIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLANTQWVNNVKSGAYQSWIEQNYEGRQ >NZ_CP019416|2175901:2186408|2181572_2182472_-|WP_001023662.1|DBSCAN-SWA MNILLFGKTGQVGWELQRSLAPVGNLIALDVHSKEFCGDFSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPELAQLLNATSVEAIAKAANETGAWVVHYSTDYVFPGTGDIPWQETDATSPLNVYGKTKLAGEKALQDNCPKHLIFRTSWVYAGKGNNFAKTMLRLAKERQTLSVINDQYGAPTGAELLADCTAHAIRVALNKPEVAGLYHLVAGGTTTWHDYAALVFDEARKAGITLALTELNAVPTSAYPTPASRPGNSRLNTEKFQRNFDLILPQWELGVKRMLTEMFTTTTI >NZ_CP019416|2175901:2186408|2183933_2184827_-|WP_000981469.1|DBSCAN-SWA MMNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGDNPFIVVLPDIIIDDATADPLRYNLAAMVARFNETGRSQVLAKRMKGDLSEYSVIQTKEPLDNEGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWAELERTEPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKSIEQLLHE >NZ_CP019416|2175901:2186408|2175901_2177215_-|WP_000126349.1|DBSCAN-SWA MTANNLREQISQLVAQYANEALSPKPFVAGTSVVPPSGKVIGAKELQLMVEASLDGWLTTGRFNDAFEKKLGEFIGVPHVLTTTSGSSANLLALTALTSPKLGERALKPGDEVITVAAGFPTTVNPAIQNGLIPVFVDVDIPTYNIDASLIEAAVTEKSKAIMIAHTLGNAFNLSEVRRIADKYNLWLIEDCCDALGTTYEGQMVGTFGDIGTVSFYPAHHITMGEGGAVFTKSGELKKIIESFRDWGRDCYCAPGCDNTCGKRFGQQLGSLPQGYDHKYTYSHLGYNLKITDMQAACGLAQLERVEEFVEQRKANFSYLKQGLQSCTEFLELPEATEKSDPSWFGFPITLKETSGVNRVELVKFLDEAKIGTRLLFAGNLIRQPYFANVKYRVVGELTNTDRIMNQTFWIGIYPGLTTEHLDYVVSKFEEFFGLNF >NZ_CP019416|2175901:2186408|2185004_2186408_-|WP_001111845.1|DBSCAN-SWA MPATKFSRRTLLTAGSALAVLPFLRALPVQAREPRETVDIKDYPADDGIASFKQAFADGQTVVVPPGWVCENINAAITIPAGKTLRVQGTVRGNGRGRFILQDGCQVVGGQGGSLHNVTLDVRGSDCVIKGVTMSGFGPVAQIFIGGKEPQVMRNLIIDDITVTHANYAILRQGFHNQMDGARITHSRFSDLQGDAIEWNVAIHDRDILISDHVIERIDCTNGKINWGIGIGLAGSTYDNSYPEDQAVKNFVVANITGSDCRQLVHVENGKHFVIRNVKAKNITPDFSKNAGIDNATIAIYGCDNFVIDNIDMTNSAGMLIGYGVVKGKYLSIPQNFKLNAIRLDNRQVAYKLRGIQISSGNTPSFVAITNVRMTRATLELHNQPQHLFLRNINVMQTSAIGPALKMHFDLRKDVRGQFMARQDTLLSLANVHAINENGQSSVDIDRINHQTVNVEAVNFSLPKRGG >NZ_CP019416|2175901:2186408|2177241_2178321_-|WP_000565905.1|DBSCAN-SWA MIDKNFWQGKRVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLNDLMESHIGDIRDFEKLRNSIAEFKPEIVFHMAAQPLVRLSYEQPIETYSTNVMGTVHLLETVKQVGNIKAVVNITSDKCYDNREWVWGYRENEPMGGYDPYSNSKGCAELVASAFRNSFFNPANYEQHGVGLASVRAGNVIGGGDWAKDRLIPDILRSFENNQQVIIRNPYSIRPWQHVLEPLSGYIVVAQRLYTEGAKFSEGWNFGPRDEDAKTVEFIVDKMVTLWGDDASWLLDGENHPHEAHYLKLDCSKANMQLGWHPRWGLTETLGRIVKWHKAWIRGEDMLICSKREISDYMSATTR >NZ_CP019416|2175901:2186408|2180094_2180646_-|WP_000973708.1|DBSCAN-SWA MMIVIKTAIPDVLILEPKVFGDERGFFFESYNQQTFEELIGRKVTFVQDNHSKSKKNVLRGLHFQRGENAQGKLVRCAVGEVFDVAVDIRKESPTFGQWVGVNLSAENKRQLWIPEGFAHGFVTLSEYAEFLYKATNYYSPSSEGSILWNDEAIGIEWPFSQLPELSAKDAAAPLLDQALLTE >NZ_CP019416|2175901:2186408|2178325_2179099_-|WP_000648784.1|DBSCAN-SWA MKAVILAGGLGTRLSEETIVKPKPMVEIGGKPILWHIMKMYSVHGIKDFIICCGYKGYVIKEYFANYFLHMSDVTFHMAENRMEVHHKRVEPWNVTLVDTGDSSMTGGRLKRVAEYVKDDEAFLFTYGDGVADLDIKATIDFHKAHGKKATLTATFPPGRFGALDIRAGQVRSFQEKPKGDGAMINGGFFVLNPSVIDLIDNDATTWEQEPLMTLAQQGELMAFEHPGFWQPMDTLRDKVYLEGLWEKGKAPWKTWE >NZ_CP019416|2175901:2186408|2179114_2180089_-|WP_023200991.1|DBSCAN-SWA MSHIIKIFPSNIEFSGREDESILDAALSAGIHLEHSCKAGDCGICESDLLAGEVVDSKGNIFGQGDKILTCCCKPKTALELNAHFFPELAGQTKKIVPCKVNSAVLVSGDVMTLKLRTPPTAKIGFLPGQYINLHYKGVTRSYSIANSDESNGIELHVRNVPNGQMSSLIFGELQENTLMRIEGPCGTFFIRESDRPIIFLAGGTGFAPVKSMVEHLIQGKCRREIYIYWGMQYSKDFYSALPQQWSEQHDNVHYIPVVSGDDAEWGGRKGFVHHAVMDDFDSLEFFDIYACGSPVMIDASKKDFMMKNLSVEHFYSDAFTASK >NZ_CP019416|2175901:2186408|2180646_2181525_-|WP_000857529.1|DBSCAN-SWA MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGHDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDQKGTAVSLEEKPLQPKSNYAVTGLYFYDNSVVEMAKNLKPSARGELEITDINRIYMEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAFRKNFINAQQVIELAGPLSKNDYGKYLLKMVKGL |
10 | Enterobacteria_phage(37.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2253605 : 2262776
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP019416|2253605:2262776|DBSCAN-SWA TATGACTCAAGTCGCGAAGAAAATTCTGGTAACGTGCGCGCTGCCGTACGCCAACGGCTCTATCCACCTCGGCCATATGCTGGAGCACATCCAGGCTGATGTCTGGGTCCGTTACCAGCGAATGCGCGGCCATGAGGTTAACTTCATCTGTGCCGATGACGCTCATGGCACGCCGATCATGCTGAAAGCGCAGCAGCTTGGTATTACGCCGGAGCAAATGATCGGTGAAATGAGCCAGGAGCACCAGACCGATTTCGCCGGTTTTAATATTAGCTACGACAACTACCACTCAACGCACAGCGACGAGAATCGCGAGCTGTCCGAGCTGATTTATACGCGCCTGAAAGAGAACGGTTTTATTAAGAACCGCACTATCTCTCAACTCTACGATCCGGAAAAAGGCATGTTCCTGCCGGACCGATTTGTGAAAGGCACCTGCCCGAAATGTAAATCCGCGGACCAGTACGGCGATAACTGTGAAGTCTGCGGCGCAACTTACAGCCCGACCGAACTTATCGAGCCGAAATCCGTGGTGTCCGGCGCGACGCCGGTAATGCGTGACTCCGAGCACTTTTTCTTTGACCTGCCGTCATTCAGCGAAATGTTGCAGGCGTGGACCCGCAGCGGCGCGCTGCAGGAGCAGGTGGCGAACAAAATGCAGGAGTGGTTTGAATCCGGCCTGCAACAGTGGGACATTTCCCGCGACGCGCCGTATTTTGGTTTCGAAATCCCGAACGCGCCGGGCAAATATTTCTACGTCTGGCTGGACGCGCCGATTGGCTATATGGGCTCCTTCAAAAATCTGTGCGATAAGCGCGGTGACACGACCAGTTTTGATGAGTACTGGAAAAAAGACTCCGACGCCGAGCTGTACCACTTTATCGGCAAAGACATCGTCTATTTCCACAGCCTGTTCTGGCCTGCCATGCTGGAAGGCAGCCACTTCCGTAAGCCGACCAACCTGTTCGTTCACGGTTACGTGACGGTGAACGGCGCGAAGATGTCTAAGTCTCGCGGCACCTTTATTAAGGCCAGCACCTGGCTGAAACACTTTGACGCCGACAGCCTGCGCTACTACTACACCGCGAAGCTTTCTTCACGCATTGATGACATCGACCTGAACCTGGAAGACTTTGTCCAGCGCGTCAATGCCGATATCGTCAATAAAGTAGTCAACCTGGCCTCGCGTAACGCCGGTTTTATCAATAAGCGTTTCGACGGCGTGCTGGCGGCTGAACTGGCCGATCCGCAATTGTATAAAACCTTTACTGACGCCGCTGCGGTGATTGGCGAAGCATGGGAAAGCCGTGAATTCGGTAAAGCTATCCGTGAGATTATGGCGCTGGCCGACATCGCTAACCGTTATGTTGACGAGCAAGCGCCGTGGGTGGTGGCTAAACAAGAAGGCCGCGACGCGGACCTGCAGGCCATTTGCTCAATGGGCATCAACCTGTTCCGCGTGCTGATGACGTATCTGAAACCGGTACTGCCGACGCTTTCTGAACGCGTTGAAGCCTTCCTGAACAGTGAGTTGAACTGGGATGCCATCGAACAGCCGCTGCTCGGTCACAAGGTCAACACCTTTAAGGCGCTCTACAATCGCATCGACATGAAGCAAGTTGAAGCGCTGGTTGAAGCGTCTAAAGAAGAGGTGAAAGCCGCAGCCGCGCCGGTTACCGGCCCGTTAGCCGACTTCCCGATTCAGGAAACCATCACCTTTGACGATTTCGCCAAAATTGACCTGCGCGTAGCATTGATTGAAAATGCTGAGTTCGTAGACGGTTCTGACAAATTGCTGCGCCTGACGCTGGATCTGGGCGGCGAGAAGCGTAACGTCTTCTCCGGCATTCGTTCCGCCTACCCGGACCCGCAGGCGCTGATTGGCCGCCAGACGGTAATGGTCGCCAACCTCGCGCCGCGCAAAATGCGCTTTGGCGTCTCCGAGGGAATGGTGATGGCCGCAGGACCTGGCGGGAAAGATATCTTCCTGTTAAGCCCTGATGACGGCGCGAAGCCTGGCCAACAGGTGAAATAAGCAACAAGCCGGAGCATGCTCCGGCTTTTTTAACCGCTTAATCCTGACTGGCATATCACCCCATCCGCGTGTTTTAACTTTTTCCTTAATAAGCAATAGCGTTAAGACTCCATATCCGGACTGCTAAATAACGGCTAAAAGTCATATTCTATCTCTCCCTGATACATTGCTATTACTGGGTTAAATTAATTCATTGAAATTATTTTAAAAACCCAATCATTATATAAAATGGAGTTTTAGATGAAAATTTCTGGCAAGTTATTGTCCGCGGCTTTGGCTTCCGTACTGGTGTTCTCTCTTGCTGGCTGTGGCGATAAAGAAGAATCAAAGACCTTTAACGCAAACCTGGCGGGGACAGAAATTTCAATTACCTACACCTATAAAGGTGACAAAATCATTAAGCAGACGTCTGAAAGTAAAATCAGCTATGCCACTGTAGGCGCTAAAACGAAAGAAGATGCCGCCAAAATTCTCGATCCGCTGAGCGCGAAATATAAAAATATCGCCGGAGTGGAAGAAAAATTAACCTATGAAGATACCTATGCCCAGGAAAACGTCTCTGTGGATATGGAAAAAGTGGACTTTAAAGCGTTACAGCAAATCTCAGGGACGATGGTGTCCGGCGATACCAGCAAAGGTATCAGCATGAAACAAACCCAGACGCTGCTGGAAGCTGCTGGTTTTAAAGAAGCGAAATAACTGGCAGGCATAATGTATTGCATCGACTGGTAAAGTCGCTCAGGGCGCTGCTTCGGCAGCGTCTTTTCTTTATGAATTCCGAAAAAAGACAGCCTCGCTGTAAGTCCCCTCGCCAAAGCCTATAATCCCTGAATTCCTGGCCACACCAATACTTAACGACAAGGAATTGTTATGCAGGTTCTACGTCTTATGGCACTGCCACTATTCGCGCTCTCTCTATCGGTTAGCATAACTGGCTGCGATCAGAAAAACGATACTCTCCAGGGAAAGCAAAATAACATGACAGCGTTTATCAAGAAGATAGCCGCTAGCAAAGAGTCAGAGGAAACACAACGCTATGTAGGTAATCTCAACGGTATTGAAATCAAGTTAACCTATTACTACAAAGGGGATATCGTTTTACGTCAAATATCTGAACATAAACTACTTTATAAGACCCTGAAAGCCAATAATAAAGAAGAAGCACAAAAAATGCTGAGTCAAGTCGGCGAAGCTTATCAGGGTATGCCGGGTTTGACTGAACGAATCGACTATTATGATAGCTATGCTACGGAATATGTGGATATTGATTTTACCCAGGCAAAAATAAGCGACCTCTGTAAATTGCCAGGATCATCAATTGACAACTGTTCCGCGTACTATCTGTCAATGATTCGCTCGCAGAAACTGTTGGAAGAGAGCGGGTATCATAGAATCAATTAGTATAATAATGCGTTTTCCCGGTCAGACAGGACGCTGCCGGGAACAATACCGCAGAATTACTTCGCCGCGTGTTCACGGGCCGCCAGACCGCGCAGAAAATAACGCATAAACTGATCGCCGCATTCGCGGAAGTTCTTATGGTCCGGCGCGCGCATCATCGCGGTGATCTCTGGCATTGAGACACGAAACAACTGACCGGTAAGTATCGCCAGGATATCATCTGTTTTTAGCGAAAAGGCAATACGCAGCTTTTTCAGCACAATATTGTTGTTGATACGACGTTCCGCCGTCAATGCAGGCGCCGCCTCATCTTTGCCGCGTTTTTCATAAATGAGGCCATTGAGAAATGAGGACAACACGATATCCGGGCAACGCTGAAACCCCTCTTCCTCTTCTTTGCGCAACCAGATTGCAATCTGCTCCGGCGTAGCATCAACGTTACCCAGCGCCAGGATACGCGCCAGATCGGTATTATTAGCTTTTAAAATGTAGCGCACGCTACGCAGAATATCGTTACTCAGCATGAGGCCTTCGGTCGTTTCTATGGCAAAACGATATTCTAACAGTCTTTTACAGGCCAATCGCCTCTTTTAAACTTTTCAGATAGCGACGGCTTACCGGCACCGTCAGGCCGTTGCGTAAAATCAGCTCTGCCTGCCCATTATCCTCCAGCCGAATTTCCTGCAAATGGGCCATATTCACCAGAAACTGACGATGACAACGCAGTAGCGGCGTCCGGCTTTCCAGCGTGCGCAGCGTCAGCTCGGTAAACCCCTCTTTCCCTTCACTGCTGGTCACATAAACGCCGCTCATACGGCTACTGACAAAGGCGACATCATCCATTTGCAACAAATAGATCCGGCTGTGTCCGGTACAGGGAATGAATTTAAGCGCCTGCTGGTTTTCCGGCAACAACGAAACATCCTGTTTACTGCGCTCCTGACGCAGACGATGTAACGTTTTTTCCAGCCGTTTCTCCTCTATCGGCTTGAGCAGATAATCAAAAGCGTGTTCTTCAAAGGCTTTGATGGCGTATTCGTCAAACGCGGTTAAAAAAACGATATACGGGCGGTGTTCCGGATCAAGCATTCCTACCATCTCCAGTCCACTGATACGCGGCATCTGAATATCCAGAAACAGCACATCAGGTCGCAACTTATGTACCGCGCCAATCGCTTCTACCGCGTTCGCGCACTCTCCCACAATCTCAATGTCATCCTGCCCCTGGAGCAAAATCCGCAGATTTTCCCGCGCTAACGGCTCATCATCCACAATCAGCACTTTAATCATGCGTCCTCCTCCAGTGGAAGTCGTAATGTAATTCGGGTAAAACAGTCCGGCTCGCAGGCCACGCTAATACCATAATCATCGCCAAAGTGTTCGCGCAGACGTTTATCCACCAGACTCATCCCCAGCCCGCTACTGCCGGCGGAAGGCTGATACAGTCCCGCATTATCCTCAATATCTAACATCAAATGCTGCCCTTCGCGCCGGGCGCGAATAGCGACGTTGCCGGTATCAAGCAGTTGCGACGTGCCATGTTTAATGGCGTTCTCAACAATCGGCTGTAATGTAAACGCAGGCAATTTCTGACGTGAAAGCGTCGATGGAACATCAAGCTGTACCTGCAGACGCGACTGAAAACGCGCTTTTTCAATTTGCAGATAAGCGTTTACGTGTTCAATTTCATCCGCCAGCGTGACGATTTCCGACGGGCGTTTTAAATTTTTGCGAAAAAAGGTCGACAAGTACTGCACCAGTTGGCTGGCCTGTTCGCTGTCGCGGCGAATCACCGCTTTAATGGTATTGAGCGCGTTAAACAGAAAATGCGGGTTCACCTGCGCGTGCAACAGCTTGATCTCTGACTGCGTCAGCAACGCCTTCTGCCGTTCATACTGCCCGGCCAGGATCTGCGCGGATAAAAGCTGCGCAATACCCTCTCCCAGGGTACGGTTAATTGAGCTGAACAGTCGGTTTTTCGCTTCGTACAATTTAATGGTGCCCATGACTCGCTGATTTTCGCCACGCAGCGGGATCACCAGCGTCGAACCGAGTTTACACTGCGGGTGTAGCGAACAGCGATACGGCACTTCGTTGCCATCGGCATAAACCACCTCTCCGGTTTCAATTGCTTTCAGCGTATAACCTGATGAAATGGGTTTGCCCGGTAGATGGTGATCGTCGCCAATACCAGTAAAAGCCAGCAGTTTTTCGCGATCGGTGATGGCGACGGCGCCAATATCCAGCTCCTGATATAACACCTGCGCCACCTTCATACTGTTCACTTCGTTAAATCCCTGACGCAGAATCCCCTCCGTTGACGCGGCGACCTTCAGCGCGGTAGCAGAAAATGCCGAAGTATATTTTTCGAACATGGCGCGCTTATCGAGCAAAATACGCATAAACAGCGCGGCGCCAACGGTATTCGTCACCATCATCGGCGCGGCAATATTACTGACCAGATGCAGGGCATCGTCAAACGGCCTGGCTATCAGTAAAATGATCAGCATCTGCACCAGTTCGGCAACACACGTAATTGCTCCCGCCGTCAGCGGGCTAAACACTTTGTCCGGGCGTCCGCGACGTATGAGAACGCTGTGTACCAACCCGCCCAGCAGCCCTTCGACGATGGTGGAAATCATACAGCTCAGCGCCGTCATGCCGCCCATAGAATACCGATGTAACCCACCGGTCAGGCCGACCAGCCCGCCGACGACCGGCCCGCCGAGTAGGCCGCCCATCACCGCGCCAATCGCGCGGGTATTGGCAATCGAATCTTCGATATGTAGCCCAAAATAAGTGCCCATAATGCAGAAGATAGAAAACGTGACGTAACACAGAAGCTTGTGCGGCAGACGAACCGTGACCTGCATAAGCGGGATGAACAGGCGCGTTTTACTCATTAGCCACGCAATGACCAGAAACACGCACATCTGCTGAAGCAGCAGCAACACCAGATTAAACTCGTACATACCCGCAAACCACACTTCAATTAAAAGCGCGTAACATACATTGAGTACGAGTAACTTTCTTTGAACTGTTGCATAAAAATATGAATTCGTGAATACGATCACTTAAACGCCGCGCCGCAACCCGCTACTTCGCGTTTTAATGCATAAAAAACAGGCAAAACTTCCTGGTTCCTAAAAGAGCGTCTAAAGTTAAACCGGGACCTCGCGAGCAAGGGTGAAACGATGGCGCTTTACACAATTGGTGAAGTGGCTTTGCTTTGTGATATCAATCCTGTCACGTTGCGCGCGTGGCAGAGACGTTATGGACTTTTAAAACCACAGCGAACGGATGGCGGTCATCGTCTGTTTAACGATGCCGATATCGACAGAATCCGCGAAATCAAGCGCTGGATAGATAACGGCGTCCAGGTCAGCAAAGTCAAAGTGCTGCTCAGTAGCGACAGTAGCGAACAACCTAACGGCTGGCGCGAACAGCAGGAGATCCTGCTGCACTACCTGCAAAGCAGTAATCTGCACAGTTTACGGTTATGGGTCAAAGAACGCGGTCAGGATTATCCTGCCCAAACATTGACCACTAACCTGTTCGTCCCACTGCGGCGACGATTACAGTGCCAACAACCCGCCCTTCAGGCGCTGCTCGGCATTCTTGACGGTATCCTGATCAACTATATTGCGCTCTGCCTGGCGTCTGCGCGTAAGAAACAGGGAAAAGATGCGTTGGTGATCGGCTGGAATATCCATGATACCACCCGCCTGTGGCTGGAAGGTTGGGTCGCCAGCCAACAGGGATGGCGAATCGACGTGTTGGCGCATTCGCTTAGCCAGTTCCGCCCGGAACTTTTTGACGGCAAGACGTTACTGGTATGGTGCGGAGAAAACCAGACGCTGGCGCAGCAGCAGCAACTCCTGGCATGGCGCGCCCAGGGACACGACATTCATCCCCTTGGCGTTTAAACAGCAGCTAACAAATTCGCTTTAATGTATACTCCTTTTATTAACATAAGGAGTACATAATGCGCGTAGCGAAAATCGGGGTGATCGCCCTTTTCCTGCTGATGGCTATTGGCGGTATCGGCGGCGTGATGCTGGCAGGTTACAGTTTTATTTTGCGTGCCGGGTAAGCGCGCGCGTCAGCCTTTCAAACAGGCGATCGATAATGATCGCCGCCAGCGCCACCAGCAGCGCCCCCTGGATAACATAGGCCGTATTAAAGCCGCTAAGCCCGATAATGATCGGCGTGCCTAACGTACTGGCCCCCACCGTTGAAGCGATGGTCGCCGTACCAATATTGATAATCACCGAGGTTCGGATGCCCGCCAGAATCACCGGCGCGGCCAGCGGCAGCTCAACCTGATACAACTGTTGGCGACGGCTCATTCCCATACCGCTGGCAACGCTCATCACGCTGGCAGGCACCGCGCCCAGCCCGGCCAGGGTCGCCTGCAGGATGGGCAACACCCCATACAGGATCAAGGCGATAATGGCCGGTTGCTGACCAAAACCCATGACGGGTACCGCGATCGCCAGTACCGCGACCGGGGGAAAGGTCTGCCCGACGGCGGCGATAGTCTCCACCAGGGGACGAAACTCTTTCCCACTTTCTCGCGTGACCGCCATCCCTGCGCCGACGCCCACCACGACGGCAAACAGACTTGAGATGCCCACCAACCAGAAATGGGCGAGCGCGAGGGCGGCAAAACTCTCCTGTTGGTAGACCGGGCGCGGTAAATCGGGAAACAGCGCGGCGAAGAACGGCTGGCTATAAGGCAATCCAAACAGCAGAAGCAAGAACAGAACAATAAGCCAGAGAAGCGGATCACACAGTCGTTTCACGGGGGGACGTCTCCGAAAGCAGATCGCGGAAATGGAGCGTACCGCAGGGCTCGCCCTGCTGATTCGCCACCGGCAGGACGTCGCACCGACGGGCGACAAACATCGATAGCGCATCGCGTAGCGTCATCTCTTCCACCAGCGCGTCGCCGCTGAGCTGTTCATGCCGACGTACATAATCGCCTACGTTACGTAACGAAAGCAGCCTTACGCCCAGCTCGCTGCGGCCAAAAAATGCCTGCACGAAATCATTTTCCGGCGAGGTCAGCATAGAAAGCGGCGATCCCTGTTGGATAACGTGGCCCCCGTCCATCAGCACCAGATGGTCGGCGAGGCGTAGCGCCTCGTCGATGTCGTGCGTCACCAGTACGATGGTGCGCCCCAGCAGCTGATGAATGCGGGTCATCTCCTGCTGCAATGCGCCGCGCGTTACCGGATCAAGCGCGCCGAAAGGCTCGTCCATCAGCAATACCTGCGGATCGGCAGCCAGCGCCCGCGCAACGCCGACCCGCTGCTGTTGCCCGCCGGAAAGCTGATGCGGATAGCGATCGCGCAGCGCGCTTTCCAGACCCAATAATGCCATCAGTTCGTCAATACGATCGTTAATCCGTGCACGCGACCACTTTTGTAGTTGCGGTACGGTGGCGATATTTTGCGCCACCGTCCAGTGGGGAAAAAGGCCGATAGACTGAATGGCATAGCCCATGCGACGACGCAGTTCAAGCACCGGCAGGCTGCGGATCTCTTCCCCGGCAAAACGGATCGTTCCGCTATCATGCTCTACCAGCCGGTTAATCATCTTCAGAGTGGTCGATTTTCCCGAACCGGAGGTGCCAATTAACACCGAAAAGCTGCCTTCGCTAAAGTGCAAATTGAGGTCGCTAACAGCCTGTTGATCGCCGAAGGTTTTACTAACATGGTTAAATTCAATCAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP019416|2253605:2262776|2261828_2262776_-|WP_000569165.1|DBSCAN-SWA MIEFNHVSKTFGDQQAVSDLNLHFSEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGTIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWTVAQNIATVPQLQKWSRARINDRIDELMALLGLESALRDRYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHQLLGRTIVLVTHDIDEALRLADHLVLMDGGHVIQQGSPLSMLTSPENDFVQAFFGRSELGVRLLSLRNVGDYVRRHEQLSGDALVEEMTLRDALSMFVARRCDVLPVANQQGEPCGTLHFRDLLSETSPRETTV >NZ_CP019416|2253605:2262776|2261113_2261845_-|WP_000824857.1|DBSCAN-SWA MKRLCDPLLWLIVLFLLLLFGLPYSQPFFAALFPDLPRPVYQQESFAALALAHFWLVGISSLFAVVVGVGAGMAVTRESGKEFRPLVETIAAVGQTFPPVAVLAIAVPVMGFGQQPAIIALILYGVLPILQATLAGLGAVPASVMSVASGMGMSRRQQLYQVELPLAAPVILAGIRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIIDRLFERLTRALTRHAK >NZ_CP019416|2253605:2262776|2260234_2260966_+|WP_001240417.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKVLLSSDSSEQPNGWREQQEILLHYLQSSNLHSLRLWVKERGQDYPAQTLTTNLFVPLRRRLQCQQPALQALLGILDGILINYIALCLASARKKQGKDALVIGWNIHDTTRLWLEGWVASQQGWRIDVLAHSLSQFRPELFDGKTLLVWCGENQTLAQQQQLLAWRAQGHDIHPLGV >NZ_CP019416|2253605:2262776|2257610_2258330_-|WP_000598637.1|DBSCAN-SWA MIKVLIVDDEPLARENLRILLQGQDDIEIVGECANAVEAIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIEEKRLEKTLHRLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMDDVAFVSSRMSGVYVTSSEGKEGFTELTLRTLESRTPLLRCHRQFLVNMAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >NZ_CP019416|2253605:2262776|2255879_2256338_+|WP_000703137.1|DBSCAN-SWA MKISGKLLSAALASVLVFSLAGCGDKEESKTFNANLAGTEISITYTYKGDKIIKQTSESKISYATVGAKTKEDAAKILDPLSAKYKNIAGVEEKLTYEDTYAQENVSVDMEKVDFKALQQISGTMVSGDTSKGISMKQTQTLLEAAGFKEAK >NZ_CP019416|2253605:2262776|2257096_2257564_-|WP_000950413.1|DBSCAN-SWA MLSNDILRSVRYILKANNTDLARILALGNVDATPEQIAIWLRKEEEEGFQRCPDIVLSSFLNGLIYEKRGKDEAAPALTAERRINNNIVLKKLRIAFSLKTDDILAILTGQLFRVSMPEITAMMRAPDHKNFRECGDQFMRYFLRGLAAREHAAK >NZ_CP019416|2253605:2262776|2258326_2260012_-|WP_000272850.1|DBSCAN-SWA MYEFNLVLLLLQQMCVFLVIAWLMSKTRLFIPLMQVTVRLPHKLLCYVTFSIFCIMGTYFGLHIEDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSVLIRRGRPDKVFSPLTAGAITCVAELVQMLIILLIARPFDDALHLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSGYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQKLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQPSAGSSGLGMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLPLEEDA >NZ_CP019416|2253605:2262776|2261025_2261133_+|WP_001261696.1|DBSCAN-SWA MRVAKIGVIALFLLMAIGGIGGVMLAGYSFILRAG >NZ_CP019416|2253605:2262776|2253605_2255639_+|WP_000195330.1|tRNA|DBSCAN-SWA MTQVAKKILVTCALPYANGSIHLGHMLEHIQADVWVRYQRMRGHEVNFICADDAHGTPIMLKAQQLGITPEQMIGEMSQEHQTDFAGFNISYDNYHSTHSDENRELSELIYTRLKENGFIKNRTISQLYDPEKGMFLPDRFVKGTCPKCKSADQYGDNCEVCGATYSPTELIEPKSVVSGATPVMRDSEHFFFDLPSFSEMLQAWTRSGALQEQVANKMQEWFESGLQQWDISRDAPYFGFEIPNAPGKYFYVWLDAPIGYMGSFKNLCDKRGDTTSFDEYWKKDSDAELYHFIGKDIVYFHSLFWPAMLEGSHFRKPTNLFVHGYVTVNGAKMSKSRGTFIKASTWLKHFDADSLRYYYTAKLSSRIDDIDLNLEDFVQRVNADIVNKVVNLASRNAGFINKRFDGVLAAELADPQLYKTFTDAAAVIGEAWESREFGKAIREIMALADIANRYVDEQAPWVVAKQEGRDADLQAICSMGINLFRVLMTYLKPVLPTLSERVEAFLNSELNWDAIEQPLLGHKVNTFKALYNRIDMKQVEALVEASKEEVKAAAAPVTGPLADFPIQETITFDDFAKIDLRVALIENAEFVDGSDKLLRLTLDLGGEKRNVFSGIRSAYPDPQALIGRQTVMVANLAPRKMRFGVSEGMVMAAGPGGKDIFLLSPDDGAKPGQQVK >NZ_CP019416|2253605:2262776|2256509_2257040_+|WP_001197951.1|DBSCAN-SWA MQVLRLMALPLFALSLSVSITGCDQKNDTLQGKQNNMTAFIKKIAASKESEETQRYVGNLNGIEIKLTYYYKGDIVLRQISEHKLLYKTLKANNKEEAQKMLSQVGEAYQGMPGLTERIDYYDSYATEYVDIDFTQAKISDLCKLPGSSIDNCSAYYLSMIRSQKLLEESGYHRIN |
10 | Enterobacteria_phage(66.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
2504854 : 2510907
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP019416|2504854:2510907|DBSCAN-SWA CATGGATAGCCTTAACGACGATAAAATTAACCGGCAAAGCAGTGACCTGGAAGTTGAAAGCGAAGAAAAACAAAGCGGTAAAGAGATTGAAGTGGATGAAGATCGTCTTCCTTCCCGCGCCATGGCGATTCATGAACATATTCGCCAGGATGGTGAAAAAGAGATGGAACGCGATGCGATGGCTTTGCTCTGGTCAGCCATTGCCGCAGGACTTTCTATGGGGGCATCACTCCTGGCGAAAGGGATTTTCCACGTGCAGCTTGAAGGCGTTCCCGGCGGCTTTTTACTGGAAAATCTCGGCTATACCTTTGGTTTTATCATTGTCATCATGGCCCGCCAGCAATTATTTACTGAAAATACCGTTACCGCCGTATTGCCGGTAATGCAAAATCCCACTCTGAGTAACGTTGGCCTGCTGATGCGCTTATGGGGCGTAGTCTTATTGGGCAACCTTATTGGCACCGGGGTTGCGGCGTGGGCATTTGAATATATGCCTATATTTGATGAAGAGACCCGCGACGCCTTTGTCAAAATTGGTATGGAGGTCATGAAAAATAGCCCAACGGAGATGTTTGCCAACGCGATTATCTCTGGCTGGATCATCGCCACAATGGTATGGATGTTTCCTGCTGCAGGCGGGGCAAAAATTGTGGTCATTATTTTGATGACCTGGCTTATCGCGCTGGGCGATACCACCCATATTGTCGTCGGTTCCGTTGAAATTTTGTATTTGGTTTTCAACGGCACGCTGCCCTGGAGCGACTTTTTCTGGCCCTTCGCCCTTCCCACACTTGCCGGAAATATCTGCGGCGGCACCTTTATTTTCGCATTAATGAGCCACGCGCAGATCCGCAACGATATGAGCAATAAGCGTAAGGAAGAAGCGAGGCTACGCGGCGAGCGCCTGGAACGGGAGCGGAAAAAAGCGGAAAAACAGCGCTGAGTGGCGACGGTTTAACCAGTCAGGCGGCGCTACACTTACTCCGGCGAATAAAAAACGCTATACTGGCGCCGCGTTGTCCCCTTAGTTAAATGGATATAACGAGCCCCTCCTAAGGGCTAGTTGCAGGTTCGATTCCTGCAGGGGACACCATGACCTCTTTTACCCACCTCTCACAAATCTCAGCAACCCCACACCACACAAGGCGTTGCAGCGTTTTTGATGTCTTATGATTTCCAGCGAATAGTGCCTATAAACATAACGAAAAGCATCCCTTTAAAGTACCCTCTCGCTGTTAGAACTTAACCAAAAGTTAGCACCTTCACTTTCCGACAGATATAGGCCCAGAACTTCTTTTTTGCCTTCCAGATTCAGTGCCAGAACGGTATAAACCGCCTTGCTCTGATAACGGCCATCCTCACGGTTTTTATAATGAATAGCGTCCAGCCAGACGACGGGATAAACCTTCTCCAGCGGGCGCTGTTGCCACTGTTTTAGTTCAGGGATAACTTTATCGGTACTGCACTGACGGTGGCGGTTGAAACGCTGAAGGCATAAAGATCTTCACTCTCCCGGCCGATGTCCTGATAACTCATCTCCAGCGCAAACAGTCGAATGATATTGCGCTCGATCTCGTCGTACAGGGGGTCTGAAGCTTCTTCACCAGTTATGGCTCAAAAGTGCCGTTACGATTGCGCGGAGTTGCCAGTTCAAAACTGCCTGTTGGGGCTTTAATGGCTTTTTGCCGGAACCATTTTTACGGTTTGCCTCAACATCCTGAGCCAGATGGGAATCAAGTTCAGCAGACAGGGTAGACTCGGTTAAATACTTGATTAATGGCGTTAAGATGCCATCTTTGCCCGTTAATGCCTGGCCGGACCTGAAGGGCTTTAAGTGCTTTGTCGAAATCGAAGGGATGGGACATGTGCCATTCTTTTTTATTTTATGTTACTAAAATTATACAGAATTTTTAACGCTCCCCCTCCCCCCAGCACTTCCACCCTTTCAAGTACCTCTCCCTGAAAAAAAATGCAAAGCCTTGTAAGACGATGCAAAGCTTTACATGTCCCGTTTTTATTCCAAGACGCTTGGCAATCAGCAATACCAATTGATCGATAACATCGATCAATATATTAAAACTCAATAGTTTAAAACTATTAAAAATACAATTATTGATCGCTTATATCGATCAAACCAATTTGTAGTGCTACACTCCAGACCTTTCTGAATCTGATAATTTTCATAATGTTGAAGTTATTCGCTAAGTACACATCGATCGGTGTTCTTAACATGCTCATTCATTGGAGAGTATTTGCTTTCTGTATGTATGGGATGCATACGCATCAGGCGCTGACGAACTTTTCCGATTTTGTTATCGCTGTATCGTTCAGCTTCTATGCTAATGCGCGCTTCACCTTTAATGCCAGCACTACTGCAATTCGCTACATGATGTATATGGGATTCATGGGAGCACTGAGCGCTGTTGTTGGATGGATGGCTGACCAATGTTCTTTGCCACCATTGGTTACCCTCATCACTTTCTCGGCAATTAGCCTGGTATGCGGCTTTATCTATTCCAGATTCTTTATCTTCAGGGATAAAAATGAAAATCTCTCTTGTCGTCCTAGTTTTTAACGAAGAAGACACGATACCGATTTTCTATAGAACGGTACATGAGTTTAATGAACTTGAAAAATATAAAGTTGAGATTATTTTTATTAATGACGGAAGTAAAGATGTGACAGAGTCAATAATTAAAATAATAGCCGTATCTGATCCACTCGTCATTCCGTTTTCGTTTACACGAAACTTCGGGAATGATGCAAGATGCACAACCATTTTATTAATCTTTTTTTAAAATTGAGGTAATTTAAGTTGGAACACTTAAAATACAGACCTGATATAGATGGATTACGCGCAATAGCGGTTTTATCTGTGGTAATATTCCATTATTTCCCATCATTATTGCCGGGTGGGTTTGTTGGAGTAGATATATTCTTTGTGATATCTGGATACCTTATAACATCAATAATATTAAAATCTGCATCAAACAAATCATTTTCATACCTTGATTTCTATAAAAGGAGAGTGCTTAGAATATTTCCAGCATTATCCATAGTTCTTGTATCATGTCTTATTGTTGGCTGGGTTTATTTATTCCAGGATGATTACAAATTACTTGGTAAGCATGTTTTTAGCGGCTCATTCTTTATATCAAACTTTACTCTTTGGAGTGAGTCTGGCTATTTTGATTCAAAGTCATACCTTAAACCTTTACTACATTTGTGGTCACTGGGAATTGAAGAGCAATTTTATATAATATGGCCAGTAGTTATATTGCTATGCTTTAGAAGCAAAAACCATAACAGAAACATAGTATTATCATGCGCAACTATATTTATAATTAGCTATGCGATTAGCATTTTTACAATGGCATCTGATGGCGGAGCTAATTACTACTCTCCCGCATCAAGATTTTGGGAGTTAATGGCTGGGGCGATTATATCCACATTGAGATTTATAGGAATAAACACTTCGTTATCAAAATTAATGTCCCTGTTAGGAATTATACTAATCGCATTATCAATAACCATGATAGATGAAAAGATGTCATTTCCTGGATATATAGCAATAATCCCAGTACTTGGCGCCTCTCTTATAATAGCATCTAATGGTAATGATTTAGTTGTGTCGAAATTGCTTAGTGTTAGGCCTGTTGTTTTCTTTGGTCTTATTAGCTATCCTCTTTATTTGTGGCATTGGCCTATTTATTCATTCTATCGTTCAATATTTGCTGGTTCACCAGACTACCATGAATTAATTCTTCTTTTATTATCATCGTTCTTTTTGGCGATATTAACTTATTATTTAATTGAAAAACCACTGAGAAATGCCAGAAATAAATATATCACAGCAATATTATTAGCATTATCAGTATTTGGGACAGGTTTAATTGGCGCATTTATTTTTCATATAAATGGAGTTAAAGACAGGGAAATCAATAAATCAGCAGGTGAATATGCTTCTGTTACTGACGTGTACAATTATTATAAATATGGAGAACTACTCCGTGGAGGGATATGCCACTCAGTACAACTTACTGCTGCCATATCCAATGGATGTATAAAAAATGGCAAGCATAATATATTTATCATTGGTGATTCTTATGCGGCGGCTCTTTTCAATGGACTTTCTCATTATATAGATAATAAAGGTTCTGATTATATAATAAGCCAAATGACAGATGGTAACGCTCCTCCTCTATTTGTTGACGGTAAAGATGATTTACAGAGAAGTGTCATCACTCTAAACAATAATAGAATTAATGAAATTAAACGTGTTCAGCCTGAGGTGGTTCTGCTGACATGGTCAGTTCGAGGAACAAATGGAGTACATGATAAAAAGTTAGCAATTGATACGTTATCATTAACCATAAAAAAAATTAAAGAGGCATCCCCTGACTCAAGGATTATTTTCATTGGACCAGTCCCGGAATGGAATGCAAATTTAGTTAAAATAATATCTAACTACCTGAGTGAGTTTAAAAAAACCCCACCATTGTATATGACATATGGATTAAATAGTGAAATAAGCGAGTGGGACTCTTACTTTAGTAACAATGTTCCAAAAATGGGAATTGAATATATATCAGCATACAAAGCATTATGTAACGAAAGTGGATGTCTTACAAGAGTTGGTAATGGTCCTGATTTTATCACTGCCGTTGATTGGGGACATTTAACAAAGCCTGGTTCTGATTTCCTTTTTAATAAAATCGGAAATAAAATAATCAAATAGATAGGCTGTTACTATTACATATAAATCCAATATAGAACCTGCCAGTCATACTGTGTAACTGCCACTATATTAACGGTGATCGCTCAGGCGGTCACCGAACTCGATAATAAAGCGACTCATTGCCAGCGACCAGTCCTGGATCGGCATACTCCATTTTTTGACGCCCCTTTGACCAACTCTTTTACTGCGGCGGTGATCTTAAGTATCAGCATGGGCGACACATCAGCGTTGTACATCTCTTTGAAGGTGGCGATGATTTCGCGGGTAGTCATGTCATGCCTTTGGCATAAAGGGATAAATCTGACTGTCCATCTCGTAATACGCGTCTGATGCTCCTTAATCAGCTGCAGTTCAAAGGTATTTTCACGGTCACGCAGCATGTTCAGCACTATCTCGCCGCCATCATACAGCACCGTTTTTAATGAGTAGCCATTGTGGGAGTTTGAGCTCGTTTTAGGAGCATTTTTCTCATGCCCGAGATGGTCAGCCAGCTCAGCATTGAGCGCCGTTTCGACGGTTAATGTCGTCAGCATATGGGAAAACTGACTGAGGTCGGCTTCAGTTTTAAGACCTTTAGCCAGTTCAGCCGCAAGAGCTGTGAGTTTCTTTTCGTCCATAATTGCCTTTCTCCTTTGCTGGAGTGAATATATAATAATCAGACAATTACATAATTTTTAATTACAATCTCCCAATACTTATATTTTACCCCCATATTCATTACACGTTCAAAGTGTTGTTAACGTTAATCTAACTAATTATCCATTAGGTATTTCCCCTAAAGACCTTAACGAAGCCGAATTATTTTTGCATGTGACACAAAACATGAGTAATCTTTTATAAATTTATCTGACACGATATTAGAACTAACGACATCTCAACTACCATACAATCTTTATAGTTAACTATTAAGGATGCGGGGTCTGACAGATTACCAGATGGTGTTTGTTTTATGTCAATGAGTCTCAAACCATTAATAATGCAACCACTTTCTACAAACAATTAGTTAGTGTTAAAGTTTTAGTATCGATTACAGAAATATATGTGAATGCCTTCCCATATGTGTACCATAGATCACCAGACCCTGCGCAATCTTGCATAAGAATGTTCGATACATAACTTCCCCAGCCATCCATACCAAGACCAACACTCTATGAACCAATGGCAACGAGACCATCGACTAAATGATTTGTTGGTAGTTGGTAGTTGGTAGTTGGTAGTTGGTGTACCGGATATTCTGATAGCGAATAGTTGCCTGGTCTATCTTTTTCATTTCCGATATCTGCCCAAAGGTTGAAGCAGTCCCACAAAAAGTATAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP019416|2504854:2510907|2507396_2507651_+|WP_000703599.1|DBSCAN-SWA MKISLVVLVFNEEDTIPIFYRTVHEFNELEKYKVEIIFINDGSKDVTESIIKIIAVSDPLVIPFSFTRNFGNDARCTTILLIFF >NZ_CP019416|2504854:2510907|2510739_2510907_-|WP_105789229.1|DBSCAN-SWA MYFLWDCFNLWADIGNEKDRPGNYSLSEYPVHQLPTTNYQLPTNHLVDGLVAIGS >NZ_CP019416|2504854:2510907|2507038_2507428_+|WP_001576268.1|DBSCAN-SWA MLKLFAKYTSIGVLNMLIHWRVFAFCMYGMHTHQALTNFSDFVIAVSFSFYANARFTFNASTTAIRYMMYMGFMGALSAVVGWMADQCSLPPLVTLITFSAISLVCGFIYSRFFIFRDKNENLSCRPSF >NZ_CP019416|2504854:2510907|2510580_2510724_-|WP_105789228.1|DBSCAN-SWA MDGWGSYVSNILMQDCAGSGDLWYTYGKAFTYISVIDTKTLTLTNCL >NZ_CP019416|2504854:2510907|2504854_2505796_+|WP_000377777.1|DBSCAN-SWA MDSLNDDKINRQSSDLEVESEEKQSGKEIEVDEDRLPSRAMAIHEHIRQDGEKEMERDAMALLWSAIAAGLSMGASLLAKGIFHVQLEGVPGGFLLENLGYTFGFIIVIMARQQLFTENTVTAVLPVMQNPTLSNVGLLMRLWGVVLLGNLIGTGVAAWAFEYMPIFDEETRDAFVKIGMEVMKNSPTEMFANAIISGWIIATMVWMFPAAGGAKIVVIILMTWLIALGDTTHIVVGSVEILYLVFNGTLPWSDFFWPFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKEEARLRGERLERERKKAEKQR >NZ_CP019416|2504854:2510907|2507668_2509591_+|WP_000400616.1|DBSCAN-SWA MEHLKYRPDIDGLRAIAVLSVVIFHYFPSLLPGGFVGVDIFFVISGYLITSIILKSASNKSFSYLDFYKRRVLRIFPALSIVLVSCLIVGWVYLFQDDYKLLGKHVFSGSFFISNFTLWSESGYFDSKSYLKPLLHLWSLGIEEQFYIIWPVVILLCFRSKNHNRNIVLSCATIFIISYAISIFTMASDGGANYYSPASRFWELMAGAIISTLRFIGINTSLSKLMSLLGIILIALSITMIDEKMSFPGYIAIIPVLGASLIIASNGNDLVVSKLLSVRPVVFFGLISYPLYLWHWPIYSFYRSIFAGSPDYHELILLLLSSFFLAILTYYLIEKPLRNARNKYITAILLALSVFGTGLIGAFIFHINGVKDREINKSAGEYASVTDVYNYYKYGELLRGGICHSVQLTAAISNGCIKNGKHNIFIIGDSYAAALFNGLSHYIDNKGSDYIISQMTDGNAPPLFVDGKDDLQRSVITLNNNRINEIKRVQPEVVLLTWSVRGTNGVHDKKLAIDTLSLTIKKIKEASPDSRIIFIGPVPEWNANLVKIISNYLSEFKKTPPLYMTYGLNSEISEWDSYFSNNVPKMGIEYISAYKALCNESGCLTRVGNGPDFITAVDWGHLTKPGSDFLFNKIGNKIIK |
6 | Salmonella_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|