Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_016785 | Corynebacterium diphtheriae CDCE 8392, complete sequence | 2 crisprs | cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,csa3,cas3,WYL,cas4,DinG | 0 | 0 | 1 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_016785_1 | 39026-39783 | TypeI-E |
I-E
Consensus repeat of NC_016785_1
|
12 spacers
spacers of NC_016785_1
>1.1|39054|33|NC_016785|CRISPRCasFinder,CRT GCCACCAATCAGCCCTACGAATCGGCAAACACA >1.2|39115|33|NC_016785|CRISPRCasFinder,CRT,PILER-CR CTCGGAGGAGCGGGCACCGACGTTACGGAGCTT >1.3|39176|33|NC_016785|CRISPRCasFinder,CRT,PILER-CR CTTAAAATCGGCTCGCTATTTCTCCGGGTACGA >1.4|39237|33|NC_016785|CRISPRCasFinder,CRT,PILER-CR GATTGATAAGACAGCGCTAGACCTGCCTGCACC >1.5|39298|33|NC_016785|CRISPRCasFinder,CRT,PILER-CR CAAGCAGAAGAGCGCGACGGCAATGTGGTTCCC >1.6|39359|33|NC_016785|CRISPRCasFinder,CRT,PILER-CR GCGTGTTTGATGGAGTGCAGTTCTGGGAATTAG >1.7|39420|33|NC_016785|CRISPRCasFinder,CRT,PILER-CR CCCGCACCAATCGTTTGTGGATCGGTATCCAAC >1.8|39481|33|NC_016785|CRISPRCasFinder,CRT,PILER-CR GCACCTGTATAAGTCATTATTTATTTCCCTTTC >1.9|39542|33|NC_016785|CRISPRCasFinder,CRT,PILER-CR CATCGGCCGGTAGTGGGTCGCCTTCCGCGCCCC >1.10|39603|33|NC_016785|CRISPRCasFinder,CRT,PILER-CR CCAAACCCATTCCCACCAGAATTACGCTCATTC >1.11|39664|32|NC_016785|CRISPRCasFinder,CRT,PILER-CR GACGACCGTGGGAACCGCGAGGATAAATAGCA >1.12|39724|32|NC_016785|CRISPRCasFinder,CRT,PILER-CR GACGACCGTGGGAACCGCGAGGATAAATAGCA |
cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2 |
CRISPR arrays and Neighbor proteins around NC_016785_1
The CRISPR arrays of NC_016785_1 >merge|NC_016785|1|39026-39783|CRISPRCasFinder,CRT,PILER-CR GTTTTCCCCGCACCAGCGGGGATGAGCCGCCACCAATCAGCCCTACGAATCGGCAAACACAGTTTTCCCCGCACCAGCGGGGATGAGCCCTCGGAGGAGCGGGCACCGACGTTACGGAGCTTGTTTTCCCCGCACCAGCGGGGATGAGCCCTTAAAATCGGCTCGCTATTTCTCCGGGTACGAGTTTTCCCCGCACCAGCGGGGATGAGCCGATTGATAAGACAGCGCTAGACCTGCCTGCACCGTTTTCCCCGCACCAGCGGGGATGAGCCCAAGCAGAAGAGCGCGACGGCAATGTGGTTCCCGTTTTCCCCGCACCAGCGGGGATGAGCCGCGTGTTTGATGGAGTGCAGTTCTGGGAATTAGGTTTTCCCCGCACCAGCGGGGATGAGCCCCCGCACCAATCGTTTGTGGATCGGTATCCAACGTTTTCCCCGCACCAGCGGGGATGAGCCGCACCTGTATAAGTCATTATTTATTTCCCTTTCGTTTTCCCCGCACCAGCGGGGATGAGCCCATCGGCCGGTAGTGGGTCGCCTTCCGCGCCCCGTTTTCCCCGCACCAGCGGGGAGGAGCCCCAAACCCATTCCCACCAGAATTACGCTCATTCGTTTTCCCCGCACCAGCGGGGATGAGCCGACGACCGTGGGAACCGCGAGGATAAATAGCAGGTTTTCCCGCACCAGCGGGGATGAGCCGACGACCGTGGGAACCGCGAGGATAAATAGCAGGTTTTCCCGCACCAGCGGGGATGAGCC >NC_016785|1|1|39026-39783|CRISPRCasFinder GTTTTCCCCGCACCAGCGGGGATGAGCC GCCACCAATCAGCCCTACGAATCGGCAAACACA GTTTTCCCCGCACCAGCGGGGATGAGCC CTCGGAGGAGCGGGCACCGACGTTACGGAGCTT GTTTTCCCCGCACCAGCGGGGATGAGCC CTTAAAATCGGCTCGCTATTTCTCCGGGTACGA GTTTTCCCCGCACCAGCGGGGATGAGCC GATTGATAAGACAGCGCTAGACCTGCCTGCACC GTTTTCCCCGCACCAGCGGGGATGAGCC CAAGCAGAAGAGCGCGACGGCAATGTGGTTCCC GTTTTCCCCGCACCAGCGGGGATGAGCC GCGTGTTTGATGGAGTGCAGTTCTGGGAATTAG GTTTTCCCCGCACCAGCGGGGATGAGCC CCCGCACCAATCGTTTGTGGATCGGTATCCAAC GTTTTCCCCGCACCAGCGGGGATGAGCC GCACCTGTATAAGTCATTATTTATTTCCCTTTC GTTTTCCCCGCACCAGCGGGGATGAGCC CATCGGCCGGTAGTGGGTCGCCTTCCGCGCCCC GTTTTCCCCGCACCAGCGGGGAGGAGCC CCAAACCCATTCCCACCAGAATTACGCTCATTC GTTTTCCCCGCACCAGCGGGGATGAGCC GACGACCGTGGGAACCGCGAGGATAAATAGCA GGTTTTCCCGCACCAGCGGGGATGAGCC GACGACCGTGGGAACCGCGAGGATAAATAGCA GGTTTTCCCGCACCAGCGGGGATGAGCC >NC_016785|1|1|39026-39783|CRT GTTTTCCCCGCACCAGCGGGGATGAGCC GCCACCAATCAGCCCTACGAATCGGCAAACACA GTTTTCCCCGCACCAGCGGGGATGAGCC CTCGGAGGAGCGGGCACCGACGTTACGGAGCTT GTTTTCCCCGCACCAGCGGGGATGAGCC CTTAAAATCGGCTCGCTATTTCTCCGGGTACGA GTTTTCCCCGCACCAGCGGGGATGAGCC GATTGATAAGACAGCGCTAGACCTGCCTGCACC GTTTTCCCCGCACCAGCGGGGATGAGCC CAAGCAGAAGAGCGCGACGGCAATGTGGTTCCC GTTTTCCCCGCACCAGCGGGGATGAGCC GCGTGTTTGATGGAGTGCAGTTCTGGGAATTAG GTTTTCCCCGCACCAGCGGGGATGAGCC CCCGCACCAATCGTTTGTGGATCGGTATCCAAC GTTTTCCCCGCACCAGCGGGGATGAGCC GCACCTGTATAAGTCATTATTTATTTCCCTTTC GTTTTCCCCGCACCAGCGGGGATGAGCC CATCGGCCGGTAGTGGGTCGCCTTCCGCGCCCC GTTTTCCCCGCACCAGCGGGGAGGAGCC CCAAACCCATTCCCACCAGAATTACGCTCATTC GTTTTCCCCGCACCAGCGGGGATGAGCC GACGACCGTGGGAACCGCGAGGATAAATAGCA GGTTTTCCCGCACCAGCGGGGATGAGCC GACGACCGTGGGAACCGCGAGGATAAATAGCA GGTTTTCCCGCACCAGCGGGGATGAGCC >NC_016785|1|1|39087-39783|PILER-CR GTTTTCCCCGCACCAGCGGGGATGAGCC CTCGGAGGAGCGGGCACCGACGTTACGGAGCTT GTTTTCCCCGCACCAGCGGGGATGAGCC CTTAAAATCGGCTCGCTATTTCTCCGGGTACGA GTTTTCCCCGCACCAGCGGGGATGAGCC GATTGATAAGACAGCGCTAGACCTGCCTGCACC GTTTTCCCCGCACCAGCGGGGATGAGCC CAAGCAGAAGAGCGCGACGGCAATGTGGTTCCC GTTTTCCCCGCACCAGCGGGGATGAGCC GCGTGTTTGATGGAGTGCAGTTCTGGGAATTAG GTTTTCCCCGCACCAGCGGGGATGAGCC CCCGCACCAATCGTTTGTGGATCGGTATCCAAC GTTTTCCCCGCACCAGCGGGGATGAGCC GCACCTGTATAAGTCATTATTTATTTCCCTTTC GTTTTCCCCGCACCAGCGGGGATGAGCC CATCGGCCGGTAGTGGGTCGCCTTCCGCGCCCC GTTTTCCCCGCACCAGCGGGGAGGAGCC CCAAACCCATTCCCACCAGAATTACGCTCATTC GTTTTCCCCGCACCAGCGGGGATGAGCC GACGACCGTGGGAACCGCGAGGATAAATAGCA GGTTTTCCCGCACCAGCGGGGATGAGCC GACGACCGTGGGAACCGCGAGGATAAATAGCA GGTTTTCCCGCACCAGCGGGGATGAGCC
>NC_016785.1|WP_014306231.1|35987_38789_+|CRISPR-associated-helicase/endonuclease-Cas3 MSVVDQADQWRRARSVQAYSLWAKSGSEDLYLRLPQHLIDAACVAEWLWNNWVSDSLKSTLSAAWRLPAEEVGRLYTFYAGTHDVGKATISFQRLVEKTSHGNYLLGPVREAGLSLQWTLNEGEGKKFPHGMASALIIAAWLEKHDIDPSSAARLSFIADAHHGFASDEELYRSHEDTLDYYPPEWLVVHAEILDSMAEITDIGETLEELADQSTPSAPAMQIMTGLVIMADWIASDEKAFPYVCDSSQHDRVVEGMSHVNLPPAWVPTDVPDNVETLFRDTFTWPDSYQVRPVQRAAVAVARAVQDPTLIIIEAPTGEGKTEAGLATSHILGQKTGAQGIFFAAPTMSTANGLFERTKNWAQCTSSRGEVASLYLAHSKNKLSLPFQSLRFTSIGEDDHLEKHGSVVASQWLSGRHRGILSDFVVGTVDQVLMMALQVRFSMLRHVGLAGKIVIIDEVHAYDAYMSQYLYLTLQWLAKYGVSVILMSATLPPQQRARLVNAYASQVCKKADASALNSDAYPLITAVNKKGISVTEVPQENSDTTIKIRCIDDSLPALGGMFSDLLVDGGIALVICNTIRRAQQAYDSLKAIFPDEVELHHAAFIATQRSEKEDALRESLGPHASRGEDRPWRRIVVATQVAEQSLDIDADVLVTDIAPIDLIIQRAGRLHRHERPHSDRPEILGQPQIFIRGINNEEISGEVPEFDGGAAAIYGEKILYATVAYLPDEFHRPSDVPKLVKNVYSDTPCIPENWKEQWEQACVKAKENYEKSVRKAQTFSFPQPHMARTLRDLFKQQHSNSVDKNEESGSAQVRDAEFSIEVVALLKNEYGYHPFGRKEEIENGRELTWKEAEALAGNTVRLPARMTRRDSDFNAVIDSLEAQTPPEWQRSGLLKGQVALLFDERGEARVGRFLVRYTNERGLEVEVCPKEDA >NC_016785.1|WP_155760674.1|35292_35463_+|hypothetical-protein MPSSIPSAAPQSTVIVTVIEAESVEPIEPHMRAAWKAMSKRRKDTLAYIRHTINAL >NC_016785.1|WP_014306229.1|34354_35170_+|bifunctional-hydroxymethylpyrimidine-kinase/phosphomethylpyrimidine-kinase MTPHILTIAGSDPSGGAGIQADLKSIMAAGGYGMAAITALTAQNTCGVTAIHTPPTEFLSQQLRAISDDITIHAIKIGMIGSSDAATAIATWLDQLHHTPIVVLDPVMVATSGSVLGERHYFEPLLHHATVITPNLPELAVLANNHDPEQAEHVARSLAEQYDCAVLLKGGHRHGTNDLGNTWITASGPQFHAPSPRIHTTNTHGTGCSLSSALATRLAIEPPEPALHWATTWLNGAIAHGSDLNVGHGNGPVDHSYRLGEYSANCNTIES >NC_016785.1|WP_014306228.1|33347_34358_+|ThiF-family-adenylyltransferase MLDELERQRVARQLRLPGFGIEQQEKLNKGRVLVIGAGGLGSPALQSLAAAGVGSIRLVDNDTVDVSNIQRQILFGVGDVGRSKVHVAAERLRAIQPGIRIDARTERLTAHNAHELAEGCDVILDGSDTFATKFLCGDLAEITGIPLVWGSVLQFEGHMGVFTREVGLRDLFPEAPTQGLNCADAGVLGATTAVIANLMATETIKILAGIGTVQPGAVTTYNALTSTFRTYTVGRDPLRSAARTLYTWTLPNEYELIDVREPHEIEHTPSGAHITLPQSMWNDTTAIQHALDNITTDNVVVVCASGIRSAAFIEQFAHLNPHLTFHNVPSGINELP >NC_016785.1|WP_014306227.1|32562_33348_+|thiazole-synthase MLTIADRSFQSHLIMGTGGASSFDTLEKSLIASGTELTTVAMRRHAAHTGAHGESVFELMQRLNITPLPNTAGCRTARDAILTAQLAREALKTSWIKVEVIADDTTLLPDVLELIDATETLTNDGFTVLAYTSDDPVVAQRLEDAGAAAVMPLGSPIGTGLGILNPHNIELICSRATVPVLLDAGIGTASDATLAMELGCSGVLLASAINRCINPITMATAMKHAVEAGRLAREAGRIPRREHAVASSSFEGLASWADEVL >NC_016785.1|WP_014302657.1|32360_32561_+|sulfur-carrier-protein-ThiS MDIYINDTLTTIESPQLTEIISNHCNGIRPGIAVAINQRVIPRSQWDTTTVTAGDHLDILTAVQGG >NC_016785.1|WP_014306226.1|31288_32377_+|glycine-oxidase-ThiO MKIAVVGGGIVGLSTAFELSTRGYNVHVFDPNPASGASHFAGGMLAPAAEVQFQQDPLFPLMKRAGKLWPDMVRWVAQHTNLPTGYRTEGTLVVAADRADAEHLKQLRATQEAAGMDVRPIATRQARGLEPALGPRLSAAVHIPNDTQVAPRVFLTALLDALDDCGVEVTKEKITDLEPLYQQFDVVVLAAGLGAQHLSPIPLALRPVRGDILRVQTEPGAVNMVVRGWVNDRPIYIIPRANGEIAIGATSREDERDLPSVEGIYDLLRDAIRVVPGIVDSSLIEANVGVRPGTPDDLPYLGWASDRLIISTGYFRHGILLSSLGAHVTACLIDGTDPGIDLTACAPDRHHNERGTTHGHLH >NC_016785.1|WP_014306225.1|30623_31292_+|thiamine-phosphate-synthase MLPTPRWGRDFDPRCYFVTGTGSVDHIVDVARQAARAGAGLIQVRSKPIAARDLYILGREVARAVAEVNPRTRVLIDDRVDVALALMNNGEHIHGVHVGQDDLPVRHVRALLGDNAIIGLTTGTLELVRASRQVAEVIDYIGAGPFRPTPTKDSGRAPVGLAGYPPLVAESLVPVVAIGDVRPEDAADLAATGVAGVAIVRALMNSQDVATDVKLVLKGFAQ >NC_016785.1|WP_014306224.1|28840_30640_+|phosphomethylpyrimidine-synthase-ThiC MSAASANSATNPSAWENSEIHPKHSYSPIVSGDLEVPETEIQLDDSPTGPNDPVRIYRTRGPECDPTVGLKPLRAQWIDNREDTEEYAGRERNLADDGRSAQRRGAASLEWKGVKPAPRRAKQGKRVTQMHYARQGIITKEMEFVALREHMDPEFVRSEIARGRAIIPNNINHPESEPMIIGRKFLTKINANIGNSAVTSSIEEEVSKLRWATRWGADTVMDLSTGDDIHTTREWIIRNSPVPIGTVPIYQALEKVNGVAEDLTWEIFRDTVIEQCEQGVDYMTIHAGVLLAYIPLTTKRVTGIVSRGGSIMAGWCLAHHKESFLYEHFDELCEIFAQYDVAFSLGDGLRPGSVADANDAAQFAELKTIGELARRAWEYDVQVMIEGPGHVPLNMVQENNELEQKWAHDAPFYTLGPLVTDIAPGYDHITSAIGAAHIAMGGTAMLCYVTPKEHLGLPNRDDVKTGVITYKLAAHAADVAKGHPGARAWDDAMSKARFEFRWHDQFALSLDPDTAIAYHDETLPAEPAKTAHFCSMCGPKFCSMRISQDIRDMFADKITDLGIPQVGGDAEAGMSAKSEEFVAQGSQLYSEVRDNAAHA >NC_016785.1|WP_014307720.1|27147_27765_+|rhomboid-family-intramembrane-serine-protease MVHTEVMNALKRAYGQAPATAVLCALTILIYLLTVVESRSIEHNLSDSWIADHWTLYGPYSHGLGWLRMVGTVFLHSGPTHLALNMFMLFFFGREIEHYLGSGRFTLAYIVSGIGASATVLLMDPLAPTVGASGAVYGLMAIFVAMSYRLRRDLTAPLILIAVNVGYSLLMDGVSLWGHLGGLLTGCVLGIVLVIAQTTRGGKRG >NC_016785.1|WP_014306232.1|40010_41684_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MASLLEIRWILALDTQGNQISVGIRDIFAGEVQVAYIQGESPAQDYAVMRLLLGIFWRAHSMDIDMDVEPFDFLEWFNKMRRQLARKGKDQVVLDYLDRYADRFELFDAKAPFMQVAGLHVASGEYKHITTIIPEAQDEYFSMRGGKERDSVSIEEAARWLVYVHAFDYSGIKSGAVGDSRVKGGRGYPIGTGWTGMTGGTLVKGEDLLDTLLLNTTLETLNDPEDRPVWERTAYGPGERAVVGENSQPQGPADLATWQSRRIRLIPEGDRVVGVIVCNGDKIPDAGANVLDDPMTPYRFSTNKSKKDHDVYYPRPYDVERTMWKALDALVVAETDGGFSAKEKAPKRPKNLASLAELAVQKANVPAVLNVDLVSVEYGPQASSVATTYASRMSMPVVLLLTEAKHLRGKVREIARATTQSAVALGQFSGNLLDAAGGEYAFQPAITDRVLAELEPRFNDWLERLRDISPEQAIQDNESLTDLEKAWQHTARSVIDQHARILLRGAGPKALAGRIQYRDADDHKGRVVSAAGYYRMLQRKLDEVLPLTVRKQEKEGE >NC_016785.1|WP_014306233.1|41687_42314_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MAENKDALRAAVGATCHRLQEAYLGDRNSARHHSARATLAELRHGATVDIRKNPLGLEKVLFAMVGDFSDRLVGHGDNPSPSEEAAFVALTLFGVHMQSATSPVHIPQVSFASACGRLHSLGTSDSIKPRVDAMLLASQEQARLVHIRSLVTLLRANNIGFDYGLLARDLRALNDPKKRAGIQLRWGRDFAIGHFRNFNSTTNSTQTA >NC_016785.1|WP_014306234.1|42328_43462_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTLVVDIHALQTVPPSLINRDDMGAPKTAIFGGVPRQRVSSQSWKRAVRRFFEDEIDKSTVGLRSRKLPENIVSRAIALGSDLTEDQILDGVRQLFKAAKISLVEPKAPKKGEELPEDAEKYPTTGYLLFLSPYQLDRAAQAVVDKNGEKFTKAEAEDILDTKHSVDMALFGRMLADAPAYNIDASVQVAHAISVHESQPEFDYFTAVDDVVEDAEETGAGMIGTTQMMSSTLYRFATVNVDGLVKNLEDTELAHEAVRMFIRAFAESMPTGKQNSFANNTLPELMYVAVRDTRSVSLVNAFEEPVAAADGSRRQAAATALAREERDIEEAYGMKPLASFVVALGELGTDFEELAEKVTLTELGEAVIRTLAAQEAK >NC_016785.1|WP_014302665.1|43466_44171_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSYSLLLLLKGPLQSWGDESRYNTRATGNTPSKSGVIGLLAAAQGRQRTDSIEDLVALDFAVRVDQSGTLLRDYQTAQPWQKAPKANASLVTRHFLSDAAFVAAIGSEDKELLEGLQGHLRQPTYPLFLGRRSCPAPVNLDLGIVDKPVVEALMAHDTWHATSAHQQERSRDVELPIYRDGKPGEHGVPRQDVPLSFAQEHRRYGWRTVVYAGSKSIVNEKGTANDPFFEAVIS >NC_016785.1|WP_014306235.1|44172_44838_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MATFTKVLINPARRQGRKYLTNPHMLHAAVMHGFPPDCDTSTERILWRLDERGHEHILYVVGPEKPTIDHIVEEAGWDVRPPQSADYDRLLSQLTKGQKWNFELVANPTHTIPVQGGKRGKVVAHSTAAKQLEWLHRKAESMGVSFGTLENSSAQVVGKKTLDFHRSRPNGERGDRVHLVTARFSGQLEVVDADKLRATLIGGVGRAKGYGCGLLTLARGQ >NC_016785.1|WP_049791608.1|44892_45780_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MESRLSFLYVERAVINRDGNALTIKDQRGIAHVPATQLAVVLLGPGTKVTYAAMALLGDAGASVVWVGEKGVRYYAHGRPPAKTSRFAEAHARLWSNQRTRLRCARRMYDMRFPGEEISQLSLSQLRGREGARMKKIYAAEAQRTGVVWTRRNYDPQDFESGDPINRALTEGSAALYGIAHAVIVGLGFIPSLGIVHTGTDRAFVYDIADLYKAEISIPVAFEAVAAIPSGDDLNVRARIRDKVVSTRLMQRMVHDLQDLMEIPEEDAYSDVDLMLWSELEVIAAGVNLRSLLLV >NC_016785.1|WP_014306237.1|45809_46157_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MIVLVVSAVPAGLRGDLTKWLMEISPGVFVGNPSARIRDLLWERTTELSKDGRALLIYSSNNEQGMEYRTHRHEWEPTDFDGLTLMMRPDNTKPSSYQRRTGWSIARNVRKNRGN >NC_016785.1|WP_003849883.1|46164_46434_-|cell-division-protein-CrgA MPKSKINHSSTAYTAGSTANRTPVKINSTGTPLWYKIIMFGLILAGLLWLIVNYLAGQDISFMTELGPWNYGIGFGLFIIGLLMTMGWR >NC_016785.1|WP_014306238.1|46550_48572_-|Stk1-family-PASTA-domain-containing-Ser/Thr-kinase MTDIVLVDRYRLGDVIGTGGMSEVYEATDVLLGRKVAVKMLRADLARDVNFRERFRREAQNSGKLNHPAIVAVYDTGETPRAGLNTPYIVMELVNGRTLRDIVREDGPLTPSQAAHTLIPVCHALQVSHDAGIIHRDIKPANVMITNTGAVKIMDFGIARALDDATSAMTQTSAVIGTAQYLSPEQARGKLADARSDVYALGCVLYETLTGKPPFEGETPFAVAYQHVQEDPVKPSEYIADLSPTAAINVDAVVLTAMSKHPGDRYQTAQEMCADLERLERNAVTDAARHYVTPTSFATQDPASTTVVPVTQVTELDHAEAGAGIGAGVVPAGAVAGSAAVAGAGGAHAAPRSSNRGLRILAAILAVLVLAVGAGFAIDHFGGGPFSQRSTVTIPKLQNSTQQDAVNQLEKLGLQVNVIEEPNPDIPRGKVIRTNPTDGSNVQRNSTVRLTISSGKEITEVPDLSGKNTADAVKILEAAGLLLDPTVREDSSDTVPKGEIIEVSPAAGSQVSRGSKVSITVSTGVETVRVPVITGMKWDQAEGNLTSLGFKPEVVRVDSVEPAGTVVAVPDEGAEVPKGSRVTVQISNGAMFTVPEITRQTIGDAVRILHDAGWNGNASRLIQAAKVPTVAVTDQNLIASQLPTPGTALRKDAPIEIRLYEFNLAALVPPAQH >NC_016785.1|WP_014306239.1|48568_50098_-|serine/threonine-protein-kinase MTQSQSPDPALQALVGSDYALQWVVGNGGMSTVWLADDLRNQREVAIKVLRPEFSDNEEFLSRFRNEALASEHIDSDNVVRTYDYREVTDDMGRTLCFIVMEYVRGESLADMLARKGRLEEDLALDVLEQAAHGLSIIHRMGMVHRDIKPGNLLITQNGQVKITDFGIAKAAAAVPLTRTGMVVGTAQYVSPEQAQGRDVTAATDVYSLGVVGYEMLVGQRPFTGDSSVSVAIAHINQAPPAMPTSVSAPARELIGIALRKDPAHRYADGNELALAVSATRMGQRPPQPKSAPLQHIAPQPAPTESTYALGATAQPTTVIPATGQVPAAPTAAPAAAAAYPASTVIPAGTPRQEPEKQSSGWGAGIVVGALAALLLGTAAWAASQGMFDDLFDKTSQSSESSVPPPPVTATVTETPTPQITTVVPEPLPTSSPEPTPSETKRTPDESHPSSDHQLPSVRPSHNGRPSSQAPHAPHAPTQDADQPAESPAPDTLDSLIENLNKLNQGGAQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_016785_2 | 1465451-1465563 | Orphan |
NA
Consensus repeat of NC_016785_2
|
2 spacers
spacers of NC_016785_2
>2.1|1465474|22|NC_016785|CRISPRCasFinder TTCTGCTGCCGGCTTAGGCGCA >2.2|1465519|22|NC_016785|CRISPRCasFinder AGCCGCTTGTTGCGGAGTTGCT |
CRISPR arrays and Neighbor proteins around NC_016785_2
The CRISPR arrays of NC_016785_2 >merge|NC_016785|2|1465451-1465563|CRISPRCasFinder GCTGGCTTTGGAGCCGGCTTCTTTTCTGCTGCCGGCTTAGGCGCAGCTGGTTTAGGAGCCGCAGGCTTAGCCGCTTGTTGCGGAGTTGCTGCTGGCTTAGGCGCTGCAGGTTT >NC_016785|2|2|1465451-1465563|CRISPRCasFinder GCTGGCTTTGGAGCCGGCTTCTT TTCTGCTGCCGGCTTAGGCGCA GCTGGTTTAGGAGCCGCAGGCTT AGCCGCTTGTTGCGGAGTTGCT GCTGGCTTAGGCGCTGCAGGTTT
>NC_016785.1|WP_014302055.1|1462343_1462787_-|30S-ribosome-binding-factor-RbfA MVDHARAARLAKRIQTIVATAIEREVKDRRLEYVTVTDTRVTGDLHDATVYYTVRGRTIDDQPDLKAAAEALQRARGQLRKIVGDQLSVRFTPTLSFELDTVPEASAHMEDLLARARARDLELAELKKNAQPAGDAHPYKDDDAMND >NC_016785.1|WP_193350843.1|1461718_1462303_-|DHH-family-phosphoesterase MCVAGHERPDADAVGSVCALVLGLQQAGCHAQGFIGQDAPIGASFMWLPGAHAINLATELPPCDLIITVDYAALSRLGALQKSIVEAKQPVIVLDHHATNHGFGDLNLVDAQAESATSIVYDLCQALGVEITPDIAQCLYSGLVADTGGFRWGRPRIHDLAQTLVEHGADIQQINHNLFDAYSLDDRRPFARCC >NC_016785.1|WP_173664309.1|1461471_1461765_-|hypothetical-protein MRIHWMIEGHSRDAVESIADYVRGLYGVDLAVVLKEYSPETWTISLRSDSWDVAAIARKCGGGGHVHAAGMTLHGTEQEIIGSIMAAVAAAAETAKL >NC_016785.1|WP_014307016.1|1460155_1461475_-|MATE-family-efflux-transporter MNTSLNDTADRSAHITAVTVLALALPSLGVLAATPLYLLLDTAVVGGLGTVALAALGAGTVIYSQVTTQLTFLSYGTTARSARLYGAGKQGEAVYEGVQATWIALLVGAVLATILFFGAPTFAWWLTGNREVANNAGHWLRITAFGVPMILAIMAGNGWLRGIQNTRAPLVFTLAGVIPGACAVPFFVHWWGLVGSAWANLMGTSITAVLFVGCLARYHRGSWRPQWRIMKTQLVLGRDLILRSFSFQVSFLSAAAVAGRFGAESLAAHQVLMQLWGFLTLVLDSLAIAGQTLTGAALGAGSAAVARAVGEKSIRYSTFFGVVLAAVFTVGWSVIPQVFTRDTNVLNVMAGPWWQLVALIALGGVVFALDGILLGASDAAFLRTVSIASVVCGFLPGVWLALIFDAGLVGVWWGLIAFLCIRLGTCWWRFRSMKWAGVS >NC_016785.1|WP_003851917.1|1459302_1460109_-|metallophosphoesterase MVQTLWAVADLHAAVRANGGRIDTIQPHDPSDWLIVAGDVAERTSVVIDVLHELRQRFATVIWVPGNHELFCRSSDRFQGRAKYDELVRRCRQIDVLTPEDPYPVFHGVTVVPLFTLYDYSFRPEGLTIEAALQSAHDKQLVLTDQFAIAPFVDIRAWCWDRLAYSVHRLSRERGPKILINHWPLVQEPVSELPIPEIGLWCGTRHTRSWPVRYSAITVVYGHLHVPNERIIDGVRHVEVSLGYPHQWSQNIEDRSWPFPVMTSEVVA >NC_016785.1|WP_010935090.1|1458613_1459306_-|4'-phosphopantetheinyl-transferase-superfamily-protein MIAGNNSEYALDSRLFPQSARSTALLVPRHTPDLSNFNRLHVLEKAQVKNAVAVRRAEFGDARWCAHQSLRKLGLYDHPAILRGERGMPLWPVGIAGSLTHTEGLRAAVVAPTTEVASMGIDAEIAEELPGGILGSIARPNEIAMLDDLRARGLLFADRLLFCAKEATYKAWFPITQRWLDFDQAEIDIRADGTFISYLLIRPTPFPFIEGKWAIHDGYVVATTVIPAMG >NC_016785.1|WP_014307015.1|1457714_1458617_+|tRNA-pseudouridine(55)-synthase-TruB MNDALANSGLVIVDKPEGMTSHDVVSKIRRTFSTKKVGHAGTLDPMATGVLVLGLERGTKFLAHMVASTKSYAATIRLGAATTTDDREGETIASASPDQLAAITETKISDAVKQFRGSIMQRPAAVSAIKIDGKRAHQRVREGEKVEIPARPVTISRYDILEIRRDAAFIDIDVEVDCSSGTYIRSLARDLGEELGVGGHLTALRRTQVGPFTLDNAVTLEKLEENPHVSLTLDQALAASYPVLSVSEKEASDLAMGKWLTPRGLKGIHAAVDPHGRAIALVKEQGKRLATIFVARPSTL >NC_016785.1|WP_014307014.1|1456720_1457692_-|bifunctional-riboflavin-kinase/FAD-synthetase MDQVDIWHRLEDIPADLKASVITIGVFDGVHRGHRTLVAAATDRAQALGVPSVLVTFNPHPLSVLRPDKMPPLLGTVNQRADLAESLGVDHMFAMNFTAELSHLSPEEFFCSVIKDKLNAQAVVVGKNFTFGYKAAGTTDTLKALGEKYGVEIYVLDLLTENGDVVSSTAIRSDLLEGNIRRANWGLGREFSVHGDVVRGAGRGGKELGFPTANLYFPDSIALPEDGVYAGWLTVTSSAPIDGDMVRGVRYPAAISVGHNPTFGDKRRSVESFVLDRHADLYGHSIVVEFVDRIRPMVKFDGIDELLVAIENDVTQTRAILHI >NC_016785.1|WP_014307013.1|1455701_1456661_-|nucleoside-hydrolase MKKIILDLDTGIDDALALAYTLGSPELDLIGVTATYGNVLVETGVRNDLALLELFGRSDVPVFAGEPHALAKDGFEVLEISAFIHGKNGIGEAEVAEPVGVVQELSAVDFLIESVERYGDELIIVPTGAMTNIAAAMKKSETFARDAQIVFMGGALTVPGNVSQWAEANVNQDPEAADIMVRNAGDITMVGLDVTLQTLLTYAETATWRTLGTPAGNFLADATDYYIKAYDTTAPHLGGCGLHDPLAVGVAIDPSLVTLLPINLKVDTEGPTRGRTIGDEVRLNDPHKNCKVAVGVDVDRFLKEFMERITRVAQGSNAR >NC_016785.1|WP_003851911.1|1455260_1455530_-|30S-ribosomal-protein-S15 MALSTEQKKSILAEYGLHETDTGSPEAQVALLSARINQLTEHLKFHKHDHHSRRGLLLLVGRRKGLLKYLADNNVDRYRDLIARLGLRR >NC_016785.1|WP_003851923.1|1465898_1466231_-|YlxR-family-protein MPSDSNQRSQQRIRTCIARRRPLPEASLLRVVALKGPDESAAVRVIPDPQRKMGGRGAWISPTLEALELAEKRRAFNRALRVSAVVDTGHVREYLAGLTARPNIVRKTEH >NC_016785.1|WP_003851924.1|1466486_1467485_-|transcription-termination/antitermination-protein-NusA MNIDVQALKAIEADRNIAVDELLETIARALLFAYQEYKDTNTVENSRARVDINSVTGHVSVIVSELDEDGVVTTEYDDTPENFGRVGAQAVRDAIVRRLREAETLKAYDAYSEYEGRVVSGIVQADIFANEKGIVVIHLGTEVDGQDGILIPAEQIPGESFKHGDRVKAYVVGINRTPRDLQINLSRTHPELVRRLFELEVPEVADGSVEIIGIAREAGHRSKVAVKATVKGLNAKGACIGPRGQRVNNIMNELGGEKIDIIDFDDDPAKFVGNALAPSKVVHVEITDAEAQTAQVTVPDYQLSLAIGKEGQNARLAARLTGWKIDIRSDAS >NC_016785.1|WP_010935095.1|1467481_1468036_-|ribosome-maturation-factor-RimP MAFPTVEVLTELVTPVVAQHNMDLEGIRINKAGKKSLVAVSVDSDFRPDLDQLELVSNQISEVFDAGEAAGELSFGAGYTLEVGTPGLDQPLASARRWRRNRHRLVALEVEGKKSVERIGALNDDETAVIVVKRRGKKLVVRSVQLAENTQAVVEIEFAKPAEDELALTALEFDQALDRGEENK >NC_016785.1|WP_010935096.1|1468087_1468996_+|DUF4439-domain-containing-protein MREVIFLQRSTTSAISAICLLGCVVACDISPSPDPNATLIDLAAIAHNDAAVLQSKNSALAQQRHADSEELISEIQRLCGTNSEGKLPESCSNEIVQGAVDKQAVSLKETVTESDAATRSAQAIVSAIDSAPSESLGLLGQQLVDLVRAGAAAPQSGIAQLNPRNEINKGTSKDDLIHDRESLKKALDWEYSAIYGLGVALAHSPAGTRTAVSDAITAHRDRVELLESSFAESFPNETIPRPEAAYEFSGYPEPHDAQSSRAFFDSLEADSAAWWLHALSESHSATWRALCASLAAQSAARR >NC_016785.1|WP_014307020.1|1469074_1470832_-|proline--tRNA-ligase MITRLSTLFLRTLREDPADAEVPSHKLLVRAGYIRRTAPGVYTWLPLGLRTLRKVETVVREEMDAIGAQELLFPALLPREPYEQTHRWTEYGDSLFRLKDRKGGDYLLGPTHEEMFASAVKDMYSSYKDFPVTLYQIQTKYRDEERPRAGILRGREFVMKDSYSFDMSDAGLEDSYQRHREAYQRILDRLGVEYVICAATSGAMGGSASEEFLAVSDNGEDTFVRATEGPYAANVEAVVTQPGVERPLEQAPEAVEYETPHAETIEALVQWAQSAGVTVEDRSVAAADTLKCLLVKITQPGAEEAELAGILLPGDREVDMKRLEASVEPAEVELASEEDFKNKPFLVKGYVGPRALNAHGVKVLADPRVVSGTSWIAGADAVEHHVVGLTMGRDFTVDGYIEAAEIREGDPAPEGQGTLTLARGIEVGHIFQLGRKYTEAFDVQILDESGKRAIPTMGSYGIGVSRLMAVLAEQRHDETGLNWPLEVAPYQVHVVVANKDKEAIEAGDALVAALDSHGIEVLFDDRPKVSPGVKFKDAELLGMPFVVVLGRAFKDGNIELRERGQETVLVSADEIVDTVVAKLNR >NC_016785.1|WP_014307021.1|1470863_1471610_+|peroxide-stress-protein-YaaA MLIVLPPSETKAVGGKNPAIDFDSLHFPTLNPIRKEIAHDLARLDISQAQEILKLSQKLLPEAQRNIELFQSPTMPAVLRYTGVLYDALDATSLPSSTWEHLAIGSALFGVVMANDNIPHYRLSGAAKLPCADASVPTLKRRWGNAISTALSDTNEVILDLRSGTYQQLGKVKHAITVRVESEDSDGKRSVISHFNKHYKGQLARALLLHDVTPDPDHVIEDLIGMTQECGFAVEHSKPHELTVVITQ >NC_016785.1|WP_014307022.1|1471629_1472454_-|uroporphyrinogen-III-C-methyltransferase MVYRIMKKFTEISSRNRKLFKLAGMFSSSESWTASVSLIGGGPGAWDLITVRGMHRLQQADVILADHLGPASELAQLCDVSTKDIIDVSKLPYGKQVAQSKINELLIEHAQAGKKVARLKGGDPYIFGRGFEELQACAKHGIACEVVPGVTSAVSVPALAGIPITQRGVVHSFTVISGHVPPQHPQSLNDWEALARTGGTLSVIMGVKNAGAIAQALIDAGRGADTPVAVVQEGSTENQKSFKTTLAQLGQAMKDNDIKPPAVYVIGEVAGLQA >NC_016785.1|WP_010935099.1|1472452_1473799_+|YdiU-family-protein MPLTFAHSFADAVPSLSVPWRAEQWPDPRIIVFNHELGQELGIDEASLLSNITQQGHAQAYSGHQFGQFNPLLGDGRALVLGDCAQTNAPHGQFEISLKGSGPTPFARRGDGRATLGPMLREYLISEALHGLGIPTTRSLAVITTGTDVQRERLLPGAVVVRVAQNHLRVGSVQCAAMRDDDSLAALVRYALSSHEDGVSDAEAAGLLLRRVSTSQAKLVAQWMRFGFVHGVMNTDNVTLSGQTIDFGPCAFIDSFHPQAVFSSIDSHGRYSFGRQPSIMGWNIARLAEALLPLMSIDEARNIVHEFPDMYRRSWLDEMADAVGISPDDDHAAGVLDDLVALLDVHRPDYAQFMRSLSDGTTVSLFPWAQQWEAEVSSLRVAAPRNPVYVPRNYLVEDALEHAMNADMSVFSLLVEAGKNPYLREKRFEKLENPAPETWQDYVTYCGT >NC_016785.1|WP_014307023.1|1473802_1474927_-|GTP-binding-protein MSHSTPVTVLSGFLGSGKTTLLNQMLSNRESKKIAVIVNDFSEINIDAALIAGEGHLERGEDKFVELTNGCICCTLRDDLVQSVGALASSGDYDHIVIESTGISEPMPVAATFEWVWDDGTRLADIAPIDTMATLVDASQFLTYMGKKTYLTDRDLGATEDDERTIADLLVDQVEFADKIYITKSDLVDDERYHATKALVRRMNPRASIDKLVNGRVITASGENRNAINDLLGAMCYDEETARTYQGYVAELDNPHTPETEEYGISSFVFKGDRPFDRQRLIAALRSTRGIVRSKGHCWISDRIDMVQVWHQAGPDLRIAPAGYWQSAGITPSNEIVVIGVNFDHAQAQQLLNDAMLSDSEVQQLLSTADAAKS >NC_016785.1|WP_003851932.1|1474946_1475456_-|methylated-DNA--[protein]-cysteine-S-methyltransferase MHAEYGFVSTPDGQFCVVTDSMTHKILASGWTENVSELVGLIHRDLRPLSMREAEVSLSIQNVISAYYDGRFARILTVPLLQQATDFRMSVWSALRRIPSGSPVSYATLARMSGHEGAVRAAASACANNPVALFVPCHRVIRSDGSYGGFRYGLAVKRSLLTREVAHNK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
140991 : 175272
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_016785|140991:175272|DBSCAN-SWA GCTAACCAATCACCTTGAACTGAGCCCGCCTGCGCTCCGCCTCTGACGCCTCATGAACTGTAGACGCTTCCTCGGCGCGTTTCCTCTCACTTTCCATATGCGCTTCCATCGCGCCGGGAATGGCATCAAGGCCTTCTTCCCAGAGATGCGCATAGATATCCAGGGTCATCGCAGCACTGGAGTGGCCGAGCATAAGTTGTACTGTTTTCACATCAGCACCGGCTGCAATGGCGATGGATGCAGCAGTGTGGCGTAACTCGTAGGTGTCGAGGTCACCAATCCCAGTCCAGATGCACAGGTTTTTCCACACCACCCTCCATCGGGCGGTAGTCCAGACTTTTCCGCGTTCGTCAGGGATCAGCCAGTCATCAGGGTCTTTGCCTTGAGCGTTGCGATCTAGGAGCACCAGGATTTCGCCACCGATGGGTACATCTCGGTGGTTTCGTGTTTTTGTTGAGTCTTCACGGCCTAAGTCATCAACGTCACGGCGGATCATGAGACGTCCGCGTACTGGGTCCAGATCTTTGACTTTGAGTCCTTTTGCTTCTCCTGGTCTTAGACCGGTCATGATGAGGACGCGTAGGAGGAGTTTTGCTTGTTCGGTGGGTGCTTGTCTGATGAGTTCGTCGACTTCTGTGATTTTGAGGTAGCGGCGTTCTGATTTCTTTTGTTTTGGTAGGTCGCCGGTTTTGGTGGGGTTTTGGTGGATGACTCCGAGTTCCACTGCGAGGTCGAGGATTCCGTGGATGATGAGGCCGACTTTGCGCATGGCTGATTCGCTGAGTGCTCGCGGTGGTTGGCTGGCTGGCACGCTTTTCATTGTTGAAAGTGTGGGGATCCAAGCGTTGATGACTGAGCGTTGAATGTGTGCACAGGGGGTTTGTCCCCATTGTGGTCGGATGTGGACGTTCCAGTAGCTGAGGTAGTCGCGTTTGGTTTTGTCTGAGATGTTTCCTTTTGATGCGATCCAGGGTTCCCATAGGTCGGAGAGTGTGATGTCGACTTTGTCTTTGGTGATCCAGGTGCCGTCGGCTTGTCCGACTTCTGCGCGGGCTGCCCAGAGTTCTGCCTCGTCGCGGGTCTCGAATGTTTTTGTTGCTTCGCGCCCGTTCTCAATCCAGACGGTTTGCCAGCGCTTGCCGACGCCCCATCGCGCTGATCGGATACGTTTCGTTTTGGATGTGGTGTTGGGGTTTCTTTTTGTCCAGAGGTCACGGACGGTAGCCATGGGGTAGACTTCTTTCTTGCTTAGTTCTTTAGAAGGGGCTGGGCATTGCCCTTCACCGGGTCTTGCTTGCCGGCGGACAGGTGAAGGGCACTTGGCTGTCTATTGCTTATGCAGAGATGTCACGGTCGCTAGACCCTCGTTCGTCGATTGATGCTTTTTGATGTTGTTGGAGAGTAAGTCGATCGGGTCGATGTAGACGTTTTCAAGAAGTACGTCGACAGGCTCGTTGATGTTCCAGCCGCGGCCTGCGATTTGCATTCGTAGGCTGCGGTAGCGTTTGTAGTCGATGAGTTCGAGTTCGTGGGCACGCCGGACGATTGCTTGGATGGAGTAGCCCCATTGAGCTTTGAGCTCGGCGTAGTCCTTGAGCGTTAGCTCTGGAGTAATTACAGGCGTGAGGAGGGCGCGTGGCATGAGGAAGCTGGCGGCGAAAATATTCGCTTCCTTTTCCATCTGGGATTTGTCGGATCGGAGTGTGTTTGCGTGAAGTATCAAGTGAGCGAGTTCGTGTGCGAGGCTGAACCGATAGCGGTCTCCACTTCGTTGTTGATTGAGTGCGATGATCCTCAATGTCGAGTCGGTCGGCGTTGAAACTCCGTCAAAGTTGGTCTCCTCAACGACGTAGTCAGGCAGTGAGGTAACGAGGATTCCGTGGTTGTTTAACACTGTGGTGAGGTTTGGGATCATTGAATCTTGATCAATCCCAAAGTGATCCCGGGTTTGAGAGGCGAATTCTTCCAGCATGCGGAGGCTCAGCTCGCCTTCGAGATCGGCAATGGGAATCGTGGGTAGGTCTGATGTCACGTCGGGGTACCGTTCACGAAGGTAATGTTCGGTGATGGAAAAGGCTGCAGCTATCTTGTCAGCGTTTGCACGGCCTTCGCGTGCTGAGCGGAAGAGTAGATCGGGTGCTGCGTAGAAGTGGATGGATGCCTCGAAATATTCAGCACTGACTCCGGTTTGAAACCGTGCGGTGTTGATGAGTTTCGTGCTGGGGGGTCGGTTTTCTCGCTCGACACTGCTCAAGGTGGATTGGGCGATGCCTAATTGCTCGGCGAATTCTCCCTGGGATCGTCCGAGGAGCAAGCGTAGGTGTTTCAGGTTCGAGCTGAGGGTTGGCATGAGTTGGCTTTCTAGCTTCCGAAAACGAAGCGGGGTTCGTCTGCGTGGGTGGGGATGAACATTGCAGTTGTCAGTGATTCAGGTTTTGTGAGTAGTGGAATTTTCTCAGACAACATGGTGGCTTCTTCGAGAGGGCCTTCGTTGTCGAACATCTGGAGCGAGATCCGTTTGAGCGCGTCGGTTCCAGGCTCTGCATATTTCCAGAAGAGGACGATGCGAGATCGACGCTCGATGAGCGGCCTTTGTGATGATTCCTCGTTCGTTCCATGGCGAAATCCTTCCGGCCTGCGGAAAACGAGGGTGACATCGTCGTCCTCGAAGCGGAGGCTGAGTGGAAACATGGATTCGTCGATCGTGAGCCCGCCGAGTCCGCTTTCGTGGAATTGCTTGTGAATGGCTACCACCATGGGATCCCTTGTCAGGTTGCGGTACCGGTTGATGTAGTCGCCGTAGGGGAGCAAGTCCCGCATGCGCCGTCCATCGGCCAGCGCCTCCCGTAGGGCATCATGGTCAGCTTCAAGATTGCGGAGTACCCGCTTCATTGTGAATAGGTTTTCGCCGGTAGTGTGGGCCATGCCATATTCCTTTCTTATAACAGTTCGAGTGATATTTTAACAAAATATCGAAAATATCACACCGATTGTTACATATGAGAAAGTGATTAAGGCATTAACATGCCCTCTTCTCCGGCTGAATTCTTCCTCGCTCATACATCCCCATCCACAAAACGAGTAGATGAGGTGTAACACCCAACTCTGCAGCCATCGCCGCAGCAGAACCCTCACACTCAAGCCCTACAGCCTCCACATCACCAACATCCAAAAGCTGCTGTGCTGCCCACTCATCAGCCTGCCGCTCCGCGCCGGGAGTTGAGCAGTTATGCCCGTAGTGGGCATGTCCTAGTTCGTGGGCGATGGCGCAGCGGCGGGTGACTGGGTCGAGGCCAATTTTGATAAAGATTGTTTGTGTTGGTGGGTGAAAGCAAGCGTTGAGGGTGCTTCCCAGCTTGCTCGTCTCGATCACTGTGATGCCCATGGTGTGGGCGAGTGTTTCGAGGGCTGCTTCCACAGGGCTCATGAAGTGCTCCTAGGTGTAGTTTTCTTCGAGTGGGTCGGTGGCCTTTTGCGCTGCAATTGGCTCTGTTCCGGCGTTGATGCCGTCGAGGATGGCATCGTAGTCAGGCTCAGTGACATGCGGGGGTGCGGTGGGGGAGTTGTTGCCGCGTTTTGCTTTGCGACGGGTGTCCAACTCGCTGATTGGTTTCTCCGCCCAGTCGGCTTGCCCGGCTTTCATTCGTCGGAGAATCTCGGCGACTAGATCTTCATCACTAGCGTCCATGATTCCGCTCGACGTTGAAAATGCTTTGAGATCGAACTCGGTCAGCACATCGAGCGCGAGCAGCGCTGGGACGACGCTGACTTGGTATGCGCGTGCGATCTTCACTGCCGTTTCAACGGTAACCGTGCCATCGCGCAACTGCCTAGCGAGTGTGGCTACTGGGATTCCGGAATTGATTGCAATCTGACGGTCGGAATCAGTTCCGCGAAAGCTTCTTAGCCAGCTTGTTAGTTGCTCCATGCGTCAATTATGACACAACAATGTCCCGCCAGCAATACATTACAAACCGGAGGGTATTGACATGTGTATCATGAATGGTATAGCGTGTTTCGTGAACGAGATAGAAAGGCGTTGAACATGGCAAATGTAAAGATCAAAATCCGTGACGGACTCATCGACCGCCTCCGAAACATGAGCGGAATCACCAGCGACGAAGCCTTCGCCCGAACCATCGGAACCAGCCGAAGCACACTCGTCGACGTCAAAACCGGCGAACGCGAACCATCCCTCGCATTCGCAATCGGAATCGCCCAAGCCTTCGGCCTAGGCCTATCCGAAATCGTCACCTGGGAAACCGAAACCACAGCAGCCTAAACGCGCCGGGAACTAGGCGTCGAAAAGCATAAGAAAGGAACTACTCATGGCTTGGGAAAGGATCGGCGAAGATCTGCATGGTCGTCGAAAAGTAATCGACCTCACTGGAGTGACACCGAAGCAACAACGTCCACGCCAAGTTAAGGCACGCCAAGCACTTGAGCGAGCAATCGCACAATTGAAAAGAGAAGAAACCGGGTTCAGCGTTTATGACGCAGCGGGCGTACCAGAGGAATGGCTACCGCTGATCGCGGTAAGACTCGAAGAGGTCTTTTGTACGTTGCTCCGTTTCCAACGCGAGGTCAACGCCGTCGTAGCAAGCGACCCAGACTTCGCCACCGTCCGCAAGCTCGAAAACCTCAAAGGGGTCGAACTGCTCCAGCAACGGACCGCCGGGGAAATCTTGGAGACTGCCTTGCTCTTGTCCGAATGGGCGCCGGTCGAGTCGGTCGCGGTAGTGCGCGGAATGATTCAAACGCTCGGTGATACGGAAGTGGACGGAAGCGACATCTGAGGTCGCCAGCGTGTAGCGCTGACCACCATAAACAAGAACAACTTTAGACATATCAGCAGCATATCTGTGCCAGGAAGGGGTCGCCAGCATGAGTAATAGCGATGGCATGCGGAGATTAATCGACCAGCTCAACGTCGAGTTTTGCAGGAAACAGTGCGAGATTCGGAAGAGCAAAATATCTGCTGCAAAGCAGGAGCTAGACGGCTCCGCACCAATGCAGAGTTTATTCAGGCTTCGAATCAGCGGAGCTCCGCTGGTTGAACTTCGGGTCGTCCCATTCGAAAAAGATAGGTTCACTGTCATCCATGCTGACGTCAAGGACCCGGTAACCGTACATGCTGTCACCATTGAAGAGATCCATCGCGTGATGACGCTCCTTTTCCGTCAGACGCTCCGGAATCTCGGGGTCAGCAGGATCGGGAAAGACGAATTGGACATCGTATTCAGGCTTGATCCACAGGCTCGGGAAACGAGCAGTCCGAAGATTAAGCGGTACACCGCCCTCTTGGAGGGTATTCACCGCGTAAACAATCCAGGGGAACGTCCAGTACGGCATCTCGTATTCGTAACCATTGATATTAAGCGTGAAGGTCTTTTCGTCCATATCCGAAGACTACTGCGATACCGCAAGGTGAATCGGCTTATGCGTAAAAACATTGTAAACAACACACAAAGAGGACGTGTTGTGCGCAGGGGAAGAAAGAATCACCGTGGACACACCGAACGTAGAGATTTCAATACCAGACGACAGTGAGTTTTATGAGCGACAGTACGAGATCCTGAAGCGTGAGCTACTCGCTGCTGAGCTGGGATTCAATCGCTCGGATTCGCGTTTCGAGAATTGCAAACTGTGTAGTAATGACCGAACTAGTGCGGCGAGTGAACTGGATGCACTGGTTGTAAGCCTCTGCCGGCTTCATCAAACCAGTCTTAATTTCATACTCCAGATCACGAAGTTGGCTTACCAAATCAGCGCTAGTCGACAGTCCTTTCTCTCGAAGCAAAAACTCGATGTCTTCGTGACTAAAACCATCATTCGGATTGTCGGAGGTGCCCATCAAATTTCATCTTTCCTTCTAGGAAACATTTCTAAGAAAAGTCAAGCACATAAATAAACGAAATAAAGCGAAATTCTGAAAGGAGTGGAAGGGGTGGAACTGCAGCTATTTAACTTTCGTGGAAAGCGAGTACGCGTCCTTACAAATCAGGACGGAGAGCCTCAATGGGTTGGCAAAGATATTTGCGAGATCTTAGAGATCAAAAACTCTAGGGATGCACTATCTCGTATCGATCCAGAGGGGGTCGGCATTGCCGACACCCTTACACCAGGAGGCGTGCAAAAGCTCAGAGTCGTAAATGAATCAGGACTATACGAACTCCTTTTCCAGTCTCGCGTCCCTCAAGCAAAAGAATTCCGTCGGTGGGTTACCGGGGAAGTTCTGCCAGAAATTCGACGCCACGGAATGTACGCAACAACAGCCACGGTAGAGCAAATGCTCGCTGATCCAACAACGGCAATCAAACTTCTGGAACAGATCAAGCAAGAGCGCGACCAACGTAAAGCACTCGAAGCGCAAGCAGCGATTGATAAACCAAAGGTCATGTTCGCTGATGCTGTCGCGGAAGCTAACACTGACATCTTGGTACGTGACTTAGCGAAGATCTTGCGCGGCAATGGTATTGAGGTCGGCGGAAATCGTCTTTTCGCGTGGCTTCGAAAGCACAAATACCTCATGGACGGGCCAAGCCACATTAAGCACACACCAACGCAAAAAGCGATGGAACTCGGACTGTTCAAAATCAAAGAAACCGTCGTGACCAGATCCGACGGAAGATCATCGATCACCGTAACTCCGAAGGTTACGGGAAAAGGGCAACGGTACTTCGTCGAAAGATTCCTTGATGGGAGATTCGACATCGACGATATCAAGACAAATAAAAACCGACCTGTTGCACCAGGTCGGAAATGACACTCAAGGAAAAGCATCTATGGGAAATACTACCACCATCGACCGATGGCTGTCGCCATCGCAGGCTGCGGAAATCATTCCATATTCCGCATGGCAAATCCGGAAGTTCTGTCGACAGGGGATTCTGCCGCACTCCAAACGACCTGGATCAAAGCAGAACCGCATCATGATCAAGCACTCAGACCTCGTCAATTTCATCCAGCAAGGAGCAGCAGCATGACAATGCGCACTTACAAAAACCCATACCCAGACAGCGAAGATGCTGTCGAAATCCGCTTCGATCATTGCCGTGAAGATATTGCAAAAGCAGCAAAAGAGTACTGGCGAGAAATGACAGAAGCAGAACTCGATGACCTTCAAGAAGAAATCATGCGTGCGCTTGCCGTCAGCGAGTGGCAAAACATCTGGCTTACAAGTGCCGCATTCATCACAGTTCTTGCCTACCATTCCCACGATTAGAGGAAGAAAAAATGATTCTTTTAGTGCTCATAAACATCGTCTTTTCAATGCTCCTGCTTTTCAACCTGACTGATGCAAATCGAAAGATTGAGGACGCGACGAAGCGCCTAGACATGCTCAACGAGGATGTCGACGTGCAGAGCGTGAAAATCCTTGATCTCTACGGGATCGCACAGATTCCACCAATGCCAGATGAAGAGGCAGCGTCTCGAATCTTCACAGGGGTGCGGAGATGAACCGGACAGCTTTAGAAGAACTGCACCAAGCTCTTATATCTGAGGCAGAAGCAATGCGCGCTGGTGAGTATTTCCTAGGCGTCGGAATCGTTGATGCATATGCACAGCAGCTTCGGGAAGCGATCGACTCTCATGACGAATAACGAAGGAATGTGCAAGCACTGTGGAGCAGCCGTGCTCTTTGTTAAAGACGATCGATGGCACGTTTTTGATGCAAAGCCGTCGGAAGAAGGCGAATGGCGGATACGGTCACGGTGGGTGGCTAATGTCGCGGCAGAAAAAACGACAGTGGCGCGTTTAGCGGAGTCGAAATTGCGCCGAGCGCGTCTCATGGGAGAGCCACTTTTTCGCCCGCATTTCCAGACATGCCCTGCGAACCCCCGCGCAAAAATTCTGCAGGAGAAGCGGAGACATGGATGATGCATCAATCGCAGATGAATGGTGGGAGAACCTACCAGAACGAAGAAAAATACAAATCCATCACTGGATCGTCTCGCCTAGGCAAATCACTATCCAGGAACTGCCTGGTCAGCTTGCGCTAATAGAAGGAACAGAACAATGACTAAAACACAGCTAGTGTGCGACAAAACCCAATTGAACTGGGGGCTGAAAGCGGCAAAAGCAATTGCTGGGAAGAGCCCATATGACATCATCCAGATGAGAGTCTCACCAGACCGCGATTATCTGTATATCTGTGCAGTTAATGACAAGGCAACGCTGGTCGCCAAGGTGGAGCTTCTCGTCGCGAATGTCAGTAGTGAAGAAGATGAGATCATCACGATCGATAAAGCAAAAGTCCCAGCATTGATTCTCGCGACTGCTGAGACCGGGAAGAAATCGGAAGACTCGCGGCCTATGGCAGGGATCTGCATTCGAGGGAAAGAAGTTGATTTCACCGACGAAAATGGGGCTGGGCGCGGAGTTGATCTTACGACGATTCACCGGAACGATTCTGCAGAAATCGGAGATCCTGTCCGCACAATCATTAGAGTCAAACAACAGCTTGCTGAAACATCGCCTTGCGATGTGACGCCATCGCCAGCTCAGATAACAGAACTAGCTCGGGCGACGCGATACCTAGGAGGCAAGCCCAAGCTGAGTATGCGCTCGTATGTGCACGAGGGAACTGAATCGCATCGACTTGTGGCTGAGGCGACATTCTGGACCTTGTCAGTACTCAACGTCCCAGAAGTATTGCTCGAAACCGAAAAAGATAAAGCCAATATCGTCGATGCAGCACCAATCGGAGGGATCTCATGAAACATATTGACGCAGCGATCTGTCGCCGAAAGTCAGAAGGTAAAACCGCGGAGCCTAACTACTGGGATTCCCAAGGCCCGCGAGAATCAACTGTAGCGATGTTAGAACGCCACAAAATAGCTAGAAAAAAGTGCATGACCTGCACACTTCTGGCCGAGTGTGAGAAGATGCTTTCCGATTTTGAAAAAGAAGAACTCAGAGTCGACGGAGTTGTAGCAGGGAGATACTGCGATGTCTCGCACAGAAACGGCTCGAGTATCGATGTCTTGCGACATTGCAAGCACTGCAATGTCCGATTGATACCGCAGGGTGGCCCACGGGGAAAAGAACCCAGCGGGGCTCGAAAACACCGGGGAGAAGGGCTCTGTGCGGTTTGCTATCCCCTTTTCTCAAGAAAACAACGCTAAACACAATGACTAGCAAAGGAAATAGACCATGGCATGGAGCAGAGTCGGCGACAACATCGCCACACATCCGCTCATGTCGAGGCTCCTTACCTCATGCGAATTCGACCACTCGCTCAAAAATGAGGCGTTTGGTGCGCTCGTCCAGCTCACAACTGTGTCGGCTGCGCATCTCACTGACTACATCATTGAGTATGGGCTCATGGCGCAAATCGCACCTGGACGTGAAAAGCAACTGATTGACGTTCTTGTAGATGCCGGAATGCTTTTTCGCGATGAAGTTGACGGGCGCAAGGTTCTACGAATCGTCGATGACAATGAGCTCCTCCACAATCGCAGCCGGGATGAAGTCGAGATCGACCGCCGTCGCGCCGCTGATAAGCGCAACCCGGCTTTAATCCCCGCAGTTCGCTACCGCGACGGCGACCAATGCCGCTGGTGCGGGAAAACCGTCGACTGGCGGGATCGGAAAAGCTGGCGCGCAGCGACGATTGACTCACTTAATGAGCATCGGGAATCAACCGTGGATACGCTGGTTGTCGCCTGCAAGAGCTGCAACTCAAAACGTGGCGCAGGTGAAGAACTTCAACTATTGCCAACCCCTACGAGAGAAAAGGTACATTACACAGACCACACAATCGACTGGATTAATCGTTCCGAGTGGGCTCAGCATGAGGGCATTCATCTCGAGCCACGCCAAACACACCTGGACATCGGACAGCAGATCACAACACCGGCAGCGCCGTCGGAGCAGCAGCAGGTAGGTCAAGCAGCAGCGCCTTTGGAGGCAGCAGCGCGGGCCCATCGAGCAGCGCCTGATGTTGAAGCGCCATTCGTTAGTGATCCGCTTGATGAGGCTCCAGACTGGGTCAAACAGTCTCTTGTCAACGATCACGGTCAGGCAGCAGCGCCTTCAATGGCAGCAGCGCCGCGTGAACATGATCATGCACCGGCAGCGCCGTCGGAGCAGCAGCAGGTGGGTCAAGCAGCAGCGCCCTTGGAGGCAGCAGCGCGGGCCCATCGAGCAGCGCCTGGCGTTGAAGCGCCACCGCATAACCACGTTGATAACGAAACGGTAAAACCTAGCACGGATCTAGAACAAATCACAGATCGGTGGGGTGACGGATCTAGATCTCTCGGGACGGGACGGGACGGGAATGGACAGGCAGGGACGGTAGCTAATCGACGTCGGAGACGTAGGGGTCGCCGTGGAGGTGGAAGGAATAAAGCTCATGGATAGTTGGAAGTTACATTCTTTAGGGAAAGCTCTGTATGAGTTGGAGAAGTTAGGCCCTTTGCTTGATGATCTTTTACTCCCTTCTCAGTGCGGTTATTCTGAAGGAAGGGGAGGTTCTGGGCAAGGGTCGCGCCCTCCATTGAGGATTCCAATTCTGGATGTGAAGTGGGAGACGGAGCGTCTACTGACTCATTGGGCATGGGGCTGCGCGGTGAAGCTGAATGTGGTCCCGCCTTATTCGAGATCCGTGCATAGCGTTGCGGCATGGCTTCAGTGCCACTTGATTGATATCGGAGATCTTGATGAAGCTGATGTTATTGCGGAGCAGGTGATCAGCCAGTCGGAGCTGCTAAGTGAGATGTTTTCTTCCGATGATGATGGAGCAATCACATCGCCTAAGCAGGGGACATGTAGAGAGGTCGCCGCGATCTGCAAAGGGCTCGGCTATGGAACATCAAAGACCACGATTCACCGGTGGGCACACGAGGGGGCGATAGCCTCACAAACAATGGAAGATGGCCGTGTCATAGTGGACCTACAAGAGGTGTTAGACAAGCTGGCTACCTGCAATAATGCAATGTAGTTACCGCGTGGGACACCCAATATAGTAAGCTAACGCTCGAATCTTCTGGGTCCAGATCACAAGCATGTGACTGGGCCTTTTGTTATGCATCAACGATGCAACACGAGGGGAGGACAAGACATGGGGTTTGACTCTAGAGCTGCGAAAAAACTCAAAGCAATGTTCAAACAACAATGCCGTGATGCAGGTGCCGTATGCTGGCTCTGCGGACAACCCATCAACTACGATGCCCCACCGAACAGCAGAGACTCATTCGAGCCGGATCACTTCTACCCGCAGGCAACGCACCCAGAGCTGGCAGAAGACCCAGAGAATCTTCGACCATCGCACTGCTCCTGCAACAGATCCCGCAAAGACGGAGTACCAGCCCCCAGCCTCGGGAGCCTATCCGAGCAATGGTGAGCAAACACCAGCTAGGGAGGGGGTATCAAGATCACGGGAACGAAATCCACGGACGGGTGAAGGGTGGGTCAGCTGGCCTCTCTCCCCGCAGATACCCCCCTTACCTCGACCCTCAATAACACGGTGAACCCAACGATCGGAGGTTGAAGTCATTTGGCTACTAATTCGCGCCGGGTAGGCGAACTGGAGCAAGCATTTTGTGATTCGATCGCTGCGTTGGAGGCAGACGGCATTGCGGTGCCGGAAAAGTATTCAGCAGTGGTGATGCTCGGAAGACTCTACGCATTCAACATTGACGAATCCGTGAACACCGACACCGAGCAAGCTACCAAAGCGCTGTACCTAGGTCCGCACTTGATGGGTGTCATGAAGCTGCTTGGCACGGCTCCAACAGAGGCAAGCGGGGAGGATAAATCCAGCGCACCATCCAACGTTGTCGCTGATCGAATGAAGGTGCTCGAAATCATGAAGGAGTACAAGAAGCAAAATGGGGGCTAAAGGAAAAACGGAACCACGGTTTTTCCCTCCACCGCTTCGTGAGCTGACTTCAGAAACGTCAGCGGGATTCGAGGTCATCGAATTTGCAAAGCTTCTTGGCATCGAGTTGTACCCGTTCCAGAAATGGGCGCTGATCCATGGCCTAGAGCTTCTCGAAGATGGTTCATTCCGTTGGCGAGTCGTCGTTATTGAGGTCGCCAGGCAAAACGGAAAGACCATGCTGATGGTTGTTCTCGGCCTCTGGAGAATCTTCCAATATGGAGCATCTCGCGTGCTTTCTGCCGCGCAATCGCTCAGCGATGCCGAAGACACATTGAACGAAGCATTTCTAATCGCAGCGTGGAACCCCGTGCTGCGAACATTCCTGCCAGACAATCCGCGAAGTGAGGGAGAAGATGACAAATTCAATGGCGCGTGGCGACCCCGCGCAAACGGCAAAGCTTCGATGAAGCTGGCATCAGCACCTGTCCCTGGGATCTTGGACGTTGCAAAAACCATGCCGATCTGGTCACTAGCTGTCACATCGCGCAAAGGCGGACGCTCCAAGTCAGTTGATCTCGCGCTCCTCGACGAGCTCCGCGAACATCTCGACTGGGAAGCCTGGAACGCTATCGTCCCAACATCAAGAAACCGCCCACAATCACAAGTCTGGGGATTCTCCAACGCAGGAGATCAATCAAGTGTCGTCCTACGATCACTTCGAGAATCTGCGATTAGGCAGATCGATGACGGAAACACCACATCAAAAACAGCGTTTTTCTCATGGTCAGCAGACCCAGAAGCAAGCATCCTAGATCCAGAAGCCCACGCACAGGCCAACCCATCAATGGGGTACTCCAACATCACTGCCGAATCAATCATGGCAGAAGCAGAAGACGCACTATCCGGCGACAATGAAGCCGGATTCCGCGCCGAAGCATTATGCCAGTGGCAGCAAGTCATCACCCCAGGCAAGATCCCAACGAAAATCTGGGAATCTCTCACAGATCCAGAATCCCACCGAGCAACAGATGCACCCGTACACATCGGGATCGATGTCGCCTCCGATGGACGATTCTCCCACATCGCGATCGCATCACAACGCGAAGACGGGCTGTGGCACATCGAGATCATCGCGTCTCGCGCCGGGTTTAAGTGGGTGCCTGAATGGCTTGGGCGTCGAAAGGCAGAGGCATGGTTTCCTGGCAAGGTTGGAATGCAGATCAAAGGTTCTGCATCAGCTAGTTTGGCTCCATTGGTGGAAGAAGCTGGTATTGAGGTCATCCCGTGGCAAGGAACATCGATGTCTGCATCGGTTCTTGGGTTCATCGATGAAATTCGGAATCGAGGGCTCAGGCACAGATCACAGCCAATCCTCAATGTCTCTATTGAAGGTGCAATTGATCGACGTCTGGGTGACATTTCGATCTGGGATCGTGTGAAGTCTGCGACTGATGTTTCTCCAACAGTGGCTGCGAACATTGCGTGGTGGATGGCCACACGCCCTGACGATGACCAATTTGTTTCTGCTTATGCAGATGAAGACTACGACACAGCTATCGATTATGACGGCGATGATTACGACGATGACGATTACTTGCTCATTGTCTAGGAAGGAGGCATCGTGGGATTTCTCCAGAGAATAGGATTGCTGCCACAGGTGACAACGACACCAACGCAGCACGAGCTTTTAGCGCCGCTTCTTGATACATACATCGGTGTGGTCACAGAGATGCCAACAGAAGAGCTGTTCGCAGAACAGCCTCACTTGCGGACGGTCACGACGTTTATCGCGCGAGCGATTTCATCAACATCGTTGCATGTGTATCGCCGAGATTCAGACGGTGGCCGGCATCGCGTCCGCGACTCGGATCTGGCAAAACTCATGCGCAGGTCATCGAAGACTGAGCTCATGCAGGACATGTTGAATGGCTCCATCTTGGATCTGTGTCTGTATGACGAGTTTATTTGGGTGGCCATGGAAGATAGCGATTCAGGGGAGTGGGAACTGCATAGGATCCCACCGACGTGGATCAAACAACGCAAGCACTCAGATCCGTGGACGCTGGAATGGATGGGGATCATTGACGCGAAGACTGGGCAGCAGATCAAGATCCCAGCAGAACGCATTATCCACGTGCATGGGTACAACCCAACGTCAGCGTCTCGCGGGCTTACTCCAGTCGTTGCGCTTCGCGAGACGCTGAAGGAGCAGCTGGAGTCAGCGGCATATCGTGGGCAGCTCTGGCGCAACGGGCCGCGTCTAGGTGGCGTGATCACACGGCCAAAGGACGCGAAGTGGGATTCAACATCGCGTAAGCGGTTTAAGGCTGCATGGCAATCGCAATACTCTGGGCGTGGTTCAGGTGCTGGCGGAACACCGATCTTGGAAGACGGAATGCAGTTCGTGCCAGCGCATCTGAAAGCACAAGACGAACAGGTTGTTGAGATGACCAAGCTGTCATTGCAGACGGTCGCGAGCATCTACCATGTCAATCCTGTCATGGTCGGGCTTCTTGATAACGCGAATTACTCGAATGTGCGAGAATTCCGACGATCACTGTATGGCGACTCTCTAGGGCCCATCATCAAACAAGTTGAGGGCGTGATCAATGAATTTTTGCGACCCATGATTGACGATGATGATGCGGTGTACGTCGAGTTCAATCTCGATGAAAAGCTACGAGCCAGCTTTGAGGAAAAAGCGGCAGTGACATCAACTGCGGTCGGTGGCCCATGGATGACAAGGAACGAAGCTCGTGCAATGAACAACCTTCCGGCAATTGACGGTGGAGAAGATCTGATTACTCCTTTGAATGTCACTACAGAGAACAGCGAAAGCAATAACGATGTGGGATTGGAGGAAGAATCATGACTATCCATGTCGTTATAGGCCCTCCTTGTTCTGGGAAATCAAGCTTTGTTGAGGCTAATGCTCCAGCTGGGATTGGGCGCTTCGATTTCGACAACATTGCGGGAACTGTTGCAGGGCAAGACGTTAAAAATGCATCACCGAATCCAGTTGCCAATGCCGTGTTGGCGATGCGGCGTGGGCTGATGGGGTGGCTTCTTGACGTTGAACTTGATCCACCAGAGTTTTGGTTGATTAACGCGCAACCATCGCCAGCACTGATAGCAGCTCTGAGTGCACGTGGGGCAACGTTTCATCTTTGTGACCCAGGGATGGAGGAATGCCTCGCACGTGCTGCCAGAGATGGGCGCCCTCAATCTGTCGAAGATCGCATTCGGCAGTGGTACGACAATCCACCTGAGCTACCTACGAAAGGAGGGCACGAAGTGAAAACGAAAAGCTATGACGTTTCGATCGACGAGACACAAACAGAGGGAACAATCACCGCATACGCCAGCGTATTCGGAAACGTTGACTCCTACGGTGACGTTGTCATTCCAGGTGCTTTCGAAGAGACACTGGGTGAATGGCAGAAGTCAGGGAATACAATCCCGCTGTTGTATGGGCATGATTTCAAAGATCCGTTCTCGAACATCGGTGGAGTGACATCGGCTGTTGAAGATGCACACGGGCTCAAAATCACTGCGCAGCTTGACCTCGACAATCCCAAAGCGAGGCAGGTCTACAACCTGTTGAAAGCAAAACGACTCTCACAAATGAGTTTTGCCTTCGATGTTGTTGAAGGCTCGTGGGGAGAACGAGAACAGCAGGAAGTTTACGAGCTGAAAAAGGTCAAGCTCTACGAGGTTTCTGTCGTTCCAATCGGAGCGAATCAGGAGACGAGCATCATTGATGTGAAAGCAGTGGTAGCGGAAGAACTCGCGAATATGCTCGATTCACGTCAACTAACAAAATCAATCAGCAACAAGCCTCCTAAACAGGGGGCTTTTGCTCTATCTGCGCTCGATGCGCATCTCAGCATATTGGAAAAGGAGAACCGATGAACCTCAAGGACCTACTCGCACACCGCGAGAATCTGATGGACAGCGCAAAGCGAGCACGCAGTGCGATCACCGACGATATGGACCCAGCAGACGCAGCGCAGGCGGTGGAGAACGTCAAGAGCATCATTAGCGAGATCGAATCTACTGACGAAGCCATTGCTGCACGTCGAGGGGTTTCTGATGTCACACAGAAGCTCAAGGGCCTGACGATCACCGAGCGTGGCACTGAGAACGATTCTGCAGCTTCTCGTTCCCTTGGAGAGCACTTTGTTAAGGCTGCAGGTGACCGACTCAAGAACCAAGCGGCAGGTGCGCATATTGAATATTCAGTTCCTGAATATCAGGTGAAGGAAGATGCTCACAGCTCCCCGAAGGATCTTGTCGAGGGATGGGGAACCTTCTACCAGCGTGGCATCATCAACCAGCGTCGCGAGCGCCTTGTCGCTGCTGATTTGATGGGATCGGCGGTCGTAACCGCATCGACAGTGAAATACATTGTTGAAAAGGCTAACCGTATTGCATCCGGTGCTCCGGCAACAGTCGCAGAAGGAACAAAGAAGCCATATGTAAAGTATGCCGACTTCGATGTTGTCACCGAGTCACTGTCGAAGGTCGCAGCACTGGCAAAGTTCACCGACGAGATGATCGAGGATTACGACTTTGTCGCAAGCTGGATCAACAACAATCTGGTCTATGACCTGTCTGTTGTGGAAGAAAAGCAACTCATTGATGGCGACGGGCGCGGTTCGAACATCAAGGGGCTGCTGAACCGAGAAGGTATCCAGACGCATAAATCTGCAAAGCAAGCAGATTGGTTTAACGATCTGTTCAAAGCCAAGAACAAGGTGTCGCAGGCAACCAACTTGGAAGCAGACGGCATCATGATCAACCCAGTCAACTATGAGGCGCTGCGGTTGACCAAGGACGGCAATGGCCAGTACATCGCAGGTGGACCTTTCCAAGGACAGTATGGCAACGGCAACATCCTCATTGATCCACCTCTATGGGGAATCAAGACAGTTGTGTCCAATGCTGTTCCAGCAGGAACCGCAATTGTCGGAGCATTCCGACAAGGAGCAACCGTCTTGCGCAAGGGTGGTGTCCGCGTCGATTCCGCGAACACGAACGCCGATGACTTCGAGAACAACCTCGTCACGCTGCGCGCAGAGGAACGTCTCGGCCTGATGGTGCCACTCCCAGCAGCATTCGTGAAGGTCACTCTCGAAGAAACTACAGAGGAACTCTAATGCGACGCGAATACGAAGTCACAACACCATACGGGCAGAAGCTAACGCTTGAAATGAGTGAAGATTACAAGAACGCTCATTGGCCTGATGCCGTGCTACTCGAAGACACTTGGCCTGCTGCCTCGTCGTTCGAAGCAGCAGAGACCAAGAGGAAAACGCCAACACGGAATCGAGCACAGAAGCCAGAAGCCGACAAGTAACACACACGGGAAAGGGGAACAATGAATGCGATAACCCCAACGCCAGGAATCGACAAAGAGGCCTTTGACCGCGCAGCAAACGCGGTCCGCAGGCTTTGCGGATGGCACATTTTCCCCATCATCGAGGAGACCATCACACTGGACTCTCCAGGCGATAGTCTCCTTGTCCTTCCCACGAAACACCTCGTCGAGATCATCAACGTCACCATCGACGGAACCACATACCCACTGAGCGATTTCCGAAGCAGCCCAGACGGGCTACTCGTCAAACGCCACGGGAGATTCCCAAGAGGAATAGCGATGGTCACTGTCACCATGAAACACGGATATGAGAAACCGACAGAGATCCTCGGAGTCATCAACGACATGGCACGCCGAGCCAACGAGTCAAACCTCACGCAGCTCAACGTCGGAGGAATCTCAGTCGGAGCAACGAACTCAGCCACACCACAATCATCTGAATGGAGAATCGTCGACGAACTACGACTAGGACCACTGCCATGAGCATCATCTTCAACCAGCGAATCGAGATTATTCGCGCCGGGGAAAAGCGGTCTGTGTACTCGTCCGATGTTATGGAGGATTGGGATAATCCAGTGGTTCTCCCTGTGGAGGTTCCCGTGTCGATTCAACCGGTTTCTTCAACAGAATCAGATGCCACAGCGAATCGCAGCTATGTGACGTCACGATTCAGGTTGTTTTCTCCGCCTGGGATAGACATCCCACAGTTGAAAGCGAAAGACCGTGTGCGAATTGGGTTGCTTGTCTTGGATGTCGTGGGAGATCCGGCACGGTGGCCGCATCCATTAAAACCGGCGACTGTGCATCATGTGGAAGCAGATTTGGAGGTGCATCGTGGGTAAATACGACAAGCAATTCCAACAGTTGAATCGCAACCCGAAGATTGCGCAGGCGTTGAAGAACCGTGCGGAGAAGATTCGCGTAGCTGCTCAGCGGATTTCTGATGCAGAAGGTGGGACAGCTCATTACCGCGTGGTGTCAGGCGTGCGTCCAGGAGGCCGTGCGTATGCCTATGTCGTTTCGGACAATCGTGATGAGGAATTCGGAACGGAGAAGACGAAACGAATCGGAGCGCTCCGGAGGGCAGCACGTGGTGGATGAAAAAGCCGTGATTATCAGCAAACTATCGAAGCTGGGGTTTCCAGTGTATTCAGAATTGCCGCATGATTTCGAAGAAAAGCATCTGCCAGTGCTCTGGGTTCAGCATGTGGGGCCTGCAGCTAGGAGGCAGGCGATCAACTCTAGGGGGATCGATTACGTTGACCTCGATATTGATCTCTTCGTCTCACTAGATATGTGGCACACAGGTGCAGCAATGGAACTAGCGCAGACGATACGAACACATATGCATCGATTCCGCGAGGGCATGCTCAAAGTCCTCGACACAGGGAGGCCAATAGCACGCCCTGATTTCAATTCCACAATTCGCCGGTGCGGACTGACCATCACGGTCGCTGTGCCGGCTTAACCAATTTTTCCGTAAGGAGAGTAAGAACATGACTGATTTTGATATTGATACATCTGCTGCCAACTACGCTGACGAGCTTGCATTGCTCGGCGTGACAGGAGCGATGAGCTACGCCCCAAAGGGAACGCAGATGCCAGAAACAATCGCACCTTTGAACCCACCATTTGTGGATTTTGGTTGGCTGTCTGATGGGGGAATCACAGAGTCTCAGAACGAGGAACGAAACGACTGGACACCATTCCAGTCAACCAACCCAATTCGTGGACAGGTAACCAAACAGGATTTCCAGTTCAAGACGGTCGTATGGTCGATCAGCGGCCTTGCCAATGCAATGTATTACGGTGTGCCTGAATCTGACATGCGTTTTGACCAAGAAACAGGCGTCACGACCTTTGAACAGGGCAAGGAACTTCCACCAGACTTCAAGTTCGGCCTCGTCGTGGATATTGTCGATGGCAAGAAAGCTCGGCGACACTGCATGCCGAACGTCTCGGTTGTTGAGCGCGGCGACATCGTCTACAGCAAGGACGATCTTGTTGGCTATGAAATGACATTCCGCGCCAGCTACGACCCAGTCGCTGGATATGCAGTGCGTCGCATGTTCAAGGAAGGCTGGAAGCCAGGACACGCTGGGACGACTCTCACCGACGAAAACAAAGACGCATCGCTAAGTGATTGGTCGAACACACTCGATGAATCCGAAGTCAGAAAACAGAGCAAGACCGTCACCCTGCCAAAAGGTGCGACCGGTGGCACGTTTACCGTTTCTATCAACGGCAAAGCTTCCGCAGCGATTAACCACGATGCAACAGGCACTGCCATGAAGTTGGTGCTTAACAAAGTTGATGGCGGGGAATCTGCCAAAGTTACAGGCCGTGCAGGCGGCCCATACACCATCACAGGTGTTGAAGGAGAAATCACTGCTGACGGCACAAACCTCACAGGTAGTGACACTCAGGACATCTCGATTAACTAACAACCATGGCTCTACGTTTTTGGCAGACCGACGTGGAGCCATACCCATAGGTCTGCTAACCCAATTTTTCATCACATCTACATAGGAGGTCTGCCATGACCATCGATCTCAACGCCATGCTCGCAAAGCGCGCCGAAGTCCTCGGAGAAGGAAACAAGTTTGACGTCAAACTCGGCGACAAGGCGTTCTACTTCGTCGCCCCTGAACTTGCATCATCTGAGTGGAATGATCGCCACCAAGCATTCCTCGAAGACATCCGAGACGGGCTGATGACATCAGAAACAGCCCGCGAAGAATTCCTCGGACTAGCGCTCGAAGACCAAGCAGAAGAATTCGCAGAGGCAGCCGATGAAATCGGTGTTGACCCATTCATCATCGCACAGATGGCATTCCAGGAGCACGCCGAATACGTGGGAAAAACCCAATCCCAGAAGCCCTTAAATCGCACCCAGAGGCGTGCGAAGCAGCGCTAATCGCAGAATACGGGCACGACTACGTAGCAGCTTTCTGGCGCGAAGAAATCACAACCAGAAAGCTGCTCGTACTCATTAGCCACCTGCCAGAAAATTCAGCACTCCACAGGAGCCAGCAGCGCGAGACATTCGACGGGAACCTGTGGAACCAAGAACTCTCCATGATGTGGGAGATCAGCAACCTCATCAGGATCACCAATTTTCTGTTAGAACGCAGCCAGGCTAAAAACCCCAACCACGTTAAAGCCCCAAAGCTGAAACTCTATCCGTGGTCACCAGACCAAGACATCAAGCATTACGGAAAGGTGGACGAAGAGGATCAAGTCGACGCCGTGAACTTCCTCCTGGGGCTTTCCCCACCACCGCAATAAGGAGGAAGAACAATGGACAGCGACGCAACATATGTGCCGATTCTTGCCTCATTCGACGGCTTCTTTAAATCCATCGACAAAAATGCGGAAAAAGCTGGACAACAAGCAGCAACCACATTCGCAGAATCGATGGAGCGCAACCTCCAACGCGCAGAACGAGCCGCAGAAAAAGCCGGAACTGTCTTCGAACGCGCCCACAACCGCGCAGCAGACGCAGCAGCAAAGACCCAAATCGCAGAGCTCAAACTCCTTGAAGTCAAAGAAAAACAAGGCGCGAAAGCCTCCGAGGTTGCAGCAGCAGAAGCAAAAGTAGAAAAAGCGCGCCGGGATCAGGAGGCAGCTGATAAGGCTGTAGCAAAGGCTGCAAAATCGTTGGCATCTGCACAAGATGATGTTGCTAAGGCAACGCAGCAAGCCGGCGATGCGATGGAAGTTAACGCTGAAAAAGCTGGGCTGTTCTCAAAGATGACAGGTGGGCTTGGCGACAAGCTTGGGGCGTTGCCTGCGCTTGCTGCGGGTGCTGTTGCGGGGTTCGCTGGTTTCGCGGCGATCAAGGAAACGCTGTTGGATGTTGGTTCAGCATTTGACAGCGCTTATGACACGATCCGTATCGGCACGGGCGCATCGGGTGAGGCATTCGCTGGTTTGCAGCAGTCGATGCGCAATGTAGCAGCCAATAACATCGGAATTGGCGATGATATGGAGGCTGTGGGAACTGCGCTTGCAGACATTAACACTCGTCTGGGATTGACTGGCGCACCGCTGGAAAAGATGACGGCGCAGATGCTGCAGTTACAGCACATGGGTGTTGATGCGGACATTAACGCTGTATCCCAAGCGCTTAACGGGTTCGGCATTGAAGCAGATGCGATGCCAGCAGCGCTTGATTCGCTGTTCCAAGTTTCGCAGGCGACAGGTTTGACCATCACGGAGTTGTCGAATTCTGCGGTAAAAGCTGGCCCTCAGCTTCGACAGTTTGGTTTTAGCATGGCTGATTCAGCAGCTTTGGTGGGCCAGCTCGACAAAGCAGGCGTGAACGCTGATGGTGTGCTGTCGAAGATGTCGAAAGCACTGACAACGTTCGCGGCCGAGGGCAAGGATGCACCAAAGGCGTTGAATGAGACGATCACCTCGATTGAACAGCTGGTCAAAGCTGGTAATTCTCAAGGTGCTATCAATCTTGCCGAAGGAATCTTTGGGGCAAAGGGTGCAGCGCAGTTTGTTGATGCTGTGCAAACTGGAACCCTTTCTGTTGAAGATTTCATGAGTGCGACAGGCGCGACAAATGACACAATTTCTGGGTTAGCTGCTGAAACTGCGTCTTTTAAGGAGCATTGGCATCAGTTCAAGATGCAAGCGATGCTGGCTATCGAGCCTGTGGCAACAGCAGTGTTTAACATGCTGACTCCTGCCATCCTCAATCTGAAAGATGGGTTTACCTCAGCTATCGAGTTTGTCGAGAACACCCTTGTCCCAGGTTTTAAAAAGATCCCAGACGTACTAGCAGTAACAGCGCAGTGGCTTGAAGACAACAGGAACAAACTGATTGCTCTTGCAGTTGCAGTATCTCCGATTGTTGTTCCTTTCCTCGTCGGGTTGGCTGCGAAGTGGACTGCTGCAGGTGTTGCAGCAACAGTATCAGCAGCAAAGCAAGCGCAAGCGTGGGTACTGACGAAGATTGAGGCAGCTCAAGCAAGTGCTGCAAATATCGCTGCGCTGTGGACAACAGGTGCGGGATGGATCAAAGCAGGCGCGCAAGCAACACTCGGAGCTGGTCAGATTGCTGGTGCATGGCTACTGACTAAAGCACAGTCTGGTGTATCTATTGTGGCGTCGATCGCTGCGGTGGGCATTGGCTGGGTCACGACTGGAATCCAAGCTCTCGCAGGAGCAGCGCAGGTAGCAGCAGCATGGGTAATCGGGCTTGGTCCAATTGCGTGGGTCACTGCAGCAATTGCTGCGGTGGGAGCTGGACTTGTGTGGTTCTTCACTCAAACGGAGACCGGAAAGCAAGCGTGGCAGAGCTTCACCTCTGCGCTACAAGCTGGGTGGGATCTATTTTCCTCAGCATTGCGTACAGGCTGGGAGATGCTCAAAACTGCAGTGTTTGATGCGTGGACAGCCCGTGTGGAGACACTTAAAAGCGTCTGGGAGGCAACAACCGGGGCTATCGGTGCTGGTTGGGAATGGCTCAAAGGTGCCCTGTACAGCGGATGGATGTGGATCTCTGGCAACGTTATCGAAGGGTTCAAAGCTGGACTTAGCGGTCTCAGAGATTTCTTTTCCTCGGTTGTCAATGGGATCAAATCTACCTGGGCGACGCTTCGATCAGCACTGGCAAAACCAGTGAACTTCATGATTAACACTGTCTATAACGGTGGCATTCTCAAGGCTTGGAATGTTATCGCGGGGCTGTTACCAGGGCTGAAACAAGGCAACCCACTCGCAGGAATCCCAGAGCACGCAACAGGTGGCCGAATCGCAGGACCAGGAACCGGAACATCAGACGATGTTCTGATGTGGGGCTCGAATGGTGAGCACATGCTCACAGCGAAGGAAGTCCAACGCGCCGGTGGACACAATGCGATCTACTTCATGCGTGATTTGATCGCAAGCAGAACCCCATTCACGTGGGACGGCGGAAGATTCATCGCTGAACACCGCAAGTCTGTCAACGACTACGGCTCCGAGGTAAAACGCCGAGGAATCGGAAACGTTGATGACAACGGACTGTTCTCAATGCTGCCAAAGTTCAAAGACGGCGGAGCAATCCGCCCAATGTGGGAACTCCAACTCGAAAACGGCCATAAGGCCGCAAAATCGAGAAACGGCAACCCATATACATGGGGCTACGAGGACTGCTCTGGTTACATGTCTGCCATTGCTGACGCGATCCTCCACGGAGGACGCGGAAGACGAGCATGGGCCACGGGTTCCTTCCCAGGTGGTCAACCATGGGCGCCAAGTCTGGGAAAGGGCTTTAGCGTTGGTGTCCATGACAACCCAGGAGGCCCAGGCGGTGGACACACTGCAGGCACCCTCACAGGCGTCGGGCCATATTCAACCGTCAACGTCGAATCGGGAGGAAGCCATGGAAACGTCGCATACGGTGGACCAGCTATTGGCGCAGACCATGCGCAATTCAATGGTGTGCGTCCAGGCAGATTCCACCTAGCGATCGGAGCAGACGGAGCATTCGAAACTGCAGGAAGCGGCGGAGTCTCACCACAAGCACAGCGAAGCATGATCGCTAAAGCCTTCGGTGGTGTCATCAGCACTGTCATGGACCCCATCGCCGCAAAACTGCCAAGCCCACCACCGGCATGGCAAGCAATCCCACGAGGCGCATATGACTCTGGGAAAGACGCACTTGTCAAAGGCGTCGACAATGCTGTCAACTCGATCGAAGATTCACTAGCAACCGTCTACCGAGGAATCTCAAAGATCCCGAATCTGCTCAAAGAAAAGGGACCGAGAGGAATCGAAAAGAAAGCAAAAATCTACGATAGAGGTGGCATTCTCGGACACGGAGAAATAGCAATCAACCACGGAGCACCAGAGCGCATCCTCCCACCAGCACTGACAGCCAGCTTCGACAAATTCACTACCGTTGTGCCACAGGTCGCAGACCGATTCGGCATCATCGCAGACAAACTTCTCGGAACGAAAATCAGGGAAACCCCAACCGCGCCGGGAAGCTTGAATGCACAAGTGGATATGGAGAAGCTGAATGATCAGCTAAGCGGCCTTCATAGCATGGCAGATAAGGTTGTTCCTGTTCTGGAGACCGTCAATGCGATGGGCATTGAAGGGTTGAAGGGGCTCACCCACATCACGGGTGCATGGAAGGAATACGTAAGCGTGGAAAATTCTGTGGCTAAGGCTGCAGAGGATTCGAAGGCTGCGAATGAAGCCCTAGCGACAGCAAGGAAAGAGCTTGCAGATCTTGAGCGTGAAATCGCGAAGAAGGGACCGAATGCCAAGGGCCAAGCAGAAGATTCAAAGAAGCTAGCAGAGGCTAGGAAGAAAGTTTCTGATGCGGAGTCGAAGGCTGTAGATTCTTCATCTGCGCTGGAAAAGGCGCTGGGTGATGTTGCTGTCGGGCAAATCAAGCTTGCGTTGTCTGTTGTGAGTGCTATCGCTGAGATCGGCAAGGCTATTGCGGGAGCATTCACTGCTGTGTACGAGGGGCAATCACGTGGCTGGTCGCTGGTGTCGCAGATGGCGGAAGAAGTTGAGAAGGGCCGTCAGAAGCTGTCGGAGATGCGCATTGAAAATGCGAATTTGACGATTCAGCAGATTAAGGCGATCAACGATCTTCGGATTGCTCAGTGGGATACTCACCGTGCAGCTTTAAACGGTGCGTTGGGTATTGCGCAAGCGCAGGCTGAGTTGGATAAGCGGCGTCGAGAAGGGATCATGCTGGGTGCTTCTGGTATTGATGCGATGGCTCGTGCCATGGATCGGTACAGGAAAACAGGAATCTTCGCGATTGAGGAAGTTACCTGGTACACCGAGGAGCAAAAACGCAAGATTAAAGCTGCTGAGTGGAAGGTTCATGAGGCTCGGATCCAATCAGCAATTGATCAGCTTGATGCACAAGGCAAAGTAGAACTTGCTGGTCTAGCTGCTGCCGAGGCAACACTGAAGCAGCACACCGCAGTACGGTTGCTTGAACTCTCAGCGCAGAAGTTATCGGCGCAGGCGGCATCTTTTTATGGTGTCACGGCCCAGGGCGCTACTGCGCTTGAGCGGATGAACCAAGGAAAGGCACTGCAAGGCAAGGGTGCTGGTTCGATTTTTGGTGGAATTTTGAAAGGCTTGGGAGGCGCAGCGGGAGGCGCTGCTGCTGGCTTTGCTCTTGGTGGCCCTGTTGGAGCACTCCTTGGTGGCATCGCCGGCTTGCTTTTTGGCGGTGCAGGTCAGGTGATGAATGGTATCGCTGAGATCAAGCAGGGCCGTATGCAGGAATCCACGTACAAAAAGTACGCGGAGGAAGAACTAAAGAAGCTTCCACCGGAGGTTCAAAAAGAGATTCATTCTGCTGGAGTCATGGGTGGTATCGCATCGTTCTTCGGTGGCAATGGCGCTATGATTGCTTCGACACCGTTTGAGGCGATGAAGATCAAGAACCAGTTTGCGACGTGGGACTACGAGAAGAAACTGCAATCGATGGAGCATGATTCATCGATACAGAAGCAGCTTCTGGCAACACAGCGAGCTCAAATTGAGCAGCGTCTTGCAGCTCAAAAGGCAGCGTTGGAAGCAGAAAGAGATGCCGCTGGTTACCATGCTGCAGCAGGAGAAGCAGATAACGAGGGAGTGCGCCAAGCTTATGACGCGCTCGCGCAGGATTCTGCGCGCAGAGCTCGTGATCTCGCTGAAACCGCGCAGCGTGGGAAATCGCAGCTCGATGAAATAGTCGGTCGTCTGAAAGATCTAGCAGATCGATCAGCGGCAAATATTATCAATCAAGGAATCAACCAAACTGTCGTGGTCAAACTCGAAAAAACCGGCAGGCTCCATACAGACCAGGACATTGCGAAGGCGCTGGAAGAAACGCTTAACGCAGTGAAAGGTCTAGAGGCTCGTGTCGAGATGGTCGAGGCTGCCCGCGCACCGAGTGCATTGGAAGCAGCGCACTCGCAGCTTTAGGAGGTCATAATGCAGGTGATGTTTTACTCACCATCAGGTCAAAAGCTCTTTGGTGAAAATGGATCACCTGCATTCCTCCTGAAAGGGGGAATCACAGATTTAAAGGGGGCAATAGAAGCACGAACAACCGTGATCCCAGAAGTGCCGGGCCAAGTCTTTGATGGGGTCACGATTAAGCCGTTTACGTTCGGTGTGACCATGGTTGTGTATCCAACGAAAGAAATGCCAATGGAGAAAGCAATGCGCAAGACCAGGCAGATCTTCTCGGCATTCGATTACTCGACGTGCAGGATCGATGGAACTGGGATACCTGTGGGGCTGAAGGTGCGGCTGGATTCGATCATCTCCGCGCCATCGCAGGACACAGCGAAGGAGATTGCAGAGGAAATCACGATCACACTCGCTGCTGACGAAGGAATCTTTTGGGCAGATCGAAAAGAAGCCAGCGGAAAGATTGACATCGTCAATTATGGTGACTGTGATCTCTGGCCAGAATACCGATGGCAAAAAACCACGACGATCACATTGCCAAGCGGAGCAAAGATTAATCTTCCTGAAACACCCACGCCACGAGTCTTGCGGACGACACGACATCAAGTGTCGATCACTGATCTAGATGGGAAACCAGACCGCGAGCTGCTCAAAAAGCTTGGAACAGTGTGGCCGGAAGGCGTGCCGAGGAAAGAAACGAAGACATACGAGATCACCCAAAACGGTTACATCATGTGGCGTATTGGGTATCTAGACCCATGGGGGCTGTAAATGGACGTATCGCAGTGGGAGCAATTCGCAAAGCACCGATCTGCTGTCATCCGTGATTACGGAATGTGGATCGGACTCATGGATAACAACATGGAACCCATCGTGGACATGCCAGCACCTGTCAGCATCGATGCCCCTATCACCAGAATGACCCCATCATCATGCAAAGCAGTATTCAAGACGCGAGTCGATGGGCACATTCACCCAATGGTGGATTACCTGATTGCCGAAAATCTGGCGAAAGTCGACGAGCAAGGCCAACTCATCGCAGCGGCACAAGACGCAGTATTCCTAGCGATCGAAGTAGCACAGGGGATCAGAAACGTCTACAAAGGCGTATTCACAGTCGCATTGGGAGACCAAGAATCACCAACACTGCTCGAATTCAACGGGGTCTGCGAAATCCAATGGACGCTCGGCGCACTCCCATGCCCATCAGCACCATATTCATGGACCGGTAAATGGGTCGATCTCAATCAGGACTGGGCTGGAAAATGGTCAAAGACTAGGACGATGGCAGACATCAAAGTCGCAGAAGTTGCCGACGGATTCACCTTGTCAGGGGCTGCCGACACCACAATCGGAAAACTCATTAGCCAGTCACTGCAAGCAATCAACACGATGTTGAAATCGAAAGGCCGAAGCCTGCCGATCGCAGTCAAACCAGTTACATCGAATCCAACATCACCACAGCTAGTGATCCGTCCAACGGACAGATTCATCTGGGACGAAATCAGCGACCTGGCTTTAGCTGCAGGCGTGACCGTGAAATGCAAAACATGGTGGCCCAGCGACCCACCAGTGCCGGGACTTGACCTGAAAGAACCCATCGTCGTCATCGAGATCACTCAGGAGGAATGATGGACCCAGTAATAAAACTCGTCGCTCATGCACAAGAAATGTCCATGACAATCCCCAGAAGACAAGCCTGCATGGTCTACGGAAAACTCAACGTCACACTCAAAAAAGGAGACGTACAAGAAGAACGAGACAAGCGCGTCACAGACGGTTACGTCCACATCCCAGAAGATATGCCTCAAGGCCGATTCGACTTCGGGTTCACCCGAGAAGACGCAGACGTCAACATCGGAACAGGACAATCAACCTACGAGTTAGCTCTCGACACAGCAGCACGAAGAATCACAGGGCAAGTCCTATTCGAGCAAGACATCATCGCGCCGGGGTTTGGTGATTGGCGGCCGCTGATTGATTTTTCGTGTGGGGATCTTGTCGGTGTTCGTATCTGGGGGAAAGAGCTGGTGCTGCCGGTCACATCAATTACTCGCGAAACAACAGGGTGGCGTGCGCATGTTGGCGGGCAGCTGATCAATGATCGCCGCAAAATCATTTCTGAGAATCGAAAAATCCTTGCTGACATTGAATCAGAGCGCAGAGAGCGAATGAACGAGATCGGAGCAATCTCATCTGTGGCGTCTGCAGCTTCGGTTGCAGCAAAGGACGCTGACGCGAAGGCGGAGACAGCCGATGGGAAAGCCGAGGATGCGCTGGAGAAGTGGAGACGACAAAAAGACCAACTGGATAAAGTCCAGTCAGATCTGATTGAAAAAATACACAGTGGAACCGAATCCAGGATCGTGGGTTGAATGAGCTAGCCGCTCAGCAGGAGGCTATGAAGCGCTACGTTGACCTATCGAGGCCTTCCTCTGCCACAGTATCAACGTGGGACCCTGTGTGGGCCGGCCCCGTACACGTCAGCTACCCATCCAACAATCATATCCAGCTCTATCTCAAAGACTCGCCTTACACAATCGGAGCATCAATACTGGGTATCGCCCGCGTCAATGCACTCCGCGGGTACTCATTTTCCTTCACCGCAGATATGACCGCTGGCCAGACATTCACGCCACAGGTCGGCGGATTCGAAGCCTTTAACCAAGTATCTGTCACGGTGCACCCGATCGTGAATTTCGCAGCCATTTTGACCGAAGAACGCAGAAAGAGAGGCCTGCAATAATGCCTAGACTTGCAGGAAAATTGGAAACAATAACAAACACACCGTCGAGGGTTCGTGAGGTGCTTTTGCGTGCTGCACGCACCCGCACAGCCGGCAAAGCAGTCATCGTGGATGAACCCGTTCGCGTGATAGTCAACGAATCAGGTGAGTTCACGGCGGATACCGCGCCGGGTGCTGCGGTGTTGGTGCTGGTGGGGGCGGATTTTATGGCCCGTGAGTCAATACCCCTGCTGGTCGCTGAGGGGATGACCACAATCGCGGAGGCGATGGAAGCTGCGGAGGATTTCACGCCGGAGGTGCACGACAAATTGGCTGAGCTGGCGGCTGAGGTGGCTCGTGGTGTGAAATCCACGGGTGAGGCAGTGAAGTCTGTGTCTGCTGATCGTGAGAAGACAGAAAAGTCTGCTGCTGCTGCTGAGGAATCTGCGGCTACGGCTAGTAAGGATGCTCAGACGGCGTTGTCTGCGTGGCAGCAGTTGAAGGCGCGTCTTGATCAGTGGGAGGCACGGTCACAGCAGCTGGAGAAGTGGCAGCCACAGTACGAGTGGCTGAAAGAAAACGCAGGGAAGTCTTTTGCTGGGGTGCAGGAGAAGATCGCAGAAGCCGCACAGGCCCTGATCTTGCAGGTCAAGCGTGATGCGGAGTCTGCGAAACGGGACGCAGCGAACGCGGGACAGCATTCTTTGAAAGCGCAGGCAGCTGCTAAATCTGTGGAGTCGGCGGCGCAGTCTGCTGTCGATGCTGTGGTGAAGAAAATTCTTGACGGTGCGCCACAGGCTTATGACACGCTGAAAGAAGTCGCGGAGGAGCTTTCGTCTCAAAAGAGTGCTGCGGCCGCGCTGATCAAGCAGCTTTCCGAGAAAGCGTCACAAGCGGATGTGGCGACGCTTGCCCAGAAAATCACGAATTTGGGTATTGATGGGGTTCGCGGCTTATCTGACGCTTTGGCCAGTAAGGCGAATACGAACCATACGCACACTACCTCACAGATCATGGGGCTTAGCCAGCTGGAAAGTAAGTATTTCACCATTGAGGCTGAGATTGGTAAAAAGGCTGGAAAAGAAGAGATCGCCGACATGGCCACCAAATCCGATGTGCAGGATCTACAAGCCAAGGTGGGCACTATCCCGGAGATACGGGTGGTGTCGCAGATGCCGACACGTCCAGATAACTCGACTATTTATCTGGTTCGGTGATTGGTGATGGCTGGCTTGAGAGTCGGGGGTAAGACCCCTAGCAAGATTTACTACGGACGAGCTTCGGTAAGACAGGTGTATTACGGCGCTGCTCGGGTTTGGCCCGAGATCCCAGCATGGGATTATGGGCGTTACTACAGCCTTGGTGATGTAGTCGAGTATGAAGGTGGGATTTACCGATGCATCCAGTATCATAAATCGGGTATACAACACACCCCCATAGTCCAGTGGCTATGGATACCTATTTAATAACTGGTAGCGCTGAACCCGGAAGCGTCGTTTTCTGTGGCTGATGATCAACGTTGAGTAACCCCCGCACCCGCATAGGGTGCATTTTTTATACCCTCACCAACCGAGGTGGGGGCTATGCATAGAAAGGAACGTCCCCGTGAAAAACTGGGAAACCCTAGAACCAGACAAATACAACCTCCTCACCAAAAACTTCAGCCCAGGCAGAGGCGGCGAATCAATCAAATTCATCACCCTCCACCACATGGCCATGGTAGGTGGTGTCGACGAATGCGTCCGAGTCTGGTCACAACGACCCGCATCCGCACACTACTGCATCGGCCCTACCGGTGAAATCGGCCAAGCAGTCAACGACTGGGACACCGCATGGGCAAACGCAAACCTATTGTCTAACCAACGCTCCATCGCAATTGAGCACTCCAACTCTGCCGGTGCATCACAAGACTGGCCAATAGGGGAGAAAACCCTCGAAGAAGGAGCGCACCTTGTAGCCGCACTGTGCCGCTACTACGGATTAGGACGACCAGAATCCGGAAAGAATATCCGATTCCACTGCACCGAATCCGGTGGCGCTACTTCCTGCCCGTACCACCTACGACCAGGACACAAGTACCACGATTTATATATGAGCCGAGCACAGTGGTGGTACGACAACCCCGCAGGTGGACACACACAAAACACAATGGTTTCAAAGGAGAAGAAGAACATGAATGAGGCATACTCGCGTGACATCAAAGCACAGCTGACAGGAAGCGAAGACCTAGGACAGTACCCCGGCTGGGGTCAGCTGGGAGGTCGTACCATCGTTGATGCGTTGGGTGCGATTGGTGAAAAACTGGGAATGGATGGCTTTTACGATACAAAGGCAGGCAAATAATGACAGTTGAGTTTTGGAAAGATTTAGCTGAGCGTGCGATCAAGACATTTGCGCAGGCACTGCTTGCTGTGCTGGCTGTCGGGGTGCCGATATGGGAGCTGGACTGGTCAGGTGCTTTTGGCATTGCTGCTACGGCCACAGTGATTAGTGTTTTGACATCGATTGCCTCGATCAGTGTCGGTACTCAGGGGACTGCTAGTGCGGTGTCGTCTCCTGCACGACACCGTAAGGAGTAGAACTGATGCCTATTGAGCATTTGCCTTCTCGTGTCCAGCCGACTGCCCGCCGTGTGCGGGCTTTTTTGATGACTGATTCCACGGCGCTACTGCTGCTTGCTGTGGTGCAGGCTGCGATGGGGTTGTATTATCTCCCTGGGGTTTTGGGGGATCCGTTGCGGTGGCAGCGGCCGGTGGAATCGATCATGCCGATTATTGCGTGGGCATGGGTCCACCTCGCGGTTGGTGGGCTATGCGCCGTTGCTGCGGTGACTGATAGGTGGCATGTGGATATCGTTGCTCTCGCTCTTGCGACCGGTCTCAATTTGTCGTGGGCTTTTAGTCTGCTTGCTGCGTCTGTAGAGCACAATCAGTCAGTGCTATGGCTTGTTGGAGTTCTTATTCTTGCCATGACGGTTTCGCTGATGTGGGCTGTGTGGCGTGGTAAGCGTGGGGATATTCCTTTTGCTAAGGAAGGGGGAGCTGCATGAGTGTTGTAGCTGCTTTTTTAAGTGGTGTGGGTGCGCTTGTCACGGCGTTAGGTGGTGTGTTGATCGGTGTGGTGAAAGCCAGGTCGGATACACATACTGCGAAAGGTTCGCGCATGGACGTGCTGGAGGCACGCATTGACAAAATGCAGGCTGATTTAGATGATGAGCGTGCGCGTCGCCGTGGTGTGGAGGTAGATAATCACCGGCTGCGTATGGCATTGGTAACCGCGGTAAAGCATCTAGAGCAGCTCATACGCTGGGCTGACGGTGGTGCGAAGCCACCTAGGCCTGATGATATTGATCTAGATGAGATTAAAAGCCTGTTGAAGGCGTAAAAAGATGGCCCCCTTTGGTTGAGATGATACACCTAGGCTAAAGGGGGCTATCTTTGCGTGTGGTACACCTGATCTGGGCCCGGTTCATGTTGTGGTGGTCAACGCTGGGGTAACCGGCGTTGCGTATCCAGTGGCTACACTCAGGTTGTAATGATTGGGATGATGTACCTGATCTGAGAGCGATTAAAAACTCATTGAGGAGTAGGTCCCGATTGGTTTTTGCTAGTGAAGCTTAGCTAGCTTTCCCCATGTAACCAATCTATCAAAAAAGGGCATTGATTTCAGAGCACCCTTATAATTAGGATAGCTTTACCTAATTATTTTATGAGTCCTGGTAAGGGGATACGTTGTGAGCAGAAAACTGTTTGCGTCAATCTTAATAGGGGCGCTACTGGGGATAGGGGCCCCACCTTCAGCCCATGCAGGCGCTGATGATGTTGTTGATTCTTCTAAATCTTTTGTGATGGAAAACTTTTCTTCGTACCACGGGACTAAACCTGGTTATGTAGATTCCATTCAAAAAGGTATACAAAAGCCAAAATCTGGTACACAAGGAAATTATGACGATGATTGGAAAGGGTTTTATAGTACCGACAATAAATACGACGCTGCGGGATACTCTGTAGATAATGAAAACCCGCTCTCTGGAAAAGCTGGAGGCGTGGTCAAAGTGACGTATCCAGGACTGACGAAGGTTCTCGCACTAAAAGTGGATAATGCCGAAACTATTAAGAAAGAGTTAGGTTTAAGTCTCACTGAACCGTTGATGGAGCAAGTCGGAACGGAAGAGTTTATCAAAAGGTTCGGTGATGGTGCTTCGCGTGTAGTGCTCAGCCTTCCCTTCGCTGAGGGGAGTTCTAGCGTTGAATATATTAATAACTGGGAACAGGCGAAAGCGTTAAGCGTAGAACTTGAGATTAATTTTGAAACCCGTGGAAAACGTGGCCAAGATGCGATGTATGAGTATATGGCTCAAGCCTGTGCAGGAAATCGTGTCAGGCGATCAGTAGGTAGCTCATTGTCATGCATAAATCTTGATTGGGATGTCATAAGGGATAAAACTAAGACAAAGATAGAGTCTTTGAAAGAGCATGGCCCTATCAAAAATAAAATGAGCGAAAGTCCCAATAAAACAGTATCTGAGGAAAAAGCTAAACAATACCTAGAAGAATTTCATCAAACGGCATTAGAGCATCCTGAATTGTCAGAACTTAAAACCGTTACTGGGACCAATCCTGTATTCGCTGGGGCTAACTATGCGGCGTGGGCAGTAAACGTTGCGCAAGTTATCGATAGCGAAACAGCTGATAATTTGGAAAAGACAACTGCTGCTCTTTCGATACTTCCTGGTATCGGCAGCGTAATGGGCATTGCAGACGGTGCCGTTCACCACAATACAGAAGAGATAGTGGCACAATCAATAGCTTTATCGTCTTTAATGGTTGCTCAAGCTATTCCATTGGTAGGAGAGCTAGTTGATATTGGTTTCGCTGCATATAATTTTGTAGAGAGTATTATCAATTTATTTCAAGTAGTTCATAATTCGTATAATCGTCCCGCGTATTCTCCGGGGCATAAAACGCAACCATTTCTTCATGACGGGTATGCTGTCAGTTGGAACACTGTTGAAGATTCGATAATCCGAACTGGTTTTCAAGGGGAGAGTGGGCACGACATAAAAATTACTGCTGAAAATACCCCGCTTCCAATCGCGGGTGTCCTACTACCGACTATTCCTGGAAAGCTGGACGTTAATAAGTCCAAGACTCATATTTCCGTAAATGGTCGGAAAATAAGGATGCGTTGCAGAGCTATAGACGGTGATGTAACTTTTTGTCGCCCTAAATCTCCTGTTTATGTTGGTAATGGTGTGCATGCGAATCTTCACGTGGCATTTCACAGAAGCAGCTCGGAGAAAATTCATTCTAATGAAATTTCGTCGGATTCCATAGGCGTTCTTGGGTACCAGAAAACAGTAGATCACACCAAGGTTAATTCTAAGCTATCGCTATTTTTTGAAATCAAAAGCTGA
Protein sequences of DBSCAN-SWA_1 >NC_016785|140991:175272|156725_157976_+|WP_014306312.1|capsid|DBSCAN-SWA MNLKDLLAHRENLMDSAKRARSAITDDMDPADAAQAVENVKSIISEIESTDEAIAARRGVSDVTQKLKGLTITERGTENDSAASRSLGEHFVKAAGDRLKNQAAGAHIEYSVPEYQVKEDAHSSPKDLVEGWGTFYQRGIINQRRERLVAADLMGSAVVTASTVKYIVEKANRIASGAPATVAEGTKKPYVKYADFDVVTESLSKVAALAKFTDEMIEDYDFVASWINNNLVYDLSVVEEKQLIDGDGRGSNIKGLLNREGIQTHKSAKQADWFNDLFKAKNKVSQATNLEADGIMINPVNYEALRLTKDGNGQYIAGGPFQGQYGNGNILIDPPLWGIKTVVSNAVPAGTAIVGAFRQGATVLRKGGVRVDSANTNADDFENNLVTLRAEERLGLMVPLPAAFVKVTLEETTEEL >NC_016785|140991:175272|172440_172905_+|WP_014306322.1|DBSCAN-SWA MPIEHLPSRVQPTARRVRAFLMTDSTALLLLAVVQAAMGLYYLPGVLGDPLRWQRPVESIMPIIAWAWVHLAVGGLCAVAAVTDRWHVDIVALALATGLNLSWAFSLLAASVEHNQSVLWLVGVLILAMTVSLMWAVWRGKRGDIPFAKEGGAA >NC_016785|140991:175272|150146_151355_+|WP_014306309.1|DBSCAN-SWA MAWSRVGDNIATHPLMSRLLTSCEFDHSLKNEAFGALVQLTTVSAAHLTDYIIEYGLMAQIAPGREKQLIDVLVDAGMLFRDEVDGRKVLRIVDDNELLHNRSRDEVEIDRRRAADKRNPALIPAVRYRDGDQCRWCGKTVDWRDRKSWRAATIDSLNEHRESTVDTLVVACKSCNSKRGAGEELQLLPTPTREKVHYTDHTIDWINRSEWAQHEGIHLEPRQTHLDIGQQITTPAAPSEQQQVGQAAAPLEAAARAHRAAPDVEAPFVSDPLDEAPDWVKQSLVNDHGQAAAPSMAAAPREHDHAPAAPSEQQQVGQAAAPLEAAARAHRAAPGVEAPPHNHVDNETVKPSTDLEQITDRWGDGSRSLGTGRDGNGQAGTVANRRRRRRGRRGGGRNKAHG >NC_016785|140991:175272|169876_171073_+|WP_014306319.1|DBSCAN-SWA MPRLAGKLETITNTPSRVREVLLRAARTRTAGKAVIVDEPVRVIVNESGEFTADTAPGAAVLVLVGADFMARESIPLLVAEGMTTIAEAMEAAEDFTPEVHDKLAELAAEVARGVKSTGEAVKSVSADREKTEKSAAAAEESAATASKDAQTALSAWQQLKARLDQWEARSQQLEKWQPQYEWLKENAGKSFAGVQEKIAEAAQALILQVKRDAESAKRDAANAGQHSLKAQAAAKSVESAAQSAVDAVVKKILDGAPQAYDTLKEVAEELSSQKSAAAALIKQLSEKASQADVATLAQKITNLGIDGVRGLSDALASKANTNHTHTTSQIMGLSQLESKYFTIEAEIGKKAGKEEIADMATKSDVQDLQAKVGTIPEIRVVSQMPTRPDNSTIYLVR >NC_016785|140991:175272|154438_155689_+|WP_014306311.1|portal|DBSCAN-SWA MGFLQRIGLLPQVTTTPTQHELLAPLLDTYIGVVTEMPTEELFAEQPHLRTVTTFIARAISSTSLHVYRRDSDGGRHRVRDSDLAKLMRRSSKTELMQDMLNGSILDLCLYDEFIWVAMEDSDSGEWELHRIPPTWIKQRKHSDPWTLEWMGIIDAKTGQQIKIPAERIIHVHGYNPTSASRGLTPVVALRETLKEQLESAAYRGQLWRNGPRLGGVITRPKDAKWDSTSRKRFKAAWQSQYSGRGSGAGGTPILEDGMQFVPAHLKAQDEQVVEMTKLSLQTVASIYHVNPVMVGLLDNANYSNVREFRRSLYGDSLGPIIKQVEGVINEFLRPMIDDDDAVYVEFNLDEKLRASFEEKAAVTSTAVGGPWMTRNEARAMNNLPAIDGGEDLITPLNVTTENSESNNDVGLEEES >NC_016785|140991:175272|145318_145786_+|WP_041734806.1|DBSCAN-SWA MAWERIGEDLHGRRKVIDLTGVTPKQQRPRQVKARQALERAIAQLKREETGFSVYDAAGVPEEWLPLIAVRLEEVFCTLLRFQREVNAVVASDPDFATVRKLENLKGVELLQQRTAGEILETALLLSEWAPVESVAVVRGMIQTLGDTEVDGSDI >NC_016785|140991:175272|146009_146390_-|WP_003850198.1|DBSCAN-SWA MDEKTFTLNINGYEYEMPYWTFPWIVYAVNTLQEGGVPLNLRTARFPSLWIKPEYDVQFVFPDPADPEIPERLTEKERHHAMDLFNGDSMYGYRVLDVSMDDSEPIFFEWDDPKFNQRSSADSKPE >NC_016785|140991:175272|171461_172199_+|WP_014306320.1|DBSCAN-SWA MKNWETLEPDKYNLLTKNFSPGRGGESIKFITLHHMAMVGGVDECVRVWSQRPASAHYCIGPTGEIGQAVNDWDTAWANANLLSNQRSIAIEHSNSAGASQDWPIGEKTLEEGAHLVAALCRYYGLGRPESGKNIRFHCTESGGATSCPYHLRPGHKYHDLYMSRAQWWYDNPAGGHTQNTMVSKEKKNMNEAYSRDIKAQLTGSEDLGQYPGWGQLGGRTIVDALGAIGEKLGMDGFYDTKAGK >NC_016785|140991:175272|159287_159665_+|WP_003850238.1|DBSCAN-SWA MVDEKAVIISKLSKLGFPVYSELPHDFEEKHLPVLWVQHVGPAARRQAINSRGIDYVDLDIDLFVSLDMWHTGAAMELAQTIRTHMHRFREGMLKVLDTGRPIARPDFNSTIRRCGLTITVAVPA >NC_016785|140991:175272|155685_156729_+|WP_003850232.1|head,protease|DBSCAN-SWA MTIHVVIGPPCSGKSSFVEANAPAGIGRFDFDNIAGTVAGQDVKNASPNPVANAVLAMRRGLMGWLLDVELDPPEFWLINAQPSPALIAALSARGATFHLCDPGMEECLARAARDGRPQSVEDRIRQWYDNPPELPTKGGHEVKTKSYDVSIDETQTEGTITAYASVFGNVDSYGDVVIPGAFEETLGEWQKSGNTIPLLYGHDFKDPFSNIGGVTSAVEDAHGLKITAQLDLDNPKARQVYNLLKAKRLSQMSFAFDVVEGSWGEREQQEVYELKKVKLYEVSVVPIGANQETSIIDVKAVVAEELANMLDSRQLTKSISNKPPKQGAFALSALDAHLSILEKENR >NC_016785|140991:175272|145035_145272_+|WP_003850195.1|DBSCAN-SWA MANVKIKIRDGLIDRLRNMSGITSDEAFARTIGTSRSTLVDVKTGEREPSLAFAIGIAQAFGLGLSEIVTWETETTAA >NC_016785|140991:175272|143351_143915_-|WP_010934088.1|DBSCAN-SWA MAHTTGENLFTMKRVLRNLEADHDALREALADGRRMRDLLPYGDYINRYRNLTRDPMVVAIHKQFHESGLGGLTIDESMFPLSLRFEDDDVTLVFRRPEGFRHGTNEESSQRPLIERRSRIVLFWKYAEPGTDALKRISLQMFDNEGPLEEATMLSEKIPLLTKPESLTTAMFIPTHADEPRFVFGS >NC_016785|140991:175272|144009_144417_-|WP_010934089.1|DBSCAN-SWA MSPVEAALETLAHTMGITVIETSKLGSTLNACFHPPTQTIFIKIGLDPVTRRCAIAHELGHAHYGHNCSTPGAERQADEWAAQQLLDVGDVEAVGLECEGSAAAMAAELGVTPHLLVLWMGMYERGRIQPEKRAC >NC_016785|140991:175272|161497_167140_+|WP_014306316.1|DBSCAN-SWA MDSDATYVPILASFDGFFKSIDKNAEKAGQQAATTFAESMERNLQRAERAAEKAGTVFERAHNRAADAAAKTQIAELKLLEVKEKQGAKASEVAAAEAKVEKARRDQEAADKAVAKAAKSLASAQDDVAKATQQAGDAMEVNAEKAGLFSKMTGGLGDKLGALPALAAGAVAGFAGFAAIKETLLDVGSAFDSAYDTIRIGTGASGEAFAGLQQSMRNVAANNIGIGDDMEAVGTALADINTRLGLTGAPLEKMTAQMLQLQHMGVDADINAVSQALNGFGIEADAMPAALDSLFQVSQATGLTITELSNSAVKAGPQLRQFGFSMADSAALVGQLDKAGVNADGVLSKMSKALTTFAAEGKDAPKALNETITSIEQLVKAGNSQGAINLAEGIFGAKGAAQFVDAVQTGTLSVEDFMSATGATNDTISGLAAETASFKEHWHQFKMQAMLAIEPVATAVFNMLTPAILNLKDGFTSAIEFVENTLVPGFKKIPDVLAVTAQWLEDNRNKLIALAVAVSPIVVPFLVGLAAKWTAAGVAATVSAAKQAQAWVLTKIEAAQASAANIAALWTTGAGWIKAGAQATLGAGQIAGAWLLTKAQSGVSIVASIAAVGIGWVTTGIQALAGAAQVAAAWVIGLGPIAWVTAAIAAVGAGLVWFFTQTETGKQAWQSFTSALQAGWDLFSSALRTGWEMLKTAVFDAWTARVETLKSVWEATTGAIGAGWEWLKGALYSGWMWISGNVIEGFKAGLSGLRDFFSSVVNGIKSTWATLRSALAKPVNFMINTVYNGGILKAWNVIAGLLPGLKQGNPLAGIPEHATGGRIAGPGTGTSDDVLMWGSNGEHMLTAKEVQRAGGHNAIYFMRDLIASRTPFTWDGGRFIAEHRKSVNDYGSEVKRRGIGNVDDNGLFSMLPKFKDGGAIRPMWELQLENGHKAAKSRNGNPYTWGYEDCSGYMSAIADAILHGGRGRRAWATGSFPGGQPWAPSLGKGFSVGVHDNPGGPGGGHTAGTLTGVGPYSTVNVESGGSHGNVAYGGPAIGADHAQFNGVRPGRFHLAIGADGAFETAGSGGVSPQAQRSMIAKAFGGVISTVMDPIAAKLPSPPPAWQAIPRGAYDSGKDALVKGVDNAVNSIEDSLATVYRGISKIPNLLKEKGPRGIEKKAKIYDRGGILGHGEIAINHGAPERILPPALTASFDKFTTVVPQVADRFGIIADKLLGTKIRETPTAPGSLNAQVDMEKLNDQLSGLHSMADKVVPVLETVNAMGIEGLKGLTHITGAWKEYVSVENSVAKAAEDSKAANEALATARKELADLEREIAKKGPNAKGQAEDSKKLAEARKKVSDAESKAVDSSSALEKALGDVAVGQIKLALSVVSAIAEIGKAIAGAFTAVYEGQSRGWSLVSQMAEEVEKGRQKLSEMRIENANLTIQQIKAINDLRIAQWDTHRAALNGALGIAQAQAELDKRRREGIMLGASGIDAMARAMDRYRKTGIFAIEEVTWYTEEQKRKIKAAEWKVHEARIQSAIDQLDAQGKVELAGLAAAEATLKQHTAVRLLELSAQKLSAQAASFYGVTAQGATALERMNQGKALQGKGAGSIFGGILKGLGGAAGGAAAGFALGGPVGALLGGIAGLLFGGAGQVMNGIAEIKQGRMQESTYKKYAEEELKKLPPEVQKEIHSAGVMGGIASFFGGNGAMIASTPFEAMKIKNQFATWDYEKKLQSMEHDSSIQKQLLATQRAQIEQRLAAQKAALEAERDAAGYHAAAGEADNEGVRQAYDALAQDSARRARDLAETAQRGKSQLDEIVGRLKDLADRSAANIINQGINQTVVVKLEKTGRLHTDQDIAKALEETLNAVKGLEARVEMVEAARAPSALEAAHSQL >NC_016785|140991:175272|159031_159298_+|WP_014306313.1|DBSCAN-SWA MGKYDKQFQQLNRNPKIAQALKNRAEKIRVAAQRISDAEGGTAHYRVVSGVRPGGRAYAYVVSDNRDEEFGTEKTKRIGALRRAARGG >NC_016785|140991:175272|152824_154426_+|WP_003850227.1|DBSCAN-SWA MGAKGKTEPRFFPPPLRELTSETSAGFEVIEFAKLLGIELYPFQKWALIHGLELLEDGSFRWRVVVIEVARQNGKTMLMVVLGLWRIFQYGASRVLSAAQSLSDAEDTLNEAFLIAAWNPVLRTFLPDNPRSEGEDDKFNGAWRPRANGKASMKLASAPVPGILDVAKTMPIWSLAVTSRKGGRSKSVDLALLDELREHLDWEAWNAIVPTSRNRPQSQVWGFSNAGDQSSVVLRSLRESAIRQIDDGNTTSKTAFFSWSADPEASILDPEAHAQANPSMGYSNITAESIMAEAEDALSGDNEAGFRAEALCQWQQVITPGKIPTKIWESLTDPESHRATDAPVHIGIDVASDGRFSHIAIASQREDGLWHIEIIASRAGFKWVPEWLGRRKAEAWFPGKVGMQIKGSASASLAPLVEEAGIEVIPWQGTSMSASVLGFIDEIRNRGLRHRSQPILNVSIEGAIDRRLGDISIWDRVKSATDVSPTVAANIAWWMATRPDDDQFVSAYADEDYDTAIDYDGDDYDDDDYLLIV >NC_016785|140991:175272|152019_152337_+|WP_003850222.1|DBSCAN-SWA MHQRCNTRGGQDMGFDSRAAKKLKAMFKQQCRDAGAVCWLCGQPINYDAPPNSRDSFEPDHFYPQATHPELAEDPENLRPSHCSCNRSRKDGVPAPSLGSLSEQW >NC_016785|140991:175272|146577_146847_-|WP_014306307.1|DBSCAN-SWA MMGTSDNPNDGFSHEDIEFLLREKGLSTSADLVSQLRDLEYEIKTGLMKPAEAYNQCIQFTRRTSSVITTQFAILETRIRAIESQLSSE >NC_016785|140991:175272|147775_147976_+|WP_003850203.1|DBSCAN-SWA MGNTTTIDRWLSPSQAAEIIPYSAWQIRKFCRQGILPHSKRPGSKQNRIMIKHSDLVNFIQQGAAA >NC_016785|140991:175272|172198_172435_+|WP_014306321.1|holin|DBSCAN-SWA MTVEFWKDLAERAIKTFAQALLAVLAVGVPIWELDWSGAFGIAATATVISVLTSIASISVGTQGTASAVSSPARHRKE >NC_016785|140991:175272|158676_159039_+|WP_010934108.1|DBSCAN-SWA MSIIFNQRIEIIRAGEKRSVYSSDVMEDWDNPVVLPVEVPVSIQPVSSTESDATANRSYVTSRFRLFSPPGIDIPQLKAKDRVRIGLLVLDVVGDPARWPHPLKPATVHHVEADLEVHRG >NC_016785|140991:175272|146937_147756_+|WP_003850201.1|DBSCAN-SWA MELQLFNFRGKRVRVLTNQDGEPQWVGKDICEILEIKNSRDALSRIDPEGVGIADTLTPGGVQKLRVVNESGLYELLFQSRVPQAKEFRRWVTGEVLPEIRRHGMYATTATVEQMLADPTTAIKLLEQIKQERDQRKALEAQAAIDKPKVMFADAVAEANTDILVRDLAKILRGNGIEVGGNRLFAWLRKHKYLMDGPSHIKHTPTQKAMELGLFKIKETVVTRSDGRSSITVTPKVTGKGQRYFVERFLDGRFDIDDIKTNKNRPVAPGRK >NC_016785|140991:175272|149009_149711_+|WP_003850214.1|DBSCAN-SWA MTKTQLVCDKTQLNWGLKAAKAIAGKSPYDIIQMRVSPDRDYLYICAVNDKATLVAKVELLVANVSSEEDEIITIDKAKVPALILATAETGKKSEDSRPMAGICIRGKEVDFTDENGAGRGVDLTTIHRNDSAEIGDPVRTIIRVKQQLAETSPCDVTPSPAQITELARATRYLGGKPKLSMRSYVHEGTESHRLVAEATFWTLSVLNVPEVLLETEKDKANIVDAAPIGGIS >NC_016785|140991:175272|151347_151935_+|WP_014306310.1|DBSCAN-SWA MDSWKLHSLGKALYELEKLGPLLDDLLLPSQCGYSEGRGGSGQGSRPPLRIPILDVKWETERLLTHWAWGCAVKLNVVPPYSRSVHSVAAWLQCHLIDIGDLDEADVIAEQVISQSELLSEMFSSDDDGAITSPKQGTCREVAAICKGLGYGTSKTTIHRWAHEGAIASQTMEDGRVIVDLQEVLDKLATCNNAM >NC_016785|140991:175272|140991_142218_-|WP_014306303.1|integrase|DBSCAN-SWA MATVRDLWTKRNPNTTSKTKRIRSARWGVGKRWQTVWIENGREATKTFETRDEAELWAARAEVGQADGTWITKDKVDITLSDLWEPWIASKGNISDKTKRDYLSYWNVHIRPQWGQTPCAHIQRSVINAWIPTLSTMKSVPASQPPRALSESAMRKVGLIIHGILDLAVELGVIHQNPTKTGDLPKQKKSERRYLKITEVDELIRQAPTEQAKLLLRVLIMTGLRPGEAKGLKVKDLDPVRGRLMIRRDVDDLGREDSTKTRNHRDVPIGGEILVLLDRNAQGKDPDDWLIPDERGKVWTTARWRVVWKNLCIWTGIGDLDTYELRHTAASIAIAAGADVKTVQLMLGHSSAAMTLDIYAHLWEEGLDAIPGAMEAHMESERKRAEEASTVHEASEAERRRAQFKVIG >NC_016785|140991:175272|157975_158176_+|WP_003850234.1|DBSCAN-SWA MRREYEVTTPYGQKLTLEMSEDYKNAHWPDAVLLEDTWPAASSFEAAETKRKTPTRNRAQKPEADK >NC_016785|140991:175272|161275_161485_+|WP_010934113.1|DBSCAN-SWA MMWEISNLIRITNFLLERSQAKNPNHVKAPKLKLYPWSPDQDIKHYGKVDEEDQVDAVNFLLGLSPPPQ >NC_016785|140991:175272|172901_173240_+|WP_010934122.1|DBSCAN-SWA MSVVAAFLSGVGALVTALGGVLIGVVKARSDTHTAKGSRMDVLEARIDKMQADLDDERARRRGVEVDNHRLRMALVTAVKHLEQLIRWADGGAKPPRPDDIDLDEIKSLLKA >NC_016785|140991:175272|167902_168763_+|WP_010934115.1|DBSCAN-SWA MDVSQWEQFAKHRSAVIRDYGMWIGLMDNNMEPIVDMPAPVSIDAPITRMTPSSCKAVFKTRVDGHIHPMVDYLIAENLAKVDEQGQLIAAAQDAVFLAIEVAQGIRNVYKGVFTVALGDQESPTLLEFNGVCEIQWTLGALPCPSAPYSWTGKWVDLNQDWAGKWSKTRTMADIKVAEVADGFTLSGAADTTIGKLISQSLQAINTMLKSKGRSLPIAVKPVTSNPTSPQLVIRPTDRFIWDEISDLALAAGVTVKCKTWWPSDPPVPGLDLKEPIVVIEITQEE >NC_016785|140991:175272|144426_144918_-|WP_014306305.1|DBSCAN-SWA MEQLTSWLRSFRGTDSDRQIAINSGIPVATLARQLRDGTVTVETAVKIARAYQVSVVPALLALDVLTEFDLKAFSTSSGIMDASDEDLVAEILRRMKAGQADWAEKPISELDTRRKAKRGNNSPTAPPHVTEPDYDAILDGINAGTEPIAAQKATDPLEENYT >NC_016785|140991:175272|173589_175272_+|WP_003850266.1|DBSCAN-SWA MSRKLFASILIGALLGIGAPPSAHAGADDVVDSSKSFVMENFSSYHGTKPGYVDSIQKGIQKPKSGTQGNYDDDWKGFYSTDNKYDAAGYSVDNENPLSGKAGGVVKVTYPGLTKVLALKVDNAETIKKELGLSLTEPLMEQVGTEEFIKRFGDGASRVVLSLPFAEGSSSVEYINNWEQAKALSVELEINFETRGKRGQDAMYEYMAQACAGNRVRRSVGSSLSCINLDWDVIRDKTKTKIESLKEHGPIKNKMSESPNKTVSEEKAKQYLEEFHQTALEHPELSELKTVTGTNPVFAGANYAAWAVNVAQVIDSETADNLEKTTAALSILPGIGSVMGIADGAVHHNTEEIVAQSIALSSLMVAQAIPLVGELVDIGFAAYNFVESIINLFQVVHNSYNRPAYSPGHKTQPFLHDGYAVSWNTVEDSIIRTGFQGESGHDIKITAENTPLPIAGVLLPTIPGKLDVNKSKTHISVNGRKIRMRCRAIDGDVTFCRPKSPVYVGNGVHANLHVAFHRSSSEKIHSNEISSDSIGVLGYQKTVDHTKVNSKLSLFFEIKS >NC_016785|140991:175272|167149_167902_+|WP_003850245.1|DBSCAN-SWA MQVMFYSPSGQKLFGENGSPAFLLKGGITDLKGAIEARTTVIPEVPGQVFDGVTIKPFTFGVTMVVYPTKEMPMEKAMRKTRQIFSAFDYSTCRIDGTGIPVGLKVRLDSIISAPSQDTAKEIAEEITITLAADEGIFWADRKEASGKIDIVNYGDCDLWPEYRWQKTTTITLPSGAKINLPETPTPRVLRTTRHQVSITDLDGKPDRELLKKLGTVWPEGVPRKETKTYEITQNGYIMWRIGYLDPWGL >NC_016785|140991:175272|142317_143340_-|WP_014306304.1|DBSCAN-SWA MPTLSSNLKHLRLLLGRSQGEFAEQLGIAQSTLSSVERENRPPSTKLINTARFQTGVSAEYFEASIHFYAAPDLLFRSAREGRANADKIAAAFSITEHYLRERYPDVTSDLPTIPIADLEGELSLRMLEEFASQTRDHFGIDQDSMIPNLTTVLNNHGILVTSLPDYVVEETNFDGVSTPTDSTLRIIALNQQRSGDRYRFSLAHELAHLILHANTLRSDKSQMEKEANIFAASFLMPRALLTPVITPELTLKDYAELKAQWGYSIQAIVRRAHELELIDYKRYRSLRMQIAGRGWNINEPVDVLLENVYIDPIDLLSNNIKKHQSTNEGLATVTSLHKQ >NC_016785|140991:175272|147972_148215_+|WP_003850204.1|DBSCAN-SWA MTMRTYKNPYPDSEDAVEIRFDHCREDIAKAAKEYWREMTEAELDDLQEEIMRALAVSEWQNIWLTSAAFITVLAYHSHD >NC_016785|140991:175272|160736_161114_+|WP_014306315.1|DBSCAN-SWA MTIDLNAMLAKRAEVLGEGNKFDVKLGDKAFYFVAPELASSEWNDRHQAFLEDIRDGLMTSETAREEFLGLALEDQAEEFAEAADEIGVDPFIIAQMAFQEHAEYVGKTQSQKPLNRTQRRAKQR >NC_016785|140991:175272|152490_152835_+|WP_003850225.1|DBSCAN-SWA MATNSRRVGELEQAFCDSIAALEADGIAVPEKYSAVVMLGRLYAFNIDESVNTDTEQATKALYLGPHLMGVMKLLGTAPTEASGEDKSSAPSNVVADRMKVLEIMKEYKKQNGG >NC_016785|140991:175272|158197_158680_+|WP_003850235.1|DBSCAN-SWA MNAITPTPGIDKEAFDRAANAVRRLCGWHIFPIIEETITLDSPGDSLLVLPTKHLVEIINVTIDGTTYPLSDFRSSPDGLLVKRHGRFPRGIAMVTVTMKHGYEKPTEILGVINDMARRANESNLTQLNVGGISVGATNSATPQSSEWRIVDELRLGPLP >NC_016785|140991:175272|159693_160641_+|WP_014306314.1|DBSCAN-SWA MTDFDIDTSAANYADELALLGVTGAMSYAPKGTQMPETIAPLNPPFVDFGWLSDGGITESQNEERNDWTPFQSTNPIRGQVTKQDFQFKTVVWSISGLANAMYYGVPESDMRFDQETGVTTFEQGKELPPDFKFGLVVDIVDGKKARRHCMPNVSVVERGDIVYSKDDLVGYEMTFRASYDPVAGYAVRRMFKEGWKPGHAGTTLTDENKDASLSDWSNTLDESEVRKQSKTVTLPKGATGGTFTVSINGKASAAINHDATGTAMKLVLNKVDGGESAKVTGRAGGPYTITGVEGEITADGTNLTGSDTQDISIN >NC_016785|140991:175272|148226_148451_+|WP_003850206.1|DBSCAN-SWA MILLVLINIVFSMLLLFNLTDANRKIEDATKRLDMLNEDVDVQSVKILDLYGIAQIPPMPDEEAASRIFTGVRR >NC_016785|140991:175272|148447_148594_+|WP_003850207.1|DBSCAN-SWA MNRTALEELHQALISEAEAMRAGEYFLGVGIVDAYAQQLREAIDSHDE |
40 | Corynebacterium_phage(93.75%) | capsid,integrase,holin,protease,portal,head | attL 137434:137456|attR 175556:175578 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|