Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP029164 | Escherichia coli strain 104 chromosome, complete genome | 4 crisprs | WYL,cas3,csa3,PD-DExK,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG | 1 | 15 | 8 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029164_1 | 136157-136296 | Orphan |
NA
Consensus repeat of CP029164_1
|
1 spacers
spacers of CP029164_1
>1.1|136201|52|CP029164|CRISPRCasFinder GCACTGAACTCGTAGGCCTGATAAGACGCGACAGCGTCGCATCAGGCAAGGC |
CRISPR arrays and Neighbor proteins around CP029164_1
The CRISPR arrays of CP029164_1 >merge|CP029164|1|136157-136296|CRISPRCasFinder TGTTATTGTCGGATGCGGCGTGAACGCCTTATCCGACCTACACAGCACTGAACTCGTAGGCCTGATAAGACGCGACAGCGTCGCATCAGGCAAGGCTGTTATTGTCGGATGCGACGTGAACGCCTTATCCGACCTACACA >CP029164|1|1|136157-136296|CRISPRCasFinder TGTTATTGTCGGATGCGGCGTGAACGCCTTATCCGACCTACACA GCACTGAACTCGTAGGCCTGATAAGACGCGACAGCGTCGCATCAGGCAAGGC TGTTATTGTCGGATGCGACGTGAACGCCTTATCCGACCTACACA
>CP029164.1|AWH67990.1|134628_136140_+|cytosol-aminopeptidase MEFSVKSGSPEKQRSACIVVGVFEPRRLSPIAEQLDKISDGYISALLRRGELEGKPGQTLLLHHVPNVLSERILLIGCGKERELDERQYKQVIQKTINTLNDTGSMEAVCFLTELHVKGRNNYWKVRQAVETAKETLYSFDQLKTNKSEPRRPLRKMVFNVPTRRELTSGERAIQHGLAIAAGIKAAKDLGNMPPNICNAAYLASQARQLADSYSKNVITRVIGEQQMKELGMHSYLAVGQGSQNESLMSVIEYKGNASEDARPIVLVGKGLTFDSGGISIKPSEGMDEMKYDMCGAAAVYGVMRMVAELQLPINVIGVLAGCENMPGGRAYRPGDVLTTMSGQTVEVLNTDAEGRLVLCDVLTYVERFEPEAVIDVATLTGACVIALGHHITGLMANHNPLAHELIAASEQSGDRAWRLPLGDEYQEQLESNFADMANIGGRPGGAITAGCFLSRFTRKYNWAHLDIAGTAWRSGKAKGATGRPVALLAQFLLNRAGFNGEE >CP029164.1|AWH67989.1|133261_134362_-|LPS-export-ABC-transporter-permease-LptF MIIIRYLVRETLKSQLAILFILLLIFFCQKLVRILGAAVDGDIPANLVLSLLGLGVPEMAQLILPLSLFLGLLMTLGKLYTESEITVMHACGLSKAVLVKAAMILAVFTAIVAAVNVMWAGPWSSRHQDEVLAEAKANPGMAALAQGQFQQATNGSSVLFIESVDGSDFKDVFLAQIRPKGNARPSVVVADSGHLTQLRDGSQVVTLNQGTRFEGTALLRDFRITDFQDYQAIIGHQAVALDPNDTDQMDMRTLWNTDTDRARAELNWRITLVFTVFMMALMVVPLSVVNPRQGRVLSMLPAMLLYLLFFLIQTSLKSNGGKGKLDPTLWMWTVNLIYLALAIVLNLWDTVPVRRLRASFSRKGAV >CP029164.1|AWH67988.1|132179_133262_-|LPS-export-ABC-transporter-permease-LptG MQPFGVLDRYIGKTIFTTIMMTLFMLVSLSGIIKFVDQLKKAGQGSYDALGAGMYTLLSVPKDVQIFFPMAALLGALLGLGMLAQRSELVVMQASGFTRMQVALSVMKTAIPLVLLTMAIGEWVAPQGEQMARNYRAQAMYGGSLLSTQQGLWAKDGNNFVYIERVKGDEELGGISIYAFNENRRLQSVRYAAAAKFDPEHKVWRLSQVDESDLTNPKQITGSQTVSGTWKTNLTPDKLGVVALDPDALSISGLHNYVKYLKSSGQDAGRYQLNMWSKIFQPLSVAVMMLMALSFIFGPLRSVPMGVRVVTGISFGFVFYVLDQIFGPLTLVYGIPPIIGALLPSASFFLISLWLLMRKS >CP029164.1|AWH67987.1|130516_132019_+|DUF853-domain-containing-protein MSEPLLIARTPDTELFLLPGMANRHGLITGATGTGKTVTLQKLAESLSEIGVPVFMADVKGDLTGVAEEGTSSEKLLARLKNIGVNDWQPHTNPVVVWDIFGEKGHPVRATVSDLGPLLLARLLNLNDVQSGVLNIIFRIADDQGLLLLDFKDLRAITQYIGDNAKSFQNQYGNISSASVGAIQRGLLSLEQQGAAHFFGEPMLDIKDWMRTDANGKGVINILSAEKLYQMPKLYAASLLWMLSELYEQLPEAGDLEKPKLVFFFDEAHLLFNDAPQVLLDKIEQVIRLIRSKGVGVWFVSQNPSDIPDNVLGQLGNRVQHALRAFTPKDQKAVKAAAQTMRANPTFDTEKAIQELGTGEALISFLDAKGSPSVVERAMVIAPCSRMGPVTEDERNGLINHSPVYGKYEDDVDRESAYEMLQKGFQASIEQQNNPPAKGKEVAVDDGILGGLKDILFGTTGPRGGKKDGVVQTMAKSAARQVTNQIVRGMLGSLLGGRRR >CP029164.1|AWH67986.1|129440_130439_+|LacI-family-DNA-binding-transcriptional-regulator MRNHRISLQDIATLAGVTKMTVSRYIRSPKKVAKETGERIAKIMEEINYIPNRAPGMLLNAQSYTLGILIPSFQNQLFADILAGIESVTSVHNYQTIIANYNYDRDSEEESVINLLSYNIDGIILSEKYHTIRTVKFLRSATIPVVELMDVQGERLDMEVGFDNRQAAFDMVCTMLDKRVRRKILYLGSKDDTRDEQRYQGYCDAMMLHNLSPLRMNPRAISSIHLGMQLMRDALSANPDLDGVFCTNDDIAMGALLLCRERNLAVPEQISIAGFHGLEIGRQMIPSLASVITPRFDIGRMAAQMLLSKIKNNDHNHNTVDLGYQIYHGNTL >CP029164.1|AWH67985.1|128054_129374_+|gluconate-permease MPLIIIAAGVALLLILMIGFKVNGFIALVLVAAVVGFAEGMDAQAVLHSIQNGIGSTLGGLAMILGFGAMLGKLISDTGAAQRIATTLIATFGKKRVQWALVITGLVVGLAMFFEVGFVLLLPLVFTIVASSGLPLLYVGVPMVAALSVTHCFLPPHPGPTAIATIFEANLGTTLLYGFIITIPTVIVAGPLFSKLLTRFEKAPPEGLFNPHLFSEEEMPSFWNSIFAAVIPVILMAIAAVCEITLPKTNTVRLFFEFVGNPAVALFIAIVIAIFTLGRRNGRTIEQIMDIIGDSIGAIAMIVFIIAGGGAFKQVLVDSGVGQYISHLMTGTTLSPLLMCWTVAALLRIALGSATVAAITTAGVVLPIINVTHADPALMVLATGAGSVIASHVNDPGFWLFKGYFNLTVGETLRTWTVMETLISIMGLLGVLAINAVLH >CP029164.1|AWH67984.1|127225_127990_+|gluconate-5-dehydrogenase MNDLFSLAGKNILITGSAQGIGFLLATGLGKYGAQIIINDITAERAELAVEKLHQEGIQAVAAPFNVTHKHEIDAAVEHIEKDIGPIDVLVNNAGIQRRHPFTEFPEQEWNDVIAVNQTAVFLVSQAVTRHMVERKAGKVINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARHNIQVNGIAPGYFKTEMTKALVEDEAFTAWLCKRTPAARWGDPQELIGAAVFLSSKASDFVNGHLLFVDGGMLVAV >CP029164.1|AWH67983.1|126170_127202_+|L-idonate-5-dehydrogenase MQVKTQSCVVAGKKTVAVTEQTIDWNNNGTLVQITRGGICGSDLHYYQEGKVGNFMIKAPMVLGHEVIGKVIHSDSSKLHEGQTVAINPSKPCGHCKYCIEHNENQCTEMRFFGSAMYFPHVDGGFTRYKMVETSQCVPYPAKADEKVMAFAEPLAVAIHAAHQAGELQGKRVFISGVGPIGCLIVSAVKTLGAAEIVCADVSPRSLSLGKEMGADVLVNPQNDDMDHWKAEKGYFDVSFEVSGHPSSVNTCLEVTRARGVMVQVGMGGAMAEFPMMTLIGKEISLKGSFRFTSEFNTAVSWLANGVINPLPLLSAEYPFTDLEEALRFAGDKTQAAKVQLVF >CP029164.1|AWH67982.1|125390_125954_-|thermosensitive-gluconokinase MAGESFILMGVSGSGKTLIGSKVAALLSAKFIDGDDLHPAKNIDKMSQGIPLSDEDRLPWLERLNDASYSLYKKNETGFIVCSSLKKQYRDILRKGSPHVHFLWLDGDYETILARMQRRAGHFMPVALLKSQFEALERPQADEQDIVRIDINHDIANVTEQCRQAVLAIRQNRICAKEGSASDQRCE >CP029164.1|AWH67981.1|124367_125387_+|NAD(P)-dependent-alcohol-dehydrogenase MSMIKSYAAKEAGGELEVYEYDPGELRPQDVEVQVDYCGICHSDLSMIDNEWGFSQYPLVAGHEVIGRVVALGSAAQDKGLQVGQRVGIGWTARSCGHCDACISGNQINCEQGAVPTIMNRGGFAEKLRADWQWVIPLPENIDIESAGPLLCGGITVFKPLLMHHITATSRVGVIGIGGLGHIAIKLLHAMGCEVTAFSSNPAKEQEVLAMGADKVVNSRDPQALKTLAGQFDLIINTVNVSLDWQPYFEALTYGGNFHTVGAVLTPLPVPAFTLIAGDRSVSGSATGTPYELRKLMRFAARSKVAPTTELFPMSKINDAIQHVRDGKARYRVVLKADF >CP029164.1|AWH67991.1|136397_136841_+|DNA-polymerase-III-subunit-chi MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAYRLDEALWARPAESFVPHNLAGEGPRGGAPVEIAWPQKRSSSPRDILISLRTSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK >CP029164.1|AWH67992.1|136840_139696_+|valine--tRNA-ligase MEKTYNPQDIEQPLYEHWEKQGYFKPNGDESQESFCIMIPPPNVTGSLHMGHAFQQTIMDTMIRYQRMQGKNTLWQVGTDHAGIATQMVVERKIAAEEGKTRHDYGREAFIDKIWEWKAESGGTITRQMRRLGNSVDWERERFTMDEGLSNAVKEVFVRLYKEDLIYRGKRLVNWDPKLRTAISDLEVENRESKGSMWHIRYPLADGAKTADGKDYLVVATTRPETLLGDTGVAVNPEDPRYKDLIGKYVILPLVNRRIPIVGDEHADMEKGTGCVKITPAHDFNDYEVGKRHALPMINILTFDGDIRESAQVFDTKGNESDVYSSEIPAEFQKLERFAARKAVVAAVDALGLLEEIKPHDLTVPYGDRGGVVIEPMLTDQWYVRADVLAKPAVEAVENGDIQFVPKQYENMYFSWMRDIQDWCISRQLWWGHRIPAWYDEAGNVYVGRNEDEVRKENNLGADVALRQDEDVLDTWFSSALWTFSTLGWPENTDALRQFHPTSVMVSGFDIIFFWIARMIMMTMHFIKDENGKPQVPFHTVYMTGLIRDDEGQKMSKSKGNVIDPLDMVDGISLPELLEKRTGNMMQPQLADKIRKRTEKQFPNGIEPHGTDALRFTLAALASTGRDINWDMKRLEGYRNFCNKLWNASRFVLMNTEGQDCGFNGGEMTLSLADRWILAEFNQTIKAYREALDSFRFDIAAGILYEFTWNQFCDWYLELTKPVMNGGTEAELRGTRHTLVTVLEGLLRLAHPIIPFITETIWQRVKVLCGITADTIMLQPFPQYDASQVDEAALADTEWLKQAIVAVRNIRAEMNIAPGKPLELLLRGCSADAERRVNENRGFLQTLARLESITVLPADDKGPVSVTKIVDGAELLIPMAGLINKEDELARLAKEVAKIEGEISRIENKLANEGFVARAPEAVIAKEREKLEGYAEAKAKLIEQQAVIAAL >CP029164.1|AWH67993.1|139742_140930_-|DUF898-domain-containing-protein MNDVNIGKDNSRHSFVFTGKGGEYFLICLVNFSLTIITLGIYGPWALIKCRRYIYQHVTLKGQPFSYKGTGGAIFVSMLLIVVVYLLSISCFAGQHFALGLFLFALLICGIPCMAVKSLQYQANMTSLNDIRFGFNCSMMRAWWVMLGLPVLLALVFWFALYLIAQVTTSIGGLFFNLVALSLLSAIGLGVVHGITYSKWMPLLGNNATFGIHKFSIQVNVKECIKGCMLAILTMVPFIIVIGIMIAPVFQQLMMMTMLGRSDAGSEFVLQYYPQIMASYFLYFVAILVFASYLYVTLRNLFLNNLTLANGTIRFHSSVTAIGMLLRMLAVLMGSSITCGLAYPWLKMWMVSWIANNTHVQGDLDSLELTNDDKPQDSGSLMWISRGIMPYVPFI >CP029164.1|AWH67994.1|141122_141626_+|N-acetyltransferase MNVVASPALRLRKLTVADNPAIAHVIRQVSAEYGLTADKGYTVADPNLDELYQVYSQPGHAYWVVEYEGEVVGGGGIAPLTGSESDICELQKMYFLPAIRGKGLAKKLALMAMEQAREMGFKRCYLETTAFLKEAIALYEHLGFEHIDYALGCTGHVDCEVRMLRKL >CP029164.1|AWH67995.1|141802_142219_-|ribonuclease-E-inhibitor-RraB MANPEQLEEQREETRLIIEELLEDGSDPDALYTIEHHLSADDLETLEKAAVEAFKLGYEVTDPEELEVEDGDIVICCDILSECALNADLIDAQVEQLMTLAEKFDVEYDGWGTYFEDPNGEDGDDEDFVDEDDDGIRH >CP029164.1|AWH67996.1|142380_143385_+|ornithine-carbamoyltransferase MSGFYHKHFLKLLDFTPAELNSLLQLAAKLKADKKSGKEEARLTGKNIALIFEKDSTRTRCSFEVAAYDQGARVTYLGPSGSQIGHKESIKDTARVLGRMYDGIQYRGYGQEIVETLAEYAGVPVWNGLTNEFHPTQLLADLLTMQEHLPGKAFNEMTLVYAGDARNNMGNSMLEAAALTGLDLRLVAPQACWPEAALVAECSALAQKHGGKITLTEDIASGVKGADFIYTDVWVSMGEPKEKWAERIALLRDYQVNSKMMALTGNSQVKFLHCLPAFHDEQTTLGKKMAAELGLYGGMEVTDEVFESPASIVFDQAENRMHTIKAVMVATLAK >CP029164.1|AWH67997.1|143430_143883_-|YhcH/YjgK/YiaL-family-protein MIVGNIHHLQSWLPEELREAIEYIKSHVSDETAKGKHAIDGDRLFYLISEDTTEPGELRRAEYHARYLDIQIVLKGQEGMTFSTQPAGVPETDWLADKDIAFIGQGIDEKTVILNEGDFVVFYPGEVHKPLCAVGAPAQVRKAVVKLLKS >CP029164.1|AWH67998.1|144560_145781_+|arginine-deiminase MEKHYVGSEIGQLRSVMLHRPNLSLKRLTPSNCQELLFDDVLSVERAGEEHDIFANTLRQQGIEVLLLTDLLTQTLDIPEAKSWLLETQISDYRLGPTFATDVRTWLAEMSHRDLARHLSGGLTYSEIPASIKNMVVDTHDINDFIMKPLPNHLFTRDTSCWIYNGVSINPMAKPARQRETNNLRAIYRWHPQFAGGEFIKYFGDENINYDHATLEGGDVLVIGRGAVLIGMSERTTPQGIEFLAQALFKHRQAERVIAVELPKHRSCMHLDTVMTHIDIDTFSVYPEVVRPDVNCWTLTPDGHGGLKRTQESTLLHAIEKALGIDQVRLITTGGDAFEAEREQWNDANNVLTLRPGVVVGYERNIWTNEKYDKAGITVLPIPGDELGRGRGGARCMSCPLHRDGI >CP029164.1|AWH67999.1|145791_146736_+|carbamate-kinase MENKPTLVIALGGNALLKRGEPLEAEIQRKNIDLAAKTIAQLTQHWRVVLVHGNGPQVGLLALQNSAYAHVAPYPLDILGAESQGMIGYMLQQALKNQLPQREISVLLTQVEVDANDPAFSNPTKYIGPIYDHAQTQVLQAEKGWVFKADGHSFRRVVPSPQPKRIVERDAIQTLIAHDHLVICNGGGGVPVVEKADGYHGIEAVIDKDLSAALLASQIHADALLILTDADAVYLDWGKPTQRPLAQVTPELLNEMQFDAGSMGPKVTACAKFVSQCRGIAGIGSLADGPEILAGDKGTLIRLDTPITTLDPFL >CP029164.1|AWH68000.1|146746_147751_+|ornithine-carbamoyltransferase MATSLKNRNFLKLLDYTPAEIQYLIDLAINLKAAKKSGNEKQTLVGKNIALIFEKSSTRTRCAFEVAAFDQGAQVTYIGPSGSQIGHKESMKDTARVLGRMYDGIEYRGYGQNIVEELGEFAGVPVWNGLTNEFHPTQILADLMTMLEHAPGKTLPELSFAYLGDARNNMGNSLMVGAAKMGMDIRLVAPKSFWPDEALVTQCREIASVTGARITLTEDVEEGVYDVDFLYTDVWVSMGEPKEAWAERVSLMTPYQINQQVITATRNPEVKFMHCLPAFHNEHTTVGREIEMAYGLKGLEVTDEVFESAHSIVFDEAENRMHTIKAVMVATLGD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029164_3 | 2002885-2003401 | Unclear |
I-E
Consensus repeat of CP029164_3
|
8 spacers
spacers of CP029164_3
>3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT ACGTAACAAAACAACAGCAAAATATTATCGAC >3.2|2002975|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GCTCCGCCGGTTTGATCTCCGGTTTGCGCTGT >3.3|2003036|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GCAATTAATTTAGTTCCAGATGCTGCGAAAGA >3.4|2003097|32|CP029164|PILER-CR,CRISPRCasFinder,CRT TTGGTCACTCGTCAAAAGTCGAGACGGTCGAA >3.5|2003158|32|CP029164|CRISPRCasFinder,CRT AATGATTGATATAAATCTGTGTACGGTGTCCG >3.6|2003219|32|CP029164|CRISPRCasFinder,CRT,PILER-CR CTAGGATAAATTAAAAGACAAAATTGCAGCAA >3.7|2003280|32|CP029164|CRISPRCasFinder,CRT,PILER-CR GAGCGACCAGTATCAAGATCGACAGGTTTTGC >3.8|2003341|32|CP029164|CRISPRCasFinder,CRT,PILER-CR ATCGATATGTACGTTAGCGAGGGGATCACGCA |
CRISPR arrays and Neighbor proteins around CP029164_3
The CRISPR arrays of CP029164_3 >merge|CP029164|3|2002885-2003401|PILER-CR,CRISPRCasFinder,CRT,PILER-CR GAGTTCCCCACGTCAGCGGGGATAAACCGACGTAACAAAACAACAGCAAAATATTATCGACGAGTTCCCCACGTCAGCGGGGATAAACCGGCTCCGCCGGTTTGATCTCCGGTTTGCGCTGTGAGTTCCCCACGTCAGCGGGGATAAACCGGCAATTAATTTAGTTCCAGATGCTGCGAAAGAGAGTTCCCCACGTCAGCGGGGATAAACCGTTGGTCACTCGTCAAAAGTCGAGACGGTCGAAGAGTTCCCCACGTCAGCGGGGATAAACCGAATGATTGATATAAATCTGTGTACGGTGTCCGGAGTTCCCCGCGCCAGCGGGGATAAACCGCTAGGATAAATTAAAAGACAAAATTGCAGCAAGAGTTCCCCGCGGCAGCGGGGATAAACCGGAGCGACCAGTATCAAGATCGACAGGTTTTGCGAGTTCCCCGCGCCAGCGGGGATAAACCGATCGATATGTACGTTAGCGAGGGGATCACGCAGAGTTCCCCGCACCAGCGGGGATAAACCA >CP029164|3|1|2002885-2003157|PILER-CR GAGTTCCCCACGTCAGCGGGGATAAACCG ACGTAACAAAACAACAGCAAAATATTATCGAC GAGTTCCCCACGTCAGCGGGGATAAACCG GCTCCGCCGGTTTGATCTCCGGTTTGCGCTGT GAGTTCCCCACGTCAGCGGGGATAAACCG GCAATTAATTTAGTTCCAGATGCTGCGAAAGA GAGTTCCCCACGTCAGCGGGGATAAACCG TTGGTCACTCGTCAAAAGTCGAGACGGTCGAA GAGTTCCCCACGTCAGCGGGGATAAACCGAATGATTGATATAAATCTGTGTACGGTGTCCGGAGTTCCCCGCGCCAGCGGGGATAAACCG CTAGGATAAATTAAAAGACAAAATTGCAGCAA GAGTTCCCCGCGGCAGCGGGGATAAACCG GAGCGACCAGTATCAAGATCGACAGGTTTTGC GAGTTCCCCGCGCCAGCGGGGATAAACCG ATCGATATGTACGTTAGCGAGGGGATCACGCA >CP029164|3|3|2002885-2003401|CRISPRCasFinder GAGTTCCCCACGTCAGCGGGGATAAACCG ACGTAACAAAACAACAGCAAAATATTATCGAC GAGTTCCCCACGTCAGCGGGGATAAACCG GCTCCGCCGGTTTGATCTCCGGTTTGCGCTGT GAGTTCCCCACGTCAGCGGGGATAAACCG GCAATTAATTTAGTTCCAGATGCTGCGAAAGA GAGTTCCCCACGTCAGCGGGGATAAACCG TTGGTCACTCGTCAAAAGTCGAGACGGTCGAA GAGTTCCCCACGTCAGCGGGGATAAACCG AATGATTGATATAAATCTGTGTACGGTGTCCG GAGTTCCCCGCGCCAGCGGGGATAAACCG CTAGGATAAATTAAAAGACAAAATTGCAGCAA GAGTTCCCCGCGGCAGCGGGGATAAACCG GAGCGACCAGTATCAAGATCGACAGGTTTTGC GAGTTCCCCGCGCCAGCGGGGATAAACCG ATCGATATGTACGTTAGCGAGGGGATCACGCA GAGTTCCCCGCACCAGCGGGGATAAACCA >CP029164|3|1|2002885-2003401|CRT GAGTTCCCCACGTCAGCGGGGATAAACCG ACGTAACAAAACAACAGCAAAATATTATCGAC GAGTTCCCCACGTCAGCGGGGATAAACCG GCTCCGCCGGTTTGATCTCCGGTTTGCGCTGT GAGTTCCCCACGTCAGCGGGGATAAACCG GCAATTAATTTAGTTCCAGATGCTGCGAAAGA GAGTTCCCCACGTCAGCGGGGATAAACCG TTGGTCACTCGTCAAAAGTCGAGACGGTCGAA GAGTTCCCCACGTCAGCGGGGATAAACCG AATGATTGATATAAATCTGTGTACGGTGTCCG GAGTTCCCCGCGCCAGCGGGGATAAACCG CTAGGATAAATTAAAAGACAAAATTGCAGCAA GAGTTCCCCGCGGCAGCGGGGATAAACCG GAGCGACCAGTATCAAGATCGACAGGTTTTGC GAGTTCCCCGCGCCAGCGGGGATAAACCG ATCGATATGTACGTTAGCGAGGGGATCACGCA GAGTTCCCCGCACCAGCGGGGATAAACCA >CP029164|3|2|2003190-2003401|PILER-CR ACGTAACAAAACAACAGCAAAATATTATCGAC GAGTTCCCCACGTCAGCGGGGATAAACCG GCTCCGCCGGTTTGATCTCCGGTTTGCGCTGT GAGTTCCCCACGTCAGCGGGGATAAACCG GCAATTAATTTAGTTCCAGATGCTGCGAAAGA GAGTTCCCCACGTCAGCGGGGATAAACCG TTGGTCACTCGTCAAAAGTCGAGACGGTCGAA GAGTTCCCCACGTCAGCGGGGATAAACCGAATGATTGATATAAATCTGTGTACGGTGTCCGGAGTTCCCCGCGCCAGCGGGGATAAACCG CTAGGATAAATTAAAAGACAAAATTGCAGCAA GAGTTCCCCGCGGCAGCGGGGATAAACCG GAGCGACCAGTATCAAGATCGACAGGTTTTGC GAGTTCCCCGCGCCAGCGGGGATAAACCG ATCGATATGTACGTTAGCGAGGGGATCACGCA GAGTTCCCCGCACCAGCGGGGATAAACCA
>CP029164.1|AWH69691.1|2001873_2002545_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVIGRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >CP029164.1|AWH69690.1|1999347_2000646_+|enolase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >CP029164.1|AWH69689.1|1997623_1999261_+|CTP-synthetase MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >CP029164.1|AWH69688.1|1996604_1997396_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADGSAGNSREVLARWEQIKTEERAQKAQHSALDDIPSSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQVVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >CP029164.1|AWH69687.1|1994274_1996509_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >CP029164.1|AWH69686.1|1992925_1994227_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKALFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHEVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILVPQLEALLPKVRACLGSLQAIRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >CP029164.1|AWH69685.1|1990112_1992869_-|signal-transduction-histidine-kinase MTNYSLRARMMILILAPTVLIGLLLSIFFVVHRYNDLQRQLEDAGASIIEPLAVSTEYGMSLQNRESIGQLISVLHRRHSDIVRAISVYDENNRLFVTSNFHLDPSSMQLGSNVPFPRQLTVTRDGDIMILRTPIISESYSPDESPSSDAKNSQNMLGYIALELDLKSVRLQQYKEIFISSVMMLFCIGIALIFGWRLMRDVTGPIRNMVNTVDRIRRGQLDSRVEGFMLGELDMLKNGINSMAMSLAAYHEEMQHNIDQATSDLRETLEQMEIQNVELDLAKKRAQEAARIKSEFLANMSHELRTPLNGVIGFTRLTLKTELTPTQRDHLNTIERSANNLLAIINDVLDFSKLEAGKLILESIPFPLRSTLDEVVTLLAHSSHDKGLELTLNIKSDVPDNVIGDPLRLQQIITNLVGNAIKFTENGNIDILVEKRALSNTKVQIEVQIRDTGIGIPERDQSRLFQAFRQADASISRRHGGTGLGLVITQKLVNEMGGDISFHSQPNRGSTFWFHINLDLNPNIIIEGPSTQCLAGKRLAYVEPNSAAAQCTLDILSETPLEVVYSPTFSALPPAHYDMMLLGIAVTFREPLTMQHERLAKAVSMTDFLMLALPCHAQVNAEKLKQDGIGACLLKPLTPTRLLPALTEFCHHKQNTLLPVTDESKLAMTVMAVDDNPANLKLIGALLEDMVQHVELCDSGHQAVERAKQMPFDLILMDIQMPDMDGIRACELIHQLPHQQQTPVIAVTAHAMAGQKEKLLGAGMSDYLAKPIEEERLHNLLLRYKPGSGISSRVVTPEVNEIVVNPNATLDWQLALRQAAGKTDLARDMLQMLLDFLPEVRNKVEEQLVGENPEGLVDLIHKLHGSCGYSGVPRMKNLCQLIEQQLRSGTKEEDLEPELLELLDEMDNVAREASKILG >CP029164.1|AWH69684.1|1988541_1989882_+|glucarate-dehydratase MSSQFTTPVVTEMQVIPVAGHDSMLMNLSGAHAPFFTRNIVIIKDNSGHTGVGEIPGGEKIRKTLEDAIPLVVGKTLGEYKNVLTLVRNTFADRDAGGRGLQTFDLRTTIHVVTGIEAAMLDLLGQHLGVNVASLLGDGQQRSEVEMLGYLFFVGNRKATPLPYQSQPDDKCDWYRLRHEEAMTPDAVVRLAEAAYEKYGFNDFKLKGGVLAGEEEAESIVALAQRFPQARITLDPNGAWSLNEAIKIGKYLKGSLAYAEDPCGAEQGFSGREVMAEFRRATGLPTATNMIATDWRQMGHTLSLQSVDIPLADPHFWTMQGSVRVAQMCHEFGLTWGSHSNNHFDISLAMFTHVAAAAPGKITAIDTHWIWQEGNQRLTKEPLEIKGGLVQVPEKPGLGVEIDMDQVMKAHELYQKHGLGARDDAMGMQYLIPGWTFDNKRPCMVR >CP029164.1|AWH69683.1|1987180_1988521_+|glucarate-dehydratase MTTQSSPVITDMKVIPVAGHDSMLLNIGGAHNAYFTRNIVVLTDNAGHTGIGEAPGGEVIYQTLVKAIPMVLGQEVARLNKVVQQVHKGNQAADFDTFGKGAWTFELRVNAVAALEAALLDLLGQALNVPVCELLGPGKQRDAITVLGYLFYIGDRTKTDLPYLENTSGNHEWYQLRHQKAMNSEAVVRLAEASQDRYGFKDFKLKGGVLPGEQEIDTVRALKKRFPDARITVDPNGAWLLDEAISLCKGLNDVLTYAEDPCGAEQGFSGREVMAEFRRATGLPVATNMIATNWREMGHAVMLNAVDIPLADPHFWTLSGAVRVAQLCDDWGLTWGCHSNNHFDISLAMFTHVGAAAPGNPTAIDTHWIWQEGDCRLTKNPLEIKNGKIAVPDAPGLGVELDWEQVQKAHEAYKRLPGGARNDAGPMQYLIPGWTFDRKRPVFGRH >CP029164.1|AWH69682.1|1985826_1987179_+|MFS-transporter MSSLSQAASSVEKRTNARYWIVVMLFIVTSFNYGDRATLSIAGSEMAKDIGLDPVGMGYVFSAFSWAYVIGQIPGGWLLDRFGSKRVYFWSIFIWSMFTLLQGFVDIFSGFGIIVALFTLRFLVGLAEAPSFPGNSRIVAAWFPAQERGTAVSIFNSAQYFATVIFAPIMGWLTHEVGWSHVFFFMGGLGIVISFIWLKVIHEPNQHPGVNQKELEYIAAGGALINMDQQNTKVKVPFSVKWGQIKQLLGSRMMIGVYIGQYCINALTYFFITWFPVYLVQARGMSILKAGFVASVPAVCGFIGGVLGGIISDWLMRRTGSLNIARKTPIVMGMLLSMVMVFCNYVNVEWMIIGFMALAFFGKGIGALGWAVMADTAPKEISGLSGGLFNMFGNISGIVTPIAIGYIVGTTGSFNGALIYVGVHALIAVLSYLVLVGDIKRIELKPVAGQ >CP029164.1|AWH69692.1|2004038_2005517_-|sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFASHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKTWAWSEDTAVMEKFNIPRQMLFDVQMPGTVLGHITRQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDAKAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNHFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFMPVESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNADSIQSWSNA >CP029164.1|AWH69693.1|2005543_2006821_-|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSASLKTVIIIYSVVYILLGILCWFFVSDNNNLRSTNNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSVLGLLALTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGYWQDTLPAAEAYRNMWLMGMAALAMVIIFTFLLFQKIRTADSAPAMASSK >CP029164.1|AWH69694.1|2007139_2007925_+|KR-domain-containing-protein MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASQASNYVNGHLLVVDGGYLVR >CP029164.1|AWH69695.1|2007994_2009449_+|FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYVLEDMKTGFNILREIMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNHLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >CP029164.1|AWH69696.1|2009542_2010880_+|MFS-transporter MNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFIGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGVLLGAFSVVWTVGYVLASIAGHHFISESPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVATATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLAAVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >CP029164.1|AWH69697.1|2010857_2011637_+|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MNILLAFKAEPDAGMLAEKEWQAAAQGNSGPDVSLLRSLLGADEQAAAALLLAQRKNGTSMSLTALSMGDERALHWLRYLMALGFEEAVVLETAADLRFASEFVARHIAEWQRQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDAPFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAELPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWQNYLRQRMQP >CP029164.1|AWH69698.1|2011633_2012494_+|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNITIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSMRKSHWGNALTATLQTEKRPLCLSLARQAGALKNATLPSGMQQLNIVPGAPPDWLISVENLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLVEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKAVLEALVTTIHADCQ >CP029164.1|AWH69699.1|2012640_2013216_-|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISQIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >CP029164.1|AWH69700.1|2013232_2013493_-|ferredoxin-family-protein MSVARNLWRVADAPHIVPADSVERQTVQRLINACPAGLFSLTPEGDLRVDYRGCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >CP029164.1|AWH69701.1|2013483_2014755_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLVERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERRITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLARFKTHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLLGDALRSCVNTGISVRGMDMALTGTQAAAQTLISACQHREPQNLFALYHHNVERSLLWDVLQRYQHVPALLQRPGWYRAWPALMQDISRDLWDQGDKPVPPLRQLFWRHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029164_4 | 2029103-2030107 | TypeI-E |
I-E
Consensus repeat of CP029164_4
|
16 spacers
spacers of CP029164_4
>4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CATTGAAAACATTGCCTTTATTTTATTTTTTG >4.2|2029193|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GTGCCGCCGCTGGGCACTTCCTTCCCGTGAGT >4.3|2029254|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CGGACGGTGGGAATATAGAAAATCCGTCCACC >4.4|2029315|32|CP029164|PILER-CR,CRISPRCasFinder,CRT TTATACTCTTTTCATCGACTAAGGAGGGGAGG >4.5|2029376|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CAATAACGCAGCATCCAGGAAGCTGTTTCCGC >4.6|2029437|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CGATCGGTGAAGAGGTCCGCGAAATACTCACT >4.7|2029498|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GCGATAGTTGATTCAGCCGCGCCAGCGAATGT >4.8|2029559|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GGGCCGCCGCGAATTTACACACGATTCAATAC >4.9|2029620|32|CP029164|PILER-CR,CRISPRCasFinder,CRT AACTGGTGCGCGACGGCTGGCTAACAAAACGA >4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CGTGGCTGCGCTGGCCGTTGCAGCAGTTTGAT >4.11|2029742|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CGTAAACGCCCCGTCGCCATTAATTTCGGGGT >4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT TGGGATGAGCAAATAACGTCGTTTCCTAGAAA >4.13|2029864|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CCGCCGTGCCAGTGATCCTCATACGGCCTGTT >4.14|2029925|32|CP029164|PILER-CR,CRISPRCasFinder,CRT AAATTAAGAACGGCGTAAACGACGGCAGCATG >4.15|2029986|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GTGATGATTCAGAGCAGACATTAGCCCGCGCT >4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GGTAAAAACACGGTCTGAACCGACATTCATGT |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas3 |
CRISPR arrays and Neighbor proteins around CP029164_4
The CRISPR arrays of CP029164_4 >merge|CP029164|4|2029103-2030107|PILER-CR,CRISPRCasFinder,CRT GTGTTCCCCGCGCCAGCGGGGATAAACCGCATTGAAAACATTGCCTTTATTTTATTTTTTGGTGTTCCCCGCGCCAGCGGGGATAAACCGGTGCCGCCGCTGGGCACTTCCTTCCCGTGAGTGTGTTCCCCGCGCCAGCGGGGATAAACCGCGGACGGTGGGAATATAGAAAATCCGTCCACCGTGTTCCCCGCGCCAGCGGGGATAAACCGTTATACTCTTTTCATCGACTAAGGAGGGGAGGGTGTTCCCCGCGCCAGCGGGGATAAACCGCAATAACGCAGCATCCAGGAAGCTGTTTCCGCGTGTTCCCCGCGCCAGCGGGGATAAACCGCGATCGGTGAAGAGGTCCGCGAAATACTCACTGTGTTCCCCGCGCCAGCGGGGATAAACCGGCGATAGTTGATTCAGCCGCGCCAGCGAATGTGTGTTCCCCGCGCCAGCGGGGATAAACCGGGGCCGCCGCGAATTTACACACGATTCAATACGTGTTCCCCGCGCCAGCGGGGATAAACCGAACTGGTGCGCGACGGCTGGCTAACAAAACGAGTGTTCCCCGCGCCAGCGGGGATAAACCGCGTGGCTGCGCTGGCCGTTGCAGCAGTTTGATGTGTTCCCCGCGCCAGCGGGGATAAACCGCGTAAACGCCCCGTCGCCATTAATTTCGGGGTGTGTTCCCCGCGCCAGCGGGGATAAACCGTGGGATGAGCAAATAACGTCGTTTCCTAGAAAGTGTTCCCCGCGCCAGCGGGGATAAACCGCCGCCGTGCCAGTGATCCTCATACGGCCTGTTGTGTTCCCCGCGCCAGCGGGGATAAACCGAAATTAAGAACGGCGTAAACGACGGCAGCATGCTGTTCCCCGCGCCAGCGGGGATAAACCGGTGATGATTCAGAGCAGACATTAGCCCGCGCTGTGTTCCCCGCGCCAGCGGGGATAAACCGGGTAAAAACACGGTCTGAACCGACATTCATGTGTGTTCCCCGCGTCAGCGGGGATAAACCG >CP029164|4|3|2029103-2030107|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACCG CATTGAAAACATTGCCTTTATTTTATTTTTTG GTGTTCCCCGCGCCAGCGGGGATAAACCG GTGCCGCCGCTGGGCACTTCCTTCCCGTGAGT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGGACGGTGGGAATATAGAAAATCCGTCCACC GTGTTCCCCGCGCCAGCGGGGATAAACCG TTATACTCTTTTCATCGACTAAGGAGGGGAGG GTGTTCCCCGCGCCAGCGGGGATAAACCG CAATAACGCAGCATCCAGGAAGCTGTTTCCGC GTGTTCCCCGCGCCAGCGGGGATAAACCG CGATCGGTGAAGAGGTCCGCGAAATACTCACT GTGTTCCCCGCGCCAGCGGGGATAAACCG GCGATAGTTGATTCAGCCGCGCCAGCGAATGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGGCCGCCGCGAATTTACACACGATTCAATAC GTGTTCCCCGCGCCAGCGGGGATAAACCG AACTGGTGCGCGACGGCTGGCTAACAAAACGA GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTGGCTGCGCTGGCCGTTGCAGCAGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTAAACGCCCCGTCGCCATTAATTTCGGGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG TGGGATGAGCAAATAACGTCGTTTCCTAGAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGCCGTGCCAGTGATCCTCATACGGCCTGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG AAATTAAGAACGGCGTAAACGACGGCAGCATG CTGTTCCCCGCGCCAGCGGGGATAAACCG GTGATGATTCAGAGCAGACATTAGCCCGCGCT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGTAAAAACACGGTCTGAACCGACATTCATGT GTGTTCCCCGCGTCAGCGGGGATAAACCG >CP029164|4|4|2029103-2030107|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACCG CATTGAAAACATTGCCTTTATTTTATTTTTTG GTGTTCCCCGCGCCAGCGGGGATAAACCG GTGCCGCCGCTGGGCACTTCCTTCCCGTGAGT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGGACGGTGGGAATATAGAAAATCCGTCCACC GTGTTCCCCGCGCCAGCGGGGATAAACCG TTATACTCTTTTCATCGACTAAGGAGGGGAGG GTGTTCCCCGCGCCAGCGGGGATAAACCG CAATAACGCAGCATCCAGGAAGCTGTTTCCGC GTGTTCCCCGCGCCAGCGGGGATAAACCG CGATCGGTGAAGAGGTCCGCGAAATACTCACT GTGTTCCCCGCGCCAGCGGGGATAAACCG GCGATAGTTGATTCAGCCGCGCCAGCGAATGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGGCCGCCGCGAATTTACACACGATTCAATAC GTGTTCCCCGCGCCAGCGGGGATAAACCG AACTGGTGCGCGACGGCTGGCTAACAAAACGA GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTGGCTGCGCTGGCCGTTGCAGCAGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTAAACGCCCCGTCGCCATTAATTTCGGGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG TGGGATGAGCAAATAACGTCGTTTCCTAGAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGCCGTGCCAGTGATCCTCATACGGCCTGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG AAATTAAGAACGGCGTAAACGACGGCAGCATG CTGTTCCCCGCGCCAGCGGGGATAAACCG GTGATGATTCAGAGCAGACATTAGCCCGCGCT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGTAAAAACACGGTCTGAACCGACATTCATGT GTGTTCCCCGCGTCAGCGGGGATAAACCG >CP029164|4|2|2029103-2030107|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG CATTGAAAACATTGCCTTTATTTTATTTTTTG GTGTTCCCCGCGCCAGCGGGGATAAACCG GTGCCGCCGCTGGGCACTTCCTTCCCGTGAGT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGGACGGTGGGAATATAGAAAATCCGTCCACC GTGTTCCCCGCGCCAGCGGGGATAAACCG TTATACTCTTTTCATCGACTAAGGAGGGGAGG GTGTTCCCCGCGCCAGCGGGGATAAACCG CAATAACGCAGCATCCAGGAAGCTGTTTCCGC GTGTTCCCCGCGCCAGCGGGGATAAACCG CGATCGGTGAAGAGGTCCGCGAAATACTCACT GTGTTCCCCGCGCCAGCGGGGATAAACCG GCGATAGTTGATTCAGCCGCGCCAGCGAATGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGGCCGCCGCGAATTTACACACGATTCAATAC GTGTTCCCCGCGCCAGCGGGGATAAACCG AACTGGTGCGCGACGGCTGGCTAACAAAACGA GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTGGCTGCGCTGGCCGTTGCAGCAGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTAAACGCCCCGTCGCCATTAATTTCGGGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG TGGGATGAGCAAATAACGTCGTTTCCTAGAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGCCGTGCCAGTGATCCTCATACGGCCTGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG AAATTAAGAACGGCGTAAACGACGGCAGCATG CTGTTCCCCGCGCCAGCGGGGATAAACCG GTGATGATTCAGAGCAGACATTAGCCCGCGCT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGTAAAAACACGGTCTGAACCGACATTCATGT GTGTTCCCCGCGTCAGCGGGGATAAACCG
>CP029164.1|AWH69713.1|2028712_2029006_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQISQLAGCGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ >CP029164.1|AWH69712.1|2027792_2028716_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALTEDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRQTYALLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGYAPAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLACRDIFRSSKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPETLGDSGHRGRGG >CP029164.1|AWH69711.1|2027145_2027796_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQERPAESDIFTIECRPFAPELRTGQSLCFNLRANPTICKAGKRHDLLMEAKRQVRGQVEGSDVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSVDYTGMLTVTDPGLFLQRLSQGYGKSRAFGCGLMLIKPGAEA >CP029164.1|AWH69710.1|2026417_2027164_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDDTERLNAFNRHYSLVVCASRNPRWARDYHTVQMPKEVRKARYFSRREELSAPDLLSAIISRRDYYTDAWWMVAVATTPDAPYSLEQLQDGLRHPVFPLYLGRKSHPLALPLAPLLLEGNASDVLRNAYQQYQDSFRELKVSLPKLQDECWWEGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE >CP029164.1|AWH69709.1|2025351_2026407_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRVSSQSLKRAWRTSALFEQALAGHIGIRSGRIAREAATILIEKGIEEKKAIEWAAKIADYLGKAKNDKKPKDPLTNAETEQLVHISPAEFDAVKALAHQLAEEKRAPKEEDLALLRKDRMAVDIAMFGRMLANKPEFNVEAACQVAHAFGVSETIVEDDFFTAVDDLRQASEDAGAGHLGETGFGSALFYTYICIDKDLLVENLGGDEALANQTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTEQPRSLAAAFYEPINGTLQLDVAVQRITTLRENMNTVYEQKTECASFDVMNKQGSMKDVLDFICA >CP029164.1|AWH69708.1|2024800_2025337_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MSIVKEEHKATLRKWHEELQEKRGNRASLRRSTTVNDVCLSEGFRSLLMQTHTLWKIESQEWRFTALALVAAVAANVKAIDERQPFAAQLAAVMSEGRFTRLSAVKTPDELLRQLRRAVKLLNGSVNLISLADDIFRWCQESDDLLNHHRRQQRPTEFIRIRWALEYYQAGDADNEQN >CP029164.1|AWH69707.1|2020486_2023144_+|CRISPR-associated-helicase/endonuclease-Cas3 MTFFYFWGKTRRGEKDGGEDYHLLCWHSLDVAAMGYLMVKSNCFGLAGYFRQLGFADTELAAQFFAWLLCWHDIGKFARSFQQLYLHPQLKVPEGARKNYEKISHSTLGYWLWNHYLSECQELLPSSSLSPRKLKRVMEMWMPMTTGHHGRPPERMDELDNFLPEDKGAARDFLLAIKVLFPLIEIPTFWDDDEGVELIKQLSWYISATVVLADWTGSSTRYFPRVAQAMDIEDYWQKTLVQAQNALTVFPPKAESAPFTGINTLFPFIENPTPLQQKVLDLDISQPGPQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLFFGLPTMATANAMYDRLVKTWLAFYSPESRPSLVLAHSARTLMDRFNESLWSGDLVGSEEPDEQTFSQGCAAWFADSNKKALLAEIGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAYMSCILEGLIERQARGGNSVILLSATLSQQQRDKLVAAFARGIEGQQEAPLLEKDDYPWLTHVTKSDVHSHRVATRKDVERSVSVGWLHSEQECIARIESAVSQGKCIAWIRNSVDDAIQVHRQLLARGVIPASSLSLFHSRFAFSDRQRIETETLARFGKEDCSQRAGKVLICTQVLEQSVDCDLDEMISDLAPVDLLIQRAGRLQRHIRDINGLLKRDGKDERSPPEFLILAPVWDDSPGDEWFGSAMRNSAYVYPDHGRIWLTQRVLREQGAIQMPHAARLLIESVYGEDVAMPEGFARSEQEQVGKYYCDRAMAKKFVLNFKPGYAANINDYLPEKLSTRLAEESVSLWLATCIDGVVKPYATGAHAWEMSVVRVRRSWWKKHRDEFSLLEGDAFRQWCIEQRQDPEMANVILVTDDESCGYSAMEGLTGKVG >CP029164.1|AWH69706.1|2020096_2020249_+|Hok/Gef-family-protein MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK >CP029164.1|AWH69705.1|2019098_2019833_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >CP029164.1|AWH69704.1|2017312_2019025_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKVAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >CP029164.1|AWH69714.1|2030188_2031226_-|aminopeptidase MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEVFDKAGIAVLSVEATNWNLGNKDGYQQRAKTAAFPAGNSWHDVRLDNQQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >CP029164.1|AWH69715.1|2031477_2032386_+|sulfate-adenylyltransferase-subunit-2 MDQKRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >CP029164.1|AWH69716.1|2032387_2033815_+|sulfate-adenylyltransferase MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEETFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMPWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFSGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >CP029164.1|AWH69717.1|2033814_2034420_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTPQQREQHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >CP029164.1|AWH69718.1|2034469_2034793_+|hypothetical-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >CP029164.1|AWH69719.1|2034986_2035298_+|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNNDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >CP029164.1|AWH69720.1|2035316_2036027_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKHVVIAISPGDSRFAQLPLANHPQITVVDGGEERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAETGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >CP029164.1|AWH69721.1|2036026_2036506_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYALGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >CP029164.1|AWH69722.1|2036502_2037552_+|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVMEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGALRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAEETELQTLLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >CP029164.1|AWH69723.1|2037532_2038294_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVTCSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029164_5 | 3135822-3135945 | Orphan |
NA
Consensus repeat of CP029164_5
|
1 spacers
spacers of CP029164_5
>5.1|3135865|38|CP029164|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around CP029164_5
The CRISPR arrays of CP029164_5 >merge|CP029164|5|3135822-3135945|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >CP029164|5|5|3135822-3135945|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>CP029164.1|AWH70716.1|3135381_3135687_-|monooxygenase MATLLQLHFAFNGPFGDAMVEQLEPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQAKLA >CP029164.1|AWH70715.1|3133645_3135256_-|FAD-NAD(P)-binding-protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPIYCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMLATNQDLPSETFDLAVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFHRDNASEKLNITLMSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEVGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYEMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEEIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAIAQAVVKPASRARRRLSFNQM >CP029164.1|AWH70714.1|3132827_3133640_+|hypothetical-protein MIITRADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGERAFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLVQIASFGANARIANSGDNVHIIASGEDSTIVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >CP029164.1|AWH70713.1|3132038_3132824_+|thiosulfate-reductase-cytochrome-B-subunit MNPSQHAEQFQSQLANYVPLFTPQFWPVWLIIAGLLLVGMWLVLGLHALLRARGVKKSANDHGEKVYLYSKAVRLWHWSNALLFVLLLASGLINHFALVGATTVKSLVAVHEVCGFLLLACWLGFVLINAVGGNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >CP029164.1|AWH70712.1|3131373_3132042_+|4Fe-4S-dicluster-domain-containing-protein MSFTRRKFVLGMGTVIFFTGSASSLLANTMQEKKVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQENKYYQYQLPGAGKPHLYRRFGQHLIKKENV >CP029164.1|AWH70711.1|3130662_3131310_+|hypothetical-protein MGEMNHQDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYSVAEEELSVLLENIKQNGDHADIACMTGSQDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDIAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >CP029164.1|AWH70710.1|3128556_3130659_+|aldehyde-ferredoxin-oxidoreductase MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLNIKDDKVSLEKADLLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKGLFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIRWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEDMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRKTLQRLGLEDIAADLAAHNLLPA >CP029164.1|AWH70709.1|3127909_3128536_+|4Fe-4S-dicluster-domain-containing-protein MTPVDRPLLNIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTNLNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >CP029164.1|AWH70708.1|3127244_3127454_+|fumarate-hydratase-FumD MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >CP029164.1|AWH70707.1|3125275_3126688_-|pyruvate-kinase MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKYPLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL >CP029164.1|AWH70717.1|3136259_3137516_+|hypothetical-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDKGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKANDTLLAGGTMNNLGGEDSDTIVENGAIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGADLQQSTITVQQGGVLILDGSTVKGDSVTFSVGNINLNGGKLWLITDAATQVHLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >CP029164.1|AWH70718.1|3137556_3138930_-|multidrug-resistance-protein-MdtK MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSAIILQRAAR >CP029164.1|AWH70719.1|3139144_3139786_+|riboflavin-synthase MFTGIVQGTAKLVSIDEKPNFRTHVVELPDHMLEGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >CP029164.1|AWH70720.1|3139825_3140974_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIAGARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >CP029164.1|AWH70721.1|3141264_3142476_-|Bcr/CflA-family-multidrug-efflux-MFS-transporter MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFATIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVDCQNHGNAEVAHSESH >CP029164.1|AWH70722.1|3142588_3143521_+|LysR-family-transcriptional-regulator MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQIANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRLVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVALELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >CP029164.1|AWH70723.1|3143517_3144543_-|PurR-family-transcriptional-regulator MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >CP029164.1|AWH70724.1|3144530_3144755_-|hypothetical-protein MISVFTTSPFRQDRPKFHAYTICVLAIDPFLTLRVVFPAYRNTFVVRKVCKGKRLPCDFAGAEVRVWSEMEWQQ >CP029164.1|AWH70725.1|3144841_3144931_+|YnhF-family-membrane-protein MSTDLKFSLVTTIIVLGLIVAVGLTAALH >CP029164.1|AWH70726.1|3145096_3146266_+|MFS-transporter MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPEAVCVANS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
CP029164_2 | 2.1|917746|59|CP029164|CRISPRCasFinder | 917746-917804 | 59 | CP029164.1 | 1712076-1712134 | 2 | 0.966 |
1. spacer 2.1|917746|59|CP029164|CRISPRCasFinder matches to position: 1712076-1712134, mismatch: 2, identity: 0.966
gaaggcagagggagacagtctgcgcggtgagataggcggtgtatacagagatgcccgtg CRISPR spacer gaaggcagagggagacagtctgcgtggtgagataggtggtgtatacagagatgcccgtg Protospacer ************************.***********.**********************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP038506 | Escherichia coli strain 28Eco12 plasmid p28Eco12, complete sequence | 11835-11866 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP019283 | Escherichia coli strain 13P484A plasmid p13P484A-3, complete sequence | 42147-42178 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | CP042641 | Escherichia coli strain NCYU-24-74 plasmid pNCYU-24-74-3, complete sequence | 36520-36551 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP034821 | Salmonella sp. SSDFZ54 plasmid pTB502, complete sequence | 1557-1588 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP027443 | Escherichia coli strain 2013C-3252 plasmid unnamed1, complete sequence | 9506-9537 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP027575 | Escherichia coli strain 2013C-4081 plasmid unnamed2 | 94756-94787 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP027223 | Escherichia coli strain 2015C-3101 plasmid unnamed2 | 65534-65565 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP030188 | Salmonella enterica strain SA20094620 plasmid pSA20094620.3, complete sequence | 61916-61947 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP027590 | Escherichia coli strain 2014C-3011 plasmid unnamed2 | 64165-64196 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP039862 | Salmonella enterica subsp. enterica serovar 1,4,[5],12:i:- strain PNCS014880 plasmid p16-6773.2, complete sequence | 20286-20317 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP033632 | Escherichia coli isolate ECCNB12-2 plasmid pTB-nb1, complete sequence | 119284-119315 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_AP018798 | Escherichia coli strain E2855 plasmid pE2855-2, complete sequence | 85099-85130 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | CP012494 | Escherichia coli strain CFSAN004177 plasmid pCFSAN004177G_03, complete sequence | 46688-46719 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | CP027320 | Escherichia coli strain 2014C-3084 plasmid unnamed1 | 42116-42147 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NC_013370 | Escherichia coli O111:H- str. 11128 plasmid pO111_2, complete sequence | 87237-87268 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP026475 | Escherichia coli strain KBN10P04869 plasmid pKBN10P04869B, complete sequence | 81853-81884 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MN510445 | Escherichia coli strain JIE250 plasmid pJIE250_3, complete sequence | 70500-70531 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MN510447 | Escherichia coli strain TZ20_1P plasmid pTZ20_1P, complete sequence | 47861-47892 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP012491 | Escherichia coli strain CFSAN004176 plasmid pCFSAN004176P_03, complete sequence | 62997-63028 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MH422554 | Escherichia phage P1, complete genome | 87194-87225 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NC_050152 | Enterobacteria phage P7, complete genome | 93995-94026 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MH445380 | Escherichia virus P1 isolate transconjugant 2(L-II), complete genome | 58645-58676 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NC_031129 | Salmonella phage SJ46, complete genome | 77363-77394 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MH445381 | Escherichia virus P1, complete genome | 56870-56901 | 0 | 1.0 |
CP029164_6 | 6.1|3774268|40|CP029164|CRISPRCasFinder | 3774268-3774307 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 1 | 0.975 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032392 | Salmonella enterica subsp. enterica serovar Dublin strain CVM 34981 plasmid p34981_2, complete sequence | 28190-28221 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP022965 | Salmonella enterica subsp. enterica serovar Pullorum strain QJ-2D-Sal plasmid pQJDsal2, complete sequence | 29440-29471 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NC_007208 | Salmonella enterica OU7025 plasmid pOU1113, complete sequence | 19749-19780 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_HG970001 | Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91 plasmid pSG, complete sequence | 38393-38424 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032386 | Salmonella enterica subsp. enterica serovar Dublin strain CVM N53043 plasmid pN53043_2, complete sequence | 3653-3684 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032388 | Salmonella enterica subsp. enterica serovar Dublin strain CVM N45955 plasmid pN45955_1, complete sequence | 42554-42585 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP030208 | Salmonella enterica strain SA19992307 plasmid pSA19992307.1, complete sequence | 4890-4921 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032450 | Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69838 plasmid pSDU1-USMARC-69838, complete sequence | 68762-68793 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NC_011204 | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 plasmid pCT02021853_74, complete sequence | 31020-31051 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NC_019106 | Salmonella enterica subsp. enterica serovar Dublin plasmid pSD_77, complete sequence | 75580-75611 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NC_019112 | Salmonella enterica subsp. enterica serovar Pullorum plasmid pSPUV, complete sequence | 20322-20353 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP012348 | Salmonella enterica subsp. enterica serovar Pullorum str. ATCC 9120 plasmid pCFSAN000725_01, complete sequence | 42630-42661 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP019180 | Salmonella enterica subsp. enterica serovar Dublin str. ATCC 39184 plasmid pATCC39184, complete sequence | 59477-59508 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032394 | Salmonella enterica subsp. enterica serovar Dublin strain CVM 22453 plasmid p22453_2, complete sequence | 17037-17068 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NC_010422 | Salmonella enterica subsp. enterica serovar Dublin plasmid pOU1115, complete sequence | 19784-19815 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032381 | Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69807 plasmid pSDU2-USMARC-69807, complete sequence | 47366-47397 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032447 | Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69840 plasmid pSDU1-USMARC-69840, complete sequence | 58031-58062 | 2 | 0.938 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_KP453775 | Klebsiella pneumoniae strain ST11 plasmid pKP12226, complete sequence | 17440-17471 | 2 | 0.938 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP017632 | Escherichia coli strain SLK172 plasmid pSLK172-1, complete sequence | 122310-122341 | 2 | 0.938 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP030285 | Escherichia coli strain E308 plasmid pLKSZ04, complete sequence | 6029-6060 | 2 | 0.938 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP036204 | Escherichia coli strain L725 plasmid punnamed2, complete sequence | 15121-15152 | 2 | 0.938 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MN510446 | Escherichia coli strain SvETEC plasmid pSvP1_F, complete sequence | 131402-131433 | 2 | 0.938 |
CP029164_5 | 5.1|3135865|38|CP029164|CRISPRCasFinder | 3135865-3135902 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP022140 | Salmonella enterica subsp. salamae serovar 55:k:z39 str. 1315K plasmid unnamed1, complete sequence | 27735-27766 | 6 | 0.812 |
CP029164_4 | 4.5|2029376|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029376-2029407 | 32 | NZ_CP015205 | Rhodococcus sp. 008 plasmid pR8C2, complete sequence | 47221-47252 | 7 | 0.781 |
CP029164_4 | 4.5|2029376|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029376-2029407 | 32 | NZ_CP025960 | Rhodococcus qingshengii strain djl-6-2 plasmid pDJL1, complete sequence | 25136-25167 | 7 | 0.781 |
CP029164_3 | 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2002914-2002945 | 32 | MK448716 | Streptococcus phage Javan249, complete genome | 37558-37589 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_LT904880 | Salmonella enterica subsp. enterica serovar Typhi strain ty3-193 genome assembly, plasmid: 3 | 100861-100892 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_LT904895 | Salmonella enterica subsp. enterica serovar Typhi strain ERL12960 genome assembly, plasmid: 2 | 32455-32486 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP012254 | Cronobacter sakazakii strain NCTC 8155 plasmid pCS1, complete sequence | 90639-90670 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029647 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_217186 plasmid pHCM2, complete sequence | 24048-24079 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_LT904853 | Salmonella enterica subsp. enterica serovar Typhi strain TY585 genome assembly, plasmid: 2 | 2158-2189 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NC_003385 | Salmonella enterica subsp. enterica serovar Typhi str. CT18 plasmid pHCM2, complete sequence | 69550-69581 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029859 | Salmonella enterica subsp. enterica serovar Typhi strain 343077_285138 plasmid pHCM2, complete sequence | 33422-33453 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029847 | Salmonella enterica subsp. enterica serovar Typhi strain 343078_273110 plasmid pHCM2, complete sequence | 64890-64921 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029916 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_202113 plasmid pHCM2, complete sequence | 3001-3032 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029893 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_252143 plasmid pHCM2, complete sequence | 30210-30241 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029876 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_227128 plasmid pHCM2, complete sequence | 21803-21834 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029884 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_268186 plasmid pHCM2, complete sequence | 5429-5460 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029961 | Salmonella enterica subsp. enterica serovar Typhi strain 343078_251131 plasmid pHCM2, complete sequence | 76887-76918 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029849 | Salmonella enterica subsp. enterica serovar Typhi strain 343078_211126 plasmid pHCM2, complete sequence | 9632-9663 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029857 | Salmonella enterica subsp. enterica serovar Typhi strain 343077_286126 plasmid pHCM2, complete sequence | 17697-17728 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029929 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_216103 plasmid pHCM2, complete sequence | 73527-73558 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029865 | Salmonella enterica subsp. enterica serovar Typhi strain 343077_228157 plasmid pHCM2, complete sequence | 87633-87664 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_LT883154 | Salmonella enterica subsp. enterica serovar Typhi strain ERL12148 genome assembly, plasmid: 2 | 32810-32841 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029905 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_231186 plasmid pHCM2, complete sequence | 30728-30759 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029851 | Salmonella enterica subsp. enterica serovar Typhi strain 343078_203125 plasmid pHCM2, complete sequence | 44207-44238 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP040567 | Salmonella enterica subsp. enterica serovar Typhimurium strain SAP17-7299 plasmid pCFSAN059543, complete sequence | 17468-17499 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029889 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_294172 plasmid pHCM2, complete sequence | 17968-17999 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029891 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_253155 plasmid pHCM2, complete sequence | 18575-18606 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029898 | Salmonella enterica subsp. enterica serovar Typhi strain 343077_213147 plasmid pHCM2, complete sequence | 21136-21167 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029951 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_205186 plasmid pHCM2, complete sequence | 47286-47317 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029867 | Salmonella enterica subsp. enterica serovar Typhi strain 343077_228140 plasmid pHCM2, complete sequence | 33258-33289 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029901 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_232188 plasmid pHCM2, complete sequence | 32901-32932 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029921 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_282186 plasmid pHCM2, complete sequence | 100446-100477 | 8 | 0.75 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_KP453775 | Klebsiella pneumoniae strain ST11 plasmid pKP12226, complete sequence | 70246-70277 | 8 | 0.75 |
CP029164_4 | 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029681-2029712 | 32 | NC_009717 | Xanthobacter autotrophicus Py2 plasmid pXAUT01, complete sequence | 288380-288411 | 8 | 0.75 |
CP029164_4 | 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029681-2029712 | 32 | NZ_CP010657 | Phaeobacter piscinae strain P71 plasmid pP71_a, complete sequence | 57627-57658 | 8 | 0.75 |
CP029164_4 | 4.13|2029864|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029864-2029895 | 32 | NZ_CP042263 | Litoreibacter sp. LN3S51 plasmid unnamed2, complete sequence | 173595-173626 | 8 | 0.75 |
CP029164_3 | 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2002914-2002945 | 32 | NZ_CP044347 | Escherichia coli strain P225M plasmid pP225M-CTX-M-55, complete sequence | 121862-121893 | 9 | 0.719 |
CP029164_3 | 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2002914-2002945 | 32 | NZ_CP032533 | Bacillus megaterium NCT-2 plasmid pNCT2_5, complete sequence | 17401-17432 | 9 | 0.719 |
CP029164_3 | 3.6|2003219|32|CP029164|CRISPRCasFinder,CRT,PILER-CR | 2003219-2003250 | 32 | AP014399 | Uncultured Mediterranean phage uvMED isolate uvMED-CGF-C110A-MedDCM-OCT-S26-C20, *** SEQUENCING IN PROGRESS ***, 2 ordered pieces | 35411-35442 | 9 | 0.719 |
CP029164_3 | 3.6|2003219|32|CP029164|CRISPRCasFinder,CRT,PILER-CR | 2003219-2003250 | 32 | AP013383 | Uncultured Mediterranean phage uvMED DNA, complete genome, group G15, isolate: uvMED-CGR-C110A-MedDCM-OCT-S24-C13 | 29583-29614 | 9 | 0.719 |
CP029164_3 | 3.7|2003280|32|CP029164|CRISPRCasFinder,CRT,PILER-CR | 2003280-2003311 | 32 | NZ_CP021994 | Cryobacterium sp. LW097 plasmid unnamed1 | 33387-33418 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP011404 | Lactobacillus salivarius str. Ren plasmid pR1, complete sequence | 100759-100790 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP020859 | Lactobacillus salivarius strain ZLS006 plasmid unnamed1, complete sequence | 48192-48223 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP020859 | Lactobacillus salivarius strain ZLS006 plasmid unnamed1, complete sequence | 248214-248245 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP017108 | Lactobacillus salivarius strain CICC23174 plasmid pLS_1 sequence | 140326-140357 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP017110 | Lactobacillus salivarius strain CICC23174 plasmid pLS_3, complete sequence | 20889-20920 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP029460 | Clostridium novyi strain 150557 plasmid pCN2, complete sequence | 45232-45263 | 9 | 0.719 |
CP029164_4 | 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029681-2029712 | 32 | NZ_CP014942 | Rhodococcus sp. BH4 plasmid, complete sequence | 418665-418696 | 9 | 0.719 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_AP023208 | Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence | 12113-12164 | 10 | 0.808 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | MT230402 | Escherichia coli strain DH5alpha plasmid pESBL87, complete sequence | 272-323 | 10 | 0.808 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_AP023207 | Escherichia coli strain TUM18781 plasmid pMTY18781-2, complete sequence | 31252-31303 | 10 | 0.808 |
CP029164_3 | 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2002914-2002945 | 32 | NZ_CP045561 | Acinetobacter nosocomialis strain AC1530 plasmid pAC1530, complete sequence | 141094-141125 | 10 | 0.688 |
CP029164_3 | 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2002914-2002945 | 32 | NZ_CP014478 | Acinetobacter pittii strain AP_882 plasmid pNDM-AP_882, complete sequence | 140150-140181 | 10 | 0.688 |
CP029164_3 | 3.8|2003341|32|CP029164|CRISPRCasFinder,CRT,PILER-CR | 2003341-2003372 | 32 | NZ_CP024940 | Paraburkholderia hospita strain mHSR1 plasmid pmHSR1_P, complete sequence | 1163456-1163487 | 10 | 0.688 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NC_018511 | Bacillus thuringiensis HD-789 plasmid pBTHD789-5, complete sequence | 60-91 | 10 | 0.688 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NC_001446 | Bacillus thuringiensis sv israelensis HI4 plasmid pTX14-3, complete sequence | 4548-4579 | 10 | 0.688 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP009345 | Bacillus thuringiensis HD1002 plasmid 6, complete sequence | 1165-1196 | 10 | 0.688 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP053657 | Bacillus cereus strain CTMA_1571 plasmid p.1, complete sequence | 105460-105491 | 10 | 0.688 |
CP029164_4 | 4.6|2029437|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029437-2029468 | 32 | NZ_CP031468 | Paraburkholderia caffeinilytica strain CF1 plasmid p1, complete sequence | 166221-166252 | 10 | 0.688 |
CP029164_4 | 4.6|2029437|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029437-2029468 | 32 | NZ_CP014580 | Burkholderia sp. OLGA172 plasmid pOLGA1, complete sequence | 2849-2880 | 10 | 0.688 |
CP029164_4 | 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029681-2029712 | 32 | NZ_CP048816 | Caulobacter rhizosphaerae strain KCTC 52515 plasmid unnamed | 34863-34894 | 10 | 0.688 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 4115-4166 | 11 | 0.788 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 4114-4165 | 11 | 0.788 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 4114-4165 | 11 | 0.788 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 4114-4165 | 11 | 0.788 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | MT230312 | Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence | 152-203 | 11 | 0.788 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 208982-209033 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 44-95 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 219476-219527 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 43-94 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP044307 | Escherichia coli strain C27A plasmid pC27A-2, complete sequence | 18332-18383 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 210310-210361 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 43-94 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 191265-191316 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 43-94 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP044147 | Escherichia coli O157 strain AR-0428 plasmid pAR-0428-2 | 7393-7444 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | CP044351 | Escherichia coli strain 194195 plasmid p194195_1, complete sequence | 84217-84268 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 3373-3424 | 13 | 0.75 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 3372-3423 | 13 | 0.75 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP044307 | Escherichia coli strain C27A plasmid pC27A-2, complete sequence | 15000-15051 | 13 | 0.75 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 3372-3423 | 13 | 0.75 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 3372-3423 | 13 | 0.75 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_AP023209 | Escherichia coli strain TUM18781 plasmid pMTY18781-4, complete sequence | 70-121 | 14 | 0.731 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 30159-30210 | 14 | 0.731 |
1. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP038506 (Escherichia coli strain 28Eco12 plasmid p28Eco12, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
2. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019283 (Escherichia coli strain 13P484A plasmid p13P484A-3, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
3. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to CP042641 (Escherichia coli strain NCYU-24-74 plasmid pNCYU-24-74-3, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
4. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034821 (Salmonella sp. SSDFZ54 plasmid pTB502, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
5. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP027443 (Escherichia coli strain 2013C-3252 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
6. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP027575 (Escherichia coli strain 2013C-4081 plasmid unnamed2) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
7. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP027223 (Escherichia coli strain 2015C-3101 plasmid unnamed2) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
8. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030188 (Salmonella enterica strain SA20094620 plasmid pSA20094620.3, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
9. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP027590 (Escherichia coli strain 2014C-3011 plasmid unnamed2) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
10. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP039862 (Salmonella enterica subsp. enterica serovar 1,4,[5],12:i:- strain PNCS014880 plasmid p16-6773.2, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
11. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP033632 (Escherichia coli isolate ECCNB12-2 plasmid pTB-nb1, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
12. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_AP018798 (Escherichia coli strain E2855 plasmid pE2855-2, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
13. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to CP012494 (Escherichia coli strain CFSAN004177 plasmid pCFSAN004177G_03, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
14. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to CP027320 (Escherichia coli strain 2014C-3084 plasmid unnamed1) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
15. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_013370 (Escherichia coli O111:H- str. 11128 plasmid pO111_2, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
16. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP026475 (Escherichia coli strain KBN10P04869 plasmid pKBN10P04869B, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
17. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MN510445 (Escherichia coli strain JIE250 plasmid pJIE250_3, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
18. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MN510447 (Escherichia coli strain TZ20_1P plasmid pTZ20_1P, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
19. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012491 (Escherichia coli strain CFSAN004176 plasmid pCFSAN004176P_03, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
20. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MH422554 (Escherichia phage P1, complete genome) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
21. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_050152 (Enterobacteria phage P7, complete genome) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
22. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MH445380 (Escherichia virus P1 isolate transconjugant 2(L-II), complete genome) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
23. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_031129 (Salmonella phage SJ46, complete genome) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
24. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MH445381 (Escherichia virus P1, complete genome) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
25. spacer 6.1|3774268|40|CP029164|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 1, identity: 0.975
gcgctgcgggtcatttttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ***************.************************
26. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032392 (Salmonella enterica subsp. enterica serovar Dublin strain CVM 34981 plasmid p34981_2, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
27. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP022965 (Salmonella enterica subsp. enterica serovar Pullorum strain QJ-2D-Sal plasmid pQJDsal2, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
28. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_007208 (Salmonella enterica OU7025 plasmid pOU1113, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
29. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_HG970001 (Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91 plasmid pSG, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
30. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032386 (Salmonella enterica subsp. enterica serovar Dublin strain CVM N53043 plasmid pN53043_2, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
31. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032388 (Salmonella enterica subsp. enterica serovar Dublin strain CVM N45955 plasmid pN45955_1, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
32. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030208 (Salmonella enterica strain SA19992307 plasmid pSA19992307.1, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
33. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032450 (Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69838 plasmid pSDU1-USMARC-69838, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
34. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_011204 (Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 plasmid pCT02021853_74, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
35. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_019106 (Salmonella enterica subsp. enterica serovar Dublin plasmid pSD_77, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
36. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_019112 (Salmonella enterica subsp. enterica serovar Pullorum plasmid pSPUV, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
37. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012348 (Salmonella enterica subsp. enterica serovar Pullorum str. ATCC 9120 plasmid pCFSAN000725_01, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
38. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019180 (Salmonella enterica subsp. enterica serovar Dublin str. ATCC 39184 plasmid pATCC39184, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
39. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032394 (Salmonella enterica subsp. enterica serovar Dublin strain CVM 22453 plasmid p22453_2, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
40. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_010422 (Salmonella enterica subsp. enterica serovar Dublin plasmid pOU1115, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
41. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032381 (Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69807 plasmid pSDU2-USMARC-69807, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
42. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032447 (Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69840 plasmid pSDU1-USMARC-69840, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
43. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KP453775 (Klebsiella pneumoniae strain ST11 plasmid pKP12226, complete sequence) position: , mismatch: 2, identity: 0.938
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtgaaaacacgttctgaaccgacattcatgt Protospacer ***.******** *******************
44. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017632 (Escherichia coli strain SLK172 plasmid pSLK172-1, complete sequence) position: , mismatch: 2, identity: 0.938
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtgaaaacacgttctgaaccgacattcatgt Protospacer ***.******** *******************
45. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030285 (Escherichia coli strain E308 plasmid pLKSZ04, complete sequence) position: , mismatch: 2, identity: 0.938
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctaaaccgtcattcatgt Protospacer ****************.***** *********
46. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP036204 (Escherichia coli strain L725 plasmid punnamed2, complete sequence) position: , mismatch: 2, identity: 0.938
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctaaaccgtcattcatgt Protospacer ****************.***** *********
47. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MN510446 (Escherichia coli strain SvETEC plasmid pSvP1_F, complete sequence) position: , mismatch: 2, identity: 0.938
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacgttatgaaccgacattcatgt Protospacer ************ * *****************
48. spacer 5.1|3135865|38|CP029164|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
49. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP022140 (Salmonella enterica subsp. salamae serovar 55:k:z39 str. 1315K plasmid unnamed1, complete sequence) position: , mismatch: 6, identity: 0.812
cattgaa-aacattgcctttattttattttttg CRISPR spacer -attaaatcacattccctctattttattttttc Protospacer ***.** ***** ***.*************
50. spacer 4.5|2029376|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015205 (Rhodococcus sp. 008 plasmid pR8C2, complete sequence) position: , mismatch: 7, identity: 0.781
caataacgcagcatccaggaagctgtttccgc CRISPR spacer cgcgagcgcagcatcgaagaagctgtttccac Protospacer *. *.********* *.************.*
51. spacer 4.5|2029376|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP025960 (Rhodococcus qingshengii strain djl-6-2 plasmid pDJL1, complete sequence) position: , mismatch: 7, identity: 0.781
caataacgcagcatccaggaagctgtttccgc CRISPR spacer cgcgagcgcagcatcgaagaagctgtttccac Protospacer *. *.********* *.************.*
52. spacer 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MK448716 (Streptococcus phage Javan249, complete genome) position: , mismatch: 8, identity: 0.75
acgtaacaaaacaacagcaaaatattatcgac CRISPR spacer taataacaaaacgacagcaaaagattatagtt Protospacer .*********.********* ***** * .
53. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_LT904880 (Salmonella enterica subsp. enterica serovar Typhi strain ty3-193 genome assembly, plasmid: 3) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
54. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_LT904895 (Salmonella enterica subsp. enterica serovar Typhi strain ERL12960 genome assembly, plasmid: 2) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
55. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP012254 (Cronobacter sakazakii strain NCTC 8155 plasmid pCS1, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttatca Protospacer ******** *****.******** ** .*.
56. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029647 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_217186 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
57. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_LT904853 (Salmonella enterica subsp. enterica serovar Typhi strain TY585 genome assembly, plasmid: 2) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
58. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NC_003385 (Salmonella enterica subsp. enterica serovar Typhi str. CT18 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
59. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029859 (Salmonella enterica subsp. enterica serovar Typhi strain 343077_285138 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
60. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029847 (Salmonella enterica subsp. enterica serovar Typhi strain 343078_273110 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
61. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029916 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_202113 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
62. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029893 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_252143 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
63. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029876 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_227128 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
64. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029884 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_268186 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
65. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029961 (Salmonella enterica subsp. enterica serovar Typhi strain 343078_251131 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
66. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029849 (Salmonella enterica subsp. enterica serovar Typhi strain 343078_211126 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
67. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029857 (Salmonella enterica subsp. enterica serovar Typhi strain 343077_286126 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
68. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029929 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_216103 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
69. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029865 (Salmonella enterica subsp. enterica serovar Typhi strain 343077_228157 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
70. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_LT883154 (Salmonella enterica subsp. enterica serovar Typhi strain ERL12148 genome assembly, plasmid: 2) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
71. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029905 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_231186 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
72. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029851 (Salmonella enterica subsp. enterica serovar Typhi strain 343078_203125 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
73. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP040567 (Salmonella enterica subsp. enterica serovar Typhimurium strain SAP17-7299 plasmid pCFSAN059543, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
74. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029889 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_294172 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
75. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029891 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_253155 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
76. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029898 (Salmonella enterica subsp. enterica serovar Typhi strain 343077_213147 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
77. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029951 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_205186 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
78. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029867 (Salmonella enterica subsp. enterica serovar Typhi strain 343077_228140 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
79. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029901 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_232188 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
80. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029921 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_282186 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
81. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KP453775 (Klebsiella pneumoniae strain ST11 plasmid pKP12226, complete sequence) position: , mismatch: 8, identity: 0.75
cattgaaaacattgcctttattttattttttg CRISPR spacer tcgctaaaacattggctttatttaatttttta Protospacer . . ********* ******** *******.
82. spacer 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_009717 (Xanthobacter autotrophicus Py2 plasmid pXAUT01, complete sequence) position: , mismatch: 8, identity: 0.75
cgtggctgcgctggccgttgcagcagtttgat CRISPR spacer cctggccgcgcttgccgttgcagcaggaccgt Protospacer * ****.***** ************* . .*
83. spacer 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP010657 (Phaeobacter piscinae strain P71 plasmid pP71_a, complete sequence) position: , mismatch: 8, identity: 0.75
cgtggctgcgctggccgttgcagcagtttgat-- CRISPR spacer tgtggcggcgctggcagttgcagcg--ctggttc Protospacer .***** ******** ********. .**.*
84. spacer 4.13|2029864|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP042263 (Litoreibacter sp. LN3S51 plasmid unnamed2, complete sequence) position: , mismatch: 8, identity: 0.75
ccgccgtgccagtgatcctcatacggcctgtt CRISPR spacer cggtgcagccagtgatcgtcatacagcctgtg Protospacer * *. ********** ******.******
85. spacer 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP044347 (Escherichia coli strain P225M plasmid pP225M-CTX-M-55, complete sequence) position: , mismatch: 9, identity: 0.719
acgtaacaaaacaacagcaaaatattatcgac CRISPR spacer taagaacaaaacaagagcgaaatattatgcat Protospacer . ********** ***.********* *.
86. spacer 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032533 (Bacillus megaterium NCT-2 plasmid pNCT2_5, complete sequence) position: , mismatch: 9, identity: 0.719
acgtaacaaaacaacagcaaaatattatcgac CRISPR spacer atgtaacaaaaaaacagtaaaatatggtgaga Protospacer *.********* *****.******* .* ..
87. spacer 3.6|2003219|32|CP029164|CRISPRCasFinder,CRT,PILER-CR matches to AP014399 (Uncultured Mediterranean phage uvMED isolate uvMED-CGF-C110A-MedDCM-OCT-S26-C20, *** SEQUENCING IN PROGRESS ***, 2 ordered pieces) position: , mismatch: 9, identity: 0.719
ctaggataaattaaaagacaaaattgcagcaa CRISPR spacer gtcagataaattaaaagagataattgcaaatt Protospacer * .************** * *******.
88. spacer 3.6|2003219|32|CP029164|CRISPRCasFinder,CRT,PILER-CR matches to AP013383 (Uncultured Mediterranean phage uvMED DNA, complete genome, group G15, isolate: uvMED-CGR-C110A-MedDCM-OCT-S24-C13) position: , mismatch: 9, identity: 0.719
ctaggataaattaaaagacaaaattgcagcaa CRISPR spacer gtcagataaattaaaagagataattgcaaatt Protospacer * .************** * *******.
89. spacer 3.7|2003280|32|CP029164|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP021994 (Cryobacterium sp. LW097 plasmid unnamed1) position: , mismatch: 9, identity: 0.719
gagcgaccagtatcaagatcgacaggttttgc CRISPR spacer gagcgaccagcagcaagatcgaccagacctca Protospacer **********.* ********** .* ..*
90. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP011404 (Lactobacillus salivarius str. Ren plasmid pR1, complete sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer tttaataaacattgccttaattttattattaa Protospacer . * . ************ ******** ** .
91. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP020859 (Lactobacillus salivarius strain ZLS006 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer tttaataaacattgccttaattttattattaa Protospacer . * . ************ ******** ** .
92. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP020859 (Lactobacillus salivarius strain ZLS006 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer tttaataaacattgccttaattttattattaa Protospacer . * . ************ ******** ** .
93. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017108 (Lactobacillus salivarius strain CICC23174 plasmid pLS_1 sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer tttaataaacattgccttaattttattattaa Protospacer . * . ************ ******** ** .
94. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017110 (Lactobacillus salivarius strain CICC23174 plasmid pLS_3, complete sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer tttaataaacattgccttaattttattattaa Protospacer . * . ************ ******** ** .
95. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP029460 (Clostridium novyi strain 150557 plasmid pCN2, complete sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer aaaaaagaacattgtatttattttatttttaa Protospacer * .*.*******. ************** .
96. spacer 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP014942 (Rhodococcus sp. BH4 plasmid, complete sequence) position: , mismatch: 9, identity: 0.719
cgtggctgcgctggccgttgcagcagtttgat CRISPR spacer ggtggcggcgctggccgctgcagcggcgggca Protospacer ***** **********.******.*. *
97. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_AP023208 (Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence) position: , mismatch: 10, identity: 0.808
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer acgggcgactcgtaggcctgataagacgcgccagcgtcgcatcaggcaccga Protospacer .*. .*********************** ***************** *
98. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to MT230402 (Escherichia coli strain DH5alpha plasmid pESBL87, complete sequence) position: , mismatch: 10, identity: 0.808
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggtaacg-gtttgtaggcctgataagacgcgacagcgtcgcatcaggcattga Protospacer *.* .* ..*.************************************* *
99. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_AP023207 (Escherichia coli strain TUM18781 plasmid pMTY18781-2, complete sequence) position: , mismatch: 10, identity: 0.808
--gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer tggcaacgg--ctgtaggcctgataagacgcgacagcgtcgcatcaggcattga Protospacer *** .*. ..************************************* *
100. spacer 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP045561 (Acinetobacter nosocomialis strain AC1530 plasmid pAC1530, complete sequence) position: , mismatch: 10, identity: 0.688
acgtaacaaaacaacagcaaaatattatcgac CRISPR spacer taataaaaaaacaacagcaaaattttgaaaat Protospacer .*** **************** **. .*.
101. spacer 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP014478 (Acinetobacter pittii strain AP_882 plasmid pNDM-AP_882, complete sequence) position: , mismatch: 10, identity: 0.688
acgtaacaaaacaacagcaaaatattatcgac CRISPR spacer taataaaaaaacaacagcaaaattttgaaaat Protospacer .*** **************** **. .*.
102. spacer 3.8|2003341|32|CP029164|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP024940 (Paraburkholderia hospita strain mHSR1 plasmid pmHSR1_P, complete sequence) position: , mismatch: 10, identity: 0.688
atcgatatgtacgttagcgaggggatcacgca CRISPR spacer ggcgatatttacgttagcgacgggactgacct Protospacer . ****** *********** ****... *
103. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_018511 (Bacillus thuringiensis HD-789 plasmid pBTHD789-5, complete sequence) position: , mismatch: 10, identity: 0.688
cattgaaaacattgcctttattttattttttg CRISPR spacer tcgttcttacatttcttttattttattttttt Protospacer . * ***** *.***************
104. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_001446 (Bacillus thuringiensis sv israelensis HI4 plasmid pTX14-3, complete sequence) position: , mismatch: 10, identity: 0.688
cattgaaaacattgcctttattttattttttg CRISPR spacer tcgttcttacatttcttttattttattttttt Protospacer . * ***** *.***************
105. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP009345 (Bacillus thuringiensis HD1002 plasmid 6, complete sequence) position: , mismatch: 10, identity: 0.688
cattgaaaacattgcctttattttattttttg CRISPR spacer tcgttcttacatttcttttattttattttttt Protospacer . * ***** *.***************
106. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053657 (Bacillus cereus strain CTMA_1571 plasmid p.1, complete sequence) position: , mismatch: 10, identity: 0.688
cattgaaaacattgcctttattttattttttg CRISPR spacer accttaaaacatttcctttgttttatttagat Protospacer .* ******** *****.********
107. spacer 4.6|2029437|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP031468 (Paraburkholderia caffeinilytica strain CF1 plasmid p1, complete sequence) position: , mismatch: 10, identity: 0.688
cgatcggtgaagaggtccgcgaaatactcact CRISPR spacer taatcggtgacgacgtccgcgaaatcggagat Protospacer ..******** ** *********** . *
108. spacer 4.6|2029437|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP014580 (Burkholderia sp. OLGA172 plasmid pOLGA1, complete sequence) position: , mismatch: 10, identity: 0.688
cgatcggtgaagaggtccgcgaaatactcact CRISPR spacer taatcggtgacgacgtccgcgaaatcggagat Protospacer ..******** ** *********** . *
109. spacer 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP048816 (Caulobacter rhizosphaerae strain KCTC 52515 plasmid unnamed) position: , mismatch: 10, identity: 0.688
cgtggctgcgctggccgttgcagcagtttgat CRISPR spacer cgtggctgcgctggccgtggccgcttgccagg Protospacer ****************** ** ** ....
110. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 11, identity: 0.788
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggcac-aagtttgtaggcatgataagacgcgccagcgtcgcatcaggcatctg Protospacer **** .*..*.****** ************ *****************
111. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 11, identity: 0.788
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggcac-aagtttgtaggcatgataagacgcgccagcgtcgcatcaggcatctg Protospacer **** .*..*.****** ************ *****************
112. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 11, identity: 0.788
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggcac-aagtttgtaggcatgataagacgcgccagcgtcgcatcaggcatctg Protospacer **** .*..*.****** ************ *****************
113. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 11, identity: 0.788
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggcac-aagtttgtaggcatgataagacgcgccagcgtcgcatcaggcatctg Protospacer **** .*..*.****** ************ *****************
114. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to MT230312 (Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence) position: , mismatch: 11, identity: 0.788
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer gctcgggtctngtaggcctgataagacgcgtcagcgtcgcatcaggcttcaa Protospacer ** * *. ** ******************* **************** .
115. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 12, identity: 0.769
----gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer aaaaacagtgg----gtaggcctgataagacgcgtcagcgtcgcatcaggcatctg Protospacer .** **. ******************* *****************
116. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 12, identity: 0.769
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer cgtac-agatttgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer *.** ..*.*.****** ************ ***************** .
117. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 12, identity: 0.769
----gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer aaaaacagtgg----gtaggcctgataagacgcgtcagcgtcgcatcaggcatctg Protospacer .** **. ******************* *****************
118. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 12, identity: 0.769
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer cgtac-agatttgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer *.** ..*.*.****** ************ ***************** .
119. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP044307 (Escherichia coli strain C27A plasmid pC27A-2, complete sequence) position: , mismatch: 12, identity: 0.769
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer cgtac-agatttgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer *.** ..*.*.****** ************ ***************** .
120. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 12, identity: 0.769
----gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer aaaaacagtgg----gtaggcctgataagacgcgtcagcgtcgcatcaggcatctg Protospacer .** **. ******************* *****************
121. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 12, identity: 0.769
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer cgtac-agatttgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer *.** ..*.*.****** ************ ***************** .
122. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 12, identity: 0.769
----gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer aaaaacagtgg----gtaggcctgataagacgcgtcagcgtcgcatcaggcatctg Protospacer .** **. ******************* *****************
123. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 12, identity: 0.769
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer cgtac-agatttgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer *.** ..*.*.****** ************ ***************** .
124. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP044147 (Escherichia coli O157 strain AR-0428 plasmid pAR-0428-2) position: , mismatch: 12, identity: 0.769
---gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer agtgcagt---tttgtaggcatgataagacgcgccagcgtcgcatcaggcatccg Protospacer *** * .*.****** ************ *****************
125. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to CP044351 (Escherichia coli strain 194195 plasmid p194195_1, complete sequence) position: , mismatch: 12, identity: 0.769
---gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer agtgcagt---tttgtaggcatgataagacgcgccagcgtcgcatcaggcatccg Protospacer *** * .*.****** ************ *****************
126. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 13, identity: 0.75
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ttcgcgggctcgtaggcatgataagacgcgccagcgtcgcatcaggcacctg Protospacer . .*..********* ************ *****************
127. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 13, identity: 0.75
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ttcgcgggctcgtaggcatgataagacgcgccagcgtcgcatcaggcacctg Protospacer . .*..********* ************ *****************
128. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP044307 (Escherichia coli strain C27A plasmid pC27A-2, complete sequence) position: , mismatch: 13, identity: 0.75
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ttcgcgggctcgtaggcatgataagacgcgccagcgtcgcatcaggcacctg Protospacer . .*..********* ************ *****************
129. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 13, identity: 0.75
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ttcgcgggctcgtaggcatgataagacgcgccagcgtcgcatcaggcacctg Protospacer . .*..********* ************ *****************
130. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 13, identity: 0.75
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ttcgcgggctcgtaggcatgataagacgcgccagcgtcgcatcaggcacctg Protospacer . .*..********* ************ *****************
131. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_AP023209 (Escherichia coli strain TUM18781 plasmid pMTY18781-4, complete sequence) position: , mismatch: 14, identity: 0.731
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggttcaggtccgtaggcatgataagacgcgtcagcgtcgcatcaggcatcgg Protospacer * .......******* ************ ***************** *
132. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 14, identity: 0.731
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer gggtgcagattgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer * .. *. *.****** ************ ***************** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
108496 : 123827
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP029164|108496:123827|DBSCAN-SWA AATGAGCAGAAAAACCCAACGTTACTCTAAAAAGTTCAAAGCCGAAGCTGTCAGAACGGTTCTTGAAAATCAACTTTCGATCAGTGAAGGCGCTTCCCGATTATCTCTTCCTGAAGGCACTTTAGGACAATGGGTTACCGCCGCCAGAAAAGGGCTCGGTACTCCTGGTTCCCGCACGGTGGCTGAACTGGAATCTGAAATTCTGCAACTGCGTAAGGCGTTAAATGAAGCTCGCCTTGAGCGAGATATATTAAAAAAAGCAACTGTAGATTTTGCACAGGAGTCGCTGAAAAATACGCGTTAATCGAACAATGGCGACAACAATTTCCCATTGAAGCGATGTGTCAGGTATTTGGTGTATCCAGGAGCGGTTATTACAACTGGGTACAGCATGAACCCTCAGACAGAAAACAAAGTGATGAGCGGCTAAAACTGGAGATTAAGGTGGCACATATCCGCACTCGCGAAACATATGGAACCCGGCGGCTCCAGACGGAGCTGGCAGAGAATGGCATCATCGTTGGTCGTGACCGACTGGCACGTCTTCGTAAGGAGCTAAGGCTACGCTGTAAGCAGAAACGCAAGTTCAGAGCGACTACGAACTCGAACCACAATCTGCCAGTTGCGCCAAATCTGCTGAACCAGACGTTCGCTCCTACAGCACCAAATCAGGTCTGGGTGGCGGACCTGACGTATGTTGCCACACAGGAGGGATGGTTGTACCTCGCTGGCATCAAAGATGTTTATACGTGCGAAATTGTCGGCTACGCCATGGGAGAGCGCATGACAAAAGAGCTGACAGGTAAAGCCCTGTTTATGGCGCTCAGGAGCCAGCGCCCACCTGCCGGGCTAATCCACCACTCTGATCGAGGTTCACAGTACTGCGCATACGATTACCGGGTCATACAGGAGCAGTTTGGTCTGAAAACATCAATGTCGCGTAAAGGTAACTGTTACGACAACGCTCCGATGGAAAGCTTCTGGGGAACGCTGAAAAATGAGAGCCTGAGCCACTATCGTTTTAATAACCGGGATGAAGCCATCTCAGTAATACGGGAATACATTGAGATTTTCTACAATCGTCAGCGTCGTCACTCTCGTCTGGGGAATATCTCCCCGGCAGCCTTCAGGGAAAAATATCATCAGATGGCTGCTTAAAAAAAGAACAAATGGTAGTGTCCGCTATTGCCAGTACACCTCAACATTCCACCATGCATTCCGATTAACGCCGCATAGCCAGTTGAACTTTGCTACTTTGTGAGAGGTAGTACCTTCTATCCAGTGCGAATTTAATTAATGGAATAAATGATTATGAGTGAAAATGATACAATCCCAAAGAAGTCTACAAGTCAGATTAACAAAGCGGTATTCTTTACATCTGCTTTGCTAATTTTCCTTCTTGTCGCCTTTGCCGCCGTATTCCCGGATGTCGCCGACAAAAATTTTAAACTACTTCAGCAACAAATCTTCACGAATGCCAGCTGGTTCTACATCCTTGCTGTGGCCCTGATTTTACTGAGTGTCACGTTCCTTGGACTCTCACGCTACGGTGATATCAAGCTGGGCCCGGACCATGCGCAGCCTGATTTCAGCTACCACTCCTGGTTTGCGATGCTTTTTTCGGCAGGGATGGGGATCGGCCTGATGTTCTTTGGCGTTGCCGAACCTGTAATGCATTATCTTTCGCCACCCGTTGGCACTCCAGAAACCGTTGCGGCAGCCAAGGAAGCAATGCGTCTGACCTTTTTCCACTGGGGACTGCACGCATGGGCAATTTATGCCATTGTGGCGCTGATTCTGGCGTTCTTCAGTTACCGTCACGGTCTGCCTTTAACTCTGCGCTCCGCACTCTATCCCATTATTGGCGATCGCATATACGGACCTGTAGGACATGCGGTTGATATTTTCGCTGTTATAGGCACGGTCTTTGGCGTTGCGACATCACTGGGTTACGGTGTTTTGCAGGTGAATGCCGGTTTGAACCATCTTTTCGGGGTGCCCATCAATGAAACGGTGCAGGTAATTCTGATCGTGGTCATCACGGGGTTAGCGACGATTTCAGTGGTGTCCGGTCTGGATAAGGGAATACGTATCCTGTCTGAACTCAATCTGGGTCTGGCTTTGTTGCTCCTGGCGCTGGTCCTGTGTCTGGGACCAACCGTGCTTCTGCTGAAGTCATTTGTGGAAAATACGGGCGGTTATCTTTCGGAACTGGTGAGTAAAACGTTCAACCTTTACGCGTATGAGCCCAAGTCGAGCAACTGGCTGGGGGGCTGGACATTACTGTACTGGGGATGGTGGCTTTCATGGTCGCCGTTTGTGGGGATGTTCATCGCACGGGTCTCCCGCGGGCGAACCATTCGCGAGTTTGTCACCGGCGTGCTGTTTGTTCCCGCGGGTTTTACGCTAATGTGGATGACGGTGTTTGGTAACAGCGCGATCTATCTCATTATGAACCAGGGGGCCACAGACCTCGCCAATACTGTTCAGCAGGATGTGTCGCTGGCCCTGTTTAATTTCCTGGAGCATTTCCCGTTCTCTTCTGTGCTGTCATTCATTGCAATGGCGATGGTCATCGTCTTCTTTGTAACGTCTGCTGATTCGGGGGCAATGGTTGTGGATACTCTGGCATCAGGTGGAGTGGCAAACACACCCGTCTGGCAGCGAATATTCTGGGCCTCGCTCATGGGCATTGTTGCAATTGCGCTTCTCCTTGCCGGAGGGCTAAGTGCGCTGCAAACGGTGACAATAGCGAGTGCATTGCCCTTCTCAGTGATCTTACTAATATCCATATACGGACTTTTAAAAGCTTTGCGCCGGGATTTGACCAAGCGTGAAAGCCTGAGCATGGCGACAATTGCTCCTACGGCTGCACGTAACCCAATTCCTTGGCAGAGAAGGTTACGCAATATCGCGTATCTGCCGAAGCGATCTCTTGTGAAACGTTTTATGGACGACGTTATCCAGCCCGCCATGACGCTGGTTCAGGAGGAACTGAACAAGCAGGGGACGATAAGCCACATTAGTGATGCAGTCGACGATCGTATTCGTCTTGAAGTCGATTTGGGCAACGAGCTGAATTTTATATATGAAGTGAGGCTTCGCGGGTATATCTCACCGACCTTCGCGCTCGCCGCAATGGATAATGATGAGCAGCAGAGTGAACAACATCGATATTATCGCGCTGAGGTTTATCTCAAAGAAGGCGGTCAAAATTATGATGTGATGGGCTGGAACCAGGAACAGCTGATTAATGACATACTGGACCAGTACGAAAAACACCTGCACTTCCTGCACCTGGTTCGTTAATAGCAACATGCCGTCCTGGGGGCGGCAATTATTATCTCGGCCGCAATATGAGGGAATGCAGAATGATTTCACGCTGGAAATGGATGCTGAAGCAGACAATTAAAAAACTATGGTTCAGGGCAACGTTATTCGCAATTGTCGCGATAATAACGGCCCTTTTATCAATTCTTTTTAAATCAATGATACCTGAGTCGGTTTCCGTGAAGGTTGGTGCGGAAGCAGTCGATAACATTCTGAACATACTGGCATCGAGTATGCTGGCAGTGACCACATTTTCGCTGAGTATCATGGTCACAGCCTACGGTTCAGCCACTACTAATGTGACTCCCAGAGCTACGCGTTTAGTTGTTGAAGACGTCACCACACAAAATGTACTGGCCACCTTCATCGGTTCTTTTCTCTTGAAGTGGTCAACAAAAACTGGCCACCGCGTTAGAGTTTTTCCAGTATCGATTTTCCGATTCGTTTGGGGTAACCCACCGTTATATTCGTGCGGTCTTAGTGCGCTGTAATATCCAACGATATAGTCCGTTATGGCGTGAGCTGCCTCGCTGAAGCTTACGTAACCCACCACCGGCATCCATTCGTTCTTCAGACTCCTGAAGAAGCGTTCCATTGGGCTGTTATCCCTGCAGTTTCCGCGCCGGCTCATACTCTGTCTGATCTGGTATCGCCACAATAACTGCCGGAACTGCCTGCTCGTATAATGACTGCCCTGATCGCTGTGGAACATCACCCCGCCGGGCTTACCACGGGTTTCCCATGCCATTTCCAGCGCTTTCATGGTGAGCCTGCTGTCCGGCGAGAACGACATGGCCCAGCCCACTGGTTTTCTTGCGAACAGGTCGAGAACAACGGCGAGGTACGCCCAGCGCTTACCCCTCCAGATATAGGTCACATCACCGCACCACACCTGATTTGGCTCGGTCACGGCGAACTGCCTTTCAAGGTAGTTAGGGATAGCAACATGTTCATGACCACCACGTTTATACCGGTGAGTCGGCTGCTGACAGCTGACCAGCCCCAGCTCTTTCATGAGCCTGCCAGCAAGCCAGCGTCCCATCTGGTAGCCTCTCCGGGTTGCCATTGTGGCGATGCTTCTTGCTCCGGCCGAACCGTGGCTGATGCCATGTAGCTCAAGTACCTGACTGCGTAATACAGCCCGTCTGCCGTCTGGTTTTTCAGGACGGTTTTTCCAGTATCTGTAGCTGCTGCGATGAACCCCGAACACATGGCAGAGTGTGACCACAGGATAATGCGCTCTGAGTTTCCCGATTATCGAGAACTGTTCAGGGAGTCTGACATCAAGAGCGCGGTAGCCTTTTTAATATTTCATTCTCCATTTCAATGCGTTGTAGCTTTTTCCTCAGCTTACGTATTTCGATTTGTTCTGGTGTTATCGGAGAGGCTTTTGGTGTTTTGCCCTGACGCTCATCACGCAGTTGTTTGACCCATCTTGTCATTGTGGAAAGGCCAACATCCATAGCTTTCGCGGCATCTGCCACCGTGTATTTCTGGTCAACAACCAGTTGAGCGGATTCGCGTTTAAACTCTGCGCTGAAATTTCTTTTTTTCATTGGAGCACCTGTGTTGTTCTGAGGTGAGCATATCACCTCTGTTCAGGTGGCCAAATTCAGTGTGCCACTTCAGCCCTGATTCGACTTTTACAATGACTTATCTGGACGCATGAAGAACGTCAGGAATATACCAATAACAAGCAATGCGACGGAACCGTAGAATGGTAAACTCCAGTTGCCTGTTTTGTCGATAATAATGCCAAAGGCGATAGGTGAAATAATGTCGGCGACAGCGGAACCGGCGTTCATCAATCCGCTGGCAATTCCCACATACTTCGGCGTAATATCCATCGGGACAGCCCGGATAGGCCCAATGGTCAGCTCATCGGCCAGCAATGTCGTGAACTGTGAGACGAAACAGCGTACACAGTTTGAATGTATCTACTTTTCCCAATACTGGGCGAAGGGAGATTTTATTGCGAAGCGCGCGCCAATTGGTCAGTGGGAACCTTATTCGGAAGAGTCACTACTTGGCATCATCGTCACCAGCGTCTGTCGTATTAAAGTCGCGATGCTAAAACCTGAACCTCCCAGAGATCCCCATATCCCGCTGATGGGGGATTTTAACTAAAACTTAGCGCTATCGCTGCGTATCGTAACTATCCCGTATTACGTAGCGTAAATTATGACGCCATATGGGGAACAAATACACAGAGGAAGAAAAAGATTAGATTGGTTTTAAGGGATGTTTTTATCATAAATGGCACTGCTAAAATTATGAGATTATCGAAGATAAAATTCTTGATCATACATAAAAGGCCTCCTGAATTAAATCAAGAGGCGTATAGTTTTTACATTACTGATGTATGTTTAATGATCTCACTGATTTTCACTTTCCTGTCCTGCGATCTTGAAAGAGTGGCAGCATCAGCAGTAGCAATCGCCGACATGGCCGCCTCACCATTGAGAAGGTCAATATAATCTTCTTCAGGTTTTGCACCACAGAGGATATTATGGAGGAATAACGTCTCCTTTCTTATTAAACTGGCAAGCCATAATGGTGTTTTTTTTCCTGGATGACCATATGCTATAGCGCCATCCATTTCTGAGGTCATATTGCCTTTCCGACGATCATCATCTTCTTCTTGTGTTTCATGGACCAAAAAATGCTTTGTCTGACCGCCAATCCTAAGTGACCCTGCTGTTTCTTGCATATCAATTTTAATAGAGCCTTTAGTTCCATTGATGATGACATAATGTTCCGGCCAGTTAAATGCACTCCCCCACTCTAAGGTTGCTAGTTTTCCTGACGGGAATTCCAAGGTCATAAATAACATATCATCTTCATTGCCAAATCCTGGACCAGAATGGGCCAAATTTCCACCAATCATAGTAACCGTCTCTGGTATTTCTCCAAGTAAATGCTGAACACAATCTAACTCATGTATATGATGATATAGATGTCCACCAGATTGTTCTTTCATCTTTTTCCAGGAAAGTCTCTCTTGTTTGTTTTCCCAGCCATTTCTCTTAGTATGACATGATAATATTTCGCCGATAACACCTTCTTTAATTAACTTCCGTGCATATTGAACCCCATTGAAAAAATTCATAATATGCCCGGCCATAAAGGTCACACCAGCTTCTTTACACGCTTTGACCATATCCACACAATCTTCATAACTTAATGCAATTGGTTTTTCACAAAAAACATGCTTCTTATTCTTTGCTGCTTTAATTACTGGTTCTTTATGCAGATAATTTGGGGTGGCTACGATCACGCAATCGACTAATTTACTTGAGACTAAAGCATCCAAGCTTGACATATTGATACACTGCAATTCACGGGCAATATTTTCTCCATTTTCAGGATCGTATACACATGTAATTTTTGCATTATCATGCATATTCATAAAACGAGCTAATTCAGCGCCAAAGTATCCAACACCAACAACGCCATAATTAATCATAAAGCCTCCAATCATTTAGCCACGGATAGTTTATAAATTTTACCTGGAATATCAAAACCAACTAACAAAATTAATAAGGCAGAGAATGCAACCGTAACAATGAATAATGAAACACCTAAGCCATAATATCCTGAAATGTATGTAGCTAATACAGGTGCGGCCATTCCTCCAGTTGCCCCTAAGTTATAAATAAGACCGGTCCCTAATCCTCTTAATTTTGTTGGAAAGTAATCATATATAAATTTTGGAACCAACCCTGCAATACCTAAATTTGTAAACATTAATCCAAAGAGACATAATCCTATAAGAGAAGAGTTTTTCACAGAAATAAAAAAAAGAGGACAAAGGAAAATAAATGAAGTTATTAGACCGACTACAAAGGCTTTTTTTACACCAATCTTATCACCAACAAAACCAAAAAATATTGTACCTGTCAGTGTTCCTAAACCTGCTATTGTCATCAGAGTTGAAATGACCACTGTATTAACTCCATTATCTGCCAGGTAGGAAGGAAGTAGTCCGTTTATCGGCCAGTTTGCACCAAATAGACAAAAACAGACGAGGAAAACGATCATAGAGATTGAAAGATGTGGTTTTCTGAAGACAGACAAAAATGTTGATTTATCCTTATATTTATCTTCAATCCACTCCTGACTTTCTGGAGCACTTTTTCTGATCCAAAGAACTAGTAAAACTGGTAACAGGCCTATAAAAAAAGAGTTTCTCCATCCATATACTTCAGCAAACTGAGGGATTATTTGTGCCGCAATAATATTTCCAACAGAAAAACCACTTACCAAAAAAGCACTAGCTTTAGATTGAAGATTTTTAGGCCAACTTTCTACCGCATAAGTTGAAGCACATGCATATTCACCAGACATCCCTAAGCCAACAATAAAACGGCAAACTGCGAGCATATATAAGTTTGTAGCAATACCGCTAAGGCCTGTTCCGACTGAGTAAATGAAAATTGCCCACATCATCATTGGCTTACGACCATATTTATCAGCCATGGCACCAAAAAAACCACCTCCAATAGGTCTGGCTATGAAGGCCACTGTCCCTATTAAAGTAGCCTGAATATCCGTAATGCCAAGATCTGCTTTTATAATATGAAGAATGTAAAATATCATCATAAAATCAAAGCCATCAAATACATATCCAAGCCATGCGGAAAAAAGAGCTTTCCGTTGTGGTGGATTAACTTGTTTATACCATGCTGTTGCCATATTTGTTTCCTGTGTTATTCTAATATAGCCCTTAAAGTAAATATTAGGGAGCCTCAACAATATCCAGAAATAATTATATTGTCTTCAAATACGCCAGTACTATTGATTGTTATCAATACCTAATCAGTACAATCTTTTCCTCCATTTCTTCTTAACATATTTCACTCTATGAAAGATAAAACTGATACCCACCTACAGTTTTGGGTCACCAACAAAAAACAATGTTCGCCAGATATTCCACCACTGGCACAGTTCAACCCTGCTTCAAAAATTGACTTATTCATTGTGCTCATTTTCGTAGTCCGTTCAGGTTGAATAAAACTCCCTTCGTCAAATGAGGAAAGACTGAGAGGTTATTACTTTAGCTTACGTTATAGCTGTTTTCCTTTGCAGTATTTCTTTGCTTTTCTGTATACCTTTATACCTGTTATACCAGATCAAAAAACAAGCAATCCACATAACAAAACGCGTTTTGTTACTGATGTCACAAATTGAGCATATTTTTGTAGCTGATAGTTTGTTACAAACACCAGAACCAAGATTAATGCCGATCAGTTAAGGATCAGTTGACCGATCCAGTGGCTGTGTAAGAATCCGGAAACGCTCACTTGTTTCCGGATTTTTTTATGCACATTGGACAGGCTCTTGATCTGGTATCCCGTTACGATTCTCTGCGTAACCCACTGACTTCTCTGGGGGATTACCTCGACCCCGAACTCATCTCTCGTTGCCTTGCCGAATCAGGTACTGTAACGCTACGCAAGCGCCGTCTTCCCCTCGAAATGATGGTCTGGTGTATTGTTGGCATGGCGCTTGAGCGTAAAGAACCTCTTCACCAGATTGTGAATCGCCTGGACATCATGCTGCCGGGCAATCGCCCCTTCGTTGCCCCCAGTGCCGTTATTCAGGCCCGCCAGCGCCTGGGAAGTGAGGCTGTCCGCCGCGTGTTCACGAAAACAGCGCAGCTCTGGCATAACGCCACGCCGCATCCGCACTGGTGCGGCCTGACCCTGCTGGCCATCGATGGTGTGTTCTGGCGCACACCGGATACACCAGAGAACGATGCAGCCTTCCCCCGCCAGACACATGCCGGGAACCCGGCGCTCTACCCGCAGGTCAAAATGGTCTGCCAGATGGAACTGACCAGCCATCTGCTGACGGCTGCAGCCTTCGGCACGATGAAGAACAGCGAAAATGAGCTTGCTGAGCAACTTATAGAACAAACCGGCGATAACACTCTGACGTTAATGGATAAAGGTTATTACTCACTGGGACTGTTAAATGCCTGGAGCCTGGCGGGAGAACACCGCCACTGGATGATACCTCTCAGAAAGGGAGCGCAATATGAAGAGATCAGAAAACTGGGTAAAGGCGATCATCTGGTGAAGCTGAAAACCAGCCCGCAGGCACGAAAAAAGTGGCCGGGACTGGGAAATGAAGTGACTGCCCGCCTGCTGACCGTGACGCGCAAAGGAAAAGTCTGCCATCTGCTGACGTCGATGACGGACGCCATGCGCTTCCCCGGAGGAGAAATGGGGGATCTGTACAGTCATCGCTGGGAAATCGAACTGGGATACAGGGAGATAAAACAGACGATGCAACGGAGCAGGCTGACGCTGAGAAGTAAAAAGCCGGAGCTTGTGGAGCAAGAGCTGTGGGGTGTCTTACTGGCTTATAATCTGGTGAGATATCAGATGATTAAAATGGCGGAACATCTGAAAGGTTACTGGCCGAATCAACTGAGTTTCTCAGAATCATGCGGAATGGTGATGAGAATGCTGATAACATTGCAGGGCGCTTCACCGGGACGTATACCGGAGCTGATGCGCGATCTTGCAAGTATGGGACAACTTGTGAAATTACCGACAAGAAGGGAAAGGGCCTTCCCGAGAGTGGTAAAGGAGAGGCCCTGGAAATACCCCACAGCCCCGAAAAAGAGCCAGTCAGTTGCTTAACTGACTGGCATTACAGAACCAAGATGCCTATTTCGTTTCAGCATCAACTGTTAAATTATCTGGTTGCCAACGAGAATACTGAGCTTTTTTGTCCCCAAGACTGACCGTCAACGCCACAATCTTTGACTGTTTGACATTGTCATGCCCGCAGGCCAGCGGATCTAAAGACATTAGACTGCTTGCATCAGCCAGTTGCGTATCTGGCTATGTCCCTTAATTGGTAAGGGTTCGAGGAACTCAACATTAACGCATTTTTTAGCGCATCACCGACAGTGTATATGAACGATTGCACTCTGACGATGGTTGATTTATTTGGTTGCGTAGGGGCTAGCTAGTCTGGTACCGGAGGCTAGATCTGAAGTACTGACCCCAGGAATAACGCCGGGCTAACTCATATTCTGACCCATTGTCAGCCCGCGTGTTTACTGCTCTAGCTCATTTCCTGTTGTCAGGCGTTCAATTCCCAACTGCAGGTTATGGCCCGGTGCACAAACACCTGACGGCAGGTATCACTTACCCGAAGAAGGTAAAGCCCTCAAGAGGCAGTAACTCAGCCAACCTCTCCTCCGGCCACTCCGGCAGACGCGTCAGGACGTCCGTCAGCCAAGCATGTGGCTCCAGACTGCGGTTCCCAGAAGGCTCATTATCTGTGCCGCGCGGTTCCCGGCCACCAGTGAACCAGCGAACAACCAGGCCTTTCGTCCCATAACGACCGGACGGATAGCACGCTCGCAGATATTGTTGTGTGTGCCTTGAAATGCACGATCTTTCCGTCGGTGAAAGTCCGCCCCGGCTAACTCTCGCTCCGACTGGAAGGACAAGAAGCGGTCCAGGAGGTAACAAAAGGGCTGAATCTCTCCCTGAAACGACTTCTTCAGGAAGTCAACGGGTTATGCGGGCCGTAACGTGAGTGAACACTGAGCAGCCATCGAAATGCCGTCGAGGGTGCTGACACGCCACAATAACGGGGAAGGCCGACGATGACCGGGAAGAGAGTGAGAAAGGCATCGGTCATACCCACCGGGGTTGTGGTGTCAGCATGCATAACAATTGATCGATCGTGCAACAAGAGAAATCCATAACGGTGTCTGTCCGGGTGACGGGCAACAACGCCTCGCGAGGGGGTGAATCGGTTGTTATGGATGGCAGATGGGGTGTAGTAGTGAAAATACTGAGTAATGTCAGAGGAGCGAAGGCCCCCTCTTGCAGTACCAGTGTAACATGCAGGAAGAGCCCGGCGATTGCCATCAGCCTCACAACCCCGCTAGAACCGGTTCAGAACCTACAGAAAACACTACAGGAGAAAGCAAAGTGAACCGCCCCGTGTTTCCTGTTTATCATTTTCTGGTCAGTGCCGCGATTCTGGTATTCGTGGTGATTTTCTGGCGGACACACCATCGCGACCACCGTAACTGGCTGGCTCTACGGCTATTCGTATTATGCTCTGTAAACCGTTGGCCATTGCGGATGGTTAAAGGAACAGTTGTGGGGACAACTGACACATTACGTGAAATGCAGCGTTATGAACAACTGAGTAGCACGGGGCTGACAACTGGAAAATCCAGCCGGGAGCACCGTTATATGATACGATTGTTATTGTCACTGGTGAGAGTGTGCGCAGGGATTATATGTCAGTGTATGACTATCCCGTACCAACCACACCGTGGCTGAATACGGCACCCGGTTTATTTATTGACGATTATACCTCGACAGCCTCCAGTACAGTGTCTTCCCTGAGCCGGACACTGATTTATGACTATGAGCAGAACCCTGATTCCGGCAACAATGTGGTGGCGCTGGCAGCAAAAGCAGGATACAGCACATGGTGGATATCCAATCAAGGAAAACTGGGGGGCATGACACACGCATCTCTGTTATTGCTTCTGATGCGGAGCATGCCACTTTCCTCAAGAAAGGCAGCTTCGCTTCCCGTAAAACAGATGACAAACTGTTGTTACAGGAAACAGAACGTGCGCTGGCGGATACATCCTCTCCGAAGATAATTTTCCTGCACATGATGGGTTCTCATCCAAATCCGTGTGACAGCCTTAACTCCTAGCCGAATAATTACCTGGAGCAGTATCCCCGAAAAATTGCCTGTTACCTCGCCAGCATCAGTAAACTGGATAACTTTCTTGGCCAGCTTGATGGTATCCTTCGCCGGTACTCACGTCATTTCGCCATGCTTTACTTTTCTGGCCTTGGGCTGTCGGTCAGCGACAGTGCCAATCCTGTTCATCATTATGGTCATGTGCAGGGAGGCTACAGTGTGCCACTGATTATTACCGCCAGTGACATAACGTCTCATCAGCCCGTCAGTAGAAAAATCAGTGCCCGTCATTTCGCAGGTATTTTTCAGTGGATGACCGGTATTTGTACTGAAAATATACCGCCATTCAATCCGCTGACAGACGAAGATAACTAACCTGTTATGGTTTTTAAGGGAGAGAGGAATATACCGGCAGACAGTTTGAAACCTCAGCCACTTATTCTTCCTGATCACAGGTAATCATATATGGCAGACTGTAATACAGTCGCTTGTACTGAAAAGTATTCTGTATAAAATATCCGTATTCATATCAGCACAAGGGACCATCAAATCCCTGCAGCACTCCTTACGTCAGTTATCCCGCATCACCCTGTGAGTACAGGATTTTTTATGAGGTTACTAGACTGGCCCCCTGAATCTCCAGACAACCAATATCACTTATTTAAGTGATAGTCTTAATACTAGTTTTTAGACTAGTCATTGGAGAGCAGATGATTGATGTCTTAGGACCGGAGAAACGCAGACGGCGTACCACACAGGAAAAGATCGCAATTGTTCAGCAGAGCTTTGAACCAGGGATGACGGTCTCCCTCGTTGCCCGGCAACATGGTGTAGCAGCCAGCCAGTTATTTCTCTGGCGTAAGCAATACCAGGAAGGAAGTCTTACTGCTGTCGCCGCCGGAGAACAGGTTGTTCCTGCCTCTGAACTTGCTGCCGCCATGAAGCAGATTAAAGAACTCCAGCGCCTGCTCGGCAAGAAAACGATGGAAAATGAACTCCTCAAAGAAGCCGTTGAATATGGACGGGCAAAAAAGTGGATAGCGCACGCGCCCTTATTGCCCGGGGATGGGGAGTAAGCTTAGTCAGCCGTTGTCTCCGGGTGTCGCGTGCGCAGTTGCACGTCATTCTCAGACGAACCGATGACTGGATGGATGGCCGCCGCAGTCGTCACACTGATGATACGGATGTGCTTCTCCGTATACACCATGTTATCGGAGAGCTGCCCACGTATGGTTATCGTCGGGTATGGGCGCTGCTTCGCAGACAGGCAGAACTTGATGGTATGCCTGCGATCAATGCCAAACGTGTTTACCGGATCATGCGCCAGAATGCGCTGTTGCTTGAGCGAAAACCTGCTGTACCGCCATCGAAACGGGCACATACAGGCAGAGTGGCTGTGAAAGAAAGTAATCAGCGATGGTGCTCTGACGGGTTCGAGTTCCGCTGTGATAACGGAGAAAAACTGCGAGTCACGTTCGCGCTGGACTGCTGTGACCGTGAGGCACTGCACTGGGCGGTCACTACGGGCGGCTTCAACAGTGAAACAGTACAGGACGTCATGCTGGGAGCGGTGGAACGCCGCTTCGGCAACGAGCTTCCGGCGTCTCCAGTAGAGTGGCTGACGGATAATGGTTCATGCTACCGGGCTAATGAAACACGGCAGTTTGCCCGGATGTTGGGGCTTGAACCGAAGAGCACGGCGGTGCGGAGTCCGGAGAGTAACGGCATAGCAGAGAGCTTCGTGAAAACGATAAAGCGTGACTACATCAGTGTCATGCCCAAACCAGACGGGTTAACGGCAGCAAAGAACCTTGCAGAGGCGTTCGAGCATTATAACGAATGGCATCCGCATAGTGCACTGGGTTATCGCTCGCCACGGGAATATCTACGGCAGCAAGCCAGTAATGGGTTAAGTGATAACAGGTGTCTGGAAATATAGGGGCAAATCCAGTTACTTTTTTATTGAATGTATATAAATAATGAATAACCGCGGATTCTGATAATCCCCCCTGCAGGGAGCGGTAAATAGTAAATACATCAGTTAATACCGTTTGTCTTTTTTAACAAAGAAAATAATCCTAGAATAATTCCCGGGCATTTGCCCGGGATGATTACTGTTTTAATGGATTATTAATCTTTGCATATTCAAATGGACTGATAAACCTTTCTCTGTTTACATCCAGAAAATCGGCCCACCACTGTAGCATCAATCGCCGTTCTTCCAGATGCTCTGCTTTATGGATATACGCGGCCCTCACTGAATTTCGCGCCATGTGGCTCATCTGGCGTTCAACAGCATCACGAGACCACAGACCTGATTCGACCAATGAACTACAGGCCATTGTTCGAAAGCCATGACCACAAACCTCTACTTTTGTATCATACCCCATGACCCGTAACGCACTATTTACCGTATTCTCACTCATGGGTTTGTGCGAATCGTGATCACCAATAAATATCAAGTCATGGGCCCCATAAAACTGTTTTATCTGCTTTAAAATTGCAAGAGCTTGCGTTGAAAGAGGCACTAGATGCGTTGTACGCATTTTTGAGCCTCTATGGGAATGTTTCACTCCAGGAATAGGCTCCCGCTCCGGTGGGATAGTCCATATAGACGCTTCGAAATCGATCTCTGACCAACGAGCAAAACGCAGCTCACTGGACCGAATAAAGATCAGCAAAGTGAGTTCTATCGCCCATCGGGTTAGCGGCCTACCAGTATAGCTATCTATTTTTGTAAGCAACTCAGGGATGCGCTTTAATTCAAGCGCGGGACGATGTTGTCGATTACAGGAAGCAACCGCCCCAGCCATCTCTTGTGCCGGGTTATAATCAATTAACCCACTTTGCACTGCATAGCGCATGATGGCTGTAGTGCGCTGCTGAAGACGAGCGGCCACTTCAAGACGTCCAGACATTTCTACGGCCTTAATAGGTGCTAATAAATCTCGAGTTTTTAACTCAGCGATATTACGTTCACCAAGCGCTGCAAAAAGATTATCTTCAAGACTTTTTAGCACACGATGGGCGTGATCTTCAGACCACTTTTTATTGGTGCCATGCCACTCAATCGCGACTTCTTTAAAGGTTCGTGCTTTACTCTGTTCAACCTTATCATTTTTCTTTTTGTCTCCCGGATCGACGCCATTCGCAAGCAGCTTACGCGTCTCGTCACGACGTACTCTGGCATCCGCTAGTGTGATTTCAGGATAAACCCCAAGTGCCAGCATTTTTTGCTTTCCCTCATAACGGTACTGCAAACGCCAGTACTTAGAACCATTTGGATGGACAAGCAGATGCAT
Protein sequences of DBSCAN-SWA_1 >CP029164|108496:123827|108496_109652_+|AWH67968.1|transposase|DBSCAN-SWA MSRKTQRYSKKFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARKGLGTPGSRTVAELESEILQLRKALNEARLERDIFKKSNCRFCTGVAEKYALIEQWRQQFPIEAMCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >CP029164|108496:123827|122636_123827_-|AWH72403.1|integrase|DBSCAN-SWA MHLLVHPNGSKYWRLQYRYEGKQKMLALGVYPEITLADARVRRDETRKLLANGVDPGDKKKNDKVEQSKARTFKEVAIEWHGTNKKWSEDHAHRVLKSLEDNLFAALGERNIAELKTRDLLAPIKAVEMSGRLEVAARLQQRTTAIMRYAVQSGLIDYNPAQEMAGAVASCNRQHRPALELKRIPELLTKIDSYTGRPLTRWAIELTLLIFIRSSELRFARWSEIDFEASIWTIPPEREPIPGVKHSHRGSKMRTTHLVPLSTQALAILKQIKQFYGAHDLIFIGDHDSHKPMSENTVNSALRVMGYDTKVEVCGHGFRTMACSSLVESGLWSRDAVERQMSHMARNSVRAAYIHKAEHLEERRLMLQWWADFLDVNRERFISPFEYAKINNPLKQ >CP029164|108496:123827|117146_118475_+|AWH67974.1|transposase|DBSCAN-SWA MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEEIRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGEMGDLYSHRWEIELGYREIKQTMQRSRLTLRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLITLQGASPGRIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >CP029164|108496:123827|113694_113952_+|AWH67971.1|DBSCAN-SWA MVSSSASNVVNCETKQRTQFECIYFSQYWAKGDFIAKRAPIGQWEPYSEESLLGIIVTSVCRIKVAMLKPEPPRDPHIPLMGDFN >CP029164|108496:123827|119349_119544_-|AWH67976.1|DBSCAN-SWA MLHDRSIVMHADTTTPVGMTDAFLTLFPVIVGLPRYCGVSAPSTAFRWLLSVHSRYGPHNPLTS >CP029164|108496:123827|109800_111804_+|AWH67969.1|holin|DBSCAN-SWA MIMSENDTIPKKSTSQINKAVFFTSALLIFLLVAFAAVFPDVADKNFKLLQQQIFTNASWFYILAVALILLSVTFLGLSRYGDIKLGPDHAQPDFSYHSWFAMLFSAGMGIGLMFFGVAEPVMHYLSPPVGTPETVAAAKEAMRLTFFHWGLHAWAIYAIVALILAFFSYRHGLPLTLRSALYPIIGDRIYGPVGHAVDIFAVIGTVFGVATSLGYGVLQVNAGLNHLFGVPINETVQVILIVVITGLATISVVSGLDKGIRILSELNLGLALLLLALVLCLGPTVLLLKSFVENTGGYLSELVSKTFNLYAYEPKSSNWLGGWTLLYWGWWLSWSPFVGMFIARVSRGRTIREFVTGVLFVPAGFTLMWMTVFGNSAIYLIMNQGATDLANTVQQDVSLALFNFLEHFPFSSVLSFIAMAMVIVFFVTSADSGAMVVDTLASGGVANTPVWQRIFWASLMGIVAIALLLAGGLSALQTVTIASALPFSVILLISIYGLLKALRRDLTKRESLSMATIAPTAARNPIPWQRRLRNIAYLPKRSLVKRFMDDVIQPAMTLVQEELNKQGTISHISDAVDDRIRLEVDLGNELNFIYEVRLRGYISPTFALAAMDNDEQQSEQHRYYRAEVYLKEGGQNYDVMGWNQEQLINDILDQYEKHLHFLHLVR >CP029164|108496:123827|119678_120146_+|AWH67978.1|DBSCAN-SWA MQYQCNMQEEPGDCHQPHNPARTGSEPTENTTGESKVNRPVFPVYHFLVSAAILVFVVIFWRTHHRDHRNWLALRLFVLCSVNRWPLRMVKGTVVGTTDTLREMQRYEQLSSTGLTTGKSSREHRYMIRLLLSLVRVCAGIICQCMTIPYQPHRG >CP029164|108496:123827|121190_122463_+|AWH67979.1|transposase|DBSCAN-SWA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEAVEYGRGKKVDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFRCDNGEKLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNELPASPVEWLTDNGSCYRANETRQFARMLGLEPKSTAVRSPESNGIAESFVKTIKRDYISVMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQQASNGLSDNRCLEI >CP029164|108496:123827|115302_116520_-|AWH67973.1|DBSCAN-SWA MATAWYKQVNPPQRKALFSAWLGYVFDGFDFMMIFYILHIIKADLGITDIQATLIGTVAFIARPIGGGFFGAMADKYGRKPMMMWAIFIYSVGTGLSGIATNLYMLAVCRFIVGLGMSGEYACASTYAVESWPKNLQSKASAFLVSGFSVGNIIAAQIIPQFAEVYGWRNSFFIGLLPVLLVLWIRKSAPESQEWIEDKYKDKSTFLSVFRKPHLSISMIVFLVCFCLFGANWPINGLLPSYLADNGVNTVVISTLMTIAGLGTLTGTIFFGFVGDKIGVKKAFVVGLITSFIFLCPLFFISVKNSSLIGLCLFGLMFTNLGIAGLVPKFIYDYFPTKLRGLGTGLIYNLGATGGMAAPVLATYISGYYGLGVSLFIVTVAFSALLILLVGFDIPGKIYKLSVAK >CP029164|108496:123827|118502_118646_-|AWH67975.1|DBSCAN-SWA MSLDPLACGHDNVKQSKIVALTVSLGDKKAQYSRWQPDNLTVDAETK >CP029164|108496:123827|119559_119790_+|AWH67977.1|DBSCAN-SWA MSVRVTGNNASRGGESVVMDGRWGVVVKILSNVRGAKAPSCSTSVTCRKSPAIAISLTTPLEPVQNLQKTLQEKAK >CP029164|108496:123827|114004_114130_-|AWH67972.1|DBSCAN-SWA MIKNFIFDNLIILAVPFMIKTSLKTNLIFFFLCVFVPHMAS >CP029164|108496:123827|114172_115291_-|AWH72402.1|DBSCAN-SWA MINYGVVGVGYFGAELARFMNMHDNAKITCVYDPENGENIARELQCINMSSLDALVSSKLVDCVIVATPNYLHKEPVIKAAKNKKHVFCEKPIALSYEDCVDMVKACKEAGVTFMAGHIMNFFNGVQYARKLIKEGVIGEILSCHTKRNGWENKQERLSWKKMKEQSGGHLYHHIHELDCVQHLLGEIPETVTMIGGNLAHSGPGFGNEDDMLFMTLEFPSGKLATLEWGSAFNWPEHYVIINGTKGSIKIDMQETAGSLRIGGQTKHFLVHETQEEDDDRRKGNMTSEMDGAIAYGHPGKKTPLWLASLIRKETLFLHNILCGAKPEEDYIDLLNGEAAMSAIATADAATLSRSQDRKVKISEIIKHTSVM >CP029164|108496:123827|111748_111907_+|AWH67970.1|holin|DBSCAN-SWA MTYWTSTKNTCTSCTWFVNSNMPSWGRQLLSRPQYEGMQNDFTLEMDAEADN |
14 | Acinetobacter_phage(25.0%) | transposase,holin,integrase | attL 103956:103970|attR 128728:128742 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
355346 : 412633
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP029164|355346:412633|DBSCAN-SWA TTCATTTTTGTGTCCTGCAAGCCCCGCGATAGCCAGAGTGTACCACGCCTCCCGTGAACAACGCCGCGCTGTCCAGGGCATCGGCTCTTTTCAGAGGAATTGTTTGATGGCTACCCTCACCACTGGCGTGGTTCTTCTTCGCTGGCAACTTCTTAGTGCCGTAATGATGTTTCTGGCCAGCACGCTCAACATCCGTTTTCGTCGGTCTGATTATGTCGGGCTTGCAGTGATCAGCAGCGGTCTGGGCGTGGTTTCTGCCTGCTGGTTCGCAATGGGGTTGCTTGGCATCACAATGGCGGATATCACCGCCATCTGGCACAACATCGAGTCGGTGATGATAGAAGAGATGAATCAGACACCGCCACAATGGCCAATGATTTTGACTTGATATGTAGAAGCCTCCAAAACGGAGGCTTCTTTTTTACGGCTGGCAGATGTTTTAATCATCCACCTTAAAACAATATAACCTATTGTTTTAATGAAAAATCAGAACGGAATATCGTCATCAAAGTCCATCGGCGGCTCATTAGACGGTGCTGCCGGAGCAGACTGCTGCGGGCGAGACTGCGCGCCGCCGCTGAACTGATTGCCGCCCTGCGGCTGCTGAGGCTGACCCCAACCGCCCTGCGGCTGACCACCACCGATATTGCCACCTGCAGGAGCGCCACCACCCTGACGACCACCCAACATCTGCATGGTGCCGCCAACGTTCACCACGACTTCTGTGGTGTAGCGATCCTGACCGGATTGATCGGTCCATTTGCGGGTACGCAGCTGACCTTCGATATAAACCTGAGAACCTTTACGCAGATATTCGCTGGCCACTTCTGCCAGTTTGCCGAACAGCACAACGCGGTGCCATTCAGTCTGCTCTTTCATCTCGCCGGTCGCTTTATCACGCCAGGATTCGGAAGTAGCCAGCGTAATGTTGGCAACTGCGCCACCATTTGGCATGTAGCGTACTTCCGGGTCCTGACCCAGATTACCAACGAGAATAACCTTGTTTACGCCTCTGCTGGCCATGTTCGTGTCTCCTGAAAAAAAATCGTTCTGAATAAGTGTAAACGCGCGATTGTACCATTACCAATAGCGCTTTTACTATGTTGTGACCTCGGTTCCGGGAAACAAACCTGGCCAGACATTGTTACACAACACTCCGGGTAATGCATTCCAATACTGTATATTCATTCAGGTCAATTTGTGTCATAATTAACCGTTTGTGATCGCCGGTAGCACCATGCCACCGGGCAAAAAAGCGTTTAATCCGGGAAAGGTGAATGGATAAGATCGAAGTTCGGGGCGCCCGCACCCATAATCTCAAAAACATCAACCTCGTTATCCCCCGCGACAAACTCATTGTCGTGACCGGGCTTTCGGGTTCTGGCAAATCCTCGCTCGCTTTCGACACCTTATATGCCGAAGGGCAGCGCCGTTACGTTGAATCCCTTTCCGCCTACGCGCGGCAGTTTCTGTCACTGATGGAAAAGCCGGACGTCGATCATATTGAGGGGCTTTCTCCTGCCATCTCGATTGAGCAGAAATCGACGTCTCATAACCCGCGTTCTACGGTGGGGACAATCACCGAAATCCACGACTATTTGCGTTTGTTGTTCGCCCGCGTCGGTGAGCCGCGCTGCCCGGACCACGATGTACCGCTGGCGGCGCAAACCGTCAGTCAGATGGTGGATAACGTGTTGTCCCAGCCGGAAGGTAAACGCCTGATGCTGCTCGCGCCAATCATTAAAGAGCGCAAAGGCGAACACACCAAAACGCTGGAAAATCTGGCAAGCCAGGGTTACATCCGTGCTCGTATTGATGGCGAAGTCTGCGATCTTTCCGATCCGCCGAAACTGGAACTGCAAAAGAAACATACCATTGAAGTGGTGGTTGATCGCTTCAAGGTGCGTGACGATCTCACCCAACGTCTGGCAGAGTCGTTTGAAACCGCGCTGGAGCTTTCCGGTGGTACCGCGGTAGTTGCCGATATGGACGATCCAAAAGCGGAAGAGCTGCTATTTTCCGCCAACTTCGCCTGCCCAATTTGCGGCTACAGTATGCGTGAACTGGAACCACGCCTGTTTTCGTTTAACAACCCGGCGGGTGCCTGCCCGACCTGTGACGGCCTTGGCGTACAGCAATATTTCGATCCTGACCGCGTGATCCAGAATCCGGAACTGTCGCTGGCAGGTGGTGCGATCCGAGGCTGGGATCGCCGCAACTTCTATTACTTCCAGATGCTGAAATCGCTGGCAGATCACTATAAGTTCGACGTCGAAGCGCCGTGGGGCAGCCTGAGCGCGAACGTGCATAAAGTGGTGTTGTACGGTTCTGGCAAAGAAAACATTGAATTCAAATACATGAACGATCGTGGCGATACCTCCATTCGTCGTCATCCGTTCGAAGGCGTGCTGCACAATATGGAGCGCCGTTATAAAGAGACGGAATCCAGTGCGGTACGTGAAGAATTAGCCAAGTTTATCAGCAATCGTCCGTGCGCCAGCTGCGAAGGAACCCGTCTGCGTCGGGAAGCACGCCACGTGTATGTCGAGAATACGCCGCTGCCCGCCATCTCCGACATGAGCATCGGTCATGCGATGGAATTCTTCAACAATCTCAAACTCGCAGGTCAGCGAGCGAAGATTGCGGAAAAAATTCTTAAAGAGATTGGCGATCGCCTGAAATTCCTCGTTAACGTAGGCCTGAATTACCTGACACTTTCCCGCTCGGCAGAAACGCTTTCTGGCGGTGAAGCACAGCGTATCCGTCTGGCGAGCCAGATTGGTGCGGGCCTGGTTGGCGTTATGTATGTGCTGGACGAGCCGTCTATCGGCCTGCACCAGCGCGATAACGAGCGCCTGTTGGGTACGCTTATCCATCTGCGCGATCTCGGTAATACCGTGATTGTGGTGGAGCACGACGAAGACGCGATTCGCGCCGCTGACCATGTGATCGACATTGGCCCGGGCGCAGGTGTACACGGCGGTGAAGTGGTCGCAGAAGGTCCACTGGAAGCGATTATGGCAGTGCCTGAGTCGTTGACCGGGCAGTACATGAGCGGTAAACGCAAGATTGAAGTGCCGAAGAAACGCGTTCCGGCGAATCCGGAAAAAGTGCTGAAGCTGACAGGCGCACGTGGTAATAACCTGAAAGACGTGACGCTGACGCTGCCAGTCGGTCTGTTTACCTGCATCACAGGGGTTTCAGGTTCCGGTAAATCGACGCTGATTAACGACACACTGTTCCCGATTGCCCAACGCCAGTTGAATGGTGCGACCATCGCCGAACCGGCACCGTATCGCGATATTCAGGGGCTGGAGCATTTCGATAAAGTGATCGATATCGACCAAAGCCCAATTGGTCGTACTCCGCGTTCTAACCCGGCGACCTATACCGGCGTGTTTACGCCTGTGCGCGAACTGTTTGCGGGCGTACCGGAATCCCGTGCGCGCGGCTATACGCCGGGACGTTTCAGCTTTAACGTCCGTGGCGGGCGCTGCGAAGCCTGTCAGGGCGACGGCGTGATCAAAGTGGAGATGCACTTCCTGCCGGACATTTACGTACCGTGCGACCAGTGCAAAGGTAAACGCTATAACCGTGAAACGCTGGAGATTAAGTACAAAGGCAAAACCATCCACGAAGTGCTGGATATGACCATCGAAGAGGCGCGTGAGTTCTTTGATGCGGTGCCAGCTCTGGCGCGTAAGCTGCAAACGTTGATGGACGTTGGCCTGACGTACATTCGCCTCGGGCAGTCCGCAACCACGCTTTCTGGTGGTGAAGCCCAGCGCGTGAAGCTGGCGCGTGAGCTGTCAAAACGCGGCACCGGGCAGACGCTGTATATTCTTGATGAGCCGACCACCGGTCTGCACTTCGCCGATATTCAGCAACTGCTCGACGTACTGCATAAACTGCGCGATCAGGGCAACACCATTGTGGTGATTGAGCACAATCTCGACGTGATCAAAACCGCTGACTGGATTGTCGACCTGGGACCGGAAGGCGGAAGTGGCGGCGGCGAAATCCTCGTCTCCGGTACGCCAGAAACCGTCGCGGAGTGCGAAGCTTCGCATACGGCGCGCTTCCTCAAGCCGATGCTGTAATCGTTAAGGCCGCTTTCTGAGCGGCCTTTTCCTTTCAGAGTTGCACCAGCAATTTACGTTTTTCTTCCGGCAGCAAATTCACCGCCTGCTGATAAGACGCATCCACCAGATAATAGATTTGCGAATCCGGCAGCGAACCGTCGAGATAAACAGTGCTCCAGTGCGCTTTGTTCAGATGGCGGCTCGGACGCACATCACTGTGCTGCTGACGTAACAACTCTGCCAGCTCCGGGCTGGTTTTCAGCGAAACAGCTGGGCGATTTTCAACCTCTTTCACCATCGCAAAAAGCACATCTTCAACTTTGATCTGCGTCGCTTTCCAGTCGCTGTGAACGCTCTGTTCCGCGCCAGCTTTAGCCATGCAATATTGTAGCAACTCCGAAATGGTCATTTTTTACTCCCCTTGTAGTGTCGCGATAATGCGACGCGATCCGCCGTGGATGCGATGTTCCCCCAGCCAGATGCCTTGCCAGGTGCCGGTCTGAATACGCCCTTTATGCACCGGCAATACAAGCGATGTTCCCAGCATTGAGGATTTGATATGAGAAGGCATATCGTCTGCTCCCTCATAGTCATGCTCATAATTTCCGTTGTCGGGAACGGTGCGGAGGAAAAAACGCTCCATGTCGTGGCGTACGGTGGGATCGCAGTTCTCATTAAGTGTCAGAGAGGCGGAGGTATGTTGCAGCAACAGATGCAGTAAACCGATGTTAACGCGCGGCATATCAGCCAGCTGATTCAGAATTTCATCCGTTACCAGATGAAACCCACGAGATTTGGCGCTAAGCGTGAGCGTCTTTTGATACCACATGTGCTGCTCCTTGATAAAACTCTCTTAATCAGTTTGCAGCAAGACAGCGAAAGGATAAAGGTGTGATTAAAAAAACAGCATTCGGGAGAGTGTCACGCTCTCCCGCTCTGTCAGTATTCTGAATTGACGATCACCTCTTCACCAAACGCACCCGCTTGTGGCAAGGGTTTGTAGGTAGAGTTGGAGGCGCGCAGAATGCGGATACCACGAGCGCCGACATCGCGTGCGGCGGTAATATCGTTATCGGAATCGCCATAAAAAATTCGGATATTTTTATCCTGCAGCCATTGCGATTTTGTATTTTGCCCTGGTTTATCACCCGCAAAGATCACCGGATTCATGTTGGTGGCAGGAATATGAAAATTATCCGCCAGCGTTTTTGAAACCGTTTCTGTTTTCGTCGGGCTACGACCAGTCACAAAGAAGATCGCGTCACCGCGGCGTACATGCATATCAATCAGCTGGCGAGCGACCTCTTTTGGAATGCTAAATTCATCCCAGCCATTGTTCATTTTTTCCCAGAACACAGGATTTTTCAGATAATCTTCGCTTTCTGGCGAGAAGTTTTTTTTGCCGCGCCAGAAGCCCGGACTGGAAAAAAGTACCGTGTCATCGATATCAAACCCCACCGCCATTGGCGGACGCCCTGCGAGGCTATTTTCAATTTGTGCGACCGAAACCCAATGAATTGGTGCCTGTTCAGCAAGCCTGGCAACGTTAGTACCAGGGTTAAGCGGTGAAGGAGATGAGGCCAGGGCAACAGCGGAACTGTTTAGAGCGAACAATAAGCAAACGGCACTGATTGCCTGTGTGATCTTGCGCATATTTTTCCCTAAATAGTCAGTATGTTGAAACTTTTTATTGTGAGATTTGTTGCAAAAACCATCTGACCATAACGCCAGCAGCAGGGGAAAGGAAGGATTATCTGCGGTTTTTGTGATGAAAAGCAGGATGATTATATTTAACCCAGTAAATACGTTCCTGTCGTTTTTTGTAAAAAAAATGTTTATTACGCGATGTAAGAATACATTAATTTTGCGTTTATTATGATGAATCCTCTGCAAATGAGCGAAATTTGATAGAACATTTATCTGCTGTACGATTTAATAAACTTAATGATACAGCATTACAAAAACAATCGAAGTTTATAAAGATGATTTCTGATTGACCAGCCCCCTACCCTACAATGGTGACTAATTTTATTACTCCCGAAGGAGATGATGATATGAATATATCAAATGTGAATAGCAACAACACGACATCTTTACCTGTAGAGCTAGATACACTAAATAACAAAGGTATCTCTTATGACAAAGATTTCTCTTATGCCAAAGATCTCTTTTTATATATAGAAACACAGTTAAAAATTGCAAAAGATTTTTGTAGACCTGGAGAAGAAGTATCAAGTTCTATTGCAAGTACAGTTTTTCACGCATTTATTGATTTAGTTAACAAAATCAGGGGTAAGAAAGATTGTATGTATATTTTCACGCTTTGCTGTTTTGCCGAGGAGGTTAAAGGTGATTATTCTCATTACAGGACCTTTTTATTTGATATTGGTAATCAATATAAGGTTAAACTTACACAAAGCGGAAAAAAAGAGTTCTCTTTAACTTTAGAATTTAACGATACTATAATTGAATCTCAGATAGTCACAGGCAATAAAGCAAAGCATATTCTTGAGGATATAGAAAAATTCTATCGTAACAAACCCGATACTTATTATTAAATTTAATAATGGTTAGATGTGCAAAAATTATTATTCCACACGGTAAAACAGGGAAAAGCACTCCAGAACATTGGGATAAACTTAACAAGTTAGAACTAACAACCTTTTATTACCTCTCACGTAGAACGATGGCATCAAAAAAATCATCTATAAAGCAAGAGTGAAAGCGACGACAGTTTAATTTCACTGCAGGCTGGGTAACTCCAGCCTGCTTTCCTGCATTACATCACCGCAGCAAACGCCTTTGCCACACGTTGTACATTTGCCGTATTTAACCCAGCGACACACATGCGACCGCTGGCGATGAGATAGACACCAAATTCTTCACGTAGTCGGTCAACCTGAGCGGCACTTAAACCGGTATAACTGAACATACCGCGCTGATTAAGCAGATAATCGAAATTGCGTTCTGGTATCTCTGTGCTCAATACCTTCACCAGTTCCTGACGCATCGCCAGAATGCGAGTACGCATCTCTTCTACTTCCGCCAGCCAGCTGGCTTTCAATGCCTCGTCATTCAGCACCGCAGCCACCACCTGCGCACCAAAATTCGGCGGGCTGGAGTAGTTGCGGCGAACTGTTGCTTTCAATTGCCCCAGTACGCGGCCTGCGGCTTCGGCATCTTCACACAGAACAGAAAGTCCGCCGACGCGCTCGCCGTAAAGGGAGAAAATTTTCGAGAACGAATTGCTCACCAGAGCGGGTAATCCAGCGCTGGCAATGGCGCGAATGGCGTAGGCATCCTCTTCCATACCGGCACCAAATCCTTGATAGGCAATATCAAGAAATGGGATAAGCTCGCGGGCTTTGAGAATTTCAATCACCGCATCCCATTGGTCATTAGTGAGATCGGCACCCGTTGGGTTGTGGCAACATGGATGCAGCAACACAATACTGCGGGCAGGTAATGTTTTCAGCGTCACCAACAGGTCATTAAAGCGCACGCCGTTAGTCGCTTCGTCATACCAGGGGTAAGTACTTACTTCGAATCCAGCCCCGGCGAATATTGCTACGTGGTTTTCCCATGTAGGATCGCTGACCCAGACGCCTGATTCCGGGAAGTAGCGTTTCAGGAAATCCGCGCCCACTTTCAATGCCCCTGAGCCGCCAAGGGTTTGAATGGTTGCTACGCGCTGTTGTTGCAGTACCGGATGGTCGGCACCAAACAGCAGCGGCGCAATAGCATGGCGATAGCTGTTAAGCCCTTCCATCGGTAAATAAAGCGAAGCGCCATGAGGCTGCGCATTCAGGCGCGCTTCCGCATCCGCCACGGCTTTCAGTTGTGGAATAATTCCGTCTTCGTTGTAGTACAGACCGATACTTAAATTCACTTTGTCGCTGCGAGGGTCTTCTTTAAAACGCTCCATAAGCGTAAGAATCGGGTCGCCAGCGTAGGCGTCAACTTTTTGAAACACGCGATGGTTCTCCAGGTTTACGGGCAGGTGGTTAAAACACAATAAACCGGAAGAAGGCGAAGATCGAGTGGATGTTCAGGGGCGAACGGCAATTAGCAACAGAGTGAGACTCATGACAAACGTACATCCGCCAGAGGCACGGCCTTCATAAGAGCATGAAAGAATATCAACTTATTGAATTGGTAGGATTTTATTGGCCGGATAAGGCATTCACGCCGCATCCGGCACAGACAATCAAATATTACAGAACGATTAATCCACGTATTTCATCGCGACCCTTGAAGTCAGGCGCGTAATAAGTTCGTAAGCGCTCACTTTTGTCATTTCAGCGATACGTTCTACGGGCAAACCTTCGCCCCATAAAATGACCGGGTCCCCGGCTTTGTCCTGCGCCTGTGGACCTAAGTCTACGCAGATCATATCCATCGCGACGCGCCCGACAATCGGCACTTCGCGACCGTTCACCAGCACTGGCGTACCGGACGGCGCGGCGCGCGGATAACCATCGCCATAGCCCATCGCGACTACGCCAAGGCGGGTATCACGTTCGCTTACCCAGGTTCCACCATAACCGACAGGCTCTCCGGCTTTATGCTCACGCACAGCAATCAGGCTGGAGGTCAATGACATTACCGGCTGACAGCCAAAATCGGCACCGGTGGAACGATCTTCCAGCGGCGAGACGCCATAAAGAATGATGCCCGGGCGCACCCAGTCAAAATGCGACTGTGGCCACAGCAGAATGCCACCCGACGCGGCAATGGAACGTTGACCTGGTTTGCCTTCACAAAAGGTATTAAAGATAGCGAGTTGTTTCTCGGTTGCGCCGCATTTTGGTTCATCCGCGCGCGCAAAATGGCTGACGATATTCACCGGCTGACGAACGTTTTTGCACTGGGTCAGGCGATGATAAAACGCCTCAGCCTGTTCCGGCCTTACGCCCAGACGGTGCATACCGGTATCGAGTTTCATCCAGACGGTAACCGGCTCGTCCAGGCTAGCCTCTTCCAGCGCAGCCAGCTGTTCTTCGTTATGCACGGCGGTATGAAAATGTTGCGCAGAAATCGTCGGCAGATCTCTGGCATCAAAAAAGCCTTCGAGTAACAGTACAGGTTTGGTGATTCCCCCCGCACGCAGTCGCAGAGCTTCTTCGAGACGGGCTACGCCAAAGGCGTCAGCATCGGGGAGCGTTCGCGCGGTCTCAAGAAGACCGTGACCATAAGCGTTCGCTTTCACCACCGCAACCATTTTACTGGCAGGCGCCAGTTCACGAAGACGTTGCAGGTTGTGTCGCAGAGCGCGGCGGTTAATCACAACAGTTGCCGCTTGCATTTGTGTTCCTTGATAAGTGTTTGCTTTAATTACCTAATTCATAAAATAATTATTATTCGTCGTCGTACTGCGGCCCCGCATAGTTGTCGAAGCGCGACCATTGACCGTTAAAGGTCAGGCGTACCGTCCCGATTGGGCCGTTACGTTGTTTACCGATAATAATTTCCGCGATGCCTTTTAAATCACTGTTTTCGTGATACACCTCATCACGATAGATAAACATGATCAAGTCCGCATCCTGCTCGATAGAGCCAGATTCACGCAGGTCGGAGTTGACCGGGCGTTTGTCGGCACGTTGTTCCAGAGAACGGTTCAACTGGGACAGCGCCACCACCGGCACGTTCAGTTCTTTCGCCAGTGCTTTCAGCGAGCGGGAGATTTCTGCAATTTCCAGCGTACGGTTATCGGAAAGCGCCGGTACGCGCATCAGTTGCAGGTAGTCGATCATGATAAGCCCGATGCCGCCGTGTTCACGGGCAATACGGCGTGCGCGGGAACGCACTTCCGTTGGCGTCAAGCCGGAGGAATCATCGATATAGATATTACGTTTTTCGAGCAAAATACCCATGGTGCCGGAAATGCGTGCCCAGTCTTCATCATCGAGCTGACCGGTACGGATTTTAGTCTGGTCAACGCGCGACAGCGACGCCAGAGAACGCATCATGATCTGTTCTGATGGCATCTCCAGCGAGAAGATAAGCACCGGTTTATCCTGCAACATCGCCGCGTTTTCGACGAGGTTCATCGCAAATGTCGTTTTACCCATCGACGGACGCGCGGCAACGATGATCAAATCCGACGGCTGCAAGCCAGCGGTTTTTTTGTTGAGATCGTCATAACCGGTATTTACCCCGGTAACGCCATCGTGTGGCTGCTGAAACAACTGCTCAATACGCGCCACGGTTGCGTCGAGCACATCGGCGATGTTCTTCGGCCCTTCGTCTTTGTTTGCACGACTTTCGGCAATTTTAAAGACGCGGGATTCAGCAAGGTCGAGCAGATCTTCACTGGTACGCCCTTGTGGATCAAAACCAGCTTCGGCAATCTCGTTAGCAACGGAGATCATCTCACGAACAACGGCACGTTCGCGCACGATATCCGCATATGCACTGATGTTCGCCGCACTTGGCGTATTTTTTGACAGCTCTGCCAGATAAGCAAAACCACCGACGCTATCGAGTTGCCCCTGGCGTTCCAGCGATTCCGCAAGGGTAATCAGATCGATAGGACTACCGCTTTCCTGCAAACGCGCCATTTCAGTAAAGATATGACGGTGTGGGCGGGTGTAAAAATCATCTGCCACCACACGCTCAGCTACATCATCCCAGCGTTCGTTATCCAGCATTAAACCGCCCAACACCGACTGCTCCGCTTCGATCGAGTGCGGAGGCACTTTCAGCCCGGCAACTTGTGGATCGCGTTCGCGGGGTTCAGCCTGCTGTTTGTTGAAGGGTTTATTTCCTGCCATAGTGAATGGAGTTACCGAGATAAAGAATGGGTCGAAACTTTACCATATGAAGCAGACCCTGACGATACGTTCTGGAGGACACATGGCAACACGAATTGAATTTCACAAGCACGGTGGCCCGGAAGTACTTCAAGCCGTAGAGTTCACTCCTGCCGATCCGGCGGAGAATGAAATCCAGGTCGAAAATAAAGCCATCGGCATCAATTTTATCGACACGTATATCCGCAGCGGCCTTTACCCGCCGCCATCGCTACCCAGCGGATTAGGCACCGAAGCGGCAGGCATCGTGAGTAAAGTCGGCAGTGGTGTAAAGCATATTAAGGCAGGCGATCGCGTAGTCTATGCACAATCGGCATTAGGCGCTTACAGCTCTGTGCATAACATTAATGCGGATAAAGCGGCGATTCTGCCTGCGGCAATTTCTTTTGAGCAAGCTGCGGCATCCTTCCTGAAAGGCTTAACGGTTTATTATCTGCTGCGCAAAACCTATGAAATTAAACCCGATGAGCAGTTCCTGTTCCACGCAGCGGCTGGCGGCGTTGGCTTGATTGCCTGCCAGTGGGCAAAAGCCCTTGGCGCGAAACTTATCGGCACCGTAGGAACCGCGCAAAAAGCGCAGAGTGCGCTAAAAGCGGGCGCGTGGCAGGTTATTAACTATCGTGAAGAGAATCTGGTCGAGCGGTTAAAAGAGATCACCGGTGGCAAGAAAGTGCGCGTGGTGTACGATTCCGTGGGCAGAGACACCTGGGAACGGTCGCTGGATTGCCTGCAACGCCGCGGCTTAATGGTGAGTTTTGGCAACTCATCAGGTGCGGTTACCGGTGTGAACTTAGGCATTCTCAATCAGAAAGGCTCGTTGTATGTGACACGCCCTTCCCTGCAAGGCTATATCACCACGCGGGAGGAATTAACCGAGGCCAGTAATGAACTGTTCTCTTTGATTGCCAGCGGTGTGATTAAGGTCGATGTCGCCGAGCAGCAGAAATATCCGCTGAAGGATGCGCAGCGTGCGCATGAGATTCTGGAAAGCCGGGCGACGCAAGGTTCCAGCCTGTTGATTCCATAAAAGAAATAGGGCTTCCACCTGGGAAGCCCTTTCTTTTTATAGTTCGGCTGTATGTAGGGTACAGCACGATGAATCTGTTAGAGGCGCAATAGTGACAGATTTGATTATCAATTCCTATTTTGTTCTAAGGATAAAACCTTAGGTTGTGATCGTCCGCACAATCCCTTAGTAACGCCAACGGTCATAGCGCTGATATTTCGGCACTTTTGGTGCTTTAATCACCTTAATAACCCACACTACCGCAATCGCCAATAACAGCCACGGCAGCAACTTAATCATCAATGCCAGCATACCGCCGAGGAACATAATGGCCGTCGCCACAACCAGCGCAGCGATAATGCCCAGCAACGAAACGCCGGTGACCATCAGCATGACAAAAAAGCCAATCACAAAAAGTAGTTCCAGCATGATGCTCTCCCAAATATGAAATCTCTTGCAGGCATTACAAGAATCATGCCAAAAATAATCTATTGATTTAACAGCAAAACGCCCCGCGACGGTGCGCAGGGCGTGGTGAATTTGACTACTTTTTGGTGAAAAGTTAACGCTTATCCGCCACCAGTTTGAGCGCGTGTTCCAGCACATTAATGTCTGCACCCGCTTTATGGGCATTTTCACTTAAATAACGCCGCCACTGCCGCGCGCCAGGAATACCCTGGAACAAGCCCAGCATATGCCGGGTAATATGGCCGAGATACGTACCCTGGCTGAGTTCACGCTCAATGTACGGATACATGGCGCGCACTACCGCCACCGGATCGGCATCAATATCCGAGGAACCAAAAATCTCTCGGTCTACCGCCGCCAGAATACCCGGATTCTGATACGCCTCGCGCCCGACCATCACGCCATCCATATGTTGCAAATGCGCTTTAGCTTCTTCCAGCGACTTGATACCACCGTTAATCGACATCGTCAGATGCGGAAAGTCACGCTTCAGTTGATACACACGCGGATAATCGAGCGGCGGGATCTCACGGTTTTCTTTCGGACTTAACCCAGAAAGCCAGGCTTTACGTGCGTGGATGATGAACATCTCACACTCACCTTTGCCGGAAACGGTGTTGATGAAATCGCAGAGAAATTCATAGCTGTCCTGATCATCGATGCCAATACGCGTCTTCACCGTCACCGGAATCGACACCACATCGCGCATCGCTTTCACGCAGTCGGCAACCAGCTGCGCATTACCCATCAGACACGCACCAAACATGCCGTTCTGCACCCGGTCAGACGGGCAGCCGACATTCAGGTTGATCTCATCGTATCCACGCGCTTCTGCCAGCTTCGCACACTGTGCCAGCGCCGCCGGATCGCTACCCCCGAGTTGCAACGCTACCGGATGTTCTTCTTCACTGTACGCCAGGTAATCACCTTTACCGTGAATAATCGCCCCTGTGGTCACCATTTCGGTATACAGCAACGTATTGCGGGAAAGCAGACGCAGGAAATAACGGCAATGTCTGTCCGTCCAGTCAAGCATAGGAGCAATGCTAAACCGAGAATTCCAGTAAACACCAGTTTTTTCAGGCATCACGCTGGTTTGATTAATTTTTTGTGTTTCATGATTATCGTGCATTTTTGAACATTTCAGGCTATTTTTCTCGCGTTAGGTTCCCGCCAGGTTCCCACGTTTTATGGGAACCCGAAATAACGAGGTCGTGTAATGGCGTACTATAACATAGAGAAACGACTAAAATCCGATGGCACACCACGCTATCGCTGTAATGTGATTATCAAAGAAAAAGGTGTTATCACTTACAGGGAAAGCAAAACATTCCCTAAACATGCTCATGCCAAAACATGGGGCACACAGAAAGTGATGGAATTAGATCTATATGGCATTCCATCATCAAATGCAGTTGACGGACTTACAGTCCGTGACTTACTACACAAATATTTAAATGACCCAAATGCCGGAGGTAAAGCAGGCCGTACTAAAAGATATGTGCTGGAACTGCTTATGGATAGTGACATCTCCGCGATCAAACTATCTGAACTGACAGAAAATGACGTAATTGAACATTGCAGGCTAAGAAACAACGCTGGCGCAGGTCCAGCAACAGTCAGCCACGATGTTAGTTATCTTGGCAGTGTTCTGGATGCGGCCAAACCTGTATACGGAATTAATTACACATCAAACCCGGCGAAAAGTGCTCGTCCATATCTACTTAAACTTGGTTTGATTGGTAAATCAAACCGTCGTAATCGTAGACCAGCATCTGATGAACTGGACATGCTCATTGAAGGCCTTCAACAACGATCTACTCATAAATGCTCAAAAATTCCGTTCGTTGATATCCTCAAATTTTCTGTGTGGTCCTGTATGCGAATCGGAGAAGTATGCCGGTTACGATGGGAAGATCTCGACCAGGAACAAAAATCTATACTCGTAAGAGACAGGAAAGATCCACGCAAAAAGGAAGGCAACCACATGAAAGTAGCCTTGCTTGGGGAAGCCTGGGATATCGTCCAACGACAGCCCCAAAAATCGGAATTCATTTTTCCATATAACAGCACTTCTGTTACTGCGGGATTTCAGAGGGTAAGAAGCAAATTAGGTATTAAGGATCTGCGATACCATGATTTGCGTAGAGAAGGGGCAAGTCGCTTATTTGAGGCTGGTTTTAGTATTGAGGAAGTAGCCCAGGTTACAGGGCATCGTTCATTAAACGTGCTATGGCAGGTATATACCGAACTGTATCCGAAATCTTTACATAATCGTTTTGAAGAGCTCCAAAGGAGCAGAAATAAGACCCCTTGACACTGTTTATCCATACAGTTAAAAATAATACTGTATACAAACACAGTATAGAGGGACTTTTATGCGTATTGAAATCTGCATAGCCAAAGAAAAAATGACTAAAATGCCAACCGGTGCTGTGGATGCGTTAAAGGAAGAATTAACCCGACGCATCAGTAAACGTTATGACGATGTAGAGGTGATCGTAAAAGCCACCAGCAACGATGGCCTTTCTGTTACACGCACCGCAGATAAGGATTCTGCAAAAACTTTTGTTCAGGAGACTCTGAAAGATACCTGGGAATCTGCTGACGAGTGGTTTGTTCACTAATTAACACGTAAAATCGGTAACGGCTGGAAATCATTCAATACTCGCACTATCGAAAGTTCGCCAGCCAGCCGTGGCACGTTCTTGCATACGACGTGCTACGGTTTCATTTATCTCCGACTGGAAACTTCTTATACAAAGTCGATACGCCAACATCATAAATGATCGCCACCTTCTGGCGAGGAACTCCTGATGCAATTAGTCGCCCGGACTGCGCCCATTGTTCTGGTGAAAGTTTGGGACGGCGACCGCCAATTCGTCCCTGTGCGCGAGCAGCTTCCAGTCCTGCTTTTGTTCGTTCAACAATCAGTTCACGTTCCATTTCAGCCAGGGCACCCATCACATGAAAGAAAAAGCGCCCCATTGGGGTACTGGTATCAATTGAATCCGTCAGACTACGAAAGTTGATGCCTCGTTCGCGCAACTCCTCCACCAGCACGACAAGATGCCGCATACTGCGCCCCAGTCGGTCCAGTTTCCAGACCACCAGCGTATCACCTGCCGATAATGTCCTGAGCAGTTTTTTCAGTCCCGGCCTTTCGGACTTTGTACCGCTTATCTTGTCTTCAAAAATCAGCTCGCATCCTGCACAGTTCAGCGCATTACGTTGTAGATCTGTGTTCTGGTCATTTGTTGACACACGTACATAGCCAATAAGCATGGTAGATCTCCCTGACAAAAGCAGGAATGATGCCATTTGCTCGTTATTTCTGCATTTTCATAAACGTTGGTTTGGGAGAAGGCTCAGCGTTACCCGTTGGCGTACCTGTTCCGTGGCCTTCGGCCACACCGCCGACAGGCTGGTTGAAATGCAATGGCGCTGCCTTTGATAAGGTGAAATATCCCCATCTTGCTACAGCATATCCATCAGGGAAACTACCTGATCTCCGTGGTGAGTTTATTCGTGGATGGGATGACGGGCGTGGTATTGATGCAGGACGTGCTTTATTGAGCATTCAGACTGGGATGCTGGAAAAACACCGCCATATTGTTGTTGCCAACGATAGGTATGATTCAAAAGAGGAATGGGAACTGGCGACAATCTTCAGAAGAGCATATACGCAAGGCCGGGGGCTTGATGCTGCCGATGCCGGAGGAACTCTGATTCCATCACCAACGCTACATACACGAGGGAGTATTGGTAACACAGGTGGGAGCGAAACCCGTCCACGAAATATTGCATTTAACTATATCGTGAGAGCTGCATAATGGATAAAGCCGTATTAAATAGCGAACTTATTGCCACGAAGGCGGGGAATATTACCGTCTATAACTATGATGGTGAAACACGGGAATATATTTCCACTTCAAATGAATATCTTGCCGTTGGTGTCGGTATCCCGGCATATTCCTGTTTAGATGCTCCTGGCACATATAAGGCTGGTTATGCAATCTGCCGTTCAGTAGATTTAAACTCATGGGAATATATGCCAGACCATCGCGGTGAAATTATCTATAGCACCGAAACAGGAGAAGCAAAAGAAATCACAGTTCCGGGTGATTACCCTGAAAATACAACCACTATCGTCCCGTTAACGCCATATGATAAATGGGATGGTGAGAAATGGGTGACAGATTCTGAGGCACAACACGGTGCCGCAGTAGAAGCGGCAGAAGCACAGCGCCAGTCACTGATTGATGCTGCAATGGCTTCCATCAGTCTGATTCAGCTGAAATTGCAGGCCGCACGTAAACTGACGCAGGCAGAAACAACCCGACTTAACGCCGTGCTGGATTACATTGACGCGGTGACGGCAACAGATACCAGCACCGCGCCGGATGTCATCTGGCCTGAACTGCCGGAGGCGTAGGCCATTCAATATCTGGCGCACTGGAGGTATCAACCAGTTCCAGTGCGTCCAGATAATCCAGCCATAAATTATATTGCGCCAGTTCCTCACCTTTCAGACGACCAATAGCGGCTTTACCAGGCCATTGCCTGCTATTCATGTGCTCGTTGGCTTCATTAATAAGTTTTCCTTTTTTCAACTCTGCCAATGCAATAAGGTTTTCTTTTGATAAAGGTGGTTGCTCTGTCAAAACCGGATATCCCTCCTGATTGCTGACTATTTTCATGCCATTATCCTGACCATCCAGTAGTGACAGCCATTCATCCGTGGTTATCTCAACAGCATCTGAAGGTGCTTTATTTAAATCGGTAAAAAAACCATTTTCTTTTTGTGAATAGAAGTATCTGTCCATTTATCAATCTCCAAAAGCAATCCAGTAAGCAAAAGGATTTATTCCTTGCTCAGTCACAGATGACATCAGGGAAAATTGCGATGGTGAAACAGGTAATGCTGCAAAACTAACCATTGTTGACACACCTAACCGTGCATTATCGTACGATGCAACAACACAATAATTGGTATTGCTGAAAGATATCGGCAGGGTGATATTTACAGGTGAGCCTAATGGCCCTGATGCTGATATTCCCATTTGAATGATGGTTCCATCAGGCAATTTTCTCCAGCGATTAGAACTCGGATTTCTTTTCCAGGCTGACATATCCGGTATCTGATTTTCTCCTGTACCCACATCCCTTTTCGCCGCTTCTCCCAAACCAAGGTTTTCGAGAGCTGTTTTCACCGTGCCATCCGATTTGATATCGCCAAACGGATTCTTGCGGCTTAACAGCAGCGCACGAAGCGCGGTAAGCAACTGGTCGTGCCGCGCCTTCTCCAGGCTGGCACCGGATGCCTCCACCACGCTACAAAGTTCTTCCTGCAACATGTCAAAGTAGTCATCATCCAGATCGGTGGCAGGTGTGCCGGTCTGGGGGTTACCACGGGTAAAACCGTTCTTACCCGCGCCGAACTTATCCTTCTGCGCGGTTTTCGTGTCTATACGATGCATGGATTACTCCGGATATTTAAAAATTACGTAGGTATGCGAAGGGCAGAGTTTGTTAAGCACGCACTCGACAACGGTGTCGCCCCAGATACGCAGTGCGGAATCACAGGGATCGCCACATGTCATCCAGGTGGTGTTGGTGGCGGCTGGCATGTTGACCTGCCAGTAATACCGCCATTCCGGCGCATTCACCGCGTCAGTACAGGCCGATGAGCAGGTGAACGTACCTTTATCGTATCGCGTGATAGTGGCGTCTGGTCTGCCCAGGGCAGCAAGCTGTGCAAGGTAAAAATCCTCATTGATGCCGCCCGCCAGATTAACCTTCGCATCCAGCCGTTGCTGACGCTGGCGAAGGGTCTGTGTCCCTGCGGGAATACATTCATCCGGCAGACCGCACAGACGCTCCCAGCGGTTTATCAGTTCAGTGGTGGTGCGCGGATCCAGCTCCCGCATCAGGGCATCCGCACGCTGATGAACACGGGTTAATGACGGTGCCGCACCTGCAATCGCCGGATCGCTGGCTGACCACGCCGGACCGGGCGGCAGCAGTGCCGACAACAGACGGATGTAATCATCGTTTGTCACGTCCATGAAATCGTCCCCAGAACCGCCAGTTCATTTTTTGCAATGGAGATATTGTCTGCCGGTGCAAGCAACTGATGGCTGTATTCCCCGTTCGCACCGGAAATCGCCTCACTGATACGCGATACCTTCAGTTCTCCCTGCGGATAACCATCACGCAGCAGGAACGAACGCAACTCCGCGGTGATGGCAGCCCGTATTTCCGGTGTGTCCGGCGTCACACGGATATGAAAATCCACCGTATGTGCCACCGGCCTGAACACATACAAATCAGAGCCTGCCACCGGGGCCAGTGGCCCGATATGTTGTCTTGCCGCCGTTTCCGTTGATTCTTCCGGAATGGGATTAATCAGGTCACTGCTGGCAATCATCACACCGACAGTTCCCGTTCCCATCAAGTGACGGTATGTCCATGCGCGGGTAATGCCGGGCACTTCTTTAGCCCAGACGACATAGTCCCCGTCAGCCCCGCCCTGAGGCGTCCAGTAATACCGCTCAATGACGCGGGCGCGCCACGTTTCCAGCTCTTCAGTATCAAATCCGCCTGTCAGGGTGTCAGCCACACCGGAAGACGGCAGACCATTCACCGGCGTGACCAGGATTAATGCCGTACCGTCGTCAGCGTTACCGACCGCACCTGCACTTGAGCAGGCGATCGGCACGCGCAGGACACCACCGGAGCTGGTTGCATCGGCAGTTGCCGTGTACTGAACCAGGTCATCGCGCTGAATAACACTCCCGGCAGTCACCTTCAGGCCATCGCTGACACCTTCCCAGCGCATATACCCGCTGGCAGCCGTGGCCCCCTTGCGCGGACACCGTTTCATCGCAGCATGTCGCGCCAGCCAGGACTCATCGCACAGGTCAGGCAGCATGTTCATTGCCAGATAATCGATGTAACCGTAAACCGTATGCAGCGCCGCCGCATACACCTTTGCCCGCACGTCTTCATCCATGCGCCGGAGCGTGTCGCTGACGTCCAGCCTGGCGAATAAATCGTTACGGAGCATACTGATATTTTCTGCCAGCGTCGGGCGCTGAAATTCACTGTCCGCCATGCGTTATCGCACTCCACAGATCATCAAAAGAAATCATTACCGGTCCGTCACGACGCCAGAGAGTGATACTGTTACCCAGTTCATTAATCCCGGTGCGGCGGATATCCAGATCAATACGGGACACCACGCCGTCATCAATCATCCATTGCAGGCATTCGCGGATATACCCCCTTACCGTCTGCACCAGCTGATTGGTCAGTTTGCTGCGCTGAAGCAGCCACAGTCGGGAGCCGTAACGGTCATTCTGTACCGCAGGCCAGGTATCCCCCCACCATCCCATCGGGACGTCGGCATTGTCATCAGGCTCCGCCCGCCGCCAGGTAAACAGGGAAATCACCACGGCGCGGGTCAGCGGATCCAGCGGTGCGCTGGCGCAGGTGCGTTTACCGTTCACCGTCAGCCACAGTTCCATCATGCCTCCATCGCTTTATCAGGTTTGTCGGTGTTACTGCCCTGACCGTTCTCTCTGTGACGATGCCCGTTATAGGCAAGCCGCATCGCTGACATGGTGGTGCCGCCGGAGTCGCACAGGTCTTTCACCTGTCCTGTCACTTCCAGGTCCATTTCAAAACGTGCTTTAGGTGAATTGCGAAACGTGATCGTTTTACCAGCACCGTCCACCACGATCCCCTCCCGGGTCAGCGTCACGGACTGCCCCTGATCGTCATAGACAGCCACCTCACCCGTCTGCAGCCCTTTCAGGCGGTAGCGCCGGTCCGACACCGTAACAACCACCGCATGAGAACGGTCGCCATCCGGAAACAACACCACCGCTTCCGCACCGCTGTTTGCCCTTGCGGTAAAACCGTAGGGTTCAAGATGTTCAACCCCGGCTTTGGGTTCACCGGCAATCAGGGACACATCCACGGTCTGACATTTCGTGGCGGCACTGATGCTTTTCACCACTGCCCGCCCAATCAGGCCGAGGAGTTGTCGCTGCATGGCTTCAATCGTACTCATCAGAACGGGTCCTCCTGTACTCTGGCTTTTTTCTTTTTCCGCGCGCCGGGGGCTTCGGGTTCAGGCAGATAAGCATCAGGCGGGCCGACACGGATTTCCGTCAGGGTGCCGTTCTGGTCCTGAGTAAACGTGACTTCCGAGACAAGCAGTTCGGTATTGTCGAAACCACAGACCGGATCGAAGACAATCACCCGCTGGTTGGGCTGCCACAGCGTACCGTTACCCTGTCGCCAGCCCTGCACCACATAGGTGGTTTCATCCGTCCGCGCCGCCCGTTGTCGGGCTTCAAAGTCCGCACGGGCAATACAGCCTGCCCCCGTAGCCTGCCCTGTCTGCCTGATATACATCGGACGGTAACGGGCAATAAATGCGTCCTCTGTGCGGGCCCGCAGCGCGGTGGTGGTGGCCTCACCGAAATCATCGTCGTTTCCGGCACGCTGCCCCGCCACCTGGTAAACAGAAAACCGCTCCCGGATACTCTTCTCCGTATCGCAGGAAAGGATGTTTTCCCCGAGTACCAGCGCAGTATGTGCCCGCGTTGAGCCAATACCGCCAATCACCAGCCTGCCGTGCGGGTCGTCGTAAGCCAGTGCCTGCTGCTGACCGAGTATTTTGTTGATTACCTCAATCACCGTTTCACCGTGATCAGGCTGGACATCAGGAATAACACCCGACGGCGCACCGTTGTTCACCACCTCAATGCCGAAAGGCGCAGCAAGCGCCTGCGCAATCTGTACCAGCGATCGTCCATTAAACTGTGTCGGTTCGGCTGCACAGTCAATCAGGTCAGCGGTCAGACTGCGTCCGGCAATACCGGTGCTGACCGAACGGGCATCGTAACGAACGGGCGTCGCCTCCACCCAGCCGGTGATCACCAGCTCATCACCAATCAGCACCTCCACTTTTGAACCGTTTTTAATGCGCGGCTGAAGCGTGGTGATACCCTCATCTCCCGGCCACTGGCGGGTGATCTCCACACTGAAATCCCGCGCCAGCCGTTCAATACCGGCACCGATGCGCACCGATGTCCAGCCATTCCACTCCCGGCCATTTACCCGTAGCGTGACATTGTCGTTCATTGCACTGGCACCTTCAGAGGGATCACCGGCACAAAGCCGGGATGCGTAATGGCATTACGCCGGATAATGTCCGCGTCACGCGCCGCGTTATCAAACCAGGTCGCCGCCAGCACCAGCGCGGGTAAAACCTCATCCGGTGTGCGCTGAATGATCCGTGCAGACTGTTCAAGGCGCGTGTTGATATCCGCATTCAGATCTGCTTTCACCCGGCGCAGCGCCAGAAACAGCGCATCGCTGGTTGTACGGGACAACTCCTTATCAATTGCCGTATTCAGTGTGTCGCGAATGTCAGTCAGTTCTTCCCACGTCGGCAGGTCAACCGTGTTTTTCACCGCCGGTGCATTGTTCAGTGCCGGATGCGTGACGGAAGGCCAGCCAGTGCTCTGCGCGGGTGTTGTTGCCTGCCCCACTGCGGAATTCTGCATCACCGCGGAAGTTGTTGGCGCAGGCAATCGGGTGACGGCATACGCCGCTTCGCTGATTGCGGTCGTACGAAGGGTGCTGGCAACCACGTTACGCTGCTGCGTCGCCGTGGCGGTGGTTTTACTGTCCGTTTTCCAGACGCCGCGCGGTTGCAGATCGCTGCCGAGGCTGACACCGGAAAGCGTTTTGATCATGGTGACCAGGTCGCTGGCGTTACCATAAAGGCGTTTCCCGGTACGCCACATTTTCTGCACCTGCTCAACGAAATTTTTGCCTGACGATGGCGGCGGCAGAAGTACCGAGATATCCCCCTGCAACAGCCTGGCGGCATCCGATACGGCAGAATCCACCACTTTCATCGCATCAGAAACATACCCCAGCATTATGCTGGCATTACCGATAACGTCGTTCTGCACGAAATCCGCCACACCATCGATACTGAAACCGCTGAAGCTGTCACTGATGCAGTCATCCAGTGCAGAACAGGATGACATCAGCGTCTGCGCCGTCGCCGCACCTGATGTGGGGTAAGAGAGTTCTCCTGCTTCGACAAACTTCAGGTCAAAGCGGACAATACGCCCTTCACTTTTCGATGTGCTGACCCGAACTTCCCCGTCAACACAGACTTTCAGCTCACCATATGTCGGGTGGACAAGCGTGCCGGGACCGGGTTTATTCAGCGCGTCAATCAGGCGATCGCGCTGGTCAAAGCAGTCATCTCCCACCACATAAGCTGTGATGGACGGGCGGAAAGTGACTTTTCCCAGATCTTCGGTATAGGGCTTGTCGCGGTTCGGGTATTCATGTGTTTCCACACGGCGACCGGTTCCCGCACTTTCTTCTTCAACCTTAAACGGTACGCCGCGAAATGACGCATCCTGAAGCCTGTCTTTCCACGTCATATAAACTCCGGATACAAAAAACCCGCCAAATCTGCTTTGTCAGTTATTTACATCGCAGAAGATGTGGCGGGAACCTAATATTTTTAATTACTATCTGAGTTGAACATCAATGGAATAAATATCACCACTCTTTATAAATTTAGAATCTGTCCTTTCATCAAAAGATTCAAATGACTGTACCTTTAAAAACTTTTTCATTTTATTTTCAAAAATACTTTCATTAACACCAGTTAAATATTTGAACGCTCTACCAGCAAGGACCTCATTACTTAAATCCATTGTGTTTTTATTGTCTTTGAAAAACCAAACAATAACCTTTTGTGGGCATGATGGATTATAAACAGATATATAAAACTGCGGCTCATATTTTTCATCAGCGTCATCACTAAGCATTTCTTCAGAAGATAATTCTCTTCTGAATTCATATTGCCGCTTAGTTATTCCTTCATCCTTTATTATCTCTTGCTTAACTGGTGCAATACCTATAGAAGAGATTAATTCTGACTCATTAAAGCTGAACTTACACTCTTCCGCAGCCAAGTTAAAAGATAAAAGTGCAGATATAAAAAAAACAAAGATACGCATAATCATCCCTTCAATCATTTGTAAGGAATGATTATATTAACTACTTAAAGCTGAAAACCCAAATTATGCCAGACAAAAACACATTAATCATTTTGTACACTACCTGAACCGCGTATAGCCAACATCATGGCTGACATCAAAACCGCTGGATCGCGTTTCCATAACCCGCATACCCGGAGGCGAATTCACAAAAGATACCTTGATCTCACCATCAACTTTTGGCACAGAAGCTTTGTTAATCATGAAGGGATTCGAGCCTGTGGCATCGGAGGCGTTGTTTGACTGAGCCGGATCCACCGCCGGATAAGGTGTGTATCCCCGCGCCGGTATTCCCGTCCCATAAGCATCATAAGCACCCGCGCCCCACTGCGCAGAGTTAATGGCATCGACCGTGTCACCGGAACTGTCGGTAAACCACTCAATAATTGGCTTCAGCTTGTCCCACATATCCTGAAACCACTTAACAACCGGTCCCCAGTTATTGATCACCATCCCCAGCGGCGACCAGGCAAAAACCTTCTTCAGAAGTTCCCAACCTGCCTCAAAATAAGGACCAATGGTTTCCCAGAGCTTCTTGAAATAAGGTCCGACAACATCCCAGTTAGTGATAATTAATCCCGCAGCCAGAGCAATCGCCGTCGCAATCATGCCAATCGGCGTCATCGACATAATCCTGCTGACAATACTGATGGCACTGCCCACGCCCATCAATCCCAGTTTCAGAATCGCAAGACCGGCAGCAAGCCCGACGACGCCGCGAATAACCCGGGGATTTTCATCCGCAAACTTCGTGAATTTCTCCCCCAACTCCCCCAGCCATTGTGTGATATTTTTAGCGTCACCAGAAAATGCGCCGCCAATAGCCGCAAGGCCGTTAGTTGCGGTCCCTGTCATTGCCTCCCACAGGTTGGACAGCGTACCAAGCTGTGCCTGAACACGTTTATTCAGGCTGGCCTGTTTATTCATCTTCTGCTGGATCTGATCGTAGCCATCCTTTCCTTTATCGATTAGTGCATTGACCACCTGAAGGGTTTCGGCATCATCACCAAATATTGCCTTAAGTACACCTGTTCGCTTAACGTCGGTCAGTTTTCGCAGCTTTGCCAGTTGCCTGAACATGTTATCAAGACCGCCAAAACTTCCTTTGCCGTCAGTAAAATCGAGCTGTACCCCGAGTTTCTGGCGGGCCATAACTTTATTAACGTCCCTGATTTTCTTAACGCTTAATCCGGACTGGATAACTTTTCGCAGGGCATTACCTGCCGACTCCCCGTTCATCCCCATCTGATCCATCATGACGCTGATGGGGGCAAGGCTCTGTGCAGCCTGAAGACCATCCTTGTTCACCATCTTCAGAACAGAACTGGTTTTAGTGAAGAAGGACAACATGTTGGTATCGTCAACGCCCAGATAAAACGCCTTCTGGATAGTGTCGAACAGCCCCATCATGTCTTCTGACGCCGTTCCGGTAGCATCCTGCATCTTTGCAGCAAACTCAGCAGCCGCTTCCGGTGTTTTTTTCAGTTGTACCGCAAGATAAGCTGTCGCTTTACCCACACCACCCAGAATGTTTTCTGCCGGGATCCCCTGACGCACCAGCATCTGCATCATGTTCTGGAAATCAGCCGTTGTACCAGGTAGCTGGTTACCCAGGCCAATAGCCAGTTTATTGATGTCCTGAAAGCTCTTTCCAACCTCGCCGTTCGCATCCATCATGGCGACTTTCAGCCCGGTGGCGGCGTTTTCCTGATCGGCATAAGATTTCAGGGAAAGCGTCAGACCCGCTGCCAGTCCGCCCCCAAGCGCCAGCCCACCCTGTGACGCTTCTTCCGCCTGGCGTTTAAATCCCCGGATTTTCTTTTGCATTTTCGACAGCGCGGGAGAAAGCCTGTCGACACCGGTGATCAACGCCTTAAGCTCAAATTCAGCCATGTGTGCGTTTCTCCTGCTCTATCCTGTTTGCCTGACTGACCAGTAAGGGAATTTCACTGATCGGCATATTCAGCAATTCGAAAGGATTAATGCGCCAGTAGCTGGCGCAGTCAAAGAAGCGATCAGTGAGGTATTCAGCCGTCAGGCCTGGAGGAAAAAACCAGCCACAAGCCACGCCGCCGCATTCAGGTCTGCCGGAGACATCTGGTCGACAGAGCTTTGCGGCACTTTCGCCAGCCGCACAATGTATTTCGACACCACATGCGCCAGAAGTTTGACTGACTCATCCTGATTCATCTGGTAGGGATACCCCAGCTCGCGGACATCCTTCCCGGTGGGCTCATCAAACTCCAGTACGGAGAGTGTCTCGCCATGAGCAGTAATCGGTTTCTTTAACTCAAGCTCTTTCATTACTGGTAATCCCCTTCTTCACCGTGGAACTCAAGATCAACCGTGCCTTCTTCGGCATTATGGTTCGCTTCGCCGTGCAGCCAGGCAGACGACAGTACATAGACCTGACCGTTCGCCAGCTCGGCAGTGATAGTCATCTCATCAGACGAGGTGATTTTGTTCACCGGAAAATTCTTCGGCACCTTGAAAGTCCCTTTGACATAAGGCGCACGGTGAGTTTCCTTGCGGTCCACTGAACCGTCCAGGCCGATGATGTCATCATTGACCGTCCTGTTCATGGGCACCTCAATGCCGCCGGTCAGCGATAGCTGTTGACCGTCAATTTTGAAATAACAGGTTCCCCCGATACGGGCCATTATGCAGACTCCTCTGAATACTGAAGACGGAACTGGTTAACCACGGCAAAGACACGCAACTGGTTAACATAGTCAGGCGGGAACAGCGTGTTCAGGCGGTTCGGATCGCTGGCATCACGCTCCACAACCAGGTACTGCTTAAACAGTTCGTAGTTTTCCACGATCCCCGCACGCTCAAGCTGACGGTAGGTTGCCAGCAGTTCCCCTTTGATCACCGCCGGTGTGACAATCGCCTGACCGGGACCAAAGCGGGTACCGTCGCTGGCAAGCTTGTGACGCCCGTACTTACTGGTAATGACGGATTTCAGTTTGCGCAGCACATACGCGCTGGTATGCAGCGTCTCGCTGTCGAGGTAGCTGTTATCCGCAACACCGTAAGCGTTTTTCCTGTACGTGGTGACATCACGCTGAATGCGCAGCACCCCGCTTTCGACATACGCCGTTGCCACGCCATAAGACAGCAGGGTCTGTTGTTCGGTCATCGTGAACCGTTTCCCCTTCGGCGCAGGCAGCATACCCACCAGCTCACCGGTCTGCGTGGGACGTGCCGGATCGTTGCGAATAAACACCGCTGCGCGGGCGGTACGGCTTGCCGCCAGCTCGTCGGCAGGCGTCTGGGTCTCTTTTTCGTATCCCGCCAGGGTAATGTGCTGCTGGTTAAACTGGTCACCTGCGGTCACCAGTTCTGACAGCGTGCCGATCTTTGCCGTATACACATGACCATACAGCTGACGCGCATAGCTCCAGCGACCGCTGGTATCGTTCATCTCGGTCACCAGCGTGTTAACGGAGGCTGTGTCGTTGAACGGCAGGCCGATATAATCAAACGGCTCATCCGCCATTGCAGCCACCGCGCCGGTGAGAACAGGAGCGCCCGTTCCGGCGGTACCCGTCGCCACGGCAATCTGTACGCCCGCTGGCAGCACTTCGCCCCCACCAAAGCCGTAGTAATTGAGGCTGACAGGAATTTCATTCCCGCAAAGCCCCTTATGACGCGCGGTCAGTGTGACCACGCCTGCCGAAGATGAGGCAGTAAACGGCAGGGTCGGAACGGCATTGATGGCATCTTTGATACTGCTGGCAATCGTCGCGACGTTATCGCCGTTGGTCACCGGTGCCTGCACGCGGGTACGTCCCACATAAACATTCACCGTGCCGGTTTCGGTTGCCGCGCCGGTCACCGTCAGCGTAACCGTTGCCGCCGCGCCCGTGGCTTCCGGAACGGCAATCACATACAGTTCACCAAACGGGTCGGTCTGGCGATAAGCCTCGACCATACGCGCCAGCTGACTTCCCGCACCACAAATCTGGCGTGCATAGTCTGCCGATGGCATCAGCACCAGACTGTTGGCAACAATCTCTGCACCGTTATTGGCATGACCAATCAGCAGCGATGCTCCGCTGTCCTGTGCAGTATTCGCCGCCGAGTTATCCATTTCCGCATAAAAAATCGGAACCAGCGTATTCGACGGAATGGTGTTAAAGCTTATCGTCATCGGTATTCACCTTTTTATTCACGCGCCGGATATCACCCGCTGCTTCACGGCGCAGCCAGTAGTTGTTCTCATCAACATTTCGCCCCTCGGCGGGCAAAAGGTCGCCGCGGGCAGGGTCAGGAACTGACCGCCCTTTAACAGGTTTCACAAACATGAAGATTCTCAGGAAGGAAGGGTTATTTCGGTGTGATGTTCGATATCGCCGTCAGGCCCGTTACCGGGATCGAGATAATCAACATCAATCGCCAGCGTTCGCAGTTCATCCAGACTGTTCAGGTCATCCTGCTGGCGGGTATCGTCTTCGGTCAGCTCGCTGATGACCGAAAAATCGAACTGATAAATCAGCTCATGACGATTCAGATCCAGCAGCGTGCCGCCGTCATAGGTAATCGGGTTACCGCACGCTTCCGGGTTCCAGCCCAGCAGAGCCTTAAAGAGCATCTGCCGGACATCGTCCACCACATCATACGAGGCAAACTGACCGCGCTCATCACGCCCGTTACTCAGTATGACAACCACGGAGAAGCCCTCTTTCAGCTCCTGCCAGTAGTCGGTCTGGCTTTTGTTTTCTCCCGGAGAGTCATCACCCGGTACCACATACGCCGCCGGGAGTCTCAGCTTTCCGACCTCCGGCAGATTTTTGAACTGTGCCGCGCCTGCCACCCGTTTTTCAAAATACGGGCAGCGGGCACGCAGCGCAGCAATAACAGGCGTCAGTTTCATCTGTGTCGTCGCTCCGGCTTCAGTGATTTACGCAATTCCCGCGCCAGAAAATAGCGTGTCCAGCTGCGGTTCTTTTCAAGAGTTTCCACCATAAAGTTATTACGTGGAGCCAGCCGCCAGCCGCTGCCACCGGATGCACCACGATGATGACTACGACGACGTTTTGCTCCTCCCCGGACACCAAAAAACAGAAACGCCGGATAGAAGTCACCAGAGATCATCCGGTTCCCCTTCCCGTTGCGCTGGTTAGGGGCAATGCGTGTCATAAAACCGGCTCGCTTTTTACTGGCTCTCGGCACCATGTAACCAATCGAACGAGCCAGGCGTCCGGTCTGATAACCGGGGTTTTCACCCGGTGCCGACCGCGCACGGCGCATCACCAGCCGACGGGCATCACGCATATGACGCTGCCCAATCGTGACAAACGCCCTCCGGACACGGGCGCGGTTAAAGCGCATCTCCGCGGGCTGCTGAACATCAACGTGAAAAAAGGGAGTCGCCATTGCTGCCTCCGTGACTCTGCGTAAATTCGCCCAGTTCCGTACACTCCAGCAGCAGAAAGCGCCGCGCCCCGTTCAGATCGCGCTGACGTTTCACCCGGTACACACTGTCATCACAGACCACCTCATAATCAGCAGTGATCCCCCGGCGGTAGCGAATGGTGATGTAATGGGTGATGGCATCTCCGGTCTGCGCGGTTTCCTGCCAGGTGGTGGCACTGGTCTGGATAACCTTCGCCCATGCCCGGAACGCAACCGGGTATTGAGGCTCCACGCCAAAGTTATCCGCGGGCATATCCACCCGCTGGCGGATCAGGACGCGTTTATTCAGTTCGCCGGGGTCCGGCAGAATGTAGGTTGCGCTGGTCTGCGCCTGACGAATTTTCATAGTGGTATAAGGCGATAAGGAGCAACCAACCAGTTAAAACTCATTGGCAACTCCATTTTCTCAACGTCTGTAACCGTTGAGCGGTTTTCGTAGAAATGGCTGACAAGTAGCAGGAGCGCAAGCTTCACATCATCAGATATCACAAGCCCATCAGAATCATCCGCAGGCCTGTCATCTGCGGTTGCATACAACGTACGGTTAAGGAAGTTTTCCGTCCGACTCTGAGCGGCCTTCCCAAGCAGTTCAAGCAACTCATCTTCATCAGAGAAATCATCATCCAGACGAAGCTGAAGCTTAATCTCTTCCATTTTTAACAGCATAAAACCTCCTGTGCCCGCCAGAACGCGGGCACAAAAAAACCGCATTACGCGGCGTGCTGTATTACGTAAAAAGACTAATCAACCACCAACGCTACCTTTCCCCACCAGCGCTTTAATGGCAGAGGTGTCTTCCAGGATACAGTCAAAACGATGGAAGGCCAGAAAACCGGTCTGATCATATTCCGCGTAACGCTCAACCAGACGTTTAAGAATCATGTATCGCACACGACGGATAATGAAGCGATCAAAGTCACCACAGAACATGAATTTTTTACCCGCCCCGATATCATCAATTTCCTGATCAATGACATACGGTACATTCAACACTGAAGCAGGTGCCACACCAACAATATCCGGCAACCATAAAGGGCGTCCCTGACCGTCTTCCATCTCACTGATCAGTTTCAGCGTATTATCGTTAAACGCCAGGCGGAATTTCGGTCCGCGACGATATGCAGGATCAATGCTGTGTTTCAGAGCCAGAATTTCCTGCCATTTCACCGCATTTGCCGCGGCAGTCTGTGTTGTGCCGGTCACTGATGCTGCCAGCCCTTTGGGTTGTTTAGGCGTACCAGCACCAGTTCCCTGAATCAGATAACGGGCTTCACCACGACCAATACGTTCAGCAATGCGACGGGCAAGATAAGCTTCCATATCGATCGCGCTGTCCTGCAACAACTCATTAGACACACGAATGATTTTCGATGTCATTTTGAGCGCCCCAAGGCTTCCCATACCGAAATCGGTGTCTTCTTCACCGGCTTCTTCATTTTCGCCCAGCAGAACACCAACTTCGGAAGTACCATCAGCTGTTGCCCACTCCATAGTGCGACCGTCAGAAGTGGTCAGAATCTGCGCCACACTGGCGATGCCACCGTAGGATTTCATCTTCTCAACAACTTTCGCCAGGAATGTTTCTGGTACGGTATATCCGCCCTTTTCATCCTGAGCTACACCCTGAGCACGAAGTTCACGCAACGCCTTTCGTTCTTCTGATGTCAGCTCACTGGCACCGTGACGCATCCACTTATCAAAAACCTGAGCTCGTTTCTCATCCTGTTGCGGATTGTTTTCCGGATCAAGATTCTGACGCTGCTCTTCCTCATTGCTTTCAATGTACGCCTGATCCTGACGACGCAGTTCTTCTTCGCGTGCAATTCGTTCATCAAGCGCTTCCAGTTCGGATTTTGCTTTGTTCCACTCAGTGCGCTGCTCTTCCGTCCATGCGTTATCACCAATTTTTTCATTCAGGGCGCGCATGTCAGTTGCGATAGTATTACGTTTCTGTTTCAGTTCATGCAGTTTCATGATGTTTCCTTTACGCGTTAAGAAGGGTCAGGACGCGTTCACGCGCCATACGTTGATTAATGGCTTTCTGTAGCGCGCCGCTGTTGCGCGCCTCCTGCCATGCTTTCATGGAGCGAACAGCCGAGTCAGCCTCCTGATAGGCAGGATATGTCACAGGACTGACATCCAGCAGACGGGAAAAGCGGGTTATCTCGCGAATAACAACCCCATCCTCATCCTGATACCACTCCTCACCGTCACGGGCGACACGGAAAGCGAAAGATGACTGGTTAATATCTCCACGTTGCATCGGGGCCAGCACCAGATCACGAATGGTCTGTGTCTCCGGAGCCTGGATGTCATAGCGTAATCCGCGCTCATCAACTGAAAGATTCAGCGTGCCTGCTGCACTACGCCCAAGAATAAAATTAGGATCGTGATTAAACAGTGCGCGTACATCATCACCAAGCACATCGTCAAAAGCGCCGGGCCGGATGATTTCGCGGAATGAACCGAATATCAGCTCAGAACGACAGTCAAACACCGATCCATAACCGATAATGTGCGCCGGGTTATCGTCATGCCTCTCAGCACGCACCTCACCGCTGTAACAACGGATTTCACGGTCATTCATTGGTTTTTCCCTCATCATTTTTTGGGGGCTTAAAATCTCCTGCCGGGTTAGCAGCATTCACGCTTACCAGCATCTCATCCAGCCCTTCAACCGGATTCATATCCTCGAATGCGCGGGCCTCATTACGGCTCATCCATCCATCGGTAATAGCGAAGTGATAGAATTGCGCGCGCTCCTGCGGAGTTCCGCGTAAAAGCCCCGTCAGATTGAACCTGACGTAATACCCGGCGGCTAACTCAGCGCGGGTAAACAAGCGACGGTTAAGCTCCTGCTCCCAGTTCGTCACCCACGGCATCATCGTGTAGCGGACAAACTGAATCGCCTGCGCAGAAATATTGGAGAAGGTGGCTTTTTCGAGGTCATTAATCATGTGCGCAGGAATATTGAAAATACCGGCGATCATTGAACGGTTCAGCTTCATCATGTCAATGATCTGAGCGTCAACTGGCGACACAGTCAGTGCCTTGTAATCCAGATCGGCTGGCAGCAGCATGGTTTTGTTTTCCTGGCGGCGTAACGCCTGCGATGCCTTCTGCCACTGATCTTTAAGCCAGCCCCAGCTTTCCTTATTGAGTCCGCTTTTAACGGATACTATCCCCGCCGGACGGGCATTACCGCTGAAGAAGCTTTCTGTGTACTTCTGACCGCTCATCCCCATGCCTATTGTTTCGGCATGTTGCATAATCGGACTCAGCCCCATCTTCTGATTATTACCCAGCGCACGGATGTGGATCATATCGTCCGGACTGATCGCAAACGCCCCATATTCGTTGTACAAACCGTAGGTATATCGGCCACCAGTATTCATCAGCGTCGTTTCCCACGGCATACAGCAATCCAGGGATATGACTTCACCGCGACGATTACGTTTCACCCAGGTATACCCATTCCCCCAGCCAAGGATGTGACGTTGCTTCAGTTCGCGCCATTTGTAGCTGGTTTGCCAGGTATTGGGCTCATCATGAACCAGATAAAACGCCGGATGATCGCGTGCGGGTTCAACCTTCCCCTTGTGCCTGCGCATAACATGCAACGGCATCTGGGCAAGGCTGGAAGACAGGACATAGATACAGGAATACACCGCAGCCAGTTTCATCGCAGTCTCAGGACTGACATAAACGTCTGCCCGGAACAGCCCATCAGTATCAACGGCATCACCGGTTATCGGGGTGGAAGGATTCTCCAGTGATTTACTTCTGAACAGAGCATCAAGCAGCACGCGTCCCCCTTCTGGCCATAGCCAGTGCGCCCACCAGCAGTAAAGCACCGGACAAAATCAGAGCCGGAGCCATACCAAACTGCAGGTAAACCCCGCACGTAAGCAGGCCAAAACCAGCCAGCCCGATAACATCAGCAATTAGTGATTTCATAGAATTAAGAGATCATCGTCCGGATCAAGAGATGAGAGGAAATCGTCAGGTTCTTTGAGCATTGCCCGACCGATCGTCATAATCAGTGCAACCGCACCATCGATTTTGTTTTCCGCCTGCTCCTTGACGGGCTTCACTAAATCATCGTTACCAGGCATGTTTTTGCCGACCACATTGCCGATACACCAGGTCATGATGGGATTGCCGTCATGATGAAAGCGCCCCGATTCAATCGCTGCTTCCAGCTCTTTCATCGGGTCGGACATATTGGCGAAGTTCTGGACGATAGTAACGGGATTCAGGTCTTCATCAGCAAGGTCATGTGACAGCCCGGTCGCCCCGAAAGGGTCGATGGGTGACTCACTGACCGGGCTGATTTTGTTCGCCGCTTTGGCCTCTTCGAGGATGTAGCGATAATCCACCTCTGCACCATCGGTAACGGTCAGAACGCCCATTTCCACCCATTTCTGAAAGCGTTCGGCTGTCCGGCGATCTTCATTTTTCTCGACGCTGTACACCGTGTCATACGGTACCCAGAAACGCGGGGCCACACTGTAGTAATGCGTTTTACCGTCAATCTCGCGGGTATAAAGTCGCGCCATGCTGTTCATATCCAGCTTACGCGCCAGGTCAAAGGCCAGAATGCACGGCTGCCCCTCGAACTGCTCAAGAGTCAGTGATTTATCCTCGCAGCTCTGCCAGCTCACCAGGTTGAAATACGCCGAACGCGCCGACACCCAGATATTGAGGTGTTTTGTTTTAAAGACGTTTGCCAGACGGGCGTTATTTTTCGCACGTTGCTGCTGGCTTAACAAAAACTCGCGATAAACCGACACACCGATATTCGGGTTAGCTTTTTCAAGTACCTGCGGGTCGGTCCAGTCGTCGCCTTCGTCAACGGTATAGATGATCCCGAACAGTTCATCGTTGGGCACCGAGCCGTTGAGCATCTCGATGACTTCCCGCCGTTTGTCGTAGCACGGCCCCTCAATGTTGTACCCGGCGGTGGTAATGGCCCACATCAGTGGCTGGCGTCGCGCCCCCATCCCGGTAAGCATCGTGGTGTAAAGCGCATCTGTGGCGTGCTCGTGATATTCATCCACCACGGCACAGTGGGGTGATGAACCATCACCGGGGTTACCGATCAGCGGTTCAAACCGCGCACCATCCTCCGGACGGTTCATGTTTGAGGCGTTAACCTCAATCCCGAACGCTTCCGTCAGCATGGGTGTGCGTTTACACATCAGTCGTGCCGGACGAAAGACTTCCCATGCCTGTTTCTCCGTCGTGGCACCGGAATACACTTCCGCGCCGAACTCGTTATCACAGGCAAAACAATACAGGGCGACACCGGCAGAGATTGCCGATTTGCCGTTCTTACGGGGGATTTCGGTATACACCTCCCGGAAGCGGCGCAGCCGGGAGCCTTTATTGACCCAGCCAAACGCGCAGCAGATCACAAAGAGCTGCCACGGCTCCAGCGTGATGGGCATCCTCTTGAATGCCCACTCACCCTTGGTGTGCGGCAACAGCTGAATAAATTTGGCGGCCCGTTCAGCCAGGTCCTTGTCGAAGCGGTAACGAAACGACTTACTTTTTTCCGCCATCAGGTCATCAAGATGGCGCTGGCAGGCCTGAATCACAAACTGGCAGGCCACAATCTTTCCGCGCACGACATCACGGGCATACTGATTGGCAGCATTTACGTTGGGGTAAGATTTCCGGCTCATGATTCGATGATTTTCAGATTGTCAGAAACGGGTTAGTGGCTTTCTTCTTCCCCGCCAGGCCAATCAGACGCTGGCGGCTGCTGGGGTCGAGTCCGAGCATTGCCCCCGTGCTGCTCATCTCGGACTCCTGTTCTTTCTTGGCGGTCAGCTCCGGATTTTTGACCCTGCCGCCCATTGCACCGGTGATGGTGTTGCCCTGGCTGGCAATATTTTTCACGGCACGTCGCCAGAACTCATAGGCTACGCACCACCGCTCAAGCACCGCGAGGTCAGTCACGCACAGCAGGCCCTGACCGCAGAGTTCTTTGGTTGTCAGTTGCCACATGATCGTGGCGAGAGGGAGATTTTCTTCTGCGAACCACTCCGGTGGCTCAACACCTTTGATGGGCGTAAAAACAGGTTCATCTTTATTCAGGGCTCGCTTGCCGGGGTTTCCGGCCAGCGCCTTGCGCGCCGTTGGCTTGGGGCGACGCCCGGAACGCCCCGCCGTTCCAGCCATATGCGGCACTCCTGGTTAAATTTCATTTTTCGCGGGTATAAAAAAACGATGGGGCGGGCAGTCCGGAAGACGTCAGGTCACAGAGATTTGACCCGCCCCTCCCCTCAGACAGTTGAGAATTATTATCACTTAAGTCGTTCACGGGCCGTCTTCGCCTTATGACACGGCCAGCACAGACTCTGCAGATTACAGTCGGCATCAGTGCCGCCATGCGCTTTAGGGATGATGTGGTCAACGGTTTTCGCCTCACGCACCACACCAGCACGCAGACATAACTGACACAGGCCTTTGTCACGCTTCAGGACACCCGCGCGGATACTGTCCCACTTCGAACCATAACCGCGCTGATGACGGGATTGTCCTGGCTTGTATTGCTTCCAGCCCTCGCTTTTGTGGCTTTCGCAATAGCCTGACGGATCAGTCGTGGTATTGCGGCAACCGCGAAAACGGCAGGCTTTTGGGGTTCGTGGTGGCATTTATATCCCCTCTTTGGTGCACGCTTACAATGCGTAAAAAAGCCTCGCATTAGCGAGGCTCGTTTATATCTTGAAGGTGAATACTTATTGTCTTATCTATCCACGGGAAACATTAAGATTATTACACCCGTTAGTTGGGAAATAAAACAAAATGCAGGTGGTTTATTTATTCTTTGCTGATTGCTTTCTGAATGGCATCGGCTAATGGTTCAATTAAGCCAATAGCTGCTTTTAATTCCTGTTCAAATTTTGCACTTGTAGCCTGACCACCTGCAGAAGAAAGTGAGGCCTTAACTAATTCAAGGGCAGCTTGAACAGCTATAACCCTCTGTTTTTGAGCTTGAGTTACAGGCTGAGCACCTATTCCTGAAGGGAAATAATTCTCTAACATTACAACCTCCGTATTAAAAAGTGAGGTTACACATTACCCTTAAGATTACTCTTAGTGAAGAGGTATCTCATAATTATCACCCTTACCAATGACGCTTGATGAAATTTGTAATGGACTGGCTCTTATTTCAACGCAACCACTTACCGCGCGCCAGATGCTTAACCTCAAACATTAGCAATGAGATGTTTAATCTGAATCCACTCCAGAAGTAATCACCACTCTGTCTACAGGGCCTGATGTGAAGGATGATGAGTAAAATTATCGCTATCATCGAAGGCATTGCGTCCTGATGTACTCCTGCAGGTAGTTAACCTGTGCGGTTATCCTGTCGATTCCACTTCGGAGACGGTAATAATTGAGTTCAGCATCTGCTGTAAGTCCTGGGCTTTCTCCATCGCCCATGCCGCTGGCTCTGGTCGTTGATTTTGCACAGGTGGCGGCGACTTGCAGGCGCTTACGCCCAGAAGAAACATCAGCACGGAGACTTTCGATAGTCGCGTTAGCATCAGCAAGCTCCTTTGTGTATCTGGCGTCAAGTTCTGCTACGTCACGTTGACGCTTCTGCATATCAGAGATAGTCGCCATAGCCGAATCTAATGCCATAGCGTTTTCGTCGCGCTGTTTTTTGTATTCAATGGCTTTATTGTGGTAATGATTCGCTGACCAGACAAGACCACCAGCAATACAAGCAATAAACGTTAAAATGAGCGCCCAATAACTCATCTTCATACCAGCAGCGCCTCCCGCGCCTTGTTGTATCGAATCTTACGATCCTCAATACCGTTCAGACCGCCGTTAATGATGCGCGTAACACGAGTAATATCGGCACCGTAGACCATGCAGCCTTTAGATGTGTAGAACCATGCAGCTGAGCGCGCGGCCTGTAGCTCCTGCTCCAGTTGTTCAGGTGAAGTCACCAGATCTAACTTCAGCGCCGCGCCACAGATGCGATAATTATGGAGGCCAGTGATTTGAATTAATCCTCTACCGCGATATTTCCAGCCATCACCGGGTGCTTTGTTACCCAGCCGGTTGCTATACACCAGATTGGCAATAGCATCCTGACGAGCTGCATGTCCGGATGTTCTGCCAAGGGCATCAGCCTGCTGCTGTGTGATCCTCTTTCCGAACGTCGCCACCAGCGCAGATGGTGTGTAGTTAAAATTTTCAACTAAGGCGCGAAACCCCATCGACTCATGGCCTACCTGAGCGATAAACATCGCCTGATCCGCTGGTGCTGTAATGCCGAATTCCTTCATCGCCGCATCAATGTGCGGAAACCAGCGCGCAGCCAGCCCGGCGCTAATACCAGCCGCCTTTTGAAATAATTGTTGGTTCATTAGTGCCTCAGATGATCAACCAGACGTGCAACGTTGCCTCTGACGGCCACCAGCACGGAAAGAAAAATAGTGTTCGCCACGATAATGGGCCATGAGGAATGGGGATAAATCCCACAGAGATAGGCCAACGGAACAGCACTGTATGTAACAGTAATCAGCCAGGCTAAACGTGAAACCCAAGGACGATGCCGCGAATCACCACGACGATAAAACATCAGAGTAATAACAACACAAGCACATAACAGCGCATTTATAGTTGCTGTCGGGTCATTTAGCTCCACCCGAACCTCCCCGGCGCGTTATGAGCGCCACCAGCGAGCCGATATCCTGATTATTCAGGAACGTCAGGATTTTAACGGCTAAAGCAGAGACGATTACGGCACCAATAGCATCCAGAGGTTTATCACTGTATCCGGTCAAGTTCGCCAGCTTGGAGCCAACCAACCCAGAGCAAAGAATCCCGGCAATATATGACACGATAAAATATGCCAGTCGGCGCGATGCACTCAGATCTGCTGCTGTTGCTATGTAGAATACAGCCCCTGCAAATGCGCCAAATACAACGCCGTAATCAGTTCCGGTCAGCAGTCCATAAACACTGGCACCCGTCAGGGCACCACCAGCCAGCCCAGTACCGGAAATCGGATCGGACATTTAGCCCCCTCTTAATTGCTGTTGGTCCTCTCAGATATGAGGGGAAGGGATCTTAATGACAGTCTGTTTATTATTTCAGTCAAACACTACCCTGTTGATGATTTCTCAGAAGCGAACTTGACTCCCAGGGGAAACTCAACTTTCCGTTAAAACCACCAGCAGACATTCGTTCAATTTCCACAGAAATATCACTGAGCCGTTCTTCAAGCTCTGCTTTTTCTTTTACCAGACGGTTATAGCGGCTTAGATGAAGCTTTTGCTGCTCCAGCCAGTCTTCAAGCTGTTCAACAGTCATACCAGGGTTAAAAAAATATGGCTGCTGCTTTTCGCCCTGCATTATTGACCTCCAGAAAAGCAAAAACCCCGCCGAAGCGAGGTTTGTTATGATTTCGTTAACGGCAGACATACAAAGCCCATCGTTAGGAAAATCCTAACCAGATTTTTTGAAAAATGCAAGAATCATGTCGCTATCTTCGGCGAAAATCATTTATCTCGTCACTTTTCTTAATTGCGCCTCAGCATATGCTTCTTCCTGCCAGCACTTTGTCACCAGTTTATCAATGACATCTGCATATCCTTTGTACCACTGATAATCCGTCAGGTCTGGTACCAGCTTCTGGACATGATGCCGCGCCAGTGTGGTTGGTAAACGGCTAAACCGGTTTCCATTGCAACGCCCACAAATCTTATAAACAGGCGTGCCATGAAGCCGGGTCCTTTTTTCATCCAGGACAATACCTTTACCCTTACACCCTCTGCACGCTGTGCTGACTTCTCCCTTACCATGACAATGCTGACATAGTTCCTTCACCCACTCTTCCTTGATAACAGATTCACCGCTTCTGGAGTGTTTCACCACTTCGCGCAATACATTATGAAATCCAGTACCAGCACAATGCTCACAGCGAGCCTTACTTGCCGCAGACCTGGAATAATCAGCAAAGGCAAAATTCACAAGGTAAGGGATGAGCTGTAACCGGGTTTCTTCACTCAATTTATTCAATGTCGGATTATCCAGTGCCATCGCGTAATTGAGCAGACCTTCAATCGCAAACTGAGGATCCTGAACACCAACTTTTGCCAGGAATAAGGCAAACCCAAGCGGTGCTTTCGACTGCACCATCCCCTGCGCAGCCATCACATCCGTAATTGTTAAACCACCAGAGCCTGTCGCCGGTGCGTCATCGCTCAGTTTTGGAGATTTCGGGGAGTAATATTTCGGTAAGGCTTCAAGGTTCATGCTCGTTCTCCACTTACGCCAGTACGCCTATTGCCAGCGCACGATCGATAAAACGAAATATCAGCTCCAGCTGGGAGCCATACTTCTCTTCAAATGCCACGGTATCCGCATGCAGCTCGTCGTGATGCTTTCTGCACAAAGGCAACACAAAAAGGTCATGCGCTTTTGTTCCCATTCCACCCTGACCGTGACCTATCAGGTGGTGGGGATCATCAGCGGGCTTTCCACAACATGCACACGGCTGTGTCTTAACCCAGCGCGTGTACTTTTCATTAACCCAGCGGCGACGTTTTGGGCGTAACATAAAAGACTCCGGCGACTCCGGATCCACTTTCAGCGCCAGCACCTTTTTCGCCTTATCCTGGATGATGCTGGTGGCAGGAACCGAAGGCACAAGGTCACTTTCCCGGGTAACAGACGGCACAACAGGCTTCGGTAATCTCAGTGCCTTACGGGCTGCACTTTCCGGTAAGGCATCCGCCAGATCATTACGAATCAGCCACCAGCACAGTTCCGGCATTGTCACAACGTGACTGTCATCAAAACCGAGATCCCGACGCACAACAGACAACACCCAGCGGGCACAGTTATCCGTTGCCATTGATTCCAGCCGTTCCGTGAACTGATCGCGCAGTTGGTTATCGCAGTGCCAGCACAGACGGATTGCACCCGGAGCGTGTCGCATTGTGGTCATGTTCTCGCTGTGCCAGTCGGAATGAGGCCACTGGCAGCCTTTTTCACGAAGTAACCAGCTTTCAAGACATTCCACGCCACCAGCACGACGGATCACTGCCTCATTGCGGAACACGGCCCGAACGGCAGGATCATCCGCCAGCGGTTGTGATGCCGCCGGAACGGCACCACTGGCGAAAGATGAATAACGCTCCGGCTCAGGCTCCAGCAGGACACGCCCCTGCATAAACAGGGGCATCAGCTCTGAACCTGGCCTGAACAATACGATCCCCATACGCGGGGCAATTTCAGGGGTCAGTAGTGCTCTCACGGTCACCTCAATGAACGGTATCGAGCAGCTTTAACAGCTCAGGGAATCGGGATTCGAAGAAATGCGGCTGCGTCTCGCGCGGATTTGCGGGACTGGTGATGTTCTTGCCGAACATGCAGCCTTTCGCCGTCAGCGACCAGAATTTTTTGATGTTGTTAATCCCGGTACGGCTGTATCGTTCGCGCTGCTCGACGATCCCCAGCTTCACCATCTGGTGATATGCCTGATTAGCCGTCAGGCGGATACCATACTGTTTCAGCAGTGCACTCAGTGACAGTGTCGGGCGACTTGAGCCATCGTGTGCATCAGCAGGAGCATCAATGGCATAGCGCGGTGCCAGATTCGGTAAGCCAACAGCCTCCTGGAGTTTCTGACAGGCACCAAGCACTGAAGAGTTAGACAGGTTTAATTCCCGGCGCATAAAGTCCAGCAGAATCACACCAGCCTGCATCTTGTCAGCAGCCTGTCCGGATAATTTTTCCGGTGCGCTGGTTACCATGTCGAAAGTACGGATCACCTTCAGATGGAATGACGGGCTGATCCACATTGCATAGGCATACACCAGTTCCTTGCAGACATACGTTCCCCGTTCATTTCCCCCATGAATCACACTCACCGGGTCAACACCCAAATTCTGGGTGTTGGTCAATTCATGAACAAGCTCAACAGTTTGTTGGCTGGAAAGAAACTTTCCCGGCTCCTTGGTTCTGGCATTTGCACCAGATGCTACTGCTGCGCGATGCAGATCGTTCAGGCTGTAACGCCCATAAGCATCACGACGAACTTCAATACCATCAATGACCATCAGATTATTCATACTTCGTTTCTCCTCTTAATCAGGCAGCTGCACCCGCCGTTTTCTCGTACTTACTGATAGTGATCTCGACCTTCCCTTCCGGGATAACCGGTCCCCACTCCACCAGCATTCTTTTCACCTGACTGTCGTCTTCCCACACCCCCGCGTGGGTCAGGGCGTCAAACAGCGCCTTGTTATAGTTGTCCAGATCGCGGATCCGGTTATCCGGAGGAAACAACACGATCTCCACTGAAGCAGGTGCCGACGTTGGTTTCGGCAGACGACGTAACTGCTCAACTATTGCTGCGCACGCCGCGCTCTGGAATTTTCGCCCCGCCGCGCTTATCAGGCTCTTACCAGCAAACGCCCCTTTGTTGGGGTGTCGCCAGTACGTGTTCACGCTGGGCGGAAAAGGCAGGATCAGCTTCATACTTTCAGGCCTCTCTCATGTAACCAGTGGGCTGCACGCAGCCTTGCGTTTTCCTCACCGGCAAGCAGTGAGCGGATAATCCCGACCGCCTCGCTGTCGTCGTCCTTCACCGCGGTATGAAGCGTGATGCCCCGGGCCACGCCACGCTTTATCGTGATGACGCCTTTTTTCTCCAGTGCGCGAAGATGCTCCACCGCTGCATTCACTGAACGGTATCCCAGCATGGTTGCCACCTCCTGATTGGTTGGCGGGAAGCCACGTTCTTTCTGATAAGAAATCAGCATATCCAGCACCTGCTGCTGGCATTGAGTTAACGTCGTCATGCCGCCATCTCCCTGACCAGTTTTTCCGCCTGCTGGCGAACCTGCGCCAGAAACGCCTCACCACATGCCTCAAGTTCATCGCGCCCGATGTAGCTGATTGCCTGTCCCTTCCAGGTCTTGTCAAAAACAGCAATAGCACCAGCGAAAAAAGCTCCTGTCGGTACCTGCTTCTCGTCTTTCGGGATAAACCAGACAGGCAGTTCAAAACCAATACGCCCGCGAATAAAAGCAATATGATCTGCATCTTCCGGCCACCACACTTCGCTGGTGGCAGCTTTGATCAGGAAAACATAGCGCCCGCCCTTATCACGCATGGCACTGGCATGTTTCATGATGTAACGCATGCCGGTGATGTATTGCCCCTCATGCTGACTGGCGCGGCTGTATGGAGGATTACCAAAGGCAGCACCTTTAAGCTCCACAAGGCGTTCTGACCAGTCATGCGCCAGCGCGTTGTCTTCCGCCGTGTAATACGCGGCACATTTGGCGTTATCACCGTCAGTGAACAGATCCAGAACAAACGGGCCAAACAGGGTGTTAATTCCCCAGAAAATGTTGTCCGGCGTGCGCCACTGATCGCCCACTTCCTTCAGTTCATGGGCTGGTTTGTTCCGCAGTTCCACCAGCGCCTGGCAATATTTATTACTCATTAAGCCCCCACGTAATTCCCTGACAGATACCACTCATCACCCGATACAGCGCGCTTGCTGCTTTTCCGTAAACACTGCTCACGACGCGCCAGAAAATTGTTTCGTTCTGGCTGGGAGTGGCTTTCACGGAATGCCGCCATCCACACCGTTGCAGCACGACGGTATAAGCCCCTGGACTCCAGTTCTTCCGCCTGGCGGGTCAGGCACAAAATCACCCGGGGATCGTTAGTGCCGACATAGAAATTGCGCACAGGTCTGGTTTCACGAACTGGTTGTGGTTCCGGCTCCTGCGCTCTCTCAGTCAGGCGTGGGAAATGTCTGCGTGTATCTCCTTCACAACGGTGAGCCACACGCCCACTCTGACGTAACTTGCTTGCTGACTGCAGAACGCGCTGCCGTGAGTAACCTGCAAAAGCATCCGCAATGTCTCCGGAAGTACACCCCGGATGGGCTTCAATGAATTTCTGAACGTCATTCAAAAGACTCATGATCACCCCCTGAATCCTGCCGGGATCTGGCTGTAGTCCACGTTGTCGTAACTGGCTTTGAAGTACGGGTCTTCGCGTTTTTCGGTGTACGTGCTGACGGACGGCGATAAGCGCAGGGAAAGCTCATCCCATTTTTCCCGCAGCTTCGACGGGCTGAGCACGTTACGGCACCAGAACGGATCGCGACTGACGCGGCTGTACATCTCGCAGATTTGTTTGTGAGTACGACCATCCTGCACACACATCAGGCGAATTTCGTTTGCCCAGGCTGTCCAGTTAGGTTCTTTGGGACGAACCACCTCGCCGTCACATTCGGCGGCCTGCTCGTACAGGGCGATGATTTTTTTCCAGAGCCACTGTGCGCAGGTCAAATCATCCTGCGTTCCCCACTGGCGCTTTTTAGGGCTGAATACAACCGCATCAGGATAGCGAGTTAAAAAATCCTGTTCAGCCTTCTGCGTGTCCGGTTGCGAAGCGTCCGGACGAGAAGGTTTTTTATCTGACGGATCATGTTTTGATTTTACTGACGGATCCCCGCCAGATTCTGACGGGTGAAAACCCGCTTTTTTGCCAGATTTCGACGCATCAAATTTTGACGGGTCAGATTTTGATGCGTCAGATTTTGACGGATCAGAATCTGACAGTTGAGAAAATGCCGCTGCCTGAAGCTTCGCAACGTTAAGCTGATAAACATTCGACGCATTGCGGTTACCCTGGCGACGCGCCTTACGCGTTAACCAGCCTTCTGCTTCCAGCCGTGCGATAGCCGTTCTGACGGTACTCATCCCCGCGCCAATCTGGCGGGCAATGGTTTCAATTGATGGCCAGCACACACCTTCGTCATTACTGAAATCAGCCAGGCGGGCCATAATTGCCACACTGGATAATTTCATGCCTGACGCTGCGCAACCATCCCATACATAGCCGGTTAATTTAGTGCTCATGACCGACCTCTATTTCCCTGAATTTACGACGAAACTGCTCGAGCGGGCTGAAGCACTCATGCTCATAGCCTTCGCGGAGGTAGATAACCAGTTGTGTTTCCGGCTCCCAACGAATGACTCTGACGGGCACTCCGTAGTGATCTTTGAACCAGCGGTTAACTTCTCGCAAAGGACTGTCTCCTTCTGCCGGTTGAAATCACCCACAGCCCACTCTGCAAAGCTGTGGGTTACAATTTCCCTGTCACCTGGTACATTTACTGCATAGCAATACTCCACCTTCGCTTTTCCACCCGGTACAGGAAGTGCAATCAGTTGCGAGCGACGGTAGTGTGTTGTTAAACTGTTCATGCGTTAGTTTCTCCACAACCAGAAGCAATCGACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTTACTCTTTTCCGGCGCACAAAAAACACGAAATAACAGTGTTAAATGCTCCTGCCACTTCGCCATTACTTGGTAGCTGTTCTCTTCGATTTGCTCACGCTCAGCCCGGTCAATAACTCCATCAGCAGTTGCCTTGCGTAAGTACTGGGAATGCTTGCCAATCCATTCTATTGACTCCATCAGCCGCTGATTAATGTCACCATTGTCAATGTCATCAATGTCCACCAGCGGCACAAACACCCCATTACTACGACGCGCTATCGCATCCGTTACATGCCTGGTACCACTGGCATCCTGTAAAACCATGGCCCACTCAAGTGGAAAAATTTGATCCCCACCGCTACGCAGTCTGTTATGCAATTGATCTTTCGCTGGGGTGATATCATCAGATTTATACAAACCAAGAATTTCCGCTGCTTCCTCATAGCCATGAGGTAAATCAGCAATCGTTCTTCGTATTGCTGCCACCAGCCATGCTGGTTGCTTATCAACTTTCCATTCAGGTTCTTTACCCACGGTTAATTCCTCGTTTCTGTGGTTACGTTTACGCAGTTGAACCGCTAACTTTTGAATAGCACTCAGGTAATCCATCATTTGGATTGGGGTAAATATCAGGACGCAGTTCATGAGGGGTCACGGACCAGTTTCCCAACTCACAAAGTTGTAAAACCCGTTCTGACGGGACTTGATTGTTAATTACCCAATTGGCGACGGATTGAGTGGACTTAAAACCAAAGCGACGGGCTACTTCAGATAAAGATTTTCCCGCAGCCTTTACTGCTTTCTCTGTGTAGTTTTGAGATGACATACCTTTCTCCTCTGAAATTCATAGGGATGATGCTACTTAAAATAGCAAATTGCAACTACTTAAAATAGAAATGACTAGCGCGTGCGATGTGAGTAATCTTCTACCTATGGTAGAAGAACAGAAGTATCCAGATTTCGCCAAGAGACTAAACGAATTGATGACAATCAAAGGAATCTCTGTCACTCAACTCAAAAGTCTTGTGGGCGTTACATATGAAATGGCTCGTCGATACACAATCGGCGCAGCGAAGCCACGTGTCTCTGTCATGAGTAAACTTGCGTTGGCTCTTGGAGTATCAGCTTCATATCTAGAATATGGTGTTGGAGATAGAGAGGAATGTAAGGAAATGGCAAGCATCCCCAATCCAACAAAGCCCGATGTATACAGGATAGAAGTTTTGGATCTTAGCGTTAGCGCAGGACCTGGGACCTATATGCTTTCAGACTATGTTGATGTGCTCTACGCCATTGAGTTCACAACTGAACATGCCCGTTCTCTTTTCGGAAACCGTTCTCAGAATGATATAAAAGTTATGACGGTAAATGGAGATAGTATGTCCCCAACTCTTGTTTCCGGGGATCGATTGTTTGTCGACATTTCCGTTCGCCACTTCCAGACTGATGGAGTTTACTCTTTCGTTTACGGTAAGACTTTCCATGTTAAACGTCTACAAATGCAAGGCAACAAACTAGCTGTTCTTTCGGATAACCCCGCCTATGAGAAATGGTACATTGATGAGAAGTCGCAAGATCAGCTTTATGTTATGGGTAAGGCACTGATTCATGAGTCGATTAAATATAATCGACTTTAATTAGGCATCTAGCAATCGTTAAAGCAAACGAGCACAGATGATAAAAGATAAGGATAGAAAATCTTTATACAAATTTTATTTGGCACATTGGTCCATAAAAATGGATTAACACTGAGCTGTTACCCTAAGTAATAAAAGATAAAGACCATATGTTATCTTTGGGTGCGTTACTAACACAAAATAAAACATAAATAATAGACAATTCTATACAACAGGATACGATAATGATTAATGAACGTACTGAAGCGACAGATGGGGTAGCAGATATGATTTCCACCAATACAAAATATTTAGTATGGAACAACAAAGGTGGTGTAGGAAAAACTTTTCTTACATATAATCTTGCCGTTGAGTTTGCTATATCTCATCCGGATCAAGATGTTGTGGTTATTGACTCATGCCCTCAATCAAACGTTTCAGAAATTATTCTTGGTGGCAATGGTACCGGGGAAGAAAATCTAAATAAATTGCGAGACAGAAATGTTACAATCGCAGGTTATATCAAGGAGCGTTTTAGCAAATCTCCTTTGTCTCGTTTAGGAAATGAATCTTCTTACTTTGTACGAGCCCATGATGTTAATGCAAAAATGCCAGAGAACTTATATATTCTTCCTGGTGATGTCGATCTTGATATCTGTTCACGCTTAATATCTCACATTGGCTCATCCCCAGTAAAAGAAGCATGGAAGAAAAGCCGATCTTTGCTGGTAGATCTAATAGCATCTTTTGAAGCCGATAAAAACATCTCTGACAGAGCAAAAACATTTTTTATTGATTGTAATCCAAGTTTTGCCAGCTACACAGAATTGGGAGTAGTCGCGGCAAATAGAATAATTATCCCTTGCACTGCCGATGCTGCATCAATTCGCGGAATAAAAAACCTTGTTAAACTTATTTATGGAGTGTCTATTGACAAGTCAGAACAAGATGAAATGTTCTTAGATTTCAACAAAGAAGCAAAGCAAAACCTTATCGAACTACCTGAACTACACCTTTTCGTACAAAACCGCTCAAGAACTAATGAAAGTGATGCAGCAAAAGCATTCAAATCACATGCAGAAGAGATCAAAAGAATCACGGATGACCTGTTAAATACACATCCTCATCTGTTCACAAATGTGGCTACTTTCGAGAGAGTTCAAAATGTCAAAGATGGTAATACTCTTGCAGCAATAATAAACCATGAGGGATGCCCTTTAAGTAGGCTGCAGCATAAGAGTTACACTATCTATGGTATGGCGACCCAAGCTAATAGAGCACAAATTGAAGCACTAGAATCTGATGTTTCAACAGTAGTCAAGTGTCTGTAATACAATAACCATGATTTAGAACGAACATAATTCTCGCCCGACCAACCGGTCGGGTTTTATACTCGTAGCCTGCTTAATAACACTGTTCCTGACCCGTTGTATCCATAGTTCCCACTATTTTCCCCCAAATTATTACTATCTAATTAACCTAATAATTAAACCAAAACAAAAGACACTCGATATTTTCTATACGTAACTACACCACATCACCCCATCTGTGAAAAACTTTTTAATTCATAACGTTATATAGAAAACTAAAAATTAAATACACACTTCTACTTTTTGTTGTTGATTTTTGCTACTTTTAGTAGCATCATTATCTGAACAAACAACAGACTGTTGTTAACGAAAGGTTGGTTGTAACACGGCGTATGGCACATGCGTCGTTAGCGGTCTGGTGACGTTAAAGGGGACAATCCACTCCTTGCTCGGGCAAACAAACCAGGTAGCCGGAATGTGCAAGTCAATGATGATGCTGATAAGACGCCTAACCAGCGTGGCGATTCGGTTTGACGCCTGGGAAGAGACCAGGGTGCAACGATGAGGGCATTTATGGAACCGCGACAAAGTGTGGTGCCGTAACTGGCTAAGTGCTCTCAGCGTTGTGGTGAATGCGCAGGCTGATGCGCGAAAGACATTGCAGCTATTGCGGAAAAGAGCTGTTCGGCGGGGCAATTAAACGCCCGTGAGAGTCTGAAATAACCGCAAGCCGGAGATCAGCACCGGTCACCACAACAGCCACTGCTTTGGCGGTACCAGTTTGTACACTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGAGCATCACCGGACGACGGGCTCATAACCCAATCCACCCGGGCGGCTGCCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATGAGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCAGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCGCTCAACCTGGAAATCGAACCATTTGATGAGAACCGTGTGAAAATCAAACATAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCATTAATATCGTTCTTAATTAACCGATTATTTATCTCATCACTGAATATCTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGTATCGATGACTTTACCCGTTATTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCCGATAATATGCGTGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCCCTGTTGTCTGTTAACAGCGAGCGTAACTCCCAGAAATCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCCATTCAGGCAACAAAAGCGGCGGCGGCAATCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGATAATGACTTCAGCGGCAAACGCTCCCTGATGGAATCTGTCGAAGCGAAGACCAAAGACATTATGCCAGTGGCATTTGAATTTAAATGCGTTCCATTTGAAGGCCTGAAAGAACGTCCGTTTAAATTACGACTCAGCATTATCACTGGCGATCGTCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCGGTGCAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAGAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTAACTCAATAATCACCGGATGGTGAGAGCTTCCTTTTAGCAGAATTCAACGCGGTGCAGCGCATATAAAGTGGAGAACGAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAAGATAAATAAAGACGACATCGTGATTAACGATATCGCAGTTTCCCTTTCAAATATCTGTCGCTTTGCAGGACATCTTTCACACTTCTACAGTGTCGCCCAGCATGCGGTGCTTTGCAGCCAGTTGGTGCCGCAGGAATTTGCTTTTGAAGCGTTAATGCATGATGCAACAGAAGCATATTGCCAGGACATCCCCGCACCACTGAAACGCCTTCTTCCTGACTATAAACGGATGGAAGAAAAAATAGACGCCGTAATCCGTGAGAAATACGGGTTACCTCCTGTTATGAGCACGCCAGTGAAATATGCCGATCTCATTATGCTGGCAACCGAACGCCGCGATCTCGGGCTTGATGATGGCTCTTTCTGGCCTGTACTGGAAGGTATCCCGGCGACAGAGATGTTCAAAGTTATTCCACTGTCACCAGGCCATGCCTACGGGATGTTTATGGAACGTTTTAACGAGTTATCGGAGTTACGCAAATGCGCATGAATGTTTTCGAAATGGAGGGGTTTCTTCGCGGGAAATGTGTACCGCGAGATCTGAAAGTGAACGAAACAAATGCTGAGTACCTGGTACGTAAGTTCGACGCGCTTGAAGCTAAATGCGAGACGCTGGCGGCGGAGAATGCGCGGCTGAATAAATTTATCGTACAGAGTTGCTATGTGTTTAATGGCGAGCAGGATGAAATATCTGATGCGTATATCTGCGCAACAGACGGAGGTATGCCGCAAATTCCAGCCACCGATGCTTTTCTGGCTGAAATTCGTGCGGCGGCTCGCAACGAGGGGATTAACTATACCGCAAGTCGTCTTGCTGCTGCTTTCAACCACGGATTTATCAATAAGTCTTTACGTGAAGTTTTCGACGTTACGCGCATGATTCTGTCAGCGAAAGAAGAGTTGGCTAATGAACCGCATCCGATTGATGGCCTGTCTGGTGAATATGCGGAAAAATCCCTAGAAGAATGGGCGGAACAGCTTCGCAAAGGAGTCATCCAGTGAGCAAGATTGACTATCAGGCACTGCGTGGGGCGGCAGTAGCAATTGAAACAGTAGCAACGCCTCAAAAATTGCTGGCATTTCGTATGAAAGTCACACCTCAGGTTGTGCTGGCACTGCTGGATGAACGGGAAAGAAACCAGCAATACATCAAACGCCGCGACCAAGAGAACGAGGATATTGCGCTTACTGTTGGGAAGCTGAGAGTTGAGCTTGAGGAGACAAAATCAAAACTCAACGAGCAGCGTGAGTATTACGAGGGCGTTATCTCGGATGGAAGTAAGCGCATAGCAGAGTTAGAAAGTGATTCTCAGGCACAAAAGTTAGTTGAAGCAATCATTGTTGCGATAGAAAACGAACAGGAACGTCTTTTTGATGAAGATTACCTAATGGATTCGAAAGAATGCATTGACGTAATTCGTGAAGAAGTAAAGCGATGGAATGATTCCCGCGCCGCTGGCATTCGCATCAAAGGAGAGTGAGATGGCAACTTTAACAAAAAAAGAACGGGCATGGTTGAACGAATTACAGGACGTTCTTGATCGCTGCCCATCACCGAAAAAAATTGGTTTTTACACCATTGGCGATAAAAGCATTTACCTGTATGACCTGCGCCGCATGGATGAAATCATGGAGGCTCTTGATAATCGTTCGTCAATGGATTGGTGTGTTGCTGTTCATGATATGAATGCCGGATTTGATGAAAAGATTTTATTCCCCTCATCAGTTGAAAGTACAGCAGGATAAGGACTAACACATGACCACTATTACCAAAGAGCGACTGCTGACAATCAGGCAGTGGCGCGAAACATACGGACCTGGTAGCAACGTTGTACTGCCAGCAGAAGAAGCGGAAGAACTGGCACGAATTGCTCTGGCATCGCTGGAAGCAGAGCCGATAGGTTTCCGTTGCAGGCGCAATGATAACCTTGGTGATTGGAGTTACGTATATCATCGAGAGCCAGATGATTTTGAGCGCAAACATTTAGTGATAGAGGGCATTTACGCCGCCCCTCCAGCACCGGTAGTGCCTGAAGAAGCAACTCCGGAAAACGTAGAAATGCTCTCTGGCTATGTTTCAACGTACAAATTAACCGATAGCGAGCGCGATATTGCTGCCGAAATATGGAACGCCTGCCGCGCCGCCATGCTTCAGTCCGGAAACTTTCGGGAAAACAAGAATTCGTCAACCAATAATTTTCGGGAAATCGCGGAAACGTCAACCAACTATCCGGCAATTCCTAGTGAGGTGTTGTCCGCAATCCTGAAGGTTGCCAGGATTCGTGCCGATTTCGATGATTTTGACGGTGACAGGCGAGGTATCGGTGATTGTCTGGATGAGGCTGAGCAAGAGCTTATCGTTACCATTAACAAATATGCCAGTCAGTTGGCAGCAGAACCTATAGCGCCTAATGACGTTCGAGAGCAGACAGCCATTCCACAAGTTCCGGTAACTCCGGATGGTTGGATAAGCTGTAGTGAGCGAATGCCGGAAAAGAATCAGAACGTGCTTATTTCGGTGAATTTCGATAGCTCTCTGGTTGAACCGCTAATATGCTCCGCACGCTATACCGGAAGCACCTTTCGGCGCGGAGATGCAACGATTAAGCCGGGTAATGGTATTGAGCAAGCAACTCACTGGATGCCGCTACCGGAACCGCCGCAGGAGGTGAAGTGATGAACAACTTAATGATCGACCTTGAGACGATGGGGAAAAATAAGGATGCACCGATCGTTTCCATTGGCGCGGTGTTCTTCACTCCAGAAACCGGAGACATCGGACAAGAATTCTATACGGTTGTTAGCCTGGAAAGTGCTATGGGGCAAGGAGCTACACCTGACGGCGATACCATCCTGTGGTGGTTGAAACAAAGCCCTGAAGCACGAGCTGCAATCTGTATTGATGATACTTTGTCGATCAGCGATGCTCTCTCAGAACTAAATCATTTCATTAACCGGCACGCAGACAATACGAAATATTTAAAAGTCTGGGGTAACGGAGCCACCTTCGACAACGTAATTTTACGTGGAGCTTATGAGCGAGCAGGACAAATCTGCCCGTGGGCATACTGGAATGACCACGATGTACGCACGATCGTTACGCTTGGGCGTTCCATCGGATTCGACCCCAAAATGGACATGCCTTTCGATGGCGAACGGCACAACGCCCTGGCTGATGCCCGTCATCAGGCAAAATATGTTTCCGCTATCTGGCAGAAATTAATTCCTGCCACCAGCACAGAATTATGATTTTCCCGGGTGCAGCCGGTTTTGATGGAGAAAATTATGAACACCTTGTTTTTACTGATGGCTGAATTCAATACCCCTAACATTGAACTCTCAGCAGTTAGCCAAAAGTACTTTGGCATGAGTCCAGCCACGGCAGAAGCAAAAGCAAACGCTTGTAAGTTGCCCGTTCCAACATATCGCATCGGCACATCACAAAAAGCAAAACGTTGCATCAATATTCAGGATCTTGCGGAATACATAGACAAAAGACGAGAAGAAGGACGTATCGAGTGGGAACAGGTCAGAACAAGCAAACAGAAGGGCAAAGAACATCACTAAAGAAAAAACCCGCCTAAAGGCGGGTTTTCAAAAAGCACCAGCTATGATCATGCTGCTTTGCGACGACGAAGCTTACCCTGCTGCTCTTTACCAGAGACAGTAGCGTGAGTGAACGCATTAGGAGCAGCCTTCATCAGAACTTCAACAGCAGCACCCATACCTGCGAATGCTTTCATTGTGTCGAACTTAACCTGTGGCTTGGTTGCTTTTTGATCTTTCATAGAAAACTCCCGAGACAGTAAAGGCGTCTCTAACCCTTTCTTTAAAGCTAGCTTGTTTCGCTAACTTATGCCAATCGATCATGTCGATTGGTGACATCGTTTCTTAGTAGTTTAAGCACAAAACGACTGCCATAGATGTACCTTTAAGGTAATCTGGACGGGTATCCTACAATTTGTAGACCCTTCTCGTCTATACCTACTGAGCAAATTTAAGAAAGATATCCTGCAGCTCATCAATGACTGCCGACATCACATAACCGCACTGTTCCATGCGGAAACCAAAAGACTCGTAATACTGCACCAGTTCTGGTACTGGCTCTACAATGTGGACAACTTTACATTCAACAGCTTTACAAAATATAAAAGCACTCATAAGAGTGAGTAAAACCATGCGCCCTTTCAATGGGTGAGATTCATCTTCTCTAGAAAACCTTTCGATCATATGGATACGAAAGATGTTTTCTTCAACCCCATAAACACAAATTGCTGCTCCTGATGGTATTCCCTGAACCCGACCTTGCTGAACAAGTTTTATGCAGAACTCATACTTTTCTCTGGAGTTGCCATAGGTACTTAACGCATAGTCCCATTCAAGCTCACCATAGCCACCACACAGAATCTTGTAATCATCATCACTGAGCGGACCAACAGCAAGAGGTAAGCCGACATGATCAATAATCAACTGGATATTGTTACGTACTGATTGACCTATCTCGTCCAGGGTAAGCATCATGGCCTCTCAAGCGGAACACTAAAAGTCGCATTATATCTCATTCTTAAGCCGCGTATGGATTACACCTTGAAATGAAAACGCCGGGTTCCCAATAGGCTCCCGCAGAGTGTATAACTACTTGTTTTTCAACAACGGTACATCCTATCAAGCATCGGGGCAATCGAGAAACGGTAAGCGGGATAAACAGTGTGTGATTCTGCTGGCATGGCGACTTTTTAAGCATCTGGTAAATTGGGGGCGCTACTATAGCATAACGAATATGCGATCAGGCAATGTGACAGCCATGACACCCTGTTCTCGGCGTTTAAGCGAGAACTATGGTAAAGTAAGGACATTCTTAACCCCACTGTCGAGGTGCCCAATGGAAAAGACCACAACGCAGGAGTTATTAGCGCAGGCTGAAAAAATCTGCGCACAGCGTAATGTGCGCCTGACCCCACAGCGCCTCGAAGTGTTGCGCCTGATGAGTCTGCAGGATGGCGCTATCAGCGCTTATGATCTGCTTGATTTGCTGCGCGAAGCTGAACCGCAAGCCAAGCCGCCAACGGTTTATCGCGCGCTGGAGTTTCTGCTTGAACAAGGTTTTGTGCATAAGGTGGAATCCACCAACAGTTATGTGCTCTGTCATCTGTTCGATCAGCCCACCCATACTTCAGCCATGTTTATTTGCGATCGCTGCGGCGCGGTGAAAGAAGAATGTGCTGAAGGCGTGGAAGACATCATGCATACGCTGGCGGCAAAAATGGGTTTTGCCCTGCGGCATAATGTTATTGAAGCACATGGATTATGTTCAGCATGTGTAGAAGTAGAAGCGTGTCGTCATCCTGAACAGTGCCATCATGACCACTCTATTCAGGTGAAAAAGAAACCTCGCTAAAAGGGTGTACATCCTTGTACATGTCGGGCAGGAGGGATTAATTACCAGCGGTAATCATGGCGTTTTTCCCAACTATCAACCTCTTTTTCTGCCTGATCTTTCTGATAACCGTAGCGTTCCTGGATTTTACCCACTAACTGATCACGTTTTCCTTCAATGATCGTCATATCATCATCGGTCAGTTTGCCCCATTGCTCTTTCACTTTACCTTTAAACTGTTTCCAGTTACCGCCGGCTTCATCTTTATTCATAATCAAGACCTCATCGTTAGGTTGTGAATGAGAGTACGTTCACTTTTCTTCTGAACGTGAGATTAAGTGTAGTCAGCAATTTGGCTTAGGATTATTTATTCAGAATTTTTAACCGTCACGTTGCGACAAACCAGGTATCGTTACGCCAGTGACGCCGCCAGATAGCCGCCAGAGAAAGCCCGCGTAACGCCAGAAAGACGGTTAACGCCAGCCACAAACCATGATTCCCCAGCCACGGCAGCGTAAGGAGCGTCAGCGCAAAACCTGCGGCGGCCACCGCCATACTGTTACGCATTTCGGCGGCACGCGTTGCGCCTATAAACATGCCGTCCAGCAGATAACACCAGACGCCGACCAACGGCAAAATCACCTGCCAGATAAGATAGCGGTCAGCCAGCTGCTGAATCTGGGTTAACGACGTCAGCAACGCAATGATGTGTTCCCCTGCCAGCAAATAAACCACCGAAAACAGTAACGCTACGATCCCCGACTGGCGGCACGCTGCCCGCCAGACATCCAGCAACTGGCTACCGTCGCGCGCACCATATGCCTGACCGGAATGTGCTTCAACCGCGTAGGCAAAACCATCCAGCGCATAGGCGGTAAAGGTGAGTAGCGTCATCAGAACCGCGTTAACAGCGATAATGTCACTCCCCAGTCGTGCGCCAAGTACGGTGATCGCGCCGAAACAGAGTTGCAACAACAGCGAGCGCAGCATGATATCGCGGTTAAGCGCCAGCAAGCGACGGAAATTTCCTCGCCAGGCAGTTTTCAGCATTTCGCCGGAGATTCCGCGTAGTTTGAGGATTTTACGCACCATTAGCAGACCAATCAGCAATGTTGCATATTCCGCAATAACCGTCGCCAGCGCCGCGCCCTGCACGTTCATATGCAGCCCCATCACCAGCCAGACATCCAGCACAATGTTGAGGATATTGCCGACCACTAACAAAATTACTGGCGCACGGGCATATTGCACGCCGAGTAACCAACCAAGTAATACCAGATTCGCCAGCGACGCCGGTGCGCTTAACCAGCGGATTTCAAGAAAGCGCCGCGCCTGTTCTAGCACCGCTTCGCTGCCGCCAACAATATGCAGCGCCAGATCGATAATCGGCGTACGCAGCAGCGCAATTAACGCCCCAGCCCCCAACGCCAACAGCAACGGTTGCACCAGCGCACGGGCTAATGCCTGAGGATTTTTGGCACCATAAGCCTGCGCAGTCAGCCCGGTGGTGCTCATGCGTAAAAACAGCAACAACATAAAGAGAAAGCTGGTCGCCGTTGCACCAACCGCCACGCCGCCCAAATAAACCGGACTATCAAGATGACCAATTACCGCCGTATCGACCAGTCCCAGCAACGGAACGGTGATATTGGAGAAAATCATGGGTAAGGCGAGATGCCAGAGTGCTTTATCAGATGAAGTGAGGAATGCCATGCAGACAAGCCTGATGAAGAGAGATGAAAAACAAACCGCGATACCAGGCGGCATCGCGGTCTCAGAGATATGTTACAGCCAGTCGCCGTTGCGAATAACCCCAACCGCCAGCCCTTCAATGGTGAAGCTCTGCTGACGAAGGTCAACGACAATTGGTTTAAACTCGCTGTTTTCTGGCAACAGTTCGACTTTATTGCCCTGTTTTTTCAGGCGCTTAACGGTAACTTCGTCATCAATACGTGCGACAACGACCTGACCGTTACGTACATCCTGAGTTTTATGCACTGCCAGCAAATCGCCATCCATAATGCCGATATCTTTCATCGACATCCCGCTGACACGCAGCAGGAAATCAGCATTCGGCTTGAACAAGGAAGGGTCGACCTGATAATGACCTTCAATATGCTGTTGCGCCAGAAGCGGTTCACCGGCAGCCACACGACCTACCAGCGGCAACCCTTCTTCCTCTTCCTGCAATAAACGAATCCCGCGTGATGCGCCGGAAACAATTTCAATAACGCCTTTGCGTGCCAGCGCCTTCAGATGTTCTTCAGCCGCGTTTGGGGAACGGAACCCCAAACGCTGCGCGATTTCCGCACGCGTCGGCGGCATACCTGTCTGGCTGATGTGATCACGGATGAGATCAAACACCTCTTGTTGCCTGGCCGTTAACGCTTTCAT
Protein sequences of DBSCAN-SWA_2 >CP029164|355346:412633|370443_370998_-|AWH68214.1|DBSCAN-SWA MLIGYVRVSTNDQNTDLQRNALNCAGCELIFEDKISGTKSERPGLKKLLRTLSAGDTLVVWKLDRLGRSMRHLVVLVEELRERGINFRSLTDSIDTSTPMGRFFFHVMGALAEMERELIVERTKAGLEAARAQGRIGGRRPKLSPEQWAQSGRLIASGVPRQKVAIIYDVGVSTLYKKFPVGDK >CP029164|355346:412633|373771_374830_-|AWH68220.1|plate|DBSCAN-SWA MADSEFQRPTLAENISMLRNDLFARLDVSDTLRRMDEDVRAKVYAAALHTVYGYIDYLAMNMLPDLCDESWLARHAAMKRCPRKGATAASGYMRWEGVSDGLKVTAGSVIQRDDLVQYTATADATSSGGVLRVPIACSSAGAVGNADDGTALILVTPVNGLPSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYVVWAKEVPGITRAWTYRHLMGTGTVGVMIASSDLINPIPEESTETAARQHIGPLAPVAGSDLYVFRPVAHTVDFHIRVTPDTPEIRAAITAELRSFLLRDGYPQGELKVSRISEAISGANGEYSHQLLAPADNISIAKNELAVLGTISWT >CP029164|355346:412633|367807_368845_-|AWH68211.1|tRNA|DBSCAN-SWA MHDNHETQKINQTSVMPEKTGVYWNSRFSIAPMLDWTDRHCRYFLRLLSRNTLLYTEMVTTGAIIHGKGDYLAYSEEEHPVALQLGGSDPAALAQCAKLAEARGYDEINLNVGCPSDRVQNGMFGACLMGNAQLVADCVKAMRDVVSIPVTVKTRIGIDDQDSYEFLCDFINTVSGKGECEMFIIHARKAWLSGLSPKENREIPPLDYPRVYQLKRDFPHLTMSINGGIKSLEEAKAHLQHMDGVMVGREAYQNPGILAAVDREIFGSSDIDADPVAVVRAMYPYIERELSQGTYLGHITRHMLGLFQGIPGARQWRRYLSENAHKAGADINVLEHALKLVADKR >CP029164|355346:412633|397074_397728_-|AWH68252.1|DBSCAN-SWA MSNKYCQALVELRNKPAHELKEVGDQWRTPDNIFWGINTLFGPFVLDLFTDGDNAKCAAYYTAEDNALAHDWSERLVELKGAAFGNPPYSRASQHEGQYITGMRYIMKHASAMRDKGGRYVFLIKAATSEVWWPEDADHIAFIRGRIGFELPVWFIPKDEKQVPTGAFFAGAIAVFDKTWKGQAISYIGRDELEACGEAFLAQVRQQAEKLVREMAA >CP029164|355346:412633|403434_403797_+|AWH68260.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH >CP029164|355346:412633|368932_370030_+|AWH68212.1|integrase|DBSCAN-SWA MAYYNIEKRLKSDGTPRYRCNVIIKEKGVITYRESKTFPKHAHAKTWGTQKVMELDLYGIPSSNAVDGLTVRDLLHKYLNDPNAGGKAGRTKRYVLELLMDSDISAIKLSELTENDVIEHCRLRNNAGAGPATVSHDVSYLGSVLDAAKPVYGINYTSNPAKSARPYLLKLGLIGKSNRRNRRPASDELDMLIEGLQQRSTHKCSKIPFVDILKFSVWSCMRIGEVCRLRWEDLDQEQKSILVRDRKDPRKKEGNHMKVALLGEAWDIVQRQPQKSEFIFPYNSTSVTAGFQRVRSKLGIKDLRYHDLRREGASRLFEAGFSIEEVAQVTGHRSLNVLWQVYTELYPKSLHNRFEELQRSRNKTP >CP029164|355346:412633|410301_410511_-|AWH68270.1|DBSCAN-SWA MNKDEAGGNWKQFKGKVKEQWGKLTDDDMTIIEGKRDQLVGKIQERYGYQKDQAEKEVDSWEKRHDYRW >CP029164|355346:412633|400117_400381_-|AWH68257.1|DBSCAN-SWA MSSQNYTEKAVKAAGKSLSEVARRFGFKSTQSVANWVINNQVPSERVLQLCELGNWSVTPHELRPDIYPNPNDGLPECYSKVSGSTA >CP029164|355346:412633|355346_355493_-|AWH68197.1|integrase|DBSCAN-SWA MPAKKNHASGEGSHQTIPLKRADALDSAALFTGGVVHSGYRGACRTQK >CP029164|355346:412633|359479_359836_-|AWH68201.1|DBSCAN-SWA MTISELLQYCMAKAGAEQSVHSDWKATQIKVEDVLFAMVKEVENRPAVSLKTSPELAELLRQQHSDVRPSRHLNKAHWSTVYLDGSLPDSQIYYLVDASYQQAVNLLPEEKRKLLVQL >CP029164|355346:412633|359839_360256_-|AWH68202.1|DBSCAN-SWA MWYQKTLTLSAKSRGFHLVTDEILNQLADMPRVNIGLLHLLLQHTSASLTLNENCDPTVRHDMERFFLRTVPDNGNYEHDYEGADDMPSHIKSSMLGTSLVLPVHKGRIQTGTWQGIWLGEHRIHGGSRRIIATLQGE >CP029164|355346:412633|384983_386213_-|AWH68236.1|capsid|DBSCAN-SWA MKLHELKQKRNTIATDMRALNEKIGDNAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPENNPQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGSVGG >CP029164|355346:412633|408147_408429_+|AWH68267.1|DBSCAN-SWA MNTLFLLMAEFNTPNIELSAVSQKYFGMSPATAEAKANACKLPVPTYRIGTSQKAKRCINIQDLAEYIDKRREEGRIEWEQVRTSKQKGKEHH >CP029164|355346:412633|383136_383697_-|AWH68232.1|DBSCAN-SWA MKLTPVIAALRARCPYFEKRVAGAAQFKNLPEVGKLRLPAAYVVPGDDSPGENKSQTDYWQELKEGFSVVVILSNGRDERGQFASYDVVDDVRQMLFKALLGWNPEACGNPITYDGGTLLDLNRHELIYQFDFSVISELTEDDTRQQDDLNSLDELRTLAIDVDYLDPGNGPDGDIEHHTEITLPS >CP029164|355346:412633|399504_400089_-|AWH68256.1|DBSCAN-SWA MGKEPEWKVDKQPAWLVAAIRRTIADLPHGYEEAAEILGLYKSDDITPAKDQLHNRLRSGGDQIFPLEWAMVLQDASGTRHVTDAIARRSNGVFVPLVDIDDIDNGDINQRLMESIEWIGKHSQYLRKATADGVIDRAEREQIEENSYQVMAKWQEHLTLLFRVFCAPEKSNARECAAPGVVASIASGCGETNA >CP029164|355346:412633|355451_355733_+|AWH68198.1|DBSCAN-SWA MATLTTGVVLLRWQLLSAVMMFLASTLNIRFRRSDYVGLAVISSGLGVVSACWFAMGLLGITMADITAIWHNIESVMIEEMNQTPPQWPMILT >CP029164|355346:412633|395536_396346_-|AWH68249.1|DBSCAN-SWA MNNLMVIDGIEVRRDAYGRYSLNDLHRAAVASGANARTKEPGKFLSSQQTVELVHELTNTQNLGVDPVSVIHGGNERGTYVCKELVYAYAMWISPSFHLKVIRTFDMVTSAPEKLSGQAADKMQAGVILLDFMRRELNLSNSSVLGACQKLQEAVGLPNLAPRYAIDAPADAHDGSSRPTLSLSALLKQYGIRLTANQAYHQMVKLGIVEQRERYSRTGINNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFPELLKLLDTVH >CP029164|355346:412633|391067_391292_-|AWH68242.1|DBSCAN-SWA MLENYFPSGIGAQPVTQAQKQRVIAVQAALELVKASLSSAGGQATSAKFEQELKAAIGLIEPLADAIQKAISKE >CP029164|355346:412633|393365_393623_-|AWH72414.1|DBSCAN-SWA MQGEKQQPYFFNPGMTVEQLEDWLEQQKLHLSRYNRLVKEKAELEERLSDISVEIERMSAGGFNGKLSFPWESSSLLRNHQQGSV >CP029164|355346:412633|361442_361985_+|AWH68205.1|DBSCAN-SWA MVTNFITPEGDDDMNISNVNSNNTTSLPVELDTLNNKGISYDKDFSYAKDLFLYIETQLKIAKDFCRPGEEVSSSIASTVFHAFIDLVNKIRGKKDCMYIFTLCCFAEEVKGDYSHYRTFLFDIGNQYKVKLTQSGKKEFSLTLEFNDTIIESQIVTGNKAKHILEDIEKFYRNKPDTYY >CP029164|355346:412633|392899_393286_-|AWH68246.1|DBSCAN-SWA MSDPISGTGLAGGALTGASVYGLLTGTDYGVVFGAFAGAVFYIATAADLSASRRLAYFIVSYIAGILCSGLVGSKLANLTGYSDKPLDAIGAVIVSALAVKILTFLNNQDIGSLVALITRRGGSGGAK >CP029164|355346:412633|388192_389926_-|AWH68239.1|terminase|DBSCAN-SWA MSRKSYPNVNAANQYARDVVRGKIVACQFVIQACQRHLDDLMAEKSKSFRYRFDKDLAERAAKFIQLLPHTKGEWAFKRMPITLEPWQLFVICCAFGWVNKGSRLRRFREVYTEIPRKNGKSAISAGVALYCFACDNEFGAEVYSGATTEKQAWEVFRPARLMCKRTPMLTEAFGIEVNASNMNRPEDGARFEPLIGNPGDGSSPHCAVVDEYHEHATDALYTTMLTGMGARRQPLMWAITTAGYNIEGPCYDKRREVIEMLNGSVPNDELFGIIYTVDEGDDWTDPQVLEKANPNIGVSVYREFLLSQQQRAKNNARLANVFKTKHLNIWVSARSAYFNLVSWQSCEDKSLTLEQFEGQPCILAFDLARKLDMNSMARLYTREIDGKTHYYSVAPRFWVPYDTVYSVEKNEDRRTAERFQKWVEMGVLTVTDGAEVDYRYILEEAKAANKISPVSESPIDPFGATGLSHDLADEDLNPVTIVQNFANMSDPMKELEAAIESGRFHHDGNPIMTWCIGNVVGKNMPGNDDLVKPVKEQAENKIDGAVALIMTIGRAMLKEPDDFLSSLDPDDDLLIL >CP029164|355346:412633|393773_394526_-|AWH68247.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATGSGGLTITDVMAAQGMVQSKAPLGFALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEETRLQLIPYLVNFAFADYSRSAASKARCEHCAGTGFHNVLREVVKHSRSGESVIKEEWVKELCQHCHGKGEVSTACRGCKGKGIVLDEKRTRLHGTPVYKICGRCNGNRFSRLPTTLARHHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEAQLRKVTR >CP029164|355346:412633|372119_372539_-|AWH68217.1|tail|DBSCAN-SWA MDRYFYSQKENGFFTDLNKAPSDAVEITTDEWLSLLDGQDNGMKIVSNQEGYPVLTEQPPLSKENLIALAELKKGKLINEANEHMNSRQWPGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQAR >CP029164|355346:412633|375241_375790_-|AWH68222.1|plate|DBSCAN-SWA MSTIEAMQRQLLGLIGRAVVKSISAATKCQTVDVSLIAGEPKAGVEHLEPYGFTARANSGAEAVVLFPDGDRSHAVVVTVSDRRYRLKGLQTGEVAVYDDQGQSVTLTREGIVVDGAGKTITFRNSPKARFEMDLEVTGQVKDLCDSGGTTMSAMRLAYNGHRHRENGQGSNTDKPDKAMEA >CP029164|355346:412633|372542_373193_-|AWH68218.1|DBSCAN-SWA MHRIDTKTAQKDKFGAGKNGFTRGNPQTGTPATDLDDDYFDMLQEELCSVVEASGASLEKARHDQLLTALRALLLSRKNPFGDIKSDGTVKTALENLGLGEAAKRDVGTGENQIPDMSAWKRNPSSNRWRKLPDGTIIQMGISASGPLGSPVNITLPISFSNTNYCVVASYDNARLGVSTMVSFAALPVSPSQFSLMSSVTEQGINPFAYWIAFGD >CP029164|355346:412633|392631_392913_-|AWH68245.1|holin|DBSCAN-SWA MELNDPTATINALLCACVVITLMFYRRGDSRHRPWVSRLAWLITVTYSAVPLAYLCGIYPHSSWPIIVANTIFLSVLVAVRGNVARLVDHLRH >CP029164|355346:412633|355831_356368_-|AWH68199.1|DBSCAN-SWA MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMKEQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTTEVVVNVGGTMQMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSGGAQSRPQQSAPAAPSNEPPMDFDDDIPF >CP029164|355346:412633|410626_412006_-|AWH68271.1|DBSCAN-SWA MPPGIAVCFSSLFIRLVCMAFLTSSDKALWHLALPMIFSNITVPLLGLVDTAVIGHLDSPVYLGGVAVGATATSFLFMLLLFLRMSTTGLTAQAYGAKNPQALARALVQPLLLALGAGALIALLRTPIIDLALHIVGGSEAVLEQARRFLEIRWLSAPASLANLVLLGWLLGVQYARAPVILLVVGNILNIVLDVWLVMGLHMNVQGAALATVIAEYATLLIGLLMVRKILKLRGISGEMLKTAWRGNFRRLLALNRDIMLRSLLLQLCFGAITVLGARLGSDIIAVNAVLMTLLTFTAYALDGFAYAVEAHSGQAYGARDGSQLLDVWRAACRQSGIVALLFSVVYLLAGEHIIALLTSLTQIQQLADRYLIWQVILPLVGVWCYLLDGMFIGATRAAEMRNSMAVAAAGFALTLLTLPWLGNHGLWLALTVFLALRGLSLAAIWRRHWRNDTWFVAT >CP029164|355346:412633|362206_363400_-|AWH68206.1|DBSCAN-SWA MFQKVDAYAGDPILTLMERFKEDPRSDKVNLSIGLYYNEDGIIPQLKAVADAEARLNAQPHGASLYLPMEGLNSYRHAIAPLLFGADHPVLQQQRVATIQTLGGSGALKVGADFLKRYFPESGVWVSDPTWENHVAIFAGAGFEVSTYPWYDEATNGVRFNDLLVTLKTLPARSIVLLHPCCHNPTGADLTNDQWDAVIEILKARELIPFLDIAYQGFGAGMEEDAYAIRAIASAGLPALVSNSFSKIFSLYGERVGGLSVLCEDAEAAGRVLGQLKATVRRNYSSPPNFGAQVVAAVLNDEALKASWLAEVEEMRTRILAMRQELVKVLSTEIPERNFDYLLNQRGMFSYTGLSAAQVDRLREEFGVYLIASGRMCVAGLNTANVQRVAKAFAAVM >CP029164|355346:412633|363652_364732_-|AWH68207.1|DBSCAN-SWA MQAATVVINRRALRHNLQRLRELAPASKMVAVVKANAYGHGLLETARTLPDADAFGVARLEEALRLRAGGITKPVLLLEGFFDARDLPTISAQHFHTAVHNEEQLAALEEASLDEPVTVWMKLDTGMHRLGVRPEQAEAFYHRLTQCKNVRQPVNIVSHFARADEPKCGATEKQLAIFNTFCEGKPGQRSIAASGGILLWPQSHFDWVRPGIILYGVSPLEDRSTGADFGCQPVMSLTSSLIAVREHKAGEPVGYGGTWVSERDTRLGVVAMGYGDGYPRAAPSGTPVLVNGREVPIVGRVAMDMICVDLGPQAQDKAGDPVILWGEGLPVERIAEMTKVSAYELITRLTSRVAMKYVD >CP029164|355346:412633|404815_405352_+|AWH68262.1|DBSCAN-SWA MSFIKTFSGKHFYYDKINKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFKVIPLSPGHAYGMFMERFNELSELRKCA >CP029164|355346:412633|366282_367266_+|AWH68209.1|DBSCAN-SWA MATRIEFHKHGGPEVLQAVEFTPADPAENEIQVENKAIGINFIDTYIRSGLYPPPSLPSGLGTEAAGIVSKVGSGVKHIKAGDRVVYAQSALGAYSSVHNINADKAAILPAAISFEQAAASFLKGLTVYYLLRKTYEIKPDEQFLFHAAAGGVGLIACQWAKALGAKLIGTVGTAQKAQSALKAGAWQVINYREENLVERLKEITGGKKVRVVYDSVGRDTWERSLDCLQRRGLMVSFGNSSGAVTGVNLGILNQKGSLYVTRPSLQGYITTREELTEASNELFSLIASGVIKVDVAEQQKYPLKDAQRAHEILESRATQGSSLLIP >CP029164|355346:412633|386223_386826_-|AWH68237.1|head,protease|DBSCAN-SWA MNDREIRCYSGEVRAERHDDNPAHIIGYGSVFDCRSELIFGSFREIIRPGAFDDVLGDDVRALFNHDPNFILGRSAAGTLNLSVDERGLRYDIQAPETQTIRDLVLAPMQRGDINQSSFAFRVARDGEEWYQDEDGVVIREITRFSRLLDVSPVTYPAYQEADSAVRSMKAWQEARNSGALQKAINQRMARERVLTLLNA >CP029164|355346:412633|408846_409380_-|AWH68268.1|DBSCAN-SWA MLTLDEIGQSVRNNIQLIIDHVGLPLAVGPLSDDDYKILCGGYGELEWDYALSTYGNSREKYEFCIKLVQQGRVQGIPSGAAICVYGVEENIFRIHMIERFSREDESHPLKGRMVLLTLMSAFIFCKAVECKVVHIVEPVPELVQYYESFGFRMEQCGYVMSAVIDELQDIFLKFAQ >CP029164|355346:412633|392017_392632_-|AWH68244.1|DBSCAN-SWA MNQQLFQKAAGISAGLAARWFPHIDAAMKEFGITAPADQAMFIAQVGHESMGFRALVENFNYTPSALVATFGKRITQQQADALGRTSGHAARQDAIANLVYSNRLGNKAPGDGWKYRGRGLIQITGLHNYRICGAALKLDLVTSPEQLEQELQAARSAAWFYTSKGCMVYGADITRVTRIINGGLNGIEDRKIRYNKAREALLV >CP029164|355346:412633|371545_372148_+|AWH68216.1|tail|DBSCAN-SWA MDKAVLNSELIATKAGNITVYNYDGETREYISTSNEYLAVGVGIPAYSCLDAPGTYKAGYAICRSVDLNSWEYMPDHRGEIIYSTETGEAKEITVPGDYPENTTTIVPLTPYDKWDGEKWVTDSEAQHGAAVEAAEAQRQSLIDAAMASISLIQLKLQAARKLTQAETTRLNAVLDYIDAVTATDTSTAPDVIWPELPEA >CP029164|355346:412633|397727_398222_-|AWH68253.1|DBSCAN-SWA MIMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTRRHFPRLTERAQEPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARREQCLRKSSKRAVSGDEWYLSGNYVGA >CP029164|355346:412633|384581_384905_-|AWH68235.1|head,tail|DBSCAN-SWA MLLKMEEIKLQLRLDDDFSDEDELLELLGKAAQSRTENFLNRTLYATADDRPADDSDGLVISDDVKLALLLLVSHFYENRSTVTDVEKMELPMSFNWLVAPYRLIPL >CP029164|355346:412633|405342_405864_+|AWH68263.1|DBSCAN-SWA MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCETLAAENARLNKFIVQSCYVFNGEQDEISDAYICATDGGMPQIPATDAFLAEIRAAARNEGINYTASRLAAAFNHGFINKSLREVFDVTRMILSAKEELANEPHPIDGLSGEYAEKSLEEWAEQLRKGVIQ >CP029164|355346:412633|382957_383128_-|AWH68231.1|DBSCAN-SWA MFVKPVKGRSVPDPARGDLLPAEGRNVDENNYWLRREAAGDIRRVNKKVNTDDDKL >CP029164|355346:412633|380852_381122_-|AWH68228.1|tail|DBSCAN-SWA MKELELKKPITAHGETLSVLEFDEPTGKDVRELGYPYQMNQDESVKLLAHVVSKYIVRLAKVPQSSVDQMSPADLNAAAWLVAGFFLQA >CP029164|355346:412633|360366_361080_-|AWH68203.1|DBSCAN-SWA MRKITQAISAVCLLFALNSSAVALASSPSPLNPGTNVARLAEQAPIHWVSVAQIENSLAGRPPMAVGFDIDDTVLFSSPGFWRGKKNFSPESEDYLKNPVFWEKMNNGWDEFSIPKEVARQLIDMHVRRGDAIFFVTGRSPTKTETVSKTLADNFHIPATNMNPVIFAGDKPGQNTKSQWLQDKNIRIFYGDSDNDITAARDVGARGIRILRASNSTYKPLPQAGAFGEEVIVNSEY >CP029164|355346:412633|376865_378194_-|AWH68224.1|DBSCAN-SWA MTWKDRLQDASFRGVPFKVEEESAGTGRRVETHEYPNRDKPYTEDLGKVTFRPSITAYVVGDDCFDQRDRLIDALNKPGPGTLVHPTYGELKVCVDGEVRVSTSKSEGRIVRFDLKFVEAGELSYPTSGAATAQTLMSSCSALDDCISDSFSGFSIDGVADFVQNDVIGNASIMLGYVSDAMKVVDSAVSDAARLLQGDISVLLPPPSSGKNFVEQVQKMWRTGKRLYGNASDLVTMIKTLSGVSLGSDLQPRGVWKTDSKTTATATQQRNVVASTLRTTAISEAAYAVTRLPAPTTSAVMQNSAVGQATTPAQSTGWPSVTHPALNNAPAVKNTVDLPTWEELTDIRDTLNTAIDKELSRTTSDALFLALRRVKADLNADINTRLEQSARIIQRTPDEVLPALVLAATWFDNAARDADIIRRNAITHPGFVPVIPLKVPVQ >CP029164|355346:412633|396365_396755_-|AWH68250.1|DBSCAN-SWA MKLILPFPPSVNTYWRHPNKGAFAGKSLISAAGRKFQSAACAAIVEQLRRLPKPTSAPASVEIVLFPPDNRIRDLDNYNKALFDALTHAGVWEDDSQVKRMLVEWGPVIPEGKVEITISKYEKTAGAAA >CP029164|355346:412633|381121_381478_-|AWH68229.1|tail|DBSCAN-SWA MARIGGTCYFKIDGQQLSLTGGIEVPMNRTVNDDIIGLDGSVDRKETHRAPYVKGTFKVPKNFPVNKITSSDEMTITAELANGQVYVLSSAWLHGEANHNAEEGTVDLEFHGEEGDYQ >CP029164|355346:412633|373196_373781_-|AWH68219.1|DBSCAN-SWA MDVTNDDYIRLLSALLPPGPAWSASDPAIAGAAPSLTRVHQRADALMRELDPRTTTELINRWERLCGLPDECIPAGTQTLRQRQQRLDAKVNLAGGINEDFYLAQLAALGRPDATITRYDKGTFTCSSACTDAVNAPEWRYYWQVNMPAATNTTWMTCGDPCDSALRIWGDTVVECVLNKLCPSHTYVIFKYPE >CP029164|355346:412633|356622_359445_+|AWH68200.1|DBSCAN-SWA MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQRRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGTITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLMLLAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTIEVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSANFACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPELSLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYGSGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKFISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAGQRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDAIRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEVPKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLINDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSNPATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIKVEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFFDAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTGQTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADWIVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML >CP029164|355346:412633|409744_410260_+|AWH68269.1|DBSCAN-SWA MEKTTTQELLAQAEKICAQRNVRLTPQRLEVLRLMSLQDGAISAYDLLDLLREAEPQAKPPTVYRALEFLLEQGFVHKVESTNSYVLCHLFDQPTHTSAMFICDRCGAVKEECAEGVEDIMHTLAAKMGFALRHNVIEAHGLCSACVEVEACRHPEQCHHDHSIQVKKKPR >CP029164|355346:412633|406344_406608_+|AWH68264.1|DBSCAN-SWA MATLTKKERAWLNELQDVLDRCPSPKKIGFYTIGDKSIYLYDLRRMDEIMEALDNRSSMDWCVAVHDMNAGFDEKILFPSSVESTAG >CP029164|355346:412633|361126_361306_+|AWH68204.1|DBSCAN-SWA MLQKPSDHNASSRGKEGLSAVFVMKSRMIIFNPVNTFLSFFVKKMFITRCKNTLILRLL >CP029164|355346:412633|374816_375245_-|AWH68221.1|tail|DBSCAN-SWA MMELWLTVNGKRTCASAPLDPLTRAVVISLFTWRRAEPDDNADVPMGWWGDTWPAVQNDRYGSRLWLLQRSKLTNQLVQTVRGYIRECLQWMIDDGVVSRIDLDIRRTGINELGNSITLWRRDGPVMISFDDLWSAITHGGQ >CP029164|355346:412633|386818_388045_-|AWH68238.1|portal|DBSCAN-SWA MLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNTGGRYTYGLYNEYGAFAISPDDMIHIRALGNNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLDEMLVSVNAANPAGDFKPPKNDEGKTNE >CP029164|355346:412633|407538_408111_+|AWH68266.1|DBSCAN-SWA MNNLMIDLETMGKNKDAPIVSIGAVFFTPETGDIGQEFYTVVSLESAMGQGATPDGDTILWWLKQSPEARAAICIDDTLSISDALSELNHFINRHADNTKYLKVWGNGATFDNVILRGAYERAGQICPWAYWNDHDVRTIVTLGRSIGFDPKMDMPFDGERHNALADARHQAKYVSAIWQKLIPATSTEL >CP029164|355346:412633|399149_399329_-|AWH68255.1|DBSCAN-SWA MREVNRWFKDHYGVPVRVIRWEPETQLVIYLREGYEHECFSPLEQFRRKFREIEVGHEH >CP029164|355346:412633|406618_407539_+|AWH68265.1|DBSCAN-SWA MTTITKERLLTIRQWRETYGPGSNVVLPAEEAEELARIALASLEAEPIGFRCRRNDNLGDWSYVYHREPDDFERKHLVIEGIYAAPPAPVVPEEATPENVEMLSGYVSTYKLTDSERDIAAEIWNACRAAMLQSGNFRENKNSSTNNFREIAETSTNYPAIPSEVLSAILKVARIRADFDDFDGDRRGIGDCLDEAEQELIVTINKYASQLAAEPIAPNDVREQTAIPQVPVTPDGWISCSERMPEKNQNVLISVNFDSSLVEPLICSARYTGSTFRRGDATIKPGNGIEQATHWMPLPEPPQEVK >CP029164|355346:412633|401416_402502_+|AWH68259.1|DBSCAN-SWA MINERTEATDGVADMISTNTKYLVWNNKGGVGKTFLTYNLAVEFAISHPDQDVVVIDSCPQSNVSEIILGGNGTGEENLNKLRDRNVTIAGYIKERFSKSPLSRLGNESSYFVRAHDVNAKMPENLYILPGDVDLDICSRLISHIGSSPVKEAWKKSRSLLVDLIASFEADKNISDRAKTFFIDCNPSFASYTELGVVAANRIIIPCTADAASIRGIKNLVKLIYGVSIDKSEQDEMFLDFNKEAKQNLIELPELHLFVQNRSRTNESDAAKAFKSHAEEIKRITDDLLNTHPHLFTNVATFERVQNVKDGNTLAAIINHEGCPLSRLQHKSYTIYGMATQANRAQIEALESDVSTVVKCL >CP029164|355346:412633|412024_412633_-|AWH68272.1|DBSCAN-SWA MKALTARQQEVFDLIRDHISQTGMPPTRAEIAQRLGFRSPNAAEEHLKALARKGVIEIVSGASRGIRLLQEEEEGLPLVGRVAAGEPLLAQQHIEGHYQVDPSLFKPNADFLLRVSGMSMKDIGIMDGDLLAVHKTQDVRNGQVVVARIDDEVTVKRLKKQGNKVELLPENSEFKPIVVDLRQQSFTIEGLAVGVIRNGDWL >CP029164|355346:412633|378878_380711_-|AWH68226.1|tail|DBSCAN-SWA MAEFELKALITGVDRLSPALSKMQKKIRGFKRQAEEASQGGLALGGGLAAGLTLSLKSYADQENAATGLKVAMMDANGEVGKSFQDINKLAIGLGNQLPGTTADFQNMMQMLVRQGIPAENILGGVGKATAYLAVQLKKTPEAAAEFAAKMQDATGTASEDMMGLFDTIQKAFYLGVDDTNMLSFFTKTSSVLKMVNKDGLQAAQSLAPISVMMDQMGMNGESAGNALRKVIQSGLSVKKIRDVNKVMARQKLGVQLDFTDGKGSFGGLDNMFRQLAKLRKLTDVKRTGVLKAIFGDDAETLQVVNALIDKGKDGYDQIQQKMNKQASLNKRVQAQLGTLSNLWEAMTGTATNGLAAIGGAFSGDAKNITQWLGELGEKFTKFADENPRVIRGVVGLAAGLAILKLGLMGVGSAISIVSRIMSMTPIGMIATAIALAAGLIITNWDVVGPYFKKLWETIGPYFEAGWELLKKVFAWSPLGMVINNWGPVVKWFQDMWDKLKPIIEWFTDSSGDTVDAINSAQWGAGAYDAYGTGIPARGYTPYPAVDPAQSNNASDATGSNPFMINKASVPKVDGEIKVSFVNSPPGMRVMETRSSGFDVSHDVGYTRFR >CP029164|355346:412633|370091_370340_+|AWH68213.1|DBSCAN-SWA MRIEICIAKEKMTKMPTGAVDALKEELTRRISKRYDDVEVIVKATSNDGLSVTRTADKDSAKTFVQETLKDTWESADEWFVH >CP029164|355346:412633|400487_401192_+|AWH68258.1|DBSCAN-SWA MVEEQKYPDFAKRLNELMTIKGISVTQLKSLVGVTYEMARRYTIGAAKPRVSVMSKLALALGVSASYLEYGVGDREECKEMASIPNPTKPDVYRIEVLDLSVSAGPGTYMLSDYVDVLYAIEFTTEHARSLFGNRSQNDIKVMTVNGDSMSPTLVSGDRLFVDISVRHFQTDGVYSFVYGKTFHVKRLQMQGNKLAVLSDNPAYEKWYIDEKSQDQLYVMGKALIHESIKYNRL >CP029164|355346:412633|390550_390901_-|AWH68241.1|DBSCAN-SWA MPPRTPKACRFRGCRNTTTDPSGYCESHKSEGWKQYKPGQSRHQRGYGSKWDSIRAGVLKRDKGLCQLCLRAGVVREAKTVDHIIPKAHGGTDADCNLQSLCWPCHKAKTARERLK >CP029164|355346:412633|383693_384200_-|AWH68233.1|DBSCAN-SWA MATPFFHVDVQQPAEMRFNRARVRRAFVTIGQRHMRDARRLVMRRARSAPGENPGYQTGRLARSIGYMVPRASKKRAGFMTRIAPNQRNGKGNRMISGDFYPAFLFFGVRGGAKRRRSHHRGASGGSGWRLAPRNNFMVETLEKNRSWTRYFLARELRKSLKPERRHR >CP029164|355346:412633|378284_378797_-|AWH68225.1|DBSCAN-SWA MIEGMIMRIFVFFISALLSFNLAAEECKFSFNESELISSIGIAPVKQEIIKDEGITKRQYEFRRELSSEEMLSDDADEKYEPQFYISVYNPSCPQKVIVWFFKDNKNTMDLSNEVLAGRAFKYLTGVNESIFENKMKKFLKVQSFESFDERTDSKFIKSGDIYSIDVQLR >CP029164|355346:412633|367431_367674_-|AWH68210.1|DBSCAN-SWA MLELLFVIGFFVMLMVTGVSLLGIIAALVVATAIMFLGGMLALMIKLLPWLLLAIAVVWVIKVIKAPKVPKYQRYDRWRY >CP029164|355346:412633|380703_380886_-|AWH68227.1|DBSCAN-SWA MACGWFFPPGLTAEYLTDRFFDCASYWRINPFELLNMPISEIPLLVSQANRIEQEKRTHG >CP029164|355346:412633|364784_366200_-|AWH68208.1|DBSCAN-SWA MAGNKPFNKQQAEPRERDPQVAGLKVPPHSIEAEQSVLGGLMLDNERWDDVAERVVADDFYTRPHRHIFTEMARLQESGSPIDLITLAESLERQGQLDSVGGFAYLAELSKNTPSAANISAYADIVRERAVVREMISVANEIAEAGFDPQGRTSEDLLDLAESRVFKIAESRANKDEGPKNIADVLDATVARIEQLFQQPHDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAMLQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQLDDEDWARISGTMGILLEKRNIYIDDSSGLTPTEVRSRARRIAREHGGIGLIMIDYLQLMRVPALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRSLEQRADKRPVNSDLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVRLTFNGQWSRFDNYAGPQYDDE >CP029164|355346:412633|381477_382974_-|AWH68230.1|tail|DBSCAN-SWA MTISFNTIPSNTLVPIFYAEMDNSAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQTDPFGELYVIAVPEATGAAATVTLTVTGAATETGTVNVYVGRTRVQAPVTNGDNVATIASSIKDAINAVPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKIGTLSELVTAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSYGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASDPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA >CP029164|355346:412633|394539_395529_-|AWH68248.1|DBSCAN-SWA MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPERYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLIRNDLADALPESAARKALRLPKPVVPSVTRESDLVPSVPATSIIQDKAKKVLALKVDPESPESFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >CP029164|355346:412633|375789_376869_-|AWH68223.1|plate|DBSCAN-SWA MNDNVTLRVNGREWNGWTSVRIGAGIERLARDFSVEITRQWPGDEGITTLQPRIKNGSKVEVLIGDELVITGWVEATPVRYDARSVSTGIAGRSLTADLIDCAAEPTQFNGRSLVQIAQALAAPFGIEVVNNGAPSGVIPDVQPDHGETVIEVINKILGQQQALAYDDPHGRLVIGGIGSTRAHTALVLGENILSCDTEKSIRERFSVYQVAGQRAGNDDDFGEATTTALRARTEDAFIARYRPMYIRQTGQATGAGCIARADFEARQRAARTDETTYVVQGWRQGNGTLWQPNQRVIVFDPVCGFDNTELLVSEVTFTQDQNGTLTEIRVGPPDAYLPEPEAPGARKKKKARVQEDPF >CP029164|355346:412633|391559_392021_-|AWH68243.1|lysis|DBSCAN-SWA MKMSYWALILTFIACIAGGLVWSANHYHNKAIEYKKQRDENAMALDSAMATISDMQKRQRDVAELDARYTKELADANATIESLRADVSSGRKRLQVAATCAKSTTRASGMGDGESPGLTADAELNYYRLRSGIDRITAQVNYLQEYIRTQCLR >CP029164|355346:412633|396751_397078_-|AWH68251.1|DBSCAN-SWA MTTLTQCQQQVLDMLISYQKERGFPPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAVGIIRSLLAGEENARLRAAHWLHERGLKV >CP029164|355346:412633|403862_404687_+|AWH68261.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNSERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAIRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >CP029164|355346:412633|398218_399160_-|AWH68254.1|DBSCAN-SWA MSTKLTGYVWDGCAASGMKLSSVAIMARLADFSNDEGVCWPSIETIARQIGAGMSTVRTAIARLEAEGWLTRKARRQGNRNASNVYQLNVAKLQAAAFSQLSDSDPSKSDASKSDPSKFDASKSGKKAGFHPSESGGDPSVKSKHDPSDKKPSRPDASQPDTQKAEQDFLTRYPDAVVFSPKKRQWGTQDDLTCAQWLWKKIIALYEQAAECDGEVVRPKEPNWTAWANEIRLMCVQDGRTHKQICEMYSRVSRDPFWCRNVLSPSKLREKWDELSLRLSPSVSTYTEKREDPYFKASYDNVDYSQIPAGFRG >CP029164|355346:412633|371027_371546_+|AWH68215.1|tail|DBSCAN-SWA MPFARYFCIFINVGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFDKVKYPHLATAYPSGKLPDLRGEFIRGWDDGRGIDAGRALLSIQTGMLEKHRHIVVANDRYDSKEEWELATIFRRAYTQGRGLDAADAGGTLIPSPTLHTRGSIGNTGGSETRPRNIAFNYIVRAA >CP029164|355346:412633|389939_390434_-|AWH68240.1|terminase|DBSCAN-SWA MPHMAGTAGRSGRRPKPTARKALAGNPGKRALNKDEPVFTPIKGVEPPEWFAEENLPLATIMWQLTTKELCGQGLLCVTDLAVLERWCVAYEFWRRAVKNIASQGNTITGAMGGRVKNPELTAKKEQESEMSSTGAMLGLDPSSRQRLIGLAGKKKATNPFLTI >CP029164|355346:412633|384174_384585_-|AWH68234.1|head,tail|DBSCAN-SWA MKIRQAQTSATYILPDPGELNKRVLIRQRVDMPADNFGVEPQYPVAFRAWAKVIQTSATTWQETAQTGDAITHYITIRYRRGITADYEVVCDDSVYRVKRQRDLNGARRFLLLECTELGEFTQSHGGSNGDSLFSR |
77 | Shigella_phage(36.67%) | portal,tRNA,plate,protease,terminase,holin,tail,integrase,lysis,head,capsid | attL 349726:349741|attR 384062:384077 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
934714 : 943640
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP029164|934714:943640|DBSCAN-SWA TATGCAGAAAAATGTGACTCCCGGCAGGCGAAAAGGCTGCCCTAATTATCCTCCCGAATTTAAACAGCTGCTCGTTGCTGCCTCCTGTGAACCCGGGATATCCATCTCAAAACTTGCTCTTGAAAATGGCATTAACGCCAATCTGTTGTTCAAATGGCGACAACAATGGCGCGAGGGAAAGCTGCTATTACCTTCTTCAGAGAGCCCCCAGCTACTTCCTGTGACTCTCGATGCAGCTGCCGAACAGCCAGAATCGCTCGCAGAGGACCCGGAAACCCTCAGTATCAGCTGTGAGGTAACGTTCCGGCACGGGACGCTCCGCTTCAATGGCAATGTCAGCGAAAAACTCCTGACTCTGCTGATACAGGAACTGAAACGATGATCCCGTTACCTTCCGGGACCAAAATTTGGCTGGTTGCCGGTATCACCGATATGAGAAATGGCTTCAACGGCCTGGCTGCGAAAGTACAGACGGCGCTGAAAGACGATCCCATATCCGGCCATGTTTTCATTTTCCGGGGCCGCAGCGGCAGTCAGGTTAAACTGCTGTGGTCCACCGGTGACGGGCTGTGCCTCCTGACCAAGCGGCTGGAGCGTGGGCGCTTCGCCTGGCCGTCAGCCCGTGATGGCAAAGTGTTCCTTACGCAGGCGCAGCTGGCGATGCTGCTGGAAGGTATCGACTGGCGACAGCCTAAGCGGCTGCTGACCTCCCTGACCATGCTGTAAATCTCTTTATCCTGGTTGTCACAGAATAAGCCCGGTAAAATACGGGCTTATGAACGACATCTCTTGTAAGCGTCAACGGAGCACCGTATTGACGCTTATTTATTGGTGAGTACTACGTTCCATGGCAGGAGTTCGTCAACACAGTTGGAGGGCCATTCCGGCAGTACGCTCAGAATATGGCGCAGATACGCTTCCGGATCGATACCGTTCAGACGGCAGGTGCCGATCAGCCCGTACAGCAGTGCTCCACGCTCGCCGCCGTGATCGCTACCGAAGAACACGTAATTTTTCTTTCCGAGACAGACTGCACGAAGCGCTCTTTCCGCTGTGTTATTGTCCGCTTCTGCCAAACCATCATCACTGTAATAACAGAGGGCGTCCCACTGATTCAGTACATAGCTGAACGCTTCGCCCAGTCTGGATTTTTTCGACAGCGTGCCATTCTTCTCCACCATCCATTCATGCAGCGACGTCAGTAACGCTTTGCTTCGCTGCTGCCTGGCTGCAAGACGCTCTGACTCTGGTAATCCCCGTATTTCATCCTCGATGGCGTACAGTTCACTGATTCGCTTCAGGGCTTCTTCTGCCGTCGCACTTTTGCTGCTGATGTATACATCGTGGATTTTTCGCCGGGCATGGGCCCAGCACGCAACTTCTGTCAGTGCACCACCTTCACGTTCTGCACTGAACAACCTGTCGTAACCTGTGAACGCATCCGCCTGCAGGATACCCCGGAAGGGGCGAAGGTGTTGCTCCGGGTGTTTCCCCTGCCGGTTCGGCGAGTACGCGAACCAGACCGCTGGAGGAGATGACGAACCCACATTGCGATCATCCCGGACATACGTCCAGATACGCCCTGTTTTCGCCTTTTTCTGACCCGGTGCCAGTACCTTTACCGGTGTGTCATCAGTGTGAACCTTGCGGGTATTCATTACATAACGGTACAGGGCATCATTCACCGGTGTCATTAACTGGCAGCACGCGTCAACCCAGTTGGAGAGTAAGGCCCGGCTCAGTTCGACACCCTGGCGGGCAAAGATTTCACTCTGACGATACAGTGGCAGATGTTCGCAGTATTTTCCCGTTAACACGCGGGCAAGTAATCCGGGGCCCGCGATACCACGCTCTATCGGGCGGGACGGCGCCGGTGCTTCAACAATACAGTCACATTTTGTACAGGCTTTTTTTACCCGTTCTGTGCGGATCACTTTCAGGGCACTGCTCACCAGTTCCAGCTGTTCAGCGCTGACTTCCCCCAGATAATCCAGCTCACCGCCACACTCCGGGCAACAGCTTTCTTCTGGCTCCAGGCGGTGTATTTCACGGGGAAGGTGTGCCGGTAACGGACGACGATGGCGCGACTGTCGCAACTGGCGGGGAACCTGAGGATCGTCTTCCCGCCCACTGTAACGATCGCTGTCCTGTTCACGTTGTTTCAGCAGAGCCTCAGCCAGTTCAACTTCACGACGCAGTTTTTCAGAACGGGTACCGAACAGCATCCGGTGCAGTTTTTCTATCTGAGCCCGCAGATGTTCTATTTCCCGTTCATCTTCTTCGATCTTTTCTTCGGCACGTGCCAGTGCAGAGCGCAGGAAGGCCTCCGTCTCTTCAACCAGACTCAGTTGCTGGTCTTTCTGACGGAGCTGGCTTTCCAGTTCTGCAATGCGAATGAGGTATTTCTGACTCATGGCCGTTTTTATAATGCGGCCAGGCGTTTTTTACAACATTGTCAGTGCGTTAAGGCGGAATGTTTTTGGCTGACGCCAGTCCAGCTTATCGAGGAGCATTGCCAGTTGCGAGCGGGTAATGGATACCTTGCCGTCACGTACCGCAGGCCAGATAAACTGGCCTTCCTCCAGGCGTTTGGTGAACAGGCACAGACCATCAGCATCAGCCCAAAGAATTTTAACGGTGTCACCCCGTCGGCCACGGAAGATAAACAGGTGACCGGAGAAGGGATTATCATTCAGCACATGTTGTACCTGTTCTCCCAGTCCGTTGAAGGATTTACGCATATCGGTAACGCCGGCAACGAGCCAGATACGGGTACCTGATGGGAGTGAGATCATCTTCCCCTCCCGGTCAGTTCACGGATCAACACCGTGAGCAGCTCTGGCGATGGATTTTCCAGCGTCATGTTACCGTGACGGAATTCCACCTTGCAGGAACTGGCACTGACTCTGGTCTGAGTGGAAGTGGATAAAGACGGCGCAATGGCCGCCACAGGTTCTTTCTGCTCATCCGGCGTTATTTCTACAGGTAATAATTCAACGCCAGTGTCAGAAGAGGTCGTTACCGGAAGACGCCGCGAAACACGCCCTTCGTTCTGCCAGAGCCTGAGCCATTTGAAAATAACATTATCATTGACGCCATTTTCACGTGCAATCTGTGCAACACAAGCTCCAGGTTGTGATGCCAGTTCCACCATACGAAGTTTGAATTCATTCGAATAGTTTTTACGAGGTTCTTTTCGCCAGTCCTGTAATTCCATACTTAGATGTCCGTCTATATCAGATGGGCGTCTAAGTTACCAATTCTCGTCTGATGGCTACATACGGCGGTCAGTTTACGCTTACCTTTATCCCGACAGACGACGTATATTTACCAGTAAGGCCAACCCGGTTGCGCCCTTTGAACCAGAAGTCAAACGTCAGGCTGTCATGGCACTCTGTACACGACAAGTATCAGCCAGTGAAATCGCCAGGCGTATTGGTGTCAGTCGTGCGGTATTGTATAAATGGAAAGATAAAATTATCGGCAACAGTGCTTACCAGACTATGCGTAAACATAACGAACCTTCCCTGGAGGCAGAACGCGATGCGTTGCGGGAGGAAGTCGCCCGACTGAATCAGGAAATACGCCGCCGGCAGATGGAGCTGGATATTCTGAAAAAGGCGGAGGAAATCATAAAAAAAGACCCGGGCATCAGTATCAGTCACCTGAACAACAGAGAGAAAACGAAGATCGCTGATGCCCTGAGACAAACATATCCCCTGACAGAATTACTGCATGTTCTGGGCCTTGCCCGTAGCAGTTATTTTTATCACCGGGCTGCACTGAAAGCCGGTGATAAATACGCCACGATACGTACGATGCTGACAGATATATTTAACAGTAATTACCAGTGTTATGGCTATCGTCGCCTGCATGCGATGCTCAGGCATGAGGGGGGGCGGCTATCAGAAAAGGTTGTACGCAGACTTATGGTGGAAGAACAGCTTGTCGTCAGCCGTAACCGTCGTCGCCGCTACAGCTCATATTGCGGAGAAATCGGACCGGCTCCGGATAACCTTATCGCCAGAGATTTTAAGGCGGAGCAACCTAATCAGAAATGGCTGACAGATATCACGGAGTTCCACCTCCCTGCAGGTAAAGTCTGGCTATCATCGGTGGTGGACTGCTTCGATGGAAAAGTTGTGAGCTGGTCTCTCAGTACACGCCCCGATGCTGAACTGGTCAACACTATGCTGGATAGCGCTGTCGAAACGTTAAATGCTGGCGAACGACCGGTGATACACAGTGACAGAGGTGGGCATTATCGCTGGTCAGGCTGGCTGGAAAGAGTGAATGCAGAAGGTCTTATTCGCTCAATGTCCCGCAAAGGATGTTCACCTGATAATGCGGCATGCGAAGGCTTTTTCGGCAGACTGAAAACGGAAATGTATCATGGGCGTAAATGGTCGGGCATCACGCCAGAAAAGTTCATGCAGCAAGTGGATACTTACATCAGATGGTATAACGAGCGGCGTATAAAATTATCGCCGGGTGCAGTCAGCCCCAAAATGTACCGCCAACAATGCGGGCTGGAATGATAAAGCAGTCCAGGAAATCGTCCGCATCCCCAGAACGGTCAAACATCGTGGCGTTGACAACGGGTTCCTGAGTCCGGTGAAGCGCCGGTTACTGGAAAATGTTGTGTACGTCATAACAGAAACGGAGAGAAAAACGGCGATGCAAATCAGGCGGAGAAGGGTGTATCAGCGGTTACTGAAGGTTGACCCGCTGAAGTGCATCCTGTGCGGAGGTCAGATGCGGTTTACGGGGCTGAAGCGGGGCTACCGTCTGGCAGAGCTGGTCATGATGCATGAGCCACTGGCACGACAGCGGGTATACAGCTGAGAGCCGCAGAGGGGAAGTTGCGTCCATTTTTTGAGGAATGGAGCAAAAAATCATCACAGATATAAAAAATCAATCAATGAAAGCCATTTAATTGCCGGTGCATTGTGGATGCAGTCCTCATGGTACGTAACCAGAGTTGAATAAACATCCATTTTTTTGTGTTTTTTTAACCTCCCCGGGTTTTTAATTCCTGTCGATTAAAAGTGTTAGATTATTTGGTTAAGAGTTCTCTCGTCCTATTTATTCAGGCGTTATCACGGATATGCAGATCGCTAAGTCAGTTGTAAGAAAACCAGACAGTGCACATCATTATACCTAGGTCTTCTCTGGAAAAATACTTAGGAATAATTTATGTAAAACAGGTTGTCGTCTTCATGATTTAAACGCAGTACATATATATACTCTATAATAAATACAGGGAGTATCACGAGACGTCTGGATAAGAGGAATAACCAGAATAAATTGGTAAGGAATAAAGACACTTATTTCTATTGAGAAAAAACATTCATCCTGATGTGATTAGTATCAATGAAATAATGTGCTTGTATGGTTGATAAAAAATGCATCACTAAAGAAAAAACAGTATGAATAAGAATATACGAATTTTACAATTTCTGGTCAGTATACTTTATTCTGTGCAGTCTCATTTTTCTGGTGCGCAAACAATACAACTGAATGGTAATGGTATACCTGAAAGTATAACCAGGAGTATTACAGGTGTTGATGGAAACGCGGCGCTTAATATCAGCGTGCCGTATAAAACAAGTTATACTCAAAATATACTATCTGTTGAAAGCAGTATTAACATCAAAGGAGGAACAAGCAATACATCAATCGGTGGTGCAGGTGTCTACGGTGAAAACTTTACGCTAAATAATAATGGTAGTGTTTGGGGAGGAGATGGATATAATGGTGGAATTGCTGTTAGTGGCAACAAAATATCTATAAACAATTACAGAAATGTATATGGGGGTAATGGTCTTGGTGGCTCAGGAAGTAGCGGAGGTGCAGGGTTAAGCGGGGATGATATTATAGTTGATAATTACAGAAGTATATACGGAGGTGATGATGTAGGTGGGACAGGTGGTTCCGGTGTAACCGGTAGCAATATTACAGTGCATAATTCCGGAGGAATATTGGGCGGTAATGGCGTAAACGGTGGTGATGGTATTAATGGTAGTAATCTTTTCATTACTAACGACAACATGATATCTGGAGGATATGGAATAAAACAAGGGGGAGATGCTATTTCTGGAAATCAAATCACTTTGAATAATAACGGTATTGTTCAGGGGGGATATGGCCCCGACGGTGGTTGCTCTGTTTATGGAGAAGATATCCATATTAATAATCATGGTAATCTTTCAGGATTATATAATAGCCAAAAAGATGCTTATAATACATCAATAATTTTTTCTGGCGGGTATAATTCATTAGATATTTATTCAGATTCTGTGATTAATGGTGATATTAAACTAGCTAGTATACCTGTTAATGGTACAAATGAATTAATTATTAAAAACATCAATAACGCAACAGCAATTAATGGTGGGCTAATGATTGGGAATGGCTCATCTGTTTATCTGTCAGGCAAGAACTCCATTTTTAACGGAAATATAAGTATTGATGAAGACGCATCTATGAACCTGTCTGTAGGAAATGCTAATGTTCACGCAAATACTATTACATTAAAAAGTGATTCATGGCTTAATATAGACACATCAATTAAGAACTGGACTCAGGACTATTACACATTATTGTCGTCAGACACAGGTATCTCGATCGCTGATAATAGTCACATTGTACAATACAATGTATTACTGACAGAAGGTGCTGAAAGTTATGTTTATACGTCTTTAAATGACGACGATAACAAACTGATATCCATGCTGAGATGGAATAATACAAAAGGGATGGGATATGGAACCTTTAATATAGAAAAAGATGCCACTCTGAACATAGGCGTTTCTCTTTCCGATAATCTTTCACCTTTATTATATGATGGCTGGGACGGCAAAAGTCTGACAAAATCAGGTAATGGTACTCTTATACTTTCTGCAACAAACAATTATACAGGAAATACAGAGGTTAAATCTGGCGTATTAATTCTTGCTGCACCTGATGCTCTTGGTCGGACTGAGTATTTATATTTATCCCGTGGCGCAGAACTGGATATGAATGGGTATCCTCAGACAATAAGCAAACTACTGACGGCTGCAGGCTCTGTGCTGAACATTCATGGCGGAAGTCTGATACTGAATAATGGAGGAGAATCTGCAGGTACTATTGCAGGGGATGGTTCTCTGAACATAAATGGGGGAATGCTTGATATAACGGGTAATAATCGTAATTTTTCCGGTGTTTTTACCGTGAATAAGGGGGCTCATTTGGCTGTATCCACGGCTGATAATCTGGGGACAGCCTTTGTTGATAACTATGGCACATTAACTCTGAACAGTACATCAGCATGGCAGCTTACCAACAATATCAGTGGTTATGGTAATGTTCGCAAGACGGGAGCAGGTGCACTGAACATTAGCGATAACGCAAAATGGACCGGGATGACAGATATTATTCAGGGGACAGTGATACTGGGGAACGCAGATTCACCGGTGATGCTCGGCAGTAACCAGGTCATTGTTGAAGAGCAGGGCAAACTCTCCGGGTTTGGGGGCGTTGCAGGAAATCTGAGCAATAGTGGTATAGTCGATCTCACTACATATATGCCGGGTAATATACTGACTGTCGGAGGGAATTACACTGGCAGAAATGGACTTATTCTCCTCCAGACAGAAACAGGTGGTGACAATTCGAAAACAGATCGTCTGGTGATTAAAGGTAATGCCAGTGGCCGTACCCGTGTCGCTGTTACTCAGGCCGGTGGTACTGGTGCAGAGACACTTAATGGGATTGAAGTGATTCACGTCAGTGGCAATGCTGATAATGCTGAATTCATTCAGACGGAACGTATTACAGCCGGAGCTTATGATTACATACTGAAACGTGGTCAGGGGATTAACAGCACTAACTGGTATCTGATTAGCAGAAAAGACATTCCTGTACCACAACCTGAAGCTGTACCGGAAAGCCATGATAATAATTTGCGTCCTGAGGCAGGTAGCTATGTTGCCAGTATTGCTGCTGCAAATAATCTGTTTGTAACGAATCTGTATGAACGACAGGGACAGGAGTTGTATATCAGCCACATGACAGGAGAAGAAAATGAAGCAGGTATCTGGATGTATAATAAAGGAAAACATAATCGCTGGCGTGACAACAGTAGTCAGCTGAGAACCCGGGGGAATAGCTACGTTGTGTTAATAGGGGGAGATATAGCTCAGTGGAGCCTGAATGGTACCGATCGCTGGCATACAGGTATGATGGCTGGCTATGGTCATAATAATAACAGTACGAATGCCCTGAGCACCGGATACCATTCGGAAGGAAGAATGAATGGATACACAGCGGGTCTTTATGCAACATGGTATGCCAATGATGAAACACACAATGGTTCTTATCTTGATAGCTGGCTGCAATACAGCTGGTTTGATAATCATATAAATGGAGAACGGCTGCCTGCTGAGTCATGGAAGTCAAAAGGGTTTACGGTATCTCTGGAAGCAGGATATTCATGGAAGGCTGGAGAGTTTACCGACAATTACAAGGGAAGTCATGAATGGTATGTTCAGCCGCAGCTTCAGGTTGTCCGGATGAATGTAAAATCAGACAAATATCATGAAAGTAACGGAACCAGTATTGAAAATACCGGTAACGGAAATATTCTCACCCGCCTGGGAGCAAGAACATGGCTTACCAGCAAAAACGGTAAAAATACGCGGTATGCGGTTCCGTTCAGACCATTTGTGGAGGCACACTGGTTGCACAATAGTCGTGTTTTCGGCACCAGTATGAATGGTGTAAGTATATACCAGGATGGTGCGCGTGATATCGGAGAAATAAATGGTGGTGTTGTGGGAATGATAACACCAGAAGTAGCATTCCGGGCTGATGCAGGCATTCAACTTGGAGAACATGGATACCATAATACATCTGCCATGTTGAGTGTGGAATATCGTTTCTGA
Protein sequences of DBSCAN-SWA_3 >CP029164|934714:943640|935091_935439_+|AWH68713.1|DBSCAN-SWA MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPISGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGIDWRQPKRLLTSLTML >CP029164|934714:943640|940142_943640_+|AWH68717.1|DBSCAN-SWA MNKNIRILQFLVSILYSVQSHFSGAQTIQLNGNGIPESITRSITGVDGNAALNISVPYKTSYTQNILSVESSINIKGGTSNTSIGGAGVYGENFTLNNNGSVWGGDGYNGGIAVSGNKISINNYRNVYGGNGLGGSGSSGGAGLSGDDIIVDNYRSIYGGDDVGGTGGSGVTGSNITVHNSGGILGGNGVNGGDGINGSNLFITNDNMISGGYGIKQGGDAISGNQITLNNNGIVQGGYGPDGGCSVYGEDIHINNHGNLSGLYNSQKDAYNTSIIFSGGYNSLDIYSDSVINGDIKLASIPVNGTNELIIKNINNATAINGGLMIGNGSSVYLSGKNSIFNGNISIDEDASMNLSVGNANVHANTITLKSDSWLNIDTSIKNWTQDYYTLLSSDTGISIADNSHIVQYNVLLTEGAESYVYTSLNDDDNKLISMLRWNNTKGMGYGTFNIEKDATLNIGVSLSDNLSPLLYDGWDGKSLTKSGNGTLILSATNNYTGNTEVKSGVLILAAPDALGRTEYLYLSRGAELDMNGYPQTISKLLTAAGSVLNIHGGSLILNNGGESAGTIAGDGSLNINGGMLDITGNNRNFSGVFTVNKGAHLAVSTADNLGTAFVDNYGTLTLNSTSAWQLTNNISGYGNVRKTGAGALNISDNAKWTGMTDIIQGTVILGNADSPVMLGSNQVIVEEQGKLSGFGGVAGNLSNSGIVDLTTYMPGNILTVGGNYTGRNGLILLQTETGGDNSKTDRLVIKGNASGRTRVAVTQAGGTGAETLNGIEVIHVSGNADNAEFIQTERITAGAYDYILKRGQGINSTNWYLISRKDIPVPQPEAVPESHDNNLRPEAGSYVASIAAANNLFVTNLYERQGQELYISHMTGEENEAGIWMYNKGKHNRWRDNSSQLRTRGNSYVVLIGGDIAQWSLNGTDRWHTGMMAGYGHNNNSTNALSTGYHSEGRMNGYTAGLYATWYANDETHNGSYLDSWLQYSWFDNHINGERLPAESWKSKGFTVSLEAGYSWKAGEFTDNYKGSHEWYVQPQLQVVRMNVKSDKYHESNGTSIENTGNGNILTRLGARTWLTSKNGKNTRYAVPFRPFVEAHWLHNSRVFGTSMNGVSIYQDGARDIGEINGGVVGMITPEVAFRADAGIQLGEHGYHNTSAMLSVEYRF >CP029164|934714:943640|935534_937127_-|AWH68714.1|transposase|DBSCAN-SWA MSQKYLIRIAELESQLRQKDQQLSLVEETEAFLRSALARAEEKIEEDEREIEHLRAQIEKLHRMLFGTRSEKLRREVELAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPVNDALYRYVMNTRKVHTDDTPVKVLAPGQKKAKTGRIWTYVRDDRNVGSSSPPAVWFAYSPNRQGKHPEQHLRPFRGILQADAFTGYDRLFSAEREGGALTEVACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESERLAARQQRSKALLTSLHEWMVEKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNTAERALRAVCLGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWPSNCVDELLPWNVVLTNK >CP029164|934714:943640|937157_937508_-|AWH68715.1|DBSCAN-SWA MISLPSGTRIWLVAGVTDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRRGDTVKILWADADGLCLFTKRLEEGQFIWPAVRDGKVSITRSQLAMLLDKLDWRQPKTFRLNALTML >CP029164|934714:943640|934714_935095_+|AWH72442.1|DBSCAN-SWA MQKNVTPGRRKGCPNYPPEFKQLLVAASCEPGISISKLALENGINANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETLSISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR >CP029164|934714:943640|937504_937930_-|AWH68716.1|DBSCAN-SWA MELQDWRKEPRKNYSNEFKLRMVELASQPGACVAQIARENGVNDNVIFKWLRLWQNEGRVSRRLPVTTSSDTGVELLPVEITPDEQKEPVAAIAPSLSTSTQTRVSASSCKVEFRHGNMTLENPSPELLTVLIRELTGRGR |
6 | Stx2-converting_phage(66.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2046461 : 2053601
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP029164|2046461:2053601|DBSCAN-SWA ATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCACAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGATATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAGCGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAGCCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCTTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATTCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGTGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGCTCGTGTGCATAAGCGGCCAGAGTTTCGGTTGAGAGGCAGCGTGCCACATCAACCTCACGGGCTGGTGCAATTTGACGGTAATGTGCCACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGACGCCCTGCTTCGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCCCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGATAGCCCTCTTGCTGTAAGCGAGCCAGCTCCTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATTCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAATGCATCGGTAACCGGGCCGATATTACCTTTCGCAGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACCGGACAGGAACGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGGGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCCGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCGGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCATGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCCAGTCCCGGTTCTGAACCGATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGACAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGGTAGCAATTTCTTGCGCATCAGCACTGGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTTTTCCACCAAACAGCACCTGTTTAACCTGGGTCGCATTGACCACCAGCACCAGCAGAGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCAGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACACATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCCGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATGTTGTGAATATAAGCGCGGAAAGATAACGATATGGTGAGCTGATTCACAAAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTTAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGGGGCGTCCAGTCTCCGGGACGGGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGACGTTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATTGCACAGCATCTGATTCACATGGAGCCACTCACAGTGGTTACTAACGACTTCGTTATTGCGGACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGTGGCGCAGTGTGTCGGGAAAACCGTTCCTGTGTTGGGGAAGCCGCTGCGACCATGCTGCGTAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTATGCGGGGGATCTCTACGCCAGCAGAAGATAAAGTCACGGTGAAAAGGGCGATTGCCAGTGCCAGCCGTCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGACAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGATCAGATCATTACTGACGACGGTCTGCCGGAGAGTGCCAGCCGCGCGCTGGCGAAGCTGGATCTCTCTTTGCTGATAGCGAAAAATGAATAATGACCTGCAATAACGCCGGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTTTTCGGCGATCCGGTATCTATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAGTTCTCTCTCCGCTATTTCTTTGCCAAACTGATACTCCTTCCCCGGATAATCTGCATGTGCGATGACATATTTTATGGTGTCGTTAGTGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTGCCATCGCCGGTCGCGAACGCATCCAACGCCATCGCTTCGTGGTTGCCTTTGACGGAGATAAACCAGGGCTGGTTTAACAGGCGCAGCACGTTAAGACTCTCGGGCCCACGATCGATGTTATCGCCGACAGAAATAAGCAAGTCGGTTTCAGGACAAAAAGAGAGTTGATGTAAGCGGGATTGTAATAATTGATAGTCACCATGAATATCACCAACGGCCCATATATGGTGATAGTGATGGGCATTGATTTTTTGGTAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGATTTCCTATTATACAGGGTATTTTTATTTGATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGACTCTTCAAGCGATAAATCCACTCCAGCGCCTGACGCGGGGTGAGTGAATCTGGGTCGAGATTTTCCAGTGCTTCGACCGCAGGCGACGTTTCTTCTGGCACTGACAGCAAAGACATTTGCGTACCATCCACTTGCGTCGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGTTTTTGCCGTGCGCGCTTAATAACCTCTTTTGGCACGCCGGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCTGCGCCATCCTGCACGCTGTGCATAAAGGCAATGGTGTCGCCGTGCTCCAGTGCATCGAGATGCACGTTGGCGACGCCTTCCATTTTCTCCGGTAACTGGGTCAGCTCGAAATAGTGGGTAGCAAACAACGTCAACGCTTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAAGTGGACGTTCCACGCCCAATCTCATCCATCAGCACCAGACTGTACTCGGTGGCGTTATGTAAAATATTGGCGGTTTCGGTCATTTCCACCATAAAGGTTGAACGCCCGGATGCCAGATCATCCGCTGCCCCTACGCGGGTAAAGATGCGATCGATAGGTCCAATCTCGACTTTTTGTGCCGGTACATAGCTGCCGATGTAGGCCATCAGCGCAATCAGTGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATATTCGGACCGGTAATGATCAACATGCGACGCTGCGGCGACAGGTTCAGCGGGTTGGCGATAAACGGCTCATTCAGCACCTGTTCAACCACCGGATGGCGGCCTTCGGTGATGCGAATACCCGGTTTATCAATAAAGGTCGGGCAGGTGTAGTTCAGCGTATAGGCCCGTTCCGCCAGGTTCACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAACTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAAGTGAGAACTTTGTCTTCGTACTCTTTTAGCTCTGGAATAATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAATTGATGGGTGCCAGATGGCTTTGTCCACGGCTGATCTGAATGTAATAACCATGAACCGCGTTAAAGCCGACTTTCAGGGTGTCCAGGCCAGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCATTCATCCAGTTCTTCGTTATAACCAGTCGCAATAACACCACCGTCGCGTACCAGCACCGGCGGCGTGTCGATGATTGCTCGCTCCAGCAGGTCGCGCAGTTCGGCAAACTCGCCCATCTTCTCACGCAGTGCTTGTACCGGGGCGCTATCGACACCTTCCAGTTGCGCGCGCAGCTCCGGCAGTTGCTGGAAAGCGTGACGCATACGGGCCAGATCGCGTGGACGAGCAGTTCGTAAAGCCAGACGCGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGCAGCTCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGACTACCCATCGGCGTTACAGTACAGTCGAGCACTGAAGCCAGCGTATTTTCCGCACCGCCGGCCAGGTTCTGAGTAATTTCCAGGTTACGACGCGTCGCGGCATCCATAATGATGCTGTCCTGCTGACGTTCCATAGTGATAGAACGAATATGCGGCAGGGTCGTGCGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGTGGTGCGTTCTCCACACCAAAACCGACCAGATCTCGGGTGCCAAATTGCAGGTTCAATTGCTGGCGAGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGACGCAGGCCGCGGCGACCTTCAATCAGCGACATCTCGGCGAAATCTTCCGCATACAGTAATTCCGCTGGATTCGTACGTTGCAGTTCTGCCGCCATCGTTTCGCGGTCCGCCGGTTCGCTCAGGCGAAAACGCCCGGAGCTGATATCCAGCGTGGCGTAGCCGAACCCCTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGTTCCTGCAACAGCGCTTCATCGCTGATGGTGCCCGGCGTAACGATACGGACAACTTTGCGCTCGACCGGCCCTTTGCTGGTCGCCGGATCGCCAATCTGTTCACAAATAGCAACCGATTCGCCCTGATTCACCAGTTTGGCGAGGTAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGATGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCTTGAGATACTGCTGCATCATGGGCGTATGGGCATCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_4 >CP029164|2046461:2053601|2046461_2047100_-|AWH69733.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNIPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >CP029164|2046461:2053601|2049459_2050227_+|AWH69736.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGRCIYLDAGTTTLAIAQHLIHMEPLTVVTNDFVIADYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSMRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALAKLDLSLLIAKNE >CP029164|2046461:2053601|2047096_2048359_-|AWH69734.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQAREAGRPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSTETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDEAFFSRAQREFLS >CP029164|2046461:2053601|2050277_2050934_-|AWH69737.1|DBSCAN-SWA MPSTRYQKINAHHYHHIWAVGDIHGDYQLLQSRLHQLSFCPETDLLISVGDNIDRGPESLNVLRLLNQPWFISVKGNHEAMALDAFATGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDTIKYVIAHADYPGKEYQFGKEIAERELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGSPKSGRLSFYKIK >CP029164|2046461:2053601|2051039_2053601_-|AWH69738.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLKLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMERQQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLEGVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIATGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >CP029164|2046461:2053601|2048355_2049264_-|AWH69735.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNATQVKQVLFGGKGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGSEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS |
6 | Escherichia_phage(83.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2633863 : 2643308
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP029164|2633863:2643308|DBSCAN-SWA AATGATTGAATTTAGCCATGTCAGCAAACTGTTCGGCGCACAAAAAGCCGTTAACGATCTCAATCTCAATTTTCAGGAAGGGAGTTTTTCGGTGCTGATTGGCACATCTGGCTCCGGCAAATCCACCACCCTGAAAATGATTAACCGCCTGGTGGAACATGACAGCGGAGAGCTCCGCTTTGCCGGAGAAGAAATTCGCTCGCTGCCAGTACTGGAGTTACGCCGCCGTATGGGCTATGCCATTCAATCTATTGGCCTGTTTCCCCACTGGAGCGTGGCACAAAACATTGCCACCGTGCCGCAATTACAAAAATGGTCGCGGGCGAGGATTGACGATCGTATCGACGAATTAATGGCGCTACTGGGGCTGGAGTCAAATTTACGTGAGCGTTATCCACATCAGCTTTCCGGTGGTCAGCAGCAACGTGTGGGAGTAGCGCGCGCACTGGCTGCCGATCCGCAAGTCTTACTGATGGATGAACCTTTTGGCGCACTGGACCCGGTAACGCGCGGCGCGTTGCAACAAGAGATGACGCGCATTCACCGTTTGCTGGGGCGTACCATTGTGCTGGTCACGCATGATATTGATGAGGCGCTACGGCTGGCAGAACATCTGGTATTGATGGATCACGGTGAAGTGGTGCAGCAGGGGAATCCGCTGACGATGCTGACTCGTCCGGCGAATGATTTTGTCCGCCAGTTTTTTGGACGTAGTGAACTGGGTGTGCGCCTGCTTTCGTTACGTAGTGTGGCGGATTACGTGCGTCGCGAAGAACGGGCAGAAGGTGAGGCACTGGCAGAAGAGATGACGCTGCGCGATGCGCTCTCCCTGTTTGTCGCGCGGGGATGCGAGGTGCTGCCAGTGGTGAACACGCAGGGCCAGCCTAGCGGCACGCTGCATTTTCAGGATCTGCTGGTGGAGGCGTAAGCGTATGAAGATGTTGCGCGATCCGCTGTTCTGGCTCATTGCTTTGTTTGTGGCACTGATTTTCTGGCTGCCTTACAGCCAGCCGCTGTTTGCTGCCTTGTTCCCACAACTGCCACGACCCGTTTATCAGCAAGAAAGTTTTGCAGCTCTGGCACTGGCTCATTTCTGGCTGGTGGGAATTTCGAGTCTGTTTGCGGTGATCATTGGTACTGGTGCCGGAATTGCAGTCACTCGCCCGTGGGGCGCGGAATTTCGCCCACTGGTGGAAACTATTGCCGCCGTTGGGCAGACTTTTCCGCCCGTTGCGGTGCTGGCGATTGCTGTTCCGGTGATCGGCTTTGGTCTGCAACCAGCGATTATCGCCTTGATCCTATACGGCGTGCTGCCCGTCCTGCAGGCGACACTTGCCGGGCTGGGAGCGATTGATGCCAGCGTGACAGAAGTTGCGAAGGGTATGGGAATGAGTCGTGGTCAGCGACTGCGTAAGGTCGAGCTACCGCTGGCGGCTCCGGTGATTCTGGCGGGCGTGCGAACTTCGGTGATTATCAACATTGGTACGGCGACGATCGCCTCAACGGTAGGGGCCAGCACGCTGGGTACGCCCATCATCATCGGGCTTAGCGGATTTAATACCGCGTATGTGATCCAGGGGGCGTTACTGGTGGCACTGGCGGCGATCATCGCAGACCGCCTGTTTGAAAGGCTGGTGCAGGCGCTTAGCCAGCACGCAAAATAAAGGTATAACCTGCGAGCATGACGCCACCAATTCCGCCTAACGCCATAAACAGGAACAGGGCGATGACCCCAATTTTAGCTATGCGCATAATGCACTCCTTATGTTAACGAAAGGATTGTACAGTAAAGCGCATTTGTTAACGAATCATTAAATGCCGAGTGGGAAAATATCATGGCCTTGTTCCTGCCAACTGGTGAGTTGCTGCTGTTGGGCGGAGGTTCGATTTTCACCGCACCACACCAGCAATGTACGGCCTTCGAACAGTTCAGGGCGTAGTTGATTGAGCGAGTGGGCGAGGACATCAATGCGCCATCCTTGTTGACTGGCAATCCAGCCCTCCAGCCACAGACGGGTGGTATCCTGAATATTCCAGCCAACCACCAGCGCATCTTTACCCTGTTTTTTACGTGCCGAAGCCAGACAAATGGCGATGTAGTTGATCAGTACGCCGTCGAGGATCGCCAGCAGCGCCTGGAGAGTCGGTTGTTGGCACTGAAGCCGTCGGCGCAGAGGAATAAACAGATGTGTGGTGAGTGTCTGGGCGGGGTAATCCTGACCGCGCTCTTTGATCCACGTTCGCAGGCTATGCAGATTGCCGCTTTGCAGGTAGGTCAGTAATGTTTCTTGCTGATCGCGCCAGCCGTTCTGCACATCAACATTTTCATTACTGAGCAGCATTTTAACTTTGCTGACCTGCACGCCGTTGTCGATCCAGCGTTTGATCTCGCGGATCCGGTCAATATCGGCATCGTTGAACAGCCGATGACCGCCGTCTGTCCGTTGCGGTTTCAGCAATCCGTAACGCCTCTGCCACGCGCGTAACGTGACAGGGTTAATATCACAAAGCAACGCCACTTCACCAATTGTGTAAAGCGCCATCGTCTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCAGTTTTGCGAACCAGGTAGTTTTGCCCGTTTTTTGTGCATTTATAGGGTGATTTTATTTTTGCCAGGCGATTTTGAGTGATCGTACTCACGAATTCTCATTTTTCTGCAAGAGTTCAAAGAAAGTTAAACGCAGGCAATGTATGTTACGCGTTTTAAAGGGAAGTGTGGTTTGCGGGTATGTACGATTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTTTTTTTAGTCATTGCGTGGTTAATGAGTAAAACGCCATTATTCATACCGTTAATGCAGGTCACGGTTCGTCTGCCGCATAAATTTCTCTGCTACATCGTCTTTTCCATCTTTTGCATTATGGGCACCTGGTTTGGGTTGCACATTGACGATTCTATTGCCAATACCCGTGCGATAGGCGCGGTTATGGGCGGCTTACTCGGCGGTCCGGTCGTCGGTGGGCTAGTTGGCCTGACCGGCGGGTTACATCGATATTCGATGGGGGGCATGACCGCGCTAAGTTGCATGATCTCGACCATCGTTGAAGGATTGCTCGGCGGCCTGGTACACAGCATCCTGATCCGTCGCGGGCGCACTGATAAAGTCTTTAACCCCATTACCGCCGGTGCCGTCACGTTCGTCGCTGAAATGGTGCAAATGTTAATCATCCTGGCGATCGCCCGACCTTATGAAGATGCGGTGCGTCTGGTGAGTAATATTGCTGCGCCAATGATGGTCACCAATACCGTCGGCGCGGCGCTGTTTATGCGTATATTGCTCGATAAACGCGCGATGTTTGAAAAATACACTTCAGCTTTTTCTGCCACTGCGCTGAAAGTGGCGGCCTCGACGGAAGGCATTTTGCGCCAGGGGTTTAACGAAGTGAACAGCATGAAAGTGGCACAGGTGCTGTATCAGGAGCTGGATATTGGTGCAGTCGCGATTACCGATCGAGAGAAATTGCTGGCCTTTACCGGAATTGGTGACGATCACCATTTACCCGGCAAACCGATTTCTTCGACTTACACCTTAAAAGCGATTGAAACCGGTGAAGTGGTCTACGCTGATGGCAACGAAGTACCTTATCGTTGCTCTTTGCATCCGCAATGCAAACTGGGGTCGACGCTGGTAATTCCGTTGCGTGGTGAAAATCAGCGGGTGATGGGCACCATCAAATTGTATGAAGCCAAAAACCGTTTATTCAGTTCAATCAACCGCACGTTGGGTGAGGGGATTGCGCAATTGCTTTCGGCGCAGATCCTTGCCGGGCAATATGAGCGGCAAAAAGCGATGCTCACCCAGTCAGAGATCAAACTGCTTCACGCCCAGGTGAACCCCCATTTTTTGTTTAATGCGCTTAACACCATTAAAGCGGTGATCCGTCGCGACAGCGAACAGGCCAGCCAGCTGGTGCAGTACCTTTCCACTTTTTTCCGCAAAAACTTAAAGCGGCCTTCGGAGTTTGTTACTCTCGCCGACGAAATTGAACATGTGAACGCTTATCTGCAAATTGAAAAGGCGCGCTTCCAGTCGCGGTTGCAGGTCAACATTGCTATTCCGCAAGAATTATCCCAGCAGCAATTGCCCGCGTTTACCCTGCAACCGATAGTGGAAAACGCCATTAAACATGGGACATCACAACTGCTGGATACAGGGCGAGTGGCAATCAGCGCCCGACGTGAGGGGCAACATTTAATGCTGGAGATCGAAGACAATGCCGGTTTGTATCAACCGGTAACCAATGCCAGTGGGCTGGGGATGAATCTGGTGGATAAGCGTTTACGTGAACGGTTTGGCGATGACTATGGGATAAGCGTCGCCTGTGAGCCTGATAGTTACACCCGAATAACGTTACGACTACCATGGAGGGACGAGGCATGATTAAAGTCTTAATTGTCGATGATGAACCGTTAGCACGGGAGAACCTGCGCGTATTTTTGCAGGAGCAGAGCGATATTGAAATCGTTGGAGAGTGTTCAAACGCCGTGGAAGGGATCGGCGCGGTGCATAAACTGCGCCCGGATGTGCTGTTTCTCGATATCCAGATGCCGCGCATCAGTGGTCTGGAAATGGTGGGGATGCTTGACCCGGAACATCGTCCGTATATTGTTTTTCTCACCGCGTTTGACGAATACGCTATTAAAGCTTTTGAAGAACATGCCTTTGACTATCTGCTGAAGCCAATTGATGAAGCGCGACTGGAGAAAACGCTGGCGCGTTTGCGTCAGGAACGCAGCAAGCAGGATGTTTCGCTGTTACCGGAAAATCAACAGGCGCTGAAATTTATCCCTTGTACGGGGCATAGTCGGATTTATTTGCTGCAAATGAAAGATGTGGCATTTGTCAGCAGTCGGATGAGCGGTGTCTACGTTACCAGCCACGAAGGGAAAGAGGGCTTTACCGAATTGACATTACGTACCCTGGAAAGTCGTACACCACTACTGCGCTGCCATCGTCAGTATCTGGTTAACCTCGCGCATTTACAGGAGATTCGTCTGGAAGATAACGGTCAGGCCGAGTTGATTTTGCGTAATGGCTTAACCGTGCCGGTCAGCCGCCGTTATCTGAAAAGCTTAAAAGAGGCGATTGGCCTGTAAAAGACTGCTAAAATGGCTTTTTGCCTCATCAACACCTGAAGGCCTCATGCTAAGTAACGATATTCTGCGCAGCGTGCGCTACATTTTGAAAGCCAATAATAATGACCTGGTGCGTATACTGGCGCTGGGTAATGTCGAAGCCACCGCGGAACAGATTGCCGTCTGGCTACGTAAAGAAGACGAAGAGGGTTTTCAGCGTTGTCCGGACATTGTTTTGTCGTCATTCCTCAATGGCCTGATTTATGAAAAACGCGGCAAGGATGAGTCTGCTCCGGCACTGGAGCCGGAACGTCGCATTAATAACAACATCGTGCTGAAAAAATTACGCATCGCGTTTTCGCTGAAAACCGATGACATTCTGGCGATCCTCACCGAACAGCAGTTCCGCGTTTCGATGCCGGAAATTACAGCGATGATGCGCGCACCGGATCATAAAAACTTCCGCGAATGCGGCGATCAATTTTTACGTTATTTTCTGCGTGGACTGGCAGCGCGCCAGCATGTGAAGAAAAGCTAAGACGGGTATGGCGGCCATGCGAAACATGGCCGCCGACAGATTATTTCACTTCTTTAAAACCAGCGGCTTTCATCACCAGTTCCATTTGCGCCATAGTGATACCTTTTTTGGCATCTTCAGCAGAAACGTTGATTCCTGAAATACCCTGCAGGGCTTTAAAATCCACTTTTTCCATATCGATAGTCACGTTTTCCTGCGCGTAGGTATCGGTATAGGTTAATTTTTCTTCAACACCCGCGATGTTTTTGTATTTGGCGCTTAACGGCTCAAGTGTCTTGGCAGCATCTTCTTTGGTGGTTGCACCAATGGAGGCAAATTGAATTTTGGTTTCAGAAGATTGCTTAAGCACCTTGTCACCTTTGTAGACATAGGTAATGGCAATTTCAGTGCCGTTCAGATTGGCGCTGAATTTCTTCGATTCTTCTTTGTCACCGCAGCCAGCAAGAGAGAAAACCAGAACAGATGCAACAACAAGGGAAAACAGCTTATTGAAAGCCTTCATGTAAAACTCCATTTTATTTAATCAAGGAACTGGTGACTCTCACCAGGGGCTATATAGGATATGCCTAATACCGTGGCGTGAGCAGTCCGGAACTGGAGTAGAACTCTTAGTAAAAAGCACTATTTCATCCTTGTTGCTGAAGCATGGGGAATAATTGTTCGCACAGCAAAACACCGTTATTCATTGCTTCTACCCGTGCCTCGCTTTCTGTATTACGAAATTGTCCCAACATATGTGCCAGCCGATAAAAACCCACCGCGGTGAGGTCATTCGCCAGCAACTCTGCCTGACCAATAGCACTCTGTTCCTGATAGCGCCAGCCGTTATGGAGCAGTTGAATAAGTAATGCCTGGCAGCGCATCAGCAACTGATGAGCAGTAGACGGCACAGGCAGAACGCTGGCAGAAGGTAGTGATGCCACCACAGGCGCAGTTTCTGCGTCCAGCGCCCAGGCACGAGTCTTTGTCATCATTACCTGTGGTTCCAGTGTCAATTGCCCATCAACAAAACTGACAAAGCCAGAAACCAGACACACGGGGTCGTCTGTTTGTTGCAAAAGCGCCGCCATGCGTTCAACGGCATAAGGCGCGCTGGCTGAGGCTGGCAGTGATAACGTCAGCAGATTATCTTCACCTTCGCCGCTGATTACCTGCGCATCCAGCGTCTGCCGACTGCTGTCCCAACCGAGCGAAATGCACTCTTCTACCGGCAGAATAAATAAATTATCGACCTGGTTAAGCGGTCGAATGGCGGCTGGCGGACGCTGGCGTAAATATTCCCGCAAAGCCACAATGCCCGGCTGGCGCAGCGGCGTACTCAGCATTTTCCAGGCATCTGGCGACAGTGGAACAACACTACTCAAGCGGTTGCGGGTAGCTAACAGCAGTTCGCCTTCGGCACTGCGACGCGCTGCTTGTGAAACAATTTGCCCACCTGCCAGTGCGCCAGCCTGAAAGCTAAACAGCCGACGTTTTGCCGCGGGAGCGTTTTCTTGTTCGCTTCTCGGCCAACTACGCGAAAGATGCAAAATACTGCCGGTGTCGGGATCGGTAAACCAGATGCGTAAACCATAATGCTCAATATCCTGCCAGCAACGCATACCTAAAGACACCAGCCGCAGATGATCAAGCTTTGCTTCTCCGGCAATGCCAGAGCCAACGACCGTGCGCCACGGCACAGGAGGAACTTCACCAACACTGTCGCGCCGGGCCATCTCTTGTACGCAATTTAATCGACTGTTTAATGCCGCAAGCTGACGTAAGCATTCTCCGGCATGATAGTGGCTGGCGCGGGCGTGGAAGGCATCAACGCTGGCGCGCAGTTGCCGTAGCGATTCACTCACCCATCGCCAGTTGCAGGCCTCTGCCGCCTGCAATGCGCGGTTGAAAGCGGCCTCGTAGTGAATAAGCGGCTGGCTGATGCCGCTCAGCCATAATGCCTGGCTTAGTTGTTGAACATATTGACGACACGTTTTGCCTTCTTCGCTGGCAAACGGATCGTCAGATGATGTGACGTGTTCGCTGCGCATCTGCCAGATTAAATGAGTCAATTCCGCTTGCTGAGTTTTGGCCTCGACGAAGGCCTGCACAGCCAGTACGACATGTTCGCAAAGTGTGCCTTCAATACAATCACAACGGGCGAAACGAATACTGCTGCGGGAATAAAAACGCACATCACTCATCGGTAAGCGGGCAGAGGGAATTTCGCCAGGCGCACAGAACAACTCAATGGTGATGCCTTTAGCGACCAACGCCTGTGCGCGTTTGCGGGTGGCATCGGGAAGGGTAGCCAGTTCTTCCAGCCAGATTGCCGGATCCCATTCCTCTTCTTTTTCCGTGGGCTGGGCGGTAGCACAAAGTCGTTGATAACTTAACACCAGCATCACGCGATGACGGCACATGCCGCTGGCCCCGCAGGTGCACTGAGTCTCTTTCAGTGCCTGACCGTTCGCCAGCTGGGTACGGACACCGTCACTGAAGGTGGCGATTAAAGCGCCGTTCTCATGGCTGATTTCCGGGACGTTGCCATTTTCCAGTTCCTTAAGGCTGCGCTTAACAAAACCGGCATTGCTTAACGCCGTCAGTGCCTGTGGTGTCAGTTCTAATAATTCCGGACGTAGTGAATTCATGACTGAAGGTTCTCTGCAAGCCATGATGCCAGCTCGCCCGGCGTCATGGCGGCTATTTGTGCGCCGACATTAACCAGCGCCTGGGCCGTATCGCGGTCATAGCAAGGCGTTGCTGTGCTATCGAGCGCTGCCAGTCCCAGCACTTTGATGCCGCTCTGGACACACTTTTTCACCTGATGCGTCAGTAATGATGATGAACCCCCTTCGTAAAAATCGCTCACGAGGATAATGACGCTTTTCGCTGGTTGTTCAATAAGTTGCCGACCATACTCCACGGCACTGGCGATATTGGTTCCGCCGCCTAATTGCACTTTCATTAATAACTCTACCGGATCGGCAACGTCTGCCGTGAGATCAACCACACTGGTGTCAAAAGCGACCAGATGGGTACGAATGCCGGGTAACTGCCATAAACAGGCCGCCATCACCGCAGAGTGGATCACCGAATCGACCATCGAGCCGCTTTGATCAACCAGTAAGACCAGTTGCCATTCTTCGCTTTGGCGTTTAATGCGGCTGTTAAAGCGGGGGGATTCGATATACAACTTGCCGTATTGCGGGTGCCAGTGTTGCAGGTTGGCGCGAAGAGTACTTTTGAAATCAAAGTTTCGCGCCAGTGGAATAAATGAGCGGCGACGGCGATCGCGGACACCAGAAAAAGCCTGACGAACTTCCTTTGCCAGTCGAGCCATAATTTCTTCAACAACCTGGCGCACTATCCGGCGGGCGGCAGCCAGTACTTCGGGATTCATCAGATGTTTGGTGTGCAAAACGGCGCGTAGCAGGCTTTCAGAAGGCTGCATACGTTCCAGCACGTCGAGATTCGTCACCACATCTTCAATGTCGTAGCGCAGTACGGCATCGCTTTCCAGCCGCTCAATCACCTGTTGCGGAAACAGCGTGTGAATACTGTTGATCCACTCAGGGGTGGTGAGATTTGAGCTACCTAATCCACCGGAACGTTCACCACGCTGGAGCCGTTCAGGATCGCGCCCATACAGCCACTCCAGCGCGTGGTCTATCTGCCGGGCGTTGTCATCCAGCCCACAAAGCGTCGTTTCTGCCGCTTCGCCAAGAATTAATCGCCAGCGTTGTAGCTCACGGGTGGTCAGAAGATCGTTCAGTTCAGACAT
Protein sequences of DBSCAN-SWA_5 >CP029164|2633863:2643308|2638308_2639028_+|AWH70264.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >CP029164|2633863:2643308|2639585_2640047_-|AWH70266.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >CP029164|2633863:2643308|2635506_2635614_-|AWH70261.1|DBSCAN-SWA MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG >CP029164|2633863:2643308|2639074_2639545_+|AWH70265.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKS >CP029164|2633863:2643308|2633863_2634790_+|AWH70259.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGELRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERAEGEALAEEMTLRDALSLFVARGCEVLPVVNTQGQPSGTLHFQDLLVEA >CP029164|2633863:2643308|2642171_2643308_-|AWH70268.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGSSNLTTPEWINSIHTLFPQQVIERLESDAVLRYDIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARRIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKSTLRANLQHWHPQYGKLYIESPRFNSRIKRQSEEWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIAAMTPGELASWLAENLQS >CP029164|2633863:2643308|2634794_2635526_+|AWH70260.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRLRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK >CP029164|2633863:2643308|2640171_2642175_-|AWH70267.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENGALIATFSDGVRTQLANGQALKETQCTCGASGMCRHRVMLVLSYQRLCATAQPTEKEEEWDPAIWLEELATLPDATRKRAQALVAKGITIELFCAPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKTQQAELTHLIWQMRSEHVTSSDDPFASEEGKTCRQYVQQLSQALWLSGISQPLIHYEAAFNRALQAAEACNWRWVSESLRQLRASVDAFHARASHYHAGECLRQLAALNSRLNCVQEMARRDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENAPAAKRRLFSFQAGALAGGQIVSQAARRSAEGELLLATRNRLSSVVPLSPDAWKMLSTPLRQPGIVALREYLRQRPPAAIRPLNQVDNLFILPVEECISLGWDSSRQTLDAQVISGEGEDNLLTLSLPASASAPYAVERMAALLQQTDDPVCLVSGFVSFVDGQLTLEPQVMMTKTRAWALDAETAPVVASLPSASVLPVPSTAHQLLMRCQALLIQLLHNGWRYQEQSAIGQAELLANDLTAVGFYRLAHMLGQFRNTESEARVEAMNNGVLLCEQLFPMLQQQG >CP029164|2633863:2643308|2636626_2638312_+|AWH70263.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >CP029164|2633863:2643308|2635673_2636405_-|AWH70262.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2738115 : 2744445
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP029164|2738115:2744445|DBSCAN-SWA TATGCCATTTAAAAAAATCTCCCGACGCACCTTCCTGACGGCAAGCTCGGCGCTTGCCTTCCTCCATACCCCTTTCGCTCGCGCACTTCCCGCCCGACAAAGCGTTAACATTAACGACTACAACCCTCACGACTGGATCGCCTCATTTAAACAAGCCTTCAGCGAAGGGCAAACGGTCGTCGTGCCTGCTGGATTCGTTTGTGACAATATCAACACCGGCATCTTCATTCCTCCTGGAAAAACGTTACACATCCTTGGAAGCCTGCGCGGCAACGGCAGAGGGCGATTTGTCTTACAGGACGGCAGCCAGGTGACAGGGGAGGAGGGCGGCAGTATGCATAACATCACCCTGGATGTGCGTGGTTCTGACTGCACCATCAAAGGGCTGGCGATGAGCGGCTTTGGCCCGGTAACGCAGATTTATATCGGCGGCAAAAACAAACGGGTCATGCGCAACCTGACCATCGATAACCTCACTGTCAGCCACGCTAATTACGCCATCTTACGCCAGGGATTTCATAACCAGATAATTGGTGCCAACATCACCAACTGTAAATTTAGTGATTTACAGGGGGACGCTATTGAATGGAACGTGGCGATTAACGACAGTGATATTTTGATCTCCGACCATGTCATCGAGCGCATCAACTGTACCAACGGCAAAATCAACTGGGGCATTGGCATAGGCCTTGCGGGAAGCACTTACGATAATAATTACCCGGAAGATCAGGCAGTGAAAAACTTTGTCGTGGCGAATATCACGGGATCGGATTGTCGGCAGTTGATCCATGTTGAAAATGGCAAACATTTTGTTATCCGTAATATCAAAGCCCGCAATATCACGCCGGATTTCAGTAAGAAAGCAGGCATTGATAACGCGACAGTTGCTATTTACGGTTGTGACAATTTCGTGATTGATAATATTGAAATGATTAATAGCGCCGGGATGTTAATCGGCTATGGGGTAATTAAAGGCAAATATCTCTCGATACCGCAAAATTTCCGAGTGAATAATATTCAACTGGATAATACCCACCTTGCTTATAAATTGCGCGGTATCCAAATCTCCGCCGGGAATGCCGTCTCCTTTGTGGCGCTGACTAACATTGAGATGAAGCGTGCGTCGCTTGAGCTATACAATAAACCGCAACACCTTTTTATGCGTAATATCAATGTGATGCAGGAATCCTCAGTTGGACCCGCATTGAGCATGAACTTCGACATGCGCAAAGACGTTCGCGGCGTCTTTATGGCGAAAAAAGAAACACTGCTGTCTCTTGCAAATGTTCATGCGGTGAATGAGAAAGGACAAAGCTCCGTCGATATCGACAGGGTTAATCACCATATTGTTAATGTGGAAAAGATTAACTTTAGATTGCCGGAACGGAGAGAGTAGATTTGCGACCATTCCTGGAAAAATGGAGCCATACTTAGGAACAATGCTACTGCAATCCACAACGAAGCGGCGTAACATCACAAGTAATTCAGTAATCAATTCAGGGTAATTGATGCTGGCGAAAAAAATCGAACAAGCTATAATTCAGCAACCATTTTACAGGTGGATGAAATAATGACGAATTTAAAAGCAGTTATTCCTGTAGCGGGTCTTGGGATGCATATGTTGCCTGCCACTAAGGCGATTCCCAAAGAGATGCTACCGATCGTCGACAAACCAATGATTCAGTACATTGTTGACGAGATTGTGGCTGCAGGGATCAAAGAAATCCTCCTGGTAACTCACGCGTCCAAGAACGCGGTCGAAAACCACTTCGACACCTCTTATGAATTAGAATCTCTCCTTGAGCAGCGCGTGAAGCGTCAACTGCTTGCGGAAGTGCAGTCCATCTGTCCACCGGGCGTGACCATTATGAACGTGCGTCAGGGCGAACCTTTAGGTTTGGGCCACTCCATTTTATGTGCACGACCTGCCATTGGTGACAATCCATTTGTCGTGGTGCTGCCAGACGTTGTGATCGACGACGCCAGCGCCGACCCGCTGCGCTACAACCTTGCTGCCATGATTGCGCGCTTCAATGAAACAGGACGCAGCCAGGTGCTGGCAAAACGTATGCCGGGTGACCTCTCTGAATACTCTGTCATTCAGACCAAAGAACCGCTGGATCGCGAAGGTAAAGTCAGTCGCATCGTCGAATTTATCGAAAAACCGGATCAGCCGCAGACGCTGGACTCAGATATCATGGCCGTGGGCCGTTATGTGCTTTCTGCCGATATTTGGCCGGAACTTGAACACACTCAGCCTGGTGCATGGGGGCGTATTCAGCTGACTGATGCCATTGCCGAACTGGCGAAAAAACAGTCCGTTGATGCCATGCTGATGACTGGTGACAGCTACGATTGCGGTAAAAAAATGGGTTATATGCAGGCGTTCGTGAAGTATGGGCTGCGCAACCTCAAAGAAGGGGCGAAGTTCCGCAAAGGGATTGAGAAGCTGTTAAGCGAATAATGAAAATCTGACCGGATGTAACGGTTGATAAGAAAATTATAACGGCAGTGAAGATTCGTGGCGAAAGTAATTTGTTGCGAATATTCCTGCCGTTGTTTTATATAAACAATCAGGATAACAACGAGTTAGCAATAGGATTTTAGTCAAAGTTTTCCAGAATTTTCCTTGTTTCCAGAGCGGATTGGTAAGACAATTAGCGTTTGAATTTTTCGGGTTTAGCGCGAGTGGGTAACGCTCGTCACATCGTAGGCATGCATGCAGTGCTCTGGTAGCTGTAAAGCCAGGGGCGGTAGCGTGCATTAATACCTCTATTAATCAAACTGAGAGCCGCTTATTTCACAGCATGCTCTGAAGTAATATGGAATAAATTAAGTGAAAATACTTGTTACTGGTGGCGCAGGATTTATTGGTTCTGCTGTAGTTCGTCACATTATAAATAATACGCAGGATAGTGTTGTTAATGTCGATAAATTAACGTACGCCGGAAACCTGGAATCACTTGCTGATGTTTCTGATTCTGAACGCTATGTTTTTGAACATGCGGATATTTGCGATGCAGCTGTAATGGCACGGATTTTTGCTCAGCATCAGCCGGATGCAGTGATGCACTTGGCTGCTGAAAGCCATGTTGACCGTTCAATTACAGGCCCTGCGGCATTTATTGAAACCAATATTGTTGGTACTTATGTCCTTTTGGAAGCCGCTCGCAATTACTGGTCTGCTCTTGATAGCGACAAGAAAAATAGCTTCCGTCTTCATCATATTTCTACTGACGAAGTCTATGGTGATTTGCCTCATCCAGATGAAGTAAATAATACAGAAGAATTACCCTTATTTACTGAGACGACAGCTTACGCGCCAAGCAGTCCTTATTCCGCATCCAAAGCATCCAGCGATCATTTAGTCCGCGCGTGGAAACGTACCTATGGTTTACCGACCATTGTGACTAATTGCTCTAACAATTATGGTCCTTATCATTTCCCGGAAAAATTGATTCCATTGGTTATTCTCAATGCTCTGGAAGGTAAAGCATTACCTATTTATGGTAAAGGGGATCAAATTCGCGACTGGCTGTATGTTGAAGATCATGCGCGTGCGTTATATACCGTCGTAACCGAAGGTAAAGCGGGTGAAACTTATAACATTGGTGGGCACAACGAAAAGAAAAACATAGATGTAGTGCTCACTATTTGTGATTTGCTGGATGAGATTGTACCGAAAGAGAAATCTTATCGTGAGCAAATCACTTATATTGCCGATCGTCCGGGACACGATCGCCGTTATGCGATTGATGCTGAGAAGATTGGTCGCGAATTGGGATGGAAACCACAGGAAACGTTTGAGAGCGGGATTCGGAAGACAGTGGAATGGTACCTGTCCAATACAAAATGGGTTGATAATGTGAAAAGTGGTGCCTATCAATCGTGGATTGAAGAGAACTATGAGGGCCGCCAGTAATGAATATCCTCCTTTTTGGCAAAACAGGGCAGGTAGGTTGGGAACTACAGCGTGCTCTGGCACCTCTGGGTAATTTGATTGCTCTTGATGTTCACTCCACTGATTATTGTGGTGATTTTAGTAATTCTGAAGGTGTAGCTGAAACTGTCAAAAAAATTCGCCCTGATGTTATTGTTAATGCGGCTGCTCATACCGCGGTAGATAAGGCTGAGTCAGAACCCGAATTTGCACAATTACTCAATGCGACAAGTGTTGAAGCGATTGCAAAAGCAGCCAATGAGGTCGGTGCATGGGTTATTCACTACTCTACTGACTACGTATTTCCGGGAACCGGTGAAATACCATGGCAGGAGGCGGATGCAACCGCACCGCTGAACGTTTATGGTGAAACCAAGTTAGCCGGGGAAAAAGCATTACAAGAGCATTGTGCGAAGCATCTTATTTTCCGGACCAGCTGGGTCTATGCAGGTAAAGGAAATAACTTCGCCAAAACGATGTTGCGTCTGGCAAAAGAGCGTGAAGAATTAGCTGTTATTAACGATCAGTTTGGTGCGCCAACTGGCGCAGAGTTACTGGCTGATTGTACGGCACATGCCATTCGTGTGGCACTGAATAAACCGGAAGTCGCAGGCTTGTACCATCTGGTAGCTAGTGGTACCACAACGTGGCACGATTATGCTGCGCTGGTTTTTGAAGAGGCGCGCAAAGCAGGCATTCCCCTTGCACTCAACAAGCTCAACGCAGTACCAACAACAGCCTATCCTACACCAGCTCGTCGTCCGCATAACTCTCGCCTTGATACAGAAAAATTTCAGCAGAACTTTGCGCTTGTCTTGCCTGACTGGCAGATTGGCGTGAAACGCATGCTCAACGAATTATTTACGACTACAGCTATTTAATAGTTTTTGCATCTTGTTCGTGATGGTGGAGCAAGATGAATTAAAAGGAATGATGAAATGAAAACGCGTAAAGGTATTATTTTAGCGGGCGGTTCTGGTACTCGTCTTTATCCTGTAACTATGGCTGTCAGTAAACAGCTATTACCGATTTATGATAAACCGATGATCTATTATCCGCTCTCTACACTGATGTTAGCGGGTATTCGCGATATTCTGATTATCAGTACGCCACAGGATACTCCTCGTTTTCAACAACTGCTGGGTGACGGGAGCCAGTGGGGGCTAAATCTTCAGTACAAAGTGCAACCGACTCCAGATGGGCTTGCGCAGGCGTTTATTATCGGTGAAGAGTTTATCGGTGGTGATGATTGTGCTTTGGTTCTTGGTGATAATATCTTCTACGGCCACGATCTGCCGAAGTTAATGGACGTAGCTGTTAACAAAGAAAGTGGTGCAACAGTATTTGCTTATCACGTAAATGATCCTGAACGCTACGGTGTCGTTGAGTTTGATAAAAACGGTACGGCAATAAGCCTGGAAGAAAAACCGTTACAACCAAAAAGTAATTATGCGGTAACCGGGCTTTATTTCTATGACAACGACGTTGTCGAAATGGCGAAAAACCTTAAGCCTTCTTCCCGTGGTGAACTAGAAATTACCGATATTAACCGTATTTATATGGAACAGGGGCGTTTATCTGTTGCCATGATGGGCCGTGGTTATGCATGGCTGGACACGGGGACACATCAAAGCCTGATTGAGGCAAGCAACTTCATTGCCACCATTGAAGAGCGCCAGGGACTAAAGGTTTCCTGCCCAGAAGAAATTGCTTACCGTAAAGGATTTATTGATGCTGAGCAGGTGAAAGTATTAGCTGAACCGCTGAAAAAAAATGCTTATGGTCAGTATCTGCTGAAAATGATTAAAGGTTATTAATTAAATGAACGTAATTAAAACAGAAATTTCTGATGTGCTGATTTTTGAACCAAAAGTTTTTGGCGATGAGCGCGGCTTCTTTTTTGAAAGCTTTAACCAGAAAGTATTTGAAGAAGCTGTAGGCCGCAAAGTTGAATTTGTCCAAGATAACCATTCGAAGTCTAGTAAAGGTGTTTTACGCGGGCTGCATTATCAGTTAGAACCGTATGCGCAAGGGAAATTGGTACGTTGCGCTGTTGGTGAGGTTTTTGACGTAGCGGTTGATATTCGTAAATCGTCACCAACTTTTGGCAAATGGGTTGGGGTGAATTTATCTGCTGAGAATAAGCGCCAGTTGTGGATACCTGAAGGATTAGCACATGGGTTTTTGGTGCTGAGTGAGACGGCGGAGTTTTTATATAAAACGACCAATTATTATCATCCAGAGAGTGATAGAGGGATTATTTGGGATGATCCTGACATTGACGTAAAGTGGCCTTTAAGTATCCATAAACCGATTTTATCTATAAAAGATGAAAAACAAAAGATGTTTAAAGAAATGATAGCGTTGGAGAAATGGAATGGAGAATACTAA
Protein sequences of DBSCAN-SWA_6 >CP029164|2738115:2744445|2742992_2743871_+|AWH70348.1|DBSCAN-SWA MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPTPDGLAQAFIIGEEFIGGDDCALVLGDNIFYGHDLPKLMDVAVNKESGATVFAYHVNDPERYGVVEFDKNGTAISLEEKPLQPKSNYAVTGLYFYDNDVVEMAKNLKPSSRGELEITDINRIYMEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAYRKGFIDAEQVKVLAEPLKKNAYGQYLLKMIKGY >CP029164|2738115:2744445|2738115_2739510_+|AWH70343.1|DBSCAN-SWA MPFKKISRRTFLTASSALAFLHTPFARALPARQSVNINDYNPHDWIASFKQAFSEGQTVVVPAGFVCDNINTGIFIPPGKTLHILGSLRGNGRGRFVLQDGSQVTGEEGGSMHNITLDVRGSDCTIKGLAMSGFGPVTQIYIGGKNKRVMRNLTIDNLTVSHANYAILRQGFHNQIIGANITNCKFSDLQGDAIEWNVAINDSDILISDHVIERINCTNGKINWGIGIGLAGSTYDNNYPEDQAVKNFVVANITGSDCRQLIHVENGKHFVIRNIKARNITPDFSKKAGIDNATVAIYGCDNFVIDNIEMINSAGMLIGYGVIKGKYLSIPQNFRVNNIQLDNTHLAYKLRGIQISAGNAVSFVALTNIEMKRASLELYNKPQHLFMRNINVMQESSVGPALSMNFDMRKDVRGVFMAKKETLLSLANVHAVNEKGQSSVDIDRVNHHIVNVEKINFRLPERRE >CP029164|2738115:2744445|2739684_2740578_+|AWH70344.1|DBSCAN-SWA MTNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEILLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQGEPLGLGHSILCARPAIGDNPFVVVLPDVVIDDASADPLRYNLAAMIARFNETGRSQVLAKRMPGDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQPQTLDSDIMAVGRYVLSADIWPELEHTQPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKGIEKLLSE >CP029164|2738115:2744445|2742035_2742935_+|AWH70347.1|DBSCAN-SWA MNILLFGKTGQVGWELQRALAPLGNLIALDVHSTDYCGDFSNSEGVAETVKKIRPDVIVNAAAHTAVDKAESEPEFAQLLNATSVEAIAKAANEVGAWVIHYSTDYVFPGTGEIPWQEADATAPLNVYGETKLAGEKALQEHCAKHLIFRTSWVYAGKGNNFAKTMLRLAKEREELAVINDQFGAPTGAELLADCTAHAIRVALNKPEVAGLYHLVASGTTTWHDYAALVFEEARKAGIPLALNKLNAVPTTAYPTPARRPHNSRLDTEKFQQNFALVLPDWQIGVKRMLNELFTTTAI >CP029164|2738115:2744445|2743875_2744445_+|AWH70349.1|DBSCAN-SWA MNVIKTEISDVLIFEPKVFGDERGFFFESFNQKVFEEAVGRKVEFVQDNHSKSSKGVLRGLHYQLEPYAQGKLVRCAVGEVFDVAVDIRKSSPTFGKWVGVNLSAENKRQLWIPEGLAHGFLVLSETAEFLYKTTNYYHPESDRGIIWDDPDIDVKWPLSIHKPILSIKDEKQKMFKEMIALEKWNGEY >CP029164|2738115:2744445|2740614_2740878_-|AWH70345.1|DBSCAN-SWA MHATAPGFTATRALHACLRCDERYPLALNPKNSNANCLTNPLWKQGKFWKTLTKILLLTRCYPDCLYKTTAGIFATNYFRHESSLPL >CP029164|2738115:2744445|2740950_2742036_+|AWH70346.1|DBSCAN-SWA MKILVTGGAGFIGSAVVRHIINNTQDSVVNVDKLTYAGNLESLADVSDSERYVFEHADICDAAVMARIFAQHQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEAARNYWSALDSDKKNSFRLHHISTDEVYGDLPHPDEVNNTEELPLFTETTAYAPSSPYSASKASSDHLVRAWKRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKALPIYGKGDQIRDWLYVEDHARALYTVVTEGKAGETYNIGGHNEKKNIDVVLTICDLLDEIVPKEKSYREQITYIADRPGHDRRYAIDAEKIGRELGWKPQETFESGIRKTVEWYLSNTKWVDNVKSGAYQSWIEENYEGRQ |
7 | Enterobacteria_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
3221809 : 3273222
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP029164|3221809:3273222|DBSCAN-SWA GTTATATTTTTTCGATCTCGACCAGATTAGTGTGCTGCGGGTTTCCCTTCGCCAGTGGTGAAGGGCGCAGAGTGGTTAGCGTATTCACACAGCCGCCATGGTCGATTTTATCGCCAGACATATTGGCCTCGTGCCAGGCTCCCTGGCCCATAGCGCTAACCCCAGGGAGAATGCGTGGTGTCACTTTGGCTGGTAGCCGAACTTCGCCACGATGGTTATACACCCGCACCATATCGCCGTTGGCAATCCCACGTTTCTGTGCGTCAATAGGGTTGATCCACACCTCCTGACGGCAGGCAGCCTTCAGGACATCAATGTTGCCGTAGGTCGAGTGAGTACGGGATTTGTAATGGAAACCAAACAGTTGCAGTGGAAAAGTACTACGTTCAGGGGAGTCCCAGCCTTCAAAGGTTGAGGCATAAACCGGCAGTGGGCTTATCACTTCATCTTTTTCCAGTTCCCAGGTACGGGCAATTTCCGCCAGCTTGCTGGAATAAATTTCAATCTTACCGGAAGGCGTTTTAAGTGGATTTGCCTCGGGGTCGTCACGAAATGCTTTGTAGGCGACAAAATGACCATTGGGATCTTTACGCTTATAGATACCCATTTTTTTCAGTTCGTCGTAAGACGGTAACGCCGGATCTTTGGCAAGCATTTTGGCGTACAGATGTTGTAACCATTGTTCCTGCGTGCGACCTTCTGTGAACTTTTGATAGACGTCAGGTCCAAGACGTTTCGCGACCTCACTCAGGATCCAGTAAATCGGTTTGCGTTCGAATTTTTCGCTGGTGACTGGCTGAAGGAAAATGAGATATCCCATGTTACCGGCGTAGTCGTTAGGAATAATATCTTCCTGCTCAACGGTCATCAGGTCTGGCAGCAGAATGTCGGCATATTTTGCCGATGAGGTCATAAAGTTTTCGATGACCACAATCATTTCGCATTTCGATTCGTCCTGCAGAATTTCATGCGTTTTGTTGATGTCAGAATGCTGATTAACCAGGGTATTTCCCGCGTAGTTCCAGATGAACTTAATGGGCACATCCAGTTTATCTTTGCCCCGGACGCCGTCGCGGATTGCCGTCATTTGCGGGCCATGATCGATAGCATCCGTCCAGCTGAAGCAGGAGATTGACGTTTTGACCGGATTATCCAGCACCGGCAGGCGTTCTATGGTAATGGTATAGGTCGATTCACGCGCGCCACTATTCCCGCCGCTGATGCCGACATTGCCCGTCAGAATAGGCAACATCGCAATAGCGCGTGCAGTCAGCTCGCCGTTTGCCTGGCGTTGCGGTCCCCAGCCCTGGCAGATATAAGCGGGTTTTGCTGTGCCAATTTCACGCGCCAGTTTGATGATACGGTCCACCGGGATACCGGTAATTTGCGAAGCCCACTGCGGCGTTTTCGCTGTTTTGTCGTCTCCTTCACCAAGAATATAGGCTTTATAGTGACCATTTTTTGGTGCATCTGCGGGTAAGGTTTTTTCGTCATAGCCGACGCAGTATTTATCGAGAAAAGGTTGATCAACGAGATTTTCGTTAATCAATACCCAGGCAATACCCGCAACCAGCGCGGCATCGGTGCCCGGGCGAATAGGAAGCCATTCGTCTTCACGACCGGCAGCCGTATCGGTATATCGCGGATCGATAACAATCATTTTTGCGTTCGATTTCTCGCGCGCTTTTTCAAGAAGATAAGTGATGCCACCACCGCTCATGCGGGTTTCTGCCGGGTTGTTACCAAACATCACGACCAGCTTGCTGTTTTCAATATCCGTTGTGCTGTTGCCATCATTACTGCCGTAGGTGTAGGGCATGGCACAGGAGATTTGCGCGGTGCTGTAGGAGCCATACTGGTTGAGTGAACCGCCGTAGCAGTTCATCAGGCGTTTGACCGCCGAGGCGGATGGCGAAGAGCGGGTCATATTGCCGCCAACAATCCCCGAAGAGTACTGAATATATACAGCCTCATTGCCATATTGTTCGACGGTTTTTTTCAGGCTACTGGCGATAGTATCCAGGGCTTCATCCCAGCTAATCCGTTCGAATTTGCCTTCGCCGCGTTTGCCCACGCGTTTCATTGGGTAATTCAAGCGGTCGGGATGATTAATACGCCGACGGATGGAGCGACCGCGCAAACAGGCGCGTACTTGATGATTGCCATACTCATCGCTGCCGGTGTTGTCAGTTTCCACCCAGGTCACTTCATTATCTTTAACATGTAGACGAAGTGCACAGCGGCTACCACAGTTGACGGAACAGGCACCCCAGACCACTTTTTCGCTGGCTTGTTGTACCGCTGCCGCTGCACTGCGCAGGGTGAACGGTAAAGAAAAACCGCCTGCAGCCAGCGCCAGAGAACCTATCGCGGTAGATTTAACGAGTGTTCTGCGGCTGATGCCCACCATTTGTTCATTTTTGGACATAACTCACTCCCTGTTCTTTATCGTTATATAAAAGTTTATATATTGAATATTTAGCGCGCTAACAATAGAGGGAGTCTACCTATTTTGGGTTAAGAATTATTAATCCATATCAATAGAAGGGTATGAGTAATAAGGTGGGATTATGTTGTATATTCAAATCGCCGGATGTGTCGTATCCGGCGTGCAGTCGATAATGTATTACTGCGGTTCGGCAGGCGCGCCATCCTGGGTAGACTGCGCGGGAGCAGAGACGTTACCGCTGGTGGTGCGGGTGTAGAGAATTTTATGCGTATCATTAGCGCAATGCCCGACGACCTGGGAATCAGGCTGATCAACCTGGTCATTGGGTACAATACTTAACGTGAAGCTGCTTTCGGGTACGCCATTATTGATAATGCGCTGTGATATATCGCTCTGTATGCGCTCACAGGATCCCGGCGCGGCGAGTACCGCGGGTGAGGCGAGGGCGAGCAGAAGCGCGGCACAGCAGGTTGAGCGTTTCATCATAAGCTCCTTACGCGAAGATAACTTCTTTAAGCATAGCATTTAACGTGTAAAGTACTGTATTTGCTACTATGATTGAGAATCATCTCTACTCTCTGGTGACTGTTGTGAAATACAAATTACTGCCATGCTTACTCGCGATATTCCTCACAGGATGTGACCGCACAGAGGTAACACTTTCATTTACCCCTGAGATGGCCAGTTTCTCTAATGAATTCGATTTTGATCCGCTGCGTGGTCCGGTAAAAGATTTCACTCAGACATTAATGGATGAGCAAGGTGAAGTGACGAAACGTGTTTCGGGGACTTTGTCGGAAGAAGGCTGTTTTGATTCACTCGAATTACTGGATCTGGAAAATAATACCGTGGTTGCCCTGGTACTGGACGCCAATTATTACCGTGATGCCCAGACACTGGAGAAGAGAGTTCGTTTGCAGGGAAAATGCCAGCTAGCAGAATTACCTTCTGCCGGGGTGAGCTGGGAAACCGATGATAATGGCTTTGTGATTAAAGCCAGCAGCAAACAGATGCAGATGGAATATCGCTATGATGATCAGGGGTATCCGCTGGGTAAAACCACGAAAAGTAACGACAAAACATTATCTGTCAGCGCTACGCCATCAACGGATCCGATCAAAAAATTAGATTACACAGCGGTTACTTTACTGAATAATCAAAGGGTTGGTAATGTAAAACAGAGCTGTGAATATGACAGTCACGCCAATCCGGTGGACTGTCAGCTAATCATTGTTGATGAAGGAGTAAAACCCGCCGTCGAACGGGTTTACACCATCAAAAATACTATCGATTATTATTAATGCTATTGTGCGGTCGGCTTCAGGAGAGTCTGACCCGGTGTTTTGTGCTCTGCCAGATACTGATGCTGGAATATGCACATGCGAATGGCATTACGATATTGACCATTAATAAAGAACTCATGCATCAATTCACCTTCAACCGTAAAGCCAAGCTTGCGGTAAATGTGAATCGCTTTTTCATTCTCTTTATCGACGATCAGATACAGCTTATAGAGATTGAGAACGGTGAAGCCATAGTCCATTGCTAATTTTGCGGCACGCGTTGCCAGACCTTTCCCCTGATATTCCGGGGAGATAATTATCTGAAATTCTGCGCGGCGATGAACATGGTTAATTTCCACCAGCTCCACCAGACCGGCTTTTTCGCCGTCACATTCCACCACAAAGCGCCGTTCGCTCTGATCGTGAATATGCTTATCGTACAGATCAGAGAGTTCAACAAAGGCTTCGTAGGGTTCCTCAAACCAGTAACGCATCACACTGGCGTTATTGTCGAGTTGATGTACATAGCGTAAATCTTCACGCTCCAGCGGGCGTAGCTTAACACTGTGGGCGCTTGGCATAACGTGTCCTTACATTCCTTAAATCAATAACAGGTTAGGGGGTAATAACGCGGCCAGTTCGACGGTCCAGGCAGCGCAAAGTATTTGGCTCCCAGTAGGCATTGATGTTGGCGCTTTGCTCACATTTATCGCGGTTATCAAAAGCAGCGTCGGCTTTATCCCACTCTTTTTCAGTGCGTTTATTCACTTTCTGGCGCAGATTACGCGTGTCATTCCATTGCTCTTTTTCCATAGCGGCGCGCTGGCGGCTTTGTGCACTGTCGCCAGACTCAATCACCAGTTTGTTATTTTCGGCATGAACAGTTGTGCTCAATGCCAGTGCGCAAGGCAGCAGAAAAGCGAGCAGGCCGATTCGTTTGCTGAGAGTGATTTTCATAATTCATTCCCTGTATGAATGATTAAAGCTGATTCTACACCATCCACTGCGAACGCAAAACGTTCAAGGAGGGTGTTTATATTGATGATATTATGTCGCCCTATAACTATACGTGATGTCAATAAGTGACAAAGATGATTAAAACAACGTTACTATTTTTTGCTACTGCGCTGTGTGAAATTATTGGATGCTTTCTGCCCTGGTTGTGGTTAAAACGAAACGCCAGTATCTGGCTGTTGCTTCCGGCGGGGATTTCACTGGCGCTGTTTGTCTGGTTGTTAACGTTGCATCCTGCGGCGAGTGGGCGTGTTTACGCCGCTTATGGTGGCGTTTATGTCTGCACGGCGTTGATGTGGCTGCGCGTTGTGGATGGCGTGAAACTGAGTCTTTATGACTGGACGGGGGCGTTGATTGCGCTTTGCGGCATGTTGATCATTGTTGCGGGCTGGGGGCGCACGTAGGAATATAAATCCCTTTTATCAATAAGATACGAGGATGTGTCAGCTGACAAAAGGTATTCTATTTCATCTTTTGTCAACCATTCACGGCGCAAATATACGCCTTTTTTTGTGATCACTCCGGCTTTTTTTGATCTTCATACTTGTATGGTAGTAGCTCAGTTGCGTAGATTTCATGCATCACGACAAGCGATGCAAGGAATCGAACATGAAGATCGTAAAGGCTGAAGTTTTTGTTACCTGTCCGGGGCGTAATTTCGTCACATTAAAAATCACCACTGAGGACGGTATTACGGGCCTTGGGGATGCCACCCTAAATGGACGTGAGCTTTCCGTGGCCTCTTATTTGCAGGATCACCTTTGTCCGCAGCTTATTGGTCGCGATGCACACCGTATCGAAGATATCTGGCAGTTTTTCTATAAAGGTGCTTACTGGCGTCGCGGCCCGGTGACGATGTCGGCCATTTCAGCGGTTGATATGGCGCTGTGGGATATTAAAGCCAAAGCTGCCAACATGCCGCTTTACCAGTTACTCGGCGGCGCGTCTCGTGAAGGGGTGATGGTTTATTGCCATACCACCGGTCACAGTATTGATGAAGCTCTGGATGATTATGCTCGTCATCAGGAGCTGGGATTCAAAGCCATCCGCGTGCAGTGCGGAATCCCTGGTATGAAAACCACCTACGGCATGTCGAAAGGTAAAGGTCTGGCTTATGAACCCGCAACTAAAGGACAGTGGCCGGAAGAGCAGCTGTGGTCGACGGAGAAATACCTCGATTTCATGCCGAAATTGTTTGACGCGGTACGTAACAAGTTTGGTTTTAATGAACATTTGCTGCATGACATGCACCATCGCTTAACGCCTATTGAAGCGGCGCGCTTTGGTAAAAGCATTGAAGATTATCGCATGTTCTGGATGGAAGACCCGACGCCTGCGGAAAACCAGGAATGCTTCCGTCTCATTCGCCAACATACCGTCACACCCATCGCGGTGGGTGAAGTCTTCAACAGCATCTGGGACTGCAAGCAACTGATTGAAGAGCAACTTATCGATTATATCCGTACCACGCTGACCCATGCAGGCGGAATTACCGGTATGCGCCGGATTGCCGATTTTGCTTCGCTGTATCAGGTACGTACTGGCTCACACGGTCCTTCCGATTTGTCACCAGTCTGCATGGCTGCGGCGCTGCACTTTGATCTGTGGGTCCCCAATTTCGGTGTCCAGGAGTACATGGGTTATTCCGAACAAATGCTTGAAGTCTTCCCGCACAACTGGACTTTCGATAACGGCTATATGCATCCGGGAGACAAACCGGGTCTTGGCATCGAATTCGATGAAAAGCTGGCGGCGAAATATCCCTATGAACCTGCTTATCTGCCAGTCGCACGTCTGGAAGATGGCACGCTGTGGAACTGGTAAGGAGTAAGATAATGAAAAGCATTTTAATTGAAAAACCGAATCAACTGGCAATTATCGAACGCGAAATACCCACCCCGTCAGCGGGTGAAGTACGAGTAAAAGTGAAACTTGCCGGAATTTGTGGTTCAGATAGCCATATTTATCGTGGGCATAATCCTTTTGCGAAATATCCGCGCGTCATTGGTCATGAATTCTTTGGCGTCATTGATGCAGTGGGTGAAGGCGTGGAAAGCGCCAGAGTCGGTGAACGCGTTGCTGTCGATCCGGTGGTCAGCTGTGGGCATTGCTATCCGTGCTCTATAGGTAAGCCGAACGTTTGTACGACACTGGCTGTATTAGGTGTGCACGCTGACGGTGGTTTCAGTGAATATGCCGTGGTTCCGGCAAAAAATGCGTGGAAAATTCCTGAAGCAGTGGCCGATCAATATGCGGTGATGATCGAACCTTTTACCATTGCGGCTAACGTTACCGGTCATGGTCAACCGACTGAAAATGATACCGTTCTGGTTTACGGTGCCGGTCCAATCGGCCTGACGATCGTTCAGGTATTAAAAGGCGTCTATAACGTTAAAAATGTGATTGTTGCCGATCGCATTGATGAACGACTGGAAAAAGCGAAAGAGAGCGGGGCAGACTGGGCGATTAATAACAGCCAGACACCGCTTGGCGAGAGTTTCGCTGAAAAAGGCATCAAGCCGACATTAATTATCGATGCGGCTTGTCATCCTTCTATCCTGAAAGAGGCTGTAACGCTGGCTTCTCCAGCGGCACGTATTGTATTGATGGGCTTCTCCAGTGAACCGTCTGAAGTGATTCAGCAAGGAATTACCGGAAAAGAACTCTCTATTTTCTCTTCACGCTTAAATGCAAATAAATTTCCGGTCGTTATCGACTGGTTAAGTAAAGGGTTAATTAAACCAGAAAAATTAATTACCCATACGTTTGATTTCCAGCATGTTGCTGATGCCATTAGTTTATTTGAACAGGATCAAAAACATTGCTGCAAAGTCTTACTCACTTTTTCTGAATAATACCAATAACGGCGAGTAAGTAGTACGCATCTTACCTCTTTTTTAGAGATAACCATTATGACAATAGAAAAACACGAAAGAAGCACTAAGGATTTGGTGAAAGCAGCAGTATCGGGATGGCTGGGCACTGCGCTTGAATTTATGGATTTCAAGAGTCATACGTGTCAACTATTTGATAAATATTAAATTAATTTTTCATTGCTTCGTTATGGGGCATGGTTGGGGCAAACTCGCTTAACTGTGTATTTAACAAAGCTACCTGTGCATTATTGTTTTCAGACATCCATTTTCCGTATACCTGAAATACCATTTGCGCATCTGCATGGCCCATCTGGTTTGCTATAAATGCCGGGTTAGCACCAGCTGTCAGCGACCAGCAGGCATAAGTATGTCTCGACTGATATGATTTTCGATGGCGGAGTCCGGCACGTTTTATCGCTGCGTCCCACATCTGCCTTATTGAGTCAACGGTAAAATGGTCACCATAATTTTTTACTCTCGCTGACACTTCAGGTTGAAAAACAAAGGTGCATTTTTGTTTTTCTGTTCTGCCATACTCTCTGAGGTGAACATCAATGATATGCTCTTTGCTCAGTCTCGTTAATGTCATCTGACTCCGGAGAGCGTCGATTGCTGGCTTAATAAGATGAATGACCCGATTGGTTCCCGCCTGTGTTTTTGGTACCGTGAAACGGTCTTTTGCTAAATTTCTCCTGATCATCATTGTTCCATTTTTCAGATCTATGTCCTCCCATCCAAGTGCACACAGCTCACCAGGGCGAACGCCAGTATAAACAGAAACACACCATAAATTTTTTGCTTGCTGATTTCTGCACGCATCGATAAGACGGATAAATTCCTCCCGCGAAAGAGGATCCGGAATGGTTCTTGATTCCTTTAATGGCGAGATCCCCTTAAACGGATTATCTGGCAGGTAACCGTTATCAACACCAAACTGGAACACGGCGTTAAGATTTGTCATGTAATTATTTACAGTTACAGCCGATCTCCCTGGTTGTGTAACAATATAGTTACTTTTGGGGATCTGGTATCCAGTCAGTAGCTCTTTACGAACCTCCAGTAATTTTTCTTTATTAATCGATGAGGCAAGATTTTTTTCACCGATTATGCTCAGGATATTTTTGATGACGGCACGGTATGTGTTGAGTGATGTTTTGGCGACTTCAGTTTCTTTCAGTGCCAGAAATTTTTCAGCCAGTTCTTTTATGGTTAAATCTTGTCGGGCCTCACCAAATTTTTCCAGATTGCGTGAGGAGGGAAACTGTTTTGCATAGTCGAAAACACCAGTTTTTATTGCGTAACAAACAGAGGAGCGTAGTTCACCTGCAACGCGCCTGTTTTTTGCTGTGTCAGGAACCCCCAGATTTTCCCTGACTCTTACGCCTTTATAAACAAACCAGATACGTAATTTCCCTCCATGGTTTTCCACGCCTGTCGGATATTTCATTTCAACTTCTCTCATTAGTTAGTGTGGCTTTTAGTCAAGTAAGATGACGTCTTGGTCTCGCTGATGCCTGGCGCTCAATCCAGCGATCAATTTCTTCCAGGTTGTAAAAGCATGGACTGTTATCCCATGGCATACTGTCATGAGCGACATGCTTATATTCCCTTCCTTCCATAAACGATTTTTCCCGGGCCTTTTTTAACGTACCTTTTTTTATTCCTTTCAGCGCAATTAACTGCTCTTCGGATACCCATTTGCCGGGAGAGACAATCATGATTACTTCGCTCATCGATTTCTTTATCTCTTACATCAGACGAGCGCCGGTTGCAGAATACCAGTCACAACCGGCGACAGTTGAACATTAAGAATCAGCCTGACTCGGGATCAGTTTTTGCCAGATAACTGAAACGTATTTTGCCTGGTAACGGGCGTCATCAAGTGCATTATGGCGCTCACCTTCGAATGGAATAGCCGTTCTGGCATCGAAGTCTATGGCTTTCCCCAGCTCAACGATTGTGCGTACATCGCGATCGTTGTAGTAACGCCACGGGCAGGGGATCCCCTGCCGTTCGTATGAACGGCGCAAAATCGTGTTGTCGAAGTTGGCTCCATTTCCCCAAACCTGAACAAAAAATTCACTGGAGTTTTCGTCGATAAATTCCCGCAATTGTAACAGTGCATCATCTAACGGGATTTCATCGGTCATAATGGCAGATTGCGCTTCGCGTGATTGCTTAAGCCACCATTTAATGGTGTCCCGATCAATGACTCCGCCAGCAGTTTCCAGATCGATAGTCTTACTAAATTCCGGTCCCATATCTCCGGTTTGCGGATCGAAAAATATTGCACCTATTGAGATGATCGGGGCATCAGGATTTTTTCCCATGGTTTCAAGGTCGATCATTAGATGGTCACACGTCCTGCTGGTGGATGTGATTTCGTGATGACCGTTCACCTTAATTGAGTGATCTGCCGTCTCGCCAGTTTCATTATCGCTATTGTGATGCTGATTGCCGCCAGTGTTCTCCTTGTGTGGATGTTCAGCGCCTTCCATTTCCTCCGGATCATCTTCCTGAACTTCAACCTGATACTCTTCATCGAATGTTTCTTGGTATGTTGCGTCGCCCATCACCGCGCCACAATCAGGGCAGTTGCCGCCGCCGGTCTGACCGCAGGCGGTGCAGACTTTTTCCACTTCCTGTTGCGCCACTGGTTCAGGCTGTTTCGTTTCTGGCTCGTTTTGTAACGCATTTGTGCTGTTTTGTTCCGCTTTTTGGTAGTTCCGTTCCGATTCATGCTGGTTCTGGTTCACAGAATCGCGGGTCTGGATCCCCTTAACCCATTTCGGATCATTCGGGTCACTAATCCCTTCAACAAATTCACCACGTGATGCAGCAAGCAACTTATCGGCGTCAGGCTGGCTGATATTGGCTGCCTGCATAATTTTGTTTACTTCGTCAGCGGTAACTTTTATCGGCTCTGGTTGTTCTGAATCTTCAGCGGTATCTACATTTTGCGGTAAGCCCGTGTATGTGCCATTTTTTCGGGCAAAATATTCTTCTTTTGTGATTTCAGTGGCGCCAGCAGCCAGTGCCTTATCCAGACCAGAAAGTTTGTTTGCGCGACCGTATTTTTCTCCGTCCTTATCTGCGAAGAGGAAATAGAACGGCCCCTCACGCTCTACAGATGGTTCAGCTTCCGGCGCGGTTTCATTTTTTGGGATATCAGATACCTCAGTTTCCACTGCATCAGTTTGTGTTTCTGATGACTGGAGAACATCAACAGTGCCCAGGTCTGTTTCTTCATTCTCAAACACGCCCTTTGTCGTCAGGTATTCGCAGATATATTTGTTCAGTGCTATGGGATCTTTGTGAATGTCGATCGGACGCTCACGGACAAGGCCAAAAATAGTCTGGCGGTCGTAGCGAAGGGCATCAGGCTGTTTGCGCATTGATGCCGAGATACGCTTCCAGTCTTCGCGGTCGTTGTCGATAACTTCTTTTTTTGCCCAGCGATGGATGCTGCCGTCAATGTTTCCGGCATCCACATCACCAGGCCAGAGAGCGTAGGCCAGTTCGTCATCCAGTGTTTTCCATGTCTGCTTGTATTCGCGATGAATGGCAGCAATGACCGGGCTGATTTTTCCTGTTGAATTTTCAGTGTACTGTTGATTGGCTCTGGCGCGGGCGAGATCAACAACAGACGTGTATTTTCCGGTTTCCTTGCGTTCACCTTCGCGACGTTTTTTCCAGATGCGCATCTCTGCCTGAATTTCGGGCCATTTAGCACCAGGAATACATTTATGCTTAACCCACCCAATGGCGTGCAACTTAAGCTCCGGATACATGGCGTTAACTTCTGGCATTTTCATCAACGCTTCAACGATATGTCCGTCGAATGTTGCCATGTCTTCCTGCAACAATTCCTGTGCGCTAATAACCATATCAACGGTGATGTTTTCACATGTGTCGAACTTAACCATGACAGCGTTCTGTACTTCAGGGGCCAGCTTGTCAAAAGTGACGTTCATCGGATCGGATTCAGTCTCAACCGGGACAAAAGAAGCAGACTCCTCATCCCAGCGGTTTTCCTGCATATATTCAGCATCCCATGAATCGAGGGCAGGGCGGGGTATGCCAGGTTTATCCTCGCAGACAATAAATTTATAAGCGCAGTCCTGAGCAGCCGGATAATGTTCCAGGAACTGCCAGTGAAATTTTGCTCGAGCACGGCGTTCGTCGCCAGCTTCAATGGCTGTGGCTACAGCCACAGCGCCTTCTTCCCTTGTTGCCAGTTCGTCAGGAATAGCGGCGCAAATAAAGACTTTACTCATTTTGTTTTAACCTCATGACAGATTTAAGGATGAACAAATCCCTGCCATTGCTGGCATATAAGAATGAAACCGGATATTTATTACGGAACTGTTTTAAAGACCTGCCGGGATTTCGATATTATCCTGGTGAATAACTTTATCGACCGGGTAACAGTTACCGGGAATTTTCTGTTCGGTTGCTGCTGTCATACACTCCTGCATTGTCCTGTGAACACTGACTGCAATATCAACTGGCTCTCCGGAAACAAGAAAAACTGTCAGAACAAGCGCAAATGCTGAATTCATTGTGCACATCCTTTTGGCATCAGACGTAAACGAGCCAGCATTGAAACAATGCATATTTTATTTAATAGCTCCCGTTCTTGTTTTCTCTTGTTAATGGCATCTTCAGTAAATACAGGGTTACTGATAGTGACACCAATTTCAAAACAATCTTCAGACGTATTAACGTTTGGTAATAACGTTTTCATTATCGCGTCCTCAACAATGAATTTTGTGATGCAGTGCCTGGTGCCTCCAGGTGACGTTAACCAGTTAACAATTAACGCCGGATACAGAGAATCCACCCATAACACTGTTTTTGGTTTTAACTGTTCCGCGTGCGCTTAGCCGCATTCACCGCATCACAAAATTCACTTTAAAAACGGCGGCAGAGCAGTCACGGAGTAAAACTGATACCGCCAAACGTCACCAGAAAATTGATAACAGAGGGCGTTGCAGCGGGGTTGTCACTTAAGCGTATGGTCAACCTGACAACTCGGTGTCCTCAACGGGGAAGGAATAACCCTGCCATACTTACCGCCGCGCCATTTCGCGGGTTGCCACAACCGGAAGCGCACGGTCGAATTAAATTTAACGACACCGTACAGTGAGACGAACTTCGCCGTGCGCTTTCGTGTTGTGTGCCTGCTTTTAACCACGTCAGGCGAGGTGGTATCCTTCTTATTCCGAATAACCAAGAAGGAAATCTATATGACTAAAGAAGAATTTGTCTCTTATATTTTTGATAAAACGGTTGAAATGTATGCCGCTACTTACGGGTCCTGTAATCCTCTGAATAAACCAGAGGGGAAAGATGATTTCGACAAAATTTACCGCTTCTTGGAGGACCGCTATATCAAAAGGTTAGAGGACGCAGGGATCAAATCCCCAGTGAAGTCACCATTGTCCTGAGAACTTGCAGGACGTCATGATCGTAACTTCCATCCAAACCGCGACGGCAAATTGCTTCTCGTATTACCGGAAGCAGTTCGCTGGAAATCTCGGTGCATATTTCACCTGATAATACGCCTGGCTCAAGTGAGAATATTGGTGAACTTACGGTCTTGGTCTCGACAGTTTCAGAGTCAGTGCCAACATTATAAAGCTCAACGAAAGCGGTCTTGATTTTCCGGGCCAGATCTTTTGCTGGCTCGCTTGCAATATCTTTCCCGATTTCTCGCAGCACAGAATGCAATGTATGAGCTGCTGTTTTCTGTACATCAGACGGTAAATCTTTAAATTCCATCGTCAGCCTCATCAGTCAGTGTTTCTGGCTAACCAGCAACGCGCGCCAGATTCGGTTTTAAACGTTTTGCTTTTGGTATATGTCATCGCGGTGAACGTACCGTCCTGGTTGGGGAACACGCCACATACCAGAGATTCGCTGTTGCCAAGATCGATAGTATCCATGCTGACCTCATTTCCCCTTAACGCCGGGGTCGCGGAACTGTTTGCTGAGAACACCGTGCGGTGTCTTGATGGAAAGTAATTTAGAATAACCTAACATGAGAGGCAAGTGTTTTTTGTTAGATTGATCTAACAAAAAGAGTGGGCGCAACTAATCACTTGAAAAGAATGTTATTTTATTGATTTATTTTTACGCGCTTTAAGCATTTCTTCGAAGAGTTTGTTGAAGTTTTCTACTCTTGCGCGCATTTCAGACAGCAAGGCTTCCTGCTCGGAAGATGGAAGAGCATCGAATAATTCGATCAATTCTTTGTGGTTGGGAGTTAGCTCTGTTTCCACATGAAGTTCTTGTGCAGGCACTGGTGCCTTGTCTTCGTCACCAAACATTAGCCATGTAGGTGAGCACTTCAGAGCATCCGCTAAAGCAAACAATCGTTTTCCGACTGGCTGGGTTTCGTCTCTTTCCCATTGTGAAATTGTGACGTGAGCAACTCCAGCGAGGCGCGCAGCTTCTCGTTGTGTTAAGCGTAATTCTTTTCGTCGCGCCAGAACTCGCTGGCCTAGGGTTCTTGTATCCATAGTTAGGTAATTCTAATTTTTCTTGACTTAGGTATCCCGCGCACAATAATGTTAGAAAAGTCTAACAAGAGGGGGCTTTGATGCTTAAAGTTGACGCAATTACTTTTTTTGGCAGCAAAACAAAGCTTGCCAATGCCGCAGGAGTGAGACTGGCAAGTGTTGCTGCTTGGGGGATACTGGTTCCTGAAGGTCGCGCGATGCGTCTACAGGAGGCATCTGGCGGGGAGCTTCAGTATGATCCCAAAGTTTATGACGAATATCGTAAGACGAAGCGGGCGGGGCGGTTGAACAATGAAAATCACTCCTGAACAGGCTCGTGAGGCTCTGGATGCCTGGATATGTCGACCAGGAATGACACAGGAGCAGGCGACGATATTAATCACTGAAGCATTCTGGGCTTTGAAAGAGCGCCCGAACATCGATGTTCAGCGTGTCACATATGAAGGTGGCGCGGTTGATCAGCGAGCGCTTAGCGTTAATCGAGTGAAGATATTCGAACGCTGGAAGGCTATCGACACCAGGGATAAGCGTGAAAAGTTCACAGCGCTAGTGCCTGCAATTATGGAGGCTATCCGGATTAGTGATTTCAGGTTGTATCGTGAGATCAGTGATGGAAAAAGTATTACGTACATGATCGCCGGGTTAAATAAAGAATATGGCGATGTGGTGGAATCCGGACTGCTTTTTGCAGATCCTGCCGTTGTAGATCGTGAAACTGACGAACTTATAGAAAAAGCAATTGCTTTCAAGCTTGCGTATCGACAGCAATACCAACAAAAAGCTGGATGGAATTATGAGTCTTCTTTTTGCTGAACGCCCACTGGTTATAAACACGCAGCTGGCAATGAAAATCGGCTTAAACGAAGCCATTGTTTTGCAACAACTGCACTACTGGTTGAGAGATACCAACTCCGGTATGGAATGTGATGGTGTTCGCTGGATTTATAACACAACGGAACAATGGCTGGAACAGTTCCCATTCTGGTCAGAGTCAACGTTAAAGCGCGCGTTTGCAAGTCTGAAAACGCTGGGGCTTTTGCGTTGTGAAAAGCTCAATAAATCAAAGCGCGATATGACCAATTTCTACACGATCAACTATGGGAGCGAGCTTTTAGATGGTGGCAAATTGAACGAATCCATCGGTTCAAAATGCGCCGCTCCATCAGGTCAAAATGACACGATGGAAGAGGTCAAAATGAAACGCTCCATTGGTTCAAAACGACCCAATGTCATCGGGTCAAAATGGCCTGATGATCTTACAGAGAATACAACAGAGATTACTACAGAGAATAAAAACACTTTTCGTCCGGAAGCTTCGCAACCGGACCCGCAGACGACTGAACAGGATTTTTTAACCCGGAACCCCGACGCGGTTGTGTTTAGTGTGAAAAAACGCCAGTGGGGTAGCAGGGAGGATCTGGCGTGTGCGCAGTGGATTTGGGGGCGGATCGTGAACCTTTACGAACAGGCTGCCAGCGACGATGGAGAGATCATGCGACCAAAAGAGCCTAACTGGACAGCCTGGGCCAATGACGTGCGCACAATGCGGATGCTGGATGGCAGAAGCCACAGACAAATTTGCGAAATGTTTGGTCGGGTACAGCGGGATCCATTCTGGGTAAAAAACATCATGAGCCCGTCAAAACTCCGCGAAAAATGGGACGAACTGGTCATCCGCCTGGGACGTTCACCTGTACAGCGTTGTGTGAATCATATTTCTGAACCGGATACAGAAATTCCGCCTGGTTTCAGGGGATAAGTGTTGATTTCAGGTCATGAGATAATTTTAAGGGGGACTTGTGGCAAAAGTTTTTACACAAGAAGAGCGGGAAAAAATTAAAGGGCAGGTTGTTGAACTCGTACGCCGGAGTGGGCGTGAGACGTTACGGCAACTGGAAGTCAAGACAGGTGCGACAAGATATCTGATGAGCGTTCTCGCAAGAGAGCTGGTTGCCAGCGGCGATGTATACAACTCTGGTTACGGGTTATTCCCGTCTGAACAGGCGCGTAAGGACTGGCAAAATGCCCGTAAAAAGCTCTCAAGGGCAAAGCTGAAGAAACCATCTGCGGTTGATCCGGACCTTATCTGGTCATTACCTGACGGAGAAATACGTCGCTACGACAGGCGTCAGAACATAATCTGCCGCGAGTGCCGGAAGAGTGAAGTTATGCAGCGCATACTGGCATTTTATCAGGGATAATGTTAGATATTTTAGACGTTACTAGATTAAAAAGCATTAGTTCAGGAGTGAATTGACATTCTCATTTTTCATGGCACAGGGTAGATCTGGCGTGGTTGTCCGCTTTGTGCCAACAGCGGACGTTACTAACGAAACTGTGAGTTAAAAACGGGAGCTGGTCACCTCCATCCCGCGAAATGGAATTCAAAAATGCCCTCATGTACGGTACTTCTAGTCTGATGTGAGCATTTTCGTGGATTTTTTCGAGCCACTCCAACGAATGATTACATTCAGTTTCGGGCACTAAAACGTCATAAATCCGCAAGTTGAGGAGTAGCGATCAGCCAATTGGTTAAAAAACAGCTTTAATTCCCTTTTTCAAGCAGGCTCCTTGTCGCACGTCTGGTGCATCCAATTCATTTCAGTTACCCTGATGCCATACTGGCACGGACTAGCTCTCTTAATTGGCGCTGGTTTGTTAACCTTGTCCGCTGTCGGGTCTTTGTAAGTTGTAATTCCTAAACTAGCATAATTTTATTATAAGGCCTCTTACTATGCACATTTTATCTGTTTCAAAAAAGGAAGTTGGAAGAGCTGGGGATAGATTAGTTTCTCAGTTTGGTACAGGTGAAAAATATAAAGAGGAAGATGTACAAATTCTTCATGAGTGGAGGATGTTGCATTTATATCCTTTAAGTAAAATACAATTTTATATGGAAAGAGAGGCTATTTCCTTAAACAAAAATGCGTTACTATCCTCTAGAATTAAGAGGATGCCTTCCATTGTTACTAAGCTTTCTAGATTTCCTGATATGAAGTTAAATAAGATGCAGGATCTTGGTGGGTGCCGGGCTATTTTAAATAATCTGGATCAAGTGTATGATTTAGTTAATAAAATAAAATCATCTAAATTCTCACATGAACTAGTTAGAATGGATGATTACATGATAGACGTGAAAGATTCTGGTTATAGAAGCTTTCATATGGTCTATTCATTCCAAAATAAAAAATTTCCATCTTTAAATGGGTTGCGCATTGAAATGCAAATAAGAACAGCTATTCAACATAGCTGGGCTACAGCAGTTGAAATGGTTGGTTTGTTTCGAAAGGAATCATTGAAATCTGGTTTTGGTGATGCAAGATGGCTTAGGTTTTTTGAATTGGTATCAGAGCTTTTTTATAAACTTGAGTATGAAAAAGAACCATCTGGGAGTTATATAAAAATATCAGAAGAATTAAGTTATTTATCCGTCGAGTTGAATGTTTTCGATATATTAGCTGCTTACAATGCCGTAGTTAGCCATATTGAGGGTAGCAAAAAATATGATAAAGGACTTTGCATAATTGTTGTTGATACCATAAAGCGGAATATAAATATAAAGAGCTTCGAAAATCACAATCATGCGAAGGCGGCAGAAGCTTATGTCGAGTCTGAAAAATACTGCGCGGAAAATAAAGGCTGTGAAGTGGCTATGGTTTCTGTTAGTTCAATTAGTGAATTAAAAAATGCATATCCTGCTTACTTTTTAGATACGAAAACTTTTTTAAATTATCTCAGTAGGTATGTTTTTATAAAATAAAAAATGTTATCTGGAGTGGTCTGAATTCTACTAACTTAGTTTTTTCAGACCGCTTCTGACCACAATGGCCTGTGAGGTTCTGCTGCTAGTCAACATAAAATAACAAAGCCGGCAGGTGGTGGTTGTTGTAAAGTTATTCAATGACATTCATGGAGGCTTAAGCGAATTATTTGTCCCTGGTCGGTATGGTGGCAACAGCTTGTTGCGAGAATTTGGCTAATTTGGACTGACTTCCGCTCCTTGCTCAAAGCGGACTAGAAGGTTAGCTTGTGTCGGACTTTGCGTATTAAAAGAAGTGCTGGTGGTGACTAGTGGTTGAGCCCCATTTCCACAGAAAAAATCAGAGAAACTATACCCAATAGTTGTATTGAATCACTGACGAGACAGCCTCATATTCATCAGGACTGGTGTACGTCCAATACAGGAGGTTGTGGTGCTGGTTCTCAAATGTGCGCTGGCTATTACGGCTGTAATGGCGATTTATTGTCTTGCTATTGTTCTTATGGATCGCCTTTCTGACTGATTTCATATTGGCGAGGTAAAGGTAGTTAAGTAGAATGGCTGCGGGTGCTTGAGGCTATCTGCCTCGGGCATGAACACCAAAGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCTGCAAGACGCTTAACATTAATCTGAGGCCAATTTCATGCTAGACACATGTAGGTTAGCCTCTTACGTGCCGGAAGGCAAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAGTGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCCGGCGAGGGAGAAATCCCTCGCCACCTCTGATGTGGCAGGCATCCTCAACGCACCCGCACTTAACCCGCTTCGGCGGGTTTTTGTTTTTATTTTCAACGCGTTTGAAGTTCTGGACGGTGCCGGAATAGAATCAAAAATACTTAAGTAGCGCGCAGGGATAAGAGGGATGGTCCCTTAAAGGGGAGAGCTAATTATCCGGAAGGATTCTGATGATGAACATCGAAGAACTGCGTAAAATTTTTTGTGAAGATGGCCTCTATGCTGTGTGCGTTGAAAATGGAAATCTTGTTAGTCATTACCGCATTATGTGTTTGCGAAAGAATGGGGCTGCGTTAATTAATTTTGTGGATGGTCGAGTGACAGACGGATTTATCTTGCGCGAAGGTGAGTTTGTCACTTCATTACAGGCACTGAAAGAGATCGGAATAAAAGCAGGCTTTTCAGCTTTTGCAGAAGAATAAACTCATCTACAATCTTGCGCGGGGCTGAACTCCCGCTGAGTAACACCGTGCCACCGGAGAAAACCGATGGCACGCAACGTAAAATATTACAATTCTGATAATTCGCCCGTTCTTGCCTGCACGCACGGGCGGTATTCTCACGCATTCAAGTCTGAATGGTTCCAGCACCCTCCATGCACTGCAGAACAGGCCGAATGGCTGATTCATTCTTACTGCAGGCGCGGGTTCGAGGTTAAGAAAGCTCTCAGTCTCGACTATCGGCACTGGATAATCTCTGTCAGGCTGCCTTATTCCGAACGCCCACCACGTGCGTCCCGCACTTTCCAGCAACGGATCTGGAGGTAACGTGCGGGTATTACTTAGACCTGTTCTGGTGCCTGAGCTTGGGCTGGTGGTCCTTAAGCCGGGCCGTGAATCCATACAGATATTTCATAATCCTCGAGTGCTGGTGGAACCGGAACCAAAAAGCATGCGTAATCTGCCATCCGGAGTCGTTCCTGCCGTTCGCCAGCCGCTGGCGGAAGACAAAACATTGCTGCCGTTTTTTAGTAACGAACGGGTGATTCGTGCTGCTGGCGGCGTTGGCGCATTGTCTGACTGGCTATTACGTCATATTACATCCTGCCAGTGGCCTAATGGCGATTACCATCACACTGAAACAGTCATTCACCGTTATGGTACCGGCGCAATGGTGTTGTGCTGGCACTGCGACAACCAACTGCGTGACCAGACATCGGAATCACTGGAGCTGCTTGCTCAACAAAATCTGACAGCATGGGTGATTGACGTCATCCGTCACGCAATAAGCGGTACGCAGGAGCGGGAATTATCTTTGGCTGAATTATCCTGGTGGGCGGTCTGCAATCAGGTGGTGGATGCACTACCTGAGGCTGTATCGCGTCGTTCGCTGGGATTACCAGCGGAAAAAATCTGCTCGGTGTACCGCGAAAGCGACATCGTACCGGGAGAGCAGACCGCCACCAGCATATTGAAACAACGCACAAAAAATCTTGCACCGTTGCCTTACGCCCACCAGCAACAAAAACCACCACAGGAAAAGACGGTGGTAAGCATCAACGTTGATCCAGAGTCTCCGGAATCTTTCATGAAGCTGCCTAAACGTCGCCGCTGGGTTAAGGAGAAATACACTCGTTGGGTTAAGACACAGCCGTGTGCTTGCTGCGGTATGCCAGCCGACGATCCGCATCATCTGATTGGTCACGGGCAGGGCGGAATGGGAACAAAAGCACATGATCTCTTTGTGTTGCCTTTGTGCAGAAAGCATCACAACGAGCTGCATACGGATACAGTGGCATTTGAAGAGAAGTATGGCTCCCAACTGGAGCTGATATTTCGTTTTATCGATCGCGCGCTGGCAACTGGCGTACTGGCGTAAGTGGAGAACGAGCATGAACCTTGAAGCCTTACCAAAATATTACTCCCCAAAATCTCCAAAATTGAGCGATGACGCTCCAGCGACAGGCACCGGTTGTTTAACAATTACGGATGTAATGGCAGCGCAGGGGATGGTGCAGTCGAAAGCACCACTTGGGTTGGCCTTATTTCTGGCAAAAGTTGGTGTTCAGGACCCTCAGTTTGCGATTGAAGGCCTGCTAAATTACGCGATGGCACTGGATAACCCGACATTGAACAAATTGAGTGAAGAAATCCGGTTACAGATTATTCCTTACCTCGTGAATTTTGCCTTTGCTGATTACTCCAGGTCTGCGGCAAGTAAGGCTCGCTGTGAGCATTGTTCAGGTACGGGATTTTATAATGTATTGCGCGAAGTGGTGAAACACTACAGACGCGGGGAATCTGTAATCAAGGAAGAATGGGTGAAGGAACTATGTCAGCATTGCCATGGTAAGGGCGAAGTCAGCACAGCGTGCAGAGGGTGTAAGGGTAAAGGGATTGTTCTGGATGAAAAAAGAACCCGGTTTCATGGCGTACCGGTATATAAGATTTGTGGGCGTTGTAATGGAAACCGATTTAGTCGTTTACCGACCACGCTGGCACGACGTCATGTCCAGAAGCTGGTACCAGACCTGACCGATTATCAGTGGTATAAGGGGTATGCGGACGTCATTGATAAACTGGTAACAAAGTGCTGGCAGGAAGAAGCATACGCGGAAGCGCAATTGAGGAAGGTGACGAGATAAATGATTTTTGCTGAAGATGGCGACATGATGTTTGCATTTTTCAAAAAACATGGATAAGATTCTCTCAACGATGGGCTTTGTGTATCCGACGTTTAGAAAAAAGTAGAAAACCCGCTTATAAGCGGGTTTTTGTGCTTTAAATGGGGCAATAGAGATATTGAATCTCATCCCGGGATAAACATTGGCAGTTGAAGGTCCACGCGAACCATTTATCCGGCAAAATTCCACGCGTAATCCTGTGGTAATTTCTTCTGCATCTCGAAGATTGAGAGCTGAAACGTGAAGCTGGGCATCGATACGCCATCGGATGGGAATATAAGACCTTTGCTGCTTTTGTAGTCAAAGTTTTTGACAATTCCTGTCATTTTAGGGGACAGAAAAACTCCTTAATACTGATAACCTGGTGCACCATACACACGTTCCTGGAGAAAACTACTTTTTTGATAGGGTTGAAGGTGGCTGGATGTCTAAAATAAACATTGCTTCATATGTTCAACTATGAGTTAATGACTGCGTCGGTTTGAAGAACAGACGATATACGAAGTAGTTTACTAAAGCAGTTCTCATTTCAGGTGTTATTCACTTATTCCTTCTTTGAGTCTCTCCAATTAAGTACGAAGTCGTTTCTGTTATGCAAACCATTTATGCCGAAAGGCTCAAGTTAAGGAATGTAGAATGTCAAATAAAATGACTGGTTTAGTAAAATGGTTTAACGCTGATAAAGGTTTCGGCTTTATTTCTCCTGTTGATGGTAGTAAAGATGTGTTTGTGCATTTTTCTGCGATTCAGAATGATAATTATCGAACCTTATTTGAAGGTCAAAAGGTTACCTTCTCTATAGAGAGTGGTGCTAAAGGTCCTGCAGCAGTAAATGTCATCATTACTGATTAAAATTCATCGCTCGTCTGTATACGATAACGAAGAAGGCTGATGCCTGAGTAGAGATACGGACAGAGTAGTGAATATTGGATCTCTTTAATAAAAAGTAAGGAGGTCCAATACATGAAACAATGGCCAGCATATTTGGCAAAATCTTAATCAGGAAAAGTATGCTAACCATTGTGGTGAAGTGCAGGTTTGCTGCATGAATAGTTTTACAGCAGAAGCTAACTGCTGGCATGGCAAAACAAAGTGCGTAAGTGGATGACTCCCACAAAAAGCACCACAATCTCAAACCCGCTCAGGCGGGTTTTTTATTATCTGCTTTAAATATATTATTAAAATATAAAAAATACTTGTTACTAATAAAATCAATCAGGCTACAGCTTTAAGATTTGTCTGGAATACTTTGTTGCAATGAGGGCAGATCAAAAGGGCACCTTTTTGTACTCTTGAAAAACTGTGTTCTGACTCTTGGGTGCAGTTTGGGCAGGAACATTTAACGAGATAATTACGGCGTGATTTTGAGTCTTTACGTTCTGACATAGGCTTTTCCTGTATAAATGGCCGTATACAGTACACTAAATATGAAAACATTTCTCGTATTATTATTTTATATATGACTTTCTTTCAAAATAATTACTCACATTTTTAATGTGTATGTTTCTTTAGCGCCGTTGAGAACAACGTGTGCTGTCAAAACTACCCCGTAGACTCCGATCTTTTCAAACATATTGCACCATCCGTGTACATCGGGGTGAGGATATGAAATCAATGGATAAGTTAACAACAGGTGTTGCCTATGGCACATCGGCGGGTAATGCTGGTTTCTGGGCATTGCAGTTACTCGATAAAGTAACTCCGTCACAGTGGGCTGCAATCGGTGTGCTGGGTAGCCTGGTTTTTGGCCTGCTGACGTATCTGACAAATCTTTATTTCAAGATTAAAGAAGACAGGCGTAAGGCTGCGAGAGGAGAGTAATCCAATGACTCAAGACTATGAACTGGTTGTGAAAGGAGTCCGTAATTTTGAGAATAAAGTTACGGTAACTGTAGCCTTACAGGACAAAGAACGCTTTGACGGTGAAATTTTTGACCTGGATGTCGCCATGGACCGTGTTGAAGGAGCTGCGCTGGAGTTTTATGAGGCAGCAGCCAGAAGGAGCGTCCGGCAAGTCTTCCTGGAAGTAGCAGAAAAATTGTCAGAAAAAGTTGAGTCTTATCTGCAGCATCAGTACTCCTTTAAGATTGAAAATCCTGCCAATAAGCACGAGCGTCCTCATCATAAATATCTATGAACACAAAAATCAGATACGGCCTGTCGGCTGCCGTTCTGGCGCTGATTGGTGCTGGCGCATCTGCTCCTCAGATACTTGACCAGTTTCTGGACGAAAAAGAAGGTAACCACACAATGGCATACCGCGATGGTTCTGGCATATGGACCATCTGTCGGGGTGCCACAGTGGTGGATGGAAAAACCGTTTTTCCCAATATGAAACTGTCGAAGGAAAAATGCGACCAGGTCAACGCCATTGAGCGTGATAAGGCGCTGGCATGGGTGGAGCGCAATATTAAAGTACCACTGACCGAACCACAAAAAGCGGGTATCGCGTCATTTTGTCCCTATAACATTGGCCCCGGTAAGTGTTTCCCGTCGACGTTTTATAAGCGGCTGAATGCTGGTGATCGTAAAGGTGCATGCGAAGCGATTCGCTGGTGGATTAAGGATGGCGGACGCGATTGCCGCATTCGTTCAAATAACTGTTACGGTCAGGTTATTCGTCGTGACCAGGAGAGCGCATTAACCTGCTGGGGGATAGAACAGTGAATCAGATATTCATGGTGATTTTTCTCGTGTTGTCAGGATTTATCGTCGGAAATGTCTGGAGCGACCGAGGATGGCAAAAAAAATGGGCGGAACGTGATGCTGCCGCATTATCACAAGAGGTAAATGCTCAATTTGCTGCTCGAATAATTGAACAGGGGCGAACTATAGCCCGTGATGAGGCTGTTAAAGATGCGCAACAGAAATCTGCTGAAATTTCTGCCAGGGCTGCTTATCTGTCTGATAGTGTTAACCAGTTGCGTGCCGAAGCAAAAAAATATGCCATACGCCTTGACGCAGCGAAGCATACCGCAGATCTTGCCGCTGCCGTCAGAGGCAAAACAACCAAAACCGCCGAAGGAATGCTCACCAACATGCTCGGAGATATTGCAGCAGAAGCTCAGCTTTATGCTGAAATTGCTGACGAACGCTACATCGCAGGAGTGACTTGTCAACAGATCTATGAATCTTTAAGAGATAAAAAGCATCAAATGTAGGGTAATATTAAATCGGAACATTTACATCGCGGAATGTAAAATTTAAATAAAAAGGACTCTTCCATGAGCCAAAATTCCTGAAATCTTAAGGGTAAGATAAAAGGTCTTAATCAGAATGACACGTTTTATTAATAAATAAAGCTATTCTTTCATTGCTGTGTTTTTCTTTACAAAAGTAATCCTTGCTATGGGTGGTTAATCATGCGTTAATGGTGTTCTGGTTTGTTACAAATTTATCTGAAGCAGTCATTTTTATAATTTTATTATTTGTACCTCTTGAGATTTCCTTGTTGGTTTTTCTCTCTGATATTTTTTTTTCGGACCATTCTGCCCAAGGGCTAATTTCTTCAAAAGGTAATAATTATGTCTAACAAAATGACTGGTTTAGTGAAATGGTTTAACCCTGAAAAAGGTTTTGGTTTCATCACGCCGAAAGATGGCAGCAAAGATGTGTTTGTCCATTTCTCAGCAATTCAGAGCAACGATTTCAAAACATTAACTGAGAATCAGGAAGTTGAATTTGGTATTGAGAACGGACCTAAAGGTCCTGCCGCTGTTCATGTAGTGGCGCTTTGAGGTAGACAATATTACAAACCATATTCACTTTAGATGCCCGTGTTGTCATGGTTCCCAGTATAGAACATCATCTTTTGATGTTTCTGACATGAATCCTTTCGGGGCAAAATGTATCTTTTGTAAATCAATGATGATTACATTTGATAATATTTCACAATACTTAAATGCCAGCCGTCTGTCGTTGGATTTAAAAAAGTGAAAATGAAGGCTCCTTCGGGAGCTTTTTTGCTTGGTGTCTATTCGATGGATACTCACATACTACGGTAACATCATGAAAAAAATCATAGTTTTTTTTAACTCTGAACCAGCAGTGGTAGTGTCAGCGATGACTGGAGTTAACACCATCATGCGTGAATATCCAAATGGCGAAAAAACACACCTTACTGTAATGGCCGCAGGGTTTCCATCTCTGACCGGAGATCATAAAGTCATTTATGTAGCCGCGGATCGACATGTTACTTCAGAAGAAATTCTGGAAGCAGCAATAAGGCTCTTGAGTTGATTTGATGCTATTGCATTGATAATTCAGGAAAATTCTCTTTGTCTGTTTGTGTAAAATTTAGACTATCGTATGTTGATTATTGCGATGTTTCATCTTATCTTTTACACGTTTGCACCATATAATCGACTTACTGTGTAACTGGAAAGTCATAACAGACTAAAAGAGGAAATGATGAATATTGAAGACTTAAAAACAAAAGCAGAAGCAGATATTTCTGAATATATAACAAAAAAAATTATTGAACTTAAGAAAAAGACCGGGAAAGAAGTTACCAGTATTCAGTTTACCGCACGGGAAAAAATGACGGGTCTTGAAAGCTATGATGTCAAGATTAATTTAATCTGATGTATTCAATAATAAAGTTTATCCATAAACCTCGTTTTTACGGGGTTTTGTTATATTTGAATGGTTCCGAATATCTAAATCACAATTGTTGATGGTTTTTATTAAACCAATGCAGTCCGGCTCAGGAGTGAGAGAAGCCGGACGTTATGGTTTAGCGTGGTAAGATCTGTGTAGTTTTCTGGATGCTTTCAGTAAATAGTAATGAGTTATCAAAGGCATAGTAATATCTTTGGTGTTCCTGGATATTTGTAACCCATCGGAAAACTCCTGCTTTAGCAAGATTTTCCCTGTATTGTTGAAATGTGATTTCTTTTGATTTCAACTTATTATAGGAGGTCTCTATAAGATGTTTGTTTCTGGAGAATTTAACATTTACAACCTTTTTGAGTCCTTTTACTAACACTATGTTGTCGTTTTCTAACACAATGTGAATATTATCTGTGGCTAAATAGTAAATATAAAGTGAGACATTGTGACGTTTTAGCTCAGAATAAAATAATTCACAGTTTAAATCTTTACGCACTTGATCGAATATTTCTTTAAAAATGGCAGCCTGAGCCATTGGTAAACCTTCCATGTGATACGATGGCGCGTAGTTTGCGTAACAAAGTTGAGCCTTGCTGGCATCCAGGAGGGATATGCAACCGACAGATGTATGTAAGGTCGATGTACTCAAACTTTCATACTTTTCCTCTTTTATGCAGAAAGATTTGAAGTAATATTTTAACCGCTAGATGACGAGCAAACGCATGGAGCGACAAAATGAATAAAGAACAATCTGCTGATGAACTCTCGTTGGATCTGATTCGTGTAAAAAATATGCTTAATAGCACCATTTCTATGAGTTACCCGGATGTTGTAATCGCATGTATAGAACATCAGGTGTCTCTGGAAGCATTCAGGGCAATTGAGGCAGCGTTGGTGAAGCACGATAAGAATTCGAAGGATTATTCCCTGGTGGTTGACTGAGCACCATAACTGCTAATCATTCAAACTATTTAACCTGTGACAGAGTCAATGTCGCATTCTGTCACTGTCAGGCTAATACAGAGCTGCAATTCAACTACTGCAATGTCCTCGTAATTAGGTGAATTTACATATCGTCCTGTTCGGATGCCGGCTGCATTGCTGAAGATGAGGCATTTATGGTTCGCATATTTTCCCCTCATGCTCGTCAGTCCTGTGCGTAGGAAGAAACAGGACACTCACACTAATTTGTGTGGGCATGCTGTGATGTCCTTCTGAATTATTCCTATGCCATTATGTAAAGCGCTGTATCAGATGCTCGTCACGGCTGTCAGGCTGTCGGGTCCTCCCGGTGGGGGCCCCTGCCACGGGGCGGGAGCGTCGCGGAAAAAGGCTAGTTTTTGAAATTTCATTCGTCATCACCACTACTGTAATAGATTGATATTACAGTGTTTTTATTTTTGTGGTGTCGATTTTGATTGTTTTTTGTTCATCACTAACACCGTTTGCCTAAAGTTGTTCGCAAGATGCATGTTTAAAACATTCTGGAGCGGGTATGGATCGAGAGTTAAAAAATCTGACGCTGAATATCAGTCAACTGGCGGCACTGTCAGGTGTACATCGCCAGACTGCTGCGGCAAGGCTGCAAAATCTACCCGTTGCAGGGGGGCATGAAAGCAACCTCAAGCTTTATCGGGTGGTTGATATTGTGTCGGCATTTCTGGCATTACCACCGCCGGTTGCAGAAAGCGAAATGGATGCGCATGAGCGCAAAGCCTGGTATCAGTCTGAACGTGAGCGTCTTAAGTTCGAACAGGAAACGGCACAACTCATTCCGGCCAGTGATGTCAGACGGGAGTTTGCCATCTGGGCAAAAGCGGTCGTGCAGGTGCTGGAGACATTACCGGATATTCTTGAACGTGACTGCGGTCTGCAGCCTGCCGCTGTGAGCCGTGTTCAGTCCATTATTGATGATCTGCGCGATCAGATAGCCCTGCGGGTGACTGAAGCAGGTGCGGATGATGAGGAGGAATTACAGCAGGAGGAGTAATGCTGAATCAGGAAACCGCAAAGGCAGCACGAACCGATTCAGGTTATATCCTTCGCGCACCGAGACGAATGCGGGTTGCTGATGCCGTTGCTCAGTATATGCGGGTGCCCCTGGGGGCAGGGAACTCAGTCCCGTGGGATCCGCTGGTGGCACCGTATGTTATTGAGCCGATGAACTGCCTGGCCTCGCGTGAATACGACGCAGTGATATTTGTTGGCCCGGCACGAACCGGCAAGACTATCGGCCTGATTGACGGCTGGGTGATTTACAACGTGATTTGCGATCCTGCTGATATGCTGATCATTCAGATGACGGAGGAAAAAGCCCGCGAACACTCCAAAAAACGACTCGCCAGAACGTTTCGCGTCAGCCCGGAAGTGGTCAGTCGCCTGAGTCCGAACAAAAATGACAACAACGTTTATGACAGAACATTCCTTGCTGGTAACTACCTGAAAATCGGCTGGCCGTCAGTCAATATCATGTCCTCATCAGATTATAAATGCGTCGCGCTGACGGATTATGACCGTTTTCCGGAAGATATTGATGGCGAGGGGGATGCTTTCTCTCTTGCCTCAAAACGTACCACAACATTTATGTCCAGTGGTATGACGCTGGTGGAGAGTTCCCCCGGCAGGGATGTGAAGGATGTGAAATGGCGACGGACTTCACCGCATGAGGCTCCACCAACCACGGGGATACTGTCGCTCTATAACCGTGGCGATCGCCGTCGCTGGTACTGGCCCTGTCCACACTGTGGTGAGTATTTTCAGCCCTGCGGCGATGTGGTTGCTGGTTTCCGTGATATTGCCGATCCCGTGCTGGCAAGTGAGGCGGCTTATATTCAGTGTCCTTCCTGTTCAGGACGGATTATGCCTGAACAAAAACGTGAGCTGAACGGACGTGGGGTCTGGTTGCGGGATGGTGAATCCATCAATGCGGATGGCAGTCGTTATGGTGATCCCCGACGCTCACGTATTGCGTCATTCTGGATGGAGGGTCCGGCAGCTGCTTACCAGACACTCTCGCAACTCGTTTACAAACTGCTTACTGCAGAACAGGAATACGAGACAACCGGAAGTGAAGAAACACTCAAGACGGTTATCAATACCGACTGGGGATTACCTTATCTTCCCCGCGCCAGCATGGAGCAACGAAAAAGTGAACTGCTTGAGCAGCGGGCAGAGCCAGTTCCTTCCCGCAGTGTGCCGGATGGCGTTAATTTTCTTGTGGCGACAGTGGATGTGCAGGCGGGACGTCATCGCCGTTTTGTGGTTCAGGTAACGGGCTATGGCAGCCGTGGCGAACGCTGGATTATTGATCGTTACAACATCACGCAGTCATTGCGCGGTGACAGCGACGGGGAGAGCCAGCGAATTGATCCGGCCAGCTATCCGGAAGACTGGGATGTCCTGCTGACGGATGTTTTTCATAAAAGCTGGCCGCTGGCCTCCGATCCTTCTCAACAAATGCGACTGATGGCAATGGCGGTGGACTCCGGCGGTGAAGACGGGGTCACTGATAATGCCTATAAATTCTGGCGTCGTTGCCGTCGTGATGGCCTTGGTAAACGTATTTACCTGTTTAAGGGCGACAGCATCCGGCGCGCAAAACTGATCACCCGTACATTCCCTGATAACACCGGACGAACGGGCCGACGGGCGCAGGCCGCAGGTGATGTGCCGCTCTGGCTTCTTCAGACGGATGCACTGAAAGACCGGGTGAATAACGCGTTATGGCGTGACTCGCCAGGTCCCGGCTATGTGCATTTCCCTGACTGGCTGGGGAGCTGGTTTTACGACGAACTGACGTATGAAGAGCGGAGCAGTGACGGGAAATGGAGTAAGCCGGGTCGCGGTGCCAACGAAGCTTTTGACCTGATGGTGTATGCCGAGGCTCTGGTCATTCTGCATGGATACGAAAAGATCCGCTGGCCGGATGCACCGGAGTGGGCGAGCCGGGAAACCTGGCTGGAGTGTGTCCCGGACAGTATCGAACCGTCACCCTCACCGGAACCGGTATCCACGCCTGTTAAAAAACAAAAACGGAAGAAAACAGTAACTGACGATGTTAACCCCTGGCTGACTTCCGGAGGATGGTTATGAACCAGAATGATATCGAAGCCATGATTCAGCGTTATACGGAAGCTGAAATGGCGGTGCTGGACGGAAAATCCGTCACCTTTAATGGTCAGCAGATGACCATGGAAAACTTATCTGAGATCCGGCAGGGACGGCAGGAGTGGGAGCGCCGCCTTGCGGCTCTGATTACACGACGACGGGGGCATCCCGGGTACCGGCTGGCGAGGTTCTGATGGCAATTCTTGATGATGTGATTGGCGTTTTTTCACCAGGATGGAAAGCGGCAAGGCTGCGTTCCCGGGCGGTGATCCAGGCTTATGAGGCCGTAAAAACGACGCGGACACACAAAGCCCGGCGGGAAAACCGAACTGCCGACCAGTTAAGCCAGTACGGGGCCGTGTCGTTACGTGAGCAGGCCCGTTACCTCGATAACAACCACGATCTGGTCATTGGTGTATTTGACAAGCTGGAAGAACGGGTGGTGGGGAAAAACGGGATTATTGTCGAGCCACATCCGGTATTACGCAATGGGGCCATTGCCCGTGATCTGGCTGCGGAGATTCGCACCCGATGGAGTGAATGGTCTGTCAGCCCGGAAGTCACCGGGCAGTTTACCCGTCCGATGCTGGAACGTCTGATGCTGCGTACCTGGCTGCGCGATGGTGAGGTGTTTGCCCAGATGGTTTCCGGGCGCATAAACAGCCTGACGCCTTCTGCCGGTGTTCATTTCTGGCTGGAGGCGCTCGAGCCGGACTTTATTCCCATGACCAGTGATGAGAGCAACAGGCTGAATCAGGGCGTGTTTGTTGATGACTGGGGGCGTCCCGAAAAATATCTGGTGTATAAAAGCCGTCCCGTATCCGGACGGCAGATGGAAACCAAAGAAGTGGATGCAGAGCGAATGCTGCATCTTAAATTTGTTCGCCGTCTGCACCAGATGCGCGGGACGTCTTTATTGTCCGGTGTGCTGATCCGCCTCAGTGCTCTGAAAGAGTATGAAGATTCTGAGCTGACTGCAGCAAGGATCGCCGCTGCTCTGGGGATGTACATCCGGAAAGGCGACGGGCAGAGCTATGAAGCGGATGGTAATGGCAGCAAGGATAAGGAACGCGAGCTTACCATTCAGCCAGGTATTATTTACGATGATCTGAAACCCGGCGAAGAAATCGGAATGGTGAAGTCGGATCGCCCCAATCCTAACCTTGAAACTTTTCGTAATGGTCAGTTGCGTGCCGTGGCGGCGGGCAGTCGTCTGAGTTTTTCCAGTACAGCGCGCAACTATAACGGCACTTACAGCGCCCAGCGTCAGGAGCTGGTTGAATCCACTGATGGCTACCTGATCCTGCAGGACTGGTTTATTGGTGCCGTCACCCGTCCGATGTATCGTGCCTGGCTGAAACAGGCTGTGGCATCCGGTGTTATCAGGCTACCCCGCGATCTTGACCGTTCTTCACTGTATACCGCGGTGTATTCCGGACCGGTGATGCCGTGGATTGACCCTGTTAAGGAGGCTGAGGCCTGGAAAATCCAGATTCGTGGTGGAGCGGCGACAGAATCAGACTGGGTACGTGCTGGTGGTCGTAATCCGGATGATGTCAAACGTCGGCGCAAGGCCGAAATTGATGAAAACCGCAAACTGGATCTGGTATTTGATACCGATCCGGCCAGTGATAAAGGAGGCAGCAGTGCCGCAACGAAACGACATGAGCCGCAGCACACCGACGACCAGTCCGAAGAATAATTCCTGGTTCAGGATGCAGGCTGGTCACCAGAGTGACGCGGATATTTATATTTATGACGAGATTGGTTTCTGGGGTGTTACAGCGAAGCAGTTTATCAGTGATCTGAATGCACTGGGCGATATCACCCACATTAATCTCCATATTAATTCACCGGGTGGCGATGTCTTTGAAGGCATCGCCATTTTTAATGCGCTGAAAACACATGGTGCGTCCATTACCGTTTATGTCGACGGTGTGGCGGCGTCAATGGCGTCGGTCATTGCGATGGTGGGAAACCCGGTCATTATGCCGGAAAACACCTTCATGATGATTCATAAACCATTTGGCTTTACGGGCGGTGATGCGGAGGACATGCGCACCTATGCCGACCTGCTCGATAAAGTTGAGGCGGTTCTGTTACCCGCTTATGCACAGAAAACCGGGAAAACCACCGATGAAATTGCTGCCATGCTGGCGGATGAGACCTGGATGTCCGGTGCCGAATGTCTGGCACATGGATTTGCTGATCAGGTAACGCCAGCCGTTAAGGCAATGGCATGTATTCAGTCAAAACGTACAGAGGAATTTAAAAAGATGCCGGAATCCATTCGAAACATGATTACTCCGCCACGCAACAGTGCTCCACGCGTACAGGATAATGAACCTGAAGCCTCCCGGACGCCAGTGCAGGCAGCAGCACCCGTGGTGGATGAAAACAGCATCCGTGCGCAGGTACTGGCAGAGCAAAAAGCGCGTGTAAACGGTATTAATGATCTGTTTGCCATGTTTGGCGGGCGTTATCAGACGCTGCAGGCTCAGTGTCTTGCCGATCCTGAATGTTCGCTGGAGCAGGCCCGCGAAAAGCTGTTGAACGAGATGGGGCGCGAGTCCACGCCATCCAATAAAAATACCCCGGCTCATATTTATGCCGGTAACGGTAATTTTGTGGGGGACGGGATCCGCCAGGCGCTGATGGCGCGTGCCGGATTTGAAAAAACCGAACGTGATAATGTCTACAACGGGATGACCCTGCGTGAATATGCCCGTATGTCACTGACTGAACGGGGTATTGGGGTTTCCAGTTATAACCCGATGCAGATGGTCGGTGCGGCGTTCACACACAGTACGTCTGACTTCGGTAATATTCTGCTGGATGTTGCGAACAAAGCCATTCTGCAGGGCTGGGAAGATGCCCCTGAAACCTATGAACAGTGGACGCGGAAAGGTCAGTTGTCTGATTTTAAAATTGCCCATCGTGTGGGTATGGGTGGCTTCAGTGCTCTGCGTCAGGTGCGTGAAGGGGCGGAATATAAATACGTCACCACCGGAGATAAACAGGCCACTATTGCACTGGCGACCTATGGCGAGCTGTTCAGTATCACCCGTCAGGCCATTATCAATGATGATCTGAATATGCTGACCGATGTCCCGATGAAACTGGGCCGTGCGGCGAAATCCACTATTGCCGATCTGGTTTATGCCATTCTGACGTCTAACCCGAAAATCTCCACAGATAATGTAAGTCTGTTCGATAAAGCGAAACATGCAAACGTACTGGAGAGCGCTGCAATGGACGTGGCATCGCTGGATAAAGCCCGCCAGTTGATGCGCGTTCAGAAAGAGGGGGAGCGTCATCTGAATATTCGTCCTGCGTTCGTACTGGTACCGACGGCGATGGAGTCTGTTGCTAACCAGGTCATTCGCTCCTCAAGTGTCAAGGGGGCTGACATTAACGCCGGTATTATTAACCCGGTGAAAGATTTTGCGACCGTTATTGCAGAGCCTCGTCTTGATGATAACAGCCAGACCACCTTCTACCTGGCTGCGTCAAAAGGCTCCGATACGATTGAAGTGGCTTATCTCAACGGTGTGGATACGCCATATATTGATCAGATGGAGGGCTTCAGTGTGGATGGCGTGACAACGAAAGTGCGTATTGACGCCGGTGTCGCGCCAGTTGATCACCGCGGTCTGGTGAAATGTACGGCGTAAACGTCGCAGACAACAACTCTGATGGCCCGTAAGGGCTTTTTTTGTACCTGAAATCAGCCCCTGAACGGGGCTGTGCGGAGACAGTTATGGCAAAGAATTTTGTAGAAGAAGGAAAAACGGTGGCGATTGTTGCCAGTGCAGCCATCAGCAGCGGAGATCTGGTGCAGGTGGGTGATGTTTTTGCGGTGGCGCTGACCGATATTCCACAGGGTGAAACAGGCGACGGCATGACCGAAGGTGTGTTTATCCTGCCTAAACTGAAAACGGATGACATGAAAACGGGTAAGAAGGTTTATCTGAAGTCCGGAAAAGTTCAGCTGACTAACAGCGGCTCTGATCCGCTGGTCGGGGTTGTCTGGGCAGATGCCGGAACCAGTGCAGAAGAAGTGCCGGTAAAACTCAATGTCTGATCCCTTTTCCCGGCTGGCAGCGCGTATGGATGCGATCACGGTCAGAAAGATGGGAAAGACAGCCTCGATTAATGATGCCGATATGACTGTGATCCCGGGCGAAACACTGGCAGAGCTGAATGCTCTGTCCGGACCTGCGGTCTCTCTGGTGGTGTTTTCTTCGGGATACCGCCCACGGCGCGGGGATCGCGTTGTTTATGACGGACAACAATGGACGGTCACACGGCATGAACGTTTTAACGGTAAGCCAATGATCTTTATTGAGTAAAGAGGTGTGGGATGAAGGGGCTTGAGAATGCCATCCGCAATCTGAACAGCCTTGATACCCGTATGGTGCCACAGGCCAGCGCATGGGCGATAAACCGTGTGGCACAGAAAGCGGTCTCGGTTGCCACCCGGCAGGTTGCCGGGAATACCGTTGCGGGAGATAACCAGGTGAAAGGGATCCCCCTGAAACTGGTACGTCAGCGTGTCCGGGTGTTTAAAGCCAGTCCGTCAGGAAAAATGACGGCCAGGATCCGCGTTAACCGGGGCAATCTGCCCGCCATTAAGCTGGGGACAGCCCGGGTCAGACTGGCCCGGCGTGGTGGAAAACTGCAGTACCGTGGCAGTGTGCTGAAGGTGGGTAAATATCTTTTCCGGGATGCGTTTATTCAGCAACTGGCGAATGGTCGCTGGCATGTGATGCGGCGTATTGATGGCAAAAATCGTTACCCCATTGATGTGGTGAAAATCCCGCTTTCCGGACCGCTGACACAGGCATTTGAAGATGCCCGCGACCGCATCATTGCTGCGGAAATGCCGAAACAGCTGGGGTATGCACTGAAACAACAACTGAGGTTATGGCTGACCCGATGAACCGACATACACAAATCCGCCAGGCCGTACTGGCACGCCTTCGGGAACAGTGTGGAGACAGCGCCACGTTTTTTGACGGGCTTCCGGCATTTATTGATGCGCAGGAACTGCCTGCCGTGGCGGTGTGGCTGAGTGATGCTCAGTACACCGGAAAAATGACGGATGAAGATGACTGGCAGGCTGTTCTGCATATTGCTGTCTTCATCCGGGCACAGGCACCGGATTCAGAGCTGGATATGTGGATGGAGAGCACCATTTTCCCGGCTCTGAATGATATACCGGCACTTTCCGGACTCATCGACACCCTGATCCCTCTCGGTTTTAACTATCAACGTGATAATGAGATGGCCACCTGGGCGATGGCGGAAATCACGTACCAGATCACGTACACGAATTAAAGGAGGTGGCAATGACCACACCAAATCCACTGGCAAAAACGAAAGGTGCGGGAACGACGTTCTGGATGTACACCGGCAAGGGCGATGCGTTTGCGAACCCTTTATCGGACACTGACTGGCTGCGTCTTGCGATGGTGAAGGATCTGCAACCTGGCGAAATGACCGCTGATGCAGAAGATGACACTTATCTCGATGATGAAGATGCAGACTGGAAAACGACAACACAGGGGCAGAAATCCGTCGGTGATACTTCGGCGACGCTGGCCTGGCGTCCGGGTGACAGCGGACAGAAAAAACTGGTTCAGTTGTTCGACTCCGGTGAAGTCTGCGCGTTTCGTATCAAATATCCCAACGGCACTGTTGATGTTTTCCGTGGCTGGCTGAGCTCACTGGGTAAAACCATTGCCTCAAAAGACGTGATGACCCGCACAGTGAAAATCAGCGGTGTGGGGCGTCCGTATCTGGCAGAGGAAGGCACTGAAACAGTGAGCGTTACCGGGCTGACGGTGGCACCGGCATCTGCCAGTGTAAAAGTGGGAGCAACCACCACGCTGACCTTTACAGTAAAACCTGACGGAGCCAGTGACAAAGCGATCAGTGTGCATTCGACAGATCCACAGACTGCCACGGTGACCCTGAACGGGCTTGTGGCCACGGTGAAAGGCGTGAAGCAGGGCAGTGTCAGCATTGTGGGCATGACTTCTGACGGCGATTTTGTGGCAGTGGCTGCGGTGGCTGTCAGCGCCGCAGGTTAACAGGACGATACTCATCATTTGCCCCGGTTATCCGGGGCTTTTTTGCAGGTGGAGAACATGATGTTTCTGAAACAGGGCACGTTTAATTATGAAAAGCAGTCCGTGGTGCTCAGTGAGCTGTCCGGGCTGCAGAGAATTGAATATCTGGCGTTTGTTCAGCAGCGAACGGCAAAGTTTGATGCCGAAGAGGGAGAACTGCCGGAGGCTGAACGACAGATTGCTTTTCTGCGGATGGGGATGGATATCAATGCCTGGCTGGTTTCCCGCTCACTGTGGAATGCGGAACAGTCTCAGGATGTTGAGACGCTTTGCGCATCCGTTATTACAACATGGTCGTATGATGCCCTGGGAGCGGGGGCGGAGATGGTTCTGTCGCTGAGCGGTATGGGAGCCATTGAGAATGCCGGGGATTTGGAGCATGAGGTGCTGACGCCGGAAAAGTCCTGACGCGGGAAATGCAGTTTGTCATGCGGCTTGCCCGGGAGTTCCGGCGGGCAGACTGGCGGCGGATGCTGTCGGAAATGTCGGCCACTGAGCTTGGTGAGTGGGGCGATTATTTCCGGATGCAGAGCTTCAGTGATGTGTGGATGGATGCGCAGTTTGCCTCGCTGAAGGCATTGATCGTGAGAATGGTGTCCGGTAGCAGTGATGCTGCGGTGGCTGATTTCAGCCTTTTACCGGAAGAGAACGGGATACCGGAGCGAACGGACGAAGAACTGATGCATCTTGGGGAAGGTATTTCCGGAGGTGTGCGTTATGGACCAGATAGCCAACCTGGTCATTGATTTGGGGATTGATGCGGCAGAGTTTAAAAATGAAATTCCCCGTATCAAAAACCTTCTGAATGGTGCAGCCAGCGATGCAGAACGGTCTTCTGCCCGTATGCAGCGTTTTATGGAGCGTCAGACTCAGGCCGCCCGGCAGACAATGCAGGCGGCTTCTTCGGCTGCAACAGCCGCATCCGTCCATGCGCAGACGGTGGAGAAGAGCGCACAGGCTCATGAACGCATGGCCCGCGAGGTGGAGCAAACCCGCCAGCGTATGGAGGCACTGAGCCAGAAAATGCGCGAGGAACAGGCGCAGGCCATGGCTCTGGCGGAGGCTCAGGATAAAGCGGCTGCCGCGTTTTATCGTCAGATTGACAGTGTGAAACAGGCCAGTGCGGGGCTGCAGGAATTACAGCGTATTCAGCAGCAGATCCGACAGGCCAGAAACAGTGGCGGGATTGGTCAGCAGGATTATCTGGCGCTGATTTCTGAGGTTACGGCGAAAACCCGTGTTCTTACGCAGGCTGAGGCAGAGGCTACCCGACAGAAAGTGGCGTTTATCCGTCAGCTTAAAGAGCAGGCAACCCGCCAGAATCTTTCTTCTTCTGAGTTGCTTCGTGCTAAGGCTGCCCAGCTGGGGGTAAGCAGTGCTGCAGAAGTGTATATCCGCAAAATGGAGCAGGCAGGAAAAGCCACGCATTCGCTGGGTCTGAAAAGTGCAGCGGCCCGCCAGGAGATAGGCGTTCTGATAGGTGAACTGGCTCGCGGCAATTTAGGTGCGCTGAGGGGATCCGGGATAACGCTGGCTAACCGTGCCGGATGGATAGACACACTGATGTCACCGAAAGGCATGATGCTGGGCGGGGTTATTGGCGGTATTGCCGCGGCCGTCTATGGTCTGGGTAAAGCCTGGTATGATGGTCAGAAGGAGGGGGAAGAATTTAACCGCCAGTTGTCGCTGACGGGGCATTATGCCGGAGTCACTGCCGGGCAGCTGTGGACGCTCAGTCGTGCTATTTCCGGGAATGGTATTACGCAACATGCTGCAGCCGGTGCGCTGGCTCAGGTGGTGGGGAGTGGTGCATTTCGTGGAAACGATATCGGTATGGTGGCGAGAGCTGCCGCACAGATGGAGCGATCGGTTGGCCAGTCGGTCAGCGATACCATAAATCAGTTTAAGCGGCTGAAGGATGATCCTGTAAATGCCGCGAAGGCTCTGGACAATGAGCTGCATTTTCTTACTGCCACTCAGCTTGAGCAGATACGCGTCCTTGGGGAACAGGGGCGGTCCAGTGATGCGGCACGGATAGCCATGTCTGCACTGGCAGAGGAAACCGGTCGGCGTACTGCGGATATTGATAATAACCTCAATGCGCTGGGCAGTACGCTGAAGTATCTGTCTGATTTATGGAGTAGTTTCTGGGATGCGGCCATGAATATTGGTCGTGAAGACTCGCTGGATGAACAGATTTCCGCTTTACAGGAGAAAGTGTCGCGGGCGAAAAGACTCCCCTGGACGGCATCATCTTCTCAGGTTGAGTACGATCAGCAGCGTCTTAACGAGCTTCAGGAGAAAAAACGCCAGAAGGATTTGCAGGATGCAAAAGAGCAGGCAGAGCGCAATTATCAGGAGCAACAGAAACGCCGTAATGCTGAAAATGCTGCACTGAACCGGATGAATGAAACGGAAGCAGCACGACATCAGCGTGAAATTGCGCGTATTAATGCCATGCAGTACGCCGATCAGGCTGTCAGGGATGCGGCGATACAACGTGAAAATGAACGTTACGAGAAAGCCCTGGCATCCGGTAAGAAAAAAACACGCGAACCCCGTAATGATGAGGCCACCCGGTTATTGCTGCAGTACAGTCAGCAACAGGCACAGGTGGAAGGACAGATTGCTGCTGCCAGACAGTCAGCAGGCATTGCCACGGAAAGGATGACAGAAGCGCATAAACAGCTTCTGGCTCTGCAGCAGCGCATCAGCGACCTGGACGGGAAAAAACTGACGGCAGATGAAAAGAGTGTGCTGGCCCGTAAAGATGAACTGATTCAGGCACTGACGCTGCTGGATGTAAAACAGCAGGAGCTTCAGAAACAGACGGCACTCAACGAGCTGAAGAAAAAAACAATTCAGCTGACCAGTCAACTGGCTGAAGAAGAGCGCGCTCAGCGTCAGCAACATGACCTGGATATCGCCACGGTGGGTATGGGTGATCAGCAGCGGCAGCGATATCAGGTACAACTGAGTCTTCGCCAGAAATACCAGCAACAGCTGGAGCAGTTGAGGCGGGATAGTGAGCAGAAAGGAACATATAACACGGATGACTACAGAAAGGCCGAGCAGGCGCTGACGGAGAGCCTGAACCGACAACTGAATGAGAATCGCCGTTACTGGCAACAGCTTGAAGTTGTGCAGGGTAACTGGAAAAACGGAGTCCTGCGTGCATTTCAGGATTTTACCGTGGATGCAGATAATACGGCAGGAACAGCAGAACAGGTGTTCTCGTCAGCCTTCAGCAACATGGGAAATGGCCTGGCAACTTTTGTCACTACCGGCAAACTCAATTTCAAATCCTTCACCTCTTCTGTGCTGTCAGATATGGCGAAAATCCTGGCGCAGGCAACCATGATGAAATCGATAAAAGGGATTGGCAGTGTACTGGGATTTGATCTCAGCAGCCTTTCCCTGAATGCCAATGGGGGGATTTATCAGTCTGCTGATTTGAGTCGTTACAGTGGCACGGTGGTTAACCGTCCGACGTTTTTTGCTTTTGCAAAAGGCGCGGGTGTGATGGGGGAAGCGGGACCTGAAGCCATTCTGCCACTGCGTCGTGATGCTGACGGTAAGCTGGGGGTTGTGGCGGATATTGGTGGTTCAGGTATGGCGATGTTTGCCCCGCAGTACAACATCGAGATCAATAACGATGGCACGAACGGGCAGATAGGTCCGGCTGCCCTGAAGGTGGTTTATGACCTCGGGAAAAAAGCAGCAGCGGACTTTATGCAACAGCAGGCCCGTGATGGTGGTCGATTAAGTGGAGCATATCGGTAATGGAGACGTTTCACTGGAAAGTGCGCCCGGATATGAATGTGGTATCAGAGCCGAAAGTGGTGACAGTGAAGCTGGGCGATGGTTATGAACAGCGTCGTGTGGCGGGACTGAATAACCAGTTGTCGACTTACAGCGTGACGATACGTGTTCGTAAATGTGAACACCCATCTTTAAAAGCCTTTCTGGAACGGCACGGTGGCGTCCGCGCATTTCAGTGGACGCCACCTTATGACTGGAAGCCGATCAGGGTGGTTTGTCGTAAATGGTCGGCGAGCGTAGGGGCGCTGTGGGTAACCATAACGGCAGATTTTGAACAGGTCGTGGCATAGGAGGCTCTGATGCAGGATATTCCACAGGAAACACATCATGAGACGACACGCCTCACTCAGTCAGCCCTGGTGGTGCTCTGGGAAATCGATCTGACAGAGGTCGGTGGAGAACGTTATTTTTTCTGTAATGAGCAGAACGAAAAAGGTGAGCCGGTTACCTGGCAGGGGCGGCAGTATCAGGCATACCCCATTCAGGGGACGGGATTTGAACTGAATGGCAAGGGCAGTGCTGCCCGTCCGACACTGACGGTTTCTAACCTGCACGGCATGGTCACCGGGATGGCGGAAGACCTGCAGAGTCTGGTCGGCGGAACGGTGGTCAGGCGTAAGGTTTACGCCCGTTTTCTGGATGCGGTGAACTTCGTCAACGGAAACAGCGACGCCGATCCGGAGCAGGAGGTGATCAGCCGCTGGCGCATCGAGCAGTGCAGCGAACTGAGTGCGGTCAGTGCCTCCTTTGTACTGTCCACACCGACGGAAACGGATGGTGCCGTTTTTCCGGGGCGCATCATGCTGGCTAATACCTGCACCTGGACCTATCGCGGTGATGAGTGCGGTTATCACGGTCCGGCTGTCGCGGATGAATATGATCAGCCGACGTCCGATATCACGAAGGATAAATGCAGCAAATGCCTGAATGGCTGTAAGTTTCGCAATAACGTCGGCAACTTTGGCGGCTTCCTTTCCATTAACAAACTTTCGCAGTAAATCCCATGACAGAGACAGAATCAGCGATTCTGGCGCACGCCCGGCGATGTGCGCCAGCGGAGTCGTGCGGCTTCGTGGTGAGAACGCCGGAAGGGGAAAGATATTTTCCCTGCGTGAATATCTCCGGTGAGCCGGAGGCGTATTTCCGGATGTCGCCGGAGGACTGGCTGCGGGCAGAGATGCAGGGTGAGATTGTGGCGCTGGTCCACAGCCACCCCGGTGGTCTGCCCTGGCTGAGTGAGGCCGACCGGCGGCTGCAGGTGCAGAGTGATTTGCCGTGGTGGCTGGTCTGCCGTGGGGAGATTCATAAATTCCGCTGTGTGCCGTATCTCACCGGGCGGCGCTTTGAGCACGGGGTGATGGACTGTTACACGCTGTTCCGGGATGCTTACCATCTGGCGGGGATTGAGATGCCGGATTTTCATCGCGAGGATGACTGGTGGCGTCACGGTCAGAATCTCTATCTGGATAATCTGGAGGCCACAGGGCTGTATCAGGTGCCGTTGTCAGCGGCGCAGCCGGGCGATGTGCTGCTGTGCTGTTTTGGTTCATCGGTGCCGAATCATGCCGCCATTTACTGCGGCGACGGTGAGCTGTTGCACCATATTCCTGAACAACTGAGCAAACGAGAGAGGTATACCGACAAATGGCAGCGACGCACACACTCCCTCTGGCGTCACCGGGCATGGCACGCATCTGCCTTTACGGGGATTTACAACGATTTGGCTGCCGCATCGACCTTCGTGTAAAAACGGGGGCTGAAGCCATCCGGGCGTTGTCCACACAGCTCCCGGCGTTTCGTCAGAAACTGAATGACGGCTGGTATCAGGTGCGCATTGCCGGGCGTGATGCAGGTGAAACCGAATTATCTGCCCGTCTTAATGAGCCGCTGGCAAATGGTGCCGTGATCCACATCGTGCCGCGTCTGGCGGGAGCTAAAAGTGGCGGTGTGTTTCAGGTGGTGTTGGGGGCGGCGCTGATTGCTGTGGCATGGTGGAACCCTGTGGGCTGGCTGGGTGCCGCGGCTGTATCGGGCATGTATGCGGCAGGGGCCAGTATGATCCTGGGTGGTGTGGCCCAGATGCTGGCACCGAAAGCCCGGACGCCCACAGCGACCAGCACGGATAACGGTAAGCAGAACACCTATTTTTCATCACTGGATAACATGGTTGCCCAGGGCAATGTTCTGCCTGTTCTGTACGGTGAAATGCGTGTGGGGTCTCGTGTGGTTTCTCAGGAGATCAGCACGGCAGATGAAGGGGACGGTGGTCAGGTTGTGGTGATTGGCCGCTGATGCAAAATGTTTTATGTGAAACCGCCTCCGGGCGGTTTTGTCGTTTATGGAGCATGACGAATGGGCAAAGGCAGCAGTAAGGGGCATACCCCGCGCGAAGCGAAGGACAACCTGAAATCCACGCAGTTACTGAGTGTGATCGATGCCATCAGCGAAGGGCCGGTTGAAGGTCCGGTGGATGGATTAAAAAGCGTGCTGCTGAACAGTACACCGGTGCTGGACAGTGAGGGGAATACCAACATCGCCGGTGTCACGGTGGTGTTCCGGGCAGGTGAACAGGAGCAGACACCGCCGGAGGGATTTGAATCCTCCGGATCCGAGACGGTGCTGGGTACGGAAGTGAAATACGACACGCCGATCACCCGGACCATCACGTCTGCAAACATCGACCGTCTGCGCTTTACCTTCGGTGTGCAGGCACTGGTGGAAACCACCTCAAAGGGGGACCGGAATCCATCGGAAGTCCGCCTGCTGGTTCAGATACAACGTAACGGTGGCTGGGTGACGGAAAAAGACATCACCATTAAGGGCAAAACCACCTCGCAGTATCTGGCCTCGGTGGTGGTGGGTAACCTGCCGCCGCGCCCGTTTAATATCCGGATGCGCAGGATGACGCCGGACAGCACCACAGACCAGCTGCAGAACAAAACGCTCTGGTCGTCATACACCGAAATCATCGATGTGAAACAGTGCTACCCGAACACGGCACTGGTCGGCGTGCAGGTGGACTCGGAGCAGTTCGGCAGCCAGCAGGTGAGCCGTAATTATCATCTTCGCGGACGCATTCTGCAGGTGCCGTCGAACTATAACCCGCAGACGCGGCAATACAGCGGTATCTGGGACGGAACGTTTAAGCCAGCATACAGCAACAACATGGCCTGGTGTCTGTGGGATATGCTGACCCATCCGCGCTACGGCATGGGGAAACGTCTTGGTGCGGCGGATGTGGACAAATGGGCGCTGTATGTCATCGGCCAGCATTGCGAGCAGTCGGTGCCGGACGGTTTTGGCGGCACGGAGCCGCGCATCACCTGTAATGCGTACCTGACCACACAGCGTAAGGCGTGGGATGTTCTCAGTGATTTCTGCTCGGCGATGCGCTGTATGCCGGTATGGAACGGGCAGACGCTGACGTTCGTGCAGGACCGACCGTCGGATAAGGTGTGGACCTATAACCGCAGTAATGTGGTGATGCCGGATGATGGCGCGCCGTTCCGCTACAGCTTCAGCGCCCTGAAGGACCGCCATAATGCCGTTGAGGTGAACTGGATTGACCCGAATAACGGCTGGGAGACGGCGACAGAGCTTGTGGAGGACACGCAGGCCATTGCCCGTTACGGTCGTAACGTCACGAAGATGGATGCTTTTGGCTGTACCAGCCGGGGGCAGGCACACCGCGCCGGGCTGTGGCTGATTAAAACGGAGCTGCTGGAAACGCAGACCGTGGACTTCAGCGTGGGCGCAGAAGGGCTTCGCCATGTACCGGGCGATGTCATTGAAATCTGCGATGATGACTATGCGGGCATCAGCATCGGCGGGCGCGTGCTGGCGGTGAACAGCCAGACGCGGACACTGACGCTCGACCGTGAAATCACGCTGCCATCCTCCGGTACCACGCTGATAAGCCTGGTTGACGGAAGTGGCAATCCGGTCAGCGTGGAGGTCCAGTCCGTCACCGACGGCGTGAAGGTAAAAGTGAGCCGTGTTCCTGACGGCGTTGCCGGATACAGCGTGTGGGGGCTGAAGCTGCCGACGCTGCGCCAGCGCCTGTTCCGCTGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTACCGGAAAAAGAAGCCATCGTGGATAACGGGGCGCACTTTGACGGCGACCAGAGCGGCACGGTGAACGGGGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACCGCAGACAGCGGGGAATATCAGGTGCTGGCGCGCTGGGACACGCCGAAGGTGGTGAAGGGGGTGAGTTTTATGCTTCGCCTGACCGTGGCCGCGGATGACGGCAGTGAGCGGCTGGTCAGCACGGCCCGGACGACGGAAACCACATACCGCTTCACGCAACTGGCGCTGGGGAACTACAGGCTGACAGTCCGGGCGGTAAATGCGTGGGGACAGCAGGGCGATCCGGCATCGGTATCGTTCCGGATTGCCGCACCGGCAGCGCCGTCACAGATTGAGCTGACGCCGGGCTATTTTCAGATAACTGCCACGCCGCATCTTGCGGTTTATGATCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAACGCGGATTACCGATATCAGGCAGGTTGAAACCACAGCCCGCTACCTTGGCACGGGGCTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTACTTTTATATCCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCCGTCGGTCGGGCGAGCGATGATGCGGAAGGTTACCTGGATTTTTTCAAAGGCAAGATAACCGAATCCCATCTCGGCAAGGAGCTGCTGGAAAAAGTCGAGCTGACGGAGGATAACGCCAGCAAACTGGAGGAGTTTTCGAAAGAGTGGCAGGACGCTAACGATAAGTGGAATGCCATGTGGGGCGTCAAAATTGAGCAGACCAAAGACGGCAAACATTATGTCGCGGGTATTGGCCTCAGCATGGAGGACACGGAAGAAGGCAAGCTGAGCCAGTTTCTGGTTGCCGCTAACCGTATCGCGTTTATTGACCCGGCAAACGGGAATGAAACGCCGATGTTTGTGGCGCAGGGCAATCAGATATTTATGAACGACGTGTTCCTGAAGCGCCTGACGGCCCCGACCATTACCAGTGGTGGAAATCCACCGGCATTTTCCCTGACGTCAGACGGAAAGCTGACCGCTAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACGCCGGGACGCTCAACAATGTCACGGTAAATGAAAACTGTACGATTAAGGGCATGCTGGAGGCGACTCAGGTCAGAGGTGACTTCGTTAAAGCTGTATCCAAATCATTCCCGAAACAGGCTGGTACGTGGGGTAACACGGAAACACCAAACGGGACGGTTACAGTCACCATCAGTGATGATCATAACTTTGACCGTCAAATCATTATTCCGCCCATTATCTTTAACGGAATAGCGTATAGCGATCCGGGAAGTGGTAATAACCCGGGAGGTACAAGATACACGGGTTATGGTTTTGAAGTTCGCAAAAACGGTGTATTAATCGCATCCAGAGAAACTAAAGGGGCCATTCCCGGTAGCTACAGTGCGGTTATTGATATGCCGAGTGGCAGGGGAAGCGTCACTCTGGAGTTTAAGGTTTTCCATAAAGGCAATCAGTGGGCAGGTAATATCACCGACTGTACGGTGATTGTGACCAAAAAAGCCGCTTCCGGCATCAGTATTCGTTGAAATTGTTATAACCCATATAAGGGCACCAGAAATGGTGCCTTTTTTATTGCAGAAAAGCGAGAGGTAATTATGCGTAAAGTTTGTGCAGCAATTTTGTCCGCAGCCATTTGTCTGGCCGTATCCGGTGCGCCTGCATGGGCATCTGAACATCAGTCCACGCTGAGCGCGGGGTATCTTCATGCCTCGACCAACGTTCCCGGCAGCGATGATCTGAACGGGATTAACGTGAAATACCGTTATGAGTTTACGGATACGCTGGGGATGGTGACGTCATTCAGCTATGCAGGAGACAAGAATCGCCAGCTGACCCGTTACAGCGATACCCGCTGGCATGAAGATTCTGTGCGTAACCGCTGGTTCAGCGTGATGGCGGGGCCCTCTGTGCGCGTGAATGAATGGTTCAGTGCGTATGCGATGGCGGGCGTGGCTTACAGCCGTGTGTCGACTTTTTCCGGGGATTATCTCCGCGTAACTGACAACAAGGGGAAAACGCATGATGTGCTGACCGGAAGTGATGACGGTCGCCACAGCAACACGTCTCTGGCGTGGTGGGCTGGCGTGCAGTTTAACCCGACCGAATCCGTGGCCATTGATATTGCTTATGAAGGTTCCGGCAGTGGTGACTGGCGCACTGATGGTTTCATCGTGGGTGTCGGTTATAAGTTCTGATTAGCCAGGTAACACAGTGTTATGACAGCCCGCCGTTTCAGGCGGGCTTTTTTGTGGGGTGAATATGGCAGTAAAGATTTCAGGTGTACTGAAAGACGGCACAGGAAAACCGGTACAGAACTGCACAATCCAGCTGAAAGCAAAACGTAACAGCACCACGGTGGTGGTGAACACGCTGGCATCTGAAAATCCGGATGAAGCCGGGCGTTACAGTATGGACGTTGAGTACGGTCAGTACAGCGTTATTCTGTTGGTGGAGGGATTCCCGCCGTCACATGCCGGGACCATCACCGTGTATGAAGATTCTCAACCCGGTACGCTGAATGATTTTCTCGGTGCCATGACGGAGGATGATGCCCGTCCGGAGGCACTGCGCCGTTTTGAGCTGATGGTGGAAGAGGTGGCGCGTAACGCGTCCGCAGTGGCACAGAACACGGCAGCCGCAAAGAAGTCAGCCAGCGATGCCGGCACATCTGCCCGTGAGGCGGCAACCCATGCGACTGATGCTGCAGACTCAGCACGCGCAGCCAGCACGTCAGCCGGACAGGCCGCGTCGTCGGCTCAGTCAGCGTCTTCCAGCGCAGGAACGGCATCAGCAAAGGCCACTGAAGCGGAAAAAAGTGCTGCCGCTGCAGAGTCCTCAAAAAGCGCGGCAGCTACCAGTGCCGGTGCCGCGAAAACGTCTGAAACGAATGCTTCAGCGTCACAACAATCAGCCGCCACTTCTGCATCCACCGCGACCACGAAAGCGTCAGAAGCAGCCACTTCAGCACGGGATGCGTCGGCCTCAAAAGAGGCAGCGAAATCATCAGAAACGAACGCAGCCTCGAGCGCCAGTAGTGCCGCTTCCTCGGCAACGGCGGCAGCAAATTCTGCGAAGGTGGCAAAAACGTCCGAGACGAACGCCAGGTCTTCTGAAACGGCAGCGGGACAGAGCGCCTCAGCTGCGGCAGGCTCAAAAACAGCGGCTGCATTATCTGCCAGTGCCGCGTCAACAAGTGCCGGGCAGGCCTCAGCCAGTGCCACCGCCGCCGGAAAATCGGCAGAAAGTGCTGCATCGTCTGCTTCAACAGCCACAACGAAGGCTGGCGAAGCCACTGAACAGGCCAGCGCAGCAGCGAGTTCTGCTTCCGCAGCGAAGACATCCGAAACGAACGCGAAAGCGTCGGAAACCAGCGCAGAATCCTCAAAAACGGCTGCCGCATCGTCAGCCAGTTCGGCGGCGTCATCGGCATCATCTGCGTCTGCTTCAAAAGATGAGGCGACCAGACAGGCGTCAGCAGCAAAGGGCAGCGCCACGACGGCATCCACGAAGGCGACAGAGGCAGCTGGCAGTGCGACGGCGGCAGCTCAGAGCAAAAGTACGGCGGAATCCGCGGCAACGCGCGCCGAGACAGCGGCAAAACGGGCAGAGGATATTGCATCCGCCGTGGCGCTTGAGGATGCGAGCACGACGAAAAAGGGGATAGTCCAGCTAAGCAGCGCGACCAACAGCACTTCCGAGTCACAGGCGGCAACGCCAAAAGCCGTTAAGGCCGCGTATGAGCTGGCTAACGGGAAATACACCGCACAGGATGCAACGACAGCACAGAAAGGGATAGTTCAGCTTAGCAACGCGACCAACAGCACATCTGAAATGCTGGCGGCAACGCCAAAGTCGGTAAAGGCAGCCTATGACCTTGCTAACGGGAAATATACTGCTCAGGACGCTACGACAGCACAAAAAGGAATTGTCCAGCTCAGTAGTGCAACCAACAGCGCATCTGAAACGCTTGCCGCGACACCGAAAGCAGTGAAAGCAGCTAATGATAATGCGAATGGTCGGGTACCTTCTGCCCGTAAGGTGAATGGTAAGGCGCTTTCAGCGGATATAACACTGACGCCGAAAGATATTGGTACGCTTAACTCAACAACAATGTCATTCAGCGGTGGTGCTGGTTGGTTCAAATTAGCAACGGTAACCATGCCACAGGCGAGTTCTGTTGTTTCAATTACGTTGATTGGTGGCGCGGGATTTAACGTGGGGTCACCTCAACAGGCAGGTATATCTGAACTTGTTTTGCGTGCAGGTAATGGTAATCCGAAGGGGATTACTGGTGCTTTATGGCAGCGCACATCGACAGGGTTTACAAATTTTGCCTGGGTCAATACATCTGGTGATACTTACGATATTTACGTTGCAATCGGAAATTATGCGACTGGTGTAAATATTCAATGGGATTATACCAGTAATGCCAGCGTGACGATTCATACGTCACCAGCATATTCTGCTAATAAGCCGGAAGGGTTAACGGACGGTACAGTTTATTCACTCTATACGCCATCAGAGCAGTTTTATCCGCCTGGCGCACCAATCCCGTGGCCATCAGATACCGTTCCGTCTGGCTATGCCCTGATGCAGGGGCAGGCTTTTGACAAATCTGCATACCCGAAACTTGCAGCGGCTTATCCGTCAGGCGTGATCCCTGATATGCGTGGCTGGACGATTAAGGGCAAACCTGCCAGTGGTCGGGCCGTATTGTCTCAGGAACAGGACGGCATTAAATCGCATACCCACAGCGCCAGCGCATCCAGTACGGATTTGGGGACGAAAACCACATCGTCGTTTGATTACGGCACTAAATCCACGAATAACACTGGTGCGCATACCCATAGTTTAAGTGGCAGCACGAATGCGGCTGGTAATCACAGCCATAGAGATGGCCGTCGATTTAACCCCAGTGTTTTTAAAGATACTTATCAATATGGTTATACAAGCTCAGGTCAAAATACCTGGGGTGTACAAGGCTCAGTAGGTATGTCTACGGGGTGGTTAGCGAATACCAGTACAGATGGTAATCATAGCCACTCACTGTCCGGCACAGCAGCATCTGCAGGTGCACACGCGCATACTGTCGGTATTGGTGCTCATACGCACTCCGTTGCGATTGGTTCACATGGACACACCATCACCGTTAACGCTGCTGGTAACGCGGAAAACACCGTCAAAAACATCGCATTTAACTATATTGTGAGGCTTGCATAATGGCATTCAGAATGAGTGAACAACCACGGACCATAAAAATTTATAATCTGCTGGCCGGAACTAATGAATTTATTGGTGAAGGTGACGCATATATTCCGCCTCATACCGGTCTGCCAGCAAACAGTACCGATATTGCACCGCCAGATATTCCGGCTGGCTTTGTGGCTGTTTTCAACAGTGATGAGGCATCGTGGCATCTCGTTGAAGATCATCGGGGAAAAACCGTCTATGACGTGGCTTCCGGCGACGCGTTATTTATTTCTGAACTTGGCTCGTTACCGGAAAATGTCACCTGGTTATCCCCGGAAGGGGAATATCAGAAGTGGAACGGCACAGCCTGGGTGAAGGATACGGAAGCAGAAAAACTGTTCCGGATCCGGGAGGCGGAAGAAACAAAAAACAGCCTGATGCAGGTAGCCAGTGAGCATATTGCGCCGCTTCAGGATGCTGTGGATCTGGAGATCGCAACGGAGGAAGAAACCTTGTTGCTGGAAGCCTGGAAAAAGTATCGGGTGTTGCTGAACCGTGTTGATACATCAACTGCACCGGATATTGAATGGCCAGCCCTACCGTATGGTAAGATATATAAATTCTATAATTAGAAGTATCTTTCCATTTAAGGCTAGGAAGGGGGGCTTGGAAAACGTAAGGAATCTCACACCGAGATTATTTTTATATATCAGGCGTCTGATTTTTTGCTTTAGCTCTTAAAATGGTTTGCCGCGAGGTTTTGAATTCTTGGGCAATGTCACTTATATTTATACCTGACTTAATTTGCTCTAGTACCCTCTGTATTTGTTCATCATTTAACACAGGTGGGCGACCAAATCGCTTTCCTGCGCCCCGTGCTCTTAATATCCCTGAATGAGTACGCTCAAGTAAAAGGTCTCGCTCAAATTCAGCGACTGCTGAAATTACTTGCATCATCATTTTTCCAGGTGGACTGGTCAGGTCAACGCCCCCCAATGCTAAGCAATGCACTTTGATACCTGCTTTGGTCAGTTGTTCCACCGTTTTACTGATATCCATTGGATTACAACCGAGGCGATCCAGTTTTGTGACAATCAATGCATCACCTCTTTTCAGTCGATCAAGCAACTGGTTAAAACCAGGACGCTCACTGGTTGCTGCTGAGCCGCTAATTTGTTCTTCAATTATTTGCTGAGATTTGATGTTAAAACCTGCACTTTCGATTTCCCGACGTTGATTTTTGGTTGTCTGTTCCAGAGTTGATACCCGACAGTAAGCATAAATTCGAGACATAGTGAGATCTTCTATACGAAATTGGTGTACATATCATAATGCATCTCAGAAAATAATTTTGATTATTTTTGTACATATTTGTATGTACACGTTCGAAAATAAACGAATGCGTATGCAACCCCGTAATTTTGGTGAGACCCAAAATCGATTTTGTGAAAAATGGCCTTAACTCGGTTTGTTTTTCGAGTTCCGGGCGGACTCAAGGAAGAAGAATAGTGTTGCGTGTTATTTTAACCAGATTTCAAGTTGTTTGGTCGTGGAAAAGTGGAGCAAAATGTTGTTAAAGTGGAAAAATGATAAAAAAGTAAGTTTATTATATTACATTTTACCATTTAAATTTTTGTTGTCTTTAAGAACTGATATCGCTGTTTGTAATAATTCTTTGTTATCCAGCCATGACTTTTTCTTTATGTTTCCTTCAATGTAATCAAGCAATGTTCTGGTATTGATAGGTCTTCCCTGTTTTGCTACTTCCACTACAGCATCCCCTAGGATAATTCTTACTTCAGGAAGCTGCGCAGGGAACCACTTTAGGGTGTCTTTTGATTTCAT
Protein sequences of DBSCAN-SWA_7 >CP029164|3221809:3273222|3224847_3225558_+|AWH72530.1|DBSCAN-SWA MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAQTLEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY >CP029164|3221809:3273222|3249873_3251976_+|AWH70836.1|terminase|DBSCAN-SWA MLNQETAKAARTDSGYILRAPRRMRVADAVAQYMRVPLGAGNSVPWDPLVAPYVIEPMNCLASREYDAVIFVGPARTGKTIGLIDGWVIYNVICDPADMLIIQMTEEKAREHSKKRLARTFRVSPEVVSRLSPNKNDNNVYDRTFLAGNYLKIGWPSVNIMSSSDYKCVALTDYDRFPEDIDGEGDAFSLASKRTTTFMSSGMTLVESSPGRDVKDVKWRRTSPHEAPPTTGILSLYNRGDRRRWYWPCPHCGEYFQPCGDVVAGFRDIADPVLASEAAYIQCPSCSGRIMPEQKRELNGRGVWLRDGESINADGSRYGDPRRSRIASFWMEGPAAAYQTLSQLVYKLLTAEQEYETTGSEETLKTVINTDWGLPYLPRASMEQRKSELLEQRAEPVPSRSVPDGVNFLVATVDVQAGRHRRFVVQVTGYGSRGERWIIDRYNITQSLRGDSDGESQRIDPASYPEDWDVLLTDVFHKSWPLASDPSQQMRLMAMAVDSGGEDGVTDNAYKFWRRCRRDGLGKRIYLFKGDSIRRAKLITRTFPDNTGRTGRRAQAAGDVPLWLLQTDALKDRVNNALWRDSPGPGYVHFPDWLGSWFYDELTYEERSSDGKWSKPGRGANEAFDLMVYAEALVILHGYEKIRWPDAPEWASRETWLECVPDSIEPSPSPEPVSTPVKKQKRKKTVTDDVNPWLTSGGWL >CP029164|3221809:3273222|3249379_3249874_+|AWH70835.1|DBSCAN-SWA MDRELKNLTLNISQLAALSGVHRQTAAARLQNLPVAGGHESNLKLYRVVDIVSAFLALPPPVAESEMDAHERKAWYQSERERLKFEQETAQLIPASDVRREFAIWAKAVVQVLETLPDILERDCGLQPAAVSRVQSIIDDLRDQIALRVTEAGADDEEELQQEE >CP029164|3221809:3273222|3237753_3238155_+|AWH70817.1|DBSCAN-SWA MAKVFTQEEREKIKGQVVELVRRSGRETLRQLEVKTGATRYLMSVLARELVASGDVYNSGYGLFPSEQARKDWQNARKKLSRAKLKKPSAVDPDLIWSLPDGEIRRYDRRQNIICRECRKSEVMQRILAFYQG >CP029164|3221809:3273222|3271406_3272009_+|AWH70853.1|tail|DBSCAN-SWA MAFRMSEQPRTIKIYNLLAGTNEFIGEGDAYIPPHTGLPANSTDIAPPDIPAGFVAVFNSDEASWHLVEDHRGKTVYDVASGDALFISELGSLPENVTWLSPEGEYQKWNGTAWVKDTEAEKLFRIREAEETKNSLMQVASEHIAPLQDAVDLEIATEEETLLLEAWKKYRVLLNRVDTSTAPDIEWPALPYGKIYKFYN >CP029164|3221809:3273222|3240391_3240604_+|AWH70820.1|DBSCAN-SWA MLDTCRLASYVPEGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >CP029164|3221809:3273222|3262246_3262945_+|AWH70847.1|tail|DBSCAN-SWA MQDIPQETHHETTRLTQSALVVLWEIDLTEVGGERYFFCNEQNEKGEPVTWQGRQYQAYPIQGTGFELNGKGSAARPTLTVSNLHGMVTGMAEDLQSLVGGTVVRRKVYARFLDAVNFVNGNSDADPEQEVISRWRIEQCSELSAVSASFVLSTPTETDGAVFPGRIMLANTCTWTYRGDECGYHGPAVADEYDQPTSDITKDKCSKCLNGCKFRNNVGNFGGFLSINKLSQ >CP029164|3221809:3273222|3251972_3252185_+|AWH70837.1|DBSCAN-SWA MNQNDIEAMIQRYTEAEMAVLDGKSVTFNGQQMTMENLSEIRQGRQEWERRLAALITRRRGHPGYRLARF >CP029164|3221809:3273222|3241418_3242468_+|AWH70823.1|DBSCAN-SWA MRVLLRPVLVPELGLVVLKPGRESIQIFHNPRVLVEPEPKSMRNLPSGVVPAVRQPLAEDKTLLPFFSNERVIRAAGGVGALSDWLLRHITSCQWPNGDYHHTETVIHRYGTGAMVLCWHCDNQLRDQTSESLELLAQQNLTAWVIDVIRHAISGTQERELSLAELSWWAVCNQVVDALPEAVSRRSLGLPAEKICSVYRESDIVPGEQTATSILKQRTKNLAPLPYAHQQQKPPQEKTVVSINVDPESPESFMKLPKRRRWVKEKYTRWVKTQPCACCGMPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHNELHTDTVAFEEKYGSQLELIFRFIDRALATGVLA >CP029164|3221809:3273222|3247356_3247512_+|AWH72532.1|DBSCAN-SWA MREYPNGEKTHLTVMAAGFPSLTGDHKVIYVAADRHVTSEEILEAAIRLLS >CP029164|3221809:3273222|3233800_3233992_-|AWH70809.1|DBSCAN-SWA MNSAFALVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIPAGL >CP029164|3221809:3273222|3233988_3234177_-|AWH70810.1|DBSCAN-SWA MKTLLPNVNTSEDCFEIGVTISNPVFTEDAINKRKQERELLNKICIVSMLARLRLMPKGCAQ >CP029164|3221809:3273222|3240029_3240233_+|AWH70819.1|DBSCAN-SWA MSPISTEKIRETIPNSCIESLTRQPHIHQDWCTSNTGGCGAGSQMCAGYYGCNGDLLSCYCSYGSPF >CP029164|3221809:3273222|3244878_3245094_+|AWH70826.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGNAGFWALQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDRRKAARGE >CP029164|3221809:3273222|3240820_3241072_+|AWH70821.1|DBSCAN-SWA MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDGRVTDGFILREGEFVTSLQALKEIGIKAGFSAFAEE >CP029164|3221809:3273222|3258541_3258871_+|AWH70844.1|tail|DBSCAN-SWA MQFVMRLAREFRRADWRRMLSEMSATELGEWGDYFRMQSFSDVWMDAQFASLKALIVRMVSGSSDAAVADFSLLPEENGIPERTDEELMHLGEGISGGVRYGPDSQPGH >CP029164|3221809:3273222|3225560_3226121_-|AWH70799.1|DBSCAN-SWA MPSAHSVKLRPLEREDLRYVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECDGEKAGLVELVEINHVHRRAEFQIIISPEYQGKGLATRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGFTVEGELMHEFFINGQYRNAIRMCIFQHQYLAEHKTPGQTLLKPTAQ >CP029164|3221809:3273222|3258146_3258533_+|AWH72533.1|tail|DBSCAN-SWA MFLKQGTFNYEKQSVVLSELSGLQRIEYLAFVQQRTAKFDAEEGELPEAERQIAFLRMGMDINAWLVSRSLWNAEQSQDVETLCASVITTWSYDALGAGAEMVLSLSGMGAIENAGDLEHEVLTPEKS >CP029164|3221809:3273222|3268446_3271407_+|AWH70852.1|DBSCAN-SWA MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDAGTSAREAATHATDAADSARAASTSAGQAASSAQSASSSAGTASAKATEAEKSAAAAESSKSAAATSAGAAKTSETNASASQQSAATSASTATTKASEAATSARDASASKEAAKSSETNAASSASSAASSATAAANSAKVAKTSETNARSSETAAGQSASAAAGSKTAAALSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAASSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKGSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSESQAATPKAVKAAYELANGKYTAQDATTAQKGIVQLSNATNSTSEMLAATPKSVKAAYDLANGKYTAQDATTAQKGIVQLSSATNSASETLAATPKAVKAANDNANGRVPSARKVNGKALSADITLTPKDIGTLNSTTMSFSGGAGWFKLATVTMPQASSVVSITLIGGAGFNVGSPQQAGISELVLRAGNGNPKGITGALWQRTSTGFTNFAWVNTSGDTYDIYVAIGNYATGVNIQWDYTSNASVTIHTSPAYSANKPEGLTDGTVYSLYTPSEQFYPPGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSLSGSTNAAGNHSHRDGRRFNPSVFKDTYQYGYTSSGQNTWGVQGSVGMSTGWLANTSTDGNHSHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA >CP029164|3221809:3273222|3272988_3273222_-|AWH70855.1|DBSCAN-SWA MKSKDTLKWFPAQLPEVRIILGDAVVEVAKQGRPINTRTLLDYIEGNIKKKSWLDNKELLQTAISVLKDNKNLNGKM >CP029164|3221809:3273222|3231235_3233707_-|AWH70808.1|DBSCAN-SWA MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQTWKTLDDELAYALWPGDVDAGNIDGSIHRWAKKEVIDNDREDWKRISASMRKQPDALRYDRQTIFGLVRERPIDIHKDPIALNKYICEYLTTKGVFENEETDLGTVDVLQSSETQTDAVETEVSDIPKNETAPEAEPSVEREGPFYFLFADKDGEKYGRANKLSGLDKALAAGATEITKEEYFARKNGTYTGLPQNVDTAEDSEQPEPIKVTADEVNKIMQAANISQPDADKLLAASRGEFVEGISDPNDPKWVKGIQTRDSVNQNQHESERNYQKAEQNSTNALQNEPETKQPEPVAQQEVEKVCTACGQTGGGNCPDCGAVMGDATYQETFDEEYQVEVQEDDPEEMEGAEHPHKENTGGNQHHNSDNETGETADHSIKVNGHHEITSTSRTCDHLMIDLETMGKNPDAPIISIGAIFFDPQTGDMGPEFSKTIDLETAGGVIDRDTIKWWLKQSREAQSAIMTDEIPLDDALLQLREFIDENSSEFFVQVWGNGANFDNTILRRSYERQGIPCPWRYYNDRDVRTIVELGKAIDFDARTAIPFEGERHNALDDARYQAKYVSVIWQKLIPSQADS >CP029164|3221809:3273222|3245098_3245410_+|AWH70827.1|DBSCAN-SWA MTQDYELVVKGVRNFENKVTVTVALQDKERFDGEIFDLDVAMDRVEGAALEFYEAAARRSVRQVFLEVAEKLSEKVESYLQHQYSFKIENPANKHERPHHKYL >CP029164|3221809:3273222|3236034_3236262_+|AWH70814.1|DBSCAN-SWA MLKVDAITFFGSKTKLANAAGVRLASVAAWGILVPEGRAMRLQEASGGELQYDPKVYDEYRKTKRAGRLNNENHS >CP029164|3221809:3273222|3243909_3244125_+|AWH70825.1|DBSCAN-SWA MSNKMTGLVKWFNADKGFGFISPVDGSKDVFVHFSAIQNDNYRTLFEGQKVTFSIESGAKGPAAVNVIITD >CP029164|3221809:3273222|3252184_3253693_+|AWH70838.1|portal|DBSCAN-SWA MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTPSAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEADGNGSKDKERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRHEPQHTDDQSEE >CP029164|3221809:3273222|3226994_3227183_+|AWH70802.1|DBSCAN-SWA MSADKRYSISSFVNHSRRKYTPFFVITPAFFDLHTCMVVAQLRRFHASRQAMQGIEHEDRKG >CP029164|3221809:3273222|3247020_3247209_+|AWH70831.1|DBSCAN-SWA MTNHIHFRCPCCHGSQYRTSSFDVSDMNPFGAKCIFCKSMMITFDNISQYLNASRLSLDLKK >CP029164|3221809:3273222|3229466_3229595_+|AWH70805.1|DBSCAN-SWA MTIEKHERSTKDLVKAAVSGWLGTALEFMDFKSHTCQLFDKY >CP029164|3221809:3273222|3247683_3247857_+|AWH70832.1|DBSCAN-SWA MNIEDLKTKAEADISEYITKKIIELKKKTGKEVTSIQFTAREKMTGLESYDVKINLI >CP029164|3221809:3273222|3235225_3235378_-|AWH70812.1|DBSCAN-SWA MDTIDLGNSESLVCGVFPNQDGTFTAMTYTKSKTFKTESGARCWLARNTD >CP029164|3221809:3273222|3261907_3262237_+|AWH70846.1|tail|DBSCAN-SWA METFHWKVRPDMNVVSEPKVVTVKLGDGYEQRRVAGLNNQLSTYSVTIRVRKCEHPSLKAFLERHGGVRAFQWTPPYDWKPIRVVCRKWSASVGALWVTITADFEQVVA >CP029164|3221809:3273222|3238688_3239714_+|AWH70818.1|DBSCAN-SWA MHILSVSKKEVGRAGDRLVSQFGTGEKYKEEDVQILHEWRMLHLYPLSKIQFYMEREAISLNKNALLSSRIKRMPSIVTKLSRFPDMKLNKMQDLGGCRAILNNLDQVYDLVNKIKSSKFSHELVRMDDYMIDVKDSGYRSFHMVYSFQNKKFPSLNGLRIEMQIRTAIQHSWATAVEMVGLFRKESLKSGFGDARWLRFFELVSELFYKLEYEKEPSGSYIKISEELSYLSVELNVFDILAAYNAVVSHIEGSKKYDKGLCIIVVDTIKRNINIKSFENHNHAKAAEAYVESEKYCAENKGCEVAMVSVSSISELKNAYPAYFLDTKTFLNYLSRYVFIK >CP029164|3221809:3273222|3226155_3226497_-|AWH70800.1|DBSCAN-SWA MKITLSKRIGLLAFLLPCALALSTTVHAENNKLVIESGDSAQSRQRAAMEKEQWNDTRNLRQKVNKRTEKEWDKADAAFDNRDKCEQSANINAYWEPNTLRCLDRRTGRVITP >CP029164|3221809:3273222|3255706_3256075_+|AWH70840.1|DBSCAN-SWA MYLKSAPERGCAETVMAKNFVEEGKTVAIVASAAISSGDLVQVGDVFAVALTDIPQGETGDGMTEGVFILPKLKTDDMKTGKKVYLKSGKVQLTNSGSDPLVGVVWADAGTSAEEVPVKLNV >CP029164|3221809:3273222|3263591_3264239_+|AWH70849.1|tail|DBSCAN-SWA MAATHTLPLASPGMARICLYGDLQRFGCRIDLRVKTGAEAIRALSTQLPAFRQKLNDGWYQVRIAGRDAGETELSARLNEPLANGAVIHIVPRLAGAKSGGVFQVVLGAALIAVAWWNPVGWLGAAAVSGMYAAGASMILGGVAQMLAPKARTPTATSTDNGKQNTYFSSLDNMVAQGNVLPVLYGEMRVGSRVVSQEISTADEGDGGQVVVIGR >CP029164|3221809:3273222|3221809_3224236_-|AWH70797.1|DBSCAN-SWA MSKNEQMVGISRRTLVKSTAIGSLALAAGGFSLPFTLRSAAAAVQQASEKVVWGACSVNCGSRCALRLHVKDNEVTWVETDNTGSDEYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGKRGEGKFERISWDEALDTIASSLKKTVEQYGNEAVYIQYSSGIVGGNMTRSSPSASAVKRLMNCYGGSLNQYGSYSTAQISCAMPYTYGSNDGNSTTDIENSKLVVMFGNNPAETRMSGGGITYLLEKAREKSNAKMIVIDPRYTDTAAGREDEWLPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPADAPKNGHYKAYILGEGDDKTAKTPQWASQITGIPVDRIIKLAREIGTAKPAYICQGWGPQRQANGELTARAIAMLPILTGNVGISGGNSGARESTYTITIERLPVLDNPVKTSISCFSWTDAIDHGPQMTAIRDGVRGKDKLDVPIKFIWNYAGNTLVNQHSDINKTHEILQDESKCEMIVVIENFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTSEKFERKPIYWILSEVAKRLGPDVYQKFTEGRTQEQWLQHLYAKMLAKDPALPSYDELKKMGIYKRKDPNGHFVAYKAFRDDPEANPLKTPSGKIEIYSSKLAEIARTWELEKDEVISPLPVYASTFEGWDSPERSTFPLQLFGFHYKSRTHSTYGNIDVLKAACRQEVWINPIDAQKRGIANGDMVRVYNHRGEVRLPAKVTPRILPGVSAMGQGAWHEANMSGDKIDHGGCVNTLTTLRPSPLAKGNPQHTNLVEIEKI >CP029164|3221809:3273222|3236245_3236767_+|AWH70815.1|DBSCAN-SWA MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAVDQRALSVNRVKIFERWKAIDTRDKREKFTALVPAIMEAIRISDFRLYREISDGKSITYMIAGLNKEYGDVVESGLLFADPAVVDRETDELIEKAIAFKLAYRQQYQQKAGWNYESSFC >CP029164|3221809:3273222|3264299_3267713_+|AWH70850.1|DBSCAN-SWA MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPVEGPVDGLKSVLLNSTPVLDSEGNTNIAGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPITRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTEKDITIKGKTTSQYLASVVVGNLPPRPFNIRMRRMTPDSTTDQLQNKTLWSSYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYNPQTRQYSGIWDGTFKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWALYVIGQHCEQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVEVNWIDPNNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLWLIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISIGGRVLAVNSQTRTLTLDREITLPSSGTTLISLVDGSGNPVSVEVQSVTDGVKVKVSRVPDGVAGYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFMLRLTVAADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQQGDPASVSFRIAAPAAPSQIELTPGYFQITATPHLAVYDPTVQFEFWFSETRITDIRQVETTARYLGTGLYWIAASINIKPGHDYYFYIRSVNTVGKSAFVEAVGRASDDAEGYLDFFKGKITESHLGKELLEKVELTEDNASKLEEFSKEWQDANDKWNAMWGVKIEQTKDGKHYVAGIGLSMEDTEEGKLSQFLVAANRIAFIDPANGNETPMFVAQGNQIFMNDVFLKRLTAPTITSGGNPPAFSLTSDGKLTAKNADISGSVNANAGTLNNVTVNENCTIKGMLEATQVRGDFVKAVSKSFPKQAGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYSDPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRGSVTLEFKVFHKGNQWAGNITDCTVIVTKKAASGISIR >CP029164|3221809:3273222|3267782_3268382_+|AWH70851.1|DBSCAN-SWA MRKVCAAILSAAICLAVSGAPAWASEHQSTLSAGYLHASTNVPGSDDLNGINVKYRYEFTDTLGMVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAGVAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDGRHSNTSLAWWAGVQFNPTESVAIDIAYEGSGSGDWRTDGFIVGVGYKF >CP029164|3221809:3273222|3226631_3226958_+|AWH70801.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLSLYDWTGALIALCGMLIIVAGWGRT >CP029164|3221809:3273222|3234592_3234880_+|AWH70811.1|DBSCAN-SWA MRFRVVCLLLTTSGEVVSFLFRITKKEIYMTKEEFVSYIFDKTVEMYAATYGSCNPLNKPEGKDDFDKIYRFLEDRYIKRLEDAGIKSPVKSPLS >CP029164|3221809:3273222|3257342_3258086_+|AWH70843.1|tail|DBSCAN-SWA MTTPNPLAKTKGAGTTFWMYTGKGDAFANPLSDTDWLRLAMVKDLQPGEMTADAEDDTYLDDEDADWKTTTQGQKSVGDTSATLAWRPGDSGQKKLVQLFDSGEVCAFRIKYPNGTVDVFRGWLSSLGKTIASKDVMTRTVKISGVGRPYLAEEGTETVSVTGLTVAPASASVKVGATTTLTFTVKPDGASDKAISVHSTDPQTATVTLNGLVATVKGVKQGSVSIVGMTSDGDFVAVAAVAVSAAG >CP029164|3221809:3273222|3230911_3231163_-|AWH70807.1|DBSCAN-SWA MSEVIMIVSPGKWVSEEQLIALKGIKKGTLKKAREKSFMEGREYKHVAHDSMPWDNSPCFYNLEEIDRWIERQASARPRRHLT >CP029164|3221809:3273222|3224434_3224740_-|AWH70798.1|DBSCAN-SWA MKRSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTRTTSGNVSAPAQSTQDGAPAEPQ >CP029164|3221809:3273222|3245936_3246434_+|AWH70829.1|DBSCAN-SWA MNQIFMVIFLVLSGFIVGNVWSDRGWQKKWAERDAAALSQEVNAQFAARIIEQGRTIARDEAVKDAQQKSAEISARAAYLSDSVNQLRAEAKKYAIRLDAAKHTADLAAAVRGKTTKTAEGMLTNMLGDIAAEAQLYAEIADERYIAGVTCQQIYESLRDKKHQM >CP029164|3221809:3273222|3228389_3229409_+|AWH70804.1|DBSCAN-SWA MKSILIEKPNQLAIIEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGEGVESARVGERVAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTENDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGESFAEKGIKPTLIIDAACHPSILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPVVIDWLSKGLIKPEKLITHTFDFQHVADAISLFEQDQKHCCKVLLTFSE >CP029164|3221809:3273222|3227163_3228378_+|AWH70803.1|DBSCAN-SWA MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGTLWNW >CP029164|3221809:3273222|3246797_3247010_+|AWH70830.1|DBSCAN-SWA MSNKMTGLVKWFNPEKGFGFITPKDGSKDVFVHFSAIQSNDFKTLTENQEVEFGIENGPKGPAAVHVVAL >CP029164|3221809:3273222|3235546_3235954_-|AWH70813.1|DBSCAN-SWA MDTRTLGQRVLARRKELRLTQREAARLAGVAHVTISQWERDETQPVGKRLFALADALKCSPTWLMFGDEDKAPVPAQELHVETELTPNHKELIELFDALPSSEQEALLSEMRARVENFNKLFEEMLKARKNKSIK >CP029164|3221809:3273222|3256929_3257331_+|AWH72534.1|tail|DBSCAN-SWA MNRHTQIRQAVLARLREQCGDSATFFDGLPAFIDAQELPAVAVWLSDAQYTGKMTDEDDWQAVLHIAVFIRAQAPDSELDMWMESTIFPALNDIPALSGLIDTLIPLGFNYQRDNEMATWAMAEITYQITYTN >CP029164|3221809:3273222|3256067_3256343_+|AWH70841.1|DBSCAN-SWA MSDPFSRLAARMDAITVRKMGKTASINDADMTVIPGETLAELNALSGPAVSLVVFSSGYRPRRGDRVVYDGQQWTVTRHERFNGKPMIFIE >CP029164|3221809:3273222|3241138_3241417_+|AWH70822.1|DBSCAN-SWA MARNVKYYNSDNSPVLACTHGRYSHAFKSEWFQHPPCTAEQAEWLIHSYCRRGFEVKKALSLDYRHWIISVRLPYSERPPRASRTFQQRIWR >CP029164|3221809:3273222|3236693_3237713_+|AWH70816.1|DBSCAN-SWA MLSSLRIDSNTNKKLDGIMSLLFAERPLVINTQLAMKIGLNEAIVLQQLHYWLRDTNSGMECDGVRWIYNTTEQWLEQFPFWSESTLKRAFASLKTLGLLRCEKLNKSKRDMTNFYTINYGSELLDGGKLNESIGSKCAAPSGQNDTMEEVKMKRSIGSKRPNVIGSKWPDDLTENTTEITTENKNTFRPEASQPDPQTTEQDFLTRNPDAVVFSVKKRQWGSREDLACAQWIWGRIVNLYEQAASDDGEIMRPKEPNWTAWANDVRTMRMLDGRSHRQICEMFGRVQRDPFWVKNIMSPSKLREKWDELVIRLGRSPVQRCVNHISEPDTEIPPGFRG >CP029164|3221809:3273222|3248008_3248419_-|AWH70833.1|DBSCAN-SWA MAQAAIFKEIFDQVRKDLNCELFYSELKRHNVSLYIYYLATDNIHIVLENDNIVLVKGLKKVVNVKFSRNKHLIETSYNKLKSKEITFQQYRENLAKAGVFRWVTNIQEHQRYYYAFDNSLLFTESIQKTTQILPR >CP029164|3221809:3273222|3229596_3230892_-|AWH70806.1|DBSCAN-SWA MREVEMKYPTGVENHGGKLRIWFVYKGVRVRENLGVPDTAKNRRVAGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEKNLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLPDNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHLREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN >CP029164|3221809:3273222|3258842_3261908_+|AWH70845.1|tail|DBSCAN-SWA MDQIANLVIDLGIDAAEFKNEIPRIKNLLNGAASDAERSSARMQRFMERQTQAARQTMQAASSAATAASVHAQTVEKSAQAHERMAREVEQTRQRMEALSQKMREEQAQAMALAEAQDKAAAAFYRQIDSVKQASAGLQELQRIQQQIRQARNSGGIGQQDYLALISEVTAKTRVLTQAEAEATRQKVAFIRQLKEQATRQNLSSSELLRAKAAQLGVSSAAEVYIRKMEQAGKATHSLGLKSAAARQEIGVLIGELARGNLGALRGSGITLANRAGWIDTLMSPKGMMLGGVIGGIAAAVYGLGKAWYDGQKEGEEFNRQLSLTGHYAGVTAGQLWTLSRAISGNGITQHAAAGALAQVVGSGAFRGNDIGMVARAAAQMERSVGQSVSDTINQFKRLKDDPVNAAKALDNELHFLTATQLEQIRVLGEQGRSSDAARIAMSALAEETGRRTADIDNNLNALGSTLKYLSDLWSSFWDAAMNIGREDSLDEQISALQEKVSRAKRLPWTASSSQVEYDQQRLNELQEKKRQKDLQDAKEQAERNYQEQQKRRNAENAALNRMNETEAARHQREIARINAMQYADQAVRDAAIQRENERYEKALASGKKKTREPRNDEATRLLLQYSQQQAQVEGQIAAARQSAGIATERMTEAHKQLLALQQRISDLDGKKLTADEKSVLARKDELIQALTLLDVKQQELQKQTALNELKKKTIQLTSQLAEEERAQRQQHDLDIATVGMGDQQRQRYQVQLSLRQKYQQQLEQLRRDSEQKGTYNTDDYRKAEQALTESLNRQLNENRRYWQQLEVVQGNWKNGVLRAFQDFTVDADNTAGTAEQVFSSAFSNMGNGLATFVTTGKLNFKSFTSSVLSDMAKILAQATMMKSIKGIGSVLGFDLSSLSLNANGGIYQSADLSRYSGTVVNRPTFFAFAKGAGVMGEAGPEAILPLRRDADGKLGVVADIGGSGMAMFAPQYNIEINNDGTNGQIGPAALKVVYDLGKKAAADFMQQQARDGGRLSGAYR >CP029164|3221809:3273222|3262950_3263694_+|AWH70848.1|tail|DBSCAN-SWA MTETESAILAHARRCAPAESCGFVVRTPEGERYFPCVNISGEPEAYFRMSPEDWLRAEMQGEIVALVHSHPGGLPWLSEADRRLQVQSDLPWWLVCRGEIHKFRCVPYLTGRRFEHGVMDCYTLFRDAYHLAGIEMPDFHREDDWWRHGQNLYLDNLEATGLYQVPLSAAQPGDVLLCCFGSSVPNHAAIYCGDGELLHHIPEQLSKRERYTDKWQRRTHSLWRHRAWHASAFTGIYNDLAAASTFV >CP029164|3221809:3273222|3234848_3235214_-|AWH72531.1|DBSCAN-SWA MEFKDLPSDVQKTAAHTLHSVLREIGKDIASEPAKDLARKIKTAFVELYNVGTDSETVETKTVSSPIFSLEPGVLSGEICTEISSELLPVIREAICRRGLDGSYDHDVLQVLRTMVTSLGI >CP029164|3221809:3273222|3256354_3256933_+|AWH70842.1|tail|DBSCAN-SWA MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKLVRQRVRVFKASPSGKMTARIRVNRGNLPAIKLGTARVRLARRGGKLQYRGSVLKVGKYLFRDAFIQQLANGRWHVMRRIDGKNRYPIDVVKIPLSGPLTQAFEDARDRIIAAEMPKQLGYALKQQLRLWLTR >CP029164|3221809:3273222|3242481_3243234_+|AWH70824.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATGTGCLTITDVMAAQGMVQSKAPLGLALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEEIRLQIIPYLVNFAFADYSRSAASKARCEHCSGTGFYNVLREVVKHYRRGESVIKEEWVKELCQHCHGKGEVSTACRGCKGKGIVLDEKRTRFHGVPVYKICGRCNGNRFSRLPTTLARRHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEAQLRKVTR >CP029164|3221809:3273222|3272079_3272670_-|AWH70854.1|DBSCAN-SWA MSRIYAYCRVSTLEQTTKNQRREIESAGFNIKSQQIIEEQISGSAATSERPGFNQLLDRLKRGDALIVTKLDRLGCNPMDISKTVEQLTKAGIKVHCLALGGVDLTSPPGKMMMQVISAVAEFERDLLLERTHSGILRARGAGKRFGRPPVLNDEQIQRVLEQIKSGINISDIAQEFKTSRQTILRAKAKNQTPDI >CP029164|3221809:3273222|3253541_3255665_+|AWH70839.1|DBSCAN-SWA MMSNVGARPKLMKTANWIWYLIPIRPVIKEAAVPQRNDMSRSTPTTSPKNNSWFRMQAGHQSDADIYIYDEIGFWGVTAKQFISDLNALGDITHINLHINSPGGDVFEGIAIFNALKTHGASITVYVDGVAASMASVIAMVGNPVIMPENTFMMIHKPFGFTGGDAEDMRTYADLLDKVEAVLLPAYAQKTGKTTDEIAAMLADETWMSGAECLAHGFADQVTPAVKAMACIQSKRTEEFKKMPESIRNMITPPRNSAPRVQDNEPEASRTPVQAAAPVVDENSIRAQVLAEQKARVNGINDLFAMFGGRYQTLQAQCLADPECSLEQAREKLLNEMGRESTPSNKNTPAHIYAGNGNFVGDGIRQALMARAGFEKTERDNVYNGMTLREYARMSLTERGIGVSSYNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPVKDFATVIAEPRLDDNSQTTFYLAASKGSDTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA >CP029164|3221809:3273222|3245406_3245940_+|AWH70828.1|DBSCAN-SWA MNTKIRYGLSAAVLALIGAGASAPQILDQFLDEKEGNHTMAYRDGSGIWTICRGATVVDGKTVFPNMKLSKEKCDQVNAIERDKALAWVERNIKVPLTEPQKAGIASFCPYNIGPGKCFPSTFYKRLNAGDRKGACEAIRWWIKDGGRDCRIRSNNCYGQVIRRDQESALTCWGIEQ >CP029164|3221809:3273222|3248619_3248826_+|AWH70834.1|DBSCAN-SWA MNKEQSADELSLDLIRVKNMLNSTISMSYPDVVIACIEHQVSLEAFRAIEAALVKHDKNSKDYSLVVD |
64 | Enterobacteria_phage(36.96%) | portal,terminase,lysis,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
4030203 : 4084519
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP029164|4030203:4084519|DBSCAN-SWA TTTACCCTATAGGTGCTACTGGCCATTCAATATCCGGTGCAGTTGTTGTATTAACACGGTTCAGCAACACCCGATACTTCTTCCAGGCTTCCAGCAACGAGATTTCTTCCTCCGTTGCGATTTCCAGATCTACAGCATCCTGAAGTGGCGCAATATGCTCACTGGCTACCTGCATCAGGCTGTTTTTTGTTTCTTCTGCCTCCCGGATCCGGAACAGTTTTTCTGCTTCCGTATCCTTCACCCAGGCTGTGCCGTTCCACTTCTGAAACTCCCCTTCCGGCGATAACCAGGTAACATTTTCCGCTAACGGACCGAGTTCAGAAATAAATAACGCGTCGCCGGAAGCCACGTCATAGACCGTTTTACCCCGATGGTCTTCAACGAGATGCCACGATGCCTCATCACTGTTGAAAACAGCCACAAAGCCAGCCGGAATATCTGGCGGTGCAATATCGGTACTGTTTGCTGGCAGACCTGTATGAGGCGGAATATATGCGTCACCTTCACCAATAAATTCATTAGTTCCGGCCAGCAGATTATAAATTGTTATGGTCCGTGGTTGTTCACTCATTCTGAATGCCATTATGCAAGCCTCACAATATAGTTAAATGCGATGTTTTTGACGGTGTTTTCCGCGTTACCAGCAGCGTTAACGGTGATGGTGTGTCCATGTGAGCCAATCGCAACCGAGTGCGTATGAGCACCAATACCGACAGTATGTGCGTGTGCACCTGCGCTTGCAGCAGTGCCGGACAGTGAGTGGGTATGAGCACCATCTGATGATGTCTTCCCTGCATTACGAGTCTGGCCACTACCGCTTGTTGTGCTCATAATCCCCGCGCTTGGATTTGAAATCGCGGTATAACCATTAGGGAAAATGCTCGTGTTCGTGCCACCAAATGCACCGGAACTCTTGTGTTGGTGCGCACCGGCACTATTTGCGGTCCCGCTAATACTATGGGTATGCGCCCCGGTGTTATTCGTGGATTTGGTTCCGTAATCAAACGACGATGTGGTTTTCGTCCCCAAATCCGTACTGGATGCGCTGGCGCTGTGGGTATGCGATTTAATGCCGTCCTGTTCCTGAGACAATACGGCCCGACCACTGGCAGGTTTGCCCTTAATCGTCCAGCCACGCATATCAGGGATCACGCCTGACGGATAAGCCGCTGCAAGTTTCGGGTAGGCAGATTTGTCAAAAGCCTGCCCCTGCATCAGGGCATAGCCAGACGGAACGGTATCTGATGGCCACGGGATTGGTGCGCCGACTGGGTAGCTTTCTGGTGGAAGATTTTTCGAGGTATAAACTTCTGCCCAGTCTTCCTCAAAACCATAGCCATCTCTTGAGGAACGGTAGAACAGACCTCCATTTCTGTAATGCGCCTTCATCTGCAGGGTCCGGCAACTTCCGACTCCGGTATAGAAGTTAACCAGAATATAGCTGTCGCCAGAGCGGGTGACATTGTAAGCGCCTGATTCGGCATTCCAGGGAACGCCACCATCCGCATCGGCATATGTATCCGTTGCCCTTCTGGCAAAAGCAGCCACATGCGCGGCGGTTAAAGTAATATCTTTGGAACCATCAAACTCAACACCAGAAACCAGTCTTGGCGTTTGCAGCTTTGTTGCTGTTAATGCATTACCGTTCAGACTTGCGGACAGTTTGGTTCCAATAATCAGTTCGCCGGTTGCGTTATCAATAGCAAACGGTCTTAATGTATTCCAGCCACCATAAACATCACCTTGATTGGTAAGCAGCAGGTAAGTTTTAGCGCCATCATTACGCCATAATGCACCATACTCCCCACCTATCATTCGAATCTGATTACCACCACGCGCTACAATTTCGTCTGTGGCAAAAAGTTTTTTGCACGACAAGTTATCGTTAACGATTAACGAATGAGACTCATAAAAACCACGCCCACTCTTAAAATCAAGGATAACGTCCGCCGCGATACATTCAGTCGCCGGATTTGTTGCCCCAAACTTATAGGTCGTATCATTAACAACGAGATCAGCACCAGGTGCGGATATTGACAGGCCATCTTCGATAAACGCAAAAACAGGGAAAGCAGCGCCATCAACATAGAACACAGAGCGCAAATCATCGCCCTTATTACTCATCATTATTGAGTGGATGGCTCGTTCATTGTTTTGATATTGCCAGAACATTCCATAAGCATAACGCCCCCTGTCAGTCCAGCCACCAGGCATAACAAATCCGTTAAACTCGCAGTTATTCATCGGATCGCCTGCGGTTCGCGTTGCCGTGGTGATAATGACCCTTGATGCCAGTTCGCTTACTGAGCCAGCAGAACGCATAACAACAACAGGGTAATATTTTCCAGATGTTGCACCTGCAGGAGCGTTAACCCGCACATAACGCATACCACGCTTATCAGCAAAGTCTGTTTTACTGACCGCGTTAATGTTGTTCAGGAAGCGTCCCTTATCGGGTATATCAGCGCCGTTCTGGTCTTTCTGCAGACGTTTCTCTGCATTGTCATAGGCTGATTTTACTGCCTTTGGCGTTGCCGCCAGCGTTTCAGACGTACTGTTGGTCGCACTGCTGAGCTGTACTACCCCCTTTTTCGTCGTGCTCGCATCCTCAAGCGCCACGGCGGATGCAATATCCTCTGCCCGTTTTGCCGCTGTCTCGGCGCGCGTTGCCGCGGATTCCGCCGTACTTTTGCTCTGTGCTGCCGCTGCCGCACTGCCAGCTGCCTCTGTCGCCTTCGTGGATGCCGTCGTGGCGCTGCTCTTCGCTGCTGACGCCTGTCTGGTCGCCTCATCTTTTGAAGCAGACGCAGATGATGCCGATGACGCTGCCGAACTGGCGGACGATGCGGCTGCCGTTTTTGAGGATTCTGCACGGGTTTCCGACGCTTTCGCATTCGTTTCGGATGTCTTCGCTGCGGAAGCAGACCTCGCTGCTGCGCTGGCCTGTACAGCGGCTTCGCCAGCCTTCGTTGTGGCTGTTGAAGCGGACGATGCAGCACTTTCTGCCGATTTTCCGGCGGCGGTGGCACTGGCTGAGGCCTGCCCGGCACTTGTTGACGCGGCACTGGCAGACGACGCAGCCGCTGTTTTTGAGCCTGCCGCAGCTGAGGCACTCTGTCCCGCTGCCGTTTCAGAAGCCTTGGCGTTCGTCTCGGACGTTTTTGCCGCCTTCGCAGAATTTGCTGTCGCCGTTGCCGAGGAAGCTGCGCTGCTGGCGCTCGAGGCTGCGCTCGTTTCTGATGATTTTGCCGCCTCTTTTGAAGCCGACGCATCCCGGGCTGAGGTGGCAGCTTCTGACGCTTTCGTGGTCGCGGTGGATGCTGAAGTGGCTGCTGATTGTTGTGACGCTGCCGCATTCGTTTCTGACGTTTTTGCCGCAGCGGCACTGGTGGCCGCCGCGCTTTTTGAGGACTCTGCAGCGGCAGCACTTTTTGATGCTTCAGTGGCCTTTGCTGATGCCGTTCCTGCGCTGGAAGACGCTGACTGAGCCGACGAAGCGGCCTGTCCGGCTGACGTGCTGGCTGCGCGTGCTGAGCCTGCAGCATCAGTCGCATGGGTTGCCGCCTCACGGGCAGATGTGCCGGCATCGCCGGCTGACTTCTTCGCGGCTGCCGTGTTCTGTGCCACCGCGGACGCGTTACGCGCCACCTCTTCCACCATCAGTTCAAAACGGCGCAGAGCCTCAGGACGGGCATCATCCTCTGTCATGGCACCGAGAAAATCATTCAGCGTACCGGGTTGAGAATCTTCATACACGGTAATGGTCCCGGCATGTGATGGCGGGAATCCCTCCACCAACAGACTGACGCTGTACTGACCATACTCGACGTCCATTGTGTAACGCCCGGCTTCATCCGGGTTTTCTGAGGCCACTGTGTTCACCACCACCGTGGTGCTGTTGCGCCTGGCCTTTAGCTGAATGGTGCAGTTTTGTATCGGCTTACCTGCGCCATCTTTCAGTACACCTGAAATCCGTACTGCCATATTCCCCCCACAAAAAAGCCCGCCTGAACCGGCGGGCTGTCATAACACTGTGTTACCTGGCTAATCAGAATTTATAACCGACACCAACGATGAAACTGTTGGTACGCCAGTCACCGCTGCCGGAGCCTTCATAAGCAATATCAATGGCCACGGATTCGGTCGGGTTAAACTGCACGCCAGCTCCCCACGCCAGAGACGTGTTGCTGTGGCGACCGTCATCACTTCCGGTCAGCACGTCGTGCGTTTTCCCCTTGTTGTCAGTTACGCGGAGATAATCCCCGGAGAAAGTCGACACACGGCTGTAAGTCACACCCGCCATCGCATACGCGCTGAACCATTCATTCACGCGCACAGACGGCCCCGCCATCACGCTGAACCAGCGGTTACGAACGGAATCTTCATGCCAGCGGGTATCGCTGTAACGGGTAAGCTGGCGATTCTTGTCTCCTGCATAGCTGAATGACGTCACCATCCCCAGTGTGTCCGTAAACTCATAACGGTATTTCACGTTAATCCCGTTCAGATCATCGCTGCCGGGAACGTTCGTCGAGGCATGAAGATACCCCGCGCTCAGCGTGGACTGATGTTCAGATGCCCATGCAGGCGCACCGGATACGGCCAGACAAATGGCTGCGGACAAAATTGCTGCACAAACTTTACGCATAATTACCTCTCGCTTTTCTGCAATAAAAAAGGCGTCATTCCTGACGCCCTTTATTGGGGTTATAACAATTTCAACGAATACTGATGCCGGAAGCGGCTTTTTTGGTCACAATCACCGTACAGTCGGTGATATTACCTGCCCACTGATTGCCTTTATGGAAAACCTTAAACTCCAGAGTGACGCTTCCCCTGCCACTCGGCATATCAATAACCGCACTGTAGCTACCGGGAATGGCCCCTTTAGTTTCTCTGGATGCGATTAATACACCGTTTTTGCGAACTTCAAAACCATAACCCGTGTATCTTGTACCTCCCGGGTTATTACCACTTCCCGGATCGCTATACGCTATTCCGTTAAAGATAATGGGCGGAATAATGATTTGACGGTCAAAGTTATGATCATCGCTGATGGTGACTGTAACCGTCCCGTTTGGTGTTTCCGTATTACCCCACGTACCAGCCTGTTTCGGGAATGATTTGGATACAGCTTTAACGAAGTCACCTCTGACCTGAGTCGCCTCCAGCATGCCCTTAATCGTACAGTTTTCATTTACCGTGACATTGTTGAGCGTCCCGGCGTTCGCATTCACACTGCCACTGATATCCGCATTTTTAGCGGTCAGCTTTCCGTCCGGTGTCAGGGAAAATGCCGGTGGATTTCCACCGCTGGTAATGGTGGGGGCCGTCAGGCGTTTCAGGAACACGTCGTTCATGAATATCTGATCGCCCTGACCAACAAACATCGGTTTTGTGTTGCCATTCGCAGGATTAACCATCGCAATCCTGTCCGCCGCCAGCAGCACCTGACTCTGCATGCCGTCAGGGGTGTTCTCAATACCGGCACCAATACCCGCGATATAAAGGCGTCCGTCCTGCATCTGCTGCAGCTTCACAGCCCACATGCTGTTCAGGTTATTATTTGTATCAACCTGAACTTTCTGTATCTGCTGGATTGCCGCACTCTGGTCTTCCAGTTTTTTATTGACGGTCTGCGTGATTTCATTGCTGACATTCGTAATGGACGTCCTGATTTCAGCCAGGTCCGGCGCAAGCTGACCGTTATCAATCTGCGTCCACAGCTCCTGGGCCAGATGTGTTTTCCCGATTTCTCCTTTGAAAAAATCCAGGTAACCTTCCGCATCATCGCTCGCCCGACCGACAGCCTCCACGAATGCCGATTTGCCAACAGTGTTCACACTGCGAACGTAAAAATAATAATCATGACCCGGCTTGATATTGATACTGGCAGCTATCCAGTACAGCGCCGTGCCAAGATAGCGGGCTGTAGTTTCAACCTGCCTGATATCGGTAATCCGCTTTTCCGAGAACCAGAACTCAAACTGTACCGTCGGGTCATAAACGGCAAGATGCGGCGTTGCGGTTATCTGAAAATAGCCCGGCGTCAGTTCAATCCGCGACGGCGTCGCCGGTGCGGCAATCCGGAACAATACCGACGCCGGATCGCCCTGCTGTCCCCACGCATTTACCGCCCGGACTGTCAGCCTGTAGTTTCCCAGCGCCAGCTGCCTGAAGCGGTATGTGGTTTCCGTCGTCCGGGCTGTGCTGACCAGCCGCTCACTGCCATCGTCCGCTGTCACGGTCAGGCGAAGCATAAAGCTCACCCCCTTCACCACCTTCGGTGTGTCCCAGCGCGCCAGCACCTGATATTCCCCGCTGTCTGCGGAGACTTCGGCTGTCAGGTGCTGTACCGCTGGCGGCGTGACACCATTCACCGTGCCGCTCAAATCGCCGTCAAAGTGCGCCCCGTTATCCACGATGGCCTCTTTTTCCGGTACATGCTGCACGGCAGTGATGGCATACGTGCCGTCATCGTTCTCACGGATACTCACACAGCGGAACAGGCGCTGGCGCAACGTCGGCAACTTCAGCCCCCATACGCTGTATCCGGCAACGCCGTCAGGAACACGGCTCACTTTTACCTTCACGCCGTCGGTGACGGACTGGACCTCCACGCTGACCGGATTGCCACTTCCGTCAACCAGGCTTATCAGCGTGGTGCCGGAAGATGGCAGCGTGATTTCACGGTCGAGCGTCAGCGTCCGGGTCTGGCTGTTCACCGCCAGCACGCGCCCGCCGGTGCTGATACCGGCATAGTCATCATCGCAGATTTCAATGACATCGCCCGGTACATGGCGAAGCCCTTCAGCACCCACGCTGAAGTCCACGGTCTGCGTTTCCAGCAGCTCCGTTTTAATCAGCCACAGCCCGGCTCGGTGTGCCTGCCCCCGGCTGGTACAGCCAAAGGCATCCATCTTCGTGACGTTACGACCATAACGGGCAATGGCCTGCGTGTCCTCCACAAGCTCTGTCGCCGTCTCCCAGCCGTTATCCGGGTCAATCCAGTTCACCTCAACGGCATTATGGCGGTCCTTGAGGGCGCTGAAGCTGTAGCGGAACGGCGCGCCATCATCCGGCATCACCACATTACTGCGGTTATAGGTCCACACCTTATCCGACGGTCGGTCCTGCACGAACGTCAGCGTCTGCCCGTTCCATACCGGCATACAGCGCATCGCCGAGCAGAAATCACTGAGCACATCCCACGCCTTGCGCTGTGTGGTCAGGTACGCATTACAGGTGATGCGCGGCTCCGTGCCGCCAAAGCCGTCCGGCACTGACTGGTCGCAGTACTGGCCGATGACATACAGCGCCCATTTGTCCACATCCGCCGCACCAAGACGTTTCCCCATGCCGTAGCGTGGGTGGGTCAGCATATCCCACAGACACCAGGCCATGTTATTGCTGTATGCTGGCTTAAACGTTCCGTCCCAGATACCGCTGTATTGCCGCGTCTGCGGGTTATAATTCGACGGCACCTGCAGAATACGCCCGCGCAGATGATAATTACGGCTCACCTGCTGGCTGCCGAACTGCTCCGAGTCCACCTGCACACCGACCAGTGCCGTGTTCGGGTAGCACTGTTTCACATCGATGATTTCGGTGTATGACGACCAGAGCGTTTTGTTCTGCAGCTGGTCTGTGGTGCTGTCCGGCGTCATCCTGCGCATCCGTATACTGAACGGGCGCGGCGGCAGGTTATCCACCACCACCGAGGCCAGATACTGCGAGGTGGTTTTGCCCTTAATGGTGATGTCTTTTTCCGTCACCCAGCCACCGTTACGCTGTATCTGAACCAGCAGGCGGACTTCCGACGGATTCCTGTCACCCTTTGAGGTGGTTTCCACCAGTGCCTGTACACCGAAGGTAAAGCGCAGACGGTCGATGTTTGCCGACGTGATGGTGCGGGTGATCGGCGTGTCATATTTCACTTCCGTACCCAGCACCGTCTCGGAGCCGGAGGATTCAAATCCCTCCGGCGGTGACTGCTCCTGCTCACCGGCCCGGAACACCACCGTGACGCCGGATATATTGGTATTCCCCTCACTGTCCAGCACCGGCGTACTGTTCAGCAGCACGCTTTTTAATCCATCCACCGGACCTTCAACCGGCCCTTCGCTGATGGCATCGATCACACTCAGCAGCTGCGTGGACTTCAGGTTGTCCTTCGCTTCGCGCGGGGTATGCCCCTTACTGCTGCCTTTACCCATTCGTCATGCTCCATAAACGATAAAACCGCCCGGAGGCGGTTTCACATAAAACGTTTTTCATCAGCGACCAATCACCACAACCTGACCACCATCACCTTCATCTGCTGTGCTGATCTCCTGAGATACCACACGTGATCCCACGCGCATTTCACCGTACAGAACGGGCAGAACATTGCCCTGGGCAACCATATTATCCAGTGAAGAAAAATAGGTGTTCTGTTTGCCGTTATCCGTTGTCTGTGTGCTGGGGGTTTTGGGTTTAGGGGCCAGCATCTGTGCAACACCGCCAAGCGTCATACTGGCACCGAGAGAAAACAGCAGATTACTCGCCATAATTCCTACCCCCGGCATCCATATAGCAACCGCCATAACAGCCGCCCCCAGCACAGCCTGAAACACACCGCCACTTTTGGCTCCTGCCAGACGCGGCACGATATGGATCACAGCACCATTTGCCAGCGGTTCATTAAGACGGGCTGATAATTCCGTTTCACCTGTATCACGCCCGGCAATCCGTACCTGATACCAGCCGTCGCTCAGTTTCTGACGAAACGCCGGGAGCTGTGTGGCCAGCGCCCGGATGGCTTCAGCCCCCGTTTTCACACGAAGGTCGATGCGGCGACCAAATCGTTGTAAATCCCCGTAAAGGCAGATGCGCGCCATGCCCGGTGACGCCAGAGGGAGTGTGTGCGTCGCTGCCATTTGTCGGTATACCTCTCTCGTTTGCTCAGTTGTTCAGGAATATGGTGCAGCAGCTCGCCATCACCACAGTAAATGGCGGCATGATTCGGCACCGATGAACCAAAACAGCACAGCAGCACATCGCCCGGTTGTGCTGATGACAACGGCACCTGATACAACCCTGTGGCCTCCAGATTATCCAGATAGAGATTCTGACCGTGACGCCACCAGTCATCCTCGCGATGAAAATCCGGCATCTCAATCCCCGCCAGATGATAAGCATCCCGGAACAGCGTGTAACAGTCCGTCACCCCGTGCTCAAAGCGCCGCCCGGTGAGATGCGGCACACAGCGGAACTTATGAATCGTCCCCCGGCAGACCAGCCACCACGGCAAATCACTCTGCACCTGCAGCCGCCGGTCGGCCTCACTCAGCCAGGGCAGACCACCGGGGTGGCTGTGGACCAGCGCCACAATCTCACCCTGCATTTCTGCCTGCAGCCAGTCTTCCGGCGACATACGGAAATACGCCTCCGGCTCACCGGAGATATTCACGCAGGGGAAATATCTTTCCCCCTCCGGCGTGCTTACCACGAAGCCGCACGACTCCGCTGGCGCACATCGCCGGGCGTGCGCCAGAATCGCTGATTCTGTCTGTGTCATGGGATTTACTGCGAAAGTTTGTTAATGGAAAGGAAGCCGCCAAAGTTGCCGACGTTATTGCGAAACTTACAGCCACTCAGGCATTTGCTGCATTTATCCTTCGTGATATCGGACGTTGGCTGGTCATATTCATCCGCGACCGCCGGACCGTGATAACCGCACTCATCGCCGCGATAGGTCCAGGTGCAGGTGTTGGCCAGCATGATACGTCCCGGAAAAACAGCGCCATCCGTTTCCGTCGGCGTGGACAGTACAAAGGAGGCACTGACCGCGCTCAGTTCGCTGCACTGCTCGATGCGCCAGCGGCTGATCACCTCCTGCTCCGGATCGGCGTCGCTGTTTCCGTTGACGAAGTTCACCGCATCCAGAAAACGGGCGTAAACCTTACGCCGGACCACCGTTCCGCCGACCAGACTCTGCATATCTTCCGCCATCCCGGTGACCATACCGTACAGGTTAGAAACCGTCAGCGTGGGGCGTGTACTGGTGCCTTTGCCATTCAGTTCAAAACCGCTCCCCTGAATGGGATACGGCTGATACTGTCGCCCCTGCCAGGTGACCGGCTCACCTTTTTCGTTCTGCTCATTACAGAAAAAATAACGTTCTCCACCGACCTCTGTCAGGTCGATTTCCCAGAGCACCACGCTGGCCGACTGCTCCGCACGGGTGCATTCATTCAGTGTTTCCTGCCGGATATCCTGCATCAGTTCACCACCTGTTCAAACTCTGCGCTGAACTCAACACGCAGCATACTGACCCGCGACGACCATTTTGCGCAGGTCACCTTTATCTGCCGCCACTCATAAGGCGGCGTCCACAGAAAGGCTTTCCAGCCCCCGTGCTCTTCCAGAAACGACTCCAGTACCGTGGCCTCCTCACGGGGGACAGAAAGCGTCACGCTGTACGTTTTCAGGTTGGCATTCAGCCCGGCAGGCGCTCGCTGGGAATAGCCATCACCAAAGCGCACCTTTCTTACAGAAGGGGCCGAAGCCACATCCATACCGGGTTTCACTTTCCAGCGGAAGGTTTTCATCGTCCACCTCCGGAGAACAGGCCACCATCACGCATCTGTGTCTGAATTTCATCACGGGCACCCTTGCGGGCCATGTCATACACTGCCTTCATCATCTGTGGACCTGGCAGACCATTCGTACCGTCGTTCTGAATCACCACGTGATTGTTCTGATTAAAATTAATGCCTTCAGCCCGCCGCATCTGCGCCGGACTTCCGGCACCGCCCACATAACCACCTTCCGCATAGCCCCGCATCAGGCGGTACAGGTTGCCGACACCAATCCGGCTGGTTGCCTCCTTCGTGAAGACAAATTCACCACGGTGAACAATCCCCGCTGGCTCATATTTGCCGCCGGTTCCCGTAAATCCCCCGGTCGCAAAATGGAATTTCGCCGCAGCTGCCTGAATGGCTGTACCGCCTGACGCGGATGCGCCGCCACCAACAGCCCCGCCAATGGCGCTGCCGATACTCCCGACAATCCCCACCATTGCCTGCTTAAGCAGAATTTCTGTCATCATGGACAGCACGGAACGGGTGAAGCTGCGCCAGTTCTGCTCACTGCCGGTCAGCATCGCCGCCATATTCTGTGCAATACCATCAAAGGTCTGCGTGGCAGCACTTTTAACCTGCGACATACTGTCCGTGGCGCTCTCTTCCCACTCACTCCAGCCGGACTTCAGGCCTGCCATCCAGCTCCCGCGAAGCTGGTCTTCAGCCGCCCAGGTCTTTTTCTGCTCTGACATGACGTTATTCAGCGCCAGCGGATTATCGCCATACTGTTCCTTCAGGCGCTGTTCCGTGGCTTCCCGTTCTGCCTGCCGGTCAGTCAGCCCCCGGCTTTTCGCATCAATGGCGGCCCGTTTTGCCCGTTGCTGCTGTGCGAATTTATCCGCCTGCTGCGCCAGCGCGTTCAGGCGCTCCTGATACGTAACCTTGTCGCCAAGTGCAGCCAGCTGGCGTTTGTACTCCAGCGTCTCATCTTTATGCGCCAGCAGGGATTTCTCCTGTGCAGACAGCTGGCGACGTTGCGCCGCCTCCTCCAGTACCGCGAACTGACTCTCCGCCTTCCACAAATCCCGGCGCTGCTGGCTGATTTTCTCATTTGCTCCGGCATGCCTCTCCAGCGTCCGGAGTTCAGCCTGAAGCGTCAGCAGGGCAGCATGAGCACTGTCTTCCTGACGATCGCCCGCAGACACCTTCACGCCGGACTGTTTCGGCTTTTTCAGCGTCGCTTCATAGTCCTTTTTCGCCGCCGCCATCAGCGTGTTGTAATCTGCCTGCAGAATTTTCCCGTCCTTCAGTGCCTTGTTCAGTTCTTCCTGACGGGCGGTATATTTCTCCAGCGGCGTCTGCAGCCGTTCGTAAGCCTTCTGCGCCTCTTCGGTATATTTCAGCCGTGACGCTTCAGTATCGCTCTGCTGCTGCGCATTTGTGTCCTGTTGACTCTGCTGTTCAGCCTTCTTTCTCGCGGCTTCAAGCGCAAGACGGGCCTTTTCACGATCATCCCAGTAACGCGCCCGCGCTTCATCGTTAACAAAATAATCATCCTTGCGCAGACTCCAGATGTCGTCCGCTTTCTTAAACGCAGCCTCTGCCTTAATCAGCATCTCCTGCGCGGTATCAGGACGACCAATATCCAGCACCGCATCCCACATGGATTTGAATGCCCGCGCTGTCCTGTCTGCCCAGGTCTCCAGCGTACCCATGTTCTCTTTCAGGCGGCGGGTCTGGTCATCAAACCCTTTCGTCGCGGCCTCGTTCGCCGCCTGCAATGCCCCGGCTTCATCGCCGGAACGCTGCAACTGAGCAACATACGCAATCTGCTCCGCCGTCACGTTATGGAACTGGCGTGCCATCGCCGTCAACCCCGACGTCGGGTCTGTGGTCAGCTTCCCGAAGGCTTCAGCGACCTTGTCCACCTCCACACCGGATGCAGAGGAGAAACGCGCCACACTCTGGCTGATGGACGCAATCTGAGCCTCACCGCTTACTCCCGCCTTAACCAGTGCGCTGAGTGACTCGCTGGTCTGGTTAAACGTCAGCCCTGCCACCTGCCCGGCTCTGGACAGGACCAGCATGCGATCTGCCGTCAGACCCGACTGATTGCCGGAAAGGACCAGCGTTTTGTTGAAATCGGACAGGGTTGAGTTGCCCTGATACCAGGCATACGCCAGCGCACCGGTCGCCACCGCCAGCGAGGTGGCCCCCACCATCGGCAGGTTGATCGCACCGGCAAGCCCCCTGAACATGGGGATCATCCCGCCGAAGGAGTCCTTAACCTGCCCCCCCTGTTGCAGCAGGATCAGCCACGGACTTTGCCCGCCTGCAAGCTGCGTGGCCACGTCGGTGAACTGTGCAGGCAGCATACGCATGGCGGCTTTATACTGTCCGACGGAAATCCCCGCTTTCTGTGCAGCCAGCGCCTGCCGGTTCATTGACTGTTCAACGACTGCCGCTGTTTTTTTCGCATCACTTTCCGTACCGGAAAAATGACGCCTGACTCTGGCCATCTGCTCGTCAAATCTGGCCGCATCCAGACTTAAATCAACGACCAGATCGCCTACCGGTTCAGCCATACCGGACTCCTCCTGCGATCCCTTCTGATACTGTCATCAGCATTACGTCATCCTCCGTCATGTCCGCCACATCCGGGGAAGCGGGGATAACTTCATTCCCGTCCGGGCCAAAACGAACGCCTCCGGCAAGCCCTGCCGCTTTCTGCATCAGCACATCATCTTCAGGCTCTTCGTCAGCCTCGCGCCGGTTCAGCAGACTGAAATCCAGCGGATGCATCTCCGGATCGCTGAAAAACAGGCTGAGCACGGTGTACGTCAGCCCGGAAAAGTGCATATCCAGCAGAACATCATGAAAATAATGGGTACTGTAAAAGCGGTGCCAGTCGGCATACTCCGTGGATGACATCCCGGCAAGCATGGCGCGCCAGTCGGGTCGCCCCATCTCACGCGCCAGTTTCAGGGCAAAACTCAGCTCACCGTCGAACACTTTCCCGCAGAAACAGGCACTGCAGGCCCGGCGTCCTCTGCCTGTTCAGGAGCATCATTCACCACAAACTCATACATACCGGACAGCCGGTACACCACGTTTTCAGCATGAGAAATTGCCTCTGTGGGCCAGGTGGTAAGCACTTCCTGCTCAATCTGTTTAACGGCTTCATTCATGGAAGGCAGCTTTGTCTTCTGCGGATGGTTATGCCACAGGGACATCGCTACCACAAAAGCACCGGTTCTGATGGCGTCTTCCACAGTAAACTTCCGGTTGCTGTCTGACTCCGCCTGTTCTGCCTGCCGTTTCATCAGGTCGAGATGCTCAATACGCTGCAGGGCTGACAGTTCAGAAAGCGTGACGGTCACACCGTTATGTTCAAATGATTCGGTTTTCAGGAACATCGCTGACTCTCCGGATTAACTGGCGGTGACGTTGATTTCTGCAACCGCAGCAAACTCACCATTACCAGATACGACCGGAATGTTGACCTTGCCTGCAGCAACACCTTTCACGGTGATGGTCATACCACTGACCGACACGGTGGCTTTTGTTTTATCCGCAGACACCGCACGGAAGCTCTTGTCGGTTGCGCCATCCGGCTGGAATGCCACGGTCAGCGTGGTGCTCTGCCCTTTCACTACGGAAGCACTGGCGGGTGTCACCGTCATGCCGGTCGTCGCCGTCACCGTGCTGCGATCTTCTGCCATCGACGGACGGCCCACATTGGTGACCTTCACCGTGCGGGTAATCACTTCCTTCGCCGTCACCGCCTTACCGATACTGCTGACCCAGCCACGGAACACATCGACCGTGCCGTTCGGGAAGCGGATTTTATAGGCACGGGTATCACCTTCATTAAACCACGCCAGCAGCGCCTGCTGCCCCTGCTCTCCGGGCATCCACGCCAGCGTGAAGCTGGTATCTCCGGCAGATTTCTGCCCCTGCCCGGTCGCAGTCCAGTCTGCATCTTCATCATCGAGATAGCTGTCGTCATAGGACTCAGCGGTCAGTTCGCCGGGCGTCAGGTCTTTAACTTTTGCCAGACGCGACCAGTCAACGTCTGAAAGCGGGTTCGCATAAGGGTCACCGCTCCCCTTATAAACCCACAGGGTGGTTCCGGCACCTTTCACCGGTATTGCTGGATTTGGTACAGGCATATCGTCCTCACATTTCATAGGTAATGACATACGTCAGATCGGCAGAACTCCACAAGCCCGCATCATCGTCGCGCCGGTAGTCATAGCCACTGGCCACCATACTGGTGATCAAATCTGACAGTGCCGGGATATCGCTCATCACCGGATAAATCCGGGACTCCATCCACGCATCCAGCTCTGAATCCGGCACCTGAGCAGGCAGGAAAACTTCAATATGCAGCTCCGCCTGCCAGGTATCGCTGTCCAGCTCTTCGCCCGTGTATTCAGCGCCGGTGAGATAAACGGCAATTGCCGGAAAATCTTCCTCATCAAAAACAGCAGGGCGACCATCAAAAAACGTCGCCCCGGTGTCATGCTTCTCCAGTGCATCCAGTACGGCTGCACGGAGTTCAGTATGTTTCATCGCTTTATTACCATCCTCAGTTGATGCTGCAGCGCATAGCCCAGCTCTTTCGGAAGACGCTCACGCCGTATCCGCTCAATATTCTGTTTAAACGCCGTGGTCAGCGGCACCGCCATCGGGATTTTCACCACATCAATGGGGTAACGGTTTTTCCCGGCCACACGCTGCATGACATGCCAGCGGCCATTTTTCAGTTGCTGAATAAACGCGCCGGGAATACGACGGTTTCCCACCACAAGCACGCTGCCGCCACCTTTCAGGGCTGAACGCTGCCCCTTTTTACGACGCCTGCGTCGGGACAGGACAATCCGCGCGTTACCCAGCTTTATTACGGGCAAATCCCCCCGGTTAACCTTGATTCTGGCCTGCGGATTTTTGACCGTGGCCCTTTTCAGCCAGGCCCTTTCCTTTACCAGTTTCCGGCGTACCTTTGTCTCACGGGCAACCTGTGACGCCGACTGCGATATCGCGGATGAAGCAACGCGGTTAATGGCCATTGCGGCGGCACCGGGCACCGCCGTTCTGCTGATACGGCTGAGGTTTTCAACGGCCTGCTCAAGACCTTTTATGGCCATACATCCCCCTTTCAGCGGCGACGGTTAACGGCAGGCGGTACGCCCCGCCCAAGCCAGAGATGACAGCTTCCGCCATCATCCGGCGAAATCCGGTCTATCCAGAAGTTTTCCTCACCGATGGTCAGCGTGTCGCCGCGCCGCAGCTGCCGCACATCATCAGTCCGGACAAACAGGGACGGGCTGGAGCCTTCAACGCGTACGCCCTGTCCGGCATAGCTGATATTTTCAGGGTCATCAAAAACACCACGTATTACTGCGCCGGACTGCTCACCGGATGTCATGGTGGCTGACGTTCCCATGTACCCGCGTATCGTTTCATCGGCGCGGGCAATGGCAGCATCGAACAGGTTATCGAAATCAGCCACAGCGCCTCCCGTTATTGCATTCTGGCCAGGCCGCGCTCTGTCATTTCAGCTGCCACACCGGCAGAGACACGGAACGCCGTTCCCGGCAGCACAAATGCCACAGCCTCATCCCGCGTGGCGTGAAGTGCATCAGTATGCAGCGTCACCAGTGCCACAACCGTGACCAGATCAGCCGTATCAGTCACGGTATCCGGCTGCGCTGATACCACCTCATTTTCATGTCCGGTCAGCGCATTTTCCGGGCTGACAGATGTGTCCTGACCGGCAGCGTCATCCGTGTCATCAAGCTCCTCTTCCAGCTCTGCCACACGGAGCGCCAGTTCTTCTTTCGTCCCCGTCAGGCTGACATCACGGTTCAGTTGTTCACCCAGCGAGCGGAGACGGGCAATCAGTTCATCTTTCGTCATGGACTCCTCCACAGAGAAACAATGGCCCCGAAGGGCCATGATTACGCCAGTTGTACGGACACGAACTCATCAGGGTCAGCCAGCAGCATCAGCGGTGCTGACTGAATCATGGTGAACTCACGCGCCGGATCGCCGGTGGTCACCCAGTTTTTCGGGTAACGGGCAGAGGCGTTAATGCCTTCGCGCTGTGCGTCCGCATCCTGAATGCAGCCATAGGTGCGCAGACCGCGTGCATGAGTGTTACCCAGCACCATCGTGTTGTCCGGCAGGAAGTTCTTTTTGACGCCGTTTTCCACGTACTGTCCGGAATACACGACGATGGCCACATCGCCATACATCCCCTTATAGGACACCGCTTTACCCAGGTCTTTCACCGCTGTCTCCAGCTCGGAATGAGAGCCGCGACGGGTATCCAGCTTCTCCTTGACGGCGTTGAAGGAACGGAACAGCGCCCAGCCTTTCGGATCAAACACGATGATATTCACCACGCCGCTGGCGTTCAGCGCGTAGGCTTCGATATCGTCGGTCGGGTCATACGTGGACTTGTCACGCTTGCTCCACTCCGTGCCGCCGGACTGCGTGATGTTGTTCGCCGCACTGCGGCCCATATCCACCTCAACCGGATCGAAGGCTTCACCGGTCATGGTGTATTTGCCCTTGAGCACGGCAGAAACTGCCTGCATCTCTTCGACCTGAGCAATGGCCAGCTCTTCGTCACGCATGTTCTGCAGGATGATGCGACGGCGGCGGTAAGCCGGGTCCGCCAGATTCTGCGGATCTTCATCCGGCAGGCGACGCAGGGTCATCTGCGGATTCACTTCATGCTTGGGTTACATGAGTCAATCCCGAAAGAACTATTTGATTTCACTTAGCTTTGTGGTTTATATATCTACGCAACTCCTATCACTGCAATACTTTCTGCAATACAGATATGTGTAGTGAGGACTTGTCTAAAAACCATATTCACGAAGAAAAAGGATGACACGATGCAACCAGAAGATTATGAAGAAAAGCAATACGAGGATGAGCCAGAATCCTATCCAATTGATGAGTTTCAACTTACTACTACGCCCAATGATTTTAATATAATCACAATTATAAGTTTTATTAAATCAAAAGTTTTTAAAATCCCTAACTTTCAAAGGCATTATGTTTGGGATATCAAACGAGCATCAAAACTCATTGAATCTCTTTTAATAGGCCTTCCTATACCTCAAATATTTTTATATGAACAAGATAAAAATGAATTTTTAGTGATAGATGGTCAGCAGAGATTAATGACCCTTTATTACTTTGTAAATGGTGTATTCCCTAGAAAAGAGAAACGTTCTGAACTGAGAAAAATCTTTGAAGATAATGGAAACATTCCAGAAAACATTCTTCACAATGATGAATACTTCACAAAGTTCAACCTAAAACTTGATGGTCTATCAGACACCCAAAAAAACAAGTTTAATGGGAAAAACTATGAAACATTAAACGAATTTCAAACCACTTTAAATCTGGCAACCATACGAAATATGGTAATCAAACCGGTTGCACAAGATTCGGAAGATGGTGCAATGTTTGAAATATTTAACCGCCTAAATAGTGGGGGAATGAACCTTTCTCCTCAAGAAATTCGTATGAGTTTATATCACTCAGACTTTCTTTCAAATCTTGTTTCATTGAATGAAAACAAGACATGGAGAAAAATTCTTTCAAAAAATGTTGTTGACATGCGATTGAGTGATATTGAAGCGATATTGCGCACATTTGCTATGTCCCTTTTTACATCTCAATACAAAAGCTCAGTTAGCGGTTTTTTGAACAATTTCTCAAATTATGCAAAAAACTACGACACTAAAGACATAATTTTATTTAGTAATATATGGAATGAATTTATGGATAGTGTTGATGGAATTGATGAAATCAATTTCAGAACTGGTGGGAATCGTATGAGTATAACTTTATTTGAGTCAATTTTTTATGCTGCAACTTATGACTCATTTAAAGATAAAGATCTAAAAATAAGACAAGTGACGGTGAATTACATCGATAAGCTTAAAAATGATCCTGAATTTCTGACATTTAGTACTGATAAAACAACAAGAAGAGAGCATGTAATTGGACGTCTAGAACGTGCAAGAACAATTTTGGAGGGAATGTAAAATAATGGAGGATATGGGATATACTTCTATTGAAAGTATGTTTAATAACTATAAGTGTTATTATGATTTTCTTCTAACTCATAATGAGATTAGTTTTGCTAATGATTACAAATCACAATTTTCAAAAGTAATGCTATTAGCATGTGCTAGCTATTTTGAAACTTTAGTTGTAACTAAGATACATTGTATGCTTAACCCAAGCCAATGTAATCTTACACACGATTTTATCGATAATAAGGCCCTAACCAGGCAGTATCATACACTCTTTGATTGGAAAAAAAGGAATGCAAACCAGTTTTTTTCCTTTTTTGGCCCAAAATTTAAAGAGTTTATGATTGAGAAGGTAAAATCAAGCACAGAGTTGACCAAGTCTATCTCTGATTTTATGGAAATCGGAGAGCTAAGAAATAAATTAGCACATAATAATTATGCTACTTTTGTATTAGAAAGCACAGCTGAAGAGATTTATAATAAATTTTTAAATGCACATTCATTTGTCTCTCAACTAGATACGTTCAGTACACAGTTTAGAGAGCAAATTGGTGAACAGTAATAGTTTATCTCCAGCAAAAAAAATAACTTGCAAATAATAAAAGCGGTCGCGTGACCGCTTTTTTCTATTTATCATTGCTATTTTCCTTTATTATTTCAGCAATTGACATAACTAAGGCGTCTGCTTTATTTCTAGTCCTATCAAACATATCTCTAATTACATCAATATAATAATGTTCCTCAAAAATATTTACGTCTGACAAAAGTATTGCACAACTCTTGCCCAAAATGTCAGCCAGTTGATCTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAATGCCCGTTTTCAGCGCCTGTCATTTCCTTTCTTTTCAGGGGGTATTTTAAATAAAAACATAAAGTTACGGCGAAGAAGAACGGAAATGCCTTAAACCGGAAAATTTCCATAAATAGAGAAAAACTGCGCGCCTGACGCCCCGTAGCCTGTCAGATCGCCGGAAAGGACCCGCCAGCCAGAGCGGGCCCTAATTTCATCAACCAATCAGCTTATAGCGACCATCCCGTGCATTGCGGCGTACACGCTCAATCTTGAGGCATAGCGCCGCATCTGGCTTTTTTGGGACAGGTACGCGGCAATATTCAGAAGCGCGAGGAATATTATTTATCCAGTCGATCACTTCACTTAAATACCAGGCCTTACGCCCTTCCGTAACCTGCACACGCTCCGGGAACTCTCCACTAGCCTCAAGGTTTAGCAGTGTACGCCGACTCAGGGTTGTAATTTCCATCACCTGATTCATATCAACAAGGCGCTCGCTTAAACACATTTTGTCAGCGATAGCTTTTAATTCCTCTACAGCTGGATTCGGGTACATCATTTCGGCAATTGGCTTAAGGTCATTGTAATCATTCTGCATTGTATCCCCCTTTACACACGAGCCAGCGGCTGAACAGAAATACCTGAGCCAACAAACGCGGCAACCTTTGCCGACAGTTCTCTTACAGACTCAGGCCAGTTCAGAGCATCAACATTTAAAACACCTGTCTTATAAACCTGTGCCTGTGTTTTTTTCGCTGTGTCGATTTGTACAGCGGAAACATAAACCGCTTTGCCTGCGCTCGTTCCATCCCATACCACTAGCGCACCAGTTGCATCTTCCTGCATCAGTGGCGTAAACGCAGGTATTACCCCTTTATTGGCTGAAAATATCCCCAGCGTAGTAACCAGTGCTTCAGTGCCAGACATGAGTTCAGTGTAATGAGTAGCCATTGCTCCCCCTTAGCCAATGCGAACGGTAACAAAACGATTGATGCGGGCCGGTATTGGCTGTGGTGCTGAATGTGTCTGCACATATTCAATAGCCGGATCACCAGGCACAATATAGTTTTTCGGTGCAAGTTCGGCTTTAGTCAGCCCCATTCGGATTAGCTCCGGATCCTGAATACCGCCATAGGCGACAATCCCCTGAAGAGCCGTATTGCCAAGCACCATCAAATCAGGATCAAGGAAATGTTTTTCAGTTCCGTCCTCGTCGGTATAACGCCCGCTGTATACAACAATCGCAACATCGCCCATATACCCTTTAAAACTCACCGAATCACCAAGGTCTTTAAGGGCCGTTTCCAGTTCGGAATTAGAACCACGACGGGTATCCAGAGCCTCTTTTATCGCTCTGAATGAACGGTATTTCTTCCATACATTACCGCCCATAATGATGATATTAGTGACGCCCTCACTAAATTCTGCGTAGCTCTCAATATCATCATTTGGATCAAAAGTTTCTTTATCCTTACCTGACCACTCAGCACCGCCAGACTGAGTGATGATATTTTGTGGTTTAATATTCCAGTCCAGCTCATAACGTTCAATACCATCGCCCTCAATGATATTTTTCCCCGTTGTGATTGCCTGAACGGCAAGCCATTCAATACGTGCACGAATAGCTTTAGCCTGATTTACAATCGCCTGTTTAACTTTAATATTACGCGCTCCAAAAGCATTGTATTGCTCAGGTGATACACCAGCAGGGCGCACAGCTAACTTATTTGGATCAATGCTGCTTTTCGGCTTCATATAGCCTGGACGAATTGTTTTTGATTCGTACCCTTCGTCACGTGAAACTTTACTACCCACCATAGGAGAACAAAACGCTGCAATTGGGATATTTGGATCGTCGATTGTATCAAGAATAATATCGCGCGATTCAAACATTACCGAGCGAGTAAAAAACAAACTGGTAAACAACGCATTTAGTTGTTTTTGTACATCTACAGCATTAACCACCTGTACAAGCTGGGTAGGCGAATATAAATCAACCATACGCATCCTCTTTGCATTCATTAAAAATAATTGTGGATATATGCTATCACCGATATTTGTCATGCGAATACATGCAACCGAGTGCAATGTTGTATAACGTTTTGGGATGACAACTTCAGCACGGATAATTAGTGTTAATATCTTCACTCCCTTTGGTCTGGATTTATGTAGCATGCCGGAAAATTTATTTTTTTTCCGGCCTTTTTTATTGGCAATATTTAAAACGGAATATCATCTCCCCATTGCTCATTATCTCCCACTGGTGGATGGCTTCCTTGCTGATCTGCCTGTTGTTTTGCTCTGTTCAGTGCGTCAGTAGCCTGACCCTGTTGATCTTTTTTGCCGCCCGGTCGCACCGTTCGCGCACTGATTACGCTGTCTGCGATAACCTGCCAGCCCTGCCGCGTTTCACCGTTCTGTCCAGTCCACTGGCTCACCTGCATATTACCCGCCACGCTCAGGAGTTCGCCTTTGTGATGCTTTGCCAGCGTGTCGGCTTGTCTGCCAAACGCCAGCACAGATAACCACATCGTCGCCGTTCCGTCATCTGCCTGGCTGCATGGCAAAGATACCGCCATCCATGCCAGTGTCATGGGTGTTCCCTTGCTGGTATGTTTTACCTGTGGGTCGTCCACCAGCCGCCCGTAAGCTGCTATTTGCGCCGTCATGCTGCCTGCTCTCAGGACTTAATATTGATGGTTGTCACTTCCTCCGCTTCGGCAATCTCCCGTTCGGTCAGCGTGGCAAAGTTTGCAGCCGCCGTTGTCATGAATGCGCTTATCAGTTCGGGATGTGCTTTCGCGTATCCTTCCCCGGCGTTGCGGTCTATTGCCTTAATCGCCACCCTTAGCCAGTGTTCAGCCATATCAAGGGCGCGGTAATGTGGCTTTATATGTTTGTTCAGTTTTCCTGATGTGTGCATTTTTATTTTTACCCCCTCGTTTAAAAAGTTTTTTGTGCACCACCACCTTGTCTACCTTGTCTACCTGATTAGTTATCAGGCCAGTAATGGCGCGGGTTTCAGGGAGGTAGACAGCCCCAAATAGCTGTCTACCTCATCTCTACCCGTCTCCTTACCTGTCTACAAAAATGGGTAGATAAGGTAGATAACAGGTAGACAGTGAAAAATAGTTATCTACCTGCATTAATACATTGAAATAAAAGTATTTTATTTCAGTCAGGTAGACAAGGTAGATAACCATTGCCATTTTTTATAAAAACGCATCGCAATCATCAGTAGTCGTTGCGTTGGTCTGCGTGACTCCCTTAACTTTTCGCGTAATATATTCATATCCGTAAACTTTCGCCGCTGACCTCATAGCCTTTCCGAACTCATTCACGCTCAAACATTTCCCCTTTCCTGTATATGCCATGAAGGCCATATAGACACGGTAAAGGCTGTTTCTGGTCGTGTACTTCACGGTGTCACCACCACCGCCCATCATTAGCCCACGAGCTTCCTCCAGAAACTCCAGCGCCGCGCAAAGCTCAACAACCGGATCCGTTTGCTGCTTTATTGCCAGAGCTTCATCACCGTCACGCTGTTCCAGTAATAAAGCCCGTGCCTTTTCAGGGTCAGCAAAATTAGCCAGCAAGCGGCGGATAATTACGGGGATTTCTGCCGCTATCTTTTCCGGTAATTCCTTGTCTTTTTCGTCCTCCCTTACAATGTTGTCGAACCGGAAAATCACCCGACGGCGTGACACACCTCCGGCCCGTTCGGTAAAGATCATCGGGTCGTTATTGGTTGCCAGTACCACCGCCCTTATTATCGTCGTGAATCGCTTCTCATATTTCGGGTTAATTTCAACGGGATCGCCTCCCGTGATTTTCTTGATGCCCGTGCCTTCCCCCGTATATTTCGGCTGATCGGCAAGGACGATAAGACGACTCCCGACAACCTGCGCACGCCCTCCTGCATCATCGAGTGATGTCATCTCTGCGCTTACGGTGTTCTGTTTGCCAGCAAGCAGGGTGGCAATATGGGTAAATGTACTCTTACCGCTTCCCCCGTCTCCGGTGGCCTCAATGAACATCTGCCAGTCGTACCGGTTCGCCATAATCATGTACAGCGCGGCACATATACGCATCATCTTGCGCGGGTCTTTTCCGGCTGCGTGCTCAAGCCATTTATGAAAGTTTGGCGCGTTATCGCGGATGTTCTCCCCTGGTGCTGGTGGCGTGTACTCAATGCCGTTGTGCGTGGTGATCCAGTTCTCCGGCGTGTGCGGGGAAAATTCCCCCGTTTTCAGGTCAAGCGCACCATTGGCGAACGGCAGCAAATCGCCGGACGGCTCGCCCATTGGTTCGGCAATAACTTTTAACGCTTCAACGGCGTTATTGATCACGCGTTTGCTGAAAGTGGCCCTGTGCTCTGAATAGATCGCCACCATTTCGCGGCTCAGCTCCATTGTGCTGATCGGACACCATACCCCGCCGCGCCATACGTGAACGATTTCACTTTCCGGATGCACACAAACGCCATCAAAGCGCTCGGCAAGCAGCTGCGCGCGCTCACTGTCTGCCATCTGTGAAAGTTGCGCCTTTTGCTTCGTCGGAAGATTAAGCACCAGGCTTTCCCCACGCTCGCATTCCTCTTTGAGTCGCGGCAACTGGTCGGATAAATCCACTGGGCTGGTGTCAGTAATCCCCGCGTATTCGTGTACGGTCTTCACTCCAGCCACAGCCAGTAACGTAACAATCTGCGTCATGCTGTGCTCTGTGATATGTCCTGCGCGGTAAACACGCACACACTGACGATCTTCATCAATGATCCGGTAATCGGTGATGTTTTTCAGTTGCTCATCAGCCAACACGACAGGCGGCACATCGTCGGCGGCAATATGTTTACCCGCCCATTCCTGCCACTCTTTCGCATGGCTCCACGCATCACTACCTGCAAAGATGATTATCTCTGTCAGTCTGTCGCGTGGCTGTTTTTTTAAGTTCGGTGCCAGTTTCATTTTTTGCCCCTGAATGCGTTAATCATGCTTTTCATTTTCTGGATGTTTCCCCGCGCTTTTTCCCTGCTGATGGGCTTACTGCGGGGTGCGGCATATACCAGGGAAAAATCACGCCGGAACTGATAAACAGGCATCACGCAGTCATAGCTATACCCCTCACGACGGTAAGTGATGCGCCGTTCTGCCACGCCTTTAATCGTTACCGTGCCGCCGTATTTATCGCGGTAAATATCGCCGTTCATAAATTCAGGCCGAGCGGGGCCGCTGGCAATAAAGCCAGAATTTTTCATTTCCATATTATTTATTCCTCGACTTAACTCGACTTATTTGATAGCAGGGCACTATTTATTGCGTCATTGAGTTTTTCTGCTGCTTCATCAATAAGTGACAACAGGCCATAAGCAATATTTGCATCTTCATTGTCATTTATGCAATCAAGCCACATATTTAATATTACTTTTGCTGAATTATTTAAAGTTAATGAACTTTCTGCACATGCTAACAATTTAAAAAAGACTTCCCGTTCTGTATTCATTTAATCCCCCACCAGCTTACTTTCTTCCTCAATCAAAAAACTAGCGACACTTCCCGAAAGACGCGCCAGTAGGCTCACCAGTGCGGATATATCAGCATCTGTAATTTTGTTCGGGTATACCTCAAGAAGGCGGCAAATAATTTCTGTCTGGTGCGCACGTTCAGCGGCTTCGTGTAATGTGATTTCCTGCATTAATGCACCTCTTTTAATTCATACACTGCTGAAATAATGACTTGTGATAAGCCATATTCTGATGATTCGCTTCTCACCGCAGCAATAGCCGTCTGAACATTAACAGCCTTCACATTCTGAGCGATACCAATTGTGTGGCCTATTGGGTTAACAGCTCGGGCAAATACACGGAAGGTTTTAAGCATGACTCACTCCCTGGCGGATTTTTGCAGCGAATACAGCAACACAACCGGACGGGCAACGGCTACGCGCTTCGCGTTCCGTCCAGGCGGTTACGTGGATGATTTGAGATTCTCCGGCACTCAGTGCCAGAAAACGCCACACAAAGGCCGTTTGTGTGTGTACAAGGTGTGGTATATGATTTACAGCAACCATAACGGCTCCTAGTTTACGTTGTTGGTTAGACGCCCCGTATGTGTTCCCAGCACTGCGGGGCGTTGCTCTTTGTATTTCAACAATCCTTTCGGTGTGTTTCATGTTATGAGCGCATGAAACACACGTCAAGGCTTTTTGTATTTCTTTTTTTGTGTATACTGAAACACACCGATGATTAGGAGTTTCAGAAATGGCAACGGCTAACAAAAACGCAAAATCACAACTGACAACTGTCAGAGTCCCACTAGATGTTATGCAAGGGATGGAATCCGTTAAGCTGGACGGTGAAAGCAATGCCGGATTTATCGTAACCGCCATGCGCGGTGAAATCGCCCGCCGCCAGGCAGAAGGAAGCGGAGAAAATCCCCTCGTGTCTTCACTGGATGCCTTAGCTAAGGTCGAACAAATCGGCATCAAGGCAGCGGAGGAAATCGGGCAACTCGTAGCCGTCGCTCGTGAAGAACTCCAGCGGCGTAAAGCCAAAGAATCTGAATAATTAGTATCAGCGCCGTGATGTGAGTAACTACGGCGCATTGCTATGTAAATACTGGCAATAAACAGAAAAGGTAGTTCTACTCCGAATAATTTTATCTGACACTACTCCTGAACTAACATGCGCTTATCTTACAGGATATAAATATAAATCCATAAAATCACGATTAAATAAAGTCGCTCCAAACATAAACCACACCCAACGCTTAACAAGATAGCAACAAACAGGTAAATAACTTGCAGAAATATTTATCGCAAGGATTATCATTATTAATGACAAATCACTTTACCAAGTCATTCCCCTCTCTTATCATAAAGAGAAAGTAATAAATAAGTTAAGGGAGTTAGAATGCTATGAATCTAAAAAAAATAGCCACAAACACAAAAAACAAGATAACAGAAACATTTAATAAACTTATATTAGAGGCATCTAAAACCCCCACACAAGATGAAATTAAAATACTTGAGAGAAGGAGTAAGAAGTTTAATTACTCCTTTTTCTCATACGCAGTCACAGGAGCTATAATAGTTTTTTGCTCTCAACCATTAATTAAATACGCAAACCCAATACTTATTTTATTGAGTGGTCTGTTACTGTCTATCATCATTATCATTCTCAGAATGATTTATATTTCACAAGCGAATGCATCATGGACAACCAAAAAACGCTCACATGTACTAGTTCATTTTCTTTCTGCATGTTTCATAGCATCAACATTGACGTTGCTATATCAGGCTTACGATAATAACATCACACACAAATTGTACTGTAAAAATATACAACAACTTATTGAAAAAAGGATAGAAACAGAAAAAAACATCAGCATATTCAGTGGGATGCAATGCACCCCGGTATATGATTACTCTTTATTTGGATTTAATCTCTTATAAAGAATGTTATTACTGATTTGAGTACAAATTCTCAAATCAGTAATTCATAATATTTTATTCTGAGATAATTTAAACTACCCACTCACCTCGAATCCATGCCTGCACTTCTGAAAGACGATATGCAACAGCAGTGGAACCAATCTTGATCCGCTTAGGAAATTTTCCTTCCTTCTCCAGCTTCCAGCGTGTGCTGTTCGCAAGAGTGGTTAGCTCCCGACATTCTTTCTCACGGATCATTCGGTCAATGTTAGGAATGTACTCCAGACCCTTTTTATCAACAATTGCCATTTTTTTCATGTTAACCAACCTTTTGTTTGAGGATTGTCACTTTTGAATCAGCACCTGCGATGCTATTGAGATATGTAGTCCAGAGTTCCAGAGCATCCAGTTTTTTAGCCATAAACTTACTCCGGTTGTAAACACCTGCCACGCCAGGTAGCGCATGGCCTAACAGTTGTTCTACTACATAAAATTCAACACCGAGATCACTTAGATGAGTAGATAGCGTTCTTCTAAGGTCGTGTAGTGACCATTGTTTTTCATGGCCCAAACGTTTACCGATTTTCCCCCCAATCTTGCTTACGCTTTCTCTAATTCGCAGACTTCCCAGCACATAACCAGTATGTTTTGTCTCTTCGTGAACATCCGTTACCCACTGTCGTAGAATTTCAGGTACTGGTCTGACGATTTCAACACCAGTTTTTGAGTGATCTTTTGGTACAGTCCAAACCCAACTTTCGAGATCCCATTCGCTCCATTCTGATAATCGGGCTTCACTCATTCGACATCCAAATACTGTACAAAGCACAAACATTTTTCGCGTGTATTCAGACATTAGTTTTAAATCAGGCTCGACAAAAATTGCCTTCCAGAGCTGGCCGAGTTCGGCTTCATCCAGAACCCGATCCCGCTTACCTGCAATCTGCCCCACATCACTCATGCGCAAATCCTTTAAAGCATCACACGTCGCGTACTGGCGTACCCGACAAAAACGAAGAGCTAATTTAGTGTCAGAAAAAACATACGCCGCCATAACTGGTGCATTACGTTTAATTCGGTCAAAACAGTCCAGCCATTCATATAGGTGAGTGTCATTTACGGGCAAATGACCGATATAGGGAAAGATATGCTTTCGAAATCTGCCAAGCGTTACAGCATGAGTTTTACGACGCACCTTACAGTAATTTTCATACCAGTAATTTAGTGCATCCTCCACTGTAACCGGCTTTAAGCGTTCTTCAGCCTGAATCTTAATCTGGATACGCGGATCACGTTTGTCAGCCAACCAACCACGGCACTCGTCGCGCTTTTCCCTTGCCTGTTTGAGTGACATATCAGGATATTTACCCAACGTTAGCCAGACCGGAGCAGCCCGGCCACCTGCTAACCTGTAGAAGAAAACAAAGCTCACAGCCCCTTTAGTACTCACACGAATAGAAAGCCCCTTTCCATCAGCAATGGTGATCTGCTTTTCTCTGGGTTTCCCCAGATATCCTTTAAGCGCTTTGTCGCTCAGTTTGTTCTCGCCAGCCATTTTTAGCCCCAAAAAGCAATACAAGCTGCAATACAGAGATGATTGCAACACACAGATAACGAGGAAAATTCAGTGAAAGCACCAGATAAACTTATTCTTTATTATCAAAAGATTAAGTGTAAAAACCAGCAACTACACGAAAGCCTCAGAAAGCCATGCTAAGTGCTTGGGTTTGACATATCCCGGCGTAAATTCAGAGGTGGAGCCGCCACGGGAACGGATAACCTCACCGGAAACAATCGGCGAAACGTACAGCGCCATGTTTACCAGTCCCGGAATTTGTGAGAGATAGACTTTCTCCGTAGTGAAGGGATAGCTCTCACGGAAAAAGAGACGCAGAAACAGCGGATCAAACTTAAATTTCTGCTCATTTGCCGCCAGCAGCTGGGCGGTTGTGTACATCGACATAAAAAAATCCCGTAAAAAAAGCCGCACAGGCGGCCTTTAGTGATGAAGGGTAAAGTTAAACGATGCTGATTGCCGTTCCGGCAAACGCGGTCCGTTTTTTCGTCTCGTCGCTGGCAGCCTCCGGCCAGAGCACATCCTCATAACGGAACGTGCCGGACTTGTAGAACGTCAGCGTGGTGCTGGTCTGGTCAGCATCAACCGCCAGAATGCCAACGGCAGCACCGTCGGTGGTGCCATCCCACGCAACCAGCTTACGGGTGGAGGTATCCAGCATCAGCGGGGTCATTGCAGGCGCTTTCGCACTCAATCCGCCGGGCGCGGTTGCCGTATGTGCCGGGTCACTGTTGCCCAGCGGCTGGTAATGGGTAAAGGTTTCTTTGCTCGTCATAAACATCCCTTACACTGGTGTGTTCAGCAAATCGTTAACGGCATCAGATGCCGGGTTACCTGCAGCCAGCGGTGCCGGTGCCCCCTGCATCAGACGATCCAGCGCAGTGTCACTGCGCGCCTGTGCACTCTGTGGTGCAGCTGCCAGAATGCGGCGGGCCGTTTCCACGGTCATACCGGGGGTTTCGGCCAGCACGCGTGCCTGTTCTTCGCGTCCGTGAGCCTCCTCACAGTTGAGGATCCCCATAATGCGACTGTTTTCTGCCGCAACCGCTGCGGTGATCTGCGCGTTCACGTCCGGCTGCGCCGCGCTGGCGTTCTCGCCCTCCGTCGCTTGCACCACGCCAGTAACGTCAGCCTGCGAAGCAGTGGCTGAAACAGTTGTTGATTGAGTCTCTTTGGTCATTCGCCCTCCTGAGAGACGGGATTTACGTGCATCCAGTGCATCACGCATGACGGTGATCGCATCGGTACTGTTAACAAGTTCATCAGCCAGTCCGGCATCAATGGCCTCCTGACCGCTGTACACTGCAGCCTCGGTATCCAGCACAGCCTGCACGGACAGGCCGGTATATGCCGACACCTTCTGTGCAAACATCTGGCGGGTTGCGTCCATCCGGGACTGCAGTGTCTCCCGGACGTCATCCGGAAGATGGCTGTAGGGATTGCCATCCACCTTATGGCTGCCGCTGTAAATCAGCGTGATTTCCACACCCTGTTTCTCCAGGGCAGCACCGTAATTACTGTGAGCCATCATGACGCCGATGGAGCCTGTCCGGGCGGTCTGCGTGACCAGACGCCGGGAGGCGGCACTGGCAAGCAGCTGACCTGCACTGCAGTTCATGTCGTTGGCCAGCGCCCATACCGGTTTTATGTCACGCACACGGGCGATGATGTCAGCGCAGTCAAATGCCCCCGCCACCATTCCGCCTGGCGTGTCCATATCGAGCAGAATGCCGTCCACCATCGGGTCGCTGGCAGCCTGTTGCAGACGGGCGATAATGCCGTTGTAACCGGTCATCCCCGAGTACGGCTGCAGCGCCCGCGTCCGGCTGACCAGCGTGCCGGACACCGGCAGCACGGCGATGCCGTTCATGACCTGATAACTGCGGGCCTGTCGTGGTCCGTCATCATCACCGGATAATGCCAGCGTCGCGAGTGCCTCCTGGGCAGTCAGGCTGTCGCCGGACACCGCATCCGTCAGGCGGCTGATCCCAAGCTGGCCTGCAAGCGCACAAAAGAAAACCCGCGCATAGGCGGGTTCAAGCATCAGCGGCTCATTAAAGGCCATGCTGGCAATATGCGGGAGATTACGCAGCTCTGCTGTCACTCTTCTCCTCCTCTGTTGATTGTCGCAGCCCGGATTCAAATGCCGCAGCCGCCCAGGCGGGCGGTTTAAGACCAGCCGCGCGGCGCTCCATCGTTTCACGGACCTGCTGGGCAAAAATTTCCTGATAGTCGTCACCGCGTTTCGCGCACTCTTTCTCGTAGGTGCTCAGTCCGGCTTCTATCAGCATCGCCGCTTCCTGAACTTCTTTCAGACCATCGATGGCCATACGACCGGAGCCTATCCAGTCGCAGTTCCCCCAGGCACTGCGGGCTTCCTGAAAACTGAAGCGCGCTTTTGAAGGTAACGTCACCACGCGGCGGGCGATGGCCTCTTCCAGCCAGCACAGAAACATCTGGCTCGCCTGACGGGATGCGACGAATTTTCGCCGCCCCATAAAGTACGCCCACGACTCGTTCGCACTGGCCCGTGCCGTGGAGTAGCTCATCTGGGCGTAATTCCGGGAAAGCTGCTCATACGAGACACCCAGCCCGGCAGCGATATACCGCAACAGTGACTGCTCAAACACGGAGTAGCCGTTATCCGTGTCCTGAGCCGTCTGCAGGTTCAGTGAGTCCCCCGGCATCAGGTGCGGCACTTTTGCGCCTCCCAGACGGACCGGTGCTGCGGCGTAATACGCGGCAATTTCACCAATCCAGCCGGTCAGCTTGTCCCGCTGCTCCTTACTGTTCGCGCCCAGAATAAAATCCATCGCTGACTGCGTATCCAGCTCACTCTCAATGGTGGCGGCATACATCGCCTTCACAATGGCACTCTGCAGCTGCGTGTTCTGCAGCGTGTCGAGCATCTTCATCTGCTCCATCACGCTGTAAAACACATTTGCACCGCGGGTCTGCCCGTCCTCCACGGGTTCAAAGACGTGAATGAACGAGGCACGACCGCCGGGTAACTCACGGGGTATCCATGTCCATTTCTGCGGCATCCAGCCAGGATAGCCGTCCTCGCTGACGTAATATCCCAGCGCCGCACCGCTGTCATTAAGCTGCACACCGGCACGGCAGTTCCGGCTGTCGCCGGTATTGTTCGGGTTGCTGATGCGCTTCGGGCTGACCATCCGGAACTGTGTCCGGAAAAGCCGCGACGAACTGGTATCCCAGGTGGCCTGAACGAACAGTTCACCGTTAAAGGCGTGCATGGCCACACCTTCCCGAATCATCATGGTAAACGTGCGTTTTCGCTCAACGTCAATGCAGCAGCAGTCATCCTCGGCAAACTCTTTCCATGCCGCTTCAACCTCGCGGGAAAAGGCACGGGCTTCTTCCTCCCCGATGCCCAGATAGCGCCAGCTTGGGCGATGACTGAGCCGGAAAAAAGACCCGACGATATGATCCTGATGCAGCTGGATGGCGTTGGCGGCATAGCCGTTATTGCGTACCAGATCGTCTGCGCGGGCATTGCCACGGGTAAAGTTGGGCAGCAGGGCTGCATCCACACTTTCACCCGGTGGGTTCCACGCCCGCAACTGCCCACCAAATCCGCTGCCACCGCCGTGATAACCGGCATATTCACGCAGCGATGTCATGCCGTCCGGCCCCAGAAGGGTGGGAATGGTGGACGTTTTCATACATAAAATCCTGCAGGTCCCCTGCGTCGCTGTGTCATGCCGGTCTGCACTTCCAGCTCCGCAATGTATTTTTTCAGGTCAGACACGGAAGTGGCCGTAAACTCCACTCTCCGTCCGTCTTTCTGTACCGTTGCCACCCGTTTTCCTGTCATCAGGTCATGCAGTGCCGCACGGGCAGCGGCAAGTTCTTCCTGTCGCGTCATTCATCCTCTCCGGATAAGGCACGGGCGTAATCTGCCAGTGTTTTCTTGTTGGTTGCTGCACCATCCTCTTCCTGCAGGCTCGCCAGCAGTGCACTGAGATCCAGCTGCCAGCGGGAAATACTGATGCGCAGCGCCGCCAGCGCATAAACGAAGCAGTCGAGCGCCTCATTGCGTCGCTTTTTGCTGTCCCACAGTATTTTTTTCCTGCCATCCACCCATTTTTCGACCTGCTCTTCAGCAGTCAGTTGCTGCGCTTCGGTCAGATCAAAAATATCCGGGTTATTCGGGAAGTGAACGGCACCGGGAAGCGGTTCATCCCCTCCCGGCGTCAGTGTGAAGCGGTTATAAATCTGCTCTTTCGCGGTATCCGTACCGATTTCGGTAAGGTAAACCCCGTTTTTGTTTCGCTTACGTGGCATGCTGGCCACCGGCTTTCCGTAGACGGATGCCCCTTTAATGGGGATCACCCGGAACAGCCCATGTTTTTTCGAGCGTTCATACACAATGGTCGGGTCAATCCCGCCAGTATCCCAGCAGATACGGGATACCGACATTTCTGCACCATTCCGGCGGGTATAGGTTTTATTGATGGTCTCATCCACACGCAGCAGCGTCTGTTCATCGTCGTGGCGGCCCATAATAATCTGCCGGTCAATCAGCCAGCTTTCCTCACCCGGCCCCCATCCCCATACGCGCATTTCGTAGCGGTCCAGCTGGGAGTCGATACCGGCGGTCAGGTAAGCCACACGGTCAGGAACGGGCGCTGAATAATGCTCTTTCCGCTCTGCCATCACTTCAGCATCCGGACGTTCGCCGATTTTCGCTTCCCACGTCTCACCGAGCGTGGTGTTCACGAAGGTTTTACGTTTTCCCGTATCCCCTTTCGTCTTCATCCAGTCTTTGACAATCTGCACCCAGGTGGTGAACGGGCTGTACGCCGTCCAGATGTGAAAGGTCACACTGTCAGGCGGTTCAATCTCTTCACCGGATGACGAAAACCAGAGAATGCCATCACGGGTCCAGATCCCGGTCTTTTCGCAGATATAACGGGCATCAGTGAAGTCCAGCTCCTGCTGGCGGATGACGCAGGCATTATGTTCGCAGAGATAAAACACGCTGGAGGGATCATCCGGCGTCCATTTGAGGCCAAACGGCGTCTCTTTATCGCCAAATTTAAGGTACTGCTCCTCCCCGCAGTGCGGGCAGGCAACATGAAAACGCATAAAATGCGGGGATTCACTGGCTGCACGCTCAATCTGGCAGGTGCCTCTCACTTTGGGCGTGGAGCCACGGATGGACTTTGGCCAGACCGAGCCTTCAATACGCTTATCGCCCAGGAACGTCGGAGAGCCTTCCTGTTCAATATCCTCATCAAAGGCAGCAAGTTCATCATAACCCGCCACATCCACTGACTTTTCACGGTAGTTTTTTGCCGCTTTACCGCCCAGGCACCAGAAGCCACGACCATTGGAAAAACGCTTCATAGTGAGCGTGTTATCCCGGTGCTTTTTGCCATACCACGGAGCCAGCGCCAGCAGCGACGGAATATCGCGGATGGTCGGCTCAACGTGGGTTTTCATAAAGTTCTCGGCATCACCATCCGTCGGCAACCAGATAAGGGTGTTGCGCTGCTTATGCTCTATGAAGTAGGCATAAACACCCAGCAGCATTTTGGAATAACCAACACGGGCAGACTTCACCACATTCACCTCGCGGATGTAGTCACTGCCCATCGCATTCATGATGGCCCGCTGAAAGGGCAGTGTTTCCCAGCGCCCTTCCTGGTATGCGGATTCTTTTGGGAGATAGTAATTGGCATCCGCCCATTCAACGGCGGTCTGTGGCTCCGGCCTGAACAGTGAGCGAAGCCCGGCGCGGACAAAATGCCGCAGCCTGTTAACCTGACTGTTCGATATATTCACTCAGCAACCCCGGTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATAGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCTGAGTCTCTGGCATTCTTCAGTTCCTGTGCGTCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGCAGATCTGTCTCGCTGGCCTGCCGCAGTTCTTCAACTTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATTTTATAACGGCGGCAGAGTCATAAAGCACCTCATTACCCTTGCCACCGCCTCGCAGAACGGGCATTCCCTGTTCCTGCCAGTTCTGAATGGTACGGATACTCGCACCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGCCAAAAACAGAGAAAGGAAACGACAAAGGCCCAAAAGTTCGTTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGGGGTTATTTTAAATAAAAACATTAAGTTACGACGAAGAAGAACGGAAACACCTTAAACCGGAAAATTTTCATAAATAGCAAAAACCCGCGAGGTCGCCGCCCCGTAACCTGTCGGATCGCCGGAAAGGACCCGTAAAGTGATAATGATTATCATCTACATATCACAACGTGCGTGGAGGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCGTATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTGAATACGGGGCAACCTCATGTCAACGAAGAACAGAACCCGCAGAACAACAACCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGAGCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGTCGGAGACTAACGTCAGAAAAGAGAGCATATACATCAATCCAGAGTGATGATGAATAAACATCCCGGTTTCTTCCACCATCGCACCGGAAAAGCGACTATGAGGGTAACCCTGCGTCTGTCAGCACAGTAAAACCCGGTGTGCATCGTTTTTGATTATTCCCGCACACTCACGCAGAAGGAATTCCCCGTCGGGCTACGGTCATGGTTAATGCGGGAATACGGCGACGATACAGCGCAGCTAAAAGGGTAATGGACAGAAAGAGCGGTTTATTTCATTCCACAGGATTCTGAGTGCCCCCCCCTCCTCCAATAGGCTGAGCATCCACCTATATAGTTTTAATTTTCATCAATCCATTTAACTATCGTTTAATTGTTGTCACATAGGATTCTGCCGTTTTTAACAATGCAGGATAATAAGATGAAAAAAATGTTGTTTTCTGCCGCTCTGGCAATGCTTATTACAGGATGTGCTCAACAGACGTTTACTGTTGGAAACAAACCGACAGCAGTAACACCAAAGGAAACCATCACCCATCATTTCTTCGTTTCGGGAATTGGACAGGAGAAAACTGTTGATGCAGCCAAAATTTGTGGCGGCGCAGAAAATGTTGTTAAAACAGAAACCCAGCAAACATTCGTAAATGGATTTCTCGGTTTTATTACTTTAGGCATTTATACTCCGCTGGAAGCGCGTGTGTATTGCTCACAATAATTGCATGAGTTGCCCATAGATATGGGCAGCTCTATCTGCACTGCTCATTAATATACTTCTGGGTTCCTTCCAGTTGTTTTTGCATAGTAATCAGCCTCTCTCTGAGGGTGAAATAATCCCGTTCAGCGGTGTCTGCCAGTCGGGGGGAGGCTGCATTATCCACGCCGGAGGCGGTGGTGGCTTCACGCACTGACTGACAGACTGCTTTGATGTGCAACCGACGACGACCAGCGGCAACATCATCACGCAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTTGCATCGAGCGCAGCAACATCACGCTGACGCATCTGCATGTCAGTAATTGCCGCGTTCGCCAGCTTCAGTTCTCTGGCATTTTTGTCGCGCTGGGCTTTGTAGGTAATGGCGTTATCACGGTAATGATTAACAGCCCATGACAGGCCGACGATGATGCAGATAACCAGAGCGGAGATAATCGCGGTTACTCTGTTCATTGCTGACCCCACAAACAGATTTCACGCTCAATCTCACGACGAGTCATGAGACCTTTCCATTGCTTACCGCCAGCATATGTCCAGCGACGTAGCTGATCACATGCGCCTTTAATATCACCCTGGTTTATTTTGCGAAGAAGCGTCGATGTTCTGAAATTGCCAGCACCCACGTTGTAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCGACAGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCACTCTGCTTCGGTATACGTTTTACCGGGAATGATGTCTTTTCCTGTATGCCCGTGACATACAGTCCATACACCAACTATGTCTTTGTAAGGATTATGTCTCACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAATACAGATGCTATAGCAATAGCCCCGCCACCAATAGCAGCAGCAACAGCTTTTCGTAATGATGGAGGCATTATCCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAGGCTACCCAGCACTCCAATTGCCGCCCACTGTGAGGGCGTGACTTTATCTAGCAGCTGTAAAAACCAGTACCCGGCACTACCTGCTGAGGTGCCATAGGCGACACCCGTTGTTAACTTATCCATGGATTTCATAACCCCACCTCGCAGACAAAGCGGGTGTAAATTGAGGGAATACTACGAAACGTAACAGACTCGGAGTCAGTGAATAACTCAGGTATTGGGTTATCAGCTAATATCGAGACTCAAAAAATGGAAAAACCCGCTCGACGGCGGGTTTAAGCTGTGTGACGAAGTAACCACTCTTAACAGCATAACCAATTTTTTACGTACGTAAACCACTAAATGATATTTGCGAGAATGCTACCGAGTATTGAAAACACCACTACAAATACATAAGCAAATCTCAACAAATAACCAACAAATAATTTCCAGTGTTATTTTTAGCTGGTTTAAATTGAACCTTCAAATTATAGAGCACTTATAAATAACAGCCGTTAATATAAATTGGCTAACAGATTTATTTTTATTCAGCCAAGAGCCATGAATAGGATTCGATAGAAAAAAGTTCAGATAAAAATAGAGATCTACTTCACAAATCAAACGAGAAACCAAAACTTACATCTTGAAATAATCACATTGATTAGATGAATATTTATCGCGCAGTGACATCATTTTTTAATAATAGTTCAAAAAAAGGGCTCACGATGAAAAAATTAACAGTGGCAATTTCTGCTGTAGCTGCATCAGTACTGATGGCGATGTCTGCTCAGGCAGCTGAAATTTATAATAAAGACAGTAACAAGCTGGATCTGTACGGGAAAGTTAATGCTAAGCACTACTTCTCCTCTAATGATGCAGATGATGGTGATACTACTTATGCCCGTCTTGGCTTCAAAGGTGAAACCCAAATCAACGATCAACTGACTGGTTTCGGTCAGTGGGAATATGAATTCAAAGGCAACCGCGCTGAATCTCAAGGTTCCTCCAAAGATAAAACCCGTCTTGCCTTCGCTGGCCTGAAATTCGGTGACTACGGCTCCATCGATTACGGCCGTAACTACGGTGTAGCATACGACATCGGTGCGTGGACTGACGTCCTGCCAGAATTCGGTGGTGACACTTGGACTCAAACCGACGTGTTCATGACTCAACGTGCAACTGGTGTTGCAACCTATCGTAACAACGACTTCTTTGGTCTGGTTGATGGTCTGAATTTTGCTGCTCAGTACCAAGGCAAAAACGATCGTAGCGATTTCGATAACTACACTGAAGGTAACGGTGATGGCTTCGGTTTCTCTGCTACCTATGAATACGAAGGATTCGGTATCGGTGCAACTTATGCGAAATCTGATCGTACCGACACTCAAGTTAATGCAGGGAAAGTTCTTCCTGAAGTATTTGCTTCCGGTAAAAATGCAGAAGTTTGGGCCGCAGGTCTGAAATATGACGCTAACAACATTTACCTGGCCACTACCTATTCTGAAACCCAGAATATGACTGTATTTGCTGATCACTTCGTTGCTAATAAAGCCCAAAACTTCGAAGCTGTTGCACAATATCAGTTCGATTTCGGTCTGCGTCCGTCCGTTGCTTACCTGCAATCTAAAGGTAAGGATCTTGGAGTATGGGGCGATCAGGACTTAGTCAAATATGTTGATGTAGGTGCAACCTATTACTTCAACAAAAATATGTCTACTTTCGTTGATTACAAAATCAACCTGCTTGACAAAAATGACTTCACTAAAGCACTCGGTGTAAGCACTGATGACATCGTTGCTGTAGGTCTGGTTTACCAGTTCTAATCTGATTACGAAAAAGATATGTTGCGGGAGGCGTTGCCTCCCCAACATATAAGTGGCTCCCTCAAGCCACTTCCTTTAGAAGCACAACCTTGCTTCTAACTATATAAACCTTCTGTTATATATTACCCTTTATTTTTGGGGGCGTCTCAACGCCCCATTTTTAATAATTTTTAGTAAACAATTGGCATATTAATTAGAGTTATTAACAACGATATCCATCTCTAACCGGATATCTAATGCCATTAACATCCCTTCAATTATACCCTCAGCCTTCTGTAACCTTTTCCCGATATAACCATCAGAGCAGCAATGCTTACCTGCCAGTGACATGAATGTCATACCGACTACATAATAATCTACTAATAAATCGTGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATGCACCCGCAAATGATCATCGCGTCATCGTCACAACATTGCGGGCGAGATTTTACTTTTGAAGTAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAGGTCACATCTTCATGATTATTAGCCGCCCACGCTCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAAATCAGACCAGAACGCCAATTACAAGCAAAAATCAACAAAACAGTATTAGTTGATTGTTATCTCTGACTTCATACTCCTGCTCCTGTCAGGGTTTTGGCGTAATTCTTCAGTATTCGGTAATCGGTCAAAACAGAACCGGGGAAACGATATAAGCGCAGACGCCCCCAGCGGTGGCGAAGAAGTTCTGCCATATTAAACTCAAACATCATTCATTCCCCATTTCGGTGATGGTCAGTTCCAGCCTCCCACCTTTGGTAACAGGCATCTTCACAACGCGGTAATCAACGACCTGAGCATCATCCAGCCAGAAACCTGCTTTGGTGAGTGCGTCAAAAGAGGCTTTTTGCAGATTATCCAGGTCACGGCGACGGCGATCCGGCATGTGGCACTCAATGCGGATTTTCACAGGCATAGCCAGGCCGATATCCAGCATTGCGTTTTTAATGATTCGGGCGACGTTATCGCAGTATGCCTGCCCCTCTGCGCTAACGTGCGTGCGCCCGCGATTATGGCGGTAATAGCGATTATTGCTCGGAGGCCAGGGTAATGTGATGCTGTAGGTATTCACGCCTTAATAACCCCCTCTTTCAGCCAGATAACCTGTGTTCTCGCCATACCTTCCAGCGCGCATTCTTTTGCATATGCAGCATCGACTTGCTTGGCATGCCCTTTACCGGCAATCTTCTGTATGCGCTAAACCTAGATAGAATCCACTCTGTGCACATTGAAGCCCGCTCTATGCTTCCTTTCAGGTATTGAAGGGATTGAGATGGGCTAAGCATTATTGGCCTCCTGCATCAGGAGAAAGACAATCATGGCGGCGCGGAGAGGTCTGGCATCAAATATTGGGCTTACGCCTTTTGCATCCACACACCATTCAGTTAACTGGTCTAAGATAGAAATCCTGTATTTCTCAATAATCGGCCATGAAGCGCTCGGATCATTGCAGTAGTCAGGCAAATGATTTAATGGCTCAAAAGTTGTATCAGCATTTCCGTAATACCATTTGTTGGTGTTATTCCCTGATGTTTCCGGTTTACTTGCCCAAAGGCCTTTAAAAATTATGTCTCCTACCATTCTGTTAATTTCAAAATCACTTAACTGTGAATAATCCATTGTCATTTCCTCGCACGATATCTTAGCCACCGGATATCCCACAGGTGAGCTGTGTAATTGAAGGTTTTTACGTCAGATTCTTTTGGGATTGGCTTGCGTTTATTTCTGGAGCGTTTCGTTGGAAGGTATTTGCAGTTTTCGCAGATGATGTCGGTGAAACTTCGTCGCTGTCGCCTCATGCCGCCCTCCTGACGCCCTGCCCGATCGCCATCAATGCCGCTTTGGATACGGTAGTAAACATCCGTCGAGGACTGATGAACGGTCGCCAAATCAGCAGCATGGAGCCTTTGCTGTTTCCCTTCTTCTCCAGCCCTGTCGATGGTTCGATAAAATTAATCCGTCCATCAGTGATAATGCGAACTTCGTCGACACTCTCCAGAGCCTTGCTGAACCATCCGACTGACATATCCTCTGGCACAAGCATAACTACCGTCTGTCGCTGTTGTATGCACTGCTCAGCGGCTTTTTCCACCCACGGCCTGATATTGCTGTACGGTGGGTTATTCCAGATTGCACCGTGGCTTACCCACTCAGAATTGAGCGCGTCGTCGGCCTCAGTTAGCCAGTGAGCACACAGAGCATTTTTGTCGCTCGCTGCAGAATCCAGCCAGAATCCAAACTCAATATCCAGTGCATCAAAAAGCCAAAGCGGCGTTTGCCAGCAGTCCTTGTCGTGTGCTGGCGTATTTGATTTGATAGTCATGCAGCCCGATCTCCCCATCTCGCTTTCCACTCCAGAGCCAGTCTCGCTTCGTCTGACCACTTAACGCCACGCTCTGTACCGAATGCCTGTATAAGCTCTAATAGCTCCGCAAATTCGCCTACACGCATCCTGCTGGTTGACTGGCCTATTACCACAAAGCCATTCCCGGCAAGGTTAGGAACAACATCCTGCTGCTTTAATGCTGCGGTAAACACACACTTCCAGCTTTCTGCATCCAGCCAGCGACCATGCCATTCAACCTGACGAGAGACGTCACCTAAGCAGGCCCATAGCTTCCTGTTTTGGTCTAAGCTGCGGTTGCGTTCCTGAATGGTTACTACGATTGGTTTGGTTGGGTCTGGAAGGATTTGCTGTACTGCGTGAATAGCGTTTTGCTGATGTGCTGGAGATCGAATTTCAAAGGTTAGTTTTTTCATGACTTCCCTCTCCCCCAAATAAAAAGGCCTGCGATTACCAGCAGGCCTGTTATTAGCTCAGTGATGTAGATGGTCATCTTTTAACTCCATATACCGCCAATACCCGTTTCATCGCGGCACTCTGGCGACACTCCTTAAAAATCAGGTTCGTGCTCATCTTTCCTTCCCGTTCTTCCTTGGTAGCAAACCGGTAATACACCGTTCGCCAGACCTTACCTTCGATAACCAGAAGACCTGCCCGTGCCATTTTAGCCGCGGCCTGATTTATGCTGGTTACTGTTGCGCCTGTTAGCGCGGCAACGTCCGGCGCACAGAAGCTATTATGCGTCCCCAGGTAATGAATAATTGCCTCTTTGCCCGTCATACACTTGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCGATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGCCTTACGGCGTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTAACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCCTCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGCGATAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGAAATGGTCGATTCTGCCGACGGGCTACGCGCATTCCTGCGTTAACCTGTTCCATTGTGGTGATCCCGTTTTCCCGGAAAGCCAGCACCCACTGGCGGCGGATTTCGTTCAGTTCGTTCTGGTCCCGGTTAGCCAGGCTCGCCGGGAAAGTTGCCAGTAACTGGCTGAACACACCATTGATGATCTGCGCTACCTGTTGTACCTGCGGCTTTTCGTCGTACTGTTCCGGCATGTTGTTGGCGATCCGACGCATCTGCTCACGGTCAAAGTTAACCATCTGTGCGGCGATGTTTTTCATAAATCCACCCCGTAAATCCAGTCAGTGTTTGTCAGGTCGAGTTTTGGTTTGCTGGCTGTCACGCCTGCCTGTTGCTTGTTACGGTTGATTTCGAGCTGGGTCCACTTGTCGCGGAGTTTGGCCGGACTCAGCACGTTACCGGACCAGAAGTTGTCCTGGCAGGCCCAGCGGAAAAGCACACACATATCGCGGTGGTTACGTCCGTCACGTTCACGCATCAGGCGGATATCGTTAGCCCACCCAGCAAAATTCGGTTTTCTGGCTGATGGTGCGATAGTCTTCACCATGTCAAACATCCACTCTGCGGCGGTCAGGTCTTCTGCTGTCCCCCACTTGCTGCCGCTCTGAATTGCAGCATCCGGTTTAACCACAGAAAGATCGTTTTCTGGCTGGTCAGAGGATTCGCCAGAATTCTCTGACGAATAATCTTTTCTTTTTTCTTTTGTAATAGTGTCTTTTGTGTCCCCCTGTTTTGAGGGATAGCAATCCCCCAATTTGAGGGATGTTTTATCCCTCGTTTTAGGGGATTGTCCCTCGTTTTGAGGGATACACCATTCTGAGATGTTTTTATTTGGTCCAAACATGCCGCCTTGCTGCTTGATAATATTCATTCTGACGAGTTCTAACTTGGCTTCATTGCACCGTTTGACAGGTAACTTTGTAATCTCGCTAAGTTGAGAATCGGTGATTCTGTCCATTGGTTTATTCCACCCATAGGTTTTACGCAGAATGGCAAGCAGCACTTTAAACTGTCGCTTGGTCAGATCTGCGCCCGAATAAGCCTCAAGCAGCATATTTGATAGTCTGGCGTAACCATCATCGAGATCTGCCACATTACGCTCCTGTCCGGCAAAGTTACCTCTGCCGAAGTTGAGTATTTTTGCTGTATTTGTCATAATGACTCCTGTTGATAGATCCAGTAATGACCTCAGAACTCCATCTGGATTTGTTCAGAACGCTCGGTTGCCGCCGGGCGTTTTTTATTGGTGAGAATCGCAGCAACTTGTCGCGCCAATCGAGCCATGTCGTCGTCAACGACCCCCCATTCAAGAACAGCAAGCAGCATTGAGAACTTGGGAATCCAGTCTCTCTTCCACCTGCTGATCTGCGACTTATCAACTCCCACAGCTTCCGCTGTCTTCTCAGTTCCAAGCATTGCGATTTTGTTAAGCAACGCACTCTCGATTCGTAGTGCCTCGTTGCGTTTGTTTGCACGAACCATATGTAAGTATTTCCTTAGATAACAATTGATTGAATGTATGCAAATAAATGCATACACCATAGGTGTGGTTTAATTGGATGCCCTTTTTCAGGGCGGGGATGGGTAAGAGCGGGGTTATTTATGCTGTTGTTTTTTTGTTACTCGGGAAGGGCTTTACCTCTTCCGCATAAACGCTTCCATCAGCGTTTATAGTTAAAAAAATCTTTCGGCCTGCATGAATGGCCTTGTTGATCGCGCTTTGATATACGCCGAGATCTTTAGCTGTCTTGGTTTGCCCAAAGCGCATTGCATAATCTTTCAGGGTTATGCGTTGTTCCATACAACCTCCTTAGTACATGCAACTATTATCACCGCCAGAGGTAAAATAGTCAACACGCACGGTGTTAGATATTTATCCCTTGCGGTGATAGATTTAACGTATGAGCGCAAAAAAGAAACCATTAACACAAGAGCAGCTTGAGGACGCACGTCGCCTTAAAGCTATTTATGAAAAAAAGAAAAATGAACTTGGCTTATCCCAGGAATCTGTCGCAGACAAGATGGGGATGGGGCAGTCAGGCGTTGGTGCTTTATTTAATGGCATCAATGCATTAAATGCTTATAACGCCGCATTGCTTGCAAAAATTCTCAACGTTAGCGTTGAAGAATTTAGCCCTTCAATCGCCAGAGAAATCTACGAGATGTATGAAGCGGTTAGTATGCAGCCGTCACTTAGAAGTGAGTATGAGTACCCTGTTTTTTCTCATGTTCAGGCCGGGATGTTCTCGCCTGAGCTTAGAACCTTTACCAAAGGCGATGCGGAGAAATGGGTAAGCACAACCAAAAAAGCCAGTGGCTCTGCATTCTGGCTTGAGGTTGAAGGTAATTCCATGACCGCACCAACAGGATCCAAGCCCAGCTTTCCTGACGGGATGTTAATTCTGGTTGACCCTGAGCAGGCTGTTGAGCCAGGCGATTTCTGTATAGCCAGACTTGGTGGTGATGAGTTTACCTTCAAGAAACTGATCAGGGATAGCGGTCAGGTGTTTCTACAGCCACTAAACCCACAATACCCAATGATCCCATGCAATGAGAGTTGTTCCGTTGTGGGGAAAGTTATCGCTAGTCAGTGGCCTGAAGAGACGTTTGGGTGATAGGAAGTAAGTTTTATGTTGACGGCACAGTCAACTTGGCATAGATTAATTAAACCAAGCCCAGCCCCGTTCGCAGACAATTGTTAATATCTGCATAACGGCTCTGGGCTATTTTTTTGGGACTCTTATGAAGAAAGCAGCAATTTTAATTGATGCGGGTTTTTTCATGCAGCGTGTTCATGCTACGCATCGTAAACACTTCGCCGAGCATGAACTGACTGCGCAATGCATAATGAAAGTAATATGGTCAATGGTTCTTTCCCATCTTAATGGAAAACGTCAATCACAAGAACGTAGGGAACCGCTTGAGCTTTATAGAATTTACTTCTATGACTGTCCACCACTCGACATTCAAACACGCCTTCCACTTCCTGAGCCTGGCAATAAGACGCCTGGTCGCAAGAATTTCAAACTCGAAAAATCATATATTCTGAGAACGGAGCTGCATGAAGAGTTAAGAAAAACTCGAAAAACAGCCTTAAGATTAGGGAATCTTGTTGATAATAAGCGATGGCAACTAACTACATTCTCCCTTGATGCTCTGATGAAAGGAACGAAAAAATGGGATGAACTGACAAATGATGATTTTTACTATGACATCAAACAAAAACAAGTTGATATCAAGCTAGGGATGGATATCACGACTTTAGCTTATGAAAAACTTGTTGATGTAATTGTCCTTGTTGCTGGGGACTCAGACTTTGTGCCTGCCGCCAAACACGCCAGAATTAAAGGTATTGATTTTATTCTTGATCCACTAAGACAGAATGTTACCCCATCACTGTCAGAGCACATTGATGGAGTTCAGTCATACAGCTTGATATCAGGACTTGCCGATGCTTTACATGTTGAGCCAGACCCGGCACCTGACTGGTGGGAAGATCGAAAAAAAGGCAAGCCTAGGGGAAAAAACAATAGCGGTAAACGCAGGTATGGAAACACTCAAGCTGAGTCTGCAAAGAAACATCAAAGAAATAAACGATAATCCCATCAACCCGGCCACCGAGCCGGGTTTTCTTTGCCTCACGATCGCCCCACCTAAAAACACATAACCAATTGTATTTATTTTAAAATTAATAGGTGCAACTCACTAAACAACGCAATTCTGATCTCTCGATCACCTCCCAAGCCACACAACCCTGCAAAAAATAAATCTATATAAAAAACATACAGATAACCATCTGCGGTGATAAATTATCTCTGGCGGTGTTGACACAAATACCACTAGCGGTGATACTAAACACACCAGCAGGACGCACTAACCCACATGAAGGTGATGCTCTTAAAAATTAAGCCCTGAAGAAGGGCAGCATTCAAAGCAGAAGGCTTTGGGGTGTGTGATACGAAACGAAACATTGGCCGGAAGTGCGAATCCGGATTAGCTGCAAATGAGCCAATCGTGGGGTGTTTTCGTTCAGGACTACGACTCCCACACACCACCAAAGCTAACTGACAAGAGAATCCAGATGGATGCACAAACACGCCGCCGCGAACGTCGCGCAGAGAAACAGGCTCAATGGAAAGCAGCAAATCCCCTGTTGGTTGGGGTAAGCGCAAAACCAGTTAACCGCCCTATTCTCTCGCTGAATCGCAAACCGAAATCACGAGTAGAAAGCGCACTGAATCCGATAGACCTTACGGTGCTGGCTGAATACCACGAACAGATTGAAAGCAACCTGCAACGTATTGAGCGCAAGAATCAGCGCACATGGTACAGCAAGCCTGGCGAACGCGGCATAACATGCAGAGGACGCCAGAAAATTAAAGGTAAATCTATATCACTTATTTAGAAAATGCAGATTTAGGGAACAGATAGGAGGCGTTACACCTATGGCATCTCATCCTATGGTTAGAAGGTGGTGCAAATCCTTCGTATTGAAGTATGGATTTCACAGAAGATTCATAGCATTGAGCGCAAAGATAGTGCATTGGCTGACCGGTATTTGCCGATTTTTTGAGACGATAAACCACCGTAGCAACAGTAGGTGTATACATCTCATAGTTTTTCTTTTCCTCTTCCCACTTAGAGGCTCGATTTATCTTTTCTTCAAGCTCAATAATCTTGTCCTTAGAAATCATCAAAAGCTCATTAAGTGACATTTGCTGCTGTTGGGCATCCATGAGCTTATCGACAAGTTCGTATGTTTTTTCTTTTACTGAGTAGTCTATTTGCATTTTCTGGATTTCCTTTACTGCGCCAACAGCACTCATCAGAGCACCTCCGGCACCAGAAACTGCATCTGTAATCCTACTTATTATTCCTTTTTCATCAGACATATAAATCACTCTCTTACTGTAGGGGTAAGAGGATTTTACTATTTTTCTCGCTGTAGGGGTACACGAGAACCACCGAGCCTGATGTGGTTAAAAGACAGGCATACTAATAAACACTGCACTGTGTATTTATTCCAACGAGTGAATACACGGAGCAATGTCGCTCGTAACTAAACAGGAGCCGACTTGTTCTGATTATTGGAAATCTTCTTTGCCCTCCAGTGTGAGGGCGATTTTTTATCTATGAGGATATGAATAGGTGTCAAACATCAAAAAATACATCATTGATTACGACTGGAAAGCATCAATAGAAATTGAAATCGACCATGACGTAATGACAGAGGAAAAACTTCACCAGATTAATAATTTCTGGTCAGACTCTGAATACCGACTCAATAAACACGGCTCTGTATTAAATGCTGTATTAATCATGCTGGCGCAACATGCTCTGCTTATAGCAATTTCAAGCGACTTAAATGCATATGGTGTTGTGTGTGAGTTCGACTGGAATGATGGAAATGGTCAGGAAGGATGGCCTCCAATGGATGGTAGCGAAGGAATAAGAATTACCGATATCGATACATCAGGAATATTTGATTCAGATGATATGGCTATCAAGGCCGCCTGAGTGCGGCTTTACCGCATACCAATAACGCTTCACTCGAGGCGTTTTTCGTTATGTATAAATAAGGAGCACACCATGCAATATGCCATTGCAGGGTGGCCTGTTGCTGGCTGCCCTTCCGAATCTTTACTTGAACGAATCACCCGTAAATTACGTGACGGATGGAAACGCCTTATCGACATACTTAATCAGCCAGGAGTACCCAAAAATGGATAAAAAACTTATGGCTATCCAGACAAAATTCACTATTGCCACTTTTATTGGCGATGAAAAGATGTTTCGTGAGGCCGTCGACGCTTATAAAAAATGGATATTAATACTGAAACTGAGATCAAGCAAAAGCATTCACTAACCCCCTTTCCTGTTTTCCTAATCAGCCCGGCATTTCGCGGGCGATATTTTCACAGCTATTTCAGGAGTTCGGCCATGAACGCTTATTACATTCAGGATCGTCTTGAGGCTCAGAGCTGGGCGCGTCACTACCAGCAGATAGCCCGTGAAGAGAAAGAGGCAGAACTGGCAGACGACATGGAAAAAGGCCTGCCCCAGCACCTGTTTGAATCGCTATGCATCGATCATTTGCAACGCCACGGGGCCAGCAAAAAAGCCATTACCCGTGCGTTTGATGACGATGTTGAGTTTCAGGAGCGCATGGCAGAACACATCCGGTACATGGTTGAAACCATTGCTCACCATCAGGTTGATATTGATTCAGAGGTATAAAACGGATGAGTACAGCACTCGCAACGCTGGCAGGGAAGCTGGCTGAACGTGTCGGCATGGATTCTGTCGACCCACAGGAACTGATCACCACTCTTCGCCAGACGGCATTTAAAGGCGATGCCAGCGATGCGCAGTTCATCGCATTGTTGATCGTCGCCAACCAGTACGGCCTTAATCCGTGGACGAAAGAAATTTACGCCTTCCCTGACAAGCAGAACGGCATCGTTCCGGTGGTGGGCGTTGATGGCTGGTCCCGTATCATCAATGAAAACCAGCAGTTTGATGGCATGGACTTTGAGCAGGACAATGAATCATGTACATGCCGGATTTACCGCAAGGACCGTAATCATCCGATCTGCGTTACCGAGTGGATGGATGAATGCCGCCGCGAACCATTCAAAACCCGCGAAGGCAGAGAAATCACCGGACCGTGGCAGTCGCATCCCAAACGGATGTTACGGCATAAAGCCATGATTCAGTGTGCCCGTCTGGCCTTCGGATTTGCTGGTATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACCGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTATGCTGATTGCCCTGGACAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAATGAAAGCTCTTGGATTCCTGAAACAGAAAGCCTCTGAACAGAAGGTGGCTGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGGGATGATGCATGGCACAAATTACGGCTCGGCGTCATCACCGCTTCAGAAGTTCACAACGTGATAGCAAAGCCCCGCTCAGGAAAGAAGTGGCCTGACATGAAAATGTCCTACTTCCACACCCTGCTGGCTGAGGTTTGCACCGGTGTGGCTCCGGAAGTTAACGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAACGACGCCAGAACCCTCTTTGAGTTCACTTCCGGCGTTAATGTTACTGAATCCCCGATCATCTATCGCGACGAAAGTATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAACTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAGTTCCGGCTCGGTGGTTTCGAGGCCATAAAATCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACGCGAAAAGATGCCTGGTACTTTGCCAACTATGACCCACGAATGAAGCGTGAAGGCCTGCATTATGTCGTGGTTGAGCGGGATGAAAATTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGCGCGATCACTTTCGTCTACTCCATTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCCCTGAAAGCACAGCGGCTGGCTGAGGAGATAAATAATAAACGGGGAGCTGTATGCACAAAGCATCTCCCGTTGAGTTAAGAACGAGTATCGAGATGGCACATAGCCTCGCTCAAATTGGAGTCAGGTTTGTGCCAATACCAGTAGAAACAGACGAAGAATTTCATACGTTAGCCGCATCCCTTTCACAAAAGCTGGAAATGATGGTGGCGAAAGCAGAAGCAGATGAGAGAGACCAGGTATGACAACCACTGAATGCATTTTTCTGGCAGCGGGCTTCATATTCTGTGTGCTTATGCTTGCCGACATGGGGCTTGTTCAATGACACCTCAGCAAGAAAACGCCCTTCGCAGCATTGCCCGTCAGGCTAATTCTGAAATCAAAAAAGCCAGACAGCAGTTTCCGGATAAAAACGTCGATGACATTTGCCGTAGCGTACTAAAGAAGCACCGCGAAACGGTAACGCTGATGGGATTCACACCGACTCATTTAAGCCTGGCGATCGGCATGTTGAACGGCGTCTTTAAGGAACGGTGAACATGAAAAGCAAAATCATCAGGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATCATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGATAGATGAACGAAGACGCTTGGCCGTTCAGGGTTGTCGGACTTGTGCAAGTTGCCAGGAGGATCTGGAACTTATCAGTAAACAGAGAGGTTCGAAGTGAGCGAAATTAACTCTCAGGCACTGCGTGAGGCGGCAGTAGCAATTGAAACAGTAGCAACACCTCAAAAATTGCTGGCATTTCGTATGAAAGTCACACCTCAGGTTGTGCTGGCTCTACTGGATGAACGAGATGCATTAAATGAACGCCTAGCCGAACTGGAGGCTGATTTAGCAGGGCTGGCCGAAGACCACCAGAAAGCGACTGAGTCAATTAAGCAGGCTGATGCAGCTGTTAAGTTGGCACACGAGAAGTTTTCGGCGCTGGCGGCGGAGAATGAGCTGGCTCGTAAAGCAGTTCAGGAATTCTGCGATGTTGTTGGCGACAGCACCGAGGTTATCTGCGAGGAGATTGGGAGAGATGGTGTTCTGGTTATTTTGGAGGCCATGAAGGCAACAGGAAATATGCCAGCCACCGATGCTTTCCTGGCTGAAGTACGGGCGCAGGGGGTGGAGATGATGCGCGAACATCCATCAATCAAACTTTGCTCTTTGACGCACATATGTGATGAGTTAGCCGCCCAGCTTCGCAAAGGAGGCAACCAGTGACTGGACATGCAGCAATCCTCGACATGTGCTGTGGCAGTCGCATGTTCTGGTTAGATAAGAATGACGAACGGGCGAGATAAGCGATCGGTTAAGTGCTATAGTAATGCGCTTTTGTATTTATGGAGTGAATATGAAAAATATCCTACTGGCATCATTGTTAGTGGCATCGCCGGGTGCATTTGCAGCCAGCTTTGACTGCCAAAAGGCTTCGACAGCAATCGAACATAAAATCTGCGATAACGAACGTCTGTCAAAATTAGACGAACAGCTTAGCTCTGCCTATTCTAGTGCCCTCAAAGGAAACCCAGAGAACGCAGACACCCTAAAAATGGTTCAACGTCAGTGGGTAAATATGCGTGGAAAACTCACTGATAATAAGGCTCTGGAGCTGGCTTATCTTATCCAAATTAATGGCCTCAAAGGTTTGGGGAGTTCAGTCAGCGTAACAGCGGCCAATGACATACCCACGTCGGCGCAGAAACATTCTAAAGAGCAGGAAGATACAAGTAAGGCAGAAGCTAAGTCGGTCAAGAACGGCAATAAGCTAACCTTAGAGTCATTCCGAGCTAAATATGTAGAAGTAGATGGTGAGTATTACAGCACGACATCCATTCCTAGAGGCAGTTCGTTCTTGTTCACTTGCGCTAGTCGTATTGCTGATGACCAAGTGAATATTTGGAAGAAACAGGCAGCCAAAGAGGGCAAAATCGACCTATTCTTTGAGGTTGAGAATCACTTACACACGGCTATGTTGAACGCCAATTTTCAGAAGTTGAATTCAGACCCTGCCAAAAGAGGTATTTGTAATCTGATTAACGCAGTGCCGTAAGTAAATTTAGGGCCACAGTTGTGGCCTTAAATATTTTTTTCAGCCTTTTCTTATTTGTAATAAGCAGTACTTGGTAGTGCTTATAAAACAGAATAAAAAATATATGACTTTGGCGATTACCCAGTAAAGATATTCGAAATAAATGTAAATATCGACAATGAATAACTATCCTCGCACTCGCGGGGATTTCTTTTATCTGAACTCGCTACGGCGAGTTTTTTTTATGGAGATGATAAATGCACTTCCGAGTCACAGGTGAATGGAATGGAGAACCATTCAACAGAGTTATCGAAGCCGAGAACATCAGCGACTGCTATGACCACTGGATGCTGTGGGCGCAGATAGCACATGCAGACGTAACCAATATTCGAATTGAAGAACTGAAAGAACACCAAGCCGCCTGATGGCGGTTTTTTCTTGCGTGTAATTGCGGAGACTTTGCGATGTACTTGACACTTCAGGAGTGGAACGCACGCCAGCGACGCCCAAGAAGCCTTGAAACAGTTCGTCGATGGGTACGCGAGTGCAGGATATTCCCTCCTCCGGTTAAGGATGGAAGAGAGTATCTGTTCCACGAATCAGCGGTAAAGGTTGACTTAAATCGACCAGTAACAGGTAGCCTTTTGAAGAGGATCAGAAATGGGAAGAAGGCGAAGTCATGAGCGCCGGGATTTACCCCCTAACCTTTATATAAGAAACAATGGATATTACTGCTACAGGGACCCAAGGACGGGTAAAGAGTTTGGATTAGGCCGAGACAGGCGAATCGCAATCACTGAAGCTATACAGGCCAACATTGAGTTATTTTCAGGACACAAACACAAGCCTCTGACAGCGAGAATCAACAGTGATAATTCCGTTACGTTACATTCATGGCTTGATCGCTACGAAAAAATCCTGGCCAGCAGAGGAATCAAGCAGAAGACACTCATAAATTACATGAGCAAAATTAAAGCAATAAGGAGGGGTCTGCCTGATGCTCCACTTGAAGACATCACCACAAAAGAAATTGCGGCAATGCTCAATGGATACATAGACGAGGGCAAGGCGGCATCAGCCAAGTTAATCAGATCAACACTGAGCGATGCATTCCGAGAGGCTATAGCTGAAGGCCATATAACAACAAACCCGGTCGCAGCCACTCGCGCTGCAAAATCAGAGGTAAGGAGATCAAGACTTACGGCTGACGAATACCTGAAAATTTATCAAGCAGCAGAATCATCACCATGTTGGCTTAGACTTGCAATGGAACTGGCTGTTGTTACCGGGCAGCGAGTTGGTGATTTATGCGAAATGAAGTGGTCTGATATCGTAGATGGATATCTTTATGTCGAGCAAAGCAAAACAGGCGTAAAAATTGCCATCCCAACAACATTGCATGTTGATGCTCTCGGGATATCAATGAAGGAAACACTTGATAAATGCAAAAAGATTCTTGGCGGAGAAACCATAATTGCATCTACTCGTCGTGAACCGCTTTCATCCGGCACAGTATCAAGGTATTTTATGCGCGCACGAAAAGCATCAGGTCTTTCCTTCGAAGGGGATCCGCCTACCTTTCACGAGTTGCGCAGTTTGTCTGCAAGACTCTATGAGAAGCAGATAAGCGATAAGTTTGCTCAACATCTTCTCGGGCATAAGTCGGACACCATGGCATCACAGTATCGTGATGACAGAGGCAGGGAGTGGGACAAAATTGAAATCAAATAA
Protein sequences of DBSCAN-SWA_8 >CP029164|4030203:4084519|4043344_4043779_-|AWH71552.1|tail|DBSCAN-SWA MFDGELSFALKLAREMGRPDWRAMLAGMSSTEYADWHRFYSTHYFHDVLLDMHFSGLTYTVLSLFFSDPEMHPLDFSLLNRREADEEPEDDVLMQKAAGLAGGVRFGPDGNEVIPASPDVADMTEDDVMLMTVSEGIAGGVRYG >CP029164|4030203:4084519|4055372_4055558_-|AWH71570.1|DBSCAN-SWA MLKTFRVFARAVNPIGHTIGIAQNVKAVNVQTAIAAVRSESSEYGLSQVIISAVYELKEVH >CP029164|4030203:4084519|4078595_4079012_+|AWH71605.1|DBSCAN-SWA MDINTETEIKQKHSLTPFPVFLISPAFRGRYFHSYFRSSAMNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >CP029164|4030203:4084519|4082110_4082809_+|AWH71613.1|DBSCAN-SWA MKNILLASLLVASPGAFAASFDCQKASTAIEHKICDNERLSKLDEQLSSAYSSALKGNPENADTLKMVQRQWVNMRGKLTDNKALELAYLIQINGLKGLGSSVSVTAANDIPTSAQKHSKEQEDTSKAEAKSVKNGNKLTLESFRAKYVEVDGEYYSTTSIPRGSSFLFTCASRIADDQVNIWKKQAAKEGKIDLFFEVENHLHTAMLNANFQKLNSDPAKRGICNLINAVP >CP029164|4030203:4084519|4074375_4074576_-|AWH71600.1|DBSCAN-SWA MEQRITLKDYAMRFGQTKTAKDLGVYQSAINKAIHAGRKIFLTINADGSVYAEEVKPFPSNKKTTA >CP029164|4030203:4084519|4055550_4056162_-|AWH71571.1|DBSCAN-SWA MPICSTLAKASSEDTRGFSPLPSAWRRAISPRMAVTINPALLSPSSLTDSIPCITSSGTLTVVSCDFAFLLAVAISETPNHRCVSVYTKKEIQKALTCVSCAHNMKHTERIVEIQRATPRSAGNTYGASNQQRKLGAVMVAVNHIPHLVHTQTAFVWRFLALSAGESQIIHVTAWTEREARSRCPSGCVAVFAAKIRQGVSHA >CP029164|4030203:4084519|4059470_4060790_-|AWH71576.1|DBSCAN-SWA MTAELRNLPHIASMAFNEPLMLEPAYARVFFCALAGQLGISRLTDAVSGDSLTAQEALATLALSGDDDGPRQARSYQVMNGIAVLPVSGTLVSRTRALQPYSGMTGYNGIIARLQQAASDPMVDGILLDMDTPGGMVAGAFDCADIIARVRDIKPVWALANDMNCSAGQLLASAASRRLVTQTARTGSIGVMMAHSNYGAALEKQGVEITLIYSGSHKVDGNPYSHLPDDVRETLQSRMDATRQMFAQKVSAYTGLSVQAVLDTEAAVYSGQEAIDAGLADELVNSTDAITVMRDALDARKSRLSGGRMTKETQSTTVSATASQADVTGVVQATEGENASAAQPDVNAQITAAVAAENSRIMGILNCEEAHGREEQARVLAETPGMTVETARRILAAAPQSAQARSDTALDRLMQGAPAPLAAGNPASDAVNDLLNTPV >CP029164|4030203:4084519|4079799_4080480_+|AWH71607.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDENYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP029164|4030203:4084519|4054642_4054942_-|AWH71567.1|DBSCAN-SWA MEMKNSGFIASGPARPEFMNGDIYRDKYGGTVTIKGVAERRITYRREGYSYDCVMPVYQFRRDFSLVYAAPRSKPISREKARGNIQKMKSMINAFRGKK >CP029164|4030203:4084519|4065961_4066255_+|AWH71583.1|DBSCAN-SWA MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQEKTVDAAKICGGAENVVKTETQQTFVNGFLGFITLGIYTPLEARVYCSQ >CP029164|4030203:4084519|4060770_4062372_-|AWH71577.1|portal|DBSCAN-SWA MKTSTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRAWNPPGESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQLNDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSKEQRDKLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIARRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAAMLIEAGLSTYEKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSRAA >CP029164|4030203:4084519|4057429_4058665_-|AWH71574.1|DBSCAN-SWA MAGENKLSDKALKGYLGKPREKQITIADGKGLSIRVSTKGAVSFVFFYRLAGGRAAPVWLTLGKYPDMSLKQAREKRDECRGWLADKRDPRIQIKIQAEERLKPVTVEDALNYWYENYCKVRRKTHAVTLGRFRKHIFPYIGHLPVNDTHLYEWLDCFDRIKRNAPVMAAYVFSDTKLALRFCRVRQYATCDALKDLRMSDVGQIAGKRDRVLDEAELGQLWKAIFVEPDLKLMSEYTRKMFVLCTVFGCRMSEARLSEWSEWDLESWVWTVPKDHSKTGVEIVRPVPEILRQWVTDVHEETKHTGYVLGSLRIRESVSKIGGKIGKRLGHEKQWSLHDLRRTLSTHLSDLGVEFYVVEQLLGHALPGVAGVYNRSKFMAKKLDALELWTTYLNSIAGADSKVTILKQKVG >CP029164|4030203:4084519|4050642_4051683_-|AWH71563.1|capsid|DBSCAN-SWA MVDLYSPTQLVQVVNAVDVQKQLNALFTSLFFTRSVMFESRDIILDTIDDPNIPIAAFCSPMVGSKVSRDEGYESKTIRPGYMKPKSSIDPNKLAVRPAGVSPEQYNAFGARNIKVKQAIVNQAKAIRARIEWLAVQAITTGKNIIEGDGIERYELDWNIKPQNIITQSGGAEWSGKDKETFDPNDDIESYAEFSEGVTNIIIMGGNVWKKYRSFRAIKEALDTRRGSNSELETALKDLGDSVSFKGYMGDVAIVVYSGRYTDEDGTEKHFLDPDLMVLGNTALQGIVAYGGIQDPELIRMGLTKAELAPKNYIVPGDPAIEYVQTHSAPQPIPARINRFVTVRIG >CP029164|4030203:4084519|4040790_4043352_-|AWH71551.1|tail|DBSCAN-SWA MAEPVGDLVVDLSLDAARFDEQMARVRRHFSGTESDAKKTAAVVEQSMNRQALAAQKAGISVGQYKAAMRMLPAQFTDVATQLAGGQSPWLILLQQGGQVKDSFGGMIPMFRGLAGAINLPMVGATSLAVATGALAYAWYQGNSTLSDFNKTLVLSGNQSGLTADRMLVLSRAGQVAGLTFNQTSESLSALVKAGVSGEAQIASISQSVARFSSASGVEVDKVAEAFGKLTTDPTSGLTAMARQFHNVTAEQIAYVAQLQRSGDEAGALQAANEAATKGFDDQTRRLKENMGTLETWADRTARAFKSMWDAVLDIGRPDTAQEMLIKAEAAFKKADDIWSLRKDDYFVNDEARARYWDDREKARLALEAARKKAEQQSQQDTNAQQQSDTEASRLKYTEEAQKAYERLQTPLEKYTARQEELNKALKDGKILQADYNTLMAAAKKDYEATLKKPKQSGVKVSAGDRQEDSAHAALLTLQAELRTLERHAGANEKISQQRRDLWKAESQFAVLEEAAQRRQLSAQEKSLLAHKDETLEYKRQLAALGDKVTYQERLNALAQQADKFAQQQRAKRAAIDAKSRGLTDRQAEREATEQRLKEQYGDNPLALNNVMSEQKKTWAAEDQLRGSWMAGLKSGWSEWEESATDSMSQVKSAATQTFDGIAQNMAAMLTGSEQNWRSFTRSVLSMMTEILLKQAMVGIVGSIGSAIGGAVGGGASASGGTAIQAAAAKFHFATGGFTGTGGKYEPAGIVHRGEFVFTKEATSRIGVGNLYRLMRGYAEGGYVGGAGSPAQMRRAEGINFNQNNHVVIQNDGTNGLPGPQMMKAVYDMARKGARDEIQTQMRDGGLFSGGGR >CP029164|4030203:4084519|4072046_4072337_-|AWH71596.1|DBSCAN-SWA MTGKEAIIHYLGTHNSFCAPDVAALTGATVTSINQAAAKMARAGLLVIEGKVWRTVYYRFATKEEREGKMSTNLIFKECRQSAAMKRVLAVYGVKR >CP029164|4030203:4084519|4071008_4071536_-|AWH71594.1|DBSCAN-SWA MTIKSNTPAHDKDCWQTPLWLFDALDIEFGFWLDSAASDKNALCAHWLTEADDALNSEWVSHGAIWNNPPYSNIRPWVEKAAEQCIQQRQTVVMLVPEDMSVGWFSKALESVDEVRIITDGRINFIEPSTGLEKKGNSKGSMLLIWRPFISPRRMFTTVSKAALMAIGQGVRRAA >CP029164|4030203:4084519|4048843_4049392_+|AWH71559.1|DBSCAN-SWA MEDMGYTSIESMFNNYKCYYDFLLTHNEISFANDYKSQFSKVMLLACASYFETLVVTKIHCMLNPSQCNLTHDFIDNKALTRQYHTLFDWKKRNANQFFSFFGPKFKEFMIEKVKSSTELTKSISDFMEIGELRNKLAHNNYATFVLESTAEEIYNKFLNAHSFVSQLDTFSTQFREQIGEQ >CP029164|4030203:4084519|4069316_4069700_-|AWH71588.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLITSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVVGMTFMSLAGKHCCSDGYIGKRLQKAEGIIEGMLMALDIRLEMDIVVNNSN >CP029164|4030203:4084519|4065156_4065258_-|AWH71581.1|DBSCAN-SWA MIIIITLRVLSGDPTGYGAATSRVFAIYENFPV >CP029164|4030203:4084519|4083045_4083213_+|AWH71614.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP029164|4030203:4084519|4052894_4054646_-|AWH71566.1|DBSCAN-SWA MKLAPNLKKQPRDRLTEIIIFAGSDAWSHAKEWQEWAGKHIAADDVPPVVLADEQLKNITDYRIIDEDRQCVRVYRAGHITEHSMTQIVTLLAVAGVKTVHEYAGITDTSPVDLSDQLPRLKEECERGESLVLNLPTKQKAQLSQMADSERAQLLAERFDGVCVHPESEIVHVWRGGVWCPISTMELSREMVAIYSEHRATFSKRVINNAVEALKVIAEPMGEPSGDLLPFANGALDLKTGEFSPHTPENWITTHNGIEYTPPAPGENIRDNAPNFHKWLEHAAGKDPRKMMRICAALYMIMANRYDWQMFIEATGDGGSGKSTFTHIATLLAGKQNTVSAEMTSLDDAGGRAQVVGSRLIVLADQPKYTGEGTGIKKITGGDPVEINPKYEKRFTTIIRAVVLATNNDPMIFTERAGGVSRRRVIFRFDNIVREDEKDKELPEKIAAEIPVIIRRLLANFADPEKARALLLEQRDGDEALAIKQQTDPVVELCAALEFLEEARGLMMGGGGDTVKYTTRNSLYRVYMAFMAYTGKGKCLSVNEFGKAMRSAAKVYGYEYITRKVKGVTQTNATTTDDCDAFL >CP029164|4030203:4084519|4045928_4046282_-|AWH71556.1|tail|DBSCAN-SWA MADFDNLFDAAIARADETIRGYMGTSATMTSGEQSGAVIRGVFDDPENISYAGQGVRVEGSSPSLFVRTDDVRQLRRGDTLTIGEENFWIDRISPDDGGSCHLWLGRGVPPAVNRRR >CP029164|4030203:4084519|4044946_4045342_-|AWH71554.1|tail|DBSCAN-SWA MKHTELRAAVLDALEKHDTGATFFDGRPAVFDEEDFPAIAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELDAWMESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM >CP029164|4030203:4084519|4049896_4050280_-|AWH71561.1|DBSCAN-SWA MQNDYNDLKPIAEMMYPNPAVEELKAIADKMCLSERLVDMNQVMEITTLSRRTLLNLEASGEFPERVQVTEGRKAWYLSEVIDWINNIPRASEYCRVPVPKKPDAALCLKIERVRRNARDGRYKLIG >CP029164|4030203:4084519|4039766_4040465_-|AWH71549.1|tail|DBSCAN-SWA MQDIRQETLNECTRAEQSASVVLWEIDLTEVGGERYFFCNEQNEKGEPVTWQGRQYQPYPIQGSGFELNGKGTSTRPTLTVSNLYGMVTGMAEDMQSLVGGTVVRRKVYARFLDAVNFVNGNSDADPEQEVISRWRIEQCSELSAVSASFVLSTPTETDGAVFPGRIMLANTCTWTYRGDECGYHGPAVADEYDQPTSDITKDKCSKCLSGCKFRNNVGNFGGFLSINKLSQ >CP029164|4030203:4084519|4065405_4065600_+|AWH71582.1|DBSCAN-SWA MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSEKRAYTSIQSDDE >CP029164|4030203:4084519|4080476_4080659_+|AWH71608.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSITKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >CP029164|4030203:4084519|4080631_4080823_+|AWH71609.1|DBSCAN-SWA MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAASLSQKLEMMVAKAEADERDQV >CP029164|4030203:4084519|4083252_4083471_+|AWH71615.1|DBSCAN-SWA MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >CP029164|4030203:4084519|4034920_4038418_-|AWH71546.1|DBSCAN-SWA MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPVEGPVDGLKSVLLNSTPVLDSEGNTNISGVTVVFRAGEQEQSPPEGFESSGSETVLGTEVKYDTPITRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTEKDITIKGKTTSQYLASVVVDNLPPRPFSIRMRRMTPDSTTDQLQNKTLWSSYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYNPQTRQYSGIWDGTFKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWALYVIGQYCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVEVNWIDPDNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLWLIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISTGGRVLAVNSQTRTLTLDREITLPSSGTTLISLVDGSGNPVSVEVQSVTDGVKVKVSRVPDGVAGYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDLSGTVNGVTPPAVQHLTAEVSADSGEYQVLARWDTPKVVKGVSFMLRLTVTADDGSERLVSTARTTETTYRFRQLALGNYRLTVRAVNAWGQQGDPASVLFRIAAPATPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRITDIRQVETTARYLGTALYWIAASINIKPGHDYYFYVRSVNTVGKSAFVEAVGRASDDAEGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLAEIRTSITNVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDGRLYIAGIGAGIENTPDGMQSQVLLAADRIAMVNPANGNTKPMFVGQGDQIFMNDVFLKRLTAPTITSGGNPPAFSLTPDGKLTAKNADISGSVNANAGTLNNVTVNENCTIKGMLEATQVRGDFVKAVSKSFPKQAGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYSDPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRGSVTLEFKVFHKGNQWAGNITDCTVIVTKKAASGISIR >CP029164|4030203:4084519|4069785_4069926_-|AWH71589.1|DBSCAN-SWA MMFEFNMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAGV >CP029164|4030203:4084519|4057200_4057428_-|AWH71573.1|DBSCAN-SWA MKKMAIVDKKGLEYIPNIDRMIREKECRELTTLANSTRWKLEKEGKFPKRIKIGSTAVAYRLSEVQAWIRGEWVV >CP029164|4030203:4084519|4040464_4040794_-|AWH71550.1|tail|DBSCAN-SWA MKTFRWKVKPGMDVASAPSVRKVRFGDGYSQRAPAGLNANLKTYSVTLSVPREEATVLESFLEEHGGWKAFLWTPPYEWRQIKVTCAKWSSRVSMLRVEFSAEFEQVVN >CP029164|4030203:4084519|4073963_4074257_-|AWH71599.1|DBSCAN-SWA MVRANKRNEALRIESALLNKIAMLGTEKTAEAVGVDKSQISRWKRDWIPKFSMLLAVLEWGVVDDDMARLARQVAAILTNKKRPAATERSEQIQMEF >CP029164|4030203:4084519|4071532_4071991_-|AWH71595.1|DBSCAN-SWA MGEREVMKKLTFEIRSPAHQQNAIHAVQQILPDPTKPIVVTIQERNRSLDQNRKLWACLGDVSRQVEWHGRWLDAESWKCVFTAALKQQDVVPNLAGNGFVVIGQSTSRMRVGEFAELLELIQAFGTERGVKWSDEARLALEWKARWGDRAA >CP029164|4030203:4084519|4066744_4067242_-|AWH71585.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVRHNPYKDIVGVWTVCHGHTGKDIIPGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSTLLRKINQGDIKGACDQLRRWTYAGGKQWKGLMTRREIEREICLWGQQ >CP029164|4030203:4084519|4055181_4055373_-|AWH71569.1|DBSCAN-SWA MQEITLHEAAERAHQTEIICRLLEVYPNKITDADISALVSLLARLSGSVASFLIEEESKLVGD >CP029164|4030203:4084519|4070835_4071012_-|AWH71593.1|DBSCAN-SWA MRRQRRSFTDIICENCKYLPTKRSRNKRKPIPKESDVKTFNYTAHLWDIRWLRYRARK >CP029164|4030203:4084519|4030784_4034186_-|AWH71544.1|DBSCAN-SWA MAVRISGVLKDGAGKPIQNCTIQLKARRNSTTVVVNTVASENPDEAGRYTMDVEYGQYSVSLLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSAGDAGTSAREAATHATDAAGSARAASTSAGQAASSAQSASSSAGTASAKATEASKSAAAAESSKSAAATSAAAAKTSETNAAASQQSAATSASTATTKASEAATSARDASASKEAAKSSETSAASSASSAASSATATANSAKAAKTSETNAKASETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAAVQASAAARSASAAKTSETNAKASETRAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAAAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGVVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGRFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELIIGTKLSASLNGNALTATKLQTPRLVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNPSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA >CP029164|4030203:4084519|4043760_4044183_-|AWH71553.1|tail|DBSCAN-SWA MFLKTESFEHNGVTVTLSELSALQRIEHLDLMKRQAEQAESDSNRKFTVEDAIRTGAFVVAMSLWHNHPQKTKLPSMNEAVKQIEQEVLTTWPTEAISHAENVVYRLSGMYEFVVNDAPEQAEDAGPAVPVSAGKCSTVS >CP029164|4030203:4084519|4070304_4070499_-|AWH71591.1|DBSCAN-SWA MLSPSQSLQYLKGSIERASMCTEWILSRFSAYRRLPVKGMPSKSMLHMQKNARWKVWREHRLSG >CP029164|4030203:4084519|4062571_4064497_-|AWH71579.1|terminase|DBSCAN-SWA MNISNSQVNRLRHFVRAGLRSLFRPEPQTAVEWADANYYLPKESAYQEGRWETLPFQRAIMNAMGSDYIREVNVVKSARVGYSKMLLGVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLALAPWYGKKHRDNTLTMKRFSNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDKRIEGSVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKFGDKETPFGLKWTPDDPSSVFYLCEHNACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEEIEPPDSVTFHIWTAYSPFTTWVQIVKDWMKTKGDTGKRKTFVNTTLGETWEAKIGERPDAEVMAERKEHYSAPVPDRVAYLTAGIDSQLDRYEMRVWGWGPGEESWLIDRQIIMGRHDDEQTLLRVDETINKTYTRRNGAEMSVSRICWDTGGIDPTIVYERSKKHGLFRVIPIKGASVYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYNRFTLTPGGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKRRNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNKKTLADYARALSGEDE >CP029164|4030203:4084519|4038478_4039120_-|AWH71547.1|tail|DBSCAN-SWA MAATHTLPLASPGMARICLYGDLQRFGRRIDLRVKTGAEAIRALATQLPAFRQKLSDGWYQVRIAGRDTGETELSARLNEPLANGAVIHIVPRLAGAKSGGVFQAVLGAAVMAVAIWMPGVGIMASNLLFSLGASMTLGGVAQMLAPKPKTPSTQTTDNGKQNTYFSSLDNMVAQGNVLPVLYGEMRVGSRVVSQEISTADEGDGGQVVVIGR >CP029164|4030203:4084519|4045338_4045917_-|AWH71555.1|tail|DBSCAN-SWA MAIKGLEQAVENLSRISRTAVPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERAWLKRATVKNPQARIKVNRGDLPVIKLGNARIVLSRRRRRKKGQRSALKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIPMAVPLTTAFKQNIERIRRERLPKELGYALQHQLRMVIKR >CP029164|4030203:4084519|4034250_4034850_-|AWH71545.1|DBSCAN-SWA MRKVCAAILSAAICLAVSGAPAWASEHQSTLSAGYLHASTNVPGSDDLNGINVKYRYEFTDTLGMVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAGVTYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAIDIAYEGSGSGDWRTNSFIVGVGYKF >CP029164|4030203:4084519|4068029_4069127_+|AWH71587.1|DBSCAN-SWA MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYARLGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYGVAYDIGAWTDVLPEFGGDTWTQTDVFMTQRATGVATYRNNDFFGLVDGLNFAAQYQGKNDRSDFDNYTEGNGDGFGFSATYEYEGFGIGATYAKSDRTDTQVNAGKVLPEVFASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFADHFVANKAQNFEAVAQYQFDFGLRPSVAYLQSKGKDLGVWGDQDLVKYVDVGATYYFNKNMSTFVDYKINLLDKNDFTKALGVSTDDIVAVGLVYQF >CP029164|4030203:4084519|4062368_4062575_-|AWH71578.1|head,tail|DBSCAN-SWA MTRQEELAAARAALHDLMTGKRVATVQKDGRRVEFTATSVSDLKKYIAELEVQTGMTQRRRGPAGFYV >CP029164|4030203:4084519|4077923_4078292_+|AWH71603.1|DBSCAN-SWA MSNIKKYIIDYDWKASIEIEIDHDVMTEEKLHQINNFWSDSEYRLNKHGSVLNAVLIMLAQHALLIAISSDLNAYGVVCEFDWNDGNGQEGWPPMDGSEGIRITDIDTSGIFDSDDMAIKAA >CP029164|4030203:4084519|4070491_4070875_-|AWH71592.1|DBSCAN-SWA MGYPVAKISCEEMTMDYSQLSDFEINRMVGDIIFKGLWASKPETSGNNTNKWYYGNADTTFEPLNHLPDYCNDPSASWPIIEKYRISILDQLTEWCVDAKGVSPIFDARPLRAAMIVFLLMQEANNA >CP029164|4030203:4084519|4075556_4076375_+|AWH72573.1|DBSCAN-SWA MQRVHATHRKHFAEHELTAQCIMKVIWSMVLSHLNGKRQSQERREPLELYRIYFYDCPPLDIQTRLPLPEPGNKTPGRKNFKLEKSYILRTELHEELRKTRKTALRLGNLVDNKRWQLTTFSLDALMKGTKKWDELTNDDFYYDIKQKQVDIKLGMDITTLAYEKLVDVIVLVAGDSDFVPAAKHARIKGIDFILDPLRQNVTPSLSEHIDGVQSYSLISGLADALHVEPDPAPDWWEDRKKGKPRGKNNSGKRRYGNTQAESAKKHQRNKR >CP029164|4030203:4084519|4080833_4081115_+|AWH71610.1|DBSCAN-SWA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP029164|4030203:4084519|4077172_4077667_-|AWH71602.1|DBSCAN-SWA MSDEKGIISRITDAVSGAGGALMSAVGAVKEIQKMQIDYSVKEKTYELVDKLMDAQQQQMSLNELLMISKDKIIELEEKINRASKWEEEKKNYEMYTPTVATVVYRLKKSANTGQPMHYLCAQCYESSVKSILQYEGFAPPSNHRMRCHRCNASYLFPKSAFSK >CP029164|4030203:4084519|4074676_4075390_+|AWH71601.1|DBSCAN-SWA MSAKKKPLTQEQLEDARRLKAIYEKKKNELGLSQESVADKMGMGQSGVGALFNGINALNAYNAALLAKILNVSVEEFSPSIAREIYEMYEAVSMQPSLRSEYEYPVFSHVQAGMFSPELRTFTKGDAEKWVSTTKKASGSAFWLEVEGNSMTAPTGSKPSFPDGMLILVDPEQAVEPGDFCIARLGGDEFTFKKLIRDSGQVFLQPLNPQYPMIPCNESCSVVGKVIASQWPEETFG >CP029164|4030203:4084519|4067241_4067457_-|AWH71586.1|holin|DBSCAN-SWA MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGG >CP029164|4030203:4084519|4052362_4052605_-|AWH71565.1|DBSCAN-SWA MHTSGKLNKHIKPHYRALDMAEHWLRVAIKAIDRNAGEGYAKAHPELISAFMTTAAANFATLTEREIAEAEEVTTINIKS >CP029164|4030203:4084519|4059128_4059461_-|AWH71575.1|head|DBSCAN-SWA MTSKETFTHYQPLGNSDPAHTATAPGGLSAKAPAMTPLMLDTSTRKLVAWDGTTDGAAVGILAVDADQTSTTLTFYKSGTFRYEDVLWPEAASDETKKRTAFAGTAISIV >CP029164|4030203:4084519|4081431_4081980_+|AWH71612.1|DBSCAN-SWA MSEINSQALREAAVAIETVATPQKLLAFRMKVTPQVVLALLDERDALNERLAELEADLAGLAEDHQKATESIKQADAAVKLAHEKFSALAAENELARKAVQEFCDVVGDSTEVICEEIGRDGVLVILEAMKATGNMPATDAFLAEVRAQGVEMMREHPSIKLCSLTHICDELAAQLRKGGNQ >CP029164|4030203:4084519|4064471_4065017_-|AWH71580.1|terminase|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIPGLLSEYIEQSG >CP029164|4030203:4084519|4072333_4073035_-|AWH71597.1|DBSCAN-SWA MKNIAAQMVNFDREQMRRIANNMPEQYDEKPQVQQVAQIINGVFSQLLATFPASLANRDQNELNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELVHMTARINRGEAIPEPVKQLPVMGGRPLNRAQALAKIAEIKAKFGLKGASV >CP029164|4030203:4084519|4044198_4044939_-|AWH72571.1|tail|DBSCAN-SWA MPVPNPAIPVKGAGTTLWVYKGSGDPYANPLSDVDWSRLAKVKDLTPGELTAESYDDSYLDDEDADWTATGQGQKSAGDTSFTLAWMPGEQGQQALLAWFNEGDTRAYKIRFPNGTVDVFRGWVSSIGKAVTAKEVITRTVKVTNVGRPSMAEDRSTVTATTGMTVTPASASVVKGQSTTLTVAFQPDGATDKSFRAVSADKTKATVSVSGMTITVKGVAAGKVNIPVVSGNGEFAAVAEINVTAS >CP029164|4030203:4084519|4049456_4049651_-|AWH71560.1|DBSCAN-SWA MEVNKDQLADILGKSCAILLSDVNIFEEHYYIDVIRDMFDRTRNKADALVMSIAEIIKENSNDK >CP029164|4030203:4084519|4078371_4078641_+|AWH71604.1|DBSCAN-SWA MPLQGGLLLAALPNLYLNESPVNYVTDGNALSTYLISQEYPKMDKKLMAIQTKFTIATFIGDEKMFREAVDAYKKWILILKLRSSKSIH >CP029164|4030203:4084519|4056594_4057131_+|AWH71572.1|DBSCAN-SWA MNLKKIATNTKNKITETFNKLILEASKTPTQDEIKILERRSKKFNYSFFSYAVTGAIIVFCSQPLIKYANPILILLSGLLLSIIIIILRMIYISQANASWTTKKRSHVLVHFLSACFIASTLTLLYQAYDNNITHKLYCKNIQQLIEKRIETEKNISIFSGMQCTPVYDYSLFGFNLL >CP029164|4030203:4084519|4073031_4073931_-|AWH71598.1|DBSCAN-SWA MTNTAKILNFGRGNFAGQERNVADLDDGYARLSNMLLEAYSGADLTKRQFKVLLAILRKTYGWNKPMDRITDSQLSEITKLPVKRCNEAKLELVRMNIIKQQGGMFGPNKNISEWCIPQNEGQSPKTRDKTSLKLGDCYPSKQGDTKDTITKEKRKDYSSENSGESSDQPENDLSVVKPDAAIQSGSKWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTASKPKLDLTNTDWIYGVDL >CP029164|4030203:4084519|4083448_4084519_+|AWH71616.1|integrase|DBSCAN-SWA MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTTLHVDALGISMKETLDKCKKILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP029164|4030203:4084519|4054959_4055181_-|AWH71568.1|DBSCAN-SWA MNTEREVFFKLLACAESSLTLNNSAKVILNMWLDCINDNEDANIAYGLLSLIDEAAEKLNDAINSALLSNKSS >CP029164|4030203:4084519|4058797_4059067_-|AWH72572.1|capsid|DBSCAN-SWA MYTTAQLLAANEQKFKFDPLFLRLFFRESYPFTTEKVYLSQIPGLVNMALYVSPIVSGEVIRSRGGSTSEFTPGYVKPKHLAWLSEAFV >CP029164|4030203:4084519|4081213_4081435_+|AWH71611.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQEDLELISKQRGSK >CP029164|4030203:4084519|4050291_4050633_-|AWH71562.1|head|DBSCAN-SWA MATHYTELMSGTEALVTTLGIFSANKGVIPAFTPLMQEDATGALVVWDGTSAGKAVYVSAVQIDTAKKTQAQVYKTGVLNVDALNWPESVRELSAKVAAFVGSGISVQPLARV >CP029164|4030203:4084519|4046293_4046689_-|AWH71557.1|DBSCAN-SWA MTKDELIARLRSLGEQLNRDVSLTGTKEELALRVAELEEELDDTDDAAGQDTSVSPENALTGHENEVVSAQPDTVTDTADLVTVVALVTLHTDALHATRDEAVAFVLPGTAFRVSAGVAAEMTERGLARMQ >CP029164|4030203:4084519|4039017_4039761_-|AWH71548.1|tail|DBSCAN-SWA MTQTESAILAHARRCAPAESCGFVVSTPEGERYFPCVNISGEPEAYFRMSPEDWLQAEMQGEIVALVHSHPGGLPWLSEADRRLQVQSDLPWWLVCRGTIHKFRCVPHLTGRRFEHGVTDCYTLFRDAYHLAGIEMPDFHREDDWWRHGQNLYLDNLEATGLYQVPLSSAQPGDVLLCCFGSSVPNHAAIYCGDGELLHHIPEQLSKRERYTDKWQRRTHSLWRHRAWRASAFTGIYNDLVAASTFV >CP029164|4030203:4084519|4076856_4077180_+|AWH72574.1|DBSCAN-SWA MDAQTRRRERRAEKQAQWKAANPLLVGVSAKPVNRPILSLNRKPKSRVESALNPIDLTVLAEYHEQIESNLQRIERKNQRTWYSKPGERGITCRGRQKIKGKSISLI >CP029164|4030203:4084519|4051901_4052351_-|AWH71564.1|DBSCAN-SWA MTAQIAAYGRLVDDPQVKHTSKGTPMTLAWMAVSLPCSQADDGTATMWLSVLAFGRQADTLAKHHKGELLSVAGNMQVSQWTGQNGETRQGWQVIADSVISARTVRPGGKKDQQGQATDALNRAKQQADQQGSHPPVGDNEQWGDDIPF >CP029164|4030203:4084519|4030203_4030785_-|AWH71543.1|tail|DBSCAN-SWA MAFRMSEQPRTITIYNLLAGTNEFIGEGDAYIPPHTGLPANSTDIAPPDIPAGFVAVFNSDEASWHLVEDHRGKTVYDVASGDALFISELGPLAENVTWLSPEGEFQKWNGTAWVKDTEAEKLFRIREAEETKNSLMQVASEHIAPLQDAVDLEIATEEEISLLEAWKKYRVLLNRVNTTTAPDIEWPVAPIG >CP029164|4030203:4084519|4079017_4079803_+|AWH71606.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTMLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAMKALGFLKQKASEQKVAA >CP029164|4030203:4084519|4066286_4066748_-|AWH71584.1|lysis|DBSCAN-SWA MNRVTAIISALVICIIVGLSWAVNHYRDNAITYKAQRDKNARELKLANAAITDMQMRQRDVAALDAKYTKELADAKAENDALRDDVAAGRRRLHIKAVCQSVREATTASGVDNAASPRLADTAERDYFTLRERLITMQKQLEGTQKYINEQCR >CP029164|4030203:4084519|4069922_4070285_-|AWH71590.1|DBSCAN-SWA MNTYSITLPWPPSNNRYYRHNRGRTHVSAEGQAYCDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKASFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE >CP029164|4030203:4084519|4047675_4048839_+|AWH71558.1|DBSCAN-SWA MQPEDYEEKQYEDEPESYPIDEFQLTTTPNDFNIITIISFIKSKVFKIPNFQRHYVWDIKRASKLIESLLIGLPIPQIFLYEQDKNEFLVIDGQQRLMTLYYFVNGVFPRKEKRSELRKIFEDNGNIPENILHNDEYFTKFNLKLDGLSDTQKNKFNGKNYETLNEFQTTLNLATIRNMVIKPVAQDSEDGAMFEIFNRLNSGGMNLSPQEIRMSLYHSDFLSNLVSLNENKTWRKILSKNVVDMRLSDIEAILRTFAMSLFTSQYKSSVSGFLNNFSNYAKNYDTKDIILFSNIWNEFMDSVDGIDEINFRTGGNRMSITLFESIFYAATYDSFKDKDLKIRQVTVNYIDKLKNDPEFLTFSTDKTTRREHVIGRLERARTILEGM |
78 | Enterobacteria_phage(56.25%) | portal,terminase,holin,tail,integrase,lysis,head,capsid | attL 4036870:4036886|attR 4092465:4092481 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|