Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP029164 | Escherichia coli strain 104 chromosome, complete genome | 4 crisprs | WYL,cas3,csa3,PD-DExK,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG | 1 | 15 | 8 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029164_1 | 136157-136296 | Orphan |
NA
Consensus repeat of CP029164_1
|
1 spacers
spacers of CP029164_1
>1.1|136201|52|CP029164|CRISPRCasFinder GCACTGAACTCGTAGGCCTGATAAGACGCGACAGCGTCGCATCAGGCAAGGC |
CRISPR arrays and Neighbor proteins around CP029164_1
The CRISPR arrays of CP029164_1 >merge|CP029164|1|136157-136296|CRISPRCasFinder TGTTATTGTCGGATGCGGCGTGAACGCCTTATCCGACCTACACAGCACTGAACTCGTAGGCCTGATAAGACGCGACAGCGTCGCATCAGGCAAGGCTGTTATTGTCGGATGCGACGTGAACGCCTTATCCGACCTACACA >CP029164|1|1|136157-136296|CRISPRCasFinder TGTTATTGTCGGATGCGGCGTGAACGCCTTATCCGACCTACACA GCACTGAACTCGTAGGCCTGATAAGACGCGACAGCGTCGCATCAGGCAAGGC TGTTATTGTCGGATGCGACGTGAACGCCTTATCCGACCTACACA
>CP029164.1|AWH67990.1|134628_136140_+|cytosol-aminopeptidase MEFSVKSGSPEKQRSACIVVGVFEPRRLSPIAEQLDKISDGYISALLRRGELEGKPGQTLLLHHVPNVLSERILLIGCGKERELDERQYKQVIQKTINTLNDTGSMEAVCFLTELHVKGRNNYWKVRQAVETAKETLYSFDQLKTNKSEPRRPLRKMVFNVPTRRELTSGERAIQHGLAIAAGIKAAKDLGNMPPNICNAAYLASQARQLADSYSKNVITRVIGEQQMKELGMHSYLAVGQGSQNESLMSVIEYKGNASEDARPIVLVGKGLTFDSGGISIKPSEGMDEMKYDMCGAAAVYGVMRMVAELQLPINVIGVLAGCENMPGGRAYRPGDVLTTMSGQTVEVLNTDAEGRLVLCDVLTYVERFEPEAVIDVATLTGACVIALGHHITGLMANHNPLAHELIAASEQSGDRAWRLPLGDEYQEQLESNFADMANIGGRPGGAITAGCFLSRFTRKYNWAHLDIAGTAWRSGKAKGATGRPVALLAQFLLNRAGFNGEE >CP029164.1|AWH67989.1|133261_134362_-|LPS-export-ABC-transporter-permease-LptF MIIIRYLVRETLKSQLAILFILLLIFFCQKLVRILGAAVDGDIPANLVLSLLGLGVPEMAQLILPLSLFLGLLMTLGKLYTESEITVMHACGLSKAVLVKAAMILAVFTAIVAAVNVMWAGPWSSRHQDEVLAEAKANPGMAALAQGQFQQATNGSSVLFIESVDGSDFKDVFLAQIRPKGNARPSVVVADSGHLTQLRDGSQVVTLNQGTRFEGTALLRDFRITDFQDYQAIIGHQAVALDPNDTDQMDMRTLWNTDTDRARAELNWRITLVFTVFMMALMVVPLSVVNPRQGRVLSMLPAMLLYLLFFLIQTSLKSNGGKGKLDPTLWMWTVNLIYLALAIVLNLWDTVPVRRLRASFSRKGAV >CP029164.1|AWH67988.1|132179_133262_-|LPS-export-ABC-transporter-permease-LptG MQPFGVLDRYIGKTIFTTIMMTLFMLVSLSGIIKFVDQLKKAGQGSYDALGAGMYTLLSVPKDVQIFFPMAALLGALLGLGMLAQRSELVVMQASGFTRMQVALSVMKTAIPLVLLTMAIGEWVAPQGEQMARNYRAQAMYGGSLLSTQQGLWAKDGNNFVYIERVKGDEELGGISIYAFNENRRLQSVRYAAAAKFDPEHKVWRLSQVDESDLTNPKQITGSQTVSGTWKTNLTPDKLGVVALDPDALSISGLHNYVKYLKSSGQDAGRYQLNMWSKIFQPLSVAVMMLMALSFIFGPLRSVPMGVRVVTGISFGFVFYVLDQIFGPLTLVYGIPPIIGALLPSASFFLISLWLLMRKS >CP029164.1|AWH67987.1|130516_132019_+|DUF853-domain-containing-protein MSEPLLIARTPDTELFLLPGMANRHGLITGATGTGKTVTLQKLAESLSEIGVPVFMADVKGDLTGVAEEGTSSEKLLARLKNIGVNDWQPHTNPVVVWDIFGEKGHPVRATVSDLGPLLLARLLNLNDVQSGVLNIIFRIADDQGLLLLDFKDLRAITQYIGDNAKSFQNQYGNISSASVGAIQRGLLSLEQQGAAHFFGEPMLDIKDWMRTDANGKGVINILSAEKLYQMPKLYAASLLWMLSELYEQLPEAGDLEKPKLVFFFDEAHLLFNDAPQVLLDKIEQVIRLIRSKGVGVWFVSQNPSDIPDNVLGQLGNRVQHALRAFTPKDQKAVKAAAQTMRANPTFDTEKAIQELGTGEALISFLDAKGSPSVVERAMVIAPCSRMGPVTEDERNGLINHSPVYGKYEDDVDRESAYEMLQKGFQASIEQQNNPPAKGKEVAVDDGILGGLKDILFGTTGPRGGKKDGVVQTMAKSAARQVTNQIVRGMLGSLLGGRRR >CP029164.1|AWH67986.1|129440_130439_+|LacI-family-DNA-binding-transcriptional-regulator MRNHRISLQDIATLAGVTKMTVSRYIRSPKKVAKETGERIAKIMEEINYIPNRAPGMLLNAQSYTLGILIPSFQNQLFADILAGIESVTSVHNYQTIIANYNYDRDSEEESVINLLSYNIDGIILSEKYHTIRTVKFLRSATIPVVELMDVQGERLDMEVGFDNRQAAFDMVCTMLDKRVRRKILYLGSKDDTRDEQRYQGYCDAMMLHNLSPLRMNPRAISSIHLGMQLMRDALSANPDLDGVFCTNDDIAMGALLLCRERNLAVPEQISIAGFHGLEIGRQMIPSLASVITPRFDIGRMAAQMLLSKIKNNDHNHNTVDLGYQIYHGNTL >CP029164.1|AWH67985.1|128054_129374_+|gluconate-permease MPLIIIAAGVALLLILMIGFKVNGFIALVLVAAVVGFAEGMDAQAVLHSIQNGIGSTLGGLAMILGFGAMLGKLISDTGAAQRIATTLIATFGKKRVQWALVITGLVVGLAMFFEVGFVLLLPLVFTIVASSGLPLLYVGVPMVAALSVTHCFLPPHPGPTAIATIFEANLGTTLLYGFIITIPTVIVAGPLFSKLLTRFEKAPPEGLFNPHLFSEEEMPSFWNSIFAAVIPVILMAIAAVCEITLPKTNTVRLFFEFVGNPAVALFIAIVIAIFTLGRRNGRTIEQIMDIIGDSIGAIAMIVFIIAGGGAFKQVLVDSGVGQYISHLMTGTTLSPLLMCWTVAALLRIALGSATVAAITTAGVVLPIINVTHADPALMVLATGAGSVIASHVNDPGFWLFKGYFNLTVGETLRTWTVMETLISIMGLLGVLAINAVLH >CP029164.1|AWH67984.1|127225_127990_+|gluconate-5-dehydrogenase MNDLFSLAGKNILITGSAQGIGFLLATGLGKYGAQIIINDITAERAELAVEKLHQEGIQAVAAPFNVTHKHEIDAAVEHIEKDIGPIDVLVNNAGIQRRHPFTEFPEQEWNDVIAVNQTAVFLVSQAVTRHMVERKAGKVINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARHNIQVNGIAPGYFKTEMTKALVEDEAFTAWLCKRTPAARWGDPQELIGAAVFLSSKASDFVNGHLLFVDGGMLVAV >CP029164.1|AWH67983.1|126170_127202_+|L-idonate-5-dehydrogenase MQVKTQSCVVAGKKTVAVTEQTIDWNNNGTLVQITRGGICGSDLHYYQEGKVGNFMIKAPMVLGHEVIGKVIHSDSSKLHEGQTVAINPSKPCGHCKYCIEHNENQCTEMRFFGSAMYFPHVDGGFTRYKMVETSQCVPYPAKADEKVMAFAEPLAVAIHAAHQAGELQGKRVFISGVGPIGCLIVSAVKTLGAAEIVCADVSPRSLSLGKEMGADVLVNPQNDDMDHWKAEKGYFDVSFEVSGHPSSVNTCLEVTRARGVMVQVGMGGAMAEFPMMTLIGKEISLKGSFRFTSEFNTAVSWLANGVINPLPLLSAEYPFTDLEEALRFAGDKTQAAKVQLVF >CP029164.1|AWH67982.1|125390_125954_-|thermosensitive-gluconokinase MAGESFILMGVSGSGKTLIGSKVAALLSAKFIDGDDLHPAKNIDKMSQGIPLSDEDRLPWLERLNDASYSLYKKNETGFIVCSSLKKQYRDILRKGSPHVHFLWLDGDYETILARMQRRAGHFMPVALLKSQFEALERPQADEQDIVRIDINHDIANVTEQCRQAVLAIRQNRICAKEGSASDQRCE >CP029164.1|AWH67981.1|124367_125387_+|NAD(P)-dependent-alcohol-dehydrogenase MSMIKSYAAKEAGGELEVYEYDPGELRPQDVEVQVDYCGICHSDLSMIDNEWGFSQYPLVAGHEVIGRVVALGSAAQDKGLQVGQRVGIGWTARSCGHCDACISGNQINCEQGAVPTIMNRGGFAEKLRADWQWVIPLPENIDIESAGPLLCGGITVFKPLLMHHITATSRVGVIGIGGLGHIAIKLLHAMGCEVTAFSSNPAKEQEVLAMGADKVVNSRDPQALKTLAGQFDLIINTVNVSLDWQPYFEALTYGGNFHTVGAVLTPLPVPAFTLIAGDRSVSGSATGTPYELRKLMRFAARSKVAPTTELFPMSKINDAIQHVRDGKARYRVVLKADF >CP029164.1|AWH67991.1|136397_136841_+|DNA-polymerase-III-subunit-chi MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAYRLDEALWARPAESFVPHNLAGEGPRGGAPVEIAWPQKRSSSPRDILISLRTSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK >CP029164.1|AWH67992.1|136840_139696_+|valine--tRNA-ligase MEKTYNPQDIEQPLYEHWEKQGYFKPNGDESQESFCIMIPPPNVTGSLHMGHAFQQTIMDTMIRYQRMQGKNTLWQVGTDHAGIATQMVVERKIAAEEGKTRHDYGREAFIDKIWEWKAESGGTITRQMRRLGNSVDWERERFTMDEGLSNAVKEVFVRLYKEDLIYRGKRLVNWDPKLRTAISDLEVENRESKGSMWHIRYPLADGAKTADGKDYLVVATTRPETLLGDTGVAVNPEDPRYKDLIGKYVILPLVNRRIPIVGDEHADMEKGTGCVKITPAHDFNDYEVGKRHALPMINILTFDGDIRESAQVFDTKGNESDVYSSEIPAEFQKLERFAARKAVVAAVDALGLLEEIKPHDLTVPYGDRGGVVIEPMLTDQWYVRADVLAKPAVEAVENGDIQFVPKQYENMYFSWMRDIQDWCISRQLWWGHRIPAWYDEAGNVYVGRNEDEVRKENNLGADVALRQDEDVLDTWFSSALWTFSTLGWPENTDALRQFHPTSVMVSGFDIIFFWIARMIMMTMHFIKDENGKPQVPFHTVYMTGLIRDDEGQKMSKSKGNVIDPLDMVDGISLPELLEKRTGNMMQPQLADKIRKRTEKQFPNGIEPHGTDALRFTLAALASTGRDINWDMKRLEGYRNFCNKLWNASRFVLMNTEGQDCGFNGGEMTLSLADRWILAEFNQTIKAYREALDSFRFDIAAGILYEFTWNQFCDWYLELTKPVMNGGTEAELRGTRHTLVTVLEGLLRLAHPIIPFITETIWQRVKVLCGITADTIMLQPFPQYDASQVDEAALADTEWLKQAIVAVRNIRAEMNIAPGKPLELLLRGCSADAERRVNENRGFLQTLARLESITVLPADDKGPVSVTKIVDGAELLIPMAGLINKEDELARLAKEVAKIEGEISRIENKLANEGFVARAPEAVIAKEREKLEGYAEAKAKLIEQQAVIAAL >CP029164.1|AWH67993.1|139742_140930_-|DUF898-domain-containing-protein MNDVNIGKDNSRHSFVFTGKGGEYFLICLVNFSLTIITLGIYGPWALIKCRRYIYQHVTLKGQPFSYKGTGGAIFVSMLLIVVVYLLSISCFAGQHFALGLFLFALLICGIPCMAVKSLQYQANMTSLNDIRFGFNCSMMRAWWVMLGLPVLLALVFWFALYLIAQVTTSIGGLFFNLVALSLLSAIGLGVVHGITYSKWMPLLGNNATFGIHKFSIQVNVKECIKGCMLAILTMVPFIIVIGIMIAPVFQQLMMMTMLGRSDAGSEFVLQYYPQIMASYFLYFVAILVFASYLYVTLRNLFLNNLTLANGTIRFHSSVTAIGMLLRMLAVLMGSSITCGLAYPWLKMWMVSWIANNTHVQGDLDSLELTNDDKPQDSGSLMWISRGIMPYVPFI >CP029164.1|AWH67994.1|141122_141626_+|N-acetyltransferase MNVVASPALRLRKLTVADNPAIAHVIRQVSAEYGLTADKGYTVADPNLDELYQVYSQPGHAYWVVEYEGEVVGGGGIAPLTGSESDICELQKMYFLPAIRGKGLAKKLALMAMEQAREMGFKRCYLETTAFLKEAIALYEHLGFEHIDYALGCTGHVDCEVRMLRKL >CP029164.1|AWH67995.1|141802_142219_-|ribonuclease-E-inhibitor-RraB MANPEQLEEQREETRLIIEELLEDGSDPDALYTIEHHLSADDLETLEKAAVEAFKLGYEVTDPEELEVEDGDIVICCDILSECALNADLIDAQVEQLMTLAEKFDVEYDGWGTYFEDPNGEDGDDEDFVDEDDDGIRH >CP029164.1|AWH67996.1|142380_143385_+|ornithine-carbamoyltransferase MSGFYHKHFLKLLDFTPAELNSLLQLAAKLKADKKSGKEEARLTGKNIALIFEKDSTRTRCSFEVAAYDQGARVTYLGPSGSQIGHKESIKDTARVLGRMYDGIQYRGYGQEIVETLAEYAGVPVWNGLTNEFHPTQLLADLLTMQEHLPGKAFNEMTLVYAGDARNNMGNSMLEAAALTGLDLRLVAPQACWPEAALVAECSALAQKHGGKITLTEDIASGVKGADFIYTDVWVSMGEPKEKWAERIALLRDYQVNSKMMALTGNSQVKFLHCLPAFHDEQTTLGKKMAAELGLYGGMEVTDEVFESPASIVFDQAENRMHTIKAVMVATLAK >CP029164.1|AWH67997.1|143430_143883_-|YhcH/YjgK/YiaL-family-protein MIVGNIHHLQSWLPEELREAIEYIKSHVSDETAKGKHAIDGDRLFYLISEDTTEPGELRRAEYHARYLDIQIVLKGQEGMTFSTQPAGVPETDWLADKDIAFIGQGIDEKTVILNEGDFVVFYPGEVHKPLCAVGAPAQVRKAVVKLLKS >CP029164.1|AWH67998.1|144560_145781_+|arginine-deiminase MEKHYVGSEIGQLRSVMLHRPNLSLKRLTPSNCQELLFDDVLSVERAGEEHDIFANTLRQQGIEVLLLTDLLTQTLDIPEAKSWLLETQISDYRLGPTFATDVRTWLAEMSHRDLARHLSGGLTYSEIPASIKNMVVDTHDINDFIMKPLPNHLFTRDTSCWIYNGVSINPMAKPARQRETNNLRAIYRWHPQFAGGEFIKYFGDENINYDHATLEGGDVLVIGRGAVLIGMSERTTPQGIEFLAQALFKHRQAERVIAVELPKHRSCMHLDTVMTHIDIDTFSVYPEVVRPDVNCWTLTPDGHGGLKRTQESTLLHAIEKALGIDQVRLITTGGDAFEAEREQWNDANNVLTLRPGVVVGYERNIWTNEKYDKAGITVLPIPGDELGRGRGGARCMSCPLHRDGI >CP029164.1|AWH67999.1|145791_146736_+|carbamate-kinase MENKPTLVIALGGNALLKRGEPLEAEIQRKNIDLAAKTIAQLTQHWRVVLVHGNGPQVGLLALQNSAYAHVAPYPLDILGAESQGMIGYMLQQALKNQLPQREISVLLTQVEVDANDPAFSNPTKYIGPIYDHAQTQVLQAEKGWVFKADGHSFRRVVPSPQPKRIVERDAIQTLIAHDHLVICNGGGGVPVVEKADGYHGIEAVIDKDLSAALLASQIHADALLILTDADAVYLDWGKPTQRPLAQVTPELLNEMQFDAGSMGPKVTACAKFVSQCRGIAGIGSLADGPEILAGDKGTLIRLDTPITTLDPFL >CP029164.1|AWH68000.1|146746_147751_+|ornithine-carbamoyltransferase MATSLKNRNFLKLLDYTPAEIQYLIDLAINLKAAKKSGNEKQTLVGKNIALIFEKSSTRTRCAFEVAAFDQGAQVTYIGPSGSQIGHKESMKDTARVLGRMYDGIEYRGYGQNIVEELGEFAGVPVWNGLTNEFHPTQILADLMTMLEHAPGKTLPELSFAYLGDARNNMGNSLMVGAAKMGMDIRLVAPKSFWPDEALVTQCREIASVTGARITLTEDVEEGVYDVDFLYTDVWVSMGEPKEAWAERVSLMTPYQINQQVITATRNPEVKFMHCLPAFHNEHTTVGREIEMAYGLKGLEVTDEVFESAHSIVFDEAENRMHTIKAVMVATLGD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029164_3 | 2002885-2003401 | Unclear |
I-E
Consensus repeat of CP029164_3
|
8 spacers
spacers of CP029164_3
>3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT ACGTAACAAAACAACAGCAAAATATTATCGAC >3.2|2002975|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GCTCCGCCGGTTTGATCTCCGGTTTGCGCTGT >3.3|2003036|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GCAATTAATTTAGTTCCAGATGCTGCGAAAGA >3.4|2003097|32|CP029164|PILER-CR,CRISPRCasFinder,CRT TTGGTCACTCGTCAAAAGTCGAGACGGTCGAA >3.5|2003158|32|CP029164|CRISPRCasFinder,CRT AATGATTGATATAAATCTGTGTACGGTGTCCG >3.6|2003219|32|CP029164|CRISPRCasFinder,CRT,PILER-CR CTAGGATAAATTAAAAGACAAAATTGCAGCAA >3.7|2003280|32|CP029164|CRISPRCasFinder,CRT,PILER-CR GAGCGACCAGTATCAAGATCGACAGGTTTTGC >3.8|2003341|32|CP029164|CRISPRCasFinder,CRT,PILER-CR ATCGATATGTACGTTAGCGAGGGGATCACGCA |
CRISPR arrays and Neighbor proteins around CP029164_3
The CRISPR arrays of CP029164_3 >merge|CP029164|3|2002885-2003401|PILER-CR,CRISPRCasFinder,CRT,PILER-CR GAGTTCCCCACGTCAGCGGGGATAAACCGACGTAACAAAACAACAGCAAAATATTATCGACGAGTTCCCCACGTCAGCGGGGATAAACCGGCTCCGCCGGTTTGATCTCCGGTTTGCGCTGTGAGTTCCCCACGTCAGCGGGGATAAACCGGCAATTAATTTAGTTCCAGATGCTGCGAAAGAGAGTTCCCCACGTCAGCGGGGATAAACCGTTGGTCACTCGTCAAAAGTCGAGACGGTCGAAGAGTTCCCCACGTCAGCGGGGATAAACCGAATGATTGATATAAATCTGTGTACGGTGTCCGGAGTTCCCCGCGCCAGCGGGGATAAACCGCTAGGATAAATTAAAAGACAAAATTGCAGCAAGAGTTCCCCGCGGCAGCGGGGATAAACCGGAGCGACCAGTATCAAGATCGACAGGTTTTGCGAGTTCCCCGCGCCAGCGGGGATAAACCGATCGATATGTACGTTAGCGAGGGGATCACGCAGAGTTCCCCGCACCAGCGGGGATAAACCA >CP029164|3|1|2002885-2003157|PILER-CR GAGTTCCCCACGTCAGCGGGGATAAACCG ACGTAACAAAACAACAGCAAAATATTATCGAC GAGTTCCCCACGTCAGCGGGGATAAACCG GCTCCGCCGGTTTGATCTCCGGTTTGCGCTGT GAGTTCCCCACGTCAGCGGGGATAAACCG GCAATTAATTTAGTTCCAGATGCTGCGAAAGA GAGTTCCCCACGTCAGCGGGGATAAACCG TTGGTCACTCGTCAAAAGTCGAGACGGTCGAA GAGTTCCCCACGTCAGCGGGGATAAACCGAATGATTGATATAAATCTGTGTACGGTGTCCGGAGTTCCCCGCGCCAGCGGGGATAAACCG CTAGGATAAATTAAAAGACAAAATTGCAGCAA GAGTTCCCCGCGGCAGCGGGGATAAACCG GAGCGACCAGTATCAAGATCGACAGGTTTTGC GAGTTCCCCGCGCCAGCGGGGATAAACCG ATCGATATGTACGTTAGCGAGGGGATCACGCA >CP029164|3|3|2002885-2003401|CRISPRCasFinder GAGTTCCCCACGTCAGCGGGGATAAACCG ACGTAACAAAACAACAGCAAAATATTATCGAC GAGTTCCCCACGTCAGCGGGGATAAACCG GCTCCGCCGGTTTGATCTCCGGTTTGCGCTGT GAGTTCCCCACGTCAGCGGGGATAAACCG GCAATTAATTTAGTTCCAGATGCTGCGAAAGA GAGTTCCCCACGTCAGCGGGGATAAACCG TTGGTCACTCGTCAAAAGTCGAGACGGTCGAA GAGTTCCCCACGTCAGCGGGGATAAACCG AATGATTGATATAAATCTGTGTACGGTGTCCG GAGTTCCCCGCGCCAGCGGGGATAAACCG CTAGGATAAATTAAAAGACAAAATTGCAGCAA GAGTTCCCCGCGGCAGCGGGGATAAACCG GAGCGACCAGTATCAAGATCGACAGGTTTTGC GAGTTCCCCGCGCCAGCGGGGATAAACCG ATCGATATGTACGTTAGCGAGGGGATCACGCA GAGTTCCCCGCACCAGCGGGGATAAACCA >CP029164|3|1|2002885-2003401|CRT GAGTTCCCCACGTCAGCGGGGATAAACCG ACGTAACAAAACAACAGCAAAATATTATCGAC GAGTTCCCCACGTCAGCGGGGATAAACCG GCTCCGCCGGTTTGATCTCCGGTTTGCGCTGT GAGTTCCCCACGTCAGCGGGGATAAACCG GCAATTAATTTAGTTCCAGATGCTGCGAAAGA GAGTTCCCCACGTCAGCGGGGATAAACCG TTGGTCACTCGTCAAAAGTCGAGACGGTCGAA GAGTTCCCCACGTCAGCGGGGATAAACCG AATGATTGATATAAATCTGTGTACGGTGTCCG GAGTTCCCCGCGCCAGCGGGGATAAACCG CTAGGATAAATTAAAAGACAAAATTGCAGCAA GAGTTCCCCGCGGCAGCGGGGATAAACCG GAGCGACCAGTATCAAGATCGACAGGTTTTGC GAGTTCCCCGCGCCAGCGGGGATAAACCG ATCGATATGTACGTTAGCGAGGGGATCACGCA GAGTTCCCCGCACCAGCGGGGATAAACCA >CP029164|3|2|2003190-2003401|PILER-CR ACGTAACAAAACAACAGCAAAATATTATCGAC GAGTTCCCCACGTCAGCGGGGATAAACCG GCTCCGCCGGTTTGATCTCCGGTTTGCGCTGT GAGTTCCCCACGTCAGCGGGGATAAACCG GCAATTAATTTAGTTCCAGATGCTGCGAAAGA GAGTTCCCCACGTCAGCGGGGATAAACCG TTGGTCACTCGTCAAAAGTCGAGACGGTCGAA GAGTTCCCCACGTCAGCGGGGATAAACCGAATGATTGATATAAATCTGTGTACGGTGTCCGGAGTTCCCCGCGCCAGCGGGGATAAACCG CTAGGATAAATTAAAAGACAAAATTGCAGCAA GAGTTCCCCGCGGCAGCGGGGATAAACCG GAGCGACCAGTATCAAGATCGACAGGTTTTGC GAGTTCCCCGCGCCAGCGGGGATAAACCG ATCGATATGTACGTTAGCGAGGGGATCACGCA GAGTTCCCCGCACCAGCGGGGATAAACCA
>CP029164.1|AWH69691.1|2001873_2002545_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVIGRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >CP029164.1|AWH69690.1|1999347_2000646_+|enolase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >CP029164.1|AWH69689.1|1997623_1999261_+|CTP-synthetase MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >CP029164.1|AWH69688.1|1996604_1997396_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADGSAGNSREVLARWEQIKTEERAQKAQHSALDDIPSSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQVVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >CP029164.1|AWH69687.1|1994274_1996509_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >CP029164.1|AWH69686.1|1992925_1994227_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKALFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHEVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILVPQLEALLPKVRACLGSLQAIRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >CP029164.1|AWH69685.1|1990112_1992869_-|signal-transduction-histidine-kinase MTNYSLRARMMILILAPTVLIGLLLSIFFVVHRYNDLQRQLEDAGASIIEPLAVSTEYGMSLQNRESIGQLISVLHRRHSDIVRAISVYDENNRLFVTSNFHLDPSSMQLGSNVPFPRQLTVTRDGDIMILRTPIISESYSPDESPSSDAKNSQNMLGYIALELDLKSVRLQQYKEIFISSVMMLFCIGIALIFGWRLMRDVTGPIRNMVNTVDRIRRGQLDSRVEGFMLGELDMLKNGINSMAMSLAAYHEEMQHNIDQATSDLRETLEQMEIQNVELDLAKKRAQEAARIKSEFLANMSHELRTPLNGVIGFTRLTLKTELTPTQRDHLNTIERSANNLLAIINDVLDFSKLEAGKLILESIPFPLRSTLDEVVTLLAHSSHDKGLELTLNIKSDVPDNVIGDPLRLQQIITNLVGNAIKFTENGNIDILVEKRALSNTKVQIEVQIRDTGIGIPERDQSRLFQAFRQADASISRRHGGTGLGLVITQKLVNEMGGDISFHSQPNRGSTFWFHINLDLNPNIIIEGPSTQCLAGKRLAYVEPNSAAAQCTLDILSETPLEVVYSPTFSALPPAHYDMMLLGIAVTFREPLTMQHERLAKAVSMTDFLMLALPCHAQVNAEKLKQDGIGACLLKPLTPTRLLPALTEFCHHKQNTLLPVTDESKLAMTVMAVDDNPANLKLIGALLEDMVQHVELCDSGHQAVERAKQMPFDLILMDIQMPDMDGIRACELIHQLPHQQQTPVIAVTAHAMAGQKEKLLGAGMSDYLAKPIEEERLHNLLLRYKPGSGISSRVVTPEVNEIVVNPNATLDWQLALRQAAGKTDLARDMLQMLLDFLPEVRNKVEEQLVGENPEGLVDLIHKLHGSCGYSGVPRMKNLCQLIEQQLRSGTKEEDLEPELLELLDEMDNVAREASKILG >CP029164.1|AWH69684.1|1988541_1989882_+|glucarate-dehydratase MSSQFTTPVVTEMQVIPVAGHDSMLMNLSGAHAPFFTRNIVIIKDNSGHTGVGEIPGGEKIRKTLEDAIPLVVGKTLGEYKNVLTLVRNTFADRDAGGRGLQTFDLRTTIHVVTGIEAAMLDLLGQHLGVNVASLLGDGQQRSEVEMLGYLFFVGNRKATPLPYQSQPDDKCDWYRLRHEEAMTPDAVVRLAEAAYEKYGFNDFKLKGGVLAGEEEAESIVALAQRFPQARITLDPNGAWSLNEAIKIGKYLKGSLAYAEDPCGAEQGFSGREVMAEFRRATGLPTATNMIATDWRQMGHTLSLQSVDIPLADPHFWTMQGSVRVAQMCHEFGLTWGSHSNNHFDISLAMFTHVAAAAPGKITAIDTHWIWQEGNQRLTKEPLEIKGGLVQVPEKPGLGVEIDMDQVMKAHELYQKHGLGARDDAMGMQYLIPGWTFDNKRPCMVR >CP029164.1|AWH69683.1|1987180_1988521_+|glucarate-dehydratase MTTQSSPVITDMKVIPVAGHDSMLLNIGGAHNAYFTRNIVVLTDNAGHTGIGEAPGGEVIYQTLVKAIPMVLGQEVARLNKVVQQVHKGNQAADFDTFGKGAWTFELRVNAVAALEAALLDLLGQALNVPVCELLGPGKQRDAITVLGYLFYIGDRTKTDLPYLENTSGNHEWYQLRHQKAMNSEAVVRLAEASQDRYGFKDFKLKGGVLPGEQEIDTVRALKKRFPDARITVDPNGAWLLDEAISLCKGLNDVLTYAEDPCGAEQGFSGREVMAEFRRATGLPVATNMIATNWREMGHAVMLNAVDIPLADPHFWTLSGAVRVAQLCDDWGLTWGCHSNNHFDISLAMFTHVGAAAPGNPTAIDTHWIWQEGDCRLTKNPLEIKNGKIAVPDAPGLGVELDWEQVQKAHEAYKRLPGGARNDAGPMQYLIPGWTFDRKRPVFGRH >CP029164.1|AWH69682.1|1985826_1987179_+|MFS-transporter MSSLSQAASSVEKRTNARYWIVVMLFIVTSFNYGDRATLSIAGSEMAKDIGLDPVGMGYVFSAFSWAYVIGQIPGGWLLDRFGSKRVYFWSIFIWSMFTLLQGFVDIFSGFGIIVALFTLRFLVGLAEAPSFPGNSRIVAAWFPAQERGTAVSIFNSAQYFATVIFAPIMGWLTHEVGWSHVFFFMGGLGIVISFIWLKVIHEPNQHPGVNQKELEYIAAGGALINMDQQNTKVKVPFSVKWGQIKQLLGSRMMIGVYIGQYCINALTYFFITWFPVYLVQARGMSILKAGFVASVPAVCGFIGGVLGGIISDWLMRRTGSLNIARKTPIVMGMLLSMVMVFCNYVNVEWMIIGFMALAFFGKGIGALGWAVMADTAPKEISGLSGGLFNMFGNISGIVTPIAIGYIVGTTGSFNGALIYVGVHALIAVLSYLVLVGDIKRIELKPVAGQ >CP029164.1|AWH69692.1|2004038_2005517_-|sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFASHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKTWAWSEDTAVMEKFNIPRQMLFDVQMPGTVLGHITRQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDAKAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNHFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFMPVESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNADSIQSWSNA >CP029164.1|AWH69693.1|2005543_2006821_-|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSASLKTVIIIYSVVYILLGILCWFFVSDNNNLRSTNNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSVLGLLALTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGYWQDTLPAAEAYRNMWLMGMAALAMVIIFTFLLFQKIRTADSAPAMASSK >CP029164.1|AWH69694.1|2007139_2007925_+|KR-domain-containing-protein MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASQASNYVNGHLLVVDGGYLVR >CP029164.1|AWH69695.1|2007994_2009449_+|FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYVLEDMKTGFNILREIMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNHLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >CP029164.1|AWH69696.1|2009542_2010880_+|MFS-transporter MNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFIGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGVLLGAFSVVWTVGYVLASIAGHHFISESPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVATATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLAAVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >CP029164.1|AWH69697.1|2010857_2011637_+|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MNILLAFKAEPDAGMLAEKEWQAAAQGNSGPDVSLLRSLLGADEQAAAALLLAQRKNGTSMSLTALSMGDERALHWLRYLMALGFEEAVVLETAADLRFASEFVARHIAEWQRQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDAPFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAELPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWQNYLRQRMQP >CP029164.1|AWH69698.1|2011633_2012494_+|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNITIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSMRKSHWGNALTATLQTEKRPLCLSLARQAGALKNATLPSGMQQLNIVPGAPPDWLISVENLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLVEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKAVLEALVTTIHADCQ >CP029164.1|AWH69699.1|2012640_2013216_-|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISQIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >CP029164.1|AWH69700.1|2013232_2013493_-|ferredoxin-family-protein MSVARNLWRVADAPHIVPADSVERQTVQRLINACPAGLFSLTPEGDLRVDYRGCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >CP029164.1|AWH69701.1|2013483_2014755_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLVERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERRITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLARFKTHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLLGDALRSCVNTGISVRGMDMALTGTQAAAQTLISACQHREPQNLFALYHHNVERSLLWDVLQRYQHVPALLQRPGWYRAWPALMQDISRDLWDQGDKPVPPLRQLFWRHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029164_4 | 2029103-2030107 | TypeI-E |
I-E
Consensus repeat of CP029164_4
|
16 spacers
spacers of CP029164_4
>4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CATTGAAAACATTGCCTTTATTTTATTTTTTG >4.2|2029193|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GTGCCGCCGCTGGGCACTTCCTTCCCGTGAGT >4.3|2029254|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CGGACGGTGGGAATATAGAAAATCCGTCCACC >4.4|2029315|32|CP029164|PILER-CR,CRISPRCasFinder,CRT TTATACTCTTTTCATCGACTAAGGAGGGGAGG >4.5|2029376|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CAATAACGCAGCATCCAGGAAGCTGTTTCCGC >4.6|2029437|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CGATCGGTGAAGAGGTCCGCGAAATACTCACT >4.7|2029498|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GCGATAGTTGATTCAGCCGCGCCAGCGAATGT >4.8|2029559|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GGGCCGCCGCGAATTTACACACGATTCAATAC >4.9|2029620|32|CP029164|PILER-CR,CRISPRCasFinder,CRT AACTGGTGCGCGACGGCTGGCTAACAAAACGA >4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CGTGGCTGCGCTGGCCGTTGCAGCAGTTTGAT >4.11|2029742|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CGTAAACGCCCCGTCGCCATTAATTTCGGGGT >4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT TGGGATGAGCAAATAACGTCGTTTCCTAGAAA >4.13|2029864|32|CP029164|PILER-CR,CRISPRCasFinder,CRT CCGCCGTGCCAGTGATCCTCATACGGCCTGTT >4.14|2029925|32|CP029164|PILER-CR,CRISPRCasFinder,CRT AAATTAAGAACGGCGTAAACGACGGCAGCATG >4.15|2029986|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GTGATGATTCAGAGCAGACATTAGCCCGCGCT >4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT GGTAAAAACACGGTCTGAACCGACATTCATGT |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas3 |
CRISPR arrays and Neighbor proteins around CP029164_4
The CRISPR arrays of CP029164_4 >merge|CP029164|4|2029103-2030107|PILER-CR,CRISPRCasFinder,CRT GTGTTCCCCGCGCCAGCGGGGATAAACCGCATTGAAAACATTGCCTTTATTTTATTTTTTGGTGTTCCCCGCGCCAGCGGGGATAAACCGGTGCCGCCGCTGGGCACTTCCTTCCCGTGAGTGTGTTCCCCGCGCCAGCGGGGATAAACCGCGGACGGTGGGAATATAGAAAATCCGTCCACCGTGTTCCCCGCGCCAGCGGGGATAAACCGTTATACTCTTTTCATCGACTAAGGAGGGGAGGGTGTTCCCCGCGCCAGCGGGGATAAACCGCAATAACGCAGCATCCAGGAAGCTGTTTCCGCGTGTTCCCCGCGCCAGCGGGGATAAACCGCGATCGGTGAAGAGGTCCGCGAAATACTCACTGTGTTCCCCGCGCCAGCGGGGATAAACCGGCGATAGTTGATTCAGCCGCGCCAGCGAATGTGTGTTCCCCGCGCCAGCGGGGATAAACCGGGGCCGCCGCGAATTTACACACGATTCAATACGTGTTCCCCGCGCCAGCGGGGATAAACCGAACTGGTGCGCGACGGCTGGCTAACAAAACGAGTGTTCCCCGCGCCAGCGGGGATAAACCGCGTGGCTGCGCTGGCCGTTGCAGCAGTTTGATGTGTTCCCCGCGCCAGCGGGGATAAACCGCGTAAACGCCCCGTCGCCATTAATTTCGGGGTGTGTTCCCCGCGCCAGCGGGGATAAACCGTGGGATGAGCAAATAACGTCGTTTCCTAGAAAGTGTTCCCCGCGCCAGCGGGGATAAACCGCCGCCGTGCCAGTGATCCTCATACGGCCTGTTGTGTTCCCCGCGCCAGCGGGGATAAACCGAAATTAAGAACGGCGTAAACGACGGCAGCATGCTGTTCCCCGCGCCAGCGGGGATAAACCGGTGATGATTCAGAGCAGACATTAGCCCGCGCTGTGTTCCCCGCGCCAGCGGGGATAAACCGGGTAAAAACACGGTCTGAACCGACATTCATGTGTGTTCCCCGCGTCAGCGGGGATAAACCG >CP029164|4|3|2029103-2030107|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACCG CATTGAAAACATTGCCTTTATTTTATTTTTTG GTGTTCCCCGCGCCAGCGGGGATAAACCG GTGCCGCCGCTGGGCACTTCCTTCCCGTGAGT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGGACGGTGGGAATATAGAAAATCCGTCCACC GTGTTCCCCGCGCCAGCGGGGATAAACCG TTATACTCTTTTCATCGACTAAGGAGGGGAGG GTGTTCCCCGCGCCAGCGGGGATAAACCG CAATAACGCAGCATCCAGGAAGCTGTTTCCGC GTGTTCCCCGCGCCAGCGGGGATAAACCG CGATCGGTGAAGAGGTCCGCGAAATACTCACT GTGTTCCCCGCGCCAGCGGGGATAAACCG GCGATAGTTGATTCAGCCGCGCCAGCGAATGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGGCCGCCGCGAATTTACACACGATTCAATAC GTGTTCCCCGCGCCAGCGGGGATAAACCG AACTGGTGCGCGACGGCTGGCTAACAAAACGA GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTGGCTGCGCTGGCCGTTGCAGCAGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTAAACGCCCCGTCGCCATTAATTTCGGGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG TGGGATGAGCAAATAACGTCGTTTCCTAGAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGCCGTGCCAGTGATCCTCATACGGCCTGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG AAATTAAGAACGGCGTAAACGACGGCAGCATG CTGTTCCCCGCGCCAGCGGGGATAAACCG GTGATGATTCAGAGCAGACATTAGCCCGCGCT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGTAAAAACACGGTCTGAACCGACATTCATGT GTGTTCCCCGCGTCAGCGGGGATAAACCG >CP029164|4|4|2029103-2030107|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACCG CATTGAAAACATTGCCTTTATTTTATTTTTTG GTGTTCCCCGCGCCAGCGGGGATAAACCG GTGCCGCCGCTGGGCACTTCCTTCCCGTGAGT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGGACGGTGGGAATATAGAAAATCCGTCCACC GTGTTCCCCGCGCCAGCGGGGATAAACCG TTATACTCTTTTCATCGACTAAGGAGGGGAGG GTGTTCCCCGCGCCAGCGGGGATAAACCG CAATAACGCAGCATCCAGGAAGCTGTTTCCGC GTGTTCCCCGCGCCAGCGGGGATAAACCG CGATCGGTGAAGAGGTCCGCGAAATACTCACT GTGTTCCCCGCGCCAGCGGGGATAAACCG GCGATAGTTGATTCAGCCGCGCCAGCGAATGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGGCCGCCGCGAATTTACACACGATTCAATAC GTGTTCCCCGCGCCAGCGGGGATAAACCG AACTGGTGCGCGACGGCTGGCTAACAAAACGA GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTGGCTGCGCTGGCCGTTGCAGCAGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTAAACGCCCCGTCGCCATTAATTTCGGGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG TGGGATGAGCAAATAACGTCGTTTCCTAGAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGCCGTGCCAGTGATCCTCATACGGCCTGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG AAATTAAGAACGGCGTAAACGACGGCAGCATG CTGTTCCCCGCGCCAGCGGGGATAAACCG GTGATGATTCAGAGCAGACATTAGCCCGCGCT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGTAAAAACACGGTCTGAACCGACATTCATGT GTGTTCCCCGCGTCAGCGGGGATAAACCG >CP029164|4|2|2029103-2030107|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG CATTGAAAACATTGCCTTTATTTTATTTTTTG GTGTTCCCCGCGCCAGCGGGGATAAACCG GTGCCGCCGCTGGGCACTTCCTTCCCGTGAGT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGGACGGTGGGAATATAGAAAATCCGTCCACC GTGTTCCCCGCGCCAGCGGGGATAAACCG TTATACTCTTTTCATCGACTAAGGAGGGGAGG GTGTTCCCCGCGCCAGCGGGGATAAACCG CAATAACGCAGCATCCAGGAAGCTGTTTCCGC GTGTTCCCCGCGCCAGCGGGGATAAACCG CGATCGGTGAAGAGGTCCGCGAAATACTCACT GTGTTCCCCGCGCCAGCGGGGATAAACCG GCGATAGTTGATTCAGCCGCGCCAGCGAATGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGGCCGCCGCGAATTTACACACGATTCAATAC GTGTTCCCCGCGCCAGCGGGGATAAACCG AACTGGTGCGCGACGGCTGGCTAACAAAACGA GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTGGCTGCGCTGGCCGTTGCAGCAGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGTAAACGCCCCGTCGCCATTAATTTCGGGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG TGGGATGAGCAAATAACGTCGTTTCCTAGAAA GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGCCGTGCCAGTGATCCTCATACGGCCTGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG AAATTAAGAACGGCGTAAACGACGGCAGCATG CTGTTCCCCGCGCCAGCGGGGATAAACCG GTGATGATTCAGAGCAGACATTAGCCCGCGCT GTGTTCCCCGCGCCAGCGGGGATAAACCG GGTAAAAACACGGTCTGAACCGACATTCATGT GTGTTCCCCGCGTCAGCGGGGATAAACCG
>CP029164.1|AWH69713.1|2028712_2029006_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQISQLAGCGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ >CP029164.1|AWH69712.1|2027792_2028716_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALTEDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRQTYALLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGYAPAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLACRDIFRSSKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPETLGDSGHRGRGG >CP029164.1|AWH69711.1|2027145_2027796_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQERPAESDIFTIECRPFAPELRTGQSLCFNLRANPTICKAGKRHDLLMEAKRQVRGQVEGSDVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSVDYTGMLTVTDPGLFLQRLSQGYGKSRAFGCGLMLIKPGAEA >CP029164.1|AWH69710.1|2026417_2027164_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDDTERLNAFNRHYSLVVCASRNPRWARDYHTVQMPKEVRKARYFSRREELSAPDLLSAIISRRDYYTDAWWMVAVATTPDAPYSLEQLQDGLRHPVFPLYLGRKSHPLALPLAPLLLEGNASDVLRNAYQQYQDSFRELKVSLPKLQDECWWEGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE >CP029164.1|AWH69709.1|2025351_2026407_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRVSSQSLKRAWRTSALFEQALAGHIGIRSGRIAREAATILIEKGIEEKKAIEWAAKIADYLGKAKNDKKPKDPLTNAETEQLVHISPAEFDAVKALAHQLAEEKRAPKEEDLALLRKDRMAVDIAMFGRMLANKPEFNVEAACQVAHAFGVSETIVEDDFFTAVDDLRQASEDAGAGHLGETGFGSALFYTYICIDKDLLVENLGGDEALANQTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTEQPRSLAAAFYEPINGTLQLDVAVQRITTLRENMNTVYEQKTECASFDVMNKQGSMKDVLDFICA >CP029164.1|AWH69708.1|2024800_2025337_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MSIVKEEHKATLRKWHEELQEKRGNRASLRRSTTVNDVCLSEGFRSLLMQTHTLWKIESQEWRFTALALVAAVAANVKAIDERQPFAAQLAAVMSEGRFTRLSAVKTPDELLRQLRRAVKLLNGSVNLISLADDIFRWCQESDDLLNHHRRQQRPTEFIRIRWALEYYQAGDADNEQN >CP029164.1|AWH69707.1|2020486_2023144_+|CRISPR-associated-helicase/endonuclease-Cas3 MTFFYFWGKTRRGEKDGGEDYHLLCWHSLDVAAMGYLMVKSNCFGLAGYFRQLGFADTELAAQFFAWLLCWHDIGKFARSFQQLYLHPQLKVPEGARKNYEKISHSTLGYWLWNHYLSECQELLPSSSLSPRKLKRVMEMWMPMTTGHHGRPPERMDELDNFLPEDKGAARDFLLAIKVLFPLIEIPTFWDDDEGVELIKQLSWYISATVVLADWTGSSTRYFPRVAQAMDIEDYWQKTLVQAQNALTVFPPKAESAPFTGINTLFPFIENPTPLQQKVLDLDISQPGPQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLFFGLPTMATANAMYDRLVKTWLAFYSPESRPSLVLAHSARTLMDRFNESLWSGDLVGSEEPDEQTFSQGCAAWFADSNKKALLAEIGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAYMSCILEGLIERQARGGNSVILLSATLSQQQRDKLVAAFARGIEGQQEAPLLEKDDYPWLTHVTKSDVHSHRVATRKDVERSVSVGWLHSEQECIARIESAVSQGKCIAWIRNSVDDAIQVHRQLLARGVIPASSLSLFHSRFAFSDRQRIETETLARFGKEDCSQRAGKVLICTQVLEQSVDCDLDEMISDLAPVDLLIQRAGRLQRHIRDINGLLKRDGKDERSPPEFLILAPVWDDSPGDEWFGSAMRNSAYVYPDHGRIWLTQRVLREQGAIQMPHAARLLIESVYGEDVAMPEGFARSEQEQVGKYYCDRAMAKKFVLNFKPGYAANINDYLPEKLSTRLAEESVSLWLATCIDGVVKPYATGAHAWEMSVVRVRRSWWKKHRDEFSLLEGDAFRQWCIEQRQDPEMANVILVTDDESCGYSAMEGLTGKVG >CP029164.1|AWH69706.1|2020096_2020249_+|Hok/Gef-family-protein MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK >CP029164.1|AWH69705.1|2019098_2019833_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >CP029164.1|AWH69704.1|2017312_2019025_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKVAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >CP029164.1|AWH69714.1|2030188_2031226_-|aminopeptidase MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEVFDKAGIAVLSVEATNWNLGNKDGYQQRAKTAAFPAGNSWHDVRLDNQQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >CP029164.1|AWH69715.1|2031477_2032386_+|sulfate-adenylyltransferase-subunit-2 MDQKRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >CP029164.1|AWH69716.1|2032387_2033815_+|sulfate-adenylyltransferase MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEETFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMPWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFSGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >CP029164.1|AWH69717.1|2033814_2034420_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTPQQREQHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >CP029164.1|AWH69718.1|2034469_2034793_+|hypothetical-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >CP029164.1|AWH69719.1|2034986_2035298_+|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNNDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >CP029164.1|AWH69720.1|2035316_2036027_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKHVVIAISPGDSRFAQLPLANHPQITVVDGGEERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAETGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >CP029164.1|AWH69721.1|2036026_2036506_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYALGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >CP029164.1|AWH69722.1|2036502_2037552_+|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVMEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGALRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAEETELQTLLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >CP029164.1|AWH69723.1|2037532_2038294_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVTCSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029164_5 | 3135822-3135945 | Orphan |
NA
Consensus repeat of CP029164_5
|
1 spacers
spacers of CP029164_5
>5.1|3135865|38|CP029164|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around CP029164_5
The CRISPR arrays of CP029164_5 >merge|CP029164|5|3135822-3135945|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >CP029164|5|5|3135822-3135945|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>CP029164.1|AWH70716.1|3135381_3135687_-|monooxygenase MATLLQLHFAFNGPFGDAMVEQLEPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQAKLA >CP029164.1|AWH70715.1|3133645_3135256_-|FAD-NAD(P)-binding-protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPIYCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMLATNQDLPSETFDLAVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFHRDNASEKLNITLMSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEVGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYEMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEEIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAIAQAVVKPASRARRRLSFNQM >CP029164.1|AWH70714.1|3132827_3133640_+|hypothetical-protein MIITRADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGERAFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLVQIASFGANARIANSGDNVHIIASGEDSTIVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >CP029164.1|AWH70713.1|3132038_3132824_+|thiosulfate-reductase-cytochrome-B-subunit MNPSQHAEQFQSQLANYVPLFTPQFWPVWLIIAGLLLVGMWLVLGLHALLRARGVKKSANDHGEKVYLYSKAVRLWHWSNALLFVLLLASGLINHFALVGATTVKSLVAVHEVCGFLLLACWLGFVLINAVGGNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >CP029164.1|AWH70712.1|3131373_3132042_+|4Fe-4S-dicluster-domain-containing-protein MSFTRRKFVLGMGTVIFFTGSASSLLANTMQEKKVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQENKYYQYQLPGAGKPHLYRRFGQHLIKKENV >CP029164.1|AWH70711.1|3130662_3131310_+|hypothetical-protein MGEMNHQDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYSVAEEELSVLLENIKQNGDHADIACMTGSQDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDIAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >CP029164.1|AWH70710.1|3128556_3130659_+|aldehyde-ferredoxin-oxidoreductase MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLNIKDDKVSLEKADLLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKGLFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIRWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEDMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRKTLQRLGLEDIAADLAAHNLLPA >CP029164.1|AWH70709.1|3127909_3128536_+|4Fe-4S-dicluster-domain-containing-protein MTPVDRPLLNIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTNLNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >CP029164.1|AWH70708.1|3127244_3127454_+|fumarate-hydratase-FumD MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >CP029164.1|AWH70707.1|3125275_3126688_-|pyruvate-kinase MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKYPLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL >CP029164.1|AWH70717.1|3136259_3137516_+|hypothetical-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDKGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKANDTLLAGGTMNNLGGEDSDTIVENGAIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGADLQQSTITVQQGGVLILDGSTVKGDSVTFSVGNINLNGGKLWLITDAATQVHLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >CP029164.1|AWH70718.1|3137556_3138930_-|multidrug-resistance-protein-MdtK MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSAIILQRAAR >CP029164.1|AWH70719.1|3139144_3139786_+|riboflavin-synthase MFTGIVQGTAKLVSIDEKPNFRTHVVELPDHMLEGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >CP029164.1|AWH70720.1|3139825_3140974_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIAGARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >CP029164.1|AWH70721.1|3141264_3142476_-|Bcr/CflA-family-multidrug-efflux-MFS-transporter MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFATIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVDCQNHGNAEVAHSESH >CP029164.1|AWH70722.1|3142588_3143521_+|LysR-family-transcriptional-regulator MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQIANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRLVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVALELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >CP029164.1|AWH70723.1|3143517_3144543_-|PurR-family-transcriptional-regulator MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >CP029164.1|AWH70724.1|3144530_3144755_-|hypothetical-protein MISVFTTSPFRQDRPKFHAYTICVLAIDPFLTLRVVFPAYRNTFVVRKVCKGKRLPCDFAGAEVRVWSEMEWQQ >CP029164.1|AWH70725.1|3144841_3144931_+|YnhF-family-membrane-protein MSTDLKFSLVTTIIVLGLIVAVGLTAALH >CP029164.1|AWH70726.1|3145096_3146266_+|MFS-transporter MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPEAVCVANS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
CP029164_2 | 2.1|917746|59|CP029164|CRISPRCasFinder | 917746-917804 | 59 | CP029164.1 | 1712076-1712134 | 2 | 0.966 |
1. spacer 2.1|917746|59|CP029164|CRISPRCasFinder matches to position: 1712076-1712134, mismatch: 2, identity: 0.966
gaaggcagagggagacagtctgcgcggtgagataggcggtgtatacagagatgcccgtg CRISPR spacer gaaggcagagggagacagtctgcgtggtgagataggtggtgtatacagagatgcccgtg Protospacer ************************.***********.**********************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP038506 | Escherichia coli strain 28Eco12 plasmid p28Eco12, complete sequence | 11835-11866 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP019283 | Escherichia coli strain 13P484A plasmid p13P484A-3, complete sequence | 42147-42178 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | CP042641 | Escherichia coli strain NCYU-24-74 plasmid pNCYU-24-74-3, complete sequence | 36520-36551 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP034821 | Salmonella sp. SSDFZ54 plasmid pTB502, complete sequence | 1557-1588 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP027443 | Escherichia coli strain 2013C-3252 plasmid unnamed1, complete sequence | 9506-9537 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP027575 | Escherichia coli strain 2013C-4081 plasmid unnamed2 | 94756-94787 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP027223 | Escherichia coli strain 2015C-3101 plasmid unnamed2 | 65534-65565 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP030188 | Salmonella enterica strain SA20094620 plasmid pSA20094620.3, complete sequence | 61916-61947 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP027590 | Escherichia coli strain 2014C-3011 plasmid unnamed2 | 64165-64196 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP039862 | Salmonella enterica subsp. enterica serovar 1,4,[5],12:i:- strain PNCS014880 plasmid p16-6773.2, complete sequence | 20286-20317 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP033632 | Escherichia coli isolate ECCNB12-2 plasmid pTB-nb1, complete sequence | 119284-119315 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_AP018798 | Escherichia coli strain E2855 plasmid pE2855-2, complete sequence | 85099-85130 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | CP012494 | Escherichia coli strain CFSAN004177 plasmid pCFSAN004177G_03, complete sequence | 46688-46719 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | CP027320 | Escherichia coli strain 2014C-3084 plasmid unnamed1 | 42116-42147 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NC_013370 | Escherichia coli O111:H- str. 11128 plasmid pO111_2, complete sequence | 87237-87268 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP026475 | Escherichia coli strain KBN10P04869 plasmid pKBN10P04869B, complete sequence | 81853-81884 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MN510445 | Escherichia coli strain JIE250 plasmid pJIE250_3, complete sequence | 70500-70531 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MN510447 | Escherichia coli strain TZ20_1P plasmid pTZ20_1P, complete sequence | 47861-47892 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP012491 | Escherichia coli strain CFSAN004176 plasmid pCFSAN004176P_03, complete sequence | 62997-63028 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MH422554 | Escherichia phage P1, complete genome | 87194-87225 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NC_050152 | Enterobacteria phage P7, complete genome | 93995-94026 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MH445380 | Escherichia virus P1 isolate transconjugant 2(L-II), complete genome | 58645-58676 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NC_031129 | Salmonella phage SJ46, complete genome | 77363-77394 | 0 | 1.0 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MH445381 | Escherichia virus P1, complete genome | 56870-56901 | 0 | 1.0 |
CP029164_6 | 6.1|3774268|40|CP029164|CRISPRCasFinder | 3774268-3774307 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 1 | 0.975 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032392 | Salmonella enterica subsp. enterica serovar Dublin strain CVM 34981 plasmid p34981_2, complete sequence | 28190-28221 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP022965 | Salmonella enterica subsp. enterica serovar Pullorum strain QJ-2D-Sal plasmid pQJDsal2, complete sequence | 29440-29471 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NC_007208 | Salmonella enterica OU7025 plasmid pOU1113, complete sequence | 19749-19780 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_HG970001 | Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91 plasmid pSG, complete sequence | 38393-38424 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032386 | Salmonella enterica subsp. enterica serovar Dublin strain CVM N53043 plasmid pN53043_2, complete sequence | 3653-3684 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032388 | Salmonella enterica subsp. enterica serovar Dublin strain CVM N45955 plasmid pN45955_1, complete sequence | 42554-42585 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP030208 | Salmonella enterica strain SA19992307 plasmid pSA19992307.1, complete sequence | 4890-4921 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032450 | Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69838 plasmid pSDU1-USMARC-69838, complete sequence | 68762-68793 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NC_011204 | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 plasmid pCT02021853_74, complete sequence | 31020-31051 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NC_019106 | Salmonella enterica subsp. enterica serovar Dublin plasmid pSD_77, complete sequence | 75580-75611 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NC_019112 | Salmonella enterica subsp. enterica serovar Pullorum plasmid pSPUV, complete sequence | 20322-20353 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP012348 | Salmonella enterica subsp. enterica serovar Pullorum str. ATCC 9120 plasmid pCFSAN000725_01, complete sequence | 42630-42661 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP019180 | Salmonella enterica subsp. enterica serovar Dublin str. ATCC 39184 plasmid pATCC39184, complete sequence | 59477-59508 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032394 | Salmonella enterica subsp. enterica serovar Dublin strain CVM 22453 plasmid p22453_2, complete sequence | 17037-17068 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NC_010422 | Salmonella enterica subsp. enterica serovar Dublin plasmid pOU1115, complete sequence | 19784-19815 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032381 | Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69807 plasmid pSDU2-USMARC-69807, complete sequence | 47366-47397 | 2 | 0.938 |
CP029164_4 | 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029803-2029834 | 32 | NZ_CP032447 | Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69840 plasmid pSDU1-USMARC-69840, complete sequence | 58031-58062 | 2 | 0.938 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_KP453775 | Klebsiella pneumoniae strain ST11 plasmid pKP12226, complete sequence | 17440-17471 | 2 | 0.938 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP017632 | Escherichia coli strain SLK172 plasmid pSLK172-1, complete sequence | 122310-122341 | 2 | 0.938 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP030285 | Escherichia coli strain E308 plasmid pLKSZ04, complete sequence | 6029-6060 | 2 | 0.938 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | NZ_CP036204 | Escherichia coli strain L725 plasmid punnamed2, complete sequence | 15121-15152 | 2 | 0.938 |
CP029164_4 | 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2030047-2030078 | 32 | MN510446 | Escherichia coli strain SvETEC plasmid pSvP1_F, complete sequence | 131402-131433 | 2 | 0.938 |
CP029164_5 | 5.1|3135865|38|CP029164|CRISPRCasFinder | 3135865-3135902 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP022140 | Salmonella enterica subsp. salamae serovar 55:k:z39 str. 1315K plasmid unnamed1, complete sequence | 27735-27766 | 6 | 0.812 |
CP029164_4 | 4.5|2029376|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029376-2029407 | 32 | NZ_CP015205 | Rhodococcus sp. 008 plasmid pR8C2, complete sequence | 47221-47252 | 7 | 0.781 |
CP029164_4 | 4.5|2029376|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029376-2029407 | 32 | NZ_CP025960 | Rhodococcus qingshengii strain djl-6-2 plasmid pDJL1, complete sequence | 25136-25167 | 7 | 0.781 |
CP029164_3 | 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2002914-2002945 | 32 | MK448716 | Streptococcus phage Javan249, complete genome | 37558-37589 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_LT904880 | Salmonella enterica subsp. enterica serovar Typhi strain ty3-193 genome assembly, plasmid: 3 | 100861-100892 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_LT904895 | Salmonella enterica subsp. enterica serovar Typhi strain ERL12960 genome assembly, plasmid: 2 | 32455-32486 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP012254 | Cronobacter sakazakii strain NCTC 8155 plasmid pCS1, complete sequence | 90639-90670 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029647 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_217186 plasmid pHCM2, complete sequence | 24048-24079 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_LT904853 | Salmonella enterica subsp. enterica serovar Typhi strain TY585 genome assembly, plasmid: 2 | 2158-2189 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NC_003385 | Salmonella enterica subsp. enterica serovar Typhi str. CT18 plasmid pHCM2, complete sequence | 69550-69581 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029859 | Salmonella enterica subsp. enterica serovar Typhi strain 343077_285138 plasmid pHCM2, complete sequence | 33422-33453 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029847 | Salmonella enterica subsp. enterica serovar Typhi strain 343078_273110 plasmid pHCM2, complete sequence | 64890-64921 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029916 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_202113 plasmid pHCM2, complete sequence | 3001-3032 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029893 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_252143 plasmid pHCM2, complete sequence | 30210-30241 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029876 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_227128 plasmid pHCM2, complete sequence | 21803-21834 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029884 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_268186 plasmid pHCM2, complete sequence | 5429-5460 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029961 | Salmonella enterica subsp. enterica serovar Typhi strain 343078_251131 plasmid pHCM2, complete sequence | 76887-76918 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029849 | Salmonella enterica subsp. enterica serovar Typhi strain 343078_211126 plasmid pHCM2, complete sequence | 9632-9663 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029857 | Salmonella enterica subsp. enterica serovar Typhi strain 343077_286126 plasmid pHCM2, complete sequence | 17697-17728 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029929 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_216103 plasmid pHCM2, complete sequence | 73527-73558 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029865 | Salmonella enterica subsp. enterica serovar Typhi strain 343077_228157 plasmid pHCM2, complete sequence | 87633-87664 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_LT883154 | Salmonella enterica subsp. enterica serovar Typhi strain ERL12148 genome assembly, plasmid: 2 | 32810-32841 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029905 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_231186 plasmid pHCM2, complete sequence | 30728-30759 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029851 | Salmonella enterica subsp. enterica serovar Typhi strain 343078_203125 plasmid pHCM2, complete sequence | 44207-44238 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP040567 | Salmonella enterica subsp. enterica serovar Typhimurium strain SAP17-7299 plasmid pCFSAN059543, complete sequence | 17468-17499 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029889 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_294172 plasmid pHCM2, complete sequence | 17968-17999 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029891 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_253155 plasmid pHCM2, complete sequence | 18575-18606 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029898 | Salmonella enterica subsp. enterica serovar Typhi strain 343077_213147 plasmid pHCM2, complete sequence | 21136-21167 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029951 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_205186 plasmid pHCM2, complete sequence | 47286-47317 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029867 | Salmonella enterica subsp. enterica serovar Typhi strain 343077_228140 plasmid pHCM2, complete sequence | 33258-33289 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029901 | Salmonella enterica subsp. enterica serovar Typhi strain 343076_232188 plasmid pHCM2, complete sequence | 32901-32932 | 8 | 0.75 |
CP029164_3 | 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT | 2003158-2003189 | 32 | NZ_CP029921 | Salmonella enterica subsp. enterica serovar Typhi strain 311189_282186 plasmid pHCM2, complete sequence | 100446-100477 | 8 | 0.75 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_KP453775 | Klebsiella pneumoniae strain ST11 plasmid pKP12226, complete sequence | 70246-70277 | 8 | 0.75 |
CP029164_4 | 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029681-2029712 | 32 | NC_009717 | Xanthobacter autotrophicus Py2 plasmid pXAUT01, complete sequence | 288380-288411 | 8 | 0.75 |
CP029164_4 | 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029681-2029712 | 32 | NZ_CP010657 | Phaeobacter piscinae strain P71 plasmid pP71_a, complete sequence | 57627-57658 | 8 | 0.75 |
CP029164_4 | 4.13|2029864|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029864-2029895 | 32 | NZ_CP042263 | Litoreibacter sp. LN3S51 plasmid unnamed2, complete sequence | 173595-173626 | 8 | 0.75 |
CP029164_3 | 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2002914-2002945 | 32 | NZ_CP044347 | Escherichia coli strain P225M plasmid pP225M-CTX-M-55, complete sequence | 121862-121893 | 9 | 0.719 |
CP029164_3 | 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2002914-2002945 | 32 | NZ_CP032533 | Bacillus megaterium NCT-2 plasmid pNCT2_5, complete sequence | 17401-17432 | 9 | 0.719 |
CP029164_3 | 3.6|2003219|32|CP029164|CRISPRCasFinder,CRT,PILER-CR | 2003219-2003250 | 32 | AP014399 | Uncultured Mediterranean phage uvMED isolate uvMED-CGF-C110A-MedDCM-OCT-S26-C20, *** SEQUENCING IN PROGRESS ***, 2 ordered pieces | 35411-35442 | 9 | 0.719 |
CP029164_3 | 3.6|2003219|32|CP029164|CRISPRCasFinder,CRT,PILER-CR | 2003219-2003250 | 32 | AP013383 | Uncultured Mediterranean phage uvMED DNA, complete genome, group G15, isolate: uvMED-CGR-C110A-MedDCM-OCT-S24-C13 | 29583-29614 | 9 | 0.719 |
CP029164_3 | 3.7|2003280|32|CP029164|CRISPRCasFinder,CRT,PILER-CR | 2003280-2003311 | 32 | NZ_CP021994 | Cryobacterium sp. LW097 plasmid unnamed1 | 33387-33418 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP011404 | Lactobacillus salivarius str. Ren plasmid pR1, complete sequence | 100759-100790 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP020859 | Lactobacillus salivarius strain ZLS006 plasmid unnamed1, complete sequence | 48192-48223 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP020859 | Lactobacillus salivarius strain ZLS006 plasmid unnamed1, complete sequence | 248214-248245 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP017108 | Lactobacillus salivarius strain CICC23174 plasmid pLS_1 sequence | 140326-140357 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP017110 | Lactobacillus salivarius strain CICC23174 plasmid pLS_3, complete sequence | 20889-20920 | 9 | 0.719 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP029460 | Clostridium novyi strain 150557 plasmid pCN2, complete sequence | 45232-45263 | 9 | 0.719 |
CP029164_4 | 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029681-2029712 | 32 | NZ_CP014942 | Rhodococcus sp. BH4 plasmid, complete sequence | 418665-418696 | 9 | 0.719 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_AP023208 | Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence | 12113-12164 | 10 | 0.808 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | MT230402 | Escherichia coli strain DH5alpha plasmid pESBL87, complete sequence | 272-323 | 10 | 0.808 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_AP023207 | Escherichia coli strain TUM18781 plasmid pMTY18781-2, complete sequence | 31252-31303 | 10 | 0.808 |
CP029164_3 | 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2002914-2002945 | 32 | NZ_CP045561 | Acinetobacter nosocomialis strain AC1530 plasmid pAC1530, complete sequence | 141094-141125 | 10 | 0.688 |
CP029164_3 | 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2002914-2002945 | 32 | NZ_CP014478 | Acinetobacter pittii strain AP_882 plasmid pNDM-AP_882, complete sequence | 140150-140181 | 10 | 0.688 |
CP029164_3 | 3.8|2003341|32|CP029164|CRISPRCasFinder,CRT,PILER-CR | 2003341-2003372 | 32 | NZ_CP024940 | Paraburkholderia hospita strain mHSR1 plasmid pmHSR1_P, complete sequence | 1163456-1163487 | 10 | 0.688 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NC_018511 | Bacillus thuringiensis HD-789 plasmid pBTHD789-5, complete sequence | 60-91 | 10 | 0.688 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NC_001446 | Bacillus thuringiensis sv israelensis HI4 plasmid pTX14-3, complete sequence | 4548-4579 | 10 | 0.688 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP009345 | Bacillus thuringiensis HD1002 plasmid 6, complete sequence | 1165-1196 | 10 | 0.688 |
CP029164_4 | 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029132-2029163 | 32 | NZ_CP053657 | Bacillus cereus strain CTMA_1571 plasmid p.1, complete sequence | 105460-105491 | 10 | 0.688 |
CP029164_4 | 4.6|2029437|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029437-2029468 | 32 | NZ_CP031468 | Paraburkholderia caffeinilytica strain CF1 plasmid p1, complete sequence | 166221-166252 | 10 | 0.688 |
CP029164_4 | 4.6|2029437|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029437-2029468 | 32 | NZ_CP014580 | Burkholderia sp. OLGA172 plasmid pOLGA1, complete sequence | 2849-2880 | 10 | 0.688 |
CP029164_4 | 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT | 2029681-2029712 | 32 | NZ_CP048816 | Caulobacter rhizosphaerae strain KCTC 52515 plasmid unnamed | 34863-34894 | 10 | 0.688 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 4115-4166 | 11 | 0.788 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 4114-4165 | 11 | 0.788 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 4114-4165 | 11 | 0.788 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 4114-4165 | 11 | 0.788 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | MT230312 | Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence | 152-203 | 11 | 0.788 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 208982-209033 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 44-95 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 219476-219527 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 43-94 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP044307 | Escherichia coli strain C27A plasmid pC27A-2, complete sequence | 18332-18383 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 210310-210361 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 43-94 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 191265-191316 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 43-94 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP044147 | Escherichia coli O157 strain AR-0428 plasmid pAR-0428-2 | 7393-7444 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | CP044351 | Escherichia coli strain 194195 plasmid p194195_1, complete sequence | 84217-84268 | 12 | 0.769 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 3373-3424 | 13 | 0.75 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 3372-3423 | 13 | 0.75 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP044307 | Escherichia coli strain C27A plasmid pC27A-2, complete sequence | 15000-15051 | 13 | 0.75 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 3372-3423 | 13 | 0.75 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 3372-3423 | 13 | 0.75 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_AP023209 | Escherichia coli strain TUM18781 plasmid pMTY18781-4, complete sequence | 70-121 | 14 | 0.731 |
CP029164_1 | 1.1|136201|52|CP029164|CRISPRCasFinder | 136201-136252 | 52 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 30159-30210 | 14 | 0.731 |
1. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP038506 (Escherichia coli strain 28Eco12 plasmid p28Eco12, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
2. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019283 (Escherichia coli strain 13P484A plasmid p13P484A-3, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
3. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to CP042641 (Escherichia coli strain NCYU-24-74 plasmid pNCYU-24-74-3, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
4. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034821 (Salmonella sp. SSDFZ54 plasmid pTB502, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
5. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP027443 (Escherichia coli strain 2013C-3252 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
6. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP027575 (Escherichia coli strain 2013C-4081 plasmid unnamed2) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
7. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP027223 (Escherichia coli strain 2015C-3101 plasmid unnamed2) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
8. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030188 (Salmonella enterica strain SA20094620 plasmid pSA20094620.3, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
9. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP027590 (Escherichia coli strain 2014C-3011 plasmid unnamed2) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
10. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP039862 (Salmonella enterica subsp. enterica serovar 1,4,[5],12:i:- strain PNCS014880 plasmid p16-6773.2, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
11. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP033632 (Escherichia coli isolate ECCNB12-2 plasmid pTB-nb1, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
12. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_AP018798 (Escherichia coli strain E2855 plasmid pE2855-2, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
13. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to CP012494 (Escherichia coli strain CFSAN004177 plasmid pCFSAN004177G_03, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
14. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to CP027320 (Escherichia coli strain 2014C-3084 plasmid unnamed1) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
15. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_013370 (Escherichia coli O111:H- str. 11128 plasmid pO111_2, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
16. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP026475 (Escherichia coli strain KBN10P04869 plasmid pKBN10P04869B, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
17. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MN510445 (Escherichia coli strain JIE250 plasmid pJIE250_3, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
18. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MN510447 (Escherichia coli strain TZ20_1P plasmid pTZ20_1P, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
19. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012491 (Escherichia coli strain CFSAN004176 plasmid pCFSAN004176P_03, complete sequence) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
20. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MH422554 (Escherichia phage P1, complete genome) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
21. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_050152 (Enterobacteria phage P7, complete genome) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
22. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MH445380 (Escherichia virus P1 isolate transconjugant 2(L-II), complete genome) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
23. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_031129 (Salmonella phage SJ46, complete genome) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
24. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MH445381 (Escherichia virus P1, complete genome) position: , mismatch: 0, identity: 1.0
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctgaaccgacattcatgt Protospacer ********************************
25. spacer 6.1|3774268|40|CP029164|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 1, identity: 0.975
gcgctgcgggtcatttttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ***************.************************
26. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032392 (Salmonella enterica subsp. enterica serovar Dublin strain CVM 34981 plasmid p34981_2, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
27. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP022965 (Salmonella enterica subsp. enterica serovar Pullorum strain QJ-2D-Sal plasmid pQJDsal2, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
28. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_007208 (Salmonella enterica OU7025 plasmid pOU1113, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
29. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_HG970001 (Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91 plasmid pSG, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
30. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032386 (Salmonella enterica subsp. enterica serovar Dublin strain CVM N53043 plasmid pN53043_2, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
31. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032388 (Salmonella enterica subsp. enterica serovar Dublin strain CVM N45955 plasmid pN45955_1, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
32. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030208 (Salmonella enterica strain SA19992307 plasmid pSA19992307.1, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
33. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032450 (Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69838 plasmid pSDU1-USMARC-69838, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
34. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_011204 (Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 plasmid pCT02021853_74, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
35. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_019106 (Salmonella enterica subsp. enterica serovar Dublin plasmid pSD_77, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
36. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_019112 (Salmonella enterica subsp. enterica serovar Pullorum plasmid pSPUV, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
37. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012348 (Salmonella enterica subsp. enterica serovar Pullorum str. ATCC 9120 plasmid pCFSAN000725_01, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
38. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019180 (Salmonella enterica subsp. enterica serovar Dublin str. ATCC 39184 plasmid pATCC39184, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
39. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032394 (Salmonella enterica subsp. enterica serovar Dublin strain CVM 22453 plasmid p22453_2, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
40. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_010422 (Salmonella enterica subsp. enterica serovar Dublin plasmid pOU1115, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
41. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032381 (Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69807 plasmid pSDU2-USMARC-69807, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
42. spacer 4.12|2029803|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032447 (Salmonella enterica subsp. enterica serovar Dublin strain USMARC-69840 plasmid pSDU1-USMARC-69840, complete sequence) position: , mismatch: 2, identity: 0.938
tgggatgagcaaataacgtcgtttcctagaaa CRISPR spacer tgggatgagcgaataacgtcttttcctagaaa Protospacer **********.********* ***********
43. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KP453775 (Klebsiella pneumoniae strain ST11 plasmid pKP12226, complete sequence) position: , mismatch: 2, identity: 0.938
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtgaaaacacgttctgaaccgacattcatgt Protospacer ***.******** *******************
44. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017632 (Escherichia coli strain SLK172 plasmid pSLK172-1, complete sequence) position: , mismatch: 2, identity: 0.938
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtgaaaacacgttctgaaccgacattcatgt Protospacer ***.******** *******************
45. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030285 (Escherichia coli strain E308 plasmid pLKSZ04, complete sequence) position: , mismatch: 2, identity: 0.938
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctaaaccgtcattcatgt Protospacer ****************.***** *********
46. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP036204 (Escherichia coli strain L725 plasmid punnamed2, complete sequence) position: , mismatch: 2, identity: 0.938
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacggtctaaaccgtcattcatgt Protospacer ****************.***** *********
47. spacer 4.16|2030047|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MN510446 (Escherichia coli strain SvETEC plasmid pSvP1_F, complete sequence) position: , mismatch: 2, identity: 0.938
ggtaaaaacacggtctgaaccgacattcatgt CRISPR spacer ggtaaaaacacgttatgaaccgacattcatgt Protospacer ************ * *****************
48. spacer 5.1|3135865|38|CP029164|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
49. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP022140 (Salmonella enterica subsp. salamae serovar 55:k:z39 str. 1315K plasmid unnamed1, complete sequence) position: , mismatch: 6, identity: 0.812
cattgaa-aacattgcctttattttattttttg CRISPR spacer -attaaatcacattccctctattttattttttc Protospacer ***.** ***** ***.*************
50. spacer 4.5|2029376|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015205 (Rhodococcus sp. 008 plasmid pR8C2, complete sequence) position: , mismatch: 7, identity: 0.781
caataacgcagcatccaggaagctgtttccgc CRISPR spacer cgcgagcgcagcatcgaagaagctgtttccac Protospacer *. *.********* *.************.*
51. spacer 4.5|2029376|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP025960 (Rhodococcus qingshengii strain djl-6-2 plasmid pDJL1, complete sequence) position: , mismatch: 7, identity: 0.781
caataacgcagcatccaggaagctgtttccgc CRISPR spacer cgcgagcgcagcatcgaagaagctgtttccac Protospacer *. *.********* *.************.*
52. spacer 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to MK448716 (Streptococcus phage Javan249, complete genome) position: , mismatch: 8, identity: 0.75
acgtaacaaaacaacagcaaaatattatcgac CRISPR spacer taataacaaaacgacagcaaaagattatagtt Protospacer .*********.********* ***** * .
53. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_LT904880 (Salmonella enterica subsp. enterica serovar Typhi strain ty3-193 genome assembly, plasmid: 3) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
54. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_LT904895 (Salmonella enterica subsp. enterica serovar Typhi strain ERL12960 genome assembly, plasmid: 2) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
55. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP012254 (Cronobacter sakazakii strain NCTC 8155 plasmid pCS1, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttatca Protospacer ******** *****.******** ** .*.
56. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029647 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_217186 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
57. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_LT904853 (Salmonella enterica subsp. enterica serovar Typhi strain TY585 genome assembly, plasmid: 2) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
58. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NC_003385 (Salmonella enterica subsp. enterica serovar Typhi str. CT18 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
59. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029859 (Salmonella enterica subsp. enterica serovar Typhi strain 343077_285138 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
60. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029847 (Salmonella enterica subsp. enterica serovar Typhi strain 343078_273110 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
61. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029916 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_202113 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
62. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029893 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_252143 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
63. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029876 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_227128 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
64. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029884 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_268186 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
65. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029961 (Salmonella enterica subsp. enterica serovar Typhi strain 343078_251131 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
66. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029849 (Salmonella enterica subsp. enterica serovar Typhi strain 343078_211126 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
67. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029857 (Salmonella enterica subsp. enterica serovar Typhi strain 343077_286126 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
68. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029929 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_216103 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
69. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029865 (Salmonella enterica subsp. enterica serovar Typhi strain 343077_228157 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
70. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_LT883154 (Salmonella enterica subsp. enterica serovar Typhi strain ERL12148 genome assembly, plasmid: 2) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
71. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029905 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_231186 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
72. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029851 (Salmonella enterica subsp. enterica serovar Typhi strain 343078_203125 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
73. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP040567 (Salmonella enterica subsp. enterica serovar Typhimurium strain SAP17-7299 plasmid pCFSAN059543, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
74. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029889 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_294172 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
75. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029891 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_253155 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
76. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029898 (Salmonella enterica subsp. enterica serovar Typhi strain 343077_213147 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
77. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029951 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_205186 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
78. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029867 (Salmonella enterica subsp. enterica serovar Typhi strain 343077_228140 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
79. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029901 (Salmonella enterica subsp. enterica serovar Typhi strain 343076_232188 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
80. spacer 3.5|2003158|32|CP029164|CRISPRCasFinder,CRT matches to NZ_CP029921 (Salmonella enterica subsp. enterica serovar Typhi strain 311189_282186 plasmid pHCM2, complete sequence) position: , mismatch: 8, identity: 0.75
aatgattgatataaatctgtgtacggtgtccg CRISPR spacer aatgattgttataagtctgtgtagcgttgttg Protospacer ******** *****.******** ** ..*
81. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KP453775 (Klebsiella pneumoniae strain ST11 plasmid pKP12226, complete sequence) position: , mismatch: 8, identity: 0.75
cattgaaaacattgcctttattttattttttg CRISPR spacer tcgctaaaacattggctttatttaatttttta Protospacer . . ********* ******** *******.
82. spacer 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_009717 (Xanthobacter autotrophicus Py2 plasmid pXAUT01, complete sequence) position: , mismatch: 8, identity: 0.75
cgtggctgcgctggccgttgcagcagtttgat CRISPR spacer cctggccgcgcttgccgttgcagcaggaccgt Protospacer * ****.***** ************* . .*
83. spacer 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP010657 (Phaeobacter piscinae strain P71 plasmid pP71_a, complete sequence) position: , mismatch: 8, identity: 0.75
cgtggctgcgctggccgttgcagcagtttgat-- CRISPR spacer tgtggcggcgctggcagttgcagcg--ctggttc Protospacer .***** ******** ********. .**.*
84. spacer 4.13|2029864|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP042263 (Litoreibacter sp. LN3S51 plasmid unnamed2, complete sequence) position: , mismatch: 8, identity: 0.75
ccgccgtgccagtgatcctcatacggcctgtt CRISPR spacer cggtgcagccagtgatcgtcatacagcctgtg Protospacer * *. ********** ******.******
85. spacer 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP044347 (Escherichia coli strain P225M plasmid pP225M-CTX-M-55, complete sequence) position: , mismatch: 9, identity: 0.719
acgtaacaaaacaacagcaaaatattatcgac CRISPR spacer taagaacaaaacaagagcgaaatattatgcat Protospacer . ********** ***.********* *.
86. spacer 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032533 (Bacillus megaterium NCT-2 plasmid pNCT2_5, complete sequence) position: , mismatch: 9, identity: 0.719
acgtaacaaaacaacagcaaaatattatcgac CRISPR spacer atgtaacaaaaaaacagtaaaatatggtgaga Protospacer *.********* *****.******* .* ..
87. spacer 3.6|2003219|32|CP029164|CRISPRCasFinder,CRT,PILER-CR matches to AP014399 (Uncultured Mediterranean phage uvMED isolate uvMED-CGF-C110A-MedDCM-OCT-S26-C20, *** SEQUENCING IN PROGRESS ***, 2 ordered pieces) position: , mismatch: 9, identity: 0.719
ctaggataaattaaaagacaaaattgcagcaa CRISPR spacer gtcagataaattaaaagagataattgcaaatt Protospacer * .************** * *******.
88. spacer 3.6|2003219|32|CP029164|CRISPRCasFinder,CRT,PILER-CR matches to AP013383 (Uncultured Mediterranean phage uvMED DNA, complete genome, group G15, isolate: uvMED-CGR-C110A-MedDCM-OCT-S24-C13) position: , mismatch: 9, identity: 0.719
ctaggataaattaaaagacaaaattgcagcaa CRISPR spacer gtcagataaattaaaagagataattgcaaatt Protospacer * .************** * *******.
89. spacer 3.7|2003280|32|CP029164|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP021994 (Cryobacterium sp. LW097 plasmid unnamed1) position: , mismatch: 9, identity: 0.719
gagcgaccagtatcaagatcgacaggttttgc CRISPR spacer gagcgaccagcagcaagatcgaccagacctca Protospacer **********.* ********** .* ..*
90. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP011404 (Lactobacillus salivarius str. Ren plasmid pR1, complete sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer tttaataaacattgccttaattttattattaa Protospacer . * . ************ ******** ** .
91. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP020859 (Lactobacillus salivarius strain ZLS006 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer tttaataaacattgccttaattttattattaa Protospacer . * . ************ ******** ** .
92. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP020859 (Lactobacillus salivarius strain ZLS006 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer tttaataaacattgccttaattttattattaa Protospacer . * . ************ ******** ** .
93. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017108 (Lactobacillus salivarius strain CICC23174 plasmid pLS_1 sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer tttaataaacattgccttaattttattattaa Protospacer . * . ************ ******** ** .
94. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017110 (Lactobacillus salivarius strain CICC23174 plasmid pLS_3, complete sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer tttaataaacattgccttaattttattattaa Protospacer . * . ************ ******** ** .
95. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP029460 (Clostridium novyi strain 150557 plasmid pCN2, complete sequence) position: , mismatch: 9, identity: 0.719
cattgaaaacattgcctttattttattttttg CRISPR spacer aaaaaagaacattgtatttattttatttttaa Protospacer * .*.*******. ************** .
96. spacer 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP014942 (Rhodococcus sp. BH4 plasmid, complete sequence) position: , mismatch: 9, identity: 0.719
cgtggctgcgctggccgttgcagcagtttgat CRISPR spacer ggtggcggcgctggccgctgcagcggcgggca Protospacer ***** **********.******.*. *
97. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_AP023208 (Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence) position: , mismatch: 10, identity: 0.808
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer acgggcgactcgtaggcctgataagacgcgccagcgtcgcatcaggcaccga Protospacer .*. .*********************** ***************** *
98. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to MT230402 (Escherichia coli strain DH5alpha plasmid pESBL87, complete sequence) position: , mismatch: 10, identity: 0.808
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggtaacg-gtttgtaggcctgataagacgcgacagcgtcgcatcaggcattga Protospacer *.* .* ..*.************************************* *
99. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_AP023207 (Escherichia coli strain TUM18781 plasmid pMTY18781-2, complete sequence) position: , mismatch: 10, identity: 0.808
--gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer tggcaacgg--ctgtaggcctgataagacgcgacagcgtcgcatcaggcattga Protospacer *** .*. ..************************************* *
100. spacer 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP045561 (Acinetobacter nosocomialis strain AC1530 plasmid pAC1530, complete sequence) position: , mismatch: 10, identity: 0.688
acgtaacaaaacaacagcaaaatattatcgac CRISPR spacer taataaaaaaacaacagcaaaattttgaaaat Protospacer .*** **************** **. .*.
101. spacer 3.1|2002914|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP014478 (Acinetobacter pittii strain AP_882 plasmid pNDM-AP_882, complete sequence) position: , mismatch: 10, identity: 0.688
acgtaacaaaacaacagcaaaatattatcgac CRISPR spacer taataaaaaaacaacagcaaaattttgaaaat Protospacer .*** **************** **. .*.
102. spacer 3.8|2003341|32|CP029164|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP024940 (Paraburkholderia hospita strain mHSR1 plasmid pmHSR1_P, complete sequence) position: , mismatch: 10, identity: 0.688
atcgatatgtacgttagcgaggggatcacgca CRISPR spacer ggcgatatttacgttagcgacgggactgacct Protospacer . ****** *********** ****... *
103. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_018511 (Bacillus thuringiensis HD-789 plasmid pBTHD789-5, complete sequence) position: , mismatch: 10, identity: 0.688
cattgaaaacattgcctttattttattttttg CRISPR spacer tcgttcttacatttcttttattttattttttt Protospacer . * ***** *.***************
104. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NC_001446 (Bacillus thuringiensis sv israelensis HI4 plasmid pTX14-3, complete sequence) position: , mismatch: 10, identity: 0.688
cattgaaaacattgcctttattttattttttg CRISPR spacer tcgttcttacatttcttttattttattttttt Protospacer . * ***** *.***************
105. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP009345 (Bacillus thuringiensis HD1002 plasmid 6, complete sequence) position: , mismatch: 10, identity: 0.688
cattgaaaacattgcctttattttattttttg CRISPR spacer tcgttcttacatttcttttattttattttttt Protospacer . * ***** *.***************
106. spacer 4.1|2029132|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053657 (Bacillus cereus strain CTMA_1571 plasmid p.1, complete sequence) position: , mismatch: 10, identity: 0.688
cattgaaaacattgcctttattttattttttg CRISPR spacer accttaaaacatttcctttgttttatttagat Protospacer .* ******** *****.********
107. spacer 4.6|2029437|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP031468 (Paraburkholderia caffeinilytica strain CF1 plasmid p1, complete sequence) position: , mismatch: 10, identity: 0.688
cgatcggtgaagaggtccgcgaaatactcact CRISPR spacer taatcggtgacgacgtccgcgaaatcggagat Protospacer ..******** ** *********** . *
108. spacer 4.6|2029437|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP014580 (Burkholderia sp. OLGA172 plasmid pOLGA1, complete sequence) position: , mismatch: 10, identity: 0.688
cgatcggtgaagaggtccgcgaaatactcact CRISPR spacer taatcggtgacgacgtccgcgaaatcggagat Protospacer ..******** ** *********** . *
109. spacer 4.10|2029681|32|CP029164|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP048816 (Caulobacter rhizosphaerae strain KCTC 52515 plasmid unnamed) position: , mismatch: 10, identity: 0.688
cgtggctgcgctggccgttgcagcagtttgat CRISPR spacer cgtggctgcgctggccgtggccgcttgccagg Protospacer ****************** ** ** ....
110. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 11, identity: 0.788
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggcac-aagtttgtaggcatgataagacgcgccagcgtcgcatcaggcatctg Protospacer **** .*..*.****** ************ *****************
111. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 11, identity: 0.788
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggcac-aagtttgtaggcatgataagacgcgccagcgtcgcatcaggcatctg Protospacer **** .*..*.****** ************ *****************
112. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 11, identity: 0.788
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggcac-aagtttgtaggcatgataagacgcgccagcgtcgcatcaggcatctg Protospacer **** .*..*.****** ************ *****************
113. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 11, identity: 0.788
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggcac-aagtttgtaggcatgataagacgcgccagcgtcgcatcaggcatctg Protospacer **** .*..*.****** ************ *****************
114. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to MT230312 (Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence) position: , mismatch: 11, identity: 0.788
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer gctcgggtctngtaggcctgataagacgcgtcagcgtcgcatcaggcttcaa Protospacer ** * *. ** ******************* **************** .
115. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 12, identity: 0.769
----gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer aaaaacagtgg----gtaggcctgataagacgcgtcagcgtcgcatcaggcatctg Protospacer .** **. ******************* *****************
116. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 12, identity: 0.769
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer cgtac-agatttgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer *.** ..*.*.****** ************ ***************** .
117. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 12, identity: 0.769
----gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer aaaaacagtgg----gtaggcctgataagacgcgtcagcgtcgcatcaggcatctg Protospacer .** **. ******************* *****************
118. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 12, identity: 0.769
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer cgtac-agatttgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer *.** ..*.*.****** ************ ***************** .
119. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP044307 (Escherichia coli strain C27A plasmid pC27A-2, complete sequence) position: , mismatch: 12, identity: 0.769
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer cgtac-agatttgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer *.** ..*.*.****** ************ ***************** .
120. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 12, identity: 0.769
----gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer aaaaacagtgg----gtaggcctgataagacgcgtcagcgtcgcatcaggcatctg Protospacer .** **. ******************* *****************
121. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 12, identity: 0.769
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer cgtac-agatttgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer *.** ..*.*.****** ************ ***************** .
122. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 12, identity: 0.769
----gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer aaaaacagtgg----gtaggcctgataagacgcgtcagcgtcgcatcaggcatctg Protospacer .** **. ******************* *****************
123. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 12, identity: 0.769
-gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer cgtac-agatttgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer *.** ..*.*.****** ************ ***************** .
124. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP044147 (Escherichia coli O157 strain AR-0428 plasmid pAR-0428-2) position: , mismatch: 12, identity: 0.769
---gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer agtgcagt---tttgtaggcatgataagacgcgccagcgtcgcatcaggcatccg Protospacer *** * .*.****** ************ *****************
125. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to CP044351 (Escherichia coli strain 194195 plasmid p194195_1, complete sequence) position: , mismatch: 12, identity: 0.769
---gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer agtgcagt---tttgtaggcatgataagacgcgccagcgtcgcatcaggcatccg Protospacer *** * .*.****** ************ *****************
126. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 13, identity: 0.75
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ttcgcgggctcgtaggcatgataagacgcgccagcgtcgcatcaggcacctg Protospacer . .*..********* ************ *****************
127. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 13, identity: 0.75
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ttcgcgggctcgtaggcatgataagacgcgccagcgtcgcatcaggcacctg Protospacer . .*..********* ************ *****************
128. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP044307 (Escherichia coli strain C27A plasmid pC27A-2, complete sequence) position: , mismatch: 13, identity: 0.75
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ttcgcgggctcgtaggcatgataagacgcgccagcgtcgcatcaggcacctg Protospacer . .*..********* ************ *****************
129. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 13, identity: 0.75
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ttcgcgggctcgtaggcatgataagacgcgccagcgtcgcatcaggcacctg Protospacer . .*..********* ************ *****************
130. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 13, identity: 0.75
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ttcgcgggctcgtaggcatgataagacgcgccagcgtcgcatcaggcacctg Protospacer . .*..********* ************ *****************
131. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_AP023209 (Escherichia coli strain TUM18781 plasmid pMTY18781-4, complete sequence) position: , mismatch: 14, identity: 0.731
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer ggttcaggtccgtaggcatgataagacgcgtcagcgtcgcatcaggcatcgg Protospacer * .......******* ************ ***************** *
132. spacer 1.1|136201|52|CP029164|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 14, identity: 0.731
gcactgaactcgtaggcctgataagacgcgacagcgtcgcatcaggcaaggc CRISPR spacer gggtgcagattgtaggcatgataagacgcgtcagcgtcgcatcaggcatcag Protospacer * .. *. *.****** ************ ***************** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
108496 : 123827
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP029164|108496:123827|DBSCAN-SWA AATGAGCAGAAAAACCCAACGTTACTCTAAAAAGTTCAAAGCCGAAGCTGTCAGAACGGTTCTTGAAAATCAACTTTCGATCAGTGAAGGCGCTTCCCGATTATCTCTTCCTGAAGGCACTTTAGGACAATGGGTTACCGCCGCCAGAAAAGGGCTCGGTACTCCTGGTTCCCGCACGGTGGCTGAACTGGAATCTGAAATTCTGCAACTGCGTAAGGCGTTAAATGAAGCTCGCCTTGAGCGAGATATATTAAAAAAAGCAACTGTAGATTTTGCACAGGAGTCGCTGAAAAATACGCGTTAATCGAACAATGGCGACAACAATTTCCCATTGAAGCGATGTGTCAGGTATTTGGTGTATCCAGGAGCGGTTATTACAACTGGGTACAGCATGAACCCTCAGACAGAAAACAAAGTGATGAGCGGCTAAAACTGGAGATTAAGGTGGCACATATCCGCACTCGCGAAACATATGGAACCCGGCGGCTCCAGACGGAGCTGGCAGAGAATGGCATCATCGTTGGTCGTGACCGACTGGCACGTCTTCGTAAGGAGCTAAGGCTACGCTGTAAGCAGAAACGCAAGTTCAGAGCGACTACGAACTCGAACCACAATCTGCCAGTTGCGCCAAATCTGCTGAACCAGACGTTCGCTCCTACAGCACCAAATCAGGTCTGGGTGGCGGACCTGACGTATGTTGCCACACAGGAGGGATGGTTGTACCTCGCTGGCATCAAAGATGTTTATACGTGCGAAATTGTCGGCTACGCCATGGGAGAGCGCATGACAAAAGAGCTGACAGGTAAAGCCCTGTTTATGGCGCTCAGGAGCCAGCGCCCACCTGCCGGGCTAATCCACCACTCTGATCGAGGTTCACAGTACTGCGCATACGATTACCGGGTCATACAGGAGCAGTTTGGTCTGAAAACATCAATGTCGCGTAAAGGTAACTGTTACGACAACGCTCCGATGGAAAGCTTCTGGGGAACGCTGAAAAATGAGAGCCTGAGCCACTATCGTTTTAATAACCGGGATGAAGCCATCTCAGTAATACGGGAATACATTGAGATTTTCTACAATCGTCAGCGTCGTCACTCTCGTCTGGGGAATATCTCCCCGGCAGCCTTCAGGGAAAAATATCATCAGATGGCTGCTTAAAAAAAGAACAAATGGTAGTGTCCGCTATTGCCAGTACACCTCAACATTCCACCATGCATTCCGATTAACGCCGCATAGCCAGTTGAACTTTGCTACTTTGTGAGAGGTAGTACCTTCTATCCAGTGCGAATTTAATTAATGGAATAAATGATTATGAGTGAAAATGATACAATCCCAAAGAAGTCTACAAGTCAGATTAACAAAGCGGTATTCTTTACATCTGCTTTGCTAATTTTCCTTCTTGTCGCCTTTGCCGCCGTATTCCCGGATGTCGCCGACAAAAATTTTAAACTACTTCAGCAACAAATCTTCACGAATGCCAGCTGGTTCTACATCCTTGCTGTGGCCCTGATTTTACTGAGTGTCACGTTCCTTGGACTCTCACGCTACGGTGATATCAAGCTGGGCCCGGACCATGCGCAGCCTGATTTCAGCTACCACTCCTGGTTTGCGATGCTTTTTTCGGCAGGGATGGGGATCGGCCTGATGTTCTTTGGCGTTGCCGAACCTGTAATGCATTATCTTTCGCCACCCGTTGGCACTCCAGAAACCGTTGCGGCAGCCAAGGAAGCAATGCGTCTGACCTTTTTCCACTGGGGACTGCACGCATGGGCAATTTATGCCATTGTGGCGCTGATTCTGGCGTTCTTCAGTTACCGTCACGGTCTGCCTTTAACTCTGCGCTCCGCACTCTATCCCATTATTGGCGATCGCATATACGGACCTGTAGGACATGCGGTTGATATTTTCGCTGTTATAGGCACGGTCTTTGGCGTTGCGACATCACTGGGTTACGGTGTTTTGCAGGTGAATGCCGGTTTGAACCATCTTTTCGGGGTGCCCATCAATGAAACGGTGCAGGTAATTCTGATCGTGGTCATCACGGGGTTAGCGACGATTTCAGTGGTGTCCGGTCTGGATAAGGGAATACGTATCCTGTCTGAACTCAATCTGGGTCTGGCTTTGTTGCTCCTGGCGCTGGTCCTGTGTCTGGGACCAACCGTGCTTCTGCTGAAGTCATTTGTGGAAAATACGGGCGGTTATCTTTCGGAACTGGTGAGTAAAACGTTCAACCTTTACGCGTATGAGCCCAAGTCGAGCAACTGGCTGGGGGGCTGGACATTACTGTACTGGGGATGGTGGCTTTCATGGTCGCCGTTTGTGGGGATGTTCATCGCACGGGTCTCCCGCGGGCGAACCATTCGCGAGTTTGTCACCGGCGTGCTGTTTGTTCCCGCGGGTTTTACGCTAATGTGGATGACGGTGTTTGGTAACAGCGCGATCTATCTCATTATGAACCAGGGGGCCACAGACCTCGCCAATACTGTTCAGCAGGATGTGTCGCTGGCCCTGTTTAATTTCCTGGAGCATTTCCCGTTCTCTTCTGTGCTGTCATTCATTGCAATGGCGATGGTCATCGTCTTCTTTGTAACGTCTGCTGATTCGGGGGCAATGGTTGTGGATACTCTGGCATCAGGTGGAGTGGCAAACACACCCGTCTGGCAGCGAATATTCTGGGCCTCGCTCATGGGCATTGTTGCAATTGCGCTTCTCCTTGCCGGAGGGCTAAGTGCGCTGCAAACGGTGACAATAGCGAGTGCATTGCCCTTCTCAGTGATCTTACTAATATCCATATACGGACTTTTAAAAGCTTTGCGCCGGGATTTGACCAAGCGTGAAAGCCTGAGCATGGCGACAATTGCTCCTACGGCTGCACGTAACCCAATTCCTTGGCAGAGAAGGTTACGCAATATCGCGTATCTGCCGAAGCGATCTCTTGTGAAACGTTTTATGGACGACGTTATCCAGCCCGCCATGACGCTGGTTCAGGAGGAACTGAACAAGCAGGGGACGATAAGCCACATTAGTGATGCAGTCGACGATCGTATTCGTCTTGAAGTCGATTTGGGCAACGAGCTGAATTTTATATATGAAGTGAGGCTTCGCGGGTATATCTCACCGACCTTCGCGCTCGCCGCAATGGATAATGATGAGCAGCAGAGTGAACAACATCGATATTATCGCGCTGAGGTTTATCTCAAAGAAGGCGGTCAAAATTATGATGTGATGGGCTGGAACCAGGAACAGCTGATTAATGACATACTGGACCAGTACGAAAAACACCTGCACTTCCTGCACCTGGTTCGTTAATAGCAACATGCCGTCCTGGGGGCGGCAATTATTATCTCGGCCGCAATATGAGGGAATGCAGAATGATTTCACGCTGGAAATGGATGCTGAAGCAGACAATTAAAAAACTATGGTTCAGGGCAACGTTATTCGCAATTGTCGCGATAATAACGGCCCTTTTATCAATTCTTTTTAAATCAATGATACCTGAGTCGGTTTCCGTGAAGGTTGGTGCGGAAGCAGTCGATAACATTCTGAACATACTGGCATCGAGTATGCTGGCAGTGACCACATTTTCGCTGAGTATCATGGTCACAGCCTACGGTTCAGCCACTACTAATGTGACTCCCAGAGCTACGCGTTTAGTTGTTGAAGACGTCACCACACAAAATGTACTGGCCACCTTCATCGGTTCTTTTCTCTTGAAGTGGTCAACAAAAACTGGCCACCGCGTTAGAGTTTTTCCAGTATCGATTTTCCGATTCGTTTGGGGTAACCCACCGTTATATTCGTGCGGTCTTAGTGCGCTGTAATATCCAACGATATAGTCCGTTATGGCGTGAGCTGCCTCGCTGAAGCTTACGTAACCCACCACCGGCATCCATTCGTTCTTCAGACTCCTGAAGAAGCGTTCCATTGGGCTGTTATCCCTGCAGTTTCCGCGCCGGCTCATACTCTGTCTGATCTGGTATCGCCACAATAACTGCCGGAACTGCCTGCTCGTATAATGACTGCCCTGATCGCTGTGGAACATCACCCCGCCGGGCTTACCACGGGTTTCCCATGCCATTTCCAGCGCTTTCATGGTGAGCCTGCTGTCCGGCGAGAACGACATGGCCCAGCCCACTGGTTTTCTTGCGAACAGGTCGAGAACAACGGCGAGGTACGCCCAGCGCTTACCCCTCCAGATATAGGTCACATCACCGCACCACACCTGATTTGGCTCGGTCACGGCGAACTGCCTTTCAAGGTAGTTAGGGATAGCAACATGTTCATGACCACCACGTTTATACCGGTGAGTCGGCTGCTGACAGCTGACCAGCCCCAGCTCTTTCATGAGCCTGCCAGCAAGCCAGCGTCCCATCTGGTAGCCTCTCCGGGTTGCCATTGTGGCGATGCTTCTTGCTCCGGCCGAACCGTGGCTGATGCCATGTAGCTCAAGTACCTGACTGCGTAATACAGCCCGTCTGCCGTCTGGTTTTTCAGGACGGTTTTTCCAGTATCTGTAGCTGCTGCGATGAACCCCGAACACATGGCAGAGTGTGACCACAGGATAATGCGCTCTGAGTTTCCCGATTATCGAGAACTGTTCAGGGAGTCTGACATCAAGAGCGCGGTAGCCTTTTTAATATTTCATTCTCCATTTCAATGCGTTGTAGCTTTTTCCTCAGCTTACGTATTTCGATTTGTTCTGGTGTTATCGGAGAGGCTTTTGGTGTTTTGCCCTGACGCTCATCACGCAGTTGTTTGACCCATCTTGTCATTGTGGAAAGGCCAACATCCATAGCTTTCGCGGCATCTGCCACCGTGTATTTCTGGTCAACAACCAGTTGAGCGGATTCGCGTTTAAACTCTGCGCTGAAATTTCTTTTTTTCATTGGAGCACCTGTGTTGTTCTGAGGTGAGCATATCACCTCTGTTCAGGTGGCCAAATTCAGTGTGCCACTTCAGCCCTGATTCGACTTTTACAATGACTTATCTGGACGCATGAAGAACGTCAGGAATATACCAATAACAAGCAATGCGACGGAACCGTAGAATGGTAAACTCCAGTTGCCTGTTTTGTCGATAATAATGCCAAAGGCGATAGGTGAAATAATGTCGGCGACAGCGGAACCGGCGTTCATCAATCCGCTGGCAATTCCCACATACTTCGGCGTAATATCCATCGGGACAGCCCGGATAGGCCCAATGGTCAGCTCATCGGCCAGCAATGTCGTGAACTGTGAGACGAAACAGCGTACACAGTTTGAATGTATCTACTTTTCCCAATACTGGGCGAAGGGAGATTTTATTGCGAAGCGCGCGCCAATTGGTCAGTGGGAACCTTATTCGGAAGAGTCACTACTTGGCATCATCGTCACCAGCGTCTGTCGTATTAAAGTCGCGATGCTAAAACCTGAACCTCCCAGAGATCCCCATATCCCGCTGATGGGGGATTTTAACTAAAACTTAGCGCTATCGCTGCGTATCGTAACTATCCCGTATTACGTAGCGTAAATTATGACGCCATATGGGGAACAAATACACAGAGGAAGAAAAAGATTAGATTGGTTTTAAGGGATGTTTTTATCATAAATGGCACTGCTAAAATTATGAGATTATCGAAGATAAAATTCTTGATCATACATAAAAGGCCTCCTGAATTAAATCAAGAGGCGTATAGTTTTTACATTACTGATGTATGTTTAATGATCTCACTGATTTTCACTTTCCTGTCCTGCGATCTTGAAAGAGTGGCAGCATCAGCAGTAGCAATCGCCGACATGGCCGCCTCACCATTGAGAAGGTCAATATAATCTTCTTCAGGTTTTGCACCACAGAGGATATTATGGAGGAATAACGTCTCCTTTCTTATTAAACTGGCAAGCCATAATGGTGTTTTTTTTCCTGGATGACCATATGCTATAGCGCCATCCATTTCTGAGGTCATATTGCCTTTCCGACGATCATCATCTTCTTCTTGTGTTTCATGGACCAAAAAATGCTTTGTCTGACCGCCAATCCTAAGTGACCCTGCTGTTTCTTGCATATCAATTTTAATAGAGCCTTTAGTTCCATTGATGATGACATAATGTTCCGGCCAGTTAAATGCACTCCCCCACTCTAAGGTTGCTAGTTTTCCTGACGGGAATTCCAAGGTCATAAATAACATATCATCTTCATTGCCAAATCCTGGACCAGAATGGGCCAAATTTCCACCAATCATAGTAACCGTCTCTGGTATTTCTCCAAGTAAATGCTGAACACAATCTAACTCATGTATATGATGATATAGATGTCCACCAGATTGTTCTTTCATCTTTTTCCAGGAAAGTCTCTCTTGTTTGTTTTCCCAGCCATTTCTCTTAGTATGACATGATAATATTTCGCCGATAACACCTTCTTTAATTAACTTCCGTGCATATTGAACCCCATTGAAAAAATTCATAATATGCCCGGCCATAAAGGTCACACCAGCTTCTTTACACGCTTTGACCATATCCACACAATCTTCATAACTTAATGCAATTGGTTTTTCACAAAAAACATGCTTCTTATTCTTTGCTGCTTTAATTACTGGTTCTTTATGCAGATAATTTGGGGTGGCTACGATCACGCAATCGACTAATTTACTTGAGACTAAAGCATCCAAGCTTGACATATTGATACACTGCAATTCACGGGCAATATTTTCTCCATTTTCAGGATCGTATACACATGTAATTTTTGCATTATCATGCATATTCATAAAACGAGCTAATTCAGCGCCAAAGTATCCAACACCAACAACGCCATAATTAATCATAAAGCCTCCAATCATTTAGCCACGGATAGTTTATAAATTTTACCTGGAATATCAAAACCAACTAACAAAATTAATAAGGCAGAGAATGCAACCGTAACAATGAATAATGAAACACCTAAGCCATAATATCCTGAAATGTATGTAGCTAATACAGGTGCGGCCATTCCTCCAGTTGCCCCTAAGTTATAAATAAGACCGGTCCCTAATCCTCTTAATTTTGTTGGAAAGTAATCATATATAAATTTTGGAACCAACCCTGCAATACCTAAATTTGTAAACATTAATCCAAAGAGACATAATCCTATAAGAGAAGAGTTTTTCACAGAAATAAAAAAAAGAGGACAAAGGAAAATAAATGAAGTTATTAGACCGACTACAAAGGCTTTTTTTACACCAATCTTATCACCAACAAAACCAAAAAATATTGTACCTGTCAGTGTTCCTAAACCTGCTATTGTCATCAGAGTTGAAATGACCACTGTATTAACTCCATTATCTGCCAGGTAGGAAGGAAGTAGTCCGTTTATCGGCCAGTTTGCACCAAATAGACAAAAACAGACGAGGAAAACGATCATAGAGATTGAAAGATGTGGTTTTCTGAAGACAGACAAAAATGTTGATTTATCCTTATATTTATCTTCAATCCACTCCTGACTTTCTGGAGCACTTTTTCTGATCCAAAGAACTAGTAAAACTGGTAACAGGCCTATAAAAAAAGAGTTTCTCCATCCATATACTTCAGCAAACTGAGGGATTATTTGTGCCGCAATAATATTTCCAACAGAAAAACCACTTACCAAAAAAGCACTAGCTTTAGATTGAAGATTTTTAGGCCAACTTTCTACCGCATAAGTTGAAGCACATGCATATTCACCAGACATCCCTAAGCCAACAATAAAACGGCAAACTGCGAGCATATATAAGTTTGTAGCAATACCGCTAAGGCCTGTTCCGACTGAGTAAATGAAAATTGCCCACATCATCATTGGCTTACGACCATATTTATCAGCCATGGCACCAAAAAAACCACCTCCAATAGGTCTGGCTATGAAGGCCACTGTCCCTATTAAAGTAGCCTGAATATCCGTAATGCCAAGATCTGCTTTTATAATATGAAGAATGTAAAATATCATCATAAAATCAAAGCCATCAAATACATATCCAAGCCATGCGGAAAAAAGAGCTTTCCGTTGTGGTGGATTAACTTGTTTATACCATGCTGTTGCCATATTTGTTTCCTGTGTTATTCTAATATAGCCCTTAAAGTAAATATTAGGGAGCCTCAACAATATCCAGAAATAATTATATTGTCTTCAAATACGCCAGTACTATTGATTGTTATCAATACCTAATCAGTACAATCTTTTCCTCCATTTCTTCTTAACATATTTCACTCTATGAAAGATAAAACTGATACCCACCTACAGTTTTGGGTCACCAACAAAAAACAATGTTCGCCAGATATTCCACCACTGGCACAGTTCAACCCTGCTTCAAAAATTGACTTATTCATTGTGCTCATTTTCGTAGTCCGTTCAGGTTGAATAAAACTCCCTTCGTCAAATGAGGAAAGACTGAGAGGTTATTACTTTAGCTTACGTTATAGCTGTTTTCCTTTGCAGTATTTCTTTGCTTTTCTGTATACCTTTATACCTGTTATACCAGATCAAAAAACAAGCAATCCACATAACAAAACGCGTTTTGTTACTGATGTCACAAATTGAGCATATTTTTGTAGCTGATAGTTTGTTACAAACACCAGAACCAAGATTAATGCCGATCAGTTAAGGATCAGTTGACCGATCCAGTGGCTGTGTAAGAATCCGGAAACGCTCACTTGTTTCCGGATTTTTTTATGCACATTGGACAGGCTCTTGATCTGGTATCCCGTTACGATTCTCTGCGTAACCCACTGACTTCTCTGGGGGATTACCTCGACCCCGAACTCATCTCTCGTTGCCTTGCCGAATCAGGTACTGTAACGCTACGCAAGCGCCGTCTTCCCCTCGAAATGATGGTCTGGTGTATTGTTGGCATGGCGCTTGAGCGTAAAGAACCTCTTCACCAGATTGTGAATCGCCTGGACATCATGCTGCCGGGCAATCGCCCCTTCGTTGCCCCCAGTGCCGTTATTCAGGCCCGCCAGCGCCTGGGAAGTGAGGCTGTCCGCCGCGTGTTCACGAAAACAGCGCAGCTCTGGCATAACGCCACGCCGCATCCGCACTGGTGCGGCCTGACCCTGCTGGCCATCGATGGTGTGTTCTGGCGCACACCGGATACACCAGAGAACGATGCAGCCTTCCCCCGCCAGACACATGCCGGGAACCCGGCGCTCTACCCGCAGGTCAAAATGGTCTGCCAGATGGAACTGACCAGCCATCTGCTGACGGCTGCAGCCTTCGGCACGATGAAGAACAGCGAAAATGAGCTTGCTGAGCAACTTATAGAACAAACCGGCGATAACACTCTGACGTTAATGGATAAAGGTTATTACTCACTGGGACTGTTAAATGCCTGGAGCCTGGCGGGAGAACACCGCCACTGGATGATACCTCTCAGAAAGGGAGCGCAATATGAAGAGATCAGAAAACTGGGTAAAGGCGATCATCTGGTGAAGCTGAAAACCAGCCCGCAGGCACGAAAAAAGTGGCCGGGACTGGGAAATGAAGTGACTGCCCGCCTGCTGACCGTGACGCGCAAAGGAAAAGTCTGCCATCTGCTGACGTCGATGACGGACGCCATGCGCTTCCCCGGAGGAGAAATGGGGGATCTGTACAGTCATCGCTGGGAAATCGAACTGGGATACAGGGAGATAAAACAGACGATGCAACGGAGCAGGCTGACGCTGAGAAGTAAAAAGCCGGAGCTTGTGGAGCAAGAGCTGTGGGGTGTCTTACTGGCTTATAATCTGGTGAGATATCAGATGATTAAAATGGCGGAACATCTGAAAGGTTACTGGCCGAATCAACTGAGTTTCTCAGAATCATGCGGAATGGTGATGAGAATGCTGATAACATTGCAGGGCGCTTCACCGGGACGTATACCGGAGCTGATGCGCGATCTTGCAAGTATGGGACAACTTGTGAAATTACCGACAAGAAGGGAAAGGGCCTTCCCGAGAGTGGTAAAGGAGAGGCCCTGGAAATACCCCACAGCCCCGAAAAAGAGCCAGTCAGTTGCTTAACTGACTGGCATTACAGAACCAAGATGCCTATTTCGTTTCAGCATCAACTGTTAAATTATCTGGTTGCCAACGAGAATACTGAGCTTTTTTGTCCCCAAGACTGACCGTCAACGCCACAATCTTTGACTGTTTGACATTGTCATGCCCGCAGGCCAGCGGATCTAAAGACATTAGACTGCTTGCATCAGCCAGTTGCGTATCTGGCTATGTCCCTTAATTGGTAAGGGTTCGAGGAACTCAACATTAACGCATTTTTTAGCGCATCACCGACAGTGTATATGAACGATTGCACTCTGACGATGGTTGATTTATTTGGTTGCGTAGGGGCTAGCTAGTCTGGTACCGGAGGCTAGATCTGAAGTACTGACCCCAGGAATAACGCCGGGCTAACTCATATTCTGACCCATTGTCAGCCCGCGTGTTTACTGCTCTAGCTCATTTCCTGTTGTCAGGCGTTCAATTCCCAACTGCAGGTTATGGCCCGGTGCACAAACACCTGACGGCAGGTATCACTTACCCGAAGAAGGTAAAGCCCTCAAGAGGCAGTAACTCAGCCAACCTCTCCTCCGGCCACTCCGGCAGACGCGTCAGGACGTCCGTCAGCCAAGCATGTGGCTCCAGACTGCGGTTCCCAGAAGGCTCATTATCTGTGCCGCGCGGTTCCCGGCCACCAGTGAACCAGCGAACAACCAGGCCTTTCGTCCCATAACGACCGGACGGATAGCACGCTCGCAGATATTGTTGTGTGTGCCTTGAAATGCACGATCTTTCCGTCGGTGAAAGTCCGCCCCGGCTAACTCTCGCTCCGACTGGAAGGACAAGAAGCGGTCCAGGAGGTAACAAAAGGGCTGAATCTCTCCCTGAAACGACTTCTTCAGGAAGTCAACGGGTTATGCGGGCCGTAACGTGAGTGAACACTGAGCAGCCATCGAAATGCCGTCGAGGGTGCTGACACGCCACAATAACGGGGAAGGCCGACGATGACCGGGAAGAGAGTGAGAAAGGCATCGGTCATACCCACCGGGGTTGTGGTGTCAGCATGCATAACAATTGATCGATCGTGCAACAAGAGAAATCCATAACGGTGTCTGTCCGGGTGACGGGCAACAACGCCTCGCGAGGGGGTGAATCGGTTGTTATGGATGGCAGATGGGGTGTAGTAGTGAAAATACTGAGTAATGTCAGAGGAGCGAAGGCCCCCTCTTGCAGTACCAGTGTAACATGCAGGAAGAGCCCGGCGATTGCCATCAGCCTCACAACCCCGCTAGAACCGGTTCAGAACCTACAGAAAACACTACAGGAGAAAGCAAAGTGAACCGCCCCGTGTTTCCTGTTTATCATTTTCTGGTCAGTGCCGCGATTCTGGTATTCGTGGTGATTTTCTGGCGGACACACCATCGCGACCACCGTAACTGGCTGGCTCTACGGCTATTCGTATTATGCTCTGTAAACCGTTGGCCATTGCGGATGGTTAAAGGAACAGTTGTGGGGACAACTGACACATTACGTGAAATGCAGCGTTATGAACAACTGAGTAGCACGGGGCTGACAACTGGAAAATCCAGCCGGGAGCACCGTTATATGATACGATTGTTATTGTCACTGGTGAGAGTGTGCGCAGGGATTATATGTCAGTGTATGACTATCCCGTACCAACCACACCGTGGCTGAATACGGCACCCGGTTTATTTATTGACGATTATACCTCGACAGCCTCCAGTACAGTGTCTTCCCTGAGCCGGACACTGATTTATGACTATGAGCAGAACCCTGATTCCGGCAACAATGTGGTGGCGCTGGCAGCAAAAGCAGGATACAGCACATGGTGGATATCCAATCAAGGAAAACTGGGGGGCATGACACACGCATCTCTGTTATTGCTTCTGATGCGGAGCATGCCACTTTCCTCAAGAAAGGCAGCTTCGCTTCCCGTAAAACAGATGACAAACTGTTGTTACAGGAAACAGAACGTGCGCTGGCGGATACATCCTCTCCGAAGATAATTTTCCTGCACATGATGGGTTCTCATCCAAATCCGTGTGACAGCCTTAACTCCTAGCCGAATAATTACCTGGAGCAGTATCCCCGAAAAATTGCCTGTTACCTCGCCAGCATCAGTAAACTGGATAACTTTCTTGGCCAGCTTGATGGTATCCTTCGCCGGTACTCACGTCATTTCGCCATGCTTTACTTTTCTGGCCTTGGGCTGTCGGTCAGCGACAGTGCCAATCCTGTTCATCATTATGGTCATGTGCAGGGAGGCTACAGTGTGCCACTGATTATTACCGCCAGTGACATAACGTCTCATCAGCCCGTCAGTAGAAAAATCAGTGCCCGTCATTTCGCAGGTATTTTTCAGTGGATGACCGGTATTTGTACTGAAAATATACCGCCATTCAATCCGCTGACAGACGAAGATAACTAACCTGTTATGGTTTTTAAGGGAGAGAGGAATATACCGGCAGACAGTTTGAAACCTCAGCCACTTATTCTTCCTGATCACAGGTAATCATATATGGCAGACTGTAATACAGTCGCTTGTACTGAAAAGTATTCTGTATAAAATATCCGTATTCATATCAGCACAAGGGACCATCAAATCCCTGCAGCACTCCTTACGTCAGTTATCCCGCATCACCCTGTGAGTACAGGATTTTTTATGAGGTTACTAGACTGGCCCCCTGAATCTCCAGACAACCAATATCACTTATTTAAGTGATAGTCTTAATACTAGTTTTTAGACTAGTCATTGGAGAGCAGATGATTGATGTCTTAGGACCGGAGAAACGCAGACGGCGTACCACACAGGAAAAGATCGCAATTGTTCAGCAGAGCTTTGAACCAGGGATGACGGTCTCCCTCGTTGCCCGGCAACATGGTGTAGCAGCCAGCCAGTTATTTCTCTGGCGTAAGCAATACCAGGAAGGAAGTCTTACTGCTGTCGCCGCCGGAGAACAGGTTGTTCCTGCCTCTGAACTTGCTGCCGCCATGAAGCAGATTAAAGAACTCCAGCGCCTGCTCGGCAAGAAAACGATGGAAAATGAACTCCTCAAAGAAGCCGTTGAATATGGACGGGCAAAAAAGTGGATAGCGCACGCGCCCTTATTGCCCGGGGATGGGGAGTAAGCTTAGTCAGCCGTTGTCTCCGGGTGTCGCGTGCGCAGTTGCACGTCATTCTCAGACGAACCGATGACTGGATGGATGGCCGCCGCAGTCGTCACACTGATGATACGGATGTGCTTCTCCGTATACACCATGTTATCGGAGAGCTGCCCACGTATGGTTATCGTCGGGTATGGGCGCTGCTTCGCAGACAGGCAGAACTTGATGGTATGCCTGCGATCAATGCCAAACGTGTTTACCGGATCATGCGCCAGAATGCGCTGTTGCTTGAGCGAAAACCTGCTGTACCGCCATCGAAACGGGCACATACAGGCAGAGTGGCTGTGAAAGAAAGTAATCAGCGATGGTGCTCTGACGGGTTCGAGTTCCGCTGTGATAACGGAGAAAAACTGCGAGTCACGTTCGCGCTGGACTGCTGTGACCGTGAGGCACTGCACTGGGCGGTCACTACGGGCGGCTTCAACAGTGAAACAGTACAGGACGTCATGCTGGGAGCGGTGGAACGCCGCTTCGGCAACGAGCTTCCGGCGTCTCCAGTAGAGTGGCTGACGGATAATGGTTCATGCTACCGGGCTAATGAAACACGGCAGTTTGCCCGGATGTTGGGGCTTGAACCGAAGAGCACGGCGGTGCGGAGTCCGGAGAGTAACGGCATAGCAGAGAGCTTCGTGAAAACGATAAAGCGTGACTACATCAGTGTCATGCCCAAACCAGACGGGTTAACGGCAGCAAAGAACCTTGCAGAGGCGTTCGAGCATTATAACGAATGGCATCCGCATAGTGCACTGGGTTATCGCTCGCCACGGGAATATCTACGGCAGCAAGCCAGTAATGGGTTAAGTGATAACAGGTGTCTGGAAATATAGGGGCAAATCCAGTTACTTTTTTATTGAATGTATATAAATAATGAATAACCGCGGATTCTGATAATCCCCCCTGCAGGGAGCGGTAAATAGTAAATACATCAGTTAATACCGTTTGTCTTTTTTAACAAAGAAAATAATCCTAGAATAATTCCCGGGCATTTGCCCGGGATGATTACTGTTTTAATGGATTATTAATCTTTGCATATTCAAATGGACTGATAAACCTTTCTCTGTTTACATCCAGAAAATCGGCCCACCACTGTAGCATCAATCGCCGTTCTTCCAGATGCTCTGCTTTATGGATATACGCGGCCCTCACTGAATTTCGCGCCATGTGGCTCATCTGGCGTTCAACAGCATCACGAGACCACAGACCTGATTCGACCAATGAACTACAGGCCATTGTTCGAAAGCCATGACCACAAACCTCTACTTTTGTATCATACCCCATGACCCGTAACGCACTATTTACCGTATTCTCACTCATGGGTTTGTGCGAATCGTGATCACCAATAAATATCAAGTCATGGGCCCCATAAAACTGTTTTATCTGCTTTAAAATTGCAAGAGCTTGCGTTGAAAGAGGCACTAGATGCGTTGTACGCATTTTTGAGCCTCTATGGGAATGTTTCACTCCAGGAATAGGCTCCCGCTCCGGTGGGATAGTCCATATAGACGCTTCGAAATCGATCTCTGACCAACGAGCAAAACGCAGCTCACTGGACCGAATAAAGATCAGCAAAGTGAGTTCTATCGCCCATCGGGTTAGCGGCCTACCAGTATAGCTATCTATTTTTGTAAGCAACTCAGGGATGCGCTTTAATTCAAGCGCGGGACGATGTTGTCGATTACAGGAAGCAACCGCCCCAGCCATCTCTTGTGCCGGGTTATAATCAATTAACCCACTTTGCACTGCATAGCGCATGATGGCTGTAGTGCGCTGCTGAAGACGAGCGGCCACTTCAAGACGTCCAGACATTTCTACGGCCTTAATAGGTGCTAATAAATCTCGAGTTTTTAACTCAGCGATATTACGTTCACCAAGCGCTGCAAAAAGATTATCTTCAAGACTTTTTAGCACACGATGGGCGTGATCTTCAGACCACTTTTTATTGGTGCCATGCCACTCAATCGCGACTTCTTTAAAGGTTCGTGCTTTACTCTGTTCAACCTTATCATTTTTCTTTTTGTCTCCCGGATCGACGCCATTCGCAAGCAGCTTACGCGTCTCGTCACGACGTACTCTGGCATCCGCTAGTGTGATTTCAGGATAAACCCCAAGTGCCAGCATTTTTTGCTTTCCCTCATAACGGTACTGCAAACGCCAGTACTTAGAACCATTTGGATGGACAAGCAGATGCAT
Protein sequences of DBSCAN-SWA_1 >CP029164|108496:123827|115302_116520_-|AWH67973.1|DBSCAN-SWA MATAWYKQVNPPQRKALFSAWLGYVFDGFDFMMIFYILHIIKADLGITDIQATLIGTVAFIARPIGGGFFGAMADKYGRKPMMMWAIFIYSVGTGLSGIATNLYMLAVCRFIVGLGMSGEYACASTYAVESWPKNLQSKASAFLVSGFSVGNIIAAQIIPQFAEVYGWRNSFFIGLLPVLLVLWIRKSAPESQEWIEDKYKDKSTFLSVFRKPHLSISMIVFLVCFCLFGANWPINGLLPSYLADNGVNTVVISTLMTIAGLGTLTGTIFFGFVGDKIGVKKAFVVGLITSFIFLCPLFFISVKNSSLIGLCLFGLMFTNLGIAGLVPKFIYDYFPTKLRGLGTGLIYNLGATGGMAAPVLATYISGYYGLGVSLFIVTVAFSALLILLVGFDIPGKIYKLSVAK >CP029164|108496:123827|117146_118475_+|AWH67974.1|transposase|DBSCAN-SWA MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEEIRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGEMGDLYSHRWEIELGYREIKQTMQRSRLTLRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLITLQGASPGRIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >CP029164|108496:123827|114004_114130_-|AWH67972.1|DBSCAN-SWA MIKNFIFDNLIILAVPFMIKTSLKTNLIFFFLCVFVPHMAS >CP029164|108496:123827|122636_123827_-|AWH72403.1|integrase|DBSCAN-SWA MHLLVHPNGSKYWRLQYRYEGKQKMLALGVYPEITLADARVRRDETRKLLANGVDPGDKKKNDKVEQSKARTFKEVAIEWHGTNKKWSEDHAHRVLKSLEDNLFAALGERNIAELKTRDLLAPIKAVEMSGRLEVAARLQQRTTAIMRYAVQSGLIDYNPAQEMAGAVASCNRQHRPALELKRIPELLTKIDSYTGRPLTRWAIELTLLIFIRSSELRFARWSEIDFEASIWTIPPEREPIPGVKHSHRGSKMRTTHLVPLSTQALAILKQIKQFYGAHDLIFIGDHDSHKPMSENTVNSALRVMGYDTKVEVCGHGFRTMACSSLVESGLWSRDAVERQMSHMARNSVRAAYIHKAEHLEERRLMLQWWADFLDVNRERFISPFEYAKINNPLKQ >CP029164|108496:123827|113694_113952_+|AWH67971.1|DBSCAN-SWA MVSSSASNVVNCETKQRTQFECIYFSQYWAKGDFIAKRAPIGQWEPYSEESLLGIIVTSVCRIKVAMLKPEPPRDPHIPLMGDFN >CP029164|108496:123827|111748_111907_+|AWH67970.1|holin|DBSCAN-SWA MTYWTSTKNTCTSCTWFVNSNMPSWGRQLLSRPQYEGMQNDFTLEMDAEADN >CP029164|108496:123827|114172_115291_-|AWH72402.1|DBSCAN-SWA MINYGVVGVGYFGAELARFMNMHDNAKITCVYDPENGENIARELQCINMSSLDALVSSKLVDCVIVATPNYLHKEPVIKAAKNKKHVFCEKPIALSYEDCVDMVKACKEAGVTFMAGHIMNFFNGVQYARKLIKEGVIGEILSCHTKRNGWENKQERLSWKKMKEQSGGHLYHHIHELDCVQHLLGEIPETVTMIGGNLAHSGPGFGNEDDMLFMTLEFPSGKLATLEWGSAFNWPEHYVIINGTKGSIKIDMQETAGSLRIGGQTKHFLVHETQEEDDDRRKGNMTSEMDGAIAYGHPGKKTPLWLASLIRKETLFLHNILCGAKPEEDYIDLLNGEAAMSAIATADAATLSRSQDRKVKISEIIKHTSVM >CP029164|108496:123827|119678_120146_+|AWH67978.1|DBSCAN-SWA MQYQCNMQEEPGDCHQPHNPARTGSEPTENTTGESKVNRPVFPVYHFLVSAAILVFVVIFWRTHHRDHRNWLALRLFVLCSVNRWPLRMVKGTVVGTTDTLREMQRYEQLSSTGLTTGKSSREHRYMIRLLLSLVRVCAGIICQCMTIPYQPHRG >CP029164|108496:123827|119559_119790_+|AWH67977.1|DBSCAN-SWA MSVRVTGNNASRGGESVVMDGRWGVVVKILSNVRGAKAPSCSTSVTCRKSPAIAISLTTPLEPVQNLQKTLQEKAK >CP029164|108496:123827|121190_122463_+|AWH67979.1|transposase|DBSCAN-SWA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEAVEYGRGKKVDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFRCDNGEKLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNELPASPVEWLTDNGSCYRANETRQFARMLGLEPKSTAVRSPESNGIAESFVKTIKRDYISVMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQQASNGLSDNRCLEI >CP029164|108496:123827|119349_119544_-|AWH67976.1|DBSCAN-SWA MLHDRSIVMHADTTTPVGMTDAFLTLFPVIVGLPRYCGVSAPSTAFRWLLSVHSRYGPHNPLTS >CP029164|108496:123827|118502_118646_-|AWH67975.1|DBSCAN-SWA MSLDPLACGHDNVKQSKIVALTVSLGDKKAQYSRWQPDNLTVDAETK >CP029164|108496:123827|108496_109652_+|AWH67968.1|transposase|DBSCAN-SWA MSRKTQRYSKKFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARKGLGTPGSRTVAELESEILQLRKALNEARLERDIFKKSNCRFCTGVAEKYALIEQWRQQFPIEAMCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >CP029164|108496:123827|109800_111804_+|AWH67969.1|holin|DBSCAN-SWA MIMSENDTIPKKSTSQINKAVFFTSALLIFLLVAFAAVFPDVADKNFKLLQQQIFTNASWFYILAVALILLSVTFLGLSRYGDIKLGPDHAQPDFSYHSWFAMLFSAGMGIGLMFFGVAEPVMHYLSPPVGTPETVAAAKEAMRLTFFHWGLHAWAIYAIVALILAFFSYRHGLPLTLRSALYPIIGDRIYGPVGHAVDIFAVIGTVFGVATSLGYGVLQVNAGLNHLFGVPINETVQVILIVVITGLATISVVSGLDKGIRILSELNLGLALLLLALVLCLGPTVLLLKSFVENTGGYLSELVSKTFNLYAYEPKSSNWLGGWTLLYWGWWLSWSPFVGMFIARVSRGRTIREFVTGVLFVPAGFTLMWMTVFGNSAIYLIMNQGATDLANTVQQDVSLALFNFLEHFPFSSVLSFIAMAMVIVFFVTSADSGAMVVDTLASGGVANTPVWQRIFWASLMGIVAIALLLAGGLSALQTVTIASALPFSVILLISIYGLLKALRRDLTKRESLSMATIAPTAARNPIPWQRRLRNIAYLPKRSLVKRFMDDVIQPAMTLVQEELNKQGTISHISDAVDDRIRLEVDLGNELNFIYEVRLRGYISPTFALAAMDNDEQQSEQHRYYRAEVYLKEGGQNYDVMGWNQEQLINDILDQYEKHLHFLHLVR |
14 | Acinetobacter_phage(25.0%) | transposase,holin,integrase | attL 103956:103970|attR 128728:128742 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
355346 : 412633
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP029164|355346:412633|DBSCAN-SWA TTCATTTTTGTGTCCTGCAAGCCCCGCGATAGCCAGAGTGTACCACGCCTCCCGTGAACAACGCCGCGCTGTCCAGGGCATCGGCTCTTTTCAGAGGAATTGTTTGATGGCTACCCTCACCACTGGCGTGGTTCTTCTTCGCTGGCAACTTCTTAGTGCCGTAATGATGTTTCTGGCCAGCACGCTCAACATCCGTTTTCGTCGGTCTGATTATGTCGGGCTTGCAGTGATCAGCAGCGGTCTGGGCGTGGTTTCTGCCTGCTGGTTCGCAATGGGGTTGCTTGGCATCACAATGGCGGATATCACCGCCATCTGGCACAACATCGAGTCGGTGATGATAGAAGAGATGAATCAGACACCGCCACAATGGCCAATGATTTTGACTTGATATGTAGAAGCCTCCAAAACGGAGGCTTCTTTTTTACGGCTGGCAGATGTTTTAATCATCCACCTTAAAACAATATAACCTATTGTTTTAATGAAAAATCAGAACGGAATATCGTCATCAAAGTCCATCGGCGGCTCATTAGACGGTGCTGCCGGAGCAGACTGCTGCGGGCGAGACTGCGCGCCGCCGCTGAACTGATTGCCGCCCTGCGGCTGCTGAGGCTGACCCCAACCGCCCTGCGGCTGACCACCACCGATATTGCCACCTGCAGGAGCGCCACCACCCTGACGACCACCCAACATCTGCATGGTGCCGCCAACGTTCACCACGACTTCTGTGGTGTAGCGATCCTGACCGGATTGATCGGTCCATTTGCGGGTACGCAGCTGACCTTCGATATAAACCTGAGAACCTTTACGCAGATATTCGCTGGCCACTTCTGCCAGTTTGCCGAACAGCACAACGCGGTGCCATTCAGTCTGCTCTTTCATCTCGCCGGTCGCTTTATCACGCCAGGATTCGGAAGTAGCCAGCGTAATGTTGGCAACTGCGCCACCATTTGGCATGTAGCGTACTTCCGGGTCCTGACCCAGATTACCAACGAGAATAACCTTGTTTACGCCTCTGCTGGCCATGTTCGTGTCTCCTGAAAAAAAATCGTTCTGAATAAGTGTAAACGCGCGATTGTACCATTACCAATAGCGCTTTTACTATGTTGTGACCTCGGTTCCGGGAAACAAACCTGGCCAGACATTGTTACACAACACTCCGGGTAATGCATTCCAATACTGTATATTCATTCAGGTCAATTTGTGTCATAATTAACCGTTTGTGATCGCCGGTAGCACCATGCCACCGGGCAAAAAAGCGTTTAATCCGGGAAAGGTGAATGGATAAGATCGAAGTTCGGGGCGCCCGCACCCATAATCTCAAAAACATCAACCTCGTTATCCCCCGCGACAAACTCATTGTCGTGACCGGGCTTTCGGGTTCTGGCAAATCCTCGCTCGCTTTCGACACCTTATATGCCGAAGGGCAGCGCCGTTACGTTGAATCCCTTTCCGCCTACGCGCGGCAGTTTCTGTCACTGATGGAAAAGCCGGACGTCGATCATATTGAGGGGCTTTCTCCTGCCATCTCGATTGAGCAGAAATCGACGTCTCATAACCCGCGTTCTACGGTGGGGACAATCACCGAAATCCACGACTATTTGCGTTTGTTGTTCGCCCGCGTCGGTGAGCCGCGCTGCCCGGACCACGATGTACCGCTGGCGGCGCAAACCGTCAGTCAGATGGTGGATAACGTGTTGTCCCAGCCGGAAGGTAAACGCCTGATGCTGCTCGCGCCAATCATTAAAGAGCGCAAAGGCGAACACACCAAAACGCTGGAAAATCTGGCAAGCCAGGGTTACATCCGTGCTCGTATTGATGGCGAAGTCTGCGATCTTTCCGATCCGCCGAAACTGGAACTGCAAAAGAAACATACCATTGAAGTGGTGGTTGATCGCTTCAAGGTGCGTGACGATCTCACCCAACGTCTGGCAGAGTCGTTTGAAACCGCGCTGGAGCTTTCCGGTGGTACCGCGGTAGTTGCCGATATGGACGATCCAAAAGCGGAAGAGCTGCTATTTTCCGCCAACTTCGCCTGCCCAATTTGCGGCTACAGTATGCGTGAACTGGAACCACGCCTGTTTTCGTTTAACAACCCGGCGGGTGCCTGCCCGACCTGTGACGGCCTTGGCGTACAGCAATATTTCGATCCTGACCGCGTGATCCAGAATCCGGAACTGTCGCTGGCAGGTGGTGCGATCCGAGGCTGGGATCGCCGCAACTTCTATTACTTCCAGATGCTGAAATCGCTGGCAGATCACTATAAGTTCGACGTCGAAGCGCCGTGGGGCAGCCTGAGCGCGAACGTGCATAAAGTGGTGTTGTACGGTTCTGGCAAAGAAAACATTGAATTCAAATACATGAACGATCGTGGCGATACCTCCATTCGTCGTCATCCGTTCGAAGGCGTGCTGCACAATATGGAGCGCCGTTATAAAGAGACGGAATCCAGTGCGGTACGTGAAGAATTAGCCAAGTTTATCAGCAATCGTCCGTGCGCCAGCTGCGAAGGAACCCGTCTGCGTCGGGAAGCACGCCACGTGTATGTCGAGAATACGCCGCTGCCCGCCATCTCCGACATGAGCATCGGTCATGCGATGGAATTCTTCAACAATCTCAAACTCGCAGGTCAGCGAGCGAAGATTGCGGAAAAAATTCTTAAAGAGATTGGCGATCGCCTGAAATTCCTCGTTAACGTAGGCCTGAATTACCTGACACTTTCCCGCTCGGCAGAAACGCTTTCTGGCGGTGAAGCACAGCGTATCCGTCTGGCGAGCCAGATTGGTGCGGGCCTGGTTGGCGTTATGTATGTGCTGGACGAGCCGTCTATCGGCCTGCACCAGCGCGATAACGAGCGCCTGTTGGGTACGCTTATCCATCTGCGCGATCTCGGTAATACCGTGATTGTGGTGGAGCACGACGAAGACGCGATTCGCGCCGCTGACCATGTGATCGACATTGGCCCGGGCGCAGGTGTACACGGCGGTGAAGTGGTCGCAGAAGGTCCACTGGAAGCGATTATGGCAGTGCCTGAGTCGTTGACCGGGCAGTACATGAGCGGTAAACGCAAGATTGAAGTGCCGAAGAAACGCGTTCCGGCGAATCCGGAAAAAGTGCTGAAGCTGACAGGCGCACGTGGTAATAACCTGAAAGACGTGACGCTGACGCTGCCAGTCGGTCTGTTTACCTGCATCACAGGGGTTTCAGGTTCCGGTAAATCGACGCTGATTAACGACACACTGTTCCCGATTGCCCAACGCCAGTTGAATGGTGCGACCATCGCCGAACCGGCACCGTATCGCGATATTCAGGGGCTGGAGCATTTCGATAAAGTGATCGATATCGACCAAAGCCCAATTGGTCGTACTCCGCGTTCTAACCCGGCGACCTATACCGGCGTGTTTACGCCTGTGCGCGAACTGTTTGCGGGCGTACCGGAATCCCGTGCGCGCGGCTATACGCCGGGACGTTTCAGCTTTAACGTCCGTGGCGGGCGCTGCGAAGCCTGTCAGGGCGACGGCGTGATCAAAGTGGAGATGCACTTCCTGCCGGACATTTACGTACCGTGCGACCAGTGCAAAGGTAAACGCTATAACCGTGAAACGCTGGAGATTAAGTACAAAGGCAAAACCATCCACGAAGTGCTGGATATGACCATCGAAGAGGCGCGTGAGTTCTTTGATGCGGTGCCAGCTCTGGCGCGTAAGCTGCAAACGTTGATGGACGTTGGCCTGACGTACATTCGCCTCGGGCAGTCCGCAACCACGCTTTCTGGTGGTGAAGCCCAGCGCGTGAAGCTGGCGCGTGAGCTGTCAAAACGCGGCACCGGGCAGACGCTGTATATTCTTGATGAGCCGACCACCGGTCTGCACTTCGCCGATATTCAGCAACTGCTCGACGTACTGCATAAACTGCGCGATCAGGGCAACACCATTGTGGTGATTGAGCACAATCTCGACGTGATCAAAACCGCTGACTGGATTGTCGACCTGGGACCGGAAGGCGGAAGTGGCGGCGGCGAAATCCTCGTCTCCGGTACGCCAGAAACCGTCGCGGAGTGCGAAGCTTCGCATACGGCGCGCTTCCTCAAGCCGATGCTGTAATCGTTAAGGCCGCTTTCTGAGCGGCCTTTTCCTTTCAGAGTTGCACCAGCAATTTACGTTTTTCTTCCGGCAGCAAATTCACCGCCTGCTGATAAGACGCATCCACCAGATAATAGATTTGCGAATCCGGCAGCGAACCGTCGAGATAAACAGTGCTCCAGTGCGCTTTGTTCAGATGGCGGCTCGGACGCACATCACTGTGCTGCTGACGTAACAACTCTGCCAGCTCCGGGCTGGTTTTCAGCGAAACAGCTGGGCGATTTTCAACCTCTTTCACCATCGCAAAAAGCACATCTTCAACTTTGATCTGCGTCGCTTTCCAGTCGCTGTGAACGCTCTGTTCCGCGCCAGCTTTAGCCATGCAATATTGTAGCAACTCCGAAATGGTCATTTTTTACTCCCCTTGTAGTGTCGCGATAATGCGACGCGATCCGCCGTGGATGCGATGTTCCCCCAGCCAGATGCCTTGCCAGGTGCCGGTCTGAATACGCCCTTTATGCACCGGCAATACAAGCGATGTTCCCAGCATTGAGGATTTGATATGAGAAGGCATATCGTCTGCTCCCTCATAGTCATGCTCATAATTTCCGTTGTCGGGAACGGTGCGGAGGAAAAAACGCTCCATGTCGTGGCGTACGGTGGGATCGCAGTTCTCATTAAGTGTCAGAGAGGCGGAGGTATGTTGCAGCAACAGATGCAGTAAACCGATGTTAACGCGCGGCATATCAGCCAGCTGATTCAGAATTTCATCCGTTACCAGATGAAACCCACGAGATTTGGCGCTAAGCGTGAGCGTCTTTTGATACCACATGTGCTGCTCCTTGATAAAACTCTCTTAATCAGTTTGCAGCAAGACAGCGAAAGGATAAAGGTGTGATTAAAAAAACAGCATTCGGGAGAGTGTCACGCTCTCCCGCTCTGTCAGTATTCTGAATTGACGATCACCTCTTCACCAAACGCACCCGCTTGTGGCAAGGGTTTGTAGGTAGAGTTGGAGGCGCGCAGAATGCGGATACCACGAGCGCCGACATCGCGTGCGGCGGTAATATCGTTATCGGAATCGCCATAAAAAATTCGGATATTTTTATCCTGCAGCCATTGCGATTTTGTATTTTGCCCTGGTTTATCACCCGCAAAGATCACCGGATTCATGTTGGTGGCAGGAATATGAAAATTATCCGCCAGCGTTTTTGAAACCGTTTCTGTTTTCGTCGGGCTACGACCAGTCACAAAGAAGATCGCGTCACCGCGGCGTACATGCATATCAATCAGCTGGCGAGCGACCTCTTTTGGAATGCTAAATTCATCCCAGCCATTGTTCATTTTTTCCCAGAACACAGGATTTTTCAGATAATCTTCGCTTTCTGGCGAGAAGTTTTTTTTGCCGCGCCAGAAGCCCGGACTGGAAAAAAGTACCGTGTCATCGATATCAAACCCCACCGCCATTGGCGGACGCCCTGCGAGGCTATTTTCAATTTGTGCGACCGAAACCCAATGAATTGGTGCCTGTTCAGCAAGCCTGGCAACGTTAGTACCAGGGTTAAGCGGTGAAGGAGATGAGGCCAGGGCAACAGCGGAACTGTTTAGAGCGAACAATAAGCAAACGGCACTGATTGCCTGTGTGATCTTGCGCATATTTTTCCCTAAATAGTCAGTATGTTGAAACTTTTTATTGTGAGATTTGTTGCAAAAACCATCTGACCATAACGCCAGCAGCAGGGGAAAGGAAGGATTATCTGCGGTTTTTGTGATGAAAAGCAGGATGATTATATTTAACCCAGTAAATACGTTCCTGTCGTTTTTTGTAAAAAAAATGTTTATTACGCGATGTAAGAATACATTAATTTTGCGTTTATTATGATGAATCCTCTGCAAATGAGCGAAATTTGATAGAACATTTATCTGCTGTACGATTTAATAAACTTAATGATACAGCATTACAAAAACAATCGAAGTTTATAAAGATGATTTCTGATTGACCAGCCCCCTACCCTACAATGGTGACTAATTTTATTACTCCCGAAGGAGATGATGATATGAATATATCAAATGTGAATAGCAACAACACGACATCTTTACCTGTAGAGCTAGATACACTAAATAACAAAGGTATCTCTTATGACAAAGATTTCTCTTATGCCAAAGATCTCTTTTTATATATAGAAACACAGTTAAAAATTGCAAAAGATTTTTGTAGACCTGGAGAAGAAGTATCAAGTTCTATTGCAAGTACAGTTTTTCACGCATTTATTGATTTAGTTAACAAAATCAGGGGTAAGAAAGATTGTATGTATATTTTCACGCTTTGCTGTTTTGCCGAGGAGGTTAAAGGTGATTATTCTCATTACAGGACCTTTTTATTTGATATTGGTAATCAATATAAGGTTAAACTTACACAAAGCGGAAAAAAAGAGTTCTCTTTAACTTTAGAATTTAACGATACTATAATTGAATCTCAGATAGTCACAGGCAATAAAGCAAAGCATATTCTTGAGGATATAGAAAAATTCTATCGTAACAAACCCGATACTTATTATTAAATTTAATAATGGTTAGATGTGCAAAAATTATTATTCCACACGGTAAAACAGGGAAAAGCACTCCAGAACATTGGGATAAACTTAACAAGTTAGAACTAACAACCTTTTATTACCTCTCACGTAGAACGATGGCATCAAAAAAATCATCTATAAAGCAAGAGTGAAAGCGACGACAGTTTAATTTCACTGCAGGCTGGGTAACTCCAGCCTGCTTTCCTGCATTACATCACCGCAGCAAACGCCTTTGCCACACGTTGTACATTTGCCGTATTTAACCCAGCGACACACATGCGACCGCTGGCGATGAGATAGACACCAAATTCTTCACGTAGTCGGTCAACCTGAGCGGCACTTAAACCGGTATAACTGAACATACCGCGCTGATTAAGCAGATAATCGAAATTGCGTTCTGGTATCTCTGTGCTCAATACCTTCACCAGTTCCTGACGCATCGCCAGAATGCGAGTACGCATCTCTTCTACTTCCGCCAGCCAGCTGGCTTTCAATGCCTCGTCATTCAGCACCGCAGCCACCACCTGCGCACCAAAATTCGGCGGGCTGGAGTAGTTGCGGCGAACTGTTGCTTTCAATTGCCCCAGTACGCGGCCTGCGGCTTCGGCATCTTCACACAGAACAGAAAGTCCGCCGACGCGCTCGCCGTAAAGGGAGAAAATTTTCGAGAACGAATTGCTCACCAGAGCGGGTAATCCAGCGCTGGCAATGGCGCGAATGGCGTAGGCATCCTCTTCCATACCGGCACCAAATCCTTGATAGGCAATATCAAGAAATGGGATAAGCTCGCGGGCTTTGAGAATTTCAATCACCGCATCCCATTGGTCATTAGTGAGATCGGCACCCGTTGGGTTGTGGCAACATGGATGCAGCAACACAATACTGCGGGCAGGTAATGTTTTCAGCGTCACCAACAGGTCATTAAAGCGCACGCCGTTAGTCGCTTCGTCATACCAGGGGTAAGTACTTACTTCGAATCCAGCCCCGGCGAATATTGCTACGTGGTTTTCCCATGTAGGATCGCTGACCCAGACGCCTGATTCCGGGAAGTAGCGTTTCAGGAAATCCGCGCCCACTTTCAATGCCCCTGAGCCGCCAAGGGTTTGAATGGTTGCTACGCGCTGTTGTTGCAGTACCGGATGGTCGGCACCAAACAGCAGCGGCGCAATAGCATGGCGATAGCTGTTAAGCCCTTCCATCGGTAAATAAAGCGAAGCGCCATGAGGCTGCGCATTCAGGCGCGCTTCCGCATCCGCCACGGCTTTCAGTTGTGGAATAATTCCGTCTTCGTTGTAGTACAGACCGATACTTAAATTCACTTTGTCGCTGCGAGGGTCTTCTTTAAAACGCTCCATAAGCGTAAGAATCGGGTCGCCAGCGTAGGCGTCAACTTTTTGAAACACGCGATGGTTCTCCAGGTTTACGGGCAGGTGGTTAAAACACAATAAACCGGAAGAAGGCGAAGATCGAGTGGATGTTCAGGGGCGAACGGCAATTAGCAACAGAGTGAGACTCATGACAAACGTACATCCGCCAGAGGCACGGCCTTCATAAGAGCATGAAAGAATATCAACTTATTGAATTGGTAGGATTTTATTGGCCGGATAAGGCATTCACGCCGCATCCGGCACAGACAATCAAATATTACAGAACGATTAATCCACGTATTTCATCGCGACCCTTGAAGTCAGGCGCGTAATAAGTTCGTAAGCGCTCACTTTTGTCATTTCAGCGATACGTTCTACGGGCAAACCTTCGCCCCATAAAATGACCGGGTCCCCGGCTTTGTCCTGCGCCTGTGGACCTAAGTCTACGCAGATCATATCCATCGCGACGCGCCCGACAATCGGCACTTCGCGACCGTTCACCAGCACTGGCGTACCGGACGGCGCGGCGCGCGGATAACCATCGCCATAGCCCATCGCGACTACGCCAAGGCGGGTATCACGTTCGCTTACCCAGGTTCCACCATAACCGACAGGCTCTCCGGCTTTATGCTCACGCACAGCAATCAGGCTGGAGGTCAATGACATTACCGGCTGACAGCCAAAATCGGCACCGGTGGAACGATCTTCCAGCGGCGAGACGCCATAAAGAATGATGCCCGGGCGCACCCAGTCAAAATGCGACTGTGGCCACAGCAGAATGCCACCCGACGCGGCAATGGAACGTTGACCTGGTTTGCCTTCACAAAAGGTATTAAAGATAGCGAGTTGTTTCTCGGTTGCGCCGCATTTTGGTTCATCCGCGCGCGCAAAATGGCTGACGATATTCACCGGCTGACGAACGTTTTTGCACTGGGTCAGGCGATGATAAAACGCCTCAGCCTGTTCCGGCCTTACGCCCAGACGGTGCATACCGGTATCGAGTTTCATCCAGACGGTAACCGGCTCGTCCAGGCTAGCCTCTTCCAGCGCAGCCAGCTGTTCTTCGTTATGCACGGCGGTATGAAAATGTTGCGCAGAAATCGTCGGCAGATCTCTGGCATCAAAAAAGCCTTCGAGTAACAGTACAGGTTTGGTGATTCCCCCCGCACGCAGTCGCAGAGCTTCTTCGAGACGGGCTACGCCAAAGGCGTCAGCATCGGGGAGCGTTCGCGCGGTCTCAAGAAGACCGTGACCATAAGCGTTCGCTTTCACCACCGCAACCATTTTACTGGCAGGCGCCAGTTCACGAAGACGTTGCAGGTTGTGTCGCAGAGCGCGGCGGTTAATCACAACAGTTGCCGCTTGCATTTGTGTTCCTTGATAAGTGTTTGCTTTAATTACCTAATTCATAAAATAATTATTATTCGTCGTCGTACTGCGGCCCCGCATAGTTGTCGAAGCGCGACCATTGACCGTTAAAGGTCAGGCGTACCGTCCCGATTGGGCCGTTACGTTGTTTACCGATAATAATTTCCGCGATGCCTTTTAAATCACTGTTTTCGTGATACACCTCATCACGATAGATAAACATGATCAAGTCCGCATCCTGCTCGATAGAGCCAGATTCACGCAGGTCGGAGTTGACCGGGCGTTTGTCGGCACGTTGTTCCAGAGAACGGTTCAACTGGGACAGCGCCACCACCGGCACGTTCAGTTCTTTCGCCAGTGCTTTCAGCGAGCGGGAGATTTCTGCAATTTCCAGCGTACGGTTATCGGAAAGCGCCGGTACGCGCATCAGTTGCAGGTAGTCGATCATGATAAGCCCGATGCCGCCGTGTTCACGGGCAATACGGCGTGCGCGGGAACGCACTTCCGTTGGCGTCAAGCCGGAGGAATCATCGATATAGATATTACGTTTTTCGAGCAAAATACCCATGGTGCCGGAAATGCGTGCCCAGTCTTCATCATCGAGCTGACCGGTACGGATTTTAGTCTGGTCAACGCGCGACAGCGACGCCAGAGAACGCATCATGATCTGTTCTGATGGCATCTCCAGCGAGAAGATAAGCACCGGTTTATCCTGCAACATCGCCGCGTTTTCGACGAGGTTCATCGCAAATGTCGTTTTACCCATCGACGGACGCGCGGCAACGATGATCAAATCCGACGGCTGCAAGCCAGCGGTTTTTTTGTTGAGATCGTCATAACCGGTATTTACCCCGGTAACGCCATCGTGTGGCTGCTGAAACAACTGCTCAATACGCGCCACGGTTGCGTCGAGCACATCGGCGATGTTCTTCGGCCCTTCGTCTTTGTTTGCACGACTTTCGGCAATTTTAAAGACGCGGGATTCAGCAAGGTCGAGCAGATCTTCACTGGTACGCCCTTGTGGATCAAAACCAGCTTCGGCAATCTCGTTAGCAACGGAGATCATCTCACGAACAACGGCACGTTCGCGCACGATATCCGCATATGCACTGATGTTCGCCGCACTTGGCGTATTTTTTGACAGCTCTGCCAGATAAGCAAAACCACCGACGCTATCGAGTTGCCCCTGGCGTTCCAGCGATTCCGCAAGGGTAATCAGATCGATAGGACTACCGCTTTCCTGCAAACGCGCCATTTCAGTAAAGATATGACGGTGTGGGCGGGTGTAAAAATCATCTGCCACCACACGCTCAGCTACATCATCCCAGCGTTCGTTATCCAGCATTAAACCGCCCAACACCGACTGCTCCGCTTCGATCGAGTGCGGAGGCACTTTCAGCCCGGCAACTTGTGGATCGCGTTCGCGGGGTTCAGCCTGCTGTTTGTTGAAGGGTTTATTTCCTGCCATAGTGAATGGAGTTACCGAGATAAAGAATGGGTCGAAACTTTACCATATGAAGCAGACCCTGACGATACGTTCTGGAGGACACATGGCAACACGAATTGAATTTCACAAGCACGGTGGCCCGGAAGTACTTCAAGCCGTAGAGTTCACTCCTGCCGATCCGGCGGAGAATGAAATCCAGGTCGAAAATAAAGCCATCGGCATCAATTTTATCGACACGTATATCCGCAGCGGCCTTTACCCGCCGCCATCGCTACCCAGCGGATTAGGCACCGAAGCGGCAGGCATCGTGAGTAAAGTCGGCAGTGGTGTAAAGCATATTAAGGCAGGCGATCGCGTAGTCTATGCACAATCGGCATTAGGCGCTTACAGCTCTGTGCATAACATTAATGCGGATAAAGCGGCGATTCTGCCTGCGGCAATTTCTTTTGAGCAAGCTGCGGCATCCTTCCTGAAAGGCTTAACGGTTTATTATCTGCTGCGCAAAACCTATGAAATTAAACCCGATGAGCAGTTCCTGTTCCACGCAGCGGCTGGCGGCGTTGGCTTGATTGCCTGCCAGTGGGCAAAAGCCCTTGGCGCGAAACTTATCGGCACCGTAGGAACCGCGCAAAAAGCGCAGAGTGCGCTAAAAGCGGGCGCGTGGCAGGTTATTAACTATCGTGAAGAGAATCTGGTCGAGCGGTTAAAAGAGATCACCGGTGGCAAGAAAGTGCGCGTGGTGTACGATTCCGTGGGCAGAGACACCTGGGAACGGTCGCTGGATTGCCTGCAACGCCGCGGCTTAATGGTGAGTTTTGGCAACTCATCAGGTGCGGTTACCGGTGTGAACTTAGGCATTCTCAATCAGAAAGGCTCGTTGTATGTGACACGCCCTTCCCTGCAAGGCTATATCACCACGCGGGAGGAATTAACCGAGGCCAGTAATGAACTGTTCTCTTTGATTGCCAGCGGTGTGATTAAGGTCGATGTCGCCGAGCAGCAGAAATATCCGCTGAAGGATGCGCAGCGTGCGCATGAGATTCTGGAAAGCCGGGCGACGCAAGGTTCCAGCCTGTTGATTCCATAAAAGAAATAGGGCTTCCACCTGGGAAGCCCTTTCTTTTTATAGTTCGGCTGTATGTAGGGTACAGCACGATGAATCTGTTAGAGGCGCAATAGTGACAGATTTGATTATCAATTCCTATTTTGTTCTAAGGATAAAACCTTAGGTTGTGATCGTCCGCACAATCCCTTAGTAACGCCAACGGTCATAGCGCTGATATTTCGGCACTTTTGGTGCTTTAATCACCTTAATAACCCACACTACCGCAATCGCCAATAACAGCCACGGCAGCAACTTAATCATCAATGCCAGCATACCGCCGAGGAACATAATGGCCGTCGCCACAACCAGCGCAGCGATAATGCCCAGCAACGAAACGCCGGTGACCATCAGCATGACAAAAAAGCCAATCACAAAAAGTAGTTCCAGCATGATGCTCTCCCAAATATGAAATCTCTTGCAGGCATTACAAGAATCATGCCAAAAATAATCTATTGATTTAACAGCAAAACGCCCCGCGACGGTGCGCAGGGCGTGGTGAATTTGACTACTTTTTGGTGAAAAGTTAACGCTTATCCGCCACCAGTTTGAGCGCGTGTTCCAGCACATTAATGTCTGCACCCGCTTTATGGGCATTTTCACTTAAATAACGCCGCCACTGCCGCGCGCCAGGAATACCCTGGAACAAGCCCAGCATATGCCGGGTAATATGGCCGAGATACGTACCCTGGCTGAGTTCACGCTCAATGTACGGATACATGGCGCGCACTACCGCCACCGGATCGGCATCAATATCCGAGGAACCAAAAATCTCTCGGTCTACCGCCGCCAGAATACCCGGATTCTGATACGCCTCGCGCCCGACCATCACGCCATCCATATGTTGCAAATGCGCTTTAGCTTCTTCCAGCGACTTGATACCACCGTTAATCGACATCGTCAGATGCGGAAAGTCACGCTTCAGTTGATACACACGCGGATAATCGAGCGGCGGGATCTCACGGTTTTCTTTCGGACTTAACCCAGAAAGCCAGGCTTTACGTGCGTGGATGATGAACATCTCACACTCACCTTTGCCGGAAACGGTGTTGATGAAATCGCAGAGAAATTCATAGCTGTCCTGATCATCGATGCCAATACGCGTCTTCACCGTCACCGGAATCGACACCACATCGCGCATCGCTTTCACGCAGTCGGCAACCAGCTGCGCATTACCCATCAGACACGCACCAAACATGCCGTTCTGCACCCGGTCAGACGGGCAGCCGACATTCAGGTTGATCTCATCGTATCCACGCGCTTCTGCCAGCTTCGCACACTGTGCCAGCGCCGCCGGATCGCTACCCCCGAGTTGCAACGCTACCGGATGTTCTTCTTCACTGTACGCCAGGTAATCACCTTTACCGTGAATAATCGCCCCTGTGGTCACCATTTCGGTATACAGCAACGTATTGCGGGAAAGCAGACGCAGGAAATAACGGCAATGTCTGTCCGTCCAGTCAAGCATAGGAGCAATGCTAAACCGAGAATTCCAGTAAACACCAGTTTTTTCAGGCATCACGCTGGTTTGATTAATTTTTTGTGTTTCATGATTATCGTGCATTTTTGAACATTTCAGGCTATTTTTCTCGCGTTAGGTTCCCGCCAGGTTCCCACGTTTTATGGGAACCCGAAATAACGAGGTCGTGTAATGGCGTACTATAACATAGAGAAACGACTAAAATCCGATGGCACACCACGCTATCGCTGTAATGTGATTATCAAAGAAAAAGGTGTTATCACTTACAGGGAAAGCAAAACATTCCCTAAACATGCTCATGCCAAAACATGGGGCACACAGAAAGTGATGGAATTAGATCTATATGGCATTCCATCATCAAATGCAGTTGACGGACTTACAGTCCGTGACTTACTACACAAATATTTAAATGACCCAAATGCCGGAGGTAAAGCAGGCCGTACTAAAAGATATGTGCTGGAACTGCTTATGGATAGTGACATCTCCGCGATCAAACTATCTGAACTGACAGAAAATGACGTAATTGAACATTGCAGGCTAAGAAACAACGCTGGCGCAGGTCCAGCAACAGTCAGCCACGATGTTAGTTATCTTGGCAGTGTTCTGGATGCGGCCAAACCTGTATACGGAATTAATTACACATCAAACCCGGCGAAAAGTGCTCGTCCATATCTACTTAAACTTGGTTTGATTGGTAAATCAAACCGTCGTAATCGTAGACCAGCATCTGATGAACTGGACATGCTCATTGAAGGCCTTCAACAACGATCTACTCATAAATGCTCAAAAATTCCGTTCGTTGATATCCTCAAATTTTCTGTGTGGTCCTGTATGCGAATCGGAGAAGTATGCCGGTTACGATGGGAAGATCTCGACCAGGAACAAAAATCTATACTCGTAAGAGACAGGAAAGATCCACGCAAAAAGGAAGGCAACCACATGAAAGTAGCCTTGCTTGGGGAAGCCTGGGATATCGTCCAACGACAGCCCCAAAAATCGGAATTCATTTTTCCATATAACAGCACTTCTGTTACTGCGGGATTTCAGAGGGTAAGAAGCAAATTAGGTATTAAGGATCTGCGATACCATGATTTGCGTAGAGAAGGGGCAAGTCGCTTATTTGAGGCTGGTTTTAGTATTGAGGAAGTAGCCCAGGTTACAGGGCATCGTTCATTAAACGTGCTATGGCAGGTATATACCGAACTGTATCCGAAATCTTTACATAATCGTTTTGAAGAGCTCCAAAGGAGCAGAAATAAGACCCCTTGACACTGTTTATCCATACAGTTAAAAATAATACTGTATACAAACACAGTATAGAGGGACTTTTATGCGTATTGAAATCTGCATAGCCAAAGAAAAAATGACTAAAATGCCAACCGGTGCTGTGGATGCGTTAAAGGAAGAATTAACCCGACGCATCAGTAAACGTTATGACGATGTAGAGGTGATCGTAAAAGCCACCAGCAACGATGGCCTTTCTGTTACACGCACCGCAGATAAGGATTCTGCAAAAACTTTTGTTCAGGAGACTCTGAAAGATACCTGGGAATCTGCTGACGAGTGGTTTGTTCACTAATTAACACGTAAAATCGGTAACGGCTGGAAATCATTCAATACTCGCACTATCGAAAGTTCGCCAGCCAGCCGTGGCACGTTCTTGCATACGACGTGCTACGGTTTCATTTATCTCCGACTGGAAACTTCTTATACAAAGTCGATACGCCAACATCATAAATGATCGCCACCTTCTGGCGAGGAACTCCTGATGCAATTAGTCGCCCGGACTGCGCCCATTGTTCTGGTGAAAGTTTGGGACGGCGACCGCCAATTCGTCCCTGTGCGCGAGCAGCTTCCAGTCCTGCTTTTGTTCGTTCAACAATCAGTTCACGTTCCATTTCAGCCAGGGCACCCATCACATGAAAGAAAAAGCGCCCCATTGGGGTACTGGTATCAATTGAATCCGTCAGACTACGAAAGTTGATGCCTCGTTCGCGCAACTCCTCCACCAGCACGACAAGATGCCGCATACTGCGCCCCAGTCGGTCCAGTTTCCAGACCACCAGCGTATCACCTGCCGATAATGTCCTGAGCAGTTTTTTCAGTCCCGGCCTTTCGGACTTTGTACCGCTTATCTTGTCTTCAAAAATCAGCTCGCATCCTGCACAGTTCAGCGCATTACGTTGTAGATCTGTGTTCTGGTCATTTGTTGACACACGTACATAGCCAATAAGCATGGTAGATCTCCCTGACAAAAGCAGGAATGATGCCATTTGCTCGTTATTTCTGCATTTTCATAAACGTTGGTTTGGGAGAAGGCTCAGCGTTACCCGTTGGCGTACCTGTTCCGTGGCCTTCGGCCACACCGCCGACAGGCTGGTTGAAATGCAATGGCGCTGCCTTTGATAAGGTGAAATATCCCCATCTTGCTACAGCATATCCATCAGGGAAACTACCTGATCTCCGTGGTGAGTTTATTCGTGGATGGGATGACGGGCGTGGTATTGATGCAGGACGTGCTTTATTGAGCATTCAGACTGGGATGCTGGAAAAACACCGCCATATTGTTGTTGCCAACGATAGGTATGATTCAAAAGAGGAATGGGAACTGGCGACAATCTTCAGAAGAGCATATACGCAAGGCCGGGGGCTTGATGCTGCCGATGCCGGAGGAACTCTGATTCCATCACCAACGCTACATACACGAGGGAGTATTGGTAACACAGGTGGGAGCGAAACCCGTCCACGAAATATTGCATTTAACTATATCGTGAGAGCTGCATAATGGATAAAGCCGTATTAAATAGCGAACTTATTGCCACGAAGGCGGGGAATATTACCGTCTATAACTATGATGGTGAAACACGGGAATATATTTCCACTTCAAATGAATATCTTGCCGTTGGTGTCGGTATCCCGGCATATTCCTGTTTAGATGCTCCTGGCACATATAAGGCTGGTTATGCAATCTGCCGTTCAGTAGATTTAAACTCATGGGAATATATGCCAGACCATCGCGGTGAAATTATCTATAGCACCGAAACAGGAGAAGCAAAAGAAATCACAGTTCCGGGTGATTACCCTGAAAATACAACCACTATCGTCCCGTTAACGCCATATGATAAATGGGATGGTGAGAAATGGGTGACAGATTCTGAGGCACAACACGGTGCCGCAGTAGAAGCGGCAGAAGCACAGCGCCAGTCACTGATTGATGCTGCAATGGCTTCCATCAGTCTGATTCAGCTGAAATTGCAGGCCGCACGTAAACTGACGCAGGCAGAAACAACCCGACTTAACGCCGTGCTGGATTACATTGACGCGGTGACGGCAACAGATACCAGCACCGCGCCGGATGTCATCTGGCCTGAACTGCCGGAGGCGTAGGCCATTCAATATCTGGCGCACTGGAGGTATCAACCAGTTCCAGTGCGTCCAGATAATCCAGCCATAAATTATATTGCGCCAGTTCCTCACCTTTCAGACGACCAATAGCGGCTTTACCAGGCCATTGCCTGCTATTCATGTGCTCGTTGGCTTCATTAATAAGTTTTCCTTTTTTCAACTCTGCCAATGCAATAAGGTTTTCTTTTGATAAAGGTGGTTGCTCTGTCAAAACCGGATATCCCTCCTGATTGCTGACTATTTTCATGCCATTATCCTGACCATCCAGTAGTGACAGCCATTCATCCGTGGTTATCTCAACAGCATCTGAAGGTGCTTTATTTAAATCGGTAAAAAAACCATTTTCTTTTTGTGAATAGAAGTATCTGTCCATTTATCAATCTCCAAAAGCAATCCAGTAAGCAAAAGGATTTATTCCTTGCTCAGTCACAGATGACATCAGGGAAAATTGCGATGGTGAAACAGGTAATGCTGCAAAACTAACCATTGTTGACACACCTAACCGTGCATTATCGTACGATGCAACAACACAATAATTGGTATTGCTGAAAGATATCGGCAGGGTGATATTTACAGGTGAGCCTAATGGCCCTGATGCTGATATTCCCATTTGAATGATGGTTCCATCAGGCAATTTTCTCCAGCGATTAGAACTCGGATTTCTTTTCCAGGCTGACATATCCGGTATCTGATTTTCTCCTGTACCCACATCCCTTTTCGCCGCTTCTCCCAAACCAAGGTTTTCGAGAGCTGTTTTCACCGTGCCATCCGATTTGATATCGCCAAACGGATTCTTGCGGCTTAACAGCAGCGCACGAAGCGCGGTAAGCAACTGGTCGTGCCGCGCCTTCTCCAGGCTGGCACCGGATGCCTCCACCACGCTACAAAGTTCTTCCTGCAACATGTCAAAGTAGTCATCATCCAGATCGGTGGCAGGTGTGCCGGTCTGGGGGTTACCACGGGTAAAACCGTTCTTACCCGCGCCGAACTTATCCTTCTGCGCGGTTTTCGTGTCTATACGATGCATGGATTACTCCGGATATTTAAAAATTACGTAGGTATGCGAAGGGCAGAGTTTGTTAAGCACGCACTCGACAACGGTGTCGCCCCAGATACGCAGTGCGGAATCACAGGGATCGCCACATGTCATCCAGGTGGTGTTGGTGGCGGCTGGCATGTTGACCTGCCAGTAATACCGCCATTCCGGCGCATTCACCGCGTCAGTACAGGCCGATGAGCAGGTGAACGTACCTTTATCGTATCGCGTGATAGTGGCGTCTGGTCTGCCCAGGGCAGCAAGCTGTGCAAGGTAAAAATCCTCATTGATGCCGCCCGCCAGATTAACCTTCGCATCCAGCCGTTGCTGACGCTGGCGAAGGGTCTGTGTCCCTGCGGGAATACATTCATCCGGCAGACCGCACAGACGCTCCCAGCGGTTTATCAGTTCAGTGGTGGTGCGCGGATCCAGCTCCCGCATCAGGGCATCCGCACGCTGATGAACACGGGTTAATGACGGTGCCGCACCTGCAATCGCCGGATCGCTGGCTGACCACGCCGGACCGGGCGGCAGCAGTGCCGACAACAGACGGATGTAATCATCGTTTGTCACGTCCATGAAATCGTCCCCAGAACCGCCAGTTCATTTTTTGCAATGGAGATATTGTCTGCCGGTGCAAGCAACTGATGGCTGTATTCCCCGTTCGCACCGGAAATCGCCTCACTGATACGCGATACCTTCAGTTCTCCCTGCGGATAACCATCACGCAGCAGGAACGAACGCAACTCCGCGGTGATGGCAGCCCGTATTTCCGGTGTGTCCGGCGTCACACGGATATGAAAATCCACCGTATGTGCCACCGGCCTGAACACATACAAATCAGAGCCTGCCACCGGGGCCAGTGGCCCGATATGTTGTCTTGCCGCCGTTTCCGTTGATTCTTCCGGAATGGGATTAATCAGGTCACTGCTGGCAATCATCACACCGACAGTTCCCGTTCCCATCAAGTGACGGTATGTCCATGCGCGGGTAATGCCGGGCACTTCTTTAGCCCAGACGACATAGTCCCCGTCAGCCCCGCCCTGAGGCGTCCAGTAATACCGCTCAATGACGCGGGCGCGCCACGTTTCCAGCTCTTCAGTATCAAATCCGCCTGTCAGGGTGTCAGCCACACCGGAAGACGGCAGACCATTCACCGGCGTGACCAGGATTAATGCCGTACCGTCGTCAGCGTTACCGACCGCACCTGCACTTGAGCAGGCGATCGGCACGCGCAGGACACCACCGGAGCTGGTTGCATCGGCAGTTGCCGTGTACTGAACCAGGTCATCGCGCTGAATAACACTCCCGGCAGTCACCTTCAGGCCATCGCTGACACCTTCCCAGCGCATATACCCGCTGGCAGCCGTGGCCCCCTTGCGCGGACACCGTTTCATCGCAGCATGTCGCGCCAGCCAGGACTCATCGCACAGGTCAGGCAGCATGTTCATTGCCAGATAATCGATGTAACCGTAAACCGTATGCAGCGCCGCCGCATACACCTTTGCCCGCACGTCTTCATCCATGCGCCGGAGCGTGTCGCTGACGTCCAGCCTGGCGAATAAATCGTTACGGAGCATACTGATATTTTCTGCCAGCGTCGGGCGCTGAAATTCACTGTCCGCCATGCGTTATCGCACTCCACAGATCATCAAAAGAAATCATTACCGGTCCGTCACGACGCCAGAGAGTGATACTGTTACCCAGTTCATTAATCCCGGTGCGGCGGATATCCAGATCAATACGGGACACCACGCCGTCATCAATCATCCATTGCAGGCATTCGCGGATATACCCCCTTACCGTCTGCACCAGCTGATTGGTCAGTTTGCTGCGCTGAAGCAGCCACAGTCGGGAGCCGTAACGGTCATTCTGTACCGCAGGCCAGGTATCCCCCCACCATCCCATCGGGACGTCGGCATTGTCATCAGGCTCCGCCCGCCGCCAGGTAAACAGGGAAATCACCACGGCGCGGGTCAGCGGATCCAGCGGTGCGCTGGCGCAGGTGCGTTTACCGTTCACCGTCAGCCACAGTTCCATCATGCCTCCATCGCTTTATCAGGTTTGTCGGTGTTACTGCCCTGACCGTTCTCTCTGTGACGATGCCCGTTATAGGCAAGCCGCATCGCTGACATGGTGGTGCCGCCGGAGTCGCACAGGTCTTTCACCTGTCCTGTCACTTCCAGGTCCATTTCAAAACGTGCTTTAGGTGAATTGCGAAACGTGATCGTTTTACCAGCACCGTCCACCACGATCCCCTCCCGGGTCAGCGTCACGGACTGCCCCTGATCGTCATAGACAGCCACCTCACCCGTCTGCAGCCCTTTCAGGCGGTAGCGCCGGTCCGACACCGTAACAACCACCGCATGAGAACGGTCGCCATCCGGAAACAACACCACCGCTTCCGCACCGCTGTTTGCCCTTGCGGTAAAACCGTAGGGTTCAAGATGTTCAACCCCGGCTTTGGGTTCACCGGCAATCAGGGACACATCCACGGTCTGACATTTCGTGGCGGCACTGATGCTTTTCACCACTGCCCGCCCAATCAGGCCGAGGAGTTGTCGCTGCATGGCTTCAATCGTACTCATCAGAACGGGTCCTCCTGTACTCTGGCTTTTTTCTTTTTCCGCGCGCCGGGGGCTTCGGGTTCAGGCAGATAAGCATCAGGCGGGCCGACACGGATTTCCGTCAGGGTGCCGTTCTGGTCCTGAGTAAACGTGACTTCCGAGACAAGCAGTTCGGTATTGTCGAAACCACAGACCGGATCGAAGACAATCACCCGCTGGTTGGGCTGCCACAGCGTACCGTTACCCTGTCGCCAGCCCTGCACCACATAGGTGGTTTCATCCGTCCGCGCCGCCCGTTGTCGGGCTTCAAAGTCCGCACGGGCAATACAGCCTGCCCCCGTAGCCTGCCCTGTCTGCCTGATATACATCGGACGGTAACGGGCAATAAATGCGTCCTCTGTGCGGGCCCGCAGCGCGGTGGTGGTGGCCTCACCGAAATCATCGTCGTTTCCGGCACGCTGCCCCGCCACCTGGTAAACAGAAAACCGCTCCCGGATACTCTTCTCCGTATCGCAGGAAAGGATGTTTTCCCCGAGTACCAGCGCAGTATGTGCCCGCGTTGAGCCAATACCGCCAATCACCAGCCTGCCGTGCGGGTCGTCGTAAGCCAGTGCCTGCTGCTGACCGAGTATTTTGTTGATTACCTCAATCACCGTTTCACCGTGATCAGGCTGGACATCAGGAATAACACCCGACGGCGCACCGTTGTTCACCACCTCAATGCCGAAAGGCGCAGCAAGCGCCTGCGCAATCTGTACCAGCGATCGTCCATTAAACTGTGTCGGTTCGGCTGCACAGTCAATCAGGTCAGCGGTCAGACTGCGTCCGGCAATACCGGTGCTGACCGAACGGGCATCGTAACGAACGGGCGTCGCCTCCACCCAGCCGGTGATCACCAGCTCATCACCAATCAGCACCTCCACTTTTGAACCGTTTTTAATGCGCGGCTGAAGCGTGGTGATACCCTCATCTCCCGGCCACTGGCGGGTGATCTCCACACTGAAATCCCGCGCCAGCCGTTCAATACCGGCACCGATGCGCACCGATGTCCAGCCATTCCACTCCCGGCCATTTACCCGTAGCGTGACATTGTCGTTCATTGCACTGGCACCTTCAGAGGGATCACCGGCACAAAGCCGGGATGCGTAATGGCATTACGCCGGATAATGTCCGCGTCACGCGCCGCGTTATCAAACCAGGTCGCCGCCAGCACCAGCGCGGGTAAAACCTCATCCGGTGTGCGCTGAATGATCCGTGCAGACTGTTCAAGGCGCGTGTTGATATCCGCATTCAGATCTGCTTTCACCCGGCGCAGCGCCAGAAACAGCGCATCGCTGGTTGTACGGGACAACTCCTTATCAATTGCCGTATTCAGTGTGTCGCGAATGTCAGTCAGTTCTTCCCACGTCGGCAGGTCAACCGTGTTTTTCACCGCCGGTGCATTGTTCAGTGCCGGATGCGTGACGGAAGGCCAGCCAGTGCTCTGCGCGGGTGTTGTTGCCTGCCCCACTGCGGAATTCTGCATCACCGCGGAAGTTGTTGGCGCAGGCAATCGGGTGACGGCATACGCCGCTTCGCTGATTGCGGTCGTACGAAGGGTGCTGGCAACCACGTTACGCTGCTGCGTCGCCGTGGCGGTGGTTTTACTGTCCGTTTTCCAGACGCCGCGCGGTTGCAGATCGCTGCCGAGGCTGACACCGGAAAGCGTTTTGATCATGGTGACCAGGTCGCTGGCGTTACCATAAAGGCGTTTCCCGGTACGCCACATTTTCTGCACCTGCTCAACGAAATTTTTGCCTGACGATGGCGGCGGCAGAAGTACCGAGATATCCCCCTGCAACAGCCTGGCGGCATCCGATACGGCAGAATCCACCACTTTCATCGCATCAGAAACATACCCCAGCATTATGCTGGCATTACCGATAACGTCGTTCTGCACGAAATCCGCCACACCATCGATACTGAAACCGCTGAAGCTGTCACTGATGCAGTCATCCAGTGCAGAACAGGATGACATCAGCGTCTGCGCCGTCGCCGCACCTGATGTGGGGTAAGAGAGTTCTCCTGCTTCGACAAACTTCAGGTCAAAGCGGACAATACGCCCTTCACTTTTCGATGTGCTGACCCGAACTTCCCCGTCAACACAGACTTTCAGCTCACCATATGTCGGGTGGACAAGCGTGCCGGGACCGGGTTTATTCAGCGCGTCAATCAGGCGATCGCGCTGGTCAAAGCAGTCATCTCCCACCACATAAGCTGTGATGGACGGGCGGAAAGTGACTTTTCCCAGATCTTCGGTATAGGGCTTGTCGCGGTTCGGGTATTCATGTGTTTCCACACGGCGACCGGTTCCCGCACTTTCTTCTTCAACCTTAAACGGTACGCCGCGAAATGACGCATCCTGAAGCCTGTCTTTCCACGTCATATAAACTCCGGATACAAAAAACCCGCCAAATCTGCTTTGTCAGTTATTTACATCGCAGAAGATGTGGCGGGAACCTAATATTTTTAATTACTATCTGAGTTGAACATCAATGGAATAAATATCACCACTCTTTATAAATTTAGAATCTGTCCTTTCATCAAAAGATTCAAATGACTGTACCTTTAAAAACTTTTTCATTTTATTTTCAAAAATACTTTCATTAACACCAGTTAAATATTTGAACGCTCTACCAGCAAGGACCTCATTACTTAAATCCATTGTGTTTTTATTGTCTTTGAAAAACCAAACAATAACCTTTTGTGGGCATGATGGATTATAAACAGATATATAAAACTGCGGCTCATATTTTTCATCAGCGTCATCACTAAGCATTTCTTCAGAAGATAATTCTCTTCTGAATTCATATTGCCGCTTAGTTATTCCTTCATCCTTTATTATCTCTTGCTTAACTGGTGCAATACCTATAGAAGAGATTAATTCTGACTCATTAAAGCTGAACTTACACTCTTCCGCAGCCAAGTTAAAAGATAAAAGTGCAGATATAAAAAAAACAAAGATACGCATAATCATCCCTTCAATCATTTGTAAGGAATGATTATATTAACTACTTAAAGCTGAAAACCCAAATTATGCCAGACAAAAACACATTAATCATTTTGTACACTACCTGAACCGCGTATAGCCAACATCATGGCTGACATCAAAACCGCTGGATCGCGTTTCCATAACCCGCATACCCGGAGGCGAATTCACAAAAGATACCTTGATCTCACCATCAACTTTTGGCACAGAAGCTTTGTTAATCATGAAGGGATTCGAGCCTGTGGCATCGGAGGCGTTGTTTGACTGAGCCGGATCCACCGCCGGATAAGGTGTGTATCCCCGCGCCGGTATTCCCGTCCCATAAGCATCATAAGCACCCGCGCCCCACTGCGCAGAGTTAATGGCATCGACCGTGTCACCGGAACTGTCGGTAAACCACTCAATAATTGGCTTCAGCTTGTCCCACATATCCTGAAACCACTTAACAACCGGTCCCCAGTTATTGATCACCATCCCCAGCGGCGACCAGGCAAAAACCTTCTTCAGAAGTTCCCAACCTGCCTCAAAATAAGGACCAATGGTTTCCCAGAGCTTCTTGAAATAAGGTCCGACAACATCCCAGTTAGTGATAATTAATCCCGCAGCCAGAGCAATCGCCGTCGCAATCATGCCAATCGGCGTCATCGACATAATCCTGCTGACAATACTGATGGCACTGCCCACGCCCATCAATCCCAGTTTCAGAATCGCAAGACCGGCAGCAAGCCCGACGACGCCGCGAATAACCCGGGGATTTTCATCCGCAAACTTCGTGAATTTCTCCCCCAACTCCCCCAGCCATTGTGTGATATTTTTAGCGTCACCAGAAAATGCGCCGCCAATAGCCGCAAGGCCGTTAGTTGCGGTCCCTGTCATTGCCTCCCACAGGTTGGACAGCGTACCAAGCTGTGCCTGAACACGTTTATTCAGGCTGGCCTGTTTATTCATCTTCTGCTGGATCTGATCGTAGCCATCCTTTCCTTTATCGATTAGTGCATTGACCACCTGAAGGGTTTCGGCATCATCACCAAATATTGCCTTAAGTACACCTGTTCGCTTAACGTCGGTCAGTTTTCGCAGCTTTGCCAGTTGCCTGAACATGTTATCAAGACCGCCAAAACTTCCTTTGCCGTCAGTAAAATCGAGCTGTACCCCGAGTTTCTGGCGGGCCATAACTTTATTAACGTCCCTGATTTTCTTAACGCTTAATCCGGACTGGATAACTTTTCGCAGGGCATTACCTGCCGACTCCCCGTTCATCCCCATCTGATCCATCATGACGCTGATGGGGGCAAGGCTCTGTGCAGCCTGAAGACCATCCTTGTTCACCATCTTCAGAACAGAACTGGTTTTAGTGAAGAAGGACAACATGTTGGTATCGTCAACGCCCAGATAAAACGCCTTCTGGATAGTGTCGAACAGCCCCATCATGTCTTCTGACGCCGTTCCGGTAGCATCCTGCATCTTTGCAGCAAACTCAGCAGCCGCTTCCGGTGTTTTTTTCAGTTGTACCGCAAGATAAGCTGTCGCTTTACCCACACCACCCAGAATGTTTTCTGCCGGGATCCCCTGACGCACCAGCATCTGCATCATGTTCTGGAAATCAGCCGTTGTACCAGGTAGCTGGTTACCCAGGCCAATAGCCAGTTTATTGATGTCCTGAAAGCTCTTTCCAACCTCGCCGTTCGCATCCATCATGGCGACTTTCAGCCCGGTGGCGGCGTTTTCCTGATCGGCATAAGATTTCAGGGAAAGCGTCAGACCCGCTGCCAGTCCGCCCCCAAGCGCCAGCCCACCCTGTGACGCTTCTTCCGCCTGGCGTTTAAATCCCCGGATTTTCTTTTGCATTTTCGACAGCGCGGGAGAAAGCCTGTCGACACCGGTGATCAACGCCTTAAGCTCAAATTCAGCCATGTGTGCGTTTCTCCTGCTCTATCCTGTTTGCCTGACTGACCAGTAAGGGAATTTCACTGATCGGCATATTCAGCAATTCGAAAGGATTAATGCGCCAGTAGCTGGCGCAGTCAAAGAAGCGATCAGTGAGGTATTCAGCCGTCAGGCCTGGAGGAAAAAACCAGCCACAAGCCACGCCGCCGCATTCAGGTCTGCCGGAGACATCTGGTCGACAGAGCTTTGCGGCACTTTCGCCAGCCGCACAATGTATTTCGACACCACATGCGCCAGAAGTTTGACTGACTCATCCTGATTCATCTGGTAGGGATACCCCAGCTCGCGGACATCCTTCCCGGTGGGCTCATCAAACTCCAGTACGGAGAGTGTCTCGCCATGAGCAGTAATCGGTTTCTTTAACTCAAGCTCTTTCATTACTGGTAATCCCCTTCTTCACCGTGGAACTCAAGATCAACCGTGCCTTCTTCGGCATTATGGTTCGCTTCGCCGTGCAGCCAGGCAGACGACAGTACATAGACCTGACCGTTCGCCAGCTCGGCAGTGATAGTCATCTCATCAGACGAGGTGATTTTGTTCACCGGAAAATTCTTCGGCACCTTGAAAGTCCCTTTGACATAAGGCGCACGGTGAGTTTCCTTGCGGTCCACTGAACCGTCCAGGCCGATGATGTCATCATTGACCGTCCTGTTCATGGGCACCTCAATGCCGCCGGTCAGCGATAGCTGTTGACCGTCAATTTTGAAATAACAGGTTCCCCCGATACGGGCCATTATGCAGACTCCTCTGAATACTGAAGACGGAACTGGTTAACCACGGCAAAGACACGCAACTGGTTAACATAGTCAGGCGGGAACAGCGTGTTCAGGCGGTTCGGATCGCTGGCATCACGCTCCACAACCAGGTACTGCTTAAACAGTTCGTAGTTTTCCACGATCCCCGCACGCTCAAGCTGACGGTAGGTTGCCAGCAGTTCCCCTTTGATCACCGCCGGTGTGACAATCGCCTGACCGGGACCAAAGCGGGTACCGTCGCTGGCAAGCTTGTGACGCCCGTACTTACTGGTAATGACGGATTTCAGTTTGCGCAGCACATACGCGCTGGTATGCAGCGTCTCGCTGTCGAGGTAGCTGTTATCCGCAACACCGTAAGCGTTTTTCCTGTACGTGGTGACATCACGCTGAATGCGCAGCACCCCGCTTTCGACATACGCCGTTGCCACGCCATAAGACAGCAGGGTCTGTTGTTCGGTCATCGTGAACCGTTTCCCCTTCGGCGCAGGCAGCATACCCACCAGCTCACCGGTCTGCGTGGGACGTGCCGGATCGTTGCGAATAAACACCGCTGCGCGGGCGGTACGGCTTGCCGCCAGCTCGTCGGCAGGCGTCTGGGTCTCTTTTTCGTATCCCGCCAGGGTAATGTGCTGCTGGTTAAACTGGTCACCTGCGGTCACCAGTTCTGACAGCGTGCCGATCTTTGCCGTATACACATGACCATACAGCTGACGCGCATAGCTCCAGCGACCGCTGGTATCGTTCATCTCGGTCACCAGCGTGTTAACGGAGGCTGTGTCGTTGAACGGCAGGCCGATATAATCAAACGGCTCATCCGCCATTGCAGCCACCGCGCCGGTGAGAACAGGAGCGCCCGTTCCGGCGGTACCCGTCGCCACGGCAATCTGTACGCCCGCTGGCAGCACTTCGCCCCCACCAAAGCCGTAGTAATTGAGGCTGACAGGAATTTCATTCCCGCAAAGCCCCTTATGACGCGCGGTCAGTGTGACCACGCCTGCCGAAGATGAGGCAGTAAACGGCAGGGTCGGAACGGCATTGATGGCATCTTTGATACTGCTGGCAATCGTCGCGACGTTATCGCCGTTGGTCACCGGTGCCTGCACGCGGGTACGTCCCACATAAACATTCACCGTGCCGGTTTCGGTTGCCGCGCCGGTCACCGTCAGCGTAACCGTTGCCGCCGCGCCCGTGGCTTCCGGAACGGCAATCACATACAGTTCACCAAACGGGTCGGTCTGGCGATAAGCCTCGACCATACGCGCCAGCTGACTTCCCGCACCACAAATCTGGCGTGCATAGTCTGCCGATGGCATCAGCACCAGACTGTTGGCAACAATCTCTGCACCGTTATTGGCATGACCAATCAGCAGCGATGCTCCGCTGTCCTGTGCAGTATTCGCCGCCGAGTTATCCATTTCCGCATAAAAAATCGGAACCAGCGTATTCGACGGAATGGTGTTAAAGCTTATCGTCATCGGTATTCACCTTTTTATTCACGCGCCGGATATCACCCGCTGCTTCACGGCGCAGCCAGTAGTTGTTCTCATCAACATTTCGCCCCTCGGCGGGCAAAAGGTCGCCGCGGGCAGGGTCAGGAACTGACCGCCCTTTAACAGGTTTCACAAACATGAAGATTCTCAGGAAGGAAGGGTTATTTCGGTGTGATGTTCGATATCGCCGTCAGGCCCGTTACCGGGATCGAGATAATCAACATCAATCGCCAGCGTTCGCAGTTCATCCAGACTGTTCAGGTCATCCTGCTGGCGGGTATCGTCTTCGGTCAGCTCGCTGATGACCGAAAAATCGAACTGATAAATCAGCTCATGACGATTCAGATCCAGCAGCGTGCCGCCGTCATAGGTAATCGGGTTACCGCACGCTTCCGGGTTCCAGCCCAGCAGAGCCTTAAAGAGCATCTGCCGGACATCGTCCACCACATCATACGAGGCAAACTGACCGCGCTCATCACGCCCGTTACTCAGTATGACAACCACGGAGAAGCCCTCTTTCAGCTCCTGCCAGTAGTCGGTCTGGCTTTTGTTTTCTCCCGGAGAGTCATCACCCGGTACCACATACGCCGCCGGGAGTCTCAGCTTTCCGACCTCCGGCAGATTTTTGAACTGTGCCGCGCCTGCCACCCGTTTTTCAAAATACGGGCAGCGGGCACGCAGCGCAGCAATAACAGGCGTCAGTTTCATCTGTGTCGTCGCTCCGGCTTCAGTGATTTACGCAATTCCCGCGCCAGAAAATAGCGTGTCCAGCTGCGGTTCTTTTCAAGAGTTTCCACCATAAAGTTATTACGTGGAGCCAGCCGCCAGCCGCTGCCACCGGATGCACCACGATGATGACTACGACGACGTTTTGCTCCTCCCCGGACACCAAAAAACAGAAACGCCGGATAGAAGTCACCAGAGATCATCCGGTTCCCCTTCCCGTTGCGCTGGTTAGGGGCAATGCGTGTCATAAAACCGGCTCGCTTTTTACTGGCTCTCGGCACCATGTAACCAATCGAACGAGCCAGGCGTCCGGTCTGATAACCGGGGTTTTCACCCGGTGCCGACCGCGCACGGCGCATCACCAGCCGACGGGCATCACGCATATGACGCTGCCCAATCGTGACAAACGCCCTCCGGACACGGGCGCGGTTAAAGCGCATCTCCGCGGGCTGCTGAACATCAACGTGAAAAAAGGGAGTCGCCATTGCTGCCTCCGTGACTCTGCGTAAATTCGCCCAGTTCCGTACACTCCAGCAGCAGAAAGCGCCGCGCCCCGTTCAGATCGCGCTGACGTTTCACCCGGTACACACTGTCATCACAGACCACCTCATAATCAGCAGTGATCCCCCGGCGGTAGCGAATGGTGATGTAATGGGTGATGGCATCTCCGGTCTGCGCGGTTTCCTGCCAGGTGGTGGCACTGGTCTGGATAACCTTCGCCCATGCCCGGAACGCAACCGGGTATTGAGGCTCCACGCCAAAGTTATCCGCGGGCATATCCACCCGCTGGCGGATCAGGACGCGTTTATTCAGTTCGCCGGGGTCCGGCAGAATGTAGGTTGCGCTGGTCTGCGCCTGACGAATTTTCATAGTGGTATAAGGCGATAAGGAGCAACCAACCAGTTAAAACTCATTGGCAACTCCATTTTCTCAACGTCTGTAACCGTTGAGCGGTTTTCGTAGAAATGGCTGACAAGTAGCAGGAGCGCAAGCTTCACATCATCAGATATCACAAGCCCATCAGAATCATCCGCAGGCCTGTCATCTGCGGTTGCATACAACGTACGGTTAAGGAAGTTTTCCGTCCGACTCTGAGCGGCCTTCCCAAGCAGTTCAAGCAACTCATCTTCATCAGAGAAATCATCATCCAGACGAAGCTGAAGCTTAATCTCTTCCATTTTTAACAGCATAAAACCTCCTGTGCCCGCCAGAACGCGGGCACAAAAAAACCGCATTACGCGGCGTGCTGTATTACGTAAAAAGACTAATCAACCACCAACGCTACCTTTCCCCACCAGCGCTTTAATGGCAGAGGTGTCTTCCAGGATACAGTCAAAACGATGGAAGGCCAGAAAACCGGTCTGATCATATTCCGCGTAACGCTCAACCAGACGTTTAAGAATCATGTATCGCACACGACGGATAATGAAGCGATCAAAGTCACCACAGAACATGAATTTTTTACCCGCCCCGATATCATCAATTTCCTGATCAATGACATACGGTACATTCAACACTGAAGCAGGTGCCACACCAACAATATCCGGCAACCATAAAGGGCGTCCCTGACCGTCTTCCATCTCACTGATCAGTTTCAGCGTATTATCGTTAAACGCCAGGCGGAATTTCGGTCCGCGACGATATGCAGGATCAATGCTGTGTTTCAGAGCCAGAATTTCCTGCCATTTCACCGCATTTGCCGCGGCAGTCTGTGTTGTGCCGGTCACTGATGCTGCCAGCCCTTTGGGTTGTTTAGGCGTACCAGCACCAGTTCCCTGAATCAGATAACGGGCTTCACCACGACCAATACGTTCAGCAATGCGACGGGCAAGATAAGCTTCCATATCGATCGCGCTGTCCTGCAACAACTCATTAGACACACGAATGATTTTCGATGTCATTTTGAGCGCCCCAAGGCTTCCCATACCGAAATCGGTGTCTTCTTCACCGGCTTCTTCATTTTCGCCCAGCAGAACACCAACTTCGGAAGTACCATCAGCTGTTGCCCACTCCATAGTGCGACCGTCAGAAGTGGTCAGAATCTGCGCCACACTGGCGATGCCACCGTAGGATTTCATCTTCTCAACAACTTTCGCCAGGAATGTTTCTGGTACGGTATATCCGCCCTTTTCATCCTGAGCTACACCCTGAGCACGAAGTTCACGCAACGCCTTTCGTTCTTCTGATGTCAGCTCACTGGCACCGTGACGCATCCACTTATCAAAAACCTGAGCTCGTTTCTCATCCTGTTGCGGATTGTTTTCCGGATCAAGATTCTGACGCTGCTCTTCCTCATTGCTTTCAATGTACGCCTGATCCTGACGACGCAGTTCTTCTTCGCGTGCAATTCGTTCATCAAGCGCTTCCAGTTCGGATTTTGCTTTGTTCCACTCAGTGCGCTGCTCTTCCGTCCATGCGTTATCACCAATTTTTTCATTCAGGGCGCGCATGTCAGTTGCGATAGTATTACGTTTCTGTTTCAGTTCATGCAGTTTCATGATGTTTCCTTTACGCGTTAAGAAGGGTCAGGACGCGTTCACGCGCCATACGTTGATTAATGGCTTTCTGTAGCGCGCCGCTGTTGCGCGCCTCCTGCCATGCTTTCATGGAGCGAACAGCCGAGTCAGCCTCCTGATAGGCAGGATATGTCACAGGACTGACATCCAGCAGACGGGAAAAGCGGGTTATCTCGCGAATAACAACCCCATCCTCATCCTGATACCACTCCTCACCGTCACGGGCGACACGGAAAGCGAAAGATGACTGGTTAATATCTCCACGTTGCATCGGGGCCAGCACCAGATCACGAATGGTCTGTGTCTCCGGAGCCTGGATGTCATAGCGTAATCCGCGCTCATCAACTGAAAGATTCAGCGTGCCTGCTGCACTACGCCCAAGAATAAAATTAGGATCGTGATTAAACAGTGCGCGTACATCATCACCAAGCACATCGTCAAAAGCGCCGGGCCGGATGATTTCGCGGAATGAACCGAATATCAGCTCAGAACGACAGTCAAACACCGATCCATAACCGATAATGTGCGCCGGGTTATCGTCATGCCTCTCAGCACGCACCTCACCGCTGTAACAACGGATTTCACGGTCATTCATTGGTTTTTCCCTCATCATTTTTTGGGGGCTTAAAATCTCCTGCCGGGTTAGCAGCATTCACGCTTACCAGCATCTCATCCAGCCCTTCAACCGGATTCATATCCTCGAATGCGCGGGCCTCATTACGGCTCATCCATCCATCGGTAATAGCGAAGTGATAGAATTGCGCGCGCTCCTGCGGAGTTCCGCGTAAAAGCCCCGTCAGATTGAACCTGACGTAATACCCGGCGGCTAACTCAGCGCGGGTAAACAAGCGACGGTTAAGCTCCTGCTCCCAGTTCGTCACCCACGGCATCATCGTGTAGCGGACAAACTGAATCGCCTGCGCAGAAATATTGGAGAAGGTGGCTTTTTCGAGGTCATTAATCATGTGCGCAGGAATATTGAAAATACCGGCGATCATTGAACGGTTCAGCTTCATCATGTCAATGATCTGAGCGTCAACTGGCGACACAGTCAGTGCCTTGTAATCCAGATCGGCTGGCAGCAGCATGGTTTTGTTTTCCTGGCGGCGTAACGCCTGCGATGCCTTCTGCCACTGATCTTTAAGCCAGCCCCAGCTTTCCTTATTGAGTCCGCTTTTAACGGATACTATCCCCGCCGGACGGGCATTACCGCTGAAGAAGCTTTCTGTGTACTTCTGACCGCTCATCCCCATGCCTATTGTTTCGGCATGTTGCATAATCGGACTCAGCCCCATCTTCTGATTATTACCCAGCGCACGGATGTGGATCATATCGTCCGGACTGATCGCAAACGCCCCATATTCGTTGTACAAACCGTAGGTATATCGGCCACCAGTATTCATCAGCGTCGTTTCCCACGGCATACAGCAATCCAGGGATATGACTTCACCGCGACGATTACGTTTCACCCAGGTATACCCATTCCCCCAGCCAAGGATGTGACGTTGCTTCAGTTCGCGCCATTTGTAGCTGGTTTGCCAGGTATTGGGCTCATCATGAACCAGATAAAACGCCGGATGATCGCGTGCGGGTTCAACCTTCCCCTTGTGCCTGCGCATAACATGCAACGGCATCTGGGCAAGGCTGGAAGACAGGACATAGATACAGGAATACACCGCAGCCAGTTTCATCGCAGTCTCAGGACTGACATAAACGTCTGCCCGGAACAGCCCATCAGTATCAACGGCATCACCGGTTATCGGGGTGGAAGGATTCTCCAGTGATTTACTTCTGAACAGAGCATCAAGCAGCACGCGTCCCCCTTCTGGCCATAGCCAGTGCGCCCACCAGCAGTAAAGCACCGGACAAAATCAGAGCCGGAGCCATACCAAACTGCAGGTAAACCCCGCACGTAAGCAGGCCAAAACCAGCCAGCCCGATAACATCAGCAATTAGTGATTTCATAGAATTAAGAGATCATCGTCCGGATCAAGAGATGAGAGGAAATCGTCAGGTTCTTTGAGCATTGCCCGACCGATCGTCATAATCAGTGCAACCGCACCATCGATTTTGTTTTCCGCCTGCTCCTTGACGGGCTTCACTAAATCATCGTTACCAGGCATGTTTTTGCCGACCACATTGCCGATACACCAGGTCATGATGGGATTGCCGTCATGATGAAAGCGCCCCGATTCAATCGCTGCTTCCAGCTCTTTCATCGGGTCGGACATATTGGCGAAGTTCTGGACGATAGTAACGGGATTCAGGTCTTCATCAGCAAGGTCATGTGACAGCCCGGTCGCCCCGAAAGGGTCGATGGGTGACTCACTGACCGGGCTGATTTTGTTCGCCGCTTTGGCCTCTTCGAGGATGTAGCGATAATCCACCTCTGCACCATCGGTAACGGTCAGAACGCCCATTTCCACCCATTTCTGAAAGCGTTCGGCTGTCCGGCGATCTTCATTTTTCTCGACGCTGTACACCGTGTCATACGGTACCCAGAAACGCGGGGCCACACTGTAGTAATGCGTTTTACCGTCAATCTCGCGGGTATAAAGTCGCGCCATGCTGTTCATATCCAGCTTACGCGCCAGGTCAAAGGCCAGAATGCACGGCTGCCCCTCGAACTGCTCAAGAGTCAGTGATTTATCCTCGCAGCTCTGCCAGCTCACCAGGTTGAAATACGCCGAACGCGCCGACACCCAGATATTGAGGTGTTTTGTTTTAAAGACGTTTGCCAGACGGGCGTTATTTTTCGCACGTTGCTGCTGGCTTAACAAAAACTCGCGATAAACCGACACACCGATATTCGGGTTAGCTTTTTCAAGTACCTGCGGGTCGGTCCAGTCGTCGCCTTCGTCAACGGTATAGATGATCCCGAACAGTTCATCGTTGGGCACCGAGCCGTTGAGCATCTCGATGACTTCCCGCCGTTTGTCGTAGCACGGCCCCTCAATGTTGTACCCGGCGGTGGTAATGGCCCACATCAGTGGCTGGCGTCGCGCCCCCATCCCGGTAAGCATCGTGGTGTAAAGCGCATCTGTGGCGTGCTCGTGATATTCATCCACCACGGCACAGTGGGGTGATGAACCATCACCGGGGTTACCGATCAGCGGTTCAAACCGCGCACCATCCTCCGGACGGTTCATGTTTGAGGCGTTAACCTCAATCCCGAACGCTTCCGTCAGCATGGGTGTGCGTTTACACATCAGTCGTGCCGGACGAAAGACTTCCCATGCCTGTTTCTCCGTCGTGGCACCGGAATACACTTCCGCGCCGAACTCGTTATCACAGGCAAAACAATACAGGGCGACACCGGCAGAGATTGCCGATTTGCCGTTCTTACGGGGGATTTCGGTATACACCTCCCGGAAGCGGCGCAGCCGGGAGCCTTTATTGACCCAGCCAAACGCGCAGCAGATCACAAAGAGCTGCCACGGCTCCAGCGTGATGGGCATCCTCTTGAATGCCCACTCACCCTTGGTGTGCGGCAACAGCTGAATAAATTTGGCGGCCCGTTCAGCCAGGTCCTTGTCGAAGCGGTAACGAAACGACTTACTTTTTTCCGCCATCAGGTCATCAAGATGGCGCTGGCAGGCCTGAATCACAAACTGGCAGGCCACAATCTTTCCGCGCACGACATCACGGGCATACTGATTGGCAGCATTTACGTTGGGGTAAGATTTCCGGCTCATGATTCGATGATTTTCAGATTGTCAGAAACGGGTTAGTGGCTTTCTTCTTCCCCGCCAGGCCAATCAGACGCTGGCGGCTGCTGGGGTCGAGTCCGAGCATTGCCCCCGTGCTGCTCATCTCGGACTCCTGTTCTTTCTTGGCGGTCAGCTCCGGATTTTTGACCCTGCCGCCCATTGCACCGGTGATGGTGTTGCCCTGGCTGGCAATATTTTTCACGGCACGTCGCCAGAACTCATAGGCTACGCACCACCGCTCAAGCACCGCGAGGTCAGTCACGCACAGCAGGCCCTGACCGCAGAGTTCTTTGGTTGTCAGTTGCCACATGATCGTGGCGAGAGGGAGATTTTCTTCTGCGAACCACTCCGGTGGCTCAACACCTTTGATGGGCGTAAAAACAGGTTCATCTTTATTCAGGGCTCGCTTGCCGGGGTTTCCGGCCAGCGCCTTGCGCGCCGTTGGCTTGGGGCGACGCCCGGAACGCCCCGCCGTTCCAGCCATATGCGGCACTCCTGGTTAAATTTCATTTTTCGCGGGTATAAAAAAACGATGGGGCGGGCAGTCCGGAAGACGTCAGGTCACAGAGATTTGACCCGCCCCTCCCCTCAGACAGTTGAGAATTATTATCACTTAAGTCGTTCACGGGCCGTCTTCGCCTTATGACACGGCCAGCACAGACTCTGCAGATTACAGTCGGCATCAGTGCCGCCATGCGCTTTAGGGATGATGTGGTCAACGGTTTTCGCCTCACGCACCACACCAGCACGCAGACATAACTGACACAGGCCTTTGTCACGCTTCAGGACACCCGCGCGGATACTGTCCCACTTCGAACCATAACCGCGCTGATGACGGGATTGTCCTGGCTTGTATTGCTTCCAGCCCTCGCTTTTGTGGCTTTCGCAATAGCCTGACGGATCAGTCGTGGTATTGCGGCAACCGCGAAAACGGCAGGCTTTTGGGGTTCGTGGTGGCATTTATATCCCCTCTTTGGTGCACGCTTACAATGCGTAAAAAAGCCTCGCATTAGCGAGGCTCGTTTATATCTTGAAGGTGAATACTTATTGTCTTATCTATCCACGGGAAACATTAAGATTATTACACCCGTTAGTTGGGAAATAAAACAAAATGCAGGTGGTTTATTTATTCTTTGCTGATTGCTTTCTGAATGGCATCGGCTAATGGTTCAATTAAGCCAATAGCTGCTTTTAATTCCTGTTCAAATTTTGCACTTGTAGCCTGACCACCTGCAGAAGAAAGTGAGGCCTTAACTAATTCAAGGGCAGCTTGAACAGCTATAACCCTCTGTTTTTGAGCTTGAGTTACAGGCTGAGCACCTATTCCTGAAGGGAAATAATTCTCTAACATTACAACCTCCGTATTAAAAAGTGAGGTTACACATTACCCTTAAGATTACTCTTAGTGAAGAGGTATCTCATAATTATCACCCTTACCAATGACGCTTGATGAAATTTGTAATGGACTGGCTCTTATTTCAACGCAACCACTTACCGCGCGCCAGATGCTTAACCTCAAACATTAGCAATGAGATGTTTAATCTGAATCCACTCCAGAAGTAATCACCACTCTGTCTACAGGGCCTGATGTGAAGGATGATGAGTAAAATTATCGCTATCATCGAAGGCATTGCGTCCTGATGTACTCCTGCAGGTAGTTAACCTGTGCGGTTATCCTGTCGATTCCACTTCGGAGACGGTAATAATTGAGTTCAGCATCTGCTGTAAGTCCTGGGCTTTCTCCATCGCCCATGCCGCTGGCTCTGGTCGTTGATTTTGCACAGGTGGCGGCGACTTGCAGGCGCTTACGCCCAGAAGAAACATCAGCACGGAGACTTTCGATAGTCGCGTTAGCATCAGCAAGCTCCTTTGTGTATCTGGCGTCAAGTTCTGCTACGTCACGTTGACGCTTCTGCATATCAGAGATAGTCGCCATAGCCGAATCTAATGCCATAGCGTTTTCGTCGCGCTGTTTTTTGTATTCAATGGCTTTATTGTGGTAATGATTCGCTGACCAGACAAGACCACCAGCAATACAAGCAATAAACGTTAAAATGAGCGCCCAATAACTCATCTTCATACCAGCAGCGCCTCCCGCGCCTTGTTGTATCGAATCTTACGATCCTCAATACCGTTCAGACCGCCGTTAATGATGCGCGTAACACGAGTAATATCGGCACCGTAGACCATGCAGCCTTTAGATGTGTAGAACCATGCAGCTGAGCGCGCGGCCTGTAGCTCCTGCTCCAGTTGTTCAGGTGAAGTCACCAGATCTAACTTCAGCGCCGCGCCACAGATGCGATAATTATGGAGGCCAGTGATTTGAATTAATCCTCTACCGCGATATTTCCAGCCATCACCGGGTGCTTTGTTACCCAGCCGGTTGCTATACACCAGATTGGCAATAGCATCCTGACGAGCTGCATGTCCGGATGTTCTGCCAAGGGCATCAGCCTGCTGCTGTGTGATCCTCTTTCCGAACGTCGCCACCAGCGCAGATGGTGTGTAGTTAAAATTTTCAACTAAGGCGCGAAACCCCATCGACTCATGGCCTACCTGAGCGATAAACATCGCCTGATCCGCTGGTGCTGTAATGCCGAATTCCTTCATCGCCGCATCAATGTGCGGAAACCAGCGCGCAGCCAGCCCGGCGCTAATACCAGCCGCCTTTTGAAATAATTGTTGGTTCATTAGTGCCTCAGATGATCAACCAGACGTGCAACGTTGCCTCTGACGGCCACCAGCACGGAAAGAAAAATAGTGTTCGCCACGATAATGGGCCATGAGGAATGGGGATAAATCCCACAGAGATAGGCCAACGGAACAGCACTGTATGTAACAGTAATCAGCCAGGCTAAACGTGAAACCCAAGGACGATGCCGCGAATCACCACGACGATAAAACATCAGAGTAATAACAACACAAGCACATAACAGCGCATTTATAGTTGCTGTCGGGTCATTTAGCTCCACCCGAACCTCCCCGGCGCGTTATGAGCGCCACCAGCGAGCCGATATCCTGATTATTCAGGAACGTCAGGATTTTAACGGCTAAAGCAGAGACGATTACGGCACCAATAGCATCCAGAGGTTTATCACTGTATCCGGTCAAGTTCGCCAGCTTGGAGCCAACCAACCCAGAGCAAAGAATCCCGGCAATATATGACACGATAAAATATGCCAGTCGGCGCGATGCACTCAGATCTGCTGCTGTTGCTATGTAGAATACAGCCCCTGCAAATGCGCCAAATACAACGCCGTAATCAGTTCCGGTCAGCAGTCCATAAACACTGGCACCCGTCAGGGCACCACCAGCCAGCCCAGTACCGGAAATCGGATCGGACATTTAGCCCCCTCTTAATTGCTGTTGGTCCTCTCAGATATGAGGGGAAGGGATCTTAATGACAGTCTGTTTATTATTTCAGTCAAACACTACCCTGTTGATGATTTCTCAGAAGCGAACTTGACTCCCAGGGGAAACTCAACTTTCCGTTAAAACCACCAGCAGACATTCGTTCAATTTCCACAGAAATATCACTGAGCCGTTCTTCAAGCTCTGCTTTTTCTTTTACCAGACGGTTATAGCGGCTTAGATGAAGCTTTTGCTGCTCCAGCCAGTCTTCAAGCTGTTCAACAGTCATACCAGGGTTAAAAAAATATGGCTGCTGCTTTTCGCCCTGCATTATTGACCTCCAGAAAAGCAAAAACCCCGCCGAAGCGAGGTTTGTTATGATTTCGTTAACGGCAGACATACAAAGCCCATCGTTAGGAAAATCCTAACCAGATTTTTTGAAAAATGCAAGAATCATGTCGCTATCTTCGGCGAAAATCATTTATCTCGTCACTTTTCTTAATTGCGCCTCAGCATATGCTTCTTCCTGCCAGCACTTTGTCACCAGTTTATCAATGACATCTGCATATCCTTTGTACCACTGATAATCCGTCAGGTCTGGTACCAGCTTCTGGACATGATGCCGCGCCAGTGTGGTTGGTAAACGGCTAAACCGGTTTCCATTGCAACGCCCACAAATCTTATAAACAGGCGTGCCATGAAGCCGGGTCCTTTTTTCATCCAGGACAATACCTTTACCCTTACACCCTCTGCACGCTGTGCTGACTTCTCCCTTACCATGACAATGCTGACATAGTTCCTTCACCCACTCTTCCTTGATAACAGATTCACCGCTTCTGGAGTGTTTCACCACTTCGCGCAATACATTATGAAATCCAGTACCAGCACAATGCTCACAGCGAGCCTTACTTGCCGCAGACCTGGAATAATCAGCAAAGGCAAAATTCACAAGGTAAGGGATGAGCTGTAACCGGGTTTCTTCACTCAATTTATTCAATGTCGGATTATCCAGTGCCATCGCGTAATTGAGCAGACCTTCAATCGCAAACTGAGGATCCTGAACACCAACTTTTGCCAGGAATAAGGCAAACCCAAGCGGTGCTTTCGACTGCACCATCCCCTGCGCAGCCATCACATCCGTAATTGTTAAACCACCAGAGCCTGTCGCCGGTGCGTCATCGCTCAGTTTTGGAGATTTCGGGGAGTAATATTTCGGTAAGGCTTCAAGGTTCATGCTCGTTCTCCACTTACGCCAGTACGCCTATTGCCAGCGCACGATCGATAAAACGAAATATCAGCTCCAGCTGGGAGCCATACTTCTCTTCAAATGCCACGGTATCCGCATGCAGCTCGTCGTGATGCTTTCTGCACAAAGGCAACACAAAAAGGTCATGCGCTTTTGTTCCCATTCCACCCTGACCGTGACCTATCAGGTGGTGGGGATCATCAGCGGGCTTTCCACAACATGCACACGGCTGTGTCTTAACCCAGCGCGTGTACTTTTCATTAACCCAGCGGCGACGTTTTGGGCGTAACATAAAAGACTCCGGCGACTCCGGATCCACTTTCAGCGCCAGCACCTTTTTCGCCTTATCCTGGATGATGCTGGTGGCAGGAACCGAAGGCACAAGGTCACTTTCCCGGGTAACAGACGGCACAACAGGCTTCGGTAATCTCAGTGCCTTACGGGCTGCACTTTCCGGTAAGGCATCCGCCAGATCATTACGAATCAGCCACCAGCACAGTTCCGGCATTGTCACAACGTGACTGTCATCAAAACCGAGATCCCGACGCACAACAGACAACACCCAGCGGGCACAGTTATCCGTTGCCATTGATTCCAGCCGTTCCGTGAACTGATCGCGCAGTTGGTTATCGCAGTGCCAGCACAGACGGATTGCACCCGGAGCGTGTCGCATTGTGGTCATGTTCTCGCTGTGCCAGTCGGAATGAGGCCACTGGCAGCCTTTTTCACGAAGTAACCAGCTTTCAAGACATTCCACGCCACCAGCACGACGGATCACTGCCTCATTGCGGAACACGGCCCGAACGGCAGGATCATCCGCCAGCGGTTGTGATGCCGCCGGAACGGCACCACTGGCGAAAGATGAATAACGCTCCGGCTCAGGCTCCAGCAGGACACGCCCCTGCATAAACAGGGGCATCAGCTCTGAACCTGGCCTGAACAATACGATCCCCATACGCGGGGCAATTTCAGGGGTCAGTAGTGCTCTCACGGTCACCTCAATGAACGGTATCGAGCAGCTTTAACAGCTCAGGGAATCGGGATTCGAAGAAATGCGGCTGCGTCTCGCGCGGATTTGCGGGACTGGTGATGTTCTTGCCGAACATGCAGCCTTTCGCCGTCAGCGACCAGAATTTTTTGATGTTGTTAATCCCGGTACGGCTGTATCGTTCGCGCTGCTCGACGATCCCCAGCTTCACCATCTGGTGATATGCCTGATTAGCCGTCAGGCGGATACCATACTGTTTCAGCAGTGCACTCAGTGACAGTGTCGGGCGACTTGAGCCATCGTGTGCATCAGCAGGAGCATCAATGGCATAGCGCGGTGCCAGATTCGGTAAGCCAACAGCCTCCTGGAGTTTCTGACAGGCACCAAGCACTGAAGAGTTAGACAGGTTTAATTCCCGGCGCATAAAGTCCAGCAGAATCACACCAGCCTGCATCTTGTCAGCAGCCTGTCCGGATAATTTTTCCGGTGCGCTGGTTACCATGTCGAAAGTACGGATCACCTTCAGATGGAATGACGGGCTGATCCACATTGCATAGGCATACACCAGTTCCTTGCAGACATACGTTCCCCGTTCATTTCCCCCATGAATCACACTCACCGGGTCAACACCCAAATTCTGGGTGTTGGTCAATTCATGAACAAGCTCAACAGTTTGTTGGCTGGAAAGAAACTTTCCCGGCTCCTTGGTTCTGGCATTTGCACCAGATGCTACTGCTGCGCGATGCAGATCGTTCAGGCTGTAACGCCCATAAGCATCACGACGAACTTCAATACCATCAATGACCATCAGATTATTCATACTTCGTTTCTCCTCTTAATCAGGCAGCTGCACCCGCCGTTTTCTCGTACTTACTGATAGTGATCTCGACCTTCCCTTCCGGGATAACCGGTCCCCACTCCACCAGCATTCTTTTCACCTGACTGTCGTCTTCCCACACCCCCGCGTGGGTCAGGGCGTCAAACAGCGCCTTGTTATAGTTGTCCAGATCGCGGATCCGGTTATCCGGAGGAAACAACACGATCTCCACTGAAGCAGGTGCCGACGTTGGTTTCGGCAGACGACGTAACTGCTCAACTATTGCTGCGCACGCCGCGCTCTGGAATTTTCGCCCCGCCGCGCTTATCAGGCTCTTACCAGCAAACGCCCCTTTGTTGGGGTGTCGCCAGTACGTGTTCACGCTGGGCGGAAAAGGCAGGATCAGCTTCATACTTTCAGGCCTCTCTCATGTAACCAGTGGGCTGCACGCAGCCTTGCGTTTTCCTCACCGGCAAGCAGTGAGCGGATAATCCCGACCGCCTCGCTGTCGTCGTCCTTCACCGCGGTATGAAGCGTGATGCCCCGGGCCACGCCACGCTTTATCGTGATGACGCCTTTTTTCTCCAGTGCGCGAAGATGCTCCACCGCTGCATTCACTGAACGGTATCCCAGCATGGTTGCCACCTCCTGATTGGTTGGCGGGAAGCCACGTTCTTTCTGATAAGAAATCAGCATATCCAGCACCTGCTGCTGGCATTGAGTTAACGTCGTCATGCCGCCATCTCCCTGACCAGTTTTTCCGCCTGCTGGCGAACCTGCGCCAGAAACGCCTCACCACATGCCTCAAGTTCATCGCGCCCGATGTAGCTGATTGCCTGTCCCTTCCAGGTCTTGTCAAAAACAGCAATAGCACCAGCGAAAAAAGCTCCTGTCGGTACCTGCTTCTCGTCTTTCGGGATAAACCAGACAGGCAGTTCAAAACCAATACGCCCGCGAATAAAAGCAATATGATCTGCATCTTCCGGCCACCACACTTCGCTGGTGGCAGCTTTGATCAGGAAAACATAGCGCCCGCCCTTATCACGCATGGCACTGGCATGTTTCATGATGTAACGCATGCCGGTGATGTATTGCCCCTCATGCTGACTGGCGCGGCTGTATGGAGGATTACCAAAGGCAGCACCTTTAAGCTCCACAAGGCGTTCTGACCAGTCATGCGCCAGCGCGTTGTCTTCCGCCGTGTAATACGCGGCACATTTGGCGTTATCACCGTCAGTGAACAGATCCAGAACAAACGGGCCAAACAGGGTGTTAATTCCCCAGAAAATGTTGTCCGGCGTGCGCCACTGATCGCCCACTTCCTTCAGTTCATGGGCTGGTTTGTTCCGCAGTTCCACCAGCGCCTGGCAATATTTATTACTCATTAAGCCCCCACGTAATTCCCTGACAGATACCACTCATCACCCGATACAGCGCGCTTGCTGCTTTTCCGTAAACACTGCTCACGACGCGCCAGAAAATTGTTTCGTTCTGGCTGGGAGTGGCTTTCACGGAATGCCGCCATCCACACCGTTGCAGCACGACGGTATAAGCCCCTGGACTCCAGTTCTTCCGCCTGGCGGGTCAGGCACAAAATCACCCGGGGATCGTTAGTGCCGACATAGAAATTGCGCACAGGTCTGGTTTCACGAACTGGTTGTGGTTCCGGCTCCTGCGCTCTCTCAGTCAGGCGTGGGAAATGTCTGCGTGTATCTCCTTCACAACGGTGAGCCACACGCCCACTCTGACGTAACTTGCTTGCTGACTGCAGAACGCGCTGCCGTGAGTAACCTGCAAAAGCATCCGCAATGTCTCCGGAAGTACACCCCGGATGGGCTTCAATGAATTTCTGAACGTCATTCAAAAGACTCATGATCACCCCCTGAATCCTGCCGGGATCTGGCTGTAGTCCACGTTGTCGTAACTGGCTTTGAAGTACGGGTCTTCGCGTTTTTCGGTGTACGTGCTGACGGACGGCGATAAGCGCAGGGAAAGCTCATCCCATTTTTCCCGCAGCTTCGACGGGCTGAGCACGTTACGGCACCAGAACGGATCGCGACTGACGCGGCTGTACATCTCGCAGATTTGTTTGTGAGTACGACCATCCTGCACACACATCAGGCGAATTTCGTTTGCCCAGGCTGTCCAGTTAGGTTCTTTGGGACGAACCACCTCGCCGTCACATTCGGCGGCCTGCTCGTACAGGGCGATGATTTTTTTCCAGAGCCACTGTGCGCAGGTCAAATCATCCTGCGTTCCCCACTGGCGCTTTTTAGGGCTGAATACAACCGCATCAGGATAGCGAGTTAAAAAATCCTGTTCAGCCTTCTGCGTGTCCGGTTGCGAAGCGTCCGGACGAGAAGGTTTTTTATCTGACGGATCATGTTTTGATTTTACTGACGGATCCCCGCCAGATTCTGACGGGTGAAAACCCGCTTTTTTGCCAGATTTCGACGCATCAAATTTTGACGGGTCAGATTTTGATGCGTCAGATTTTGACGGATCAGAATCTGACAGTTGAGAAAATGCCGCTGCCTGAAGCTTCGCAACGTTAAGCTGATAAACATTCGACGCATTGCGGTTACCCTGGCGACGCGCCTTACGCGTTAACCAGCCTTCTGCTTCCAGCCGTGCGATAGCCGTTCTGACGGTACTCATCCCCGCGCCAATCTGGCGGGCAATGGTTTCAATTGATGGCCAGCACACACCTTCGTCATTACTGAAATCAGCCAGGCGGGCCATAATTGCCACACTGGATAATTTCATGCCTGACGCTGCGCAACCATCCCATACATAGCCGGTTAATTTAGTGCTCATGACCGACCTCTATTTCCCTGAATTTACGACGAAACTGCTCGAGCGGGCTGAAGCACTCATGCTCATAGCCTTCGCGGAGGTAGATAACCAGTTGTGTTTCCGGCTCCCAACGAATGACTCTGACGGGCACTCCGTAGTGATCTTTGAACCAGCGGTTAACTTCTCGCAAAGGACTGTCTCCTTCTGCCGGTTGAAATCACCCACAGCCCACTCTGCAAAGCTGTGGGTTACAATTTCCCTGTCACCTGGTACATTTACTGCATAGCAATACTCCACCTTCGCTTTTCCACCCGGTACAGGAAGTGCAATCAGTTGCGAGCGACGGTAGTGTGTTGTTAAACTGTTCATGCGTTAGTTTCTCCACAACCAGAAGCAATCGACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTTACTCTTTTCCGGCGCACAAAAAACACGAAATAACAGTGTTAAATGCTCCTGCCACTTCGCCATTACTTGGTAGCTGTTCTCTTCGATTTGCTCACGCTCAGCCCGGTCAATAACTCCATCAGCAGTTGCCTTGCGTAAGTACTGGGAATGCTTGCCAATCCATTCTATTGACTCCATCAGCCGCTGATTAATGTCACCATTGTCAATGTCATCAATGTCCACCAGCGGCACAAACACCCCATTACTACGACGCGCTATCGCATCCGTTACATGCCTGGTACCACTGGCATCCTGTAAAACCATGGCCCACTCAAGTGGAAAAATTTGATCCCCACCGCTACGCAGTCTGTTATGCAATTGATCTTTCGCTGGGGTGATATCATCAGATTTATACAAACCAAGAATTTCCGCTGCTTCCTCATAGCCATGAGGTAAATCAGCAATCGTTCTTCGTATTGCTGCCACCAGCCATGCTGGTTGCTTATCAACTTTCCATTCAGGTTCTTTACCCACGGTTAATTCCTCGTTTCTGTGGTTACGTTTACGCAGTTGAACCGCTAACTTTTGAATAGCACTCAGGTAATCCATCATTTGGATTGGGGTAAATATCAGGACGCAGTTCATGAGGGGTCACGGACCAGTTTCCCAACTCACAAAGTTGTAAAACCCGTTCTGACGGGACTTGATTGTTAATTACCCAATTGGCGACGGATTGAGTGGACTTAAAACCAAAGCGACGGGCTACTTCAGATAAAGATTTTCCCGCAGCCTTTACTGCTTTCTCTGTGTAGTTTTGAGATGACATACCTTTCTCCTCTGAAATTCATAGGGATGATGCTACTTAAAATAGCAAATTGCAACTACTTAAAATAGAAATGACTAGCGCGTGCGATGTGAGTAATCTTCTACCTATGGTAGAAGAACAGAAGTATCCAGATTTCGCCAAGAGACTAAACGAATTGATGACAATCAAAGGAATCTCTGTCACTCAACTCAAAAGTCTTGTGGGCGTTACATATGAAATGGCTCGTCGATACACAATCGGCGCAGCGAAGCCACGTGTCTCTGTCATGAGTAAACTTGCGTTGGCTCTTGGAGTATCAGCTTCATATCTAGAATATGGTGTTGGAGATAGAGAGGAATGTAAGGAAATGGCAAGCATCCCCAATCCAACAAAGCCCGATGTATACAGGATAGAAGTTTTGGATCTTAGCGTTAGCGCAGGACCTGGGACCTATATGCTTTCAGACTATGTTGATGTGCTCTACGCCATTGAGTTCACAACTGAACATGCCCGTTCTCTTTTCGGAAACCGTTCTCAGAATGATATAAAAGTTATGACGGTAAATGGAGATAGTATGTCCCCAACTCTTGTTTCCGGGGATCGATTGTTTGTCGACATTTCCGTTCGCCACTTCCAGACTGATGGAGTTTACTCTTTCGTTTACGGTAAGACTTTCCATGTTAAACGTCTACAAATGCAAGGCAACAAACTAGCTGTTCTTTCGGATAACCCCGCCTATGAGAAATGGTACATTGATGAGAAGTCGCAAGATCAGCTTTATGTTATGGGTAAGGCACTGATTCATGAGTCGATTAAATATAATCGACTTTAATTAGGCATCTAGCAATCGTTAAAGCAAACGAGCACAGATGATAAAAGATAAGGATAGAAAATCTTTATACAAATTTTATTTGGCACATTGGTCCATAAAAATGGATTAACACTGAGCTGTTACCCTAAGTAATAAAAGATAAAGACCATATGTTATCTTTGGGTGCGTTACTAACACAAAATAAAACATAAATAATAGACAATTCTATACAACAGGATACGATAATGATTAATGAACGTACTGAAGCGACAGATGGGGTAGCAGATATGATTTCCACCAATACAAAATATTTAGTATGGAACAACAAAGGTGGTGTAGGAAAAACTTTTCTTACATATAATCTTGCCGTTGAGTTTGCTATATCTCATCCGGATCAAGATGTTGTGGTTATTGACTCATGCCCTCAATCAAACGTTTCAGAAATTATTCTTGGTGGCAATGGTACCGGGGAAGAAAATCTAAATAAATTGCGAGACAGAAATGTTACAATCGCAGGTTATATCAAGGAGCGTTTTAGCAAATCTCCTTTGTCTCGTTTAGGAAATGAATCTTCTTACTTTGTACGAGCCCATGATGTTAATGCAAAAATGCCAGAGAACTTATATATTCTTCCTGGTGATGTCGATCTTGATATCTGTTCACGCTTAATATCTCACATTGGCTCATCCCCAGTAAAAGAAGCATGGAAGAAAAGCCGATCTTTGCTGGTAGATCTAATAGCATCTTTTGAAGCCGATAAAAACATCTCTGACAGAGCAAAAACATTTTTTATTGATTGTAATCCAAGTTTTGCCAGCTACACAGAATTGGGAGTAGTCGCGGCAAATAGAATAATTATCCCTTGCACTGCCGATGCTGCATCAATTCGCGGAATAAAAAACCTTGTTAAACTTATTTATGGAGTGTCTATTGACAAGTCAGAACAAGATGAAATGTTCTTAGATTTCAACAAAGAAGCAAAGCAAAACCTTATCGAACTACCTGAACTACACCTTTTCGTACAAAACCGCTCAAGAACTAATGAAAGTGATGCAGCAAAAGCATTCAAATCACATGCAGAAGAGATCAAAAGAATCACGGATGACCTGTTAAATACACATCCTCATCTGTTCACAAATGTGGCTACTTTCGAGAGAGTTCAAAATGTCAAAGATGGTAATACTCTTGCAGCAATAATAAACCATGAGGGATGCCCTTTAAGTAGGCTGCAGCATAAGAGTTACACTATCTATGGTATGGCGACCCAAGCTAATAGAGCACAAATTGAAGCACTAGAATCTGATGTTTCAACAGTAGTCAAGTGTCTGTAATACAATAACCATGATTTAGAACGAACATAATTCTCGCCCGACCAACCGGTCGGGTTTTATACTCGTAGCCTGCTTAATAACACTGTTCCTGACCCGTTGTATCCATAGTTCCCACTATTTTCCCCCAAATTATTACTATCTAATTAACCTAATAATTAAACCAAAACAAAAGACACTCGATATTTTCTATACGTAACTACACCACATCACCCCATCTGTGAAAAACTTTTTAATTCATAACGTTATATAGAAAACTAAAAATTAAATACACACTTCTACTTTTTGTTGTTGATTTTTGCTACTTTTAGTAGCATCATTATCTGAACAAACAACAGACTGTTGTTAACGAAAGGTTGGTTGTAACACGGCGTATGGCACATGCGTCGTTAGCGGTCTGGTGACGTTAAAGGGGACAATCCACTCCTTGCTCGGGCAAACAAACCAGGTAGCCGGAATGTGCAAGTCAATGATGATGCTGATAAGACGCCTAACCAGCGTGGCGATTCGGTTTGACGCCTGGGAAGAGACCAGGGTGCAACGATGAGGGCATTTATGGAACCGCGACAAAGTGTGGTGCCGTAACTGGCTAAGTGCTCTCAGCGTTGTGGTGAATGCGCAGGCTGATGCGCGAAAGACATTGCAGCTATTGCGGAAAAGAGCTGTTCGGCGGGGCAATTAAACGCCCGTGAGAGTCTGAAATAACCGCAAGCCGGAGATCAGCACCGGTCACCACAACAGCCACTGCTTTGGCGGTACCAGTTTGTACACTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGAGCATCACCGGACGACGGGCTCATAACCCAATCCACCCGGGCGGCTGCCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATGAGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCAGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCGCTCAACCTGGAAATCGAACCATTTGATGAGAACCGTGTGAAAATCAAACATAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCATTAATATCGTTCTTAATTAACCGATTATTTATCTCATCACTGAATATCTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGTATCGATGACTTTACCCGTTATTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCCGATAATATGCGTGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCCCTGTTGTCTGTTAACAGCGAGCGTAACTCCCAGAAATCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCCATTCAGGCAACAAAAGCGGCGGCGGCAATCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGATAATGACTTCAGCGGCAAACGCTCCCTGATGGAATCTGTCGAAGCGAAGACCAAAGACATTATGCCAGTGGCATTTGAATTTAAATGCGTTCCATTTGAAGGCCTGAAAGAACGTCCGTTTAAATTACGACTCAGCATTATCACTGGCGATCGTCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCGGTGCAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAGAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTAACTCAATAATCACCGGATGGTGAGAGCTTCCTTTTAGCAGAATTCAACGCGGTGCAGCGCATATAAAGTGGAGAACGAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAAGATAAATAAAGACGACATCGTGATTAACGATATCGCAGTTTCCCTTTCAAATATCTGTCGCTTTGCAGGACATCTTTCACACTTCTACAGTGTCGCCCAGCATGCGGTGCTTTGCAGCCAGTTGGTGCCGCAGGAATTTGCTTTTGAAGCGTTAATGCATGATGCAACAGAAGCATATTGCCAGGACATCCCCGCACCACTGAAACGCCTTCTTCCTGACTATAAACGGATGGAAGAAAAAATAGACGCCGTAATCCGTGAGAAATACGGGTTACCTCCTGTTATGAGCACGCCAGTGAAATATGCCGATCTCATTATGCTGGCAACCGAACGCCGCGATCTCGGGCTTGATGATGGCTCTTTCTGGCCTGTACTGGAAGGTATCCCGGCGACAGAGATGTTCAAAGTTATTCCACTGTCACCAGGCCATGCCTACGGGATGTTTATGGAACGTTTTAACGAGTTATCGGAGTTACGCAAATGCGCATGAATGTTTTCGAAATGGAGGGGTTTCTTCGCGGGAAATGTGTACCGCGAGATCTGAAAGTGAACGAAACAAATGCTGAGTACCTGGTACGTAAGTTCGACGCGCTTGAAGCTAAATGCGAGACGCTGGCGGCGGAGAATGCGCGGCTGAATAAATTTATCGTACAGAGTTGCTATGTGTTTAATGGCGAGCAGGATGAAATATCTGATGCGTATATCTGCGCAACAGACGGAGGTATGCCGCAAATTCCAGCCACCGATGCTTTTCTGGCTGAAATTCGTGCGGCGGCTCGCAACGAGGGGATTAACTATACCGCAAGTCGTCTTGCTGCTGCTTTCAACCACGGATTTATCAATAAGTCTTTACGTGAAGTTTTCGACGTTACGCGCATGATTCTGTCAGCGAAAGAAGAGTTGGCTAATGAACCGCATCCGATTGATGGCCTGTCTGGTGAATATGCGGAAAAATCCCTAGAAGAATGGGCGGAACAGCTTCGCAAAGGAGTCATCCAGTGAGCAAGATTGACTATCAGGCACTGCGTGGGGCGGCAGTAGCAATTGAAACAGTAGCAACGCCTCAAAAATTGCTGGCATTTCGTATGAAAGTCACACCTCAGGTTGTGCTGGCACTGCTGGATGAACGGGAAAGAAACCAGCAATACATCAAACGCCGCGACCAAGAGAACGAGGATATTGCGCTTACTGTTGGGAAGCTGAGAGTTGAGCTTGAGGAGACAAAATCAAAACTCAACGAGCAGCGTGAGTATTACGAGGGCGTTATCTCGGATGGAAGTAAGCGCATAGCAGAGTTAGAAAGTGATTCTCAGGCACAAAAGTTAGTTGAAGCAATCATTGTTGCGATAGAAAACGAACAGGAACGTCTTTTTGATGAAGATTACCTAATGGATTCGAAAGAATGCATTGACGTAATTCGTGAAGAAGTAAAGCGATGGAATGATTCCCGCGCCGCTGGCATTCGCATCAAAGGAGAGTGAGATGGCAACTTTAACAAAAAAAGAACGGGCATGGTTGAACGAATTACAGGACGTTCTTGATCGCTGCCCATCACCGAAAAAAATTGGTTTTTACACCATTGGCGATAAAAGCATTTACCTGTATGACCTGCGCCGCATGGATGAAATCATGGAGGCTCTTGATAATCGTTCGTCAATGGATTGGTGTGTTGCTGTTCATGATATGAATGCCGGATTTGATGAAAAGATTTTATTCCCCTCATCAGTTGAAAGTACAGCAGGATAAGGACTAACACATGACCACTATTACCAAAGAGCGACTGCTGACAATCAGGCAGTGGCGCGAAACATACGGACCTGGTAGCAACGTTGTACTGCCAGCAGAAGAAGCGGAAGAACTGGCACGAATTGCTCTGGCATCGCTGGAAGCAGAGCCGATAGGTTTCCGTTGCAGGCGCAATGATAACCTTGGTGATTGGAGTTACGTATATCATCGAGAGCCAGATGATTTTGAGCGCAAACATTTAGTGATAGAGGGCATTTACGCCGCCCCTCCAGCACCGGTAGTGCCTGAAGAAGCAACTCCGGAAAACGTAGAAATGCTCTCTGGCTATGTTTCAACGTACAAATTAACCGATAGCGAGCGCGATATTGCTGCCGAAATATGGAACGCCTGCCGCGCCGCCATGCTTCAGTCCGGAAACTTTCGGGAAAACAAGAATTCGTCAACCAATAATTTTCGGGAAATCGCGGAAACGTCAACCAACTATCCGGCAATTCCTAGTGAGGTGTTGTCCGCAATCCTGAAGGTTGCCAGGATTCGTGCCGATTTCGATGATTTTGACGGTGACAGGCGAGGTATCGGTGATTGTCTGGATGAGGCTGAGCAAGAGCTTATCGTTACCATTAACAAATATGCCAGTCAGTTGGCAGCAGAACCTATAGCGCCTAATGACGTTCGAGAGCAGACAGCCATTCCACAAGTTCCGGTAACTCCGGATGGTTGGATAAGCTGTAGTGAGCGAATGCCGGAAAAGAATCAGAACGTGCTTATTTCGGTGAATTTCGATAGCTCTCTGGTTGAACCGCTAATATGCTCCGCACGCTATACCGGAAGCACCTTTCGGCGCGGAGATGCAACGATTAAGCCGGGTAATGGTATTGAGCAAGCAACTCACTGGATGCCGCTACCGGAACCGCCGCAGGAGGTGAAGTGATGAACAACTTAATGATCGACCTTGAGACGATGGGGAAAAATAAGGATGCACCGATCGTTTCCATTGGCGCGGTGTTCTTCACTCCAGAAACCGGAGACATCGGACAAGAATTCTATACGGTTGTTAGCCTGGAAAGTGCTATGGGGCAAGGAGCTACACCTGACGGCGATACCATCCTGTGGTGGTTGAAACAAAGCCCTGAAGCACGAGCTGCAATCTGTATTGATGATACTTTGTCGATCAGCGATGCTCTCTCAGAACTAAATCATTTCATTAACCGGCACGCAGACAATACGAAATATTTAAAAGTCTGGGGTAACGGAGCCACCTTCGACAACGTAATTTTACGTGGAGCTTATGAGCGAGCAGGACAAATCTGCCCGTGGGCATACTGGAATGACCACGATGTACGCACGATCGTTACGCTTGGGCGTTCCATCGGATTCGACCCCAAAATGGACATGCCTTTCGATGGCGAACGGCACAACGCCCTGGCTGATGCCCGTCATCAGGCAAAATATGTTTCCGCTATCTGGCAGAAATTAATTCCTGCCACCAGCACAGAATTATGATTTTCCCGGGTGCAGCCGGTTTTGATGGAGAAAATTATGAACACCTTGTTTTTACTGATGGCTGAATTCAATACCCCTAACATTGAACTCTCAGCAGTTAGCCAAAAGTACTTTGGCATGAGTCCAGCCACGGCAGAAGCAAAAGCAAACGCTTGTAAGTTGCCCGTTCCAACATATCGCATCGGCACATCACAAAAAGCAAAACGTTGCATCAATATTCAGGATCTTGCGGAATACATAGACAAAAGACGAGAAGAAGGACGTATCGAGTGGGAACAGGTCAGAACAAGCAAACAGAAGGGCAAAGAACATCACTAAAGAAAAAACCCGCCTAAAGGCGGGTTTTCAAAAAGCACCAGCTATGATCATGCTGCTTTGCGACGACGAAGCTTACCCTGCTGCTCTTTACCAGAGACAGTAGCGTGAGTGAACGCATTAGGAGCAGCCTTCATCAGAACTTCAACAGCAGCACCCATACCTGCGAATGCTTTCATTGTGTCGAACTTAACCTGTGGCTTGGTTGCTTTTTGATCTTTCATAGAAAACTCCCGAGACAGTAAAGGCGTCTCTAACCCTTTCTTTAAAGCTAGCTTGTTTCGCTAACTTATGCCAATCGATCATGTCGATTGGTGACATCGTTTCTTAGTAGTTTAAGCACAAAACGACTGCCATAGATGTACCTTTAAGGTAATCTGGACGGGTATCCTACAATTTGTAGACCCTTCTCGTCTATACCTACTGAGCAAATTTAAGAAAGATATCCTGCAGCTCATCAATGACTGCCGACATCACATAACCGCACTGTTCCATGCGGAAACCAAAAGACTCGTAATACTGCACCAGTTCTGGTACTGGCTCTACAATGTGGACAACTTTACATTCAACAGCTTTACAAAATATAAAAGCACTCATAAGAGTGAGTAAAACCATGCGCCCTTTCAATGGGTGAGATTCATCTTCTCTAGAAAACCTTTCGATCATATGGATACGAAAGATGTTTTCTTCAACCCCATAAACACAAATTGCTGCTCCTGATGGTATTCCCTGAACCCGACCTTGCTGAACAAGTTTTATGCAGAACTCATACTTTTCTCTGGAGTTGCCATAGGTACTTAACGCATAGTCCCATTCAAGCTCACCATAGCCACCACACAGAATCTTGTAATCATCATCACTGAGCGGACCAACAGCAAGAGGTAAGCCGACATGATCAATAATCAACTGGATATTGTTACGTACTGATTGACCTATCTCGTCCAGGGTAAGCATCATGGCCTCTCAAGCGGAACACTAAAAGTCGCATTATATCTCATTCTTAAGCCGCGTATGGATTACACCTTGAAATGAAAACGCCGGGTTCCCAATAGGCTCCCGCAGAGTGTATAACTACTTGTTTTTCAACAACGGTACATCCTATCAAGCATCGGGGCAATCGAGAAACGGTAAGCGGGATAAACAGTGTGTGATTCTGCTGGCATGGCGACTTTTTAAGCATCTGGTAAATTGGGGGCGCTACTATAGCATAACGAATATGCGATCAGGCAATGTGACAGCCATGACACCCTGTTCTCGGCGTTTAAGCGAGAACTATGGTAAAGTAAGGACATTCTTAACCCCACTGTCGAGGTGCCCAATGGAAAAGACCACAACGCAGGAGTTATTAGCGCAGGCTGAAAAAATCTGCGCACAGCGTAATGTGCGCCTGACCCCACAGCGCCTCGAAGTGTTGCGCCTGATGAGTCTGCAGGATGGCGCTATCAGCGCTTATGATCTGCTTGATTTGCTGCGCGAAGCTGAACCGCAAGCCAAGCCGCCAACGGTTTATCGCGCGCTGGAGTTTCTGCTTGAACAAGGTTTTGTGCATAAGGTGGAATCCACCAACAGTTATGTGCTCTGTCATCTGTTCGATCAGCCCACCCATACTTCAGCCATGTTTATTTGCGATCGCTGCGGCGCGGTGAAAGAAGAATGTGCTGAAGGCGTGGAAGACATCATGCATACGCTGGCGGCAAAAATGGGTTTTGCCCTGCGGCATAATGTTATTGAAGCACATGGATTATGTTCAGCATGTGTAGAAGTAGAAGCGTGTCGTCATCCTGAACAGTGCCATCATGACCACTCTATTCAGGTGAAAAAGAAACCTCGCTAAAAGGGTGTACATCCTTGTACATGTCGGGCAGGAGGGATTAATTACCAGCGGTAATCATGGCGTTTTTCCCAACTATCAACCTCTTTTTCTGCCTGATCTTTCTGATAACCGTAGCGTTCCTGGATTTTACCCACTAACTGATCACGTTTTCCTTCAATGATCGTCATATCATCATCGGTCAGTTTGCCCCATTGCTCTTTCACTTTACCTTTAAACTGTTTCCAGTTACCGCCGGCTTCATCTTTATTCATAATCAAGACCTCATCGTTAGGTTGTGAATGAGAGTACGTTCACTTTTCTTCTGAACGTGAGATTAAGTGTAGTCAGCAATTTGGCTTAGGATTATTTATTCAGAATTTTTAACCGTCACGTTGCGACAAACCAGGTATCGTTACGCCAGTGACGCCGCCAGATAGCCGCCAGAGAAAGCCCGCGTAACGCCAGAAAGACGGTTAACGCCAGCCACAAACCATGATTCCCCAGCCACGGCAGCGTAAGGAGCGTCAGCGCAAAACCTGCGGCGGCCACCGCCATACTGTTACGCATTTCGGCGGCACGCGTTGCGCCTATAAACATGCCGTCCAGCAGATAACACCAGACGCCGACCAACGGCAAAATCACCTGCCAGATAAGATAGCGGTCAGCCAGCTGCTGAATCTGGGTTAACGACGTCAGCAACGCAATGATGTGTTCCCCTGCCAGCAAATAAACCACCGAAAACAGTAACGCTACGATCCCCGACTGGCGGCACGCTGCCCGCCAGACATCCAGCAACTGGCTACCGTCGCGCGCACCATATGCCTGACCGGAATGTGCTTCAACCGCGTAGGCAAAACCATCCAGCGCATAGGCGGTAAAGGTGAGTAGCGTCATCAGAACCGCGTTAACAGCGATAATGTCACTCCCCAGTCGTGCGCCAAGTACGGTGATCGCGCCGAAACAGAGTTGCAACAACAGCGAGCGCAGCATGATATCGCGGTTAAGCGCCAGCAAGCGACGGAAATTTCCTCGCCAGGCAGTTTTCAGCATTTCGCCGGAGATTCCGCGTAGTTTGAGGATTTTACGCACCATTAGCAGACCAATCAGCAATGTTGCATATTCCGCAATAACCGTCGCCAGCGCCGCGCCCTGCACGTTCATATGCAGCCCCATCACCAGCCAGACATCCAGCACAATGTTGAGGATATTGCCGACCACTAACAAAATTACTGGCGCACGGGCATATTGCACGCCGAGTAACCAACCAAGTAATACCAGATTCGCCAGCGACGCCGGTGCGCTTAACCAGCGGATTTCAAGAAAGCGCCGCGCCTGTTCTAGCACCGCTTCGCTGCCGCCAACAATATGCAGCGCCAGATCGATAATCGGCGTACGCAGCAGCGCAATTAACGCCCCAGCCCCCAACGCCAACAGCAACGGTTGCACCAGCGCACGGGCTAATGCCTGAGGATTTTTGGCACCATAAGCCTGCGCAGTCAGCCCGGTGGTGCTCATGCGTAAAAACAGCAACAACATAAAGAGAAAGCTGGTCGCCGTTGCACCAACCGCCACGCCGCCCAAATAAACCGGACTATCAAGATGACCAATTACCGCCGTATCGACCAGTCCCAGCAACGGAACGGTGATATTGGAGAAAATCATGGGTAAGGCGAGATGCCAGAGTGCTTTATCAGATGAAGTGAGGAATGCCATGCAGACAAGCCTGATGAAGAGAGATGAAAAACAAACCGCGATACCAGGCGGCATCGCGGTCTCAGAGATATGTTACAGCCAGTCGCCGTTGCGAATAACCCCAACCGCCAGCCCTTCAATGGTGAAGCTCTGCTGACGAAGGTCAACGACAATTGGTTTAAACTCGCTGTTTTCTGGCAACAGTTCGACTTTATTGCCCTGTTTTTTCAGGCGCTTAACGGTAACTTCGTCATCAATACGTGCGACAACGACCTGACCGTTACGTACATCCTGAGTTTTATGCACTGCCAGCAAATCGCCATCCATAATGCCGATATCTTTCATCGACATCCCGCTGACACGCAGCAGGAAATCAGCATTCGGCTTGAACAAGGAAGGGTCGACCTGATAATGACCTTCAATATGCTGTTGCGCCAGAAGCGGTTCACCGGCAGCCACACGACCTACCAGCGGCAACCCTTCTTCCTCTTCCTGCAATAAACGAATCCCGCGTGATGCGCCGGAAACAATTTCAATAACGCCTTTGCGTGCCAGCGCCTTCAGATGTTCTTCAGCCGCGTTTGGGGAACGGAACCCCAAACGCTGCGCGATTTCCGCACGCGTCGGCGGCATACCTGTCTGGCTGATGTGATCACGGATGAGATCAAACACCTCTTGTTGCCTGGCCGTTAACGCTTTCAT
Protein sequences of DBSCAN-SWA_2 >CP029164|355346:412633|391067_391292_-|AWH68242.1|DBSCAN-SWA MLENYFPSGIGAQPVTQAQKQRVIAVQAALELVKASLSSAGGQATSAKFEQELKAAIGLIEPLADAIQKAISKE >CP029164|355346:412633|376865_378194_-|AWH68224.1|DBSCAN-SWA MTWKDRLQDASFRGVPFKVEEESAGTGRRVETHEYPNRDKPYTEDLGKVTFRPSITAYVVGDDCFDQRDRLIDALNKPGPGTLVHPTYGELKVCVDGEVRVSTSKSEGRIVRFDLKFVEAGELSYPTSGAATAQTLMSSCSALDDCISDSFSGFSIDGVADFVQNDVIGNASIMLGYVSDAMKVVDSAVSDAARLLQGDISVLLPPPSSGKNFVEQVQKMWRTGKRLYGNASDLVTMIKTLSGVSLGSDLQPRGVWKTDSKTTATATQQRNVVASTLRTTAISEAAYAVTRLPAPTTSAVMQNSAVGQATTPAQSTGWPSVTHPALNNAPAVKNTVDLPTWEELTDIRDTLNTAIDKELSRTTSDALFLALRRVKADLNADINTRLEQSARIIQRTPDEVLPALVLAATWFDNAARDADIIRRNAITHPGFVPVIPLKVPVQ >CP029164|355346:412633|362206_363400_-|AWH68206.1|DBSCAN-SWA MFQKVDAYAGDPILTLMERFKEDPRSDKVNLSIGLYYNEDGIIPQLKAVADAEARLNAQPHGASLYLPMEGLNSYRHAIAPLLFGADHPVLQQQRVATIQTLGGSGALKVGADFLKRYFPESGVWVSDPTWENHVAIFAGAGFEVSTYPWYDEATNGVRFNDLLVTLKTLPARSIVLLHPCCHNPTGADLTNDQWDAVIEILKARELIPFLDIAYQGFGAGMEEDAYAIRAIASAGLPALVSNSFSKIFSLYGERVGGLSVLCEDAEAAGRVLGQLKATVRRNYSSPPNFGAQVVAAVLNDEALKASWLAEVEEMRTRILAMRQELVKVLSTEIPERNFDYLLNQRGMFSYTGLSAAQVDRLREEFGVYLIASGRMCVAGLNTANVQRVAKAFAAVM >CP029164|355346:412633|384174_384585_-|AWH68234.1|head,tail|DBSCAN-SWA MKIRQAQTSATYILPDPGELNKRVLIRQRVDMPADNFGVEPQYPVAFRAWAKVIQTSATTWQETAQTGDAITHYITIRYRRGITADYEVVCDDSVYRVKRQRDLNGARRFLLLECTELGEFTQSHGGSNGDSLFSR >CP029164|355346:412633|359839_360256_-|AWH68202.1|DBSCAN-SWA MWYQKTLTLSAKSRGFHLVTDEILNQLADMPRVNIGLLHLLLQHTSASLTLNENCDPTVRHDMERFFLRTVPDNGNYEHDYEGADDMPSHIKSSMLGTSLVLPVHKGRIQTGTWQGIWLGEHRIHGGSRRIIATLQGE >CP029164|355346:412633|408147_408429_+|AWH68267.1|DBSCAN-SWA MNTLFLLMAEFNTPNIELSAVSQKYFGMSPATAEAKANACKLPVPTYRIGTSQKAKRCINIQDLAEYIDKRREEGRIEWEQVRTSKQKGKEHH >CP029164|355346:412633|370443_370998_-|AWH68214.1|DBSCAN-SWA MLIGYVRVSTNDQNTDLQRNALNCAGCELIFEDKISGTKSERPGLKKLLRTLSAGDTLVVWKLDRLGRSMRHLVVLVEELRERGINFRSLTDSIDTSTPMGRFFFHVMGALAEMERELIVERTKAGLEAARAQGRIGGRRPKLSPEQWAQSGRLIASGVPRQKVAIIYDVGVSTLYKKFPVGDK >CP029164|355346:412633|398218_399160_-|AWH68254.1|DBSCAN-SWA MSTKLTGYVWDGCAASGMKLSSVAIMARLADFSNDEGVCWPSIETIARQIGAGMSTVRTAIARLEAEGWLTRKARRQGNRNASNVYQLNVAKLQAAAFSQLSDSDPSKSDASKSDPSKFDASKSGKKAGFHPSESGGDPSVKSKHDPSDKKPSRPDASQPDTQKAEQDFLTRYPDAVVFSPKKRQWGTQDDLTCAQWLWKKIIALYEQAAECDGEVVRPKEPNWTAWANEIRLMCVQDGRTHKQICEMYSRVSRDPFWCRNVLSPSKLREKWDELSLRLSPSVSTYTEKREDPYFKASYDNVDYSQIPAGFRG >CP029164|355346:412633|355346_355493_-|AWH68197.1|integrase|DBSCAN-SWA MPAKKNHASGEGSHQTIPLKRADALDSAALFTGGVVHSGYRGACRTQK >CP029164|355346:412633|380703_380886_-|AWH68227.1|DBSCAN-SWA MACGWFFPPGLTAEYLTDRFFDCASYWRINPFELLNMPISEIPLLVSQANRIEQEKRTHG >CP029164|355346:412633|361442_361985_+|AWH68205.1|DBSCAN-SWA MVTNFITPEGDDDMNISNVNSNNTTSLPVELDTLNNKGISYDKDFSYAKDLFLYIETQLKIAKDFCRPGEEVSSSIASTVFHAFIDLVNKIRGKKDCMYIFTLCCFAEEVKGDYSHYRTFLFDIGNQYKVKLTQSGKKEFSLTLEFNDTIIESQIVTGNKAKHILEDIEKFYRNKPDTYY >CP029164|355346:412633|384983_386213_-|AWH68236.1|capsid|DBSCAN-SWA MKLHELKQKRNTIATDMRALNEKIGDNAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPENNPQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGSVGG >CP029164|355346:412633|378878_380711_-|AWH68226.1|tail|DBSCAN-SWA MAEFELKALITGVDRLSPALSKMQKKIRGFKRQAEEASQGGLALGGGLAAGLTLSLKSYADQENAATGLKVAMMDANGEVGKSFQDINKLAIGLGNQLPGTTADFQNMMQMLVRQGIPAENILGGVGKATAYLAVQLKKTPEAAAEFAAKMQDATGTASEDMMGLFDTIQKAFYLGVDDTNMLSFFTKTSSVLKMVNKDGLQAAQSLAPISVMMDQMGMNGESAGNALRKVIQSGLSVKKIRDVNKVMARQKLGVQLDFTDGKGSFGGLDNMFRQLAKLRKLTDVKRTGVLKAIFGDDAETLQVVNALIDKGKDGYDQIQQKMNKQASLNKRVQAQLGTLSNLWEAMTGTATNGLAAIGGAFSGDAKNITQWLGELGEKFTKFADENPRVIRGVVGLAAGLAILKLGLMGVGSAISIVSRIMSMTPIGMIATAIALAAGLIITNWDVVGPYFKKLWETIGPYFEAGWELLKKVFAWSPLGMVINNWGPVVKWFQDMWDKLKPIIEWFTDSSGDTVDAINSAQWGAGAYDAYGTGIPARGYTPYPAVDPAQSNNASDATGSNPFMINKASVPKVDGEIKVSFVNSPPGMRVMETRSSGFDVSHDVGYTRFR >CP029164|355346:412633|386818_388045_-|AWH68238.1|portal|DBSCAN-SWA MLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNTGGRYTYGLYNEYGAFAISPDDMIHIRALGNNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLDEMLVSVNAANPAGDFKPPKNDEGKTNE >CP029164|355346:412633|366282_367266_+|AWH68209.1|DBSCAN-SWA MATRIEFHKHGGPEVLQAVEFTPADPAENEIQVENKAIGINFIDTYIRSGLYPPPSLPSGLGTEAAGIVSKVGSGVKHIKAGDRVVYAQSALGAYSSVHNINADKAAILPAAISFEQAAASFLKGLTVYYLLRKTYEIKPDEQFLFHAAAGGVGLIACQWAKALGAKLIGTVGTAQKAQSALKAGAWQVINYREENLVERLKEITGGKKVRVVYDSVGRDTWERSLDCLQRRGLMVSFGNSSGAVTGVNLGILNQKGSLYVTRPSLQGYITTREELTEASNELFSLIASGVIKVDVAEQQKYPLKDAQRAHEILESRATQGSSLLIP >CP029164|355346:412633|388192_389926_-|AWH68239.1|terminase|DBSCAN-SWA MSRKSYPNVNAANQYARDVVRGKIVACQFVIQACQRHLDDLMAEKSKSFRYRFDKDLAERAAKFIQLLPHTKGEWAFKRMPITLEPWQLFVICCAFGWVNKGSRLRRFREVYTEIPRKNGKSAISAGVALYCFACDNEFGAEVYSGATTEKQAWEVFRPARLMCKRTPMLTEAFGIEVNASNMNRPEDGARFEPLIGNPGDGSSPHCAVVDEYHEHATDALYTTMLTGMGARRQPLMWAITTAGYNIEGPCYDKRREVIEMLNGSVPNDELFGIIYTVDEGDDWTDPQVLEKANPNIGVSVYREFLLSQQQRAKNNARLANVFKTKHLNIWVSARSAYFNLVSWQSCEDKSLTLEQFEGQPCILAFDLARKLDMNSMARLYTREIDGKTHYYSVAPRFWVPYDTVYSVEKNEDRRTAERFQKWVEMGVLTVTDGAEVDYRYILEEAKAANKISPVSESPIDPFGATGLSHDLADEDLNPVTIVQNFANMSDPMKELEAAIESGRFHHDGNPIMTWCIGNVVGKNMPGNDDLVKPVKEQAENKIDGAVALIMTIGRAMLKEPDDFLSSLDPDDDLLIL >CP029164|355346:412633|370091_370340_+|AWH68213.1|DBSCAN-SWA MRIEICIAKEKMTKMPTGAVDALKEELTRRISKRYDDVEVIVKATSNDGLSVTRTADKDSAKTFVQETLKDTWESADEWFVH >CP029164|355346:412633|405342_405864_+|AWH68263.1|DBSCAN-SWA MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCETLAAENARLNKFIVQSCYVFNGEQDEISDAYICATDGGMPQIPATDAFLAEIRAAARNEGINYTASRLAAAFNHGFINKSLREVFDVTRMILSAKEELANEPHPIDGLSGEYAEKSLEEWAEQLRKGVIQ >CP029164|355346:412633|360366_361080_-|AWH68203.1|DBSCAN-SWA MRKITQAISAVCLLFALNSSAVALASSPSPLNPGTNVARLAEQAPIHWVSVAQIENSLAGRPPMAVGFDIDDTVLFSSPGFWRGKKNFSPESEDYLKNPVFWEKMNNGWDEFSIPKEVARQLIDMHVRRGDAIFFVTGRSPTKTETVSKTLADNFHIPATNMNPVIFAGDKPGQNTKSQWLQDKNIRIFYGDSDNDITAARDVGARGIRILRASNSTYKPLPQAGAFGEEVIVNSEY >CP029164|355346:412633|356622_359445_+|AWH68200.1|DBSCAN-SWA MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQRRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGTITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLMLLAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTIEVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSANFACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPELSLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYGSGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKFISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAGQRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDAIRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEVPKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLINDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSNPATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIKVEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFFDAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTGQTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADWIVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML >CP029164|355346:412633|397727_398222_-|AWH68253.1|DBSCAN-SWA MIMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTRRHFPRLTERAQEPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARREQCLRKSSKRAVSGDEWYLSGNYVGA >CP029164|355346:412633|375241_375790_-|AWH68222.1|plate|DBSCAN-SWA MSTIEAMQRQLLGLIGRAVVKSISAATKCQTVDVSLIAGEPKAGVEHLEPYGFTARANSGAEAVVLFPDGDRSHAVVVTVSDRRYRLKGLQTGEVAVYDDQGQSVTLTREGIVVDGAGKTITFRNSPKARFEMDLEVTGQVKDLCDSGGTTMSAMRLAYNGHRHRENGQGSNTDKPDKAMEA >CP029164|355346:412633|400117_400381_-|AWH68257.1|DBSCAN-SWA MSSQNYTEKAVKAAGKSLSEVARRFGFKSTQSVANWVINNQVPSERVLQLCELGNWSVTPHELRPDIYPNPNDGLPECYSKVSGSTA >CP029164|355346:412633|373771_374830_-|AWH68220.1|plate|DBSCAN-SWA MADSEFQRPTLAENISMLRNDLFARLDVSDTLRRMDEDVRAKVYAAALHTVYGYIDYLAMNMLPDLCDESWLARHAAMKRCPRKGATAASGYMRWEGVSDGLKVTAGSVIQRDDLVQYTATADATSSGGVLRVPIACSSAGAVGNADDGTALILVTPVNGLPSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYVVWAKEVPGITRAWTYRHLMGTGTVGVMIASSDLINPIPEESTETAARQHIGPLAPVAGSDLYVFRPVAHTVDFHIRVTPDTPEIRAAITAELRSFLLRDGYPQGELKVSRISEAISGANGEYSHQLLAPADNISIAKNELAVLGTISWT >CP029164|355346:412633|394539_395529_-|AWH68248.1|DBSCAN-SWA MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPERYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLIRNDLADALPESAARKALRLPKPVVPSVTRESDLVPSVPATSIIQDKAKKVLALKVDPESPESFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >CP029164|355346:412633|374816_375245_-|AWH68221.1|tail|DBSCAN-SWA MMELWLTVNGKRTCASAPLDPLTRAVVISLFTWRRAEPDDNADVPMGWWGDTWPAVQNDRYGSRLWLLQRSKLTNQLVQTVRGYIRECLQWMIDDGVVSRIDLDIRRTGINELGNSITLWRRDGPVMISFDDLWSAITHGGQ >CP029164|355346:412633|367807_368845_-|AWH68211.1|tRNA|DBSCAN-SWA MHDNHETQKINQTSVMPEKTGVYWNSRFSIAPMLDWTDRHCRYFLRLLSRNTLLYTEMVTTGAIIHGKGDYLAYSEEEHPVALQLGGSDPAALAQCAKLAEARGYDEINLNVGCPSDRVQNGMFGACLMGNAQLVADCVKAMRDVVSIPVTVKTRIGIDDQDSYEFLCDFINTVSGKGECEMFIIHARKAWLSGLSPKENREIPPLDYPRVYQLKRDFPHLTMSINGGIKSLEEAKAHLQHMDGVMVGREAYQNPGILAAVDREIFGSSDIDADPVAVVRAMYPYIERELSQGTYLGHITRHMLGLFQGIPGARQWRRYLSENAHKAGADINVLEHALKLVADKR >CP029164|355346:412633|382957_383128_-|AWH68231.1|DBSCAN-SWA MFVKPVKGRSVPDPARGDLLPAEGRNVDENNYWLRREAAGDIRRVNKKVNTDDDKL >CP029164|355346:412633|371545_372148_+|AWH68216.1|tail|DBSCAN-SWA MDKAVLNSELIATKAGNITVYNYDGETREYISTSNEYLAVGVGIPAYSCLDAPGTYKAGYAICRSVDLNSWEYMPDHRGEIIYSTETGEAKEITVPGDYPENTTTIVPLTPYDKWDGEKWVTDSEAQHGAAVEAAEAQRQSLIDAAMASISLIQLKLQAARKLTQAETTRLNAVLDYIDAVTATDTSTAPDVIWPELPEA >CP029164|355346:412633|363652_364732_-|AWH68207.1|DBSCAN-SWA MQAATVVINRRALRHNLQRLRELAPASKMVAVVKANAYGHGLLETARTLPDADAFGVARLEEALRLRAGGITKPVLLLEGFFDARDLPTISAQHFHTAVHNEEQLAALEEASLDEPVTVWMKLDTGMHRLGVRPEQAEAFYHRLTQCKNVRQPVNIVSHFARADEPKCGATEKQLAIFNTFCEGKPGQRSIAASGGILLWPQSHFDWVRPGIILYGVSPLEDRSTGADFGCQPVMSLTSSLIAVREHKAGEPVGYGGTWVSERDTRLGVVAMGYGDGYPRAAPSGTPVLVNGREVPIVGRVAMDMICVDLGPQAQDKAGDPVILWGEGLPVERIAEMTKVSAYELITRLTSRVAMKYVD >CP029164|355346:412633|361126_361306_+|AWH68204.1|DBSCAN-SWA MLQKPSDHNASSRGKEGLSAVFVMKSRMIIFNPVNTFLSFFVKKMFITRCKNTLILRLL >CP029164|355346:412633|383693_384200_-|AWH68233.1|DBSCAN-SWA MATPFFHVDVQQPAEMRFNRARVRRAFVTIGQRHMRDARRLVMRRARSAPGENPGYQTGRLARSIGYMVPRASKKRAGFMTRIAPNQRNGKGNRMISGDFYPAFLFFGVRGGAKRRRSHHRGASGGSGWRLAPRNNFMVETLEKNRSWTRYFLARELRKSLKPERRHR >CP029164|355346:412633|406618_407539_+|AWH68265.1|DBSCAN-SWA MTTITKERLLTIRQWRETYGPGSNVVLPAEEAEELARIALASLEAEPIGFRCRRNDNLGDWSYVYHREPDDFERKHLVIEGIYAAPPAPVVPEEATPENVEMLSGYVSTYKLTDSERDIAAEIWNACRAAMLQSGNFRENKNSSTNNFREIAETSTNYPAIPSEVLSAILKVARIRADFDDFDGDRRGIGDCLDEAEQELIVTINKYASQLAAEPIAPNDVREQTAIPQVPVTPDGWISCSERMPEKNQNVLISVNFDSSLVEPLICSARYTGSTFRRGDATIKPGNGIEQATHWMPLPEPPQEVK >CP029164|355346:412633|390550_390901_-|AWH68241.1|DBSCAN-SWA MPPRTPKACRFRGCRNTTTDPSGYCESHKSEGWKQYKPGQSRHQRGYGSKWDSIRAGVLKRDKGLCQLCLRAGVVREAKTVDHIIPKAHGGTDADCNLQSLCWPCHKAKTARERLK >CP029164|355346:412633|395536_396346_-|AWH68249.1|DBSCAN-SWA MNNLMVIDGIEVRRDAYGRYSLNDLHRAAVASGANARTKEPGKFLSSQQTVELVHELTNTQNLGVDPVSVIHGGNERGTYVCKELVYAYAMWISPSFHLKVIRTFDMVTSAPEKLSGQAADKMQAGVILLDFMRRELNLSNSSVLGACQKLQEAVGLPNLAPRYAIDAPADAHDGSSRPTLSLSALLKQYGIRLTANQAYHQMVKLGIVEQRERYSRTGINNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFPELLKLLDTVH >CP029164|355346:412633|399149_399329_-|AWH68255.1|DBSCAN-SWA MREVNRWFKDHYGVPVRVIRWEPETQLVIYLREGYEHECFSPLEQFRRKFREIEVGHEH >CP029164|355346:412633|364784_366200_-|AWH68208.1|DBSCAN-SWA MAGNKPFNKQQAEPRERDPQVAGLKVPPHSIEAEQSVLGGLMLDNERWDDVAERVVADDFYTRPHRHIFTEMARLQESGSPIDLITLAESLERQGQLDSVGGFAYLAELSKNTPSAANISAYADIVRERAVVREMISVANEIAEAGFDPQGRTSEDLLDLAESRVFKIAESRANKDEGPKNIADVLDATVARIEQLFQQPHDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAMLQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQLDDEDWARISGTMGILLEKRNIYIDDSSGLTPTEVRSRARRIAREHGGIGLIMIDYLQLMRVPALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRSLEQRADKRPVNSDLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVRLTFNGQWSRFDNYAGPQYDDE >CP029164|355346:412633|371027_371546_+|AWH68215.1|tail|DBSCAN-SWA MPFARYFCIFINVGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFDKVKYPHLATAYPSGKLPDLRGEFIRGWDDGRGIDAGRALLSIQTGMLEKHRHIVVANDRYDSKEEWELATIFRRAYTQGRGLDAADAGGTLIPSPTLHTRGSIGNTGGSETRPRNIAFNYIVRAA >CP029164|355346:412633|392899_393286_-|AWH68246.1|DBSCAN-SWA MSDPISGTGLAGGALTGASVYGLLTGTDYGVVFGAFAGAVFYIATAADLSASRRLAYFIVSYIAGILCSGLVGSKLANLTGYSDKPLDAIGAVIVSALAVKILTFLNNQDIGSLVALITRRGGSGGAK >CP029164|355346:412633|392631_392913_-|AWH68245.1|holin|DBSCAN-SWA MELNDPTATINALLCACVVITLMFYRRGDSRHRPWVSRLAWLITVTYSAVPLAYLCGIYPHSSWPIIVANTIFLSVLVAVRGNVARLVDHLRH >CP029164|355346:412633|403862_404687_+|AWH68261.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNSERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAIRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >CP029164|355346:412633|408846_409380_-|AWH68268.1|DBSCAN-SWA MLTLDEIGQSVRNNIQLIIDHVGLPLAVGPLSDDDYKILCGGYGELEWDYALSTYGNSREKYEFCIKLVQQGRVQGIPSGAAICVYGVEENIFRIHMIERFSREDESHPLKGRMVLLTLMSAFIFCKAVECKVVHIVEPVPELVQYYESFGFRMEQCGYVMSAVIDELQDIFLKFAQ >CP029164|355346:412633|355451_355733_+|AWH68198.1|DBSCAN-SWA MATLTTGVVLLRWQLLSAVMMFLASTLNIRFRRSDYVGLAVISSGLGVVSACWFAMGLLGITMADITAIWHNIESVMIEEMNQTPPQWPMILT >CP029164|355346:412633|397074_397728_-|AWH68252.1|DBSCAN-SWA MSNKYCQALVELRNKPAHELKEVGDQWRTPDNIFWGINTLFGPFVLDLFTDGDNAKCAAYYTAEDNALAHDWSERLVELKGAAFGNPPYSRASQHEGQYITGMRYIMKHASAMRDKGGRYVFLIKAATSEVWWPEDADHIAFIRGRIGFELPVWFIPKDEKQVPTGAFFAGAIAVFDKTWKGQAISYIGRDELEACGEAFLAQVRQQAEKLVREMAA >CP029164|355346:412633|396751_397078_-|AWH68251.1|DBSCAN-SWA MTTLTQCQQQVLDMLISYQKERGFPPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAVGIIRSLLAGEENARLRAAHWLHERGLKV >CP029164|355346:412633|355831_356368_-|AWH68199.1|DBSCAN-SWA MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMKEQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTTEVVVNVGGTMQMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSGGAQSRPQQSAPAAPSNEPPMDFDDDIPF >CP029164|355346:412633|359479_359836_-|AWH68201.1|DBSCAN-SWA MTISELLQYCMAKAGAEQSVHSDWKATQIKVEDVLFAMVKEVENRPAVSLKTSPELAELLRQQHSDVRPSRHLNKAHWSTVYLDGSLPDSQIYYLVDASYQQAVNLLPEEKRKLLVQL >CP029164|355346:412633|393365_393623_-|AWH72414.1|DBSCAN-SWA MQGEKQQPYFFNPGMTVEQLEDWLEQQKLHLSRYNRLVKEKAELEERLSDISVEIERMSAGGFNGKLSFPWESSSLLRNHQQGSV >CP029164|355346:412633|403434_403797_+|AWH68260.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH >CP029164|355346:412633|407538_408111_+|AWH68266.1|DBSCAN-SWA MNNLMIDLETMGKNKDAPIVSIGAVFFTPETGDIGQEFYTVVSLESAMGQGATPDGDTILWWLKQSPEARAAICIDDTLSISDALSELNHFINRHADNTKYLKVWGNGATFDNVILRGAYERAGQICPWAYWNDHDVRTIVTLGRSIGFDPKMDMPFDGERHNALADARHQAKYVSAIWQKLIPATSTEL >CP029164|355346:412633|381477_382974_-|AWH68230.1|tail|DBSCAN-SWA MTISFNTIPSNTLVPIFYAEMDNSAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQTDPFGELYVIAVPEATGAAATVTLTVTGAATETGTVNVYVGRTRVQAPVTNGDNVATIASSIKDAINAVPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKIGTLSELVTAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSYGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASDPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA >CP029164|355346:412633|399504_400089_-|AWH68256.1|DBSCAN-SWA MGKEPEWKVDKQPAWLVAAIRRTIADLPHGYEEAAEILGLYKSDDITPAKDQLHNRLRSGGDQIFPLEWAMVLQDASGTRHVTDAIARRSNGVFVPLVDIDDIDNGDINQRLMESIEWIGKHSQYLRKATADGVIDRAEREQIEENSYQVMAKWQEHLTLLFRVFCAPEKSNARECAAPGVVASIASGCGETNA >CP029164|355346:412633|396365_396755_-|AWH68250.1|DBSCAN-SWA MKLILPFPPSVNTYWRHPNKGAFAGKSLISAAGRKFQSAACAAIVEQLRRLPKPTSAPASVEIVLFPPDNRIRDLDNYNKALFDALTHAGVWEDDSQVKRMLVEWGPVIPEGKVEITISKYEKTAGAAA >CP029164|355346:412633|372542_373193_-|AWH68218.1|DBSCAN-SWA MHRIDTKTAQKDKFGAGKNGFTRGNPQTGTPATDLDDDYFDMLQEELCSVVEASGASLEKARHDQLLTALRALLLSRKNPFGDIKSDGTVKTALENLGLGEAAKRDVGTGENQIPDMSAWKRNPSSNRWRKLPDGTIIQMGISASGPLGSPVNITLPISFSNTNYCVVASYDNARLGVSTMVSFAALPVSPSQFSLMSSVTEQGINPFAYWIAFGD >CP029164|355346:412633|378284_378797_-|AWH68225.1|DBSCAN-SWA MIEGMIMRIFVFFISALLSFNLAAEECKFSFNESELISSIGIAPVKQEIIKDEGITKRQYEFRRELSSEEMLSDDADEKYEPQFYISVYNPSCPQKVIVWFFKDNKNTMDLSNEVLAGRAFKYLTGVNESIFENKMKKFLKVQSFESFDERTDSKFIKSGDIYSIDVQLR >CP029164|355346:412633|410626_412006_-|AWH68271.1|DBSCAN-SWA MPPGIAVCFSSLFIRLVCMAFLTSSDKALWHLALPMIFSNITVPLLGLVDTAVIGHLDSPVYLGGVAVGATATSFLFMLLLFLRMSTTGLTAQAYGAKNPQALARALVQPLLLALGAGALIALLRTPIIDLALHIVGGSEAVLEQARRFLEIRWLSAPASLANLVLLGWLLGVQYARAPVILLVVGNILNIVLDVWLVMGLHMNVQGAALATVIAEYATLLIGLLMVRKILKLRGISGEMLKTAWRGNFRRLLALNRDIMLRSLLLQLCFGAITVLGARLGSDIIAVNAVLMTLLTFTAYALDGFAYAVEAHSGQAYGARDGSQLLDVWRAACRQSGIVALLFSVVYLLAGEHIIALLTSLTQIQQLADRYLIWQVILPLVGVWCYLLDGMFIGATRAAEMRNSMAVAAAGFALTLLTLPWLGNHGLWLALTVFLALRGLSLAAIWRRHWRNDTWFVAT >CP029164|355346:412633|410301_410511_-|AWH68270.1|DBSCAN-SWA MNKDEAGGNWKQFKGKVKEQWGKLTDDDMTIIEGKRDQLVGKIQERYGYQKDQAEKEVDSWEKRHDYRW >CP029164|355346:412633|404815_405352_+|AWH68262.1|DBSCAN-SWA MSFIKTFSGKHFYYDKINKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFKVIPLSPGHAYGMFMERFNELSELRKCA >CP029164|355346:412633|400487_401192_+|AWH68258.1|DBSCAN-SWA MVEEQKYPDFAKRLNELMTIKGISVTQLKSLVGVTYEMARRYTIGAAKPRVSVMSKLALALGVSASYLEYGVGDREECKEMASIPNPTKPDVYRIEVLDLSVSAGPGTYMLSDYVDVLYAIEFTTEHARSLFGNRSQNDIKVMTVNGDSMSPTLVSGDRLFVDISVRHFQTDGVYSFVYGKTFHVKRLQMQGNKLAVLSDNPAYEKWYIDEKSQDQLYVMGKALIHESIKYNRL >CP029164|355346:412633|383136_383697_-|AWH68232.1|DBSCAN-SWA MKLTPVIAALRARCPYFEKRVAGAAQFKNLPEVGKLRLPAAYVVPGDDSPGENKSQTDYWQELKEGFSVVVILSNGRDERGQFASYDVVDDVRQMLFKALLGWNPEACGNPITYDGGTLLDLNRHELIYQFDFSVISELTEDDTRQQDDLNSLDELRTLAIDVDYLDPGNGPDGDIEHHTEITLPS >CP029164|355346:412633|372119_372539_-|AWH68217.1|tail|DBSCAN-SWA MDRYFYSQKENGFFTDLNKAPSDAVEITTDEWLSLLDGQDNGMKIVSNQEGYPVLTEQPPLSKENLIALAELKKGKLINEANEHMNSRQWPGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQAR >CP029164|355346:412633|393773_394526_-|AWH68247.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATGSGGLTITDVMAAQGMVQSKAPLGFALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEETRLQLIPYLVNFAFADYSRSAASKARCEHCAGTGFHNVLREVVKHSRSGESVIKEEWVKELCQHCHGKGEVSTACRGCKGKGIVLDEKRTRLHGTPVYKICGRCNGNRFSRLPTTLARHHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEAQLRKVTR >CP029164|355346:412633|381121_381478_-|AWH68229.1|tail|DBSCAN-SWA MARIGGTCYFKIDGQQLSLTGGIEVPMNRTVNDDIIGLDGSVDRKETHRAPYVKGTFKVPKNFPVNKITSSDEMTITAELANGQVYVLSSAWLHGEANHNAEEGTVDLEFHGEEGDYQ >CP029164|355346:412633|386223_386826_-|AWH68237.1|head,protease|DBSCAN-SWA MNDREIRCYSGEVRAERHDDNPAHIIGYGSVFDCRSELIFGSFREIIRPGAFDDVLGDDVRALFNHDPNFILGRSAAGTLNLSVDERGLRYDIQAPETQTIRDLVLAPMQRGDINQSSFAFRVARDGEEWYQDEDGVVIREITRFSRLLDVSPVTYPAYQEADSAVRSMKAWQEARNSGALQKAINQRMARERVLTLLNA >CP029164|355346:412633|409744_410260_+|AWH68269.1|DBSCAN-SWA MEKTTTQELLAQAEKICAQRNVRLTPQRLEVLRLMSLQDGAISAYDLLDLLREAEPQAKPPTVYRALEFLLEQGFVHKVESTNSYVLCHLFDQPTHTSAMFICDRCGAVKEECAEGVEDIMHTLAAKMGFALRHNVIEAHGLCSACVEVEACRHPEQCHHDHSIQVKKKPR >CP029164|355346:412633|392017_392632_-|AWH68244.1|DBSCAN-SWA MNQQLFQKAAGISAGLAARWFPHIDAAMKEFGITAPADQAMFIAQVGHESMGFRALVENFNYTPSALVATFGKRITQQQADALGRTSGHAARQDAIANLVYSNRLGNKAPGDGWKYRGRGLIQITGLHNYRICGAALKLDLVTSPEQLEQELQAARSAAWFYTSKGCMVYGADITRVTRIINGGLNGIEDRKIRYNKAREALLV >CP029164|355346:412633|389939_390434_-|AWH68240.1|terminase|DBSCAN-SWA MPHMAGTAGRSGRRPKPTARKALAGNPGKRALNKDEPVFTPIKGVEPPEWFAEENLPLATIMWQLTTKELCGQGLLCVTDLAVLERWCVAYEFWRRAVKNIASQGNTITGAMGGRVKNPELTAKKEQESEMSSTGAMLGLDPSSRQRLIGLAGKKKATNPFLTI >CP029164|355346:412633|406344_406608_+|AWH68264.1|DBSCAN-SWA MATLTKKERAWLNELQDVLDRCPSPKKIGFYTIGDKSIYLYDLRRMDEIMEALDNRSSMDWCVAVHDMNAGFDEKILFPSSVESTAG >CP029164|355346:412633|384581_384905_-|AWH68235.1|head,tail|DBSCAN-SWA MLLKMEEIKLQLRLDDDFSDEDELLELLGKAAQSRTENFLNRTLYATADDRPADDSDGLVISDDVKLALLLLVSHFYENRSTVTDVEKMELPMSFNWLVAPYRLIPL >CP029164|355346:412633|380852_381122_-|AWH68228.1|tail|DBSCAN-SWA MKELELKKPITAHGETLSVLEFDEPTGKDVRELGYPYQMNQDESVKLLAHVVSKYIVRLAKVPQSSVDQMSPADLNAAAWLVAGFFLQA >CP029164|355346:412633|412024_412633_-|AWH68272.1|DBSCAN-SWA MKALTARQQEVFDLIRDHISQTGMPPTRAEIAQRLGFRSPNAAEEHLKALARKGVIEIVSGASRGIRLLQEEEEGLPLVGRVAAGEPLLAQQHIEGHYQVDPSLFKPNADFLLRVSGMSMKDIGIMDGDLLAVHKTQDVRNGQVVVARIDDEVTVKRLKKQGNKVELLPENSEFKPIVVDLRQQSFTIEGLAVGVIRNGDWL >CP029164|355346:412633|368932_370030_+|AWH68212.1|integrase|DBSCAN-SWA MAYYNIEKRLKSDGTPRYRCNVIIKEKGVITYRESKTFPKHAHAKTWGTQKVMELDLYGIPSSNAVDGLTVRDLLHKYLNDPNAGGKAGRTKRYVLELLMDSDISAIKLSELTENDVIEHCRLRNNAGAGPATVSHDVSYLGSVLDAAKPVYGINYTSNPAKSARPYLLKLGLIGKSNRRNRRPASDELDMLIEGLQQRSTHKCSKIPFVDILKFSVWSCMRIGEVCRLRWEDLDQEQKSILVRDRKDPRKKEGNHMKVALLGEAWDIVQRQPQKSEFIFPYNSTSVTAGFQRVRSKLGIKDLRYHDLRREGASRLFEAGFSIEEVAQVTGHRSLNVLWQVYTELYPKSLHNRFEELQRSRNKTP >CP029164|355346:412633|375789_376869_-|AWH68223.1|plate|DBSCAN-SWA MNDNVTLRVNGREWNGWTSVRIGAGIERLARDFSVEITRQWPGDEGITTLQPRIKNGSKVEVLIGDELVITGWVEATPVRYDARSVSTGIAGRSLTADLIDCAAEPTQFNGRSLVQIAQALAAPFGIEVVNNGAPSGVIPDVQPDHGETVIEVINKILGQQQALAYDDPHGRLVIGGIGSTRAHTALVLGENILSCDTEKSIRERFSVYQVAGQRAGNDDDFGEATTTALRARTEDAFIARYRPMYIRQTGQATGAGCIARADFEARQRAARTDETTYVVQGWRQGNGTLWQPNQRVIVFDPVCGFDNTELLVSEVTFTQDQNGTLTEIRVGPPDAYLPEPEAPGARKKKKARVQEDPF >CP029164|355346:412633|391559_392021_-|AWH68243.1|lysis|DBSCAN-SWA MKMSYWALILTFIACIAGGLVWSANHYHNKAIEYKKQRDENAMALDSAMATISDMQKRQRDVAELDARYTKELADANATIESLRADVSSGRKRLQVAATCAKSTTRASGMGDGESPGLTADAELNYYRLRSGIDRITAQVNYLQEYIRTQCLR >CP029164|355346:412633|401416_402502_+|AWH68259.1|DBSCAN-SWA MINERTEATDGVADMISTNTKYLVWNNKGGVGKTFLTYNLAVEFAISHPDQDVVVIDSCPQSNVSEIILGGNGTGEENLNKLRDRNVTIAGYIKERFSKSPLSRLGNESSYFVRAHDVNAKMPENLYILPGDVDLDICSRLISHIGSSPVKEAWKKSRSLLVDLIASFEADKNISDRAKTFFIDCNPSFASYTELGVVAANRIIIPCTADAASIRGIKNLVKLIYGVSIDKSEQDEMFLDFNKEAKQNLIELPELHLFVQNRSRTNESDAAKAFKSHAEEIKRITDDLLNTHPHLFTNVATFERVQNVKDGNTLAAIINHEGCPLSRLQHKSYTIYGMATQANRAQIEALESDVSTVVKCL >CP029164|355346:412633|373196_373781_-|AWH68219.1|DBSCAN-SWA MDVTNDDYIRLLSALLPPGPAWSASDPAIAGAAPSLTRVHQRADALMRELDPRTTTELINRWERLCGLPDECIPAGTQTLRQRQQRLDAKVNLAGGINEDFYLAQLAALGRPDATITRYDKGTFTCSSACTDAVNAPEWRYYWQVNMPAATNTTWMTCGDPCDSALRIWGDTVVECVLNKLCPSHTYVIFKYPE >CP029164|355346:412633|367431_367674_-|AWH68210.1|DBSCAN-SWA MLELLFVIGFFVMLMVTGVSLLGIIAALVVATAIMFLGGMLALMIKLLPWLLLAIAVVWVIKVIKAPKVPKYQRYDRWRY |
77 | Shigella_phage(36.67%) | portal,tRNA,plate,protease,terminase,holin,tail,integrase,lysis,head,capsid | attL 349726:349741|attR 384062:384077 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
934714 : 943640
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP029164|934714:943640|DBSCAN-SWA TATGCAGAAAAATGTGACTCCCGGCAGGCGAAAAGGCTGCCCTAATTATCCTCCCGAATTTAAACAGCTGCTCGTTGCTGCCTCCTGTGAACCCGGGATATCCATCTCAAAACTTGCTCTTGAAAATGGCATTAACGCCAATCTGTTGTTCAAATGGCGACAACAATGGCGCGAGGGAAAGCTGCTATTACCTTCTTCAGAGAGCCCCCAGCTACTTCCTGTGACTCTCGATGCAGCTGCCGAACAGCCAGAATCGCTCGCAGAGGACCCGGAAACCCTCAGTATCAGCTGTGAGGTAACGTTCCGGCACGGGACGCTCCGCTTCAATGGCAATGTCAGCGAAAAACTCCTGACTCTGCTGATACAGGAACTGAAACGATGATCCCGTTACCTTCCGGGACCAAAATTTGGCTGGTTGCCGGTATCACCGATATGAGAAATGGCTTCAACGGCCTGGCTGCGAAAGTACAGACGGCGCTGAAAGACGATCCCATATCCGGCCATGTTTTCATTTTCCGGGGCCGCAGCGGCAGTCAGGTTAAACTGCTGTGGTCCACCGGTGACGGGCTGTGCCTCCTGACCAAGCGGCTGGAGCGTGGGCGCTTCGCCTGGCCGTCAGCCCGTGATGGCAAAGTGTTCCTTACGCAGGCGCAGCTGGCGATGCTGCTGGAAGGTATCGACTGGCGACAGCCTAAGCGGCTGCTGACCTCCCTGACCATGCTGTAAATCTCTTTATCCTGGTTGTCACAGAATAAGCCCGGTAAAATACGGGCTTATGAACGACATCTCTTGTAAGCGTCAACGGAGCACCGTATTGACGCTTATTTATTGGTGAGTACTACGTTCCATGGCAGGAGTTCGTCAACACAGTTGGAGGGCCATTCCGGCAGTACGCTCAGAATATGGCGCAGATACGCTTCCGGATCGATACCGTTCAGACGGCAGGTGCCGATCAGCCCGTACAGCAGTGCTCCACGCTCGCCGCCGTGATCGCTACCGAAGAACACGTAATTTTTCTTTCCGAGACAGACTGCACGAAGCGCTCTTTCCGCTGTGTTATTGTCCGCTTCTGCCAAACCATCATCACTGTAATAACAGAGGGCGTCCCACTGATTCAGTACATAGCTGAACGCTTCGCCCAGTCTGGATTTTTTCGACAGCGTGCCATTCTTCTCCACCATCCATTCATGCAGCGACGTCAGTAACGCTTTGCTTCGCTGCTGCCTGGCTGCAAGACGCTCTGACTCTGGTAATCCCCGTATTTCATCCTCGATGGCGTACAGTTCACTGATTCGCTTCAGGGCTTCTTCTGCCGTCGCACTTTTGCTGCTGATGTATACATCGTGGATTTTTCGCCGGGCATGGGCCCAGCACGCAACTTCTGTCAGTGCACCACCTTCACGTTCTGCACTGAACAACCTGTCGTAACCTGTGAACGCATCCGCCTGCAGGATACCCCGGAAGGGGCGAAGGTGTTGCTCCGGGTGTTTCCCCTGCCGGTTCGGCGAGTACGCGAACCAGACCGCTGGAGGAGATGACGAACCCACATTGCGATCATCCCGGACATACGTCCAGATACGCCCTGTTTTCGCCTTTTTCTGACCCGGTGCCAGTACCTTTACCGGTGTGTCATCAGTGTGAACCTTGCGGGTATTCATTACATAACGGTACAGGGCATCATTCACCGGTGTCATTAACTGGCAGCACGCGTCAACCCAGTTGGAGAGTAAGGCCCGGCTCAGTTCGACACCCTGGCGGGCAAAGATTTCACTCTGACGATACAGTGGCAGATGTTCGCAGTATTTTCCCGTTAACACGCGGGCAAGTAATCCGGGGCCCGCGATACCACGCTCTATCGGGCGGGACGGCGCCGGTGCTTCAACAATACAGTCACATTTTGTACAGGCTTTTTTTACCCGTTCTGTGCGGATCACTTTCAGGGCACTGCTCACCAGTTCCAGCTGTTCAGCGCTGACTTCCCCCAGATAATCCAGCTCACCGCCACACTCCGGGCAACAGCTTTCTTCTGGCTCCAGGCGGTGTATTTCACGGGGAAGGTGTGCCGGTAACGGACGACGATGGCGCGACTGTCGCAACTGGCGGGGAACCTGAGGATCGTCTTCCCGCCCACTGTAACGATCGCTGTCCTGTTCACGTTGTTTCAGCAGAGCCTCAGCCAGTTCAACTTCACGACGCAGTTTTTCAGAACGGGTACCGAACAGCATCCGGTGCAGTTTTTCTATCTGAGCCCGCAGATGTTCTATTTCCCGTTCATCTTCTTCGATCTTTTCTTCGGCACGTGCCAGTGCAGAGCGCAGGAAGGCCTCCGTCTCTTCAACCAGACTCAGTTGCTGGTCTTTCTGACGGAGCTGGCTTTCCAGTTCTGCAATGCGAATGAGGTATTTCTGACTCATGGCCGTTTTTATAATGCGGCCAGGCGTTTTTTACAACATTGTCAGTGCGTTAAGGCGGAATGTTTTTGGCTGACGCCAGTCCAGCTTATCGAGGAGCATTGCCAGTTGCGAGCGGGTAATGGATACCTTGCCGTCACGTACCGCAGGCCAGATAAACTGGCCTTCCTCCAGGCGTTTGGTGAACAGGCACAGACCATCAGCATCAGCCCAAAGAATTTTAACGGTGTCACCCCGTCGGCCACGGAAGATAAACAGGTGACCGGAGAAGGGATTATCATTCAGCACATGTTGTACCTGTTCTCCCAGTCCGTTGAAGGATTTACGCATATCGGTAACGCCGGCAACGAGCCAGATACGGGTACCTGATGGGAGTGAGATCATCTTCCCCTCCCGGTCAGTTCACGGATCAACACCGTGAGCAGCTCTGGCGATGGATTTTCCAGCGTCATGTTACCGTGACGGAATTCCACCTTGCAGGAACTGGCACTGACTCTGGTCTGAGTGGAAGTGGATAAAGACGGCGCAATGGCCGCCACAGGTTCTTTCTGCTCATCCGGCGTTATTTCTACAGGTAATAATTCAACGCCAGTGTCAGAAGAGGTCGTTACCGGAAGACGCCGCGAAACACGCCCTTCGTTCTGCCAGAGCCTGAGCCATTTGAAAATAACATTATCATTGACGCCATTTTCACGTGCAATCTGTGCAACACAAGCTCCAGGTTGTGATGCCAGTTCCACCATACGAAGTTTGAATTCATTCGAATAGTTTTTACGAGGTTCTTTTCGCCAGTCCTGTAATTCCATACTTAGATGTCCGTCTATATCAGATGGGCGTCTAAGTTACCAATTCTCGTCTGATGGCTACATACGGCGGTCAGTTTACGCTTACCTTTATCCCGACAGACGACGTATATTTACCAGTAAGGCCAACCCGGTTGCGCCCTTTGAACCAGAAGTCAAACGTCAGGCTGTCATGGCACTCTGTACACGACAAGTATCAGCCAGTGAAATCGCCAGGCGTATTGGTGTCAGTCGTGCGGTATTGTATAAATGGAAAGATAAAATTATCGGCAACAGTGCTTACCAGACTATGCGTAAACATAACGAACCTTCCCTGGAGGCAGAACGCGATGCGTTGCGGGAGGAAGTCGCCCGACTGAATCAGGAAATACGCCGCCGGCAGATGGAGCTGGATATTCTGAAAAAGGCGGAGGAAATCATAAAAAAAGACCCGGGCATCAGTATCAGTCACCTGAACAACAGAGAGAAAACGAAGATCGCTGATGCCCTGAGACAAACATATCCCCTGACAGAATTACTGCATGTTCTGGGCCTTGCCCGTAGCAGTTATTTTTATCACCGGGCTGCACTGAAAGCCGGTGATAAATACGCCACGATACGTACGATGCTGACAGATATATTTAACAGTAATTACCAGTGTTATGGCTATCGTCGCCTGCATGCGATGCTCAGGCATGAGGGGGGGCGGCTATCAGAAAAGGTTGTACGCAGACTTATGGTGGAAGAACAGCTTGTCGTCAGCCGTAACCGTCGTCGCCGCTACAGCTCATATTGCGGAGAAATCGGACCGGCTCCGGATAACCTTATCGCCAGAGATTTTAAGGCGGAGCAACCTAATCAGAAATGGCTGACAGATATCACGGAGTTCCACCTCCCTGCAGGTAAAGTCTGGCTATCATCGGTGGTGGACTGCTTCGATGGAAAAGTTGTGAGCTGGTCTCTCAGTACACGCCCCGATGCTGAACTGGTCAACACTATGCTGGATAGCGCTGTCGAAACGTTAAATGCTGGCGAACGACCGGTGATACACAGTGACAGAGGTGGGCATTATCGCTGGTCAGGCTGGCTGGAAAGAGTGAATGCAGAAGGTCTTATTCGCTCAATGTCCCGCAAAGGATGTTCACCTGATAATGCGGCATGCGAAGGCTTTTTCGGCAGACTGAAAACGGAAATGTATCATGGGCGTAAATGGTCGGGCATCACGCCAGAAAAGTTCATGCAGCAAGTGGATACTTACATCAGATGGTATAACGAGCGGCGTATAAAATTATCGCCGGGTGCAGTCAGCCCCAAAATGTACCGCCAACAATGCGGGCTGGAATGATAAAGCAGTCCAGGAAATCGTCCGCATCCCCAGAACGGTCAAACATCGTGGCGTTGACAACGGGTTCCTGAGTCCGGTGAAGCGCCGGTTACTGGAAAATGTTGTGTACGTCATAACAGAAACGGAGAGAAAAACGGCGATGCAAATCAGGCGGAGAAGGGTGTATCAGCGGTTACTGAAGGTTGACCCGCTGAAGTGCATCCTGTGCGGAGGTCAGATGCGGTTTACGGGGCTGAAGCGGGGCTACCGTCTGGCAGAGCTGGTCATGATGCATGAGCCACTGGCACGACAGCGGGTATACAGCTGAGAGCCGCAGAGGGGAAGTTGCGTCCATTTTTTGAGGAATGGAGCAAAAAATCATCACAGATATAAAAAATCAATCAATGAAAGCCATTTAATTGCCGGTGCATTGTGGATGCAGTCCTCATGGTACGTAACCAGAGTTGAATAAACATCCATTTTTTTGTGTTTTTTTAACCTCCCCGGGTTTTTAATTCCTGTCGATTAAAAGTGTTAGATTATTTGGTTAAGAGTTCTCTCGTCCTATTTATTCAGGCGTTATCACGGATATGCAGATCGCTAAGTCAGTTGTAAGAAAACCAGACAGTGCACATCATTATACCTAGGTCTTCTCTGGAAAAATACTTAGGAATAATTTATGTAAAACAGGTTGTCGTCTTCATGATTTAAACGCAGTACATATATATACTCTATAATAAATACAGGGAGTATCACGAGACGTCTGGATAAGAGGAATAACCAGAATAAATTGGTAAGGAATAAAGACACTTATTTCTATTGAGAAAAAACATTCATCCTGATGTGATTAGTATCAATGAAATAATGTGCTTGTATGGTTGATAAAAAATGCATCACTAAAGAAAAAACAGTATGAATAAGAATATACGAATTTTACAATTTCTGGTCAGTATACTTTATTCTGTGCAGTCTCATTTTTCTGGTGCGCAAACAATACAACTGAATGGTAATGGTATACCTGAAAGTATAACCAGGAGTATTACAGGTGTTGATGGAAACGCGGCGCTTAATATCAGCGTGCCGTATAAAACAAGTTATACTCAAAATATACTATCTGTTGAAAGCAGTATTAACATCAAAGGAGGAACAAGCAATACATCAATCGGTGGTGCAGGTGTCTACGGTGAAAACTTTACGCTAAATAATAATGGTAGTGTTTGGGGAGGAGATGGATATAATGGTGGAATTGCTGTTAGTGGCAACAAAATATCTATAAACAATTACAGAAATGTATATGGGGGTAATGGTCTTGGTGGCTCAGGAAGTAGCGGAGGTGCAGGGTTAAGCGGGGATGATATTATAGTTGATAATTACAGAAGTATATACGGAGGTGATGATGTAGGTGGGACAGGTGGTTCCGGTGTAACCGGTAGCAATATTACAGTGCATAATTCCGGAGGAATATTGGGCGGTAATGGCGTAAACGGTGGTGATGGTATTAATGGTAGTAATCTTTTCATTACTAACGACAACATGATATCTGGAGGATATGGAATAAAACAAGGGGGAGATGCTATTTCTGGAAATCAAATCACTTTGAATAATAACGGTATTGTTCAGGGGGGATATGGCCCCGACGGTGGTTGCTCTGTTTATGGAGAAGATATCCATATTAATAATCATGGTAATCTTTCAGGATTATATAATAGCCAAAAAGATGCTTATAATACATCAATAATTTTTTCTGGCGGGTATAATTCATTAGATATTTATTCAGATTCTGTGATTAATGGTGATATTAAACTAGCTAGTATACCTGTTAATGGTACAAATGAATTAATTATTAAAAACATCAATAACGCAACAGCAATTAATGGTGGGCTAATGATTGGGAATGGCTCATCTGTTTATCTGTCAGGCAAGAACTCCATTTTTAACGGAAATATAAGTATTGATGAAGACGCATCTATGAACCTGTCTGTAGGAAATGCTAATGTTCACGCAAATACTATTACATTAAAAAGTGATTCATGGCTTAATATAGACACATCAATTAAGAACTGGACTCAGGACTATTACACATTATTGTCGTCAGACACAGGTATCTCGATCGCTGATAATAGTCACATTGTACAATACAATGTATTACTGACAGAAGGTGCTGAAAGTTATGTTTATACGTCTTTAAATGACGACGATAACAAACTGATATCCATGCTGAGATGGAATAATACAAAAGGGATGGGATATGGAACCTTTAATATAGAAAAAGATGCCACTCTGAACATAGGCGTTTCTCTTTCCGATAATCTTTCACCTTTATTATATGATGGCTGGGACGGCAAAAGTCTGACAAAATCAGGTAATGGTACTCTTATACTTTCTGCAACAAACAATTATACAGGAAATACAGAGGTTAAATCTGGCGTATTAATTCTTGCTGCACCTGATGCTCTTGGTCGGACTGAGTATTTATATTTATCCCGTGGCGCAGAACTGGATATGAATGGGTATCCTCAGACAATAAGCAAACTACTGACGGCTGCAGGCTCTGTGCTGAACATTCATGGCGGAAGTCTGATACTGAATAATGGAGGAGAATCTGCAGGTACTATTGCAGGGGATGGTTCTCTGAACATAAATGGGGGAATGCTTGATATAACGGGTAATAATCGTAATTTTTCCGGTGTTTTTACCGTGAATAAGGGGGCTCATTTGGCTGTATCCACGGCTGATAATCTGGGGACAGCCTTTGTTGATAACTATGGCACATTAACTCTGAACAGTACATCAGCATGGCAGCTTACCAACAATATCAGTGGTTATGGTAATGTTCGCAAGACGGGAGCAGGTGCACTGAACATTAGCGATAACGCAAAATGGACCGGGATGACAGATATTATTCAGGGGACAGTGATACTGGGGAACGCAGATTCACCGGTGATGCTCGGCAGTAACCAGGTCATTGTTGAAGAGCAGGGCAAACTCTCCGGGTTTGGGGGCGTTGCAGGAAATCTGAGCAATAGTGGTATAGTCGATCTCACTACATATATGCCGGGTAATATACTGACTGTCGGAGGGAATTACACTGGCAGAAATGGACTTATTCTCCTCCAGACAGAAACAGGTGGTGACAATTCGAAAACAGATCGTCTGGTGATTAAAGGTAATGCCAGTGGCCGTACCCGTGTCGCTGTTACTCAGGCCGGTGGTACTGGTGCAGAGACACTTAATGGGATTGAAGTGATTCACGTCAGTGGCAATGCTGATAATGCTGAATTCATTCAGACGGAACGTATTACAGCCGGAGCTTATGATTACATACTGAAACGTGGTCAGGGGATTAACAGCACTAACTGGTATCTGATTAGCAGAAAAGACATTCCTGTACCACAACCTGAAGCTGTACCGGAAAGCCATGATAATAATTTGCGTCCTGAGGCAGGTAGCTATGTTGCCAGTATTGCTGCTGCAAATAATCTGTTTGTAACGAATCTGTATGAACGACAGGGACAGGAGTTGTATATCAGCCACATGACAGGAGAAGAAAATGAAGCAGGTATCTGGATGTATAATAAAGGAAAACATAATCGCTGGCGTGACAACAGTAGTCAGCTGAGAACCCGGGGGAATAGCTACGTTGTGTTAATAGGGGGAGATATAGCTCAGTGGAGCCTGAATGGTACCGATCGCTGGCATACAGGTATGATGGCTGGCTATGGTCATAATAATAACAGTACGAATGCCCTGAGCACCGGATACCATTCGGAAGGAAGAATGAATGGATACACAGCGGGTCTTTATGCAACATGGTATGCCAATGATGAAACACACAATGGTTCTTATCTTGATAGCTGGCTGCAATACAGCTGGTTTGATAATCATATAAATGGAGAACGGCTGCCTGCTGAGTCATGGAAGTCAAAAGGGTTTACGGTATCTCTGGAAGCAGGATATTCATGGAAGGCTGGAGAGTTTACCGACAATTACAAGGGAAGTCATGAATGGTATGTTCAGCCGCAGCTTCAGGTTGTCCGGATGAATGTAAAATCAGACAAATATCATGAAAGTAACGGAACCAGTATTGAAAATACCGGTAACGGAAATATTCTCACCCGCCTGGGAGCAAGAACATGGCTTACCAGCAAAAACGGTAAAAATACGCGGTATGCGGTTCCGTTCAGACCATTTGTGGAGGCACACTGGTTGCACAATAGTCGTGTTTTCGGCACCAGTATGAATGGTGTAAGTATATACCAGGATGGTGCGCGTGATATCGGAGAAATAAATGGTGGTGTTGTGGGAATGATAACACCAGAAGTAGCATTCCGGGCTGATGCAGGCATTCAACTTGGAGAACATGGATACCATAATACATCTGCCATGTTGAGTGTGGAATATCGTTTCTGA
Protein sequences of DBSCAN-SWA_3 >CP029164|934714:943640|940142_943640_+|AWH68717.1|DBSCAN-SWA MNKNIRILQFLVSILYSVQSHFSGAQTIQLNGNGIPESITRSITGVDGNAALNISVPYKTSYTQNILSVESSINIKGGTSNTSIGGAGVYGENFTLNNNGSVWGGDGYNGGIAVSGNKISINNYRNVYGGNGLGGSGSSGGAGLSGDDIIVDNYRSIYGGDDVGGTGGSGVTGSNITVHNSGGILGGNGVNGGDGINGSNLFITNDNMISGGYGIKQGGDAISGNQITLNNNGIVQGGYGPDGGCSVYGEDIHINNHGNLSGLYNSQKDAYNTSIIFSGGYNSLDIYSDSVINGDIKLASIPVNGTNELIIKNINNATAINGGLMIGNGSSVYLSGKNSIFNGNISIDEDASMNLSVGNANVHANTITLKSDSWLNIDTSIKNWTQDYYTLLSSDTGISIADNSHIVQYNVLLTEGAESYVYTSLNDDDNKLISMLRWNNTKGMGYGTFNIEKDATLNIGVSLSDNLSPLLYDGWDGKSLTKSGNGTLILSATNNYTGNTEVKSGVLILAAPDALGRTEYLYLSRGAELDMNGYPQTISKLLTAAGSVLNIHGGSLILNNGGESAGTIAGDGSLNINGGMLDITGNNRNFSGVFTVNKGAHLAVSTADNLGTAFVDNYGTLTLNSTSAWQLTNNISGYGNVRKTGAGALNISDNAKWTGMTDIIQGTVILGNADSPVMLGSNQVIVEEQGKLSGFGGVAGNLSNSGIVDLTTYMPGNILTVGGNYTGRNGLILLQTETGGDNSKTDRLVIKGNASGRTRVAVTQAGGTGAETLNGIEVIHVSGNADNAEFIQTERITAGAYDYILKRGQGINSTNWYLISRKDIPVPQPEAVPESHDNNLRPEAGSYVASIAAANNLFVTNLYERQGQELYISHMTGEENEAGIWMYNKGKHNRWRDNSSQLRTRGNSYVVLIGGDIAQWSLNGTDRWHTGMMAGYGHNNNSTNALSTGYHSEGRMNGYTAGLYATWYANDETHNGSYLDSWLQYSWFDNHINGERLPAESWKSKGFTVSLEAGYSWKAGEFTDNYKGSHEWYVQPQLQVVRMNVKSDKYHESNGTSIENTGNGNILTRLGARTWLTSKNGKNTRYAVPFRPFVEAHWLHNSRVFGTSMNGVSIYQDGARDIGEINGGVVGMITPEVAFRADAGIQLGEHGYHNTSAMLSVEYRF >CP029164|934714:943640|934714_935095_+|AWH72442.1|DBSCAN-SWA MQKNVTPGRRKGCPNYPPEFKQLLVAASCEPGISISKLALENGINANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETLSISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR >CP029164|934714:943640|935091_935439_+|AWH68713.1|DBSCAN-SWA MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPISGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGIDWRQPKRLLTSLTML >CP029164|934714:943640|937157_937508_-|AWH68715.1|DBSCAN-SWA MISLPSGTRIWLVAGVTDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRRGDTVKILWADADGLCLFTKRLEEGQFIWPAVRDGKVSITRSQLAMLLDKLDWRQPKTFRLNALTML >CP029164|934714:943640|937504_937930_-|AWH68716.1|DBSCAN-SWA MELQDWRKEPRKNYSNEFKLRMVELASQPGACVAQIARENGVNDNVIFKWLRLWQNEGRVSRRLPVTTSSDTGVELLPVEITPDEQKEPVAAIAPSLSTSTQTRVSASSCKVEFRHGNMTLENPSPELLTVLIRELTGRGR >CP029164|934714:943640|935534_937127_-|AWH68714.1|transposase|DBSCAN-SWA MSQKYLIRIAELESQLRQKDQQLSLVEETEAFLRSALARAEEKIEEDEREIEHLRAQIEKLHRMLFGTRSEKLRREVELAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPVNDALYRYVMNTRKVHTDDTPVKVLAPGQKKAKTGRIWTYVRDDRNVGSSSPPAVWFAYSPNRQGKHPEQHLRPFRGILQADAFTGYDRLFSAEREGGALTEVACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESERLAARQQRSKALLTSLHEWMVEKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNTAERALRAVCLGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWPSNCVDELLPWNVVLTNK |
6 | Stx2-converting_phage(66.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2046461 : 2053601
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP029164|2046461:2053601|DBSCAN-SWA ATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCACAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGATATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAGCGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAGCCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCTTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATTCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGTGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGCTCGTGTGCATAAGCGGCCAGAGTTTCGGTTGAGAGGCAGCGTGCCACATCAACCTCACGGGCTGGTGCAATTTGACGGTAATGTGCCACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGACGCCCTGCTTCGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCCCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGATAGCCCTCTTGCTGTAAGCGAGCCAGCTCCTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATTCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAATGCATCGGTAACCGGGCCGATATTACCTTTCGCAGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACCGGACAGGAACGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGGGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCCGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCGGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCATGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCCAGTCCCGGTTCTGAACCGATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGACAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGGTAGCAATTTCTTGCGCATCAGCACTGGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTTTTCCACCAAACAGCACCTGTTTAACCTGGGTCGCATTGACCACCAGCACCAGCAGAGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCAGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACACATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCCGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATGTTGTGAATATAAGCGCGGAAAGATAACGATATGGTGAGCTGATTCACAAAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTTAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGGGGCGTCCAGTCTCCGGGACGGGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGACGTTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATTGCACAGCATCTGATTCACATGGAGCCACTCACAGTGGTTACTAACGACTTCGTTATTGCGGACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGTGGCGCAGTGTGTCGGGAAAACCGTTCCTGTGTTGGGGAAGCCGCTGCGACCATGCTGCGTAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTATGCGGGGGATCTCTACGCCAGCAGAAGATAAAGTCACGGTGAAAAGGGCGATTGCCAGTGCCAGCCGTCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGACAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGATCAGATCATTACTGACGACGGTCTGCCGGAGAGTGCCAGCCGCGCGCTGGCGAAGCTGGATCTCTCTTTGCTGATAGCGAAAAATGAATAATGACCTGCAATAACGCCGGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTTTTCGGCGATCCGGTATCTATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAGTTCTCTCTCCGCTATTTCTTTGCCAAACTGATACTCCTTCCCCGGATAATCTGCATGTGCGATGACATATTTTATGGTGTCGTTAGTGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTGCCATCGCCGGTCGCGAACGCATCCAACGCCATCGCTTCGTGGTTGCCTTTGACGGAGATAAACCAGGGCTGGTTTAACAGGCGCAGCACGTTAAGACTCTCGGGCCCACGATCGATGTTATCGCCGACAGAAATAAGCAAGTCGGTTTCAGGACAAAAAGAGAGTTGATGTAAGCGGGATTGTAATAATTGATAGTCACCATGAATATCACCAACGGCCCATATATGGTGATAGTGATGGGCATTGATTTTTTGGTAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGATTTCCTATTATACAGGGTATTTTTATTTGATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGACTCTTCAAGCGATAAATCCACTCCAGCGCCTGACGCGGGGTGAGTGAATCTGGGTCGAGATTTTCCAGTGCTTCGACCGCAGGCGACGTTTCTTCTGGCACTGACAGCAAAGACATTTGCGTACCATCCACTTGCGTCGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGTTTTTGCCGTGCGCGCTTAATAACCTCTTTTGGCACGCCGGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCTGCGCCATCCTGCACGCTGTGCATAAAGGCAATGGTGTCGCCGTGCTCCAGTGCATCGAGATGCACGTTGGCGACGCCTTCCATTTTCTCCGGTAACTGGGTCAGCTCGAAATAGTGGGTAGCAAACAACGTCAACGCTTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAAGTGGACGTTCCACGCCCAATCTCATCCATCAGCACCAGACTGTACTCGGTGGCGTTATGTAAAATATTGGCGGTTTCGGTCATTTCCACCATAAAGGTTGAACGCCCGGATGCCAGATCATCCGCTGCCCCTACGCGGGTAAAGATGCGATCGATAGGTCCAATCTCGACTTTTTGTGCCGGTACATAGCTGCCGATGTAGGCCATCAGCGCAATCAGTGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATATTCGGACCGGTAATGATCAACATGCGACGCTGCGGCGACAGGTTCAGCGGGTTGGCGATAAACGGCTCATTCAGCACCTGTTCAACCACCGGATGGCGGCCTTCGGTGATGCGAATACCCGGTTTATCAATAAAGGTCGGGCAGGTGTAGTTCAGCGTATAGGCCCGTTCCGCCAGGTTCACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAACTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAAGTGAGAACTTTGTCTTCGTACTCTTTTAGCTCTGGAATAATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAATTGATGGGTGCCAGATGGCTTTGTCCACGGCTGATCTGAATGTAATAACCATGAACCGCGTTAAAGCCGACTTTCAGGGTGTCCAGGCCAGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCATTCATCCAGTTCTTCGTTATAACCAGTCGCAATAACACCACCGTCGCGTACCAGCACCGGCGGCGTGTCGATGATTGCTCGCTCCAGCAGGTCGCGCAGTTCGGCAAACTCGCCCATCTTCTCACGCAGTGCTTGTACCGGGGCGCTATCGACACCTTCCAGTTGCGCGCGCAGCTCCGGCAGTTGCTGGAAAGCGTGACGCATACGGGCCAGATCGCGTGGACGAGCAGTTCGTAAAGCCAGACGCGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGCAGCTCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGACTACCCATCGGCGTTACAGTACAGTCGAGCACTGAAGCCAGCGTATTTTCCGCACCGCCGGCCAGGTTCTGAGTAATTTCCAGGTTACGACGCGTCGCGGCATCCATAATGATGCTGTCCTGCTGACGTTCCATAGTGATAGAACGAATATGCGGCAGGGTCGTGCGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGTGGTGCGTTCTCCACACCAAAACCGACCAGATCTCGGGTGCCAAATTGCAGGTTCAATTGCTGGCGAGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGACGCAGGCCGCGGCGACCTTCAATCAGCGACATCTCGGCGAAATCTTCCGCATACAGTAATTCCGCTGGATTCGTACGTTGCAGTTCTGCCGCCATCGTTTCGCGGTCCGCCGGTTCGCTCAGGCGAAAACGCCCGGAGCTGATATCCAGCGTGGCGTAGCCGAACCCCTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGTTCCTGCAACAGCGCTTCATCGCTGATGGTGCCCGGCGTAACGATACGGACAACTTTGCGCTCGACCGGCCCTTTGCTGGTCGCCGGATCGCCAATCTGTTCACAAATAGCAACCGATTCGCCCTGATTCACCAGTTTGGCGAGGTAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGATGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCTTGAGATACTGCTGCATCATGGGCGTATGGGCATCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_4 >CP029164|2046461:2053601|2051039_2053601_-|AWH69738.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLKLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMERQQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLEGVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIATGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >CP029164|2046461:2053601|2048355_2049264_-|AWH69735.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNATQVKQVLFGGKGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGSEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS >CP029164|2046461:2053601|2050277_2050934_-|AWH69737.1|DBSCAN-SWA MPSTRYQKINAHHYHHIWAVGDIHGDYQLLQSRLHQLSFCPETDLLISVGDNIDRGPESLNVLRLLNQPWFISVKGNHEAMALDAFATGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDTIKYVIAHADYPGKEYQFGKEIAERELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGSPKSGRLSFYKIK >CP029164|2046461:2053601|2046461_2047100_-|AWH69733.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNIPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >CP029164|2046461:2053601|2047096_2048359_-|AWH69734.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQAREAGRPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSTETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDEAFFSRAQREFLS >CP029164|2046461:2053601|2049459_2050227_+|AWH69736.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGRCIYLDAGTTTLAIAQHLIHMEPLTVVTNDFVIADYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSMRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALAKLDLSLLIAKNE |
6 | Escherichia_phage(83.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2633863 : 2643308
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP029164|2633863:2643308|DBSCAN-SWA AATGATTGAATTTAGCCATGTCAGCAAACTGTTCGGCGCACAAAAAGCCGTTAACGATCTCAATCTCAATTTTCAGGAAGGGAGTTTTTCGGTGCTGATTGGCACATCTGGCTCCGGCAAATCCACCACCCTGAAAATGATTAACCGCCTGGTGGAACATGACAGCGGAGAGCTCCGCTTTGCCGGAGAAGAAATTCGCTCGCTGCCAGTACTGGAGTTACGCCGCCGTATGGGCTATGCCATTCAATCTATTGGCCTGTTTCCCCACTGGAGCGTGGCACAAAACATTGCCACCGTGCCGCAATTACAAAAATGGTCGCGGGCGAGGATTGACGATCGTATCGACGAATTAATGGCGCTACTGGGGCTGGAGTCAAATTTACGTGAGCGTTATCCACATCAGCTTTCCGGTGGTCAGCAGCAACGTGTGGGAGTAGCGCGCGCACTGGCTGCCGATCCGCAAGTCTTACTGATGGATGAACCTTTTGGCGCACTGGACCCGGTAACGCGCGGCGCGTTGCAACAAGAGATGACGCGCATTCACCGTTTGCTGGGGCGTACCATTGTGCTGGTCACGCATGATATTGATGAGGCGCTACGGCTGGCAGAACATCTGGTATTGATGGATCACGGTGAAGTGGTGCAGCAGGGGAATCCGCTGACGATGCTGACTCGTCCGGCGAATGATTTTGTCCGCCAGTTTTTTGGACGTAGTGAACTGGGTGTGCGCCTGCTTTCGTTACGTAGTGTGGCGGATTACGTGCGTCGCGAAGAACGGGCAGAAGGTGAGGCACTGGCAGAAGAGATGACGCTGCGCGATGCGCTCTCCCTGTTTGTCGCGCGGGGATGCGAGGTGCTGCCAGTGGTGAACACGCAGGGCCAGCCTAGCGGCACGCTGCATTTTCAGGATCTGCTGGTGGAGGCGTAAGCGTATGAAGATGTTGCGCGATCCGCTGTTCTGGCTCATTGCTTTGTTTGTGGCACTGATTTTCTGGCTGCCTTACAGCCAGCCGCTGTTTGCTGCCTTGTTCCCACAACTGCCACGACCCGTTTATCAGCAAGAAAGTTTTGCAGCTCTGGCACTGGCTCATTTCTGGCTGGTGGGAATTTCGAGTCTGTTTGCGGTGATCATTGGTACTGGTGCCGGAATTGCAGTCACTCGCCCGTGGGGCGCGGAATTTCGCCCACTGGTGGAAACTATTGCCGCCGTTGGGCAGACTTTTCCGCCCGTTGCGGTGCTGGCGATTGCTGTTCCGGTGATCGGCTTTGGTCTGCAACCAGCGATTATCGCCTTGATCCTATACGGCGTGCTGCCCGTCCTGCAGGCGACACTTGCCGGGCTGGGAGCGATTGATGCCAGCGTGACAGAAGTTGCGAAGGGTATGGGAATGAGTCGTGGTCAGCGACTGCGTAAGGTCGAGCTACCGCTGGCGGCTCCGGTGATTCTGGCGGGCGTGCGAACTTCGGTGATTATCAACATTGGTACGGCGACGATCGCCTCAACGGTAGGGGCCAGCACGCTGGGTACGCCCATCATCATCGGGCTTAGCGGATTTAATACCGCGTATGTGATCCAGGGGGCGTTACTGGTGGCACTGGCGGCGATCATCGCAGACCGCCTGTTTGAAAGGCTGGTGCAGGCGCTTAGCCAGCACGCAAAATAAAGGTATAACCTGCGAGCATGACGCCACCAATTCCGCCTAACGCCATAAACAGGAACAGGGCGATGACCCCAATTTTAGCTATGCGCATAATGCACTCCTTATGTTAACGAAAGGATTGTACAGTAAAGCGCATTTGTTAACGAATCATTAAATGCCGAGTGGGAAAATATCATGGCCTTGTTCCTGCCAACTGGTGAGTTGCTGCTGTTGGGCGGAGGTTCGATTTTCACCGCACCACACCAGCAATGTACGGCCTTCGAACAGTTCAGGGCGTAGTTGATTGAGCGAGTGGGCGAGGACATCAATGCGCCATCCTTGTTGACTGGCAATCCAGCCCTCCAGCCACAGACGGGTGGTATCCTGAATATTCCAGCCAACCACCAGCGCATCTTTACCCTGTTTTTTACGTGCCGAAGCCAGACAAATGGCGATGTAGTTGATCAGTACGCCGTCGAGGATCGCCAGCAGCGCCTGGAGAGTCGGTTGTTGGCACTGAAGCCGTCGGCGCAGAGGAATAAACAGATGTGTGGTGAGTGTCTGGGCGGGGTAATCCTGACCGCGCTCTTTGATCCACGTTCGCAGGCTATGCAGATTGCCGCTTTGCAGGTAGGTCAGTAATGTTTCTTGCTGATCGCGCCAGCCGTTCTGCACATCAACATTTTCATTACTGAGCAGCATTTTAACTTTGCTGACCTGCACGCCGTTGTCGATCCAGCGTTTGATCTCGCGGATCCGGTCAATATCGGCATCGTTGAACAGCCGATGACCGCCGTCTGTCCGTTGCGGTTTCAGCAATCCGTAACGCCTCTGCCACGCGCGTAACGTGACAGGGTTAATATCACAAAGCAACGCCACTTCACCAATTGTGTAAAGCGCCATCGTCTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCAGTTTTGCGAACCAGGTAGTTTTGCCCGTTTTTTGTGCATTTATAGGGTGATTTTATTTTTGCCAGGCGATTTTGAGTGATCGTACTCACGAATTCTCATTTTTCTGCAAGAGTTCAAAGAAAGTTAAACGCAGGCAATGTATGTTACGCGTTTTAAAGGGAAGTGTGGTTTGCGGGTATGTACGATTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTTTTTTTAGTCATTGCGTGGTTAATGAGTAAAACGCCATTATTCATACCGTTAATGCAGGTCACGGTTCGTCTGCCGCATAAATTTCTCTGCTACATCGTCTTTTCCATCTTTTGCATTATGGGCACCTGGTTTGGGTTGCACATTGACGATTCTATTGCCAATACCCGTGCGATAGGCGCGGTTATGGGCGGCTTACTCGGCGGTCCGGTCGTCGGTGGGCTAGTTGGCCTGACCGGCGGGTTACATCGATATTCGATGGGGGGCATGACCGCGCTAAGTTGCATGATCTCGACCATCGTTGAAGGATTGCTCGGCGGCCTGGTACACAGCATCCTGATCCGTCGCGGGCGCACTGATAAAGTCTTTAACCCCATTACCGCCGGTGCCGTCACGTTCGTCGCTGAAATGGTGCAAATGTTAATCATCCTGGCGATCGCCCGACCTTATGAAGATGCGGTGCGTCTGGTGAGTAATATTGCTGCGCCAATGATGGTCACCAATACCGTCGGCGCGGCGCTGTTTATGCGTATATTGCTCGATAAACGCGCGATGTTTGAAAAATACACTTCAGCTTTTTCTGCCACTGCGCTGAAAGTGGCGGCCTCGACGGAAGGCATTTTGCGCCAGGGGTTTAACGAAGTGAACAGCATGAAAGTGGCACAGGTGCTGTATCAGGAGCTGGATATTGGTGCAGTCGCGATTACCGATCGAGAGAAATTGCTGGCCTTTACCGGAATTGGTGACGATCACCATTTACCCGGCAAACCGATTTCTTCGACTTACACCTTAAAAGCGATTGAAACCGGTGAAGTGGTCTACGCTGATGGCAACGAAGTACCTTATCGTTGCTCTTTGCATCCGCAATGCAAACTGGGGTCGACGCTGGTAATTCCGTTGCGTGGTGAAAATCAGCGGGTGATGGGCACCATCAAATTGTATGAAGCCAAAAACCGTTTATTCAGTTCAATCAACCGCACGTTGGGTGAGGGGATTGCGCAATTGCTTTCGGCGCAGATCCTTGCCGGGCAATATGAGCGGCAAAAAGCGATGCTCACCCAGTCAGAGATCAAACTGCTTCACGCCCAGGTGAACCCCCATTTTTTGTTTAATGCGCTTAACACCATTAAAGCGGTGATCCGTCGCGACAGCGAACAGGCCAGCCAGCTGGTGCAGTACCTTTCCACTTTTTTCCGCAAAAACTTAAAGCGGCCTTCGGAGTTTGTTACTCTCGCCGACGAAATTGAACATGTGAACGCTTATCTGCAAATTGAAAAGGCGCGCTTCCAGTCGCGGTTGCAGGTCAACATTGCTATTCCGCAAGAATTATCCCAGCAGCAATTGCCCGCGTTTACCCTGCAACCGATAGTGGAAAACGCCATTAAACATGGGACATCACAACTGCTGGATACAGGGCGAGTGGCAATCAGCGCCCGACGTGAGGGGCAACATTTAATGCTGGAGATCGAAGACAATGCCGGTTTGTATCAACCGGTAACCAATGCCAGTGGGCTGGGGATGAATCTGGTGGATAAGCGTTTACGTGAACGGTTTGGCGATGACTATGGGATAAGCGTCGCCTGTGAGCCTGATAGTTACACCCGAATAACGTTACGACTACCATGGAGGGACGAGGCATGATTAAAGTCTTAATTGTCGATGATGAACCGTTAGCACGGGAGAACCTGCGCGTATTTTTGCAGGAGCAGAGCGATATTGAAATCGTTGGAGAGTGTTCAAACGCCGTGGAAGGGATCGGCGCGGTGCATAAACTGCGCCCGGATGTGCTGTTTCTCGATATCCAGATGCCGCGCATCAGTGGTCTGGAAATGGTGGGGATGCTTGACCCGGAACATCGTCCGTATATTGTTTTTCTCACCGCGTTTGACGAATACGCTATTAAAGCTTTTGAAGAACATGCCTTTGACTATCTGCTGAAGCCAATTGATGAAGCGCGACTGGAGAAAACGCTGGCGCGTTTGCGTCAGGAACGCAGCAAGCAGGATGTTTCGCTGTTACCGGAAAATCAACAGGCGCTGAAATTTATCCCTTGTACGGGGCATAGTCGGATTTATTTGCTGCAAATGAAAGATGTGGCATTTGTCAGCAGTCGGATGAGCGGTGTCTACGTTACCAGCCACGAAGGGAAAGAGGGCTTTACCGAATTGACATTACGTACCCTGGAAAGTCGTACACCACTACTGCGCTGCCATCGTCAGTATCTGGTTAACCTCGCGCATTTACAGGAGATTCGTCTGGAAGATAACGGTCAGGCCGAGTTGATTTTGCGTAATGGCTTAACCGTGCCGGTCAGCCGCCGTTATCTGAAAAGCTTAAAAGAGGCGATTGGCCTGTAAAAGACTGCTAAAATGGCTTTTTGCCTCATCAACACCTGAAGGCCTCATGCTAAGTAACGATATTCTGCGCAGCGTGCGCTACATTTTGAAAGCCAATAATAATGACCTGGTGCGTATACTGGCGCTGGGTAATGTCGAAGCCACCGCGGAACAGATTGCCGTCTGGCTACGTAAAGAAGACGAAGAGGGTTTTCAGCGTTGTCCGGACATTGTTTTGTCGTCATTCCTCAATGGCCTGATTTATGAAAAACGCGGCAAGGATGAGTCTGCTCCGGCACTGGAGCCGGAACGTCGCATTAATAACAACATCGTGCTGAAAAAATTACGCATCGCGTTTTCGCTGAAAACCGATGACATTCTGGCGATCCTCACCGAACAGCAGTTCCGCGTTTCGATGCCGGAAATTACAGCGATGATGCGCGCACCGGATCATAAAAACTTCCGCGAATGCGGCGATCAATTTTTACGTTATTTTCTGCGTGGACTGGCAGCGCGCCAGCATGTGAAGAAAAGCTAAGACGGGTATGGCGGCCATGCGAAACATGGCCGCCGACAGATTATTTCACTTCTTTAAAACCAGCGGCTTTCATCACCAGTTCCATTTGCGCCATAGTGATACCTTTTTTGGCATCTTCAGCAGAAACGTTGATTCCTGAAATACCCTGCAGGGCTTTAAAATCCACTTTTTCCATATCGATAGTCACGTTTTCCTGCGCGTAGGTATCGGTATAGGTTAATTTTTCTTCAACACCCGCGATGTTTTTGTATTTGGCGCTTAACGGCTCAAGTGTCTTGGCAGCATCTTCTTTGGTGGTTGCACCAATGGAGGCAAATTGAATTTTGGTTTCAGAAGATTGCTTAAGCACCTTGTCACCTTTGTAGACATAGGTAATGGCAATTTCAGTGCCGTTCAGATTGGCGCTGAATTTCTTCGATTCTTCTTTGTCACCGCAGCCAGCAAGAGAGAAAACCAGAACAGATGCAACAACAAGGGAAAACAGCTTATTGAAAGCCTTCATGTAAAACTCCATTTTATTTAATCAAGGAACTGGTGACTCTCACCAGGGGCTATATAGGATATGCCTAATACCGTGGCGTGAGCAGTCCGGAACTGGAGTAGAACTCTTAGTAAAAAGCACTATTTCATCCTTGTTGCTGAAGCATGGGGAATAATTGTTCGCACAGCAAAACACCGTTATTCATTGCTTCTACCCGTGCCTCGCTTTCTGTATTACGAAATTGTCCCAACATATGTGCCAGCCGATAAAAACCCACCGCGGTGAGGTCATTCGCCAGCAACTCTGCCTGACCAATAGCACTCTGTTCCTGATAGCGCCAGCCGTTATGGAGCAGTTGAATAAGTAATGCCTGGCAGCGCATCAGCAACTGATGAGCAGTAGACGGCACAGGCAGAACGCTGGCAGAAGGTAGTGATGCCACCACAGGCGCAGTTTCTGCGTCCAGCGCCCAGGCACGAGTCTTTGTCATCATTACCTGTGGTTCCAGTGTCAATTGCCCATCAACAAAACTGACAAAGCCAGAAACCAGACACACGGGGTCGTCTGTTTGTTGCAAAAGCGCCGCCATGCGTTCAACGGCATAAGGCGCGCTGGCTGAGGCTGGCAGTGATAACGTCAGCAGATTATCTTCACCTTCGCCGCTGATTACCTGCGCATCCAGCGTCTGCCGACTGCTGTCCCAACCGAGCGAAATGCACTCTTCTACCGGCAGAATAAATAAATTATCGACCTGGTTAAGCGGTCGAATGGCGGCTGGCGGACGCTGGCGTAAATATTCCCGCAAAGCCACAATGCCCGGCTGGCGCAGCGGCGTACTCAGCATTTTCCAGGCATCTGGCGACAGTGGAACAACACTACTCAAGCGGTTGCGGGTAGCTAACAGCAGTTCGCCTTCGGCACTGCGACGCGCTGCTTGTGAAACAATTTGCCCACCTGCCAGTGCGCCAGCCTGAAAGCTAAACAGCCGACGTTTTGCCGCGGGAGCGTTTTCTTGTTCGCTTCTCGGCCAACTACGCGAAAGATGCAAAATACTGCCGGTGTCGGGATCGGTAAACCAGATGCGTAAACCATAATGCTCAATATCCTGCCAGCAACGCATACCTAAAGACACCAGCCGCAGATGATCAAGCTTTGCTTCTCCGGCAATGCCAGAGCCAACGACCGTGCGCCACGGCACAGGAGGAACTTCACCAACACTGTCGCGCCGGGCCATCTCTTGTACGCAATTTAATCGACTGTTTAATGCCGCAAGCTGACGTAAGCATTCTCCGGCATGATAGTGGCTGGCGCGGGCGTGGAAGGCATCAACGCTGGCGCGCAGTTGCCGTAGCGATTCACTCACCCATCGCCAGTTGCAGGCCTCTGCCGCCTGCAATGCGCGGTTGAAAGCGGCCTCGTAGTGAATAAGCGGCTGGCTGATGCCGCTCAGCCATAATGCCTGGCTTAGTTGTTGAACATATTGACGACACGTTTTGCCTTCTTCGCTGGCAAACGGATCGTCAGATGATGTGACGTGTTCGCTGCGCATCTGCCAGATTAAATGAGTCAATTCCGCTTGCTGAGTTTTGGCCTCGACGAAGGCCTGCACAGCCAGTACGACATGTTCGCAAAGTGTGCCTTCAATACAATCACAACGGGCGAAACGAATACTGCTGCGGGAATAAAAACGCACATCACTCATCGGTAAGCGGGCAGAGGGAATTTCGCCAGGCGCACAGAACAACTCAATGGTGATGCCTTTAGCGACCAACGCCTGTGCGCGTTTGCGGGTGGCATCGGGAAGGGTAGCCAGTTCTTCCAGCCAGATTGCCGGATCCCATTCCTCTTCTTTTTCCGTGGGCTGGGCGGTAGCACAAAGTCGTTGATAACTTAACACCAGCATCACGCGATGACGGCACATGCCGCTGGCCCCGCAGGTGCACTGAGTCTCTTTCAGTGCCTGACCGTTCGCCAGCTGGGTACGGACACCGTCACTGAAGGTGGCGATTAAAGCGCCGTTCTCATGGCTGATTTCCGGGACGTTGCCATTTTCCAGTTCCTTAAGGCTGCGCTTAACAAAACCGGCATTGCTTAACGCCGTCAGTGCCTGTGGTGTCAGTTCTAATAATTCCGGACGTAGTGAATTCATGACTGAAGGTTCTCTGCAAGCCATGATGCCAGCTCGCCCGGCGTCATGGCGGCTATTTGTGCGCCGACATTAACCAGCGCCTGGGCCGTATCGCGGTCATAGCAAGGCGTTGCTGTGCTATCGAGCGCTGCCAGTCCCAGCACTTTGATGCCGCTCTGGACACACTTTTTCACCTGATGCGTCAGTAATGATGATGAACCCCCTTCGTAAAAATCGCTCACGAGGATAATGACGCTTTTCGCTGGTTGTTCAATAAGTTGCCGACCATACTCCACGGCACTGGCGATATTGGTTCCGCCGCCTAATTGCACTTTCATTAATAACTCTACCGGATCGGCAACGTCTGCCGTGAGATCAACCACACTGGTGTCAAAAGCGACCAGATGGGTACGAATGCCGGGTAACTGCCATAAACAGGCCGCCATCACCGCAGAGTGGATCACCGAATCGACCATCGAGCCGCTTTGATCAACCAGTAAGACCAGTTGCCATTCTTCGCTTTGGCGTTTAATGCGGCTGTTAAAGCGGGGGGATTCGATATACAACTTGCCGTATTGCGGGTGCCAGTGTTGCAGGTTGGCGCGAAGAGTACTTTTGAAATCAAAGTTTCGCGCCAGTGGAATAAATGAGCGGCGACGGCGATCGCGGACACCAGAAAAAGCCTGACGAACTTCCTTTGCCAGTCGAGCCATAATTTCTTCAACAACCTGGCGCACTATCCGGCGGGCGGCAGCCAGTACTTCGGGATTCATCAGATGTTTGGTGTGCAAAACGGCGCGTAGCAGGCTTTCAGAAGGCTGCATACGTTCCAGCACGTCGAGATTCGTCACCACATCTTCAATGTCGTAGCGCAGTACGGCATCGCTTTCCAGCCGCTCAATCACCTGTTGCGGAAACAGCGTGTGAATACTGTTGATCCACTCAGGGGTGGTGAGATTTGAGCTACCTAATCCACCGGAACGTTCACCACGCTGGAGCCGTTCAGGATCGCGCCCATACAGCCACTCCAGCGCGTGGTCTATCTGCCGGGCGTTGTCATCCAGCCCACAAAGCGTCGTTTCTGCCGCTTCGCCAAGAATTAATCGCCAGCGTTGTAGCTCACGGGTGGTCAGAAGATCGTTCAGTTCAGACAT
Protein sequences of DBSCAN-SWA_5 >CP029164|2633863:2643308|2636626_2638312_+|AWH70263.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >CP029164|2633863:2643308|2635506_2635614_-|AWH70261.1|DBSCAN-SWA MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG >CP029164|2633863:2643308|2642171_2643308_-|AWH70268.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGSSNLTTPEWINSIHTLFPQQVIERLESDAVLRYDIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARRIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKSTLRANLQHWHPQYGKLYIESPRFNSRIKRQSEEWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIAAMTPGELASWLAENLQS >CP029164|2633863:2643308|2634794_2635526_+|AWH70260.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRLRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK >CP029164|2633863:2643308|2635673_2636405_-|AWH70262.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI >CP029164|2633863:2643308|2633863_2634790_+|AWH70259.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGELRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERAEGEALAEEMTLRDALSLFVARGCEVLPVVNTQGQPSGTLHFQDLLVEA >CP029164|2633863:2643308|2638308_2639028_+|AWH70264.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >CP029164|2633863:2643308|2639585_2640047_-|AWH70266.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >CP029164|2633863:2643308|2639074_2639545_+|AWH70265.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKS >CP029164|2633863:2643308|2640171_2642175_-|AWH70267.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENGALIATFSDGVRTQLANGQALKETQCTCGASGMCRHRVMLVLSYQRLCATAQPTEKEEEWDPAIWLEELATLPDATRKRAQALVAKGITIELFCAPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKTQQAELTHLIWQMRSEHVTSSDDPFASEEGKTCRQYVQQLSQALWLSGISQPLIHYEAAFNRALQAAEACNWRWVSESLRQLRASVDAFHARASHYHAGECLRQLAALNSRLNCVQEMARRDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENAPAAKRRLFSFQAGALAGGQIVSQAARRSAEGELLLATRNRLSSVVPLSPDAWKMLSTPLRQPGIVALREYLRQRPPAAIRPLNQVDNLFILPVEECISLGWDSSRQTLDAQVISGEGEDNLLTLSLPASASAPYAVERMAALLQQTDDPVCLVSGFVSFVDGQLTLEPQVMMTKTRAWALDAETAPVVASLPSASVLPVPSTAHQLLMRCQALLIQLLHNGWRYQEQSAIGQAELLANDLTAVGFYRLAHMLGQFRNTESEARVEAMNNGVLLCEQLFPMLQQQG |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2738115 : 2744445
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP029164|2738115:2744445|DBSCAN-SWA TATGCCATTTAAAAAAATCTCCCGACGCACCTTCCTGACGGCAAGCTCGGCGCTTGCCTTCCTCCATACCCCTTTCGCTCGCGCACTTCCCGCCCGACAAAGCGTTAACATTAACGACTACAACCCTCACGACTGGATCGCCTCATTTAAACAAGCCTTCAGCGAAGGGCAAACGGTCGTCGTGCCTGCTGGATTCGTTTGTGACAATATCAACACCGGCATCTTCATTCCTCCTGGAAAAACGTTACACATCCTTGGAAGCCTGCGCGGCAACGGCAGAGGGCGATTTGTCTTACAGGACGGCAGCCAGGTGACAGGGGAGGAGGGCGGCAGTATGCATAACATCACCCTGGATGTGCGTGGTTCTGACTGCACCATCAAAGGGCTGGCGATGAGCGGCTTTGGCCCGGTAACGCAGATTTATATCGGCGGCAAAAACAAACGGGTCATGCGCAACCTGACCATCGATAACCTCACTGTCAGCCACGCTAATTACGCCATCTTACGCCAGGGATTTCATAACCAGATAATTGGTGCCAACATCACCAACTGTAAATTTAGTGATTTACAGGGGGACGCTATTGAATGGAACGTGGCGATTAACGACAGTGATATTTTGATCTCCGACCATGTCATCGAGCGCATCAACTGTACCAACGGCAAAATCAACTGGGGCATTGGCATAGGCCTTGCGGGAAGCACTTACGATAATAATTACCCGGAAGATCAGGCAGTGAAAAACTTTGTCGTGGCGAATATCACGGGATCGGATTGTCGGCAGTTGATCCATGTTGAAAATGGCAAACATTTTGTTATCCGTAATATCAAAGCCCGCAATATCACGCCGGATTTCAGTAAGAAAGCAGGCATTGATAACGCGACAGTTGCTATTTACGGTTGTGACAATTTCGTGATTGATAATATTGAAATGATTAATAGCGCCGGGATGTTAATCGGCTATGGGGTAATTAAAGGCAAATATCTCTCGATACCGCAAAATTTCCGAGTGAATAATATTCAACTGGATAATACCCACCTTGCTTATAAATTGCGCGGTATCCAAATCTCCGCCGGGAATGCCGTCTCCTTTGTGGCGCTGACTAACATTGAGATGAAGCGTGCGTCGCTTGAGCTATACAATAAACCGCAACACCTTTTTATGCGTAATATCAATGTGATGCAGGAATCCTCAGTTGGACCCGCATTGAGCATGAACTTCGACATGCGCAAAGACGTTCGCGGCGTCTTTATGGCGAAAAAAGAAACACTGCTGTCTCTTGCAAATGTTCATGCGGTGAATGAGAAAGGACAAAGCTCCGTCGATATCGACAGGGTTAATCACCATATTGTTAATGTGGAAAAGATTAACTTTAGATTGCCGGAACGGAGAGAGTAGATTTGCGACCATTCCTGGAAAAATGGAGCCATACTTAGGAACAATGCTACTGCAATCCACAACGAAGCGGCGTAACATCACAAGTAATTCAGTAATCAATTCAGGGTAATTGATGCTGGCGAAAAAAATCGAACAAGCTATAATTCAGCAACCATTTTACAGGTGGATGAAATAATGACGAATTTAAAAGCAGTTATTCCTGTAGCGGGTCTTGGGATGCATATGTTGCCTGCCACTAAGGCGATTCCCAAAGAGATGCTACCGATCGTCGACAAACCAATGATTCAGTACATTGTTGACGAGATTGTGGCTGCAGGGATCAAAGAAATCCTCCTGGTAACTCACGCGTCCAAGAACGCGGTCGAAAACCACTTCGACACCTCTTATGAATTAGAATCTCTCCTTGAGCAGCGCGTGAAGCGTCAACTGCTTGCGGAAGTGCAGTCCATCTGTCCACCGGGCGTGACCATTATGAACGTGCGTCAGGGCGAACCTTTAGGTTTGGGCCACTCCATTTTATGTGCACGACCTGCCATTGGTGACAATCCATTTGTCGTGGTGCTGCCAGACGTTGTGATCGACGACGCCAGCGCCGACCCGCTGCGCTACAACCTTGCTGCCATGATTGCGCGCTTCAATGAAACAGGACGCAGCCAGGTGCTGGCAAAACGTATGCCGGGTGACCTCTCTGAATACTCTGTCATTCAGACCAAAGAACCGCTGGATCGCGAAGGTAAAGTCAGTCGCATCGTCGAATTTATCGAAAAACCGGATCAGCCGCAGACGCTGGACTCAGATATCATGGCCGTGGGCCGTTATGTGCTTTCTGCCGATATTTGGCCGGAACTTGAACACACTCAGCCTGGTGCATGGGGGCGTATTCAGCTGACTGATGCCATTGCCGAACTGGCGAAAAAACAGTCCGTTGATGCCATGCTGATGACTGGTGACAGCTACGATTGCGGTAAAAAAATGGGTTATATGCAGGCGTTCGTGAAGTATGGGCTGCGCAACCTCAAAGAAGGGGCGAAGTTCCGCAAAGGGATTGAGAAGCTGTTAAGCGAATAATGAAAATCTGACCGGATGTAACGGTTGATAAGAAAATTATAACGGCAGTGAAGATTCGTGGCGAAAGTAATTTGTTGCGAATATTCCTGCCGTTGTTTTATATAAACAATCAGGATAACAACGAGTTAGCAATAGGATTTTAGTCAAAGTTTTCCAGAATTTTCCTTGTTTCCAGAGCGGATTGGTAAGACAATTAGCGTTTGAATTTTTCGGGTTTAGCGCGAGTGGGTAACGCTCGTCACATCGTAGGCATGCATGCAGTGCTCTGGTAGCTGTAAAGCCAGGGGCGGTAGCGTGCATTAATACCTCTATTAATCAAACTGAGAGCCGCTTATTTCACAGCATGCTCTGAAGTAATATGGAATAAATTAAGTGAAAATACTTGTTACTGGTGGCGCAGGATTTATTGGTTCTGCTGTAGTTCGTCACATTATAAATAATACGCAGGATAGTGTTGTTAATGTCGATAAATTAACGTACGCCGGAAACCTGGAATCACTTGCTGATGTTTCTGATTCTGAACGCTATGTTTTTGAACATGCGGATATTTGCGATGCAGCTGTAATGGCACGGATTTTTGCTCAGCATCAGCCGGATGCAGTGATGCACTTGGCTGCTGAAAGCCATGTTGACCGTTCAATTACAGGCCCTGCGGCATTTATTGAAACCAATATTGTTGGTACTTATGTCCTTTTGGAAGCCGCTCGCAATTACTGGTCTGCTCTTGATAGCGACAAGAAAAATAGCTTCCGTCTTCATCATATTTCTACTGACGAAGTCTATGGTGATTTGCCTCATCCAGATGAAGTAAATAATACAGAAGAATTACCCTTATTTACTGAGACGACAGCTTACGCGCCAAGCAGTCCTTATTCCGCATCCAAAGCATCCAGCGATCATTTAGTCCGCGCGTGGAAACGTACCTATGGTTTACCGACCATTGTGACTAATTGCTCTAACAATTATGGTCCTTATCATTTCCCGGAAAAATTGATTCCATTGGTTATTCTCAATGCTCTGGAAGGTAAAGCATTACCTATTTATGGTAAAGGGGATCAAATTCGCGACTGGCTGTATGTTGAAGATCATGCGCGTGCGTTATATACCGTCGTAACCGAAGGTAAAGCGGGTGAAACTTATAACATTGGTGGGCACAACGAAAAGAAAAACATAGATGTAGTGCTCACTATTTGTGATTTGCTGGATGAGATTGTACCGAAAGAGAAATCTTATCGTGAGCAAATCACTTATATTGCCGATCGTCCGGGACACGATCGCCGTTATGCGATTGATGCTGAGAAGATTGGTCGCGAATTGGGATGGAAACCACAGGAAACGTTTGAGAGCGGGATTCGGAAGACAGTGGAATGGTACCTGTCCAATACAAAATGGGTTGATAATGTGAAAAGTGGTGCCTATCAATCGTGGATTGAAGAGAACTATGAGGGCCGCCAGTAATGAATATCCTCCTTTTTGGCAAAACAGGGCAGGTAGGTTGGGAACTACAGCGTGCTCTGGCACCTCTGGGTAATTTGATTGCTCTTGATGTTCACTCCACTGATTATTGTGGTGATTTTAGTAATTCTGAAGGTGTAGCTGAAACTGTCAAAAAAATTCGCCCTGATGTTATTGTTAATGCGGCTGCTCATACCGCGGTAGATAAGGCTGAGTCAGAACCCGAATTTGCACAATTACTCAATGCGACAAGTGTTGAAGCGATTGCAAAAGCAGCCAATGAGGTCGGTGCATGGGTTATTCACTACTCTACTGACTACGTATTTCCGGGAACCGGTGAAATACCATGGCAGGAGGCGGATGCAACCGCACCGCTGAACGTTTATGGTGAAACCAAGTTAGCCGGGGAAAAAGCATTACAAGAGCATTGTGCGAAGCATCTTATTTTCCGGACCAGCTGGGTCTATGCAGGTAAAGGAAATAACTTCGCCAAAACGATGTTGCGTCTGGCAAAAGAGCGTGAAGAATTAGCTGTTATTAACGATCAGTTTGGTGCGCCAACTGGCGCAGAGTTACTGGCTGATTGTACGGCACATGCCATTCGTGTGGCACTGAATAAACCGGAAGTCGCAGGCTTGTACCATCTGGTAGCTAGTGGTACCACAACGTGGCACGATTATGCTGCGCTGGTTTTTGAAGAGGCGCGCAAAGCAGGCATTCCCCTTGCACTCAACAAGCTCAACGCAGTACCAACAACAGCCTATCCTACACCAGCTCGTCGTCCGCATAACTCTCGCCTTGATACAGAAAAATTTCAGCAGAACTTTGCGCTTGTCTTGCCTGACTGGCAGATTGGCGTGAAACGCATGCTCAACGAATTATTTACGACTACAGCTATTTAATAGTTTTTGCATCTTGTTCGTGATGGTGGAGCAAGATGAATTAAAAGGAATGATGAAATGAAAACGCGTAAAGGTATTATTTTAGCGGGCGGTTCTGGTACTCGTCTTTATCCTGTAACTATGGCTGTCAGTAAACAGCTATTACCGATTTATGATAAACCGATGATCTATTATCCGCTCTCTACACTGATGTTAGCGGGTATTCGCGATATTCTGATTATCAGTACGCCACAGGATACTCCTCGTTTTCAACAACTGCTGGGTGACGGGAGCCAGTGGGGGCTAAATCTTCAGTACAAAGTGCAACCGACTCCAGATGGGCTTGCGCAGGCGTTTATTATCGGTGAAGAGTTTATCGGTGGTGATGATTGTGCTTTGGTTCTTGGTGATAATATCTTCTACGGCCACGATCTGCCGAAGTTAATGGACGTAGCTGTTAACAAAGAAAGTGGTGCAACAGTATTTGCTTATCACGTAAATGATCCTGAACGCTACGGTGTCGTTGAGTTTGATAAAAACGGTACGGCAATAAGCCTGGAAGAAAAACCGTTACAACCAAAAAGTAATTATGCGGTAACCGGGCTTTATTTCTATGACAACGACGTTGTCGAAATGGCGAAAAACCTTAAGCCTTCTTCCCGTGGTGAACTAGAAATTACCGATATTAACCGTATTTATATGGAACAGGGGCGTTTATCTGTTGCCATGATGGGCCGTGGTTATGCATGGCTGGACACGGGGACACATCAAAGCCTGATTGAGGCAAGCAACTTCATTGCCACCATTGAAGAGCGCCAGGGACTAAAGGTTTCCTGCCCAGAAGAAATTGCTTACCGTAAAGGATTTATTGATGCTGAGCAGGTGAAAGTATTAGCTGAACCGCTGAAAAAAAATGCTTATGGTCAGTATCTGCTGAAAATGATTAAAGGTTATTAATTAAATGAACGTAATTAAAACAGAAATTTCTGATGTGCTGATTTTTGAACCAAAAGTTTTTGGCGATGAGCGCGGCTTCTTTTTTGAAAGCTTTAACCAGAAAGTATTTGAAGAAGCTGTAGGCCGCAAAGTTGAATTTGTCCAAGATAACCATTCGAAGTCTAGTAAAGGTGTTTTACGCGGGCTGCATTATCAGTTAGAACCGTATGCGCAAGGGAAATTGGTACGTTGCGCTGTTGGTGAGGTTTTTGACGTAGCGGTTGATATTCGTAAATCGTCACCAACTTTTGGCAAATGGGTTGGGGTGAATTTATCTGCTGAGAATAAGCGCCAGTTGTGGATACCTGAAGGATTAGCACATGGGTTTTTGGTGCTGAGTGAGACGGCGGAGTTTTTATATAAAACGACCAATTATTATCATCCAGAGAGTGATAGAGGGATTATTTGGGATGATCCTGACATTGACGTAAAGTGGCCTTTAAGTATCCATAAACCGATTTTATCTATAAAAGATGAAAAACAAAAGATGTTTAAAGAAATGATAGCGTTGGAGAAATGGAATGGAGAATACTAA
Protein sequences of DBSCAN-SWA_6 >CP029164|2738115:2744445|2742035_2742935_+|AWH70347.1|DBSCAN-SWA MNILLFGKTGQVGWELQRALAPLGNLIALDVHSTDYCGDFSNSEGVAETVKKIRPDVIVNAAAHTAVDKAESEPEFAQLLNATSVEAIAKAANEVGAWVIHYSTDYVFPGTGEIPWQEADATAPLNVYGETKLAGEKALQEHCAKHLIFRTSWVYAGKGNNFAKTMLRLAKEREELAVINDQFGAPTGAELLADCTAHAIRVALNKPEVAGLYHLVASGTTTWHDYAALVFEEARKAGIPLALNKLNAVPTTAYPTPARRPHNSRLDTEKFQQNFALVLPDWQIGVKRMLNELFTTTAI >CP029164|2738115:2744445|2738115_2739510_+|AWH70343.1|DBSCAN-SWA MPFKKISRRTFLTASSALAFLHTPFARALPARQSVNINDYNPHDWIASFKQAFSEGQTVVVPAGFVCDNINTGIFIPPGKTLHILGSLRGNGRGRFVLQDGSQVTGEEGGSMHNITLDVRGSDCTIKGLAMSGFGPVTQIYIGGKNKRVMRNLTIDNLTVSHANYAILRQGFHNQIIGANITNCKFSDLQGDAIEWNVAINDSDILISDHVIERINCTNGKINWGIGIGLAGSTYDNNYPEDQAVKNFVVANITGSDCRQLIHVENGKHFVIRNIKARNITPDFSKKAGIDNATVAIYGCDNFVIDNIEMINSAGMLIGYGVIKGKYLSIPQNFRVNNIQLDNTHLAYKLRGIQISAGNAVSFVALTNIEMKRASLELYNKPQHLFMRNINVMQESSVGPALSMNFDMRKDVRGVFMAKKETLLSLANVHAVNEKGQSSVDIDRVNHHIVNVEKINFRLPERRE >CP029164|2738115:2744445|2742992_2743871_+|AWH70348.1|DBSCAN-SWA MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPTPDGLAQAFIIGEEFIGGDDCALVLGDNIFYGHDLPKLMDVAVNKESGATVFAYHVNDPERYGVVEFDKNGTAISLEEKPLQPKSNYAVTGLYFYDNDVVEMAKNLKPSSRGELEITDINRIYMEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAYRKGFIDAEQVKVLAEPLKKNAYGQYLLKMIKGY >CP029164|2738115:2744445|2743875_2744445_+|AWH70349.1|DBSCAN-SWA MNVIKTEISDVLIFEPKVFGDERGFFFESFNQKVFEEAVGRKVEFVQDNHSKSSKGVLRGLHYQLEPYAQGKLVRCAVGEVFDVAVDIRKSSPTFGKWVGVNLSAENKRQLWIPEGLAHGFLVLSETAEFLYKTTNYYHPESDRGIIWDDPDIDVKWPLSIHKPILSIKDEKQKMFKEMIALEKWNGEY >CP029164|2738115:2744445|2740614_2740878_-|AWH70345.1|DBSCAN-SWA MHATAPGFTATRALHACLRCDERYPLALNPKNSNANCLTNPLWKQGKFWKTLTKILLLTRCYPDCLYKTTAGIFATNYFRHESSLPL >CP029164|2738115:2744445|2739684_2740578_+|AWH70344.1|DBSCAN-SWA MTNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEILLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQGEPLGLGHSILCARPAIGDNPFVVVLPDVVIDDASADPLRYNLAAMIARFNETGRSQVLAKRMPGDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQPQTLDSDIMAVGRYVLSADIWPELEHTQPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKGIEKLLSE >CP029164|2738115:2744445|2740950_2742036_+|AWH70346.1|DBSCAN-SWA MKILVTGGAGFIGSAVVRHIINNTQDSVVNVDKLTYAGNLESLADVSDSERYVFEHADICDAAVMARIFAQHQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEAARNYWSALDSDKKNSFRLHHISTDEVYGDLPHPDEVNNTEELPLFTETTAYAPSSPYSASKASSDHLVRAWKRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKALPIYGKGDQIRDWLYVEDHARALYTVVTEGKAGETYNIGGHNEKKNIDVVLTICDLLDEIVPKEKSYREQITYIADRPGHDRRYAIDAEKIGRELGWKPQETFESGIRKTVEWYLSNTKWVDNVKSGAYQSWIEENYEGRQ |
7 | Enterobacteria_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
3221809 : 3273222
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP029164|3221809:3273222|DBSCAN-SWA GTTATATTTTTTCGATCTCGACCAGATTAGTGTGCTGCGGGTTTCCCTTCGCCAGTGGTGAAGGGCGCAGAGTGGTTAGCGTATTCACACAGCCGCCATGGTCGATTTTATCGCCAGACATATTGGCCTCGTGCCAGGCTCCCTGGCCCATAGCGCTAACCCCAGGGAGAATGCGTGGTGTCACTTTGGCTGGTAGCCGAACTTCGCCACGATGGTTATACACCCGCACCATATCGCCGTTGGCAATCCCACGTTTCTGTGCGTCAATAGGGTTGATCCACACCTCCTGACGGCAGGCAGCCTTCAGGACATCAATGTTGCCGTAGGTCGAGTGAGTACGGGATTTGTAATGGAAACCAAACAGTTGCAGTGGAAAAGTACTACGTTCAGGGGAGTCCCAGCCTTCAAAGGTTGAGGCATAAACCGGCAGTGGGCTTATCACTTCATCTTTTTCCAGTTCCCAGGTACGGGCAATTTCCGCCAGCTTGCTGGAATAAATTTCAATCTTACCGGAAGGCGTTTTAAGTGGATTTGCCTCGGGGTCGTCACGAAATGCTTTGTAGGCGACAAAATGACCATTGGGATCTTTACGCTTATAGATACCCATTTTTTTCAGTTCGTCGTAAGACGGTAACGCCGGATCTTTGGCAAGCATTTTGGCGTACAGATGTTGTAACCATTGTTCCTGCGTGCGACCTTCTGTGAACTTTTGATAGACGTCAGGTCCAAGACGTTTCGCGACCTCACTCAGGATCCAGTAAATCGGTTTGCGTTCGAATTTTTCGCTGGTGACTGGCTGAAGGAAAATGAGATATCCCATGTTACCGGCGTAGTCGTTAGGAATAATATCTTCCTGCTCAACGGTCATCAGGTCTGGCAGCAGAATGTCGGCATATTTTGCCGATGAGGTCATAAAGTTTTCGATGACCACAATCATTTCGCATTTCGATTCGTCCTGCAGAATTTCATGCGTTTTGTTGATGTCAGAATGCTGATTAACCAGGGTATTTCCCGCGTAGTTCCAGATGAACTTAATGGGCACATCCAGTTTATCTTTGCCCCGGACGCCGTCGCGGATTGCCGTCATTTGCGGGCCATGATCGATAGCATCCGTCCAGCTGAAGCAGGAGATTGACGTTTTGACCGGATTATCCAGCACCGGCAGGCGTTCTATGGTAATGGTATAGGTCGATTCACGCGCGCCACTATTCCCGCCGCTGATGCCGACATTGCCCGTCAGAATAGGCAACATCGCAATAGCGCGTGCAGTCAGCTCGCCGTTTGCCTGGCGTTGCGGTCCCCAGCCCTGGCAGATATAAGCGGGTTTTGCTGTGCCAATTTCACGCGCCAGTTTGATGATACGGTCCACCGGGATACCGGTAATTTGCGAAGCCCACTGCGGCGTTTTCGCTGTTTTGTCGTCTCCTTCACCAAGAATATAGGCTTTATAGTGACCATTTTTTGGTGCATCTGCGGGTAAGGTTTTTTCGTCATAGCCGACGCAGTATTTATCGAGAAAAGGTTGATCAACGAGATTTTCGTTAATCAATACCCAGGCAATACCCGCAACCAGCGCGGCATCGGTGCCCGGGCGAATAGGAAGCCATTCGTCTTCACGACCGGCAGCCGTATCGGTATATCGCGGATCGATAACAATCATTTTTGCGTTCGATTTCTCGCGCGCTTTTTCAAGAAGATAAGTGATGCCACCACCGCTCATGCGGGTTTCTGCCGGGTTGTTACCAAACATCACGACCAGCTTGCTGTTTTCAATATCCGTTGTGCTGTTGCCATCATTACTGCCGTAGGTGTAGGGCATGGCACAGGAGATTTGCGCGGTGCTGTAGGAGCCATACTGGTTGAGTGAACCGCCGTAGCAGTTCATCAGGCGTTTGACCGCCGAGGCGGATGGCGAAGAGCGGGTCATATTGCCGCCAACAATCCCCGAAGAGTACTGAATATATACAGCCTCATTGCCATATTGTTCGACGGTTTTTTTCAGGCTACTGGCGATAGTATCCAGGGCTTCATCCCAGCTAATCCGTTCGAATTTGCCTTCGCCGCGTTTGCCCACGCGTTTCATTGGGTAATTCAAGCGGTCGGGATGATTAATACGCCGACGGATGGAGCGACCGCGCAAACAGGCGCGTACTTGATGATTGCCATACTCATCGCTGCCGGTGTTGTCAGTTTCCACCCAGGTCACTTCATTATCTTTAACATGTAGACGAAGTGCACAGCGGCTACCACAGTTGACGGAACAGGCACCCCAGACCACTTTTTCGCTGGCTTGTTGTACCGCTGCCGCTGCACTGCGCAGGGTGAACGGTAAAGAAAAACCGCCTGCAGCCAGCGCCAGAGAACCTATCGCGGTAGATTTAACGAGTGTTCTGCGGCTGATGCCCACCATTTGTTCATTTTTGGACATAACTCACTCCCTGTTCTTTATCGTTATATAAAAGTTTATATATTGAATATTTAGCGCGCTAACAATAGAGGGAGTCTACCTATTTTGGGTTAAGAATTATTAATCCATATCAATAGAAGGGTATGAGTAATAAGGTGGGATTATGTTGTATATTCAAATCGCCGGATGTGTCGTATCCGGCGTGCAGTCGATAATGTATTACTGCGGTTCGGCAGGCGCGCCATCCTGGGTAGACTGCGCGGGAGCAGAGACGTTACCGCTGGTGGTGCGGGTGTAGAGAATTTTATGCGTATCATTAGCGCAATGCCCGACGACCTGGGAATCAGGCTGATCAACCTGGTCATTGGGTACAATACTTAACGTGAAGCTGCTTTCGGGTACGCCATTATTGATAATGCGCTGTGATATATCGCTCTGTATGCGCTCACAGGATCCCGGCGCGGCGAGTACCGCGGGTGAGGCGAGGGCGAGCAGAAGCGCGGCACAGCAGGTTGAGCGTTTCATCATAAGCTCCTTACGCGAAGATAACTTCTTTAAGCATAGCATTTAACGTGTAAAGTACTGTATTTGCTACTATGATTGAGAATCATCTCTACTCTCTGGTGACTGTTGTGAAATACAAATTACTGCCATGCTTACTCGCGATATTCCTCACAGGATGTGACCGCACAGAGGTAACACTTTCATTTACCCCTGAGATGGCCAGTTTCTCTAATGAATTCGATTTTGATCCGCTGCGTGGTCCGGTAAAAGATTTCACTCAGACATTAATGGATGAGCAAGGTGAAGTGACGAAACGTGTTTCGGGGACTTTGTCGGAAGAAGGCTGTTTTGATTCACTCGAATTACTGGATCTGGAAAATAATACCGTGGTTGCCCTGGTACTGGACGCCAATTATTACCGTGATGCCCAGACACTGGAGAAGAGAGTTCGTTTGCAGGGAAAATGCCAGCTAGCAGAATTACCTTCTGCCGGGGTGAGCTGGGAAACCGATGATAATGGCTTTGTGATTAAAGCCAGCAGCAAACAGATGCAGATGGAATATCGCTATGATGATCAGGGGTATCCGCTGGGTAAAACCACGAAAAGTAACGACAAAACATTATCTGTCAGCGCTACGCCATCAACGGATCCGATCAAAAAATTAGATTACACAGCGGTTACTTTACTGAATAATCAAAGGGTTGGTAATGTAAAACAGAGCTGTGAATATGACAGTCACGCCAATCCGGTGGACTGTCAGCTAATCATTGTTGATGAAGGAGTAAAACCCGCCGTCGAACGGGTTTACACCATCAAAAATACTATCGATTATTATTAATGCTATTGTGCGGTCGGCTTCAGGAGAGTCTGACCCGGTGTTTTGTGCTCTGCCAGATACTGATGCTGGAATATGCACATGCGAATGGCATTACGATATTGACCATTAATAAAGAACTCATGCATCAATTCACCTTCAACCGTAAAGCCAAGCTTGCGGTAAATGTGAATCGCTTTTTCATTCTCTTTATCGACGATCAGATACAGCTTATAGAGATTGAGAACGGTGAAGCCATAGTCCATTGCTAATTTTGCGGCACGCGTTGCCAGACCTTTCCCCTGATATTCCGGGGAGATAATTATCTGAAATTCTGCGCGGCGATGAACATGGTTAATTTCCACCAGCTCCACCAGACCGGCTTTTTCGCCGTCACATTCCACCACAAAGCGCCGTTCGCTCTGATCGTGAATATGCTTATCGTACAGATCAGAGAGTTCAACAAAGGCTTCGTAGGGTTCCTCAAACCAGTAACGCATCACACTGGCGTTATTGTCGAGTTGATGTACATAGCGTAAATCTTCACGCTCCAGCGGGCGTAGCTTAACACTGTGGGCGCTTGGCATAACGTGTCCTTACATTCCTTAAATCAATAACAGGTTAGGGGGTAATAACGCGGCCAGTTCGACGGTCCAGGCAGCGCAAAGTATTTGGCTCCCAGTAGGCATTGATGTTGGCGCTTTGCTCACATTTATCGCGGTTATCAAAAGCAGCGTCGGCTTTATCCCACTCTTTTTCAGTGCGTTTATTCACTTTCTGGCGCAGATTACGCGTGTCATTCCATTGCTCTTTTTCCATAGCGGCGCGCTGGCGGCTTTGTGCACTGTCGCCAGACTCAATCACCAGTTTGTTATTTTCGGCATGAACAGTTGTGCTCAATGCCAGTGCGCAAGGCAGCAGAAAAGCGAGCAGGCCGATTCGTTTGCTGAGAGTGATTTTCATAATTCATTCCCTGTATGAATGATTAAAGCTGATTCTACACCATCCACTGCGAACGCAAAACGTTCAAGGAGGGTGTTTATATTGATGATATTATGTCGCCCTATAACTATACGTGATGTCAATAAGTGACAAAGATGATTAAAACAACGTTACTATTTTTTGCTACTGCGCTGTGTGAAATTATTGGATGCTTTCTGCCCTGGTTGTGGTTAAAACGAAACGCCAGTATCTGGCTGTTGCTTCCGGCGGGGATTTCACTGGCGCTGTTTGTCTGGTTGTTAACGTTGCATCCTGCGGCGAGTGGGCGTGTTTACGCCGCTTATGGTGGCGTTTATGTCTGCACGGCGTTGATGTGGCTGCGCGTTGTGGATGGCGTGAAACTGAGTCTTTATGACTGGACGGGGGCGTTGATTGCGCTTTGCGGCATGTTGATCATTGTTGCGGGCTGGGGGCGCACGTAGGAATATAAATCCCTTTTATCAATAAGATACGAGGATGTGTCAGCTGACAAAAGGTATTCTATTTCATCTTTTGTCAACCATTCACGGCGCAAATATACGCCTTTTTTTGTGATCACTCCGGCTTTTTTTGATCTTCATACTTGTATGGTAGTAGCTCAGTTGCGTAGATTTCATGCATCACGACAAGCGATGCAAGGAATCGAACATGAAGATCGTAAAGGCTGAAGTTTTTGTTACCTGTCCGGGGCGTAATTTCGTCACATTAAAAATCACCACTGAGGACGGTATTACGGGCCTTGGGGATGCCACCCTAAATGGACGTGAGCTTTCCGTGGCCTCTTATTTGCAGGATCACCTTTGTCCGCAGCTTATTGGTCGCGATGCACACCGTATCGAAGATATCTGGCAGTTTTTCTATAAAGGTGCTTACTGGCGTCGCGGCCCGGTGACGATGTCGGCCATTTCAGCGGTTGATATGGCGCTGTGGGATATTAAAGCCAAAGCTGCCAACATGCCGCTTTACCAGTTACTCGGCGGCGCGTCTCGTGAAGGGGTGATGGTTTATTGCCATACCACCGGTCACAGTATTGATGAAGCTCTGGATGATTATGCTCGTCATCAGGAGCTGGGATTCAAAGCCATCCGCGTGCAGTGCGGAATCCCTGGTATGAAAACCACCTACGGCATGTCGAAAGGTAAAGGTCTGGCTTATGAACCCGCAACTAAAGGACAGTGGCCGGAAGAGCAGCTGTGGTCGACGGAGAAATACCTCGATTTCATGCCGAAATTGTTTGACGCGGTACGTAACAAGTTTGGTTTTAATGAACATTTGCTGCATGACATGCACCATCGCTTAACGCCTATTGAAGCGGCGCGCTTTGGTAAAAGCATTGAAGATTATCGCATGTTCTGGATGGAAGACCCGACGCCTGCGGAAAACCAGGAATGCTTCCGTCTCATTCGCCAACATACCGTCACACCCATCGCGGTGGGTGAAGTCTTCAACAGCATCTGGGACTGCAAGCAACTGATTGAAGAGCAACTTATCGATTATATCCGTACCACGCTGACCCATGCAGGCGGAATTACCGGTATGCGCCGGATTGCCGATTTTGCTTCGCTGTATCAGGTACGTACTGGCTCACACGGTCCTTCCGATTTGTCACCAGTCTGCATGGCTGCGGCGCTGCACTTTGATCTGTGGGTCCCCAATTTCGGTGTCCAGGAGTACATGGGTTATTCCGAACAAATGCTTGAAGTCTTCCCGCACAACTGGACTTTCGATAACGGCTATATGCATCCGGGAGACAAACCGGGTCTTGGCATCGAATTCGATGAAAAGCTGGCGGCGAAATATCCCTATGAACCTGCTTATCTGCCAGTCGCACGTCTGGAAGATGGCACGCTGTGGAACTGGTAAGGAGTAAGATAATGAAAAGCATTTTAATTGAAAAACCGAATCAACTGGCAATTATCGAACGCGAAATACCCACCCCGTCAGCGGGTGAAGTACGAGTAAAAGTGAAACTTGCCGGAATTTGTGGTTCAGATAGCCATATTTATCGTGGGCATAATCCTTTTGCGAAATATCCGCGCGTCATTGGTCATGAATTCTTTGGCGTCATTGATGCAGTGGGTGAAGGCGTGGAAAGCGCCAGAGTCGGTGAACGCGTTGCTGTCGATCCGGTGGTCAGCTGTGGGCATTGCTATCCGTGCTCTATAGGTAAGCCGAACGTTTGTACGACACTGGCTGTATTAGGTGTGCACGCTGACGGTGGTTTCAGTGAATATGCCGTGGTTCCGGCAAAAAATGCGTGGAAAATTCCTGAAGCAGTGGCCGATCAATATGCGGTGATGATCGAACCTTTTACCATTGCGGCTAACGTTACCGGTCATGGTCAACCGACTGAAAATGATACCGTTCTGGTTTACGGTGCCGGTCCAATCGGCCTGACGATCGTTCAGGTATTAAAAGGCGTCTATAACGTTAAAAATGTGATTGTTGCCGATCGCATTGATGAACGACTGGAAAAAGCGAAAGAGAGCGGGGCAGACTGGGCGATTAATAACAGCCAGACACCGCTTGGCGAGAGTTTCGCTGAAAAAGGCATCAAGCCGACATTAATTATCGATGCGGCTTGTCATCCTTCTATCCTGAAAGAGGCTGTAACGCTGGCTTCTCCAGCGGCACGTATTGTATTGATGGGCTTCTCCAGTGAACCGTCTGAAGTGATTCAGCAAGGAATTACCGGAAAAGAACTCTCTATTTTCTCTTCACGCTTAAATGCAAATAAATTTCCGGTCGTTATCGACTGGTTAAGTAAAGGGTTAATTAAACCAGAAAAATTAATTACCCATACGTTTGATTTCCAGCATGTTGCTGATGCCATTAGTTTATTTGAACAGGATCAAAAACATTGCTGCAAAGTCTTACTCACTTTTTCTGAATAATACCAATAACGGCGAGTAAGTAGTACGCATCTTACCTCTTTTTTAGAGATAACCATTATGACAATAGAAAAACACGAAAGAAGCACTAAGGATTTGGTGAAAGCAGCAGTATCGGGATGGCTGGGCACTGCGCTTGAATTTATGGATTTCAAGAGTCATACGTGTCAACTATTTGATAAATATTAAATTAATTTTTCATTGCTTCGTTATGGGGCATGGTTGGGGCAAACTCGCTTAACTGTGTATTTAACAAAGCTACCTGTGCATTATTGTTTTCAGACATCCATTTTCCGTATACCTGAAATACCATTTGCGCATCTGCATGGCCCATCTGGTTTGCTATAAATGCCGGGTTAGCACCAGCTGTCAGCGACCAGCAGGCATAAGTATGTCTCGACTGATATGATTTTCGATGGCGGAGTCCGGCACGTTTTATCGCTGCGTCCCACATCTGCCTTATTGAGTCAACGGTAAAATGGTCACCATAATTTTTTACTCTCGCTGACACTTCAGGTTGAAAAACAAAGGTGCATTTTTGTTTTTCTGTTCTGCCATACTCTCTGAGGTGAACATCAATGATATGCTCTTTGCTCAGTCTCGTTAATGTCATCTGACTCCGGAGAGCGTCGATTGCTGGCTTAATAAGATGAATGACCCGATTGGTTCCCGCCTGTGTTTTTGGTACCGTGAAACGGTCTTTTGCTAAATTTCTCCTGATCATCATTGTTCCATTTTTCAGATCTATGTCCTCCCATCCAAGTGCACACAGCTCACCAGGGCGAACGCCAGTATAAACAGAAACACACCATAAATTTTTTGCTTGCTGATTTCTGCACGCATCGATAAGACGGATAAATTCCTCCCGCGAAAGAGGATCCGGAATGGTTCTTGATTCCTTTAATGGCGAGATCCCCTTAAACGGATTATCTGGCAGGTAACCGTTATCAACACCAAACTGGAACACGGCGTTAAGATTTGTCATGTAATTATTTACAGTTACAGCCGATCTCCCTGGTTGTGTAACAATATAGTTACTTTTGGGGATCTGGTATCCAGTCAGTAGCTCTTTACGAACCTCCAGTAATTTTTCTTTATTAATCGATGAGGCAAGATTTTTTTCACCGATTATGCTCAGGATATTTTTGATGACGGCACGGTATGTGTTGAGTGATGTTTTGGCGACTTCAGTTTCTTTCAGTGCCAGAAATTTTTCAGCCAGTTCTTTTATGGTTAAATCTTGTCGGGCCTCACCAAATTTTTCCAGATTGCGTGAGGAGGGAAACTGTTTTGCATAGTCGAAAACACCAGTTTTTATTGCGTAACAAACAGAGGAGCGTAGTTCACCTGCAACGCGCCTGTTTTTTGCTGTGTCAGGAACCCCCAGATTTTCCCTGACTCTTACGCCTTTATAAACAAACCAGATACGTAATTTCCCTCCATGGTTTTCCACGCCTGTCGGATATTTCATTTCAACTTCTCTCATTAGTTAGTGTGGCTTTTAGTCAAGTAAGATGACGTCTTGGTCTCGCTGATGCCTGGCGCTCAATCCAGCGATCAATTTCTTCCAGGTTGTAAAAGCATGGACTGTTATCCCATGGCATACTGTCATGAGCGACATGCTTATATTCCCTTCCTTCCATAAACGATTTTTCCCGGGCCTTTTTTAACGTACCTTTTTTTATTCCTTTCAGCGCAATTAACTGCTCTTCGGATACCCATTTGCCGGGAGAGACAATCATGATTACTTCGCTCATCGATTTCTTTATCTCTTACATCAGACGAGCGCCGGTTGCAGAATACCAGTCACAACCGGCGACAGTTGAACATTAAGAATCAGCCTGACTCGGGATCAGTTTTTGCCAGATAACTGAAACGTATTTTGCCTGGTAACGGGCGTCATCAAGTGCATTATGGCGCTCACCTTCGAATGGAATAGCCGTTCTGGCATCGAAGTCTATGGCTTTCCCCAGCTCAACGATTGTGCGTACATCGCGATCGTTGTAGTAACGCCACGGGCAGGGGATCCCCTGCCGTTCGTATGAACGGCGCAAAATCGTGTTGTCGAAGTTGGCTCCATTTCCCCAAACCTGAACAAAAAATTCACTGGAGTTTTCGTCGATAAATTCCCGCAATTGTAACAGTGCATCATCTAACGGGATTTCATCGGTCATAATGGCAGATTGCGCTTCGCGTGATTGCTTAAGCCACCATTTAATGGTGTCCCGATCAATGACTCCGCCAGCAGTTTCCAGATCGATAGTCTTACTAAATTCCGGTCCCATATCTCCGGTTTGCGGATCGAAAAATATTGCACCTATTGAGATGATCGGGGCATCAGGATTTTTTCCCATGGTTTCAAGGTCGATCATTAGATGGTCACACGTCCTGCTGGTGGATGTGATTTCGTGATGACCGTTCACCTTAATTGAGTGATCTGCCGTCTCGCCAGTTTCATTATCGCTATTGTGATGCTGATTGCCGCCAGTGTTCTCCTTGTGTGGATGTTCAGCGCCTTCCATTTCCTCCGGATCATCTTCCTGAACTTCAACCTGATACTCTTCATCGAATGTTTCTTGGTATGTTGCGTCGCCCATCACCGCGCCACAATCAGGGCAGTTGCCGCCGCCGGTCTGACCGCAGGCGGTGCAGACTTTTTCCACTTCCTGTTGCGCCACTGGTTCAGGCTGTTTCGTTTCTGGCTCGTTTTGTAACGCATTTGTGCTGTTTTGTTCCGCTTTTTGGTAGTTCCGTTCCGATTCATGCTGGTTCTGGTTCACAGAATCGCGGGTCTGGATCCCCTTAACCCATTTCGGATCATTCGGGTCACTAATCCCTTCAACAAATTCACCACGTGATGCAGCAAGCAACTTATCGGCGTCAGGCTGGCTGATATTGGCTGCCTGCATAATTTTGTTTACTTCGTCAGCGGTAACTTTTATCGGCTCTGGTTGTTCTGAATCTTCAGCGGTATCTACATTTTGCGGTAAGCCCGTGTATGTGCCATTTTTTCGGGCAAAATATTCTTCTTTTGTGATTTCAGTGGCGCCAGCAGCCAGTGCCTTATCCAGACCAGAAAGTTTGTTTGCGCGACCGTATTTTTCTCCGTCCTTATCTGCGAAGAGGAAATAGAACGGCCCCTCACGCTCTACAGATGGTTCAGCTTCCGGCGCGGTTTCATTTTTTGGGATATCAGATACCTCAGTTTCCACTGCATCAGTTTGTGTTTCTGATGACTGGAGAACATCAACAGTGCCCAGGTCTGTTTCTTCATTCTCAAACACGCCCTTTGTCGTCAGGTATTCGCAGATATATTTGTTCAGTGCTATGGGATCTTTGTGAATGTCGATCGGACGCTCACGGACAAGGCCAAAAATAGTCTGGCGGTCGTAGCGAAGGGCATCAGGCTGTTTGCGCATTGATGCCGAGATACGCTTCCAGTCTTCGCGGTCGTTGTCGATAACTTCTTTTTTTGCCCAGCGATGGATGCTGCCGTCAATGTTTCCGGCATCCACATCACCAGGCCAGAGAGCGTAGGCCAGTTCGTCATCCAGTGTTTTCCATGTCTGCTTGTATTCGCGATGAATGGCAGCAATGACCGGGCTGATTTTTCCTGTTGAATTTTCAGTGTACTGTTGATTGGCTCTGGCGCGGGCGAGATCAACAACAGACGTGTATTTTCCGGTTTCCTTGCGTTCACCTTCGCGACGTTTTTTCCAGATGCGCATCTCTGCCTGAATTTCGGGCCATTTAGCACCAGGAATACATTTATGCTTAACCCACCCAATGGCGTGCAACTTAAGCTCCGGATACATGGCGTTAACTTCTGGCATTTTCATCAACGCTTCAACGATATGTCCGTCGAATGTTGCCATGTCTTCCTGCAACAATTCCTGTGCGCTAATAACCATATCAACGGTGATGTTTTCACATGTGTCGAACTTAACCATGACAGCGTTCTGTACTTCAGGGGCCAGCTTGTCAAAAGTGACGTTCATCGGATCGGATTCAGTCTCAACCGGGACAAAAGAAGCAGACTCCTCATCCCAGCGGTTTTCCTGCATATATTCAGCATCCCATGAATCGAGGGCAGGGCGGGGTATGCCAGGTTTATCCTCGCAGACAATAAATTTATAAGCGCAGTCCTGAGCAGCCGGATAATGTTCCAGGAACTGCCAGTGAAATTTTGCTCGAGCACGGCGTTCGTCGCCAGCTTCAATGGCTGTGGCTACAGCCACAGCGCCTTCTTCCCTTGTTGCCAGTTCGTCAGGAATAGCGGCGCAAATAAAGACTTTACTCATTTTGTTTTAACCTCATGACAGATTTAAGGATGAACAAATCCCTGCCATTGCTGGCATATAAGAATGAAACCGGATATTTATTACGGAACTGTTTTAAAGACCTGCCGGGATTTCGATATTATCCTGGTGAATAACTTTATCGACCGGGTAACAGTTACCGGGAATTTTCTGTTCGGTTGCTGCTGTCATACACTCCTGCATTGTCCTGTGAACACTGACTGCAATATCAACTGGCTCTCCGGAAACAAGAAAAACTGTCAGAACAAGCGCAAATGCTGAATTCATTGTGCACATCCTTTTGGCATCAGACGTAAACGAGCCAGCATTGAAACAATGCATATTTTATTTAATAGCTCCCGTTCTTGTTTTCTCTTGTTAATGGCATCTTCAGTAAATACAGGGTTACTGATAGTGACACCAATTTCAAAACAATCTTCAGACGTATTAACGTTTGGTAATAACGTTTTCATTATCGCGTCCTCAACAATGAATTTTGTGATGCAGTGCCTGGTGCCTCCAGGTGACGTTAACCAGTTAACAATTAACGCCGGATACAGAGAATCCACCCATAACACTGTTTTTGGTTTTAACTGTTCCGCGTGCGCTTAGCCGCATTCACCGCATCACAAAATTCACTTTAAAAACGGCGGCAGAGCAGTCACGGAGTAAAACTGATACCGCCAAACGTCACCAGAAAATTGATAACAGAGGGCGTTGCAGCGGGGTTGTCACTTAAGCGTATGGTCAACCTGACAACTCGGTGTCCTCAACGGGGAAGGAATAACCCTGCCATACTTACCGCCGCGCCATTTCGCGGGTTGCCACAACCGGAAGCGCACGGTCGAATTAAATTTAACGACACCGTACAGTGAGACGAACTTCGCCGTGCGCTTTCGTGTTGTGTGCCTGCTTTTAACCACGTCAGGCGAGGTGGTATCCTTCTTATTCCGAATAACCAAGAAGGAAATCTATATGACTAAAGAAGAATTTGTCTCTTATATTTTTGATAAAACGGTTGAAATGTATGCCGCTACTTACGGGTCCTGTAATCCTCTGAATAAACCAGAGGGGAAAGATGATTTCGACAAAATTTACCGCTTCTTGGAGGACCGCTATATCAAAAGGTTAGAGGACGCAGGGATCAAATCCCCAGTGAAGTCACCATTGTCCTGAGAACTTGCAGGACGTCATGATCGTAACTTCCATCCAAACCGCGACGGCAAATTGCTTCTCGTATTACCGGAAGCAGTTCGCTGGAAATCTCGGTGCATATTTCACCTGATAATACGCCTGGCTCAAGTGAGAATATTGGTGAACTTACGGTCTTGGTCTCGACAGTTTCAGAGTCAGTGCCAACATTATAAAGCTCAACGAAAGCGGTCTTGATTTTCCGGGCCAGATCTTTTGCTGGCTCGCTTGCAATATCTTTCCCGATTTCTCGCAGCACAGAATGCAATGTATGAGCTGCTGTTTTCTGTACATCAGACGGTAAATCTTTAAATTCCATCGTCAGCCTCATCAGTCAGTGTTTCTGGCTAACCAGCAACGCGCGCCAGATTCGGTTTTAAACGTTTTGCTTTTGGTATATGTCATCGCGGTGAACGTACCGTCCTGGTTGGGGAACACGCCACATACCAGAGATTCGCTGTTGCCAAGATCGATAGTATCCATGCTGACCTCATTTCCCCTTAACGCCGGGGTCGCGGAACTGTTTGCTGAGAACACCGTGCGGTGTCTTGATGGAAAGTAATTTAGAATAACCTAACATGAGAGGCAAGTGTTTTTTGTTAGATTGATCTAACAAAAAGAGTGGGCGCAACTAATCACTTGAAAAGAATGTTATTTTATTGATTTATTTTTACGCGCTTTAAGCATTTCTTCGAAGAGTTTGTTGAAGTTTTCTACTCTTGCGCGCATTTCAGACAGCAAGGCTTCCTGCTCGGAAGATGGAAGAGCATCGAATAATTCGATCAATTCTTTGTGGTTGGGAGTTAGCTCTGTTTCCACATGAAGTTCTTGTGCAGGCACTGGTGCCTTGTCTTCGTCACCAAACATTAGCCATGTAGGTGAGCACTTCAGAGCATCCGCTAAAGCAAACAATCGTTTTCCGACTGGCTGGGTTTCGTCTCTTTCCCATTGTGAAATTGTGACGTGAGCAACTCCAGCGAGGCGCGCAGCTTCTCGTTGTGTTAAGCGTAATTCTTTTCGTCGCGCCAGAACTCGCTGGCCTAGGGTTCTTGTATCCATAGTTAGGTAATTCTAATTTTTCTTGACTTAGGTATCCCGCGCACAATAATGTTAGAAAAGTCTAACAAGAGGGGGCTTTGATGCTTAAAGTTGACGCAATTACTTTTTTTGGCAGCAAAACAAAGCTTGCCAATGCCGCAGGAGTGAGACTGGCAAGTGTTGCTGCTTGGGGGATACTGGTTCCTGAAGGTCGCGCGATGCGTCTACAGGAGGCATCTGGCGGGGAGCTTCAGTATGATCCCAAAGTTTATGACGAATATCGTAAGACGAAGCGGGCGGGGCGGTTGAACAATGAAAATCACTCCTGAACAGGCTCGTGAGGCTCTGGATGCCTGGATATGTCGACCAGGAATGACACAGGAGCAGGCGACGATATTAATCACTGAAGCATTCTGGGCTTTGAAAGAGCGCCCGAACATCGATGTTCAGCGTGTCACATATGAAGGTGGCGCGGTTGATCAGCGAGCGCTTAGCGTTAATCGAGTGAAGATATTCGAACGCTGGAAGGCTATCGACACCAGGGATAAGCGTGAAAAGTTCACAGCGCTAGTGCCTGCAATTATGGAGGCTATCCGGATTAGTGATTTCAGGTTGTATCGTGAGATCAGTGATGGAAAAAGTATTACGTACATGATCGCCGGGTTAAATAAAGAATATGGCGATGTGGTGGAATCCGGACTGCTTTTTGCAGATCCTGCCGTTGTAGATCGTGAAACTGACGAACTTATAGAAAAAGCAATTGCTTTCAAGCTTGCGTATCGACAGCAATACCAACAAAAAGCTGGATGGAATTATGAGTCTTCTTTTTGCTGAACGCCCACTGGTTATAAACACGCAGCTGGCAATGAAAATCGGCTTAAACGAAGCCATTGTTTTGCAACAACTGCACTACTGGTTGAGAGATACCAACTCCGGTATGGAATGTGATGGTGTTCGCTGGATTTATAACACAACGGAACAATGGCTGGAACAGTTCCCATTCTGGTCAGAGTCAACGTTAAAGCGCGCGTTTGCAAGTCTGAAAACGCTGGGGCTTTTGCGTTGTGAAAAGCTCAATAAATCAAAGCGCGATATGACCAATTTCTACACGATCAACTATGGGAGCGAGCTTTTAGATGGTGGCAAATTGAACGAATCCATCGGTTCAAAATGCGCCGCTCCATCAGGTCAAAATGACACGATGGAAGAGGTCAAAATGAAACGCTCCATTGGTTCAAAACGACCCAATGTCATCGGGTCAAAATGGCCTGATGATCTTACAGAGAATACAACAGAGATTACTACAGAGAATAAAAACACTTTTCGTCCGGAAGCTTCGCAACCGGACCCGCAGACGACTGAACAGGATTTTTTAACCCGGAACCCCGACGCGGTTGTGTTTAGTGTGAAAAAACGCCAGTGGGGTAGCAGGGAGGATCTGGCGTGTGCGCAGTGGATTTGGGGGCGGATCGTGAACCTTTACGAACAGGCTGCCAGCGACGATGGAGAGATCATGCGACCAAAAGAGCCTAACTGGACAGCCTGGGCCAATGACGTGCGCACAATGCGGATGCTGGATGGCAGAAGCCACAGACAAATTTGCGAAATGTTTGGTCGGGTACAGCGGGATCCATTCTGGGTAAAAAACATCATGAGCCCGTCAAAACTCCGCGAAAAATGGGACGAACTGGTCATCCGCCTGGGACGTTCACCTGTACAGCGTTGTGTGAATCATATTTCTGAACCGGATACAGAAATTCCGCCTGGTTTCAGGGGATAAGTGTTGATTTCAGGTCATGAGATAATTTTAAGGGGGACTTGTGGCAAAAGTTTTTACACAAGAAGAGCGGGAAAAAATTAAAGGGCAGGTTGTTGAACTCGTACGCCGGAGTGGGCGTGAGACGTTACGGCAACTGGAAGTCAAGACAGGTGCGACAAGATATCTGATGAGCGTTCTCGCAAGAGAGCTGGTTGCCAGCGGCGATGTATACAACTCTGGTTACGGGTTATTCCCGTCTGAACAGGCGCGTAAGGACTGGCAAAATGCCCGTAAAAAGCTCTCAAGGGCAAAGCTGAAGAAACCATCTGCGGTTGATCCGGACCTTATCTGGTCATTACCTGACGGAGAAATACGTCGCTACGACAGGCGTCAGAACATAATCTGCCGCGAGTGCCGGAAGAGTGAAGTTATGCAGCGCATACTGGCATTTTATCAGGGATAATGTTAGATATTTTAGACGTTACTAGATTAAAAAGCATTAGTTCAGGAGTGAATTGACATTCTCATTTTTCATGGCACAGGGTAGATCTGGCGTGGTTGTCCGCTTTGTGCCAACAGCGGACGTTACTAACGAAACTGTGAGTTAAAAACGGGAGCTGGTCACCTCCATCCCGCGAAATGGAATTCAAAAATGCCCTCATGTACGGTACTTCTAGTCTGATGTGAGCATTTTCGTGGATTTTTTCGAGCCACTCCAACGAATGATTACATTCAGTTTCGGGCACTAAAACGTCATAAATCCGCAAGTTGAGGAGTAGCGATCAGCCAATTGGTTAAAAAACAGCTTTAATTCCCTTTTTCAAGCAGGCTCCTTGTCGCACGTCTGGTGCATCCAATTCATTTCAGTTACCCTGATGCCATACTGGCACGGACTAGCTCTCTTAATTGGCGCTGGTTTGTTAACCTTGTCCGCTGTCGGGTCTTTGTAAGTTGTAATTCCTAAACTAGCATAATTTTATTATAAGGCCTCTTACTATGCACATTTTATCTGTTTCAAAAAAGGAAGTTGGAAGAGCTGGGGATAGATTAGTTTCTCAGTTTGGTACAGGTGAAAAATATAAAGAGGAAGATGTACAAATTCTTCATGAGTGGAGGATGTTGCATTTATATCCTTTAAGTAAAATACAATTTTATATGGAAAGAGAGGCTATTTCCTTAAACAAAAATGCGTTACTATCCTCTAGAATTAAGAGGATGCCTTCCATTGTTACTAAGCTTTCTAGATTTCCTGATATGAAGTTAAATAAGATGCAGGATCTTGGTGGGTGCCGGGCTATTTTAAATAATCTGGATCAAGTGTATGATTTAGTTAATAAAATAAAATCATCTAAATTCTCACATGAACTAGTTAGAATGGATGATTACATGATAGACGTGAAAGATTCTGGTTATAGAAGCTTTCATATGGTCTATTCATTCCAAAATAAAAAATTTCCATCTTTAAATGGGTTGCGCATTGAAATGCAAATAAGAACAGCTATTCAACATAGCTGGGCTACAGCAGTTGAAATGGTTGGTTTGTTTCGAAAGGAATCATTGAAATCTGGTTTTGGTGATGCAAGATGGCTTAGGTTTTTTGAATTGGTATCAGAGCTTTTTTATAAACTTGAGTATGAAAAAGAACCATCTGGGAGTTATATAAAAATATCAGAAGAATTAAGTTATTTATCCGTCGAGTTGAATGTTTTCGATATATTAGCTGCTTACAATGCCGTAGTTAGCCATATTGAGGGTAGCAAAAAATATGATAAAGGACTTTGCATAATTGTTGTTGATACCATAAAGCGGAATATAAATATAAAGAGCTTCGAAAATCACAATCATGCGAAGGCGGCAGAAGCTTATGTCGAGTCTGAAAAATACTGCGCGGAAAATAAAGGCTGTGAAGTGGCTATGGTTTCTGTTAGTTCAATTAGTGAATTAAAAAATGCATATCCTGCTTACTTTTTAGATACGAAAACTTTTTTAAATTATCTCAGTAGGTATGTTTTTATAAAATAAAAAATGTTATCTGGAGTGGTCTGAATTCTACTAACTTAGTTTTTTCAGACCGCTTCTGACCACAATGGCCTGTGAGGTTCTGCTGCTAGTCAACATAAAATAACAAAGCCGGCAGGTGGTGGTTGTTGTAAAGTTATTCAATGACATTCATGGAGGCTTAAGCGAATTATTTGTCCCTGGTCGGTATGGTGGCAACAGCTTGTTGCGAGAATTTGGCTAATTTGGACTGACTTCCGCTCCTTGCTCAAAGCGGACTAGAAGGTTAGCTTGTGTCGGACTTTGCGTATTAAAAGAAGTGCTGGTGGTGACTAGTGGTTGAGCCCCATTTCCACAGAAAAAATCAGAGAAACTATACCCAATAGTTGTATTGAATCACTGACGAGACAGCCTCATATTCATCAGGACTGGTGTACGTCCAATACAGGAGGTTGTGGTGCTGGTTCTCAAATGTGCGCTGGCTATTACGGCTGTAATGGCGATTTATTGTCTTGCTATTGTTCTTATGGATCGCCTTTCTGACTGATTTCATATTGGCGAGGTAAAGGTAGTTAAGTAGAATGGCTGCGGGTGCTTGAGGCTATCTGCCTCGGGCATGAACACCAAAGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCTGCAAGACGCTTAACATTAATCTGAGGCCAATTTCATGCTAGACACATGTAGGTTAGCCTCTTACGTGCCGGAAGGCAAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAGTGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCCGGCGAGGGAGAAATCCCTCGCCACCTCTGATGTGGCAGGCATCCTCAACGCACCCGCACTTAACCCGCTTCGGCGGGTTTTTGTTTTTATTTTCAACGCGTTTGAAGTTCTGGACGGTGCCGGAATAGAATCAAAAATACTTAAGTAGCGCGCAGGGATAAGAGGGATGGTCCCTTAAAGGGGAGAGCTAATTATCCGGAAGGATTCTGATGATGAACATCGAAGAACTGCGTAAAATTTTTTGTGAAGATGGCCTCTATGCTGTGTGCGTTGAAAATGGAAATCTTGTTAGTCATTACCGCATTATGTGTTTGCGAAAGAATGGGGCTGCGTTAATTAATTTTGTGGATGGTCGAGTGACAGACGGATTTATCTTGCGCGAAGGTGAGTTTGTCACTTCATTACAGGCACTGAAAGAGATCGGAATAAAAGCAGGCTTTTCAGCTTTTGCAGAAGAATAAACTCATCTACAATCTTGCGCGGGGCTGAACTCCCGCTGAGTAACACCGTGCCACCGGAGAAAACCGATGGCACGCAACGTAAAATATTACAATTCTGATAATTCGCCCGTTCTTGCCTGCACGCACGGGCGGTATTCTCACGCATTCAAGTCTGAATGGTTCCAGCACCCTCCATGCACTGCAGAACAGGCCGAATGGCTGATTCATTCTTACTGCAGGCGCGGGTTCGAGGTTAAGAAAGCTCTCAGTCTCGACTATCGGCACTGGATAATCTCTGTCAGGCTGCCTTATTCCGAACGCCCACCACGTGCGTCCCGCACTTTCCAGCAACGGATCTGGAGGTAACGTGCGGGTATTACTTAGACCTGTTCTGGTGCCTGAGCTTGGGCTGGTGGTCCTTAAGCCGGGCCGTGAATCCATACAGATATTTCATAATCCTCGAGTGCTGGTGGAACCGGAACCAAAAAGCATGCGTAATCTGCCATCCGGAGTCGTTCCTGCCGTTCGCCAGCCGCTGGCGGAAGACAAAACATTGCTGCCGTTTTTTAGTAACGAACGGGTGATTCGTGCTGCTGGCGGCGTTGGCGCATTGTCTGACTGGCTATTACGTCATATTACATCCTGCCAGTGGCCTAATGGCGATTACCATCACACTGAAACAGTCATTCACCGTTATGGTACCGGCGCAATGGTGTTGTGCTGGCACTGCGACAACCAACTGCGTGACCAGACATCGGAATCACTGGAGCTGCTTGCTCAACAAAATCTGACAGCATGGGTGATTGACGTCATCCGTCACGCAATAAGCGGTACGCAGGAGCGGGAATTATCTTTGGCTGAATTATCCTGGTGGGCGGTCTGCAATCAGGTGGTGGATGCACTACCTGAGGCTGTATCGCGTCGTTCGCTGGGATTACCAGCGGAAAAAATCTGCTCGGTGTACCGCGAAAGCGACATCGTACCGGGAGAGCAGACCGCCACCAGCATATTGAAACAACGCACAAAAAATCTTGCACCGTTGCCTTACGCCCACCAGCAACAAAAACCACCACAGGAAAAGACGGTGGTAAGCATCAACGTTGATCCAGAGTCTCCGGAATCTTTCATGAAGCTGCCTAAACGTCGCCGCTGGGTTAAGGAGAAATACACTCGTTGGGTTAAGACACAGCCGTGTGCTTGCTGCGGTATGCCAGCCGACGATCCGCATCATCTGATTGGTCACGGGCAGGGCGGAATGGGAACAAAAGCACATGATCTCTTTGTGTTGCCTTTGTGCAGAAAGCATCACAACGAGCTGCATACGGATACAGTGGCATTTGAAGAGAAGTATGGCTCCCAACTGGAGCTGATATTTCGTTTTATCGATCGCGCGCTGGCAACTGGCGTACTGGCGTAAGTGGAGAACGAGCATGAACCTTGAAGCCTTACCAAAATATTACTCCCCAAAATCTCCAAAATTGAGCGATGACGCTCCAGCGACAGGCACCGGTTGTTTAACAATTACGGATGTAATGGCAGCGCAGGGGATGGTGCAGTCGAAAGCACCACTTGGGTTGGCCTTATTTCTGGCAAAAGTTGGTGTTCAGGACCCTCAGTTTGCGATTGAAGGCCTGCTAAATTACGCGATGGCACTGGATAACCCGACATTGAACAAATTGAGTGAAGAAATCCGGTTACAGATTATTCCTTACCTCGTGAATTTTGCCTTTGCTGATTACTCCAGGTCTGCGGCAAGTAAGGCTCGCTGTGAGCATTGTTCAGGTACGGGATTTTATAATGTATTGCGCGAAGTGGTGAAACACTACAGACGCGGGGAATCTGTAATCAAGGAAGAATGGGTGAAGGAACTATGTCAGCATTGCCATGGTAAGGGCGAAGTCAGCACAGCGTGCAGAGGGTGTAAGGGTAAAGGGATTGTTCTGGATGAAAAAAGAACCCGGTTTCATGGCGTACCGGTATATAAGATTTGTGGGCGTTGTAATGGAAACCGATTTAGTCGTTTACCGACCACGCTGGCACGACGTCATGTCCAGAAGCTGGTACCAGACCTGACCGATTATCAGTGGTATAAGGGGTATGCGGACGTCATTGATAAACTGGTAACAAAGTGCTGGCAGGAAGAAGCATACGCGGAAGCGCAATTGAGGAAGGTGACGAGATAAATGATTTTTGCTGAAGATGGCGACATGATGTTTGCATTTTTCAAAAAACATGGATAAGATTCTCTCAACGATGGGCTTTGTGTATCCGACGTTTAGAAAAAAGTAGAAAACCCGCTTATAAGCGGGTTTTTGTGCTTTAAATGGGGCAATAGAGATATTGAATCTCATCCCGGGATAAACATTGGCAGTTGAAGGTCCACGCGAACCATTTATCCGGCAAAATTCCACGCGTAATCCTGTGGTAATTTCTTCTGCATCTCGAAGATTGAGAGCTGAAACGTGAAGCTGGGCATCGATACGCCATCGGATGGGAATATAAGACCTTTGCTGCTTTTGTAGTCAAAGTTTTTGACAATTCCTGTCATTTTAGGGGACAGAAAAACTCCTTAATACTGATAACCTGGTGCACCATACACACGTTCCTGGAGAAAACTACTTTTTTGATAGGGTTGAAGGTGGCTGGATGTCTAAAATAAACATTGCTTCATATGTTCAACTATGAGTTAATGACTGCGTCGGTTTGAAGAACAGACGATATACGAAGTAGTTTACTAAAGCAGTTCTCATTTCAGGTGTTATTCACTTATTCCTTCTTTGAGTCTCTCCAATTAAGTACGAAGTCGTTTCTGTTATGCAAACCATTTATGCCGAAAGGCTCAAGTTAAGGAATGTAGAATGTCAAATAAAATGACTGGTTTAGTAAAATGGTTTAACGCTGATAAAGGTTTCGGCTTTATTTCTCCTGTTGATGGTAGTAAAGATGTGTTTGTGCATTTTTCTGCGATTCAGAATGATAATTATCGAACCTTATTTGAAGGTCAAAAGGTTACCTTCTCTATAGAGAGTGGTGCTAAAGGTCCTGCAGCAGTAAATGTCATCATTACTGATTAAAATTCATCGCTCGTCTGTATACGATAACGAAGAAGGCTGATGCCTGAGTAGAGATACGGACAGAGTAGTGAATATTGGATCTCTTTAATAAAAAGTAAGGAGGTCCAATACATGAAACAATGGCCAGCATATTTGGCAAAATCTTAATCAGGAAAAGTATGCTAACCATTGTGGTGAAGTGCAGGTTTGCTGCATGAATAGTTTTACAGCAGAAGCTAACTGCTGGCATGGCAAAACAAAGTGCGTAAGTGGATGACTCCCACAAAAAGCACCACAATCTCAAACCCGCTCAGGCGGGTTTTTTATTATCTGCTTTAAATATATTATTAAAATATAAAAAATACTTGTTACTAATAAAATCAATCAGGCTACAGCTTTAAGATTTGTCTGGAATACTTTGTTGCAATGAGGGCAGATCAAAAGGGCACCTTTTTGTACTCTTGAAAAACTGTGTTCTGACTCTTGGGTGCAGTTTGGGCAGGAACATTTAACGAGATAATTACGGCGTGATTTTGAGTCTTTACGTTCTGACATAGGCTTTTCCTGTATAAATGGCCGTATACAGTACACTAAATATGAAAACATTTCTCGTATTATTATTTTATATATGACTTTCTTTCAAAATAATTACTCACATTTTTAATGTGTATGTTTCTTTAGCGCCGTTGAGAACAACGTGTGCTGTCAAAACTACCCCGTAGACTCCGATCTTTTCAAACATATTGCACCATCCGTGTACATCGGGGTGAGGATATGAAATCAATGGATAAGTTAACAACAGGTGTTGCCTATGGCACATCGGCGGGTAATGCTGGTTTCTGGGCATTGCAGTTACTCGATAAAGTAACTCCGTCACAGTGGGCTGCAATCGGTGTGCTGGGTAGCCTGGTTTTTGGCCTGCTGACGTATCTGACAAATCTTTATTTCAAGATTAAAGAAGACAGGCGTAAGGCTGCGAGAGGAGAGTAATCCAATGACTCAAGACTATGAACTGGTTGTGAAAGGAGTCCGTAATTTTGAGAATAAAGTTACGGTAACTGTAGCCTTACAGGACAAAGAACGCTTTGACGGTGAAATTTTTGACCTGGATGTCGCCATGGACCGTGTTGAAGGAGCTGCGCTGGAGTTTTATGAGGCAGCAGCCAGAAGGAGCGTCCGGCAAGTCTTCCTGGAAGTAGCAGAAAAATTGTCAGAAAAAGTTGAGTCTTATCTGCAGCATCAGTACTCCTTTAAGATTGAAAATCCTGCCAATAAGCACGAGCGTCCTCATCATAAATATCTATGAACACAAAAATCAGATACGGCCTGTCGGCTGCCGTTCTGGCGCTGATTGGTGCTGGCGCATCTGCTCCTCAGATACTTGACCAGTTTCTGGACGAAAAAGAAGGTAACCACACAATGGCATACCGCGATGGTTCTGGCATATGGACCATCTGTCGGGGTGCCACAGTGGTGGATGGAAAAACCGTTTTTCCCAATATGAAACTGTCGAAGGAAAAATGCGACCAGGTCAACGCCATTGAGCGTGATAAGGCGCTGGCATGGGTGGAGCGCAATATTAAAGTACCACTGACCGAACCACAAAAAGCGGGTATCGCGTCATTTTGTCCCTATAACATTGGCCCCGGTAAGTGTTTCCCGTCGACGTTTTATAAGCGGCTGAATGCTGGTGATCGTAAAGGTGCATGCGAAGCGATTCGCTGGTGGATTAAGGATGGCGGACGCGATTGCCGCATTCGTTCAAATAACTGTTACGGTCAGGTTATTCGTCGTGACCAGGAGAGCGCATTAACCTGCTGGGGGATAGAACAGTGAATCAGATATTCATGGTGATTTTTCTCGTGTTGTCAGGATTTATCGTCGGAAATGTCTGGAGCGACCGAGGATGGCAAAAAAAATGGGCGGAACGTGATGCTGCCGCATTATCACAAGAGGTAAATGCTCAATTTGCTGCTCGAATAATTGAACAGGGGCGAACTATAGCCCGTGATGAGGCTGTTAAAGATGCGCAACAGAAATCTGCTGAAATTTCTGCCAGGGCTGCTTATCTGTCTGATAGTGTTAACCAGTTGCGTGCCGAAGCAAAAAAATATGCCATACGCCTTGACGCAGCGAAGCATACCGCAGATCTTGCCGCTGCCGTCAGAGGCAAAACAACCAAAACCGCCGAAGGAATGCTCACCAACATGCTCGGAGATATTGCAGCAGAAGCTCAGCTTTATGCTGAAATTGCTGACGAACGCTACATCGCAGGAGTGACTTGTCAACAGATCTATGAATCTTTAAGAGATAAAAAGCATCAAATGTAGGGTAATATTAAATCGGAACATTTACATCGCGGAATGTAAAATTTAAATAAAAAGGACTCTTCCATGAGCCAAAATTCCTGAAATCTTAAGGGTAAGATAAAAGGTCTTAATCAGAATGACACGTTTTATTAATAAATAAAGCTATTCTTTCATTGCTGTGTTTTTCTTTACAAAAGTAATCCTTGCTATGGGTGGTTAATCATGCGTTAATGGTGTTCTGGTTTGTTACAAATTTATCTGAAGCAGTCATTTTTATAATTTTATTATTTGTACCTCTTGAGATTTCCTTGTTGGTTTTTCTCTCTGATATTTTTTTTTCGGACCATTCTGCCCAAGGGCTAATTTCTTCAAAAGGTAATAATTATGTCTAACAAAATGACTGGTTTAGTGAAATGGTTTAACCCTGAAAAAGGTTTTGGTTTCATCACGCCGAAAGATGGCAGCAAAGATGTGTTTGTCCATTTCTCAGCAATTCAGAGCAACGATTTCAAAACATTAACTGAGAATCAGGAAGTTGAATTTGGTATTGAGAACGGACCTAAAGGTCCTGCCGCTGTTCATGTAGTGGCGCTTTGAGGTAGACAATATTACAAACCATATTCACTTTAGATGCCCGTGTTGTCATGGTTCCCAGTATAGAACATCATCTTTTGATGTTTCTGACATGAATCCTTTCGGGGCAAAATGTATCTTTTGTAAATCAATGATGATTACATTTGATAATATTTCACAATACTTAAATGCCAGCCGTCTGTCGTTGGATTTAAAAAAGTGAAAATGAAGGCTCCTTCGGGAGCTTTTTTGCTTGGTGTCTATTCGATGGATACTCACATACTACGGTAACATCATGAAAAAAATCATAGTTTTTTTTAACTCTGAACCAGCAGTGGTAGTGTCAGCGATGACTGGAGTTAACACCATCATGCGTGAATATCCAAATGGCGAAAAAACACACCTTACTGTAATGGCCGCAGGGTTTCCATCTCTGACCGGAGATCATAAAGTCATTTATGTAGCCGCGGATCGACATGTTACTTCAGAAGAAATTCTGGAAGCAGCAATAAGGCTCTTGAGTTGATTTGATGCTATTGCATTGATAATTCAGGAAAATTCTCTTTGTCTGTTTGTGTAAAATTTAGACTATCGTATGTTGATTATTGCGATGTTTCATCTTATCTTTTACACGTTTGCACCATATAATCGACTTACTGTGTAACTGGAAAGTCATAACAGACTAAAAGAGGAAATGATGAATATTGAAGACTTAAAAACAAAAGCAGAAGCAGATATTTCTGAATATATAACAAAAAAAATTATTGAACTTAAGAAAAAGACCGGGAAAGAAGTTACCAGTATTCAGTTTACCGCACGGGAAAAAATGACGGGTCTTGAAAGCTATGATGTCAAGATTAATTTAATCTGATGTATTCAATAATAAAGTTTATCCATAAACCTCGTTTTTACGGGGTTTTGTTATATTTGAATGGTTCCGAATATCTAAATCACAATTGTTGATGGTTTTTATTAAACCAATGCAGTCCGGCTCAGGAGTGAGAGAAGCCGGACGTTATGGTTTAGCGTGGTAAGATCTGTGTAGTTTTCTGGATGCTTTCAGTAAATAGTAATGAGTTATCAAAGGCATAGTAATATCTTTGGTGTTCCTGGATATTTGTAACCCATCGGAAAACTCCTGCTTTAGCAAGATTTTCCCTGTATTGTTGAAATGTGATTTCTTTTGATTTCAACTTATTATAGGAGGTCTCTATAAGATGTTTGTTTCTGGAGAATTTAACATTTACAACCTTTTTGAGTCCTTTTACTAACACTATGTTGTCGTTTTCTAACACAATGTGAATATTATCTGTGGCTAAATAGTAAATATAAAGTGAGACATTGTGACGTTTTAGCTCAGAATAAAATAATTCACAGTTTAAATCTTTACGCACTTGATCGAATATTTCTTTAAAAATGGCAGCCTGAGCCATTGGTAAACCTTCCATGTGATACGATGGCGCGTAGTTTGCGTAACAAAGTTGAGCCTTGCTGGCATCCAGGAGGGATATGCAACCGACAGATGTATGTAAGGTCGATGTACTCAAACTTTCATACTTTTCCTCTTTTATGCAGAAAGATTTGAAGTAATATTTTAACCGCTAGATGACGAGCAAACGCATGGAGCGACAAAATGAATAAAGAACAATCTGCTGATGAACTCTCGTTGGATCTGATTCGTGTAAAAAATATGCTTAATAGCACCATTTCTATGAGTTACCCGGATGTTGTAATCGCATGTATAGAACATCAGGTGTCTCTGGAAGCATTCAGGGCAATTGAGGCAGCGTTGGTGAAGCACGATAAGAATTCGAAGGATTATTCCCTGGTGGTTGACTGAGCACCATAACTGCTAATCATTCAAACTATTTAACCTGTGACAGAGTCAATGTCGCATTCTGTCACTGTCAGGCTAATACAGAGCTGCAATTCAACTACTGCAATGTCCTCGTAATTAGGTGAATTTACATATCGTCCTGTTCGGATGCCGGCTGCATTGCTGAAGATGAGGCATTTATGGTTCGCATATTTTCCCCTCATGCTCGTCAGTCCTGTGCGTAGGAAGAAACAGGACACTCACACTAATTTGTGTGGGCATGCTGTGATGTCCTTCTGAATTATTCCTATGCCATTATGTAAAGCGCTGTATCAGATGCTCGTCACGGCTGTCAGGCTGTCGGGTCCTCCCGGTGGGGGCCCCTGCCACGGGGCGGGAGCGTCGCGGAAAAAGGCTAGTTTTTGAAATTTCATTCGTCATCACCACTACTGTAATAGATTGATATTACAGTGTTTTTATTTTTGTGGTGTCGATTTTGATTGTTTTTTGTTCATCACTAACACCGTTTGCCTAAAGTTGTTCGCAAGATGCATGTTTAAAACATTCTGGAGCGGGTATGGATCGAGAGTTAAAAAATCTGACGCTGAATATCAGTCAACTGGCGGCACTGTCAGGTGTACATCGCCAGACTGCTGCGGCAAGGCTGCAAAATCTACCCGTTGCAGGGGGGCATGAAAGCAACCTCAAGCTTTATCGGGTGGTTGATATTGTGTCGGCATTTCTGGCATTACCACCGCCGGTTGCAGAAAGCGAAATGGATGCGCATGAGCGCAAAGCCTGGTATCAGTCTGAACGTGAGCGTCTTAAGTTCGAACAGGAAACGGCACAACTCATTCCGGCCAGTGATGTCAGACGGGAGTTTGCCATCTGGGCAAAAGCGGTCGTGCAGGTGCTGGAGACATTACCGGATATTCTTGAACGTGACTGCGGTCTGCAGCCTGCCGCTGTGAGCCGTGTTCAGTCCATTATTGATGATCTGCGCGATCAGATAGCCCTGCGGGTGACTGAAGCAGGTGCGGATGATGAGGAGGAATTACAGCAGGAGGAGTAATGCTGAATCAGGAAACCGCAAAGGCAGCACGAACCGATTCAGGTTATATCCTTCGCGCACCGAGACGAATGCGGGTTGCTGATGCCGTTGCTCAGTATATGCGGGTGCCCCTGGGGGCAGGGAACTCAGTCCCGTGGGATCCGCTGGTGGCACCGTATGTTATTGAGCCGATGAACTGCCTGGCCTCGCGTGAATACGACGCAGTGATATTTGTTGGCCCGGCACGAACCGGCAAGACTATCGGCCTGATTGACGGCTGGGTGATTTACAACGTGATTTGCGATCCTGCTGATATGCTGATCATTCAGATGACGGAGGAAAAAGCCCGCGAACACTCCAAAAAACGACTCGCCAGAACGTTTCGCGTCAGCCCGGAAGTGGTCAGTCGCCTGAGTCCGAACAAAAATGACAACAACGTTTATGACAGAACATTCCTTGCTGGTAACTACCTGAAAATCGGCTGGCCGTCAGTCAATATCATGTCCTCATCAGATTATAAATGCGTCGCGCTGACGGATTATGACCGTTTTCCGGAAGATATTGATGGCGAGGGGGATGCTTTCTCTCTTGCCTCAAAACGTACCACAACATTTATGTCCAGTGGTATGACGCTGGTGGAGAGTTCCCCCGGCAGGGATGTGAAGGATGTGAAATGGCGACGGACTTCACCGCATGAGGCTCCACCAACCACGGGGATACTGTCGCTCTATAACCGTGGCGATCGCCGTCGCTGGTACTGGCCCTGTCCACACTGTGGTGAGTATTTTCAGCCCTGCGGCGATGTGGTTGCTGGTTTCCGTGATATTGCCGATCCCGTGCTGGCAAGTGAGGCGGCTTATATTCAGTGTCCTTCCTGTTCAGGACGGATTATGCCTGAACAAAAACGTGAGCTGAACGGACGTGGGGTCTGGTTGCGGGATGGTGAATCCATCAATGCGGATGGCAGTCGTTATGGTGATCCCCGACGCTCACGTATTGCGTCATTCTGGATGGAGGGTCCGGCAGCTGCTTACCAGACACTCTCGCAACTCGTTTACAAACTGCTTACTGCAGAACAGGAATACGAGACAACCGGAAGTGAAGAAACACTCAAGACGGTTATCAATACCGACTGGGGATTACCTTATCTTCCCCGCGCCAGCATGGAGCAACGAAAAAGTGAACTGCTTGAGCAGCGGGCAGAGCCAGTTCCTTCCCGCAGTGTGCCGGATGGCGTTAATTTTCTTGTGGCGACAGTGGATGTGCAGGCGGGACGTCATCGCCGTTTTGTGGTTCAGGTAACGGGCTATGGCAGCCGTGGCGAACGCTGGATTATTGATCGTTACAACATCACGCAGTCATTGCGCGGTGACAGCGACGGGGAGAGCCAGCGAATTGATCCGGCCAGCTATCCGGAAGACTGGGATGTCCTGCTGACGGATGTTTTTCATAAAAGCTGGCCGCTGGCCTCCGATCCTTCTCAACAAATGCGACTGATGGCAATGGCGGTGGACTCCGGCGGTGAAGACGGGGTCACTGATAATGCCTATAAATTCTGGCGTCGTTGCCGTCGTGATGGCCTTGGTAAACGTATTTACCTGTTTAAGGGCGACAGCATCCGGCGCGCAAAACTGATCACCCGTACATTCCCTGATAACACCGGACGAACGGGCCGACGGGCGCAGGCCGCAGGTGATGTGCCGCTCTGGCTTCTTCAGACGGATGCACTGAAAGACCGGGTGAATAACGCGTTATGGCGTGACTCGCCAGGTCCCGGCTATGTGCATTTCCCTGACTGGCTGGGGAGCTGGTTTTACGACGAACTGACGTATGAAGAGCGGAGCAGTGACGGGAAATGGAGTAAGCCGGGTCGCGGTGCCAACGAAGCTTTTGACCTGATGGTGTATGCCGAGGCTCTGGTCATTCTGCATGGATACGAAAAGATCCGCTGGCCGGATGCACCGGAGTGGGCGAGCCGGGAAACCTGGCTGGAGTGTGTCCCGGACAGTATCGAACCGTCACCCTCACCGGAACCGGTATCCACGCCTGTTAAAAAACAAAAACGGAAGAAAACAGTAACTGACGATGTTAACCCCTGGCTGACTTCCGGAGGATGGTTATGAACCAGAATGATATCGAAGCCATGATTCAGCGTTATACGGAAGCTGAAATGGCGGTGCTGGACGGAAAATCCGTCACCTTTAATGGTCAGCAGATGACCATGGAAAACTTATCTGAGATCCGGCAGGGACGGCAGGAGTGGGAGCGCCGCCTTGCGGCTCTGATTACACGACGACGGGGGCATCCCGGGTACCGGCTGGCGAGGTTCTGATGGCAATTCTTGATGATGTGATTGGCGTTTTTTCACCAGGATGGAAAGCGGCAAGGCTGCGTTCCCGGGCGGTGATCCAGGCTTATGAGGCCGTAAAAACGACGCGGACACACAAAGCCCGGCGGGAAAACCGAACTGCCGACCAGTTAAGCCAGTACGGGGCCGTGTCGTTACGTGAGCAGGCCCGTTACCTCGATAACAACCACGATCTGGTCATTGGTGTATTTGACAAGCTGGAAGAACGGGTGGTGGGGAAAAACGGGATTATTGTCGAGCCACATCCGGTATTACGCAATGGGGCCATTGCCCGTGATCTGGCTGCGGAGATTCGCACCCGATGGAGTGAATGGTCTGTCAGCCCGGAAGTCACCGGGCAGTTTACCCGTCCGATGCTGGAACGTCTGATGCTGCGTACCTGGCTGCGCGATGGTGAGGTGTTTGCCCAGATGGTTTCCGGGCGCATAAACAGCCTGACGCCTTCTGCCGGTGTTCATTTCTGGCTGGAGGCGCTCGAGCCGGACTTTATTCCCATGACCAGTGATGAGAGCAACAGGCTGAATCAGGGCGTGTTTGTTGATGACTGGGGGCGTCCCGAAAAATATCTGGTGTATAAAAGCCGTCCCGTATCCGGACGGCAGATGGAAACCAAAGAAGTGGATGCAGAGCGAATGCTGCATCTTAAATTTGTTCGCCGTCTGCACCAGATGCGCGGGACGTCTTTATTGTCCGGTGTGCTGATCCGCCTCAGTGCTCTGAAAGAGTATGAAGATTCTGAGCTGACTGCAGCAAGGATCGCCGCTGCTCTGGGGATGTACATCCGGAAAGGCGACGGGCAGAGCTATGAAGCGGATGGTAATGGCAGCAAGGATAAGGAACGCGAGCTTACCATTCAGCCAGGTATTATTTACGATGATCTGAAACCCGGCGAAGAAATCGGAATGGTGAAGTCGGATCGCCCCAATCCTAACCTTGAAACTTTTCGTAATGGTCAGTTGCGTGCCGTGGCGGCGGGCAGTCGTCTGAGTTTTTCCAGTACAGCGCGCAACTATAACGGCACTTACAGCGCCCAGCGTCAGGAGCTGGTTGAATCCACTGATGGCTACCTGATCCTGCAGGACTGGTTTATTGGTGCCGTCACCCGTCCGATGTATCGTGCCTGGCTGAAACAGGCTGTGGCATCCGGTGTTATCAGGCTACCCCGCGATCTTGACCGTTCTTCACTGTATACCGCGGTGTATTCCGGACCGGTGATGCCGTGGATTGACCCTGTTAAGGAGGCTGAGGCCTGGAAAATCCAGATTCGTGGTGGAGCGGCGACAGAATCAGACTGGGTACGTGCTGGTGGTCGTAATCCGGATGATGTCAAACGTCGGCGCAAGGCCGAAATTGATGAAAACCGCAAACTGGATCTGGTATTTGATACCGATCCGGCCAGTGATAAAGGAGGCAGCAGTGCCGCAACGAAACGACATGAGCCGCAGCACACCGACGACCAGTCCGAAGAATAATTCCTGGTTCAGGATGCAGGCTGGTCACCAGAGTGACGCGGATATTTATATTTATGACGAGATTGGTTTCTGGGGTGTTACAGCGAAGCAGTTTATCAGTGATCTGAATGCACTGGGCGATATCACCCACATTAATCTCCATATTAATTCACCGGGTGGCGATGTCTTTGAAGGCATCGCCATTTTTAATGCGCTGAAAACACATGGTGCGTCCATTACCGTTTATGTCGACGGTGTGGCGGCGTCAATGGCGTCGGTCATTGCGATGGTGGGAAACCCGGTCATTATGCCGGAAAACACCTTCATGATGATTCATAAACCATTTGGCTTTACGGGCGGTGATGCGGAGGACATGCGCACCTATGCCGACCTGCTCGATAAAGTTGAGGCGGTTCTGTTACCCGCTTATGCACAGAAAACCGGGAAAACCACCGATGAAATTGCTGCCATGCTGGCGGATGAGACCTGGATGTCCGGTGCCGAATGTCTGGCACATGGATTTGCTGATCAGGTAACGCCAGCCGTTAAGGCAATGGCATGTATTCAGTCAAAACGTACAGAGGAATTTAAAAAGATGCCGGAATCCATTCGAAACATGATTACTCCGCCACGCAACAGTGCTCCACGCGTACAGGATAATGAACCTGAAGCCTCCCGGACGCCAGTGCAGGCAGCAGCACCCGTGGTGGATGAAAACAGCATCCGTGCGCAGGTACTGGCAGAGCAAAAAGCGCGTGTAAACGGTATTAATGATCTGTTTGCCATGTTTGGCGGGCGTTATCAGACGCTGCAGGCTCAGTGTCTTGCCGATCCTGAATGTTCGCTGGAGCAGGCCCGCGAAAAGCTGTTGAACGAGATGGGGCGCGAGTCCACGCCATCCAATAAAAATACCCCGGCTCATATTTATGCCGGTAACGGTAATTTTGTGGGGGACGGGATCCGCCAGGCGCTGATGGCGCGTGCCGGATTTGAAAAAACCGAACGTGATAATGTCTACAACGGGATGACCCTGCGTGAATATGCCCGTATGTCACTGACTGAACGGGGTATTGGGGTTTCCAGTTATAACCCGATGCAGATGGTCGGTGCGGCGTTCACACACAGTACGTCTGACTTCGGTAATATTCTGCTGGATGTTGCGAACAAAGCCATTCTGCAGGGCTGGGAAGATGCCCCTGAAACCTATGAACAGTGGACGCGGAAAGGTCAGTTGTCTGATTTTAAAATTGCCCATCGTGTGGGTATGGGTGGCTTCAGTGCTCTGCGTCAGGTGCGTGAAGGGGCGGAATATAAATACGTCACCACCGGAGATAAACAGGCCACTATTGCACTGGCGACCTATGGCGAGCTGTTCAGTATCACCCGTCAGGCCATTATCAATGATGATCTGAATATGCTGACCGATGTCCCGATGAAACTGGGCCGTGCGGCGAAATCCACTATTGCCGATCTGGTTTATGCCATTCTGACGTCTAACCCGAAAATCTCCACAGATAATGTAAGTCTGTTCGATAAAGCGAAACATGCAAACGTACTGGAGAGCGCTGCAATGGACGTGGCATCGCTGGATAAAGCCCGCCAGTTGATGCGCGTTCAGAAAGAGGGGGAGCGTCATCTGAATATTCGTCCTGCGTTCGTACTGGTACCGACGGCGATGGAGTCTGTTGCTAACCAGGTCATTCGCTCCTCAAGTGTCAAGGGGGCTGACATTAACGCCGGTATTATTAACCCGGTGAAAGATTTTGCGACCGTTATTGCAGAGCCTCGTCTTGATGATAACAGCCAGACCACCTTCTACCTGGCTGCGTCAAAAGGCTCCGATACGATTGAAGTGGCTTATCTCAACGGTGTGGATACGCCATATATTGATCAGATGGAGGGCTTCAGTGTGGATGGCGTGACAACGAAAGTGCGTATTGACGCCGGTGTCGCGCCAGTTGATCACCGCGGTCTGGTGAAATGTACGGCGTAAACGTCGCAGACAACAACTCTGATGGCCCGTAAGGGCTTTTTTTGTACCTGAAATCAGCCCCTGAACGGGGCTGTGCGGAGACAGTTATGGCAAAGAATTTTGTAGAAGAAGGAAAAACGGTGGCGATTGTTGCCAGTGCAGCCATCAGCAGCGGAGATCTGGTGCAGGTGGGTGATGTTTTTGCGGTGGCGCTGACCGATATTCCACAGGGTGAAACAGGCGACGGCATGACCGAAGGTGTGTTTATCCTGCCTAAACTGAAAACGGATGACATGAAAACGGGTAAGAAGGTTTATCTGAAGTCCGGAAAAGTTCAGCTGACTAACAGCGGCTCTGATCCGCTGGTCGGGGTTGTCTGGGCAGATGCCGGAACCAGTGCAGAAGAAGTGCCGGTAAAACTCAATGTCTGATCCCTTTTCCCGGCTGGCAGCGCGTATGGATGCGATCACGGTCAGAAAGATGGGAAAGACAGCCTCGATTAATGATGCCGATATGACTGTGATCCCGGGCGAAACACTGGCAGAGCTGAATGCTCTGTCCGGACCTGCGGTCTCTCTGGTGGTGTTTTCTTCGGGATACCGCCCACGGCGCGGGGATCGCGTTGTTTATGACGGACAACAATGGACGGTCACACGGCATGAACGTTTTAACGGTAAGCCAATGATCTTTATTGAGTAAAGAGGTGTGGGATGAAGGGGCTTGAGAATGCCATCCGCAATCTGAACAGCCTTGATACCCGTATGGTGCCACAGGCCAGCGCATGGGCGATAAACCGTGTGGCACAGAAAGCGGTCTCGGTTGCCACCCGGCAGGTTGCCGGGAATACCGTTGCGGGAGATAACCAGGTGAAAGGGATCCCCCTGAAACTGGTACGTCAGCGTGTCCGGGTGTTTAAAGCCAGTCCGTCAGGAAAAATGACGGCCAGGATCCGCGTTAACCGGGGCAATCTGCCCGCCATTAAGCTGGGGACAGCCCGGGTCAGACTGGCCCGGCGTGGTGGAAAACTGCAGTACCGTGGCAGTGTGCTGAAGGTGGGTAAATATCTTTTCCGGGATGCGTTTATTCAGCAACTGGCGAATGGTCGCTGGCATGTGATGCGGCGTATTGATGGCAAAAATCGTTACCCCATTGATGTGGTGAAAATCCCGCTTTCCGGACCGCTGACACAGGCATTTGAAGATGCCCGCGACCGCATCATTGCTGCGGAAATGCCGAAACAGCTGGGGTATGCACTGAAACAACAACTGAGGTTATGGCTGACCCGATGAACCGACATACACAAATCCGCCAGGCCGTACTGGCACGCCTTCGGGAACAGTGTGGAGACAGCGCCACGTTTTTTGACGGGCTTCCGGCATTTATTGATGCGCAGGAACTGCCTGCCGTGGCGGTGTGGCTGAGTGATGCTCAGTACACCGGAAAAATGACGGATGAAGATGACTGGCAGGCTGTTCTGCATATTGCTGTCTTCATCCGGGCACAGGCACCGGATTCAGAGCTGGATATGTGGATGGAGAGCACCATTTTCCCGGCTCTGAATGATATACCGGCACTTTCCGGACTCATCGACACCCTGATCCCTCTCGGTTTTAACTATCAACGTGATAATGAGATGGCCACCTGGGCGATGGCGGAAATCACGTACCAGATCACGTACACGAATTAAAGGAGGTGGCAATGACCACACCAAATCCACTGGCAAAAACGAAAGGTGCGGGAACGACGTTCTGGATGTACACCGGCAAGGGCGATGCGTTTGCGAACCCTTTATCGGACACTGACTGGCTGCGTCTTGCGATGGTGAAGGATCTGCAACCTGGCGAAATGACCGCTGATGCAGAAGATGACACTTATCTCGATGATGAAGATGCAGACTGGAAAACGACAACACAGGGGCAGAAATCCGTCGGTGATACTTCGGCGACGCTGGCCTGGCGTCCGGGTGACAGCGGACAGAAAAAACTGGTTCAGTTGTTCGACTCCGGTGAAGTCTGCGCGTTTCGTATCAAATATCCCAACGGCACTGTTGATGTTTTCCGTGGCTGGCTGAGCTCACTGGGTAAAACCATTGCCTCAAAAGACGTGATGACCCGCACAGTGAAAATCAGCGGTGTGGGGCGTCCGTATCTGGCAGAGGAAGGCACTGAAACAGTGAGCGTTACCGGGCTGACGGTGGCACCGGCATCTGCCAGTGTAAAAGTGGGAGCAACCACCACGCTGACCTTTACAGTAAAACCTGACGGAGCCAGTGACAAAGCGATCAGTGTGCATTCGACAGATCCACAGACTGCCACGGTGACCCTGAACGGGCTTGTGGCCACGGTGAAAGGCGTGAAGCAGGGCAGTGTCAGCATTGTGGGCATGACTTCTGACGGCGATTTTGTGGCAGTGGCTGCGGTGGCTGTCAGCGCCGCAGGTTAACAGGACGATACTCATCATTTGCCCCGGTTATCCGGGGCTTTTTTGCAGGTGGAGAACATGATGTTTCTGAAACAGGGCACGTTTAATTATGAAAAGCAGTCCGTGGTGCTCAGTGAGCTGTCCGGGCTGCAGAGAATTGAATATCTGGCGTTTGTTCAGCAGCGAACGGCAAAGTTTGATGCCGAAGAGGGAGAACTGCCGGAGGCTGAACGACAGATTGCTTTTCTGCGGATGGGGATGGATATCAATGCCTGGCTGGTTTCCCGCTCACTGTGGAATGCGGAACAGTCTCAGGATGTTGAGACGCTTTGCGCATCCGTTATTACAACATGGTCGTATGATGCCCTGGGAGCGGGGGCGGAGATGGTTCTGTCGCTGAGCGGTATGGGAGCCATTGAGAATGCCGGGGATTTGGAGCATGAGGTGCTGACGCCGGAAAAGTCCTGACGCGGGAAATGCAGTTTGTCATGCGGCTTGCCCGGGAGTTCCGGCGGGCAGACTGGCGGCGGATGCTGTCGGAAATGTCGGCCACTGAGCTTGGTGAGTGGGGCGATTATTTCCGGATGCAGAGCTTCAGTGATGTGTGGATGGATGCGCAGTTTGCCTCGCTGAAGGCATTGATCGTGAGAATGGTGTCCGGTAGCAGTGATGCTGCGGTGGCTGATTTCAGCCTTTTACCGGAAGAGAACGGGATACCGGAGCGAACGGACGAAGAACTGATGCATCTTGGGGAAGGTATTTCCGGAGGTGTGCGTTATGGACCAGATAGCCAACCTGGTCATTGATTTGGGGATTGATGCGGCAGAGTTTAAAAATGAAATTCCCCGTATCAAAAACCTTCTGAATGGTGCAGCCAGCGATGCAGAACGGTCTTCTGCCCGTATGCAGCGTTTTATGGAGCGTCAGACTCAGGCCGCCCGGCAGACAATGCAGGCGGCTTCTTCGGCTGCAACAGCCGCATCCGTCCATGCGCAGACGGTGGAGAAGAGCGCACAGGCTCATGAACGCATGGCCCGCGAGGTGGAGCAAACCCGCCAGCGTATGGAGGCACTGAGCCAGAAAATGCGCGAGGAACAGGCGCAGGCCATGGCTCTGGCGGAGGCTCAGGATAAAGCGGCTGCCGCGTTTTATCGTCAGATTGACAGTGTGAAACAGGCCAGTGCGGGGCTGCAGGAATTACAGCGTATTCAGCAGCAGATCCGACAGGCCAGAAACAGTGGCGGGATTGGTCAGCAGGATTATCTGGCGCTGATTTCTGAGGTTACGGCGAAAACCCGTGTTCTTACGCAGGCTGAGGCAGAGGCTACCCGACAGAAAGTGGCGTTTATCCGTCAGCTTAAAGAGCAGGCAACCCGCCAGAATCTTTCTTCTTCTGAGTTGCTTCGTGCTAAGGCTGCCCAGCTGGGGGTAAGCAGTGCTGCAGAAGTGTATATCCGCAAAATGGAGCAGGCAGGAAAAGCCACGCATTCGCTGGGTCTGAAAAGTGCAGCGGCCCGCCAGGAGATAGGCGTTCTGATAGGTGAACTGGCTCGCGGCAATTTAGGTGCGCTGAGGGGATCCGGGATAACGCTGGCTAACCGTGCCGGATGGATAGACACACTGATGTCACCGAAAGGCATGATGCTGGGCGGGGTTATTGGCGGTATTGCCGCGGCCGTCTATGGTCTGGGTAAAGCCTGGTATGATGGTCAGAAGGAGGGGGAAGAATTTAACCGCCAGTTGTCGCTGACGGGGCATTATGCCGGAGTCACTGCCGGGCAGCTGTGGACGCTCAGTCGTGCTATTTCCGGGAATGGTATTACGCAACATGCTGCAGCCGGTGCGCTGGCTCAGGTGGTGGGGAGTGGTGCATTTCGTGGAAACGATATCGGTATGGTGGCGAGAGCTGCCGCACAGATGGAGCGATCGGTTGGCCAGTCGGTCAGCGATACCATAAATCAGTTTAAGCGGCTGAAGGATGATCCTGTAAATGCCGCGAAGGCTCTGGACAATGAGCTGCATTTTCTTACTGCCACTCAGCTTGAGCAGATACGCGTCCTTGGGGAACAGGGGCGGTCCAGTGATGCGGCACGGATAGCCATGTCTGCACTGGCAGAGGAAACCGGTCGGCGTACTGCGGATATTGATAATAACCTCAATGCGCTGGGCAGTACGCTGAAGTATCTGTCTGATTTATGGAGTAGTTTCTGGGATGCGGCCATGAATATTGGTCGTGAAGACTCGCTGGATGAACAGATTTCCGCTTTACAGGAGAAAGTGTCGCGGGCGAAAAGACTCCCCTGGACGGCATCATCTTCTCAGGTTGAGTACGATCAGCAGCGTCTTAACGAGCTTCAGGAGAAAAAACGCCAGAAGGATTTGCAGGATGCAAAAGAGCAGGCAGAGCGCAATTATCAGGAGCAACAGAAACGCCGTAATGCTGAAAATGCTGCACTGAACCGGATGAATGAAACGGAAGCAGCACGACATCAGCGTGAAATTGCGCGTATTAATGCCATGCAGTACGCCGATCAGGCTGTCAGGGATGCGGCGATACAACGTGAAAATGAACGTTACGAGAAAGCCCTGGCATCCGGTAAGAAAAAAACACGCGAACCCCGTAATGATGAGGCCACCCGGTTATTGCTGCAGTACAGTCAGCAACAGGCACAGGTGGAAGGACAGATTGCTGCTGCCAGACAGTCAGCAGGCATTGCCACGGAAAGGATGACAGAAGCGCATAAACAGCTTCTGGCTCTGCAGCAGCGCATCAGCGACCTGGACGGGAAAAAACTGACGGCAGATGAAAAGAGTGTGCTGGCCCGTAAAGATGAACTGATTCAGGCACTGACGCTGCTGGATGTAAAACAGCAGGAGCTTCAGAAACAGACGGCACTCAACGAGCTGAAGAAAAAAACAATTCAGCTGACCAGTCAACTGGCTGAAGAAGAGCGCGCTCAGCGTCAGCAACATGACCTGGATATCGCCACGGTGGGTATGGGTGATCAGCAGCGGCAGCGATATCAGGTACAACTGAGTCTTCGCCAGAAATACCAGCAACAGCTGGAGCAGTTGAGGCGGGATAGTGAGCAGAAAGGAACATATAACACGGATGACTACAGAAAGGCCGAGCAGGCGCTGACGGAGAGCCTGAACCGACAACTGAATGAGAATCGCCGTTACTGGCAACAGCTTGAAGTTGTGCAGGGTAACTGGAAAAACGGAGTCCTGCGTGCATTTCAGGATTTTACCGTGGATGCAGATAATACGGCAGGAACAGCAGAACAGGTGTTCTCGTCAGCCTTCAGCAACATGGGAAATGGCCTGGCAACTTTTGTCACTACCGGCAAACTCAATTTCAAATCCTTCACCTCTTCTGTGCTGTCAGATATGGCGAAAATCCTGGCGCAGGCAACCATGATGAAATCGATAAAAGGGATTGGCAGTGTACTGGGATTTGATCTCAGCAGCCTTTCCCTGAATGCCAATGGGGGGATTTATCAGTCTGCTGATTTGAGTCGTTACAGTGGCACGGTGGTTAACCGTCCGACGTTTTTTGCTTTTGCAAAAGGCGCGGGTGTGATGGGGGAAGCGGGACCTGAAGCCATTCTGCCACTGCGTCGTGATGCTGACGGTAAGCTGGGGGTTGTGGCGGATATTGGTGGTTCAGGTATGGCGATGTTTGCCCCGCAGTACAACATCGAGATCAATAACGATGGCACGAACGGGCAGATAGGTCCGGCTGCCCTGAAGGTGGTTTATGACCTCGGGAAAAAAGCAGCAGCGGACTTTATGCAACAGCAGGCCCGTGATGGTGGTCGATTAAGTGGAGCATATCGGTAATGGAGACGTTTCACTGGAAAGTGCGCCCGGATATGAATGTGGTATCAGAGCCGAAAGTGGTGACAGTGAAGCTGGGCGATGGTTATGAACAGCGTCGTGTGGCGGGACTGAATAACCAGTTGTCGACTTACAGCGTGACGATACGTGTTCGTAAATGTGAACACCCATCTTTAAAAGCCTTTCTGGAACGGCACGGTGGCGTCCGCGCATTTCAGTGGACGCCACCTTATGACTGGAAGCCGATCAGGGTGGTTTGTCGTAAATGGTCGGCGAGCGTAGGGGCGCTGTGGGTAACCATAACGGCAGATTTTGAACAGGTCGTGGCATAGGAGGCTCTGATGCAGGATATTCCACAGGAAACACATCATGAGACGACACGCCTCACTCAGTCAGCCCTGGTGGTGCTCTGGGAAATCGATCTGACAGAGGTCGGTGGAGAACGTTATTTTTTCTGTAATGAGCAGAACGAAAAAGGTGAGCCGGTTACCTGGCAGGGGCGGCAGTATCAGGCATACCCCATTCAGGGGACGGGATTTGAACTGAATGGCAAGGGCAGTGCTGCCCGTCCGACACTGACGGTTTCTAACCTGCACGGCATGGTCACCGGGATGGCGGAAGACCTGCAGAGTCTGGTCGGCGGAACGGTGGTCAGGCGTAAGGTTTACGCCCGTTTTCTGGATGCGGTGAACTTCGTCAACGGAAACAGCGACGCCGATCCGGAGCAGGAGGTGATCAGCCGCTGGCGCATCGAGCAGTGCAGCGAACTGAGTGCGGTCAGTGCCTCCTTTGTACTGTCCACACCGACGGAAACGGATGGTGCCGTTTTTCCGGGGCGCATCATGCTGGCTAATACCTGCACCTGGACCTATCGCGGTGATGAGTGCGGTTATCACGGTCCGGCTGTCGCGGATGAATATGATCAGCCGACGTCCGATATCACGAAGGATAAATGCAGCAAATGCCTGAATGGCTGTAAGTTTCGCAATAACGTCGGCAACTTTGGCGGCTTCCTTTCCATTAACAAACTTTCGCAGTAAATCCCATGACAGAGACAGAATCAGCGATTCTGGCGCACGCCCGGCGATGTGCGCCAGCGGAGTCGTGCGGCTTCGTGGTGAGAACGCCGGAAGGGGAAAGATATTTTCCCTGCGTGAATATCTCCGGTGAGCCGGAGGCGTATTTCCGGATGTCGCCGGAGGACTGGCTGCGGGCAGAGATGCAGGGTGAGATTGTGGCGCTGGTCCACAGCCACCCCGGTGGTCTGCCCTGGCTGAGTGAGGCCGACCGGCGGCTGCAGGTGCAGAGTGATTTGCCGTGGTGGCTGGTCTGCCGTGGGGAGATTCATAAATTCCGCTGTGTGCCGTATCTCACCGGGCGGCGCTTTGAGCACGGGGTGATGGACTGTTACACGCTGTTCCGGGATGCTTACCATCTGGCGGGGATTGAGATGCCGGATTTTCATCGCGAGGATGACTGGTGGCGTCACGGTCAGAATCTCTATCTGGATAATCTGGAGGCCACAGGGCTGTATCAGGTGCCGTTGTCAGCGGCGCAGCCGGGCGATGTGCTGCTGTGCTGTTTTGGTTCATCGGTGCCGAATCATGCCGCCATTTACTGCGGCGACGGTGAGCTGTTGCACCATATTCCTGAACAACTGAGCAAACGAGAGAGGTATACCGACAAATGGCAGCGACGCACACACTCCCTCTGGCGTCACCGGGCATGGCACGCATCTGCCTTTACGGGGATTTACAACGATTTGGCTGCCGCATCGACCTTCGTGTAAAAACGGGGGCTGAAGCCATCCGGGCGTTGTCCACACAGCTCCCGGCGTTTCGTCAGAAACTGAATGACGGCTGGTATCAGGTGCGCATTGCCGGGCGTGATGCAGGTGAAACCGAATTATCTGCCCGTCTTAATGAGCCGCTGGCAAATGGTGCCGTGATCCACATCGTGCCGCGTCTGGCGGGAGCTAAAAGTGGCGGTGTGTTTCAGGTGGTGTTGGGGGCGGCGCTGATTGCTGTGGCATGGTGGAACCCTGTGGGCTGGCTGGGTGCCGCGGCTGTATCGGGCATGTATGCGGCAGGGGCCAGTATGATCCTGGGTGGTGTGGCCCAGATGCTGGCACCGAAAGCCCGGACGCCCACAGCGACCAGCACGGATAACGGTAAGCAGAACACCTATTTTTCATCACTGGATAACATGGTTGCCCAGGGCAATGTTCTGCCTGTTCTGTACGGTGAAATGCGTGTGGGGTCTCGTGTGGTTTCTCAGGAGATCAGCACGGCAGATGAAGGGGACGGTGGTCAGGTTGTGGTGATTGGCCGCTGATGCAAAATGTTTTATGTGAAACCGCCTCCGGGCGGTTTTGTCGTTTATGGAGCATGACGAATGGGCAAAGGCAGCAGTAAGGGGCATACCCCGCGCGAAGCGAAGGACAACCTGAAATCCACGCAGTTACTGAGTGTGATCGATGCCATCAGCGAAGGGCCGGTTGAAGGTCCGGTGGATGGATTAAAAAGCGTGCTGCTGAACAGTACACCGGTGCTGGACAGTGAGGGGAATACCAACATCGCCGGTGTCACGGTGGTGTTCCGGGCAGGTGAACAGGAGCAGACACCGCCGGAGGGATTTGAATCCTCCGGATCCGAGACGGTGCTGGGTACGGAAGTGAAATACGACACGCCGATCACCCGGACCATCACGTCTGCAAACATCGACCGTCTGCGCTTTACCTTCGGTGTGCAGGCACTGGTGGAAACCACCTCAAAGGGGGACCGGAATCCATCGGAAGTCCGCCTGCTGGTTCAGATACAACGTAACGGTGGCTGGGTGACGGAAAAAGACATCACCATTAAGGGCAAAACCACCTCGCAGTATCTGGCCTCGGTGGTGGTGGGTAACCTGCCGCCGCGCCCGTTTAATATCCGGATGCGCAGGATGACGCCGGACAGCACCACAGACCAGCTGCAGAACAAAACGCTCTGGTCGTCATACACCGAAATCATCGATGTGAAACAGTGCTACCCGAACACGGCACTGGTCGGCGTGCAGGTGGACTCGGAGCAGTTCGGCAGCCAGCAGGTGAGCCGTAATTATCATCTTCGCGGACGCATTCTGCAGGTGCCGTCGAACTATAACCCGCAGACGCGGCAATACAGCGGTATCTGGGACGGAACGTTTAAGCCAGCATACAGCAACAACATGGCCTGGTGTCTGTGGGATATGCTGACCCATCCGCGCTACGGCATGGGGAAACGTCTTGGTGCGGCGGATGTGGACAAATGGGCGCTGTATGTCATCGGCCAGCATTGCGAGCAGTCGGTGCCGGACGGTTTTGGCGGCACGGAGCCGCGCATCACCTGTAATGCGTACCTGACCACACAGCGTAAGGCGTGGGATGTTCTCAGTGATTTCTGCTCGGCGATGCGCTGTATGCCGGTATGGAACGGGCAGACGCTGACGTTCGTGCAGGACCGACCGTCGGATAAGGTGTGGACCTATAACCGCAGTAATGTGGTGATGCCGGATGATGGCGCGCCGTTCCGCTACAGCTTCAGCGCCCTGAAGGACCGCCATAATGCCGTTGAGGTGAACTGGATTGACCCGAATAACGGCTGGGAGACGGCGACAGAGCTTGTGGAGGACACGCAGGCCATTGCCCGTTACGGTCGTAACGTCACGAAGATGGATGCTTTTGGCTGTACCAGCCGGGGGCAGGCACACCGCGCCGGGCTGTGGCTGATTAAAACGGAGCTGCTGGAAACGCAGACCGTGGACTTCAGCGTGGGCGCAGAAGGGCTTCGCCATGTACCGGGCGATGTCATTGAAATCTGCGATGATGACTATGCGGGCATCAGCATCGGCGGGCGCGTGCTGGCGGTGAACAGCCAGACGCGGACACTGACGCTCGACCGTGAAATCACGCTGCCATCCTCCGGTACCACGCTGATAAGCCTGGTTGACGGAAGTGGCAATCCGGTCAGCGTGGAGGTCCAGTCCGTCACCGACGGCGTGAAGGTAAAAGTGAGCCGTGTTCCTGACGGCGTTGCCGGATACAGCGTGTGGGGGCTGAAGCTGCCGACGCTGCGCCAGCGCCTGTTCCGCTGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTACCGGAAAAAGAAGCCATCGTGGATAACGGGGCGCACTTTGACGGCGACCAGAGCGGCACGGTGAACGGGGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACCGCAGACAGCGGGGAATATCAGGTGCTGGCGCGCTGGGACACGCCGAAGGTGGTGAAGGGGGTGAGTTTTATGCTTCGCCTGACCGTGGCCGCGGATGACGGCAGTGAGCGGCTGGTCAGCACGGCCCGGACGACGGAAACCACATACCGCTTCACGCAACTGGCGCTGGGGAACTACAGGCTGACAGTCCGGGCGGTAAATGCGTGGGGACAGCAGGGCGATCCGGCATCGGTATCGTTCCGGATTGCCGCACCGGCAGCGCCGTCACAGATTGAGCTGACGCCGGGCTATTTTCAGATAACTGCCACGCCGCATCTTGCGGTTTATGATCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAACGCGGATTACCGATATCAGGCAGGTTGAAACCACAGCCCGCTACCTTGGCACGGGGCTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTACTTTTATATCCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCCGTCGGTCGGGCGAGCGATGATGCGGAAGGTTACCTGGATTTTTTCAAAGGCAAGATAACCGAATCCCATCTCGGCAAGGAGCTGCTGGAAAAAGTCGAGCTGACGGAGGATAACGCCAGCAAACTGGAGGAGTTTTCGAAAGAGTGGCAGGACGCTAACGATAAGTGGAATGCCATGTGGGGCGTCAAAATTGAGCAGACCAAAGACGGCAAACATTATGTCGCGGGTATTGGCCTCAGCATGGAGGACACGGAAGAAGGCAAGCTGAGCCAGTTTCTGGTTGCCGCTAACCGTATCGCGTTTATTGACCCGGCAAACGGGAATGAAACGCCGATGTTTGTGGCGCAGGGCAATCAGATATTTATGAACGACGTGTTCCTGAAGCGCCTGACGGCCCCGACCATTACCAGTGGTGGAAATCCACCGGCATTTTCCCTGACGTCAGACGGAAAGCTGACCGCTAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACGCCGGGACGCTCAACAATGTCACGGTAAATGAAAACTGTACGATTAAGGGCATGCTGGAGGCGACTCAGGTCAGAGGTGACTTCGTTAAAGCTGTATCCAAATCATTCCCGAAACAGGCTGGTACGTGGGGTAACACGGAAACACCAAACGGGACGGTTACAGTCACCATCAGTGATGATCATAACTTTGACCGTCAAATCATTATTCCGCCCATTATCTTTAACGGAATAGCGTATAGCGATCCGGGAAGTGGTAATAACCCGGGAGGTACAAGATACACGGGTTATGGTTTTGAAGTTCGCAAAAACGGTGTATTAATCGCATCCAGAGAAACTAAAGGGGCCATTCCCGGTAGCTACAGTGCGGTTATTGATATGCCGAGTGGCAGGGGAAGCGTCACTCTGGAGTTTAAGGTTTTCCATAAAGGCAATCAGTGGGCAGGTAATATCACCGACTGTACGGTGATTGTGACCAAAAAAGCCGCTTCCGGCATCAGTATTCGTTGAAATTGTTATAACCCATATAAGGGCACCAGAAATGGTGCCTTTTTTATTGCAGAAAAGCGAGAGGTAATTATGCGTAAAGTTTGTGCAGCAATTTTGTCCGCAGCCATTTGTCTGGCCGTATCCGGTGCGCCTGCATGGGCATCTGAACATCAGTCCACGCTGAGCGCGGGGTATCTTCATGCCTCGACCAACGTTCCCGGCAGCGATGATCTGAACGGGATTAACGTGAAATACCGTTATGAGTTTACGGATACGCTGGGGATGGTGACGTCATTCAGCTATGCAGGAGACAAGAATCGCCAGCTGACCCGTTACAGCGATACCCGCTGGCATGAAGATTCTGTGCGTAACCGCTGGTTCAGCGTGATGGCGGGGCCCTCTGTGCGCGTGAATGAATGGTTCAGTGCGTATGCGATGGCGGGCGTGGCTTACAGCCGTGTGTCGACTTTTTCCGGGGATTATCTCCGCGTAACTGACAACAAGGGGAAAACGCATGATGTGCTGACCGGAAGTGATGACGGTCGCCACAGCAACACGTCTCTGGCGTGGTGGGCTGGCGTGCAGTTTAACCCGACCGAATCCGTGGCCATTGATATTGCTTATGAAGGTTCCGGCAGTGGTGACTGGCGCACTGATGGTTTCATCGTGGGTGTCGGTTATAAGTTCTGATTAGCCAGGTAACACAGTGTTATGACAGCCCGCCGTTTCAGGCGGGCTTTTTTGTGGGGTGAATATGGCAGTAAAGATTTCAGGTGTACTGAAAGACGGCACAGGAAAACCGGTACAGAACTGCACAATCCAGCTGAAAGCAAAACGTAACAGCACCACGGTGGTGGTGAACACGCTGGCATCTGAAAATCCGGATGAAGCCGGGCGTTACAGTATGGACGTTGAGTACGGTCAGTACAGCGTTATTCTGTTGGTGGAGGGATTCCCGCCGTCACATGCCGGGACCATCACCGTGTATGAAGATTCTCAACCCGGTACGCTGAATGATTTTCTCGGTGCCATGACGGAGGATGATGCCCGTCCGGAGGCACTGCGCCGTTTTGAGCTGATGGTGGAAGAGGTGGCGCGTAACGCGTCCGCAGTGGCACAGAACACGGCAGCCGCAAAGAAGTCAGCCAGCGATGCCGGCACATCTGCCCGTGAGGCGGCAACCCATGCGACTGATGCTGCAGACTCAGCACGCGCAGCCAGCACGTCAGCCGGACAGGCCGCGTCGTCGGCTCAGTCAGCGTCTTCCAGCGCAGGAACGGCATCAGCAAAGGCCACTGAAGCGGAAAAAAGTGCTGCCGCTGCAGAGTCCTCAAAAAGCGCGGCAGCTACCAGTGCCGGTGCCGCGAAAACGTCTGAAACGAATGCTTCAGCGTCACAACAATCAGCCGCCACTTCTGCATCCACCGCGACCACGAAAGCGTCAGAAGCAGCCACTTCAGCACGGGATGCGTCGGCCTCAAAAGAGGCAGCGAAATCATCAGAAACGAACGCAGCCTCGAGCGCCAGTAGTGCCGCTTCCTCGGCAACGGCGGCAGCAAATTCTGCGAAGGTGGCAAAAACGTCCGAGACGAACGCCAGGTCTTCTGAAACGGCAGCGGGACAGAGCGCCTCAGCTGCGGCAGGCTCAAAAACAGCGGCTGCATTATCTGCCAGTGCCGCGTCAACAAGTGCCGGGCAGGCCTCAGCCAGTGCCACCGCCGCCGGAAAATCGGCAGAAAGTGCTGCATCGTCTGCTTCAACAGCCACAACGAAGGCTGGCGAAGCCACTGAACAGGCCAGCGCAGCAGCGAGTTCTGCTTCCGCAGCGAAGACATCCGAAACGAACGCGAAAGCGTCGGAAACCAGCGCAGAATCCTCAAAAACGGCTGCCGCATCGTCAGCCAGTTCGGCGGCGTCATCGGCATCATCTGCGTCTGCTTCAAAAGATGAGGCGACCAGACAGGCGTCAGCAGCAAAGGGCAGCGCCACGACGGCATCCACGAAGGCGACAGAGGCAGCTGGCAGTGCGACGGCGGCAGCTCAGAGCAAAAGTACGGCGGAATCCGCGGCAACGCGCGCCGAGACAGCGGCAAAACGGGCAGAGGATATTGCATCCGCCGTGGCGCTTGAGGATGCGAGCACGACGAAAAAGGGGATAGTCCAGCTAAGCAGCGCGACCAACAGCACTTCCGAGTCACAGGCGGCAACGCCAAAAGCCGTTAAGGCCGCGTATGAGCTGGCTAACGGGAAATACACCGCACAGGATGCAACGACAGCACAGAAAGGGATAGTTCAGCTTAGCAACGCGACCAACAGCACATCTGAAATGCTGGCGGCAACGCCAAAGTCGGTAAAGGCAGCCTATGACCTTGCTAACGGGAAATATACTGCTCAGGACGCTACGACAGCACAAAAAGGAATTGTCCAGCTCAGTAGTGCAACCAACAGCGCATCTGAAACGCTTGCCGCGACACCGAAAGCAGTGAAAGCAGCTAATGATAATGCGAATGGTCGGGTACCTTCTGCCCGTAAGGTGAATGGTAAGGCGCTTTCAGCGGATATAACACTGACGCCGAAAGATATTGGTACGCTTAACTCAACAACAATGTCATTCAGCGGTGGTGCTGGTTGGTTCAAATTAGCAACGGTAACCATGCCACAGGCGAGTTCTGTTGTTTCAATTACGTTGATTGGTGGCGCGGGATTTAACGTGGGGTCACCTCAACAGGCAGGTATATCTGAACTTGTTTTGCGTGCAGGTAATGGTAATCCGAAGGGGATTACTGGTGCTTTATGGCAGCGCACATCGACAGGGTTTACAAATTTTGCCTGGGTCAATACATCTGGTGATACTTACGATATTTACGTTGCAATCGGAAATTATGCGACTGGTGTAAATATTCAATGGGATTATACCAGTAATGCCAGCGTGACGATTCATACGTCACCAGCATATTCTGCTAATAAGCCGGAAGGGTTAACGGACGGTACAGTTTATTCACTCTATACGCCATCAGAGCAGTTTTATCCGCCTGGCGCACCAATCCCGTGGCCATCAGATACCGTTCCGTCTGGCTATGCCCTGATGCAGGGGCAGGCTTTTGACAAATCTGCATACCCGAAACTTGCAGCGGCTTATCCGTCAGGCGTGATCCCTGATATGCGTGGCTGGACGATTAAGGGCAAACCTGCCAGTGGTCGGGCCGTATTGTCTCAGGAACAGGACGGCATTAAATCGCATACCCACAGCGCCAGCGCATCCAGTACGGATTTGGGGACGAAAACCACATCGTCGTTTGATTACGGCACTAAATCCACGAATAACACTGGTGCGCATACCCATAGTTTAAGTGGCAGCACGAATGCGGCTGGTAATCACAGCCATAGAGATGGCCGTCGATTTAACCCCAGTGTTTTTAAAGATACTTATCAATATGGTTATACAAGCTCAGGTCAAAATACCTGGGGTGTACAAGGCTCAGTAGGTATGTCTACGGGGTGGTTAGCGAATACCAGTACAGATGGTAATCATAGCCACTCACTGTCCGGCACAGCAGCATCTGCAGGTGCACACGCGCATACTGTCGGTATTGGTGCTCATACGCACTCCGTTGCGATTGGTTCACATGGACACACCATCACCGTTAACGCTGCTGGTAACGCGGAAAACACCGTCAAAAACATCGCATTTAACTATATTGTGAGGCTTGCATAATGGCATTCAGAATGAGTGAACAACCACGGACCATAAAAATTTATAATCTGCTGGCCGGAACTAATGAATTTATTGGTGAAGGTGACGCATATATTCCGCCTCATACCGGTCTGCCAGCAAACAGTACCGATATTGCACCGCCAGATATTCCGGCTGGCTTTGTGGCTGTTTTCAACAGTGATGAGGCATCGTGGCATCTCGTTGAAGATCATCGGGGAAAAACCGTCTATGACGTGGCTTCCGGCGACGCGTTATTTATTTCTGAACTTGGCTCGTTACCGGAAAATGTCACCTGGTTATCCCCGGAAGGGGAATATCAGAAGTGGAACGGCACAGCCTGGGTGAAGGATACGGAAGCAGAAAAACTGTTCCGGATCCGGGAGGCGGAAGAAACAAAAAACAGCCTGATGCAGGTAGCCAGTGAGCATATTGCGCCGCTTCAGGATGCTGTGGATCTGGAGATCGCAACGGAGGAAGAAACCTTGTTGCTGGAAGCCTGGAAAAAGTATCGGGTGTTGCTGAACCGTGTTGATACATCAACTGCACCGGATATTGAATGGCCAGCCCTACCGTATGGTAAGATATATAAATTCTATAATTAGAAGTATCTTTCCATTTAAGGCTAGGAAGGGGGGCTTGGAAAACGTAAGGAATCTCACACCGAGATTATTTTTATATATCAGGCGTCTGATTTTTTGCTTTAGCTCTTAAAATGGTTTGCCGCGAGGTTTTGAATTCTTGGGCAATGTCACTTATATTTATACCTGACTTAATTTGCTCTAGTACCCTCTGTATTTGTTCATCATTTAACACAGGTGGGCGACCAAATCGCTTTCCTGCGCCCCGTGCTCTTAATATCCCTGAATGAGTACGCTCAAGTAAAAGGTCTCGCTCAAATTCAGCGACTGCTGAAATTACTTGCATCATCATTTTTCCAGGTGGACTGGTCAGGTCAACGCCCCCCAATGCTAAGCAATGCACTTTGATACCTGCTTTGGTCAGTTGTTCCACCGTTTTACTGATATCCATTGGATTACAACCGAGGCGATCCAGTTTTGTGACAATCAATGCATCACCTCTTTTCAGTCGATCAAGCAACTGGTTAAAACCAGGACGCTCACTGGTTGCTGCTGAGCCGCTAATTTGTTCTTCAATTATTTGCTGAGATTTGATGTTAAAACCTGCACTTTCGATTTCCCGACGTTGATTTTTGGTTGTCTGTTCCAGAGTTGATACCCGACAGTAAGCATAAATTCGAGACATAGTGAGATCTTCTATACGAAATTGGTGTACATATCATAATGCATCTCAGAAAATAATTTTGATTATTTTTGTACATATTTGTATGTACACGTTCGAAAATAAACGAATGCGTATGCAACCCCGTAATTTTGGTGAGACCCAAAATCGATTTTGTGAAAAATGGCCTTAACTCGGTTTGTTTTTCGAGTTCCGGGCGGACTCAAGGAAGAAGAATAGTGTTGCGTGTTATTTTAACCAGATTTCAAGTTGTTTGGTCGTGGAAAAGTGGAGCAAAATGTTGTTAAAGTGGAAAAATGATAAAAAAGTAAGTTTATTATATTACATTTTACCATTTAAATTTTTGTTGTCTTTAAGAACTGATATCGCTGTTTGTAATAATTCTTTGTTATCCAGCCATGACTTTTTCTTTATGTTTCCTTCAATGTAATCAAGCAATGTTCTGGTATTGATAGGTCTTCCCTGTTTTGCTACTTCCACTACAGCATCCCCTAGGATAATTCTTACTTCAGGAAGCTGCGCAGGGAACCACTTTAGGGTGTCTTTTGATTTCAT
Protein sequences of DBSCAN-SWA_7 >CP029164|3221809:3273222|3264299_3267713_+|AWH70850.1|DBSCAN-SWA MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPVEGPVDGLKSVLLNSTPVLDSEGNTNIAGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPITRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTEKDITIKGKTTSQYLASVVVGNLPPRPFNIRMRRMTPDSTTDQLQNKTLWSSYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYNPQTRQYSGIWDGTFKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWALYVIGQHCEQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVEVNWIDPNNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLWLIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISIGGRVLAVNSQTRTLTLDREITLPSSGTTLISLVDGSGNPVSVEVQSVTDGVKVKVSRVPDGVAGYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFMLRLTVAADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQQGDPASVSFRIAAPAAPSQIELTPGYFQITATPHLAVYDPTVQFEFWFSETRITDIRQVETTARYLGTGLYWIAASINIKPGHDYYFYIRSVNTVGKSAFVEAVGRASDDAEGYLDFFKGKITESHLGKELLEKVELTEDNASKLEEFSKEWQDANDKWNAMWGVKIEQTKDGKHYVAGIGLSMEDTEEGKLSQFLVAANRIAFIDPANGNETPMFVAQGNQIFMNDVFLKRLTAPTITSGGNPPAFSLTSDGKLTAKNADISGSVNANAGTLNNVTVNENCTIKGMLEATQVRGDFVKAVSKSFPKQAGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYSDPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRGSVTLEFKVFHKGNQWAGNITDCTVIVTKKAASGISIR >CP029164|3221809:3273222|3257342_3258086_+|AWH70843.1|tail|DBSCAN-SWA MTTPNPLAKTKGAGTTFWMYTGKGDAFANPLSDTDWLRLAMVKDLQPGEMTADAEDDTYLDDEDADWKTTTQGQKSVGDTSATLAWRPGDSGQKKLVQLFDSGEVCAFRIKYPNGTVDVFRGWLSSLGKTIASKDVMTRTVKISGVGRPYLAEEGTETVSVTGLTVAPASASVKVGATTTLTFTVKPDGASDKAISVHSTDPQTATVTLNGLVATVKGVKQGSVSIVGMTSDGDFVAVAAVAVSAAG >CP029164|3221809:3273222|3233988_3234177_-|AWH70810.1|DBSCAN-SWA MKTLLPNVNTSEDCFEIGVTISNPVFTEDAINKRKQERELLNKICIVSMLARLRLMPKGCAQ >CP029164|3221809:3273222|3267782_3268382_+|AWH70851.1|DBSCAN-SWA MRKVCAAILSAAICLAVSGAPAWASEHQSTLSAGYLHASTNVPGSDDLNGINVKYRYEFTDTLGMVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAGVAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDGRHSNTSLAWWAGVQFNPTESVAIDIAYEGSGSGDWRTDGFIVGVGYKF >CP029164|3221809:3273222|3235225_3235378_-|AWH70812.1|DBSCAN-SWA MDTIDLGNSESLVCGVFPNQDGTFTAMTYTKSKTFKTESGARCWLARNTD >CP029164|3221809:3273222|3236245_3236767_+|AWH70815.1|DBSCAN-SWA MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAVDQRALSVNRVKIFERWKAIDTRDKREKFTALVPAIMEAIRISDFRLYREISDGKSITYMIAGLNKEYGDVVESGLLFADPAVVDRETDELIEKAIAFKLAYRQQYQQKAGWNYESSFC >CP029164|3221809:3273222|3258541_3258871_+|AWH70844.1|tail|DBSCAN-SWA MQFVMRLAREFRRADWRRMLSEMSATELGEWGDYFRMQSFSDVWMDAQFASLKALIVRMVSGSSDAAVADFSLLPEENGIPERTDEELMHLGEGISGGVRYGPDSQPGH >CP029164|3221809:3273222|3248619_3248826_+|AWH70834.1|DBSCAN-SWA MNKEQSADELSLDLIRVKNMLNSTISMSYPDVVIACIEHQVSLEAFRAIEAALVKHDKNSKDYSLVVD >CP029164|3221809:3273222|3224434_3224740_-|AWH70798.1|DBSCAN-SWA MKRSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTRTTSGNVSAPAQSTQDGAPAEPQ >CP029164|3221809:3273222|3258842_3261908_+|AWH70845.1|tail|DBSCAN-SWA MDQIANLVIDLGIDAAEFKNEIPRIKNLLNGAASDAERSSARMQRFMERQTQAARQTMQAASSAATAASVHAQTVEKSAQAHERMAREVEQTRQRMEALSQKMREEQAQAMALAEAQDKAAAAFYRQIDSVKQASAGLQELQRIQQQIRQARNSGGIGQQDYLALISEVTAKTRVLTQAEAEATRQKVAFIRQLKEQATRQNLSSSELLRAKAAQLGVSSAAEVYIRKMEQAGKATHSLGLKSAAARQEIGVLIGELARGNLGALRGSGITLANRAGWIDTLMSPKGMMLGGVIGGIAAAVYGLGKAWYDGQKEGEEFNRQLSLTGHYAGVTAGQLWTLSRAISGNGITQHAAAGALAQVVGSGAFRGNDIGMVARAAAQMERSVGQSVSDTINQFKRLKDDPVNAAKALDNELHFLTATQLEQIRVLGEQGRSSDAARIAMSALAEETGRRTADIDNNLNALGSTLKYLSDLWSSFWDAAMNIGREDSLDEQISALQEKVSRAKRLPWTASSSQVEYDQQRLNELQEKKRQKDLQDAKEQAERNYQEQQKRRNAENAALNRMNETEAARHQREIARINAMQYADQAVRDAAIQRENERYEKALASGKKKTREPRNDEATRLLLQYSQQQAQVEGQIAAARQSAGIATERMTEAHKQLLALQQRISDLDGKKLTADEKSVLARKDELIQALTLLDVKQQELQKQTALNELKKKTIQLTSQLAEEERAQRQQHDLDIATVGMGDQQRQRYQVQLSLRQKYQQQLEQLRRDSEQKGTYNTDDYRKAEQALTESLNRQLNENRRYWQQLEVVQGNWKNGVLRAFQDFTVDADNTAGTAEQVFSSAFSNMGNGLATFVTTGKLNFKSFTSSVLSDMAKILAQATMMKSIKGIGSVLGFDLSSLSLNANGGIYQSADLSRYSGTVVNRPTFFAFAKGAGVMGEAGPEAILPLRRDADGKLGVVADIGGSGMAMFAPQYNIEINNDGTNGQIGPAALKVVYDLGKKAAADFMQQQARDGGRLSGAYR >CP029164|3221809:3273222|3245406_3245940_+|AWH70828.1|DBSCAN-SWA MNTKIRYGLSAAVLALIGAGASAPQILDQFLDEKEGNHTMAYRDGSGIWTICRGATVVDGKTVFPNMKLSKEKCDQVNAIERDKALAWVERNIKVPLTEPQKAGIASFCPYNIGPGKCFPSTFYKRLNAGDRKGACEAIRWWIKDGGRDCRIRSNNCYGQVIRRDQESALTCWGIEQ >CP029164|3221809:3273222|3244878_3245094_+|AWH70826.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGNAGFWALQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDRRKAARGE >CP029164|3221809:3273222|3224847_3225558_+|AWH72530.1|DBSCAN-SWA MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAQTLEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY >CP029164|3221809:3273222|3241418_3242468_+|AWH70823.1|DBSCAN-SWA MRVLLRPVLVPELGLVVLKPGRESIQIFHNPRVLVEPEPKSMRNLPSGVVPAVRQPLAEDKTLLPFFSNERVIRAAGGVGALSDWLLRHITSCQWPNGDYHHTETVIHRYGTGAMVLCWHCDNQLRDQTSESLELLAQQNLTAWVIDVIRHAISGTQERELSLAELSWWAVCNQVVDALPEAVSRRSLGLPAEKICSVYRESDIVPGEQTATSILKQRTKNLAPLPYAHQQQKPPQEKTVVSINVDPESPESFMKLPKRRRWVKEKYTRWVKTQPCACCGMPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHNELHTDTVAFEEKYGSQLELIFRFIDRALATGVLA >CP029164|3221809:3273222|3221809_3224236_-|AWH70797.1|DBSCAN-SWA MSKNEQMVGISRRTLVKSTAIGSLALAAGGFSLPFTLRSAAAAVQQASEKVVWGACSVNCGSRCALRLHVKDNEVTWVETDNTGSDEYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGKRGEGKFERISWDEALDTIASSLKKTVEQYGNEAVYIQYSSGIVGGNMTRSSPSASAVKRLMNCYGGSLNQYGSYSTAQISCAMPYTYGSNDGNSTTDIENSKLVVMFGNNPAETRMSGGGITYLLEKAREKSNAKMIVIDPRYTDTAAGREDEWLPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPADAPKNGHYKAYILGEGDDKTAKTPQWASQITGIPVDRIIKLAREIGTAKPAYICQGWGPQRQANGELTARAIAMLPILTGNVGISGGNSGARESTYTITIERLPVLDNPVKTSISCFSWTDAIDHGPQMTAIRDGVRGKDKLDVPIKFIWNYAGNTLVNQHSDINKTHEILQDESKCEMIVVIENFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTSEKFERKPIYWILSEVAKRLGPDVYQKFTEGRTQEQWLQHLYAKMLAKDPALPSYDELKKMGIYKRKDPNGHFVAYKAFRDDPEANPLKTPSGKIEIYSSKLAEIARTWELEKDEVISPLPVYASTFEGWDSPERSTFPLQLFGFHYKSRTHSTYGNIDVLKAACRQEVWINPIDAQKRGIANGDMVRVYNHRGEVRLPAKVTPRILPGVSAMGQGAWHEANMSGDKIDHGGCVNTLTTLRPSPLAKGNPQHTNLVEIEKI >CP029164|3221809:3273222|3262246_3262945_+|AWH70847.1|tail|DBSCAN-SWA MQDIPQETHHETTRLTQSALVVLWEIDLTEVGGERYFFCNEQNEKGEPVTWQGRQYQAYPIQGTGFELNGKGSAARPTLTVSNLHGMVTGMAEDLQSLVGGTVVRRKVYARFLDAVNFVNGNSDADPEQEVISRWRIEQCSELSAVSASFVLSTPTETDGAVFPGRIMLANTCTWTYRGDECGYHGPAVADEYDQPTSDITKDKCSKCLNGCKFRNNVGNFGGFLSINKLSQ >CP029164|3221809:3273222|3236034_3236262_+|AWH70814.1|DBSCAN-SWA MLKVDAITFFGSKTKLANAAGVRLASVAAWGILVPEGRAMRLQEASGGELQYDPKVYDEYRKTKRAGRLNNENHS >CP029164|3221809:3273222|3240029_3240233_+|AWH70819.1|DBSCAN-SWA MSPISTEKIRETIPNSCIESLTRQPHIHQDWCTSNTGGCGAGSQMCAGYYGCNGDLLSCYCSYGSPF >CP029164|3221809:3273222|3262950_3263694_+|AWH70848.1|tail|DBSCAN-SWA MTETESAILAHARRCAPAESCGFVVRTPEGERYFPCVNISGEPEAYFRMSPEDWLRAEMQGEIVALVHSHPGGLPWLSEADRRLQVQSDLPWWLVCRGEIHKFRCVPYLTGRRFEHGVMDCYTLFRDAYHLAGIEMPDFHREDDWWRHGQNLYLDNLEATGLYQVPLSAAQPGDVLLCCFGSSVPNHAAIYCGDGELLHHIPEQLSKRERYTDKWQRRTHSLWRHRAWHASAFTGIYNDLAAASTFV >CP029164|3221809:3273222|3229466_3229595_+|AWH70805.1|DBSCAN-SWA MTIEKHERSTKDLVKAAVSGWLGTALEFMDFKSHTCQLFDKY >CP029164|3221809:3273222|3237753_3238155_+|AWH70817.1|DBSCAN-SWA MAKVFTQEEREKIKGQVVELVRRSGRETLRQLEVKTGATRYLMSVLARELVASGDVYNSGYGLFPSEQARKDWQNARKKLSRAKLKKPSAVDPDLIWSLPDGEIRRYDRRQNIICRECRKSEVMQRILAFYQG >CP029164|3221809:3273222|3238688_3239714_+|AWH70818.1|DBSCAN-SWA MHILSVSKKEVGRAGDRLVSQFGTGEKYKEEDVQILHEWRMLHLYPLSKIQFYMEREAISLNKNALLSSRIKRMPSIVTKLSRFPDMKLNKMQDLGGCRAILNNLDQVYDLVNKIKSSKFSHELVRMDDYMIDVKDSGYRSFHMVYSFQNKKFPSLNGLRIEMQIRTAIQHSWATAVEMVGLFRKESLKSGFGDARWLRFFELVSELFYKLEYEKEPSGSYIKISEELSYLSVELNVFDILAAYNAVVSHIEGSKKYDKGLCIIVVDTIKRNINIKSFENHNHAKAAEAYVESEKYCAENKGCEVAMVSVSSISELKNAYPAYFLDTKTFLNYLSRYVFIK >CP029164|3221809:3273222|3245936_3246434_+|AWH70829.1|DBSCAN-SWA MNQIFMVIFLVLSGFIVGNVWSDRGWQKKWAERDAAALSQEVNAQFAARIIEQGRTIARDEAVKDAQQKSAEISARAAYLSDSVNQLRAEAKKYAIRLDAAKHTADLAAAVRGKTTKTAEGMLTNMLGDIAAEAQLYAEIADERYIAGVTCQQIYESLRDKKHQM >CP029164|3221809:3273222|3253541_3255665_+|AWH70839.1|DBSCAN-SWA MMSNVGARPKLMKTANWIWYLIPIRPVIKEAAVPQRNDMSRSTPTTSPKNNSWFRMQAGHQSDADIYIYDEIGFWGVTAKQFISDLNALGDITHINLHINSPGGDVFEGIAIFNALKTHGASITVYVDGVAASMASVIAMVGNPVIMPENTFMMIHKPFGFTGGDAEDMRTYADLLDKVEAVLLPAYAQKTGKTTDEIAAMLADETWMSGAECLAHGFADQVTPAVKAMACIQSKRTEEFKKMPESIRNMITPPRNSAPRVQDNEPEASRTPVQAAAPVVDENSIRAQVLAEQKARVNGINDLFAMFGGRYQTLQAQCLADPECSLEQAREKLLNEMGRESTPSNKNTPAHIYAGNGNFVGDGIRQALMARAGFEKTERDNVYNGMTLREYARMSLTERGIGVSSYNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPVKDFATVIAEPRLDDNSQTTFYLAASKGSDTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA >CP029164|3221809:3273222|3255706_3256075_+|AWH70840.1|DBSCAN-SWA MYLKSAPERGCAETVMAKNFVEEGKTVAIVASAAISSGDLVQVGDVFAVALTDIPQGETGDGMTEGVFILPKLKTDDMKTGKKVYLKSGKVQLTNSGSDPLVGVVWADAGTSAEEVPVKLNV >CP029164|3221809:3273222|3225560_3226121_-|AWH70799.1|DBSCAN-SWA MPSAHSVKLRPLEREDLRYVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECDGEKAGLVELVEINHVHRRAEFQIIISPEYQGKGLATRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGFTVEGELMHEFFINGQYRNAIRMCIFQHQYLAEHKTPGQTLLKPTAQ >CP029164|3221809:3273222|3258146_3258533_+|AWH72533.1|tail|DBSCAN-SWA MFLKQGTFNYEKQSVVLSELSGLQRIEYLAFVQQRTAKFDAEEGELPEAERQIAFLRMGMDINAWLVSRSLWNAEQSQDVETLCASVITTWSYDALGAGAEMVLSLSGMGAIENAGDLEHEVLTPEKS >CP029164|3221809:3273222|3252184_3253693_+|AWH70838.1|portal|DBSCAN-SWA MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTPSAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEADGNGSKDKERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRHEPQHTDDQSEE >CP029164|3221809:3273222|3234592_3234880_+|AWH70811.1|DBSCAN-SWA MRFRVVCLLLTTSGEVVSFLFRITKKEIYMTKEEFVSYIFDKTVEMYAATYGSCNPLNKPEGKDDFDKIYRFLEDRYIKRLEDAGIKSPVKSPLS >CP029164|3221809:3273222|3236693_3237713_+|AWH70816.1|DBSCAN-SWA MLSSLRIDSNTNKKLDGIMSLLFAERPLVINTQLAMKIGLNEAIVLQQLHYWLRDTNSGMECDGVRWIYNTTEQWLEQFPFWSESTLKRAFASLKTLGLLRCEKLNKSKRDMTNFYTINYGSELLDGGKLNESIGSKCAAPSGQNDTMEEVKMKRSIGSKRPNVIGSKWPDDLTENTTEITTENKNTFRPEASQPDPQTTEQDFLTRNPDAVVFSVKKRQWGSREDLACAQWIWGRIVNLYEQAASDDGEIMRPKEPNWTAWANDVRTMRMLDGRSHRQICEMFGRVQRDPFWVKNIMSPSKLREKWDELVIRLGRSPVQRCVNHISEPDTEIPPGFRG >CP029164|3221809:3273222|3272988_3273222_-|AWH70855.1|DBSCAN-SWA MKSKDTLKWFPAQLPEVRIILGDAVVEVAKQGRPINTRTLLDYIEGNIKKKSWLDNKELLQTAISVLKDNKNLNGKM >CP029164|3221809:3273222|3227163_3228378_+|AWH70803.1|DBSCAN-SWA MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGTLWNW >CP029164|3221809:3273222|3261907_3262237_+|AWH70846.1|tail|DBSCAN-SWA METFHWKVRPDMNVVSEPKVVTVKLGDGYEQRRVAGLNNQLSTYSVTIRVRKCEHPSLKAFLERHGGVRAFQWTPPYDWKPIRVVCRKWSASVGALWVTITADFEQVVA >CP029164|3221809:3273222|3228389_3229409_+|AWH70804.1|DBSCAN-SWA MKSILIEKPNQLAIIEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGEGVESARVGERVAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTENDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGESFAEKGIKPTLIIDAACHPSILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPVVIDWLSKGLIKPEKLITHTFDFQHVADAISLFEQDQKHCCKVLLTFSE >CP029164|3221809:3273222|3246797_3247010_+|AWH70830.1|DBSCAN-SWA MSNKMTGLVKWFNPEKGFGFITPKDGSKDVFVHFSAIQSNDFKTLTENQEVEFGIENGPKGPAAVHVVAL >CP029164|3221809:3273222|3268446_3271407_+|AWH70852.1|DBSCAN-SWA MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDAGTSAREAATHATDAADSARAASTSAGQAASSAQSASSSAGTASAKATEAEKSAAAAESSKSAAATSAGAAKTSETNASASQQSAATSASTATTKASEAATSARDASASKEAAKSSETNAASSASSAASSATAAANSAKVAKTSETNARSSETAAGQSASAAAGSKTAAALSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAASSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKGSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSESQAATPKAVKAAYELANGKYTAQDATTAQKGIVQLSNATNSTSEMLAATPKSVKAAYDLANGKYTAQDATTAQKGIVQLSSATNSASETLAATPKAVKAANDNANGRVPSARKVNGKALSADITLTPKDIGTLNSTTMSFSGGAGWFKLATVTMPQASSVVSITLIGGAGFNVGSPQQAGISELVLRAGNGNPKGITGALWQRTSTGFTNFAWVNTSGDTYDIYVAIGNYATGVNIQWDYTSNASVTIHTSPAYSANKPEGLTDGTVYSLYTPSEQFYPPGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSLSGSTNAAGNHSHRDGRRFNPSVFKDTYQYGYTSSGQNTWGVQGSVGMSTGWLANTSTDGNHSHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA >CP029164|3221809:3273222|3230911_3231163_-|AWH70807.1|DBSCAN-SWA MSEVIMIVSPGKWVSEEQLIALKGIKKGTLKKAREKSFMEGREYKHVAHDSMPWDNSPCFYNLEEIDRWIERQASARPRRHLT >CP029164|3221809:3273222|3272079_3272670_-|AWH70854.1|DBSCAN-SWA MSRIYAYCRVSTLEQTTKNQRREIESAGFNIKSQQIIEEQISGSAATSERPGFNQLLDRLKRGDALIVTKLDRLGCNPMDISKTVEQLTKAGIKVHCLALGGVDLTSPPGKMMMQVISAVAEFERDLLLERTHSGILRARGAGKRFGRPPVLNDEQIQRVLEQIKSGINISDIAQEFKTSRQTILRAKAKNQTPDI >CP029164|3221809:3273222|3247356_3247512_+|AWH72532.1|DBSCAN-SWA MREYPNGEKTHLTVMAAGFPSLTGDHKVIYVAADRHVTSEEILEAAIRLLS >CP029164|3221809:3273222|3249379_3249874_+|AWH70835.1|DBSCAN-SWA MDRELKNLTLNISQLAALSGVHRQTAAARLQNLPVAGGHESNLKLYRVVDIVSAFLALPPPVAESEMDAHERKAWYQSERERLKFEQETAQLIPASDVRREFAIWAKAVVQVLETLPDILERDCGLQPAAVSRVQSIIDDLRDQIALRVTEAGADDEEELQQEE >CP029164|3221809:3273222|3248008_3248419_-|AWH70833.1|DBSCAN-SWA MAQAAIFKEIFDQVRKDLNCELFYSELKRHNVSLYIYYLATDNIHIVLENDNIVLVKGLKKVVNVKFSRNKHLIETSYNKLKSKEITFQQYRENLAKAGVFRWVTNIQEHQRYYYAFDNSLLFTESIQKTTQILPR >CP029164|3221809:3273222|3256929_3257331_+|AWH72534.1|tail|DBSCAN-SWA MNRHTQIRQAVLARLREQCGDSATFFDGLPAFIDAQELPAVAVWLSDAQYTGKMTDEDDWQAVLHIAVFIRAQAPDSELDMWMESTIFPALNDIPALSGLIDTLIPLGFNYQRDNEMATWAMAEITYQITYTN >CP029164|3221809:3273222|3249873_3251976_+|AWH70836.1|terminase|DBSCAN-SWA MLNQETAKAARTDSGYILRAPRRMRVADAVAQYMRVPLGAGNSVPWDPLVAPYVIEPMNCLASREYDAVIFVGPARTGKTIGLIDGWVIYNVICDPADMLIIQMTEEKAREHSKKRLARTFRVSPEVVSRLSPNKNDNNVYDRTFLAGNYLKIGWPSVNIMSSSDYKCVALTDYDRFPEDIDGEGDAFSLASKRTTTFMSSGMTLVESSPGRDVKDVKWRRTSPHEAPPTTGILSLYNRGDRRRWYWPCPHCGEYFQPCGDVVAGFRDIADPVLASEAAYIQCPSCSGRIMPEQKRELNGRGVWLRDGESINADGSRYGDPRRSRIASFWMEGPAAAYQTLSQLVYKLLTAEQEYETTGSEETLKTVINTDWGLPYLPRASMEQRKSELLEQRAEPVPSRSVPDGVNFLVATVDVQAGRHRRFVVQVTGYGSRGERWIIDRYNITQSLRGDSDGESQRIDPASYPEDWDVLLTDVFHKSWPLASDPSQQMRLMAMAVDSGGEDGVTDNAYKFWRRCRRDGLGKRIYLFKGDSIRRAKLITRTFPDNTGRTGRRAQAAGDVPLWLLQTDALKDRVNNALWRDSPGPGYVHFPDWLGSWFYDELTYEERSSDGKWSKPGRGANEAFDLMVYAEALVILHGYEKIRWPDAPEWASRETWLECVPDSIEPSPSPEPVSTPVKKQKRKKTVTDDVNPWLTSGGWL >CP029164|3221809:3273222|3243909_3244125_+|AWH70825.1|DBSCAN-SWA MSNKMTGLVKWFNADKGFGFISPVDGSKDVFVHFSAIQNDNYRTLFEGQKVTFSIESGAKGPAAVNVIITD >CP029164|3221809:3273222|3241138_3241417_+|AWH70822.1|DBSCAN-SWA MARNVKYYNSDNSPVLACTHGRYSHAFKSEWFQHPPCTAEQAEWLIHSYCRRGFEVKKALSLDYRHWIISVRLPYSERPPRASRTFQQRIWR >CP029164|3221809:3273222|3256067_3256343_+|AWH70841.1|DBSCAN-SWA MSDPFSRLAARMDAITVRKMGKTASINDADMTVIPGETLAELNALSGPAVSLVVFSSGYRPRRGDRVVYDGQQWTVTRHERFNGKPMIFIE >CP029164|3221809:3273222|3231235_3233707_-|AWH70808.1|DBSCAN-SWA MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQTWKTLDDELAYALWPGDVDAGNIDGSIHRWAKKEVIDNDREDWKRISASMRKQPDALRYDRQTIFGLVRERPIDIHKDPIALNKYICEYLTTKGVFENEETDLGTVDVLQSSETQTDAVETEVSDIPKNETAPEAEPSVEREGPFYFLFADKDGEKYGRANKLSGLDKALAAGATEITKEEYFARKNGTYTGLPQNVDTAEDSEQPEPIKVTADEVNKIMQAANISQPDADKLLAASRGEFVEGISDPNDPKWVKGIQTRDSVNQNQHESERNYQKAEQNSTNALQNEPETKQPEPVAQQEVEKVCTACGQTGGGNCPDCGAVMGDATYQETFDEEYQVEVQEDDPEEMEGAEHPHKENTGGNQHHNSDNETGETADHSIKVNGHHEITSTSRTCDHLMIDLETMGKNPDAPIISIGAIFFDPQTGDMGPEFSKTIDLETAGGVIDRDTIKWWLKQSREAQSAIMTDEIPLDDALLQLREFIDENSSEFFVQVWGNGANFDNTILRRSYERQGIPCPWRYYNDRDVRTIVELGKAIDFDARTAIPFEGERHNALDDARYQAKYVSVIWQKLIPSQADS >CP029164|3221809:3273222|3235546_3235954_-|AWH70813.1|DBSCAN-SWA MDTRTLGQRVLARRKELRLTQREAARLAGVAHVTISQWERDETQPVGKRLFALADALKCSPTWLMFGDEDKAPVPAQELHVETELTPNHKELIELFDALPSSEQEALLSEMRARVENFNKLFEEMLKARKNKSIK >CP029164|3221809:3273222|3256354_3256933_+|AWH70842.1|tail|DBSCAN-SWA MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKLVRQRVRVFKASPSGKMTARIRVNRGNLPAIKLGTARVRLARRGGKLQYRGSVLKVGKYLFRDAFIQQLANGRWHVMRRIDGKNRYPIDVVKIPLSGPLTQAFEDARDRIIAAEMPKQLGYALKQQLRLWLTR >CP029164|3221809:3273222|3247683_3247857_+|AWH70832.1|DBSCAN-SWA MNIEDLKTKAEADISEYITKKIIELKKKTGKEVTSIQFTAREKMTGLESYDVKINLI >CP029164|3221809:3273222|3271406_3272009_+|AWH70853.1|tail|DBSCAN-SWA MAFRMSEQPRTIKIYNLLAGTNEFIGEGDAYIPPHTGLPANSTDIAPPDIPAGFVAVFNSDEASWHLVEDHRGKTVYDVASGDALFISELGSLPENVTWLSPEGEYQKWNGTAWVKDTEAEKLFRIREAEETKNSLMQVASEHIAPLQDAVDLEIATEEETLLLEAWKKYRVLLNRVDTSTAPDIEWPALPYGKIYKFYN >CP029164|3221809:3273222|3247020_3247209_+|AWH70831.1|DBSCAN-SWA MTNHIHFRCPCCHGSQYRTSSFDVSDMNPFGAKCIFCKSMMITFDNISQYLNASRLSLDLKK >CP029164|3221809:3273222|3251972_3252185_+|AWH70837.1|DBSCAN-SWA MNQNDIEAMIQRYTEAEMAVLDGKSVTFNGQQMTMENLSEIRQGRQEWERRLAALITRRRGHPGYRLARF >CP029164|3221809:3273222|3226155_3226497_-|AWH70800.1|DBSCAN-SWA MKITLSKRIGLLAFLLPCALALSTTVHAENNKLVIESGDSAQSRQRAAMEKEQWNDTRNLRQKVNKRTEKEWDKADAAFDNRDKCEQSANINAYWEPNTLRCLDRRTGRVITP >CP029164|3221809:3273222|3234848_3235214_-|AWH72531.1|DBSCAN-SWA MEFKDLPSDVQKTAAHTLHSVLREIGKDIASEPAKDLARKIKTAFVELYNVGTDSETVETKTVSSPIFSLEPGVLSGEICTEISSELLPVIREAICRRGLDGSYDHDVLQVLRTMVTSLGI >CP029164|3221809:3273222|3240391_3240604_+|AWH70820.1|DBSCAN-SWA MLDTCRLASYVPEGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >CP029164|3221809:3273222|3245098_3245410_+|AWH70827.1|DBSCAN-SWA MTQDYELVVKGVRNFENKVTVTVALQDKERFDGEIFDLDVAMDRVEGAALEFYEAAARRSVRQVFLEVAEKLSEKVESYLQHQYSFKIENPANKHERPHHKYL >CP029164|3221809:3273222|3226994_3227183_+|AWH70802.1|DBSCAN-SWA MSADKRYSISSFVNHSRRKYTPFFVITPAFFDLHTCMVVAQLRRFHASRQAMQGIEHEDRKG >CP029164|3221809:3273222|3240820_3241072_+|AWH70821.1|DBSCAN-SWA MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDGRVTDGFILREGEFVTSLQALKEIGIKAGFSAFAEE >CP029164|3221809:3273222|3242481_3243234_+|AWH70824.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATGTGCLTITDVMAAQGMVQSKAPLGLALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEEIRLQIIPYLVNFAFADYSRSAASKARCEHCSGTGFYNVLREVVKHYRRGESVIKEEWVKELCQHCHGKGEVSTACRGCKGKGIVLDEKRTRFHGVPVYKICGRCNGNRFSRLPTTLARRHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEAQLRKVTR >CP029164|3221809:3273222|3233800_3233992_-|AWH70809.1|DBSCAN-SWA MNSAFALVLTVFLVSGEPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVIHQDNIEIPAGL >CP029164|3221809:3273222|3263591_3264239_+|AWH70849.1|tail|DBSCAN-SWA MAATHTLPLASPGMARICLYGDLQRFGCRIDLRVKTGAEAIRALSTQLPAFRQKLNDGWYQVRIAGRDAGETELSARLNEPLANGAVIHIVPRLAGAKSGGVFQVVLGAALIAVAWWNPVGWLGAAAVSGMYAAGASMILGGVAQMLAPKARTPTATSTDNGKQNTYFSSLDNMVAQGNVLPVLYGEMRVGSRVVSQEISTADEGDGGQVVVIGR >CP029164|3221809:3273222|3226631_3226958_+|AWH70801.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLSLYDWTGALIALCGMLIIVAGWGRT >CP029164|3221809:3273222|3229596_3230892_-|AWH70806.1|DBSCAN-SWA MREVEMKYPTGVENHGGKLRIWFVYKGVRVRENLGVPDTAKNRRVAGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEKNLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLPDNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHLREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN |
64 | Enterobacteria_phage(36.96%) | portal,terminase,lysis,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
4030203 : 4084519
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP029164|4030203:4084519|DBSCAN-SWA TTTACCCTATAGGTGCTACTGGCCATTCAATATCCGGTGCAGTTGTTGTATTAACACGGTTCAGCAACACCCGATACTTCTTCCAGGCTTCCAGCAACGAGATTTCTTCCTCCGTTGCGATTTCCAGATCTACAGCATCCTGAAGTGGCGCAATATGCTCACTGGCTACCTGCATCAGGCTGTTTTTTGTTTCTTCTGCCTCCCGGATCCGGAACAGTTTTTCTGCTTCCGTATCCTTCACCCAGGCTGTGCCGTTCCACTTCTGAAACTCCCCTTCCGGCGATAACCAGGTAACATTTTCCGCTAACGGACCGAGTTCAGAAATAAATAACGCGTCGCCGGAAGCCACGTCATAGACCGTTTTACCCCGATGGTCTTCAACGAGATGCCACGATGCCTCATCACTGTTGAAAACAGCCACAAAGCCAGCCGGAATATCTGGCGGTGCAATATCGGTACTGTTTGCTGGCAGACCTGTATGAGGCGGAATATATGCGTCACCTTCACCAATAAATTCATTAGTTCCGGCCAGCAGATTATAAATTGTTATGGTCCGTGGTTGTTCACTCATTCTGAATGCCATTATGCAAGCCTCACAATATAGTTAAATGCGATGTTTTTGACGGTGTTTTCCGCGTTACCAGCAGCGTTAACGGTGATGGTGTGTCCATGTGAGCCAATCGCAACCGAGTGCGTATGAGCACCAATACCGACAGTATGTGCGTGTGCACCTGCGCTTGCAGCAGTGCCGGACAGTGAGTGGGTATGAGCACCATCTGATGATGTCTTCCCTGCATTACGAGTCTGGCCACTACCGCTTGTTGTGCTCATAATCCCCGCGCTTGGATTTGAAATCGCGGTATAACCATTAGGGAAAATGCTCGTGTTCGTGCCACCAAATGCACCGGAACTCTTGTGTTGGTGCGCACCGGCACTATTTGCGGTCCCGCTAATACTATGGGTATGCGCCCCGGTGTTATTCGTGGATTTGGTTCCGTAATCAAACGACGATGTGGTTTTCGTCCCCAAATCCGTACTGGATGCGCTGGCGCTGTGGGTATGCGATTTAATGCCGTCCTGTTCCTGAGACAATACGGCCCGACCACTGGCAGGTTTGCCCTTAATCGTCCAGCCACGCATATCAGGGATCACGCCTGACGGATAAGCCGCTGCAAGTTTCGGGTAGGCAGATTTGTCAAAAGCCTGCCCCTGCATCAGGGCATAGCCAGACGGAACGGTATCTGATGGCCACGGGATTGGTGCGCCGACTGGGTAGCTTTCTGGTGGAAGATTTTTCGAGGTATAAACTTCTGCCCAGTCTTCCTCAAAACCATAGCCATCTCTTGAGGAACGGTAGAACAGACCTCCATTTCTGTAATGCGCCTTCATCTGCAGGGTCCGGCAACTTCCGACTCCGGTATAGAAGTTAACCAGAATATAGCTGTCGCCAGAGCGGGTGACATTGTAAGCGCCTGATTCGGCATTCCAGGGAACGCCACCATCCGCATCGGCATATGTATCCGTTGCCCTTCTGGCAAAAGCAGCCACATGCGCGGCGGTTAAAGTAATATCTTTGGAACCATCAAACTCAACACCAGAAACCAGTCTTGGCGTTTGCAGCTTTGTTGCTGTTAATGCATTACCGTTCAGACTTGCGGACAGTTTGGTTCCAATAATCAGTTCGCCGGTTGCGTTATCAATAGCAAACGGTCTTAATGTATTCCAGCCACCATAAACATCACCTTGATTGGTAAGCAGCAGGTAAGTTTTAGCGCCATCATTACGCCATAATGCACCATACTCCCCACCTATCATTCGAATCTGATTACCACCACGCGCTACAATTTCGTCTGTGGCAAAAAGTTTTTTGCACGACAAGTTATCGTTAACGATTAACGAATGAGACTCATAAAAACCACGCCCACTCTTAAAATCAAGGATAACGTCCGCCGCGATACATTCAGTCGCCGGATTTGTTGCCCCAAACTTATAGGTCGTATCATTAACAACGAGATCAGCACCAGGTGCGGATATTGACAGGCCATCTTCGATAAACGCAAAAACAGGGAAAGCAGCGCCATCAACATAGAACACAGAGCGCAAATCATCGCCCTTATTACTCATCATTATTGAGTGGATGGCTCGTTCATTGTTTTGATATTGCCAGAACATTCCATAAGCATAACGCCCCCTGTCAGTCCAGCCACCAGGCATAACAAATCCGTTAAACTCGCAGTTATTCATCGGATCGCCTGCGGTTCGCGTTGCCGTGGTGATAATGACCCTTGATGCCAGTTCGCTTACTGAGCCAGCAGAACGCATAACAACAACAGGGTAATATTTTCCAGATGTTGCACCTGCAGGAGCGTTAACCCGCACATAACGCATACCACGCTTATCAGCAAAGTCTGTTTTACTGACCGCGTTAATGTTGTTCAGGAAGCGTCCCTTATCGGGTATATCAGCGCCGTTCTGGTCTTTCTGCAGACGTTTCTCTGCATTGTCATAGGCTGATTTTACTGCCTTTGGCGTTGCCGCCAGCGTTTCAGACGTACTGTTGGTCGCACTGCTGAGCTGTACTACCCCCTTTTTCGTCGTGCTCGCATCCTCAAGCGCCACGGCGGATGCAATATCCTCTGCCCGTTTTGCCGCTGTCTCGGCGCGCGTTGCCGCGGATTCCGCCGTACTTTTGCTCTGTGCTGCCGCTGCCGCACTGCCAGCTGCCTCTGTCGCCTTCGTGGATGCCGTCGTGGCGCTGCTCTTCGCTGCTGACGCCTGTCTGGTCGCCTCATCTTTTGAAGCAGACGCAGATGATGCCGATGACGCTGCCGAACTGGCGGACGATGCGGCTGCCGTTTTTGAGGATTCTGCACGGGTTTCCGACGCTTTCGCATTCGTTTCGGATGTCTTCGCTGCGGAAGCAGACCTCGCTGCTGCGCTGGCCTGTACAGCGGCTTCGCCAGCCTTCGTTGTGGCTGTTGAAGCGGACGATGCAGCACTTTCTGCCGATTTTCCGGCGGCGGTGGCACTGGCTGAGGCCTGCCCGGCACTTGTTGACGCGGCACTGGCAGACGACGCAGCCGCTGTTTTTGAGCCTGCCGCAGCTGAGGCACTCTGTCCCGCTGCCGTTTCAGAAGCCTTGGCGTTCGTCTCGGACGTTTTTGCCGCCTTCGCAGAATTTGCTGTCGCCGTTGCCGAGGAAGCTGCGCTGCTGGCGCTCGAGGCTGCGCTCGTTTCTGATGATTTTGCCGCCTCTTTTGAAGCCGACGCATCCCGGGCTGAGGTGGCAGCTTCTGACGCTTTCGTGGTCGCGGTGGATGCTGAAGTGGCTGCTGATTGTTGTGACGCTGCCGCATTCGTTTCTGACGTTTTTGCCGCAGCGGCACTGGTGGCCGCCGCGCTTTTTGAGGACTCTGCAGCGGCAGCACTTTTTGATGCTTCAGTGGCCTTTGCTGATGCCGTTCCTGCGCTGGAAGACGCTGACTGAGCCGACGAAGCGGCCTGTCCGGCTGACGTGCTGGCTGCGCGTGCTGAGCCTGCAGCATCAGTCGCATGGGTTGCCGCCTCACGGGCAGATGTGCCGGCATCGCCGGCTGACTTCTTCGCGGCTGCCGTGTTCTGTGCCACCGCGGACGCGTTACGCGCCACCTCTTCCACCATCAGTTCAAAACGGCGCAGAGCCTCAGGACGGGCATCATCCTCTGTCATGGCACCGAGAAAATCATTCAGCGTACCGGGTTGAGAATCTTCATACACGGTAATGGTCCCGGCATGTGATGGCGGGAATCCCTCCACCAACAGACTGACGCTGTACTGACCATACTCGACGTCCATTGTGTAACGCCCGGCTTCATCCGGGTTTTCTGAGGCCACTGTGTTCACCACCACCGTGGTGCTGTTGCGCCTGGCCTTTAGCTGAATGGTGCAGTTTTGTATCGGCTTACCTGCGCCATCTTTCAGTACACCTGAAATCCGTACTGCCATATTCCCCCCACAAAAAAGCCCGCCTGAACCGGCGGGCTGTCATAACACTGTGTTACCTGGCTAATCAGAATTTATAACCGACACCAACGATGAAACTGTTGGTACGCCAGTCACCGCTGCCGGAGCCTTCATAAGCAATATCAATGGCCACGGATTCGGTCGGGTTAAACTGCACGCCAGCTCCCCACGCCAGAGACGTGTTGCTGTGGCGACCGTCATCACTTCCGGTCAGCACGTCGTGCGTTTTCCCCTTGTTGTCAGTTACGCGGAGATAATCCCCGGAGAAAGTCGACACACGGCTGTAAGTCACACCCGCCATCGCATACGCGCTGAACCATTCATTCACGCGCACAGACGGCCCCGCCATCACGCTGAACCAGCGGTTACGAACGGAATCTTCATGCCAGCGGGTATCGCTGTAACGGGTAAGCTGGCGATTCTTGTCTCCTGCATAGCTGAATGACGTCACCATCCCCAGTGTGTCCGTAAACTCATAACGGTATTTCACGTTAATCCCGTTCAGATCATCGCTGCCGGGAACGTTCGTCGAGGCATGAAGATACCCCGCGCTCAGCGTGGACTGATGTTCAGATGCCCATGCAGGCGCACCGGATACGGCCAGACAAATGGCTGCGGACAAAATTGCTGCACAAACTTTACGCATAATTACCTCTCGCTTTTCTGCAATAAAAAAGGCGTCATTCCTGACGCCCTTTATTGGGGTTATAACAATTTCAACGAATACTGATGCCGGAAGCGGCTTTTTTGGTCACAATCACCGTACAGTCGGTGATATTACCTGCCCACTGATTGCCTTTATGGAAAACCTTAAACTCCAGAGTGACGCTTCCCCTGCCACTCGGCATATCAATAACCGCACTGTAGCTACCGGGAATGGCCCCTTTAGTTTCTCTGGATGCGATTAATACACCGTTTTTGCGAACTTCAAAACCATAACCCGTGTATCTTGTACCTCCCGGGTTATTACCACTTCCCGGATCGCTATACGCTATTCCGTTAAAGATAATGGGCGGAATAATGATTTGACGGTCAAAGTTATGATCATCGCTGATGGTGACTGTAACCGTCCCGTTTGGTGTTTCCGTATTACCCCACGTACCAGCCTGTTTCGGGAATGATTTGGATACAGCTTTAACGAAGTCACCTCTGACCTGAGTCGCCTCCAGCATGCCCTTAATCGTACAGTTTTCATTTACCGTGACATTGTTGAGCGTCCCGGCGTTCGCATTCACACTGCCACTGATATCCGCATTTTTAGCGGTCAGCTTTCCGTCCGGTGTCAGGGAAAATGCCGGTGGATTTCCACCGCTGGTAATGGTGGGGGCCGTCAGGCGTTTCAGGAACACGTCGTTCATGAATATCTGATCGCCCTGACCAACAAACATCGGTTTTGTGTTGCCATTCGCAGGATTAACCATCGCAATCCTGTCCGCCGCCAGCAGCACCTGACTCTGCATGCCGTCAGGGGTGTTCTCAATACCGGCACCAATACCCGCGATATAAAGGCGTCCGTCCTGCATCTGCTGCAGCTTCACAGCCCACATGCTGTTCAGGTTATTATTTGTATCAACCTGAACTTTCTGTATCTGCTGGATTGCCGCACTCTGGTCTTCCAGTTTTTTATTGACGGTCTGCGTGATTTCATTGCTGACATTCGTAATGGACGTCCTGATTTCAGCCAGGTCCGGCGCAAGCTGACCGTTATCAATCTGCGTCCACAGCTCCTGGGCCAGATGTGTTTTCCCGATTTCTCCTTTGAAAAAATCCAGGTAACCTTCCGCATCATCGCTCGCCCGACCGACAGCCTCCACGAATGCCGATTTGCCAACAGTGTTCACACTGCGAACGTAAAAATAATAATCATGACCCGGCTTGATATTGATACTGGCAGCTATCCAGTACAGCGCCGTGCCAAGATAGCGGGCTGTAGTTTCAACCTGCCTGATATCGGTAATCCGCTTTTCCGAGAACCAGAACTCAAACTGTACCGTCGGGTCATAAACGGCAAGATGCGGCGTTGCGGTTATCTGAAAATAGCCCGGCGTCAGTTCAATCCGCGACGGCGTCGCCGGTGCGGCAATCCGGAACAATACCGACGCCGGATCGCCCTGCTGTCCCCACGCATTTACCGCCCGGACTGTCAGCCTGTAGTTTCCCAGCGCCAGCTGCCTGAAGCGGTATGTGGTTTCCGTCGTCCGGGCTGTGCTGACCAGCCGCTCACTGCCATCGTCCGCTGTCACGGTCAGGCGAAGCATAAAGCTCACCCCCTTCACCACCTTCGGTGTGTCCCAGCGCGCCAGCACCTGATATTCCCCGCTGTCTGCGGAGACTTCGGCTGTCAGGTGCTGTACCGCTGGCGGCGTGACACCATTCACCGTGCCGCTCAAATCGCCGTCAAAGTGCGCCCCGTTATCCACGATGGCCTCTTTTTCCGGTACATGCTGCACGGCAGTGATGGCATACGTGCCGTCATCGTTCTCACGGATACTCACACAGCGGAACAGGCGCTGGCGCAACGTCGGCAACTTCAGCCCCCATACGCTGTATCCGGCAACGCCGTCAGGAACACGGCTCACTTTTACCTTCACGCCGTCGGTGACGGACTGGACCTCCACGCTGACCGGATTGCCACTTCCGTCAACCAGGCTTATCAGCGTGGTGCCGGAAGATGGCAGCGTGATTTCACGGTCGAGCGTCAGCGTCCGGGTCTGGCTGTTCACCGCCAGCACGCGCCCGCCGGTGCTGATACCGGCATAGTCATCATCGCAGATTTCAATGACATCGCCCGGTACATGGCGAAGCCCTTCAGCACCCACGCTGAAGTCCACGGTCTGCGTTTCCAGCAGCTCCGTTTTAATCAGCCACAGCCCGGCTCGGTGTGCCTGCCCCCGGCTGGTACAGCCAAAGGCATCCATCTTCGTGACGTTACGACCATAACGGGCAATGGCCTGCGTGTCCTCCACAAGCTCTGTCGCCGTCTCCCAGCCGTTATCCGGGTCAATCCAGTTCACCTCAACGGCATTATGGCGGTCCTTGAGGGCGCTGAAGCTGTAGCGGAACGGCGCGCCATCATCCGGCATCACCACATTACTGCGGTTATAGGTCCACACCTTATCCGACGGTCGGTCCTGCACGAACGTCAGCGTCTGCCCGTTCCATACCGGCATACAGCGCATCGCCGAGCAGAAATCACTGAGCACATCCCACGCCTTGCGCTGTGTGGTCAGGTACGCATTACAGGTGATGCGCGGCTCCGTGCCGCCAAAGCCGTCCGGCACTGACTGGTCGCAGTACTGGCCGATGACATACAGCGCCCATTTGTCCACATCCGCCGCACCAAGACGTTTCCCCATGCCGTAGCGTGGGTGGGTCAGCATATCCCACAGACACCAGGCCATGTTATTGCTGTATGCTGGCTTAAACGTTCCGTCCCAGATACCGCTGTATTGCCGCGTCTGCGGGTTATAATTCGACGGCACCTGCAGAATACGCCCGCGCAGATGATAATTACGGCTCACCTGCTGGCTGCCGAACTGCTCCGAGTCCACCTGCACACCGACCAGTGCCGTGTTCGGGTAGCACTGTTTCACATCGATGATTTCGGTGTATGACGACCAGAGCGTTTTGTTCTGCAGCTGGTCTGTGGTGCTGTCCGGCGTCATCCTGCGCATCCGTATACTGAACGGGCGCGGCGGCAGGTTATCCACCACCACCGAGGCCAGATACTGCGAGGTGGTTTTGCCCTTAATGGTGATGTCTTTTTCCGTCACCCAGCCACCGTTACGCTGTATCTGAACCAGCAGGCGGACTTCCGACGGATTCCTGTCACCCTTTGAGGTGGTTTCCACCAGTGCCTGTACACCGAAGGTAAAGCGCAGACGGTCGATGTTTGCCGACGTGATGGTGCGGGTGATCGGCGTGTCATATTTCACTTCCGTACCCAGCACCGTCTCGGAGCCGGAGGATTCAAATCCCTCCGGCGGTGACTGCTCCTGCTCACCGGCCCGGAACACCACCGTGACGCCGGATATATTGGTATTCCCCTCACTGTCCAGCACCGGCGTACTGTTCAGCAGCACGCTTTTTAATCCATCCACCGGACCTTCAACCGGCCCTTCGCTGATGGCATCGATCACACTCAGCAGCTGCGTGGACTTCAGGTTGTCCTTCGCTTCGCGCGGGGTATGCCCCTTACTGCTGCCTTTACCCATTCGTCATGCTCCATAAACGATAAAACCGCCCGGAGGCGGTTTCACATAAAACGTTTTTCATCAGCGACCAATCACCACAACCTGACCACCATCACCTTCATCTGCTGTGCTGATCTCCTGAGATACCACACGTGATCCCACGCGCATTTCACCGTACAGAACGGGCAGAACATTGCCCTGGGCAACCATATTATCCAGTGAAGAAAAATAGGTGTTCTGTTTGCCGTTATCCGTTGTCTGTGTGCTGGGGGTTTTGGGTTTAGGGGCCAGCATCTGTGCAACACCGCCAAGCGTCATACTGGCACCGAGAGAAAACAGCAGATTACTCGCCATAATTCCTACCCCCGGCATCCATATAGCAACCGCCATAACAGCCGCCCCCAGCACAGCCTGAAACACACCGCCACTTTTGGCTCCTGCCAGACGCGGCACGATATGGATCACAGCACCATTTGCCAGCGGTTCATTAAGACGGGCTGATAATTCCGTTTCACCTGTATCACGCCCGGCAATCCGTACCTGATACCAGCCGTCGCTCAGTTTCTGACGAAACGCCGGGAGCTGTGTGGCCAGCGCCCGGATGGCTTCAGCCCCCGTTTTCACACGAAGGTCGATGCGGCGACCAAATCGTTGTAAATCCCCGTAAAGGCAGATGCGCGCCATGCCCGGTGACGCCAGAGGGAGTGTGTGCGTCGCTGCCATTTGTCGGTATACCTCTCTCGTTTGCTCAGTTGTTCAGGAATATGGTGCAGCAGCTCGCCATCACCACAGTAAATGGCGGCATGATTCGGCACCGATGAACCAAAACAGCACAGCAGCACATCGCCCGGTTGTGCTGATGACAACGGCACCTGATACAACCCTGTGGCCTCCAGATTATCCAGATAGAGATTCTGACCGTGACGCCACCAGTCATCCTCGCGATGAAAATCCGGCATCTCAATCCCCGCCAGATGATAAGCATCCCGGAACAGCGTGTAACAGTCCGTCACCCCGTGCTCAAAGCGCCGCCCGGTGAGATGCGGCACACAGCGGAACTTATGAATCGTCCCCCGGCAGACCAGCCACCACGGCAAATCACTCTGCACCTGCAGCCGCCGGTCGGCCTCACTCAGCCAGGGCAGACCACCGGGGTGGCTGTGGACCAGCGCCACAATCTCACCCTGCATTTCTGCCTGCAGCCAGTCTTCCGGCGACATACGGAAATACGCCTCCGGCTCACCGGAGATATTCACGCAGGGGAAATATCTTTCCCCCTCCGGCGTGCTTACCACGAAGCCGCACGACTCCGCTGGCGCACATCGCCGGGCGTGCGCCAGAATCGCTGATTCTGTCTGTGTCATGGGATTTACTGCGAAAGTTTGTTAATGGAAAGGAAGCCGCCAAAGTTGCCGACGTTATTGCGAAACTTACAGCCACTCAGGCATTTGCTGCATTTATCCTTCGTGATATCGGACGTTGGCTGGTCATATTCATCCGCGACCGCCGGACCGTGATAACCGCACTCATCGCCGCGATAGGTCCAGGTGCAGGTGTTGGCCAGCATGATACGTCCCGGAAAAACAGCGCCATCCGTTTCCGTCGGCGTGGACAGTACAAAGGAGGCACTGACCGCGCTCAGTTCGCTGCACTGCTCGATGCGCCAGCGGCTGATCACCTCCTGCTCCGGATCGGCGTCGCTGTTTCCGTTGACGAAGTTCACCGCATCCAGAAAACGGGCGTAAACCTTACGCCGGACCACCGTTCCGCCGACCAGACTCTGCATATCTTCCGCCATCCCGGTGACCATACCGTACAGGTTAGAAACCGTCAGCGTGGGGCGTGTACTGGTGCCTTTGCCATTCAGTTCAAAACCGCTCCCCTGAATGGGATACGGCTGATACTGTCGCCCCTGCCAGGTGACCGGCTCACCTTTTTCGTTCTGCTCATTACAGAAAAAATAACGTTCTCCACCGACCTCTGTCAGGTCGATTTCCCAGAGCACCACGCTGGCCGACTGCTCCGCACGGGTGCATTCATTCAGTGTTTCCTGCCGGATATCCTGCATCAGTTCACCACCTGTTCAAACTCTGCGCTGAACTCAACACGCAGCATACTGACCCGCGACGACCATTTTGCGCAGGTCACCTTTATCTGCCGCCACTCATAAGGCGGCGTCCACAGAAAGGCTTTCCAGCCCCCGTGCTCTTCCAGAAACGACTCCAGTACCGTGGCCTCCTCACGGGGGACAGAAAGCGTCACGCTGTACGTTTTCAGGTTGGCATTCAGCCCGGCAGGCGCTCGCTGGGAATAGCCATCACCAAAGCGCACCTTTCTTACAGAAGGGGCCGAAGCCACATCCATACCGGGTTTCACTTTCCAGCGGAAGGTTTTCATCGTCCACCTCCGGAGAACAGGCCACCATCACGCATCTGTGTCTGAATTTCATCACGGGCACCCTTGCGGGCCATGTCATACACTGCCTTCATCATCTGTGGACCTGGCAGACCATTCGTACCGTCGTTCTGAATCACCACGTGATTGTTCTGATTAAAATTAATGCCTTCAGCCCGCCGCATCTGCGCCGGACTTCCGGCACCGCCCACATAACCACCTTCCGCATAGCCCCGCATCAGGCGGTACAGGTTGCCGACACCAATCCGGCTGGTTGCCTCCTTCGTGAAGACAAATTCACCACGGTGAACAATCCCCGCTGGCTCATATTTGCCGCCGGTTCCCGTAAATCCCCCGGTCGCAAAATGGAATTTCGCCGCAGCTGCCTGAATGGCTGTACCGCCTGACGCGGATGCGCCGCCACCAACAGCCCCGCCAATGGCGCTGCCGATACTCCCGACAATCCCCACCATTGCCTGCTTAAGCAGAATTTCTGTCATCATGGACAGCACGGAACGGGTGAAGCTGCGCCAGTTCTGCTCACTGCCGGTCAGCATCGCCGCCATATTCTGTGCAATACCATCAAAGGTCTGCGTGGCAGCACTTTTAACCTGCGACATACTGTCCGTGGCGCTCTCTTCCCACTCACTCCAGCCGGACTTCAGGCCTGCCATCCAGCTCCCGCGAAGCTGGTCTTCAGCCGCCCAGGTCTTTTTCTGCTCTGACATGACGTTATTCAGCGCCAGCGGATTATCGCCATACTGTTCCTTCAGGCGCTGTTCCGTGGCTTCCCGTTCTGCCTGCCGGTCAGTCAGCCCCCGGCTTTTCGCATCAATGGCGGCCCGTTTTGCCCGTTGCTGCTGTGCGAATTTATCCGCCTGCTGCGCCAGCGCGTTCAGGCGCTCCTGATACGTAACCTTGTCGCCAAGTGCAGCCAGCTGGCGTTTGTACTCCAGCGTCTCATCTTTATGCGCCAGCAGGGATTTCTCCTGTGCAGACAGCTGGCGACGTTGCGCCGCCTCCTCCAGTACCGCGAACTGACTCTCCGCCTTCCACAAATCCCGGCGCTGCTGGCTGATTTTCTCATTTGCTCCGGCATGCCTCTCCAGCGTCCGGAGTTCAGCCTGAAGCGTCAGCAGGGCAGCATGAGCACTGTCTTCCTGACGATCGCCCGCAGACACCTTCACGCCGGACTGTTTCGGCTTTTTCAGCGTCGCTTCATAGTCCTTTTTCGCCGCCGCCATCAGCGTGTTGTAATCTGCCTGCAGAATTTTCCCGTCCTTCAGTGCCTTGTTCAGTTCTTCCTGACGGGCGGTATATTTCTCCAGCGGCGTCTGCAGCCGTTCGTAAGCCTTCTGCGCCTCTTCGGTATATTTCAGCCGTGACGCTTCAGTATCGCTCTGCTGCTGCGCATTTGTGTCCTGTTGACTCTGCTGTTCAGCCTTCTTTCTCGCGGCTTCAAGCGCAAGACGGGCCTTTTCACGATCATCCCAGTAACGCGCCCGCGCTTCATCGTTAACAAAATAATCATCCTTGCGCAGACTCCAGATGTCGTCCGCTTTCTTAAACGCAGCCTCTGCCTTAATCAGCATCTCCTGCGCGGTATCAGGACGACCAATATCCAGCACCGCATCCCACATGGATTTGAATGCCCGCGCTGTCCTGTCTGCCCAGGTCTCCAGCGTACCCATGTTCTCTTTCAGGCGGCGGGTCTGGTCATCAAACCCTTTCGTCGCGGCCTCGTTCGCCGCCTGCAATGCCCCGGCTTCATCGCCGGAACGCTGCAACTGAGCAACATACGCAATCTGCTCCGCCGTCACGTTATGGAACTGGCGTGCCATCGCCGTCAACCCCGACGTCGGGTCTGTGGTCAGCTTCCCGAAGGCTTCAGCGACCTTGTCCACCTCCACACCGGATGCAGAGGAGAAACGCGCCACACTCTGGCTGATGGACGCAATCTGAGCCTCACCGCTTACTCCCGCCTTAACCAGTGCGCTGAGTGACTCGCTGGTCTGGTTAAACGTCAGCCCTGCCACCTGCCCGGCTCTGGACAGGACCAGCATGCGATCTGCCGTCAGACCCGACTGATTGCCGGAAAGGACCAGCGTTTTGTTGAAATCGGACAGGGTTGAGTTGCCCTGATACCAGGCATACGCCAGCGCACCGGTCGCCACCGCCAGCGAGGTGGCCCCCACCATCGGCAGGTTGATCGCACCGGCAAGCCCCCTGAACATGGGGATCATCCCGCCGAAGGAGTCCTTAACCTGCCCCCCCTGTTGCAGCAGGATCAGCCACGGACTTTGCCCGCCTGCAAGCTGCGTGGCCACGTCGGTGAACTGTGCAGGCAGCATACGCATGGCGGCTTTATACTGTCCGACGGAAATCCCCGCTTTCTGTGCAGCCAGCGCCTGCCGGTTCATTGACTGTTCAACGACTGCCGCTGTTTTTTTCGCATCACTTTCCGTACCGGAAAAATGACGCCTGACTCTGGCCATCTGCTCGTCAAATCTGGCCGCATCCAGACTTAAATCAACGACCAGATCGCCTACCGGTTCAGCCATACCGGACTCCTCCTGCGATCCCTTCTGATACTGTCATCAGCATTACGTCATCCTCCGTCATGTCCGCCACATCCGGGGAAGCGGGGATAACTTCATTCCCGTCCGGGCCAAAACGAACGCCTCCGGCAAGCCCTGCCGCTTTCTGCATCAGCACATCATCTTCAGGCTCTTCGTCAGCCTCGCGCCGGTTCAGCAGACTGAAATCCAGCGGATGCATCTCCGGATCGCTGAAAAACAGGCTGAGCACGGTGTACGTCAGCCCGGAAAAGTGCATATCCAGCAGAACATCATGAAAATAATGGGTACTGTAAAAGCGGTGCCAGTCGGCATACTCCGTGGATGACATCCCGGCAAGCATGGCGCGCCAGTCGGGTCGCCCCATCTCACGCGCCAGTTTCAGGGCAAAACTCAGCTCACCGTCGAACACTTTCCCGCAGAAACAGGCACTGCAGGCCCGGCGTCCTCTGCCTGTTCAGGAGCATCATTCACCACAAACTCATACATACCGGACAGCCGGTACACCACGTTTTCAGCATGAGAAATTGCCTCTGTGGGCCAGGTGGTAAGCACTTCCTGCTCAATCTGTTTAACGGCTTCATTCATGGAAGGCAGCTTTGTCTTCTGCGGATGGTTATGCCACAGGGACATCGCTACCACAAAAGCACCGGTTCTGATGGCGTCTTCCACAGTAAACTTCCGGTTGCTGTCTGACTCCGCCTGTTCTGCCTGCCGTTTCATCAGGTCGAGATGCTCAATACGCTGCAGGGCTGACAGTTCAGAAAGCGTGACGGTCACACCGTTATGTTCAAATGATTCGGTTTTCAGGAACATCGCTGACTCTCCGGATTAACTGGCGGTGACGTTGATTTCTGCAACCGCAGCAAACTCACCATTACCAGATACGACCGGAATGTTGACCTTGCCTGCAGCAACACCTTTCACGGTGATGGTCATACCACTGACCGACACGGTGGCTTTTGTTTTATCCGCAGACACCGCACGGAAGCTCTTGTCGGTTGCGCCATCCGGCTGGAATGCCACGGTCAGCGTGGTGCTCTGCCCTTTCACTACGGAAGCACTGGCGGGTGTCACCGTCATGCCGGTCGTCGCCGTCACCGTGCTGCGATCTTCTGCCATCGACGGACGGCCCACATTGGTGACCTTCACCGTGCGGGTAATCACTTCCTTCGCCGTCACCGCCTTACCGATACTGCTGACCCAGCCACGGAACACATCGACCGTGCCGTTCGGGAAGCGGATTTTATAGGCACGGGTATCACCTTCATTAAACCACGCCAGCAGCGCCTGCTGCCCCTGCTCTCCGGGCATCCACGCCAGCGTGAAGCTGGTATCTCCGGCAGATTTCTGCCCCTGCCCGGTCGCAGTCCAGTCTGCATCTTCATCATCGAGATAGCTGTCGTCATAGGACTCAGCGGTCAGTTCGCCGGGCGTCAGGTCTTTAACTTTTGCCAGACGCGACCAGTCAACGTCTGAAAGCGGGTTCGCATAAGGGTCACCGCTCCCCTTATAAACCCACAGGGTGGTTCCGGCACCTTTCACCGGTATTGCTGGATTTGGTACAGGCATATCGTCCTCACATTTCATAGGTAATGACATACGTCAGATCGGCAGAACTCCACAAGCCCGCATCATCGTCGCGCCGGTAGTCATAGCCACTGGCCACCATACTGGTGATCAAATCTGACAGTGCCGGGATATCGCTCATCACCGGATAAATCCGGGACTCCATCCACGCATCCAGCTCTGAATCCGGCACCTGAGCAGGCAGGAAAACTTCAATATGCAGCTCCGCCTGCCAGGTATCGCTGTCCAGCTCTTCGCCCGTGTATTCAGCGCCGGTGAGATAAACGGCAATTGCCGGAAAATCTTCCTCATCAAAAACAGCAGGGCGACCATCAAAAAACGTCGCCCCGGTGTCATGCTTCTCCAGTGCATCCAGTACGGCTGCACGGAGTTCAGTATGTTTCATCGCTTTATTACCATCCTCAGTTGATGCTGCAGCGCATAGCCCAGCTCTTTCGGAAGACGCTCACGCCGTATCCGCTCAATATTCTGTTTAAACGCCGTGGTCAGCGGCACCGCCATCGGGATTTTCACCACATCAATGGGGTAACGGTTTTTCCCGGCCACACGCTGCATGACATGCCAGCGGCCATTTTTCAGTTGCTGAATAAACGCGCCGGGAATACGACGGTTTCCCACCACAAGCACGCTGCCGCCACCTTTCAGGGCTGAACGCTGCCCCTTTTTACGACGCCTGCGTCGGGACAGGACAATCCGCGCGTTACCCAGCTTTATTACGGGCAAATCCCCCCGGTTAACCTTGATTCTGGCCTGCGGATTTTTGACCGTGGCCCTTTTCAGCCAGGCCCTTTCCTTTACCAGTTTCCGGCGTACCTTTGTCTCACGGGCAACCTGTGACGCCGACTGCGATATCGCGGATGAAGCAACGCGGTTAATGGCCATTGCGGCGGCACCGGGCACCGCCGTTCTGCTGATACGGCTGAGGTTTTCAACGGCCTGCTCAAGACCTTTTATGGCCATACATCCCCCTTTCAGCGGCGACGGTTAACGGCAGGCGGTACGCCCCGCCCAAGCCAGAGATGACAGCTTCCGCCATCATCCGGCGAAATCCGGTCTATCCAGAAGTTTTCCTCACCGATGGTCAGCGTGTCGCCGCGCCGCAGCTGCCGCACATCATCAGTCCGGACAAACAGGGACGGGCTGGAGCCTTCAACGCGTACGCCCTGTCCGGCATAGCTGATATTTTCAGGGTCATCAAAAACACCACGTATTACTGCGCCGGACTGCTCACCGGATGTCATGGTGGCTGACGTTCCCATGTACCCGCGTATCGTTTCATCGGCGCGGGCAATGGCAGCATCGAACAGGTTATCGAAATCAGCCACAGCGCCTCCCGTTATTGCATTCTGGCCAGGCCGCGCTCTGTCATTTCAGCTGCCACACCGGCAGAGACACGGAACGCCGTTCCCGGCAGCACAAATGCCACAGCCTCATCCCGCGTGGCGTGAAGTGCATCAGTATGCAGCGTCACCAGTGCCACAACCGTGACCAGATCAGCCGTATCAGTCACGGTATCCGGCTGCGCTGATACCACCTCATTTTCATGTCCGGTCAGCGCATTTTCCGGGCTGACAGATGTGTCCTGACCGGCAGCGTCATCCGTGTCATCAAGCTCCTCTTCCAGCTCTGCCACACGGAGCGCCAGTTCTTCTTTCGTCCCCGTCAGGCTGACATCACGGTTCAGTTGTTCACCCAGCGAGCGGAGACGGGCAATCAGTTCATCTTTCGTCATGGACTCCTCCACAGAGAAACAATGGCCCCGAAGGGCCATGATTACGCCAGTTGTACGGACACGAACTCATCAGGGTCAGCCAGCAGCATCAGCGGTGCTGACTGAATCATGGTGAACTCACGCGCCGGATCGCCGGTGGTCACCCAGTTTTTCGGGTAACGGGCAGAGGCGTTAATGCCTTCGCGCTGTGCGTCCGCATCCTGAATGCAGCCATAGGTGCGCAGACCGCGTGCATGAGTGTTACCCAGCACCATCGTGTTGTCCGGCAGGAAGTTCTTTTTGACGCCGTTTTCCACGTACTGTCCGGAATACACGACGATGGCCACATCGCCATACATCCCCTTATAGGACACCGCTTTACCCAGGTCTTTCACCGCTGTCTCCAGCTCGGAATGAGAGCCGCGACGGGTATCCAGCTTCTCCTTGACGGCGTTGAAGGAACGGAACAGCGCCCAGCCTTTCGGATCAAACACGATGATATTCACCACGCCGCTGGCGTTCAGCGCGTAGGCTTCGATATCGTCGGTCGGGTCATACGTGGACTTGTCACGCTTGCTCCACTCCGTGCCGCCGGACTGCGTGATGTTGTTCGCCGCACTGCGGCCCATATCCACCTCAACCGGATCGAAGGCTTCACCGGTCATGGTGTATTTGCCCTTGAGCACGGCAGAAACTGCCTGCATCTCTTCGACCTGAGCAATGGCCAGCTCTTCGTCACGCATGTTCTGCAGGATGATGCGACGGCGGCGGTAAGCCGGGTCCGCCAGATTCTGCGGATCTTCATCCGGCAGGCGACGCAGGGTCATCTGCGGATTCACTTCATGCTTGGGTTACATGAGTCAATCCCGAAAGAACTATTTGATTTCACTTAGCTTTGTGGTTTATATATCTACGCAACTCCTATCACTGCAATACTTTCTGCAATACAGATATGTGTAGTGAGGACTTGTCTAAAAACCATATTCACGAAGAAAAAGGATGACACGATGCAACCAGAAGATTATGAAGAAAAGCAATACGAGGATGAGCCAGAATCCTATCCAATTGATGAGTTTCAACTTACTACTACGCCCAATGATTTTAATATAATCACAATTATAAGTTTTATTAAATCAAAAGTTTTTAAAATCCCTAACTTTCAAAGGCATTATGTTTGGGATATCAAACGAGCATCAAAACTCATTGAATCTCTTTTAATAGGCCTTCCTATACCTCAAATATTTTTATATGAACAAGATAAAAATGAATTTTTAGTGATAGATGGTCAGCAGAGATTAATGACCCTTTATTACTTTGTAAATGGTGTATTCCCTAGAAAAGAGAAACGTTCTGAACTGAGAAAAATCTTTGAAGATAATGGAAACATTCCAGAAAACATTCTTCACAATGATGAATACTTCACAAAGTTCAACCTAAAACTTGATGGTCTATCAGACACCCAAAAAAACAAGTTTAATGGGAAAAACTATGAAACATTAAACGAATTTCAAACCACTTTAAATCTGGCAACCATACGAAATATGGTAATCAAACCGGTTGCACAAGATTCGGAAGATGGTGCAATGTTTGAAATATTTAACCGCCTAAATAGTGGGGGAATGAACCTTTCTCCTCAAGAAATTCGTATGAGTTTATATCACTCAGACTTTCTTTCAAATCTTGTTTCATTGAATGAAAACAAGACATGGAGAAAAATTCTTTCAAAAAATGTTGTTGACATGCGATTGAGTGATATTGAAGCGATATTGCGCACATTTGCTATGTCCCTTTTTACATCTCAATACAAAAGCTCAGTTAGCGGTTTTTTGAACAATTTCTCAAATTATGCAAAAAACTACGACACTAAAGACATAATTTTATTTAGTAATATATGGAATGAATTTATGGATAGTGTTGATGGAATTGATGAAATCAATTTCAGAACTGGTGGGAATCGTATGAGTATAACTTTATTTGAGTCAATTTTTTATGCTGCAACTTATGACTCATTTAAAGATAAAGATCTAAAAATAAGACAAGTGACGGTGAATTACATCGATAAGCTTAAAAATGATCCTGAATTTCTGACATTTAGTACTGATAAAACAACAAGAAGAGAGCATGTAATTGGACGTCTAGAACGTGCAAGAACAATTTTGGAGGGAATGTAAAATAATGGAGGATATGGGATATACTTCTATTGAAAGTATGTTTAATAACTATAAGTGTTATTATGATTTTCTTCTAACTCATAATGAGATTAGTTTTGCTAATGATTACAAATCACAATTTTCAAAAGTAATGCTATTAGCATGTGCTAGCTATTTTGAAACTTTAGTTGTAACTAAGATACATTGTATGCTTAACCCAAGCCAATGTAATCTTACACACGATTTTATCGATAATAAGGCCCTAACCAGGCAGTATCATACACTCTTTGATTGGAAAAAAAGGAATGCAAACCAGTTTTTTTCCTTTTTTGGCCCAAAATTTAAAGAGTTTATGATTGAGAAGGTAAAATCAAGCACAGAGTTGACCAAGTCTATCTCTGATTTTATGGAAATCGGAGAGCTAAGAAATAAATTAGCACATAATAATTATGCTACTTTTGTATTAGAAAGCACAGCTGAAGAGATTTATAATAAATTTTTAAATGCACATTCATTTGTCTCTCAACTAGATACGTTCAGTACACAGTTTAGAGAGCAAATTGGTGAACAGTAATAGTTTATCTCCAGCAAAAAAAATAACTTGCAAATAATAAAAGCGGTCGCGTGACCGCTTTTTTCTATTTATCATTGCTATTTTCCTTTATTATTTCAGCAATTGACATAACTAAGGCGTCTGCTTTATTTCTAGTCCTATCAAACATATCTCTAATTACATCAATATAATAATGTTCCTCAAAAATATTTACGTCTGACAAAAGTATTGCACAACTCTTGCCCAAAATGTCAGCCAGTTGATCTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAATGCCCGTTTTCAGCGCCTGTCATTTCCTTTCTTTTCAGGGGGTATTTTAAATAAAAACATAAAGTTACGGCGAAGAAGAACGGAAATGCCTTAAACCGGAAAATTTCCATAAATAGAGAAAAACTGCGCGCCTGACGCCCCGTAGCCTGTCAGATCGCCGGAAAGGACCCGCCAGCCAGAGCGGGCCCTAATTTCATCAACCAATCAGCTTATAGCGACCATCCCGTGCATTGCGGCGTACACGCTCAATCTTGAGGCATAGCGCCGCATCTGGCTTTTTTGGGACAGGTACGCGGCAATATTCAGAAGCGCGAGGAATATTATTTATCCAGTCGATCACTTCACTTAAATACCAGGCCTTACGCCCTTCCGTAACCTGCACACGCTCCGGGAACTCTCCACTAGCCTCAAGGTTTAGCAGTGTACGCCGACTCAGGGTTGTAATTTCCATCACCTGATTCATATCAACAAGGCGCTCGCTTAAACACATTTTGTCAGCGATAGCTTTTAATTCCTCTACAGCTGGATTCGGGTACATCATTTCGGCAATTGGCTTAAGGTCATTGTAATCATTCTGCATTGTATCCCCCTTTACACACGAGCCAGCGGCTGAACAGAAATACCTGAGCCAACAAACGCGGCAACCTTTGCCGACAGTTCTCTTACAGACTCAGGCCAGTTCAGAGCATCAACATTTAAAACACCTGTCTTATAAACCTGTGCCTGTGTTTTTTTCGCTGTGTCGATTTGTACAGCGGAAACATAAACCGCTTTGCCTGCGCTCGTTCCATCCCATACCACTAGCGCACCAGTTGCATCTTCCTGCATCAGTGGCGTAAACGCAGGTATTACCCCTTTATTGGCTGAAAATATCCCCAGCGTAGTAACCAGTGCTTCAGTGCCAGACATGAGTTCAGTGTAATGAGTAGCCATTGCTCCCCCTTAGCCAATGCGAACGGTAACAAAACGATTGATGCGGGCCGGTATTGGCTGTGGTGCTGAATGTGTCTGCACATATTCAATAGCCGGATCACCAGGCACAATATAGTTTTTCGGTGCAAGTTCGGCTTTAGTCAGCCCCATTCGGATTAGCTCCGGATCCTGAATACCGCCATAGGCGACAATCCCCTGAAGAGCCGTATTGCCAAGCACCATCAAATCAGGATCAAGGAAATGTTTTTCAGTTCCGTCCTCGTCGGTATAACGCCCGCTGTATACAACAATCGCAACATCGCCCATATACCCTTTAAAACTCACCGAATCACCAAGGTCTTTAAGGGCCGTTTCCAGTTCGGAATTAGAACCACGACGGGTATCCAGAGCCTCTTTTATCGCTCTGAATGAACGGTATTTCTTCCATACATTACCGCCCATAATGATGATATTAGTGACGCCCTCACTAAATTCTGCGTAGCTCTCAATATCATCATTTGGATCAAAAGTTTCTTTATCCTTACCTGACCACTCAGCACCGCCAGACTGAGTGATGATATTTTGTGGTTTAATATTCCAGTCCAGCTCATAACGTTCAATACCATCGCCCTCAATGATATTTTTCCCCGTTGTGATTGCCTGAACGGCAAGCCATTCAATACGTGCACGAATAGCTTTAGCCTGATTTACAATCGCCTGTTTAACTTTAATATTACGCGCTCCAAAAGCATTGTATTGCTCAGGTGATACACCAGCAGGGCGCACAGCTAACTTATTTGGATCAATGCTGCTTTTCGGCTTCATATAGCCTGGACGAATTGTTTTTGATTCGTACCCTTCGTCACGTGAAACTTTACTACCCACCATAGGAGAACAAAACGCTGCAATTGGGATATTTGGATCGTCGATTGTATCAAGAATAATATCGCGCGATTCAAACATTACCGAGCGAGTAAAAAACAAACTGGTAAACAACGCATTTAGTTGTTTTTGTACATCTACAGCATTAACCACCTGTACAAGCTGGGTAGGCGAATATAAATCAACCATACGCATCCTCTTTGCATTCATTAAAAATAATTGTGGATATATGCTATCACCGATATTTGTCATGCGAATACATGCAACCGAGTGCAATGTTGTATAACGTTTTGGGATGACAACTTCAGCACGGATAATTAGTGTTAATATCTTCACTCCCTTTGGTCTGGATTTATGTAGCATGCCGGAAAATTTATTTTTTTTCCGGCCTTTTTTATTGGCAATATTTAAAACGGAATATCATCTCCCCATTGCTCATTATCTCCCACTGGTGGATGGCTTCCTTGCTGATCTGCCTGTTGTTTTGCTCTGTTCAGTGCGTCAGTAGCCTGACCCTGTTGATCTTTTTTGCCGCCCGGTCGCACCGTTCGCGCACTGATTACGCTGTCTGCGATAACCTGCCAGCCCTGCCGCGTTTCACCGTTCTGTCCAGTCCACTGGCTCACCTGCATATTACCCGCCACGCTCAGGAGTTCGCCTTTGTGATGCTTTGCCAGCGTGTCGGCTTGTCTGCCAAACGCCAGCACAGATAACCACATCGTCGCCGTTCCGTCATCTGCCTGGCTGCATGGCAAAGATACCGCCATCCATGCCAGTGTCATGGGTGTTCCCTTGCTGGTATGTTTTACCTGTGGGTCGTCCACCAGCCGCCCGTAAGCTGCTATTTGCGCCGTCATGCTGCCTGCTCTCAGGACTTAATATTGATGGTTGTCACTTCCTCCGCTTCGGCAATCTCCCGTTCGGTCAGCGTGGCAAAGTTTGCAGCCGCCGTTGTCATGAATGCGCTTATCAGTTCGGGATGTGCTTTCGCGTATCCTTCCCCGGCGTTGCGGTCTATTGCCTTAATCGCCACCCTTAGCCAGTGTTCAGCCATATCAAGGGCGCGGTAATGTGGCTTTATATGTTTGTTCAGTTTTCCTGATGTGTGCATTTTTATTTTTACCCCCTCGTTTAAAAAGTTTTTTGTGCACCACCACCTTGTCTACCTTGTCTACCTGATTAGTTATCAGGCCAGTAATGGCGCGGGTTTCAGGGAGGTAGACAGCCCCAAATAGCTGTCTACCTCATCTCTACCCGTCTCCTTACCTGTCTACAAAAATGGGTAGATAAGGTAGATAACAGGTAGACAGTGAAAAATAGTTATCTACCTGCATTAATACATTGAAATAAAAGTATTTTATTTCAGTCAGGTAGACAAGGTAGATAACCATTGCCATTTTTTATAAAAACGCATCGCAATCATCAGTAGTCGTTGCGTTGGTCTGCGTGACTCCCTTAACTTTTCGCGTAATATATTCATATCCGTAAACTTTCGCCGCTGACCTCATAGCCTTTCCGAACTCATTCACGCTCAAACATTTCCCCTTTCCTGTATATGCCATGAAGGCCATATAGACACGGTAAAGGCTGTTTCTGGTCGTGTACTTCACGGTGTCACCACCACCGCCCATCATTAGCCCACGAGCTTCCTCCAGAAACTCCAGCGCCGCGCAAAGCTCAACAACCGGATCCGTTTGCTGCTTTATTGCCAGAGCTTCATCACCGTCACGCTGTTCCAGTAATAAAGCCCGTGCCTTTTCAGGGTCAGCAAAATTAGCCAGCAAGCGGCGGATAATTACGGGGATTTCTGCCGCTATCTTTTCCGGTAATTCCTTGTCTTTTTCGTCCTCCCTTACAATGTTGTCGAACCGGAAAATCACCCGACGGCGTGACACACCTCCGGCCCGTTCGGTAAAGATCATCGGGTCGTTATTGGTTGCCAGTACCACCGCCCTTATTATCGTCGTGAATCGCTTCTCATATTTCGGGTTAATTTCAACGGGATCGCCTCCCGTGATTTTCTTGATGCCCGTGCCTTCCCCCGTATATTTCGGCTGATCGGCAAGGACGATAAGACGACTCCCGACAACCTGCGCACGCCCTCCTGCATCATCGAGTGATGTCATCTCTGCGCTTACGGTGTTCTGTTTGCCAGCAAGCAGGGTGGCAATATGGGTAAATGTACTCTTACCGCTTCCCCCGTCTCCGGTGGCCTCAATGAACATCTGCCAGTCGTACCGGTTCGCCATAATCATGTACAGCGCGGCACATATACGCATCATCTTGCGCGGGTCTTTTCCGGCTGCGTGCTCAAGCCATTTATGAAAGTTTGGCGCGTTATCGCGGATGTTCTCCCCTGGTGCTGGTGGCGTGTACTCAATGCCGTTGTGCGTGGTGATCCAGTTCTCCGGCGTGTGCGGGGAAAATTCCCCCGTTTTCAGGTCAAGCGCACCATTGGCGAACGGCAGCAAATCGCCGGACGGCTCGCCCATTGGTTCGGCAATAACTTTTAACGCTTCAACGGCGTTATTGATCACGCGTTTGCTGAAAGTGGCCCTGTGCTCTGAATAGATCGCCACCATTTCGCGGCTCAGCTCCATTGTGCTGATCGGACACCATACCCCGCCGCGCCATACGTGAACGATTTCACTTTCCGGATGCACACAAACGCCATCAAAGCGCTCGGCAAGCAGCTGCGCGCGCTCACTGTCTGCCATCTGTGAAAGTTGCGCCTTTTGCTTCGTCGGAAGATTAAGCACCAGGCTTTCCCCACGCTCGCATTCCTCTTTGAGTCGCGGCAACTGGTCGGATAAATCCACTGGGCTGGTGTCAGTAATCCCCGCGTATTCGTGTACGGTCTTCACTCCAGCCACAGCCAGTAACGTAACAATCTGCGTCATGCTGTGCTCTGTGATATGTCCTGCGCGGTAAACACGCACACACTGACGATCTTCATCAATGATCCGGTAATCGGTGATGTTTTTCAGTTGCTCATCAGCCAACACGACAGGCGGCACATCGTCGGCGGCAATATGTTTACCCGCCCATTCCTGCCACTCTTTCGCATGGCTCCACGCATCACTACCTGCAAAGATGATTATCTCTGTCAGTCTGTCGCGTGGCTGTTTTTTTAAGTTCGGTGCCAGTTTCATTTTTTGCCCCTGAATGCGTTAATCATGCTTTTCATTTTCTGGATGTTTCCCCGCGCTTTTTCCCTGCTGATGGGCTTACTGCGGGGTGCGGCATATACCAGGGAAAAATCACGCCGGAACTGATAAACAGGCATCACGCAGTCATAGCTATACCCCTCACGACGGTAAGTGATGCGCCGTTCTGCCACGCCTTTAATCGTTACCGTGCCGCCGTATTTATCGCGGTAAATATCGCCGTTCATAAATTCAGGCCGAGCGGGGCCGCTGGCAATAAAGCCAGAATTTTTCATTTCCATATTATTTATTCCTCGACTTAACTCGACTTATTTGATAGCAGGGCACTATTTATTGCGTCATTGAGTTTTTCTGCTGCTTCATCAATAAGTGACAACAGGCCATAAGCAATATTTGCATCTTCATTGTCATTTATGCAATCAAGCCACATATTTAATATTACTTTTGCTGAATTATTTAAAGTTAATGAACTTTCTGCACATGCTAACAATTTAAAAAAGACTTCCCGTTCTGTATTCATTTAATCCCCCACCAGCTTACTTTCTTCCTCAATCAAAAAACTAGCGACACTTCCCGAAAGACGCGCCAGTAGGCTCACCAGTGCGGATATATCAGCATCTGTAATTTTGTTCGGGTATACCTCAAGAAGGCGGCAAATAATTTCTGTCTGGTGCGCACGTTCAGCGGCTTCGTGTAATGTGATTTCCTGCATTAATGCACCTCTTTTAATTCATACACTGCTGAAATAATGACTTGTGATAAGCCATATTCTGATGATTCGCTTCTCACCGCAGCAATAGCCGTCTGAACATTAACAGCCTTCACATTCTGAGCGATACCAATTGTGTGGCCTATTGGGTTAACAGCTCGGGCAAATACACGGAAGGTTTTAAGCATGACTCACTCCCTGGCGGATTTTTGCAGCGAATACAGCAACACAACCGGACGGGCAACGGCTACGCGCTTCGCGTTCCGTCCAGGCGGTTACGTGGATGATTTGAGATTCTCCGGCACTCAGTGCCAGAAAACGCCACACAAAGGCCGTTTGTGTGTGTACAAGGTGTGGTATATGATTTACAGCAACCATAACGGCTCCTAGTTTACGTTGTTGGTTAGACGCCCCGTATGTGTTCCCAGCACTGCGGGGCGTTGCTCTTTGTATTTCAACAATCCTTTCGGTGTGTTTCATGTTATGAGCGCATGAAACACACGTCAAGGCTTTTTGTATTTCTTTTTTTGTGTATACTGAAACACACCGATGATTAGGAGTTTCAGAAATGGCAACGGCTAACAAAAACGCAAAATCACAACTGACAACTGTCAGAGTCCCACTAGATGTTATGCAAGGGATGGAATCCGTTAAGCTGGACGGTGAAAGCAATGCCGGATTTATCGTAACCGCCATGCGCGGTGAAATCGCCCGCCGCCAGGCAGAAGGAAGCGGAGAAAATCCCCTCGTGTCTTCACTGGATGCCTTAGCTAAGGTCGAACAAATCGGCATCAAGGCAGCGGAGGAAATCGGGCAACTCGTAGCCGTCGCTCGTGAAGAACTCCAGCGGCGTAAAGCCAAAGAATCTGAATAATTAGTATCAGCGCCGTGATGTGAGTAACTACGGCGCATTGCTATGTAAATACTGGCAATAAACAGAAAAGGTAGTTCTACTCCGAATAATTTTATCTGACACTACTCCTGAACTAACATGCGCTTATCTTACAGGATATAAATATAAATCCATAAAATCACGATTAAATAAAGTCGCTCCAAACATAAACCACACCCAACGCTTAACAAGATAGCAACAAACAGGTAAATAACTTGCAGAAATATTTATCGCAAGGATTATCATTATTAATGACAAATCACTTTACCAAGTCATTCCCCTCTCTTATCATAAAGAGAAAGTAATAAATAAGTTAAGGGAGTTAGAATGCTATGAATCTAAAAAAAATAGCCACAAACACAAAAAACAAGATAACAGAAACATTTAATAAACTTATATTAGAGGCATCTAAAACCCCCACACAAGATGAAATTAAAATACTTGAGAGAAGGAGTAAGAAGTTTAATTACTCCTTTTTCTCATACGCAGTCACAGGAGCTATAATAGTTTTTTGCTCTCAACCATTAATTAAATACGCAAACCCAATACTTATTTTATTGAGTGGTCTGTTACTGTCTATCATCATTATCATTCTCAGAATGATTTATATTTCACAAGCGAATGCATCATGGACAACCAAAAAACGCTCACATGTACTAGTTCATTTTCTTTCTGCATGTTTCATAGCATCAACATTGACGTTGCTATATCAGGCTTACGATAATAACATCACACACAAATTGTACTGTAAAAATATACAACAACTTATTGAAAAAAGGATAGAAACAGAAAAAAACATCAGCATATTCAGTGGGATGCAATGCACCCCGGTATATGATTACTCTTTATTTGGATTTAATCTCTTATAAAGAATGTTATTACTGATTTGAGTACAAATTCTCAAATCAGTAATTCATAATATTTTATTCTGAGATAATTTAAACTACCCACTCACCTCGAATCCATGCCTGCACTTCTGAAAGACGATATGCAACAGCAGTGGAACCAATCTTGATCCGCTTAGGAAATTTTCCTTCCTTCTCCAGCTTCCAGCGTGTGCTGTTCGCAAGAGTGGTTAGCTCCCGACATTCTTTCTCACGGATCATTCGGTCAATGTTAGGAATGTACTCCAGACCCTTTTTATCAACAATTGCCATTTTTTTCATGTTAACCAACCTTTTGTTTGAGGATTGTCACTTTTGAATCAGCACCTGCGATGCTATTGAGATATGTAGTCCAGAGTTCCAGAGCATCCAGTTTTTTAGCCATAAACTTACTCCGGTTGTAAACACCTGCCACGCCAGGTAGCGCATGGCCTAACAGTTGTTCTACTACATAAAATTCAACACCGAGATCACTTAGATGAGTAGATAGCGTTCTTCTAAGGTCGTGTAGTGACCATTGTTTTTCATGGCCCAAACGTTTACCGATTTTCCCCCCAATCTTGCTTACGCTTTCTCTAATTCGCAGACTTCCCAGCACATAACCAGTATGTTTTGTCTCTTCGTGAACATCCGTTACCCACTGTCGTAGAATTTCAGGTACTGGTCTGACGATTTCAACACCAGTTTTTGAGTGATCTTTTGGTACAGTCCAAACCCAACTTTCGAGATCCCATTCGCTCCATTCTGATAATCGGGCTTCACTCATTCGACATCCAAATACTGTACAAAGCACAAACATTTTTCGCGTGTATTCAGACATTAGTTTTAAATCAGGCTCGACAAAAATTGCCTTCCAGAGCTGGCCGAGTTCGGCTTCATCCAGAACCCGATCCCGCTTACCTGCAATCTGCCCCACATCACTCATGCGCAAATCCTTTAAAGCATCACACGTCGCGTACTGGCGTACCCGACAAAAACGAAGAGCTAATTTAGTGTCAGAAAAAACATACGCCGCCATAACTGGTGCATTACGTTTAATTCGGTCAAAACAGTCCAGCCATTCATATAGGTGAGTGTCATTTACGGGCAAATGACCGATATAGGGAAAGATATGCTTTCGAAATCTGCCAAGCGTTACAGCATGAGTTTTACGACGCACCTTACAGTAATTTTCATACCAGTAATTTAGTGCATCCTCCACTGTAACCGGCTTTAAGCGTTCTTCAGCCTGAATCTTAATCTGGATACGCGGATCACGTTTGTCAGCCAACCAACCACGGCACTCGTCGCGCTTTTCCCTTGCCTGTTTGAGTGACATATCAGGATATTTACCCAACGTTAGCCAGACCGGAGCAGCCCGGCCACCTGCTAACCTGTAGAAGAAAACAAAGCTCACAGCCCCTTTAGTACTCACACGAATAGAAAGCCCCTTTCCATCAGCAATGGTGATCTGCTTTTCTCTGGGTTTCCCCAGATATCCTTTAAGCGCTTTGTCGCTCAGTTTGTTCTCGCCAGCCATTTTTAGCCCCAAAAAGCAATACAAGCTGCAATACAGAGATGATTGCAACACACAGATAACGAGGAAAATTCAGTGAAAGCACCAGATAAACTTATTCTTTATTATCAAAAGATTAAGTGTAAAAACCAGCAACTACACGAAAGCCTCAGAAAGCCATGCTAAGTGCTTGGGTTTGACATATCCCGGCGTAAATTCAGAGGTGGAGCCGCCACGGGAACGGATAACCTCACCGGAAACAATCGGCGAAACGTACAGCGCCATGTTTACCAGTCCCGGAATTTGTGAGAGATAGACTTTCTCCGTAGTGAAGGGATAGCTCTCACGGAAAAAGAGACGCAGAAACAGCGGATCAAACTTAAATTTCTGCTCATTTGCCGCCAGCAGCTGGGCGGTTGTGTACATCGACATAAAAAAATCCCGTAAAAAAAGCCGCACAGGCGGCCTTTAGTGATGAAGGGTAAAGTTAAACGATGCTGATTGCCGTTCCGGCAAACGCGGTCCGTTTTTTCGTCTCGTCGCTGGCAGCCTCCGGCCAGAGCACATCCTCATAACGGAACGTGCCGGACTTGTAGAACGTCAGCGTGGTGCTGGTCTGGTCAGCATCAACCGCCAGAATGCCAACGGCAGCACCGTCGGTGGTGCCATCCCACGCAACCAGCTTACGGGTGGAGGTATCCAGCATCAGCGGGGTCATTGCAGGCGCTTTCGCACTCAATCCGCCGGGCGCGGTTGCCGTATGTGCCGGGTCACTGTTGCCCAGCGGCTGGTAATGGGTAAAGGTTTCTTTGCTCGTCATAAACATCCCTTACACTGGTGTGTTCAGCAAATCGTTAACGGCATCAGATGCCGGGTTACCTGCAGCCAGCGGTGCCGGTGCCCCCTGCATCAGACGATCCAGCGCAGTGTCACTGCGCGCCTGTGCACTCTGTGGTGCAGCTGCCAGAATGCGGCGGGCCGTTTCCACGGTCATACCGGGGGTTTCGGCCAGCACGCGTGCCTGTTCTTCGCGTCCGTGAGCCTCCTCACAGTTGAGGATCCCCATAATGCGACTGTTTTCTGCCGCAACCGCTGCGGTGATCTGCGCGTTCACGTCCGGCTGCGCCGCGCTGGCGTTCTCGCCCTCCGTCGCTTGCACCACGCCAGTAACGTCAGCCTGCGAAGCAGTGGCTGAAACAGTTGTTGATTGAGTCTCTTTGGTCATTCGCCCTCCTGAGAGACGGGATTTACGTGCATCCAGTGCATCACGCATGACGGTGATCGCATCGGTACTGTTAACAAGTTCATCAGCCAGTCCGGCATCAATGGCCTCCTGACCGCTGTACACTGCAGCCTCGGTATCCAGCACAGCCTGCACGGACAGGCCGGTATATGCCGACACCTTCTGTGCAAACATCTGGCGGGTTGCGTCCATCCGGGACTGCAGTGTCTCCCGGACGTCATCCGGAAGATGGCTGTAGGGATTGCCATCCACCTTATGGCTGCCGCTGTAAATCAGCGTGATTTCCACACCCTGTTTCTCCAGGGCAGCACCGTAATTACTGTGAGCCATCATGACGCCGATGGAGCCTGTCCGGGCGGTCTGCGTGACCAGACGCCGGGAGGCGGCACTGGCAAGCAGCTGACCTGCACTGCAGTTCATGTCGTTGGCCAGCGCCCATACCGGTTTTATGTCACGCACACGGGCGATGATGTCAGCGCAGTCAAATGCCCCCGCCACCATTCCGCCTGGCGTGTCCATATCGAGCAGAATGCCGTCCACCATCGGGTCGCTGGCAGCCTGTTGCAGACGGGCGATAATGCCGTTGTAACCGGTCATCCCCGAGTACGGCTGCAGCGCCCGCGTCCGGCTGACCAGCGTGCCGGACACCGGCAGCACGGCGATGCCGTTCATGACCTGATAACTGCGGGCCTGTCGTGGTCCGTCATCATCACCGGATAATGCCAGCGTCGCGAGTGCCTCCTGGGCAGTCAGGCTGTCGCCGGACACCGCATCCGTCAGGCGGCTGATCCCAAGCTGGCCTGCAAGCGCACAAAAGAAAACCCGCGCATAGGCGGGTTCAAGCATCAGCGGCTCATTAAAGGCCATGCTGGCAATATGCGGGAGATTACGCAGCTCTGCTGTCACTCTTCTCCTCCTCTGTTGATTGTCGCAGCCCGGATTCAAATGCCGCAGCCGCCCAGGCGGGCGGTTTAAGACCAGCCGCGCGGCGCTCCATCGTTTCACGGACCTGCTGGGCAAAAATTTCCTGATAGTCGTCACCGCGTTTCGCGCACTCTTTCTCGTAGGTGCTCAGTCCGGCTTCTATCAGCATCGCCGCTTCCTGAACTTCTTTCAGACCATCGATGGCCATACGACCGGAGCCTATCCAGTCGCAGTTCCCCCAGGCACTGCGGGCTTCCTGAAAACTGAAGCGCGCTTTTGAAGGTAACGTCACCACGCGGCGGGCGATGGCCTCTTCCAGCCAGCACAGAAACATCTGGCTCGCCTGACGGGATGCGACGAATTTTCGCCGCCCCATAAAGTACGCCCACGACTCGTTCGCACTGGCCCGTGCCGTGGAGTAGCTCATCTGGGCGTAATTCCGGGAAAGCTGCTCATACGAGACACCCAGCCCGGCAGCGATATACCGCAACAGTGACTGCTCAAACACGGAGTAGCCGTTATCCGTGTCCTGAGCCGTCTGCAGGTTCAGTGAGTCCCCCGGCATCAGGTGCGGCACTTTTGCGCCTCCCAGACGGACCGGTGCTGCGGCGTAATACGCGGCAATTTCACCAATCCAGCCGGTCAGCTTGTCCCGCTGCTCCTTACTGTTCGCGCCCAGAATAAAATCCATCGCTGACTGCGTATCCAGCTCACTCTCAATGGTGGCGGCATACATCGCCTTCACAATGGCACTCTGCAGCTGCGTGTTCTGCAGCGTGTCGAGCATCTTCATCTGCTCCATCACGCTGTAAAACACATTTGCACCGCGGGTCTGCCCGTCCTCCACGGGTTCAAAGACGTGAATGAACGAGGCACGACCGCCGGGTAACTCACGGGGTATCCATGTCCATTTCTGCGGCATCCAGCCAGGATAGCCGTCCTCGCTGACGTAATATCCCAGCGCCGCACCGCTGTCATTAAGCTGCACACCGGCACGGCAGTTCCGGCTGTCGCCGGTATTGTTCGGGTTGCTGATGCGCTTCGGGCTGACCATCCGGAACTGTGTCCGGAAAAGCCGCGACGAACTGGTATCCCAGGTGGCCTGAACGAACAGTTCACCGTTAAAGGCGTGCATGGCCACACCTTCCCGAATCATCATGGTAAACGTGCGTTTTCGCTCAACGTCAATGCAGCAGCAGTCATCCTCGGCAAACTCTTTCCATGCCGCTTCAACCTCGCGGGAAAAGGCACGGGCTTCTTCCTCCCCGATGCCCAGATAGCGCCAGCTTGGGCGATGACTGAGCCGGAAAAAAGACCCGACGATATGATCCTGATGCAGCTGGATGGCGTTGGCGGCATAGCCGTTATTGCGTACCAGATCGTCTGCGCGGGCATTGCCACGGGTAAAGTTGGGCAGCAGGGCTGCATCCACACTTTCACCCGGTGGGTTCCACGCCCGCAACTGCCCACCAAATCCGCTGCCACCGCCGTGATAACCGGCATATTCACGCAGCGATGTCATGCCGTCCGGCCCCAGAAGGGTGGGAATGGTGGACGTTTTCATACATAAAATCCTGCAGGTCCCCTGCGTCGCTGTGTCATGCCGGTCTGCACTTCCAGCTCCGCAATGTATTTTTTCAGGTCAGACACGGAAGTGGCCGTAAACTCCACTCTCCGTCCGTCTTTCTGTACCGTTGCCACCCGTTTTCCTGTCATCAGGTCATGCAGTGCCGCACGGGCAGCGGCAAGTTCTTCCTGTCGCGTCATTCATCCTCTCCGGATAAGGCACGGGCGTAATCTGCCAGTGTTTTCTTGTTGGTTGCTGCACCATCCTCTTCCTGCAGGCTCGCCAGCAGTGCACTGAGATCCAGCTGCCAGCGGGAAATACTGATGCGCAGCGCCGCCAGCGCATAAACGAAGCAGTCGAGCGCCTCATTGCGTCGCTTTTTGCTGTCCCACAGTATTTTTTTCCTGCCATCCACCCATTTTTCGACCTGCTCTTCAGCAGTCAGTTGCTGCGCTTCGGTCAGATCAAAAATATCCGGGTTATTCGGGAAGTGAACGGCACCGGGAAGCGGTTCATCCCCTCCCGGCGTCAGTGTGAAGCGGTTATAAATCTGCTCTTTCGCGGTATCCGTACCGATTTCGGTAAGGTAAACCCCGTTTTTGTTTCGCTTACGTGGCATGCTGGCCACCGGCTTTCCGTAGACGGATGCCCCTTTAATGGGGATCACCCGGAACAGCCCATGTTTTTTCGAGCGTTCATACACAATGGTCGGGTCAATCCCGCCAGTATCCCAGCAGATACGGGATACCGACATTTCTGCACCATTCCGGCGGGTATAGGTTTTATTGATGGTCTCATCCACACGCAGCAGCGTCTGTTCATCGTCGTGGCGGCCCATAATAATCTGCCGGTCAATCAGCCAGCTTTCCTCACCCGGCCCCCATCCCCATACGCGCATTTCGTAGCGGTCCAGCTGGGAGTCGATACCGGCGGTCAGGTAAGCCACACGGTCAGGAACGGGCGCTGAATAATGCTCTTTCCGCTCTGCCATCACTTCAGCATCCGGACGTTCGCCGATTTTCGCTTCCCACGTCTCACCGAGCGTGGTGTTCACGAAGGTTTTACGTTTTCCCGTATCCCCTTTCGTCTTCATCCAGTCTTTGACAATCTGCACCCAGGTGGTGAACGGGCTGTACGCCGTCCAGATGTGAAAGGTCACACTGTCAGGCGGTTCAATCTCTTCACCGGATGACGAAAACCAGAGAATGCCATCACGGGTCCAGATCCCGGTCTTTTCGCAGATATAACGGGCATCAGTGAAGTCCAGCTCCTGCTGGCGGATGACGCAGGCATTATGTTCGCAGAGATAAAACACGCTGGAGGGATCATCCGGCGTCCATTTGAGGCCAAACGGCGTCTCTTTATCGCCAAATTTAAGGTACTGCTCCTCCCCGCAGTGCGGGCAGGCAACATGAAAACGCATAAAATGCGGGGATTCACTGGCTGCACGCTCAATCTGGCAGGTGCCTCTCACTTTGGGCGTGGAGCCACGGATGGACTTTGGCCAGACCGAGCCTTCAATACGCTTATCGCCCAGGAACGTCGGAGAGCCTTCCTGTTCAATATCCTCATCAAAGGCAGCAAGTTCATCATAACCCGCCACATCCACTGACTTTTCACGGTAGTTTTTTGCCGCTTTACCGCCCAGGCACCAGAAGCCACGACCATTGGAAAAACGCTTCATAGTGAGCGTGTTATCCCGGTGCTTTTTGCCATACCACGGAGCCAGCGCCAGCAGCGACGGAATATCGCGGATGGTCGGCTCAACGTGGGTTTTCATAAAGTTCTCGGCATCACCATCCGTCGGCAACCAGATAAGGGTGTTGCGCTGCTTATGCTCTATGAAGTAGGCATAAACACCCAGCAGCATTTTGGAATAACCAACACGGGCAGACTTCACCACATTCACCTCGCGGATGTAGTCACTGCCCATCGCATTCATGATGGCCCGCTGAAAGGGCAGTGTTTCCCAGCGCCCTTCCTGGTATGCGGATTCTTTTGGGAGATAGTAATTGGCATCCGCCCATTCAACGGCGGTCTGTGGCTCCGGCCTGAACAGTGAGCGAAGCCCGGCGCGGACAAAATGCCGCAGCCTGTTAACCTGACTGTTCGATATATTCACTCAGCAACCCCGGTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATAGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCTGAGTCTCTGGCATTCTTCAGTTCCTGTGCGTCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGCAGATCTGTCTCGCTGGCCTGCCGCAGTTCTTCAACTTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATTTTATAACGGCGGCAGAGTCATAAAGCACCTCATTACCCTTGCCACCGCCTCGCAGAACGGGCATTCCCTGTTCCTGCCAGTTCTGAATGGTACGGATACTCGCACCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGCCAAAAACAGAGAAAGGAAACGACAAAGGCCCAAAAGTTCGTTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGGGGTTATTTTAAATAAAAACATTAAGTTACGACGAAGAAGAACGGAAACACCTTAAACCGGAAAATTTTCATAAATAGCAAAAACCCGCGAGGTCGCCGCCCCGTAACCTGTCGGATCGCCGGAAAGGACCCGTAAAGTGATAATGATTATCATCTACATATCACAACGTGCGTGGAGGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCGTATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTGAATACGGGGCAACCTCATGTCAACGAAGAACAGAACCCGCAGAACAACAACCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGAGCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGTCGGAGACTAACGTCAGAAAAGAGAGCATATACATCAATCCAGAGTGATGATGAATAAACATCCCGGTTTCTTCCACCATCGCACCGGAAAAGCGACTATGAGGGTAACCCTGCGTCTGTCAGCACAGTAAAACCCGGTGTGCATCGTTTTTGATTATTCCCGCACACTCACGCAGAAGGAATTCCCCGTCGGGCTACGGTCATGGTTAATGCGGGAATACGGCGACGATACAGCGCAGCTAAAAGGGTAATGGACAGAAAGAGCGGTTTATTTCATTCCACAGGATTCTGAGTGCCCCCCCCTCCTCCAATAGGCTGAGCATCCACCTATATAGTTTTAATTTTCATCAATCCATTTAACTATCGTTTAATTGTTGTCACATAGGATTCTGCCGTTTTTAACAATGCAGGATAATAAGATGAAAAAAATGTTGTTTTCTGCCGCTCTGGCAATGCTTATTACAGGATGTGCTCAACAGACGTTTACTGTTGGAAACAAACCGACAGCAGTAACACCAAAGGAAACCATCACCCATCATTTCTTCGTTTCGGGAATTGGACAGGAGAAAACTGTTGATGCAGCCAAAATTTGTGGCGGCGCAGAAAATGTTGTTAAAACAGAAACCCAGCAAACATTCGTAAATGGATTTCTCGGTTTTATTACTTTAGGCATTTATACTCCGCTGGAAGCGCGTGTGTATTGCTCACAATAATTGCATGAGTTGCCCATAGATATGGGCAGCTCTATCTGCACTGCTCATTAATATACTTCTGGGTTCCTTCCAGTTGTTTTTGCATAGTAATCAGCCTCTCTCTGAGGGTGAAATAATCCCGTTCAGCGGTGTCTGCCAGTCGGGGGGAGGCTGCATTATCCACGCCGGAGGCGGTGGTGGCTTCACGCACTGACTGACAGACTGCTTTGATGTGCAACCGACGACGACCAGCGGCAACATCATCACGCAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTTGCATCGAGCGCAGCAACATCACGCTGACGCATCTGCATGTCAGTAATTGCCGCGTTCGCCAGCTTCAGTTCTCTGGCATTTTTGTCGCGCTGGGCTTTGTAGGTAATGGCGTTATCACGGTAATGATTAACAGCCCATGACAGGCCGACGATGATGCAGATAACCAGAGCGGAGATAATCGCGGTTACTCTGTTCATTGCTGACCCCACAAACAGATTTCACGCTCAATCTCACGACGAGTCATGAGACCTTTCCATTGCTTACCGCCAGCATATGTCCAGCGACGTAGCTGATCACATGCGCCTTTAATATCACCCTGGTTTATTTTGCGAAGAAGCGTCGATGTTCTGAAATTGCCAGCACCCACGTTGTAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCGACAGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCACTCTGCTTCGGTATACGTTTTACCGGGAATGATGTCTTTTCCTGTATGCCCGTGACATACAGTCCATACACCAACTATGTCTTTGTAAGGATTATGTCTCACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAATACAGATGCTATAGCAATAGCCCCGCCACCAATAGCAGCAGCAACAGCTTTTCGTAATGATGGAGGCATTATCCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAGGCTACCCAGCACTCCAATTGCCGCCCACTGTGAGGGCGTGACTTTATCTAGCAGCTGTAAAAACCAGTACCCGGCACTACCTGCTGAGGTGCCATAGGCGACACCCGTTGTTAACTTATCCATGGATTTCATAACCCCACCTCGCAGACAAAGCGGGTGTAAATTGAGGGAATACTACGAAACGTAACAGACTCGGAGTCAGTGAATAACTCAGGTATTGGGTTATCAGCTAATATCGAGACTCAAAAAATGGAAAAACCCGCTCGACGGCGGGTTTAAGCTGTGTGACGAAGTAACCACTCTTAACAGCATAACCAATTTTTTACGTACGTAAACCACTAAATGATATTTGCGAGAATGCTACCGAGTATTGAAAACACCACTACAAATACATAAGCAAATCTCAACAAATAACCAACAAATAATTTCCAGTGTTATTTTTAGCTGGTTTAAATTGAACCTTCAAATTATAGAGCACTTATAAATAACAGCCGTTAATATAAATTGGCTAACAGATTTATTTTTATTCAGCCAAGAGCCATGAATAGGATTCGATAGAAAAAAGTTCAGATAAAAATAGAGATCTACTTCACAAATCAAACGAGAAACCAAAACTTACATCTTGAAATAATCACATTGATTAGATGAATATTTATCGCGCAGTGACATCATTTTTTAATAATAGTTCAAAAAAAGGGCTCACGATGAAAAAATTAACAGTGGCAATTTCTGCTGTAGCTGCATCAGTACTGATGGCGATGTCTGCTCAGGCAGCTGAAATTTATAATAAAGACAGTAACAAGCTGGATCTGTACGGGAAAGTTAATGCTAAGCACTACTTCTCCTCTAATGATGCAGATGATGGTGATACTACTTATGCCCGTCTTGGCTTCAAAGGTGAAACCCAAATCAACGATCAACTGACTGGTTTCGGTCAGTGGGAATATGAATTCAAAGGCAACCGCGCTGAATCTCAAGGTTCCTCCAAAGATAAAACCCGTCTTGCCTTCGCTGGCCTGAAATTCGGTGACTACGGCTCCATCGATTACGGCCGTAACTACGGTGTAGCATACGACATCGGTGCGTGGACTGACGTCCTGCCAGAATTCGGTGGTGACACTTGGACTCAAACCGACGTGTTCATGACTCAACGTGCAACTGGTGTTGCAACCTATCGTAACAACGACTTCTTTGGTCTGGTTGATGGTCTGAATTTTGCTGCTCAGTACCAAGGCAAAAACGATCGTAGCGATTTCGATAACTACACTGAAGGTAACGGTGATGGCTTCGGTTTCTCTGCTACCTATGAATACGAAGGATTCGGTATCGGTGCAACTTATGCGAAATCTGATCGTACCGACACTCAAGTTAATGCAGGGAAAGTTCTTCCTGAAGTATTTGCTTCCGGTAAAAATGCAGAAGTTTGGGCCGCAGGTCTGAAATATGACGCTAACAACATTTACCTGGCCACTACCTATTCTGAAACCCAGAATATGACTGTATTTGCTGATCACTTCGTTGCTAATAAAGCCCAAAACTTCGAAGCTGTTGCACAATATCAGTTCGATTTCGGTCTGCGTCCGTCCGTTGCTTACCTGCAATCTAAAGGTAAGGATCTTGGAGTATGGGGCGATCAGGACTTAGTCAAATATGTTGATGTAGGTGCAACCTATTACTTCAACAAAAATATGTCTACTTTCGTTGATTACAAAATCAACCTGCTTGACAAAAATGACTTCACTAAAGCACTCGGTGTAAGCACTGATGACATCGTTGCTGTAGGTCTGGTTTACCAGTTCTAATCTGATTACGAAAAAGATATGTTGCGGGAGGCGTTGCCTCCCCAACATATAAGTGGCTCCCTCAAGCCACTTCCTTTAGAAGCACAACCTTGCTTCTAACTATATAAACCTTCTGTTATATATTACCCTTTATTTTTGGGGGCGTCTCAACGCCCCATTTTTAATAATTTTTAGTAAACAATTGGCATATTAATTAGAGTTATTAACAACGATATCCATCTCTAACCGGATATCTAATGCCATTAACATCCCTTCAATTATACCCTCAGCCTTCTGTAACCTTTTCCCGATATAACCATCAGAGCAGCAATGCTTACCTGCCAGTGACATGAATGTCATACCGACTACATAATAATCTACTAATAAATCGTGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATGCACCCGCAAATGATCATCGCGTCATCGTCACAACATTGCGGGCGAGATTTTACTTTTGAAGTAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAGGTCACATCTTCATGATTATTAGCCGCCCACGCTCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAAATCAGACCAGAACGCCAATTACAAGCAAAAATCAACAAAACAGTATTAGTTGATTGTTATCTCTGACTTCATACTCCTGCTCCTGTCAGGGTTTTGGCGTAATTCTTCAGTATTCGGTAATCGGTCAAAACAGAACCGGGGAAACGATATAAGCGCAGACGCCCCCAGCGGTGGCGAAGAAGTTCTGCCATATTAAACTCAAACATCATTCATTCCCCATTTCGGTGATGGTCAGTTCCAGCCTCCCACCTTTGGTAACAGGCATCTTCACAACGCGGTAATCAACGACCTGAGCATCATCCAGCCAGAAACCTGCTTTGGTGAGTGCGTCAAAAGAGGCTTTTTGCAGATTATCCAGGTCACGGCGACGGCGATCCGGCATGTGGCACTCAATGCGGATTTTCACAGGCATAGCCAGGCCGATATCCAGCATTGCGTTTTTAATGATTCGGGCGACGTTATCGCAGTATGCCTGCCCCTCTGCGCTAACGTGCGTGCGCCCGCGATTATGGCGGTAATAGCGATTATTGCTCGGAGGCCAGGGTAATGTGATGCTGTAGGTATTCACGCCTTAATAACCCCCTCTTTCAGCCAGATAACCTGTGTTCTCGCCATACCTTCCAGCGCGCATTCTTTTGCATATGCAGCATCGACTTGCTTGGCATGCCCTTTACCGGCAATCTTCTGTATGCGCTAAACCTAGATAGAATCCACTCTGTGCACATTGAAGCCCGCTCTATGCTTCCTTTCAGGTATTGAAGGGATTGAGATGGGCTAAGCATTATTGGCCTCCTGCATCAGGAGAAAGACAATCATGGCGGCGCGGAGAGGTCTGGCATCAAATATTGGGCTTACGCCTTTTGCATCCACACACCATTCAGTTAACTGGTCTAAGATAGAAATCCTGTATTTCTCAATAATCGGCCATGAAGCGCTCGGATCATTGCAGTAGTCAGGCAAATGATTTAATGGCTCAAAAGTTGTATCAGCATTTCCGTAATACCATTTGTTGGTGTTATTCCCTGATGTTTCCGGTTTACTTGCCCAAAGGCCTTTAAAAATTATGTCTCCTACCATTCTGTTAATTTCAAAATCACTTAACTGTGAATAATCCATTGTCATTTCCTCGCACGATATCTTAGCCACCGGATATCCCACAGGTGAGCTGTGTAATTGAAGGTTTTTACGTCAGATTCTTTTGGGATTGGCTTGCGTTTATTTCTGGAGCGTTTCGTTGGAAGGTATTTGCAGTTTTCGCAGATGATGTCGGTGAAACTTCGTCGCTGTCGCCTCATGCCGCCCTCCTGACGCCCTGCCCGATCGCCATCAATGCCGCTTTGGATACGGTAGTAAACATCCGTCGAGGACTGATGAACGGTCGCCAAATCAGCAGCATGGAGCCTTTGCTGTTTCCCTTCTTCTCCAGCCCTGTCGATGGTTCGATAAAATTAATCCGTCCATCAGTGATAATGCGAACTTCGTCGACACTCTCCAGAGCCTTGCTGAACCATCCGACTGACATATCCTCTGGCACAAGCATAACTACCGTCTGTCGCTGTTGTATGCACTGCTCAGCGGCTTTTTCCACCCACGGCCTGATATTGCTGTACGGTGGGTTATTCCAGATTGCACCGTGGCTTACCCACTCAGAATTGAGCGCGTCGTCGGCCTCAGTTAGCCAGTGAGCACACAGAGCATTTTTGTCGCTCGCTGCAGAATCCAGCCAGAATCCAAACTCAATATCCAGTGCATCAAAAAGCCAAAGCGGCGTTTGCCAGCAGTCCTTGTCGTGTGCTGGCGTATTTGATTTGATAGTCATGCAGCCCGATCTCCCCATCTCGCTTTCCACTCCAGAGCCAGTCTCGCTTCGTCTGACCACTTAACGCCACGCTCTGTACCGAATGCCTGTATAAGCTCTAATAGCTCCGCAAATTCGCCTACACGCATCCTGCTGGTTGACTGGCCTATTACCACAAAGCCATTCCCGGCAAGGTTAGGAACAACATCCTGCTGCTTTAATGCTGCGGTAAACACACACTTCCAGCTTTCTGCATCCAGCCAGCGACCATGCCATTCAACCTGACGAGAGACGTCACCTAAGCAGGCCCATAGCTTCCTGTTTTGGTCTAAGCTGCGGTTGCGTTCCTGAATGGTTACTACGATTGGTTTGGTTGGGTCTGGAAGGATTTGCTGTACTGCGTGAATAGCGTTTTGCTGATGTGCTGGAGATCGAATTTCAAAGGTTAGTTTTTTCATGACTTCCCTCTCCCCCAAATAAAAAGGCCTGCGATTACCAGCAGGCCTGTTATTAGCTCAGTGATGTAGATGGTCATCTTTTAACTCCATATACCGCCAATACCCGTTTCATCGCGGCACTCTGGCGACACTCCTTAAAAATCAGGTTCGTGCTCATCTTTCCTTCCCGTTCTTCCTTGGTAGCAAACCGGTAATACACCGTTCGCCAGACCTTACCTTCGATAACCAGAAGACCTGCCCGTGCCATTTTAGCCGCGGCCTGATTTATGCTGGTTACTGTTGCGCCTGTTAGCGCGGCAACGTCCGGCGCACAGAAGCTATTATGCGTCCCCAGGTAATGAATAATTGCCTCTTTGCCCGTCATACACTTGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCGATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGCCTTACGGCGTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTAACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCCTCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGCGATAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGAAATGGTCGATTCTGCCGACGGGCTACGCGCATTCCTGCGTTAACCTGTTCCATTGTGGTGATCCCGTTTTCCCGGAAAGCCAGCACCCACTGGCGGCGGATTTCGTTCAGTTCGTTCTGGTCCCGGTTAGCCAGGCTCGCCGGGAAAGTTGCCAGTAACTGGCTGAACACACCATTGATGATCTGCGCTACCTGTTGTACCTGCGGCTTTTCGTCGTACTGTTCCGGCATGTTGTTGGCGATCCGACGCATCTGCTCACGGTCAAAGTTAACCATCTGTGCGGCGATGTTTTTCATAAATCCACCCCGTAAATCCAGTCAGTGTTTGTCAGGTCGAGTTTTGGTTTGCTGGCTGTCACGCCTGCCTGTTGCTTGTTACGGTTGATTTCGAGCTGGGTCCACTTGTCGCGGAGTTTGGCCGGACTCAGCACGTTACCGGACCAGAAGTTGTCCTGGCAGGCCCAGCGGAAAAGCACACACATATCGCGGTGGTTACGTCCGTCACGTTCACGCATCAGGCGGATATCGTTAGCCCACCCAGCAAAATTCGGTTTTCTGGCTGATGGTGCGATAGTCTTCACCATGTCAAACATCCACTCTGCGGCGGTCAGGTCTTCTGCTGTCCCCCACTTGCTGCCGCTCTGAATTGCAGCATCCGGTTTAACCACAGAAAGATCGTTTTCTGGCTGGTCAGAGGATTCGCCAGAATTCTCTGACGAATAATCTTTTCTTTTTTCTTTTGTAATAGTGTCTTTTGTGTCCCCCTGTTTTGAGGGATAGCAATCCCCCAATTTGAGGGATGTTTTATCCCTCGTTTTAGGGGATTGTCCCTCGTTTTGAGGGATACACCATTCTGAGATGTTTTTATTTGGTCCAAACATGCCGCCTTGCTGCTTGATAATATTCATTCTGACGAGTTCTAACTTGGCTTCATTGCACCGTTTGACAGGTAACTTTGTAATCTCGCTAAGTTGAGAATCGGTGATTCTGTCCATTGGTTTATTCCACCCATAGGTTTTACGCAGAATGGCAAGCAGCACTTTAAACTGTCGCTTGGTCAGATCTGCGCCCGAATAAGCCTCAAGCAGCATATTTGATAGTCTGGCGTAACCATCATCGAGATCTGCCACATTACGCTCCTGTCCGGCAAAGTTACCTCTGCCGAAGTTGAGTATTTTTGCTGTATTTGTCATAATGACTCCTGTTGATAGATCCAGTAATGACCTCAGAACTCCATCTGGATTTGTTCAGAACGCTCGGTTGCCGCCGGGCGTTTTTTATTGGTGAGAATCGCAGCAACTTGTCGCGCCAATCGAGCCATGTCGTCGTCAACGACCCCCCATTCAAGAACAGCAAGCAGCATTGAGAACTTGGGAATCCAGTCTCTCTTCCACCTGCTGATCTGCGACTTATCAACTCCCACAGCTTCCGCTGTCTTCTCAGTTCCAAGCATTGCGATTTTGTTAAGCAACGCACTCTCGATTCGTAGTGCCTCGTTGCGTTTGTTTGCACGAACCATATGTAAGTATTTCCTTAGATAACAATTGATTGAATGTATGCAAATAAATGCATACACCATAGGTGTGGTTTAATTGGATGCCCTTTTTCAGGGCGGGGATGGGTAAGAGCGGGGTTATTTATGCTGTTGTTTTTTTGTTACTCGGGAAGGGCTTTACCTCTTCCGCATAAACGCTTCCATCAGCGTTTATAGTTAAAAAAATCTTTCGGCCTGCATGAATGGCCTTGTTGATCGCGCTTTGATATACGCCGAGATCTTTAGCTGTCTTGGTTTGCCCAAAGCGCATTGCATAATCTTTCAGGGTTATGCGTTGTTCCATACAACCTCCTTAGTACATGCAACTATTATCACCGCCAGAGGTAAAATAGTCAACACGCACGGTGTTAGATATTTATCCCTTGCGGTGATAGATTTAACGTATGAGCGCAAAAAAGAAACCATTAACACAAGAGCAGCTTGAGGACGCACGTCGCCTTAAAGCTATTTATGAAAAAAAGAAAAATGAACTTGGCTTATCCCAGGAATCTGTCGCAGACAAGATGGGGATGGGGCAGTCAGGCGTTGGTGCTTTATTTAATGGCATCAATGCATTAAATGCTTATAACGCCGCATTGCTTGCAAAAATTCTCAACGTTAGCGTTGAAGAATTTAGCCCTTCAATCGCCAGAGAAATCTACGAGATGTATGAAGCGGTTAGTATGCAGCCGTCACTTAGAAGTGAGTATGAGTACCCTGTTTTTTCTCATGTTCAGGCCGGGATGTTCTCGCCTGAGCTTAGAACCTTTACCAAAGGCGATGCGGAGAAATGGGTAAGCACAACCAAAAAAGCCAGTGGCTCTGCATTCTGGCTTGAGGTTGAAGGTAATTCCATGACCGCACCAACAGGATCCAAGCCCAGCTTTCCTGACGGGATGTTAATTCTGGTTGACCCTGAGCAGGCTGTTGAGCCAGGCGATTTCTGTATAGCCAGACTTGGTGGTGATGAGTTTACCTTCAAGAAACTGATCAGGGATAGCGGTCAGGTGTTTCTACAGCCACTAAACCCACAATACCCAATGATCCCATGCAATGAGAGTTGTTCCGTTGTGGGGAAAGTTATCGCTAGTCAGTGGCCTGAAGAGACGTTTGGGTGATAGGAAGTAAGTTTTATGTTGACGGCACAGTCAACTTGGCATAGATTAATTAAACCAAGCCCAGCCCCGTTCGCAGACAATTGTTAATATCTGCATAACGGCTCTGGGCTATTTTTTTGGGACTCTTATGAAGAAAGCAGCAATTTTAATTGATGCGGGTTTTTTCATGCAGCGTGTTCATGCTACGCATCGTAAACACTTCGCCGAGCATGAACTGACTGCGCAATGCATAATGAAAGTAATATGGTCAATGGTTCTTTCCCATCTTAATGGAAAACGTCAATCACAAGAACGTAGGGAACCGCTTGAGCTTTATAGAATTTACTTCTATGACTGTCCACCACTCGACATTCAAACACGCCTTCCACTTCCTGAGCCTGGCAATAAGACGCCTGGTCGCAAGAATTTCAAACTCGAAAAATCATATATTCTGAGAACGGAGCTGCATGAAGAGTTAAGAAAAACTCGAAAAACAGCCTTAAGATTAGGGAATCTTGTTGATAATAAGCGATGGCAACTAACTACATTCTCCCTTGATGCTCTGATGAAAGGAACGAAAAAATGGGATGAACTGACAAATGATGATTTTTACTATGACATCAAACAAAAACAAGTTGATATCAAGCTAGGGATGGATATCACGACTTTAGCTTATGAAAAACTTGTTGATGTAATTGTCCTTGTTGCTGGGGACTCAGACTTTGTGCCTGCCGCCAAACACGCCAGAATTAAAGGTATTGATTTTATTCTTGATCCACTAAGACAGAATGTTACCCCATCACTGTCAGAGCACATTGATGGAGTTCAGTCATACAGCTTGATATCAGGACTTGCCGATGCTTTACATGTTGAGCCAGACCCGGCACCTGACTGGTGGGAAGATCGAAAAAAAGGCAAGCCTAGGGGAAAAAACAATAGCGGTAAACGCAGGTATGGAAACACTCAAGCTGAGTCTGCAAAGAAACATCAAAGAAATAAACGATAATCCCATCAACCCGGCCACCGAGCCGGGTTTTCTTTGCCTCACGATCGCCCCACCTAAAAACACATAACCAATTGTATTTATTTTAAAATTAATAGGTGCAACTCACTAAACAACGCAATTCTGATCTCTCGATCACCTCCCAAGCCACACAACCCTGCAAAAAATAAATCTATATAAAAAACATACAGATAACCATCTGCGGTGATAAATTATCTCTGGCGGTGTTGACACAAATACCACTAGCGGTGATACTAAACACACCAGCAGGACGCACTAACCCACATGAAGGTGATGCTCTTAAAAATTAAGCCCTGAAGAAGGGCAGCATTCAAAGCAGAAGGCTTTGGGGTGTGTGATACGAAACGAAACATTGGCCGGAAGTGCGAATCCGGATTAGCTGCAAATGAGCCAATCGTGGGGTGTTTTCGTTCAGGACTACGACTCCCACACACCACCAAAGCTAACTGACAAGAGAATCCAGATGGATGCACAAACACGCCGCCGCGAACGTCGCGCAGAGAAACAGGCTCAATGGAAAGCAGCAAATCCCCTGTTGGTTGGGGTAAGCGCAAAACCAGTTAACCGCCCTATTCTCTCGCTGAATCGCAAACCGAAATCACGAGTAGAAAGCGCACTGAATCCGATAGACCTTACGGTGCTGGCTGAATACCACGAACAGATTGAAAGCAACCTGCAACGTATTGAGCGCAAGAATCAGCGCACATGGTACAGCAAGCCTGGCGAACGCGGCATAACATGCAGAGGACGCCAGAAAATTAAAGGTAAATCTATATCACTTATTTAGAAAATGCAGATTTAGGGAACAGATAGGAGGCGTTACACCTATGGCATCTCATCCTATGGTTAGAAGGTGGTGCAAATCCTTCGTATTGAAGTATGGATTTCACAGAAGATTCATAGCATTGAGCGCAAAGATAGTGCATTGGCTGACCGGTATTTGCCGATTTTTTGAGACGATAAACCACCGTAGCAACAGTAGGTGTATACATCTCATAGTTTTTCTTTTCCTCTTCCCACTTAGAGGCTCGATTTATCTTTTCTTCAAGCTCAATAATCTTGTCCTTAGAAATCATCAAAAGCTCATTAAGTGACATTTGCTGCTGTTGGGCATCCATGAGCTTATCGACAAGTTCGTATGTTTTTTCTTTTACTGAGTAGTCTATTTGCATTTTCTGGATTTCCTTTACTGCGCCAACAGCACTCATCAGAGCACCTCCGGCACCAGAAACTGCATCTGTAATCCTACTTATTATTCCTTTTTCATCAGACATATAAATCACTCTCTTACTGTAGGGGTAAGAGGATTTTACTATTTTTCTCGCTGTAGGGGTACACGAGAACCACCGAGCCTGATGTGGTTAAAAGACAGGCATACTAATAAACACTGCACTGTGTATTTATTCCAACGAGTGAATACACGGAGCAATGTCGCTCGTAACTAAACAGGAGCCGACTTGTTCTGATTATTGGAAATCTTCTTTGCCCTCCAGTGTGAGGGCGATTTTTTATCTATGAGGATATGAATAGGTGTCAAACATCAAAAAATACATCATTGATTACGACTGGAAAGCATCAATAGAAATTGAAATCGACCATGACGTAATGACAGAGGAAAAACTTCACCAGATTAATAATTTCTGGTCAGACTCTGAATACCGACTCAATAAACACGGCTCTGTATTAAATGCTGTATTAATCATGCTGGCGCAACATGCTCTGCTTATAGCAATTTCAAGCGACTTAAATGCATATGGTGTTGTGTGTGAGTTCGACTGGAATGATGGAAATGGTCAGGAAGGATGGCCTCCAATGGATGGTAGCGAAGGAATAAGAATTACCGATATCGATACATCAGGAATATTTGATTCAGATGATATGGCTATCAAGGCCGCCTGAGTGCGGCTTTACCGCATACCAATAACGCTTCACTCGAGGCGTTTTTCGTTATGTATAAATAAGGAGCACACCATGCAATATGCCATTGCAGGGTGGCCTGTTGCTGGCTGCCCTTCCGAATCTTTACTTGAACGAATCACCCGTAAATTACGTGACGGATGGAAACGCCTTATCGACATACTTAATCAGCCAGGAGTACCCAAAAATGGATAAAAAACTTATGGCTATCCAGACAAAATTCACTATTGCCACTTTTATTGGCGATGAAAAGATGTTTCGTGAGGCCGTCGACGCTTATAAAAAATGGATATTAATACTGAAACTGAGATCAAGCAAAAGCATTCACTAACCCCCTTTCCTGTTTTCCTAATCAGCCCGGCATTTCGCGGGCGATATTTTCACAGCTATTTCAGGAGTTCGGCCATGAACGCTTATTACATTCAGGATCGTCTTGAGGCTCAGAGCTGGGCGCGTCACTACCAGCAGATAGCCCGTGAAGAGAAAGAGGCAGAACTGGCAGACGACATGGAAAAAGGCCTGCCCCAGCACCTGTTTGAATCGCTATGCATCGATCATTTGCAACGCCACGGGGCCAGCAAAAAAGCCATTACCCGTGCGTTTGATGACGATGTTGAGTTTCAGGAGCGCATGGCAGAACACATCCGGTACATGGTTGAAACCATTGCTCACCATCAGGTTGATATTGATTCAGAGGTATAAAACGGATGAGTACAGCACTCGCAACGCTGGCAGGGAAGCTGGCTGAACGTGTCGGCATGGATTCTGTCGACCCACAGGAACTGATCACCACTCTTCGCCAGACGGCATTTAAAGGCGATGCCAGCGATGCGCAGTTCATCGCATTGTTGATCGTCGCCAACCAGTACGGCCTTAATCCGTGGACGAAAGAAATTTACGCCTTCCCTGACAAGCAGAACGGCATCGTTCCGGTGGTGGGCGTTGATGGCTGGTCCCGTATCATCAATGAAAACCAGCAGTTTGATGGCATGGACTTTGAGCAGGACAATGAATCATGTACATGCCGGATTTACCGCAAGGACCGTAATCATCCGATCTGCGTTACCGAGTGGATGGATGAATGCCGCCGCGAACCATTCAAAACCCGCGAAGGCAGAGAAATCACCGGACCGTGGCAGTCGCATCCCAAACGGATGTTACGGCATAAAGCCATGATTCAGTGTGCCCGTCTGGCCTTCGGATTTGCTGGTATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACCGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTATGCTGATTGCCCTGGACAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAATGAAAGCTCTTGGATTCCTGAAACAGAAAGCCTCTGAACAGAAGGTGGCTGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGGGATGATGCATGGCACAAATTACGGCTCGGCGTCATCACCGCTTCAGAAGTTCACAACGTGATAGCAAAGCCCCGCTCAGGAAAGAAGTGGCCTGACATGAAAATGTCCTACTTCCACACCCTGCTGGCTGAGGTTTGCACCGGTGTGGCTCCGGAAGTTAACGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAACGACGCCAGAACCCTCTTTGAGTTCACTTCCGGCGTTAATGTTACTGAATCCCCGATCATCTATCGCGACGAAAGTATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAACTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAGTTCCGGCTCGGTGGTTTCGAGGCCATAAAATCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACGCGAAAAGATGCCTGGTACTTTGCCAACTATGACCCACGAATGAAGCGTGAAGGCCTGCATTATGTCGTGGTTGAGCGGGATGAAAATTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGCGCGATCACTTTCGTCTACTCCATTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCCCTGAAAGCACAGCGGCTGGCTGAGGAGATAAATAATAAACGGGGAGCTGTATGCACAAAGCATCTCCCGTTGAGTTAAGAACGAGTATCGAGATGGCACATAGCCTCGCTCAAATTGGAGTCAGGTTTGTGCCAATACCAGTAGAAACAGACGAAGAATTTCATACGTTAGCCGCATCCCTTTCACAAAAGCTGGAAATGATGGTGGCGAAAGCAGAAGCAGATGAGAGAGACCAGGTATGACAACCACTGAATGCATTTTTCTGGCAGCGGGCTTCATATTCTGTGTGCTTATGCTTGCCGACATGGGGCTTGTTCAATGACACCTCAGCAAGAAAACGCCCTTCGCAGCATTGCCCGTCAGGCTAATTCTGAAATCAAAAAAGCCAGACAGCAGTTTCCGGATAAAAACGTCGATGACATTTGCCGTAGCGTACTAAAGAAGCACCGCGAAACGGTAACGCTGATGGGATTCACACCGACTCATTTAAGCCTGGCGATCGGCATGTTGAACGGCGTCTTTAAGGAACGGTGAACATGAAAAGCAAAATCATCAGGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATCATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGATAGATGAACGAAGACGCTTGGCCGTTCAGGGTTGTCGGACTTGTGCAAGTTGCCAGGAGGATCTGGAACTTATCAGTAAACAGAGAGGTTCGAAGTGAGCGAAATTAACTCTCAGGCACTGCGTGAGGCGGCAGTAGCAATTGAAACAGTAGCAACACCTCAAAAATTGCTGGCATTTCGTATGAAAGTCACACCTCAGGTTGTGCTGGCTCTACTGGATGAACGAGATGCATTAAATGAACGCCTAGCCGAACTGGAGGCTGATTTAGCAGGGCTGGCCGAAGACCACCAGAAAGCGACTGAGTCAATTAAGCAGGCTGATGCAGCTGTTAAGTTGGCACACGAGAAGTTTTCGGCGCTGGCGGCGGAGAATGAGCTGGCTCGTAAAGCAGTTCAGGAATTCTGCGATGTTGTTGGCGACAGCACCGAGGTTATCTGCGAGGAGATTGGGAGAGATGGTGTTCTGGTTATTTTGGAGGCCATGAAGGCAACAGGAAATATGCCAGCCACCGATGCTTTCCTGGCTGAAGTACGGGCGCAGGGGGTGGAGATGATGCGCGAACATCCATCAATCAAACTTTGCTCTTTGACGCACATATGTGATGAGTTAGCCGCCCAGCTTCGCAAAGGAGGCAACCAGTGACTGGACATGCAGCAATCCTCGACATGTGCTGTGGCAGTCGCATGTTCTGGTTAGATAAGAATGACGAACGGGCGAGATAAGCGATCGGTTAAGTGCTATAGTAATGCGCTTTTGTATTTATGGAGTGAATATGAAAAATATCCTACTGGCATCATTGTTAGTGGCATCGCCGGGTGCATTTGCAGCCAGCTTTGACTGCCAAAAGGCTTCGACAGCAATCGAACATAAAATCTGCGATAACGAACGTCTGTCAAAATTAGACGAACAGCTTAGCTCTGCCTATTCTAGTGCCCTCAAAGGAAACCCAGAGAACGCAGACACCCTAAAAATGGTTCAACGTCAGTGGGTAAATATGCGTGGAAAACTCACTGATAATAAGGCTCTGGAGCTGGCTTATCTTATCCAAATTAATGGCCTCAAAGGTTTGGGGAGTTCAGTCAGCGTAACAGCGGCCAATGACATACCCACGTCGGCGCAGAAACATTCTAAAGAGCAGGAAGATACAAGTAAGGCAGAAGCTAAGTCGGTCAAGAACGGCAATAAGCTAACCTTAGAGTCATTCCGAGCTAAATATGTAGAAGTAGATGGTGAGTATTACAGCACGACATCCATTCCTAGAGGCAGTTCGTTCTTGTTCACTTGCGCTAGTCGTATTGCTGATGACCAAGTGAATATTTGGAAGAAACAGGCAGCCAAAGAGGGCAAAATCGACCTATTCTTTGAGGTTGAGAATCACTTACACACGGCTATGTTGAACGCCAATTTTCAGAAGTTGAATTCAGACCCTGCCAAAAGAGGTATTTGTAATCTGATTAACGCAGTGCCGTAAGTAAATTTAGGGCCACAGTTGTGGCCTTAAATATTTTTTTCAGCCTTTTCTTATTTGTAATAAGCAGTACTTGGTAGTGCTTATAAAACAGAATAAAAAATATATGACTTTGGCGATTACCCAGTAAAGATATTCGAAATAAATGTAAATATCGACAATGAATAACTATCCTCGCACTCGCGGGGATTTCTTTTATCTGAACTCGCTACGGCGAGTTTTTTTTATGGAGATGATAAATGCACTTCCGAGTCACAGGTGAATGGAATGGAGAACCATTCAACAGAGTTATCGAAGCCGAGAACATCAGCGACTGCTATGACCACTGGATGCTGTGGGCGCAGATAGCACATGCAGACGTAACCAATATTCGAATTGAAGAACTGAAAGAACACCAAGCCGCCTGATGGCGGTTTTTTCTTGCGTGTAATTGCGGAGACTTTGCGATGTACTTGACACTTCAGGAGTGGAACGCACGCCAGCGACGCCCAAGAAGCCTTGAAACAGTTCGTCGATGGGTACGCGAGTGCAGGATATTCCCTCCTCCGGTTAAGGATGGAAGAGAGTATCTGTTCCACGAATCAGCGGTAAAGGTTGACTTAAATCGACCAGTAACAGGTAGCCTTTTGAAGAGGATCAGAAATGGGAAGAAGGCGAAGTCATGAGCGCCGGGATTTACCCCCTAACCTTTATATAAGAAACAATGGATATTACTGCTACAGGGACCCAAGGACGGGTAAAGAGTTTGGATTAGGCCGAGACAGGCGAATCGCAATCACTGAAGCTATACAGGCCAACATTGAGTTATTTTCAGGACACAAACACAAGCCTCTGACAGCGAGAATCAACAGTGATAATTCCGTTACGTTACATTCATGGCTTGATCGCTACGAAAAAATCCTGGCCAGCAGAGGAATCAAGCAGAAGACACTCATAAATTACATGAGCAAAATTAAAGCAATAAGGAGGGGTCTGCCTGATGCTCCACTTGAAGACATCACCACAAAAGAAATTGCGGCAATGCTCAATGGATACATAGACGAGGGCAAGGCGGCATCAGCCAAGTTAATCAGATCAACACTGAGCGATGCATTCCGAGAGGCTATAGCTGAAGGCCATATAACAACAAACCCGGTCGCAGCCACTCGCGCTGCAAAATCAGAGGTAAGGAGATCAAGACTTACGGCTGACGAATACCTGAAAATTTATCAAGCAGCAGAATCATCACCATGTTGGCTTAGACTTGCAATGGAACTGGCTGTTGTTACCGGGCAGCGAGTTGGTGATTTATGCGAAATGAAGTGGTCTGATATCGTAGATGGATATCTTTATGTCGAGCAAAGCAAAACAGGCGTAAAAATTGCCATCCCAACAACATTGCATGTTGATGCTCTCGGGATATCAATGAAGGAAACACTTGATAAATGCAAAAAGATTCTTGGCGGAGAAACCATAATTGCATCTACTCGTCGTGAACCGCTTTCATCCGGCACAGTATCAAGGTATTTTATGCGCGCACGAAAAGCATCAGGTCTTTCCTTCGAAGGGGATCCGCCTACCTTTCACGAGTTGCGCAGTTTGTCTGCAAGACTCTATGAGAAGCAGATAAGCGATAAGTTTGCTCAACATCTTCTCGGGCATAAGTCGGACACCATGGCATCACAGTATCGTGATGACAGAGGCAGGGAGTGGGACAAAATTGAAATCAAATAA
Protein sequences of DBSCAN-SWA_8 >CP029164|4030203:4084519|4079017_4079803_+|AWH71606.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTMLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAMKALGFLKQKASEQKVAA >CP029164|4030203:4084519|4030784_4034186_-|AWH71544.1|DBSCAN-SWA MAVRISGVLKDGAGKPIQNCTIQLKARRNSTTVVVNTVASENPDEAGRYTMDVEYGQYSVSLLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSAGDAGTSAREAATHATDAAGSARAASTSAGQAASSAQSASSSAGTASAKATEASKSAAAAESSKSAAATSAAAAKTSETNAAASQQSAATSASTATTKASEAATSARDASASKEAAKSSETSAASSASSAASSATATANSAKAAKTSETNAKASETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAAVQASAAARSASAAKTSETNAKASETRAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAAAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGVVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGRFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELIIGTKLSASLNGNALTATKLQTPRLVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNPSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA >CP029164|4030203:4084519|4067241_4067457_-|AWH71586.1|holin|DBSCAN-SWA MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGG >CP029164|4030203:4084519|4040790_4043352_-|AWH71551.1|tail|DBSCAN-SWA MAEPVGDLVVDLSLDAARFDEQMARVRRHFSGTESDAKKTAAVVEQSMNRQALAAQKAGISVGQYKAAMRMLPAQFTDVATQLAGGQSPWLILLQQGGQVKDSFGGMIPMFRGLAGAINLPMVGATSLAVATGALAYAWYQGNSTLSDFNKTLVLSGNQSGLTADRMLVLSRAGQVAGLTFNQTSESLSALVKAGVSGEAQIASISQSVARFSSASGVEVDKVAEAFGKLTTDPTSGLTAMARQFHNVTAEQIAYVAQLQRSGDEAGALQAANEAATKGFDDQTRRLKENMGTLETWADRTARAFKSMWDAVLDIGRPDTAQEMLIKAEAAFKKADDIWSLRKDDYFVNDEARARYWDDREKARLALEAARKKAEQQSQQDTNAQQQSDTEASRLKYTEEAQKAYERLQTPLEKYTARQEELNKALKDGKILQADYNTLMAAAKKDYEATLKKPKQSGVKVSAGDRQEDSAHAALLTLQAELRTLERHAGANEKISQQRRDLWKAESQFAVLEEAAQRRQLSAQEKSLLAHKDETLEYKRQLAALGDKVTYQERLNALAQQADKFAQQQRAKRAAIDAKSRGLTDRQAEREATEQRLKEQYGDNPLALNNVMSEQKKTWAAEDQLRGSWMAGLKSGWSEWEESATDSMSQVKSAATQTFDGIAQNMAAMLTGSEQNWRSFTRSVLSMMTEILLKQAMVGIVGSIGSAIGGAVGGGASASGGTAIQAAAAKFHFATGGFTGTGGKYEPAGIVHRGEFVFTKEATSRIGVGNLYRLMRGYAEGGYVGGAGSPAQMRRAEGINFNQNNHVVIQNDGTNGLPGPQMMKAVYDMARKGARDEIQTQMRDGGLFSGGGR >CP029164|4030203:4084519|4073031_4073931_-|AWH71598.1|DBSCAN-SWA MTNTAKILNFGRGNFAGQERNVADLDDGYARLSNMLLEAYSGADLTKRQFKVLLAILRKTYGWNKPMDRITDSQLSEITKLPVKRCNEAKLELVRMNIIKQQGGMFGPNKNISEWCIPQNEGQSPKTRDKTSLKLGDCYPSKQGDTKDTITKEKRKDYSSENSGESSDQPENDLSVVKPDAAIQSGSKWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTASKPKLDLTNTDWIYGVDL >CP029164|4030203:4084519|4081431_4081980_+|AWH71612.1|DBSCAN-SWA MSEINSQALREAAVAIETVATPQKLLAFRMKVTPQVVLALLDERDALNERLAELEADLAGLAEDHQKATESIKQADAAVKLAHEKFSALAAENELARKAVQEFCDVVGDSTEVICEEIGRDGVLVILEAMKATGNMPATDAFLAEVRAQGVEMMREHPSIKLCSLTHICDELAAQLRKGGNQ >CP029164|4030203:4084519|4070304_4070499_-|AWH71591.1|DBSCAN-SWA MLSPSQSLQYLKGSIERASMCTEWILSRFSAYRRLPVKGMPSKSMLHMQKNARWKVWREHRLSG >CP029164|4030203:4084519|4060770_4062372_-|AWH71577.1|portal|DBSCAN-SWA MKTSTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRAWNPPGESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQLNDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSKEQRDKLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIARRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAAMLIEAGLSTYEKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSRAA >CP029164|4030203:4084519|4049456_4049651_-|AWH71560.1|DBSCAN-SWA MEVNKDQLADILGKSCAILLSDVNIFEEHYYIDVIRDMFDRTRNKADALVMSIAEIIKENSNDK >CP029164|4030203:4084519|4080631_4080823_+|AWH71609.1|DBSCAN-SWA MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAASLSQKLEMMVAKAEADERDQV >CP029164|4030203:4084519|4044946_4045342_-|AWH71554.1|tail|DBSCAN-SWA MKHTELRAAVLDALEKHDTGATFFDGRPAVFDEEDFPAIAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELDAWMESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM >CP029164|4030203:4084519|4043760_4044183_-|AWH71553.1|tail|DBSCAN-SWA MFLKTESFEHNGVTVTLSELSALQRIEHLDLMKRQAEQAESDSNRKFTVEDAIRTGAFVVAMSLWHNHPQKTKLPSMNEAVKQIEQEVLTTWPTEAISHAENVVYRLSGMYEFVVNDAPEQAEDAGPAVPVSAGKCSTVS >CP029164|4030203:4084519|4055181_4055373_-|AWH71569.1|DBSCAN-SWA MQEITLHEAAERAHQTEIICRLLEVYPNKITDADISALVSLLARLSGSVASFLIEEESKLVGD >CP029164|4030203:4084519|4046293_4046689_-|AWH71557.1|DBSCAN-SWA MTKDELIARLRSLGEQLNRDVSLTGTKEELALRVAELEEELDDTDDAAGQDTSVSPENALTGHENEVVSAQPDTVTDTADLVTVVALVTLHTDALHATRDEAVAFVLPGTAFRVSAGVAAEMTERGLARMQ >CP029164|4030203:4084519|4065156_4065258_-|AWH71581.1|DBSCAN-SWA MIIIITLRVLSGDPTGYGAATSRVFAIYENFPV >CP029164|4030203:4084519|4059470_4060790_-|AWH71576.1|DBSCAN-SWA MTAELRNLPHIASMAFNEPLMLEPAYARVFFCALAGQLGISRLTDAVSGDSLTAQEALATLALSGDDDGPRQARSYQVMNGIAVLPVSGTLVSRTRALQPYSGMTGYNGIIARLQQAASDPMVDGILLDMDTPGGMVAGAFDCADIIARVRDIKPVWALANDMNCSAGQLLASAASRRLVTQTARTGSIGVMMAHSNYGAALEKQGVEITLIYSGSHKVDGNPYSHLPDDVRETLQSRMDATRQMFAQKVSAYTGLSVQAVLDTEAAVYSGQEAIDAGLADELVNSTDAITVMRDALDARKSRLSGGRMTKETQSTTVSATASQADVTGVVQATEGENASAAQPDVNAQITAAVAAENSRIMGILNCEEAHGREEQARVLAETPGMTVETARRILAAAPQSAQARSDTALDRLMQGAPAPLAAGNPASDAVNDLLNTPV >CP029164|4030203:4084519|4080833_4081115_+|AWH71610.1|DBSCAN-SWA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP029164|4030203:4084519|4038478_4039120_-|AWH71547.1|tail|DBSCAN-SWA MAATHTLPLASPGMARICLYGDLQRFGRRIDLRVKTGAEAIRALATQLPAFRQKLSDGWYQVRIAGRDTGETELSARLNEPLANGAVIHIVPRLAGAKSGGVFQAVLGAAVMAVAIWMPGVGIMASNLLFSLGASMTLGGVAQMLAPKPKTPSTQTTDNGKQNTYFSSLDNMVAQGNVLPVLYGEMRVGSRVVSQEISTADEGDGGQVVVIGR >CP029164|4030203:4084519|4050291_4050633_-|AWH71562.1|head|DBSCAN-SWA MATHYTELMSGTEALVTTLGIFSANKGVIPAFTPLMQEDATGALVVWDGTSAGKAVYVSAVQIDTAKKTQAQVYKTGVLNVDALNWPESVRELSAKVAAFVGSGISVQPLARV >CP029164|4030203:4084519|4034920_4038418_-|AWH71546.1|DBSCAN-SWA MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPVEGPVDGLKSVLLNSTPVLDSEGNTNISGVTVVFRAGEQEQSPPEGFESSGSETVLGTEVKYDTPITRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTEKDITIKGKTTSQYLASVVVDNLPPRPFSIRMRRMTPDSTTDQLQNKTLWSSYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYNPQTRQYSGIWDGTFKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWALYVIGQYCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVEVNWIDPDNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLWLIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISTGGRVLAVNSQTRTLTLDREITLPSSGTTLISLVDGSGNPVSVEVQSVTDGVKVKVSRVPDGVAGYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDLSGTVNGVTPPAVQHLTAEVSADSGEYQVLARWDTPKVVKGVSFMLRLTVTADDGSERLVSTARTTETTYRFRQLALGNYRLTVRAVNAWGQQGDPASVLFRIAAPATPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRITDIRQVETTARYLGTALYWIAASINIKPGHDYYFYVRSVNTVGKSAFVEAVGRASDDAEGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLAEIRTSITNVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDGRLYIAGIGAGIENTPDGMQSQVLLAADRIAMVNPANGNTKPMFVGQGDQIFMNDVFLKRLTAPTITSGGNPPAFSLTPDGKLTAKNADISGSVNANAGTLNNVTVNENCTIKGMLEATQVRGDFVKAVSKSFPKQAGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYSDPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRGSVTLEFKVFHKGNQWAGNITDCTVIVTKKAASGISIR >CP029164|4030203:4084519|4047675_4048839_+|AWH71558.1|DBSCAN-SWA MQPEDYEEKQYEDEPESYPIDEFQLTTTPNDFNIITIISFIKSKVFKIPNFQRHYVWDIKRASKLIESLLIGLPIPQIFLYEQDKNEFLVIDGQQRLMTLYYFVNGVFPRKEKRSELRKIFEDNGNIPENILHNDEYFTKFNLKLDGLSDTQKNKFNGKNYETLNEFQTTLNLATIRNMVIKPVAQDSEDGAMFEIFNRLNSGGMNLSPQEIRMSLYHSDFLSNLVSLNENKTWRKILSKNVVDMRLSDIEAILRTFAMSLFTSQYKSSVSGFLNNFSNYAKNYDTKDIILFSNIWNEFMDSVDGIDEINFRTGGNRMSITLFESIFYAATYDSFKDKDLKIRQVTVNYIDKLKNDPEFLTFSTDKTTRREHVIGRLERARTILEGM >CP029164|4030203:4084519|4074676_4075390_+|AWH71601.1|DBSCAN-SWA MSAKKKPLTQEQLEDARRLKAIYEKKKNELGLSQESVADKMGMGQSGVGALFNGINALNAYNAALLAKILNVSVEEFSPSIAREIYEMYEAVSMQPSLRSEYEYPVFSHVQAGMFSPELRTFTKGDAEKWVSTTKKASGSAFWLEVEGNSMTAPTGSKPSFPDGMLILVDPEQAVEPGDFCIARLGGDEFTFKKLIRDSGQVFLQPLNPQYPMIPCNESCSVVGKVIASQWPEETFG >CP029164|4030203:4084519|4039017_4039761_-|AWH71548.1|tail|DBSCAN-SWA MTQTESAILAHARRCAPAESCGFVVSTPEGERYFPCVNISGEPEAYFRMSPEDWLQAEMQGEIVALVHSHPGGLPWLSEADRRLQVQSDLPWWLVCRGTIHKFRCVPHLTGRRFEHGVTDCYTLFRDAYHLAGIEMPDFHREDDWWRHGQNLYLDNLEATGLYQVPLSSAQPGDVLLCCFGSSVPNHAAIYCGDGELLHHIPEQLSKRERYTDKWQRRTHSLWRHRAWRASAFTGIYNDLVAASTFV >CP029164|4030203:4084519|4059128_4059461_-|AWH71575.1|head|DBSCAN-SWA MTSKETFTHYQPLGNSDPAHTATAPGGLSAKAPAMTPLMLDTSTRKLVAWDGTTDGAAVGILAVDADQTSTTLTFYKSGTFRYEDVLWPEAASDETKKRTAFAGTAISIV >CP029164|4030203:4084519|4077923_4078292_+|AWH71603.1|DBSCAN-SWA MSNIKKYIIDYDWKASIEIEIDHDVMTEEKLHQINNFWSDSEYRLNKHGSVLNAVLIMLAQHALLIAISSDLNAYGVVCEFDWNDGNGQEGWPPMDGSEGIRITDIDTSGIFDSDDMAIKAA >CP029164|4030203:4084519|4050642_4051683_-|AWH71563.1|capsid|DBSCAN-SWA MVDLYSPTQLVQVVNAVDVQKQLNALFTSLFFTRSVMFESRDIILDTIDDPNIPIAAFCSPMVGSKVSRDEGYESKTIRPGYMKPKSSIDPNKLAVRPAGVSPEQYNAFGARNIKVKQAIVNQAKAIRARIEWLAVQAITTGKNIIEGDGIERYELDWNIKPQNIITQSGGAEWSGKDKETFDPNDDIESYAEFSEGVTNIIIMGGNVWKKYRSFRAIKEALDTRRGSNSELETALKDLGDSVSFKGYMGDVAIVVYSGRYTDEDGTEKHFLDPDLMVLGNTALQGIVAYGGIQDPELIRMGLTKAELAPKNYIVPGDPAIEYVQTHSAPQPIPARINRFVTVRIG >CP029164|4030203:4084519|4039766_4040465_-|AWH71549.1|tail|DBSCAN-SWA MQDIRQETLNECTRAEQSASVVLWEIDLTEVGGERYFFCNEQNEKGEPVTWQGRQYQPYPIQGSGFELNGKGTSTRPTLTVSNLYGMVTGMAEDMQSLVGGTVVRRKVYARFLDAVNFVNGNSDADPEQEVISRWRIEQCSELSAVSASFVLSTPTETDGAVFPGRIMLANTCTWTYRGDECGYHGPAVADEYDQPTSDITKDKCSKCLSGCKFRNNVGNFGGFLSINKLSQ >CP029164|4030203:4084519|4083448_4084519_+|AWH71616.1|integrase|DBSCAN-SWA MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTTLHVDALGISMKETLDKCKKILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP029164|4030203:4084519|4074375_4074576_-|AWH71600.1|DBSCAN-SWA MEQRITLKDYAMRFGQTKTAKDLGVYQSAINKAIHAGRKIFLTINADGSVYAEEVKPFPSNKKTTA >CP029164|4030203:4084519|4066744_4067242_-|AWH71585.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVRHNPYKDIVGVWTVCHGHTGKDIIPGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSTLLRKINQGDIKGACDQLRRWTYAGGKQWKGLMTRREIEREICLWGQQ >CP029164|4030203:4084519|4068029_4069127_+|AWH71587.1|DBSCAN-SWA MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYARLGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYGVAYDIGAWTDVLPEFGGDTWTQTDVFMTQRATGVATYRNNDFFGLVDGLNFAAQYQGKNDRSDFDNYTEGNGDGFGFSATYEYEGFGIGATYAKSDRTDTQVNAGKVLPEVFASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFADHFVANKAQNFEAVAQYQFDFGLRPSVAYLQSKGKDLGVWGDQDLVKYVDVGATYYFNKNMSTFVDYKINLLDKNDFTKALGVSTDDIVAVGLVYQF >CP029164|4030203:4084519|4058797_4059067_-|AWH72572.1|capsid|DBSCAN-SWA MYTTAQLLAANEQKFKFDPLFLRLFFRESYPFTTEKVYLSQIPGLVNMALYVSPIVSGEVIRSRGGSTSEFTPGYVKPKHLAWLSEAFV >CP029164|4030203:4084519|4052894_4054646_-|AWH71566.1|DBSCAN-SWA MKLAPNLKKQPRDRLTEIIIFAGSDAWSHAKEWQEWAGKHIAADDVPPVVLADEQLKNITDYRIIDEDRQCVRVYRAGHITEHSMTQIVTLLAVAGVKTVHEYAGITDTSPVDLSDQLPRLKEECERGESLVLNLPTKQKAQLSQMADSERAQLLAERFDGVCVHPESEIVHVWRGGVWCPISTMELSREMVAIYSEHRATFSKRVINNAVEALKVIAEPMGEPSGDLLPFANGALDLKTGEFSPHTPENWITTHNGIEYTPPAPGENIRDNAPNFHKWLEHAAGKDPRKMMRICAALYMIMANRYDWQMFIEATGDGGSGKSTFTHIATLLAGKQNTVSAEMTSLDDAGGRAQVVGSRLIVLADQPKYTGEGTGIKKITGGDPVEINPKYEKRFTTIIRAVVLATNNDPMIFTERAGGVSRRRVIFRFDNIVREDEKDKELPEKIAAEIPVIIRRLLANFADPEKARALLLEQRDGDEALAIKQQTDPVVELCAALEFLEEARGLMMGGGGDTVKYTTRNSLYRVYMAFMAYTGKGKCLSVNEFGKAMRSAAKVYGYEYITRKVKGVTQTNATTTDDCDAFL >CP029164|4030203:4084519|4081213_4081435_+|AWH71611.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQEDLELISKQRGSK >CP029164|4030203:4084519|4062571_4064497_-|AWH71579.1|terminase|DBSCAN-SWA MNISNSQVNRLRHFVRAGLRSLFRPEPQTAVEWADANYYLPKESAYQEGRWETLPFQRAIMNAMGSDYIREVNVVKSARVGYSKMLLGVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLALAPWYGKKHRDNTLTMKRFSNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDKRIEGSVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKFGDKETPFGLKWTPDDPSSVFYLCEHNACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEEIEPPDSVTFHIWTAYSPFTTWVQIVKDWMKTKGDTGKRKTFVNTTLGETWEAKIGERPDAEVMAERKEHYSAPVPDRVAYLTAGIDSQLDRYEMRVWGWGPGEESWLIDRQIIMGRHDDEQTLLRVDETINKTYTRRNGAEMSVSRICWDTGGIDPTIVYERSKKHGLFRVIPIKGASVYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYNRFTLTPGGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKRRNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNKKTLADYARALSGEDE >CP029164|4030203:4084519|4082110_4082809_+|AWH71613.1|DBSCAN-SWA MKNILLASLLVASPGAFAASFDCQKASTAIEHKICDNERLSKLDEQLSSAYSSALKGNPENADTLKMVQRQWVNMRGKLTDNKALELAYLIQINGLKGLGSSVSVTAANDIPTSAQKHSKEQEDTSKAEAKSVKNGNKLTLESFRAKYVEVDGEYYSTTSIPRGSSFLFTCASRIADDQVNIWKKQAAKEGKIDLFFEVENHLHTAMLNANFQKLNSDPAKRGICNLINAVP >CP029164|4030203:4084519|4043344_4043779_-|AWH71552.1|tail|DBSCAN-SWA MFDGELSFALKLAREMGRPDWRAMLAGMSSTEYADWHRFYSTHYFHDVLLDMHFSGLTYTVLSLFFSDPEMHPLDFSLLNRREADEEPEDDVLMQKAAGLAGGVRFGPDGNEVIPASPDVADMTEDDVMLMTVSEGIAGGVRYG >CP029164|4030203:4084519|4072333_4073035_-|AWH71597.1|DBSCAN-SWA MKNIAAQMVNFDREQMRRIANNMPEQYDEKPQVQQVAQIINGVFSQLLATFPASLANRDQNELNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELVHMTARINRGEAIPEPVKQLPVMGGRPLNRAQALAKIAEIKAKFGLKGASV >CP029164|4030203:4084519|4078371_4078641_+|AWH71604.1|DBSCAN-SWA MPLQGGLLLAALPNLYLNESPVNYVTDGNALSTYLISQEYPKMDKKLMAIQTKFTIATFIGDEKMFREAVDAYKKWILILKLRSSKSIH >CP029164|4030203:4084519|4083045_4083213_+|AWH71614.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP029164|4030203:4084519|4052362_4052605_-|AWH71565.1|DBSCAN-SWA MHTSGKLNKHIKPHYRALDMAEHWLRVAIKAIDRNAGEGYAKAHPELISAFMTTAAANFATLTEREIAEAEEVTTINIKS >CP029164|4030203:4084519|4062368_4062575_-|AWH71578.1|head,tail|DBSCAN-SWA MTRQEELAAARAALHDLMTGKRVATVQKDGRRVEFTATSVSDLKKYIAELEVQTGMTQRRRGPAGFYV >CP029164|4030203:4084519|4030203_4030785_-|AWH71543.1|tail|DBSCAN-SWA MAFRMSEQPRTITIYNLLAGTNEFIGEGDAYIPPHTGLPANSTDIAPPDIPAGFVAVFNSDEASWHLVEDHRGKTVYDVASGDALFISELGPLAENVTWLSPEGEFQKWNGTAWVKDTEAEKLFRIREAEETKNSLMQVASEHIAPLQDAVDLEIATEEEISLLEAWKKYRVLLNRVNTTTAPDIEWPVAPIG >CP029164|4030203:4084519|4076856_4077180_+|AWH72574.1|DBSCAN-SWA MDAQTRRRERRAEKQAQWKAANPLLVGVSAKPVNRPILSLNRKPKSRVESALNPIDLTVLAEYHEQIESNLQRIERKNQRTWYSKPGERGITCRGRQKIKGKSISLI >CP029164|4030203:4084519|4057429_4058665_-|AWH71574.1|DBSCAN-SWA MAGENKLSDKALKGYLGKPREKQITIADGKGLSIRVSTKGAVSFVFFYRLAGGRAAPVWLTLGKYPDMSLKQAREKRDECRGWLADKRDPRIQIKIQAEERLKPVTVEDALNYWYENYCKVRRKTHAVTLGRFRKHIFPYIGHLPVNDTHLYEWLDCFDRIKRNAPVMAAYVFSDTKLALRFCRVRQYATCDALKDLRMSDVGQIAGKRDRVLDEAELGQLWKAIFVEPDLKLMSEYTRKMFVLCTVFGCRMSEARLSEWSEWDLESWVWTVPKDHSKTGVEIVRPVPEILRQWVTDVHEETKHTGYVLGSLRIRESVSKIGGKIGKRLGHEKQWSLHDLRRTLSTHLSDLGVEFYVVEQLLGHALPGVAGVYNRSKFMAKKLDALELWTTYLNSIAGADSKVTILKQKVG >CP029164|4030203:4084519|4054959_4055181_-|AWH71568.1|DBSCAN-SWA MNTEREVFFKLLACAESSLTLNNSAKVILNMWLDCINDNEDANIAYGLLSLIDEAAEKLNDAINSALLSNKSS >CP029164|4030203:4084519|4044198_4044939_-|AWH72571.1|tail|DBSCAN-SWA MPVPNPAIPVKGAGTTLWVYKGSGDPYANPLSDVDWSRLAKVKDLTPGELTAESYDDSYLDDEDADWTATGQGQKSAGDTSFTLAWMPGEQGQQALLAWFNEGDTRAYKIRFPNGTVDVFRGWVSSIGKAVTAKEVITRTVKVTNVGRPSMAEDRSTVTATTGMTVTPASASVVKGQSTTLTVAFQPDGATDKSFRAVSADKTKATVSVSGMTITVKGVAAGKVNIPVVSGNGEFAAVAEINVTAS >CP029164|4030203:4084519|4071008_4071536_-|AWH71594.1|DBSCAN-SWA MTIKSNTPAHDKDCWQTPLWLFDALDIEFGFWLDSAASDKNALCAHWLTEADDALNSEWVSHGAIWNNPPYSNIRPWVEKAAEQCIQQRQTVVMLVPEDMSVGWFSKALESVDEVRIITDGRINFIEPSTGLEKKGNSKGSMLLIWRPFISPRRMFTTVSKAALMAIGQGVRRAA >CP029164|4030203:4084519|4073963_4074257_-|AWH71599.1|DBSCAN-SWA MVRANKRNEALRIESALLNKIAMLGTEKTAEAVGVDKSQISRWKRDWIPKFSMLLAVLEWGVVDDDMARLARQVAAILTNKKRPAATERSEQIQMEF >CP029164|4030203:4084519|4055372_4055558_-|AWH71570.1|DBSCAN-SWA MLKTFRVFARAVNPIGHTIGIAQNVKAVNVQTAIAAVRSESSEYGLSQVIISAVYELKEVH >CP029164|4030203:4084519|4083252_4083471_+|AWH71615.1|DBSCAN-SWA MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >CP029164|4030203:4084519|4056594_4057131_+|AWH71572.1|DBSCAN-SWA MNLKKIATNTKNKITETFNKLILEASKTPTQDEIKILERRSKKFNYSFFSYAVTGAIIVFCSQPLIKYANPILILLSGLLLSIIIIILRMIYISQANASWTTKKRSHVLVHFLSACFIASTLTLLYQAYDNNITHKLYCKNIQQLIEKRIETEKNISIFSGMQCTPVYDYSLFGFNLL >CP029164|4030203:4084519|4048843_4049392_+|AWH71559.1|DBSCAN-SWA MEDMGYTSIESMFNNYKCYYDFLLTHNEISFANDYKSQFSKVMLLACASYFETLVVTKIHCMLNPSQCNLTHDFIDNKALTRQYHTLFDWKKRNANQFFSFFGPKFKEFMIEKVKSSTELTKSISDFMEIGELRNKLAHNNYATFVLESTAEEIYNKFLNAHSFVSQLDTFSTQFREQIGEQ >CP029164|4030203:4084519|4054642_4054942_-|AWH71567.1|DBSCAN-SWA MEMKNSGFIASGPARPEFMNGDIYRDKYGGTVTIKGVAERRITYRREGYSYDCVMPVYQFRRDFSLVYAAPRSKPISREKARGNIQKMKSMINAFRGKK >CP029164|4030203:4084519|4045338_4045917_-|AWH71555.1|tail|DBSCAN-SWA MAIKGLEQAVENLSRISRTAVPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERAWLKRATVKNPQARIKVNRGDLPVIKLGNARIVLSRRRRRKKGQRSALKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIPMAVPLTTAFKQNIERIRRERLPKELGYALQHQLRMVIKR >CP029164|4030203:4084519|4064471_4065017_-|AWH71580.1|terminase|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIPGLLSEYIEQSG >CP029164|4030203:4084519|4077172_4077667_-|AWH71602.1|DBSCAN-SWA MSDEKGIISRITDAVSGAGGALMSAVGAVKEIQKMQIDYSVKEKTYELVDKLMDAQQQQMSLNELLMISKDKIIELEEKINRASKWEEEKKNYEMYTPTVATVVYRLKKSANTGQPMHYLCAQCYESSVKSILQYEGFAPPSNHRMRCHRCNASYLFPKSAFSK >CP029164|4030203:4084519|4080476_4080659_+|AWH71608.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSITKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >CP029164|4030203:4084519|4055550_4056162_-|AWH71571.1|DBSCAN-SWA MPICSTLAKASSEDTRGFSPLPSAWRRAISPRMAVTINPALLSPSSLTDSIPCITSSGTLTVVSCDFAFLLAVAISETPNHRCVSVYTKKEIQKALTCVSCAHNMKHTERIVEIQRATPRSAGNTYGASNQQRKLGAVMVAVNHIPHLVHTQTAFVWRFLALSAGESQIIHVTAWTEREARSRCPSGCVAVFAAKIRQGVSHA >CP029164|4030203:4084519|4071532_4071991_-|AWH71595.1|DBSCAN-SWA MGEREVMKKLTFEIRSPAHQQNAIHAVQQILPDPTKPIVVTIQERNRSLDQNRKLWACLGDVSRQVEWHGRWLDAESWKCVFTAALKQQDVVPNLAGNGFVVIGQSTSRMRVGEFAELLELIQAFGTERGVKWSDEARLALEWKARWGDRAA >CP029164|4030203:4084519|4049896_4050280_-|AWH71561.1|DBSCAN-SWA MQNDYNDLKPIAEMMYPNPAVEELKAIADKMCLSERLVDMNQVMEITTLSRRTLLNLEASGEFPERVQVTEGRKAWYLSEVIDWINNIPRASEYCRVPVPKKPDAALCLKIERVRRNARDGRYKLIG >CP029164|4030203:4084519|4040464_4040794_-|AWH71550.1|tail|DBSCAN-SWA MKTFRWKVKPGMDVASAPSVRKVRFGDGYSQRAPAGLNANLKTYSVTLSVPREEATVLESFLEEHGGWKAFLWTPPYEWRQIKVTCAKWSSRVSMLRVEFSAEFEQVVN >CP029164|4030203:4084519|4065961_4066255_+|AWH71583.1|DBSCAN-SWA MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQEKTVDAAKICGGAENVVKTETQQTFVNGFLGFITLGIYTPLEARVYCSQ >CP029164|4030203:4084519|4079799_4080480_+|AWH71607.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDENYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP029164|4030203:4084519|4075556_4076375_+|AWH72573.1|DBSCAN-SWA MQRVHATHRKHFAEHELTAQCIMKVIWSMVLSHLNGKRQSQERREPLELYRIYFYDCPPLDIQTRLPLPEPGNKTPGRKNFKLEKSYILRTELHEELRKTRKTALRLGNLVDNKRWQLTTFSLDALMKGTKKWDELTNDDFYYDIKQKQVDIKLGMDITTLAYEKLVDVIVLVAGDSDFVPAAKHARIKGIDFILDPLRQNVTPSLSEHIDGVQSYSLISGLADALHVEPDPAPDWWEDRKKGKPRGKNNSGKRRYGNTQAESAKKHQRNKR >CP029164|4030203:4084519|4057200_4057428_-|AWH71573.1|DBSCAN-SWA MKKMAIVDKKGLEYIPNIDRMIREKECRELTTLANSTRWKLEKEGKFPKRIKIGSTAVAYRLSEVQAWIRGEWVV >CP029164|4030203:4084519|4045928_4046282_-|AWH71556.1|tail|DBSCAN-SWA MADFDNLFDAAIARADETIRGYMGTSATMTSGEQSGAVIRGVFDDPENISYAGQGVRVEGSSPSLFVRTDDVRQLRRGDTLTIGEENFWIDRISPDDGGSCHLWLGRGVPPAVNRRR >CP029164|4030203:4084519|4069316_4069700_-|AWH71588.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLITSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVVGMTFMSLAGKHCCSDGYIGKRLQKAEGIIEGMLMALDIRLEMDIVVNNSN >CP029164|4030203:4084519|4069785_4069926_-|AWH71589.1|DBSCAN-SWA MMFEFNMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAGV >CP029164|4030203:4084519|4066286_4066748_-|AWH71584.1|lysis|DBSCAN-SWA MNRVTAIISALVICIIVGLSWAVNHYRDNAITYKAQRDKNARELKLANAAITDMQMRQRDVAALDAKYTKELADAKAENDALRDDVAAGRRRLHIKAVCQSVREATTASGVDNAASPRLADTAERDYFTLRERLITMQKQLEGTQKYINEQCR >CP029164|4030203:4084519|4078595_4079012_+|AWH71605.1|DBSCAN-SWA MDINTETEIKQKHSLTPFPVFLISPAFRGRYFHSYFRSSAMNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >CP029164|4030203:4084519|4051901_4052351_-|AWH71564.1|DBSCAN-SWA MTAQIAAYGRLVDDPQVKHTSKGTPMTLAWMAVSLPCSQADDGTATMWLSVLAFGRQADTLAKHHKGELLSVAGNMQVSQWTGQNGETRQGWQVIADSVISARTVRPGGKKDQQGQATDALNRAKQQADQQGSHPPVGDNEQWGDDIPF >CP029164|4030203:4084519|4072046_4072337_-|AWH71596.1|DBSCAN-SWA MTGKEAIIHYLGTHNSFCAPDVAALTGATVTSINQAAAKMARAGLLVIEGKVWRTVYYRFATKEEREGKMSTNLIFKECRQSAAMKRVLAVYGVKR >CP029164|4030203:4084519|4034250_4034850_-|AWH71545.1|DBSCAN-SWA MRKVCAAILSAAICLAVSGAPAWASEHQSTLSAGYLHASTNVPGSDDLNGINVKYRYEFTDTLGMVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAGVTYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAIDIAYEGSGSGDWRTNSFIVGVGYKF >CP029164|4030203:4084519|4070491_4070875_-|AWH71592.1|DBSCAN-SWA MGYPVAKISCEEMTMDYSQLSDFEINRMVGDIIFKGLWASKPETSGNNTNKWYYGNADTTFEPLNHLPDYCNDPSASWPIIEKYRISILDQLTEWCVDAKGVSPIFDARPLRAAMIVFLLMQEANNA >CP029164|4030203:4084519|4070835_4071012_-|AWH71593.1|DBSCAN-SWA MRRQRRSFTDIICENCKYLPTKRSRNKRKPIPKESDVKTFNYTAHLWDIRWLRYRARK >CP029164|4030203:4084519|4065405_4065600_+|AWH71582.1|DBSCAN-SWA MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSEKRAYTSIQSDDE >CP029164|4030203:4084519|4069922_4070285_-|AWH71590.1|DBSCAN-SWA MNTYSITLPWPPSNNRYYRHNRGRTHVSAEGQAYCDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKASFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE |
78 | Enterobacteria_phage(56.25%) | portal,terminase,holin,tail,integrase,lysis,head,capsid | attL 4036870:4036886|attR 4092465:4092481 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|