Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_016026 | Micavibrio aeruginosavorus ARL-13, complete sequence | 4 crisprs | PrimPol,DinG,cas3,RT,csa3,DEDDh | 0 | 2 | 3 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_016026_1 | 473637-473780 | Orphan |
NA
Consensus repeat of NC_016026_1
|
1 spacers
spacers of NC_016026_1
>1.1|473680|58|NC_016026|CRISPRCasFinder ATTGACAACGGAGTGAGTACCGACTAAATACTCGCTCCCTCGGCAGATGGAAACGTCG |
CRISPR arrays and Neighbor proteins around NC_016026_1
The CRISPR arrays of NC_016026_1 >merge|NC_016026|1|473637-473780|CRISPRCasFinder GCCAGCATCTTCGAAAAACTTTGAAAAAAGATTGAAAAAGATGATTGACAACGGAGTGAGTACCGACTAAATACTCGCTCCCTCGGCAGATGGAAACGTCGGCCAGCATCTTGGAAAAACTTTGAAAAAAGATTGAAAAAGATG >NC_016026|1|1|473637-473780|CRISPRCasFinder GCCAGCATCTTCGAAAAACTTTGAAAAAAGATTGAAAAAGATG ATTGACAACGGAGTGAGTACCGACTAAATACTCGCTCCCTCGGCAGATGGAAACGTCG GCCAGCATCTTGGAAAAACTTTGAAAAAAGATTGAAAAAGATG
>NC_016026.1|WP_014102021.1|472982_473513_+|hypothetical-protein MSGYDGLRKFFGLIVTRRTLSDPVAAPPDLSVIVGHLDAVSAFNTVPIGRDFSISENGYMQRAWDCLLAERIGKPAGVRTLFAVRRVSGFMEGRGHCADCVTTSVVSEPLFGRPMALEAVIPEMETLLRALRLGDAGCFLDQVHFNQPTLAEMQALVGAYRALAPRKTGHLALVPK >NC_016026.1|WP_014102020.1|472428_472908_+|hypothetical-protein MPALDRTAVKHRAQIESLLQQINDIHGLYAMPLAKNFQTMPDYTVHYDWMCLLADRVGEPAGVRTAFRLRWAHGWASQRMGCAPYGQLGDPAYTELGIVLPLEKIFPVMMVLQDERRRVFGIGPCTMTFERTTIEDARQLVDAYRMLCPVKKDFLKLVK >NC_016026.1|WP_014102019.1|471716_472373_+|hypothetical-protein MDWKDLGGKLRRAFGLQTQPTPEPLPPPTDRYLSARELKTLLECVQHIKDITQTNALALKSLSQATPRPLGDAERDFQDAMCAQFRPLGGDPLQNSQMALGTCVAWLAQLNQAHNIAARNETVVTPNLALPVRSGSRKFAADIGVMARGATTAMGGICASLSSAAPFLDIPNIQRIHEEYAEHGAVMNQILEKITTILTDATYGVRDQTPKQTLSLKK >NC_016026.1|WP_148260498.1|471149_471599_+|DUF1489-family-protein MISNDPIHLIKLAVGVDDVGHLHALQSSRLFDFDGALATCAWTRRKPTRDGGLLNGGSIYWIIKGRIQARQAFLGFEMEDTDEGPYCRLVLDPALMLVAAMPHRAFQGWRYLDPAKAPPDLRFFDPDLAAEDDEMPADLAAQLRDAGLI >NC_016026.1|WP_041794216.1|468310_471007_-|SLBB-domain-containing-protein MARARKFKSTRFFMQSGLCTVWVLTGLAFGAPSARAQDFLPMQIAPWSTDTRNIARVTQARTDIQAGGHQQAAPAAASPPARKAGTLTPLTPQDYEPLLADLLARDEMLARMVGHHVSPLSSIEEFYAGRVVDPLEQFGYDLFENFSAPSTQARGKQTDGEPSPTPQAALPAGAVQDNFVLSTGDRLNITFRGQRRDQGIYTITTDGLLILDDLPPVSAAGRTIGQLREALAASADSLYNTDIYVSLESVRQVNVLVVGNVRKPGRQTLTVFHTALDALMQADGIDKNGSLRQIKLVRDGRTTMVDLYGLLIHGSSGMDLALRDGDRLIVPPLGPTIAVAGGVKRPGIYEILPALKGMKHAPEKSSEFLSMQEALDMAGGLVSPGNNRFMKLGLHRNGQETVETITDPFTPALNDGSILMVARADDKRAGLVELVGHTRQPGLHPLSSSKTLAALLSDRSMFGADIYPLIGAIERWDDERMARIFLDFPPILVAQGQYDQELKDGDIVHLFSRSQMMALQKQKFNPASIEPAAGSVDETDIDPADTVTGDPALSAFLAERTISVRGAVRDSGVWPVAAGTTLDSVLAVAGGLSLEANTSNIEVTRTHDASIPDTESSIPFRTAVNMNDTDPKTVAINPGDTVRVKQKFRKAEGQSVTIIGEVNNPGKYDLVPGDTLRDLFARAGGITDQAYPDGTIFSRESERKAEEERFRAAAREMERALATALHKEKDAPDMTQIAMVQDLAAELRNVEAVGRITVEADPTVLEVQPELDILLEAGDRIYVPRRPLTVRVEGEVLSPAALQFRNGKNPRDYIAEAGGPSHFADQDRAFVLYPDGSAQPLFISAWNHKASMIPPGSTIVVPRDPKPFDFIESAKDVSQILSNLAVTGIFLSDIRDDD >NC_016026.1|WP_081463057.1|467357_468311_-|glycosyltransferase MMFSIVTITRNDLAGLHATYKSVQSQTCTDYEWIVIDGASDDGTVAYLQNLSSQTPSSPHPTLPPLQEGRKEEEEILLPPEQEVRVEEEIPLPPQRGGRLGGGRENGTGGRAILWTSEPDAGLYDAMNKGLARATGDYIIFLNGGDQFADDNVLSNLSQLIGMASTKPGFIYGDALETLPDGQTAYKAARPFIKVDLGMFTHHQAMVYARNVIGDMHYDTRYKIAADYKFTLQTLGATRAIYYVPAPFCLFAHGGLSQTRTALGRREQFDIRRELGVVGPIRNRAITTLQMINMGVRRICPPLYWALKARRDNAAIR >NC_016026.1|WP_014102015.1|466557_467403_+|glycosyltransferase-family-2-protein MPDHPVIPVSVIIVTKGEGAFLSPTLAALSGFDQIIVVDSGGDADTFSVAQSFGADTVAYTWDGAYPKKRQWCLDHVTLAHDWVFFVDADEIVTPDLTRAIADLFVSGAPDADGYFVRGRYVWGGRVLRFGLTNNKLALFNRHAFMFPVVDDLGLPGMGEIEGHYQPVAKKSGARIGVLSPMLTHDAATDPARWYERHERYAQWEAGMNARNAWPVDPVAGRHRLKRIFRALPARGVVAFLYCYVWRCGFLDGWAGFDFARARGWYYHRIAALSRRAFNAQ >NC_016026.1|WP_014102014.1|465704_466565_+|chain-length-determinant-family-protein MSTHSQTMNAPEPDLIDLLRDWWRLRGWIMAGMVAGVLAAFAFLALAVPQYRVSMLIAPADRGTGTDIKALLPDNATFALQYLASSIGAQDTTDFSRLENIMRGADVAAIVMKNKDVADGVRASRSLRISAGADIRDPAELADWMARTIKIEPVGTTTMRRVVLNHPDREWATGFLTLVHDAADRLIRNDVRTRADARSAYLQDALRRTDHPDHRRALTNLLMEQEHVRMMLAMDEAFAAVIAESPSASARAVWPRKSIVLPAFVFAGAVLFYCLGLIFGRRDRHA >NC_016026.1|WP_014102013.1|464369_465701_+|glycosyltransferase-family-4-protein MSRPSVIFFNRVYPSDRGATGRVLRDLARAMARDGWAVTVVTTASVAREDRDGDVRVIRLKSNTKSRNLFTYGAAWVRMMIAGLKLPRPDLIVTMTDPPMMVVAGGIMARARKTKHIHWCQDLYPDLLPSIGIRLPDFMMSGLSALSFNAMRRCEKIVVIGRCMARQLTKTGLDPKRIAVIPNWPDQELTRTMTDAMNEAAVEADASIPAKPFEELFKDDGAPKFRVLYAGTIGRAHPIHTIVDAAAILQHQCPDIEFVFVGDGPGLDRLAHERARRGLENIRLLPRQPNRRLRPLMESGDVHIISMKHDAAGLLVPSKLYSALAVGRPCVFVGPMNSEVAKVISDFHAGAVVAQGEPETLAQTILTLRMDGNAWHNAHDGSAQAGRIFVPSESINAWIKRARDVVGRPLTPPSAKPKVTVPTPVAQDNAVQQPPSVTIHAAE >NC_016026.1|WP_014102012.1|463081_464362_+|hypothetical-protein MFSTYRDVQCAALCAAILVYALWGEPTPPAFGWPEILVGVLLTAAVGLRSFARAVTPVRGDAHPFWFRAGQFFLLYGLSVPLVGAVIAGASPGNIVRDIVPFLFMLLPVFMVDTVRDRVRWHFIVTACVVVLGVIFAARVVAPLALASSGMDHAAQLGMNLSGQDPRRLANAPSVLFAAMALLGAAGWIITRRINANAMISAAILSGVALIPLAAMALVLQRASLGLLALGLAFWIAIGIVKKPRRMVVPLLGLAVLCLMVWAPLADVVAGLAHKNVMVGANMRWQEAAAVRDALHGIGAILFGNGWGATVQSPAVGDSIVTFTHNLGTTLWLKSGLIGVGLGLAYFGGLALALIRFLPIHPILVVALGAPMVIDYLLYASFKSLDFGLILLLAALYSARTAAVSPGGQPGNPVVFKTIPNNADFK >NC_016026.1|WP_014102022.1|479195_479549_+|DMT-family-protein MSFSLPVPIATIGLLLASNIFMTFAWYGHLKFKTTPLLIVIFVSWGIAFFEYCLQVPANRMGHAVFNAAQLKTIQECLTLLVFMGFSIWYLKEPIQWNHLLGFGLIVLAAWVIFKKW >NC_016026.1|WP_014102023.1|479945_481268_+|PAS-domain-containing-protein MKNLAYALGFREKQFANDDDRFLSLKMGALRGLSTNIMMADKDYNIVYVNDAIIEFLRALEGDIKKDFPSFNVDQLIGTNIDMFHKSPSHQRGMLDRMSGEFDTSIKVGGIVFNLHAFPVFDDNKNRVGTVVEWQDSKQMDGVSQIASIHKSMAVIEFNMDGTIITANKNFLDTVGYGLDEVKGHHHRMFMEASEADSAEYRKFWDDLRAGQYQSSEYKRVGKGGREIWIQASYNPVFDLNGRPFKVVKFATDVTKQVIAKQNAGKMIESAAVGTEELSASVKEITESMTKSRATTEKAYGIVDQADQQTNKLADAAASMGGIVELINSIAGQINLLALNATIESARAGEAGKGFAVVANEVKNLAAQAKTATDKISLEINSMRDISSNVVSSLNAIKESIETVREYVNSTASAVEEQSAVANEIASNMQRVTREVNSMV >NC_016026.1|WP_014102024.1|481330_481861_-|hypothetical-protein MTSLIQTIQETSQAMGHVSPEKRAALVDSMAQLARRLADHVSALDDIPSHYQAFTMPAINDQIAAAEKRTEKAADQILTAAESIMKSLAKMKGDAAAEIQNQANIIFEATSFQDLVTQHLNEIRLRMKELNDDMLALQNCMTSISSGSGDAPLQKTRTRKSERPDAHLLNGPTTNF >NC_016026.1|WP_014102025.1|481874_482273_-|response-regulator MDINKDMKVLIVDDHKTMLRIVRNLLSQINISNVDEATDGQSALQKLAHNKYDLVLSDWNMMPMTGLQLLQFVRTDSTYEHKNVPFIMITAESRPENVMEAKQAGVDNYIIKPFNADTLETKIKSVMTKKQR >NC_016026.1|WP_014102027.1|482572_485236_+|chemotaxis-protein-CheW MDDLVTEFITETVESLSTLDLDLVRLEQEPENKDLLGNIFRLMHTIKGTCGFIGLPRLEKTAHAAENLLDNFRNDKMDVSERAMTLLFMCIDRVRFLVSEVSKSGAEPEGNDSDIIQVIEAEIEQSLHGGEKKESAVSNRDPVPEPPVSVDISPAQTVEKGPEYLRVQMNVLEDLINMVSELVLTRNQLSQLIRMEENSNLTTPFQRLNRIVSDLQDSVMKTRMQPIGNAWSKLPRIVRDLSTEMKKKIVLEMEGEETELDRQVLEQIKDPLTHMIRNSCDHGIERPADRLDAGKKEQGCIRLRAYHEGGFIVLQISDDGKGLDPAKIAEKAIEKGLADPDKIQAMSDKQILSYIMRPGFSTAEQITNVSGRGVGMDVVRANIEKIGGSIDMESTPGKGTCFTIQIPLTLAIISALIVEIDSYRYAIPQMNIQELVSINPTDSDMIEYINDKPVLRLRDRIIPLLDSEALFDFKSDQGQKPHNEKLICVISTGSSYYGILVDQIYDTEEVVIKSVSSVLKNAGIFSGNTILGDGRVIMILDPAAIARKFNVEKAVNQIEAENIMARQARESVKERASMLVFKAGDGALKAVPLALVSRIQVFPRGEITCSADKIVVRYNNTLMQLCFIDSTTQGLNDHEVMSLVLSDDMSDASMGLIIDHVVDIIEGDLDLTTATLRPGVLGSMILSDRTVDVIDIAHFLSLSRSDWFSKMAHQSAPYANYHIERVHERLDIVETGPHTTGRSATIGELEQTAMAHRPMAEYRGQKMRLLVVDDSPFFRSMLYPILTGAGYDVTLSEDPLHAIRLHDDGHMFDIVLSDIEMPHMDGYEFVERMRDDSSWKDVPFIAITSHNTREDIEYGYKKGFNKYIGKFDKDELIRSLVSIRNNE >NC_016026.1|WP_014102028.1|485257_485740_+|chemotaxis-protein-CheW MNNAKPNIVADTFKILILNIGNHYFGAPIESIQDVIQRNPTTPVPLTPPNIIGLLNLRGHIVTEIDVAYTLGIHNRDWLAGNNGYSIVINRGGEMYSLVFEGIGDVVDVMDSSIEKLPDTINRKWFSISRGVCRMGDKLVVLLDFNLMIDHLTPEPANMV >NC_016026.1|WP_014102029.1|485746_486895_+|chemotaxis-response-regulator-protein-glutamate-methylesterase MDISPVRVMLVDDSVVVRGLLRNIIEKHNDLDIVAAAADGQTALRDYRTHRPDIVLMDVEMPHMDGLSALREILVHDPDARVIMCSSLTQAGAETTYQALHIGAVDCLAKPSSKSIDRGLTFEQELLLKLRTLGRNGAKRKAVSITARSSGVPELVSLSTPYMHKLGGDVVLRRMPDHLPPNFPLALAIGASTGGPKALVEFLTSVDKNIMLPIFITQHIPPGFSRFLAENIERKTGFPAHEAEEGMLVSPGHVYIAPGQKHMGVQKGIPKRITLTDGPPVNFCKPSVDVMLDSLEHAYGGHLLTVILTGMGADGHQSSRRMVVDGTHNILIAQDEESSVVWGMPGAVAKDGICHAVLPLSRIGAAVNKLVRRESIGDHHAN >NC_016026.1|WP_014102030.1|486884_487700_+|protein-glutamate-O-methyltransferase-CheR MQIDDIEFHFFRNFLKESSGYHLTDDKRYLLESRLEDVLRSWKLNDHRAIISSIRNDYSSKMATDVIEAMTINETFFFRDQIPFDVFENQLLDRLAESAVANRVRIWSAACSTGQEPYSVAMIATEKRSVYPKLLCEVVGTDINSRVLSRARQGVFSDIEVHRGLPDHYRDKYFTRDGSNWKINDDIRAQVHFRQMNLKGDYDVEGPFDFVLLRNVLIYFDTALKENILRRIADRMRPGGYLLLGAAEGIYDLNHHFQRCPDIKGLYEYRG >NC_016026.1|WP_014102031.1|487734_489120_-|homospermidine-synthase MIGFGSIGRGMVPLLERHFKFDRDRFVIIDPEDIYRPVVEGLGIRFIHAELSLENYRDILTPLLRNGEGVGFCVNVSVDTSSRDIMRLCRELDCHYIDTVAEPWAGYYDNKNAHPGDRTNYDLRDDIIEEKALARGTRTAVSCCGANPGMVSWFVKQAVLNIARDTNTPFTEPQTREDWARLMQTLGIKGVHIAERDTQRARMPKEMNVFVNTWSVEGFVSEGYQPAELGWGTHENWMPPNASTHSFGCQAAIYLNQPGAATRVRTWCPTPGAQFGFLVTHNEAISISDYFTVGSGPKPEYRPTCHYAYHPCNNAILSWHELFGRDGKMQPRMHILGESEIVDGRDELGVLLFGHAKNAYWYGSRLTIEETRALAPYQNATALQVTSAIIAGMVWALENPMAGIVETEEMDHRRCLEVQSPYLGTVEGHYTDWTPLDNRPGLFPEDIDTSDPWQFRNILVH >NC_016026.1|WP_014102032.1|489286_489985_-|hypothetical-protein MTIKTATNDNAPLRSVFENANLRRAGALSTLGLAAAILSGCATPGLNTETCMGTDYNGIGFYGVGQSKWDKDCANAEFAETLLRKKNDPVGNALGFLLYLDQLPGARAQLEARLGQKGALRIEPETVATLLASPDVSRYTGVQLYAKMNEDDRATVNSLLQKRGIDPKVALTLNDDDRRAMIQAQEQATSRAEQDVAKTQATPPTTPENTPKQQCKPVVTNGRQIRFVCGGQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_016026_2 | 677987-678152 | Orphan |
NA
Consensus repeat of NC_016026_2
|
3 spacers
spacers of NC_016026_2
>2.1|678009|23|NC_016026|CRT GGAGCGGCTTTCTTAGCGGTTGC >2.2|678054|20|NC_016026|CRT ACTGTCTTCTTAGCAGCCGG >2.3|678096|35|NC_016026|CRT GGAGCAGCTTTTTTCACCGTCTTTTTCACTGCCGG |
DinG |
CRISPR arrays and Neighbor proteins around NC_016026_2
The CRISPR arrays of NC_016026_2 >merge|NC_016026|2|677987-678152|CRT TTTTTTAGCAGCAGCTTTTTTCGGAGCGGCTTTCTTAGCGGTTGCTTTTTTCGGAGCAGCTTTTTTCACTGTCTTCTTAGCAGCCGGCTTTTTAGCAGCAGCTTTTTTCGGAGCAGCTTTTTTCACCGTCTTTTTCACTGCCGGTTTTTTAGCAGCTACTTTTTTC >NC_016026|2|1|677987-678152|CRT TTTTTTAGCAGCAGCTTTTTTC GGAGCGGCTTTCTTAGCGGTTGC TTTTTTCGGAGCAGCTTTTTTC ACTGTCTTCTTAGCAGCCGG CTTTTTAGCAGCAGCTTTTTTC GGAGCAGCTTTTTTCACCGTCTTTTTCACTGCCGG TTTTTTAGCAGCTACTTTTTTC
>NC_016026.1|WP_014102240.1|674980_677785_-|ATP-dependent-DNA-helicase MSGHVAEQTQKRARVSMPAAAVITVTARTTTALSPDGEIKSYPHDQARMIFHKRPVIVCHAPYTRHRLGTDDLIAYDVLELFAFVHPAKFCVPTPFGIATALGIHVGDNSEDAPLALHESVRVLLSDLRACESDQEIRANLLGIADAMGAQGKGWVWTPYIFSALNAEYDPERQILTRTALNIWKNLPEWSEGAPPPPAGHDSVSGEEARERLRSMLTQGRSAEPRPQQMDYTGQIASIFAPAQNADQPHVLLAEAGTGVGKTLGYLAPSSVWAEKNGGPVWVSTYTRNLQRQIGQELERLYPDPMVRDRAVAIRKGRENYLCLLNYEDLANAAALARHATQIIAAGLMARWISATKDGDLSGGDFPGWLPALLGYGGTTSLADKRGECIHSACDHYHRCFVERSIRKARHADIVVANHALVMINTALAGTATPDDQPHRYVFDEGHHLFDAADGAFSGNLSARETADLRRWIFGQEGGKRSRARGLKRRMEDLIAGDGAAEEDLEKIIQAAHSLPGPGWSKRFKDNAAFGATEQFLALVYQQVYARADGRDGPYSLECTTLPLIDGLGDAARLLKSRLNDLRTPMMVLAGRLRRRLGEQSATLDTDTRRRLESVAASLQRRGEVTIGAWIAMLDTLESGNADDQFVDWMEIERIEGQAIDVGLYRHWVDPMIPFAAAMKTQAHGIAITSATLRDGTGDEEEDWRVAIERTGALSLTPSPHRFAVSSPYNYADQTRVYIITDVRKDDLDQVAGAYQALFEAAGGGGLGLFTAIQRLRAVRDRIAPKLEDNGIALYAQHVDEMDTGTLVDIFRADTHACLLGTDAVRDGVDVPGDSLRMLVYDRVPWTRPTILHKARREAFGKRRYDELLTRLKLRQAFGRLVRRADDRGVFVMLDSMLPTRLHGAFPEGVTIERVGLADAVKGIKEFWSTDSVG >NC_016026.1|WP_148260412.1|674814_675000_+|hypothetical-protein MEFIYDEQSTPIALLTPELKAEFERAGLFRMLDPYYQYPQTTSEPASRVRHLALVINPPNP >NC_016026.1|WP_014102238.1|674380_674626_-|hypothetical-protein MPKKSLKAAFNAACNRFFELAKEWENARSVPWDAVDDARQAVLNARDDLQGAQLTPDEWDDVIQDIIAVSDTHERLKPPSP >NC_016026.1|WP_014102237.1|674056_674389_+|hypothetical-protein MTASIMCVVFRPAHDKNIGDVSRMMPVIEQVIADCDGLNFSEDVYDSSAFRFFLPSGANVYDADRVVQCLNRLNDHDGGALLEAEQVSVPEKSVTLTRYPGIERNIVRLG >NC_016026.1|WP_014102236.1|673499_673925_+|ATP-synthase-F1-subunit-epsilon MADATTSNDVLTFELVSPERKLMSGTAYRVTIPGVEGDFGVLAGHASVLSTVRMGVVEILESASAAPVRIFITGGFADVTPVNCTLLAEEAVNVNDLDAAKLEQDIRNLSDDLSVAKDAFEKSKLQRRLDVTRAKLKAVAA >NC_016026.1|WP_014102235.1|673236_673488_+|hypothetical-protein MTTAPAHSSPMLQQVQTILRGMGYAPVAGDLGMYMRPIGSTPSISGTFYIVSQDDVQKVGELPVNEPVPHAVFVAQKAQYLFR >NC_016026.1|WP_014102234.1|671812_673234_+|F0F1-ATP-synthase-subunit-beta MTQAKGTITQILGAVLDVQFEEGNVPAILNALTTQNEGKTLVLEVAQHLGENTVRCIAMDTTDGLVRGQEVLDTGDAISVPVGPEVLGRILDVIGNPIDNLPAPSAKKRYPIHRPAPAFVDQSTEAEQLVTGIKVVDLLCPYLKGGKIGLFGGAGVGKTVTIQELINNIAKGHGGVSVFAGVGERTREGNDLYHEMMDAGVIKLDGESKVGLVFGQMNEPPGARARVALTGLSMAEYFRDEEGQDVLFFMDNVFRFTQAGAEVSALLGRIPSAVGYQPTLATDMGALQERITSTNKGSITSVQAVYVPADDLTDPAPATTFSHLDATTVLSRQIAELGIYPAVDPLDSTSRILDPRIVGEEHYKCAADVQKTLQTYKALQDIIAILGMDELSEEDKLIVARARKIQRFLSQPFHVAEVFTGSPGKFVQLEDTIKGFRAIVDGKYDHLPESAFYMVGTIEEAEEKAKKMAAEAA >NC_016026.1|WP_041794270.1|671380_671713_+|hypothetical-protein MAVAMQALERAERHIDAVLDPKAKASRDVKVKAFAELVRVLDVMQDAIATAPFLIARTAQDVVDDAFYRAFGLMNKEGIDEDEVRAHVHHFTSSTGGRMISPDFSRRFDA >NC_016026.1|WP_014102232.1|670315_671227_+|F0F1-ATP-synthase-subunit-gamma MPSLKEYRNRIASVKSTRKITSAMKMVAASKLKKAQEQAEASQPYAHAMAGMMSRVAKGVVVGPNSPKLLIGTGSDQVHMIVVVSSDRGLCGGFNGNLVRRVRNEVRGLLNAGKTVKLVCVGRKARDILRREFPKHITHSFTGLAGKNRIGFAEADEVSQYILSQFDAGEFDVCTLMYNEFKSVLTQRPVGAQLIPFRLPEVEAANQNVDAAEADKGATSPYSFEPDEAEILSALLPKNLSIQIFGALLDSAAGEQAARMTAMDNATRNAGEMIKKLSLQYNRARQAYITKELIEIISGAEAL >NC_016026.1|WP_014102231.1|668690_670241_+|F0F1-ATP-synthase-subunit-alpha MEIRAAEISEILKKQIAEFDAQADVAEIGQVLSVGDGVARVYGLDQVRAGEMVEFPGGIKGMALNLEADNVGVVIFGDDRSIKEGDIVKRTGEIVQVPVGKGLLGRVVDGLGNPIDGKGPIKNAEMRRVEVKAPGIIPRKSVHEPMQSGLKAIDALVPVGRGQRELIIGDRQTGKTAVALDTIINQKVINKSANEKDHLYCIYVAVGQKRSTVAQLVRQLEESGAMEYSIVVAATASDPAPMQFMAPYTGCTMGEFFRDNGMHALCVYDDLSKQAVAYRQMSLLLRRPPGREAYPGDVFYIHSRLLERAAKMNDEHGAGSLTALPVIETQAGDVSAYIPTNVISITDGQIFLETGLFFKGIRPAINVGLSVSRVGSAAQIKAMKQVAGTIKLELAQYREMEAFAQFASDLDASTQKLLARGARLTQLLVQPQYQPMPVEEQVLVIFAGTKGFLDSVPVASVREYERRLLEDVRANGKHILDAIRTEKALSDKLQKDLSDYLSQFGKGFEAVEKKAA >NC_016026.1|WP_014102243.1|678529_679234_-|hypothetical-protein MMPFLTPKTPVTPQSGQSGNALWFILLAIALLTALTIAITKTGDNVQQAGETERATVEATRIMRDGKAMQTAIQQMLARGGSENDICFDSDDWATNDYDFAACADAENRVFDPAGAGLGMPKTTATQKIIYTGSLAIDGVGTSAPDLVFILSGVGKADCLRINRMMKIDDASGNPPPISAVVSYTPFTGTYTAGNTVTAPQILQKSAGCVGGNGSDADELDQDFYHYYHVLIAR >NC_016026.1|WP_014102244.1|679192_680845_+|lysine--tRNA-ligase MGCNRGFGGQKGHHLRGYPGVEKGDRACIFVSHKRQNQDLKQLYHGDLKMGQNPAQNSETTVTTAGNPRLAKQVKLDALKAAGIDPYPHVFPRTHQNGTLQDMYKDLPNGTETDDHVAVAGRIMAIRNNGMFLDLMDPSGKMQVFCHKDSMSEEALSILDYFDIGDIIGAEGTVRRTPRGELSVRAKKVTMLTKSLMPLPEKYHGLTDVEQRYRQRYLDLIMNDESRQKLLMRSKIISTIRKFMEEHGAIEVETPMMHPILGGASAKPFVTHHNALDADFFLRIAPELYLKRLIVGGLADAVFEINRNFRNEGISYKHNPEFTMIESYHAYKDYYDVMDLIEKLVQAVAMAVHGTLEINFQGNVINLGSPWARKGMVELVQEETGVDFMSMDAAQAHAEAKKLGVHVDPKANWGQVVETIFGEKVEHKLIQPIHVIDHPLDISPLSKVHRNNPRLVERFESYINGWEMANAFTELNDPKIQHDRFMDQVAQREGGNEEAMMVDHDFVTALEYGLPPTGGWGMGIDRLTMIMTDSHNIREVIAFPTLKPEK >NC_016026.1|WP_014102245.1|680972_681251_+|type-II-toxin-antitoxin-system-HicA-family-toxin MPTLKHNDMVDILLHDGWKCVGQTGSHEQFKHDAKPNVVTVTNHGPKDIPCGTVRSILKTAGLDNVLKQLQHGASIKQLSKQMAKEMRAHLA >NC_016026.1|WP_014102246.1|681262_681568_+|type-II-toxin-antitoxin-system-HicB-family-antitoxin MSKSYVALIRKEDNTEYWIDIPDVPGCASCGETIDAAIANFEDALQFHLQGMKESGVFLQDPRSVQDVLRSEEDPFIESYMVEIDDMTPHLKFSFSRLSIV >NC_016026.1|WP_014102247.1|681539_682151_-|nitroreductase MTTITPQPATVQPDAIEFLLRRRSCKIKTLAAPGPDDQQLAIILQIAARVPDHGKLAPWSFVTFTGNARADFGKILAQAWKQDNPDAEPAKLDLESERFLRAPVVVAVLSHVREGKIPAWEQILSAGAACQNLILAATMMGFGAQWVTEWYATNNTVRTALGLKTDQDQVAGFIYLGTPSETPEERPRPEMDTIVTQWTGVKS >NC_016026.1|WP_041794273.1|682238_683018_+|hypothetical-protein MKYNLFENPFFQDKRRVALLVAAGVVLGGIVAVATMPDRVTNKSPVTHVLQPVPAGMMTLRGEGTLPDLHVAQQNDTVLKDMVTNFAAAGAVGLLSSVNDLDNRIMVLLFRWGGVDNINPDSYGGGMDGRIVALLQKAGQVPADVRPDMVIKADEVVRLTQRWNNGFNHFKIRLLAQAAGPEVFDGQIRYDVRSDRLDVTGGLSPAFMAQFARAVRDNPQSASIMAQFLDFIDSTRGFANLSEDDQDAIMALSAGPQGE >NC_016026.1|WP_014102249.1|683020_683290_+|hypothetical-protein MQIIKVWLKITAVLAVLGWVLHWASPDFAAWLDGVFSKKETAAVVEQVASPGSADVIPPSDAGAIRPGQQTDGQGDAKRVPRAPYQFNQ >NC_016026.1|WP_014102250.1|683304_683985_-|response-regulator-transcription-factor MRLLLIEDDELLSQFIAAGLHQAGYESDCAYTADEALALVRTQSYDLIITDLGLPDQDGLSLLKKIREHNKNIPVLILTARQGVDDKVKGLDLGADDYLPKPFEMPELTARVRALLRRPAQALDAVITVGNLALDTNAHTASVINAPMKLTRREIDLLEQLMRNSGKVVSKELIESRLYSYGEQGSSNSIEVLVHRLRKKLEDAGADVQIATLRGLGYVLAERTEE >NC_016026.1|WP_014102251.1|684000_684375_-|CBS-domain-containing-protein MYRYKTQYVAVMDRGDLVGIFTYASYLGNVLRSGKEAENTPLDEVMNTAPAPVDAEQSCRDVFQTVCQNGFPYVPVEQNGRFLGLVSDDILRLELSRELNAMRKKIGFSFFSPDDGSAMGGARP >NC_016026.1|WP_014102252.1|684899_686378_-|aminopeptidase MKIQSLGPYYEDQIKTIKALAKGSSLWKAEAEALLPRITQSYKRADADALLATLSDMQRLAYIVAAGLEHESSFRADLKDMPASGWAAFTAPPVEDKLNLSLAKKLYNVKPNNNDVATMRLGDTSRAIGSYLVQWCLRDKVPFSVYFQDSDFHALLLNHATPDGVKALAADYMRMVDGVNKSMIVRANTPNRKIVHAHPDKAKIYDHETAPFFQKAGTGEVFYTLTCIPTENDSKIDGITYNDYIKLFFEMCDQPWDAISDAHLKLIQEFNIATHVRITNNDGTDVSMELVDDDGSHFTFCNSLIAKNVPGSEIFSAPRKNSVNGVVVAKGKFTHGGALIEDLTMEFKNGELVKYEAKAGLDAFKRAVEMDEGARFVGELGIGTNPHLKQHVANGLLVEKIGGSFHLALGRPYSYTEYQGVQVKVDNGGRSKLHWDITTMLYGKDGIIYLDGRKVMENGLWIDPQYDVLNRGWAAIPRKDRPAYWKNYDPKL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_016026_3 | 1544833-1544926 | Orphan |
NA
Consensus repeat of NC_016026_3
|
1 spacers
spacers of NC_016026_3
>3.1|1544865|30|NC_016026|CRISPRCasFinder TGTCATAAAAAAGGCCGCAATTTCTGGAAG |
CRISPR arrays and Neighbor proteins around NC_016026_3
The CRISPR arrays of NC_016026_3 >merge|NC_016026|3|1544833-1544926|CRISPRCasFinder AAATGGCGCGCCCGTTCGAACAACGTTCGAACTGTCATAAAAAAGGCCGCAATTTCTGGAAGAAATGGCGCGCCCGTTCGAACAAAGTTCGAAC >NC_016026|3|2|1544833-1544926|CRISPRCasFinder AAATGGCGCGCCCGTTCGAACAACGTTCGAAC TGTCATAAAAAAGGCCGCAATTTCTGGAAG AAATGGCGCGCCCGTTCGAACAAAGTTCGAAC
>NC_016026.1|WP_014103092.1|1543911_1544802_+|LysR-family-transcriptional-regulator MDKLANMQAFAMVGQTGSFAEAARRLNLAHSVVSKRIKDLEDYLGAQLLMRTTRKVSLTDAGYAYLDHVRKFLDEMDEIEGALRHKAQKPVGTIKLTAPLSFGLQYLGPAIASYLAQYPDVTVKTYLSDRRVDLVEEGYDLAIRVGALSDSSLIAKKLGACRRVVCATPAYFKQHGTPQTPDDLRNHNCLSYINLAEGKSWPFMVDGHKTWQPVTGNFLSDNGDLLYQAALANGGITLLPTFIVGDALNDGRLVPVLESYEETDFDIHAVYQHTRHLSAKIRTLIDHFGKVFGAGF >NC_016026.1|WP_014103091.1|1543106_1543805_-|pirin-family-protein MLRKRASGERGPTQTGWLSSKHSFSFGHYYDADHMGFGPLRVMNEDRVTPAAGFGTHPHANMEIISYVLDGELAHKDSMGNGSVIRPGDIQLMSAGTGVRHSEFNNSKDRGVHFLQIWIMPNVENATPSYQQQTFDPADMENKFRVVISPDGADNSLRVNQDARMMAGKFKKGSFSDVPTSSGRRYWLQMARGTANVNSVNLESGDGLAIMDEDKIFVQATSDAEILILDLP >NC_016026.1|WP_014103090.1|1542385_1543069_-|hydrolase MTAATNKLLTPDNSVFVFIDHQPQMAFGVTSIDRQLLKNNTIAMAKTAKLFNIPTILTAVETESFSGYIWPELMDVLQQEPIERTSMNSWEDKAFVDAVKKTGRKKLVMAALWTEACLIFPTICALDEGFEVIMNVDASGGTSKDAHDAAIRRGEQHGAESISTVQLLLEMQRDWSRKETYQGTTDIVREHFGAYGMGIDYAASMVHDYGQRAKFPHNVKKSGNKAA >NC_016026.1|WP_081463094.1|1540246_1542256_-|amidohydrolase MKSDFLIHKLTRRNFMQNTAMLGAGAVLGGAIGLLPKGAGAMTNAQIDTVFFNGKITTLDKDNPDVTAIALAGGMVAATGSDKEMRALAGPLARMIDLQGKRVIPGLNDSHTHLIRGGLNYNMELRWDGVASLSDALAMLKIQADRTPAPQWVRVVGGWSEFQFRERRMPTLAEINAVSPDTPVFILHLYDSALLNGAALRAVGLTKDSQDPPGGKIQRDANGNPTGMLIAEPNAMILYSTLAKGPTLPLSDQINSTRHFMRELNRLGITSVIDAGGGFQNYPDDYNVINKLHNDGHMTVRIAYNLFTQNKGVELADFQRWSGMVKPYSGDGFLRHNGAGEMLVFSAADFEDFLQPRPDMAANMEKELGAVVRHLIEQRWPFRLHATYDETISRALDVFEQVNADTPFNGVRWFFDHAETVSDKSLERIKKLNGGVAVQHRMAFQGEYFVDRYGAEAAKHSPPIRKMLDMGVPVGGGTDATRVASFNPFVSLYWLITGKTVGGLSLYGDENRLDREEALRLWTLGSAYKSNEETVKGALVPGMYADLAVLSHDFMTVPDEAIKDTVSIMTVVGGKIVYAADEFKSFDAPLPPVSPDWSPVRHFGGYQSGALEGAVKVASACAVHGCGHSHHHHASGHDGILSRWLGLDGSGNKPSFENPWAIGCGCFAY >NC_016026.1|WP_014103088.1|1538513_1540145_-|MFS-transporter MTKETKISAFSPFHHRVFAVLWGATLISNIGTWMFNVTSGWLMTDLAPSPLMVSLVQAATALPIFLFAIPAGAFGDLFDRRRLLIITQILSAVALFIFAGLLWVGAVGAWTLLFFTFLTGAMSAFAMPAWQAIVPRLVPKNELAPAIALNGVSVNIARAIGPALGGFILVAMGAVATVVLDAVSFLVIVAALLWWKTTTPQTTPNVPRERLVGAMQAGVRFSIRSLPLRHTLVRAFAFFVFASAYWALLPLLAKDVLQGGPGLYGILLTALGAGAIAGTFMLAPLKTKIGPNRILALASGLTALGMVVMAYGGTEIAGIAGAFIGGIGWILAVSSLNVSAQLSLPDWVRARGLAVFQMVFFGAMTLGSIVWGHVAGLVGLSETLAVAAAMIVVGIPLTWRFHLNRGEGEDYTPSHHWPEPMLVAPVDHDRGPVLVTLEYRIDDADREKFYHLMTELGDIRRRDGAIQWGFFEDVEDHGRFIEMFTAESWADHLRHHDRVTESDRVLQHKIHELHKGGKVKTMHAVMPGLSGGSLKKIPKNHKD >NC_016026.1|WP_014103087.1|1537561_1538494_-|ring-cleaving-dioxygenase MTHASGIHHITAIASDPKTNYDFYTKLLGLRFIKKTVNFDDPSTYHFYFGDKVGSPGTILTFFPYPGTPQGRPGLGQAVEVTFAIPKTAFSFWLDRFHQKGIQYQGPEDRFGDKVLRISDPDGLMLEFVGVDDLPSENVWTTDEISADVAIRGFHSVTLWVQGYEKTAALLNEHLGFHAVGNEESRFRFTTGKKGLGQTVDLRCLPEIWSGAPGAGTIHHVAWRIGGDKEEGHVRAALARQGLNLTPVIDRNYFHSVYFREPNGVLFELATDNPGFAVDEPVDTLGQDLKLPAQYEQHREAIVAVLPPLE >NC_016026.1|WP_014103086.1|1536928_1537555_-|alpha/beta-hydrolase MNFPESDFKHIFLPGDVEKPVLLLLHGTGGDENDLVPLGQAVAPDHAILSVRGRVLENGMPRFFRRLAEGVFDLDDLKFRTDELADFITAARDEYEIGKRPLVALGYSNGANIAASLFLKRPEVLDGAMLLRAMVPFEPDELPNLSGKKILMLTGMMDLIIPLDNSKKLAGMLADAGADLDFRAKPMAHGLGQSDLADMQNWFPAAFR >NC_016026.1|WP_081463073.1|1534785_1536747_-|type-II/IV-secretion-system-protein MRFSKDTGPEIMDDANKKAPRKPGADLELDDGAFALALDDDYIEAETVEVVTRDIAPQPSAQQPAPNAGSTAGASAPSGTRDLAVQNTGGVSEDLNRGRMGDRLVAMGIITEDQLNVALQEKKVTGKMLGSVLVDLGFIDEDLLSGFLAESSGFDVFDPKNTIFSGDALAMIDKATAKKHQLLPISIDDKEAAVAMCDPYDVMAMDTLRRFLPKNITIKPLVTTPKIIMEAIDAAYGYASSIAAILKELEEGEPTDLSTLSEDEAYSHPIVRLVNALVYDAVKIGASDLHFEPEENFVRLRYRLDGVLFTAQILHKQHWNGISQRLKIMSHMNIADKLSPQDGRFGLNIGGKLADFRVSSLPTVHGENIVLRVLDQSSNIIPLEQLGFSPHNLEKIRRAQARPEGIIIVTGPTGSGKTTSLYSMLNEINTVEVNIQTLEDPVEYSLPMIRQTPIREGVLEFADGIRALLRQDPDIIFLGEIRDGITAEKALQASMTGHQVYSTLHTNDSFGAIPRLLDLGLKPGMIAGAIVAVFAQRLVRKVCPHCREAYQPGPDECAILNVDPANPPTIYKAHQGGCQMCAGQGYKGRISIAEILLFDDELDEVIAQNGSKAELKRKAYEKGFKNMKDDGILKVLEGITTLESLATAVDVYK >NC_016026.1|WP_014103084.1|1533509_1534730_-|type-II-secretion-system-F-family-protein MAADRYKYRAINNKGRPVRGVISAANEVDLYNQLQSAGLELIQCQSLTKKKGMLSDLRAPKISTRDLIQLFMHMEQMQGAGVALLDALADIRDTTEHDRLRDVLSEVHRDVSDGSALSEAMGHHPKTFGSLYISLIAAGEETGDLTAAYRHLIKYLKWVDQMQAKVRKATRYPTILVVVVIATIVVMMSFVVPQIVGFIRNLDQELPWYTTSLMATSDFFVKYWWGVLATPPILFVVYKALVKSSEDFAYRMDRLFLEMPVAGPLIRKINIARFAQTFGAMFASGIDVLSALRAARNTVKNLALVEALEGVEEQVAAGSPLSEAFNASGEFPSMVVRMLKVGEESGNLTVVLDQVAEFYTNDVDEAVQGLIAMIEPFLTMFLGVMIMWIAVAVFGPIYASFENIDF >NC_016026.1|WP_148260541.1|1531739_1533458_-|hypothetical-protein MISDDALYVYDVGGKVRLVDTVPWATRDFEQTVSGLIRRECGGKSVMIVNDMTDQLFKGGQRIPKVGPMDKANVVARKLAVAFPNYPIRGALALKDVGPRKTGATAAKAGGGLYLFAAVPMSEPVQKTIGAVKTSMSSIAGFTLLPVESSDMVRTLAEKAAKREKTKSRWAVLIGQHQSGGLRQVITRDGQLAMTRMTPVTDLSTDPGAWVSEVAQEFKATISYLSRFGYSAGDGTEVFVITTPQAGEMLRQRIDVPCNIHNYTVGEAARELGFSIGIQENQYHADPLHASWIGRKSRLILPMVATDINKIYGPRQAATFAGLLLFCGAAYLGWQLAGNAQAWFTAKDDLVSQQRLRINVNQEYEIEVARMNALGVDIKLIQSSLETYKTLEAESLRPLPILRKVGEALGSELRLDTMKIERVVPKLPDDPYVVAEMTEDQKAPTLKASLQLSFPGTVDPLVAKREVDDLQARLRTALPGYDVAIPRPVGDLVYEEVEGGTGVPGAQGGVQAPLEDHVAELVITGPVQFHEPDVPVEDAPADTPADAPTDSPVDTNTEGQTAPDMTYEGAEQ >NC_016026.1|WP_014103095.1|1545422_1546658_+|hypothetical-protein MTRNVSTRRGASFSRFACNAAVFALAGLSAALINGAPADARTGGTMPPALVVTNEPLPNELQSKIYQQPRRAPVIDTQQVMGSQYWDEGSETIVSRKIDDLRKELFGLQGNVSGLSDRLNQLSLAGQNHSAEYYANVATISTQLQSGTTPGNPRLIKRLSVARNSLEQLAGNVASLNDLAVEISNAASMAGFLLESARTTYSLSGAVEEDHVRLSQLEDSISNTVVAIDRMLNTVNDDITRTVAYLSTERNNLRTLSLGITTGDLFGKSLGTRPFSGAPQTSMNGAPQAAPMGDDMAGGYVQPVSQPAPLASARPLVKIRFDKANVNYEQPVYMAVNEALQRYPNARFELVAVQPTGGNSAEATIESTRARRNAEKVLRSLTEMGVSLDRIDLSNLQSNEATTSEVHLYVR >NC_016026.1|WP_014103096.1|1547070_1547850_+|PRC-barrel-domain-containing-protein MRSLLFSIAFAVVTLIVLTLAFPAKPKAQEAPSPATSLFSSTTETSVMRINAHRGRAEEARYNNAQSIQNLLGQDVLDGRGQAVALVHDVIIVNGDDRNDNEAEFLILSDGTQFGMPGRMVALDYDDAVKSEPRRESLKRIDTSDSDDVVAFDYSFGPGMNDTRLLRLNEISVRNTIGTPIYGTQMDQVGMVADVTLKNGRADLIVFVPTPVMGMGIDPVALFYDQTLIISAADGHNAFQLSPEQNEALNSYRDALRTF >NC_016026.1|WP_014103097.1|1548053_1548803_+|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MKILVPVKRVIDAYVTIRVKADGTGVETANVKMSMNPFCEIAVEEAVRMKEAGKATEIVVVSAGPANVQETMRTAMAMGADRGIHIQTDEDIQPLAMAKLLKAVVEKEQPGLVLMGKQAIDGDNNQTGQMLAGLLNWAQGTFASKVELNGDHAIVTREIDGGLETLKLKMPCVVTTDLRLNEPRYAALPNIMKAKKKPLDTTTPADLGVTIEHKLKTLKVSEPPKRAAGIKVADVAELVSKLKNEAKVL >NC_016026.1|WP_014103098.1|1548865_1549801_+|electron-transfer-flavoprotein-subunit-alpha MTILVVAEHDNQTLNHATLCTIAAAQKLGSDIHVLVAGSGSASVADAVSKAAGVTKVLHADDAAYARELAENMGNLIAKIGGAYSHILAPASFFGKNILPRAAALLDVQQISDIVAIESADTFVRPVYAGNALATVQVTGSPIVVTVRPTAFDAVAETGGAGAVEALASAGDSGLSSFVGQEVTKSERPDLQTAKVVVSGGRGLGSGENYEKIITPLADKLGAALGASRAAVDAGYVPNDYQVGQTGKVVAPQLYIAVGISGAIQHLAGMKDSKVIVAINKDADAPIFQIADYGLVADLFEAVPELEKALG >NC_016026.1|WP_041793921.1|1549960_1550824_+|GNAT-family-N-acetyltransferase MSNPLLSAIAGSDKVSVRLAKTPAEIEAAQRLRYSIFYDEFGAKPDDTVAATKLDADKYDPVADHIIVVDTSGDAEKIVGTYRLIRKEPADSVGGFYTSNEYDISALQSCGMSILELGRSCVLPDYRTRPVLQLLWQGIANYVMVDHQIELLFGCASFHGTDPDKISEQLSYLYHYHLAPPGLRPTALPDRFVKMDLHPKESLNPKKIFNELPPLIKGYLRVGSMVGDGAVIDEQFNTIDVCIVLQTHLVTSRYKKHYERKTGQNMPIPEELAGQTDADAEALFRRD >NC_016026.1|WP_148260446.1|1550820_1551669_+|1-acyl-sn-glycerol-3-phosphate-acyltransferase MKDKNTMTRSLIAVIKALMFILWSLLVAPLQFVFLLFNRGPAAYILPHIWQRGVCRILGLRVVVEGTPDTARQVMFVSNHLSYLDIPVIASVLKASFIAKKDVSSWPVFGFLSTLQQTAFISRDRKDAKVEKNNLSSMIAAGKSLILFPEGTSTDGCDVVKFKSSLFSLAADPTTGAFLPVQPISLVMDRVDGRVPADGPNDVRDVYAWHGDMTMGPHLWNFVKSRGATIRLIFHPVLDPQVYNDRKLLAEAAWNQVRGGVAGPSLSAPATASTLAAAAIGG >NC_016026.1|WP_014103101.1|1551717_1551957_+|DUF3126-family-protein MSQAQAKLKMTGEESSKIQKFLEKTLKTPGLALRARPQAADSVEVLVNGEYVGLIHKDLDEGETSYIFTMTILDIDLDE >NC_016026.1|WP_014103102.1|1552165_1553020_+|hypothetical-protein MILFSGNKQSKSGPVSARSVIRMALTVCGAAFIAATISSAAQAQPASCDPAYWESMKQRGMLEAQREVQQNQNLIFKADSVLELTCFDRQLQALAQQAISLFSETTRWGVILSPTSMDAALNNLVATGLMNYIANNSFAHTYGGGRFPGDYTMQSSFGGSTVYNCNTMALVWEAAKCYNFAEESKDGFFTLADFVEPRVGVGHQGFTCSGDGRLGNMRSVASNSGDQYQTETYNSYAGLFASDSCSAPIPTGVRVSRVNMNPYNEHICINPGCFYNLSTCTNTP >NC_016026.1|WP_014103103.1|1553035_1553404_-|hypothetical-protein MTDSHMDPAALRRKQVLDDALTQLRQTRDQLDPALLARVRALIGDRTLLDLMEPVSDDRPSLPPGVKAWNPAADIKPEPAKPGYEAIDRRRNLQTIRLFLELQPQNKSVQTKVRTLMSEFFN >NC_016026.1|WP_014103104.1|1553569_1554859_+|3-deoxy-D-manno-octulosonic-acid-transferase MERIYRTLMRAGTPALRLLLATRVKRGKEDPARLNERMGVAGHARPDGPLVWFHAASVGEAQSTLILITALLDAHPDLNILVTTGTVTSAELMKNRLPPRAIHQFYPLDHPIWVERFVDHWQPDLVLWMESELWPNMLGTIRARNIPAVLVNARLSPRSMRRWKRMRSVITPMLSTFTTILTQTDEHAANYRALGATHVITTDNLKFASLPLPYNATDLHALKDAIGARPVWLYASTHDGEEGLACTLHRKLFIDFPELLTIIVPRHPERRAVIATTVQAEHLRVCMRGPNKALPSMDDDIYVADTLGELGLFYRLAPISCIGRSFSRDGGGGHNPVEAAQLGSAVLYGPMVQNQQALYDEMRDYGAAIALADPDMFAETLRDLMRNQGRLIEQQNRGQNFAREKNAVLDRVMAAITPLVPTPKKSDAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_016026_4 | 2270437-2270781 | Orphan |
NA
Consensus repeat of NC_016026_4
|
5 spacers
spacers of NC_016026_4
>4.1|2270461|51|NC_016026|CRISPRCasFinder AGGCAGAGACCAGGGATTGGGTGGCGATATAGCCGCTGGCAAGGCTGGCGT >4.2|2270536|27|NC_016026|CRISPRCasFinder CGCTGGATGGGGCGGCGGATCATCCAC >4.3|2270587|30|NC_016026|CRISPRCasFinder CGATGGAACCAACAACGGATCAACCACCAG >4.4|2270641|54|NC_016026|CRISPRCasFinder TGCGACATATGCCTGCGGCACATCGACCAACGGCGGGGCCTGTGGCGCCACACT >4.5|2270719|39|NC_016026|CRISPRCasFinder CGGCGACACGGTGACCCAGGGCAGTGCCGGGTCCGGCGG |
CRISPR arrays and Neighbor proteins around NC_016026_4
The CRISPR arrays of NC_016026_4 >merge|NC_016026|4|2270437-2270781|CRISPRCasFinder CCGGGGCGGCCCCGGTGGCGGCGGAGGCAGAGACCAGGGATTGGGTGGCGATATAGCCGCTGGCAAGGCTGGCGTTTATGGCGGCGGTGGTGGCGGCGACGCTGGATGGGGCGGCGGATCATCCACCTATGGCGGCGCAGGCGGCAGCGGCGATGGAACCAACAACGGATCAACCACCAGCTATGGCGGCGCAGGCGGCGCGGATGCGACATATGCCTGCGGCACATCGACCAACGGCGGGGCCTGTGGCGCCACACTGAATGGCGGCGGTGGCGGCGGCATCGGCGACACGGTGACCCAGGGCAGTGCCGGGTCCGGCGGCACAGGCGGCGCAGCCGCCAATGG >NC_016026|4|3|2270437-2270781|CRISPRCasFinder CCGGGGCGGCCCCGGTGGCGGCGG AGGCAGAGACCAGGGATTGGGTGGCGATATAGCCGCTGGCAAGGCTGGCGT TTATGGCGGCGGTGGTGGCGGCGA CGCTGGATGGGGCGGCGGATCATCCAC CTATGGCGGCGCAGGCGGCAGCGG CGATGGAACCAACAACGGATCAACCACCAG CTATGGCGGCGCAGGCGGCGCGGA TGCGACATATGCCTGCGGCACATCGACCAACGGCGGGGCCTGTGGCGCCACACT GAATGGCGGCGGTGGCGGCGGCAT CGGCGACACGGTGACCCAGGGCAGTGCCGGGTCCGGCGG CACAGGCGGCGCAGCCGCCAATGG
>NC_016026.1|WP_014103783.1|2267655_2268855_+|hypothetical-protein MKQRIKNIRKNTHLMAITSLMACAVLTGTLAKPAHAACSSPTALAGTLEWFSGTTEFKYCDGTNWLSMAGGTVTWVQSGSNIYYNTGNVAIGTTNSQGLKLAVNGGLRLADSGTACNATYKGVMRYSAAKNIEFCNGTSWKALAGPTIETCSVQEYTTPGSHSYTVLPGCEDLAIETYGAGGGGGYSTYGGGGGGSSRVQDESNTIIALGGGGGGGAGDSSAQGGGGGGGYGKKIVTLSAGDNLLVVVGEGGESGCGTNGGTGGNPDGGTFGNNSNGGNSTYGGGGGGDGGGYRGGASTYGGGGGGGDGVNNDGSTTDYGGAGGADAQYLCGTSTYGGPCGGEKSGGGGGSGIGDLVLRGLNGNSFQGGPAANNGPGQGATDSSSCARGGNGKVVIRPF >NC_016026.1|WP_014103782.1|2266499_2267636_+|hypothetical-protein MIAIAGFCALVPNAAHAACTSPAKGEGAIEWFTADQKFKYCDNTNWVVIGTGGSWVVSGSDIYYTTDMVGINRSNPAVMLDVGGSVRIGDNSVTCNTAREGAIRYSSVNTVDFCDGTSWKSFSGITPSTCPTTEYTTPGSYTYTVTAGCTNLILESYGAGGGGGWASYAGGGGGSSRIEYPASTIVSLGGGGGGGGGDAGGPGGGGGGGYGKKLMTLSVGNVLNIYVGQGGQNGCSNNGGAGGNPSGGTLGNSSNGGNSTYGGGGGGDGGYRGGTSTYGGGGGGGDGVDNNGSTTTYGGAGGADVAYLCGTSTYGGPCGGSKSGGGGGYGIGDVALQGSGGNSSAGGTAAGGGTGTGGAPASACNRGGDGKVVIRPQQ >NC_016026.1|WP_148260473.1|2266234_2266465_-|hypothetical-protein MEGTDAAASALKTWARPDTENPLLPEIQNGLLNSPKFSRPTIVMMLNLPAADLFYAIVLNKVLPVKSAVRTGLHIQ >NC_016026.1|WP_014103780.1|2265786_2266209_-|holo-ACP-synthase MIIGTGSDLIDIRRIEKTLARFGDRFILRCFTETERAKAESRRGAGTHIATYAKRFAAKEACSKALGTGFAEGVFMRDIGVVNDSFGRPTLHLTGGAAKRLAAMVPAGMRPVIHLTLTDEPPLGMAHVMIEARPQGEPDI >NC_016026.1|WP_014103779.1|2264866_2265712_-|signal-peptidase-I MSDQTPADQQANQKSAPKSPPLSASEEWSEFIKTAMIAVVLALLIRTFLYEPFNIPSGSMKPTLEVGDYLFVSKPAYGYSRYSFPFGLAPIEGRVWAKAPERGDVAVFKLPTNPRIDYIKRIVGMPGDTVQVIDGRLYINRQIVPRESVGLKRVDEDGSIVVMTEYLETLPNGVVHSIYEEGDDHPLDNTPEYTVPDGHYFAMGDNRDNSQDSRVMNHVGFIPYENIVGRASFLFFSTNGSASLAEVWKWPGAIRYSRLLMSVEPVKVEAPAAASTPVAAD >NC_016026.1|WP_081463086.1|2264162_2264867_-|ribonuclease-III MPMAGAPDMDQAMNDLQDRLNHRFSNPDLLRAALTHSSTGAAVNYERLEFLGDRVMGLALARFLFDIFPHENEGDLARRHAALVSGSTLARVAKGINLGDALHLSHAERAAGGAENDNILSDVVEAMIGALYLDAGLDPCMSAIQSLWGDLLQADLTPPRDPKTALQEWAQGQGHPLPRYTMIERSGPDHAPIFTVSVFVEGFDEVAEQGTSRRAAEKAAATRLLNIIEKDNRS >NC_016026.1|WP_014103777.1|2263242_2264166_-|GTPase-Era MTERCGFVAIIGAPNAGKSTLINRMVGAKVSIVNRKVQTTRINVRGIVMMDDDATQIILIDTPGIFSPKRRLDRAMVAAAWNGEADADITALLIDASKEGFDKDTRALLDTIEKRVKDGAVGDRKIILLLNKIDQMPADQLLKISAELNDRIPFTATFMISGLKGRGVQDVLDWISKNIPEGPHHYPGDQLSDLPERLLAAEITREKIYDNLHQELPYAATVETETWESFDDGSVKISQIIYLAREAHKPIILGKGGSRLKTIGMQSRKELESLLECRVHLKLFVKVKENWMDDPDRYSVWGLDPGA >NC_016026.1|WP_014103776.1|2262391_2262928_-|gcrA-cell-cycle-regulator-family-protein MSWTEERVSLLKQLWGEGKSAAEIAKALGGGLTRNAVIGKAHRLKLSNRVSPIQQNSKTPDAAPIAVKATVRVVEETAAPVRAAARVAIAIPQAANNGKGVSMVELKDRMCRWPVGDPKDSNFHFCGCSSEAGLPYCGAHAKIAYQAPSRSRQLNAEDFEREGSAVHAEEELKDVVRA >NC_016026.1|WP_014103775.1|2261641_2262214_+|CspA-family-cold-shock-protein MSHFVGNSEDGFQTDTLPAVRAKLKWFNGPKGFGFVVPDGEDIDAFLHVTTLQRAGATALGDGADLMCRIKRGPRGAMVTEVTEILDLGALPETAMPTSAPRMPQSGGPSISDHAGPEKGVTMDGTVKWYKPEKGFGFIIPEDQAKDVFIHKACLERHGLMGLEPGQRVRMQVRAVAKGREVIDFELMDG >NC_016026.1|WP_041794603.1|2259469_2261191_-|long-chain-fatty-acid--CoA-ligase MSVETISPASSSAQYPWLSHYPQGLNWGCDIDMGPVPAMLDKTVAAHGAWPGIDFMGKVWSWADIGAQVDALAKAFQDMGVVKGTRIGMFLPNCPTFIVGYYAALKAGATVVNFNPLYTPRELKHQIEDSGTTIMLTLDLQMLHQKMDEMLKTSSLQKLVVARFTDILPFPKSLLFPIFKAKDKAKIAPSDKIVWLHEITAGGGKPAPVSIDPMNDIAVLQYTGGTTGTPKGAALTHANVTANAHQCSLWLGGHGGDGQQRMMGVLPFFHVFAMTAVMNFSVRSAFEIIIPAPRFELDITLKAIDKKKPHYFPAVPAIYNGINNHPKLAEFDLKSLRYCISGGAPLPVEVKKAFERNTGCVVVEGYGLTESAPVVCVNPIVGANKAGSIGMPVPGTIVEIVSTEDGVSLVKQGERGELCVRGPQVMKGYWNKPEETDLVLKGGRLHTGDVATMDQEGYVYIVDRIKDLIITNGYNVYPRNVEEAIYLHTGVEECIVAGVPDDERGEAVKAWIKPKAGVTLTEKDMLAFLADKISKIEMPRHMEIRETPLPKTMIGKLSRKDILAEEKAKREAA >NC_016026.1|WP_014103785.1|2270856_2271453_-|DUF2062-domain-containing-protein MLFRRRTKLHPIKRLREILWPSMGWGRTWDYIRHRMFRRSDSSYSITAGLAAGVAVSFSPIMGTHIVQAAGVALVTRANVFAGAIGTLFGNPTTFPMIWWASYQLGAFIIGLFWDVRMVELPDHITFAFLMAHPYKIFLPMMVGGYTLALVSWPVAYLICYWPVKQMQKAYHAERLQKLRDKILHREHAAREKGDNTD >NC_016026.1|WP_014103786.1|2271473_2273645_-|bifunctional-(p)ppGpp-synthetase/guanosine-3',5'-bis(diphosphate)-3'-pyrophosphohydrolase MSLSSDLVDQIKTYNPDIDPALIERAIEYARVKHDGQVRASGEPYYTHPVEVAAILADMKMDPATIVTAILHDTLEDTDATMEELKKLFGDDVANMVNGVSKLSRIEGQTVEGKQAENFRKLVLAMSDDIRVLLVKLADRLHNMRTIHHIAKPEKQRRIARETLEIYAPLAERIGIHQIKEELEDRAFGVMNPEARESITNRLSYLRQEGTDMADTIIKALSKTLKDAGINGVVLGREKTRYSIWRKMQRKNVSFEQLSDIMAFRVLVDNVEQCYHVLGIIHSQYPTVPGRFKDYISTPKPNGYRSIHTTVIGPENQRIEVQIRTKDMNEEADLGVAAHWAYKGGASKADMKDARQFRWLRELLDLIENEQRPEEFLENTKLELFQDQVFVFTPKGDLMELPNGSTPVDFAYAIHSNVGDRCTGAKINGRIAPLNTKLQNGDQVDIITAKNQTPSPTWERFVATGKARSHIRRYVRQQQRDEYATLGRAMLQKVFQAEGYEYSEKGLAGILNQFRGAEVVDDILAGIGQGNFVARDVFRAIFPSHKAAPARKPNEMDVAEAGVTGRKAESSSRPMPIKGLIPGMAVHFARCCHPLPGDRIVGIVTTGKGVTIHTIDCETLENFADTPERWLDVSWGDGPDSPESHIGRIDVTIANVAGALGTISTVIGKNGGNITNLKITNRSLDFWDMILDVYVNDIKHLNNIIAALRATPQIASVQRSRGR >NC_016026.1|WP_014103787.1|2273755_2274172_-|DNA-directed-RNA-polymerase-subunit-omega MARVTVEDCVEKVANRFELVMLAAQRARKIGSGAALTLDRDNDKNPVVALREIAEETVGVEDLKEELIRNNQRVIEMDDSEDIIDQMDGEEEWNALAAQSAAMDLDRDSDDDDDFGDDDGEPSLEDLAGGVPDGDDDL >NC_016026.1|WP_081463095.1|2274554_2275193_+|NYN-domain-containing-protein MPFYPEEKLALFIDGSNLYAAARALEFDIDYRLLLKWAADQGRLVRALYYTALIEDQEYSPIRPLVDWLDYNGYTMVTKPTKEFVDAQGRRKIKGNMDIELAIDMMEMADNVDHIMLFSGDGDFRRLIEAVQRKGVRVTVVSSIKTSPPMVADELRRQADHFLELEMLANAIQRAGGPRTAANAQPATDGMDDEDEDDNFGNALPPSILGAE >NC_016026.1|WP_014103789.1|2275269_2276610_-|acetyl-CoA-carboxylase-biotin-carboxylase-subunit MFKKILIANRGEIALRIIRACREMGIQTVAVHSTADANAMAVRLADESVCIGPAPSRESYLNIPAILTAATVTGAEAIHPGYGFLSENEQFARMVEEHGFVFIGPKPEHIATMGDKVMAKKTVKALGLPVVPGSEGALESVEEGLAFAKEAGYPVLIKAASGGGGKGMKVVRSPEEFQEAYSTARSEAKANFGDDTVYVEKYLEKPRHIEIQVFGDTHGNAIHLGERDCSTQRRHQKLVEEAPSPVLSAEERDQIGSLAAEVIRKMGYRGAGTIEFLYENGQFFFMEMNTRIQVEHPVTEMITGIDLIAEQIRVAAGEPLSVNKDRIHLRGHAIEIRINAEDPDTFMPSPGTITQFHAPGGLGVRFDSAIYGGYRIPPYYDSMVGKLIVHGRNRDECIRRLRRAITETVVEGVKTTLPLQLWISEQPEFTSGEYNIHWLEKKLAER >NC_016026.1|WP_014103790.1|2276632_2277085_-|acetyl-CoA-carboxylase-biotin-carboxyl-carrier-protein MKIDEKAIRKLAELLDETHLTEIEVAEGEQVIRVARGGAVFSGSAPMPVSMASDPTIPQAANLSAPSTVAGNHPGAVVSPMVGTAYLQAEPGAPSFVQKGATVKAGDTLLIIEAMKVMNPIKAQKGGVVTQIAIENGQPVEYGDVLMVIE >NC_016026.1|WP_014103791.1|2277180_2277615_-|type-II-3-dehydroquinate-dehydratase MKKILVLNGPNLNMLGRREPDIYGTTTLGDIEALCRAAGAKAGHEIDFRQSNHEGVLVDWIQEVAHDPDLVGVVINAAAYTHTSVAIHDALKILHVPVVEVHLSDPSTREPFRHISYVEPVASAVFKGMGPQGYLLGIEHLLSN >NC_016026.1|WP_014103792.1|2277615_2278110_-|copper-chaperone-PCu(A)C MKMRSAALMALMLSVLSSAAYADVIVHDAYSFATVSGTKTGAVFLTVGADAADRLIGAETPVTKRAELHTHEDDNGVMKMRKTDGFDVGADAGLTLKPGGHHIMLLDLPQPLVKDQTFPLTLVFEKAGKVETTVLVRAAGDVPADHDHGHDHGAGHDDHAGHAH >NC_016026.1|WP_014103793.1|2278120_2278765_-|DsbA-family-protein MWGLGGFAILLVFLATSMTMRTLSWKKTQSHITQAVAGQVMHAAEQGDPVIRIVALTRYGSCDPCMQAHQALTQALADEAAGAGDIQVIVQPVPLSDPHNQRLARLALAAGLQDKFAPFHDALMNYDGALTDDVVKTLAMDAGVDFDRLNADMNDPRVDDALNAGRALMDAVKPPALPSFVFNDHLVFAPPKDGVFKSSDFLAFFNHVRTTPSR >NC_016026.1|WP_014103794.1|2278816_2279599_-|thioredoxin-domain-containing-protein MTLSSKLRLSAVAVSVLAVSVAGFALVPHAATKDVFTADQKAALNDIIYDYLMENPQVIMEAVAKHQVDQEQAQVDAMKELIVTKKDALFNDAGKPVAGNPKGTVVIAEFYDYNCGYCKHAFNDMAQILESDKDVKFVMIDFPILSEGSHMAAKYALAAGKQGKYFEMHSKLMKMSGQLREEQVQAMGKDLGLDVEQMKKDAESADVAKQIESNIALARELGISGTPGFIINETPVRGYLGLEGMQSIIAEERAKLAKKD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NC_013859 | Azospirillum sp. B510 plasmid pAB510e, complete sequence | 316910-316936 | 3 | 0.889 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NC_048876 | Gordonia phage Secretariat, complete genome | 19921-19947 | 4 | 0.852 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP019586 | Sinorhizobium meliloti strain CCMM B554 (FSM-MA) plasmid pSymB, complete sequence | 670717-670743 | 5 | 0.815 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NC_019849 | Sinorhizobium meliloti GR4 plasmid pRmeGR4d, complete sequence | 1104891-1104917 | 5 | 0.815 |
NC_016026_3 | 3.1|1544865|30|NC_016026|CRISPRCasFinder | 1544865-1544894 | 30 | NZ_CP040048 | Acinetobacter baumannii strain VB1190 plasmid unnamed1, complete sequence | 388870-388899 | 6 | 0.8 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP019586 | Sinorhizobium meliloti strain CCMM B554 (FSM-MA) plasmid pSymB, complete sequence | 1319803-1319829 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NC_019849 | Sinorhizobium meliloti GR4 plasmid pRmeGR4d, complete sequence | 383926-383952 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NC_014818 | Asticcacaulis excentricus CB 48 plasmid pASTEX01, complete sequence | 92671-92697 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NC_003078 | Sinorhizobium meliloti 1021 plasmid pSymB, complete sequence | 1357624-1357650 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP021799 | Sinorhizobium meliloti strain USDA1106 plasmid psymB, complete sequence | 1423847-1423873 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NC_016624 | Azospirillum lipoferum 4B plasmid AZO_p5, complete sequence | 212382-212408 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP021828 | Sinorhizobium meliloti strain KH35c plasmid psymB, complete sequence | 922579-922605 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP021823 | Sinorhizobium meliloti strain KH46 plasmid psymB, complete sequence | 1043647-1043673 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP021795 | Sinorhizobium meliloti strain USDA1157 plasmid psymB, complete sequence | 584256-584282 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP021806 | Sinorhizobium meliloti strain T073 plasmid psymB, complete sequence | 636625-636651 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP019484 | Sinorhizobium meliloti strain B401 plasmid pSymB, complete sequence | 1476077-1476103 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP019487 | Sinorhizobium meliloti strain B399 plasmid pSym, complete sequence | 1252370-1252396 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NC_020560 | Sinorhizobium meliloti 2011 plasmid pSymB, complete sequence | 1357630-1357656 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP013635 | Rhizobium sp. N324 plasmid pRspN324e, complete sequence | 414883-414909 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NC_017326 | Sinorhizobium meliloti SM11 plasmid pSmeSM11d, complete sequence | 377784-377810 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NC_017323 | Sinorhizobium meliloti BL225C plasmid pSINMEB02, complete sequence | 1620436-1620462 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP009146 | Sinorhizobium meliloti strain RMO17 plasmid pSymB, complete sequence | 378929-378955 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NC_018701 | Sinorhizobium meliloti Rm41 plasmid pSYMB, complete sequence | 372700-372726 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP021802 | Sinorhizobium meliloti strain USDA1021 plasmid psymB, complete sequence | 996998-997024 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP021820 | Sinorhizobium meliloti strain M162 plasmid psymB, complete sequence | 724804-724830 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP021831 | Sinorhizobium meliloti strain HM006 plasmid psymB, complete sequence | 628673-628699 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP021814 | Sinorhizobium meliloti strain M270 plasmid psymB, complete sequence | 260285-260311 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP021810 | Sinorhizobium meliloti strain Rm41 plasmid psymB, complete sequence | 1161494-1161520 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP021218 | Sinorhizobium meliloti RU11/001 plasmid pSymB, complete sequence | 524964-524990 | 6 | 0.778 |
NC_016026_4 | 4.2|2270536|27|NC_016026|CRISPRCasFinder | 2270536-2270562 | 27 | NZ_CP026527 | Sinorhizobium meliloti strain AK21 plasmid pSymB, complete sequence | 382032-382058 | 6 | 0.778 |
NC_016026_3 | 3.1|1544865|30|NC_016026|CRISPRCasFinder | 1544865-1544894 | 30 | JF314845 | Cronobacter phage ES2, complete genome | 17836-17865 | 7 | 0.767 |
NC_016026_3 | 3.1|1544865|30|NC_016026|CRISPRCasFinder | 1544865-1544894 | 30 | NZ_AP014865 | Bacillus thuringiensis serovar tolworthi strain Pasteur Institute Standard strain plasmid pKK1, complete sequence | 235712-235741 | 8 | 0.733 |
1. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NC_013859 (Azospirillum sp. B510 plasmid pAB510e, complete sequence) position: , mismatch: 3, identity: 0.889
-cgctggatggggcggcggatcatccac CRISPR spacer gcgc-ggatgcggcggcggatcatccgc Protospacer *** ***** ***************.*
2. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NC_048876 (Gordonia phage Secretariat, complete genome) position: , mismatch: 4, identity: 0.852
cgctggatggggcggcggatcatccac CRISPR spacer cgaaggatggggcggcggatcctcctc Protospacer ** ***************** *** *
3. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP019586 (Sinorhizobium meliloti strain CCMM B554 (FSM-MA) plasmid pSymB, complete sequence) position: , mismatch: 5, identity: 0.815
cgctggatggggcggcggatcatccac CRISPR spacer cgctggatcgggcggcggaacatcgga Protospacer ******** ********** **** .
4. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NC_019849 (Sinorhizobium meliloti GR4 plasmid pRmeGR4d, complete sequence) position: , mismatch: 5, identity: 0.815
cgctggatggggcggcggatcatccac CRISPR spacer cgctggatcgggcggcggaacatcgga Protospacer ******** ********** **** .
5. spacer 3.1|1544865|30|NC_016026|CRISPRCasFinder matches to NZ_CP040048 (Acinetobacter baumannii strain VB1190 plasmid unnamed1, complete sequence) position: , mismatch: 6, identity: 0.8
tgtcataa----aaaaggccgcaatttctggaag CRISPR spacer ----atagccgcaaaaggccgcaatttctgtaag Protospacer ***. ****************** ***
6. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP019586 (Sinorhizobium meliloti strain CCMM B554 (FSM-MA) plasmid pSymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
7. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NC_019849 (Sinorhizobium meliloti GR4 plasmid pRmeGR4d, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
8. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NC_014818 (Asticcacaulis excentricus CB 48 plasmid pASTEX01, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer agctggatggggcggcggctcatatcg Protospacer ***************** **** .
9. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NC_003078 (Sinorhizobium meliloti 1021 plasmid pSymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
10. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP021799 (Sinorhizobium meliloti strain USDA1106 plasmid psymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
11. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NC_016624 (Azospirillum lipoferum 4B plasmid AZO_p5, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gtgcggatgcggcggcggatcatccgc Protospacer .***** ***************.*
12. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP021828 (Sinorhizobium meliloti strain KH35c plasmid psymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
13. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP021823 (Sinorhizobium meliloti strain KH46 plasmid psymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
14. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP021795 (Sinorhizobium meliloti strain USDA1157 plasmid psymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
15. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP021806 (Sinorhizobium meliloti strain T073 plasmid psymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
16. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP019484 (Sinorhizobium meliloti strain B401 plasmid pSymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
17. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP019487 (Sinorhizobium meliloti strain B399 plasmid pSym, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
18. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NC_020560 (Sinorhizobium meliloti 2011 plasmid pSymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
19. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP013635 (Rhizobium sp. N324 plasmid pRspN324e, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer ggctggaagcggcggcggatcatctgt Protospacer ****** * **************...
20. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NC_017326 (Sinorhizobium meliloti SM11 plasmid pSmeSM11d, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
21. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NC_017323 (Sinorhizobium meliloti BL225C plasmid pSINMEB02, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
22. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP009146 (Sinorhizobium meliloti strain RMO17 plasmid pSymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
23. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NC_018701 (Sinorhizobium meliloti Rm41 plasmid pSYMB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
24. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP021802 (Sinorhizobium meliloti strain USDA1021 plasmid psymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
25. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP021820 (Sinorhizobium meliloti strain M162 plasmid psymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
26. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP021831 (Sinorhizobium meliloti strain HM006 plasmid psymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
27. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP021814 (Sinorhizobium meliloti strain M270 plasmid psymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
28. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP021810 (Sinorhizobium meliloti strain Rm41 plasmid psymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
29. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP021218 (Sinorhizobium meliloti RU11/001 plasmid pSymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
30. spacer 4.2|2270536|27|NC_016026|CRISPRCasFinder matches to NZ_CP026527 (Sinorhizobium meliloti strain AK21 plasmid pSymB, complete sequence) position: , mismatch: 6, identity: 0.778
cgctggatggggcggcggatcatccac CRISPR spacer gattggatggggcgcccgatcatccaa Protospacer ..*********** * *********
31. spacer 3.1|1544865|30|NC_016026|CRISPRCasFinder matches to JF314845 (Cronobacter phage ES2, complete genome) position: , mismatch: 7, identity: 0.767
tgtcataaaaaaggccgcaatttctggaag CRISPR spacer tgtaataaaaaaggccgccatttggcgacc Protospacer *** ************** **** **
32. spacer 3.1|1544865|30|NC_016026|CRISPRCasFinder matches to NZ_AP014865 (Bacillus thuringiensis serovar tolworthi strain Pasteur Institute Standard strain plasmid pKK1, complete sequence) position: , mismatch: 8, identity: 0.733
tgtcataaaaaaggccgcaatttctggaag CRISPR spacer aacaataaaaaaggccgtcatttctggcgg Protospacer .. *************. ******** .*
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
508142 : 516390
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_016026|508142:516390|DBSCAN-SWA CATGCAGGCAAGCCCATCAGCCCTGATTACAGGGATCACCGGACAGGACGGCGCGCATCTGGCGGAATTTTTGCTGGGGCGCGGGTATGTGGTGCATGGCGTGCGCCTGTATTCCGCGACCGATGATACGCAACGCCTGCGCGATATTCTGGATCATCCACGCTTTCACCTGCATATCGGCGATTTGAATGATGGCGGGTCGCTGGCCCGCTTGATCCGCGATTGTGCGCCCGATGAAATTTATAACCTGGCCGCGCAAAGCCATGTTCACGCCAGTTTCAAAGTGCCTGAGGCGACCGCGCAGATCAACGCGCTGGGTCCGCTGCGCCTGCTCGAAGCCATTCGCCTGCTGGGGCGCGAGCATGAGATTAAGTTTTATCAGGCATCCAGCTCTGAAATGTTCGGCAATGCCCCGGCCCCGCAAAGCGAAGATACGCCGTTTACACCGTGCAGCCCCTATGCCGCCGCGAAACTCTACGCCTATTGGCTGGTGCGCAATTACCGCGATGCGTATGGTATGTTCGCGTGTAACGGCATCCTCTTTAACCATGAAAGCGCCCTGCGCGGTCAGGAATTTGTGACGCAGAAAATTGCGCGCGGCGTTGCGGCGCTGGCGTCTGGCCATGATGCCCCTGCGCTGGTTCTAGGCAATTTGAATTCCCGTCGCGATTGGGGCGATGCGCGTGATTATGTGCGTGGCATGTGGATGATGTTGCAACGGGATACGCCGGACGATTACGTCCTAGGCACCGGGCAATCCTACAGCGTACGCGATTTCGTCAATGCGGCATTTGATGCGGTGGGTTTTACCCTGACATGGACGGGGATTGGCGTGGGGGAAACCGCGCGCTGTGCCCGCACTGGCCGGTTATTGGTCAGCATTGACCCATCCCTGTTCCGCCCGACGGAAGTGAATAACCTGATCGCCGATGCGGCCAAAGCCCGAACCGTTCTGGGCTGGATGCCGGAAACGGATTTTCAAACGCTGGTGCGGGATATGGTTGCGGCGGCAATGGATAATACACAGAAATCGCACGGCGACGATGACTGGACGAACGATAACTATGCCCGCCTTGCGTAAATTACCGTATCGTCGGATCTGGATTGCCGGGCACCGTGGGCTGGTGGGCGCGGCATTGGTGCGTCATTTGCGGGACAACCATCCGGATTGTGAAATTTTGGCCGTATCACGCGATACACTCGATCTGCGGCGGCAGGATGAAACCGAACACTGGATCGCACAGAACAAGCCTGATGCCATCATTCTGGCGGCGGCGACCGTGGGCGGTATCGGGGCGAATGCGGCGCGCCCAGCAGATTTCCTCTACGACAATCTGGCCATTGCCACCAATGTCATTCATGCCGCCGCGGCCCAGAATGTGGGCAAGCTGCTGTTCCTGGGTTCATCCTGCATCTATCCGCGCGATTGCACGCAGCCGATTACCGAGGATGCGTTGCTGACCGGCGGGTTGGAGCCCAGTAATGAATGGTATGCGATTGCCAAAATTGCGGGGTTGAAATTGTGTCAGGCTTATCGCCGCCAAGGCGGGCATGATTTTATCGCGGCGATGCCGTGCAATTTGTACGGTCCCGGTGATCAGTTCGATCTGGAACAATCCCATGTTATTCCGGCCTTGATGATGCGGTTTGATAATGCCCGTCGCGCCGGTGATGCGTGCGTGACCTTGTGGGGAACGGGCCGTCCTTTGCGTGAATTTTTGTACGTTGATGATCTGGCCGATGCGCTGGTGACTCTGCTGGGCCATTATAGTGGTGAAAGTCCAGTGAATATTGGCGCGGGGGCGGATATATCAATTGCTGATCTGGCCTTGAAAATTGCACGTGTCACGGGGTATGGCGGACGCATTGAATGGGATTCGTCGAAACCCGATGGGACACCGCGGAAAATCATGGATTCATCGCGCATGCGTGCGCTGGGATGGGCCCCACAAACCGGTCTGGATGACGGTTTGGCTTTGGCGTGGGATTGGTACATCAATCATCGGGATCAACGGGCGGCCTGATCCATGTTGCACAATATTATTCCGGTTATTTTGTGCGGTGGTGTCGGGCGGCGTTTGTGGCCGTTATCGACGCCGCGTCGGCCCAAACCTTTCTTACGGGATATGTCCGGGCAATCTTTGTTGCAACAGACCATGGATCGTGCCCGTGGCATGAAGCCCCCCGTTATTGTGTGTAATAAAATCCATGCCGATTTGGTACGACGGGATATAGGAACGACGTCATCGGCTCTTCTTCTAGAGCCCTGTGGCCGGAATACGGCCCCGGCGATGGTGGCGGCGGCCCATTATATTCAGCGTGAATTTGGCAGCGATGCCGTCATGCTGATCATGCCATCCGATCATTTTATGGCCGATCCGGCGGCTGTGGGGCGGGCGGCGCTGACCTTGTGGCCGTATCTGGACCATGACATTGTTGGCGTGTTTGGGGTGCGTCCAACGCGCGCCGAAACGGGCTATGGCTATATTCAGGTCGATGTCGGTGCGCATGCGCGCCCGGGATCGCATGCTGTGCGCTCTTTTGTCGAGAAACCGGATCGGGAATTGGCGCAAACCTATCTGGATCAGGGGTGCTGGTGGTGGAACAGCGGGTTGTTTCTGGCCCGGGCGAAAACTCTTCTTGATCATGCGCAAACCCATGCCTCCGCCGCGTATGTGGCAACAGGGCGCGCCGTTGAAAACGGTGTCTGGGATCAGCAGGCTCTGCATCTGTCCAGCGATTTTGCCGACGCTCCGGCGGTGTCCTTTGACAAAGCGGTGATGGAGCGAATCGGCGGCGTGCGGGTGGCTGCGTTGGAAACCGTGTGGTCCGATTTGGGTACCTGGTCGGCATTGGCCAAAAACATGTGTGCCAACCTGTTTTCAACCGGTTGAATGTGCGGCGCATCATGGCGTAGTCTGGGGGGCTGTTAAACCCTTGGAGTCGCTCTTTCTATGACCACCCCGTCGGATGCGTTGTTACAGGTCGCCCTGATCCCCTATGTCTGTGGCGCGGGCGCACAAACCCCTGGATGTGAACAGGGGCCATTGGATTTCGAATTGCGTGGCTTGTCTGATGCGCTGCAGGCCTCAGGTCGCGATGTGTGGTGGAGCGTCGATCCCGAAGCATTACTGGCCGGGCCATACGGGTCCAGTGCCCACCGCGACTTGCCGCCATTGGGGTCCGATGAGCGCAACGAGATTGTGGTCTGGCACGTGCGCGGTCTGGCCGACCGTGTGGAAGAAGATGTTCGCAATGGGGCCTTCGTTGTGACGCTGGGGGGTGATCACAGCATGGCCGCCGGATCAATAACCGGTCTGGCCCGTGGGTTGAAGAAAACGATGGGCGCCGATGTTCGGCTGGGGTTGCTCTGGCTGGATGCGCATGCGGATTTGAACACACTGGCGACCACCCCGTCCAAAGCCTTGCACGGCATGCCGTTGTCACAGGTTTTGGGGCTGGATGTTGCGCATGATCCATTTGGTCTGGGGCATGACGCGGTTGTTGTCGCGCCCTCGCACCTGCTTTATGCGGGATTGCGTGATCTGGACCCGGGCGAGGTTGATTTTATCCGTGACATGAATATCCATTCATTTCCAATGCGGGATTTGGTCGGGAAAGACTTGGCTTCGACGTTAATCGCGGCTATGGCTGCGATGGATGTCGATGTGTGGGCCATTTCGTTGGATCTGGATGGGTTGGACCCGGCCTTTGCTCCGGCGGTGGGTACACCGGTTGCTGGCGGACTAAACCATAGCGATGTTCTTCAGGCCCTGCGCGCGATCATGGATCGTTTTGATGTGCGCCTGTTTGAAGTGGCGGAACATAACCCGACTCTGAAGGGGGCTGGGGTTAATTATCAAACCGCATTATCTACTTTGCAGGTGGTTTTGGATGGGGCCACACAGCGTTTTAAGATTCGATCCGACGCAGCGTGATAAAGGTCACGTTACCACCATATTCAACTTGGCCTGTCATTTGGATTTTGTCGCCCTTGCGCGCCAATTTTAATGTGGTGCGGTACGTGACCGGTTGGCGTGACGGTGACCCGTCCGGGCGCAGTAGAGAAAAATCTGAATCGGCATCGGTGGGATCGGCAATTGAAACCATTTCGACTTCGTTTTCATTCAGCACGGTGAATGTACTGGTCCACAGGCAATTGGCGGCGTCACGGCGTTCAGTTTTTCCGTCGTGGATGTGTGTGATGCCATCGGCTTTCATTTCCAGTGGGCCTTGATATGAGCTGGTGCTGCTGACTTGGTATTGTCCATCCAATTCATGGGTCATTTTTGTTTTTCCGTTTTATGTGCTGAAAACATGCAGTGTCGGGCCGTTGCGGTGCCGATAAATATCCGCCGTGAGCAGGCCGATATTGCGACCCACCGAATCCAGCAATTGCGGGCGCGGTTTTAACGAGGTCATCATCGTCAGTCGTTGTCCTTGTTTAAGCTCAAACGTCATCGGGGTTTTATTGCGGTCCATAATGGTAATGGTTTGACCCTGTTTAACCACCAATCTGTGTTCAACCATAGGGGCGCACGGTATCACGCTGGCCTCGTAGCCAAAGTAATCGGGCCCGACCTTCATCAGTTGCGCCCCCGGGACATTGTCCCGAATGACACGCATAATATTGATCATGAAAGGCTCAAGGCACTGATCATACGCCCGCAACAATGTGTTGATGTTGTCGTTTGAGTCGTATCCGATCAATATTCTCCCATTGGGCCCGGCGAGTTGTTGCAACCCTTGCATAAAAATTCGGGCTTTGACCTTAGGAAAAGCTTCTGTCGGCTTTACGTTTTCCAGATTTGATATCAATGATCCCGTGCATAAGACCGCAGTGTCCTCTCGTGGGGCAAGGAAATGCGCGGCAACACGAAAGTTCATTGTCAGATAAGACGCACGAGCGCCGATGCAATTGAAGATATAGGATCGATCAATCACCTTCTTCGCCGCATCATTAAACTTCTGGCTCAAGTCGATAAAAATGATCTCTTTCAAATGAGGTATATGCTTGAGGATATGCATTTCCTTCTGCATAAAGCTGTCTCGTGGGCCGGGGCCGACAATGATGGCACGGGTGCAAAAGCGGAAATTCCGGGACAGTTCGGAACAGTTTTTTTGAAACAGTTCAATTTCGTCGTTATACAGGTAGTAATTCGGGTTTTCATGGACAAATTTATTGAACAAAAATGCCCCGTTATCGAGATAGGCGATCCGGCCCATGTGGCCGTGCTCTATCCCGCTGAACAGGTTGATGGCCGCAGCAGTAACATCACTTTTTTGTTCGTCGTTTAGTTTTGGATGAACCATGATGTGCTTTTAAATCTATGCCGCAACTTTCACGGATGTTCCAACGGGGGTTGATCGGGTCCACATATCGGCAAACTGCATGCGATAGGCGCTGGCAATCGCCGGATAATCCATCAAAATCACGGTCACATTGGCATCAAACAACATGATGGCCAGTTTGTCACCGTAAACATAAAACGGAACCGATGCGAACAGCTCGCTGGGGATCCAGCGATATTCTGCGTAGGATTCGCCCGGAACAAAATCATCGCCCTCACGGATCAGGATTTTGTATTTGATTTTTTTCAAGGCGCGCATGCGGTCAATATGGGTTTGGGCGTATTGGCCCAGATGTTTGATGAACAAGCGTTCATCCACATTGCTGACCAAAATGTCGCCGGCATAATTGCGCACGGTGCTGTAAATATCTTCGTAAAAGTCGATCAGCCCGGCCTGGCCCGTGAACACGCGTACCTCGCCACTGCGCAGGCGGATGCCTGTGGTGCCCAGAAATTCAATCCCTGCGGATTCGAACGCCTTCTGAATGGTGTGCAGGGTGTTGTTGCGGGGTGTACTTTGATTATTTTCGATGCTACCAATGGACGTCGCAGACAATCCGCATCGATCGGCCAGATCTTGTTGGCTCCAGTTTAATACGGCGCGTGCGCCACGAATTTGTGCTGTGGTAATGGTCATTTATGATGTCCTGTCCTTTATAATTCGTTTTTTCTTAAGCGTTTCTTGGGAAATTCGTAAGTATTGGTAGCGCTCGAATGTAGCGGGCGCAACCAAAATTATCGATCTATCCATTGGAAATCCGGGTTGAAAAATCCCCTGTCAGCCAGAGGCCGCAGGGGATTTGTTAATTGTCTGTTTTTATTGATCTTTTATGCAACCTGCCCCGGGCTGGTCTGGGCGGGGTCCAGCAATCCTTCGCGGCGGAGCAGGGCATCCGGGTCGGCATCGCGGCCCCGGAACCGGACATACAGCACCTGCGGGTGCTCGCTGCCGCCGCGGGACAGGATTTCGGCCTTGTATGCCTTGGCCGTTTCCTGATCATACAAACCCTTTTCCTCAAACAGGGCGAAGGTATCGGCATCCAACACTTCGGCCCATTTGTAACTGTAATATCCGGCGGAATATCCACCCGCGAACAAATGGCTGAACGCGGTGGATTGCGGTCCGGCCAGACGCGGGAACAGGGACATATCCTTCATCACGCTGTCCTCAAACGTGGCCGTGTCTTTGATCGTCGATGGGTTGGCTGTGTGCCATGCCATGTCGAGTAGGGCAAAGCTGACCTGACGCAATCCGGCCCAGCCACCCATAAAGTTTTTGGCATCGTTCAGTTTCTTGACCAGATCGGCTGGAATCTTCTCACCCGTCTTGTAATGCGCGGCGAACAGGTCGAGCGTTTCCGATGTGTAGGCCCAGTTTTCCTGCACTTGGCTCGGTAGTTCCACGAAATCCCACAAAACATTCGTGCCCGCCATAGACATATACGTCACATCCGACAACATACCGTGAATGGCGTGGCCCATTTCGTGGAACAGGGTCAGCAATTCGTCGAAGGTGAGGAGCGATGGCTGATCCTTCGCCGGTTTGGTGAAGTTGCAGACAATCGCAATGACGGGACGGCGGACTTCACCGCGATACAGGCCCTGATCGCGGAACGCGGTCATCCATGCGCCATCCTTCTTCCCCGTGCGCGGGAAGAAATCAGCATAGAATGTGCCCATGAATTTATTGCTGTCATTGTCAAAGACGTCATAGGCGGTCACATCCGGGTGCCAGACGGGGTATTTGTCGTTGGCGACGAAGCGCAGATTGAACAATTTTGTAAAGTGCTGGAACACGCCCTTTAACACATTGTCCAATGGGAAATACGGGCGCAAATCTTCGGATGAAAATTTGAACAGGTCCTGTTTCAATTTTTCACCGTAATATGCCACATCCCACGGCTGGATCGGGTCCGGGCCGCCGGCCTTGCGGGCGAAATCCTGCAGCATCTTTAAATCTTTTTCGGCGGCCGGTTTGTATGCGTTTTTCAACTTGGTCAGAAACGCGCGCACTTCATCTTCGGTCCGGGCCATGCGCTGTTCCAGCACATAGGCCGCATGGTCGTTATAGCCCAGCAATTTGGCCCGGGCATCGCGCAAGGTTACAATGTCCAAAACCATCTGGCTGTTGTCAAATTCATCGCCATAGGCGCGGCTGGCAAAGGCGCGCCAGATTTTTTCGCGTAAACTCCGTTTGTCGGCATATTGCACAACAGGCAGGTAGCTGGGGAAATCCAGCGTGAACAGCCACGCGTCATTGCGGCCTTTTTCTTCGGCCATCTGGCGGGCCACGGCGCGGGCACCATCGGGGATGCCGGACAAATCGGATTCATCCGTAATCCACAATTCAAATTTTTCCGCCGATTTTTTAACGTTGTTTGAGAACGCCGGACCCATGGTCGACAGGCGTTCATTCATCTCACGCAATTTGGCTTTGTCAGCATCGTTTAATAATGCGCCGCCACGAACGAAACCCTTGTATGTTTCATCCAGCATACGGGATTGTTCCGGGGTCAGGGACAGGGTGTCACGTTGATCCCAGACGGCCTTGACGCGTGCGAATAAGTCCGGGTCCAGCGACACGTCGCTGGAAAAATTCGCACTTAAAGGGCCAATCTTGTCGGACAAGGCCTGCAGCCCATCCGTGCCATTGGCGGACAGCATGTTGTAAAACACACCCGATGCCGCGCCCAATGTTTCCGATGCGGTTTCCAACGCCACAATGGTATTTTCAAATGTCGGTGCGTCTTTATTGGCTTTGATCGCGGCAATGTTCTTGCGGGCTTCCTCAATCCCGGCCTCTACCGCGGGCAGGAAATGCTCCTCGCGAATTTTATCAAAGGCCGGGGCCATGTTCGGTACATCGGACGGGGTCAACAGCGGATTGGTATTGGTCAT
Protein sequences of DBSCAN-SWA_1 >NC_016026|508142:516390|508142_509222_+|WP_041793744.1|DBSCAN-SWA MQASPSALITGITGQDGAHLAEFLLGRGYVVHGVRLYSATDDTQRLRDILDHPRFHLHIGDLNDGGSLARLIRDCAPDEIYNLAAQSHVHASFKVPEATAQINALGPLRLLEAIRLLGREHEIKFYQASSSEMFGNAPAPQSEDTPFTPCSPYAAAKLYAYWLVRNYRDAYGMFACNGILFNHESALRGQEFVTQKIARGVAALASGHDAPALVLGNLNSRRDWGDARDYVRGMWMMLQRDTPDDYVLGTGQSYSVRDFVNAAFDAVGFTLTWTGIGVGETARCARTGRLLVSIDPSLFRPTEVNNLIADAAKARTVLGWMPETDFQTLVRDMVAAAMDNTQKSHGDDDWTNDNYARLA >NC_016026|508142:516390|509205_510165_+|WP_014102052.1|DBSCAN-SWA MPALRKLPYRRIWIAGHRGLVGAALVRHLRDNHPDCEILAVSRDTLDLRRQDETEHWIAQNKPDAIILAAATVGGIGANAARPADFLYDNLAIATNVIHAAAAQNVGKLLFLGSSCIYPRDCTQPITEDALLTGGLEPSNEWYAIAKIAGLKLCQAYRRQGGHDFIAAMPCNLYGPGDQFDLEQSHVIPALMMRFDNARRAGDACVTLWGTGRPLREFLYVDDLADALVTLLGHYSGESPVNIGAGADISIADLALKIARVTGYGGRIEWDSSKPDGTPRKIMDSSRMRALGWAPQTGLDDGLALAWDWYINHRDQRAA >NC_016026|508142:516390|514320_516390_-|WP_014102058.1|DBSCAN-SWA MTNTNPLLTPSDVPNMAPAFDKIREEHFLPAVEAGIEEARKNIAAIKANKDAPTFENTIVALETASETLGAASGVFYNMLSANGTDGLQALSDKIGPLSANFSSDVSLDPDLFARVKAVWDQRDTLSLTPEQSRMLDETYKGFVRGGALLNDADKAKLREMNERLSTMGPAFSNNVKKSAEKFELWITDESDLSGIPDGARAVARQMAEEKGRNDAWLFTLDFPSYLPVVQYADKRSLREKIWRAFASRAYGDEFDNSQMVLDIVTLRDARAKLLGYNDHAAYVLEQRMARTEDEVRAFLTKLKNAYKPAAEKDLKMLQDFARKAGGPDPIQPWDVAYYGEKLKQDLFKFSSEDLRPYFPLDNVLKGVFQHFTKLFNLRFVANDKYPVWHPDVTAYDVFDNDSNKFMGTFYADFFPRTGKKDGAWMTAFRDQGLYRGEVRRPVIAIVCNFTKPAKDQPSLLTFDELLTLFHEMGHAIHGMLSDVTYMSMAGTNVLWDFVELPSQVQENWAYTSETLDLFAAHYKTGEKIPADLVKKLNDAKNFMGGWAGLRQVSFALLDMAWHTANPSTIKDTATFEDSVMKDMSLFPRLAGPQSTAFSHLFAGGYSAGYYSYKWAEVLDADTFALFEEKGLYDQETAKAYKAEILSRGGSEHPQVLYVRFRGRDADPDALLRREGLLDPAQTSPGQVA >NC_016026|508142:516390|512053_512362_-|WP_148260499.1|DBSCAN-SWA MKADGITHIHDGKTERRDAANCLWTSTFTVLNENEVEMVSIADPTDADSDFSLLRPDGSPSRQPVTYRTTLKLARKGDKIQMTGQVEYGGNVTFITLRRIES >NC_016026|508142:516390|511095_512079_+|WP_014102054.1|DBSCAN-SWA MTTPSDALLQVALIPYVCGAGAQTPGCEQGPLDFELRGLSDALQASGRDVWWSVDPEALLAGPYGSSAHRDLPPLGSDERNEIVVWHVRGLADRVEEDVRNGAFVVTLGGDHSMAAGSITGLARGLKKTMGADVRLGLLWLDAHADLNTLATTPSKALHGMPLSQVLGLDVAHDPFGLGHDAVVVAPSHLLYAGLRDLDPGEVDFIRDMNIHSFPMRDLVGKDLASTLIAAMAAMDVDVWAISLDLDGLDPAFAPAVGTPVAGGLNHSDVLQALRAIMDRFDVRLFEVAEHNPTLKGAGVNYQTALSTLQVVLDGATQRFKIRSDAA >NC_016026|508142:516390|510168_511035_+|WP_014102053.1|DBSCAN-SWA MLHNIIPVILCGGVGRRLWPLSTPRRPKPFLRDMSGQSLLQQTMDRARGMKPPVIVCNKIHADLVRRDIGTTSSALLLEPCGRNTAPAMVAAAHYIQREFGSDAVMLIMPSDHFMADPAAVGRAALTLWPYLDHDIVGVFGVRPTRAETGYGYIQVDVGAHARPGSHAVRSFVEKPDRELAQTYLDQGCWWWNSGLFLARAKTLLDHAQTHASAAYVATGRAVENGVWDQQALHLSSDFADAPAVSFDKAVMERIGGVRVAALETVWSDLGTWSALAKNMCANLFSTG >NC_016026|508142:516390|513469_514129_-|WP_014102057.1|DBSCAN-SWA MTITTAQIRGARAVLNWSQQDLADRCGLSATSIGSIENNQSTPRNNTLHTIQKAFESAGIEFLGTTGIRLRSGEVRVFTGQAGLIDFYEDIYSTVRNYAGDILVSNVDERLFIKHLGQYAQTHIDRMRALKKIKYKILIREGDDFVPGESYAEYRWIPSELFASVPFYVYGDKLAIMLFDANVTVILMDYPAIASAYRMQFADMWTRSTPVGTSVKVAA >NC_016026|508142:516390|512443_513454_-|WP_014102056.1|DBSCAN-SWA MVHPKLNDEQKSDVTAAAINLFSGIEHGHMGRIAYLDNGAFLFNKFVHENPNYYLYNDEIELFQKNCSELSRNFRFCTRAIIVGPGPRDSFMQKEMHILKHIPHLKEIIFIDLSQKFNDAAKKVIDRSYIFNCIGARASYLTMNFRVAAHFLAPREDTAVLCTGSLISNLENVKPTEAFPKVKARIFMQGLQQLAGPNGRILIGYDSNDNINTLLRAYDQCLEPFMINIMRVIRDNVPGAQLMKVGPDYFGYEASVIPCAPMVEHRLVVKQGQTITIMDRNKTPMTFELKQGQRLTMMTSLKPRPQLLDSVGRNIGLLTADIYRHRNGPTLHVFST |
8 | Acanthocystis_turfacea_Chlorella_virus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
563889 : 581959
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_016026|563889:581959|DBSCAN-SWA GCTAAATTTTGATTTCTTCAATGACTAGCGTTGCTTTTGAAACTCCGCCGAACATTCTTCCAGTAGAAATTCCATTGATGTAATACGTACCGGCGGAAACACCGATCCTGACCTTCAAATCAATTGTAGAAGCCCCCGGTGTGATTATTACTGGCGGTATACGAGTTTGAGAGAACCCACCAGCGGAACCAGATGTGAATCCAAATGAAATCGCGTTTGCAACGCTATCTCGGAAAATACCCGCATAAGAAGCCGTGGCCGTACTATGTGATCCTTGGCATTGGAAATAAACGCGCAGGTTATTGCCACTGTTCACCATCGTGTATGTCGTACTGATAATTTCAATCCCCTCAGTGTTTTGAGGGATCGTGTCGTCAAGAGGGAAAGTGGCAGTAATCGCTGCGGATGTTGCATATTCTACAATTGCAGTCTTTACAATCTGCCCTGCTAAAGGCTGTAACGCTGTGTCTGCTTTTGCGCCTTGGGTGGCGGTCGCAAATACGTTTCTTGGTGGGTTAACCATTTCAACCCAATTGTTGGCTTCACTATACCGACCAACCATGAGATCGCCAGTGTTGTAAGCACCTGATGTGAGATCAGATCCGCGCCACTTAAGGGTTTTTGCGCCCAATTCGTTGGGGTTTACCGTTGCCGATCCTGTGATGTTTCCGGCTGAATACCAGGCCATCAACATTCCATCGCTAAATCCTATTGGTGCAGGATAGAGCGTTAGTGTTTGAGCATTTGCTGTTCCTCCAGCGGTGCTGAATCCTTCTTTGCTGACAAGATTTTCGTCTCGCCAAACCATCGCGTGAGAGATTGCCCCTATGAATACGTTCCGTGTCCCGGCCCCCCAGTTTACAAGGGCATCGCTGTTGGAAGATTCAAGAACGGCATCACGGCTTAATGTGTCAGGAGTTCCGGCAGTAACAGTACCAATACCGGATTCCCAATCCGTGCCATCTTCAATAAAATACGCAACCTTTGCGCCATTCCCAACGCCGGAAACAAATGATTGTGCACCGATGGTTGGGCCAGCAAGGTTGATCGTTCCGGTTCCAGTGGTTGTGCTGGTTTCAAGCACGCGGTCGGCATACTTAATCATTCGATTATCTCCTCGATACGGTAGGATTTTTCAAAAAGTTGGAAATTTGCATTAACGATGGGAGCAAGGCCTTTCATAAGGCCGTAAATTGTTCGCTTTTGGAGTGTGTCTTTTTTAGAGGGATCATTGATAAACAGAACGTCCCGCGTTGTTCCGCGCAGGCGGTCGATTTCAAACGCGATGTCGTACATCTCTTGCTCTGATGCAAAGGACAGCTTCAGCTCGCAAAACCTGTATTTGGGGCGTTCGTTTGCAATGTTTTTTCCGGAAACCATCCGGGCGATTGTGGATGGGTCGATAAATCCATCTGCAACGCCATAATCCATGTTGGTGACGGGTTGGAATGCTTTGGCCATGTATAGGCGACCAACATCGAGATAGCTGATTTCGGTATCAACCATATCAATGCGCCAGTATCGGTAATTCTGGGCCGTATCCAGATGCAGGATGAAGTGGTTTGTATCCAGTGCTCCGTATTGTTCATCATCAACACCGGAGGCCCATGTTTCGTCGTATCCGGATTGGTGGCTGCGCAGGGGAAGGTTTCCGCTGTCATAATCTGGGTCGGCCAGCAGGTCGGCCTCAACGTCGGCGGCCCGGACACGGGCATACCCACGGCTTGATCCGTTATGGCCCAGCAGGGCGATTAGGTTGACGGCCTTTTCCGTACCAAGATCAATCTGGATAAAAGCATTCTCCGGATCAATAAACCGGCACACTTCGCCAATTGGCATCCGTTTCAAATTTCCGACAGGAAGGCTGGCCGGATATGTGGACGCGCTTAATGTCGCCGCATCCGAAAGGACGGGCGTTGCCAACAGCATGTTTGCCATTATCCCCAAAGCTCCAATGTGGTTTGATTGGTTTCCGCGTCCTCTGAAATGCCGACAACGGATAGGTTTTTGTTAATGTTGAAGCGTGGATATTCCAATTTCACTGGGTCACCAACATAAAGGCGGAAAAGTGTGCCATATGAAGGAACCCGGTATATTTTACGTTCCACGCCATAAATGCGAACCAAGCGGGCAAGTAATGACTCGGCATCTGCTTTATCTGCAAGCGTTGTATAAAAAACCCGCTCAACGGCCTGCGCGTTTCGGGTTCTGACCGACCGGTCTTCGCTGACAACAGTGCGGTAATCTTGCCCCACAAATGTGCGGTATCCGTCTGTGGCTGAACCGGCGAGTTCATCTTCCTTCTGGACAACCCATGCCGGCGCATATGCAACCGATATTCGCCATGCTGGTGATATTGGTGCAGGGGCTTCAATCCCTGACGTATCAATATTGCTGGCGTTGATTGTGGCGATTTCGACGCCAGGATCGTCAACATAACCGGCCGTCAGCATTCCGGTCCGGCCAAAGTTCCAATAGGCCCCACATGGGTTAATCAGTGCATCCAGAAAATTGCGGATAGGGGCTTTTTCTGTGATGTACAGCCCCATGGCCCCGGGGATTTCATCGTCCAGACGGTTAAATGCGGCGCCATCAATTTCTGTGGCTGAAAACGACTGGATGCCGAGCTTTGTCATCACAAGACGACGCGCAATGTCGGCGGCGCTGGAAACATAACCCCCCGCACTGTCCCCGTGTGCATCTGCCGTAATTCGGCCCGCTGGCGTTGATCCGAGCTTGATGAATCCACTGGATAAACTGGTTGCGTATTTTCCGGCGTCGGGCGCTGCTTCGGCAATGTCCGTCACGTCCTCATCAAAGGCCAGCAACGCACCTTTGTCGTAAACCGCGTCAACACCCATCATGGGCCCGGCGTGGATCTGGTAAATCAGATTTACCGGGTCAACCAGAACAGGCTCAATGTTGAACGCTTCGCCATAGAGCAGGGGTTTCATCTTCCCCGATAGATCGTCACCGCCCTCAAGTCCACCTGTCCCGTCATAGGTCGCGGAAACAATGTCCTGATCGATCTTCAAGCCATTATCCCGGATGGTAAGAGTTATGACATCATCATCAAATTCGATGGCGTTGCATAGACCGTCAAAAACCGTGGTGAATTGGCTGTATTTGAACGATGGCCCGCCAGCTTTGACAACAATACGGCGGCCTTTCCAGCTCAAACCGGCCAGATAATCAAAATCACCGTTTCCGTTCTGGATCTGGATCGCGCCAAAGGATGGGCTGCTGATCCCGAACTCTTCACCCCGGAGAATGGAAAAATCGAACTGCAACGGATTGTTGACCACCGGGGCAAAGTATTGGTTCGCCGGGGTGTCCGTCGGTTCGGTAATAAATTTCATGTCTGAAAGGTAAACCGTTTCGATTCCACCCAATGTTTTGGCCTTTGTATATCCGAACGGAAGCCAACCGATCGGCGCAGGCAACCACGTATGCTCCACGATATTCGATTCATCGTATGGGTGCAGTTCGATCAGGTACCGCAGCTGGATATCCGGCTCTGACAAAAGCGCGCGGAAAGGGTCCGGCACAGAATTGTCGTATTCTGGCGGAATGTACCCATATGGCAGGAACCCAAATGGCGCGAACATTACGACCTCGCCACAATCATCTGGTTGGCCACTCGGCTCAATTGTCGGGACATTTCGCGCAGGTCTTTACGCATGGCCGCGATCTGCTCGTTATTTTCGGCCATTTTTACTTCCATCTGTGCCGATGAGCGGCCAACCATTTCCATGGATTCCTGATTGGTGTGAACGCGCTCACCGCCCCTGAACCGGACCAACTCGGGCCCGCGCTCACCAACCCACGCCAACCCTGCCGGTGCACTGGCCGTGCCGGTGGCATAGCCGGGGATATCCAGCTCTTTGGCGATCTGGCCAATCGAAGACCGGACAAAGCTTTCCAAAGCGGCAAAGCTGGTTGAGGACGCATAAACACCCTGACCAACGGTCAGAAGTTGCTGGGCGGCCTTCAAAAGATCCTGAGTGACGCTGTAATCACCGCCTTGTGCCGTGCTGAGAAGCTTTCCGAACTGATCTTGTGCCAGTTTCAGTTTGTCCATGGGAGATGCGGACGAAACGCTGCTCATGGACTGTTCATATAACCAACCGTCGAATGTGGCCTTCATCGCCTCCATAGACTGGAACCCGGCCTGCATGGCCTTGACGGCTCCGATTGCGACGTCCCGCTGTTTTTCCAAAGCCTCCGTCACCTTATCCAAAGGCAGGCCTAATTGAGCGGCACGGGTGTACATGGCATCAAACTGTTCATTGATTTGTTTGATGGCCTCAGCCGCCGATTCTGCGGGCGTGGCGGTGTCCTCAAGGATCGATCGGGCGAATGTGATATCCGACACCAATTGTTCCACATCGCTGCCCAAGCTCAAGGAGCGGCGCGCAACGGCCATCACGTCACTGTTTGCTCCGGTCAGGTATGAATCGCTGTTCAGAACCTGGCGGATAACGGCGTTTATGTCTCCGGCCGTGCTGGAAATTTTGCTGCGGGAACCGCCCCAGAATGTTCCCGTGTCCTTTTTGCCGATGTTCGTCTCGATCAGCGGGATAGCCTGCGCAAATGTTGCATTCAGGGCGCTGGCCACGGCGTTCAGGCTCTTTGTGACACCTTCCGCGAACTGGTTGGCCTTGGACATGTCAACGCCTTTGGTGCTGACAGCATTGATGCCGAGCAGTCCATCTTTCCCGACGTTGTACGATGTTCCGATGGTTTCCCGCTTCGGTCCGCCGCTGAACAGCCCACCAACAGCGGTACCAAGGAATCCACCAATCGCGGCACCAATCGGGCCACCGAATCCGCCGCCAATCAGAGAGCCAACGGTGCCCAATCCGGTATCGACCAGGCCATTGCCCGATCCGAGACCCAACAGGTTTGCACCAAGGCCACCAAGAGCGCCGAACGGAAGTCCGCCGAATGCTTTACCAAGCATCCCGGAGGCCTCAAAGCTCAAGCCAAGGGAACTGCCCAGACCTGCGCCGGTGTTGTACAAACCTGTGAAGCCCTGCGCCATGGTGATGCTACCGCCTGTGAGCAGGTTCTTACCCAACCCCAACAGGTTACCGATGCCACCAAGACCACCACCACCGGTGCCGCCAATATCGCCCAGAATACTGGCCTGTGCCCCGGATGAAATTCCCATGGCGCCACCGATGCCACCAACAACGGACAGGATAACCGGACGTGCCAGTGCCATGTATGCCAGATCAGCCAGGAACGTCTTGAAGGTCGATTTCCAGCCTTCGATCAGTTTTTTGAATCCGCCATCGCTTTCGGTGAAACCGGTTCGGAAGGCATCGCGGAAACCGTCATCGATCTGGTTTGCAAGGCTTTCAAACGCCTTTGCCACGGGGCCATTGCGCTCCGCCTGCACGCGAACCTTTTCCAGTTCATCGCCTGCCGCCTTGATGCCACGCTCGATCGCGGTGATTTCCTCTGCGGTTTTCGCGTATCCGCGCAAGGCTTCCAGTTCTTTGATCTTGGCGTTGTATTTTTCCTGCTCTGTCGCCGTGGATTTCACCAGCGCGGCAAGTTTCTTTGCTGTCTCCTCGGCGGATTTAGCGTCTCCAGATTGTCCCCCACCGCCACCGCCCGATCCTTTGCCTGAAAGGGTGCCAAGCCATGTCTGATACGCCTTTTGTGCCTGTGTGCTGCTGGCGGCAACGCCGCGCAGGTGGGCTTCAAGTTTACCGCCATACTGAAGCTGGTTTTTGCTTGCTTCTTTCATGACGCGATCAGCCGCCCGGTCTGCAACAGATTCAGAGAATCCGCCGAATAAGGGGGTGTTTGAGCGCGATCTTATGCTTGTTTTGCTGTTTTGAATGACGCCCTGATCATAGGCGCTCAATTCACCGGGTAATACTTTTTTTAAGGTCAGGATTAATATCGATAATCCATCAATAAGGTCATAAACATTGTTTTTAACGCGCAACAGAACCTCGTCGGCTACATCACCGAACGTCGCCATAGGGTCAATCAGCGCGGCAAGATCGTCACGGAAAATTATGGCTGCCGTAGAGGCTCCGGCAAGGATGGTTGCCAATAACATGATCGGGTTTGCCAGAAGTGCCGCAGCAAGGGCGGCGAGCGCCGGGATCACAACGGATGATATGGCTGCGGCCAGAGCAGGGAGGGCCGAACCTGCTAAAATCAGGACCGTGTTTCCGACCACATCGATATTTTCAGCAAGGAACGCGACCGCTTTGGCCAAAGACTGCGTCCCGCCAATCGTGGCATCTGTTTCTCCGATGTATTTTTGAAGCGCATTGCTCAACATCTGCGTGGCTCGTGCCAAAGAAAGTGGCATATCAGCGAACCGTGCGGCAATTTCTGCGGACTGCCCAGCGAGGGAATCGAACACATCCTTGGCCAGAACCTCGCCCGCCAGCACGGCCTGGCGAAGGCTTCCAACTGTCAGGTTCATACCCTTGGCAATCGCGGCTCCGACCTCTGGGATGTTTTCCATGATCGAGTTGAATTCTTCGGCACGCAGGATGCCACCGGCCATTGCCTGCGAAAACTGCATGGTACCAGCCGCAAGCGCCATCGTGCTGGCACCGCCGAGTAATCCGAGCTGATTAATTGTTTCGGAAAGCGCCAAAACTTCGGAATTTGTTTTTTCTAATTCTTTTGCCCCGATATTGATCCTTTGGAACAGTTGGATATTTCCCTCAAGCGCCGATCCTGTGCGTTGGGAAATATCGAACAATTCACGGCTGACGCGGGTATAGTCACCGGTTGCCCGGGTTACACCCTGAATACGGGCCATCAGAACCTGGTACGAATCTGCCGATTTCAGGATTTGCTGGATGCCAAAAGCACCCAGCAGCCCCATCATGGTATTACGCAGAGCACCAGCAGCCCGATCAAGGCTCTGCATCTGGCGTTCTGCGCGTTGCGTTGCTGACGTTACCTTGTCACCGCTGTTGGCGATATCATCCAACGTGCGCTTGACTGTACGGCCACCATTGGTTTCACCACGAACGGCAACAACAATTTCAGCATCTGTGGTCATGCGTTGCCTTTCTTTTCATCATGTTCGCGGTCCGCCTGTTCCATTTTACGGATCAGGTGGTTGAGTTCGTGGAGGTCGTCGAGGTCGAGGGAGTGGATCTGTGCGAATGTCACAATGGACGACCATGGAATAGGGCCTATGGCCATGCCCATGGGGCGATCATTCTGCAGATCGATGTACGCGTTGAAATAAAAGTCCAGACCGGGGAGAAGGGCGGGGCGCTCCAATAAGGCCTTGGGCATCTGCCCCATGGCCACTACGGCTTCAAGATCATTGATCTTGTCGCCCCACTTTCTCCACCAAATCAGGACTTCGCAGAGTTTTTTGCGTCTTCTTCCAGCTGTTCGGCACGGAACAGAGCAAAATCATCGGCGGCCTGCTGGATGTTTACAAACAGCTCCGGCAAATCGATCAGCAATTTCACGCAATTTTCTTTGGTGAATTTCATGGGCTTGCCATCTGGGCCAGAAACCCCGCTCCAGCCGATAATGATGGATTCGGCATAAATCTCAGCCATCACGCGGTTAATCACGTCAACATCAGCCGTTTTTGCCTGAATTTGGCGCGCATACGGTTTCAGCTTTGCGCTTGCCACTTTTGCGTATTTTTGATTCGACCCACCAGCGCGATGGATGGTGATGATGCCAGAGGCACCGTAATCCAGATCCACGCCTTTGCCGCTTTCCAGATTTTTGTCCGTTGTAAAGTGCTTGTAGAAGCTCATTTTTTAGTCCTTGTTTTCGGGTTGTGCGCAGTAGTGGGGACAGAACCCGATGACTGTCCCCACCATCCTGCGCGGGATATGGGTATTCAAGCCGCCGAAGCGGCTACCGGGTTGTTAAATGGTTACGGGGTTTCGGGCACGCGGGTGATTTTCAACGTGCACTGCTCTGTCGGGTCGTAGAGAGCTTGGAACGGCAGGGTAATCATAACGTCCTGATCGTTCCCCGGTGTAGCAACATCGGCATCACTGAATTTCAGCCGAGGGATGACGAATTCATAATTCGTTTCCGATGCGCCGCCGATGGTGAATGTCAAATCGGTCGCCGTGCCGGCCAAGAACAGGTCGTACGCTTCCTTGTTCTCAAAATACATGGTCACGGTACCCGTCAGAACAAAGCGACCAGCGCCGACGCCGAGGCTTTCAACGGAACCGACAACGGGCTTCTGCCGCAGGTTGTTGGTGCCTTGCAGGGACAGGGCTGTGATTTTCGGGCTGGTTACGCCTGTGACAGCCAATGACGCAAAGTTTGTCCCGGCGTTCATGACGTCATAGGTCGGCGCATCTGTGTACGTGGCACCGGAAATGGCCGTTGTAGCTACACTTCCGCCTTTGCCCATAAATCCGAATGAACCGGTCACGATCTGCTGGGTTGAAATATCCAGATTGAAGGTGTTGGCGATCATTCCGGTGTAGCGGAAGAAATTGTCCGTTGCTCCGGTTTCGAATGTTTTTTCGATTGAGAACGATTTCGGTGTGATACCGTTCTTCAGGACGTTGCTGCTCCATGCGCTGTACATGAAGGATTCCAGAAAATCATCGAATGTACCGTAGGACAGTTCGAAGTTTACCCCACCTTCTGCGCCACCAGCCACCTGTACCAGATCGGCCACGTTGCGGTCTGGGCGGATTTCGTTGCTGGTAATGTTCTGGCGCGCGTGTTTCAGCGATTCGCCGGTATATCGGACCTTTTTAAAGGTCGGGGTGGCCGGGGTGGTGCCATATACGCTTTCGGCGATATAGGCGAGGCGTGTCTGCGAGGTATCAGCCATTTCGATGGTCTCCTTTTGGCATTAAAAAAGCCCCTGACGGGGCGGTGGGGCTTTATCGGGAATTAGGCAATCCGGTCGTACTTGAATGCGGCGCTGACCTTCCATTGGTAGGCGCCTGTCGCATCCTTTCCGATGTCCCTGGCATTGATGTTGCTGAACGTGATGCCGGGCAGGGTGTTAGCAAGGAAAATGTCGTGCGCGATATCGGCCTTTTTGCGGGCGTCGATACTGCCCTGACCAGCAGGCTGGAAAATGTCGATATAAACCATACCGACGCGGCGGAACCGGTTTGAACCGGGGCTTCCCATGCTGGCCTGATATCCGTCGTTATTCTTCATACTGAACCGCACCCACGTCGCATTGTTCGGCGGGGTAAAGGATACGTCCGGCCATGCAATAGGGGTCCGCGCGGCCCATCCTGTATTGAAATAGGCGCGCACCGCGGCTTCGGCTTGTTCAAATTTCATGATTGTTTCGCCAGTTCCTTGCCCTGCCGGACGCCAATCTGCACCGCGGCATCCACGAAACCAGCCGGGGCTTTTGTTGAGCTTCCATCGTTCAGGGGACGGATATAGGGCAGGTTGTTCGATATATAGATCGTGTCGTTCAGTTTGTACGAACCGATAACAGGCAAGGCCTTTTCGGTTCCGTCGTACGACCCGCCTTCCATGCCAGATGGATCGACCAGCGACACATCTACGGTGTTCAGGTCGATATTCCAGTTTGCACGGGCCCGGCCAGTATCGACCGGGGTATTTTCAACAATGTTATTGAAAACCTTGAGGGCGATCAGGCGAGTGACTTTTTCCTGCGTTCCCAAAACCTTTTCTTCATAGGCCTTGTTTAACTGTCTGCGGAAATCTGCGCTCATGCTGATTTTTGCACCTCATCCATGACACGCTTTGAAAGCCATCCACAGGCCCCATGCAACAGAACAGGGTTTTCGTCCGGTGATATGCTATAGGCGGTTTGCACATAGCCTTGGGCATTGATTCCGGCGATCGCAATACTGCGAAATTCTCCGGTTTTCGCCTGTTCTAATGCTTGCGTCAGGTACCGAACAGCATCGTAATTGATTGGGTACGGCATCGGCTCGACCACGGAAAGCTTTGGTGTGACTTTGCCGATGCGACAGCCTGTCATTTCCGTACCTGCACCATGTACAGCAGGACGGTGTCGCCGGGCTGCACAACGTCTGTGTTGATCACGTTGTAGGTGACGCCACCATCAATGATCTTGTGGTCCGTATCCGGAGCCATATCCAGCGATGCACCAGCAATCAGGACGCGTTTATCTGTGCGTTTGATCTTTTCCCCATCCACATCGTTTTTGGAAAAATTCGTGAACAGGGCCTTTACCGTTTCATCCACCGGGGTGGCCGCCGTAAAGGTATCGTTGGATGGGTCATAGACCTGGCCTTCGCCGGGGCGGCGAACGGTGATTTCGCGCCCTTTATCTGCGATCTGTTTCAGCGCAGTTTTGGCCATGTTTTCGTAAAATTCCGTCATGTGCGCACCAATGGGCGGTTATAGATAGATCCGGAAAGCAAGCCAGACAGCAAGCCATCGATCGCGGGGCGGGTTTTGCCGTTCCGGGCATTGTCCATGTATTCGACCTCGATCACATCGACCTTTTCCCGCTTAACGGCGCGGTCCTGCGTCGGGTTCAGGTCAGTGGCAATGGCCTCCAGGGCCAATTGGATGACGGCCTGTTTCAGTTTGGCCGGGACGACAGCCTCATCGCGAGGGAACGAAAGGGCTTGCTCTTCGGTCAGTGCACAGCCCTTCCAGCGGCCCGCATAGGCCTGTTCCACATAATCCGTGGCGCGGACGAGGGCGGCTTGCTTTACCGAATTGCTTCCGGTCCACGCGGCGTTACCGCGGTCCGTATGATATGCGTCTGCCTCAGCGACAGATGCATAGCTGTTTGCGTCCGGTAATCCTGTGCCGTCCTCGACCACGAATGCCATGGATTATTGACCTTCGCCTGTGGCCAGCTTTGCGGCCCGTGCCAGATCAGCCAAAGCGGCTTTATTCGCGTCTGTTTCGAAGGTAACCTTGTTGGCTGTGAGGTATTCGGCCAGTTGTTTTTTGGTCATCTGGTCGAATTCGTCGCCCTGTTTTTGATTGGGTTTTTGTTCTTCGAGATCTTTGAAGGCCTGCACTTTATCACCGTATGCGGCTTCGATTTTCTTGTGCGTTCCAAGGGTGATAACGGTATCACACGGTTCGATTTGATCTGCCGTGAACAGCCGGGCGGAGCGCAGGCTAACGTGTGCAAATTTGTCTTTTGCCCGGATGTCTTCGGACAGGTCCTTCATGGCTTTGTCGTCATCGCCATAAATCAGGGTGGTTGTTTTCTTGCTGCTCATTGCAATCTCCTTATTGCGTTTATGATAAATTCTATGGCCTCTCCGCTGCGCATTTCATCCAGCGTCCACTGGGAGTATGCCAAGCGAGAAAAGTACACTCGTTTCTGCTCTTCTGATGGCGCCATGGGCAATTCTGCGTAAGGCGCAGATGGTTCGCAGATGACACGAACGCCATTGATAATTGCCTCGTGGCCCACATTCGAATTTATGCAATGAACGGCATGTACAGAGGCCCAATCGATTGGGCCAGACGATATACCATCATGACCAGTTGCGTTCACATCTGGGCTATCAGGGTGTGGCCGCCACAGAACGGGCCTCGCCCCCTTTGCTTTCGCAATTTCTCGGGTGGCCCAAGCGGAAATTCCATCCGCATCAAGGCCATGTGACGGATCGCCCACATGCTGACCGCAGACCAAGACGTATTCACCATCATGTTGCGGTTTGAAAAAAATCCCGAGACGATCAAGCCTGTCGGATAGGCACTGGAAAGGAGGAACCCACCCCAATCGGTTAAGGCCCACCTGCCAATGTCCGGTGGCCCATGTTTTTATCCCAGAAACTCGGGACATGTAGCCGTAATCAATAACGATAACCGGTTTACCCTTTTTAGCGTATGCGTCCCGGATAACTGTGCCTTTATCGCGCAGCCCAGACACAACCACGACGTCGAAATCTTCGACTTGATCAGGGGTAAACGGGTTATGGTTACGATATTTGAAAGGAAGAGTGGCGGCTCTTAACCCTTGGGCAAAAGCCGCCACTTCCATATTCTCTTCTAAGGCATAAAGCCCCAGCATTATCGGGACTGAATGATGCAGCCAGCAAAATCCTTGCTGGATGTGAAGGCGGCGTCCCAGTTTGATCCGGTACCGAGGGCGGCAGCGTTCGGGTTTTTACCGCCGTTCGCCACATCCCATTTGAATCCTTTGACGCCGAGGTTGTACCCAAACTCACCCTGCATACGCATAAGGATCTGTTCTTTCCCGGTGATCTCTTGGAAGGTCATGTACTCTTCCTCGGTTTCGTCCACGATCAGGCCATCAGCGGTCAGGCCGAGGGTCATGTAGTTCGTGTATGCGCCGGTACCTGTGCCGCCTGTTTGCACCAGAGACGCACTGTCCGTTACCAGAATAGGGCGGTTCAGGGTCACCGGGCCACCGTTGACAATGGTGGTATTCGCAATCAGGTCGCCGTTGTTGGCTGGCGTGACCTGGTATTGGAACAGGTCGAAGAACACCTTCGAGTGCATTACGAAGGCCTGAATGCGGTTTGCCTGATCCCCGAACTTGGAAAGCCCTGAAACCATTGAGGACGTGTTCAGGGTTCCGTTTGTCGGAACCGTGTACTTTACCGCTGCTTGGTTGTTCAGGGCGGCAACACCAGCCAGCAAGGAACTGTTCAGCATGTCGGCCAACGCATCAGCGGCAGCCTGTTCACCTGCAGCAACTTTGATAGCGTCCATATCCAAGCCCGGCTTCAGAATTGCCGCTCGAGACCAGTCAACCGGGCCGATTTTACGGTTCAGTTTTACGCTGATAATCTGGTCTTGCGTCAGTTTAATGGACGTCGCATCCGAAACGGATGTTTGATCCTGACGGGAAACCAGACCACCAGCGTTCTTGAAGAACGCTGAATAATCATAATCACCCGGTTTGCGGTTGCTACGGAAAACAATGGTGCCCGCTGATTGGCCGTTAAAGCCATCGATCATTTGGGTGATTTTTTCTGTGTATCCAGTTTGGATCAGCGGATCAAAATATTTCATATCGGTAGGAAGGGATGTCGCCATGATGTTGTCCTTTCAGGTCTGACAATTAAAAACCCATCGTATGATGGGGGGTTTCTTCTCGTTATTCGGGAAGCGCGAGATACGCTTCTTGGCCGTGTTCTTTGATGTACGCCGCCTTATCAACATGGTTCATCGCAGAACGCGTTTGACCGCCAGCAGCCTTGCCGCCACCGTTTGAGCCACCGGCTCCGCCGCCACTATTTGCTGTGGCGGCAATGAAATGCTTCCCGTCCTCGCCTTGCGCGAACGATGAAACGAATTCTTGAATGGGCTTGCCGGAGATTGTGGCCACAGCAGCGCCATCCGCATCTGATACCTCAGCCTTGTTTTCCGCCAAAATCATGGCCTTGGCCGCTTTCAGATGCGCCGGATTGGTTACACCGGCCTTGGTCAGGGCCTCGGTCAGGCCGTTATCGACCAGAAGTTTGTGCAATTTCGCCTCGCGGTCAGCCAGCTTTGCGGACAGATCGCCCTTTTCTTTGGCATGGCGCGCTTCCAGCGTGGATTTTATTTTCTCCACATCGCCCCCTTTGGCGGCGGCTTCCTCTTCTGCGGCGTCCTTGGCGGCTTTAAGTTCGTCCAACTGTTCTTGGATTGATTTTGACGAATCCTTGAGTTTTTTAACCTCACCCAAAAGCTCATCTTTTTTAGCCTTCAGGCCCTCAGTTTCTTTGTCGATCATCTTTTGCAGTTCGGCCTTGCCTTCTGCGGTATCGGTATCAATTACGGGCATTCAATCATTCCCCTCGGGTTTGAGTTTCGCGGCCTTGCCGCATTAAAAACCCGCCAACGCGGCGGGTTATTCTGAAAATGTCTTGTTCCAGATATTGGCATCGCGCCGCTTCAGTTCCGCCAGCGTATATTCGCGCCCGGTATCGTCGGCGAACCGGTCAAGCGGTAGGCCACCATCGCGGAACAGTTTGGCTTTTTTGACGCCTAAGACCTCCTGCTGGACGGATAGAGGCTGGCGTTTAAGCCATTGCTCATAGGTCCAGTCATCGGGCACCGGGCCAAATGCGCTGGCCCGCGATCCCTTGATCGATGTTTCGCCCATATACGGAACCTTTCGGCTCCGGCACCGGAAGTGCGCCGGGATCGGTGGCGCGGATTTGACCGGGTACACGCGGCCATCGCGCTCCCGGCATGTGGCCGATGTGCGGCTGTCGAGGGTTGAAACCCATTTCAGGCCCTTGATTATGTCGGCGTTGGCTTCCCACACATACTGTCCTGATCGGTCAGCAACATGGGCAATGGCGGTGCGCACGATGGAATCCACATCGCGTCTGGATATTTCCAGCACACCGTCACGGTACCGCGCTGCGCGTGTCCCGCGTATGCGGCGCACGATCTGGTCCGTTGTCTGACCCTCGGCAATACCTATCCGGACAGCATCATTGATGCGGCGGGCGTCGGTCTGCGCCAGGCTGGAATACCATTCCTTGAGCAGGCGGCCCTGAAACGGCTGGGCCCGTACAATTGCCCGGAGCTGTGACGTTGCGGGGCGGTTCATTTCGATTACCGCGGCACCGATTGCTGATTTCTCAATCAGGACGGCCTGAAATTCTGCCTCATACTCCGCAAAATCCACCATACCTTCGGCGGATACGTCGGATAGGCCATCATAGAGCGTGGCCCGCATCTTTGCGATCGATGCACCCATGTCGCGGATGCGTTTCGTTGTCTCCGGCCCGAGGTCAACACCTCGTTCCTCGATCTTGTGAAGACGATCCAGAAGGGTTTTCTGCAGGCCCGCATCCGCCTTGTTCAGGAGGGCGATTATCTTCCGGACTTCACTGGCCTTGAACCGTTCCAGCCAAACCATGTGGCGAACGGAGGCATCAAGGATGCGCTCATTCGCTGTCGTCATCGACTGTCATCGTTCCGAGGGCAGGGGCGCTATCGGCTTTACGGTCCTGTTCCTCTTCAAAGGTCAGGCCTTCACTGATGACTTCACCAGATTTCAGGCCCTCAAAGAACGTCCTGTCGGACACGCCGCCCGATTGCCATGCACCAACCCATGCGGTCAGGTCTTGAGCCGACATAGGCGTAGGCAAATAATCCTTGTTCATTTCATAGACGGCTTTACCGCCGAGCCCGGCCCAGTCCACCATAAACTGTAGAGCCTTTCGGACCTGCATTTCAACGGAGCCGGCCAGCGTTGCCAGAACACTGTTTTCACCGCCGCGCTTTATGCCCAGAGCGTCCGCCGTTTCTGTTTGGCGCTTCTCCGGGGCCAGCATGCGGGCACCAAGGGAGGCCATTTGTGCGTCTTTGCGATCCATAAGTTTTTCAATGGACGAGAAACCTTCGGCACCACACTGCAGGAACCCGGCGGTGGCATCCCGCGGCAATGCCATACCGCTACCGCCCATGTTGAATGTAAAATTTCCAGGATCATCGACGCCAGTAACGTAGGGCGTCGGCAGGCCAGCCATGTGCGCACCATTCTCAAGATCGGCGCTGTTCCGATACTGGGACAGGTTCAGATCGGCCAGATCCTCGATCGGCGGTTTTTGAACGCTGCAATCCGGCTCTTTTGGGGCCAAAAAGTAAAACGGTATATCGGTCAGGGGCGTGCCCCCCTTAAGAGGCTGCGCCATGTCGGCCATTTCCCAACCCTTGCCAGTCTGACGCCAGATGATCTGGACGTAGTAGCCGTCATAAAGGGTCAGTTCGCGGATCTGCGGCTTTACGGTACCGTCCACATCCTCATAGATTTCTTCAAGCGCGACGCGGGAAAGCTTTGTGACATTCCCAATCCTGGCCATTTTCCAGTTCAGGATGGATTCGGCTTTGTATGCGGTCATGTACGGTCGGATTTCCAGACGCGACATGTCATGCAAGGTCAGCGCGGCCCCGGACGCAAGATTCGGTGAGGGCGGATAATCAACCAAAATCCCGGCGCGGCCAACCGTGATCACGTCCTCGGTCAAACCGCGCACAAATCCCTCCAGGCTCATTCCGGCCAGATTGATGTTGGAGGTCCAAGCGTCCTTTACCGACGCGGGCAGGTCAACCACAGGAGCCTTGCGGAATATCAGGCCTGTGTAGCCATCAACCGTCCTGCCAGTGGCATTGTAGAAAACCGCACGCAATTTGTACGCGTTATATGCTTCCTTGCTTTGGCCATCCAGTTTTGGCAGATAATTCTCACCGCCGGCATGAACCTTGCGTTGCCCACCAACCACATCGCGGCATTTTCTCCACACCGGTGCATACTCATCGTATTCCGGGTGTGTTTTGTCTGGTGTTGGTGCGGTCATTTACCTGTGACTTTGATGGTTTGGACTTTGTTGTGAGTAATCGGGAATAGATAGGCAATCGGGTAACCGCCTGCATCGTTGGCGTGGTCGTGTCCGGATGATTTGTCCGGCTCCCCATTTTCACCCCATATCTGCCGCTCAAGCGATTGGGTGAATTTGGGGCAGGCCGCTGTATTCACCAGATAGCGCCGATCTCCGCATGTGTTGCAGAACATTGCGTTCATGCTGTTGATGCGATCTTTAACCGAAGGGTTCCGCGAATTGACGCGGACGATAAACCCAGCCTGTTTCAGCAGGGCTATATCCGTTTCACTGGCATTGCTGCTCTTGCGGTTATCCCCGGAGGCATCAGGATAAACGATAATCTTGTGGCCCGGGTACCGTTCCTTGATCTTGGCGATCATGGCCGGCGTGTCAAACAGGTCAATCAGTTCAGCGGCGGCACGCGGCAATCCATCGCGCATGACATTGACGATCCCTGCCATTTTCCCGACGTTGAAGTCCATCCCGATATGGAGGTCTTCACCCGGCTGTACGGTATCCGTGCAATGGTTCAGGCGGCGGTCGAAACAGTAATAGATGACGCCCTGATAGTTTTCAAAACTGGCTTCGTATTCCTGCCGAAACGTCCGTGGATCCATTGTCCGGCGCGCGGAATCAATCTCATCCTCTGGGACGTTTCCACCCTGGAGCGAGGTGTAAAGCCAGCTTTTGTGATCTTGAAGCTGACCCTGTCCGTCCAGATAGGTGTCGTAACAGTGGTTAAACCCTTTGGGCGTTCCGATGCGCAAAGCATGGCCGCCGCGGTATTCGACCCCATCCACCGTATAACGGCAGGTGGACAGCATCGGACGCAGAACTTCTTCCCATGCCTCATAGGCGCAATCGGCCCATTCATCGACCAGCACAAAGAACAGGCCGGAGCCGCGGAGGTTATCGTAATTGTCCAGCCCCACAATCCGGATAACGTGCCCGGTGCGCAGGGTGATGCTGCATTCTGTTTCGTTCGGTTTTCCGGCGCGCCATTCGCGGGGTATCGCCTGCTTTAGCCTGCGCCAGAAAACCCGCTTAGCCTGTTTGAATGTCGGGGCCGCGTACCAGATTTCATCCTCGATACTGACGCCCCACTTCATGGCCAAACGGGCAGCGCGGCGCATTTCAGCCTTGCCAAGGTATGTCTTGCCAAACCGGCGGCCACAAACAGCGTCGCGAAAGCGGGCGCATTGTTGCCATCCCCAGACAAAGATGTTGGCCTGTTTGGGCGTCAGAGCCACCGGGCCATCAGAGGATGGGCGCATTCGGTACCGCCTCGTCAGGCTTCAAAGCGTATTCATCACCAATCGGCGCATCCGAGGGCGGAAGCGCCTCTGGTTTAGGTGCCCATGTATCGCGGCGGCGGTTTGTCAGCCAAACCGTACAGGCCTGCGTATCTGGCGGGGCCTGTTTGCGAATGGTCACGACCTTGACGTCTTCATACTCACGGGACTTTTTACCGCCCTCGTACTCAATGACCTTAACCTTGAACGCGACCTCTTCGGTCCATTCTGCACCATTGGCACGTCGGAACAGAGAATCCGCCACCTCGGTGTCAGCGGGCGTCTTGCCTGCGCGTATAGCCTCCGAGAATTCGGGATAATCGACTTTCCACTTATTGATCGTGCTTTCCGATTTTCCAAAGAATTCAGCAATATCTGTATCTGTGGCATCAGATTTAAGCAGGAACAATTTCCGGACTTGTGGCGCAAATTCTGGGCGGTAGTCACTTGGACGGCCTGCGCCAGTTCCTTTTTCTTTTTTTGGGGCCTTGCCCTGTGTTTTCTTTGCCAT
Protein sequences of DBSCAN-SWA_2 >NC_016026|563889:581959|573573_573849_-|WP_014102114.1|DBSCAN-SWA MTGCRIGKVTPKLSVVEPMPYPINYDAVRYLTQALEQAKTGEFRSIAIAGINAQGYVQTAYSISPDENPVLLHGACGWLSKRVMDEVQKSA >NC_016026|563889:581959|573169_573577_-|WP_014102113.1|DBSCAN-SWA MSADFRRQLNKAYEEKVLGTQEKVTRLIALKVFNNIVENTPVDTGRARANWNIDLNTVDVSLVDPSGMEGGSYDGTEKALPVIGSYKLNDTIYISNNLPYIRPLNDGSSTKAPAGFVDAAVQIGVRQGKELAKQS >NC_016026|563889:581959|576930_577602_-|WP_014102120.1|DBSCAN-SWA MPVIDTDTAEGKAELQKMIDKETEGLKAKKDELLGEVKKLKDSSKSIQEQLDELKAAKDAAEEEAAAKGGDVEKIKSTLEARHAKEKGDLSAKLADREAKLHKLLVDNGLTEALTKAGVTNPAHLKAAKAMILAENKAEVSDADGAAVATISGKPIQEFVSSFAQGEDGKHFIAATANSGGGAGGSNGGGKAAGGQTRSAMNHVDKAAYIKEHGQEAYLALPE >NC_016026|563889:581959|572768_573173_-|WP_014102112.1|DBSCAN-SWA MKFEQAEAAVRAYFNTGWAARTPIAWPDVSFTPPNNATWVRFSMKNNDGYQASMGSPGSNRFRRVGMVYIDIFQPAGQGSIDARKKADIAHDIFLANTLPGITFSNINARDIGKDATGAYQWKVSAAFKYDRIA >NC_016026|563889:581959|563889_564993_-|WP_014102105.1|DBSCAN-SWA MIKYADRVLETSTTTGTGTINLAGPTIGAQSFVSGVGNGAKVAYFIEDGTDWESGIGTVTAGTPDTLSRDAVLESSNSDALVNWGAGTRNVFIGAISHAMVWRDENLVSKEGFSTAGGTANAQTLTLYPAPIGFSDGMLMAWYSAGNITGSATVNPNELGAKTLKWRGSDLTSGAYNTGDLMVGRYSEANNWVEMVNPPRNVFATATQGAKADTALQPLAGQIVKTAIVEYATSAAITATFPLDDTIPQNTEGIEIISTTYTMVNSGNNLRVYFQCQGSHSTATASYAGIFRDSVANAISFGFTSGSAGGFSQTRIPPVIITPGASTIDLKVRIGVSAGTYYINGISTGRMFGGVSKATLVIEEIKI >NC_016026|563889:581959|570929_571175_-|WP_148260406.1|DBSCAN-SWA MPKALLERPALLPGLDFYFNAYIDLQNDRPMGMAIGPIPWSSIVTFAQIHSLDLDDLHELNHLIRKMEQADREHDEKKGNA >NC_016026|563889:581959|580128_581430_-|WP_014102123.1|DBSCAN-SWA MRPSSDGPVALTPKQANIFVWGWQQCARFRDAVCGRRFGKTYLGKAEMRRAARLAMKWGVSIEDEIWYAAPTFKQAKRVFWRRLKQAIPREWRAGKPNETECSITLRTGHVIRIVGLDNYDNLRGSGLFFVLVDEWADCAYEAWEEVLRPMLSTCRYTVDGVEYRGGHALRIGTPKGFNHCYDTYLDGQGQLQDHKSWLYTSLQGGNVPEDEIDSARRTMDPRTFRQEYEASFENYQGVIYYCFDRRLNHCTDTVQPGEDLHIGMDFNVGKMAGIVNVMRDGLPRAAAELIDLFDTPAMIAKIKERYPGHKIIVYPDASGDNRKSSNASETDIALLKQAGFIVRVNSRNPSVKDRINSMNAMFCNTCGDRRYLVNTAACPKFTQSLERQIWGENGEPDKSSGHDHANDAGGYPIAYLFPITHNKVQTIKVTGK >NC_016026|563889:581959|574678_575077_-|WP_014102117.1|DBSCAN-SWA MSSKKTTTLIYGDDDKAMKDLSEDIRAKDKFAHVSLRSARLFTADQIEPCDTVITLGTHKKIEAAYGDKVQAFKDLEEQKPNQKQGDEFDQMTKKQLAEYLTANKVTFETDANKAALADLARAAKLATGEGQ >NC_016026|563889:581959|571237_571657_-|WP_014102110.1|DBSCAN-SWA MSFYKHFTTDKNLESGKGVDLDYGASGIITIHRAGGSNQKYAKVASAKLKPYARQIQAKTADVDVINRVMAEIYAESIIIGWSGVSGPDGKPMKFTKENCVKLLIDLPELFVNIQQAADDFALFRAEQLEEDAKNSAKS >NC_016026|563889:581959|575876_576869_-|WP_014102119.1|DBSCAN-SWA MATSLPTDMKYFDPLIQTGYTEKITQMIDGFNGQSAGTIVFRSNRKPGDYDYSAFFKNAGGLVSRQDQTSVSDATSIKLTQDQIISVKLNRKIGPVDWSRAAILKPGLDMDAIKVAAGEQAAADALADMLNSSLLAGVAALNNQAAVKYTVPTNGTLNTSSMVSGLSKFGDQANRIQAFVMHSKVFFDLFQYQVTPANNGDLIANTTIVNGGPVTLNRPILVTDSASLVQTGGTGTGAYTNYMTLGLTADGLIVDETEEEYMTFQEITGKEQILMRMQGEFGYNLGVKGFKWDVANGGKNPNAAALGTGSNWDAAFTSSKDFAGCIIQSR >NC_016026|563889:581959|567495_570933_-|WP_014102108.1|DBSCAN-SWA MTTDAEIVVAVRGETNGGRTVKRTLDDIANSGDKVTSATQRAERQMQSLDRAAGALRNTMMGLLGAFGIQQILKSADSYQVLMARIQGVTRATGDYTRVSRELFDISQRTGSALEGNIQLFQRINIGAKELEKTNSEVLALSETINQLGLLGGASTMALAAGTMQFSQAMAGGILRAEEFNSIMENIPEVGAAIAKGMNLTVGSLRQAVLAGEVLAKDVFDSLAGQSAEIAARFADMPLSLARATQMLSNALQKYIGETDATIGGTQSLAKAVAFLAENIDVVGNTVLILAGSALPALAAAISSVVIPALAALAAALLANPIMLLATILAGASTAAIIFRDDLAALIDPMATFGDVADEVLLRVKNNVYDLIDGLSILILTLKKVLPGELSAYDQGVIQNSKTSIRSRSNTPLFGGFSESVADRAADRVMKEASKNQLQYGGKLEAHLRGVAASSTQAQKAYQTWLGTLSGKGSGGGGGGQSGDAKSAEETAKKLAALVKSTATEQEKYNAKIKELEALRGYAKTAEEITAIERGIKAAGDELEKVRVQAERNGPVAKAFESLANQIDDGFRDAFRTGFTESDGGFKKLIEGWKSTFKTFLADLAYMALARPVILSVVGGIGGAMGISSGAQASILGDIGGTGGGGLGGIGNLLGLGKNLLTGGSITMAQGFTGLYNTGAGLGSSLGLSFEASGMLGKAFGGLPFGALGGLGANLLGLGSGNGLVDTGLGTVGSLIGGGFGGPIGAAIGGFLGTAVGGLFSGGPKRETIGTSYNVGKDGLLGINAVSTKGVDMSKANQFAEGVTKSLNAVASALNATFAQAIPLIETNIGKKDTGTFWGGSRSKISSTAGDINAVIRQVLNSDSYLTGANSDVMAVARRSLSLGSDVEQLVSDITFARSILEDTATPAESAAEAIKQINEQFDAMYTRAAQLGLPLDKVTEALEKQRDVAIGAVKAMQAGFQSMEAMKATFDGWLYEQSMSSVSSASPMDKLKLAQDQFGKLLSTAQGGDYSVTQDLLKAAQQLLTVGQGVYASSTSFAALESFVRSSIGQIAKELDIPGYATGTASAPAGLAWVGERGPELVRFRGGERVHTNQESMEMVGRSSAQMEVKMAENNEQIAAMRKDLREMSRQLSRVANQMIVARS >NC_016026|563889:581959|571779_572706_-|WP_014102111.1|DBSCAN-SWA MADTSQTRLAYIAESVYGTTPATPTFKKVRYTGESLKHARQNITSNEIRPDRNVADLVQVAGGAEGGVNFELSYGTFDDFLESFMYSAWSSNVLKNGITPKSFSIEKTFETGATDNFFRYTGMIANTFNLDISTQQIVTGSFGFMGKGGSVATTAISGATYTDAPTYDVMNAGTNFASLAVTGVTSPKITALSLQGTNNLRQKPVVGSVESLGVGAGRFVLTGTVTMYFENKEAYDLFLAGTATDLTFTIGGASETNYEFVIPRLKFSDADVATPGNDQDVMITLPFQALYDPTEQCTLKITRVPETP >NC_016026|563889:581959|565825_567496_-|WP_014102107.1|DBSCAN-SWA MFAPFGFLPYGYIPPEYDNSVPDPFRALLSEPDIQLRYLIELHPYDESNIVEHTWLPAPIGWLPFGYTKAKTLGGIETVYLSDMKFITEPTDTPANQYFAPVVNNPLQFDFSILRGEEFGISSPSFGAIQIQNGNGDFDYLAGLSWKGRRIVVKAGGPSFKYSQFTTVFDGLCNAIEFDDDVITLTIRDNGLKIDQDIVSATYDGTGGLEGGDDLSGKMKPLLYGEAFNIEPVLVDPVNLIYQIHAGPMMGVDAVYDKGALLAFDEDVTDIAEAAPDAGKYATSLSSGFIKLGSTPAGRITADAHGDSAGGYVSSAADIARRLVMTKLGIQSFSATEIDGAAFNRLDDEIPGAMGLYITEKAPIRNFLDALINPCGAYWNFGRTGMLTAGYVDDPGVEIATINASNIDTSGIEAPAPISPAWRISVAYAPAWVVQKEDELAGSATDGYRTFVGQDYRTVVSEDRSVRTRNAQAVERVFYTTLADKADAESLLARLVRIYGVERKIYRVPSYGTLFRLYVGDPVKLEYPRFNINKNLSVVGISEDAETNQTTLELWG >NC_016026|563889:581959|564989_565826_-|WP_041793749.1|DBSCAN-SWA MANMLLATPVLSDAATLSASTYPASLPVGNLKRMPIGEVCRFIDPENAFIQIDLGTEKAVNLIALLGHNGSSRGYARVRAADVEADLLADPDYDSGNLPLRSHQSGYDETWASGVDDEQYGALDTNHFILHLDTAQNYRYWRIDMVDTEISYLDVGRLYMAKAFQPVTNMDYGVADGFIDPSTIARMVSGKNIANERPKYRFCELKLSFASEQEMYDIAFEIDRLRGTTRDVLFINDPSKKDTLQKRTIYGLMKGLAPIVNANFQLFEKSYRIEEIIE >NC_016026|563889:581959|578719_580132_-|WP_014102122.1|DBSCAN-SWA MTAPTPDKTHPEYDEYAPVWRKCRDVVGGQRKVHAGGENYLPKLDGQSKEAYNAYKLRAVFYNATGRTVDGYTGLIFRKAPVVDLPASVKDAWTSNINLAGMSLEGFVRGLTEDVITVGRAGILVDYPPSPNLASGAALTLHDMSRLEIRPYMTAYKAESILNWKMARIGNVTKLSRVALEEIYEDVDGTVKPQIRELTLYDGYYVQIIWRQTGKGWEMADMAQPLKGGTPLTDIPFYFLAPKEPDCSVQKPPIEDLADLNLSQYRNSADLENGAHMAGLPTPYVTGVDDPGNFTFNMGGSGMALPRDATAGFLQCGAEGFSSIEKLMDRKDAQMASLGARMLAPEKRQTETADALGIKRGGENSVLATLAGSVEMQVRKALQFMVDWAGLGGKAVYEMNKDYLPTPMSAQDLTAWVGAWQSGGVSDRTFFEGLKSGEVISEGLTFEEEQDRKADSAPALGTMTVDDDSE >NC_016026|563889:581959|573845_574214_-|WP_014102115.1|DBSCAN-SWA MTEFYENMAKTALKQIADKGREITVRRPGEGQVYDPSNDTFTAATPVDETVKALFTNFSKNDVDGEKIKRTDKRVLIAGASLDMAPDTDHKIIDGGVTYNVINTDVVQPGDTVLLYMVQVRK >NC_016026|563889:581959|577668_578736_-|WP_014102121.1|capsid|DBSCAN-SWA MTTANERILDASVRHMVWLERFKASEVRKIIALLNKADAGLQKTLLDRLHKIEERGVDLGPETTKRIRDMGASIAKMRATLYDGLSDVSAEGMVDFAEYEAEFQAVLIEKSAIGAAVIEMNRPATSQLRAIVRAQPFQGRLLKEWYSSLAQTDARRINDAVRIGIAEGQTTDQIVRRIRGTRAARYRDGVLEISRRDVDSIVRTAIAHVADRSGQYVWEANADIIKGLKWVSTLDSRTSATCRERDGRVYPVKSAPPIPAHFRCRSRKVPYMGETSIKGSRASAFGPVPDDWTYEQWLKRQPLSVQQEVLGVKKAKLFRDGGLPLDRFADDTGREYTLAELKRRDANIWNKTFSE >NC_016026|563889:581959|575073_575877_-|WP_041793751.1|DBSCAN-SWA MLGLYALEENMEVAAFAQGLRAATLPFKYRNHNPFTPDQVEDFDVVVVSGLRDKGTVIRDAYAKKGKPVIVIDYGYMSRVSGIKTWATGHWQVGLNRLGWVPPFQCLSDRLDRLGIFFKPQHDGEYVLVCGQHVGDPSHGLDADGISAWATREIAKAKGARPVLWRPHPDSPDVNATGHDGISSGPIDWASVHAVHCINSNVGHEAIINGVRVICEPSAPYAELPMAPSEEQKRVYFSRLAYSQWTLDEMRSGEAIEFIINAIRRLQ >NC_016026|563889:581959|574210_574675_-|WP_014102116.1|DBSCAN-SWA MAFVVEDGTGLPDANSYASVAEADAYHTDRGNAAWTGSNSVKQAALVRATDYVEQAYAGRWKGCALTEEQALSFPRDEAVVPAKLKQAVIQLALEAIATDLNPTQDRAVKREKVDVIEVEYMDNARNGKTRPAIDGLLSGLLSGSIYNRPLVRT >NC_016026|563889:581959|581413_581959_-|WP_014102124.1|DBSCAN-SWA MAKKTQGKAPKKEKGTGAGRPSDYRPEFAPQVRKLFLLKSDATDTDIAEFFGKSESTINKWKVDYPEFSEAIRAGKTPADTEVADSLFRRANGAEWTEEVAFKVKVIEYEGGKKSREYEDVKVVTIRKQAPPDTQACTVWLTNRRRDTWAPKPEALPPSDAPIGDEYALKPDEAVPNAPIL |
20 | Pseudomonas_phage(21.43%) | capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2325817 : 2372236
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_016026|2325817:2372236|DBSCAN-SWA CATGGACACAGCACAAAACCTGCAGCACCACACCGATTTACCCACCACGCCCGACGCTTTGTTCAAGCGGCTGGACGATCTGGGCATCCTGTATACGACGTGGCACCACCCGGCGTTCTTTACGGTGGAAGAAGGGCTGGAATTTGAAAAAGACATTCCTGGCCTGCATTGCCGCAATTTGTTCGTGCGCGATAAACGCGAAACCATGTTTCTGGTCTCTGCTGCCAATGAAACCAAGATTGATTTGAAGAAACTGTCGGCTCTGCTTGATTGCGGACGGTTGTCGTTTGGATCGCCGGAGCGTTTGTGGGCCAATCTGGGTGTGCGGCCGGGATCGGTGTGCCCATACGCGATTATCAACGATACCGCACAGGCCGTGACCATGGTGCTGGATGATACGATCATGCGGGCCACCACCGTCAATTTCCACCCGATGGTCAACACCATGACCATTGGCGTTGCGCCCCAAGACCTTGTACGCTTTATCGAATCAACCGGGCATACGCCGTTGATCCTTGATCTCAGTGCTACCGCTCCTGAAGGAGAATAAAGACCATGTTATTCGGCAATTCCAAAGACGCGCAAAAAAATGATGCGGCCACCAGCACCGCCGCGAACGATGCCATTTTCGATGTCGGCACAGAGGATTTCGAAGCCAACGTCATGCGCGCATCGATGGACACGCCAATCATCGTCGATTTCTGGGCCCCATGGTGCGGACCATGCAAACAATTGGGCCCCACATTGGAACAAGCCGTCATCGCCACCAAGGGCGTAGTGCGCATGGCCAAAGTGAATATCGACGAACACCCGGAGTTGGCGCAGGCGATGCGCGTGCAATCCATTCCGACCGTGTTTGCATTTTTCGGTGGTCAACCCATCACCGGTTTCACCGGCAACCGCCCGGCCAGCGACATTAAAAAGCTGATGGACCAATTGGTGCAGCTGGCCCGCGAAAGCAAACCGGATGCGGTGAATATTCCCGAAACACTGGAGGCCGCCAATCAGGCCCTGGCCGACAATAACCCGACACAGGCGCAGGTTTTATATTCCACCGTGCTGGAAGAAGATGAAAACAACGTCGCCGCGTTTGTTGGATTGGTGCGCGCCTTCATCGCCGATGGCGATGTTGAAACCGCCGCCGGGATGATTGAAAACGCACCCGACACCATCGCCAAAAATTCACAATTTTCCGCCGCCATCACCGCCGTTGAATTGGCGCGGCAGGCCGTGCAGGCCGGGCCGGATAATGGCACCAGCGATCTGGCCAAAGCCGTTGCACAAGCACCCGATAATCACGCCGCACGATTTGATTACGCCATGGCATTGTTCGCCGCAGGCGAGCGTGAAGAGGCCATGAATCAGTTGCTGGACATCATTCGCCGTGACCGTGCGTGGGAAGACGAAAAAGCGCGCAAGCAATTGCTGCAGTTTTTTGACGCCATTGGTCCGGCGGACAAGAGCGTGGCCGCCGCACGGCGGCAATTGTCATCGATTCTGTTTTCGTGATTCTGCTTTATGCGGCATCCCCGTGAAACGGGGATCCAAACGGCCAGACCCATGGATTCCCGCTTTCGCGGGAATGCTGCTGTTAGAAACTGCATAAAAACCCCGCATTTCCGGCCTTTTCCCGCCGCGATGCACCATAAAATTATAATAAAATGTCACCGCGCGGGCTTTTTCTTGTAACAAAAATGCCTATATGATTAACACTGTGTATGGTTAACCGGGTGGAATACGTGCGTACTCTCTGAATAACGGCTGTGGGGCTATGGGTTGTGACTTGCCACCAACATTGAACGATTTGCCCGCGGAAATTCCTGTATTTCCGCTGAGCGGCGTTTTGTTGTTGCCGCATGGCCAGTTGCCCCTGAACATCTTTGAGCCCCGTTATCTGAGCATGGTTGAAGATGCACTGAAATCCCATCGTATCATCGGCATGATCCAACCGCGCGGTGCGGACAATGCCCATCCGGCCTTGTTCGAAACCGGATGTGCGGGCCGTATTGTGAATTTTTCTGAAACCAATGATGGCCGCTATCTGGTCACGCTGAAGGGCGTGGCCCGGTTCCGCGTGAAATCCGAACTGGATCAGGGCCGCAACGGCTACCGCCGGGTGCAAGCCGATTGGGCTGATTTCTCCAGCGATCTGGACGCGGTGAGCTGTCTCAATCTCGACCGCCCGCGTTTGCGCGGTTTGCTGGAAAGCTATTTTGAACTGCACGGCCTGTCCTGTGACTGGAACGCGGTGGAAAGCGCGACGGATAACAAGCTGATCACCTGCCTGTCCATGATTTGCCCGCTGGATGCGGGCGAGAAACAAGCCTTGCTGGAAGCCTCCTGCTGCAAAACCCGGGCGGATTTGTTTATGACCATTCTGGATATGGCGGTGCGTCAATCCGGTGCATTGCACACAAGCTGCCGCCACGGGTCTGACACGTCACTCTGTCATTGATCCCGCCCGCATCACCGGGGTATAACCCCCGTATGACACAGACACATCAAATCGACCCGAAATTGCTGGAAATCCTGGTCTGTCCCCTGACCAAGGTGCCGTTGCGTTACGATGCAAAGGCGCAGGAATTGATTTCAGATCAGGCAAAACTGGCCTATCCCATCCGCGATGGCATTCCGATCATGCTGGTCGACGATGCGCGCAAGATTGACGAATAGCAAAAAGGATGACGACCATGGACCGCGTTATTATTTTCGACACGACCTTGCGCGACGGCGAACAATCCCCGGGTTGTTCCATGAACCATGATGAAAAACTGCGCATGGCCGCATTGCTGGATCAGATGGGCGTGGACGTGATCGAAGCCGGTTTCCCGATTGCCAGCAAGGGTGATTGGGAAGCGGTGAAGGCGATTGCCGGAACGGTCAAAAACGCGACCGTGGCGGGATTGTGCCGCGCCAAGCGCGGTGACATCGAATCCGCCGCCGAAGCGTTGGCCCCGGCGAAAAGCCGCCGCATCCATACCTTCCTGTCCACATCACCGCTGCACATGAAACATAAATTGCAGATGGAGCCGGAAGCGGTTCTGGATGCCATTCGCGACAGCGTCACGCTGGCCCGCCAATTCACCGACGATGTTGAATGGTCCGCCGAAGATGGCAGCCGCACCGAAAATGATTTCCTGTGCCGCGCCGTTGAAACGGCCATTGCCGCTGGTGCCACCACCATCAACATTCCCGACACGGTGGGCTATGCCCTGCCCGCAGATTATGCGGCGAAATTTACATTGCTGCTGAACAAGGTGCCGAATATCGACAAGGCGATCTTGTCCGTTCACTGCCACAACGATCTGGGGCTGGCGGTTGCCAACTCACTCGCCGGTGTGATGGCGGGCGCGCGCCAGATTGAATGCACGATCAACGGCATCGGCGAACGCGCCGGCAATGCCGCATTGGAAGAAATCGTGATGGCCATGCGCACACGCGCCGATTCATTGCCGTATAAGAACAATATCGATACGACGATGATTACCAAGCTGTCGCACGCCTTGTCCGACATCACCGGATTTTCGGTACAGCCGAACAAGGCGATTGTCGGCGCCAATGCGTTCGCGCATGAAAGCGGCATCCATCAGGACGGCATGCTGAAAAACGCGCAGACCTATGAAATCATGACGCCGGAATCCGTGGGCTTGAATAAATCGGAACTGGTGCTGGGCAAACATTCCGGCCGCCATGCGTTCCGCGCCCGGCTGGAACAGCTGGGCTTCGATCTGGGCGACAACGCGTTGCAGGACGCGTTCGTGCGCTTCAAGGATCTGGCCGACCAGAAGAAAAGCGTGAATGATGATGACCTGATCGCGCTGGTGGGTCCGGATGCACCCCGGGTCGAAGTCGGCCCGTAAGGGTATATTCAAAAGCGCTAAAGAAAAAACAGCCGCCCCGGACGGGGGCGGCGTTTCTTTTTAGATCTCCCGTTTACGGAAGTTTTTTTCCCTGTTCAATTTGTCTCGCCGGTTAAGGCCTAATAAAGTTACAATTAAGACGGATGGATTGTCAAAAAATTGCGCGGAACTTATCCACAGAAAGCGTGCACATTATGGAATGTATCGGTTGATAGAATGAGCGCAAACCCACCCATCACCATACAGGATGTTTTGACCGCCTATGCCATGGGCCTGTTCCCCATGGCCGAACGGGCGGATGAACAGGCCTTTTACTGGTATGACCCGCCCTTGCGCGGGCAGATGGATATTGTCGGCCTGCATGTTCCGCGCTCGTTGAAAAAATTCGTTTTGAAATCCCCGTTCACCATCACGGTGGATCAGGATTTCCCGGGCGTGATGGCCGGGTGTGCGGCACGCACCGATGACCGACCACAAACATGGATCAATGACGGCATCCGCACTTTGTTCACCGACTTGTACCGCGCGGGATTTGCCCATTCGGTGGAGGTGCGGGATGCGGGCGGTGCCTTGGTCGGCGGCCTTTACGGCCTGGCGATTGGCGCGGCGTTTTTTGGTGAAAGCATGTTTTCGCGCGAATCCGGGGCCAGCAAAACCGCCCTGATCCATCTGTGCGCGCGGCTGTGGCGCGGGGGCTTTACCCTGCTGGACACCCAGTATTTGAACCCGCATCTGGAACAATTCGGGGCCTATGAAATCCCCCGCGACCAGTATCTGGCCCGCCTGCATCAGGCCATCCCCCGCCCGGCAGATTTCAATTTGCGCCGGAATCCCGGGTTAAGCGAGGACCAGCTGGTCCGTGAGTGGTTCGCAATGCGCGCAACAAATCAGTGACTTGCCTGATATTTCCCATAAAAATTGACACTTCGTTAAGGATTTGATAGGCTGAGGTCAACTCAAAGACATCATTATTGTCGAGGACCGTGCCGATGAATCGTGACGTATTACACAACGAAGCATCCACCCAAAGCCTGAAAGGGGCGTGGACCCGGCATGCGACCGTCGATGGGGCCGTTGTGGGTGGAATGAGGGGCACCACCCTGTCGCGCGATGCGTTCAAGGCGCAAACCCGTTTCCTGCCCGTCTTCGCCACTGCGGCCGACCCGCACAACACCCCCCGTTACGTCGTCATGGCCGCCCGCCAATGGCAGGGCGAACACGCCGATGGCACGATCCGCACCCATGCGGTGGCGGATGTCCATGTTCTGGACCAGCCACATATTGACCGCCTGTTGACCCGCAAGCACATCACGCAGAACCAGATTTTTGACCGCGTGGGTGATGCAACACGCGCCGCCAAAGCCCAAAACGAATCCACCTTTGAAACCTGCGCCCGCGATTATACACAGGGCAGACACGCCGGGATCAAAGCCTGGCGTTTGGTACAACAGGGTGATGCGGCAACGCCGTCCCCCGCACAACACCCCACGACCAAATCGCGCGCGTTTTAATACAATAAATAACAACACGACGACCACGACGGATGCGACGATTATCGCGCAGACCTCGTCGTAGTCGTTGTGTTGAAATTATTCCGCTGACGGCGGGGGTTCCTCGACGGGAATATCCGTGTCGGCCGCAGGTGTATCCGACGCGACAGGGGCTGCCGGAGCATCCGTTGCTGCCGCACCTTCAGGCGCGGCGGAAGCGTCCGCAGGATCCGGCGCAGGTGGCTCTTCCCCGGTTTTATGGGTCAGGCAATCCAAAACCCACACATCATAAATCGGGTGATCCATTGGCGACAAAGCCGGGGATGACGCAAACATCCAGCCACTGAAAATCCATTGCGATTCTTCGGCCTTCGGCGTCAATTCCCACACCTGCAAGAAGGCGGCGGATTCCGGCGTTTCGATCGGCGGTGCTTTGCGGCAGGCCTGAATTTTAATGTAGAGCGTACCGAATTTAACAGTCGATCCGACATTCGCCTCGAATGTCATTGTGCGCGCCGTCACCTTGTCCAGCGATTGCAGCTTCACGACGGGGAAGTCGTCCATCGCGGCGCGCGCAGGAAGGGCAAACCCAAAAGCGCACAGCGCCAGAAGCGTCGATACGACCGGGCGGACGCGCTTACGGATGCGCATCAGCCGATTCCGCACCATTGCCATTCTGATCTTTCGGCGCGCCCTGCATGCTGAAGATAAACTTGCCGAGCAATTGCTCGAGGTTTTGCGGAGCCTGGGTATAGGAAATACGGTCGCCGTTTTTCATCATGTCTTCGGACGAACCCGGCTCCAGCGACAGGTAGCGGCCACCCATCAAACTCTCGCTGCTGATCAAAGCGGCGCTGTCGTCCGGGACTTTCACGGACGGATCAACCGACATGCTGACCTTGGCCAGATAGGTTTCCGGGTCCAGCTCGACCGCGGATACGGTGCCGACTTTCACACCGCTGATCTGCACATCGTCGCCCGCACGCAGGCCACCGACGGAGGAAAAATTCGCGCTGATTGTATAGCCGCTGACATCACCGACATTCGCGGCGCTGTAGCTGAAAATCAAAAATACGGCGGCGACCAGCAAAACAACCGCGCCCAAAACCGTTTCAATCAGACTGTGTTTCATTCTCTTAATTCACCCAAAAACGCGGTTTAAATTATACTATTGCGGCGGTGTCCAGGATTCGTAATCACCTGTTGCTGCATCACGCTGACCACCGGACAGAACATGCCCCGGCGGACGATACGCCAGATTGGTGCCGGTCATGTTCGGCATATGCGGTTTCTGCCACGGACGACGGAACGATTCCGCGTTTGACGCCGGGACCACATCGGTTTGATGGTGCAGCCAGCCATGCCATTCCGGCGGAACGTTCGATGCTTCGGGCACACCCTTGTACATCACCCAGCGGCGATCCAGCTTGTACCCCGGACGGGCCTTGGCGCTGTAATATCGGTTGCCATAGACGTCGCGGCCCACACTGCGCGCCCCGCTCAGCAATGTAACGAAACCAATATGCGCGGGTGACAGGCACCCCAGCATCTGACCGATTGCACGAAACAATCCCATGGACATCCATCCCGTCTTTTGAAGGAAAAAAGGAGCGTTCCCGCCCCGCCTCAAAGCCCCCCATGGATGCCGCAAAAGCCCATGAAAATCAAGCCTTTTGTCGGGGAAAGACGCAGGGCGCGCCCCGTCTGTTCCCGCCATCCCGGACCTGACCCGTGATCCGGAGCAACACGCGCTAACCCATCCCGGTCCTCGGGATCCCCACTGTCACGGGGATGACAAAGGGATGTGCGATAATATGAAAATCTAATCCGCGACTTTCCCGGACAGGCCCTGCCACATCCGCACCATGTCCTGAGCCTGGTCGTGTTGGAGCTGTTCATAGGCGACGGAGCGTTGGGCTATGGGCTTGGCCACTTCGCCCTTGATCCGGTCCTCATCATCCCCGCCCATCATCGCCGCAATTTCAGCATGGGTCAGGGCATAGCAGACCTTCTCAATCCCGGCGCGAATGACCAGCGCCTGTCCCAGCATCGTGGGTTCGGAACTGCAATACAGGGTACAGCCCTTTAATTCGGTGCGGCCCAATTTTTCCGTGGCCCGGCGAATGGCCAGCACCTCGGCATGGGCGGTGGGGTCGCAACGGGAGGTCACGCTGTTCACACCCTCGCCAATAATATGCCCATCAGCATCGGCCACAACCGCCCCGAACGGCCCGCCCAGACGCGACCGCGCGTTCTGGTGCGCCAGCGCAATGGCCCGCGCCATCAATTGATCGGGAGAATAGCCACGTTCAACACGGTGGGCACCGATGGCATCCGGATTTTTGCCGTGCGACCGCAAGACACGGCGCACCACTTCGACCAGACGGTTGGGTTCGACGGGTTTACGCAGAACTTGCTTGATTTCATAACGACTGATTTCCAGCAGCAGATCGGTGCTGGGTTCATCCGTAATCATAATGATACCGAATGTGTGACCCTTGACCTGACAATGACGGGCGAAATCGAAGCCGGTTTTGGGCTGCATCGCCTGATCGACAATGGCGATGTCGATATTGCGTGTATCGGCAATGCGCGCCGCGGCCCCGCCATCGGCGGCGGTGACAACCTCATACCCGTTCATATACAGAAGCCCCGCGAGGTGATCGCGGTCGTCTTCGTTTTGTTCGGCAATCAAGACCTGAAGATTGTTGGAAACGCGGCTCATGAGAAACACCTGAAAAAATTAGCGAGAAAAAGTGTAGTCCTTTTTATTTATTTGAAAAGCCTTGTTGAAACAGAAAGTCTCGGGTACCCTTTTTATAACAATGACGCACCCATCTTATAAAAAACCGGGGCGTAAACAGGGAGAAGTACATGCTAGAAGACCGCAAGGGCGCACACAGCGCCTTCGATGCCGTTGCCGGAAACGATAATCAATCCTCGGCCCCGGCCCCTACCACAGCGCCAGCCACACGCACCGCCTCGGCCCCCCTACCCACACCGCCCAATTATTTTAATCAGGTCGTATCCGCCGCGCCGCAGGAGCGTATCTTGATTTTGGAGGGGCCAATCACACAGGAACTGGCCATCACGCTGCGCCTGCATCTGCTGCGCATGGAAGCCGCCAGCCCGGATGAGCCGATCACCATTATGATCAATTCGGGTGGTGGCTTGGTCACCGCCGGCATGGCAATTTATGACACAATTCAATCGCTGGAATGCCCGGTAAACACTTATGTCACCGGCATGGCTGCGTCCATGGCTTCCATCCTGTTGGTTGCGGGTACGCCCGGTGAGCGTCGCGCTGCGCCGAATGCCCGTATTATGATCCACCAGCCGTCTGGGGGATCACAGGGCAACGCCACGGAAATGGGCATCAGCCAGAACGAAATCGAACACACCTATCGCCGCATGGCCTGGGTGTACGCCGCCCATGCGTTTGACGCGGCGAAAAGCCCGGCTTTTGACAAAAAAGTCGCGGACAAGATGGCCGATCTGCAAAAAACCGCCCCCACATCCGGTGGCACATGGACGCCGGATACATTGAAAATGACGGCGCTGGCGCATTTGTATCACGCCGTGATGAAGCAGGATTATTTCCTGTACGCCGAAGAAGCCAAGGATATGGGCCTGATCGACAAGATCGAATATCCGGATATGAACATCAAACCCACAATGGACCCGAAACGCGCCAAGATATTACACCGCATTGCCGAGGCCGAACGTCTGGCCAACAAGCATCACCGCGACAACCGCCCGGAACCGAGCGCGTTTTAAATCGCCGTTCAAACAAACACCCCAACGAAAAAACGCCGCCCGCATCAAACCGGGTGGCGTTTTTTATCGTCTCATTTGAAATCAGGCACGGCTGAACACGCTGTGCAGCATGCCGTACCACACCGCCGCCATCCAGAAGAGCAGGAACAATCCCGCCTGCACAAAAATCGAACTGGCCATCGGGATCAGGAACGGCACCACCACCATGGCCAGACACAAGCCCCGGGCACAATGCTTGAAATCGGCCGCAAAGAAACGGATTTGCGCCAAAAAAGCGCCCCACAATTGAAACGCCAGAACCGTGCAGAATCCGTATAAAATCAAACCCGGATCGGACACGGACACCAGAAACCCGCGCATGATCAGCGGTGCGATACACAGTAAAACAATACAAACAAACGCCAGAACAAGTGTCACCAGATGCGATGTGATATCGATCGGTGCGCGGCCGCCCTGATACCGCCCGGACGCGGACAGCGGCACGACATTTTCCGGCGCCGATTGTGTATGTGAAATCCCAGCCATGAGTAAAAATCCCCCCACAGGACAAGCATGAAGGACCACGCAGCGTGCGTGCCATTATAATTTTTTGAAATTCAGGACAGTCAGAAGAAAAATGGGAAAAAAGAAACGCGCAACACGGGAATGATTCTACCCCGAATTGCGCGTTCTGAATATAGCGTTTAAAAAATTATTCCTGTAACCCGTCCGTGCCGCGATGTTGGCGCAGGTAATCGCGCAGCGGCATGCCATCCAGCGACAATCCCCGGATCACAGGCGCGCCGAACGCCCCCGGCACCGCGATTTCCATCATGAAACGATCCGGTTCGGACACGACCAGCCGGTCCAGATCGGCCGCAAATTCCTCCGGGATGAAGTATTTTTGCGGCCCCGTGGCGGACGACGCGGGAATGCGGCTTTGGCACAAACGATCCGCATCCGGCGCATCACATTGCAGGCGCGTCACATTCATAAAAGTCTGATTGCCGTTCAGGCACAGGCAGCAATTGGGATCATTATCCGCGCACGCAGCATCCACGCCCGCATCCGCCTCCATCCAGTCATAACGAAACTGGATATACCGGCCATGCAGGAGATCCCGTGGATCGTATCCCGCGATCTTCACGGTCCAGACCGGACCCGCCGTCTGGTCGGTCGCCGCCTTGACCCACAGTAATCCGGGCACCAGCAACGGCAACACCAGCACCGCCAGCATGATTATTTTGCGCATGGGTTCGGAAATTTTATCGCAACAGGCACAAAAACCGATCATGACTGACCTCCCGAGGAATGAGACTGCGTGTTTGATAAAGACCGCACGTAACGATTGGCCCGGCGCAAGCCCCAAATCATGCCCAGCATGGCCACCCCGGCAATGATCAAGCCGAATCCCGTGGTCATCAACCCGGCAAAGGCCTCAAGGAAAATGACGTAGATGCGAATGCCGATCAGCGTCACCGACAGCGTCACCAGACGCATGGCCCCAGTCAAATGCCCATACCACCCCGCACCAATCCAGAACGCGATGAACGTCAGACCAGCCATAAAGCTGGAATCCGGTGACGGGATCAGGAACGGTAGCAAGACAGACACAACACATAATCCGCAATAGCCCAGCGTGGCACGATAGAACGCCCGGTCCTCAACACGGCTGTTGTACAGGGCGTAATGCACCGCCATGCCCGCCAGTGCGATCAAAGGCACAGCAACCGCAACCAGATAGGCCGTGCCATATTCCATACCGGAATCGCGTGCGATGGTGCCCAGTTCCTTGGCGCGGTCGACATACCACAGCATTGTCGCCATCATCCCGGTGAAGGTCAGCAAAATAAATCCGGCCCGCATCACAACAAACGCCCAGATGGGGCGGAAGGCACGGAACATGCGCGTAAATCCATCGGCACACATCGCCAGCGGCAACCACACAGCCACGGCCAAGCTGTAAAACAACATCCAGATATCGGGCAGATCGTGCAGATATTGATCCATGATCACCGGGATGCAGCCCAGAAATGCCAGGACCCAGGGAATGGCCGTCAGCATCGTGCGGCCATAAATCACCATAAACGGCGAAATCACGATCAGCCACAGCGCCAGCAACCCGCCCGCCCCGCCATTGAGGTGATAAATCTGGCCCACCAGCGCCATAAAGGTCAGTGTCAGCCCCGCCAGAACCAGCACGGCCCCCTCCCGCCACACATCCATCCCGCGGGCATGGAACCGCCACACGGCCACCGCCGCGCCAATGTTCAAAACGGCATGCACCACCAGTTTCGCCGCATCGGGAATATCGTTCCAGTTGGCCCCGATCACCATCAATACGCCCACCAGAATGGCGAACAACGACAGCCCCATCAGACCGCCCGCAAACCGACCCTTTTGCCGGTCGCGTTCAAACGCGGTGATGGATACAGCCTGATCGGCGGAGATTAACCCCACCTGTTCCCAGGCTTTAATCTTGCGATGGACACCAAACATATGATTTTCCCCTTTAAAACGGGCGGTCATTGTGGGATTTCAAAGACAGAAGGATGGGCGCTACGCAAACGCGAAAATCGGTCATTATCGTACCTTTTTTCAACGCCATGGACATGGTGTTTTTGATACTGGTAGAATCGTGCGTATGAAACCATTTATTCCACCGCACCGCGAATCAGAACGCGGCAGCGTTTTAATTTATATCTTTATCGCCATCGCCGTTCTGGCCGCGCTGAGCTTTGCCGTATCCAGGTCGGGGCGGGAAAGTGCGCAAACCATCAACAAGGAACGCGCCGATTTATGGGCTACGGAATTGTTCGATTATTCGAACATGCTGCGCCGTGCGGTCACGACATTGACGATTACGGGTCTGACGGAAAACGATGTGTGCTTCCATGATTCCGGGTGGGACGACAACAGTTATCAATTCTCCACCGCCTGCCCCACCGCCAATCTGGTGTTCAGCCAGCTGGGTGGCGGCGCGACATTCCAGAACCCGAATTACAATCTGCTAGATTCCAGCTATTCCACGGACCCCGCCTATAAAAAATGGCATATCACTGGCGCCAACAGTGTCGCCGGGGTCGGCACCGATTGTTCCCCGACGGGACCGTGTAATGAATTGCTGGCCGTGTTGCCCTTTGTCCGGCGTGAGGCCTGTATCGCCGTGAATTCCAAACTGGGCATCACCGATGATTTGACCGAGCCGCCGCAGGATGACGCCGGTTTTGACCTGAGCGTGCCGTTCAAGGGGTCATACCCAAATGGCGAATCCATCAACACCGCCACGCTGGATGGCAAGCGGGTCGGATGTTTTAAGGGCAATGGCGGGGTCATTGACGACGCCTATGTGTTTTACAGTGTCCTGATCGCACGATAGGCCAATAAAAAACCCGCTGTTCAAGCGGGTTTTTTATTCTGTCTGTCACCACTTTATTTCGGCGTCATTTCATTGCTACGACGGCGCGTTTTTCCCGGCTTGGCCGCCGCTGGATTATTGTCATTGGCCACAGCAGCATCGTTGAACAGGAAGCGCAGCTTTTTATCCTGGCCCTGTCCTTCGGTATCCACCAGAACCGCACCACCCTTGGCCAGCTCACCAAACAAAACCTGGCGCGCCAGCGGCGTGCTGATCTCGGTCTGGATCAGGCGGCCCATCGGACGGGCCCCCATGTCACGGTTATAACCCTTGGTGGCCAGATAGTCGCGAGCATCGGCGCTCAGGTCGATTTTGACATTCCGTTCCGCCAGCTGACCACCCAAGATCTTGACAAATTTATCAACGATCGAGCCCATCGTTTCCGGCTTCAGGTAATCAAAACGGATTTGCGCATCCATGCGGTTGCGGAATTCCGGCGGCAGGAATTGGCTCACCTCATCCGCGCGCAGAACTTCTTCGCGCGTTTCGGTATGGAACCCGATGCCCTGAACCTTTTTGACTTCGGCATCGCGCAAGTTGCTGGTCATGATCAAAATCACGTTGCGGAAATCCGTGGTCTTGCCATGGGAATCCGTCAAACGCCCATCATCCATCACCTGCAACAGGGCTTTCAGAACATCCGGGTGGGCTTTTTCAATTTCATCGAGCAACAGCACGCAGTTATTGTGCTTGGTCACGGCATTGGTCAGCTCGCCGCCCTCATCATGGCCGACATAGCCCGGGGGTGACCCGACCAGGCGCGCAACAGTATGTTTCTCCTGAAATTCAGACATATCAAAGCGCACCAATTGCGTGCCCAGCGTTGCCGCCAGCTGTTTCGCCATTTCGGTCTTGCCCACACCCGTTGGACCGGTGAACAGATAACTGCCCTTCGGTTTATTCAGGTCACGCAGGCCAGCCTGAGCCAGCAATACGGCATCGGTCAGGGCATCAATCGCGGCATCTTGCTGGAACACGGCCTGGCGCAGATCGGAATCCAGCGTGCGCAGTTTTTCCAGCCCGTCGCCCGACAATTCCTTTTTCGGCAGGCGCTTGATACGGGCGACAGTGTCTTCCATCTCATCCTTGTCGATCACGTTCGTGGCGCGTGGTTCAACGATACGGTGCGCGGCGGCCCCATCCAGCAAATCAATTGCCTTGTCCGGCAACTGGCGATCCGTCATATACCGGACCGACAATTTCACCGCGGCTTCAATCGCTTCATCGGTATAGGTCACACCGTGGAAGGTTTCGTAATGTTTCTTCAGACCTTTCAGAATTTCGATGGCCTGTGCCGGGGTCGGTTCGGCCACATCGATTTTCTGGAAGCGACGGCTCATCGCCGCGTCTTTTTCGAAATATTTGCTGTATTCGGCATACGTCGTCGCCCCAACGCAACGCACCCGACCACTGGACAGATACGGTTTCATAATGTTGGCCGCATCCATCTTGCTGTCACTGCCCGTACCGGCGCCGATCAGCATGTGGATTTCATCGACGAACAAGATAGCGCCCGGGGTCAATTCCAGCTGGGTCAAAACGGCCTTCAGGCGTTTTTCAAAATCACCACGATATTTCGATCCCGCGGTCATCGCCGTCAAATCGAGCGAATACAATGTCGCGCCCAGCAATTGGTCCGGCACATTGCCATTCACGATATCCATCGCCAGCCCTTCGGCCACCGCCGTTTTACCAACGCCGGGTTCACCCACCAGAATAGGGTTGTTTTTCTTACGACGGGCCAGGACCTCAATCGTCTGGTTAATTTCATTCTGACGCGCCAGGACTGGATCAATTTTGCCCGAGGCAGCCAGTTCGTTCAGGTTCACCGCAAACTGTTCCAACGGCGGACGTTCGTCCTCTTCTTCCTCACCCTGCTGTGACGCCGGGTTCTTGCGCTTGCTCAGGCCATAATTATCCGTCGCCGCGGCATCTTTCGGATTCACCGTCGTTCCGTGGGTCAGGAAATTATCCAGCGTTTCGGCGCTGCCGATGCCATGTTTTTCCAACAGATACGCGGCATTTGAACCGCGTTCGCGCATCAAGGCCAGCAACAGCGATTTGCTGTTGGCAATGGTGCCCGGGTTTTTGGTCGTGTTCTCGTAATAGACGCGATTCAAAATCGTGCCCATCGCGCTGCTTTTTTCCAGGTCGATATCCTGCAGCGGGCGCGGGTTGACATCGAAATGTTCGACGATGAAATGCTCCGCCTCTTCCTTGATCTTGGCAACATCCACTTTCAGTGCACTGAAAACGGTTTTGCAATCGGGATCATCCAGCAACGCCAACAACAAATGTTCCGGCATCATGATCTGGTTGCCGTGTGCGATGGCGATGTCCTGTGTGCGCAATAAGGTGAGGCGCATATTTTCAGATTGTTCCATGGCGATCCCTTGTCCCTACTCAAAAGAAGTTTCTTGTTTTTGAACCTGTTTGAAATTTATAGAGATTTTACATAGGGGTGCCAAGAAAATTATCTTATTTTATGGAAATAAATATTCTTGCTTTGGCTGATTTCAGCCATTTTTAACCCCCGTCCGGGGTTCCGGCGGGGGTCAAAGATGGTCCAGACCGCTGTAAATCGGCCAGCCGAACCCTGGTCTAGTCCTGGTCGATTTTCTCACGGGCACCGGATTTGCGGGCCTCTTCATAATCGAAAATCAGCTCACCGCCCTTGAAATCGACATGGGCGACGCCGCCCTTGGTCAATTTGCCGAATAACAATTCCTCGGCCAGCGGGCGTTTGATTTTTTCCTGAATGACACGCGCCAACGGACGCGCACCCATGGCCGGGTCATAACCGATTTCGGCCAGATGGGCGCGGGCCTCATCCGACAGGACGATGGTCACATCACGATCGGCCAATTGGGCTTCCAGTTGAATGACGAATTTATCAACGACACGCGCCACGGTTTCCGGCTTCAAATTCTGGAACGGCACAATGGCATCCAGGCGGTTGCGGAATTCCGGCGTGAACAGGCGACGGATTTCATCATTGTCACCATCAATTTCCATGGTGCGGCCAAACCCGATCGGCGATTTCGCCATCTGGGCCGCGCCCGCATTGGTCGTCATAATCAAAATGACGTTGCGGAAATCAACCGTTTTCCCGTTGTTATCGGTCAGTTTGCCGTAGTCCATGACCTGCAGCAAAATGTTGTACAGGTCCGGATGGGCCTTTTCAATTTCATCCAGCAGCAACACGCAATGCGGGTTCTGGTCAATCGCATCGGTCAGCAAGCCACCCTGTTCAAAGCCAACATAGCCCGGCGGCGTACCGATCAAACGGGACACCGAATGCTTCTCCATATATTCCGACATGTCGAAGCGTTTCAATTCAACGCCGAGCGATTTGGCCAGTTGCCGCGCCACTTCGGTTTTACCAACCCCGGTGGGGCCGCTGAACAGATACGACCCAATCGGTTTTTCCGCATCGCGCAGACCCGCGCGCGCCATTTTGATGGAATCGGACAGCACCGCAATCGCCTGATCCTGGTCGTACACCATGGTTTTCAGGTCGCGATCGAGATTCTTCAACAATGTGCGATCGTCTTTGTTCACCGCCTTGGCCGGAATGCGCGCGATCACCGCGATCACATCCTCAATATCCTTCACATTGATGGTTTTCTTGCGTTTTGACGCCGGCACCAGCATCTGGGCCGCACCAACTTCGTCGATAATATCAATCGCCTTGTCCGGCAATTTGCGATCACCAATATAGCGGGCAGAGAGCTCCACCGCCGCCTTGATCGCATCATTGGTGTATTTCACATTGTGGTGTTCTTCGTAATAAGGCTTCAGCCCTTTCAGAATTTTGATCGCATCCTCGATGCTCGGTTCCTTCACATCGATTTTCTGGAACCGGCGGACCAGCGCGCGGTCCTTTTCAAAATGATTGCGGTATTCCTTGTACGTCGTCGACCCGATGCAGCGCAAAGACCCGTTCGACAAGGCCGGTTTCAACAGGTTCGACGCATCCATCGCCCCGCCCGACGTGGCCCCAGCGCCAATCACGGTGTGAATTTCATCGATAAACAGAACGGCGTTATCAACCTTTTCAATTTCGCCCAGAACGGCCTTCAACCGCTCTTCGAAATCACCGCGATAACGCGTACCCGCCAGCAACGCACCCATATCCAGCGCATAAATCACGGCGGTTTTCAAAACGTCTGGCACTTCGCCATCGACAATGCGTTTGGCCAACCCTTCGGCAATGGCGGTTTTACCAACGCCCGGATCACCCACATAAAGCGGGTTGTTTTTCGACCGGCGACAGAGAATTTGCACCGTGCGATCAACCTCTTCATGGCGGCCAATCAGCGGGTCGATCTTGCCGCGCCGCGCCTTGTTGTTCAAATTGGTGCAATAGGTTTCCAGCGCATCGCGCCCCGGCTTGGCATTCTTGTCATCACCGGAAGCGGATTCACCCGGTTCGGTGCCCTCGGGCGGGCGGATTTCGCTCTGTCCCGGCACCTTGGCGATGCCGTGGGAAATGTAATTCACCGCATCAAAGCGGGTCATGTCCATTTCCTGCAGGAAATGGACCGCATTGGATTCACGTTCCGAGAACATGGCGACCAGCACATTGGCGCCGGTCACTTCCTCACGCCCCGAGGATTGCACATGGATCGCCGCGCGTTGCAAAACGCGCTGGAACGCCGTGGTCGGTTTGGATTCTTCGCCGTTGCGATTGACCAGATAGGTCAGTTCATTGTCCAGATAATGCGCCAGCTGATCCTTCAGATCCGGCAGCGAAATGCCGCAGGACCGCAATACCGCCATCGCATCCTGATCATCGGTCAGGGCCAGCAGTAAATGCTCCAGCGTCGCATATTCATGGCCGTGATCATTGGCCAGAGCCAGAGCGCGGTGGAGCGTTTTTTCAAGATTGCGTGAGAGCATGGCGAGGTCCGGGTGCGGAAAGGGTTGGCGTCTCTATCAATATAGTATTTTCGGGCCAAAGTTAACGACCCGCACCCTTGCACAGGGCTTATTTTGAGTATGGAGCGATTCGAATTGATAAAAGATGAAGAATGATCAATGAAATCAGGGTGTTTTCATATTCACCCTTATTCCTTCTCCATCGTGCATTGGAGGGGTTGTTCGCTGGCGCGGGCGAAGTCCATCACCTGGGCCACTTTGGTTTCGGCAACTTCGTACGTGAAGATGCCACAGATGCCGACACCGCGGCGGTGAACATGCAACATGATGTCGGTGGCTTCCTGACGGTTCTTGGAAAAAAACCGTTCCAGAACGTGGACGACAAATTCCATCGGCGTGTAATCGTCGTTCAACAACAACACCTTGTACATGGACGGCTTTTTCGTCTTTGGCCGGGGTTTGAGCAACAGGCCAGCCTGACGCCGTTCCTCGCCCTCACCATCATCGGGCAGATCGTCCCCATGATCGTCCCCCATACCATTGATCAGGTCGCCAAAATGCGGCCCGAAAAACGGTCGGGGATCGTTGGCATCATCATGGGCCATCAGCCAGAAATCCATATCGTCTTCAATCCAAAAACTGGTCATTGCTCACACGCTCTCTCAAAAGTCTTTTTTCTGTGGGGCGCGGAAACTGCCATCAATATTATAACGAATTATGAACAGAATCAGGAAAAATTAAAAATCATCAAAACAATCCCGAATCCTTTTGTTTTGCACCCTTTTACCCCTGCATTCTGTAACAAAATTGCGCAGCCTGCACAAAGATCAACGGCCCCGACAAAGAAAAATGCCCGTTCCCGCAAGGGAACGAGCAGTTGGGAAGGTCACCATTTGGAATGGGACTCGGGCAGGACGGTGGCGCGGGTCTTAGGCGGCCAGGGCGTCCGTAGCTTTTTTAATCGCCTTGCTGGCCTGCGCGTTCAGCGGCTCCAGCGTTTCGGTCGTAACCTTCAGGCCCAGTTCGGACAATTTGGTCGACGCGGCGACGAAATCTTCGAATTGCTGCTGCGCCAGTTTGGTCTGTGCGGCAGAGAAATCGTTGATCGTTTTGCAGCCCATCAGGGTTTTCAGGGCAGCCGCATTTTTCTCGGCCGCGTCCTGAGAGATGGACACGTAGGTTTTCAGGATGTCTTCAACACCCTTGGCGAACACGGACCCGGCCTGGGTCAGCGCTTCAACCTGTTCTTGACCCAGCGCCGACGCGTCCTTGGTCAGTTTTTCAAAATTGGCGGTGGTGGCTTTGGTCGTTGTCATGATTTTCTCCGTAATTTCAGTGGATTGCGTTTCAACAGAGGCCTTCAGCGCGCGGACATTCGTGGCGGCGGTTTTGGAGGTCTTCTTTTGGGGTGCTTTTGCTTTTTTCGCTGTCATGGTGGCTCTCCCTATCCTTTTGCTTATAATTTATCAAAGAATATGATGCATTGCACAATACCAAGGTAACACGCACCCCCTGGAGCGTCAAGGGAAAAAATGGTGCAACGCACAACGCTTCTCAAAATGAAAAAAATGCCCCGTGATTCGGGGTGTGGTCTATCCCTGAGGCTGTATGCATTTTTTCGAGCGGATTCATAGACTTTAAGGGTCTGCCCGCCTATCATGGTCTTATGAGTCGGACCCGCCGTGGTTGCGATCTGGCTCTGCCATGGGGCCAGACCATCGGGCCATGACGCGTACAAGAAACGAAAAGCATACAGGATCTCTTCCACAATGGGATACGACGTGGCCACTCTCCGTCAGCACGCTCGCACATTTTTTTCCACCGCCCTCGGGCTGGGCCTGATGGCCGGGCTGATGCTGGCCGCGACCCCGGCCCATGCCGCGAAGAAGAAATCACAAGACAACCCGCGTTACGCCTCAATCGTCATGGATGCCGATACGGGTGCAATCCTGCATGAACGCTATGCCGACAAAAGCCTGTACCCCGCATCGCTGGTCAAGATGATGACGCTGCTGATGGTGTTTGAAGCGATGGACCGGGGCGAGATGAACCTGAACACCCGCATCCGCATTTCCCAACACGCCGCCAGCATGCAACCCAGCAAGATCGGATTGAAAGCCGGGTCCACCATCCGGGTTGAAGATGCGATTCTGGCGCTGGTCACAAAATCCGCCAACGACATGGCCGCCGCATTGGGCGAAGCCGTGGGCGGGACGGAATCCAATTTCGCCAGCATGATGACCCGCCGCGCCCGTGAAATCGGGATGCGCAACACAACATTCCGCAACGCCTCGGGCCTGCACCATCCGTCACAGGTGTCAACCGTGCGGGACATGGCCATCCTGTCGCGTCACATCATCTACAACCAGCCACGGAACTTCCGCTTCTTCAGCACCAAGAATTTTCGTTATAACGGCGTGAATTATCACAACCATAACCGGCTGATGAGCACCTATGCCGGGATGGACGGCCTGAAAACCGGTTATACCGTTCCGGCGGGATTCAACCTGGCGGCCACGGCTGTGCGCAATGACCGCCGCCTGATCGCTGTCGTCTTCGGCGGCCGCACGACCCAGAGCCGAAACGCCCATGTGGCCGACCTGCTGGATCGCGGTTTCAAACAAATCGGCACGGGTCAGGTGATGATGGCGCAAACCAACGTCCGCGCCACAGCCCCGGCCCCGGTTCCAAACCGCAAACCGGGCACGGAAACAGGCACCACCGTTGTGGCCGCCACCCATTTTGAAGAGATGATGGGCGCAGGCGATTCCGACCCGACCGCATCCACACGGATTGAAAGCGGCATGGTCGCCGTCAATGCATTACGCAATGCTGGTGCCGTATCACGCGCAGAGCCACAGACCCCAACGCAGCAACCGGTTCAGCCCCAAGTCATTCGCACCCCCGCGGCGCAGGCCGCCCTGCAACCGCAAAGCATGACCACCACCGCCCCGATTGTGACATCCGGTACGGAACATACATGGTCGGTGCAAATTGGTGCGTTCACCAACCGCATGAAAACCGACCAGCTTTTGACCAGTTCACAGGGGAAATTGCCGCCGCAACTGGCCCAGGTGGCCCAGCCGGTGATCGTCCCCTTAAGCGCCGGAAGCAGCACCGTCTTCCGCGCCAGGATCAAGGGATTTTCCCGCAACCAGGCCATTGAAGCCTGTCGGTATTTCACGGATTGCATGACCATTTCCCCCCGCGCGTTCTGAGTAAGGATCTGCCGATGACGTCGCTGAAAGATAAAACGATCGTCATCACCGGGGCCAGCCGCGGGATTGGTGCCGCGATTGCCATACGGGCCGCGCGCGACGGAGCCAATATCGCCATTCTGGCCAAATCCGACACGCCCCACCCGACACTGGAGGGCACCATCCACACCACCGCCGACACCGTGGAAAAGGCAGGAGGGCGCGCTTTGCCGCTGGCCGTGGATATCCGCGATGAAAATGTGGTTGCGGGTGCGATCCAGACCATTGCCGCAACATTCGGCCGGATCGATATGGTGGTGAATAACGCCAGCGCCATCCACGCCGCCCCCACGCCCCACACGCCGATGAGCAAATATGATTTGATGATGGATGTGAATGCGCGCGGCACGTTTGCCGTGGTGCAAGCCGCCCTGCCCCATCTGCAAAAATCCGCCACGGCTGATTCCCCGGCACAAATCTTAACCCTATCCCCCCCGCTCAATCTGGGCAGCCAGTGGATCGGGCGGTGCCCGGCCTATAGCCTGTCCAAATATGGGATGAGCCTGCTGACCATGGGGTTTGCGGCGGAATTTAAGGACTGGCATATCCATGCCAACACGCTGTGGCCGCAAACATTGATCGCCACGGACGCCGTGCGCGTGTTTTTCGCCGATGCCTATAATGCGTCGCGCACACCCGAAATTGTTGCCGAGGCCGCCTATGCCATCCTGACCGGATGCAATGGACGATTTGAAACCGGTCAGCATTACACCGATGAAAGCGCCCTGCGCCTGGTCGGTATCAATGATTTTTCACAGTATAATACGACCCCCGGGCTTGATCCGGTGGATGATATTTTCCTGGATTAAACCATGCAGCAAAAACGCAACCCATCACGCCCCCAATCTGGCGAATCCGGTAACGTGATCTTCTTCATCTTGCTGGGGATTGCGCTGATTGGCCTGGTGACGGCCGCCTTGCGATCCGGCGGATCGGAAAGCGCGAATATCGACGCCGAACAAATGGTCATCAAGGTGTCCCAGGTGCAGCAAAATGCCAGCGCACTGGCGCGCGCCACGGAAGTTGCGTACCAGAATGTGCGCGAAGAATCCGCCATCAGCTTCGCCCATCCGGACGCACACAGCGATTACGGCACATACGGCACCACCCCATCCGCCGAAGTGTTTCATGTCCAGGGCGGTGGCGCGGATTATCTGGACCCGCCCGCCGGAATCAATGACGGTACGGCCTGGCAATTTTACGGCACGACTGCCGCGCCGGATGTGGGAACGGATGCTGCCGATCTGATCGCCGTTTTGCCCAATGTCACGCCAGAATTTTGCACAGCCATGAATAAATCACTGGGGCAATTGGGCGCGACCAATGATACGGGCACCTGTGTCTATGATTCATCGGGTGTGCGTTTCAGCGGCGTGTTTGCCGTCGCGCCGAATACGATGGATGGATCCAGCTTTACCTTTACCCCCGCACCGCAAGCGTGCGTTACCTGCACCGGGGCCGGCAATACGCAGCATTTTTACAAGGTTATTCTGGCGCGTTAAGGCCTGAGCCTTATTTCGTCAGCGGCATCGGCTGTGACGCGGTCACAGGCGCGGGCGGCGCATAAGGCGGTGTGGTCGCGGCCGACGGATAGATTGCTTTGGCCTGATGCGCGGGCTGGTTCGCCGCGGCCGTTTCCGGCGGCGGTGTGGTCGCGCTATAGGCCATGCCGGTCGCCAGTTGATCCCGCTCCGCTTTTGTCATGCTCATTTCACGCAACTGACGCGTTGCCGGAAGCGTGTAATCGTAGGCCGACAGAACGCGTCCGGTTTCCGCCTGAATCATCTGCAGCGAAATCATAATATGGTCTTTGCCGCGGGCATAGGACCCATACATCTGCACCAGACCGGGTTTACCCATGCCACTGGCGGATGATGGCGTGACGCCTTTCATCACAGGTTTCGGCGCACCGACCCCCACTTGCGGGGCCATAATGGTGGACGGCACGCTGTCCGGCGGCAGGCCGCCCGTATAGACGTTATATCCCAACTGCACAAACCGTGCGCCCAACTGGTTCGATACGGTTTGACCGAATGGGGTCAGTTCACCAGGAATGGCCACATCATAAAGCGGGAAGATTTGCACCGGTGTCGCCGCATTGACATGGGCGCGGGATTGTTGCGCCAGCATATCCGCCGCCGCATAAGACGAATCCGTCAGATTGATTTTTGAAGTATCGAACAATTCATCCTGCACCGCCAGCGCGACCGCACACGCGCTGAGCACACCACCAACCCCCAGCAAAGCGCAGGCCCCAAGAACCAGCGATGGCAATCGAACAGACATCATCGCATGCTCCTGCTTTTTAGAATCCGCTATCAACGACAATCGGGTCGCTGGACACATCAGCCACCGGAACCGGCGGATTATTCATGTAAACCGGCGCGGCAGAGACATCCACCGAACCATGCTGACGTGTGATTTCCTGTCCCTGAACGACCAGGAAGATTTCCGCGTCCAGCGTATAAGCCCCGGCGCCGTTCTGGGCCCGGTCAGCAATCGCGTAGCCCAAAGAGAACGGACCTTCGCCCGCCGCGGTCGACACTGGAATACCCTGATTCTGCATCGCGCGGCGCAGGGCCATTTCCATGGCCGCATCGCGGTTGGTGTTGGCCGGATGTGTAATCACCATCGTCGGCTCCATCGGACGGCCGAAATTGCGCACCAGTTTGGCCAGCAGATCTTCGGCCACAACGGTCAGGGAATCGATATCAGCATTGGTAAAGCCGGGCGGCAGACCTTCGGTTTGCTGCACCGGAGCCTGATCAACAACCATGGCCGGAGCCGTGCCCTGACCAACGACGTGGGCCGGTTTGGACGCTTCCTTGCCCGGAGGCGCCTTGTACACTTCATTATGATAGGTGTAACCCGTTGGCATTGTGGTGTCAGCGCAAGCCGCAAGCCCACCCGCCAGAACAACAGCAGAAGCCAATGTGAAAACGCGCACCAGACGGGACAAAGCAGGCATGGTCAAATTCCTATAAAGACAGTGGATGGCCGCAAAAAGAATCAAGAAGGGCGGCATGGCAGAACGCCAGACCGGTTCATGCTATCAGGGATTACCCAGAAAATACAGTCTTTGCAGGCGGAAAATTGCTTAGCCCTGCCCCTGCAACTTCTTCACGATCGCGGTCGTGCTTTTGCCCGCGACCACCGGAGCCAGCCAGACCACGCCGCCGCGGGATTGAACATAATCCGCACCATCGACATGCTTGCCTTCGTAATCCGCGCCTTTCACCAACACATCCGGGTTGATGCGGCGGATGATCAGCTCAGCCTTGTCATCCTCTTCCTTGGTCCGACCAAAGATCACCACCAGATCGACCGCGCCCAGCGCCGCCAGAACCTGCGCGCGGGAGTCTTCGTCATTGATCGGGCGGTCTTTGCTCTTATACCGGCGGATGGATTCGTCGCAGTTCAGGGCGACCACCAGACGGTCACAGCGCGAGCGCGCCTGCGCCAGATAGGTCACGTGACCCGGGTGCAGAATATCAAACGCGCCGTTGGTAAAGCCGATTTTCAGGCCCTGCGCCCGCCAGCCCTCAACGATTTTGGCGGCATCATCCCAATTGTCGCAAACGGGAGCGATCATGTCGCTGATGGCGTCAGCGGATTCCAGTTCATCCATCAAAATCGGCGCGGTGCCAACCTTGCCAACGACAATGCCAGCAGCCCGGTTGGCCAGATGGGCCGCATCATTCAGATCCATGCCCGCAGCCAGACCGGCGGCGATCGTGGCAATGACCGTATCGCCCGCACCGGAGACATCGAACACCGCGCGCGCCTGCGCCCGCAAATGCAAGGGTGTGCCATCGGCGCGCACCACGCTCATCCCGTCTTGGGACCGGGTGGCGATGACGCAATCAATCCCGGATGTTTTGATCAAATGCCGCGCAGCCACAACAATGTCGCTATCCGTGGCATTGGGCAGGTTGTTGGTGGCCTCGGCCAGTTCCTTGCGGTTTGGCGTCACGATGCTGGCCCCGCGATAGATAGAATAATCCCGCCCCTTCGGGTCCACCAGAACCGGAATATTCTTGGCCCGGGCCGCATCAATGACCCCTGCAATGACCGCAGGCGTCAACGTCCCCTTGCCATAGTCGGACAGGATCAGTGCGCGCGCGCCGTCATTCAATGCAGCCTGCACCTGCGCAATCAGGCGCGCAGCGACATCGTCGCCCACAGGCCGCGCATCTTCGGCATCCAGACGCAGGAGATGGTGATTGCCCGCCATAAACCGGGTTTTCACAATGGTCGGGCGATCGGTTGCCACCAGCAATCCGGCGGCATCCGCACCGATATCGGCCATCATGGCGCGCAACGCGCCCGCATCCGCATCATCGCCAACAACACCCGCCAGATGCACACGCACGCCCAGCCCGCGCAGATTGGCCAGAACGTTACCCGCGCCCCCCAAAACGGCCCGCGTTGTCGTGTTGGACAAAACCGGCACCGGGGCCTCGGGCGACAGGCGCACGGCATCGCCGGTCACATACCGGTCGAGCATGATGTCGCCAATCACGGCCACGACTGCATTTTGCAAAGAATTGCGCGTCATAATATCCAAAGGATCTACAGGGCAAGCCCCTGTTTTTTCAGCATTTTATCGGCGATTTGCGCCAGTCCGTCTCTTTTTGCCTGTTTGTTTAGGTTTCGTCAACGTTTCTCCGCTTTAATGGTAGGAACAAAAGACGGGTTTGGGCTTAATTTTCGGGGAATTTTCGGGGTTTTTGAGTGGGTTTTGAATCGGGCGGATCATCTTTAAAATCACACAGCCAGGGTGGCGTGCTGAACCGGATCAAAGTATCCGTTGACCGCAATCGTCTGGGCGAAGTGCTGGTTTACACCGGGGCCTTAACGCCGCAGGAATTGCGCTATGCGTTGGCCCGCCAAAAAACATCTGGCGCGTCTGAGCCTCTGGGTCGCGTTCTGCTCCGCGAACGCATGATCCGTCGTCAGGATTTGTACCGCGCCTTGGCCCAGCAAATGACACTGCGCCTGATGGCTGCATCCCTTGCCATTGCTTTGGGCTTTACGATGTTTGGCATCAAACGCGCCAACGCCGCATCCGTGCGCGACCTGCCGGCGCAGGTCACATTGGCCAGCGTTGCCAACAGCGCATTTACACCGGTGAATTATTATCCGAACCTGTTCGGTGCGAATGAAAAACAATCCACGAATTTGAAACCGTTCACAAAATGGACGGATATGTTCGTGCGGTTTGAAACCGCAATGGGCGGCGCATCCGCAAAATCCAGCATTAAAACCATCAAGGCCGAGATTGAGCCGTTGCAAGGTTTGCCGTTGGACGCCATGGCCGCCCGCGTGAATACCATCGCAAACGCTGTCCGTTATATCGAGGACAAGGATAACTACGGCAACAGCGATTACTGGGCCACCCCGGTTGAATTCTTTGCCCGGGGCGGTGATTGCGAAGATTACGCGATTGCCAAATACACCATGCTGCGCGCGTTGGGCGTACCGGAAAGCCGCCTGCGCATCGCCATTGTGCATGATTTGCAAAAAAACATTCCGCACGCCGTTTTGGTTGTGTATACGGATGACGGCGCATTGATCCTGGACAACCAGAACAAGAGCGTGCGCACCGCGAATTCGCTGACCAGCCGGTATCGCCCGATCTTCTCGATCAACCGCGATGCGTGGTGGCTGCACAACAAACCGCAAGGCACGGTTCTGGCATCAGCCCAGTAAGGTTTAGGTTTTGAAGTTTTAAGTTCCCTGTTCAACTAAAAAAGCCCGGATCAAAAATCCGGGCTTCTTTCTTTTTCGGTTATATCATCATGGGACCACGGCCCAGTTTCTTCCAGTCTGGCTTTTCCTGATCAGACTTACGAACCCACGCCCCCATGGCCGCCGAGCCCGTTTGCCCATCCGCATAGACAATATCCGCGCGCAGGATAACACCGTGATGATCGGGCGTATAACGGCCTACGAAATCCTTGCCGTTGTTAAAAACCTCGACCAATTCAAACCCGACATAACGGGCCGAATTGTGAACGATCACCTCTGGCCCCTGCATCGCCTTCGTGGTCCCACTTCCCGGATTCGCCTTGCGGCGAAAGCTCATCAGCACCGGTTCGTCCTGTGCCGGATATTCCGGATTTTGGCCATTGGCACGAACACGGAAAACGAAGCAGCCGTTATCCTGACGATCCAGCGATACATCAAAACCATCAATTTTCTTCTGAACAACCGGGCCACCAAACAGGTGCGCGGCAGCAGCGCGAAACAATGTTTTGAGGGTCATCACCAAATCCTTTTATTTAAATCGTCGCATCAGGCTGGTTTTGTTTTTGCGCCAGCGTGATCGTGGATTGGCGCGGAACCAGACGCAGATTTTCCTCGGTTCTGTATCCGCCAAAGGTCGTGATTTTCGCCCGCAGGACAACACCATGATAAGCGCGCGCGGAATCAAAAGCTTCGACCAGCTCAAAGGCTTTGTATTTCGTATTTTTCTCAACGAACAAACGGCCATCAACGAAGCCGCGGCGGCATTCCGTGTCATGGTCAACGCGCAGAACCATGTCCTGATCCGCGGCCGGATAGTCGGGATTTTTTCCATTCGCCCGGATACGCAATTCGAACGTATTGTTGTCCATCCGCTTGATCGCTGTGTCGAATCCATCAACATAACAATAGAATTGCGGCCCAGCCCCAAACAAGCGCGCGGCCTTTTTAAAGATATTCTGCAGTTTCATAACACACTCTCTCTTTTTTCGGTTTTTAAAATCTTCCGGGCCACGCGATCATCACCGCGGCCACGGGCCTTTTTTACTTTCGCACGAGCAGGAATCGCACGGGTTTATTGTACCGCCGAACGAAACATGGTCGAGACGCAACATGGTTTCTTTCGTCCCCTCAATCGCACAGGCCGGGCGCAAATCCTGCGCCAGCGGACGACCGACGGAATCAAACGTCAGTTGCAGGGATTCAAACACCATGGCCACGTTCGCGTGGATGGATTGATCAATCGCCTTTGGCAAGACAAACGTCGACACATGAATGGAACGGCCTTCCTTCATCCGGTCGCGCACTTTGTCATTATCGCCCACGCCCGACGGGAGATAAAAAACCGTGCCTTCGTAATCCTGACGCATGGCATATTCACCGCTGGCGGCCACATGGCTGGTCGCATAACGATTAAACGCCACAACCAGATTGCGCACAGTAAAGGGCCCCGGATTGGCAAAGCGATTCATGCCATCATCACAGGCACCCATGGCCATCACCGTCGTCTGGATCACTTGATACACGTCATTAGAAAACGCGCGATAATGTTCAAGAACCCCACCCGGAACAAACATCTTTTGCGCGCGGGCCATCAATTCATCGCGGCCAACCGCAACGCTCAGCCCATGGGACCAGATCGCCTCGCGCATCTTTTGTTCCAACGCCACATCACCGCGCGCAACACTGACGATATTTTGCGTCAGGGCGTCCTGCAGCAATTCCGTCGCATCCTGCGGAAGGATTGCATCCATCATGGTATCTGGCTCCTTGGGGTTTGCCTGATAAGGGTGAAGGCTATCTTTATCGAAAGCAATTTTTTATGTCAATAAATGTCTTTTCCGTTTTGTTTTCAAAAGGATACGAAAAAACAAAGGGCCCCTATTGGGGCCCTTTTATTCCGTCACGTCCGACTGTCAGGGTTTGTCCTGAAGCCTGATTACTGGCAAGCCAGGCATTCGTCGTATTTTTTGTCCTCACCCAGTTCCGGCTCGCGGTTGCCTTCGGGCTTTTGGCCTTCTTTGCTGCGCGCCCAGCTGGCGGCGGATTCGGCGCGCTGGATGGATTTGGAACGGCAGTAATACAGCGACTTCACGCCCTTCGCCCATGCTTCCCAGTGAATGCGGTGCAGGTCGCGTTTGTGCACGTTGCCCGGCAGGAAGACGTTCAGGCTTTGTGCCTGGCAGATGAACGGCGTGCGGTCCGCCGCATGTTCGATCAACCAACGCTGGTCGATTTCGAAGGCGGTTTTGAACACGTCTTTTTCATCCTGGCTCAGGAAGTCCAGATGCTGGACCGATCCTTCGTTGGTGAAGATGGACGACCAGATGTCATCATTGTTCAGGCCTTTTTCTTCCAGCAGTTTTTCCAGATACGGGTTTTTCACCGCAAAGGAGCCGGACAGAGTTTTGTGGTTGTACGCGTTGGCGGCGTTCGGCTCGATCCCCGGGGATGCACCGCCGGTGATGATGGAAATCGACGCCGTCGGCGCAATCGCCATCTTGTTCGAGAAACGCTCCATAATGCCGTAATCAGCGGCATCCGGGCAGGCCCCACGTTCGTGAGCCAGTTTCACGGAGGCGTCATCAGCCTGACGCTTAATGTGCTGGAACATGCGGCGGTTCCAGACCTTGGCCATCACCGATTCCATCGGGATCATTTTCGATTGCAGGAAGGAGTGATACCCCATCACGCCCAGACCAACAGAGCGTTCACGCATCGCGGAATATTTCGCCCGTTTCATCGAATCCGGCGCCTTGTTGATGAAGTCGGTCAGCACGTTGTCGAGGAAACGCATGACGTCTTCGATAAAGGTCGGGTGATCCTGCCACTGTTCGAACAGTTCGAGGTTCAGCGACGACAGGCAGCACACAGCCGAGCGTTCATTGCCCAGCGGGTCTTTCCCGGTCGGCAAAGTGATTTCCGAGCAGAGGTTGGACATCTTTACGCTGAGGCCCGCCATTTTATGGTGCGCCGGAATGCGGTCGTTCACATGATCGATAAACAGAATATAGGGCTCACCCGTTTCAATCCGGGCGGTCAGGATACGGATCCACAGATCGCGCGCCTTCACCCGGCCAACAACGTGGCCATCTTTCGGGGATTTCAGGGCCCATTCCTCGTCCTTCTCAACAGCGTGCATAAAGCTGTTCGGGATGACGACAGCGTTGTGCATGTTCAACGCTTTGCGGTTCGGATCACCCCCGGTCGGACGGCGGATTTCGATGAACTCCTCGATCTCCGGGTGCCAGACGGGCAGATAGATCGCGGCGGAGCCACGACGCAAGGACCCTTGGGAAATGGCCTGAGTCAGAGAATCCTGCACCTTAATGAACGGAACGATCCCGGACGTTTTGCCATTGCGGCCCACTTTTTCACCAATGGAACGCACATTGCCCCAGTAGGAGCCAATACCGCCGCCACGGGCCGCCAGCCAGACGTTTTCATTCCACAACGCCACGATGTCTTCGAGGCTGTCGCCGGATTCGTTGAGGAAGCAGGAAATCGGCAAGCCGCGGCTGGTCCCGCCATTGGACAGGATCGGCGTCGCCGGCATAAACCACAGCTTTGAAATATAATCATAAATTCGCTGGGCGTGGGCGGAATCGTCACCGTAATACATGGCGACGCGCGCAAACAGGTCCTGGAAAGATTCTTCGGGCAGAAGGTAGCGGTCAACCAGCGTGGCCTTGCCGAAATCGGTCAGGTTGGCGTCCCGGGAACGGTCAATCTGGATACGGTTGCCTTGGGGCACTTGGGCGACGTCAAACATGGCATGCTACTCCTGTAAAATTTCAAAAACTCTCCCTCGGCCACCTTGGGGGTGGACACGGCGGGGCGGGCTGTCGGGGCTTGTTAGATAGTGTGGATAAAACCACTATCCATTGCGGTCATCACAAATACAGCCTACTACATGTCGTGGTTGCCACCAAGATGACAAAATTCCTAGTGTCCGCGACTGACCCCCGTAAACACACGCTAAAAACGCCCGTGGCCGTGTGAGTTCAAGTCCTAAAATGAGTCTTATGCAGAGTTTTTTTATCCCCAAAAACGAAATCATCTTTCACGAATATCGAAAGGCTGCATTCCATATCGGCACCACATCTTGTGTCCATCTGTGCCGCCAGCCACCTTCCGGAAAAAGACCCTGCCCGCGACAGCGCGGGCCGCATACCCCCGCAAATGACCGCTTAAGTCCCTGATCCACCACAGGGAAATCATGATTGATATGGGGTCAGACCGCCCCCCAATCCACCCCTGATTCGCCCCCGATTCGCGGGCAAAAACATCAATCTTTTCAATGCGATACAAAAGGCACGATTCGTATTTCTTGCCGGGGTCCGCGCCATGGGGGTAAGGTGCGGCCAGCATAAAACACTGAACGGAACGTGACTCGGCTATGGAAAAACACGGCTTTATGATTGCGATTGGCGGGCTGTCAGGATCAGGCAAATCGACGCTGGCCGGACGATTGGCGGCGGAAACCGGCGCGGTCTGGCTGCGGTCCGATTCCATTCGCAAGGAATTGTGGGGCGTGGACCCGCTGACCAAATTGCCGCCCGAGGCCTATAGCCGCGATTTCAGCACCAAGACCTACGAAACGCTGGCCAGCCGGATGGAAGAGAATTTGCGCGCCGGACGGATCGTGATTGTCGATATGAGCTTTGCCAAGCCCGTCGAGCGCCGGAGCTTCGCCAACCGGGCCACCGCCTGTGGCGCGGGATTCCACGGGATCTGGCTGGATGCCACACCCGATACGCTGAAAACCCGCGTCGATGCCCGGGTGGGGGATGTATCGGATGCCGATTCCACAATCGTGACCATGCAATTGGGCTTTGATCTGGGCACCATTGAATGGAGCCGGATCAACACCGACCGCCCGGCGGACAGCGTTTACCGTCTGGCTTGTGCCCTGCTGGGCGTGGACGGTCCCACCCCCATCCGCGCGATCAAGCTGGAACCATAAGATTTTTACGCCAATCGGGCCAAAACTTGATATAAAGCGGGCGTGATCTATCCCTGCACGAACCCCGTAAGGCTTTCGCTATGACCAGCACTGAAATCTGCCCCCTGATCCGTAAACGCCAGACCAAGATCGTCGCCACACTCGGCCCCGCATCGGCGAATATTGAGATGATCGAGAAACTGGTTCTGGCCGGGGTTGATGTGGTGCGGTTGAACTTCAGCCACGGCGAACACGCCGACCATGCCGCACGCGTGAAAATTATTCGCGGACTGGAAACCAAACTGGCCCGCCCCATCGCCATCATCGCCGATTTGCAGGGTCCGAAATTACGCGTTGGCCGATTTAAAGACGGGTCCATCACGCTGACCGCCGGGCAAAAACTGCGCCTCGATCTGGATAAAACCGAGGGCGATGACACCCGCGTCAATTTGCCGCACCCGGAAATCATCAACACGCTGAGCCCCGGCGCGTTCATCCTGTGCGATGACGGCAAGGTGCGGATGAAGATTATTGATAAGGGCGCGGATTTCCTGATCGCGGAAGTGGTCAGCGGCACGAAATTGTCGAACAACAAGGGCGTGAACGTACCCGGTGTGATTTTGCCCATTCCGGCCCTGACCGAAAAGGATCGCAAGGATCTGGTCGCAGCCCTTGATATGGGTGTGGATTGGGTCGCACAAAGCTTCGTGCAGCGCCCGGAAGATGTCGCCGAAGCCAAAAAACTGATCGGTGGTCGCGCCGCGTTGATGGCAAAGATTGAAAAGCCCTCCGCGATTGAATTGTTCGCCGGCATCCTCGATCTGGTCGATGGCATTATGCTGGCGCGTGGTGATCTGGGCGTTGAAATTCCGCCGGAAGAAGTGCCCGCCCTGCAGAAAAAAATCGTGCGGCAGGTACGCCAGTCCGGCAAGCCGATTATCGTCGCGACCCAGATGCTGGAATCCATGATTGAAAGCCCAGCCCCGACCCGGGCGGAGGCCAGTGACGTGGCCACCGCCGTGTATGACGGCACCGACGCCGTGATGCTGTCCGCCGAAACCGCCGCCGGGAAATATCCGATTGAATCCGTATCCATCATGGACCGTATTTGCCAGCACGTCGAAGCCGACGATCTGTACCGCCGCATTATGGATGCCGACCATCCGGATGCGGATACCGATGCGTCGGACGCGATCACCATTGCGGCCTGTCAGGTGGCACAGACAATCAACGCCGCCTGCATCACCAATTACACATCATCGGGATCGACCACGCTGCGTACCGCGCGCCAGCGCCCGGCGATGCCGATTTTGTGCTTGTCGCATTCGGCACAGACCACACGCCGTTTGATGCTGTCCTATGGCGTGCATGCCGTGTACACGCCGGACGTCACATCCTTCGCCGACGCGGTCAAGATGGCCACGGAACAGGCCGCGATTCAGGGGTTGGCAAAGAAAGGCCAACGCCTGGTCCTGACCGCCGGGGTGCCGTTCGGCACGCCGGGATCGACCAACGCGTTGCGTGTGGCCTGGGTTGAATAAGGGGGGTTGAATAAGGGCATTGAATGAGGACATTAACGCCTCATTAACCAAACCCGGAAAAATCCGCAGAATAAGACGGGTTTTTCCAAAGCCCCAATCCACCATTTCCGCACCGCGCATAAAAAAGCGCGGGATTCCCTGTAAATTTTGACAAAAACCAGACGGACCCGTACCATAGTATATGGTTTGCATGATGAGAATCATGTGATCCGGGGGGCACGGAAATCCCCGTTTGAATCAGCGGCTGGCATCGGTCGTGACTGGTTCCGCTTCCCGCGAAGCGTATACCAGAGATGCCCTCTCTTTTTTCTCGGGAGTGTTCTCGCGTATTTTTAAAAGGCCGGCCACTTTTTCTGTTCTTAACGTCTTCGGCCCATTTAAATCTGATTTCAAACGGATCATCAAAAACGACCTTTTCGCCCCCGCACGCGCTATGGCTTTTTGCCATGACGCATCACACCGGGGGGATATTTTTTGATGCAACGTCGTTTTGTGCTGCTTGCTTTGGCCAGTGCCGCGCTTTTCGTTCTGCCGCAATCCGCATACGCACAGGAAGCCCAGCCGGGCGATGCGTGTACCACCAACGGTGCGGTGCGATCGACCGGTGGCCCCGAACAAATGCCGCGCCGCATGTTGATTTGTAACGGCACAACGTGGCAAAGCGCACTGGAACAAACCACGGCCGGTGCCTCTCTCCTGCAAATTGGCAACGATACCGGATCGTGCACCACGGCCAAGCTGGGCCGGATGCGCTATAACGGCACATCGACATGGGAATATTGCAACGGATCGACATGGGCGGCATTGGGGAGCGGCGGGTCAGCCGCAGGCGCGGACCGGGAAATTCAATTTAATTCAGGGGGCGCGTTTGGTACATCCAGCACATTCAAATTGATGGCGGATGGTGATTTATTGTTGGCCGGACCATCCACCACGGGAATCGCAAGCGTTCCCGTATCCGGAAACGGAACGCGGATGTTCTTTGACTTCCAAACATCGGCATTCCGCGTGGGATCTGTAAGCGGCACACAATGGGATAATGCCAATATCGGAATCTTTAGTGTCGCGATGGGATTCGACGCAACAGCCAGCAACACTTTCGGTGTCGCAATGGGGGCTCAAACAACAGCCAGCGGCGTTTCCAGTACCGCCATGGGGTATAACACAATCGCCGCAGGCTCTCATAGCTTCGCCGCCGGCAGAGACGTCAACATAACCACGACAGGCAGTGGATCGTTTGGCTTTGGGCTAACAAACACAGCCCCAGCCATAAAACCGCAAGTATCCGGCGCGCAATCCTTTGGCATTTTCATGGGCAACCAGAACGCCGTGAATTTTTCCGCCGCAAACACGATGGGATTATTCGGCGGCAAGATGGTGATTGATTCTACCATTCCCGCCACCAATCTGGTCGCGGACACAGAACTGGAAATCGATGGCACATTGAAAATCGGCAGTGGCGGCGAAGCATGCGATGCCAGCCGCGAGGGATCGATTCAATATCTGGCCGCATCCGATACATTCCAGGTTTGCGCCACAGCCGGAAGCTGGACCGCGTTGGGCGGCGGCGGGTCAGCCGCAGGATCCGACCGAGAAATTCAGTTTAATAGCGGTGGACTGTTCGGAGCCAGCGCATCGTTAACATTCGATGGCACAGGTTTGTATAGCAAGGCTTTTTACGCAGACGCGACAGGAACAACGTACGATACTGCCGTCTCGGGGATAAGTGATGCTGACTATGGTGTAGGTATTTATGGCGAATTCGCGCACCCAACAAACTTAGGAACTGGCGTTCGCGGGAACGCCAGCAGCACCAATGGCAGAGGCGTCCAGGGCACAGCCTCAGCATCGACCGGACTGACCTACGGCGGCTATTTTGAAAGTGACAGTTCCACGGGCGTGGGCGTTTTCGGGATAGGGGGCGGGTATGGCATTAGAGGGGAGGCCACGAGCGCCACTGGATACGCAGGATATTTCTTCAACTCCGCCGATGGCTGGGGCGTCTATTCTGAGAATGATATGGGTCTGGCCTCCGGAAAATATCTCAACTGGGGATCGACGCGCGGTAGTACCGGTTACGGTATTCGCGACAACGCGGGAACGATCGAATGTAAAAATTCTGGTGGAGCATGGGCCGCGTGCGCAGGAAGTGGCGGTGCATCGCTGTCCGGTCTGACCGCGGCAACGGCGACCAACACAATCGCCAATGCCAACTTCACGCAAACATGGAACTGGGACACGCTGACAACGGGCAATGGTTTGGTGTTGAACTCAACATCATTGACGGATGGCAATATCCTGCGCGTTATCAACAGCAACACCACAGGCACCGGCGGTCCGATTTATGCCCAAACGAGCAGCACTGGCGCATTAAGCTACGGTGTAGGCGGCTTTGCGACCTCCACCACCGGCTTGAGCATAGGCCTTTACGGTTCCAGCACCAGTTCAAGCGGTAACGGTGTTCGTGGCAGTGCGGGATCAGGCACCGGCACAACATCTGGCGTTTACGGTAGCGCCTCCAGCACCAGCGGGCGCGGCGTATATGGCTGGGCCGCCGCTGCAACCGGCACAACATACGGCGTCTATGGCACATCCGCCAGCTCCGCCGGATATGGTGGATATTTCACCAATACGTCCACGGGCGTGGCCCTGCGGGCACAGGGTGATTTGGAATATACCGGGTCATTGCGTGACATGTCGGATATTCGGTTGAAGGACAATGTGAAGCCGTTGGATTCATCACTGGAAAAAATCACGCAACTGCAGGGTATTTCATTCACCATGAAGGACAGCAATACGCACGACACCGAATATGGTTTTTCGGCCCAGGATGTTCAAAAAATCTATCCCAACCTGGTCCACAAAGCCAACGATGAAGATGGCACATTGTCGATGAATTACACCGGATTGATCGCCCCGCTGGTCGAAGCCATCAAGGAACAGCAAAAAGAAATTGAAGTGCTGAAGGCCGAAATCGAAGCGTTGAAAGCCCGGGAATAAGGCGCGATGAAAAAAACGATCCTGTCCGTTGCTATCATGCTGTGCGTGCTGTCTTCCGCCAAGCCATCATGGGCCGCTGGGGAACAACCGGGTGATGCCTGCACCGTGGCCGGGGCGGTTGTGCGTGCCGCCGGACCCGATCAGGTGCCATATCTCACCTTGGTGTGCAACGGATCGACATGGGTATTGGCGGATGAACGCACAACAGCCGGGCGTAGCTTGTTCAGAGTTGGCAGCGACGCCACCGCCTGTGACGCCACAAAGGTGGGGCGAATAAGTTTTGTTGGGGGGGCGTGGACCTATTGCCACGATTCCAGCTGGAAAGCGATTGGGCTTCCGAATTGTGCAGCAGGGCAGGGTTTAACGCAGGGTGTGGCTGGTCCGGAATGTTGCCCGACAATGGGCTGGGCTGTGACCGCTCCGCCAGAGGCCAACAGCTGGTCCAATGTAACGTATGGAAACGGTTTGTTTGTTGCGGTTTCGACAAACGGTACAAACCGGATCATGACATCCCCCGATGGCGTTAATTGGACTGCGCGTGCAGCCCCAGAAAATTTGGCGTGGAATAGGATTTTTTTTGGTAACAATCTGTTCATTGTTACGACATCGTCCACCAGTAACCGGATCATGACGTCCCCTGATGGGGTGACGTGGACAGCGCGGACCGTTCCGCAAAGCAATAGTTGGCAAGGTGTAACGTATGGCAATGGTTTGTATGTTGCGGTTTCGTCAAATGGTACGAACCGTGTCATGACATCCCCCGATGGGGTGACATGGACGTTGCGGACGGCAGCCGCAGATAATGCGTGGATGGCGATTACCTATGGTGATGGATTGTTTGTTGCGACTTCGTCGAATGGGACAAACCGTGTCATGACATCCCCCAATGGGATTACATGGACATCCCGGTTTTTGCCGGGAACGGACCCGGCTGTATATTCTATAGCGTATGGCAATGGGCGTTTTGCGGGCGTCAGTTCTGGTGGTCGTGTTTTTACATCGACAGACGGGATAAACTGGTCACAGGCCACATTATCCGAAACGAATTTCCTTCGCTCGATTACCTTTAATGGGACTAAGTTTATTACCGTATCGAATAACGGAACCAGCAGGATTGCAACATCTGTGGATGGGGTAACGTGGCAATTGTATCAGGCCCCCGAAGCAAGCAATTGGATGTCTGTGACCAATGGTGGCGGAAAGGTTGTGGCCACGGCCACGTCCGGAACAAATACGATAATGTACTCCATTGATGTACCGTGTCCGTAATGGGAAAAGCGATGAAAATATTATTGCATGCTACGATGATAATAAGCCTTGCAGGATTTTCTTTGCACGCATACGCGCAGGAAGAACAGCCAGGATCCGCCTGCACACCATCAGGAAAAATTGTAGTAACCGGCGGCCCCGAACAAATGCCGCGCCGGGTGATGATTTGTAATGGATCAACATGGCGCACGTTTATGGAACAGAACACGGATGGAAAAAGCCTGTTCCAGGTGGACAATGACACCGGCAGCTGCACAGCGGCGAAGGAAGGCCGCCTGCGCTTTAACGACAGCACCAACCTCTGGTCCTATTGCCGCAGCGGATCATGGACCAACTTCACCACCCTGCCCACATGTGCGGTTGGGCAAGGATATGTCATGACCGGATCGGGTTGGGGATGTTGCCCAGAGGGCGGAACCACATGGACCAGCGGCAGCAATATTCCGGGCAGCAACGATCTTTTCCCGGGCGCGTTGACATACGGCAATGGATTATTTGTTTTGGTAGGGGAATCCTTTGTTGATTCCAATCATTTCAGAACCTCCCCTGATGGCATCACATGGACACCGCGCACAGCCCCGGATCCCGGCTGGGGCTGGCGCACCGTGACATACGGGAACGGGACATTCGTTGCCATTGGCGGATCATGGTCCGTGACCGACCTGACCAGTTCAACCGATGGCATTACATGGACGCCCCGAACATCACCCGCCAACAATTACTGGGCCGGCATCGCCCATGGCAACGGTTTGTTTGTTGCGGTTGGCTATAGCGGAGGAACCGAAGTGATCACATCCCCCGATGGCATCACATGGACAGAACGCACCGGAATCGCAGAGGAATGGATCGACGTGACCTATGGCAACGGCCTGTTCGTCGCCGTCGGGCGGACGGGAACCAACCGCGTCATGACATCGCCCGATGGGATTACATGGACCGCGCGCACAGCCGCAGAAAACAATGAATGGACATCGGTCACATATGGCGGCGGACAATTTGTGGCCGTCGCACAAACGGGAACCAACCGCGTCATGACATCCCCCGACGGCATCACATGGACCGCACGCACAATCGCCGCAGAGCAATGGCAAGACGTGACGTATGCCGCAGGGAAATTTATCACTGTGGCTTGGGCCGCCGGGCAAGAGATTGCGACATCAGAAGACGGTATTACATGGACCCTGCGCAACACACCAGGATACGAAGACTTTTTCCAAGTGGCCAGCAACAGTAGCACGCTTGTTTCAATTGGCCTGTATGGGGCCACAATGCGGTCCCCCATAGCGATTTGCCCATAAGGACAAGTTCATGAAATTAAAATTTTTTCACATCGCCCTGTTTTTGTCCGTGATCGGCGCCAGTACGTTCACTCAAGCCCAAGAAGCCCAACCCGGTGCCGCATGTTCCGGGGCGGGAACATCCACATGGACCGGCGGGCCGGAGCAAATTCCCGGACGATTGTTGATTTGCAATGGATCAACATGGCAAGCGGTGCAGGAAACGGCCAGCACAGGCCGCACATTGTTCCAGACCGATTACGATTCCGGCGCATGCGACAGCGCAAAAGAAGGCCGCCTGCGTTACGATAGCGCAAGCAATGCATGGAGCTATTGCTACAACAACGCGTGGGCGAACTTCGCCTCGGTGCCCACTTGCGCGATCGGTCAGGGATTAACCATGGCCGCCACAGGATGGAGTTGCTGTCCAAAAGTTGAATCAACATGGACCGGGCATTTATCAGCGGAAAACAACGCCTGGCGATCGGCCGTCTATGGTAACGGCATATTTGTGGCCGTAGCCAGTAGCGGCACAAACCGCGTCATGACATCGCCCGACGGCATAACATGGACCGCACGCGCAGCCGCAGAAGCCAACGTTTGGCGTAGCGTAACATACGGAAACGGATTGTTTGTCGCCGTAGCCAGCAGCGGTACAAACCGTGTCATGACATCACCCGACGGCATCACGTGGACCGCGCGCACGGCAGCGCAAGCAAATCAATGGTATAGCGTCACGTACGGAAATGGATTGTTTGTCGCCGTTTCCATCGACGGCACAAACCGCGTCATGACATCGCCCGACGGCATAACATGGACCGCGCGCAGCGCCGCGCAGGCGAACGTCTGGCTGTCTGTCACATATGGCGGGGGCACATTTGTCGCCGTTTCAAACGGCGGCACAAATCGTATCATGACGTCCACAAATGGAACAACGTGGACGTCCGGAAACGCACCTGGAAGCAGTTTATGGACATCCATCACCTATGGTAACGGAAGATTTGTAGCCGTTGCTCAAGCAAGCACGGCCGCTATGTACTCAACAAACGCCATCAGCTGGGCCGCTGGTACTTTGCCAGAATCCAACAACTGGCAAGGGTTGGCGTATGGGAATGGTATATTTGTAGCGGTTGCAGCAAACGGCACAAATCGCATCGCATCATCCATGGACGGCATTACATGGACATCGCATGCCGCCCCGGAAGCAACGTTCTGGGAGGATGTCAGTTTTGGCAACGGCGTGTTTGTTGGATTGTCATACGGCGGTACAAATCAAGTTATGAGATCGCCAGTTACATCCTGTAACTAATAATCAGCGCGAGCCGCCGTTCAGTCGATCCGGAAATTTTGGCAAACGCCCATCCGCAAATAACTTCACCTGTGACAACGGGTTGTTATTCGCGGCCATGCGGGATTCGCCGTCATAAATGCGGGTGATGGTCCAGTTGGGTTTTGTATTGGCGCCATTGCATTGGCAGGGGTCGGGTTTTTCGATCACGAACACATCGCAATTCCCCGGCGTTTTCAGATCATCATACGCCGATAACGGCAAATGGAACCACCGCATCATAAAGGCCTTGATCACGCCGCCATGCGCCACGATCATAATATCGTCGGTATCGTTTTTATCGTGGGCGCGATGCAAACTACCGATAAAGTCGGATGTGCGCATTTGCACCGCCATCGGTGATTCCCCAAAGGGGGGCGCGGACAGATAGGCGCTGTGCTTATGCACCTGGGTCGATAAGTGGGCCAGGGCCTGGGCAAATTTACGGCGCAAGAACCCCTTTTGCGCATCGATATAGGCCAGCGCGCCAAAACTGTGTTCCACCAAACGGGCATCTTCGCGAATTGTGTAATCCCCCGCCAGCGCATCATCCCCCATGCCGGTCAGAATGCCGGACAAGGTTTGCCGCGTGCGCAGGAAGGAACTGACATATATATGGGGCCATGACGGCGCATCGGGGGATTCGGACCCAACCCCGGAATGCCAAACATGCGGCCAGCGCCCCGGGCGGTTGCCGCGCGCCGGGTCATTGAACCAATCGCGCAGGAACACGCCCGCATCGCGGGCCTGGGTCCAGCCATTATCGGTCAATGACACTTGTGGGTCACCGATGCGGGAATAGGTGTTCCAATCCACGTTGCCTTCGGATTCGCCGTGTCTGATCAAAATGATACGCATGGCGGGCAGGATAATGGGTCAAAGACACTATGAAAAGCGCTATCATCTTATCCCCCGTTCTATCCCCGTCTTTTCCCCAGAATGTCTATATATAGTGTTTATCAATCAAAAAATTCACTATATATTGATTTTCGTGATGTTGTGGACGAGCGGGTGAAAAGACTCGGATTCCACGGCCAATCACGTCACAACTGTTTCTACGTCTTCTATCCAAACCGCCAGACCCAAGGAGTACCCTCATGGCCCAGAATGTTGCCACTCGCGCGCCCATTGACGATGTGACCAATGCCGCATCCCCGTTGTTGCAGGAACGCCATGTGTACAAACCGTTTTTGTACCCCTGGTGCTATGAGGCGTGGTTGACGCAGCAGCGCATCCACTGGATCCCGGAAGAGGTGCCGCTGGCCGAAGACGTGCGCGATTGGAAGAACAAACTGACCGCAGCGGAAAAAACCCTGCTGACCCAGATTTTCCGTTTCTTCACCCAATCCGACGTGGAAGTGAATAACTGCTATATGAAGCATTACAGCCAGGTGTTCGGCCCGGTTGAGGTGCAGATGATGCTGTCCGCCTTCTCCAACATTGAAACGGTGCACATTGCCGCATACTCCCACTTGCTGGACACCATCGGCATGCCGGAGATCGAATATTCCGCCTTCATGAAATACAAGGAAATGAAGGACAAATTCGATTACATGCAGGGCGCATCCATGGCATCCCGCCGCGACATCGCCAAAACCATGGCCATGTTCGGTGCCTTTACCGAAGGGCTGCAATTGTTTGCGTCTTTCGCGATCCTGATGAACTTCCCGCGCTTCAACAAAATGAAGGGCATGGGCCAGATCGTCACATGGTCGGTGCGTGATGAAACCCTGCACTGCCTGTCCATGATCAAATTGTTCAACGCGTTCATCGCGGAAAACAAAGATATCTGGGACGATGAATTGAAACGCGAAATCACCGAGTCCTGCAAAACCATTGTCGGGTTCGAAGATGCGTTCATCGACCTGGCGTTCGAGGGCGGCGAGATCGAAGGCCTGACCCCGCAGGAAGTGAAAAACTACATTCGCTATATCGGCGACCGCCGTTTGCAGCAGCTGAATCTGGACCCGGTCTTCGGGATTGAGAAAAACCCGCTGCCGTGGATGGACATTATGCTGAACGGGGCGGAGCACGCCAACTTCTTCGAAAACCGTGCGACGGAATATTCCCGCGCATCGACCGAAGGCGCATGGGAAGACGTGTTTGACGATTCCGTCTTCTCCGGCAATTACGGCAAGAAAATTGGCGGCAGCGACGGCGAAAGCTCCGCCGCGTAAGCCACACACAACACCATCGAACCCCCGCCCCACCCGGTGCGGGGGTTCTTTTTTGGCCCAAAATATCCTTAGGAACAGGCCGTAGGCGCTTGAAAAACACCCCTCTACCCTCCATTTCACTGAAACTGTCTAAAGTATAGTTCCCCTACGACCCGCCTCGAAAATGGGCCGCATGGAGTTTTATGAGTGTGTTTCGTTTATTTGTTCTGAGTACTCTTTCTGTTCTGGCGTTCTCGGTGCACGCCTCCCCCGCACACGCAGAGGCCCAACCGGGCGATGCCTGTACCATCAATGGCGCGGTTCAGGAAACCGGCGGCCCGGAACAGATGCCCCGCCGTACCCTGATCTGTAATGGCACCACATGGCAAAACGCGCTGGAGCAGACCACAGCAGGCGCGTCGCTCTTTCAAGTTGGCAATGATACCGGATCCTGCACAGCGGCCAAATTGGGCCGCATTCGCTATAACGGGTCGGCCACATGGGAATTTTGTAACGGGTCCAGCTGGATCAATATGGTTGGGGGAACGGCCCTGTCCGGCATAACATCCGCAACGGCGACCAACACCATTAACAACACCACCTACACCCAAAGCTGGGGTTGGAATGGGATTACCACACAAAACGGCCTGGAAATGGGATCCAACAGCCTGACCAGCGGTAGCCTGCTGGGCGTAGCCGTCACCAACAGCGCATCCACAGGCAATGCGATTGGTGTAGCCACAAACGCTACAGGTAATGGTGCCACAGCCATTCGCGCGATCAGCACGGGCACCAGCGGCGTCACCGCCGCCATATACGGCGAAAACGCCAGCACTGGTGGGGCTGGCATTTCCGGCCAAGCCACAGCCACATCAGGGACCAATTACGGCGGATATTTCCTGAACACCAGCAGCGGTGGATACGGGGTCTATGCCGCAAACACGGCAGCATCGGGGACCGCCTTCGGCATTCGCGGGGCCACCAGCAGCACAACGGGAATCGGTGTATCCGGCGCCGCCGTGGCCACCACGGGATTGAACTACGGCGTCCATGGAACCACCGCCAGCACAGGCGGACGCGGCGTGTTCGGCTCCGCAACGGCTACGACTGGGGCCACCTATGCCGTGTATGGCGTGAATCTCAGCACGGAGGGTCACGCCGTTGTGGGTCATTCAACAGCCACAACTGGCACGACATTTGGCGTCTATGGTGTGAATGAGAGCACCGACGGACGCGCCATTAACGGCACAGCCACCGCGACATCCGGTGTCAATTATGGCGTCCATGGGCGCAGTGACAGCGCCAGTGGCTTTGGCGGATATTTCGCCAACACGGCAAACGGCTGGGGACTATATTCGACCAACAATGTCGGATTGGGGGCCGGGATGTACCTCAACTGGGGCACGACACAAGGCACCGGTGGATATGGGTTACGTGACAATGCGGGCACCCTGCAATACAAAAACAGCGGCGGGGCGTGGACCAATATTGCCAGCGGCGGAGGGGGTGCGGCCCTGTCTGGATTGACGGCTGCCACAGCCGCAAATTCGATCAACAATGCCGCCTATGCACAGGCATGGGCGTGGAACAGTTTGGGAAATAACAACGGCCTGACCCTATCAACAACGGCAACCACAGGAACAGGGCGTTTATTATACATCAACGCATCGGGTTCCACCGGCCAAAACACTGCCTTGTACGCCCGAAATGCGGGCACGAATGGAGTCGCAATATTTTGTGATTCATATGATGATTATGGATGCCGCGCGACCGTCCAGTGGACATCCACATCTGACGCTCGCCTGAAAAAGAATATCGAACCCCTTAAAAACGAATATGGGCTGGATGCCATTATGCAGCTCAAGCCCGTCACCTATAACTGGAAAGAGAAACCGGACGATAGTAGAAAATCTCTCGGATTTATCGCGCAGGATGTAGAAAAAATCATCCCTGAAATCGTCGGTGAAGATTCCGCACCAAAAGAAATCACACTACCGGATGGATCGACAGAAACAATCGAAAACCCCAAAGCTGTTCAATACTCCGCCATAGTCGTCCCCTTGGTCAAAGCGGTACAGGAATTAAAGGCCGAAAACGACGCCCTGCACGCTCTCAATGCGGATTTGTTACGCCGGGTAGAGGCGTTGGAGGCCGCCGCGGAAAAATAGACAAACCAGACCTTTGGCCCACCAAACCTAACCCAAAGATCAGGCCCTAACACCTTGATTTCGACGAAAAACATATCATTTCAGTAAAGCATCTTTTTTGTGAATCCCTACAACCCGCCTCGAAAATGGGCCGCATGGAGTTGAGTGTGTTTCGTTTATTTGTTCTGAGTGCTGTTTCTGTTCTGGCGTTCTCGGTGCACGCCACCCCCGCACACGCCGAAGCCCAGCCGGGCGATGCCTGTACCATCAATGGCGCGGTTCAGGAAACCGGCGGGCCGGAACAGATGCCCCGCCGTACCCTGATCTGTAATGGCACCACATGGCAAAACGCGCTGGAACAAACCAGCGCGGGCGCGTCCTTGTTTCAGGTCGGGAACGACACAGGATCATGCACAGCGGCCAAGTTAGGCCGCATTCGCTATAACGGGTCCACCACGTGGGAATTTTGCAATGGGTCCAGCTGGATCAACATGGCCGCAGGCACGGCGATTTCCGGCCTGACATCCGCAACGGCGACCAAAACCATCAACAACACCACCTACACCCAAAGCTGGGGCTGGAACGGGATTACGACCCAGAACGGGTTGGAAATCGGGTCCAGCACACTGACCAGCGGAACCCTTTTGGGCGTATCGGTCAGCAACACATCATCCACTGGGCAAGCCCTGGGCGTTATTAACGCTGGCACCGGCACAAACGCCATGGCCATCTATGGTGAGGCCTCTGGATCCAGCGGCACCACATACGGTATTTATGGCCGCAGCAATAGCACGACCGGCCGGGGCGTCTATGGTGAAGCGGCCGCGACCACTGGTGCAAATTACGGCGTCTTTGGCACAACAGCAAGCACGTCCGGGACAGGCGTTCGGGGTTCAGCCACAGCCGCCACGGGCACAAATTACGGCGGACTTTTTGCAAGCGCAAGCACGACCGGATATGGAATTTTTGCCGAAACAACCGCCGCCACAGGCGCAAATTACGGCGGATATTTTTCGAACGCTAGCACAACCGGGTATGGTCTTTACGGCTTGGCCTACGCCACCACGGGCGCAAACTACGGTGTCTATGGCCGATCCGCGAGCTCCACCGGATATGGCGGATATTTCATCAACACGCACGCCAGTGGCGGATGGGGCGTATATTCCGCAGACGATATTGGCCTGGCCGCCAACATGTACCTCAACTGGGGCACCACGCGTGGCAGTGGCGGTTACGGTATTCGCGACAACGCCGGGACCATCGAATGTAAAAATTCCGGCGGAGCATGGGCGAACTGCGTCCAGACCAGCCTGGCCCTGAGCGCATTGACCGCCGCCACAGCAGCCAACACCATCAACAACGCCGCCAACACCCAGACATGGCAATGGAACAGCATGACCACGGGTGACGGGATGGTCATAACCTCCTCATCGGTAACCAGCGGTCAAGTGCTGACTGTATCCGCCACCAACACCGCCAATACAGGGCAGTCGATTTATGCATCCAACAACAGTACAGCCAACAACGCATCAGCCCTCTATGGATACGCGTCGGGCGCATCAGGGGCCCATAATGCCGTGGCTGGCGTGAATAACAGCACGACAGGGCGCGGCGTGGTCGGCAATGCCACCGCGACGTCCGGCGCGACAACGGGCGTCTATGGCGTTGTATCCAGCACGGCAGGCAAAGCCATTCACGGCAATGCGGTGGCGACAACCGGCGCAAATTATGGCGGCTATTTTGAATCGGACAGCACCGGCGGAACGGGCATTTTTGCATATGCCACAGCCACCAGCGGCGTGACCGCCGGCGGATCTTTCACCACGGCCAGCCCCCTCTCCCGCGCGGTGACCGCCAATGCGGCGGCAACGACGGGCGAGAATTACGGTGTGTATGGCCGGTCCGGCAGTTCATCCGGATTTGGCATTTATTGCGAAGCCGCCGCCAACGCCAATGGATGCGGTGGAAACCGGGCCTGGTATAACGCGTCGGATGAACGACTGAAAAAAGACATCGTCCCGTTGAGCGCCGATGAAGGTCTGGCCGCCATCATGCAACTCAATCCCGTGCATTATAAATGGCGTGATGCGCAGGCCGAAGATCAAAGCGAGATGGGTTTCATCGCGCAGGAGGTCGAGCAGGTTCTACCTGAGCTGGTCGGGATTGGACCGGATACCGAAATCACATCCGAAGACGGGGCCAAGGAAACCATCGAAGACGCCAAATCCATGAGCTATGCCACCGTGGTGGTGCCGCTGGTCAAGGCGGTGCAGGAATTGAAGGCCGAAAACGATGAATTGCGTGCCCGACTGGAAAAACTGGAAGCGCAGCAAGCCGCAACACCTTGA
Protein sequences of DBSCAN-SWA_3 >NC_016026|2325817:2372236|2352269_2352746_-|WP_014103880.1|DBSCAN-SWA MTLKTLFRAAAAHLFGGPVVQKKIDGFDVSLDRQDNGCFVFRVRANGQNPEYPAQDEPVLMSFRRKANPGSGTTKAMQGPEVIVHNSARYVGFELVEVFNNGKDFVGRYTPDHHGVILRADIVYADGQTGSAAMGAWVRKSDQEKPDWKKLGRGPMMI >NC_016026|2325817:2372236|2370193_2372236_+|WP_148260569.1|tail|DBSCAN-SWA MPRRTLICNGTTWQNALEQTSAGASLFQVGNDTGSCTAAKLGRIRYNGSTTWEFCNGSSWINMAAGTAISGLTSATATKTINNTTYTQSWGWNGITTQNGLEIGSSTLTSGTLLGVSVSNTSSTGQALGVINAGTGTNAMAIYGEASGSSGTTYGIYGRSNSTTGRGVYGEAAATTGANYGVFGTTASTSGTGVRGSATAATGTNYGGLFASASTTGYGIFAETTAATGANYGGYFSNASTTGYGLYGLAYATTGANYGVYGRSASSTGYGGYFINTHASGGWGVYSADDIGLAANMYLNWGTTRGSGGYGIRDNAGTIECKNSGGAWANCVQTSLALSALTAATAANTINNAANTQTWQWNSMTTGDGMVITSSSVTSGQVLTVSATNTANTGQSIYASNNSTANNASALYGYASGASGAHNAVAGVNNSTTGRGVVGNATATSGATTGVYGVVSSTAGKAIHGNAVATTGANYGGYFESDSTGGTGIFAYATATSGVTAGGSFTTASPLSRAVTANAAATTGENYGVYGRSGSSSGFGIYCEAAANANGCGGNRAWYNASDERLKKDIVPLSADEGLAAIMQLNPVHYKWRDAQAEDQSEMGFIAQEVEQVLPELVGIGPDTEITSEDGAKETIEDAKSMSYATVVVPLVKAVQELKAENDELRARLEKLEAQQAATP >NC_016026|2325817:2372236|2347290_2347980_+|WP_014103874.1|DBSCAN-SWA MQQKRNPSRPQSGESGNVIFFILLGIALIGLVTAALRSGGSESANIDAEQMVIKVSQVQQNASALARATEVAYQNVREESAISFAHPDAHSDYGTYGTTPSAEVFHVQGGGADYLDPPAGINDGTAWQFYGTTAAPDVGTDAADLIAVLPNVTPEFCTAMNKSLGQLGATNDTGTCVYDSSGVRFSGVFAVAPNTMDGSSFTFTPAPQACVTCTGAGNTQHFYKVILAR >NC_016026|2325817:2372236|2357272_2358712_+|WP_014103886.1|DBSCAN-SWA MTSTEICPLIRKRQTKIVATLGPASANIEMIEKLVLAGVDVVRLNFSHGEHADHAARVKIIRGLETKLARPIAIIADLQGPKLRVGRFKDGSITLTAGQKLRLDLDKTEGDDTRVNLPHPEIINTLSPGAFILCDDGKVRMKIIDKGADFLIAEVVSGTKLSNNKGVNVPGVILPIPALTEKDRKDLVAALDMGVDWVAQSFVQRPEDVAEAKKLIGGRAALMAKIEKPSAIELFAGILDLVDGIMLARGDLGVEIPPEEVPALQKKIVRQVRQSGKPIIVATQMLESMIESPAPTRAEASDVATAVYDGTDAVMLSAETAAGKYPIESVSIMDRICQHVEADDLYRRIMDADHPDADTDASDAITIAACQVAQTINAACITNYTSSGSTTLRTARQRPAMPILCLSHSAQTTRRLMLSYGVHAVYTPDVTSFADAVKMATEQAAIQGLAKKGQRLVLTAGVPFGTPGSTNALRVAWVE >NC_016026|2325817:2372236|2332334_2332748_-|WP_041794092.1|DBSCAN-SWA MSMGLFRAIGQMLGCLSPAHIGFVTLLSGARSVGRDVYGNRYYSAKARPGYKLDRRWVMYKGVPEASNVPPEWHGWLHHQTDVVPASNAESFRRPWQKPHMPNMTGTNLAYRPPGHVLSGGQRDAATGDYESWTPPQ >NC_016026|2325817:2372236|2367944_2369909_+|WP_014103893.1|tail|DBSCAN-SWA MSVFRLFVLSTLSVLAFSVHASPAHAEAQPGDACTINGAVQETGGPEQMPRRTLICNGTTWQNALEQTTAGASLFQVGNDTGSCTAAKLGRIRYNGSATWEFCNGSSWINMVGGTALSGITSATATNTINNTTYTQSWGWNGITTQNGLEMGSNSLTSGSLLGVAVTNSASTGNAIGVATNATGNGATAIRAISTGTSGVTAAIYGENASTGGAGISGQATATSGTNYGGYFLNTSSGGYGVYAANTAASGTAFGIRGATSSTTGIGVSGAAVATTGLNYGVHGTTASTGGRGVFGSATATTGATYAVYGVNLSTEGHAVVGHSTATTGTTFGVYGVNESTDGRAINGTATATSGVNYGVHGRSDSASGFGGYFANTANGWGLYSTNNVGLGAGMYLNWGTTQGTGGYGLRDNAGTLQYKNSGGAWTNIASGGGGAALSGLTAATAANSINNAAYAQAWAWNSLGNNNGLTLSTTATTGTGRLLYINASGSTGQNTALYARNAGTNGVAIFCDSYDDYGCRATVQWTSTSDARLKKNIEPLKNEYGLDAIMQLKPVTYNWKEKPDDSRKSLGFIAQDVEKIIPEIVGEDSAPKEITLPDGSTETIENPKAVQYSAIVVPLVKAVQELKAENDALHALNADLLRRVEALEAAAEK >NC_016026|2325817:2372236|2343409_2343757_-|WP_049782190.1|protease|DBSCAN-SWA MGDDHGDDLPDDGEGEERRQAGLLLKPRPKTKKPSMYKVLLLNDDYTPMEFVVHVLERFFSKNRQEATDIMLHVHRRGVGICGIFTYEVAETKVAQVMDFARASEQPLQCTMEKE >NC_016026|2325817:2372236|2340941_2343242_-|WP_014103868.1|protease|DBSCAN-SWA MLSRNLEKTLHRALALANDHGHEYATLEHLLLALTDDQDAMAVLRSCGISLPDLKDQLAHYLDNELTYLVNRNGEESKPTTAFQRVLQRAAIHVQSSGREEVTGANVLVAMFSERESNAVHFLQEMDMTRFDAVNYISHGIAKVPGQSEIRPPEGTEPGESASGDDKNAKPGRDALETYCTNLNNKARRGKIDPLIGRHEEVDRTVQILCRRSKNNPLYVGDPGVGKTAIAEGLAKRIVDGEVPDVLKTAVIYALDMGALLAGTRYRGDFEERLKAVLGEIEKVDNAVLFIDEIHTVIGAGATSGGAMDASNLLKPALSNGSLRCIGSTTYKEYRNHFEKDRALVRRFQKIDVKEPSIEDAIKILKGLKPYYEEHHNVKYTNDAIKAAVELSARYIGDRKLPDKAIDIIDEVGAAQMLVPASKRKKTINVKDIEDVIAVIARIPAKAVNKDDRTLLKNLDRDLKTMVYDQDQAIAVLSDSIKMARAGLRDAEKPIGSYLFSGPTGVGKTEVARQLAKSLGVELKRFDMSEYMEKHSVSRLIGTPPGYVGFEQGGLLTDAIDQNPHCVLLLDEIEKAHPDLYNILLQVMDYGKLTDNNGKTVDFRNVILIMTTNAGAAQMAKSPIGFGRTMEIDGDNDEIRRLFTPEFRNRLDAIVPFQNLKPETVARVVDKFVIQLEAQLADRDVTIVLSDEARAHLAEIGYDPAMGARPLARVIQEKIKRPLAEELLFGKLTKGGVAHVDFKGGELIFDYEEARKSGAREKIDQD >NC_016026|2325817:2372236|2344990_2346439_+|WP_014103872.1|DBSCAN-SWA MGYDVATLRQHARTFFSTALGLGLMAGLMLAATPAHAAKKKSQDNPRYASIVMDADTGAILHERYADKSLYPASLVKMMTLLMVFEAMDRGEMNLNTRIRISQHAASMQPSKIGLKAGSTIRVEDAILALVTKSANDMAAALGEAVGGTESNFASMMTRRAREIGMRNTTFRNASGLHHPSQVSTVRDMAILSRHIIYNQPRNFRFFSTKNFRYNGVNYHNHNRLMSTYAGMDGLKTGYTVPAGFNLAATAVRNDRRLIAVVFGGRTTQSRNAHVADLLDRGFKQIGTGQVMMAQTNVRATAPAPVPNRKPGTETGTTVVAATHFEEMMGAGDSDPTASTRIESGMVAVNALRNAGAVSRAEPQTPTQQPVQPQVIRTPAAQAALQPQSMTTTAPIVTSGTEHTWSVQIGAFTNRMKTDQLLTSSQGKLPPQLAQVAQPVIVPLSAGSSTVFRARIKGFSRNQAIEACRYFTDCMTISPRAF >NC_016026|2325817:2372236|2335024_2335468_-|WP_014103862.1|DBSCAN-SWA MAGISHTQSAPENVVPLSASGRYQGGRAPIDITSHLVTLVLAFVCIVLLCIAPLIMRGFLVSVSDPGLILYGFCTVLAFQLWGAFLAQIRFFAADFKHCARGLCLAMVVVPFLIPMASSIFVQAGLFLLFWMAAVWYGMLHSVFSRA >NC_016026|2325817:2372236|2361718_2362981_+|WP_014103888.1|DBSCAN-SWA MKKTILSVAIMLCVLSSAKPSWAAGEQPGDACTVAGAVVRAAGPDQVPYLTLVCNGSTWVLADERTTAGRSLFRVGSDATACDATKVGRISFVGGAWTYCHDSSWKAIGLPNCAAGQGLTQGVAGPECCPTMGWAVTAPPEANSWSNVTYGNGLFVAVSTNGTNRIMTSPDGVNWTARAAPENLAWNRIFFGNNLFIVTTSSTSNRIMTSPDGVTWTARTVPQSNSWQGVTYGNGLYVAVSSNGTNRVMTSPDGVTWTLRTAAADNAWMAITYGDGLFVATSSNGTNRVMTSPNGITWTSRFLPGTDPAVYSIAYGNGRFAGVSSGGRVFTSTDGINWSQATLSETNFLRSITFNGTKFITVSNNGTSRIATSVDGVTWQLYQAPEASNWMSVTNGGGKVVATATSGTNTIMYSIDVPCP >NC_016026|2325817:2372236|2365570_2366431_-|WP_187287635.1|DBSCAN-SWA MIRHGESEGNVDWNTYSRIGDPQVSLTDNGWTQARDAGVFLRDWFNDPARGNRPGRWPHVWHSGVGSESPDAPSWPHIYVSSFLRTRQTLSGILTGMGDDALAGDYTIREDARLVEHSFGALAYIDAQKGFLRRKFAQALAHLSTQVHKHSAYLSAPPFGESPMAVQMRTSDFIGSLHRAHDKNDTDDIMIVAHGGVIKAFMMRWFHLPLSAYDDLKTPGNCDVFVIEKPDPCQCNGANTKPNWTITRIYDGESRMAANNNPLSQVKLFADGRLPKFPDRLNGGSR >NC_016026|2325817:2372236|2348783_2349446_-|WP_014103876.1|DBSCAN-SWA MPALSRLVRVFTLASAVVLAGGLAACADTTMPTGYTYHNEVYKAPPGKEASKPAHVVGQGTAPAMVVDQAPVQQTEGLPPGFTNADIDSLTVVAEDLLAKLVRNFGRPMEPTMVITHPANTNRDAAMEMALRRAMQNQGIPVSTAAGEGPFSLGYAIADRAQNGAGAYTLDAEIFLVVQGQEITRQHGSVDVSAAPVYMNNPPVPVADVSSDPIVVDSGF >NC_016026|2325817:2372236|2331265_2331730_-|WP_049782188.1|DBSCAN-SWA MDDFPVVKLQSLDKVTARTMTFEANVGSTVKFGTLYIKIQACRKAPPIETPESAAFLQVWELTPKAEESQWIFSGWMFASSPALSPMDHPIYDVWVLDCLTHKTGEEPPAPDPADASAAPEGAAATDAPAAPVASDTPAADTDIPVEEPPPSAE >NC_016026|2325817:2372236|2329835_2330570_+|WP_148260477.1|tRNA|DBSCAN-SWA MRGTYPQKACTLWNVSVDRMSANPPITIQDVLTAYAMGLFPMAERADEQAFYWYDPPLRGQMDIVGLHVPRSLKKFVLKSPFTITVDQDFPGVMAGCAARTDDRPQTWINDGIRTLFTDLYRAGFAHSVEVRDAGGALVGGLYGLAIGAAFFGESMFSRESGASKTALIHLCARLWRGGFTLLDTQYLNPHLEQFGAYEIPRDQYLARLHQAIPRPADFNLRRNPGLSEDQLVREWFAMRATNQ >NC_016026|2325817:2372236|2349575_2351036_-|WP_014103877.1|DBSCAN-SWA MTRNSLQNAVVAVIGDIMLDRYVTGDAVRLSPEAPVPVLSNTTTRAVLGGAGNVLANLRGLGVRVHLAGVVGDDADAGALRAMMADIGADAAGLLVATDRPTIVKTRFMAGNHHLLRLDAEDARPVGDDVAARLIAQVQAALNDGARALILSDYGKGTLTPAVIAGVIDAARAKNIPVLVDPKGRDYSIYRGASIVTPNRKELAEATNNLPNATDSDIVVAARHLIKTSGIDCVIATRSQDGMSVVRADGTPLHLRAQARAVFDVSGAGDTVIATIAAGLAAGMDLNDAAHLANRAAGIVVGKVGTAPILMDELESADAISDMIAPVCDNWDDAAKIVEGWRAQGLKIGFTNGAFDILHPGHVTYLAQARSRCDRLVVALNCDESIRRYKSKDRPINDEDSRAQVLAALGAVDLVVIFGRTKEEDDKAELIIRRINPDVLVKGADYEGKHVDGADYVQSRGGVVWLAPVVAGKSTTAIVKKLQGQG >NC_016026|2325817:2372236|2351212_2352190_+|WP_014103879.1|DBSCAN-SWA MGFESGGSSLKSHSQGGVLNRIKVSVDRNRLGEVLVYTGALTPQELRYALARQKTSGASEPLGRVLLRERMIRRQDLYRALAQQMTLRLMAASLAIALGFTMFGIKRANAASVRDLPAQVTLASVANSAFTPVNYYPNLFGANEKQSTNLKPFTKWTDMFVRFETAMGGASAKSSIKTIKAEIEPLQGLPLDAMAARVNTIANAVRYIEDKDNYGNSDYWATPVEFFARGGDCEDYAIAKYTMLRALGVPESRLRIAIVHDLQKNIPHAVLVVYTDDGALILDNQNKSVRTANSLTSRYRPIFSINRDAWWLHNKPQGTVLASAQ >NC_016026|2325817:2372236|2344150_2344654_-|WP_014103871.1|DBSCAN-SWA MTAKKAKAPQKKTSKTAATNVRALKASVETQSTEITEKIMTTTKATTANFEKLTKDASALGQEQVEALTQAGSVFAKGVEDILKTYVSISQDAAEKNAAALKTLMGCKTINDFSAAQTKLAQQQFEDFVAASTKLSELGLKVTTETLEPLNAQASKAIKKATDALAA >NC_016026|2325817:2372236|2328303_2328489_+|WP_014103853.1|DBSCAN-SWA MTQTHQIDPKLLEILVCPLTKVPLRYDAKAQELISDQAKLAYPIRDGIPIMLVDDARKIDE >NC_016026|2325817:2372236|2352762_2353197_-|WP_014103881.1|DBSCAN-SWA MKLQNIFKKAARLFGAGPQFYCYVDGFDTAIKRMDNNTFELRIRANGKNPDYPAADQDMVLRVDHDTECRRGFVDGRLFVEKNTKYKAFELVEAFDSARAYHGVVLRAKITTFGGYRTEENLRLVPRQSTITLAQKQNQPDATI >NC_016026|2325817:2372236|2325817_2326366_+|WP_014103849.1|tRNA|DBSCAN-SWA MDTAQNLQHHTDLPTTPDALFKRLDDLGILYTTWHHPAFFTVEEGLEFEKDIPGLHCRNLFVRDKRETMFLVSAANETKIDLKKLSALLDCGRLSFGSPERLWANLGVRPGSVCPYAIINDTAQAVTMVLDDTIMRATTVNFHPMVNTMTIGVAPQDLVRFIESTGHTPLILDLSATAPEGE >NC_016026|2325817:2372236|2336212_2337421_-|WP_014103864.1|DBSCAN-SWA MFGVHRKIKAWEQVGLISADQAVSITAFERDRQKGRFAGGLMGLSLFAILVGVLMVIGANWNDIPDAAKLVVHAVLNIGAAVAVWRFHARGMDVWREGAVLVLAGLTLTFMALVGQIYHLNGGAGGLLALWLIVISPFMVIYGRTMLTAIPWVLAFLGCIPVIMDQYLHDLPDIWMLFYSLAVAVWLPLAMCADGFTRMFRAFRPIWAFVVMRAGFILLTFTGMMATMLWYVDRAKELGTIARDSGMEYGTAYLVAVAVPLIALAGMAVHYALYNSRVEDRAFYRATLGYCGLCVVSVLLPFLIPSPDSSFMAGLTFIAFWIGAGWYGHLTGAMRLVTLSVTLIGIRIYVIFLEAFAGLMTTGFGLIIAGVAMLGMIWGLRRANRYVRSLSNTQSHSSGGQS >NC_016026|2325817:2372236|2364289_2365567_+|WP_014103890.1|DBSCAN-SWA MKLKFFHIALFLSVIGASTFTQAQEAQPGAACSGAGTSTWTGGPEQIPGRLLICNGSTWQAVQETASTGRTLFQTDYDSGACDSAKEGRLRYDSASNAWSYCYNNAWANFASVPTCAIGQGLTMAATGWSCCPKVESTWTGHLSAENNAWRSAVYGNGIFVAVASSGTNRVMTSPDGITWTARAAAEANVWRSVTYGNGLFVAVASSGTNRVMTSPDGITWTARTAAQANQWYSVTYGNGLFVAVSIDGTNRVMTSPDGITWTARSAAQANVWLSVTYGGGTFVAVSNGGTNRIMTSTNGTTWTSGNAPGSSLWTSITYGNGRFVAVAQASTAAMYSTNAISWAAGTLPESNNWQGLAYGNGIFVAVAANGTNRIASSMDGITWTSHAAPEATFWEDVSFGNGVFVGLSYGGTNQVMRSPVTSCN >NC_016026|2325817:2372236|2359189_2361712_+|WP_014103887.1|tail|DBSCAN-SWA MQRRFVLLALASAALFVLPQSAYAQEAQPGDACTTNGAVRSTGGPEQMPRRMLICNGTTWQSALEQTTAGASLLQIGNDTGSCTTAKLGRMRYNGTSTWEYCNGSTWAALGSGGSAAGADREIQFNSGGAFGTSSTFKLMADGDLLLAGPSTTGIASVPVSGNGTRMFFDFQTSAFRVGSVSGTQWDNANIGIFSVAMGFDATASNTFGVAMGAQTTASGVSSTAMGYNTIAAGSHSFAAGRDVNITTTGSGSFGFGLTNTAPAIKPQVSGAQSFGIFMGNQNAVNFSAANTMGLFGGKMVIDSTIPATNLVADTELEIDGTLKIGSGGEACDASREGSIQYLAASDTFQVCATAGSWTALGGGGSAAGSDREIQFNSGGLFGASASLTFDGTGLYSKAFYADATGTTYDTAVSGISDADYGVGIYGEFAHPTNLGTGVRGNASSTNGRGVQGTASASTGLTYGGYFESDSSTGVGVFGIGGGYGIRGEATSATGYAGYFFNSADGWGVYSENDMGLASGKYLNWGSTRGSTGYGIRDNAGTIECKNSGGAWAACAGSGGASLSGLTAATATNTIANANFTQTWNWDTLTTGNGLVLNSTSLTDGNILRVINSNTTGTGGPIYAQTSSTGALSYGVGGFATSTTGLSIGLYGSSTSSSGNGVRGSAGSGTGTTSGVYGSASSTSGRGVYGWAAAATGTTYGVYGTSASSAGYGGYFTNTSTGVALRAQGDLEYTGSLRDMSDIRLKDNVKPLDSSLEKITQLQGISFTMKDSNTHDTEYGFSAQDVQKIYPNLVHKANDEDGTLSMNYTGLIAPLVEAIKEQQKEIEVLKAEIEALKARE >NC_016026|2325817:2372236|2334040_2334943_+|WP_014103861.1|protease|DBSCAN-SWA MLEDRKGAHSAFDAVAGNDNQSSAPAPTTAPATRTASAPLPTPPNYFNQVVSAAPQERILILEGPITQELAITLRLHLLRMEAASPDEPITIMINSGGGLVTAGMAIYDTIQSLECPVNTYVTGMAASMASILLVAGTPGERRAAPNARIMIHQPSGGSQGNATEMGISQNEIEHTYRRMAWVYAAHAFDAAKSPAFDKKVADKMADLQKTAPTSGGTWTPDTLKMTALAHLYHAVMKQDYFLYAEEAKDMGLIDKIEYPDMNIKPTMDPKRAKILHRIAEAERLANKHHRDNRPEPSAF >NC_016026|2325817:2372236|2346453_2347287_+|WP_014103873.1|DBSCAN-SWA MTSLKDKTIVITGASRGIGAAIAIRAARDGANIAILAKSDTPHPTLEGTIHTTADTVEKAGGRALPLAVDIRDENVVAGAIQTIAATFGRIDMVVNNASAIHAAPTPHTPMSKYDLMMDVNARGTFAVVQAALPHLQKSATADSPAQILTLSPPLNLGSQWIGRCPAYSLSKYGMSLLTMGFAAEFKDWHIHANTLWPQTLIATDAVRVFFADAYNASRTPEIVAEAAYAILTGCNGRFETGQHYTDESALRLVGINDFSQYNTTPGLDPVDDIFLD >NC_016026|2325817:2372236|2330665_2331187_+|WP_014103856.1|DBSCAN-SWA MNRDVLHNEASTQSLKGAWTRHATVDGAVVGGMRGTTLSRDAFKAQTRFLPVFATAADPHNTPRYVVMAARQWQGEHADGTIRTHAVADVHVLDQPHIDRLLTRKHITQNQIFDRVGDATRAAKAQNESTFETCARDYTQGRHAGIKAWRLVQQGDAATPSPAQHPTTKSRAF >NC_016026|2325817:2372236|2363175_2364279_+|WP_148260478.1|DBSCAN-SWA MEQNTDGKSLFQVDNDTGSCTAAKEGRLRFNDSTNLWSYCRSGSWTNFTTLPTCAVGQGYVMTGSGWGCCPEGGTTWTSGSNIPGSNDLFPGALTYGNGLFVLVGESFVDSNHFRTSPDGITWTPRTAPDPGWGWRTVTYGNGTFVAIGGSWSVTDLTSSTDGITWTPRTSPANNYWAGIAHGNGLFVAVGYSGGTEVITSPDGITWTERTGIAEEWIDVTYGNGLFVAVGRTGTNRVMTSPDGITWTARTAAENNEWTSVTYGGGQFVAVAQTGTNRVMTSPDGITWTARTIAAEQWQDVTYAAGKFITVAWAAGQEIATSEDGITWTLRNTPGYEDFFQVASNSSTLVSIGLYGATMRSPIAICP >NC_016026|2325817:2372236|2347990_2348767_-|WP_014103875.1|DBSCAN-SWA MMSVRLPSLVLGACALLGVGGVLSACAVALAVQDELFDTSKINLTDSSYAAADMLAQQSRAHVNAATPVQIFPLYDVAIPGELTPFGQTVSNQLGARFVQLGYNVYTGGLPPDSVPSTIMAPQVGVGAPKPVMKGVTPSSASGMGKPGLVQMYGSYARGKDHIMISLQMIQAETGRVLSAYDYTLPATRQLREMSMTKAERDQLATGMAYSATTPPPETAAANQPAHQAKAIYPSAATTPPYAPPAPVTASQPMPLTK >NC_016026|2325817:2372236|2366682_2367762_+|WP_041794101.1|DBSCAN-SWA MAQNVATRAPIDDVTNAASPLLQERHVYKPFLYPWCYEAWLTQQRIHWIPEEVPLAEDVRDWKNKLTAAEKTLLTQIFRFFTQSDVEVNNCYMKHYSQVFGPVEVQMMLSAFSNIETVHIAAYSHLLDTIGMPEIEYSAFMKYKEMKDKFDYMQGASMASRRDIAKTMAMFGAFTEGLQLFASFAILMNFPRFNKMKGMGQIVTWSVRDETLHCLSMIKLFNAFIAENKDIWDDELKREITESCKTIVGFEDAFIDLAFEGGEIEGLTPQEVKNYIRYIGDRRLQQLNLDPVFGIEKNPLPWMDIMLNGAEHANFFENRATEYSRASTEGAWEDVFDDSVFSGNYGKKIGGSDGESSAA >NC_016026|2325817:2372236|2331803_2332298_-|WP_014103858.1|DBSCAN-SWA MKHSLIETVLGAVVLLVAAVFLIFSYSAANVGDVSGYTISANFSSVGGLRAGDDVQISGVKVGTVSAVELDPETYLAKVSMSVDPSVKVPDDSAALISSESLMGGRYLSLEPGSSEDMMKNGDRISYTQAPQNLEQLLGKFIFSMQGAPKDQNGNGAESADAHP >NC_016026|2325817:2372236|2354165_2355998_-|WP_014103883.1|DBSCAN-SWA MFDVAQVPQGNRIQIDRSRDANLTDFGKATLVDRYLLPEESFQDLFARVAMYYGDDSAHAQRIYDYISKLWFMPATPILSNGGTSRGLPISCFLNESGDSLEDIVALWNENVWLAARGGGIGSYWGNVRSIGEKVGRNGKTSGIVPFIKVQDSLTQAISQGSLRRGSAAIYLPVWHPEIEEFIEIRRPTGGDPNRKALNMHNAVVIPNSFMHAVEKDEEWALKSPKDGHVVGRVKARDLWIRILTARIETGEPYILFIDHVNDRIPAHHKMAGLSVKMSNLCSEITLPTGKDPLGNERSAVCCLSSLNLELFEQWQDHPTFIEDVMRFLDNVLTDFINKAPDSMKRAKYSAMRERSVGLGVMGYHSFLQSKMIPMESVMAKVWNRRMFQHIKRQADDASVKLAHERGACPDAADYGIMERFSNKMAIAPTASISIITGGASPGIEPNAANAYNHKTLSGSFAVKNPYLEKLLEEKGLNNDDIWSSIFTNEGSVQHLDFLSQDEKDVFKTAFEIDQRWLIEHAADRTPFICQAQSLNVFLPGNVHKRDLHRIHWEAWAKGVKSLYYCRSKSIQRAESAASWARSKEGQKPEGNREPELGEDKKYDECLACQ >NC_016026|2325817:2372236|2335634_2336216_-|WP_014103863.1|DBSCAN-SWA MIGFCACCDKISEPMRKIIMLAVLVLPLLVPGLLWVKAATDQTAGPVWTVKIAGYDPRDLLHGRYIQFRYDWMEADAGVDAACADNDPNCCLCLNGNQTFMNVTRLQCDAPDADRLCQSRIPASSATGPQKYFIPEEFAADLDRLVVSEPDRFMMEIAVPGAFGAPVIRGLSLDGMPLRDYLRQHRGTDGLQE >NC_016026|2325817:2372236|2328506_2329676_+|WP_014103854.1|DBSCAN-SWA MDRVIIFDTTLRDGEQSPGCSMNHDEKLRMAALLDQMGVDVIEAGFPIASKGDWEAVKAIAGTVKNATVAGLCRAKRGDIESAAEALAPAKSRRIHTFLSTSPLHMKHKLQMEPEAVLDAIRDSVTLARQFTDDVEWSAEDGSRTENDFLCRAVETAIAAGATTINIPDTVGYALPADYAAKFTLLLNKVPNIDKAILSVHCHNDLGLAVANSLAGVMAGARQIECTINGIGERAGNAALEEIVMAMRTRADSLPYKNNIDTTMITKLSHALSDITGFSVQPNKAIVGANAFAHESGIHQDGMLKNAQTYEIMTPESVGLNKSELVLGKHSGRHAFRARLEQLGFDLGDNALQDAFVRFKDLADQKKSVNDDDLIALVGPDAPRVEVGP >NC_016026|2325817:2372236|2337566_2338301_+|WP_014103866.1|DBSCAN-SWA MKPFIPPHRESERGSVLIYIFIAIAVLAALSFAVSRSGRESAQTINKERADLWATELFDYSNMLRRAVTTLTITGLTENDVCFHDSGWDDNSYQFSTACPTANLVFSQLGGGATFQNPNYNLLDSSYSTDPAYKKWHITGANSVAGVGTDCSPTGPCNELLAVLPFVRREACIAVNSKLGITDDLTEPPQDDAGFDLSVPFKGSYPNGESINTATLDGKRVGCFKGNGGVIDDAYVFYSVLIAR >NC_016026|2325817:2372236|2332988_2333891_-|WP_014103860.1|DBSCAN-SWA MSRVSNNLQVLIAEQNEDDRDHLAGLLYMNGYEVVTAADGGAAARIADTRNIDIAIVDQAMQPKTGFDFARHCQVKGHTFGIIMITDEPSTDLLLEISRYEIKQVLRKPVEPNRLVEVVRRVLRSHGKNPDAIGAHRVERGYSPDQLMARAIALAHQNARSRLGGPFGAVVADADGHIIGEGVNSVTSRCDPTAHAEVLAIRRATEKLGRTELKGCTLYCSSEPTMLGQALVIRAGIEKVCYALTHAEIAAMMGGDDEDRIKGEVAKPIAQRSVAYEQLQHDQAQDMVRMWQGLSGKVAD >NC_016026|2325817:2372236|2327587_2328271_+|WP_014103852.1|DBSCAN-SWA MGCDLPPTLNDLPAEIPVFPLSGVLLLPHGQLPLNIFEPRYLSMVEDALKSHRIIGMIQPRGADNAHPALFETGCAGRIVNFSETNDGRYLVTLKGVARFRVKSELDQGRNGYRRVQADWADFSSDLDAVSCLNLDRPRLRGLLESYFELHGLSCDWNAVESATDNKLITCLSMICPLDAGEKQALLEASCCKTRADLFMTILDMAVRQSGALHTSCRHGSDTSLCH >NC_016026|2325817:2372236|2356625_2357192_+|WP_014103885.1|DBSCAN-SWA MEKHGFMIAIGGLSGSGKSTLAGRLAAETGAVWLRSDSIRKELWGVDPLTKLPPEAYSRDFSTKTYETLASRMEENLRAGRIVIVDMSFAKPVERRSFANRATACGAGFHGIWLDATPDTLKTRVDARVGDVSDADSTIVTMQLGFDLGTIEWSRINTDRPADSVYRLACALLGVDGPTPIRAIKLEP >NC_016026|2325817:2372236|2338354_2340724_-|WP_014103867.1|DBSCAN-SWA MEQSENMRLTLLRTQDIAIAHGNQIMMPEHLLLALLDDPDCKTVFSALKVDVAKIKEEAEHFIVEHFDVNPRPLQDIDLEKSSAMGTILNRVYYENTTKNPGTIANSKSLLLALMRERGSNAAYLLEKHGIGSAETLDNFLTHGTTVNPKDAAATDNYGLSKRKNPASQQGEEEEDERPPLEQFAVNLNELAASGKIDPVLARQNEINQTIEVLARRKKNNPILVGEPGVGKTAVAEGLAMDIVNGNVPDQLLGATLYSLDLTAMTAGSKYRGDFEKRLKAVLTQLELTPGAILFVDEIHMLIGAGTGSDSKMDAANIMKPYLSSGRVRCVGATTYAEYSKYFEKDAAMSRRFQKIDVAEPTPAQAIEILKGLKKHYETFHGVTYTDEAIEAAVKLSVRYMTDRQLPDKAIDLLDGAAAHRIVEPRATNVIDKDEMEDTVARIKRLPKKELSGDGLEKLRTLDSDLRQAVFQQDAAIDALTDAVLLAQAGLRDLNKPKGSYLFTGPTGVGKTEMAKQLAATLGTQLVRFDMSEFQEKHTVARLVGSPPGYVGHDEGGELTNAVTKHNNCVLLLDEIEKAHPDVLKALLQVMDDGRLTDSHGKTTDFRNVILIMTSNLRDAEVKKVQGIGFHTETREEVLRADEVSQFLPPEFRNRMDAQIRFDYLKPETMGSIVDKFVKILGGQLAERNVKIDLSADARDYLATKGYNRDMGARPMGRLIQTEISTPLARQVLFGELAKGGAVLVDTEGQGQDKKLRFLFNDAAVANDNNPAAAKPGKTRRRSNEMTPK >NC_016026|2325817:2372236|2353248_2353983_-|WP_014103882.1|DBSCAN-SWA MMDAILPQDATELLQDALTQNIVSVARGDVALEQKMREAIWSHGLSVAVGRDELMARAQKMFVPGGVLEHYRAFSNDVYQVIQTTVMAMGACDDGMNRFANPGPFTVRNLVVAFNRYATSHVAASGEYAMRQDYEGTVFYLPSGVGDNDKVRDRMKEGRSIHVSTFVLPKAIDQSIHANVAMVFESLQLTFDSVGRPLAQDLRPACAIEGTKETMLRLDHVSFGGTINPCDSCSCESKKGPWPR >NC_016026|2325817:2372236|2358949_2359114_-|WP_187287634.1|DBSCAN-SWA MIRLKSDLNGPKTLRTEKVAGLLKIRENTPEKKERASLVYASREAEPVTTDASR >NC_016026|2325817:2372236|2326371_2327325_+|WP_014103850.1|DBSCAN-SWA MLFGNSKDAQKNDAATSTAANDAIFDVGTEDFEANVMRASMDTPIIVDFWAPWCGPCKQLGPTLEQAVIATKGVVRMAKVNIDEHPELAQAMRVQSIPTVFAFFGGQPITGFTGNRPASDIKKLMDQLVQLARESKPDAVNIPETLEAANQALADNNPTQAQVLYSTVLEEDENNVAAFVGLVRAFIADGDVETAAGMIENAPDTIAKNSQFSAAITAVELARQAVQAGPDNGTSDLAKAVAQAPDNHAARFDYAMALFAAGEREEAMNQLLDIIRRDRAWEDEKARKQLLQFFDAIGPADKSVAAARRQLSSILFS |
42 | Agrobacterium_phage(21.43%) | tRNA,tail,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|