Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_021505 | Pseudomonas putida NBRC 14164, complete genome | 5 crisprs | DEDDh,csa3,cas3,DinG,WYL | 0 | 3 | 6 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_021505_1 | 1031276-1031370 | Orphan |
NA
Consensus repeat of NC_021505_1
|
1 spacers
spacers of NC_021505_1
>1.1|1031299|49|NC_021505|CRISPRCasFinder TGCCCAGTTCAAAGGCCTGAACGAGCAGGTTGCCGCGCTGAAGGCCGCG |
csa3 |
CRISPR arrays and Neighbor proteins around NC_021505_1
The CRISPR arrays of NC_021505_1 >merge|NC_021505|1|1031276-1031370|CRISPRCasFinder CAGGTTGATGGCGGCAAGCTGGATGCCCAGTTCAAAGGCCTGAACGAGCAGGTTGCCGCGCTGAAGGCCGCGCAGGTTGATGGCGGCAAGCTGGA >NC_021505|1|1|1031276-1031370|CRISPRCasFinder CAGGTTGATGGCGGCAAGCTGGA TGCCCAGTTCAAAGGCCTGAACGAGCAGGTTGCCGCGCTGAAGGCCGCG CAGGTTGATGGCGGCAAGCTGGA
>NC_021505.1|WP_016498087.1|1029670_1030666_+|NAD(P)-dependent-oxidoreductase MRILVTGASGFIGGRFARFALEQGLDVRVSGRRAEGVEHLVKRGAQFIPGDLGDPELARRLCQGMEAVVHCAGAVGNWGRYQDFYQGNVVVTENVVEGCLKEHVRRLVHLSSPSIYFNGRSRLDIREDQVPRRFHDHYGQTKHLAEQKVFGAQEFGLEVLALRPRFVTGAGDASIFPRLMQMQRKGRVAIIGNGLNKVDFTSVHNLNEALLSALFADDRALGQAYNISNGQPLPLWDVVNYVMRQMQLPQVTRYRSYGLAYSLAAVNEAACMLWPGRPQPTLSRLGMQVMSRDFTLDISRARQYLDYQPKVSLWTALDEFCGWWKHLPPGQ >NC_021505.1|WP_016498086.1|1028730_1029621_+|LysR-family-transcriptional-regulator-ArgP MFDYKLLAALAAVIEQGGFERAAQVLGLSQSAISQRIKLLEARVGQPVLVRATPPSPTEVGRQLLNHVQQVRLLERDLQRQVPALDEEGMPERLRIALNADSLATWWAGAVGNFCAQQNVLTDLVVEDQEVGLKRMRAGEVAACLCGSERPVAGARSLLLGAMRYRALASPGFMARHFPQGFVASRLARTPAIVYGPDDFLQHRYLASLGIEDGFLHHLCPSSEGFLRMTEAGLGWGLVPELQAREQLASGQLVEICSDTPIDVPLYWHHWRNGGQLLAQLTDHLRHTAQQWLVPL >NC_021505.1|WP_016498085.1|1028031_1028631_-|amino-acid-transporter MWQSYLNGMLVAFGLIMAIGTQNAFVLAQSLRREHHLPVAALCIVCDALLVAAGVFGLATVLAQNPTLLAVARWGGALFLIWYGAKALRSAFSKQSLQHQEGQGMRSRRAVLLSALAVTLLNPHVYLDTVLLIGSLGAQQTEPGAYVAGAASASLIWFSTLAIGAAWLAPWLARPATWRMLDLMVAVMMFAVAAQLIFN >NC_021505.1|WP_012312902.1|1027196_1027793_-|superoxide-dismutase-[Fe] MAFELPPLPYAHDALQPHISKETLEYHHDKHHNTYVVNLNNLVPGTEFEGKTLEEIVKSSSGGIFNNAAQVWNHTFYWNCLSPNGGGQPTGALAEAINAAFGSFDKFKEEFTKTSVGTFGSGWGWLVKKADGSLALASTIGAGCPLTSGDTPLLTCDVWEHAYYIDYRNLRPKYVEAFWNLVNWAFVAEQFEGKTFKA >NC_021505.1|WP_016497637.1|1025955_1026909_-|IS110-family-transposase MAMQVGKLIVGADVAKAELVIHHDDRDEIIKVKNTKPEIKKWLKQQPLNTAIAVEATNVYHLDLVELAHSLGFEVYVIDGFQLSNYRKSVGVRVKTDPTDARLLSRFLRNEGEDLRPWTPPPAVYGKLQSLLRRRAALVTARTAMTQSWANEALLKAAFTTFVKSIDRLDLLIQKKIKEVLREAGLHEQVARCQAVEGIGFLTATALVMAFMRGEFKSSDSYIAFLGMDLRVIDSGQKNGRRRLTKRGCSEIRRLLHNAAMSASRTATWKGLYEQHRNAGKATTQALVILARKLARVAFALMKNQDEYVTKGGKAPC >NC_021505.1|WP_016498084.1|1023639_1025694_-|GGDEF-and-EAL-domain-containing-protein MKLELRNSLSVKLLRVVLLSALAVGVVLSCAQIVYDTYKTRQAVNNDAQRILDMFRDPSTQAVYSLDREMGMQVMEGLFQDESVRMASIGHPNETMLAEKSRPLQDMSMRWLTDPILGQERTYTTQLVGRGPYSEYYGDLSITLDTSSYGEDFLINAVIIFISGVLRALAMGLVLYLVYHWLLTKPLSKIIEHLTQINPDRPSQHQIPLLKGHEKNELGLWVNTANQLLASIERNTHLRHEAENSLQRMAQYDFLTGLPNRQQLQQQLDKILVDGGRLQHRVAVLCVGLDDFKGINEQFSYQVGDQLLLALADRLRAHSGRLGALARLGGDQFALVQANIEQPYEAAELAQSILDDLEVPFDLDHHQQIRLRATIGITLFPEDGDSTEKLLQKAEQTMTLAKARSRNRYQFYIASVDSEMRRRRELEKDLREALPRNQLYLVYQPQISYRDHRVVGVEALLRWQHPELGMVPPDQFIPLAEQNGSIISIGEWVLDQACRQLREWHDQGFSELRMAVNLSTVQLHHSELPRVVNNLLQAYRLPPRSLELEVTETGLMEDISTAAQHLLSLRRSGALIAIDDFGTGYSSLSYLKSLPLDKIKIDKSFVQDLLDDDDDATIVRAIIQLGKSLGMQVIAEGVETAEQETYIVAQGCHEGQGYHYSKPLSARELTSFLKQAQRNQVSML >NC_021505.1|WP_016498083.1|1022143_1023481_-|imelysin MIRMPLASASLLAIAIALAGCGEKDDKAAAPQAQAPAAAASTTAAAPGAVDEAAGKAVVKHYAEIVYAVYSDSLSTAKALQTAVDAFLAKPNDETLKAAKEAWVAARVPYLQTEAFRFGNTIIDDWEGQVNAWPLDEGLIDYVDKSYEHALGNPAAGANIIANTEIQVGEEKVDVKDITPEKLASLNELGGSEANVATGYHAIEFLLWGQDLNGTGPGAGNRPASDYLEGQGATGGHNDRRRAYLKAVTDLLVKDLEEMVGNWAPNVADNYRATLEAEPVNDGLRKMLFGMGSLSLGELAGERMKVSLEANSPEDEQDCFSDNTHYSHFYDAKGIRNVYLGEYTRPDGTKVTGPSLSSLVAKADPAADATLKADLEATEAKIQVIVDHALKGEHYDQLIAADNAAGNQIVRDAIAALVKQTGAIEQAAGKLGIANLNPDTADHEF >NC_021505.1|WP_016498082.1|1020438_1021866_-|c-type-cytochrome MSSSLSRLSPLLLTLALAACDDAPRFTQAEPGEALSGGKATVQRSDRNAYSLPSANLSPERRLDFAVGNSFFRNPWVIAPSTTTARDGLGPLFNTNACQNCHVRDGRGHPPEPDDSNAVGMLVRLSIPDQPYLAKVIERLGVVPEPVYGTQLQDMAIPGVAPEGKVRVSYTQETVSFKDGHQVELRKPTLQITQLGYGVMHPDTRFSARVAPPMIGLGLLEAIPEADLLANEDPDDHNRDGIRGRANRVWDDAQGKTVVGRFGWKAGQPNVNQQNVHAFVGDMGLTSTLQPKDDCTPAQADCLAAPNGDGADGEKEVSDNILRLVTFYTRNLAVPARRDVNAPQVLAGKNLFYQAGCQGCHTPQFTTAADAAEPELANQVIRPYTDLLLHDMGPGLADERTEFAANGQDWRTPPLWGVGLNETVSGHSQFLHDGRARNLLEAVLWHGGEADAARNFVLTFNAEQRAALLAFLNSL >NC_021505.1|WP_016498081.1|1019353_1020418_-|hypothetical-protein MFRPKLLFTSLAALALGACSPQDPQAVTSAAIAKQVILPTYSRWVEADRALAASALAYCEGKEDLDKARADFLNAQKAWAELQPLLVGPLAEGNRAWQVQFWPDKKNLVGRQVEQLVNGDKPVDAASLGKSSVVVRGLSAYEYILFDSKPDIATAEQKARYCPLLVAIGEHQKVLAEEILKGWNSTDGMLSQMTKFPNQRYADSHEAIADLLRSQVTALDTLKKKLGAPMGRQSKGIAQPLQAEAWRSHNSLKSLEATLKAAETVWVGVDNQGLRGLLPSDQKALADKIDAAYATSLKLLADNQKTLGELLADDAGQQTLNQIYDSLNVVHRLHEGELAKALNIQLGFNANDGD >NC_021505.1|WP_016498080.1|1018253_1019351_-|DUF1513-domain-containing-protein MLRRQALKLGSVLLSALTLGGWSLFRNKGSEPLLLSARDDGDGKHYAVGFRLDGTEVFSTQVAQRCHAIIHHPEQPIALFVARRPGTESYLVDLRDGRLLQTVVSQPNRHFYGHAVVHKGGEWLYATENDTTDPGRGVLGVYRFEGERLVHTGEIPTHGIGPHEVAWLPDGETLIVANGGIRTEAESRVEMNLDAMQPSLVLMQRDGTLLSKETLAQQMNSVRHLAVGSDGTIAACQQFMGDADETAELLAIKRPGEPFKAFPVPERQLQAMAQYTASVAIHSELRLVALTAPRANRLFVWDLDSGAVRLDAPMPDCAGVGAVKDGFVVTSGQGRCRFYDCRKAELVGQPMNLPSGFWDNHLHLV >NC_021505.1|WP_016498089.1|1031753_1032803_-|alkene-reductase MTTLFDPITLGDLQLPNRIIMAPLTRCRADEGRVPNALMAEYYVQRASAGLILSEATSVSAMGVGYPDTPGIWNDEQVRGWNNVTKAVHAAGGRIFLQLWHVGRISHPSYLNGELPVAPSAIQPKGHVSLVRPLSDYPTPRALETEEINDIVEAYRSGAENAKAAGFDGVEIHGANGYLLDQFLQSSTNQRTDRYGGSLENRARLLLEVTDAAIEVWGANRVGVHLAPRADAHDMGDADRAETFSYVARELGKRGIAFICSREREADDSIGPLIKEAFGGLYIVNERFDKASANAALASGKADAVAFGVPFIANPDLPARLAADAPLNEARPETFYGKGPVGYIDYPRL >NC_021505.1|WP_016498090.1|1032820_1033123_-|helix-turn-helix-transcriptional-regulator MPLDLDEIIKALAHPVRREILSWLKDPATQFPDQYHSTENGVCAGQIDQRCGLSQSTVSAHLATLQRAGLISSQKIGQWHFFKRNEATIEAFLEQLRQAL >NC_021505.1|WP_016498091.1|1033273_1033855_+|DUF479-domain-containing-protein MNYLAHLHLGGPAPQQLLGSLYGDFVKGSLEGRFPPALEAAIRLHRHIDSYTDQHPLVLAALARFPRERRRFAGIVLDVFFDHCLVRDWGNYAEQPLEQFTGAFYRVLLAEPELPGRLARIAPFMAADDWLGAYGDFATLEHVFNGIARRLSRPEGMAGVMVELERLYEPLLADFREFYPQLQAFAAARMPDS >NC_021505.1|WP_016498092.1|1033987_1034776_-|1-acyl-sn-glycerol-3-phosphate-acyltransferase MPKLRVVARLSRLLLVLLLGMLMASLVALGERLGFKAPIERRQRWTCLFMKRLVAALPFDVRVVGELPQRPMLWVSNHVSWTDIPLLGMLLPLSFLSKAEVRHWPVAGWLAEKAGTLFIRRGGGDSQRLREQIAGQLGLARPLLIFPEGTTTSGRSLRTFHGRLLAGAIDRGVAVQPVAIQYLRDGQIDPIAPFIGDDDLVSHLMRLFAQPRGEVCIELLQPIGSVGKERAVLALQAQQAIHLALFGVEEVEAVPRRQARAA >NC_021505.1|WP_016498093.1|1034775_1035531_-|GNAT-family-N-acetyltransferase MTRIAHSGDNSTERRLQAERLVGAAALQEAQALRFKVFSAEFKAKLKGAEQGLDMDDYDVHCRHIGVRDLSTGELVATTRLLDHQAASSLGRFYSEEEFRLHGLLQLQGPILELGRTCVAPDYRNGGTIAVLWGELAEVLNEGRYSYLMGCASIPMQDGGIQAHAVMQRLRDRYLCNEHLRAEPKKPLPSLALPGNVIAEMPPLLKAYMRLGAKICGEPCWDEDFQVADVFILLKRDDLCPRYARHFKAAV >NC_021505.1|WP_016498094.1|1035688_1036576_+|hypothetical-protein MAWLQRLNDPLRHAPADTLGETYAALLERLGPVAPFELAALGGRAMATPGLAFLVGYQAALRVLWPSAPASLGALCATERRSVRPADMHTRLDGLRLSGSKDFVTAGLEAEWLLVAARSETAGAAPQLNLAVVYPGEPGVTLEPLPTLPLMPEVGHGRLLLEQATCELLAGDGWDAYVKPFRSLEDLYVLTALTAWLYGVGQESAWPQDLRLQLLGLLAGCAEGSRQCADSVSCHLLLGGLFAQFQALRGAIDAALAAGPVHWAQIWQRDQGVMTLAAAAREKRLNKAWAAAGLS >NC_021505.1|WP_016498095.1|1036764_1037847_+|serine-hydrolase MLKGLLLVLCLALATAEAEDWPDPAWQNDPATFDWQAVDAYAFPPRTAPDRSGIRTDALLIIRDGRILHERYTAPTTAATAHLTWSVSKSVLATLMGVAQGEGRFQLEDPVTRFYPAMRGHPGIRMADLLHWASGLDWQEDYEYAPLKSSVVAMLYTRGREDMAAYTAARGASASPGQRFLYSSGDSNLLAAALRGMLDAGQYPDYPWHALFTPLGIDSAVWERDRAGTYVGSSYLYLSARDLARIGLLMQRDGRWQGRQLLPKAWVAFNRTAFDHAEPVPGEATPGGHWWLNQPLAGAARPWPSAPEDTYAALGHWGQALYVLPAQKLVVVRYADDRDGSYQHDELLKRVLAAMAREGT >NC_021505.1|WP_016498096.1|1037843_1038146_+|hypothetical-protein MKRALLLLLVLLGLLLWAWQERQALADFPGILSAYSAKEYCSCRFVMGFEQAYCQGYVKQWLPLSLLEENSQQRLVTAEGLGRRNQAAWQGLREGCRLLP >NC_021505.1|WP_016498097.1|1038190_1038772_+|YceI-family-protein MFKLPRLLPALLLALCLPAHANWHLDGESSRLSFITGKNGDTAEVHRFLVLHGTVDRKGVAGLSIEMDSVSSGIPLRDEQMRDNLFEVGRFAEATVKAQIDLRPINDLADGAQIELRLPLTVTLHGQSHSYNALLLATRLDARRFQVVTLEPLLLRAEDFGLLPGLESLRKFAGLKSINPSVPVSAVLIFTAR >NC_021505.1|WP_016498098.1|1038773_1039931_+|phosphatidylserine/phosphatidylglycerophosphate/-cardiolipin-synthase-family-protein MPGPVFPWRDGNQFELLIDGPEFFPRMLEAIVGAEFQVDLELYLVEAGACAEAVVEVLEQAARRGVRVRCLFDDYGSLAFNSALRQRLLEAGVYLRWYNRLRWKRGLRNLYRDHRKLLLVDERWAVVGGTGVTDEFWKPGDATSEWHEVMVQIQGPVVTDWQLLFDRQWSANNRRTAWRPAEGFGLPRLPKLPAHGQGMGRVAYADARQHQDILHSLVRALNSGKQRVWLATPYFLPTWSVRRSLRRAASKGLDVRLLLTGPRTDHPSVRYAGHRYYPRLLRAGVRIFEYQPCFLHLKMAVVDDWVSVGSCNFDHWNLRFNLEANIEALDPPLTAAVVASFERDFALSQEVDLDHWHARPLWRRVKQRIWGWIDRLVVNVLDRRD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_021505_2 | 1964580-1964653 | Orphan |
NA
Consensus repeat of NC_021505_2
|
1 spacers
spacers of NC_021505_2
>2.1|1964604|26|NC_021505|CRISPRCasFinder CCTGCGAGAGATCACTGTCAATCCAC |
CRISPR arrays and Neighbor proteins around NC_021505_2
The CRISPR arrays of NC_021505_2 >merge|NC_021505|2|1964580-1964653|CRISPRCasFinder TCATCACCTTTACGTTAACGTAAACCTGCGAGAGATCACTGTCAATCCACTCATCACCTTTACGTAAACGTAAA >NC_021505|2|2|1964580-1964653|CRISPRCasFinder TCATCACCTTTACGTTAACGTAAA CCTGCGAGAGATCACTGTCAATCCAC TCATCACCTTTACGTAAACGTAAA
>NC_021505.1|WP_016498806.1|1963232_1964396_-|isovaleryl-CoA-dehydrogenase MHYPSLNFALGETIDMLRDQVRTFVAAELAPRAAQIDHDNLFPADMWRKFGDMGLLGITVSEEYGGAGLGYLAHVVSMEEISRASASVALSYGAHSNLCVNQINRNGTHEQKLKYLPKLISGEHIGALAMSEPNAGSDVVSMKLRAEKRGDHYVLNGSKTWITNGPDANTYVIYAKTDLDQGAHGITAFIVERDWKGFSRSNKFDKLGMRGSNTCELFFDDVEVPAENILGQLNGGVRVLMSGLDYERVVLSGGPTGIMQSCMDLVVPYIHDRKQFGQSIGEFQLIQGKIADMYTQLNASRAYLYAVAQACDRGETTRKDAAGVILYTAERATQMALEAIQILGGNGYINEFPAGRLLRDAKLYEIGAGTSEIRRMLIGRELFNETR >NC_021505.1|WP_016498805.1|1961605_1963213_-|3-methylcrotonyl-CoA-carboxylase-subunit-beta MATLHTQINPRSAEFAGNSAAMLEQVQALRGLLAQVAQGGGPKAQERHTSRGKLLPRERIDRLLDPGSPFLEIGQLAAHEVYGEDVPAAGVIAGIGRVEGVECMIVANDATVKGGSYYPLTVKKHLRAQTIAQQNRLPCIYLVDSGGANLPRQDEVFPDREHFGRIFFNQANMSAQGIPQIAVVMGSCTAGGAYVPAMADEAIMVRQQATIFLAGPPLVKAATGEVVSAEDLGGADVHCRTSGVADHYADNDEHALAIARRSVANLNWHKLGKLQRLAPVAPLYAADELYGVVPADAKQPFDVREVIARLVDGSVFDEFKALFGTTLVCGFAHLHGYPVAILANNGILFAEAAQKGAHFIELACQRGIPLLFLQNITGFMVGKKYEEGGIAKHGAKLVTAVACAQVPKFTVIIGGSFGAGNYGMCGRAYDPRFLWMWPNARIGVMGAEQAAGVLAQVKREQSERSGQPFSAEDEARLKQPILDQYEHQGHPYYSSARLWDDGVIDPAQTRDVLGLALSAALNAPIEQSRFGIFRM >NC_021505.1|WP_016498804.1|1960786_1961602_-|gamma-carboxygeranoyl-CoA-hydratase MSDFSTLEVIRDPRGFATLWLSREDKNNAFNAQMIRELIVAIDRLAEDASLRFVLLRGRGRHFSAGADLAWMQQSAQLDFNTNLDDAHELGELMYALHRLKTPTLAVVQGAAFGGALGLISCCDMAIGAEDAQLCLSEVRIGLAPAVISPFVVKAIGERSARRYALTAERFSGIRARELGLLAEVYPASELDAQVEAWVNNLLQNSPQALRATKDLLREVDDGELSPALRRYCENTIARIRVSAEGQEGLRAFLEKRRPAWQTDDKKEPRP >NC_021505.1|WP_016498803.1|1958837_1960790_-|acetyl/propionyl/methylcrotonyl-CoA-carboxylase-subunit-alpha MSRPVLTTLLVANRGEIACRIMRTAKAMGLTTVAVHSATDRDARHSREADIRVDLGGTKAAESYLLVDKLLAAAKASGAQAIHPGYGFLSENAGFARAIEQAGLIFLGPPATAIDAMGSKSAAKALMDAAGVPLVPGYHGEAQDLDTFRAAAERIGYPVLLKASAGGGGKGMKVVEEESQLADALASAQREAQSSFGDARMLVEKYVLKPRHVEIQVFADQHGNCLYLNERDCSIQRRHQKVVEEAPAPGLSPELRRAMGEAAVRAAQAIGYVGAGTVEFLLDARGEFFFMEMNTRLQVEHPVTEAITGLDLVAWQIRVACGEPLPITQEQVPLIGHAIEVRLYAEDPANEFLPATGTLALYRESAPGEGRRVDSGVSEGDVVSPFYDPMLGKLIAWGENREQARLRLLAMLDEFAIGGVKTNIAFLHRILAHPAFAAAELDTGFIPRHQDVLLPAPHALPAGFWEAAAEAWLQGQPGHKRDDDRSSPWGAHNGLRLGLPARSSLHLASAGQDQAVALERSAASTWQLAGEQLVHDQAGVRRQHLAIRRDGTLYLHWDGEMHAIQAFDPIAEAEAGHGHQGGLGAPMNGSIVRVLVEPGQVVEAGTALVVLEAMKMEHSIRAPHGGTVKALFCQEGDMVSEGTVLVELAE >NC_021505.1|WP_016498802.1|1957844_1958501_+|helix-turn-helix-domain-containing-protein MNMDRWIALVRKRMETLGLTQEQLAERVGVSQGSVGHWVNKRRQPKIESMNRTFVELGIPHYNVSLELRIHGLVEEVCREEGADNNLDLMQCIACFRYPVLAWSELGVEEREEPGVFEQTDYLAQGKAFWLTVENDAMSAVSGRSVPQGMRMLVDPGVEAEAGRLVIARQPGKPAIFRELAEEGGQRYLKALNGNYPALLCEEGCEILGVVVRVHGAF >NC_021505.1|WP_016498801.1|1957145_1957799_+|helix-turn-helix-domain-containing-protein MENWNAFLKRYKREHNLSQLKLAERLGMTQGGVGHWLRGTRRPTLETINEKLEKLGLVFLEAQVMVVERDILREAPGRYAVEQPVSAQALLYASFRFPVLTWADLQGPLPEAGKCHEQTDYMPAGNAFWLVVENDSMNAASGKSVPEGMRVLVDTGLQAEPGKLVIARQPGRPAVLRQLVEEGGDKMLKPLNTRYPTILCEEGCEFLGVVVRVHGAF >NC_021505.1|WP_016498800.1|1956703_1957015_-|hypothetical-protein MKPPKNDETDLDSEAARRALDYYLNPKPQRPTLDNKIWTLHESVTGDQAREHAIALLRCAAATAQETASHQHGSQRELTYALMHMVDMARALLEHKRAADPDF >NC_021505.1|WP_016498799.1|1955754_1956270_-|DUF4880-domain-containing-protein MTRLLLPLEHPTPVAEHDADHALLHALKRLPRRVQQVFLLNRLDQLDFATIAARLDLPLASIERNMDQALQAGRSRRDVLSSVAGQWYVRLQSPQVTACERIDFRRWLDADTANLQAFHETELRWRSLLAPARQLGHDGWYRQGRAALSLGGCSIALGLGVAALVLFGLWA >NC_021505.1|WP_016498798.1|1954757_1955735_+|LysR-family-transcriptional-regulator MTPTTTTLARDPEAKRFLNDRLDWNLLRTFLVIGQEGSISRAAARLHLSQPAVSQALRRLEEQLDSALVVRRGPRINLSKAGEEVMQIAAELYGTVSRLGPALDSPAETVTGKIRLLSISRIQSRAYDDFLAQFHSDYPQVELEIDVLRSSDVASGLLQKTASFGLSLCRTPQPRLEQRVLLEQRYAFFCGKRHRLFGRKNLTVADLQGENFVSFTSDQMGGNLSPLTVFRDQQGFTGRIVASSPSLDEILRLVGAGYGIGCLPEHIVAADVQANELWRLPPWEGVIDVNVYLLWNREQKLTQAESIFLERFQQMLMTTDPAERF >NC_021505.1|WP_016498797.1|1953291_1954602_-|MHS-family-MFS-transporter MKSNTPTPRRAAAAAFIGTTIEFYDFYIYAFAAALVLGQLFFPSENPMLSTMAAFGSFAVGFIARPFAGMVFGHLGDRLGRKKMLLVTIVLMGVATTCIGLLPTYAQAGIWAPIGLIFLRLLQGISVGGEWGGAVLMASEHAPKGRKVFFASFAQWGSPAGLLLALIAFRFITAMETEDLMSWGWRIPFLMSGLLMIVGLLIRFGVPESPEFAEVKDSDQTSDNPVREVLRNHWRNIVFAALAVTIGSGGFFFTNTFMITYVTQYQGIAKTTILDCLFVVTILQFLSQPCSAMLAERLGEGRFLKWVAALCMVVPYPMFLLVQTGNVVYMTAGIALAVVLLSALYAVIAGYMAEAFPARVRYSGISIAYQLGSGLTGGLTPMLGTFVAGQFAGQWLPLALFFSVLALMSLAGVLGLSHLRNASARPVALSTSQGIA >NC_021505.1|WP_016498807.1|1964716_1966399_+|AMP-binding-protein MSQPSYTRGRQDQALLTQTIGQAFDATVARCADAEALVSRHQGLRYSWRQLAEQVEVHARALMALGVNTGERVGIWSPNCAQWCILQLASAKVGAILVNINPAYRVGELEYVLRQSGCRWLVCADAFKTSDYHAMVQELVPELACASPGKLVSERLPDLRGVISLAANPPAGFLRWHALAEKAGQTTLEAFTARQQGLQFDQPVNIQYTSGTTGAPKGATLSHYNILNNGFMVGESLGLTACDRMVIPVPLYHCFGMVMANLGCITHGSTMIYPNDAFDAELTLRAVAEERASILYGVPTMFIAMLDHPSRAQMDLSTLRSGIMAGATCPIEVMRRVIDQMHMAEVQIAYGMTETSPVSLQTGPDDDLELRVTTVGRTQPQLENKLVDADGCIVVRGEIGELCTRGYSVMLGYWDNPQATADAIDPAGWMHSGDLAVMDEQGYVRIVGRNKDMIIRGGENIYPRELEEFFYTHPAVADAQVIGIPCSRYGEEIVAWIKLHPGHSATVEELQGWCKARIAHFKVPRHIRFVDEYPMTVTGKVQKFRMREISVAEISAVSAG >NC_021505.1|WP_016498808.1|1966598_1967459_-|RHS-repeat-associated-core-domain-containing-protein MAPSTPHTQSPQPVCFTPYGHSGAIGDAAIATGFNGVPFDIYANCYHLGQGYRAFFPGLMRFSSADSLSPFGVGGLNAYAYCKGDPVNYGDPQGTAGVPIGRTGNTPRSAVTARGQFGIGQSSREVKSRRNELRLERMIKQAQETHEQAQAQGVNLALDLLEHEEDLFKRVNQHLGVGVLHRAADKNFRVQERLSTAATRAYTDHMKDILKGQARHYADDGILNIAQSTYLSDLDRSLINAYAPFTENLSGKYSGGYEWDLRLLMAQWKIREAEKFLSGRHSHFSV >NC_021505.1|WP_115283702.1|1967874_1968066_-|hypothetical-protein MVDMLQDAGVVDSGPGTLLAHLDRDKCEFALMRLEARYSVRLRRDYLSVAEIADELYKALDGR >NC_021505.1|WP_016498810.1|1968209_1968371_-|hypothetical-protein MNIPIPPETPDPNIDDPSLPPPVPEEDPDELPIKPTMPPTVGDPPTQEPPVKA >NC_021505.1|WP_016498811.1|1968510_1970529_+|alpha-1,4-glucan--maltose-1-phosphate-maltosyltransferase MSRNEPFESVPTANDHPDQAISLSQALLAPRIVIEDTEPALEAGTFAAKAISGQPVAVSSKVYSDGHDRLAVMLNWRQANSRRWHCVPMQSAGNDLWLAEFTPTELGPHLFSIEAWVDPFATYSHDLEKKYNAGVEVKLELEEGRLMLGKGTELCSGALRDELEALQQRLAELDTDGQVALLLGPDVAHLISEAGHRSYLARSRELPVDVDRPAAQFASWYELFPRSITDDPQRHGTFNDVHQRLPMIRDMGFDVLYFPPIHPIGTQHRKGRNNALKAEPGDPGSPYAIGSAEGGHDAIHPQLGTREDFRRLVAAAAEHGLEIALDFAIQCSQDHPWLKEHPGWFSWRPDGTIRYAENPPKKYQDIVNVDFYAPDAVPSLWLALRDVVLGWVEEGVKTFRVDNPHTKPLPFWQWMIADVRARYPDVIFLAEAFTKPAMMARLGKVGYAQSYTYFTWRNTKQELREFYEQLNQPPWSQCYRPNFFVNTPDINPFFLHTSGRAGFLIRAALATMGSGLWGMYSGFELCESAPLPGKEEYLDSEKYQIRPRDFTQPGNIIAEIAQLNRIRRQNRALQTHLGVAFFNCWNDNILYFAKRTPERDNYILVAISLDPHNAQEAHFELPLWELGLDDNAETHGEDLMNGHRWTWHGKTQWMRIEPWHQPFGIWRIEKAR >NC_021505.1|WP_016498812.1|1970689_1974007_+|maltose-alpha-D-glucosyltransferase MAKRSRPAAFIDDPLWYKDAVIYQLHIKSFFDANNDGIGDFAGLISKLDYIAELGVNTLWLLPFYPSPRRDDGYDIAEYKAVHPDYGSLADARRFIAEAHKRGLRVITELVINHTSDQHPWFQRARHAKRGSKARDFYVWSDDEQKYDGTRIIFLDTEKSNWTWDPVAGQYFWHRFYSHQPDLNFDNPQVLNAVIKVMRFWLDLGVDGLRLDAIPYLIERDGTNNENLPETHTVLKAIRAEIDANYPDRMLLAEANQWPEDTRPYFGEGEGDECHMAFHFPLMPRMYMALAMEDRFPITDILRQTPEIPANCQWAIFLRNHDELTLEMVTDRERDYLWNYYAEDRRARINLGIRRRLAPLLQRDRRRIELLTSLLLSMPGTPTLYYGDELGMGDNIYLGDRDGVRTPMQWSPDRNGGFSRADPQRLVLPPIMDPLYGYQTVNVEAQSHDPHSLLNWTRRMLAVRKQQKAFGRGTLRTLTPSNRRILAYIREYTDADGHTEVILCVANVSRAAQAAELELSQYADKVPVEMLGGSAFPPIGQLPFLLTLPPYAFYWFLLAAHDRMPSWHIQATEGLPELTTLVLRKRMEELLEAPARDTLQTTILPQYLPKRRWFAGKEGPIDEVRLRYGVRFATATTPVLLSEIEVLSGGTANHYQLPFGLLPEDQINTALPQQLALSRVRRAHQVGLITDAFVLEPFIRAVLHACQDGLRLPCGSGAGELRFECTDLLAALGLNDESAVRYLSAEQSNSSVVIGDRVVLKLIRRVNPGIHPELEMSAYLTAAGFANISPLLAWVSRVDEQNAPHLLMIAQGYLSNQGDAWAWTQNTLERAIRDQMAPPSRDAEAHTDALLELTGFAALLGQRLGEMHLLLAAPTDDEAFRPRPSDADDSQRWGTQISAELNHALDLLAQHRDALDPDSQALVDDLQQQRDGLAQHITSLAEQAEGGLLMRVHGDLHLGQVLVVQGDAYLIDFEGEPSRPLQERRAKHSPYKDVSGVLRSFDYAAAMILRSASAVDLSGPAQQARQRVARQYLHQSRHAFVEAYGLATAAMPHAWQQAEGERAALELFCLEKAAYEITYEAENRPSWLAVPLHGLHGLISTWGES >NC_021505.1|WP_016498813.1|1974007_1976218_+|1,4-alpha-glucan-branching-protein-GlgB MNATTRENGGLRQRDLDALARAEHADPFAVLGPHGDGQGGVFIRAFLPNALSARVLARHDGRVLAEMVQSAVPGLFTAHLDDAQAYLLQIGWAGGEQVTEDPYSFGPQLGDMDLHLFAEGNHRDLSGRFGAQPIQVEGVDGVCFSVWAPNARRVSVVGDFNNWDGRRHPMRLRHSAGVWELFVPRLGVGETYKYEVLGKDGILPLKADPLARATELPPSTASKVAGELSHAWQDQDWMAQRAQRHAYSAPLSIYELHPGSWRCELDEAGEVGRYYNWRELAERLVPYVQELGFTHIELLPIMEHPFGGSWGYQPLSMFAPTSRYGSAEDFAAFIDACHQGGIGVLLDWVPAHFPTDEHGLARFDGTALYEYDNPLEGYHQDWNTLIYNLGRNEVRGFMMASALHWLKHFHIDGLRVDAVASMLYRDYSRKAGEWVPNRHGGRENLEAIDFIRHLNGVAAHEAPGALIIAEESTAWPGVSQPTQQGGLGFAYKWNMGWMHDTLHYIQNDPVHRNHHHNEMSFGLIYAYSEHFILPISHDEVVHGKHSLIDKMPGDRWQKFANLRAYLTFMWAHPGKKLLFMGCEFGQWREWNHDSELDWYLLQYPEHQGVQRLVGDLNRLYREEPALHEQDCQPQGFQWLIGDDAQNSVYAWLRWSSSGEPVLVVANFTPVPREGYRIGVPFGERWQELLNSDAELYAGSNVGNLGAVASEALASHGQPLSLALNLPPLGVLIMKPA >NC_021505.1|WP_016498814.1|1976320_1977397_-|autotransporter-outer-membrane-beta-barrel-domain-containing-protein MKSTSNPLRFDSIFYAVSTSLLLATPVETFAYELQGDPTSPGFLQQPAMPQMSLDPVSASSLSIGTLSAFSQTMSARHGQTAPDLIASQWAQFFPTTSRSGTQPPDQLEAPSQQLTIGPDLFVRETAAGTVHRAGVFVGHNNLQSSFNGIRPLLGDKQRNAVNLSGESLGVYWSMTHEQGWHLDAVAMGSRIDVMGRGENGQRLDESGHAMTFSVEGGYPIRLGGNWVIEPQAQLINQQFFPGNQVQEETLQAFDSQPSWSGRVGAKLSGRYDVRGMPIEPYVRTNVWYDFSNPDEVKLDQVDKISSSRYSTTVELGLGLVARVTPSVALFVSADYSSDVDDNDLNGLIGSLGVRMRW >NC_021505.1|WP_016498815.1|1977716_1978511_-|endonuclease/exonuclease/phosphatase-family-protein MNPEAGSTGFASVNQAAAVKRLRVLTVNTHKGFTAFNRRFILPELREAVRSTQADIVFLQEVLGSHDRHAARYPGWPQTSQYEFLADSMWSDFAYGRNAVYPDGHHGNALLSKYPIIEHRNLDVSITGPERRGLLHCILDVPGQHQVHAICVHLSLLESHRQKQLQLLRKLLESLPADAPVIIAGDFNDWKSRGNRTLGLQPDLHEAFERHHGHLARTYPARLPLLPLDRIYLRNAESHGPRILGHKPWSHLSDHLPLAVEVRL >NC_021505.1|WP_041167663.1|1978692_1980846_-|glycogen-debranching-protein-GlgX MSPRTPKKTRSVAPSRIREGMPFPLGATWDGLGVNFALFSANATKVELCLFDSTGEQELERIELPEYTDEIYHGYLPDAHPGLVYGYRVYGPYEPENGHRFNPNKLLIDPYAKQLVGSLEWSEALFGYTVGHPDGDLSFDERDSAPFVPKCKVIDPAFTWGRDQRVLIPWERTIIYEAHTRGISMRHPAVPEELRGTFAGLANDELLKHIKDLGVSSIELLPIHAFVNDQHLLDKGLNNYWGYNSIAFFAPHPRYLASGKIAEFKEMVAHLHDAGLEVILDVVYNHTAEGNERGPTLSMRGIDNASYYRLMPDDKRYYINDSGTGNTLDLSHPCVLQLVTDSLRYWAGEMHVDGFRFDLATILGRYHDGYSERHGFLVACRQDPMLSQVKLIAEPWDCGPGGYQVGNFAPGWAEWNDRFRDTARAFWKGDEGQLADFAARLTASGDMFNNRGRRPYSSVNFITAHDGFTLRDLVSYNHKHNEDNDENNQDGTDNNLSWNCGAEGPTDDPDINALRMRQMRNYFATLLLAQGTPMIVAGDEFSRTQHGNNNAYCQDSEIGWVNWDLDQEGKELLAFVKRLTRLRLAYPVLRRSRFLVGDYNEAIGVKDVTWLAPDGNEMSVEQWEDPHGRCLGMLIDGRAQVSGIARPGSEATVLLIVNAHHDVVPFKLPTVPEGDYWSCLVDTDRPELRKGQHLQFDSTFEVKGRSMLLMVLQHEEE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_021505_3 | 2671007-2671104 | Orphan |
NA
Consensus repeat of NC_021505_3
|
1 spacers
spacers of NC_021505_3
>3.1|2671030|52|NC_021505|CRISPRCasFinder ACAACCGGTAGTAGCTGCCGCGCAGACCGCGCCCGTCGCACAAGGTGGTCTT |
CRISPR arrays and Neighbor proteins around NC_021505_3
The CRISPR arrays of NC_021505_3 >merge|NC_021505|3|2671007-2671104|CRISPRCasFinder TCCTTTGACGACCTGCCGGACGAACAACCGGTAGTAGCTGCCGCGCAGACCGCGCCCGTCGCACAAGGTGGTCTTTCCTTTGACGACCTGCCAGACGA >NC_021505|3|3|2671007-2671104|CRISPRCasFinder TCCTTTGACGACCTGCCGGACGA ACAACCGGTAGTAGCTGCCGCGCAGACCGCGCCCGTCGCACAAGGTGGTCTT TCCTTTGACGACCTGCCAGACGA
>NC_021505.1|WP_144063131.1|2670114_2670996_+|hypothetical-protein MAQGRPGSSAQTLGQGGAQAVQMYQQQQHQRGLADYRNRLAQMQQDQLAMQQQAAQAKAQREQEYRARLADPNFLAGLSPTARMMAQLGVDPNALIRAQSADNLAQHRQAQLAQQQGQFDARQANHGSGGQPSGPRTPAPRPYIDQPIGNNQMQRYKFDPSIGDYAPWGEPFNQYSPGRKAKGGADGMVGAILNPEQPDAEAVPDASSLPGTGSMQSYMPRPQGPVGVLPMSAAGGNASQKPTTRVPNTRKPAVSGDLAAAKSAISAGKSRQAVVNRLMQAGYTAEQIKGAGI >NC_021505.1|WP_016499421.1|2669793_2669994_+|hypothetical-protein MNFADQILAADPNAASPFADDPELAAAVEADFQALLAQVLAAPPMPAVQWPANVVSLAAYKARQPE >NC_021505.1|WP_016499420.1|2669395_2669797_+|hypothetical-protein MKTNSAAVTLSQLADLVATDRSALCRNPRLPEGAPVHTGAAHRPEHAFALDALAAFAYDQTAHLTEAECRLRLALIGRTEPSRTSKRTCAKPGPFRLLDNGDGTHEMVPILEPDEFTLTEQRGLRAAVMEQSK >NC_021505.1|WP_016499419.1|2668680_2668884_-|hypothetical-protein MDKATVTADTLELILLNQQALRAGIEELSLWIKQRGSTATCDNVMIALQTMDANAEGIEQGIRVLRG >NC_021505.1|WP_016499418.1|2667976_2668528_-|recombinase-family-protein MAIIGYVRVSTGEQSVAAQKHSMRAHAIDKWFEDSGVSGAVPALERPGCAAMLAYAREGDTVVVAAVDRLGRDTIDVLSTVEALQAKAVSVISVREGFDLSTEIGRFMLAMLAAVAKLERSNIKARQMAGIEKARSEGKALGRKATVDPVEVSTWRRENSASIAQTAEQFGISTATVKRCCRV >NC_021505.1|WP_049824913.1|2666241_2667573_+|site-specific-integrase MPQRIPPVSDQAQPPKKRASSDDAARDINEPGKYPEGSVPGMYLHVKKASKVWSLKYRLHGVEGTYTIGGFPDIRHDRACELAQEARTWVAEGHKPKDMRDARIAAELAQKGETFEKVCSEWLSSRGDLAHKTLLNYRSTLDLHVLPALGQLPVHLVQYRHVKKVLVDLGQHPAAANHALVTIRAVLDYAVEAELIDDNVVARRGKGLIRKRKTVHHAAIERPDDLKEFLRRLEQQAPAGSPTTWALWMLVLLPVRPSELAKMRWQDIDMGEAEWRYTVPKTGKQHVVPLPDRAVSMLAMIREHRAQAAAATVPSPFGGIAKPEVPTGWVFASTRGAGKSICSTTMLNGIRALGYEKGELTSHGFRSTFRSLAHEKLGIEAIVLELCLGHRMPGVLGDTYARAHLLPQRREAMEKWAAYVEDLWFEVVHGVSRAQAEAALTGM >NC_021505.1|WP_016499416.1|2665689_2666070_-|RidA-family-protein MAHADIIYTPDPDAESISSDVAEFNGVLVTTQIPTHADGSLELGGIVEQSECTLQALKVALEKAGSGMDRVLHLTIYLTDMADRAAFNEVYQRFFSKPWPVRAAVGVASLAFEGMRVEVTAMAAKR >NC_021505.1|WP_016499415.1|2663483_2665484_-|U32-family-peptidase MSLPKNHLELLSPARDVAIAREAILHGADAIYIGGPSFGARHNACNEVGEIAELVEFARRYHARVFTTINTILHDNELEPARKLIHQLYDAGVDALIVQDLGVMELDIPPIELHASTQTDIRTLERAKFLDQAGFSQLVLARELNLQQIRAIAAETDAAIEFFIHGALCVAFSGQCNISHAQTGRSANRGDCSQACRLPYTLKDDQGRVVAFEKHLLSMKDNNQTANLADLVDAGVRSFKIEGRYKDMGYVKNITAHYRKELDAILEGRPEYARASSGRTEHFFLPDPDKTFHRGSTDYFVTDRKVDIGAFDSPTFTGLPVGVVEKVGKRDMQVVTEVPLTNGDGLNVLVKREVVGFRANIAEPRGEFEEDGQKRYRYRVEPNEMPEGLHKLRPNHPLSRNLDHNWQQALQRTSSERRVGVEWHAVLREQRLMLTLSSEEGVSVQVALDGPFGEANKPQQALDQLHDLLGQLGTTMYHASSIELDAPQAYFIPNSQLKALRREAIEALTEARIKAHPRGGRKAETTPPPVYPESHLSFLANVYNQKARDFYHRHGVQLIDAAYEAHEEHGEVPVMITKHCLRFSFNLCPKQAKGVTGVRTKVAPMQLIQGDEVLTLKFDCKPCEMHVVGKMKSHIIDLPTPGSAVAQVVGHISPEDLLKTIVRAPH >NC_021505.1|WP_016499414.1|2662141_2663467_+|sigma-54-dependent-Fis-family-transcriptional-regulator MLQPAAQRRLLIVDPCDDCHQLLPGLSSAGWDVDSCVLGAALDHPCDVGLLRLQASHLRHPDAVKDMIKRSNTEWIAVLSAEQLRMPAVGDFVCEWFFDFHTLPFDVSRVQVTLGRAFGMARLRGKGAAKVDDATHELLGESRPIRELRKLLGKLAPTESPVLIRGESGTGKELVARTLHRQSQRSDQPFVAINCGAIPEHLIQSELFGHEKGAFTGAHQRKTGRIEAAHGGTLFLDEIGDLPLELQANLLRFLQEKHIERVGGSQPIPVDARVLAATHVDLERAIEQGRFREDLYYRLNVLQVVTAPLRDRHGDLSMLASHFAHFYSLETGRRPRSFSDHALAAMGRHDWPGNVRELANRVRRGLVLAEGRQIEAQDLGLQTLDPGQQPLGTLEEYKQRAERQALCDVLNRHSDNMSVAAKVLGISRPTFYRLLHKHQIR >NC_021505.1|WP_016499413.1|2661533_2661800_-|hypothetical-protein MNAPLRVNEALLIADHAFEPFQCVAWDAPNGTGELSLAVIDRTNTRIGSKRVSSSTYSDPAQWASLIEEVRAELCEKGYDLQPWSMPK >NC_021505.1|WP_016499424.1|2674251_2674503_+|hypothetical-protein MTTEIEQAEAAVRQAEGEYRELERAAAMLRDRLPAAQQELESARAQLRRLKQEDGTAATLCQQREIDRLTAAIAAANEDRSHA >NC_021505.1|WP_016499425.1|2674495_2675056_+|hypothetical-protein MHDIEHLNNLRQSAEWHNVDTARAKLTTTVDGEYIITFDRPVVDLGPWTESAPERTVTARSAEGAEIALLHGLVKLHVCERRRVRTGVVVGWTCADIARKPLTAGEIAAFKARTNPARKIEKLQEELTEALARQAARAAAAQGAADLAERYGLAAAAPVQTSKPTGTVSGRPSKAKRASRQEVNHE >NC_021505.1|WP_016499426.1|2675048_2676038_+|hypothetical-protein MSDSIQVEPIDGPDEDVPVDGIAAVLTDSVGIPNDAGPDTPVPEADATAVEEYDAHAQQRDEHLSDDGQEDEQGPQGKQQQRVPLGALQEERRARQAAQEQARQLQAQLAQLQAQQDQFRAMQQQLAAQQQAQQIPAFEDDPEGHVAAKFQQIEQHIVGQQQAAVQRAQFEQAAAQVGQELQEMAPQVVAIEQEFAATHSDYHDAYAHLNAEVDRRIAQQHPHASPAEHAFAKQIALLAFVKDSQAKGLNPAQLIYGKAQELGYQAQHRAPAPARRQAPTSLSTLAADGKAPDQRGHLSASQVSNMSNEDFDALWSQMAADARTPGFGL >NC_021505.1|WP_016499428.1|2676470_2677256_-|hypothetical-protein MAGLWEPVGGVYWKVPPDDVMPFLHWATVLVFAAVAVVAVAVRINRKRSEPEWHLRQHMVEIGAAATLTYLVGMTVLTWGRIGSLGEMPLNEVGDFLAGAFGPVAFLWLVLGFLQQGYELRMQATELKNSVEQHKEMVKTTKQERDRALKALFTFDVGSFIHSPTERWVRRKIRARNEGKKALNVVLQSDPHVNDGEPLELGDMPEGHQVTVAFDFPLLEGVTTGKFWIEYDDAVGGRRKEAFWYSTSEFDLSVVRAKADD >NC_021505.1|WP_041167698.1|2677284_2677677_+|hypothetical-protein MASDDSGVDLLRELLQAVTVVLNFLGHPLEHAAMTPEQIRAFIEREYSHLVAEPRHNPDGWAFFLGAPRRGADSNRIFRAVQHSGGGPTRLKLAVTSRLKGEPVEIDFTGSEAALKELIDRELQRYSDGL >NC_021505.1|WP_016499430.1|2677689_2678721_-|hypothetical-protein MISNEDQVRATVEKLKQIADVLKSDIGPVQLTLELHERFENFRHELEIFESQLTSSHLLDLMGRANYVASRGEHLVSVTRAAKLNVRQELLDELLKQAARLSAMISVPNENGMISLRDLEEAEKDLRKQINIEQGRLTEIEIKINTLHDTASNEIQKISLAYEETRAVLDEKKAQIDELVGHASAQVVAGDYAKSSEVEKRMADMLRWGSIACMAFVVAILGVTAFKSLDAEIHWENFVIRITLALLLSVPAAYLARESAKHREQQYQHLQTSLDLKAITPFLASLPPEEQHKIKIDIASKIFAGRDFSRVGADPFPINAHELVMEIIKKLELPKGASRGPGQ >NC_021505.1|WP_144063132.1|2678857_2680240_-|hypothetical-protein MNIDTESREILRYISQDFVMLTANGEQRAYHIESGDIFGLKGFIKYCSKHYGEITIIQGDGKEERTASGAIWWSWEDPLQRVARRVVMEPTSKPEHEGNPEVFNLWYERKKTMCPPDLYATPESIEIFVKHLLYLADDDEVVVMYFLNWLAQLYQTPETKIPSAFLFYSKLGGVGKSTMFKLLAKVFGPCMVGSCSGRALTKSFDDVTEHKRLLMVNEMARSEKADGYENFKNMISEEQVSFEGKGRAAKDIKNITHYIVTTNNKDALPLMQGDRRIAVFMCNAAPKPDSYYVKLMDWMENEGPALVAGVLAQWRFPADWNPYAPVPQTAAARAMQDAAQGELYGVVKELIDQRREPFDKDIIVVGEAATKLNNMGLALTKPANITSLGKVLKVLCGEPEPLRILKRETGKSMPLNVYLIRNAEQWKAASAEQRMNHLDTGVHLFPVQDQSAESEVANHE >NC_021505.1|WP_016499433.1|2680636_2680912_-|AlpA-family-phage-regulatory-protein MGNPRIEVPAPIADALVHPTPLCQALGISRMTLDRWVEAGHFPAPIVIGQMRGSGKVSRTAFLKSEIDAWIEARKAERDAAKKLRTETLPA >NC_021505.1|WP_041168014.1|2681133_2681826_+|tRNA-(adenine-N(1))-methyltransferase MNEQTLSRRLERVAAHVPQGARLADIGSDHGYLPVALMLRGVIEAGVAGEVAQTPFASAQRNVRRNGLQDRLTVRLADGLAAVEPQDRISVVSICGMGGDTMCDILEAGKQRLGGVTRLVLQPNGGERELRQWLAGNGYQIVSEELLRENRFDYEIIVAEPGSVVYSVEQLYFGPVLMQEKSEAFLVKWRRMLRQKQQTLANFQRARDAVPQAKIDDFNQQVGWITQVLA >NC_021505.1|WP_016499435.1|2681827_2682088_-|DUF2790-domain-containing-protein MKRSIAVLAVAATLASFGAFADAGSTQPTSSNYEYGMPLDVAKVISITPASNAADCQVGTAHMVYVDHQGQKREIDYREMGNCSQQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_021505_4 | 3883434-3883567 | Orphan |
NA
Consensus repeat of NC_021505_4
|
2 spacers
spacers of NC_021505_4
>4.1|3883463|22|NC_021505|PILER-CR AGCCCCGGCCTTGGCGCCGGCG >4.2|3883514|28|NC_021505|PILER-CR TTCGCTTTGGGAAGCCTTGCCGCTCCCC |
CRISPR arrays and Neighbor proteins around NC_021505_4
The CRISPR arrays of NC_021505_4 >merge|NC_021505|4|3883434-3883567|PILER-CR CCGCACTTGCCTTCACCACACTTGCCCTCAGCCCCGGCCTTGGCGCCGGCGCCGCATTTACCTTCACCGCACTTGCCTTCGCTTTGGGAAGCCTTGCCGCTCCCCCCGCACTTGCCTTCACCACACTTGCCCTC >NC_021505|4|1|3883434-3883567|PILER-CR CCGCACTTGCCTTCACCACACTTGCCCTC AGCCCCGGCCTTGGCGCCGGCG CCGCATTTACCTTCACCGCACTTGCCTTC GCTTTGGGAAGCCTTGCCGCTCCCCCCG CACTTGCCTTCACCACACTTGCCCTC
>NC_021505.1|WP_016500468.1|3882299_3883133_-|DUF692-domain-containing-protein MFPAMLNIGLGLRRGLLPELLAMEAGAVDFLECAPENWIAVGGAYGKGLAQLAERFAVTCHGLSLSLGGSAPLDRHFLEQTRQFLDRYQVRLYSEHLSYCSDDGHLYDLMPIPFTDEAVHHVAARIRQAQEQLERRIAVENISYYAAPYQAMSELDFIRAVLEEADCDLLLDVNNVYVNACNHGYDAQQFLAGLPPARVTGMHVAGHYDEAPDLKVDTHGAAVKEDVWALYASACVRFGVKPTVLERDFNYPPLAELLAETARMRAVQCAAGGQADE >NC_021505.1|WP_041168093.1|3881611_3882307_-|hypothetical-protein MNETLRAQQYALARHLRDPQRHAPPPGVEARRLKVYRELFYGAIEGLLAGSFPVMRQVLGEQRWHARVRDFYASYRCQTPLFTEIAETFVDYLQRVALDAPWQLELAHYEWVEAQLYLSDAEDPAHDPGGDLLAGEPLLSCVARVLAYRWPVESIGVDYQPTAAPASPSLLLVYRDARLQVRFARLAPMAYRLLMEQGTGRERLQALGGDVQQGLVLLEALREQGVIVGTV >NC_021505.1|WP_016500466.1|3881056_3881578_+|DoxX-family-protein MKAFTLPSDWKLSTALAGLTTVDRLGSWSADVPLRVFLAWEFFESGLEKFNGSNWFADLQSSFPFPFNLLPAELNWQLSMWAELVLPLLLLLGLGTRLASLGLMVVTVVAIAAVHWPAHWSGLAELAQGYAITDQGFGNYKLPLIYLVALLPLLLKGAGRLSLDHWLGMRFGR >NC_021505.1|WP_016500465.1|3879600_3880887_+|HAMP-domain-containing-histidine-kinase MKWPRTLASRLAMIFFTGLVLAYGLSFGLQAYERYISSRSMMLSNLEQDVATSVAILDRLPAAERAAWLPRLERRTYRYRLDQGLAGEAMASSDPPMAAASIVKAIGSDYRLTFQEIPGPNAHFQAHLNLADGAPLTIDVTPAPVPVARWLPVVLLIQLAVLMLCTWLAVHLAIGPLTRLAQAVDNLDPDQPGVQLDESGPREVRYAAVAFNALQARIAAHLKERMQLLAAISHDLQTPITRMKLRVEVMDEGVEKDKFGSDLDEMEHLVREGVAYARSMDSSTEATCRVNLDAFLDSLVFDYQDSGAQVERHGSSGALLETRPHALRRVLVNLVDNALKFAGAAELQVSREGSTTIIRVLDNGPGIPGDELDEVLKPFYRVEGSRNRSTGGTGLGLAIAHQLIQAMGGRLTLSNREQGGLCAQIELS >NC_021505.1|WP_016500464.1|3878863_3879604_+|response-regulator MEHVDHILIVDDDREIRELVGNYLKKNGLRTSIVADGRQMRAFLEANSVDLIVLDIMMPGDDGLLLCRELRAGKHRNTPVLMLTARNDETDRIIGLEMGADDYLTKPFSARELLARINAVLRRTRMLPPNLTISESSRLLRFGQWRLDTTARHLLDNDGTLVALSGAEYRLLRVFLDHPQRVLSREQLLNLTQGREADLFDRSIDLLVSRLRQRLGDDAREPSCIKTVRSEGYVFSLPVQLLESPS >NC_021505.1|WP_162838442.1|3878419_3878593_-|hypothetical-protein MSQTYPSPAHRAHDRLCEQIEAFLQQGGHIQQVARGVSGSPDGVSKLQWHKARTAKP >NC_021505.1|WP_016500463.1|3877882_3878158_-|DUF2790-domain-containing-protein MNLKTFASASLFAMLSFGAIAAQAAAMPTQNTGAMQYHYGEHLDVKKVLSVQDDQSNACGLVNTRMDYLDSHGRPQSVEYRSYATGGCHDN >NC_021505.1|WP_041168090.1|3877356_3877872_-|thioredoxin-family-protein MLKVGSIALALSGLGLAGHLMARDSYGAMPSLSGASQWINSPPLDGPALKGKVVLVDFWTWDCINCQRSLPHVNDWARRYADQGLVVVGVHTPEYDYEHDVGTLRNKVAGLGVGYPVAVDNDHKVWNAWGNQFWPAHYFVDRKGQVRHVHFGEGDYGGQEQVIKALLDERG >NC_021505.1|WP_041167787.1|3876447_3877296_+|shikimate-dehydrogenase MSQHAILAGLIGRGIQLSRTPALHEHEGDAQALRYLYRLIDADQLQLEDSALPGLLDAAQHTGFTGLNITYPFKQSILPLLDELSDEARGIGAVNTVVLKDGKRVGHNTDCLGFAEGLRRGLPDVARRQVVQMGAGGAGSAVAHALLGEGVEQLVLFEVDAARAQALVDNLNGHFGAGRAVLGTDLAAAVAEADGLVNTTPVGMAKLPGTPLPVELLHAGLWVAEIIYFPLETELLRAARALGCRTLDGSNMAVFQAVKAFELFSGRQADAARMQAHFASFT >NC_021505.1|WP_041167786.1|3876001_3876451_+|type-II-3-dehydroquinate-dehydratase MKPLILVLNGPNLNMLGTREPAQYGRETLADLAQSCADTAHANGLEIEFRQTNHEGELIDWIHAARGRCAGIVINPGAWTHTSVAIRDALVASEVPVIEVHLSNVHKREPFRHLSFVSSIAVGVICGLGSHGYRMALSHFAELFQERTA >NC_021505.1|WP_016500470.1|3883873_3884434_-|DUF4174-domain-containing-protein MLVRSLTLATLLAVAGPLFAADSDAPLAKELGKARPLVVIAPSSADPTLRGLNKALEDPATQAAFKERNLVLYSVANMMGKREDKNLEQQTTMALIRELKLGASKGTKVILVGKDGERHMLKDDDTGEAIDPQVILKAVDELPASEKAVAAPEPVAAAPQAKDKDSKPAKPAKPAKPAAPPKPLED >NC_021505.1|WP_016500471.1|3884661_3889155_-|hypothetical-protein MDTTVPSDAQAAIDAFQDGIIGRRLPPWLRHAPAEQLPEIGKALANSLRCCEHVKAVLRGIEGLDSFVASALGKALDERYGLGRNPYLLRFLEGRREPVINSQPVGAHLTDVVYEEKPLLEVALRNFTAAQAQEGGQPRGNRLLLPRHGTVKPPTSIEFAGLCRELDLGERYQRHLDAVLSPTGSSERLVSQLVDATRYTMLVDAYKARHEGTLDASELNVMVAVCEKGELPRLAGDLVQARQLKLLGCRIEQVTVFVVVEQGVLFNTTRRVLLYVPGDPFSPWRAFESIDKLNRELGRRLRDKTYQRFFSRFVLRRDSQAFFAQVAERFDDLPGWAFRDLEPHLQAYPQPLFISLAQARIHQIKEDAAMIAVPVARLDREVQRQHDLRLEAEGWALLNLASFFLPGLGLALLAVTACELLGEVYHGAEAWQEGDSQEALDHLTHVATDLAVLATTVAGVGVARRVWARSAQVDAMVPARLEDGTEKLWQHDLTPFQSQAPVAASSRDALGIRRQDGQAWVEMDGHHYRVAEAGDDQWQLYPVDGHGPLLRHNGAGAWRLWSEQPARWTDKYRMFRRLGEPFNGLNDEQIDQVLLFHGLDGDDVRGLHVHAQAPSPGMIDSVERVRLDQRIRSMIGRLRNGEPVEDATVLDHARHLPGASGLTDQALAALAWTQRRTLLQHLFEALQPSDTPGSAALRRVFPGLSARTAQALVQAASSVDRMRLQSSARVALGLAEAARGSVLATRQARVFEALYLDTPQHADLARVTLGLLRYVPGGEQGVCWRLYEGCLGGPILAKTEQGQRAFDLVHLNGTFQLHGSQGTALGEAGELFEVIAPAYTEAQREAMGIGDPFAHNLRVIVAREAARHRDEISRLLGAVRPGAVRGPMRLADGRIGYPLGGGGVGGFASRGRALRATLRDLFPWLSDEQVETFADDARRSGHQIEQVLADLRNEFAVLRITLNTWVARGQGDVREDREALRQTLFNCWRRSVGVGELQINAQENLHVMFCNFRSSGLPNIPAQVSFRGVTSLSLLHLDLLEVPSSLLLAFPNLQTLDLGGNLLTRLPQPLLQITQLRHLSLTNNRIVLNTAQTATLASCTSLQSLDLSHNPLGRRFTLAGLAELRWLSLRDTQISQFPLGVFDNAQLVSVDLRDNRIRHIPEGFYQLPLWHRRRFRLNANPLGEAQTLRLQASLSSDDPALDEEQVLLRLQHAREVWGDSVAPEHRGLMLAAWDSLDSGQDAERFFRVLRQLLLSEDFRVNPRALGNRVMGVLQAMAITPELRQNLLSVANDEWGCQDGATWCLSNLELNLLVWQVEHAAKGGSERALLDLGRRLWRQDAVDMFATRWALQHGRTLEGSEVGLAFRVGLRERLDLPLQVGEMSFLAISGVLDADLAEAEAAVRDAETPEEIARSMVDREFWQAHLERSHPERFAAVDLPFRRQLETVLDDEALTEGAMIDQADAIRDAQRAARRGLMLDMTIHAMEVGPKGPAIDVR >NC_021505.1|WP_016500474.1|3893753_3896114_-|DNA-polymerase-II MELQQGFVLTRHWHDTPEGTCVEFWLATDQGPRQLRLAPQVSVAFIAQTHEAHARVLLANEPGVELRPLALKDFDQRPVLGLYCRQHRQLMQLEQRLRTAGVEVFEADIRPPERYLMERFITAPVQFTGQPDAQGVLCDAQLKPSPGYRPPLRLVSLDIETSERGELYSIALEGCGQRQVYMLGPANGDAAELDFDLEYCADRAALITRLNQWMARHDPDAIIGWNLVQFDLRLLHEHAKSLQVPLTLGRNGAAMTLRSHAGGGHVFADAPGRLLIDGIEALRSATWSFPSFSLENVAQTLLGEGKAIDTPYQRMDEINRRFVEDKPALAHYNLKDCELVTRILAHTRLLDFLLERASVTGLAVDRSGGSVAAFCHLYIPQMHRLGFVAPSLGSRPDEASPGGFVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLIEGLRLPDDAHSVEGFRGGRFSRTQHWLPAIVERVWQGREAAKREGNAPLSQALKIIMNAFYGVLGSSGCRFFDPRLASSITMRGHQIMRQTRSLIEACGYDVIYGDTDSTFVWLKGAHAEEDAARIGRELVAKVNQWWQAHLHETMNLQSALELQFEVHYRRFLMPTIRGTDEGSKKRYAGLVQRADGSQEMVYKGLESVRTDWSPLARQFQQELYGRVFRSEPYRDYVREYVRRTLAGEQDELLVYRKRLRRPLADYQRNVPPHVRAARLADEYNKRLGRPLQYQRGGWISYVITTAGPEPLENLQAPIDYDHYISRQLLPVADAILPFVGDDFVRLTDHQLLLF >NC_021505.1|WP_041167789.1|3896148_3897339_-|MFS-transporter MSRTSSPPLLDASSERLPLSGLLALAMTGFIAILSETLPAGLLDQIADGMHISQAMAGQWVTAYALGSLLTAIPLVTLTQGWYRRRALLLAILGFVLFNGLTALSGSNSLTLVLRFFTGAAAGLAWGLIAGHARRMVPAPLQGRAMALAMLGQPIALSLGLPIATWLGAGLGWRATFVLVTLVALLLVVWVLRSVPEYPGHAAGKRPAAMQVLRTPGVLIVLLVILTWILGHNILYTYLVPLLAAAGMAGEIGAVLMVFGLSALAGIGLVGMLVDRHLRKLVLLSLAGFALATLALGQASSWLMYLSIALWGLTYGGAPTLLQTACADAAGEGGDVAQSMLVTVWNSAIALGGIVGGALLVGSGTEAFGGVVLALIAVALLLTWAARRSGFVAGAR >NC_021505.1|WP_041168094.1|3897456_3898353_+|LysR-family-transcriptional-regulator MDSLSGFVVFNRVAETRSFVAAGQSLGITASAVGKRVARLESRLGVRLFHRSTRSITLTAEGTMFLERSRRILAEIEATEQELSQASETPRGRLRVSMPQVTRLVMPALAEFMALYPQVELDLDFSDRMVDIVGEGFDVVMRGGQPVDSRLSAKFLGHFQHRLVASPEYLRERGTPLHPRDLAAHTCLHYRFPSNGKLETWPLRQEHPEQAYDIPISMVCNHVETRVCFALNHRGITCLPDFNVGRELANGSLVSVLDDFMERRGSFYLLWPSGRQMPPKLRVFIDFMLERVFNRTGN >NC_021505.1|WP_016500477.1|3898333_3899254_-|LysR-family-transcriptional-regulator MSERIQALHALRAFEVASRYGSFTRAAEELALTQGAVSHHIKTLEALFGCDLFERRGPKLSLTEHGRLLSQELKVGFKIIENACALLRQDRYGLRLKAPSTLTMRWLLRALDAFKKADDNCSVQLSSVWMDIDTVDFYSEPYDCAILLASGRFPADIESFKLFDEWLIPVCQPDYMPQPQPALADLAQCEFLHPSPDRRDWRRWLARMGALEISIDQGQVFDTLDQGISAAQQGLGISVVDLVLASADLQAGRLVTPFKHAVATGDGYYMTWLKASPKARQMHKLRDFLLGQVPPLAYKDINYLYG >NC_021505.1|WP_016500478.1|3899413_3900202_+|YqcI/YcgG-family-protein MFTGYGNCYRLDALEQAVEHGCNTQHWTFKTIEHFRSILANPDFPCLFGRKAVNGETCHILFARAEQLADDIAQGLADYVGTVAPITPKQRIGSPLVVFLETAADYTLAEQQALAWKVLRGVHARDPHPWPQGMPTDPDDNGWSFCYAGMPLFINMNFPGHQQMKSRNLGHHITFVINPRANFDEVANANTESGKRIRERIRERVHHYNDGVMPDTLGFFGDTDNYEWKQYQLQETGSLNPSRCPFHAHAAHPATPDLLIEN >NC_021505.1|WP_016500479.1|3900204_3900831_+|LysE-family-translocator MNTALTFTYALTVLLLIATPGPVVALIVNTAAASGSRKAMFTAVGTNWASLVLIGAAAWIILTSAAIDKAWLSTMSLLGCLFIGYIAVGTLRDALQAPAPEAASEAPKAARGGLLQGFMVGISNPKDIIFFIAFFPQFIQITESFGKSMVVLSLLWVAIDFAVLSLYIFAIGKIASQRSNRVISLASGVALLLIAAGGLLYNLNELAA >NC_021505.1|WP_016500480.1|3900870_3901953_+|hypothetical-protein MTATDMSPPPSPLQQDYQRFLLLGSRRAPHTVHVHETGYRSGTINTDALGLRYSHCAGKRFSAAERGGASRINLLVGGSTALGIGASSDEHTVASHLSALTGEVWLSLAGCGLNASQELLMFLTHQHRLSQLGHVVVLSGLNSLAHEALSEVLGSPNNPLHAKAYQAFLNSFSEGLQPAAPPRRPSLWRRIGQALTTPAAQAPVIWPLSPPEKRLARAADSIGRTLRQWDRLLADSHATLTFILQPLLPWCRDTLPAGEQAMLAALEQQPANFDRLLDGAFDSQLHSAFFRRIKSQADPVPCYDMNGMLSSSPVFGADLFIDRLHLNDLGNNALAKVITAKLGLAQEKHAQRKVTPIKLV >NC_021505.1|WP_070100151.1|3902100_3902787_+|sel1-repeat-family-protein MRRLPPLLAALMPLAAHALEVRIDPHADLLYRQALPLLEQADNQGDDTSTLRTALGGDPELSRQGQAMAHTLPTAVALLKKSVELGHPVAQYRLALYYMTYLPAAQIPDAACPLLEASLKQGFAAPAPAIATWCRPYNASSEYRAALEAIPSMATVYAPYYPQPTTRLACSRSRPEGLQMLWGRQRDYQAEVYRLLGDLDPPHRLSLLQKAVDINGCMTAQQHLTRHP |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_021505_5 | 4570888-4570967 | Orphan |
NA
Consensus repeat of NC_021505_5
|
1 spacers
spacers of NC_021505_5
>5.1|4570914|28|NC_021505|CRISPRCasFinder AGCCAACTTCCACAGGGGCCCCACCTCT |
CRISPR arrays and Neighbor proteins around NC_021505_5
The CRISPR arrays of NC_021505_5 >merge|NC_021505|5|4570888-4570967|CRISPRCasFinder TTCAGGCCTGTGGTGATCCTGTGGGAAGCCAACTTCCACAGGGGCCCCACCTCTTTCAGGCCTGTGGTGATCCTGTGGGA >NC_021505|5|4|4570888-4570967|CRISPRCasFinder TTCAGGCCTGTGGTGATCCTGTGGGA AGCCAACTTCCACAGGGGCCCCACCTCT TTCAGGCCTGTGGTGATCCTGTGGGA
>NC_021505.1|WP_016501025.1|4569429_4570824_+|class-II-fumarate-hydratase MSRIETDSLGPVEVPDGAYWGAQTQRSLINFAIGKERMPLAVLHALALIKKAAARVNDRNGDLPADIARLIEQAADEVLDGQHDDQFPLVVWQTGSGTQSNMNVNEVIAGRANELAGKGRGGKAPVHPNDHVNRSQSSNDCFPTAMHIAAAQAVHEQLLPAVAELSSGLAELSARHQKLVKTGRTHMMDATPITFGQEVSAFVAQLDYAQRAIRATLPAVCELAQGGTAVGTGLNAPHGFAEAIAAELAAESGLPFVTAPNKFAALAGHEPLTSLAGALKTLAVALMKIANDLRLLGSGPRAGFAEVRLPANEPGSSIMPGKVNPTQCEALSMLACQVLGNDAAIGFAASQGHLQLNVFKPVIIHNLLQSIELLADGCRNFQLHCVAGIEPDAEQMAAHLERGLMLVTALNPHIGYDKAAEIAKKAYSEGKTLREAALELKYLTNEQFDQWVRPENMLAPGGKG >NC_021505.1|WP_016485344.1|4568750_4569275_-|DUF2059-domain-containing-protein MTRLRVLCAAVALACASGQVLAATASHNAAAEKFLTLANADKLGTPVYMQVQQMFAQRFAQTKAPAAKQPVLESYQAKANAALDNAIGWNKLKPKMVDLYTQTFTEQELKDLVKFYESPLGKKVLREMPKVTQQSAQLTQQSLEPAVPVVNKLLEDMTKELDPNAGKAAVPAKK >NC_021505.1|WP_016501024.1|4568443_4568740_-|BolA-family-transcriptional-regulator MTMQQRIEQQLGALAPQHLQVLDESHMHSRGQETHYKAVIVSEQFAGLNSVKRHQKVYATMGELMGQIHALAIHTYTAEEWAKVGVAPASPVCAGGGH >NC_021505.1|WP_016501023.1|4567290_4568223_-|rhodanese-related-sulfurtransferase MSQPIVVAALYKFVTLQDYVELREPLLKAMLDNDVKGTLLLAHEGINGTVSATREGIDGLLAWLRNDPRLADVDHKESYCDEQPFYRTKVKLKKEIVTLGVPGVDPNQAVGTYVEPKDWNALISDPEVLLIDTRNDYEVAIGTFKGAMDPKTETFREFPEYIKANFDPSKHKKVAMFCTGGIRCEKASSYMLGEGFEAVYHLKGGVLKYFEEVPQEESLWDGDCFVFDNRVTVRHDLSEGEYDQCHACRHPINAEERASEHYSPGVSCPHCWDTLSEKTRRSAIDRQKQIELAKARNLPHPIGYNYKAEA >NC_021505.1|WP_016501022.1|4566654_4567287_-|DsbA-family-protein MSARLIYVMDPMCSWCWGFAPVAAALIAQAREAGVVTRVVPGGLRTGGSALDSSTRKYILEHWQAVADATGQPFRFEGAMPEGFVYDTEPACRALVTARELDAERVWPLLALIQRSFYEQGVDVTTAPQLVELASQCGFDRTVFAETFARADTRAATAADFSWAQDLGIAGFPTLLAERNGQLALLTNGYQPLDRLQPLLGRWLQQAACA >NC_021505.1|WP_016501021.1|4564829_4566662_-|ABC-transporter-ATP-binding-protein MLDVPGSPDPVPGKPHAAGDRLSWAEIRRLALHHKKNLWSANCIAVLAACCSVPIPLLLPLLVDEVLLGHGDAALKWMNHLLPSNWQVAAGYIGLMLALTLCLRLAALAFNVIQAKLFAGLAKDIVYRLRIRLIERLKRISLKEYESLGSGTVTTHLVTDLDTLDKFVGETLSRFLVAMLTLTGTAAILIWMHWQLALLILLFNPLVIYFTVQLGKRVKHLKKLENDSTARFTQALTETLDAIQEIRASNRQGYFLGRLGLRAREVRDYAVDSQWKSDASGRASGLLFQFGIDIFRAAAMLTVLFSDLSIGQMLAVFSYLWFMIGPVEQLLNLQYAYYAAGGALSRLNELLARDDEPQYPAASDPFAGRETVGIEVRDLRFAYADEPVLENLDLSIAPGEKVAIVGASGGGKSTLVQLLLGLYSAQAGTIRFGGASLQEIGLETLRENVAVVLQHPSLFNDTVRANLTMGRDSSDDACWQALRIAQLDATIAALPQGLDSVVGRSGVRLSGGQRQRLAIARMVLAEPKVVILDEATSALDAATEYNLHLALARFLSGRTTLIIAHRLSAVKQADRVLVFDGGHVAEDGDHQQLIAEGGLYAKLYGHLQQS >NC_021505.1|WP_041167837.1|4562175_4564632_-|EAL-domain-containing-protein MKGHRTLEAPKLIGITWPFIAVVVFQVALGSLSLYTLSAVRAYVAGESLWSKAQKDAIYYLNLYADTRDESTYQRYRQAITVPQGDHQLREVLDQPSPDLAAARQAVLQGGNHPDDVERIIWFYRNFRNISYMHTAIDYWDIGDDYLAKLDVLAGEMRQSFASGPVDAQAVNAWKARIVTINEGVTPAAKAFSDALGEGSRMLLRVLLITNLLTAMFLIAIAWRRSSKLLVQRQAFATALQAEKERAQTTLQAIGDAVITADVDGCIGYMNPAAEQLTHWQSGQAQGLPLSALFSLVDEQAEEDGRSLVEQVLSGSLKGGAEHARLIQRLDGSTVSINLVGSPIVSDGQLSGIVLVLHDMTQERQYIANLSWQATHDALTGLANRREFEYRLEQALNGLARQAGRHSLMFLDLDQFKLVNDTCGHAAGDELLRHICAVLQSGLREGDTLARLGGDEFGVLLESCPPDQAERIAEQLRQMVQSLHFVWKGRPFVTTVSIGLVHIAQTPGTLETSLRAADMACYMAKEKGRNRVQVYHADDSELSMRFGEMAWIQRLHVALEENRFCLYAQEIAPLKTFEGPGHIEILLRLHDESGRTILPSSFIPAAERYGLMTALDRWVVRNVFQVVRQCLDEGREGPLSICAINLSGSSIGDDKFLDYLQRLFVEYAIPPRMICFEITETSAIANLGSAIRFINELKGLGCRFSLDDFCAGMSSFAYLKHLPVDYLKIDGSFVKDMLDDPVNRAMVEVINHIGHVMGKRTIAEFVETPLIEQALQEIGVDYAQGYLIERPQVFTCDSLQRQRIATRPLLQRAPGTFR >NC_021505.1|WP_016501019.1|4561359_4562145_-|hypothetical-protein MIDAFVRIGPLMDPASYPQWAQQLIEDCRESKRRVVEHEFYERLRDGQLKQSTIRQYLIGGWPVVEQFSLYMAHNLTKTRYGRHQGEDMARRWLMRNIRVELNHADYWVNWCQAHGVHLHELQAQEVPPELNGLNDWCWRVCATENLAISMAATNYAIEGATGEWSAVVCSTDTYAQGFPEDQRKRAMKWLKMHAQYDDAHPWEALEIICTLAGENPTLGLRTELRRAICKSYDCMYLFLERCMQLEGRQQGRLRPALAAG >NC_021505.1|WP_041168116.1|4560494_4561235_+|YciK-family-oxidoreductase MFDYTARHDLLQGRVILVTGAGRGIGAAAAKAYAAVGATVLLLGKTEANLNEVYDEIEAAGHPQPVVIPFNLETALPHQYDELAVMIEDQFGRLDGLLNNASIIGPRTPLEQLSGDNFMRVMHINVDATFMLTSTLLPLLKLSEDASVVFTSSSVGRKGRAYWGAYGVSKFATEGLMQTLADELEGVAPVRSNSINPGATRTAMRAQAYPSENPQNNPLPEEIMPVYLYLMGPDSKAVNGQALNAQ >NC_021505.1|WP_016501017.1|4559732_4560404_+|N-acetylmuramic-acid-6-phosphate-phosphatase-MupP MRLRAVLFDMDGTLLDTAPDFIAICQAMLAERGLPAIDDNLIRGVISGGARAMVATTFAMDPEAEGFEALRLEFLERYQRDCAVHSKLFDGMAELLADIEKGNLLWGVVTNKPVRFAEPIMQRLGLAERSALLICPDHVKNSKPDPEPLTLACTTLGLDPATVLFVGDDLRDIESGRDAGTRTAAVRYGYIHPEDNPNNWGADVVVDHPLELRKVIDSALCGC >NC_021505.1|WP_016501026.1|4571001_4571964_-|DMT-family-transporter MHTTSGRWSYGLFLALLTALLWGILPIKLKQVLQVVDPITVTWYRLLVSGGLLFAWLAARRRLPSFTRLAPKGKGLVVVAVLGLMGNYVLYLIGLNLLSPGTAQLVVQVGPVLLLVASVFVFRERFSLGQGVGLVILLAGFGLFFNQRLEELLTSLGTYTTGVLTILLATSIWVFYALSQKQLLTVWHSQQVMMVIYLSCAALLTPWVHPLEALQLTPVQGWLLLACCLNTLVAYGAFAEALAHWEASRVSATLALTPLVTFVAVALAALVWPEYVHAEDINALGYVGAVTVVSGSALVALGPSLVASWRARRARLAQVQ >NC_021505.1|WP_016501027.1|4572216_4573710_-|polyphosphate:AMP-phosphotransferase MFESAEIGHSIDKEAYDAEVPALREALLEAQYELKQQARFPVIVLINGIEGAGKGETVKLLNEWMDPRMIDVLTFDQQTDEELARPPAWRYWRALPPKGRMGVFFGNWYSQMLQGRVHGVFKDAVLDQAIMGAERLEQMLCDEGALIIKFWFHLSKKQMKARLKSLKDDPLHSWKISPLDWQQSQTYDRFVRFGERVLRRTSRDYAPWHIVEGVDPNYRSLAVGRILLDSLQAALANNPKGKHQGNVAPLGRSIDDRSLLGALDMTLRLDKADYQEQLVTEQARLAGLLRDKRMRRHALVAVFEGNDAAGKGSAIRRVAAALDPRQYRIVPIAAPTEEERAQPYLWRFWRHIPARGKFTIFDRSWYGRVLVERVEGFCSPADWMRAYSEINDFEEQLVNAGVVVVKFWLAIDQQTQLERFQEREQIPFKRYKITEDDWRNRDKWDEYAQAVGDMVDRTSSEIAPWTLVEANDKRWARVKVLRTINQALEAAFAKHKK >NC_021505.1|WP_016501028.1|4573846_4575811_+|bifunctional-tRNA-(5-methylaminomethyl-2-thiouridine)(34)-methyltransferase-MnmD/FAD-dependent-5-carboxymethylaminomethyl-2-thiouridine(34)-oxidoreductase-MnmC MSTLLQHAQIDWDDQGRPHSRQYDDVYFAVNEGIEETKHVFLGQTRLAERFANLTPHSCMVIGETGFGTGMNFFCAWQLFDQHAHSDARLHFVSVEKYPLGHDDMARAVRLWPELAAYTEPLLEQYVAVHPGFQQFTFANGRVTLTLLIGDVLEQLPQLDAQIDVWFLDGFAPAKNPDMWTPELFAQLARLSHPGTVLGTFTTTGWVRRSLVEAGFAMKKVPGIGKKWEVMSGAYVGPLPAPGAPWYARPAPSQGPREALVIGAGLAGSTTAASLARRGWQVTVLERHEAPAQEASGNPQGVLYLKLSAHGTALSQMILSGFGYTRRQLERLQRGRDWDACGVLQLAFDNKEAERQGKLAAAFDHDLLHALERADAEAIAGVALPAGGLFYPEGGWVHPPALCQQQLQHPGTRLVTHQEVLELRKVDQQWQAWAGDCLIASAPVVILAGAAEVRRFEPCAQLPLKRIRGQITRLPATAGSRALRTVVCAEGYVAPPRGDEHTLGASFDFHSEDLAPTLAEHQGNLALLDEISVDLAQRLGTAELAPEQLQGRAAFRCTSPDYLPIVGPVADAQAFAEAYAVLGRDARQVPDVACPWLDGLYVNSGHGSRGLITAPLSGELVAAWVCGEPLPLPRAVAEACHPNRFALRKLIRGK >NC_021505.1|WP_016501029.1|4576039_4577827_+|N-acetylglutaminylglutamine-amidotransferase MCGLAGELRFTPIDQAPRPADLAAVERITHHLAPRGPDAWGFHSQGPIALGHRRLKIMDLSDGSAQPMVDNTLGLSLAFNGAIYNFPELRQELQDLGYSFWSDGDTEVLLKGYHAWGAALLPKLNGMFALAIWERDNQRLFLARDRLGVKPLYLSRNSERLRFASTLPALLKGGDIDPMLDPVALNHYLNFHAVVPAPRTLLANVQKLEPGTWMRIDRHGEVERQTWWQLKYGANPDERELDLEGWTTRVLDATRDAVAIRQRAAVDVGVLLSGGVDSSLLVGLLREAGVDDLSTFSIGFEDAGGERGDEFQYSDLIAKHYGTRHHQLRIAEHEIIDQLPAAFRAMSEPMVSHDCIAFYLLSREVAKHCKGVQSGQGADELFAGYHWYPQVDGADDAFAAYRDAFFDRSHAEYRDTVQAPWALETDAAGDFVREHFARPGARDAVDKALRLDSTVMLVDDPVKRVDNMTMAWGLEARTPFLDYRLVELSARIPARFKLPDGGKQVLKQAARRVIPHEVIDRKKGYFPVPGLKHLEGATLGWVRELLTDPSQDRGLFNPAMLDRLLSNPHGQLTPLRGSKLWQLAALNLWLSEQGI >NC_021505.1|WP_016501030.1|4577830_4579576_+|N-acetylglutaminylglutamine-synthetase MKAHEIAYGQRLLRGQAPSYERLQARLAGDGSLPHDQPRAVHCGWGRLLIGHTYPDPASLAEALLDESPGERDIALYVAAPQQLLAQAPQQLFLDPSDTLRLWFTDYRPAQRVFRGFRVRRAQNPADWQAINTLYQARGMLPVDAELLTPRHLGGPVYWLAEDEDSGAVIGSVMGLNHAKAFDDPEHGSSLWCLAVDPHCTRPGVGEVLVRHLIEHFMSRGLAYLDLSVLHDNRQAKRLYEKLGFRNLPTFAVKRKNGINEQLFLGPGPQADLNPYARIIVDEARRRGIEVQVDDAAGGLFTLSLGGRRIRCRESLSDLTSAVTMTLCQDKRLTQHALGNAGLQVPAQQLAGNADDNLAFLDEHGAVVVKPVDGEQGQGVAVNLTCIDDITRAVAHARQFDSRVLLESFHAGLDLRIVVIGYEVVAAAIRHPAQVLGDGKHSVRQLIEAQSRRRQAATGGESRIPLDDETERTLRAAGFGYDDVLPASQRLAVRRTANLHTGGTLEDVTERLHPVLADAAVRAARALEIPVVGLDFMVRDAGQPEYVIIEANERAGLANHEPQPTAERFIDLLFPHSRPLT >NC_021505.1|WP_003252630.1|4579700_4580885_+|osmoprotectant-NAGGN-system-M42-family-peptidase MSERLPEPDLDYLKRVLLEMLAIPSPTGFTDTIVRYVAERLDELGIPFELTRRGTIRATLKGRQTSPDRAVSAHLDTIGASVRQLQDNGRLALAPVGCWSSRFAEGSRVSVFTDTGVFRGSVLPLMASGHAFNTAIDQMPVSWEHVEVRLDAYCATRADCEALGISIGDFVAFDPLPEFTESGHISARHLDDKAGVAALLAALKAVVESGRQPLIDCHPLFTITEETGSGAAGALPWDVSEFVGIDIAPVAPGQASSEHAVSVAMQDSSGPYDYHLSRHLLKLAGDHDLPVRRDLFRYYFSDAHSAVTAGHDIRTALVAFGCDATHGYERTHIDSLAALSRLLSAYLLSPPVFASDSQPANASLERFSHQLEHDAQMESDTRVPAVDSLVGNKG >NC_021505.1|WP_016501031.1|4580969_4581197_+|YheU-family-protein MLIPYDQLQAETLTLLIEDFVTRDGTDNGDDTPLETRVLRVRQALAKGQAFILFDPESQQCQLLAKHDVPRELLD >NC_021505.1|WP_016501032.1|4581249_4581429_-|carbon-storage-regulator-CsrA MLVIGREVGEIIVIDDNIRIMVVDVREGVVRFGVDAPRSVQVHRAEVYKRIKEAKQGEA >NC_021505.1|WP_003252624.1|4581719_4581872_+|DUF3309-domain-containing-protein MTTILIIILILLLIGGLPVFPHSRSWGYGPSGIIGVVLVILLVLLLLGMI >NC_021505.1|WP_016501033.1|4582025_4582835_-|SDR-family-oxidoreductase MHNRIMITGAGSGLGREIALRWAREGWRLALADVNENGLRETLELARAAGGEGFIQRCDVRDYSQLTALAQACTEQFGGIDVIVNNAGVASGGFFAELSLEDWDWQIAVNLMGVVKGCKAFLPLLERSKGRIINVASMAALMQGPGMSNYNVAKAGVLALSESLLVELRQLEVSVHVVCPSFFQTNLLDSFRGPNPAMKAQVGKLLEGSPISAADIAGYIHQQVAAGEFLILPHEAGRQAWQLKCQAPERLYDEMADMAVKMRAKAPSR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_021505_4 | 4.1|3883463|22|NC_021505|PILER-CR | 3883463-3883484 | 22 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 80896-80917 | 2 | 0.909 |
NC_021505_4 | 4.1|3883463|22|NC_021505|PILER-CR | 3883463-3883484 | 22 | NZ_CP022367 | Azospirillum sp. TSH58 plasmid TSH58_p02, complete sequence | 861669-861690 | 2 | 0.909 |
NC_021505_4 | 4.1|3883463|22|NC_021505|PILER-CR | 3883463-3883484 | 22 | MH001451 | Mycobacterium phage Nairb, complete genome | 15429-15450 | 2 | 0.909 |
NC_021505_4 | 4.1|3883463|22|NC_021505|PILER-CR | 3883463-3883484 | 22 | MF155936 | Mycobacterium phage ZenTime222, complete genome | 15429-15450 | 2 | 0.909 |
NC_021505_4 | 4.1|3883463|22|NC_021505|PILER-CR | 3883463-3883484 | 22 | MK494089 | Mycobacterium phage Ibrahim, complete genome | 15429-15450 | 2 | 0.909 |
NC_021505_4 | 4.1|3883463|22|NC_021505|PILER-CR | 3883463-3883484 | 22 | NC_024135 | Mycobacterium phage Bernal13, complete genome | 15429-15450 | 2 | 0.909 |
NC_021505_4 | 4.1|3883463|22|NC_021505|PILER-CR | 3883463-3883484 | 22 | KM591905 | Mycobacterium phage RonRayGun, complete genome | 15429-15450 | 2 | 0.909 |
NC_021505_4 | 4.1|3883463|22|NC_021505|PILER-CR | 3883463-3883484 | 22 | MN735432 | Mycobacteriophage Whitty, complete genome | 15429-15450 | 2 | 0.909 |
NC_021505_4 | 4.1|3883463|22|NC_021505|PILER-CR | 3883463-3883484 | 22 | NZ_CP025613 | Niveispirillum cyanobacteriorum strain TH16 plasmid unnamed1, complete sequence | 126064-126085 | 3 | 0.864 |
NC_021505_2 | 2.1|1964604|26|NC_021505|CRISPRCasFinder | 1964604-1964629 | 26 | NZ_CP017076 | Novosphingobium resinovorum strain SA1 plasmid pSA1, complete sequence | 858639-858664 | 5 | 0.808 |
NC_021505_4 | 4.2|3883514|28|NC_021505|PILER-CR | 3883514-3883541 | 28 | NZ_CP039913 | Agrobacterium tumefaciens strain CFBP6625 plasmid pAtCFBP6625b, complete sequence | 23157-23184 | 6 | 0.786 |
1. spacer 4.1|3883463|22|NC_021505|PILER-CR matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 2, identity: 0.909
agccccggccttggcgccggcg CRISPR spacer ggccccggccatggcgccggcg Protospacer .********* ***********
2. spacer 4.1|3883463|22|NC_021505|PILER-CR matches to NZ_CP022367 (Azospirillum sp. TSH58 plasmid TSH58_p02, complete sequence) position: , mismatch: 2, identity: 0.909
agccccggccttggcgccggcg CRISPR spacer ggccccggccttcgcgccggcg Protospacer .*********** *********
3. spacer 4.1|3883463|22|NC_021505|PILER-CR matches to MH001451 (Mycobacterium phage Nairb, complete genome) position: , mismatch: 2, identity: 0.909
agccccggccttggcgccggcg CRISPR spacer ggccccggcctcggcgccggcg Protospacer .**********.**********
4. spacer 4.1|3883463|22|NC_021505|PILER-CR matches to MF155936 (Mycobacterium phage ZenTime222, complete genome) position: , mismatch: 2, identity: 0.909
agccccggccttggcgccggcg CRISPR spacer ggccccggcctcggcgccggcg Protospacer .**********.**********
5. spacer 4.1|3883463|22|NC_021505|PILER-CR matches to MK494089 (Mycobacterium phage Ibrahim, complete genome) position: , mismatch: 2, identity: 0.909
agccccggccttggcgccggcg CRISPR spacer ggccccggcctcggcgccggcg Protospacer .**********.**********
6. spacer 4.1|3883463|22|NC_021505|PILER-CR matches to NC_024135 (Mycobacterium phage Bernal13, complete genome) position: , mismatch: 2, identity: 0.909
agccccggccttggcgccggcg CRISPR spacer ggccccggcctcggcgccggcg Protospacer .**********.**********
7. spacer 4.1|3883463|22|NC_021505|PILER-CR matches to KM591905 (Mycobacterium phage RonRayGun, complete genome) position: , mismatch: 2, identity: 0.909
agccccggccttggcgccggcg CRISPR spacer ggccccggcctcggcgccggcg Protospacer .**********.**********
8. spacer 4.1|3883463|22|NC_021505|PILER-CR matches to MN735432 (Mycobacteriophage Whitty, complete genome) position: , mismatch: 2, identity: 0.909
agccccggccttggcgccggcg CRISPR spacer ggccccggcctcggcgccggcg Protospacer .**********.**********
9. spacer 4.1|3883463|22|NC_021505|PILER-CR matches to NZ_CP025613 (Niveispirillum cyanobacteriorum strain TH16 plasmid unnamed1, complete sequence) position: , mismatch: 3, identity: 0.864
agccccggccttggcgccggcg CRISPR spacer cagcccggccttggcgccggcg Protospacer . *******************
10. spacer 2.1|1964604|26|NC_021505|CRISPRCasFinder matches to NZ_CP017076 (Novosphingobium resinovorum strain SA1 plasmid pSA1, complete sequence) position: , mismatch: 5, identity: 0.808
cctgcgagagatcactgtcaatccac CRISPR spacer gctgcgggcgatcactgtcaatccgg Protospacer *****.* ***************.
11. spacer 4.2|3883514|28|NC_021505|PILER-CR matches to NZ_CP039913 (Agrobacterium tumefaciens strain CFBP6625 plasmid pAtCFBP6625b, complete sequence) position: , mismatch: 6, identity: 0.786
ttcgctttgggaagccttgccgctcccc CRISPR spacer ttcgctttgggaaagcttgccgcgatct Protospacer *************. ******** .*.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
371081 : 421979
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_021505|371081:421979|DBSCAN-SWA CTCAACCGTGGTTCTTGCCGAGCAACGCGTGATAAAGCTCGGTATCCCCAAGAATCCCCACCACCCTGTCGTTATCCTGCAACACCAGCTTGTTGCCGGTCTGATAACGAATCTGCAGCGCCTCGCGCATGCCGATGTCGGCGTGCACCACGGTCGGCCTGCGCTCCAGCAGCTCCACATCCTGCCCCGGTGCCCAGTTCTGCATGTCCAGCCCGTTCTGGCCCTGGCGTGCACGCTTGAGCGCGCCACCTTCGCCCAGGTCCAGCCACGAATCGATACCCGGGTCCAGGCACACCGAGCCATTGACCCGCTTGCAGTTGTCCAGGCTGCGCATCAGGCTGCGGCCGCACAGCACGTTCAGCGGGTTGGTGTGGGCGACGAAGGTACGCACATACTCGTCGGCCGGGTTGAGCACGATCTCCTCCGGCTTGCTGTACTGGATGATCCGCCCGTCCTTCATGATCGCAATGCGGGTACCCAGCTTCAGTGCCTCGTCCAGGTCGTGGCTGACGAACACGATGGTCTTGCTCAGCTTGGCCTGCAGGCCCAGCAGCTCGTCTTGCAGGCCTTGGCGGATCAGCGGGTCCAGTGCCGAGAACGGTTCGTCCATCAGCAGGATGTCGGCATCCATCGCCAGCGCCCGGGCCAGGCCCACGCGCTGCTGCATGCCGCCAGACAACTCGTCCGGCTTCTTGTTGCGCCACTGGGTCAGGCCCACCAGCTCGAGCTTCTCGTCCACCAGCTTGCGCCGTTCCTTTTCAGGGCGGCCCTGCATTTCCAGGCCAAAGCTGATGTTCTCGCGCACCGTCAGCCAAGGCATCAGGGCGAACTTCTGGAACACCATGGCAATGCGCTTGGTGCGCATCATTTTCAGCTCGGCCGGGCTGCAGTGGGCAATGTCGATGTGCTTGCCTTCGTGCTCGACAAACAGCTTGCCGCGGCTGACGGTGTTCAGGCCGTTGATGCAGCGCAGCAGGCTCGATTTGCCCGAGCCGGACAAGCCCATCAGCACGCAGATCTCACCCTTGTTGATATCAAGGTTGGCCTTTTCGACGCCGACCACCAGGCCGGTCTGCTTGAGGATCTGTTCGCGGGTCTTGCCCTGGTCCAGCAGCGACAGTGCCTCGCGCGGCTTGTTGGAGAAAATGACGTCGACGTCTTCGAAACGAATGATGCTCATGCCTCACCCCTTACCGGCAGTTCCGGTTGCTTGCAGATACGGTCGAGCATGATTGCCAGCAGCACGATCGCCAGGCCCGCTTCGAAGCCCAGGGAAATATCGGCGGTGTTCAGTGCGTTGACCACAGGTTTGCCCAGGCCGTCAGCGCCCACCAGGGCGGCGATCACCACCATCGACAGCGACAGCATGATGCACTGGGTGACGCCGGCGGCGATGCTGGGCATCGCATGCGGCAGTTCGATACGGGTAAGCAATTGGCGACGCGAGCAGCCAAAGGCCTTGCCAGCGTCCATCAGCTCCTGCGGGACGTCGCAGATGCCCAGGTAGGTGAGGCGGATTGGTGCCGCAATGGCGAAGACCACGGTAGAGATCAGCCCCGGTACCACACCCAGGCCGAACAGGGTCAGGGTAGGGATCAGGTAGACGAAGGTGGGCACCGTCTGCATCAGGTCGAGTACCGGGCGCATGGCGGTATAGAACATTGGTTTGTGCGCAGCCAGGATGCCCAGGGGCACGCCAATCACCACGCACACCACCGTGGCGAAGGTCACCTGCGCCAGGGTTTCCATGGTTTCCTGCCAGTAACCCAGGTTGAAGATCAGCAGGAATGACAGGGCAACGAACGCGGTCAGCCCCCACTTGCGCTGGATCAGGTGTGCCAGGCCGGCGAACAGGGCGATGAGGACGAACGGGTTGAACCAGGTCAGGGCACTGGTGACCCCATGGATCATGAATTCCAGGCCTTGCGCGATGGCGTCGAAGTAATTTGCGCCGTTCTGGGTCAGCCACTCGACGAACGACGCGATGTACTGGCCCAGGGGTATTTTCTGATCGATAAGCATGATAGCGAGCTTCCACCTGCATAGATTGAAATCAACCCGGGGCAGGCACGGTCCTGCCCCGAGGTAGCGCTATTGCGTGCTGCGTTATTGCGCCAGCTTGGCCTTGGCCGCCTCAAGGCCGGGTTTGCCATCAACAGTGGTAACGCCGGCAAGCCAGGTGTCCAGCTTGCCGGGGTTGTCTTTAAGCCACTTCTTGGCGGCTGCTTCGGGTTTCATCTTGTCGTCCAGGACATAGCCCATCATGGTGCTTTCATCCTTGAGCTCGAACGACAGGTTCTTGAGCAGTTGGCCAACGTTGCTGCATTCCTGCGCGTAGCCCTTGCGGGTATTGGTCAATACGGTCGCCTTGCCGAAATCGGGCCCGAAGAAATCGTCCCCGCCGGTCAGGTACTGCATCTTGAAGCGGGTGTTCATCGGGTGCGGTTCCCAGCCCAGGAACACCAGGGCCTCGCCGCGTTTTTGCGCGCGGTCGACCTGCGACAGCATGCCGGCCTCGCTCGATTGCACAATCTTGAAACCGGCGTCCTTCAGGCCGAAGGCGTTCTTGTCGATCATGCTCTGGATGGTGCGGTTGCCATCGTTGCCGGGCTCGATGCCGTAGATCTTGCCGTCCAGTTCCTTCTTGAACTTGGGAATGTCGGCGAAATCTTTCAGGCCCTTGTCGTACAGCGCCTGGGGCACGGCCAAGGTGTACTTGGCGTTTTCCAGGTTGGCGCGCACCGTTTCCACGGTGCCAGCGTCGCGGTACTGCTTGATGTCGTTCTCCATGGTCGGCATCCAGTTGCCGAGAAACACGTCCAGGTCCTTGCCGGTGGCCAGCGACTTGTAGGTCACGGGTACCGAGATCATGGTGGTGTGGGTTTTGTAACCCAGGGCTTCGAGCACAACGCTGGTGGTCGCGGTCGTGACTGTGATGTCGGTCCAGCCGACGTCGGAGAAACGTACCGTCTGACACTGCTCGGGTTCGGCGGCCTGGGCCAGCAAGGGCGTGGACAGCAACGCAACCAGCAACAGCGAGGGTGAACCTTTCATGGATGGACTCCTGAGATATTGTCTTCGGGTTCGCGTGGGTCGCCGGGCTTTCTGAGGGTCCGGTCGACCAGGCCTGCAGAGGGCAGGATGCGTGGCCGTGCTTTACGAGTCGCTTTCGATAATCCTCCAGCGCCGGGCAATCGCCTACTGGTAGGGTCGTATCCAGTACGGAACGGGTCGTATCCAGTAAGGGCGATGTCGCTTATGGGTTTATCCACCCCCTCGGCCGCATTTTTGCGTCACCAGGCCGCCGGAAAGCGGCGTGGTGAGGGCCGCTCCTTGAGCAAAAACAGCGCGGCGCGGCTGGACAAAAGTACGTCGGTTGCAGACGGCGAGGGGTGCGGTAAAAGCCCGATGATGCTGGCATTCAAGGCGGCCCGCTGTTTGAGCCTAGCAGTTGCCCCGCCCTGCGCATGGCGCGCAGAGACAAGCCCAGCAAGAGGTGATTCGACAATGGCCATCAGCGTGTTCGACCTGTTCAAGATCGGTGTCGGGCCTTCCAGCTCCCACACCGTCGGCCCCATGCGTGCTGGCGCCCTGTTTGTTCAGGGCCTTCGCGAACGCGGCGAGCTGGAACGTGTAAAACGGATCGAAGTGCGCCTGTATGGCTCGCTGTCGGCCACCGGCATCGGTCACGGCACCGACAACGCCACGGTCATGGGCCTGATGGGCGAATGGCCTGACGCCATTGACCCGACCCAGATCGTGCCGCGCATCGCCGACCTGCGCGAAACCAACACGCTGCAGCTCGATAGCCGCTTGCCCATCGAGTTTGTCTGGGCCCGCGACATGCTGCTGCTGGACGAGAACCTGCCGTACCACCCCAACGCCATGACCCTGATTGCCGAAGGCGAGCAGGGCGAGTTGCACCGCGACACCTACTACTCGGTGGGTGGCGGCTTTGTGGTCGATGCCGCCCAGGCCGCCAGCGGTGTGCTGGATGCCGACCAGACGGTGCTGCCGTACGACTTCAACAGCGCCGCCGAACTGCTGCGCCTGTGCAAGCAAAACGACCTCAGCGTGTCGCAATTGATGATGGCCAACGAGAAGGTCTGGCGCAGCGAAGAAGAGATCCGCGCCGGCCTGCACAAGCTCTGGGAAGCCATGCAGGAATGCGTCAACAACGGGCTTAAATACGAAGGCACGCTGCCCGGCGGGCTGAACGTGCGCCGCCGCGCTGCCAAGCTGCACCGCAGCCTGCAGGAGATCGGCAAGCCCAACGTGATTGGCTCGACCATGAGCGCCATGGAATGGGTCAACCTGTTCGCCCTGGCGGTCAACGAAGAGAACGCCGCCGGTGGGCGCATGGTTACCGCACCCACCAATGGCGCGGCCGGCATCATCCCGGCGGTGCTGCACTACTACATGCGCTTCAGCGATGAGGTGAACGAGTCCAGCATTGTCGATTACTTCCTGGCTGCGGCTGCCGTGGGCATCCTGTGCAAGAAGAACGCGTCGATTTCCGGTGCCGAAGTGGGCTGCCAGGGTGAAGTCGGCTCAGCCTGCGCCATGGCGGCGGCAGGGCTTGCCGAAGTGTTGGGTGCCACCCCGCCACAGGTGGAGAACGCGGCCGAAATTGCCTTGGAACACAACCTCGGCCTGACCTGCGACCCGGTCGGCGGGTTGGTGCAGGTGCCGTGCATCGAGCGCAACGCGATTGCTGCGGTAAAAGCCATCAACGCGGTGCAAATGGCGCTGCGCGGTGACGGCGAGCACTTCATTTCCCTCGACCAGGTGATCCGCACCATGCGTGATACCGGGGCCGACATGCACGACAAATACAAAGAGACCTCGCGCGGTGGCCTGGCGGTCAGCGCTATCGAGTGCTGATCGTTACCAGATTTGCTGGCCTCTTCGCGGGTGAACCCGCTCCCACAGGCATTGCGCAGTACGTGTGGGAGCGGGTTCACCCGCGAAGAGGCCTTTGAAGGTTGATGCAGGTGCAACATAACCACCAATGCCTCCTTATGGTGCGCCCGCCGCCTACCGATCGTCGGGTGTCGCGATCAGTGTCGGCATTGTCGGTGACACTCAAGAGTGTCGTTTCTGGGCAACTTACGCTGCACGTGTCACCAAATTGCTCAGCCGTTACACGCACGGCGCCTAGCACGCCTGTTTTGCGACTATCTGCCAACTCGGCCGAAGCCCCTGATTTGCCTGATGCAGGTGTCGTTTTCAGGTTTTTTTGGATAGCCGTTCAATTTCAGGCACGGCGTTTGCGTTGAGATTGGCAACCTACCCCCGGAAAATGACCGAGGGGTCATAACAAGAAATCAGTGCCCGCCTGAGGCACACCCTGCATTTGTGTGAGGAGAAATCGCGATGACGTCGTACACCTCCGGGACCCCAACCCAGAACCGCACAGCACCCCAGTCCATCGGGTTTCTTCTACTGGACAATTTCACCCTGATTTCCCTGGCATCGGCCGTCGAGCCGCTGCGCATGGCCAACCAGCTTTCCGGCCGCGAGCTGTACCGCTGGCACACCCTGACTGTCGACGGCGGCCAGGTCTGGGCCAGCGACGGCCTGCAGATCACCCCCGATGCCGCCATGCACAGCGCGCCGCCCATCGATACCGTGATCGTCTGCGGCGGCGTGGGCATCCAGCGCACTGTTACCCGTGAGCATGTCACCTGGCTGCAGGCCCAGGCCCGTCAGTCGCGCCGCCTTGGCGCGGTGTGCACCGGCAGCTGGGCCCTGGCCTGTGCCGGGCTGCTCGACGGTTTCGATTGCAGTGTGCACTGGGAGTGCCTGGCGGCCATGCAGGAGGCCTACCCACGGGTGAACATGAGCACTCGGCTGTTCACCCTCGACCGTAACCGCTTCACCAGCTCCGGCGGCACTGCTCCGCTGGACATGATGTTGCACCTGATCAGCCGCGACCATGGCCGCGAGCTGTCGGCGGCCATCTCCGAGATGTTCGTCTACGAGCGCATCCGCAACGAGCAGGACCACCAGCGCGTGCCGCTCAAGCACATGCTCGGTACCAACCAGCCGAAGTTGCAGGAAATCGTCGCGCTGATGGAGGCCAACCTCGAAGAGCCGATCGACCTGGACGAACTGGCGGTGTACGTGGCGGTGTCGCGACGCCAGCTTGAGCGGCTGTTCCAGAAGTACCTGCACTGCTCGCCATCGCGCTACTACCTGAAACTGCGCCTGATCCGTGCGCGGCAGTTGCTCAAGCAGACACCGATGTCGATCATCGAAGTGGCGTCGGTGTGTGGTTTTGTGTCCACGCCGCACTTCTCCAAGTGCTACCGCGAGTACTTCGGCATTCCGCCGCGTGACGAGCGTGTGGGTTCCAACACCGCGCAGCAGGTAGCGATGATGCCGATTCCGCAGGCCATGACCCTGTCGCCGCACAGTGGGCCGATGGCCGCGCTTAGCCAGGCGCGCAACGAGTCGACCTTTGCCAGCGTAAGGCTCTGACACCGCGCCGGGCTTTTCGCGGGCACGCCCGCTCCCACAGGTACTGCACAGGCCTCGATCTGTGTGTAGTCACTGTGGGAGCTGGCTTGCAGGCGATAGGGCCAGTACAGGAAAAAGAATCAGGCTCGCTTGTTGAAACTCGCCAGCGCTGGCAGCAGCTGCTTGTCGATGGCCTGGCGTACAGCAGGCAGGATCGTCGCACTACCGGTGTACATCTGCTCCACCATCCCCTTCAGCGCCCGCGCATTGGGCTCGGTCAGCCCACGCACCACCGCTTCACACGCCTGTTCGGCACTGGCCCCGGCTGGAATATCGAAACCGCGGGCGCGCAGGTGGCCGGCGAGGTCGTCCTGGTCAATCAAATCCGCATGCATCATGGCTGTTGTCCCTTTTATCGCGAAGAGAGTGCTGCTGTGATTCAGGACCGATTCTGTGACGTGCGCAACAGCCTCGCAAGGACAGAATATCGCCATGGCTGCTTTGGCTAAGAACAGGTCGTTTACGGCTAAATCCCCCGCCCGGCAAGCACGCACACTGAAGCCATTCACGGTACCCGAGGGTTTGGTCACCCTCGCTCACGCACAGTGGATGTTGCTCGCTCTTGGCCCCGGTAGCTCACGGCTTTCCGGGGTTATTTTTTTGTGTTTCTCAACAGTTCCATCTTATAAAACTTCTAGCAGCCTAGTATTTCTGCCAGGTGACGGAAAGATCCTGATAGTGCTCTCCTGCGGGCGACGGTTTTCCACAGGAGCGGAAAAGTGAATACCACGATCTGGAACAGAATCCTGCTTGCCTCAATGGTGATGTTTTTTGGGGGATGTTCAATCTCTGAGCGCACCGTGGCTGCGGCCAACAGTACACCCCCTCAACTTTTCACACTTGAACCTACGCCTGAGCGCAACGTCCTGCAAAAAACGGCTAGCGGCTACCTGGCCACATTGTTGGGTGACCCGGCGAACGCGGAAGTTACTCTGGTCAAGATCGACCCTGCACGGGTGAGCCAGCAGACGCAGGATTTGGCGGTGAGCTTGCCGGACGGTAAAACCGCCCAGTTTCATCTCCGCGATTTCACCACCCTCAAATCGGGTATAGACAGCTGGGTTGGCTACAAGCCTTCGGCATGGAAAGCGACCCACGCGGCCTCCGCTTCGGAGATTGATTACGACCCATTTTATTATCTGTCCATGGTGCGTGAGGGTGACAAACTGGTGGGCAATCTCATTGTCGAGGGGCAGCGTTATCGACTGGACTCTATTGGTTCAGGGCGGTATGTCTTGATCAAAGTTGACGAGTCCAAACTTCCTCCAGACGGCGAACCCCTCGTAGCCCCTGCAGGGGGCGCGCGCGATGACACCAAAGTGAAATCGCCGCAATCCGCGCACAGCGTAATACGCTTGATGTTCTTGGCAACCAATCAACGAAAGGCCGCTAACCCCTGGTGGTGGCTTGACCTTCTCCTGGCGATGAGAGATGCCAACCAATACATGAAAAACAGTGACGTGCAGATCACCTACCAATGGGCCGGGAATTTTTATGGGGATTACGACGAGACTGGAAGAAGCAGCTCCCAGCAACTGGGAGATATCGGTAATGCCCAGCCGTTCGCATCGCAAGTCCTGAGCATGCGAGAGGAATTAAGAGCGGATATGGTCTTAATGTATACTACAGAGCCTTCGGTGTGCGGCAGAGCCTACATTTCGACGGGCAAGAATTCGCCTCATTCGATAGTTACCTGTCCCAGATCGCTTGCGCACGAACTTGGTCATAACATCGGTGCACTGCATAAACAGGGCGAGACTGGAAATGTTCCTGAATATGCATACGGCTACGAGACCCCTCTTTATAACCTACACACTCAGATGGTAGCGTCTCGCGACGCTTTACCTAATTTCTCCAACCCCCGTGTGACCTACCTGGGCATGCCGTTAGGGGATAGTAAATTTAATGACGTTGCGAGGCGGTTTAACGAACGCCGGGAAACCGTCGAGAATTTTTACCCGCCACCAAATCCGTTGACGGTACTGGTGCAACTTTTCAGTGATTATAATCAACGGGGTGACATGTGCACCTTGAAAATCGAGCCGGGTGTAGATCAGCCGTCATGCAATAAGGTAAAGTCTATAAGGGTTTATGACTTTGCCCCAGGAATGAAACTTTGCTTTAGATCGGAGGAGTATAGAAGAGTTTGTTACGCTGGTTCGTATGCTGGTGACTTTGCGGTGCCTAATCTTAAAGCGGTAAGCGTGCTCCCATCCGGGTTGGTTCGCGAGGGGAGGGTTGGCGATGACGCATTTTACGGTGACGTTAAATATGTTGAATACGCAATGGATAACCAAGTGACGATTAAATTGTATTCGCTTGCGGACTTTAAGGGGGAAACCTGCGAGTTTCGCATGCCCACGAGTACCGTAGGCAGCCTGTCCGAGTGGCCGCAATGTGCGGCTCTCAGCGGCGGCAAGTCACGCTCTGCCAAAGTATTTTCTTTTTCCAGTCCCTATAACAAGCTGTGCTTTTTCAACGCAGATCACCGTCAAAGTCTCTGTTTTAACGGAAACTATCAAGGTAATTTCTCGATCCGAAACTGGGACGTGGGCACGGATTTGCCGTCGGGTCTGGTCAGGACGCATCGTGGTGGCGGCTATATGAACGGTTCTGTGCATCGCATTTCCTATGGGAATGACTCAGAGGATTGGCCTCCCGCACCTTGAGTGCTACCGTAGATTCGGGGCGTAATATGCGCCCCGGATTCCATTACTCACCCAATTATCGTCATAAAAGCCGTCGGAGATGAAATCGCGCAGTGCCTTGCGCACATGCAGCGGGTGCATGCCGGGCGTGGTGGAGAGGGTTTGGTTACCCTCGCTCACGCACACGCACAATGGAGGTGCTCGCTCTTGGCCTCGGTAGCTCACGGCTTTTAGGCATCATTTTTTGCAACTGTTATTTATGTCAGTTGTTGGGAGTCGATGTCCATGGTCAAGTAAGGCTCCCATAGCAGGAGCATGGACATGAACCGGAAACAATTGAAATTCTGCTTGCTTGGTGTTGTGTTTCTTTTTGGAGGATGCACAGCTTCGGACCCTGCAAACTACCTCGAAAGTTCGGCGGTTACGCTATTTCAGATCAAGCCAGACCAGAACCGCGCCGCCCTTGAGAGGCAATCCGGTACTGACCAGTATTTGAAAATTTTGTTGAATGTGGCAGAAGACGCTGAAGTTAAAGAAGTGCAGGTCAAGCCGGGGCTTGTATCGAAAGATACGACGATGCTATCAATGCCTCTGACTGACGGCAGAACGGTGAGCTTCAAGTTGTCAAGAAGCGATAATGTAGCTTCGGGAATGGTAGGGTGGGTTGGTGATATGCCTTCCAATCGTAGACAACTCTACCCCTCACCTGCGGAAATTAATATGGATCCGCTAAACTGGGTTTCGCTGGTGAGCGACGGTAAGCTGGTCGTTGGGGATATCCGTGTGGAGGGGCAGCTCTACCGTTTAACGGCGGTTGGCAAAGGCCAGCAAGTTTTGGTCAAGGTTGATGAATCAAAACTACCACCAGAAGCAGCGCCGATAGCTGCTCCGGTGCAACCACAGGGCAATGCCCAGTTGGTCATTGCTCCTCTGTCGTCAAAAAGTACGATTAGGGTGCTTTTCGTGACAACCCGGCAGTCCAGGGCGCGATTTCCGAATTACAAGATAGAATTGGCGCAGGCGTTACAAAACGCCAACCAATACTTGATCAATAGCAAAGTCGATGCGGTGTACGAGCTGTCGGATATTTACGATTCTGACTACGATGAAACAGGCAAAGAACCCCAAACTCAGCTTAATGATATGATAGCTGATAAACCATTAGGGGCCAAAATTCATATCGAACGTGAAAAAGTACGCGCCGATCTGGTATCGATGCTTTCCACGTATAGTATTTACTGTGGTATAGCAAAAATGCCAGCGCGCAAGGAGACGGCTTTCTCGTCCATTAGTTGTTTTGGTGCGTTAGGCCACGAATTAGGGCATAACATGGGGGCTATGCATAGTGACGAGTTTCCCGATCAGGGCATACCCGCTTACGCTTATGGCTACAAGCATACAGCGCCAAATTTCCATACGCAGATGCGAACGTCGCATGGTGCCATTCCCTATCATTCAAACCCGCGGCTGCAGTACCAAGGGGTTCCTATGGGTACGGTAGATAAAAATGATGTGGCGCGGACTTTCAACGAAAATCGGGACACTGTTGCCAATTTTTATCCGGACCCTGCTCATCGCGTTCGTTTGTGGTTGTATGGGGCGAACCGGGGTTGTTTCATTGATTTAAAGCCGGGTGAACAAGCGCTGCTCTCTTGGTATACGGAATGCAAGGATAATGATGTCGATCCCGTACGGGTAGAGGTAAAAGACTTCTACAGCGGTTCGACCCCGAGAAAATTATGCTTCTCCAACATTTTTTACACAGTAAGGAGCTGTTATACGGGCAGTAACTTTTCTGGGGATTTTGTTATCACGAGCGTTCATTCTGGCGGTGGTAAACCAGAGGGGTTCAAATTTACAAGAAGACTGATAGGGCCAGTCTATAGGGTGTCATACGAGTGATCGTTTTGATTATCCGCTTCTACTCTGTCTGATTGTTGCGAGTGAAGGCGCCTAGGTGCTTCGTATCTATGGAGACCCTGGCGCCGGTTCTCTGGTTATTCAAGAAGGAAGTTTTATCGTACGCCCCATATATTGCGGCGCCTGCGCCCCCTCCTGTTGCTCTACCAAGTGATTCAATCGGCAAACAGTCGTGTGCCCAAAAGGCGCTGACTTCGGCCCCGCTAGGTCCACGTGCAGTAACATCTGCTCGCTCGCCGCCAGCGCCTCGTCGAACCCTGCCCGGTGCAGGCTGTGATACACGTGAAGACGCTTGCGATCAAAGCCGATGATCTGCGTCTGCACCCACACCTCGGTGCCCAGCTTCACCTCGTGCAGATAATTGATATGCGCCTCCAGCGTGAACAGTGAATTACCGCTTTGCCCGCGGCTGTCGGCGTCCAGGCCAATGCGCTCCATCAGGGCATCGGTCGCGTAGCTGAAGATCAGCAGGTAGAAGGCATCCCGCAGGTGGCCGTTGTAATCGACCCAGTCCTCCTGGACCGTGGTGCGGTAAGTGATCAGAGCGGGCATCTTGGTCTCCTTAATCGCTAAACGACATGCCGTGGCTGGCCTTGCTGGTTTTCACCGCTTCCAGTACCGCCAGCAGGGTGTCGTCACGGTAGCGCTCCAGTGCAGCGATACTGCGCTCGCCCTGTTGCTCAGAGGTACCGTTCACCACATCGTCGATCAGCTTGTCGGTCAGTTCCGGTGCTGGCAGGTAGGTCCAGGGCAGCTTCAGGGCAGGGCCGAACTGTGACATGAAGTGGCGCATGCCGGCATCGCCACCGGCCAGGGTGTAGGTAAGGAACGTGCCCATGAACGACCAGCGCAAGCCTGCGCCGAAGCGAATCGCGTCGTCGATTTCGCCCGTGGTGGCAACGCCGTCGTTGACCAGGTGCAAGGCTTCGCGCCAGAGCGCTTCGAGCAGGCGGTCGGCAATGAAGCCTGGCACCTCCTTGCGCACATGCAGCGGGCGCATGCCGAGTGCGGTGTAGATGGTTTTGGCAGCTTCAATGGCCTCGGGTGAGGTGCGGCTACCGCCAACGATCTCCACCAGCGGCAGCAGGTACACCGGGTTGAAGGGGTGGCCGACAACACAGCGTTCAGGGTGAGTCGACGATTCGTAGAATTCGCTGGGCAACAGGCCGGAAGTGCTTGAGCCGATGATTGCGTCAGGCTTGGCTGCGGCGCTGATTTTTGCGTGCAGGTCGAGCTTGAGGTCCAGGCGCTCTGGTGCGCTTTCCTGGATGAAATCGGCGTTGCGCACGCATTCTTCGATGGTGCTGACGAACTTCAGCCGGTCCTGGGAGGCACTTTGGGCCAGACCTTGTTTTTCAAGCGCCGGCCAGGCGTTGGCGATACGTTTGCGCAGCGCCTGTTCGGCACCGGGCGCCGGGTCCCAGGCGACCACATCCAGGCCGTGGGCGAGGGCGCGGGCGACCCAGCCGCTGCCGATCACGCCGCTACCCAGGGCGGCAAAGGTCTTTATCTCGGTGATGAAGGGCATATCGGTCTCCTGAATGGATCAGCGGCGCTTGAGGTTCATTTTTTCCCGGCCTTCTGCCGGGCTGAGCACGCGGCCACCCATGCGGGTGATGATCTCGCTTGCGCGTTCCACCAGTTGGCCATTGCTGGCCAGCACCCCACGGTCCAGATACAAGTTGTCTTCCAGGCCCACCCGCACATTGCCGCCGAGCAAAACGGCCTGGGCGGCCATGGGCATCTGCATGCGGCCGATGCCGAAGCCGGCCCAAGTGACGTTGGCCGGCAGGTTGTCCACCATCGCCTTCATGGTGGTGGTGTCGGCCGGTGCGCCCCACGGGATGCCCAGGCACAGCTGGAACAGCGGGTCTTCGAGAAGGCCTTCCTTCATCATTTGCTTGGCGAACCACAGGTGGCCGGTGTCGAAGATTTCCAGTTCGGCTTTCACGCCCAGTTCGGTGATGCGCTTGGCGCCGGCGCGCAGTTGGGCCGGGGTGGACACGTAGATCGAGTTGCCGTCGCCGAAGTTGAGGGTGCCGCAGTCGAGGGTGCAGATTTCCGGCAGCAGCGCTTCGACATGGGCCAGGCGCTCCAGCGGGCCGATCAGGTCGGTGCCCGGGCCGAACTCCAGCGGCGTTTCACCCGGGCCGATTTCCAGGTCGCCGCCCATGCCGGCAGTGAGATTGACGATGATGTCCACGTCAGCTTCACGGATGCGTTCCATGACTTCGCGGTACAGCGCCACATCGCGGCTGAAGCGGCCGGTCTGCGGGTCGCGGACGTGGCAGTGGACCACGGTGGCGCCGGCCTTGGCGGCCTCGACGGCGGATTCGGCGATCTGTTTGGGGGTGACCGGGACCAGGTGGCTTTTCGAGGCGGTGTCGCCGGCGCCGGTGAGGGCGCAGGTGATGATGACGTCGTGGTTCATGGCGGGGTTCCTTGTGTTGTCTGTGCTGGCCCTATCGCCGGCAAGCCAGCTCCCACAGGGTTCGCGTGAACCCTGGGCGTAGGCAGAACCTGTGGGAGCGGGCGTGCCCGCGAAAGGGCCGGATCAGTTGGCAGTAAGCTTGAGGTTGTCTGCGGCCGGCTTGCCATCGAAGGTGGTCACGCCCTCAAGCCAGCGGGCTTTGTCATCGGGGTGGTCCTTGAGCCACTGGCGGGCCGATTCCAGCGCATCCTTGTGGTCGAGCAGCGGCTGCATCATGCGGCTCTCGTCCTCGGCGCTGAAGTTCAGGTTGGCCAGCAGGCGGTGGGCGTTGGGGCAGCGTTCGGCGTAGTCCGGGGCAGTCACTGTCCACACCGTTGCGCGGCCTTCGTCCGGGCCCAGGGCGTCCTGGCTCTCGCCCAGGTAGGCCATGTCGATGTTCACGTTCATCGGGTGCGGCGCCCAGCCGAAGAACACCACGGCCTCCTTGCGCCGCACGGCGCGGTCGACGGCAGCAAGCATGCCGGCCTCGCTGGACTCGACCAGCTGGAACTTGCCCAGGCCGAACTGGTTCTTGGTGATCATCGCCTTGATCTGGGTGTTGGCGCCGGAACCGGGCTCGATGCCGTAGATCTTGCCGCCCAGCTCCTTCTCGAACTTGTGGATGTCGGCGAAGGTCTTCAGGCCTTTGTCATACAGGTACTTCGGCACTGCCAGGGTCGCCCGGGCATCTTCCAGGCTGGGCTTTTCCAGCACCTTCACCTGCTGGGCATCGATGAACGGGGTGATGGTCTGGGTCATGATCGGGTTCCAGTAACCCAGGAACATATCCAGGCGCTTGTCGCGGATGCCGGCGAAGATGATCTGCTGCGAGGCGCTGGTCTGTTTGGTCTGGTAGCCCAGGCCGTCGAGCAGCACCTGCGCCATGGCGCTGGTGGCGATGACGTCGGTCCAGTTGACCACGCCCAGGCGGACATTCTTGCAGGCCGCAGGTTCGGCGGCGTAAAGCGGGGAAGTGGCGATGCTGCTCAGGGCAAGGGTCAACAGGCTGCGGCGGATCAAGCTGTGCATGGTGGCTCTCCATCGGCAGTGCATATTGTTATGGGGTTCGGTCTCTCTGGGCCGTGTGAACACGCTACGCTGCACGCAGGGTGAAAAATCGCACCCTGGCGACCAACACTTGCACGGAGGCGACCACCTTTGCCGTGGAGCATGCAATGCTGCAAAACATTCAATTTCTGCTGTTGCCCGGGTTCTCGGCCATGGGCTTCATCAGTGCCCTGGAGCCGCTGCGGGTAGCCAACCGGTTCAAGGGGCCGTCCTACCGTTGGCAGGTATTGAGCCTGGATGGCGGTGCGGTGCAGGCCAGCAACGGCATGTCGGTAAACGCCGATGCGGCGCTGGCCGCGGGCGAGCCTGCCGGCATCCTGCTGATCGTGGCCGGTTTCGATCCCCTGGCCTGTTATGGCCAGGCGCTGCAACAGGTGCTACGCCGCCTGGATCATGAAGGGGTGATACTCGGCGGCATCGACACGGGTGCAGTGGTACTGGCCGAGGCCGGCCTGCTCGACGGCCACCGCGCCACCGTGCACTGGGAGGCGCTGGAAGCGTTCAAGGAAAACTACCCGAGCCTGCAGGCGACCCAGGAGCTGTTCGAGATCGACCGGCGGCGCATCACTTGCGCTGGCGGCACGGCGTCCATTGACCTGATGCTCGACCTGATTGCACAGGCCCACGGCAGTGAACTGGCGGTGCAGGTGTCGGAGCAATTCGTGCTCGGGCGTATCCGCCAGCGACAGGACCACCAGCGCATGCAGATCGCCAGCCGCTATGGCATCAGCAACAAGAAGCTGGTGAAGGTGATTGGCGAAATGGAGCGCAACACCGAACAGCCGCTCAATACCCAGGTGCTGGCTGAAGCGGTGCAGGTGACCCGGCGCCAGCTGGAACGGCTGTTTCGTGTGCACTTGGACGACACGCCCAGTGGTTTCTATCTGCGACTGCGGCTGGACAAGGCGCGGCAGTTGTTGCGCCAGACCGACATGAGTGTGCTGGAGGTGGGGGTGGCTTGCGGGTTCGAGTCGGCTTCGTACTTTACCCGTTGTTACCGGGCGCGGTACCAGCGTTGCCCGCGGGAGGACAGGTTGGCCAGGGCAGTTTGATTTCCTCCTGCACTGGCCCTATCACCGGCAAGCCAGCTCCCACAGGTATTGCACAGCCTTTGAATGCCGTGGGGTACCTGTGGGAGCTGGCTTGCCGGCGATAGGGCCTGCGCAGGAAACAGGGATTTCACTGCTGGCCGATGGACTGCAAATACTCCGACCGGTCATCACTGCGCTGGGCCACGCAGGTATCCCAGGCCGCCTGGAATGCCTTGCTCCCCTGCTTCTCGGCATAGGTCTCAACCTTGCAATCAGCATCGCGCAGTTGCATCCACAGCTTCTCTGCCGCGTCCATGCGCCCGATCAGGGCAGTGGCCTGGTCGCTTTCGTCGGCATATTGATCGCGAATGCGCTGGATCAAGTCGTCGTAAGCCGATTTCAGCTCCCGCTCGGCCGTCTGCTTGTTGAATGCAGCGCAGGCGTAGGTCTGCTGATCGGTGTCGACGTTGTCGCACGGGGTGCTTTCTTCCTCGCCGGCCTGGGCGCCCGATACGACGGCCAGCAGCGCCAGCCATGCCAATGATTTCATCCCTTTTCTCCTCAACAGGCTGACGAATCGCAGGGATTCTCGCTCAGACGAGGGCGCGTGGGTAGCCACCTGTCCAAATGTTCATAAAGCAGCCCGAAAGGTCGTTAACGCGACTGCCTCTCATCGCCAGACGGCGTGCCAGAGGGCGTGTTGTCGCGCATTGACGCTTTCGGCAAACCCCCTGTCGTTTTTGCATCGGGGCGCCTTTGGGCACAGGCATATGCTGGCCCCAAAGCGCCGGCACAAGGTTTGGCGCCAACCATTCAATAAAAGGGGACAGCCTGATGAGCCCAGCCGAACTTCACGCCGACAGCATCGTCATCGACGGCCTGATCATTGCCAAATGGAACCGCGAGCTGTTCGAGGACATGCGCAAAGGCGGGCTGACTGCGGCCAACTGCACGGTGTCGGTCTGGGAAGGCTTCAAGGCAACCGTCGACCAGATCGCCGCCAGCCAGAAGCTCATCCGCGACAACAGCGACCTGGTGATGCCGGTGCGTACCACCGCCGACATCCGCAAGGCCAAGGAACTGGGCAAGACCGGCATCCTCTTCGGCTTCCAGAACGCCCATGCGTTCGAAGACCAGATCGCCTATGTGGACGTGTTCAAGCAGCTGGGCGTGGGCATCGTGCAGATGTGCTACAACACCCAGAACCTGGTGGGCACCGGCTGCTACGAGCGTGACGGCGGGCTGTCGGGCTTCGGCCGCGAGATCGTGGCGGAAATGAACCGCGTCGGCATCATGTGCGACCTGTCCCACGTCGGCTCCAAGACCTCCGAAGAGGTCATCCTCGAATCGAAGAAACCGGTCTGCTACTCCCACTGCCTGCCGTCGGGCCTTAAAGAGCACCCGCGCAACAAGTCGGACGAAGAGCTGAAATTCATCGCCGACCACGGCGGCTTCGTCGGCGTGACCATGTTCGCGCCGTTCCTGGCCAAAGGCATCGACTCGACCATCGACGACTACGCCGAAGCCATCGAGTACACCATGAACATCGTCGGTGAAGACGCCATCGGTATCGGCACCGACTTCACCCAGGGCCATGGGCAGGACTTCTTCGAGTACCTGACCCACGACAAGGGCTACGCCCGTCGTCTGACCAACTTCGGCAAGATCATCAACCCGCTGGGCATCCGCACCGTGGGCGAATTCCCCAACCTCACCGAAACCTTGCTCAAGCGCGGCCACTCCGAGCGTGTGGTGCGCAAGATCATGGGCGAGAACTGGGTCAACGTCCTCAAGGACGTCTGGGGCGAGTAAGCCGCTCTCCAAGCCTGATAGCCCCTGCCAGCAACGCCGGGGCACCCACATAAAAATTTGTATGGAGTTGAGTTTCCATGGCCAAGATCGCCCCGCAATTGCCAATCGAAGTCGACAGCGAGACCGGTGTCTGGACCAGCGACGCCTTGCCGATGCTGTACGTGCCGCGCCATTTCTTCGTCAACAACCACATCGGTATCGAGGAAGTGCTGGGCGCCGACGCCTATGCCGAGATCCTCTACAAGGCCGGCTACAAGTCCGCCTGGCACTGGTGTGAAAAGGAAGCCGAGTGCCATGGCCTGGAAGGCGTGGCGGTGTTCGAGCACTACATGAAGCGCCTGAGCCAGCGTGGCTGGGGCCTGTTCGAGATCCAGGACATCGACCTGGACAAAGGCACCTGCAGCGTCAAGCTCAAGCATTCGGCGTTCGTGTACGTGTACGGCAAGGTTGGCCGCAAGGTCGACTACATGTTCACCGGCTGGTTCGCCGGCGCAATGGACCAGATTCTCGCTGCCCGCGGCAGCTCGATCCGCACCGTGGCCGAACAGGTCTACGGCGGTTCGGAAGAAGGCCACGAAGATGGCCTGTTCGTTACAAAGCCGTTGTAAGCCGGAGATAGCGTCATGGCATTCGAAGCAATGTTCCAGCCGATCCAGATCGGCAAACTGACCATCCGCAACCGTGTGCTCAGCACCGCGCACGCCGAGGTCTACGCCACTGACGGCGGCATGACGACCGACCGCTACGTGAAGTACTACGAAGAGAAGGCCAAGGGCGGTATCGGCCTGGCGATCTGCGGCGGCTCGTCCGTGGTCGCCATCGACAGCCCGCAGGAATGGTGGGCGTCGGTCAACCTGTCGACCGACCGCATCATCCCGCACTTCCAGAACCTGGCCGACGCCATGCACAAGCATGGCGCCAAGATCATGATCCAGATTACCCACATGGGCCGTCGCTCGCGCTGGGACGGCTTCAACTGGCCGACCCTGATGTCGCCGTCGGGCATCCGCGAACCGGTGCACCGCGCCACCTGCAAGACCATCGAGGTGGAAGAGATCTGGCGTGTCATCGGCAACTACGCGCAGGCTGCGCGGCGTGCCAAAGAGGGCGGCCTGGACGGCGTGGAACTGTCGGCCGTGCACCAGCACATGATCGACCAGTTCTGGAGCCCGCGGGTCAACAAGCGTACCGATGAATGGGGCGGCACGTTTGAAGGCCGCATGAAGTTCGGCCTGGAAGTACTCAAAGCCGTGCGCGCCGAAGTGGGCGACGACTTCTGCGTGGGCATGCGCATCTGTGGTGACGAGTTCCACCCCGATGGCCTCAGCCACGAAGACATGAAGCAGATTGCCGCCTACTACGACGGCACCGGCATGCTCGACTTCATCGGCGTGGTCGGCTCGGGCTGCGACACCCACAACACCCTGGCCAACGTCATCCCCAACATGAGCTACCCGCCAGAGCCGTTCCTGCACCTGGCAGCCGGCATCAAGGAGGTGGTCAAGGTCCCGGTGCTTCACGCGCAGAACATCAAGGACCCGAACCAGGCCACGCGCATTCTTGAAGGCGGCTACGTGGACATGGTCGGCATGACCCGTGCGCACATGGCCGACCCGCACCTGATCGCCAAGATCAAGATGGGCCAGATTGACCAGATCAAGCAGTGCGTCGGTGCCAACTACTGCATCGACCGCCAGTATCAGGGCCTGGATGTGCTGTGCATCCAGAACGCCGCGACCTCCCGTGAATACATGGGTGTGCCGCACATCATCGAAAAAACCACCGGCGTCAAACGCAAGGTGGTGGTGGTTGGCGCCGGCCCTGCCGGCATGGAAGCAGCCCGCGTGGCTGCCGAACGTGGGCACGATGTGACCCTGTTCGAGAAGAAAGACCAGATCGGCGGGCAGATCACCATTGCCGCCAAGGCACCGCAGCGTGACCAGATCGCCGGCATCACCCGCTGGTACCAGCTGGAGCTGGCGCGCCTGAAAGTGGACCTGCGCCTGGGCACCGCTGCCAGCGTGGACGCCATCCAGGACCTGCGCCCGGACGTGATCGTGCTGGCGGTGGGCGGCCATCCGTTTGTCGAGCAGAACGAGCACTGGGGCGCTGCCGAAGGGCTGGTGGTCAGCAGCTGGGACGTGCTCGACGGCAAGGTTGCGCCGGGCAAGAACGTGCTGGTGTACGACACCATTTGTGAATTCACCGGCATGTCGGCAGCAGACTACATTGCCGACAAAGGCAGCCAGGTTGAAATCGTCACCGACGACATCAAGCCGGGCGTGGCCATGGGCGGCACGACCTTCCCGACCTACTACCGCAGCATGTACCCGAAAGAAGTGATCATGACCGGCGACATGATGCTGGAAAAGGTCTACCGCGAAGGCGACAAGCTGGTGGCGGTGCTGGAGAACGAGTACACCGGCGCCAAGGAAGAACGCGTGGTCGACCAGGTGGTGATCGAGAACGGCGTGCGGCCTGATGAACAGCTGTACTACGCGCTGAAAGAGGGCTCGCGCAACAAGGGCCAGATCGATGTGGAGGCGCTGTTTGCCATCAAGCCACAGCCGATCCTCAGCCAGCCGGGCGAAGGTTACCTGCTGTACCGCATCGGCGACTGCGTGGCCCAGCGCAACGTGCATGCGGCGATCTACGACGCCTTGCGCCTGTGCAAGGATTTCTGATCGCACCGTCTTTCTAGAGGCGGGACTGGCCCCCCTGTAGGAGCGGCCTTGTGCCGCGAAAGGGTCGCGAAGCGGCCCCCAGCATTTCATCGGTATCAAGGATGCTGGGGCCGCTACGCAGCCCTTTCGCGGCACAAGGCCGCTCCTACAGGGGTCCGCGCTGGCCTTTGGGGTGGCGCAAGAATTTAGCTGTTGTGGGAGCCTCCCATGTTGAACACCCTTCTACCCATCCTGCTGTTCGCTGCCCTTGGCCTGGCGGTGCTCGGCGCCCTGCGCCGGGTGCGCATGTGGCGGCGTGGCCGGGCCTCCAAGGTCGACCTGATCGGCGGCCTGCTGGCCATGCCGCGCCGTTACCTGGTGGACCTGCACCACGTCGTCGAGCGCGACAAGTACATGTCCAAGACCCACGTGGCCACCGCTGGCGGCTTCGTGCTGTCGGCCGCGCTGGCGATCCTGGTGCATGGCTTTGGCCTGCAAAGCAAAGTCCTCGGCTACGCGCTGCTGGTGGCCACGGTGATCATGTTCACTGGCGCCATCTTTGTCTTCAAGCGCCGCCTCAACCCGCCTGCGCGCCTGTCCAAAGGCCCGTGGATGCGCTTGCCGAAGAGCCTGCTGGTATTTGCCGCAAGCTTCTTCATTGCCACCCTGCCGGTCGCCGGCATCCTGCCTGCCAACACGGGTGGCTGGGTGATGGTCGCAGTGCTCGGCCTGGGCGTGCTGTGGGGCGTGTCGGAGCTGTTCTTCGGCATGACCTGGGGTGGCCCGATGAAGCACGCCTTCGCCGGTGCGCTGCACCTGGCCTGGCACCGCCGCGCCGAGCGCTTTGGCGGCGGACGCTCCACCGGCCTCAAGCCGCTGGACCTGGAAGACCCCAACGCGCCGCTGGGCGTGGAAAAACCGGCTGACTTCACCTGGAACCAGCTGCTGGGCTTCGATGCCTGCGTGCAGTGCGGTAAATGTGAAGCCATGTGCCCGGCTTTTGCTGCTGGCCAGCCGCTGAACCCGAAAAAGCTCATCCAGGACATGGTCATCGGCCTGGCCGGTGGCACGGACGCCAAGTTCGCCGGTAGCCCGTACCCGGGCAAGCCGATCGGTGAACACGGCGGCCATCCGCACCAGCCTATCGTCAATGGCCTGGTCGACGCTGAAACGCTGTGGTCGTGCACCACCTGCCGTGCCTGCGTCGAGGAATGCCCGATGATGATCGAGCACGTCGATGCCATCGTCGACATGCGCCGCCACCTCACCCTGGAAAAGGGCGCCACCCCGAACAAGGGCGCCGAGGTGCTGGACAACCTGATCGCCACCGACAACCCCGGCGGCTTCGCCCCAGGCGGTCGCATGAACTGGGCTGCCGACCTCAACCTGAAACTGCTGTCGGAGGTGAAAACCACCGAAGTGCTGTTCTGGGTTGGCGACGGTGCCTTCGACATGCGCAACCAGCGCACCCTGCGTTCGTTTGTCAAAGTACTGAAGGCCTCGGGCGTGGACTTTGCCGTGCTCGGGCTGGAAGAACGCGACAGTGGCGACGTGGCCCGCCGCCTGGGCGACGAAGCGACCTTCCAGCAACTGGCCAAACGCAACATCCAGACCTTGGCCAAATACAAGTTCCAGCGCATCGTCACCTGCGACCCGCACAGCTTCCATGTGCTGAAGAACGAGTACGGCGCATTGGGCGGCGAGTACCAGGTGCAGCACCACAGTACCTACATGGCCGAACTGATCGCGGCCAACAAGCTCAACCTGGCCCAGCACAAGGGCGGCAGCGTCACCTACCACGACCCGTGCTATCTGGGCCGCTACAACGGTGAGTACGAAGCGCCGCGTGAAGTGCTCAAGGCGCTGGGTATCGAAGTGCGCGAGATGCAGCGCTCGGGCTTCCGTTCCCGTTGCTGCGGCGGTGGCGGCGGTGCACCGATCACCGACATCCCGGGCAAGCAGCGTATCCCCGACATGCGCATGGACGACATTCGCGAAACCGAGGCCGAGTTGGTGGCCGTGGGTTGCCCACAGTGCACCGCCATGCTCGAAGGCGTGGTCGAGCCGCGCCCACAGATCAAGGACCTGGCGGAGCTGGTGGCCGACGTGCTGATCGAAGAGGACGCGCCGTCTGCCGCAAAGCCGCAAACGGCCAAACGTGAACCTGCGGAGGTGCACTGATGAGCGACATTATCCGCCGCGACCCACGCGCCGAGTGGATCGCCCGTAACCGCCTGCACCCGCTGCACGCGGCCATGCAGACGCAACAAACCAGCTGGATGGGGCCCAATGGCCTCATCCGCAAGAACCCCCATGCGATTGCCGCAGGCTTCATCGGCCCGGCCGGCATCAAGCGCATCGACCGCAGCGGCGCCCAGCAGGGTACCGGTGTGGGCGGGCGGCGCACGGCGGCGGCAGAGGTCAAGCTGCCGCTGCACCAGGTACCGGCGCCGGCGTTCTACATCGCGGTGGTGCCGGACATGGTCGGTGGCCGCCTGAGCAGCCACGACCGCGACCTGCTCGGCCTGGCCCACAGCCTGGCCGGCAGTGACGGCGCAGTGCTGGCGGTGGTCTTTAACGAGCACAAGGAAAGCAACTTTTCCACCGCTGGCGTCGACCGCCTGCTGGTCGTCGAGGGCGAGGCCTTCGAGGGTTATGCACCGGAGCAACTGGTGCAGGGCCTGCGGGCTGTGGATAACCAGTTCACCCCGCGCCACTGGCTGCTGCCTGACAGCCGCACCGGTGGCGGCGAACTGGGCCGACGCCTGGGCGCGGCGCTGGGCGAGCGCCCGGCCACGCGCGTATGGCAGGTCAAGGACGGCCAGTGCATCGGCCGCGCCGGTGCTGGCCAGCAAGACCTGCAACGTGCGGTGCCACGGCTGATACTGGCGGCGGCGGAGTGCGCCGAGCCGGTCAGCGAAACCCGTCACGAAGCCTTGCCGGTGGAGTTGTCCACAAGCGTGGCGCGCAGCCTGTCGCGCATCGAAGACCTGGGCTCGGTGGCCGTGGACCCGGCCACCATTGCCATGGCCGAGGCCGAGTTCATCATCTCGGGCGGCAACGGGGTCAAGGACTGGGACCTGTACCACCAGGCCACCGCAGCCCTTGGCGCCACCGAAGGCGCCTCGCGGGTGGCGGTGGACGACGGCTTCATGCCGCGCAACCGCCAGGTGGGTGCTACCGGCACCTGGGTTACCGCGCGTGTGTACGTGGCTGTGGGTATCTCGGGTGCCATCCAGCACCTTCAGGGCATCGGCGCCTGCGACAAGGTGGTGGCGATCAACATGGACCCGGGTTGCGACATGATCAAACGGGCGGACCTGTCGGTGATTGGCGACAGTTCGGCGATTCTCAAGGCATTGATCGAGGCTGTGGACAACTACCGCAGCGGCGGCCAGCGCGACGCGGCATAAGGGCACGACCATGAGTACGAAAGTGATCAGCCTGGTTTCCATCGGTGCCCACCCAAGCTCCGGGCGCGCCCGGCGCGCCGAGCAGGATGCCCGCGCCGTGGAACTGGGGTTGCAGCTGGCTGGGGATAACTTGCAGGTGGTACATGCTGGCAATCCACAGGAAGAGGCTTTGCGCGCTTACCTGGGCATGGGCCTGGACCATCTTGACGTGCTGGAGCAGCCGGCCGGTGCCGATGTGCTGGGCGTGCTGGGGGATTACCTGCGCGACGCCGGAGCACAGCTGGTGCTGACCGGTAGCCAGGCCGAGACGGGTGAAGGGTCGGGCATGTTGCCATTCCTGCTGGCCGAAAAGCTTGGCTGGCCGTTGATCGTGGGGTTGGCCGAGGTGGAGTCGATCGAGAACGGCACCGCTCAGGTATTGCAGGCCTTGCCGCGGGGCCAGCGGCGCCGGCTGAAAGTGCGCCTGCCGTTGCTGGCGACTGTGGATAACGCTGCGCCCAAGCCGCGCCAGAGCGCATTCGGGCCGGCGCGACGTGGTGTGCTGGCGGCGCGTAATGTGGCCATTGTCGAAGATGAGTTGCTGGCCGATGCCGAGTTGCAACCGGCCCGCCCTCGGCCCAAGCGCCTGAAGGTGATCAAGGCCAAGAGCGGTGCAGACCGCATGAAAGCAGCAACAGCCAAGGCCAGTGGTGGCGGTGGCAAGGTGCTGAAGGACGTTTCGCCACAGGAAGGTGCTGAGGCCATCCTCAAGCTGCTGGTGGAAGAAGGCGTGCTGCGCTAAGACCTTTACTGGCCTCATCGCCGGCAAGCCAGCTCCCACAGGATCACCACTGCCTACGGGTTCAGTGAAATACCTGTGGGAGCTGGCTTGCCGGCGATTGGCCCTGACAGACTAATGATGTTCCCTGGCTACATTGAACGCCCACTACCCCAATGCCCCGCCCTTCATCGAACCACTTCACAGTGCCCTCCTGGCGGCTGTGTATAAGTCACTCCTGGGTAATGCCCAAGTAGTAACCATTACCCCAGCACATCGCTGCATTTCACGAAACTGGCAGCACTATCAGGTACGCCTGTATTTTGCCCACGGAATCTGTTCGCCAAGGTGTGGATAAAGTGTTGGTGTATGGCTGGGGGCCATATATAAAGGGGCTTACAGAGTTTTGATCATAAACTGATCAACTCTGAGCCGGCTCCTCAGACACAGACATGCGTGCCTCCAACGGGTTTTGCCCACAATCGCTGTTAGCGGGTCTGTGGATAATCTGTTCGGCACTGCCTGAAAGCCGCGCAGGTCATGGCGTGTGAGGAGTTGATCAGAAAATGATCAAATAGGGGTGAATGCCTCCTGATCCATGATTATCAAGGCCTTGAGTCGTTTATCCACAGACCAGAGGCTGCATTGCCGCGTTTTGCTCACAAACTCTGTTGGTGTCTCTGTGGACAAGTTGTATGAACATCGCTGTAGCCCAGAAAGCATGAGGCCTTGACGTGATTGGTTGAATAGTGACCAGTTGCCGGCCTGCTATTTTTGCCAGTACGCAAACGCAATTTTCCTCATTACCCTGTCCACACGCTTCGTGACTTGTGGATAACCATGACGACATTCCTGCTAAGCATCGACCGCCAGCACTCTGTACTGGGTAGGGGGCGTCACGAGGCGCGCGCTTTTTCGCCTTACGGTGCCTTGAGCGCGGCACTGATACCAGGGTTGGCTTTCTGTGGTCAGCATCCGGATCCGCTGACAGGCTGTTATCCGCTGGGCAATGGTCGCCGCTTCTACAGTCCGAGCCTGAGGCGGTTCATCAGCTCTGACTCGCTAAGCCCGTTCGGCAAGGGAGGTATCCATGCATACGCCTATTGTGGCGGCGATCCCGTGAATCGCCATGACCCGAGCGGGGCGTTCTGGGGCGTCGTGCTCAGAATCGTTGGCGTGGCGTCCAGCGGTGCTACGTTGTTCGGCTCGTTGGCACGTACTGCCAAGAATGTAGTGGGGCGCAGGGCCGCTTTCTGGGCCAACAATAACCCTCCGGGGGGAGGCGGGCCTGTGCCCTCCGTGCGCCATCAGGAGCTGCCTCATGCATCCAGGGTCTCGAATCAGCAGTTTTTCATAACCGGGAGCGCTGGGGTAGCTGGGCAGTTGGCTGCTGCGATCTCTGGTGTGACGCCTGCCTTCCAGACAGCAACAGATGTATTGGGCGTGGTGAATTCAGTTACCAACCTGAGCGGCGGTTCGATAGGTAACTTTGCTGCAGCACGTGAGGTAGGGAGTTACTTATGGGCAAACCCACGTGAAATTCCCGTGGTCGCTCTGGAGACTTTCATGGATGTGACGATGGTGGACGAGGTGTTCGCTAACGTGGGCCGGGGGTTTACAGCAGTAGCGGAAAGGATAAGGTCAGCGCGCCCGACCCCGCACGACGTCACCGTGTAGCGTCAGGCGCGGGTGTCCAAACGTCGGCCGACGCGCTCCAACTCGCCAGGCAAGTGGAGCGCGGCAGCCTAACGCTAAATACTCACTGCGCCTTCACCTCTTTCAGGTATGGAGCAGGCTCTGCACCCAAGTTGGCCAATACACGGTCGCTGTACCAGTCAATGAAGTTCACCACACCAAACTCGTAAGTCTTCGAGTAAGGCCCCGGCTGGTACGCCGTGGAGTTGATGCCACGCTGGTTTTCTTCCGCCAGGCGGCGGTCCTGGTCGTTGGTGGCGTCCCACACCTTGCGCATGCGCTCTGGGTCGTAGTCCACACCTTCCACGGCGTCCTTGTGCACCAGCCACTTGGTGGTGACCATGGTTTCCTGGGCGCTGATCGGCCACACGGTGAACACGATCATGTGGTCGCCCATGCAGTGGTTCCACGAGTGCGGCAGGTGCAGGATGCGCATCGAGCCCAGGTCCGGGTTCTTGATGCGGCCCATCAGCTTCTGGCAGGCCTGCTTGCCGTCCATGGTCATCGATACCGTGCCTTTGAGCAGCGGCATGCGCACGATACGGTTACGCAGGCCGTGGCTCTTGTGCAGGTACGGGATCTTTTCGGCTTCCCAGGCGGCGGCCGAAGCGGCCACATGGTCCTTGAATTCCTGGCTGGCGCGCGGGTCGTTGGTGTCGTCCCACTCCAGCAGGGTTTGCAGCAGCTCCGGGTGCGAACCGCTGCAGTGGTAGCACTCGCGATTGTTCTCCAGCACCAGCTTCCAGTTGGCCTTTTCCATCAAGGTGGTGTGCACCGCCACCTTGGTGTTCTCCATGTCGTACGGTTCCATGTAGTGTTCCAGGGTAGCCAGGAACTCGTCGATGGCAGGCGGGTTTTCTGCCAGGCTGATGAAGATATAGCCACCGGCCACCTTCACATGCACCGGCTTCAGGCCGTACTCCTTCATGTCGAAGTCGGCGCCCATTTCGGTGCCGGCGAACAGCAGGCGGCCGTCCAGTTCGTAGGTCCACTGGTGGTAATGGCAGACCAGCTTGGCCACCTTGCCTTTGTCGCTGACGCACAAACGCGAACCACGGTGGCGGCAGACGTTATGGAAGGCGTGCACCTTGCCTTCGGCACCACGCACCACCAGGATCGGGTTTTTGCCGATCTGCAAAGTGATGTAGTTACCTTTCGCCGGAATTTCGCAAGTCATGCCAGCGATCAACCATTCTTTCTGGAAGATCTCCTGCATGTCGATCTGGAACAGACGCTCGTCGGTGTAGAAAGGCTGGGGCAGCGAGTAGGTGCGCTCGCGGGTCTGCAGCATCTCGGCGGTAGCCTTGCGTGCAGGTTCAAGTGGATCGCCCAGGCTCAGGGTTGCGGTGACGTCCATCGTGTATTCCTCGGGGCCGTGTGCGGCCGGCAAAAGGTGGCTAATCGTTGTTGTTGCCGCAAGGCGGTTATCAGCGGCTTTTTGTTTAAACAGCAGGCTTGTTTATTTGCCGTGGAGTGTGCGTCCGAAGACGCCGCGAACCGTATCCATGGGCGACATGGCCGAATTGAATTACGACGCGCCAGCCCTTGTAGCGCGGGGCTGGTCGCGATAAGCACGCCGATGTCGCTGGCAGGAATGTACGTCGCTCTCAGCTTGCGCATTATCCAAGCCATAAAAAGCCCGATAGTCGGCTGCTGGAGATGAACATGTCCGATACCTTCCTCAATCCGGTCACCACCCAGACCTGGGCCAACGGCCGCCACATCGTGCGCTGCGTCAAGGTCATCCAGGAGACCTGGGACGTGCGCACGTTCTGCTTCATGGCCGATCAGCCGATCATGTTCTTCTTCAAGCCCGGGCAGTTCGTCACCCTGGAGCTGGAGATCGAAGGCAAGCCGGTGATGCGCTCCTACACCATCTCCAGTTCGCCGTCGGTGCCGTACAGCTTCTCGATCACCGTCAAGCGCGTGCCGGGCGGCCTGGTGTCAAACTTCCTGCACGACACCATGCACGAAGGCGCCGAGCTGCCGGTGCATGGCCCGGTGGGGCTGTTCAACGCCATCGATTTCCCGGCGGGCAAGGCGCTGTACCTGTCGGGCGGTGTGGGCATTACCCCGGTGATGTCGATGGCGCGCTGGTTCTACGACACCAACGCCAATGTCGACATGGTGTTCGTGCACAGCGCCCGTTCGCCGAAAGACATCATCTACCACCGCGAGCTGGAACAGATGGCTTCGCGCATCCCCAACTTCAGCCTGCACATCATTTGCGAAAAGCATGGCCTGGGCGAGCCATGGGCGGGGTACCGCGGTTACCTGAACCAGCGGCTGATGGAGCTGATTGCTCCTGACTACATGGAGCGCGTGGTGTTCTGCTGCGGCCCGACGCCTTACATGACGGCAGTCAAGCGCATGCTCGAAGCGGCCGGCTTCGACATGAAGAACTACCACGAGGAGTCGTTCGGCGCGACGCCACCAGAAGCCAAGGCCGATGCAGTGGAGCACGCAGAGCAGGCGGCCGATGCACCGGAGCTGGATATTTCCGACCTCAACCTGGTGGAGTTCATCGGCAGCGACAAGAGCATCCGCGTTGCCCCGGGCGAGACCGTGCATGCTGCGGCAGCCAAAGTTGGCCTGATGATCCCGAAAGCTTGCGGCATGGGCATCTGCGGCACCTGCAAGGTGCTCAAACTGGGCGGCGAAGTAGAGATGGAGCACAACGGCGGCATTACCGAAGAGGATGAAGCCGAGGGCTACATCCTGTCGTGCTGCAGTGTGCCGAAAGGGGATGTGCGGATCGATTACTGATCCACTGATCCGAGATTGCCGGGGGCTGCTTCGCAGCCCATCGCCGGCAAGCCAGCTCCCACAGGTACTGCACACGCGGTCGATGTGGGAGCTGGCTTGCCGGCGATGGGCCGCAAAGCGGCCCCGACTTTTTCAGGTGCGGAAACGGGCTACCAGACCATTAAGGTCAACCGCCAGGCGCGACAGCTCTGCACTCGCCGCACTGGTCTGATGGGCACCGGTAGCGCTCTGCACCGACAGGTCATTGATGTTCACCAGGTTGCGGTCCACTTCCCGCGCCACCTGCGCCTGCTCTTCCGCCGCGCTGGCGATTACCAGGTTACGTTCGTTGATTTGCGCGACCGCACCGGCAATGGTGTCCAGCGCCATGCCGGCCCCTTTGGCGATGTTCAGCGTCGACTCGGCGCGCTCGGTGCTGGTGCGCATCGATTCCACAGCCTCTTCCGTACCGCCTTGAATGCTGCCGATCATCCGCTCGATTTCGCTGGTCGACTGCTGGGTGCGGTGCGCCAGTGCGCGCACCTCATCCGCAACCACGGCAAAACCACGGCCCGCTTCGCCGGCACGCGCCGCCTCGATGGCAGCGTTGAGCGCCAGCAGGTTGGTCTGGTCGGCCAGGCCACGGATCACGTCGAGTACCTTGCCAATGTCGCGTGACTGTTCAGCCAGGTGAGTGATCAGTTTGGCAGTGGCCTGCACATCACCGCTCATGCGCTCGATGGCGCCCACAGTCTCCATCACCAGGTCGCGACCGTCACCGGTGGAGCGGCTGGCTTCACTGGACGCTTCCGAGGTGCTCACGGCGTTGCGCGCCACTTCTTCCACAGCGCTGGTCATTTCGGTGACGGCGGTGGCGGCCTGTTCAATCTCGTTGTTCTGCTGTTGCAGGCCACGGGCACTTTCGTCGGTAACGGCATTCAGCTCTTCGGCGGCTGAGGCCAGCTGGGTGGCAGAGCCTGCTATCTGCTGCAGGGTGTCGCGCAGCTTGTCCTGCATGCGGGCCATGGCGCGCAGCAAACGCGCGGCTTCATCGGTACCCTCGGCGCGGATGACATGTGTCAGGTCACCGTCGGCGACTTGCTCTGCGCACTTGAGCGCCTCATCGATCGGCTTGACGATGCTGCGGGTCAGCAGGAAGGCGCAGGCGAAGGTCAGCACGGTGGCGGCAACCAGCAGGCCGATAACCAAGGCAAAGGCCGCAGCGTATTGGCTGGCGGCTTTCTCGTTAGTGGCGCGGGTCTGGTCGGTGTTGATGCGCACCAGCGTGTCCATGACCTTGTTGATCTGTTCGGAGTTGGCCAACAGGTCGCGGTTGAGCAGGTCGCGCAGTTCGTCGACGCGGTCGGCCTGGCTCAGGCTGCGCATGCGCGATTCCAGTTGGCGGTACTGTTCAAGCAACTGGCCGTACTGGTCGAACGCGGCCTGTTCGTCGGCGGCGCCGATCATTGGCAGGTAGGCCTGGCGTGCGCGGTCGATCTGGCTGTTGCGCTGGGCCAGCAGGTTGAGGGTATCGCTTTGGATTGCAGGCTCGCGGTTGGTCAGCAGGCGGTAGGAGAGGGTGCGCATGCGCAGGTTAAGCGCGGTGATTTCGTCGAGGATCTTGATACTGGGCACACTGACTTGCTCGATGGCCACGCCCGCCTGACGGATGTTGCCCATCTGCACCAGCGAGAAGACCCCGAGGCCAAGCATCAGCAGGCCGATAAGCGCAAAGCCGAGCAGCGCGCGCGGGGCGATATTCATGTTACGTAGAGACATGGAGAGGAGGATCCAGGCAAGGGATAAAGAGTGCGCAAAGTGCGACAAAACATGACTTGCCGGGCTATCGGTCGTGACTGGCCAGTCTTGAGTGCAACCCGCAAAAATTGTGAGGTAGCTAACCTTTTCCTGACCGGCGGTTGACCCAGGTCGGAACAAGGGGGTGATTACCCTGTCAATCTGCGGGCCAGACACCTTGTGAATCCGATTTTTAATGGCGATGGGCCGCGCTTTTCTTTATCGTGTGCGCCCCTCGCAACAACCTGAGATAGACCCATGCTGGAAGCTTCCCTGAATCAAATCGAGCAACTGGTCAGCGACCTGATGCAGAAAAACGCTCAACTCACCGAGCAGAACACCACCCTGGGCACGCAACTGGCCCAGGCCAAGGAAGAGAACGAAACCCTGCAGCTGTCGCTGATGGAGCAGGAAGAGAAGAACGGCAGCACCGCCGCTCGCCTGCAGGCCCTGGTTGATCGCGCCAGCGCTGGCGTTGTTGGCGCATGAGATTGCAAGAGCAGCCGGTCAATGTCGTGTCGATTCTCGGCATCGACTATTCGATCAAGGCGCCCGAAGGCCAGGAAGAAACCCTGGCCCAGGCGGTGCGGATGCTCAACACCGCCCTGAACGAAACCAAGCGTCAGTACCCGACCCTGATCGGTGACAAATTGCTGGTGCTGGCTGCCCTTAACCTGTGCTCCAAACAGGTTGAACTGCAGAAAGAACACCAGCAGACCCTTGCGCGTACTCAGGCGCAGATCGACGCCACGGTGGACGCCATCGTGCGGACTATTGCAGAGTCCTGATCGATAGCTGAAAGGCCCAAGGCCGCTCCTCCAAGGGTTATGCCTTCCCCTGTAGGAGCAGCCTTGTGCTGCGAATGGGCTGCGCAGCAGCCCCAAAGCCCCAAAGCCCAACTCATTGTTCCAGGTCACTGGATTAACTGGATACGCAAGTGTATACACTTCTCGCAAAACAATAATGTGCGGGGGAGTAGGGGCATGCGTATCTGGCGTAAAAGCATCCAGTGGCAGTTGATCACCAGCATGGGCGCCGCCCTGCTGGCCAGCATCCTGGTGGTGGTCATCATCTTCACCGTGGCGCTCAACCGCCTCACCGACCGCTACCTGGTCGACACTGCCCTGCCCGCCAGTGTCGAGGCGATCCGCAACGACATCGAACGCATGCTCGGCCAGCCACTGGTGGCTGCGGCGGACATTGCCGGCAACACCCTGCTGCGCGACTGGCTGGCGGCCGGCGAAGACCCCGCCCAGGCTGCGGCATTCATCGAATACCTGGGCGCCGCCAAACAGCGTAACCGGGCCTTCACCACCTTGTTTGCCTCGACCGAAACCGGCCACTACTACAACGAGAATGGCCTGGACCGCACCCTCAGCCGCAGCAACCCCAAGGACAAATGGTTCTACGGCTACATCGACAGCGGCGCCGAACGGTTTATCAATATCGACATTGACGGTGCCACTGGCGAGCTGGCGCTGTTTATCGACTATCGCGTGGAAAAGGCCGGCAAGCTGGTGGGCGTGGCCGGCATGGGCCTGCGCATGACCGAGCTGTCGCAGCTGATTCACGACTTCAGCTTTGGCGAGCATGGCAAGGTCTTCCTGGTACGTAACGACGGCCTGATCCAGGTGCACCCGGACGCTGCTTTCAGCGGCAAGCGCCAGCTCGCCGAGCAGCTCGGCACGGATGCTGCCAAGGCCGTGATGACCGGCGGCGAAGGCCTGCGCAACAGCCGTTTCAGCCGTGACGGCGAGCGCTACCTGGCGCTGGGCCTGCCCCTGCGTGACCTCAACTGGACGCTGGTGGCCGAGGTGCCGGAGTCGGAAATCTACGCACAAATGCATCAGGCGGTATGGCTGACCAGCCTGATCGGTGGCGCCGTGGCGCTTGTGTCGCTGCTGCTGGTGGTGCTGCTGGCGCGAGGCCTGGTGCGCCCTGTGCGCCGCGTTACCGCTGCGCTGGTGCAGATTGGCAGTGGCGCAGGCGACCTTAGCCACCGCCTGGATGATTCGCGCCAGGACGAACTGGGTGACCTGGCCCGGGGCTTCAACCGCTTTCTCGACAGCCAGCGCAGCCTGATTGGCGAGGTGTTGCAGACTTCCGAACGGCTGCATCGGGCGGTTGAGCAGGTGACCCAGGTGGTGGAAAACACGGCCGAGCGCTCCGGGCGACAGCAGGAAATGACCGAAATGGTTGCCACTGCCGTACATGAAATGGGCCTGACCGTGCAGGACATTGCCCGCAATGCCGGTGATGCAGCTCAGGCCTCGCAGTCGGCACGGGATGAGGCCTTGCAGGCGCGCGAGGTGGTACGGCGTTCAATTCAGGGCATCGAGGGCATGTCGGGCGACATCGGCAAGGCAGCCGATGCAGTCAGCCAGTTGGCCAACGAAGTCGCCTCTATCGATGAAGTGCTGGCGGTTATCCGCAGTATTTCCGAGCAAACCAATTTGCTGGCGCTGAACGCCGCTATCGAGGCTGCGCGGGCAGGGGAAATGGGCCGCGGATTTGCTGTGGTGGCCGACGAAGTGCGCACGCTGGCGCGGCGCACGCAGCTATCCACCGACGAAGTGCAACAGATGATCCAGCGCCTGAAGCAGGGCGCGGGTTCGGCGGTGAGCTCGATGCAGGCGGGGCAGCAGGCGACCGGCAGTGGGGTGGAATCGAGCCAGCGTACCGGGGCGTCGCTGGGGGCGATTACCGACCAGGTGGAACACATCAGCGACATGAACCATCAGGTGGCCACGGCGACCGAAGAGCAGTCGGCGGTGACTGAGGAAATCAACCGCACGGTGCAGGGCATTTCCGACCTGGCGCGGGAGACGGCGGCAGAGGTGCAGGGCTGCCGTGAGGAGTGCCAGGCGTTGCGTGGCTTGGCTGATGACCTGGCTCGGCAGATGGGCGGGTTCAGGCTTTAGATCGGTGTGGGCTTCATCGCCGGCAAACCAGCTCCTACAGGATCACCGCAGGCCTGAGCCTTGTGTGGTCCATGTGGGAGCTGGCTTGCCGGCGATTGGGGCCTGTCAGGCGGTGATGATGCTGCGGATATCCGCCGCCAGCTCGCGCACCCGCTCTTCCTCGGTATCCCAGGAACACATGAACCGCGCCCCGCCGCTGCCGATAAAGGTATAGAACCGCCAGCCCTTGGCGCGCAGCGCTTCAATCGCGTGCTCCGGCATTTGCAGGAACACCCCGTTGGCCTCCACCGGGAACATCAGTTCCACCCCCGGCAAGTCACTCACCAGCAACGCCAGCAGCTGCGCGCAATGGTTGGCGTGGTTGCCATGGCGCAACCACGCGCCATCTTCCAGCAGGCCCACCCACGGCGCGGACAGGAAGCGCATTTTCGACGCCAGTTGCCCGGCCTGCTTGCAGCGGTAGTCGAAGTCCTCGGCCAGTTGGCGATTGAAGAACAGGATCGCCTCGCCCACCGCCATGCCGTTCTTGGTGCCGCCAAAGCACAGCACGTCGACGCCGGCCTTCCAGGTCAGCTCAGCCGGGCTGCAGCCCAGGAACGCGCAGGCATTGGTAAAGCGCGCGCCGTCCATGTGCAGGTTCAGGCCCAGCTCCTTGCAGGTGGCGCTGATGGCCTTGAGCTCATCGGGGCGATATACGGTGCCCACTTCGGTGGCCTGGGTGATGGTCACCACGCGTGGCTTGGGGTAGTGGATGTCCTGGCGCTTCAGTGCCACTTCGCGGATCGACTGCGGGGTCAGCTTGCCGTTGACGCTGGCCGCCGTCAGCAGCTTGGAGCCGTTGGAGAAAAACTCCGGCGCGCCGCATTCGTCGGTTTCGACGTGGGCAGTCTCGGAACAGATCACGCTGTGGTAGCTCTGGCACAGCGAGGCCAGGGCCAGGGAGTTGGCTGCGGTACCGTTGAAGGCGAAAAACACCTCGCAGTCGGTTTCGAACAGGTTGCGGAAGTATTCCGAGGCGCGCTCGGTCCACTGATCGTCGCCATAGGCGCGGTCGTGGCCGTGGTTGGCCTTCTCCATCGCCGCCCAGGCTTCGGGGCAGATACCGGAATAGTTGTCGCTGGCGAATTGTTGGCTCTTATCTGTCATGACACGGTCCTGTGAACGACGCAGAACAGCACTCTAAACCATCGATTCAGCACTTGCCTATACAACCGGTACAGCGATTTCTGTTCATTCTGGGCCGGCCTATTCGCGGGCACGCCCGCTCCCACAATGGTCCTGCACACGCCAATCACTGTGGGAGCGGGCGCGCCCGCGAAGGGGCCAGCATGGGCGGCACAAAGAGAATGTCGTAAACGCGCCATTGCAAGGCGTGCGCAGGCATCTGACCCGCCCCCGCCAGTCATACCATCGCCACAAAGGGCCACAGTGCCCCACGACAAAAAACGATCGCTGCCGGGAGATACACGATGTTCAGCAAGCAAGACCAGATCCAGGGTTACGACGACGCACTGCTGGCGGCGATGAATGCCGAAGAACAGCGCCAGGAAGATCACATCGAGCTGATCGCCTCGGAGAACTACACCAGCCAGCGCGTCATGCAGGCCCAAGGCAGCGGCCTCACCAACAAATACGCCGAAGGCTACCCGGGCAAGCGCTACTACGGTGGCTGCGAACACGTGGACAAAGTAGAGGCCCTGGCCATCGAGCGCGCCAAGCAGCTGTTCGGTGCCGACTACGCCAACGTCCAGCCGCACTCCGGCTCGTCGGCCAATGGCGCCGTCTACCTGGCCCTGCTGCAAGCCGGTGACACCATCCTCGGCATGAGCCTGGCCCACGGCGGTCACCTGACCCACGGCGCAAAAGTGTCGTCCTCGGGCAAGCTGTACAACGCCGTGCAGTACGGCATCAACACCGACACCGGCCTGATCGACTACGACGAAGTCGAGCGCCTGGCGGTCGAGCACAAGCCGAAAATGATCGTTGCCGGTTTCTCGGCCTACTCCAAGACCCTCGACTTCCCACGCTTCCGCGCCATTGCCGACAAGGTCGGTGCGCTGCTGTTCGTCGACATGGCCCACGTAGCCGGCCTGGTTGCCGCTGGCCTGTACCCGAACCCGATCCCGTTCGCCGATGTGGTGACCACCACCACCCACAAGACCCTGCGCGGCCCTCGTGGCGGCCTGATCCTGGCCAAGTCGAACGAAGAGATCGAGAAGAAGCTGAACGCCGCTGTATTCCCGGGCGCCCAAGGCGGCCCGCTGATGCACGTGATTGCCGCCAAGGCCGTGTGCTTCAAGGAAGCGCTGGAGCCTGGCTTCAAGGCCTACCAGCAGCAAGTGATCGAAAACGCCCAGGCCATGGCCCAGGTGTTCATCGACCGCGGCTACGACGTGGTGTCCGGTGGCACCGACAACCACCTGTTCCTGGTCAGCCTGATCCGCCAGGGCCTCACCGGCAAAGATGCCGACGCCGCCCTGGGCCGCGCGCACATCACCGTCAACAAGAACGCCGTGCCGAACGACCCGCAGTCGCCGTTCGTCACCTCGGGCCTGCGCATCGGCACCCCGGCCGTCACCACCCGCGGCTTCAAGGTTGCGCAGTGCGTGGCCCTGGCCGGCTGGATCTGCGACATCCTCGACAACCTCGGTGACGCAGACGTCGAAGCCGATGTGGCGAAGAACGTCGCGGCGCTGTGCGCAGACTTCCCTGTTTACCGCTGAGTGGAGTAAACGACCATGCAACGCTACTCGGGCTTCGGCCTCTTCAAACACTCCCTCAGCCACCACGAAAACTGGCAGCGCATGTGGCGCACGCCAACCCCTAAAAAGGTTTACGACGTGGTTATCGTCGGCGGTGGCGGCCATGGCCTGGCCACGGCCTACTACCTGGCCAAAGAGCACGGCATCACCAACGTTGCCGTGATCGAGAAAGGTTACCTGGGCGGCGGCAACACCGCCCGTAACACCACCATTGTGCGTTCCAACTACTTGTGGGACGAGTCGGCGCAGCTGTACGAGCACGCCATGAAGCTGTGGGAAGGCTTGTCCCAGGACATCAACTACAACGTCATGTTCTCCCAGCGCGGCGTGTACAACCTGTGCCACACCCTGCAGGACATTCGTGACTCCGAGCGCCGCGTCAGCGCCAACCGCCTCAACGGTGTCGATGGCGAGCTGCTGAACACCGCCCAGGTCGCGGCCGAAATCCCGTACCTGGACTGCTCGAAGAACACCCGTTACCCGATCCTTGGCGCAACCGTTCAGCGCCGTGGTGGCGTGGCCCGCCACGACGCCGTGGCCTGGGGCTATGCCCGCGCTGCCGACGCCCTGGGCGTGGACCTGATCCAGCAGACCGAAGTGATCGGCTTCCGCAAGGAAAACGGCGCGGTCATCGGCGTGGAAACCAACAAAGGCTTCATCGGCGCCAAACGCGTCGGCGTGGTCACCGCGGGTAACTCCGGGCACATGGCCAAGCTGGCCGGTTTCCGCCTGCCGCTGGAATCGCACCCGCTGCAAGCGCTGGTATCCGAGCCGATCAAGCCGATCATCGACAGCGTGATCATGTCCAACGCCGTGCACGGCTACATCAGCCAGTCCGACAAGGGCGACCTGGTAATCGGTGCCGGTATCGACGGCTGGGTCGGCTACGGCCAGCGCGGTTCGTACCCGGTGATCGAGCACACCCTGCAGGCCATCGTGGAGATGTTCCCCAACCTCTCTCGCGTGCGCATGAACCGCCAGTGGGGCGGCATCGTCGACACCTCGCCGGACGCCTGCCCGATCATCACCAAGACCCCGGTCAAGAACATGTTCTTCAACTGCGGTTGGGGCACTGGCGGCTTCAAGGCGACCCCGGGTTCGGGCAACGTCTTCGCCGCGAGCCTGGCCAAGGGCGAAATGCACCCACTGGCCGCGCCGTTCTCCATGGACCGTTTCTACAACGGCGCACTGATCGACGAACACGGCGCCGCCGCCGTCGCCCACTAACCGGAGACACCGTCATGTTGCATATTTTCTGTCCCCACTGCGGCGAGCTGCGCTCCGAAGAAGAGTTCCACGCCTCTGGCCAGGCGCACATCGCCCGCCCGCTGGACCCTGCCGCCTGCTCCGACGAGGAGTGGGGTACCTACATGTTCTACCGTGATAACCCACGCGGTATTCACCATGAACTGTGGGACCACGTTGCCGGTTGCCGCCAGTACTTCAACGTCACCCGCGACACCGTGACCTACGAAATTCTGGAAACCTACAAGATTGGCGAAAAGCCGCAAGTGACCGCCAACGGTAAAGCTGCGAACGCACCGTCGACCGTCAAAGGCCAAGGGGAAAAAGTATGAGCCAGACCTATCGCCTCGCCAGCGGCGGCCGCATCGACCGCAGCAAGGTCCTGAACTTCACTTTCAACGGCAAGACCTACCAGGGTTATGCCGGTGACAGCCTGGCCGCCGCACTGCTGGCCAACGGCGTAGACATTGTCGGCCGCAGCTTCAAGTATTCGCGCCCACGCGGCATCATCGCTGCCGGTACCGAAGAGCCGAACGCCATCCTGCAGATCGGTTCCAGCGAAGCCACCCAGATCCCCAACGTGCGCGCTACTCAACAAGCGCTGTACGCAGGCCTTGTCGCCACCAGCACCAACGGCTGGCCGAACGTCAACAATGACGTGATGGGCATCCTCGGCAAGGTGGGCGGCAGCATGATGCCGCCGGGCTTCTACTACAAAACCTTCATGTACCCGAAATCGTTCTGGATGACGTACGAAAAGTACATCCGTAAAGCGGCAGGCCTTGGCCGTGCACCGCTGCAGAACGACCCGGATAGCTACGACTACATGAACCAGCACTGCGACGTGCTGATCGTCGGCGCCGGCCCTGCTGGCCTGGCCGCTGCACTGGCTGCTGCGCGCAGCGGCGCCCGTGTGATCCTGGCTGACGAGCAGGAAGAGTTTGGCGGCAGCCTGCTCGACAGCCGCGAAACCCTCGACGGCAAGCCTGCCGCCGACTGGGTCAACGCCGTGATCAAAGAGCTGGAAGGCCTGCCGGAAGTGACCCTGCTGCCACGTGCCACGGTCAACGGCTACCACGACCACAACTTCCTGACCATTCACGAGCGCCTTACCGACCACCTCGGCGACCGCGCCCCGATCGGCCAGGTACGCCACCGTGTGCACCGTGTACGCGCCAAGCGCGTGGTACTGGCGCCCGGCGCCCACGAGCGCCCGCTGGTGTACGGCAACAACGACGTACCGGGCAACATGCTGGCTGGTGCTGTGTCCACCTACGTTCGCCGCTACGGCGTGGCACCGGGCCGCAAGCTGGTGCTGTCGACCAACAACGACCACGCCTACCGCGCTGCGCTTGACTGGCACGACGCTGGCCTGCAAGTGGTCGCCATCGCCGACGCCCGCCACAACCCACGTGGCTCGCTGGTTGAAGAAGCGCGTGCCAAAGGCATCCGCATCCTCACCTCCAGCGCCGTGGTCGAGGCCAAAGGCAGCAAGCATGTCACCGGCGCCCGCGTGGCGGCTATTGATGTGCAGGCGCACAAAGTCACCAGCCCAGGCGAAACCCTTGAGTGCGACCTGATCGCAACCTCCGGTGGCTACAGCCCGATCGTGCACCTGGCTTCGCACCTGGGCGGTCGCCCAGTGTGGCGTGACGACATCCTCGGCTTCGTACCGGGCGATGCGCCGCAGAAACGCGAGTGCGTGGGCGGTATCAATGGCGTCTACGCCCTCGGTGACGTGATTGCCGATGGCTTCGAAGGCGGCGTTCGCGCAGCCACCGAGGCGGGCTTCAAGGCCTCTGTCGGCACCCTGCCAAAAACCGTGGCGCGCAAGGAAGAGGCCACCGTGGCACTGTTCCAGGTGCCGCACGACAAAGGCAGCAAGGGGCCGAAGCAGTTCGTCGACCAGCAGAACGACGTGACCGCCGCAGGTATCGAGCTGGCCACCCGTGAAGGCTTCGAGTCGGTCGAGCACGTAAAACGCTACACCGCGCTGGGTTTCGGTACCGATCAGGGCAAACTGGGCAACATCAACGGCCTGGCCATCGCCGCCCGTTCGATCGGCATCACCATCCCGGAAATGGGCACCACCATGTTCCGCCCCAACTACACGCCGGTGACGTTCGGCGCGGTAGCGGGCCGTCACTGTGGCCACCTGTTCGAGCCCGTGCGCTTCACCGCCCTGCATGCCTGGCACGTGAAGAACGGCGCCGAGTTCGAAGACGTCGGCCAGTGGAAGCGCCCTTGGTACTTCCCCAAAGCCGGTGAAGACATCCATGCCGCCGTGGCCCGCGAGTGCAAGGCCGTGCGCGACAGCGTGGGCCTGCTGGACGCCTCGACCCTGGGCAAGATCGACATCCAGGGCCCGGACGCCCGTGAGTTCCTCAACCGCATCTATACCAATGCCTGGACCAAGCTGGACGTGGGCAAGGCCCGCTACGGCCTGATGTGCAAGGAAGACGGCATGGTCTTCGACGACGGCGTAACCGCCTGCGTTGGCGACAACCACTTCATCATGACCACCACCACCGGTGGCGCCGCCCGTGTGCTGCAGTGGATGGAGCTGTACCACCAGACCGAATGGCCGGAGCTGAAGGTGTACTTCACCTCGGTTACCGACCACTGGGCCACCATGACCTTGTCCGGCCCCAACAGCCGCAAGCTGCTCAGCGAGCTGACCGACATCGACATGGACAAGGAAGCCTTCCCGTTCATGACCTGGAAGGAAGGCAACGTCGGCGGCGTGCCGGCCCGTGTGTTCCGTATCTCGTTCACCGGTGAGCTGTCGTACGAAGTCAACGTGCAGGCCAACTACGCCATGGGCGTGCTGGAACAGATCATCGAGGCGGGCAAGAAGTACAACCTGACCCCGTACGGCACCGAGACCATGCACGTACTGCGTGCCGAGAAGGGCTTCATCATCGTGGGCCAGGACACCGACGGTTCGATGAACCCGGACGACCTGAACATGAGCTGGTGTGTGGGCCGCAACAAACCATATTCGTGGATCGGCCTGCGTGGCATGAACCGCGAAGACTGCGTGCGCGAGAACCGCAAGCAGCTGGTAGGCCTGAAGCCGGTCGACCCGACCAAGTGGCTGCCGGAAGGCGCCCAGCTGGTGTTCGACCCCAAACAGCCGATCCCGATGGACATGGTTGGCCACGTTACCTCCAGCTACGCGTCCAACTCCCTGGGCTACTCGTTCGCCATGGGGGTGGTCAAAGGCGGCCTCAAGCGCATGGGCGAGCGTGTCTACTCGCCGCAGGCAGATGGCAGCGTGATCGAGGCGGAAATCGTGTCTTCGGTGTTCTTCGATCCGAAGGGTGAGCGGCAGAACGTTTGACTCCGGGCCCTGTGGGGGGCAGGCCAGGCCTATCGCCGGCAAGCCAGCTCCCACAGGATCGATGCAGCTTTGAAGGTGGGTGAAATCCCTGTGGGAGCTGGCTTGCCGGCGATAGGGCCGGTAGCCACCACACAGAAGCTGCGTCGCATCGACGAACAAGAATTCAAGGCAGGTAAGAAATGAGCGCTATCAACGTCTTCCAGCAAAACCCCGGCGCCGAGGCCAAGGCCCAGTCGCCACTGCACCACGCCGACCTGGCCAGCCTGGTTGGCAAAGGCCGCAAGAACGCAGGCGTGACCCTGCGTGAACGCAAGTTCCTTGGCCACCTGACCCTGCGTGGCGACGGCCACAACCCGGAATTCGCCGCCGGCGTGCACAAGGCCCTGGGCCTGGAGCTGCCAGTGGCCCTGACCGTGGTCGCCAACAACGACATGTCGCTGCAATGGGTTGGCCCCGACGAGTGGCTGCTGATCGTGCCCGGTGGCCAGGAACTGGCGGTCGAGCAAAAGCTGCGCGCGGCCCTTGATGGCCAGCACATCCAGGTGGTCAACGTCAGCGGCGGGCAAAGCCTGCTGGAACTGCGCGGCCCGAACGTGCGCGAAGTGCTGATGAAATCCACCAGCTATGATGTTCACCCGAACAACTTCCCGGTGGGCAAGGCCGTGGGCACCGTGTTCGCCAAGTCGCAACTGGTGATCCGCCGTACCGCCGAAGACACCTGGGAGCTGGTGATTCGCCGCAGCTTCGCCGACTACTGGTGGCTGTGGCTGCAGGACGCTTCGGCCGAATACGGCCTGAGCATCGAAGCTTAAGGAGAGCAGAACATGAGTCGGGCACCGGATACCTGGATTCTCACCGCCGACTGCCCGAGCATGCTCGGCACCGTCGACGTGGTGACGCGTTACCTCTTCGAGCAGCGCTGCTACGTGACGGAGCACCACTCCTTCGATGACCGGCAGTCGGGGCGTTTCTTCATTCGCGTCGAATTCCGCCAGCCGGATGATTTCGACGAAACGGGCTTCCGTGCCGGCCTGGCCGAGCGCAGCGAAGCGTTCGGCATGGCCTTCGAGCTGACCGCACCCAATCACCGCCCCAAGGTGGTGATCATGGTGTCCAAGGCTGACCACTGCCTGAATGACCTGCTGTACCGGCAGCGCATTGGCCAGCTGGGCATGGACGTGGTAGCGGTGGTTTCCAACCACCCCGATCTCGAGCCCCTGGCGCACTGGCACAAGATTCCTTATTACCACTTCGCCCTTGACCCCAATGACAAGGCGGGGCAAGAGCGCAAGGTGCTGCAGGTGATCGAGGAGACAGGTGCCGAGCTGGTTATCCTCGCCCGTTACATGCAGGTGTTGTCACCCGAGCTGTGCCGGCGCCTGGATGGCTGGGCGATCAACATTCACCACTCGCTGTTGCCAGGGTTCAAAGGCGCCAAGCCTTACCACCAGGCGTACAACAAGGGCGTGAAAATGGTCGGTGCCACCGCGCACTACATCAACAACGACCTGGACGAAGGGCCGATCATTGCCCAGGGCGTCGAGGTGGTGGACCACAGCCACTATCCGGAAGACCTGATTGCCAAGGGGCGGGATATCGAATGCCTGACCCTGGCGCGGGCTGTGGGTTATCACATCGAGCGGCGGGTGTTCCTCAACGCCAACCGCACGGTCGTGCTCTGACATCCTGTACCGGCCTCTTCGCGGGACAAGCCCGCTCCCACAGGTTCATCGCCATGTTCGAAAGAGCGGTGAGCCTGTGGGAGCGGGCTTGCCCGCGAATGGCCTCACACCTTTGTACCTGCCGCTAAAAGACATCCCCCTGTCGCTTCCAGTCAGCGCACCCCGGCCTATCAGGCCCACAATTCCCTGCATTCCAGTAACACAAGCCGCCCATGGATGGCGGCGCGTTCAACCACCGCTGCATAAATAAAAATCAAGCGAGGTAAGAGCATGTCTGGCAATCGTGGAGTGGTATATCTCGGCGCCGGCAAGGTCGAAGTGCAACACATCGACTACCCGAAAATGCAGGACCCGCGTGGCAAGAAGATCGAGCACGGCGTCATCCTGAAGGTGGTCTCCACCAACATCTGCGGCTCTGACCAGCACATGGTCCGTGGCCGCACCACTGCCCAGGTCGGCCTGGTTCTGGGCCACGAAATCACCGGTGAAATTGTCGAGATCGGGCGTGACGTCGAACGCTTGAAAATCGGTGACCTGGTGTCGGTACCGTTCAACGTTGCCTGCGGCCGCTGCCGCTCCTGCAAAGAGATGCACACCGGCGTCTGCCTCACCGTCAACCCGGCCCGCGCCGGTGGTGCCTACGGCTACGTCGACATGGGCGACTGGACCGGCGGCCAGGCTGAGTACGTGCTGGTGCCATACGCCGACTTCAACCTGCTGAAACTGCCGGACCGCGACAAGGCCATGGAAAAGATCCGTGACCTGACATGCCTGTCCGACATTCTGCCTACCGGCTACCACGGTGCCGTGACGGCTGGCGTAGGCCCAGGCAGCACCGTTTACGTTGCGGGCGCTGGCCCGGTCGGCCTGGCCGCTGCTGCCTCGGCCCGCCTGCTGGGCGCGGCCTGCGTCATCGTCGGTGACCTGAACCCGGCCCGCCTGGCCCACGCCAAGTCGCAAGGCTTTGAAGTGGTCGACCTGTCCAAGGACACCCCGCTGCACGAGCAGATCATCGACATCCTCGGTGAGCCGGAAGTGGACTGCGCCGTCGACGCCGTCGGCTTCGAGGCCCGTGGCCATGGTCACGAAGGTGCCAAGCACGAGGCCCCGGCCACCGTGCTGAACTCGCTGATGCAAGTGACCCGCGTGGCCGGCAACATCGGTATCCCGGGCCTGTACGTAACCGAAGACCCAGGCGCGGTAGATGCCGCTGCCAAGATCGGCGCGCTGAGCATCCGCTTCGGCCTGGGCTGGGCGAAGTCGCACAGCTTCCACACCGGCCAGACCCCGACCATGAAGTACAACCGCCAGCTGATGCAGGCGATCATGTGGGACCGTATCAACATTGCTGAAGTGGTGGGGGTGCAGGTGATCAACCTGGATCAGGCGCCAGAAGGGTATGGCGAGTTTGATGCGGGTGTACCGAAGAAGTTTGTGATTGACCCGCACAAGATGTGGGGCGCGGCGTGATTTAATTCACTTGGACAGAAAGCCCCTCTCCAGAGGGGCTTTTTTACGGCTGCATTACCGACTTTAAAGTCTGCCTCGCCCATTGTAGGAGCGGCCTTGTGTCGCGAAAGGGGTGCGAAGCAGCCCCCGACGGTACGAGCACTTGCACCAAGTTCCTGGGGAAGCTTCGCAGTAGATCTGCCCGCGGAACATTAACGCCTGATACGACTAGGTTCTGGAGCTGGTGGCGAAATTTCATTAATGGCAGTTGAATACTTTGGCGAGAAATTGACGGGTTGCGTTAGATCGCTGCGTAAGCGACTGCGAAGGCGCAACGATAATTCACCGAGACGCTTCAGTTCTTCCTTCATGTGCTCAGGGGCGTTGTCGGGATGTATTCTCATATATCTCAAGTCAGCATGGATTATTCTTCTACGCTCTCGAATCTTGGAAAGGCCTTGCTGTATCCGTGGAGGAAAGTTGTCTTCAAAATACTGATAGTTCGGAAGCAGTAGATTCTCCGATTCAGCTTGAGCAAAGGATGGGGGCTGTGTGCCCTGTTGTCGAGCAGGTTGTAGTTCTTGTGGGGTGAGTTCGGGTTGCAGTTGTGCTTGGGCTTGAGCTCTATCACGACTTCTGGTCCGCCCGAATAGGTTGGCGATGCCTTTAATTGGACTCAGCCAACTTTGACCTGATGGATCTAACCGATTGACTGGATCGCCACTACAGTAGGCATATGCATTAATACCACCTGCGTCAAACGGACTTAATCTGTCAGCTGAGTGAAATCGCTGCAGGACGGGATTGTAAATGCGATGACCGTTACCAAGGGCATAGAAACTTGTAAATGCATCTCGGAGCTGGCCATTGAACGCCAAGAGACGTTGCTCTGCGGGTGCATTGATTGCATAACCGTACGGCGTGTAGGCAGAGGAATGAGGTGTGCGGTCAGAATCAGGGTGCTGCTTCATCATCTTGGCCTGCTTGATGTGAATTGTTGTTACCAGTATTAGGATGTCGAACGCATAGCGCTACTGATAGAAATGACAGGTTGGTACCACTGACCTGGGCTGTTTGATAGCCTCGACAACCGGCCCGGAACAAACCGTGAAACCTTAAGTCAAAACCCTCGTTTCCCAGGCCATCCCGGCCCACGAGGTCCCTCCCGATGAAAGCATTCACCCTGGCAACCCTTATGACCCTGGCCGCAAGCCCGGTGTTTGCCTTCAACCTCAGCGATGCTGCCAACGCCGTGTCCGCCATGCAAACTCAGAAGCAGCAGGGCCAAGTACAGGCTCCCGAGGCACAGGCCAATCTGCTGAACACCTTGGGCAGCGAATTGAAGATCACCCCCGAACAGGCCGTGGGCGGAGCTGGGGCGATGTTGGGGCTGGCGCGCAACAACCTGAGCAGCGATGACTACGGCCAACTGACCAAGGCAGTGCCAGGGCTGGACCTGCTGGCCGGTGCGAATGCGTTGGGCGGGTTAAGTGGGTTCGGCGATTTGCTGGGCAAGAACAGCGAAAGCGAGTCGGCCCTGAGCAACGCGCTGGGCAACAACGTGGAAAACCGCAGCGACCTGGACAGTGCGTTCAAGGCGCTGGGGATGGATACCGGGATGATCGGGCAGTTTGCGCCACTGATTCTGCAGTATTTGGGGCAGCAGGGGATTGCTGGGTCATTGTTGCAGAACCTGAGTAGCCTGTGGACCGCGCCGGCATCTACACCAGCCCCTTCGGTGTAAAAAATAGAGGGCACTGAAACGTATTCATAGTCAGCGCTTAGGCGCGAGATCAGTGCCCTTTTCTTTAGTCTCATATTTAAATTGAGCGGGTTTTACGGATCTCCTTTCTGAGTTGATCAAATCGTTGTTTCGTATTTTCAAGAATTCCAAATCTTTCTTCATCTGCTACGTCAATCTCAATTCCATTGGACGCGCGCATTTCGATTTTTTTGCGGAGGTCAGCGAACTTATTTAGCAATGGCTCATTTTTCATGAGCTCTCTTTCGTAATATTTTATAAGTTCTGCCCGGTATCGAAACTGGTCAAGTCTAGTCAGCTCTGCTTCATCTGTATAGTTTATGCGTAAGGGTAGATTACTGAATCGTGTTATTTTTTTGATCGCTGTGACCGGCGTGGTCAGAGTGCCCGGTATTACCTCGACCCCAACAGTAGGAGGAGCGGGTCGATCAAAAATCGATATCAGGCGTTTGAACGCTTTCTTTGGAGTTAGCCATATACTCGCCCCTCGCATATGGCCTGTGGGGTCTTGATAATTGACCGGGTCACCTTCGCAAAAAGCGTAAGCGTTGAGCCCGCCCTGCTCAAACGGGCTGAGGTGGTCCGGAGCATGGAAGCGCATCAGAACAGGACTATAGCTGCGGTAGCCTTGGCCCAACGCATAGGCGCGGGCGGCAAGATCATATCGCTCACCATTAAAGCCAAGCACAGCGTTCTGGTTTGTGTAGTTATTACTCTGCCCATATGGGCAGTACGTAAACATTGCTACTGACACAGGGCTGGCCTCGCAAGACGGTCTGCCTCAGTCTTGATGTTTGCAAGGCCATGCACAACTAGCAGAATTATCAGTACCCGTCTTTGAGGTCAGCCGGCTTCGGAGAAGTCCACCATACGCACTGGGCGGCGGAAACCGGCAGTCAGCGCCGCCAGATACGCCAACCCCAGCGCAAACCAGCACAACCCGATCACCAGCGTCAGCGCCGACAGGCTCGTCCACAGCCACAGTGTCAGCCCCAATCCCACCAGTGGCACCACGCCATAGCTCAGCAGCCCCTTGGCATCCCGCTGGCTTGCATCGTCCATCAGGTGCGTCTTCACCACCGCCAGGTTCACCGCCGAGAACGCCACCAGCGCGCCGAAGCTGATCAGCGAGGCGAGGGTAGCCAGGTCGATCACCAGCGCCAGCAGCGAGAAGGCTGACACCAGCAAGATAGCGAACACTGGCGTACCAAAGCGCGGCGACAGGTAGCCGAAGCTACGGCGTGGCAGCACGTTGTCGCGGCCCATGGTGAACAGGATGCGCGACACCGCCGCCTGCGAGGCCAGCGCCGAACCCAGGCTGCCTGCAACATAAGCGGCGGTGAAGAAGTTGGTAAGGAACTGCCCGCCAGCCTTGAACATCACTTCGTTGGCCGCCGCATCGGCATTGGCGAAGCTGCTGCCCGGCAGCACCAGTTGGCTGACGTAGGCCAGCAGGGTGAACAGCAAGCCGGCAAACAGGGTGGTGAGGATGATCGCCCGTGGCACGTCGCGGCGTGCGTCGCGGCATTCTTCGGCCAGGGTCGATACGGCGTCGAAGCCGAGGAACGACAGGCACAGCACCGCTGCGCCAGCCATCAAGTGACCGAAGCCTGGTTTGCTGCCGTCACCCAGCAGCGGCGACAGCAGATCGAGCGGCTGGCCGGCGAGTGTCTGGCAGGACAGCGCGACAAACACGCCGATAAATACGATCTGCGCGCCGACGATCAGGTTGCTGGTCTTGGCCACCGAGTTGATGCCGACCACGTTCAGCACTGTCACCAGGGCGATGCAGGCGAGTACGAAGGCCCAGGCGGGCACTGCCGGGAACGCGATGTTGAGGAACAGGCCGATCAGCAGGTAGTTGATCATCGGCAGGAACAGGTAGTCGAGCAGCAGCGACCAGCCGGCAAGGAAGCCGATATTGGGGCCGAAGGCCATGTTGGTGTAGGAATACGCCGAGCCCGCCACCGGGAAGCGTTTGACCATGAAGCTGTAGGACGCAGCGGTGAACAGCATGGCCACCAGGGTGACCAGGTAGGCGCCGGCGGTGCGGCCACCGGTGAGTTCGGTGACGATGCCGTAGGTGGTGAAGATGGTCAGCGGGACCATGTAGACCAGGCCGAAGAACACCAGGGCGGGGAGGCCGAGAACACGGCGAAGCTGGGCGGGGGTAGAGCTAGGTTGGTTGGCCATTGATCGATCCAGGCTTTTTGTAATTGTGCTTTGATCGATTTTTATCCACTTAGGTGGGGTGTGACAAAGAAGTAATCCTAATGTCGATTATTAGCCAGGGTTTTAATCGGGCTTTACAGTGGCTTCACCGGCCTTATCGCCGGCAAGCCAGCTCCCACAGAATGCCCGCATTGGCCAATGCTTGTGCAGTCCCTGTGGGAGCTGGCTTGCCGGCGATAGGGCCAGTACAGATCAGCCCAGCTCTCCCCGCAGCTCCTCGATCCGCTGGTCCTTCGCCGCCCACAGCTGGTTCACCCAGGCCTGTACGGTCTGCCGGAACTCGGGGTCATTCTCGTAATCGCCCGCCCACAATGCCGGGTCCAGCTCCCGCACCTGAATGTCGATAATCACCCGGCTAATGCTGCCATTGAGCAATGCCCAGAACCCCGGTGCCTGGTTGCCGGGGTACACGATGGTCACGTCCAGCAGTGCATCCAGTTGTTCACCCAACGCCGCCAGCACAAAGGCCACGCCGCCAGCCTTGGGCTTGAGCAGGTAGCGATACGGTGACTGCTGTTCCTGGCGCTTGGTTTCGGTAAACCGGGTGCCTTCCAGGTAGTTGACCACGGTCACTGGCTGACGCTTGAACAGCTCGCAGGCCGCCTTGGTGATCTCCAGGTCCTTGCCCTTGAGCTCGGGGTGCTTTTCCAGAAACGCCTTGCTGTAGCGCTTCATGAACGGGTAATCCAGCCCCCACCAGGCAAGGCCCAGCAGCGGAACCCAGATCAATTCCTTCTTGAGGAAGAACTTGAAGAAAGGCGTGCGGCGGTTGAGGCTTTCGATCAACGCCGGGATATCGACCCAGGTCTGGTGGTTGCTTACCGCAAGATAGGAGGTGTCCTTACGCAGGTTCTCTACGCCGCGGATGTCCCACTCTGTAGGAATGCACAGGGCGAAGATGGCCTTGTCGATTTCCGACCAGGTTTCGGCCACCCACATCACTGCCCACGAGGCATAGTCGCGGCCACGGCCCGGCAGCACCAGCTTGAGCAGGGCAAAGACCAGCAGCGGGCAGATCAACACAACAGTATTGAGCAGCAGCAGAGTGGTGGTAAGAATGCCGGTCAGCAGGCGACGCATAAAGAGACTCTTGTTGTGGGAAATAATCGGCTGGCAATGATAAGCAGGGCAGTGGCCTACGCCAAATGTCCAATGCCCTGCGGCGGCAACAAATGTTTCACATTATGTTGATTAGGCTTGGGACAGCGAACCTATCCTGCCAGTGCAGTCTAAGCACTGACCCTTCTCAAGGAAGCCTGCTTGTGAAATCTCTGTTTGCCATGTTGTCACTGTTGGCGCTGCCAGTAATGGCCGCCGAACCTACAATCTACGGCCGGTACGAGAACATCGCCCTGCCTGAGCTGGGCGAAACCCTGAAGGCCAAGATGGACACCGGTGCCTTCACCGCGTCGCTGTCGGCCAAGGACATTGAGCTGTTCAACCGCGATGGCGACGAGTGGGTACGTTTCCGCTTGGCGACCAAGGACTCGGACGGCAAGGTGTACGAGCACAAGGTCTCGCGCATCAGCAAGATCAAGGGCCGTGCCGATGAAGAGGAAGAGGGCGATGCGCCGGAAATTTCCAAGCGCCCGGTGGTGGACCTGGAGCTGTGCCTGGGTGACGTGAAGCGCACCGTGGAAGTGAACCTGGTGGACCGCAGCAGCTTCAACTACCCGCTGCTGGTGGGGTCCAAGGCGCTGCGTGAGTTCAAGGCCGCGGTCAACCCGGCCAAGAAGTTTACCGCTGGCAAGCCCGATTGCTGATTCAGGCTGCCTGCACTGGCCTCTTCGCGGGCACGCCCGCTTGTATGGTTAGACTTGAAGGGGTGGTGGGACTAAATGCCCGATTCTGACTGTTCACAGCAGTACAGACCGTGGGAGACATCGCCCCACCCCTTCCACCAAAGCGCCGATAAAGAATGCATCGTTTGCAAACGACGATAGAAGCAAGCCAGCGCCTCTTGGTGAAACCCCTTCAAGCCAAAAAACCATAACGTGAGGAGGCTCCCGTGGCAATGCAGGTTGGCAAATTGATCGTCGGTGCGGATGTCGCGAAAGCTGAGTTAGTGATTCATCACGATGATCGCGATGAGATCATCAAGGTGAAAAATACCAAACCAGAAATCAAGAAATGGCTGAAGCAACAGCCTCTCAACACGGCAATTGCTGTTGAGGCGACCAATGTTTACCACCTGGACTTGGTTGAGCTGGCCCATAGCCTGGGTTTCGAGGTCTATGTCATTGATGGATTCCAACTGAGCAACTACCGCAAAAGCGTGGGTGTACGGGTAAAAACGGACCCCACTGATGCTCGGTTGTTGTCCCGTTTTTTGAGAAACGAGGGGGAAGACCTCCGCCCTTGGACTCCCCCTCCCGCCGTCTACGGCAAGCTTCAGAGCCTTCTGCGACGCCGAGCGGCCTTGGTGACTGCCCGCACGGCGATGACTCAGAGCTGGGCTAATGAAGCCCTTTTGAAAGCCGCCTTCACAACCTTTGTAAAATCGATAGACCGGCTGGATTTGTTGATCCAAAAGAAAATTAAAGAAGTGCTGCGCGAAGCAGGGCTGCACGAGCAAGTTGCTCGCTGCCAGGCGGTAGAGGGTATTGGGTTTCTCACGGCCACTGCCTTGGTAATGGCTTTTATGCGGGGCGAGTTCAAGAGCAGTGATTCGTACATTGCATTCCTGGGAATGGATCTACGAGTGATTGATTCTGGGCAGAAGAATGGACGTCGTCGCCTTACCAAGCGAGGCTGCTCAGAAATCCGTCGCCTGCTGCATAACGCGGCGATGTCAGCCAGCCGGACGGCCACTTGGAAAGGGCTCTACGAACAGCATCGCAATGCGGGTAAAGCAACAACCCAGGCGTTGGTAATCCTGGCCAGGAAGCTTGCACGAGTGGCATTCGCCCTGATGAAGAATCAGGACGAATATGTCACCAAGGGTGGGAAAGCGCCTTGCTGA
Protein sequences of DBSCAN-SWA_1 >NC_021505|371081:421979|382251_382722_-|WP_016497612.1|DBSCAN-SWA MPALITYRTTVQEDWVDYNGHLRDAFYLLIFSYATDALMERIGLDADSRGQSGNSLFTLEAHINYLHEVKLGTEVWVQTQIIGFDRKRLHVYHSLHRAGFDEALAASEQMLLHVDLAGPKSAPFGHTTVCRLNHLVEQQEGAQAPQYMGRTIKLPS >NC_021505|371081:421979|372256_373102_-|WP_016484510.1|holin|DBSCAN-SWA MLIDQKIPLGQYIASFVEWLTQNGANYFDAIAQGLEFMIHGVTSALTWFNPFVLIALFAGLAHLIQRKWGLTAFVALSFLLIFNLGYWQETMETLAQVTFATVVCVVIGVPLGILAAHKPMFYTAMRPVLDLMQTVPTFVYLIPTLTLFGLGVVPGLISTVVFAIAAPIRLTYLGICDVPQELMDAGKAFGCSRRQLLTRIELPHAMPSIAAGVTQCIMLSLSMVVIAALVGADGLGKPVVNALNTADISLGFEAGLAIVLLAIMLDRICKQPELPVRGEA >NC_021505|371081:421979|376456_377563_+|WP_016497607.1|DBSCAN-SWA MTSYTSGTPTQNRTAPQSIGFLLLDNFTLISLASAVEPLRMANQLSGRELYRWHTLTVDGGQVWASDGLQITPDAAMHSAPPIDTVIVCGGVGIQRTVTREHVTWLQAQARQSRRLGAVCTGSWALACAGLLDGFDCSVHWECLAAMQEAYPRVNMSTRLFTLDRNRFTSSGGTAPLDMMLHLISRDHGRELSAAISEMFVYERIRNEQDHQRVPLKHMLGTNQPKLQEIVALMEANLEEPIDLDELAVYVAVSRRQLERLFQKYLHCSPSRYYLKLRLIRARQLLKQTPMSIIEVASVCGFVSTPHFSKCYREYFGIPPRDERVGSNTAQQVAMMPIPQAMTLSPHSGPMAALSQARNESTFASVRL >NC_021505|371081:421979|419209_420097_-|WP_016497635.1|DBSCAN-SWA MRRLLTGILTTTLLLLNTVVLICPLLVFALLKLVLPGRGRDYASWAVMWVAETWSEIDKAIFALCIPTEWDIRGVENLRKDTSYLAVSNHQTWVDIPALIESLNRRTPFFKFFLKKELIWVPLLGLAWWGLDYPFMKRYSKAFLEKHPELKGKDLEITKAACELFKRQPVTVVNYLEGTRFTETKRQEQQSPYRYLLKPKAGGVAFVLAALGEQLDALLDVTIVYPGNQAPGFWALLNGSISRVIIDIQVRELDPALWAGDYENDPEFRQTVQAWVNQLWAAKDQRIEELRGELG >NC_021505|371081:421979|399977_401603_-|WP_016497625.1|DBSCAN-SWA MSLRNMNIAPRALLGFALIGLLMLGLGVFSLVQMGNIRQAGVAIEQVSVPSIKILDEITALNLRMRTLSYRLLTNREPAIQSDTLNLLAQRNSQIDRARQAYLPMIGAADEQAAFDQYGQLLEQYRQLESRMRSLSQADRVDELRDLLNRDLLANSEQINKVMDTLVRINTDQTRATNEKAASQYAAAFALVIGLLVAATVLTFACAFLLTRSIVKPIDEALKCAEQVADGDLTHVIRAEGTDEAARLLRAMARMQDKLRDTLQQIAGSATQLASAAEELNAVTDESARGLQQQNNEIEQAATAVTEMTSAVEEVARNAVSTSEASSEASRSTGDGRDLVMETVGAIERMSGDVQATAKLITHLAEQSRDIGKVLDVIRGLADQTNLLALNAAIEAARAGEAGRGFAVVADEVRALAHRTQQSTSEIERMIGSIQGGTEEAVESMRTSTERAESTLNIAKGAGMALDTIAGAVAQINERNLVIASAAEEQAQVAREVDRNLVNINDLSVQSATGAHQTSAASAELSRLAVDLNGLVARFRT >NC_021505|371081:421979|393394_394627_+|WP_016497622.1|DBSCAN-SWA MSDIIRRDPRAEWIARNRLHPLHAAMQTQQTSWMGPNGLIRKNPHAIAAGFIGPAGIKRIDRSGAQQGTGVGGRRTAAAEVKLPLHQVPAPAFYIAVVPDMVGGRLSSHDRDLLGLAHSLAGSDGAVLAVVFNEHKESNFSTAGVDRLLVVEGEAFEGYAPEQLVQGLRAVDNQFTPRHWLLPDSRTGGGELGRRLGAALGERPATRVWQVKDGQCIGRAGAGQQDLQRAVPRLILAAAECAEPVSETRHEALPVELSTSVARSLSRIEDLGSVAVDPATIAMAEAEFIISGGNGVKDWDLYHQATAALGATEGASRVAVDDGFMPRNRQVGATGTWVTARVYVAVGISGAIQHLQGIGACDKVVAINMDPGCDMIKRADLSVIGDSSAILKALIEAVDNYRSGGQRDAA >NC_021505|371081:421979|373186_374134_-|WP_016497605.1|holin|DBSCAN-SWA MKGSPSLLLVALLSTPLLAQAAEPEQCQTVRFSDVGWTDITVTTATTSVVLEALGYKTHTTMISVPVTYKSLATGKDLDVFLGNWMPTMENDIKQYRDAGTVETVRANLENAKYTLAVPQALYDKGLKDFADIPKFKKELDGKIYGIEPGNDGNRTIQSMIDKNAFGLKDAGFKIVQSSEAGMLSQVDRAQKRGEALVFLGWEPHPMNTRFKMQYLTGGDDFFGPDFGKATVLTNTRKGYAQECSNVGQLLKNLSFELKDESTMMGYVLDDKMKPEAAAKKWLKDNPGKLDTWLAGVTTVDGKPGLEAAKAKLAQ >NC_021505|371081:421979|391442_393395_+|WP_016497621.1|DBSCAN-SWA MLNTLLPILLFAALGLAVLGALRRVRMWRRGRASKVDLIGGLLAMPRRYLVDLHHVVERDKYMSKTHVATAGGFVLSAALAILVHGFGLQSKVLGYALLVATVIMFTGAIFVFKRRLNPPARLSKGPWMRLPKSLLVFAASFFIATLPVAGILPANTGGWVMVAVLGLGVLWGVSELFFGMTWGGPMKHAFAGALHLAWHRRAERFGGGRSTGLKPLDLEDPNAPLGVEKPADFTWNQLLGFDACVQCGKCEAMCPAFAAGQPLNPKKLIQDMVIGLAGGTDAKFAGSPYPGKPIGEHGGHPHQPIVNGLVDAETLWSCTTCRACVEECPMMIEHVDAIVDMRRHLTLEKGATPNKGAEVLDNLIATDNPGGFAPGGRMNWAADLNLKLLSEVKTTEVLFWVGDGAFDMRNQRTLRSFVKVLKASGVDFAVLGLEERDSGDVARRLGDEATFQQLAKRNIQTLAKYKFQRIVTCDPHSFHVLKNEYGALGGEYQVQHHSTYMAELIAANKLNLAQHKGGSVTYHDPCYLGRYNGEYEAPREVLKALGIEVREMQRSGFRSRCCGGGGGAPITDIPGKQRIPDMRMDDIRETEAELVAVGCPQCTAMLEGVVEPRPQIKDLAELVADVLIEEDAPSAAKPQTAKREPAEVH >NC_021505|371081:421979|394637_395408_+|WP_016484524.1|DBSCAN-SWA MSTKVISLVSIGAHPSSGRARRAEQDARAVELGLQLAGDNLQVVHAGNPQEEALRAYLGMGLDHLDVLEQPAGADVLGVLGDYLRDAGAQLVLTGSQAETGEGSGMLPFLLAEKLGWPLIVGLAEVESIENGTAQVLQALPRGQRRRLKVRLPLLATVDNAAPKPRQSAFGPARRGVLAARNVAIVEDELLADAELQPARPRPKRLKVIKAKSGADRMKAATAKASGGGGKVLKDVSPQEGAEAILKLLVEEGVLR >NC_021505|371081:421979|402604_404542_+|WP_016497627.1|DBSCAN-SWA MRIWRKSIQWQLITSMGAALLASILVVVIIFTVALNRLTDRYLVDTALPASVEAIRNDIERMLGQPLVAAADIAGNTLLRDWLAAGEDPAQAAAFIEYLGAAKQRNRAFTTLFASTETGHYYNENGLDRTLSRSNPKDKWFYGYIDSGAERFINIDIDGATGELALFIDYRVEKAGKLVGVAGMGLRMTELSQLIHDFSFGEHGKVFLVRNDGLIQVHPDAAFSGKRQLAEQLGTDAAKAVMTGGEGLRNSRFSRDGERYLALGLPLRDLNWTLVAEVPESEIYAQMHQAVWLTSLIGGAVALVSLLLVVLLARGLVRPVRRVTAALVQIGSGAGDLSHRLDDSRQDELGDLARGFNRFLDSQRSLIGEVLQTSERLHRAVEQVTQVVENTAERSGRQQEMTEMVATAVHEMGLTVQDIARNAGDAAQASQSARDEALQAREVVRRSIQGIEGMSGDIGKAADAVSQLANEVASIDEVLAVIRSISEQTNLLALNAAIEAARAGEMGRGFAVVADEVRTLARRTQLSTDEVQQMIQRLKQGAGSAVSSMQAGQQATGSGVESSQRTGASLGAITDQVEHISDMNHQVATATEEQSAVTEEINRTVQGISDLARETAAEVQGCREECQALRGLADDLARQMGGFRL >NC_021505|371081:421979|407280_408531_+|WP_016484536.1|DBSCAN-SWA MQRYSGFGLFKHSLSHHENWQRMWRTPTPKKVYDVVIVGGGGHGLATAYYLAKEHGITNVAVIEKGYLGGGNTARNTTIVRSNYLWDESAQLYEHAMKLWEGLSQDINYNVMFSQRGVYNLCHTLQDIRDSERRVSANRLNGVDGELLNTAQVAAEIPYLDCSKNTRYPILGATVQRRGGVARHDAVAWGYARAADALGVDLIQQTEVIGFRKENGAVIGVETNKGFIGAKRVGVVTAGNSGHMAKLAGFRLPLESHPLQALVSEPIKPIIDSVIMSNAVHGYISQSDKGDLVIGAGIDGWVGYGQRGSYPVIEHTLQAIVEMFPNLSRVRMNRQWGGIVDTSPDACPIITKTPVKNMFFNCGWGTGGFKATPGSGNVFAASLAKGEMHPLAAPFSMDRFYNGALIDEHGAAAVAH >NC_021505|371081:421979|384724_385669_-|WP_016497615.1|holin|DBSCAN-SWA MHSLIRRSLLTLALSSIATSPLYAAEPAACKNVRLGVVNWTDVIATSAMAQVLLDGLGYQTKQTSASQQIIFAGIRDKRLDMFLGYWNPIMTQTITPFIDAQQVKVLEKPSLEDARATLAVPKYLYDKGLKTFADIHKFEKELGGKIYGIEPGSGANTQIKAMITKNQFGLGKFQLVESSEAGMLAAVDRAVRRKEAVVFFGWAPHPMNVNIDMAYLGESQDALGPDEGRATVWTVTAPDYAERCPNAHRLLANLNFSAEDESRMMQPLLDHKDALESARQWLKDHPDDKARWLEGVTTFDGKPAADNLKLTAN >NC_021505|371081:421979|401879_402110_+|WP_016497626.1|DBSCAN-SWA MLEASLNQIEQLVSDLMQKNAQLTEQNTTLGTQLAQAKEENETLQLSLMEQEEKNGSTAARLQALVDRASAGVVGA >NC_021505|371081:421979|406011_407265_+|WP_016497629.1|DBSCAN-SWA MFSKQDQIQGYDDALLAAMNAEEQRQEDHIELIASENYTSQRVMQAQGSGLTNKYAEGYPGKRYYGGCEHVDKVEALAIERAKQLFGADYANVQPHSGSSANGAVYLALLQAGDTILGMSLAHGGHLTHGAKVSSSGKLYNAVQYGINTDTGLIDYDEVERLAVEHKPKMIVAGFSAYSKTLDFPRFRAIADKVGALLFVDMAHVAGLVAAGLYPNPIPFADVVTTTTHKTLRGPRGGLILAKSNEEIEKKLNAAVFPGAQGGPLMHVIAAKAVCFKEALEPGFKAYQQQVIENAQAMAQVFIDRGYDVVSGGTDNHLFLVSLIRQGLTGKDADAALGRAHITVNKNAVPNDPQSPFVTSGLRIGTPAVTTRGFKVAQCVALAGWICDILDNLGDADVEADVAKNVAALCADFPVYR >NC_021505|371081:421979|420279_420780_+|WP_016497636.1|protease|DBSCAN-SWA MKSLFAMLSLLALPVMAAEPTIYGRYENIALPELGETLKAKMDTGAFTASLSAKDIELFNRDGDEWVRFRLATKDSDGKVYEHKVSRISKIKGRADEEEEGDAPEISKRPVVDLELCLGDVKRTVEVNLVDRSSFNYPLLVGSKALREFKAAVNPAKKFTAGKPDC >NC_021505|371081:421979|421025_421979_+|WP_016497637.1|transposase|DBSCAN-SWA MAMQVGKLIVGADVAKAELVIHHDDRDEIIKVKNTKPEIKKWLKQQPLNTAIAVEATNVYHLDLVELAHSLGFEVYVIDGFQLSNYRKSVGVRVKTDPTDARLLSRFLRNEGEDLRPWTPPPAVYGKLQSLLRRRAALVTARTAMTQSWANEALLKAAFTTFVKSIDRLDLLIQKKIKEVLREAGLHEQVARCQAVEGIGFLTATALVMAFMRGEFKSSDSYIAFLGMDLRVIDSGQKNGRRRLTKRGCSEIRRLLHNAAMSASRTATWKGLYEQHRNAGKATTQALVILARKLARVAFALMKNQDEYVTKGGKAPC >NC_021505|371081:421979|412071_412704_+|WP_012270121.1|DBSCAN-SWA MSAINVFQQNPGAEAKAQSPLHHADLASLVGKGRKNAGVTLRERKFLGHLTLRGDGHNPEFAAGVHKALGLELPVALTVVANNDMSLQWVGPDEWLLIVPGGQELAVEQKLRAALDGQHIQVVNVSGGQSLLELRGPNVREVLMKSTSYDVHPNNFPVGKAVGTVFAKSQLVIRRTAEDTWELVIRRSFADYWWLWLQDASAEYGLSIEA >NC_021505|371081:421979|382732_383698_-|WP_016497613.1|DBSCAN-SWA MPFITEIKTFAALGSGVIGSGWVARALAHGLDVVAWDPAPGAEQALRKRIANAWPALEKQGLAQSASQDRLKFVSTIEECVRNADFIQESAPERLDLKLDLHAKISAAAKPDAIIGSSTSGLLPSEFYESSTHPERCVVGHPFNPVYLLPLVEIVGGSRTSPEAIEAAKTIYTALGMRPLHVRKEVPGFIADRLLEALWREALHLVNDGVATTGEIDDAIRFGAGLRWSFMGTFLTYTLAGGDAGMRHFMSQFGPALKLPWTYLPAPELTDKLIDDVVNGTSEQQGERSIAALERYRDDTLLAVLEAVKTSKASHGMSFSD >NC_021505|371081:421979|415235_415997_-|WP_080642752.1|DBSCAN-SWA MMKQHPDSDRTPHSSAYTPYGYAINAPAEQRLLAFNGQLRDAFTSFYALGNGHRIYNPVLQRFHSADRLSPFDAGGINAYAYCSGDPVNRLDPSGQSWLSPIKGIANLFGRTRSRDRAQAQAQLQPELTPQELQPARQQGTQPPSFAQAESENLLLPNYQYFEDNFPPRIQQGLSKIRERRRIIHADLRYMRIHPDNAPEHMKEELKRLGELSLRLRSRLRSDLTQPVNFSPKYSTAINEISPPAPEPSRIRR >NC_021505|371081:421979|404647_405688_-|WP_016497628.1|DBSCAN-SWA MTDKSQQFASDNYSGICPEAWAAMEKANHGHDRAYGDDQWTERASEYFRNLFETDCEVFFAFNGTAANSLALASLCQSYHSVICSETAHVETDECGAPEFFSNGSKLLTAASVNGKLTPQSIREVALKRQDIHYPKPRVVTITQATEVGTVYRPDELKAISATCKELGLNLHMDGARFTNACAFLGCSPAELTWKAGVDVLCFGGTKNGMAVGEAILFFNRQLAEDFDYRCKQAGQLASKMRFLSAPWVGLLEDGAWLRHGNHANHCAQLLALLVSDLPGVELMFPVEANGVFLQMPEHAIEALRAKGWRFYTFIGSGGARFMCSWDTEEERVRELAADIRSIITA >NC_021505|371081:421979|378360_380271_+|WP_115283709.1|DBSCAN-SWA MVMFFGGCSISERTVAAANSTPPQLFTLEPTPERNVLQKTASGYLATLLGDPANAEVTLVKIDPARVSQQTQDLAVSLPDGKTAQFHLRDFTTLKSGIDSWVGYKPSAWKATHAASASEIDYDPFYYLSMVREGDKLVGNLIVEGQRYRLDSIGSGRYVLIKVDESKLPPDGEPLVAPAGGARDDTKVKSPQSAHSVIRLMFLATNQRKAANPWWWLDLLLAMRDANQYMKNSDVQITYQWAGNFYGDYDETGRSSSQQLGDIGNAQPFASQVLSMREELRADMVLMYTTEPSVCGRAYISTGKNSPHSIVTCPRSLAHELGHNIGALHKQGETGNVPEYAYGYETPLYNLHTQMVASRDALPNFSNPRVTYLGMPLGDSKFNDVARRFNERRETVENFYPPPNPLTVLVQLFSDYNQRGDMCTLKIEPGVDQPSCNKVKSIRVYDFAPGMKLCFRSEEYRRVCYAGSYAGDFAVPNLKAVSVLPSGLVREGRVGDDAFYGDVKYVEYAMDNQVTIKLYSLADFKGETCEFRMPTSTVGSLSEWPQCAALSGGKSRSAKVFSFSSPYNKLCFFNADHRQSLCFNGNYQGNFSIRNWDVGTDLPSGLVRTHRGGGYMNGSVHRISYGNDSEDWPPAP >NC_021505|371081:421979|398744_399845_+|WP_016497624.1|DBSCAN-SWA MSDTFLNPVTTQTWANGRHIVRCVKVIQETWDVRTFCFMADQPIMFFFKPGQFVTLELEIEGKPVMRSYTISSSPSVPYSFSITVKRVPGGLVSNFLHDTMHEGAELPVHGPVGLFNAIDFPAGKALYLSGGVGITPVMSMARWFYDTNANVDMVFVHSARSPKDIIYHRELEQMASRIPNFSLHIICEKHGLGEPWAGYRGYLNQRLMELIAPDYMERVVFCCGPTPYMTAVKRMLEAAGFDMKNYHEESFGATPPEAKADAVEHAEQAADAPELDISDLNLVEFIGSDKSIRVAPGETVHAAAAKVGLMIPKACGMGICGTCKVLKLGGEVEMEHNGGITEEDEAEGYILSCCSVPKGDVRIDY >NC_021505|371081:421979|408877_411892_+|WP_016497630.1|DBSCAN-SWA MSQTYRLASGGRIDRSKVLNFTFNGKTYQGYAGDSLAAALLANGVDIVGRSFKYSRPRGIIAAGTEEPNAILQIGSSEATQIPNVRATQQALYAGLVATSTNGWPNVNNDVMGILGKVGGSMMPPGFYYKTFMYPKSFWMTYEKYIRKAAGLGRAPLQNDPDSYDYMNQHCDVLIVGAGPAGLAAALAAARSGARVILADEQEEFGGSLLDSRETLDGKPAADWVNAVIKELEGLPEVTLLPRATVNGYHDHNFLTIHERLTDHLGDRAPIGQVRHRVHRVRAKRVVLAPGAHERPLVYGNNDVPGNMLAGAVSTYVRRYGVAPGRKLVLSTNNDHAYRAALDWHDAGLQVVAIADARHNPRGSLVEEARAKGIRILTSSAVVEAKGSKHVTGARVAAIDVQAHKVTSPGETLECDLIATSGGYSPIVHLASHLGGRPVWRDDILGFVPGDAPQKRECVGGINGVYALGDVIADGFEGGVRAATEAGFKASVGTLPKTVARKEEATVALFQVPHDKGSKGPKQFVDQQNDVTAAGIELATREGFESVEHVKRYTALGFGTDQGKLGNINGLAIAARSIGITIPEMGTTMFRPNYTPVTFGAVAGRHCGHLFEPVRFTALHAWHVKNGAEFEDVGQWKRPWYFPKAGEDIHAAVARECKAVRDSVGLLDASTLGKIDIQGPDAREFLNRIYTNAWTKLDVGKARYGLMCKEDGMVFDDGVTACVGDNHFIMTTTTGGAARVLQWMELYHQTEWPELKVYFTSVTDHWATMTLSGPNSRKLLSELTDIDMDKEAFPFMTWKEGNVGGVPARVFRISFTGELSYEVNVQANYAMGVLEQIIEAGKKYNLTPYGTETMHVLRAEKGFIIVGQDTDGSMNPDDLNMSWCVGRNKPYSWIGLRGMNREDCVRENRKQLVGLKPVDPTKWLPEGAQLVFDPKQPIPMDMVGHVTSSYASNSLGYSFAMGVVKGGLKRMGERVYSPQADGSVIEAEIVSSVFFDPKGERQNV >NC_021505|371081:421979|380571_382152_+|WP_016497611.1|DBSCAN-SWA MNRKQLKFCLLGVVFLFGGCTASDPANYLESSAVTLFQIKPDQNRAALERQSGTDQYLKILLNVAEDAEVKEVQVKPGLVSKDTTMLSMPLTDGRTVSFKLSRSDNVASGMVGWVGDMPSNRRQLYPSPAEINMDPLNWVSLVSDGKLVVGDIRVEGQLYRLTAVGKGQQVLVKVDESKLPPEAAPIAAPVQPQGNAQLVIAPLSSKSTIRVLFVTTRQSRARFPNYKIELAQALQNANQYLINSKVDAVYELSDIYDSDYDETGKEPQTQLNDMIADKPLGAKIHIEREKVRADLVSMLSTYSIYCGIAKMPARKETAFSSISCFGALGHELGHNMGAMHSDEFPDQGIPAYAYGYKHTAPNFHTQMRTSHGAIPYHSNPRLQYQGVPMGTVDKNDVARTFNENRDTVANFYPDPAHRVRLWLYGANRGCFIDLKPGEQALLSWYTECKDNDVDPVRVEVKDFYSGSTPRKLCFSNIFYTVRSCYTGSNFSGDFVITSVHSGGGKPEGFKFTRRLIGPVYRVSYE >NC_021505|371081:421979|402106_402409_+|WP_016484532.1|DBSCAN-SWA MRLQEQPVNVVSILGIDYSIKAPEGQEETLAQAVRMLNTALNETKRQYPTLIGDKLLVLAALNLCSKQVELQKEHQQTLARTQAQIDATVDAIVRTIAES >NC_021505|371081:421979|388628_389159_+|WP_016497619.1|DBSCAN-SWA MAKIAPQLPIEVDSETGVWTSDALPMLYVPRHFFVNNHIGIEEVLGADAYAEILYKAGYKSAWHWCEKEAECHGLEGVAVFEHYMKRLSQRGWGLFEIQDIDLDKGTCSVKLKHSAFVYVYGKVGRKVDYMFTGWFAGAMDQILAARGSSIRTVAEQVYGGSEEGHEDGLFVTKPL >NC_021505|371081:421979|371081_372260_-|WP_012270095.1|holin|DBSCAN-SWA MSIIRFEDVDVIFSNKPREALSLLDQGKTREQILKQTGLVVGVEKANLDINKGEICVLMGLSGSGKSSLLRCINGLNTVSRGKLFVEHEGKHIDIAHCSPAELKMMRTKRIAMVFQKFALMPWLTVRENISFGLEMQGRPEKERRKLVDEKLELVGLTQWRNKKPDELSGGMQQRVGLARALAMDADILLMDEPFSALDPLIRQGLQDELLGLQAKLSKTIVFVSHDLDEALKLGTRIAIMKDGRIIQYSKPEEIVLNPADEYVRTFVAHTNPLNVLCGRSLMRSLDNCKRVNGSVCLDPGIDSWLDLGEGGALKRARQGQNGLDMQNWAPGQDVELLERRPTVVHADIGMREALQIRYQTGNKLVLQDNDRVVGILGDTELYHALLGKNHG >NC_021505|371081:421979|374587_375964_+|WP_016497606.1|DBSCAN-SWA MAISVFDLFKIGVGPSSSHTVGPMRAGALFVQGLRERGELERVKRIEVRLYGSLSATGIGHGTDNATVMGLMGEWPDAIDPTQIVPRIADLRETNTLQLDSRLPIEFVWARDMLLLDENLPYHPNAMTLIAEGEQGELHRDTYYSVGGGFVVDAAQAASGVLDADQTVLPYDFNSAAELLRLCKQNDLSVSQLMMANEKVWRSEEEIRAGLHKLWEAMQECVNNGLKYEGTLPGGLNVRRRAAKLHRSLQEIGKPNVIGSTMSAMEWVNLFALAVNEENAAGGRMVTAPTNGAAGIIPAVLHYYMRFSDEVNESSIVDYFLAAAAVGILCKKNASISGAEVGCQGEVGSACAMAAAGLAEVLGATPPQVENAAEIALEHNLGLTCDPVGGLVQVPCIERNAIAAVKAINAVQMALRGDGEHFISLDQVIRTMRDTGADMHDKYKETSRGGLAVSAIEC >NC_021505|371081:421979|408545_408881_+|WP_016484537.1|DBSCAN-SWA MLHIFCPHCGELRSEEEFHASGQAHIARPLDPAACSDEEWGTYMFYRDNPRGIHHELWDHVAGCRQYFNVTRDTVTYEILETYKIGEKPQVTANGKAANAPSTVKGQGEKV >NC_021505|371081:421979|383716_384601_-|WP_016497614.1|DBSCAN-SWA MNHDVIITCALTGAGDTASKSHLVPVTPKQIAESAVEAAKAGATVVHCHVRDPQTGRFSRDVALYREVMERIREADVDIIVNLTAGMGGDLEIGPGETPLEFGPGTDLIGPLERLAHVEALLPEICTLDCGTLNFGDGNSIYVSTPAQLRAGAKRITELGVKAELEIFDTGHLWFAKQMMKEGLLEDPLFQLCLGIPWGAPADTTTMKAMVDNLPANVTWAGFGIGRMQMPMAAQAVLLGGNVRVGLEDNLYLDRGVLASNGQLVERASEIITRMGGRVLSPAEGREKMNLKRR >NC_021505|371081:421979|385815_386760_+|WP_016497616.1|DBSCAN-SWA MLQNIQFLLLPGFSAMGFISALEPLRVANRFKGPSYRWQVLSLDGGAVQASNGMSVNADAALAAGEPAGILLIVAGFDPLACYGQALQQVLRRLDHEGVILGGIDTGAVVLAEAGLLDGHRATVHWEALEAFKENYPSLQATQELFEIDRRRITCAGGTASIDLMLDLIAQAHGSELAVQVSEQFVLGRIRQRQDHQRMQIASRYGISNKKLVKVIGEMERNTEQPLNTQVLAEAVQVTRRQLERLFRVHLDDTPSGFYLRLRLDKARQLLRQTDMSVLEVGVACGFESASYFTRCYRARYQRCPREDRLARAV >NC_021505|371081:421979|413844_415044_+|WP_016497631.1|DBSCAN-SWA MSGNRGVVYLGAGKVEVQHIDYPKMQDPRGKKIEHGVILKVVSTNICGSDQHMVRGRTTAQVGLVLGHEITGEIVEIGRDVERLKIGDLVSVPFNVACGRCRSCKEMHTGVCLTVNPARAGGAYGYVDMGDWTGGQAEYVLVPYADFNLLKLPDRDKAMEKIRDLTCLSDILPTGYHGAVTAGVGPGSTVYVAGAGPVGLAAAASARLLGAACVIVGDLNPARLAHAKSQGFEVVDLSKDTPLHEQIIDILGEPEVDCAVDAVGFEARGHGHEGAKHEAPATVLNSLMQVTRVAGNIGIPGLYVTEDPGAVDAAAKIGALSIRFGLGWAKSHSFHTGQTPTMKYNRQLMQAIMWDRINIAEVVGVQVINLDQAPEGYGEFDAGVPKKFVIDPHKMWGAA >NC_021505|371081:421979|417628_418978_-|WP_016497634.1|DBSCAN-SWA MANQPSSTPAQLRRVLGLPALVFFGLVYMVPLTIFTTYGIVTELTGGRTAGAYLVTLVAMLFTAASYSFMVKRFPVAGSAYSYTNMAFGPNIGFLAGWSLLLDYLFLPMINYLLIGLFLNIAFPAVPAWAFVLACIALVTVLNVVGINSVAKTSNLIVGAQIVFIGVFVALSCQTLAGQPLDLLSPLLGDGSKPGFGHLMAGAAVLCLSFLGFDAVSTLAEECRDARRDVPRAIILTTLFAGLLFTLLAYVSQLVLPGSSFANADAAANEVMFKAGGQFLTNFFTAAYVAGSLGSALASQAAVSRILFTMGRDNVLPRRSFGYLSPRFGTPVFAILLVSAFSLLALVIDLATLASLISFGALVAFSAVNLAVVKTHLMDDASQRDAKGLLSYGVVPLVGLGLTLWLWTSLSALTLVIGLCWFALGLAYLAALTAGFRRPVRMVDFSEAG >NC_021505|371081:421979|412716_413574_+|WP_016484540.1|DBSCAN-SWA MSRAPDTWILTADCPSMLGTVDVVTRYLFEQRCYVTEHHSFDDRQSGRFFIRVEFRQPDDFDETGFRAGLAERSEAFGMAFELTAPNHRPKVVIMVSKADHCLNDLLYRQRIGQLGMDVVAVVSNHPDLEPLAHWHKIPYYHFALDPNDKAGQERKVLQVIEETGAELVILARYMQVLSPELCRRLDGWAINIHHSLLPGFKGAKPYHQAYNKGVKMVGATAHYINNDLDEGPIIAQGVEVVDHSHYPEDLIAKGRDIECLTLARAVGYHIERRVFLNANRTVVL >NC_021505|371081:421979|416843_417527_-|WP_080642753.1|DBSCAN-SWA MFTYCPYGQSNNYTNQNAVLGFNGERYDLAARAYALGQGYRSYSPVLMRFHAPDHLSPFEQGGLNAYAFCEGDPVNYQDPTGHMRGASIWLTPKKAFKRLISIFDRPAPPTVGVEVIPGTLTTPVTAIKKITRFSNLPLRINYTDEAELTRLDQFRYRAELIKYYERELMKNEPLLNKFADLRKKIEMRASNGIEIDVADEERFGILENTKQRFDQLRKEIRKTRSI >NC_021505|371081:421979|386887_387289_-|WP_016497617.1|DBSCAN-SWA MKSLAWLALLAVVSGAQAGEEESTPCDNVDTDQQTYACAAFNKQTAERELKSAYDDLIQRIRDQYADESDQATALIGRMDAAEKLWMQLRDADCKVETYAEKQGSKAFQAAWDTCVAQRSDDRSEYLQSIGQQ >NC_021505|371081:421979|389174_391235_+|WP_016497620.1|DBSCAN-SWA MAFEAMFQPIQIGKLTIRNRVLSTAHAEVYATDGGMTTDRYVKYYEEKAKGGIGLAICGGSSVVAIDSPQEWWASVNLSTDRIIPHFQNLADAMHKHGAKIMIQITHMGRRSRWDGFNWPTLMSPSGIREPVHRATCKTIEVEEIWRVIGNYAQAARRAKEGGLDGVELSAVHQHMIDQFWSPRVNKRTDEWGGTFEGRMKFGLEVLKAVRAEVGDDFCVGMRICGDEFHPDGLSHEDMKQIAAYYDGTGMLDFIGVVGSGCDTHNTLANVIPNMSYPPEPFLHLAAGIKEVVKVPVLHAQNIKDPNQATRILEGGYVDMVGMTRAHMADPHLIAKIKMGQIDQIKQCVGANYCIDRQYQGLDVLCIQNAATSREYMGVPHIIEKTTGVKRKVVVVGAGPAGMEAARVAAERGHDVTLFEKKDQIGGQITIAAKAPQRDQIAGITRWYQLELARLKVDLRLGTAASVDAIQDLRPDVIVLAVGGHPFVEQNEHWGAAEGLVVSSWDVLDGKVAPGKNVLVYDTICEFTGMSAADYIADKGSQVEIVTDDIKPGVAMGGTTFPTYYRSMYPKEVIMTGDMMLEKVYREGDKLVAVLENEYTGAKEERVVDQVVIENGVRPDEQLYYALKEGSRNKGQIDVEALFAIKPQPILSQPGEGYLLYRIGDCVAQRNVHAAIYDALRLCKDF >NC_021505|371081:421979|396224_397061_+|WP_041167495.1|DBSCAN-SWA MTTFLLSIDRQHSVLGRGRHEARAFSPYGALSAALIPGLAFCGQHPDPLTGCYPLGNGRRFYSPSLRRFISSDSLSPFGKGGIHAYAYCGGDPVNRHDPSGAFWGVVLRIVGVASSGATLFGSLARTAKNVVGRRAAFWANNNPPGGGGPVPSVRHQELPHASRVSNQQFFITGSAGVAGQLAAAISGVTPAFQTATDVLGVVNSVTNLSGGSIGNFAAAREVGSYLWANPREIPVVALETFMDVTMVDEVFANVGRGFTAVAERIRSARPTPHDVTV >NC_021505|371081:421979|377682_377940_-|WP_016497608.1|DBSCAN-SWA MMHADLIDQDDLAGHLRARGFDIPAGASAEQACEAVVRGLTEPNARALKGMVEQMYTGSATILPAVRQAIDKQLLPALASFNKRA >NC_021505|371081:421979|387573_388551_+|WP_016497618.1|DBSCAN-SWA MSPAELHADSIVIDGLIIAKWNRELFEDMRKGGLTAANCTVSVWEGFKATVDQIAASQKLIRDNSDLVMPVRTTADIRKAKELGKTGILFGFQNAHAFEDQIAYVDVFKQLGVGIVQMCYNTQNLVGTGCYERDGGLSGFGREIVAEMNRVGIMCDLSHVGSKTSEEVILESKKPVCYSHCLPSGLKEHPRNKSDEELKFIADHGGFVGVTMFAPFLAKGIDSTIDDYAEAIEYTMNIVGEDAIGIGTDFTQGHGQDFFEYLTHDKGYARRLTNFGKIINPLGIRTVGEFPNLTETLLKRGHSERVVRKIMGENWVNVLKDVWGE >NC_021505|371081:421979|397143_398436_-|WP_016484527.1|DBSCAN-SWA MDVTATLSLGDPLEPARKATAEMLQTRERTYSLPQPFYTDERLFQIDMQEIFQKEWLIAGMTCEIPAKGNYITLQIGKNPILVVRGAEGKVHAFHNVCRHRGSRLCVSDKGKVAKLVCHYHQWTYELDGRLLFAGTEMGADFDMKEYGLKPVHVKVAGGYIFISLAENPPAIDEFLATLEHYMEPYDMENTKVAVHTTLMEKANWKLVLENNRECYHCSGSHPELLQTLLEWDDTNDPRASQEFKDHVAASAAAWEAEKIPYLHKSHGLRNRIVRMPLLKGTVSMTMDGKQACQKLMGRIKNPDLGSMRILHLPHSWNHCMGDHMIVFTVWPISAQETMVTTKWLVHKDAVEGVDYDPERMRKVWDATNDQDRRLAEENQRGINSTAYQPGPYSKTYEFGVVNFIDWYSDRVLANLGAEPAPYLKEVKAQ >NC_021505|371081:421979|416191_416767_+|WP_016497632.1|DBSCAN-SWA MKAFTLATLMTLAASPVFAFNLSDAANAVSAMQTQKQQGQVQAPEAQANLLNTLGSELKITPEQAVGGAGAMLGLARNNLSSDDYGQLTKAVPGLDLLAGANALGGLSGFGDLLGKNSESESALSNALGNNVENRSDLDSAFKALGMDTGMIGQFAPLILQYLGQQGIAGSLLQNLSSLWTAPASTPAPSV |
42 | Bacillus_virus(20.0%) | protease,holin,transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2037360 : 2046174
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_021505|2037360:2046174|DBSCAN-SWA TATGCTCGATTCCAAACTGTTACGCGGCCAACTTCAGGAAGTGGCGCAACGCCTGGCCTCCCGTGGCTATATCCTGGATGTCGCGCGCATCGAATCACTGGAAGAGCGCCGCAAGGCGGTGCAGACCCGCACCGAGCAGCTGCAGGCTGAGCGTAACGCTCGTTCCAAGTCTATCGGCCAGGCCAAGGCCAAGGGCGAAGACATCGCGCCACTGATGGCCGACGTCGAGCGCATGGCCAACGAACTGGCGGCCGGCAAGACTGAGCTGGACGGCATTCAGGCCGAACTGGACGGTATCCTGCTGACCATCCCCAACCTGCCGGACGCCAGCGTACCGGTCGGTGCCAGCGAAGACGACAACGTCGAAGTGCGCCGTTGGGGTACGCCGACCGCCTTTGACTTCGCAATCAAGGACCACGTCGCCCTCGGTGAAGTCAGCGGTGGCCTGGACTTCGAGGCCGCAGCCAAGCTGTCTGGCGCCCGCTTTGCCGTGCTGCGTGGCCCGATTGCCCGCCTGCACCGCGCCCTGGCGCAGTTCATGATCAACCTGCACACCGGCGAGCACGGCTACGAGGAGCACTACACCCCGTACCTGGTGCAAGCGCCTGCCTTGCAGGGCACTGGCCAGCTGCCGAAGTTCGAGGAAGACCTGTTCAAGATCACCCGCGAAGGTGAAGCTGACTTCTACCTGATTCCGACTGCCGAAGTGTCGCTGACCAACCTGGTAGCGGGTGAAATCCTCGACGCCAAGCAGCTGCCGCTGAAACTGGTTGCCCACACCCCTTGCTTCCGCAGTGAAGCCGGTGCTTCTGGCCGTGACACCCGCGGCATGATCCGCCAGCACCAGTTTGACAAGGTCGAGATGGTGCAGGTGGTGGAGCCGGCCAAGTCGATGGAAGCCCTGGAAGGCTTGACCGCCAACGCCGAGCGCGTGCTGCAGCTGCTGGAGCTGCCGTACCGCGTGCTGGCCCTGTGCACCGGCGACATGGGCTTTGGCGCCGTTAAAACCTACGACCTGGAAGTGTGGGTACCGAGCCAGGACAAGTACCGCGAAATCAGCTCGTGCTCCAACTGTGGCGACTTCCAGGCACGCCGCATGCAGGCCCGCTGGCGCAACCCGGAAACCGGCAAGCCAGAGCTGGTGCACACCCTCAACGGCTCCGGCCTGGCCGTAGGCCGTACCTTGGTTGCCGTGCTGGAAAACTACCAGCAGGCTGACGGCTCGATCCGTGTGCCAGACGTGCTGAAGCCGTACATGGGCGGCGTCGAGGTCATCCGCTAAATGGATTACCTGCCGCTGTTCCACAAGCTGCAGGGCGGCCGCGTGCTGGTCGTCGGTGGCGGTGAGATCGCCTTGCGCAAGGCGCGCCTGCTGGCGGATGCCGGTAGCGAGCTGCGCGTGGTGGCGCCGGATGTCGACGGCGAGTTGGCTGCGTTGGCCCGGGAAGGTGGCGGTGAGGTGCTGGTGCGTGGCTATCAGGCAGCCGACCTGGTCGGTTGCCGGCTGGTGATCGCGGCTACCGACGACCCTGGGCTGAATGCCCAGGTGTCGGCGGATGCCCAGGCACTCAGCCTGCCAGTCAACGTGGTGGATGCCCCGGCCCTGTGCACGGTGATCTTCCCGGCGATTGTCGACCGTTCACCGCTGGTGATTGCGGTATCCAGTGGCGGTGACGCCCCGGTGTTGGCGCGTTTGATCCGCGCCAAGCTGGAGGCCTGGATTCCGTCGGCCTATGGTGAGCTGGCCGGGCTGGCTGCGCGTTTCCGGCACAAGGTCAAAACCTTGTACCCGGACGTCAACCAGCGTCGCGGCTTCTGGGAAACCGTGTTCCAGGGGCCGATTGCCGAGCGCCAGCTGGCCGGGCAGGGCGCTGAGGCCGAACGCCTGTTACAGGCGATGGTCGATGGCGCGCCGGTGCAGCAGGGTGGTGAGGTGTACCTGGTAGGTGCGGGCCCGGGCGATCCGGACTTGCTCACCTTCCGTGCCTTGCGCCTGATGCAGCAGGCCGACGTGGTGCTGTACGACCGCCTGGTGGCACCTGCAATCATCGAGATGTGCCGCCGTGATGCCGAGCGCATCTACGTGGGCAAGCGCCGCGCCGATCATTCCGTGCCGCAGGACCAGATCAACCGCCTGCTGGTCGATCTGGCCAAGCAGGGCAAGCGTGTACTGCGACTGAAGGGTGGCGACCCGTTCATCTTCGGCCGGGGTGGCGAAGAGATCGAAGAGCTGGCCGAGCATGGCATCCCTTTCCAGGTAGTGCCGGGCATTACGGCGGCCAGTGGCTGCTCCGCTTATGGCGGCATACCGCTGACTCACCGCGACTACGCCCAGTCGGTGCGCTTCGTCACCGGGCACCTGAAGGATGGCACCAGCAACCTGCCGTGGAATGACCTGGTGGCGCCGGCGCAGACCTTGGTGTTCTACATGGGGTTGGTCGGTTTGCCGACCATCTGTGCCGAGCTGATTCGCCATGGGCGGGCGGCAAGTACCCCGGCGGCGTTGGTGCAGCAGGGGACTACGCGTAACCAGCGGGTGTTTACCGGTACTTTGGCGGACTTGCCAGAGCTGGTGGCCCGGCATGAAGTGCATGCGCCGACCCTGGTGATTGTGGGGGAAGTGGTGCAGTTGCGTGACAAGCTGGCCTGGTTCGAAGGTTCGCAGAACAGCTGAAATTGTCGGGGCCGCTTTGCGGCCCATTCGCAGCACAAGGCTGCTCCTACAGGGGTCGCGTGAAGCCCTGCGTGATTCCTGTAGGAGCAGCCTTGTGCTGCGATGGGCTGCAAAGCAGCCCCGGCTATCTCAAGCCTGCCAGATCCCTTTCCCCACCAACTTCTCTCGATCATGCGGCAAGCCAAAGTCCTGCAACGGCCCCTTAGGCACGATCCCGGTCGGGTTGATGGTCTTGTGGCTCATGTAATAGTGCTTCTGGATATGCTCCATATCCACCGTCCCCGCCACCCCTGGCCACTGGTACAGCTCACGCAGCCAGTTCGACAGGTTGTGATAGTCACTCAGGCGGCGCAGGTTGCACTTGAAGTGCCCGTGGTACACCGCATCGAAGCGCACCAGCGTGGTAAACAGGCGTACATCGGCCTCGGTCAGGTACTCACCGGCCAGGTAGCGGTTACGGCTCAGCACGTCTTCCAGATGATCCAGCTCGTTGAACACGTCGTCGAAGGCTGCCTCGTAGGCGTCTTGCGTGGTGGCAAAGCCTGCGCGGTACACGCCATTGTTCACCGCCGGGTAAATGCGCTCGTTCAATGCCTCGATGGTCGGCCGCAGTGGCTCGGGGTAGAGGTCCAGCATATTGCCGGTCAGCTCGTTGAACGCGCTGTTGAAGATACGGATGATCTCCGACGACTCGTTGTTGACGATGCGTTTCTCTTGCTTGTCCCACAACACGGGCACGGTGACGCGGCCGGTGTAATGCGGGTCATCCTGGGTATAGCGCTGGTGCAGGTACTGCAGGCCGTCAAGGTGGTCACCGCTGGACCCTTGCTGCTGGTCGAAGGTCCAGCCATTGTCTTGCATCAACCAGCTGACCACCGACACATCGATCAGTGGCTCAAGGCCTTTCAGGGCACGCAAGATCAGGGTGCGGTGGGCCCATGGGCAGGCCAGCGAGACGTAAAGGTGGTAACGGCCAGCCTCCGGGGCGGGCAGCTGATTGCGGCGCTGAGCGTTTTCGCGTTTGAACGTACCGTCCTTGCCGTTTTCATACCACTGGTCGTGCCAGCGGCCGTCGATCAAAAGGCCCATGGTGAGCTCCTGAAACAAAGTTGTTTGTCGTTGGAGCCCAGTCTAGGCCAAATCGTTCGAACAAATAGCGCAAATATCGGGGTGCTATTATCGGCTAAATCGATCTATTCCGCGTATCCCAGTAGCCCTGCGCTGTCGTGAATGCCTGTTCGCGGTCTTGCCCCAGGCCACGCAGGGCCAGGGCCATGGTCGCGATCAGGGCCTTTTCGCCATAACTGTCTTCCACTTCACCGCGCCAGAACGCGCACAGGTGCTCGGCCTGCAGGCTGCCCGGTTTGACATGGCGGCGCTCGCTCAGTGCTGGCCACTCTTCGTCCCACGCTTCGCCCGCCGTGGTGCCGTAGAGGTGGCTGATGACATCGGGGTTGACCTCGATCTCGCCGCCATCGCCCTTGATCACCACAGCATGGTCGCCCAGCAGGCGGCTGGCCTCACGGTGCACGGCCTGGTAACCAGGGTGGAAAATGCTTTGCAGGCCACAGCGTGCGCCCAGCGGGTTAAGTACCCGGGCCAGCGAATGAATGGGCGAGCGCAGGCCCAAGGTGTTGCGCAGGTCGATCATGCGCTGCAACTGCGGCGCCCAGTCGTGCAGCGGGAAGAACGCCAATTGGTGCTGGTCGAGGGCGTTGCCGACGGCGGCCCAGTCGCGGCATAGCGGGATTTGCAGCAGGTCGAGCAGTTGCTCGCTGTACATGCGCCCGGCAGTGTGTGCGCCGCCGCCGTGCATCAGGATGCGCACGCCATTGCCGGCCAGGCACTTGGCCGCCAGCAGGTACCAGGGCAGGTGGCGCTTCTTGCCGGCGTAGGTGGGCCAGTCCAGGTCCACCGCAATGCTCGGTGCCTGCAGGTGGGCGCGCAGGGCCTCGGTGAAGCCGGCCAGCTCTTCGGCGCTTTCTTCCTTGTGGCGCAGCAGCATGAGGAAGGCCCCCAGCTGGGTGTCTTCGACCTTGCCTTCCAGCAGCAGGGTCATCGCCGCGCGGGCTTCTTCGCGGGTCAGGCCGCGCGCGCCGCGCTTGCCTTTGCCAAGAATGCGCACGAATTCAGCGAACGGGTGTTCGGCGGGGGTCTCCAGGGTAAGGGGACGAACTTCATTCAAGAGGGGCTGGGTCATATGCAGTTGGTCGGCTTGGGCAGGCCCGCCAGCTTGGCGGCAAGTTTGGCGGGGGTGCCATTGAACAGGCGGTTCAGGTGCGGGCTGTTGCCCTTTTCCGGGCCCAGCTTCAGGGCCGCGTACTTGATCAGCGGGCGCGTGGCCGGGGACAGCTGGTATTCCTGGTAGAACTGGCGCAGCAGTTCGAGGATTTCCCAATGGTCCGCCGTCAATGGGATACCCTCGCGCTCGGCCAAAGCCTCGGCGGCGGCGCGGGACCAGTCTTGCAGGTCGACCAAAAAACCGTCCTTGTCCAGGGCGATCGCCTGATCGCCAACGTTGAGCGTACTCATAGCCAGCTGTTGACCTTGTCGTAATGCAGCGACAGTTCGACGAACCCGGCGTAGTCCACGGCCTTGGCCAAGTCGTTGTCGACCGCGCGGGCTTGCACATCCTCGTCCAGGGCGAACAGGCGCTGGGCCAGGCCGGCAGCTTGCAGCTGGCGGTGCGGCTCGCTGCCGCTGCGCAGGGCATACACCGCATCGCCGCATAGCAGCAACGCGTCGTCGGTGCCGAGCAGGCGCAGGCAGCTGGCCAGGCGCTCGTCGCCAAACGGGGAGTGGGCAATTACATGCAGGGTTGTCATCAGAGCGTTACCACCTGGTCGAAACGGGCGATCAGCGCGGCAAGGGCCGCGTCATCCAGCACTTGCACCGGCAGCGCCAGGGCATCGACTGCCAGGCCGCGGCGGGTGAGGCTGTGGCTGCAGGCGAACAGCTCCTCGACACCGAACATCGGCAGCGCCTGCAGGTTGGCGGCCAGGTTCTTCTGTTGCACGGCGGTGGGTTGCTGGCCGGGAGCGAGCTGGAACACCCCGTCGTCGAGAAACAGCATGCCCAGCGGCAGGTCGAAGGCCCCGCCGGCCAGGGCGATGTCCAGCGCTTCGCGGGCTGATGGGCCATTCCAGGGGGCCTGACGGCTGATGATCAACAAGGATTTGGCCATTTCAGTCGCCTCCAAAGCAGACAAGGCGGTCGGCAACCTGCGCCGCTTCATGCAGCTGGCCAAGCCCCGACAGTTCCCAAGGTTTGGGCAGGTTCACCGCCGGGCGTTGGTAACGGTTGGCTTCGGCCTCGTCGAGCACGCCACGGCGCAGGGCGGCGGCGATGCACACCACGGCGTCCAGCTGGTGAGTCTCGATAAAGGCACGCCATTGGCTGGCCACATCCAGTTCGTCCTGGGGCGCGACCACGTTGGCCGAGGCACTGTGCACCCCGTCCTGATAAAAGAACAGCCGGGCAATCTCATGCCCGCCGGCCAGCACCGCCTCGGCGTAGCGCAAGGCGCGCCGCGAGGAGGGCGCATGGGCCGGGGAGAACACCGCGATAGCGAATTTCATGGCTAACTCATGCAAAGGAATGCCGCCATGATAAAGCAAAAAAGCCCGCGCCCGCTTGTGCAAGCGGCGCGGGCTTTCTATGTATCCGTGTGCCCTACGCGCCCCTGTGGGAGCGGCCTTGTGTCGCGATGGGCTGCAAAGCAGCCCCATGGCCCTGTCAGAAGGACACAAGACTCAAGCCTGCTCCTTGCTCTCCGGCAAGAACCAGTTCAGCACCAGCGCACAGATCCCCCCGGTAGCCACACCCGACTCCAGCACATTGCGAATCGCCGCCGGCATATGCGCCAGGAACTCCGGCACCTGCGCCACGCCCAGGCCCAGCGCCAACGACACGGCAATGATCAGCAGCGCACGGCGGTCCAGCCGGGTGCTGGCCAGAATATTGATCCCCGAAGCCGCAACCGCCCCGAACATCACCATGGCCGCACCACCCAGCACCGGCTCCGGCACGGCCTGGATCACCCCGGCAACGCTCGGGAACAGCCCCAGCAGCACCAGCATCACGGCAATCCACATACCAATGTGACGGCTGGCAATGCCGGTCAGCTGAATCACCCCGTTGTTCTGGGCAAAGATCGAGCTAGGGAAGGTATTGAACACACCGGCCAGCAGCGAGTTGGCACCGTTGACCAGCACGCCACCCTTGATCCGCTGCATCCATACCGGCCCTTCGACCGGCTGGCGCGACACCTTGCTGGTGGCGGTAACGTCACCGATGGCTTCCAGCGAGGTCACCAGGTAAATCACCAGCATCGGAATGAACAGCGCCCAAGAGAAGCCCAGGCCGAAGTGCAGCGGGGTCGGCACCTGGAACAACGCGGCTTCGTGCATGCCGGTAAAGTCCAGGCGGCCCATGTAGCCGGCCAGTGCATAGCCAACCGCCAGGGCAATGACGATGGCGCAGCTGCGCATCCACACCACGGGTACGCGGTTGAGGATCACGATGATCGCCAGCACCACGCCCGACAGCAGCAGGTTATCGCCGTTGGCAAAGGTGCCATTGGCCATGGCGCCGAAGCCACCGCCCATGCTGATCAGGCCGACCTTGATCAAGGTGAGGCCGATCATCAGCACCACGATACCGGTCACCAGCGGAGTGATCAGGCGTTTGACGAACGGCAGGATGCGCGACACGCCCATCTCCACGAACGAGCCCGCGATCACCACGCCGAAGATGGCTGCCATCACACCCTCTACCGGCGTGCCTTGCTTCACCATCAGCGCGCCGCCCGCGATCAGCGGGCCGACGAAGTTGAAGCTGGTGCCCTGCACAATCAGCAGCCCGGCGCCGAATGGCCCAAAGCGCTTGCACTGCACGAAGGTGGCGATACCCGAGATCACCAGCGACATGGAAACGATCAGGTTGGTATCGCGCGCAGAAACGCCCAGCGCCTGGCAGATCAGCAGGCCAGGGGTAACGATCGGCACGATGATCGCCAGCAGGTGCTGCAGGGCAGCCAGCAGGCCGATCAGCAGCCGTGGCTTGTCCTCCAGGCCAAGCACCAGTTCATTGGCAGGCGCCGCTGCGCCTGGGCTGTGTTCGTGTGAGCTCATTGCTGAAAGCCGCCCCGGAAGAAAAAAGGAGCGCATTCTACGGGGTGAAGAGGGATTGCGGTAGAGAAAAGCATGATGGCAGCATGATCGGCGAGCATTCGCCTGATTATCTGACTTTTTAGTCAGTTATTTGATGCTGACTTATCAGGGAAAACAAAGTCCCGGCCTACACCGTTGCCTTTTGGAATTTGCGCAATACCTGTGGGAGCGGGCAAGCCCGCGAAGAATCCACCGCGGTGCGAGGCAATGGCTAAACTGGCGTTCGCGACCAGGCGCCCCCAATTTCCCGCGCCCACAAAAAAGCCCGCCGAAGCGGGCTTTCTCACACAGCAATCAATCAGTCATCGCGACCCATGATGCCAAACAGCTGCAGCAGGCTGACAAACAGGTTGTAGATCGACACATACAGGCTGATGGTCGCCATGATGTAGTTACGCTCGCCACCATGAATGATCGCGCTGGTCTGGAACAGGATGCAGACCGACGAGAACAGCACGAAGCCAGCGCTGATCGCCAGTTGCAGGCCGCTGATCTGGAAGAAGAAGCTGGCAACGACAGCGCCCAGCAACACGAAGAAGCCCGCAGTGATGAAGCCACTGAGGAAGCTCATGTCCTTGCGGGTGATCAGCACATAGGCCGACAGACCACCAAACACCAGCGCAGTCATGGCAAACGCCGAGCTGACCACTTCAGCGCCACCGGCCATGCCCAGGTAACGGTTGAGGATCGGGCCGAGAATGAAGCCCATGAAGCCGGTGAGGGCGAAGGTGGACACCAGACCCCAAGCCGAATCACGCAGCTTGTTGGTCAGGAAGAACAACCCGTAGAAGCCGATCAGCACCACGAACACGTTCGGGTAGCCGACGCGCATCTGCTGGGCAACGAATGCCATGACACCGCTGAAGGCGAGGGTAAGCGCCAGCAGGCTGTACGTGTTGCGCAGGACCTTGCTGATCTCCTGCTGCTCGACCTGCTGGCCGTGATGGACGGCGTAATCCTGTTCGCGCAT
Protein sequences of DBSCAN-SWA_2 >NC_021505|2037360:2046174|2042229_2042565_-|WP_016498865.1|DBSCAN-SWA MSTLNVGDQAIALDKDGFLVDLQDWSRAAAEALAEREGIPLTADHWEILELLRQFYQEYQLSPATRPLIKYAALKLGPEKGNSPHLNRLFNGTPAKLAAKLAGLPKPTNCI >NC_021505|2037360:2046174|2043218_2043611_-|WP_016498868.1|DBSCAN-SWA MKFAIAVFSPAHAPSSRRALRYAEAVLAGGHEIARLFFYQDGVHSASANVVAPQDELDVASQWRAFIETHQLDAVVCIAAALRRGVLDEAEANRYQRPAVNLPKPWELSGLGQLHEAAQVADRLVCFGGD >NC_021505|2037360:2046174|2042857_2043217_-|WP_016498867.1|DBSCAN-SWA MAKSLLIISRQAPWNGPSAREALDIALAGGAFDLPLGMLFLDDGVFQLAPGQQPTAVQQKNLAANLQALPMFGVEELFACSHSLTRRGLAVDALALPVQVLDDAALAALIARFDQVVTL >NC_021505|2037360:2046174|2042561_2042858_-|WP_016498866.1|DBSCAN-SWA MTTLHVIAHSPFGDERLASCLRLLGTDDALLLCGDAVYALRSGSEPHRQLQAAGLAQRLFALDEDVQARAVDNDLAKAVDYAGFVELSLHYDKVNSWL >NC_021505|2037360:2046174|2037360_2038641_+|WP_016498861.1|tRNA|DBSCAN-SWA MLDSKLLRGQLQEVAQRLASRGYILDVARIESLEERRKAVQTRTEQLQAERNARSKSIGQAKAKGEDIAPLMADVERMANELAAGKTELDGIQAELDGILLTIPNLPDASVPVGASEDDNVEVRRWGTPTAFDFAIKDHVALGEVSGGLDFEAAAKLSGARFAVLRGPIARLHRALAQFMINLHTGEHGYEEHYTPYLVQAPALQGTGQLPKFEEDLFKITREGEADFYLIPTAEVSLTNLVAGEILDAKQLPLKLVAHTPCFRSEAGASGRDTRGMIRQHQFDKVEMVQVVEPAKSMEALEGLTANAERVLQLLELPYRVLALCTGDMGFGAVKTYDLEVWVPSQDKYREISSCSNCGDFQARRMQARWRNPETGKPELVHTLNGSGLAVGRTLVAVLENYQQADGSIRVPDVLKPYMGGVEVIR >NC_021505|2037360:2046174|2045502_2046174_-|WP_003251184.1|DBSCAN-SWA MREQDYAVHHGQQVEQQEISKVLRNTYSLLALTLAFSGVMAFVAQQMRVGYPNVFVVLIGFYGLFFLTNKLRDSAWGLVSTFALTGFMGFILGPILNRYLGMAGGAEVVSSAFAMTALVFGGLSAYVLITRKDMSFLSGFITAGFFVLLGAVVASFFFQISGLQLAISAGFVLFSSVCILFQTSAIIHGGERNYIMATISLYVSIYNLFVSLLQLFGIMGRDD >NC_021505|2037360:2046174|2041216_2042218_-|WP_041167993.1|DBSCAN-SWA MNEVRPLTLETPAEHPFAEFVRILGKGKRGARGLTREEARAAMTLLLEGKVEDTQLGAFLMLLRHKEESAEELAGFTEALRAHLQAPSIAVDLDWPTYAGKKRHLPWYLLAAKCLAGNGVRILMHGGGAHTAGRMYSEQLLDLLQIPLCRDWAAVGNALDQHQLAFFPLHDWAPQLQRMIDLRNTLGLRSPIHSLARVLNPLGARCGLQSIFHPGYQAVHREASRLLGDHAVVIKGDGGEIEVNPDVISHLYGTTAGEAWDEEWPALSERRHVKPGSLQAEHLCAFWRGEVEDSYGEKALIATMALALRGLGQDREQAFTTAQGYWDTRNRSI >NC_021505|2037360:2046174|2038641_2040033_+|WP_016498862.1|DBSCAN-SWA MDYLPLFHKLQGGRVLVVGGGEIALRKARLLADAGSELRVVAPDVDGELAALAREGGGEVLVRGYQAADLVGCRLVIAATDDPGLNAQVSADAQALSLPVNVVDAPALCTVIFPAIVDRSPLVIAVSSGGDAPVLARLIRAKLEAWIPSAYGELAGLAARFRHKVKTLYPDVNQRRGFWETVFQGPIAERQLAGQGAEAERLLQAMVDGAPVQQGGEVYLVGAGPGDPDLLTFRALRLMQQADVVLYDRLVAPAIIEMCRRDAERIYVGKRRADHSVPQDQINRLLVDLAKQGKRVLRLKGGDPFIFGRGGEEIEELAEHGIPFQVVPGITAASGCSAYGGIPLTHRDYAQSVRFVTGHLKDGTSNLPWNDLVAPAQTLVFYMGLVGLPTICAELIRHGRAASTPAALVQQGTTRNQRVFTGTLADLPELVARHEVHAPTLVIVGEVVQLRDKLAWFEGSQNS >NC_021505|2037360:2046174|2043785_2045165_-|WP_016498869.1|DBSCAN-SWA MSSHEHSPGAAAPANELVLGLEDKPRLLIGLLAALQHLLAIIVPIVTPGLLICQALGVSARDTNLIVSMSLVISGIATFVQCKRFGPFGAGLLIVQGTSFNFVGPLIAGGALMVKQGTPVEGVMAAIFGVVIAGSFVEMGVSRILPFVKRLITPLVTGIVVLMIGLTLIKVGLISMGGGFGAMANGTFANGDNLLLSGVVLAIIVILNRVPVVWMRSCAIVIALAVGYALAGYMGRLDFTGMHEAALFQVPTPLHFGLGFSWALFIPMLVIYLVTSLEAIGDVTATSKVSRQPVEGPVWMQRIKGGVLVNGANSLLAGVFNTFPSSIFAQNNGVIQLTGIASRHIGMWIAVMLVLLGLFPSVAGVIQAVPEPVLGGAAMVMFGAVAASGINILASTRLDRRALLIIAVSLALGLGVAQVPEFLAHMPAAIRNVLESGVATGGICALVLNWFLPESKEQA >NC_021505|2037360:2046174|2040162_2041122_-|WP_016498863.1|DBSCAN-SWA MGLLIDGRWHDQWYENGKDGTFKRENAQRRNQLPAPEAGRYHLYVSLACPWAHRTLILRALKGLEPLIDVSVVSWLMQDNGWTFDQQQGSSGDHLDGLQYLHQRYTQDDPHYTGRVTVPVLWDKQEKRIVNNESSEIIRIFNSAFNELTGNMLDLYPEPLRPTIEALNERIYPAVNNGVYRAGFATTQDAYEAAFDDVFNELDHLEDVLSRNRYLAGEYLTEADVRLFTTLVRFDAVYHGHFKCNLRRLSDYHNLSNWLRELYQWPGVAGTVDMEHIQKHYYMSHKTINPTGIVPKGPLQDFGLPHDREKLVGKGIWQA |
10 | uncultured_Caudovirales_phage(75.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2090114 : 2097019
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_021505|2090114:2097019|DBSCAN-SWA TTTAGTCGTCGAAGCTGTCGTGGTCAGGCGGCGCTGACTGCTGGTTCTGCTGCTGGCGGGCGGGGCGTTGTTGACGTGGCTGCCTATCGGGGATTTGCCCTGTTTGATGACCCTGCGGCCGGCTACCGAGTAGCTGCATGGTCCCGTTGATGTCCACGATGATTTCCGTGGTGTAGCGCTTGATGCCGTCCTTCTCCCACTCGCGGGTCTGCAGCTTGCCCTCGATGTAGCACTGGGAGCCCTTTCGCAGGTATTCGCCGGCAATCTCGGCGACCTTTCCGAACAGCGACACCCGGTGCCACTCTGTGCGCTCGACCTTCTGCCCCGACTGGTTGTCAGTCCACTGCTCGCTTGTGGCCAGGCTGAGGTTTGTGACGGCGTTGCCGTTCGGCAGGTAGCGCACGTCAGGGTCTTGGCCGCAGGTTCCTACCAAACTGACCTTGTTGACCCCGCGACTCATGACTGAGCAACCGCTGCCGCAACGATGAACAGTAGGGCAAGGAGTGAGCACCAGCGGGTCGCGCGCTCGCCGACCTGGCGGGCCCTCACCAGTGCGACAACAGGCAGGGCTTTGGCCTCGATGGCGCGCTCCAGACTTTCCGCATAGCGAGCTGCCTGTGGATAGCTGGTGTTGCGGCCGTAGAACCGCACGCTGCGCAAGGCGAGCCAAGGGGTAGGCAATGGCTGATCCATCTCTGGCTCTGCAGGAGGCAGTCTTTGCCAGGCTTCAGGCGGAGGTAAGCTGCCCGATTTACGACGGCGCGCCGCTGAACGCCGAAATGCCGTACCTCTCCATCGATCGAGAGGTTTCGGTCAACAGCAGCCCGATATCGGGCCGCAAGCGCGAAACGCGCCTGCTGTACCTGTCGGTCTGGTCCGATGCCGTGGGCCAGGCCGAGGTCAAGCGCATCAACGGCGAAGTTATCGCCGCCCTCGATGAGCGCCGGCTGCCGCTGGAGGTGGGGCGCGCGGTTTCCGTTCGGGTCGAACAGTCCGACGCCGACGGCATCACTTACCAGGGCTCGCTCACCGTCCGCGTGATCACCACCCACTAAAAAACCTACCGGCCGCCCCGCGGCCTCTATCCAATGCGGCTTTGGAGGAATACCCATGCCTGCAGCAGACAATTTGAACACAGCCGCCGGCTGCCGACTCTTCATCGGCGGGAAAACAGGCGCGGACACCGAGACCGACTACAAGGCCGACACCTACGTCGAAGTGGGCGAGATCGAGGACCTTGGCGAATTCGGCGACACCTTCAGCAGCGTAAACTTCACCTCGCTGAAAGACGGTCGCGTGCGAAAGTACAAGGGCACCGCTGACGCCGGAGACCTGACCGTCACTGTCGGCATGGACAGTGGTGATGCCGGGCAGCGCGCAGTCAAGACCGCGCACAAGGACCGCAGCAAGGGCGACTACAACCTCAAGATCACCCTGAATGACGGTGACCCGACCGCCACCCCGGTGATCAACCCGACCACCTTCTACATGCGCGTCAAGGTGATGAACAACACCGTTGCACCAGGTGCTGCTGACAACGTGGTGCGCCGGAACATCACCATGGGCATCAACTCCGACGTGCTGGAAATTCCAGCCGCCGCCGCTGCTTGATAGGGGGCTTCAGTGAGCAAAACCCTGCACGGTACTACCGAGGTCACCGTAGGCGGTCGGGTGCTTGTCCTGTCGCCGACGCTCAAAGCGGTTCGTACCATCGAGGCCTACTTTGGTGGGCTCCGAGGCGCGTCCGAGCGGCTGCGTGCTGTAGGCGTTGATGCTGTCGTCGTCGTTTTCGCTGCCGGCTCCGGGATGGAAGACAAGGGCGCCGTCGAAGAACTTGCCGAGGAGATCTGGCAGCAGGGCGTTGCTGACCTGGTGCCAGCCGCGACGGCCTTCCTTTATGCACTTTACAACCCGCGGGGCGGTGACCCGGGGAAGCCGCCAGCACCGACGGGGTCAGCGCCGTAGAGAACGGGAGCTACGTCGACCGGATGTTCTCCGTCGTCGTAGGCTGGCTGGGCTGGTCGCCGCAAGTGGCCTGGACAACGCCGCTGCAGGAGCTCTTCCTGGCCATCGATGCACGGGTAGAGTGGGCTCGCATGACGCATCCATTTGCTGTCAGCTCGAAACCGGAGGGGCAAGGATCGAAGTCGAAATCTTCGAACGTCGCCGAGAAAATTCGGCAGGCGTTGTCTGGGCGAAAGGCACAGTGACGTTTTTGCCCAACTCACCTGTGAGAACATTTTTGTTGGCGTTGATCAACGATGGTAGATTTGCGCCAACCACAGGGAGAGTCGCATGAAACGTGTATTGTTGGTAATGGCAATGGGAGTTGGAGTCCTTTCTGGATGTAAGTCGCCTGAGGATAAGATGGTCTCGGTTTGCACAGATATTGCAAAAATGTCTGTCGCTGATCCTTCGTCATTGTTGGTTAATTCCGGGTCTGTTGTTCAGGCCATCCCTACAAAAGAAAGTTTGTTCAGGTTCGCCTCTTTGAATTTCGATGGTGAGCTAACTGGCGATAGTAAGGCTTGGTACGATATTCAAGTAAAAGAACTTGATAAGCTGAAGGAGAGCTATGCGAGTATTGATTACACGGATAAGTCTTCTTTAGCTCGAAGGGGGCAGGCAATTTGTTGGTTTATGGATAGGGGGGATGGACCTGTCCTAGGCTCCGTGTCAGTTGCCGGGAAAAGCTATTCGGGTACTGACTTGTTAATGGTTTTTGTCAGCAATCCAAGGCCTAAATATTTGAGCTCGTCTAACGCAATCGAGTAGCGAGACGTGAATTAATTTTTAAGCCGATCTGTCTACAGCCCGCTAGCGCGGGCTTTTTGTTGAGGCAAATATGGCTGAAACCGACATTCAGGGTATGCTGGTCAGGATTGAGGCAACCACGGCTCAGCTCCGGCAAGAGATGGCTCGCGCTGAGTCCAGCGTGTCGCAAACCAGTGGGAAAATCGACAGCAGCCTAAAGCAGGTGGATGAAGCGTTCGATAGGGTTGAGGCCAACTCGGTTGCTCTCCGGGAGGGGGTGGGACGTGCATTCAATGGCATCGGACTCGCCGCCGCGGGTGCGGTTGCTGGCTTGGTGGCGTTGACCACATCCACTCTTACCTATGCCCAAGAAGTTCAGAATCTCGCCAGCCTCTCCAACACTTCTGTCGAGGACTTCCAGCGTCTCGCAGTAGGTGCGAAGACCGTAGGCGTCGAACAGGAAAAGCTCGCTGACATTTACAAGGACACTACTGATCGCGTAGGGGAATTTATTTCTCGTGGCGGCGGGGAGCTGCAAGATTTTTTCAAGGACATCGCACCGCAGGTTGGGGTGACTGTGGAGAGTTTCAAAAACCTCTCCGGCCCGGAGGCCCTGCAGCTGTATTACACGTCGCTGGAAAAAGCAGGGGCCAGCCAGCAGCAGATCACCAGCTACATGGAGCAGATGGCCGATGAAGCGACTGCTCTTGTTCCACTGCTCAAAAACTCCGGCCAAGGCTTCAAGGATGCTGGTGAGCGGGCGGATGAAACTGGCGCAATCATCTCTGCTTTCGACATTGGCCAGATGGTGAAGCTGAACAAGTCCGTTCATGACTTGGAAAATTCATGGAGCGGCGCAAGCCATCAGTTGGTTGCAGGGCTTGTCCCAGGCATTGAAAGTGTAACCAGCATGCTCCAGGGCATGACAGACAATGGTAGCTCTAAAGCCTTAGGCCAAGCATCGCGTTCCTTGCTGACAACGTGAACATCCTGCTTGGCGTGATCGGTACAAAGCTGACTGCAAGTTTCATCGGCTATGTGGCTGCGCTGGGCAAGGGTATCTACTCAACGGTTGAGGCTACTCGAGCGACAAATGCCCAGGCTCAAGCTGCGCTCATGGCGGCCAAAGCCGATCAGATCGCAGCAGCTTCGGCGGTAGTCCGCGCTCAGAAAGAGGCTGACGCTGCTCGCGGCACCGCAGTTCAGACGGCGTTGTCGCTTGAGCTTGCGCAGGCAAGGATGGCAGAAACAGCAGCGACTGCCCGTCTTGGTGTGGCCCAAGCAGCAGTGAAAGCTTCGTCGGGCGTGCTAATGACCGTTCTCGGTGGTCCCGCAGGCCTGGCTGCTTTGGCTGTCGGCGCCGGTATCGCCTTCCTGACCATGGGCAGCAATGCCCAGCCGGCCGGCGCAGATCTCGATGACCTGAAAAGGCCAATCGAGGAACTGCGCAAAGCATTTAGAGAGTTGGACAAGGATCAGCGCGGAGCTGCGCTGGTCGGCGCAATGCGCGTGCAGGAACAAGCAGCCGCCGATGCCGATAAGTCCTACCAGGACTTTCTCATCACGGTGAAGAGGGGGGTGGGTTCAACAGTCGCCGCTCGTATTTCCGGCGAGGCGGACGAAGCACGGAAGGCGGGAAAGGGCCTATCCGAGTGGCTGGATGATCTCGGTAAGCGCTTCAACATCCCCCAAGAAGCGATGCGCAGCATGCGCGAAGCCGCCGGCAGTTATTCGACCGTGAGCCAGAGCGCGCGCAAAGCTGGTGAGCGTGTTTCGCTTTACAACAAAGAGATGGACAGCAAGGCCGAGTCAACCAATGCCGCCGGTGATGCAGACGCCAAGGCCACATCGGCCGGCCATACTTAGCCGCTTACCAAGCAGGGAGAGTGGTAATGCAGTTGCGTGAAAGACTTCGGGAGTTCCAGCTATGGTTCAACCCGAAGCGCAGACGGTGGGCTGGCGTGACCCTGATCGCGCTTGGGGTGGTTGGGATGTTTCTCAACCCGCAAAGCCGGTGGACCCTGGTGCTGGGAACTGGAATCTACTGGTTCTTCACGGCCCTGCCGCCCACACTCGGCGGCAAGCGTTGAGGTGCTGGCGGGCAGCGCCGGAGGGGCAGGCGGCGATCGGAAACTCTTTGCCCAGGGCCTGCTGAACTGTCGCGATGGCGCGGCAGAGGTAAGCCCAGTCAGGGTTCGGCTCCATGGCGCCAGGTCGGCGAGGTACACACTGATCGGGTCGAGGCCGTGCACCTCGGTGATCAGCAGCTTGGTGACGGTCGATGTCTCGATGTTCATGGCTTTGTCCATGCATGCGCCGCCCTCCGTGGCCGGATGCGGCATGGTGGCAATGTGAATGGCGAAAAATTGATTTGGTCTGTGGCTGTTAGAGGTGAACAAGACTGCGTCGTGTTAGATTGCGTGCATCAAACAAGGAGGTTTAGATGAGCCAGAGCAGAGACAGCAAGATCGATAATATTGAGTTCAATGTGGCGAGCATTGAGCGCAGCAAGGCTTTCTACGGTGAGGCCTTCGGCTGGAGCTTTGTGGACTATGGCCCCAGCTACACAGAATTTAGCGATGGGCGTCTTACTGGTGGTTTTACGACCGGGGAGCCCGTGCAGCCCGGAGGCCCGCTGGTGATCTTGTATTCAACTGATCTCGAAGAGTCGCAACGTAAACTTGTTTCGGCGGGCGCCCGTATCAGTCGAGAAATATTCTCTTTCCCTGGAGGGCGGAGATTCCACTTCACCGACCCCGATGGATACGAACTGGCTGTTTGGAGTACCGATATTCCAGAGGTTTAGATTTGCCCCCGATGGTTCTATGTATGCGCCGCCCTCCGTGGCCGGATGCGGCATGGTGGCAATTTGGTTTGGTTTGGGATGGGGTATTAGGGGTGACCGGCATGGGGCCGGGCATCGTCAGAGCCATAATCGCCTCCATTGCGTACGAAGGTGGCGCCAGGCCCGATGTCGAGTAGGTCGCATACTCGACAAACATCGCCAAAATGCCCAGCCATAGAGCTGGGCTTGTTCGTTTAGGGATTTACTTGAGGTGGGGCCAGACTGGTAGAATTTGCGCCCCTTGAACCTTGAGATGGATCTTATGCGACAGTTTTTGGCAATGACTTTTGGGGGGCTAACCCCGAGCTACTACGCTCGGCAGTTATTCTTCGGCGCGCTGTTTGGGGCTTTTTTCATTTACATGAAATCACGCGCTCCACTAGGTATTGATTTCGCAACGGTAGCCATCTCGGTGGTGAGTACCTTGCTCTATCCCTATTCTCGATTTGTGTACGAGAGCGTGGTGGGCTTCATCATGGGCCGCAACATGTTCTTCGTGAATGCGCTGCTGATGCTGTTTATCAAAGTGATGACGATGTTCATCTGCTGGTTCCTGGCGATCTTCATCGCGCCACTGGGATTGCTGTACCTCTACTGGCATCACAGTAGACAGCCTTCCAACTAAATCCTCCGAGCCCAGCCTAGCGCTGGGCTTTTTCGCTTATGTAGATGACGCTGAAAGTGCAGTGTGAGAGGGGCAAGGCGGCCGGAGCGAAAAGCGAAGAGGTCGCAACGCTGTTTAAAGGCCCGTCCCCAGCAGATCACAGGCACCCTGACACGGCTTCAGCGCCGGTCATCCGCACCCATTCATGAGCGCGGCCAAGTACCGGGATTTTTATTGCCCAAACAGGCCCTCAAGAGGGCCGGTGAGAGTTGCGCGATCAAGCAGCCTGCAGCGCGGTAGGTATCACCTGGAGTCGAACTCCGAACTTGGCCAGAGCTTCTTCCAGGCTCTCGAGCTTCGAGGTGTGCTCGAAGTCGACCAAGCGCCCGGCTGCGGTAGGGGAGATGCCGAGCATCGAGGCTAGGTCAGCCCGGGTCTTGCCGGAGCGGACCAGCTCATTCCAGAGTGCGATTTTTGCGACCGTTACGCCAGAAAGTCGAACGATGTGATCGCCGGCTTCGGTGGCAGGTGGAATTGCTCGCTTCTGGTCCACGTAGATTGACAGCGCCAGGGTGAGCCCGTCCACCGCATTTGCCAGAAGCTCCTCCAGGCTATCGCCCACGCTGTGAGCCTCAGGGATGTCTGGGCAAGACGACCAGAAGTGATCGTTTTCTTCATGAACCACGATTTTGTAGTCGTACAT
Protein sequences of DBSCAN-SWA_3 >NC_021505|2090114:2097019|2095310_2095673_+|WP_016498925.1|DBSCAN-SWA MSQSRDSKIDNIEFNVASIERSKAFYGEAFGWSFVDYGPSYTEFSDGRLTGGFTTGEPVQPGGPLVILYSTDLEESQRKLVSAGARISREIFSFPGGRRFHFTDPDGYELAVWSTDIPEV >NC_021505|2090114:2097019|2090114_2090573_-|WP_016498916.1|DBSCAN-SWA MSRGVNKVSLVGTCGQDPDVRYLPNGNAVTNLSLATSEQWTDNQSGQKVERTEWHRVSLFGKVAEIAGEYLRKGSQCYIEGKLQTREWEKDGIKRYTTEIIVDINGTMQLLGSRPQGHQTGQIPDRQPRQQRPARQQQNQQSAPPDHDSFDD >NC_021505|2090114:2097019|2090793_2091168_+|WP_016498918.1|DBSCAN-SWA MADPSLALQEAVFARLQAEVSCPIYDGAPLNAEMPYLSIDREVSVNSSPISGRKRETRLLYLSVWSDAVGQAEVKRINGEVIAALDERRLPLEVGRAVSVRVEQSDADGITYQGSLTVRVITTH >NC_021505|2090114:2097019|2091736_2092078_+|WP_016498920.1|DBSCAN-SWA MSKTLHGTTEVTVGGRVLVLSPTLKAVRTIEAYFGGLRGASERLRAVGVDAVVVVFAAGSGMEDKGAVEELAEEIWQQGVADLVPAATAFLYALYNPRGGDPGKPPAPTGSAP >NC_021505|2090114:2097019|2096593_2097019_-|WP_016498927.1|DBSCAN-SWA MYDYKIVVHEENDHFWSSCPDIPEAHSVGDSLEELLANAVDGLTLALSIYVDQKRAIPPATEAGDHIVRLSGVTVAKIALWNELVRSGKTRADLASMLGISPTAAGRLVDFEHTSKLESLEEALAKFGVRLQVIPTALQAA >NC_021505|2090114:2097019|2091223_2091724_+|WP_016498919.1|DBSCAN-SWA MPAADNLNTAAGCRLFIGGKTGADTETDYKADTYVEVGEIEDLGEFGDTFSSVNFTSLKDGRVRKYKGTADAGDLTVTVGMDSGDAGQRAVKTAHKDRSKGDYNLKITLNDGDPTATPVINPTTFYMRVKVMNNTVAPGAADNVVRRNITMGINSDVLEIPAAAAA >NC_021505|2090114:2097019|2092958_2093852_+|WP_016498922.1|DBSCAN-SWA MAETDIQGMLVRIEATTAQLRQEMARAESSVSQTSGKIDSSLKQVDEAFDRVEANSVALREGVGRAFNGIGLAAAGAVAGLVALTTSTLTYAQEVQNLASLSNTSVEDFQRLAVGAKTVGVEQEKLADIYKDTTDRVGEFISRGGGELQDFFKDIAPQVGVTVESFKNLSGPEALQLYYTSLEKAGASQQQITSYMEQMADEATALVPLLKNSGQGFKDAGERADETGAIISAFDIGQMVKLNKSVHDLENSWSGASHQLVAGLVPGIESVTSMLQGMTDNGSSKALGQASRSLLTT >NC_021505|2090114:2097019|2093848_2094733_+|WP_016498923.1|DBSCAN-SWA MNILLGVIGTKLTASFIGYVAALGKGIYSTVEATRATNAQAQAALMAAKADQIAAASAVVRAQKEADAARGTAVQTALSLELAQARMAETAATARLGVAQAAVKASSGVLMTVLGGPAGLAALAVGAGIAFLTMGSNAQPAGADLDDLKRPIEELRKAFRELDKDQRGAALVGAMRVQEQAAADADKSYQDFLITVKRGVGSTVAARISGEADEARKAGKGLSEWLDDLGKRFNIPQEAMRSMREAAGSYSTVSQSARKAGERVSLYNKEMDSKAESTNAAGDADAKATSAGHT >NC_021505|2090114:2097019|2094759_2094957_+|WP_016498924.1|DBSCAN-SWA MQLRERLREFQLWFNPKRRRWAGVTLIALGVVGMFLNPQSRWTLVLGTGIYWFFTALPPTLGGKR >NC_021505|2090114:2097019|2095974_2096337_+|WP_016498926.1|DBSCAN-SWA MRQFLAMTFGGLTPSYYARQLFFGALFGAFFIYMKSRAPLGIDFATVAISVVSTLLYPYSRFVYESVVGFIMGRNMFFVNALLMLFIKVMTMFICWFLAIFIAPLGLLYLYWHHSRQPSN >NC_021505|2090114:2097019|2090569_2090794_-|WP_016498917.1|DBSCAN-SWA MPTPWLALRSVRFYGRNTSYPQAARYAESLERAIEAKALPVVALVRARQVGERATRWCSLLALLFIVAAAVAQS >NC_021505|2090114:2097019|2092408_2092888_+|WP_016498921.1|DBSCAN-SWA MKRVLLVMAMGVGVLSGCKSPEDKMVSVCTDIAKMSVADPSSLLVNSGSVVQAIPTKESLFRFASLNFDGELTGDSKAWYDIQVKELDKLKESYASIDYTDKSSLARRGQAICWFMDRGDGPVLGSVSVAGKSYSGTDLLMVFVSNPRPKYLSSSNAIE |
12 | Pseudomonas_phage(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
3935972 : 3971477
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_021505|3935972:3971477|DBSCAN-SWA GTCAAAACTGCACATCCAGTATCACGCTGTCAGAGTAGGTTGCTGCCGGCGGCGTAGCCTGGTCGGTATAGACCTTGGCGTTGTAGTTGAACACCTGGCTACCCGTGCCGGTACCCGCACCGGGGTTTACGTCGGCATCCGTGCTGGAACGCCTGGCAGCCCCCGATGCCCCCCAACGCACCACCCCTGCACTCTTGAAAATGTCATAGGCCAGGTAGTTGTTGGCCGACGACTTCATCCGTCGCCGCCCGCCCGACACGTTCTGCCCGTCATCCAGCCCCACCGTGTAGTTGCTGCCCTTGGTGCACGACACGTTTATGCCCTGGCTCACCGTGCCAAACCCTGCCACCACAGGCGCACTGGCGAAGCTGATGTTGGGTGTGGTGATCTGGCAATCGTTGGTCACCGTCAGGGTCACCGTCAGCGGCTTGGTGCCCGTGCCTTGGTCAGTCCCGATACAGATGCCGCCTAACCCAATCCCATCGCAGTACTTCCAGGTCCACTCGACATTGAGGGTTTCCTGATACAGCCCCGCCGCAACGTTGGCGTTGATCAGAGTGCGCATGTAGATCGGCACGGCTTTGGGCGTACCGTTGAGCAGCCCCAGCAAGTCGAGGATGCCGGTGGTCCTGAATTCGAAGCCCGTACCGCGCGTGATCGGATAGGTAGTGGTGGCGTCAGCATACAGCGTGTAAGGGATGACATCTCCGGTGGGCCCTACCAGGCCGTTGGTGGCCGAGGTGATCTTCACCACGAATGAATCGGTACTACTCAGCACACTAAGCACCGAGCCGTTGCACTGCAAACCCGAGTTGGCACTGGAAGCCGTCTGCACCGTGGTTCGTACCTGGGTCGAGTTGAGCGAGCCAAATGCCGCTGCCGAAGTCGCCACTGACGTGCACTTGGCCCAGGCCGGGCCCGCACAAAGCAGCACGGTGCACGCCCATACCACGCCCCACCCGGCCGTCATTTGCACACCAAGGGGCCGACCAAGGGGATCGAGCCTTGGGCCTCAGGCAGATCGAACGCCACCTCGCAATGCCCACCACCCTCCAGCTCCACCAGCAGGCGGTTGTGGGCCGCCAGGTTTTCCAGATACACCAGGCCATCCCAGCCCACCACGGCCTGGCTGCCACTTTCTTCATGAGTGACACGGCTGCCCAGCTTGAGCACTTGCTGGTTGCCATCCACCAGTTCGATGCTGGCCGCCATTACGCGTTTGAGCGGGAACTCCAGCAAGTAGCCGCTACCCCGGCGCACGGCCACACGCTGCTCCACGTCCCCTGCAAGGATGTCCGGCGGCAGGTCCATGGGGTCGATCTCGTACTTGCCGCGGTAATAGCCACTGCTGTACGGCACCAGCAAGTGCCCCTTGGCATCCGTGCGGCCGACATCCTGGTTTTCGTAGCGCACCGGCACATCGGCGTAACCACCGGTACTGACCACCACGAAGGCATCGTCGATGCGGTTGGCGGCGAATACGCCGGCATCCATCCACACCAGCGAGCCACTGGCGTCGGCCCAGCGGGTCATCGCGCCGCTGCTGCCATACACACCGGCCTGCAACTGCACCGACTGCAAGCGCCAGGTGACGTCGGCCTGGCGGTAGGCATCGCGGTCGCTGCCTGCGGCATAGCCCAGGTTGTAACCAACACCGCCACCTACCGGCACGGCGTGGCTGTAGTTGACCCTTTGCAGGCTCTCGCCTTCCTTGCTGCGCTCCATGCCCAGGGCCAGGGTGCCGTGCAGGTCGAACGGGATCACCAGTTGCGCCTGCACGGCCCACTGGCTGTCACCCACTTCACGGTTGGCAGACAGGTACACACTGCTGCTGCCCCATAGCGGCCTGCTGTAGCTCAGGTTGATCAGCCGGGTACGCGTGCCATCACCGGCGCGCACATCGAAGTAACCGGCACCGATGCTGCCGTATTCGTTCAGGTTGAGGCTAAGCGTGACCTGTTCGCTGCTTTTGCTCAGTTGCATGTCCGGGGAGTCCACCCGGGTCAGGTCGGCGTAGTCGCCGTGGCGTTGCAGGCGCTGGTAACTGAAGCCGATGCGCTGGCTGTTGTACTGGTAACCCAGGGCGACCTGGTGGCCCTTGTCGCCGTCGAACCGGCTTTGCGCCAGGGCGGCGTTGAGCACGCCGAAGTTGCCCAGGCGCATGTTCCCGCCCAAGCCGCCCAGCATCAGCGATTCGGCAGTTTCGGCGTGGGTTTCCAGGGTGAACATGTCGCTGAGGCCATAGCGCAGGCTGCCCGAGGCAACCCCGGGCCCATAGCTGAAATCGCGCACCGCGTAATCACGGCGCAGGCTACCGGCGGCCACCGAGTAGTCGGACAGGCCCTTCTGCAGCAGGCTGCTGGTGACATAGAACGGCAAGGTGGTCGACACCTGCCGGCCAAGGGCATCAGTGGTGACCACCACCGCCTCCCCGGCGCCGTTGATGAACGGCACGTTGGTCAGGGTGTACGGCCCAGGCTGCAGCTCGGTGGTGCTGGACTTGAAACCGTTGATGAACAGGTCGAGCGAGGTCGGCACCGCGGCTTCGCCGGCGAACGCCGGTAACGGGTAAGTGACCAAATCGGGGCGTGCAGCGAAGTCGCGTGACACTTGCAGGCCGCCCACCCGCACCGAACTGCTCCAGGGCAAGGCACCGGTCACGAAGTCACCGGCTTCGTAGGTGAGCAGGCGCTGCTCGTCGGTGAAGCGCCAGGTGGTGTCGTAACGCATGAAGCCCTGGCGGGTATCGTCCGCCTGTGCGCCATTGAATGACTGGCGCCATTGCCCGGTACTGGAGAACGTCCCCCAACTGTCGAACAGGCGCAGCTCGTTCCAGGCGGCCAGGTAGGTGCCGCCCTCATCGGTGTCGTTCAGATACAGGTCATAGTTGAACAAGGCGCCGAAACTGCTGCGCGCATCGCTGGCCGGGTACAGGTTGCGATCACCCACTTGCTGGTCTGGCAACCAGGCCGGCGGCACCTGCAGCAGCAGGCGCTGGTTCTGGCTGTCGTAGTCGGCGTGCAGCCCCGTGAAGCTTTCCAGGGCCACCTCGCCCTGGGGATTGCCCGGCAACGAGATACCGGCTGCCCGCAACACGTCGCTGCCCAGGTACAACTGCCCGGCCCGCTGCTGCACCGGCACCAGTTCAGCCCTGGGCATCTGGTTCACCACCAGGTCCAGGTACAGGGTAGCGTCGGCGATGGCGGACATCTCCGTGGGCGGCGGGGGCAGGTCGTCGGCCAGCGCCAGCACAGGGCTGGTCATGACCAGCCCCAGGCACCAGCGGCACGTCGTCGCCGAGAGCCATTTCACACACGGTTTCCAACCATTGCCGGGTCCTCCCTGGCCCCTGCACTGCATACAGTCGTTGACTCATTGGCCCTGGCGGATGGCGTCCGCCGCGCTCTGGCCGTTGACCCTGCCCTTCAGCACGCTGCCGCTGGTGGAGGCGACGGGAGCAGGCCAGCGCATGCTGGCGCCCGGCAACACATAACCCAGCAGGCCTTCAGCCAGTGGCTTGCTCTGGCTGCCTTGCTGCACCACCACATCGGTCAGCCGCGCATGCACGGGGCCTGTGTTGCGCATTTCCACGTAAGGTTTGCCCTGCACCGTCACCGGCCGCCAGCTGAGCTGCGGCATGCCCACACCTTCGGCATTGCGTTTGCCTTCAGGGTCGGCCTTGCCCAACAGCCCCTCGCCATAGACGAACAGCGGCACCGAATAACGCATCTGCAAACGGATGGCCGCAGTAGCGCCGGGCTCGGCCTTGTCGACCGGGATGGCGGGCGGGATCTCGTCGATGATGATGCGGTAGGCCTGCTCCTGGCCGGCAGGTGAAGGGCCGGTACGGGTCAAGCGGATCAGCTGCTTCTGCCCGGGGGCAATGTTGGCCACCGGCGGGCTGCCGATGATCTCGCGCTGCGCCTGGAACTGTTCCTGGTAATCACCCTGGCGCCAGGCGAATACCCGTACCTGCAAATTGGCCGGCTCAGTGCCGCGGTTTTCCAGCCACAGTGCACCGGCCTTCTGGTCGGCCTCCAGCACCGGGTCGATCGGCCAGATCAGCACCGAGGTGGCCGCCCCGGCCGGCAGGCTTGCCAGCCACAACAATCCGATCAATCCACGCGCCCACTTCGCGCCTGCCCGCATGCAGCTCTCCTTAGACACTTGTCTTTGGTGCGCTGGCCTTCACCAGGTCACCGTCACTTGCACCACATCGGTGTAGGTCCCTGCCGGCAGCACGCCGGTCAGCTGGACTCGCCCGTACACCGGCAGCTTGATCGCCGTCGGGTCGGAATAGCTCACGGCCACGCTCTGGCCGATCCCCAAGACCTGGCTGTACGCCGCGTCCCGGTATAACTGGTAAGCCAGCACTTGCGTGCCGCTGGTACGTTTCAGGTTGCGTGTGCCACTGGCACTGTTCTGCCCGCCGTCCACGCTCATGCTCATGGCCACGCCCGGCGTGCACTGGAAGGTCACTGTCGAGCCACCCAGCGAGGTGCTGAGCAACGCTTTGGACAGTGCCGACTGCGAGCCATAATCGAGCGTGCCGTAACTGGTGACACCGCCCACCACCAGGCACCCGGCGACAATCTGCGCCGTTACCGTGAAGGTACTGGTGGTCACCGCCCCCAACGGGGCAGCCAACAGCATGCCGATTCCGGTCAGCCCACCAGCCAGCCAGCCGCGCATGGCTCAGAACGACAGTTCGACCGAAATCGTGTCGCTGTAGACCCCTGCAGGCAGGCCCGCCTTGCCCACGGCCTTGCCGTAAAGGTTGACGGTCTGGGCCACGCCAGTGCTGGTCGGCAGGGTGATGGTGCCATCGATGGCCAGCAGCGTCGTGCGCCCGGTGTCGGTGTACAGGTCATACGGCACAAAGTTGCCGGAACCATCGGCGAGCGCACGCGTGCCACCGCTCGACTGGCCGTCATGCAGGCCTGCGCGCACTTTGATCGCAGGCACCGTACCCGCCGAACACAAGATACTCATGGCACCGCCCCCGCCTCCCAGCACCTGGGCATTGGCAGTGACGAACAAGGCATCCTGGGTACCGAAGTTCAAGGCACCGAAGTTGAGCCCCGACGTGCCAGAACTCCCATTTACCTGGCACGCCGAGATCAGCGTCAGGGTCGAATTGATGCTGCCGGTCACCGTGGCAGCCTGGGCCTGGGAAGCCAAGGCCAGGCCCAGGCCTGCGAGCATGCAGCATGAAAGGTTCGTTCGCATCGGGGTCTCCTTGCGTAGTTACCAGTCCAGGGTCACTCTGAGGGTGTCGCGGTAAAGCCCGGCCGGCAGTGCGCGCGGTTGCGCCACCACCACACCGTAAATGGGGATCGGTACTTGCTGGGTGCTGTTGATGGTGAAGGCACGCGCCTGGCCGATGGCGTAGCGGCTGTTGCCGCCCGGGTCGACCGCCAGCTGATAAGGGATCAGTTCCCGGCCGTTGCTCAGCCGGCGCACGCCGTCGTCGCCATTCAGGCCGCCATTGATCCGCACATTGAAGGCCCTTACTTCGGGCGTGCAGCTGATCTGCAGGGAGCCCGCGCCGCCCGCTTCGTCAACCCGGCTGCGCAGCGGCTGGTCCCAGTTCGGGCCACGCTCGCCAAAGTCCAGCAGCCCCGGGTTGCCCAGCACAGCCGGTTGCGTGTCGTCGCTGCTGATCTGGCAGGCTGCGCTGATCACCAACCGCGCCTGGATGAAGCCGGTGGTGCTGCCATGCGCCGCACCACCAGGAACCAGCAACGGGCCCAGGGTAAGCAACAGGATCGAAGTGCGGTTCATGGCGTGGCGTCCTTGTGCATGGTGGCGTCCCTACCAGGTGACCGTGACTTTGAGCAGGTCGGCGTAAACCCCTGCATTCGGCACCCAGGCCAGTTTGTCGATGCGAGCGTACAACGGCAGCTCGACCGAGCCGCTGCTCGGTACCCTTGCCGACTGGGCGACCCCTACCGCCAGAGGCTCACGCCAGGCTGCGTCGCGGTACAAGCGATAAGGGATGGGCCTAGCCGTGTGGTCGTCGCTGGCCAGGTAGCGCAGCTCGCCGACGCCACCATGCTGGCCGCCATCGACGCGCACCTGATAAGGGGTGTCCGGGTTGCACTCAAGGCGCGGTGGGCGCTGGCTTAGCAGAACGCCGCTCAACGGTGCGCCGGGGCCGTCCAGGCGTGCCGCCGTGCCCAGATCAATGCGTCCGAGGGCCTGGGCTCCCGCGTCGCGGGTCTGGTTGACCAGCATGCAACCGCGCTGCACCAGCACACGCACTTCCACGAGAAAATCCGCTGCCACGGTACTGCCGCTGAACAACAGGCCCAACAACGCGGCCATTAAGCGCTCCGTCACCTTTTTATCCTTGTCGAAACGTCCTGAACAGAGCGTAGCAGCGAGGCAGTGCAACCGTGGTGGCCCCAGGCCGTTTTTTGGTATTAGCGATAAAGCGATAGCGCCCCGTTAAAAAAAGCCGGAACGGCGCAGAACCCCAGGCCACCTCCGTGTCTCCAATGGCCGAGGGCCGGTAAATGTGCCCGGTTTCTGGTAGGGTAAAGCACGTTCCTCCAGCGCCGGGCGCCAATGCCATACATGGACGGACATGACAGGAGCGTTTTCATTGATGACTGCAGACACCACGCTGGCAGAAGCCATGGAGCGCTGTGCCCGGGAACCGATCCACGTGCCGAGCAGCATCCAGCCCCACGGTTTCCTGCTGGTGCTGGACGCCACCGACCTGTGCGTGCTGCAGGCCAGTGAAAACGTCGAGCACTGGCTGGGCTTGCCAGCCCGTGAACTGATCGGTTGCCCCTTCGCCAGCCTGGTCAGCGACGGTTTTGACCTGCACGCGCAGCTCGCAAGGCTGCCCGAAGATGAGGTCTTCCCCTTCCACATCGGTGACGTGCGCTTGCGCCAAAGTGCGCCGTACAGCACCCCGCTGCATTTGCTGGTGCACGGCCACGACGAAGTACTGATTGCCGAATTCGAACCCCCTCGGCTGTCGCCGGAACTGACCGGGCAAGGTGACTATTACCCGCTCGTGCGCAGCTTTGTCGGCAGCCTGCAACTGGCCAGCAGCCTCGAAGACCTGCTGCAGCAAACGGTGCTGCAGCTCAAGCGCATCACCGGTTTTGGCCGGGTCAAGGCGTACCGTTTCGATGCCGAGGGCAACGGCCAGGTGCTGGCCGAGTCCGCCGACCCAGGCTACCCCGCTTACCTGGGCCTGTGCTTCCCGGCTGCAGACATCCCGCGCCAGGCGCGCGAACTGTACCGGGTCAACCGCATCCGCGTGATCGAGGATGCCAACTACCAGCCCTCGCCCCTGCTACCGGTCATCAACCCGCGCACCGGCAAGGCACTGGACATGAGTTTTGCCGCACTGCGCAGTGTGTCGCCGGTGCACCTGCAGTACATGCGCAATATGGGCACACTCGCTTCGATGTCGCTGTCGATCGTGGTCGACGGTGAGCTGTGGGGGCTGATTTCCTGCCACCACCAACAGCCACGTGCGGTGGACCTGCGTACCCGCACGGCCTGCGAGTTGCTGGCCAGTGTGCTGTCGCTGCAGATCGAGTCCCGCGAGTCCCACGCCAGCACGCGCAAGCTGCTGGCGTTGCGCCAACACATCGTGCGCATGATCTCGTCCATGGCCGACCACGACAGCGTCAGTGACGGCCTGCGCGACCTGCCCCAGGTGTTGCTGGCCTTTGCCGGCGCCCAAGGCGCCGCAGTTATCTCGGCCGAACGCTGTGACCTGATCGGCCAGACCCCGCCTGAAGCCCAGGTGACCGCGCTGGTGCACTGGCTTGGCCAGCGCGGCGAAGACACGGTGTTCCACAGTGACAACGTGCGCCGCGACATCATTGACCTGCCCGAGCTGGCCACCCACGCCGGTGGTGTGCTGGCGGTGGCCATCTCGCAGATCCATTCGCACTACCTGCTGTGGTTCCGGCCCGAACAGGTGCGCACGGTGAACTGGGCGGGCCAGCCGACCAAGCAGGTCGGGCCGCAGGGCAACCTCGACCCACGGCACAGTTTCGAACGCTGGCAGGAAGAGCTGCGCGGCTATAGCGAGCCTTGGGACCCGTTAGTGATCGACGGCGTGCTGGAACTGCGCACCGCCGTGCTTGGCATAGTCCTGCGCAAAGCCGAAGAACTGGCACAGCTGGCGGGTGACTTGCGCCGTTCGAACAAGGAGCTCGAAGCGTTTTCCTACAGTGTTTCCCACGACCTGCGCGCGCCTCTGCGGCACATCGCCGGCTATACCGAACTGCTGGGTGAAATGGAAGGCCAGGGCTTGACCGAGCGCGGCAAGCGGTTCTTGCAGCACATTGGTGAAGCCGCGCACTTTGCCGGCAGCCTGGTCGACAACCTGCTCAACTTCTCGCAGATGGGCCGCTCTGCCTTGCGCCTGTCCGACGTAGACCTTAACGCGCTGGTCGAGGCCATTCGCAGCGAACTGGCCCCCGATTACGAAGGCCGCGCAATCGTTTGGGACATCGCCCCGCTGCCCAAGGTGATCGGCGACCCGGCGTTCATCAACATGGCCTTGCACAACCTGATCGCCAATGCCATCAAGTACACACGTGGCCGTACGCCTGCCCACATCGGCATCAGTGCCGTGGAGCATCCTGGCGAAACCGAGATCTGCATCCGCGACAATGGCGTGGGCTTCGACATGGCCTATGCCAACAAGCTGTTCGGCGTATTCCAGCGCCTGCACCGCATGGAAGATTTCGAGGGCACAGGGATTGGCCTGGCAAGCGTGCGCCGCATCATCGAGCGCCATGACGGCCGGGTCTGGGCCGAAGGCCAGATCGACCAGGGCGCCAGCTTCCACTTCACCCTCCCCCGAAACACTGCTACATGAGGCACCGCTACCATCATGCTCAAACCCATCCTGCTGGTCGAAGACAACCCCCGGGACCTGGAACTTACCCTGCTGGCCCTGGAGCGCAGCCAGTTGGCCAACGAGGTCATCGTGCTGCGCGACGGCGCCGAGGCACTTGACTACTTGCTGCGCCGCAACACCTATGCCGACCGCGACGACGGCAACCCCGCCGTCTTGCTGCTGGACCTGAAGCTGCCCAAGGTCGACGGCCTTGAAGTGCTCAGGGCAGTACGCGCCACCGCCGAACTGCGCAGCATCCCGACGGTGATGCTGACCTCCTCGCGGGAGGAGCCCGACCTGTTGCGCGCCTATGAACTCGGGGTCAACGCCTACGTGGTCAAGCCGGTGGAATTCAAGGAATTCGTCGCCGCCATCTCTGACCTCGGGGTATTCTGGGCAGTGCTTAACGAACCGCCACCCGGCTCACTGCGGCTGAATCGTCGCGGTAGCAACTGAGGCCGCCGCAACGATGCAGCAAACGCCGTTGAAACTACTGATGGTCGAAGACAGCTCGATGGACGCCGAGCTTACCCTGATGCGTCTGGAGCGCAGCGGGCTGCATGTGCAGTCCCAGCTGGTATTCGACCATGTGGGTGTCGACCATGCCCTGCGCGAGGCCCGCTACGACCTGATCCTGTGCGACTGCGTGCTGCCGGGGTCGTCTGGCACCGAGGTGCTGGCCATTGCCCAACGCCTGGCACCGGACGTTCCGTTCATCTTCCTCTCCGGCATCTACGGCGAAGAGCACGCGGTGGAAATGATCCGCCTGGGCGCCACCGACTATGTGCTGAAGAAGAACCTGCCACTGCTGCCCAAGGCGGTACGCCGGGCGCTGACCGAAGTGCAGGAACGCCAGCGCCGGCGCCGCGCCGAAGAGGCTCTGATCGACGTCGAGGCGCGGGCACGTATCGCCATCGACGCCGCCGGCATGGGCACCTGGGACCTGCGCCCGCAGGAAGGCCTGCTGCTGTGGGACGACCGCTGCAAGACCTTGTTCGGCCTGCCTACCAGCACCGAAATGAGCCTTGAGGTATTCCTCGGTGGCATCTACCCCGATGACCTGCCAATGGTGCGTGAGGCGGTGGAATACGCCATGCGCCCGGAAAGCGGCGGCCGCTACCGCGTCGAATTTCGCATCGCGCAACCCAACGGCCTGGAGCCGCGCTGGTTGCTCAGCAGCGGCCAGAGCCAGTTCGTCGATGGCCAGTGCGTGCGCTTTTCCGGCGTGCTGCAGGACATCCACACCCAGCGCCTGGCCACACAGGCGCTGCGTCAGCTCAACGAGATGCTGGGGGAGAGGGTCGAACGCCGCACCCGCGAACGCGACCGTGCCTGGGAGCTGTCGCAGGACCTGCTGGCGGTGCTGAACAAGGACCTGACCCCAGTTGCACTCAACCCCGCCTGGGAAGCCAGCCTGGGCTTCTCCCGCGAGCGCCTGAGCCAGTCGTCGTTGCTGCATCTGCTGCCCGAAGCCGACCAGGAACTACTGCTGACCGAACTGGCTGCCCTCGCCCATGGCCGTACCAGCGTACGTTTCGTCGGCCGCATCCTGCATGCCGGCGGCCAGCAGCGCTGGCTATCGTGGGTGGTGGTGCCGGAAGACACGCTGCTGTATGTGGTGGCACGCGACATCACCAGCGAGCGCGAAGCCGCCCTGGGCTTGGCAGAAGCCAACGCCCGCCTGCGCGAACAGATCAACGAACGCGAGCGCATCGAGGCGGCGCTGCAGCAGATGCAGCGGCTCGAAGCCGTCGGTCAGCTGACCGCTGGCGTGGCCCACGACTTCAACAACCTGTTGACGGTGATCCTTACCGGCGCCAGCTTTCTTGAGCGCGACCTGGCCAAGGCCAACCTGGACAAGGCCCGTACGCGGCTCACCCACATCCGTGAGGCGGGCGAGCGTGGCGCCAAGCTGACCTCGCAGCTGCTGGCATTTTCCCGCCGCCAGCGCCTGGAGCCGGTAGCGCTCAACCTCAACCAGACCCTGGCCGGCCTGGAAGAGCTGTTGCGCCGCACACTGGGTGGCAACGTCTCGGTGCGCCTGGACCTGGACCAGGCCCTGTGGCACGCACTGACCGACCCCACCCAGACCGAGATGATCATCCTCAACCTGGCGATCAATGCCCGCGATGCCATGCCTGACGGCGGCCAGCTGACCCTGACCACCCGCAACACCCGCATTGACAACCGCCCGCAACGCCCGGAAGACCCGGACCCGGGCGAGTACGTGATGCTGAGCATCCGCGATACCGGCTGCGGCATGAGCGAAGACGTGCTGGCCAAGGTGTTCGAGCCGTTCTTTACCACCAAGGACATCGGCAAGGGCTCGGGCCTGGGCCTGGCCCAGGTGTTCGGCTTCGCCAAGCAGTCCGGGGGTGGTGTACGCATCGATACCTCCCCGGGCCGCGGCACGCAGGTGGCGGTGTACTTGCCCGCCGTAAAGGACCAGATCGTGAGCGAGCCGGTGATCCCGCCGCTCGGCCAGCCGGTCAGCGACAGCGGCCGCAACCGCACGGTGTTGCTGGTGGATGATGACCATCTGGTGCGGGACTTGCTTGGCGATGTGTTGCGCCAATATGGCTACCAGGTGCGACAGGCGCACAGCGGCGAACAGGCGTTGGCCTTGCTGGACGACGAGATCGACCTGCTGCTTACCGATTTCGCCATGCCCGAGTTCAACGGCGCGCAACTGGCGCTGGCGGCACGCGAACGGTACCCGCGCCTGCCGGTGGTGTTCCTGACTGGCTATGCGGAGTTGCAGGGGCTGGAGTTGCCGGGCAGCGTGGTGGTTCAGAAACCGGCGCAAGCGGATGAACTCGCCAGGGTGCTGAACGAGATGCTGGGCATTGCCGGGTAATTCGGGAAAGCACAAACCTTGGCAGGAGCGGCCCGGCAAGGCCGCTCCTGCAAGAATCGTGTGACGGAATTAAAAATACGGAAACGACAAAACCCAGCGCTTATCTTTCTAAGCACCGGGCGATATACGTTTGAAACTGGAAAATGGCAGGGGCGGCTGGATTCGAACCAACGCATGGCAGGATCAAAACCTGCTGCCTTACCGCTTGGCGACGCCCCTGTATCGGATGCAGCATTCGCTGCTCTGACAATTCCAACTGGGTGACAACTGGGTTGTTGTCTTGGAATGCGCGGCACTTTAACAACAAAGTTTTCATCTGTGAACCCCCTGATGAAAAAAAATTGTCAAAAAACAGCGGGTTAGTCATTTCGGCGGTTTGTGCAGTGACCATCCGTCGCTTTCCCCCCGCTTCGCTTCAACACAATCGCATTCCTCCCTTCAAATCTATTGAGCCAATTGCATTGACCAAGGAGGACCCCATGCCTGTCTCTCACGACCTGTACCAGGACCTGCACTACCCTCGCGAAATCGTCCAGCAGCGCCGCCAGCAGGACAAAGCGCTGGATCGGCTGCTGGACGAGTACATGGACATCGACAACCAGGTGCTAGCCGCCGAATCGATCTCGGCGGGCAACTTCGTTGACGAAGACCTGCGCCACCTCAAGGAGCGCCGCCTGGCGGTGAAATACATGATCGAGCGCCAACTGGAACGTAAAGCCTAGGCAATGCAGGTGTGCAGCCAGACCTATCGCATCAGGTGATCATTGCCGATCACTTTCTGTATTAGTCAAAATGGCTGCCCTCGGGCAAGCTGCGCGCTGCCCCGTCATCGCCCAAGGAGCTTTGCTTTTGTCTGCCTGGATCCCTGCCCCACTGCGCCAGCTTGGCCAGCGCCTGCGCGGCGCCACCGCCAACCAGAGCGAACTGCTCGACTGGTTCGAGGACAAAGCCCGAAGCCGCGGTTACCAGCTGAGTGACGGCCAGCGACGGGTGATCCACTGCATGGCCGAACAACTGGCACTGCTCGAACAAGGGCAGCCACGCAGCCTGTACCTGTACGGTTCGGTGGGGCGCGGCAAGAGCTGGCTGCTGGACGGGTTCTTCCAGGCGGTGCCGGTTGAAGCAAAGCGCCGCCTGCACTTTCATGATTTTTTTGCCCGCCTGCACCAGGGCATGCACCGCCACCGGGCGCTGGACGATGCGCTGGGCGCAACCCTCGACGAACTGGTGGGTGGCTGCCAGGTGTTGTGCTTCGACGAATTCCACGTGCATGACATCGGCGATGCCATGCTGCTTACTCGCCTGTTCAACGCCCTGTTCGCCCGCGGCGTGTTCCTGCTGGTGACCTCCAACTACGCGCCCGAAGGGCTGCTGCCCAACCCGCTGTACCACGAGCGTTTCCTGCCGGTGATCCGCCTGATCAACGGGCGCATGCAGGTTCTGGAAGTGGGTGGCGGCACCGATTTTCGCAGCTTGCCGGCCAACCGCGAGCACCAGCGGTTTACCCAAGGCCACTATGTATGGCCGGGGACGGCAGCGCAGCGCCAGGTACTTGGCGTGCCGGACGAGCAGCCGGTAATGCTTGAAGTGAACAAACGCCCGCTGCGCGCACTGGCCATCGATGGCCGGCGGGTGGTGCTTGGCTTTGACGACTTGTGCGAAAAGGCCACGGCGGTGATCGACTACCTGGTGCTGGCGCAGGAGTACGATGAATGGATCATCGATGGGCTGGATGACCTGGCGCAGTGTTCGCTGGCGGCGCAGCAGCGCTTCGTCAACCTGGTGGATGTGCTGTATGACCAGGACCGACAGGTGACAGTGGTTGGTAAACGGCCGCTGGAAGAGAGCCTGGGCGGGCCGCTTGCCGACCTGATGCGGACCCGCAGCCGGTTGGGGCAACTGCACCAGGTAGCCCCCTAGCACCGCATTGCCCGCCCCGGCCTCATCGCCGGCAAGCCGGCTCCCATAGGTACCGCGCAGGCTTCAGCGCCTGGGGTCAACCTGTGGGAGCTGGCTTGCCGGCGATGAGGCCCTGAAGGTCAAAGCATCAACCCGCCCTGGGCAAATCCGCCAGCACCGCCTCGATCTCCCCCAGCACCGCCGGGTCATCCAGGGTCGAAGGCGGTACATAGTCCTGCCCGTCGGCAATCTTGCGCAGCACCGCGCGCAAGATTTTCCCCGAGCGGGTCTTGGGCAGCCGCTTCACCAGCCGAACCCGGTTGAAACAGGCCAGCGGGCCAATCGCTTCGCGCACGCTGCCCACCAGGTCTACCAGCAACTGCGCCTCGGCAATGCCCTCGCCGTCCTTGAGCACAACGAGCGCCAACGGCACTTGCCCCTTGATTTCATCGTGCACGCCAATCACCGCACACTCGGCCACCGCCGGGTGGCAGGCCACCAGGTCCTCCATTTCGCCCGTGGACAGCCGGTGGCCAGAGACATTGATCACGTCATCCGTGCGCCCCATGATGTAGACGAAACCGTCATCATCCAGAAAGCCGCCGTCACCCGTGTGGTAATACCCCGGGTAAGTGCGCAGGTAAGCCTGCAAATAACGCTCGTGATCGCCCCACAGGGTCTGGCTGCACCCAGGCGGCAATGGCAAGGCAATGACGATCGAGCCCTGGTGGTTGGGGCCCAGCAAATGGCCCTCGTCATCCACCACACGTACGTAATAGCCCGGAACCGCCCGGTTGCTTGAACCCGGTTTCGCCGCACTGCCGTCCAGCCCCACGCAGGGCGCGGTGACCGGCCAGCCGGTTTCAGTCTGCCACCAGTGGTCGTGCACCGGCTTGCCGCTGACCCGCTCCAGCCATTCATGGGTGCTGGAATCGAGCTTCTCACCCGCCAGGAACAGTTGGCGCAGCGAGCTGAGGTCGTGCTTGCGGATCAGTTCGCCCTCCGGGTCTTCCTTGCGGATGGCACGCATGGCGGTGGGCGCACAGAACAGCGCGTTGACCTTGTATTGTTCCACCACCCGCCAGTAGGCCGAGGCGTCCGGCGTGCGGATCGGTTTGCCTTCGTAGAACACCGTGGTGCAGCCGCTCATCAGCGGCCCATAGACGATCAGCGAATGGCCGACCACCCACCCCACATCGGAAATGCCCCACCACACGTCCCCGGCCTGCATGCCGTAGATATGGCGCATGGCATAGCATAGCGCCACGGCATTGCCGCCATTTTCGCGGACGATGCCCTTGGGTTTGCCGGTGGTGCCAGAGGTGTACATGATGTACAGCGGGTCGCCCGCATCCAGCTCGACCGGTGCCACCGGCTGGGCGCCGACCAAGGCTACCTGCCAGTCCAGGTCGCGGCCTGGCTGCAGCTGCGCCTGCGCCTGCGGCCGTTGCAGCACCAGCACATTGCGCGGCTGGTGGCGGGCCAGTTGCAGGGCGCGGTCGACCAGCGGCTTGTAGGCAATCACCTTGTCGAATTCCAGCCCGCAGGATGCCGTCAGCAGCAGCGTGGGCCGGGCGTCATCGATGCGCAGGGCCAGCTCGTTGGCGGCAAAGCCGCCGAACACCACCGAATGCACCGCGCCAATCCGCGCGCAGGCCAGCATGGCCATGGCCGCTTGCGGCACCATGGGCATGTAGATGATCACCCCATCGCCCTTTTCCACCCCCAGCTGGCGCAGCAAACCGGCCAGGCGCGCCACTTCGTCGCGCAAGTGGTTGTAGGTGTAGGCCTGCTGCACGCCGGTGACCGGCGAATCGTAGATCAGCGCCACCTGCTCGCCACGGCCCAGCTCGATCTGGTGGTCGAGGGCCAGGTAGCAACTGTTCAGACGGCCATCGGCGAACCAGCGGTGGGTACCGTCGGCATTGTCTTGCAGGGTCAGGGCAGGCTTGCGGTGCCAGGCCAGGTGCGCGGCCTGTTCCGCCCAGAAAGCGGCAGGGTCGGAAATGGAATGGGCGTAGCTGTGCTGGTAGCTCATATCGAAGGGAACCCGGTACTTGTTGTTATTGAAGGGCGAAACCTGAGTATGGACCGCTACACCCGCGCCGCCATGGGACTAAAGTCACAACCGACCTGCAAATTTGCAGACAACGCAAAAGCGCCCCATAACGGTGCAAAGCTTCTAGAATAGGCCACCCTTCAGCCACCGAGCAGCCCATGGATATCGATCAGGCCCGAACCTTCCTGGAAATCGTGCGTTGCGGCAGCCTGGTCGCCGCCGCCGAACGCCTGTTCGTGTCACAGACGGCAATTACCGCCCGGGTGCAGCGTTTGGAGCAGCAACTGGGCTGCCAGCTATTTGTGCGCAGCCGCAACGGCGCCAGCCTGACCAGCGACGGTGAAGCATTCGTCAGTTACGCCAACCAGCTGGTACAAACCTGGGAAGCGGCGCGGCGCGACCTGCCGCTGCCCGAGGGCTGCCAGCAAGTGCTGCATGTGGGGGGAGAAGTGAGCCTGGGCAACCCGATGATGCTTGACTGGATCAGTGCCCTGCACCGTGAGCTGCCCAGCCATGCCATCCGCAGCGAGGTAAGCGACGGCGAGTCGCTGCTGCGCAAGGTCGAGATGGGTTTGCTGGACGCTGCGCTGGTGTATCAGCCGACCTACGGGCCAGGCCTGCAGGTGGAACAGTTGATGGAAGAAAAGCTGATCCGCGTGCGCCGGGTGGACCAGCCAGAGCCTTACATCTACATCGACTGGGGCGAGGCCTTCCGCCGCCAGCACGATGCCGCCCTGCCCGATTGCGCCCGCCCGGCGCTCAGCTTCAACCTGGGCCCCCTGGCCTTGCAGTTCATTCTCGATCAGGGCGGCAGCGGTTACTTCCGCACGCGCGTGGTGCAGGCGTACCTGGACAGCGGCGTGTTCGAGCGGGTACCGCAGGCGCCGGAGTTCACCTACCCGACCTTCCTGGTGTACCCACGCAAACGCGACAGCGAAGCCCTGCAACAGGCCTTCGTCATCCTGCGCCAGCTGGTAGCCGCCGGCGCCAGCGACTGGTCACAACGCTGGGACCCGGTTATCTGAAGCCTCGGCGTTGCTGAGCAGCTGGCGCTTGAGGCCAGCGATCAGGTCGGTCTGGGTGATTACCCCCACCAGCTTGCCGGCATCGAGCACCGGCAGGCAATGCAGGCCCTGCTCGCACAACAACGGCAACAGGCGCTCCAGCGGATGCTGGCTACTGACACTGACCACCCGGCGGCTCATCACCTGCTCCATGCGTACCGCTTTGCGGCCGAACAGCCCGCGCCAGCTGAAGCGGCCACGCTGCATGGCCGGGCCAACCAGGTCGCTGAGGCTGACGATGCCCACCAGCTTGCCCTGCTGCAATACCGGCAGGGTTTTCAGGTGGTGGCTGGCCAGCATTTTCCAGGCCTGCTCAAGGGTGGTTTCGGGCGTGGCGAACTGCACATCGCGCGACATCACCGAACCCGCGGTAATACCGCCAAGGCTGCGCTGCAGGGCGTGCTGCTCGGTGGCGAGGATGATGCGTTCCAGCTCGTCACGGGTGACGTCGACGAACTCACCCAGCTCTTCCAGGGCCTGATCCAGGTCCTCGCCACGGATGCCGACCCGCTCACTGGGCAGGGGGTCGTGGGTATGGTGCACATCCTTGCGCGGCGCTACGCCTTTCGGGTAGCGCACACCGGTCAGGCGATTGTAGAGCACCGCCACGCTCACCAGGATCAGCGCGTTGAGCAGAATGGGCTCAAGCAGGTGATCACCCATGGCGGTCAGCCCCGAATCAGCCAGCACTGCGCTCACTGCCACCCCGCCACCCGGCGGGTGCAGGCAGCGCAGCAGGCACATCACCAGGATCGAGACGCCCAGGGCCGCTGCGGCCACCCACAGCTCCGGGCCGAAGCCCTGGCGCATGGCCAGGCCGACCGCGCCGGCCAGCGCGTAGCTGCCCAGCACTGGCCAAGGTTGGGCCAACGGGCCGGAATGCACGGCGAACACCAGCACTGCCGTGGCCGCCAGCGGGCCCAGCAGGTGCAAGGCGATGCCGGGGCCATAGGCCATGCTGGTCAGCCAGCCAGCGAGGAACAGACCGAGCAACGCACCGATACCGGCACGCAACCATTCTTTTGGAGAGGTATTCAGGGGCGCCGGCAACAGGCGCTGCAGGCGGTTTTCGGAACGCGAGGCAGACATTGGGTAATCGGGGTCTTTGGTTTTTTCTGATTAAAAAGAAAGCCCAGCCGAGCTGGCCTGGGCCTCGTATTGCGCGTGCAATACCACAAGGGGCTCCTTCCATGGAGAGATGGAAACCGCAGGGTTTCGTGGGGGGATTTTGCCGGGCGGGGGGGCATTTGGGCCAATTCAAAAAACTGCGGCTGTACTGCAATTTTTTTGCAGTACTGATACGGCCTAGGCGGATTGGGCACACCTGTGGGAGCGGCCTCGTGTCGCGATGGGCCGCCTAGCGGCCCCGGCATTACCGAGCCTTGCGCGGCAGGTCGATGCTCACACGCAATCCTCCCAGCGGGCTCTGCTCCAAGGCCAGACGCCCGCCCCAGGCGTCGACGATATCACGCACGATACCCAACCCCAGCCCATGCCCGTCCACCTGCTCGTCCAGCCGCGAGCCCCGCTCCAGCACCTGCAGCCGTTGGTTCTCAGGGATGCCTGGGCCATCGTCATCGACCCACAGCTGGTAACCCCCGGCATTCGGCGCAATGCTCAGCCGCACCTCGCTGTCCGCCCACTTGCAGGCGTTGTCCAGCAGATTACCCAGCAGCTCCAGGAAGTCCTCGCGGTCCCACGGCAGCAGCAACCCGGGCGGCACGTCCCTGGCCAGTAGCAAGCCCTCTCCGTGAATCATGCCCAACGTGCTCAGCAATCCGGGTAGCTCGGCGTCGCAATCGAACTGCGCGCCAGGCAGCGCATCCCCTGCCAGGCGCGCACGGTTCAGCTCCCGCGCCAGCCGTTGCTGAATTTGCTCCAACTGCTCACGCATCTGCGCGCTTACCTCCGGCAGATCCTTCAGGCGCGCGCTGGAAGCCAGGTTCAATAGCACCGCCAGCGGCGTCTTGAGTGCATGCCCAAGGTTGCCCAAGGCATTGCGCGACCGGCGCAAGCTGTCTTCGGTATGGCTGAGCAAGTGGTTGATCTGCCCCACCAGCGGCGCCAACTCGCTGGGCACCTGCTCATCCAGCTGCGAGCGCTGGCCCTGCTGCAGCTGGGCGATCTGCTGGCGTGCCCGCTCAAGCGGGCGTAACGAGCGGGTAACCGTGATGCGCTGCAGCACGAGAACCAGAATCAGCGCAACAAGGCCCATGCCCAGGCCGATCTGCTGCATGCGCCGGAAACCTTCGCGCACCGGCGAATAGTCCTGTGCCACGCTGATCGATATGTCCTGGCCCAGGCGCCGGTAGTCCGCGCGCAGGGCCAGCAGTTGCTGCCCTTCCGGGCCGAGTTCATGGCTATCGGAAAGCCCGGGCGCCGATGGTTTGGGCATGTCAAGGTCCCACAGCGAACGAGACCTCCAGGTGCCTTTGTCAAAATCGATGCGGAAGTAGTAACCGGAAAAAGGCCGTTGGTAGGCCGCCGAAATACGCCGCTCATCCAGCTGCAGGCCGGACGGCCCACGTACCAACGCCACCAGCAGGTTCTCGCTTTCCTTGCGCAGGCCGTTTTCCAGGTAGCGTTGCAGGCCAGCTTCGAACAGCCACAAGGTCAGTTGCGCCAGCACCACGCCGACCACCACCAGCACCGCCACCAGGCCCAGGCTCAGGCGCGCCTGGATCGACTTCACCCGGCGCTCCCGGCATAGATGTAGCCTTGCCCGCGACGGGTCTCGATCACGCTGCGGCCCAGCTTGCGCCGCAAGTGATTGACGTGCACTTCCAGCACGTTGGAGTCCCGCTCGGTTTCGCCGTCGTAAAGGTGTTCGGCCAGGTGGCTTTTGGACAGGATCTGCTGCGGGTGCAGCATGAAGTAACGCAGCAGCCGAAACTCGGCAGCGGTCAGTTGCACATCCATGCCGTCACGGCTCACACACTGACGGCTTTCATCCAGGTGCAGCCCCGCTGCCTCCAGTTTCGGCTGGTTGGCCAGGCCCCGGGCACGGCGCAGCAATGCCTGGATACGCAATTGCAGCTCTTCCGGGTGGAACGGTTTGCTCAGGTAGTCATCAGCCCCGGCCTTCAGCCCCTCGATGCGCTCGGCCCACGAGCCGCGGGCGGTGAGGATCAGCACCGGCGTGACCAGGCCGGCAGCACGCCACTGGGCCAGCACATCCAGGCCCGGCAAGCCCGGCAGGCCAAGGTCGAGAATGATCAGGTCGTAGGGTTCGCTCTGGCCCTGGTAAACCGCGTCGCGGCCGTCGGCCAGCCAGTCCACGGCATAGCCCTGGCGCTGCAGGCCGGCGGTCAGCTCATCGGCCAGCGGTACATTGTCCTCGACAAGCAACAGGCGCATTCAGTCGTCTTCCTCGTCTTTGAGCAGGGCACCGGTGCTGGCATCGAGCTTGATCTCGCGTACCACACCTTCAACCGTCAGCAGTTCGACCTCATATTCGTAGCGATCGTCGTCTTCTTCCAGCTCGGCTTCCAGCAGGCGCGCCCCGGGGTGACGCCCCAGGGCGGTTTCCAGCAATTGCTCGAGCGGCAGGATGACGCCCTTCTGCCGCAGCTTCAGGGCTTCGTCCTGGTCAAGGTCGCGGGCTGCGGCCAGGGAGCACACGGCCAGCAGCGCCAGCGCGAGGTAGCGCGCCGGCCGCGGTAAGTGAATCATCAGTTGTCCCGCTCGTCCTTCAGCACTTCACCGGTCTTGGCGTCCAGCGCCACGTCCCACTCGACGTTCTGGGTGTCGCGCAACTCGACCTTGTAGATGTAGCGGCCGTACTCGTCCTCAAGCTCCGAGTCGGTGACGGTGGCGCCGGGGTGCTTGGCCACGGCGGTGGCTTTGAGATCATCCAGCGACTTGATGGTCTTGGCGTTGACCAGCTTGACCACTTCGTCAGGCTGTACATCCTTGGCGAAAGCGGCATTCGCGCCCAGGGCGAGGGCAGCGGCGGTGAACAGGGCAGTCAAGGTTTTCATCGTTCTTCTCTCCTGAGAACTGTGTAAGTTGTCTACGGGGTTAAGACTAACCACCGGTCCTTAATTCAACCTGAAAACAGCTGGCGCCATGATAGCGCAGTTACAACTCCAGGCAGGTTTGGTCTCCTGCAGGAGCAGCCTTGTGCTGCGAAAGGGCCCGTGCAGGCACAACAGTTGTGTAGTCTGTACGGGCCCTTTGCAGCACAAGGCTGCTCCTACAAGCCCAAGAGATCCGCCATGAGCGCCATCCACATCAAGTACCCTGCCCTCACCTTCAAGGCCGGTGACCGTGCCCTGCGGCTCATCCGCGAGCGCGGCCTGCAGGCGGCCGATGTCGGGGTGCTGCCAGGTGCGGCCGGTGGCCCGAAGCCGCTGGGTATCCAGGGCCTGGACCTGGCACTGTTCGGCGAGTGGCTGCCTTCGGCACCGCGCCCGCGCGCCCTGATTGGCGCTTCGATCGGTGCCTGGCGCTTCGCCAGCGCCTGCCTCGAAGACCCCATCGCGGGTCTGCGTCGCCTGGGCGAGTTGTACACCGAACTGGATTTCGCCAAGGGCGCCACCCCGGCCGAAATCAGCCACAGCTGCCAACGCATGCTCGACGACCTGCTGCAGGGCCGCGACGGCCAGCTGCTGGCCAACCCGCACTATCACCTGAATATCCTGGTAGTGAAAAGCCATGGCCAGCTAGCCCACGACCATCGCGGCCGCCTGGGCCTGGGCCTGGGCTCGGTGGTGGCCAGCAACCTGCTCGGCCGTTCGCGCCTGGCGCGCCACTTCGAGCGCATCATCCTGCACGACGTGCGCGCCACGCCGCCGCTCGACGCACTGACCGACTTCCCCTCACGCTACCTGCCGCTGGACCTGGCCAACCTGCGCCACGCCCTGCTGGCCTCGGGTTCGATCCCCATGGTCATGCAGGGTGTGAAAGACATCCCCGGCGTGGGCGCTGGCACCTACCGCGACGGCGGCCTGCTCGACTACCACCTCGACCTGCCCTACCGCGGCGACGACCTGGTGCTGTACCCGCACTTCACCGACAAGGTGGTACCGGGCTGGTTCGACAAAGCCTTGCCCTGGCGCAAAGGCGATGCCACCCGCCTGCAGAACGTGCTGCTGATGACCCCTTCGCCGCAGTACCTGGCCGCCCTGCCCTATGGCAAGCTGCCGGACCGCAACGACTTCAAGCGCTTCATGGGTGATGCGCCAGGCCGCAAGCGCTATTGGTACAAGGCTATCGCTGAAAGCCAGCGCCTGGGCGACGAGTTGCTGGAGCTGGTCGCCACCGGGCGGCTGCATGAACGCCTGCAAGCCTTGTAACGGCCATGCTGGTAGAATGCGCGCATTCATTGATTCACACAGAGTTATGACAGCGTGGAAATTTTCAAAGAGTTCACATTCGAATCGGCCCACCGCCTGCCCCACGTCCCTGAAGGGCACAAATGCGGCCGCCTGCATGGCCACTCGTTCAAGGTCGGCCTGCACCTGACCGGCCCGCTCGACCCGCACACTGGCTGGATCCGCGACTTCGCCGAGGTCAAGGCGATCTTCAAGCCGATCTACGAGCAACTGGACCACAACTACCTGAACGACATCCCGGGCCTTGAAAACCCCACCAGCGAAGTGATCGCCAAGTGGATCTGGGACCAGGTCAAGCCGCTGATGCCGGAACTGTCGAAAGTGCGCATCCACGAAACCTGCACCAGCGGTTGCGAATACACCGGCGACTGATCCGCCCGCGCTACTGGCCCTGCGTGCTGCCCATGGCATCGCGCAGGAAGCTGGGGGCGATGTAGCGCTGGTAATGCGCTTCTGAAAGCAGGAAGAACTCGCGATCGATGGCATCGCGCAATTGCGGCAGTTCCCAGTCGCGAAACTCCGGCAGCAGCACCATGCCGTAGGCCTCCAGGTCACGGATCATTCGCGCCCCGCGGGCAATCAGCTGGTAAGCCCAGCAATATTCCGACTGCTGCTCGACAAAACGGATGGAGCGCTGCTCCAGTTGCTGACGCAGCAGGCTTTTGTCGAACACCTCCAGCTTGGCCATCATCACCTGCACCAGCAACTGCTCAAGGCGCAGCCACACCGCGCGTTTTTGCGCGTCGTCGTAGCCGTTCCAGTTGATCACTTCGTGGTGGAAACGCTTGCAGCCACGGCACACCGTGTCTCCGTACACCGTGGAGCACAGGCCGACGCAAGGGGTCTTGATGGACTTGTTGGACATGAAAAACAACAGTTTGGCGGAGGAACACGGGCCCATGTTAGCCCTTTGTCTAACCAGGGTCACTCGTTAAAGTCATCTTCTGCGCCTTACTTTCGGGCTTTTTTTGCCGTAGAATCATCCGGCCTTTTCAAAGGCAACAATGTCCGTTGGAAGCTGTTTTCAAAGCGTCACGAGCACAGTTAATCCGGTAGAACGGCGTTGGCCCGGGCCATGCTTCCCTCGCATGCCCGCGCCAGCCCTCATCAGCTCCCCGTTCTGCAGGCGTAAAACTTTGAAAGCAGCTTCTGTAAGGATTCTTTGCGACTCTGGCTGGGCGGCCCACAAAGCCGTGGCAGCGCATGGGTGCATCGAATGCTGGATAAGCGTCCCGGACCCCCTTTAGGGACCACTGATGAGGGAAATAACTGTGCTTGAAGCCTACCGCAAACACATCGAAGAGCGTGCCGCCCTGGGTATCGTGCCCCAGCCGCTGAACGCCGAACAAACCGCAGGCCTGGTCGAGCTGCTGAAAAACCCGCCGGCCGGCGAAGAAGCCTTCCTCGTAGACCTGATCACCAACCGCGTACCGCCAGGGGTCGACGAAGCTGCCTACGTCAAGGCTGCTTTCCTCTCTGCTGTTGCCAAGGGCGAAACCCAGTCGCCTCTGATCGACCGCAAGCACGCCACCGAACTGCTCGGCACCATGCAGGGCGGCTATAACATCGAAACGCTGGTCGCACTGCTGGATGACGCCGAACTGGGCGCCGTCGCGGCCGAACAGCTCAAGCACACCCTGCTGATGTTCGATGCCTTCCACGACGTGGCCGAAAAAGCCAAGGCCGGCAACGCTCACGCCAAGGCCGTGCTGGACTCCTGGGCTGCCGGCGAGTGGTTCACCGCCCGCCCGGCAATCGCCGAGAAGTACACCCTGACCGTGTTCAAGGTGCCAGGCGAAACCAACACCGACGACCTGTCGCCTGCTCCGGACGCCTGGTCGCGCCCTGACATCCCGCTGCACGCCCTGGCCATGCTGAAAATGGCCCGCGACGGCATCGAACCGCAGCAGCCTGGTTCGGTCGGCCCGCTGGCACAGATCGAAGCCGTCAAGGCCAAGGGCTTCCCGGTTGCCTATGTAGGTGACGTGGTAGGTACCGGTTCCTCGCGTAAATCCGCTACCAACTCGGTACTGTGGTTCTTCGGCGACGACATTCCGTACGTGCCGAACAAGCGCGCCGGTGGCTTCTGCTTCGGCACCAAGATCGCCCCGATCTTCTACAACACCATGGAAGACGCCGGCGCCCTGCCGATCGAGTTCGACTGCACCAACCTGGGCATGGGCGACGTCATCGACGTCTACCCGTACAAAGGTGAAGTACGCCGCCACGAAAGCGACGAGCTGGTCACCAACTTCGAGCTGAAAACCGAAGTGCTGCTGGACGAGGTCCGCGCTGGCGGCCGTATCCCGCTGATCGTTGGCCGTGGCCTGACCGAAAAAGCCCGCGCCGAGCTGGGCCTGGGTGCTTCCGACCTGTTCAAGAAACCAGAGCAGCCTGCCGACTCCGGCAAGGGCTTCACCCTGGCGCAGAAAATGGTCGGCCGTGCCTGTGGCCTGCCAGAAGGCCAGGGCGTGCGCCCAGGTGCCTACTGCGAGCCGAAGATGACCACCGTCGGTTCCCAGGACACCACTGGCCCGATGACCCGCGACGAGCTGAAAGACCTGGCGTGCCTGGGCTTCTCTGCCGACCTGGTCATGCAGTCGTTCTGCCACACTGCGGCGTATCCAAAGCCGATCGACGTCACCACCCACCACACCCTGCCAGACTTCATCCGCACCCGTGGCGGCGTGTCGCTGCGCCCAGGCGACGGCATCATCCACAGCTGGCTGAACCGCATGCTGATGCCTGACACCGTTGGTACCGGTGGCGACTCGCACACCCGCTTCCCGATCGGCATCTCGTTCCCGGCCGGTTCCGGCCTGGTGGCCTTCGCCGCCGCCACCGGCGTCATGCCGCTGGACATGCCAGAATCGATCCTGGTGCGCTTCAAGGGCAAACTGCAACCCGGCATCACCCTGCGCGACCTGGTGCATGCCATCCCTTACTACGCCATCCAGAAAGGCCTGCTGACCGTCGAGAAAAAAGGCAAGAAAAACGCCTTCTCCGGCCGCATCCTGGAAATCGAAGGCCTGGACGAACTGACCGTCGAGCAAGCTTTCGAGCTGTCCGATGCCTCGGCCGAGCGTTCCGCTGCCGGTTGCACCATCAAGCTGCCAGAGAAGGCCATTGCCGAGTACCTGACGTCCAACATCACCCTGCTGCGCTGGATGATCGGCGAAGGCTACGGCGATGCCCGCACCCTGGAGCGCCGCGCCCAGGCCATGGAAGCCTGGCTGGCCAACCCTGAGCTGCTGTCGGCTGACGCCGATGCCGAGTACGCTGAAATCATCGAAATCGACCTGGCCGACGTCAAAGAGCCTGTGCTCTGCGCGCCGAACGACCCGGACGATGCCCGCCTGCTGTCTTCGGTACAGGGCGAGAAGATCGACGAAGTGTTCATCGGTTCGTGCATGACCAACATCGGCCACTTCCGCGCTGCCGGCAAGCTGCTGGAGAAGGTCAAGGGTGGCATCCCTACCCGTCTGTGGCTGGCTCCGCCAACCAAGATGGATGCTCACCAGCTGACCGAAGAAGGCTACTACGGCATCTACGGCAAGGCCGGTGCGCGCATGGAAATGCCAGGCTGCTCGCTGTGCATGGGTAACCAGGCACGTGTGCAGACCGGTTCGACCGTGGTCTCCACCTCGACCCGTAACTTCCCGAACCGTCTGGGCGACGCGACCAACGTGTACCTGGCATCGGCCGAACTGGCTGCTGTCGCTTCGATCATCGGCAAGCTGCCGACCGTCGAAGAGTACATGCAGTACGCGAAAGACATCGACAGCATGGCTGCCGACGTTTACCGCTACCTGAGCTTCGACCAGATCGCCGAGTTCCGCGAAGCGGCAGCCAACGCCAAGATCCCGGTGGTTCAGGCTTAAGCTGAGTTACAGGCAGTAAAGAAAGCCCCGGCAGCGATGCCGGGGCTTTTTGTTTTAGCCTGTACCGGCCACCTCAGATCACAAACAAATCCACAAACCTGTGCACCGCCATACCCTCCAGCTGCTGCTGGTCCTTGCATACCTCCACAATCTGCGCACACCGCTGACGAGCAAAACGCGTCCCCAGGTTTGCCCTGAACTTGGCCTCCAGCAACGGTATGCCCTCCCCTCGCCGCCGCCGATGCCCGATCGGGTATTCCACCACCACTTGGTCAGTGCTGCTGCCATCCTTGAAGAACACTTGCAGCGCGTTGGCGATCGAGCGCTTGTCTGGCTCCAGGTACTCGCGGCTGAAGCGCAGGTCTTCCACCACTTCCATCTTCTCGCGCAGGCGATCGATGCTTGGGTGGTGGGCATGGAAAGCATCTTCATAGTGCTCCGCCACCAAATGGCCGAAGATCAGCGGTACCGCCACCATGTACTGCAGGCAGTGGTCACGGTCGGCAGCATTGGCCAGCGGGCCGCTCTTGGAAATGATGCGGATCGCTGACTCCTGGGTGGTGATCACGATGCGGTCGATTTCATGCAGACGGTTGCGCACCAATGGGTGCAGGGTCACTGCCGCCTCACAGGCGGTCTGGGCATGAAACTCGGCAGGGAAACTGACCTTGAACAACACATTTTCCATCACATAGCTGCCCAAGGCCTGAGGCAGACGCAGCTCGTACTGCCCGGCAGGCTTGAGCGCCAAGTCTTTGTTGGTATGACTGAACGATACGTCGTAGAACCCCCACTGCGGCGCCGTGAGCACGCCCGGCACGCCCATCTCGCCGCGCAGGGCGATATCGGCCAGGCGCACGCCCCGGCTGGAAGCATCGCCCGCCGCCCAGGACTTGCGAGAACCGGCGTTGGGCGCATGGCGGTAAGTGCGCAGGGCCTGGCCGTCGACGAAGGCGTGGGACAAGGCCGAGAGCATCTGCTCGCGATTGGCCCCCATCAGCCTGGCACACACGGCGGTTGAGGCCACCTTCACCAGAATCACGTGATCGAGCCCGACACGGTTGAAAGAATTTTCCAGGGCCAGCACACCCTGGATCTCATGGGCCATGACCATTGCCTCAAGCACATCGCGCATCAGCAGCGGCGCCTCACCCGCTGCGACGTGCTTCTGCGACAGGTGATCAGCCACGGCCAGGATGCCGCCAAGGTTGTCCGAGGGATGAGCCCATTCAGCGGCCAGCCAGGTGTCGTTGTAGTCCAGCCAGCGCACCGTGCAGCCGATGTCCCAGGCGGCCTTGACCGGGTCGAGGCGATAGGAAGTGCCGGGGACGCGAGCACCATTCGGAACCAGCGTGCCTTCCACCTGAGGGCCGAGGAGTTTCGTGCATTCGGGGAAGCGCAGGGCCAGCAGGCCGCAGCCGAGGGTGTCCATCAGGCAGTTGCGCGCGGTGGCCAGGGCTTCGGCGGAATCGACCCGGTAGCCGAGGGCGTAATCGGCCAGGGTCTGTAGCACCCGGTCGTAATCCGGGCGGTCGTTGAGGTCTACGTTGGCGCTCATATCCAGCTCCCTTCAGAATTGCCGGAGCTGCTGTGCAGCCCTTCGCGGGCTCGCCCGCTCCCACAGGGATATCACAGCATTCAAACCCTGTGATTACCTGTGGGAGCGGGCAAGCCCGCGAAGGGCCGCAAAGCGGCCCCTCTTGAAGCCGAGCGCCAGTGGTGCTTTAGAAGCTATCGCCAGGCACGCGAACCCAGCCCTCCATCAGCACGCGAGCACTGCGGCTCATGATTGCCTTGGTCACTGTCCACTCACCCTCCACCCGGCGCGCCTCGGCCCCGACCCGCAAGGTGCCCGAAGGGTGGCCGAAACGCACAGCGCTGCGCTCGCCCCCGCCAGCGGCGAGGTTGACCAGCGTGCCCGGAATGGCCGCTGCGGTGCCGATCGCCACCGCCGCCGTACCCATCATGGCATGGTGCAGCTTGCCCATCGACAAGGCCCGCACCAACAGGTCGACATCCCCTGCCTGTACCACCTTGCCACTGGACGCAGTGTAGGTGCTCGGTGGCGCAACGAACGCCACTTTTGGTGTGTGCTGACGCCCGGCAGCCTGGTCGACATTGTCGATCAAGCCCATGCGCACGGCGCCATAAGCACGAATGGTCTCGAAACGCAGCAGCGCCTGCGGGTCGCCGTTGATAGCGTCCTGCAGTTCGGTACCGGTGTAGCCGATATCCGCCGCGTTTACGAAGATGGTCGGAATGCCAGCGTTGATCAGGGTCGCCTTGAAGGTGCCGACACCCGGCACCTCCAGGTCATCGACCAGGCTGCCAGTGGGGAACATCGCACCGCCGTCGTCACCGTCGTCGGCCGCCGGGTCGAGGAACTCCAGTTGCACCTCGGCCGCCGGGAAGGTCACCCCGTCCAGCTCGAAGTCGCCGGTTTCCTGCACTTCACCTTCAGTGATCGGGACATGGGCGATGATGGTCTTGCCGATATTGGCCTGCCAGATGCGCACCGTGGCGATGCCGTTGCGCGGAATGCGCGCCGGGTCGACCAGGCCACTGCTGATGGCGAACGAACCGACCGCAGCAGACAGGTTGCCGCAGTTGCCGCTCCAGTCGACGAACGCCTTGTCGATGCTGACCTGGCCGAACAGGTAGTCGACATCGTGTTCGGGCTTGATGCTTTGCGACAGGATCACGGTCTTGCTGGTGCTGGACGTGGCGCCGCCCATGCCGTCGATCTGCTTGCCGTAGGGGTCGGGGCTGCCTATTACCCGCAGCAGCAGTGCGTCACGGGCGGGGCCGGGGATCTGCGCCTGCTCGGGCAGGTCTTGCAGGCGGAAGAACACGCCTTTGCTGGTGCCGCCACGAATGTAGGTGGCGGGGATTTTGATCTGGGGTACATGTGCCATTGTCTTGGGCATCCTGATAGTCCGGTGAAATACCCTGGGGCTGCTATGCAGCCCTTTCGCGACACAAGGCCGCTCCCACAGGTACTGCGCTGCCCGTTGTGGGAGCGACCTTGTGTCGCGATGGGCCGCAAAGCGGCCCCCGTTTCATGCTCAGGCGGTCGCTTCGAGGAAGTCCTGGGCAAAGCGTTGCAGCACCCCGCCCGCCTCGTAGATCGACACTTCCTCGGCGGTGTCCAGGCGGCAGGTCACCGGCACTTCCAGGCGCTCGCCATTGGCGCGGGTAACCACCAACGTGAGGGTCGCCCGTGGCGTACGTGCACCCAGCACGTCATAGGTCTCGCTGCCGTCCAGGCCCAGGGTTTTGCGGTCAGTGCCTGGCTTGAACTCCAGCGGCAACACGCCCATGCCCACCAGGTTGGTGCGGTGAATGCGCTCGAAGCCTTCGGCGACGATGGCCTCGACACCCGCCAGGCGCACGCCCTTGGCCGCCCAGTCGCGGGACGAACCTTGGCCGTAGTCGGCGCCGGCCACGATGATCAGCGGCTGTTTGCGCTGCATGTAGGTTTCGATCGCCTCCCACATGCGGGTCACCTTGCCTTCCGGCTCGATACGCGCCAGCGAGCCCTGCTTCACGCTGCCGTCTTCCTTGCGCACCATTTCGTTGAACAGTTTGGGGTTGGCGAACGTGGCACGCTGGGCGGTCAGGTGGTCGCCGCGGTGGGTGGCGTAAGAGTTGAAGTCCTCTTCCGGCAAGCCCATTTTCGCCAGGTATTCACCGGCGGCGCTGTCGAGCATGATGGCGTTGGACGGCGACAGGTGGTCGGTGGTGATGTTGTCCGGCAGCACCGCCAACGGCCGCATGCCGCGCAGGGTACGTTCACCGGCCAGCGCGCCTTCCCAATACGGCGGGCGGCGGATGTAAGTGCTCATCGGGCGCCAGTCATACAGCGGCGCCACCTTCGGCCCGCGGTCTTCCTCGATGGCGAACATCGGAATGTACACCTTGCGGAACTGCTCTGGCTTGACCGCGGCACGTACCACGGCGTCGATCTCTTCATCGCTCGGCCAGATGTCCTTGAGGCGGATTTCCTTGCCATCGACCACGCCCAGCACATCCTTCTCGATGTCGAAGCGGATCGTGCCGGCAATGGCATAGGCCACCACCAGCGGCGGCGAAGCCAGGAAGGCCTGCTTGGCATACGGGTGGATACGCCCGTCGAAGTTGCGGTTACCCGACAGCACGGCGGTGGCGTACAGGTCGCGGTCGATGATTTCTTGCTGGATCACCGGGTCCAGCGCGCCAGACATCCCGTTGCAGGTGGTGCAAGCGAAGGCGACGATACCAAAACCGAGTTGCTCCAGCTCCTTCTCCAGCCCGGCTTCTTCCAGGTAAAGCTGCACAGCCTTGGAACCTGGCGCCAGCGACGACTTGACCCACGGTTTGCGGGCCAGGCCAAGCCTGTTGGCATTGCGTGCAATCAGGCCAGCAGCTATCACGTTGCGCGGGTTGCTGGTGTTGGTGCAGCTGGTGATGGCGGCAATGATCACCGCACCGTCCGGCATCTGCCCCGGCACCTCTTCCCAGCTGCCGGCAATGCCTTTGGCTGCCAGGTCGCTGGTGGCCACGCGAGCGTGCGGGTTGGACGGGCCGGCCATGTTGCGCACCACGCTCGACAGGTCGAAGCTCAGGGTGCGCTCGTAGACGGCGCCGCCCAGGCTGTCGGCCCACAGGCCGGTCGCCTTGGCGTAGGTTTCCACCAGCTTGACCTGCTGCTCTTCGCGGCCGGTCAGGCGCAGGTAGTCGATGGTCTGCTGGTCGATGGCGAACATAGCGGCGGTGGCGCCGTATTCCGGGGCCATGTTGGAAATGGTGGCGCGGTCACCCAAGGTCAGCGCGCGGGCGCCCTCGCCGTGGAATTCCAGGTAGGCGCCGACGACTTTCTGTTTACGCAGGAATTCGGTCAGGGCCAGCACCAGGTCGGTGGCGGTGATGTTCGGTGCCAGCTTGCCGGTCAGCTCGACGCCGACGATTTCGGGCAGGCGCATCCACGAAGCGCGGCCGAGCATCACGTTTTCGGCTTCCAGGCCACCAACGCCAATGGCGATCACGCCCAGGGCGTCGACATGCGGGGTGTGGCTGTCGGTTCCGACGCAGGTGTCCGGGTAGGCCACGCCGCGGTCGCTATGAATGACAGGCGACATCTTCTCCAGGTTGATCTGGTGCATGATGCCATTACCCGGCTGGATCACGTCGACGTTCTTGAACGCCTTCTTGGTCCAGTTGATGAAGTGGAAGCGGTCTTCGTTGCGGCGGTCTTCAATGGCGCGGTTCTTTTCGAAGGCCTGTGGGTCGAAGCCACCGCACTCCACTGCCAGCGAGTGGTCGACGATCAGCTGCACCGGCACCACCGGGTTGACGGCGGCCGGGTCGCCACCCTTGTCGGCGATGGCATCACGCAGGCCGGCGAGGTCGACCAGCGCGGTCTGGCCGAGGATGTCGTGGCACACCACGCGGGCCGGGAACCACGGGAAGTCGAGGTCACGCTGGCGTTCGATCAGTTGGCCCAGCGAGGCGTCGAGGGTGGCCGGGTCGCAGCGGCGCACCAGGTTCTCGGCGAGCACGCGGGACGTGTAAGGCAGGCCGTCGTAGGCGCCGGGCTTGATCGCCTCGACCGCCGCACGGGCATCGAAGTAGTCCAGGTCGGTGCCTGGAAGGTGCTTGCGGAATGCTGTGTTCATATCGTTATCAGGCTCGGTCACGGTACGGTCGTGGATAGCGTTCTAATTGCTGCCCTGTATTGGCTGCTCCGGCCTCATCGCCGGCAAGCCGGCTCCCACAGGTACAGCACTAGCTACAAGGGCGGTGCAACACCTGTGGGAGCTGGCTTGCCGGCGATGAGGCCGGAACAGGCCACCCCACAGGTTCGGTTTCAGCGCTGCTCGATCGGCACGAACTGGCGCTGCTCGACGCCGACATACTCGGCGCTCGGACGGATGATGCGGTTGTTGGCACGCTGTTCGAAGACATGCGCCGCCCAGCCGGTCAGGCGCGAGCAGACGAAGATCGGGGTGAACAGCTTGGTCGGGATACCCATGAAGTGGTACGCCGAGGCATGGTAGAAGTCAGCGTTGGGGAACAGCCGCTTCTGCTCCCACATGGTCTTGTCGATGGCTTCGGAAACCGGGTACAGCACCTTGTCGCCCACTTCGTCGGCCAGCTGCTTCGACCAGCCCTTGATCACCTCGTTGCGCGGGTCGGACTCTTTGTAGATGGCATGGCCAAAGCCCATGATCTTGTCCTTGCGCTCCAGCATGCGCAGCAGCTCGGCGGTGGCTTCCTGCGGGCTCTGGAAGCGCTCGATCAGCTCCATCGCCGCCTCGTTGGCACCGCCGTGCAGCGGGCCGCGCAGCGAGCCAATGGCAGCGGTGACGCAGGAGTACAGGTCGGACAGGGTCGAGGCGCAAACGCGGGCGGTGAACGTGGAGGCGTTGAACTCGTGCTCGGCGTACAGGATCAGCGACACGTTCATGACCTTGACGTGCAGCTCGCTCGGCTTTTTGCCGTGCAGCAGGTGCAGGAAGTGGCCGCCGAGCGTGTCTTCGTCACTGGTGCAATCGATGCGCACGCCATGGTGGGTGAAGCGGTACCAGTAGCACATCACTGCCGGGAACAGCGCCAGCAGGCGGTCGGTCTTGTCGCGCTGGGCTTCGAAAGTGAGCTCGGGCTCCAGGGTACCCAGTACCGAGCAACCGGTGCGCATCACGTCCATCGGGTGGGCGTCACGCGGGATACGCTCCAGCACTTCCTTCAATGCTTGCGGTAGGTCACGCAGGCCCTTGAGCTTGAGCTTGTAGTCGGCCAGCTCGGCCTGGGTTGGCAGCTCGCCGTACAGCAGCAGGTAAGCGACTTCTTCGAACTCGGCACCTGCCGCCAGGTCACGCACGTCGTAACCCCGGTAGGTCAGGCCGGCACCGGCCTGGCCAACGGTCGACAGTGCGGTCTGGCCGGCCACCTGGCCACGCAGGCCTGCGCCACTGAGTACTTTCGCTTCGGCCATGGTGTTTCTCCTTTCTTGAATTTGTTATGGATTTCTTGCTGGCTAATTCGGTTCGCCTGTGCCGGCCTCTTCGCGGGTAAACCCGCGAAGAGGCCGGCACAGGCGCCATCACTTACCCTTTCTTCTGGGCAAACAGCGCGTCGAGGCTCTGCTCGAAGGCGTGGTAACCAATGGCATCGTAGAGCTCCATGCGGGTCTGCATGGTGTCGATCACGTTTTTCTGTGTGCCGTCGCGGCGCAGCGCGGTGTACACGTTCTCGGCGGCCTTGTTCATGGCACGGAACGCCGACAGCGGGTACAGCACCAGCGACACGTCGACCGAGGCCAGCTCTTCGGTGGTGTACAGGGGCGTGGCGCCGAATTCGGTGATGTTGGCCAGAATCGGTGCCTTCACCCGATCAGCGAACGTCTTGTACATCTGCAGTTCAGTGATGGCTTCCGGGAAGATCATGTCGGCGCCAGCCTCGATGCAGGCCTCGGCGCGATCAAGGGCAGCGTTCAGGCCTTCGACGGCCAAGGCGTCAGTACGCGCCATGATCACGAAGCTGTCGTCGCTACGGGCATCGACAGCGGCCTTGATGCGGTCGACCATTTCCTGCTGGCTGACGATTTCCTTGTTCGGACGGTGGCCGCAGCGTTTGGCGCCAACCTGGTCCTCGATATGGATGGCAGCAGCGCCAAACTTGCTCATCGAACGTACGGTGCGGGCGACGTTGAAGGCCGACGCACCGAAGCCGGTGTCAACATCCACCAGCAACGGCACGTCGCACACGTCGGTAATGCGGCGCACATCGGTGAGCACGTCGTCCAGGCCGGTGATGCCCAGGTCCGGCAAGCCCAGCGAGCCTGCGGCCACGCCGCCGCCGGACAGGTAGATGGCCTTGAAGCCGGCACGCTTGGCCAGCAGGGCATGGTTGGCATTGATGGCGCCAACCACCTGCAGGGGATGTTCGGCAGCAACGGCGTCGCGGAAACGCTGACCGGGGGTGCTCTTCACAGTCATTTCTCACCTCGTGGGCTGTTGTTGTGGGCGTCCAGATAGTGACGCTCGATGTTGCGCTTGGACGCGCCGATGTGGCGGCGCATCAGGAGTTCGGCCAGCTCGCCGTCACGGTCGGCGATGGCATCGAGGATGCGGTGGTGTTCGGCAAACGCCTGGCGTGGCCGATTGGGCGTGGCAGAGAACTGGATGCGGTACATGCGCACCAGTTGGTACAGTTCGCCGCACAGCATCTTGACCAGGGTCTGGTTGCCGCTGCCCTGGATGATCCGGTAATGGAAGTCGTAGTCGCCTTCCTGCTGGTAGTAACCCACACCAGCCTGGAATGCGGCATCGCGCTCATGGGTATCAAGCACACGGCGCAGTTCGTCGATGTCGCCCTGACTCATCCGTTCGGCTGCCAGGCGGCAGGCCATGCCTTCCAGCGACTCGCGGATTTCGTACAGTTCGATCAGTTCAGCGTGGTTCAGCGATACCACCCGCGCGCCCACATGCGGTACGCGCACCAGCAGGCGCTGGCCCTCCAGGCGGTGGATGGCCTCGCGCAACGGCCCACGGCTGATGCCGTAGGTGCGCGCCAGCTCCGGCTCGGAAATCTTGCTGCCGGGGGCAATTTCGCCCTTGACGATGGCCGCCTGAATGCGCCGGAAGACGTTTTCAGACAAGGTTTCCGTTTCGTCTGTCAGCACCGGGCTGGCGGTGGAAAGGTCCTGCATATTGTCGACACCTTGAAAATCTCTTTGCCAGAAATTAGCGAAAACACGCTCGCCAGTCAAAGACAAAATGAACATTGTCGACAATTGTCTAATAACGAACTAACCCCCGCCCGGTGCTGTCAGATCGCGCACCGCTGGCGTCAAGACGTCGGCGTGATAGAATGCCGCGCCTCCGATGGGGTATGGATAGCCTGTCTGTCATACATTCATGCTTGTTGACAGATAAAAGAGGAAGCTTGCGCAGTAGTTGCCAGTCGCCTTGACCAGAACCTTGCCTAGAACCTTGCACTGCACCGCGCCAGGATTATGAGACTTACACCCGTTTTATTGCTGCTATGCCTCACCCTGCTGCCGGCCCTTGGTCAGGCTGCGGGCAAGACCGTCTATGGTCTCAACGAATATGCACGGCTGGGCGACCTGGACCTGGAAGTGGCCGCCAAGCTCGACACCGGCGCCAAGACCGCCTCGCTCAGCGCCCGCGATATCAAGCGGTTCAAGCGCAACGGCGAAAGCTGGGTGCGCTTCTACCTGGCCATCGATGCTGCCCACTCGCACCCGATCGAGCGCCCGCTGGCACGCGTAAGCAAGATCAAGCGCCGGGCCGGCGACTATGATGCCGAATCGGGCAAGGCCTACACGGCACGCCCGGTCATCGAACTTGAAATCTGCATGGGCCAGGCCATGCGCACCATCGAAGTCAACCTCACCGACCGCAGCGCCTTCCAGTTCCCGCTGCTGATCGGCTCCGAGGCACTCAAGCACTTCGACGCGCTGGTCGACCCAAGCCTTAAATATGCGGCCGGCAAACCTGCCTGTGCCACCGACGCTCACAAAGCAGAGTAA
Protein sequences of DBSCAN-SWA_4 >NC_021505|3935972:3971477|3955416_3956085_-|WP_016500523.1|DBSCAN-SWA MRLLLVEDNVPLADELTAGLQRQGYAVDWLADGRDAVYQGQSEPYDLIILDLGLPGLPGLDVLAQWRAAGLVTPVLILTARGSWAERIEGLKAGADDYLSKPFHPEELQLRIQALLRRARGLANQPKLEAAGLHLDESRQCVSRDGMDVQLTAAEFRLLRYFMLHPQQILSKSHLAEHLYDGETERDSNVLEVHVNHLRRKLGRSVIETRRGQGYIYAGSAG >NC_021505|3935972:3971477|3956399_3956708_-|WP_016500525.1|DBSCAN-SWA MKTLTALFTAAALALGANAAFAKDVQPDEVVKLVNAKTIKSLDDLKATAVAKHPGATVTDSELEDEYGRYIYKVELRDTQNVEWDVALDAKTGEVLKDERDN >NC_021505|3935972:3971477|3956085_3956400_-|WP_016500524.1|DBSCAN-SWA MIHLPRPARYLALALLAVCSLAAARDLDQDEALKLRQKGVILPLEQLLETALGRHPGARLLEAELEEDDDRYEYEVELLTVEGVVREIKLDASTGALLKDEEDD >NC_021505|3935972:3971477|3942581_3944828_+|WP_016500515.1|DBSCAN-SWA MTADTTLAEAMERCAREPIHVPSSIQPHGFLLVLDATDLCVLQASENVEHWLGLPARELIGCPFASLVSDGFDLHAQLARLPEDEVFPFHIGDVRLRQSAPYSTPLHLLVHGHDEVLIAEFEPPRLSPELTGQGDYYPLVRSFVGSLQLASSLEDLLQQTVLQLKRITGFGRVKAYRFDAEGNGQVLAESADPGYPAYLGLCFPAADIPRQARELYRVNRIRVIEDANYQPSPLLPVINPRTGKALDMSFAALRSVSPVHLQYMRNMGTLASMSLSIVVDGELWGLISCHHQQPRAVDLRTRTACELLASVLSLQIESRESHASTRKLLALRQHIVRMISSMADHDSVSDGLRDLPQVLLAFAGAQGAAVISAERCDLIGQTPPEAQVTALVHWLGQRGEDTVFHSDNVRRDIIDLPELATHAGGVLAVAISQIHSHYLLWFRPEQVRTVNWAGQPTKQVGPQGNLDPRHSFERWQEELRGYSEPWDPLVIDGVLELRTAVLGIVLRKAEELAQLAGDLRRSNKELEAFSYSVSHDLRAPLRHIAGYTELLGEMEGQGLTERGKRFLQHIGEAAHFAGSLVDNLLNFSQMGRSALRLSDVDLNALVEAIRSELAPDYEGRAIVWDIAPLPKVIGDPAFINMALHNLIANAIKYTRGRTPAHIGISAVEHPGETEICIRDNGVGFDMAYANKLFGVFQRLHRMEDFEGTGIGLASVRRIIERHDGRVWAEGQIDQGASFHFTLPRNTAT >NC_021505|3935972:3971477|3935972_3936941_-|WP_016500508.1|coat|DBSCAN-SWA MTAGWGVVWACTVLLCAGPAWAKCTSVATSAAAFGSLNSTQVRTTVQTASSANSGLQCNGSVLSVLSSTDSFVVKITSATNGLVGPTGDVIPYTLYADATTTYPITRGTGFEFRTTGILDLLGLLNGTPKAVPIYMRTLINANVAAGLYQETLNVEWTWKYCDGIGLGGICIGTDQGTGTKPLTVTLTVTNDCQITTPNISFASAPVVAGFGTVSQGINVSCTKGSNYTVGLDDGQNVSGGRRRMKSSANNYLAYDIFKSAGVVRWGASGAARRSSTDADVNPGAGTGTGSQVFNYNAKVYTDQATPPAATYSDSVILDVQF >NC_021505|3935972:3971477|3941218_3941755_-|WP_016500513.1|coat|DBSCAN-SWA MNRTSILLLTLGPLLVPGGAAHGSTTGFIQARLVISAACQISSDDTQPAVLGNPGLLDFGERGPNWDQPLRSRVDEAGGAGSLQISCTPEVRAFNVRINGGLNGDDGVRRLSNGRELIPYQLAVDPGGNSRYAIGQARAFTINSTQQVPIPIYGVVVAQPRALPAGLYRDTLRVTLDW >NC_021505|3935972:3971477|3948185_3948428_+|WP_012271609.1|DBSCAN-SWA MPVSHDLYQDLHYPREIVQQRRQQDKALDRLLDEYMDIDNQVLAAESISAGNFVDEDLRHLKERRLAVKYMIERQLERKA >NC_021505|3935972:3971477|3962018_3963503_-|WP_016500529.1|DBSCAN-SWA MSANVDLNDRPDYDRVLQTLADYALGYRVDSAEALATARNCLMDTLGCGLLALRFPECTKLLGPQVEGTLVPNGARVPGTSYRLDPVKAAWDIGCTVRWLDYNDTWLAAEWAHPSDNLGGILAVADHLSQKHVAAGEAPLLMRDVLEAMVMAHEIQGVLALENSFNRVGLDHVILVKVASTAVCARLMGANREQMLSALSHAFVDGQALRTYRHAPNAGSRKSWAAGDASSRGVRLADIALRGEMGVPGVLTAPQWGFYDVSFSHTNKDLALKPAGQYELRLPQALGSYVMENVLFKVSFPAEFHAQTACEAAVTLHPLVRNRLHEIDRIVITTQESAIRIISKSGPLANAADRDHCLQYMVAVPLIFGHLVAEHYEDAFHAHHPSIDRLREKMEVVEDLRFSREYLEPDKRSIANALQVFFKDGSSTDQVVVEYPIGHRRRRGEGIPLLEAKFRANLGTRFARQRCAQIVEVCKDQQQLEGMAVHRFVDLFVI >NC_021505|3935972:3971477|3969030_3969921_-|WP_016485912.1|DBSCAN-SWA MTVKSTPGQRFRDAVAAEHPLQVVGAINANHALLAKRAGFKAIYLSGGGVAAGSLGLPDLGITGLDDVLTDVRRITDVCDVPLLVDVDTGFGASAFNVARTVRSMSKFGAAAIHIEDQVGAKRCGHRPNKEIVSQQEMVDRIKAAVDARSDDSFVIMARTDALAVEGLNAALDRAEACIEAGADMIFPEAITELQMYKTFADRVKAPILANITEFGATPLYTTEELASVDVSLVLYPLSAFRAMNKAAENVYTALRRDGTQKNVIDTMQTRMELYDAIGYHAFEQSLDALFAQKKG >NC_021505|3935972:3971477|3965010_3967599_-|WP_016500531.1|DBSCAN-SWA MNTAFRKHLPGTDLDYFDARAAVEAIKPGAYDGLPYTSRVLAENLVRRCDPATLDASLGQLIERQRDLDFPWFPARVVCHDILGQTALVDLAGLRDAIADKGGDPAAVNPVVPVQLIVDHSLAVECGGFDPQAFEKNRAIEDRRNEDRFHFINWTKKAFKNVDVIQPGNGIMHQINLEKMSPVIHSDRGVAYPDTCVGTDSHTPHVDALGVIAIGVGGLEAENVMLGRASWMRLPEIVGVELTGKLAPNITATDLVLALTEFLRKQKVVGAYLEFHGEGARALTLGDRATISNMAPEYGATAAMFAIDQQTIDYLRLTGREEQQVKLVETYAKATGLWADSLGGAVYERTLSFDLSSVVRNMAGPSNPHARVATSDLAAKGIAGSWEEVPGQMPDGAVIIAAITSCTNTSNPRNVIAAGLIARNANRLGLARKPWVKSSLAPGSKAVQLYLEEAGLEKELEQLGFGIVAFACTTCNGMSGALDPVIQQEIIDRDLYATAVLSGNRNFDGRIHPYAKQAFLASPPLVVAYAIAGTIRFDIEKDVLGVVDGKEIRLKDIWPSDEEIDAVVRAAVKPEQFRKVYIPMFAIEEDRGPKVAPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPLAVLPDNITTDHLSPSNAIMLDSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVRKEDGSVKQGSLARIEPEGKVTRMWEAIETYMQRKQPLIIVAGADYGQGSSRDWAAKGVRLAGVEAIVAEGFERIHRTNLVGMGVLPLEFKPGTDRKTLGLDGSETYDVLGARTPRATLTLVVTRANGERLEVPVTCRLDTAEEVSIYEAGGVLQRFAQDFLEATA >NC_021505|3935972:3971477|3963669_3964860_-|WP_016500530.1|DBSCAN-SWA MAHVPQIKIPATYIRGGTSKGVFFRLQDLPEQAQIPGPARDALLLRVIGSPDPYGKQIDGMGGATSSTSKTVILSQSIKPEHDVDYLFGQVSIDKAFVDWSGNCGNLSAAVGSFAISSGLVDPARIPRNGIATVRIWQANIGKTIIAHVPITEGEVQETGDFELDGVTFPAAEVQLEFLDPAADDGDDGGAMFPTGSLVDDLEVPGVGTFKATLINAGIPTIFVNAADIGYTGTELQDAINGDPQALLRFETIRAYGAVRMGLIDNVDQAAGRQHTPKVAFVAPPSTYTASSGKVVQAGDVDLLVRALSMGKLHHAMMGTAAVAIGTAAAIPGTLVNLAAGGGERSAVRFGHPSGTLRVGAEARRVEGEWTVTKAIMSRSARVLMEGWVRVPGDSF >NC_021505|3935972:3971477|3951822_3952689_+|WP_016500520.1|DBSCAN-SWA MDIDQARTFLEIVRCGSLVAAAERLFVSQTAITARVQRLEQQLGCQLFVRSRNGASLTSDGEAFVSYANQLVQTWEAARRDLPLPEGCQQVLHVGGEVSLGNPMMLDWISALHRELPSHAIRSEVSDGESLLRKVEMGLLDAALVYQPTYGPGLQVEQLMEEKLIRVRRVDQPEPYIYIDWGEAFRRQHDAALPDCARPALSFNLGPLALQFILDQGGSGYFRTRVVQAYLDSGVFERVPQAPEFTYPTFLVYPRKRDSEALQQAFVILRQLVAAGASDWSQRWDPVI >NC_021505|3935972:3971477|3959335_3961945_+|WP_041167795.1|DBSCAN-SWA MLEAYRKHIEERAALGIVPQPLNAEQTAGLVELLKNPPAGEEAFLVDLITNRVPPGVDEAAYVKAAFLSAVAKGETQSPLIDRKHATELLGTMQGGYNIETLVALLDDAELGAVAAEQLKHTLLMFDAFHDVAEKAKAGNAHAKAVLDSWAAGEWFTARPAIAEKYTLTVFKVPGETNTDDLSPAPDAWSRPDIPLHALAMLKMARDGIEPQQPGSVGPLAQIEAVKAKGFPVAYVGDVVGTGSSRKSATNSVLWFFGDDIPYVPNKRAGGFCFGTKIAPIFYNTMEDAGALPIEFDCTNLGMGDVIDVYPYKGEVRRHESDELVTNFELKTEVLLDEVRAGGRIPLIVGRGLTEKARAELGLGASDLFKKPEQPADSGKGFTLAQKMVGRACGLPEGQGVRPGAYCEPKMTTVGSQDTTGPMTRDELKDLACLGFSADLVMQSFCHTAAYPKPIDVTTHHTLPDFIRTRGGVSLRPGDGIIHSWLNRMLMPDTVGTGGDSHTRFPIGISFPAGSGLVAFAAATGVMPLDMPESILVRFKGKLQPGITLRDLVHAIPYYAIQKGLLTVEKKGKKNAFSGRILEIEGLDELTVEQAFELSDASAERSAAGCTIKLPEKAIAEYLTSNITLLRWMIGEGYGDARTLERRAQAMEAWLANPELLSADADAEYAEIIEIDLADVKEPVLCAPNDPDDARLLSSVQGEKIDEVFIGSCMTNIGHFRAAGKLLEKVKGGIPTRLWLAPPTKMDAHQLTEEGYYGIYGKAGARMEMPGCSLCMGNQARVQTGSTVVSTSTRNFPNRLGDATNVYLASAELAAVASIIGKLPTVEEYMQYAKDIDSMAADVYRYLSFDQIAEFREAAANAKIPVVQA >NC_021505|3935972:3971477|3967790_3968918_-|WP_016500532.1|DBSCAN-SWA MAEAKVLSGAGLRGQVAGQTALSTVGQAGAGLTYRGYDVRDLAAGAEFEEVAYLLLYGELPTQAELADYKLKLKGLRDLPQALKEVLERIPRDAHPMDVMRTGCSVLGTLEPELTFEAQRDKTDRLLALFPAVMCYWYRFTHHGVRIDCTSDEDTLGGHFLHLLHGKKPSELHVKVMNVSLILYAEHEFNASTFTARVCASTLSDLYSCVTAAIGSLRGPLHGGANEAAMELIERFQSPQEATAELLRMLERKDKIMGFGHAIYKESDPRNEVIKGWSKQLADEVGDKVLYPVSEAIDKTMWEQKRLFPNADFYHASAYHFMGIPTKLFTPIFVCSRLTGWAAHVFEQRANNRIIRPSAEYVGVEQRQFVPIEQR >NC_021505|3935972:3971477|3970940_3971477_+|WP_016500534.1|protease|DBSCAN-SWA MRLTPVLLLLCLTLLPALGQAAGKTVYGLNEYARLGDLDLEVAAKLDTGAKTASLSARDIKRFKRNGESWVRFYLAIDAAHSHPIERPLARVSKIKRRAGDYDAESGKAYTARPVIELEICMGQAMRTIEVNLTDRSAFQFPLLIGSEALKHFDALVDPSLKYAAGKPACATDAHKAE >NC_021505|3935972:3971477|3958446_3958929_-|WP_151326450.1|DBSCAN-SWA MSNKSIKTPCVGLCSTVYGDTVCRGCKRFHHEVINWNGYDDAQKRAVWLRLEQLLVQVMMAKLEVFDKSLLRQQLEQRSIRFVEQQSEYCWAYQLIARGARMIRDLEAYGMVLLPEFRDWELPQLRDAIDREFFLLSEAHYQRYIAPSFLRDAMGSTQGQ >NC_021505|3935972:3971477|3940666_3941200_-|WP_016500512.1|coat|DBSCAN-SWA MRTNLSCCMLAGLGLALASQAQAATVTGSINSTLTLISACQVNGSSGTSGLNFGALNFGTQDALFVTANAQVLGGGGGAMSILCSAGTVPAIKVRAGLHDGQSSGGTRALADGSGNFVPYDLYTDTGRTTLLAIDGTITLPTSTGVAQTVNLYGKAVGKAGLPAGVYSDTISVELSF >NC_021505|3935972:3971477|3940159_3940663_-|WP_016500511.1|coat|DBSCAN-SWA MRGWLAGGLTGIGMLLAAPLGAVTTSTFTVTAQIVAGCLVVGGVTSYGTLDYGSQSALSKALLSTSLGGSTVTFQCTPGVAMSMSVDGGQNSASGTRNLKRTSGTQVLAYQLYRDAAYSQVLGIGQSVAVSYSDPTAIKLPVYGRVQLTGVLPAGTYTDVVQVTVTW >NC_021505|3935972:3971477|3944843_3945305_+|WP_016500516.1|DBSCAN-SWA MLKPILLVEDNPRDLELTLLALERSQLANEVIVLRDGAEALDYLLRRNTYADRDDGNPAVLLLDLKLPKVDGLEVLRAVRATAELRSIPTVMLTSSREEPDLLRAYELGVNAYVVKPVEFKEFVAAISDLGVFWAVLNEPPPGSLRLNRRGSN >NC_021505|3935972:3971477|3969917_3970634_-|WP_016500533.1|DBSCAN-SWA MQDLSTASPVLTDETETLSENVFRRIQAAIVKGEIAPGSKISEPELARTYGISRGPLREAIHRLEGQRLLVRVPHVGARVVSLNHAELIELYEIRESLEGMACRLAAERMSQGDIDELRRVLDTHERDAAFQAGVGYYQQEGDYDFHYRIIQGSGNQTLVKMLCGELYQLVRMYRIQFSATPNRPRQAFAEHHRILDAIADRDGELAELLMRRHIGASKRNIERHYLDAHNNSPRGEK >NC_021505|3935972:3971477|3939343_3940120_-|WP_016500510.1|DBSCAN-SWA MRAGAKWARGLIGLLWLASLPAGAATSVLIWPIDPVLEADQKAGALWLENRGTEPANLQVRVFAWRQGDYQEQFQAQREIIGSPPVANIAPGQKQLIRLTRTGPSPAGQEQAYRIIIDEIPPAIPVDKAEPGATAAIRLQMRYSVPLFVYGEGLLGKADPEGKRNAEGVGMPQLSWRPVTVQGKPYVEMRNTGPVHARLTDVVVQQGSQSKPLAEGLLGYVLPGASMRWPAPVASTSGSVLKGRVNGQSAADAIRQGQ >NC_021505|3935972:3971477|3941785_3942313_-|WP_041167793.1|coat|DBSCAN-SWA MTERLMAALLGLLFSGSTVAADFLVEVRVLVQRGCMLVNQTRDAGAQALGRIDLGTAARLDGPGAPLSGVLLSQRPPRLECNPDTPYQVRVDGGQHGGVGELRYLASDDHTARPIPYRLYRDAAWREPLAVGVAQSARVPSSGSVELPLYARIDKLAWVPNAGVYADLLKVTVTW >NC_021505|3935972:3971477|3945318_3947706_+|WP_016500517.1|DBSCAN-SWA MQQTPLKLLMVEDSSMDAELTLMRLERSGLHVQSQLVFDHVGVDHALREARYDLILCDCVLPGSSGTEVLAIAQRLAPDVPFIFLSGIYGEEHAVEMIRLGATDYVLKKNLPLLPKAVRRALTEVQERQRRRRAEEALIDVEARARIAIDAAGMGTWDLRPQEGLLLWDDRCKTLFGLPTSTEMSLEVFLGGIYPDDLPMVREAVEYAMRPESGGRYRVEFRIAQPNGLEPRWLLSSGQSQFVDGQCVRFSGVLQDIHTQRLATQALRQLNEMLGERVERRTRERDRAWELSQDLLAVLNKDLTPVALNPAWEASLGFSRERLSQSSLLHLLPEADQELLLTELAALAHGRTSVRFVGRILHAGGQQRWLSWVVVPEDTLLYVVARDITSEREAALGLAEANARLREQINERERIEAALQQMQRLEAVGQLTAGVAHDFNNLLTVILTGASFLERDLAKANLDKARTRLTHIREAGERGAKLTSQLLAFSRRQRLEPVALNLNQTLAGLEELLRRTLGGNVSVRLDLDQALWHALTDPTQTEMIILNLAINARDAMPDGGQLTLTTRNTRIDNRPQRPEDPDPGEYVMLSIRDTGCGMSEDVLAKVFEPFFTTKDIGKGSGLGLAQVFGFAKQSGGGVRIDTSPGRGTQVAVYLPAVKDQIVSEPVIPPLGQPVSDSGRNRTVLLVDDDHLVRDLLGDVLRQYGYQVRQAHSGEQALALLDDEIDLLLTDFAMPEFNGAQLALAARERYPRLPVVFLTGYAELQGLELPGSVVVQKPAQADELARVLNEMLGIAG >NC_021505|3935972:3971477|3952662_3953817_-|WP_016500521.1|DBSCAN-SWA MSASRSENRLQRLLPAPLNTSPKEWLRAGIGALLGLFLAGWLTSMAYGPGIALHLLGPLAATAVLVFAVHSGPLAQPWPVLGSYALAGAVGLAMRQGFGPELWVAAAALGVSILVMCLLRCLHPPGGGVAVSAVLADSGLTAMGDHLLEPILLNALILVSVAVLYNRLTGVRYPKGVAPRKDVHHTHDPLPSERVGIRGEDLDQALEELGEFVDVTRDELERIILATEQHALQRSLGGITAGSVMSRDVQFATPETTLEQAWKMLASHHLKTLPVLQQGKLVGIVSLSDLVGPAMQRGRFSWRGLFGRKAVRMEQVMSRRVVSVSSQHPLERLLPLLCEQGLHCLPVLDAGKLVGVITQTDLIAGLKRQLLSNAEASDNRVPAL >NC_021505|3935972:3971477|3948555_3949626_+|WP_016500518.1|DBSCAN-SWA MSAWIPAPLRQLGQRLRGATANQSELLDWFEDKARSRGYQLSDGQRRVIHCMAEQLALLEQGQPRSLYLYGSVGRGKSWLLDGFFQAVPVEAKRRLHFHDFFARLHQGMHRHRALDDALGATLDELVGGCQVLCFDEFHVHDIGDAMLLTRLFNALFARGVFLLVTSNYAPEGLLPNPLYHERFLPVIRLINGRMQVLEVGGGTDFRSLPANREHQRFTQGHYVWPGTAAQRQVLGVPDEQPVMLEVNKRPLRALAIDGRRVVLGFDDLCEKATAVIDYLVLAQEYDEWIIDGLDDLAQCSLAAQQRFVNLVDVLYDQDRQVTVVGKRPLEESLGGPLADLMRTRSRLGQLHQVAP >NC_021505|3935972:3971477|3936937_3939235_-|WP_016500509.1|DBSCAN-SWA MTSPVLALADDLPPPPTEMSAIADATLYLDLVVNQMPRAELVPVQQRAGQLYLGSDVLRAAGISLPGNPQGEVALESFTGLHADYDSQNQRLLLQVPPAWLPDQQVGDRNLYPASDARSSFGALFNYDLYLNDTDEGGTYLAAWNELRLFDSWGTFSSTGQWRQSFNGAQADDTRQGFMRYDTTWRFTDEQRLLTYEAGDFVTGALPWSSSVRVGGLQVSRDFAARPDLVTYPLPAFAGEAAVPTSLDLFINGFKSSTTELQPGPYTLTNVPFINGAGEAVVVTTDALGRQVSTTLPFYVTSSLLQKGLSDYSVAAGSLRRDYAVRDFSYGPGVASGSLRYGLSDMFTLETHAETAESLMLGGLGGNMRLGNFGVLNAALAQSRFDGDKGHQVALGYQYNSQRIGFSYQRLQRHGDYADLTRVDSPDMQLSKSSEQVTLSLNLNEYGSIGAGYFDVRAGDGTRTRLINLSYSRPLWGSSSVYLSANREVGDSQWAVQAQLVIPFDLHGTLALGMERSKEGESLQRVNYSHAVPVGGGVGYNLGYAAGSDRDAYRQADVTWRLQSVQLQAGVYGSSGAMTRWADASGSLVWMDAGVFAANRIDDAFVVVSTGGYADVPVRYENQDVGRTDAKGHLLVPYSSGYYRGKYEIDPMDLPPDILAGDVEQRVAVRRGSGYLLEFPLKRVMAASIELVDGNQQVLKLGSRVTHEESGSQAVVGWDGLVYLENLAAHNRLLVELEGGGHCEVAFDLPEAQGSIPLVGPLVCK >NC_021505|3935972:3971477|3949753_3951643_-|WP_016500519.1|DBSCAN-SWA MSYQHSYAHSISDPAAFWAEQAAHLAWHRKPALTLQDNADGTHRWFADGRLNSCYLALDHQIELGRGEQVALIYDSPVTGVQQAYTYNHLRDEVARLAGLLRQLGVEKGDGVIIYMPMVPQAAMAMLACARIGAVHSVVFGGFAANELALRIDDARPTLLLTASCGLEFDKVIAYKPLVDRALQLARHQPRNVLVLQRPQAQAQLQPGRDLDWQVALVGAQPVAPVELDAGDPLYIMYTSGTTGKPKGIVRENGGNAVALCYAMRHIYGMQAGDVWWGISDVGWVVGHSLIVYGPLMSGCTTVFYEGKPIRTPDASAYWRVVEQYKVNALFCAPTAMRAIRKEDPEGELIRKHDLSSLRQLFLAGEKLDSSTHEWLERVSGKPVHDHWWQTETGWPVTAPCVGLDGSAAKPGSSNRAVPGYYVRVVDDEGHLLGPNHQGSIVIALPLPPGCSQTLWGDHERYLQAYLRTYPGYYHTGDGGFLDDDGFVYIMGRTDDVINVSGHRLSTGEMEDLVACHPAVAECAVIGVHDEIKGQVPLALVVLKDGEGIAEAQLLVDLVGSVREAIGPLACFNRVRLVKRLPKTRSGKILRAVLRKIADGQDYVPPSTLDDPAVLGEIEAVLADLPRAG >NC_021505|3935972:3971477|3956945_3958025_+|WP_016500526.1|DBSCAN-SWA MSAIHIKYPALTFKAGDRALRLIRERGLQAADVGVLPGAAGGPKPLGIQGLDLALFGEWLPSAPRPRALIGASIGAWRFASACLEDPIAGLRRLGELYTELDFAKGATPAEISHSCQRMLDDLLQGRDGQLLANPHYHLNILVVKSHGQLAHDHRGRLGLGLGSVVASNLLGRSRLARHFERIILHDVRATPPLDALTDFPSRYLPLDLANLRHALLASGSIPMVMQGVKDIPGVGAGTYRDGGLLDYHLDLPYRGDDLVLYPHFTDKVVPGWFDKALPWRKGDATRLQNVLLMTPSPQYLAALPYGKLPDRNDFKRFMGDAPGRKRYWYKAIAESQRLGDELLELVATGRLHERLQAL >NC_021505|3935972:3971477|3954100_3955420_-|WP_016500522.1|DBSCAN-SWA MKSIQARLSLGLVAVLVVVGVVLAQLTLWLFEAGLQRYLENGLRKESENLLVALVRGPSGLQLDERRISAAYQRPFSGYYFRIDFDKGTWRSRSLWDLDMPKPSAPGLSDSHELGPEGQQLLALRADYRRLGQDISISVAQDYSPVREGFRRMQQIGLGMGLVALILVLVLQRITVTRSLRPLERARQQIAQLQQGQRSQLDEQVPSELAPLVGQINHLLSHTEDSLRRSRNALGNLGHALKTPLAVLLNLASSARLKDLPEVSAQMREQLEQIQQRLARELNRARLAGDALPGAQFDCDAELPGLLSTLGMIHGEGLLLARDVPPGLLLPWDREDFLELLGNLLDNACKWADSEVRLSIAPNAGGYQLWVDDDGPGIPENQRLQVLERGSRLDEQVDGHGLGLGIVRDIVDAWGGRLALEQSPLGGLRVSIDLPRKAR >NC_021505|3935972:3971477|3958079_3958436_+|WP_012271598.1|DBSCAN-SWA MEIFKEFTFESAHRLPHVPEGHKCGRLHGHSFKVGLHLTGPLDPHTGWIRDFAEVKAIFKPIYEQLDHNYLNDIPGLENPTSEVIAKWIWDQVKPLMPELSKVRIHETCTSGCEYTGD |
30 | Feldmannia_species_virus(33.33%) | coat,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
5582950 : 5592235
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_021505|5582950:5592235|DBSCAN-SWA GATGAGCGAGAGCGCATTCGCCGAGCGCATCGTGCACAACCTGCTCGACACTGACTTCTACAAACTCACGATGATGCAGGGCGTGTTGCACAACTACCCGGACGCCGACGTCGAATGGGAATTCCGCTGCCGTAACGGCGAGGACCTGCGCCCATACCTGGGCGAGATCCGCCACCAGCTCGAACTGCTCAGCGACCTCACCCTGGATGATGGCCAGCTGGCCTTCCTCGAACGCATCAGCTTCCTCAAGCCCGACTTCCTGCGCTTTCTGCGCCTGTTCCGCTTCAACCTTCGCTACGTGCGTATCGGCATTGAAAACGACCAGCTGTTCCTGCGCCTGAAAGGCCCGTGGCTGCATGTGATCCTGTTCGAAGTGCCACTGCTGGCCATCATCAGCGAAGTGCGCAACCGCCATCTGCACCCGCACATGCGCCTGGCCGAGGCTCGCGACCAGCTGTACCGCAAGTTCGACTGGCTGCGCGCGCATGCCAGCGATGACGAACTGGCCAACCTGCAGGTAGCCGACTTCGGCACCCGCCGGCGCTTTTCCAGCCGGGTGCAGGAAGACGTGGTGCGGGTGCTGCGCGACGACTTCCCGGCCCGCTTCGTCGGCACCAGCAACGTCGACCTGGCATGGAAACTGGATATCAAGCCGCTGGGCACCATGGCCCATGAGTGGATCATGGCTCACCAGCAACTCGGCCCGCGCCTGATCGACAGCCAGATCGCCGCGCTGGACTGCTGGGTACGCGAGTACCGCGGCCTGCTCGGCATCGCCCTGACCGACTGCATCACCATGGATGCCTTCCTCGGCGATTTCGACCTGTACTTCGCCAAGCTGTTCGACGGCCTGCGCCACGACTCGGGCGAGCCGGTGGCCTGGGCGGAGAAGGCCATTGCCCATTACCAGAAACTGGGTATCGACCCGATGACCAAGACCCTGGTGTTTTCCGACGGCCTCAACCTGACCCGCTCACTGGAGATCTTCCGTGCCCTGCGCGGTCGCATCAACGTCAGCTTTGGCATCGGCACCAACCTGACCTGCGACATACCGGGTGTGGCGCCGATGAACATCGTGCTTAAAATGACCGACTGCAACGGCGCGCCAGTGGCCAAGATCTCGGATGAGGCCGCCAAGACCCAATGCCGTGACGAGAACTTCGTCGCCTACATGCGCCACGTATTCAAAGTCCCCAGCAAGGAGTAACCCATGCAAGCGGTTCAGCAAGAGATTGCCCAGGCGCTGAAGGTACAGCCGCCGTTCGCCGACGCTGCAGCGCTCGAGGCCGAAGTCGCCCGGCGTGTGGCGTTCATCAAGGATTGCCTGGCCAACGCCCGGCTCAAGACCCTGGTGCTGGGCATCAGCGGCGGTGTCGACTCGCTGACTGCCGCCTTGCTCGCCCAGCGCGCCGTGAATGAACTGCGGGCCGAAACCGGCGACAAGGCATACACCTTCATTGCCGTGCGCCTGCCCTACCAGGTGCAACATGACGAGCATGACGCCCAGGCCTGCCTGGACGTGATCAAGGCCGATGAAGTGCACACGGTGGATATCGCCCCGGCGGTGCGGGCATTGGCCGCTGAAGTGGTGGAATTGAAGAACGGCTCGCCAACGCTGGTGGACTTTGTGGTGGGCAACGTCAAGGCACGTACCCGCATGGTTGCCCAGTACACCATCGCCGGGGCCCGCGCGGGCCTGGTGATCGGTACCGACCACGCCGCCGAGGCGGTAATGGGCTTCTTTACCAAGTTTGGTGATGGTGCCTGCGACCTGGCGCCCCTGAGCGGGCTGGTGAAGAACCAGGTACGGGCGATTGCGCGCAGCTTTGGCGCACCGGAGTCACTGGTGGAGAAGGTGCCGACGGCGGACCTTGAAGACCTGGAGCCGGGCAAGCCGGACGAAGCATCCCATGGCGTGACCTACCAGCAGATCGACGCCTTCCTGCATGGGTTGCCGGTTGATCAGGCGGCGTTCGACATCATCGTCGCCACCTACCGCAAGACCCAGCACAAGCGCGAACTGCCGTTCGCCCCATAAAAGCTTCGCGGGCACGCCCGCTCCCACAGGATTTTCACTGCCCTTGAAAACAGTGCTTACCTGTGGGAGCGGACGTGCCCGCGAATAGGCCCGAGAGGCCTACACAGCTGTCATTACTTGACTACAACCTTGCCTTTCATCATCGAGATGTGGCCCGGGAATGTGCAGAAGAAGCTGTAGTCACCACCGGCTTCCAGCTTGGAAGTGTCGAACTTCACTTCGGTCTCTTTTTCCGGTGCGCCGATCATCGCGGTGTGGGCGATGATGTTGGCGTTATCTTCCTTCAGGTAGCCCTTGTCGATGCCCTGGGTCATGCCTTCAGTGGCAATGCCCTGCATGTCGGCAGTCTTGCTGATCACCAGGTTGTGGCCCATGACGTTCTTCGGCAGGTTGCCGGAGTGGGTCAGTTTGACGGTGAATTCCTTGCAGCTCTTGTCGACGGTGAATTCTTTGGTGGTGTAGGACATCTGGTCCGTCGATTCAACAGTCACCGAGCATTCGGCTGCAAAGACAGAGGCGCTGGCGAGGGTCAGCAGGGATACCGCTACAGCTTTCGCAAACATCATGAATCTCCTTGGCAGGGTTTTATCAATTGCGAGACTGCCTGAAACCGTTCAGCCCCTTGCTGATATGGGTCAAGGGGGTGTCAACAGGCCAGCCAGTAGAATTGTTCAATGGATTGTATACAACCAATCTAAGAGCACATCATGCCCTTTTAATAGACGATGGGACAACCTCTCATTTAGGAGCAATACCAAATGCCCATCTCTGCATTTATCAACAGCCTGCTCGCCGCCTATGCCCACGGGGCAACGGGCGCCATCTGCGAATTCTCCCGCCCGGAATGAACGTGGCCTTGGAAGCGACGGGCATGCGCCGTACCCTGTCCACCTGAGTGACAGGAGATGAGCAATGCCCGTACGTTCCGTTTGTGTGTTCTGCGGCGCCAGCATCGGCGCCACCCCTGCCTACCGTGAAGCAGCCATCGCACTTGGCCAGGCCATTGCGCGCCGCGGTCTGACCCTGGTCTATGGCGGCGGCGCTGTCGGCCTGATGGGCACTGTTGCCGACGCGGCCATGGCGGCCGGTGGCGAAGTGATCGGGATCATTCCCGAAAGCCTGATGAACGCCGAAATCGGCCACAAGGGCCTCAGCCGCCTGGAAGTGGTCGACGGCATGCATGCCCGCAAGGCACGCATGGCCGAGCTTAGCGATGCCTTCATCGCCCTGCCGGGTGGGCTGGGTACGCTGGAAGAATTGTTCGAGGTATGGACCTGGGGGCAACTGGGCTATCACGCCAAGCCGCTGGGGCTGCTGGATGTGAACGGCTTCTACGAGAAACTGGGCGGGTTCCTCGACCATATCGTCGAAGAAGGCTTTGTGCGGCCGCAACATCGGGCGATGTTGCTGCTGGGGCAGCAGCCGGACGCGCTGCTGGACGGGATGGACAGTTTTGTGGCGCCGGTGGTGCCGAAGTGGGTCGACAAGCAGCCTGACTAAACCCGATACGGGGGCTGCTTTGCAGCCCAATCGCCGGCAAGCCAGCTCCCACAGAGATTGCACAAAGCCCAAAGCTTGCACTGTACCTGTGGGAGCTGGCTTGCCGGCGATTGGGCCGCAAAGCGGCCCCCGACATTAGCGTGGGATAACTGGCTGGCGCGGCTTCTTGTTGCCCTTGCCGCCCTTGGCTGCTTCCTTGCGCTCCTTGGCCGCCTGCTGGTTACGCGCAAACGCCTCGGCCTTGGCCTTCTCACGCTTGTCCCACGGCTTGCTGCCATCGCTGCCACGCGGTGGCAGGCCGGTGTGCTGGGTCAGGATCTTCTGCTCCTTGCCCACCTTGTGGCTGCCCGCTGGCGTCGAGTTCTTGCGGCGCGCGCTCTGGTACGTGTCGGATTGCGGCTGGTGCAGCGGGATCAGCTGGTGCTTGCCCGGCCCAATCAGGTCGGCGCGGCCCATACGCTCCAGCGCTTCACGCAGCATCGGCCAGCCCTTCGGGTCGTGGTAGCGCAAGAACGCCTTGTGCAGGCGGCGTTGCTCGTCGCTCTTGACGATCTCCACCCCTTCACTCTTGTAGGTCACTTTGCGCAGCGGGTTCTTGCCCGAGTGGTACATGGCCGTGGCCGAGGCCATCGGCGACGGGTAGAACGCCTGCACCTGGTCAGCGCGGAAGCCGTTACCCTTCAGCCACAGGGCCAGGTTCATCATGTCTTCGTCGGTGGTGCCCGGGTGCGCGGCGATGAAGTACGGGATCAGGTACTGCTCCTTGCCCGCCTCTTTCGAGTACTTCTCGAACATACGCTTGAAGCGGTCGTACGAGCCGATACCCGGCTTCATCATCTTGTCCAGCGGGCCACGCTCGGTGTGCTCCGGGGCAATCTTCAGGTAGCCGCCCACGTGGTGGGTAACCAGCTCCTTGACGTACTCCGGCGACTCCACGGCCAGGTCGTAGCGCAGGCCCGAGGCGATCAGGATCTTCTTCACACCTGGCAGGGCACGGGCCTTGCGGTACAGCTCGATCAGCGAGCTGTGGTCGGTGTTGAGGTTTTCGCAGATACCCGGGAACACGCACGACGGCTTGCGGCAGTGCTTCTCGATTTCATGGCTCTTGCAGGCGATGCGGTACATGTTGGCGGTCGGCCCGCCAAGGTCGGAGACCACGCCGGTGAAGCCCGGCACCTTGTCGCGCATCTCTTCGATCTCGTGCAGGATCGACTCGTGCGAGCGGTTCTGGATGATGCGGCCTTCGTGCTCGGTGATCGAGCAGAAGGTGCAGCCACCAAAGCAGCCACGCATGATGTTCACCGAGAAACGGATCATCTCGTAGGCCGGGATGCGCTCCTTGCCATAGGCCGGGTGTGGCACACGGGCGTAGGGCATGCCGAACACGTAGTCCATTTCTTCGGTGGTCATGGGGATGGGTGGCGGGTTGAACCACACATCCACTTCGCCATGCTTCTGCACCAGGGCGCGGGCGTTACCCGGATTGGTCTCCAGGTGCAGCACGCGGTTGGCGTGGGCATAGAGTACCGGGTCGTTACGTACTTTTTCGAACGACGGCAGGCGGATTACCGACTTTTCCCGGGTCACGCTCGGGCTGTCGAGAATCTGCACGACCTTGGCTTCGTTCGGGTCTTCCTGGTCGCCCTTGGCCTGCTCGATGGCGCAGGCCTGGGTGTCCTGGGTGTTTACGTACGGGTTGATGATCTTGTCGACGCGGCCCGGGCGGTCGATGCGGGTGGAGTCGATCTCGAACCAGCCCTGCGGGGTATCACGGCGCACGAACGCGGTGCCGCGGATGTCAGTGATGCTCTCGATCGTCTCGCCACTGGCCAGGCGCTGGGCCACTTCCACCACCGCACGCTCGGCGTTGCCGAACAGCAGGATGTCGGCGCTGGCGTCGATCAGGATCGAGTGACGCACCTTGTCCTGCCAGTAGTCGTAGTGGGCGATGCGGCGCAGCGAAGCCTCGATGCCGCCGAGTACGATCGGCACATGCTTGTAGGCTTCCTTGCAGCGCTGGCTGTACACCAGGCTGGCGCGGTCCGGACGGCTGCCGGCCAGGCCACCTGGGGTGTAGGCGTCGTCAGAACGGATCTTCTTGTCTGCGGTGTAGCGGTTGATCATCGAATCCATGTTGCCTGCGGCCACGCCGAAGAACAGGTTCGGCTCGCCAAGCTTCATGAAGTCGTCTTTCGACTGCCAGTTCGGCTGGGCGATGATGCCCACGCGGAAGCCCTGGGCTTCCAGGAGGCGGCCGATGATGGCCATGCCGAACGACGGATGGTCGACGTAGGCATCACCGGTCACGATGATGATGTCGCAGGAATCCCAGCCGAGCAGATCCATCTCCTGCCTGCTCATTGGCAGGAAAGGTGCTGGCCCGAAGCATTCGGCCCAGTACTTGGGATAGTCGTAGAGTGGTTTGGCTGCTTGCATGTCAGTGACCGGTTCTGGTGTGCAGGGAAATCGCGGGCGCGGAATATAGCACAAATTTTGACCAAATCCGACTGGATTCGTGGGGATTGGTGGGTGGGGCGACGGTTGGGTGGGCGGGGGTTCCGGATGGCTGGCTTGGGGGTTGTGGTGGTGGGGTTGTGGTGGGGTTGCCAGGTGTTCATGCATCCTGAAGCCAACACCTCACTGCTTCATCCCCAACTCAGCATCATCCATCAGCGCCTTGGCCATCGCACTCAGGTAATGGGCAGCCCAGATCATCATCGGCTTCTCATCCATCAACCCGGTGATGGTCAGTTCGCGCAAATACCCCATCAACTCCGAAGACTGCTCGCGGGCGTTTCGGCACGGGATACCCGGCTCAATGCGGAACAACGGGTGGGTGCCATTTTCGCCTTGATAGAACGTGGTCTTACCGACGGTGAATTGGGTGTCTTCTGTTGTCATTGTTCAATCCCCTCGGAATTCTCGACCGCCTGGCCGGGCCTCTTCGCGGGCACGCCCGCTCCCACAAGGTCACCACAATCTTCGCGACGTGCACCGTACCTGTGGGAGCGGGCATGCCCGCGAAAGGGCACATCCAGGCGACAACCAGGTCAAGAATCGTAGCGCGCTCCGCAAGACTTTCGTCTTGAGGCCTGCCAAGAGCTGTCGGGCGTACGCCGGCAGGTTTGCTGCGCCTGTCAGATTGAGTGCCGCCTTTGCGGAGCTTCGCAAGCGGAGCTCTCTATTTATCGGCATTTACGACGACATGTCCCCTGAGCGCACATCCAGAATTGATAGACAATTCTGCCTAGCATTTTTGTCAGTACCGCGAAGCCAACTCAAGAAGTACAGTAAAATTTTAAAGCTACTAGCAGCGAGGATGAAATATGCGATACTTTTATTGTGAAGATACATTGTCACATATTGCGCATGCCTCCACGAGCGAATCATTTTTCAGGTACGGTAGATACAACCTCTCGCTTAAAGTTCAGCAGAACACCCAGCGACTTCTGAGCGTTGATCTTTTTAACACTCCACTTCTCGCCACCACTCCAAACCAAGTTGTGTCTTACCGCTACAGCGCTTTTGGATTCAGCAGCCCCTCTAACAACGAATGGGGCTTCAAAGGTGAGCGAAAAGACCCTATCAGCAGCGGTTACCTACTCGGAAATGGTCGCAGACTCTACAACCCGTCCATCATGCGATTTACCAGCCCAGACCCCTTAAGCCCATTCTCCAAAGGTGGCCTTAACTACTACGCATTCACCCTGAACGACCCTATTAACGGCAGTGACCCCACCGGATTGGTTACTCAATCTATAATTAACTTTAATGCCAAGCTTCACCCGAAGAAAACGTATCGTGGCGATATACTGTGGCAACATGATGGGATAACTGCTTTCGCTGAAAAGCGTCGAACAGATGGCAATCTCGACACCCTTTACATCCTGAGTCACGGCGAAAAAGGTGTGTTATCGGGAAATGAACTCAACTATTCAGCTTTGGATATTTTCATCAGACTTAACCAAAAGGGTATTAAAATGCAGGGTAGACAAACACATTTTTTAGCTTGCTACTCTGCTACACCTGAGTATTACGGGGGCTATTCAGTCGCTGATGAAATGGCCAAACTAACCGGGGTACAATCCTCAGGTTACGATGGCCCCGTCTCCGTTGCCGATGAAATCGACAAGAGCGGAAAGTTTGTCGCTCACCGCATCAGCAATCCAATTAATAATTTTCTTTTCGGGTTTACCGCAACCAAAGTCAGGGGGGGCAACATTCGAAACCCCCACAAAACACAAAATACAGGGCCTGGCGGCAGACACTGGTAGCGGCAGACTCAGCACCTGTAATGTATTCGTAGTCTCCCACCAACTGGCCGACCACTTTGATTAACCTCAGCAGTCGTCACTCATCATCATCAAAGTTGTACATCCCCGGCGCCAAGTTTTCGAAGCGGGTGTACTTACCAATAAACGCCAGGCGTACAAAGCCGATGGGGCCGTTACGCTGTTTGCCGATGATGATTTCGGCAATGCCCTTGTGCTCGGTCTCCGGGTGATACACCTCGTCACGGTAGACGAACATGATCACGTCGGCGTCCTGCTCGATTGCACCGGACTCACGCAAGTCGGAGTTGATCGGGCGCTTGTTTGGCCGTTGCTCAAGGGAGCGGTTCAGCTGGGACAGTGCTACGACCGGGCAGTTGAACTCTTTGGCAAGGGCCTTGAGGGAGCGAGAAATCTCGGAAATTTCGTTGGTCCGATTATCACCACCGGAACCTGGAATCTGCATCAACTGCAGGTAGTCGACCATGATCAGGCCGATTTCGCCGTGCTCACGCGCCAGGCGGCGAGTCCGCGAACGCATTTCCGAAGGGCTGATGCCCGCTGTATCGTCGATGAACAGCTTGCGGTCGTTGAGCAGGTTGACTGCCGAAGTCAGGCGCGGCCAGTCGTCGTCGTCCAGCTGGCCAGAACGCACCTTGGTCTGGTCAATACGGCCCAAGGACGAGAGCATACGCATGATCAGCGATTCACCTGGCATCTCGAGGGAGAACACCAGCACAGCCTTGTCGCTGCGCAGCACGGCATTCTCGACCAGGTTCATGGCAAACGTGGTCTTACCCATCGAGGGTCGGCCGGCGACAATGATCAAGTCCGCCGGCTGCAGGCCGCTGGTCTTCTCGTCCAGGTCGGTGTAACCGGTGGAAACACCGGTAATTTCGCTGTCAGAGTTGAACAGCGTATCGATGCGGTCGATGGCCTTGGTCAACAGCTCGTTGACGCCTACCGGGCCGCCGGTCTTTGGCCGCGCCTCGGCAATCTGGAAAATCTGCCGTTCGGCGTCGTCGAGAATTTCCTCGGCGTTGCGACCTTGGGGGTTGAAGGCGTTGTCGGCAATATCGGTGCTGATGCTGATCAGCTGGCGCAGCGTGGCGCGCTCGCGAATGATCGCGGCGTAGGCCTTGATGTTGGCCACCGATGGCGTGTTCTTGGCCAGCTCCGCCAAGTAAGCCAGGCCGCCGACCTGCGACGAAACGCCTTCCTTGTCCAACTGCTCGTGCAACGTGACCACGTCGAACGGGTGGTTAAGGTCCACCAGCTTATGGATGGCACGGTAGATCAGGCGATGGTCATGCCGGTAGAAATCGCCATCCGAAACCTGATCCAGCACCCGCTCCCAGGCGTTGTTGTCCAGCATCAGGCCACCGAGCACGGCCTGTTCGGCCTCGATGGAATGCGGCGGCACCTTCAGGGCGGCGGTTTGCAGGTCAAGCTGTTCGGAGGTTGTGATCTCGTTCAT
Protein sequences of DBSCAN-SWA_5 >NC_021505|5582950:5592235|5585100_5585550_-|WP_016501867.1|DBSCAN-SWA MFAKAVAVSLLTLASASVFAAECSVTVESTDQMSYTTKEFTVDKSCKEFTVKLTHSGNLPKNVMGHNLVISKTADMQGIATEGMTQGIDKGYLKEDNANIIAHTAMIGAPEKETEVKFDTSKLEAGGDYSFFCTFPGHISMMKGKVVVK >NC_021505|5582950:5592235|5584159_5584987_+|WP_016501866.1|DBSCAN-SWA MQAVQQEIAQALKVQPPFADAAALEAEVARRVAFIKDCLANARLKTLVLGISGGVDSLTAALLAQRAVNELRAETGDKAYTFIAVRLPYQVQHDEHDAQACLDVIKADEVHTVDIAPAVRALAAEVVELKNGSPTLVDFVVGNVKARTRMVAQYTIAGARAGLVIGTDHAAEAVMGFFTKFGDGACDLAPLSGLVKNQVRAIARSFGAPESLVEKVPTADLEDLEPGKPDEASHGVTYQQIDAFLHGLPVDQAAFDIIVATYRKTQHKRELPFAP >NC_021505|5582950:5592235|5589124_5589388_-|WP_016501870.1|DBSCAN-SWA MTTEDTQFTVGKTTFYQGENGTHPLFRIEPGIPCRNAREQSSELMGYLRELTITGLMDEKPMMIWAAHYLSAMAKALMDDAELGMKQ >NC_021505|5582950:5592235|5589813_5590761_+|WP_016501871.1|DBSCAN-SWA MRYFYCEDTLSHIAHASTSESFFRYGRYNLSLKVQQNTQRLLSVDLFNTPLLATTPNQVVSYRYSAFGFSSPSNNEWGFKGERKDPISSGYLLGNGRRLYNPSIMRFTSPDPLSPFSKGGLNYYAFTLNDPINGSDPTGLVTQSIINFNAKLHPKKTYRGDILWQHDGITAFAEKRRTDGNLDTLYILSHGEKGVLSGNELNYSALDIFIRLNQKGIKMQGRQTHFLACYSATPEYYGGYSVADEMAKLTGVQSSGYDGPVSVADEIDKSGKFVAHRISNPINNFLFGFTATKVRGGNIRNPHKTQNTGPGGRHW >NC_021505|5582950:5592235|5590837_5592235_-|WP_016501872.1|DBSCAN-SWA MNEITTSEQLDLQTAALKVPPHSIEAEQAVLGGLMLDNNAWERVLDQVSDGDFYRHDHRLIYRAIHKLVDLNHPFDVVTLHEQLDKEGVSSQVGGLAYLAELAKNTPSVANIKAYAAIIRERATLRQLISISTDIADNAFNPQGRNAEEILDDAERQIFQIAEARPKTGGPVGVNELLTKAIDRIDTLFNSDSEITGVSTGYTDLDEKTSGLQPADLIIVAGRPSMGKTTFAMNLVENAVLRSDKAVLVFSLEMPGESLIMRMLSSLGRIDQTKVRSGQLDDDDWPRLTSAVNLLNDRKLFIDDTAGISPSEMRSRTRRLAREHGEIGLIMVDYLQLMQIPGSGGDNRTNEISEISRSLKALAKEFNCPVVALSQLNRSLEQRPNKRPINSDLRESGAIEQDADVIMFVYRDEVYHPETEHKGIAEIIIGKQRNGPIGFVRLAFIGKYTRFENLAPGMYNFDDDE >NC_021505|5582950:5592235|5582950_5584156_+|WP_016501865.1|DBSCAN-SWA MSESAFAERIVHNLLDTDFYKLTMMQGVLHNYPDADVEWEFRCRNGEDLRPYLGEIRHQLELLSDLTLDDGQLAFLERISFLKPDFLRFLRLFRFNLRYVRIGIENDQLFLRLKGPWLHVILFEVPLLAIISEVRNRHLHPHMRLAEARDQLYRKFDWLRAHASDDELANLQVADFGTRRRFSSRVQEDVVRVLRDDFPARFVGTSNVDLAWKLDIKPLGTMAHEWIMAHQQLGPRLIDSQIAALDCWVREYRGLLGIALTDCITMDAFLGDFDLYFAKLFDGLRHDSGEPVAWAEKAIAHYQKLGIDPMTKTLVFSDGLNLTRSLEIFRALRGRINVSFGIGTNLTCDIPGVAPMNIVLKMTDCNGAPVAKISDEAAKTQCRDENFVAYMRHVFKVPSKE >NC_021505|5582950:5592235|5585899_5586487_+|WP_016501868.1|DBSCAN-SWA MPVRSVCVFCGASIGATPAYREAAIALGQAIARRGLTLVYGGGAVGLMGTVADAAMAAGGEVIGIIPESLMNAEIGHKGLSRLEVVDGMHARKARMAELSDAFIALPGGLGTLEELFEVWTWGQLGYHAKPLGLLDVNGFYEKLGGFLDHIVEEGFVRPQHRAMLLLGQQPDALLDGMDSFVAPVVPKWVDKQPD >NC_021505|5582950:5592235|5586622_5588923_-|WP_041167906.1|DBSCAN-SWA MQAAKPLYDYPKYWAECFGPAPFLPMSRQEMDLLGWDSCDIIIVTGDAYVDHPSFGMAIIGRLLEAQGFRVGIIAQPNWQSKDDFMKLGEPNLFFGVAAGNMDSMINRYTADKKIRSDDAYTPGGLAGSRPDRASLVYSQRCKEAYKHVPIVLGGIEASLRRIAHYDYWQDKVRHSILIDASADILLFGNAERAVVEVAQRLASGETIESITDIRGTAFVRRDTPQGWFEIDSTRIDRPGRVDKIINPYVNTQDTQACAIEQAKGDQEDPNEAKVVQILDSPSVTREKSVIRLPSFEKVRNDPVLYAHANRVLHLETNPGNARALVQKHGEVDVWFNPPPIPMTTEEMDYVFGMPYARVPHPAYGKERIPAYEMIRFSVNIMRGCFGGCTFCSITEHEGRIIQNRSHESILHEIEEMRDKVPGFTGVVSDLGGPTANMYRIACKSHEIEKHCRKPSCVFPGICENLNTDHSSLIELYRKARALPGVKKILIASGLRYDLAVESPEYVKELVTHHVGGYLKIAPEHTERGPLDKMMKPGIGSYDRFKRMFEKYSKEAGKEQYLIPYFIAAHPGTTDEDMMNLALWLKGNGFRADQVQAFYPSPMASATAMYHSGKNPLRKVTYKSEGVEIVKSDEQRRLHKAFLRYHDPKGWPMLREALERMGRADLIGPGKHQLIPLHQPQSDTYQSARRKNSTPAGSHKVGKEQKILTQHTGLPPRGSDGSKPWDKREKAKAEAFARNQQAAKERKEAAKGGKGNKKPRQPVIPR |
8 | Agrobacterium_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
5816548 : 5822873
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NC_021505|5816548:5822873|DBSCAN-SWA CGTGAAAGCCCTCGACCAACTCACCTTCGACAACCGCTTCGCCCGCCTGGGCGATGCGTTCTCCACCCAGGTGCTGCCCGAGCCCATTGCAGACCCGCGCCTGGTGGTAGCCAGCGAGTCGGCCATGGCCTTGCTCGAGCTCGACCCCGCCCAGGCCGACCTGCCGGTGTTCGCCGAGCTGTTCAGCGGCCACAAGCTGTGGGAAGAAGCCGACCCACGGGCGATGGTCTATTCCGGCCACCAGTTCGGCTCGTACAACCCACGGCTGGGCGATGGCCGCGGCCTGCTGCTGGCCGAAGTACTCAACGACAAAGGCGAGCACTGGGACCTGCACCTCAAGGGCGCCGGCCAGACCCCTTACTCACGCATGGGCGATGGCCGCGCCGTGCTGCGCTCGTCGATACGCGAATTCCTTGCCTCGGAAGCCTTGCACGCACTGGGTATCCCAAGCAGCCGGGCGTTGTGCGTGATCGGCTCCAGCACCGCGGTGTGGCGTGAAACCCGCGAAAGCGCCGCCATGCTCACGCGCCTGGCGCAGAGCCATGTGCGTTTCGGCCATTTCGAGTATTTCTACTACACCAAGCAGCCCGAGCAGCAGCGCGTGCTGATCGACCATGTCCTTGAGCAGCACTACCCCGAATGCCGTGACGCCGAACAACCGTACCTGGCCATGTTCCACACCATCGTCGAGCGCAACGCCGAACTGATTGCCCGCTGGCAGGCCTATGGCTTCTGCCACGGGGTGATGAACACCGACAACATGTCGATCCTGGGCATCACCTTCGACTTCGGCCCTTACGCCTTCCTGGACGACTTCGACGCCAACTTCATCTGCAACCACTCCGACGACCGTGGCCGCTACAGCTACGCCAACCAGGTGCCCATCGCCCACTGGAACCTCAGCGCGCTGGCCCAAGCCCTGACCACCGTGATCGAAGTTGAGCCGTTGAAGGAAGCGTTGGGGCTGTTCCTGCCGCTGTACCAGGTCCATTATCTGGACCTGATGCGCCGGCGTCTGGGCCTGACCACTGCCGAGGACGACGACATGGCGCTGGTCGAGCGCCTGCTGCAGTGCATGCAGCGTGGCGGCGTGGACTACAGCCTGTTCTTCCGAAAACTCGGTGAGCAGCCGGTGGCCGATGCGCTGAAAGTGGTGCGCGACGACTTCATCGACCTGGCCGGCTTCGATGCCTGGGGTGCAGATTACCTGGCCCGCTGTGAACGTGAACCCGGCAATGCCGAAGGCCGCCGTGAACGGATGCATGCGGTGAACCCGCTGTATGTGCTGCGCAACTACCTGGCGCAGAAGGCCATCGAAGCGGCTGAAGCGGGGGACTACAGTGAAGTGCGGCAATTGCACCAGGTACTGACCAGACCCTTCGAGGAGCAGCCCGGGATGCAGGCCTACGCCGAGCGGCCGCCGGAATGGGGCAAGCACCTGGAAATCAGCTGCTCTTCCTGATTGATGATTGCTCTACCGGCCCTATCGCCGGCAAGCCAGCTCCCACAGGTAACGTGCCAGACTTGATGCAAGCGGTACCCCTGTGGGAGCTGGCTTGCCGGCGATAGGGCCCGAACAAACAACACAAGGAATTGATATGTCCGACCCACTGGTAATCCCCTGCCCCCACTGCAACGGCCTCAACCGCCTCCCCGCCGAGCGCCTGGGCGATGCCCCGAAATGCGGCCGCTGCAAACAGGAGGTGATGTTGAGCACCCCCTTCGAGCTCACCGAAGCCAGCTACGCCAGCCAGATCAAAGGCGACCTGCCGCTGCTGGTCGACATCTGGGCCGACTGGTGCGGCCCGTGCAAATCCTTCGCCCCCACCTTCGAACAGGCTGCCCGCCAACTTGCCGGCCGCTGCCGTCTGGCCAAGCTCGACAGCGAAGCCAACCGCAACCTCGCCGGGCAACTGGGCATACGCTCGATCCCCAGCTTGCTGCTGTTCAAGAACGGGCGCGAAATCAGCCGCCAGGCCGGGGCGTTTCCACTCCAGTCGTTGCTGGAGTGGGTGCGTAGCCAAGGGGTCTAATTCTCAGGCATTTGCCAGTAGCGACTCATAGGGAATACGAAGGCCATCATGCAGGCGCTTGATCATCGAAAGGCTTAGCCCACGCTTCCCATTCAATACTTCCGATACTCGGCCGCTTGGGCCGATATATGGCTCAAGATCACGTGGGGTAAGGCCCTGCTGATCCATGCAGAATTTGATGGCCTCGATGGGATTGGCGGGATGAATGGGATAGTGCTTGTTTTCGTACACTTCAATCAGTGTCACAAGCACCTCCATTTCATCAGCTTCTGGCGTGCCCGCTTCTGCCTGGAAAATGGCCTCGAGCCGCTGGAACGCGGCTCGAAGGTCGTCATCATTGCGGATCGGCTTAATATTCACAGACTGTCTCCACGTCAATCTCGTCGTAGCGCTTGTGCGTACCGACGAACTTCACCCAGGCAATACCGGCTCGATACTGCATCTCGACCACAAGCCGATACTTGTTCCCCCCGATGTTGAAAACCACGCGGTTCTTGCCGCAGATGCTGGCATTGCCAATCTGATCTTTTACATCCTGTGGCGTTCGCCAGATAGCTTTGAGCGCCATATCGTGCCAGCTTTCCAAGGCCGCTTTGGCATCTTCATGCCCAGGCAATTCCCAGAACTTCACCAAGCTGCTTTTGGCAATGACGCGCATCCAAGCAAATCTCCCGTTTTGGGAGATTGTGCTCGGGACTTGGATAAAGTGCAAGCGTCAGGCAGGCGCTATTCGCTGCGCTCAAGCAAATCGTGCAACTCGACAAACTGCTGGGTCAGCTTGTGCCGCGGCTCCAGGTGAATCAGCGGCAGGCTGGCGTGGTGCGATTCGCGCATCTTCACCGAGCTGCCCAGGTACACCGGAAGCACCGGCAGGCCTTCGGCCAGCAGCTCGTCGAGCATCTGCTGCGGCAGGCTGGCACGCGACTGGAACTGGTTGACCACGATGCCTTCGACCATCAGGTCTTCGTTGTGGTCTTCCTTGAGGTCTTCGATCTCGGCCAGCAGGCCGTACAGGGCCTGGCGCGAGAAGCTGTCGCAGTCAAAGGGGATGAGCACACGATCAGCCGCGATCAACGCGGAAACCGCGTAGAAGTTAAGCGCCGGCGGGGTATCGATGTAGATACGCTCGTAGTCCTCGTCCAGCTCGTCGAGCAACTTACGCAGCTTGTTGATCTTGTGCTTGGCCTCAAGCTTGGGCTGCAGGTCGGCCAACTCGGCAGTGGCCGTGACCACGTGCAGGTTGTCGAACGGGGTTTCGTAGATGTCGACCTTGTTCTTCTTGCTGAACGGCCCGCTGGACAGGCTTTGCTTGAAGAAGTCGGCGATACCCATGGGGATGTCCTCGCCGGTCAGGCCGGTGAGGTACTGGGTCGAGTTGGCCTGGGCATCCAGGTCGATCAACAAGGTCCGATAGCCTTCATTGGCACTGACCGCCGCCAGGTTGCAGGCGATGCTCGACTTGCCCACGCCACCTTTCTGATTGAACACCACGCGCCGCATGATTGACCTCCGTGTTTCAACGAATGCCCGAGTATGTGGCGGCACTGCGGCAATGGCCAGCGCCATTTGGCCTGCATTACCCCTGGCCGACCACCTGCCCCGTCAACAGCTCGGCAAACGCCTTGGCCATCACCGAATTGCCGCGCTCGCGCTGCACCAGCCACACCGCCGACATCGCCCCCTCATCCAGCAGCGTGCGGTACACCACGCCTTCGATGCGCATGCGCTGGAACGACGCCGGTAACACCGACACCCCCAACCCCGCCGACACCAGGCCGATGATGGTCATCGCCTCCCCGGCCTCCTGGGCAAAGTGCGGGCTGAAGCCTGCCTGCCGCGCCAGGCTGAGCAACTGGGCATGCAGGCCGCTGCCATAACTGCGCGGGAAGAATACGAACGGCTCGTGGGCCAGCGCCGCCATGTGCACCCCTTGCTCAGTGCTTTCAGCCAGCGGATGCGATGCGTTGATGACCGCCACCAGCGGCTCGCGGAACAACTCGGTGGCCACCAACCCTTCTGGCAACGGCATCGGCCGCATCAGCCCCACTTCGATGGACTCGTCGAACACCCCTTCGGCCACATCGCGGCTGCTCATTTCCTTGAGGTTCAGGTGCACCGCCGGGAAGCGCTGGCGGAAGGCATGAATGGCCTTGGGAATCTTCGAGGTGAACGGGGCCGACGAGGTAAAGCCGATCTTCATCTCGCCCAGCTCCCCCAACTGCGCGCGCCGAGCCACGTCGGCGGCCTTCTCCACCTGCGCCAGCACCTGACGGGCCTCTTCGAGGAACAGCCGCCCTGCCTCGCTCAGTTCCACCCGACGGTTGGTACGCTCGAACAGCCTGGCCCCCAGTTCCTGTTCCAGGGCCTGGATCTGCTGGCTCAGGGGCGGCTGGGAAATACCCAGTTGCTGGGCGGCACGACCAAAGTGCAGTTCTTCGGCCACGGCGATGAAGTAACGCAGATGACGCAGCTCCATGATCAGCTCCAAATGATTCGAAAAACGTCTTAAATAGGTCGAACAATATATTGGATCTAATCATTAGCCAGCTATATGCTTTTTTCATTGCGCCAGAGGTACCCGCCCGTGAAAACTGCTGTAGCCCCCCTTCCCGCCGAGCCAGAGCCTGCTCTACTGAACGAAATGTGGATTGAAAAAGGCACCCCGGCCTTCATGAAGACCGTGCTGGCCCTGTTCAGCGGCGGCTTCGCCACCTTCGCCCTGCTGTACTGCGTGCAGCCGATGATGCCGCTGCTGTCGAAGGAGTTTTCCATCAACGCGGCCCAGAGCAGCCTGGTACTGTCGGTGTCCACCGCCATGCTGGCGTTCGGCCTGTTGATCACCGGCCCCATTTCTGACCGTATCGGGCGCAAGCCGGTGATGGTCTTTGCCCTGGTTTGCGCCGCCCTCTCCACCTTGGCCAGCGCGGTGATGCCAAGCTGGGAACTGGTACTGGCCACCCGCGCCTTGGTTGGTCTGTCACTGAGCGGCTTGGCTGCCGTGGCCATGACCTACCTGAGCGAAGAAATCCACCCACAGCACATCGGCCTGGCCATGGGCCTGTACATCGGCGGCAATGCCATTGGCGGCATGAGCGGCCGGCTGATTACCGGCGTGCTGATCGACTTCGTCAGCTGGCACACGGCCATGCTGACCATCGGTGGCCTGGCCTTGGTCGCCGCACTGGTTTTCTGGAAGGTGCTGCCCGAATCGCGCAACTTCCGCCCGCAGATGATGAGCCCGCGCAGCCTGCTGGACGGTTTTGTCATGCACTTCAAGGATGCCGGGCTGCCTTGGCTGTTCCTTGAAGCCTTCCTGCTGATGGGCGCCTTCGTCACCTTGTTCAACTACATCGGCTACCGCTTGCTGGCCGAGCCTTACCATATGAACCAGGCGCTGGTGGGCTTGCTGTCGGTGGTCTACCTGTCCGGCATCTACAGCTCGGCACAAGTCGGTGCCCTGGCGGACAAGCTGGGCCGGCGCAAGGTGTTCTGGGCCAGTATCGTGGTGATGGCGGGTGGCTTGCTAATGACCCTGGCCAGCCCGCTGGCGATGGTGATCGTGGGCATGCTGGTGTTCACCTTCGGCTTCTTTGGCGCGCACTCGGTGGCCAGCAGCTGGATCGGCCGCCGGGCGCTGAAGGCCAAGGGGCAGGCATCGTCGCTGTACCTGTTCAGCTATTACGCAGGGTCCAGCGTGGCGGGTACGGCGGGCGGGGTGTTCTGGCACCAGTGGGGCTGGAACGGCATAGGACTGTTCATTGGCAGCTTGTTAGCCGTGGCGTTGCTGGTGGCGCTGCACCTGAGCAAGTTACCACCCAAGACAGCCTGAGCATGGGGCTGCTTCGCAGCCCAATCGCCGGCAAGCCAGCTCCCACAGGAACCCACAAGGCTCATGACCTGTGGAGTACTTGTGGGAGCTGGCTTGCCGGCGATTGGGCCATCAGATCAGTTGATGATCTCGACAATATCCACATCCACTTCCCGGCTCATCAGGTGCTTTTCCACCTCACCGGTCAGCTTCACCTTGGTCTTGTCGTTGAACGGCGTCGGTGGCAAGTCTTCGTCGTCGATTTCAACAGTGATGGTGCCGGTGTTGTCCTTGAACTCGTACTTGTCGTCGTTGTTGATCTTCTTGGTCACATACCCCTGCAGCACCACAGGCGTGTCATCGGCGGCATCGTTGGCAGCCGCAACAGTGGTGACAGACTGGGCGCCAGGGCCGGTATAGGTTGCGGCCAGAGCAGCGGTGCTGAACAGAGGGGCAAGGATCAGGGCGAGGTAACGGGCTTTCAT
Protein sequences of DBSCAN-SWA_6 >NC_021505|5816548:5822873|5818929_5819238_-|WP_016502032.1|DBSCAN-SWA MRVIAKSSLVKFWELPGHEDAKAALESWHDMALKAIWRTPQDVKDQIGNASICGKNRVVFNIGGNKYRLVVEMQYRAGIAWVKFVGTHKRYDEIDVETVCEY >NC_021505|5816548:5822873|5818583_5818940_-|WP_016502031.1|DBSCAN-SWA MNIKPIRNDDDLRAAFQRLEAIFQAEAGTPEADEMEVLVTLIEVYENKHYPIHPANPIEAIKFCMDQQGLTPRDLEPYIGPSGRVSEVLNGKRGLSLSMIKRLHDGLRIPYESLLANA >NC_021505|5816548:5822873|5819306_5820080_-|WP_016502033.1|DBSCAN-SWA MRRVVFNQKGGVGKSSIACNLAAVSANEGYRTLLIDLDAQANSTQYLTGLTGEDIPMGIADFFKQSLSSGPFSKKNKVDIYETPFDNLHVVTATAELADLQPKLEAKHKINKLRKLLDELDEDYERIYIDTPPALNFYAVSALIAADRVLIPFDCDSFSRQALYGLLAEIEDLKEDHNEDLMVEGIVVNQFQSRASLPQQMLDELLAEGLPVLPVYLGSSVKMRESHHASLPLIHLEPRHKLTQQFVELHDLLERSE >NC_021505|5816548:5822873|5816548_5818009_+|WP_016502029.1|DBSCAN-SWA MKALDQLTFDNRFARLGDAFSTQVLPEPIADPRLVVASESAMALLELDPAQADLPVFAELFSGHKLWEEADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVLNDKGEHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSSTAVWRETRESAAMLTRLAQSHVRFGHFEYFYYTKQPEQQRVLIDHVLEQHYPECRDAEQPYLAMFHTIVERNAELIARWQAYGFCHGVMNTDNMSILGITFDFGPYAFLDDFDANFICNHSDDRGRYSYANQVPIAHWNLSALAQALTTVIEVEPLKEALGLFLPLYQVHYLDLMRRRLGLTTAEDDDMALVERLLQCMQRGGVDYSLFFRKLGEQPVADALKVVRDDFIDLAGFDAWGADYLARCEREPGNAEGRRERMHAVNPLYVLRNYLAQKAIEAAEAGDYSEVRQLHQVLTRPFEEQPGMQAYAERPPEWGKHLEISCSS >NC_021505|5816548:5822873|5818145_5818580_+|WP_016502030.1|DBSCAN-SWA MSDPLVIPCPHCNGLNRLPAERLGDAPKCGRCKQEVMLSTPFELTEASYASQIKGDLPLLVDIWADWCGPCKSFAPTFEQAARQLAGRCRLAKLDSEANRNLAGQLGIRSIPSLLLFKNGREISRQAGAFPLQSLLEWVRSQGV >NC_021505|5816548:5822873|5822525_5822873_-|WP_016502036.1|DBSCAN-SWA MKARYLALILAPLFSTAALAATYTGPGAQSVTTVAAANDAADDTPVVLQGYVTKKINNDDKYEFKDNTGTITVEIDDEDLPPTPFNDKTKVKLTGEVEKHLMSREVDVDIVEIIN >NC_021505|5816548:5822873|5821164_5822409_+|WP_041167928.1|DBSCAN-SWA MKTAVAPLPAEPEPALLNEMWIEKGTPAFMKTVLALFSGGFATFALLYCVQPMMPLLSKEFSINAAQSSLVLSVSTAMLAFGLLITGPISDRIGRKPVMVFALVCAALSTLASAVMPSWELVLATRALVGLSLSGLAAVAMTYLSEEIHPQHIGLAMGLYIGGNAIGGMSGRLITGVLIDFVSWHTAMLTIGGLALVAALVFWKVLPESRNFRPQMMSPRSLLDGFVMHFKDAGLPWLFLEAFLLMGAFVTLFNYIGYRLLAEPYHMNQALVGLLSVVYLSGIYSSAQVGALADKLGRRKVFWASIVVMAGGLLMTLASPLAMVIVGMLVFTFGFFGAHSVASSWIGRRALKAKGQASSLYLFSYYAGSSVAGTAGGVFWHQWGWNGIGLFIGSLLAVALLVALHLSKLPPKTA >NC_021505|5816548:5822873|5820156_5821056_-|WP_016502034.1|DBSCAN-SWA MELRHLRYFIAVAEELHFGRAAQQLGISQPPLSQQIQALEQELGARLFERTNRRVELSEAGRLFLEEARQVLAQVEKAADVARRAQLGELGEMKIGFTSSAPFTSKIPKAIHAFRQRFPAVHLNLKEMSSRDVAEGVFDESIEVGLMRPMPLPEGLVATELFREPLVAVINASHPLAESTEQGVHMAALAHEPFVFFPRSYGSGLHAQLLSLARQAGFSPHFAQEAGEAMTIIGLVSAGLGVSVLPASFQRMRIEGVVYRTLLDEGAMSAVWLVQRERGNSVMAKAFAELLTGQVVGQG |
8 | Microcystis_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|