Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP022040 | Prevotella melaninogenica strain FDAARGOS_306 chromosome 1, complete sequence | 4 crisprs | WYL,DEDDh,PD-DExK | 0 | 0 | 2 | 0 |
CP022041 | Prevotella melaninogenica strain FDAARGOS_306 chromosome 2, complete sequence | 12 crisprs | PrimPol,csa3,DEDDh,cas3 | 2 | 2 | 3 | 2 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_1 | 171268-171358 | Orphan |
NA
Consensus repeat of CP022041_1
|
1 spacers
spacers of CP022041_1
>1.1|171293|41|CP022041|CRISPRCasFinder GTGAACAAGTTGCTTGAGAAGAAGACAAAGGAACTTTGAAA |
CRISPR arrays and Neighbor proteins around CP022041_1
The CRISPR arrays of CP022041_1 >merge|CP022041|1|171268-171358|CRISPRCasFinder AGTTGACGAGTAAACGAGTTGACAAGTGAACAAGTTGCTTGAGAAGAAGACAAAGGAACTTTGAAAAATTAACAAGTGAACAAGTTGACAA >CP022041|1|1|171268-171358|CRISPRCasFinder AGTTGACGAGTAAACGAGTTGACAA GTGAACAAGTTGCTTGAGAAGAAGACAAAGGAACTTTGAAA AATTAACAAGTGAACAAGTTGACAA
>CP022041.2|ASE18061.1|169777_170209_-|DoxX-family-protein MKCLELLFPKAESTKTSLILLASRLVFGLTFASHGLDKLQHFSETAAHFPAPFGLSGEVAVGLSIFGELVCGLAFVFGFLTRLALLPMIFTMLVAFTTVHGGSVSNGELAFLYLVIFVLSWFAGAGKFSVDGIIRSKISGGNS >CP022041.2|ASE18060.1|166535_167960_+|alanine:cation-symporter-family-protein MIELFSADGWLNQAIVSVNYFTWTYILVAGLVICALWFTWRTRFVQFRMIGEMVRLLGDSTGTHDEGEKHVSSFQAFAVSIASRVGTGNLAGVATAIAIGGPGAVFWMWVIALLGSATAFIESTLAQLYKRRHADSFIGGPAYYILHGMHCKWMAKLFAVLITMTFCMAYISIQSNTICGAMQKAFSIDPTWMGGALAILSLAIVFGGIQRIAKVSSVLVPLMAVSYVLLALVIIVMNIQLIPHVFRLIVENAFGFEQVAGGGLGATMMNGIKRGLFSNEAGEGSAPNIAATASTTHPVKQGLIQSLGVFTDTLLVCSCTAFIIIISGLYVNNSESGILLTQVALESEVGAAGPIFIAIAIFFFAFSSIIGNYYYGEANVRFLTQKPSAILALRVITGGVMVMFGAIASLDLVWSIGDFFMALITICNLIAILTLGKYAFRLLDDYRQQKRAGVKSPVFKRETMPDIAKDIECW >CP022041.2|ASE18059.1|165475_166393_-|YitT-family-protein MVRKRKNKFRDVREFLMIALAMLIGSFGWCAFLLPHHITIGGIAGIASVIQWGLDIPVQYTYLTINGILLFVALKILGWKFCVRTIFAVLVFAFSTSVLREVFAGHPLFSDEPFLACVVGGVLLGVGVSIALQYNASSGGSDVIAAMIHKYRDVSLGRVILACDLCIISSSYLVLENWEKVIYGYIVLFVMTYVVDYLINGMRGSVQFFVISEHWGEIGSAINNDVDRGCTVIEARGFYTGKKVGMLFVIARRSEAHSIYQVIDEIDPNAFVSQGAVNGVYGMGFDRMKVAHKKKTADEKVRTKE >CP022041.2|ASE18058.1|164370_165453_-|tRNA-2-thiouridine(34)-synthase-MnmA MNIQELEGKRIAVLLSGGVDSSVVVYEFARLGLHPDCFYIKIGPEEKEDWDCNSEEDLEMATLVTRRFGCKLEVIDCHKEYWDQVTRYTMEKVKAGFTPNPDVMCNRLIKFGAFDEKMGHDYDLIATGHYAQTEWIDGRKWLTTSPDPVKDQTDFLAQIYDWQLKKAIFPIGHYEKNEVREIAERENLINAHRKDSQGICFLGNIDYNEYVRRYLGEEIGDVIELETGKKIGEHKGLWFHTIGQRKGLGLGGGPWFVIKKDVTKNILYVSHGYDPATAYKKDFPLHDFHFLTEGITTLPEKITFKIRHTPEYHPATVEQLSDGRCIIHSTENIHGVAPGQFCVVYDEQHHRCFGSGEITL >CP022041.2|ASE18057.1|161450_163052_-|hypothetical-protein MKQVYYFIFLMLLAFSSVNVKADTTVRVKVDDINRVSVKVNYLPVVNLANGTNEITVPQYGALTIEAKQGYYLKSVLKTIDNDNSAPQTINNLTSCNIYVSDADNNKLFTVKSADLAKARTGSCTVNVDNASKVRVSRNESRTSVELQNGENTVKWIPNTEKTLVITNANYGDAPIYKVTLDGNDVTPSSGQYFVTLTAGCVVDIKADYPNVSYPVKFNFTDEKAKGVISKVMADGEEVKNYNDADFKLKAGTKLSLTFDQRNYALDAFKVNGTATTIYGTYECYVKDNLVFDIQAHKYATVKAVLTVDKAANITAYEGQSYNNKVITLQNGSNNIELGEKNNLIQIKPNSGCKIESIKANGTPVTANYEGAYEIRLTDGMTIEVTTSAIVRDQKATVFIDDISLANYGFNFYRSDHSTIKMQTGENTVMFSADDHHFMLGAYGNDLSKMVVKLNGTILAPSYPGGTSFEFDLKNNDRLDVLLKGDTAGIDAIEAVKQGKAVVYRLDGKRIEGTQLPNGVYIINGKKVIVNKR >CP022041.2|ASE18056.1|159078_161454_-|T9SS-C-terminal-target-domain-containing-protein MRKILLTVFCLCSVGLAYGQSKLDLQSQLELFKLRNTSIPTYNSRTRSFERPKSVPENTMAMVEMKDQNDRADLEAQGVKVLRVRGNIAIVVAPIKDIERIAGLKCVRRMELPRRVYQKMDVVRKEIGVDKIHKGIDLPQAYTGKGVVTGIVDGGIDPNHINFLKPDGSTRFGYISKITASQSNKDGYQFDNYYPRVVLDTLTNRDNAYAIEDFTTDSYTTFHGTHTTGIMAGGYKGNINYAKTNDNDRSYKVTGPNPFYGCATESELVASCGDLRDQYIAFGVDDVVQYAKLSGKKPKPCVINLSLGSNIGVHDSTSVMNRFLAEEGKHAIICVAAGNEANMGIALKKNFTAADETVKTFLTPMQPDTLRSGGKTYFNLRNGQIAAYSNDSTEFELQIVVTNTKRGNRVVSRIPLPNNTNGQPITYASGGEYSMSGAVINQGFAKAFDGYVTAASAIDPETGRYYAMAQIMTSDNQKDNKDGNYKLALLITSKKPGQRVEVYSDAQFIYFDSNKQEGFVSGTRNGSISDMACAANIVTVGSYNVRNHWSSLDGFVYGYNKRGDEDDFPEGEASRFSSFGTLADGRNLPHVCAPGASIISSVNTYAVENTDLGYTDMALQGKLEKGGKKYYWHQSLGTSMATPVVAGAIALWLEANPSLTVKDVIRIIQQTARKDDYVTKTGDPVQWGAGKFDAYAGLKQVLKEKETNGINGVRYAESQAVPVITMTGERSFSAFLAGAKQLNLRAYSLSGQLVHSLSAQGDELNVNASSWNKGVYLIQVNGGKAQRIVIY >CP022041.2|ASE18055.1|155251_158962_-|T9SS-C-terminal-target-domain-containing-protein MKHKFTFLPSFLLLVMAMLWSSLAAHAQDEVLIRFHTNVPEKAKNSGVAAQVSFVLGSKSDKADVSIDYGSGDTEVEINKAIVENDHGEQAIKGTLCTDVVSDAGWVTIKGNPDDLYFFNASGNEIDQIQFNDNLKLQVLNLEHNNLKSLNIDRLQSLNIIYLQDNPFSATTPLMIGRMPNLMVLEVPQIGHISPDFTLKNFPNLRSFDAYHTISLKTADPTGCPYLQRLSLDMTSVESVDLSKNSLLQILNVGDSRVKTLDLSHNPEITQLYISHSSGAVNTDVKFETIDVSHCPKLYYFYCGGNNLKELDLRNNPELFTLSCDRNLLRNLDVSQNPKLYSVNVRYNYMDFATLPEPGNWFEYYHEQNPMELNDTYKVGDVIDLSKRVLRANTTTQARLYRVPKDDPTKPVELDDSYYSYEGGKVTLKKELNEEVFVQFTNSRLRDYPIRTENFRVKTVAEFGKDVKAIQLASLGDAGSPLKMSVGILGATAANPVTVKVDLGDGNTTPFEIKDERPATANIVTTRTGAGDIIVYVPQDKYVTSLESDGQYIDNIDLSALTELRTLTLKRANLTTIDLSYNNKLEKLDLSYNQLNRVDLRGPSSYFNKSKLTDINISHNQVDSLLFNTIYGVTKLNVSHNKLNKLDLKDADNLRMLDISYNKFTRLLLNHSELIEDINVANNELTEVKIPPVAPVKKLNVSGNYFTLANMPNDFGLTRGNFIYAPQNVLQISTSSPGIDLSEQYITKDGATTNFVWKKKDGTPLQLGKDYTITNGSAKFTNLALDSIYCEMTHAAYPDFEGKNVFKTTNVHPIAFPKYELASFTTVNQTDSVVLSLASYVPGKSVYFEWGGNGNVTQYTLGEKYKIFQAKSKANTKVRVLVAEADDKVKVFSVANVKMTDVDLTGLKEAKLISITQAGLTSVKLPAAPNLTDLNFDGNELTDIDLSPFPKLFAVSLIGNKIKNFDLSKAPNLGIAYLSSNKMKEIKLDNPKLESLDLSDNDLENVSLDKLPQLEQLWLNANKLTKVDVSKNTNLRVLNVVGNRLKFSTMPLPNNNGKRFDRYSYNLQAPIDVKCVNGKVDLSSEAVVGGEMTTYHWFIGNVTYRDGELQGEKLEVNDEYTVENGVTTLKLTQSITNLVCVMSNDNFPNALIYTNYIAFTPATGIDAVTADKDVKIQFFDGAISVLGAQNSTVAIYSIDGKLVYQGKVADDSTRISLARGTYIVRVGNKAAKISVK >CP022041.2|ASE18054.1|152977_154678_-|putative-transporter MDWLIDIFSAEKQDTVAHIMLLYSIVIALGIYLGKIKIGGISLGVTFVLFVGILAGHIKFTGPIPVLTFVQDFGLILFVFMIGLQVGPGFFESFGKGGLKLNILSTVAILLNVLVMFACYYIFFDTQDKTNLPMMVGTLYGAVTNTPGLGAANEALHSVFKNGMNFDIASGYACAYPLGVVGIISATIAIRYICKVNLQEENEKLNEEEAENPHAKPYTMYLKVQNAYIAGRKLEEISEFLNRDFVCTRLMHEGVLSVPTLDNIFELGDEILVVSAEADAAAIRAFIGPEIEVDWHEEDQPQQLVSRRIVITNSKINGKTLGDIHFRSVYGVNVTRISRQGMDLFAGRNHRFIVGDRIMVVGPEENVNRVASMMGNSEKRLNAPNIATIFVGIIVGIIFGTLPIAIPHMPVPMKLGLAGGPLVIAILIGRFGYRMGLVTYTTTSANMMLREIGLALFLASVGIKAGATFWDTVVQGDGLKYVYTGFIITIVPILIVGTIARLKYKFNYFTIMGMLAGTYTDPPALAYANSICAGEAPAVGYSTVYPLSMFLRIFLAQVIVLFFCQI >CP022041.2|ASE18053.1|148663_151972_-|SusC/RagA-family-TonB-linked-outer-membrane-protein MEKRITLFLFGLILSLGTAFGQAKINGTVVSQDDGEAVIGASVMVQGTTTGTVTDIDGHFTIDVPAGKKLVVSYIGMVTQTVTAKDGMKVVLSNDNHQLTEVVVTGMTQQDKRLFSGAATKIDASKAKLDGMADVSRSLEGRAAGVSVQNVSGTFGTAPKIRVRGNTSIFGSSKPLWVVDGVIMEDIANVDASSLSSGDAKTLISSAIAGLNADDIESFQILKDGSATSIYGARAMAGVIVVTTKKGKAGQSHLSYTGEYTLRLKPSYKNFNIMNSQDQMEVYRELEKKGYLNYAEIANASTSGVYGRMYQLISEYDATKGQFGLENTQKARDAYLRAAEYRNTDWFDQLFSSSIMHNHSVSASGGTDKGQYYASLSAMYDPGWYKSSRVQRYTGNINTTYNINKKVGLNLIANASYRKQRAPGSLSRTIDPSTGAVSRAFDINPYSYSLNTSRTLDPTESYTRNYAPFNIFNELNNNYMDLNVHDFRIQTSIDYKPITKLKLTALVAVKSAASTMEHVVRENSNQALAYRAMGTTTIRDANPYLYTDPDNPFALPESILKEGGILDRTGNHFFGWDTRLSAAYNDVFNDTHIVNLYAGVESNSVDRKATFDREWGMMFDAGEIAKLNYRAFKMFQEKGSDYYGLSNTHIRSLAYFGTGTYSYKGRYQMTGTFRYEGTNYLGKATSARWLPTYNVSGAWNAHEEEWFNKVFKQALTHATFRLSYSLTGDRPPVTNSLPIFTATVPWRPFTSIQETGYDEQFGNKNLTYEKKREFNLGFDFGFLDNRINLTTDFYWRRNSDLIGYVNHPQLGSYNLANVASMKSNGMELSLNTHNIKTKDFSWESNFIFSWTHNEITSLFTHARVIDLVQGTGYSLVGYPVNSIFSIPFAGLDGEGLPTFMNEDGRRTISGLNLQERDQEKIKYLKFEGPADPTTTGSFGNVFRYKNWDLNVFVTYSFGNKVRLNHIFSNRYDDMDALPKEFRNRWTYSGDETLTTIPVIASRRQNRNNSQLDVAYNAYNYSDVRIAKGDFIRMKEISLGYTFPAAMIRTIGVSSLALKLQATNLFLFYSDKKLNGQDPEFFNTGGVAAPVPKQFTMTLRLGL >CP022041.2|ASE18052.1|146795_148508_-|RagB/SusD-family-nutrient-uptake-outer-membrane-protein MKIKNIIYKGSLMLASVAILASCSDQLDTLPDNRTTLDTPKKIAGLLVTAYPDRTPTLFNEWMSDNTDYMGAQNSQGNRGGDQYFFWQEQTEGGNDSPEQVWMLYYEGVYKANEALAAIEDQGGPKNDILRNSKGEALLIRAYDHFILANEFCRPYNGKTSTKDAGLYYATGIADFSAAAEQSNRGTVADVYAKIAADIEAGIPLLNDTYEVPKYHFNKQAAYAFATRFYLYYEKWEKAKEYADKLLGSNPAASLRDYRALQAMPLSKSEQAVKIAEAYCSASADCNLLVQTSVSNAGMALAPWLTSKRYTLTNYLAETELFQSNNIWGTSSNLIWKPFTVNSGESNFALLMKLPREFEIRNTTTGSGYLRTLNVDFTMDEALLNRAEAEIMLGQNDAACADMTIWMKNFFNTNVTLTPTSVQTYFKTVPYAYADAAKMVPSFKKHINPRFTIGAEGSVQESLLQCLLNFRRIETVHQGMRWMDIRRYNIEIPRRLIGANGRPSKNLDWLEKDDPRQVVQIPQSIREAGVAGNPTKALVAGAKLVDLSQYKLPVLSAVNQYSAPVSAHSF >CP022041.2|ASE18062.1|171447_173928_+|ferrous-iron-transport-protein-B MKLSELKTGETGVIVKVSGHGGFRKRIIEMGFIKGKTVEVLLNAPLQDPVKYKVMGYEVSLRHSEADQIEVLSDVKTHSVGNEEEQEDNQVEMDSTTDDSTDKELTPEKQSDAVRRKSHTINVALVGNPNCGKTSLFNFASGAHERVGNYSGVTVDAKVGRAEFDGYVFNLVDLPGTYSLSAYSPEELYVRKQLVDKTPDVVINVIDSSNLERNLYLTTQLIDMHIRMVCALNMFDETEQRGDHIDAQKLSELFGVPMIPTVFTNGRGVKELFRQIIAVYEGKEDESLQFRHIHINHGHEIENGIKEMQEHLKKYPELCHRYSTRYLAIKLLEHDKDVEQLVSPLGDSIEIFNHRDTAAARVKEETGNDSETAIMDAKYGFINGALKEANFSTGDKKDTYQTTHVIDHVLTNKYFGFPIFFLVLLVMFTATFVIGQYPMDWIEAGVGWLGEFISTNMPAGPVKDMIVEGVIGGVGAVIVFLPQILILYFFISYMEDCGYMSRAAFIMDRLMHKIGLHGKSFIPLIMGFGCNVPAVMATRTIESRRSRLITMLILPLMSCSARLPIYVMITGSFFALKYRSLAMLSLYIIGVLMAVAMSRLFSAFVVKGEDTPFVMELPPYRFPTWKAIGRHTWEKGKQYLKKMGGIILVASIIVWALGYFPLPDDPNMDNQARQEQSYIGRIGKAVEPVFRPQGFNWKLDVGLLSGMGAKEIVASTMGVLYSNDGSFSDDNGYSSETGKYSKLHNLITKDVATMHHISYEEAEPIATLTAFSFLLFVLLYFPCVATIAAIKGETGSWGWALFAAGYTTALAWIVSAVVFQVGMLFM >CP022041.2|ASE18063.1|174335_175280_-|type-IX-secretion-system-membrane-protein-PorP/SprF MKKIVFTLLLALFAVVIRAQESQTEYNFLRLPVSAHAAALGGENITIIEDDPSLMFSNPALASSVSDKTVGLSYMNYMRGAHYMGASYTKALGEKATLAGGVQYMNYGKMKEVDANNVQTGTFNASEIAVEGIFSYELARNLVGGITAKFITSYIGSYNSMAVGVDLGLNWYEPERQWSVSLVAKNLGGQIKAYEEEYGKMPIDVQVGVSKTFAALPVRVSATLVDLTHYDYRFINHLNLGAEVLLSESIWVGGGYNFRKADEMTIGKADNASAHGAGFSVGAGINLEQFKLNLAYGKYHAASNSILVNLAYSF >CP022041.2|ASE18889.1|175382_176219_+|energy-transducer-TonB MYNGRKRCLVYLILEIKKSNRADLENKRWVGFLLGIIVALSFFFVAMEYNATGSDDDSANTKAIKNVTLHDMDMLPAIDQQDLAKTQEDKKPTMEDLLNLKRRDIPNKVTPHDAGSMNSNDKKTGAPQVSNEPIVMPMVTTTTEPPKIKEEAKKEMEKMTDDNSDKVVERYDDKVSKRILSETPTPPGGWVEFMKWLTKTLQYPAAAKENKLQGTVNITFIINADGTVDDVRIKSGKVPVLNDEVLRVLKTMGKWKPGIEKNKPCRSLIEIPFVFQLA >CP022041.2|ASE18064.1|176284_177259_+|polyprenyl-synthetase-family-protein MYTANEILSKVNEYINNLTYDRKPQSLYEPIKYVLSLGGKRIRPTLMLLSYNLFKDDPETILSPACALETYHNYTLLHDDLMDDAPLRRGQQTVHVRWDANTAILSGDSMLVLAFERMAQCDSRHLSEVLRLFTVTALEIGEGQQYDMEFENRNDVKEEEYIEMIRLKTSVLLACAMKIGAILADAPAEDVENLYKFGEQIGLAFQLQDDYLDVYGDPKVFGKKIGGDIICNKKTYMLINAFNKANARQCKELEKWIGCENFNHEEKVAAVTELYNSIGVDKMAIERINYYFDEANKYIAAVNLPDERKAELLAYAQKMLHRKW >CP022041.2|ASE18065.1|177512_178328_+|TatD-family-deoxyribonuclease MIIDTHAHLDVEDFADDLPEVISHAHEAGVGKIFLPAIDLKSVDTVLAVCRQFPDTCYPMIGLQPEEVRDDWREVLDAMHERILLSLRQKAEGTAKPGETVIAIGEVGLDFYWTREYEKQQLAAFEEAVKWSVETRLPLMIHCRKAQNEMLHIMRPYEKELPGGVFHCFTGNQKEAEEFLRFDRFVLGVGGVSTFKSSHLREDLPAAVPLDRIVLETDSPYMAPVPHRGKRNESAFIVEVMRTLALSYGVDEAEFARQTNENVRRVFGVGC >CP022041.2|ASE18066.1|178474_179167_+|(d)CMP-kinase MKKITIAIDGFSSCGKSTMAKDLAKEIGYIYVDTGAMYRSVTLYALRHNLFNTDGTIREEELQAQMKDINISFQLNKETGRPDTYLNGENVENEIRTMEVSSHVSPIATLAFVRKALVEQQQRMGAEKGIVMDGRDIGTVVFPNAELKIFVTASAEVRAQRRYDELKAKGMEADFADILKNVQERDYIDSHRETSPLRKADDALELDNSQLTIAEQKQWLYNQYLKAAEA >CP022041.2|ASE18067.1|180163_180871_+|glycosyl-hydrolase-family-25 MKLKKIIMLIACIFSFAGLKAQYTIQCEDTCSHVHGLDMSHYQGDVWWETVAENSNHKLNYVYLKATEGGTRIDQRYLENIEAAQRYGMNVGSYHFYRPAIPQEEQLRNFRMQCRPQDQDLIPMVDIETTGGLSTEALRDSLQRFLVLMTQEYGVKPLVYTYTNFYNRYLSGALDGYKLFIAQYNGREPELNDGRDIFAWQYTGKGRINGVRGYVDKSRLMGNHSMRELRFRRKR >CP022041.2|ASE18068.1|180935_182321_+|zinc-ribbon-domain-containing-protein MIIKCPECGHQVSDKAPVCPSCGVEIAGHIIKCSHCGELYLKEESSCPNCHHTEHHVESSVTAAEHHTSEATNDSKVQEPVVLMSVDKGEVTGNDDVIIPVEETEEHETDNYDNKTQDTINEPNTEEEAVDADFIMDDNADEEVIANAEAIAEDEEESTPDKNNHLSLAVSLLIAAITAAVLLFLYNQGVGASKANNEQEAFAQAMSSSEPTVLKNYLKENPSASKAHRDSISARLKVLTTTTQNMQQSDNDLSVALTSNSKEVLQQFIAKYPDSKHRGELEAKIDEIDWAGAVAKNNENAYLGYKAQHPNGIHSKEADEKLKNILTPEMAEESAAAKVTDGERAKAVAAVRQLLQGINSKSTDKISGAVAPSLNFLGSGGATVKDIRRYMTDRLYQADVKTINWHLGSPTEVTKNSNEAGADIRLKIPATLDIDRKGGKSKRSYVISATIKNGRITHINW >CP022041.2|ASE18069.1|182719_183103_-|hypothetical-protein MKKKPTYKVPVSICAIVLVVGAIVLAVLMETREMAPPRKYEVDMSGEVIGIDNGPKIPVLTKEEEKDEEVKEEKKTEAPKKESSESEETADPENPVIVPNVPEGQTEPVVKAPVPEIKKPTIDQIEN >CP022041.2|ASE18070.1|183144_183627_-|hypothetical-protein MKKILLLLTVLLFAMGARAQKDIVSMADAIKIFQAKTLQVGKQVLEKQGYSYKGVSSDEFGKDYNWVKNMNLTSDFLPTAMGRGNSSMVLLAQNGKTVYVYVFNRTAFAGLQAQVKAMGYDMGNAVKGDKTTLICTKDNQPTISFLTLQQPLPYCVQITE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_2 | 298657-298793 | Orphan |
NA
Consensus repeat of CP022041_2
|
1 spacers
spacers of CP022041_2
>2.1|298707|37|CP022041|CRISPRCasFinder CTTTCCTCCTTAACTCCACATCAATCCTATGAGTCAG |
CRISPR arrays and Neighbor proteins around CP022041_2
The CRISPR arrays of CP022041_2 >merge|CP022041|2|298657-298793|CRISPRCasFinder CGGTGTTGTCTTAACTCCTTTAACTCCATTTAATTCCTTAACTCCACATCCTTTCCTCCTTAACTCCACATCAATCCTATGAGTCAGCGGTGTTGTCTTAACTCCTTTAACTCCATTTAACTCCTTAACTCCACATC >CP022041|2|2|298657-298793|CRISPRCasFinder CGGTGTTGTCTTAACTCCTTTAACTCCATTTAATTCCTTAACTCCACATC CTTTCCTCCTTAACTCCACATCAATCCTATGAGTCAG CGGTGTTGTCTTAACTCCTTTAACTCCATTTAACTCCTTAACTCCACATC
>CP022041.2|ASE18150.1|292713_294258_+|hypothetical-protein MRKQTFKVRLLSLIALCGIVMVFASCSNEDVAQSTVETNTEKNSNLTSFVTGAAATRTSLNYDNGAFFWEAGDHIYVKDDNGTWQKSRNAPTEKTASFKFMVPGKYTNSTTYKVYYPGKNGLNNNVTISGTQSQAQPNSTTHIGEAGDCGTADATRNGGVFNFRLDHQAAILVFQPFTNNTAVKNCYLTKIEVESDNDITDTYTLDTTTGELTGTGSGKTITMTTKGSGSYANGFSLNTSTADVKTNGAYVVIRPGTHALTVRYWIKDYVTNVEGAITKTYDSFKYEKNDYYDMTADINVTDYDGSKYYMWDAQKNYWNGHEWNSANPWQPVLANKSNPNYPVSGSSYFHRTGGFGREDAMNSSCKNLPNANELGWYVKNGDPRWDDDKLWTAMGHLYKGGIWLRNKAYINLITSFNSDQGPSYLDMRDNYARFEVTPIQGAPVGYFASRYFFLPALGKYSDGYLSKIGERGFYWSSSASPVDETWAYSLEFSKSQVIVSPTSSNDGYYIGTFE >CP022041.2|ASE18149.1|292435_292660_+|hypothetical-protein MKQNKIKIGYQAPYTEVIVINNETLLAENSFPGQHRPGNHKKGPNANNAKQALEWLEVGDEASSSNEHTSLWED >CP022041.2|ASE18148.1|289752_291369_-|ABC-F-family-ATPase MITLSNLAIQFGKRVLYKDVNLKFTPGNIYGVIGANGAGKSTLLRAISGDLEPNKGTVELGPGERLSVLEQDHFKYDEYKVMDTVLMGHDALWQNMKEREELYAKPEMTEEDGNRAADLELKFAEMNGWEAESNAAQLLQNLGVKEELHEKQMSQLSNTEKVRVMLAKALFGKPENLLLDEPTNDLDLETVEWLEDYLGEIDEHQTVLVVSHDRHFLDSVSTQTIDIDFGKVTVFAGNYSFWYESSQLALRQAQNQKMKAEEKKKQLEEFIRRFSANVAKSKQTTSRKKMLERLNVEEIRPSSRKYPGIIFSMEREPGNQILEVENLKAVDEDGTVLFDNVNFNIEKGQKVVFLSRNSKAMTALFEIINGNREPDAGTYNWGVTITTAYLPLDNTEFFDCDLNLVDWLSQFGPGNEVAMKGFLGRMLFKQEEVEKKVNVLSGGEKMRCMIARMQLQNANCLILDTPTNHLDLESIQAFNNNLIGFKGNILFSSHDHEFINTVADRIIELTPKGTIDKLMSYDDYIHDEQIKEQKAGMY >CP022041.2|ASE18147.1|289111_289690_-|nucleotide-exchange-factor-GrpE MSKKEKNIKIEGEELELNNEETTQNYAEAQAEDANGEETPAEEELDPLVAAQNDAEQWKDKYIRLVAEFENYKKRTLKEKSELILNGSEKTVAAILPILDDFERATADKTEDPQAIKEGYELIYKKFLKALETLGVNKIETDNADFDVDYHEAIAMVPGMGDDKKGKVIDCVQTGYTLNDKVIRHAKVAVGQ >CP022041.2|ASE18146.1|287933_289097_-|molecular-chaperone-DnaJ MAKRDYYEVLGVSKNASEDEIKKAYRKLAIKYHPDRNPDDPEAEAKFKEAAEAYDVLHDPQKRQQYDQFGFDAPGGGFGGGGPFGGGGGFSMDDIFSMFGDVFGGHGGGFGGFGGGGHQAPKYRGSDLRLKVRLSLQEVATGVTKKFKVRKDIPCEHCHGTGAEEGSGTETCQNCHGSGVEIRTQQSIFGMMQTQTTCHVCNGEGTIIKNKCTHCHGEGVVKGEEVVEINIPAGVAEGMVVNVPGKGNAGRHNGVAGNIQVYIEEEPNDTFIRDGQNVIYNLLLDFPTAALGGQVDIPTIDGSNVKIKIEPGTQPGKTLRLRGKGLPAVQGYGSGTGDLVVHISIYVPKELNKEEKKIIEDLRQSENFRGDNSTKRSIFENFKKLFS >CP022041.2|ASE18145.1|286398_287691_-|bifunctional-folylpolyglutamate-synthase/dihydrofolate-synthase MTYQETVEYLFNSTPVFEHVGASAYKEGLETTKALDEHFGHPHTHFLSIHVAGTNGKGSCSHTLAAILQAEGYKVGLYTSPHLVDFRERIRVNGKMISEQEVIDFVEQERNFFEPLHPSFFELTTALAFKHFAEQKVDIAIIEVGLGGRLDCTNIITPILSIITNISLDHTQFLGNTLGAIAAEKAGIIKHRVPVVIGESVPETQIVFKAKAQKEDALIVFAEDIPAILSSRPNPDGGIAYQTRFFGDIVGELGGSYQEKNANTVLTAVMQLYNKGVIKNAESIAKGFANVCELTGLMGRWQKLQANPLVICDTGHNVGGWTYLSQQIKRQQCKQKRIVFGMVDDKDLHAVMSMLPDDAIYYWTQPSTHRAFPAEKVAATADEYDLHGMVFPTVLDAYQAALHDAAQSDFIFVGGSSYVVADLLTSLQKK >CP022041.2|ASE18144.1|285410_286022_-|hypothetical-protein MTIQEQIKQRIDEAEPGTVFFVNDFAEFDNEYVSKLLSTMKGFGVLERLAKGIYYKPIVTNYGNVYPSAEKIVKLIAEHENAEILPTGEYALNALGLSTQVPMKAIYLTTGSPRTITIGNKKIRLKHRTPSTYSYHSKLMPLLVLALKAKGQSNITSADTNRIEKLISDSTEKQQIKTDLATAPVWVRKYIKPIIEKYEHLAR >CP022041.2|ASE18143.1|284374_285430_-|nucleotidyl-transferase-AbiEii/AbiGii-toxin-family-protein MSIWQDKTIEERIAIVQNTALRTNIEDLAIEKDWWVTITLKALFSTSFSEFLLFKGGTSLSKGKWENIDLRRFSEDIDISLSRSWFTETEEKQKLYPFAKCENNNQLKSLRKASREVIFERLSPELNEQLAKLGAKDFYVENVKSIIQDGVEIPIDTDRDPVVLNVVYPSILDETNEYIQPKVKIEISCMSMDEPFENRALTSLIYDTFNEVDNATQCFVPTVLPIRTFLEKALLLNEEYQKKSPRSERMSRHLYDLERLMDTYSETAINDSELYKSIIEHRKKFYHISSVDYDSDKRENIKIWPTGEIENLFRDDYKAMIESFIYNENPLTFDQLRERILLLEDKFRENT >CP022041.2|ASE18142.1|283027_284158_-|leucine-rich-repeat-domain-containing-protein MKQIYILLIALLMGLSANAETSGSCGPNLIWTLTGKGVLTISGKGKMYDYSNNNRAPWEGYGKVKRIKIGDGVTTIGDYAFYDCGMPTSVTIPNGVTKIGDYAFYNCTSLASVTIPNSVTTIGDYAFSHCIPLTSITIPNSVTKIGKYTFDYCSNLTSITIPNSVTEIGDYAFTYCLALTSVTIPNSVTKIGEYTFYECSHLTSVTIPNSVTEIGTSAFEDCRSLTSVTIPNSVTEIEEETFKNCYNLQKVNIGNSVKTIGVSAFENCTNITQISSEAVVPPTCESNAFFRIYKSECKLIVPKNSLDAYKQAPQWKDFLLIEESTTGITNTVYNNSGLADVYTIDGTKRLSKASTDEINALPKGVYIVNGKKIIIK >CP022041.2|ASE18141.1|281468_282737_-|leucine-rich-repeat-domain-containing-protein MKQIYILLIALLMGLSANAEESGTCGPNLKWHLTNDGVLTITGKGKMYDYSVPYNSAPWRYFGVKRIIVGDSVTRIGEYAFSDCSSLTSITIPNSVTTIKEYAFSNCSSLTSVTIPNSVTTIGDNAFNGCSSLTSVNIPNSVTTIGGWAFSDCSSITSVTIPNSVTTIREYTFDNCSSLTSVTIPNSVTTIGGWAFSGCGSLTSVTIPNSVTTIGGWAFSGCGSLTSVTIPNSVTTIGGWAFSICSSLPSVTIPNSVTTIGDNAFMGCSSLTSVTIPNSVTRIGSEAFSDCTNLQKVNIGNSVKTIGEFAFNKCTNITQISSEAVVPPTCESGVFFYVNTSKCKLIVPKNSLDAYKQAYQWEDFSLIEGSTTGITNTVNNKAGLVDVYTIDGAKRLSKASTDEINALPKGVYIVNGKKIIIK >CP022041.2|ASE18153.1|298813_300037_-|MFS-transporter MMKQKTNYLFPLAIIGLFFFSIGFALGINSYLMPVLEKSMHISGAASSLLLAATFIPFLLFGIPATHCIKAIGYKRTMALSFAIFAAAFGLFILAAKQNSLTWFLIASFVSGAANAVLQASVNPYVTILGPMDSAARRISCMGISNKLAWPVTTLFITLVIGKGIGDTQLTDLYMPFTIIIGIFLLLGVIALMAPLPDVKAAGEDESECGDEAAVSSYADGKTSILQFPHLLLGCLALFLYVGVETISLATATGYAQSLGLEGDNYGFIPSVGMIVGYICGVIFIPRYLSQAAAMRICAIIALVGSVAVAVVPDPVISVYCIFLMALGCSLMWPALWPLAMADLGKFTKSGASLLTMAIAGGAVMPWLRGVVQDCTSFQTSYWVSVPCFLFILYYGLAGYKIRTKKE >CP022041.2|ASE18154.2|300385_303865_-|beta-galactosidase MWTQSIANPNQQIYITKKGSGYKLSAVSARNGQTYYVTFGSISSVDGYYCGYENSEASAATLQFKEVPAVVIPEGADWENAKVYERNKERAHATYMPYPSTKAMKADGQRYDKPWLDPTGANYLSLNGTWKLRWSEGAKPVLLGKDDFWGDGVSTEGSAWNDITVPSCLEMNGYGLPMYVNVDYPFEDQQPYVRMKAGLKNSVGSYRRDFTLPAGWENKRVFLHFDGIYSAAYVYVNGNEVGYTEGANNVSEFDITKYLRTGKNNVAVQVIRWSDGSYLEGQDMWHMSGIHRDVYLVATPKTYLADHYIKATVTPGSTTVATGSAATSVDLTVCNRDKTAAKKTVTVTLFDPSGKEVKKLKSDFVFAAGDSLKTQTVDFGTLSNVKLWSAETPTLYTFTFSQSQDGKEEEAFSTKYGFRKIDLSKGYLEVNGRRTYLKGANTQDTDPLHGRSISTDLMLKDIAMMKQSNMNTVRTSHYPRHAKMMAMFDYYGLFVVDEADMELHKNWDGVKTIINNTDWTGAIVDRNVRNTLRDRNHPSVVFWSLGNESGSGLNIMAAYNAVKELDNRYIHYEGSTRDNAEGTDLHSVMYPAVDHSRGGTTGPVTSDANHPSTGKPYFMCEYAHAMGNAVGNLREYWEAMEGSLMGVGGCIWDWVDQSIYSYDAIKNNQLTKNGFPAYITGYDCPGPHQYNFVNNGLVNADRAWSAELDEVKRVYQWVGFNLNKDTHQVKLTNKYLDRNLNQFYLKWTLLADGKPVQDGIVKKLNCAAGGTETVDLKYNPTAFAGKELFLNIGLYTKEATNWCDRDYPVAEFQQQLAQRTEVLDKVDNTKADALHATKNSDGGYTYANGKQKVTFDGQGNITLWAYEGKDLFVQNEGPRFDRYRWIENDNPMEAYGNDPTDNGVKSQTATFQLSDDGKTATVNVTQNGNYGKATYKYTINANGTIDLASSFEAQGNGARRLGFSLNFPSDMSKVSYYARGPRASYIDRLDGEDFGIYETTVKDMYEPFAHPQSNGNRIGLRWLTLTNNEGNGVKVETSGDVAFSLTPWTEAELRTARHEWELPTSNRVVAHFDAIQQGLGNKSCGPGPLSKYEIQKGKTYSNIVRFIPFSETADDTANGISAVVNSATTMAQVYDLSGRRLPEPPAKGFYIQSGKVHAN >CP022041.2|ASE18155.1|304286_307778_-|hypothetical-protein MNICKRLCAAFLLSVVCVGQVFAQGYPRISTKGNEHWYYVKYLRSNNVLEDKGEKQKCLTAVPQVMNGGRQLWKVVAAENFGRNKKYQLVSKSGRTLYVNGTNPYSSRFMAATKATDITSFHIFQSMNTSFGTGAFELAPDSTNSHAMNQVGEIRAGQEIGLWDKGDINNVLTFVSKDDMEFPYYMPVISTAANPVYYYIQFQTGNWLLSAKGDKATCQTASLHNGNLDDMLWRVSEKDGKYSFVSKSGKILYISDSYVNAAKAHNVKDTLFTMVESNNSLGGFEIGKSTTGRNFFNMFQGAGEGRLISFWDLGDGGNVVRFVPAEALVPVSGITTFNPANKYTLWYTKPATNWMTSCLPIGNGQFGATLMGDVAIDDVQFNDKTLWSGKLGGLTSTAAYGYYLNFGNLYIRSRGMSKVTDYVRYLDINDAVAGVKYTMDGVAYSRTYFASNPDSCVVVRYTASQNGKINTTLTLKNQNGRNVSYTVDNNNQATITFDGQVARQDDHGATTPESYYCAARIVTDGGTITKNAKGIIEVNGANSMTVYLRGLTDFDPDAPTYVSGANLLAGRAAATVNDAQNKGYDALLAAHKADYKSLFDRCQLTLSDVKNNIPTPQLISSYRDNQHDNLFLEELYFNYGRYLLISSSRGVSLPANLQGIWNDNNTPAWHSDIHANINVQMNYWPAEPTNLSELHRPFLDYIYREACVKPTWRRFAQDMGHVNTGWTLPTENNIYGSGTTFANTYTVANAWYCQHLWQHYTYTMDKDFLRAKAFPAMKSAVDYWFKKLVKAADGTYECPNEWSPEHGPTENATAHSQQLVWDLFNNTRKAIKVLGDDVVSKAFRDSLATYFAKLDDGCHTEVNPADGQTYLREWKYSSQFNNPSKIGVNEYKAHRHISHLMGLYPCTQISEDADKTVFEAARQSLIARGDGHGTGWSLGHKINLNARAYEGQHCHNLIKRALQQTWDTGTNEAAGGIYENLWDAHAPYQIDGNFGYTAGVAEMLLQSHNDKLVILPALPTTFWQKGSVKGLKAVGNFTVDIDWAAAKATKVQIVSNMGTTCIVKYTNVAKDYKVTTADGKTVKAKRINDDEISFPTVKGGVYVIVSKTADAIAAIQKAQDGNIASVDYYSLNGTKTSQSQSRGVYIKQMKYSNGTTSTTKVIN >CP022041.2|ASE18156.1|308639_309038_+|pilus-assembly-protein-HicB MKKVTIIVEQASDGSYWCRTAEDIAGIGLNSCGDTVEQAKQDLIDCYQEAKEDLEEQGKTMPVVEFVYKYDLQSFFNYFSFLNVTEIAKRAGINPSLMRQYNSGIKNAGEKTYERLAACLDGIKAELQAASF >CP022041.2|ASE18157.1|309456_311157_-|hypothetical-protein MKKLFLFTLLCVITMTTQAQTNLSDYLTKRMPKRETRAVWLTTLASLDWPKNYARSEESIKLQKQELIDILDKYQKANINTVLLQARVRAATIYPSDIEPWDQCITGVEGRAPGYGYDPLSFAVEECHKRGMEIHAWIATIPVGAKNSLGCRTLMKKGFRIRNFSTGSYLDPADPSVAPYLASVCGEIVRKYDVDGINLDYIRYPDGWPRPSYRDGDTPDQRRSNITAIVRAIHDEVKAIKPWVKMSCSPIGKHADLSRYSSKNFNAHDRVSQEAQEWMRLGLMDQLYPMQYFRGDNYYPFVADWVENAYKREIVTGLGTYFLDPREGNWTLGDLTRQMYVSRDLGVGHAHFRSYFLTANKQGVYDFEKQFNATLSLPHKMQGVVSTAAMPYAVNSSLVERREDKSVILRWKAVTPYYNIYASYTYPVDTEDARNLLFARYTGQTLQLKNVNPNLYFAVRGMDRYGLETPALQENMKSTSLSKSPATLLANDGNTLTLPAAAKLTDADRYVILSLQGVILRIVNAKSVRNNQLYIGSLSDGMYSLKVYNHKKKSFTLGAFMVRRGS >CP022041.2|ASE18158.1|312099_312531_-|NUDIX-hydrolase MYTYNYPHPAVTADCLVFTRTDEGMKLLLIQRKNEPCKGKWAFPGGFMDIDETTIDAARRELKEETGLVVGELHRVGIFDAVDRDPRERIITVAYYTILDKPAEVSGLDDAAQAKWFSLTELPDLAFDHKEILQEAERVLGDG >CP022041.2|ASE18895.1|312547_312760_-|DUF3791-domain-containing-protein MDQKTLEFVTYCIGKLSVMLKLPQQEVYRRLKTSGILDEYIVPSYDVLHTFGSRYLMEDLTEYMKEKGVL >CP022041.2|ASE18159.1|312939_313752_+|GSCFA-family-protein MEFRTIVNIPRPTFELEPCERILFVGSCFADNIGKRFEEEKFRAMVNPFGVMYNPVSVLHTVKKVANHTFDTAVFTLGTNHVYVERATGEIVDNCQKRPQREFEERELTVEECADALREAITLLRQANPKVNVIITVSPIRYAKYGYHGSQLSKAVLLLATDKVIKEEGERIYYFPAYEIVNDELRDYRFYKADMLHPNEQAVEYIWEQLVATCFSAEAKQFLEEWRPIKEALAHRPFHPEAAAYQDFIKKTKEKAKMLELKYPNIELNL >CP022041.2|ASE18896.1|313763_314195_+|transcriptional-repressor MNDKQIEALLKAHGIRLTANRILIARTLSGLDNPASIKELEAKIQTIDKSNIFRTLSLFKQQHLVHQMEDGNDIVRYELCLSDDDEEDEDMHVHFYCERCHRTYCLNDIHIPQVELPAGYEQSSINYMIKGVCPKCAHRYYIK >CP022041.2|ASE18160.1|314278_316759_+|bifunctional-UDP-N-acetylmuramoyl-tripeptide:D-alanyl-D-alanine-ligase/alanine-racemase MNYTIEKVTTLIGARRYGDKDANISFVLTDSRSLCFPEETLFFALKTERNDGQNYIPELYARGVRNFVVEVVPEDWATRYPDSNFLKVVGSLEALQRLAERHRDEYLIPIVGITGSNGKTMVKEWLYQLLSPQMVVTRSPRSYNSQIGVPLSVLLLNENTQVGVFEAGISQPGEMMALRDIIQPTIGVFTTLGTAHQENFPSLEAKCHEKIKLFHDTEAIVYSADNEVMAQCLSQYDYKGQKLDWSVKNTEAAFYIKAIEKKDIETTVSYVWKGQTEGQYKLPFIDDASVENSITCAVVSLHLGLTPATISERMAQLEPVAMRLEVKEGQHGCTLINDSYNSDFNSLDIALDFMNRRPDHKGRRRTLILSDILQSGDTDKDLYNKVAFLCEKRGVEKFIGIGEGLLAQRSAFKHLGEKHFFATVNSFIHSDVFANLHDEVILLKGARQFGFDRLTELLVKKVHETVLEVNLNAVVDNLNWYRSFLKPTTKLVCMIKADAYGAGAVEIAKTLQDHRVDYLAVAVADEGVTLRKNGITSNIMIMNPEMTSFKTLFDYDLEPEVYSFRLMDALVKAAQKEGITGFPVHIKLDTGMHRLGFDPQKDMDELIKRLKQQNAIIPRSVFSHFVGSDADNFDEFSAHQFALFDEGSKKLQAAFSHKIIRHMDNSSGIEHFPERQMDMCRLGLGLYGINPRTNKTINNISTLKTTILQLRNVPAGDTVGYSRKGTIDRDSVIAAIPIGYADGLNRHLGNRHCYCLVNGQKAEYVGNICMDVAMIDVTGIDCKEGDSVEIFGDHLPVTVLSDTLDTIPYEVLTTISNRVKRVYFQD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_3 | 351616-351726 | Orphan |
NA
Consensus repeat of CP022041_3
|
1 spacers
spacers of CP022041_3
>3.1|351643|57|CP022041|CRISPRCasFinder ACTTATTGCTATATAATTCGTGTCATTCGCGTAATTCGCAGTCAACTTTTTGATTAG |
CRISPR arrays and Neighbor proteins around CP022041_3
The CRISPR arrays of CP022041_3 >merge|CP022041|3|351616-351726|CRISPRCasFinder AACTACGAATTTCACTAATTACGCTAAACTTATTGCTATATAATTCGTGTCATTCGCGTAATTCGCAGTCAACTTTTTGATTAGAACTACGAATTTCGCTAATTACGCTAA >CP022041|3|3|351616-351726|CRISPRCasFinder AACTACGAATTTCACTAATTACGCTAA ACTTATTGCTATATAATTCGTGTCATTCGCGTAATTCGCAGTCAACTTTTTGATTAG AACTACGAATTTCGCTAATTACGCTAA
>CP022041.2|ASE18178.1|345764_349142_-|SusC/RagA-family-TonB-linked-outer-membrane-protein MKIFKESKQKHLYFSAAFALSLALAPTGVYAGTNTSTAVQAVQQNGNHKVTGRVVDSTGEPLIGATILVEGTTNGTVTDIDGNYTLNTTANAKLVFSYIGYAAQTIPVGGKGTIDVTLKEEANTMNEVVVTAMGIMRKEKSLTYATQQVKAEDLMKVQDPNAANSLEGKVAGITITPSAGGAGGASKIVLRGNRSILGNSSPLIVVDGVPMSNGIRGQQGMGAEGFGSTGTSEGSDPLSLINPDDIESINVLKGANAAALYGSRAANGVVMITTKRGREGKVDINVTSNITFDSPLLTPKIQKTYGAAYDKTTGALSLNNWGGKLADRADNDLVVRTPLDERWVGYPEEQIGTDASGNPIMARRHNVYLRNRAGNDVDNFFRTGVTTNNSISLSGGTDIARTYVSVANSHATGMMRNNSYNRNSISFRQTYNFFKRLHIDASMNYTESKTKNRPGGGTVGNPLYHLYTAPQNIDMDYYRDHYMNAEGKWLSNPGSYYKLNGSNFAWAAGQRTTLTGPQQEWAYLSHPNNNPYWLINSGNSQQKESRLFGTLQANVDIYDGLTFQARVNYSQIRFKNHATRFATTFLPASMEDYGRLWDSDEKTTEFYTDYLLSYNKTFGDYSVSATAGYVGHTIKGESKGTDAVATYYDRLMRKLPTMVNYFETSAGGYGVTSTSKSSNWDRSYLFTAQLGWKETVYFDASYRRDWYRPFRYFKELGKIDTDNYGYFGVGANAIVSQLVKLPDWFNFLKYRVSYSSVGNSIPNKAYGAMSRNLQTGALSGNKLLDFSPVPEETGSFETGIESLFLNNRLSFDFTFYNTIVRHLYMELGSLGGNTELLNSAKVRNTGFETTVGYDFKFGKDLRWRTSYNLSYNDNKILETGYDKDGTSRKYQQTVGEAKVIYQKGGAIGDIYAGDFMRDANGHIVLTAKGEPTFDKTGSNDRFLGNMNSKWQMGWTNTFNYKEFQLSFLINGRIGGKVLSLTESYLDYIGASERTEAARLSAERNNIVATNYGNVPGMELNDGSGRIVPIQSYYQALGASSNPSTYLYNGTNFRLRELSLGYTFRDLFGMNRNLTLSFIARNLFFIYKDAPTDPDVSLSTQNGLGAFESFNMPSSRSFGFSLKANF >CP022041.2|ASE18177.1|343775_345608_-|SusD/RagB-family-nutrient-binding-outer-membrane-lipoprotein MKSKHIIVLMAMALPTLGLQSCLDYDNPGDEFNSTTKNVEKVTSRGDVDKIPFRQATDAAAADEALNAMQDLLDAGVGGQFSMRGGKNGENPGPHAYQYQYSLGVDNYAEYTVIPHTFFQYSKIRLASSYAIDQKCYGGAWGSFTEMKTSLVPILNNEKVNAVPELKAAYLTLFNQQAVEVADVYGPMPYRELKTNLQVGPYTYNKVEDVYNDAVANLDTAIACFHYFDQKPAAYKQKIKSAFVDRFVVMTDGGAADGTLKAWARYANSLKLRIAMHMVKSDPVRAQKLAEEAVADGVIENEAQSVSIRPGVMGFSHPLPGVESWGDARMSATMEITLKTFNHPWLKYLFKKNDNVIKNNKTGETTPADSRICGIRTGTHPGEGQGYDENQYIAFSKLNEQYFNSAPLYLMKYAEVCFLRAEGALRGWNMGGSAQHFYEEGIRHGNCEDPEMKSMDGESPNGQQNVNWYDSWIDTYMAQENPVAYVYKDPTGDTPDAASPIHVGVKWNDSDSQETKLEKIITQKYLATYPNGFEAWVDLRRTGFPRMLPVLNIDEADGSLVPGDIMRRLPFPGTSDIATKQDVDNTGIPALGGPDKMATRLFWDKTTSNF >CP022041.2|ASE18176.1|340646_343676_-|T9SS-C-terminal-target-domain-containing-protein MNKKSTLSAWIIAAMMAMAPAGVTAQTYSSTASTQVFDLSKLGDQTLLEHFAELLDNGKKYPTDADLTAWGIKDEVEFIRSHVRKRAIESRADRLLQDTYENRNLFMNIPGGAGKNLGGYPSKTFANDNFSMWNYTNLFGAWNYGLFQAPGSWADAAHRNGTSIFAGIKFFDHTTGGAANSWASFIMKRNTDGSFRYTHPIINCMRFLGFDGINYNWESTNKYQDADNIAFHKELYKIAKSEGFNDFKIMYYTTSSRLTSYNSSYMWGQDKDNRICEVMLNYDNSDFSWSMGESVREAERTMGSADGLYAGVWIVSMDRRWNSLNNQDAKRCGICLWGEHAESRFWSYNTGGDAMSRMSNYQEYLERAFSGGNRNPLYRPEVSNRGNNVEAQGTTPPLARFAGLASWIPERTAISGNLPFATHFNTGNGERYNYKGKKTAGSWYNMSSQDVVPTYRWMVVKPETEVASTDVQPSFTNEDAYTGGAALRLKGVNNATATDVVLFKTNLTPSKGKVVAKVAIKTGKEGNNDSKLSLIVRVNGAWKAYALGNTENANWTEKKVELNDITAGQKIERIGLRVKDSDADYNVLVGKLELNDDVTATPANVKDLTVQVKEETKNSLSVKAVWGIDKDPGQNPTVYNDEANIDHFEILYKNGENGKVSEVGRTSQWATLVPNIQFTSVDDKPFIGVRSVSTDLKTYSKTKWIAVPRAQQSELPEAQEEGYGTVELDNAAAGAETARKIRYVQKFQTEGGSKNIDYTANGPAGNETNYVDATSQELEVAQGATVKVKIKGYEATQMKDQSNDDLRYCMGKAWMDFNGDKQFNPENLSENPNEGECVVFFGQVRKGVPAQVQQLNEYTFKVPEDAKPGQSRLRLVFCDAWFQGGLTPTGKFNKGFAIDFKVTITGSNAARGAKADTHDKGVADEPELLEGGSTNIISANVGGASQLTVVGGKVVFENVERAWVFSTDGQTVKSLVNPKSFNTNELPAGVYLVKMQNNNVIRTQKITIK >CP022041.2|ASE18175.1|336481_340219_-|T9SS-C-terminal-target-domain-containing-protein MRRINLLLFGCCLIPSTLMAQELKSNYIQWGFESQQFPGKLQSWSKSNPKINDDDNFFISRVKPKARFRYEGTQVRTDLTETNDKKLLAWLPWNVPSKNALPDGVFDSEVFSMWPYVTHWGDWNCGLGRIPAALLDAAHKNGVPVSSVAGIPYGGLSGAWSSALEQLATTDVQKAAAYMNYFGYDGLGYNSEYSEYFGGGRITRKLRDFHVDLNKAIRPTNPIYENIWYDGTNDKGRISFDHGLNEYNKSIFGAKGSEAANLFFNYNWNSSTLLQNTVEMAEEIERNPLDIYAGINMQGGEPRTGSRWTLLKNYPISIGLWGAHSQNMFFESRGEKGSDPETQQRTYMLRTERWFSSGSRNPVNSLQINNSLNYNADNTDFAGMSAMMTARSSMSWDLTQEPLITYFNLGNGKFFNYGGVRKNDRPWANVGVQDYLPTWRWWFASKLLGRTAVDVPATGLDAEFVWDDAYMGGSTVRVHGSTPKEYLHLFKTKYALKTGDVITFRYKVKGGKADASLVFATEDNVNAEKAYPVLTTADEADEDKWVEKTITVDGTLNGKTLALVALKMENAADLNLYLGEFSIVRGSFDAPAQPIDVKTTLLHAAKNGVDAKIIFNMPNTKGQGEPCYNTDVKTSLFKLWAKQEGKDPILMGVTTSWAGMFYSVPVDLKGQGKVKFGVSAVSLDMKKESAIAWGEDHEIFNSYEYSDEIKADKSVIKPNEAFTIAYVDPRHEAGNWKIEQNGATVASSNNANEIKVENGLSQTGFYDLVLTGAVNENGARVNKEVRYANYIQITSDAVGAVPHINKLTANNSETSIEVVANSEVTMKYEGKKADGSGSRGIKTLEKPVGVKVSELGLTSNNQAWTLAFWVKFNGFTGNTQIIDMRDPGTGWPQNNWGTMWTTYDPNTGVYEVTLREKNAGGAKEYKQRWEVDFVPGAWTHFVLAMEGNGTTTKPMVYINGKAAKAHNWEYDGHKGDGINPDGFANNAWWDNNVLGISLGRAGAAAINGTVDDVKFFNKALTAAEAAHTMMSTDANEAGLKAYWDFEADADASHYFTSKVGNAKLAHGELKAGEGEGVTTLVPDAPTYDAGSAFVSGTFQVKTTAEWTAKKATIVSQDGTDMAGTAKLKYAKAGDYEVTLTLKNAHGSDTRTFQVIKVKADPTGINGTETADMKVYAIDRDVLFDVETPGNYLVQVFSTNGQMVASKAVSVNGAESVRLHLGAQGVYVVNVKKDGKTLRTVKFICK >CP022041.2|ASE18174.1|332507_336281_-|T9SS-C-terminal-target-domain-containing-protein MKRITLFASLALASCLTVSAQRTPTHPLDIQDAKFENLPNYLEAWLKGEMKQPQGVSEIDDQFFISRVRPLERIKDGDYQVRQGVKRDRKMCLWTPLDDPTAQWKALPRYCFEGDNFSLWSYIDIHGNWTSPWIRSTAGLTDVAHKNGVSVGCVMSIGYGAYIYLNQWRPDTYSKTLYKLTKKEGGKFVYAAPLVRMMKYYGVNGIGFNSEFRTSSDVMATLTDFFVACHKEAEKINWKFEVHWYDGTGDDGSIHFDGGLGSHNQNIFGDKDHIATDMLFANYNWGPSHLRGSVSTAQRLGRSSFDYYAGFDIQGRGLRQPSWSALLNNDISIGFWGAHSQSLIHQSATDNGTSDIAIQKTYLKKQELMFSGGYNNPGLLPAINTDCNLANASLKNFHGLATFLSAKSTIQQVPFVSRFNLGNGLSFRKDGKVTFNHKWYNLNTQDYMPTWRWWITDGADQVNTGNINSLAKAELTFDDAYWGGSCLSISGQTDFSRVKLFKTMLEVQPDYEFSVTYKTIAGKDTHAKFFVALKDHVTEYKEVALPAVEAEGKWNTFTVKASELGLAAGDKVAMMGLVVENTPANYELRVGEMALRDNAKDFATATPTIKKVEILRGRYNACDFKMQYASKEESGWEKTYNDEVGTWYYEIYFQQKDQPQQLLTATTSWAAYVVDAPMVSGADKRDCRFGVRAVSPDGKKGSEIVWSDYQTVAYDQPLSDVVADRPVIKPNEEFTLKYLDEMVPAAQAWKLQDAVTGQVVAQGESGTSAKFTVAKEGTYDLVVVDNNGKESVVRGKVKITPEATGAVPQITDITADKTTEKVGQNVTYTYEGRLGEGKASRGLEITDPKMFRIPGDVQQGKSYSYALWFKADKFNHDKQGTNLINKNSIYDSWPHNNWGDLWVTIRPEWQGSRALHAANEISFNTMGWTAHDNPYEDVMTTGYSVTPGVWNHIVVTHTDGNIQKIYFNGRQVASHSFAASSRREDYGDRRINASKVADIFIGGGGVYKSGFNGIIDEVQVWSKPLSDDEVLRAMKGYKEGEVPADLKAYFTFEDVDGMKFKNIGSAGRTYDGSVVVVSGSGGENTSSAAYVDQQPNNNVLGFPGVVGTYEIKTVPTWALGDGQISSSADKTAVVTYAAPGKKNVTLKLKNGWGEAEKTVEEIVEITSEANAIDAVDAQLGFSVYPNPFVESVNMRFAENGRYTINVLGTTGALLQSNSFEAAQGQVVNVAITGTKGMYLVQVLKNGKVYKTVKVIKR >CP022041.2|ASE18173.1|331722_332238_-|hypothetical-protein MTTLPQSSYKTTTWKGGVTRQIFISPADGDLSARQFDVRISSAIIDDVQSVFSDFSGFTRYILPLEGEITLIKEGRRIVLSHNDLYEFEGDEKVSSENTQGAVDFNIIVRHGISVEVGIVEDAAFTDSRRTIVFALEDCCIEGKTVRKHDTALLDEPFSLKGKAVIARFME >CP022041.2|ASE18172.1|330792_331104_-|T9SS-C-terminal-target-domain-containing-protein MIKNIFTPFLFTLLLSVGFTAPVQARAAIDLIDLDVQTISISVVGNVLHVVGAENEQLAIYNVTGVRVMSVKVDGDDKHYTLNLPKGCYIVKVGNVVRKVSIR >CP022041.2|ASE18171.1|330198_330747_-|RNA-polymerase-sigma-factor MTPLDEAEVIRLLASFEGRQKVFPVIVDQYSQSLYWKIRSIVLTHEDADDVLQNTFLKAWKSLPTFQGKAKLSTWLYRIAINESLDFLRRQKAATLSSADADLSVANRLLADDYFDGDKSQALLQEAIATLPDVQRTVFTLRYYDEMKYSDISEILGTSEGSLKASYHIAVQKITDYVKRYE >CP022041.2|ASE18170.1|329738_330131_-|hypothetical-protein MESTNHLQKEYGTQRPFTVPENYFSELSSRVMAQIPAEEQKEAVVAVKPHRTMVHYLRPLAAAAMTIGVVLVGFLAYHEFDGEQGKHALAEGHLAQGAHETSASSEDEFDKAADYFMIDESDMYAYLASE >CP022041.2|ASE18169.1|329296_329719_-|hypothetical-protein MKHKRIILLFCIFFSVSLLHAQGKFDFNRIKAESHNFITKEAGLSGQEAARLFPVYDEMRGKQRVYFDKLRAIFSAKPSSEREASKTIEQADAYEIQLKQIEQRYHKEMLKVLPATKLLRVLEAERRFHRQTFRKMAGRR >CP022041.2|ASE18897.1|351907_352915_-|DUF1735-domain-containing-protein MKIKVFKSVLAASLVAATLASCQSEPEVGSTLYPTAEENYSAKAYLYTGTSDGNKLLLAGEKSASTVTLANDSAKFYVRLSSPAEKDVTVTLAATSDGVEANSSEEVMSTDAISLSKTSVTFAKGQQVSEPIVVKLVNGDALKNLAMLKNGVTSVVIKSVDGAETAKTNTKVLVTTNFTFNNINASGTLNADKQIALNEYQMSTTLSSSNAAKLNDGDNNTYVYSYTYYEPEFTMAFNSTKELIGVGILCNYTSYGYGVKKVAVATSLDGKTWTNMGTATAASTYDDDTPFPIVFNTPVTCKFVKLTILQSFDESENPRFLIGEIGAYEYLDWNL >CP022041.2|ASE18179.1|352934_354098_-|DUF1735-domain-containing-protein MMKYNISKLFIGAVAMASLTLLASCENAEYSPLSNQAYIAQTNTNGNSSQNITIGTSAVTSSVNVRLSDLATQDYTFEVVSDTTALAEYNQRNETSYKPLPASLYSLSSNEVKIEKGKSVSSDVTLTINPLTQALKDSGQKYAVALRLKSKDGKYDVLNSGSSMVYILDQVVYQAVPIINATHNIHFAMRQTYDLSQWTVEMNVNISKLGKGLGELNNQTLFGAWGNDGGEIYTRFGDAPIEGNRLQIKTQGTQMNSKQLFNENQWYHLAFVCTGTKLYLYVNGALDNSMDLPGKATNLSNRINFGNTDYLKANVKVSELRFWTVARTQAQIANNMYACDAKSTGLEAYWKLNEGQGDTFKDATDHGNTGKCVAVPTWEQNVRIDGK >CP022041.2|ASE18180.1|354111_355203_-|endoglycosidase MKNLIKIFLLSACAATAFTACSDWTETEAKDGADLTHTNKSEAYYAQLRDYKKTDHSVAFGWFGNWTGTGVTHENSLAGLPDSTDFVSLWGNWKNPTEAMLKDLRFVQKTKGTKVLISCLVFDIGDQITPTNTDSTLTWKEWRHKFWGWGNDEASQIAATEKYANAICDTIAKYGYDGFDLDAEPSYAQPFQTDKELWQNAKVMEAFVKTMGKRIGPKSGTDKMFVIDGEPDAMAAQYGEYFNYFILQAYSSSGNSDLNSRFTAQATHFQQYLTPEQVANKLIVCENFENYAGKGGVSFRLDNGTTLPSLLGMAYWNPVYNGVTYRKGGVGTYHMEYEYTVSGQTGNYPFLRKAIQIMNPSIQ >CP022041.2|ASE18181.1|355238_356789_-|SusD/RagB-family-nutrient-binding-outer-membrane-lipoprotein MKKINIYKAVTACTFASVLLCSMSSCTDGFQEANRPGTGASLEDLSRDNYQTSSFLVQMENEAFPEQENAYQMNEDLIGNYLGRYMTYANNGFAEKNFARLNAPNGWVRYPFKDSMTKTVSAFKAIDNVTKGEGPVYAWALILRAQSFMRLTDMYGPLPIGADATDGNAYSSQEDVYKSIIADLNKATDVIKPLVASNPNVTIAEELDKVYQGKMAKWLKYANSLKLRIAIRIRYVEPTLAKQLGEQAVQDGVITSNDDNCAIAYTPNGQYKTSVEWGDSRACADLESFLTGYNDPRLTKFFTPVEGGIRSVIGCRAAAKIGNKTTAGKAYSAANIKIDSKGVWLTASEMAFCRAEGALAGWSNMGGSAKDLYEEGVKLSFEQWGAGSATAYLADNTSTEKNYVDPISEYGGDVSAVSNITIKWDDSATDEQKMERLITQKWIAMFPNGQEGWSEIRRTGYPKVFPLAQSTDYSIQVANRIPFDIDEATNNKANYIKAVQLLKGNDDYATKMWWQR >CP022041.2|ASE18182.1|356820_359922_-|SusC/RagA-family-TonB-linked-outer-membrane-protein MRKGKILPVSVFLSCCSLTAFASSHVTSADNFGKNVVVAATKANTAKVLGTQQSSGDVKVTGTIVDNAGDPVIGATIRVKDSQHGTTTDLDGKFEIMTHKGATLIVSYIGMNTEEVKVSGDAPLNITLKAEAHQIEEVVVTALGIKRSEKALSYNVQKVGGENLTTVKNPNFMNSLSGKVAGVNINASSAGMGGAARVVMRGPKSITRSNQALYVIDGVPINNTSQGEISGGAFSSQPGSEGIADINPEDIESISVLSGPAAAALYGSAAAQGVIMITTKKGKEGKVSVTVSNSTQFANPFVMPEFQNSYVNRAGDVKSWGAKTPSVYGDYEPKDFFNTGTNVQNNVALTAGTDKNQTYISVGTTNAKGIIPNNSYDRYNFAFRNTTTFLHDKMTFDFNFNYIKEHDKNLTAQGQYFNPLTAVYLFPRGESFDAVRTYELYDVTRGINVQNWNFGDALSMQNPYWVANRMVRSNNRSRYMISASLKYKIFDWMDVVGRLRWDDAGTKQEDKRYASTTNLFAHSKYGFYGYDKVNDRSLYGDLMFNINKTLNDFSVSANLGGSFTRNKYNVTGFQGGLKAPSNLFTPNAIDYGSATADNRPIFTDHAHKINSLFANVELGWRSMLFMTVTGRNDWDSALDGTDNVSFFYPSVGMSAVISQMATLPSWISYMKVRGSWASVGSAISPNITSPWRYQYNPASGTYSTVTYKFPSNFRPERTNSWEAGLTSRFFNNALTLDVTVYQSNTRNQTFLRPITGSQGFSSEYVQTGNVRNRGIELSLGYGHTWGDFAWNTNFTYSANRNKIVELLDDPNEVINQGGLNGANIILKKGGTMGDLYMTSDFKRDAEGNVAIKDGNVSQVNLTNPSYRGSVLPKGNIGFSNDFSWKGFNFGFVVTARFGGIVMSQTQALMDAYGVSKASADARDKGGIAVNNGLVSAENYYAVVGGENPIWSEYIYSATNARIQEAHLAYTFPRRMLGGMELTLGLTANNLLMLYNKAPFDPEATASTGTYYQGFDYLMQPSLRTLGFNVKLKF >CP022041.2|ASE18183.1|360588_360789_+|aminotransferase MKKIYLEPETKVYEQKVEGHLLAGTGVEGSTTDPKQGGDPTSTSTTTIPNPFNSEAKEKSKWYTKE >CP022041.2|ASE18184.1|360895_362800_+|hypothetical-protein MEKLSKFAIVLCGTMAISLAACQTDELSDNTAGENTPKVYHVTVSADAAQGTGASTRALAVDPANGKRLISEWLVNDELAAFVNGDEGKQNNYFVLTTDRGGKSAKFIGDIAAAGRSMSTNDNISFLYPAVALKGANKTITPVERREESVAVGTVSKIISYVPSAKAQKYVSLNLSRQDGTAASLGSRFDYQCKSSHPQKVESDKLDIKIGKLQRLVSFWGLRFTDENNNKLSNIDSIYVSNIKASGILNITDNTFASDDKYEANKSIAVIPAKGTKFTSANNQYTYIALLPGNYTNVHIMVYSKGKMYEQEYANINFTADNVYHTDILHMTPVGPKPWVEVQGVKWATGNFIHYGPANGGYWGIAPAQWWISQRAVTLNNRRKVTNDKAGNTTSQFADFPTQKVEDVDLFRYGHIKEALDLRSSLLFRNSVAGKKLYYPASAGTYTDWDINGTGATRGDIAWYYTKDKHQHYRMPTGDEMKKLYTEARAIPAYCYTDKGTKVYGAYFTTSTSGGYHEFPAKRFVKSIDSYTNVTALIQANKGLFLPYTGLRDPGAVGMKMRDLSGDGNAYGQYMTTDDSKWNDRSADFFFGAAEWNWAEHPKTQAKAIRPVWDSGDNTLDSDYLNLRNSLGIK >CP022041.2|ASE18185.1|362792_363044_-|hypothetical-protein MLRVQKGVNKGLKGHLLQVKRALVASRLVVFIKLVCEKKADKGGRWFGFKYGDCEKENEFGNHAEPIIIKVYLKKTESSGTFT >CP022041.2|ASE18186.1|363441_363636_+|hypothetical-protein MRKSELMALETIATGMKVAYQKPLCAILSVEAESVICSGSVTGETENKNVSEDPSDAWYTGQDS >CP022041.2|ASE18187.1|363759_365589_+|hypothetical-protein MKKRILLPALCMAISIFTASCSSDDIESPEAKEQMVTLTVSAGTEDDAAATRAVLSEADAAKSPWKWEQGDKILLVSGNGSQKTVSTLSLKSMKRGGLGADFEGQVPASSVTNGSTYRFFYVGKTSDGRSDRSVDASTGSINIDLTQQTGNLADLKRNCVLTGEGKVVVSGNKATTEGSIKLANVFAVAHFAVTANNNTALTKIGLRGKGVYASANVDLSTGAVTGISEIGTEVDPEENIFFPDGKTDFYVTFVPGTVAPAFDGYYAGSHSNTEAITSTPEARNYAADKSFVYLNGQKAKFSVSATQKVAITNGNMQYVMPIATYTSAMNTKTLSANTLIRWKNSVPLKLKGSLTIHKGYYRLAPEQWEMAMPKNKRTGGSYSYSTIKINGKEYVSPETYGYFDLPSWGTIDNPTVINNTFSIRGTGTETQYDFGNKLYVGSKKTRVMTSDEWAYLMPQNTNNSNNRIWIENGKKYAKWARCFIDENGDGKRGTNPAELRGYLIFPDDMTIDEARNAFTKTPTFGGGNAINNPTTYEKIKSSGAVFIPLSAYRSRNNRTLSVWGEHGNYSTSSYVNGSMVHIRITATSSIFNDRSDPNQGCMSRLVQNL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_4 | 415025-415115 | Orphan |
NA
Consensus repeat of CP022041_4
|
1 spacers
spacers of CP022041_4
>4.1|415054|33|CP022041|CRISPRCasFinder TAAGCCTTCTTTGCCTTAAAAAAACTCTTAGAT |
CRISPR arrays and Neighbor proteins around CP022041_4
The CRISPR arrays of CP022041_4 >merge|CP022041|4|415025-415115|CRISPRCasFinder TGTGGCAAGAAATAGTAAACGGAAGCTAATAAGCCTTCTTTGCCTTAAAAAAACTCTTAGATTGTGGCAAGAAATAGTAAACGGAAACTAA >CP022041|4|4|415025-415115|CRISPRCasFinder TGTGGCAAGAAATAGTAAACGGAAGCTAA TAAGCCTTCTTTGCCTTAAAAAAACTCTTAGAT TGTGGCAAGAAATAGTAAACGGAAACTAA
>CP022041.2|ASE18220.1|413917_414274_+|hypothetical-protein MKIIRILYNRLKNNRIKQRSTVIKIRKRNKTYYQTDMNLTKISYPKPPTKNPQNHETEMKTRLKTLVLQAYSKTENACTAPKTYVRQKGRLRPKTLFKVFFPQRKSIYFLKTKHLFLT >CP022041.2|ASE18219.1|411927_412854_+|site-specific-integrase MKTKTIREISLAWKRDKQRYVKQSTYAAYVLVLENHILSSFGDCDSLSEKLVQEFVLQKLNAGLSIKTVKDILIVLKMVMKFGVKNGWMNYCEWDIKYPTTVINKEMEVLTVTHHKKILDFIKQNFTFRNLGIYISLTTGLRIGEVCGLKWSDINTDSGIITVNRTIERIYIVEGERRHTELVINSPKTKNSCREIPMNKELLAMVKPLKKVVNINFYVLTNEEKPTEPRTYRNYYHRLMKHLDIPRLKYHGLRHSFATRCIESNCEYKTVSVLLGHANITTTLNLYVHPNMEQKKRCITQMLKYLRK >CP022041.2|ASE18218.1|410662_411811_-|restriction-endonuclease-subunit-S MLDFYSTNSLSWDKLEYGTNAIQNLHYGLIHVGLPTMVDLDNDKLPNIVSGNTPKNYELCQEGDIAFADASEDTNEVAKAVEFYNLNGKDVICGLHTIHGRDNQHKTIVGYKGYAFSSTAFHQQIRRIAQGTKIYSINSKNFSECYIGIPSKGEQKKIATLLRLIDERISTQNKIIDKLESLIKGICNNYFLKLSHSQEMKSIRLRDILKERNEYCCKDGTFVHGTLSKDGLFPKTERWNRDFLVKEENKKYKITHLDDICYNPANLKFGVICRNIYGDLIFSPIYVTFEISKKVNIGFIELYLTNRNFIEKIRKFEQGTVYERMSVSPEDFLSYKIRIPSLSEQTFFYQKIQRLKNCSQNELEHLNLYKKLRHYLLRQMFI >CP022041.2|ASE18217.1|409446_410670_+|restriction-endonuclease-subunit-S MTTTIDNEQKTLNVPNLRFPEFEGEWQEERLSDIADISKGIGISKDQLSADGEPCILYGELYTKYKSETIKEVISKTNIDNTKLVKSKANDVIIPCSGETAEEIATARCVLKDDVLLGGDLNIIRLHGYDGSFMSYQLNGKRKYDIAKVAQGVSVVHLYGEHLKNIKTINPSLNEQKKIANLLSLLDERISTQNKIIDKLESLIKGIMVELQKQGQNKGNWRNVLLSKVLKERDERNTNLYQVFSVSVSQGVINQVDYLGRSYAARDTSKYNVVHYGDLVYTKSPTGAYPYGIVKQNFNQENVAVSPLYGVYIPNSLSVGRYLHEYFRSEINTHNYLHPLIQKGAKNTINITNQRFLENSVPIPVCANELLLISKLLLSFNDKIKYQQSILQQYQKQKQYLLRQMFI >CP022041.2|ASE18216.1|407855_409301_+|transcriptional-regulator MEYIDFKKLIDALTAKPKETEWLEFKHNFHSKEEIGERISALSNSSYLCNMPFGYIVFGVDDKSHDVVGTDLYGKQIMVGNEELESWLSTRLNPRIDFEIIDDFNYEDKGHVCIFKIPATINRPVSFLHEAYVRVGTITRKLKDFPAKEAKIWKGDRKSLEKIVLKKGLSGQDVFSYLSAETYFDMMHLPLPQDTNGILDRFLAENLITKDEIGYSITELGAILFAKHLSDFDGLKRKMVRVIVYKGKNKIETIREQTFDKGYAIGFEEMVAWINSQLPANEEIGMALRKDARMYPEISIRELAANMIIHQDFAVQGFPMIEIYSDRIEFSNPGQPLISVERFIDEYQSRNDTLADIMRRMGICEEKGSGMDKTIFYVELYQLPPVRFQLYESRTTATVFSYRKFADLDKSERVRACYQHACLKYVSNEKMNNQSLRTRLGIEDKNYPMASRIIKDALEAKLIKEENAEGGNRHNYIPYWA >CP022041.2|ASE18215.1|405740_407849_+|DUF262-domain-containing-protein MKGYAKPLYEFIEGNKIQFVIPVYQRNYDWLIDNCDQLLSDLVKLSRLNRYSHFFGSIVTSSADNNSYNRLVIDGQQRLTTISLLLLAGIKAVKDDAIEISDKKRVEEAYEVFLNAKFCNAERKIKLVPIENDRIAYDKIFNGEGSFNEESKITRNYHHFYEKLTRRPQLLSFDQLLDAIERLQIISIELDKDDDAQLIFESLNSTGLALTEADKIRNYLLMSLTPEEQLECFKNYWQKIELATESQPTRFLRDYLTIKQQLQRPVRLSNIYYEWKKYMEGHDRKEELIEMLDYAHYYQQVVEAKLFTAKLSEKMRHICNIETDVANVFFIQFLKYASTNELSEAEMYKVIDLVENYLARRIVCNMPGNALTQVFCALHKDVLKSLEEYASANIELDYSYSDILTFHIMRRDGNYQLPRDVQFVESIKTRDAYHMLKPFQIFLFERLENSVHGEYNDVAIDMKKKDATIEHIMPQTLSGEWKAMLGDNFEEIQEKYLHTFANLTLTGINSELSNKPFAIKRDGKAIGNEIYPGYRNSKYRLTKNVTLCEKWTETELQNRSNEIVTIFLRLYPLPQTTFKPLPKPVDEVSLEEESFTPTNRQLKGFRLFGNEYTETTWKEMLLRVVKMVEQQYTDIVDTLYDAEGFFWSAQQADTRYCTQIAPQKYLWTSMDNRSKLRCLRFLFEKCDIAESELVMLLEPIRE >CP022041.2|ASE18214.1|404165_405722_+|type-I-restriction-modification-system-subunit-M MSEELQQKLRDQLWEVANKLRGNMSASDFMYFTLGFIFYKYLSEKIEKHANDALVDDEVTFKELWSMEKDTDIEELQESVKTECIENIGYFIEPNFLFSSVIESIKKKENILPILERSLKRIEDSTLGQDSEEDFGGLFSDIDLASPKLGKTADDKNTLVSNVLLALDDIDFGVEASQEIDILGDAYEYMISQFAAGAGKKAGEFYTPQEVSRILAEIVTLGHARLRNVYDPTCGSGSLLLRAASIGHANEIFGQEKNPTTYNLARMNMLLHGIKFSNFRIENGDTLEADAFGDTQFDAVVANPPFSAEWSAADKFNNDDRFSKAGRLAPRKTADYAFILHMLYHLNEGGTMACVAPHGVLFRGNAEGVIRRFLIEKKNYVDAIIGLPANIFYGTSIPTCILVFKKCRKEDDSILFIDASKDFEKIKTQNKLRPQHIQKIVDTYRDRKEIEKYSHLATLEEIAENDYNLNIPRYVDTFEEEEPIDIHAVMKEIKDLEAKRADLDKEIEGYLKELGLVE >CP022041.2|ASE18213.1|403513_404164_+|hypothetical-protein MIKIIRGLDITALGVCVYNRKWQSVHLQQGDMDGACAVYSMMMNLIVLKVFTRNQVTNLNTSFKGNTAKGRLFKEFFVKEGLCRDGFYFSEIKEKLSHSFAKEVMSSARQYMISQSAQASYVEELKKGIDDNLPLVTAITFKGGAHAILAIGYEEEEGVIRKIFCLDPGHAISQTALWNSVIILNEGKGMYCHQYITDKDDENVFVSETLKIERKK >CP022041.2|ASE18212.1|400658_403502_+|type-I-restriction-endonuclease-subunit-R MPVQSEAALENGLIATLQQMNYEYVQIEEEKNLRTNFKSQLEKHNRKRLEEIGRTEFTEAEFEKILIYLEGGTRFEKAKKLRDLFPLELDNGERLWVEFLNRTHWCQNEFQVSHQITVEGRKKCRYDVTILINGLPLVQIELKRRGVELKQAYNQIQRYHKTSFHGLFDYVQLFVISNGVNTRYFANNPNSGYKFTFNWTDAANVPFNELEKFATSFFDKCTLGKIIGKYIVLHEGDKCLMVLRPYQFYAVEKILDRVKNSNNNGYIWHTTGAGKTLTSFKAAQLVAELDDVDKVMFVVDRHDLDTQTQSEYEAFEPGAVDSTDNTDELVKRLHGNSKIIITTIQKLNAAVSKQWYSRRIEEIRHARIVMIFDECHRSHFGDCHKNIVRFFDNTQIFGFTGTPIFVENAVDGHTTKEIFGNCLHKYLIKDAIADENVLGFLVEYYHGNADVDNANQNRMTEIAKFILNNFNKSTFDGEFDALFAVQSVSTLIRYYKIFKSLNPKIRIGAVFTYASNSSQDDALTGMNTGSYVSESTGEADELQAIMDDYNDMFGTSFTTENFRAYYDDINLRMKKKKTDMKPLDLCLVVGMFLTGFDSKKLNTLYVDKNMDYHGLLQAFSRTNRVLNEKKRFGKIVCFRDLKSNVDASIKLFSNSNNLEDIVRPPFNEVKKNYQELTTNFLEQYPTPSSIDLLQSEKDKKQFILAFRDVIKKHAEIQVYDEFEEDAADLGMTEQQFMDFRSKYLDIYDTFAGGCKPSEENQTPDEDTESTETSTESGIDDIDFCLELLHSDIINVTYILELIADLNPYSADYKEKRTYIIDTMIKDAELRNKAKLIDGFIQQNVDDDRDNFMARKQKFDGTSDLEERLNNYITTERNNAVDKLAKEEGLDVTVLNHYLSEYDYLQKEQPEIIQEALKEKHLGLIKKRKTLTRILDRLKSIIRTFSWE >CP022041.2|ASE18211.1|400372_400618_+|XRE-family-transcriptional-regulator MYLRILQYTMINQQKKAIYRIKAVLAEKQLSGKWLANEIGRTENTVSRWCSNKVQPFLENLLEIAKALKVDVRDLLRSTEE >CP022041.2|ASE18221.1|415426_417289_+|DUF4268-domain-containing-protein MGLNNRLQIGDVSNNGQVEIVDEDRLCYLVRSTSKGATGLRTISKSLLEEYVNYWSEHSDATSESARQALSGTSEIDKYEYGYTSTLSVMAQMVLQSTHKVKNNVSSLPRQQIFYGAPGTGKSFTINQEIKGEDVIRTTFHPDSDYSSFVGAYKPTTREITMRDLSGHPVVEHGQTLTEEKIVYEFVPQAFLQAYIGAWEKYAACDEGNPHRQFLVIEEINRGNCAQIFGDLFQLLDRNHSGFSDYPVKADTDMKRYVAKALKGLTIPQAGAINSLYGGRDVVSEVLEGNILLLPSNLFIWATMNTSDQSLFPIDSAFKRRWDWSYMPISDAKKGYVIDVAGSQYDWWQFLEEINEKIENTTNSEDKKLGYFFCKAHGGVISAETFVGKVVFYLWNDVFKDFDLVGPIFDDTVEGGRLTFAKFYTEGEMKTKVRAEKVAQFLGNLGLTPLEESEEEYNGQAESTDDSENPRATWSVTERKRYDFWQAFLAYAQKNDEFKTYFGGTKKAGKDHWKNFYVSGADFYMSVVLKLWERAIALQVYFDRTTDTYYHLATQKKEIEAEMDTTYEWRENPEKKSSTIVERIDDIDFEDKEHWTTIFDLIITRTLRMREVFVKYSKQQ >CP022041.2|ASE18223.1|419750_420530_-|hypothetical-protein MAKVNYPKGEKLARLLLDVLAKEPSYKEHSKGLPYYYISFEGQEYYLYFKCITHEGNPYPLEHRRSQLPQRPEFDKIKNDNIPFLFLGYDIDNDLFVCWEPAKVKPRLNNRSYVSFYSRLSIQESVEEGKIRDEYLTNGDKFVLFKRVDAVSFFQMIDTHFEELKHDDTQTDTRSVSEPPTDYNTQKKLGNHVQGRIVSVEEDVSIQLKIDSMAQNNSVLEIVADCMNEFSTYYPKMSFADWSQVIRKYLQNDNNLYKR >CP022041.2|ASE18224.1|420531_421500_-|PD-(D/E)XK-motif-protein MSNLYSTYSQLRERRKESGLYEVEDFIIGKPHKFGATEDGFPVIFVECCDDAVSTPIRLKAISVDFSQLCTLKDAGGETLTKKYTIIVLNSLEADLQSYFLEVFAMVLNKFSSTPSVSLLKAEISKVAKIFMMPPSFSAVVIQGLWAELFVIANAKSPEDLAKAWHVTAEDKYDFNDGKDKIEVKSTSNLDRVHTFALEQLNPNAGSELAIASVITVRSGQGVNVFDVLDTISQRGLSIEQMSKIQEIAYLTIGPHLEEAKKIKYDFTLALNSYMKFDYRDVPSIKSEHVPSGVTSVHFASCLKDVEPIDLSATNSGLLKFM >CP022041.2|ASE18225.1|421517_423731_-|endonuclease MSSTIQVINPTNKQNNQVVIGNNTLHFMDSQSKLDDEGKSIIIDEAMKILSHCVKPGTNDSITNIAVGYVQSGKTLSFTTLTALAADNGYRMIIYLTGTKTNLEKQTSDRLASDLDTDSSDVYNLMSGIDDNFTLDTSIKNFLIHTDDVILIPILKHYKHIQRLADTFVSPTLKSCLNNLGVIIIDDEADQSSFNTFAKKNTANPDWIEDDFSKTYASILALKKSLPNHSYIQYTATPQAAFLIDNSDILSPTYHTVLTPGKGYTGGKFFFKNKNYQLVHLVDDAEVYHHKRNPLTTTPKSLIESLQQFLVSVAIVVFIQKRKNVDFLSMMIHVDGRCDTNTLFANWTKNALQQWIDILTLDEKDPGRKLVCKKFKNAYDEMTRYIQNPPSFDEVMKNMVKVILRTKIHLVQSQGGSVGDDGISWKSAKANILIGADMLNRGFTIEKLSMTYMTRTTQGKSNADTIEQRCRFFGYKMDYADICRIYLSKKSLVEYNDYVEHEETLRANLSQCETLEEFSKHSHAMLLAETLNPTRTNILSSKLVRNKLSGWKQMPSLDCIDNNKILFESFLSNIPSTAYTDCENYGNNPIRNHRWVNIPINDFIDFFKLVKYEDAPNITRKIVTIQYLYYLRDSVNVDHIRLYEMAYKATVQSGDIRTRSIKDDKPNNLQAGRAANGSYPGDIKFCTDNEVCVQVHHIKIKQPLHRLNSKDLYNLCIYYPENLATSFVGLDSDDEDD >CP022041.2|ASE18226.1|423743_425309_-|ATP-binding-protein MPSNVSIATRPLVYSTFRYISNKVWNALAEYIDNSIQSFLDHQDVLSKINPDGIKLRVSINMDFENDTIIIEDNAFGITEENYQRAFELANIPLDNKGLNEFGMGMKVSSIWLSNVWKVETTAYGDDVLKTVTFDLNEVVENEELSLPVTEESCDKEAHFTRITLSHLSGNKPTPRQLSYIKKHLASIYTLHLRKQTLELIVNDEPLEYKELTILNAPKYNEPNGKPILWKKDINLTFGERYAISGFIALLDTMSTSIDNGFLLFRRGRVIGSSYDERYRPKELCGQEGSPLYKRVFGELYLTGFDVSFTKNSFQEDDDFAELIKLLREDLNKDKTFDLFAQGQHYTKPRTQKEIKNIGTKLVKQIISGFTKPIVHTPSTTISTPNDTTPKKPALVIPVTTPQTQAKVEQIKSTPTDLFDGIPVDILLDNNEKVELTIKTGEAVAGLYTFNEIADDKYEATINLKNNVFQRFASSLSTQEGQEQLSYMIEVMVASEISMIKGGSDAATKFRSTFNNLFGTI >CP022041.2|ASE18227.1|425298_426660_-|restriction-endonuclease MAKKKQLQFIDLFAGLGGFHLALSKLGCKCVFSSELKEDLRKLYQINYPGVRIEGDITKIAPKDIPAHDIICAGFPCQPFSQAGNRQGFNDEKGRGTLFDYIIDIVAYHKPKYIILENVSNLKGHDNGNTWRIIQEKLDEQEYSVKAEILSPHEFGIPQHRKRIYIVCIRKDLGLLDNFTFPKGNKPVCDVNDIIEANAKDITPIKEETHYQLNIWQEFIDKTIANGGTIPTFPIWAMEFGASYDFETVAPAFQSIEQLVEKKGKLGKIIRGTTLKECLAQLPNYSQTDKTRVFPVWKIRYIQQNRDFYNKHKSWLKGWMKKVVHFENSHLKMEWNCGVNVEPHIENKIVQFRASGIRVKKPTFVPALNLVGTQVPIFPWIELPKDMQKPEIGLTKGRYMTLHEAASVQGMRELSFGNDDFRLSLARSYEALGNAVNVELVKMIAKKLLDYAK >CP022041.2|ASE18228.1|426671_426869_-|XRE-family-transcriptional-regulator MEDINQIKLALVKSKKTNKWLAEQLKVNPTTFSKWCTNTTQLNLYTLKKIAGLLNIPVSELIVSE >CP022041.2|ASE18230.1|428062_428548_+|peptidase-M15 MINDSHSNEIDFEERLSPHFTVGEMMRSGKAVGMGIKNVPEENPAPGEASRAEVIENLRELCRCVLEPLRRRVGRVIVVGGYRCEAVNRAVHGAEHSQHLRGEAADIHVTGLEMCRKYAAILSQTDFDQMILEPQESIKKRWIHISYRRDGKNRHQILGAK >CP022041.2|ASE18231.1|429165_429687_+|DNA-binding-protein MSIKFRMYQDNRKNSKRKGYWYARAVSPDLVSVKDLALRISERCTVTEPDILAVISALVFEMNQVLKDGNRVKLDGLGTFRVGIHSQGVQKAEDFNAQRDIYGAHVLFSPTVTIDAMKRRVKTLISGLRIQEAVQYDAPKAAEKAKNKGKKKENKPSAGPEPGEATATTEGHA >CP022041.2|ASE18232.1|431479_432805_-|Na+/H+-antiporter-NhaA MRQRVEKKLNQHLMLPIKLFMGREKSGGIVLILSVTLAMILANSNIAESYFHFFEQEVGFIVNGEPYLNYSLHHWINDGLMAMFFFVVGLELKREFIGGELADIRNTILPIGAAIGGMIVPALIFLSLNIGTPQTMGWGIPMATDIAFALGVVYLLGDKVPASAKVFLTTLAIVDDLGAVLVIAFFYTSELSIASLLFGLGFLAVMFIGNRLGIKSLFFYAALGIGGVWVTFLLSGIHATIAAVLAAFMIPADAKINESVYLKRMKKLTRRFEKEEPNEVRTLEEGQVDVLTHIQHDTEIAIPLLQQLEHKMSPIVTFLIMPIFAIANAGISFTDLSLSDIFSTHVALGVTLGLLLGKPIGIIGATFLMVKMRWATLPSAITRRTLLGLGMLASIGFTMSMFISTLAFTDELLMTQAKLGIFLASILGGIGGYVLLNKKSK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_5 | 479024-479203 | Orphan |
NA
Consensus repeat of CP022041_5
|
2 spacers
spacers of CP022041_5
>5.1|479076|37|CP022041|PILER-CR CTTTGCAAGCAACTGGTTCACTCGTCCACTCGTTGAC >5.2|479165|28|CP022041|PILER-CR AGCAACTTGTTAACTTGTCTACTCGTTG |
CRISPR arrays and Neighbor proteins around CP022041_5
The CRISPR arrays of CP022041_5 >merge|CP022041|5|479024-479203|PILER-CR CATGTTACTCTGTCTTTCTTTGCAAGCAACTGGTTCACTCGTCCACTCGTTGACTTGTCTACTTGTTAACCATGTTACTCTGTCTTTTATTTACCCTGTCTACAAGCAACTTGTTAACTTGTCTACTCGTTGACTTGTCTACTTACCATGTTACTCTGTCTTTTATTTACCCTATCTACA >CP022041|5|1|479024-479203|PILER-CR CATGTTACTCTGTCTTTCTTTGCAAGCAACTGGTTCACTCGTCCACTCGTTG ACTTGTCTACTTGTTAACCATGTTACTCTGTCTTTTA TTTACCCTGTCTACAAGCAACTTGTTAACTTGTCTACTCGTTGACTTGTCTA CTTACCATGTTACTCTGTCTTTTATTTA CCCTATCTACA
>CP022041.2|ASE18268.1|478282_478780_+|hypothetical-protein MVLKDVSYLKRGNYIYMYGHWLNMEFSQEFDTSYWKIFTENDFFLKLGFRKENNYYKLYIPYGRSYGYSAFYKYCYCKYQGGKYTPNTVVSDGNIILFPDLETQTCLLGFLDNGDHYIEIPYKKFIEEVTDIWEERTPIEGFKFDVEPIVYLKKDGVWLKEPKEQ >CP022041.2|ASE18267.1|477500_478010_+|hypothetical-protein MMKIEKKVGKRILINGKELRLDTYCNNDNMWFWYIYTKEEVDSRLFSKSGEYFKLFLKMDQKYPYSAYESRMYCIYLGYKYEVENIWHGLFILSPNERKTRRHLKLNDHDDSRIEVPYEEFIASSPIIWEERKPISDFVFDVEPLVYLFKDNSYIEENLHGAWYKRNIK >CP022041.2|ASE18266.1|476739_477201_+|guanylate-kinase MRTYKRGNFAIYLSQEYYFYKTDNPDMFELIDRKCQYEKLSKIGFLQHNNIISYKYVSKKDISSAFNTITFVKYKGFNFFVENSSEGKFILRPLEEAMKYFKDFPRHGYDPIYEAMEEEISDIWEERRPIEGFKFDVEPIVYLKKDGIWLVEE >CP022041.2|ASE18264.1|474840_475965_-|sugar-transporter MAIKLNRGITHAEKEIKEGDIFYVYNDYYKKYFFGKILVDISRLTKQVGKDSALDFFSDCYLVAVYKEISDTPELHSREFIIPGSFIYKSSFKRRNRQGFDWTHYAYEAVDFHTLDFPEFFLNYDDGVYLVRGELKFRTELSRQQEEEYKIRGSKSGSIDYSSALLLQGYKAYSDRINYHDLRLLPELRKSIYDMIGEDASMSYYNLALKYGKDTGRLFTDALPEEVQQIKPVETDQRTGFPKELLCGIAWSFRQQRYYSLAAFADELQAYNEEITGEYTPGVWTDELKLIGSRILVQYEHWDDELEESREEKVFLQADNGSYFTVSELIYKIHNHVCDKLVNDDNVFFEGLQLFERDDVNHPRTPYYFILQGS >CP022041.2|ASE18263.2|474433_474814_-|hypothetical-protein MFKKILGDKYTRANNIAKSVRSLQRVPEKERPRIVKKAVLNVYVIGSIIILISLWLYFFGNEIMAIPIGNNLEGETFRSRRANLYLHLLVPVFIPVIFIFGIPLLIRNYIIKRIVDKEFPKERRRM >CP022041.2|ASE18262.1|473805_474324_-|hypothetical-protein MNLKKILKEEATWCFLALAGLLILLYVGAYVIIDTYFYMISLNILIFLFSYIILKIKNKLHYYSYVVGCAFFVVWLIFYIICDLKSRSLKGYLTKQLPVLFYIPTGSGGRGSSSGIEVECKGNQHKISTTHESDSLYQIYGDSVINHIVVRYVLKEPFPHVYYIDRMQITYK >CP022041.2|ASE18261.1|473336_473741_-|hypothetical-protein MNMRIITFIGLITFIISCFLPCFSVRDSSFDYMGYTALFLGWAGVFINSKLTVYYISWYANILFLISLFIKRKYIGIILLTGSLILGCLFHGCPYIVINEAGKHSDIVSLQIGYYVWILSFFILLIGRILRYKE >CP022041.2|ASE18260.1|472426_473329_-|hypothetical-protein MSMKYLLLLVLFVGFSCIEPNVKDMLGDDFRLYKHTPAWSFAEAVEDEDTTEISKQVLQMHIPVDYRDPKYKQTLLMLATRTNKIESVEKLLELGANPNAHDDSTKYFGESAVLLACRFTRPSSKVLALLLKYRGDPNSTACGVKEDGLGEIVPMRDFALSAAVFSSFEKVKLLVDAGANINYATSTGNCAIENCMFHDRMDIMLYLLQKGADYRRKFTEIDVDKPDYPTFEVDILYKLRKCVYPIGSKAYKEKMKVVNFLKRKGLDYWKSPIPSGMYGVIMREIAPKNKADFDYYIKHY >CP022041.2|ASE18259.1|471908_472376_-|hypothetical-protein MNIEIIDYDTYKRLNYDSVFKDYHNDSYRIYGKIVEGDSYAKIAWSSDLLQPQFIEVFPKIFAIGIDQDFAIYDFDLKRRIMYLDLGFLFCEMAIFEKKILIATELEVIVIDTQQYKVIDTIPLQDTYDKMKINCGEVEIYCMDHSFEKCHIGNK >CP022041.2|ASE18258.1|471556_471922_-|hypothetical-protein MAINKGSSEIWSDKVSLLHIKGTQRGGEHGGESKQSHGGAEGTKSDRAVVYVLLIALVINVLRNLAKQMFCNEVDKNISKTRKHPLFLHAPTVPLCLTTSAIEKYAALFGYSKSCSYICEQ >CP022041.2|ASE18269.1|479485_482800_+|SusC/RagA-family-TonB-linked-outer-membrane-protein MSKKLLMCFAMLFMCVSAALAQTKISGTVVAADDNEPVIGATIMVVGTKSGAVTDVDGKFSLTTDVANPQVTVGYIGMASQTLKGTTDMKVVLKSSTQTLNEVVVTGLTRTDRRLFTGATDKVDAEKARLSGVADISRSLEGQAAGVSVQNVSGTFGTSPKIRIRGATSIYGSSKPLWVVDGVIMEDAANVGADDLASGNPETLISSAIAGLNADDIESFQILKDGSATSIYGARAMAGVIVVTTKKGKQGQAHISYTGEFTSRLVPSYSNFDILDSKEQMGIYRELADKGWLNFSEVLNGSEYGVYGKMYELINTYNARTGRFALENTTEARNRYLQRAEFRNTNWFKELFSSNIMQSHSISLSGGTQKSNYYASFSALLDPGWYKQSNVNRYTLNVNLTQHLSDKLSLNLIGGAAYRKQRAPGTLGQDVDVVGGEVKRDFDINPYSYASNTSRVLDPSATYVANYAPFNIFNELNNNYIDLNTLDARFQLELKYKPVKGLELSLLGAFKYMASTQEHFVKDESNQALAYRAMSNGIIRDANKYLYKDPNNPYVLPMTVLPYGGLYHRGDNRMSDYDIRATANYSHTFAEKHIMNLFGGMELTSIERQRNAFEGVGLRYDAGMVPFYIYQYFKRALESGNTYYTINPTNSRSVAFYGNTTYSYQGRYVFNGTLRYEGSNQMGRNSSARWMPTWNVSGAWNVHEENWFNKLSPLNKLTLRASYSLTGTPPDASYSNSTAIITASTPFRLFAEDQEPQLELSELANSTLTYEKKNELNLGFDASLWNNRLGITFDYYTRRNFDEIGPMVTAGLGGEIIRAANVAEMNSNGLELSISSVNIKKKNFSWTTSFIYSYATTEITKLFNQGNVMSLVSGNGFAKKGYPARALFSIPFMGLNSDGMPMVLNEKGQVTTDDINFQERTKTDFLKYEGPTDPPHTGSLGNMFTYRGFRLNVFFTYAFGNVVRLDPKFRARYNDLVSMTNAFKNRWMAAGEEKETNVPGILSKSQYMANTNVRLGYNAYNYSDARIAKGDFIRLKEISLGYDFPAQMFQSSMIKNVSLKLQATNLFLLYADKRLNGQDPEFYNVGGVASPMPKQFTLTVKLGL >CP022041.2|ASE18903.1|482812_484384_+|carbohydrate-binding-protein MKVKKYILYLPVAALALALTSCNDFLDKYPDSRMDLKNPTEVSQLLVSAYPQAHPAYLTEMYSDNTDEQLHSTWSAFDRFQEQAYQWKDIDDVSNTETPYQLWSAHYAAISSCNEAIAFINSVSNQDEYQEQLGEALLCRAFSMFQLSTVFCQAYDKTTAAKELGLPYPTEPEKVVGRLIERGTLAELYQKIEADMLKGISLVGTKYAKPKFHFTKQAAYAFATRFYLYAQQYDKAVKYANMVLGDQPADILRNWAEWNRLGPSGNVQPNAFVNASNNANIMLLPVPSQWGVISIPIQAGSKYAHGELISKNETLQAPGPWGDSGSTLNYTVLYNNGVSKYCLRKLPYVPKVIDATAEIGVPYGEYAVFSTDITLLERAEAYALLGQYDKALKDINTELTVFSKNRKQLTLADIQDFYGSMTYYTPTKPTPKKKLNPLFTVEATTQEPLLQCLLQLKRLVAIHEGFRLQDVKRYGITMYRRKVDVQSNVTAVTDSMKVGDPRLAIQLPQDVITAGVTPNPRNN >CP022041.2|ASE18270.1|484395_485283_+|hypothetical-protein MKKYLYLVAITLVSCGAVLSSCSDDKISGDSIFSTKAVERNAFDQWLYKNYTMPYNIDFQYRLKTEETEQAYNFVPADSAKTVKLAILTKYMWFDAYAETVGLDFIKENAPRIILVTGTPGYTRYRTEVIGSAEGGYKVRLGKVNALTDDQLKDYGSMNNYYFHTMHHEFMHILNQKKPYDESYDNISRSDYVSGNWTSIPDKKAQSMGFVSAYSMENPAEDIAELYSIYVTSTPEDWATIMKNAGNKGGTIINKKLKIIREYMSNSWNVDIELLRNAILRRGGKIGTLDLNTLE >CP022041.2|ASE18904.1|485419_486697_+|DUF4302-domain-containing-protein MKKVYLLFMAIVLTLSLQSCLHDDKTTFDLPAAERIEKKVADYKALLESSEDGWVMQYYTGKNYSYGGYTLLLKFKDGHVTAMGDVKDVEAQATSGYDVVKDLGPTLSFNEYNAVIHPLAETWLGSPDGAQGDYEFSILRATNDSIFLKGRKWHNEMVLTRLPKGTSWEEYMLGLVTVMEGMNVETYDFVLGNDTLAQGTLTQEVRRLTVTLGDKKWEMPYCTTNTGITLREPIVIGNKKYQHFTWNDADHSLTQDDLKIIQFLPKSHKNIDFWIGEWQLKTNLRKRIKLTLEMGSVANTLKGKLNINNINYEILLTYDPATGHLELPGQPVTDPTYKYPAGIVMIPASQKEGKLFGEGKGSLFFTWDEDMQRAKAEDSGQITGHAVDSFFGVAYGEDLQPVTDAQGNYVFAFTLPNIQYMTKIN >CP022041.2|ASE18271.1|486699_487764_+|hypothetical-protein MKKYISIFLLVAAAVLGTACSNDNNELPEYAAGLKVVKAQTAFNVIGGANEVKMASEPAQAYAQDAWLTVTKKAETLLLTATTNTSPQTRNTLLVIKDAKGDSITLNVQQEGITFGLPAGQDIFTDDKAVQKTLIATANVPVTYTTTGDWLSVAEQGSEITVKAAENTTGKARVGWVIAKAAGLVDSLKVVQASLADFVGEYKQTAKMRNADRTLSERTSDVRIEATGTNKANFIVDNKYTWAVDFIPGSGFKMTNGKVVAKNEVQTGVYEYFISVIVADDFSKEHETAINGTQESILLSIDDAGNLVFKEAQKLASEQTFSSYGWNRFSDSKPVMGAYRGIGEVYVQPKLTRK >CP022041.2|ASE18272.1|488076_488544_+|hypothetical-protein MSFRIVEMKDLSGAKAHIYSVKFDGYDETLLDCFFNENEDNKNLQEMLHKIIVMATKTGCLKQFFKEGEGSLADGVVALSVGNLRLYGIYFNNTVVLLGSGGEKNVRAYQHDPILNAKVEQIKYVAKKINKAIVERDIIVSEDGELNLDNFEVYE >CP022041.2|ASE18905.2|488527_488953_+|XRE-family-transcriptional-regulator MKCMSKMQVKKNVLSQLLSTIDEVALKKTSNRMMIAAKIGNALKDKGISQKEFAKKLKKSESEVSSWLSGDRNFTIDTLTEISLALDISLLDTEVQSVYSFPTRIFLPETGNVSNTKISVSSQWTCSLGYIDCRNKTQKVG >CP022041.2|ASE18273.1|488952_489396_+|hypothetical-protein MKNGDKIEFAIVGMQEDSYKVNYDIDFSKLNQEELEFQIEQRINVSAEPENIIISMRVHLMNGAEEIAMQGVRAIFKVKPFNSFVNDMQEDDLKVSNPALIDTFISVCIGAIRGMLVKNLKGTPLDNVVLPLIPMNVIRANSTKRTK >CP022041.2|ASE18274.1|489890_490322_-|hypothetical-protein MNTLKISIITCFSILYSLTSFAEVKSNVSIEIGSFDSIPSEIDGGCCVFYKYPNKMRNKSYIMVNDLATTAYMMINRHLEEFTLVSNQKDIFWYKNKRFTLKVTINHTQSKGDNECYKVKGILIVEDRNKNQRKLSFCGNCSW >CP022041.2|ASE18275.1|491727_492381_-|rubrerythrin MKKKFICTVCGYIHEGTEAPAECPVCHAKAAKFKEFNPEALKGTKTEQNLKNAFAGESQAHTKYLYYASKAKKDGYEQIAGFFEETARNEKEHAKIWFKFLHEGDIPTTTQNLADAAAGENYEWTDMYEQMAKDAMEEGFPELAVKFRSVGKVEKHHEERYRKLLKNIEDSVVFSREGDCIWQCRNCGHIVIGKKAPAVCPVCNHPQSFFQVEESNY |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_6 | 507107-507208 | Orphan |
NA
Consensus repeat of CP022041_6
|
1 spacers
spacers of CP022041_6
>6.1|507144|28|CP022041|CRISPRCasFinder CGGAGTGAGAGTTAAATCAAAAGCAGGA |
CRISPR arrays and Neighbor proteins around CP022041_6
The CRISPR arrays of CP022041_6 >merge|CP022041|6|507107-507208|CRISPRCasFinder ACCTCCCCCGACCCCTCCAAAGGAGGGGAGTGCCTAACGGAGTGAGAGTTAAATCAAAAGCAGGAACCTCCCCCAACCCCTCCAAAGGAGGGGAGTGCCTAA >CP022041|6|5|507107-507208|CRISPRCasFinder ACCTCCCCCGACCCCTCCAAAGGAGGGGAGTGCCTAA CGGAGTGAGAGTTAAATCAAAAGCAGGA ACCTCCCCCAACCCCTCCAAAGGAGGGGAGTGCCTAA
>CP022041.2|ASE18282.1|506532_506802_+|hypothetical-protein MEESPSAYRWFGYMFVWMLACLLILDKGVSSELFLFILLLVAIVLNAYCAYKFALGKGTFLAILAFVVAMVLDFFPIVAYFVIIEIFMA >CP022041.2|ASE18281.1|505443_506262_-|KilA-N-domain-containing-protein MTKKGEIFVKDVAIKTMTKDGIDYICITDIARQKNTAEPKDVVKNWMRQKNTLEYLGLWEKLNNPNFKGVEFDPLLAEAGSNSFTMSPTRWVELTAAIGVFTKNGAGGGTFAQRDIAFKFANWVSVEFELYLVMEFQRLKAKEQELIGWTAKRELSKINYRIHTDAIKSHLIPEKVTPTQANIIYAEEADVLNVAMFGMTARQWRESNPDLKGNIRDYASVNELICLANMENINAVLINDDIPQGERLVRLNQIAIHQMQILERNSNRNLLR >CP022041.2|ASE18280.1|503476_505174_-|energy-dependent-translational-throttle-protein-EttA MATVDDKKIIFSMVGVSKIIPQNQKQILKNIYLSFFYGAKIGIIGLNGAGKSTLMKIIAGLVEPTQGEVVWSPGYSVGYLPQDPPLDEAKTVKENVMEGVQHIYDALKEYDDINVKFGLEEYYSDADKMDKLMQRQAELQDIIDATDAWNIDSRLERAMDALRCPKGDLPVTNLSGGERRRVALCRLLLQKPDVLLLDEPTNHLDAESIDWLEQHLQQYEGTVIAVTHDRYFLDDVSEWILELDRGEGIPWKGNYSSWLDQKTKRMEQEEKSASKRRKTLERELEWVRMAPKARQAKGKARLNSYEQMLNEEQKQREEKLEIFIPNGPRLGNKVIEAQHVKKAFGEKVLFNDLNFMLPPNGIVGVIGPNGAGKTTLFRLIMGLEQADGGTFEVGETVKLAYVDQQHKDIDPNKTVYDVVSQGNETIRMGGRDINSRAYLSRFNFSGTDQSKLCSVLSGGERNRLQLAMALKQEGNVLLLDEPTNDIDVNTLRALEEGLEAFAGCAVVISHDRWFLDRICTHILAFEGNGEVVYFEGGFSDYEINKARRLGNEEIKKGRYRKLMEE >CP022041.2|ASE18279.1|502367_503213_-|hypothetical-protein MFVLFFLCFSIYGKAQTAPKKLTAPSPERFQAINDSILAEGMWLYTYEKLAWRATDSLMKYNVKREEINSATAFEEGNLTWRYIFANLDKEQTVFELTLHLSNDTSFYVCSATPRKLKPTEIEQLKAKQIAPRKVIKEKGDSIFLANTSGLNWDLLPLEKGGYRLYLLHGTTKHGVIPIGDDYAFDFDKDMNILSWRRFHRSFLEQPITMNGEEITEVIHSHTPMTPYFTTTDIANYMLYGCDLYGIKRFSVLSTAFADTSYLTTFDVEKMKLTSTVYTVK >CP022041.2|ASE18278.1|501130_501901_-|NYN-domain-containing-protein MEKKLGTYLLILYFCSIKSIALLSCRVSATSEWLFLFNLHTMPKRVTFYFDGFNFYFALKRKKKISPEWKDFYWLDLVAFCESFLGPDQVLEKVIYFTASPLSPQKNSRQSAFLNANRILHSDKFEVIRGKYMSKQIECPYCKYSISKPEEKRTDVNISVRMMADCVQDKTDVIVLISADTDLIPPLNFIHTNYPNKKVKVFFPPGSHALELHNHLRTFHSKWVFLEKNERRFRNAVMPHTITVGNQSVSIPEKWK >CP022041.2|ASE18277.1|497629_500824_+|hypothetical-protein MLFERFLRNKSLMAAMALTAVMPSVAKASILSAAPTAEGENQQLKAGVYFIVNDKRQAGTDLHVYVDESGLHGKNYTSLAADYSNAFLLHAKGDKYTIQSLKDGKYVQNVNGNSVPYKTGNDAHKFQIVYQSQSSGTGKSYFNIYNDNVGGNQFCWHLDAQRNVVRWYPLTDNGRIALGPSEFRLDPVTSLSKQQVLDRLAELTKIVDPRKDLNKYYQIVSDTYGRAMREDYIVGELSTGGFVDTDYSYCWKLVKLGSGRYAFQNAVTGKYIAQQNGQTSRYYTTSEEQGNGFEFNLNESDPYVLTFEMVDAYNVGIHCAESQGYHPVGWYVNNEANKWVFKVATIDQAKLKEQQDAYKARVDLTRNVDRYATAVAKYFTNSAATEVTAATKAMTDEALKAQMTTEGVPQGLQEVVLKIKNQSWTVYPSGRNWEKQFRIADYKVYSENNYGAWARSMGIGYDYGSMTNPTGVTARDGEDLFVFLGDDIPQDATVQIELVPLGTRSAGKYHNLKKGLNIILNQGENNVFVNYIGRTFNNGKYLRDYKPMNIHIEGGKVNGYFDLTKGNTNEDWQKMQSDGLVWAKAFNMKGELVVMNMPSQACKDYTPVHMKELVEIWNSIVQREDDLMGFRAAKRDKCNNVLNATAVDHGYMYATTGGTYYNYNTLADVLNYDKMKWGNGTLWGPAHEFGHNHQQLFNTAGMTEISVNMYSNMVMFTSGRVTSRSEHCNYTDVDGQEHRGVCESAVSTYADRFANKKMWFEYGTWGTTQMYYKLYMMFHSTGLDDQFWHKCLDYLRTHRLEGQGTANCQGQNDYLLFAKACCVAANQDLSSFFEAWGHFYDVNGSVIGDYSNTTMYTTRAQWVEAKKFMQQFPKGPANNMIFIDDHIRPTPAIYPGAAPGTMREDFNGAVRVGTMGDFGSWDQFCPDSLGQGYAIIKTQSDANGRRTYTMEAKNSHVVGFKIYDSKGNLIYFANTKTFTIPKKVMEDAQNNIVIKVCGSDGSEVDPGTVPLGIKTFSAQANGGKVDVYSIYGVLVRSKVNPETALEGLPNGVYVVGGKKVVVGK >CP022041.2|ASE18276.1|493180_496975_+|DUF5110-domain-containing-protein MKKKKAKILLAALLLAVGETAFADGKAVSGIRQINPTTVEITYSDGGVMTVDFYGQNVFRLFRDPKGGIVRDPVSNPPARILLDNPRKNAGRLSVNATDADVAVATAEVRVRFDRNTGLMTVTDLRNGKEVIKEVKGVDFQKNRTTVTLANAAGEYFYGGGVQNGRFSHRGKRIEIVNTNSWTDGGVASPAPFYWSTGGYAAMPYTFAPGAYDFGSTDKNTVTISHDMPYLDLFLMVDTTPVSLLQDYYQLTGNPVLLPKFAFYEGHLNAYNRDYWKETTEEGKGILFEDGKRYVESQKDNGGIKESLNGEKQNYQFSARAVIDRYKKDDMPLGWILPNDGYGAGYGQTTTLDGNIANLKSLGDYARKNGVEIGLWTQSNLHPVDSIPALLQRDIVKEVRDAGVRVLKTDVAWVGAGYSFGLNGIADVANIMPYYGSDARPFIITLDGWAGTQRYGGVWSGDQTGGEWEYIRFHIPTYIGSGLSGMGNITSDMDGIFGGKNLPVNIRDFQWKTFTPMQLNMDGWGSNPKYPQALGEPATSINRSYLKLKSILLPYTYSCAHEAVTGKPLMRAMFLDDSNDYTHSSATRYQYMYGPSFLVAPVYQNTAADKEGNDVRNGIYLPKGTWYDYFTGATYEGGCILNDYFAPIWKLPVLVKSGAIIPMVNPNNNPSEIDKNRRVFELYPDGKTEFTLYDDDGTTQKYLANEKATTRITSDLNDKQVLTVSIDKTEGSFDGMVKNQSTTFYLNTNAKPKKLTAVIGGKKIRLTEAEQGDNTWQYVQASNINRFSTANSEMERLLVTKNAQIIVRLSSCDITKEAVELRVEGFVRLNKSNETLRKKGALTAPELLEADIQSYSVTPKWKPVQNADYYEIAFNGQTYTTIRHNSLLFDDLQPATDYDFKVRAVNSEGASEWTPLHVKTAVNPLEYAIQGLTATSTARDMEGFEIHRLVDFSTTGDIWHTYYYTKAVPFDFTVDLHSTNTLDKLQYVPRANGGNGTITKCDIAVSKDGRNWTEIGPQQWARDGRTKEVTLSTHPVARYVKVSVKEAVGNFGSGREFYVFKVPGTKTILPGDINLDGKVDENDFTSYMNYTGLKKGDADFDGYISGGDINGNGLIDAYDISNVAMHLEDGWNDDDVAPVGGKVYYEYNRKSYAAGDDVIIKVKGKDLQSVNAFNLIFPYSPKELQFVKVETDPSMVMRNLTYDRHHSDGSQVLYPTFVNVGDHHTINGDADLLVIRMKALKPFTVKNTTAKGLLVDKQLREVEL >CP022041.2|ASE18275.1|491727_492381_-|rubrerythrin MKKKFICTVCGYIHEGTEAPAECPVCHAKAAKFKEFNPEALKGTKTEQNLKNAFAGESQAHTKYLYYASKAKKDGYEQIAGFFEETARNEKEHAKIWFKFLHEGDIPTTTQNLADAAAGENYEWTDMYEQMAKDAMEEGFPELAVKFRSVGKVEKHHEERYRKLLKNIEDSVVFSREGDCIWQCRNCGHIVIGKKAPAVCPVCNHPQSFFQVEESNY >CP022041.2|ASE18274.1|489890_490322_-|hypothetical-protein MNTLKISIITCFSILYSLTSFAEVKSNVSIEIGSFDSIPSEIDGGCCVFYKYPNKMRNKSYIMVNDLATTAYMMINRHLEEFTLVSNQKDIFWYKNKRFTLKVTINHTQSKGDNECYKVKGILIVEDRNKNQRKLSFCGNCSW >CP022041.2|ASE18273.1|488952_489396_+|hypothetical-protein MKNGDKIEFAIVGMQEDSYKVNYDIDFSKLNQEELEFQIEQRINVSAEPENIIISMRVHLMNGAEEIAMQGVRAIFKVKPFNSFVNDMQEDDLKVSNPALIDTFISVCIGAIRGMLVKNLKGTPLDNVVLPLIPMNVIRANSTKRTK >CP022041.2|ASE18283.1|507552_507756_+|hypothetical-protein MKKSYVKPCSTCVIMAVEQAMLAGSKGLRVTNQNLERVDHTIIGSSTPSTPSAGGNAKENPFQYDEE >CP022041.2|ASE18284.1|507783_509490_+|hypothetical-protein MNFKKIYGLILGVIVAGLTSCSSDLNNEEKAPVGPNETAVRSLSIATGDAKTRSEVKIDADNKWVTGDRFMAFNRTFTGSSSESRYGVLTASSTGTRTTLDGVIACKDNDELGIFYPGSYVTGFDQGKMPVVMTASYINDNKGQDGSKENLKYFDYSYGKGKVTVNGASASGSVDMKKLYSVLELDFTAGGVKLTNIKKLVLSNVLTEAVYNIQSNQLESLETGKIEVNSPVALEKVYVAILPQNHFSPTFEVYTTDNKSYRFAVSTPNFNLVAAKVYPFTVQVKEFTPNPPYIEIGGVKWGKYNLQYSTGTKVNGWVDGYHLAENPWDYYMYTPSKITEPLSDLQMSLPSYNPNDVKFDHFRWGDIEYAYDYTKTGQFWTERRDIQGVISSDKKHGDLAAYASNNKWKLPSATDFNNLMKATAEYLGYFIDDNGNKVYGVLFDPNVAEGLKGKVLDKNNKVLGSSNTAAIINAGKNLRQFVKSDFDNALFFPMAGIYYTYTGAIDKPGSQGGYWTSTSNPSNNNAAAFMPQMMSNSLGVYQGFSGTTQKALINKNNMHSIRPIYVGQ >CP022041.2|ASE18285.1|510145_513265_+|TonB-dependent-receptor MSNFMKVSQSRRKHPPFVSGRLAFSFALGLMAFAPSPVLANVDANTSMSVQQQKQSINGVVKDANGDPVIGASILANGTPVGVTDMDGRFSVSVAPGTELKISYVGFATQSVMVRSGVTNYNITLKDENSALSEVVVVGYGTQKKANLSGSVAQLDSKALENRPISNVSSGLQGLLPGITVTGADGAPGLDNGSILVRGVGTLNSASPYILIDGVEAGTLNSLDPEDIASISVLKDASSAAIYGSKASNGVILVTTKRGQNGAPKVSYSGYFGIQNATALMERMNSADAAYYYNKALERSGKAARFSDEAIKKFRDGSDPYNYPNTDWYDLAFKTAWQNRHSVNITGGNEYVKYLASAGYLKQSSILPNAGREQFNGRANLDMVLSKRITAHLNLAYIQNNYRDASSAYAGGSSDQIIRQLNIIAPWIVYKYEDGTYGTVSDGNPMAWLESGMTVNRNNRNFTGMIGLDYQILKDLKLTLQGAYVDASQRYSYFQKFIQYNPNKASDPNKLEIAHYDWHRTTFDAFLNYDKSFAKHNFKAMLGWHTERYKYLPDWMYRKNFPNNELTDMNAGDASTQQNAGNTRELSMVSYFGRLNYDYAGRYLFEANFRSDASSRFAEAHRWGFFPSFSAAWRISEEPFMESSKSWLNNLKLRASWGQLGNQDALNDYYPWMNTYNLNAKYPFGGQLTPGYYQGSYHLETISWERSTTWGVGLDFTLFGGLTGSLDYYNRKTTGIIMNVSAPAEFALGAYKDNIGALRNQGVELSLAYAKQLNKDWTINVGANFAYNKNKILNLGEGTEYIGSGNRRTAVGQQYNSFFMYKATGKFFNSQQEADDYTAKYGNPFGRKFMAGDLIYEDTNGDDKLDSNDRIYTKHTDIPAITYGFNLGATWKNIDLSMIWQGVGAVSHIYNREVLGEFSGDASHPSTLWKDSWTDDNHNAKLPRVFETGNSPSDMTRAMSTFWLWNTAYLRLKTLQLGYTLPKSALKAIGLEKVRIYYAGENLLTFDALPFNIDPEVTSERGSSYPLLRSHSIGINITF >CP022041.2|ASE18286.1|513287_514901_+|RagB/SusD-family-nutrient-uptake-outer-membrane-protein MKTVRTYILAGVAAFALTSCNDYLTTVPKDAMSPSTTWKTGDDAEKFLVGCYDGWEDGGALLYWDAGSDFAYNNFPWEGFTNIGNGSLSPSSPGWSFYDYTIIGRCNTFLENVDKCVFSSDAVKKDLVAQVKAIRAYNYFRMGFLYGGVPIVKPFTSAQEARVPRNTEQEVKDLVFKDLDEAIADINTSPAARGRIAKGAALAMKMRAALYWGDYQKAKDAAQAIIDLGKYELDPDYTNLFKLAGVDSKEIILAVQYKSGTRPLGTIGQLYNNGDGGWSSVVPTQKCVDNYEMSNGMTITEAGSGYDATHPFHGRDPRMAMTILYPGCDWEGTIFNTLDENVNGKKNPNYPTNAANSSKTALTWRKYLDPKTQYADVWDTEACPIVFRYAEVLLTWAEAENELNGPSANVYAMIDKVRTRVGMPAVDQSKYNTKDKLRELIRRERGSEFAGEGLRRADILRWTSNGKMVAETVLNGPLNRITGTINTSATDPTMRAVVSGSSKVEDRTFQTFNRYLPIPQWNISDNPKLEQNPGYAK >CP022041.2|ASE18287.1|515849_517628_+|M6-family-metalloprotease-domain-containing-protein MKKIITLLVSVLLATSSFAIPAMRMWRSFKQADGTILKVMTVGDEHFNYALTEDNIPVLPHNGSYYYARIEDNQLVPSSVLAHDKALRKGKEELVAAAIQQVRQLQKQHEMHVNSKPFGEGLGMTWEGKKKGLVILVEFEDVAFKDPKNVLTLKPREKDVKTLYENMLNKVGYTNDNGAIGSVHDYFLDQSNGKFDLTFDVVGPVKLKHPHQFYGERTANMNDANAPQMIIDACNAIQGQVDFSKYDWDDDGEVEQVYVIYAGEGEATGGESSTIWPHKYSLTDAGLDALTFNGQTINTYACSNEIIRAKVNEKSRIYYSGIGTICHEFSHCLGLPDFYDTRGGSNIGSGRYDLMCGGSYNGGPESLINVYGGTGIGTVPAGYDAYEKAYMGWLKPITLGDEAVEVKNMKGLAEGGDAYFLYNPDTKNEYYIFENRTPHRWDAELPGHGLMVFHVDFDAYSWRMNNLNAASAQRHPRFTIVSADGRLDHDTQNSDPFPTDLNNSLTKSTDPRLSFYTNYNVSSQAGVKQIVRNNDNTISFHFTPLKAATGINNLSADHEQLAETYTLSGVKVADNQNLHNQIVIVKGKKVRK >CP022041.2|ASE18289.1|518931_519450_-|N-acetyltransferase MEKKQYVEVRLRAMEPEDLDMLYHIENDRSLWNISATNVPYSRYALHNYIADAKNDIYIDGQLRMMIENREQEIVGVIDLVNFDPKHQRAEMGIIIMKPFRQKGYAKAAISALIDYTRNGLHLKQIYAVVDVDNEVSIRCLSSIGFTNGSILKEWLYCDGQYKDARVMQLFI >CP022041.2|ASE18290.1|519453_520656_-|glycosyltransferase-family-2-protein MKLSVVIVNYNVKYYLQQCLESLQRALKGVEAEVFVVDNHSHDGSVAYLRSRFPDVHFIASAHNLGFAGGNNIAIRQSKGEYVLLLNPDTVVGEEVIHASIDFMDSHLTAGGHGVQMLTHCGERALESRRGLPSPMVSFYKMVGLCKHFPQSGRFAHYYMGSLSWDVPGKIEVISGAYCFLRRTALDKVGLLDEDFFMYGEDVDLSYRLLKGGFENWYLPVRILHYKGESTQKSSFRYVHVFYDAMLIFFRKHYGGMNVLWRLPIKTAIYVKAFGSLIGTTIRATRKKLGFRTSKAKSFPHYIFVAGEDVMGKCQRLATDNALVAEYRVADKDSLQHMHADLLKEFGGKSRAYCIVYDTDLFSYQDILNVFAEQPKQNIHIGFYHRKENRVVTMMEVIGD >CP022041.2|ASE18291.1|520656_521109_-|hypothetical-protein MDKTRKILLIEFFGSCLITLLIIAVYELELILPGAWADVESSNMVTVQFLMQLLTLATIPLALFLFKIGYVHSDLHTDESHVSRKLLFWGSVRIMMLCVPMILNTFFYYAFGDSVSFFYLAVILALSLFFVFPNKKRCEHECSMDNSEQA >CP022041.2|ASE18292.1|521116_521725_-|recombination-protein-RecR MQQYPSQLLERAVEAFSQLPGVGRKTALRLVLHLLRQSTEDVDSFADAVIRVKHDVKYCKVCHNISDNEVCSICSDPRRDASVVCVVENIQDVMAIENTQQFHGLYHVLGGIISPMDGIGPHDLEIESLVERVEEGTVKEIILALASTMEGDTTNFYISRKLKDTGVKLSVIARGISVGDELEYTDEVTLGRSILNRTPFES >CP022041.2|ASE18293.1|521908_522388_-|hypothetical-protein MLLVAAMVVSISASAQFQEGKGYLGASLTGLDLHYNGHDGMNIGVQAKAGYFPWDNLMVLATFDAVHNGSEAVADHISVGVGGRYYITQNGLYLGAGVKLLHANHNYNDLMPGVEVGYAFFINRSVTIEPALYYDQSFKTHNYSTVGLKVGLGIYLFDD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_7 | 509810-509892 | Orphan |
NA
Consensus repeat of CP022041_7
|
1 spacers
spacers of CP022041_7
>7.1|509834|35|CP022041|CRISPRCasFinder ACGGGGGGTGGGGCCAGTTGCTTTTGTTACATTTT |
CRISPR arrays and Neighbor proteins around CP022041_7
The CRISPR arrays of CP022041_7 >merge|CP022041|7|509810-509892|CRISPRCasFinder ACTCCCCTCCCTATGGGGGAGGGGACGGGGGGTGGGGCCAGTTGCTTTTGTTACATTTTACTCCCCTCCCTATGGGGGAGGGG >CP022041|7|6|509810-509892|CRISPRCasFinder ACTCCCCTCCCTATGGGGGAGGGG ACGGGGGGTGGGGCCAGTTGCTTTTGTTACATTTT ACTCCCCTCCCTATGGGGGAGGGG
>CP022041.2|ASE18284.1|507783_509490_+|hypothetical-protein MNFKKIYGLILGVIVAGLTSCSSDLNNEEKAPVGPNETAVRSLSIATGDAKTRSEVKIDADNKWVTGDRFMAFNRTFTGSSSESRYGVLTASSTGTRTTLDGVIACKDNDELGIFYPGSYVTGFDQGKMPVVMTASYINDNKGQDGSKENLKYFDYSYGKGKVTVNGASASGSVDMKKLYSVLELDFTAGGVKLTNIKKLVLSNVLTEAVYNIQSNQLESLETGKIEVNSPVALEKVYVAILPQNHFSPTFEVYTTDNKSYRFAVSTPNFNLVAAKVYPFTVQVKEFTPNPPYIEIGGVKWGKYNLQYSTGTKVNGWVDGYHLAENPWDYYMYTPSKITEPLSDLQMSLPSYNPNDVKFDHFRWGDIEYAYDYTKTGQFWTERRDIQGVISSDKKHGDLAAYASNNKWKLPSATDFNNLMKATAEYLGYFIDDNGNKVYGVLFDPNVAEGLKGKVLDKNNKVLGSSNTAAIINAGKNLRQFVKSDFDNALFFPMAGIYYTYTGAIDKPGSQGGYWTSTSNPSNNNAAAFMPQMMSNSLGVYQGFSGTTQKALINKNNMHSIRPIYVGQ >CP022041.2|ASE18283.1|507552_507756_+|hypothetical-protein MKKSYVKPCSTCVIMAVEQAMLAGSKGLRVTNQNLERVDHTIIGSSTPSTPSAGGNAKENPFQYDEE >CP022041.2|ASE18282.1|506532_506802_+|hypothetical-protein MEESPSAYRWFGYMFVWMLACLLILDKGVSSELFLFILLLVAIVLNAYCAYKFALGKGTFLAILAFVVAMVLDFFPIVAYFVIIEIFMA >CP022041.2|ASE18281.1|505443_506262_-|KilA-N-domain-containing-protein MTKKGEIFVKDVAIKTMTKDGIDYICITDIARQKNTAEPKDVVKNWMRQKNTLEYLGLWEKLNNPNFKGVEFDPLLAEAGSNSFTMSPTRWVELTAAIGVFTKNGAGGGTFAQRDIAFKFANWVSVEFELYLVMEFQRLKAKEQELIGWTAKRELSKINYRIHTDAIKSHLIPEKVTPTQANIIYAEEADVLNVAMFGMTARQWRESNPDLKGNIRDYASVNELICLANMENINAVLINDDIPQGERLVRLNQIAIHQMQILERNSNRNLLR >CP022041.2|ASE18280.1|503476_505174_-|energy-dependent-translational-throttle-protein-EttA MATVDDKKIIFSMVGVSKIIPQNQKQILKNIYLSFFYGAKIGIIGLNGAGKSTLMKIIAGLVEPTQGEVVWSPGYSVGYLPQDPPLDEAKTVKENVMEGVQHIYDALKEYDDINVKFGLEEYYSDADKMDKLMQRQAELQDIIDATDAWNIDSRLERAMDALRCPKGDLPVTNLSGGERRRVALCRLLLQKPDVLLLDEPTNHLDAESIDWLEQHLQQYEGTVIAVTHDRYFLDDVSEWILELDRGEGIPWKGNYSSWLDQKTKRMEQEEKSASKRRKTLERELEWVRMAPKARQAKGKARLNSYEQMLNEEQKQREEKLEIFIPNGPRLGNKVIEAQHVKKAFGEKVLFNDLNFMLPPNGIVGVIGPNGAGKTTLFRLIMGLEQADGGTFEVGETVKLAYVDQQHKDIDPNKTVYDVVSQGNETIRMGGRDINSRAYLSRFNFSGTDQSKLCSVLSGGERNRLQLAMALKQEGNVLLLDEPTNDIDVNTLRALEEGLEAFAGCAVVISHDRWFLDRICTHILAFEGNGEVVYFEGGFSDYEINKARRLGNEEIKKGRYRKLMEE >CP022041.2|ASE18279.1|502367_503213_-|hypothetical-protein MFVLFFLCFSIYGKAQTAPKKLTAPSPERFQAINDSILAEGMWLYTYEKLAWRATDSLMKYNVKREEINSATAFEEGNLTWRYIFANLDKEQTVFELTLHLSNDTSFYVCSATPRKLKPTEIEQLKAKQIAPRKVIKEKGDSIFLANTSGLNWDLLPLEKGGYRLYLLHGTTKHGVIPIGDDYAFDFDKDMNILSWRRFHRSFLEQPITMNGEEITEVIHSHTPMTPYFTTTDIANYMLYGCDLYGIKRFSVLSTAFADTSYLTTFDVEKMKLTSTVYTVK >CP022041.2|ASE18278.1|501130_501901_-|NYN-domain-containing-protein MEKKLGTYLLILYFCSIKSIALLSCRVSATSEWLFLFNLHTMPKRVTFYFDGFNFYFALKRKKKISPEWKDFYWLDLVAFCESFLGPDQVLEKVIYFTASPLSPQKNSRQSAFLNANRILHSDKFEVIRGKYMSKQIECPYCKYSISKPEEKRTDVNISVRMMADCVQDKTDVIVLISADTDLIPPLNFIHTNYPNKKVKVFFPPGSHALELHNHLRTFHSKWVFLEKNERRFRNAVMPHTITVGNQSVSIPEKWK >CP022041.2|ASE18277.1|497629_500824_+|hypothetical-protein MLFERFLRNKSLMAAMALTAVMPSVAKASILSAAPTAEGENQQLKAGVYFIVNDKRQAGTDLHVYVDESGLHGKNYTSLAADYSNAFLLHAKGDKYTIQSLKDGKYVQNVNGNSVPYKTGNDAHKFQIVYQSQSSGTGKSYFNIYNDNVGGNQFCWHLDAQRNVVRWYPLTDNGRIALGPSEFRLDPVTSLSKQQVLDRLAELTKIVDPRKDLNKYYQIVSDTYGRAMREDYIVGELSTGGFVDTDYSYCWKLVKLGSGRYAFQNAVTGKYIAQQNGQTSRYYTTSEEQGNGFEFNLNESDPYVLTFEMVDAYNVGIHCAESQGYHPVGWYVNNEANKWVFKVATIDQAKLKEQQDAYKARVDLTRNVDRYATAVAKYFTNSAATEVTAATKAMTDEALKAQMTTEGVPQGLQEVVLKIKNQSWTVYPSGRNWEKQFRIADYKVYSENNYGAWARSMGIGYDYGSMTNPTGVTARDGEDLFVFLGDDIPQDATVQIELVPLGTRSAGKYHNLKKGLNIILNQGENNVFVNYIGRTFNNGKYLRDYKPMNIHIEGGKVNGYFDLTKGNTNEDWQKMQSDGLVWAKAFNMKGELVVMNMPSQACKDYTPVHMKELVEIWNSIVQREDDLMGFRAAKRDKCNNVLNATAVDHGYMYATTGGTYYNYNTLADVLNYDKMKWGNGTLWGPAHEFGHNHQQLFNTAGMTEISVNMYSNMVMFTSGRVTSRSEHCNYTDVDGQEHRGVCESAVSTYADRFANKKMWFEYGTWGTTQMYYKLYMMFHSTGLDDQFWHKCLDYLRTHRLEGQGTANCQGQNDYLLFAKACCVAANQDLSSFFEAWGHFYDVNGSVIGDYSNTTMYTTRAQWVEAKKFMQQFPKGPANNMIFIDDHIRPTPAIYPGAAPGTMREDFNGAVRVGTMGDFGSWDQFCPDSLGQGYAIIKTQSDANGRRTYTMEAKNSHVVGFKIYDSKGNLIYFANTKTFTIPKKVMEDAQNNIVIKVCGSDGSEVDPGTVPLGIKTFSAQANGGKVDVYSIYGVLVRSKVNPETALEGLPNGVYVVGGKKVVVGK >CP022041.2|ASE18276.1|493180_496975_+|DUF5110-domain-containing-protein MKKKKAKILLAALLLAVGETAFADGKAVSGIRQINPTTVEITYSDGGVMTVDFYGQNVFRLFRDPKGGIVRDPVSNPPARILLDNPRKNAGRLSVNATDADVAVATAEVRVRFDRNTGLMTVTDLRNGKEVIKEVKGVDFQKNRTTVTLANAAGEYFYGGGVQNGRFSHRGKRIEIVNTNSWTDGGVASPAPFYWSTGGYAAMPYTFAPGAYDFGSTDKNTVTISHDMPYLDLFLMVDTTPVSLLQDYYQLTGNPVLLPKFAFYEGHLNAYNRDYWKETTEEGKGILFEDGKRYVESQKDNGGIKESLNGEKQNYQFSARAVIDRYKKDDMPLGWILPNDGYGAGYGQTTTLDGNIANLKSLGDYARKNGVEIGLWTQSNLHPVDSIPALLQRDIVKEVRDAGVRVLKTDVAWVGAGYSFGLNGIADVANIMPYYGSDARPFIITLDGWAGTQRYGGVWSGDQTGGEWEYIRFHIPTYIGSGLSGMGNITSDMDGIFGGKNLPVNIRDFQWKTFTPMQLNMDGWGSNPKYPQALGEPATSINRSYLKLKSILLPYTYSCAHEAVTGKPLMRAMFLDDSNDYTHSSATRYQYMYGPSFLVAPVYQNTAADKEGNDVRNGIYLPKGTWYDYFTGATYEGGCILNDYFAPIWKLPVLVKSGAIIPMVNPNNNPSEIDKNRRVFELYPDGKTEFTLYDDDGTTQKYLANEKATTRITSDLNDKQVLTVSIDKTEGSFDGMVKNQSTTFYLNTNAKPKKLTAVIGGKKIRLTEAEQGDNTWQYVQASNINRFSTANSEMERLLVTKNAQIIVRLSSCDITKEAVELRVEGFVRLNKSNETLRKKGALTAPELLEADIQSYSVTPKWKPVQNADYYEIAFNGQTYTTIRHNSLLFDDLQPATDYDFKVRAVNSEGASEWTPLHVKTAVNPLEYAIQGLTATSTARDMEGFEIHRLVDFSTTGDIWHTYYYTKAVPFDFTVDLHSTNTLDKLQYVPRANGGNGTITKCDIAVSKDGRNWTEIGPQQWARDGRTKEVTLSTHPVARYVKVSVKEAVGNFGSGREFYVFKVPGTKTILPGDINLDGKVDENDFTSYMNYTGLKKGDADFDGYISGGDINGNGLIDAYDISNVAMHLEDGWNDDDVAPVGGKVYYEYNRKSYAAGDDVIIKVKGKDLQSVNAFNLIFPYSPKELQFVKVETDPSMVMRNLTYDRHHSDGSQVLYPTFVNVGDHHTINGDADLLVIRMKALKPFTVKNTTAKGLLVDKQLREVEL >CP022041.2|ASE18275.1|491727_492381_-|rubrerythrin MKKKFICTVCGYIHEGTEAPAECPVCHAKAAKFKEFNPEALKGTKTEQNLKNAFAGESQAHTKYLYYASKAKKDGYEQIAGFFEETARNEKEHAKIWFKFLHEGDIPTTTQNLADAAAGENYEWTDMYEQMAKDAMEEGFPELAVKFRSVGKVEKHHEERYRKLLKNIEDSVVFSREGDCIWQCRNCGHIVIGKKAPAVCPVCNHPQSFFQVEESNY >CP022041.2|ASE18285.1|510145_513265_+|TonB-dependent-receptor MSNFMKVSQSRRKHPPFVSGRLAFSFALGLMAFAPSPVLANVDANTSMSVQQQKQSINGVVKDANGDPVIGASILANGTPVGVTDMDGRFSVSVAPGTELKISYVGFATQSVMVRSGVTNYNITLKDENSALSEVVVVGYGTQKKANLSGSVAQLDSKALENRPISNVSSGLQGLLPGITVTGADGAPGLDNGSILVRGVGTLNSASPYILIDGVEAGTLNSLDPEDIASISVLKDASSAAIYGSKASNGVILVTTKRGQNGAPKVSYSGYFGIQNATALMERMNSADAAYYYNKALERSGKAARFSDEAIKKFRDGSDPYNYPNTDWYDLAFKTAWQNRHSVNITGGNEYVKYLASAGYLKQSSILPNAGREQFNGRANLDMVLSKRITAHLNLAYIQNNYRDASSAYAGGSSDQIIRQLNIIAPWIVYKYEDGTYGTVSDGNPMAWLESGMTVNRNNRNFTGMIGLDYQILKDLKLTLQGAYVDASQRYSYFQKFIQYNPNKASDPNKLEIAHYDWHRTTFDAFLNYDKSFAKHNFKAMLGWHTERYKYLPDWMYRKNFPNNELTDMNAGDASTQQNAGNTRELSMVSYFGRLNYDYAGRYLFEANFRSDASSRFAEAHRWGFFPSFSAAWRISEEPFMESSKSWLNNLKLRASWGQLGNQDALNDYYPWMNTYNLNAKYPFGGQLTPGYYQGSYHLETISWERSTTWGVGLDFTLFGGLTGSLDYYNRKTTGIIMNVSAPAEFALGAYKDNIGALRNQGVELSLAYAKQLNKDWTINVGANFAYNKNKILNLGEGTEYIGSGNRRTAVGQQYNSFFMYKATGKFFNSQQEADDYTAKYGNPFGRKFMAGDLIYEDTNGDDKLDSNDRIYTKHTDIPAITYGFNLGATWKNIDLSMIWQGVGAVSHIYNREVLGEFSGDASHPSTLWKDSWTDDNHNAKLPRVFETGNSPSDMTRAMSTFWLWNTAYLRLKTLQLGYTLPKSALKAIGLEKVRIYYAGENLLTFDALPFNIDPEVTSERGSSYPLLRSHSIGINITF >CP022041.2|ASE18286.1|513287_514901_+|RagB/SusD-family-nutrient-uptake-outer-membrane-protein MKTVRTYILAGVAAFALTSCNDYLTTVPKDAMSPSTTWKTGDDAEKFLVGCYDGWEDGGALLYWDAGSDFAYNNFPWEGFTNIGNGSLSPSSPGWSFYDYTIIGRCNTFLENVDKCVFSSDAVKKDLVAQVKAIRAYNYFRMGFLYGGVPIVKPFTSAQEARVPRNTEQEVKDLVFKDLDEAIADINTSPAARGRIAKGAALAMKMRAALYWGDYQKAKDAAQAIIDLGKYELDPDYTNLFKLAGVDSKEIILAVQYKSGTRPLGTIGQLYNNGDGGWSSVVPTQKCVDNYEMSNGMTITEAGSGYDATHPFHGRDPRMAMTILYPGCDWEGTIFNTLDENVNGKKNPNYPTNAANSSKTALTWRKYLDPKTQYADVWDTEACPIVFRYAEVLLTWAEAENELNGPSANVYAMIDKVRTRVGMPAVDQSKYNTKDKLRELIRRERGSEFAGEGLRRADILRWTSNGKMVAETVLNGPLNRITGTINTSATDPTMRAVVSGSSKVEDRTFQTFNRYLPIPQWNISDNPKLEQNPGYAK >CP022041.2|ASE18287.1|515849_517628_+|M6-family-metalloprotease-domain-containing-protein MKKIITLLVSVLLATSSFAIPAMRMWRSFKQADGTILKVMTVGDEHFNYALTEDNIPVLPHNGSYYYARIEDNQLVPSSVLAHDKALRKGKEELVAAAIQQVRQLQKQHEMHVNSKPFGEGLGMTWEGKKKGLVILVEFEDVAFKDPKNVLTLKPREKDVKTLYENMLNKVGYTNDNGAIGSVHDYFLDQSNGKFDLTFDVVGPVKLKHPHQFYGERTANMNDANAPQMIIDACNAIQGQVDFSKYDWDDDGEVEQVYVIYAGEGEATGGESSTIWPHKYSLTDAGLDALTFNGQTINTYACSNEIIRAKVNEKSRIYYSGIGTICHEFSHCLGLPDFYDTRGGSNIGSGRYDLMCGGSYNGGPESLINVYGGTGIGTVPAGYDAYEKAYMGWLKPITLGDEAVEVKNMKGLAEGGDAYFLYNPDTKNEYYIFENRTPHRWDAELPGHGLMVFHVDFDAYSWRMNNLNAASAQRHPRFTIVSADGRLDHDTQNSDPFPTDLNNSLTKSTDPRLSFYTNYNVSSQAGVKQIVRNNDNTISFHFTPLKAATGINNLSADHEQLAETYTLSGVKVADNQNLHNQIVIVKGKKVRK >CP022041.2|ASE18289.1|518931_519450_-|N-acetyltransferase MEKKQYVEVRLRAMEPEDLDMLYHIENDRSLWNISATNVPYSRYALHNYIADAKNDIYIDGQLRMMIENREQEIVGVIDLVNFDPKHQRAEMGIIIMKPFRQKGYAKAAISALIDYTRNGLHLKQIYAVVDVDNEVSIRCLSSIGFTNGSILKEWLYCDGQYKDARVMQLFI >CP022041.2|ASE18290.1|519453_520656_-|glycosyltransferase-family-2-protein MKLSVVIVNYNVKYYLQQCLESLQRALKGVEAEVFVVDNHSHDGSVAYLRSRFPDVHFIASAHNLGFAGGNNIAIRQSKGEYVLLLNPDTVVGEEVIHASIDFMDSHLTAGGHGVQMLTHCGERALESRRGLPSPMVSFYKMVGLCKHFPQSGRFAHYYMGSLSWDVPGKIEVISGAYCFLRRTALDKVGLLDEDFFMYGEDVDLSYRLLKGGFENWYLPVRILHYKGESTQKSSFRYVHVFYDAMLIFFRKHYGGMNVLWRLPIKTAIYVKAFGSLIGTTIRATRKKLGFRTSKAKSFPHYIFVAGEDVMGKCQRLATDNALVAEYRVADKDSLQHMHADLLKEFGGKSRAYCIVYDTDLFSYQDILNVFAEQPKQNIHIGFYHRKENRVVTMMEVIGD >CP022041.2|ASE18291.1|520656_521109_-|hypothetical-protein MDKTRKILLIEFFGSCLITLLIIAVYELELILPGAWADVESSNMVTVQFLMQLLTLATIPLALFLFKIGYVHSDLHTDESHVSRKLLFWGSVRIMMLCVPMILNTFFYYAFGDSVSFFYLAVILALSLFFVFPNKKRCEHECSMDNSEQA >CP022041.2|ASE18292.1|521116_521725_-|recombination-protein-RecR MQQYPSQLLERAVEAFSQLPGVGRKTALRLVLHLLRQSTEDVDSFADAVIRVKHDVKYCKVCHNISDNEVCSICSDPRRDASVVCVVENIQDVMAIENTQQFHGLYHVLGGIISPMDGIGPHDLEIESLVERVEEGTVKEIILALASTMEGDTTNFYISRKLKDTGVKLSVIARGISVGDELEYTDEVTLGRSILNRTPFES >CP022041.2|ASE18293.1|521908_522388_-|hypothetical-protein MLLVAAMVVSISASAQFQEGKGYLGASLTGLDLHYNGHDGMNIGVQAKAGYFPWDNLMVLATFDAVHNGSEAVADHISVGVGGRYYITQNGLYLGAGVKLLHANHNYNDLMPGVEVGYAFFINRSVTIEPALYYDQSFKTHNYSTVGLKVGLGIYLFDD >CP022041.2|ASE18294.1|522803_523061_-|acyl-phosphate-glycerol-3-phosphate-acyltransferase MKRIKIIRVLATFICHDPFAYSPIWTWDGFPPIIYTERERILPVLKEWEHKGYLTLIYDEKIAFILNVEKLPSKEKLIEESRNIK >CP022041.2|ASE18906.1|523116_523323_-|peptidase MSPIVDWNLLDVLNKNIRDNYERIRPILLKWQENGYIKLIEDNEIAFSFIPEKLPSKEKLIEESLNFK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_8 | 515345-515467 | Orphan |
NA
Consensus repeat of CP022041_8
|
1 spacers
spacers of CP022041_8
>8.1|515392|29|CP022041|CRISPRCasFinder AGTAACAGATGAACAGGTGTTTAGTGGAC |
CRISPR arrays and Neighbor proteins around CP022041_8
The CRISPR arrays of CP022041_8 >merge|CP022041|8|515345-515467|CRISPRCasFinder GAGTGAACGAGTGGACAAGTGAACAAGTTACTTGCAAGAGAAGACAAAGTAACAGATGAACAGGTGTTTAGTGGACGAGTGAACGAGTGGTCAAGTGAACAAGTTACTTGCAAGAGAAGACAA >CP022041|8|7|515345-515467|CRISPRCasFinder GAGTGAACGAGTGGACAAGTGAACAAGTTACTTGCAAGAGAAGACAA AGTAACAGATGAACAGGTGTTTAGTGGAC GAGTGAACGAGTGGTCAAGTGAACAAGTTACTTGCAAGAGAAGACAA
>CP022041.2|ASE18286.1|513287_514901_+|RagB/SusD-family-nutrient-uptake-outer-membrane-protein MKTVRTYILAGVAAFALTSCNDYLTTVPKDAMSPSTTWKTGDDAEKFLVGCYDGWEDGGALLYWDAGSDFAYNNFPWEGFTNIGNGSLSPSSPGWSFYDYTIIGRCNTFLENVDKCVFSSDAVKKDLVAQVKAIRAYNYFRMGFLYGGVPIVKPFTSAQEARVPRNTEQEVKDLVFKDLDEAIADINTSPAARGRIAKGAALAMKMRAALYWGDYQKAKDAAQAIIDLGKYELDPDYTNLFKLAGVDSKEIILAVQYKSGTRPLGTIGQLYNNGDGGWSSVVPTQKCVDNYEMSNGMTITEAGSGYDATHPFHGRDPRMAMTILYPGCDWEGTIFNTLDENVNGKKNPNYPTNAANSSKTALTWRKYLDPKTQYADVWDTEACPIVFRYAEVLLTWAEAENELNGPSANVYAMIDKVRTRVGMPAVDQSKYNTKDKLRELIRRERGSEFAGEGLRRADILRWTSNGKMVAETVLNGPLNRITGTINTSATDPTMRAVVSGSSKVEDRTFQTFNRYLPIPQWNISDNPKLEQNPGYAK >CP022041.2|ASE18285.1|510145_513265_+|TonB-dependent-receptor MSNFMKVSQSRRKHPPFVSGRLAFSFALGLMAFAPSPVLANVDANTSMSVQQQKQSINGVVKDANGDPVIGASILANGTPVGVTDMDGRFSVSVAPGTELKISYVGFATQSVMVRSGVTNYNITLKDENSALSEVVVVGYGTQKKANLSGSVAQLDSKALENRPISNVSSGLQGLLPGITVTGADGAPGLDNGSILVRGVGTLNSASPYILIDGVEAGTLNSLDPEDIASISVLKDASSAAIYGSKASNGVILVTTKRGQNGAPKVSYSGYFGIQNATALMERMNSADAAYYYNKALERSGKAARFSDEAIKKFRDGSDPYNYPNTDWYDLAFKTAWQNRHSVNITGGNEYVKYLASAGYLKQSSILPNAGREQFNGRANLDMVLSKRITAHLNLAYIQNNYRDASSAYAGGSSDQIIRQLNIIAPWIVYKYEDGTYGTVSDGNPMAWLESGMTVNRNNRNFTGMIGLDYQILKDLKLTLQGAYVDASQRYSYFQKFIQYNPNKASDPNKLEIAHYDWHRTTFDAFLNYDKSFAKHNFKAMLGWHTERYKYLPDWMYRKNFPNNELTDMNAGDASTQQNAGNTRELSMVSYFGRLNYDYAGRYLFEANFRSDASSRFAEAHRWGFFPSFSAAWRISEEPFMESSKSWLNNLKLRASWGQLGNQDALNDYYPWMNTYNLNAKYPFGGQLTPGYYQGSYHLETISWERSTTWGVGLDFTLFGGLTGSLDYYNRKTTGIIMNVSAPAEFALGAYKDNIGALRNQGVELSLAYAKQLNKDWTINVGANFAYNKNKILNLGEGTEYIGSGNRRTAVGQQYNSFFMYKATGKFFNSQQEADDYTAKYGNPFGRKFMAGDLIYEDTNGDDKLDSNDRIYTKHTDIPAITYGFNLGATWKNIDLSMIWQGVGAVSHIYNREVLGEFSGDASHPSTLWKDSWTDDNHNAKLPRVFETGNSPSDMTRAMSTFWLWNTAYLRLKTLQLGYTLPKSALKAIGLEKVRIYYAGENLLTFDALPFNIDPEVTSERGSSYPLLRSHSIGINITF >CP022041.2|ASE18284.1|507783_509490_+|hypothetical-protein MNFKKIYGLILGVIVAGLTSCSSDLNNEEKAPVGPNETAVRSLSIATGDAKTRSEVKIDADNKWVTGDRFMAFNRTFTGSSSESRYGVLTASSTGTRTTLDGVIACKDNDELGIFYPGSYVTGFDQGKMPVVMTASYINDNKGQDGSKENLKYFDYSYGKGKVTVNGASASGSVDMKKLYSVLELDFTAGGVKLTNIKKLVLSNVLTEAVYNIQSNQLESLETGKIEVNSPVALEKVYVAILPQNHFSPTFEVYTTDNKSYRFAVSTPNFNLVAAKVYPFTVQVKEFTPNPPYIEIGGVKWGKYNLQYSTGTKVNGWVDGYHLAENPWDYYMYTPSKITEPLSDLQMSLPSYNPNDVKFDHFRWGDIEYAYDYTKTGQFWTERRDIQGVISSDKKHGDLAAYASNNKWKLPSATDFNNLMKATAEYLGYFIDDNGNKVYGVLFDPNVAEGLKGKVLDKNNKVLGSSNTAAIINAGKNLRQFVKSDFDNALFFPMAGIYYTYTGAIDKPGSQGGYWTSTSNPSNNNAAAFMPQMMSNSLGVYQGFSGTTQKALINKNNMHSIRPIYVGQ >CP022041.2|ASE18283.1|507552_507756_+|hypothetical-protein MKKSYVKPCSTCVIMAVEQAMLAGSKGLRVTNQNLERVDHTIIGSSTPSTPSAGGNAKENPFQYDEE >CP022041.2|ASE18282.1|506532_506802_+|hypothetical-protein MEESPSAYRWFGYMFVWMLACLLILDKGVSSELFLFILLLVAIVLNAYCAYKFALGKGTFLAILAFVVAMVLDFFPIVAYFVIIEIFMA >CP022041.2|ASE18281.1|505443_506262_-|KilA-N-domain-containing-protein MTKKGEIFVKDVAIKTMTKDGIDYICITDIARQKNTAEPKDVVKNWMRQKNTLEYLGLWEKLNNPNFKGVEFDPLLAEAGSNSFTMSPTRWVELTAAIGVFTKNGAGGGTFAQRDIAFKFANWVSVEFELYLVMEFQRLKAKEQELIGWTAKRELSKINYRIHTDAIKSHLIPEKVTPTQANIIYAEEADVLNVAMFGMTARQWRESNPDLKGNIRDYASVNELICLANMENINAVLINDDIPQGERLVRLNQIAIHQMQILERNSNRNLLR >CP022041.2|ASE18280.1|503476_505174_-|energy-dependent-translational-throttle-protein-EttA MATVDDKKIIFSMVGVSKIIPQNQKQILKNIYLSFFYGAKIGIIGLNGAGKSTLMKIIAGLVEPTQGEVVWSPGYSVGYLPQDPPLDEAKTVKENVMEGVQHIYDALKEYDDINVKFGLEEYYSDADKMDKLMQRQAELQDIIDATDAWNIDSRLERAMDALRCPKGDLPVTNLSGGERRRVALCRLLLQKPDVLLLDEPTNHLDAESIDWLEQHLQQYEGTVIAVTHDRYFLDDVSEWILELDRGEGIPWKGNYSSWLDQKTKRMEQEEKSASKRRKTLERELEWVRMAPKARQAKGKARLNSYEQMLNEEQKQREEKLEIFIPNGPRLGNKVIEAQHVKKAFGEKVLFNDLNFMLPPNGIVGVIGPNGAGKTTLFRLIMGLEQADGGTFEVGETVKLAYVDQQHKDIDPNKTVYDVVSQGNETIRMGGRDINSRAYLSRFNFSGTDQSKLCSVLSGGERNRLQLAMALKQEGNVLLLDEPTNDIDVNTLRALEEGLEAFAGCAVVISHDRWFLDRICTHILAFEGNGEVVYFEGGFSDYEINKARRLGNEEIKKGRYRKLMEE >CP022041.2|ASE18279.1|502367_503213_-|hypothetical-protein MFVLFFLCFSIYGKAQTAPKKLTAPSPERFQAINDSILAEGMWLYTYEKLAWRATDSLMKYNVKREEINSATAFEEGNLTWRYIFANLDKEQTVFELTLHLSNDTSFYVCSATPRKLKPTEIEQLKAKQIAPRKVIKEKGDSIFLANTSGLNWDLLPLEKGGYRLYLLHGTTKHGVIPIGDDYAFDFDKDMNILSWRRFHRSFLEQPITMNGEEITEVIHSHTPMTPYFTTTDIANYMLYGCDLYGIKRFSVLSTAFADTSYLTTFDVEKMKLTSTVYTVK >CP022041.2|ASE18278.1|501130_501901_-|NYN-domain-containing-protein MEKKLGTYLLILYFCSIKSIALLSCRVSATSEWLFLFNLHTMPKRVTFYFDGFNFYFALKRKKKISPEWKDFYWLDLVAFCESFLGPDQVLEKVIYFTASPLSPQKNSRQSAFLNANRILHSDKFEVIRGKYMSKQIECPYCKYSISKPEEKRTDVNISVRMMADCVQDKTDVIVLISADTDLIPPLNFIHTNYPNKKVKVFFPPGSHALELHNHLRTFHSKWVFLEKNERRFRNAVMPHTITVGNQSVSIPEKWK >CP022041.2|ASE18277.1|497629_500824_+|hypothetical-protein MLFERFLRNKSLMAAMALTAVMPSVAKASILSAAPTAEGENQQLKAGVYFIVNDKRQAGTDLHVYVDESGLHGKNYTSLAADYSNAFLLHAKGDKYTIQSLKDGKYVQNVNGNSVPYKTGNDAHKFQIVYQSQSSGTGKSYFNIYNDNVGGNQFCWHLDAQRNVVRWYPLTDNGRIALGPSEFRLDPVTSLSKQQVLDRLAELTKIVDPRKDLNKYYQIVSDTYGRAMREDYIVGELSTGGFVDTDYSYCWKLVKLGSGRYAFQNAVTGKYIAQQNGQTSRYYTTSEEQGNGFEFNLNESDPYVLTFEMVDAYNVGIHCAESQGYHPVGWYVNNEANKWVFKVATIDQAKLKEQQDAYKARVDLTRNVDRYATAVAKYFTNSAATEVTAATKAMTDEALKAQMTTEGVPQGLQEVVLKIKNQSWTVYPSGRNWEKQFRIADYKVYSENNYGAWARSMGIGYDYGSMTNPTGVTARDGEDLFVFLGDDIPQDATVQIELVPLGTRSAGKYHNLKKGLNIILNQGENNVFVNYIGRTFNNGKYLRDYKPMNIHIEGGKVNGYFDLTKGNTNEDWQKMQSDGLVWAKAFNMKGELVVMNMPSQACKDYTPVHMKELVEIWNSIVQREDDLMGFRAAKRDKCNNVLNATAVDHGYMYATTGGTYYNYNTLADVLNYDKMKWGNGTLWGPAHEFGHNHQQLFNTAGMTEISVNMYSNMVMFTSGRVTSRSEHCNYTDVDGQEHRGVCESAVSTYADRFANKKMWFEYGTWGTTQMYYKLYMMFHSTGLDDQFWHKCLDYLRTHRLEGQGTANCQGQNDYLLFAKACCVAANQDLSSFFEAWGHFYDVNGSVIGDYSNTTMYTTRAQWVEAKKFMQQFPKGPANNMIFIDDHIRPTPAIYPGAAPGTMREDFNGAVRVGTMGDFGSWDQFCPDSLGQGYAIIKTQSDANGRRTYTMEAKNSHVVGFKIYDSKGNLIYFANTKTFTIPKKVMEDAQNNIVIKVCGSDGSEVDPGTVPLGIKTFSAQANGGKVDVYSIYGVLVRSKVNPETALEGLPNGVYVVGGKKVVVGK >CP022041.2|ASE18287.1|515849_517628_+|M6-family-metalloprotease-domain-containing-protein MKKIITLLVSVLLATSSFAIPAMRMWRSFKQADGTILKVMTVGDEHFNYALTEDNIPVLPHNGSYYYARIEDNQLVPSSVLAHDKALRKGKEELVAAAIQQVRQLQKQHEMHVNSKPFGEGLGMTWEGKKKGLVILVEFEDVAFKDPKNVLTLKPREKDVKTLYENMLNKVGYTNDNGAIGSVHDYFLDQSNGKFDLTFDVVGPVKLKHPHQFYGERTANMNDANAPQMIIDACNAIQGQVDFSKYDWDDDGEVEQVYVIYAGEGEATGGESSTIWPHKYSLTDAGLDALTFNGQTINTYACSNEIIRAKVNEKSRIYYSGIGTICHEFSHCLGLPDFYDTRGGSNIGSGRYDLMCGGSYNGGPESLINVYGGTGIGTVPAGYDAYEKAYMGWLKPITLGDEAVEVKNMKGLAEGGDAYFLYNPDTKNEYYIFENRTPHRWDAELPGHGLMVFHVDFDAYSWRMNNLNAASAQRHPRFTIVSADGRLDHDTQNSDPFPTDLNNSLTKSTDPRLSFYTNYNVSSQAGVKQIVRNNDNTISFHFTPLKAATGINNLSADHEQLAETYTLSGVKVADNQNLHNQIVIVKGKKVRK >CP022041.2|ASE18289.1|518931_519450_-|N-acetyltransferase MEKKQYVEVRLRAMEPEDLDMLYHIENDRSLWNISATNVPYSRYALHNYIADAKNDIYIDGQLRMMIENREQEIVGVIDLVNFDPKHQRAEMGIIIMKPFRQKGYAKAAISALIDYTRNGLHLKQIYAVVDVDNEVSIRCLSSIGFTNGSILKEWLYCDGQYKDARVMQLFI >CP022041.2|ASE18290.1|519453_520656_-|glycosyltransferase-family-2-protein MKLSVVIVNYNVKYYLQQCLESLQRALKGVEAEVFVVDNHSHDGSVAYLRSRFPDVHFIASAHNLGFAGGNNIAIRQSKGEYVLLLNPDTVVGEEVIHASIDFMDSHLTAGGHGVQMLTHCGERALESRRGLPSPMVSFYKMVGLCKHFPQSGRFAHYYMGSLSWDVPGKIEVISGAYCFLRRTALDKVGLLDEDFFMYGEDVDLSYRLLKGGFENWYLPVRILHYKGESTQKSSFRYVHVFYDAMLIFFRKHYGGMNVLWRLPIKTAIYVKAFGSLIGTTIRATRKKLGFRTSKAKSFPHYIFVAGEDVMGKCQRLATDNALVAEYRVADKDSLQHMHADLLKEFGGKSRAYCIVYDTDLFSYQDILNVFAEQPKQNIHIGFYHRKENRVVTMMEVIGD >CP022041.2|ASE18291.1|520656_521109_-|hypothetical-protein MDKTRKILLIEFFGSCLITLLIIAVYELELILPGAWADVESSNMVTVQFLMQLLTLATIPLALFLFKIGYVHSDLHTDESHVSRKLLFWGSVRIMMLCVPMILNTFFYYAFGDSVSFFYLAVILALSLFFVFPNKKRCEHECSMDNSEQA >CP022041.2|ASE18292.1|521116_521725_-|recombination-protein-RecR MQQYPSQLLERAVEAFSQLPGVGRKTALRLVLHLLRQSTEDVDSFADAVIRVKHDVKYCKVCHNISDNEVCSICSDPRRDASVVCVVENIQDVMAIENTQQFHGLYHVLGGIISPMDGIGPHDLEIESLVERVEEGTVKEIILALASTMEGDTTNFYISRKLKDTGVKLSVIARGISVGDELEYTDEVTLGRSILNRTPFES >CP022041.2|ASE18293.1|521908_522388_-|hypothetical-protein MLLVAAMVVSISASAQFQEGKGYLGASLTGLDLHYNGHDGMNIGVQAKAGYFPWDNLMVLATFDAVHNGSEAVADHISVGVGGRYYITQNGLYLGAGVKLLHANHNYNDLMPGVEVGYAFFINRSVTIEPALYYDQSFKTHNYSTVGLKVGLGIYLFDD >CP022041.2|ASE18294.1|522803_523061_-|acyl-phosphate-glycerol-3-phosphate-acyltransferase MKRIKIIRVLATFICHDPFAYSPIWTWDGFPPIIYTERERILPVLKEWEHKGYLTLIYDEKIAFILNVEKLPSKEKLIEESRNIK >CP022041.2|ASE18906.1|523116_523323_-|peptidase MSPIVDWNLLDVLNKNIRDNYERIRPILLKWQENGYIKLIEDNEIAFSFIPEKLPSKEKLIEESLNFK >CP022041.2|ASE18295.1|524521_526729_-|sodium-translocating-pyrophosphatase MEHIPQVFWLIPIASVCALGMAWYFFKSMMKAEEGTPRMVEIAEYVRRGAMAYLKQQYKVVLIVFVVLAIVFAIMAYGFNAQNEWVPFAFLTGGFFSGLAGFFGMKTATYASARTANAARNGLNDGLKIAFRSGAVMGLVVVGLGLLDIAIWFIVLTWFYSDKMTTSEMLITITTTMLTFGMGASTQALFARVGGGIYTKAADVGADLVGKVEANIPEDDPRNPATIADNVGDNVGDVAGMGADLYESYCGSILSTAALGATAFAASSGDMQLKAVIAPMLIAAVGVFLSLFGIFLVRTKEGATMKDLLHALGLGTNTAAVLIAAVSFLILYLLGLENWLGVSFSVIAGLAAGVIIGQATEYYTSQSYMPTKAISEASHTGAATVIIKGIGTGMISTCVPVLSISVAIMLSYLCANGFDMSMSALSIQHGLYGIGIAAVGMLSTLGITLATDAYGPIADNAGGNAEMSELGAEVRQRTDALDALGNTTAATGKGFAIGSAALTALALLASYIEEIKIAMARVGTQMTNVAGETIDATKATIPDFMNFFQVNLMNPKVLVGAFIGAMAAFLFCGLTMGAVGRAAGKMVEEVRRQFREIKGILEGTGTPDYGRCVEISTQSAQHEMIIPSLLAIIIPVVVGLLLGVAGVLGLLVGGLAAGFTLAVFMSNAGGAWDNAKKYVEEGNFGGKGSEAHKATIVGDTVGDPFKDTSGPSLNILIKLMSMVSIVMAGLTVACL >CP022041.2|ASE18296.1|527410_528142_-|NADPH-dependent-oxidoreductase MKTINTRKTIRKYTNKDVSEDLLRTLLEKAERTPTMGNLQLYSVIITRNEEKKAQLAPAHFNQPMVMGAPVVLTFCADFRRTTLWAENRKATPGYDNFLSFLNAATDALLYCQTFCNLAEEEGLGTCFLGTTIYNPKTIIEVLQLPRLVMPVATITLGWPAEDPALTDRLPIDSIIHHETYEDYTPDRIDAFYTPKEQLEENKHFVEINNKETLAQVFTDLRYTKEANEAISKALLETLKGQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_9 | 526884-526967 | Orphan |
NA
Consensus repeat of CP022041_9
|
1 spacers
spacers of CP022041_9
>9.1|526909|34|CP022041|CRISPRCasFinder GAAATGTAACGAAAGCAACTGGCCCCACCCCCCA |
CRISPR arrays and Neighbor proteins around CP022041_9
The CRISPR arrays of CP022041_9 >merge|CP022041|9|526884-526967|CRISPRCasFinder ACCCCTCCCCCAAAGGGAGGGGAGTGAAATGTAACGAAAGCAACTGGCCCCACCCCCCAACCCCTCCCCCATAGGGAGGGGAGT >CP022041|9|8|526884-526967|CRISPRCasFinder ACCCCTCCCCCAAAGGGAGGGGAGT GAAATGTAACGAAAGCAACTGGCCCCACCCCCCA ACCCCTCCCCCATAGGGAGGGGAGT
>CP022041.2|ASE18295.1|524521_526729_-|sodium-translocating-pyrophosphatase MEHIPQVFWLIPIASVCALGMAWYFFKSMMKAEEGTPRMVEIAEYVRRGAMAYLKQQYKVVLIVFVVLAIVFAIMAYGFNAQNEWVPFAFLTGGFFSGLAGFFGMKTATYASARTANAARNGLNDGLKIAFRSGAVMGLVVVGLGLLDIAIWFIVLTWFYSDKMTTSEMLITITTTMLTFGMGASTQALFARVGGGIYTKAADVGADLVGKVEANIPEDDPRNPATIADNVGDNVGDVAGMGADLYESYCGSILSTAALGATAFAASSGDMQLKAVIAPMLIAAVGVFLSLFGIFLVRTKEGATMKDLLHALGLGTNTAAVLIAAVSFLILYLLGLENWLGVSFSVIAGLAAGVIIGQATEYYTSQSYMPTKAISEASHTGAATVIIKGIGTGMISTCVPVLSISVAIMLSYLCANGFDMSMSALSIQHGLYGIGIAAVGMLSTLGITLATDAYGPIADNAGGNAEMSELGAEVRQRTDALDALGNTTAATGKGFAIGSAALTALALLASYIEEIKIAMARVGTQMTNVAGETIDATKATIPDFMNFFQVNLMNPKVLVGAFIGAMAAFLFCGLTMGAVGRAAGKMVEEVRRQFREIKGILEGTGTPDYGRCVEISTQSAQHEMIIPSLLAIIIPVVVGLLLGVAGVLGLLVGGLAAGFTLAVFMSNAGGAWDNAKKYVEEGNFGGKGSEAHKATIVGDTVGDPFKDTSGPSLNILIKLMSMVSIVMAGLTVACL >CP022041.2|ASE18906.1|523116_523323_-|peptidase MSPIVDWNLLDVLNKNIRDNYERIRPILLKWQENGYIKLIEDNEIAFSFIPEKLPSKEKLIEESLNFK >CP022041.2|ASE18294.1|522803_523061_-|acyl-phosphate-glycerol-3-phosphate-acyltransferase MKRIKIIRVLATFICHDPFAYSPIWTWDGFPPIIYTERERILPVLKEWEHKGYLTLIYDEKIAFILNVEKLPSKEKLIEESRNIK >CP022041.2|ASE18293.1|521908_522388_-|hypothetical-protein MLLVAAMVVSISASAQFQEGKGYLGASLTGLDLHYNGHDGMNIGVQAKAGYFPWDNLMVLATFDAVHNGSEAVADHISVGVGGRYYITQNGLYLGAGVKLLHANHNYNDLMPGVEVGYAFFINRSVTIEPALYYDQSFKTHNYSTVGLKVGLGIYLFDD >CP022041.2|ASE18292.1|521116_521725_-|recombination-protein-RecR MQQYPSQLLERAVEAFSQLPGVGRKTALRLVLHLLRQSTEDVDSFADAVIRVKHDVKYCKVCHNISDNEVCSICSDPRRDASVVCVVENIQDVMAIENTQQFHGLYHVLGGIISPMDGIGPHDLEIESLVERVEEGTVKEIILALASTMEGDTTNFYISRKLKDTGVKLSVIARGISVGDELEYTDEVTLGRSILNRTPFES >CP022041.2|ASE18291.1|520656_521109_-|hypothetical-protein MDKTRKILLIEFFGSCLITLLIIAVYELELILPGAWADVESSNMVTVQFLMQLLTLATIPLALFLFKIGYVHSDLHTDESHVSRKLLFWGSVRIMMLCVPMILNTFFYYAFGDSVSFFYLAVILALSLFFVFPNKKRCEHECSMDNSEQA >CP022041.2|ASE18290.1|519453_520656_-|glycosyltransferase-family-2-protein MKLSVVIVNYNVKYYLQQCLESLQRALKGVEAEVFVVDNHSHDGSVAYLRSRFPDVHFIASAHNLGFAGGNNIAIRQSKGEYVLLLNPDTVVGEEVIHASIDFMDSHLTAGGHGVQMLTHCGERALESRRGLPSPMVSFYKMVGLCKHFPQSGRFAHYYMGSLSWDVPGKIEVISGAYCFLRRTALDKVGLLDEDFFMYGEDVDLSYRLLKGGFENWYLPVRILHYKGESTQKSSFRYVHVFYDAMLIFFRKHYGGMNVLWRLPIKTAIYVKAFGSLIGTTIRATRKKLGFRTSKAKSFPHYIFVAGEDVMGKCQRLATDNALVAEYRVADKDSLQHMHADLLKEFGGKSRAYCIVYDTDLFSYQDILNVFAEQPKQNIHIGFYHRKENRVVTMMEVIGD >CP022041.2|ASE18289.1|518931_519450_-|N-acetyltransferase MEKKQYVEVRLRAMEPEDLDMLYHIENDRSLWNISATNVPYSRYALHNYIADAKNDIYIDGQLRMMIENREQEIVGVIDLVNFDPKHQRAEMGIIIMKPFRQKGYAKAAISALIDYTRNGLHLKQIYAVVDVDNEVSIRCLSSIGFTNGSILKEWLYCDGQYKDARVMQLFI >CP022041.2|ASE18287.1|515849_517628_+|M6-family-metalloprotease-domain-containing-protein MKKIITLLVSVLLATSSFAIPAMRMWRSFKQADGTILKVMTVGDEHFNYALTEDNIPVLPHNGSYYYARIEDNQLVPSSVLAHDKALRKGKEELVAAAIQQVRQLQKQHEMHVNSKPFGEGLGMTWEGKKKGLVILVEFEDVAFKDPKNVLTLKPREKDVKTLYENMLNKVGYTNDNGAIGSVHDYFLDQSNGKFDLTFDVVGPVKLKHPHQFYGERTANMNDANAPQMIIDACNAIQGQVDFSKYDWDDDGEVEQVYVIYAGEGEATGGESSTIWPHKYSLTDAGLDALTFNGQTINTYACSNEIIRAKVNEKSRIYYSGIGTICHEFSHCLGLPDFYDTRGGSNIGSGRYDLMCGGSYNGGPESLINVYGGTGIGTVPAGYDAYEKAYMGWLKPITLGDEAVEVKNMKGLAEGGDAYFLYNPDTKNEYYIFENRTPHRWDAELPGHGLMVFHVDFDAYSWRMNNLNAASAQRHPRFTIVSADGRLDHDTQNSDPFPTDLNNSLTKSTDPRLSFYTNYNVSSQAGVKQIVRNNDNTISFHFTPLKAATGINNLSADHEQLAETYTLSGVKVADNQNLHNQIVIVKGKKVRK >CP022041.2|ASE18286.1|513287_514901_+|RagB/SusD-family-nutrient-uptake-outer-membrane-protein MKTVRTYILAGVAAFALTSCNDYLTTVPKDAMSPSTTWKTGDDAEKFLVGCYDGWEDGGALLYWDAGSDFAYNNFPWEGFTNIGNGSLSPSSPGWSFYDYTIIGRCNTFLENVDKCVFSSDAVKKDLVAQVKAIRAYNYFRMGFLYGGVPIVKPFTSAQEARVPRNTEQEVKDLVFKDLDEAIADINTSPAARGRIAKGAALAMKMRAALYWGDYQKAKDAAQAIIDLGKYELDPDYTNLFKLAGVDSKEIILAVQYKSGTRPLGTIGQLYNNGDGGWSSVVPTQKCVDNYEMSNGMTITEAGSGYDATHPFHGRDPRMAMTILYPGCDWEGTIFNTLDENVNGKKNPNYPTNAANSSKTALTWRKYLDPKTQYADVWDTEACPIVFRYAEVLLTWAEAENELNGPSANVYAMIDKVRTRVGMPAVDQSKYNTKDKLRELIRRERGSEFAGEGLRRADILRWTSNGKMVAETVLNGPLNRITGTINTSATDPTMRAVVSGSSKVEDRTFQTFNRYLPIPQWNISDNPKLEQNPGYAK >CP022041.2|ASE18296.1|527410_528142_-|NADPH-dependent-oxidoreductase MKTINTRKTIRKYTNKDVSEDLLRTLLEKAERTPTMGNLQLYSVIITRNEEKKAQLAPAHFNQPMVMGAPVVLTFCADFRRTTLWAENRKATPGYDNFLSFLNAATDALLYCQTFCNLAEEEGLGTCFLGTTIYNPKTIIEVLQLPRLVMPVATITLGWPAEDPALTDRLPIDSIIHHETYEDYTPDRIDAFYTPKEQLEENKHFVEINNKETLAQVFTDLRYTKEANEAISKALLETLKGQW >CP022041.2|ASE18297.1|528239_530840_+|adenosylcobalamin-dependent-ribonucleoside-diphosphate-reductase METKQTYSFDEAFQASLAYFGGDELAARVWVNKYAMKDSFGNIYEKSPEQMHWRIANEIARIENKYKNPLTAQEVFDLLHHFKYIIPAGSPMTGIGNNYQVASLSNCFVIGLDGNADSYGAIMRIDEEQVQLMKRRGGVGHDLTHIRPKGSPVNNSALTSTGLVPFMERYSNSTREVAQDGRRGALMLSVSIKHPDSEAFIDAKMEEGKVTGANVSVKITDEFMQAVVEDKTYVQQFPTNSSEPSVTKEISAKELWEKIVHNAWKSAEPGVLFWDTIIRESIPDCYADLGFQTVSTNPCGEIPLCPYDSCRLLSLNLYSYVIDPFTDHARFDMELFKRHAQLAQRLMDDIIDLEMEKIDLIMSKIKTDPQVDEVKSAEYHLWEKIKKKSCQGRRTGVGITAEGDMIAAMGLRYGTQEATDFSVDIHRTLALNAYRSSVTMAQERGAFEIYDAKREEKNPFILRLKEADGQLYEDMKKYGRRNIACLTIAPTGTTSLMTQTTSGIEPVFMPVYKRRRKVNPNDTDVHVDFVDEVGDSFEEYIVYHRKFLTWMEVNGIDTQKKYSQEEIDELVKRSPYYKATANDVDWLMKVRMQGEIQKWVDHSISVTVNLPNQVDEELVNKLYVEAWRSGCKGCTIYRDGSRSGVMISVSKKDKTKDEKPVDEEKVKDLNSAEEHHEHICNHPNVIEVRPKELECDVVRFQNNKEKWVAFVGLLEGYPYEIFTGLQDDEEGIALPKSVTKGKIIKQTAEDGSHRYDFQFENKRGYKTTVEGLSEKFNPEYWNYAKLISGVLRYRMPIDHVIKLVGSLQLKNESINTWKNGVERALKKYVVDGTSASGLKCPVCGQETLVYQEGCLICTNCGASRCG >CP022041.2|ASE18907.1|530978_531362_+|dihydroneopterin-aldolase MKLMTSYILLQGLHFHACIGVGEQERVVGNEYVLDLRLGYPFVTAMKSDDVADTLNYAEVFNVIREVMKQPVRLLESVAGSIVEALFAVFPMISSIDLKLVKLNPPMGADSDGAGVELHLINDKTEV >CP022041.2|ASE18298.1|531414_532746_+|N-acetylmuramoyl-L-alanine-amidase MFKKISLLLVFLVLSVSLSWAADGRFTLVIDPGHGGHDAGAIGAISKEKDINLNIALAFGRYVERNLPDVNVIYTRKTDVFIPLHQRADIANKAKADLFISVHTNSVASGRYVKGFQVYTLGMHRAKANLDVAMRENGVISMEKGYQQTYQGFDPNSSESYIMFEFMQNANMERSVELARMIQNSVCSSAGRIDKGVHQAGFLVLRESYMPSCLIELGFITAADEEEYLNSPAGIDAMAKGIYNAFVQYKNAYDTRIVVPYRPVENKRIVIDRVVPTAPTTKPHPVTPIERPRSVAPVQRSRSTVPVEQPRSSTPAPPARPMNKVQEAKKRISDAIRELLPCNDSEAETAKSVPVFKVQVLASNRQLRSGSELFRGHTDIDCVQEGNFYKYCIGSSTNYNDISRLRGKLLKDFSQACIIAYKNGARMDVNQAIAEFLKNKKNK >CP022041.2|ASE18299.1|532757_533738_+|MCE-family-protein MKKFFTPQVRIAIVAILAIVVLFFGIQFLRGISLFSNDAHYKIKFNDITGLSTSTPVYARGFKVGIVRNIDYDYDKLGESITVDIDVEKTLRIPEGTTAEIVSDIMGNVKVVLQFGKSTKLLEPNGWIDGVINDGTLGDLKSMVPSIQKMLPKLDSILGSVNTLLGDPALQSSVHNIDKITANLTTSTRELNTLLAQVNGSLPVVAAKAGRVMDNANGMMVNANRGVTEARGAIRGANTMMSNLNNKVNGLDVEATVAKVNATLDNMNSLTAKLNSNEGTMGLMLNDASLYNNLNSTMRSADSLLTNLKAHPKRYVHFSIFGRKDK >CP022041.2|ASE18300.1|534114_535521_+|chromosomal-replication-initiator-protein-DnaA MNVSPKNLWDSCLQLIKENVTEQQFDTWFRPIVLQSYKPASKTLLVQVPSQFVYEYLEGHYVDLLRKVLTRVFGQGVQLTYRVMVDQENHLSQDLEQDTVEDISSQRPTARANQSPTVLDTVPQDLDSQLDPHKSFSNYVEGDSNKLPRSIGLSIAEHPNTTQFNPMFIYGPSGCGKTHLVNAIGLKAKQLYPQKRVLYVSARLFQVQYTDSVRQNTTNDFINFYQTIDILIVDDIQEWVTATKTQDTFFHIFNHLFRNGKRIILASDRPPVDLKGMNDRLLTRFSCGLIAELEKPNVQLCVDILHSKIKRDGLNIPEDVVRFIAETANGSVRDLQGVINSLLAYSVVYNSNIDMRLAERVIKRAVKIDDEPLTIDDILDKVCTHYNVTMSAVNSRSRKKDIVMARQVSMYMAQKYTKMPASRIGKLVGNRDHSTVIHSCSKIEDRLKVDKGFHAEIASIENSFKLKA >CP022041.2|ASE18908.1|536637_538239_+|Na+/H+-antiporter MEHQLVLIIALILATCLLIMVSQRVKVAYPIMLVLGGVAMSFIPGMPRFNINPDLIFLVFLPPILYEAAYYNSWKELWRWRRIISSFAFIVVFITALVVGFIANTFIPGFSVALGFLLGGIVSPPDAVSAAAIMKFVKVPRRISAILEGESLFNDASSLIIVKFALIAIGTGQFVWYQATASFIWMVIGGAGVGVLLSYAIIKLHKWLHKWLPIDENINTMFTIFSPYVMYIAAETVEASGVLAVVSGGLYFSYRRLQIIGSSSRLRSEHVWNFLIFLLNGMAFLLIGLDLPEIMTGLKEDSVSLWMATAYGLLITMALVVIRMGAAFSAVYITRFMSKFITVADSRKQSYAGPLVLGWTGMRGVVSLAAALSIPLYIPGTQIAFPERSLVLYITFMVITLTLIFQGLTLPVLLKIVKLPNYDDHMSHAEAQRIIRIGMAQASLDFLERNNMSENITHSAVLNNLGNHWNELLESEGSATLYDEVARKTYREILEEQRLWLNKLNNENERVDEELVRHYIHRIDLEEERLLKE >CP022041.2|ASE18301.1|538767_539934_-|NADH-dependent-alcohol-dehydrogenase MVNFDYWTPTRLVFGKDVVREKLVETMRPLGKRVLMTYGGGSIKKIGLYDLVKDLLKDFEIFELPGIEPNPKYDPSVLEGVRICKEQKIDVILAVGGGSVLDCSKAIAAGAYYDGEAWDLISYKVKAQKALPIVDIITLAATGSEYDCGGVITNTAINDKRGYMDALLYPVASFMDPTYTFTVPAKHTAAGTADAINHMMEQYFCASPNDISDGFLETLIKTLMKWVSVAMKEPDNYEARAELMYACTFGCNGILAMGTGGSGWPMHAIEHALSAYYDITHGVGLAIITPRWMKHVLSEKTMERFVKFGKNIYGITEGTDKEIAEQVIDRTYKFFESIDIPMHLREVGIDESRVGEMAHHVASIDHLENCPFAPLSEQDIAEIVTASL >CP022041.2|ASE18302.1|539957_540746_-|cupin-domain-containing-protein MATAVMGIQAQKNMERKEIKQTAGRATLGEFAPEFAHLNDDILFGEVWNRQEEMSLHDRSLTTILSLVAQGITDSSLKYHLNTAKANGVTRQEFSEMITHAAFYIGWPKAWAVFNMAKEVWTSDIQTKEEFQASTPYPIGEPNTGYAKYFIGLNYLAPMEADKGGVVNVTFEPRCHNNWHIHHKSVQVLICVAGRGWYQEWGKEAVEMKPGTVIAIPEGVKHWHGAARDGWMQHLTYNTHVEDSSSNEWLEPVGDDVYDTLK >CP022041.2|ASE18909.1|540760_540955_-|hypothetical-protein MINSSDGSDILVFNCYLCHVSFLLTCYVCSTKIGEISGISKCFENFVSQALGVLTKIYAAELRE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_10 | 740687-740792 | Orphan |
NA
Consensus repeat of CP022041_10
|
1 spacers
spacers of CP022041_10
>10.1|740714|52|CP022041|CRISPRCasFinder GGTAGTCCTTTCGCCTTCCTCGAACCAGCAGTGAGCGTCGGTTTAGAACGGG |
CRISPR arrays and Neighbor proteins around CP022041_10
The CRISPR arrays of CP022041_10 >merge|CP022041|10|740687-740792|CRISPRCasFinder GGGGAGTGAACAGCGTGTTATCCCTATGGTAGTCCTTTCGCCTTCCTCGAACCAGCAGTGAGCGTCGGTTTAGAACGGGGGGGAGTGAACAGCGTGTTATCCCTAT >CP022041|10|9|740687-740792|CRISPRCasFinder GGGGAGTGAACAGCGTGTTATCCCTAT GGTAGTCCTTTCGCCTTCCTCGAACCAGCAGTGAGCGTCGGTTTAGAACGGG GGGGAGTGAACAGCGTGTTATCCCTAT
>CP022041.2|ASE18442.1|740167_740512_+|50S-ribosomal-protein-L20 MPRSVNHVASKAKRTRILKQTKGYYGARKNVWTVAKNTYEKGLTYAYRDRRNKKRNFRALWIQRINAAARLYDMSYSQLMGALHKAGIEINRKVLADLAVNNQEAFKAIVDKVK >CP022041.2|ASE18441.1|739934_740132_+|50S-ribosomal-protein-L35 MPKQKTNSGAKKRFTFTGTGKIKRHHAYHSHILTKKTKKQKRNLVHQTLVDGTNLKQVRDLLRLR >CP022041.2|ASE18440.1|739140_739821_+|translation-initiation-factor-IF-3 MKNDKMKMKYRVNEQIRVREVRVVSDDGAEVMPTRKALELAQKEGVDLVEISPNAQPPVCRIIDYSKFLYQQKKHQKEMKQKQVKQEVKEIRFGPQTDEHDYKFKLKHAQEFLNAGNKVRAYVFFRGRSILFKEQGEVLLLRFANDLEELAKVEQLPKLEGKKMFLYLAPKKAGVAKKSQQKRDREEAEAGAKAGAEGVSEEPKTDGGLFANAKNGADALKKLNID >CP022041.2|ASE18439.1|736805_738755_-|threonine--tRNA-ligase MVKITFPDGSVREYEQGVTGFQIAESISPALARDVVSCGVNGETTELNRPINEDATIALYKFDDEEGKHTFWHTSAHLLAEALQELYPGIQFGFGPAVENGFFYDVMPAEGQTISENDFPKIEEKMRELAKKNEQVVRRDVSKADAVKEFTADGQEYKVEHIVEDLEDGTISTYSQGNFTDLCRGPHLVSTGAIKAVKITSVAGAFWRGDAKREQMTRIYGITFPKKKMLDEYLEMLEEAKKRDHRKIGKEMELFMFSDRVGKGLPIWLPKGTQLRLRLQELLRRLLRPYNYQEVITPGIGGKNLYVTSGHYAHYGKDAFQPIHTPEEDEEYMLKPMNCPHHCEIYAHKPRSYKDLPLRIAEFGTVFRYEKSGELHGLTRVRTFTQDDAHIFVRPDQVKAEFENNIDIILKVFKTFGFDNYEAQISLRDPEDKEKYIGSDEVWEESEAAIKEACAEKGLNARVELGEAAFYGPKLDFMVKDAIGRRWQLGTIQVDYNLPNRFKLEYTAEDNSKKTPVMVHRAPFGSLERFTAVLIEHTAGHFPLWLTPDQVAILPISEKYNDYARKVKAYFDAHDVRSIMDDRNEKIGRKIRDNELKRVPYMVIVGEKEAAEGLVSMRQQGGGEQATMTMEAFAERINNEVAEQLKGLD >CP022041.2|ASE18918.1|731399_735416_-|hybrid-sensor-histidine-kinase/response-regulator MLMLSNLNTDNGLSSARVYSIVEAEDGAMWISTKRGVDRYNGQQVRNYTLATDMQFSDASGRNIKLTKNSQQHIYAYDNKGKIYIYNKVKDTFQLQVNLMNVLGGSVVLNDLLVDDKGCFWMALDRGVYMMAPLSIAGSKDKTAYSSSNKGKYILKNTYVNHIRFFGKQLLIGSPSGVFAYSQNAAQPRLLLRGYSVLSSYHDVKAHQIWLGTFHRGILLLDDRTWKPLSSLTQFSSLSDIPLIPVRSIIPYDAATILMAVDGAGVYAYDKKTCKTNLLLNTDGRPENVLHGNGVYALCKDHLGDLWMGSYSGGVDMAIPMEHTLEFVRHENLNSQSLINNGVNDVMQSVDAQGRNSKIWYATDKGVSIYDEQTHLWHHSLYNKVALTLCQTADGKVLVGTYGDGIFQVHADGSSSRAYSVSNGKLKTDYVYSIFKDSEGGLWIGCLDGDLVHIPSSLEKNGNVDNSVNYLPINEVQSIVESPDKKIIAVGTTHGCYRIDKRSLRVSRFFYPSEFPSVDYNFFVNAMDFQDARHIWIATDGGGLYLYDWQKHQVKNYTISQSLPSNTVYALVKTPIGQVWMSTDKGLAFIDHGKVINLNFFKGLEREYKRMAVVRTQDGRMIFGSSEGAVVLASKFARGLNYKAPLHITGVEVEGKTFDEADQLEDWHAELFEMLQEGKMSFSHDEKTLIVHFESINYPYQHDIQYQYYLAGYDHQWSVPSAYQQARFANLPPGSYTLHVKAMGRSNGRILGESTLRIHIAQPWWNSWWAWIAYLCILGVIVYFGWDYYRERLHRKYYDDKINFFVNTAHNIRTPLSLVLAPLADLAKDTTLGDKSRKFLEMALCNGDKLLRMVTELLDFQKIAVVKTQVHMQKVELSVLLRQQVDKFQMSAQEKHLNLRLATCGQHTFSTDLSMMDLIFENLLSNAIKYTKVGGTITISASIDEIGKQVSIQVSDTGIGIPKIEAKHIFQSFFRASNAVNSQEMGSGLGLMLTRQLVQKLGGKLTFESEEGKGSTFLVVLPDNGYVDVSVSSKPSSLPETSDNSMSTDEKIKSEESLKDTLLFVDDNEDFRQYIRMAFADQYHVVDVESGEAALKYLSENGECDIVVSDVMMPGMQGDELCRSIKENKETSWLPVILLTAKAGRDFMIEGLDLGADDYIAKPFDSAILASKITSMLKNRCRLSQYYMERSLAIVRGEDSGQSSQKSLLPSEPSMSSASDEKNKSSDEKELGLDPMDQAFVEKATRLILDNLSDTDFTIDRLCREMAMSRTLFYGRLKTLTGQGPQDFIRLIRLEQAAQYLKQGDSVLDVSMKTGFVNVKYFSTVFKKHFGVSPSKYD >CP022041.2|ASE18438.1|729490_730708_+|IS4-family-transposase MGKSTHFIGQPVYNQVIKLLDKQQIKQISLETPRSEAYVKRLDGWTHLVIMLFVVLKHFDSLREVEIGMKAEVNKFHHLGIDYIVRRSTLADANKRRPQEFFASVYAYLLERYGSFLSDSRPKGEQKTWEKLLYMMDSTTITLFDNILKGVGRHPKSGKKKDGMKVHTVMKYHVGVPMVVQLTSAATHDHYLLKEVHLPKDATLTMDRAYVDYAQFQRLTEEGVCYVTKMKKNLTYTELSSVTYVSPDGLVTHTDKKIVFEKGEIRHQARRVELWSDNSHKSVVLLTNNLELDVKDLEEIYKRRWAIESLYKQLKQNFPLHFFYGDSVNAIQIQTWVVLIANLLCTVISRMIKRHVSFSQLVTMLRLTLMYYTDFISFMENPQNDELIIIAEKANSPPKELDLFD >CP022041.2|ASE18437.1|727775_728990_-|endoglucanase MKRNLLKICLLFVSMFTVFSCSASPSVVDTPDTSESIPTAKQWNKDVVGWNLGNEFECSAPGQDGESMQIGNPDGSIHAETAWGNPVVTKKMIQAVKKAGFNAIRIPIRWQCHITNAQAMSIDKAWIARIKEVVGWCLDNDLKVIINAHHEKWLEGRPTYQYKDENCQKLALLWMNIASEFANYDSRLAFAGTNEVHVRDNWSKPTAENLEVQNAYNQIFVDMVRATGGNNAKRHLILQTYVCNPWFGIENGGFVIPKDAEGNGNNYMSVEFHYYQPWSYAGDCAYDYWGDAYKDVGKIPAENEKTMTDFFDKAMNTWSNKGLGIVIGEWGVTDHYKSNSVKVHENMTYYCKFLTTEARKRGFSTFVWDNNHFGNGSEKYGIFDRFKSMKVNAPWILEGIFGKE >CP022041.2|ASE18436.1|724571_727748_-|SusC/RagA-family-TonB-linked-outer-membrane-protein MRKKTMFLGMLGAGLMWMPTSAILAAQMNNAAMSVQQNQGIKGTVVDATGETLIGASVKVTGTTNGVVTDLDGNFTLNCKPGATLEVSYVGYKTMTVKAVNGMKIKMQVDSKALNEVVVTALGIKRDRKALGYGLEEVKGEELSKAKETNVINSLSGKVAGLVVQNTAGGASGSTRVLLRGNTEMAGNNQPLYVVDGVPLDNTNFGSAGESGGYDLGDGISAINPDDIETMTVLKGPAASALYGSRASHGVILITTKKAEKDKISVEYNGSYTVDTQLAKWDDIQEIYGAGYNGELPASSTSGTNTSWGPKADDFMFKYFDGEERPFLMHPNNASDFFRTGFTTQNSAILSVNSGKTGMRFSVTDMRNKDILPNTNMSRDNFNLRVNTSAGPVDFDFTANYTREKVKNRPALGDSQSNVGKNLMTLAGTYDQAWLKHYEDADGNYSNWNGNDQYNKNPYWDLYKNSNTSDKDVFRFTGKAIWNIDNHLKLQGTIGTDINSMNFEDFIAKTTPGTPAGKLTDQIFNNRTFNAEILALYNNSWGDFDVNATAGGNIFKVNNKTTTNVGLNQQMNGIQNIMNYQEQNTRESMYKKQISSLYASASLGYKHTYYLEGTLRGDRSSTLPTNNNTYVYPSVSGSLVFSEFIKNKKFINYGKIRASWAKVGSDTDPYLLALNYTTGKYSYSGYTIGMIANSTQPNKDLKPTMTDSYEVGLEMKFFNGRLGLDATYYNQNSKDQILSLASTTTSGYAYRLINAGEIQNQGIEIALNARALQIKDFAWDLGVNFSKNTNKVKSLTNGMDYFELAKAYWCGVSVGAKVGENYGAIRGHDFLYNDKGQVVVDAATGLPKVDQKIKTIGNSTWDWTGGFYSTFSYKNFRLSAAFDVKVGADIYSMSMRSAYQTGKAKGTLAGREEWYTSEEARKASGMDLAAWRETGNCKGFVVEGVIDNGDGTYRKNDIAVNPEDYWKHVANGVQSAFVYDNSYVKCREITFGYTFPESILGKYVKGLTVSFVARNPFIIWKNIPNIDPDSSYNTSGLGLEYGSLPSRKSYGLNVNVKF >CP022041.2|ASE18917.1|722737_724513_-|SusD/RagB-family-nutrient-binding-outer-membrane-lipoprotein MNNIIKKYIGKSSLMMAFALMTTGTAMTSCSDETLSNINTDKTKVSELDPNAQLTTALLQTYGDFSLMDTYRNYISGFPQYFAGGWNVTNYAGSNFREDDIARRVWDRYYEVSIKNLVDAIHNSADKANLNAALRIHRVYLTAVLADTYGDVPCLEAGLGYISGISTPKYDTVEELYSWFFEELDACEKQLGTGTDRISGDVTSMGGDVAKWKKYANALRMRYAMRISDVNPQKAKEEFEKAVAAGAIASAADDAYIRYADTPYTYYDGANDYDFRTNALGEILYGQDATSPTMVCSTLFYQLQNTNDPRLYRICRHYYNIKRSQVKPDKEQNIDLTDDFLAYFRSKNLGEEPCNPGATWYTDWMSPATVDDLPTLKKYAEIDKNTYANSDYIARAGRPCLNIDFEMPSCPGDLMSYAEVEFLKAEAATKGWNVGGGDAESHYEAGVRASMELLNNYYLTSNKISKEEIDAFIANNPLGDNPKETINTQAWILHMMNPSEGWANMRRSDYPAILNRDLLTKNGFTYTDSNWSMPVRLQYPELEGQYNSANYKAAIDRMGGTDDWHKRLWWDKADVNLQPNFNPPFGKGYSK >CP022041.2|ASE18435.1|721672_722671_-|hypothetical-protein MNKTMKQYKGKVLKAMMMLFAMSFALAGTSTLSSCSSDDDPYFTVSENDDPRILNTDLADSKIDRKTNYKLEIKVTPVHYTTVTWLLDGTQIAEGTTIDQTLPLGNHELKIVATTTKGKTTSRTLNVTVTPAADDPALGTNAIELRVAPGETTTIHECKNLGLVQKVLIADKEVAFEVLDEGTTLKVTAPSDLANGDYDITLVDSNGVQFAGGTIKVTTEARQSVENTIWEGEFAVTWGTPFNALKDTFLSKVKAGTILRVYVNGNGQGTAATAWWNNILTGKGDPERGDIMVDGPAKWEFKLTDLSIQLLTEQDGFLLVGDGYTVKKVTIE >CP022041.2|ASE18443.2|741003_741597_+|hypothetical-protein MTKQDSLLSDNFGQKVDGQLKGSYRADTVSLIDKYFRRGMSRKGILMIPKGKRPAPETYLKRRYIRRHLKNFKAGASCIVSKELLERYHGDSIGKADNSQFIMMKSEMDSVLMRSHGDLSRIEHELGIPAGAWKHRVLVRIDIPKPKKLKLRMPSGNEIGANVLWLPGGLLPTGYKEAVIDRIPKGKYKASLIEITQ >CP022041.2|ASE18444.1|741647_742082_-|transcriptional-regulator MHDIIGGFIGLTLVHIGAALRFVYHRFIIRDNYSYHSLITESPVFDCFKESYKEQFKRWKQRQIQRNQAYDIDLNEEQQQTLEMFLKEGRSKKEIIQGMIETGELKLIDVDIYPRNPEYFSNRVLDGIIGLCFLIILILIIHYI >CP022041.2|ASE18445.1|742146_742644_-|hypothetical-protein MNQTKKILAYLGCIAFTALGVWLLTIENPTTTYPLWTIRVTGILSIIFFGGGGLFMAYKKVKLLINHKKEIEFTDRGISICGAEEILWNEIADFSLIRFKGNRLITIQMKNPEKVIANEPSWIKRKTMEYNLKTINALYSFPAYMMDGRAEEALTLCKLNLAKHK >CP022041.2|AVV27050.1|742650_745284_-|RNA-binding-transcriptional-accessory-protein MTVGSGEPEGASGASFISHSLSLPLQSVSAVLTLLNEGCTIPFISRYRKERTGGLDEVQITDISELYDRLKELGKRKETILRTIREQEKLTPELEAKILACMDSTELEDIYLPYKPKRRTRAQIAREQGLEPLALAIMREAQKPTAPPDLPEGGGDKLASILQKYQGRAKESLSSRVRIGTPPLSGRSGGALALDIIAEIVSENQQARNTVRTAYQRGAVITSKVIKKMKDTDEAQKFADYFDFSEPLRRCNSHRLLAMRRGEDQGILRVSITIDGEECIARLTRQFVRGHGVCQTLVSQAVEDSFKRLINPSIENEFAALSKGRADEEAIKVFTENLRQLLLSPPLGQKRVLALDPGFANGCKIACLDEQGNLLHHEIIYPHPPRNQVRQATEALQRMINTYNIEAIAIGNGTASRESKEFVENITTETTTGPSPSPLPHREGSDYRHLPKSKQQFTDNTSPINSKPQSAGHTTPLPLGEGSGEGPVRPVGPVGSASSLFIFLVSEDGASIYSASPVAREEFPDEDVTTRGAISIGRRLMDPLAELVKIDPKSIGVGQYQHDVDQSKLKHSLDQTVMSCVNQVGVNLNTASLHLLTYVSGLGPALARNIIDYRREHGPFTSRAQLKKVKRLGDTAYQQCAGFLRIPDAKNPLDNSAVHPESYHIVEQMAKDLKCTIKDLIGNKKLLAEIDVKRYLTPQPPLRRERGSEASPNPSERRGGAPPNLPEKGGVPMRTREDKGALNLPQHLSNVSTSLPPLLSEGSGEATLRDILTELEKPGRDPRGEVEVFEFDKNVHTLSDLIIGMELPGIVTNITNFGAFVDIGVHQDGLVHISQLSDRFVTDPTQVIRLHQHVRVRVVEVDMRRKRIALSMKNIKQ >CP022041.2|ASE18448.1|745425_745956_-|50S-ribosomal-protein-L19 MELNLKRTLTCIILTVLTTLSTHAQTLCVIDGTPLPDSLLHVTIDEMRSDSAKEIVAKRLGLIPPQAIESIQTFAVEEQIRQGKNIIFCKPPKDIIIMRTNSLAELQWVINGKLRKPRKKLTIIDYKLSPQRITEALPKGIKPTDIGSVNILTYINDPRQEKHPTIVIKTKSLTTK >CP022041.2|ASE18449.2|745979_746552_-|twin-arginine-translocation-pathway-signal-protein MLLSLLTLNFGVKGYWKIKSYSFQSYFKDVWEICHEKGYNEDYCILVDFSRPSGEDRMAIIDLKTLSVLDTGPCAHGKGKGNSAWKPVFSNKEGSRCSSLGAFKIAEKGYSATVGLRFALDGLDASNSNARRRNILIHSSRYVGIMHHLTSYLPLSDASWGCFTTSPAMLKKIEALCDKSKKPILLYAYK >CP022041.2|ASE18919.1|746769_748911_-|M3-family-peptidase MKQNLKNVVLAAGLACTALTGQAQKARPKTTQTVSNSLMKQSTLPFNAPDFSRIKGEDYLPAIKAAIAEQRAEIKKITDNKQKPTFANTILAYERSGKDLERISNIFYALVSADKTPEIEKAQGSIVPLMTEFENEIKFNQKFFQRIKYVYDHEYKTLKGEDKKLLEVVYKDFTHAGALLPKEKMARMQEINKELAKLQQEFGDMLPKAANEATVWVSDVKELAGLSETDIAQCKKDAESRGGKAPYCIVITNTTQQPILASLENRGLRERVYNASIHRTDGTGAYNTFPVIVKIARLRAEKAQLMGYKNYASYSLSKTMAKNTDNVYAFLHQMIEAYKPKSEAQTKAIEEYAQKTEGADFRLQPYDRFYYSAKMKKDQYSFSDDDVKPYFNLDSVLVNGIFYAAHRVYGLSFRERKDIPTYHKDMKVFDVIDANGKQLALFYCDYFRRPTKRGGAWMSAFLKQSGDRHQKPLIYNVCNYAKAPEGQPTLLTWDETQTMFHEFGHALHGMLSNCKYNTLSGTAVSRDFVEMPSQFNESFASIPEVFNNYARHYKTNEPMPDALREKMLGSLNFLSAYALGENLSATSVDLAWHCLSPSEVPTVEEAPAFEKKVLADMGLLNNQIPPRYSTSYFNHIWGGGYAAGYYSYLWSEVLAANIADYFEAHGALTRKVGDDFRQKILSRGNTRDLMQIFSDFTGLKAPDTKGLLKARGM >CP022041.2|ASE18450.1|749617_750628_-|iron-ABC-transporter-permease MKRNILLFICLATSILLLFGLNLTTGSVQIPFADILDILCGRFIGKESWEYIILENRLPQTLTAILCGASLSVCGLMLQTAFRNPLAGPDVFGISSGAGLGVALVMLLLGGTVSTSIFTVSGFLAILTAAFVGAIAVTVLILFLSTLVRNSVLLLIVGIMVGYVSSSAVSLLNFFASEEGVKSYMVWGMGNFGAVSMNHIPPFSILCLIGIIASFLLVKPLNILLLGPQYAESLGISTRQIRNILLVVVGLLTAITTAFCGPISFIGLAIPHIARLLFRTENHQILLPGTVLSGAVIALLCNFICYLPGESGIIPLNAVTPLIGAPVIIYVIIQRR >CP022041.2|ASE18920.1|750649_751780_-|ABC-transporter-substrate-binding-protein MRQLLVFTLSVLLFLSCGNHQKQVADKSEKEEKGDKTELQYARNITIERTKDYVVVRLLNPWKAGTVLHTYYLVERGKDVNVPDDGTKVVIPLRKSVIFTTAHANLVEMLHAQKAIAGVADLKYMIIPDIQKRARKRGGIVDCGDAMKPDVERIIDLNADAILLSPFENNGGYGRLEQIGVPIIECADYMERSALGRAEWMKFYGILFGREHEADSLFAVVKQNYKSLSQKASQSKVTRSVLPDRKVGAVWYLPGGESSVGLLYKDAHGRYAYSNDKHSGSLAMPFETILDKFAQSDFWILSYNSNFNRRVLLAEYQGYAKLKPYQTKEIYGCKIDSKPYFEEVSWRPDWLLSDLIQLFHPDLKIAPLRYYQKLED >CP022041.2|ASE18451.1|751910_753377_+|dihydroorotate-dehydrogenase MAEEKTNELGNHIEMKKVWELLAILVVTAIIWNLPTSSFGIDGLTVVQQRIIAIFVFATLSWLTECIPAWATSLSIMSIMCVTVSKNAFGVFKGDGIGELLDSKEIMASFADPIIMLFLAGFILAIAASKSGLDTLLARNLIKPFGNKSENVLLGFLLITGLFSMFISNTATAALMLTFLTPVFAALPANGKGRIALTMSIPIAANLGGMGTPIGTPPNLIALKYLNDPAGLNMNIDFMHWMAFMAPLVIVLLLLSWRIILYFFPFTQKTIHLKIDGEVHRGWRMWVVIITFIVTILLWVIPKDVTGIDTNTVSMIPMAIFAITGVITAKDMQEINWSVIWMVAGGFAIGLGMNGSGLADAAIESIPFGNWSPIVILAISGLICYFLSNFISNTATAALLVPILAVVCRGMGDKLGGIGGTSTVLIGIAIAASTAMCLPISTPPNAIAYSTGLVKQNDMLKVGLTSGVVSLILGYILLYFIGQIHFLG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_11 | 1315103-1315206 | Orphan |
NA
Consensus repeat of CP022041_11
|
1 spacers
spacers of CP022041_11
>11.1|1315127|56|CP022041|CRISPRCasFinder TAAACAAGTTGCTTGTATAGTTGGCAAGTGAACGAGTTGACGAGTAAACAAGTTGC |
CRISPR arrays and Neighbor proteins around CP022041_11
The CRISPR arrays of CP022041_11 >merge|CP022041|11|1315103-1315206|CRISPRCasFinder TTGTCAAGTAAACAAGTTGACGAGTAAACAAGTTGCTTGTATAGTTGGCAAGTGAACGAGTTGACGAGTAAACAAGTTGCTTGTGTAGTTGACAAGTGAACAGG >CP022041|11|10|1315103-1315206|CRISPRCasFinder TTGTCAAGTAAACAAGTTGACGAG TAAACAAGTTGCTTGTATAGTTGGCAAGTGAACGAGTTGACGAGTAAACAAGTTGC TTGTGTAGTTGACAAGTGAACAGG
>CP022041.2|ASE18844.1|1312540_1313494_+|hypothetical-protein MNTKHIILMACAALLGATQANAQGQDLSILTANTDARTAAMGNASAAAEGMYLYNNPAAFFATDKKFTADASASLFEKAEGADGTFGIYALSAGYKLAKRHAVFVGFRYAGGLSLKGSDLLGNPTKDYKPYNWTLDLGYTYFLGKGFATYATGSLIYSHLSKNATGAVFSVGGAYQNNELTLANKPANLMLDAKVGAIGPQLDYGNKHKTTLPTYLAVGGALSVEVAEKHQVAAALSSRYFFQPSEAKLFMLGGGLEYTYNKMVSVRAGYEYGDHDLSHVTMGAGFKYHGLRLNGAYNLKTADTGSSYCTIGIGYDF >CP022041.2|ASE18843.1|1309772_1312529_+|T9SS-C-terminal-target-domain-containing-protein MRKSIIYLCACALSGMMVTTSCQDNLDSDAGVSNTRSVNIDKDLFAIKGCINVKLAKGTNQAIPTTRSGSVEMQSVPSAMTSAMQYSGAYKMERVFKPAGIYEARTVAEGLDRWYTIYFDKSKDVAAVLDQFKKAEGVECAEQVLPMARPTVKMTPYSPSGASMQATASTFDDPLLAKQWHYYNDGSVNARAKKGADCNVKPVWEKYTTGKKNVIVAVVDGGIDITHEDLKDNLYVNEKEKNGQPNVDDDGNGFVDDIYGYNFVTAKDVVGGTIEPDDGGHGTHVAGTVAARNNNGKGVAGIAGGDGSPDSGVRLLSCQIFRNKDEQGDAAAAIKYAADNGAVICQNSWGYSSTAGVTSMPQLLKEAVDYFIKMAGCDANGNQRPDSPMKGGVVMFAAGNENKEFSAYPACYAPTVSVAAMAWDFSKASYSNYAKWVTITAPGGDQDRFGTEAGVLSTVPKKKVASGYAYFQGTSMACPHVSGIAALIASYFGRQGFTNEELKSRLITAYRPYNIDEQNPTYKGKLGKGYIDAEAAFETDTKIAPEKVGTLTLKPDFVDINAEWSIAKDEDKTAAFYRLYIAQGELTADKLKDMTYREINGMGHSLGETLKYDFDDLKDNTTYSVAVVAVDRWGNLSEPMIQKCTTRLNHAPEATNFPTEAIEVMENDRKSFSFNVADPDGHNWDIKATGETKGVSYTVKGNTVTVNLVPVLEAGSYNCTFILSDDLGAKAEKSFTFKIVKYIPPQLTKPFENYIIGLDEGVVTIPLTGHYTHSGNTQLTYKATAANGSIATATISNDNLQLKGMAKGVTRISIAATDGRETSSDGSFQVRVVEKKSAPVYAVYPIPVQRDIHTLLNPEVKQAELVISSTVGERLMKATVTPDKNNVATLDLSKLNPGTYKLTVYTSKGNHTQMFIKR >CP022041.2|ASE18842.1|1308155_1309757_+|hypothetical-protein MRIINIHKLLQFSLLYLLLGIAVGCAENDSFDQPYLNVSEKEISFSNQIGEKTITVNTNCKEWMATTPKAWVHLSQSGNEIAVHVDPNTTGMERSSYILVDGGLAVQKIMVSQSAADITLNLNNGEVILPQAGGTTTVDLKMDATSYDLTQSEQPEWMQVIKKKHGLKFISKPNYSTTERTTKLTIAFAGKNHEVVVKQPGVATFILACNPGNPYSLHKMMDYEYRRGSFLTEYGGPDEVNGIFEESYFFKTPSPLFKDVVYVHDTKHSVPTRIYTRSLTREGVNAVKSQAFQEFMRANGYTRDEKDTNHYVNIKEAFTMDVDIREENNSVVLFFYQMHTQDRSYPTFSSLDLGPVDLLNKTDKKISDVEAYETGKNSEEMKRQMSKSNEVEAILYKTNDPTLIARTYFFYLHNDAAVPQEKAGSVEQYSLFYSQPNLGIWQYGNEWFVTHEFDKLLTANNFEFVGYNGKHHVYARRSDYLTLAISGGEYADVNNGKAVMQITVLYKPTVFAGSKEQRLAKVERMLKQYNPKK >CP022041.2|ASE18841.1|1306499_1308119_+|hypothetical-protein MELKKTLYSLMLGGILLCLFSCAKEDEFEMPTLVLSENSIAFDKGVSERNISVTTNQNSWIASSPQEGDWLSLVQDGNVLKVKVTENKIGTERTSYVLVNANGASGKIAVTQSAADVTLDVVPNAIYLPQTGGEKTIDITTNSSVYDVTTSEEVSWLKIVKSEEEIKLIAERNDTYQKREVKLYAKSGSVIREIVVSQSGIQRYLLPIHPGVPQDEHKIMDFELGRGSYLREYQAAMPAYGLEETYTFITASPIFTLIQYCSTDGINPSQIIMIGDGRKAIDAVKDKAFDKFLTDNGYVRSNSQSDREYTNDKDLLSLKVYISEKENNEGVNLTFTPIMKQNGEYKTFSKLPFYPLELLQKDNVKLAQIEQYEQKAGSTEEERSLNEHKNTEVSQIQYKLKASTDPSAAYGRIHIFYTTDKDGDAPDNLGSVQIGALLFKDTNLGVWKYGTKWVATKEIKKVLGDEGFSFLRTSGNNHFFVRESDHLVIDVTCVLDNAMPVLALLYSYDPSVSGASSKAIKTQAKMIRNFAAAKKALKF >CP022041.2|ASE18840.1|1304752_1305565_+|PepSY-domain-containing-protein MRKFCLKIHRWFALPLGVIMAILCFSGLAILLIKDLAPLFDMNAKEMPIYTIVVRLHRWLFMKPENAHEGGQSLGRILTAVSAICMSIVLLSGVVIWWPKTKKALKSRLTVSTNKGFRRFVYDSHVSLGIYVFIFLFLMALTGPVFSFGWYRAGMSKLFGQPMPPKEMKMQQPKDGMKQGGTNDKAFAPTDASQMKGQPQAHKEGAKDMKGDQHGKKPKGGKLFKQLHTGTWGGWFSRVLYAIAALIGGFLPISGYYLWWKRRSAKKKKA >CP022041.2|ASE18839.1|1303820_1304546_+|KR-domain-containing-protein MKKAIVVGASSGIGHEVARLLIAEGWAVGVAARRIDKLTDLQAMAPERVYTVQIDVTNEDAETSLLQLIERMNGIDLYFHAAGIGWQNPSLNADIELKTMETNAVGFTRMIGCAYRYFANKGGGHIACITSIAGTKGLGPAPAYSATKAMQNTYLQALEQLAACKHHNIHFTDIRPGFVDTPLLAGTSHLPMLMTTKKVARSIIKAINSRRHICVIDSRWCVLTYLWRHIPNWIWRRMKLC >CP022041.2|ASE18939.1|1303578_1303824_+|hypothetical-protein MELDTTNMCSHLQKKLFNEEGVYYPIWQAMQNDDEITAVIRSRQLHIYRNGKKVLVLPGKAAPKIIREDSLNELLPKDLLK >CP022041.2|ASE18838.1|1301171_1302782_-|CTP-synthase MAETKYIFVTGGVVSSLGKGIISSSIGKLLQARGYNITIQKFDPYINIDPGTLNPYEHGECYVTEDGMETDLDLGHYERFTGIKTTKANSMTTGRIYKSVIDKERRGDYLGKTIQVVPHITDEIKRNIKLLGQKYHYDFVITEIGGTIGDIESAPFLEAIRQLKWELGKRAINLHLTYVPYLKAAGELKTKPTQHSVKELQSVGIQPDVLVMRTEKHLDDDIRKKVAAFCNVDFDCVVQSEDLPSIYDVPVNMLEQGLDAAILRKCGEEVGPKPALGPWKEFLDRQRKATKEVHIGLVGKYDLQDAYKSIREGLLQAGTYNDRKTVITFINSEELTEENVAEKLKGQDGIVICPGFGQRGIEGKIVAAHYTRTHDIPTFGICLGMQMMVIEFARNVLGYKDANSREIDEKTTHNVIDIMEEQKNITNMGGTMRLGAYECVLRQGSHTFNIYKQEHIQERHRHRYEFNNDYEKEFEKHGMMCVGRNPESDLVEIVEIPGLKWYIGTQFHPEYQSTVLGPHPLFLDFVKTSIENQKNK >CP022041.2|ASE18837.1|1299239_1301171_-|membrane-protein-insertase-YidC MDKRTITGFVLIALILFGFAWWQQPSAEQVAQQRAEFVKDSIASAKKAQTAKLAAEKQAQQKAAQATDTTALFYAALNGKAQDIILKNSKVELTLSTKGGVVKKAVIKNYIGHNIAVKDGSQDQKNVTLFSGDDQSLNFMLAAKNSNIETKDLIFTPSNVTDSTVTLTAVAGEGKTLTLNYTLGKDYLLNMSLQAEGMGGLFAPNYNQIDINWQERCKQQERGFTFENRYATLTYKKHDGGTDYLSETSEKEETTEDSMDWVAFKNQFFSAVMIAKDNFATGAKLKSTPLEKSSHYLKHYEANMKAGFDPTGKRPSEFEFYFGPNDFRLLQSVETESKFAKELDMERLVYLGWPLFRIINRWFTLYVFDWLSKVFPMGVVLILITLLLKLITFPMVKKSYMSSAKMRVLKPKLDEATKQFNKPEDQMQKQQAMMQKYSEYGVSPLSGCLPMLIQMPIWIAMFNFVPNAIQLRGQSFLWMHDLSTFDPIFSWSHDVWLVGDHISLTCILFCGANLLYTWFTMQQQKDQMVGQQADQMKMMQWMMFGMPLFFFFMFNDYSSGLNFYYFISLFFSAAIMWALRKTTNEEKLLAILEARREERKNNPKNNMGSGLFARMQALQELQKQQQEELRRKQDELNKKKKGL >CP022041.2|ASE18836.1|1297077_1299237_-|S9-family-peptidase MNKNLTMAMTAALMMSGSVAQAQDVNIGRSNITLTSDLMTPETLWAMGRIGAAQASPDGKKIVYQVGYYSVKENKGHQVLRVMDADGKNDRLLTTSAKSEGDAAWVDNNTLAFLTGGQLWTMNADGTNRKQLTHSDIDIEGFRFSPDRKRVVLIKSIPYYGTIKQNPSDLPKATGMVITDMNYRHWDHYVTTNAHPFVADVTPEGIGAGIDVLEGEPYESPLAPFGGIEQIDWSKDSKFIAYTCRKKEGTQYAISTDADIYIYNVETRQTKNLCKPADYVEPKIDATKSMRNQAVNHQAGDMNVGYDVNPKFSPDGKYIAWQSMKNDGYESDRNRLCVYELATGKKTYVAESFDSNVDDYTWSLNSKDLYFIGVWHATVNVYQTNLKGEVKQLTEGDHNYVSISLLGDKKLLAIRQSISQANEIFAITPAKKEKASVQTQLSFENKHIYDQLALGDVKSRWVKTTDGKEMMEWVITPPHFDPNKKYPTLLFCEGGPQSPVSQFWSYRWNFQIMAANGYVIIAPNRRGLPGFGSAWNEEVSTDWTGQCMNDYLSAIDDAANNLSFVDKDRLGAVGASFGGFSVYYLAGIHNKRFKCFISHDGAFNLESMYTDTEEAWFSNWEYDDAYWNKDKSEAAKRTYANSPHLNVDKWDTPILCIHGEKDYRINANQGMGAFNAARLRGIPAELLLYPDENHWVLKPQNSVLWQRTFFNWLDRWLKK >CP022041.2|ASE18846.1|1316250_1317927_+|N-6-DNA-methylase MTSTELKDLEGRLWQSADMLRAGAHLAANKYSQPILGLIFLRYADVLFKQHKEAIDTAYNEYKGTRMERSYKDIAIEKCGFFLPECAYFDYLNDAPDDAQKALLVKAAMEAIEHENPRMDGVLPKEVYGQLVPEEEPELLSRIVRVFKDIPENISIDIFGQIYEYFLGNFALAEGQGGGAFYTPASVVQYMVEVLQPATGDKKFLDPACGSGGMFVQAARYMHRHNTSNEQMMNFRCYGVEKEPDTVKLAKMNLLLNNVRGEIMEANSFYSDPYNAVGQFDYVMANPPFNVDEVVVERVTDDARFNTYGVPRNKTKSAKKASDKKETVPNANYLWIGYFATALNEQGKAALVMANSASDAGGSELEIRKKMIEDGIISQMVTLPSNMFSTVTLPATLWFFNKKRPKKDEILFIDARNIFTQVDRAHRKFSDEQVKNLGIISRLYEGDSDAFWALVEEYKAEGKQSEADWLLERWPDGKYQDIVGLCKVAKLEGEDGIIDNDYSLNAGRYVGVVIEDDGMTEEEFRTEMLSLNSEFAKLSAEAKDLESEIEKNLKELLG >CP022041.2|ASE18847.1|1317940_1319164_+|restriction-endonuclease-subunit-S MEYVKFKDVIINSQYGYTATETSQTEGTYKYLRITDIVPYYVNFDTVPFCKITEKDVSKYIVKEGDILIARTGATTGYNYVVPSGISNTVYASYLIRFIVDKKLVLPLFMKYVLKTQSYYGFINNYIGGSAQPGMNAKVFTKFNIPKLSLVTQQKIASILSSYDRLIENNTRRIRLLEQMAENLYKEWFVRFRFPEHENVEIVNGLPKGWKTIHIKELAQLKSGYAFKSEWFVEEGEAVAKIKDIGNILMDTSNFSYVDKENCIKAKKFLLTTGDLTIALTGATIGKISIVPKHKGNIYTNQRLGKFFLGDNPMEKLPFLYCLFKQESMVSNIVNLSNSSSAQPNISPEQIEKIKILGNHDIISMYNKTCNPLFSNILALYSQNQLLTRQRDLLLPRLMSGKLEVKS >CP022041.2|ASE18848.1|1319179_1322380_+|type-I-restriction-endonuclease-subunit-R MKSFISEDDIEQTLCTRLSQPEFGWKRIECDPSVEAQDDVSKTGRRNSSECILPAVFLTALERLNPQIDKSILAQVVADFRKDYTGKDMMDTNYKFYNYLRNGINVKVKKNGKDDFDIVRLIDFDNVENNDFHCVNQMWIKGRIRYRRPDVLLFVNGLPMVFIELKNSTVKIKEAYEKNLVSYREDIPNIFALNQICVLSNGMQTKLGAWNSKYEFFFDWLKVNDEHEKLDREHIAEYGLSIINLIDSLFRKERLLDYIENFIFFDNKRKKIIAKNHQYLGVNNLMKSVECREELKGKLGVFWHTQGSGKSYSMVMFVRKVRRKLKGNFTFLVITDRDDLDTQIHKTFVRSEVIGEKEECQPKNAAQLREFLSGNKPMVFTLIHKFQYDKTKKYPLLSERNDIFVLVDEAHRTQYKQLAENMHTGLPNANYIAFTGTPLLGSKRLTNQWFGDYVSEYNFAQAIEDGSTVRLYYSRRVPEVGLENNWLDSDIDKIVEEEELNDRERELLENSSSRILEVIKRDGRLDRIAQDIAHHFPRRGFLGKGMVVSVDKYTAVKMYEKVQHYWGEEKKALIKERNAAKTQEERDELTARLDYMNKVEMAVIISEEADEVEKFKAQGLDITVHRNKMNEITPEGKDIEDRFKDKDDPLSLVFVCAMWLTGFDVPSLSTLYLDKPMKGHTLMQAIARANRVYPGKSCGIVVDYVNVFKYMQQALSDYASNGEEGAEFPAKDITQLIATIDGCIEECDSFLQGLGIRLDKIITDGDTLDKLESLRLAYDKILEKDESKNRFKVMSNLMMNLHDAAKPEIFELGWKNEKFSPLSYLNGLFCNRIDDEKLRRAKEKMSYTLDDSVTVMVAEDKPQYSIHQSKVIDLSKLDIESIRKSINATPYKSMEIDNLRTFIETALEQLINKNCTRVPFSQRYKNIIDTYNAGGTENEDYYEKLLQLIDELKKEQGRSADMGLREEELEIYDLLIQGRKLTKEEEKEVILASKNLYNKLVEEKERLLVVDWYKDPQPKTKVLGLIQRSLDKDLPKTYDREVFSNKTNLLLDHFVDMAVQGYGWIS >CP022041.2|ASE18849.1|1322765_1323911_-|DUF1735-domain-containing-protein MNKHIFKHIAVAFAACCVVSCQDSESDLLKQKVYFDSNLYKVEMPDSGSTLGVDITSRLSNKQDGAVDVSYSLADSSLVALYNSKYGTDYVALKHQNVTFSKASSTIAAGSIYADKVSLTLNNLDQLAEGKNYMLPIKLHSSSTPVIDGEDVEYIILAKPVKITKAGDFYNKYISVKFPAGTYFKSFTYEALVHSIWWGSNCTIMGSEGLMIFRVGDVGGGISSGILQAAGRQHYEAPEKLSINKWYHVALTYDQATGKTVMYLNGTKWAESAWNISGFDPNADVGFNIGKIPGFPWGERPFYGYMSEVRVWSVARSENQIKQNMLSVDPKSDGLELYFKLNGSETVNGNKIKDSAKGIECTTGGLEFTTLAQPLTMKDLQ >CP022041.2|ASE18850.1|1323951_1325070_-|endoglycosidase MKNFKYISSLVLLCAGVLFCGCSKMTEVENEPYDHIGGYNTMNNAESEKYYADLRAYKQQAVNYGRPVAFGWYSNWSPAGTYRRGYLTSMPDSMDVVSMWSGAPNRFNITPEQKKDKEFVQKVKGTKLLEVTLLSYIGKGRTPDSVYTAVEKQAEKEGWANDQARVEEAKKKARWKFWGYEGVAGSDNHKEALARFAKALCDSLVANDWDGYDIDWEIGSGVFDMDGTLSTNADLVYLVKEMNKYIGPKSDPEHKGHRLICIDGHFGGLTEDLDGYVDYWIDQAYGRTTHFDYYGVDPKTIITTDNFESSFKSGGQLLRQAKSMPSKGYKGGVGAYRFDNDYDNTPNYKWMRQAIQINQQVFKERMGQTTQP >CP022041.2|ASE18940.1|1325087_1326713_-|SusD/RagB-family-nutrient-binding-outer-membrane-lipoprotein MKTKAYKYIVGVLALSLFTACDFQKVNTNEFELLPEEGLMDGISIGGPITAMQKCVFPVGTQADGTSVANRYQTAYNLAADCWSGYFGQNNNWGGPNNLNYFLKDGWVASSYTESYSTVVPLWQDLKGKTETQFPEVFALAQILKISAWHKATDMFGPIPYKEAGKGLITVPYDSQEEVYKAMFKELSDAIEVLTKYADNGNSKLLPNADAVYAGDVHKWVVYANSLMLRLAMRVYYADAALSKKYALQAVNHSYGVMKTKDDEAKMERGASLEFKNNLDVLINQYNECRMGSSMLAYLGGYQDPRLPKYFNTSTVSQAVTVGTYGKYSGVPTGHDVSSNDAFRDSSRPAITSTTPTYWMRASEVYFLLAEAALHGFAVGGTAESLYEKGIEMSFEENGIASSEVADYMSSGLKPSAYSFHLTNPGVNVDVPAVTEATTAWSGTDEEKLEKIMIQKWIALYPNGQEAWTEYRRTGYPKLHSVVTNYSNGEIDSEVGIRRMRFPTNKSTSAEDIANLESARKLLRGGLDKAGTRLWWDNKNH >CP022041.2|ASE18851.1|1326719_1329830_-|SusC/RagA-family-TonB-linked-outer-membrane-protein MRMIHETKQKHLYFSVAFALSIALAPTSVYAVGNPVGSPDASMPQAVQQNGNHKVTGRVVDSAGEPLIGATIMVEGTKEGAVTDIDGNFTINTTSKAKLVISYVGYTTQTIPVGDKTTIDVTLKEVANTMNEVVVTALGIKRAEKALSYNVQSVGSNELTRNKDANFVNSLNGKVAGVSISKSASGVGGATRVIMRGAKSIEGDNNVLYVIDGIPIFNFSGGRDSGIMGEGRVSSEGIADLNPEDIESISVLAGPSAAALYGSNAANGAILITTKKGKEGRVDISFSSSADFSSPLLMPKFQNTYGNKLGSYESWGEKLATPSSYDPKKDFFRTGTNFINALTLNMGNEFNQTFASVATTNSRGIVPNNTYDRYNFTIRNTTRMFKNRVQLDLGASYIKQKDNNMVSQGEYWNPIVAAYLFPRGESFEGIKTFERYDNVRNFPTQYWPISDSRFANQNPYWTAYRNLAPDDKDRFMFNAGLTYNIFDWLSVAGRIRLDKTFITSERKIYASSFNYFAKEKGAYDYYDYKDHQTYIDAIANINKTFGKFSLAANVGYSYSDYASLTRGYGGNLVLVPNKFSLNNINPTDSKIREAGGDSKVRNVAAFASAELGWRSMVYLTLTGRNDWNSRLVNSSEESFFYPSVGLSGIISEMTKLPSFISYLKVRGSYTEVGSPVSRSGMTPGTITTPIVGGSLKSTDIYPFTDYKAERTKSYEFGLTARFWKKLSFDFTWYKSNTYNQTFIGELPESSGYKAVYLQAGNVENRGVEMALGYSDNFGGLQWNSSLVYSKNVNEIKEMVKDYHHPLSPKPINIPEVSKDNGRVLLKVGGSINDIYARKVLAKDNQGFVNVSPSGGMNLETVEPIYLGKTTPDFTMGWNNNFTYKNFGLSFLINARVGGIVTSSTQALLDRFGVSKASADARDAGGVMIPNQGLYDAKKYYTLVATGENDLAGYYTYSATNVRLQELTLSYKFNSKLFNNVIKDLTLSFVATNPWMIYCKAPFDPELTASTGTYGQGNDYFMQPSLKSYGFSVKFKF >CP022041.2|ASE18852.1|1330183_1331287_-|MRP-family-ATP-binding-protein MTLYPKLITDALEKVIYPGTKKNIIESEMLADTPSINGNKVSFTLIFPRETDPFLKSTIKAAEAQIHYSVGKEVEVTITTEFKNAPRPEVGKLLPQVKNIIAVSSGKGGVGKSTVSANLAIALARLGYKVGLLDTDIFGPSMPKMFGVEDARPYGVEKDGRQLIEPVEKYGVKLLSIGFFVNPDTATLWRGSMATSALKQLIADADWGELDYFILDTPPGTSDIHLTLMQTLAITGAVIVSTPQNVALADARKGIDMYRNDKVNIPILGLVENMAWFTPAELPENKYYIFGKDGCKNLAKELGCPLLAQIPIVQSICENGDNGTPAASQVDTITGQSFLSLAQSVVTVVNRRNKEQAPTKIVDVKNG >CP022041.2|ASE18853.1|1331417_1331795_-|hypothetical-protein MGRNKFAEDEIKEIAKLLRLKNAGNRAKQKLVRHDLRTIYEFNISDFNEPGKAFGEEELQGAIQRGAIQILDDATIADMKAKRARDKARDEATREQQAIEAGEMTDWKAVAKEWEEWENSQNNGN >CP022041.2|ASE18854.1|1333136_1333403_+|hypothetical-protein MRQAISIIVLYALINILSLLFNIISFSTDGKIDIGFPFIFIHLTSAKYDPTFPYNKLLIEPFLWDIIFLFTIVCLYLCCLKFIKRIRD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041_12 | 1322540-1322693 | Orphan |
NA
Consensus repeat of CP022041_12
|
1 spacers
spacers of CP022041_12
>12.1|1322577|80|CP022041|CRISPRCasFinder TGTTCAAAGGATAAAAAACTCATATCATAAAGTTCAAAGTTCAAACATCAATGTTCAAAGGATAAAAGAGTTCAAAGGAT |
CRISPR arrays and Neighbor proteins around CP022041_12
The CRISPR arrays of CP022041_12 >merge|CP022041|12|1322540-1322693|CRISPRCasFinder AAAACTCATATCATAAAGTTCAAAGTTCAAACATCAATGTTCAAAGGATAAAAAACTCATATCATAAAGTTCAAAGTTCAAACATCAATGTTCAAAGGATAAAAGAGTTCAAAGGATAAAACTCATATCATAAAGTTCAAAGTTCAAACCTCAA >CP022041|12|11|1322540-1322693|CRISPRCasFinder AAAACTCATATCATAAAGTTCAAAGTTCAAACATCAA TGTTCAAAGGATAAAAAACTCATATCATAAAGTTCAAAGTTCAAACATCAATGTTCAAAGGATAAAAGAGTTCAAAGGAT AAAACTCATATCATAAAGTTCAAAGTTCAAACCTCAA
>CP022041.2|ASE18848.1|1319179_1322380_+|type-I-restriction-endonuclease-subunit-R MKSFISEDDIEQTLCTRLSQPEFGWKRIECDPSVEAQDDVSKTGRRNSSECILPAVFLTALERLNPQIDKSILAQVVADFRKDYTGKDMMDTNYKFYNYLRNGINVKVKKNGKDDFDIVRLIDFDNVENNDFHCVNQMWIKGRIRYRRPDVLLFVNGLPMVFIELKNSTVKIKEAYEKNLVSYREDIPNIFALNQICVLSNGMQTKLGAWNSKYEFFFDWLKVNDEHEKLDREHIAEYGLSIINLIDSLFRKERLLDYIENFIFFDNKRKKIIAKNHQYLGVNNLMKSVECREELKGKLGVFWHTQGSGKSYSMVMFVRKVRRKLKGNFTFLVITDRDDLDTQIHKTFVRSEVIGEKEECQPKNAAQLREFLSGNKPMVFTLIHKFQYDKTKKYPLLSERNDIFVLVDEAHRTQYKQLAENMHTGLPNANYIAFTGTPLLGSKRLTNQWFGDYVSEYNFAQAIEDGSTVRLYYSRRVPEVGLENNWLDSDIDKIVEEEELNDRERELLENSSSRILEVIKRDGRLDRIAQDIAHHFPRRGFLGKGMVVSVDKYTAVKMYEKVQHYWGEEKKALIKERNAAKTQEERDELTARLDYMNKVEMAVIISEEADEVEKFKAQGLDITVHRNKMNEITPEGKDIEDRFKDKDDPLSLVFVCAMWLTGFDVPSLSTLYLDKPMKGHTLMQAIARANRVYPGKSCGIVVDYVNVFKYMQQALSDYASNGEEGAEFPAKDITQLIATIDGCIEECDSFLQGLGIRLDKIITDGDTLDKLESLRLAYDKILEKDESKNRFKVMSNLMMNLHDAAKPEIFELGWKNEKFSPLSYLNGLFCNRIDDEKLRRAKEKMSYTLDDSVTVMVAEDKPQYSIHQSKVIDLSKLDIESIRKSINATPYKSMEIDNLRTFIETALEQLINKNCTRVPFSQRYKNIIDTYNAGGTENEDYYEKLLQLIDELKKEQGRSADMGLREEELEIYDLLIQGRKLTKEEEKEVILASKNLYNKLVEEKERLLVVDWYKDPQPKTKVLGLIQRSLDKDLPKTYDREVFSNKTNLLLDHFVDMAVQGYGWIS >CP022041.2|ASE18847.1|1317940_1319164_+|restriction-endonuclease-subunit-S MEYVKFKDVIINSQYGYTATETSQTEGTYKYLRITDIVPYYVNFDTVPFCKITEKDVSKYIVKEGDILIARTGATTGYNYVVPSGISNTVYASYLIRFIVDKKLVLPLFMKYVLKTQSYYGFINNYIGGSAQPGMNAKVFTKFNIPKLSLVTQQKIASILSSYDRLIENNTRRIRLLEQMAENLYKEWFVRFRFPEHENVEIVNGLPKGWKTIHIKELAQLKSGYAFKSEWFVEEGEAVAKIKDIGNILMDTSNFSYVDKENCIKAKKFLLTTGDLTIALTGATIGKISIVPKHKGNIYTNQRLGKFFLGDNPMEKLPFLYCLFKQESMVSNIVNLSNSSSAQPNISPEQIEKIKILGNHDIISMYNKTCNPLFSNILALYSQNQLLTRQRDLLLPRLMSGKLEVKS >CP022041.2|ASE18846.1|1316250_1317927_+|N-6-DNA-methylase MTSTELKDLEGRLWQSADMLRAGAHLAANKYSQPILGLIFLRYADVLFKQHKEAIDTAYNEYKGTRMERSYKDIAIEKCGFFLPECAYFDYLNDAPDDAQKALLVKAAMEAIEHENPRMDGVLPKEVYGQLVPEEEPELLSRIVRVFKDIPENISIDIFGQIYEYFLGNFALAEGQGGGAFYTPASVVQYMVEVLQPATGDKKFLDPACGSGGMFVQAARYMHRHNTSNEQMMNFRCYGVEKEPDTVKLAKMNLLLNNVRGEIMEANSFYSDPYNAVGQFDYVMANPPFNVDEVVVERVTDDARFNTYGVPRNKTKSAKKASDKKETVPNANYLWIGYFATALNEQGKAALVMANSASDAGGSELEIRKKMIEDGIISQMVTLPSNMFSTVTLPATLWFFNKKRPKKDEILFIDARNIFTQVDRAHRKFSDEQVKNLGIISRLYEGDSDAFWALVEEYKAEGKQSEADWLLERWPDGKYQDIVGLCKVAKLEGEDGIIDNDYSLNAGRYVGVVIEDDGMTEEEFRTEMLSLNSEFAKLSAEAKDLESEIEKNLKELLG >CP022041.2|ASE18845.1|1313538_1315113_+|hypothetical-protein MKRTLFSICALVLSLTASAQIIKDTPKGKLIENLYRSSKSWVKKGWTGVQPGRYEGLVSKIVIGEDGCIYIYNPLSGLDSKSWLKLERQPDGKYRAKLPQDILTDDYGGDDDEEESSERTISLTRLVSSDDGKNYEPIGANNYVDFTVEGRTLKMSGMGQKKQIWGASFNNKWERNYGGDWALTIEPLKEQLITPPATATKSQYTVTSKSDPSPRIVEVMTNNNDIYVKGLFKAEKLANVWVKLTKQGDKAVMPTNQYLGITKKTDFKKYDSDKSEYHTFAAAFESETKAAENLEFSIDATGKLTASKILRTSLGRASNDNITGEDYIESYEGLTLTPYVQKEVGAPATPEYFYLTSTPNYDNTSNEIKLAFYVKNADINGNVLDPEKMYYNVYINGSTEPFKFKKSASQYNDMHEEEMTNIPFNYKDKRNYDFKVIDNLRILHFYDSSITRLKVVMVYESDGKKYSSEPMVASLTTDGIESANFNKTTTEKYYTVDGRQIQKLQKGLNIIKSSDGTTRKVVVK >CP022041.2|ASE18844.1|1312540_1313494_+|hypothetical-protein MNTKHIILMACAALLGATQANAQGQDLSILTANTDARTAAMGNASAAAEGMYLYNNPAAFFATDKKFTADASASLFEKAEGADGTFGIYALSAGYKLAKRHAVFVGFRYAGGLSLKGSDLLGNPTKDYKPYNWTLDLGYTYFLGKGFATYATGSLIYSHLSKNATGAVFSVGGAYQNNELTLANKPANLMLDAKVGAIGPQLDYGNKHKTTLPTYLAVGGALSVEVAEKHQVAAALSSRYFFQPSEAKLFMLGGGLEYTYNKMVSVRAGYEYGDHDLSHVTMGAGFKYHGLRLNGAYNLKTADTGSSYCTIGIGYDF >CP022041.2|ASE18843.1|1309772_1312529_+|T9SS-C-terminal-target-domain-containing-protein MRKSIIYLCACALSGMMVTTSCQDNLDSDAGVSNTRSVNIDKDLFAIKGCINVKLAKGTNQAIPTTRSGSVEMQSVPSAMTSAMQYSGAYKMERVFKPAGIYEARTVAEGLDRWYTIYFDKSKDVAAVLDQFKKAEGVECAEQVLPMARPTVKMTPYSPSGASMQATASTFDDPLLAKQWHYYNDGSVNARAKKGADCNVKPVWEKYTTGKKNVIVAVVDGGIDITHEDLKDNLYVNEKEKNGQPNVDDDGNGFVDDIYGYNFVTAKDVVGGTIEPDDGGHGTHVAGTVAARNNNGKGVAGIAGGDGSPDSGVRLLSCQIFRNKDEQGDAAAAIKYAADNGAVICQNSWGYSSTAGVTSMPQLLKEAVDYFIKMAGCDANGNQRPDSPMKGGVVMFAAGNENKEFSAYPACYAPTVSVAAMAWDFSKASYSNYAKWVTITAPGGDQDRFGTEAGVLSTVPKKKVASGYAYFQGTSMACPHVSGIAALIASYFGRQGFTNEELKSRLITAYRPYNIDEQNPTYKGKLGKGYIDAEAAFETDTKIAPEKVGTLTLKPDFVDINAEWSIAKDEDKTAAFYRLYIAQGELTADKLKDMTYREINGMGHSLGETLKYDFDDLKDNTTYSVAVVAVDRWGNLSEPMIQKCTTRLNHAPEATNFPTEAIEVMENDRKSFSFNVADPDGHNWDIKATGETKGVSYTVKGNTVTVNLVPVLEAGSYNCTFILSDDLGAKAEKSFTFKIVKYIPPQLTKPFENYIIGLDEGVVTIPLTGHYTHSGNTQLTYKATAANGSIATATISNDNLQLKGMAKGVTRISIAATDGRETSSDGSFQVRVVEKKSAPVYAVYPIPVQRDIHTLLNPEVKQAELVISSTVGERLMKATVTPDKNNVATLDLSKLNPGTYKLTVYTSKGNHTQMFIKR >CP022041.2|ASE18842.1|1308155_1309757_+|hypothetical-protein MRIINIHKLLQFSLLYLLLGIAVGCAENDSFDQPYLNVSEKEISFSNQIGEKTITVNTNCKEWMATTPKAWVHLSQSGNEIAVHVDPNTTGMERSSYILVDGGLAVQKIMVSQSAADITLNLNNGEVILPQAGGTTTVDLKMDATSYDLTQSEQPEWMQVIKKKHGLKFISKPNYSTTERTTKLTIAFAGKNHEVVVKQPGVATFILACNPGNPYSLHKMMDYEYRRGSFLTEYGGPDEVNGIFEESYFFKTPSPLFKDVVYVHDTKHSVPTRIYTRSLTREGVNAVKSQAFQEFMRANGYTRDEKDTNHYVNIKEAFTMDVDIREENNSVVLFFYQMHTQDRSYPTFSSLDLGPVDLLNKTDKKISDVEAYETGKNSEEMKRQMSKSNEVEAILYKTNDPTLIARTYFFYLHNDAAVPQEKAGSVEQYSLFYSQPNLGIWQYGNEWFVTHEFDKLLTANNFEFVGYNGKHHVYARRSDYLTLAISGGEYADVNNGKAVMQITVLYKPTVFAGSKEQRLAKVERMLKQYNPKK >CP022041.2|ASE18841.1|1306499_1308119_+|hypothetical-protein MELKKTLYSLMLGGILLCLFSCAKEDEFEMPTLVLSENSIAFDKGVSERNISVTTNQNSWIASSPQEGDWLSLVQDGNVLKVKVTENKIGTERTSYVLVNANGASGKIAVTQSAADVTLDVVPNAIYLPQTGGEKTIDITTNSSVYDVTTSEEVSWLKIVKSEEEIKLIAERNDTYQKREVKLYAKSGSVIREIVVSQSGIQRYLLPIHPGVPQDEHKIMDFELGRGSYLREYQAAMPAYGLEETYTFITASPIFTLIQYCSTDGINPSQIIMIGDGRKAIDAVKDKAFDKFLTDNGYVRSNSQSDREYTNDKDLLSLKVYISEKENNEGVNLTFTPIMKQNGEYKTFSKLPFYPLELLQKDNVKLAQIEQYEQKAGSTEEERSLNEHKNTEVSQIQYKLKASTDPSAAYGRIHIFYTTDKDGDAPDNLGSVQIGALLFKDTNLGVWKYGTKWVATKEIKKVLGDEGFSFLRTSGNNHFFVRESDHLVIDVTCVLDNAMPVLALLYSYDPSVSGASSKAIKTQAKMIRNFAAAKKALKF >CP022041.2|ASE18840.1|1304752_1305565_+|PepSY-domain-containing-protein MRKFCLKIHRWFALPLGVIMAILCFSGLAILLIKDLAPLFDMNAKEMPIYTIVVRLHRWLFMKPENAHEGGQSLGRILTAVSAICMSIVLLSGVVIWWPKTKKALKSRLTVSTNKGFRRFVYDSHVSLGIYVFIFLFLMALTGPVFSFGWYRAGMSKLFGQPMPPKEMKMQQPKDGMKQGGTNDKAFAPTDASQMKGQPQAHKEGAKDMKGDQHGKKPKGGKLFKQLHTGTWGGWFSRVLYAIAALIGGFLPISGYYLWWKRRSAKKKKA >CP022041.2|ASE18839.1|1303820_1304546_+|KR-domain-containing-protein MKKAIVVGASSGIGHEVARLLIAEGWAVGVAARRIDKLTDLQAMAPERVYTVQIDVTNEDAETSLLQLIERMNGIDLYFHAAGIGWQNPSLNADIELKTMETNAVGFTRMIGCAYRYFANKGGGHIACITSIAGTKGLGPAPAYSATKAMQNTYLQALEQLAACKHHNIHFTDIRPGFVDTPLLAGTSHLPMLMTTKKVARSIIKAINSRRHICVIDSRWCVLTYLWRHIPNWIWRRMKLC >CP022041.2|ASE18849.1|1322765_1323911_-|DUF1735-domain-containing-protein MNKHIFKHIAVAFAACCVVSCQDSESDLLKQKVYFDSNLYKVEMPDSGSTLGVDITSRLSNKQDGAVDVSYSLADSSLVALYNSKYGTDYVALKHQNVTFSKASSTIAAGSIYADKVSLTLNNLDQLAEGKNYMLPIKLHSSSTPVIDGEDVEYIILAKPVKITKAGDFYNKYISVKFPAGTYFKSFTYEALVHSIWWGSNCTIMGSEGLMIFRVGDVGGGISSGILQAAGRQHYEAPEKLSINKWYHVALTYDQATGKTVMYLNGTKWAESAWNISGFDPNADVGFNIGKIPGFPWGERPFYGYMSEVRVWSVARSENQIKQNMLSVDPKSDGLELYFKLNGSETVNGNKIKDSAKGIECTTGGLEFTTLAQPLTMKDLQ >CP022041.2|ASE18850.1|1323951_1325070_-|endoglycosidase MKNFKYISSLVLLCAGVLFCGCSKMTEVENEPYDHIGGYNTMNNAESEKYYADLRAYKQQAVNYGRPVAFGWYSNWSPAGTYRRGYLTSMPDSMDVVSMWSGAPNRFNITPEQKKDKEFVQKVKGTKLLEVTLLSYIGKGRTPDSVYTAVEKQAEKEGWANDQARVEEAKKKARWKFWGYEGVAGSDNHKEALARFAKALCDSLVANDWDGYDIDWEIGSGVFDMDGTLSTNADLVYLVKEMNKYIGPKSDPEHKGHRLICIDGHFGGLTEDLDGYVDYWIDQAYGRTTHFDYYGVDPKTIITTDNFESSFKSGGQLLRQAKSMPSKGYKGGVGAYRFDNDYDNTPNYKWMRQAIQINQQVFKERMGQTTQP >CP022041.2|ASE18940.1|1325087_1326713_-|SusD/RagB-family-nutrient-binding-outer-membrane-lipoprotein MKTKAYKYIVGVLALSLFTACDFQKVNTNEFELLPEEGLMDGISIGGPITAMQKCVFPVGTQADGTSVANRYQTAYNLAADCWSGYFGQNNNWGGPNNLNYFLKDGWVASSYTESYSTVVPLWQDLKGKTETQFPEVFALAQILKISAWHKATDMFGPIPYKEAGKGLITVPYDSQEEVYKAMFKELSDAIEVLTKYADNGNSKLLPNADAVYAGDVHKWVVYANSLMLRLAMRVYYADAALSKKYALQAVNHSYGVMKTKDDEAKMERGASLEFKNNLDVLINQYNECRMGSSMLAYLGGYQDPRLPKYFNTSTVSQAVTVGTYGKYSGVPTGHDVSSNDAFRDSSRPAITSTTPTYWMRASEVYFLLAEAALHGFAVGGTAESLYEKGIEMSFEENGIASSEVADYMSSGLKPSAYSFHLTNPGVNVDVPAVTEATTAWSGTDEEKLEKIMIQKWIALYPNGQEAWTEYRRTGYPKLHSVVTNYSNGEIDSEVGIRRMRFPTNKSTSAEDIANLESARKLLRGGLDKAGTRLWWDNKNH >CP022041.2|ASE18851.1|1326719_1329830_-|SusC/RagA-family-TonB-linked-outer-membrane-protein MRMIHETKQKHLYFSVAFALSIALAPTSVYAVGNPVGSPDASMPQAVQQNGNHKVTGRVVDSAGEPLIGATIMVEGTKEGAVTDIDGNFTINTTSKAKLVISYVGYTTQTIPVGDKTTIDVTLKEVANTMNEVVVTALGIKRAEKALSYNVQSVGSNELTRNKDANFVNSLNGKVAGVSISKSASGVGGATRVIMRGAKSIEGDNNVLYVIDGIPIFNFSGGRDSGIMGEGRVSSEGIADLNPEDIESISVLAGPSAAALYGSNAANGAILITTKKGKEGRVDISFSSSADFSSPLLMPKFQNTYGNKLGSYESWGEKLATPSSYDPKKDFFRTGTNFINALTLNMGNEFNQTFASVATTNSRGIVPNNTYDRYNFTIRNTTRMFKNRVQLDLGASYIKQKDNNMVSQGEYWNPIVAAYLFPRGESFEGIKTFERYDNVRNFPTQYWPISDSRFANQNPYWTAYRNLAPDDKDRFMFNAGLTYNIFDWLSVAGRIRLDKTFITSERKIYASSFNYFAKEKGAYDYYDYKDHQTYIDAIANINKTFGKFSLAANVGYSYSDYASLTRGYGGNLVLVPNKFSLNNINPTDSKIREAGGDSKVRNVAAFASAELGWRSMVYLTLTGRNDWNSRLVNSSEESFFYPSVGLSGIISEMTKLPSFISYLKVRGSYTEVGSPVSRSGMTPGTITTPIVGGSLKSTDIYPFTDYKAERTKSYEFGLTARFWKKLSFDFTWYKSNTYNQTFIGELPESSGYKAVYLQAGNVENRGVEMALGYSDNFGGLQWNSSLVYSKNVNEIKEMVKDYHHPLSPKPINIPEVSKDNGRVLLKVGGSINDIYARKVLAKDNQGFVNVSPSGGMNLETVEPIYLGKTTPDFTMGWNNNFTYKNFGLSFLINARVGGIVTSSTQALLDRFGVSKASADARDAGGVMIPNQGLYDAKKYYTLVATGENDLAGYYTYSATNVRLQELTLSYKFNSKLFNNVIKDLTLSFVATNPWMIYCKAPFDPELTASTGTYGQGNDYFMQPSLKSYGFSVKFKF >CP022041.2|ASE18852.1|1330183_1331287_-|MRP-family-ATP-binding-protein MTLYPKLITDALEKVIYPGTKKNIIESEMLADTPSINGNKVSFTLIFPRETDPFLKSTIKAAEAQIHYSVGKEVEVTITTEFKNAPRPEVGKLLPQVKNIIAVSSGKGGVGKSTVSANLAIALARLGYKVGLLDTDIFGPSMPKMFGVEDARPYGVEKDGRQLIEPVEKYGVKLLSIGFFVNPDTATLWRGSMATSALKQLIADADWGELDYFILDTPPGTSDIHLTLMQTLAITGAVIVSTPQNVALADARKGIDMYRNDKVNIPILGLVENMAWFTPAELPENKYYIFGKDGCKNLAKELGCPLLAQIPIVQSICENGDNGTPAASQVDTITGQSFLSLAQSVVTVVNRRNKEQAPTKIVDVKNG >CP022041.2|ASE18853.1|1331417_1331795_-|hypothetical-protein MGRNKFAEDEIKEIAKLLRLKNAGNRAKQKLVRHDLRTIYEFNISDFNEPGKAFGEEELQGAIQRGAIQILDDATIADMKAKRARDKARDEATREQQAIEAGEMTDWKAVAKEWEEWENSQNNGN >CP022041.2|ASE18854.1|1333136_1333403_+|hypothetical-protein MRQAISIIVLYALINILSLLFNIISFSTDGKIDIGFPFIFIHLTSAKYDPTFPYNKLLIEPFLWDIIFLFTIVCLYLCCLKFIKRIRD >CP022041.2|ASE18855.1|1333821_1334337_+|hypothetical-protein MILRNTKRIINYCVLMIGLGVWGSCNSKSQHVLLPVDNVKSYKICENNDTTTIFEEREEESEEVIKFYKEGSEIFTSDYGGRKELLMSTTEMLDTVYSGNTYCREHRILIKKENSNLFSTSIYNIIIHPVLVLTIYYDHSYNIKAIRNWFAFTTYESEHISIPLISRPVKY >CP022041.2|ASE18856.1|1335329_1335629_+|hypothetical-protein MKIKNILAVFIIVGGISYALYFSLVTDVLLKYGETIHTKAIIEERLTGKTSDPVLRYRFLYENQAYIGFVSETSRLHVSDTINIVFLKSRPSINKPLIK >CP022041.2|ASE18857.1|1335636_1335981_+|hypothetical-protein MKKNLLFSFDDEITTGVYYDIDGHKIVVYFAAYYDNGRFVEKKCQLIIEKWEYAKSKLSVDNRYKDLEDHIGIISMILDMHIADKKLFLTVNTLDGQYVDLLFYNCNVKIEDIS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
CP022041_3 | 3.1|351643|57|CP022041|CRISPRCasFinder | 351643-351699 | 57 | CP022041.2 | 490724-490780 | 0 | 1.0 |
CP022041_3 | 3.1|351643|57|CP022041|CRISPRCasFinder | 351643-351699 | 57 | CP022041.2 | 490808-490864 | 0 | 1.0 |
CP022041_5 | 5.2|479165|28|CP022041|PILER-CR | 479165-479192 | 28 | CP022041.2 | 476356-476383 | 2 | 0.929 |
1. spacer 3.1|351643|57|CP022041|CRISPRCasFinder matches to position: 490724-490780, mismatch: 0, identity: 1.0
acttattgctatataattcgtgtcattcgcgtaattcgcagtcaactttttgattag CRISPR spacer acttattgctatataattcgtgtcattcgcgtaattcgcagtcaactttttgattag Protospacer *********************************************************
2. spacer 3.1|351643|57|CP022041|CRISPRCasFinder matches to position: 490808-490864, mismatch: 0, identity: 1.0
acttattgctatataattcgtgtcattcgcgtaattcgcagtcaactttttgattag CRISPR spacer acttattgctatataattcgtgtcattcgcgtaattcgcagtcaactttttgattag Protospacer *********************************************************
3. spacer 5.2|479165|28|CP022041|PILER-CR matches to position: 476356-476383, mismatch: 2, identity: 0.929
agcaacttgttaacttgtctactcgttg CRISPR spacer agcaacttgttcacttgtccactcgttg Protospacer *********** *******.********
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP022041_5 | 5.2|479165|28|CP022041|PILER-CR | 479165-479192 | 28 | MK552327 | Pseudomonas phage Psa21, complete genome | 200921-200948 | 7 | 0.75 |
CP022041_8 | 8.1|515392|29|CP022041|CRISPRCasFinder | 515392-515420 | 29 | NZ_CP020004 | Bacillus thuringiensis strain Bacillus thuringiensis L-7601 plasmid unnamed2, complete sequence | 38937-38965 | 7 | 0.759 |
CP022041_8 | 8.1|515392|29|CP022041|CRISPRCasFinder | 515392-515420 | 29 | NC_048762 | Bacillus phage vB_BtS_B83, complete genome | 5072-5100 | 7 | 0.759 |
CP022041_8 | 8.1|515392|29|CP022041|CRISPRCasFinder | 515392-515420 | 29 | KY290956 | Aeromonas phage L9-6, complete genome | 93770-93798 | 8 | 0.724 |
CP022041_8 | 8.1|515392|29|CP022041|CRISPRCasFinder | 515392-515420 | 29 | KY290951 | Aeromonas phage 31.2, complete genome | 93171-93199 | 8 | 0.724 |
CP022041_8 | 8.1|515392|29|CP022041|CRISPRCasFinder | 515392-515420 | 29 | NC_005135 | Aeromonas phage 44RR2.8t, complete genome | 93412-93440 | 8 | 0.724 |
CP022041_8 | 8.1|515392|29|CP022041|CRISPRCasFinder | 515392-515420 | 29 | AY962392 | Aeromonas phage 31, complete genome | 93174-93202 | 8 | 0.724 |
CP022041_8 | 8.1|515392|29|CP022041|CRISPRCasFinder | 515392-515420 | 29 | KY290958 | Aeromonas phage SW69-9, complete genome | 93306-93334 | 8 | 0.724 |
CP022041_8 | 8.1|515392|29|CP022041|CRISPRCasFinder | 515392-515420 | 29 | AY375531 | Bacteriophage 44RR2.8t, complete genome | 93412-93440 | 8 | 0.724 |
CP022041_8 | 8.1|515392|29|CP022041|CRISPRCasFinder | 515392-515420 | 29 | KY290948 | Aeromonas phage 44RR2.8t.2, complete genome | 93412-93440 | 8 | 0.724 |
CP022041_8 | 8.1|515392|29|CP022041|CRISPRCasFinder | 515392-515420 | 29 | KY290957 | Aeromonas phage Riv-10, complete genome | 94507-94535 | 8 | 0.724 |
1. spacer 5.2|479165|28|CP022041|PILER-CR matches to MK552327 (Pseudomonas phage Psa21, complete genome) position: , mismatch: 7, identity: 0.75
agcaacttgttaacttgtctactcgttg CRISPR spacer gtcaacttgctaacttgtctactactga Protospacer . *******.************* * .
2. spacer 8.1|515392|29|CP022041|CRISPRCasFinder matches to NZ_CP020004 (Bacillus thuringiensis strain Bacillus thuringiensis L-7601 plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.759
agtaacagatgaacaggtgtttagtggac CRISPR spacer agtaacagaggaacaagtgtttatgcctc Protospacer ********* *****.******* *
3. spacer 8.1|515392|29|CP022041|CRISPRCasFinder matches to NC_048762 (Bacillus phage vB_BtS_B83, complete genome) position: , mismatch: 7, identity: 0.759
agtaacagatgaacaggtgtttagtggac CRISPR spacer agtaacagaggaacaagtgtttatgcctc Protospacer ********* *****.******* *
4. spacer 8.1|515392|29|CP022041|CRISPRCasFinder matches to KY290956 (Aeromonas phage L9-6, complete genome) position: , mismatch: 8, identity: 0.724
agtaacagatgaacaggtgtttagtggac CRISPR spacer agtaacagaagaacaggtgttcgactgca Protospacer ********* ***********.... *
5. spacer 8.1|515392|29|CP022041|CRISPRCasFinder matches to KY290951 (Aeromonas phage 31.2, complete genome) position: , mismatch: 8, identity: 0.724
agtaacagatgaacaggtgtttagtggac CRISPR spacer agtaacagaagaacaggtgttcgactgca Protospacer ********* ***********.... *
6. spacer 8.1|515392|29|CP022041|CRISPRCasFinder matches to NC_005135 (Aeromonas phage 44RR2.8t, complete genome) position: , mismatch: 8, identity: 0.724
agtaacagatgaacaggtgtttagtggac CRISPR spacer agtaacagaagaacaggtgttcgactgca Protospacer ********* ***********.... *
7. spacer 8.1|515392|29|CP022041|CRISPRCasFinder matches to AY962392 (Aeromonas phage 31, complete genome) position: , mismatch: 8, identity: 0.724
agtaacagatgaacaggtgtttagtggac CRISPR spacer agtaacagaagaacaggtgttcgactgca Protospacer ********* ***********.... *
8. spacer 8.1|515392|29|CP022041|CRISPRCasFinder matches to KY290958 (Aeromonas phage SW69-9, complete genome) position: , mismatch: 8, identity: 0.724
agtaacagatgaacaggtgtttagtggac CRISPR spacer agtaacagaagaacaggtgttcgactgca Protospacer ********* ***********.... *
9. spacer 8.1|515392|29|CP022041|CRISPRCasFinder matches to AY375531 (Bacteriophage 44RR2.8t, complete genome) position: , mismatch: 8, identity: 0.724
agtaacagatgaacaggtgtttagtggac CRISPR spacer agtaacagaagaacaggtgttcgactgca Protospacer ********* ***********.... *
10. spacer 8.1|515392|29|CP022041|CRISPRCasFinder matches to KY290948 (Aeromonas phage 44RR2.8t.2, complete genome) position: , mismatch: 8, identity: 0.724
agtaacagatgaacaggtgtttagtggac CRISPR spacer agtaacagaagaacaggtgttcgactgca Protospacer ********* ***********.... *
11. spacer 8.1|515392|29|CP022041|CRISPRCasFinder matches to KY290957 (Aeromonas phage Riv-10, complete genome) position: , mismatch: 8, identity: 0.724
agtaacagatgaacaggtgtttagtggac CRISPR spacer agtaacagaagaacaggtgttcgactgca Protospacer ********* ***********.... *
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
220916 : 251291
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP022041|220916:251291|DBSCAN-SWA CATGAAGAATCACCAATTATTACGTTGCATCTTTCCAGATGTACTTGCTGACTACTTTGATGTTGTCGATATTCAAGAGAGTGTTTCGCAGATTGACTTTTGGCTTGACGAGCGTAACTTTATGGAAAAGTCAGATCATAAGTTAGGCACTGTAAGCAGTTATGGTTTTACCAGCGAGCGTGTTATTCAGGACTTCCCCCTTCGTGGCAAAGCAGTTTACCGCCATGTTCGTCGTCGCAAGTGGCGTGACAGTTCCAACGGAGAGATATTTACTTATTCATATGATGATTTGACGGCTGAGGGTAGTAGACTATCTCCCGAGTTCGTTTCTTTTTTAAAAGAATAGAATTGAGTCCACTGCAGAGAGCATCGCAAGCATTGGTTCGCACTATGGCGTAAATGGCAAGCTGTTATCCACACAGTACAAGGAACATTTCAGTGATTACCGTAGCTGGGATCAGTTGGATCATGCTCAAGACTGGCTATTGTTTGAAGATAACATAGGAGAGAGTCTAAGTATTGATGAGACCTGTCTAAGCTGTGGCGAGGTTTATACTTTTCTGACCAACAAGGCAGGAAAGGGCAGAGAAGGGACTTTGGTAGCTGTGGTTAAAGGGACCAAGGCAGAGGACGTCATCCAGATTCTCAAGAAGATAAACCTTTCTAAACGAAAGACTGTCAAGGAGATAACACTCGACTTATCATCTTCTATGATGCGCATAGCCCGTGCTGTTTTTCCCAAAGCACTTATTACCAATGATAGATTTCATGTGCAGAAACTATATTATGATGCTCTGGATGACATGCGTATCGCTTACCGATGGATGGCAAGAGATAAAGAGAATGAGGAGATAAAAGAAGCTAAATGTAAGGGTAAGGAATATATACCATTTAGATACAGCAATGGTGACACGCGTAAGCAGTTGCTTGCCAGAGCAAAGTTCATATTGACCAAGCACAAGACCAAGTGGACTGAAACACAGAAAGGTAGGGCAGAAATCATCTTTGAACATTATCCGACACTGAAAAAGGCATATGATTTGGCTATGAAACTTACCGATATTTATAACATCAAGAGCATCAAGGATGCTGCAAGGCTGAAGTTGGCAAAATGGTTTAATGAAGTTGAAGAGTTGGGAGTGGACAATTTCTACACAGTGATTGACACGTTTGAAAATCATTATCAAACCATACTCAATTTCTTTGTAAACAGAGCTACGAATGCAAATGCCGAGTCATTCAATGCTAAGGTTAAGGCATTCAGGGCGCAGTTCAGAGGAGTCACAGATATTCCTTTCTTTTTATATAGGCTTATGAAATTGTGTGCTTAAGAGTATAGGCTTGCAACTGGGTTTTAGGACTGACCCATAAAAAGTTCAAAGTATTCAACGCTCCTTGGTGGTGTGTGAGATTTAAATAACGCACATTCAATTAGTAAACACGAATGCTCTCGGTTAGAAATATATTTACAAAACACATATGAAAAGGGCTGGGGACTGAGCTGAAAACAAGAAAATTTACCACATTTGTAAGTCTTTTGTAATCAATTAGTTGCGGAAGTGTCTTCCAAAAGGTACCTAATAAGACTTCAAAAGGGCGTTAGTAAGGGGCTTAAAGGGCATCTTTTGCAAGCTAAAAGGGCGTTAATTGGAAGCCAATTAAGCCTTAATAAAAATCGAAGGAGGGAAAAATATTGACAAAACGAGGAGTAAACGGGTGGGGGAATACTCACCTTAAAGCCCTCTATCAATCCATAGGGACAGTTTTTCATTACCTCATGTTTTATCTTGTGTGTTCAGTCTTCAACTCTCCAATATTAGAAGCAATGGCGTTTAAGGAACTCCTCATACTGGTTGCGATAGCCATTGAAAACATTGATATCAACATCGCCATGGATGCCACGCACCTTACCATCAGGACTTAACTGCCAGTAAGCTAACTTGAAGTTGGGCTTATATTCGCCATAACGGGCTATCCATATGTGATAGTTATCGCCCAAATCGGGTGCCAATGGCATGTAACGGTTAGCAAACGACTGACTGATATAGAGTATCGGGCGATGACCCGTATGTTTAAATACCATAGACATCCATATCCTCACTTGTTTGAACAGCACTTCAGCACCTCCCATAGCATGAATTTGCGCATCAGTAGGCTCTACGTCGAGCACGGGGGGTAGGTCGCCTTTCCGGAATCGAGCTTTCTTTAAGAAGAAGTTAGCCTGCATATAGCCCGAAGACAAGGTTGAGAAGAAATGATAAGAGCCCACTCTAATCCCCTGCTGACGGGCTGCCAAGTAATCAGAATAGTAATAGCTGTTGAGTAAAGTTCGCCCTTCAGTTGCCTTAACATAGGCGAATGAGACTGGATAGTCAACTATGCCTTTCACCTTCTTATTGCTTAATGTACCAAGATGAGTGATTCTTAACTTGCTCCAAGCTATGCTGTAACGCTGCTCCCCCTTCTCATGTTGATGTTTAGAAAGGTCTATGCCATAGATACGATCAGCGTTGTAAGTGAGGCGTTCGCCCTTAAATACGTTGTTCTTCCACTCCCCTACCTTCAGAAAGTCATGTGGCGCAACATTGATACCAAAGCCATTTCGCTTGCCATCCTGCCAATGCCCATCATAAAAACTACCATCATTACCATAATACTGACCATATCCCTCAGCCTTTCCTCCTCGTTGCTCGCCAACGTAATAACCCGCAGAATCAATCAAAGCTCCTGCTGGAAAGCCAAGAACAGGATGCGAGTAAGCCTTGAAGTGACGCAAACTCTCTTGCATTCCTCGCTCTCTGCCGAAGCGTGAAGAGATCCTCCAAGGAGTCATACAATCAGATGTCTGCAAGCAAAGAGCCCATCGGTTTGATTCATCATTCAAGCGATTACAAACCATTATCTCGCCGGGCTTATACCATACACGTGCTGTGTATTGACTCAATTGACGGATTCGAAGACGTGCAGTAGGTTTTATATCTTGCAAAACGTGTATAATCCTTTGCAATGAATCTATCTGATGAACCACCGCTGTATGATAATCGGCAATCTTGTTATAACCATAGTCTTGCACACTATGAACACGCAAATAATAGGCAGTTTCATTGCGCTTGGACTGTAAACCCACTAACTTTATATCCATCTTCATCAATGCGTACATCAATACCTGACGCAACTTCTGATCCTTCAGATTGATAATTGTGGAAGGATAATGACGCCTTATCACAGCTATACGCCCAAAGCAAGAAGGCAAAATGGGGAGTTTATTCACCCAATAGCCCTCTGTTACGGACATACGTGAGCGTATAGAGTCACCTGGCAGACTACCATCTACAAAGGTGGAGTCGGTATAATTGGAGAAATAAGCTGCTGGTTTACCATCAACGTCCAACTGATAATAAGACTTTTCAGTAACATCTACGGCACATCTTTGTATATGTCTTCTTGTCAACGAGAGGTTACGCCTGGCAATAGATGACAATAAGAGGGCTACGAGAATAATACTTACACTCTCGTAAAAACTCCTGTTAAAGAACGAACGAAACATCTAAATAACCTTTAAAATATTTGCAAAGTTACAAATAATTATCTAACTTTGCGGCCAAGTTTATGAGAAAAACGACAGACATATTACTATCCCTTCTATTGTCAACGATGATAGTATGGATTGGTTCGGGCATCATGACAGTGGTGTGCGCACATACTGGCAATGTGTCTGTTGCTAAGATGGAGGATAAAGGTCATTGCAATGATAACGGTGTAAAAAATTGCATGAAGATTGAGGTAAAAAGCTTGTCGCCCACAGACATGGCGCACAATGACTTTTACCAGATTCAACCAATACAGCTTTCTCTTCTCCCACAATTCGTAAGTGATTGTCAATTATTGCCTTTGCCTGTTTTGACAAAGGCCCCAGAGAGGATATTGTCGTTGCTCTGGCATAGTCCGCCACGCCAATATCTCCGATTACTTACGACACTTATCATTTGATTTTTGGTTGTTTTCTGTTCCTCATGAAATGGGCAACAGAAGGTTTATCGTACCTGTTACACAAGTGTGTTCGGGTAAAAAAGCGTATCAGTATGTTTCCCATTCTGATACCAACACAGGCTTTATAAATTTTAAATAACAACTGGATATCAAACGAAATGAATAAAATTATATATTTAGCAGGACTGACCTTGCTGTCTGTCCTACCTGCTACCGCACAGGAAGCTGCTGTCGACACGACCGCAACCATGAAAGATCAGACATTGGATAACGTGACCGTTACAACGCGAAGAGCCTCTGTGCACTCATTAGCTGGAGCCATCAACGGTAAAGACATCCTACGTGACGAACTCTTCAAGGCAGCTTGCTGTAACCTTGGTGAGAGCTTTGTCAACAATGCTTCAGTAGATGTAAACTACTCGGATGCGACAACAGGAGCCAAACAAATCAAACTCCTTGGTCTTAGTGGTACGTATGTTCAGATGCTGACCGAGAACCTTCCAAACTTTCGTGGTGCTGCCCTCCCTTACGGTTTGGGCTATGTTCCAGGCAGTTGGATGAAGAGTATGCAGGTGAGTAAGGGTAATACATCTGTCAAGAATGGCTACGAAGCGATGACAGGTCAGATTAATGTAGAATACGTTAAACCTGAAGATCCAACAGGTATGAGCGTCAACCTCTATGGCAATACGGTGGGAAGATTCGAGGCAAATGCGGATGGTAACATTCATATCAATGGTAACAAGAACCTCAGTACGGAAATCCTTGCCCACTATGAGAACAACTGGCTACATCATGATGGCAACGGAGACGGATTCCAAGATGCCCCAAAGGTGAGACAATATAACCTGCAGAACCGTTGGTTTTGGAAGAAGGGCGATTATATCTTACATGCTGGTTTGTCGCTGTTGAAGGAAGATCGCGTGAGTGGTCAGACTCCTCATGCTGCTGCAACGAATCCTTATCGCATTGATATCGGCACCGACCGCTACGAGGCTTATATGAAACATGCCTTTGTTCTCAATAAGGAACACGCCACAAACATCGCTTTGATGGGTAATGTTTCTATGCACAAGCAGCAGGCGCAATATGGTATTAAGCAGTATGATGTCAACGAAAAGAATGCTTATGCATCACTTATGTTTGAGACAAACTTCACCGATGAGCACAATCTCTCTGCTGGTTTGAGTCTCAACCATGATTACTTGCATCAGACGCTGACACTCCCAGCAACGGCTGTACCTTCTGATTACGGTAATATCTACCCGCTCACACGTGGTATTGAGAGCGAAACAACACCGGGCGCATACGTTCAGTACACCTATAACTTGCATAGTCGCTTCATCGCTATGGCAGGATTGCGCGTCGACCATAGCAATGTTTACGGTACGTTCTTCACTCCTCGCTTCCACTTGAAATGGGTGCCAGCTGACTTCCTGACCCTTCGCCTCTCTGCAGGTAAGGGCTATCGCAGTCCGCACGCTTTGGCAGAGAACAATTACCTCATGGCTTCGGGTCGCCGACTCATCATCGATAAGCTTGATCAAGAGGCTGCATGGAACTATGGTGCGAGTCTTAATTTCAACATTCCAATAGGAAATAAGATGCTGAAAATCAATACTGATTACTATTACACGCACTTCCTTTCACAGATGTTGATTGACTATGACAGTAATCCACAGGAGCTACATATCACCAATCTCAATGGAAAGAGCTTCTCGCACACCTTCCAAATCGATGCTTCCTACCCTCTTTTCAAGGGCTTTGAACTGACGGCAGCCTACCGATACAACCTCGTGAAGGCGACGTATGGAGGAAAACTGATGTGGAAACCACTACAGAGTCGTTACAAAGGACTCTTAACGCTCAGCTATAAAACACCTTTAGGAGTTTGGCAGTTTGATGCTACTGCAGCCTTGAACGGAGGTGGTAGAATGCCAGAATCTTACACCTTAGCATCGGGCGAACGCTCATGGGCGGGCAGCTATAAAGCCTATGGACAGCTTAGCGGGCAGGTGACACGCTACTTCCGCCACTTCTCGCTTTACATAGGTGGGGAAAACCTTACCAATTATAAGCAGAAGAACCCAATCATTGGTTATCAAAATCCATGGGCTAACAGCTTCGAACCAACAATGGTCTATGGTCCTGTAGAGGGTGCAATGGCATATGTTGGTATCCGTCTGAACCTCGGTAAACGACAGTAAACATACATTTTTCACAATAGAATAACATAAACGACAACACAGAGAGACGTCTTGAAACACGCTTTCGGACGTCTCCCTCAAACAAAGAACAATTATGAAAAAAGCTATTTTCACAATGCTGATGCTCATGGTAGCCATGATTGCAACAGCCAAAGACATTAAGACAGTTATCTTCACAACTACTCCACAGATGCACTGTGCTAACTGTGAGGCGAAAGTAAAGAACAATCTCCGCTTTGCAAAGGGTGTGAAGGAGATTAAGACAAGTGTAGAAGAACAGAAAGTTTATGTAACCTACGATGCCCAGAAGACCAATGAGGAGAAGCTTCAGAAGGCTTTTGAGAAGTTCGGTTATAAGGCAGAAAAGACGACAAAGGAGGCTAAAATCCCTGTTCATGAGAACGAGGAGTGCGAGAATATGTAAGGGGCTGAACATGCAAATACTAAATTGTGATTCTCTGAACCCTATTATATAAATAGGAGAAAGGGGCAAACTTCAGCTGAGTAGTCTATCGCTACAGCATTCGAAGTTTGCCCCTTCGCTATTAATTTCTATCTGAAGAAATAATGACTGCTTCGTACTTTTGTAATAATAAAATCACAACTGTTTTTTATTCTACCCTCTTTTGCCTCTGAATTAACGCCCAATTAACGTGCAAAAGATGCCCTTTAAGCCTCTCACTAACGCCCTTTTGAACCCTTACTAAGCACCTTTTCAAATCGACCTTCACAACTACCTGACAGCAAACTACTTACAACAACACTTAAAACGTATATATTTTGCCCTTTTTCTCGCCTTATCGCCGAGAAGATTGTAATAATATTTCAAAGCTCAAAGTTCGATTTTTCAAACTTAAAGTTGCTGCACATTCTCTCATAGAGTTCACTGATAAACAGAAAAAAACAGCCCCGATTTGCTTGTATTGGAGACAAGGTAACATAAGAATGTGGTTTAGAGTAGAGCACAAGTTCCCAAGTTACTTGAAGCAATAAACCACACACCACGATAGTAATCATGATTCTGACCTACTGATGAGAACTCTGTTCGAATAATTATGAATTACAGATAATAACCTTCCCTTGTGAGCGATGATTAGCAACATCGCTCAGAGCCTCTTCTGCTTGTTCTAACGTATAGACATTACCTATCTCTGGGCGTATCTGACGAGCTTCAAGAATAGCTGTTGCGTTGCGTAACTGTGCGCCATCAGCGGTCACGAAGATGAAATGATAATGCTGATTTCGTTTGCTTGCCAACTTCTCATTCTTCATACCAACCAACTTAAACAGCCATTGCTTCCAGAGCGGCAAACCGAACGACTTAGCGAAACTGCCATTAGGCATACCTTTCAAAGAGACAAGCGTTGCACCTTCTCGAAGAACTGAGAAAAGTCTCGGTAGCTCTTTATCACCAAGCGAATCGATTGCACCATCAATATCCTTCAACTCTTTGGTGTAATCTTCTGTTTTGTAATCAAAATATTGGCTCACACCAAGTTCCATCGTGCGCTCTTTGGCTTTTGCACCACCACTGGTAATAACCTTCAATCCACGTGCTGCTGCCAAAGGGATAGCCATAGCACCGAAGCTACCAGAGCCACCAGAAATAAACAAACTATTCCCTGACTTTAACTTGAGAATGTCAAGCGCCTGCTCAGCTGTCAGTGCGGTCAATGGTACAGCAGCAGCCTCTATATCTGAAAGGTAATCTGGTGTTTTGGCAACTGCCGCAGCATCTACTGCAACATACTCAGCAAAGGCACCGATACGATTGATAGGAAGTCTACCGAACACACGATCGCCAACCTGAAAGTCCTTTACCGACGCTCCAACCTGCTCTACCACACCAGACATCTCATTTCCAATGGTCAATGGGAAGTGGTAAGGTACAACCATCTTCACATCTCCATGTGCAATGAGCAACTCTAATGGGTTGACAGCAGCAGCCTTTACCTTTACTAACAAGTCGTTATCGCCCACCTTTGGCATTTCTATATCACGTAAGGCAACCTTAACATCACCTTTCTTATATCCGTCTATCTGTATTGCTTTCATATCTTTATACTATTTAAAAAACTCTATTGCTTCTGGAACAAACACCTCATGGTATTGGAAAACACCTCCATGACCTGCATCTGGATAAATCGTCAGATGAGCATTGGGCAGTCTTTCTGCTAAGTCATACGAATTAGGCGTAGGGACCATTCTGTCATTATCACCATTGACAACCCATACAGGAATCTTGATTTTCGACAAATCGTCTTTCTCTCGGCTGCCCCATACGTCAATGGCTCTTAGCTGTCTTTGGAAAGAAGAAAGCTTTATCTTGTCATCTTTGTCCTCTGTACGTGCAATGCGCTTCAGAAAGTTATTCGCTGCTTTCTTGCCTTGCTCAATCATAGGGAAGAAGAGATAATACTTAGGGTCACGACCAGTGAGGAAAGCCCTGAACATATCCCAATACGTAATCTTCGCTACATTCTTTATGCCTACACCACCAGCGCCGCCTGTTCCTGCAAGAATAGCCTTCGTGGTTAAGGCTGGCGCACGAAGCAAAACCTGCTGTGCAACAAACCCACCCAATGAAAATCCAAGAAGACGAATGTCTTTAAGACCATAAGCCTCAGCAAAATCAATCACGTCGTCTGCCATTTCCTCAATCGTAAGACGTGACTTGCCAGTAGACTTACCAACGCCGACATAATCAATTCCAATGACATTGAAATGTTCTGCCAACCCATCCATTATTGCTGGGTCGAAATTATCTAATGTTGCTGCAAGGTGATTGAGACAAAGAAGTGGAGGATTTTTCACATCACCTAACTGTCTGTAGGCTACTTGCTCTGTCTTTACTTTTAATATTTGAGTCGTAGTGTTTGAATAACTTGTATTTACCATACCTATTTAATCTTTTACCATTTTATCCATATTCTTAGTTCCCCAGTCAAATAAAGCATCAAGCGGACCAAGAAACGAACGTCCTAATTCGGTTAGACCATACTCTACTCTCGGTGGAACTTCTGGATACACCTTGCGGTTTACCAGTTTGTTCTTCTCCAGATTCTTCAATGTTTGAGAGAGCATCTTCTGTGAACAGTCGAGCATATACGAATGAATCTCTGAATAACGCATCTTTTCCTTACCCGCTTTACCGAGTATATAGAGGGCTAACATTGACCATTTATCACCAATGCGAGTAATCACTTGACGAATTGGACAAGCCGAATATCGTGAATCAATCTCTTTACTAATCGTCTTCATACCTGTAAAGTTACACATTTCCAACGAATAGTTGCATACTTACCATTAGAAAGTTACTTAAAAACATTTAACCCTTATCGCTCATTAGAGCCTAAACAGATTTTTGTTTTATTATCGAACAGATTGACTGCCTTTGTAACGCAAGCGTAGCGGGCTACGTTAACTTACAAAGAAAGAAAAGATGACAACAAGAAAATAAAAGATGTTAGGAAGCAACATATAAATAAGAATTATGTTGTTTCCTAACAGATTATGTGTGTTAATTATTTCCTTTCTCTATTCTTACCCACAACTGGAACAGTGCTGGAACAAAGATACAGAGTGAGAAGAACAGACCTCCGTACATCATGCGGTCGTTGCCATGGAAGAAGTCGATGAGTTTTCCATCGTTCATCTGTGGCATGAGGTTGAACATTAGATAAGCCACCGTATTGTTCACCCAATGAAATACGAAACCTGGCAGAATACTCCCCGTTTTTGAATAGAGCCAACCCAATAGCAAACCAATCAAGAAAGCGTGTAAGCCCTGTGCTAAATTGAAATGAATAATACCGAAGATGAGGGCTGAAACGACAATGGCTATCCATCGTTTGCGTTCGCCTAACACGTTTTGCAGTACGCGCTGAATTGCACCTCGGAAGACTATCTCTTCGGCTATTGGCACCATTAAGCCCAAGGCAACATAGCCCCAAGACACCTTCATGATACTTTCAAACATCTCTTCTGTGCCCGCTGGCATCGTTAGATTAATCTTCTCGGCAAGGAATTGCAGCGGGAGTATCGAACCTAAAGCTATGAGAGCCGACCACATGAGGACAGCCCAAGGACGCGTCTTCAAATAGTTACCTGATACAGGTGCCCACTTTGTAGCGATGAAAAGAATTAAAGTTGCAAGACTACTGATGACTGCCGTTATGGCAAGAGTCTCACCATTTTTAGTCAGATAATCCCCATCAGCAGCCTCTGTCCCTCCAAGTAGGCTTACAAGCTCCTTACCTCCTTTGAAAGAAAGACTTTGTATCAAGATGAAAAGTGAAACAAATAAGATGACATAGAGCGTTGCGCTCAAAACGTTTTTATTGTTCATTTCTGCGAATCTTAGGGTGTATTAATAATATTCTCTTGTATGTGATAAGTCTCATCAAACAATCGATTGATTTGCTCTTTGTCTTGCTTCAGCTGTTGTGCCAAGGCTTCTAAAGAGTCAAACTTATGCTCGTCTCTTATCTTACTAATGAAACTTACAAGGAGTTCCTGTCCATAGAGATTTCCTGTGAAATTAAAGAGATTGACCTCTATCGAGGTGGTTGTGCCATTGAAAGTAGGCCGATGCCCAATGTTCATCATCCCTCGCTTCCATTCGACAGAATCCTTCAAGCGCACCATGACCGCATAGACACCACTTGCAGGGAGCAGCTGTTGGCAACCATTCACATCGAGATTGGCTGTAGGGAAACCCATCTTGCGCCCATTCTGATAGCCACTCACGATGCGACTCTCTATCGTATAAGGATAGCCTAAAAGGCGGTTGGCTGCCTCAATATTACCCGCGCGGAGGTCGTTTCTGATGACAGAAGAACTCACTTTCTCATCATTTGGCAGGAAGGCATCTGCACGAATGACCTCAATACCCATCTCCTTACCATACTGAACATAGTCTTCAAAAGTTTCAGAGCGGTTATGTCCGAATCGGTTGTCGTAGCCAATGATGAGTTTCTTTACATTCAGCTGTTTGCACAACACCTCTTGCATGAAGTCATGAGCTGAGAGCGCAGCCAACGAAGCATCGAAATGCAGTACGAAACTATTGTCAATATGTGTCTTTGAGAGCAGTAATAGCTTCTCATCGAGTGTTGACAGCAGTTCTGGTTGGTAGTCAGCCTGCAGCACCTGACGCGGATGTCGGTCGAAGGTGATGACTGCCGATGCCATTCCAGACCGTTCAGCCTCGTCAATGACACGACTAATCAGGAACTGATGTCCACGATGCACCCCATCAAAGAACCCTATGGTTGCAACCTGTTCTACTAATTCATGCGCCTCATCTCGCTTAATATATATTGTATTCAATCCTCTTGTCTTATAGTTCGTTTACTTCCTTGTAAAGGTATTTTTATTTCCCTGCAAGCGGTGTTAAGCTCTCTGCAAAGATAGTCTCACCTTGTTACGAAGCCATTTTTACTTTCTTGCAAAGGTACTCCAAATCACCCATAAACCCATGTTTGGGTAAACAAAATGTGTTAAAACATTAAATTTCCTTATCGTCCTTTCTCCTTTTTCCATTTATTTCATTATCTTTGCAGCCGAATTCACGAAATTCAGGACTTGTAGCTCAGTTGGTTAGAGCAACAGACTCATAATCTGGAGGTCCCTGGTTCAAGCCCAGGCTGGTCCACAAAGGGAAAAGAAGAGCTTTACACTATGCTGTAAGGCTCTTTTTTGTTGGTAATCAAGTAGTTTCATCAGTAGGTTATGATGGCTATAACGGTTCATTTGTTGTAAAATCCCTCTATCTTCTCCTTAGTTTAGAACGTACATATCACGCTATTTTGAGCAAAATTTCAGCTAATTTATCCACTTGTTGACCTCCTGTTGACCCGAGGTTGACCCGAGGTTGACCTCAAAACGAGAGATTATGCTGACAATTAAAGCAGAAATTCAGAGAGACAAGTTAAGGCAAGATGGTAGTTACAATGTCCGCATCAGATTTACGAAAGATCGTAAGGTAAAACGTATTTCCACAAGTCTGTTTGCAACAAAAGCCGATTTAACTGACAGGTTTACTATCAAGGAAGATTCTCTTATCAAACAGGAAGCGGACATACTTATCCTGCATTACCGTAAGATGTTTAACGAAATGCACTTGGAGACAGAGGCGCTTGATGTAAACGAGATTGTAGCCCGACTGACCAGCAGGGAGGAAAGCGATAAGCCTGTAGATTTTATCCAGTTCGCTAAGGGATGGATTGCCAATTCCACGTTAAAGGGAGCAGTCAACTACACCTCCGCACTTAACAGCCTCATCCGTTTCAACAAAAGTGATAAGCTGTACACACATCAAATTACAAGTGAATTCCTGCAAGAGTTTATGGCATTCTTACTCAATGAAAGCAAAGAACGAGCGGAACAATTGAAGAAGAAAGGCAAGCGTGTTCCCTCCACTCGCAGTACCTCCCTTTACTTAATGAGCATTCGTAGATTGTTTAAGGAGGCTGTAAAGCAATACAACAAGCCAGACCAAGGGCTAATCCGTATCAGGAACACCCCATTTGTTTATTTCCAAATCCCCAAACAGGAAGCAACACGAAAAAGAGCTATCACAGCAGAATTAATCCGTAAGATAGAGCAGTTACCGTATCAGACAGTTTATAAAGGCATACATCACACCAACCGCTTCAATCTCGCAAAGGATTGTTTTATCCTCTCATTTTGCATGATAGGAATAAATTCTGTGGATTTATTTAACGCAACCGAATATGATGGCAATACGATTACTTACTATCGTACTAAGACGAGAGACAGGAGAATGGATAAAGCTAAGATGATAGTAACAGTTCCTAAGATGTTGCATCCTTTGTTTGAGAAGTACAAGGATACAACAGGCAAACGGATATTCAATTTCTACCAGAATTATGTCAATGAGAAAGCCTTTAACAAAGCCATCAACAAGGGATTGAAAGAGATTGGTAGTATTCTTAAAATTGAGGATTTAGAATATTATGCTGCACGCCATTCTTGGGCTACCATTGCCCTTAACAAGGTCGGAGTAAGCAAATACGTGGTACACGAAGCCCTCAATCATATTGACGAATCTATGCGAGTAACCGACATTTATATTGAACGTGACTTCTCGAATGAGAATAAGGCAAATGCAAAAGTCGTAAAATATGTGTTTGGGAAATAATGTGACAGAATATAACGAAAAAAAGTTTTATACCCCTTGTTTAGAAAATGTCACCCAAAGAAAACTCCATCGGCTCTAATGCAGGTGGAGTCTTTTTATTACCTTTGCAGGGTATCAAAAGATAAAGGATATGACATTTGAGATTAATAAAACAGAATATATTGCTGTTATAGCTGGAATGGACTGTTTACCACGTATCAATGAGGACGAAATTATAGATCCCTCATATATCGAACTGGCTCAAAAGGCAGCGACAGATCATTTTGCTAATAGATTTTCCAAGTTATTCAAGGATAATCAAGAAAAGTACGAATTATTCCTTGGAAATAATCATATTCAAAAGCAACTATCGAAATGGCAACTTGACCCTATCAAATTCTGGTATCTACATTTGTTTATCATCGATTATGCAACAGATGCCTTTAGAAGTGTCACCATATTTGAGGAACTAAGCACAAAGCAGTATATTGAAAAACTTATCAAATCCTTAGAAAAAACAGATATAGAGAATTTCTGTCTCGATATAAGGATTAATAAAGAGAAGTTGTCAACTCGAAATCCTTGGGTAGGTGAAGCCTTACTTTCTGCTCTAAAACAAGATTTCCCATTTGACTGGGTAAATAATAATACGTTTGTTTCTAAAGATATCAAAGAGGATATAGAACGCTACACAACTGCAAGAATGAAATTTTCCACAGAATTATACAGCCACTTCTTTGATCAATATCTTCACAAATCACAACGAGGTAGAAAGTCCTTTATAGGTAAATTCCTTTACTTGGCTGATTTGACAAGGGATGAACGTTATTGGTATGGATGTAAACCAACATTAGAGAAAGTCTGTAAGAAATACGAAATTGCCCAATATCAAAAGATTATGGTTGAGGGAGTAGAACATATTGCCATACCATTTGATGTGGGAAAGGATTTATCAGACCACATAAAGAAATGTACCATGATTCCCCAAGTACGCCATTCTTTCTACTTTTCTCCAGTTGACGAGTAAGTAACTATCATCCATTATTTTAGGTTTTAGGGAATTTTCCATTGTAAAAAGTTCCCTAAAACCCTCAAGCCTTTTTTACTTTCTTTCTCCTTAATTTCCCCTTACCTTTGCATCGCAATCACGAGACAAGCGCGCTGTCTAAGTGATTGTGAGGTAACAGGTTCACCTCTGATGTAAGAACCTTAAAGACAAAGAAGATGAATAACAAAACTTTAGACCTAAGAATAGATAACGCAGAGACATCTGCAAGTACAGGAACTCAAAACAACAAAAGCATGAATAAAGGAACATTCACATTACCGACTTTACAGAAAGCACAAAACCAGTATGACAACATTGGAGCTTACAACCAACTGCAAGCAATGATGAAAGCAGGACAGAAATTATCTGTCAAGTTCTATACAGGGAAGGGCAACAAGCCATGTGCTTGGATTGAGAGTAAGCAAGTGACAGGATTTAGGTACGAAGTCAAAGACACAAGTTTTAAAGGTTTGATGAACTACCTCATATCAGGTGAGGTTGCTGATTTTGACACTTCACCTATGGAAGCTCAACCTTTGAAAGAGGAAACCAATTATGCCCTCTTAGTGCTGAAGTTGATGCTGGAAGAACACTGGGGTATCCAGTTCACACCTCTTTTTAGAGAAAGATCTAATTATATAACTGCCCAAAAGGTGTTGAGGAAAGGAACAATATACTTCCAAATCCCAAGAACTGATGAGAATGTAGAGCTACTAAGAGATTACGATGTAACTATCTAAATAATAACAGGAAGAGGGGGGAAGCTCCCTCCTCCATAAAACAAGAAATTATGAAAAAAGAAATTACTATTTCAATGAGCTATGGCAAGGTACAACATCTTTCAGAAGTCCTGCCACTGATTCCAACTAACACAATCCTTTGCAAGACACTTACTGGCATAGGAGCAACTTACGGAGAAATTCGGTCTGCCCGAAACTCTATCATCATAGAGCCGAACAGACCTGTTATATATGGGAAGTGTCGTGCCCCAAAACATAAAAATGACAATCTGTTTGGAGTTTACGAGGGCATCTATACCCAAGACATTGTTAACTACATCATCAAATCCCAAGACAGGAACAAGAAGATAAAGATACTGACCACTCCCGAGAGCTTCTACAAAGTGAGAACAGCATTTGAACAGATGGGGATAGATATTCGGACAGACGGTTATTTCCTATTGTTCGATGAGTGCCAGAAGATTGTTAGAGACTGTGGCTATCGTAAAAACATATCACTGCCAATGGATTTCTTCTTTGAGTGTCAAGACAAAGCTATGGTTTCTGCTACACCTCCATCAGAGTTTGCAGACTCACGGTTTGAGAATTTTGATATTCTCATCATCCATCCCGACTTCAACTACAAAAAAACGATAACGGTTTGTACCACCAACAATGTACTGGAGAGAACACGGCAGTTGTTACTACAACTGGACGAGCGACCAGTTTTCTTTTTCGTAAACTCCACAGACATTATCCTTGCTATGATGGAACAACTTGGATTAAGGGATCAATCAGCGGTGTTCTGTTCTGCCGACAGTGTGGATAAACTGAAATCACAGAAGTTTAGCAACGCACACGAAAACTGGGAGAAGAAATACATGGCAAAATACAACTGGATGACAAGCCGTTTTTACAATGCTATGGATATAGAGTTGAATGAAAATCCAAATGTTGTGATGATTACCGATTGTTACACCGCTGATTACACAATGATAGACCCTTATATGGATGCAGTGCAGATAACAGGGAGATTTCGTAATGGAACTAACGCCATCTACCATATCAGTAATTTCGATAAAAGAATCCCAATAAAAGACAAGAAGAGCATTATCATACGGTATCAATGCGATAAGGAAATATATGAGAACTTTAGAACATTCAAGAACTGTGAAACAGATATCAACCGAAGGGCGGCTTTCAATGATGCTATGAATGTGTTGCCTTACAACAGATTCCTTAACGATTACAGCAAAGAGGATGCTTTCAAGATAGACAACTATATCCACGAAGAATTGGTAAGGGTGATGTACCATGATTATACACGTCTCTATCAAGGATATAATGAATGTGGATATTTTGAAGTAACCACCTCAAACATAACTTATAGATACGGTGATTATGATCGATTGAAAATTACAAATAATACTTTACCTATCAAAGATATACGGAAACGAATTGTAGACCAGTTGGAAGCTCTAAAAGAAGACGAGACAGAAATGGCAATGCAATACCGAGATGAACTTCGTCACATAGATTCTTTCATTGTAAAAGCATGGGAGGTGTTAGGGAAACAGACGATTGAGGAACTTAATTATAATTCTAAATCAATTAGGGAGAAGATAATACTTACCTTACATGAGAAAAAGGCTAAATCTTCAGATGTAATTAGAATGATCTATAATTCGTTTATTTCAAGGCATTGGTATGCTTCTTCTTTTATAAAGCAAGAGCTAAAGCGAATTCACAATTTATTAGGAGTTCCCAAGAGTAAAGCTATCACTGCAAAAGATATTTGTCAATACTTCGAAGTTACAGAGAAACGGAAAGCAAAAGAAAGAGGTTATTACCTCATCAGTCCAAAATTTACTACAGAATGAAAACTATTAGATTAAGATTACTCTTTATAAAGAAACTATCATCCAAATACAATGTGCTCCCTCACATTGAGGATGGATAGAATCACATTGTGCAAGAAGAGATTATGACGAAGATTATGGTAGAGCCTATTATAGGTTTTGAAACATCTGAATTTACTGAATTTGTCAGAGAATACAACTCAAAAGGTTAACATAGAAAATAGGTGATAAAATGTCTAAAAGTGTACAAAGAGATTAGTAAAGTCAAGATAAGGAGAAATACACAGAGATAGTTCAAAAAGGCATAAGTAGCCAACAAGTGAACAGATATGGACCAGAGTCAAAAAGGACTTTTATGATATTGCAAACCATGTTCGTTACTAAATCTATATATAGCTGCCAAACGAGAACACCCCTCCTCCTCTTCTAAAATGTAGATGCTGTAATAAAGGGGATATCCCACTATATACGTCATCCTAAAATATGCCCCTCTTTTGCTATGTCCTCCCTTTATCATCAGAACATAGGTGCTTTATCATCCAAGCATCAGTTATGCAAAACTCACACACAAAGGCATACCCACTTTGTCTCCCCCTCCACACAAAACAAAGGGAAAAGAAAAATAGCAGAGGTTTCTACAGTGAAAAATAGGCTATGAATAAACCTCTATATAATCTTAGGGGATTTATGATGTGTAACAGGCTGGAAGTTGGAGTAATCCGCTTTCGGTCTGTTCTCATTTATGCCTATTTTCAATCAATATTCACTTAATCATTTACAGTTATGAACAACGTAACATTAATGCCTGCTGTTGTGGCACAAAACAACAGAAACTTTGATTTCAGCGATGCAGAGGAAGCCGTTATCATAGAGGACACCCCAACGGAGGGTAGCAGTGTTTCTTTCCTCGAAGCCAACACCAACGCAATCTCACTTGATGAACTTGCCAGTACGTGTGTCGTCCCAACTTGGGGTAATCAAGAACTGACCATTGCACATCAAGACTTTATTAACTGTGTACATGATGCTGCCAAAGACTTCTATCATGGCGAGCAGATAAATACTCCTGCTATCCGTGTTTCTCATATAGTAAGAGGTAGAACACCTAACGCATTAGGCAAGAAAGCGTCAGAACTCTTGGAGTGTGAAAAGACACAGTTCTATCAGCGTATGGCGTTTGCTTTTACTATCCCAACGATTTATGAAACCTTGAATGGTCAGAAGCTTGAACTTTGTGTGGGTGGAGTGCGTAATTACAACGACCTCAACCTTTATAGGGCAAGTAAAGGTGTAGAGAAGTTCTCTATTTTTGTAGGCTGGAGAGTAAACGTGTGCAGTAACCAAGTGCTAACAGGTGATGGTGTCAAACTATCTATCGAGGTAATGAGCATTCGTGACTTATATAAGAGTGTGATGGAGTTACTCTACCACTTTAATCCAGCTAAGGATATACACCTAATGCAGACTCTCTCAAATACTTCTCTCACAGAAACACAGTTTGCACAGATAATAGGCAGAATGAGAATGTACCAAGCCTTACCGCAAGGATATACCAAACAGATACCACGCTTGCTGATTACGGACAGCCAAGTGAACAGCGTATGTCGTGGCTATTACAGCAATCCTGATTTTGGTGCCAACGACGACAGTCTGTCTATGTGGGACTTCCACAATCTGCTTACGGAATCCAACAAGGGAAGTTATATTGACACCTACTTGCAGAGAGCGGTAAATACCACGGAGGTAGCGGTTGGCATCAACAATGCCCTACATGGTGATGAGAAATATAAATGGTTCATTGGATAAGTGAACTGTTGATTGAGAGCGGGGAGACATTCTGCAATGGGTGTCTCCCTTTTTAATTGGTAAACTATACACACAACACGATGAATAAAGAGATAGACAAGGCTCTACACCTTATATATACTTTACAGGAGCAGTTATGCAATATGCCACACAGAGAGGGCTGTCTTAAAATGCTATACCAGATAAGAAGCCTTATGGAGGATTTCGACAGTATGCTGACACCAGACCAACGCAAATACGGTATATACAGCAATGCTTATAAGGAGATATTCGCTAATGGCACAGGTCTTTCGCAATACGACAAGGTTTGTGCAACCATCTGCGAGGAAGAATATGCAGAACTGCCATTTTAACAACAACTTATTCATACATTACAACTATGGACATGGATTTCAAGGTAGCAGAGGTAACGCTACAATACAAACCGACCTGCAAGAAACAAAGCAAGGTTTTTGGTTCAGAGGAAGCATATAATATCTTACTTCCTACATTCAAAGAGGGAACAATAGAGTACAGGGAATATGCTAAGGTGTTATATCTCAACCAAGCAAATGAAGTTATTGCTTATAACACGATTTCAGAGGGAGGACTGACAGAAACTGCCGTTGACGTGAGGATAATACTACAAGGGGCATTACTGACCAATGCCACGCAGATAATATTTGCTCACAATCACCCAACTGGTAACTTAAAGCCCAGCCCACAAGATGATATGCTTACAAGAGAACTACAGAAAGCTTGTCAGATTATGAGGATACGGTTTACTGACCATATAATCATGTCAGTAGATAGCTATTACAGTTATCGTGAAGAAGGGAGGATATTGTGATGGAAACTAATTATGTCGTAATTCTTGATTTCAGTACAGCAACTGTAATTAAGATACATCTATCGAAAGAGCAGATTGAAGAATCCTACAGATATAATGATTTTGAAGAATTCCTCTTTACACTTGAACCCAAATACGGCTTTCAAGTAAAAGACTGCTGTTGGATGATGTGTGAGACATTGAAAGAGGAAAACTACCTCTGATTATATTCACTAATAGAGAAAGTGAACGTTATTCTGATAATCAATCACTTAACAAAGGGGCTGTCAGAATTATGTTCACTTTCCTTATAAACCCAACATAAAAGAATATTATTATGAGTTTGAAATATAGCAACACAACGGCAGACTATCTTGAATGGTCAGAAGCCATGAACTTAATACGTAGACTGACAAAAGACAAGAACTATAAGATTTCGCTTCTTATAGCGATTGGTTGTTTTACAGGATTAAGAATATCTGATATTCTAACCTTACGATGGAAGCAGATACTTGGTGTTTCAGAGTTTACTATAACCGAACGTAAGACAGCTAAGCAGCGAACTATACGCCTTAACAAAGAACTGCAATTGCATATTAAAGACTGTTACGAGCATATCAATCCTCTTGGCATTGCAGCTCCTGTCCTTGTTAGTCAGAAAGGTACAGTTTTCACCGTGCAACGCATCAACGTAATACTCAAAGAACTCAAACAGAGGTACAGACTGAAAATAAAACATTTCAGCTGCCATAGTTTGAGAAAAACATTTGGAAGACAAGTTTATAACATGAATAGTGACAATGCAGAACTGGCACTGATCAAGCTAATGGAGCTTTTCAACCATTCATCTGTAGCTATTACCAAACGGTATCTTGGATTACGTAAGGAAGAAATCTTGGAAACCTACGACTGTCTTACATTCTAAGCAATGACTTTTTTCAGAGAATAATTCTTTTATACACCCCATACTAAAAATGTCATCATATGCTGAATAACAAACAAAAAACACCCTTAAAATAGAAAAATCACTCTGTTATACAAACTTTTTGCAAGAAAGAAACATTTAATAAGAAAATTTATCTATCTTTGCAGTGAACTTCCAGTACGTCTGTGTGTAGATGTGCGTTATAAAGAGTTCAACTTTTAAAAGAGCGTTTTGATATTATCCTGTTGAGAAATCTGGACAATTTCTGTAACTGGGGATAGTGGCAATAACGCTCACGCATGGGATTATTGTACCCTTAACTAAGGGAATAATATACCTCAATGCGTGGGCTATTGTTTATTATTCAGTTACAGGCAGTCCAGAGCCTCTCGACAATAATAACAATAGTTCCCACGCTTTTTCCATTTACCTAAATATTAACCGCTTACTTCTGGGAAACAGTGAGCACAAGATACAAAATTATGAAGTGGTTGACTACCATTATTGTTGTAGCTATCATTGCTGGAATCTTAGGTGCACTAAACTCCAAGGATGGAGAAAAAGGAGAAGGATTCTTTTCTGGTGCTCTTGCTGGAGGAATGGGCTGTGGTTATGTTATCTTCGAAATATTTTTAGTCGTAGGTGGATTGATACTTTTATTTAAACTCTTCGGCTTTTTATTTGGATAAACAAATACAACAATTTCCTGCTATGAAAAAAGGGATATTTAACTTACTATCTATGAGTATATTATTCTTTGCCAACCAAGGGTATGCTCAACAGATTAATTCACATTTACAAGCTAAATCCACCTTACAAAATCATGTCAGTTCCATTTTAAAAGAGAATATCAAAAAGATAGATGCTATTAATGGACAGATAATAATAATGGATTGTAAGTCTGGAGAGATTAAGGCAATGGTAAACTTGATGAATACTAAATCAGGAATCAAGCCAGCAATAAGACAATTGTCAGAACCAACTGCCTTAATGCGTACAGTTTCATTATTAGCAGTATTGGAACAAGGTAAGGTAAACCTGAATGACAGTATTGATACTAAAGCAGGAATGCTTGATATAAACGGCTATCTGCTAAAAGATTACAACTGGCTTAGAGGAGGAAATGGAAAGACAAGCGTACAGCAAGGCTTTTTGTTATCATCAAACATCGCTACATACTTAACAGTTAAGCAAGCATTTGGCAAGGAGTATGACAGGTTCTATAAAGCCATTACCAATATGGGCTATGGTTTACCACAAAATAGCTTCAAAGCAATCTTACCAAATCCCTACAATGATATTTCTCTTGCCAAGTTTGTAAATGGAAAGAATCAGCAGATATCACCTTTACAGATACTTACATTTTATAATGCCATAGCCAATAATGGTAAGATGGTAAAGCCCACTTTTCATAAGAAGGATACTACCATTATAAAAGAACAAATAGCAAGCAAGGAGAATATTGCCACTATCCAGCAATTACTTATACAAAAAAGACTTACTACCAAAGTCTTCTCTGATAAAATCCCCATTGTGGGAGAACAAGGAGTTGCCATTGTAAAGGAAGATGGAGATAACACAGTTTACTGCTTACAGTTCTGTGGCTACTATCCATCAGACAATCCACAATACAGCATCATCGTTTCTTTGAATAAGAAAGGATTATCAGCAAGTGGTGGAATGGCAGGAGAAATAACAAAACAAATAGTAGAAACAAACAACTTTTTTTGATAGTATGAATATCTATTTAATAAACCCTAAGTGGATTGATTCCACTAAATTTCATAGTTTTCTTGAGATATTAAAGGTAAAAACAATAATCAGTTTTATAAACGAGGAAGATTACACTAAAAATTTACTGCCCAAATATACAGCAATAAAAGTTCTTGATTTCGCTAAGACTCCATTAAGGATGTATGGTAATAACTTTGAGTATAACAAAAAAATAGTTGATAAGTTCTTACAACAGGCTGTATATGAGTTTATTAGCCCATTCTCCCAATTAGCGAAAGATACTAAAGGGAAGATTTCAATATGCAAAGTAAACGATGCATATAAGCAAAGCACATTTAATAACCCTATCTACAATATTGAAAAACTTAATGATCCTAATATGTTGATTCTTTATAATAGTTATAAAAATGAGCATCCTTTGTATCGTGCTGTAGCAAGATTCAAAAGATATGCTTTTAGCATACTTGACAAATATTGTCCACAATTTGATTATAAACATAGAATCGCATTCGTCAGTTTTGATGAGTTCTTATTTTCTAATAGTGTATTCTCTAATAGTGTAGGAAATTCAGATTATTTCTTCAGGACAAAGTATTATTCTATATATTCAGCAAATAAAATAGATGATTTATTCTGTCCAATGCTTGTTTATCTAAATCTGAAGGCAAATCAAACCCACTATAAAGAACAAACACTACAAGAACTTTTCTTGCGGGATGGTGCGAAACGAGCTATGGAGAAATTTGTTATTGAAAGCCGTGATGCAGAATTACTAAGCAAAAGTTGTAGTTATGATTGCGATGATTCTAACGATTGGGGACAAGAGGGTGAGTTAGAATATATATATGAGAATGGCGGAGATTGGATATTAGATTAACAAATAATATAAAATATATGAAAAAGATAGAAACGCAAGAAGGTGTTTACATCATTCAAGAGAGGGAACACTATGGCATCATCAACGAAAATAATACCATAATAATTCCTTGCATATATGAGTCTATAGAATATATTCCTACTGACAACATATATCTTATAAGATTAAATAAAAAGTATGGTGTTCTGAATACCAATTTTAAGACTATAGTTCCACCTGTTTACTCTGGATTATACCTATGGGGGAAAAAGTTTATTGCAATCATTGCTGGTGGACTTTCTGAAAACTTCAACTGGCTTTGGATTTATCAAAAAAGAAAAAGTCACAACGCTTTTTATAATCCTAAATATTCACTGTTAGATAAAGATGGACAAGAAATCTTCCATCCACGCTATGACACAATTACACCTGTTGACGAATCATTTGCTACAGTAACACTTGGAGATTATATTGGCTTGATAAATAGTTGTGGAATAGAAATTATCCCTCCTATATATGATATGGAGAAAGATTCTATGATAGAAAAAATAGCAAATGGAATCGAAGAACTAAAGGGTCGAATTCATAATAAAAGCAATAGATATTATCGCTGGGAACTTACCGATGGAAAAATAAGAGAAAAGAGAATATTTTGTTTTTCCCAAGGGAATACTTTATACGTCATGGATGAAGAAGGCACTGTTTTTGGTAGTCAACAATACTTAATCATCCCAAGGTTTGATAAGACTGATAGTATTTGTAATAAAGACATCATAAATTTGTTAAGTAACAATGATACAGAAAAAATACGCAAACAGGATTTTGATTTTAGCAACATAGAACTATTCCGATAAGGATCTCCCAATTAAGAGGGTTATTTGCACCATTTGAGCAATCATTTGGTGCTTTTTATTTTATTCTCTTGCAACTCTCAAAAGAAAAGATTATATTTGCAATATAAACGAAAAAAAGGAGGACTAGTAATGAGTACAAATAACACCACAACAAGCACATCCTCTTTGGAAATCTCAAAGTGGTATCTTGCTTTAACTCACTCATGAAGAGCTCATAACCTACAAATATGTACTTGAAGCCAATAAGCAACAGCGTGTCAAAAATATGATAATGTTGATAAATTATTTGCTTCCTTATAATGGAATATAGGTTTAACACAATTCTAGTTTTGCCAACCTATTATTTCAACATAAAAGGAAATAAGGTAACACAAATATTGTTGGGAATGTTTTAATATAAACCTTTAGAGAAAAATGTCTAAGCAAAGAGTTTCAATTACAGATAATAGTATCAAAAATGGAGTGACGAATGATGCTAAAAAAGCTATATGTGAATTTATATGGAATGGTTTTGATGCAAAAGCTATTCATATTAATATTGAATACCAATCCACCGAATTAGGTGCAATAACTCATCTCTATATAATAGATGATGGCGAAGGAATAAATCGCAGTCAACTGAATGAGACATTTGGAAAATATCAAGACTCAATTAAAAAGCGTTCTTTCCAATGGAGTTCACAAGTAAAAGGACATAAAGGAAAGGGAAGATATGCTTTCAACTGTTTTGCAACTCGTGCTGATTGGACAAGTATTTATAGTGATGGTAAGAATTTGATTAAACACACAATATCAATCAATGCTGGAGACAATCATCATTTTAATGATCATAGCGACTCAGATAAAAATTATATTGTTCATAATGAAAAAACAGGAACCATTGTTTCATTTGCAAATGTTTCATTAAACCAATCCTTTTTTGAGTCAAAAGAATTTGTTGATTATCTAAAAAAAGAATATGCCGTTTTCCTAAAATTGAATGGAAACAAAGGTAAATCTATCATCATAAACGGTAAAGAATTAGATTATAATACAATTATTGCAGATACGGATAATAAGGAAATACAAATTAAGGATGAAACAAATAATAAAAACTATCATTTTGAATTAACTTTTATACGTTGGAAAGAAAAAATAAAAGAGAATTATAGTTATTATTTTCTAGATAATGGGCAGATTGAGCGCTTCGAAAAAACAACATCCCTTAATAGAAAAGATGTTGGTTTTCATCATAGTGTCTATATAATTTCATCGTATTTTGATGAATTTCATCCAACTTCGAGAAAAAGCCAAGATATAGATGAGGAATACTCACAACAGGAAATTACCTTTGATGAAACAAAATTTTCTAAGTCAGAAAAAGATAAAGTTTTTAAGCAGTTAATTAAAATAGCTGGTGTTTGGCTAAATGAAAAACAGAAAGAATATATTTCGTTAGTAGCAGGTGAGGAACTATGGCATAGATTTGAAAAGAAAGGGATTGTTGCTTATCCAAAAAATGAATATGAGCTTCCTCTCTATTTAGAATTAAAGAATACGGTAACAGGAATTTACTCTGTCCAACCAAAAATATTCGAAAATATTAGAGAGAAGCCAGCTAAAACACTGGTAGGCTGTTTAAAACTATTGTTACAGACAGATAAACGCGAAGATTTGATTTCTATAATAGATAGTGTTGTTAAAATGTCCGATGATGAGCGACATAAGTTAGCAGGTATTCTCAAAGTAACAGAGTTGTCGCATATTACTAGTACAATTGGACTCTTAGAAGATCGATTAAAAACAGTGTCTGCATTAAAAGCTATGCTTTTTGACAAAAGACTAAAGGCTTATGAAGTTAAAGATATTCAAAAGATAGTTTCGAGTGCCTTCTGGTTATTCGGTGAACAATACAATATAGTTACGGAAGCAGAGCCAGATTTTCAGCAGGCTTTAGAGGCATATTTAAATGCATTACACAATACTACAGCTGGCGTTGGAAAAAGTAAAGTTAGTGTGGATAAAATGCGTAGTCCAGATGTAAATAAAGAAATGGATATATTTGCTTTCCGCCAAACAAGAAACAGTAATACGGTAGAAAATATCATTATAGAATTGAAGAGACCATCAGTAAAACTAGGGGAGCTTGAATTGAGCCAAATAAAAACTTATATGAGGACTATATATCAAGAGCCACAGTTTTGTTCTGCAAGTGCAAAATGGACGTTCATTCTCGTAGGAAATGAATTAGATAATAGCGGAAGCATTGAAACAGAATACGATTCTAACAAGACATGGGGGAAGAGGGATTTAGTTTTACATGTTGATAAAAACTCTCAAAGATATGAGATTTTCGTTAAAACATGGAGCACTATTTTCGATGACTTTGAAATAAGACATGATTTTCTATTAAGGAGGTTAAATTTTCGACGTCAGGAACTGTCTGCACAATACTCTAATAAAGAAGATTTGCACAACATCGTAGATTCTGCCAAGGAGAAATAAAAAGACAACATCCTATGAATAAAATCATTTAGTGCTTTTTCGTTGATGACATTTTTACTCTGAAAATTATTATATATACCCAAATGTAAAAATGTCACTCCAAACGAGAAATTCTGATTTTAAGTACACTTTTACTCGATTTTTCAGAAAAGTGCGAAGCAGTTTGGATTTACAGAATAAGATACCTATATTTGTTTTAGATATTCAGGTAATCCCAACGATATAAAGGAAATATGGAGGTGCAACAATGTACCTCTTTATTTATTGCTTGAAGTGTCGATTTTTTATTTATAAGCCACTACAGATAAAATTGAAGTACAGATATACCTAATGAAAGAAAAACACATTACCAATGAGAGCCGTATGGGAATTGCCAATGAGAGCAATAGATACTACTAATAAAAAACTATTACTCATACAGCTCAAATCTTTCAGATTTTCGCTTATCTAAACTTATTGATTTAACAATGGAACATGATTATAAAAGTAACAACAGGAGAACAATTTGAAACCCGAAAAGAAGCCAAAGAACGTGTTGGAGGCAAATGGGCATTTGAGAGATTAGTAAAAAACGGAGAGATAAGATTTATAAAGGATGGTAATAACATAGCAAGCGATGGATTACACTACCCTAAGCAAGGAAATCGGAGCTTGTCTTAATCCACTGCAAGCATACATATACTTGGGATTATGCCTACATACAGATTTTAAGAATGATATTTCAAATATTAACCAAGATACGTTGGCAAAGTTCCTACAATGTGATGTTGGTGCTATACAAAGAGCCTTACGAGTGTTTCAAGAGAACAACTTGCTTACCATTCATCAAGACAAATATGGCTGGAGAGAGAAGAACAGATATAAGCTAATTAGAACTAACTGGTTTGGCGTTAAAAGAAAGATACTGGAAGAAAACATAACAAGAGAACAGATAGGCTTTCTCTTATTGTTGAAAAGTCTTTGCTACAACCACTGTAATTATACTGATTACTATGGCAAGGGCTTACAGGAAATAATGGCTCTTAAACGGAGCATGATTGACAATTATTTGAGGGTATTGGAAGCCAAACTATATATCAAGAGAGATAAGAAGAAGAAACGGATAACAATACTCCGAGATGATTTATTCCTAACAACAAAAGAATCAGAGAAAGAGAAAATTATGAAACTATGCCCAGAGTTAATGGGCGATGATGATTATATTGATGAACATGGACACTACCATTTTGTAGACTAAAATCAGCTATGCCCTACATCAAGTTTATAAAACTGGAAACAGGGTTACGAATAAAGAACTGAAAACCACCTTGCAAAAGATTTACACAGGATTTGGCTGTAAGAGAAAGGCAAAAGGAACATAGATTACAGAATATGGCTTCGCCACGCAAAGATGTAAAATCCCTACAGATAAGGGGAGAAAAGATGGAATGATATTATTCACAACATAACTACAGGCAATGATAACAAAAGAACACTTATAGGCACTCGTCAACGAGGCTAAAAGTCTTGTTGACATATTGAAGATACTGAATAAGAGGCAGAGTAAAACTAACATAGAAGAGCTTACTACTTTATTAGATAAATACGGTATAGATTATCATACTCTTCCCATTAGACTGATACAAGAGAAAATCCCCTTAAAAGATATACTTGTAGAAAACAGTACTTATCAATCTTCCAAACTAAAGAAGAGGCTGATTGAAGAGGGTATCAAAAAGGAACATTGCGAAATATGCGGACAAGGTAACACCTGGAATGACAAACTGCTTGTCTTGCAATTAGACCATATCAATGGTATACACACAGATAACCGATTGGAGAACCTGCGCATTGTCTGCCCAAACTGTCACACACAAACAGATACGTTCTGTACTCGAAAACTTAAACAACACAACTACTGTAAAGACTGTGGAAAAGAGATTACTCCTAAGTCTACTTGGTGTCCCAAATGTGCATTGAAACATAACCGTGTACACAAGGTTTCACCCTCTGATAAGCCCTCCAAGGAAGAGCTACTGCAGCTAATAAAAGAGAAACCATTTACCGAAATCGGTAGAATATACGGAGTAACTGACAATGCTATCCGTAAATGGTGTAAAAAGATGGGACTACCCTCCACGAAAAGAGAGTTAAACGCATTATATAAGAAAAACACAGATAGAGGATAAACAGCCATACCTCCTATTTCAAAGTGTGAGATTATTTTCACACAACTGACCTAAATATGTTAGTTTAGTACAGTATTATTCAAGATATTTTTGTAACTTTGTGATACGATAGAGCGTTGACCTATAGTTGACCTCTTATTCGCACAGAGTTCATTAACAAGCTGATAAATAAGCTCATATATAGTTTGTGTGGCGAGACTCATAATCTGGAGGTCCCTGGTTCAAGCCCAGGCTGGTCCACAAAGGGGTTTCAAAAGGATGTCAAACAACACGTCATTTTAGCCTTCAAAACAAGGATAATCGCTGTAAAATCAAGGTTTTACAGCGATTTTTTGTTTTTTATCCATATCCGTTTGACTGCCAAACTATACCTTTTGATAGTCATCATTCGGTCATTTTCTGCTACAGAAATTGCTACACAAAAGCAGGCTCTCAAAAAAATGTAGCAATGAATGGAAAATGACTATCAAATTGACGGCATGCAAAACTTTCGTTTTCAAGAACTTATGTCGAACTTTGTAGCCAAAAAGCTACACTAAAATGATGTGTTATGAAAACGACTTTCAAGGTATCCTACTACCTACGTTCCAACTATGAGAACAAAGAAGGAAAATCACCTGTAATGCTCCGAATATATCTCGGTGGCGAAAAGGCAAATCTTGGATCCACTAAAATTTTTGTGGATAAATCCAAGTGGAGTAACAAAACCAGCAGAATGATTGGCAGAACAGCAGAAGCACTCTCCATTAATGCTTCTATAGATGCACTGACAACAACACTAATGCAGATTTACAGGAAATATGAAACATCAGAGGAACTTTCCATAGACCTTATCAGGTCAGTATTCCTTGGTACAGACAAAGAATATACAACTTTTCTGCCAGTCTTTGACAAGTACATAGACTCCATCACACAACAAGTTGGTAAGACATTAACAAAGGGTACTTTTTATAAGTATAAGGTGGTGAGGCAGAACTTTCAGGATTTCCTACAAGCAAAGTATCATCGCAAGGATATTGGACAGACAGAACTGACAAGTGCCGTTGTTCAGGACTTCGAATTATACCTTACATCTGTTGTAGGTGGTGTGCATAACACAACCACCAAGAAATTGAGAAACCTGAAAACGGTCGTGAACTATGCGAGGAATAGAGGACTTATCATGCATGATCCATTTGCGAATCACAAGCTGCGCTATGAATTAGTAGATCGTGGCTACCTCACAGAAGAAGAGGTACTCCGTATTATGAAGAAGCACTTTGACATAGAACGGCTGGAGTTAGTAAAGAATATTTTTATTTTTTCCTGCTTCACAGGATTAGCCTACATTGACGTATATAATCTTACCTACGATAAAATTGTTACCGTAGAAGACAGACAATGGCTTATCACCAAGAGATACAAGACAAGCGTAGATGAAAACGTCATGTTACTTGACATTCCACTTGCCATTATAAGAAAGTACTACGACATCAATAGAAAAGGGGGGAAAGTCTTTCCTATGATGAGCAACCAGCGTATCAACTCATATTTGAAAGAAATTGCCGACCTTTGTGGCATAAAAAAGAATCTGACCTTCCACATGGCAAGGCATACTTTCGCCACTATGTCTATATCCAAAGGGGTACCAATGGAGTCTGTATCAAAGATGCTTGGACATACAAATATCAGAATAACTCAGATTTATGCACGAATAACAAACAAGAAAGTTGAACGTGACATGGAAGAACTGGCTGGAAAGCTCAGTAAGTTCAATACAGCCATGGGCATATAA
Protein sequences of DBSCAN-SWA_1 >CP022041|220916:251291|230632_231460_-|ASE18099.1|protease|DBSCAN-SWA MNNKNVLSATLYVILFVSLFILIQSLSFKGGKELVSLLGGTEAADGDYLTKNGETLAITAVISSLATLILFIATKWAPVSGNYLKTRPWAVLMWSALIALGSILPLQFLAEKINLTMPAGTEEMFESIMKVSWGYVALGLMVPIAEEIVFRGAIQRVLQNVLGERKRWIAIVVSALIFGIIHFNLAQGLHAFLIGLLLGWLYSKTGSILPGFVFHWVNNTVAYLMFNLMPQMNDGKLIDFFHGNDRMMYGGLFFSLCIFVPALFQLWVRIEKGNN >CP022041|220916:251291|248863_249517_+|ASE18115.2|DBSCAN-SWA MLKILNKRQSKTNIEELTTLLDKYGIDYHTLPIRLIQEKIPLKDILVENSTYQSSKLKKRLIEEGIKKEHCEICGQGNTWNDKLLVLQLDHINGIHTDNRLENLRIVCPNCHTQTDTFCTRKLKQHNYCKDCGKEITPKSTWCPKCALKHNRVHKVSPSDKPSKEELLQLIKEKPFTEIGRIYGVTDNAIRKWCKKMGLPSTKRELNALYKKNTDRG >CP022041|220916:251291|242162_243152_+|ASE18892.1|DBSCAN-SWA MSILFFANQGYAQQINSHLQAKSTLQNHVSSILKENIKKIDAINGQIIIMDCKSGEIKAMVNLMNTKSGIKPAIRQLSEPTALMRTVSLLAVLEQGKVNLNDSIDTKAGMLDINGYLLKDYNWLRGGNGKTSVQQGFLLSSNIATYLTVKQAFGKEYDRFYKAITNMGYGLPQNSFKAILPNPYNDISLAKFVNGKNQQISPLQILTFYNAIANNGKMVKPTFHKKDTTIIKEQIASKENIATIQQLLIQKRLTTKVFSDKIPIVGEQGVAIVKEDGDNTVYCLQFCGYYPSDNPQYSIIVSLNKKGLSASGGMAGEITKQIVETNNFF >CP022041|220916:251291|245279_247349_+|ASE18113.1|DBSCAN-SWA MSKQRVSITDNSIKNGVTNDAKKAICEFIWNGFDAKAIHINIEYQSTELGAITHLYIIDDGEGINRSQLNETFGKYQDSIKKRSFQWSSQVKGHKGKGRYAFNCFATRADWTSIYSDGKNLIKHTISINAGDNHHFNDHSDSDKNYIVHNEKTGTIVSFANVSLNQSFFESKEFVDYLKKEYAVFLKLNGNKGKSIIINGKELDYNTIIADTDNKEIQIKDETNNKNYHFELTFIRWKEKIKENYSYYFLDNGQIERFEKTTSLNRKDVGFHHSVYIISSYFDEFHPTSRKSQDIDEEYSQQEITFDETKFSKSEKDKVFKQLIKIAGVWLNEKQKEYISLVAGEELWHRFEKKGIVAYPKNEYELPLYLELKNTVTGIYSVQPKIFENIREKPAKTLVGCLKLLLQTDKREDLISIIDSVVKMSDDERHKLAGILKVTELSHITSTIGLLEDRLKTVSALKAMLFDKRLKAYEVKDIQKIVSSAFWLFGEQYNIVTEAEPDFQQALEAYLNALHNTTAGVGKSKVSVDKMRSPDVNKEMDIFAFRQTRNSNTVENIIIELKRPSVKLGELELSQIKTYMRTIYQEPQFCSASAKWTFILVGNELDNSGSIETEYDSNKTWGKRDLVLHVDKNSQRYEIFVKTWSTIFDDFEIRHDFLLRRLNFRRQELSAQYSNKEDLHNIVDSAKEK >CP022041|220916:251291|224524_224905_+|ASE18094.1|DBSCAN-SWA MRKTTDILLSLLLSTMIVWIGSGIMTVVCAHTGNVSVAKMEDKGHCNDNGVKNCMKIEVKSLSPTDMAHNDFYQIQPIQLSLLPQFVSDCQLLPLPVLTKAPERILSLLWHSPPRQYLRLLTTLII >CP022041|220916:251291|243156_244032_+|ASE18111.1|DBSCAN-SWA MNIYLINPKWIDSTKFHSFLEILKVKTIISFINEEDYTKNLLPKYTAIKVLDFAKTPLRMYGNNFEYNKKIVDKFLQQAVYEFISPFSQLAKDTKGKISICKVNDAYKQSTFNNPIYNIEKLNDPNMLILYNSYKNEHPLYRAVARFKRYAFSILDKYCPQFDYKHRIAFVSFDEFLFSNSVFSNSVGNSDYFFRTKYYSIYSANKIDDLFCPMLVYLNLKANQTHYKEQTLQELFLRDGAKRAMEKFVIESRDAELLSKSCSYDCDDSNDWGQEGELEYIYENGGDWILD >CP022041|220916:251291|241903_242110_+|ASE18110.1|DBSCAN-SWA MKWLTTIIVVAIIAGILGALNSKDGEKGEGFFSGALAGGMGCGYVIFEIFLVVGGLILLFKLFGFLFG >CP022041|220916:251291|221376_222246_+|ASE18092.1|transposase|DBSCAN-SWA MDHAQDWLLFEDNIGESLSIDETCLSCGEVYTFLTNKAGKGREGTLVAVVKGTKAEDVIQILKKINLSKRKTVKEITLDLSSSMMRIARAVFPKALITNDRFHVQKLYYDALDDMRIAYRWMARDKENEEIKEAKCKGKEYIPFRYSNGDTRKQLLARAKFILTKHKTKWTETQKGRAEIIFEHYPTLKKAYDLAMKLTDIYNIKSIKDAARLKLAKWFNEVEELGVDNFYTVIDTFENHYQTILNFFVNRATNANAESFNAKVKAFRAQFRGVTDIPFFLYRLMKLCA >CP022041|220916:251291|240516_240720_+|ASE18108.1|DBSCAN-SWA METNYVVILDFSTATVIKIHLSKEQIEESYRYNDFEEFLFTLEPKYGFQVKDCCWMMCETLKEENYL >CP022041|220916:251291|249702_249963_+|ASE18116.1|DBSCAN-SWA MCGETHNLEVPGSSPGWSTKGFQKDVKQHVILAFKTRIIAVKSRFYSDFLFFIHIRLTAKLYLLIVIIRSFSATEIATQKQALKKM >CP022041|220916:251291|222731_224462_-|ASE18093.1|DBSCAN-SWA MFRSFFNRSFYESVSIILVALLLSSIARRNLSLTRRHIQRCAVDVTEKSYYQLDVDGKPAAYFSNYTDSTFVDGSLPGDSIRSRMSVTEGYWVNKLPILPSCFGRIAVIRRHYPSTIINLKDQKLRQVLMYALMKMDIKLVGLQSKRNETAYYLRVHSVQDYGYNKIADYHTAVVHQIDSLQRIIHVLQDIKPTARLRIRQLSQYTARVWYKPGEIMVCNRLNDESNRWALCLQTSDCMTPWRISSRFGRERGMQESLRHFKAYSHPVLGFPAGALIDSAGYYVGEQRGGKAEGYGQYYGNDGSFYDGHWQDGKRNGFGINVAPHDFLKVGEWKNNVFKGERLTYNADRIYGIDLSKHQHEKGEQRYSIAWSKLRITHLGTLSNKKVKGIVDYPVSFAYVKATEGRTLLNSYYYSDYLAARQQGIRVGSYHFFSTLSSGYMQANFFLKKARFRKGDLPPVLDVEPTDAQIHAMGGAEVLFKQVRIWMSMVFKHTGHRPILYISQSFANRYMPLAPDLGDNYHIWIARYGEYKPNFKLAYWQLSPDGKVRGIHGDVDINVFNGYRNQYEEFLKRHCF >CP022041|220916:251291|239768_240041_+|ASE18106.1|DBSCAN-SWA MNKEIDKALHLIYTLQEQLCNMPHREGCLKMLYQIRSLMEDFDSMLTPDQRKYGIYSNAYKEIFANGTGLSQYDKVCATICEEEYAELPF >CP022041|220916:251291|250067_251291_+|ASE18117.1|integrase|DBSCAN-SWA MKTTFKVSYYLRSNYENKEGKSPVMLRIYLGGEKANLGSTKIFVDKSKWSNKTSRMIGRTAEALSINASIDALTTTLMQIYRKYETSEELSIDLIRSVFLGTDKEYTTFLPVFDKYIDSITQQVGKTLTKGTFYKYKVVRQNFQDFLQAKYHRKDIGQTELTSAVVQDFELYLTSVVGGVHNTTTKKLRNLKTVVNYARNRGLIMHDPFANHKLRYELVDRGYLTEEEVLRIMKKHFDIERLELVKNIFIFSCFTGLAYIDVYNLTYDKIVTVEDRQWLITKRYKTSVDENVMLLDIPLAIIRKYYDINRKGGKVFPMMSNQRINSYLKEIADLCGIKKNLTFHMARHTFATMSISKGVPMESVSKMLGHTNIRITQIYARITNKKVERDMEELAGKLSKFNTAMGI >CP022041|220916:251291|244049_244865_+|ASE18112.1|DBSCAN-SWA MKKIETQEGVYIIQEREHYGIINENNTIIIPCIYESIEYIPTDNIYLIRLNKKYGVLNTNFKTIVPPVYSGLYLWGKKFIAIIAGGLSENFNWLWIYQKRKSHNAFYNPKYSLLDKDGQEIFHPRYDTITPVDESFATVTLGDYIGLINSCGIEIIPPIYDMEKDSMIEKIANGIEELKGRIHNKSNRYYRWELTDGKIREKRIFCFSQGNTLYVMDEEGTVFGSQQYLIIPRFDKTDSICNKDIINLLSNNDTEKIRKQDFDFSNIELFR >CP022041|220916:251291|238665_239688_+|ASE18105.1|DBSCAN-SWA MNNVTLMPAVVAQNNRNFDFSDAEEAVIIEDTPTEGSSVSFLEANTNAISLDELASTCVVPTWGNQELTIAHQDFINCVHDAAKDFYHGEQINTPAIRVSHIVRGRTPNALGKKASELLECEKTQFYQRMAFAFTIPTIYETLNGQKLELCVGGVRNYNDLNLYRASKGVEKFSIFVGWRVNVCSNQVLTGDGVKLSIEVMSIRDLYKSVMELLYHFNPAKDIHLMQTLSNTSLTETQFAQIIGRMRMYQALPQGYTKQIPRLLITDSQVNSVCRGYYSNPDFGANDDSLSMWDFHNLLTESNKGSYIDTYLQRAVNTTEVAVGINNALHGDEKYKWFIG >CP022041|220916:251291|233008_234277_+|ASE18101.1|transposase|DBSCAN-SWA MLTIKAEIQRDKLRQDGSYNVRIRFTKDRKVKRISTSLFATKADLTDRFTIKEDSLIKQEADILILHYRKMFNEMHLETEALDVNEIVARLTSREESDKPVDFIQFAKGWIANSTLKGAVNYTSALNSLIRFNKSDKLYTHQITSEFLQEFMAFLLNESKERAEQLKKKGKRVPSTRSTSLYLMSIRRLFKEAVKQYNKPDQGLIRIRNTPFVYFQIPKQEATRKRAITAELIRKIEQLPYQTVYKGIHHTNRFNLAKDCFILSFCMIGINSVDLFNATEYDGNTITYYRTKTRDRRMDKAKMIVTVPKMLHPLFEKYKDTTGKRIFNFYQNYVNEKAFNKAINKGLKEIGSILKIEDLEYYAARHSWATIALNKVGVSKYVVHEALNHIDESMRVTDIYIERDFSNENKANAKVVKYVFGK >CP022041|220916:251291|247965_248586_+|ASE18114.1|DBSCAN-SWA MDYTTLSKEIGACLNPLQAYIYLGLCLHTDFKNDISNINQDTLAKFLQCDVGAIQRALRVFQENNLLTIHQDKYGWREKNRYKLIRTNWFGVKRKILEENITREQIGFLLLLKSLCYNHCNYTDYYGKGLQEIMALKRSMIDNYLRVLEAKLYIKRDKKKKRITILRDDLFLTTKESEKEKIMKLCPELMGDDDYIDEHGHYHFVD >CP022041|220916:251291|230016_230373_-|ASE18891.1|DBSCAN-SWA MKTISKEIDSRYSACPIRQVITRIGDKWSMLALYILGKAGKEKMRYSEIHSYMLDCSQKMLSQTLKNLEKNKLVNRKVYPEVPPRVEYGLTELGRSFLGPLDALFDWGTKNMDKMVKD >CP022041|220916:251291|236094_237903_+|ASE18104.1|DBSCAN-SWA MKKEITISMSYGKVQHLSEVLPLIPTNTILCKTLTGIGATYGEIRSARNSIIIEPNRPVIYGKCRAPKHKNDNLFGVYEGIYTQDIVNYIIKSQDRNKKIKILTTPESFYKVRTAFEQMGIDIRTDGYFLLFDECQKIVRDCGYRKNISLPMDFFFECQDKAMVSATPPSEFADSRFENFDILIIHPDFNYKKTITVCTTNNVLERTRQLLLQLDERPVFFFVNSTDIILAMMEQLGLRDQSAVFCSADSVDKLKSQKFSNAHENWEKKYMAKYNWMTSRFYNAMDIELNENPNVVMITDCYTADYTMIDPYMDAVQITGRFRNGTNAIYHISNFDKRIPIKDKKSIIIRYQCDKEIYENFRTFKNCETDINRRAAFNDAMNVLPYNRFLNDYSKEDAFKIDNYIHEELVRVMYHDYTRLYQGYNECGYFEVTTSNITYRYGDYDRLKITNNTLPIKDIRKRIVDQLEALKEDETEMAMQYRDELRHIDSFIVKAWEVLGKQTIEELNYNSKSIREKIILTLHEKKAKSSDVIRMIYNSFISRHWYASSFIKQELKRIHNLLGVPKSKAITAKDICQYFEVTEKRKAKERGYYLISPKFTTE >CP022041|220916:251291|229176_230010_-|ASE18098.1|DBSCAN-SWA MVNTSYSNTTTQILKVKTEQVAYRQLGDVKNPPLLCLNHLAATLDNFDPAIMDGLAEHFNVIGIDYVGVGKSTGKSRLTIEEMADDVIDFAEAYGLKDIRLLGFSLGGFVAQQVLLRAPALTTKAILAGTGGAGGVGIKNVAKITYWDMFRAFLTGRDPKYYLFFPMIEQGKKAANNFLKRIARTEDKDDKIKLSSFQRQLRAIDVWGSREKDDLSKIKIPVWVVNGDNDRMVPTPNSYDLAERLPNAHLTIYPDAGHGGVFQYHEVFVPEAIEFFK >CP022041|220916:251291|235480_236044_+|ASE18103.1|DBSCAN-SWA MNNKTLDLRIDNAETSASTGTQNNKSMNKGTFTLPTLQKAQNQYDNIGAYNQLQAMMKAGQKLSVKFYTGKGNKPCAWIESKQVTGFRYEVKDTSFKGLMNYLISGEVADFDTSPMEAQPLKEETNYALLVLKLMLEEHWGIQFTPLFRERSNYITAQKVLRKGTIYFQIPRTDENVELLRDYDVTI >CP022041|220916:251291|231471_232443_-|ASE18100.1|DBSCAN-SWA MNTIYIKRDEAHELVEQVATIGFFDGVHRGHQFLISRVIDEAERSGMASAVITFDRHPRQVLQADYQPELLSTLDEKLLLLSKTHIDNSFVLHFDASLAALSAHDFMQEVLCKQLNVKKLIIGYDNRFGHNRSETFEDYVQYGKEMGIEVIRADAFLPNDEKVSSSVIRNDLRAGNIEAANRLLGYPYTIESRIVSGYQNGRKMGFPTANLDVNGCQQLLPASGVYAVMVRLKDSVEWKRGMMNIGHRPTFNGTTTSIEVNLFNFTGNLYGQELLVSFISKIRDEHKFDSLEALAQQLKQDKEQINRLFDETYHIQENIINTP >CP022041|220916:251291|234407_235283_+|ASE18102.1|DBSCAN-SWA MTFEINKTEYIAVIAGMDCLPRINEDEIIDPSYIELAQKAATDHFANRFSKLFKDNQEKYELFLGNNHIQKQLSKWQLDPIKFWYLHLFIIDYATDAFRSVTIFEELSTKQYIEKLIKSLEKTDIENFCLDIRINKEKLSTRNPWVGEALLSALKQDFPFDWVNNNTFVSKDIKEDIERYTTARMKFSTELYSHFFDQYLHKSQRGRKSFIGKFLYLADLTRDERYWYGCKPTLEKVCKKYEIAQYQKIMVEGVEHIAIPFDVGKDLSDHIKKCTMIPQVRHSFYFSPVDE >CP022041|220916:251291|227206_227536_+|ASE18096.1|DBSCAN-SWA MKKAIFTMLMLMVAMIATAKDIKTVIFTTTPQMHCANCEAKVKNNLRFAKGVKEIKTSVEEQKVYVTYDAQKTNEEKLQKAFEKFGYKAEKTTKEAKIPVHENEECENM >CP022041|220916:251291|240067_240517_+|ASE18107.1|DBSCAN-SWA MDMDFKVAEVTLQYKPTCKKQSKVFGSEEAYNILLPTFKEGTIEYREYAKVLYLNQANEVIAYNTISEGGLTETAVDVRIILQGALLTNATQIIFAHNHPTGNLKPSPQDDMLTRELQKACQIMRIRFTDHIIMSVDSYYSYREEGRIL >CP022041|220916:251291|225063_227112_+|ASE18095.1|DBSCAN-SWA MNKIIYLAGLTLLSVLPATAQEAAVDTTATMKDQTLDNVTVTTRRASVHSLAGAINGKDILRDELFKAACCNLGESFVNNASVDVNYSDATTGAKQIKLLGLSGTYVQMLTENLPNFRGAALPYGLGYVPGSWMKSMQVSKGNTSVKNGYEAMTGQINVEYVKPEDPTGMSVNLYGNTVGRFEANADGNIHINGNKNLSTEILAHYENNWLHHDGNGDGFQDAPKVRQYNLQNRWFWKKGDYILHAGLSLLKEDRVSGQTPHAAATNPYRIDIGTDRYEAYMKHAFVLNKEHATNIALMGNVSMHKQQAQYGIKQYDVNEKNAYASLMFETNFTDEHNLSAGLSLNHDYLHQTLTLPATAVPSDYGNIYPLTRGIESETTPGAYVQYTYNLHSRFIAMAGLRVDHSNVYGTFFTPRFHLKWVPADFLTLRLSAGKGYRSPHALAENNYLMASGRRLIIDKLDQEAAWNYGASLNFNIPIGNKMLKINTDYYYTHFLSQMLIDYDSNPQELHITNLNGKSFSHTFQIDASYPLFKGFELTAAYRYNLVKATYGGKLMWKPLQSRYKGLLTLSYKTPLGVWQFDATAALNGGGRMPESYTLASGERSWAGSYKAYGQLSGQVTRYFRHFSLYIGGENLTNYKQKNPIIGYQNPWANSFEPTMVYGPVEGAMAYVGIRLNLGKRQ >CP022041|220916:251291|220916_221261_+|ASE18091.1|transposase|DBSCAN-SWA MKNHQLLRCIFPDVLADYFDVVDIQESVSQIDFWLDERNFMEKSDHKLGTVSSYGFTSERVIQDFPLRGKAVYRHVRRRKWRDSSNGEIFTYSYDDLTAEGSRLSPEFVSFLKE >CP022041|220916:251291|240833_241421_+|ASE18109.1|integrase|DBSCAN-SWA MSLKYSNTTADYLEWSEAMNLIRRLTKDKNYKISLLIAIGCFTGLRISDILTLRWKQILGVSEFTITERKTAKQRTIRLNKELQLHIKDCYEHINPLGIAAPVLVSQKGTVFTVQRINVILKELKQRYRLKIKHFSCHSLRKTFGRQVYNMNSDNAELALIKLMELFNHSSVAITKRYLGLRKEEILETYDCLTF >CP022041|220916:251291|228165_229167_-|ASE18097.1|DBSCAN-SWA MKAIQIDGYKKGDVKVALRDIEMPKVGDNDLLVKVKAAAVNPLELLIAHGDVKMVVPYHFPLTIGNEMSGVVEQVGASVKDFQVGDRVFGRLPINRIGAFAEYVAVDAAAVAKTPDYLSDIEAAAVPLTALTAEQALDILKLKSGNSLFISGGSGSFGAMAIPLAAARGLKVITSGGAKAKERTMELGVSQYFDYKTEDYTKELKDIDGAIDSLGDKELPRLFSVLREGATLVSLKGMPNGSFAKSFGLPLWKQWLFKLVGMKNEKLASKRNQHYHFIFVTADGAQLRNATAILEARQIRPEIGNVYTLEQAEEALSDVANHRSQGKVIICNS |
29 | unidentified_phage(33.33%) | transposase,integrase,protease | attL 246311:246325|attR 256553:256567 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
918244 : 925858
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP022041|918244:925858|DBSCAN-SWA CTTATAATGTATGTCCATCAAGATAATCTTGAAGAACTGTCTTATAACTATCTGAGATTGGAAGGATGGAATCTCCGATTACAACGCGGAAACGATCAATGCCATCAATCTTCTTCATGTGCACGATGTAGGAACGATGAATGCGCATGAATTCAGGTTTAGGAAGTGACTCCTCAATCTTCTTCATGTTCATTAGACTCATGATTGGCTTATGGTCGTTAGCAAGATAAATCTTAACATAATCCTTCAATCCTTCAATATAGAGGATGTCATCAAACATGATTTTAACCAGTTTGTACTCGCTCTTCACAAAGATAAAGCGGTCGTTCGACGCATTCTCTGTCTGACGGGTACTTTGGAACCAGTCTAAAGCCCTGTTTGCACCAGCTAAGAAGTCATCATAGCTAATTGGCTTCATAAGATAATCAATTGCATTTGCCTTGTAACCATCGATAGCATATTGGCTGAATGCAGTTGTGAAAATAATCTTAGTTTCCTTTGGTAGAATCTTTGCAAACTCCAGTCCACTAAGTTCAGGCATCTGTATATCTAAGAATATGAGGTCAACCCTATTCTCACGGATGGCTTTTACGCCTTCTACAGCACTATTAAAAACACCAATTAAGTTCAAAAAGAGTGTTTTCTTAGCATAACTCGCAAGCAAGTCTGCTGCTAAAGGTTCGTCGTCAATGATTATACAGTTTAGTGTCATAAATGATAATTTTAGATGAATATGTATTTCTATTACTATCAAATCCCTTCTCCCATTTGTACTTACCAGGGTAAGCAAGCTCAAGACGGCGCTCTACTTGTTTGAGACCTATGCCATGGCCGTTCCTTTCTGCCTCTCCCTTTGGGTTATTACTGTTTTGTATGGAGAATATTATTTGCTCATTATTTGCATCTAACTTTATGTGAATAAAGTTCTTTTGCTCTGGATTAATACCATGTTTAAAGGCATTCTCTAACAATGAGATAAATATCATTGGCGCAACATGGATGTTACAATTAGATGGGATGTTACACTCGCGTTTGATTTCAACATTCTCTGGTATGCGTATCATCATCAAGTCGACATAATTGTGTAAGAACTCTACTTCTTCCTTCAGGCAGACGTAAGGCTGTTGGTTGTCATAGAGAATATGTCGTAACATCTTTGAAAGGTCGATAATTGCCTTCTGCGCCTTTTCTGTATCGAAAGCTGTCAGCGCGTAGATATTATTCATTGTGTTGAGCAAAAAGTGTGGGTTAACTTGCTGACGTAAGTTTACCAACTCGGCTTTTGTCATTGCTGTTTCTGCCTCACGCTTTTCCTTTTCAGCCTTTGACCAACGCTGTGCCATTAGTATCGATGTTGCAACACCAGCACAGATACTTAGGTTAAAGACATTACGTATAATAAAAAAGATGTGTGGGTATACGTACTTGTTCTCTTTGTTTTCTACAAAGATTTGGTATGGATAAGTCATCAGGTTTTGCTCACCTGAGATGTAGTATAACCATTCGTGCTGTACAATAGATAAGAAAAGGATGAGCAGTGTATTGATTACCCAAAACTCTTTCTTCCGTCCCTTTAATAACATGGGGGCAAGTACAAAATAATTGGTATATGCAACTGCCATCATTGAAAGGGGAATGATATATATCGACAAGTCGTTTTGCCATGATGTATGTCCTTGCGGAGCGTACAAAATGGGTATGAGGAAAAGCGGAAGCCAAATGATAATTTGGTTCCAAAAGCTCTTAAATATGTCTGTTTGTTTCCTCATTTATTACTTTTATTTGTCTTTACAACTATATCTGCAAACAAAGGTAATAAAACTTTTTGTAAAATAAAATGTATTGATTGAATATGTGAGTTTTTTTTACATTAACCTATTGTTATATATATACAAAAGGTTTTTAATTAACTTATTTTTATAATAAAACAACCCTAAACAACACTGAAATAGCTTTCTTTACTCTATTTCCTACGTTGTTTAGGGTTGTTAGATACTATGATAATAAATATTTACAATATCTATTTTGTAAAGCTATTTAAGCGTGCTATATATTTGCCAAGAATGTCAAACTCGAGGTTTACAATAGTGCCAATATGAATGTCAGCAAAATTGGTATTCTCTCGTGTGTAAGGAATGATGGCTACCTTAAAGCTGTTATCTGTTGGTTCGCAAACAGTCAGTGAGACACCATTGACCGTAACTGACCCTTTATCAACAGTGAAATAGCCACGTTGTGCCATCTCTCGGTTGCACGGATATTCGAAGGTGAAATAGGTACTACCATCTGCATCTTCCATATTAACACACTTAGCTGTTTGATCAACGTGACCTTGTACGATGTGTCCATCTAATCTACCGTTCATAATCATTGAACGTTCTATATTCACACGATCTCCCACTTTGAGTAAGCCTAAGTTCGAGCGTTCAAGGGTTTCCTTCATGGCTGTCACCGTATAGGTATCGTCTTGGAAGCGAACGACTGTAAGGCACACTCCATTGTGTGCAATACTTTGGTCAATCTTCAATTCCTTTGTAAATGAACATTGAAAAGTGAAATGGATATTCTCCTGTTCATGTTCAATCCCTCTAAGGATTGCCATCTCTTCTACAATTCCTGAGAACATTGCTAAATTTCTTTTATTATCCCCTTTAATAAAGCCTAAATGATGGTATAAGTATTGATAGATAATCTTTTATAACTTCACAGGCTACTTATAGGAGGTTAAGTTTTACATAATTACTATACGTTTGAATAATTGAAAAAGGTGGGGAATCACTTGGAAACCCCACCTCAATCCCTCTTCTTTAAAGAAATGGGAATTAGAAAAGTCGTATCGACGATGTGATGAAAAGTCCGAGTATCAATGATAAGAGAGCTCCTCGTCTTTGTAATCCACTGACCTTGCGGCTTAGTCGCTGAAATGCGATTTATGTGGCCCGCCATTAACAAAAGAAGTCATCTCGGGTTTTCTAATTTATTTGATGACTATCCCGATCGAATCGATACGACTAAATCTGTATCCTTTCTGCGTGCAAAGTTAGCATAAAGATTTTTATCCTCCAAACATTTTATACTTAAAAGTATTTACTTCTGGAGGTGAATCCCTTTGTTCTTTGTGTTGAAAGGAGCAGAAATGATAATGTTCTTTTGTGTATAACTTTAATAGTTTATTGGTTTTATAACCTTCTTTTGATAACAATTTGCCCTCGTCATGAATACATATTTGCAGGTGTTTATCGCTCCGCACCATTGGTGCTAAGCATCAACACGACATGTGCTAAGCCTCCGCACCATACGTGCTAAATATTGACACGTTGATAAAATGAGGGAAGAAAGGCTCATTATTGTCATTTTAGAGAATGATACTAATATCCCTATTTTTAGAGAAGAGCATCTTGTAGAATTTCACTTAACGTTGGATGGATATGAATGATGTCTCGTATCTTATCCAGTTTAGCATCATAATTCATTAAAGCTGCAACTTCTTGAATAAGGTCGGCAGAGTGAGCACCATAGCAGTGTGCACCTAATATAGAGCCATCTTCAGCTATAAGCACCTTTATCATACCTTCAGTTTCTTCCATGCTTAAAGCCTTACCATTGGCACGATAGAAACCTTTTCGAGTAGAGTACTTGATTTCCTCAGCTTTACACTGGTCTTCAGTCTTACCTACGCAGGCTGCTTCTGGATATGTAAAAATCGCAGATGGCATGATATCAAGGCGGATGTTATCGTCCTTTCCAAGAATATGATTGACAGCACGGAAGCCTTGGAAGGTTGCTGCATGCGCTAACATTTGGCGTGCATTGACGTCACCGATAGCATATACACCCTTCACATTTGTTTCCATATTGTCATTGACGACAATACCCTTTGCGTTTACTTCTATACCAGTCGATTCAATACCGATATTGTCGAAGTTGGCTTGACGACCAGTAGCAATCAATACAAGGTCGGTATCAATTCTATCTTCTTTACCCTTTTTATCGAATATAACGGTTGTATACTCTTGTCCACTCTCAGCAGGTGATAGGATTTGTTTTACGGCACTCTGCATGTAGAAGGTCACACCTCTTTTCTCTAATGTCTTCCTTAAGCGTTTAGCTATATCGCTGTCAATAGGAGGGAGGCATTCCTTCATAAACTCGATAACTGTTACTTCACTACCGAAAGCTGAGAAAGCTGAGGCAAATTCCATTCCGATAACTCCTGCCCCAATGATTGTGAGGCGTTGAGGAACCTTCGCAATAGAAAGTAATTCCGTGGATGTGACAATGTTTTGTGCTGTTTCAGACTGGCTCAAGAAGTCTTCTTCACTCATGAAAGGTGGCATCTTTGAACGTGAACCAGTCGCAATGATGATATGCTCAGCCTCTATTTGTTCGCCATTTACTTCAACAACATGGTCAGAAACAAAACGAGCCTCACCTATAATAAAGTCTATACCAGGCTGACTAAGTAAAGTGCTTACGCCCTCTCTAAGTTGATTGATAACTCCTTCTTTCCTCTCCATTACCTTCGCAAAATCGAGTGGGGGAGTCGTTTCGTAAAGAGAAGAAGTCGTAAGGCGTAGCTCAGCATCATGTGCAAGGCATTTCGTAGGAATACAACCAGCATTCAGACAGGTTCCACCAGGCTGTGCTTTCTCAATGATTGTCACTTCCAAACCATTCTGAGCAGCATAGGAAGCGGTTCGGTAACCACCAGGTCCCGAACCGATTATTAATAGATTTGTCTTTTTCATATTCAATCTCTTATAGAGTTACTGTGGAGATACGCTATGCATTCTCCTCTACGTCGTTGACAAGCTGACGATAAGTAAGGATTGGATGCTTAGCTGCAAGAACATCATCTACACGTCCAATAGGTGTGTTGTATGGTGCTCCCTTAACAAGCTCTGGGTTCTCAAGAGCCTCTTTAGCAATGGTATGCATCACCTCTATAAATCCATCAATAGTGTCCTTACTTTCTGTTTCTGTTGGTTCAATCATCATAGCCTCATGGAACAGAAGTGGGAAATATATTGTTGGAGCATGATAGCCATAATCGAGTAGGCGTTTGGCAACATCCATTGTTGTTACACCGGTACTCTTATCTTTCAAGCCATCGAATACAAACTCGTGTTTACAAAGTGTGTCAATAGGAAGCTCGTAATCATCCTTTAGACATTCCTTGATATAGTTCGCATTTAGAGTTGCAAATGGTCCAACCTCCTTGAGATGCTTCTTGCCAAGTGTGAGAATATAAGTGTAGGCACGTAGAATAACGAGGAAATTGCCGAGGTAGCCACTAATACGAATATTATCAGATGAGAATTCTCCTGTCGTATCTGGATTGTCAATAACAAAACCATCCTTCGTCTTCTTTACATGTGGCTTTGGTAGGAATGGAATTAGTTTCTCACCGACACCGACAGGACCAGCACCAGGACCGCCTCCACCATGTGGTGTAGAGAATGTCTTGTGTAGATTGAGGTGAATAACATCGAATCCCATGTCTCCAGGACGTGCTGCACCTAATAGTGGATTGAGGTTGGCACCATCATAGTACAGCAGACCGCCACAGTCATGTATGAGCTTAGCAATCTCTGGAATATCTTTCTCAAAGAGACCTAAGGTGTTAGGATTGGTCATCATCATACCTGCAATGTCATCACCCAAGAGCGGTTTTAAGTCGTTGACATCAACAAGTCCCTCAGCTGTACTCTTTACTTCTACAATCTCTAAACCGCAAACAGCAGCAGAGGCAGGGTTAGTACCATGAGCAGAGTCAGGTACGATAACCTTTGTTCGCTTTGTGTCACCACGCTGTTGATGGTAAGATGCTATGAGCATCAAACCTGTTAACTCGCCATGTGCACCAGCGTATGGGTTAAGCGTGACTTCTGCCATACCCGTAATAGAAGCAAGAGCACGTTGAATATTGTATTCAACCTCCAATGCACCTTGGACTGTTTCAATAGGTTGGTGAGGGTGAAGGGCTGTAAAGCATGGCATTGAAGCTATCTCTTCGTTGATGACCGGATTGTACTTCATCGTACAAGAACCCAGAGGATAGAAACCGTTATCAACGCCAAAGTTATTTTCACTATGATTAGTATAATGACGTACAACCGTCAACTCATCACATTCTGGTAGCTCTGCATCTTTCTCTCGTTTGCAGAAGTCGGGTAAAGGATGATGGCCAAAACGATTCTCAGGGAGGCTGTAAGCACGTCTTCCTGGATGTGACAACTCAAATATCAAATTGCCATATAGTTTATTATTCATTGTTGCGTAGATTTAGAAAAACTTTGTGTGACTATAATAATCCGACGAGTGTATCAATCTCCTCTTTTGTACGTTTCTCTGTAACGGCAATAAGGAGTTTGTCATCATCAACCTTGATACCTGGGAGAATACCCTGTTTGATAGCCTTGTCGAAGAAGGTATCGCGTTCCTTTATTTGAATAAGGAACTCATTAAAGAAAGGCTTGTTGTGAACAAGCTTAACTTTACCTGTACTAAGGAGTTGCTCGCAAAGGTAATGTGCACCATCATAGCCCATCTGAGCAGCCTCTTTTATACCCTCTTTTCCCATCATACTCATGTAGATTGTAGCATAAAGCGCCATCAAACTCTGGTTAGAACAGATGTTTGAAGTTGCTTTCTGACGACGGATATGCTGTTCACGTGCTTGTAGTGTCAGTGCGAATACACGCTGTCCACGGCTATCGCATGTCTTACCAACGATACGACCTGGGAGTTTTCGCATTAGCTTTTCAGTTGAACACATGTAGCCAGCGTATGGTCCACCAAATGCCATTGGAAGTCCAAGACTCTGAATATCGCCTATAGCAATGTCTGCACCCCATTCTCCTGGTGTCTTTAATAAAGCAAGGTCAGCTGCAACACTGTTAATAATGAACAGTGCCTTTTCTTCATGGCAAGTATCAGCAAAGCCTGTAAAGTCTTCAATAATACCATGTCGATTAGGCTGTTGTACAATAACTCCAGCCACACCACCTGCCTGTAATTGGTTTTTTAAGTCTTCGTGTGAAGTCTCTCCATCAACCGCCTTGATTGCCTTGAGCTTAATGCCATGAAAATGTGCGTAAGTCTTTAATACGCCAATGATATTCTTGCATAAAGTCTCAGAATATAATACAGTGTCAGCCTTCTTTGCATTATCAAAAGCCACCATCATAGCTTCAGCAGTTGCTGTAGTGCCTTCATACATAGAAGCGTTGGCGATATCCATGCCAGTAAGTTCAGCCATCATACTTTGGAACTCAAAGATATAATGGAGTGTTCCTTGTGATATTTCAGCTTGATAGGGGGTATATGATGTAAGAAACTCCGAACGACTGAGAAGATTCTGAATAACACTTGGCGCATAGTGGTCATATACACCTCCACCTGCGAAGCAAGTAAGCTTGTCATTCTTTTGTCCAAGTTTCTCAAAGAAAGCGCGTATTTCTAACTCGCTCATCGTCTCAGGAATATCATAATCTCCCTTGAAACGGATGCTCTCAGGTACTTCAGCATATAGGTCTTCAAGCTTCTTTATGCCAATACGGTCAAGCATCTGTCGTATATCCTCACCTGTGTGAGGTAAAAACTTATGAATCAT
Protein sequences of DBSCAN-SWA_2 >CP022041|918244:925858|924547_925858_-|ASE18576.1|DBSCAN-SWA MIHKFLPHTGEDIRQMLDRIGIKKLEDLYAEVPESIRFKGDYDIPETMSELEIRAFFEKLGQKNDKLTCFAGGGVYDHYAPSVIQNLLSRSEFLTSYTPYQAEISQGTLHYIFEFQSMMAELTGMDIANASMYEGTTATAEAMMVAFDNAKKADTVLYSETLCKNIIGVLKTYAHFHGIKLKAIKAVDGETSHEDLKNQLQAGGVAGVIVQQPNRHGIIEDFTGFADTCHEEKALFIINSVAADLALLKTPGEWGADIAIGDIQSLGLPMAFGGPYAGYMCSTEKLMRKLPGRIVGKTCDSRGQRVFALTLQAREQHIRRQKATSNICSNQSLMALYATIYMSMMGKEGIKEAAQMGYDGAHYLCEQLLSTGKVKLVHNKPFFNEFLIQIKERDTFFDKAIKQGILPGIKVDDDKLLIAVTEKRTKEEIDTLVGLL >CP022041|918244:925858|920263_920869_-|ASE18573.1|DBSCAN-SWA MFSGIVEEMAILRGIEHEQENIHFTFQCSFTKELKIDQSIAHNGVCLTVVRFQDDTYTVTAMKETLERSNLGLLKVGDRVNIERSMIMNGRLDGHIVQGHVDQTAKCVNMEDADGSTYFTFEYPCNREMAQRGYFTVDKGSVTVNGVSLTVCEPTDNSFKVAIIPYTRENTNFADIHIGTIVNLEFDILGKYIARLNSFTK >CP022041|918244:925858|921626_922991_-|ASE18574.1|DBSCAN-SWA MKKTNLLIIGSGPGGYRTASYAAQNGLEVTIIEKAQPGGTCLNAGCIPTKCLAHDAELRLTTSSLYETTPPLDFAKVMERKEGVINQLREGVSTLLSQPGIDFIIGEARFVSDHVVEVNGEQIEAEHIIIATGSRSKMPPFMSEEDFLSQSETAQNIVTSTELLSIAKVPQRLTIIGAGVIGMEFASAFSAFGSEVTVIEFMKECLPPIDSDIAKRLRKTLEKRGVTFYMQSAVKQILSPAESGQEYTTVIFDKKGKEDRIDTDLVLIATGRQANFDNIGIESTGIEVNAKGIVVNDNMETNVKGVYAIGDVNARQMLAHAATFQGFRAVNHILGKDDNIRLDIMPSAIFTYPEAACVGKTEDQCKAEEIKYSTRKGFYRANGKALSMEETEGMIKVLIAEDGSILGAHCYGAHSADLIQEVAALMNYDAKLDKIRDIIHIHPTLSEILQDALL >CP022041|918244:925858|918929_920012_-|ASE18572.1|DBSCAN-SWA MRKQTDIFKSFWNQIIIWLPLFLIPILYAPQGHTSWQNDLSIYIIPLSMMAVAYTNYFVLAPMLLKGRKKEFWVINTLLILFLSIVQHEWLYYISGEQNLMTYPYQIFVENKENKYVYPHIFFIIRNVFNLSICAGVATSILMAQRWSKAEKEKREAETAMTKAELVNLRQQVNPHFLLNTMNNIYALTAFDTEKAQKAIIDLSKMLRHILYDNQQPYVCLKEEVEFLHNYVDLMMIRIPENVEIKRECNIPSNCNIHVAPMIFISLLENAFKHGINPEQKNFIHIKLDANNEQIIFSIQNSNNPKGEAERNGHGIGLKQVERRLELAYPGKYKWEKGFDSNRNTYSSKIIIYDTKLYNH >CP022041|918244:925858|918244_918955_-|ASE18571.1|DBSCAN-SWA MTLNCIIIDDEPLAADLLASYAKKTLFLNLIGVFNSAVEGVKAIRENRVDLIFLDIQMPELSGLEFAKILPKETKIIFTTAFSQYAIDGYKANAIDYLMKPISYDDFLAGANRALDWFQSTRQTENASNDRFIFVKSEYKLVKIMFDDILYIEGLKDYVKIYLANDHKPIMSLMNMKKIEESLPKPEFMRIHRSYIVHMKKIDGIDRFRVVIGDSILPISDSYKTVLQDYLDGHTL >CP022041|918244:925858|923025_924516_-|ASE18575.1|DBSCAN-SWA MNNKLYGNLIFELSHPGRRAYSLPENRFGHHPLPDFCKREKDAELPECDELTVVRHYTNHSENNFGVDNGFYPLGSCTMKYNPVINEEIASMPCFTALHPHQPIETVQGALEVEYNIQRALASITGMAEVTLNPYAGAHGELTGLMLIASYHQQRGDTKRTKVIVPDSAHGTNPASAAVCGLEIVEVKSTAEGLVDVNDLKPLLGDDIAGMMMTNPNTLGLFEKDIPEIAKLIHDCGGLLYYDGANLNPLLGAARPGDMGFDVIHLNLHKTFSTPHGGGGPGAGPVGVGEKLIPFLPKPHVKKTKDGFVIDNPDTTGEFSSDNIRISGYLGNFLVILRAYTYILTLGKKHLKEVGPFATLNANYIKECLKDDYELPIDTLCKHEFVFDGLKDKSTGVTTMDVAKRLLDYGYHAPTIYFPLLFHEAMMIEPTETESKDTIDGFIEVMHTIAKEALENPELVKGAPYNTPIGRVDDVLAAKHPILTYRQLVNDVEENA |
6 | Prochlorococcus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1279773 : 1354453
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP022041|1279773:1354453|DBSCAN-SWA CTTAAGCACACAATTTCATAAGCCTATATAAAAAGAAAGGAATATCTGTGACTCCTCTGAACTGCGCCCTGAATGCCTTAACCTTAGCATTGAATGACTCGGCATTTGCATTCGTAGCTCTGTTTACAAAGAAATTGAGTATGGTTTGATAATGATTTTCAAACGTGTCAATCACTGTGTAGAAATTGTCCACTCCCAACTCTTCAACTTCATTAAACCATTTTGCCAACTTCAGCCTTGCAGCATCCTTGATGCTCTTGATGTTATAAATATCGGTAAGTTTCATAGCCAAATCATATGCCTTTTTCAGTGTCGGATAATGTTCAAAGATGATTTCTGCCCTACCTTTCTGTGTTTCAGTCCACTTGGTCTTGTGCTTGGTCAATATGAACTTTGCTCTGGCAAGCAACTGCTTACGCGTGTCACCATTGCTGTATCTAAATGGTATATATTCCTTACCCTTACATTTAGCTTCTTTTATCTCCTCATTCTCTTTATCTCTTGCCATCCATCGGTAAGCGATACGCATGTCATCCAGAGCATCATAATATAGTTTCTGCACATGAAATCTATCATTGGTAATAAGTGCTTTGGGAAAAACAGCACGGGCTATGCGCATCATAGAAGATGATAAGTCGAGTGTTATCTCCTTGACAGTCTTTCGTTTAGAAAGGTTTATCTTCTTGAGAATCTGGATGACGTCCTCTGCCTTGGTCCCTTTAACCACAGCTACCAAAGTCCCTTCTCTGCCCTTTCCTGCCTTGTTGGTCAGAAAAGTATAAACCTCGCCACAGCTTAGACAGGTCTCATCAATACTTAGACTCTCTCCTATGTTATCTTCAAACAATAGCCAGTCTTGAGCATGATCCAACTGATCCCAGCTACGGTAATCACTGAAATGTTCCTTGTACTGTGTGGATAACAGCTTGCCATTTACGCCATAGTGCGAACCAATGCTTGCGATGCTCTCTGCAGTGGACTCAATTCTATTCTTTTAAAAAAGAAACGAACTCGGGAGATAGTCTACTACCCTCAGCCGTCAAATCATCATATGAATAAGTAAATATCTCTCCGTTGGAACTGTCACGCCACTTGCGACGACGAACATGGCGGTAAACTGCTTTGCCACGAAGGGGGAAGTCCTGAATAACACGCTCGCTGGTAAAACCATAACTGCTTACAGTGCCTAACTTATGATCTGACTTTTCCATAAAGTTACGCTCGTCAAGCCAAAAGTCAATCTGCGAAACACTCTCTTGAATATCGACAACATCAAAGTAGTCAGCAAGTACATCTGGAAAGATGCAACGTAATAATTGGTGATTCTTCATGATGCAAAGGTAATAAATACTTATGAAATTATGTGCTTAAGAGTATAGGCTTGCAACTGGGTTTTAGGACTGACCCCAATTAACCACCTTTTAAAATCCTTATTCACAACTATCTGATAGTAAACAACTTACAAATGCACTAAAATAATTCACTTTCATGCTATTTTTCCACCTTTTCTTCGAATATTTTGTCAATATATTTCCTTGCCTTTTTAAAGTAAATATCACACGAAGTCACGGAGATAAAAGCGTATTAATTTAAAGATACGTAATCACAGAGGCACCAATAGTGTAAGAAGTAAAGGAATATAACAACAATATTTATTGAGAAGAATACGCCATCAACCCTTAGTTAGCAATGTAAACACATTATATAATCTGATGATATATCAATCTTTATAACATTTTTTATTAATTCCTATTACGTAAAAAGACGAATTAGGTTGTTATAAGAAGTGAGTAGCAGGAAAACTGCTGACTAAAATATTAAAATACAATACGCCTCAGTTGAAACAAAACAGAATGAGTGAAATAAATAACAAGACCATTAAAGACTATCTTATTGGCAAAGCAACAGCCAAGGAGATGGAGCAGCTGGCTGAATGGCTGGCGGTCTCGGAGGAAAACCAAAAGGAATTCTTCGAGATGGAGTTGGCTTTCCATTTAGGAAAGAACAACTCGTTGGTTACATCTAAGAAAATTGAAGAAGCAGAGACTAAACTATTTGACCAAATCACTGAATACGAAGAACAGAACAGAAATAAGAATAAACTTTATTTCCTCCGCTATGCTGCAGCAATTATTGTTGCCGTATTACTGATTGGAGGAGGACTCTTTGCATATCTTCACCAGTCTGCAGAAACTATCACTGTTGCAGCGATGAACGAGGTAAAGAAAGTCGTTTTACCTGATAATTCTACCGTTTGGCTCAACAAGGGAGCTACAATTAGTTATGCAGGTAACTTTGAGGGCGACGAGCGTAAGGTAAACTTAAAGGGCGAAGCATTGTTCCACGTAACAAAAAATGCTGAAAAACCATTCATTGTAAACAGTGATGGAGCATCAGCAAAGGTATTGGGAACAACCTTCAACTTTAAAGATCAAGCTGCTGATGGAAAAGAAGTTATCAGCTTGATAGAAGGTAGATTGGAAGTAACAGGACTGAATGGAGAAGGAAAGGTTGTCCTTCATCCCAACCAAAAGGCTACTGTCAGCAAGGATTCCAAGACCATTAAGACAGAAAATAGTTATGCACTTGCCGACGCTGTATGGCGTGACGACATAATCCCATTTAACAATATGCGAATCAATGAGATTGCTAAAATCCTTGAGCAGCTCTACGATTACAAGATTATTGTTGACGCTAAGTTAGACCACACAAAAACTTACACAGGTGTTATCAAGAGAAACAAGGATATAAGAAATGTCTTAGACGGACTTTCTTATACCATCTCTTTCCACTACAACATTCACGACAAAGAGATAACCCTCTCAGAATAAGTATCTGAGAGAGCTATCTTTTTGCCACGAAACGTCTAAATCAGTTTCTTAAATTTTCATTACCTAACATAGACATGAGAAAACTTTACGCGCTTCTTATAGGAGCACTCGCCCCTTTATGTTCGTATGCTCAGACCACTTTCCCTGAGTGGCAGAGTCAGTACACCACTGGTTTGAACAAGATTGAGCCACATGCCTACGTCCTTCCTTATAGCTCGGTTGATAAGTTACAACAACCAGGTGGCTATGAACAGTCAGATTATTATATGTCGCTCAATGGCAAATGGAAATTCCATTGGACTAAGAATCCAGATAATAGACCAAAGGATTTCTTCCAAAACGACTTTGCTGTGCAGGGTTGGAATGACATCAACGTACCAGGTAACTGGGAGCGTCAGGGCTATGGGCTGCCTATCTATGTAAACGAAACATACGAATTTGATGACAAACTCTTCCAATTTAAGAAGAATCCTCCCCTCGTTCCATATGCACAGAACGAAGTTGGCTCTTATCGTCGCACCTTTAAGATTCCTTCAACATGGAAGGGTCGTCGTGTAGTTCTTTGTTGCGAGGGTGTTAAGTCATTCTTCTATCTTTGGATAAACGGCACTTACGTTGGTTATAACATGGGTGCTAAGATGCCTTCAGAGTGGGATATTACAAAGTATCTGAACGATGGTGAGAACACTATCGCTATGGAAGTCTATCGTTGGGCATCAGGTTCTTACCTTGAGTGTCAGGACTATTGGCGCATAAGTGGTATCGAGCAGGACGTTTACCTCTACTCTACTCCAACACAATATATCGCTGATTACAATGTAGCAGCAGGTGTTGACAAGCAGGACAACGGCTCTTTCAATGGTAATTTTAGCCTTACTACAAATATTAAAGGTGCAGGAAGCGGTAGTGTAAGCTATGTTCTAACCGACGAAAAGGGCAATACTATCCTCTCAGAAGATAAGGCTGTCAGCGCAAAGAACAATGATGAAATGGTGAATTTCACCACAAAGACCATCCAGAACGTTGCTGCTTGGAGTGCTGAACATCCTAACCTCTATACCCTTCTTATCACTTTAAAAGACAAGAATGGTAATGTTACCGAACAAACTGGTAGTAAGGTAGGATTCAGAACAATTGAAATTAAGAACAAACAGCTCTGCGTGAACGGAACACCTATCCTCGTAAAGGGTGCCAACAGACATGAGCATTCAGAGTTAGGACGCACCGTGAGCAAGGAACTCATGGAACAGGATATCCGCTTAATGAAGCAGAATAACATCAATCTTGTACGTTGTTCTCACTATCCTAACGACTCTTATTGGTACCAACTTTGCGATAAATACGGCTTGTATGTTATCGATGAGGCAAATATTGAGTCTCACGGAATGGGTTATGGCAAGGAGTCTTTGGCAAAGGATAGCACATGGCTGACTGCTCATATGGACAGAACGCGTCGTATGTATGAGCGTTCAAAGAACCACCCAAGTATCATCATCTGGTCATTGGGTAATGAGGCGGGTAATGGTGTGAACTTTGAACACACCTACCGGTGGTTGAAGAATGCTGACAAGACACGTCTTATCCAGTATGAACGTGCTGAGGAGAATTTCAACACAGACATCTATTGTCCTATGTATCAGTCGCTCGACCACATGAAGGCATATGCAAAACGTACCGACATAACACGTCCTTACATCATGTGTGAGTATCTTCATGCTATGGGTAACAGTTGTGGTGGCATGAAAGACTACTGGGATTTGATCGAGTCAGAGCCTATCTTGCAAGGTGGTAGCATATGGGACTGGGTGGATCAGTCTTTCCGTGAGCTTGACAAGAATGGTAACTGGTATTGGAGTTACGGTGGTGACTACGGTCCAAAGGATGTACCAAGCTTCGATAACTTCTGTACAAACGGACTCATCGCTGCTGACCGCACTCCTCATCCACACCTTTCTGAGGTGAAGAAAATCTATCAGAACATCAAGTCTGAGCTCGTTTCAAACGAGAAGGGCATTACAATAAAGGTAAAGAACTGGTTTGATTTCACCGATCTGAATGCTTACCAGCTCAACTGGCAGATTGTTAATGAGAACGGAAAAGTTATTGCGAAGGGAACACAACACGTAGATTGTGCTCCACATGAGACAACAACAATCACACTCCCTGCTGTTTCTGCAACTACAGATAAGGAGCAGTACCTCAACCTTGAGTGGTATCCTGCTAATGATGCATTGATTCTTACAAAGAATGATGTTGTAGCATACGACCAGTTTGTACTGAAGAAAGCAAGCGATTGCACGACCTTCCTCCCACGTGAGAAGCAACGCATGACGTATAAGGTGAACGAGAACACAGGTGAGTTGACCTCGCTGAAGAGTGGCAACCAGGAGTTCCTCGAAGCGCCACTGAGCCTAAGTTTCTACCGCCCAGCTACCGACAACGATGGTCGTGACCAGTTCGGTGTAAGAGTATGGCGCAAGGATGGTATCGATTCTATCAGTCAGAAGGTTACAAAGATTACCCGTACTAAGGACGTGACACGTGCAGAAGCCGACATTATCGGTAAGAAGGGCAATGTAATCGGTAAGGCTGTCTTCACCTATCAGCCACAGAAGAATGGCGCATTGGCAGTGAAGGTTGATTTTACTCCAGACACAGCAGCTGTCAAGGCACTCGCACGTCTGGGGCTTACCTTCTGCGTAAAGGATACATTCGGCAAGGTTGAATACAATGGTCGTGGTGATATTGAGACTTACAACGACCGTAAAGCCGCTGGTTTTATTGGACACTACAAGACAACTGCAGAGGCTATGTTCCATTATTATGTGAAGCCACAGGCAACGGGTAACCGCACGGATGTACGTTGGGTATCAATCTCTGACACACGTAATCGCCTTATGGTTGCTGCAAAGAGTCCATTCCAGTTCTCTGTAACTCCTTTCTCTGACAGCGTTATCGACCGTGCGACGCATATCAACCAGCTCTCACGTGACGGTCTGCTTACCGTACACTTAGATACCAATCAGAGTGGTGTTGGTACCGCTACTTGTGGTCCTGGTGTTGCTGAGAAGTATCGTGTATCTGTTAAGCCAACAAGCTTCGAGTTTGTACTCTACCCTGCTATGGCTAAATAAATAGCACATAGTAAGAAGCTATAACAAAGCCCTGTTCCCTTTGCTTACAATCAGTAAGGCAGAAAGGAACAGGGCTTTTTGCGCTCTATAGACTTCAACTTGAAACGCAATTAGCGGTCAGCACCATTGGTGCTTACCATCCGCACGACATGTGTTAGGCATCAACACATCAGCTAAAGATAGCAAGAAAGGAATATTAAGGTCGTTATAACAAAGAATGTAAGATGCTAACCTCTAACAAAAAAGAGAGAAGTTAACCAGATAACACTTATAAAGAAATAATTATTTACGTAAAAAAGAAACATCTATTTTCACGTAAATAATTATAAACCGACCAGCTAAAGAATATGTGAACCAGTCAAAGCCCAGTAAGCAACACCAAGAACTACCGCTGCAATGATAATACGGAAGATGCAACCAGTTATCTTCTTAATGAGATAGATACCGATAACCAAGACAGCAAGTAAGCCGATAAAATATTGTAAATTCTCCATTGGGTATTATTTGATTACTTAAATAAATGTCTGAACACTTGAGGCACTGGCGTCTCGAACTCCATACGCTGATGTGTAACAGGATGATGAAAACAGAGTACGTATGCATGTAGACAAAGACGATGAAGTGGATCATCACCATTACCATACTTTATGTCACCGCAAACAGGATGTCCCATATCGGCAGAATGTACACGAATCTGATTCTTACGACCTGTCTCTAACTGATACTCAACAAGGCTGTGCGCTACCGTACGATCAAGTACATGGAAATGTGTAACGGCATATTTTCCACCATTATCTACTGGCGAACTATAGGTTACGTATGACTTATTATCCTTTAGCCAATTGGCAATCGTACCTTCATCATGCTCCATCTCACCACTGACTACAGCCACATAACGACGGTCATAAACGATATTATGCCAGTCATGTTCAAGTAACTGCTCCGTCTGTATGTCCTTAGCATAAATCATCAGTCCTGATGTATCACGGTCAAGTCGATGGACAACATGTGCCGTACACTTCTGCTTTGTCTTACGGAAATAATCGTCAAGAACAGTCTTCACATTCAATGAAGAATGCCCAGCAGCCATTGAAAGAATACCAATATTCTTTTCAATCACTACCAAAAATCTGTCTTCATAGACGATCTTCACATAGCGACTACGGAAGAGATCGTTCTTCTTGCTCTTGCTAACAGATATCTTGTCGCCAGCCTTCAGTGGATGGTCAAACTGTGTAACGCACTTACCATTCACCTTGATACCACGATTCTGTAGCGTTGCCTTAACCTTACTCTTACTCTCATTTAGGTTAGCCAGCAACCATTCAAGCAACGGAGCGGGGGCCGTCACCTTATAATGCTTATATTTTTCTTCTGTATATGGGTTCTTTCTTATCATACTACAAAAGTACAAAGAAATAATGATTTAAACTATTACACGCCTAATAAATCATCCGAATGACCTACGACAGAGATAAAACAGCTAATATATTGCTTTCCTATAATAAGGTTGTCAATACGCTCAATACCGTTTTAGAACATCTTAACCCACTCACGAAAGCGTGCAACACCTTCTTCGCCAAACTTGGAAAGACTTGCTATTGCCTGCTCAATCGTCTTCTCTTCATCAAGATGAGACTTAGAAAAAAACTTATCAGCATAACAAATTACCTTCTCTTCCATGGTTTCAGGGAGGAAGTCTTGCTGAGGAAGTGGCAGCTTTTGTGCAATGATTTGGCTACGAGTGATACCCGCTCCTGTATGACGTTCACAAACGCGAGCATGTCGAGGGAAGCCTTCTGCACGCAACATCTCTGCACCAATACGACCATGACAGATATAAGGCTCAGTGCCGAAACATTGTATACCTGGGGCATTGCAACGAACAATACCTATATCATGCAGCATAGCAGCTTCCTCTATAAACTGACGGTCAAGCGATAACTCTGGATGACGATCAGCAATAGCAAGTGCTTTATCCGCTACAGCACGACTGTGAACAAGAAGGATATGACGTAGTTCGTTATCCTCGGGATAGTATTTATTGATAATTACTTGGTAATCCATATCACGAAAAAAGCCACACCTCCTCTCCCAGTCGGCTTACACCGCCATGAATGAGGAGCCGTGGCCCTATCATTTGTAATAATGATTCTTTATTCTTCGTGTGCTCTCTCGATGTAAGCGATAACGTCGCCCTTGTCTACCTTAGCACCCTGCTTAGCGTTGATTTCCACTAACTTACCACCAAGGGCAGCAGGAACAGTTACAAACTCACCCCAAGGAGCCTGAACGTAGCAGAACACATCACCTTCTTTATACTCCTTACCGATGAATGGTTCGATAGCTGGAGCAGCCTCACCATCACCCTGGAATTCCCAGAAGAGCTGACCCTTAACAGGAGAAACAATAGCATCAGCCTTAGCATGCTTGAATGCTGCTGCCTCCTCTGGGCTTACCTTTGCACCGAGTTTAGCATCCTTTGCAGCCTGAAGGTCAGCAAGGAAGTTCTTCTTAGCCTGTCCGCTCTTGTAGTTACGATACTGCTCTGGGTGCATAGCCAACTCGAAGAGTTCCTCATCATCCTGTCCATAATCCCATCCGTTCTCATCCATTTCCTTACGGAAGTCGTCAAGAGCGTTTGCAAGTAAGGTGTGAGGATCGGCATCAGTGAACTCAAGACCTTTCTGCTTAGCAAGGTCTTTCAACTCCTGACATATCTCACCAGGAACACGACCACTCTTACCAAGAATCATACCCCACATAGAATCGTCCATCATTACGAAGCGACCCTTACCCTGCTCAAGGGTGAGGAGGTTCATAAGAGCAATGTTCTTTGTGTATTGTGAGAATGGAGTTACCAATGGAGGATAACCTACGCGTGGCCATACATATGCCACCTCATCGAACAACTTAACGAGCATATCATCTACAGAAAGCTCTGGTTCGCCCTTCTTCTTGCGGAGGTTATTGATAGTTGAACGAATACCACCAAGATCAGCCATCATAGAACCCATCATACCACCTGGCAAACCACACTCAAGAAGCAATGAACTCATAATCTTGTTCTGTGGGTTGATGAAGTAGCCGAGCCACTCATCAATAAACTCCTGAGTCATAGCACGTGCCTTCATGTAGGCATCCATGTTGATTTCTGGTACATCAAAGCCAGCGTTCTTCAACATGCTCTGTACAGAGATCACGTCTGGGTGAACCTTACCCCATGACAATGGCTCAATAGCAACATCAAGAATATCAATACCATTCTTGGCTACCTCAAGCATAGAAGCCATTGACAAACCTGGACCAGAGTGACCGTGATATTCAAGAATAATGTCTGGATGTTTCTCCTTAATGCGACGAGTCAACTCACCAAGGAATGCAGGCTGACCGATACCAGCCATATCCTTCAAACAAAGTTCCTCTGCACCAGCTGCAATTTCCTCATCAGCCAACTTACAGTAGTACTCAATTGTATGAACAGGAGAGGTAGTGATACAGAGTGTTCCCTGTGGAGTCATACCAGCTTCCTTAGCCCACTTCAATGCAGGCGCAATATTACGAATGTCATTCAGACCGTCGAAAAGACGAGTGATATCCACACCCTGAGCATGCTTTACACGATACATCAATGCACGAACATCATCAGGAACAGGATACATACGCAAAGCGTTCAGACCGCGGTCAAGCATGTGCGTCTTGATACCAGCTTCATGCAGAATCTTGGTGTAGGCACGTACAGACTCGTTTGGGTTCTCACCAGCTAAAAGATTAACCTGTTCAAATGCACCACCATTGGTTTCTACACGTGCAAAACAGCCCATCTCAACGAAGACTGGTGCGATACGAACTAACTCATCCTTACGTGGCTGGAACTTACCAGAGCTCTGCCACATGTCTCGATAAATGAGACTGAACTCAATTTTCTTTCCCATATATATTTTCAATATGTTACTAAATGAAGGGATTGCAGCATGCAATCTGACCTGCAAAATTAGATAAAAATTTCCAAAAAACGACTGACCGCTAATATTTTTTGCAATAATTGACTATCTTTGCAGTCTAAAAGTTAGTACATGAAGAAAATGTTTTTCCCTCTCTTCGGCTTTGCACTGCTTTTCCTTACAGCTTGTTACAACCAAGTATCGACTGGAGATCACAGTGCTATCGATGTTGAGGTTCAAATGAAAGGTGATAGTACACGCTACGGATTGGCGTGCGACGGTTGCTCTGATTCTATAATTGTTCTCCTACCAAATGAGGGAGGTGACCCTGTAAAGTTTGACATTGTAACAGCAAAACGCAATGGCATGGTTTATGGTACACCCGAGATAGGTGACGAATTAGCAATTGTTCCTAATCCTATTGACCCTTACGAGGCGGAAATGGTTATTAATCTTGAACAAATGAAAGGTACATGGACTTTCCAAGTCTTGCCAAAGTTAAAGCCTAACCCTACGAAGACAGAGGAGGAAATATTGGCTGGCATGAGCGATTCTATGAAGGCTGCCTTGTTTACACCGCGTGAGTATGGCTTTACACTGAAGAGTTATAACCAAGCTTCGCCAGTGGGTTATGTCATGAAAGCTAACTCTTTGGAAGATGAAAGTCCTGTGGAATATCCTAAGGTGACTGTCTATACGAGCTGGCACATCTTCAACGGTAGACTTTATATCTATAAAGACACACTTGACGAACAGGGACACCGCATCCCACAAGACTCTGTCGGCTTTGACTCTGGTTCTATGCGCTACCTTTCTGAAGACTCTATGGCTGCTCTTTTTGGTAAGAAAGTGATGCAGTATCACCGCAAGAAGAATGCTTTGGAAGCAAACAAAGAAGCTCAAAAAGCAGAAGAGAAGAACGCTATTATACAGAGTTTTAAGAAATAAATATGTTTCTTCCGTTAAATCTATAGATGATTCTTACATAGTTTTAACGGAAGGACTGAAGTAACTTATTAAACAAAATGATAGGATAAAAACTATCAAACTACATATTTTATTTAACTCTCCGACGCAAGATACCTTTTCTTATTAATACCTCAAAGATACGGATAGAGATTTGAAAAATCATTACATAATTTCAAATAAAACATCTATAAATAAGGACAAAAACACATACAAAAGGAAGACTTGTAACCAAAAGGATATCAATTAGTTAGAAAGCAGCATATCAAAAGATGCTTAATTGGACGTCAAAAGGGCGTTAGTAAGGGGCTTAAAGGGCACCTTTTGCAAGCCAAAAGGGCGTCTTTAAGAAGCCTAAAGGGCATGTATTAGACTTAAGTCGTCTGAAAATAGTTTACATATATTGCTTAGTCAAGGAATAAGTTGTTTGTAGAAAACGGTCAGACACCAAATCTATTCACATTTTACCTTGTATTTATTCCCCTTTGTGAAAGCATCTAATTGGGGATAGTTATCGTAAAAATATGAGCGATGTAATCTAATAAACCGATTTAGGTTTAATCTTTTCAAGCACAAGATTTGGCTGTTATTACAATTTTATATACCTTTGCGCAGATAATTATAGAGGTAATTCAGCCGTCATATACATAGGCTAAAAAGCCTTTCACAACCAAGGCAGCAATGCCGTTATAAAAAGGAGACCACTACAGGTTTTAAGAATATGAAGAAATTAATAGCGTTGTTTTCAATCATAACAATACTCTCCTTCTCTTGCTCAACCATCGTTTCTGCACAGGCACCAGCCAAAGCAGCGACCACAGCTACTGCTGACACACTTTCAGATGATGCACTGAACGAGACAGGTGTAGCCGATACAGCAACCGTTAAAACACCAACCGCACAAGATACGGGCATCCACCAATCCTTAAAGCGTAAGTTCATCGAGGGTAATGCTGGCTTCATGTCGTTGGTAGCGTTAGCTTTGGTGTTAGGCTTAGCATTCTGTATTGAACGCATTATCTATCTAAGTCTTTCAGAAATAGACGCCAAACGATTTGTTGGTAAGTTGGAGGATATGATTGTTGCAGGTGAGATTGAACAGGCAAAAGCACTGAGCCGTGACACACGTGGACCAGTAGCTTCCATCTGTTATCAAGGTCTGTTGCGTATAGACGATTCTATAGAAAATATCGAACGTAGCGTTACCTCATACGGTAGTGTTCAGAGTGCTAACCTTGAGAAGGGCTGTTCATGGATTACACTCTTCATTGCAATGGCTCCATCACTCGGTTTCCTTGGAACCGTTATCGGTATGGTTATGGCATTCGATCAGATTCAAGAGGCTGGCGATATTAGTCCAACGATTGTGGCATCAGGTATGAAGGTGGCTTTGATTACAACCATCTTCGGTATCATCGTAGCCCTCGTACTGCAGATATTCTACAACTATATCCTCTCAAAGATTGACCATATCACAGCACAGATGGAAGAATCAGCCATCACATTGTTGGATGCAATCATGAAGTATAAACTTAAATCAATTCATAATTCATAATTCAAAATTCATAATTATGATTACTTATGAGTCACAATACATTAACAAATACTCGAATGGGAGCTTATTAGTAAGAAAGAATTATGATATTCATATTCAACTATTCTTACCAAGTTTCCTTCTGTAACAAACATAGAAGACATCGGTAATCATAATTATGAATTCTGAATTATGAACTATGAATTAAAGAAATAATCATAATTATGAATTATGAATTTTGAATTCTGAATTAAAGAAGTAATCATAATTATGAATTGTGAATTATGAATTCTGAATTAAAAAGTGGTATGATGAAAATAAAAGGTATAGCAAAGATGGACGAGGAACGCATATCACAGCGTGTCCTCTACGTGATAGTAGCCTTATCAGCTATTGTCTTTTTGGCATTCTATCTCATTGGCTATGACACCCCTTTTACAGGCAACACAGCATTCAATGCACCTATGCTTACCGACGTTTTGCTTGGGTTTATGTGGGGATTGCTTGCTATCACAACTACAGCCTCTATCGTTGCCGTAGTTCGAGGCATACGAAGAGCCAACCGAAGTGAAGGAATGACGAATGGTATTCCTGCAAGAAGGATTACCTATACTACATATGGAATAACAGCACTCATCTTACTCTTGACCTTTGTTTTCGGTTCTACGCAGACAATGATGGTCAATGGAGAGAACTTCACAGATAGCTTCTGGCTTCGTATAACAGATATGTTTGTGAACAGTTCACTACTTTTACTTGTTCTTGCAGCAGGAGTAGTAGCCTTTGGAGCGACTCGTTATTACAGAAAGGGACGCGGAAAATGATGTTTCGTAAGAGAGAACGAAGGAAAGTTCCTATACTAAATACGACGTCAACGGCAGATATTTCGTTCATGCTGTTGATATTCTTCCTTGTAGCCTCTTCTATGGACCTCGACAAAGGACTGTCACGTCAGTTGCCTGCTATCGACAAAACAAAGACACCACCTGCTGCCGTTGATAGCAGAAAAGTAATGCGTATTGTTATCGATGCGAAGAATCAAGTGACACTTGATGGAAAGGCTGTGACGATGAAGGAGCTCTTACAGCGTGCAACACAGCTTATTCAAACTAACGGAAAGGGGCATCTTATCCAGTTACAGAGCAGTCGTAACGCTTCATACGATACTTATATCCACGTGCAAAACCAATTAGTTGCAGCGTATAACACCCTTCGCAACCAACGCGCACTCAATCTTTTTGGAAAAGAGTTTGAACTATGTAGCAACGAACAGCAGAAGCAAATAGCCGACGAATTACCTATGCGAATATCAGAGGTGTATACTGTTAGCAAGGCAGACAACACAGAGAAAGGGGGTACAGAATGAAACTCTATCGCGGAAGAAATCATGAGATCCCTGCATTGAACACAGCTTCTATGCCTGATCTTATCTTTTCTATTCTTTTCTTCTTCATGTTAGTTGTACACATGAGGAAAGCAAACGTACACGTAAAGTACCAAGTGCCAATGGCAACAGAGCTTTCACGCATGTATAATAATTCAACTATACAGCACATCTATATTGGTCGTCCCATCAATAGCCTTGGACAAGTAGAAGGAGAGAAGATGGTTGTACAATTAAATGATCATATCACAACGATTCCTGAAATAAGGAAATATCTGATTCAACTCTCTGCTGCTTTACCGCCAGAACAGCGCAAAGAGCTAAGCGTAAGCATCAAAGCAGACCGCCATGCAGATATGGGTACGATTATGGATCTCAAACAAGTGTTGAGAGAGGCTAACGTGTTGAATGTCAACTTTACAGCCACAATGAGTAGGAATAACAAACTGAAGTAGGTTATAAAGTTGTAATTTGTTGTAAAAAACGACTTTTTAACTTGCTTATTAAGAGTTTTTTTAATACCTTTGTAAGCAAAATCAAATTGTAAATAATGGCACATAGAAGTTTAGATCGTCTTGATAAGAAGATTCTGCACCTCATCTCTGAGGATGCAAGAATTCCATTCCTTGAGGTTGCACGTGCATGTAACGTCAGCGGTGCAGCCATTCATCAACGTATTCAGAAGTTAACAAATCTTGGAGTATTAAAGGGTTCACAGTTTATTATAGACCCAGAACGTATTGGTTATGAGACCTGTGCATTCATTGGTTTGAACCTTAAGAATCCTGAGAAATTTGATGATGTTGTTGAAGCTCTGCGTCAGATTCCAGAGATTGTGGAGTGTCACTACACCACAGGTGAGTATGACCTCTTCTTGAAGATTTACGCATACAACAACCATCACCTTATGTCTGTCATTCACGACAAGCTGATGCCACTTGGTCTATCACGCAGCGAGAGTTCAATCTCTTACAATGCTGTTATCGACCGCACCTTACCTATTGAGGGCATGAAGGTTGCTGATGTCGTTCTGGATGATGATGAAGAGGACGAGGAGTAAAAATATTTCTACCTATAATTAATAGCGTAGAATAAACTAATATCCTTTTATAAAGGATGACCAATACAAAGAAACCCGTACAACATTTGTGCGGGTTTCTTCATTTTCATACTCTCCTACCCTATTCAACCAATTTATCTATCGATTTTATCAGCTCTTATCATGTCACTTTTCACACATTATCGAGTGTAAAAAATATTGACACAGAAATCAGTAAAAAGCCTCTTTTTTATGCTCAAAACAAACAAAAGCAGTACCCTTGTAACTTGCTATTAACCAAATAGTTGCACAATAGCATTTCTAAAGGTACTTAACAAGGGTTCAAAAGGGCGTTAGTTAGACCTCAAAAGGGCATCTTTTAAAAGCCAATTGGGCGTTAATTCAAACACTGCTAAGCATCAATTAAAATTGGCTATTTGATTTTTCTTGACAGAACCAATAAATTTAAAGACTTTAAGAAGAACTTAATAAGGAGGTACCGAACTCACAGAAACTTTATCATTTTATAAATAAAAAGTCCTCACAAGCAGAAATGTGCTTGCGAGGACTTTCCAATATAATTTATTTTAAGCTGTACAACAACTTACAGGCGACTCATTACCACTTGAGTACATAGGCTTTCACCCGTTGTCATACAACTATTATTCCTTGATATTTGGCTCTTCCAAACCGAGGTTGCGCTTCTTATCTACTACCTTCAAAGTAAGACCAATGATAAGAGCTATCACACCCAAACCAGCAAGCATAACCAACGGAGCTGTATAGTTGAGCTGTGTTGGGTCAGTAACACCTACGTTTGTCTTGTCGAGAACCTTACCAATAAGGAGTGGGAACAACCACAAACCGATATTCTGAATCCAGAAGATAAGTGCGTAAGCAGAACCGATAATCTTTGCATCTACGAGCTTTGGAACACTTGGCCACAATGAAGCAGGCACAAGTGAGAAAGAAGCACCGAGAACGAGGATGGTTACATAAGCTACGATAACACCACCCACCTGACTTGACTTAAACTGTGGAAGAACAAAGGCAAAGGTGAGGTGACAAACAATCAGAAGCAATGAACCTAAAACGAGCATAGAGGCAGCCTTACCCTTATGGTCAACATAGCTACCAAGGATTGGTGTAATACCTACTGCGAGCAGTGGGAATACAGCAAAGATAGACTCTGCAGACTGACGCATATAACCCATGTAGCAGTAAGCAACAAGGAAGAATATAGAAAAGAAGAGCATAAGATTCTTCAAAGCCTTATTCTTCATGAAGTTGAACATGAATGCAGTAGCAGCAACAACGAGCATGATAACATACTGTATGATAGTCACACTTGATGAAGCCCAGTATGAACCTGAAGGAAGTTCGTGGAAGGTAAGGTTACACTGAAGCATATTTACAGCGTACTTCTGGAATGGGAAGATAGCTGAGTAATAAAGCACACAGAGCAATGCTACGAGCCAGAAGCCCATACTTGAAAGAATCTTACCGAGGTCGCTAATCTTAAATGGATCTTCTTTTTCCTCTGCCTCACCAGTCTGTGAGTCGAGCTTCTTATCCATGAAGAAGTAAACGATGAACATCATCAATGCAATACAGATAAGTACTACACCGAAAGCTACAGAACGAGAAACATTAATATCGCCACCGAGTTTTGCAAAGAATGGCGAGAAAATCATACAGGTAGCAACGCCAAGACGAGCTAAAGCCATCTCTGAACCCATTGCCAATGCCATCTCACGACCCTTGAACCACTTTACAATACCACGAGAAACAGTGATACCTGCCATCTCAACGCCACAACCAAATATCATGAAACCACAAGCTGCAACCTTTGCAGAGGCTGGCATACCCTGATAGAAAGGAGATACACCCAACTGCTCGAAGACAGGAATGTGGTTGAGGTTATTGGTAAACCATACGTCAAGTGCACTACCAGCAAAGCTGTCGCTGATGGCATAGTACTTGATTAAAGCACCAGCTAACATCACTGCACCAGACAAGATAGCAGTGAAACGAACGCCCATCTTATCGAGGATGATACCTGCGAAGATGAGGAAGAACACGAAGACATTAAGGAATGTCTCTGAACCCTGCATAGTACCGAATGCGGTTGAGTCCCAACCTCTCTGCTCCTGCATCAACTCCTTGATTGGAGAAAGGATATCAACAAAGATGTAGGCACAGAACATTGCCAGGGATAAAAGGAGGAGAGCCGTCCAGCGCATCGCTGCTGAATCTCTCAATGATGTTTGAATTTTATCAGTCATGATGATTTTTTTAGTTAACAAGTTTACGAGAGGACAAGTTGACAAGTTTGACAAGTTGACAAGTAAACAAGTTGTTTGTTAGATTATAAGTTGAAAGTGAACGAGTGAACATGTTGACGAGTGAACGAGTGGATACTATCAAGCAACTTGTCTACTTGTAAACTCGTCAACACGTTAACTTGCAAAAGCTACTTCTTCAGCCAGCGGTCGAGCCAATTAAAGAATGTACGCTGCCAGAGAACACTGTTTTGTGGTTTCAATACCCAGTGGTTCTCATCAGGATAGAGAAGTAACTCAGCTGGGATACCACGCAGACGAGCTGCATTGAAAGCACCCATTCCTTGATTAGCATTAATACGATAGTCTTTCTCGCCGTGGATACAAAGGATTGGAGTATCCCACTTATCAACATTCAAATGAGGACTGTTTGCGTAAGTACGCTTTGCAGCTTCGCTCTTGTCCTTATTCCAATATGCATCGTCATACTCCCAGTTGCTGAACCAAGCCTCCTCTGTATCTGTGTACATACTCTCAAGATTGAAAGCACCATCATGAGAGATGAAACACTTGAAGCGCTTGTTGTGAATACCAGCAAGATAATAAACAGAGAAGCCACCAAATGAAGCACCAACAGCACCTAAGCGGTCTTTGTCTACAAATGATAGATTGTTAGCTGCATCGTCAATCGCTGAAAGGTAGTCGTTCATACACTGACCAGTCCAATCGGTACTAACTTCCTCGTTCCATGCACTACCAAAGCCTGGCAAGCCACGACGGTTTGGTGCAATGATGACATAGCCATTGGCTGCCATAATCTGAAAGTTCCAACGATAGCTCCAGAACTGAGAAACAGGACTTTGTGGTCCGCCTTCACAGAAGAGCAATGTAGGGTACTTCTTATTTGGATCGAAGTGTGGAGGAGTAATTACCCATTCCATCATTTCCTTACCATCAGTTGTCTTTACCCAACGAGACTTTACATCACCCAAAGCTAACTGGTCATAGATGTGCTTATTCTCAAAGCTGAGCTGTGTCTGAACGCTTGCTTTCTCCTTCTTAGCAGGAGTTATAGCGAAGATTTCATTAGCTTGAGAGATGCTCTGACGGATAGCAAGCAACTTCTTATCACCAAGGAGTGAGATACTTACATAGTTATGATCGCCTTCTGTCAGCTGCTTTACTTCGCCCTTAAGATTCGTCTGATAGACGTTCACCGTTGCATGCCATACACCAATGAAATAAAGGTCCTTTGAATTAAGGCTCCATGTGTAGTCGTCTACATTTGAATCAAAGCTTTCAGCAACGTATGTCTTCTTGCCTGTTGCCAATTCATAGACACAGAGGCGGTTGCGATCACTCTCATAACCATCGTTCTTCATACTTTGCCATGCAATGTACTTGCCGTCAGGTGAGAACTTAGGGTTAACATCGTAACCTACATTCATATCGCCAGCCTGATGATTTACAGCCTGATTACGCATACTCTTTGTAGCATCAATCTTTGGCTCAACATAGTCGGCTGGCTTGCAGAGATTCTTTGTTTGGCGAGTCTCCACATTATATATATATATGTCGGCATCGGTAGAGATGGCATATTGTGTACCTTCCTTCTTACGGCAGGTATAAGCAATGAACTTAGAATCCTTGCTCCAATCAATCTGCTCAATACCACCAAATGGTGCCAACGGACTCTCGTAAGGTTCGCCTTCGAGTACATCAATACCTGCACCGATACCCTCAGGAGTAACATCTGCAACAAATGGATGTGCATTGGTTGTAACGTAATGGTCCCAGTGACGATAGTTCATATCGGTGATGACCATACCCGTAGCCTTTGGCAAGTCGCTTGGGTTCTGCTTAATAGTGCCATAGTAAGGGATGCTCTTGATGAGAACAACACGCTTACGGTCTGGAGAGAATCGGAAACCTTCGATGTCGATATCCGAGTGTGTCAACTGCTTACGGTTAGTACCGTCTGCATTCATTGTCCACAACTGTCCACCTGTAAGGAATGCCAAGGTGTTGTTGTCGACCCATGCTGCATCACCCTCACTCTTTGCAGAGGTAGTCAGCAGACGGTCGTTCTTACCATCGGCATCCATCACGCGTAGTACCTGGTGTCCTTTGTTTTCTTTGACACTGTAATAACCCACCTGATAGACAATCTTCTTGCCATCAGGTGAGGCTTGTGCAGCACCGATACGTCCCATAGCCCAAAGTGTCTCAGGAGTCATGAGGTCACTTGTCAGCGTAATGTTGCTACGTCCGATGTTCACATCCTGTGCTTGTGCAACGCTACCACTCATCATCAGAGCCGCAGTCATTGCCATGGTTAAGTTTTTGTTCATAGTTATAGACCTTTCTTTTTCTTATTTAGCTCATCTTGCTTGCGTCGAAGTTCTTCTTGCTGCTGCTTCTGTAACTCCTGTAAAGCCTGCATACGAGCGAAGAGTCCACTACCCATATTGTTCTTAGGGTTGTTCTTTCGCTCTTCACGACGTGCTTCAAGGATAGCCAGAAGTTTCTCCTCATTGGTTGTCTTGCGCAATGCCCACATGATAGCAGCTGAGAAGAAGAGAGATATAAAGTAATAGAAGTTCAAACCTGATGAATAGTCATTAAACATGAAGAAGAAGAACAATGGCATACCAAACATCATCCACTGCATCATCTTCATCTGATCAGCCTGCTGTCCAACCATCTGATCCTTCTGCTGCTGCATGGTAAACCATGTGTAAAGCAAGTTAGCACCACAGAAGAGAATACAAGTCAATGAGATATGATCACCAACGAGCCAAACGTCATGGCTCCATGAGAAGATTGGATCGAAAGTACTGAGGTCGTGCATCCAAAGGAAGCTCTGACCACGAAGCTGAATAGCATTCGGTACGAAGTTGAACATCGCAATCCAGATTGGCATCTGTATCAACATAGGAAGACAGCCTGACAAAGGACTCACTCCATACTCAGAATACTTCTGCATCATAGCTTGCTGTTTCTGCATCTGGTCTTCTGGCTTATTGAATTGCTTCGTTGCTTCATCCAACTTTGGCTTCAACACACGCATCTTAGCAGAGCTCATATAGCTCTTCTTCACCATTGGGAAGGTAATAAGCTTTAATAATAAGGTAATGAGAATCAATACAACTCCCATTGGGAACACCTTACTCAACCAATCGAATACATAAAGAGTAAACCAACGGTTGATAATGCGGAACAATGGCCAACCAAGATATACGAGGCGTTCCATGTCAAGTTCCTTTGCAAACTTACTCTCTGTTTCAACAGACTGAAGCAGACGGAAATCGTTAGGACCAAAGTAGAATTCGAACTCTGAAGGGCGCTTACCTGTTGGATCAAAGCCAGCCTTCATGTTAGCCTCATAGTGCTTGAGGTAATGAGAAGACTTCTCAAGTGGTGTACTCTTAAGCTTTGCACCTGTTGCAAAGTTATCCTTTGCAATCATCACAGCAGAGAAGAACTGGTTTTTGAAAGCTACCCAATCCATTGAGTCCTCTGTTGTCTCTTCCTTTTCAGAAGTCTCACTCAAGTAATCAGTACCCCCATCATGCTTCTTATAGGTAAGTGTTGCGTAGCGGTTCTCAAATGTGAAACCACGTTCCTGCTGCTTACAACGCTCCTGCCAGTTAATATCTATCTGGTTATAATTTGGAGCAAAGAGTCCACCCATACCCTCTGCCTGCAAAGACATATTGAGCAAATAGTCCTTACCAAGTGTATAGTTCAAGGTAAGTGTCTTACCCTCTCCTGCCACAGCAGTCAGCGTAACAGTAGAATCTGTCACATTTGAAGGAGTGAAGATAAGGTCTTTTGTCTCAATATTACTATTCTTTGCAGCCAACATAAAGTTAAGGCTCTGGTCGTCACCACTAAACAAAGTAACATTCTTCTGGTCCTGTGAACCATCCTTAACAGCTATATTATGACCGATATAGTTTTTGATAACAGCCTTCTTTACAACACCACCCTTGGTACTCAAAGTCAGCTCAACCTTACTATTTTTCAATATAATATCCTGCGCCTTGCCATTCAAAGCAGCATAAAATAGAGCTGTAGTATCAGTAGCCTGCGCAGCCTTCTGCTGTGCCTGCTTCTCAGCAGCAAGTTTCGCGGTCTGTGCTTTCTTAGCAGAAGCGATAGAATCCTTTACAAATTCAGCTCTCTGCTGTGCTACCTGTTCAGCTGATGGCTGCTGCCACCAAGCAAAACCAAAGAGAATCAGGGCGATAAGTACAAACCCTGTGATAGTTCTTTTATCCATTTATTTGTTTTTCTGATTTTCAATTGAAGTCTTTACGAAGTCGAGGAAGAGCGGATGTGGACCGAGTACAGTTGACTGATACTCTGGGTGGAACTGTGTTCCAATATACCACTTCAAGCCTGGTATCTCAACGATTTCAACCAAGTCGCTCTCTGGGTTGCGACCAACACACATCATACCATGCTTCTCAAACTCCTTTTCATAGTCGTTATTGAACTCATAACGATGACGATGGCGTTCCTGTATATGCTCTTGCTTATAGATATTAAAGGTATGTGAACCCTGACGCAGTACGCACTCATAGGCTCCAAGGCGCATCGTACCACCCATATTGGTGATATTCTTCTGCTCCTCCATAATATCAATCACATTGTGTGTAGTCTTCTCATCAATCTCTCGTGAGTTAGCATCCTTATACCCAAGGACATTACGAGCAAACTCAATCACCATCATCTGCATACCAAGACAGATACCGAAGGTTGGGATATCGTGTGTACGTGTATAGTGAGCAGCAACAATCTTACCCTCAATACCACGCTGACCAAAGCCCGGACAAATCACAATACCATCCTGTCCCTTAAGTTTCTCAGCTACATTCTCCTCTGTCAGTTCCTCAGAATTGATAAAGGTAATAACCGTTTTACGGTCATTATAGGTTCCAGCCTGCAACAAGCCTTCACGAATACTCTTATAAGCATCCTGCAAGTCATATTTACCAACAAGTCCAATGTGTACTTCCTTAGTTGCTTTACGCTGACGATCAAGGAACTCCTTCCAAGGACCGAGTGCAGGCTTAGGACCAACTTCCTCACCGCACTTACGCAGAATAGCAGCATCCAAGCCCTGCTCAAGCATATTCACAGGAACATCATAGATACTTGGGAGGTCCTCACTCTGTACAACGCAGTCGAAATCAACATTACAGAAAGCAGCAACCTTCTTACGGATATCATCATCAAGATGCTTTTCTGTACGCATCACGAGCACATCTGGCTGAATACCTACACTCTGCAATTCCTTTACGCTGTGCTGAGTTGGCTTTGTCTTCAACTCACCTGCAGCCTTAAGATAAGGAACATATGTCAAGTGAAGGTTGATGGCACGCTTACCCAATTCCCACTTCAACTGGCGAATAGCCTCAAGGAATGGAGCTGACTCAATGTCGCCAATAGTACCACCAATCTCAGTAATCACAAAGTCATAGTGATACTTCTGCCCCAAAAGCTTAATATTACGTTTGATTTCGTCCGTAATATGAGGAACAACCTGAATAGTCTTACCCAAGTAGTCGCCACGACGCTCCTTGTCAATAACGCTCTTATAGATACGTCCCGTAGTCATAGAGTTGGCTTTTGTGGTCTTAATACCTGTGAAGCGCTCATAGTGACCGAGGTCGAGGTCGGTCTCCATACCATCCTCTGTTACGTAGCACTCACCATGCTCGTAAGGATTCAGCGTACCTGGGTCAATGTTGATGTACGGATCAAATTTCTGAATGGTAATGTTGTAACCTCTTGCTTGAAGAAGCTTACCGATTGATGACGAGATAATTCCTTTACCAAGTGAAGAAACTACGCCGCCCGTAACGAAAATGTACTTTGTTTCAGCCACGATGTATGTATGTTTAATTATTATCTTCTAAAACGAAAATCGTTGGTTCTAAACTAAAATGCAAAGATACTAATTTTCCATGAATTAGGCGGGATATAAAGATAAAAAAAACATAGTATATTTCATTTAATTGCCACAAGACCTATGAACAGTGCTATCATCTATGATTACAATGAGATTTTATCTACCTTATGGCAGATTTTATTATTACAAAACTAAGAGACATTTTAATAAAATATTTTGCAATAAAACACCCCTTTCACAAGACTAAATCTTATATTTCAAACAGTTACTATTACATAAGAAAACTACAATTTAGCAATAACGAACTAAGAAGAATATCACAAGAGAAGACTACTCTAAAACTTTAAAGAAACTTAAAATCCTCTTTACTGCCGCATTATTGTAAAACTAAAGTTCTATTTTTGCAAACCAATTTGAGGCTAACATATAGACGATATACGTTAGGATTCACAAGGAAAGATGCTCGAGTGGTTGAAGAGGCACGCCTGGAAAGCGTGTATACCCCTAAAGGGTATCGGGGGTTCGAATCCCCCTCTTTCCGCAAGAAAGGTGTAAACCAATGGATTACGTGGCTTGCACCTTTTTCTTTTCTGACAACTTAATATAGAAAGGGAAACATTCTAATAACCACAACTACACATCCTTACACAAAACACGTATTTTCCAATTCATTGCTTGTTATACGGCTTTATTTTCCTAACTTTGCAGAAAGTAAGATTAGGAAACGGATATTGTATCTACGTAATAACATCCAAATTTAACGAAAGAAATGATAGAACTCGATACTACCAATATGTGTTCACACCTGCAGAAGAAGCTCTTTAACGAAGAAGGTGTCTACTATCCTATATGGCAGGCGATGCAGAATGATGATGAGATAACAGCTGTGATTCGTTCACGACAGTTGCATATCTACCGCAATGGTAAAAAAGTTTTGGTGCTTCCCGGCAAGGCTGCACCGAAGATTATTCGCGAAGATAGCCTCAACGAACTCTTACCTAAGGACTTACTGAAATGAAAAAAGCAATCGTCGTAGGCGCCAGCAGCGGTATCGGACATGAAGTAGCACGACTGCTCATCGCAGAGGGATGGGCGGTTGGTGTGGCTGCTCGTCGTATAGACAAGTTGACAGACTTGCAAGCTATGGCACCAGAGCGGGTCTACACAGTCCAAATTGATGTAACTAACGAAGATGCAGAGACTTCCTTACTGCAACTTATAGAACGTATGAACGGCATCGACCTCTATTTTCATGCCGCAGGAATCGGTTGGCAAAACCCAAGTCTTAATGCTGACATAGAACTCAAAACAATGGAAACCAACGCTGTTGGATTCACCCGAATGATTGGCTGTGCCTATCGCTATTTTGCCAATAAGGGAGGTGGACACATCGCTTGTATCACTTCTATCGCAGGAACAAAAGGGCTCGGACCGGCTCCTGCTTATAGTGCAACAAAGGCGATGCAGAACACCTATCTACAAGCGTTGGAACAACTCGCAGCTTGTAAACATCATAACATCCACTTCACAGATATTCGTCCCGGCTTTGTCGACACTCCCCTACTCGCTGGTACCTCTCACCTCCCGATGCTGATGACCACAAAAAAGGTGGCACGCAGTATTATAAAGGCTATTAACAGCCGACGACACATCTGCGTCATCGATAGCCGTTGGTGCGTACTCACCTACTTATGGCGACATATCCCTAACTGGATATGGAGGCGAATGAAGCTATGTTAACCTCTTTTTCATACAAAGAAAACAAACAAGATTTCCACAACAACATAAACAAACACTAATTTCTTTCCTCATAAAATAAACTTTCAGGGTTTTCAATCATCTAAGCATATAAGTCTTAAAACTGTCCATAACGAAAGGGAAGGAAAAGGATAAGTAAAAAAGGAAACAACAGATAAACAAGTAACAAAACAATAATTATTTAGGATATGAGAAAATTCTGTTTAAAGATTCACCGTTGGTTTGCTTTACCATTAGGCGTAATAATGGCTATTTTGTGCTTCAGCGGTCTGGCAATACTGCTAATAAAAGACCTTGCACCCCTCTTTGACATGAATGCCAAAGAGATGCCAATCTATACAATAGTAGTACGACTGCACCGTTGGCTCTTTATGAAACCTGAGAATGCACATGAGGGTGGACAGTCACTTGGACGCATCCTTACAGCCGTGTCAGCAATCTGTATGTCAATCGTATTGTTATCGGGTGTTGTCATCTGGTGGCCAAAGACCAAGAAGGCGTTGAAAAGTCGTTTGACCGTCAGCACTAACAAGGGTTTCCGTCGCTTTGTTTACGACTCTCACGTGTCATTAGGTATCTATGTCTTTATCTTCCTTTTCCTCATGGCACTCACTGGTCCTGTCTTCTCTTTCGGCTGGTACAGAGCGGGAATGTCTAAACTCTTCGGTCAGCCTATGCCACCAAAGGAAATGAAGATGCAACAGCCTAAAGATGGAATGAAGCAGGGCGGAACAAACGACAAAGCTTTTGCACCTACGGACGCAAGTCAGATGAAAGGACAACCACAAGCACACAAGGAGGGAGCAAAAGATATGAAAGGCGATCAGCATGGTAAGAAGCCAAAGGGTGGAAAGCTCTTTAAGCAGTTACACACAGGAACATGGGGCGGATGGTTCTCACGCGTTCTCTATGCTATCGCAGCCCTTATCGGTGGTTTCCTCCCTATTAGTGGTTACTATCTGTGGTGGAAGAGAAGAAGTGCTAAGAAGAAAAAAGCATAAAACATAATCATCATATATCACAAGTCCATTGATTGTATTCGGTTGATTAACCTACAGAATACCATCAATGGGCTTTTACATTTATCCTTCAACGAAGCACTTACCTAAATGTACTGACAAACTCCCAACTACATTCATCCACTCACAGAGGATTAAACAACCAATATTATGTAAAGATTATTCAAACTACTAAAAAACAACTAACGCCCAATTAGACTCGAATTAAGCCTTAATTGGCTTTCAAAAGATGCCCTTTTGAGGTCTAACTAACGCCCTTTAAGAGTCTAAAAGATGCCCTTTTCAAATCGTATTTGTAAGAGTCTGATTATCTGACCGTTACAAAGCCCCATTTTTAGAGCGGTTTTCTATAAACAAAAGGCTTACAGAAAGCGTATTCTGTAATTATTTTTCACAGTCCACAAACCCAATAAACCATAACTAATAATCAAAATTATGTAATCTTTATTAGGGATAAGAAAACCTAATCACACAAAAAATCAACAATAAATTACTTATAAATGAGTATTGCCACCTTAACAAGAAGCCAGTACCTTTGCACTCGAAAAATTTTACAGACCATAATAACAATAATGATGTTATATAGAATATTAAATAAGGTGATAAGTCTGGTGGAAGACTTGACTTGCCGAAAAGTAAATCAGAAATACGTGTTAAAAGAACATATTTGTATTTATTTACATGCGTGTATAAGATGAGAAATAAATAACAAATGATTAGTTTATTTTAACCAAATCGACTTGCTTTTGTTCTTATCAAATAATATATTTGCGACAGATAATCTAATTTGGCGCGAATAAGATAAATACCATCATTTAAAGCTTTTACTCATTTTTAAATACCAGACAAAGAGATTTTTCTCTTTTAAAAAAGTCTAATCAATATTATATTTTATCTAATAAAACCCTTGAAGAGTATGGAATTAAAAAAGACACTCTACTCCCTAATGTTAGGTGGCATTCTTCTTTGCTTGTTCTCATGTGCAAAGGAAGACGAATTTGAAATGCCAACACTGGTGCTTTCTGAAAACAGTATTGCCTTTGATAAAGGCGTAAGTGAAAGAAACATTAGTGTAACAACAAATCAAAACAGCTGGATTGCATCGTCTCCACAGGAAGGCGACTGGCTCTCGCTGGTTCAGGACGGTAACGTACTGAAGGTGAAGGTAACTGAAAACAAGATTGGAACAGAGCGCACAAGCTACGTACTCGTTAATGCCAACGGCGCATCTGGTAAGATTGCTGTGACTCAGAGCGCAGCTGACGTAACGCTTGACGTGGTTCCAAATGCTATCTATCTGCCACAGACAGGTGGTGAGAAGACGATTGACATTACGACGAATTCGTCTGTCTATGACGTAACAACCAGCGAAGAAGTTAGTTGGTTAAAGATTGTCAAGTCGGAGGAAGAAATCAAACTGATAGCTGAGCGTAACGATACCTATCAGAAGCGTGAGGTGAAACTTTACGCCAAGAGCGGTAGTGTTATCCGCGAGATTGTTGTTTCTCAGTCGGGTATTCAACGCTACCTCCTCCCTATTCATCCAGGTGTACCACAGGACGAGCATAAGATTATGGACTTTGAGTTGGGTCGTGGTAGCTACTTGCGCGAGTATCAAGCAGCTATGCCAGCTTACGGCCTTGAGGAGACTTACACCTTTATCACTGCTTCGCCTATCTTTACTTTGATTCAGTATTGTAGCACTGATGGTATTAATCCGTCACAGATTATCATGATTGGCGATGGTAGAAAGGCTATTGATGCCGTGAAGGACAAGGCTTTCGATAAGTTCTTGACTGACAATGGCTATGTGCGCAGCAACTCTCAGTCTGACAGAGAATATACCAATGACAAGGATTTACTCTCTTTGAAGGTTTATATATCAGAAAAGGAGAATAACGAAGGAGTTAACCTCACTTTCACACCTATTATGAAACAGAATGGCGAATACAAGACTTTCAGCAAGTTACCATTCTATCCTCTTGAGCTTCTGCAGAAAGACAACGTGAAGTTGGCACAGATTGAACAATACGAGCAGAAGGCTGGTAGTACAGAAGAAGAGCGTAGCCTCAACGAACATAAGAATACTGAGGTTTCACAGATTCAATATAAACTGAAGGCAAGTACAGACCCTTCTGCAGCCTACGGACGTATTCACATCTTCTACACAACGGATAAGGATGGTGATGCACCAGACAATTTGGGTAGTGTTCAGATTGGTGCATTACTCTTCAAAGACACCAACCTCGGTGTATGGAAGTATGGTACTAAATGGGTTGCAACCAAAGAAATCAAGAAGGTATTGGGTGATGAGGGCTTCTCTTTCCTCCGCACATCAGGCAACAACCACTTCTTTGTACGTGAAAGCGACCACTTGGTTATTGATGTTACCTGCGTGTTAGACAACGCTATGCCAGTACTCGCATTGCTTTACAGCTACGATCCATCCGTATCGGGTGCAAGTAGTAAAGCCATAAAGACACAGGCTAAGATGATTAGAAACTTCGCTGCTGCCAAGAAGGCTTTGAAGTTCTAAGCAAATTGTTATACCTTATTTAAATAATTACATGATATGCGTATAATCAATATACATAAGCTGCTGCAGTTCTCTCTGCTATACCTATTATTAGGTATAGCGGTTGGATGCGCTGAGAATGACAGCTTTGACCAGCCCTATCTGAATGTTTCAGAGAAGGAGATTTCTTTCTCTAATCAGATTGGTGAGAAGACAATTACCGTAAACACGAACTGTAAGGAGTGGATGGCTACTACACCGAAGGCGTGGGTACACCTTTCACAAAGTGGAAACGAGATTGCTGTACACGTTGATCCTAATACAACAGGAATGGAGAGAAGTAGCTATATCCTTGTTGATGGTGGATTGGCAGTGCAGAAAATCATGGTAAGTCAGAGTGCTGCTGACATCACATTAAACTTAAACAATGGTGAGGTAATTCTGCCACAAGCAGGTGGTACGACAACTGTCGACTTGAAGATGGATGCTACTTCCTATGACCTTACACAGAGTGAGCAGCCAGAGTGGATGCAGGTTATCAAGAAAAAGCATGGTTTGAAGTTTATTTCTAAGCCTAATTATAGCACTACGGAAAGAACCACAAAACTGACAATAGCCTTTGCAGGAAAGAATCATGAGGTGGTTGTTAAGCAGCCGGGCGTAGCTACTTTTATCTTGGCTTGTAACCCAGGTAATCCTTATAGCTTGCATAAGATGATGGACTATGAGTATCGCCGTGGTAGTTTCTTGACGGAATACGGCGGCCCTGACGAGGTAAACGGCATCTTTGAAGAGAGTTACTTCTTCAAGACTCCATCGCCTTTGTTCAAAGATGTGGTTTATGTACACGACACAAAGCACTCTGTTCCAACGCGTATCTACACACGTTCACTGACAAGAGAGGGTGTGAACGCTGTTAAGTCGCAGGCTTTCCAAGAGTTTATGAGAGCCAATGGATATACAAGGGACGAGAAAGATACGAATCATTACGTCAATATAAAGGAAGCTTTCACGATGGATGTAGACATTAGAGAGGAGAATAATAGCGTTGTGTTATTCTTCTATCAGATGCACACACAAGACCGTAGCTACCCTACCTTTAGTAGTCTTGACCTTGGTCCTGTTGACCTTTTGAACAAAACTGACAAGAAGATAAGTGATGTCGAAGCTTATGAGACAGGTAAGAACAGTGAGGAAATGAAGCGACAAATGTCTAAGAGTAATGAGGTTGAAGCTATTCTCTACAAGACAAATGATCCTACATTGATTGCTCGTACTTACTTCTTCTATCTTCATAATGATGCCGCTGTACCACAAGAGAAGGCTGGAAGCGTTGAGCAATACAGTTTATTCTACAGCCAACCAAACTTAGGAATATGGCAGTATGGTAATGAATGGTTCGTAACCCATGAGTTCGACAAACTGCTTACTGCAAACAACTTTGAGTTCGTTGGCTACAATGGTAAGCATCATGTCTATGCGCGTCGTTCTGACTATCTGACCTTAGCAATCTCAGGTGGAGAATATGCTGACGTCAACAACGGTAAGGCAGTCATGCAAATTACTGTTCTATACAAGCCAACTGTCTTTGCAGGAAGTAAAGAACAGCGGTTGGCTAAGGTAGAACGTATGCTCAAACAATACAATCCAAAGAAATAAAATAAAGGAACATTGATGAGAAAATCGATTATATATCTATGTGCGTGTGCGCTCTCTGGCATGATGGTTACTACCTCATGCCAGGATAACTTGGACTCGGATGCGGGCGTTTCCAACACACGCTCTGTCAATATTGACAAAGACCTTTTTGCCATAAAAGGCTGTATTAACGTGAAGTTGGCAAAGGGTACAAACCAAGCTATACCTACAACGCGTAGTGGTAGTGTGGAAATGCAGAGCGTCCCATCGGCTATGACTTCTGCAATGCAGTATTCTGGGGCTTATAAAATGGAAAGAGTCTTTAAGCCAGCAGGTATCTATGAAGCACGAACCGTAGCTGAAGGACTCGACCGTTGGTACACAATCTATTTTGATAAGAGCAAAGATGTGGCTGCTGTCTTAGACCAATTCAAGAAGGCTGAGGGTGTTGAATGTGCAGAACAAGTACTACCAATGGCAAGACCAACGGTTAAGATGACTCCTTACAGCCCTTCTGGTGCAAGTATGCAGGCTACAGCAAGTACTTTTGATGACCCTCTCTTGGCTAAACAGTGGCATTACTACAATGACGGTTCAGTAAATGCACGTGCGAAGAAGGGTGCTGACTGTAATGTTAAACCTGTTTGGGAAAAGTACACAACAGGTAAGAAAAACGTTATTGTTGCCGTTGTTGATGGTGGTATTGATATCACTCACGAGGACTTGAAAGACAATCTTTATGTCAACGAAAAGGAGAAGAACGGTCAACCGAACGTCGATGATGACGGCAATGGCTTTGTTGATGACATCTATGGTTACAACTTTGTGACTGCTAAAGACGTCGTTGGCGGTACTATCGAACCCGATGATGGCGGACACGGAACCCACGTTGCGGGTACTGTCGCTGCACGCAATAATAATGGAAAGGGTGTGGCTGGTATCGCAGGTGGCGATGGTTCGCCAGATAGTGGTGTACGCTTGTTGAGCTGTCAGATATTTAGAAACAAAGACGAGCAGGGCGATGCGGCTGCTGCCATTAAGTATGCAGCTGACAACGGGGCTGTTATCTGTCAGAACTCATGGGGATATTCCTCTACTGCAGGTGTCACCTCTATGCCTCAGTTGCTGAAAGAAGCAGTGGACTACTTCATCAAGATGGCTGGTTGTGATGCTAACGGCAACCAACGTCCAGACTCTCCAATGAAGGGTGGTGTGGTAATGTTCGCTGCTGGTAATGAGAACAAAGAGTTCTCTGCTTACCCTGCTTGCTATGCTCCGACGGTTTCTGTCGCTGCAATGGCATGGGACTTCAGTAAGGCAAGTTATAGTAATTACGCTAAATGGGTGACCATTACGGCTCCTGGTGGTGATCAAGACCGCTTCGGAACAGAGGCTGGCGTATTGAGTACTGTACCAAAGAAGAAGGTTGCATCTGGTTATGCTTACTTCCAAGGAACATCAATGGCATGCCCACACGTGTCAGGTATCGCAGCACTCATCGCCTCTTACTTCGGTCGTCAGGGCTTTACAAATGAAGAATTGAAGTCACGTTTGATTACAGCTTATCGCCCTTATAACATCGATGAGCAAAACCCAACCTACAAAGGTAAGTTAGGTAAGGGTTATATTGATGCCGAAGCAGCCTTCGAAACTGATACAAAGATTGCCCCAGAGAAGGTGGGTACACTCACGCTCAAGCCTGATTTCGTGGACATCAATGCGGAGTGGAGCATCGCCAAGGACGAGGATAAGACTGCAGCCTTCTATCGACTCTACATTGCTCAGGGTGAATTGACAGCCGATAAACTCAAGGATATGACCTACAGAGAAATCAATGGCATGGGTCATAGCTTGGGTGAGACACTTAAGTATGACTTTGATGACTTGAAAGACAACACCACCTATAGTGTTGCAGTGGTAGCAGTAGACCGTTGGGGCAACCTTTCTGAGCCAATGATTCAGAAATGTACGACCCGACTCAACCATGCTCCAGAGGCGACAAACTTCCCAACAGAGGCAATAGAGGTCATGGAGAACGATCGTAAATCATTCTCTTTCAACGTTGCTGACCCTGATGGTCACAACTGGGACATCAAGGCTACGGGTGAGACAAAGGGCGTTTCTTACACTGTGAAGGGCAATACTGTGACCGTCAACCTCGTTCCTGTGCTCGAAGCTGGTAGTTACAACTGCACCTTTATACTCTCAGATGACTTAGGAGCAAAGGCAGAGAAGAGTTTTACATTTAAGATTGTAAAGTACATCCCACCACAGCTAACAAAGCCTTTCGAGAACTATATCATTGGATTGGACGAAGGAGTTGTAACTATTCCATTGACAGGTCATTACACACATAGTGGCAATACTCAGCTTACTTACAAAGCCACAGCTGCCAATGGTAGTATTGCTACAGCTACTATCAGCAATGACAACCTTCAGTTGAAGGGAATGGCAAAGGGTGTTACACGTATCAGCATCGCTGCTACAGACGGTCGTGAGACATCTTCTGACGGTTCTTTCCAAGTACGAGTTGTTGAAAAGAAGTCGGCTCCTGTCTATGCCGTCTACCCTATTCCAGTACAGAGAGACATCCATACCCTACTCAATCCAGAGGTGAAACAGGCTGAACTCGTCATCAGTTCTACTGTGGGTGAGCGTCTGATGAAGGCTACTGTGACTCCAGATAAGAACAACGTAGCCACACTTGACCTTTCTAAACTGAATCCTGGAACGTATAAGCTGACCGTATATACCAGTAAGGGCAACCATACCCAGATGTTCATTAAACGATAAACACAACAAGTATGAATACAAAACATATCATTCTCATGGCTTGCGCAGCACTTCTCGGTGCTACGCAGGCTAATGCTCAGGGACAAGATCTCTCGATATTAACAGCAAATACCGATGCCCGAACAGCCGCTATGGGTAATGCTTCGGCTGCTGCTGAGGGCATGTATCTATACAATAATCCTGCTGCTTTCTTCGCAACTGATAAGAAGTTCACCGCAGATGCTTCTGCCTCTCTCTTTGAAAAGGCAGAGGGTGCCGATGGAACCTTCGGTATTTACGCGCTTTCTGCTGGCTATAAATTGGCTAAGCGTCATGCTGTCTTTGTTGGTTTCCGCTATGCTGGAGGACTAAGTCTAAAGGGTTCCGACCTCTTGGGCAATCCAACCAAGGACTATAAGCCTTATAACTGGACCCTCGACTTGGGTTATACCTATTTCTTAGGTAAGGGATTCGCTACCTATGCAACAGGAAGTCTTATCTATAGTCACCTCTCTAAGAATGCTACAGGTGCTGTTTTCAGCGTGGGCGGAGCCTATCAAAACAACGAGTTGACCCTTGCCAACAAACCTGCCAACCTCATGCTCGATGCGAAGGTGGGTGCCATTGGTCCTCAACTCGACTATGGTAACAAGCACAAGACGACGCTCCCAACCTATCTTGCCGTAGGTGGTGCGCTGTCTGTAGAGGTAGCCGAGAAGCATCAAGTTGCTGCAGCCTTGTCTTCTCGTTACTTCTTCCAACCTTCAGAAGCCAAACTCTTTATGCTTGGTGGCGGACTTGAATACACCTATAATAAGATGGTATCAGTGCGTGCAGGCTATGAGTATGGTGACCACGACCTCAGCCATGTCACTATGGGTGCAGGCTTCAAGTACCATGGCTTACGTCTGAACGGAGCTTACAACCTGAAGACAGCAGACACAGGAAGTAGCTACTGTACCATAGGTATTGGCTATGATTTCTAAAACAAGGACAAACAAAACAATTCATTTAATCAAAAGTAACAGTTATGAAAAGAACTTTATTCTCAATTTGTGCATTAGTACTGAGCCTTACAGCTTCCGCTCAGATAATAAAAGACACACCTAAAGGTAAGCTGATTGAAAACCTGTATCGCTCAAGCAAGTCTTGGGTTAAGAAGGGTTGGACAGGTGTTCAGCCAGGTAGATATGAAGGCTTGGTATCTAAGATTGTTATCGGCGAAGATGGCTGTATTTACATTTACAATCCACTATCAGGACTCGACAGCAAGTCATGGTTGAAGCTTGAGAGACAACCTGATGGTAAGTATAGAGCAAAGCTACCACAGGATATCCTTACAGACGACTACGGTGGGGACGATGATGAGGAGGAAAGTAGCGAGCGCACAATCTCTCTTACTCGTTTGGTTTCTTCTGATGATGGAAAGAATTACGAGCCTATTGGTGCAAATAACTACGTAGATTTCACAGTGGAAGGAAGAACACTCAAGATGTCTGGTATGGGTCAGAAAAAGCAAATATGGGGTGCTTCCTTTAATAATAAGTGGGAAAGAAACTATGGTGGCGACTGGGCTCTGACGATTGAACCACTTAAAGAGCAGCTTATTACACCACCAGCTACAGCTACAAAGAGCCAGTACACCGTTACTTCTAAGTCTGATCCTTCACCACGTATTGTTGAAGTAATGACCAATAATAACGATATCTATGTCAAGGGCTTGTTCAAGGCTGAGAAACTTGCAAACGTTTGGGTGAAACTTACCAAGCAAGGCGACAAGGCTGTCATGCCTACTAATCAGTATTTGGGTATTACTAAGAAGACAGACTTTAAGAAATACGATAGCGATAAGTCGGAATATCACACCTTTGCCGCAGCTTTCGAGAGTGAGACCAAGGCTGCAGAGAACCTTGAGTTCAGCATTGATGCAACTGGTAAACTGACTGCTTCTAAGATTCTTAGAACTTCTTTAGGTAGAGCAAGTAATGATAACATTACTGGTGAAGACTATATAGAGAGCTATGAGGGTCTTACCTTGACTCCTTACGTTCAGAAGGAGGTTGGAGCCCCAGCGACACCAGAATACTTCTATTTAACATCTACTCCAAACTACGACAACACGTCTAATGAGATTAAGTTAGCATTCTATGTAAAGAACGCAGATATTAATGGCAACGTTCTCGACCCAGAGAAGATGTACTACAATGTCTATATTAATGGCAGTACAGAACCTTTCAAGTTCAAGAAGAGTGCAAGTCAATACAATGACATGCATGAGGAAGAAATGACTAATATTCCTTTCAACTACAAGGACAAGAGAAATTATGACTTCAAGGTTATCGATAACCTACGTATTCTTCATTTCTATGACAGTTCTATCACACGCCTCAAGGTCGTAATGGTTTATGAGTCTGACGGCAAGAAGTATTCAAGCGAACCAATGGTTGCCTCACTTACTACCGATGGTATCGAGAGTGCAAACTTCAACAAGACTACAACAGAAAAGTATTACACCGTTGATGGTCGACAGATTCAGAAGCTACAGAAGGGTCTTAACATCATCAAGTCTTCTGATGGTACTACACGTAAGGTGGTTGTCAAGTAAACAAGTTGACGAGTAAACAAGTTGCTTGTATAGTTGGCAAGTGAACGAGTTGACGAGTAAACAAGTTGCTTGTGTAGTTGACAAGTGAACAGGTTTACAAGATAATTGTATAGTTGGCAAGTGAACGAGAAAATAGTTATATCTATTATAACTAACAACTCGCTTACTTGCCAACTTTTGTATTTGAGAAAGACTGATATAATTCAGAATTCATAATTATGATTACTGCTTTAATTCAGAATTCATAATTCATAATCCATAATTATGATTACTGCTCTAATTCAGAATTCATAATTCAAAATCCATAATTATGATTACTAATATAATTCAGAATTCAGAATTCAAAATCCATAATTATGATTACTGCTTTAATTCATAATTCATAATCCATAATCCATAATTATGATTACCGACAAAGATAAAGTTTAAAGCTCAATATTCAAATCATAAAGCTCAATATTCAAATCATAAAGTTCAATGTTCAAAACTCAAAGTTCAAAGGTCTAAGAGTTCAAACATCAAAGGTCAAAGGACTAAGAGTTCAAACCTCAAAGGTCAAAGGACTAAGAGTTCAAACCTCAAAGGTCAAAGGCAGAGGGGTATCTCGCTATCTACTCCCCTTCCCTTGGGGGAGGGGTAAGGGGGAGGGGCTTTTTGTCACATTTTTTCGCACATTCCAAATCCAACACATGCTCTTTTAGCTTACAATTAAAGCCCAATTGGCTTGCAAAAGATGCCCTTTTGAGCTCCAACTAACGCCCTTTTGGACCCTTACTAACGCACTTTAAAAAAACAATCTTGCAAGTATTTGATTCTCTGTAAGTTACAAAACCACCAAGATTAGCCCTTTTAAGACCATTTTTCAACCCAAAACACACCGAATTTTGTAAACATATTTCACCCTCACCCTACATTATCAACTGGTTATCTCATCCCCTTCAAAACCTGTTAAACATGTTACCCTGTCTCCTTGTCCCCTTGTTACCCTGTCTCCCCTTACCCTATCCGCAAACAACTCGTTCACTCCTCAACCAGTCAACTCGTCAACTTATCCACAAAACCTAAGGTATTCTTGTATTATTTTAATATCTTTGCATAATCATTAAAGAGTCAATACGCAAAAAGAAAAATAAGACAATGACAAGTACCGAACTTAAAGACCTCGAAGGCCGTCTGTGGCAATCTGCCGACATGTTACGTGCTGGTGCACACCTTGCAGCCAATAAGTATAGCCAACCTATCCTTGGTCTTATCTTCCTCCGTTATGCCGATGTGCTTTTCAAACAGCATAAGGAGGCGATTGATACAGCCTATAACGAGTATAAGGGCACACGCATGGAGCGTAGCTATAAGGACATAGCCATTGAGAAGTGTGGCTTCTTCTTGCCTGAATGTGCTTATTTTGATTACCTCAATGATGCCCCCGACGATGCTCAAAAGGCGTTGTTGGTAAAGGCTGCGATGGAGGCTATTGAGCATGAGAACCCACGCATGGACGGCGTTCTGCCAAAGGAGGTCTACGGACAGTTGGTACCAGAGGAAGAACCGGAGCTACTGAGCCGTATTGTACGCGTATTCAAAGACATCCCTGAGAATATCAGTATTGACATCTTCGGACAGATTTACGAATACTTCCTTGGCAACTTCGCCCTTGCTGAAGGTCAGGGAGGAGGAGCCTTCTATACCCCTGCCAGTGTCGTACAGTATATGGTTGAGGTCTTACAGCCTGCCACTGGCGACAAGAAGTTTCTTGATCCTGCCTGCGGTTCGGGCGGTATGTTCGTTCAGGCAGCACGCTATATGCACCGCCACAATACCTCTAACGAACAGATGATGAACTTCCGTTGCTATGGCGTGGAGAAAGAGCCTGACACGGTGAAGTTAGCAAAGATGAACCTTCTACTCAACAATGTGCGTGGCGAGATTATGGAGGCTAACTCTTTTTATAGCGACCCTTACAATGCTGTCGGACAGTTTGATTATGTCATGGCTAACCCTCCGTTTAATGTCGATGAGGTGGTTGTTGAGAGGGTGACAGACGATGCACGCTTTAACACCTATGGTGTACCACGCAACAAAACAAAGTCTGCAAAGAAGGCTTCTGACAAGAAGGAGACCGTACCTAATGCCAACTATCTATGGATTGGCTATTTTGCCACAGCCCTCAACGAGCAAGGAAAGGCAGCACTTGTCATGGCAAACTCTGCCAGCGATGCAGGAGGCAGCGAACTCGAGATACGTAAGAAGATGATTGAGGACGGCATTATCAGTCAGATGGTGACCCTCCCCAGCAATATGTTCTCTACCGTGACCCTCCCTGCCACACTCTGGTTCTTCAATAAGAAACGACCAAAGAAAGACGAGATTCTCTTTATTGACGCTCGTAACATCTTTACACAGGTAGACCGTGCACACCGTAAGTTCTCTGATGAGCAGGTGAAAAACCTCGGAATCATCTCTCGCCTTTATGAGGGCGACAGCGATGCTTTCTGGGCATTGGTTGAGGAATATAAGGCAGAAGGCAAGCAGAGCGAGGCTGACTGGCTCTTAGAACGCTGGCCAGACGGCAAGTATCAAGACATTGTCGGGCTGTGCAAGGTAGCGAAACTTGAAGGTGAAGATGGCATCATCGACAACGATTATAGCCTTAACGCAGGACGATATGTGGGCGTAGTGATTGAAGACGACGGAATGACCGAAGAGGAATTCCGCACCGAAATGCTATCGCTAAACTCGGAGTTTGCCAAGCTGTCGGCTGAAGCTAAAGACTTGGAGAGTGAGATTGAGAAGAACTTAAAGGAGTTGTTGGGGTAAGGAGAGGAAAAATATGGAATACGTAAAATTTAAGGATGTAATTATAAATAGTCAATATGGTTATACTGCAACAGAGACTTCTCAAACAGAGGGAACATATAAGTACCTCAGAATTACAGATATTGTTCCTTATTACGTAAACTTTGATACAGTTCCTTTTTGCAAGATTACGGAAAAAGATGTTTCAAAATACATTGTAAAAGAGGGGGATATATTAATAGCGCGTACTGGTGCCACAACTGGATATAACTATGTAGTACCAAGTGGGATAAGTAATACTGTCTATGCTTCCTATCTAATTAGGTTTATAGTAGACAAAAAACTTGTACTCCCTTTATTTATGAAGTACGTTCTTAAGACTCAATCATATTATGGCTTTATAAATAACTATATAGGAGGTTCAGCCCAACCAGGAATGAATGCGAAAGTCTTTACGAAGTTCAATATACCTAAATTATCATTAGTAACCCAGCAGAAGATAGCCTCTATCCTCTCCTCCTATGACCGTCTTATCGAGAACAACACTCGTCGTATCCGTCTGTTAGAACAGATGGCAGAGAACCTTTATAAAGAGTGGTTCGTCCGTTTCCGATTCCCTGAACACGAGAATGTAGAAATAGTTAATGGATTGCCTAAAGGGTGGAAAACTATTCATATAAAAGAGTTAGCACAGTTAAAATCTGGATATGCTTTTAAAAGTGAATGGTTTGTAGAAGAAGGAGAAGCTGTAGCTAAAATTAAAGACATTGGGAATATCTTAATGGATACATCAAATTTTTCATATGTAGATAAAGAAAATTGTATCAAAGCTAAAAAGTTTTTATTAACTACAGGCGATTTAACAATAGCGCTAACAGGTGCAACTATTGGTAAAATAAGCATTGTACCTAAACATAAAGGCAATATTTATACAAATCAACGACTTGGAAAGTTCTTCCTTGGAGATAATCCTATGGAAAAGCTACCCTTTCTATATTGTCTTTTCAAGCAAGAATCTATGGTTTCAAATATAGTAAATCTTTCAAACTCAAGTAGTGCTCAACCAAATATAAGCCCTGAACAGATTGAAAAGATTAAGATATTAGGAAATCATGATATTATAAGTATGTATAACAAGACTTGTAATCCATTATTTTCAAACATATTAGCATTATATTCACAAAACCAACTCCTCACTCGCCAACGTGACTTACTCCTTCCACGTTTGATGAGTGGCAAACTTGAAGTTAAATCTTAATCAACCCATTGTATTATGAAATCGTTTATCAGTGAAGACGACATAGAGCAGACCCTTTGTACCCGACTATCGCAACCAGAGTTCGGATGGAAGCGTATTGAGTGTGACCCGAGTGTGGAGGCACAAGACGACGTTAGCAAGACGGGGCGTCGCAATTCGTCAGAGTGTATCTTACCTGCAGTGTTCCTTACTGCCTTAGAACGACTTAACCCACAGATTGACAAGAGCATCTTAGCGCAGGTTGTAGCCGACTTCCGTAAGGACTATACGGGTAAGGACATGATGGACACAAACTATAAGTTTTATAACTATCTGCGCAATGGTATCAATGTAAAGGTGAAGAAGAATGGCAAAGACGACTTTGACATCGTTCGGCTGATAGACTTTGATAATGTAGAGAACAACGACTTCCACTGTGTCAATCAGATGTGGATAAAGGGTAGAATTCGTTATCGTCGTCCCGATGTACTCTTGTTTGTCAATGGACTTCCAATGGTCTTCATCGAATTGAAGAACTCAACCGTAAAGATAAAGGAAGCCTATGAGAAGAATCTCGTTAGCTACCGAGAAGATATACCCAATATCTTTGCACTAAACCAGATATGCGTACTCTCAAATGGTATGCAGACGAAGTTGGGTGCGTGGAACTCTAAGTATGAATTCTTCTTCGACTGGCTAAAGGTCAACGACGAACATGAGAAACTCGACCGTGAGCATATAGCCGAGTATGGTCTTTCAATCATCAACCTCATTGACAGTCTTTTCCGCAAGGAACGCTTGTTAGACTATATCGAAAACTTTATCTTCTTTGACAATAAGCGCAAAAAGATTATCGCTAAGAACCACCAATACTTAGGCGTAAACAACCTCATGAAGAGCGTGGAATGCAGAGAAGAACTAAAAGGAAAGTTAGGTGTGTTCTGGCATACGCAGGGTTCGGGTAAGAGCTATTCGATGGTGATGTTTGTACGAAAGGTAAGACGCAAGCTAAAGGGTAACTTCACCTTCCTCGTCATTACCGACAGAGACGACCTCGACACACAGATACATAAGACCTTCGTGCGCAGTGAGGTCATTGGCGAGAAAGAAGAGTGCCAGCCAAAGAATGCAGCCCAACTACGTGAGTTCCTTAGCGGTAACAAACCGATGGTCTTTACGCTGATACATAAGTTTCAGTATGATAAGACAAAGAAATATCCCCTCCTCTCTGAGCGTAATGACATCTTTGTCTTAGTGGACGAGGCACATAGAACACAGTATAAGCAGTTGGCAGAGAACATGCACACGGGTCTGCCTAATGCCAACTACATCGCCTTTACAGGAACCCCTTTGTTAGGTTCAAAGCGATTGACCAATCAATGGTTTGGCGACTATGTATCAGAATATAACTTTGCACAAGCAATTGAAGACGGTAGTACGGTGCGCCTGTACTATAGCCGTCGAGTGCCAGAGGTGGGATTAGAGAATAACTGGCTTGACTCAGACATTGACAAGATAGTCGAAGAAGAAGAACTCAACGACAGAGAAAGAGAACTCTTAGAGAACTCCTCCTCACGTATCTTAGAGGTAATCAAGCGCGACGGACGTCTTGACCGTATTGCGCAAGACATTGCCCACCACTTCCCAAGACGAGGTTTCTTAGGAAAAGGTATGGTGGTCAGTGTGGATAAATACACGGCTGTGAAGATGTATGAGAAGGTGCAACACTACTGGGGAGAAGAGAAGAAAGCCCTCATCAAGGAGCGCAATGCAGCCAAGACCCAAGAAGAACGTGATGAGCTGACAGCACGATTAGACTATATGAATAAGGTTGAGATGGCTGTCATTATCAGTGAAGAAGCTGACGAAGTAGAGAAGTTCAAGGCACAAGGATTAGACATCACTGTTCATCGCAATAAGATGAACGAGATAACACCAGAGGGAAAAGACATTGAAGACAGATTTAAGGACAAGGATGATCCGCTAAGTTTGGTCTTTGTTTGTGCGATGTGGCTCACAGGCTTTGACGTCCCTTCGCTCTCAACCCTTTATCTTGACAAGCCGATGAAGGGACATACACTGATGCAAGCGATTGCTCGTGCCAACCGTGTGTATCCCGGTAAGAGCTGCGGTATCGTGGTTGACTATGTCAATGTATTTAAGTATATGCAGCAGGCACTGTCTGACTATGCCTCAAATGGTGAGGAAGGTGCAGAGTTCCCAGCAAAAGATATTACGCAGCTAATCGCAACCATTGACGGTTGTATTGAGGAGTGCGACTCGTTCTTACAAGGTTTAGGGATAAGGTTGGATAAGATTATCACCGATGGTGACACATTAGACAAACTGGAGTCTTTGCGATTGGCATACGACAAAATCCTTGAGAAGGACGAGAGTAAAAACAGGTTTAAGGTGATGAGTAACCTCATGATGAACCTGCATGATGCAGCAAAGCCAGAGATATTTGAACTTGGATGGAAGAATGAGAAGTTCTCTCCACTGAGTTATCTCAACGGACTGTTCTGTAACAGAATAGATGATGAGAAACTAAGAAGGGCAAAGGAGAAGATGAGCTACACACTGGACGATAGCGTTACGGTGATGGTTGCAGAAGACAAGCCACAATATAGTATTCATCAGAGTAAGGTGATTGACTTGAGTAAATTAGATATAGAAAGTATTCGTAAGTCTATCAATGCAACTCCTTATAAGTCAATGGAGATTGATAACCTACGCACCTTTATAGAAACAGCTTTGGAACAGTTGATTAATAAGAACTGTACGAGAGTGCCATTCTCTCAAAGATACAAGAATATCATTGATACCTATAATGCAGGTGGTACGGAGAATGAGGATTACTATGAGAAGTTGTTACAGCTGATTGATGAACTCAAGAAAGAGCAGGGACGGTCGGCAGATATGGGATTAAGGGAGGAAGAACTTGAAATCTACGACCTTCTTATCCAAGGTCGGAAGCTAACAAAGGAGGAAGAGAAGGAAGTTATCCTTGCTTCTAAAAACCTTTATAACAAGTTGGTGGAGGAAAAAGAAAGATTGTTGGTGGTTGACTGGTATAAAGACCCACAACCAAAGACAAAGGTGTTGGGACTGATACAACGGTCGCTGGATAAGGATTTGCCAAAGACCTATGATAGGGAGGTGTTTTCGAACAAGACAAACCTGCTGTTAGATCACTTTGTAGATATGGCTGTACAGGGGTATGGTTGGATATCATAATGGTTAGGTCTATGCCATGCCATAATGGTTGGGGCTATGCCTTCTCATTAAGATAAACGAACAGCCTACATATATCTGTACATAGATGTGCCTTGCATTCCTTTCATGAATTGATTACCTTTGCAACGGTAAAGTTCAAACCTCAATGCTCAAAGGATAAAAACTCATATCATAAAGTTCAAAGTTCAAACATCAATGTTCAAAGGATAAAAAACTCATATCATAAAGTTCAAAGTTCAAACATCAATGTTCAAAGGATAAAAGAGTTCAAAGGATAAAACTCATATCATAAAGTTCAAAGTTCAAACCTCAAAGTTCAAAGTAAAAGAAGAGTCGAACAAAAGCTGTCCGACTCTTCTTTATATTATTTATTTAAGGTGAAACCTTATTGAAGATCCTTCATTGTAAGTGGTTGAGCAAGGGTAGTAAACTCAAGACCACCTGTGGTACACTCGATACCCTTAGCAGAATCCTTAATCTTATTGCCATTTACGGTCTCTGATCCATTCAACTTGAAATAGAGTTCCAAGCCGTCAGACTTAGGATCTACACTCAGCATATTCTGCTTAATCTGATTCTCACTACGAGCAACACTCCATACACGAACCTCACTCATATAACCATAGAATGGACGTTCACCCCAAGGGAAGCCAGGGATCTTACCAATGTTGAAACCGACATCAGCGTTAGGATCGAAGCCAGAGATATTCCATGCAGACTCAGCCCACTTCGTACCATTAAGGTACATAACAGTTTTACCAGTTGCCTGATCGTAGGTCAATGCCACATGATACCACTTGTTAATCGAAAGTTTCTCTGGTGCTTCATAGTGCTGTCTACCTGCTGCCTGTAAGATACCAGAAGAAATACCACCACCAACATCACCTACGCGGAAGATCATCAGACCCTCTGAACCCATAATAGTACAGTTGCTGCCCCACCAAATAGAGTGAACCAATGCCTCATAGGTGAAGGACTTAAAGTAGGTTCCTGCTGGGAACTTCACAGAGATATACTTATTATAGAAGTCACCAGCCTTTGTAATCTTAACAGGTTTTGCTAAAATAATATACTCAACATCTTCCCCATCAATAACAGGAGTAGAACTTGAATGTAGCTTGATAGGCAACATATAGTTCTTTCCCTCTGCTAACTGGTCGAGATTGTTAAGCGTCAACGACACCTTATCAGCATAGATACTTCCAGCTGCAATCGTTGAAGAAGCCTTACTGAACGTAACATTCTGATGCTTTAAAGCAACGTAGTCTGTTCCGTACTTGCTATTATAAAGAGCAACCAATGATGAGTCGGCAAGTGAATAAGAGACATCTACTGCCCCATCTTGCTTGTTAGAAAGGCGAGAAGTAATGTCGACACCTAAGGTTGAACCAGAGTCAGGCATTTCGACTTTGTATAGATTGCTATCAAAATAAACTTTCTGCTTCAACAAATCGCTCTCGCTGTCTTGGCAACTCACCACACAGCATGCGGCAAACGCCACAGCAATATGTTTGAAAATATGCTTGTTCATATTTGTATTCTATTAAATTTATAATGTGAGATATGCAATGTTAAGGTTGAGTGGTCTGCCCCATTCTCTCCTTGAACACCTGTTGGTTAATCTGAATTGCCTGACGCATCCACTTGTAGTTAGGAGTATTGTCGTAGTCGTTATCGAAACGATAAGCACCTACTCCACCCTTGTAACCCTTAGAAGGCATACTCTTGGCTTGTCTTAACAGCTGACCACCACTCTTGAAAGAAGACTCAAAGTTATCGGTTGTAATAATTGTCTTTGGATCAACACCGTAGTAATCGAAGTGAGTTGTACGACCGTAAGCCTGATCAATCCAGTAATCAACATATCCATCCAAATCCTCAGTAAGTCCACCAAAGTGGCCATCAATACAGATAAGACGGTGGCCCTTGTGCTCTGGATCACTCTTAGGACCGATGTACTTGTTCATCTCCTTTACCAAGTAAACGAGGTCGGCATTGGTACTGAGTGTACCATCCATATCGAAGACACCACTACCAATCTCCCAGTCGATATCGTAACCGTCCCAGTCATTGGCTACCAATGAGTCGCAGAGAGCCTTTGCAAAACGAGCCAAAGCCTCTTTGTGGTTATCAGAACCAGCAACACCCTCGTAGCCCCAGAACTTCCAACGAGCCTTCTTCTTAGCCTCTTCAACGCGAGCTTGGTCGTTTGCCCAACCCTCTTTTTCAGCCTGCTTCTCAACAGCAGTATAGACAGAGTCAGGAGTTCTACCCTTACCAATGTAAGACAAGAGTGTAACTTCCAAGAGTTTTGTACCCTTCACCTTCTGCACAAACTCCTTGTCCTTCTTCTGCTCAGGTGTAATATTAAAGCGGTTAGGTGCACCACTCCACATGGAAACGACGTCCATACTATCTGGCATAGAGGTAAGATAGCCACGGCGATAGGTACCTGCTGGCGACCAGTTAGAATACCAACCAAAAGCAACAGGACGACCATAGTTAACGGCTTGCTGCTTATAAGCTCTCAAATCGGCATAATACTTCTCGCTCTCAGCATTGTTCATCGTATTATATCCACCGATATGATCGTAAGGTTCGTTCTCTACCTCTGTCATCTTGCTACATCCACAGAAAAGGACTCCTGCACAGAGCAATACCAACGAACTGATATATTTAAAGTTTTTCATATTCAAAATGCTTGATATTAATGATTCTTATTGTCCCACCACAAACGTGTACCAGCCTTATCCAGACCGCCACGAAGTAGCTTGCGTGCACTCTCTAAGTTAGCAATATCCTCTGCCGATGTAGACTTATTAGTTGGGAAACGCATACGGCGGATGCCTACCTCGCTATCAATCTCACCGTTGCTATAGTTTGTAACCACACTATGCAACTTAGGATAACCTGTACGACGATACTCTGTCCAAGCCTCTTGTCCGTTAGGATAGAGTGCAATCCACTTCTGTATCATAATCTTCTCAAGTTTCTCCTCATCGGTTCCACTCCATGCCGTAGTAGCCTCTGTTACTGCTGGTACGTCAACATTCACACCTGGATTGGTGAGATGAAAACTGTAAGCAGATGGCTTGAGACCAGACGACATATAGTCTGCTACCTCAGAACTTGCAATACCATTCTCTTCAAATGACATCTCGATACCCTTCTCATAAAGTGACTCAGCAGTGCCACCCACAGCGAATCCATGCAGAGCAGCCTCAGCCAAGAGGAAATAAACCTCAGAAGCACGCATCCAATAAGTTGGAGTTGTACTTGTGATAGCTGGGCGTGAAGAATCCCTGAAAGCATCGTTAGAGCTTACGTCATGACCTGTTGGTACACCAGAATACTTTCCGTATGTGCCCACAGTCACTGCCTGAGAAACAGTACTTGTATTGAAGTACTTTGGCAAACGTGGGTCTTGATAACCGCCTAAGTAAGCCAACATAGAAGAACCCATACGGCACTCGTTATACTGATTGATAAGAACATCGAGGTTGTTCTTGAACTCCAAGCTTGCGCCTCTCTCCATCTTAGCCTCATCATCCTTGGTCTTCATCACGCCATAAGAATGATTGACAGCCTGTAAAGCATACTTCTTTGAGAGTGCAGCATCAGCATAATAAACACGCATAGCAAGACGTAACATCAAAGAGTTAGCATAGACAACCCACTTGTGAACGTCGCCTGCATAGACAGCATCAGCATTTGGCAAGAGTTTGCTATTACCATTATCAGCGTACTTTGTCAACACTTCTATAGCATCAGAGAGTTCTTTGAACATTGCCTTATACACTTCTTCCTGACTATCGTATGGCACAGTGATGAGTCCTTTACCTGCCTCCTTATAAGGGATAGGTCCGAACATATCGGTAGCCTTATGCCAAGCAGAGATCTTTAGAATCTGTGCTAAGGCGAAAACCTCTGGGAACTGAGTCTCAGTCTTACCCTTTAAGTCTTGCCACAAAGGTACAACCGTTGAGTAAGACTCGGTGTAAGACGATGCTACCCAACCATCCTTGAGGAAATAGTTGAGGTTATTAGGACCGCCCCAGTTATTGTTCTGACCGAAATAACCACTCCAACAATCTGCTGCAAGGTTATAGGCTGTTTGATATCTATTAGCAACACTGGTACCATCTGCCTGTGTTCCTACTGGGAAGACGCATTTCTGCATGGCAGTTATAGGACCACCAATAGAAATACCATCCATCAATCCCTCCTCTGGGAGGAGTTCAAATTCGTTAGTGTTTACCTTTTGGAAGTCACAAGCCGTAAACAAGGACAGCGCAAGGACACCTACGATATATTTATATGCTTTTGTCTTCATGTTCAATTAGAATTTGAATTTAACACTGAAACCATAACTCTTCAGACTTGGCTGCATGAAATAATCATTGCCCTGTCCGTATGTACCTGTTGAAGCAGTAAGCTCTGGATCGAAAGGTGCTTTACAATAAATCATCCATGGATTTGTAGCCACAAATGAAAGTGTCAAATCCTTAATGACATTATTAAACAGCTTTGAATTAAACTTATAGCTCAAAGTAAGTTCCTGCAAACGAACGTTTGTTGCGCTGTAGGTATAGTAACCTGCGAGGTCGTTCTCACCTGTAGCAACAAGTGTATAATACTTCTTTGCATCATACAAGCCCTGATTTGGAATCATCACGCCACCAGCATCTCTTGCATCAGCTGATGCCTTAGACACACCGAAGCGGTCTAATAATGCCTGTGTTGAAGAAGTAACAATACCGCCAACACGTGCATTAATCAAGAAACTAAGCCCGAAGTTCTTGTAAGTGAAGTTGTTGTTCCAACCCATTGTGAAGTCTGGGGTAGTCTTACCTAAGTAGATAGGCTCAACAGTCTCTAAGTTCATACCACCTGATGGACTAACATTTACAAAGCCTTGATTGTCCTTTGCCAATACCTTTCTTGCATAAATATCATTGATAGAACCGCCCACCTTAAGCAGAACACGGCCATTATCCTTTGATACTTCAGGGATATTGATCGGCTTTGGACTCAATGGGTGATGGTAGTCCTTCACCATCTCCTTAATCTCATTGACGTTCTTAGAGTAAACCAATGATGAGTTCCATTGCAAACCACCAAAGTTATCGCTATAGCCTAATGCCATCTCGACACCACGGTTCTCTACATTACCTGCCTGCAAGTAAACAGCCTTGTAACCAGAGCTCTCTGGCAACTCACCGATAAATGTCTGATTGTAAGTATTAGACTTGTACCAAGTGAAGTCGAATGATAACTTCTTCCAGAAACGAGCAGTCAAACCGAACTCGTATGACTTTGTACGCTCTGCCTTATAATCAGTAAATGGATAGATATCAGTAGATTTCAAGCTACCACCCACGATTGGAGTTGTAATAGTACCTGGTGTCATACCTGAACGTGAAACAGGAGAACCAACCTCAGTGTAAGAACCACGAACCTTCAAATAAGAAATGAATGAAGGGAGTTTTGTCATCTCTGAGATAATACCAGATAAACCAACTGATGGATAGAAGAATGATTCCTCAGATGAGTTTACAAGACGTGAGTTCCAATCGTTACGACCTGTCAAAGTAAGATAAACCATACTTCTCCAACCCAATTCGGCACTTGCAAAAGCAGCAACGTTACGTACCTTTGAGTCACCACCAGCCTCACGAATCTTGCTATCAGTTGGATTTATGTTATTCAAAGAGAACTTATTTGGAACTAACACTAAGTTACCACCATAACCACGGGTGAGAGATGCATAGTCAGAATAGCTATAACCAACGTTGGCAGCAAGACTGAACTTACCAAAGGTTTTATTAATATTGGCAATAGCGTCAATATAAGTCTGATGATCCTTATAATCATAGTAATCATAAGCGCCCTTCTCCTTGGCAAAGTAATTGAATGAAGAAGCATAAATCTTACGCTCACTTGTGATAAACGTCTTGTCGAGACGAATACGACCTGCTACACTCAACCAATCAAAGATATTATAAGTAAGACCAGCATTGAACATGAAGCGATCCTTATCGTCTGGTGCTAAGTTACGATAAGCCGTCCAATAAGGGTTCTGATTAGCAAAACGGCTATCACTGATTGGCCAATACTGAGTTGGGAAGTTTCTCACGTTATCGTAACGCTCAAAGGTCTTGATTCCCTCAAATGACTCACCGCGTGGGAAGAGATAAGCTGCGACGATTGGGTTCCAGTACTCACCCTGTGACACCATGTTATTGTCTTTCTGCTTGATATAAGATGCACCAAGGTCGAGTTGCACCCTATTCTTAAACATACGCGTAGTGTTACGGATGGTGAAGTTGTAACGATCGTAAGTATTATTTGGTACGATACCACGTGAATTGGTGGTTGCTACAGAGGCAAAAGTCTGATTAAACTCATTACCCATATTCAATGTCAAAGCATTGATGAAGTTCGTTCCTGTACGGAAAAAGTCCTTCTTGGGATCGTATGAACTTGGTGTTGCGAGTTTCTCACCCCAACTCTCATAAGAACCAAGCTTGTTGCCATACGTATTCTGAAACTTTGGCATCAACAATGGTGAGCTAAAGTCTGCAGAACTTGAGAAAGAAATGTCTACACGTCCTTCTTTACCCTTCTTTGTGGTGATGAGGATAGCACCATTAGCAGCATTACTACCATAGAGGGCTGCAGCAGAAGGACCAGCCAAGACGCTAATGCTCTCGATATCCTCTGGGTTGAGGTCGGCAATACCTTCACTGCTTACACGGCCTTCTCCCATGATACCGCTATCGCGTCCACCACTGAAATTGAAGATTGGAATACCATCAATAACATACAATACGTTGTTATCACCTTCGATAGACTTGGCACCACGCATGATTACACGTGTTGCACCACCGACACCACTGGCACTCTTAGAAATGCTAACACCAGCTACTTTTCCGTTTAATGAGTTCACAAAGTTAGCATCTTTGTTACGTGTCAACTCATTTGAGCCTACTGACTGTACATTGTAACTCAAAGCTTTCTCAGCACGTTTAATACCCAAGGCTGTCACAACAACCTCATTCATTGTGTTTGCAACTTCTTTAAGCGTAACATCGATGGTGGTTTTGTCGCCCACAGGGATTGTCTGGGTAGTATAGCCAACATAAGAGATAACCAACTTTGCCTTTGACGTGGTATTGATAGTAAAGTTACCATCAATATCCGTTACAGCACCTTCTTTGGTTCCTTCCACCATAATAGTGGCACCAATCAGTGGCTCACCAGCAGAGTCAACAACGCGACCAGTTACCTTGTGATTGCCGTTTTGTTGCACAGCCTGAGGCATAGAAGCATCAGGTGAGCCTACGGGATTTCCTACTGCATAAACACTCGTTGGAGCTAACGCAATAGAAAGGGCAAAAGCGACTGAGAAATACAGATGCTTCTGCTTTGTTTCATGAATCATTCTCATTAGGTTCAGGGTTTAAATATTTATATTTTAGGTTCCTAATTGTTACTTATTAATAACAGCTGCATGAACAACTGCATTACAACACTATCACCTTATTATAGTAAGGTAATGATGAATAAAGAAACATTTTTTATTTACACAGGTTGTTTAACGTATAAACGTAAAGATTATTAATGAATAATAGGTTAGGTCAATTAACTCATAATAACGAGTGCAAAAGTATAAAATAAATTTGTAAACAGCCCTCCTTTTTATAAAAAAAAGTAAATAATATATTTGTTATTGCAAAAAATAGAGATATACGCCAACACGTATATCTCTATTTTTTTTATATATAAACACGTAAATGCCGTTTACCCGTTCTTTACATCAACTATCTTTGTTGGAGCCTGTTCCTTATTGCGACGATTGACAACAGTGACTACACTCTGCGCAAGACTGAGAAACGACTGACCTGTAATGGTGTCAACCTGTGACGCTGCTGGTGTTCCGTTATCACCATTCTCGCAAATACTCTGTACAATTGGAATCTGAGCAAGTAGCGGACATCCAAGTTCTTTTGCTAAGTTCTTGCAACCATCTTTTCCAAAAATGTAATATTTATTCTCAGGAAGTTCAGCTGGAGTAAACCATGCCATATTCTCAACCAAACCGAGAATTGGGATATTCACCTTGTCATTACGATACATATCAATACCCTTACGTGCATCTGCCAATGCAACATTCTGAGGAGTTGACACGATTACTGCACCCGTAATAGCAAGTGTCTGCATGAGTGTCAGGTGAATATCGCTGGTTCCTGGTGGTGTATCGAGGATAAAGTAATCGAGTTCGCCCCAATCAGCATCAGCAATAAGCTGCTTAAGAGCTGAGGTTGCCATACTTCCTCGCCAAAGGGTTGCTGTGTCTGGATTGACAAAGAAACCGATAGAAAGCAACTTCACACCATACTTCTCCACAGGTTCGATAAGCTGGCGTCCATCCTTTTCTACTCCGTATGGACGAGCATCTTCTACACCGAACATCTTTGGCATACTTGGACCGAAGATATCGGTATCGAGCAAACCTACCTTATAACCAAGGCGTGCCAAGGCTATAGCGAGGTTAGCTGAGACGGTGCTCTTTCCTACTCCACCCTTTCCTGAACTAACAGCAATGATATTCTTCACCTGAGGCAACAACTTACCCACCTCTGGGCGTGGCGCATTCTTAAACTCTGTTGTAATGGTTACTTCAACTTCCTTACCAACAGAATAGTGAATCTGAGCCTCAGCAGCCTTAATAGTAGACTTGAGGAATGGGTCGGTCTCACGTGGGAAAATAAGCGTGAAACTTACTTTATTACCATTGATGCTGGGTGTATCTGCCAACATCTCACTTTCAATAATATTCTTTTTCGTTCCTGGATAAATTACTTTCTCCAGTGCATCTGTAATGAGTTTCGGATATAATGTCATTGTTTTATTATTTTATAATCTTTGGAGAATGAAGCTATCTCACTGCTATAAAAGCTTATATTCCTTCTTATTAGAATTTCACTTACTTCTTTAAGTTAACAAAAAGAATGCCACAAAATAATCGGACACATTAGTTACCGTTATTTTGCGAGTTCTCCCATTCTTCCCATTCCTTTGCAACAGCTTTCCAGTCGGTCATCTCTCCCGCTTCAATGGCTTGCTGCTCACGTGTAGCCTCATCACGCGCCTTATCACGCGCACGCTTTGCCTTCATATCAGCGATAGTGGCATCATCAAGAATCTGAATAGCTCCTCGTTGGATAGCACCCTGTAACTCCTCTTCACCGAAAGCCTTACCGGGCTCATTGAAGTCAGAGATATTAAATTCATAAATTGTACGGAGGTCATGACGAACCAATTTCTGTTTCGCCCGATTACCCGCATTCTTCAGTCGGAGTAGCTTTGCTATTTCTTTTATCTCGTCTTCTGCGAATTTATTTCTTCCCATGTTTTGATTCTTATTCAGATGCCATTGATGGCTTTCGTGTCTGCGAAGATAGCAACAAAATCCCAATACTGCAAGTTTTTTCTTAAGGAATACATTACGTAAAGTATGATAAATTGGCTTCATTCGTTAAACCTAAATCTGTCATTAGTCCACAAAATGTATAGTTTTTATTACTATGGTATTGTTTCCTAACATCTTTTATTTTTTCGGTATCGCCGCTCTTTGTATCTTTGATACCTCCATGATATGGTTCTTCCGAAATATGGAGTGATAAACTAAAAGTCTTATATTATAAGGCACACAGAGTCACGGAGAGGACGGAGGTAACGCAATGGCACGGAGGTGTAGCATACAGATTCTTGATAACTATTCTGGGATAAACGCTTTGTATATAAATTGCTAATAGAATTGGCTAACAAGCTCCGTCGCTCTATGCCCGACGGTGCCTCCGTGACTTCCATTGCTTTGTTTTAATAATCCTCCGTCCTCTCCGTGTCTCCGTGTGACATTTCTTCAGAAACCTAACCTTAAATAACAAACCTTATCACTCCAAAATCCAGAAGAACCCATGATATTACGTTTACCTCTGTCTCTCTATGTCTCCGTGTGCCTTATTAGATAAGAACTTTAGCCTATTATTCTATCTTAGGAAGCGACCTCTTTTTACACAACCCACCTATATTTGTAAAATTGTTTCAAACGCTTAATTATCAATACATACTCTTTAAGCTTCTTAAAGATGCCTAATTAGCTTCCAATAGGTGCCCTTTAGAAGGCTTACTAACGCCCTTTTGAAGTCCAATTAAGCATCTTTTAAAACACTTCATTATAACAAACTGACTTACTGACAGTTACAAACCATCTGGTTATACTTATTTTTCTCCTTTATTTTAAGGCTCTTATTCAAAAACATGTAAGTATTTTTCGCAAACCAATACACGCCCAACAACCTACTCTTTTCAAGGTATCAACTCATTAAGTTACAAACAATGTTATAGATTGGCTGCCCGATAGTTGACATGGTAGGTTTTCCTTTCTATTGATTCCTTGGCACTATCCACAACTATCTCCGCATGAAAAACACTTTTGGGCAGCAATAAACAGGGACGTCAGAGCGTTATTGTTGACAGAATAGGTTACGACAAAAGACGGTCATACGACCTATACGAAGCTTGGTAATGGCACCGAGACTACCTATACATATGACGAGCAGCGTGAACGTCTGCAGGTGATGAACCTTACAGCAGGCGGTCAGACTGCAATGGAGAACTGATATTAATCCAGAGTAGATTAGCATAATTCTCAAAAAATAGTAGTAATAAAGAAAACAGATAAGAAATTATGAGGCAGGCTATATCGATTATTGTCCTATATGCTTTGATAAATATTCTATCATTGTTATTCAATATAATTTCTTTTTCTACTGATGGGAAGATAGATATTGGCTTCCCTTTTATTTTTATCCATTTAACGAGTGCAAAGTATGATCCTACTTTTCCATACAATAAATTACTCATAGAGCCTTTTTTATGGGATATTATTTTTTTATTTACGATAGTTTGTTTATATTTGTGCTGCCTAAAATTTATCAAAAGGATTCGTGATTGAGAAAACCTAAGAATGAAAAATAAATGATGACCTACGATGCGGTGAGCAACCTCACGGCATAGTTGAGAAAGTCTATCGCTAAGAAGGAGTTTGCATACGCATGAGTATGATAATTTGAACACCCTTATCCATGCAAACGGTAAGGCGAACTGTTATACATCCAACACTGCTGGCGAACGCATCATGAAGAAATGCAAATATTAGGAGATAAATATGAGTTGCAGGCATAGAGAAAGCCAGATGTCTTCTTCCCTGCATTTACTGAGCTTAAACAAACGATTTATGATTATAGCACATCGACTGACAAAGAACATCATGATTTAATACAAAAGTACCCTAATTTACAGGGAACTAAGTTGTATAGAGCAATTAAATCCGAGTTAATTGAACAAAACAGTAATTAAAAAAACATTTGATCATGATTTTAAGGAATACAAAAAGAATTATTAATTATTGTGTGTTGATGATAGGGCTTGGCGTATGGGGATCATGTAACTCTAAATCACAACATGTTTTACTCCCTGTAGATAATGTCAAATCTTATAAAATATGTGAGAATAATGACACAACAACAATTTTCGAAGAAAGAGAAGAAGAGTCTGAAGAAGTAATTAAATTTTATAAAGAGGGCAGTGAAATTTTTACTTCCGACTATGGTGGACGCAAAGAATTACTGATGTCCACAACAGAGATGCTGGATACTGTATATTCAGGAAACACTTATTGTAGGGAGCATCGAATTCTGATAAAAAAAGAAAATTCCAATCTGTTTTCTACATCTATATATAATATTATTATACACCCTGTATTGGTCTTAACGATTTATTATGACCATTCATATAATATAAAAGCTATTCGTAATTGGTTTGCATTTACGACTTATGAATCGGAACACATTAGTATCCCGCTAATATCGAGACCAGTTAAATATTAATCAAATAACCCCGGAAAGAGAGGACAATCGAGTTCTCTCTTGCAAAAGACTGTAGTTCAATTAATTCAAAGAGCTATTAGTCCTCTTTAACGGGACAGTAGTAATCGGAACATGAAGAAAAAGAGCTCTTTTGACAATGGACTCAGACTCCGCATCCTAACACGACACCAGGCACCAGCCTTGGGTGCGCCACTCTCCATATTATCTACTGGAAAGATGGATGTAAGACAAGGTGTAAATTTGAAACTATCTGGTGTAGAACGGAAATGCCGATTGCATACTGTTTTCGATTTTGCTGTCATTCCTGTCACTTCACTTTTGACATTAACCTACTATCTATAAACCCATTACACGAAGTGTTAAAAGTGACAGCAACTAAAAATAAAACTAAATTAGAGGTTTTACCCAATTCGGTCATTAGAGCCTAAACATTTTTGGTTTTAGTATCGAGTAGATTGTTTGGCTTTGGAACGCAGGCGTAGCGGGCTACGTCAAGTTACAAAGACAGACAAAATGTCGATAAAAAAGATGTTAGGAAACAATACCATACTATCGTAAGAATGTACATATTGTAGTCTAATGATCGATTTAGGTTAAATGATTCCCTCTTGGTTACATTTACACATACAATACCCTCCATCTTTTCTCCTTTTCCCATGTTTTTAAGACCCTTTTACAAAATAAAGTTGTATATTTGCCTTTGAAATTGCTATGCTCGATATGACTGTTCTTTCCAAAAAGAGACTAATCCTGAAAAGTATAGCCACAACATAACACATAACAAAGGACGTCTGCAGGCACAAGACCTCACAGTGGATGTCGGGTTGTGGTGTAGGAAAACTTCTCCTCAATGCTGTAATGAACGGCAAAGAAGGGGAATAATGTCCCTGCCTAACATACATTTCCTACCCACACATTATTCCTATTCTCACCACTGAATTCCGATACCGTCTCTTACAGACAGTTCAAAAAAGCATCAACCAACGTTTAAAATGAAAATAAAGAATATTTTAGCTGTTTTTATAATAGTAGGCGGCATATCTTATGCTCTTTACTTTTCATTAGTTACAGATGTTTTATTAAAGTATGGTGAAACAATTCACACAAAGGCTATTATTGAAGAGAGATTAACAGGAAAGACTTCCGATCCAGTGCTGCGCTATAGATTCTTATATGAAAATCAAGCTTATATCGGTTTTGTGTCAGAAACATCACGTTTACATGTGTCTGATACTATTAATATTGTATTCTTAAAAAGTAGACCGTCTATAAACAAACCATTAATAAAATAAATTTTATATGAAAAAGAACTTGTTATTCTCATTTGATGATGAGATTACGACTGGGGTTTATTATGACATTGATGGTCATAAAATTGTAGTTTATTTTGCGGCTTATTATGATAATGGGAGGTTCGTAGAGAAGAAATGCCAACTTATTATTGAGAAATGGGAGTATGCAAAAAGTAAATTATCAGTGGATAACCGATATAAAGATTTAGAAGATCATATTGGAATTATAAGTATGATTTTAGATATGCATATTGCAGATAAAAAATTGTTTTTAACGGTGAATACATTAGATGGTCAATATGTGGATTTACTGTTTTATAATTGCAATGTTAAAATAGAAGATATTTCTTAAGTAGGACTTGTATAATACAAAATGTGACCTTATGAATTTAAAAATAAAACCGACTCATAGTACCCTTTTGAAAAAGATAAGATGACACCTTCAAGGTTTCTTATGGTTATACAACTTATAAAAGGTTTACTTCTAATGGCAAAAGATTTTAATATATATACATATCCTACCTTTTTTTCATTGTTACTGATAATGCTAATACCATTTGCATTATCAGTAGATGATATTATGAATAATCACCAAAGCAATATGAAAATAATACTTTGGATTTCAGTATTTATAATCATCATTCTTGTTTATGGATTAATAAAAACTCGCAAAAGAAAAATAGTCATAAATGAGTTAGGCTTACTAATTACAGAAAAGGAAGGAAATATAAAGATTCCATGGGAGGATATATCGCACATATTTTTTTGTTCAACACCAATTTGGGGGCTATATGTAAAAATAACTTTTACTAATCGAAGGAATCCTTTAATTATTGACTTTGGGAAAAGCAGTTTGTGGAGTGTCAATTTTTATCGTTTTCGGATAGCCATATTGGCGTTTTCACATAAAAAGGATATAATAGTTGTAAAATCCAATCAATGGTATCTAAAGCTAATATAAAAATAAATACACAGATAGAGAAAAGGCCACATGTCTATATTTGGCAACAGCAAGACAAGCTGCTACACCTATAACACAGGTGGTCAGTGCATCATGAAGTGTTATGGTACGATGGAGGGTGTCTTTACCAATGGAGCATCATAGGGTATTGCCTTCTAAGAGTAACAAGAAGAGGTATGAAGGCGATTCTCGTTATAGAACAATAATAATCATTCAAGAAATGAAAAAGAAACTCATTTTTATTCAGCTCTGTATTGTATTGATTTTATTTTGTATAAATGTAAAAGGGCAGAATGCTTCTCTAAACAAAGCATTGGAAATCTCATACTATAATTTCTTAAGAACAGTACCTGACAGGTGGAAAGATAAATTTGACAGAAAAGATTTCTATCTCGTCAATAATTATAATCCATATGGTTTTAATGTTCAAAGAATAAAGGGCTGGAATAATTTACACCTTAAGAACGGTAAACTTATAATAGGAAAGAATCTTGTAGGATATCCGCTTTTACAAATGATAGGCAGGGATACACTGCTAATAGAGATAGGGCAGATCTTTTGCAGACAAGATAAAATGTGGGCTTTTACAGACGGATATACATCACTCTTTAATGTGGATAATAAAACTTTTAAGGCAATAAAAATCTCTCAGAAAGACTTATATTACAAGCCTGATAATGCTAAGAAGCAAGGAAAACTTATAGACTTTGACAGTATTTATGATAAGGCCGTTAATAGAGCGATTGGCTATTTGGAAAGATCAGGAGTCCCTCGGCATTTAATATATATAAATAAGGAATATTTTCCAAGTTGGTTTTTCAAAAACCAACAAGAGAATGTCATTGAAGGTTCTGAATATAAAAAATACATCAAAAAGAATTATTACGTTATCGGTTGGCCTATTTTATTTATAAAAAAAGGGAATATAATCGTCAGGTTGCATTGTTCATACAAATCTGGTATAAGAAAGAAAATTATTTCAGAAGTCCAATATACATTTGACGAAAAGAAAAATAAATGGACACTTCATAATGAAAATTCCACTACTATGTCTATTAAAATGAAATTAGTTGATGAGCATAGTAGCAGTGAGGACCTGCCATATAAGTTCAATGGCAAACAGGTCGATGAGAAAACAGGTCTGTACTATTATGGTGCAAGATAACACCTTCAATGTTTTCTTACGTTAAACAATTTATAAAAGGTTTACTTCTAATGGCAAAAGATTTTAATATACAGACATATCCTACATTCTTCTTATTGTTATTGATAATGTTAATACCATTCGCATTATCAGTAGATGATATTATGAATAATGATCAAAGCAATATGAAAATAATACTTTTGATTTCAGTATTTATAATCATCATTCTGGTTTATGGATTAGTAAAAACTCGCAAAAGAAAAATAGTCATAAATGAGTTGGGCTTACTAATTACAGAAAAGGAAGGAAATATGAAGATTTCATGGGAGGAGATATCGCACATATTTTTTTGTTCAACACCAATTTGGGGGCTATATGTAAAAATAACTTTTACTAATCGAAGGGATCCTTTAATTATTGACTTTGGGAAAAGCAGTTTGTGGAGTGTCAATTTTTATCGTTTTCGGATAGCCATATTAGCGTTTTCACATAAAAAGGATATAATCGTTGTAAAATCTAATCAATGGTATCTAAAACTAATATAAAAATAGGTAGTTCAATTGTTGTTCAAGGTGCTTCGTTAAAAGCTAATAAAATAATTTGGTTTGGCTTTACAACATTGTAACTTTACATCTGATGGTCAATAGCTACACGAAAACAAAGGATTACAACCATGAAACCATTTAGTGTTCAAACGGCAGAGAAACATCTTATCAATATAATTTATTCGGTCGAAAGCTTATATTCTTTTAGAGAATCATGGTTAGGATAAGGAGGAGTAAAACATTTAATTTAAAAAAGTAAGCTATGTTAGAATATAAAAGATATCAAATTTTGAACCTAAATTTTATTAATGAACTTGATACTGTCATTTTAGATTCAAAGCTTAAATATATTTCTGATTTTATTTTCCGTCAAGTACAAGGAAAATCTGTTTTCCTTTTTGAGATGAAGCCAACGAAGACATCAATTCTCTATGATATAGACCCAGAAAGCAAATATCATAGTATTAACTCAAGAAAAATAATTTATACTAAGAATTTAAATCTAAGCAAAAAGGTAATTTCTATGATAGTTGAAGATGAAAGCTTCTATTTGGGACATCTTTTAATTATTATAGGGACTTTCTCAAAAAGTAAGATTTCTAATATTTCAAACATGTGGGAAAATTTATTAGAAACAGGAATAGAGTATTTTAAGATGGGAAATGATGGAGAATCTTTTTATTGGTGCAACCCTCAGATAAATTCAGCAGAGGAAGAGTTCAAATTATTTATTAATGAATTGAACACCTTTGCCAATGGTATCAAACATATTAATAATGAAAACGGATAAGAATATTATTGGGGTTATAGCATTACATGTAATAACTTTTACATTTGTTATAATTTCGTGGAATCATTCATTTAATAGCATAGGTTTTAATATAGTCTCTGGTATTTCATGTATGTTATTGGGAGGCAGTTATTTACATCTCGAATTATCGAAATATTCTGAAGAAGGGAAGGTGACATGGTTCAGTATTACGCATACGATTATACCTTTAATCTTCTTGTTATTCTGGGCTGGTATTGCTCTTATATTTTATCATTATATCGTAATAATGTCTTTATTACAATGTCTGGTCACAAGTTTAATTCCCCTCAGTTTTTTGTACCTTAGGAGTAAGCGTAAAGAAAATAAATATATAACGCTATTACTCATAATTATAAGCGGCATCTTCATAGTTCTACCCTTTATAGTTTTATCTATTTTGTAACTATTCAGCGGTATCGATCAGTCTGTATTCTCTCTTCTAAAATCTTCGTATAAAGACTATAAAGATGCTTATTTTTCTTGCGTATTTCAGATTTTCTTTGTACCTTTATACCTATGAAAGAGCAAGTTACGGACATATCAAAAGTATTACAAGGCATAACGGAAGATATGCGATTGTTGCGTGAAACTATCAATCAGCAGTATACTGAGATTATCAAATCGAACCGTAACATAAATGCTCTGAACCTTGAAATTCGCAAGAAAGATACGAAACTTATAAACTTACGGAGTGTGCTTGATTATGTGACGCAGGTTATTTCCATTCCAGAGTTGAAGCCCGTAATCAAGGAAATCCGACACTATGTGATGATATGCAAGAACTGTGGTGAACGTATTCGGACGGTACTGAGACGGCGGTCAAACAACGTGGTATATGATTCAAGCGTAAAGACCTTAGTGGTTTATCTGAGTGTCGTTCAATTTCTTCCTTACGGGTCGCATAGCAAGTTTTTTGCGTGAGGTATTTGGACTCACTCCAAGCGAAGGTTCGCTGGTGAACTGGGTAAATGAGGCTAAGAGAAATGCGCAACCTGTGATTGATAAGATTAAAGAATATATCAAGTCATCAGCAGTTGTTGGTTTCGATAAGAGCGGCTTGTACTGTAACAAAAGACTCGACTGGGCATGGATTGCACAGACTGTTTATTACACATTGCTTTTCCGTGCTGATGGAAGAGGATCGAAGGTATTAGCAGACAAATTTGGCGATAGCTTGGAACGAATGACTGCCGTTACCGACCGCCATAGCGCATACTTTGCACTCCATTTTGGGACTTGCAGCCCTAAATGCCTTAACTTTAGCATTGAATGACTCGGCATTTGCATTCGTAGCTCTGTTTACAAAGAAATTGAGTATGGTTTGATAATGATTTTCAAACGTGTCAATCACCGTGTAGAAATTGTCCACTCCTAACTCTTTAACCTCATTAAACCATTTAGCCAGCTTCAGCCTTGCAGCATCCTTGATGCTCTTGATGTTATAAATATCGGTAAGTTTCATAGCCAAATCATATGCCTTTTTCAGTGTCGGATAATGTTCAAAGATGATTTGTGCCCTACCTTTCTGTGTTTCAGTCCACTTGGTCTTGTGCTTGGTCAATATGAACTTTGCTCTGGCGAGCAACTGCTTACGCGTGTCACCATTGCTGTATCTAAAGGGTATATATTCCTTACCCTTACTTTTAGCTTCTTTTATCTCCTCATTCTCTTTATCTCTTGCCATCCATCGGTAAGCGATACGCATGTCATCCAGAGCATCATAATATAGTTTCTGCACATGAAATCTATCATTGGTAATAAGCGCTTTGGGAAAAACAGAGTGGGCTATGCGCATCATAGAAGATGACAAATCGAGTGTTATCTCCTTGACAGTCTTCCGTTTAGAAAGGTTTATCTTCTTGAGAACCTGAATAACGTCCTCTGCCTTGGTCCCTTTAACCACAGCTACCAAAGTCCCTTTTCTGCCCTTCCCCGCCTTGTTGGTCAGAAAAGTATAAACCTCGCCACTGCTTAGACAGGTTTCATCAATACTTAGACTCTCACCTATGTTATCTTCAAACAATAGCCAGTCTTGAGCATGTCCCAACTGATCCCAGCTACGGTAATCACTGAAATACTCCTTGTACTGTGTGGATAACAGCTTGCCATTTACGCCATAGTGAGCACCGATACTTGCGATGCTCTCTGCGGTGGACTCTATTCTATTCTTTTAAAAAAGAAACGAACTCGGGGGATAGTCTACTGCCCTCAGCCGTCAAGTCATCATATGAATAAGTAAATATCTCTCCGTTGGAACTGTCACGCCACTTGCGACGGCGAACATGGAGGTAAACTGCTTTGCCACGAAGGGGGAAGTCCTGAATAACACGCTCGCTGGTAAAACCATAACTGCTTACAGTGCCTAACTTATGGTCTGACTTTTCCATAAAGTTACGCTCGTCAAGCCAAAAGTCAATCTGCGAAACACTCTCTTGAATATCGACAACATCAAAGTAGTCGGCAAGTACATCTGGAAAGATACAACGTAATAATTGGTGATTCTTCATGATGCAAAGGTAATAAATACTTATGAAAATATGTGCTTAAGAGTATAGACTTGCAACTGGGTTTTGGGACTGACCCGTATTTACTCCCTCTGTACGTACTCAGAGAAGAAGAATGTATATAAAAAAGAGGGTGTGTCAAAATTGATACATCCTCTTTTTATTCTTCCTTTACCTTGTCTACCACATAATTCGTTATTTACTGCAGCTATGCTGCTAATTTCATATAAAATGATTTATTGGGCTTTAAATAACCTATATAAACTTGTATAAAGTATCTGAAGGTGACTAAAAGTAGCTCCATAACGGCTTTTAGTTCCTTGAGATTGGTTTTTCTACACATTTTCCCTATGTTAAAGGCAATAGCGAAGAAGGCAAAGTCCATATTGACCTTGTCTTTCCCAAAATGCCTAAACCTCTTGTAGGCTTTATTATATTTTGTTTGTCCAAAAACAGCTTCAGGTTCTATGCACCTTCGTCCTCGATGCTTGATGCCTTCTTCCGAGGTCAGTAGTTCCCGTGCCTTTTGTTTATAGTGCTGTAGTTGGTGGTTTACTTCTATAATTCTGTTCCCTCTTGCCTTAAAACATGAGCCTCTCAAGGGACAACCATCACAGCGTTCTGCCTGATAACGTACGCTGTAAGTAACGAATCCATTAGAGGTTAGGGAACGCTTCATACCTATACGCTTCATATGCTGTCCCATAGGGCAGACGTAGAAATCCTGTTCCTTATTATAATAAAGGCTTGCGGGACTGAATGGGTTAGGTGTGTAGCGTGGACGCTGCTCTTTATGGAAGTAGTTATACTTCACATAGGCTTCCATATTATGTACGTCCATGAACAGATAGTTCTCCTCGGAGCCATACCCCGAATCGGCTACGACTGTCTTGGTATAACGATGATAGCGTGATTTGAAAGACTCCAAGAAAGAAGGTAGTGTGAGTGTATCGGTACGATTGGCATAGAGTGCAAAGTCGGTGATGAACTGGTTCTCCGTTGCTATCTGCAGATTATATCCAGGCTTAGTCTGTCCGTTCCGCATAGCGTCCTCCTTCATGTGCATAAACGTGGCATCAGGGTCGGTCTTGCTGTAGGAGTTTCTCTCTCCCATAATCTCAAGGTGTTGGTCATACTCCTGGAGTTTATTACGTTTCTTCTCAAGTTCTTTAAGCTGTTTTTTCTTGGTTCTAACAGCCTGCTTATCTTCTTTAGTCTTAGGCTCAGGGGCTGATTCCAAAGACTTGTTCAATTCCTCGGATATCTCATCGAGCAGAGCTGCAGTAAACTCGACACCTTCTGTTTTAGCGGCATTGTCCTGCGCTATGACATCGTCAACCTGAAGCAAAAGTGTGCGTATTTGTTCCTGCAACTTGGCGCGGTTCTTTTCCACGGTTCTCTTCCAAACGAAGGTATACTTGTTGGCTTTCGATTCTATTTTTGTGCCGTCAATATATTCAACGTCAAGACTTATCAAACCTTTGGCTGCAAGTACTAATACGACTTGTGTGAATATATTGTTGATTTCCTTTTTCACACGATTACGAAAACGATTGATGGTAATAAAATCAGGCTGCTCATATCCTGCAAGATAGATGAAATGAATGTCACGCTTGAGAAGCGACTCTATACGACGGCAGGAATAGATATTATTCATATAGGCGTAAAGAATCACCTTGAGCATCATTTGTGGATGATAAGGCTTGCGACCACTGGGCTTATAAAGTTTGTAGACATTATCCAACATAAGATTATCCACCAAGGCGTCTAAAAGACGAACAGGGTCATCTTCTGCGATATCCTTATCGATTCTTTGAGGAAAAAGAATCATTTTCTTGTGTATGTAAGAACGAAAGTGTACCTTTGTCATAATGAGATTTTTTGTATGCCTAAAGGTACAAAAACTTTAGGAAATAGCAAAGCCCTGGCTTAAGAAAGTCGGGGCTTTGTGCAGAAAAAAGAGGATGTGTACATTTTGACACACCCTCTTACGTTTTGTTAGGATTTCTCTGTTGGTCTGTTCAGAGTACCTAAAGCGGAGAGAGGGGCTCTTTCTCTTGTTATGTCATAAAACATCACGAGATATCAACCTGCTGTTTTTTAGTGTATTGAGTTCATTTCAAGTAAAATGAAAAATCATAGAATTTTCTTGGGTATATCAGAATTTGGGTGTAATTTTGGGTGCATAATATCAATACAAAAAATTCAATATGGCTATTACCAACGCATCTAAGGAGGGTCTGATTCCATTCATTATAGGAGGAGCAGAAAGTATATGGAAAAATAGGACCGGAAAAGATCTTGTGTCTCTCTTTCAAGCATATGGTTTTAGAGATGATGTTTATGATAATGGGTTGCCAAAACTTAATGGATCTAATCTAAACACCTCTAAAACACAATACTCAAGAGCTAGGCTACTTCAACTTGATGAAGAAAGTTTAAAGAAGTTGATAGAACAACTTTTAATAGAGTCAGCAAATATCAATGAAGCAATAGAAGAGATAAATAACATTTTACTTCCGGATAGAGTAAAGATAACTAATAAAGGGGATCAGGGCTTCATGTGGGAGGGGGTAATCGTTAATGAGGATGTTCAAAACCTAGCTTTATTTCGGCAAAACGAAAAGCAGGTAATCAATGCTATTAAAGGAGCAAGAGTATCTATTTTGGTAGCAATGGCTTGGTTTACCAATGAAAAAATTAAAGAAGCTCTAGAGGAAAAACGTCTTGAGGGATTAAGAATTGAGATTGTGACTTTTAAGGATGGGGTAAATGCTCACCATGGAGTGGATTTGTCAAACTTTGATCATAAAGAGATTCGAGGAACACGTGGAGGAATAATGCACAATAAATTTTGTGTTATAGATAACTTAATTGTAATAACAGGAAGCTATAACTGGTCTACAAATGCAGAGTGTAGAAATGATGAGAATATTCTAATTACATGTGACCATAAAACAGCCACGAAGTATTCAGTGGAGTTTAGAAGTCTAAAATCTCTATTATAACATATGAATATCAAACGCAATATCATCTTTGCTCTTGAGAGTCGTATGAAAGCAGGCATTCCCATCGTGGAGAATGTCCCTATTCGCATGCGTGTGAACTTCTCCTCCCAGCGCATCGAGTTTACTACGGGTTATCGCATTGATGCTGCCAAGTGGGATGCCGACAAACAGCGGGTCAAAAATAGCTGCACCAACAAGTTCAAACAGTCGGCAGCCGAAATCAATGCTTCGCTCTTGGAGTATTACACGGAAATACAGTCAATCTTTAAGAGGTTTGAGGTAGAAGACGTAATACCAACGCCCGAACAGATAAAGGAGGCTTTCAATACTTTGCACAAACCTGCGAGCGAAGAGCCTAATCCCAAAAAGGAAGCACTGTCTTGCGATTTTTTCAGATGTTTGATGATTTCGTAGAGGACTGTGGACGTCAGAACGATTGGACAGATTCCACGTATGAGAAGTTTGCAGCCGTGAAAAATCATTTGATCAACTTTCGTGAAAGACTTACTTTTGAGTTCTTTGATGAGCGAGGCTTGAATGACTATGTTATTTATCTGCGCGATATAAAGGACCTGTACGCCAAACTTATTATAGAGGTAATGAACGCATTGACGAAGTTACACCTAAATATGCCTTGCTTGGAACACACGCTGGTCGTAGAACGTTTATCTGTAATGCGCTTGCATTGGGCATTCCACCGCAAGTAGTTATGAAATGGACTGGGCATAGCGATTACAAGGCTATGAAACCTTACATAGATATTGCGGACGACATCAAGGCGAATGCCATGAGTAAATTCAATCAATTATAACACCGAAACAATATGAGTCAAGAAATAAATAAAGCAGAGAAACAATATTTATCACTCATTGTTCGAGCCATCGGTTCTGACCTTGAGCATACGCAAGTACGCATGATAGCCTCTGCAAATGCAGATATGCTTTTCCACTATTGGAAAGTAGGGCACTTTATTCTCTACCTCCAAAAGAAAGAAGGTTGGGGCAGCAAAGTCATAGATAATCTTTTATTGCTGTCCGGTAACCCTTTTTCACATATAAAAGTTAAGGGCTAAAATCCGTATTTGGATTTCAGCCCTTTTTGTTACTTTTGTTTCTGCGATAAAACTTCAATAACATGGGCAAAAGTACACATTTTATCGGACAGCCGGTCTATAATCAGTTAATAAAATTACTCGATAAGCAACAAATCAAGCAAATCAGCCTTGAAACACCCCGAAGTGAAGCTTATGTGAAGCGTCTTGATGGGTGGACTCACCTTGTCATAATGCTTTTCGGTGTTCTCAAACACTTTGATTCTCTTCGGGAAGTGGAGATAGGCATGAAGGCTGAAGTAAACAAACTGCATCATCTTGGCATCGACTATGTCGTCCGTCGTAGTACGCTTGCTGATGCCAACAAGCGTCGCCCACAAGAGTTCTTTGCAAGCGTTTATGCGTACCTCTTAGAGCGTTATGGTTCTTTTTTATCGGACAGCCGTCCTAAAGGTGAACAGAAGACATGGGAAAAGCTTCTGTATATGATGGATTCAACTACGATTACCCTTTTTGACAACATACTCAAGGGTGTAGGCAGACACCCCAAGAGTGGTAAGAAGAAAGGCGGTATGAAGGTTCATACGGTGATGAAGTATCATGTTGGCGTTCCCATGGTCGTACAGCTTACCTCTGCCGCCACACATGATCATTATCTACTTAAAGAAGTGCATCTTCCTAAGGATGCTACCCTTACCATGGACAGAGCATACGTTGATTATGCACAGTTCCAGCGACTTACAGAGGAAGGGGTATGCTATGTGACCAAGATGAAGAAGAATCTCACCTATACGGAGTTGTCCTCAGTAACCTATGTAAGCCCTGATGGATTGGTTACACATACAGACAAGAAGATTGTCTTTGAGAAGGGGGAGATAAGACACCAGGCAAGACGAGTGGAACTGTGGAGTGACAACTCACATAAGTCAGTTGTCCTGTTGACCAATAACTTTGAGCTTGATGTAAAGGACCTTGAGGAGATTTACAAGCGAAGATGGGCTATAGAGTCCCTTTACAAGCAGCTCAAACAGAACTTCCCACTGCATTTCTTCTATGGTGACAGTGTAAATGCCATACAGATACAGACATGGGTGGTACTCATTGCCAATCTGCTTTGTACCGTTATCTCCAGAATGATAAAAAGACACGTATCCTTCTCGCAGCTGGTAACCATGCTGAGGCTCACACTGATGTATTACACTGATTTCATCTCATTTATGGAAAATCCACAAAATGATGAACTCATCATAATAGCTAAAAAGGCAAATTCACCACCAAAAGAACTTGACTTATTTGATTAGGGGGCTTGGATTTCAAAAAAGAATGCGGAATTACCTATATAAAGGCTATTCCGCAAGGATTTATATGTTATTTAAGTTTTACCGGACAGCAATATTATAGAAGTAAACCACCAACTACAGCACTATAAACAAAAGGCACGGGAACTACTGACCTCGGAAGAAGGCATCAAGCATCGAGGACGAAGGTGCATTGAACCTGAAGCTGTTTTTGGACAAACAAAATATAATAAAGCCTACAAGAGGTTTAGGCATTTTGGGAAAGACAAGGTCAATATGGACTTTGCCTTCTTCGCTATTGCCTTTAACATAGGGAAAATGTGTAGAAAAACCAATCTCAAGGAAATAAAAGCCGTTATGGAGCTACTTTTAGTCACCTTCAGATGCTGTATACAAGTTTATATAGGTTATTTAAAGCCCAATAAATCATTTTATATGAAATTAGCAGCATAGCTGCAGTAAATAACGAATTATGTGGTACACAAAGTAAAGGAAGAATAAAAAAGAGGATGTATCAATTTTGACACACCCTCACCCTTTACCGCTGGCGCAAGAAGCAGCTAGTGCGTTACCGCTACACGGAGGGTGGCGATGTGCGCTACTTCTTCAAGTCTATCGTGATAGCCACGAAATGTAACCGACTCCGCGTATCAGGTATGAGAAACGATGAGGTTCTTGGGCGGCTCAACCGTTTCAAGGACAATCTCATCATGAGTTCATGTCTTAATCCTAAAAACCGACAATTATGATAGAAAAGGAACAGATTCTTTTACTTACACAAGGAGGTTTGAATGTGTTTTCCCATTTCCTTGGCTTTGAGGTGAACCTTCATCGCAACTTCCGCAGTCCATTCTATGACGACAGGCGGGCTTCCTGCCATATCTACTACGACAGGAAAACTTTCTCTTACAAATTCTATGATCATGGAGATACCACCTATTCTGAGGATTGCTTCTGGTTTGTGGCAACCCTCCGTAATTTGAACCTGAAAACAAGTTTTCCCGAAGTATTGGAAACGATTGTACAAGAACTTGGATTGTATTCTTTATGTAATGGTGGAAAGCATAGCAGCCATATCACGTCAACATATAAAAAAACTATTGTTCCCACTCCTAAGGCTGATATAACCAAGTGTACGGAAGAACATCCATATAGTTTTGAGATACAGCCATTTGACGATGGTCTGCTGAACTATTGGGCACATTATGGCATCCATGAGGATACACTCCGTCGCTTTCGTGTACGGAGTCTTAAACGCTATGAAAGTGTATCTGCTGAAGGCAGGAAGTTTGAACTTTATAGCTCACCTACGGAACCTATGTTTGCCTATATCGGAAACGGCTATGTAAAGATATACCGACCTCACAGTCCAAAAATCCGCTTTCTTTATGGTGGACGGATGCCTGCCACGTATTGCTTCGGAATGGAGCAGATTCCTGCCAAAGGAGATATGCTTTTCATTACAGGTGGAGAAAAGGATGTACTCTCATTGTATGCACACGGCTTCAATGCGATTTGCTTCAACAGTGAAACGGCACAGATACCGACAAGTATCATTGAGAGCCTTCAGCTTCGTTTTAGGCATATAATACTCTTATATGATGCGGATGAAACAGGTGTACGGGAGGCACATAAACAGTCTGAACATCTGGTGGAATACAAGGTCTTGAACCTTTCACTTCCGCTAAGTGGTACGAAGTCTGAAAAAGACATTTCTGATTTCTTTGCCTTAGGCAACGGGGCAAAGGAACTGAAAGAGCTGCTTGCCAAGATGTTCTCAGATCTATATAGCCAAACCATGATGATGTTACGTTCCTGTGAGATTGATTATGAGAATCCACCGGACATTTCCAAATCAGTAGTAGCAGTAAACGGTGTGCCACTCGGCACACAGGATAACCTGTTCTGCATTACTGGAGGTGAGGGGACAGGCAAGAGCAACTATGTGGGTGCCATCCTTGCCGGAGCGTTGGGAGAAAAACGATTGCCGATAGAGAAGACCCTGGGATTAGAGATTACCCCCAATCCCAAAGGCTTGGCGGTCCTACACTATGACACGGAACAGTCCGAGGCACAGTTGCACAAGAACTTGGGTAAGACACTGCGTAGGGCTTCTTTGACGGCAGTACCGGAGTTTTGCCATTCTCTGTACCTTGCCTCTCTGTCTCGTAAGGACAGACTGAACCTTATCCGTGAGAGTATGGACTTGTTCCATCACAGGCATGGAGGCATCCACCTTGTGGTGATTGACGGAATAGCCGACTTAATACGTTCTGCCAACGATGAAACGGAAAGTATTGCCATTGTGGACGAGCTTTATCGCTTGGCGGGGATTTATAATACCTGTATCATATGCGTGCTACACTTCGTACCGAATGGTATTAAACTCCGTGGGCATATCGGCTCAGAATTGCAGCGCAAAGCAGCTGGAATTCTTTCCATAGAGAAAGATGATAATCCCGAATACTCGGTCGTAAAAGCATTGAAAGTCCGTGACGGAAGTCCGTTGGACGTACCGATGATGCTTTTCGGCTGGGATAAGGCAGAGGACATGCACGTCTATCGTGGCGAGAAGTCTAAAGAGGACAAGGAAAAGCGCAAGACTGAAGAACTCATTGCCGTTGTCAAAGAAGCCTTCCGAAATTCTTTCAAGCTCACTTACCAAGAACTTTGTGAGGTTCTGATGCGCGAAATGGAAATCAAGGACAGAACTGCAAAGAAATACATCGCCTATATGAAAGAACAACGTATCTTGGCACAAGATACCAATGGTAACTATCAAAAAGGAGAACTATGCCGTACATAGATTATAAAACCGAAGACACTTGGCAAAAACGCCTGTTCGACAAGCTGATAAGCGTCGAGGATAAACTCGACCGCCTGCTTGTCCTGCAGGATCAATCTGTTGACACGACCGTCCATCCTCCCTTGAAACCCGAATACTTGGATATCATTGATGTATCCAAGATACTCAAAGTGGAGCAAAAGACCATCTACAACTGGGTTTGGGCAGGAAAAATTCCCTATCTTAAAGCCAATGGCAGGTTGCTTTTCCTTCGGGAAGAGATAGATGAAATGGTACGGAAGCGAGATGGTTGGTAATTGATTTTCTGATTAGATTATTTTTGTTGTGTAAATGCAAGTGTTTACACAACAAAATAATTACCTTCGCAAGTAAAAAGTTGGTTGCGGATTTACCGCACACAACGATAACAGTACTCGCTTATTTCTTTGATAAGGAATTTACCATATATTGTTTGAAACTATTGAACCGATAAAGATAGCGTTGCGGACAAAAAATATTTGTAATTTCTACATTTGATGCTACTTTTAATGGAAAATTCATCGCTTGCAATGATTTTTTTCGGCATTTTCCTTGCGCATTCAAAAAAAAGCAGTAATTTTGCAGTCAATATGACCTCCCACGCTTCTCATCAGAACAGCGCACCCGGGGAGGTCTTCGAGTTTATATACGGTTATGAGGTACGAAAAGAAGCCCATAGACACATCTGAGCAGGTTAAAAAGCTATGCGATCGAGGACTTAATATTGGAGACGAAAAACTTGCGTCCCAATATCTTTATAACATAAGTTATTATAGATTACGAGCATACACATACCCTTTTCAAAATAATGGGAAAGGTGCTAATCACGAATTTCTGAGAAAGGACATTTCTTTTCAAGATGTGATAGACCTTTATTGTTTTGATAGGCGTTTACGCTCTTTAATCTTCAATGCCATTGAAAAAATCGAGGTGGCATTAAGAACAAGAATCGCCTTAACTTATTCCGTTGATGAAAATGACGCCTTTTGGTTTCTCAATCATAAGTTATACTTTCACCACGATAAGTTTATCGCATTGACACACCCTGTTGTAGGGGATTTAATGAAAGAGGTCAAGCGAAGTAATGAGGATTTTATAGCTCATTATTATCAAAAATATAGCGAACCTGCTTTTCCGCCAGCGTGGATGACCTTAGAGGTTGTTTCTATGGGAACATTGAGTAAGTTGTTTTTTGCTTTGGATAAGAATAATCCTTCTAGTAAAACAATTTGCAGGGATTTGGGACTATACAATGTCGATATTCTGAAGAATTGGATGCATGGTTTATCGTCTTTACGCAATACTTGTGCCCATCATAGTCGTGTATGGAATCGCCGCTTTACAATAGGCTTGAAGTTTCCTTATAGAACGAATTATCCTTTCTTGTCAAAGCAAGAAGCTGCAACGATAAGGGACAATAAGTTGTTTGCCTACTTATCAGTAATTTTGTATTTGCAGCAAATAATCAGTCCTGATAGCTCATTTAGAAAGAGTTTGCTCACCCTATTGGACAAATCTCCCAAATTGGTAGTTTTGAAAGACATGGGCTTTCCTGAAAAATGGAGAGAATACTCTTTGTGGAAATAATATAATACGATGGTGTATCAATTATGACACACCATCGCTATTTATCTCAGTCTGGTTTTTACAATATATCCCTTACTTTCTTCTTGTTTGTTTCTATTATTCATAGAACTTGAATAAATTACCCCCTCCTCTTCCTTTTTAGTATTTGGACTATTACTACTATTTTGTTTTAAATACTCGGATTGGTCATTTTCGTTTGATTTTTCATTTTCTTCATTTGGTTCGTTCAATGTAAGTGCAATTTTTCTGTCAAGTTCTGCTGCCTCGCCTTTGAGTGAGCGAAGCTCGTCCTCTTTTTTCCAAGAACTGTTGGCAATGTTGGTGTAAACGTCTTTATTTGCCACAACTTTTGCCATTTCCTTCTCATGCGACTCTATTACCTTTGGAATACGCTCCAAGGCATTAACGAAGTTCTGACACGCAAGTTTCGGGTCTGTTGCCAACTTACCGTTATTATAGGTATAGTAGATACTCTCCTGCCCTTTCACAAAGAAGCGGTTCACCGAACAGTCAAACAAGTCTTTTGAAGTGCTCTCCGTCTTGACCATTATGGAAAAACCGTATATCTCACCGATTTTGTTGTACTCGCCCTTGGTTCGTGCCTTCTCGTCTATTTCTTGTAAACGTGCAGCAATAGCCTTGATATCGGTGCTGTCTTCCACACCTTTGATTGTCAGTTTATTTATAGGTGTACCTTCATCATCACGCTCCACACGCTTCTCAAAGAGAGCTAAGTCAGACTGTGCCTCCTTGATTTTATCCGTATGGAAGGATACAGAACTTTCAATCTCTGCCAACTTGCCCGTTGCGGCATCACGCTCACGGAGAAAGTTCTTGCGTTCTGATTCCAAGGTGGTAATCTTCTTATCCAATCTTGCTTTCTCTAAAAGCTCGGTATTACCTGAAAGCACTGCTACGTATTCTGAGAAGTTCATGCCGCTGTCCTCATCCATCGAACCCTCATCGATGGTACGACTGCCAAGCGTGTTTGTCTTCAGCTGATTGATGAAAAGCTGCTTATTATGCAGCAGGTTAAACTTGTAGCTATCCAAGGAACGTTCCACGGCATAGATAATCACATCAACCTTGTTGTCCGCAAATTCCTTGGCGATAAGATTTCCCTTGCGCACAGCTCGTCCGTTGCGCTGCTCCAAGTCCGAAGGTCGCCAAGGCGTGTCCAAATGATGGACCGCAACGGCACGCTGCTGTGCGTTTACGCCTGTTCCCAACATAGAGGTAGAACCGAAGATAATGCGAATGTCACCACGATTCATCGCATCTACCATCGCTTTCTTTGCCTTCTCGTTTTTACATTCCTGAATAAAGCGTATCTCGTAGGAAGGGATATGATAGTCCTCTACCAACTTACGCTTTATCTCAGAATAGACGTTCCATTCGCCAGGCTTGTAAGTCCCCAAATCAGAGAAAACAAACTGCGTTCCTTTTTGTGTATCGTACTTTTGATAATAGTCATTGAGCATCTTGGCACAGTGACTTGCCTTGTTGTCGATATGATCTGAGTACCCTTTTTCATCAATCATGCGTAAATCTAAGCTCATCTTGCGGGCATAATCAGTAGTTAAGAATCAAGGGAAACAGAGGGAAATACAGAGAATGTAACTATTTGAAATAGTACCATTTAGCATTTTCTTGCTGTTTTAAGGGTAAGCAAGTACGAGCATCAAACGGCAGGAGTTCCGTTACCAAATCGTAACCCATCAGGGAAAAAGCAAAAAGGGGTTACGAATTGAATGTAAACAACTGTGTCATAGGTTTTTATTCTTCATCTTTCATTTTTCTGCATCGCTCAGGAATACTTGTTCATCTGTACCTTTGCAAGCAAAGGAAATTTAGAAAAAACGACAGAAAAATGAAAGAAAACAAACTTAAAGTATCGTTCTTCGTTCAGGCGAAACGAACCGACAAGAAAGGACTTGTGCCTGTCATTGGGCGAATCTCCGTTGGCAGAACCCATTCGGGCTTCTCCACCAAGTGTAAGACTCCGCTCGCTCTTTGGGACAGCCGTAAGCAACGGCTCATCGGTAAGAGCAGCATGGCTGTGTCCGTCAATCAGAAACTCGGCGAATGCACCGCACTCATCCACGCACGCTTTCATGAACTCTGTGAAAGAGAAGAATCTTTTACCGCCACAGACGTGAGGGATGCCTATCAGGGGCAAATCCACTGTCAAGCCTTGCTCTTGGAGAGTTTCGGGGAGTATCTCACACAGACAAAGGAGCGCATAGGCATTGATAGAGCTTTAAAGACATTCAAACTCCGTACCTACCAGCTATCCCTGCTTCGTGAGTATGTGCAGAAGAAGCACAAGGTAAGCGACATTCCTCTTTCACAGCTGGACAAGTCCTTTATCGAGGGCTTCGAGTATTATCTCACCATCGACCGCAGACTGAAACGCAGCAGCATATCCAGTACCTTATCCACCTTGCAGACCATCGTCCGCATAGCGGTAAAGAAAGGTGCGCTGGACTTCTATCCGTTCTTGGGCTACAGTTACGAGCGACCAAAAGGCGAACCGAGAAGCATTACGCAGGAAGAACTTGAGCGCATCATCGAGTTGAAGATTGAATGGAAGAACTATCGTATTGTTCGTGATTTGTTCGTCTTCTCCTGCTTTTCAGGGCTTGCCATCTCCGATGTCCGCAATCTCAGAGAGGAAAACATCGTCCTTGAAGAGGGTGAACTCTGCATCAAGGGCAGGCGAATGAAAACCAAAACTCCGTATCGTGTACAGGTGCTTCCCCCTGCGCAGACTATCATGAATCGTTATAGAGGGATAAGGGCAGGCTTTGTCTTTGACGTTCCAACCACCGACGTTATCCTCAATGGCATGCACTATATACAGCGAAATATCGGTATGAAAACTCCGCTGACCTTTCACATGGCAAGACACACCTTTGCATCGCTTATCACGCTTTCGGCAGGTGTACCTATTGAAACGGTGAGCCGTATGCTCGGACACACCAACCTGAGAACAACACAGGTATATGCAGCGGTTTCCTCCGAAAGAATACATAGGGATATGCAGATAGTGCAGCAGCGAATACAAGATACATTCACCCTAAAACTTTGA
Protein sequences of DBSCAN-SWA_3 >CP022041|1279773:1354453|1289813_1290629_+|ASE18830.1|DBSCAN-SWA MKKMFFPLFGFALLFLTACYNQVSTGDHSAIDVEVQMKGDSTRYGLACDGCSDSIIVLLPNEGGDPVKFDIVTAKRNGMVYGTPEIGDELAIVPNPIDPYEAEMVINLEQMKGTWTFQVLPKLKPNPTKTEEEILAGMSDSMKAALFTPREYGFTLKSYNQASPVGYVMKANSLEDESPVEYPKVTVYTSWHIFNGRLYIYKDTLDEQGHRIPQDSVGFDSGSMRYLSEDSMAALFGKKVMQYHRKKNALEANKEAQKAEEKNAIIQSFKK >CP022041|1279773:1354453|1312540_1313494_+|ASE18844.1|DBSCAN-SWA MNTKHIILMACAALLGATQANAQGQDLSILTANTDARTAAMGNASAAAEGMYLYNNPAAFFATDKKFTADASASLFEKAEGADGTFGIYALSAGYKLAKRHAVFVGFRYAGGLSLKGSDLLGNPTKDYKPYNWTLDLGYTYFLGKGFATYATGSLIYSHLSKNATGAVFSVGGAYQNNELTLANKPANLMLDAKVGAIGPQLDYGNKHKTTLPTYLAVGGALSVEVAEKHQVAAALSSRYFFQPSEAKLFMLGGGLEYTYNKMVSVRAGYEYGDHDLSHVTMGAGFKYHGLRLNGAYNLKTADTGSSYCTIGIGYDF >CP022041|1279773:1354453|1293001_1293547_+|ASE18832.1|DBSCAN-SWA MMFRKRERRKVPILNTTSTADISFMLLIFFLVASSMDLDKGLSRQLPAIDKTKTPPAAVDSRKVMRIVIDAKNQVTLDGKAVTMKELLQRATQLIQTNGKGHLIQLQSSRNASYDTYIHVQNQLVAAYNTLRNQRALNLFGKEFELCSNEQQKQIADELPMRISEVYTVSKADNTEKGGTE >CP022041|1279773:1354453|1343804_1344602_+|ASE18862.1|DBSCAN-SWA MAITNASKEGLIPFIIGGAESIWKNRTGKDLVSLFQAYGFRDDVYDNGLPKLNGSNLNTSKTQYSRARLLQLDEESLKKLIEQLLIESANINEAIEEINNILLPDRVKITNKGDQGFMWEGVIVNEDVQNLALFRQNEKQVINAIKGARVSILVAMAWFTNEKIKEALEEKRLEGLRIEIVTFKDGVNAHHGVDLSNFDHKEIRGTRGGIMHNKFCVIDNLIVITGSYNWSTNAECRNDENILITCDHKTATKYSVEFRSLKSLL >CP022041|1279773:1354453|1333136_1333403_+|ASE18854.1|DBSCAN-SWA MRQAISIIVLYALINILSLLFNIISFSTDGKIDIGFPFIFIHLTSAKYDPTFPYNKLLIEPFLWDIIFLFTIVCLYLCCLKFIKRIRD >CP022041|1279773:1354453|1350429_1351362_+|ASE18867.1|DBSCAN-SWA MRYEKKPIDTSEQVKKLCDRGLNIGDEKLASQYLYNISYYRLRAYTYPFQNNGKGANHEFLRKDISFQDVIDLYCFDRRLRSLIFNAIEKIEVALRTRIALTYSVDENDAFWFLNHKLYFHHDKFIALTHPVVGDLMKEVKRSNEDFIAHYYQKYSEPAFPPAWMTLEVVSMGTLSKLFFALDKNNPSSKTICRDLGLYNVDILKNWMHGLSSLRNTCAHHSRVWNRRFTIGLKFPYRTNYPFLSKQEAATIRDNKLFAYLSVILYLQQIISPDSSFRKSLLTLLDKSPKLVVLKDMGFPEKWREYSLWK >CP022041|1279773:1354453|1325087_1326713_-|ASE18940.1|DBSCAN-SWA MKTKAYKYIVGVLALSLFTACDFQKVNTNEFELLPEEGLMDGISIGGPITAMQKCVFPVGTQADGTSVANRYQTAYNLAADCWSGYFGQNNNWGGPNNLNYFLKDGWVASSYTESYSTVVPLWQDLKGKTETQFPEVFALAQILKISAWHKATDMFGPIPYKEAGKGLITVPYDSQEEVYKAMFKELSDAIEVLTKYADNGNSKLLPNADAVYAGDVHKWVVYANSLMLRLAMRVYYADAALSKKYALQAVNHSYGVMKTKDDEAKMERGASLEFKNNLDVLINQYNECRMGSSMLAYLGGYQDPRLPKYFNTSTVSQAVTVGTYGKYSGVPTGHDVSSNDAFRDSSRPAITSTTPTYWMRASEVYFLLAEAALHGFAVGGTAESLYEKGIEMSFEENGIASSEVADYMSSGLKPSAYSFHLTNPGVNVDVPAVTEATTAWSGTDEEKLEKIMIQKWIALYPNGQEAWTEYRRTGYPKLHSVVTNYSNGEIDSEVGIRRMRFPTNKSTSAEDIANLESARKLLRGGLDKAGTRLWWDNKNH >CP022041|1279773:1354453|1322765_1323911_-|ASE18849.1|DBSCAN-SWA MNKHIFKHIAVAFAACCVVSCQDSESDLLKQKVYFDSNLYKVEMPDSGSTLGVDITSRLSNKQDGAVDVSYSLADSSLVALYNSKYGTDYVALKHQNVTFSKASSTIAAGSIYADKVSLTLNNLDQLAEGKNYMLPIKLHSSSTPVIDGEDVEYIILAKPVKITKAGDFYNKYISVKFPAGTYFKSFTYEALVHSIWWGSNCTIMGSEGLMIFRVGDVGGGISSGILQAAGRQHYEAPEKLSINKWYHVALTYDQATGKTVMYLNGTKWAESAWNISGFDPNADVGFNIGKIPGFPWGERPFYGYMSEVRVWSVARSENQIKQNMLSVDPKSDGLELYFKLNGSETVNGNKIKDSAKGIECTTGGLEFTTLAQPLTMKDLQ >CP022041|1279773:1354453|1351403_1352921_-|ASE18868.1|DBSCAN-SWA MSLDLRMIDEKGYSDHIDNKASHCAKMLNDYYQKYDTQKGTQFVFSDLGTYKPGEWNVYSEIKRKLVEDYHIPSYEIRFIQECKNEKAKKAMVDAMNRGDIRIIFGSTSMLGTGVNAQQRAVAVHHLDTPWRPSDLEQRNGRAVRKGNLIAKEFADNKVDVIIYAVERSLDSYKFNLLHNKQLFINQLKTNTLGSRTIDEGSMDEDSGMNFSEYVAVLSGNTELLEKARLDKKITTLESERKNFLRERDAATGKLAEIESSVSFHTDKIKEAQSDLALFEKRVERDDEGTPINKLTIKGVEDSTDIKAIAARLQEIDEKARTKGEYNKIGEIYGFSIMVKTESTSKDLFDCSVNRFFVKGQESIYYTYNNGKLATDPKLACQNFVNALERIPKVIESHEKEMAKVVANKDVYTNIANSSWKKEDELRSLKGEAAELDRKIALTLNEPNEENEKSNENDQSEYLKQNSSNSPNTKKEEEGVIYSSSMNNRNKQEESKGYIVKTRLR >CP022041|1279773:1354453|1338541_1339069_+|ASE18858.1|DBSCAN-SWA MLEYKRYQILNLNFINELDTVILDSKLKYISDFIFRQVQGKSVFLFEMKPTKTSILYDIDPESKYHSINSRKIIYTKNLNLSKKVISMIVEDESFYLGHLLIIIGTFSKSKISNISNMWENLLETGIEYFKMGNDGESFYWCNPQINSAEEEFKLFINELNTFANGIKHINNENG >CP022041|1279773:1354453|1335636_1335981_+|ASE18857.1|DBSCAN-SWA MKKNLLFSFDDEITTGVYYDIDGHKIVVYFAAYYDNGRFVEKKCQLIIEKWEYAKSKLSVDNRYKDLEDHIGIISMILDMHIADKKLFLTVNTLDGQYVDLLFYNCNVKIEDIS >CP022041|1279773:1354453|1330183_1331287_-|ASE18852.1|DBSCAN-SWA MTLYPKLITDALEKVIYPGTKKNIIESEMLADTPSINGNKVSFTLIFPRETDPFLKSTIKAAEAQIHYSVGKEVEVTITTEFKNAPRPEVGKLLPQVKNIIAVSSGKGGVGKSTVSANLAIALARLGYKVGLLDTDIFGPSMPKMFGVEDARPYGVEKDGRQLIEPVEKYGVKLLSIGFFVNPDTATLWRGSMATSALKQLIADADWGELDYFILDTPPGTSDIHLTLMQTLAITGAVIVSTPQNVALADARKGIDMYRNDKVNIPILGLVENMAWFTPAELPENKYYIFGKDGCKNLAKELGCPLLAQIPIVQSICENGDNGTPAASQVDTITGQSFLSLAQSVVTVVNRRNKEQAPTKIVDVKNG >CP022041|1279773:1354453|1279773_1280643_-|ASE18823.1|transposase|DBSCAN-SWA MDHAQDWLLFEDNIGESLSIDETCLSCGEVYTFLTNKAGKGREGTLVAVVKGTKAEDVIQILKKINLSKRKTVKEITLDLSSSMMRIARAVFPKALITNDRFHVQKLYYDALDDMRIAYRWMARDKENEEIKEAKCKGKEYIPFRYSNGDTRKQLLARAKFILTKHKTKWTETQKGRAEIIFEHYPTLKKAYDLAMKLTDIYNIKSIKDAARLKLAKWFNEVEELGVDNFYTVIDTFENHYQTILNFFVNRATNANAESFNAKVKAFRAQFRGVTDIPFFLYRLMKLCA >CP022041|1279773:1354453|1341802_1343464_-|ASE18861.1|transposase|DBSCAN-SWA MTKVHFRSYIHKKMILFPQRIDKDIAEDDPVRLLDALVDNLMLDNVYKLYKPSGRKPYHPQMMLKVILYAYMNNIYSCRRIESLLKRDIHFIYLAGYEQPDFITINRFRNRVKKEINNIFTQVVLVLAAKGLISLDVEYIDGTKIESKANKYTFVWKRTVEKNRAKLQEQIRTLLLQVDDVIAQDNAAKTEGVEFTAALLDEISEELNKSLESAPEPKTKEDKQAVRTKKKQLKELEKKRNKLQEYDQHLEIMGERNSYSKTDPDATFMHMKEDAMRNGQTKPGYNLQIATENQFITDFALYANRTDTLTLPSFLESFKSRYHRYTKTVVADSGYGSEENYLFMDVHNMEAYVKYNYFHKEQRPRYTPNPFSPASLYYNKEQDFYVCPMGQHMKRIGMKRSLTSNGFVTYSVRYQAERCDGCPLRGSCFKARGNRIIEVNHQLQHYKQKARELLTSEEGIKHRGRRCIEPEAVFGQTKYNKAYKRFRHFGKDKVNMDFAFFAIAFNIGKMCRKTNLKELKAVMELLLVTFRYFIQVYIGYLKPNKSFYMKLAA >CP022041|1279773:1354453|1313538_1315113_+|ASE18845.1|DBSCAN-SWA MKRTLFSICALVLSLTASAQIIKDTPKGKLIENLYRSSKSWVKKGWTGVQPGRYEGLVSKIVIGEDGCIYIYNPLSGLDSKSWLKLERQPDGKYRAKLPQDILTDDYGGDDDEEESSERTISLTRLVSSDDGKNYEPIGANNYVDFTVEGRTLKMSGMGQKKQIWGASFNNKWERNYGGDWALTIEPLKEQLITPPATATKSQYTVTSKSDPSPRIVEVMTNNNDIYVKGLFKAEKLANVWVKLTKQGDKAVMPTNQYLGITKKTDFKKYDSDKSEYHTFAAAFESETKAAENLEFSIDATGKLTASKILRTSLGRASNDNITGEDYIESYEGLTLTPYVQKEVGAPATPEYFYLTSTPNYDNTSNEIKLAFYVKNADINGNVLDPEKMYYNVYINGSTEPFKFKKSASQYNDMHEEEMTNIPFNYKDKRNYDFKVIDNLRILHFYDSSITRLKVVMVYESDGKKYSSEPMVASLTTDGIESANFNKTTTEKYYTVDGRQIQKLQKGLNIIKSSDGTTRKVVVK >CP022041|1279773:1354453|1331417_1331795_-|ASE18853.1|DBSCAN-SWA MGRNKFAEDEIKEIAKLLRLKNAGNRAKQKLVRHDLRTIYEFNISDFNEPGKAFGEEELQGAIQRGAIQILDDATIADMKAKRARDKARDEATREQQAIEAGEMTDWKAVAKEWEEWENSQNNGN >CP022041|1279773:1354453|1306499_1308119_+|ASE18841.1|DBSCAN-SWA MELKKTLYSLMLGGILLCLFSCAKEDEFEMPTLVLSENSIAFDKGVSERNISVTTNQNSWIASSPQEGDWLSLVQDGNVLKVKVTENKIGTERTSYVLVNANGASGKIAVTQSAADVTLDVVPNAIYLPQTGGEKTIDITTNSSVYDVTTSEEVSWLKIVKSEEEIKLIAERNDTYQKREVKLYAKSGSVIREIVVSQSGIQRYLLPIHPGVPQDEHKIMDFELGRGSYLREYQAAMPAYGLEETYTFITASPIFTLIQYCSTDGINPSQIIMIGDGRKAIDAVKDKAFDKFLTDNGYVRSNSQSDREYTNDKDLLSLKVYISEKENNEGVNLTFTPIMKQNGEYKTFSKLPFYPLELLQKDNVKLAQIEQYEQKAGSTEEERSLNEHKNTEVSQIQYKLKASTDPSAAYGRIHIFYTTDKDGDAPDNLGSVQIGALLFKDTNLGVWKYGTKWVATKEIKKVLGDEGFSFLRTSGNNHFFVRESDHLVIDVTCVLDNAMPVLALLYSYDPSVSGASSKAIKTQAKMIRNFAAAKKALKF >CP022041|1279773:1354453|1335329_1335629_+|ASE18856.1|DBSCAN-SWA MKIKNILAVFIIVGGISYALYFSLVTDVLLKYGETIHTKAIIEERLTGKTSDPVLRYRFLYENQAYIGFVSETSRLHVSDTINIVFLKSRPSINKPLIK >CP022041|1279773:1354453|1304752_1305565_+|ASE18840.1|DBSCAN-SWA MRKFCLKIHRWFALPLGVIMAILCFSGLAILLIKDLAPLFDMNAKEMPIYTIVVRLHRWLFMKPENAHEGGQSLGRILTAVSAICMSIVLLSGVVIWWPKTKKALKSRLTVSTNKGFRRFVYDSHVSLGIYVFIFLFLMALTGPVFSFGWYRAGMSKLFGQPMPPKEMKMQQPKDGMKQGGTNDKAFAPTDASQMKGQPQAHKEGAKDMKGDQHGKKPKGGKLFKQLHTGTWGGWFSRVLYAIAALIGGFLPISGYYLWWKRRSAKKKKA >CP022041|1279773:1354453|1326719_1329830_-|ASE18851.1|DBSCAN-SWA MRMIHETKQKHLYFSVAFALSIALAPTSVYAVGNPVGSPDASMPQAVQQNGNHKVTGRVVDSAGEPLIGATIMVEGTKEGAVTDIDGNFTINTTSKAKLVISYVGYTTQTIPVGDKTTIDVTLKEVANTMNEVVVTALGIKRAEKALSYNVQSVGSNELTRNKDANFVNSLNGKVAGVSISKSASGVGGATRVIMRGAKSIEGDNNVLYVIDGIPIFNFSGGRDSGIMGEGRVSSEGIADLNPEDIESISVLAGPSAAALYGSNAANGAILITTKKGKEGRVDISFSSSADFSSPLLMPKFQNTYGNKLGSYESWGEKLATPSSYDPKKDFFRTGTNFINALTLNMGNEFNQTFASVATTNSRGIVPNNTYDRYNFTIRNTTRMFKNRVQLDLGASYIKQKDNNMVSQGEYWNPIVAAYLFPRGESFEGIKTFERYDNVRNFPTQYWPISDSRFANQNPYWTAYRNLAPDDKDRFMFNAGLTYNIFDWLSVAGRIRLDKTFITSERKIYASSFNYFAKEKGAYDYYDYKDHQTYIDAIANINKTFGKFSLAANVGYSYSDYASLTRGYGGNLVLVPNKFSLNNINPTDSKIREAGGDSKVRNVAAFASAELGWRSMVYLTLTGRNDWNSRLVNSSEESFFYPSVGLSGIISEMTKLPSFISYLKVRGSYTEVGSPVSRSGMTPGTITTPIVGGSLKSTDIYPFTDYKAERTKSYEFGLTARFWKKLSFDFTWYKSNTYNQTFIGELPESSGYKAVYLQAGNVENRGVEMALGYSDNFGGLQWNSSLVYSKNVNEIKEMVKDYHHPLSPKPINIPEVSKDNGRVLLKVGGSINDIYARKVLAKDNQGFVNVSPSGGMNLETVEPIYLGKTTPDFTMGWNNNFTYKNFGLSFLINARVGGIVTSSTQALLDRFGVSKASADARDAGGVMIPNQGLYDAKKYYTLVATGENDLAGYYTYSATNVRLQELTLSYKFNSKLFNNVIKDLTLSFVATNPWMIYCKAPFDPELTASTGTYGQGNDYFMQPSLKSYGFSVKFKF >CP022041|1279773:1354453|1287270_1287804_-|ASE18828.1|DBSCAN-SWA MDYQVIINKYYPEDNELRHILLVHSRAVADKALAIADRHPELSLDRQFIEEAAMLHDIGIVRCNAPGIQCFGTEPYICHGRIGAEMLRAEGFPRHARVCERHTGAGITRSQIIAQKLPLPQQDFLPETMEEKVICYADKFFSKSHLDEEKTIEQAIASLSKFGEEGVARFREWVKMF >CP022041|1279773:1354453|1299239_1301171_-|ASE18837.1|DBSCAN-SWA MDKRTITGFVLIALILFGFAWWQQPSAEQVAQQRAEFVKDSIASAKKAQTAKLAAEKQAQQKAAQATDTTALFYAALNGKAQDIILKNSKVELTLSTKGGVVKKAVIKNYIGHNIAVKDGSQDQKNVTLFSGDDQSLNFMLAAKNSNIETKDLIFTPSNVTDSTVTLTAVAGEGKTLTLNYTLGKDYLLNMSLQAEGMGGLFAPNYNQIDINWQERCKQQERGFTFENRYATLTYKKHDGGTDYLSETSEKEETTEDSMDWVAFKNQFFSAVMIAKDNFATGAKLKSTPLEKSSHYLKHYEANMKAGFDPTGKRPSEFEFYFGPNDFRLLQSVETESKFAKELDMERLVYLGWPLFRIINRWFTLYVFDWLSKVFPMGVVLILITLLLKLITFPMVKKSYMSSAKMRVLKPKLDEATKQFNKPEDQMQKQQAMMQKYSEYGVSPLSGCLPMLIQMPIWIAMFNFVPNAIQLRGQSFLWMHDLSTFDPIFSWSHDVWLVGDHISLTCILFCGANLLYTWFTMQQQKDQMVGQQADQMKMMQWMMFGMPLFFFFMFNDYSSGLNFYYFISLFFSAAIMWALRKTTNEEKLLAILEARREERKNNPKNNMGSGLFARMQALQELQKQQQEELRRKQDELNKKKKGL >CP022041|1279773:1354453|1349744_1350053_+|ASE18866.1|DBSCAN-SWA MPYIDYKTEDTWQKRLFDKLISVEDKLDRLLVLQDQSVDTTVHPPLKPEYLDIIDVSKILKVEQKTIYNWVWAGKIPYLKANGRLLFLREEIDEMVRKRDGW >CP022041|1279773:1354453|1323951_1325070_-|ASE18850.1|DBSCAN-SWA MKNFKYISSLVLLCAGVLFCGCSKMTEVENEPYDHIGGYNTMNNAESEKYYADLRAYKQQAVNYGRPVAFGWYSNWSPAGTYRRGYLTSMPDSMDVVSMWSGAPNRFNITPEQKKDKEFVQKVKGTKLLEVTLLSYIGKGRTPDSVYTAVEKQAEKEGWANDQARVEEAKKKARWKFWGYEGVAGSDNHKEALARFAKALCDSLVANDWDGYDIDWEIGSGVFDMDGTLSTNADLVYLVKEMNKYIGPKSDPEHKGHRLICIDGHFGGLTEDLDGYVDYWIDQAYGRTTHFDYYGVDPKTIITTDNFESSFKSGGQLLRQAKSMPSKGYKGGVGAYRFDNDYDNTPNYKWMRQAIQINQQVFKERMGQTTQP >CP022041|1279773:1354453|1336116_1336590_+|ASE18941.1|DBSCAN-SWA MAKDFNIYTYPTFFSLLLIMLIPFALSVDDIMNNHQSNMKIILWISVFIIIILVYGLIKTRKRKIVINELGLLITEKEGNIKIPWEDISHIFFCSTPIWGLYVKITFTNRRNPLIIDFGKSSLWSVNFYRFRIAILAFSHKKDIIVVKSNQWYLKLI >CP022041|1279773:1354453|1341252_1341597_-|ASE18860.1|transposase|DBSCAN-SWA MKNHQLLRCIFPDVLADYFDVVDIQESVSQIDFWLDERNFMEKSDHKLGTVSSYGFTSERVIQDFPLRGKAVYLHVRRRKWRDSSNGEIFTYSYDDLTAEGSRLSPEFVSFLKE >CP022041|1279773:1354453|1286245_1287136_-|ASE18827.1|DBSCAN-SWA MIRKNPYTEEKYKHYKVTAPAPLLEWLLANLNESKSKVKATLQNRGIKVNGKCVTQFDHPLKAGDKISVSKSKKNDLFRSRYVKIVYEDRFLVVIEKNIGILSMAAGHSSLNVKTVLDDYFRKTKQKCTAHVVHRLDRDTSGLMIYAKDIQTEQLLEHDWHNIVYDRRYVAVVSGEMEHDEGTIANWLKDNKSYVTYSSPVDNGGKYAVTHFHVLDRTVAHSLVEYQLETGRKNQIRVHSADMGHPVCGDIKYGNGDDPLHRLCLHAYVLCFHHPVTHQRMEFETPVPQVFRHLFK >CP022041|1279773:1354453|1303578_1303824_+|ASE18939.1|DBSCAN-SWA MELDTTNMCSHLQKKLFNEEGVYYPIWQAMQNDDEITAVIRSRQLHIYRNGKKVLVLPGKAAPKIIREDSLNELLPKDLLK >CP022041|1279773:1354453|1287893_1289672_-|ASE18829.1|DBSCAN-SWA MGKKIEFSLIYRDMWQSSGKFQPRKDELVRIAPVFVEMGCFARVETNGGAFEQVNLLAGENPNESVRAYTKILHEAGIKTHMLDRGLNALRMYPVPDDVRALMYRVKHAQGVDITRLFDGLNDIRNIAPALKWAKEAGMTPQGTLCITTSPVHTIEYYCKLADEEIAAGAEELCLKDMAGIGQPAFLGELTRRIKEKHPDIILEYHGHSGPGLSMASMLEVAKNGIDILDVAIEPLSWGKVHPDVISVQSMLKNAGFDVPEINMDAYMKARAMTQEFIDEWLGYFINPQNKIMSSLLLECGLPGGMMGSMMADLGGIRSTINNLRKKKGEPELSVDDMLVKLFDEVAYVWPRVGYPPLVTPFSQYTKNIALMNLLTLEQGKGRFVMMDDSMWGMILGKSGRVPGEICQELKDLAKQKGLEFTDADPHTLLANALDDFRKEMDENGWDYGQDDEELFELAMHPEQYRNYKSGQAKKNFLADLQAAKDAKLGAKVSPEEAAAFKHAKADAIVSPVKGQLFWEFQGDGEAAPAIEPFIGKEYKEGDVFCYVQAPWGEFVTVPAALGGKLVEINAKQGAKVDKGDVIAYIERAHEE >CP022041|1279773:1354453|1345736_1346954_+|ASE18863.1|transposase|DBSCAN-SWA MGKSTHFIGQPVYNQLIKLLDKQQIKQISLETPRSEAYVKRLDGWTHLVIMLFGVLKHFDSLREVEIGMKAEVNKLHHLGIDYVVRRSTLADANKRRPQEFFASVYAYLLERYGSFLSDSRPKGEQKTWEKLLYMMDSTTITLFDNILKGVGRHPKSGKKKGGMKVHTVMKYHVGVPMVVQLTSAATHDHYLLKEVHLPKDATLTMDRAYVDYAQFQRLTEEGVCYVTKMKKNLTYTELSSVTYVSPDGLVTHTDKKIVFEKGEIRHQARRVELWSDNSHKSVVLLTNNFELDVKDLEEIYKRRWAIESLYKQLKQNFPLHFFYGDSVNAIQIQTWVVLIANLLCTVISRMIKRHVSFSQLVTMLRLTLMYYTDFISFMENPQNDELIIIAKKANSPPKELDLFD >CP022041|1279773:1354453|1294115_1294625_+|ASE18834.1|DBSCAN-SWA MAHRSLDRLDKKILHLISEDARIPFLEVARACNVSGAAIHQRIQKLTNLGVLKGSQFIIDPERIGYETCAFIGLNLKNPEKFDDVVEALRQIPEIVECHYTTGEYDLFLKIYAYNNHHLMSVIHDKLMPLGLSRSESSISYNAVIDRTLPIEGMKVADVVLDDDEEDEE >CP022041|1279773:1354453|1281625_1282603_+|ASE18825.1|DBSCAN-SWA MSEINNKTIKDYLIGKATAKEMEQLAEWLAVSEENQKEFFEMELAFHLGKNNSLVTSKKIEEAETKLFDQITEYEEQNRNKNKLYFLRYAAAIIVAVLLIGGGLFAYLHQSAETITVAAMNEVKKVVLPDNSTVWLNKGATISYAGNFEGDERKVNLKGEALFHVTKNAEKPFIVNSDGASAKVLGTTFNFKDQAADGKEVISLIEGRLEVTGLNGEGKVVLHPNQKATVSKDSKTIKTENSYALADAVWRDDIIPFNNMRINEIAKILEQLYDYKIIVDAKLDHTKTYTGVIKRNKDIRNVLDGLSYTISFHYNIHDKEITLSE >CP022041|1279773:1354453|1317940_1319164_+|ASE18847.1|DBSCAN-SWA MEYVKFKDVIINSQYGYTATETSQTEGTYKYLRITDIVPYYVNFDTVPFCKITEKDVSKYIVKEGDILIARTGATTGYNYVVPSGISNTVYASYLIRFIVDKKLVLPLFMKYVLKTQSYYGFINNYIGGSAQPGMNAKVFTKFNIPKLSLVTQQKIASILSSYDRLIENNTRRIRLLEQMAENLYKEWFVRFRFPEHENVEIVNGLPKGWKTIHIKELAQLKSGYAFKSEWFVEEGEAVAKIKDIGNILMDTSNFSYVDKENCIKAKKFLLTTGDLTIALTGATIGKISIVPKHKGNIYTNQRLGKFFLGDNPMEKLPFLYCLFKQESMVSNIVNLSNSSSAQPNISPEQIEKIKILGNHDIISMYNKTCNPLFSNILALYSQNQLLTRQRDLLLPRLMSGKLEVKS >CP022041|1279773:1354453|1295266_1296889_-|ASE18835.1|DBSCAN-SWA MTDKIQTSLRDSAAMRWTALLLLSLAMFCAYIFVDILSPIKELMQEQRGWDSTAFGTMQGSETFLNVFVFFLIFAGIILDKMGVRFTAILSGAVMLAGALIKYYAISDSFAGSALDVWFTNNLNHIPVFEQLGVSPFYQGMPASAKVAACGFMIFGCGVEMAGITVSRGIVKWFKGREMALAMGSEMALARLGVATCMIFSPFFAKLGGDINVSRSVAFGVVLICIALMMFIVYFFMDKKLDSQTGEAEEKEDPFKISDLGKILSSMGFWLVALLCVLYYSAIFPFQKYAVNMLQCNLTFHELPSGSYWASSSVTIIQYVIMLVVAATAFMFNFMKNKALKNLMLFFSIFFLVAYCYMGYMRQSAESIFAVFPLLAVGITPILGSYVDHKGKAASMLVLGSLLLIVCHLTFAFVLPQFKSSQVGGVIVAYVTILVLGASFSLVPASLWPSVPKLVDAKIIGSAYALIFWIQNIGLWLFPLLIGKVLDKTNVGVTDPTQLNYTAPLVMLAGLGVIALIIGLTLKVVDKKRNLGLEEPNIKE >CP022041|1279773:1354453|1297077_1299237_-|ASE18836.1|DBSCAN-SWA MNKNLTMAMTAALMMSGSVAQAQDVNIGRSNITLTSDLMTPETLWAMGRIGAAQASPDGKKIVYQVGYYSVKENKGHQVLRVMDADGKNDRLLTTSAKSEGDAAWVDNNTLAFLTGGQLWTMNADGTNRKQLTHSDIDIEGFRFSPDRKRVVLIKSIPYYGTIKQNPSDLPKATGMVITDMNYRHWDHYVTTNAHPFVADVTPEGIGAGIDVLEGEPYESPLAPFGGIEQIDWSKDSKFIAYTCRKKEGTQYAISTDADIYIYNVETRQTKNLCKPADYVEPKIDATKSMRNQAVNHQAGDMNVGYDVNPKFSPDGKYIAWQSMKNDGYESDRNRLCVYELATGKKTYVAESFDSNVDDYTWSLNSKDLYFIGVWHATVNVYQTNLKGEVKQLTEGDHNYVSISLLGDKKLLAIRQSISQANEIFAITPAKKEKASVQTQLSFENKHIYDQLALGDVKSRWVKTTDGKEMMEWVITPPHFDPNKKYPTLLFCEGGPQSPVSQFWSYRWNFQIMAANGYVIIAPNRRGLPGFGSAWNEEVSTDWTGQCMNDYLSAIDDAANNLSFVDKDRLGAVGASFGGFSVYYLAGIHNKRFKCFISHDGAFNLESMYTDTEEAWFSNWEYDDAYWNKDKSEAAKRTYANSPHLNVDKWDTPILCIHGEKDYRINANQGMGAFNAARLRGIPAELLLYPDENHWVLKPQNSVLWQRTFFNWLDRWLKK >CP022041|1279773:1354453|1292489_1293005_+|ASE18938.1|DBSCAN-SWA MMKIKGIAKMDEERISQRVLYVIVALSAIVFLAFYLIGYDTPFTGNTAFNAPMLTDVLLGFMWGLLAITTTASIVAVVRGIRRANRSEGMTNGIPARRITYTTYGITALILLLTFVFGSTQTMMVNGENFTDSFWLRITDMFVNSSLLLLVLAAGVVAFGATRYYRKGRGK >CP022041|1279773:1354453|1282677_1285737_+|ASE18826.1|DBSCAN-SWA MRKLYALLIGALAPLCSYAQTTFPEWQSQYTTGLNKIEPHAYVLPYSSVDKLQQPGGYEQSDYYMSLNGKWKFHWTKNPDNRPKDFFQNDFAVQGWNDINVPGNWERQGYGLPIYVNETYEFDDKLFQFKKNPPLVPYAQNEVGSYRRTFKIPSTWKGRRVVLCCEGVKSFFYLWINGTYVGYNMGAKMPSEWDITKYLNDGENTIAMEVYRWASGSYLECQDYWRISGIEQDVYLYSTPTQYIADYNVAAGVDKQDNGSFNGNFSLTTNIKGAGSGSVSYVLTDEKGNTILSEDKAVSAKNNDEMVNFTTKTIQNVAAWSAEHPNLYTLLITLKDKNGNVTEQTGSKVGFRTIEIKNKQLCVNGTPILVKGANRHEHSELGRTVSKELMEQDIRLMKQNNINLVRCSHYPNDSYWYQLCDKYGLYVIDEANIESHGMGYGKESLAKDSTWLTAHMDRTRRMYERSKNHPSIIIWSLGNEAGNGVNFEHTYRWLKNADKTRLIQYERAEENFNTDIYCPMYQSLDHMKAYAKRTDITRPYIMCEYLHAMGNSCGGMKDYWDLIESEPILQGGSIWDWVDQSFRELDKNGNWYWSYGGDYGPKDVPSFDNFCTNGLIAADRTPHPHLSEVKKIYQNIKSELVSNEKGITIKVKNWFDFTDLNAYQLNWQIVNENGKVIAKGTQHVDCAPHETTTITLPAVSATTDKEQYLNLEWYPANDALILTKNDVVAYDQFVLKKASDCTTFLPREKQRMTYKVNENTGELTSLKSGNQEFLEAPLSLSFYRPATDNDGRDQFGVRVWRKDGIDSISQKVTKITRTKDVTRAEADIIGKKGNVIGKAVFTYQPQKNGALAVKVDFTPDTAAVKALARLGLTFCVKDTFGKVEYNGRGDIETYNDRKAAGFIGHYKTTAEAMFHYYVKPQATGNRTDVRWVSISDTRNRLMVAAKSPFQFSVTPFSDSVIDRATHINQLSRDGLLTVHLDTNQSGVGTATCGPGVAEKYRVSVKPTSFEFVLYPAMAK >CP022041|1279773:1354453|1280758_1281103_-|ASE18824.1|transposase|DBSCAN-SWA MKNHQLLRCIFPDVLADYFDVVDIQESVSQIDFWLDERNFMEKSDHKLGTVSSYGFTSERVIQDFPLRGKAVYRHVRRRKWRDSSNGEIFTYSYDDLTAEGSRLSPEFVSFLKE >CP022041|1279773:1354453|1291368_1292202_+|ASE18831.1|DBSCAN-SWA MKKLIALFSIITILSFSCSTIVSAQAPAKAATTATADTLSDDALNETGVADTATVKTPTAQDTGIHQSLKRKFIEGNAGFMSLVALALVLGLAFCIERIIYLSLSEIDAKRFVGKLEDMIVAGEIEQAKALSRDTRGPVASICYQGLLRIDDSIENIERSVTSYGSVQSANLEKGCSWITLFIAMAPSLGFLGTVIGMVMAFDQIQEAGDISPTIVASGMKVALITTIFGIIVALVLQIFYNYILSKIDHITAQMEESAITLLDAIMKYKLKSIHNS >CP022041|1279773:1354453|1319179_1322380_+|ASE18848.1|DBSCAN-SWA MKSFISEDDIEQTLCTRLSQPEFGWKRIECDPSVEAQDDVSKTGRRNSSECILPAVFLTALERLNPQIDKSILAQVVADFRKDYTGKDMMDTNYKFYNYLRNGINVKVKKNGKDDFDIVRLIDFDNVENNDFHCVNQMWIKGRIRYRRPDVLLFVNGLPMVFIELKNSTVKIKEAYEKNLVSYREDIPNIFALNQICVLSNGMQTKLGAWNSKYEFFFDWLKVNDEHEKLDREHIAEYGLSIINLIDSLFRKERLLDYIENFIFFDNKRKKIIAKNHQYLGVNNLMKSVECREELKGKLGVFWHTQGSGKSYSMVMFVRKVRRKLKGNFTFLVITDRDDLDTQIHKTFVRSEVIGEKEECQPKNAAQLREFLSGNKPMVFTLIHKFQYDKTKKYPLLSERNDIFVLVDEAHRTQYKQLAENMHTGLPNANYIAFTGTPLLGSKRLTNQWFGDYVSEYNFAQAIEDGSTVRLYYSRRVPEVGLENNWLDSDIDKIVEEEELNDRERELLENSSSRILEVIKRDGRLDRIAQDIAHHFPRRGFLGKGMVVSVDKYTAVKMYEKVQHYWGEEKKALIKERNAAKTQEERDELTARLDYMNKVEMAVIISEEADEVEKFKAQGLDITVHRNKMNEITPEGKDIEDRFKDKDDPLSLVFVCAMWLTGFDVPSLSTLYLDKPMKGHTLMQAIARANRVYPGKSCGIVVDYVNVFKYMQQALSDYASNGEEGAEFPAKDITQLIATIDGCIEECDSFLQGLGIRLDKIITDGDTLDKLESLRLAYDKILEKDESKNRFKVMSNLMMNLHDAAKPEIFELGWKNEKFSPLSYLNGLFCNRIDDEKLRRAKEKMSYTLDDSVTVMVAEDKPQYSIHQSKVIDLSKLDIESIRKSINATPYKSMEIDNLRTFIETALEQLINKNCTRVPFSQRYKNIIDTYNAGGTENEDYYEKLLQLIDELKKEQGRSADMGLREEELEIYDLLIQGRKLTKEEEKEVILASKNLYNKLVEEKERLLVVDWYKDPQPKTKVLGLIQRSLDKDLPKTYDREVFSNKTNLLLDHFVDMAVQGYGWIS >CP022041|1279773:1354453|1333821_1334337_+|ASE18855.1|DBSCAN-SWA MILRNTKRIINYCVLMIGLGVWGSCNSKSQHVLLPVDNVKSYKICENNDTTTIFEEREEESEEVIKFYKEGSEIFTSDYGGRKELLMSTTEMLDTVYSGNTYCREHRILIKKENSNLFSTSIYNIIIHPVLVLTIYYDHSYNIKAIRNWFAFTTYESEHISIPLISRPVKY >CP022041|1279773:1354453|1316250_1317927_+|ASE18846.1|DBSCAN-SWA MTSTELKDLEGRLWQSADMLRAGAHLAANKYSQPILGLIFLRYADVLFKQHKEAIDTAYNEYKGTRMERSYKDIAIEKCGFFLPECAYFDYLNDAPDDAQKALLVKAAMEAIEHENPRMDGVLPKEVYGQLVPEEEPELLSRIVRVFKDIPENISIDIFGQIYEYFLGNFALAEGQGGGAFYTPASVVQYMVEVLQPATGDKKFLDPACGSGGMFVQAARYMHRHNTSNEQMMNFRCYGVEKEPDTVKLAKMNLLLNNVRGEIMEANSFYSDPYNAVGQFDYVMANPPFNVDEVVVERVTDDARFNTYGVPRNKTKSAKKASDKKETVPNANYLWIGYFATALNEQGKAALVMANSASDAGGSELEIRKKMIEDGIISQMVTLPSNMFSTVTLPATLWFFNKKRPKKDEILFIDARNIFTQVDRAHRKFSDEQVKNLGIISRLYEGDSDAFWALVEEYKAEGKQSEADWLLERWPDGKYQDIVGLCKVAKLEGEDGIIDNDYSLNAGRYVGVVIEDDGMTEEEFRTEMLSLNSEFAKLSAEAKDLESEIEKNLKELLG >CP022041|1279773:1354453|1293543_1294020_+|ASE18833.1|DBSCAN-SWA MKLYRGRNHEIPALNTASMPDLIFSILFFFMLVVHMRKANVHVKYQVPMATELSRMYNNSTIQHIYIGRPINSLGQVEGEKMVVQLNDHITTIPEIRKYLIQLSAALPPEQRKELSVSIKADRHADMGTIMDLKQVLREANVLNVNFTATMSRNNKLK >CP022041|1279773:1354453|1347696_1349757_+|ASE18865.1|DBSCAN-SWA MIEKEQILLLTQGGLNVFSHFLGFEVNLHRNFRSPFYDDRRASCHIYYDRKTFSYKFYDHGDTTYSEDCFWFVATLRNLNLKTSFPEVLETIVQELGLYSLCNGGKHSSHITSTYKKTIVPTPKADITKCTEEHPYSFEIQPFDDGLLNYWAHYGIHEDTLRRFRVRSLKRYESVSAEGRKFELYSSPTEPMFAYIGNGYVKIYRPHSPKIRFLYGGRMPATYCFGMEQIPAKGDMLFITGGEKDVLSLYAHGFNAICFNSETAQIPTSIIESLQLRFRHIILLYDADETGVREAHKQSEHLVEYKVLNLSLPLSGTKSEKDISDFFALGNGAKELKELLAKMFSDLYSQTMMMLRSCEIDYENPPDISKSVVAVNGVPLGTQDNLFCITGGEGTGKSNYVGAILAGALGEKRLPIEKTLGLEITPNPKGLAVLHYDTEQSEAQLHKNLGKTLRRASLTAVPEFCHSLYLASLSRKDRLNLIRESMDLFHHRHGGIHLVVIDGIADLIRSANDETESIAIVDELYRLAGIYNTCIICVLHFVPNGIKLRGHIGSELQRKAAGILSIEKDDNPEYSVVKALKVRDGSPLDVPMMLFGWDKAEDMHVYRGEKSKEDKEKRKTEELIAVVKEAFRNSFKLTYQELCEVLMREMEIKDRTAKKYIAYMKEQRILAQDTNGNYQKGELCRT >CP022041|1279773:1354453|1309772_1312529_+|ASE18843.1|DBSCAN-SWA MRKSIIYLCACALSGMMVTTSCQDNLDSDAGVSNTRSVNIDKDLFAIKGCINVKLAKGTNQAIPTTRSGSVEMQSVPSAMTSAMQYSGAYKMERVFKPAGIYEARTVAEGLDRWYTIYFDKSKDVAAVLDQFKKAEGVECAEQVLPMARPTVKMTPYSPSGASMQATASTFDDPLLAKQWHYYNDGSVNARAKKGADCNVKPVWEKYTTGKKNVIVAVVDGGIDITHEDLKDNLYVNEKEKNGQPNVDDDGNGFVDDIYGYNFVTAKDVVGGTIEPDDGGHGTHVAGTVAARNNNGKGVAGIAGGDGSPDSGVRLLSCQIFRNKDEQGDAAAAIKYAADNGAVICQNSWGYSSTAGVTSMPQLLKEAVDYFIKMAGCDANGNQRPDSPMKGGVVMFAAGNENKEFSAYPACYAPTVSVAAMAWDFSKASYSNYAKWVTITAPGGDQDRFGTEAGVLSTVPKKKVASGYAYFQGTSMACPHVSGIAALIASYFGRQGFTNEELKSRLITAYRPYNIDEQNPTYKGKLGKGYIDAEAAFETDTKIAPEKVGTLTLKPDFVDINAEWSIAKDEDKTAAFYRLYIAQGELTADKLKDMTYREINGMGHSLGETLKYDFDDLKDNTTYSVAVVAVDRWGNLSEPMIQKCTTRLNHAPEATNFPTEAIEVMENDRKSFSFNVADPDGHNWDIKATGETKGVSYTVKGNTVTVNLVPVLEAGSYNCTFILSDDLGAKAEKSFTFKIVKYIPPQLTKPFENYIIGLDEGVVTIPLTGHYTHSGNTQLTYKATAANGSIATATISNDNLQLKGMAKGVTRISIAATDGRETSSDGSFQVRVVEKKSAPVYAVYPIPVQRDIHTLLNPEVKQAELVISSTVGERLMKATVTPDKNNVATLDLSKLNPGTYKLTVYTSKGNHTQMFIKR >CP022041|1279773:1354453|1308155_1309757_+|ASE18842.1|DBSCAN-SWA MRIINIHKLLQFSLLYLLLGIAVGCAENDSFDQPYLNVSEKEISFSNQIGEKTITVNTNCKEWMATTPKAWVHLSQSGNEIAVHVDPNTTGMERSSYILVDGGLAVQKIMVSQSAADITLNLNNGEVILPQAGGTTTVDLKMDATSYDLTQSEQPEWMQVIKKKHGLKFISKPNYSTTERTTKLTIAFAGKNHEVVVKQPGVATFILACNPGNPYSLHKMMDYEYRRGSFLTEYGGPDEVNGIFEESYFFKTPSPLFKDVVYVHDTKHSVPTRIYTRSLTREGVNAVKSQAFQEFMRANGYTRDEKDTNHYVNIKEAFTMDVDIREENNSVVLFFYQMHTQDRSYPTFSSLDLGPVDLLNKTDKKISDVEAYETGKNSEEMKRQMSKSNEVEAILYKTNDPTLIARTYFFYLHNDAAVPQEKAGSVEQYSLFYSQPNLGIWQYGNEWFVTHEFDKLLTANNFEFVGYNGKHHVYARRSDYLTLAISGGEYADVNNGKAVMQITVLYKPTVFAGSKEQRLAKVERMLKQYNPKK >CP022041|1279773:1354453|1337805_1338279_+|ASE18942.1|DBSCAN-SWA MAKDFNIQTYPTFFLLLLIMLIPFALSVDDIMNNDQSNMKIILLISVFIIIILVYGLVKTRKRKIVINELGLLITEKEGNMKISWEEISHIFFCSTPIWGLYVKITFTNRRDPLIIDFGKSSLWSVNFYRFRIAILAFSHKKDIIVVKSNQWYLKLI >CP022041|1279773:1354453|1347047_1347404_+|ASE18864.1|DBSCAN-SWA MIEVNHQLQHYKQKARELLTSEEGIKHRGRRCIEPEAVFGQTKYNKAYKRFRHFGKDKVNMDFAFFAIAFNIGKMCRKTNLKEIKAVMELLLVTFRCCIQVYIGYLKPNKSFYMKLAA >CP022041|1279773:1354453|1301171_1302782_-|ASE18838.1|DBSCAN-SWA MAETKYIFVTGGVVSSLGKGIISSSIGKLLQARGYNITIQKFDPYINIDPGTLNPYEHGECYVTEDGMETDLDLGHYERFTGIKTTKANSMTTGRIYKSVIDKERRGDYLGKTIQVVPHITDEIKRNIKLLGQKYHYDFVITEIGGTIGDIESAPFLEAIRQLKWELGKRAINLHLTYVPYLKAAGELKTKPTQHSVKELQSVGIQPDVLVMRTEKHLDDDIRKKVAAFCNVDFDCVVQSEDLPSIYDVPVNMLEQGLDAAILRKCGEEVGPKPALGPWKEFLDRQRKATKEVHIGLVGKYDLQDAYKSIREGLLQAGTYNDRKTVITFINSEELTEENVAEKLKGQDGIVICPGFGQRGIEGKIVAAHYTRTHDIPTFGICLGMQMMVIEFARNVLGYKDANSREIDEKTTHNVIDIMEEQKNITNMGGTMRLGAYECVLRQGSHTFNIYKQEHIQERHRHRYEFNNDYEKEFEKHGMMCVGRNPESDLVEIVEIPGLKWYIGTQFHPEYQSTVLGPHPLFLDFVKTSIENQKNK >CP022041|1279773:1354453|1303820_1304546_+|ASE18839.1|DBSCAN-SWA MKKAIVVGASSGIGHEVARLLIAEGWAVGVAARRIDKLTDLQAMAPERVYTVQIDVTNEDAETSLLQLIERMNGIDLYFHAAGIGWQNPSLNADIELKTMETNAVGFTRMIGCAYRYFANKGGGHIACITSIAGTKGLGPAPAYSATKAMQNTYLQALEQLAACKHHNIHFTDIRPGFVDTPLLAGTSHLPMLMTTKKVARSIIKAINSRRHICVIDSRWCVLTYLWRHIPNWIWRRMKLC >CP022041|1279773:1354453|1353232_1354453_+|ASE18869.1|integrase|DBSCAN-SWA MKENKLKVSFFVQAKRTDKKGLVPVIGRISVGRTHSGFSTKCKTPLALWDSRKQRLIGKSSMAVSVNQKLGECTALIHARFHELCEREESFTATDVRDAYQGQIHCQALLLESFGEYLTQTKERIGIDRALKTFKLRTYQLSLLREYVQKKHKVSDIPLSQLDKSFIEGFEYYLTIDRRLKRSSISSTLSTLQTIVRIAVKKGALDFYPFLGYSYERPKGEPRSITQEELERIIELKIEWKNYRIVRDLFVFSCFSGLAISDVRNLREENIVLEEGELCIKGRRMKTKTPYRVQVLPPAQTIMNRYRGIRAGFVFDVPTTDVILNGMHYIQRNIGMKTPLTFHMARHTFASLITLSAGVPIETVSRMLGHTNLRTTQVYAAVSSERIHRDMQIVQQRIQDTFTLKL >CP022041|1279773:1354453|1339055_1339493_+|ASE18859.1|DBSCAN-SWA MKTDKNIIGVIALHVITFTFVIISWNHSFNSIGFNIVSGISCMLLGGSYLHLELSKYSEEGKVTWFSITHTIIPLIFLLFWAGIALIFYHYIVIMSLLQCLVTSLIPLSFLYLRSKRKENKYITLLLIIISGIFIVLPFIVLSIL |
52 | Staphylococcus_phage(22.22%) | transposase,integrase | attL 1337644:1337658|attR 1366005:1366019 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022041.2|ASE18007.1|74110_74530_-|PcfK-like-protein |
74110_74530_-
Protein sequences of CP022041.2|ASE18007.1|74110_74530_-|PcfK-like-protein>CP022041.2|ASE18007.1|74110_74530_-|PcfK-like-protein MKGTEHFTRTIAEYLNQRAMTDPLFAPNLLKPNKNIEECITYILNEVQKSGCNGFDDDEIFSMAVHYYDEDDIEVGKAISCQVAVNHIVELTEEEKAEARQEAIKQYQREELAKIQSRNARVKKTENATTQVQPSLFDF |
139 aa aa |
40
gnl|BL_ORD_ID|40 information
|
NA | NA | No | NA | ||||||||
CP022041.2|ASE18124.1|255433_255853_+|PcfK-like-protein |
255433_255853_+
Protein sequences of CP022041.2|ASE18124.1|255433_255853_+|PcfK-like-protein>CP022041.2|ASE18124.1|255433_255853_+|PcfK-like-protein MKGTEHFTRTIAEYLNQRAMTDPLFAPNLMKPNKNIEECITYILNEVQKSGCNGFDDDEIFSMAVHYYDEDDIEVGKAISCQVAVNHIVELTEEEKAEARQEAIKQYQREELAKLQSRNARVKKTENIATQVQPSLFDF |
139 aa aa |
40
gnl|BL_ORD_ID|40 information
|
NA | NA | No | NA |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022040_1 | 75267-75356 | Orphan |
NA
Consensus repeat of CP022040_1
|
1 spacers
spacers of CP022040_1
>1.1|75291|42|CP022040|CRISPRCasFinder GGCTCAACACCCAACACCCATCACCCAACACCGGTATAATTG |
RT |
CRISPR arrays and Neighbor proteins around CP022040_1
The CRISPR arrays of CP022040_1 >merge|CP022040|1|75267-75356|CRISPRCasFinder CCTCCAATAAGATGGTTTTATAAAGGCTCAACACCCAACACCCATCACCCAACACCGGTATAATTGCCCCCAATAAGATGGTTTTATAAA >CP022040|1|1|75267-75356|CRISPRCasFinder CCTCCAATAAGATGGTTTTATAAA GGCTCAACACCCAACACCCATCACCCAACACCGGTATAATTG CCCCCAATAAGATGGTTTTATAAA
>CP022040.2|ASE16654.1|74567_75062_-|sigma-70-family-RNA-polymerase-sigma-factor MDNLQNEQDFSRIVREHKSTIYTVCYMFSKDEDEVNDLFQEVLINLWKGLQNFRGESDVRTWIYRISLNTCISCDRKKRKRKTIPLSMNINPFTDSDDDSRQIQQLNRRISQLGPFDRAIILLWLENMSYEEIGEIVGISTKNVSVRLFRIKEKLKKTSETTKE >CP022040.2|ASE16653.1|73823_74447_-|hypothetical-protein METKHTEFEEMRQQLDILKNKLDNQTLINDKLIRQSMLNKMSFMKKYTWVSFLVLLFIYYVYNDIRITFNLSWWFYGATVIFMTFTVCFDAYINRFNKEAFLNGDLIATSLQMQRMKKLRKRSLLYGIFFMTIWVSWFVIELYNGSGAANGGENTSMFYGILVGGGVGLIAGVAIGIWLYLRMQRINSDIISQIDELTKDTKDSPIL >CP022040.2|ASE16652.1|70701_73362_+|DNA-mismatch-repair-protein-MutS MAKDDKGLTPMMKQFFSMKAQHPGALMLFRCGDFYETYGEDAVESARILGITLTRRNNGGNGDSIEMAGFPHHALDTYLPKLIRAGKRVAICDQLEDPKKKREAIKGKKGLSAMDKMVKRGITELVTPGVAMSDNVLNYKENNFLAAVHFGKGSCGVSFLDISTGEFLTGEGTFDYVEKLLGNFQPKEVLFDRAKKQDFERYFGTRLCTFEMDDWVFTDQTARQKLLKHFGTKNLKGFGVDHLNNGVVAAGAILQYLEITQHTQINHITSLARIEEDKYVRMDRFTIRSLELIAPMNEDGSSLLNVIDNTITPMGGRMLRRWMVFPLKDEKPINERLDVVDYLFREPDFRECINEQFHRIGDLERIISKVAVGRVSPREVVQLKNALMAIQPVKTVCLYTKSDTLKRIGEQLNLCESLRDRIEKEIQPDPPQLVNKGDVIALGFNQELDDLRSIRDNGKQYLLEIQEKEIAQTGITSLKIGFNNVFGYYLEVRNTFKDKVPENWIRKQTLAQAERYITPELKEYEEKILGADEKILALETQLYMELIQDMQEFIPQIQINANLIAHLDCLLSFMKVSQLQRYVRPVVDDSEVIDIKQGRHPVIETQLPIGEQYVPNDVLLDTEHQQIMMITGPNMAGKSALLRQTALIVLLAQIGCFVPAERARIGMVDKIFTRVGASDNISLGESTFMVEMTEASNILNNVTPRSLVLFDELGRGTSTYDGISIAWAIVEYLHEHSRAQARTLFATHYHELNEMEKNFPRIKNFNVSVKQVDGKIIFVRKLEKGGSEHSFGIHVAEIAGMPRSIVKRANVILKELEKDNSQVGSVGKAAVERLDQSREGVQLSFFQLDDPVLTQIRDEILGLDVNNLTPVEALNKLNDIKKIVKG >CP022040.2|ASE16651.1|70091_70508_+|hypothetical-protein MYKIITQLKIMKKYLLILLALLPMMLFTACSSDKEEETSNNLVGTTWQYLEKEGNKITSMTVLEFKSNTVVKIITDEDLDKNPTERDHDEGDATYVLNGKKISIKRDIYTAWGEVDGDKLTLQYVDDGEIITMIFTKR >CP022040.2|ASE16650.1|68672_69515_+|prolipoprotein-diacylglyceryl-transferase MNNLLYIAWQPSEVIFQLGSLPIRWYGMCWLVGLALGYFMMQWLFKRHKFPPSQFDPLFLYVFFGVLIGARLGHCLFYEPEEFLTSWKGIMTIFIPIREMADGSWKYVGYQGLASHGGVAGLLIALFLYIRRTKMNTWVVLDFFGIVSGITACFIRLGNLMNSEIIGKVTDVPWAFIFYNVDDKPRHPGQLYEAIAYLIIFLLIYFIYRKYPKKVGTGLYFGLCLTLIFTFRFFIEYTKEIQEAFEAGLPIDMGQILSIPLIALGVWSILRSRGKEGKLP >CP022040.2|ASE16649.1|67496_68600_+|redox-regulated-ATPase-YchF MALKCGIVGLPNVGKSTLFNCLSSAKAQAANFPFCTIEPNLGVITVPDERLNKLAEIVHPGRIVPATCEIVDIAGLVKGASKGEGLGNKFLGNIRECDAIIHVIRCFEDDNVVREGGMAVNPIEDKEIIDTELQLKDLETIEAQLAKQQKVAAAGNKDAKVMVSVLEAYKEVLEQGKNARSVEFESKEEQQAAHDLFLLTTKPVLYVCNVDESSAKTGNEYSQMIEKIAAEEGAEAMIIAAKTEEDIASLESYEDKKMFLDELGLEESGVSRLINKAYHLLNLQTFITAGEMEVKAWTFHKGWKAPQCAGVIHTDFEKGFIRAEVIKYDDYIKYGSEAAIREAGKLGIEGKEYVVQDGDIMHFRFNV >CP022040.2|ASE16648.1|66383_67193_+|DUF3108-domain-containing-protein MKKFKIYIVCLFAMIAVSTTAQCTFRNTAFSSGEYLTYNLYYNWKFIWVKAGTASWYTVSSTYKGIPAYRASLTTRGNGKLDDYFVLRDTLLTYNSKQMEPLYFRKGAREGKRYTVDEIFYTYPNGKCKLRQHRINNEGKHQWMENTYDDCCFDMMSIFLRARSFNPENWKKGEVVKFPIVDGNSRHAGRIVYGGKENIKADNNHKYRCLRLTYYEYEDDKWREIANFYVTDDSNHIPVRLDMSLKFGSAKAFLVSMKGINSPITSEVK >CP022040.2|ASE16647.1|64435_65482_+|ribonucleotide-diphosphate-reductase-subunit-beta MDNQLKRNTLFNPSGDIELRLRRMIGGNTTNLNDFNNMKYSWVSDWYRQAMNNFWIPEEINLSQDFKDYPRLEKAERTAYDKILSFLVFLDSLQSNNLPTLSEYITANEVNLCLHIQAFQECIHSQSYSYMLDTICSPEERNDILYQWKTDEHLLNRNKFIGDCYNEFHEKRDKFSLMKTLIANFILEGIYFYSGFMFFYNLSRNGKMSGSAQEIRYINRDENTHLWLFRSIILELKKEEPDMFTPEKIKVYEDMMREGVRQEIAWGQYVIGNDVQGLNAQMVSDYIRYLGNLRWSGLGFGFLYDDNQKEPENMKWVGQYSNANMVKTDFFEAKSTAYAKSTALIDDL >CP022040.2|ASE16646.1|61694_64214_+|ribonucleoside-diphosphate-reductase-subunit-alpha MNITKRNGEVEVYNNEKISIAIKKSFISTGKDISDNEIAGMVSEVEQFIKENPELRTVEDIQNRVEKCLMAHGHYDEAKNYILFRYQRNEQRQAINYIVWTADDRELADVLHSVAREYRERSYSMVTLQEKFSSFTKPGMSQKDSIDALIKAAVELTTPEAPAWEMISARILSYRSEKKITRLEEELGLKTFYRKVKYMTEEGLYGDYILQNYSEEEINEAATFIDPERNKLLNYSGLDLLLKRYVIKNYSGKVIERVQEMFLGIALHLAMPEKEDRLMWVRRIYDLLSKLEVTMATPTLSNSRKPSHQLSSCFIDTVPDSLDGIYRSLDNFSQVSKFGGGMGMYFGKVRATGGNIRGFKGVAGGVIRWMRLVNDTAVAVDQLGMRQGAVAVYLDVWHKDLPEFLQLRTNNGDDRMKAHDIFPAICYPDLFWKMAEEDMNQNWSLFCPNEIMRIKGYCLEDCYGEEWERKYLDCVNDQRLSRRVISIKDIVRLVLRSAVETGTPFTFNRDTVNRANPNAHKGMIYCSNLCTEIAQNMAPIETVSKEVETKDGDTVVVTTTRPGEFVVCNLASLSLGRLPLEDEEKMKEKVATVVRALDNVINLNFYPVPYAQLTNQRYRSIGLGISGYHHALAKRRIKWESEEHLEFMDKVFETINRAAILASSNLAKEKGSYQFFEGSDWQTGTYFDKRGYDSAEWQDVRKTVALQGMRNAYLLAVAPTSSTSIIAGTTAGLDPIMKRFFLEEKKGSMLPRVAPELSDETYWMYKSAYLINQKWSVRASGVRQRHIDQAQSMNLYITNDFTMRQILDLYLLAWKEGVKTIYYVRSKSLEVEECESCSS >CP022040.2|ASE16645.1|60712_61333_+|hypothetical-protein MKQTALFFVFLLTLLSSCCTKTCQKTTESNALDSLVLVDTTEMKSAFLGCIHRFIKQYPKDSTFILKCGYGYEDHGVYTNGVYINSDVFVIQPAYYDMFMGGEWSIDDMYPSHYFKIDNRIVFLCSRSDSFMKQEKYRKAYSQIVSDSLRVHYEDFAFILVEHKDNKATLLSSEEMRKRKISPISGFRTVVKFKAPKLTDESSDDE >CP022040.2|ASE16655.1|76097_77423_+|RNA-directed-DNA-polymerase MATRTINGREYTKVDFENKLRHLSDAQELAAMLNELKLPWYYSDFRGKQLSFLADTNNVQRRCKTFRLRKKHGGYREITAPKGSLRGILNALNILLQTYDEPTPWAFGFVCGRSVVDNARPHVGKRYILNLDLKDFFPTITRQQVADCLTAEPFGFSSLAVKLISGLATVRTKNNKEVLAQGFATSPTLSNFICREMDKEIAGVAAAQGITFTRYADDLTFSSDTDILRPQGELVQQVKAIVERYGFRLNEEKTHLQRRGRRQEVTGLMVTEKVNVSRRYVREIRSLLYIWERYGYEDACQAAWKSYRQQHGKTKGHQHCVPLNAVLRGKLNYMKMVRGADDPLYQRFVSRYTSLQQRSKGDIKEVAYKAYMGKYLSNSTEDRMTSANVLPNDNTSSRQLGATSSNPYDPRKKSKRILNFIVMVIILVAIILIKLFLKSLL >CP022040.2|ASE16656.1|77585_78179_-|chorismate-binding-protein MCQYIETIRVIDGCVCNLAYHEERLNRTRKEMLGLTEPLHIADLLKAVSLPMECSKLRFVYDKEGIHDITCTPYICKEINSLHLVYDNNISYPFKSTDRSALNELKKQQGDCDEILIVRDNHLTDTSYTNIALYDGEQWFTPSTPLLCGTMRQRLLDCGLLQEREIMVSDIPNYQYISLFNAMISLGEVILPVDKIK >CP022040.2|ASE16657.1|78162_79152_-|aminodeoxychorismate-synthase-component-I MILYDREHAIQRMNTLAKEGKDFIFIINYKADGAYIEEVADINPHELLFAFPSLSNIPEGESYSNEAVEWHTEPLTRDDYEQRINLVKQREREGDSYLANLTCKIPVRTNLSLHDIFMRSKALYRCWMKEKFVCFSPEIFVRINKEGLISSFPMKGTIDATRPEAEKELMENKKEAAEHATIVDLIRNDLSIIAEQVQVKRYRYIDHLTTNKGEILQTSSEITGQLPTDYRENIGTLLFHLLPAGSITGAPKPRTMEIIDEAEGYERDFYTGVMGCYSKGQLDSAVMIRFIDQDKDGQLHYKAGGGITAQSNNDDEYKEVIEKVYVPIY >CP022040.2|ASE16658.1|79193_79835_-|semialdehyde-dehydrogenase MRAIILGATGAIGKDLVQELINDDTIEQIAIFVRRDPGINNEKVTTHIVDFDQSDEWRLSVQGDVVFSCMGTTRKAAGSKENQYKIDYTYQYNFAKIAAEQGVPSFILVSAAMANANSHFFYTKMKGELEEAIKQLPFQHISILRPPALIRKNTTRSSEKLSVSILHFFNKIGLLQSQRPMKTEVVAHCMVELAKTKKSGVFEPKDIFKIGER >CP022040.2|ASE16659.1|80075_81134_-|sensor-histidine-kinase MKVLEDIKKYCLSRYNLSVLGAQVGIYALLVSIWSLVIMLLEHDVNAAKESMCVNAFVLFLLLIVFMANFYVLVPYLFEAKNKIKHWAFWVINLLFIVLWNHHIFSIYNADLPNAPIRIGFYQFGVMWMILNYAMVVAAIFVRYYIRHSTLRRQLREEKQKMTEAELAWLKNQLNPHFLFNTLNNIASLTQTSPNNAQKAIGQLSELLRYALYETQPKEVSLNGEIAFIKNYINLMTLRSGSNVEIKSQFIIHNTQLLIAPLVFLTPVENAFKHGISANKPSFIHISITEDNGKIVFLCENSNYPKNDTDKSGKGIGLENMYRRLELIYPDRYHIEQRITPEVYHLKIIIKP >CP022040.2|ASE16660.1|81595_83524_-|4-hydroxy-3-methylbut-2-en-1-yl-diphosphate-synthase MIDLFNFERRKTSVTHVGALNIGGENPVRVQSMTTTSTDDTEGSVAQAKRIIDAGGELVRLTTQGKREAENLKNINAQLRADNYMAPLCADVHFNANVADVAALYAEKVRINPGNYVDPARTFKKLEYTDEEYAQELQKIEDRLIPFINICKENHTAVRIGVNHGSLSDRIRNRYGDTPEGIVESCMEFLRIFRKYDFHDIVISIKSSNTVVMVRSVRLLVAEMDKEGMHYPLHLGVTEAGEGEDGRIKSAVGIGALLADGIGDTIRVSLSEEPECEIPVAKHLTWYIRRHEKHHLIPAEQYDGFDYLHPNRRETVAAGNIGGENVPVVIATRKADQATAEVSSPELPKPDYIYVQGELPEKRAKKQKYILDYDAYMNLANSGKQSLENVYPIFPVTGMPFISAINSDVKFLVLKFGTPSEEFLACLKAHPEVVVVCMTSHQNRLGDQRALAHQLMIAGVKNPIIFAQMYQHSTTEEKEESSNSQQAETTTAKEKFQLEAAADMGALMMDGLTDGIWLMNNGNLSQEDVEQTAFGILQAGRLRMVKTEYISCPGCGRTLYDLRTTIARIKEATKGMKGLKVGIMGCIVNGPGEMADADYGYVGAGPKKVSLYRKQVCVEHNIPEEEAVERLLALIKADQNKA >CP022040.2|ASE16661.1|83691_84198_-|5-(carboxyamino)imidazole-ribonucleotide-mutase MKPLVSIIMGSTSDLPVMEKACKWLEEQEIPFEVNALSAHRTPDAVETFAKEAKGRGVKVIIAAAGMAAALPGVIAASTPLPVIGVPIKGMLDGLDALLSIVQMPPGIPVATVGVNGAQNAAILAAEMIALGDEAIAKKIDNWKASLGQKIEKANKELAELKDYKFKC >CP022040.2|ASE16662.1|84326_84944_-|hypothetical-protein MNEKNIILTARVVSMVLTPFYLPVVGILAIFTFSYLSMFPWQAKLSYVFLVYAFTVLIPTLLIHLYRQYHGWTLIQLGQRERRMVPYVISILCYFTCFYIMNILHLPHMLTSILMVALIIQILCAIINVWWKISTHTAAIGGVTGSLIAFSLLFNFNPMWWLCLTLIVSGFVGSSRMILRQHSLEQVASGFFLGIICSFITIIVV >CP022040.2|ASE16663.1|84984_86499_-|RNA-polymerase-sigma-54-factor MAQEQVQIQTQKQQQVQRLSQQQMLQVKLLEMPLTELEESVNAELDDNPALEAGGEETDSIDDNDTVEHSEDDDFDTLQEREERQDALDSALERMRSDDDLPTYDSRQQRNNAEYEEIVYGDTTSFIDKLNEQVGERELTERQKSILEYLIGSLDDDGLLRKDLDSISDELAIYYGIDASTKELEEVLKILQDFDPAGIGARDLQECLLLQIDRKVENGEWEKDSHLYKYIYNILSHHFDAFKKKHWDKIQSALSLSDLQVEALQREIRKLNPKPGSSMGETQGRNLQQITPDFIIDTEDDGTVTFSLNHGNLPELHVSQTFNDMMETYRNNKANMNRQEKEALLYAKEKVEKAQGFIEAVKQRRHTLQVTMKAIIDIQRKFFQDGDEADLKPMILKDIADRTGLDISTISRVSNIKYAQTRWGTFPLRFFFTDSYTTEDGEEMSTRKIKLALKEVIDKEDKRKPLSDDALAKVMKEKGFPIARRTVAKYREQLGLPVARLRKE >CP022040.2|ASE16664.1|86545_87196_-|hypothetical-protein MATHPLWSDDYWLLLLQLYLKKPEGMKPMYSRALVDLSLELHIPPKNLYEQLFKLRHRDMPIINLIWETYGENTRKLNKDVKKLRSMKGFGQPKKFYDGVKVRETFEHDFLPVEGGATELKPFMLIMILDLYFRLTPITMAAETPEVIDLAKLMKIKPQMIVEVMDVFQLCDPYLNRDDLLISPLLMPCQEVWNRYGNDNPEKLSALAAQLKEYFT |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022040_2 | 1282211-1282327 | Orphan |
NA
Consensus repeat of CP022040_2
|
1 spacers
spacers of CP022040_2
>2.1|1282240|59|CP022040|CRISPRCasFinder ACAACTTTCTGATGACATTTAGCTGAATGAGGTTAGAAAAATGTTCTGAAGGTCTTGGA |
CRISPR arrays and Neighbor proteins around CP022040_2
The CRISPR arrays of CP022040_2 >merge|CP022040|2|1282211-1282327|CRISPRCasFinder TTTGAATCCCAACGGAGTCACAAAAAGTGACAACTTTCTGATGACATTTAGCTGAATGAGGTTAGAAAAATGTTCTGAAGGTCTTGGATTTGAATCCCAACGGAGTCACGAAAAAGT >CP022040|2|2|1282211-1282327|CRISPRCasFinder TTTGAATCCCAACGGAGTCACAAAAAGTG ACAACTTTCTGATGACATTTAGCTGAATGAGGTTAGAAAAATGTTCTGAAGGTCTTGGA TTTGAATCCCAACGGAGTCACGAAAAAGT
>CP022040.2|ASE17520.1|1280838_1281876_-|aspartate--ammonia-ligase MSQLIKPEGYKAVLGKRQTEQGIKLIKEFFQQNLATELRLSRVTAPLFVLKGLGINDDLNGVERPVSFPIKDLGEAEAEVVHSLAKWKRLTLADYDIQPGYGVYTDMNAIRADEELDNLHSLYVDQWDWEAVITEGDRTRSFLENVVRRIYAAILRTEFLTCETYPQLEPFLPQTIHFIHAQDLLDMYPTMTAKEREDEVCKKYGAVFIEGIGCRLSNGEKHDGRAADYDDWSTVAEDGKTGLNGDILIWYPILDRSIELSSMGVRVDKTALLRQLTIEKQEERQQLFFHQQLLSDKLPLCIGGGIGQSRLCMIMLHKAHIGEIQASIWPEEMRRECEEAGMPLI >CP022040.2|ASE17519.1|1279753_1280140_+|hypothetical-protein MRMKMLILIGFITVSCTSVPNKKCIVQKNQVRVIKTEKSTAFKYLQGIWIIPHAADIRIVFKNDSTFEFHDYNSIKDSIEILKGIYVLNNDQLTLRYTDRPQQKFIYHFGRYEDERYIRKGKYYFVKQ >CP022040.2|ASE17518.1|1278327_1279212_+|acyltransferase MRELKIGFLQQHNVEDIKNNIERLAEGITNLAQRGAELVILQELHNSLYFCQTEDVNKFDLAETIPGPSTGFYGELARELGIVIVTSLFEKRAPGLYHNTAVVIEKDGSIAGKYRKMHIPDDPAYYEKFYFTPGDLGFHPIDTSVGRLGVLVCWDQWYPEAARLMALQGADMLIYPTAIGYESSDTDEEKQRQREAWTTVMRGHAVANGLPVIAVNRVGHEPDPSEQTQGIQFWGSSFVAGPQGELLYRACDNDEDSVILSINLDHSENVRRWWPFLRDRRIDEYGEITKRFID >CP022040.2|ASE17517.1|1277172_1278240_+|agmatine-deiminase-family-protein MNTNENHSFRLPAEWEPQSGVILIWPHEDTDWRPYLEEITEVYLQMADAITRYEALLITARETEMVSSLLKGRLTEEQMKRVTLFTCDNNDTWARDVAPISLIANKASKDSFHPLRLLDFCFNGWGEKFAAEKDNRINRQLHEAGLLQGVLENHKDFVLEGGSIESDGSHTLFTTTSCLMAPHRNQPLTQEDIDKQLRLFFPNVERVIWLDYGQLAGDDTDGHIDTIVRIAPNDTLLYIGCDDKEDEHYEDFQLLEEQLKQLRTQKGEPYRLLRLPMPDAIYDDDERLPATHANFLIINGAVLVPTYNQPEKDKEALDTIQEAFPDREIIGIDSRTIIRQHGSIHCLTMQLPANG >CP022040.2|ASE17516.1|1275942_1276221_-|hypothetical-protein MEEMNTPKPNNNLALAIFTTVCCCLPAGIYAIIRAMKVNELYMMKQYDEAVLAANDAKKWSIIGIVVGLIGSILYFVCFGGLAALSTMAGSH >CP022040.2|ASE17937.1|1275557_1275926_-|DUF2752-domain-containing-protein MKCIGWLCLGASLLILYYFYNPVTTLWAPKCLLKVATGLQCPGCGIQRALHALLQGRFSEAIHYNYFLLFSGPYILSFGVRALLPKGKAKDSLTKVIEDKRLIWLYIILFFIWFIVRNILKI >CP022040.2|ASE17515.1|1275155_1275497_-|TM2-domain-containing-protein MESEKVNHMLMMLSSKIPAGSIPSVRTRLENTDISESEILALQSQMKDPLLSILLSIFIGTLGVDRFYIGDVGLGIGKLLTGGGCGIWWLIDIFLIVDATKQKNLELLSYYLR >CP022040.2|ASE17514.1|1274303_1275050_+|peptidylprolyl-isomerase MATRLKIKTTEGDIIIRLYDETPKHRDNFLKLAKEGYFNGTLFHRVIKDFMIQGGDPDSKNAPKGKMLGTGGPDYTIPAEFVYPQYFHKRGALSAARTGDEVNPEKESSGSQFYIVWGKTFKPAELKQMEHQMAMQQEQQVFNQLTREHHEEIMNLRRNRDRVGLQELQDKLIEQTKTTCKQQGKPSFTEEQIEVYTNVGGTPFLDNQYTVFGEVEEGLGIVERIQNCDTDRNDRPTEDVKIETVALL >CP022040.2|ASE17513.1|1273406_1274216_+|molecular-chaperone-DjlA MAIGKWIGGALGWILSGSMLGGLVGYCIGTMLDEAFAGDNRGGDRQNGYGEQSHFGGTRPFEEDRNSFLFSMLVLSSYIIKADGKIMHSEMEYVRQFLRHNFGEQAVSQGESILLKLFDLQKQQGPYQFKETIRKSCVEIHFHTSVSQRLQLLNYLVIIAKADGIVSPEEVVALKEIASYLGLSAQDIESMLNLESGAKASSNIEDAYKVLGISPSATDDEVKAAYRKMALKHHPDRVSTLGDDIRKAAEKKFQEINDAKERIYKARGL >CP022040.2|ASE17512.1|1272818_1273268_+|hypothetical-protein MMQLNLPPYQIRVREENGRKQIFDVLRRKYIALTPEEWVRQHFIHYLIEHKSYPVTLLANEVPLQVGEKKVRADSVLYDNQLRPRMIIEYKAPTIPLTQKVFEQISVYNLLLHVDYLIVSNGLDTYICKMDYENQTYAFLETIPDYQNI >CP022040.2|ASE17521.1|1282573_1283761_+|dicarboxylate/amino-acid:cation-symporter MQKKIKIGLLPRVIIAILLGLFLGYYLPDPAVRVFLTFNSIFSQFLGFMIPLIIIGLVTPAIAGIGKGAGKLLLATVAIAYVDTIVAGGLSYGTGTWLFPSMIASTGGAIPHIDKATELTPYFTINIPAMVDVMSSLVFSFIAGLGIAYGGLRTMETLFNEFKTVIEKVIEKAIIPLLPLYIFGVFLSMTHNGQARQVLLVFSQIIIVILVLHVLILIYEFCIAGAIVKHNPFRLLWNMLPAYLTALGTSSSAATIPVTLKQTVKNGVSEEVAGFVVPLCATIHLSGSAMKITACALTICMLTDLPHDPGLFIYFILMLAIIMVAAPGVPGGAIMAALAPLSSILGFNEEAQALMIALYIAMDSFGTACNVTGDGAIALAVNKFFGKKKETTVLS >CP022040.2|ASE17522.1|1284085_1285309_+|peptidase-T MEIVERFINYTKFDTQSAEDSETVPSTPKQLIFAKYLKEELEREGLKDVEMDEMGYIYATLPANTKKKIPTIGFISHYDTALDASGANVNARIVENYDGGDIQLNPNMVSSPRMFPELLEHKGEDLIVTDGTTLLGADDKAGIAEIVQAMCFLRDHDEIEHGDIRIAFNPDEEIGMGAHHFDVEKFGCEWGYTIDGGDLGELEYENFNAAGAKVFIHGVSVHTGYAKGKMVNASRLACEFNNMIPETEIPEETEGYQGFYHLIGIESRCEEAKLSYIIRDHDREHFEDRKRFMENCVKKMNEKYGEGTVELKMNDQYYNMKEKIDPNMHVIELVLQAMQQANVAPKVQPIRGGTDGAQLSFKGLPCPNIFAGGVNFHGPYEFVSVQVMEKAMQVIINICRLTAEFND >CP022040.2|ASE17523.1|1285526_1287797_+|sodium:proton-antiporter MSELPELVQDLALILVVAGFVTLLFKKLKQPLVLGYIVAGFLVSPHMSYTMSVVDKDDIQTWADIGVIFLLFSLGLDFSIKKILKMGASPIIAACTIIFSMMLLGVIVGHSFGWKEMDCIFLGGMVAMSSTTIIYKAFSDMGLTQQGFASTVMSVLILEDILAIVMMVMLSTVASGNSPDGVQLLGSIMKIGFFLVLWFVVGLFAIPLFLRSVRKILNSETLLIVSLGFCCLMAVISTQVGFSAAFGAFVMGSILAETVEADKIIRLVDPVKNLFGAIFFVSVGMLVKPDVIVQYAIPILLLVITILVGQALFGTLGYLLGGQTLKNAMRCGFSMAQVGEFAFIIATLGKSLGVISEFLYPVVVAVSVITTFLTPYMIRAAEPCYNILIKHLPKRWVRRLTHIQTNSAGESASSDNHWKVLMKKMILNTLIYGILSAAVIAIMFSAALPICRNLSIKWTGSHWIGNAVCGFLTILFIAPFLRSIVMKQNHSEAFKALWTDRRINRLPLTATILARVLIALSFIFYICNYLTRFKNALMIAVAVGLLILMLLSRWLKKRSITLERLFIQNLQSRDIEAQKQGKKKPLFANHLIDRDIHIANLELPDDSLWAGKTLYSLKLRNRFGVHISSILRGSKHINIPNGGTILFPGDKLQAIGNDEQLTKLSKAMKAELQPTITDIEKHEMKLRSFTISKTSPFIGKTLKDSGIRDEYNCMVVGVDEGQKNLTLITPSRSLQAGDVLWVVGEEKDLERILALG >CP022040.2|ASE17524.1|1288239_1290954_-|DUF2726-domain-containing-protein MDAKQCMIVDLERRGDKQMFITDQVDFIKKADNGNWMIRFLKSPRIFQYNQARILYFTHGEPVNLHEKGLYIGNKHITSAVELLRFSNKHYTFYYVTYSNGYSENLDGNNVYVTRTPIDMCGGSTWDYLRKLADETGLLAEDEESILSKQYDLIDLKRDNVPLAQYLGDKTKLATYRLPQLVYYPFGCNASQKKAVEAALTHQASIIQGPPGTGKTQTILNIVSNLLVQKKTVLVVSNNNSAVENVAEKLEKEGLGFLVAQLGSVKNKEAFVESQSGCYPNMEEWYLDNSKEVRKIAKDSLAAVSLGFDGQTRLAQLKAEYDALVTEKKYDEKLKMASGFDNDWLSEKHSSKIMKLLNLCKIMQEHGKTPSLLFCFKWVFLLGPRAYSLLKNNLLIVIEQLESAYYLTRKFEIEQEVDSIEQQLLSVDVKESAEELRKSSLQVLKQAIAKQYGSGRRTLFTRQDIKPRTETFLKEYPIVLSTTYSAKSCISKDFVFDYMIMDEASQVDIKTGALALSCAANVVIVGDDMQLPNVVSSEEEKALNAIRTTYNVDDRYNAVTHSFLRSCTEILKDAPTTLLREHYRCHPKIIQFCNQRFYGGELLPMTIDKGEEDVLQVIQTVKGNHAREHFNQREIDVIVQEVMPLCAGKGSVGIITPYRTQAEAINRVLGKDIASTVHKYQGRECDTIIMSMVDNLPTSFSDDKNLLNVAISRAKSQLYIVTSGNEMAQDTNLAQLISYVKYNNFAVKNSKIHSVFDLLYQQYTTERLAYQSEHIMVSDYMSENLVYNLIVKVLEELSWKNLAVVCHYPLAKIISDWSLLSNQENDFAKNSLTHIDFLIYNSLTKQPLMAIEVDGWLYHKDKVVQQSRDRLKDQILTKYSLIPYRISTTDTITAESLKEVFVSL >CP022040.2|ASE17525.1|1291296_1292217_+|hypothetical-protein MIIQMKRNKLTALIALIGIVAFTSCEDVKRPTPVRERINVTPPSREAEAISQLTASIDEISANLDAISSQEAMLCKTTEHANKKSKIIQQIRGLGALLKEKQNQIDKLLNEKVKKVDAPAVSNPTIDNLYKVIDFLSSQLKEKGDRVTQLEQVASRKDVTVDQLKYIVMNQTNSVDAMRYRFNMAALEREYAQLKAKEKQRIKEDKESDKVYYIIANKETLKEKGLLKTSLFSKKVNNNNVTKDLFTEANGKDLKTLTINSSSPKLLSQNPEGSYTLTENEDGTTTLTITDAEKFWNVSRYLIIQE >CP022040.2|ASE17526.1|1292303_1293119_+|Nif3-like-dinuclear-metal-center-hexameric-protein MRSVKIKEVIDALERFAPLPLQESYDNAGLQVGLTEAEVSGALLCLDVTEKVVDEAIRRECNLIVAHHPLIFRKLAQVTDANYVQRTVIKAIKNDIVIAAMHTNLDSAVGGVNYKIAEKLGLKDLRFFGRSKQVVNPQTGESVTGGDGVIGEFEEPLAADDLILLLKKKFDAECVQTNELLRREIRTIALCGGSGAFLLQDAIAAGADAFMTGEMSYHEFFGHEQEIQICVIGHYQSEQFTIEVLRDVIERECPSVKCYLSEINTNPIGYF >CP022040.2|ASE17527.1|1293280_1294102_+|hypothetical-protein MAKKDPKELPVEEKLKALFQLQTTLSGIDEKRALRGELPLEVRDLEDELEGLHIRIEKIEQDIKDYQNAITQKKGNIVDAQASLERYNKQLDSVANNREYDTLTKEIEFQTLEIELCNKKIKEAQIKVEEKRKDLEANRALLEDRQHALEEKRNELDEIMQETREEEGLLKEKAAELETKIEPGLLRSFKRIRRGARNGLGIVYVQRDACGGCFNKIPPQRQLDVKMHKKIIVCEYCGRILIDPELAGVKVDKTEEKPKKRRATRKKKEEEGE >CP022040.2|ASE17528.1|1294511_1295027_-|diguanylate-cyclase MEALEALLTRRSVRAYEERMPEQDLIAKVMEAGLYAASGKNMQTAIIVEVTNKEVRDRLSAINAEIMGVTSDPFYGAPVVLAVLADKSSPNHVYDGALMMGNLMNAAHALGLGSCWINRAKQTFEREDGKQMLKEWGIKGDYEGVGFCILGYAAKEGKTAPRKANRIFYVK >CP022040.2|ASE17529.1|1295039_1296221_-|putative-C-S-lyase MMKTYNFDEIIDRSGSGDLKHEALLPRWGRNDLLPLWVADMDFACPDFVVEALKDRLSHPIFGYTVEPEDFRPAIIDWIRAHHDWEVKPEWLSFIPGIVRGIGFVVNVFTELDEKVIIQPPVYHPFRLTPEANHRKVVFNPLRLREDGYYDMDFDNLAEVCDDKCRVLILSNPHNPAGLCWSEDTLRRLADFCYEHNIIVISDEIHSDMALFGNRHIPFASVSERAAQISITFAAPTKTFNMAGIVSSFAIVPNEELRNRFYGWLKANELDEPTLFAPIATIAAYRKGEEWRKQMLAYVEENVRFVEDFCREYIPGIRPLRPQASFLVWLDCHGLGLKHKELLNLFIDKAHLALNDGRMFGPGGEGFMRLNVGTPRSILRQALEQLAEAVNEL >CP022040.2|ASE17530.1|1296217_1297393_-|PLP-dependent-transferase MKKQTQAIHQPYKRRDAYDALSMPIYNAVAFEFDNAKVMADAFCGRIDAPDYSRVENPTVTNLEQRVKALTGAENVIALNSGMAAISNTLFSVVEQGKNVITSRHLFGNTYSLLTSTLSRLGVEARLCDLTDVEAVERLIDDNTCCLFLEIMTNPQLEVVDVRALTSIAHQHGIPVIADTTLIPFTQFSAKDLGIDLEVVSSTKYISGGATSLGGLVIDYGTFPSIGKRLLNEMLFNLGAYMTPQVAYMQTLGLETLDVRYRAQAGNALELAQRLRTLKPIHKVNYVGLEDNPYHQLAVSQYGETAGAMVTIDLESQEACFRMLDNLKLIHRATNLFDNRTLAIHPASTIFGLFTAEERAAMDVQDTTIRLSIGLESVDDLFDDIKQALEA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022040_3 | 1513606-1513702 | Orphan |
NA
Consensus repeat of CP022040_3
|
1 spacers
spacers of CP022040_3
>3.1|1513636|37|CP022040|CRISPRCasFinder AAGCTTACATTCACTTTGTTGAGTGCAGCCTATCTTA |
CRISPR arrays and Neighbor proteins around CP022040_3
The CRISPR arrays of CP022040_3 >merge|CP022040|3|1513606-1513702|CRISPRCasFinder TGCAAAGATAGTGCAAATGAGAACAAAAGAAAGCTTACATTCACTTTGTTGAGTGCAGCCTATCTTATGCAAAGATAGTGCAAATGAAAACAAAAGA >CP022040|3|3|1513606-1513702|CRISPRCasFinder TGCAAAGATAGTGCAAATGAGAACAAAAGA AAGCTTACATTCACTTTGTTGAGTGCAGCCTATCTTA TGCAAAGATAGTGCAAATGAAAACAAAAGA
>CP022040.2|ASE17682.1|1512693_1513578_-|tRNA-threonylcarbamoyladenosine-dehydratase MQNQFSRTQLLLGKPAIDTLNGSRVAVFGVGGVGGYAVEVLARSGVGAIDVIDDDRVCLTNVNRQLLATTRTVGKHKVDVAEERIHSINPRCIVRKYQTFYLPDNADQFDFSHYDYVIDCIDTVTAKLDLIYRCHEMNIPLLSCMGAAYKLDATQFRVTDIFKTINDPLAKVIRKKLRKTKIKHLKVVYSPEEPLESIEQPEISCRFHCICPDKDMRKCTDRHTIPSSNAWVPAAAGLIAGGEAVKDLVNLANTMRIRPEDEATSEPARIAHERAAKMLEEHKRLKAEKAAADK >CP022040.2|ASE17681.1|1511616_1512210_-|TetR/AcrR-family-transcriptional-regulator MKNREQTESRILEAVASIVESEGFEKLGINTIASKANVSKMLIYRYFGGLEELIAQFIMQKDYWANTGTVIINPQSVGDSIKNMFRKQIEQLRNDVTLRRLCRWELSCNNTSIEQLRDKREENGCSLIKLVSTLTGCSNTEVASLASILSAAISYLALIEDQCLSYNGISLQTDKGWNEIMKGVGMIVDLWIKSIQE >CP022040.2|ASE17680.1|1510591_1511620_-|nitroreductase MRKVIIVFVIFMGFMDNIMAQNPDYLFMIENACKAPSGHNTQPWLFKIRESEIDIYPDLSKELPVVDPSHRELFVSLGCATENLCIAAQEKEYQTEVKVIKDSFIRVLLTKDKKKQFESSLFSQIAVRQTNRSVYDGKEIPKDSINLLKATSNDPSISIYFYKRGTVDYEKVANMVYAGNSLQMNNEAFKSELTKWMRYNKKYQNNTRDGLSYATFGAPNVPLFLAKFIMSKAINAKTQNKGDRKKIASASHFVLFTTKDNTVEQWVALGRTLERILLRSTQMGIANAYLNQPNEENDITKEMVKQLQISNEYPTILIRLGYGKKMPYSLRRDYRSCILPAD >CP022040.2|ASE17947.1|1509630_1510056_+|hypothetical-protein MQIQGTGLSKEFKPYELCCDLGYNITDNLYADLRYENAIALFKENGVKDYAHSHIVGLNLGYIVYHTENCNIGAQIGYGSNMKRKNSDWKYDYYEAMAFTDLGTCKIKPRFGLGIRHYNSRTNLYKDRTVVFVSIGFGINW >CP022040.2|ASE17679.1|1508866_1509358_+|hypothetical-protein MLSIPDEYRKDKQLQCSHCGEIFNNYMIKKKKQQSKVDKFLITFAIFLTLWGVKACIDSQPRSSKNSVDNQYESSISIGDNVEVIKTTIGLIDEASVDEFGNAAVAGDATGVDQLIITGKGRQIPAGEQGKVIKRTVSADRIRLNDGTAWWIPISADLKKINE >CP022040.2|ASE17678.2|1507639_1508824_+|hypothetical-protein MIQNVFDELTKQLRKNIEAQDYYKSIVTEIRIGYNANANVIQLMHPSQVLDLRTEDLIFPDREWDIDKEFEEAIAIYFTFSMKADKKELQLFQELDIRKYFKEIKDFGIRAYAIDCGKDYNKAATYVNIILEKVYNVSEGSTITIETSDFNNIESAKICKESLTVPYKITGIATRVKEQKEDTLSNETPLPEISDKTYNRLLFGAISIIILILIVFIVNMTQKETPTTDDIRAATQDTTLYVDSASTAPFGVVSSNSETSISTQETASEPEQNQVIGKWIETSYGNNGATWLLEKEASTGRLLLTSRLGDIIQSYNCTKRRIKGNNSYIFHDPNVSGNNGLFSMRKGTTIYYISGFDNNSDAILVPTQDGSAKIYSNDIDNRYYELFSDLSSVN >CP022040.2|ASE17677.1|1506657_1507317_+|hypothetical-protein MKLNKILSNLILLLLILLVSCSTDSFQDEDISLRNLHEGDLMFVVKETSNPITDATQGINGLKIDHVAIFHHTDSADYALEAYGKAVSLTPLTNFLNRTKGKEGKPLIAVGRVIVDCDMNTSMKRALSYLGRPYDRFYMPDDKEIYCSELIQKSFVDHHGLPIFSTIPMSFHDNNGKILDAWTQFYAFYHHEVPEGEPGTNPGQLSRDKAVKVNYKLEK >CP022040.2|ASE17676.1|1505154_1506546_+|asparagine--tRNA-ligase MKRTKIVDALACTDFGKDINVKGWVRSHRSSKAVDFIALNDGSTIKNIQVVVDPSTIDEDKLKSITTGACISVIGTLVESQGAGQTSEIQCKEIEIYGLCPSDYPMQKKGQSFEYMRKYGHMRLRTNTFGAVFRIRHNMAIAIHQYFHEHGFYYFHTPLITGSDAEGAGNMFQVTTLDLDRVAKGGEVDYSADFFGKRTNLTVSGQLEGELGATALGAIYTFGPTFRAENSNTPRHLAEFWMVEPEVAFIDKDELMDLEEDFIKYCVRWALENCKDDLEFLNKMIDKELIARLEGVLKEDFARLTYTEGFEILQKAAADGVKFEFPITHWGMDLSSEHERYLVEEHFKRPVIMTDYPSEIKSFYMKKNEDGKTMQGTDVLFPRIGEIIGGSVREESYEKLVEEIESRGMKRDIYDWYLDTRKYGTCPHGGFGLGFERLILFVTGMQNIRDVIPFARTPKNAEF >CP022040.2|ASE17675.1|1503469_1505014_+|pseudouridine-synthase MAEEFENKEVQSEQNENSRDGYSAAEQGGYQREYRGTGRTQRPRIHSQRAYSSDKANSSNDEGGFRPEGFGSGLQSAGRPQQGGYRPRQNSYGGGYNNNRGGYQSRPQQGGYRPRYNSNGEEGGYQPRQQGGYNRGGYQSRPQQGGYRPRYNNDENGYQPQAYRPRYNANNGAEGEENNNYQANQGGYQPRQGGYQPRQQGGYQSRGGYNNNRGGYNNNRGGYQSRGGYNNNRGGYNNRGGYNQGGYRQHSTDYDPNAKYSLKKRIEYKEENYDPNEPIRLNKYLANAGVCSRREADEFILSGAVTVNGEVVKELGSKVMRTDEVYFQDKLVSLEKKVYVLLNKPKDYVTTSDDPQQRKTVMDLVKGACPERIYPVGRLDRNTTGVLLLTNDGDLASKLTHPKFLKKKVYHVFLDKAITANDLQKISDGIELEDGEIKADAIEYADPQDQTQVGIEIHSGKNRIVRRIFESLGYRVVKLDRVQFAGLTKKNVRRGDWRFLTEKEVDMLRMGAFE >CP022040.2|ASE17674.1|1501983_1503330_+|adenylosuccinate-lyase MTLDALTAVSPIDGRYRSKTESLADYFSEYALIRYRVRVEIEYFITLCELPLPQLESFNSALFEQLRDIYRNFDEASAARVKEIESITNHDVKAVEYFIKEEFDKIGGLDDYKEFIHFGLTSQDINNTSVPLSVKEALEEVFYPQVEELIAQLKEYAEAWEDVPMLAKTHGQPASPTRLGKEVEVYVYRLSEQLATLRNCKMTAKFGGATGNFNAHHVAYPQHDWRAFGNRFVSEKLGLEREQWTTQISNYDHLGSVFDAIRRINTIIIDLDRDFWMYISMEYFKQKIKAGEVGSSAMPHKVNPIDFENSEGNLGVANAILQFLAQKLPVSRLQRDLTDSTVLRNVGVPVGHSVIAIQSTLKGLRKLILNEEKLREDLENTWAVVAEAIQTILRREAYPHPYEALKALTRTNEKMTEETIHAFVQTLNVSDSVKAELMAITPYNYTGI >CP022040.2|ASE17683.1|1513777_1514638_-|GntR-family-transcriptional-regulator MSKIKLGAYNTLTVLKIALREGNGDPFGIYLDGGPAGEILMPQKYVPEGTEIGDELEVFVYLDQDERPIATTEEPLAQVGDFAYLECSWVNEYGAFLSWGVMKDLFCPFREQKKRMTIGNSYIVYIHLDEESYRLVASAKVEHYLDEQPRGYKHGQEVDLLIWQKTDLGFKVIVDNKYPGLIYEDQVFQYVHTGDRLKGYISTVRRDGKIDCTLQPTGQQHAEGFAEVLLQYLKDNDGVCDLGDKSEAEDIKRRFQVSKKVYKRAVGDLYKRHLITVDPLSIRLVK >CP022040.2|ASE17684.1|1514734_1515442_-|ABC-transporter-ATP-binding-protein MIDIKNITKSFGSLQVLKGIDLRIEKGEVVSIVGPSGAGKTTLLQILGTLDKPDSGSVVVDGIDVGSLSAGKLSDFRNQHLGFVFQFHQLLPEFTALENIMIPAYIAGKKNKDARQRAEELLEFMGLSDRANHKPNELSGGEKQRVAVARALVNNPAVILADEPSGSLDSKNKQELHQLFFDLRDKFGQTFVIVTHDEGLAQITDRTIHLKDGLIESLPQPLRKEGSATIKIENE >CP022040.2|ASE17685.1|1515526_1517644_-|cation:proton-antiporter MLNLSQYFPITDPTLIFFVVLLMILLSPIIMGRLRIPHIIGMVLAGVLVGKYGLNILGRDASFELFGRVGLYYIMFLAGLEMDMEGLKKNRNRVMIFGMLTFLIPFAMTYFMGVSLLGYIPLASLLLAAIMASNTLIAYPIVGRYGLTRHTSSTLSVGSSMMALFMALIVMASIVNSFHGNGGILFWLLFILKFVAYCVGLIMVIPRVTRWFLRRYSDAVMQFIFILAVVFLSAALSDAVGLEGIFGAFMSGLILNRFVPKVSPLMNRIEFTGNALFIPYFLIGVGMLINVRLLFAGSKILWVVFCIVFFGTLGKAVAAYVAARIFRMSWLAGHMMFGLTSAHAAGAIAMVMVGRRLEVAPGQYLFGDEVLNGIVIMILFTCVISTVITERAAQRLRLQEKEDQNMMKNLDDEKILIPVKYPEYSDNLITMATLMRNPRLKRELVALNVVYDDVNMRHNQAEGQRLLDHLCHLASASDVPMVTQVRVAANIANGIKHAFKEFQASEILMGLHFHKEINRSFWGEFTRSLYNGLSRQIIVTRILQPLNTIRRIQVAIPSRAEFEPGFYRWLERLARMAGNLECRIAFHGRNETLQLVNEFIRNRFPSVRAEYEEMAHWKELPTLGSQVREDHLFVIVTARKGTISYKTAMERLPEELNKFIKGKTIMIIFPDQYGSEMDDMTFAQPQHTEERSAYEAVREWIHNKV >CP022040.2|ASE17686.1|1518382_1519396_+|aspartate-semialdehyde-dehydrogenase MKVAIVGASGAVGQEFLRILAERNFPMDDLVLFGSERSAGKKYTFKGKEYEVKLLQHNDDFKDVDIAFTSAGGGTSAEFAETITKYGAVMIDNSSQFRQDNDVPLVVPEINAEDALNRPRGIIANPNCTTIMMVVVLNPIDKLSHIKKIHVSSYQSASGAGAAAMAELQQQYKELVETGEVKTIEKFPHQLAYNVIPQIDKMTENDYTKEEVKMFNETRKIMHSDVRTSATCVRVSSLRSHSEAVWFETERPLSVEEIREALKAAPGVTVVDDPQNYVYPMPLESAGHDDIYVGRIRKDLADDNGNTLWLTGDQIRKGAALNAVQIAEYLIKVGDVK >CP022040.2|ASE17687.1|1519599_1519884_-|antibiotic-biosynthesis-monooxygenase MIRLNAFFKVKAGVTTAQVKALTDELVELSRKDEGNKGYDLFESTTQPGVFLFCETWADKACLTRHARSEHFTRIVPELEKLTDGGLSIEQFER >CP022040.2|ASE17688.1|1519912_1520320_-|FMN-binding-domain-containing-protein MLKKQFVSVAAVAVLTTGVAFAAVQQDKVMYKQADGTYVVNTTSLCSNVKGFKGATPVEVYIKNNKVIKVEALPNREGPKFYDKVKQGLFPKFNGMKLSKAAKAESLDGVTGATYTSRAVKENIAAAVAYYKKNK >CP022040.2|ASE17689.1|1520541_1522365_+|lipoyl-synthase MKYILLPKPDTIHQLPFYFAVEEYVARHYTDDDYFMGWRVNPTVMLGRNQLIDNEVNTDYCKEHKIDIFRRKSGGGCIYADKGCIQFSYISCAVNANEAFADYMQRMADLLKGLKIDAQLSGRNDILINGTKVSGCAFYQLSNRSVLHNSLLFDTQLDHLSNALTPAKEKLQSKGVASVRQRVTNVATYIQLDILAFMDYVRQEMCGTEVLELTEEDMKGVAEIEKELSSDDFVYGKNPKYSLVRKHRFEGVGTLEAHIELKNNIIGSINMVGDYFLLGDIDHDFLSLLKGCEFTREAVEERLENIDLSTIIRGLKLRQFLRLLFGREPHVMKPKWLKIDLTSKKSTGETAGILAKHHMNTICTSGLCPNRSECWMARTATLMIGGDICTRKCRFCNTLSGRPRLLNPDEPRRVAESVKALKLRYAVITSVDRDDLPDYGAAHWIKTIEEIRRLNPDTKIELLIPDFMGKADLIRQVMATHPHVAGHNMETVRRLTPSVRSVARYERSLEVLREIANCGITAKTGFMLGLGETHDEILETMDDILSTGCQRLTLGQYLQPTAEHLPVKAYITPEMFAEYKRIALEKGFKHVVSGPLVRSSYHAAEGV >CP022040.2|ASE17690.1|1522539_1523703_-|OmpA-family-protein MKKLLIVLALAGVSMTGFAQDEVLTEKYSVATNSFWSNWFVQLGADWNAWYSNQEHGRDAAISPLKDFRSKPGAAFAIGKWFTPGIGLRTKIQGIWGKRVGADSNPASQLDNSNKYWIAQEQVMFNLSNLLCGYNENRVWNLIPFAGAGVGRSMSANRYAMGLSAGLQSSWRVSKGMRVYLEAGWNRYESDLDGAAYANNERRGWESHDNNLYAEIGLNFNIGKGTWKKSPDMEAINTQHQAALDALNARLQDAEEENTRLRNELANQKPVETVSESVKQLVTTPVSVFFEINQSTIASQKDLVNVQALAKYAKDNNNNLLVTGYADSATGSADYNQKLSERRATVVANELVKMGIENNKITTVGKGGVETLSPISFNRRATVQITE >CP022040.2|ASE17691.1|1524231_1526124_-|DNA-mismatch-repair-endonuclease-MutL MSDIIQLLPDSVANQIAAGEVIQRPASVIKELVENAIDAGATHIDVLVVDAGRTSIQVIDDGKGMSETDARLSFERHATSKIRKADDLFSLRTMGFRGEALASIAAVAQIELKTRMESEDLGTHLSIAGSRFTGQEPCSCPVGSNFLVENLFFNVPARRKFLKSNTTELNNIITAFERIVLVYPQISFTLHSNGTELFNLRACSYRQRIVEVFGKRLNQDLLPIDVDTSLCHIHGFVGKPESARKKAPHQYFFVNDRYMKHPYFHKAVITAFDRLIPQGEQVPYFLYFDVPAENIDVNIHPTKTEIKFENEQAIWQILLAAVKEAVGRFNDIPAIDFDTEGKPDIPVFNPNVGMSAPKVDFNPAYNPFKQTSQPAKSSVPDGWEELYADLGSGGEIRQSKLFKQREDEMISSSLGTVSDTVEEGTIIPSAATQSAAESLIEDKAPSHYQYKGCYIMTAVKSGLMIIDQHRAHIRILYEEYLHQLSEHKVHSQKVLFPEMVQFSVSDQVVLDQILPEMAEMGFQLDSLGGGSYAVNGVPAGIEGLNVVALINDMVASAMESGTSAKEEIDQALALSLARNAAIPQGQVLSSMEMDNIVNELFACSNVNYTPSGEPVIAIMKQQDIEHLFDS >CP022040.2|ASE17692.1|1526132_1526423_-|ubiquitin-carboxyl-hydrolase MLFFQSSRPRRFHHEYMYVDERKELLNDIEQRARRELNGEEVPEGKYREELQRKISGSLKPEVLRHRGNRFTAMWVSLILSAGVIALLTLFLFFAL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP022040_4 | 1623736-1623868 | Orphan |
NA
Consensus repeat of CP022040_4
|
1 spacers
spacers of CP022040_4
>4.1|1623782|41|CP022040|CRISPRCasFinder CAACTCCTTTAACTCCCAAATAGGAACAATGGAAAATCAAC |
CRISPR arrays and Neighbor proteins around CP022040_4
The CRISPR arrays of CP022040_4 >merge|CP022040|4|1623736-1623868|CRISPRCasFinder ATAAAGACACCCACAACGCATTTGTCTTAACTCCATTTCACTCCTTCAACTCCTTTAACTCCCAAATAGGAACAATGGAAAATCAACATAAAGACACCCACAACACATTTGTCTTAACTCCATTTCACTCCTT >CP022040|4|4|1623736-1623868|CRISPRCasFinder ATAAAGACACCCACAACGCATTTGTCTTAACTCCATTTCACTCCTT CAACTCCTTTAACTCCCAAATAGGAACAATGGAAAATCAAC ATAAAGACACCCACAACACATTTGTCTTAACTCCATTTCACTCCTT
>CP022040.2|ASE17760.1|1622239_1623577_+|HD-domain-containing-protein MNWQQLISNKRLGQEERHALRHDDRSEFKRDSDRLIYSAPFRRLQNKTQVFPLPGSVFVHNRLTHSLEVASLGKSLGDDVARRLIEIHPELRGTLFEEIGTIVQTACYAHDMGNPPFGHSGEKAMQAFFTEGPGISLKEKVSPHFWEDITHFEGNANAFRLLTHRFLGRREGGFVMTYSTLASIVKYPFSSTYASKHGKFGFFATEEETYKKIADELGIIQKDSSEAGICYVRHPLTYLMEAADDICYEIMDIEDSHKLKLLSFEETADLLLGFFDEETRKSIRKRIEEEGVTDQNEQVVYFRACAVGLLEAECVNVFVEHEEEILNGTFEGSLIKHISELPRQAYKHCTEVSVDRIYRSKAVLDVELSGYKIMETLMKALIGAAVEPEHFHSQQLIRSFSSQYDIQSPCLETRIMAVLDFISGMTDIYALDIYQKINGISLPLI >CP022040.2|ASE17952.1|1621576_1622014_-|dUTP-diphosphatase MIQIKVINKGHQQLPAYATSQSAGMDLRANIDAPIVLQPMERRLIPTGLHIALPVGFEAQVRPRSGLALKHGLTVLNSPGTIDADYRGEIMVLLINFSDEPFTINDGERIAQMVIARHEQAEFVEVEELDETERGEGGYGHTGVK >CP022040.2|ASE17759.1|1619818_1621555_-|tetratricopeptide-repeat-protein MFNNVILKVTKMRRTLLALALLGGSLAMHADREDSLQSYNYFFLEAIRQQEMGNLTAAFDLLCHARDLNPQAPEVYYQLAAFYVDMKKDAVAREYFEKAASLDPENSAYQEKLGKLYVSQKDYPNAIKAFERLYESNKTRSDVLQILYQLYGSQNDYKMMIKCLERLETLEGTNEQISLSKMQLYEQMGEKRKEYDELKALVDSHPLDLNYRVMFGNWLLQNGKKKEALQKYRDVLKEDPDNSLAKLSMMDYYNNIGDKATVKTILQELLQSPKTEKEAKLELLRQVITSSQKDNTPDSTEVMRLFSVALAVPQEDADIYMLKAAYMTLRKQPKAAVNRVYEQALEVEPDNSRARIALIQNIWDTQDYDKVIAICRPAIEYNPDEMAFYYFQGMAQFQKHDGDAALETFRKGVGQIKPDSDPSIVSDFYAIMGDILHEKGRNKEAFQAYDSCLQWKADNVAALNNYAYYLSEANENLTKAEQMSYKTIKAEPNNSTYLDTYAWILFQQKRYEEAKIYIEQAIRNDSTLSNVVKEHAGDIYAQTGDIEKALEFWRKALEGNKENATLRKKIELKKYIAE >CP022040.2|ASE17758.1|1618910_1619822_-|DUF4292-domain-containing-protein MKKTKILLVSMTALLLASCGTKKAVIQQKPVQDNKAAQQTAPATKDSKMQSVVVMQRVADNALYQKNLVSNLTFTLNDGHKDITVPGILRMRKDEVIRLQLLIPILRSEVGRIEFAKDYVLFIDRIHKQYVKASYDEVGFLRDNGINFYSLQSLFWNQVFIPRQQKVSEADLSQFAVDESKAQTEGSTLISLKDGKMDYKWSVNPKSNQIVLTTVTYNSNTHGASKLSWSYDDFKAFGSKLFPASQMLTINTPKFGQKPAKTLKASFDLESFSDISDWEVFTTPSDKYTKVSVEDILGKLMNF >CP022040.2|ASE17757.1|1616885_1618910_-|peptidase-M23 MKRLSLLFILSIMCMLSVSAQHSRKHRKQQKTAVAQSVEPQTTVKGKGKKAVNAINNNHAVAPATPVKGQKPIVEKLQTQPQPKGKQQQGKLQHQQQLKGQQPQLKGQPAQVKGQQVVRGKGAVRNAHVGKKGAKGPAYVTTEEIKGLQQQNLKLQKEISEHEEEMKVKQKDVDDRLQKIVRLDTEIGQHQRTIDTIATDIKGLDSNIGILKGQLASLEAQLGERRARFIRSMRYMARHRSIQDKLMFVFSAKNLTQMYRRLRFVREYAAYQRAQGEQLKAKQMQVDEKHTQLKQVRVNKSNLLYKDRQVHAQMERKRVEQQTVVSSLQNDQKVLQGVIAQRRQQQQALNAQIDRLIQIEIQKARERAIAEAKAQAAARAAAAKKRAEELARKKAAAEAAARENARRIAEAKEREAKAKAAARAAAEAAEKARQEAAARAAAARAAAEKARQEALQKERAAARERAIRKVEEQEAAAKAAQEKAEARAAAEKARADQMAREAEANRVAAERKADADRERAAREAEAARASAAENNDMLSSADRAITGNFANNRGRLPMPLSGQIVSHFGQYNVAGMSNIRLNNDGINIKGAPGSAVRSVFMGEVSGVFMAGGMSVVMIRHGIYISVYANLGSVGVSKGQKVGTGQTIGTVGKTGILQFQLRKETAKLNPEQWLR >CP022040.2|ASE17756.1|1614356_1616165_-|long-chain-fatty-acid--CoA-ligase MQTIGHLSVLIHEQAKKYGAKPAITFRNFGSLDWKTVSWNQFSMRVKEVSNALLNLGMKPQETIAVFSQNCIHHLYTDYGAYGVRVISIPFYATSSEQQIQYMIQDANVKFLFVGEQEQYDKARRIQSLCPTLERIIVFDSSVRLSQHDPNSIYFADFLKLGEGFPREAEVEERLAQANYDDICNILYTSGTTGESKGVILTYKMYQAAMDANRKSVPVNEKDRVINFLPFSHVFERGWACLSLAAGAELIVNTYPKEIQQSMRETHPTSMASVPRFWEKVYVAVKERMDKSSIVQRKLFYHALNVGRKRNIQYLARGKRVPLTLEMEYKFVNKTVLSLVRKQLGLENPHLFPTAGAYVSPEVEEFVHSIGINMIVGYGLTESLATVTCDHVGQPYTVGSVGRPLEGIDIKISDEGEVMLKGPTIMPGYFRRDTTNAEAFDKDGYFHTGDAGYMKDGELFLKERIKDLFKTSNGKYIAPQMVEAMLLVDKFIEQVSVIADQRKFVSALIVPEYSVLEEWARENHIEFKDREELCQDKRVNEMMRERIETLQQRLASYEKIKRFTLLAHHFSMENGELTNTLKLKRSVVNRRYHDVIEKMYEE >CP022040.2|ASE17755.1|1613069_1614185_-|peptide-chain-release-factor-2 MITADQLKDIQERTEALHRYLDIDKKRIEFEEEQLRTQAPDFWDDPARAQEQMKKVKDIEKWVKDYDKARALADEVQLAFDFYKDELVTEEEVDTAYAKVLKVIEGLELKNMLRQEEDPMECVMKINSGAGGTESQDWASMLMRMYMRWGEAQGYRVTISDIQEGDEAGIKSVTMKFEGGEYAYGYLKSENGVHRLVRVSPFNAQGKRMTSFASVFVTPLVDDTIEVYVDPARVSWDTFRSSGAGGQNVNKVESGVRLRYQYEDPDTGEQEEILIENTETRDQPKNRAKAMQLLKSQLYDRAMKKRMEEQAKIEAGKKKIEWGSQIRSYVFDDRRVKDHRTNYQTSDVDGVMDGKIDDFIKAYLMEFPTEE >CP022040.2|ASE17754.1|1612167_1612665_-|CYTH-domain-containing-protein MSGLEIERKFLVHKNMDWKKHASSCSHMQQGYFAAVNTVRVRIRDDKGYLTIKGPSRTGGLSRYEFEKEITLEEAQQLMLLCEPGVIDKHRYLVPFEGHTFEIDEFHGDNDGLVLAEVELGSEDESFDKPDFIGLEVTGNRHFYNSQMRRNPFKLWRDIVPEEYR >CP022040.2|ASE17753.1|1611815_1612208_+|antibiotic-resistance-protein-VanZ MSITKHILTKYPFSCIIVIGTWILCFMTIPETPLSSVRFIDKWTHSLIYLVLGLSISLEYLRNTKQPSPKFIIVWVWFLPIIMGGLIEVLQSYCTNGNRSGEWLDFFADAIGSTIAVLIGILLVRYRAKA >CP022040.2|ASE17752.1|1610381_1611743_-|sodium-dependent-transporter MSEQKRAKFGSKLGMILATAGGAVGLGNVWRFPYMTGQNGGAAFILIYIGCILLLGLPCMISEFIIGRHAASNTARAYTKLSNGSVWKWVGYLGVLTGFLITGYYAVVSGWCLQYGVASVMNHLHGTPDYFKSYFTDFSTNPWKPVLWTVMILLFTHYVIIHGVRNGIERASKILMPALFVLLVAIVVASCLLPGASKGVEFLLKPDFSKVTGDVFLGALGQSFYSMSIAMGCICTYASYYSRHTKLLNSAVQIGIIDTCVAILAGLMIFPAAFSVGVSPDSGPSLIFITLPNVFEQAFASMPIVGYIISMAFYLLLSMAALTSLISLHEVSTAFFQEELHISRPRAAMIVTAGCSLIGAVCSLSLGDWSFLKVAGVDLFDVFDFVTGQIFLPIGGLLTCLFIGWYVPKKLVKDEFTNWGTTRGIFFGAYYFLIRFVCPLAILAIFLHQLGVF >CP022040.2|ASE17761.1|1623940_1624936_+|esterase MENQYKKTFPDLMVGKKIIYVHGFMSAGSSHTVQILRDYMPEATVIAPDLPIHPEEAMELLRNLVNTEKPDLIIGTSMGGMYTEMLYGVDRICVNPAFQMGTTISETNMMGKQVFQNPRQDGVQEVIVTKALVKEYKEITEKCFSQVTEEEQQHVFGLFGDADPVVHTFDLFNEHYPQAIRFHGEHRLIEKAVFHYLMPVIRWIDDRQEGRERRTVLIDQNTLTDGYGKPKSSLNKAYEFLLDNYNVFFVCPAPTNNPSTITEQQAWIEDAFSAPAWNHTIFTNQPQLLYGDYFISSTEHDDFLGTSLLFGSEEFKTWEEIITFFERLGGQ >CP022040.2|ASE17762.1|1625203_1628650_+|DNA-helicase MAHELFSRIADILSAPSEAQALIMHETLVIACHEGLKNTRHGFGNLSSQVESLCRQHNIAPQDIVAIQKMRRHSNSNAPILPEDVAYDCRALAIFVSAVVQEAIPSFLVGKIPARGRTTENIQITNYRYIRCIVREWDDSTIQVAVTNQDSSEELLTVDYMNTPDYIDFSYLRPMLRERMQLNLLDCTVTRKKVVPRLIVVEPDYLIDISTIANCFETYGHHPLLFTVNRLTPRLSNKHIVLGNFAGSALDDIINHPAGYDIKETFRSNFKEKALDYATCPDFDAASFKQDAERQVENIKGIVDEIFQTFDREKAILEPSFVCERLGIQGRVDLMTTDLKLLVEQKSGKNTFIERKYKNPHGSLHVEKHYVQVLLYYGILQYNFQLSPKNAHIQLLYSKYPLPDGLLEVEPLQKLIREAIRFRNQAVATEFWMADNGFDRMLPLLTPQTLNVEKQNDNFYNRYLLPQLTETLAPLHQLNDLERAYFTRMMTFVIKEQLVSKVGVQEGVGNSNADLWNMPLAEKKETGNIYTELTIIEKGRSSSFNGYDTITLAVPQQGEDFLPNFRRGDMIYLYSYKKNEAPDVRQSILFKGSLQEIHGDSITVHLNDGQQNPDLISGDYFAIEHAGSDIGGTSAIRSLYTFITSNEERRQLLLGQRVPCVDKSLTLSHSYHPDYDEIILKAKQAQDYFLLIGPPGTGKTSQALQFLVREQLAGNIYSQPSSAYSAEDSKHNKPSETINTQHSTPNTQTAILLLAYTNRAVDEICNMLTENDIDYIRIGNEFSCDPKYSDHLLKEVLDDNATLNSIKSTLADAQIVVATTSTMNSNAALFNIKHFDLAIIDEASQILEPNIIGLLTSQHRGGRAIRKFILIGDHKQLPAVVQQSDTEVLIEDETLKAIHLNSCTNSLFERLILTERAAGRTEFVGTLHKQGRMHPDIADFANRKFYAREQLECVPLAHQLEQTLAYNETSEDETDDVLKAHRMIFIPSKPCRQLNISEKVNTEEARIITDLLRRLYRQLGKNFDPQKSVGVIVPYRNQIAMIRKEIEKLGIPELEEISIDTVERYQGSQRDIILYSFTIQSRYQLDFLTANTFYEDGQPIDRKLNVAITRARKQLILTGNEQTLRHNQLFAELIDYIKEKGGYYAEKA >CP022040.2|ASE17763.1|1628834_1629323_-|hypothetical-protein MRKSIQKWTYALVASVFALVMCLSLSACGSDDDNDVNNGISPVLYSDFGGEIGVNYPLGISGKFVGFSIPKSQAGKIVDLTKDGDWVAGGSVVGGLYRYDDHFFQKGSYVYLLRTGANEIELRYKYIWKEGTATRTIEGNYKNVKMTTHQDAIDWAHRQGLH >CP022040.2|ASE17764.1|1629955_1631776_+|arginine--tRNA-ligase MKIEEQITVAALAAVKELYGTEVPEKMIQLQKTRSDFEGNLTLVTFPLLKTSRKKPEDTAQDLGEYLKKNCKAVADFNVVKGFLNLVIAQAAWTELLNDINADEKFGEKRVTDESPLVMIEYSSPNTNKPLHLGHVRNNLLGWSLAQIMEANGNKVVKTNIVNDRGIHICKSMLAWQKWGNGITPEQAGKKGDHLIGDFYVLFDKHYKEECKQLQEQYEKEGLTAEEAKEKAEHEAPLIKEAHDMLVKWEANDPEIRALWEKMNNWVYAGFDETYKALGVGFDKIYYESNTYLVGKKKVEEGLAKGLFIRKEDNSVWADLTNEGLDQKLLLRKDGTSVYMTQDIGTAEMRFNDYPIDKMIYVVGNEQNYHFQVLSILLDRLGFKWGKDLVHFSYGMVELPNGKMKSREGTVVDADDLVASMIENAKSLSEDKVNKLEGITEEEKNEIARIVGMGALKYFILKVDARKNMLFNPEESIDFNGNTGPFIQYTYARIRSILRKAEAQNITLPASLNDDAPLNEKEIALIQKLNDFGAAVAQAGIDYSPSGIANYCYELTKEFNQFYHDYSILNADTEAEKITRLMIAKNVAKVIKNGMALLGIEVPERM >CP022040.2|ASE17765.1|1632006_1632474_+|ribonuclease-H MKELVNPPTNRNDTVLPLPLEVRCPSWAVDAACSGNPGPMEYQCVDLQTGARVFHFGPVMGTNNIGEFLAIVHALALMEKQGIKDKVIYSDSYNAILWVNKKRCKTTFVRNAETEELHQIIARAEHWLQTHKVTTPIIKWETKQWGEIPADFGRK >CP022040.2|ASE17766.1|1632809_1633667_-|enoyl-ACP-reductase MSYNLLKGKRGVIFGALNEMSIAWKVAERAVEEGATITLSNTPIAVRMGTVNALSEKLNCEVIPADATNVEDLENVFKRSMEVLGGKIDFVLHSIGMSPNVRKHRTYDDLDYKMLDTTLDISAVSFHKMIQSAKKLDAINDYGSILALSYVAAQRTFYGYNDMADAKALLESIGRSFGYIYGREKHVRINTISQSPTMTTAGSGVKGMDKLFDFADRMSPLGNASADECADYCIVMFSDLTRKVTMQNLYHDGGFSNVGMSLRAMATYEKGLDEYKDENGNIIYG >CP022040.2|ASE17767.1|1634128_1634749_-|DUF2238-domain-containing-protein MIDKTKLMLVLLVMIVTVITCIHPIYPNEQTLQHIGTVLLLIPLTMDVFRKQLPMSAFIGIVGFTLLHVIGARYIYSYVPYKEWAVSLGLVEKGFFHDPRNHYDRLVHFSFGALMFPYFVYLCRKWVKQQSFVAVVMAWMMIQTGSLIYELFEWLLTIVMTAEEADYYNGQQGDMWDAQKDMALALVGSTGMFLVYAVRSLIRRGK >CP022040.2|ASE17953.1|1635249_1638627_+|helicase MARIYDNIKTKFTEGLQGIITNVGVKRVDFCVGYFNLRGWNLVVDQMDTLTGNYVYENDKHTFRKCRLLIGMHRPTEELIRQLYTDQPLPDANYVSQCKLEIARDFRRQLQLGFPTKQDEFTLRRLSAQMKEEKVCVKLYLREPLHAKLYLAYRPDDNFNKIQAIMGSSNLTYSGLTKQGELNAEFGDSDSAEKLAYWFDERWEDKFCLDITKELIEIIDNSWAGDKDIPPYYIYLKTAYHLSEEARSGIKEFTIPAEFKNCLFDFQQTAVKIAARHLNNEKRGGAMIGDVVGLGKTITACAIAKMYENTFGSNTLIICPANLQDMWEKYRKQYDMKADIMSMAKPIDVDNARYYKLIIVDESHNLRNSQGVRYRNIKDLIQKQDCKVLLLTATPYNKQYKDLSSQLRLFIDDDTDLGIRPEAYIRSIGGERKFAEKHEDFIRSIKAFERSEFQEDWQELMKLFLIRRTRTFIKENYAKTDSKNGRKYLEFKDGHKSYFPDRIPKAIKFQTTEGDQYSRLYSEEMVSLMESLKLPRYGLIHYLDEKKAETASKYEGNLIDNLSRAGERMMGFCKSTFFKRVDSSGYAFLLTLYRHILRNAVYLYAIDNKLKLPVSDENTFPEDFIEDADINKITADSDDNKEFLSNKSLLTIPKKMKDYMERAETYYNSLIGKNNVQWIDSKYFKRTLKQGLKKDCDQLIAMINLCDDWNPQTDQKLNELEKLLSNTHKDDKIIIFTQYSDTAAYVYKQLQKRGIKNIEKVTGDTKNPTAIVERFSPISNRADITKENELRILIATDVLSEGQNLQDAHIIINYDLPWAIIRLIQRAGRVDRIDQSSEQIYCYSFFPADKVEEIIRLRTRLNERINENAGIVGSDEVFFEGNEQNLRDMYNENSSSLDEDEDDIEVDLGSQAYQIWKNATDANPDLKRIIPAIPNIAYSTKAANNINEDGVITYARTYNDFDVLTWYNSKGDIVSQSQKRILQTMACTIKEPCLPAQDVHLSLVEKAVKSIKNENTNVGGILGSRFSTKRKIYELLNHYYEQPLNLFNTQEKKDILKFAIDQVYNYPLLENSKFILGRMMRTGNTHDDIVDTVIEMYENANLCRVDEDKIKHKDPVIICSMGLKA >CP022040.2|ASE17768.1|1638630_1641915_+|SAM-dependent-methyltransferase MKRNIFNQYITASDFKGLFVSEMLWNNPTGATQLPEINIDNTTFHIEQIAERKGFQILHCQVEQIPSSAICKKIDHKIRKNAENYICIFILPETLHHLWIAPVKKVEKRDVVLIEYDSLDKAAFLFEKMESLSFSLDDNLTILDIIEKVQSAFLINSEKITKDFYAGFKKEHSNFAKFITGIDDHIDEKENKNKQWYASVMLNRLMFCYFIQKKEFLDGDVDYLRHKLEWTRNQEGEDRFFNKFYKGFLVNLFHDGLNTPKHNHEFEKIYGRIPYLNGGMFDVHQIEREYANLDIADEAFISLFDFFDKWHWHLDDRMTASGRDINPDVLGYIFEQYINDRAQMGAYYTKEDITEYIGRNTIVPYLMDAVKRKNEKHFRANSELWLYLKESGDKYIFDAMKKGVDQTIPEEIATGLDTTKPNLLERRCHWNERTSEAFALPTEIWRETIERLQRYNNIKEKITKGEITNINDFITYNLNIRQFVTDYLANTQDHLFVKHFYHALQHVTILDPTCGSGAFLFAALNILEPLYEVCINRMQEFNAKNPLLFKQELQEIEHKYRSNIQYFIYKSIILRNLYGVDIMVEATEIAKLRLFLKMVAVVEVDKRNPNLGLDPLPDIDFNIRCGNTLVGYATQEELERDLIEGDMYAREEFKEKVNDEMDKVSRTYEIFKNVQLHQAEDMAAFKKAKGELYQRLNTLNDLLNHKMYGAVESTKGYNAWYQSHQPFHWLAEFYEIINEHGGFDVIIGNPPYVEYNKKVKGVAVSDLYKLVGYKTLSCGNLYAYVLERSKNIMRQEGYISMIVPLSGHSTERMAPLVTNFYEKFGLHLHLNLSADANPQKLFEGVKFRLVIFTATNNGVGKYSTKYTRWLADERKNLFNALVRYNSIEDYTYQNIIPKIASPLFISIARKIKEEKVQYFVGIGNEQCLYHNAPVNWIRSHTFVPYFCSDRDGEGITTQLKSVSFDNTKQVKVGSCILNSSLFFIWWITNSDCYHLNKPEIVNFRYQYDKGIEKAICSVADRLAIDMKKKCIRRIYNYKTTGRVEYDEFYMKLSKPIINEIDKLLASHYGFTEEELDFIINYDIKYRMGDELNEE >CP022040.2|ASE17769.1|1641930_1642584_+|DUF4145-domain-containing-protein MNQKYVTPARDKDAFTCPHCHTLSLMKFRWHRHDEDVHFVTQRVGYGEYLNQLFIARCVNCGKKIIWINDDYIYPDIVAEDPNVDMPESVKQLYNEAGTIYNKSPRAACALLRLAIDRLCNELGETDRDINKNIGVLVKKGLPQAVQQALDVVRVVGNKAVHPGVISFDVDDKGTATMLMRLLNIITERMITEPKEIESLYEGLPETVKESVTKRDK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1079793 : 1132954
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP022040|1079793:1132954|DBSCAN-SWA CTTGAAAAACAAAACATTAATAGTCATCACTGGCCCAACAGGCGTTGGCAAGACAGAGACCACTCTCCGTATTGCAGAGCATTTTAATGTACCTGTTATCAATGCTGACTCCCGACAAATATTCTCTGAGATTCCTATAGGAACAGCAGCACCAACGGCAGAACAGCAGCAGCGTGTACAACACTATTTTGTTGGTAATCATCATTTGGAAGATTACTATTCAGCAAGCCTATACGAACAAGATGTACTTAATATTATTAACAGTCAGCACACACCTATCTCGTTACTCTCAGGTGGTTCAATGATGTATATTGACGCTGTATGCAATGGTATTGATGATATCCCGACAATTCTTCCTGAAATACGAGAGAAAATGATGAAACGCTTAGAAGCAGAAGGATTAGAACAGATGTGTAATCTGTTACGAGAACTGGACCCTGAGCACTGGAAGATAGTTGACAGGAATAATCCACGTCGTGTTATCCATGCCCTTGAGATATGTATCCAAACTGGAAAAACATATACATCTTTCCGTTCCAATACTATTAAAGATCGTCCTTTTAATATCATCAAGGTTGGATTAAACCGTGATAGAGACGAACTCTATAATAGAATCAATCAGCGAGTATTAGACATGATTGAAGAGGGAATGATAGAAGAAGCACTACAAGTTTATCCTAAGCGAACTCTGAATTCACTCAACACGGTTGGATACAAGGAGATATTTGAATACCTTGATGGCTTAACAACACTTGATGAAGCCATATTTAAAATACAGAGTAATACTCGAAGATACGCTCGCAAACAGCTCACATGGTATAAAAAAGATACCGCTTTTCAATGGTTTAATCCCGATAACATTGAAGAAATCTTAAATTATGTCCATACAATGATATCAAATACAAGTAAATAAGTACTTTTGTATTAAACGTATTATAGCAATATTATAAACATAGGCAAGTAGCTTTAAAACTTGAAGGTTGTCAATCAATGATAAAGAAAATATTTAAATTCATTAGAAGTATATTCCGAGGAATTTTCAATTTCTTCCCATGGTACGCAAAACTGTATAAAGGTCGTGCTTGGTACACGAAGATGGCTGTCGGAACAGTATCTTTCTTTGTAGCTATCTTCCTCTATTTAGGAATGGTAGACATTAACTTCCTTTGGCTCTTTGGAAAGTCACCAGGTTTTATTGATATAAAAACTCCTCCAACCTATGCTGCATCAGAGATTTATAGTGCAGACTCTGTCCTCATTGGTAGATTCTATAAAGAGAATCGTACACCAGTAAAGTATGAGGAGGTAACCCCTGCATTCTGGAATGCACTCATCAGCACAGAGGACGAGCGCTTCTATAGCCATAATGGTATAGACTTTATGGGTATCGGTGGTGCCATCAAGGATGCCGTAACAGGTAGTGGAGGTCGTGGTGCATCAACCATCACACAGCAGTTGGCGAAGAATATGTTCCGTGTTCGTACGCAATATTCTTCAGGTCTATTAGGTCATGTACCAGGCTTGCGAATGCTCATTATGAAAAGTAAGGAATGGATTATTGCTGTGAAGTTAGAGTTAATCTATTCAAAGAAAGAAATTCTGACAATGTATGCCAACACCGTTGACTTTGGCAACAATTCCTTTGGTGTAAAGACAGCTGCTAAGACTTATTTTAATACAAGTCCTTCAAAGTTATCTATTGATCAAGCTGCAACGTTGGTAGGTATGTTGAAAGCTACTACTTATTATAATCCAATCTTGCACCCAAAGAATTCTATTCGTCGTCGTAACACTGTGCTGTATAATATGGTTACACATAATGTATTACCACATGATGAGTACGCACTATACTCTAAGCGACCTATGAAGTTGGATATACACGTAGAAGAAAACTATGATGGTCAAGCTCAGTACTTCCGTGAATATATATCAGAATACTTCAAAGATTGGATGAAAGACAATGGTTACGACCTCTATAGCAGTGGTTTAAAGATCTATACCACCATTGACACTCGTATGCAGAAGTATGCTGAGCAAGCTGCAACAAAGCAGATGGAAAAGGTTCAACAAACTTTTGATAACCACTGGAGAGGTATGCAGCCTTGGCGCGATGCAAAGGGAAATGAGATTCCTGGCTTTATTGAGGGCATTGCTGAACGTCAGCCTTTCTATAAGAAGCTATTACAGAAATACCCTAACCAGCCTGATAGCGTTCTCTATTATCTCAATAAACCACATAAGGTGACTCTCTTCGACTATGAAAAGGGGCATATTGAGAAAGAAATGTCATCAATGGACTCTATCCGTTATATGGTTAAATTCATGCACTGTGCTATGGTTGCTATGGAACCAGAGACAGGTGCGGTAAGAGCTTGGGTGGGTGACATTGATTTCAAAACATGGAAATATGATAAGGTTGTTGCACAGCGTCAACCAGGTTCAACCTTCAAACTATTCGTCTACTCAGAGGCATTCAATCAAGGACTGACCCCTTGTGATAAACGTCGTGATGAATATATCAGTATGCAGGTTCTTGATAAGAAGACTGGTCAGATGAAGACTTGGACACCACACAATGCCAATGGTAGATTCTCTAATGATTCTATTACACTGAAGAGTGCTTTTGCTCGCAGTATAAATTCCATTGCTGTACGCTTAGGACAGGAGATGGGTATCAAGAATATTATCCGTACAGCTCAGGAGATGGGTATTAAGAGTCCGTTGGATGATGAGCCTTCATTGGCACTCGGTTCAAGCGACGTAAACTTATTAGAATTGGTAAATGCCTATAGTACTGTAGCAAATGATGGTGAATATCATGTTCCAGTTGTCGTTACACGCATCTTAGATAAAGATGGAAACGAGGTATATGTAGCACCAAAAGACCACGAGAGAGCATTACCTTACAAGACTGCATTCCTCATGCAGGAAATGTTGAAGGCTGGTGTGAACGAAGGTGGTGGTACAAGTCAAGCATTACGTCATTATACTTTTGGTGATACAGACTGGGGAGGAAAGACTGGTACAAGTAATAACCACTCTGATGCTTGGTTTATGGCAGTCAGCCCTAAACTTGTCGTTGGTGCATGGGTTGGTGGTGAATATCGCTCCATCCACTTCCGTACAGGAGCCTTAGGACAGGGTTCAAAGACTGCACTACCTATCTGTGGAGAGTTTATTTACAGCTTAATGCGCGACAAGGCTTTCCAAAAGTATCATGCTAAGTGGCAGTTAGATCCAGACGAGGACATCGACCCTTCAATGTACAACTGTCAACCTACAGTAGTTCGGCGTGCTGCTCCTGATTCTTTGCGTGACTTTACAACAGGACATAGACGTCATCAGGAGGAAGAGGAAGAACCTATCGACGGACACGAAAACACAGACGAGGGTGGCTTTATATTAGAACCAGCTCCACAACCTACTCCAGAACGGCAGGGCAATAGTTCTAATAATGAATCGCCTCAAATTATAAAGAAGGCGAAGAAGCCACGCAGTGAAGATATGGAAATCTAAGTAATTCATAATTCTTAAAATGGAACTTATAATTACTTTTGGTTGATAAAGAAATCAGAATACTTTGGTAATTGATTAATAACTATCAATTCTCTTCCTCAAACAACAAACTTCTATATATTCACAGAGACTTTTACCTCTGTGAGTATATATGAAGGTTTTTGTTTTATCTATAGCCTCTTGTCAGTAAACTTAACAATAGGTCGTTGTAATTATACAGAAAAGGCACAAGGCATAAATATAGCTTAGTAGCAAACATAACTATCAACTGCAAAGGGTAACTAAAGCCCCCCAATCAATGTCACAAAAGAAGGTAAACATAGACGTAAATACGATTGACTTGGATAATACAGAGTTGCAAAAAGCACTTCAGATTATTCAGTTTACAAACAATTCCCTATTCCTCACTGGAAAAGCAGGAACAGGAAAGTCTACATTTCTACGTTACATTGCTGCGACAACAAAGAAAAAACATATTATTCTTGCACCCACAGGCATAGCTGCTATCAATGCAGGTGGTAGTACTCTGCATAGCTTCTTCAAACTTCCATTCTATCCTCTTGTCCCAACTGACAAACGTTATTCTGCTCGTAATCTACGCAGCACAATGAAATACAATGGTGACAAGTGTAAACTTTTGCGGGAAGTCGAACTAATCATTATTGATGAGATTAGTATGGTTAGAGCTGACATCATTGACTTCATCGACAAAGTACTGCGCATATACAATCGTAATATGCGCGAGCCATTCGGCGGAAAACAACTTCTCTTGGTAGGCGACATTTATCAGTTAGAACCTGTTCTAAAGGAAGAGGATCGTCGACTCTTACAACCCTATTACCCAAGTAGCTACTTCTTTGATGCAAAGGTGTTCCAAGATTATCCACTTGTGAGTATTGAACTCAATAAAGTCTACCGTCAAAACGACCCTCACTTTATTTCCATTCTTGACCATATCCGTACGAATCAGGTCACTGACACTGATTTCAACCATATCAATGAACGTGTTGGTGCAAGTTTGGAAAATACAAACAAGCCAGAAGGAGATTTCACCATTACCCTATCGACTAAACGAGATACAGTAGACTGGATAAACAATGAAGGACTTGATAGCCTTGATGGAGATCCTGTTATGTTCTTAGGTGAAATCAAAGGAGAGTTTCCTGAGAGTAGCCTCCCTACTCCTATTGAGTTAAACCTCAAGGTGGGAGCACATATCATGTTTATCAAAAATGACATTGAGAAGCAATGGGTTAACGGAACCTTAGGTATTATTATCGGTATTGATGAGGAAGCAGGAATCCTCTATGTTCATACAGAAGAGGGAGATGACTTACAGGTGCAACGTGAGATGTGGGAGAATATAAGGTACCGATTCAATGAGGAAGAACAACGAATTGAAGAGGAACAAATAGGTACTTATATACAATTCCCCATCAAGTTGGCTTGGGCTATCACTGTACATAAGAGTCAAGGACTGACATTCAAGAATGTAAATATTGATTTCACAGGCGGTGTCTTTGCCGGTGGTCAGGCATATGTTGCTCTATCACGCTGTACAAGTCTTGAAGGTATTACACTCAAGGAACCACTACGTAGAAATGAGGTCTTTGTAAGAAGTGAGGTAACACATTTTGCCAGACACTACAACGATAATAATATTATATCGACAGTACTCAAACAGAGCAAGGCTGATAAAGAATACTACGATGCTGTATGTGCTTTCGACAAGGGAGACTTTGATGCTTTCCTGCGTAGCTTTTTTCTTGCTATTCATAGCCGTTATGATATTGAACGACCAGCAGCTAAGCGATTTATCAGACGAAAACTGGACCTCATTAACCAGCTACGTAATGAAAATGAGGAACTTAAGCGACAACAAGATAAGAAGAATGAGTATCTCAAAGAGCTGTCAGTAGAATATGTTATGATGGGTAAAGAGTGCGAGCGTGAAGAAATGAATGAGGCAGCTATTGCCAATTATGAGAAGGCTATTGCACTCTATCCTGATAATCCAACAGCACAGAAAAGATTGAAGAAATTAAAACCTTCAACCGAAAAAGATAATAAATAATGGAACAGAAAAAGAGTCTTTCAAAAATTATAGCAACAATCGTTTTTGGTATTGTTTTCATCATTATTGCTATTGTATCTTGTAATCAAAGCAAAACAATAGAAGGCAGTAACAATGGAGATACTGTTGCTGCAAAGGAGTTTACACCACATGTTGAGAATGGTAAACTAAATATTGATTTACAGAGTATTACCTCTATTCGCTTTCCTAAATATAAGACAACAAAAGCAATACCTTTTATACCCGACTCTGTTAGTTTAGCGGCTGATGAAGAATCGGTAGAAAGTGGCAACTATTCAGCAACACTACTTCTTGATACGATTCCAAATAAAGAGTTTTATCAGAGAATAGATCTTGCAGCACAGCATGATACGTGTTGGGATATTAACAAATTTGCTTATACTTACGAAAGAAAAGATAAAGCTGGAGGTGTGTATAAAGTCGTATTTAGTAAAGGTGGACAGCAGATATTTGTCACTCATCTGAATAAAGACATGATAAAAAATACGAATCAAACGAAGCCGAACAAATAAACATCACAATACACCCTAATTATGCTTAGGTTCTAAATGAACCAGTTTAGGTTTAATAAAACGCATAAAAAAAGTGTCAAAGCAAGTAGCTTCGACACTTTTTTTATATCCTTTTTAAGTCTTAACGAGCCTGCTCAAGCTTGCTATCAATATGTGCAATAGCATGGCTTTCCAAGTGAGCGAATGAATCGAGAATATCCTTAACATTCTTCTCTTCCTCTACTTGCTCATCAATATACTTTGCAATGAAGTTCTGTGATGCGCGGTCCTTCTCCTCATCAGCAACATCTGCCAACTTGTTGATAAGTTCTGTAACCTTCTGCTCATGCGCAAATGTATCAACAAAGGCCTCCTTAGCGTCTGTCCAACTTGTCTTAACGGCATCGATACTGGTAAGGATAACCTCACCACCACGATGAAGAACAAACTGAGCCATATCCATTGCGTGCTGTCTCTCCTCTTCAGCCTGCTTGTACATCCAACTTGAGAAGCCCTTCCAGCCTTCCTTACGGAACCAGCATGACATCTGTAAATAGAGATTTGATGACCACATTTCTGCCGCAATCTGCGCATTGAACGCATCCTGCATCTTCTTTGATATATTCATTTCTTGTTCTTTTTAATTAGTAATTAATTACTTACTTCGTTAGTCACTAATAATAATTCCGACTGCAAAATTACAACATTATTGTTATTTTTCCAAATTATCAGCCCAAAAATATGATTTTTTCACATGCTTATTAAACTCATATCACACATCTATTTTATCATCCCAGCTATATTTATACTCTTTTTTAGGTGATATGTTATTGCCCCGCACATGTCGTACTGATGATAAACACCAATGGTGCTGAGCACTAATTAAGTTGCTATATATTGCTATGAAAAAGTGTATTTTTATATTATTGGGATGAGGTCTACACCCAAACGGGAATACAAACGTATTTCATCTTACTTTTATATATCAAAAAAGATTTGCATCAAAGCGGTTACTAAAAGTAAACCTGAACTTTGATACAAATCTTTGTTTTCTATACTTATCTTTATAAGTGCATTCTCGTTCTACAAAAGATTATTCAATTACTACAGTTGTCCAACCATGCTTGTCTTCTATCGTGCCATACTGAATACTACGGAGAGTTTCATAAAGCTTGGTAGACCATGGTCCTGGTTTGTCACCAAAATTATAAACTTTACCCGTCTCCATATCATCAAGATGTGATATCGGAGAGATAACTGCTGCTGTACCACAAGCACCAGCTTCTTCAAAGCTATCCAATTCATCTTCTGAAATAGGACGACGCTCTACCTTCATTCCAAGGTCTTCAGCAATCTGCATCAGACTCTTGTTCGTTATAGATGGCAAGATAGAACTTGACTTAGGAGTAACATAAGTATCATCCTTGATACCAAAGAAGTTGGCTGCACCACACTCGTCAACATACTTCTTCTCCTTTGCATCCAAATAGAACTCTGAGGCATAACCTTGCTCATGTGCACGACGATTTGCTCTAAGTGATGCAGCATAGTTACCACCAACCTTATACATACCAGTTCCCAATGGAGCAGCACGGTCAACATCACGAACAATAACATAAGGATTAGCACAGAAGCCACCCTTAAAGTAAGGACCAACAGGAGTAACAAAGATTAAGAAGCAATATTCCTCTGCTGGACGAACACCCACCTGAGCAGATGTACCAATTAACAACGGACGAATATAGAGTGAAGCACCACTCTCGTAAGGTGGGATATATTCTTGATTAAGACGAACAACTTTTTTAACCATTTCTGCAAACAACTCTGTTGGTACTTCTGGCATTACAATACCACGACTTGTATTCTGCAAACGTTTTGCATTTTCTTCTACACGAAAAACACGCACTTTACCATCTGGACAACGATATGCCTTCAATCCCTCAAAAGCCTCTTGACCATAGTGAAGACAAGTTGCTGCCATGTGAAGTTTAAGATACTCATCTGAACTGACTTCTACCTCACCCCACTTTCCATTGCGATAGTAACAGCGAACATTATAATCAGTTGGCATATAGCCAAATGATAAACTTGACCATTCTATATCCTTCATATTATTTTCGTGTTAAGTTATTATTCATGACAATTATCTACAAAAGTAAACAAAATATTCTTACCATGCAACCAAATAACAAGTTAATTAAACTTACATTTCGTTTTCTAAATAAGAAAAAAGAGAATTATTGCTTTTCTAAAAGCTTTTGAATCTCGTTGTCGGTACGCTTAAGCTTTTGTTTACAGAGCTTTACTAACTCTTGAGCTTCTTTTAATTGGGCGGCCATAGAGTCTATATCAAGTTCTCCACGTTCCATTTTATCTACAATAGCTTCCAATTTATGAACTGCTTCTTCGTATTTTATTTCTTTCATTGCCTTTTTACATTTATAATATATGGAATAATATGCTCTTTTAGGATAAACGTTCCACCTTACTCTCTACCTCGCCAGTCTCAAAACGTGTTGTTATCGTATCACCTATCTTTAAGTCTTTAGCGTTACGAATTGTCCTACCATTACACAATGTAATACTGTACCCACGCTTTAAGAGTAAAGAAGGATCAAGTAACCTTGCTCGCTGCTCAAGTATATCGAGTCGATGCTGCTCAACAGACAACTTATTTTGCAGTGTCAGTAACATACAATTTTGTAACGTGCTTAAATAAAATTCTGCCTGTTTTATAGCTTCATTCATTCTCATCACCAGCCGTTGGGATAAGCCTTCAAGCCAAGCATCCTGCTTTGTACGAACCACAGAGAACAATACTGGAATATGTGAACCAATATGTTGAATACGCATTTTCTCAACCTCTAAGGCCTTCTTCACACCATCGATTATCGCAGCTTGCGCGTTTTCTACTCTCATCAACGTTGAAGCAATATGGTCTATGAGATAAGCAGCAGCGGCAGTTGGCGTTTTCATACGTTGAAACGATACCATATCCAAGATGCTTTCATCACGTTCGTGCCCAATACCTGTTATAATTGGTAATGGAAAGTTTGCCACATTCTCAGCAAGATTCAAGGTATCAAAACCGCTAAGGTCGGCTGTTGCGCCACCACCACGAATGATAACAACACAATCGAAATCTTCTTCACGACTATTGATTTGATTCAATGCGTTTATTATACTCTGTTCAACAGAGTCGCCCTGCATAATAGCAGGAAACAATTCTACGTGGAAATAGAGTCCATAATCATTAGTTTCTAACTGATTACAGAAGTCACCATAACCAGCAGCGGTCTCAGAAGAGATAACAGCTATACGCTGAGCAAACATCGGAAGACAAAGTTCTTTTTGCAAATCAAAGACTCCTTCTGCCTTTAATTGACGGATAATCTCTTGCCGCTTACGCATCATATCACCCATCGTATACTCAGGATTGATATCATCAACTATCCAAGAGAAGCCATACTGAGGATGAAATTGAGCATGAACTTGTAACATTACTTTCATTCCAGCTCGTATTTCCTGACCTGTAATACGAATAAAAGCTGGCTTAATAAGTGTCCAAATGTTGCTCCAACACTTTGCTGAAGCACGTGCTATAGGTACATTGCTCCCTTCGTTCTTCTCAATCAGCTCCATATAACAGTGTCCACGATTCTCACGGGCCTCAGACAATTCTGCTTCCACCCAATAAGAACGTGACAGGGTTGCGTCTATCACATCTGCGACAAGACTGTTTAATTCAAATAACGAAAGTGCTTTGGTCATCTTTTAGAAATATATGTTTCACTAACCATTACTGATTATAACGAGTTGCTGACTGATAGGCTGACCAAAAGTCAGGTATACCATATCCCATTATATTGTCAGGAGATTGACAATTATTTCCCGATTGACGAATAATATCTATTATCTCTTTTGCTGATTTATTAGGGCATGCCTGCCACAAACAGGCTACCATACCAGCTATGAGTGGACAAGCGAACGATGTTCCATTGTCTGGTATGATATATCCCCTACCCGATACGACATTTGTTGGACATCCATAAGCCATTACATCTGGCTTAATACGCCCATCAGCTGTAGGACCTATCGATGAAAAAGGAGCATTAACTTTATCAAGAGAAACTGCTCCCACGGTAAGAATATCATGAGCATCAGCAGGAACATTAATCTTCTTCCAAGTTCCCATACCATCGTTACCTGCACTATTTACAAGTATCATTCCCTTACTTGCCAAACGTGATGCTGCTTGAGAAATTGCTGCAGTTCGTCCATTTAAATCTGCGTAACGATAGTTCATAGATATGTCATCAAAGCCGTGATAGCCCAAAGATGAACTAATGATATCAACACCTACAGAATCAGAAAACTCAACTGCAGCTATCCAAAAATCTTCTTCTGCAGGACTTTCTGTCGAATAGTCTTCCGTACGAAGCAACCAGAAATCAGCTTTAGGAGCAGTACCCACAAACACCGTAGGCTGGTTCATCGCTATTGTTGACAAAGTCTTAGTACCATGATCCATCTCTTTATAGACATCATTATCGTACCCTGCAACAAAGTTATGTGTACCTTTTATATTGAGATTCCTAAAGGCAGCAATCTTATCCACATTCATAAAGCCTGCATCCAACACTGCGATAGTCATACCTTGTCCCATAAAACCAAGGTTGTGCAGCTTTCTACCATTCAAACTCTCAATTTGCCCCTTACCCATACCATAATAGTCGTGAACCGTAGAATCAAGAGATTGAAGTTCAGAATTATAACGAACACGTTGAGAAAGTGGACGGATAGAGTCTGGAGAGACAAATACCAGCTTACTATCTTTCACAAAAGACAATACTTTTAAGTCTTCTAACGTTTGGCGATTATTGCCACGCACAAGAACAGAGTTATTCCATTTACTCACAGCAACAACCTTACCACCTCTCTTCTTTATCTGTTGAATATAATCTGGTGATACCGGTAGGTCTGTCGAATCAAGCTTTAATCCCTGACGTTCACGTCGAAGAAGAGAAGCTTTTGATAGAAATTTCTCTGGTTTATCCAAGGAATAAGGTGTTCCATTCTTATCTTTTAAAGTAACACGGAACATATAACACTTTCCTCCTGGGTAACTTATCAAGCCACCACTCTTATTAGAAGCTAATAAAGTAGCGCTGACCAGTATAAATAGGAATATAATATAATTTCTGAGATAAGTTTGTTTCATAATCTGCAAACTTACACAAAATAATTGAATTATACCGCAGATATAAAAAGAAATTAGATAAATAAAAAAGGATTTCCTTTATTAGGAAATCCTTCTTGTACACCCTTAGGGATTCGAACCCTAGACCCACTGATTAAGAGTCAGTTGCTCTACCAACTGAGCTAAGGGTGCATGCTAAACGAAAAGACTTGGGTTAGAAATCAATTCTTGTACACCCTTAGGGATTCGAACCCTAGACCCACTGATTAAGAGTCAGTTGCTCTACCAACTGAGCTAAGGGTGCTCATTTCGTTTTTGCGAGTGCAAAGGTATTACTTTTTTGCGAATTAACCAAACTTTTCTATCTTTTTTTAGTCATAAATCAGTCGATAATATTATGCCATCAGGCTGAAAGTTTAGTATAACTACGTGATAATATGAGTATTACAAAATGACATGGTTCTAAAAGGAAACAATTATTTTTATGTATTTATAATTATCTTTCGCTCCAGATAAGTAAATTCAGACGCCTTTGATGCCTAATCAACTTATATAATTTAACCCAATTCGATCATTAGAGCCTAAACATATTTTGTTTTATTATCGAGCAGATTGTTTGGCTTTGGAACGCAGGCGTAGCGGGCTACGTCAAGTTAAAAAGACGGACAAGATGCCGATAAGAAAATAAAAGATGTTAGGAAACAACACAAAAACAAGAATTCCTGAAACATTGTGGTCTAATGACCGCATTTGGGTTAAACAAAATGAGGATGAAACCTACAACAGTTTCATCCTCATAATATCGTTTTATAATGATTTCTCTATTATATATTATACTGATGCAAATCAACGAACTTACCACGCTGGCGACGCTCATCAATCATACGTTGTAGCTTTGCCTTGCGATGTGGCGCAATCCATTTTTTCGCAAAAATGTTAGGAATAGCAACACCAAGAGCAATACAAACTAAGACTAACGAACCATTGATTGCAAAAGACATAGCATGCGTTACTTCTCCAACAACACCCTGCATCCCCATAAAACCATAAAGTGCACGATACATTAATACACCTGGAACAATAGGGATAACAGCAGGAATAGTGATACACTGATGTGGTGTATGGAGGATATGGACAGCCTTAATATTAATGATAGATATCAATGCCGAGCCACAAAGTGAACCAACAACTATTCCAAGTCCAAGACCAGCATTTCCTGTTGAAGGGTCAAGGAAAACGAAGTTACGCGTACAAACACAGATTATACCTCCAACCGCAATCCAAGGCATTAAACGATATGGAATATTATAAATCGTTGCAAAGCCCATAGCAGAAATTGCAGCAGCAACAGCATAGACATAGAAAGAGTGATGAGGAATAGTAGGGAGGTCTTTTGCAAAACCATCAAAACTACCACACTTAATTGCCAACATAATACCAAATGACATTGCTATGACAATAAGAAGCGTGTTCATTGCACGGACTAATCCCGTGTTTATATGATTGTCTAACAAGTCATTAACAGCGTTTATTAAAGGGACTCCTGGAACAATATAAAGGGCACAAGCTAAGAGTGGATGCCAAGGAGTTTCTGTAAACAGAATTGGACGAAGGAACTCTGGTAGCGCTGCTTGAACGGTAGGTGTAGAGAGAAAAGACGACAACCATGCAAGAATTGTACTAACAAAAGCCGCAACAGCAAAGTTAGCATAAAGATTAGAACCAGAATGATTCAAGAACATACGAAGACGATTACCCAATATAGCGGCAATAGAAGCATAGAAGAAAGCTGTCCAATCACAACCGAACTGAATACAGAATCCACCACAAGCAAGTCCTGCACCAATAGCAATTATCCAATCCTTGTAATAGTGTTTACCATTAGCAATCTTCTCAAGTTCCTCTTCATACTTGTCTAACGAATAGTCCTTTTGAATAGCACGCCATGAGAGTTTTGAAACTTCCTGAATAGCAAGCATATTAATTACATGCTTATCACAACGCTGCATCTTAGAAAAACTATGATATTCGTCACTAACATTCACCTGCAACATATAATAATCAATGTTCATGTGCAGATTCTCTTTGGGAAGTCCCAAATAAGCTGCTGTACGCTCCATATTTCGCTTCACACGACTCGTATCAGCAGAACTTTCCATAAGAATTTGCCCTGTACGAAGGAGCAAATCGAGTTTTCTACGAAGGGTTTTCTTGGAACAATCCATCTGTTTTTCGTCCATACAAAACAATCTTTAAACTTTAAAACTTTTGCAAAGGTACAATTTATTATCCGTATAGGAAAATTACGCTGCTAAAAATCAAATCTTTAGCTTAATAATTATAGAACTTTGTTCTCAATAGTAAAAAGCTCATCAGCCTTTTCGAGTAATACTACTATAGATTTTTATGACGCAAACCCCTACACTCTTTTAGACCTATTCGTAATAAACAAGTTTTCTTATGCTTAGTAAATCTGTACATCGTAACCGTCCGCACCATTGGTGTTCACCGCCCGCACAAGATGTGCTAACCCTCCGCACAATATGTGCTGTCCCTCTGCACCAAATCATTTCACAAACAGTAAAATACGGTTTAATAGAAGTTTGAGAAGGTATATATCAAAGAGTCGATGAATAAAAACAAACGGGTTTCTTTCCCTCTCCTTATTCTTCAATGCCCAAAGAATCTTCTATCAATGCCGCAACGATAACAATTTTCTCGTAGAAAACGAAGTGGACGAGAAGAATCCGAACTATAAACATTGAGTTTCTTACATCTTCGTTAGGGAATCACTTCAAAGCAGTTTTTCATCGAGGCGTTAATACCGACAACATTCTGGTAACCAAAACCTTTCTGTAGGATAAATCGATTTTAATAGAGTGTTGTAACTATGAAAATAAAGCTTCTCAAATAATTATAAAATGCAAGATGTACAATAAAACAAATAATCTTCAAAAATAAATTCCGATTTCTTTGGATTTCTCAAGAATGTTATATATTTTTGCTCAAATCGTTATGAGAAAATATATATTTACTGCCATTATAGTATCTGTAACTTTAACGAGTTGCTTTAGTTTCTTCCCTAATCGGTACGAGAAAGCTCTTGTATACAATGGTTTTCGAGCAAATACCAATACAGGGATAGCTACAAAAGTGAACATAAAAGGCTACTATTCTTTAGTTAATGATAGCACCGATTCTGTTACCGTAGGAGGTTCTTCTATCATTAGCAAGGGTGGCATGTCAACGCTTGCAGGGTATAATCCTTTTATCCTTTATGAGGACGGAACATACGGAAATATTCTTTTCAATGCTGTAGAAAAGGATTTTTATGCCAATGAGCATTATAAGAGAGCTAATGTGGAATTGTATAAGGAGTGTACTCCTTTGAATAACATATATCTTATGAGGAGTGGGTATTATAAGCTAAAAGAAGATACAATCTGTGTAACGACATATATATATTATTTACTACGCACAGAACTGGTATTACTGAGGTATAAAATCATTGATAGTAGTCATCTGCTACTTTTAGATGAGACTTATATTTCTGGAAATAAAAATGAAAACGATACTTGTGTTCAGAATAGGATATTTGAATTTATCCCAGCTAAAACACTTCCTTCTCCATCTTTGTTCCCAATTAAAAAGAAAAGGTGGGCTTGGGCTAACAAGAAGGATTGGAAACAATTTAAGCATAGTATTGCAAGAACATCAAAAAACAAATAAGAATATACTATATTTTTTTACAAGTTTATCAGAGATACATTTTATAGCAGTTATTGCGGGTCTTCAGGATTTTGGAGTGATAAGGTTTGTCACTTAAGGTTAGGTTTCTGACGAAATGTCACACGAGAAAAGGAGAGGACGGAGGATTATCAAAACAAAGCAATGGAAGTCACGGAGGCACCGTCGGTGCATAGAGTGACGGAGTCTGTTTGCCAATTCTATTAGCAATTTATATACAAAACGTTTATCCCCAAACAGTTATCAAGAACCTGTATGCTACACCTCCGTCGCTCTTTGTGCCGTCGGCACCTCTGTGACATTGTGTTTACCTCCGTCCTCCCCGTAGCTCTGTGTGCCTTATTATATAATCGTTTTAGTTTATCACTCCATATTTCGGAAGAACAAAAAAAACACTTGATTATGAAAAAAATCATCTTCTCAGTGAAATAAGTCTATCAATCCTTCGTTGTTACAATATAAATATAGTACCTTTGTAGATAATGAGTTTTATAAATACTGTAAATACAGAAGACCTTTACCACAAAGTTGCACGACTTATAGAAGATAGTCGCGCTCGTATGGTTACATCAATAAATTTAGCAGAGGTATACACAAAGTTTCGTATTGGACAATATATTGTAGAAGAAGAACAACGTGGTGAGACAAGAGCTCAGTATGGAAAACAAGTTCTCCAATCGCTTGCCACTAAACTTACCAAAAACTTTGGTAACGGATGGAGCTATTCTAATTTGCGCCAAATGCGACAATTCTTTGTAATATATAGCAATTTGACAACCACTGGCTGTCAAATTGACGATTTTACTCCAAAGTTCACATTGTCGTGGTCGCACTACTTGCTTCTAATGCGTGTTGAAGACCCCGATGCTCGCAGGTTTTACGAAATAGAAAGTTCACAACAACAGTGGTCCAAACGTCAACTATCAAGACAAATAGGAAGCAGTCTTTACGAAAGACTGGCTCTCAGCCGAGACAAGGAGGGAGTGATGCGGCTTGCTCAAAAGGGGCAAGTTGTAGAGAAGCCATCCGACATCATTAAGGATCCAATTACGTTGGAATTTTTAGGTTTAAAGCCCGACAGCCTATACTCTGAGTCAAAATTAGAGAATGCCATAATCGGACGTATGCAACAATTTTTATTGGAGTTAGGGAAAGGGTTTCTTTTTGAAGCACGGCAAAAGCGATTTACCTTTGAAGAAAGACATTTTTATGTGGACTTAGTCTTTTACAATCGATTATTACAATGTTATGTATTGATAGACCTTAAGACAGGAAGCTTATCCCATCAGGATTTGGGACAGATGCAGATGTATGTCAACTATTATGATCGCTATATCAGACAAGATTTTGAAAACCCAACAATCGGTATTTTGCTCTGCGAGAACAAAAACGATGCATTAGTAGAACTTACTCTGCCCCCAAATGCAAATATCTATGCTTCTGCTTATCAATTGTATCTTCCCGACAAAACTCTGCTACAATCAAAGGTCAAAGAATGGATTTCTGAATTTAGAGACAACGCTGAATGATATGAAACCAAATATAGATATTAGTGAAAACGAACTTCTACGGCAATCAGAAGAAATACTTGTAGAATTACTCAGAGACCACACGACGCAAAAGAATATCTTTTGGGCTACGGACGATTATGCCTCTTTAGGTGAGGCATATAGTTATCATGCACCTATCACAATACCTTGTATCACCGGTGATAATGGTTTTATCATACAACCGAGAGTATTAAAAACTCGAGAAGAACAAGCGAACCGAACCAAAGACAAAGCTGAGGTTTTTACGCCTTCTTGGGTGTGCAATGCGCAAAACAACCAGGTAGACGAGGCATGGTTTGGGCGCAAAGATGTTTTTAATCACGAGCACCCCGAAACCAAAACATGGACAGCCACAACTAAACCAATCTTGTTTCCTGACGGTAAAACATGGAAGGACTATGTTCGCTCCACCCGCATGGAAATCACATGTGGTGAAGCCCCCTATCTTGCAAGCCGATACGATACGACTACAGGAGCTTTTATTCCATTATCACAACGTATAGGTATGCTTGACCGTAAATTACGTGTCATATCGGAAAATACTACCACTACTGGAGAATGGCTCAAAATGGCACAAGAGGCATATAAGAATATCTATGGCTATGAATGGCAAGGAGACAATCTCCTGCTTGCTCGTGAAGCACTACTCATGACTTTTATTGAGTATTACACAGAAAAGTTCGGAGAGAAACCCCAAGAACGTTCGATAAAATACATTGCCTATATTATAGCTTGGAATATTTTTCAGATGGATGGGTTGAAAGGAGTAGTTCCCGATAGTTGTAGACATAATGAGGTCATAATTGAACAAACTCTTTTTGAAGCAATGGAGCGAACCGTGCTTTGTCCCGGATGTCAGAATGAGACCTATAAAGGACACAACGGGATATACTGCCTCATCCGTGACTGGGGACATAAAGACCCAGTAACAGGAGAGAACAATCGAAAAATACGTTTCATTGACTTAATAAAATAACAACTTAAAATATCAACCTATGGCTACTTTTGAATCATCACTCAAACCTCGATTGATATATGTTTTTGCTATTGCTGACAAACAGCATGAAGGTAGTTTGAAGATAGGAGAAACAACGCTTGGTGATGATACTGGAGATACGTTGGCAGCACCCAATAGCGATGTCCTTAACCAGGCTGCCAAGGCTCGTATTGATCAATATACGAAAACGGCAGGTATCAGTTACGAACTTCTGCACACCGAACTTACGTTTTACATCCGTGGAGGACATATCTGTTCCTTCAATGACAAACAAGTGCATAATGTCTTAGAGCGTTCGGGCGTAAAACGTAAGGAGTTTAAAGGAGCTACTGAATGGTATTCCTGTGACTTGGAAACCGTAAAACGTGCCATTGCTGCCATCAAGGAGGGCAAGGACAGTTTGGGGGCAGGCGAAGTGACTCACACTGAGAATCCGATTATTCTCCGACCAGAACAAAAGGACGCCGTGGAGCGTACGCTCAAACAGTTTCGTCGGGGCAATCAGATGTTATGGAATGCCAAAATGCGTTTTGGTAAAACCCTATGTGCCTTGCGTGTAGCTAAAGAGATGGGAGCCGTCCGTACTATTATTGTCACTCATCGTCCTGTGGTAGACGCCAGTTGGTTTGAAGACTTCGGAAAAACATTCCACGACAGTCCTGAGTGGCATTATGGTTCACATAACAAGGGGGAAAGCTTTGCCTCGCTCCAGCGACTGGCTGGTCAAGGAAAGAAATATGTCTATTTTGCTTCTATGCAAGATATGCGTGGCTCTAAGGAAGTTGGAGGAAAGTTTGACAAAAACAACGAAATATTTTCAACTACTTGGGACTTGGTTATTGTAGATGAGGCTCACGAAGGTACGCAAACCGAATTAGGAAAGGCTGTCTTAGAACAACTGATAAGCAAAAACACTAAGATATTGCGCCTATCAGGCACACCTTTCAATCTGTTGGACGACCACAAAGAAGAGGAGGTCTTTACGTGGGATTATGTTATGGAGCAAAAAGCCAAAATAGACTGGGAAATCAATCATCTTGGTGACACCAATCCATACGCTTCGCTACCCGCTATTCACATATACACCTACGATCTTGGGCGGTTGATGAGTGAATATAGCGACGAAGAAAAAGCCTTCAACTTCCGCGAGTTCTTCCGCACACGAGAAGATGGGAGCTTTGTACACGAGCGAGATATTGACCATTTCCTCACACTCCTGACGACTGATGATGAAGAATCGCTCTATCCTTACTCTAACGACAGCTTTAGACAAATTTTCCGACATACCCTTTGGATTCTGCCGGGTGTAAAAGCCGCCAAGGCTCTCAGCAGAAAACTGGCTAAGCACCCTGTCTTTGGACTCTTCAACGTGGTTAACGTGGCAGGCGATGGCGACGAAGAGGAAGAAAGCCGCGACGCACTTGAACTTGTTAATAAAGCCATTGGCAGTGACCCTGACCAATCTTATACAATAACCCTCTCCTGTGGACGTCTTACCACGGGTGTAAGCGTAAAGCCTTGGACTGGTGTGTTTATGATGGCTGGTGCTTACAGCACTTCGGCAGCAGGATATATGCAGACTATCTTCCGTGTTCAAACACCTTATACTCATAATGGACGTATGAAGACAGACTGCTACGCCTTCGACTTTGCTCCCGACCGCACGCTGCGGGTGCTTGCTGAGACTGCAAAAGTATCGCACAAGGCTGGAAAGCAAACCGAAGATGACCGTAAATTGCTTGGCGACTTCCTTAATTTCTGTCCTATTATTGCCATTGATGGTGGGCAGATGAAACAATACAAAGTGGAAACTATGCTTGCCCAACTCAAGCGAGCGCAGATAGAAAAGGTGGTGCAAGATGGTTTTGAAAATGGTGCACTCTACAATGACGAATTACTCAAACTCACAGATGTGGAACTCAAAGAGTTTGATGACCTCAAAGGCATCATTGGCAAAACCAAGGCAATGCCTAAGTCGGGAGACATCGACATCAACCGACAGGGACTGACCAACGAGCAATACGAGGAGAAAGAACAACTTGAAAAGAAAAAGAAGAAAGACCTTACGCCAGAAGAAAAAAAACGACTCGACGAGCTTAAAGCAAAGGGCGACCAACGGCGGGAGGCTATCTCCATTCTCCGTGGTATCTCTATCCGTATGCCGCTTATGCTTTATGGTGCCGAAATGGTGGACGAGGACAAGGAACTGACCATCGACAACTTTGCTAAGCTTATGGACGACCAGTCTTGGGAAGAGTTTATGCCTCGAGGCGTAACCAAACAAGTGTTTGCCCGTTTCAAGCGTTATTACGACCCCGACATCTTCCGCGAGGCTGGTAAGCGCATCCGTGAAATGGCGCGTATGGCAGACAAGTTTACCATTGAGGAACGTATTGCCCGTTTAGCAAGCATCTTTGCCACCTTCCGCAACCCCGATAAAGAAACGGTGCTCACGCCTTGGCGTGTGGTAAATATGCATCTTGGAGATAGCCTTGGAGGCTACTGCTTTATGAACGAGGACTTCACCTCCAACTTAGATATTCCTCGCTATATTGAGCATAAAGGTGTCACGACTGAGGTGTTTCATCCACAGAGCGTCATTCTGGAAATTAACTCCAAGAGTGGTCTCTATCCCCTCTATGCAGCTTACAACATCTATCGCACCAGATTAGAGCAGGCTCGTGAGAAGTATGGAGAAGTAAATCGTGCCACGGCACTTATGCTTTGGGACTTGACCTTAGAGGAAAACATCTTCGTCGTCTGCAAAACGCCTATGGCACGCTACATCACTATGCGCACCCTACGAGGTTTCCGCAATACGAATGTGCATACCAAATATTATCCCAACTTAATAGAAAGCATTATCACAGAACCCGATAGCGTGGTCAATATGTTGCGCTCGGGTAAGAGATTCTGGAAAATAAATAATGACGAAAATATGAAGATAGACGCTATTATAGGCAACCCACCGTATCAGGTGACATCCGAAAACACAAGCGATGCTCCTGTTTATCATTTATTCATCGATCTTGCAAGCCTTTTGGCGCAAAGAGTCTCGCTTCTGACTCCTGCTCGCTATCTGTTTAACGCAGGCAAAACGCCAAAGGATTGGAATACTAAGATACTTAATGACGAGCACTTTAAGGTGGTTGATTATTGGGCAAATAGTACAGATGTGTTCCCAACAGTAGATATAAAAGGAGGAGTGGCTGTTATGTACAGAGACTCTAAGCTGAATTTTGGAAAAATTGGTACGTTCACAGCTTACAAGAAATTGAACATTATTGCAAATAAAGTATGTAAGATAAGTGAAAACGGATTGTTTGCAGAACTAATATATGCGCCCGAAAGTTATAGATTGTCAGATAAATTACACGAAGACTATCCTTGGGCTAAAGAGCGTTTAAGTATGGGACATCCATACGATATAACAACCAATATTTTTGAAAAACTGCCCGAAATATTCAAAGAGACTTATCAAATAAAGGAAGAGGAAGTTAGATTCTATGGTAGATATAAAAACGAACGTTGCTATCGTTGGATAAAAAGGGAATATGTTGATTTCCACCCGAATTTGGATAAATATAAGGTGATTGTTCCTAAAAGCAACGGATCGGGAGCAATCGGAGAAGTACTGAGTTCACCGCTCATAGGAGAACCACTCATAGGAGTTACTCAGACATTTTTAACAATAGGAGCATTCGATACTCGGACAGAAGCAGAGGCTTGTCTCAAATACGTCAAGACAAAGTTTGCCAGAACGATGCTGGGGCTACTTAAAGCAACGCAACACAATCCCAAAGACACTTGGCGACTTGTACCATTGCAAGACTTTACCGCAGCGTCAGACATTGATTGGACACTTTCTGTTGCAGAGATAGACCAACAACTTTACCACAAATATGGGTTGGAAGCGGAAGAAATAGCCTTTATTGAAGAAAAGGTGCGAGCAATGGGGTGAAAAAAAGAGGCAAGTCATCATAAGGCTTGCCCTTTTTGGTTATCGTTCTGAGAGTGTTTGTTTAATGATTGCTTGCATCTCCTCGTCTATTCCAAATAGTCGATATAACTGTGTATTTACATCTTGTGCGTAATCTATTATTCCATTATCGTTCGTATAGTCCAATAAATCGGGTACAAGTTTCCCCAAAGAAGTTAAAGCTTCATCGGTTAGCAAGAATGCGTAGCGTATGAAGTCTGTTTGGCAGTAAGCATAGAAGTTTCGTGCTTCCTGCTCTGTTTCAAATGTTTTGAGGGCAACACGTGAACGGCCAAAGACAGAGTGATTGTCAAGCACAGCAATCTGATTGCTTCGTTTCTGTCCGCCAGCATTTGCGCTCGATACAATGACTTTCCACCGATGCAAATAGTCTAAGCCAGTTTTAATTACCGATTTATGAGCAATGAACCAACTTGCTCTCCCTGCCTTTCCTGCTTTGTCGTTCGTAAACAACTTTATTTCATCTTCGCCAAGTATTATTCCTTCTTGTAAAGGGCGTACTAAGGTCGGATTTTGCTCAACAAAATCGCTCTCTATAGAAAATAGACTGCGAGGAAGAACTGACCCATAGATATATTCAAAATGATGCTTAATGACAATGGCTTGAATCTTTTGACTGATTTGCTCGGCTATAGGATCAAGTGCCATTAAAATATCACCAGGGTTAGACATTGCAACTTCCATAGTTTTTCCTCCTTTGCTGTAAGCATAGATAAAGCCCTCTTTTTTCTTAGCATAATCCATAAATACAATAGAGAGTCCATCGGCAATGCCGACATCTTGGAAAATATCTGTAGAGTCAGGAAAGAAATGAAGACGGCAAAGCCTTCTATCGTTGATTTGATTATATTGGTGTCCGATAAAACTTTTTTTGCGTTCATATCTTTTTAATTCAGAATAATAAAAAAACGGGCTGATTGTTTTGCACATTCAGCCCTTCTTATTTCCTTTGTAGTTGTCATAATAAAACAATGAAATAAGATGAACAAAAGTACACATTTTATCGGACAGCCACTATATGTTCAACTGTTAAACTATTTTAATCGTGATAAAATTCTCTCTCTGAGCCAAGCTCAGGGAGGTGAACACTATATAAAGAAGTTTGATGCATGGCATCATCTTGTTGTCATGCTTTATGCAGTAATGCTGCGTTTAGACTCTCTGCGTGAGATAAAAGCCTCTCTCTTTGCTAATGTTAATCGCTTTAATCATCTTGGTTTAAAGCATTTCCCTTGTCGAAGTACCTTGTCAGATGCAAATAAACGCCGAGATTCCGAGATATTCGGTTCGATCTATATGAACCTATATGAGAAATACCGCCATGAGCTTTACTCGGACAGCCGAAATTGTGGACAGCCTAAATGGCTGAAGAATCTAAAGATAATAGATTCTACGACAATAAGTCTGTTTTCTAACTTGGTCTTTAAAGGTGTAGGACGTAATCCCAAAACTGGTAAGAAGAAGGGTGGAATAAAAGTACATACAGAGATATTTGCCAATGAGAATGTTCCAAGCGATATTAAGTTCACATCTGCAGCTAGTCATGATCAGTTTGCACTTATCCCAGAACGATACGCCAATGAGGACCTGATTGCTTTTGACCGAGCCTATATAAACTATGAAAAGTTCTCTGAACTGACGCAAAGAGGCGTTATATATGTAACTAAGATGAAAAATAATCTTAGCTTTGAAAGGATTGCTGATACAGATTACCAGATGACTACAGATTATGGAGCTGTACGCGTAGAAACCATTCTCTTCCATAAGCATACAAAGGAAAAAGATATTTACCATAAAGCAAGAAAAATCACATATCAGGATAAGACCAAGAAAGGGAAAATCAGATTCATATCTCTGCTGACCAATGATTTTCAGATGTCAGCAGAAGATATTATAGCTATCTATAAGAGACGATGGCAAATAGAAACCTTATTTAAACAAATAAAACAGAATTTCCCGCTAAGATACTTCTATGGAGAGAGTGCGAACGCTATAAAAATACAAATATGGATTACACTTATAGCCAATCTGCTTATAACCCTAGTGAAGAACAAAATAAAGAGACCTTGGAGTTTCTCAGGCTTGGCAACAATGATAAGAATTCTACTTATGAGTTATGTCTCAATACAGAGTTTCTTTGAACGGCCACATAGAGACTGGGATAGATTGATTACCCAGGTAAAAGCCCCACCAGAAGAGTTGTCATTATTCTAGTGGGGGGGCTTGGAATTTGAAATAAGAACAAATCATGCCTATTTCAGCAGGATTGAGAGAGGATATTGCAGTATATAGAGGTTTTATCGGACAGCAATAATTTGATTATAGCCAAAATTTGCCAGTCCCTTGCCTGAACGATGTATCCACCGCCCAGCAGGATAAATGAGCGAGGAAAAACGTGGAGCAAGTCTGTCGGCAAGTAATTGGAAATGTTGGAATATATTTACAACAGCTTTTTGTCCATTATCAGTATCTTTCTTTGCTACAGTCAGCTGATACGGTGGGTTGCCTATAATAGCGTAAATCGGCGAGATTTTTGCAATATTACTTAGATACAAAATCAAATAATATGGTCAAATCATTTAAGCAACAACTCAAGAAAGAAGAATTATCGGACAATACGATTACAGCTTATCTGTATGCTGTAGAAGACTTTGAGCGAAAGTATTCTTGCTTCAATCGAGAAAATCTTTTGCTTTACAAAGCAGATCAAATAGAACAATTTAAACCCAAAACTGTAAACCTTCGCATTCAGGCTCTCAACAAATACTTAGAGTTTATCGGTAAACCTCGCCTTCGTCTCAAATCTCTCAAAATACAACAGAAAACTTATCTCGAAAATGTGATTAGCAATGCGGACTACCAGTATCTCAAGTCAAAACTCAAAGAAGGAGAAGAGGAAGTATGGTATTTCCTCGTTCGTTTCTTAGGTGCTACAGGAGCTCGAGTAAGTGAACTTATACAATTCAAGGTCGAGCATGTGAGAGTGGGTTACTTTGACATCTATACCAAAGGAGGGAAGGTTCGTCGGATCTTCATTCCGCAAGATCTCTGCATGGAGACAAAAGCATGGTTGAAGCGTGCCAATATAGATCTTGGATATCTATTTTTAGACAAACACGGCAAGCAGATTACTCCACGAGCCATTCGACACAAATTGCAACGCTTCGCCCATCAATGGGGTATAAATTCCAAGGTGATGCACCCTCATTCCTTCCGTCATAGGTATGCAAAGAATTTCTTAGAATCGTTCAATGATGTTGTTCTATTGGCAGACTTAATGGGGCATGAGAGTATTGAGACCACTCGAATTTACCTGCGTCGTACTTCCAATGAGCAACAGGCTATTGTAGACAAGATAATTACTTGGTAACCTCTTTGACTTGATAAACTATTGTTAGTTGTAGCAGTCTCTCTGTGCAATGATTTCGTTCTCAACATAGTTCATTTACTTTTTGCGCTCAATAAATCCGTAATGTCTCTCTCTTTATAATAACAATTGTGTTCTATCATTGAATGAGGAATTTTTACCCATATCTCTGTATGATTGCAAGGTACACTTACTGATAATCAAATGCTTACATGAATCTTCGTTATCAATCCACCCCTCACTATCGGGCGGATTGCCGAAGAGTTGTTCGATACGATGAATAAAGTTTGCAGAACAAAACAATTTATTCAAAATGTATTTTCCTGTCACTGATATCATCAAAAAGAGCCTATAAACAATTGATTTGCAGCACCTTTATATGAAATGTTAAATGTGACAGCAAAATAAATTTAAAACAACAATGCTTATATCACCAACTTTAGTCTGCTTTATTACTCCGCATATCGGAAACCTTAATTTAACTGTATAATAGCTTTTCGAATATTCTCTTTTGCAACGGTAGGTTTCATTGCTTCTGTAAGCATCATATCTGTTGGTGTAATACATTGGACTTGTGAGAAGCCTGCTTTAAGTAAAGCTGCCGCATCCTTTACCTTCCCAGCTATAAGCATAACAGGAACATTCTTACGAAGACCATATTCTAAAACCCTTATCGGTATCTTTCCCATTAATGTCTGACTATCTGCACTACCCTCCCCTGTTATAATCAAATCTGCATCTTCTATAAGTGAATTAAAATTGACAGTTTCTAAAAGAACATCTGCACCAGAGCTCATCTTGGCATTCATAAACTGCATAAAAGCATAACCCAATCCACCTGCAGCTCCAGCACCGTTATTCAAAGAACAATCAAAACCTAACTGAGCAGCAGCCATACGTGCAAAAGTACGTGCTCGACGATCCAAACAAACTATCATTTCTGGTGTCGCTCCCTTCTGTGGACCAAAAATGACAGCAGCACCACGTTCACCAAAGAGAGGATTATTTACATCTGAAGCTAAGGTTATATCTAAATCTCGTAAAAATTTATCACGCCAGTTCTTACCAAAAATGTCTTTCAAAGCTGCCAACATACCAAGTCCGCAATCGCTTGTAGCTGACCCTCCAAGTCCTACAACAAACTTTCTATATCCTCTTTGAAGTGCATCAGCAAAGAGTTCACCTAATCCATAAGTGGTTGCACGTAAAGGATTCAATTCCTGTTGCTTAAGAAAATTTATGCCACACGACAAAGCCGTCTCTATAACGACCGTATTATCTGCACATACGGCATAGCTTGCTTGAATGGGACGCATGAGTGCATCGTGGCAATTAATAGTAATCTCCTTACAGTCGAATAACTGTAAAAAGACTTCTAACATTCCATCTCCCCCATCCGTGACAGGAACCTTTACAACCTTAACTTCCTGCCACCGCTCACGAAGACCCTGTTCAGCAGCATCTTCAGCTTCAACAGAAGAGAGACAACCTTTAAAACTATCAATAGCTAAAATAATATGCTTCATATTGCTTACAAAGTTATTAAGAATTCGTGAATAATTTGGATTTTACATCTATTATATATATCTTTGCAAGATAAACTGTAATTATATACCTTTATATGAAACAATTTTTCAAATTCGTTTTTGCTTCTTTCTTTGGAATGATGTTGTTTAGCATCGTTACAGGACTCTTTGCACTTTTCACTATTGTTGGTATGATTGCATCGCAGGATACAACCAAAGAACCTGAAGACAACTCAATACTTGTATTGAATCTTTCAGGACAGATGTCAGAGAGAAGTGAAAATAATTTTCTTAGTCAGCTACAAGGTTCTCAAATAAATAGTTTGGGTCTTGACGATATGCTTGAAGGAATCAGAAAAGCTAAAGACAACGATAAAATAAAGGGTATATACATAGAAGCAGGTGCGTTTGCGTCTGATTCATATGCTTCTATGCAGGCATTGCGTAAAGCACTACTCGATTTCAAGAAGAGTAGAAAGTGGATTATTGCTTACGCTGACACCTATACACAGGGTACATACTACTTGTCGTCTGTTGCTGATAAAGTTTATCTTAATCCACAAGGACAGATTGACTGGCACGGATTAGCTTCTGAACCAGTTTTCATTAAGGACCTCTTGGCAAAATTCGGTGTAAAGATGCAGGTAGTAAAGGTTGGTGCTTACAAAAGCGCAACCGAGATGTTCACTGGCGACAAGATGAGTGATGCTAATCGTGAGCAAACATCAGCCTATTTAAACAGTATCTGGGGCAATATTACAAAGGAAGTTGGAGCAAGCAGAGGTTTGTCAGTAGCACAACTGAACGCATACGCTGACAGTATGATAACCTTTGCTGACCCACAAGAATATGTAAAATTAAAGCTCGTTGATGGCTTAGTTTATACAGACCAGATAAAAGGAATCGTCAAAAAGCAATTAGGTATTGAGGCTGACAAAGACATCAATCAGGTTACTATTGCTGACATGGTGAACACTGAGGACAAAAACCAAGGTGATAAGGAAAATGAAGTTGCAGTCTACTATGCTTATGGTGATATTGTTGATGGTGTTGTAGGAGGTCTCTTCTCACAAGGTCATCAGATTGACGCACAAGTTGTTTGCAAAGATTTGGAAGAGTTAGCAAAAGATAAAGACGTTAAGGCTGTTGTCGTACGTGTCAACTCTGGCGGCGGCTCCGCTTATGCATCAGAGCAAATCTGGCATCAGATTATGGAACTGAAAAAGTTGAAACCTGTCGTTGTTAGTATGGGTGGAATGGCTGCTTCTGGTGGCTATTATATGTCAGCTCCAGCCAACTGGATTGTTGCTGAGCCTACTACAATTACAGGCTCTATAGGAATCTTTGGTATGTTCCCTGATGTTAGCGGTCTGCTAAGAGAAAAACTGGGTTTAAAGTTTGATGAAGTTAAAACCAACAAATATGCTGACTTCGGCACACGCGCTCGTCCTTTCACAGAAGAAGAAATGTCATATCTTAGCCAATATGTAAACCGTGGCTATAAACTCTTCCGTCACCGTGTGGCAGAAGGACGTAAGATGACTGAAAAACAGGTTGAGAAGGTAGCACAGGGGCATGTGTTTACTGGTCAAGATGCACAAAAGATAGGACTTGTTGATCAGCTTGGTGGATTGGATGTTGCTGTAGCAAAAGCTGCACAACTTGCTAAACTACCAAATTATAGAAAATGTGCTTACCCTAAGGAGCCTAACTTCTTAGAGCAAATGATGGAGCAAACGAATCCTAATAATTACCTCAGCCAACAGTTACGTGCTAACTTAGGTGACTATTATGAGCCATTCACACTCTTAAAGACCATCGATCAACAGAGTGCAATTCAGGCACGTTTGCCATTTTATCCTAATATCCATTAATCATATATGGAAGGAGACCACATCAAAATAAACAAATGGCTATTGCCTTTCAGCTGGCTCTATGGGCTTGGAGTAAGACTGCGCAATGAGTTATTTGAACTGAACATTCTCAAATCCCGACAGTTTGACATTCCAGTCATATCTGTTGGTAACATTACTGTGGGTGGTTCAGGGAAAACTCCCCACGTAGAATATCTTATCCGATTACTGAAGGATAAAATGAAGGTGGCTGTTCTCTCACGTGGCTATAAACGTAAGAGTTGTGGATACGTATTAGCAAACGAGAATACTCCTATGCGGGAGATTGGTGATGAACCCTATCAAATGAAGACAAAGTTCCCTGATATTCGTGTTGCAGTTGATAAGAAACGCTGTGAAGGAATAGATCGGCTAACATCTGATGAAGAAACAAAAGATACAGATGTTATTCTCTTAGATGATGCTTTTCAGCACCGATATGTACATCCCGGTATCAATATCCTGTTAGTTGATTATCATCGACTTATCATCTATGACAAACTTCTTCCTGCAGGAAGACTACGTGAGCCACTCTCTGGAAAAAATCGTGCAGATATTGTTATCATCACGAAGTGCCCAAAGAGTCTAAACCCAATAGACTATCGTGTATTGAGTAAAGCTATGGAACTCTACCCTTTCCAACAGCTTTATTTTACGACGTTAGACTATTGTGATTTGGAACCTATCTTCAGTAAAGGAAGAAATATACCACTCACAGAAATAAGAGGAAAAAATATCTTGTTGCTTGCTGGTATCATGTCACCAAAGCAATTGGAGTTGGATCTAAACTCTTTCACAGGAAACAATGCACTGACAACACTATCGTTCCCAGACCATCATGCATTCACAACAAAGGATATTCATCGTATTAACGAGACCTTTGCTAAAATGCCTGAACCAAAATTGATTGTCACAACAGAGAAAGATAAGGCACGTCTTGTTGATATTGATAAATTATCAGACGATGTAAAAGAAAACATTTATGCACTTCCTATCAAGGTAAGCTTTATGCTTGACAAGGAAGAGGTATTCAATAAAAAAATAATATCCTATGTACGAAAAAATTCAAGAAACAGCATCTTGGCTAAAAGAGAGGATGACCACAAGTCCAAAGACAGCCATCATTCTGGGCACAGGCCTCGGACAATTAGCTTCAGAGATAACCGATAGCTACTCATTTTCTTATCAAGATATACCAAATTTTCCAGTGTCAACAGTAGAAGGTCACGCTGGTAGCCTTATCTTTGGACGACTTGGTGGTAAGGATATCATGGCTATGAAAGGTCGCTTCCACTTCTATGAAGGATATAACATGAAGGATGTTACCTTCCCTATTCGTGTAATGCACGAGTTAGGCATTGAAACATTGTTTGTTTCAAACGCTTCTGGCGGTATGAATCCATCGTTCAAGATTGGTGACCTCATGATTATTACAGATCATATTAATATGTTCCCAGAACATCCGCTACGTGGTCGTAACTTCCCTACAGGTCCTCGCTTTCCAGATATGCACGAGGCATACGACCATAAATTGGTTGATTTAGCAGACTCTATTGCTAAGGAGAAGAACATCGAAGTTCAGCATGGAGTTTACATGGGCGTACAAGGACCAACCTTTGAAACACCTGCAGAATACCGTATGTACCACAAGATGGGAGGTGATGCTGTTGGCATGAGTACCGTACCAGAGGTTATCGTTGCTCGCCATAGTGGTATTAAGGTGTTCGGCATCAGTGTAATCACTGACCTTGGTGGCTTTGATGTTCCTGTAAAAGTTAGCCACGAAGAGGTTCAGGAAGCTGCAAACGCTGCACAGCCACGTATGACAGAGATTATGCGTGAGATGATTAAACGCTCATAAGGCATATAAATATAAAAAGAAACTAAGGCGTTCTTTATTCTATAATTCATAGATAAAGGACGTCTTAGACATTATATCAATAAGCTATTAATAAAGTAAATGGAAATAAAAGACCTTGGTGAATTCGGACTAATCAACCGTCTCACAAAAGACATACAACCAATCAACAACTCTACCATTATGGGCGTGGGTGATGATGCTGCTGTTTTACACTATTCAGATAAAGAAACACTTGTTTCTTCACAGATGTTCATGGAGGGTGTACAGTTCGATTTAACTTACATAGACATGGAACATCTTGCCTACAAGGTGGCTATGATAGCTATGAGTAACATATTCGCTATGAATGGACAGCCACGGCAACTCATCGTTTCATTAGGGCTTGGAAAACGTTTCAAAGTAGAAGACCTTGATCAGTTCTATGCTGGATTGAACAAAGCTTGTGCTAAATGGAATGTAGACATTGTTGGAGGTGACACGACTTCTTCATATACGGGACTTGCTATTAACCTCACTTGTATTGGTGAAGCTGCAAAAGATGATATCGTCTATCGTAGTGGAGCAAATGAGACCGATCTTATCTGCGTTACCGGTGACCTTGGTTCGGCCTACATGGGCTTGCAAATTCTCGAACGTGAAAAGACTGTTTATTACCAACAAGTGCAAGAGTATAATAATAAGGTAAAGGAAGCGCAAAGCAATAAAGACGAGAAACGACTTGAAGCATTACGCCAGGAACGAGCAGCAATAGAGGATTTCCAACCAGACTTTGCAGGAAAAGAATACCTTATTGACCGCCAGTTGAAACCTGAAGCACGTGGTGCTGTACTCAGTCAACTGCGTACAGCAGGCATACATCCAACCTCTATGATTGATATTTCAGATGGATTGGCAAGTGAATTAAAACATATTTGCGAAAAAAGTCACTGCGGTTGTAGAATTTATGAAAAGAATATTCCTATCGATTATCAAACTGCAGCAACCTGCGAAGAGTTCAATATGAACCTTACAACAGCTGCTTTAAATGGTGGAGAAGACTATGAACTTCTTTTTACAGTACCTATTGGCGATCATGAGAAGATTGATAAGATGGAGAATATCAGACAGATAGGTTATATAACAAAGGAAAGCTTGGGTGCATTCCTTATTGCTCGTGATGGTAACGAGTTTGAGTTGAAGGCACAAGGTTGGCCTAAAAACGAGAAATAGCCAACTTGTTAAAGGATGTAAATAGAAACATTTTTTTTTTGAAAAACTTGGAGGAATAAAATAAAAGTACTACCTTTGCATCGCAATTGAGAAAACAACACAAACGTTACAAAAATTGATTGCAAATAACGATGGTGCCATAGCTCAGTTGGTAGAGCAAAGGACTGAAAATCCTTGTGTCCCCGGTTCGATTCCTGGTGGTACCACTCCTCCTTTCATCAGGGCGGTTCTCGCAAGAGGCCGCTCTTTTTTTTACTCATTATAACGGGGCAGTCCTAAAACCGGGACAGTAGATATTTAGTGGGGATAGGCATATATTTTGGCAAGCCTAAAAAGAAAGAAACTTTTATCTACAACTCCTCTTAAGTTAGCCCTAAATAGTTTTATTTTTGCATTGAAGGATTCGGCAGCAGCGTTTGTAGATCTGTTAATATAAAAGTTAAGTATGTCTTCGTAATGCTCGTAGAAAGTTGCAGCAATGACATTAAAGGAATGAAAGCCAGCTTCTGCGACGTTATTATACCATTTTGCCAATGATAGTCTGGCTGCATTTTTGATAGTATTCTTAGCAAAAATCATTCTTAGCGAATGTGATAAACCGTAAGCCTTCTTAATGTCAGGATATTCTCTGAACAGTATTTTGGCTCTTAGCTTCTGTTCATCAGTCCATTTCTCTGATGATTTGAACAACAGGTACCTACTTCTTACCAGCAGTTCACTACGTGTATCTCCATTTTCAAAAGTCAGTGGTTGATACGCTATCTTTTCTAACTTCGCATTCTCTTTCTCGTCATTTGCTTGCTGTAAAGCTGCCCAGCGATACTCTATTCTCATCTGTTGCACGGCATCACTTGCAAGTTTCTGTATATGAAAGCGATCAATCACTCGTTTAGCCTTTGGAAAACAATGCCTTACAATCTTACGCATACTATCGGATAAGTCCAAGGTGACCTCTTCTACGGCTTCCCGTTTCTTTTCATCTATCTTATCCAATACCTTGCAGACATCCAAGAATTTGGTTCCAGCAACAATGGCTACAAGACATCCTTGCTTGCCGTGTTTGTCCCTATTAGTGATAATCGTATAGAGTTCACCATTGGACAAGGATGTCTCGTCTATTGCTATGTTCTTACCAATATTATCCTCAAAGAGAAGCCACTTCTGAGCATGAGACAACTGATCCCAAGACCTGTAACCACTTAATGTTTCCTTGTATTGTTTCTCAAAACTACGTCCATTGATATGATAGAATTCCTCAAGCGTACGGCAGGTCACTGGGGATGTCTCCATACGTTTCTTTTAAAAAAGCTCCGAACTCTTTGGAATAACGAGTGCCGGAGGCTGTAATATCTATCTGCAAGGGAAGAGAGAAACTCTTACCTGTACGAATATCTATCCAGCGACGTCGACGAAGAACCAAAATAACCTTATGGTCACGAATCGGAAAGTCCGTTACCTCGACAGCTTCCATAAAACCTTTTGATTCAAAATGAAGATCATCGGAAAGTTCTTTCTCCATCTTCTCATCAAGATAAATACGTAACAATGAAGTATCAGACTCTATCTTAACAATAGAGAAATAATGTAATACATCACTGGGTAAAACGAATTCGGCTAACTGATATAAATATTTTTCTTCCATGATACAAAGGAAAGAAAAATATATTAATTAAGAAAATTTCCCCACCTTTTTTCTACTGAGCCCCTAAAACGGGTCAGTCATAAAACCCAGTTGCAAGTCTATACTCTTAAGCACACAATTTCATAAGTATTTATTACCTTTGCATCATGAAGAATCACCAAAACGGCTCAGTAGAAATTGTCTTTATAACTTGACGTAGCCCGCTACGTCTGCACCAAAAACAAACAAGATGCTCGATAATAAAACACAATATGTTTAGGCTCTAATGAACTATTTGGGTTAAAAAGAAAAAGCGATGACCTTCACAGGTGATCGCTTTCTTTACTCAATAGGTATTAGTCTTTTAAAGGTAAAAGATTGTTTTCAGGATAACAGTCACAAAGATATATTATTTTCGCGTCACATGCAAACTTTTAGTAGATTATTTTTTCATTATAATGATGTGTGCGAACACATGTTTTATACAAAATCTATTTTTTACGAAATAAGTCTCAAACCTCTTGTTTCATAAGGTTTTTTGTTATTTAAAAGTTCAGAAATTTAAGTTATTCCACTTGTATATTAATAGCCTTTTGTAATGCTATATGAATAGTTGTGCGTCTATACTTTATCATAACACTACCGATCCTTTTGTATAATAATTTATCCATTAAAATGCTTTTGTTATCCATAGTGTCTCCTTTATTCATAGATTATAATTAACTTTGCAAAGAATTTATGTATAAGTTAAATATTAAGTAAAAGCAAAACCTTCCATTACTACATTTATAAATGATTATGAAGAAAAGAACAATCGTGTTGTTTGGAATGCTCCTTATGGTTTTGACTGCTGTTGCTGGCGGCAAAATCAAGGTAGCATGTGTAGGAAATAGTGTGACATGGGGAATGACAATTATCGATAGAGAAAAGAATTGTTATCCAGCACAACTACAGAAGATGTTAGGAGATAAATATGAAGTCAGAAACTTCGGACACTCTGGAACTACACTACTACAACATGGACATCGTCCCTATGTCGACCAACAAGAGTACCAAGACGCTTTGAACTTCAAAGCAGACCTTGTCATTATCCACCTTGGACTCAATGATACCGACCCACGAAACTGGCCTGAGTACAGTGAAGAATTTAATGCTGATTACATCCGTCTGATTGATAGTTTCCGTCAGGCGAATCCTAAAGCAAAGATTTGGATTTGTCTTATGACCCCTATCTTTGAGCGTCATCCACGTTTTGAGAGTGGTACACGCGACTGGCATGCGCAGATTCAGAAACATATACGACAAGTGGCGACTGCGACACGAGTTCCACTCATTGACTTAAATACACCTCTATATAGTCGTCCCGATCTTCTTGCCGATGCTATTCACCCGAATGCTGAGGGCGCAAAGATTATTGCAGAGACGGTCTATGGAGCCTTGACAGGCAACTATGGTGGGCTTGCCCTGTCTCCATTGTATACCGATGGTATGGTTATTCAGCGCAACAAACCTATTGTTTTCCGTGGAAAAGCAAATGCAGGTGAAACTGTTAAGGTCAACTTCAATGGGCATATGCTTTCAGCTATAACAAATGATGCAGGAAAATGGAAGATTACCTTCCCTGCTGAGAAAGCTGGCGGACCTTATAAAGCCCAAATAAGTACGAAGAAAGAAAAGCTTACTATTAAAGACATATACGTTGGTGAAGTGTGGCTTTGCTCAGGACAATCAAACATGGAGTTACCTGTAAATGCTGTACAGAGCAGGACACAAGACTTGAATGAGGCTGACAGTCAGACACACTTACATCTTTTTAATATGTCAGCTATCTACCCAACGACTGCCATAGCATGGTCTGCTAATGCTTGCGACTCTGTCAATCGTCATCAGTACCTCCATATTGGACCATGGCGTAACTGCTCTCGAGAATCTTTGGGTGGCTTCTCAGCTGTAGCTTATCACTTTGGAAAGAAGTTAGCAGACAGTTTGCAGGTCCCAGTAGGTGTCATCTGTAATGCTGTTGGTGGTACCACTACAGAGTCATGGATTGACCGCCATACGTTAGAACAACGTATGCCAGCTATTCTTCGTGATTGGTATCATGGAGATTTCGGTATGAAATGGGCACGTGAGCGAGCATTGCAAAACATCAGCGTAAGTAAGAACCCACTGCAACGTCATCCTTATGCACCAGCCTATATGTTCGAGACAGGTATGCTTCCACTAAAAGGATATAGTATCAAGGGAATTGTTTGGTACCAAGGTGAGTCCAATGCACATAACATGGAACTACACGAACGTCTCTTCCCGATGTTACAGAAGAGTTGGCGTAATTTCTTCCATGACCCAGAGCTACCATTTTATTTCGTACAGCTCTCCAGCCTGAATCGTCCTTCATGGCCACGTTTCCGTGATTCTCAGCGTCGTATGGCATCAAGACTACGTAATACATGGATGGCTGTTACAACAGATGTGGGTGATTCCTTAGATGTACATTACACTAACAAAAAACCTGTCGGTGAGCGACTTGGCTTACAGGCTTTACACCATAGTTATGACTACAACATAGAATCTGATGGCCCTATCTGTCATTCTGTATCAGCAAAAGACAATGGAATTGAATTACAGTTTATTCACGCAAAGTCACTATCAGCTAAGGGCAGCCGCCTTATTGGCTTTGAGGTAGCAGGTGCAGATGGCATCTATTATCCAGCAGAAGCACAAATAACATCATCTAACACAATTCTTGTTAAGTCGTCTTCTGTAACACGTCCACTCTATGTTCGTTATGGATGGCAACCTTTTACACGTGCCAACCTTGTGAATGAGGTAGGATTACCATGTAGTACATTCCAATGGGCAGTAAAGAAATAAAGGTGTTTATAACCATCGTTCTTTGTTTAGAAAAATGATACAATTAAGTCAGACTGCCCACAGTCTGACTTTTTTTTATTAAGACCATTCCAGACAGAATCCTATTTACTATATTACACAAATAGATTTTCTCCCTTTATGCCCCTTGATGTATCCCTATATATTTGGTAGTTTAACCATATCATGCATCTACTATAGGACACTCTCAATAACTTTATAGCAAAAGATTTTAAGAACTTCATCATATTTATGTGGTGATTAGCAACTTTCTTTTTAACTTTGTACTCTATCTAAAAACCCCTATCAATGATTGATACAACAATGAATAACTATACCTTATATAATTCAAAGCACTCTTCTACCCTATTGATTACAACTTCAATAGCTGCTCTCGGCATTTCCTTCCCAGCCAAAGCACAACAAGTAAATACACAACCCAATATTATCTTGTTCATGGTAGACGACATGGGATGGCAAGACACATCTCTTCCCTTTGCCGATTCGATTACTGCCAATAATCGGAAATACGATACACCTAACATGGAAAGACTTGCTTCCGAGGGGATGATGTTTACAGATGCTTATGCAACTCCTATCAGTTCGCCCTCCAGATGTAGTCTGATGACAGGTATGAATATGGCTCGTCATCGGGTAACGAATTGGACTTTACATCGCGATAAGATGACAGATGGGAAACGAGATGGCGTAACATTACCTGATTGGAATTATAATGGTATTGCGCAAAGCGGTAATGTCGCACATACTACAAAGGCTATATCCTTTGTACAACTCCTAAAAAATGTAGGCTATCATACCATACATTGCGGAAAAGCACACTGGGGAGCTATTGACACTCCGGGTGAGAATCCGTGCCATTTTGGCTTTGACGTAAACATTACAGGTACGGCAGCTGGTGGATTAGCTACCTATTTAAGCGAACGTAACTATGGTTTTGCAAAAGATGGCAAACCCACATCTCCATTCGCTATCCCAGGCTTGGAACGTTATTGGGGTACTGGTATCTTTGCCACAGAAGCCCTTACACAAGAGGCAATAGCATCATTAGAGAAGGCTAAGAAATACGACCAACCTTTCTATCTCTATATGTCTCATTACGCTGTGCACGTACCAATCGATCGAGATATGCGTTTTTACCCTACGTATCGTGCACGTGGTCTTTCGGAAAAGGAAGCTGCTTACGCTTCATTGATTGCAGGAATGGATAAAAGTTTGGGAGACCTCATGGATTGGGTCGCAAAGGCAGGACTTAAGCGAGAGACCATCATCATCTTTATGAGCGATAATGGAGGACTCGCCTCATCATCCTATTGGCGGGATGGAGAACTTTACACGCAGAATGCACCACTCAAGAGTGGTAAAGGTTCTCTGTATGAAGGAGGTATAAGAGTTCCCTTTATTGTAAAATGGAATAATATTGTAAAACCAAATACTCGCTCTCATGCACCTATCATCATAGAGGACCTCTACCCTACCTTACTCTCCATGGCTGGAATTAAGAACTATCATGTACCACAAAAAATAGACGGACAAGACATTACACCTATTCTTCGTGGTAAACAACAAGGTGATAAGAAGCGACAACTGATATGGAACTATCCTAACATTTGGGATGGAGAGGGATTGGGAATCAGTCTTAATTGTGCCATTCGTGAAGGGCAATGGAAATTGATTTATTCGTATCTCACTGGTCAAAAAGAACTATACGATCTATCCAGCGACCTGTCTGAGAAGAATAATCTTGCTTCTTCTCACCCTCAACTCGTTGAACGCCTTTACAGACATCTCACATCTAAATTACATAAAATGAATGCACAGAAACCAATCGTAGAGGGTGAAAAAAGAAAATAAGAGTGGTTAGTTTAAGTAGACATAAGTTATTCATTGATTTTATTTCAACTGCATCATTCTACTCTTATAGGATAAAAGAGAGAATAGTATCTCCTTGAAGTGATATCTTATCAACTTTACGAAATAAGCTACTGACCCAACAGACTAAGTAAGTTCTTTGTAGTACAACTTGTTTATAACAATAAGTTGAACCTACTTTGGTGACATTTTGCGAGATATTTAGCAGGCAGCACCATTGGTGTTTACCAACAACACAACACGTGCGGAGCATCAACACGCTATGTGCGGAATAACAATACATCAGCTGAAGATTGTCAAAGAATCTCGTTTTGGTTATTATAAAGAAAACTGTAAAGTGCTTTATAAAGAACTAATATAGGACAACTCTACCAAACAGCACTATAAAAACGTCTTATTGATATAACGCTTTATCATAGTCAATAGTCTTCAAAACAAGACGACGACGATTGCTGTTGAAACTTGGAGAGATGACATTCCTCTCTTCGAAGTAGTCTTTTATCTTTATTCCAGCCAGATAAAGCAACAGATGTGGTAGATCATCTGTCATAAACTTATTACTTCTTGCCATCAATGTTTCCGTAATAATCTTGCGATGTCTCTGCTTATATATAGGCGAACACCATATCCAAAAAGGTATTTCAAACTCTTCGTGATACTTCCTGAGATTGATTTGTTCGACTTCAGTTAACCTACCAGCCATATTTACATCTTTACCATAGCAATCCTCTCCATGGTCAGACAAGTAGACAATAATGGCATCCTTGTTACGGAACTGTTCTACTATCTTGTTCAATACATAATCATTATAAAGCGTTGCATTATCGTAGTCTGCAATAGTTTGTTTCTCTTTATCTGTCAAGTCCATACGCTTATAATCCATTATTCCAAACTTCTTCATATTACTTTTACAACGCAATGAGTATTGAAAATGCTGTCCTAAAAGATGAAAGATTATCAGTTGTGGCTTCTTATAGCTAATGATATCTTTATAATCATTTAATAGGTCCTCATCATAGTTATGAATCGTAACATTTCTATAATCAAACATTTGCTTGTTTAATTGTGGATGATTTAAGAAGAAACCACCTACAAGATTATTCGTCCAATCAGGTGTATAGTTAATACCATACGGAAATTGATTAGACAAGAAACTAACATGATACCCCGCTTTCTTAAACACTGCTGGAAAGAGTACATACTTACTCCAATCACCTTTCTCATCAACGGATTGCAATGAGAATATTTGCTTGAACACTTTACTCGTTAGATTCCATGGTGAAACAACATTAGTAAACACTGCTAACGAATCTTTACCATTCATCATGGCAAGTTGATAAGGCGTTGTTGGCAAGGGATAGCCATACAGTTGAGAATGATGTCGGTTGGCACTTTCACCTATAACAAGTACAATTGTAGGAGATGTGAAAGAACATGAATCTACTTTAATTTGCTGATTAGCCATAATGACTCCATCAACCTGCTTTGCTATCAAGTGGTTTGAATAAAGTCCATAAACAATACGTTCGACTGGATGATAAAGATGTGCAAAACCATTAGTAACCGCAACCTCAAGTTCTGAGAGATTCTTAACTCTATAAAGCTGAACCTTATCATAAACACTTAAAGCTATTCCTACAAAGATTGTAAACATTAAAGCAAAGGCAACAAAAGGTTGCTTGAGATAAGAGGTTGGAAATGTCATCTTCTTTACCGCCATAACAATATGACAGAGTGCAAGAAAAAGAATAATATCAGCTGCGGAAAAGAAAAGTTTTAGATTAAGATATTGCAAGAAAAACTCTGTAGCCTCTCTGCCTGTAGTTTCTTGCGCTAAAAGAAGCATTGTCGGTGTAATTGGAGTATCAAAGAGTGTCTTACAACAAGTATCTATTATGGCTACCACATAAACAACACTACTCAACATAATCACCAACCCTTGACGTATTGTTTGTTGACAAAGACTCAGTAAAAAACAAATAATATATAGATCCGCAATCAATTCTATATAACCAAAAACATTACGATGACTCACATTCGTAAAGGCACCAACCAACAGCATAAACATAAAGAAAAGAGAATTCTCTTCTATTGGCTTTACCATCTTAGTTACAAAGGTACTTTTCATTTTCGTATTATTGATTTGCTTCATACTGTTTCATCCAATCATGATCTTAACATGCTATAATAATCAGGATTAGTTACAAAGGTACAAAATAGCATCCGTATTTTCAAATAATTGTTATTTACATAAGCGTTATAAATATATACATTATACCATAATTCTATATTGATCGTCTATTTATGTAACTTAACATAGCTCGCTATACCTTTAACCCAAAGATAGATAAGCTGCTCAATTATAAAACAAAAGCAGTTTAGGCTTTAACAGGCAGTTTAGCTTAAAAGGATAATTATAACTACTAAGCCTAATCATAAGAACGGTGAGCACATACCAACATGTAGCTCACCGCTCAAAATAGTTGTTTATTCTTCAATTGATTGGTTTCAACATAATAAGAAGAAACATCAAAAGAAAATAACGATAGAAGTTGGCAAATGGCTTAAGAATTATTTTCTTTTCTGTTTGCCAAGAAGTTCTTATATATTTTCTTTACCTGAGGTGCAAGGAAAATAATACCAATAAGGTTAGGAATTACCATCAGACCATTGAAGGTATCTGCCAATTCCCATACCATATCTGCTGCAAAAAGACAGCCAAGAAAGACAAAGAGAACAACAAGTGCACGATAAGAAAGAATACCCTTCTTACCAAACATAAACTTGATATTCATCTCAGCAAACATATACCACCCAATAATTGTTGTGAAAGCAAAGAAGATTAGACTGATGGCAAGAAAGATAATTCCACCATTTCCAAAAGCTATCTCGAAACCTTCTTGCGTAATAGCAACCGATTTTAAGCTTAGATTTTGGAAAGAACCTGTCAGCATAATGACAAAAGCTGTTGATGTACAGATAAGAACGGTATCAACAAAAACACCTGCCATAGCTACAAATCCCTGCTCAGAAGGGTCTTTAACATCAGCCAAAGCGTGAGCATGAGGTGTTGAACCCATACCAGCCTCATTCGAAAACAGACCACGAGCAACACCATAGCGAATGGCATATTTCATCACCGTACCAGCGGCTCCACCTGCTGCTGACTTCATAGAGAAAGCATCTTGGAAGATTGTACGGATAACATGAGGCAACTGATCAGCAAACATATAAATGATAACAAGCGAACCCAAGATATATACAACCGCCATGAAAGGGACTAAAAGTTCGGCTATAGCGGTGATTCGGCGTTGTCCTCCAATAATTACTACACCCACAACTGCTGCTAAAACAATACCTATAATATAAGAAGGTATATGGAAAGCATTATTCAAAGCTATTGAGATAGAGTTTGCTTGTACCATATTTCCAACGAATCCCAACGCAAGAACAATAGCAACAGAAAAACAAACAGCAAGCCACTTACTACCAAGTCCATAATAAAGATAATAGGCTGAGCCTCCGACCGTCTCGCCATGATACTCTTTCTTATACTTCTGGGCAAGTACTGCCTCAGAGAAAATGGTGCTCATTCCTAACAATGCAGATACCCACATCCAGAAAATAGCTCCCATACCACCACTTGCAATAGCTGTTGCCACACCACCAATATTACCAGTACCAACTTGAGCGGCTACAGCTGTGGCTAATGCTTGGAATGAATTAACTTTTGTTTCACCAGGCTTTCGCTTCTTGAATATTGGACCAAAAGCATACTTAAACGCCAAATTTAGATGGCGAATCTGTGGGAAACCAAGGTAGACAGTATAGAAAATGCCTACTCCTAACAATGCGTACATGAGAAAACTATTCCACAAAAACTCATTGAACGCTTTTACATAGTCTAATACATTCATCGGCTACATAGTTTAGATATTAGTAATAGATTAGTGGATAATACCACATTAGATTGTTTAGTAACAGATTAAATAATCAATCTTACGCCGCAAATATAGTATTTTATCTTTTACAAAGCAACGATAAAAGCAACTTTTTGAGTTAATCACAAAATTACTTTTGTATTAACATTTTCATCATGCAGAGTTCAATCTGCTGCTATTTTATGATCCAATTAACGGTGTAATTATCATTCATAAACTCATTTATATCTCTCATCTGCCTACATTGAAAATCTCATCATGAAATACTTCAACTATACATACCGCATTGAAAAGACTTTACATAAATCTAAATGAACCTATTAAAATAAAGACAAAAAACACACATAAATATCATCTTTCTAACCAACAGGGAATCAACCACTTAAGTTAAGTCATTTCAAAAGATGCTTAATAGGCTTCTTAAAGGGCGTTAGTAAGGCCTCAAAAGGGCGTCTTTTGAAAGCCTATTGGACGTTAATTGAATGCTAAAAGAGCATATATTAATAACGAGATAACTCAAAATATTTTACAAATATAAAATAAACAGAAAATAAACTACTCTAACTAAGATTGTATCGGATGATATGTTAGTTTAACCCAAATCAGTCATTGGAGCCTAAACATATTTTGTGTTTCTTATCGAGCAGATTGCCTGTCGTTATAACGCAGGCGTAGCGGGCTACGTCAAGTTACAAAGACACACAAGATACCGATAAGAAAATAAAAGATGTTTAGAAACAACACTATAGTAATAAGAATTATACATAACCGATTTGGGGTTAACTCTTATCCTGCTGAAAATCATCATGTCTAACACTAAAAGATTTCATCAAATAGCCATGTATTAATAAAAAGGCACAAGAAAAGTTATAAATAATTTGGAATTATCAGAAATTCTTGTTACCTTTGCACTCGCTTTATAAAAGCATAGGGAGATTCGCTAGCTCAGCTGGTAGAGCACAACACTTTTAATGTTGGGGTCATGGGTTCGAGCCCCATGCGAATCACTTGAAACAACAAGAGGAATATCCGCAAGGATATTCCTCTTTTGTCTTAGTATCTGTGTGCTATAATCCCCCCAATGATAGACATCAGGGCTCAGTAGAAAAAAGGTGGGGAAATTTTCTTAATTAATATATTTTTCTTTCCTTTGTATTATGGAAGATACTTATCTATATCATTTAGCATCGTTGGTTCTCCCTAAGGATGTATTGAAATACTTCTCTGTAGTTAAAATAGAACCTAGTCCTTCATTATTGCGTATCCATCTTGACGAGAAGATGGAGAAAGAACTTTCCGATGATCTTCATTTTGAATCAAAAGGCTTTATGGAAGCTGTCGAGGTGACGGACTTTCCGATTCGTGACCATAAGGTTATTTTGGTTCTTCGTCGACGTCGCTGGATAGATATTCGTACAGGTAAGAGTTTCTCTCTTCCCTTGCAGATAGATATTACAGCCTCCGGCACTCGTTATTCCAAAGAGTTCGGAGCTTTTTTAAAAGAAACGTATGGAGACATCCCCAGTGACCTGCCGTACGCTTGAGGAATTCTATCATATCAATGGACGTAGTTTTGAGAAACAATACAAGGAAACACTAAGTGGTTACAGGTCCTGGGATCAGTTGTCTCATGCTCAGAAGTGGCTTCTCTTTGAGGATAATATTGGTAAGAATATAGCAATAGACGAGACATCCTTGTCCAATGGTGAACTCTATACGATTATCACTAATAGGGACAAACACGGCAAGCAAGGATGTCTTGTAGCCATTGTTGCTGGAACCAAATCCTTGGATGTCTGCAAGGTATTGGATAAGATAGATGAAAAGAAACGGGAAGCCGTAGAAGAGGTCACCTTGGACTTATCCGATAGTATGCGTAAGATTGTAAGGCATTGTTTTCCAAAAGCTAAACGAGTGATTGATCGCTTTCATATACAGAAACTTGCAAGTGATGCCGTGCAACAGATGAGAATAGAGTATCGCTGGGCAGCTTTACAGCAAGCAAATGACGAGAAAGAGAATGCAAAGTTAGAAAAGATAGCGTATCAACCACTGACTTTTGAAAATGGAGATACACGTAGTGAACTGCTGGTAAGAAGTAGGTACCTGTTGTTCAAATCATCAGAGAAATGGACTGATGAACAGAAGCTAAGAGCCAAAATACTGTTCAGAGAATATCCTGACATTAAGAAGGCTTACGGTTTATCACATTCGCTAAGAATGATTTTTGCTAAGAATACTATCAAAGATGCAGCCAGACTATCATTGGCAAAATGGTATAATAACGTCGCAGAAGCTGGCTTTCATTCCTTTAATGTCATTGCTGCAACTTTCTACGAGCATTACGAAGACATACTTAACTTTTATATTAACAGATCTACAAACGCTGCTGCGGAATCCTTCAATGCAAAAATAAAACTATTTAGGGCTAACTTAAGAGGAGTTGTAGATAAAAGTTTCTTTCTTTTTAGGCTTGCCAAAATATATGCCTATCCCCACTAAATATCTACTGTCCCGACATCAGCAGTAATAATCGAAGCTACCTTACAATATTTAGTTGGGGTGTATGCCGCTTGTCATTAGAAAACTACAATAAAAAAGTAGGCTTCAAACTTTGATTTATAAAATTAGTTTTTTATATTTGCAGAAACTAATAATTTAAAATACAAACAAATAGAACAATAACCTTTTAGACTGGCAGGACGTAAACTCTCTGTCACCTATAAAACCTCGAAGACTAACCTCTTTATACCTATGGGCAGCATCGTTCCGCCTCAATCTACTAATGTGGCTTTAACGTTGAAAACCGATTACTGTGGAAATATGATTTATGAGAATGGACAATTAAGCAAGATTCTTACCGACATGGGATACATCACACTTGCAAATTCCACTCCAACATACCATTACTACTTGTAGGACCACCTTGGAAACAACCGTGTAGTGATAGATGAGCATGGAAAAGTGGAGCAAGTGAATCATTACTATGCCTTTGGTGGACTGATGAGTGAGAGTACAGGCGGAGGCGCACAGCCTTACAAGTACAACGGCAAGGAACTTGACTGTATGCACGGATTGGATTGGTATGATTATGGTGCACGCCACTATGATGCTGTGCTGGGAAGATGGATGTGTGTGTGGATCCGTTGGCAGAAAATCAACTTGGGAGCAAATAACATTTTATGTAGAAACTATAAAGCAATCAACAACCATAAAGGGACTTGAAAATTAGGGTTTAAGCTTTAGTGAAGCTATGGGAGCAGAAGGTGTTCATGAATCTGTGCATGCAGCTGACCCTGTAGAGGTTAATCGAGATATACGTTCCAGTCAGCCAGGTCAGAAAAAGCTTTCAGAATTTATTTATGAAAGGAAGGCAAGGCTAATGGAAGCTAAGTTTAGAGAAGAACTAAAGAAAAATAAAGACAATCAAAATGAGAAAAGAAAATTATAAGATAAAGCACAAATACCTATTGGGGATATTTATCATTAGTATGTTAATATTCTCAGGACCTTTATACTCTCAGAATAATATTATTCCGAGTAATTTAAGAAAGAGTTTATATAAGCTGTATCATTATAATACAACTTGGAAATCAACTATCTCAAAATTAAGTTGTGTTGAAGGAAAAAAACATGATGGTGTGTACACTTTTCGATTAAATTATCAACCCCATTACCCAACGAGAGTTTTCTTTATAAGTAATAATACACCTTATACAATAGAAAGCTTAGGCTTCGAGAATACAACAGGTGTATTACTTGAAGCTTGTCATTTTCTATCTTCAAAACATCAAACTAACAGTTATATAAAGTCCATTTTATCTGGTGTCTACAAGTATTTATTTGCAGAATACGGATTAACTTATGGTAGTGATTTCTTCGATAAGAAACCTGACTTCTCTAATGAAAGAGATAAGATGAGGTTTATTATTCATAGGGTTAACAATGTTGATAAACTTAGAAAACAAGCATTATATCTGAGAACTGACAGAAAAATCATTCCAGATAGTATTACGAATTATATTAAAGGTAATAATTTAAATGATGAAGAAGCCTTATTATTTTTGCGTTATATTATTTTGAATAATTGATTGTTACCAAAGTAATATTAGGAGAAATTCCCTAAGCTAACGAAGCTCCAATAAGCAGTGTTAATCAAAACTACCTTACAATATTTAACTGGGGTGTATATAGCTTGTTATTAGGAAATCGCAATAAATAGTAAGCTTCAAACTTTGATTTCTAAAATTAGTTTTTTATATTTGCAGAAACTAATAATTTAAAATACAAACAAATAGAATAATTACTCTTTGGACAGATAGGGCGTAAACTCTCTGTCACCTATAGAACCTCGAAGATTAACCTCTTTGTACCTATGGGCAGCATCGTTCCGCCTCTATCTACTAATGTCGCTTTAACATTGAAAACCGATTACTGTGGAAATATGATTTATGAGAATGGACAATTAAGCAGGATTCTTACCAACGTAAGATACTGGAAAGTCGAAAGTTTTTTGTTTTACTGATTATAAATAAGATTTTTTGTTATGTGCGAGTATCTAAACAAAAATGATTTAAACTCATTTGATTGTTATATAGGTCTTGGGATATCAAAGGGCTTATCTGTAGGAAACTTGGGACTTTTATATGGGACAAGTGCTGTACAGGGTGAAAATGGATATAATAAAACTGATTTTTCAAGGTCGTATGGAGTGAGTCTAAATGCCAATCCAGCAAGGTTTGATATGTCAAAAGGAGGAGGACGCACATGGATACCATTTTTTTAAGAAGACGTATTTTATGGCCAATGTTCTGTGTTATTCTCTTTTTTATTTTTATTAAACAAGATGATCTGAGTAATAACTCTTATAATCAATACGTTAAAGGGCAATCATATTGTGTGAAAGTCTTGAGCATAAAGAAAGAAGCACGCGAATATTATGTTTATGGTATCGATAGAAAGGGACAGATTCTTAAATTAGATATTTCTTCAAAGTGGGATCTACATACAGTTCAGGTAGGAGATAGTTTAATAAAAAAAAGAAATAGTTTTGACATAAAACTCGTATCGAAGCATAAATCAAGAATCCTAATCCCTGAACTTCCTCTATAATTATTGGCTTACGAGCAATAAGGAAGGGAATGAAAGCGTTATTGTTGACAGGATAGGTTACAATAAGAATGGTCAAACAATCTACATAAATCTTGACAATGATACGGAAACTATCCATACCTACGACAAGCAGCGTAAGCGTCTGTTATCAATGAACCTTACGTTAGACGGTCTATGCGAATCCTATTAGAATCTTAGATTTCATGGGATTGATTCCAATGAGGAAAGAAGCTAAACGTTATAGGGTGGAACACAATATAAATGATCGAGTACTTAACTGGTAATAAGATAGAATCAGATGATTATAAATTATATATTTTTTAGGATACATATGGCTTATAAAGCAAAACATGATTCTGCTATGTTAAATAGTATTTTGTATTTATCATGCGTACTTATGTTTGTTCTTCTTCCGATAGCAGGTGTCGTACTCGAGATAGCACGGAAAGGTGGTAGAATTAATACCGCGTTTTTTATTTTATATTTCATTTCAATCTTAGGTTTTGTTACAATGAAATATGGAAATAAGAAGATGGTAGAGCGTTTGTATAAGAAATATTCCCAGCATAAATTTAATAGGATAATCCCAACTTATTGCTTCTTCTTCATACTACCGATATGCATTATATTGGGAATGTCTATCTATATCATAATATTAAAAAACTTTGTTGACATTGACAAAATAAGAGATATTATATACAACATTCTAACAAGCTCATATTAAAATTGACGTATAATTGCCTTAAATATCCTTACTGCCAAACATCTATACACGCTACGGATCGACATATCTTTACAATTCAAAATACAAAAGAGAACTACACACCTTTTCTATTGTGCTTTCACTCCTATCACCTCTTTTCTCCAACTAAGCATCTAACAATAAACAATTTACGCAGAATGTTAAAAATGACAGCAATTAAAATTAGATTAGGTTTTTCATAGTAATTCCATATGTAACATTTGGAAAATAACACATAACGTACCAGCCTATTTTCTTATTGCTTAAAAGTATATTTACGAAATAAAGTTGTATATTTGCCCTTGAAAAATAATATCTTTAAGCATAAATAAGTAATGGGGTTTACACTATTTGGAAATCAGACAAGCGTATCTTCTAAATATAAGAACTCTCTGCCGACTTTTACGAAAATAGGAGTTGTCATTGTAAGTGTGGAAGATATTAGGACTTATGTCCATGGTGCTACTTACGAAAGCTGGACCGTATACGAATAATGAGTAAAAACAAAATAAGATGTTTAGTCTTTTTAATAAAAATAAAATAGAGGAGCAGGATTTCTACTTTCTTAAAAATGTAATTTGTATCTTGCCAACTAAATGGGACTTTTTAATAAAGCAAATAAATAGCAGATTCATTATAGGAAAATGTAAAAATACACTTTATGGGAAGGGATTCTACAACTTGGTTCTCAATAGAGAATACTATGACTATAGTAATTATAAATATCCAGAATTAGTAACATTATCAGGTATTTATATTTGGAACAAGAAAAAACGAGAATATGTAGAAGTACAACTATATATTTCGTTTGGTACAATAATTGGCTATTATTTTAATTCTAAATATAACCATTTAGATTGGCATAAAGTTTCTCTTAATACACTGAAAGAAAATAATTATGCAAATCATTCAAACGGAAAGAAAGACATCATACAAATGCTTTCGCAAAAACTTTCGCCAGAAGAACTAAAGAAAATAGATATAGGTGATATTAATGAACTTCAGTTCGAAGGAAATACATATTATACGATTAAGAATTTAAATGATGGAGATTATATGGCTATTAATAATACAGGCGAAGTCTTCATTATAACTCATGCTCCTTTTGAAGTAAAGAAACTATACTCTTCTATTAGGGCTTTTTTGCATCAAACGCTATAACTTTTATTCTGTGCAGGAAATATCATGTCACTTTTATCTATCTTTAAAAAATAGAACTGTCTAAAAGGAATATATATTATTGAGACAATTAAAAAAATAAAAGGAAGACGCTATAAATGTCTTCCTTTTATTTTATAATATCTTGATAAAAAGCTTATTACATCATACCACCCATACCTGGAGCAGCTGGCATAGCTGGAGTATCCTCAACCTTGTCTACAATAAGACACTCAGTTGTCAGGAACATACCTGCAATTGAAGCTGCATTCTCAAGTGCAACACGAGAAACCTTAGCTGGGTCGATAACACCTGCAGCACGAAGGTCCTCATAAACATCCTTGCGAGCATTGTAACCATAGTCACCCTTACCCTCACGAACTTTATTTACTACAACAGCACCTTCACCACCTGCATTAGCGATAATCTGACGGAGAGGCTCCTCAATAGCACGACAAACAATATTGATACCTGTCTGCTCGTCAGCATTCTCACCCTTAAGGTCCTTCAATGCTTCTTGAGCACGGATGTAAGTGGTACCACCACCAACAACTACACCCTCTTCCATTGCAGCACGAGTAGCGCAAAGAGCATCGTCAACACGGTCCTTCTTCTCCTTCATCTCTACCTCAGAGTTAGCACCAACATAGAGAACAGCTACACCACCAGAGAGCTTAGCCAAGCGCTCCTGCAACTTCTCCTTATCGTATGAGCTTGTTGAAGCTGCAATCTCGTTCTTAATTTGAGCCACACGGTCCTTGATTGCTTCCTTCTCACCAGCACCATCAACGATTGTTGTATTGTCCTTAGAAATGGTAACCTTCTTAGCTGTACCCAACATCTCAAGAGTTGCCTTATCAAGTGAAAGACCCTTCTCCTCGCTGATTACCACACCACCTGTCAACACAGCGATATCCTCAAGCATTGCTTTGCGACGGTCGCCAAAGCCTGGAGCCTTAACAGCACAAATCTTCAAACCTGCACGAAGACGGTTTACAACCAATGTTGTCAATGCCTCTGAGTCAACATCCTCTGCAATGACCAATAATGGACGACCACTCTCAGCAGCTGGCTGTAGGATTGGAAGGAAGTCCTTAACGTTTGAAATCTTCTTATCGTAGATGAGGATATATGGGTTCTCCATATCACACTCCATCTTATCTGTATCTGTTACGAAGTAACCAGAGAGGTAACCACGATCAAACTGCATACCCTCAACAACACCGATACTTGTCTCACGAGTCTTGCTCTCTTCAATAGTGATAACACCATCCTTAGATACCTTACGCATAGCATCTGCAAGCAACTTACCTATTTCAGGATCGTTGTTAGCACTTACAGTAGCTACCTGCTCTATCTTGTCATAGTTGTCACCAACTACCTCTGCAGAAGCCTTAATATGATCAACAACCTTAGCTACAGCCTTGTCGATACCACGCTTAAGGTCCATTGGGTTTGCACCTGCTGTAACATTCTTCAATCCTTCAGTAACAATTGCCTGTGTGAGGATAGTAGCAGTTGTTGTACCGTCACCAGCATCATCACCAGTCTTGCTTGCTACACTCTTAACAAGCTGTGCACCTGCATTCTCAAAGTTATCCTCAAGCTCTACCTCCTTAGCAACGGTAACACCGTCCTTAGTAATCTGTGGAGCACCAAACTTCTTACCGATAACAACATTACGCCCCTTAGGACCGAGTGTTACCTTCACTGCATTTGCCAACTGATCAACACCACTCTTCAAGAGTTCACGTGCGTCTGAATTGAATTTTATCTCTTTTGCCATTTTCTTTATTCCTTATTTTGATTGTTTATTTATGCGTTGTTTGCTATTCTTTTTTATCTCTTATATGTATAACATCATTCTACAAGATATAACAGAATAGCCAAACTACAGCTATTTTAAGACCTTTTTATAAAGAGAGAATAGGAGAAGTGTTACTCAACAACTGCCAACACGTCGCTCTGACGCATCATCAAATACTTCTCACCCTCATTCTCAAGTTCAGTACCAGCGTACTTACCATAGAGAACCTCATCGCCAACCTTGAGAATCATCTCCTCATCCTTAGTACCATTACCAACGGCAACAACTTTACCACGCTGTGGTTTTTCCTTGGCTGTGTCTGGGATGATAATACCACCAACTTTCTCTTCTGCCTGTGCTGGAAGCACGAGGACTCTGTCTGCTAAAGGTTTAATTGTCATAATTGTATATGTTTTAAGATTTTAATTTTCATTCTGTATATGACCTGTCACCAAGTCATTTCGCCTTCAATTATACGAAAACTATGCCAAAAGGACTTACTGACAATCTGTCAGTTCTTCTTACCTATTCGACTAAAAACAAAAGAAAATAATTATCTTTGCATTAGAAAAGAAGTTGATAAATTAATAAAGGGTTAACAATGAAGCAACTTACCAAGACTATAACAAATAAACTTCAAGCACTATCAGATGCGGAGAAGCGAGAGATATTCCCTAAGTTCTTTAAGGCTGGCAAAGGAGAATATGGTGAAGGCGACCGTTTCTTAGGTGTTACCGTACCCAATATCAGAGCTATTGCCAAGTTACACAAAGACATATCCATAGAGGAGATACGGGAGCTGATACAGTCAGAATGGCATGAAGTGCGCCTTTGTGCCTTAATCATAATGGTAGAGAATAGTAAGAAAAAAGACGAAGCTTTACGCAAAGAGCTATTCAATCTTTACCTTTCTCAAACTAAGCGAATCAATAACTGGGACCTTATTGACCTATCTTGTCGCTTCATCATAGGCGAATACTTACTTGACAAATCACGTGACATTCTTTATCATTTAGCTCAAAGTCCACTACTATGGGATAATCGTATCGCTATCGTATCAACATACGCATTCATTCGTAAAGGACAATTAGAAGACACCTATGCACTTAGCGACCTCATGATGCAGCACCCACACGACCTCATGCACAAAGCTATTGGTTGGATGCTTCGCGAAGCTGGGAAGCGTGATTCTGAGCGACTTTATGATTACGTGATGAGCCATCGAGCAGACATGCCCCGTACCATGCTACGTTATGCAATTGAGAAGTTCTCACCCAAAGAGCGCGCTATTCTCATGAAACGTGCCTAACACTTCCCCACTCTATTACTTCCATTTGGACGCTAATCATACGGTCAGAGAAAGAATAAAAACAACCCTTTTGTCATATTTTTTCATATAATCATTTTTAAAAGAGCACAAATTGCATTCAAATTAACGCCTAATTGACTTGCAAAAGATGCCCTTTTGAGGTCTTACTAACGCCCTTTTGAAAGCCAAGTAAGCACCTTTTAAAATCTAACCTTGTAACTAATTCATAACAAGAGAGTTACAAAAGCACTCAAAATAGTATTTTTTGGCGTAATGGACATTTGGTAGGCTGAAAACCTCTCGGAATTCTTTTATTTACTACCTTTGCAGCATTTGAAAATCTATGAATAAAAAAACTGTATCTTCTGTAATAGCTCAAATGTTAAAAAGAATGGGCATCGACGTTCTGTTCAGATGTATTCCTGCAAGGATTGTGGTAGACAATTTCAAGGTGGTCTGCGTATAAATAATATTTCTCTATGGAACGACTATCTGACGGCAAATCGAACGATATCTGATCTATCCACTCTTTATAAATGTTCAGAACGAACCATACGACGTAGGTTGAGCTTAGTAGTAGATAGCTTTACTGCTACTTACCCCAAATCTGCAGTAATAATATTGGATACAACATACTTTTCCAAGACATTTGGTGTGATGCTGTTTCAGGATGCTTCATCAGGCAAAATACTCTATCGCAAGTTTGTCAAAAACGAAACTAACAGAGATTATCTTGATGGACTTCGGTATATTACGGAGCGTGGAACTATGATAAAAGCAGTGGTGTGTGATGGGCATGTAGGACTTTTACAAGCTATAAGTTTCTGTCCCGTACAAATGTGTCAATTTCACCAATTTCAGATAGTTAGAAGACTCCTTACTAACAACCCACATTTGCCTGCAGGCGTTGAACTGTTGGCATTAATGAGAAGGATGTTCTCTATGAGAAAAGAAGAGTTTATAACCGCTTTTGATAAATGGTGTGATAAATGGAAAGAGTTCCTAAACGAACGAACTCTCCTAATCTCGGGCAAGACAACTTATACACACAGAAGGCTGAGAACGGCAAGACGTTCTATTAAGACACATCTGCCATGGATCTATACGTGTGAGGAGTATCCGGATATGCAAATACCTAATACAACAAACCTGTTGGAAGGATTTAACTCACAACTTAAAAGAGCACTACATAATCATAATGGATTGAATGAAGCTAACAAGAAGAAGTTTATAGATGGATTCATAAATACAAAAAAGTAG
Protein sequences of DBSCAN-SWA_1 >CP022040|1079793:1132954|1126333_1126678_+|ASE17403.1|DBSCAN-SWA MDTIFLRRRILWPMFCVILFFIFIKQDDLSNNSYNQYVKGQSYCVKVLSIKKEAREYYVYGIDRKGQILKLDISSKWDLHTVQVGDSLIKKRNSFDIKLVSKHKSRILIPELPL >CP022040|1079793:1132954|1088360_1089665_-|ASE17378.1|DBSCAN-SWA MTKALSLFELNSLVADVIDATLSRSYWVEAELSEARENRGHCYMELIEKNEGSNVPIARASAKCWSNIWTLIKPAFIRITGQEIRAGMKVMLQVHAQFHPQYGFSWIVDDINPEYTMGDMMRKRQEIIRQLKAEGVFDLQKELCLPMFAQRIAVISSETAAGYGDFCNQLETNDYGLYFHVELFPAIMQGDSVEQSIINALNQINSREEDFDCVVIIRGGGATADLSGFDTLNLAENVANFPLPIITGIGHERDESILDMVSFQRMKTPTAAAAYLIDHIASTLMRVENAQAAIIDGVKKALEVEKMRIQHIGSHIPVLFSVVRTKQDAWLEGLSQRLVMRMNEAIKQAEFYLSTLQNCMLLTLQNKLSVEQHRLDILEQRARLLDPSLLLKRGYSITLCNGRTIRNAKDLKIGDTITTRFETGEVESKVERLS >CP022040|1079793:1132954|1110133_1111243_+|ASE17391.1|DBSCAN-SWA MEIKDLGEFGLINRLTKDIQPINNSTIMGVGDDAAVLHYSDKETLVSSQMFMEGVQFDLTYIDMEHLAYKVAMIAMSNIFAMNGQPRQLIVSLGLGKRFKVEDLDQFYAGLNKACAKWNVDIVGGDTTSSYTGLAINLTCIGEAAKDDIVYRSGANETDLICVTGDLGSAYMGLQILEREKTVYYQQVQEYNNKVKEAQSNKDEKRLEALRQERAAIEDFQPDFAGKEYLIDRQLKPEARGAVLSQLRTAGIHPTSMIDISDGLASELKHICEKSHCGCRIYEKNIPIDYQTAATCEEFNMNLTTAALNGGEDYELLFTVPIGDHEKIDKMENIRQIGYITKESLGAFLIARDGNEFELKAQGWPKNEK >CP022040|1079793:1132954|1106376_1108155_+|ASE17388.1|DBSCAN-SWA MKQFFKFVFASFFGMMLFSIVTGLFALFTIVGMIASQDTTKEPEDNSILVLNLSGQMSERSENNFLSQLQGSQINSLGLDDMLEGIRKAKDNDKIKGIYIEAGAFASDSYASMQALRKALLDFKKSRKWIIAYADTYTQGTYYLSSVADKVYLNPQGQIDWHGLASEPVFIKDLLAKFGVKMQVVKVGAYKSATEMFTGDKMSDANREQTSAYLNSIWGNITKEVGASRGLSVAQLNAYADSMITFADPQEYVKLKLVDGLVYTDQIKGIVKKQLGIEADKDINQVTIADMVNTEDKNQGDKENEVAVYYAYGDIVDGVVGGLFSQGHQIDAQVVCKDLEELAKDKDVKAVVVRVNSGGGSAYASEQIWHQIMELKKLKPVVVSMGGMAASGGYYMSAPANWIVAEPTTITGSIGIFGMFPDVSGLLREKLGLKFDEVKTNKYADFGTRARPFTEEEMSYLSQYVNRGYKLFRHRVAEGRKMTEKQVEKVAQGHVFTGQDAQKIGLVDQLGGLDVAVAKAAQLAKLPNYRKCAYPKEPNFLEQMMEQTNPNNYLSQQLRANLGDYYEPFTLLKTIDQQSAIQARLPFYPNIH >CP022040|1079793:1132954|1086036_1086522_-|ASE17375.1|DBSCAN-SWA MNISKKMQDAFNAQIAAEMWSSNLYLQMSCWFRKEGWKGFSSWMYKQAEEERQHAMDMAQFVLHRGGEVILTSIDAVKTSWTDAKEAFVDTFAHEQKVTELINKLADVADEEKDRASQNFIAKYIDEQVEEEKNVKDILDSFAHLESHAIAHIDSKLEQAR >CP022040|1079793:1132954|1086987_1088004_-|ASE17376.1|DBSCAN-SWA MKDIEWSSLSFGYMPTDYNVRCYYRNGKWGEVEVSSDEYLKLHMAATCLHYGQEAFEGLKAYRCPDGKVRVFRVEENAKRLQNTSRGIVMPEVPTELFAEMVKKVVRLNQEYIPPYESGASLYIRPLLIGTSAQVGVRPAEEYCFLIFVTPVGPYFKGGFCANPYVIVRDVDRAAPLGTGMYKVGGNYAASLRANRRAHEQGYASEFYLDAKEKKYVDECGAANFFGIKDDTYVTPKSSSILPSITNKSLMQIAEDLGMKVERRPISEDELDSFEEAGACGTAAVISPISHLDDMETGKVYNFGDKPGPWSTKLYETLRSIQYGTIEDKHGWTTVVIE >CP022040|1079793:1132954|1127933_1128581_+|ASE17405.1|DBSCAN-SWA MFSLFNKNKIEEQDFYFLKNVICILPTKWDFLIKQINSRFIIGKCKNTLYGKGFYNLVLNREYYDYSNYKYPELVTLSGIYIWNKKKREYVEVQLYISFGTIIGYYFNSKYNHLDWHKVSLNTLKENNYANHSNGKKDIIQMLSQKLSPEELKKIDIGDINELQFEGNTYYTIKNLNDGDYMAINNTGEVFIITHAPFEVKKLYSSIRAFLHQTL >CP022040|1079793:1132954|1101388_1102318_-|ASE17383.1|DBSCAN-SWA MCKTISPFFYYSELKRYERKKSFIGHQYNQINDRRLCRLHFFPDSTDIFQDVGIADGLSIVFMDYAKKKEGFIYAYSKGGKTMEVAMSNPGDILMALDPIAEQISQKIQAIVIKHHFEYIYGSVLPRSLFSIESDFVEQNPTLVRPLQEGIILGEDEIKLFTNDKAGKAGRASWFIAHKSVIKTGLDYLHRWKVIVSSANAGGQKRSNQIAVLDNHSVFGRSRVALKTFETEQEARNFYAYCQTDFIRYAFLLTDEALTSLGKLVPDLLDYTNDNGIIDYAQDVNTQLYRLFGIDEEMQAIIKQTLSER >CP022040|1079793:1132954|1126976_1127402_+|ASE17404.1|DBSCAN-SWA MIINYIFFRIHMAYKAKHDSAMLNSILYLSCVLMFVLLPIAGVVLEIARKGGRINTAFFILYFISILGFVTMKYGNKKMVERLYKKYSQHKFNRIIPTYCFFFILPICIILGMSIYIIILKNFVDIDKIRDIIYNILTSSY >CP022040|1079793:1132954|1130516_1130786_-|ASE17407.1|DBSCAN-SWA MTIKPLADRVLVLPAQAEEKVGGIIIPDTAKEKPQRGKVVAVGNGTKDEEMILKVGDEVLYGKYAGTELENEGEKYLMMRQSDVLAVVE >CP022040|1079793:1132954|1126112_1126352_+|ASE17402.1|DBSCAN-SWA MCEYLNKNDLNSFDCYIGLGISKGLSVGNLGLLYGTSAVQGENGYNKTDFSRSYGVSLNANPARFDMSKGGGRTWIPFF >CP022040|1079793:1132954|1089693_1091115_-|ASE17379.1|protease|DBSCAN-SWA MKQTYLRNYIIFLFILVSATLLASNKSGGLISYPGGKCYMFRVTLKDKNGTPYSLDKPEKFLSKASLLRRERQGLKLDSTDLPVSPDYIQQIKKRGGKVVAVSKWNNSVLVRGNNRQTLEDLKVLSFVKDSKLVFVSPDSIRPLSQRVRYNSELQSLDSTVHDYYGMGKGQIESLNGRKLHNLGFMGQGMTIAVLDAGFMNVDKIAAFRNLNIKGTHNFVAGYDNDVYKEMDHGTKTLSTIAMNQPTVFVGTAPKADFWLLRTEDYSTESPAEEDFWIAAVEFSDSVGVDIISSSLGYHGFDDISMNYRYADLNGRTAAISQAASRLASKGMILVNSAGNDGMGTWKKINVPADAHDILTVGAVSLDKVNAPFSSIGPTADGRIKPDVMAYGCPTNVVSGRGYIIPDNGTSFACPLIAGMVACLWQACPNKSAKEIIDIIRQSGNNCQSPDNIMGYGIPDFWSAYQSATRYNQ >CP022040|1079793:1132954|1088131_1088320_-|ASE17377.1|DBSCAN-SWA MKEIKYEEAVHKLEAIVDKMERGELDIDSMAAQLKEAQELVKLCKQKLKRTDNEIQKLLEKQ >CP022040|1079793:1132954|1097386_1101349_+|ASE17382.1|DBSCAN-SWA MATFESSLKPRLIYVFAIADKQHEGSLKIGETTLGDDTGDTLAAPNSDVLNQAAKARIDQYTKTAGISYELLHTELTFYIRGGHICSFNDKQVHNVLERSGVKRKEFKGATEWYSCDLETVKRAIAAIKEGKDSLGAGEVTHTENPIILRPEQKDAVERTLKQFRRGNQMLWNAKMRFGKTLCALRVAKEMGAVRTIIVTHRPVVDASWFEDFGKTFHDSPEWHYGSHNKGESFASLQRLAGQGKKYVYFASMQDMRGSKEVGGKFDKNNEIFSTTWDLVIVDEAHEGTQTELGKAVLEQLISKNTKILRLSGTPFNLLDDHKEEEVFTWDYVMEQKAKIDWEINHLGDTNPYASLPAIHIYTYDLGRLMSEYSDEEKAFNFREFFRTREDGSFVHERDIDHFLTLLTTDDEESLYPYSNDSFRQIFRHTLWILPGVKAAKALSRKLAKHPVFGLFNVVNVAGDGDEEEESRDALELVNKAIGSDPDQSYTITLSCGRLTTGVSVKPWTGVFMMAGAYSTSAAGYMQTIFRVQTPYTHNGRMKTDCYAFDFAPDRTLRVLAETAKVSHKAGKQTEDDRKLLGDFLNFCPIIAIDGGQMKQYKVETMLAQLKRAQIEKVVQDGFENGALYNDELLKLTDVELKEFDDLKGIIGKTKAMPKSGDIDINRQGLTNEQYEEKEQLEKKKKKDLTPEEKKRLDELKAKGDQRREAISILRGISIRMPLMLYGAEMVDEDKELTIDNFAKLMDDQSWEEFMPRGVTKQVFARFKRYYDPDIFREAGKRIREMARMADKFTIEERIARLASIFATFRNPDKETVLTPWRVVNMHLGDSLGGYCFMNEDFTSNLDIPRYIEHKGVTTEVFHPQSVILEINSKSGLYPLYAAYNIYRTRLEQAREKYGEVNRATALMLWDLTLEENIFVVCKTPMARYITMRTLRGFRNTNVHTKYYPNLIESIITEPDSVVNMLRSGKRFWKINNDENMKIDAIIGNPPYQVTSENTSDAPVYHLFIDLASLLAQRVSLLTPARYLFNAGKTPKDWNTKILNDEHFKVVDYWANSTDVFPTVDIKGGVAVMYRDSKLNFGKIGTFTAYKKLNIIANKVCKISENGLFAELIYAPESYRLSDKLHEDYPWAKERLSMGHPYDITTNIFEKLPEIFKETYQIKEEEVRFYGRYKNERCYRWIKREYVDFHPNLDKYKVIVPKSNGSGAIGEVLSSPLIGEPLIGVTQTFLTIGAFDTRTEAEACLKYVKTKFARTMLGLLKATQHNPKDTWRLVPLQDFTAASDIDWTLSVAEIDQQLYHKYGLEAEEIAFIEEKVRAMG >CP022040|1079793:1132954|1116044_1117607_+|ASE17395.1|DBSCAN-SWA MIDTTMNNYTLYNSKHSSTLLITTSIAALGISFPAKAQQVNTQPNIILFMVDDMGWQDTSLPFADSITANNRKYDTPNMERLASEGMMFTDAYATPISSPSRCSLMTGMNMARHRVTNWTLHRDKMTDGKRDGVTLPDWNYNGIAQSGNVAHTTKAISFVQLLKNVGYHTIHCGKAHWGAIDTPGENPCHFGFDVNITGTAAGGLATYLSERNYGFAKDGKPTSPFAIPGLERYWGTGIFATEALTQEAIASLEKAKKYDQPFYLYMSHYAVHVPIDRDMRFYPTYRARGLSEKEAAYASLIAGMDKSLGDLMDWVAKAGLKRETIIIFMSDNGGLASSSYWRDGELYTQNAPLKSGKGSLYEGGIRVPFIVKWNNIVKPNTRSHAPIIIEDLYPTLLSMAGIKNYHVPQKIDGQDITPILRGKQQGDKKRQLIWNYPNIWDGEGLGISLNCAIREGQWKLIYSYLTGQKELYDLSSDLSEKNNLASSHPQLVERLYRHLTSKLHKMNAQKPIVEGEKRK >CP022040|1079793:1132954|1111540_1112533_-|ASE17392.1|transposase|DBSCAN-SWA METSPVTCRTLEEFYHINGRSFEKQYKETLSGYRSWDQLSHAQKWLLFEDNIGKNIAIDETSLSNGELYTIITNRDKHGKQGCLVAIVAGTKFLDVCKVLDKIDEKKREAVEEVTLDLSDSMRKIVRHCFPKAKRVIDRFHIQKLASDAVQQMRIEYRWAALQQANDEKENAKLEKIAYQPLTFENGDTRSELLVRSRYLLFKSSEKWTDEQKLRAKILFREYPDIKKAYGLSHSLRMIFAKNTIKNAARLSLAKWYNNVAEAGFHSFNVIAATFYEHYEDILNFYINRSTNAAAESFNAKIKLFRANLRGVVDKSFFLFRLAKIYAYPH >CP022040|1079793:1132954|1080782_1083308_+|ASE17372.1|DBSCAN-SWA MIKKIFKFIRSIFRGIFNFFPWYAKLYKGRAWYTKMAVGTVSFFVAIFLYLGMVDINFLWLFGKSPGFIDIKTPPTYAASEIYSADSVLIGRFYKENRTPVKYEEVTPAFWNALISTEDERFYSHNGIDFMGIGGAIKDAVTGSGGRGASTITQQLAKNMFRVRTQYSSGLLGHVPGLRMLIMKSKEWIIAVKLELIYSKKEILTMYANTVDFGNNSFGVKTAAKTYFNTSPSKLSIDQAATLVGMLKATTYYNPILHPKNSIRRRNTVLYNMVTHNVLPHDEYALYSKRPMKLDIHVEENYDGQAQYFREYISEYFKDWMKDNGYDLYSSGLKIYTTIDTRMQKYAEQAATKQMEKVQQTFDNHWRGMQPWRDAKGNEIPGFIEGIAERQPFYKKLLQKYPNQPDSVLYYLNKPHKVTLFDYEKGHIEKEMSSMDSIRYMVKFMHCAMVAMEPETGAVRAWVGDIDFKTWKYDKVVAQRQPGSTFKLFVYSEAFNQGLTPCDKRRDEYISMQVLDKKTGQMKTWTPHNANGRFSNDSITLKSAFARSINSIAVRLGQEMGIKNIIRTAQEMGIKSPLDDEPSLALGSSDVNLLELVNAYSTVANDGEYHVPVVVTRILDKDGNEVYVAPKDHERALPYKTAFLMQEMLKAGVNEGGGTSQALRHYTFGDTDWGGKTGTSNNHSDAWFMAVSPKLVVGAWVGGEYRSIHFRTGALGQGSKTALPICGEFIYSLMRDKAFQKYHAKWQLDPDEDIDPSMYNCQPTVVRRAAPDSLRDFTTGHRRHQEEEEEPIDGHENTDEGGFILEPAPQPTPERQGNSSNNESPQIIKKAKKPRSEDMEI >CP022040|1079793:1132954|1124819_1125017_+|ASE17400.1|DBSCAN-SWA MGAEGVHESVHAADPVEVNRDIRSSQPGQKKLSEFIYERKARLMEAKFREELKKNKDNQNEKRKL >CP022040|1079793:1132954|1091918_1093334_-|ASE17380.1|DBSCAN-SWA MDEKQMDCSKKTLRRKLDLLLRTGQILMESSADTSRVKRNMERTAAYLGLPKENLHMNIDYYMLQVNVSDEYHSFSKMQRCDKHVINMLAIQEVSKLSWRAIQKDYSLDKYEEELEKIANGKHYYKDWIIAIGAGLACGGFCIQFGCDWTAFFYASIAAILGNRLRMFLNHSGSNLYANFAVAAFVSTILAWLSSFLSTPTVQAALPEFLRPILFTETPWHPLLACALYIVPGVPLINAVNDLLDNHINTGLVRAMNTLLIVIAMSFGIMLAIKCGSFDGFAKDLPTIPHHSFYVYAVAAAISAMGFATIYNIPYRLMPWIAVGGIICVCTRNFVFLDPSTGNAGLGLGIVVGSLCGSALISIINIKAVHILHTPHQCITIPAVIPIVPGVLMYRALYGFMGMQGVVGEVTHAMSFAINGSLVLVCIALGVAIPNIFAKKWIAPHRKAKLQRMIDERRQRGKFVDLHQYNI >CP022040|1079793:1132954|1108161_1109343_+|ASE17389.1|DBSCAN-SWA MEGDHIKINKWLLPFSWLYGLGVRLRNELFELNILKSRQFDIPVISVGNITVGGSGKTPHVEYLIRLLKDKMKVAVLSRGYKRKSCGYVLANENTPMREIGDEPYQMKTKFPDIRVAVDKKRCEGIDRLTSDEETKDTDVILLDDAFQHRYVHPGINILLVDYHRLIIYDKLLPAGRLREPLSGKNRADIVIITKCPKSLNPIDYRVLSKAMELYPFQQLYFTTLDYCDLEPIFSKGRNIPLTEIRGKNILLLAGIMSPKQLELDLNSFTGNNALTTLSFPDHHAFTTKDIHRINETFAKMPEPKLIVTTEKDKARLVDIDKLSDDVKENIYALPIKVSFMLDKEEVFNKKIISYVRKNSRNSILAKREDDHKSKDSHHSGHRPRTISFRDNR >CP022040|1079793:1132954|1102369_1103602_+|ASE17384.1|transposase|DBSCAN-SWA MNKSTHFIGQPLYVQLLNYFNRDKILSLSQAQGGEHYIKKFDAWHHLVVMLYAVMLRLDSLREIKASLFANVNRFNHLGLKHFPCRSTLSDANKRRDSEIFGSIYMNLYEKYRHELYSDSRNCGQPKWLKNLKIIDSTTISLFSNLVFKGVGRNPKTGKKKGGIKVHTEIFANENVPSDIKFTSAASHDQFALIPERYANEDLIAFDRAYINYEKFSELTQRGVIYVTKMKNNLSFERIADTDYQMTTDYGAVRVETILFHKHTKEKDIYHKARKITYQDKTKKGKIRFISLLTNDFQMSAEDIIAIYKRRWQIETLFKQIKQNFPLRYFYGESANAIKIQIWITLIANLLITLVKNKIKRPWSFSGLATMIRILLMSYVSIQSFFERPHRDWDRLITQVKAPPEELSLF >CP022040|1079793:1132954|1096362_1097367_+|ASE17930.1|DBSCAN-SWA MNDMKPNIDISENELLRQSEEILVELLRDHTTQKNIFWATDDYASLGEAYSYHAPITIPCITGDNGFIIQPRVLKTREEQANRTKDKAEVFTPSWVCNAQNNQVDEAWFGRKDVFNHEHPETKTWTATTKPILFPDGKTWKDYVRSTRMEITCGEAPYLASRYDTTTGAFIPLSQRIGMLDRKLRVISENTTTTGEWLKMAQEAYKNIYGYEWQGDNLLLAREALLMTFIEYYTEKFGEKPQERSIKYIAYIIAWNIFQMDGLKGVVPDSCRHNEVIIEQTLFEAMERTVLCPGCQNETYKGHNGIYCLIRDWGHKDPVTGENNRKIRFIDLIK >CP022040|1079793:1132954|1130986_1131694_+|ASE17408.1|DBSCAN-SWA MKQLTKTITNKLQALSDAEKREIFPKFFKAGKGEYGEGDRFLGVTVPNIRAIAKLHKDISIEEIRELIQSEWHEVRLCALIIMVENSKKKDEALRKELFNLYLSQTKRINNWDLIDLSCRFIIGEYLLDKSRDILYHLAQSPLLWDNRIAIVSTYAFIRKGQLEDTYALSDLMMQHPHDLMHKAIGWMLREAGKRDSERLYDYVMSHRADMPRTMLRYAIEKFSPKERAILMKRA >CP022040|1079793:1132954|1123075_1124068_+|ASE17399.1|transposase|DBSCAN-SWA METSPVTCRTLEEFYHINGRSFEKQYKETLSGYRSWDQLSHAQKWLLFEDNIGKNIAIDETSLSNGELYTIITNRDKHGKQGCLVAIVAGTKSLDVCKVLDKIDEKKREAVEEVTLDLSDSMRKIVRHCFPKAKRVIDRFHIQKLASDAVQQMRIEYRWAALQQANDEKENAKLEKIAYQPLTFENGDTRSELLVRSRYLLFKSSEKWTDEQKLRAKILFREYPDIKKAYGLSHSLRMIFAKNTIKDAARLSLAKWYNNVAEAGFHSFNVIAATFYEHYEDILNFYINRSTNAAAESFNAKIKLFRANLRGVVDKSFFLFRLAKIYAYPH >CP022040|1079793:1132954|1112498_1112885_-|ASE17393.1|DBSCAN-SWA MEEKYLYQLAEFVLPSDVLHYFSIVKIESDTSLLRIYLDEKMEKELSDDLHFESKGFMEAVEVTDFPIRDHKVILVLRRRRWIDIRTGKSFSLPLQIDITASGTRYSKEFGAFLKETYGDIPSDLPYA >CP022040|1079793:1132954|1132108_1132954_+|ASE17409.1|transposase|DBSCAN-SWA MYSCKDCGRQFQGGLRINNISLWNDYLTANRTISDLSTLYKCSERTIRRRLSLVVDSFTATYPKSAVIILDTTYFSKTFGVMLFQDASSGKILYRKFVKNETNRDYLDGLRYITERGTMIKAVVCDGHVGLLQAISFCPVQMCQFHQFQIVRRLLTNNPHLPAGVELLALMRRMFSMRKEEFITAFDKWCDKWKEFLNERTLLISGKTTYTHRRLRTARRSIKTHLPWIYTCEEYPDMQIPNTTNLLEGFNSQLKRALHNHNGLNEANKKKFIDGFINTKK >CP022040|1079793:1132954|1083606_1085382_+|ASE17373.1|DBSCAN-SWA MSQKKVNIDVNTIDLDNTELQKALQIIQFTNNSLFLTGKAGTGKSTFLRYIAATTKKKHIILAPTGIAAINAGGSTLHSFFKLPFYPLVPTDKRYSARNLRSTMKYNGDKCKLLREVELIIIDEISMVRADIIDFIDKVLRIYNRNMREPFGGKQLLLVGDIYQLEPVLKEEDRRLLQPYYPSSYFFDAKVFQDYPLVSIELNKVYRQNDPHFISILDHIRTNQVTDTDFNHINERVGASLENTNKPEGDFTITLSTKRDTVDWINNEGLDSLDGDPVMFLGEIKGEFPESSLPTPIELNLKVGAHIMFIKNDIEKQWVNGTLGIIIGIDEEAGILYVHTEEGDDLQVQREMWENIRYRFNEEEQRIEEEQIGTYIQFPIKLAWAITVHKSQGLTFKNVNIDFTGGVFAGGQAYVALSRCTSLEGITLKEPLRRNEVFVRSEVTHFARHYNDNNIISTVLKQSKADKEYYDAVCAFDKGDFDAFLRSFFLAIHSRYDIERPAAKRFIRRKLDLINQLRNENEELKRQQDKKNEYLKELSVEYVMMGKECEREEMNEAAIANYEKAIALYPDNPTAQKRLKKLKPSTEKDNK >CP022040|1079793:1132954|1094082_1094823_+|ASE17381.1|DBSCAN-SWA MLYIFAQIVMRKYIFTAIIVSVTLTSCFSFFPNRYEKALVYNGFRANTNTGIATKVNIKGYYSLVNDSTDSVTVGGSSIISKGGMSTLAGYNPFILYEDGTYGNILFNAVEKDFYANEHYKRANVELYKECTPLNNIYLMRSGYYKLKEDTICVTTYIYYLLRTELVLLRYKIIDSSHLLLLDETYISGNKNENDTCVQNRIFEFIPAKTLPSPSLFPIKKKRWAWANKKDWKQFKHSIARTSKNK >CP022040|1079793:1132954|1085381_1085915_+|ASE17374.1|DBSCAN-SWA MEQKKSLSKIIATIVFGIVFIIIAIVSCNQSKTIEGSNNGDTVAAKEFTPHVENGKLNIDLQSITSIRFPKYKTTKAIPFIPDSVSLAADEESVESGNYSATLLLDTIPNKEFYQRIDLAAQHDTCWDINKFAYTYERKDKAGGVYKVVFSKGGQQIFVTHLNKDMIKNTNQTKPNK >CP022040|1079793:1132954|1113656_1115738_+|ASE17394.1|DBSCAN-SWA MIMKKRTIVLFGMLLMVLTAVAGGKIKVACVGNSVTWGMTIIDREKNCYPAQLQKMLGDKYEVRNFGHSGTTLLQHGHRPYVDQQEYQDALNFKADLVIIHLGLNDTDPRNWPEYSEEFNADYIRLIDSFRQANPKAKIWICLMTPIFERHPRFESGTRDWHAQIQKHIRQVATATRVPLIDLNTPLYSRPDLLADAIHPNAEGAKIIAETVYGALTGNYGGLALSPLYTDGMVIQRNKPIVFRGKANAGETVKVNFNGHMLSAITNDAGKWKITFPAEKAGGPYKAQISTKKEKLTIKDIYVGEVWLCSGQSNMELPVNAVQSRTQDLNEADSQTHLHLFNMSAIYPTTAIAWSANACDSVNRHQYLHIGPWRNCSRESLGGFSAVAYHFGKKLADSLQVPVGVICNAVGGTTTESWIDRHTLEQRMPAILRDWYHGDFGMKWARERALQNISVSKNPLQRHPYAPAYMFETGMLPLKGYSIKGIVWYQGESNAHNMELHERLFPMLQKSWRNFFHDPELPFYFVQLSSLNRPSWPRFRDSQRRMASRLRNTWMAVTTDVGDSLDVHYTNKKPVGERLGLQALHHSYDYNIESDGPICHSVSAKDNGIELQFIHAKSLSAKGSRLIGFEVAGADGIYYPAEAQITSSNTILVKSSSVTRPLYVRYGWQPFTRANLVNEVGLPCSTFQWAVKK >CP022040|1079793:1132954|1095323_1096370_+|AVV27025.1|DBSCAN-SWA MSFINTVNTEDLYHKVARLIEDSRARMVTSINLAEVYTKFRIGQYIVEEEQRGETRAQYGKQVLQSLATKLTKNFGNGWSYSNLRQMRQFFVIYSNLTTTGCQIDDFTPKFTLSWSHYLLLMRVEDPDARRFYEIESSQQQWSKRQLSRQIGSSLYERLALSRDKEGVMRLAQKGQVVEKPSDIIKDPITLEFLGLKPDSLYSESKLENAIIGRMQQFLLELGKGFLFEARQKRFTFEERHFYVDLVFYNRLLQCYVLIDLKTGSLSHQDLGQMQMYVNYYDRYIRQDFENPTIGILLCENKNDALVELTLPPNANIYASAYQLYLPDKTLLQSKVKEWISEFRDNAE >CP022040|1079793:1132954|1103954_1104758_+|ASE17386.1|integrase|DBSCAN-SWA MVKSFKQQLKKEELSDNTITAYLYAVEDFERKYSCFNRENLLLYKADQIEQFKPKTVNLRIQALNKYLEFIGKPRLRLKSLKIQQKTYLENVISNADYQYLKSKLKEGEEEVWYFLVRFLGATGARVSELIQFKVEHVRVGYFDIYTKGGKVRRIFIPQDLCMETKAWLKRANIDLGYLFLDKHGKQITPRAIRHKLQRFAHQWGINSKVMHPHSFRHRYAKNFLESFNDVVLLADLMGHESIETTRIYLRRTSNEQQAIVDKIITW >CP022040|1079793:1132954|1103685_1103949_-|ASE17385.1|DBSCAN-SWA MILYLSNIAKISPIYAIIGNPPYQLTVAKKDTDNGQKAVVNIFQHFQLLADRLAPRFSSLIYPAGRWIHRSGKGLANFGYNQIIAVR >CP022040|1079793:1132954|1120189_1121542_-|ASE17397.1|DBSCAN-SWA MNVLDYVKAFNEFLWNSFLMYALLGVGIFYTVYLGFPQIRHLNLAFKYAFGPIFKKRKPGETKVNSFQALATAVAAQVGTGNIGGVATAIASGGMGAIFWMWVSALLGMSTIFSEAVLAQKYKKEYHGETVGGSAYYLYYGLGSKWLAVCFSVAIVLALGFVGNMVQANSISIALNNAFHIPSYIIGIVLAAVVGVVIIGGQRRITAIAELLVPFMAVVYILGSLVIIYMFADQLPHVIRTIFQDAFSMKSAAGGAAGTVMKYAIRYGVARGLFSNEAGMGSTPHAHALADVKDPSEQGFVAMAGVFVDTVLICTSTAFVIMLTGSFQNLSLKSVAITQEGFEIAFGNGGIIFLAISLIFFAFTTIIGWYMFAEMNIKFMFGKKGILSYRALVVLFVFLGCLFAADMVWELADTFNGLMVIPNLIGIIFLAPQVKKIYKNFLANRKENNS >CP022040|1079793:1132954|1122723_1123110_+|ASE17398.1|DBSCAN-SWA MEDTYLYHLASLVLPKDVLKYFSVVKIEPSPSLLRIHLDEKMEKELSDDLHFESKGFMEAVEVTDFPIRDHKVILVLRRRRWIDIRTGKSFSLPLQIDITASGTRYSKEFGAFLKETYGDIPSDLPYA >CP022040|1079793:1132954|1118019_1119753_-|ASE17396.2|DBSCAN-SWA MKSTFVTKMVKPIEENSLFFMFMLLVGAFTNVSHRNVFGYIELIADLYIICFLLSLCQQTIRQGLVIMLSSVVYVVAIIDTCCKTLFDTPITPTMLLLAQETTGREATEFFLQYLNLKLFFSAADIILFLALCHIVMAVKKMTFPTSYLKQPFVAFALMFTIFVGIALSVYDKVQLYRVKNLSELEVAVTNGFAHLYHPVERIVYGLYSNHLIAKQVDGVIMANQQIKVDSCSFTSPTIVLVIGESANRHHSQLYGYPLPTTPYQLAMMNGKDSLAVFTNVVSPWNLTSKVFKQIFSLQSVDEKGDWSKYVLFPAVFKKAGYHVSFLSNQFPYGINYTPDWTNNLVGGFFLNHPQLNKQMFDYRNVTIHNYDEDLLNDYKDIISYKKPQLIIFHLLGQHFQYSLRCKSNMKKFGIMDYKRMDLTDKEKQTIADYDNATLYNDYVLNKIVEQFRNKDAIIVYLSDHGEDCYGKDVNMAGRLTEVEQINLRKYHEEFEIPFWIWCSPIYKQRHRKIITETLMARSNKFMTDDLPHLLLYLAGIKIKDYFEERNVISPSFNSNRRRLVLKTIDYDKALYQ >CP022040|1079793:1132954|1124997_1125657_+|ASE17401.1|DBSCAN-SWA MRKENYKIKHKYLLGIFIISMLIFSGPLYSQNNIIPSNLRKSLYKLYHYNTTWKSTISKLSCVEGKKHDGVYTFRLNYQPHYPTRVFFISNNTPYTIESLGFENTTGVLLEACHFLSSKHQTNSYIKSILSGVYKYLFAEYGLTYGSDFFDKKPDFSNERDKMRFIIHRVNNVDKLRKQALYLRTDRKIIPDSITNYIKGNNLNDEEALLFLRYIILNN >CP022040|1079793:1132954|1079793_1080705_+|ASE17371.1|tRNA|DBSCAN-SWA MKNKTLIVITGPTGVGKTETTLRIAEHFNVPVINADSRQIFSEIPIGTAAPTAEQQQRVQHYFVGNHHLEDYYSASLYEQDVLNIINSQHTPISLLSGGSMMYIDAVCNGIDDIPTILPEIREKMMKRLEAEGLEQMCNLLRELDPEHWKIVDRNNPRRVIHALEICIQTGKTYTSFRSNTIKDRPFNIIKVGLNRDRDELYNRINQRVLDMIEEGMIEEALQVYPKRTLNSLNTVGYKEIFEYLDGLTTLDEAIFKIQSNTRRYARKQLTWYKKDTAFQWFNPDNIEEILNYVHTMISNTSK >CP022040|1079793:1132954|1105228_1106281_-|ASE17387.1|DBSCAN-SWA MKHIILAIDSFKGCLSSVEAEDAAEQGLRERWQEVKVVKVPVTDGGDGMLEVFLQLFDCKEITINCHDALMRPIQASYAVCADNTVVIETALSCGINFLKQQELNPLRATTYGLGELFADALQRGYRKFVVGLGGSATSDCGLGMLAALKDIFGKNWRDKFLRDLDITLASDVNNPLFGERGAAVIFGPQKGATPEMIVCLDRRARTFARMAAAQLGFDCSLNNGAGAAGGLGYAFMQFMNAKMSSGADVLLETVNFNSLIEDADLIITGEGSADSQTLMGKIPIRVLEYGLRKNVPVMLIAGKVKDAAALLKAGFSQVQCITPTDMMLTEAMKPTVAKENIRKAIIQLN >CP022040|1079793:1132954|1109224_1110034_+|ASE17390.1|DBSCAN-SWA MYEKIQETASWLKERMTTSPKTAIILGTGLGQLASEITDSYSFSYQDIPNFPVSTVEGHAGSLIFGRLGGKDIMAMKGRFHFYEGYNMKDVTFPIRVMHELGIETLFVSNASGGMNPSFKIGDLMIITDHINMFPEHPLRGRNFPTGPRFPDMHEAYDHKLVDLADSIAKEKNIEVQHGVYMGVQGPTFETPAEYRMYHKMGGDAVGMSTVPEVIVARHSGIKVFGISVITDLGGFDVPVKVSHEEVQEAANAAQPRMTEIMREMIKRS >CP022040|1079793:1132954|1128738_1130364_-|ASE17406.1|DBSCAN-SWA MAKEIKFNSDARELLKSGVDQLANAVKVTLGPKGRNVVIGKKFGAPQITKDGVTVAKEVELEDNFENAGAQLVKSVASKTGDDAGDGTTTATILTQAIVTEGLKNVTAGANPMDLKRGIDKAVAKVVDHIKASAEVVGDNYDKIEQVATVSANNDPEIGKLLADAMRKVSKDGVITIEESKTRETSIGVVEGMQFDRGYLSGYFVTDTDKMECDMENPYILIYDKKISNVKDFLPILQPAAESGRPLLVIAEDVDSEALTTLVVNRLRAGLKICAVKAPGFGDRRKAMLEDIAVLTGGVVISEEKGLSLDKATLEMLGTAKKVTISKDNTTIVDGAGEKEAIKDRVAQIKNEIAASTSSYDKEKLQERLAKLSGGVAVLYVGANSEVEMKEKKDRVDDALCATRAAMEEGVVVGGGTTYIRAQEALKDLKGENADEQTGINIVCRAIEEPLRQIIANAGGEGAVVVNKVREGKGDYGYNARKDVYEDLRAAGVIDPAKVSRVALENAASIAGMFLTTECLIVDKVEDTPAMPAAPGMGGMM |
41 | Lysinibacillus_phage(18.18%) | integrase,tRNA,protease,transposase | attL 1091372:1091391|attR 1111384:1111403 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1273406 : 1281876
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP022040|1273406:1281876|DBSCAN-SWA TATGGCAATAGGCAAATGGATTGGTGGCGCATTAGGATGGATACTTAGCGGAAGTATGCTGGGTGGTTTAGTTGGCTATTGTATAGGAACAATGCTTGACGAAGCATTTGCAGGCGATAATAGAGGTGGTGATAGGCAGAACGGATATGGCGAGCAAAGTCATTTTGGAGGCACGCGTCCGTTTGAAGAAGACCGCAACTCCTTTTTGTTCTCTATGCTTGTCCTTTCCTCTTACATTATAAAAGCCGATGGAAAGATAATGCACTCTGAGATGGAATATGTACGCCAATTCCTTCGTCATAACTTCGGCGAGCAGGCTGTAAGCCAAGGAGAATCAATTTTGCTAAAGTTATTTGATTTACAGAAGCAACAAGGCCCTTACCAATTTAAAGAAACAATCCGTAAGAGCTGTGTTGAAATACATTTTCATACAAGTGTCAGTCAACGCCTACAACTACTAAACTATCTTGTCATCATAGCAAAAGCTGATGGCATCGTAAGCCCAGAAGAAGTTGTTGCACTGAAAGAGATTGCCTCATATCTTGGGCTTTCTGCGCAAGACATAGAGTCCATGCTCAACTTAGAAAGTGGAGCAAAAGCAAGTAGCAACATTGAAGATGCTTATAAGGTGTTAGGCATATCTCCATCAGCAACTGATGATGAAGTCAAGGCTGCTTACCGCAAGATGGCACTTAAACATCATCCAGACCGTGTTTCTACGCTTGGAGACGATATCCGAAAGGCTGCTGAAAAGAAGTTCCAGGAGATTAACGACGCCAAAGAAAGAATCTATAAAGCAAGAGGACTGTGAGGAAAGTGGACAGTTTTATAATTGACAAGTAAATGCTAATCAATTCACAACAAACATCTTCATTAAGAAACAATCATAAATTGCACTATGGCAACAAGACTTAAAATAAAAACAACAGAAGGTGATATCATTATTCGCCTTTATGACGAGACTCCAAAACATCGTGATAACTTTTTAAAGCTTGCAAAGGAAGGGTATTTCAATGGTACACTCTTCCACCGAGTAATAAAGGATTTTATGATACAAGGTGGAGATCCCGATAGTAAGAATGCTCCGAAGGGTAAGATGTTAGGTACAGGTGGACCAGACTATACCATTCCTGCTGAGTTTGTGTATCCACAATACTTTCACAAGCGTGGTGCATTGAGTGCAGCCCGCACAGGTGATGAAGTTAATCCTGAGAAAGAAAGCAGTGGTAGTCAGTTTTACATTGTATGGGGAAAGACCTTTAAACCTGCAGAACTGAAACAAATGGAGCATCAGATGGCAATGCAACAAGAGCAGCAGGTATTCAATCAGCTTACAAGAGAACATCACGAGGAGATAATGAATTTAAGACGTAATCGTGACCGTGTAGGACTGCAAGAGCTACAAGACAAACTAATAGAACAAACAAAAACAACTTGTAAGCAACAAGGTAAGCCTTCCTTTACAGAAGAACAAATAGAGGTTTATACTAATGTTGGTGGTACTCCTTTCCTCGATAATCAGTACACCGTCTTCGGTGAGGTGGAGGAAGGCCTTGGTATCGTTGAACGTATACAAAACTGCGATACAGACCGCAACGACCGTCCAACAGAAGATGTTAAGATAGAAACTGTAGCCTTATTGTAAGCCTACATAATGAAACATAAAATGACATAATATCCTTGTATAGTTATTCCAGCAAGAAATAACTATACAAGGATATCTTATTTAACCCAGCAGGTTTAATCTTTCTTAGCGAAGATAGTATGACAGAAGTTCCAAGTTCTTCTGCTTAGTAGCATCTACTATCAAGAAGATATCTATAAGCCACCAGATACCACAACCACCACCTGTCAACAACTTACCAATACCTAAACCAACATCACCAATATAAAAACGGTCAACACCGAGTGTCCCGATAAATATTGACAAAAGAATTGATAGCAAAGGATCTTTCATCTGAGACTGGAGAGCCAATATCTCTGACTCACTAATATCAGTGTTTTCCAATCTTGTACGAACACTCGGAATACTTCCTGCAGGGATTTTTGATGAAAGCATCATCAACATATGATTCACCTTTTCTGATTCCATAATGATTATCTTAGGTTTAATAATTTACATAACTTCACATGCTCTCAAAACCCATATCCACTATATCTTTAGGATGTTTCTTACAATAAACCAAATGAAAAAGAGGATTATATACAACCATATAAGCCGCTTATCTTCTATTACCTTTGTGAGACTATCCTTAGCCTTCCCTTTTGGAAGTAAGGCACGTACTCCAAACGATAATATGTATGGTCCAGAAAATAGAAGAAAATAATTATAGTGAATTGCTTCTGAAAACCGCCCTTGCAATAAAGCATGCAACGCACGTTGAATGCCACAACCAGGACATTGTAACCCTGTTGCGACCTTAAGGAGACATTTAGGTGCCCATAAAGTCGTAACAGGGTTATAGAAATAATACAGAATTAATAGGGAGGCTCCCAAACAGAGCCAACCTATACACTTTATAGTACGTTTCAAGGATTTAGTGAGATCCAGCCATTGTGCTCAATGCAGCCAAACCACCGAAGCAAACAAAATAAAGGATAGAACCAATTAGACCCACAACAATACCAATAATACTCCACTTTTTAGCGTCATTAGCAGCTAAAACAGCTTCATCGTATTGTTTCATCATATAAAGCTCATTCACCTTCATAGCGCGAATGATTGCATAAATACCTGCTGGCAAGCAGCAACATACAGTTGTGAAAATCGCAAGAGCCAAATTGTTATTTGGCTTAGGAGTGTTCATTTCTTCCATAACTTTTTTTGAGCCGACTCCCCATGGCATTCTTTTTTTGTTTTATTATTTGAACCACGTTGCTACTATGGTGGGGAAGTCTTAAGTGACACCTCTTCACAAACAGGTTATTGTTTTCAGCTATTACTTAACCGAGGACAAATTTAAGTAATTTAGATTAAATATACAAGCTAAAGAAAAACTTTTCTTTCTATCATAAACAAAAAAATTACATTTTAAACAAATATAACATTTACAAATAAACCAAGAAAAGCGAGTAAACCTATATTAGTTCATCTATCCACAAATACATCAGGGATTCTTTAACATGTTAAAACTATCACAAAACTTTATACATTCTTCCCTAAGTTCATAAACGAAGTCAAATAAACCGAGAAACTTCCATAAACAATAATCGGATTAAAGTCAATTGACTCTAACCCGATTCTTTTTTATATTCTCTCAATTATTCTTCTTAACTACATGCGAATCGTGCAAGAATAAGTCTGTTTCTGATTTATGCCAAACAGTTTACACGACCTATATTGTATTCACGAATGACATTTGTCATATAATGAAACATCTTACGCATAACACTCATATTCTTTACTTTCTCTTTCATAACGTTTCCTTTCCTCTATTGAGTTATACAATCTTTTACCTATGTCCCTATTGAGACTGAAGCCCTTTCAGTGGCTACCTCTTTGTTATCCTATTGAGGTGCCGTTTGTGTTGTTTTGTGCTTGCAAAGTTAGTACACTTTCCAAAATCTCACAAAGGAAACTTACACTTTCACTTTGTTTTTCCATTAAATTTTGACCTGCGTCAATAAGATTGACACGTATCAATCCCGCTATTTGACATATATCAACCCATGCGTGCCTTTATATCAACATCCTCTAAACGAATATATTATAATAAAAAAGAAGACGGAAAGACAAGTATCACATTTTCTTTTCTTATCTTTGCACTATGAACACAAACGAAAATCATAGCTTTCGATTACCTGCAGAGTGGGAACCACAAAGCGGAGTTATACTGATATGGCCGCACGAAGATACTGATTGGCGTCCTTATCTTGAGGAGATTACTGAAGTTTATCTCCAGATGGCAGATGCTATTACCCGATACGAGGCATTGCTTATCACAGCACGTGAAACAGAAATGGTAAGTTCCTTACTGAAAGGAAGGCTGACAGAGGAACAGATGAAACGTGTAACGCTCTTTACTTGTGACAATAATGACACTTGGGCACGAGACGTTGCACCGATTTCGCTCATAGCAAACAAAGCATCGAAAGATAGTTTCCATCCCCTTCGTCTACTTGACTTCTGTTTCAATGGATGGGGTGAGAAGTTTGCAGCAGAAAAGGATAATAGAATCAATCGTCAGCTTCACGAGGCAGGACTCCTCCAAGGAGTATTAGAAAACCACAAGGATTTTGTATTGGAAGGTGGTTCGATAGAAAGCGATGGAAGCCATACACTCTTTACAACAACGAGTTGTCTGATGGCACCACATCGTAATCAACCTCTCACACAAGAGGATATAGACAAGCAATTACGCCTTTTCTTTCCAAATGTAGAGCGAGTTATATGGTTAGATTATGGTCAGTTAGCAGGTGATGATACAGACGGACATATTGATACTATCGTACGTATTGCACCAAATGACACCTTATTATATATAGGATGTGACGATAAGGAGGATGAGCATTATGAGGACTTCCAACTCTTAGAGGAACAGCTAAAACAGCTCCGCACACAGAAAGGTGAGCCTTATCGCCTACTTCGTCTCCCTATGCCTGATGCTATCTATGATGATGACGAGCGTTTGCCTGCAACACACGCAAACTTCCTTATCATCAATGGAGCTGTACTTGTTCCAACCTACAACCAACCCGAAAAGGACAAAGAAGCTTTGGACACAATTCAAGAAGCTTTCCCTGATCGTGAAATCATAGGTATTGACAGCCGTACTATTATCAGACAGCATGGTTCCATACACTGTCTGACAATGCAGCTACCAGCAAACGGATAAAACCACTCTTCGTTTGAGTTGGTAACAAATGGAATAGTAATAAAAGAATACAGAACAATAAACATAAACAGGGCTAACACCTTTATTATGAGAGAACTTAAAATAGGCTTTCTACAACAACATAATGTTGAGGATATCAAAAACAATATAGAGCGATTGGCTGAGGGAATTACAAACTTAGCACAACGTGGTGCGGAACTTGTTATCCTCCAAGAACTGCACAATTCACTTTATTTCTGTCAGACAGAAGATGTGAATAAGTTTGACTTGGCTGAGACTATACCGGGTCCCTCTACTGGCTTTTATGGTGAGTTGGCACGTGAACTGGGTATTGTCATTGTGACATCACTTTTTGAAAAACGTGCACCAGGACTTTATCATAACACAGCCGTAGTGATAGAGAAGGATGGTAGTATTGCTGGTAAATATCGCAAGATGCATATTCCAGATGACCCAGCTTACTATGAGAAGTTCTACTTCACACCAGGTGACCTCGGATTTCATCCAATTGATACAAGCGTAGGACGCCTTGGTGTACTTGTATGCTGGGATCAGTGGTATCCCGAAGCAGCCCGTCTAATGGCATTGCAAGGTGCAGATATGCTTATCTACCCTACAGCTATTGGCTACGAAAGTAGTGATACAGACGAGGAAAAACAGCGTCAGCGTGAAGCTTGGACAACAGTCATGCGTGGTCATGCCGTTGCTAACGGTTTGCCAGTAATCGCTGTAAACCGTGTCGGTCATGAACCCGACCCAAGCGAGCAAACTCAAGGTATTCAGTTCTGGGGAAGCAGTTTTGTTGCTGGACCACAGGGAGAACTACTCTATCGCGCTTGTGACAACGATGAGGACAGTGTTATTCTCAGCATCAACCTCGACCATAGCGAGAATGTACGCCGTTGGTGGCCTTTCTTGCGTGACAGAAGAATCGATGAGTATGGGGAGATAACTAAAAGGTTTATAGATTAAAACCTTCGCCTAACTCTTCCGAAGGAGGGGAAAAGCTCCACCACTCTCCAGAATCAACGGTCACTCACTGAAGGGAAGTAATGAGAGAGGGAGTCTTAACGGAATAAAAGATTCTCTCTTTATTAATAATTACAGCTTGATAGGACTTAAATAAGGTTCATCCGACATTTCGAGTGATGAACCATAAGCTCTTATATTCATATAGTTACGAAGTAATAGCCTCCACTAAAAGCCTATAAAAAGCCATTTTATAAAAACAAAAAATCATAATTCAAGCTTTTCTAAAGTCGACGTTGCCGAACTGCTGTGCGACATCGTTAAATCCTTTTATATAAAGCTGTTATTAATGGAGATATGATGCCATTTGCTCATTTGCATTTAATATCGGAGTTGGAAGAGGATTTCCTTCTTCCGAATTTTTGAAAGAACTAAATAAAGGGCACTATGATGGGACACTGATGCTCCATTGGAGACGTCCAAGCGAAATAATAGGGAGAAGGAAAAAAGAGGTTGAATTGTTTAATCATGGAGTATACTAGAAATGAGAATGAAAATGTTAATTTTAATTGGCTTCATCACCGTAAGTTGTACATCAGTACCTAACAAGAAGTGTATTGTACAAAAAAATCAAGTAAGAGTCATAAAAACAGAGAAATCTACAGCATTCAAGTATTTACAAGGTATTTGGATAATACCACATGCAGCTGATATACGAATTGTTTTCAAAAATGACTCTACTTTTGAGTTTCATGATTATAACTCAATCAAAGATTCTATAGAGATTTTAAAAGGTATATATGTATTAAATAATGATCAACTTACACTTAGATATACTGATAGACCTCAACAGAAGTTTATCTATCACTTTGGCAGATATGAGGACGAGCGTTATATTAGAAAAGGAAAGTACTATTTTGTAAAACAATAAGTCACGACTTGAGAGTGATTTAATCCCCCCCCTTTCGGGGCAGGGGTATTTGTGGTACAAATGGTAATTTGTGGTACAAAGACACACCTTGCCAGTATAAAGAAAGAGAAAGTTTGTTATGTTCCATAAAGTAACCTTTCTGGGGTGCTTTTAATAGACGAGAGTTCAAAGTTCAAAAAGTAAGTCAAGAGATAAGTTTGGGCCTTCAGTATTTTGGAGTGATAGTGTTTAATATCTGATTATCAATAACTTTTATAATTTAAACACTTTTATATCCCCTTTGTTTACTGATATTAAAGAATACAAAGTACTAATTTAAAGGTTACGAAAGTTGTTGTTGTATCACATGGGTTATACACGCTATAAAACGCTTATAATGAATTATTTACAAATCTTCTTTACTAACAAGTGGCATAAAGGAAATTATCGCATCACTCCAAATTTCGGAAGAGCCTTAAATAACATCTCCTTCCACACGGAATATGATAATAAGTAGAACTTATCTAACAAGATAGTGCGACTTAATATGCCATCCGCACCATTGGTGTTTACCATGAGCACCACACGTGCTGAGCATCAACACATAGGCTGGATATGGATTGAGAGTTGCAATTGTTATACAAAAAGAGCCATACCAAACTTATTTTTAGAGTTTGATATGGCTCTTTTTTAGTTCTATAAATATAATATGTAAGCTTCTAAATCAACGGCATTCCTGCCTCTTCGCACTCTCTTCTCATTTCTTCTGGCCAGATACTTGCTTGTATCTCACCAATATGCGCCTTGTGCAACATTATCATACAAAGACGGCTCTGTCCGATACCGCCACCAATACATAACGGCAGTTTGTCTGAGAGTAACTGCTGATGGAAGAAGAGTTGCTGACGTTCTTCTTGCTTCTCTATCGTCAACTGACGGAGTAAGGCAGTCTTATCCACACGGACACCCATAGACGAAAGTTCTATACTACGATCTAAGATAGGGTACCAAATGAGAATATCACCATTCAAGCCAGTCTTACCATCTTCTGCAACAGTTGACCAGTCATCATAATCAGCAGCACGTCCATCGTGCTTCTCACCATTACTTAATCTACAACCAATACCCTCAATAAACACCGCACCATACTTCTTACAAACCTCATCTTCACGTTCTTTTGCTGTCATTGTAGGATACATATCCAACAAATCTTGCGCATGAATAAAATGAATTGTCTGTGGAAGGAAGGGTTCTAACTGAGGATAGGTCTCACAAGTAAGGAACTCTGTACGAAGGATAGCAGCATAAATACGACGAACGACATTCTCTAAAAAGCTACGTGTTCTATCGCCTTCGGTGATAACCGCCTCCCAATCCCATTGGTCAACATAAAGTGAATGAAGGTTGTCTAATTCTTCATCGGCTCGAATAGCATTCATATCTGTATAGACACCATAGCCCGGTTGAATGTCATAGTCAGCCAACGTCAGTCGTTTCCACTTTGCCAGAGAGTGTACTACCTCAGCCTCAGCTTCTCCCAAGTCCTTAATCGGAAAGGAAACAGGACGTTCAACACCATTCAAGTCATCATTGATACCTAATCCTTTCAACACGAACAACGGTGCTGTAACACGACTTAGTCGTAATTCGGTAGCTAAGTTCTGCTGAAAAAACTCCTTTATAAGTTTGATACCCTGCTCGGTTTGACGTTTGCCAAGTACTGCTTTATATCCTTCTGGCTTTATCAGTTGACTCAT
Protein sequences of DBSCAN-SWA_2 >CP022040|1273406:1281876|1274303_1275050_+|ASE17514.1|DBSCAN-SWA MATRLKIKTTEGDIIIRLYDETPKHRDNFLKLAKEGYFNGTLFHRVIKDFMIQGGDPDSKNAPKGKMLGTGGPDYTIPAEFVYPQYFHKRGALSAARTGDEVNPEKESSGSQFYIVWGKTFKPAELKQMEHQMAMQQEQQVFNQLTREHHEEIMNLRRNRDRVGLQELQDKLIEQTKTTCKQQGKPSFTEEQIEVYTNVGGTPFLDNQYTVFGEVEEGLGIVERIQNCDTDRNDRPTEDVKIETVALL >CP022040|1273406:1281876|1275942_1276221_-|ASE17516.1|DBSCAN-SWA MEEMNTPKPNNNLALAIFTTVCCCLPAGIYAIIRAMKVNELYMMKQYDEAVLAANDAKKWSIIGIVVGLIGSILYFVCFGGLAALSTMAGSH >CP022040|1273406:1281876|1275557_1275926_-|ASE17937.1|DBSCAN-SWA MKCIGWLCLGASLLILYYFYNPVTTLWAPKCLLKVATGLQCPGCGIQRALHALLQGRFSEAIHYNYFLLFSGPYILSFGVRALLPKGKAKDSLTKVIEDKRLIWLYIILFFIWFIVRNILKI >CP022040|1273406:1281876|1273406_1274216_+|ASE17513.1|DBSCAN-SWA MAIGKWIGGALGWILSGSMLGGLVGYCIGTMLDEAFAGDNRGGDRQNGYGEQSHFGGTRPFEEDRNSFLFSMLVLSSYIIKADGKIMHSEMEYVRQFLRHNFGEQAVSQGESILLKLFDLQKQQGPYQFKETIRKSCVEIHFHTSVSQRLQLLNYLVIIAKADGIVSPEEVVALKEIASYLGLSAQDIESMLNLESGAKASSNIEDAYKVLGISPSATDDEVKAAYRKMALKHHPDRVSTLGDDIRKAAEKKFQEINDAKERIYKARGL >CP022040|1273406:1281876|1275155_1275497_-|ASE17515.1|DBSCAN-SWA MESEKVNHMLMMLSSKIPAGSIPSVRTRLENTDISESEILALQSQMKDPLLSILLSIFIGTLGVDRFYIGDVGLGIGKLLTGGGCGIWWLIDIFLIVDATKQKNLELLSYYLR >CP022040|1273406:1281876|1277172_1278240_+|ASE17517.1|DBSCAN-SWA MNTNENHSFRLPAEWEPQSGVILIWPHEDTDWRPYLEEITEVYLQMADAITRYEALLITARETEMVSSLLKGRLTEEQMKRVTLFTCDNNDTWARDVAPISLIANKASKDSFHPLRLLDFCFNGWGEKFAAEKDNRINRQLHEAGLLQGVLENHKDFVLEGGSIESDGSHTLFTTTSCLMAPHRNQPLTQEDIDKQLRLFFPNVERVIWLDYGQLAGDDTDGHIDTIVRIAPNDTLLYIGCDDKEDEHYEDFQLLEEQLKQLRTQKGEPYRLLRLPMPDAIYDDDERLPATHANFLIINGAVLVPTYNQPEKDKEALDTIQEAFPDREIIGIDSRTIIRQHGSIHCLTMQLPANG >CP022040|1273406:1281876|1279753_1280140_+|ASE17519.1|DBSCAN-SWA MRMKMLILIGFITVSCTSVPNKKCIVQKNQVRVIKTEKSTAFKYLQGIWIIPHAADIRIVFKNDSTFEFHDYNSIKDSIEILKGIYVLNNDQLTLRYTDRPQQKFIYHFGRYEDERYIRKGKYYFVKQ >CP022040|1273406:1281876|1278327_1279212_+|ASE17518.1|DBSCAN-SWA MRELKIGFLQQHNVEDIKNNIERLAEGITNLAQRGAELVILQELHNSLYFCQTEDVNKFDLAETIPGPSTGFYGELARELGIVIVTSLFEKRAPGLYHNTAVVIEKDGSIAGKYRKMHIPDDPAYYEKFYFTPGDLGFHPIDTSVGRLGVLVCWDQWYPEAARLMALQGADMLIYPTAIGYESSDTDEEKQRQREAWTTVMRGHAVANGLPVIAVNRVGHEPDPSEQTQGIQFWGSSFVAGPQGELLYRACDNDEDSVILSINLDHSENVRRWWPFLRDRRIDEYGEITKRFID >CP022040|1273406:1281876|1280838_1281876_-|ASE17520.1|DBSCAN-SWA MSQLIKPEGYKAVLGKRQTEQGIKLIKEFFQQNLATELRLSRVTAPLFVLKGLGINDDLNGVERPVSFPIKDLGEAEAEVVHSLAKWKRLTLADYDIQPGYGVYTDMNAIRADEELDNLHSLYVDQWDWEAVITEGDRTRSFLENVVRRIYAAILRTEFLTCETYPQLEPFLPQTIHFIHAQDLLDMYPTMTAKEREDEVCKKYGAVFIEGIGCRLSNGEKHDGRAADYDDWSTVAEDGKTGLNGDILIWYPILDRSIELSSMGVRVDKTALLRQLTIEKQEERQQLFFHQQLLSDKLPLCIGGGIGQSRLCMIMLHKAHIGEIQASIWPEEMRRECEEAGMPLI |
9 | Catovirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|