Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP043449 | Mucilaginibacter gossypii strain P4 chromosome, complete genome | 9 crisprs | csa3,DEDDh,RT,cas3,WYL,PD-DExK,PrimPol | 0 | 0 | 2 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043449_1 | 162220-162303 | Orphan |
NA
Consensus repeat of NZ_CP043449_1
|
1 spacers
spacers of NZ_CP043449_1
>1.1|162243|38|NZ_CP043449|CRISPRCasFinder GGCAGTGGCAGTTTGTATAGAGCGGCAATTTTTCAGTA |
CRISPR arrays and Neighbor proteins around NZ_CP043449_1
The CRISPR arrays of NZ_CP043449_1 >merge|NZ_CP043449|1|162220-162303|CRISPRCasFinder TGCAGTAAGTAGCAGTAGCAGTTGGCAGTGGCAGTTTGTATAGAGCGGCAATTTTTCAGTAAGTAGCAGTTGGCAGTGGCAGTT >NZ_CP043449|1|1|162220-162303|CRISPRCasFinder TGCAGTAAGTAGCAGTAGCAGTT GGCAGTGGCAGTTTGTATAGAGCGGCAATTTTTCAGTA AGTAGCAGTTGGCAGTGGCAGTT
>NZ_CP043449.1|WP_112653577.1|161854_162214_+|hypothetical-protein MNQERVCLDCGTPLQGRADKKFCNDLCRNNYNNQLNSNSYNLVRNINNILRRNRRVLEELNPTGKTKTTRKKLAAKGFDFDHITSIYQTKTGSTYFFNYEYGYLLLDNDEVLLVKREGE >NZ_CP043449.1|WP_112658804.1|161266_161683_-|hypothetical-protein MDLTDEQVLAQLKVRRVKLKMELDRVEVAIKAFENIGEINILDAMPYMMEDLEVDEDLLISTLMYNPKMTAEKKVIFTLSKIGKGDASDITEYILRIDGHIKDTKRAFERITYVCSRMFKSGKITAERVGKKNVYMLR >NZ_CP043449.1|WP_022833677.1|160815_161133_-|hypothetical-protein MSDNASNQPSLDEVVNSLSKAVVYLIKDITATRKTLSDGLNKIDDNFKEINKKIDALSKDTGENFVDVHMNLKSIKSEISKINKVTGYEETMKNLSIVHNKSKQQ >NZ_CP043449.1|WP_112658802.1|159688_160702_+|IS1595-family-transposase MEQEHPPLGKTLPFRTINDIAIHFRDKAVCIEYLTQLRWAGNVKCAHCGHDKVYELKGAYKGYKCAKCRKKFTAIKGTIFENSPIELSKWFMAIFILSTHRKGISSVQIARDIGVTQKTAWFMMQRVRYAFKMKSFESNEKIGKSTFDNKGNEVKAVVEVDETYIGGKVANMHKHKAEAIEKKGSSSKIGVIGAIERGGKVKLQPLKATDHENVIPFLVKSVHQGTKLMTDEHVAYNTMNRVYEHQTIKHMLKEYVRGEVHTNTIENFWSLLKRGVYGTYHFISPKHVHQYLEEFAFRFNSRELTEAQRFDKLISLSNYKITYKVLTYEPKETQANA >NZ_CP043449.1|WP_022833680.1|159004_159484_+|hypothetical-protein MYNNQFISELRAQRLNVLEQLKHIDAMLKLYGVNLDEEEIPAYSGIEALVYERPYKKDASNKEKIAGLLKLTNRFLSINEMTSLVMEFEPKSKVEEVKASLSSAKNILLKDGSIVKVQVGTNNSNTFYGSPTWIDEQGFPLPEHKYSDDAVQLKTKIVI >NZ_CP043449.1|WP_112683133.1|157685_157919_-|hypothetical-protein MGLPINVIRAIVKRKTANNLPSSSQTEILIPIESIAAYKHLSGVDYQLYLKKDCEINLGYEIESITGKIKSPHIQFL >NZ_CP043449.1|WP_112658800.1|157522_157714_+|hypothetical-protein MGNEIVRTIDQGAIELSNKVEKIANSILPLLNGLTAYECERVLSKAKAIVFEEIPIKIECEEI >NZ_CP043449.1|WP_112657835.1|156987_157326_-|single-stranded-DNA-binding-protein MNSLRNSVRLVGNLGMDPEVKVFDSNKKMVRLSIATNESYKNDKGEKITDTQWHNLIFWGTQAKLAEDLLKKGDEVAVEGKLANRNYTDKDGIKRYVSEVIVNEFLKVGVKG >NZ_CP043449.1|WP_112657833.1|156599_156851_-|hypothetical-protein MALHFVAEWIGAARLMPTPQQRILRDHCRSRLFYMRQQNKTQNRPLKIFESNNIKLRSVSSIKKKPAMDMIRMFATGATDKEQ >NZ_CP043449.1|WP_112657831.1|155686_156304_+|class-I-SAM-dependent-methyltransferase MKENRQGHWEHVYATKSSNEVSWTQTVPQTSLNFIHSFNLPKDAKIIDIGGGDSNLVDHLLNEGYRNISVLDISEAALNRAKMRLGPKANLVTWIVSEITDFKPSEIYDVWHDRATFHFLTTQQQIASYISIARNAVKRNLVVGTFSENGPEKCSGLSVNRYSKLQLEQVMSNGFQKLKCINEDHITPFNTVQNFTFCSFERCYT >NZ_CP043449.1|WP_112653578.1|162518_164075_+|amino-acid-permease MKLFIKKPIAQLMAASAETEKSLKRTLGVGSLIALGIGAIIGAGIFVRTAAAAGEHAGPAVTISFLIAAAGCALAGLCYAEFASMIPIAGSAYTYSYATMGEFIAWIIGWDLVLEYALGAATVAIGWSQYFNEFLTTFFNVHIPYAWSHSFMEVSNTTAGMYAAEMGTRGIVNLPAILILFLLTLLLIRGTAESAVVNNIIVIVKVAIVLMIIGLGWHFINPAFHTPYTIPADAGKIKVSAGVVDYADTFNHGWLGVLRGASVVFFAFIGFDAVSTAAQEAKNPQRDMPKGILISLVFCTALYILFSHVLTGLVSYKDFLIQGKEASVSYAIKTAMPGYGWLASFVTVSILAGFSSVILVMLMGQTRVFYTMSTDGLIPKVFSKLHPKFRTPYKSQWLFFVFVSLFAGFIPDKYVGDMVSIGTLFAFVLVCIGIFILRRTDPGIERPFKTPAYMIVCPLGALICLCMIASEGWENWARLIVWLLIGFAVYFGYSIKRSHVRHGKVEGANNPINPKFVE >NZ_CP043449.1|WP_112653579.1|164172_165336_+|MFS-transporter MKGDKNLWVLVFVCIINSLGFGIIVPILYSYGKTFGVTGETLGILTASFSIAQFFATPVLGSLSDKWGRKPLLVISLAGTCISFILFAEARSMIMLFAARILDGLTGGNVSVAQAMVSDTARPDNRARRFGILSSAFGFGFVIGPAIGGFLNSYGMQVPFYFAAGISLIGTLCSLFFLKETNPPDKSKKDSEKTKFSFVALITTLKRPVIGTAVFTGFMLTMAQFTMIIAFQTFTVDVLKINPTQIGILYAGFGVSGIIMQLCVPLFTKWYSSKSTILTLSTSLCFVAMFVTGLTNHFIAFVIGICIYGLFNGLRNPMLNAIIADHIDHQEQGKILGINQSYASIGQTLGPVTAGFAALLSVHAIFFLSSCYILAALLLSIRLKKKE >NZ_CP043449.1|WP_090528400.1|165348_165921_+|MarC-family-protein MPHPFIFKEIISVTMILFAIIDILGAIPVIIQLRQRVGHIESEKASIAVLVLMVTFLFIGDELLAVIGLDISSFAIAGSLVIFIIAMEMILGVDFFKEELPQAASIVPLAFPLIAGAGTMTTLLSLKSQYQTQNILVGIVLNTLVVYLVLKNVKWLERLLGPIGLSVLRKAFGIILLAIAIKLFRSNTHL >NZ_CP043449.1|WP_112653580.1|166076_167249_-|RsmB/NOP-family-class-I-SAM-dependent-RNA-methyltransferase MKAINQLKTFQRILGEYPADTPLSKFLPGFYRQNKQMGSTDRRVANRLVYNYFRLGRALPDVSEDERLLVAEFLCNTQTNSYLQHFKPEWAVCVGFSDDDKLALVKTAYPDFKLADVFPWSSQLSEGIDKEAFLKSFFCQPDLFIRVRNGYDHLVKAELTKAQVVFKDEGNGCYSLPNGTRLETIFPKQHWFEVQDYSSQQTGNYFKPQRWDSWWDACAASGGKSLLLHEDEPNIKLVVSDIRESILANLDERFQLAGLTKYQKKALDLTQNIDSVMHDYAFDGIILDAPCSGSGTWGRTPEMIAQFDVHKIEFFQKLQKSIAQNVVKYLKPGKPLIYITCSAFKGENEDVVDYLVNELGLKLEEKAVLKGYERKADTMFVARLSPSPIV >NZ_CP043449.1|WP_146750442.1|167245_168499_-|amino-acid-ABC-transporter-substrate-binding-protein MISVQNHRPLLSGNKWLPFFCIALLLAACSPKTRPVATTVKKPTDTEKKPDNTSEKPVKAPEQKVATIAMILPLNLEHLNPAQKYSPIQLSQANIAVEYYQGFKLALDSLTAYGNNYRLQIFDSKDEAMQAHDLALNAFIRSSDLIVGPVFPDGVKSFSAALSYSKGPILSPLSPANPSTIKSKNLITAIPPLEYHAWGAAEYINRTVKPKKIFVLRSGFNQESDYAINFKKAIDSLSKKKVKVTNVYVIRGKLSSLLPQLSKTEKNVFVIPATDQAFLGVTLRSLDTLNKHYPVMVFGHPSWEKFSFLKPQLLQRLNTHITSTEKINYKAGATITFLRNYRRAYHVEPTEYAIKGFDEGLYFGKLLFMDKGMQSIEETDFTGLHNGFHFVKKPGQGWINTHVNILMYTNFELKQVE >NZ_CP043449.1|WP_090529629.1|168455_169985_-|glutamine-hydrolyzing-GMP-synthase MQEKILILDFGSQFTQLIARRVRELNIYCEIHPFNHYPEIDSTVKGIILSGSPYSVRQEDAPHFEFEKFHTTRPILGVCYGAQYVAHFHGGEVLPSSTREYGRANLEYIKQDNPLFKDVPGGSQVWMSHGDTIATIGDNFEVIASTDSVKVAAYQVTGTQTYGIQFHPEVTHSIDGKQLLQNFLVDICGCKQDWTPDSFIETTVAALREKLGDDKVVLGLSGGVDSSVAAVLLHHAIGKNLHCIFVDNGLLRKDEFEQVLDSYQHMGLNIKGIDAKQRFYDALAGLTDPEKKRKAIGRVFIEVFDDAAHEVQDVKWLGQGTIYPDVIESVSVKGPSATIKSHHNVGGLPDFMKLKVVEPLNTLFKDEVRKVGKALGIDPNILGRHPFPGPGLAIRILGDITPEKVAILQEADAIYINNLRAAGVYDKVWQAGAIFLPVQSVGVMGDERTYENVICLRAVESVDGMTADWCHLPYDLLAKISNEIINNVKGINRVVYDISSKPPATIEWE >NZ_CP043449.1|WP_112653582.1|170314_171280_-|hypothetical-protein MTNNETVFTKDLQNKRLNVVRTFDAPLNLVWQAWTESEILDQWWAPHPYRTETKTQDFREGGYWLYQMVGPEHTEHPTWCKEEYKTIVVPQKIANAVSFCDENAVTNTNFPVMNWEKNFTGEGEHTTVNIDIYFDKVEDMQTIVGMGFQEGFTAGLSNLDHYLSTAFRIRKDLKPGNAARVTTYLNFPGNTEEALTFYKEVFKGEFTGKKLTRFSDIELPAEVRMNEADKKMIIHGELTIMGGHVLMATDAPESMGFKLQTGNNMHINVEPESREETERLFNELSVGGVVTMPLSDMFFGAYFAELTDKFGINWMLNYQNV >NZ_CP043449.1|WP_090528388.1|171276_171597_-|winged-helix-turn-helix-transcriptional-regulator MRRDVFQAIADPTRRAIISLLALQAMTPNAIAEHFQSSRQAVSKHIQILSECQLVNQKQTGREIYYHFNAQKMKEVDVWMDQFRALWETRFSQLDNVLQNLKNKQS >NZ_CP043449.1|WP_112653583.1|171884_172439_-|DUF1572-family-protein MENDYLTSVKKQFAYYKSLGEKTFEQLTDEQLYWQYNPESNSIAMIVKHMSGNMISRWTDIFTTDGEKPTRNREAEFTPSTPTRQTITETWEQGWQCLFDTLDKLTADDLGKIVYIRNQGHTAMEAINRQLAHYPYHVGQIVFLGKMLCNENWHSLSIPRGQSENYNADKFAQEKRKVHFTDEK >NZ_CP043449.1|WP_112653584.1|172549_173047_-|hypothetical-protein MKKYIDSGILEVFVMGIATDEEVRELMYMKAKHPEVEEALKQLETDMEKLAGEMAIAPPPHMWEKIEDEIDGLIHQGNPAQPIKFRTSGDGHNKHKKTSPEDQFIPIESESNHMRLHKSWRWIFAAVFVIGKIFLGFAIYFYLENRQSQQQLQELKTEVRELKKR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043449_2 | 458725-458817 | Orphan |
NA
Consensus repeat of NZ_CP043449_2
|
1 spacers
spacers of NZ_CP043449_2
>2.1|458751|41|NZ_CP043449|CRISPRCasFinder TGCGAACTGTGCAGGGCAAACGTGCAAAGCCCCCTCACCAC |
CRISPR arrays and Neighbor proteins around NZ_CP043449_2
The CRISPR arrays of NZ_CP043449_2 >merge|NZ_CP043449|2|458725-458817|CRISPRCasFinder CCCGTCATTGCGAGGGACGAAGCAATTGCGAACTGTGCAGGGCAAACGTGCAAAGCCCCCTCACCACCCCGTCATTGCGAGGCACGAAGCAAT >NZ_CP043449|2|2|458725-458817|CRISPRCasFinder CCCGTCATTGCGAGGGACGAAGCAAT TGCGAACTGTGCAGGGCAAACGTGCAAAGCCCCCTCACCAC CCCGTCATTGCGAGGCACGAAGCAAT
>NZ_CP043449.1|WP_112652628.1|458489_458648_-|GIY-YIG-nuclease-family-protein MQRGGCVYIITNKNNSVLYTGVTSDIIGRIFDHKNKTYPQSFTAKYNCNKLT >NZ_CP043449.1|WP_112653556.1|457570_458482_+|IS982-family-transposase MLTSDKIIEIFVKVDDFCKECEEQIAKHKLDAGNYKVRDRKASLADSEIITIVIAFHSGHFTNLKHFYITHICSHYKDFFPGLVSYNRFVELQQRVAVPMMLFLKTHCLGRSRGINFIDSTHIKVCHNRRIHNHKVFAATAERGQCSIGWFYGFKLHLIINDKGEILSFYLTKGNVDDRNVKLMTSMTEEIFGKLFGDKGYISKALADLLWGNGIQMITKPRKNMKDFNISQADKIMLRKRAIIECVYDELKNICKLQHTRHRSVNNFLMNIMGSLCAYHFFPKKPSLNIVFEEQDNQLLLAA >NZ_CP043449.1|WP_112652920.1|456098_457070_+|hypothetical-protein MKYQLQPYQGMQSRYSCPVCNHHKCFVRYIDIQTGQHLAPHVGRCGREDKCGYHLTPRNYFATLPGYKPYQPKRSRYMPGKSPAQPAVNPARSPRPEPKYIINPYWVSATLYNYQDNNFVQYLIKRLGRDITQAAIKRYHIGTHNHWPGACVFWQYDTEGDVRTGKIMLYNKETGKRVKVPFNHITWAHTLAIKEAATAGDDTTFILQQCLFGEHLLSANPAMPVAIVESEKTAIIASALIPDFIWLASGSLQGLNPAKCGVLKGRRVMLFPDVNAYDKWKLKARELHTALPNTAFSVSAVLEDIATDEDRQNGIDIGDVVGW >NZ_CP043449.1|WP_112652919.1|454735_455815_+|AAA-family-ATPase MITQNLNNSTATCITAGSLAQQSLQQMDEQQLPQNNDDVLMVRHADHWMAEAHERAIPLMLFGKFWHQGEVCILFADSNLGKSIVAVQVADGVSKGSGKYPFDVEAPAQPVLYCDFELTDKQFEARYSVDYEYHYHFGKNFYRAELNPDMELPHEFADFDDYLIYSLERSVLQTGAKVLVIDNLTYLRSETERAKDALPLMKQLKALKNKYNLSILVLAHTPKRDMAQPITRNDLQGSKMLMNFCDSAFAIGESKTDVNMRYLKQIKQRNTEQLYGEGNVCLCQIGKPYNFLKYEFVSFGKEWEQLSPQNDPEQEQIMANANELKQQGLTLRQIGQKLGISHQKADRLLKAYAKLNANV >NZ_CP043449.1|WP_112652918.1|452800_454051_+|DUF5103-domain-containing-protein MKKLYIILFILLSLNSFAQSPYNNNVYSPAIKSVEFYNTAKQGTFPVINLGTDEKVLLTFDDLRGGSRNYYYTIEHCDANWNSSNLSSAEYLQSFTDDRLYNYSYSTGTMQKYTHYEISLPNNNIAPKISGNYVLKVYEDGDQSKMVLTRRLYVLGKRVSIAADLVASANNATRQTNQKINFTVDYSGLVVQNPAYALRTFIMQNARTETAVLNGQPTYIRGSQLIYNDVSVNDFPGRNEFRLFDTRTLKLNSQRVAKIYKDSTNVVVLLGDPVRDQPNYIFQYDNDGKFYILNNDGTTPATDADYAHVYFTLSTNKDPKEGSPYVVGQFNNYRLDDSNKLHPLDNGRYTVNMLLKQGVYDYEYVWVDAKTGKADDIPFEGSHFETENEYQVLTYYRPPAARWDELVGFRELVTKR >NZ_CP043449.1|WP_091175129.1|450322_452245_+|ABC-F-family-ATP-binding-cassette-domain-containing-protein MIAINNLTFEIGARALYDEANWHIKPGEKIGLIGANGTGKTTLLKIIVGDYKPTSGTVSMAKDLTMGYLNQDLLSYSSDKNIVHVAMEAFERQNQLHDEIENLLKKLETDYSEELLNKLSDKQHEFELLDGYNIEYKAHEILAGLGFSDEDCKRKLSTFSGGWRMRVMLAKILLQAPDILLLDEPTNHLDLPSIQWLEDYLKSFPGAIIIVSHDRWFLDKVINRTVESRKGKLTVYAGNYSFYLEEKALREEIQRGEFKNQQSKIKQEERLIERFRAKASKAKMAQSRIKMLDKMERIDDVDDDNPSVNFAFRFSKQSGRHVITLEDITKKYPAIDILDHAEAVIEKGDKIALIGANGKGKSTLLRIIASADKDYTGTVTTGHNVTTTFFAQHQLESLHLENQILQELQSFAPKHTDTELRTILGSFLFTGDDVFKKIKVLSGGEKSRVALAKALTADANFLVLDEPTNHLDMQSVNILIQALDQYEGTFIVVSHDRYFLDNVANKIWFIEDQKIKIYPGTYAEFDEWYAKRKLEPKAAAPAPQPKKEEKKPEPVKQPQGENKHQQLKKLNQDLAKMEQQIADLEKEVKHFETQLADEKIYSDNGKLKQTNAAYSAKQTELKQMQDKWEALAEQILELES >NZ_CP043449.1|WP_112652917.1|449347_450121_+|SGNH/GDSL-hydrolase-family-protein MKDTKQNYRRHFLKTTAVGTLAAMGIPSIVSSALAAEKPAKKLTFNQGDVVLFQGDSITDWGRDHSKTEPNTTSALGSGYALLTASQLLLKHADKGLKIYNKGISGNKVYQLAERWDIDCLALKPNILSIHIGVNDFWHTLTSGYKGTIDTYIADYRALLTRTKQALPDIKLVICEPFAEKNVKAVDDKWYPTFDLFRKAAKDIAAEFDAVFVPYQSAFDKAEQTAPATYWNLDGVHPSVAGEALMAQTWLKAVGAL >NZ_CP043449.1|WP_112652916.1|448865_449225_+|hypothetical-protein MIKAYRLYTGDDGHSHIQKGMVDLGTLNEALAVRFQESEPHAFYDYHNAPTNQYVITLTGTLEFETYPGEKFILKPGEILIAQDTTGTAHKWRLMDDEPWKRVYVTFDPAKPINFVADK >NZ_CP043449.1|WP_112652915.1|448209_448707_-|tryptophan-rich-sensory-protein MSVAVSTKRFQFFPYLISLLIVLFIGFVASLVTRPEIAGWYSTLKKPSFNPPPWLFAPVWTAIYIMIATAAYLVWKHRSRKPVYIIARSIYFIQLILNFSWSIVFFGMHQIAAAAVVIILLWLSIVVNINWFNKFSRTASWLLVPYLLWVSFASILNMSIYFLNR >NZ_CP043449.1|WP_091175120.1|447609_448143_-|hypothetical-protein MKKLLLIFCLITAAHSFAFADKTAINNFVVKENPFAVDEVAVVATDTAGVIQENVNGVFTFVMNGFTEELKFDKGTAFYRHKLDRSTFLYAKHMNDSGTHAILYYIYKHDSKLSPFHISWVLLIAIPLLLVLLAYMFKRFIIIAVVIFCIFLYFNYHNGLSIPTFFESIIDGLKNMF >NZ_CP043449.1|WP_112652614.1|459090_460182_+|hypothetical-protein MLKVEYAKHEDDQTYYLVVNDIPYYQSSYNDRTYRSAYINEIELGELLASYSSKELSEFFDSLNMGDYDFDAWPLGVDISFSFKKTYKSSDYPNFNVELNVDTEDWASGWSIKSFSEALKIIIKDRDNKNVRYFQLDDDFVSNGLGIAVAINDLDTPIGTLIDNAFPEFESIINDANLYLASVVDNQSVISFFNFPDSIKGPCQQYLMYFAQFLKDLGIEAETEIKEQAHSTLFKITPNNKDEALDKIKDALEIYTNAPALNDLQFQGMNNGDIAFMQLQANVMHLKSQIMLNNAALQMKDATIEALQLSNYQLKAIVVESNEKLKQEEEIIPGIMSIKKYDGEWFSLNLPEMLNRLKRRFIK >NZ_CP043449.1|WP_112652615.1|460985_461981_-|type-I-glyceraldehyde-3-phosphate-dehydrogenase MKIGINGFGRIGRLAFRAAIERPDIEVVGINDLVEPDYMAYMLKYDSTHGQFNGTIAVEGGHLVVNGKTIRVTAEKDPANLKWNEVGAEVVIESTGLFLTQETAQKHIDAGAKKVVMSAPAKDDTPTFVMGVNHKALKADQNIVSNASCTTNCLAPIAKVLDDKFGIEEGLMTTVHAVTATQKTVDGPSAKDWRGGRGAYQNIIPSSTGAAKAVGLVLPQLKGKLTGMSLRVPVADVSVVDLTVRLKNGASYEAIKAAMKEASEGELKGILGYTEDEVVSEDFKGDSRTSIFDAKAGIGLNENFVKVVSWYDNEWGYSNKLIDLVQELGKL >NZ_CP043449.1|WP_112652616.1|461987_462836_-|hypothetical-protein MIAVVYSGSYFAHWRLTDKGRTVASFKTNGINPYFNDEKHILQLLNKNINLIHHAEVIRRIYFFGAGASSDERKKIVHSAFSTFFKFGKISIEHDIAGAAIACCKNEPGIVSICGSGSNAAWYDGKRVWPNNYGLGYILADEGSGNWLGRQLIKEFMNDTLPLSIRKKFIHKYDADRKNLLEKVYRQKQPALFLSSFTDFYLDNKNDHHLQNVIKKGFSKLISTYLLPLYQQHPGTSVHFAGSVAFNFQEHLYEAAAEADLQITNIIKEPINNLLTYYSSKN >NZ_CP043449.1|WP_091175145.1|462843_463830_-|6-phosphofructokinase MRKISKIGVLTSGGDAPGMNPCIRAVVRTALYNGLEVVGIRQGYKGLIENDMYEMDKRSVSNILNLGGTILKTARCLPFKTDEGMEIAYQNAKARGIDALVVIGGDGTFTGALRFSRKYPDIAVMGVPGTIDNDLCGSTYTLGFDTATNTVIQAIDKIRDTADAHDRLFFIEVMGRDSGAIALRAGISCGAEAILLPERATAIDDLIVNLKEGHMNKKSSSIVIVAEGDKNGGVYDVAKAVQQEVKNYDIKVTILGHLQRGGAPSSFDRILGSRLGFAAVNALVAGESQKMVGLQANQIMMTDLEAALNHHEFKLEEDLLQMMDILSI >NZ_CP043449.1|WP_090527735.1|463968_464652_+|NUDIX-hydrolase MLPKFDSVFSIDCVIFGFEAGELKILLIERNEEPYKDWLALPGYIVEQDESIDDAAERILYELTGLRDLHMQQFHTFGEVNRHPQGRVITVAYYALIRINGQKELRPVTQYAKKAFWHPVSELPKLAFDHSEIFKTGFNKIKRRLHYQPIAFELLPEKFTLTQLQSLYEAVLDKKLDKRNFRKKMLSYGFLKELDEKQKGVSYRAAKLYKFDKRKYGKIFQGEMNLV >NZ_CP043449.1|WP_112652617.1|465157_466018_-|N-acetylglucosamine-kinase MIIIADGGSTKTNWCLVTEEGKKVYFNTEGYNPYFSSTEYIIQSLNESLPTDLEKNLITEVNYYGAGCSTPEMRKIVEEAMKVVFVGAKVNIGHDLLAAARALLGNTEGFAAILGTGTNTCIYDGKEVVHNIDSGAYILGDEGSGCYIGKKLLTDYLRGYMPEPVRALFWETYKLTPDDINEQVYTQPRANRFCASFSKFVYDNNVHIEYSRNLVRTSFEDFFRNLVTHYPDYQKYTFNCIGSVGYNFRNVLEEVVTENGMVVGNIIRSPIDNLVKYHLELAPSSL >NZ_CP043449.1|WP_112652618.1|466347_467277_-|NGG1p-interacting-factor-NIF3 MNSNLSPINHNPDRRKFITQLSALAGTAALLSTPFAVDAITFTNPDEHITVGQIIDLFMKQVPGAPFPNTVDTLKSGNRDIVVTGIITTMFATIGIIEKAISLGANFIIAHEPTFYNHADETAWLASDDIFQYKKQLLDKHNIVIWRNHDTIHSLKPDGVGIGLLKQLDWVSYYKPETGNLLTIPSTSLSSLIETLKKKLKIEKVRYIGDPSQSCQKVLLLPGAAGGKRQITEMSTKKPDVLICGEISEWETAEYVRDAQAKGDKLSLIVLGHIASEEPGSEFMTGWLKQNVPGIKATHIHPGNSLAFM >NZ_CP043449.1|WP_112652619.1|467384_468539_+|DUF2029-domain-containing-protein MQKLAKLITNKPFVYSLWFGLSLFLVIKGVLTHQGFNNYTIFKYNFLNTIHQHNLYAYQPEHYYDLNHYGPVFSIIMAPFAILPDSIGVILWVLFNAFILFKAIQLLPLKKDQYVIVLLLCAHELMTASANVQSNPMIAALIILGFNFIKREQDFWAALMIALGAFIKLYGIVGLAFFFFSTNKPKFVLSFIFWSAVLFVLPMAISSPSFIIQTYHDWYTDLVIKNSDNQQSYMQEICVTGFIRRAFHYQDLKNMYVIGPALVLFGLSYLRVKAYKVLEYQRLILSSVLIFAVIFSSSAESSTYIIAFVGVAVWFMNLNRPVTGFEIFLLVLALLITSLSPSDLFPSFIRTQYIVPYKLKSIPCFLIWIKIIYETLTRDFVNEK >NZ_CP043449.1|WP_112652620.1|468841_469810_+|glycosyltransferase-family-2-protein MTYLADKKISIVIPSHNEEKNISYLIEQLRETLSPTGYAYELIFVDDGSRDNTLNELKINAELHPNVFYVELSRNFGKDYALKAGIAMAQGDAVITMDADLQHPPQLILKMLNLWENGYDIVYTYREGENPHGKGYQKVTSKLFYKGLNMLSDIKMENGTADFRLIDEKVVKQLKLIDEYEIFFRGIIKWAGYKQVGIPYVPSKRHTGEASYSFSKLVKLAVGSIVAFSARPLYIVSIIGLLVSSLAILYIPYVLVSYFLGYAVSGWASIIATIAFFGGLQLLVMSVIGVYVGKIFMQSKHRPHYIIRSSNVVIVDNDFIRV >NZ_CP043449.1|WP_112652621.1|469790_470597_+|polysaccharide-deacetylase-family-protein MILLGFDVEEFDMPFEYGKSIPFDEQLEISTRGTNAILKLLEQKNIKVTFFCTANYAINRPDVIKQMVTEGHEVASHGYYHSDFKVEHLMQSKLALENISGTEVTGFRMARMMPVDEAEIAKAGYEYNSSINPTWLPGRYNNFDKPRTWFYDHDVLQIPASVSPVIRFPLFWLSFHNLPLSLLKRMASATLKKDGYLNLYFHPWEFTNLHDKEKFGFPGYVSRNSGEAFARRIADFIDWASDKGYIFRRTDGFCEIIKNKIKQEAVLH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043449_3 | 464867-464954 | Orphan |
NA
Consensus repeat of NZ_CP043449_3
|
1 spacers
spacers of NZ_CP043449_3
>3.1|464892|38|NZ_CP043449|CRISPRCasFinder GGGAGGGGCTATGCAAGTAGCCGCCCTGTAAAGTTCGC |
CRISPR arrays and Neighbor proteins around NZ_CP043449_3
The CRISPR arrays of NZ_CP043449_3 >merge|NZ_CP043449|3|464867-464954|CRISPRCasFinder GATTGCTTCGTGCCTCGCAATGACGGGGAGGGGCTATGCAAGTAGCCGCCCTGTAAAGTTCGCGATTGCTTCGTACCTCGCAATGACG >NZ_CP043449|3|3|464867-464954|CRISPRCasFinder GATTGCTTCGTGCCTCGCAATGACG GGGAGGGGCTATGCAAGTAGCCGCCCTGTAAAGTTCGC GATTGCTTCGTACCTCGCAATGACG
>NZ_CP043449.1|WP_090527735.1|463968_464652_+|NUDIX-hydrolase MLPKFDSVFSIDCVIFGFEAGELKILLIERNEEPYKDWLALPGYIVEQDESIDDAAERILYELTGLRDLHMQQFHTFGEVNRHPQGRVITVAYYALIRINGQKELRPVTQYAKKAFWHPVSELPKLAFDHSEIFKTGFNKIKRRLHYQPIAFELLPEKFTLTQLQSLYEAVLDKKLDKRNFRKKMLSYGFLKELDEKQKGVSYRAAKLYKFDKRKYGKIFQGEMNLV >NZ_CP043449.1|WP_091175145.1|462843_463830_-|6-phosphofructokinase MRKISKIGVLTSGGDAPGMNPCIRAVVRTALYNGLEVVGIRQGYKGLIENDMYEMDKRSVSNILNLGGTILKTARCLPFKTDEGMEIAYQNAKARGIDALVVIGGDGTFTGALRFSRKYPDIAVMGVPGTIDNDLCGSTYTLGFDTATNTVIQAIDKIRDTADAHDRLFFIEVMGRDSGAIALRAGISCGAEAILLPERATAIDDLIVNLKEGHMNKKSSSIVIVAEGDKNGGVYDVAKAVQQEVKNYDIKVTILGHLQRGGAPSSFDRILGSRLGFAAVNALVAGESQKMVGLQANQIMMTDLEAALNHHEFKLEEDLLQMMDILSI >NZ_CP043449.1|WP_112652616.1|461987_462836_-|hypothetical-protein MIAVVYSGSYFAHWRLTDKGRTVASFKTNGINPYFNDEKHILQLLNKNINLIHHAEVIRRIYFFGAGASSDERKKIVHSAFSTFFKFGKISIEHDIAGAAIACCKNEPGIVSICGSGSNAAWYDGKRVWPNNYGLGYILADEGSGNWLGRQLIKEFMNDTLPLSIRKKFIHKYDADRKNLLEKVYRQKQPALFLSSFTDFYLDNKNDHHLQNVIKKGFSKLISTYLLPLYQQHPGTSVHFAGSVAFNFQEHLYEAAAEADLQITNIIKEPINNLLTYYSSKN >NZ_CP043449.1|WP_112652615.1|460985_461981_-|type-I-glyceraldehyde-3-phosphate-dehydrogenase MKIGINGFGRIGRLAFRAAIERPDIEVVGINDLVEPDYMAYMLKYDSTHGQFNGTIAVEGGHLVVNGKTIRVTAEKDPANLKWNEVGAEVVIESTGLFLTQETAQKHIDAGAKKVVMSAPAKDDTPTFVMGVNHKALKADQNIVSNASCTTNCLAPIAKVLDDKFGIEEGLMTTVHAVTATQKTVDGPSAKDWRGGRGAYQNIIPSSTGAAKAVGLVLPQLKGKLTGMSLRVPVADVSVVDLTVRLKNGASYEAIKAAMKEASEGELKGILGYTEDEVVSEDFKGDSRTSIFDAKAGIGLNENFVKVVSWYDNEWGYSNKLIDLVQELGKL >NZ_CP043449.1|WP_112652614.1|459090_460182_+|hypothetical-protein MLKVEYAKHEDDQTYYLVVNDIPYYQSSYNDRTYRSAYINEIELGELLASYSSKELSEFFDSLNMGDYDFDAWPLGVDISFSFKKTYKSSDYPNFNVELNVDTEDWASGWSIKSFSEALKIIIKDRDNKNVRYFQLDDDFVSNGLGIAVAINDLDTPIGTLIDNAFPEFESIINDANLYLASVVDNQSVISFFNFPDSIKGPCQQYLMYFAQFLKDLGIEAETEIKEQAHSTLFKITPNNKDEALDKIKDALEIYTNAPALNDLQFQGMNNGDIAFMQLQANVMHLKSQIMLNNAALQMKDATIEALQLSNYQLKAIVVESNEKLKQEEEIIPGIMSIKKYDGEWFSLNLPEMLNRLKRRFIK >NZ_CP043449.1|WP_112652628.1|458489_458648_-|GIY-YIG-nuclease-family-protein MQRGGCVYIITNKNNSVLYTGVTSDIIGRIFDHKNKTYPQSFTAKYNCNKLT >NZ_CP043449.1|WP_112653556.1|457570_458482_+|IS982-family-transposase MLTSDKIIEIFVKVDDFCKECEEQIAKHKLDAGNYKVRDRKASLADSEIITIVIAFHSGHFTNLKHFYITHICSHYKDFFPGLVSYNRFVELQQRVAVPMMLFLKTHCLGRSRGINFIDSTHIKVCHNRRIHNHKVFAATAERGQCSIGWFYGFKLHLIINDKGEILSFYLTKGNVDDRNVKLMTSMTEEIFGKLFGDKGYISKALADLLWGNGIQMITKPRKNMKDFNISQADKIMLRKRAIIECVYDELKNICKLQHTRHRSVNNFLMNIMGSLCAYHFFPKKPSLNIVFEEQDNQLLLAA >NZ_CP043449.1|WP_112652920.1|456098_457070_+|hypothetical-protein MKYQLQPYQGMQSRYSCPVCNHHKCFVRYIDIQTGQHLAPHVGRCGREDKCGYHLTPRNYFATLPGYKPYQPKRSRYMPGKSPAQPAVNPARSPRPEPKYIINPYWVSATLYNYQDNNFVQYLIKRLGRDITQAAIKRYHIGTHNHWPGACVFWQYDTEGDVRTGKIMLYNKETGKRVKVPFNHITWAHTLAIKEAATAGDDTTFILQQCLFGEHLLSANPAMPVAIVESEKTAIIASALIPDFIWLASGSLQGLNPAKCGVLKGRRVMLFPDVNAYDKWKLKARELHTALPNTAFSVSAVLEDIATDEDRQNGIDIGDVVGW >NZ_CP043449.1|WP_112652919.1|454735_455815_+|AAA-family-ATPase MITQNLNNSTATCITAGSLAQQSLQQMDEQQLPQNNDDVLMVRHADHWMAEAHERAIPLMLFGKFWHQGEVCILFADSNLGKSIVAVQVADGVSKGSGKYPFDVEAPAQPVLYCDFELTDKQFEARYSVDYEYHYHFGKNFYRAELNPDMELPHEFADFDDYLIYSLERSVLQTGAKVLVIDNLTYLRSETERAKDALPLMKQLKALKNKYNLSILVLAHTPKRDMAQPITRNDLQGSKMLMNFCDSAFAIGESKTDVNMRYLKQIKQRNTEQLYGEGNVCLCQIGKPYNFLKYEFVSFGKEWEQLSPQNDPEQEQIMANANELKQQGLTLRQIGQKLGISHQKADRLLKAYAKLNANV >NZ_CP043449.1|WP_112652918.1|452800_454051_+|DUF5103-domain-containing-protein MKKLYIILFILLSLNSFAQSPYNNNVYSPAIKSVEFYNTAKQGTFPVINLGTDEKVLLTFDDLRGGSRNYYYTIEHCDANWNSSNLSSAEYLQSFTDDRLYNYSYSTGTMQKYTHYEISLPNNNIAPKISGNYVLKVYEDGDQSKMVLTRRLYVLGKRVSIAADLVASANNATRQTNQKINFTVDYSGLVVQNPAYALRTFIMQNARTETAVLNGQPTYIRGSQLIYNDVSVNDFPGRNEFRLFDTRTLKLNSQRVAKIYKDSTNVVVLLGDPVRDQPNYIFQYDNDGKFYILNNDGTTPATDADYAHVYFTLSTNKDPKEGSPYVVGQFNNYRLDDSNKLHPLDNGRYTVNMLLKQGVYDYEYVWVDAKTGKADDIPFEGSHFETENEYQVLTYYRPPAARWDELVGFRELVTKR >NZ_CP043449.1|WP_112652617.1|465157_466018_-|N-acetylglucosamine-kinase MIIIADGGSTKTNWCLVTEEGKKVYFNTEGYNPYFSSTEYIIQSLNESLPTDLEKNLITEVNYYGAGCSTPEMRKIVEEAMKVVFVGAKVNIGHDLLAAARALLGNTEGFAAILGTGTNTCIYDGKEVVHNIDSGAYILGDEGSGCYIGKKLLTDYLRGYMPEPVRALFWETYKLTPDDINEQVYTQPRANRFCASFSKFVYDNNVHIEYSRNLVRTSFEDFFRNLVTHYPDYQKYTFNCIGSVGYNFRNVLEEVVTENGMVVGNIIRSPIDNLVKYHLELAPSSL >NZ_CP043449.1|WP_112652618.1|466347_467277_-|NGG1p-interacting-factor-NIF3 MNSNLSPINHNPDRRKFITQLSALAGTAALLSTPFAVDAITFTNPDEHITVGQIIDLFMKQVPGAPFPNTVDTLKSGNRDIVVTGIITTMFATIGIIEKAISLGANFIIAHEPTFYNHADETAWLASDDIFQYKKQLLDKHNIVIWRNHDTIHSLKPDGVGIGLLKQLDWVSYYKPETGNLLTIPSTSLSSLIETLKKKLKIEKVRYIGDPSQSCQKVLLLPGAAGGKRQITEMSTKKPDVLICGEISEWETAEYVRDAQAKGDKLSLIVLGHIASEEPGSEFMTGWLKQNVPGIKATHIHPGNSLAFM >NZ_CP043449.1|WP_112652619.1|467384_468539_+|DUF2029-domain-containing-protein MQKLAKLITNKPFVYSLWFGLSLFLVIKGVLTHQGFNNYTIFKYNFLNTIHQHNLYAYQPEHYYDLNHYGPVFSIIMAPFAILPDSIGVILWVLFNAFILFKAIQLLPLKKDQYVIVLLLCAHELMTASANVQSNPMIAALIILGFNFIKREQDFWAALMIALGAFIKLYGIVGLAFFFFSTNKPKFVLSFIFWSAVLFVLPMAISSPSFIIQTYHDWYTDLVIKNSDNQQSYMQEICVTGFIRRAFHYQDLKNMYVIGPALVLFGLSYLRVKAYKVLEYQRLILSSVLIFAVIFSSSAESSTYIIAFVGVAVWFMNLNRPVTGFEIFLLVLALLITSLSPSDLFPSFIRTQYIVPYKLKSIPCFLIWIKIIYETLTRDFVNEK >NZ_CP043449.1|WP_112652620.1|468841_469810_+|glycosyltransferase-family-2-protein MTYLADKKISIVIPSHNEEKNISYLIEQLRETLSPTGYAYELIFVDDGSRDNTLNELKINAELHPNVFYVELSRNFGKDYALKAGIAMAQGDAVITMDADLQHPPQLILKMLNLWENGYDIVYTYREGENPHGKGYQKVTSKLFYKGLNMLSDIKMENGTADFRLIDEKVVKQLKLIDEYEIFFRGIIKWAGYKQVGIPYVPSKRHTGEASYSFSKLVKLAVGSIVAFSARPLYIVSIIGLLVSSLAILYIPYVLVSYFLGYAVSGWASIIATIAFFGGLQLLVMSVIGVYVGKIFMQSKHRPHYIIRSSNVVIVDNDFIRV >NZ_CP043449.1|WP_112652621.1|469790_470597_+|polysaccharide-deacetylase-family-protein MILLGFDVEEFDMPFEYGKSIPFDEQLEISTRGTNAILKLLEQKNIKVTFFCTANYAINRPDVIKQMVTEGHEVASHGYYHSDFKVEHLMQSKLALENISGTEVTGFRMARMMPVDEAEIAKAGYEYNSSINPTWLPGRYNNFDKPRTWFYDHDVLQIPASVSPVIRFPLFWLSFHNLPLSLLKRMASATLKKDGYLNLYFHPWEFTNLHDKEKFGFPGYVSRNSGEAFARRIADFIDWASDKGYIFRRTDGFCEIIKNKIKQEAVLH >NZ_CP043449.1|WP_112652622.1|470917_471391_+|hypothetical-protein MKPYFKVITTLFLISVVKVSFAQTPSFGNYKTKIFIGRAAKLKIKGNALAERYKTAISNSYNDDPYIRKFHGKGGLNFAGHYCFAYWGCGSDCQQSAIVDLQTGKVYDGPTAARQFEYRRWSRLLIVNRPGDKSDCAVCQPEYWILNEQTKHFVKIK >NZ_CP043449.1|WP_112652623.1|471794_472580_-|helix-turn-helix-transcriptional-regulator MITSSLQSCHLGPGMSPEQVISDHFFLYLLKGSMLAYSGDKHYHFHPGDSCIARKNHLLRYTKQQQDGDFKKIVIVLDEAFLKRFLARHRVDPSIATSDYSIRPVKDDQLLTSFIHSLEPYYRGEAEIEEAFADLKREELLLILLKNDPGLAAVLFNFGAPQKIDIEAFMTRNFRFNVPLERFAFLTGRSLSAFKRDFQQIYNDTPGRWLTKKRLEEAYFLIHQQSQKPKDIYLELGFENLSHFSFAFKKQFGLAPTAVLY >NZ_CP043449.1|WP_112652624.1|472592_473561_-|aldehyde-reductase MENNIKQPTVLVTGGTGFIGIYCILQLLQQGYTVKTTLRSLKRKDEVINLLKGAGIKSFENLAFVEADLTNDLNWNWAADGCTYVLHVASPFPAEEPEDANDLIIPARDGALRVLKAARHAGVKRVVLTSSFAAIGYSKDPKGYTFTEEDWTDPELTDRAYIKSKTIAEKAAWDWINSEGDGMELTVINPVGVFGPALGKDFSTSIGFVKGVLDGQIKETLPFTFGVVDVRDVAYIHLKAMTHPAAAGERFLATATGVMSLYDVAELIRKERPEYAGNIANLKPLDESFYIAISNEKAQRVLNWYPRSKEEVILASVDSLLN >NZ_CP043449.1|WP_090534923.1|474042_474630_+|sigma-70-family-RNA-polymerase-sigma-factor MKPIELNHTDDVLLLQQIEQGSQHAFNLLYEKHWGNAYSEAYKRLKDSDQAKDIVQEVFTHIWLKKESLRIHNLPAYLTVAIRNKVFKLVEKQKIIHPFFDVIDDLPASCQQADDNLLWKEFLISYEALLNTLPPKRQIIFRLHYQNDLPTKEIAAQLGLTRKTVQNQLLKAIEKLKVSLLPFLSLLIILFGAIK >NZ_CP043449.1|WP_112652625.1|474720_475713_+|FecR-family-protein MDKQYFLELLHKYLNNEATDEEQQFLVKYYELFSAEPDIISLLSDEQKNEIKEEINASIWENIDKHTGVDKKIIRLKTWVNKIAAAAAIIGVCGIGYLFLHNKSVVKPPQSYSAHRPKPNLFLVLPDGSRVILSYGSKLSYASSFDGLTKREVYLTGEAFFDIKHNNLKPFVVHTGKIKTTVLGTAFDVKAVPGDKTITVTVTRGKVKVSNNNKLLGIIVPNQQITFDRQKSISMQTNINAKNYTIWTAKDDLYFEDVTFGEAAKVLEDRFKVKILFTDQLVRSKHFTSTFNKSANLDQALKSICEFNDATYSYDKAKTTITITTKSQTN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043449_4 | 1032455-1032584 | Orphan |
NA
Consensus repeat of NZ_CP043449_4
|
1 spacers
spacers of NZ_CP043449_4
>4.1|1032494|52|NZ_CP043449|CRISPRCasFinder GCGGTGGCACGGGGACTTTCATTTTTAACTGTCACTGGCGCGAGTATGCAGT |
CRISPR arrays and Neighbor proteins around NZ_CP043449_4
The CRISPR arrays of NZ_CP043449_4 >merge|NZ_CP043449|4|1032455-1032584|CRISPRCasFinder GCAGGAATAGTGGGTGGAGTACTCGTGACTTCATCTTCAGCGGTGGCACGGGGACTTTCATTTTTAACTGTCACTGGCGCGAGTATGCAGTGCAGGAATAGTGGGTGGAGTACTCGTGACTTCATCTTCA >NZ_CP043449|4|4|1032455-1032584|CRISPRCasFinder GCAGGAATAGTGGGTGGAGTACTCGTGACTTCATCTTCA GCGGTGGCACGGGGACTTTCATTTTTAACTGTCACTGGCGCGAGTATGCAGT GCAGGAATAGTGGGTGGAGTACTCGTGACTTCATCTTCA
>NZ_CP043449.1|WP_112654135.1|1031386_1032280_+|flavin-reductase-family-protein MKLRTIDASSLTPAEMQAYLHYAIAPRPICFVSTIDKNGGVNLSPFSFFNVFSINPPICVFSPTSRARDNTTKHTLENVLEVPECVINIVNYDMVQQTYLTSMDYKKGVNEFAKAGFTELASDTVKPPRVAESTVQLECAVNDVISLGKNGGAGNLVIAEVKRIHISEAILDANDKIDPHKIDLVARLGGDWYCRVTGDNLFKISKPTGSAGIGIGVDAFPIDVKYSKVLTGNDLGVLGLVETLPSYEEVSAFSKTDEMKELVDAATSDTRTLHLHLKAKQLLDNGRVMDAWKVLLM >NZ_CP043449.1|WP_112654136.1|1029756_1031301_+|carboxylesterase-family-protein MRRLIFMLASVLTVVAANAQPAPVKVNEGLLQGTFENSLTVYKGIPFAAPPVGKLRWCPPQPAAKWDGVRLADKFAPEPMQGGNPVSGKSEDCLYLNVWTPAKSPDSKIPVLVWIYGGAFNAGGTAEPAYNGANLAKKNVVLVSIAYRVGQLGFLAHPELSAESPNHVSGNYGLLDMIAALKWIKQNIAAFGGDPGKVTIFGESAGGIAVSMLCASPLAKGLFRGAISESGGSFGPTRTTTFPGENMKKLHDAEAAGEAYAKGAGYPSIDSLRKVDADKLPAVRGLAWPIVDGWVIPDDQYKLYEAGKYNDIPILVGYNSDEGASFSPPKTTDDYIAAVKNRYGKFADELIKAYPPGTGTVAKTARDLTRDAAFGWHTWSWATLQAKRGKSKVFYYYFDRHPEYPAGSPQAGYGSPHAQEVAYVFGLLNASGAQAKADLDISDAMSTYWTNFAKYGDPNGAGKPQWPAYSPARPVVMYFSQTAHTGPVPDLASLKVLDAYFRWRRSPDGEADVK >NZ_CP043449.1|WP_112654138.1|1026195_1027335_+|XdhC-family-protein MKEIIDIVAAYDEAHAQGKKTALATVVLVEGSAYRRAGARMLITEDGQLTGAISGGCLEGDALRKARLVILQQQPLLVTYDTTDDDDAKLGVGLGCNGIIHILIEPITSDINNPISLLKNIVSNGQHAVLATVFSVKDRKATQPGTCLCLTEDKLTVSHSGQLPYQVALVSDAERVLEEQRSEIRFYQADTEYTAFVEHIKPMISLVIVGAGNDAIPLTRIAAVLGWNITVIDGRPNYAVKQRFPFAQNIVTAKPADVLSHIKTNDRTAIVMMTHNYNYEVALLKELLPTSLPYIGILGPKKKLDRMLAEVEHAGISISEAQMNNIYGPVGLDIGAEGAEEIALSIMSEIKAVLSARQGYSLKYKPAPIHISNLQYLVK >NZ_CP043449.1|WP_112654139.1|1024048_1026181_+|xanthine-dehydrogenase-family-protein-molybdopterin-binding-subunit MKKDAIGDSLSRVDGRLKVTGGAKYSGEYKVPNLTYGVLVSATIASGTVTALDTRAAERAPGVLAVITPFNAPKVPGYQAGAERPVRGLKLFNDNKIYFNAQPIALVVADTFERATYAASLVKATYNTEPFETDFHKNIDKGVTPQKGNYKDYVRGEANAYKNAPVVVEEEYQLPTEVHNPMELHVTTAFWDGDDKVTLYTKSQGVKGSQRSIAAAFGLNPDNVQINSRFVGGAFGSSLRTWPHEIAAAQAAKLVKRPVKLTLTREQMFTQVGYRPLTIQKIGLGATADGKLIGITHESHSQTAVYEEFTEGAVNVSQFLYNSPNVNTLYKVVPLNVGVPAPMRGPGEATGSFALESALDELSYKLNLDPIELRLRNYTDTDPERNKPWSSKYLKECYQKGAEAIGWADRKAQPGANKEGEWLVGYGIGCGAFGAYRGNAVAKIKLTADGSVNIQSATSDIGPGTGTSMVLIAADTLGIPADKITFELGNSAFPNAPTQGGSATVSSVGSAVYDVCVALKQKLYTMAGKPADSMEPIDYVAVLKQNNMPSVELTQESKGNPEAQKYSMYSFSAHFAKVHVHPLTGQVKIKKIVACVDAGKIVNHKTASSQMIGGAVGGVGMAMTEEAVFDDRYGRYINGNFADYHVPVNADIQQIEAIFIDKPDPVLNPVGTKGIGEISLIGVAPALANAIYNATGKRVRELPITPDKLI >NZ_CP043449.1|WP_112654140.1|1023015_1024008_+|xanthine-dehydrogenase-family-protein-subunit-M MNQFQYTRPAETAVAIKSLAKEPNGYFLAGGTNLVDMMKMGLVVPDKLIDINRLPLKKIERTPTGIHIGALASNSEVAEHAYIKAQYPLLALAINAGASPQLRNMATVGGNLMQRTRCPYFFDTAMPCNKRTPGSGCGALQGINRMHALFGASDKCIAVNPSDMNVALAALDATIHVTGVKGPRAINIGDFHRLPGNHPELDNTLQKGELITSVDLPAASSAYNKHVYYLKIRDRTSYAFALVSVAAALHIENNTITGARLAMGGVAHKPWRLTAAEIFLKGKAITEDNFKQAAQIAMQGAKAYEYNKFKLKLAPNAIVQSLKLATGLAS >NZ_CP043449.1|WP_112654141.1|1022383_1023019_+|(2Fe-2S)-binding-protein MSTTKPCTPTDGEDNSNGTRRDFLKQTSLLTAIALTPGTVVKAAENQWDEKLAGVFEKQALHLEVNGVKHELMVEPRVTLLEVLREHLDLTGTKKGCDRGQCGACTVHVNGVRVNSCLSLALTNDGKKIDTIEGLAKEEELHPMQEAFIKHDGFQCGYCTSGQIMSAVALLKEGHAGSETEIREFMSGNICRCGAYPNIVKAIQEVKGGMV >NZ_CP043449.1|WP_112654142.1|1021851_1022223_-|hypothetical-protein MKMIPLFKCRDLRQAVGFYTNVLDFRLKYPEATADDGVIDLVSEFGELQLTIYESDRLFGSVVNVWIDDVDSEFKRYISRGLDTSVKKESPVHQGPTDQTWGAREFYVTDTDGNTLRFCQRQR >NZ_CP043449.1|WP_112654143.1|1020086_1021847_+|family-78-glycoside-hydrolase-catalytic-domain MKKIIGILILTIIYFICKAQKLPPVFDAKRSAEAQSTETVRKYLSPIRILWKSPDAATNIINAEKLLKQGDGQADLSGNELCILQSNEKGKPGLLLDFGKELHGGLQLVTDQSRGGKPVRVRIRFGESASEAMSDIDTIKGATNDHAMRDMIISLPWLGKLEIGNTGFRFVRIDLVDDNSQLKLKEARAIFVYRDIPYLGSFKCSDTLLNKIWLTGAYTVHLNMQDYLWDGIKRDRLVWVGDMHPETSTIAAVFGDNPVVSKSLDLARDITPLPGYMNGMVSYSMWWILIQRDWYMHTGNLKYLQQQKAYLVKLLNQYAVQVDANGSEKLDGAGRFLDWPSSENKPAIHAGLQAMLLMTLNAGAELCKILNDQATAKKCEAAIAKLKNNVPDASGSKQAAALLCLSGLLPAEKANDILSKDGVHNYSTFFGYYMLLTKAKAGDYQGGIDAIRNFWGPMLNLGATTFWEDFNIDWLPNASRIDELVPDGKKDIHGDYGAYCYKGFRHSLSHGWASGPTPWLTEYVLGVKIMAPGCKVIKIEPHLGDLSFAEGTYPTPYGIVKIKHVKQADGKVKTIINAPAGVKVVQ >NZ_CP043449.1|WP_112654144.1|1019094_1019568_+|DNA-starvation/stationary-phase-protection-protein MKTNIGINEADRQAVSDQLAKLLADEFVLYTKTRNAHWNIEGPDFHSMHVFFEQQYNELDEIMDSVAERIRKIGHYAPATLTQLLQLTHLTEKLDHKNDSAGFLKELLEDHESIIEFIRGNINPFANQFNDAGTSDFITGLMETHEGMAWMLRSHFR >NZ_CP043449.1|WP_112654195.1|1018296_1018746_+|GNAT-family-N-acetyltransferase MNYTQICKAFNDLTVTELYQLLKLRSEVFVVEQNCVFLDTDDKDYACHHLLLFDNDQELVAYARIVPAGKSYAEASIGRIVSSKKVRGTGVGKIITQAAIDQTKKIYGDVPIRIGAQYYAVKFYEQSGFKIDGKIYDEDGIDHIEMILS >NZ_CP043449.1|WP_112654133.1|1032651_1033221_+|transposase MSTKYKFRKQEQLYFISFSVINWIDLFIRTEYKQIMLESWKYCQQNKGLEIYAWCIMTSHIHMIIGSEEEKLENIMRDMKKHTSLALKAAIKQHPSESRREWMLWMMERAGKKNSQNIDFQLWQQDNHPIELYDNRILNQKLDYIHNNPVIAGFVEKPEDYLNYLYSSARDYSGMPGLVDVILVSPVVL >NZ_CP043449.1|WP_167516081.1|1033254_1033428_-|hypothetical-protein MTLICVNTVFAQKTSLHSVKIEWESFSTESFRDVSCDDFEYSFLDTPPTGASMQRWQ >NZ_CP043449.1|WP_112654132.1|1033498_1033975_-|hypothetical-protein MLIHKEVKDRELYVYMNGKLIYKRWLDTGASKVFDVMAYDKYTLASIREIKQEEHQLISVKALIKLKATKDGGRRTGILSGYRPNHVFEYDKDGNRFETYIGDIRWDDGFTIEPGEEKAVTVRFFLGWKIERYLNIGRKWWIHEGPRCVGEAELIEFM >NZ_CP043449.1|WP_112654131.1|1034406_1035396_+|ParA-family-protein MQSIVVFNNKGGVGKTTLMCNIAAYLKIKKRKKVLIVDADPQCNATAYMFPYPQIEDIYSKSESTIFEIVKPLQRGKGYISNKLPILKSPYFEVDVIPGDTQLSLSEDFLSKDWLDGKAGDFRGLQTTLLFKDLLLRLDKYDYVFFDVGPSLGALNRSVLAASDFFIVPMSSDIFSLQALENISKSLKDWEKQLSRGLSDFKTREQEPFQIDGQTISWHLQFGGYVTQQYTAKTVNGKKQPVNAYERIIKKIPSTIQKHLLTLNKISITYPQIGEITNLHSLVPLSQNSSVPIFNLKSEHGVVGAHFNKVREYEATLSEMVEKLITNLN >NZ_CP043449.1|WP_112654130.1|1035401_1036265_+|SIR2-family-protein MINWPEELIDDIARRRCVIVLGAGVSKNSTNAAGARPKDWKEFLISASEDINGKTEIRKQIGSGDFLTACELIKKELGRDDFNSLMRREFLTPQFQPADIHKFIYNLDSRFVITPNFDKIYDTYANTTSHGSIIVKKFTENDIADCIRRPEHLIIKIHGSVESPDNLIFTRKDYSESRTKYRDFYHLIDALSITHTFVFVGCGTNDPDIRLILEDYSFKFPQNKKHYIIMPKGAMNSKVREIISETMSLKALLYDSSDYHRILTSSIADLVSKVEIRRSDLCLTMDW >NZ_CP043449.1|WP_112654129.1|1036426_1037329_-|ribose-phosphate-pyrophosphokinase MKKLLFAITDYEYLAEKVLALGHCERGEIEVSHFTDGERYQRILSNVEGRDVLLIGGTVNDSATLELYDLASSLVSYGADSLTLVIPYFGYSTMERAVKAGEIVTAKTRARLLSAIPKSNRGNKVMLFDLHSEGIQYYFEQDLYPVHVYCKDIVIEAATRYGGDNFVMASTDAGRAKWVESLANDMGVNAAFILKRRLKGDHTEVSAINADVAGKTVIIYDDMIRSGGSIVNAAMTYKNAGAGDIYVITTHGLFVNDGIGKLKACGAIKKLICTDTHVNCKDLEGDDFVEVRTVAGLICG >NZ_CP043449.1|WP_112654128.1|1037395_1038814_-|nicotinate-phosphoribosyltransferase MKKENLILLADAYKYAHHKFYYPGTTHIYSYLESRGGMFNETVFFGLQYFLKEYLQGPAFNQVDLDEADEFLKQVFGRDDVFDCSKFQYILDKYNGHLPVRIKAVAEGSSVPIGNVLMTIENTDPECYWLTNFLETLLMQVWYPCTVATLSHEVKKTVTQYYEETATPEAFGGIGFVLNDFGFRGVSSVESAKIGGAAHLLSFAGSDNLAGSGMAITYYHAEKVYGLSIPATEHSICTLLGQEGELEVFKHVLRSFPTGVIACVSDSYNIFRACSEYWGEDLKQEILKRDGTLVIRPDSGDPVMTLLEIFNILFDKFGFVTNARGYKVLPPQVRVIQGDGVNYTEIGVIYKALKENGISAENLVLGMGGALLQKVDRDTQKFALKCSSAVIDGKEVAVEKSPAEMDASGNISTSFKKSKGGRLKLVKTAEGYKTIQHDEQPELADQLQTVFENGHIIKDFTFEQLIDTLQHQ >NZ_CP043449.1|WP_112654127.1|1038818_1039715_-|NUDIX-domain-containing-protein MKTGVIIARFQTPYLHEGHRELIAQVKQNHAKLIILLGVSPIKGSRKNPYDYYTREKMIKKDYPEIVVLPISDNPSDKVWSDNLDNLLKSVFNAEQFCLYGSRDSFIPYYSGKFETIELPEHGDYNATELRKQYADKVFDSNDFRAGILYAYYNQYPKVYPTVDVALFRNNRSEILLGKKAINNKWRFVGGFTDPEDTCYEDAAKRELAEECGEMQTTAMEYETSAKINDWRYRSEADKIITLLFSSDFIEGEPKAQDDIADLAWFKLTDLPLMIKDGSISEEHVELFNFITGKYLKN >NZ_CP043449.1|WP_112654126.1|1040159_1040846_-|NUDIX-hydrolase MSVAQNIKVAVDAVVFGYTSKEGLSVLLIKRNIEPFKNSWALPGGLVADHESLEEAIQRELREETGVNITYLEQLYSFGQPGRDPRNRVISITYYGLVRPDAFVVKAATDASDVNWFNIKKLPALAFDHTTIISVARERLKSKMLYQPVGFELLEEKFPFSELEKLYLAVLDRPIDRRNFKKKITKYGFLEETTEKQALEGAGRPGNLFRFNEEKYFQLKKEGISFEI >NZ_CP043449.1|WP_149354028.1|1041095_1041644_+|RNA-polymerase-sigma-70-factor MEQLRLDDRKAFEILYHKYSSKLFYAAYNLFRDKDVCEDLVQELFIDLWTKRNQLNITSLEAYLKVAIRHRVLFYLRTKKASVDLAVIETLVEKYSADSKLFQDDIAHLLEDGVAQLPEKCRQIFTLSRKEYLSNKEIATRLNISIKTVENQITIALRYLRTGLTDYLPSVVALVLLHMFGK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043449_5 | 2975975-2976076 | Orphan |
NA
Consensus repeat of NZ_CP043449_5
|
1 spacers
spacers of NZ_CP043449_5
>5.1|2975999|54|NZ_CP043449|CRISPRCasFinder TAGCTTTCTTTGCTTTTGGAGCTTCTTCAGCAGCAGGAGCTTCAACAGCTTTAG |
CRISPR arrays and Neighbor proteins around NZ_CP043449_5
The CRISPR arrays of NZ_CP043449_5 >merge|NZ_CP043449|5|2975975-2976076|CRISPRCasFinder CTTCTTTTTTAGCTTTTGGTGCAGTAGCTTTCTTTGCTTTTGGAGCTTCTTCAGCAGCAGGAGCTTCAACAGCTTTAGCTTCTTTTTTAGCTTTTGGTGCAG >NZ_CP043449|5|5|2975975-2976076|CRISPRCasFinder CTTCTTTTTTAGCTTTTGGTGCAG TAGCTTTCTTTGCTTTTGGAGCTTCTTCAGCAGCAGGAGCTTCAACAGCTTTAG CTTCTTTTTTAGCTTTTGGTGCAG
>NZ_CP043449.1|WP_112655049.1|2975453_2975870_+|hypothetical-protein MRYLLVFGLIILPVALFAQGKAKREHVKPDTTVYKTVDVQPEFPGGTEKWTNYLFKVPIPKDYDKENTQASFLIQMIIETDGSVTHASVRRKINEAMCKAFIAHVNKSPKWKPGRINGKPVRVLYSSPISCFMLQSDE >NZ_CP043449.1|WP_112655048.1|2973856_2975449_+|M1-family-metallopeptidase MKKFSFILIILLASASLYAQTLTSGGKLKPEQAIMDVRHYTISLAVDPVQKTINGFTTIDVIMEKPTRVLLFDLLDSLTISKVLVNGKQEAFEYKNNLITINTAKELPAGKASVKVIYGGKPHVARRPPWDDGFIWTRDSTGHQWMAITAEGTGGKLYFPCKDHPSDEPNDGVDMFITVPKDLVVAGPGLLKSVSKQKGTATFHWQTKYTINNYSILFNAGDYTVVTRPYTTVDGHNVPLQFYVLKEHASKAEHHLDIFVKTIKEQEKYFGEYPWVKEKIGIVETPHLGMEHQTMNAYGAKFKYTKVWGEDYDGLMHHEFGHEWWGNKVTAKDWADYWIHEGICTYGDALYVREFEGEKAYIKFFQNSALSFGNKIPIVIGKDIDEESAYNGDIYGKGAFFMHTLRYIMGDSIFFPTLRGFVTDPRYTYSNLASTDDVIQYFSKAAGQDLKPLFDLYIYSINKLEIHIKAQRGDKYQVQLLNIDMPLPVDITTDGVTKRYTLDKKGITVTSKTIPVIDPDTYYLKKLIIE >NZ_CP043449.1|WP_112655047.1|2973269_2973683_+|energy-transducer-TonB MKKIFVIISLSFWPILVKAQTETSKIDTEEYKCNCGIKVDKQPEFPGGTNNFFIFVRKNLRWPVKSQEIEGRVIVEVTITKNGKLTDPIVKRGLSREQDKEALRLINKSPKWEPAMLNGKAIDFKYYIIISFKRDIE >NZ_CP043449.1|WP_112655046.1|2972907_2973267_+|energy-transducer-TonB MKKILPILIILMVSLSAKAQKLAPPHFRGGDKAFHEFLDQNLKWPKDSAVKQGIVKVSFYVESNGLLSDIKLVQGFAHEFDKEALRVINLSPRWVPATRDGKFIKSKYSVPILYESIEL >NZ_CP043449.1|WP_112655045.1|2971919_2972894_+|polyprenyl-synthetase-family-protein MKQLTELQLLINDAVGKLSYPAYPADLYEPISYILSIGGKRMRPALLLLACDLFGGDVDKAIEPALAIEVFHNFTLMHDDIMDKAPLRRGKATVHEKWNANVAILSGDAMMVEANRLMMKVDDSILRNVLDVFNDTATGVCEGQQIDMSFEQRNNVSIEEYINMIRLKTAVLLGGTLKIGSIIGGAALTDADLIDSFGVNLGIAFQLQDDILDVYGDPEKFGKQVGGDIISNKKTFLLIRALELAKDGQAQTLNQWLCAAEFDTAEKVKAITNIYNELDIRQHAEKAMQTYADKAFVALDAINLPEDHKQYLRDFADGLLVREN >NZ_CP043449.1|WP_112655044.1|2969691_2971830_+|ribonuclease-R MSKRKKNNSSIHQVLTQMVLDIFEQNGNTPLNYKQVSAKLNVRDPESREIIYDILKDEVKKSVLKEIAPGKFQLLELKTFIEGVVDLTNDGSAFIVTDDEFESDIFIAPRKLRTALNGDRVKVYVYAKSKGKHKEGEVIEILQRAKMEFTGIVKLSERYAFFIPDDRKMMHDIFIPISELNGAKNGIKAVAEITDWPTEAKNPIGRIKHILGAQGENDTEMNAILAEYGFPLSFPAEVEHDAEEIPDVITPEEIAKRRDFRNITTFTIDPFDAKDFDDALSYRVLHNGNYEVGVHIADVSHYITPDSALDKEALDRATSVYLVDRVIPMLPERLSNGLCSLRPKEEKLCFSAVFEMDENANIITEWYGKTIIYSDRRFTYEEVQEVIETGKGDFKEEIFKLNALAYKLRDRKFKNGAISFETTEVKFKLDENGKPTGVYVKERKDAHKLIEDFMLLANRKVAERVSKMGKGKHKYTFVYRVHDSPKPDALANFAQFAARFGYKINTKSDKETAKSLNYLMEDVEGKKEQNVLTHLAIRSMAKAIYTTKSSSHYGLAFDHYTHFTSPIRRYPDVMVHRLLFHYLSGGQSANAEFYEKLCSHSSLMEKKAADAERSSVKYKQAEYLRDQVGNTFMGIISGVTEWGMYVEIIENKCEGMIRLRDISDDFYTLDEKNYAIIGQRKKKIYQLGDEVKIKVKQVDLTKKQIDFILVQE >NZ_CP043449.1|WP_112655043.1|2967915_2969613_+|ABC-transporter-ATP-binding-protein MLKVSDLSVSFKNGKNQFTAVKGISFTLNKGETIGIVGESGSGKSVTSLALMRLLNEDQAVIGGSVLLNGVCLCKLSEDEMRHVRGNQVAMIFQEPMTSLNPVLTCGFQLTEAIRLHLGSSKAEAKQKTIELFKEVQLPRPEAIFNSYPHQISGGQKQRVMIAMALACNPEILIADEPTTALDVTVQKTIIELLHKLKAERHMSLIFISHDLGVIKEIADRVLVMYKGEIIEEAAVKDLFANPRHPYTKGLLACRPSPQQHLKKLPVVADFLDEARPAVTIESIRELYHYPDTEIAERKRKLYEQQPLLKADKLNTWFPTDTGFFKRKDHVVKAVNNVSFDVYPGETLGLVGESGCGKTTLGRSILRLIEPTSGRVIFGGTELQGLKKNELRQIRKDIQIIFQDPYSSLNPKLTVGQSLMEPLQVHQFYSNDTTRKRKVLELLERVNLQPAHFNRYPHEFSGGQRQRIVIARALALQPKFIICDESVSALDVSVQAQVLNLIRELQDELKLTYIFISHDLAVIKHISDRMMVMNKGEIVETGYPDDIYYRPKEEYTKRLIASIPG >NZ_CP043449.1|WP_112655042.1|2967205_2967916_+|3'-5'-exonuclease MLEQYDLHNLLVIDIETVPQYSTHEQLPENLQVLWELKTRHQRKDEPADIHYERAGIWAEFGKIVCISVGIFIAGKNIGLRVKSFASHDEKELLTKFCNLLVSQPPTLILCAHNGKEFDFPYLCRRLLVNGIPIPPQLQIAGKKPWEIVHLDTMELWKFGDHKHYTSLNLLTTIFNIPTSKDDIDGSDVGRVYWHENQLERICAYCQKDVIATAQLLRRYRGEELIADEFITIVGS >NZ_CP043449.1|WP_112655041.1|2964714_2967105_+|penicillin-binding-protein-1C MQFVLKRAKSYLKKPKVTVSLFFLFVLTLIFWFCLPNPLFNSPTSYVIDDDQGQLLGASIANDGQWRFPYNPTVPEKFKQCIITFEDKRFEHHPGFDIVAFSRAIKQNLSSKKVSSGGSTLTMQVIRLATRHKRNIWNKLKEIFMAMRLEVTHSKSEILALYTSNAPFGTNVIGLDAASWRYFGRSPDKLSWGEMAAMAVLPNSPSLVHPGRNRAILLRKRNSLLDKLHKAGIIDSTTAALARLEPVPDRPMALPQLAPHLLQRFKADHQAKPEGDTRITSSIKSSLQQQVNNILEQHHSLLKANDINNIAAIVLDVETGATLAYAGNISHREDPQMESDVDVIDAPRSPGSTLKPLLYAAMLHDGLILPNSLMPDVPTMIAGYHPENFDLGYDGAVPASRALSRSLNVPAVKMLQQYKYERFYDFLHKAGITTLTKPADHYGLSLILGGGENTLWELSGAYADMARVLNHYNKNNGKYDPADFHNPVYEKKAAAKPELEKSGLLDAASIYYTFQAMEEVMRPGEEMLWQQFSSSQRVAWKTGTSFGFRDGWAIGVTPKYVVGVWVGNTDGEGRPGLTGINTAAPALFEIFRLLPVSRDWFEMPMGEMVKINVCKQSGYRAGQYCQDADEQYVPKSGLKALVCPYHQLVHLSADAKWQVNGNCEPPDNILNKSWFVLPPSMEYYYKARNYQYHVLPPFRPDCTQAENGNTMEVIYPKNGAKIYVPLEADGTRGRMICNAAHRQPGMKIFWHLDDQYVGETKDFHQVALNPPPGKHILTLVDGNGNTISIEFEVLKK >NZ_CP043449.1|WP_112655040.1|2963906_2964632_-|glycosyltransferase-family-2-protein MTIQKLSIIIPAYNEGKTIHLILDKIKEVNLINDIEKEVIIVNDCSKDDTEAAIYKYKLANPEVNIQYFKHESNKGKGAALHTGIAKATGDYLIIQDADLEYDPAEYNDLLKPVVAGFADVVYGSRFMGSNPHRILFFWHTIGNRWLTFASNMFSNLNLTDMETCYKLFNTKVIQSIKLTEKRFGFEPEVTQKISRVHRIRIYEVGISYYGRTYEEGKKIGWKDGVRAIYCILKYGLFKSK >NZ_CP043449.1|WP_090467954.1|2976408_2976717_-|50S-ribosomal-protein-L21 MYAIVSIAGQQFKVAKDQQIFVHRLQGDEGASIEFDSVLLAENEGKFKLGSDLKGAKVSAKIVSHLKGDKVIIFKKKRRKGYKKKNGHRQQFTKIEITGITL >NZ_CP043449.1|WP_091168300.1|2976792_2977215_-|hypothetical-protein MIKNAPYRFVDFESISFQYGHKDSLVNKYDSRTGMFQYLDRRDSLVKEHLRLTKDDLLYLHRKAADLGFWDFPSKETGDTSKVADGKAVRYIIEFKYKEKTKRVIFDTDYFGNPKLIDANQRLIAEIQKKLTDVENRGKK >NZ_CP043449.1|WP_112655051.1|2977722_2979114_-|dicarboxylate/amino-acid:cation-symporter MKKSRLTLFIFIALVLGVIAGYIYNTYVFADLNKQLSSAGAAIKSIDKKIEALPDTTVAAYKDFKLQRIALVKLQSQATDAREDKLELYNILSKIFLNLIKMIVAPLVFTTLVVGVAKVGDIKAVGRIGGKTMLWFISATLVSLLLGMLLVNLFEPGKTMHLPLPDSHLSTGIKKSALSLTEFVGHVFPKSFIEAMANNEILQIVVFSLFFGVATAAIGEQGKIVIKAMDAFAHVIMKITGYVMKMAPLAVFGAITAVVAKQGIGVLSTYGIFISEFYFSLIVLWSVIILAGYIVLRKPVFRLINRIKDAMLIAFSTSTSEAAYPKVLEELERFGCSNKIVSFVLPLGYSFNLDGSMMYMTFASLFLAQSYDIHLSFGHQLSMLLVLMLTSKGVAGVPRASLVVIAGTLAMFNIPEAGLFLLIGIDPLLDMGRSATNVLGNAMATAVVSKWEGEEVGTQIIRE >NZ_CP043449.1|WP_112655052.1|2979280_2979808_+|hypothetical-protein MTTTAYGLQHIKKEIQHLPNEHLAELMLRLARYKKENKELLAYLLFEAHDEAAFIEKVKAEAGFMFSQLSSLSYNAAKGMRKILRLLSKYTKFMASKGAEIELLINFCENYLEYADRRTSYKPLRLILIRQVEKIRGLINKLHEDLQFDYQDSYNKLISDAESKLGWFKKNDHLL >NZ_CP043449.1|WP_090525007.1|2979849_2980347_+|RNA-polymerase-sigma-factor MANKEAAFKQIYEANSKKIFHLCYGYTGDDDAANDLLQETFLKVWQNLEKFRNQAMISTWIYRIAVNTCLTYLRSEKRQAKDELTPQLAETKREELSDKNEQVALLYKCISKLEESERIIITMVLDEVPYPEIAEISGISEGNLRVKIYRIKQKLTELYNQYERL >NZ_CP043449.1|WP_112655053.1|2980333_2980924_+|hypothetical-protein MKDFDHLMSVWQGQPKPDQLSVDEVLKQVKKGIRSITQKLYWSIVAMVVTVAFAFVVTFFLAFKSAVTTIGILIVLVTMLMYLSLMVRHYHILSKRDATLNPAEYLDSLKAYQKNRSKVIGWFYYTYILLLSAGLAMYFIEVLEHSSLTFKIVTYTSIGVWFLFTTFYLKPRMFKNEEEKLNLMIDRLVRLKEQFD >NZ_CP043449.1|WP_112655054.1|2980949_2981501_+|GNAT-family-N-acetyltransferase MAISETVTIQKLTLADADVLLELSKKTFFDFFAHLNKPEDMEAYASVAFTPQKIQAELSDPNSHFFFAMLDGEITGYLKLNYCHAQTEFQDPAAVEIERIYVLAEYHGKKIGHQFIDFTLKAATDKHLQYVWLGVWEHNLKAIAFYEKHGFEVFSSHEFTLGSDKQTDLLMKKAILSSSKPKA >NZ_CP043449.1|WP_149354087.1|2981554_2982469_-|hypothetical-protein MKNLLTILIETLQVLNENIHLSLTGKLVSKCEEWLERISGYITTNDELMLDTFLHNEINPFLEHFRNNYPAERETIDRYFNAMNEETGASFENRRKLETSMQLINTSINQYLEQAQTEVQESFPCYFEKFRTDGVEYDIYIGQSIAPQRVFDMLYLKNIRLWQLRSMAEIARMTNDLGDQLSRPLQTTHLIFIHSNAIDISFRNDERRFDVEGAYNIRYEVVKKRIDKVLIAGTFERLTQPGKIAMVYFNPTEAAEYDEYIRYLQVQGYLLDDLEYLELEELQGVTGLKALRVGVNYQLPLLNS >NZ_CP043449.1|WP_149354088.1|2982465_2983911_-|hypothetical-protein MHTEVLNISKNECTICQVETCLTFNPFVAHLKERIATEKTLKSEFYRYVLERFEHDICIDLDMRPTDAEKYREMLELIYSILTPPIANEKEFHWALSTPVPDKIFFSTEAFYDFHSSHHSNLYALNVSKNEMFSDRQKRFIYNLILERMYGFSSAIKNELLFSYEDPETGLSRYFNIQTDARFVEIELNGELPELSFDTIEPYLHSHTSLELIEKVIPLNIFKFKGFSIITLTDVTLTHALENIRTELVNHSANEEEQYAHIISSLKSLAESPGIEFGLMPFLTINNEPIFDNDECSRSILLKAAKDFNLAEETFDAIIDDYNQNPRPIFFNSITDEKIGKFPFLQVLKQAGIKSYGIFPVYYNKKNVGIMEVFSYKEIVFYEKLLSKLQATIPLIAQLLQNSIDQFEARISSVIKNKFTSLQPSVQWKFNEVAWNYLKEKRKKKKSPEIETVSFSHVYPLFGLSISAIPQLNATAPCRRI >NZ_CP043449.1|WP_112655056.1|2983935_2987541_-|BamA/TamA-family-outer-membrane-protein MIKKLPLLFVSLLGMLKTQAQDSVQYRMILIGDAGEMDLQQSAVLKHAAANVLKNKTSVFYLGDNIYPRGMGLPGSPEEAETQKILQSQYQPMRAAVAPVYFIPGNHDWDKMGPLGLAKIKRQWEYLDQQGDKLLKMVPENGCPDPVEINLTNDLTVIAFDSEWWLYPFNKSNPGAECDCKNKDDVADKIKLIFERNKDKVIILASHHPFQSYGTHGGYFSLKDHIFPLTVANKDLWVPLPVIGSLYPFLRSAFSNPEDLGHPLYKDMIKKVDAVFGSFPNLIHVAGHEHGLQFIKSKQLQVVSGAGAKQTYAIKGKYSLFADATQGYVTADLLIDKRMRFTYYIDSDKGVKQVFTYTQPYTSVKNIPDTLHTPIKADSIMVRVHPKYDSVSNFHRRWFGENYRKEWAAETKLPVIRLSEIHGGLKAESIGGGFQTHSLRLIDKDGKEWVLRSVEKQPEKILPDEFQETFAKDWVNDAMSAQHPFSALIVPPLADAARVPHANPIIGVVSADPVLGQYANIFANKVCLLEEREPGGKSDNTVKMIGKMLEDNGNTVDGDEFLRARLLDLLIGDWDRHGDQWRWRDEDKGKGKFYTAIPRDRDQVFYTNQGILPTIAAQPWIAPNLQGFSGEIYSVKYSLWKTMFMQRWPSVQYSHEQWTKIVNDFVAAETDEVLEAGLRRLPESSYKLRHDVLLKQLKERRANIPAEMEYYYKFINKIVDIHTSNKNELVTVKDEPNGSLNVVINKINKDGNIKDTLMNQNYHPDITKELRIYVSGGDDKVVLDNATSPIKLRFIDSTGTKTYNIVKSLNTVQLYDRGKKLSITGDESKVNKYISKDSANTSYLRVNLMNVIAPLANISYNPDEGVLLGAGFKYTHQEGFQKLPYNDVQTVFASHSFTTKAFKIKYNGEWIHAFGKADFLMDAQLDFPDNINFFGRGNETPFVKVGDYRKYYRTRFDNFTFSPAFRWRNESGTSVTVGPAFRYYKLDVDDNIGRFITYPLLTNAPDSYTFDKTKMHAGIVVNFTSDKRNDKVLPAWGSFVNIKLQEFNGLNNNSTSFAQLIPQVALYKSLNAKSTIVLADRVGGGITIGKTAFYQSLFLGGQDNLLGYRLYRFAGQHSFYNNLELRIKLTDFASYIVPGQIGVIGFYDIGRVWENGQSSDKWHNGTGAGFYVAPARLAVIKIIAGYSEEGWYPYLSTSFRF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043449_6 | 4342439-4342556 | Orphan |
NA
Consensus repeat of NZ_CP043449_6
|
1 spacers
spacers of NZ_CP043449_6
>6.1|4342468|60|NZ_CP043449|CRISPRCasFinder GCTCAGGCTTGCCCAATAATTCCCCTTCCAAATATTATTAATTTATCAAAAGCTACTAAC |
CRISPR arrays and Neighbor proteins around NZ_CP043449_6
The CRISPR arrays of NZ_CP043449_6 >merge|NZ_CP043449|6|4342439-4342556|CRISPRCasFinder TTCTACAAACCTTTTGCCCCTCTGGGGCAGCTCAGGCTTGCCCAATAATTCCCCTTCCAAATATTATTAATTTATCAAAAGCTACTAACTTCTACAAATCTTTTGCCCCTCTGGGGCA >NZ_CP043449|6|6|4342439-4342556|CRISPRCasFinder TTCTACAAACCTTTTGCCCCTCTGGGGCA GCTCAGGCTTGCCCAATAATTCCCCTTCCAAATATTATTAATTTATCAAAAGCTACTAAC TTCTACAAATCTTTTGCCCCTCTGGGGCA
>NZ_CP043449.1|WP_112654776.1|4341913_4342381_+|IS200/IS605-family-transposase MASNNTYSQLYIHIVFAVKYRMALIEDTWAERLRMYITSIIQNQGHKLIAINNMPDHLHLFIGLNPNQSISEIVRIMKSDSSEWINKQKLANGGFQWQEGYGAFSNSRSQIDKVVNYIANQQEHHRKITFLDEYRKMLNDFNIEFDEQYIFKLPQ >NZ_CP043449.1|WP_112654775.1|4340847_4341645_-|DUF3050-domain-containing-protein MANYSNRIAQLKNEIHPLRDQLINHELYKNITSLDELTVFMEHHVFAVWDFMSLLKALQQKLTCTVTPWMPTGNANTRYLINEIVAGEESDIDERGNRASHFELYLRAMQQAGSQAEGINNLFNELNFGKHIDEALIIANIPVAARNFVQHTFDVIDTNKDHLQAAVFTFGREDLIPDMFVSIVKELSQQLPGKVDILLYYLERHIEVDGDHHSQLAYQMTAELCGNDDSKWAEATVAVKEALRARIALWDGILEAIKVQEISSL >NZ_CP043449.1|WP_112654774.1|4339968_4340763_+|FHA-domain-containing-protein MAFSLFKRSGEKQPWDVKSLREAILRFIKESLQKIEGGEGGHIKELKLFIVADAEDKPIYEGAVYVHDKALFKNEVQKIADDFDINLPPDWTLDVEFTEELPAEAKRIPDLDAAFIMNTRKQVAHNAASFTAYLRILSGEAEKEEYLIKATDPKLYIGRDKKSVTENGSFRLNQFVFPGESRDESNKYISRQHAHIEWDGESERFMIFADEGGVPPRNKTKIHIAADGKMIKLNSTQIGHPLSEGDQVILGESAVFLYTTIANR >NZ_CP043449.1|WP_112654773.1|4335867_4339944_+|FtsW/RodA/SpoVE-family-cell-cycle-protein MAQENPKVPGRRMERLFLLLTGILLAVLFVKLFGVLQLKFTDVDKRLKDGTIVNLNSPNTAQNVKALLKKGYYFDDPKDIDYIESVIASRKTTGEQVDNTGELNKRKYYVNADEAFEKGGETFKKRVLTSRTLLGYTGDDSIRFEQELKNPPALGAQTDLNLGEYSIKGTIAHKGEPVPGVLVKLTMILPRDSIFSDEETGAKTSYSENASSYKKLYVLNDQKKKVLQSLTAFARTDEQGRFVFAKLPTGKAFSVLPMQPGFEFGRSQGVDELDKDVSFKFSQAPHSIKLLSTRDFNILKKEGAFIVRYQEEFNMWYWIIAGSFFAGFIIVHLLLSARYPDADQIILPLIMMLTGISFLTLLSLQDPLRDRFLAKDTLVYLGIGMAGICIIQLFNLRRLNPDSGFYRLLVFKWSRSAANGWPWAIVAMGILFSTILFGTGPEGSGVKVNLLGFQPSEIVKYLIVIFLAGFFATNEKFISQYASWSKRWSFFSFALIATIITLLLFLVLGDLGPAMVICFTFIILFSFSRGDFLYMAGFVVLFVLTTWFFDNIWLSAGITFFSLGSVVFFRRRRLSESAIMALVVITAFLTIDKIPGLDKIIPGPVERLVERKAIWQDAWNNEVYGGDQVANGLWAMASGGVNGQGVGQGFAKTIPEAHTDMILPSIGEEFGWAGMAAVFILFLLYLHRSIIIGRQTGMPLLFYLSAGIGVCTFVQFLLIAGGSIGALPLSGVSLPFESYGGSSLVINLLAAGFLLSVSSVRGTAVQMDYITKQQDKNLVPALAAALAGVVLLVVNVSRYTTDNKQWVVKPALVADKSGLRMFSYNPRIAILMNRLQAGTIYDRNGLILATSKPELIEKQKNKLSASGMLHYDLDSAMHKRLDRFYPFEEQTFFWTGDQNTGVFNGSTNGYFAEYEHAAELRGFHMPITNYNVKASRYQEDRFLPRGMKEMTVAKKDYSALANLLVSDINGPEVEAFKNKNRDVKLTMDADLQTSIQQSIASDTSLYDNRVSVVIMESNTGDVLTSAQYPLPPVHNWDQLTMPLADQNKLATWLTTTDLGFTYASQPGSTAKVLTAMSAFNKLGIAASAIQYHVSTQERIRTKGIEPDETGMITMERAIAKSNNVYFIKLANQEHLEEYMATLYLKTGMFLHGVGGYYYNKPVLNATQEDKWRTLWRKTEFNTKPRYDPNNIHKTRAKGISGMAWGQGELIATPAAVARLVSGVANDGILLPNRYALKIADSTVAVKSGIKLAEDPRYAALLKQYMIEQSAPKTPILGIKVAGKTGTPERIVRNQSVNDGWYVFFAPKEKGSGYLVVCIRVESTRGSSDAVHLAGNHVIPFLLKKGYIKSMETETTTEE >NZ_CP043449.1|WP_112654772.1|4334430_4335864_+|serine/threonine-protein-phosphatase MANNFFGITDTGRQRQNNEDVFIAEKSGDGNFIIACVVDGVGGYAGGEIAAEIARATILEQLQYIAGDIVPLLVNTFTIANQRIYDEKVQNKDLENMACVLTLAVVDMINNKFYYAHVGDTRLYLLRDYSLIKISKDHSFVGFLEDSGRLTEEAAMDHPKRNEINKALGFAGQIGQDPDFVETGNSPFLPGDILLICSDGLTDLVDKSKITNILTSSDNLPEKGKKLIDAANNRGGKDNVTVVLVHNDKERKQHSATKPVVAAVKAAEQPEAITPAPAKRQEEPNPVVKTKGNGGTVAVLTLLCFIFLGGFIWQFKKNADQAAVPKKTDTLIAQHIKNAVELKLQDTINKLKGHTLLLSAADFQQPIVLSDTLHINKDSLYIKTKGAVVFKKDSTYSGPAIALAANCKYVVLDSVAFDGFGTAIVTHNDALVLKNVQFNNCLTPVQALYMFPNKKYISGRLFGSMFKTDSVPTKATH >NZ_CP043449.1|WP_112654771.1|4333373_4334339_+|hypothetical-protein MEPKSTFWKRIGLQDWFLPNGKPVNEEAVKIKALTPDDVYLYIIEKFKESIKQLSFADRIVFYHEFIISFNEEDYQDFVNNRSGLFGIIVNESVKKFYELLREHQEVGKKVEPSSSKWVFRLVSHPDYKRGDKGFIGKLLPGTSAKKEENLRVTFIPRQTGVAQTLDISNEVLKGFTYYSEGYYELPYANDLHYNEKDVAKPGTKVLARLETIMPDKQFVGRKVEYLIKDDDIVVSGSDEEREEQAVFKVPSEWVNTPHLRIRLNKADGKFYMASFGERTLINELEVAGSDVNSPQWVELPFNSRILLNGIVGINIFKPEP >NZ_CP043449.1|WP_112654770.1|4331927_4333355_+|serine/threonine-protein-kinase MSKVFTITEGLENMGALRTGGQGSVYKGRRYGPIITAVKLLPTPIHTESTDDKNFRNFQNEVEKLKKVNEEPNPNVVKILNSGITESGSFPFIEMEYIDGPDLEDLLKPPHEAIFSIKEIIKLADQLANALSHCHKVSVKHGDIKSNNVKFNVHTGNYVLLDFGLSAMSDDQRRTSIRHAGAIEFMAPEQNEGLMYFQTDVYSYGIILYELIGGQVPFPLKDNGETARNAVMLAHMESEIPDVMELRRKNLPESWSDEKKEMQMQVPAWLLQIVAKCLQKDINNRYANGIELQEALMQGSIGAISPTHPDESWNTEVLLKENERLQGLVLYYQENENKQPAQVVNSEPVDNKAVRMSKPIFVLFMILLCGFTVFSAVVMDKFGGRIYNGVVSRLFKPSKKATDSAASNKIILPQKKDSVQQPKASDYKDESNIPPEVDSTADSILRNIQRAKQQKEDTQFYRDSVKKADTSNLNF >NZ_CP043449.1|WP_112654769.1|4329375_4331685_+|family-20-glycosylhydrolase MSSYNLFFSRPIKIIVFILCVAFALPGKAQIYKQGVIPQPVKIKSNDITYAFPREFVIGLGPSIKASNVTFFRHYINLARDIHETEPFVNHKMAASNLWLQLDPKSISQPEGYTLVVKPHQITITGHDEAGVFYGLQSLIQLLDIGKDKITVKGYTITDYPRFAYRGMHLDVSRHFFKPEAIKKWIDLLALYKINTFHWHLTDDQGWRIEIKKYPLLQSISAYRDETIIGHKKDSPHKFDGVRYGGYYTQDEVKEIVKYATQRHITVIPEIEMPGHALAALAAYPQLGCTDGPYKTATFWGIFDDVYCAGNDETFAFLQNVMDEVLPLFPSKYIHIGGDECPKTKWKVCPKCQQRIKDEHLKDEKELQSYFIGRMEKYLNSKGRQIIGWDEILEGGLTSGATVMSWTGEEGGIAAAKQHHDAIMTPEKYVYLDYYQSLYPAEPLAGGGYTPLSKVYNYEPITGDLSGEEAKYIKGVQANAWSEYMTSPAQAERQLFPRMLALAEVAWSPKQSKNYDDFLKRLRYHQPLSNLDINAAKVFDEITDSVIETANHQVALNLQTTLPGAKIFYTTDGTEPGLNSKGYISAITIASSGIIKAAVFNNGRQQGRTYEKSFSIHKAIGKTVALKNQPQGGFNPGNTFSLVNGIFGSKLYNDGQWYGFAGDDLEAVVDLGSMQNVSKLGINILKYHWQKMWEPTLLTFEVSADGSNYTEVYRQTDFPDNGINAVRANIKTQQARYIRVKATNKEIIPPGEYIAGAKAWLMVDEIVIQ >NZ_CP043449.1|WP_112654768.1|4328258_4329194_-|ring-cleaving-dioxygenase MENTINGIHHITAIAGNAKKNYDFYTRVLGLRLVKKTVNFDDPGTYHLYYGDGNGTPGSILTFFPWEGIATGRRGARQVTEIGYSVPEGSLDFWLKRFEDNNVIYNKPAEKFGEQYLTFLDPDGLKFELIVPKKADNRTPWETAEVTAANATKGFHSITITSNKIEATAKILTGVLGYRLLEQHVNRYRFITDAVDNAAIVDLVEVPGEVAGHVAGGSVHHVAFRVPNEKVLMEYREKIANLGLHITDKIDRNYFYSLYFREPGGVLFEIATDNPGFAVDEPAELLGTGLKLPAQYENLRGELEKTLPSLV >NZ_CP043449.1|WP_112654767.1|4327626_4328250_-|phospholipase MYRHTKQVVSAGVPAEQAKKAIIMLHGRGASASSMISLKDHLELDGYAIYAPEANQHSWYPYSFMAPVQNNQPALDSALEVIDELVEDLRQKGIAKENIYFLGFSQGACLTLEYTGRNAGRYGGIIAFTGGLIGEELVKENYKGDFNNTPVLITTGDPDPHVPVSRVNDSVEILKELNADLTLKIYKGRQHTISHEEIVLANEILKN >NZ_CP043449.1|WP_112654777.1|4342651_4343383_-|DUF2071-domain-containing-protein MAKSEFLKAQWKNLVMINYEVDAAILKPYTPAGTVLDLWEGKALVSMVGFMFSDTRVLGIKWPWHVNFEEVNLRFYVRYFNGTEWKRGAVFISEIVPKSMIVLIANNLYKEHYRALPMRSSITSAADNHTQFLYEWKLNGRWNKLGATASNELVDIKAGSAEEFILEHYWGYNSLSPIKTMEYQVEHVSWQTGLVREYVFDADVAALYGEAFRPFLEKEPVSAFYALGSDIVVRMGEKIVVGK >NZ_CP043449.1|WP_112654778.1|4343429_4344233_-|M48-family-metallopeptidase MKKFKPLLVLIAIMAVFSCSTVPLTGRKQLSLVGDAEVNQSAAASYKQLLSDPKTKVVASGADAQRVKTIGNRLAVAIEKYLKENGYGDQYSFNWEFNLIQSSEVNAWCMPGGKVAVYSGLLPVANTDAYLAVVMGHEIGHAIARHSAERISQEMLVQGGGQLVGAATSQQSQATQTAISTLYGVGSQLKLLAYSRKQESEADRLGLTFMAMAGYDPHNAIAFWQRMAAQNKGGSPPEFLSTHPADATRIADIQNLIPEAMKYYKKY >NZ_CP043449.1|WP_090531854.1|4344238_4344892_-|glycerol-3-phosphate-1-O-acyltransferase-PlsY MITVYSVTALIMAYLCGSIPTAVWIGMAFYNVDVREYGSGNAGATNTFRVLGKKAGIPVMLLDIFKGWAATNFAYFIGASATGAINSTAYTNYELALGIAAVMGHLFPIFAGFRGGKGVATLFGMILAIHFHAALLCIVVFITVLLISKYVSLSSIAAAFTYPIGVTFVFPTPIRSIVIYGMCICVLVLVTHQKNIERLIRGKESKVNFFKKKTTAA >NZ_CP043449.1|WP_112654779.1|4344995_4345841_-|carboxylating-nicotinate-nucleotide-diphosphorylase MDKELIHQFINNALSEDVGDGDHTSLSTIPADATGKAKLLVKDEGILAGIELAAEIFHVVDPNLKLNVFLQDGAPVKYNDIAFEVEGNSRSILTAERLVLNCMQRMSGIATKTRQIVDLLKGTNTKVLDTRKTTPGLRYLEKWAVRIGGGVNHRFGLYDMILIKDNHVDYAGGIRQAIESANQYLTDSGKKLAIEIEVRNLDELEQVLQTGRVNRILLDNFNFDDLRQAVGIIQGRYITEASGGITIDNIREYADCGVDYISVGALTHSVKSLDLSLKAVK >NZ_CP043449.1|WP_091172493.1|4346002_4346398_-|DUF4783-domain-containing-protein MKLIYLPSFIFLLLLPYVSSADAIDNVANLLKTGNTKELSKLFANNVEITIMEDENVYSQNQATVILDKFFARNKPKSIKLLHKINSGGNYHFGVYILNTDKGEFRVAITLKDAGKGTNVVELKIEDEKVK >NZ_CP043449.1|WP_112654780.1|4346594_4348118_+|2,3-bisphosphoglycerate-independent-phosphoglycerate-mutase MENKKKLALIILDGWGYGRNDQSNAILAANTPFVDGLLKQYPNSKLEASGTAVGLPAGQMGNSEVGHMNLGAGRVVYQELGRIHKAVDDNELPTIPVLKDAFEYAKQNNKDVHFIGLVSDGGVHSHIRHVKGLCDTAKQLDVNNVYIHAFLDGRDTDPKSGLGFVTELEEHIAGAGAKIASAIGRYYAMDRDNRWERVKLAYDLMVNGIGTPTQNVTDLIKHSYLEDVTDEFVKPIVAVDDAGKPLAVIKDGDVVICFNFRTDRGREISIALTQKSFPEYNMHPLAIRYITMTPYDETFKNVQVVFNKEDLTKTLGEILQNAGKSQIRIAETEKYPHVTFFFSGGREKEFANEKRLLVPSPKVATYDLQPEMSAAGIRDAIIPELETGWPDFVCLNFANTDMVGHTGVFSAVVKAAETADSCTKAVVEAGLANGYSFIILADHGNADYMINEDGSPNTAHTTNLVPCIVIDKDVKEVKDGKLGDVAPTILSILGVAIPPEMTGNVLV >NZ_CP043449.1|WP_149466841.1|4348146_4348671_+|hypothetical-protein MGGFLLLIANLFSCRPDNRQSGAKLVYFDLKEFFRADSARLTRLNPAVNKTVTHNGVTETKVVHIGNWNQELNLFIQSDINRPAWKNSYTVSTSDSAIIYKARTPELKTRRIIIKKAGDKVAWILIYNHTKNLLYETNEKLSYFPDSLYLIQKTQHVKLMGRNDYKVQGTLPKR >NZ_CP043449.1|WP_090531867.1|4348692_4349343_-|hypothetical-protein MKLSVILPLLIMSSLESAIAQNAYVRLGQQALMDGDFRSAVSHLEKACITDSTNANAMWMLGYSYYHSDNYKKSILAYTKVIAVKPADATAYYYRARAKSYLGRDSQASAADKELYLLGAIVDLTKAISINSDLRDNKYYQNRGIAYRDYGMFKLQATSRFYDKARGINSLKASVADLEKVLADNPGRMDISALIDQSKEKLIQATTGVNTLVKQH >NZ_CP043449.1|WP_112654782.1|4349711_4352186_+|endopeptidase-La MNFDPFDFKNALPVINEDSEFFPLMSSEDEEEMNNEELPDVMPILPLRNTVLFPGVVIPITVGRDKSIKLIRDANKGSRMIGVVSQQDVGIEDPTFNQLNKVGTIALIIKMLQMPDGNTTVIIQGKKRFYLKEEVQSEPYIKATVEPFHEIKIKEDKEFKAMVSSIKDMAMNIIQLSPNIPSEAGIAIRNIESTSFLINFISSNMNADMTAKQHLLEIANLRERANLVLEHLTLDLQMLELKNQIQTKVRVDLDKQQRDYFLNQQLKTIQEELGGNTPDLEIESLRQRGIKKKWAKEVKDHFNKELEKLSRTNPAAADYSVQINYLELLLDLPWNEFTKDNFDLKRAQRILDKDHFGLDKVKQRIIEYLAVLKLKHDMKAPILCLVGPPGVGKTSLGKSIAKALGRKYVRMALGGIRDEAEIRGHRKTYIGAMPGRIIQSIKKAGASNPVFILDEIDKVGNDFRGDPSSALLEVLDPEQNGTFSDHYVEMDYDLSNVMFIATANSLSTIQPALLDRMEIIEVNGYTIEEKIEIAKQHLVPKQREAHGLKIKDVSLKADVIEKVIVDYTRESGVRSLEKKIGSVVRGVAKNIAMEEPYNSVVSKKDIEKILGAPIFDKDLYEGNDVAGVVTGLAWTSVGGDILFIEASLSPGKGRLTLTGSLGDVMKESVTIALAYLRAHAADFDINPKLFDQWDVHVHVPAGATPKDGPSAGVTMLTALVSAFTQRKVKPNLAMTGEITLRGRVLPVGGIKEKILAAKRANIKEIILCKSNQKDILEIKEDYIKDLSFHYVTDMRDVITLALLNEKVKNPINLTVKEDEKAAIN >NZ_CP043449.1|WP_112654783.1|4352319_4352751_-|DUF1801-domain-containing-protein MNVQQQTEEYIASQPEPKRGDMQTLHRHILQILPGCKLWFEDGRNAEGRIVSNPNVGYGSYTIKYANGTTREFFQVGMSANTTGISIYILGIKDKKYLAQTYGKEIGKANVTGYCIKFNNLKDINIDILEAAIRDGVEITNEN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043449_7 | 4699953-4700050 | Orphan |
NA
Consensus repeat of NZ_CP043449_7
|
1 spacers
spacers of NZ_CP043449_7
>7.1|4699978|48|NZ_CP043449|CRISPRCasFinder CTATATTTGTCACAGGCTTAACAAAGAGCCCTTTCTTTTTAATTATAA |
CRISPR arrays and Neighbor proteins around NZ_CP043449_7
The CRISPR arrays of NZ_CP043449_7 >merge|NZ_CP043449|7|4699953-4700050|CRISPRCasFinder ACTTTGAAATACAAAGTACTTTATACTATATTTGTCACAGGCTTAACAAAGAGCCCTTTCTTTTTAATTATAAACTTTGAAATACAAAGTACTTCATA >NZ_CP043449|7|7|4699953-4700050|CRISPRCasFinder ACTTTGAAATACAAAGTACTTTATA CTATATTTGTCACAGGCTTAACAAAGAGCCCTTTCTTTTTAATTATAA ACTTTGAAATACAAAGTACTTCATA
>NZ_CP043449.1|WP_112652680.1|4699128_4699710_+|TetR/AcrR-family-transcriptional-regulator MEADKIKDSIKRAAQELFRKFGYHKTSVNEIAKKAKIAKATIYKYFDSKEAVLHVLLMDYIKASVDDLVQVNANDIDEEAYLNNLIMKTCRLSYTVCNEFIGWDFIRESTNSQDFLKNLSNELEELLMASFIQLPGIRKHETYQQRLRFLIKCSKSIVFSFAFTSVSDSDVRKNFVSFQKEILPYLVKAAVSV >NZ_CP043449.1|WP_091172165.1|4698715_4699129_+|hypothetical-protein MRKLTIILLVVIGCFSCGKATDNVPSVPVNFQAALGTPALSPLNVAGGAVAINGYGVSGLIIYRKVNGTYAAYDRCSSYQPEKKCAVTIDNTGFTVTDPCSGSKFSLEDGTPVKAPATKALRTYRVAVTQFEIQVTN >NZ_CP043449.1|WP_112652679.1|4697681_4698719_+|phenylalanine--tRNA-ligase-subunit-alpha MMQAQIDQYTAEINAFSPANADELEAFRIKFLGTKGIIKDLFEQFKSVSPEEKRTFGKVLNEFKQLTEAKYNELKENIVTGTQSKGNDLDLTLPGDGFTVGSRHPLSLVRNEIIDIFKRLGFVVAEGPEIEDDWHNFSALNFPEEHPARDMQDTFFIKKNNGKDDIALRTHTSSVQVRMMEAGKPPFRAIMPGRVYRNEAISARAHCFFHQVEGLYVDENVSFSDLKQTLYHFVQELYGEGTRVRFRPSYFPFTEPSAEMDISCTICGGAGCNMCKHSGWVEILGCGMVDPNVLENCGIDSKKYTGFAFGMGMERIANLKYVIRDLRLFSENDVRFLKQFQTEML >NZ_CP043449.1|WP_112652678.1|4697268_4697685_+|PIN-domain-containing-protein MAYKRLFLDSDVLLDMFLKREPFFFHTQILLIECIKRNIELRTSSLVIANIVYVLRKQAGILKAKENLKNLFNSAKVLPFEFDAIETAILSDITDFEDAIQFHIAQKHNCDAIITRNIKDYKNSTIPVLTAEQFLNTL >NZ_CP043449.1|WP_112652677.1|4696997_4697279_+|hypothetical-protein MESTKLTLSVKADSLSLVKEYAKRQHTSVSKLVQEFLDGIAEQEKKDDPIKEKYKNVEIPEWITQLTGIVKDPNPDMSYDDMKQEYFKEKYGL >NZ_CP043449.1|WP_112652676.1|4695765_4696956_+|glycine-C-acetyltransferase MYNTLKPVLQQELTEIENAGLYKKERIITSPQGADITVQGGAEVINFCANNYLGLSGNAKVVQAAKDAMDTHGYGLSSVRFICGTQDIHKELEKKIAEFLGTEDTILYAAAFDANGGVFEPLFNDQDAIISDELNHASIIDGVRLCKAQRYRYKHDDMADLEEKLKATQELRHRIIVTDGAFSMDGTIAQLDKICDLADKYNALVMIDESHCSGFMGKNGRGTHEHHNVMGRIDIITGTLGKALGGASGGFTSGRKEIIDMLRQRSRPYLFSNTLAPAITGASIAVLDMLSETTELRDKLESNTQYFRQKMTEAGFDIKPGVHPIVPVMLYDAKLAQEFAAKMLDEGIYVIGFYYPVVPQGKARIRVQISAAHDMHHLDKAIAAFTKVGKQLGVLK >NZ_CP043449.1|WP_112652675.1|4693155_4695636_-|carboxypeptidase-like-regulatory-domain-containing-protein MRKYILLLITAFSVITASAQQSLLTGKITDKNGQVIPFVSIYIRNSTYGTTANENGIYQFKLAPGTYNVIYRYVGYTEKIEEVTIADHDQEHNVQMADEVFATNRVAETYRKNRDAADTIMKQVLKKRKYYIEEATSYSCAVYIKGVQKLLSVPKSLLGQEVRKTLDLDTNGRGILYQSESLSEYNFQKPNKVREITIANRMAGQNTAFGYKKASDLQANFYQNVFTINGLATRGFVSPVASYGPRFYNYKLLGTSVENGHTIHKIQVIPKRGHGQYFQGDIYIVDGDWRIYSVDLFIENKTSNLNLVDTLKIRQQYVAITDSVWMPASTQYSFKGAVFGFKFGGYYAAVYNNYKINPTFPDNFFTGEILKIDTVANIKKPGYWADARPIPLTAFEDRDYKKKDAFEEYKKTDTYLDSLQHHKNHINYPGYLIFGYAASNKSNRDSLYIFPFIQTFYYNTVEGFGINAKVSYIRTIDDFHSLTITPALRYGFSNKIFSANMGFEYKNDPFHNAKFYADFGSDVLDLNNVGTRSLYFNTLSTLLSENNYVKYYRSHYGDFGYQREVLNGVFLKGGLSYSSRSQLYNTSFSKIKDIKDRQFTSNNPLAPPGTPADDHSFLFPDNQALVFNASATFTFDQRYETRPTGKFNLPSKYPTLTVNYRKGFKNIIGSDVDYDFASVDLSQDHIRVGLSGYSSFKVSGGGFFNNNNLYYMDYNHFLGNQGTTFDPTYVGSFHFLPFYTYSTNGAFLEAHYQHNFAGSLFNHIPLLRKWKLEEIIGANYLTTKNNRNYREFYVGVQRLIFRVDYGISYAGDKKYIQGFRIFYGIR >NZ_CP043449.1|WP_112652674.1|4692054_4693011_-|rhodanese-related-sulfurtransferase MKMYNTLLYYCYSTIANAEQFAADHLKFCKSLGLTGRIIVADEGLNGTVSGTVEACKTYMDTVHADERFAGIDFKIDEVDTPSFVKMHVRYKSEIVHSGLRDPNVIDPKQKTGKHLEPKEFLAMKDRDDVVVLDVRSNYEHSLGKFKNAVTLDIENFRDFPAMINELAQYKDKKILTYCTGGIKCEKASALLLHEGFPEVYQLHGGIIKYGKEAGGEDFEGKCYVFDNRLSVDVNSVNPVVISTCLNCGKTTPKMINCANPECNEHFTQCDECGTAMDGCCSDACKEHPRKRVYDGTGYYVKVPQPVNVSKNKLQPIA >NZ_CP043449.1|WP_112652673.1|4689057_4691952_-|transcriptional-regulator MRKTLCIIIVLAFISVWPAWSVDIKSVGVPYVQNYTKALYQFGNQNWSVTRDEHDIMYFGNAEGLLTFDGKYWQQYHMPNGLIVRSVSADGKGRVYAGGYGEFGYWHNDGKGILKYTSLISLVPKNFLPVTEEIWKIYCDNNRVLFQSFGAIYIYSAGKIEVVKTHEPYLFLFKCGSRYFAEQLTKGLFELKGSRLEYIEGSNILGASGVLSILPFQQGKYLIGTAKNGLFIYDGKTVKPWVSQANDFLKTYQLNNGAAIADKYFAYGTILNGIVIVDTAGRVVQHINKASGMQNNTVLSLYTDASQNLWAGLDNGIDRIEVNSPLYFYFDKTGKFGTVYSSIIFDKKIYLGTNQGLFYSDWLPDNHNSPFQTFDFKLIPGSQGQVWDLSLQDNRLLCGHNDGTYQVNGASLKKISDVTGGWTIKKMAPDMLMQGTYTGLVIYRKDAAGNWQFSHKLAGFSEPSRYVERDAKGQIWVSHAYKGIYKLTLSADQRTVVSHVYYDQKQGLPGSYNINVFDLDNRTVFSSDLGFYVYDDITDRFYKYQQLNSKLGTFATSSKIIKAIGKKYWFINQGRVGLADMSVTGKLTIDTNRFSILNGQMVQHYETINRINNSTYLISVDDGFVILNDADAQLPNRIKIPDVLIRRIENVTDKVSLITEAAEQSNNIEIPYAENNIRISYSLPYYTQAKIRFQYYLEGYSHQWSEWMPQSQKEFTNLNQGTYNFKVRAKINDQYQSAVSTITFTVLPPWYAGKIALVFYVLLAVLLFYVIRYYYGLKLKKHQQQIQQKLQKEKEEFLKQEAIANEQHIINIKNEQLQADLASKSRELANSAMNIVYKNELLQKISDELTHLKGGDGKKLADEQLRRIQKVIDEGMSDERDWNIFETSFNEAHENFFKKLKAGHPDLVPNDLKLCAYLRMNMSSKEMASLLNISLRGVEIRRYRLRKKLNLEHDKNLTEFLIEL >NZ_CP043449.1|WP_112652672.1|4685769_4688796_-|TonB-dependent-receptor MKRIFTISGLMLLLLFSFDAAFAQNVTVKGKITDAATGEALIGVSVQEKGTTNGTQTDVNGLYSIKASKNGILTITYIGYATKSVPVNEQTTLNVTLQAQANELAQVVVVGYGTQRKLDVTGSVASVKGTEISKQASVNPISGLQGKVAGVQITNSGSPGASPAVSIRGLGTVFGNVSPLFVVDGVWYDDISFLNPQDIENISILKDASSTAIYGIRAANGVVLVTTKRGAKGKPVINYNGYAGWQSVTNQVKMANATEYATAINELYTSNNVSPVLFSDPASYGKGTDWYGQILRNAFVTNHELSVSGGTEKVTYHLSFGYLDQDGLAKTNNYRRYTLHLSNDFKPVKGLKLGYTLSGLSGKSADVNGGIFHQLFGVAPTLPVYYKDGAYGDPNDYHTGDGNNYNPQATLDFFNQKSRNMRFTYNGYGEVSFLKNFKFKTSFGGDIGQAEVRGYTPVYAATFAQKSLVSNLDVNHSETRNWIWENTLTYDVKIKDHKITALLGYSAQNYRTKQLDAHAQNVPYVSSGSQRGSFPDTAKVTYYATPGSQVYTRALSQFARVNYSFKDKYLLNASIRRDGASQFYGDHTYGYFPSVGGGWVITNEDFMKDQKVFNTLKLRASWGKVGNSGVPINPSILTVSADPYLTAIFGTPQTTFPGASVNTVVPPTIFWEKTQSTDFGIEGAILNSKLSFEADYYDRKTKDAIYPLPILGSLGTNGGVVLGNQATIQNRGVEFLLSWKDQATKDFYYSISANLGINTNKVLNVLSGNIPIYQGGNGIANGQLATRTVVGQPIGEFYGYQVTGIFQTPQEVAASKQTSAKPGDFKYQDTNNDGVIDSKDRVVLGSPLPKYNYGINTSFTYRNFDLALDFQGTADVSVYNANIAYRFGNENFTKDFFDHRWHGPGTSNTYPSVNVGSTDNAKPNSFYVESGAYFRLRNAQLGYTLSGSILSKWKIQKVRLFANAQNALNFFGYKGFSPEVGGSIGNMGIDANVYPLYATYNFGVNVTF >NZ_CP043449.1|WP_112652681.1|4700061_4701396_+|cell-envelope-integrity-protein-CreD MIEEQSPKQTTLGWLRESATFKLIFIGLLALLLLIPSAFVQNLVTERAVRQGETAKEVSESWSASQIIKGPILVIPYKKGINMTDTAKQAPIENLYILPDNLHIKAGLTTQLRHRGIFDVAVYNTQVKVSGNFARLDLSSLGININQLLLNKARFEFSVSDLKGLKSNPVIKTTQPILGAEPSLESVFGNGLQAGVNLSAINNNEIPFDFTLDLKGSEGLSFLQMGKTTDVRVNGNWSSPSFDGNFLPDDAKVDTGGFSASWRMMYYNRPFPQQWTGQQKTLDNDKKLEEATFGVKLRLPVDQYQKTMRTSKYAIFIILLTFISLFLTEVIRKQPIHMFNYILIGAAMVIYYTLLLSFSEQVGYNMAYLIASVSTIALISVFISSLLKNGKAALLFAFILAVFYTFIFVIIQLEDLALMVGSIALFIIIAVLMYFSRKINWDKN >NZ_CP043449.1|WP_112652682.1|4701729_4703136_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MSKANKPKFFENVQIIDIAEEGKGVGKADDFVLFVDKAVPGDVADVQLYRSKKNFGEGKITELKQASEYRTQAFCEHFGTCGGCKWQHMTYEAQLKFKQKSVVDALSRLAKINVEGIMPIVPSPADRYYRNKLEFTFSNKRWLYDGENKEDGTLNMNALGFHIPGRFDKILDVNHCYLQAEPSNSLRNEIRDFTIQQGYTYYDLRNHSGMLRNLVVRTSSTGEIMVIVVFAYAEQSEIDSLMSHIDARFPEITSLLYIVNQKKNDTIFDQDVVAFKGPEYIHEEMNGIKFRIGPKSFYQTNSIQALRLYEITRDFADFKGDELVYDLYTGAGTIANFVAGHVREVVGVEYVPTAIEDAKVNSAINNITNTKFYAGDMKDVLVADFVAEHGKPDVIITDPPRAGMHPDVVARLMEIEAPKIVYVSCNAATQARDLLVLKEKYDTVKIQPVDMFPHTQHVENVVLLLLRD >NZ_CP043449.1|WP_112652683.1|4703308_4703704_+|hypothetical-protein MNPEELLNEGNERNKPAAESPLISLERDLKYFNDSIKEIAEEIINEGLSSYPIFIAHQHELSLGELILDRHDLNSEWSIHASTLEEFVERDVIKPVLKERFVNSYKDPYQFMCVFVVVPEGANFVFFPYAK >NZ_CP043449.1|WP_112652721.1|4703812_4704325_+|phosphoribosyltransferase MPEKKLLILNKQQIQQKLDRMAYQILEDNFDEDEILIAGILPRGNHIAERLKTILDGIAPFKSRIITIELEKQSSSLSANIDFEVEECSNKVVILVDDVLNSGKTLAYGFGVFLDVPLKKLRTAVLVDRNHKSFPITTDFAGVALSTVIKEHVDVVLDEEDGEEDAVYLR >NZ_CP043449.1|WP_112652684.1|4704441_4704945_-|shikimate-kinase MKYFIVGFMGCGKTTWGRKLAAKWGYEFIDLDHVLEAKAGMSIAEYFSSFGEDAFRKLESQVLKETEYAENTVVSTGGGLPCFFDNMDWMNANGKTLYIKLSPKTLADRLENSKTIRPVLQGKKGDELIEFITGKLAEREGFYLQASNIVEGIDMSVEKLEEALGYN >NZ_CP043449.1|WP_112652685.1|4704955_4706014_-|ABC-transporter-permease MIPYLLRKLMYGLAVMLGVVFVVFFLFNILPVDPARMTQGQRADVQSLQAVRKEFGLDKPVPVQFAYYLNDLSPLGIHLNTADEQQRYGYVKLFPVSKSKVLALKWPYLRRSYQTRKDVASLLMEVIPNTLVLAAAAMIFAIIIGVFLGVASAVNKDTWIDKLAISFSTLGISAPSFFAGIIIAWTFGFVLSNYTHLNMSGSLYSYDPFKGEVITLKNLVLPVITLGLRPLAIIVQLTRNAMLDVLGQDYIRTAKAKGLSNRTIIYRHALKNAMNPVITAIANWFASLLAGSFFVEYIFGYNGLGKATVDALEMSDFPVVMGSILFIAFIFVVISILVDVIYVWIDPRVKLS >NZ_CP043449.1|WP_112652686.1|4706018_4707182_-|DoxX-family-protein MKNTSSNSAVIWIPRLLVGLLFIFSGAIKANDPLGFSYKLVEYFEVFHITFLNGLALTMAIVLCALEMLLGFALLIGARAVKVAWGLLLLIIFFGFLTFYSAFFKVVQTCGCFGDAIPLTPWQSFSKDMVLLALVLVLFVKRKEIKPLFSAKVGDKWLICAAVVSVGFGVYTYNFSPVIDFLPYKIGANLPDEMKIPPGAPLDEFELTYHLKNKKTGATKVMNDKEYLKSNIWKDASWEVVGDPENRLVKKGYEPKIRDLAIQDEQRNDYTKELLSSPFYSLFIVAYDLSETDKDAINRLNALAINLTDNYNIRTVLLTSNSAADAKAFAKEHKLISEIFYADGVPLKSMVRSNPGVLLIKNGTVINKWHYHSVPKYEDIVKEYLQK >NZ_CP043449.1|WP_090534301.1|4707187_4707679_-|DUF1599-domain-containing-protein MKKTRDYGTAWRILRPQSITDQIFIKAQRIRTLEEKKISKVGDDITGEYIGIVNYCVIAMMQLECGPEMSTELNPDHVSQMFDEKVNETKELMFAKNHDYGEAWRDMRISSLTDLILMKLLRVKQIEDNQGLTEASEGVKANYQDMLNYAVFALIKLNIHLGK >NZ_CP043449.1|WP_167516351.1|4707937_4708708_+|dihydropteroate-synthase MGIINITPDSFFADSRKPGVDEALQQAEKMLTDGATFLDIGAYSSRPGAVDISAQEETDRLLPVVEAIAAQFPEVIMSIDTFRANVAEAAVKGGAHIINDISGGMLDADMFATVARLQVPYILMHMKGTPQTMNQLAKYDDVFGEVFDYFISKYSELKRLGVHDVILDPGFGFAKKAEHSYELMSRMNEFNILGLPVLTGISRKRMIYGLLGNTAEEALNGTTALNTIALTKGTNILRVHDVKEAAEAVQIWEACQ >NZ_CP043449.1|WP_112652688.1|4708717_4709203_+|GNAT-family-N-acetyltransferase MEILPATTNDIDEITNVEIRSKMASFPSLVEPHDIDFETRQYRWKTWFAAQSPATSKPQRVLFKAVADNSIIGYIAVHLTTRYEKDAEIQSFYVLKEYQRKGIGTGLLLNAVNWLETQHTKSLCVGIAKNTPYRAFYIKYGGGHLNEHWICWEDVAAIIMS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043449_8 | 4976906-4977004 | Orphan |
NA
Consensus repeat of NZ_CP043449_8
|
1 spacers
spacers of NZ_CP043449_8
>8.1|4976931|49|NZ_CP043449|CRISPRCasFinder TTGAGATATGAGATTTTTTTCTCTAACTTGAATACTCGCTTCTTCAAAG |
CRISPR arrays and Neighbor proteins around NZ_CP043449_8
The CRISPR arrays of NZ_CP043449_8 >merge|NZ_CP043449|8|4976906-4977004|CRISPRCasFinder AAAAAATAGATATGAGACGTGAGATTTGAGATATGAGATTTTTTTCTCTAACTTGAATACTCGCTTCTTCAAAGAGATATGAGACGTGAGATATGAGAT >NZ_CP043449|8|8|4976906-4977004|CRISPRCasFinder AAAAAATAGATATGAGACGTGAGAT TTGAGATATGAGATTTTTTTCTCTAACTTGAATACTCGCTTCTTCAAAG AGATATGAGACGTGAGATATGAGAT
>NZ_CP043449.1|WP_112656084.1|4974956_4976027_+|3-isopropylmalate-dehydrogenase MKKYILVIPGDGIGPEVTTWGKAVLEKIGRDFGHEFTFDEALMGHAGIEATGNPLPDETLAKAKASDAILFGAIGHIKYDNDPSAKVRPEQGLLKIRKELGLYANLRPIMLFDELLDASSLKPEILKGTDILFFRELTGDVYFGEKKRSEDRNTASDLMIYSRYEVERIAIKAYEAARVRGKRLCSVDKANVLEASRLWREVVQEIAKKYPDVETEHMFIDNAAMQLVKNPKKFDVVLTANLFGDILTDEASQIAGSMGMLASASVGDGTGFFEPIHGSAHDIAGQDKANPLASILSVALMLEISFGLKEEAKKITDAIDKALKDGYRTGDIADANTDKVKILGTTAMGQKVLEYL >NZ_CP043449.1|WP_112656086.1|4974101_4974899_+|methyltransferase-domain-containing-protein MNQQKHIQREGKGTAKLFDERSLANDYATLAPLLRPGLKVLDVGCGTGAISKDIAALIGESGHVTGIDNTEYFIQSGKETYASVQNMELIYTDLFSFDPEEKYDLIVSARVLQWLSNPVEALKKMYSLLKPGGTVSILDYNHEALQWQPQPPASMQRFYATFLRWRGDAGMNNHIAEDLPEYLQEAGFENIEVFNADEVYQKGEYNFEGKAGIWAKVAQSKQMVEEGYIDDESRLLAIDEYTNWVENEAEQMVMKLKEVRGVKPE >NZ_CP043449.1|WP_112656078.1|4973423_4974014_+|3-isopropylmalate-dehydratase-small-subunit MATKIFKHVQTSVVPLPIENIDTDQIIPARFLKATTRDGFGNNLFRDWRFDENDNPKADFVLNHPTFSGKVLVAGKNFGCGSSREHAAWAISDYGFDAVVSSFFADIFKGNALNNGLLPVQVSDDFLKKIFDAVYADHKAEVEIDLESQTITISSTGEKESFEINPYKKACLINGYDDIDYILSQQGKIEEFELAR >NZ_CP043449.1|WP_112656080.1|4972948_4973335_+|GxxExxY-protein MEKDRLTYDIIGCAMRVRNTLGNGFQEVIYQKCLAIELEKAGISFVRELEHPIFYDGIEVGKRRADFVIEGKLSVEIKALINLEDVHLAQAKNYTVAYDFPIGLLINFGSQSLQYKLIFNPKYNIKLN >NZ_CP043449.1|WP_112656082.1|4971477_4972905_+|3-isopropylmalate-dehydratase-large-subunit MGQTLFDKIWDAHVVSSSEGFPDILYIDTHFIHEVTSPQAFDGLRQRGLPVFRPKQTVATADHNVPTIDQHLPIKEELSRYQVDMLTKNCKEFGVELYGLGHPYQGIVHVIGPELGITRPGGTYVCGDSHTSTHGAFGAIAFGIGTSQVEQVLATQCLLQSRPKRMKIEVNGKLQKGVGAKDIILYIIAQISAAGGTGYAVEYAGDTIRSLSMEGRMTICNMSIEMGARCGLIAPDETTINYVKGREFAPKGKEWDKAVAYWKTLYSDADASFDEVLYFKAEDIEPMITYGTNPGMGIGVTQHVPETASFEAKEQGSYKKALDYMGLHDDETLLGKPIDYVFIGSCTNSRIEDLRQVAEFVKGKHKADNVTVWVVPGSKQVQQQAIAEGLDKIFDAAGFPLREPGCSACLGMNEDKIPAGKYCVSTSNRNFEGRQGPNSRTFLASPLTAAASAITGVVTDIREMLSESEFSELKN >NZ_CP043449.1|WP_090531289.1|4970255_4971314_+|ketol-acid-reductoisomerase MAKLNFGGTEENVVTREEFPLSKAQEVLKDEVVAVIGYGVQGPGQALNQKDNGINVIVGQRKGTKTWDKAISDGFVPGETLFEIEEALQRGTVICYLLSDAAQIALWPTVKKHLTPGKALYFSHGFGITFNEQTGIVPPADVDVFLVAPKGSGTSLRRMFLQGRGLNSSYAIFQDATGKAFDRVIALGIAVGSGYLFETNFKKEVYSDLTGERGTLMGCVQGIFAAQYDVLRSHGHSPSEAFNETVEELTQSLMPLVAENGMDWMYANCSTTAQRGALDWWKKFRDATKPVFEELYESVATGKESQRSIDSNSQPDYREKLDAELKELRESELWQAGKTVRSLRPENQVVEA >NZ_CP043449.1|WP_090530083.1|4969416_4970019_+|acetolactate-synthase-small-subunit MSEAEKKQEFNITIYTENQIGLLSRIAIIFTRRKINIDSLNTSPSEIESIHRFNIVINEYEEVVRKLTRQIEKQVEVLKAYYHTNEDVIWQELALYKVSTDVIAEKVSVERLLRENGARAVVIRKDYTVFETTGHREETDNLINILQPYGLIEFVRSARVAIIKDSEGFNSKLREFERLEPGEEVIENEYLNQGEKVFTM >NZ_CP043449.1|WP_091174116.1|4967658_4969395_+|biosynthetic-type-acetolactate-synthase-large-subunit METAQETLTAPAATETVNVSGSVALLEALIAEGTDTIFGYPGGAIMPIYDALFDYNDKLNHILVRHEQGGIHAGQGYARTSGKVGVVFATSGPGATNLVTGLADAQIDSTPLVCITGQVFAHLLGTDAFQETDVINITTPVTKWNYQVTDATEIPEVIAKAFYIARSGRPGPVLIDITKNAQIQLFDFAGYKPCDHIRSYRPKPIVRPQYIEQAAELINSAKKPFILFGQGVILGGAEQEFKAFVEKSGIPAAWTVLGAGAIPSDHPLNVGMLGMHGNYGPNVLTNECDVLIAIGMRFDDRVTGRLDKYAKQAKVVHLDIDPAEIDKNVKSTVPVWGDCKETLPLLTKAIEKKEHTEWLAKFNDYTRQEVEAVIHNELNPTTPEMTMGEVIKQLNEITKGEAVIVTDVGQHQMVACRYAKFNNTRSNVTSGGLGTMGFALPAAIGAKFGAQDRTVVAIIGDGGFQMTCQELGTIMQSGIDVKIIILNNRFLGMVRQWQELFNQRRYSFVDIQSPDFVALAAAYRIPGKLVDDRADLTAALNEMLTAPGSFLLEIMVTKENNVFPMVPQGCSVSEIRLK >NZ_CP043449.1|WP_090531285.1|4965843_4967520_+|dihydroxy-acid-dehydratase MELNKYSKTFTQDPTQPAAQAMLYGIGLTDDDMRKAQVGVASMGYDGNTCNMHLNDLAKLVKQGIWDEDMVGLIFHTIGVSDGMSNGTEGMRYSLVSRDIIADSIEAVTGAQYYDGLITLPGCDKNMPGSIMAMGRLNRPSIMVYGGTIKPGHWKGEDLNIVSAFEALGKKIAGQIDDVDFMGVIKNACPSAGACGGIYTANTMAAAIEALGMSLPYSSSNPALSAEKKAECLAAGKAIKVLLEKDIKPSDIMTREAFENAIVVIMVLGGSTNAVLHLIAMAKSVDVKLTQDDFQAVSNRIPVLADMKPSGKYMMEDLHNIGGVPAVMKYCLEQGWLHGDCLTVTGKTIAENLAEIPALEFETQKIIKPKENPIKATGHLQILYGNLAEGGSVAKITGKEGERFTGPARVFDGEFELIAGIQSGRVKKGDVVVIRNVGPKGAPGMPEMLKPTSAIFGAGLGSSVALITDGRFSGGTHGFVVGHITPEAYDGGFIAMVKDDDIINIDAVANTINVSLPQEEIAARRAAWQKPALKVTKGVLYRYAKNVTTAAEGCVTDE >NZ_CP043449.1|WP_091174059.1|4965108_4965360_+|ATP-synthase-F1-subunit-epsilon MTLEILTPDKKVYEGEATSVTLPGALGLFEILNNHAPIISTLQDGKLTVRGGAAKEEVFFIKGGVVEALNNKVTVLAEGIQHK >NZ_CP043449.1|WP_112656074.1|4977121_4978288_+|2-isopropylmalate-synthase MLHDPNRVYVFDTTLRDGEQVPGCQLTTPEKIEIAKELELLGVDIIEAGFPVSSPGDFQSVVEISKAVKEPTVCALTRANKGDIDAAVASLQYAKRPRIHTGIGSSDMHIKHKFNSTREEILERAVEAVKYAKKSVEDIEFYAEDAGRADVVYLAQMVEAVIAAGATVVNIPDTNGYCLPDQYGSKIKFLKENVKNIDKAIISVHCHNDLGLATANSIAGLQNGARQIEGTINGIGERAGNTSIEEVVMILKTHHTLGLHTNIDSKRFYELSQMIRTQMRMPVQPNKAIVGANAFAHSSGIHQDGFLKMRENYEIIRPEDVGFPSATIVLTARSGRHALKFHLERLGYTLDKEELGFVYNNFLTLADSKLDINDQDLQSLMAHRLVKN >NZ_CP043449.1|WP_112656070.1|4979680_4980703_+|proline-iminopeptidase-family-hydrolase MKKLFFILIAACFCACKNPSKPAADTATESSNTPYEIKTGGNKLIKVAGKYNVWTKKVGDGKIKVLLLHGGPGFSHDYMECFEDFLPKEGIEFYYYDQLGCGNSDAPADTSLWNIPRYVEEVEEVRKGLGLDNFYILGHSWGGMLAMEYLHKYQSHVKGAVLSNMTAGIKGYVAYAAELKKKFFTPRDITVFDSLDRLKQYDSPQYNDLLMNKLYTQVICRLPLENWPEPLWRAFKKANHTIYIQMQGVDEFHVTGNFKGWEFWDKLQNIKTPTLVLGGVHDEMNPEDMKKEGRLLPNSRTYLCPNGSHMSMYDDQQNYFKNLIAFLKDVDAGTFNADKK >NZ_CP043449.1|WP_112656068.1|4980757_4981597_+|DUF2911-domain-containing-protein MKKLFTCIITTMIFTAVNVYAQLTPQPSSTQSIVQDFGLGKISLVYSRPDVRSRKIFGGMEPYGKVWRTGANSATVIKFTDEVSMEGNKIPAGEYGLFSIPGENEWTIILSKQPKQWGAYNYKEADDFLRFKVKTEHLKALTETMTLAFSNVTATTCDLQMMWEHSGFTIHMTTDIDVKVMARIDSAMNTDKKPYYEALIYYYNNNKDMDKALAWATELEKDKNFPPFVPKLWKARILLKKGDKAAAIATAQEGVKMATDMKTDEYVRLNNELIAQAKK >NZ_CP043449.1|WP_112651380.1|4981818_4982544_-|DUF2490-domain-containing-protein MQLKKQLLIFAVLLLAAPARLLAQDNQFSGWAAIFHSQKLSEHWGYSFDGQLRSHDEVSYLKHILLRPSVNYYFAKNKVGALGYAYIATYGRTPSNETTFRPEHRIWQQYTYTHKLTKHVQLAHRFRLEQRFLGNTADNKNDRYFAQRFRYFARAVIPMKPDSDVFTEGTFVALQNEAFVNVQNKNKVNKHFFDQNRAYVAVGYRFSKSFDAEAGYLNQYIKQADAYVVNHVAQLAFYTRF >NZ_CP043449.1|WP_112651382.1|4982685_4983711_-|hypothetical-protein MFKRIYLFVFFSIITATAALAQGSLDIHFNGLGFMDNREYKDFVARSRTYSGVRTTLDLGLNVDSLNHFIVGVNGIHEFGAKPYFLKVNPVAYYSFTGKNWLFNAGAFPREGLLDDYPRALLNDTLRYYRPNVEGLLTRFHNAHFTETAWIDWVSRQTVTDREQFLFGFSGKYRPSLTGPFYVSHYFLLMHDAGAEVLLPNDHIQDNGGGQIRLGLDLSHKTILDSLSIEAGGMMSFERVRGVDGFHKPAGFVANAYLSWKRFALFDEFYKGQGSHIIYGDAFFEKKTYNRLDIIYTPFLYKRVKGQFILSLHQTPGYSSNQEAFRVTVDLGRQTLVRFKD >NZ_CP043449.1|WP_112651384.1|4983721_4985443_-|glycosyltransferase-family-39-protein MQDTSLNSAPVKYNKPIIYFLLLWALLNAVQAFTLEIHADEAYYWVYSRFLDWGYYDHPPMVAVFIKAGYSLIHNEFGARLFTVISTTASLYLMWMMLKRYRVDAINFILVISGIFVFHIYGFTTTPDAPLLFFTVLFLYFYQQYIEEDSLKLAIILGVVIACLLYSKYHGILLVAFTLVSNIKLLRRGAFYGIVLLALALYAPHILWQVNHDYPSISYHLSERSADDYQLDNTYLYPLGQLIMAGPLIGWFLFYKGFTTKIQDVFTRTLLVNSIGILAFFFLTSFKGEVQLHWTLIAYVPLSMLVLINFARPGGKPVWFNRLAVINVSLILLVRICIIWGPPFLLKIDAMKSFFGFKDWAHQIKQKAGDNYVIFYEGFQDPSKYNFYNNTTRGLAYDSRHYRRTQYDIWPIEDSLQHKKTYYVLDVWLPGVTTDSINVFAGKWYGGWVNDTRTYQRVEFETKAHKETVSPGQKIDFDLTLKNPYPFAIDFSNKNQKHPVFFEACFFKKTDQISNQNADDSFYNIALKPGESTHFKFNVTAPEQPGRYQLIFSLRTEPFFGGRNSKSINITVK >NZ_CP043449.1|WP_112651386.1|4985593_4986073_+|DinB-family-protein MKQEFEVIKKPRLMLLNVVKDLSPEQLNHIPAGFNNNLIWNLAHMISGQQGICYTRAGVPIVVDDKYYTPYRPETKPQSFINADDIAEVKELLISTIDKMEEDYQTRIFSNYQPMTTRYGVTLSNIEEAIRFLPFHDGLHTGYIMALKRAVLEEMSKLV >NZ_CP043449.1|WP_146750324.1|4986069_4987338_-|hypothetical-protein MQSKAQTTTQIINEIKAINSKVTNYSNSTGHGMLISNKSMNIFLSNKVSSYLSGNEDLSLYKNYVNINAEEGMISINHNFHQPVDSDDWVRSFVVAGARVNIANAYSAKSANRYYDNQLGFTIQKTWMGKPRTYYAANGDFKKEMDAARALIVNTIAQSINKKAEEFEQSLNALKQEEVPGQNLNEVKSKLRKTFYASLRADYLQQFSEQQSELLVNTGSYDLIADNWTSLGVYIPVIPQKFMVSNDVKAQVNRYYNYPLELFVSHTRFWESPKLGRFFLTFASKGFVNNAVQSGSLFSADVTGAQGADGINIVTINKGDRYIGQYKNFITPVAAGKLVYIPGTSHVGISFRIEKNWGTYKALNSIIGIPIVLIDKKGVPTINFEAQLLLQDMNNSLKNTRLPYNKTAIGLTVGIPFSKIVY >NZ_CP043449.1|WP_112651390.1|4987415_4988924_-|FAD-binding-protein MKVIKTGVSSWENRHETFSEQIKDLYELGNEDNLDALEGYNDATKGLQNLIKEAIETGTPLRSLGAGWSWTKIATVKDGVMLDTKPLNTRFTVADTAVNPAYAGNKDHLLFAQSGNGIWELGAFLKNRGLSLKTSGASNRQTIAGAVSTGTHGSAFDFGATPDFVVGLHIVVSPDRHIWLERASAPVVAQRFVDLLQTELVQDDELFNAALVSFGSFGIIHGVLMETEPLFLLETYVQRLPYDTELQGMMATLDFSDTDKLPCSNERPFHFSVLLNPYDLDKGAFVTTMYKRPYRTNYQPPVDNAAGIGPGDDAASFIGTITDAVPALVPVVVTKVLNISMTTNTDPHFGTLGEIFSNTTLRGKLLSSAIGFPAELSPRVADLMLKINKDIGPFSGVFSFRFVKQTKATLGFTRFVHTCIMELDAPLSDKAYNFYSQVWLMLEHENIPFTFHWGKANEITPQRIQRMYGDAATRWINARKTMLNADCQKVFTNQITQQWGLA >NZ_CP043449.1|WP_112651392.1|4988930_4989287_-|hypothetical-protein MLPSRLNRLLHFYRKLIPAVLISLMLCSCYTARVETKAQAGSEVSHQNVNFFFWGAIQSPKRIVTPICDSLGSNGMAEVTVKNNFGYSLLTVVTLGIWSPARVEWKCGKPCAKDGVIK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043449_9 | 6146197-6146355 | Unclear |
NA
Consensus repeat of NZ_CP043449_9
|
2 spacers
spacers of NZ_CP043449_9
>9.1|6146221|33|NZ_CP043449|CRISPRCasFinder GGGGAGGGGGAAAAGGCATGGAAAGTTCTACCC >9.2|6146278|54|NZ_CP043449|CRISPRCasFinder TCTACCTTTCTTCCCCCGCGTTGAACATATACTTTGGATAACCTCGGTAAAAAA |
cas3 |
CRISPR arrays and Neighbor proteins around NZ_CP043449_9
The CRISPR arrays of NZ_CP043449_9 >merge|NZ_CP043449|9|6146197-6146355|CRISPRCasFinder AGTTGTCGTGAGGACACGACAACCGGGGAGGGGGAAAAGGCATGGAAAGTTCTACCCAGTTGCCGTGTCCCCACGGCAACATCTACCTTTCTTCCCCCGCGTTGAACATATACTTTGGATAACCTCGGTAAAAAAAGTTGCCATGTCCCCACGGCAACA >NZ_CP043449|9|9|6146197-6146355|CRISPRCasFinder AGTTGTCGTGAGGACACGACAACC GGGGAGGGGGAAAAGGCATGGAAAGTTCTACCC AGTTGCCGTGTCCCCACGGCAACA TCTACCTTTCTTCCCCCGCGTTGAACATATACTTTGGATAACCTCGGTAAAAAA AGTTGCCATGTCCCCACGGCAACA
>NZ_CP043449.1|WP_112656417.1|6142777_6146125_+|transcription-repair-coupling-factor MNIRDILDRYKADDRIKTLATALNASKNPRVQLRGLVGSSDSAMAVALYFLQHKHMVFVLPDREEAGYFQADLENLTGKEALLFPSSYRKPFEFTQPDSSNVLARAEVLNELNHSSEYGQLIVTYPEALAEKVIDRSSLEKNTLEIAVSNKLSIDFINEFLIEYDFERVDFVYEPGQFSIRGGIVDIFSFSHMLPYRVEFFGDFIESIRTFEIESQLSVEHVKSITIVPNVQSKFLTENNISLLEYVEAGTQVWIKDVQFTLDIIQTGYKKAVNLWKALSADEKAQNPDWIDPKFGFTDEKLIGDQLHDFPVIEFGKQFFYHDATAINFDMRPQPSFNKDFTLLIHNFKNNEADKIENFIFTDSAKQVERLYAILDDLDKTVKFTPVSISIREGFIDHEQKIACYTDHQIFDRYYKYKTRKGYQRSQAITLKELRDLKPGDYVTHIDHGIGKYAGLEKVDVNGKQQEMIRLIYADNDLLYVNINSLNRISKFSGKEGSVPKMNKLGTDTWERLKKTTKKKVKDIARDLIKLYALRKAQHGNAFSPDSYLQTELEASFLYEDTPDQEKATADFKKDMESPHPMDRLICGDVGFGKTEVAVRAAFKAVADSKQVAILVPTTILAAQHYKTFTDRLKGFPANIDYVNRFKSTRQIKDTLEKLKEGKVDIIIGTHRLVSKDVKFKDLGLMIIDEEQKFGVSTKEKLKQMRANVDTLTLTATPIPRTLHFSLMGARDLSIISTPPPNRQPVVTELHVFNDKLIKEAVEFEIDRGGQVFFIHNRVADLPQLGGMIRKLVPKARIGIAHGQLEGDDLEDVMLKFVNHEYDVLVATTIIEAGLDIPNANTIIINYAHMFGLSDLHQMRGRVGRSNKKAYCYLLSPPLSTLTSEARKRLSAIEEFSDLGSGFNVAMRDLDIRGSGNLLGAEQSGFIAEIGFEMYHKILDEAIQELKEDEFKGVFPEDKPRPYISFTQIDTDLEILIPDEYVTNLSERYNLYTELSKLENEVELQAFQQKLHDRFGPVPAQVDGLLNTLRLQWLGKAIGFEKISLKKNVLRGYFITNQQSSYFETEAFRNVLDFVKNNPRRTNLKEVKNTLRLGIEGIDSVDEALRMLSEVAGII >NZ_CP043449.1|WP_112656419.1|6142210_6142621_-|DUF4199-domain-containing-protein MKNAVLSGGIIGVLSIIWIFAMPRLGVMPQKDVVAPVEYFSFIIPAIGLFFGIMSYRKNECNGQMGFLEALFQSFKILIVGGIIAVFGSILYISYVSSSEANIKDFSERIFGALIVGVLLAFAVSLLFTNKANKLD >NZ_CP043449.1|WP_091176158.1|6141909_6142164_-|DUF2752-domain-containing-protein MLSLAMTSPTEASHFTLCPLKLMGIGWCPGCGLGHSIIYLFHGDISNSFRAHWLGIPAVAVIFNRIYVLTKARLLERNQFKSLT >NZ_CP043449.1|WP_112656421.1|6141031_6141841_-|AraC-family-transcriptional-regulator MQIAPHPLLSDIVKHYLIIAHDQRVALNYRLFSDGNPGMVFHLKAPLLQYNQQHTVASKQPGSFVYGQITNYNDIVSVGELAMLIVVLQPNTLLSLLGVAACELNNNTVPLKDLFGQETFDLEDQIANAANLPAATVITEQFLLNKMASKRKRADITDRAINIIHANKGIINVKNLLDVMPVTERQLERKFDEEVGISPKKYIDVVKFQNYLKQLQKLSSIKELSSLSYACGYYDQAHLNNFFRKHTGLTPLQYKANHHLLAINFMPLV >NZ_CP043449.1|WP_112656534.1|6140018_6140960_-|nitrilase MENLTIATAQFENRSGDKAYNLSVIEKMTADAAAQGAQAIAFHECSITGYTFARKLDKAQMLELAEEIPCGPSIAALTEYARRYDIAILAGLFEKDKDDNLFKAYLCADKTGIVAKHRKLHPFINPHLLPGTAYTVFDLYGWKCGILICYDNNIIENVRATTLLGAQIIFMPHVTMCTPSTRPGAGFVDPELWKDKEANAVLLRQEFDSLKGRQWLMKWLPARAYDNAIYAVFANPIGMDDDQLKNGCSMILDPYGDIIAECRSLNNELVLAELTADKLTKAGGYRYIQARKPELYRDIIGREHESFQKVVWL >NZ_CP043449.1|WP_090533497.1|6138660_6139932_+|serine--tRNA-ligase MLQVSYIRDNREQVLERLAVKNFKQPQLVDEIIELDDKRRSTQTSMDNVSAEANAAAKQIGELMRAGKKEEAEGLKGKTGAWKEEIKKLGDLLTITEEELYQKLVLLPNLPHSSVPKGLTPEDNEVVLENGTRPELPADALPHWELAAKYNLIDFELGVKITGAGFPVYKNKGAKLQRALINYFIDEAEKAGYSEVSVPLMVNEASGFGTSQLPDKEGQMYHVGVDNLYLIPTAEVPITNLYREVILKEDQLPVRNCGHTPCFRREAGSYGAHVRGLNRLHQFDKVEIVTIAHPDKSYEILELMSSHVQGLLQKLGLPYRVLRLCGGDMGFGSALTYDMETWSAAQQRWLEVSSVSNFETFQSNRLKLRFRNADGKTQLAHTLNGSALALPRIVATLLENNQTEKGIKVPEVLVPYTKFEWID >NZ_CP043449.1|WP_112656423.1|6137812_6138541_-|16S-rRNA-(cytidine(1402)-2'-O)-methyltransferase MNNPIGKLYLVPTPIGNLEDMTFRAIRVLKEVDLILAEDTRTSAPMLKHFEIHQKVFAHHQHNEHQSSNEIIKFLLQGKNIALISDAGTPAISDPGFFLVREALKFNIAVECLPGATAFVPALVNSGFPTDKFCFEGFLPLKKGRQTRYKFLAEEERTIILYESPHRLLKTLDEMATYFGADRQISVSRELTKMFEETVRGTVVEVKQYFETHPMKGEFVMCVAGAAAKPAKGKYERDEEED >NZ_CP043449.1|WP_112656425.1|6136025_6137660_-|apolipoprotein-N-acyltransferase MKKNLPLAILSGLFLWIAWPPTPYTTFLLFIGFVPMLLAIENIINDDKPKKGKRVFNVTFIGFFIWNSLSVYWVYNALKIVGEIVAIPITLIPYSLGPLLMATAIWLYYRFRLVAPRWVALIGLVCFWIGYEYLHQSWDLYFPWMTLGNGFAVSHQWVQWYEYTGVYGGTLWIWVVNILAFLIYTSLREGQTKRHRMALIMAIVIVVTVPLGYSLSVYHNYVEEVNPSNIVIAQPNIDPYEKDGTIPPASQLDILIQLSRQVAQPNTEFFIWPETAIPAPVYINEEQIGQNDFIKQAQIFLRKYPNGNLVTGAETYRLYNNRATPTAIPSPWGGDQFADFYSTALNIENGDRIQTYHKSRLVPGAESLPFGDALSFLKPVFEHLGGATGNYAPEKDAKVLYSQSGIGVDPVICYESIWGGYIARSVKKGAQFIAIITNDGWWENTSGKDQHLDYAKLRAIETRRWVCQSANTGISGFINQRGDVVKHTEWWTKTSIKQDINLNSELTFYVKHGDYIPQAGSIFAGIGILFLLGMRLRKKQTLTV >NZ_CP043449.1|WP_112656427.1|6135424_6136006_+|TetR/AcrR-family-transcriptional-regulator MARSKDFDEAEVLSKAVCIFWHKGYNGTSMQDLVDGLGISRSSLYDTFGDKHALYIKALDSYQKAGGNQMCDIINNSASAKEAIQKLLELTMRDLLNDEQRKGCFMVNAEIELAPHDVEVKNVVCRNEQQFEDAILQAIKKGQASGEVRNSQDSLALARFIMNAVRGMQVSAKATADKAFFDDIIKTTLSVLD >NZ_CP043449.1|WP_112656429.1|6134574_6135324_+|glucose-1-dehydrogenase MKKLENKVAVVTGASKGIGAGIAKSLASAGAAVVVNYASDKNGADKVVAEITAEGGKAIAVQGSVAKKADVDRLFAETKEAFGGVDVLVNNAGVYQFTPIEAVTEEEFHRQFDINVLGLLLATQGAVNSFGDKGGSIINISSTVTRITPPQSAIYTGTKGAVDSITQVLSKELGPKKIRVNAINPGMVETEGTHTAGFIGSDFQAQIESTTPLGRIGQPDDIAPVAVFLASDDSRWLTGEIILASGGVR >NZ_CP043449.1|WP_112656415.1|6146370_6146892_-|transposase MEFDEIYFYTATINKWIPLLQSDKFKHIVLNSLIHLVKQRKIEIYGFVIMPNHIHLIWSGSEMNGKEKPFASFIKFTGHQFLDELRATDNPLLVKFKTDLKNRNYLFWQTNSLPIRVFDRKMLEQKLDYIHLNPLQAHWNLTDDPNNYYFSSCSFYEQEDKKFDWLIHYMDVM >NZ_CP043449.1|WP_112656413.1|6147090_6147591_-|hypothetical-protein MKRIFIAFSVLLLIIAVGMSLTGYTLAIPLSQINSDRLNTPLPKARSDQNLQPLSDCDFSKGNWTAYIVISTDDFNDLNPLIGKRVCWKTNSKALLMKMKKDWVFKYRENSDMGTVNSSFYLVQDGVMVFESGIVLDKNNQGLQNSKYGWMQPVNGMAFCKYLQGL >NZ_CP043449.1|WP_112656411.1|6147687_6148188_-|hypothetical-protein MKYTSLLLSGCIAASFLFSSCDKKGPATSTVAITITDGQTGAASVGATVKLYDDVNKPNTGEAPSYTLTTDASGKATAVVAYIGEYYIVAEKGTQKSYYNGLIPIGIFKTQADIDSSPKQTPAAIIGSVKFKDTNNDGVINDSDKAKAPNLFLQEGQTLNYSLAVY >NZ_CP043449.1|WP_112656409.1|6148303_6148843_-|hypoxanthine-phosphoribosyltransferase MTKQIADLEFEILLTADKIEERVKAIGAQLNEDYNNSVPVFIGVLNGSFLFIADLIKQVSIPCEINFTKLASYYGGTSSTLKIREDIDLTVDIKGRDVLIIEDIVDTGNTAHYLIQKLKEREPASLRLCSLLLKPAALQKKIEELKYVGFEIENEFVVGYGLDYKEMGRNLKDIYKKVG >NZ_CP043449.1|WP_112656407.1|6148990_6150223_+|insulinase-family-protein MIDYQLYTLPNGIRILYKHWPSAITHCCFIVNAGSRDEAPGQGGLAHFIEHLLFKETERRNTSQILNRLELVGADLNAYTTKEYTCIHASLLNQHLDRTMDLFEDILFHSTFPDDEQEKERGVILDEIASYLDQPEEAIQDDFEELLFKEHPIGQNILGTPETVGRLNGDDIRGFIAANYNTTEMIFAVHGNYEFRKLAAMSEKYFGHVPLNELKKNRVKPVQGTGSIHIVNKPISQTHCIIGTQAYSSSHEHKWGLLLLNNLLGGVGMSSRLNLEIREKHGIAYTVESNYTPLTDTGIFSIYFGTDTEKANKASKLIHKELKKLREQKLGSLQLHQARQKFIGQIALAEENRMSLIIAMAKSMIDFNRVDTLEEIFAKINLVSAEQLLTISNEIFDNNRLITLLFEPKQ >NZ_CP043449.1|WP_112568308.1|6150240_6150813_+|peptide-deformylase MKYPIIAYGDPVLRKKATAIEPDEYPHIKELVENMFETMYAARGVGLAAPQVGMSMRLFVVDATPFDDDEPELKDFKKAFINATILEETGEEWGFNEGCLSIPDIREDVYRKPVVRMSYYDADWKHHEETFKGMAARVIQHEYDHIEGKLFTDKLSPLRKRLIEKKLNDISKGMVDVDYKMKFPNVKKGR >NZ_CP043449.1|WP_090533817.1|6151007_6151892_+|sugar-phosphate-isomerase/epimerase MTTRRSFLKTSALLSAGLLAAPNLFAYDKKYIGLQLYTVRDAMAADPVAALAKVAKTGFTSVEGATYTGTELFYGMRPGDFANVLKQNGLIMPSAHYRLGEELVNGEQQKGTIMNDWKKAVDDAAEAGVQYMVCAYLSQSERGNLDHYKNVANMLDIAGETCKGAGIQLCYHNHDFEFIQENGKYPYEILLENTDKDLVKMEMDLYWVNKANQDPIALIDKHPGRFPLWHVKDMDKTPEKKFTEVGNGVIDFKKIFTQAKKSGLKYFFVEQDVCPGDPFVSIAQSISYIKKNLV >NZ_CP043449.1|WP_112656405.1|6152149_6153517_-|MFS-transporter MPANKLPLSKQLAYACGMIGWSIMTNIIIVMLPYFYLPPNNAGLTTLVPQLLLFGLFNIMSVITASGRLVDAFFDPFIASLSDKSENRRGRRIPFMQWAILPAALFCGLTFYPMVKGESIHNAYWLTFTLICFFMGATAYIIPYNALLPELTRTGSERVKLSSLQQVGFVIGIILSAMVNNFADWVQRFAGTPNRDTAVQYTIWGLAVFAGLVMLVPVFAIDEKRYSNGHPSHLSLLPAIQKTFQNRNFKYYLISDFSYYMALSIISSGLLFFLKSLLNLPESMGGELMATMVLVSLLFYPLVNYLSQKIGKKPIVLFSFGLLSLIFVAIFFLGKLPFTPQVQIYTLVISASFPLASLGILPNAILAEIAQNEAKRTGENREGMFFAVKYLFVKLGQTLGIALFAFLTIYGKDPGNDYGLRLNGVCGCVLCLLAFVFFNRFRERRNRGKKRTAHH >NZ_CP043449.1|WP_112656403.1|6153571_6154381_-|molybdopterin-dependent-oxidoreductase MKLTKKLNKIFSKNKGSKKELTVEQKISRRNFISFGSFLVLGGAAYGGWRWLYNSPNEAPGITGEAHKPLRAVLNANEKVVRQLYSNKNLVKTYPKEMAAKVVRHNSDIGSEGLIDVDAWKLSVKRQSGEMLSIGIDELKKLPKTEIIYDFKCVEGWDQISHWGGVKFSDFIAHFKLDAETRFEYVGMETPDKEYYVGVDMPSAMHPQTLLAYEVNEKPLPPKHGAPLRLIIPVKYGIKNLKRIGSITFSNSRPRDYWAEQGYDYYAGL >NZ_CP043449.1|WP_167516287.1|6154355_6155063_-|thiosulfate-reductase MKIIKEKHPLLMRWTHWVNFPILTIMIWSGLLIYWANDTYTFTLFGVTFIRFFPQGFYDALHIPRRLAEGMAFHFLFMWFFALNGLLYVSYTIISGQWRELVPNRHSFKEAWLVVLHDLHIRKMAPPQNKYNAAQRIAYTAIIVMGFGSLITGLAIYKPVQFNYLAWICGGYHLARIWHFVLTIAYVLFFLVHVVQVVLAGWNNFRSVISGFEVIDEKPLPAIQIQNDETNEKTE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
3487943 : 3498358
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP043449|3487943:3498358|DBSCAN-SWA ATCAGGTTTGAAGGGTGTAATACTGGTATTGCAGCTCAAAGCTTTCAATTGCGATGCTGTTTTGCTCGGCATTCAAATCGCTCAGGCTCCATTTTTTGGGCCAGGCCTGCACTACGTTCCAGGTTAAAAGCGGGTCATGTTCTTCGTTAAGCAGGCTTACAGTCAGGTCTTTGGGTTCGATAATAAAATTATTGAACACGTTATTGCACCAGTCCATCAGCAGTTTTGATTCGGTAATGAGCCCTCGCTTTAGCACCAGTGCCGGAAACTTGGCCCTTACCGGCAGCGCGTGTTCAAAACGATTTTCGCCGCCCTCTTTTTTGGATTCTGTTTCCATCTCCATGGATAGGCCCGATACGTTCTGGAACCGGCTGTCGATTTCTTCATCGCCAATGCCCGTAAACACCACCTTAAAGTGAAAACCTAAAGGCGGATAACCCGATTTAGCGTCGAAGGCCATGATTAGCCGTTTTTGATGGTTAAACCTTCGTGTGCAACCTCAAGCGTTTCAATTGCAGCCTCGTTGGCATCTGCTTTCAAATCGGCCGACTGGATCTTAACCGGGAAGGCATTTTTAACCGACCAGGTAACTACCGGCTGATGCTCTTCGTTAAGCAGGTTAATGGTAAGGTCGCGGCGATCTATCTGGTTTAGTTTTACGGTGTTCATCCAGTCATAAAACTCGTTATCATTAGCAAATGTGCCGCGTTTTAGGGTGATGTTACCATATTTGCGCATGCCGGGCATTTTTATCTTATTGTATTCCTTACTGCTACCTTCGCGGTATTCAACCACTTCGGTTTCAATTTGTAAGCCGGTTACTTCGGTAAAACCGATGCGTGTGCCACCCCAGTCAACCTGGAAATGGAATTTGGGTAGAGGATAATTTGCCATTTTATTTGTTTTAAATGAAATTCATTGATTTATATAGAACCTATTGATTAGTTTCACATAAAGCGATTAATAATAAAGCCGTTAGCTTTAGGCTTTGTGCATTCTGCTTTAAGCTTTTATATTATTAACTCTCCTGCATTTTGTGCGAGAAGCGGAGGATGATGAACTCGGCAGGCCTTACTACAGCCATACCAATTTCAACGATCATGCGGCCCTCCAAAATATCGAGTGCGGTCATGGTTTCGTTAAGGCCAACGTGTACAAAAAATGCGTCTTCGGGTTTAGCGCCCTGTAAGGCCCCGGCACGCCATTGCAGCGTTAAAAAGTTCTGGATCATCACCCGCACCTTTGTCCAGGTATTGGCATCATTGTTCTCGAATACAAAAGGATAAGTAGCCTTTTTAACCGATTCTTCAACCATAATAAAGAAACGCCTGACAGGGATATAACGCCATTCATTATCATTACCTGCTAAGGTACGCGCGCCCCAAACCAAAGTGCCCCGGCCCGCAAAGGCTCTGATAGCATTGATTGATTTACCAGTGGCAGTAACATTCATATCATCCTGGTCGCTGTTATCAATTTTCACAGCGGGGCCTATGACATTGTTAAGGCTTACGTTGGCAGGTGCTTTCCAAACACCCCGGGTACCGTCAACATTGGCATACACCCCTGCTATTGCCCCAGACGGCGGAATAGTGCTCATTTTAAGCGTTATTGCCGAGTTAGCAGCGCTGAAAAATACGTTGCCCGAGAATAAAGCCTGCTCAGCAAGGTTTTCATAAAACAGGGCACCATTTATTAATGCGCCAAAACCAGTTAAAATGGTGGCAACTGTATCCTGTAATGCCTGTACAATGGAAAGTGCACCGGCTGCATCTTTGGTAAAATTATCTGCATTGGCAGCTATCGCCGCATAGGTTGAATTGGTTATCCATGCTGTATTATCCAATACCGAATAAACAGGCGTTGGATTACGACCAGCGGTTGTATTAGCAGCCACCTTGACATTTTTTTCGTAGGCAATGAGGTTGGTGACGGCAGCGGAAAGCTTTGCATCTTTGCTCAATGCATCAATATCTTTGCCATATAACGCACCAACAGCGGGCGCATCTTTAGCCTTTTTACCCGCGGTAACAATGGCGGCCAACAAATCGAGGTAACTGGTTAGCTGTACATCGGGCGTTATATTTTTGGCAATATTATTTGCCAATGCGGTTAACTTTGACTTAACTGCATCAACCCCGCCTACCCTCAACGCAGTTGCATCGGCAGCGTTGCTAAATACAGATCCGATTACTTTATCAGTATCTTTGATAGCATTTTGCAGGTTGGTGACCAGCGCCTTTTCATCAGTGCTTTTTGAAAAGCCATCATAATTGGTGATCTTGGCCGGTACCCCCGCATTGTTCCAAAACTCCATCTGCCTGAAATCAACATTGACATTATAAGTGGTTATGATCCAGGGATAATAAGCCGCGCCGTAATTGAGGTTGTTGATCCCTATCGAGCTGCGGAAGTTGTCAATAGGCTTATTGCTAACATCCTGGCGTTTATCGCCCTGCAGCACATCAAGTATTGAGAACCTGTCTTGCATAGAAGCGCACAAGCCCAACGCTTTGGTTTGCAAACCGCTAAAAGCAACAAGGTCGGGGTTACCGCCGCTTACCAGCTCAACCGCATCAGGAAACAGGATCAGTGTAGGCTCATCCTCGGCCGATAGGGTATCGATACCTCCTGATATGGGTGGCGCTGTTACTGCAGATGAGAAATCTCCCACCGAAATGATATAACATGGCCCGCCGCCATTATCAAAGTACTGCCTGAGAGATACATACAGGTGAAACCGCTTATCGGGTACCACGTTCAGGATCTGGTTTGTAGCCGGGTTAACAGTAACCTTAATGGTTGCCGGTACATAGCCGCCGCCAAATAAGCTTTCAAACTCCAGCAACGATTTTATGCGGGTTGGTACATTTACCAATGATTTGCTTTCAGGAGTTAATGCAAAAGCTGTAAAACCTATAAAGGCGGGTATAGCGGTTTCAACCTGGGCAACTGATGGCGGAAACAGTGCAATCTCCTGCACGTAAACTCCGGGTGTTTTATAAGCCATGACTAATTGATTTAAGATTGAACCATTTTATGACTGAAACGAAGAATGATAAACTCGGCCGGACGGACCACAGCCATGCCTACCTCAACAATAAGCCTGCCTTCCAGTATATCCAGCGAGGTCATCGTTTCACCAAGACCAACGTGTACAAAGAAGGCATCTTCAGGCTTGGCGCCCTGTAAAGCACCTGCACGCCATTGCAGTATTAAAAAGTTGGCTATCATTCCACGTACCTTTGCCCAGGTATTACCATCATTAGGCTCAAATACAAAGGGCTGGGTTGAGTTTTTAACCGATTCTTCGACCATGTTAAAGAAACGTCTTACCGATATGTAGCGCCATTCGTTGTCATTACCTGCCAGCGTACGTGCTCCCCAAACCAGTACCCCCTTGCCCGAAAATGCCCTGATAGCATTTACTGATTTACCGGTACTGGTAACATTCATATCTTCCTGGTCGGCATCATCAATTTTTACCATTGGGGCTGCTACCATGTTAAGGCTCACATTTGCCGGCGCTTTCCAAACACCCCGCGTATTATCAACAGAAGCGTATACACCTGCTATTGCCGCTGCGGGAGGTAAAATAACCGAGAACAGGCTGGTTTGCTGTTTGATATAATTATACAGATCACTGTTTAAGTGATAGATAGAATTGGCTGTGTTGGCATCATCGCGCAGAACGGCAGTGGCCGGAAAGTCCGTTGGTGTAACACCTGTTCCGGAAACGATCAGCTGTTTGCTTTCATTGCTGAAATAGTTGATGCTGGTTTGCAGCCATGGATAATAGGCAGCACCATAACTAAGATTATTAACCCCGATAGCTGTGCGGAACGCAGTTGCGATATCGCCAAACGCTTTTTTCCCTTTATCGGCTACTGCCACATAGGGGTCAATAAGTACTACCCTATCCTGCAGAAATGCAGCCTGGGCCAATGCCGCAGTAAACAAGCCATAATAATCTGCAGGTGCGGTAAGGGTTACGGCATCAGTAAACAGCAAGATTGTGGGCTCATCTTCCTTTTGAACCGCGGCAAGCGCGTTTCCAAGCAAAGCTTTGTCAACATTAACAGCTGCACCTGTTTGACCTGCCGAAACAATATAACAAGGGCCGCCGCCATTGGCAAAGTAAAGCTGCAGGCAATAGTACATCCTGAATTTCAGGGCTGCGTAAGGATCGTCAACCGCGGCATCAAATGAAAATACACCGGCCGCCTTTTTGATTTTGATACCGGTTGCCGTTCCGGTTGTGGTGAACAGGGTATTCTCGGCAAAACCAAAAATCTGCTCATACTCTTTAAGCGATTTGATCCTGACAGGGTTGTACTGCAGCGATGCCCCGTTATAGCTAATATTTTGTGTATAGCCTATAAAGGCCGGGATAGCCGTTTCAACCTCGGCTACCGATGGAGGTAGGAGCGGAATTTCTTCCACGTAAACACCCGGAGTTTTGTATGTTGTTGCCATATTACTGTTTTTTACTTCCTAAAAAAATATGTCTGAAAAAATCTGGTTGGTTTCTGGTTTAATAAGGGATGCCGTTGCATTGGGCAGGTCGATAGTGGTTGGATGATTATTTTTATCCTTCACCTTCACCGATATAAAACCCTGCCTTGTAAGCGGTAGTGCGTTATCGCTTACTGATGCCGCACCAAAAGCTGTACCTATATACCGCCAATAAGTGGCCCTGTTTTTGAACCGTAAACTAAACACCGGCGATTTAAGTGTGGTATCCGCCTGTAGCAGGTTATAAGTTGCACCGCTTGTTTTAACACAGATATCAATAAAAGCCCAGGCCGATGAGTCCATACTGTAATAAAACGGAAAATCGTCAGTATAACCTGCGGTTGCCAGGTGGGCAGTATATAAGCCTTCGGGCAAATTGCCCATATCAGCTTTGATGATGTTCAAAGGCCCGGTTTCCACACTGCCGGGCACGTTTACAACATTGCCAAAGCGATCTTTTATAGTGAGTGCCGGAGTCTTATTAGCCACCGTAACCTGGTAAGTAAATATGCGTCCAAACTGGCGGATCAAATCATTGTTACTTGCATACGCCAGTGGTTTTCCGCCTACCAGCGGATCAGTTACCCAGTTGCCCGCCGGGGTAGCGCCCGCTGCCGGGTCAAGGTTATTCTTTTTCTGGGCTATAAATAATTTAGTTGGCGCAGCCTGGTTGTCAACCAGCATATCGCCAGGGTAATAATCGGTAGCAGCCTTGTACACATTCGGTGTCGCTGTTAAAAACGGCGCCACACGCGGCGAATACGAAGGATCATTAGTAAAAAAGTATACCATACCATCCGGCGCTTTTAAAGGCAGTGAGCTATAGTTGGCAAAAAGCCCGTCGTTAATGGCCAAACTAAAGCTAAATACGGTGTTATCCGGTGGTACTATCTGCGGCTGTTCCTTGGTGCCGTTCGGGGTATAAGGAATACCGGTAATAATGCCGGTTTCTGTTTGTTTATAAACCATGTGCAAGCCTGCGAACTTTGCCCTTGTAGCTTTAGTAGGCGTTACGCTCATGAAACTGCCTATGTCATAACGCTCCAGCATGGCCAGGCGTGCATCGGCGCTCAGCGTGTCAAAGAGTGTGATCCCGTCATCCAGGTGATAGTGATGACGGATGTCAACCTCAAAAAGCGTTTCGTAGTTCAGGTCCAGTACCATCGGGTTTATAATTTACAGGTTAATCGCCGGCACTAATCGTTAGCTTAGTTTATCGCTCAGTTCTATCTTTTTAATGATGCCCCGTGTTTCGGTCACATTATCGCGCAGTATTTCAATCAGCCTGACGCGGTACAATACATATGGAAATTGTTTACCACCCAACGTGCTCCACAGGTAATTGGCCTGCTCAAATGTGGGCGAGTAGATATCAAGTATAAGGCGAAAATCGCCGAGGCCTACTTTCGTGGTCGGGCTGTTTTGTTTGGTAAAAGTGCTCTTGCCCTGAAAAAATTCGACGATATGCGACAGGAAGATCAGTGATTTTTCATAAGAAGAATGCAAACAGGCCGAAAACAAAACATACAGGTTAAGGTACATCGGCGCGTTACGATAGTTAATATCATCGCCGTTGCGTACATAATTGGGTATATTTTTCATGGTAAACTCCTCAGATATATTTACCAGCGTTACCAGGATATTCTCAAGCCCTTTGATCTCATCATCGCTCAGGTGGGCTACGTTTTTAAGCTGGGTGAAGCTTGAGCTGAGCCCGCTGTTCCCGTCGGTTGTTTTGAGATAGTTATTTAACTGCTCGGCTAAAAAACTTACGGTTTGGTAAATCATGATAATTAGATGTTTATAGGTGGTTGCAGGTTAGCTGCTGATGTTAATCCAAAATTGGGCATCAGGCTTTAGTAACCTGCAAGTGTATGCACGTTAATTAACACGTATTTATACGCAATTAGGGCACGTATAAATACGTGGTAATGAACAGGTTAAAACAACACAAAACCTGCAAATTTCATCAAAAACGAACTTTTTAAAATGGATAACCAGTTGTCACGGCGCACTCTTCTTCAAAGTCCATAATTTAAGCGCGGCGTAACATCGACAAAACCGGAGAACCTCATATCTAATTGATGTTCAAACAAATATCTCAAAATCAGGATACTGCTATCCTAAAGGATACCGGTATCCTTTAGGATAGTAGTATCTTTTTGATATCTGTTAGGGGTAATTTTGCTGCTACAAAACATACAGCAATGAAAATATTACATCTAATCAGCAGCCCGAGGGGCGAAGCATCATTCAGCGTAAAATTAGGCAGGGCCATCGTAAGTGAATTACAATCGGCCCACCCGGATCATACTTTAATCACACATGATTTAACAGCTACCCCCTTCCCGCACCTGGAAGAAGCACACATAACATCTTTTTTCACTCCTGAAGAAAAAAGGACACCCGAGCTTGTTGAAGCCGTTAAACATTCGGACGAGGCAATTGCCGAACTAATGAATGCCGATGCCGTAGTTATTGATGCACCAATGTACAACTTCGGCATTCCATCAACCTTAAAAACATGGATAGACCATATTGCCCGTGCGGGCAAAACTTTCAAATACGATGGAAAAACACCTGAGGGATTAGTTACAAATAAAAAAGTATACCTTGCCATTTCATCTGGCGGAGTATATTCTGAAGGATTCATGAAAGCTTATGATTTCACCGAGTCCTATTTAAGGGCTGTACTGGGCTTTATGGGGATGACGGATATTAAAGCATTTAGAGTTGAAGGTGTAAAGGTACCGGGCATGCAGGAAACTGCTTTAGATAAAGCGATACAGGCAATTGAGCTTTAATATTAAAACATCCGGAGGAAGTAGTTGCTTTCTTCGGGTGTTTTTGTTTTAAATTTTAACAACAAAGCAGTCTGTCAAAACAGATGTATTTTCGTCCTCATCAGCTACGGTGTAATATTCACCGGTTAAGGTGAGCCCATTGTTAATCTTTTCAACAGCGATCTTTAAAAAGCCGTGGCGGGTGTCGCAACTGTATTCAAGGCTAACCCCATCAAACCTGGAATCATCGGGAATAAAACCCGTGTTGTCGGTCAAAGCAATTGGATGCAATTCGTCAAAACCACCTGTACCGGCTACAATATAAGGCATCATGGTTAACTCATCATATTGCTTACTGAAACGCTGATAATTGTGTACGTGCCCGCTAAAAACTATATCAGGCCGGATACCGGTTTCTTCAAAAACACCTTCCAGGAAATCAATCATTGGTAAACTGGCTCCATGATTAATATCTGCCGAGTAAGGGGCATGGTGCATACAAAGGATAATTGCCTTATCTGGCCGCTGCATATCGGCAGATCGCAGTTCCTCCATAAACCATTCACGCTGCTCGCGGGTAACAACGCCGTACTTAGGTACATTGGTATGTAAACCGATGATGGTTGCCAGCGGGGTGTCCATTGTCCAGTAAATATTGGGTTGCGGCATGCTTTTGCGGTCTATCCTTCGGCTTAGCATTACCGGGCGCGAAACAGTATCACAAAACACAGTTGTAAAAGCATCAAGACTTTGATACGGCATTGCAGCAGCAGGGTTTACATCACTATCATGGTTACCGGCTATCGCGAATATCGGTCCCGGATAAAGGCTATAGGGTTCAAAAAACTGTTCATAATAGTTTTGAGCCTCTCCGTAGTTGTAAACAACATCACCTAAATGGTACAAAAACTTTGGTTTACAAGCGCCGGAAAGCACTTTATACTGTGCAGCCATTAAAGCTGCTATGCGCCTTTGCGAACCCGGATTACGGATGCTGCCGGTATCCCCTACCATATGAAAAACCATTTTGTCATCAGCAATGCCGGGCACTACATCATGTATGAATAGATGATAGGGATAATTGCCGGAGGGTTCGGGCAGGGGCTGAAATTTATAGCTGTCATCCGGCTGATCTTTTTTGATGACCGGGCCTGTGTGTTTTAACGAGGAAATTAAAGCCATCTGCTCCATGATGTCAACAAAATTACGCAACCTAATATTAAGGCAGGGTTAAATTGGGGTTTAGGGTTTAACAGGGCCAGCCTGAAATCACGGGTTCAAAACAGGGCTGACGTATAAGAAACGGTAAAAATGAGTAGTGTTACCAACAAAATGTTGGTAAAGAGGGAATAAGTTTAAATGTTCGGGGGGGGTAAAGGCTTTTAGAGTTTTTTCTTTGTAAAAAGAATATGACAACCTCACTCCATGATCATTTCAGATGGATACCCGATCCGCGCACGGGCAATAATAAGAAACATAATTTACTGGAAGTAATAATCCTATCGGTATTGGCGGTTGTATGCGGTGCGGAAAGCTGGTACGAAATGGAGGAGTTTGGCAAGGAAAAAGAAGACTTTTTAAAACAGTTGCTACCGCTTGAAAATGGCATACCGAGTCATGATACGATCAATCGCGTTTTTATGATGATCGATGCGGATGTTTTTGAACGCTGTTTCCGCGCCTGGACGGCGGAACTTGGCCAGAGCCTTCAAACAACAGGTACATCTGGCGAAAGGGAACTGATTGCTATCGATGGAAAGAGCGTCTGCAATAGCGCCTGCAAGCATCAGGGATTGGGGGCATTACATTTGGTAAGTGCCTGGTCAGGCCGTAATCAGTTGGTACTTGGGCAACAAAAGGTGGCTGACAAGAGTAATGAGATTAGTGCCATCCCGGCGTTACTATCTTTATTGAACATTAAGGGGGCAGTAGTCAGCATCGATGCGATGGGCACCCAAAAAGCAATCGCTGAAAAGATTATCGAAAGTCAGGGGGATTATATCCTGGCCCTGAAGCAAAACCATGAAACGTTTTATGACCAGGTAACCAACCAGTTTAACTTCAGGGAAGACAGTTACAGCCAGCATCTGGATAAAGGGCACGGGCGGTCTGAGATCAGAACCTGCAAGGTCATACATGAACTTAACTGGATTGACGAAAAGGAAAATTGGAAAGGAATAAAGAGTATTATCAAAATCACCTCCGAACGCATAATTGGGGACAGCCGTACTACACAAAATCGTTATTACATCTCCAGCCTTCAGGCAGACGCTGCCTATTTTAACCAAGCCATCCGCACACATTGGGGAATAGAAAACCAGTTGCACTGGCAACTGGATGTTGGTTTTGCCGAAGATTATAATACCACACGAAATAGTCAGGCTGCACAAAATCTTGCTGTAGTCAGAAAGATAGCTTTAAATATTCTAAAGGCTGACAAAACCAGCAAAGCCAGCATAAAGGCCAAAAGAAAAATGGCTGGGTGGAATCACAAGTTTCTCCTAACACTTATAGCTAACAAAAATTTCTAATGCGTCGGCCCTGGGGTTCAAAATTCTATCAGGCCAATGGTAATTGAAGTGCTTACCTTTGCGGCATGAAAAACTACAAAATAATCTTATCTGCAACCCTGCTTACTTTATCGGCATGTAGCGTTAATCATGTTAAAAACACATCTGCCGGTTCTGATTCTGCTAAAAATCAAAGTTCTGCCCATGCTGTAAGCGCGAGCCTGTTTGCCAAAATGCAAATAAAGCCCACAATTAAAATTGGAGACGCCGTTGAGCTAAAATTCACCGTTTACAACGATGCCGACACTGCCCGTCAATTTTGCAAATGGCATACCCCATTTGAGCCGTTGATGAGCAAATATCTTGACGTTAAAGATGCAAGCGGTCAGGAAATACTATACAAAGGCGCAATGGCCAAAAGAATCATGCCGCCCCCGCCCAGCAGCTATATTAAGGTTAATCCTAAAGATAGCCTATCGGCCACCATTGACCTGCTAAAGGCTTATGACATCAGTAAACCGTCGAAATATACTGTTATTTACGTAGGGCAAAATATGAGCGGGCTTACCGTGAAAGACAGTGTAGCTTTTGTTTATGAAGGGAAGTAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP043449|3487943:3498358|3492436_3493621_-|WP_112652164.1|DBSCAN-SWA MVLDLNYETLFEVDIRHHYHLDDGITLFDTLSADARLAMLERYDIGSFMSVTPTKATRAKFAGLHMVYKQTETGIITGIPYTPNGTKEQPQIVPPDNTVFSFSLAINDGLFANYSSLPLKAPDGMVYFFTNDPSYSPRVAPFLTATPNVYKAATDYYPGDMLVDNQAAPTKLFIAQKKNNLDPAAGATPAGNWVTDPLVGGKPLAYASNNDLIRQFGRIFTYQVTVANKTPALTIKDRFGNVVNVPGSVETGPLNIIKADMGNLPEGLYTAHLATAGYTDDFPFYYSMDSSAWAFIDICVKTSGATYNLLQADTTLKSPVFSLRFKNRATYWRYIGTAFGAASVSDNALPLTRQGFISVKVKDKNNHPTTIDLPNATASLIKPETNQIFSDIFF >NZ_CP043449|3487943:3498358|3493660_3494245_-|WP_090524152.1|DBSCAN-SWA MIYQTVSFLAEQLNNYLKTTDGNSGLSSSFTQLKNVAHLSDDEIKGLENILVTLVNISEEFTMKNIPNYVRNGDDINYRNAPMYLNLYVLFSACLHSSYEKSLIFLSHIVEFFQGKSTFTKQNSPTTKVGLGDFRLILDIYSPTFEQANYLWSTLGGKQFPYVLYRVRLIEILRDNVTETRGIIKKIELSDKLS >NZ_CP043449|3487943:3498358|3490963_3492418_-|WP_112652163.1|tail|DBSCAN-SWA MATTYKTPGVYVEEIPLLPPSVAEVETAIPAFIGYTQNISYNGASLQYNPVRIKSLKEYEQIFGFAENTLFTTTGTATGIKIKKAAGVFSFDAAVDDPYAALKFRMYYCLQLYFANGGGPCYIVSAGQTGAAVNVDKALLGNALAAVQKEDEPTILLFTDAVTLTAPADYYGLFTAALAQAAFLQDRVVLIDPYVAVADKGKKAFGDIATAFRTAIGVNNLSYGAAYYPWLQTSINYFSNESKQLIVSGTGVTPTDFPATAVLRDDANTANSIYHLNSDLYNYIKQQTSLFSVILPPAAAIAGVYASVDNTRGVWKAPANVSLNMVAAPMVKIDDADQEDMNVTSTGKSVNAIRAFSGKGVLVWGARTLAGNDNEWRYISVRRFFNMVEESVKNSTQPFVFEPNDGNTWAKVRGMIANFLILQWRAGALQGAKPEDAFFVHVGLGETMTSLDILEGRLIVEVGMAVVRPAEFIILRFSHKMVQS >NZ_CP043449|3487943:3498358|3494664_3495261_+|WP_112652165.1|DBSCAN-SWA MKILHLISSPRGEASFSVKLGRAIVSELQSAHPDHTLITHDLTATPFPHLEEAHITSFFTPEEKRTPELVEAVKHSDEAIAELMNADAVVIDAPMYNFGIPSTLKTWIDHIARAGKTFKYDGKTPEGLVTNKKVYLAISSGGVYSEGFMKAYDFTESYLRAVLGFMGMTDIKAFRVEGVKVPGMQETALDKAIQAIEL >NZ_CP043449|3487943:3498358|3488960_3490952_-|WP_112652162.1|tail|DBSCAN-SWA MAYKTPGVYVQEIALFPPSVAQVETAIPAFIGFTAFALTPESKSLVNVPTRIKSLLEFESLFGGGYVPATIKVTVNPATNQILNVVPDKRFHLYVSLRQYFDNGGGPCYIISVGDFSSAVTAPPISGGIDTLSAEDEPTLILFPDAVELVSGGNPDLVAFSGLQTKALGLCASMQDRFSILDVLQGDKRQDVSNKPIDNFRSSIGINNLNYGAAYYPWIITTYNVNVDFRQMEFWNNAGVPAKITNYDGFSKSTDEKALVTNLQNAIKDTDKVIGSVFSNAADATALRVGGVDAVKSKLTALANNIAKNITPDVQLTSYLDLLAAIVTAGKKAKDAPAVGALYGKDIDALSKDAKLSAAVTNLIAYEKNVKVAANTTAGRNPTPVYSVLDNTAWITNSTYAAIAANADNFTKDAAGALSIVQALQDTVATILTGFGALINGALFYENLAEQALFSGNVFFSAANSAITLKMSTIPPSGAIAGVYANVDGTRGVWKAPANVSLNNVIGPAVKIDNSDQDDMNVTATGKSINAIRAFAGRGTLVWGARTLAGNDNEWRYIPVRRFFIMVEESVKKATYPFVFENNDANTWTKVRVMIQNFLTLQWRAGALQGAKPEDAFFVHVGLNETMTALDILEGRMIVEIGMAVVRPAEFIILRFSHKMQES >NZ_CP043449|3487943:3498358|3488404_3488836_-|WP_112652161.1|tail|DBSCAN-SWA MANYPLPKFHFQVDWGGTRIGFTEVTGLQIETEVVEYREGSSKEYNKIKMPGMRKYGNITLKRGTFANDNEFYDWMNTVKLNQIDRRDLTINLLNEEHQPVVTWSVKNAFPVKIQSADLKADANEAAIETLEVAHEGLTIKNG >NZ_CP043449|3487943:3498358|3496649_3497771_+|WP_149354031.1|transposase|DBSCAN-SWA MTTSLHDHFRWIPDPRTGNNKKHNLLEVIILSVLAVVCGAESWYEMEEFGKEKEDFLKQLLPLENGIPSHDTINRVFMMIDADVFERCFRAWTAELGQSLQTTGTSGERELIAIDGKSVCNSACKHQGLGALHLVSAWSGRNQLVLGQQKVADKSNEISAIPALLSLLNIKGAVVSIDAMGTQKAIAEKIIESQGDYILALKQNHETFYDQVTNQFNFREDSYSQHLDKGHGRSEIRTCKVIHELNWIDEKENWKGIKSIIKITSERIIGDSRTTQNRYYISSLQADAAYFNQAIRTHWGIENQLHWQLDVGFAEDYNTTRNSQAAQNLAVVRKIALNILKADKTSKASIKAKRKMAGWNHKFLLTLIANKNF >NZ_CP043449|3487943:3498358|3487943_3488402_-|WP_091171169.1|tail|DBSCAN-SWA MAFDAKSGYPPLGFHFKVVFTGIGDEEIDSRFQNVSGLSMEMETESKKEGGENRFEHALPVRAKFPALVLKRGLITESKLLMDWCNNVFNNFIIEPKDLTVSLLNEEHDPLLTWNVVQAWPKKWSLSDLNAEQNSIAIESFELQYQYYTLQT >NZ_CP043449|3487943:3498358|3497836_3498358_+|WP_112655407.1|protease|DBSCAN-SWA MKNYKIILSATLLTLSACSVNHVKNTSAGSDSAKNQSSAHAVSASLFAKMQIKPTIKIGDAVELKFTVYNDADTARQFCKWHTPFEPLMSKYLDVKDASGQEILYKGAMAKRIMPPPPSSYIKVNPKDSLSATIDLLKAYDISKPSKYTVIYVGQNMSGLTVKDSVAFVYEGK >NZ_CP043449|3487943:3498358|3495309_3496422_-|WP_112652172.1|DBSCAN-SWA MALISSLKHTGPVIKKDQPDDSYKFQPLPEPSGNYPYHLFIHDVVPGIADDKMVFHMVGDTGSIRNPGSQRRIAALMAAQYKVLSGACKPKFLYHLGDVVYNYGEAQNYYEQFFEPYSLYPGPIFAIAGNHDSDVNPAAAMPYQSLDAFTTVFCDTVSRPVMLSRRIDRKSMPQPNIYWTMDTPLATIIGLHTNVPKYGVVTREQREWFMEELRSADMQRPDKAIILCMHHAPYSADINHGASLPMIDFLEGVFEETGIRPDIVFSGHVHNYQRFSKQYDELTMMPYIVAGTGGFDELHPIALTDNTGFIPDDSRFDGVSLEYSCDTRHGFLKIAVEKINNGLTLTGEYYTVADEDENTSVLTDCFVVKI |
10 | uncultured_phage(50.0%) | transposase,tail,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
5114225 : 5159919
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP043449|5114225:5159919|DBSCAN-SWA TATGCTTACTTCTGATAAAATTATTGAAATTTTCGTTAAAGTGGATGACTTCTGCAAAGAATGTGAAGAACAAATCGCTAAACACAAGCTTGATGCAGGGAATTACAAGGTCAGAGATAGAAAAGCATCACTTGCGGACAGCGAGATCATTACTATCGTAATCGCTTTTCACAGTGGGCACTTCACCAACCTCAAGCATTTCTATATAACGCATATCTGCTCGCACTATAAAGATTTCTTTCCCGGGCTGGTGTCATATAATCGTTTTGTTGAATTGCAGCAACGGGTAGCTGTTCCGATGATGTTATTCCTGAAAACGCACTGCCTTGGACGTTCCAGGGGCATCAATTTTATAGATTCCACGCATATCAAAGTATGTCATAACCGGCGCATTCATAACCATAAAGTCTTTGCAGCGACAGCAGAAAGGGGGCAATGTTCCATAGGTTGGTTCTACGGGTTTAAACTACACCTCATCATTAATGATAAAGGGGAAATACTGTCTTTTTATCTTACCAAAGGCAATGTAGATGACCGCAATGTCAAATTGATGACTTCAATGACAGAAGAAATATTCGGGAAACTGTTTGGTGACAAGGGCTATATATCTAAAGCGCTGGCCGATCTGCTTTGGGGAAATGGTATCCAGATGATTACTAAGCCGCGCAAAAACATGAAGGACTTCAATATTTCACAAGCCGATAAGATCATGCTCAGAAAAAGAGCGATCATCGAATGTGTCTATGATGAATTAAAGAATATCTGCAAACTTCAGCATACCCGACACCGCTCCGTCAACAACTTCTTAATGAACATAATGGGGTCCCTTTGCGCTTACCACTTCTTCCCTAAAAAACCTTCCTTAAATATTGTTTTCGAGGAACAAGACAATCAATTGCTTTTAGCTGCTTAAACCGAACTCACGTTAAAATAATATTAAAAGAAAAGTCGACTACTCTAAAGTAGCAACGATGAATGATTGTAAATAAAAAAAGGCGTCGAATTTGTTAAAAGCTTAATGACAGTTTTCATTAGGGCGTTGCCTACGGCCGGGCTATGCGCTCATAATGCACAGGCATTAGTTACAGCTGCTATCTAACGCGGGGTTGCGGCGAGTGGCTGATTGGGGGATGGGGAGAGGTTGACTTCTGCTTTCCCGATCAGCACAAGCGTATGTCGGCCTAAAAACCTCTCTGTTACTTCATTGCGAGGCACGACGGCACCGACCGAAGGGAGCTCATTAATACCTATAAACAACAAATTATCGTTATCAATAATCCCCAATGAGCAGAGCAGCTGTGCAGATTGGCCCTGTAAAGTTGGCGGTTGCTTCGTGCCTCCCAATGACGTGATGGCGAGTTTGGGGTGGGAAAGTTGTAACCCGAGAGTGGCATTCCCCCAACAGAACTTATAAAAAAAGCCCATAAACTTTTGTTCCCGATCAGCACAAGCGTATGTCACGCTAAGAGCAGAAAAACGGTGAGTGCCCTGCTTTCCGTCAAGTGTGTACCCACCTAAATGAATAGCTCTGCGTCTTAGCGCCTTTGCGAGAGTAATCTGTACGATATTCAAATAGGTACCAACGCAAAAAGCCTTAGTTATTAGCTAAGGCTTTGCGATTATTACACTATTGAAAGTGATAAAGATTGGCACCGACCTACTCTCCCACGTTTTACCGCAGTACCATCGGCTCTGGCGGGCTTGACTTCTCTGTTCGGAATGGGAAGAGGTAGACACCGCCGATATAGGCACCTGAATTTCTTTAGATTTGAGATGTTAGATATGAGATTTGAGACATATCTTCTTTCCAAACCATTTATTTTTTATTTAGATCTGAGGGTTAGATTTGAGTATTGAGATTTTTTATCTCACGTCTCATATCTATCGTCTCATATCTAAGACAATTATTGAAAGAAGTGATTGAATGTAAGAGAAAACAACAGCTTTTGTTGTTTATCTGTTTGTCTCGGATTAGAGGCAGCGACTAATCTCTTAATCTCTAACCTCTAATCTCTGCTTTGAAGAAAGCTTCGGGCAATTAGTATTACTCGGCTATGATGTCACCACCTTTATACCTGTAACCTATCAACGTAGTAGTCTCCTACGACCCTCAATGGAAGTCTCATCTTGTGGCTAGTTTCGCACTTAGATGCTTTCAGCGCTTATCTATTCCCAACGTAGCTACTCTGCAGTACACCTGGCGGCATAACAGATTCACCAGAGGTTAGTCCAACCCGGTCCTCTCGTACTAAGGTCAGCCCCACTCAAACTTCCTACGCCCACAACAGATAGGGACCGAACTGTCTCGCGACGTTCTGAACCCAGCTCGCGTGCCACTTTAATGAGCGAACAGCTCAACCCTTGGGACCTTCTCCAGCCCCAGGATGTGACGAGCCGACATCGAGGTGCCAAACCTCCCCGTCGATATGAGCTCTTGGGGGAGATCAGCCTGTTATCCCAGCGTACCTTTTATCCTTTGAGCGATGGCCCTTCCATGCAGAACCACCGGATCACTATATCCGTCTTTCGACCCAGCTCGACTTGTCTGTCTCACTGTCAAGCAAGCTTTTGCTATTGCACTCCACGTACGGTTACCAAGCGTACTGAGCTTACCTTTGAAAGCCTCCGTTACCTTTTTGGAGGCGACCACCCCAGTCAAACTACCCGCCAAACAATGTTCTCCTCTTTGAAGAGTTAGACACCAAATACAGAAAGGGTGGTATTTCAACGTTGACTAACCGACTCCTGGCGAAGCCGGATCACAGTCTCCCACCTATCCTACACATCCTGTATCCGATATCAATGTTAAGCTATAGTGAAGGTGCATGGGGTCTTTCCGTCCCGTTGCGGGTAACCGGCGTCTTCACCGATACCACAATTTCACCGAGCTCATGGCTGAGACAGCGCCCAGATCGTTACACCATTCGTGCAGGTCGGAACTTACCCGACAAGGAATTTCGCTACCTTAGGACCGTTATAGTTACGGCCGCCGTTTACTGGGGCTTCGATTCAATGCTTCGCCTTGCGACTAACATCCCTCTTAACCTTCCAGCACCGGGCAGGTGTCAGGCCATATACGTCATCTTGCGATTTTGCATAGCCATGTGTTTTTGTTAAACAGTCGCCTGGGCCTTTTCACTGCGGCTGACATTGCTGCCAGCGCCCCTTCTCCCGAAGTTACAGGGCCATTTTGCCGAGTTCCTTAGCCATGATTCACTCGAGCACCTTAGGATTCTCTCCTCGACTACCTGTGTCGGTTTACGGTACGGGTTTTTATTACCTGAAGCTTAGCGGGTTTTCTTGGAAGTCTGATTACCTGAACTATCACATCCCCGAAGGTTCTGTGTACTATCAGCTTTCAGCATAATCTGCGTACTTAACTACAGTTTATTTACCTACAGCCTTTAACGAACTATTCCGTCAGTTCGCGTCAGTGTCACTACTCCGTCACCGCATCGCAGTAATAAAAAGTACTGGAATATTAACCAGTTGTCCATCGGCTACGCCTGTCGGCTTCACCTTAGGCCCCGACTAACCCTGATCCGATTAGCGTTGATCAGGAAACCTTAGTCTTTCGGTGGGCGGGTTTCTCTCCCGCCTTATCGTTACTTATGCCTACATTTGCTTTTCTATTCCCTCCACAGTCAGTTACCTTCCTGCTTCTCCGACAAATAGAATGCTCCCCTACCAGTCACATTACTGTGAATCCATAGCTTCGGTATACTGCTTGATGCCCGTTTATTATCCATGCCCGATCGCTCGACTAGTGAGCTGTTACGCACTCTTTAAATGAATGGCTGCTTCCAAGCCAACATCCTAGCTGTCTGTGCAATCGGACCTCGTTAGTTCAACTTAGCAATAATTTAGGGACCTTAGCTGATGGTCTGGGTTCTTTCCCTCTCGGCCATGGACCTTAGCACCCATAGCCTCACTCCAGCGTATATTATAAAGCATTCGGAGTTTATCTGGATTTGGTAGGATTTGACTCCCCCGCACCCAATTAGTAGCTCTACCTCTTTATAACTCAACCGCCAGGCTGTTCCTAAAAACATTTCGGGGAGTACGAGCTATTTCCCAGTTTGATTAGCCTTTCACCCCTACCCACAAGTCATCCGGAAACTTTTCAACGTTTATCGGTTCGGTCCTCCAGTACCTGTTACGGCACCTTCAACCTGCCCATGGGTAGATCACAAGGTTTCGCGTCTACCTCCCCTGACTATACGCCCTTTTCAGACTCGCTTTCGCTTCGGATCCGTGTCTTAAACACTTAACCTTGCCAGGGAAGAGTAACTCGTAGGCTCATTATGCAAAAGGCACGCCGTCACCCGTTTCCAGGCTCCGACCGCTTGTAAGTACACGGTTTCAGGTTCTATTTCACTCCCCTGTTCGGGGTTCTTTTCACCTTTCCCTCACGGTACTGGTTCACTATCGGTCTCTCAGGAGTATTTAGCCTTACCGGATGGTGCCGGCAAATTCCCACAAGGCGTCTCCGACCTCGCGGTACTCAGGATACCACTATCCTACTATCAATTACCCGTACGCAGCTCTCATGCTCTATGGCCGGGTTTCCCACCCCGTTCCGGTTCTCTTTAATATTCATGTTGTGGTCCTACAACCCCTGCTATGCCGTAACATAACAGGTTTGGGCTTCTTCCATTTCGCTCGCCACTACTCTGGAAATCACTATTGTTTTCTCTTCCTCTGCTTACTTAGATGTTTCAGTTCGGCAGGTTAGCGCATTTATGCAGTTATTCTTCAAATAACTAGGTTTCCCCATTCGGAAATCTGCGGATTAATTCATATTTGCTAATCCCCGCAGCTTATCGCAGCTTATCACGTCCTTCTTCGCCTCTGAGAGCCAAGGCATCCCCCGTGTACCCTTTCTTACTTTCTTCTACTCTTGCGCCTTTTGCGCCGCAAGGTATGTCTTTATGTTAGATCAGTCGTATTTGTTCATTAGTTCATTGGTTCATTAGTATTGCTACTTATTTACCTTTGAAGCTCTTGATCAAGTACAACGTCTCTATCTGTTGTTTTCTCTTGTTTTCAATTACTTCTTCCAATATGTCAAAGAACGTTAGTTGTTAGCGGCAGTCTTTAGTTGCAGTGGCAGTTTCCTGCTTCTACTACTTATTACTGTTACTTACAAAGTGTGGAGAATAACGGATTCGAACCGTTGACCCCCTGCGTGCAAGGCAGGTGCTCTAGCCAGCTGAGCTAATCCCCCGTTGATGAATGATCAGAGATCGATGATCAGTGATTAGTTCCCTAACCTCTTATCTCTAACCTCTTTTCGCCTTAATAGTAGTCCCGAGCAGATTTGAACTGCTGACCCCTACATTATCAGTGTAGTGCTCTAACCAAGCTGAGCTACAGGACTATTTGTTTTTAGTTGCAGTTATCAGTTGCAGTTTGCAGGACTTTTCTGCCTACTGCTGCTGCCACTGCTTACTGCCACTTGTCCTGTGCTTACTTGCGTTTGCACATTCCACAGATGGCTTCATCTTCTGGGTTTCCTTTTGTCTTTTTGTTTGTGTTTTAAGAAATATCATGTAGGTAACAGTTCCGAGTCTATCAGAGATCCTGCTCCAGAAAGGAGGTATTCCAGCCACACCTTCCGGTACGGCTACCTTGTTACGACTTAGCCCCAGTTACCGACTTTACCCTAGGACGCTCCTTGCGGTTACGCACTTCAGGCACTTCCAGCTTCCATGGCTTGACGGGCGGTGTGTACAAGGCCCGGGAACGTATTCACCGCGTCATTGCTGATACGCGATTACTAGCGAATCCAACTTCACGGGGTCGAGTTGCAGACCCCGATCCGAACTGTGAATGGCTTTAAGAGATTGGCATCCTGTTGCCAGGTAGCTGCCCTCTGTACCATCCATTGTAGCACGTGTGTAGCCCCGGACGTAAGGGCCATGATGACTTGACGTCGTCCCCTCCTTCCTCTCTATTTGCATAGGCAGTCTGTTTAGAGTCCCCACCTTAAATGCTGGCAACTAAACATAGGGGTTGCGCTCGTTGCGGGACTTAACCCAACACCTCACGGCACGAGCTGACGACAGCCATGCAGCACCTAGTTTCGTGTTCCGAAGAACTGTGACGTCTCTGTCACATTCACTAACTTTCAAGCCCGGGTAAGGTTCCTCGCGTATCATCGAATTAAACCACATGCTCCTCCGCTTGTGCGGGCCCCCGTCAATTCCTTTGAGTTTCACCCTTGCGGGCGTACTCCCCAGGTGGAACACTTAACGCTTTCGCTTAGACGCTGACCGTATATCGCCAACATCGAGTGTTCATCGTTTAGGGCGTGGACTACCAGGGTATCTAATCCTGTTTGATCCCCACGCTTTCGTGCCTCAGCGTCAATCATACTTTAGTAAGCTGCCTTCGCAATTGGTGTTCTGTGACATATCTATGCATTTCACCGCTACTTGTCACATTCCGCCTACCTCAAGTACATTCAAGCTCTTCAGTATCAAGGGCACTGCGATAGTTGAGCTACCGTCTTTCACCCCTGACTTAAAAAGCCGCCTACGCACCCTTTAAACCCAATAAATCCGGATAACGCTTGGATCCTCCGTATTACCGCGGCTGCTGGCACGGAGTTAGCCGATCCTTATTCTCAAGGTACATTCAACCCGGTTCACGAACCGGGGTTTATTCCCCTGCAAAAGCAGTTTACAACCCGTAGGGCCGTCTTCCTGCACGCGGCATGGCTGGTTCAGACTTCCGTCCATTGACCAATATTCCTTACTGCTGCCTCCCGTAGGAGTCTGGTCCGTGTCTCAGTACCAGTGTGGGGGGTCATCCTCTCAGATCCCTAAACATCGTCGCCTTGGTATGCCGTTACCACACCAACTAGCTAATGTTGCGCATGCCCATCTTAGTCCTATAAATATTTGATATTAAAGCGATGCCACTTCAATATGTTATGCGGTCTTAATCTCTCTTTCGAGAGGCTATCCCCCTGACTAAGGTAGGTTACATACGTGTTACGCACCCGTGCGCCACTCTCATTAGGAGCAAGCTCCCAAATCCCGTCCGACTTGCATGTATTAGGCCTGCCGCTAGCGTTCATCCTGAGCCAGGATCAAACTCTCCATTGTAAAATGTTTTGTTTAACCACTGACCCTATTATTAATTAATAGAATCTGTATTATATATCTTTATACAGTCTCTCTTCTTTTTTGTAACTCTTCAGCTCAGTTTAACCTTGGTTAGACCTTGCCTCCCGTTACGCTACATGATTTCCATTTCTTAAAAGAACTTTTTCGCTTCATCATCCGAATCAGCTATTATGTCGTCGCAATTCGCATTGCTTAACTTGTCAGTTTGTTTCCCCGAAAATCGCTTTCGGCTCTTTTGAACTACTCATTAACTATTGTTAACTTTCTTCCTTCTTTTATTGACTGAACTCGCTCAGCCTCAACTTCTTTCGCTCTCTCAAGCTCCCTTCCGTTTGGGTCTGCAAAGGTAGAAAACCTTTTCTTATTTCCAAAACTTTTTTATTTTTTATTTTCTAAAGTTTTGCAGTGTTATATCTCTCCTCTCTTTCCCTCTTCAACACCGATTCTTCTACCCTTTGTATGAACTACCGCTCCTGTTTGAAGCGGCTGCAAAGGTGCGAATTTCATTTTGATTTAGCAAGTGAAAAGTTCACTTTTTTTCAATCTTTTTTTTCGCCCTTAATTTCAAAAAACAACGCCTCTTTTTCTAAAGCGGTCACAAATGTACACAGAAAACCAGCTACCAACCAAGATATATTTTCAAATAATTAAACCTATTTTACAACTGCCTGTAAGTTAAGGCAAAATACATCTCCAAGCACCCTCATAATTAAGTAAAGGCAGGCATTTAGCTAATGTGTATATTTGAAATGTATATGAATGAGCATAATTTCCCTCCGCATTTCATCGCATCATTGAGCGCGGAAAACGGTTTCGATGAAGAAAATTTTGTAAAAGCGCACCGGTTTGAGGAGGCTCCAACCTCTATAAGGGTTAATCCTTTTAAACCATCTGCCTTAAAAAGCACACAAAATGTACCCTGGTGCGCCAACGGATATTACCTTGAAAGCCGCCCGTCGTTCACTTTTGACCCGCTCTTTCATGCCGGCTGTTACTATGTGCAGGAAGCATCATCAATGTTTATTGATCATGTTTTTAAAACAATCAACCAAAATAATGATGATGCGGTAAAAGTACTTGACCTTTGCGCGGCACCCGGAGGTAAAAGCACCCTGCTTAATTCGGCCCTGCAACCTGCCGATTTATTAGTGGCAAACGAGATCATTAAAACACGTGTACCTATATTAACAGATAATTTGAACCGGTGGGGTACGGCCAATACCATTGTCACCAATAATGACCCTAAAGATTTCGGGAGGCTGAAAAGTTTTTTTGATATTATACTCGCTGATGCCCCCTGCTCGGGCTCGGGCATGTTCCGAAAAGATCCGGACGCGATGAGCGAATGGTCGGAAGGGAACGTGAACCTTTGCCACCAGCGACAGGAACGCATACTGGCTGATATTTACCCAGCCTTAAAAGAAGATGGGTACCTTATATATAGTACTTGCTCCTACTCACACCAGGAAAATGAAGATATCCTGGACTGGCTTTGCACCGAGTTTGAGCTGGAAAGCATCCGCATACCTATTAATAATGAATGGGGCATTGTTGAAACCCAGTCGCCGGAGAAAAAAGCCTGGGGCTACCGTTTTTACCCGGGTAAGATAAAAGGCGAAGGCTTATTTGCCGCCTGCCTGCAAAAAAAAGAAAACAGCGGCGATCTGGCCTCCTTTAAAAACAACACCCAGCAAAAATTGCCGGGCAAGGAGCTCGACCAGGTGCGCTCATATATTAATAACCCCGACGACCTTTATTATTTTAAGGTAAATGACGACTGGATGGCTATTAACCGCGCCCATAAAGAAAGCCTCAATATACTACAGCGACAGCTCTATATAAAAAAATCAGGCGTAATTATAGGTAAGCTTGCCGGTAAAGATCTGATCCCCGACCATGAACTGGCGCTAAGCCTTATAATTAATAAAGATGCAGTTTTGGAAACGCCCCTTAATAAAGATCAGGCTATCCAATACCTGCGCCGGGACAACATAGAACTATCCACAACAGATAAAGGCTGGACGCTCATGACCTATGAAGGGCATCCGCTTGGCTGGGCCAAATTATTGCCAAATCGGATCAATAATTATTATCCTAAAGAAATAAGGATCCTCTCGACGCATCCTCCGAGGGAATGGTAATATTATGCCTTTACGTTAGGGATTGAAGCGGATACCAGCCAATGTGGCTAAGGCCCTGCGCAGTATGAGCGAAAAGCCCGGCCGAAGGCAACGCCTAAATAAATAAAGATCGAACTTAAACTCACAAAAAAAGCTAAGCCGCAAAGTTTTGGAATCTCTTCCCCCTTTGCGGCTTAGCGTATTTGCATAAAAGTAAATGCTACATTAAAGTAGCTGTGATCAATTACCGTACTCCATTAGTTATTATTGAAATTTAAACCCTAAACCAATTTGCCAAACCTGGTTACGGTAATCGGGCAGGTTTTGATCAGCATAAGGACGATTTTCGCCAAGGTCGATATTATAACGGCCGCGGATCTCCACATTGCGGTTAATATCAAAGCTTACACCAACTACGCCGGCTACGTAGCTATGACTGGCATCGTTATCAATATGATCGACACTGGTGCCCAAACCGCTTTCGTAAGTGTTTTTGGTTGAAGTTAAAAAGGTAAGCTGTGGACCAACCAGCAGGTTAAAGCCCCGTACCACCCTAAATTTAGCCAGCAAAGGCACATCAATATAATTGGTGCGCTGTGTAAATTTCCCGTCCCGGTCATTCACCTCATATCCTTTTTGCGAAAACAATACTTCGGGAGCAAATGACAGCGGGTAGATAATAGGCACATCAAAGGTTAAACCTGCGTGCCAGCCCGCAAGTGAACTCGAACTATAATTGCTGTTATAAGCATCAACCGTGTTTGAAACGTTTAAACCAACCGCCAAACCAACTTTGGGGGTGTAAAAATCATCATAACGGCGTTGGGGCTGAGACCTTTCAACAACCCGGCGGGCGGGCGGCGATGACGCGGTCCATAGTAATACCCCTGCGCACTAACTGTCCCTACGGTAAAGAGGCACATTGCCAAGAAAAGTAATTTTTTCATCGTATTTGTCTGTTTTAAATAAGTGGCGGAAAAGTCGTTAGCCCGTTAGTTACCGGTTACCACCGGCTTTTCCTCGTATGTATGTTAATAACGTGATAGATTAATAAAAAACACAAAGGTTTATTAGCGGTTTGATTTTTATAATAGTTATAACATCATTTTCATCAAAAATTCAGAATAACGCTACATTTGGCCTATGCAAGCTTCTAAAACCCGCATTTTCTACCCGCTGATTATCATCTTCACATTTTTCACCGCCGATGTTTTTGCACAAAAAGAAAGTTATATCCAGTTGTTAGGCAAACAAAACCGCTGGGTCGATTCGGTGTATCATAAAATGAGCCGTAAGCAGCGGGTTGCACAATTATTATTTGTACGGGCGCATACCAATAAAGGGAAAGCCTATGAAGATTCGGTTGGGCAGGTGATTAAAGAGCAGCAATTGGGGGGCGTTGTGTTTTTTCAGGGCGGACCGGTGCGCCAGGCCAACCTAATCAATAAATACCAGAAACTGGCCAAAGTACCTTTAATTATAACTATGGACGGCGAATGGGGCCTGGGTATGCGCCTGGATTCTACGATATCATACCCATACCAAATGACCCTCGGTGCCATACAGGATGATAACCTGATTTACAAAATGGGCCAACAGGTAGCTTATGATTTTAAACGCCTGGGCATGCAAATGAACCTGGGGCCAGATATGGATGTGAACAACAATCCTAATAACCCGGTTATCAATTACCGCTCTTTTGGCGATAACATGTATAACGTGGCCAAAAAAGGCATTGCTTATTTTAAAGGCATGCAGGATGCAGGCTTGCTTACTTCGGCCAAGCATTTTCCGGGACATGGCGATACCAATGTCGATTCGCATTTTGATTTGCCGCTGCTCCCTTATAGCCGCAGTCGATTAGATTCATTGGAGATGTATCCCTTCCGCGAAGCCATCAATGCGGGTATCAGCGGGGTAATGATCGCTCATATGGGGATCCCTTCCTTAGATAATACGCCAAAACTACCCTCAACCCTTTCGCGCCCTATTATTACCGGAATCCTGAAAGATTCGTTAAACTTTAAAGGCCTGGTGATATCCGATGCGATGGAAATGAAAGGGGTAACCAAATATTTCCCCAACGGTGAAGCCGATGTAAAGGCCTTTATTGCAGGCAATGACATCATCGAGCTGTCTGAAAACTCCGACCGGGCCATAGGCCTGATCCGTAAAGCCGTAAGACATGGCAGTGTTAGTGCCGCCGAATTTGAGGCGAGAATCAAAAAGATCTTAACAGCAAAATATTGGGCAGGGCTTAATCAATACAAGCCAACTGCTTTAACCAATTTAAGCCAGGACATTAACAGCGATGCAGCCAAAGCACTGGTACAACAACTAAGCGATGCCGCGGTAACCCAATTGAAGAACAATAAGGTAAAGCTCAAGCCATTTTTAAAAACAGCCATCGTGAGTATAGGCGTAAGCTCGCCAACGCTTTTCCAGATGGAGGTTGTAAAAAGTTTCCCGAACAGTCGCATTTTTATAATTAATAAAGATGCACCGGCGATTGAAGTTACCAATATCCTGAACTGGCTTAAACAGTACCAGCAGATTATTGTAGGCATACATGATACCCGCCTCCGCCCGCAGAGCAAGCTTGACTACAGCAGCGATGTAAAACTGATGATTGCCGACCTGGCATCACGCAAAAACACGGCATTCAGTGTGTTTGCCAATGCTTATACTATAGCGGGTTTGCCGGGGCTTGAAAAAGCTGGCAGTTTATTGGTTTGCTACCAGATGTCGCCCGACTTGCAACAGTCGGCAGCGAAAGTGCTTAGCGGTCGGTTAAAGCCTACTGGTAAGCTGCCTGTAAGTGTAAATGCATTTTTTACTACGGGGATGGGATTGTAGGAAAGAGATTAGAGACTGGCGATTAGAGGTTAGGTGCGCCGCAGTTATAAATAATCTAACAGCTCATAACTGTTATTTAAACTGGCTTTCGGTTAATACTTTTTTACTTTTCATAACATTCAACAGGTATTGTGCCTCTGGCTCATAAAGATCTTCTCCAAAGCAGAAGTTTCGCATGCCGTAATCAAATGCTATAGTTCCCTTTTCTTTATTCCGGTAATAAAATACGAACCAGAATTGCCTGTCAGATTCTGATTCTATTTCCCTGATCCCAAACTCCCTGCATTCATTGAGATCGTAAATTTTTTCGCGGGTAAAAAGATCATTTTTACGATTGATGCTGAAGGTGCCCATTCCAATTTCGATGATCTCCTTGCCCCAAAACAACCAAATTATCGTTTTTACTGCCATAACTCCGCCAAGGGCCCAACCGCTTATCCATAGCAACATGAATACGCGGCCAAATTCGTTAGGGGAATTGAAAAGTTCACCAAATGCAGATGCTCCAAAATATCCCCAAAAAGTTACCCAGCAACAAATCCATATAATGGAGATATAATTTCGTTTTACAGGAATAGTAATTTTGAGATACTCGAAATTATCCTCAACTATCGCCCGCCCCTGATAAGGTTTTTCCATGGTTAATATTATAGCCTAAATATCATATTTATTATCCTGTTTGCAAATGCAGCAGCATCCTTTTTTATCTTTACTAAATATTATGCAATCTACATTCAATAAATATAACGCTAAATTTCAGGAAGCCTTAGCCGGTTTAAACCCCGAGCAATTGGCCGCGGTTAACAAAATGGACGGGCCTGTGCTTGTAGTAGCCGGCCCCGGTACGGGCAAAACCCAGATCCTGGCAGCGCGGATAGGAAAAATCCTGACCGATACAGACGCTTTACCCAGCGAAATCCTGTGCTTAACGTATACCGATGCCGGCGCAGTAGCCATGCGCAAACGCCTTTTTGAATTTATCGGCCCTGATGCTTACCGTATTAATATATATACGTTCCATGCGTTTTGTAATGAGATCATCCAGGAAAACCTGGAATATTTCGGCAAACTGAATTTGGAGTCGCTTTCTGATCTGGAATCGGCAATGTTATTCCGTGAGCTGGTTGATGAGTTCCCGAACGATCACCTGCTGAAGCGTTTTACGGGCGATATCTATTACGATGTTCCCCGGCTCAAAAGCCTGTTCAGCACTATGAAGCGGGAAAACTGGAACGCCGAAATGATTGAGCAGGCGGTAAATGACTATCTTGAAGACTTGCCTAACCGCGAGGAATTTATTTATAAACGGGCTAATGCCAAAGCGGGCATCAAGATTGGTGACCCGAAACAAAAAGATATTGACGCCGCGCATGATGTGATGAAAAAACTGCTGGCTGCTGTTGGAGAGTACCAAAACTATACAGAAAAAATGAAAACGCGCGGCCGCTATGATTATGACGACATGATCATTTGGGTGTTGCGCGCGTTCCGCGATAATGAAGAAATACTTCGTAAATACCAGGAGCGTTATCAGTATATTTTGGTTGATGAGTTCCAGGATACCAGCGGATCGCAAAATGAGTTACTGCGTTTCCTGCTTAATTATTGGGATACCCCGAACGTTTTTGTGGTAGGTGATGATGATCAGTCGATCTTTAAATTTCAGGGGGCCAATATGAAAAACATCCTTGATTTTGCCAATGACTATGTAGATACACTATACACGGTAGTGCTAAAACATAACTATCGCTCCAGTCAGCAGATCCTCGATATTTCAAAAGCGCTTATCGATAACAATTTTGAGCGATTGACCAGTCAGCTTAATTTAGATAAAGCTTTAAAATCGTCGCACTCCCGTTTTAATAACCTGTTGGTCGGGCCGGTGATCCGGGAGTACGAGAACCCTGATAAAGAACTGGTTGATGTATCGCTGCAGATAAAACAACTGGTTACTGACGGTGTACCTCCGGGCGAGATAGCGGTAATTTACCGCAACCACAACCAGGTTGAGGAGATGATCAAATTTTTGGATGTGCAAAAGATCCAGGTAAACACCAAACGCAAGATAGACGTGCTTACCCAACCCTTTGGTGAAAAGATCATTAATATCTTGCGTTACCTTGCCATGGAGCTCGATTCGCCATATAGCGGGGATGAATTACTATTTGAGATCATGCACTATGATTTCTTTAACATCCCCCCTATTGAAATTGCCAAGGCAAGCGTTGCCGTGTTTAAAGAAAACTTTGGTACCCTGGCTTACAATCAGCCTAAAACTTCTATCCGCCGTTACGTAAGCGAGATGAGGCCACCCAAACAAGCAGACCTTTTTCACCCCGACCAGCAACAGGAGATGCGGTACCTCGTAAGCAATATTGATTACCTGCTCAAAGAGTCGGTTAGCGTTACGCTGCAACAGCTTTTTCAAAGTGTGATAGCCAAAATGGGCATCCTGAAATATATTATGCAACAACAGGATAAGGGCAGCTACATGCAAATGCTTACCAGTTTTTTCGATTTCCTGAAAGATGAAAGCCGCAAAAGGCCAGACATCAACCTGCCTGATTTGATCTCGACCATCGAGATGATGAAAAAACATCAGATCCGCCTGGAACTTAACCAAAGTATCTTTTCGGAGAATGGTGTCAACTTCCTCACGGCACACGGTTCAAAAGGCCTTGAGTTTGAATATGTGTTTTTTATTGGCTGCGACAAACGTACCTGGGACAGTAAAGGCCGGAACAGCGGTTTCAGCTATCCCGACACACTCACCCGTTCCGCCGCTGATGAAATTGCACAGAAAGAGGAAGCGCGCCGCCTGTTTTATGTGGCGGTAACGCGTGCCAAGCAGCACCTTACGATCTCATACGCCGCTAAGGACAGGAAGGACAAAGACCAGGAGGCAAGCCAATTTATAGGCGAGATATTAGCCAGCACCAATATGCAGGTACAATACCCTAAAGTGGATGCCATGGATATGATCGAGTTCCAGGCAACCCAATTCAGCGAAGCCGAAAAACCAAAAGTTGAATTACTGGATAAAAACTACATCAACCAGTTGCTGCAAAATTACACGCTATCAGTAACCCACCTGAACAGTTACCTGGACTGCCCGCTGCGATTTTATTTTCAGTGTTTGATCAGGGTGCCATCGGGTAAAAGCCCGGCGGCTACTTTTGGGCAGGCAGTACACTGGGCGCTCAACAAAACCTACAGGCAGTTGAAGGATAATGACAACGAGTTTTTAAGTACGGAAAATTTCATGCGCGAGTTTAGATGGTATATGGCCCGCAATCGCGATTCATTTACTAAAGACGAGTTTAAACTGCGCTCGGCCTACGGCGAAAAGATCCTGCCGACATATTATGAACAAAACGTACCGCATTGGAACAAGATAGCTGTTACCGAACGATCTATAAAAAACATTGAAATAGCAGGTGTTCCTATTAAAGGTAACCTGGATAAAATAGAGTTTGACGGCAAGCAGGCTACCGTTGTTGATTACAAAACAGGCAAGCCTAAAAATGCGAAAGATAAGTTGTTACGACCAACAAACGACGCGCCAAACGGGGGCGATTACTGGAGGCAGGCCGTTTTTTATAAAATACTGATAGACCATGACCGCACTACCGACTGGCAGGTGATCAGCACCATTTTTGATTTTGTTGAACCCATAAGCGATGGTGAATATTACCGCGAAAAATTTGTGATAACACCGGAAGACGTAGAGACAGTAACCGGGCAGATTAAGGACACTTATGAGCAAATCATGGTCCACAATTTCAGCACAGGTTGCGGCAAAAAAGAATGCGATTGGTGCCATTTTGTAAAAAGCAATTTTAAACAAGCCGATGAGATTTTAGGGATAGTTGCGGAGGAGGAAGGAGAATAAAAGATGCTGATGTAGAGACGCATAATTGCGTCTCTCGCAGGACAGGAACAATTGCATTATAATCCTTACAATATAAATGGAAGAAAAATATCAAAATAAATATCGCACCTCATCACCTCGTTTAGCTGGATGGGATTATGGGGCTCATGTTTTATATTTTGTAACCATCTGTACCCTTAACCGTGTACCATATTTTGGAGATATCCTCCCAAATGAAAATAATGAAGCTCTATTTCTGCAATCCTCAATTATGGGTGAATTCGCTCATATCAACTGGTTAAAAATTCCAGAAATTAATCCATTTGTCGAATTGGATGAATTTGTGGTAATGCCAGACCATTTGCATGGGATATTGTTTTTAACAAGCCTGATAAAATCAATTGGGAAGCAAATAAATTCGGTGTTCAAAAAGATAGCCTGGCTTCTGTATTGCGTGGCTATAAATCATCCGTTAAGAAATATGCGAATGACAACAGCATCGATTTTGCTTGGCAGCCTCGTTATTATGATCGGGTAATTAGAAATGATCGTGAGTACAATAACATAAAGCAATACATTCATAACAATCCCAACAACTGGTTTACTGAAAGAGATAGGTTTGAAAATTTACATATCTGACCCAAATCATCAAGGTTGACCTGTATGGTCGGAGACGCAATTATGCGTCTCTACGGGAAAAAACAACATCATCATATGAAACACATTTTCTTATTCCTCCTCCTCATCCTGACCTTGCCTGCCACTGCCCAAACCATCATAAAACAGGACGCTGTCATTAAGCAAATGGTTGATGAGGTATCGGCCAAAAACATTGAGGCTACCATCCGCAAACTGGTTAGCTTTAAAAGCAGGCACACGCTGAGCGATACAACCAGTAAAACCATGGGCAGCGGTGCAGCGCGCAACTGGATCAAGGCCGAAATGGAGAAATATGCGACTGAGTCAAACGGGAGGATGACCGTGCAGTTTGATACGTTCACACAGCCTAAAGGTACACGTATCGATAAACCCATTAAACTCAAAAACGTACTGGCCACACTGAAAGGTACCGATCCGAATGATACACGGGTGTACCTGGTATCGGGACATTATGATTCACGCATCAATGACGTGATGGACGCCAACGGAGTTGAACCCGGTGCCAATGACGATGCTTCAGGCACTGCACTTTCGATGGAACTGGCCAGGGTGATGGCTAAAAGATCGTTCCCTGCCACTATTATTTTTATGACTGTGGTTGGCGAAGAACAGGGGCTTTACGGATCTGCCAACGTAGCCAAACGGGCCAAAGCCGAAAACTGGAATGTGGATGCCATGCTGAATAATGATATCGTTGGCAATACTTACGGCATGGAAACCGACCTAAAAGATAACCGCAGCGTACGTGTTTTTAGTGATGGTGTACCTACGGCAGCTACCGATAAACAAGTGGCAGCTTTGAAATCATTGGGAGGGGAAAATGATAGTCCGTCAAGACAACTGGCGCGGTACACCAAAGAAATTGGGGAGCGTTATGTCGATCAACTGGATGTGAAACTCATTTACAGGCGCGACCGTTACCTCCGCGGCGGCGATCACCTGCCATTTTTAGAGCAGGGTTTTACTGCAGTAAGGTTTACCGAAATGAACGAAAATTTTAACCGTCAGCACCAGAACATACGCACGGAAAACGGTGTTGAATACGGCGACCTGCCCGACTTTGTTGATTTTAATTATGTGCAAAAAGTAGCCCGCATGAACCTTGCAGTATTGGCTAATCTCGCTTCAGCCCCGGCCGGTCCGCAGAATGTAAGCGTGATCACCAGCGACTTAACCAATAAAACTAAACTAAAATGGGAAGCTCCCGCTACCGGCAAAAAGCCCGCCGGTTATTATGTACTGATGCGCGAAACCATCAGCCCCTATTGGGAAAAGAAATTCTATGTAACCGATACTATAGCTACGTTGAATTACTCAAAGGATAATTACTTCTTCGCGGTACAGTCTGTTGATGCTGAAGGGCACGAAAGTTTGCCGGTGTTTCCGAAGCCGGTGAGGTAGATAATCAAAAGATTATAGGTTAGATAATGGACATTAGATTTTGCACTCAAGAAAAATGAGGATTAACTAACCTCCAGCCTCCAATCACTAAAAACTACTTCACGCTCTTACGCTGCACAATAAAATATCCAGCAATAAATACCACAAGAGCAATAGCCATACTTACAAATACATCTTTAGTAAAAACATCTATTTGTAACCGGTTGGATGGATCGCCATGATTGTTTTTTAACATGCTGCTGATGGCGGCCATGGGGTGTGCATATGGAAACAGGTAGCTATATTCCCAGCCTTTTGAAGCTGCCGCAACTCCGACTATAGTACATACAAACCCTATCCCCATCGGCTTCAAAAAATCAGCCCAAAATAAACTCAGTAAAAACTGGATGGAGAGGATACCTAATGATGATAAAAACAGTTTGAAATAGATCTGTGCCAGTTCTTTCTCCATGTGGAAATCGTCGAACCGTAACTCCGGTTTTACTATGCTTAACAAATTGCCGAAACCAATGGTAAACAGCACAAACAATGCAAGACAAACAAATACAAGGAAAACCGCATAAAAAAATTTAGCCGCATAAACCGACCAGCGGGGGATTGGCAGATTAAACAAGGTTTTCCAGGTATCGGCCCGGTGTTCGATACTGTTTACGGAGTATGCTATAAATATGATATACATTGGCAGCACCAATGAGCCCATGATACCCAAAATAGCACCAGCAAACTGTAGCCAAAGCATCATCGGCGGCATTGACACCAACTTTTCACTTTTGGTATAAAACCCTACGAACAACAAAACACCTATAAATAAAGGAAGCATTACCGAAGCCCAGAAACCAAGCGTTTTACGGCTTTTGTAAAACTCCGAACGGAATGACAGTATAAATCCTTTCATGGCTTAGGCTCTTTGGGTGATATCTAAAAATAGTTTTTCCAGGTCTTTTTGCTGTTTACTGATGCTGTAAACCATTTGCCCGTTTTTATTGAGCAGGGCGTTGATCTCGCCCATTTGTTGTTTGGAGATGTAGGGTACCAGTAAATGCTCGTCGGTTACTTCGGTAACGTTAATATGGTGGCGGGTTAACAGGTTGGCGGCGTCAACGGTATTTTCGGCATCAACGCGCACCAGTGGCTTGCTGATGGCCTGCAGGTCGGCAACACTGCCCTGAAAAAGCATTGCGCCGTTATTTATAATGCCCACATGCGTGGCCATCCGTTCAATCTCACCAAGCAAATGGCTTGATATAAAAACAGTTTTGCCATGTTGGGCTACAAGCTTTTTCAAAAGCTCGCGGATCTCGATGATACCGTTTGGGTCGAGGCCGTTGGTGGGCTCATCTAAAATTAACAATTTGGGATTAGCAAGCAGGGCCAATGCAATACCAAGGCGCTGTTTCATGCCTAATGAATATTGCCCGGCCTTTTTATTCGCAGCAGCGGTTAGCTGTACCAGATCAAGCATATCTTCAACGCGCTGAGCAGGCACCTGCAATAATAAGGCCCTGTTCAATAAATTTTCCCTCCCGGTTAAGTGCTGGTATAAAGCAGGCTGCTCAATAAGCGAACCTATTTGCGAAAGTATGCTGATGCGGCTTGTTTTGATATCCTGCTCAAAAATGTGGATGTTACCTGCATCGGGTTTTAACAGGTTAAGCAGGAGTTTGATGGTAGTGGTTTTCCCGGCACCGTTGGGACCGAGAAAACCATATATACTTCCTTCTGGAACTTGCAGCGCGAGTGATTTAACTACCTGCTGCTTACCAAAGGTGAAGGAAAGCCCTTCGGTTTTAATTACCATATATTATTTTTTGCTACTGAGCAAACGGTTTTAACCATTACAAAACCTTTGTGAAACAATGAGGTAGCATGTACTGCTGTGCGGCCAATTATTGCTGTTTCCTTTCTTTGCTCGAACTGGTAGCTTACACCCAAAATAAGGGCAAACATTACCGGGATCAATAACATGATAAAAGGATTTGAATTTTTGATAAGCGTTTTCATTGTGAGTGGTGATTTTATACAACAAAGGAATATTAAAAAAATAACCTGCTAAAATCATTTAGACCAACACACTTTGTTTATAGATGAACGCTTTTCTGCACCTTTTAACCGGTTCATCGATAATAAAAGCCAGTTCATCGAAAAAATGGGCCGCTATATATAAATAACAGACCGATATCATTATCAAAATATTAAATTAGATGCCACACCCGTAAACATAAAAGGGATTTAAAAACACCGAATGGAGGCTTCATTACCATTACCTAAATCCAGAACCGGAATAAAAACCGCCTGGATAATTTTTTGGCACCTGTTTTTCTGGGCAGCCATTATATCCTTTTTTGTTTTTTTGGCCCGGCTTAATGATGCCATGCTTAGTACCAAGGAGTTGCTGGTGATTTTTTTGCTTTATCCTACTATTAATATCAGCTTATTTTATCTTAACTACCTGGTTTTTATACCTAAATTTCTAAATAAAAAACGCTACTGGCAATACGCCACCTTTGTGCTTACAAGCATCATAGTTTACGGCGTAGCCAAATACGGCGTGGGGCTTTTATTTAAAGATATTGTACTGGTGCATCAAAAAGGTCATGTCACCAGTTTTGCTACTTATTTCTACAGCAGTATTTTCACCAGCCTCATATTTGTATTTTTGAGCACTGCACTAAAATTCAGTACAGACTGGTTTCTGAACGAACGCATCCAGCGCGACCTTGAAAATCAGCGCCTCATTGCCGAGCTGGCTTTTCTAAAATCGCAGATCAACCCGCATTTTTTATTTAACTCCTTAAACAGTATTTATTCACTGGCTTATCAGCGCTCAGAAACTACCCCCGAGGCTATTTTAAAGCTTTCGGAAATTATGCGTTATATGCTGTATGAATGTAACGAAAACAAAGTTGACCTGAGTAAAGAACTCCAGTACCTGCAAAACTACATCGATCTGCAAAAGATCCGCTTTGGCAGCAAGGCATTTATTGATTTTAAGGTAGATGGCAATATCACCGATCAAAAAATAGAACCTTTGCTACTGATCGCTTTTATCGAAAATGCTTTTAAGCATGGCGTTGCAAATGATATCAATTCGCCTATAAAGCTACTTATTAACGTGGCGGATGGTAAACTGCATTTCTACATCCAAAATAAAAAACATAACAATAACCGTGATGCTGTAGGTGGAATCGGCTTGATAAATGTGCAGCGCCGGCTCAACCTGCTTTACCCCAACAGGTACTCTTTAAAAATAAGGGATGAAGAATATACTTACACCTGCGAATTATCAATAGATTTATAGTTATAACCTATTATAACCATGACCTGTTTGAAAAAATTGCTCCCAGCCCTGTTTCTTGCAGCATCGGCCATTACCGCATCTGCACAAACCGATCCTAAAGCAACCGAAGTTTGGGACCCTGAGCCTGAAGTAGTTTTACCGGGCGCCGGCAACAAGCCGCCATCTGATGCTATTATTTTGTTTGATGGCAAAAATTTGGACAAATGGACAGATCAGAAAGGTAACAAACCGAGCTGGATAGTAAAGGATGGTATTGTAACCGTTAAGCCGGGCAGCGGCTCTATCATCACCAAACAAAACTTTGCAGATTGCCAGCTGCATATAGAATGGCGCACACCAGCTGTTGTTAAAGGCGAAGGACAGGAGCGTGGCAACAGCGGCGTAATTATGCAAAGCCGCTACGAACTGCAGATCCTGGATAGTTATAAAAACCGCACTTACTCAAACGGGCAGGCCGGCTCTGTTTATAAACAATACCTGCCACAGGTAAATGCCTCGCTTAAACCGGGCCAGTGGCAAAAGTATGATATCATTTATACTGCCCCGCGCTTTAATATCGACAGCTCGGTTAAATCACCGGCTTACATTACCGTATTACACAATGGCATACTGGTACAAAACCATGTAGCTATAAAAGGAACAGTGGCGCATGTTGGTCAGCCCAAATATCAAAAACATGCCTTCGCCTTACCCCTGTTGCTGCAGGAGCATGAGTTTCCGGTATCGTTCCGCAATATCTGGATCAGGGAGATTGGGGTTCAAAAATTGTTAAACGGCAAGGATAAAAAGGGTTGGTATACCTACCTGGATACGTTAGGCAAGGATAATGATGTACATAACAACTTTGCCATTGAAAATGGTATGGTGCATGTAATGGGCAAGTATTTTGGTTATATGGCCACCAAAAAATCGTACGATAATTATTACCTGAAAGTAGTATTTAAATGGGGGAGCAAACAATATCACCCGCGCGAAAAGGGCGTACGCGACGCAGGTATTTTATATCATTTTGGGGAGGGTGATAAAGATATCGTATGGCCAAGATCGATAGAATGCCAGATTCAGGAGGGTGATTGCGGCGATATTTGGTGCGTACAACATACTAATGTGGTTACGCCCAATAAATCGGCCATTGAGTGGGACCAGCAGCGTGTTTACCGCACCGCCAATTTTGAAAATCCGCGCGGCGAATGGAACACCATCGAAATTATATGCAACGGTAACCAGATTGAACATTATGTTAACGGGCATTTGGTAAACTGGGGAATAGCCTCATTATCGCACGGTCGCATCCTTTTACAATCTGAAGGGGCCGAAATCTGGTATAAATCTGTTGAATTGACGCCGTTATAACAAAACAGCACATTAAAAAATTTAGAATAACCATTATTACATTATTTTTACGCTATGATCAGATGTTTGGTAGTTGACGATGAGCCTTTGGCGCTGCATATTTTGGAAGACTATATTTCCAAGATGCCTTTTTTGCAATTGGTTAAAGCCACAACTAACCCCATCGAGGCGCTTACCATGGTACAGGCCGCCGAAGCCGACCTTGTGTTTTTAGATGTTCAGATGCCCGAGCTTACCGGCATTCAATTTTTGAAAATTGCTAACGGCAAAGCCAAGGTTATTTTAACTACAGCCTACCCGCAGTACGCGCTTGAAGGCTATGAGCTTGATGTAGTTGATTACTTGTTAAAGCCAATCGCATTTGATCGCTTTTTTAAATCGGTACAAAAAGCACAAAGCATTATCCAGCCTGTTGCAGCCAAACCACAGGTAGTGATGCAGGCCGAGCCCGTGCAGCAGGATGATTTTTCAACCGATTTTATCTTTGTAAAAACCGAACATAAGATTCAAAAGGTATACCTGCATGATATTATGTTTATTGAGGGTTTAAAGGATTATATAAGCATTTTTACCTCTGCAGAGCGGATTATTACCCTGCAGGGCATGAAAAAAATGGAAGATGCCTTGCCCGAAAAACATTTTGTACGTGTACATAAATCATACATTGTAGCCCTCAATAAAATAGACAGTATTGAACGGAGCCGCATCCAGATCGGCGACAAGATCATCCCTGTAGGCGATACCTATCGCGATGAGTTTTTCAGGATGATTGAGAATAAAAATATTTAGGGGTAACTTTCATTCGCAAATCATACCAACACCTATAAAAACATGTCATTGCTGAGCGCAACGTTGCAATATCGTGGCCAATGTATGTGCGCCCTGTATAGTTTTATATCTATGAGATTGCCACGTCGCTCCGACGCTTTTCTACCTCCGCTCCTCGCAATGAGATGTTTTATTTATTCGGAAAGTAACTCCTCCCCGCATTTGCTCGTGGTCGGTATTTCCATCGCCTCACTTCCCTAATTAACACGGTACTGTGATAAAACCAGGAATGGGCTTAATGACATAGTTTTATTAACAATCACCTTTCCTCTCAATCCTTACAACTTACCTATTTAAAAAATATATTTCATAAAAGTGTATTTTTTGTATTTTACGCGGTCATTCATCCTATCAAAAAACTATGAGCTACCAGGTAGACCTGACCAATTGCGACAGAGAACCAATACATATCCCGGGAAAGATCCAATCGCACGGTTTCTTGATCGCTGTAAACAGCAAATCTTACAATATTTCCTACATCAGTCAAAACGTTGAAAGCTATACCGGTATCAACGCAATCGGACTCTTAGGCAAAAATATTAGCGAACTGGAAGAACAATTAGCCATAGCAAATGATACCGCGTCATTGCAGCAGTTGTTAAAGTTGGCTAAAGGTGCAAAAAGTTCAGACACCATCAATCCGCTGGCTATTGATGTAAAGGGAACTAATTACAATCTTATCATCAGTCACTCGGTAAATGACCTGGTACTGGAGTTTGAGCCAACCGAATCAGATCTGGGTGGTGACATTCAAAAAACTATAGGCCGTTCGGTAGCCGAGATCCTGAGTGGTAAAAGCTTATCCCTTCTACTGCAAAAGGCTGCTTCTGAAATAAAAAAGATCATCAATTATGACCGGGTAATGATCTATAAATTTAACGAGGATGGTCACGGCGAAGTGACCGCCGAAGTGAAGAACGATGACCTGGAGCCGTTTTTAGGGCTGCATTATCCGGCATCGGATATTCCTAAACAAGCCCGCGAGCTTTATAAGATCAATCTTACCCGTATTATAGCCGATGTAAATTCAGAAAGCTCGGCTATTATTACCTATGAGGAGGGAGCCGCACCGCTTGACCTTACCCATTCGGTATTGAGGGCTGTGTCGCCTATCCATATCCAATATTTAAAAAATATGGGCGTCGACTCGAGCTTCAGTATCTCGCTTATTGCCCATGGCGAACTTTGGGGACTTATCGCCTGCCATAATTATTCGCCAAGGTTTATCGACTATAAATCGCGTGATGCATCGAAACTGATCGGTCAGATCCTTTCATCAGCATTAGAGTACCGCCAGGACGAGGAAGAATTGGCAAAACTTCATGAACTGGGCGAAGCTGCCAATACCATTGCCGATTATATTAAAAAGGACACTGATTTTACTTTCGCCTTAACCCGCAATAAAGTTACTATCAAAGATATTACCACGGCTACGGGTGTGGCGCTGGTGTACGATGAAGAGATAACTACCATCGGCCAGACACCCACCGAAGAACAAATAACCGAGATAGTTGAATGGCTTAAAGTAAATATGGTTGATACCATTTACAGCACTTACCGTTTCCCCGAAATATATCCGGCGGCAAAAAATTACAGTGATGTTGCCAGCGGAATTTTATCCTGCAGTTTATCGCGCGAGCTTGGGGAGATGATCATTTGGTTTAAGCCCGAGCAGGTAAAAGCTATCAACTGGGCGGGCAATCCGGAAAAGCCGGTTGAAGAAACTGCCGACGGCCTGTTGCAGCTCTCGCCACGAAAATCATTTGACAGCTGGACACAAATTGTTAAAAACACATCCGAAAAGTGGAAGCATGATGAGATCGCCGCGGTTTTAAAGGTTCGTGAAGATGTTATTTTTGCCATCAACCGCAAAGCCAACGAAATCAGGATCCTGAATGAACGTTTGCAATTGGCCTATGATGAGCTGGATACCTTCAGCTATACCATATCGCATGATCTGCGCACCCCGCTATCATCAATAAAAAATTACTCGGAGCTATTGCTGGCCAGTAATAAAAGCCTGGATGATTCGGCAAAAAAAATGCTCGAAAGGATCATCAAGGGCACCGATAAAATGCACATGCTCATTAAAGAGATCCTGCATTATTCGCGCGTGGGCCGTACCAACATAGAGGCCGCCCCAATTGATATGGGCCTGTTACTAAATGAAATTAAAAGGGAAGTACTATCGGCCCTTAAGCCCGAAAGCATTGAATTTACCATTGGAGATACCCCAGGTATTGACGGCGACAGTGTAATGATAACGCAGGTTTTCACCAACCTGATTAACAATGCTGTTAAATACTCTTCAAAATCAACGCCATCAAAAGTTAAGGTTGAGGGGATAGTTAGAAACAATGAGATCGTTTATTCGGTATCCGACAATGGTGTTGGTATCGACGTAAATTATTATAACCGCGTGTTTGAACTATTTAAACGCATGGATAACGCGCTCGAATTTGAAGGCACAGGGGTTGGCCTGGCCATTGTAAAACGCATTGTTGAAAAACACAAAGCGCGAATTTGGTTTGAGAGCAAATTAGGATTTGGCACAGTTTTTTATATATCTTTTAAAATTAGTTAGTTGTGAGTTCGCCTGATATTTTGTATGTTGAAGATGACGAAGATTTTGCATTCATAATGCAGCATGCCGTACGCGAAGTGAAAGACGGGCTTACGGTAAAGATCATTGATAATGGGAAGGATGCGCTTGAGCAGTTAAAACAGCTTACCGAAGCAAGGGTAAAACCGAAACTTATACTGCTCGACCTTAACCTGCCCGGGCTTTCGGGGCTCGACCTGGTAAAACGCATTCGTGAAATATCGTTTTTAAAATATGTTCCGGTGATATTCTTTTCAACGTCCGACAATCCAAAAGACGTTAAAGCATCACTTGAATTTGGCGCTAATGCTTATTTAACCAAGCCTGCGGGCTATTTAAACCTGGTAAATTGCGTTGAATCACTTTATAATTTTTGGTTCACTAAAAATCTGAATGTCAATTAAAACGCTGAAAAAAATATTAGTCATCGAGGATGATGGGGATATAGCCGATATTATGGGCATCGCATTATGCGATAAATATGAGGTTGAAGTAAAACGCGACGGCTACGAGGTTGTAGCGCAAATTGAACGCTTCTCTCCTGATTTGATCATACTCGATAACTATATTGGCCAGCGCCAGGCCCGTGAAATAATCAAAGAGATCCATGAGGTTGATGACCACAGAACTATACCTTTTGTATTATTTTCAGGCCATGACAACATTATGCAGCTTGCTGAAAATCTTAATGCTACATCTTACCTCGCAAAACCATTTGCACTTGACGAACTTTATAAGTGTATTGACGGTATTTTAGCGGCGTAAAAACGCACACCTATGCTTCATGAAAAAATAAAACAAGCCACAGCACACCTGCATGATCAGCTTGAACAAAAAATGTTTACCGGTCAGATCATGGATGGCTCATTTACCTTTAAACAATATCAAACCATACTGACAGTAAACTACGCTACGCACCTGGCGGTAGAAGATTTTTTATTTAACAATCTCAGCGATGAATTATGTCAAAAACTGAATATCGAAGCCAGGATAAAATTACCAGCGCTGCTTCGTGACCTTGAGGAGATTAACTCATCTGTAAGCAGCAGTTTGCCGGAAGCCCCCGAATATATTGACTTAAACAGTGACGCATCCATTCTTGGTGCAATGTATGTATTGGAAGGGGCAACACTTGGGGGCAACGTAATAGCAAAACGCCTAAAAACTAACGGGCAGTTGTTATCATACAATCTGAGTTATCACTATTACCAGGTTTACGGCGACCAGCTTGGCTTAAAATGGAAACAATTTTTAGAAGTACTCAATGCCATACCTGAGGCAGAACACGAAGCGGCGATAAAAAATGCTGTATGCTTGTTTGAGCACATGGCAAATACTGAAGTAGCAAGTACTACAGTAATACTTGTACCGCCGGAAGTATAGTTAAAAAAACATGTCATTGCGAGGAGGGACGACGAAGCAATCGCACAGAAGCAGGGTCGCCCTGTATAGCAAGCGATTGCCGCGCTTCGCTTACAGCAATGACATAAAATATTCTAATATCTTAATCCAGCGCATCGGCGGTATCCTTCTCTCCAATCTGCTGACGCCACATGGCGTAATACAAACCTTTTTGGGCTATCAGATCGGCATGTTTACCGCCTTCAATAATGCGGCCTTTTTCCAGCACATAAATACTATCAGCATGCATAATGGTTGATAAACGGTGAGCGATCAGGATGGTAATATGGTTTTCCTTTTCGGATACATTACGGATGGTTTCTGTAATCTCCTCTTCAGTGATAGAATCGAGGGATGATGTTGCTTCGTCAAATACCAGGATATCCGGTCTGCGTAATAGAGCTCGCGCTATAGACAGACGCTGTTTTTCACCACCCGATACTTTTACACCGCCCTCACCAATTACAGTGCTCAAACCTTTATCGGCCCGGGCCAGTAAGGTTTGGCAGGCTGCACGCTGTAACACATCCATACATTCTTCATCAGTAGCTCCGGGGCGAACGAACTGCAGGTTCTCGCGGATGGTACCTGAAAACAGTTGTGTATCCTGGGTTACAAAACCAATTTTCTCACGGAGCTGATCAAGGTCTATTTCTTTACTTAATGTGCCGTTGTATAAAATATCTCCCTGCAAAGGCTGATATAAACCAACCAATAATTTAACCAGTGTAGTTTTACCCGAACCTGACGGACCAACAAACGCTATCGTTTCGCCCGAATTAGTTTCGAAGCTGATATGGTTAAGGGCATTACGATTGGCTGTTAAATGCTTAAAGGTAACATCGTTAAAAGTGAGGGTTTCTACCTTTTCAAGCAGCACCGGTTTTTCTGGCTTTTTATCGATAGGAATGCTTAATATCCTGTTGAAGTTACTAAGGGAAACCTCAGCTTCACGCCAGGAAAGGATTACGTTGCCTAACTCCTGCAAAGGATTAAATAAGAAGAATGAGTAAAATAAAAAGCTGAAATATTGGCCCGGAGAGATAGTGCTTTTAAAAATGAGCATCAGCAGAACCACCACCATCACACTGCGCACAAAGTTTACAGTAGTACCCTGTACAAAGCTCATGCTGCGTACATACTTCACTTTTTTCAGTTCCAGGTCGAGGATTTTGTAAGTGGTATTATTCAACCTTGCTATTTCCTGTTTTGCCAAACCCAAACTTTTTACAAGCTCGATATTCCTTAGCGATTCGGTTGTTGAACCTGCCAGCGCTGTTGTTTCGGCTACAATTGTCTTTTGGATTTTTTTGATCCTACGACTCATGGCCATACTCACAAAAGTAATGACAGGTATCGCAGCGAAATATACCAGGGTAACTTTATAGCTTACGCTCACTGAGTAAACGATAACAAATACCATCCCGATAAGCGACACAAAAAGGATTCCGATAAAGGAGGTAATAAACTTTTCGCAATCTAAACGCACCTTTTGTAATATGCCTAAAGTTTCGCCGCTCCGCTGGTCTTCAAATACCTGGTAGGGCAACTCAAGCGAGTGCTTTAAACCATCGGCATACATTTCGGCCCCAACTTTTTGAGTAATGATATTGGTAAAATAATCCTGGAAGTTTTTAGCTATACGCGATACCATGGCAACACCAATAGCAGCGCCAACCAGCATCAGCACCTGGTTAAGGTATGCGTCATATTGTAGCTTACTGCGTTGTTCAATAACACGGTCAACAATACGCCCGGTTATCCATGGGTCGAGCAGTGAGAAGCCTATATTCATCGCCGCCAGGAAAAGTGCAAATACAACTATCCAGCGGTGTTTTTTTAAGTAAGAGATTAATACATTCATGTTTCAACGGCAAACATAAAAAAAGGGAATGGCTAAGTAGCTCATTCCCTTTTAATTTGACACAAGGTTAATATTAAACCGTCATGATATCTTTTTCTTTAGCTTCTGAAAGCTGATCAACCTTAGCAATGTAAGCATCAGTAAGTTTTTGAACTTCAGCTTCGCCTGTCTTTATTTCGTCTTCAGAAACACCTTCTGATTTTAATTTTTTGATCTTTTCGTTAGCATCTTTACGGATGTTACGAACAGCCACCTTACCGGTCTCAGCCTCGGCTTTAGCTTTTTTAACCAAATCACGGCGGCGCTCTTCGGTAAGCGGCGGTACATTAATACGGATAATGATACCATCGTTTTGAGGGTTAACACCTAAATTAGCTTCTTTAATGGCTTTTTCAATAGGGTTTAACAATGATTTTTCCCATGGCTGAACAACAATGGTACGGGCATCAGGGGTGTTAACACTACCTATCTGGCTTAGTGGTGTCGGGGTACCATAGTAATCAACCCTGATATCGTCAAGTAATGAAGGGCTTGCCTTACCTGCACGTATTTTATTTAATTCGCTATCTGCATGGTCAATGGCCTTTTCCATCAAGGCCTTTGCATCGTTCACTTGTTTTTTAATGAGTTCGCTCATCGGTTTTTAAATTTTCCACAAAAGTAATAAAATCTATCAGTTTATGGCTTTGGTATTTGGGTTCAATTCAAAAGCATTTAACATTTAATAACATTATTTAACATTTAAATATTATTGAATGATAGGTATATTAGTATAAAATAGTATCACCATGAAAAAAACCTTAATCCTTGCTGCTATCATAGCCTGTGCAGGCCAGCTTAAAGCACAAAACCTCAATAAAGCACCTAAAAACAATAACGCTGTCGATAAACTTTTTAATTTGAAACCATTACAGGTTGACAGCAACTTGTCTAAACTAATGCCGGTACTTCCGAAAAACGGACTTTTAAATGATAACAGGACATTGCTAAATACCCGTGAACTTGTTGATAATGTGACCGTTTATAGCCGGATGCCTGTTGTTAAAAATTACCCGAACGACAATATGCCCGTTGTTAAAACTGACGAACCCGGGATAAAATACCATATGCTGATTAAAAAAACAGATATAGTGAATCCAGATTCTGTCGGGGTAAAAAAGGAAAAAGTTACTCCGTAGCCAACTCGTTTGGCAGTAGTTCGCCGTCTCCTTTATACTTCACCTTTTCGCGCTTTTCAACCACCTGTTCAAAGCTCTTTTCATAATCGGGATGATTAGTGCCCATTAACCTGTCCCATATGTTGAAATATAAACCATAATTGCATTTTACCAACCGGTGGTGCATATTATGATGCGTTGAAGTATTGTGCCATTTAAACAGTTTATGCGTAGTAAACCCCTGGGGAAAAAGCTCATAGCCCAGATGACCGGTTACGTTTAATAACAACGAATACAAAGAAAATATAGTGAGTGCTGTACCGTGATAGGGAATAGTAAAGGCGATGAGCGGTACAATCCCAATCTCCACTACGGCCTCCAGCGGATGGAAAGCATAAGCGGCAAATGGTGTCGGATTAGTTGACAAATGATGCGTTTTGTGCATCAGCTTAAACAAAGGCTTCCAGTGCATAGCCCTGTGTGTCCAGTAAAAGTAGGTGTCGTGCATCAGGATCATCAGCCCTATACTCAAAAAACAATAAGGATATCCTTTATCGCTGATATTTAAATAGATCCTTGTTAAATGCTGCTTACTGGCATAGATCACCAGCATGATCACCGCACCAAATATCAATATGGTGAAGAATGAATAAACAATTTCCCTAACGAGGTGCTTATTTGCGGGATAGCGTTGCTGAATTTTTGCATACCAGTAAGCTCTTTTTCGCCACACATAGAAAAACAGGTAAAACGCACCGGCAAATACAAGATAACGCACTGCTATAGAAAGCAGAACCCGGGAAATTTCTGTTATTTTATCGATGGAAGGTTGCAGTGGCATTGTTAATTTAACGCTGATTATTTTAATTCCTTAGAGGGCACAAGTACGCCCCGCTCCTGCAACCACACAATACTTGCGCCAGTTAAAGGGCCAGTTAAGGTATCCCGTCTTATTTTTATCCGATTAAAAATCCTTATTTAAAACTTTTTCCAGCTCTGCGAAGCTGATATTCATTTTAATTCGGCCTTGTTTGGCATACGAGATCTCACCAGTTTCCTCAGATACAATGATCGCCGTGGCCTCATTTGCCTCTGTAACACCAATACCTGCCCTGTGGCGCAGGCCGAACTGCGCCGGCAGATCCGTTTTTTCGGTAAGCGGCAAAATGCAACTTGCCGATTTAATCTTATTCTCCGAAATCACTACCGCACCATCATGCAGCGGACTGGTTTTCTGAAAAATACTTTCCAATAATCGCTTTGAAATTTTGGCCTCCACCACCTCGCAACTGTTTTGATAAAACTGTTCGTCATAATACTTGGCGAAAACAATGAGCGCCCCTGTGCGGGTTTGCTTCAGGCTTTTGCAGGCATCAATTATTGGCTTGATCCTGGCGTAATTATTTTTCTCAACTTCCGATTTTCCGAAGAAATATTGCCACCAGGCTTTATTGCGTTGGAGTGAAGCATTTTTACCTACCAATAATAAAAATCGCCTTACTTCCTGTTGAAAAACCACGATAATGGCAATGATCCCCACATCAACAAATTTGCCCAGAATAATGGTAAGCAGCTTCATATCCAAAGCCTTCACCACAAAATTGAGAGCAAAGATCACTGCAAACCCAATGAAGATATTAGCTGCAATGGTTCCCCTGATAAGGTTATACAACTGGTAAATGATCAAAGCCACCAGCAATACATCAAGAATAGAAAAGAAACCTATTTTAGGGAAGAGGGATTGTATGGACTGCATTTACGAAGTTAATAAGTTTACCCGAACATAAATGCCGGAAGAGCAGTAATTATGATGTGCGGCCCTTGAACACCGGTCATTTTACATACGCGTTTTATTATTTCAGGGAATGTATGGCCCTCCCGGCACGCCCCAGCCCCATTTCGGCATAGGCGTATCCGGGCTTTCTGTAGTATTATCTGTTTGGGAGTTCAGCACTGCTATCCGCGTACCCCACTCAAGGTAAGCCATAAAAGCTGAGCGAAATTCCGGATCATCGGGCAGTGACAGTTCATCGGCAGTTTGCAGCAACAGTTCAATCCAGCGCTTGCGGTGCGCTTCGGTTAAATGCTTTTGCAAATGTTTATTGATCATGGCATAATGGCTGCCTTCAGTTTCGCTATAGGTTTTTGGGCCGCCAAAAACTTCACTTACAAAATGCGCCACATGCATCCGGTGCTGGGGCGACATATGCTTAAATACGGGTTCCAACAGTTCATCGTCCAAAACCTTATCATAAAACTCGTTGAAAAGCAGTTCAAAAGCAGGTGTACCACCCGCCCATTCAAACAATGTAGGGATTGTAGCTGTCATAAGTGTAAAAATCGGTTGTGATAAAGATAGGGAGTGCATCTTTAAAAAACCAGTTATTTATACGCAGCAATAAAATTGATTATTTGTCCCGTTCGGTAGTAAACCTAACCAGTTGCTCCAGCCCGCGGTGCGAATCACTGTCGGGGAAAGTGTTCAGGATTTCGAAAGCCTCCTCCTGGTACTTTTTCATCTGGGTTTCGGCATATTGTAAGCCACCGGTATCTTTCACAAATTTAATGATCTCAGCAATCTTTTTAGGGTCTTCATTATGGTTCTTCACCAGGTTGATCACCCTCTTTTTTTCTGAACTGCTGCAATTGGCCAGGGCGTAAATAAGCGGCAGGGTAACTTTTTTTTCTTTGATATCGATACCTAACGGCTTACCCACATCATCAGTGCCGAAATCAAACATATCATCCTTGATCTGGAATGCGATGCCTATTTTCTCGCCAAAGAGGCGCATTTTTTCAATCACCTCGTCACTGGCCCCTGCCGACGCAGCACCGCAGGCGCAGCAGGATGCAATAAGCGAAGCGGTTTTCTGCCTGATCACTTCATAATAAACAGGTTCACCAATATCCATGCGGCGCACCTTTTCAATTTGCATCAGTTCACCCTCGCTCATTTGCTTAACAGCATCCGAAACAATACGCAATAATTGAAAATCGTTGTTATCGATAGAAAGCAGCAAGCCTTTTGAAAGCAGGTAATCACCCACCAAAACGGCTATCTTATTTTTCCATAAAGCATTGATAGAGAAAAAGCCGCGGCGCTGATAAGAGTTATCAACCACATCATCGTGCACAAGAGATGCTGTATGCAACAGCTCAACCAATGCCGCCCCGCGGTGCGTAGCCTCATTGATGCCCCCGCAAATACTGGCCGAAAAGAACACGAACATAGGACGGATCTGCTTGCCTTTACGCTTTACAATATAATGTGTGATGCGGTCAAGCAGCGGAACGGAACTTTTCATTGAGTTCCTGAATTTCTCCTCAAACGCATCAATATCAGCAGCAATAGGTTTTTTAATGTCGTTGATGCTCAGCATTGCAACCAAAACAGTATTTTATGTTAACCACCGCCATGTGACAGTGCGCAGCAAACATAAGCTAAAATTATGTATCTGCATTGCAGCCTAATATTTTTTACTCAAATTAAACCAAAGCACACCATAACTGTCATACTTGCGCGAGTGCAATTTTTATAAGTAGAGTTAAGTTTTTGTGAAAGCGAAGCTGGTTAAAACGGGCTTCGCTTTTTTATTGATAGTAAGGGCAAATATTTTACCTTTGCAATCCGCATTTACTCGGAGAGGTGTCTGAGTGGTCGAAAGAGCACGCCTGGAAAGTGTGTATACTCCAAAAGGGTATCGAGGGTTCGAATCCCTCCCTCTCCGCCAAGTATTATTCAATTAGGATCAAAAGCGCTGTAAATGAATTATTTACGGCGTTTTCTGTTTTAGCCCTCACCAAAATAAACCAATTAATCGCTTTGGTCTGGTGCCCCGTGGGTGACCTTTTTCAAGGTAATCCTATCGGTCACCGGAATGCTTATAAGTCACTGTTATTGAGCATGTTCACTATTTAAACATCTTGATTTCAGCAGGATATGTAACTAATTTTATTCACTTAAAAGCTTCAGCGATGAACAAATCTTTTAACCTGCTCTTTTACGTAAAGAGATCAAAAACAAATGTCGAAGGCCTCGCCCCTGTTTATCTCCGTATCACTGTTGATGGCGTACGCATCGAAGTCTCCTCTAAACGCTATGTCAATCCCGACAAATGGAACACAAACGGACAAAAGTTAACCGGCAACAGTGAAGAAGTAAAAAGTATCAACGCTTACCTGAAAACTTTAGAACACCAGGTTTATGATGTTCACCGGGACATGATAGAGCGAAAGTTGTTGATCACTGCGACTAATTTGAAAGACAAGCTTCTTGGCGGCAAGCCTTCACCCGGAAAAATGCTGGTACCGATATTTCAGGAACATAACAGACAGGTTGCCACGCTTATCGGAAAAGAATATGCAAAGGGTACGCTTGACCGCTATGAAACAAGCTTGAAACATACACAGGCTTTTTTGCTTTGGAAATACAATTCGACAGATATCGACATCCGGGCAATTGATCATGAGTTTATCATGGCTTACGACTTTTATCTGCGCTCAGAAAGAAGCTGCAATAATAATTCGACAGTGAAATACCTGAAAAACTTTAAGAAGATCATTCTGATCTGTATCGCAAATGGCTGGCTTGATAAAGATCCATTTGTTAAATACAAACCCAAAGTAAAGGAAGTTAAACGCGATTTTCTAAATGCCGAAGAGCTGGAAGTAATGGCAAACAAGAAACTGGTTAGCGACCGGGTATCTCAGGTAAGGGATATTTTCCTATTCAGCTGCTACACCGGCTTAGCTTATGCCGATGTCAAGAAATTAAAGCGGACTGAAATCGTAACGGGCATTGACGGACAAAAATGGGTTTATACAAGCCGTCAGAAAACGGATACTTCTTCCCGCATTCCTTTATTGCAAGAGGCGATGGAACTAATGGTTAAGTATGAGGAGCATCCTCAATGCGTTAATGATGGCTTATTACTTCCCGTATTGAGCAACCAGAAAATGAATAGCTATCTGAAAGAAATCGCGGACGCCTGCGGTATCAATAAAGAATTAACCTATCACATAGCCCGTCATACATTCGCGACTACGGTAACACTTGCAAACGGAGTTTCCATAGAAAGCGTATCAAAAATGCTTGGCCATACCAATATCAAAACAACACAGCATTATGCTAAAATCCTTGATATGAAAGTGGCGCAGGACATGTCTAAGCTAAGGAAGCTTTATTAGACCGCCTATTTACACAACCATAGAGCCACCGTTGGCTCTATGGTTAATTATTACATTTTAACTAACATAAATGAACAATACAACTTCAATATCAGACGATATCATCATGACCAAGATCTATTATATCAGGGATCAAAAAGTAATGCTTGACAGCGATCTGGCTGAACTTTATCGCGTAGAAACACGGCGCTTGAACGAACAGGTAATCAGGAATATGGATAGGTTCCCGAATGACTTTATGTTCAGGTTGAATGAATCGGAGTTTGAAAGTTTGATGTCGCAAATTGCGACATCAAAACGTGGGGGAAGGCGAAAACTCCCCTATGTGTTCACTGAGCATGGCGTTTTGATGCTTTCAAGCGTATTGAATAGCAAACAGGCCATCCAGGTTAATATTCAGGTTATGCGGATATTCAACCGGATACGAAACATGTACCTCGATAATACGGAACTCCGTTTGGAAATTGAACAAATCAAAAGCAAACTGATCAGGCACGATAAAAGCCTTGAATTGGTATTCGGTTATCTCGATGAACTGATCGAAAAGAAGTCGCAGCCAATAGATAGAAAAAGAATCGGTTATATGCCGGATACTGATAGCTTTTAAAAAACAGACATCCCATTTATCTTTACGTATTTCAGGAACACTGGCGTGTAAATCCAAGCCAGTGCATCCTTTATTTTATCTATGCATCAATTGTTATCGTCTTGCCGCGATCAAAATTGCTGTGCTCATCCTGGTGTACATATCCGAGTTGATTAGTCAACAGCTTCGTTTTCCCGATATTAAATTCAGGAGTAGCATCGTGATGATGGCCGTATATCCAGTAATCAGCACCGCTATCCTCAATAAAAGCATCTAAGTCGGTTGCAAAAGCTTCATTTAAAACGCTGTCTGAATATTGTTTGGGATAATGCTGAAATGTCGGCACGTGATGTGACACAATGAGGTTCCTGACATCCTCATTCCGGTTATACCCTCCAGCATCTAATGCAGTTTCGAGAAATGCCAGGGCGTCAGAATGCATCTGGTTATATTTTTCCACCGAAAACCACTTCCCTTCATTTTTGATTACCCTGAAATCCATCATCCCGCGTTCAATATAACGGGCTTTGTATTTCGATATTTCCGACCATAGTGTCGTAAAATGGATTCTGATCTGGTTCAGTTCAATAGTAGAATTATTAAGCAGCGTCACATTGGATAGTATCGATTCCCGAAAAGCGCCCGACCTGTTGGCTATATCGAAATGGTAAAATTCGTGATTGCCGGGAACCCAATAAGTATGCTCATAATTGTCTGCAAGCAAACTGAAAAAATCACGATGACGGTCAATATCCTTCAATGGTATTATATCCCCCGCCAATAAAAGAATATCTCCATTTACCTGCAAAGGGTTTAAAGCCATATAATTCCGGTTTTCCGGGAACTCCAGGTGAAGATCAGAAGATATCTGTAATTGCATAATTGTAAATGTATTTAGTATTTATTTAACGATGGCACACTAAGGCTTCAGAAATCATTCTTATCCGCATCTTTCAACCATTCAACATTGCCCGGATCATAAGGGTCGAGCCAGTCTGCTTTGGCATTCGCCCAGGCAAACCATTGTGGATCAGGATCCGGCACGGCCAATGTATAATCACGGATCAGCTGAGCTCGCTTCCAGCGTTCGGCATCCCTCATCAATTCTTTATAAGCATCTTTTTCCTGCTTTTTTCGGGCATCAAATTCGGCTTTTTCTTTTCTTTCACGTTCCCAGTTTGCCTGCCAGATCCGGGCTTTTTCCAGGTAAAGTTCTTCCTCTTTTGCGGCCAGTTCCAGCTTTGCCAGTATTTTGGGTAATTGCTCTTCCAGCAAAACCGCTTTGAGGTCCTGCCATTCTGATTTCAGCCTCGTATCAATTCTGAAAACAACTTTCCCGTTCGGGTGCCATTCGTAATTACGATATGATTTATCATATACCCTCACCTTGGTTGTCCGCTCCCGAAATGTCACTTTTAACCTGATATCGCGTATCACCACATAATTGCCATCGTCATGCAGTTCAAAGAAATAGCCGCGTACCCTCACGCATTTGATCAGCGTAGGCAGCACCGATAGAGGTTTATGTGGTTAACCTTCAGGAAAGTAAATACAGGTTCTAGCTGTTCTTTTTCGTAAGCCAGATGGAGTGTTGCCTTGATTCTCCTGGATTCTGTTCTTTTTGCGGCCTGCAAATCCTCGTCGATCATTCTTGAAGCGAGATTAAAAAGTATGTTCAGCTCCAGCCGCCTGTTCATGTAATATGTTGCGGAAGAGCCGAAAAGCTGTAAGAACTCTTCAATCCTGGAGGTGGTCAATTCGCGGATCGACATTTCACGTCCTTCCGCTCCAAGGAACTCCATAAAATGGTTAAAGGCGTAGCGTAAATTGTTTTTGTGGTGCTTGCTTAGATCGGATTCGAGCTTTTTATTCAAGGCCTGCTGGAATAGCAGGCCGATTGAATCATTTTTTATTTTTTCGTTCATTAAGCAAAGTCTTTCACCGACAGTCAAAACGGCAATCACAGTGAAGCTGGCAGGGCCGTGGAATAATCCACGACCTCTGCCCGAACTTCAACTGCGGTCCTAGCGGTGCCCTGTGCGGTTTCTTTTTTTCCATCGGCAAAAAGATTAAGAGACTTAATCTATGTACACAACCGATCTGACTCATGGTCAATTTTTATTATCTCACAAACTTTTTTGTTTTGCTGGTCTGTAGGTATATGTAGTCTTTTTATAAGCCAGAAGCATTTATCGCTACATCATTTGCCGACATCACAAATGTCACTTAATTATATGTTCTAAACTATCCTTAAACTTTCTAAACTATTCTTCATCAAGAGCCAAACGACAGGTTTACTTTCTGACAAGAAACGCTCATCTTATATATAAGCATACAGTTACATCTTCTGCAAAAAATCAAGGGGAGCTTCAAAACCGTATCTTAGAGCCTTTTCTCTTCTATCAATGCTGTGTAAAGATCCCTAATAGCGGCATTTTCCCTTTTTAGTATATCAATTTCATGATAAGCTGCGTACAGCTCTTGGTGCGCCATAAGAGCCTCAGTACCCATTGCTACCATTTTAACGTGTATTTCTGGTGGCAAAAGCCCTGCACCCGGAAGTTTTAATAAAGATAAAACATCTTTCATCGTGTTAGCTCGCAGCGTAGCCTCTCTTTTTATGCTTATATAAAAATAATCCGCGGGAATATCAAAAATTTTGCGAAGCTTAAGACAGGTATCTTTATCCAGCAGTCTAACTTCGTTTTCAAGTTCTACATAGGTTTTTTCGTCCATGTTTAATAAACGTGCAAGCTGCCCTTCGGTAAATCCCTTAGCTTTACGTGCTATATATAATACGCTATTCATGATTTTTTTTACAAATAAGACGAGTTTGGAGCGTATATAAAATTCAGGAATGTGCTAAAATATTCTCTCACTATGTAATGAATTGGAGGTACCTCTCATTACCGCCCTATCTTTTAGGAGCTATCGTATACTATAATGTAGTATGAAAAGAATAATGTATTACTTTGTAAATCAGATGAATACCTAAACGGAAGGGCGATATATTTATCAAAAATTAGAGAATCTACATCTTAAGAGATCACGTAGTCATCAACTATTATTTCTATATTAGATAAACCAAGGCTTTTTATTATTTTGTTTACATTTGACCGAACTCTACATACTAACTTATGAACACGACTGATACACCAAAAAATATTCATCAAGGCCGTAACGTTAAACGCTTCCGTGAAATGCTGGGGCTTAAACAAGAGGCTTTGGCTATCGCACTTGGTGAAGAGTGGAGTCAAAAAAGGGTTTCATTACTGGAAAGTAAAGAAACTTTGGAGCCTGAAATTTTATCTCAAGTGGCGAAAGCATTAAAAGTGCCGGAAGATTCTATCAAAAACTTTGACGAGGAGGCCGCAATTAATATTATCAATTCAACATTTACCAGTAATGACAACTCAACGTCGATTGCATACCAGCCCAATTTCACATTCAATCCTATCGACAAACTTGTAGAATTGTTTGAGGAAAACAAAAAGCTCTATGAACGACTGCTACAAAGCGAACAAGAAAAAGTAGAGATTCTCAAAGACAAGCATAATAAATAAAGCTCTTTTGCAGCTTATAATTTTATTTTCTTAAGCCTTTAGCTAAGCAAACTGTTACTTCATCCGTAACCAACCTCGAACGCCATGAAAGTAACTATCAGAACGCATGCCCTTACCAAGGGTCGTCATCGCCTGTTCTTGGACTACTACCCCTAATCGTAAATCCAAAAACTAAAAAGCAAACCGGATATGAAAATCTCAAACTCTTCGTATATGACCATCCGACAACACCTGCGGAAAAGAATCATAACCGGACAACTATGGAGCTGGCAAATACCATCTGCGCCAAACGTCAACTCGAATTACAGGCTCAAAAGCATGGAATGTCGCCCAGTTTCAGGAAACATGAAAGTTTTATTACCTACTTCAGGAAACTTGCAGATCAGCAACGTGGCTTAAATTGGCACAATTGGGATAGCAGTGTCAGGTATTTCCAGCTTTTTGCAGATGGCGCAGATATTAGTTTTGCTGAACTTGACCTGTCTCTTTGCGAACAATTCAAGCGGTTTTTATTGGACGAACCTAAACTGCGCGAATCAAGGCGTGGTATCGGGCATAATAGTGCGCTAAGTTATTTCAATAAATTCCTACAGCACTAAAACAAGCCTGGCGCGAGAAACTGATCAGCGATGACTTACACGCGTTGTCGCCGGGACTTAAGGAAATCGAGGCGGAAGTTGAGTTTCTAACGATGAGTGAAGTGCGCCAGATGCTTACCATTCCAATAAAAGATGATTTATATAAAAGGGTCGTGTTGTTAGAATTTTAACGGGACTTAGGTTTTGTGATATAAAATACCTCAACTGGAAACAGGTTCGCGGCGAGCAGGATAATTACTATCTCCAATTTCGACAGCGTAAAACCTATAAGCCCCAACTCGTTTATATCTCTAACCAAGCCTTTTATCTCTTAGGCGAACGCGGAGATCCTGCCACACTTGTCTTTCCAAAAGTTAATTACAATTACACCAGGGATCTTTTAAAAGTCTGGCCTAAACAAGCCGGTATAAATAAGCATCTGACTTTTAGTTGTCTCCGGCATACCTATGCTACTTTACAACTTGACCACGGAACAGATATTTATACTATATCAAAGCTATTAGGACACCTGCACCTTAAAACCACGCAGCGGTATACCCGTGTTATGGATAAAGCAAAAAAAGAAGCCGCAAACCGGATCGTTCTCGACTTGTGATCTGCGCACACTATGTCATATCACCGCTTAAACAACCGGAGCTGTTAGCAGGAAAACTGTCAAGGGTCCCGATCCTATCATCGGGATTTATATACCCTTGACGGTTTTCCTGCGTAGCTAATTTTGTGACTGCGGTGCTGTGACTAACCTACAACTCGTGCGCGGGAAAGGCAGGTGTGAGGAACGAATACACTCCTGCGCGCGCGGCGGCAAGGCTGGCGGCGTAATGGAACCCGGAGCGAAGCGCGTGGTGCAATGGAGCCGCCTGCCGCAGCCGTATGGCCGTCCCCTCTCTCATCTCATAAGAAAACGAATAAAGGTTAATCACTTAGCCAATATTACTTTGTTCAGGTTATCCCTAGCCGTTTTCTGATCCTTGCCTGCCTGATCGGCAACAGCTTTAGCGGCAGCAATTGCCTGTTGTTTCTCCATTGCCTTTTCAGCCTCGGCAATAAAAGCACTCGGATCATCCTTTAATCTTGTAATAATTGTTTCGGCCGTTTTAGGATACATCGCCCTGACCATCCAGAAATTATAAGCTTCCTCGTTTAGCCGGTCATTTTGATAATTTAAATATAAAGCAGTACCAAATGATCCGGTCGTAATAACAAATACAATGACTATCCCAATGATCAGGCTTTTTGACCATGCCCCGAAATGATGGCTATTTTTGACACCGATCACTTTTGGTACTCTCGAAACTACATCTTTCAATTCATTGATCTGTTGCTGGATTTTTTTATCATCATAACGTTTATTGGCCGTTTCAATCCTGCCAGCTATCCGGTCTAATGTTTCCTTGTATTGATTTAAGAAACCCTTTAAAAATTCCGGCATTTCGGTTTCTACTATGGTTATCCTTTGAATGAACCCCTGTATCAGTTCCTCCAGCATCGTTACTTTTTCTTCCAGTTCTTCCTGTTGCATAAAACTTAGTTTTAATAATTAAATAATCTTTTTTGTTATCACGCTACCTGCTTATTCCTTGTGACTGGGAATGCTCCGCTTCATGTTTCTTTCGTTTCCTTCGAGCAATATCCGCATCTCCCATCGGGTCAGGCGGCCCGGCAACAAACTGCGGTTCCAACAAGGTTTCCAAGATGCCTTTACCGTGAATTGGAAGGTGTAACTGATCAGCCTGGCCATGCTGTCGGATCGCCTCCCTTAACTGATCGGCTAATGAAAGCGAATCTGGTCTTCTTATTTCTTGCTGCTGTTGCATCTGGCGATTGACTTTAAGCTGCGTATTAATCCCGGCATAACTCAAACTTCTGTCAATGGCAGAGCCTTTCATTTTGATGTCTCCTTTTTCAAAAGAGATACCCTGTACCTCGGTGCTTCCACTACGAAATTTATAAGCAATGCCTATACCCTGTTTATGCAGCTTCGCTTCCAGTTGTTTCCAGTCTGTTGATTGTTTAAGCGTTGCTTTAATAATATCAAAAAGTTCATACCGGGTTTTCTCCGTACCTTTTAATGCCTGCCTATTTACCTGATCCTTACCTTCACCTAAATGATAGCCATACTTCAGTGTGATCTCCTTACAGGCTTTCACATTCTTCGCAAAATTATTTTTATCGGTAATGGTGTTGCCATTATTGTCTACCCTGTTATAGATCACATGTAAATGCGGGTGTTCCCTGTCATGGTGACGGACGATAACAAATTGCGTATTACGGATACCTATTTTATCCATGTATTCTTTTGCTCGTTCCACCATAATTTGATTAGTCAGTTTGGTCAAGTCTTCTTTACTCCAGCTTAAAACCAAGTGACCCACCGCTTTACCTAACTCCGGCCGCATCTTTCGTTGCAGGTTGAAATCATGGGTGATCGTACTTGCATTCTGCATCCTCACTCCTTCACTAGCAATGATCTTTGCATCCTGCTTGTTTACCACATAGTGGACACAGCCGCCAAAGCTCCTCCCGGTAATTGGTTTACCTATCATTCCTGCTGATCTTAGTTAATAAATCTTCGATCTGTTTAAGCATAGCCTGACACTTAAAAGCCAAAGAAAAAAGCCCGGCCACATGGGCAAGGTGTGTTAGTTGGTTAAGGTTATTGGCAAGCCCCGCCAGCATCCTGAACCAACCGGTTTCCTCTACAGTAAATCGCGGGAAGACTTTGGCGCTTTTGGCAGCCCTGCGGAACCATTCGCTCGATCTCAAGCCAGCTTTCTTTGCCCGCCCATCGATCAAAACACGTTCCGTCGGCGTAAGCCGCACCATCAGGAAATCGCTTCGGCTTACTGTTTTCTTCGGTCTGCCGGGCACCCTTTTTTTCTGTATTTTCTCTTTTTCAGTGGATCCTGTCATAAAATTTATTTCCTAAACCTTCTTCCCAAACCGACCAACGGGAGCGAGTTACCGGATCACCCCGGCGGGGTGTATCCGGGTTTTTGTGAAGCAAAAACATTAACTCGCTCCCTTCGGGTCTTAGATTTCTCCACCATATATCCCGTTATCTGTATTTGTTCTGCATATCGCTCGCGGTATGCGATTCCAATGTTGCAAAACGAAGTTTTGCGGTAATTTAGACGGTATCCTTTTTCGCCGCTGCTTCTGCTTCCGGCCAGTAAACCAGGCTGCCCTGCTTGGGATGATAAGATGGCTGATACCGGATATAACTATACTCATCCAATTCCCTAATGCATTTGTGGTAAGTGGCAATTGATGCGACCTTGGAAAACGCCATAAGCGTTTTCCGGGTCACGCTAAATGGGCTGACAAAGCCGTTCCGCTGCCAGCAGACAAACAGACCGGTAAACAGGCTAAGATGGGTTGGCAACAATCGATTATCCTTTTCCATGCATTTAAGCAAGCTGGCATAGGCTGCCAGCTCCTTAACATATCCCTAACAGCTTTCTTACCTCCACACTCTTCAGCCATTGCTTGGTACTGGCCTGCCCCGGCTGTATGATCCTTTTGATCTCGTTTAGCAGATCGCTTTTAAATTCCAGTAAATCCCCTTTGGTGATCAGTTCCACATTCATCTGTTCCTCCTTGTTTAATCGTTAATGGTTTGTTGCATGGCATTCAGTACTTCACTTTTTCTGAAGTATACCCTTTTATGCAGGCGCAAATAGGGTAAGCCTTTTTTCATCCAGTCAGTAAGCGTAACGAGAGACACGCCGAGTTCAGCGGCAAGTTCCTGCTTGGATAAAAGTTTTTCTGAGTTGGGCAGCTGGCTGCCGCCCGTTTTAAAATGTTCCTGTAGTTTCGCCCTCACTGCCTGGCGTATGCACAGGCTAAGAGCTCCTTGATCAATTATCTGCATAGCATTTGTTTTGGAACAAATGTGTGCTGTAGAGAACAGATTTCTTGACCATGCAGAACTGCATGGTTAAAATGGAGTGGTTTGGAGAAAGCTAAAAGTCAGGGCTGACGTATAAGAAACGGTAAAAATGAGTAGTGTTACCAACAAAATGTTGGTAAAGAGGGAATAAGTTTAAATGTTCGGGGGGGGTAAAGCCTTTTAGAATTTTTTCTTTGTAAAAAGAATATGACAACCTCACTCCATGATCATTTCAGATGGATACCCGATCCGCGCACGGGCAATAATAAGAAACATAATTTACTGGAAGTAATAATCCTATCGGTATTGGCGGTTGTATGCGGTGCGGAAAGCTGGTACGAAATGGAGGAGTTGGCAAGGAAAAAGAAGACTTTTTAAAACAGTTGCTACCGCTTGAAAATGGCATACCGAGTCATGATACGATCAATCGCGTTTTTATGATGATCGATGCGGATGTTTTTGAACGCTGTTTCCGCGCCTGGACGGCGGAACTTGGCCAGAGCCTTCAAACAACAGGTACATCTGGCGAAAGGGAACTGATTGCTATCGATGGAAAGAGCGTCTGCAATAGCGCCTGCAAGCATCAGGGATTGGGGCATTACATTTGGTAAGTGCCTGGTCAGGCCGTAATCAGTTGGTACTTGGGCAACAAAAGGTGGCTGACAAGAGTAATGAGATTAGTGCCATCCCGGCATTACTATCTTTATTGAACATTAAGGGGCAGTAGTCAGCATCGATGCGATGGGCACCCAAAAAGCAATCGCTGAAAAGATTATCGAAAGTCAGGGGATTATATCCTGGCCCTGAAGCAAAACCATGAAACGTTTTATGACCAGGTAACCAACCAGTTTAACTTCAGGGAAGACAGTTACAGCCAGCATCTGGATAAAGGGCACGGGCGGTCTGAGATCAGAACCTGCAAGGTCATACATGAACTTAACTGGATTGACGAAAAGGAAAATTGGAAAGGAATAAAGAGTATTATCAAAATCACCTCCGAACGCATAATTGGGACAGCCGTACTACACAAAATCGTTATTACATCTCCAGCCTTCAGGCAGACGCTGCCTATTTTAACAAAGCCATCCGCACACATTGGGGAATAGAAAACCAGTTGCACTGGCAACTGGATGTTGGTTTTGCCGAAGATTATAATACCACACGAAATAGTCAGGCTGCACAAAATCTTGCTGTAGTCAGAAAGATAGCTTTAAATATTCTAAAGGCTGACAAAACCAGCAAAGCCAGCATAAAGGCCAAAAGAAAAATGGCTGGGTGGAATCACAAGTTTCTCCTAACACTTATAGCTAACAAAAATTTCTAA
Protein sequences of DBSCAN-SWA_2 >NZ_CP043449|5114225:5159919|5130301_5130742_+|WP_112658349.1|DBSCAN-SWA MEEKYQNKYRTSSPRLAGWDYGAHVLYFVTICTLNRVPYFGDILPNENNEALFLQSSIMGEFAHINWLKIPEINPFVELDEFVVMPDHLHGILFLTSLIKSIGKQINSVFKKIAWLLYCVAINHPLRNMRMTTASILLGSLVIMIG >NZ_CP043449|5114225:5159919|5134507_5135563_+|WP_112658339.1|DBSCAN-SWA MEASLPLPKSRTGIKTAWIIFWHLFFWAAIISFFVFLARLNDAMLSTKELLVIFLLYPTINISLFYLNYLVFIPKFLNKKRYWQYATFVLTSIIVYGVAKYGVGLLFKDIVLVHQKGHVTSFATYFYSSIFTSLIFVFLSTALKFSTDWFLNERIQRDLENQRLIAELAFLKSQINPHFLFNSLNSIYSLAYQRSETTPEAILKLSEIMRYMLYECNENKVDLSKELQYLQNYIDLQKIRFGSKAFIDFKVDGNITDQKIEPLLLIAFIENAFKHGVANDINSPIKLLINVADGKLHFYIQNKKHNNNRDAVGGIGLINVQRRLNLLYPNRYSLKIRDEEYTYTCELSIDL >NZ_CP043449|5114225:5159919|5141850_5143611_-|WP_112658329.1|DBSCAN-SWA MNVLISYLKKHRWIVVFALFLAAMNIGFSLLDPWITGRIVDRVIEQRSKLQYDAYLNQVLMLVGAAIGVAMVSRIAKNFQDYFTNIITQKVGAEMYADGLKHSLELPYQVFEDQRSGETLGILQKVRLDCEKFITSFIGILFVSLIGMVFVIVYSVSVSYKVTLVYFAAIPVITFVSMAMSRRIKKIQKTIVAETTALAGSTTESLRNIELVKSLGLAKQEIARLNNTTYKILDLELKKVKYVRSMSFVQGTTVNFVRSVMVVVLLMLIFKSTISPGQYFSFLFYSFFLFNPLQELGNVILSWREAEVSLSNFNRILSIPIDKKPEKPVLLEKVETLTFNDVTFKHLTANRNALNHISFETNSGETIAFVGPSGSGKTTLVKLLVGLYQPLQGDILYNGTLSKEIDLDQLREKIGFVTQDTQLFSGTIRENLQFVRPGATDEECMDVLQRAACQTLLARADKGLSTVIGEGGVKVSGGEKQRLSIARALLRRPDILVFDEATSSLDSITEEEITETIRNVSEKENHITILIAHRLSTIMHADSIYVLEKGRIIEGGKHADLIAQKGLYYAMWRQQIGEKDTADALD >NZ_CP043449|5114225:5159919|5138105_5140328_+|WP_112658335.1|DBSCAN-SWA MSYQVDLTNCDREPIHIPGKIQSHGFLIAVNSKSYNISYISQNVESYTGINAIGLLGKNISELEEQLAIANDTASLQQLLKLAKGAKSSDTINPLAIDVKGTNYNLIISHSVNDLVLEFEPTESDLGGDIQKTIGRSVAEILSGKSLSLLLQKAASEIKKIINYDRVMIYKFNEDGHGEVTAEVKNDDLEPFLGLHYPASDIPKQARELYKINLTRIIADVNSESSAIITYEEGAAPLDLTHSVLRAVSPIHIQYLKNMGVDSSFSISLIAHGELWGLIACHNYSPRFIDYKSRDASKLIGQILSSALEYRQDEEELAKLHELGEAANTIADYIKKDTDFTFALTRNKVTIKDITTATGVALVYDEEITTIGQTPTEEQITEIVEWLKVNMVDTIYSTYRFPEIYPAAKNYSDVASGILSCSLSRELGEMIIWFKPEQVKAINWAGNPEKPVEETADGLLQLSPRKSFDSWTQIVKNTSEKWKHDEIAAVLKVREDVIFAINRKANEIRILNERLQLAYDELDTFSYTISHDLRTPLSSIKNYSELLLASNKSLDDSAKKMLERIIKGTDKMHMLIKEILHYSRVGRTNIEAAPIDMGLLLNEIKREVLSALKPESIEFTIGDTPGIDGDSVMITQVFTNLINNAVKYSSKSTPSKVKVEGIVRNNEIVYSVSDNGVGIDVNYYNRVFELFKRMDNALEFEGTGVGLAIVKRIVEKHKARIWFESKLGFGTVFYISFKIS >NZ_CP043449|5114225:5159919|5158307_5158577_-|WP_112658299.1|DBSCAN-SWA MQIIDQGALSLCIRQAVRAKLQEHFKTGGSQLPNSEKLLSKQELAAELGVSLVTLTDWMKKGLPYLRLHKRVYFRKSEVLNAMQQTIND >NZ_CP043449|5114225:5159919|5140330_5140750_+|WP_112658333.1|DBSCAN-SWA MSSPDILYVEDDEDFAFIMQHAVREVKDGLTVKIIDNGKDALEQLKQLTEARVKPKLILLDLNLPGLSGLDLVKRIREISFLKYVPVIFFSTSDNPKDVKASLEFGANAYLTKPAGYLNLVNCVESLYNFWFTKNLNVN >NZ_CP043449|5114225:5159919|5154364_5154703_+|WP_167516243.1|DBSCAN-SWA MELANTICAKRQLELQAQKHGMSPSFRKHESFITYFRKLADQQRGLNWHNWDSSVRYFQLFADGADISFAELDLSLCEQFKRFLLDEPKLRESRRGIGHNSALSYFNKFLQH >NZ_CP043449|5114225:5159919|5122112_5123498_+|WP_112658364.1|DBSCAN-SWA MNEHNFPPHFIASLSAENGFDEENFVKAHRFEEAPTSIRVNPFKPSALKSTQNVPWCANGYYLESRPSFTFDPLFHAGCYYVQEASSMFIDHVFKTINQNNDDAVKVLDLCAAPGGKSTLLNSALQPADLLVANEIIKTRVPILTDNLNRWGTANTIVTNNDPKDFGRLKSFFDIILADAPCSGSGMFRKDPDAMSEWSEGNVNLCHQRQERILADIYPALKEDGYLIYSTCSYSHQENEDILDWLCTEFELESIRIPINNEWGIVETQSPEKKAWGYRFYPGKIKGEGLFAACLQKKENSGDLASFKNNTQQKLPGKELDQVRSYINNPDDLYYFKVNDDWMAINRAHKESLNILQRQLYIKKSGVIIGKLAGKDLIPDHELALSLIINKDAVLETPLNKDQAIQYLRRDNIELSTTDKGWTLMTYEGHPLGWAKLLPNRINNYYPKEIRILSTHPPREW >NZ_CP043449|5114225:5159919|5141123_5141729_+|WP_112658331.1|DBSCAN-SWA MLHEKIKQATAHLHDQLEQKMFTGQIMDGSFTFKQYQTILTVNYATHLAVEDFLFNNLSDELCQKLNIEARIKLPALLRDLEEINSSVSSSLPEAPEYIDLNSDASILGAMYVLEGATLGGNVIAKRLKTNGQLLSYNLSYHYYQVYGDQLGLKWKQFLEVLNAIPEAEHEAAIKNAVCLFEHMANTEVASTTVILVPPEV >NZ_CP043449|5114225:5159919|5140739_5141111_+|WP_091175516.1|DBSCAN-SWA MSIKTLKKILVIEDDGDIADIMGIALCDKYEVEVKRDGYEVVAQIERFSPDLIILDNYIGQRQAREIIKEIHEVDDHRTIPFVLFSGHDNIMQLAENLNATSYLAKPFALDELYKCIDGILAA >NZ_CP043449|5114225:5159919|5145732_5146524_-|WP_090530839.1|DBSCAN-SWA MQSIQSLFPKIGFFSILDVLLVALIIYQLYNLIRGTIAANIFIGFAVIFALNFVVKALDMKLLTIILGKFVDVGIIAIIVVFQQEVRRFLLLVGKNASLQRNKAWWQYFFGKSEVEKNNYARIKPIIDACKSLKQTRTGALIVFAKYYDEQFYQNSCEVVEAKISKRLLESIFQKTSPLHDGAVVISENKIKSASCILPLTEKTDLPAQFGLRHRAGIGVTEANEATAIIVSEETGEISYAKQGRIKMNISFAELEKVLNKDF >NZ_CP043449|5114225:5159919|5133163_5134063_-|WP_112658343.1|DBSCAN-SWA MVIKTEGLSFTFGKQQVVKSLALQVPEGSIYGFLGPNGAGKTTTIKLLLNLLKPDAGNIHIFEQDIKTSRISILSQIGSLIEQPALYQHLTGRENLLNRALLLQVPAQRVEDMLDLVQLTAAANKKAGQYSLGMKQRLGIALALLANPKLLILDEPTNGLDPNGIIEIRELLKKLVAQHGKTVFISSHLLGEIERMATHVGIINNGAMLFQGSVADLQAISKPLVRVDAENTVDAANLLTRHHINVTEVTDEHLLVPYISKQQMGEINALLNKNGQMVYSISKQQKDLEKLFLDITQRA >NZ_CP043449|5114225:5159919|5134056_5134266_-|WP_112658341.1|DBSCAN-SWA MKTLIKNSNPFIMLLIPVMFALILGVSYQFEQRKETAIIGRTAVHATSLFHKGFVMVKTVCSVAKNNIW >NZ_CP043449|5114225:5159919|5143684_5144248_-|WP_090530830.1|DBSCAN-SWA MSELIKKQVNDAKALMEKAIDHADSELNKIRAGKASPSLLDDIRVDYYGTPTPLSQIGSVNTPDARTIVVQPWEKSLLNPIEKAIKEANLGVNPQNDGIIIRINVPPLTEERRRDLVKKAKAEAETGKVAVRNIRKDANEKIKKLKSEGVSEDEIKTGEAEVQKLTDAYIAKVDQLSEAKEKDIMTV >NZ_CP043449|5114225:5159919|5126405_5126972_-|WP_112658351.1|DBSCAN-SWA MEKPYQGRAIVEDNFEYLKITIPVKRNYISIIWICCWVTFWGYFGASAFGELFNSPNEFGRVFMLLWISGWALGGVMAVKTIIWLFWGKEIIEIGMGTFSINRKNDLFTREKIYDLNECREFGIREIESESDRQFWFVFYYRNKEKGTIAFDYGMRNFCFGEDLYEPEAQYLLNVMKSKKVLTESQFK >NZ_CP043449|5114225:5159919|5144399_5144789_+|WP_112658327.1|DBSCAN-SWA MKKTLILAAIIACAGQLKAQNLNKAPKNNNAVDKLFNLKPLQVDSNLSKLMPVLPKNGLLNDNRTLLNTRELVDNVTVYSRMPVVKNYPNDNMPVVKTDEPGIKYHMLIKKTDIVNPDSVGVKKEKVTP >NZ_CP043449|5114225:5159919|5148743_5149955_+|WP_112658321.1|integrase|DBSCAN-SWA MNKSFNLLFYVKRSKTNVEGLAPVYLRITVDGVRIEVSSKRYVNPDKWNTNGQKLTGNSEEVKSINAYLKTLEHQVYDVHRDMIERKLLITATNLKDKLLGGKPSPGKMLVPIFQEHNRQVATLIGKEYAKGTLDRYETSLKHTQAFLLWKYNSTDIDIRAIDHEFIMAYDFYLRSERSCNNNSTVKYLKNFKKIILICIANGWLDKDPFVKYKPKVKEVKRDFLNAEELEVMANKKLVSDRVSQVRDIFLFSCYTGLAYADVKKLKRTEIVTGIDGQKWVYTSRQKTDTSSRIPLLQEAMELMVKYEEHPQCVNDGLLLPVLSNQKMNSYLKEIADACGINKELTYHIARHTFATTVTLANGVSIESVSKMLGHTNIKTTQHYAKILDMKVAQDMSKLRKLY >NZ_CP043449|5114225:5159919|5123741_5124263_-|WP_149354143.1|DBSCAN-SWA MAVGLNVSNTVDAYNSNYSSSSLAGWHAGLTFDVPIIYPLSFAPEVLFSQKGYEVNDRDGKFTQRTNYIDVPLLAKFRVVRGFNLLVGPQLTFLTSTKNTYESGLGTSVDHIDNDASHSYVAGVVGVSFDINRNVEIRGRYNIDLGENRPYADQNLPDYRNQVWQIGLGFKFQ >NZ_CP043449|5114225:5159919|5146626_5147097_-|WP_112658323.1|DBSCAN-SWA MTATIPTLFEWAGGTPAFELLFNEFYDKVLDDELLEPVFKHMSPQHRMHVAHFVSEVFGGPKTYSETEGSHYAMINKHLQKHLTEAHRKRWIELLLQTADELSLPDDPEFRSAFMAYLEWGTRIAVLNSQTDNTTESPDTPMPKWGWGVPGGPYIP >NZ_CP043449|5114225:5159919|5158978_5159206_+|WP_167516245.1|transposase|DBSCAN-SWA MLPLENGIPSHDTINRVFMMIDADVFERCFRAWTAELGQSLQTTGTSGERELIAIDGKSVCNSACKHQGLGHYIW >NZ_CP043449|5114225:5159919|5159377_5159527_-|WP_167516246.1|DBSCAN-SWA MYDLAGSDLRPPVPFIQMLAVTVFPEVKLVGYLVIKRFMVLLQGQDIIP >NZ_CP043449|5114225:5159919|5157238_5157616_-|WP_112658303.1|DBSCAN-SWA MTGSTEKEKIQKKRVPGRPKKTVSRSDFLMVRLTPTERVLIDGRAKKAGLRSSEWFRRAAKSAKVFPRFTVEETGWFRMLAGLANNLNQLTHLAHVAGLFSLAFKCQAMLKQIEDLLTKISRNDR >NZ_CP043449|5114225:5159919|5155622_5156225_-|WP_112658307.1|DBSCAN-SWA MQQEELEEKVTMLEELIQGFIQRITIVETEMPEFLKGFLNQYKETLDRIAGRIETANKRYDDKKIQQQINELKDVVSRVPKVIGVKNSHHFGAWSKSLIIGIVIVFVITTGSFGTALYLNYQNDRLNEEAYNFWMVRAMYPKTAETIITRLKDDPSAFIAEAEKAMEKQQAIAAAKAVADQAGKDQKTARDNLNKVILAK >NZ_CP043449|5114225:5159919|5130918_5132265_+|WP_112658347.1|DBSCAN-SWA MKHIFLFLLLILTLPATAQTIIKQDAVIKQMVDEVSAKNIEATIRKLVSFKSRHTLSDTTSKTMGSGAARNWIKAEMEKYATESNGRMTVQFDTFTQPKGTRIDKPIKLKNVLATLKGTDPNDTRVYLVSGHYDSRINDVMDANGVEPGANDDASGTALSMELARVMAKRSFPATIIFMTVVGEEQGLYGSANVAKRAKAENWNVDAMLNNDIVGNTYGMETDLKDNRSVRVFSDGVPTAATDKQVAALKSLGGENDSPSRQLARYTKEIGERYVDQLDVKLIYRRDRYLRGGDHLPFLEQGFTAVRFTEMNENFNRQHQNIRTENGVEYGDLPDFVDFNYVQKVARMNLAVLANLASAPAGPQNVSVITSDLTNKTKLKWEAPATGKKPAGYYVLMRETISPYWEKKFYVTDTIATLNYSKDNYFFAVQSVDAEGHESLPVFPKPVR >NZ_CP043449|5114225:5159919|5147176_5148148_-|WP_090530846.1|DBSCAN-SWA MLSINDIKKPIAADIDAFEEKFRNSMKSSVPLLDRITHYIVKRKGKQIRPMFVFFSASICGGINEATHRGAALVELLHTASLVHDDVVDNSYQRRGFFSINALWKNKIAVLVGDYLLSKGLLLSIDNNDFQLLRIVSDAVKQMSEGELMQIEKVRRMDIGEPVYYEVIRQKTASLIASCCACGAASAGASDEVIEKMRLFGEKIGIAFQIKDDMFDFGTDDVGKPLGIDIKEKKVTLPLIYALANCSSSEKKRVINLVKNHNEDPKKIAEIIKFVKDTGGLQYAETQMKKYQEEAFEILNTFPDSDSHRGLEQLVRFTTERDK >NZ_CP043449|5114225:5159919|5132359_5133160_-|WP_112658345.1|DBSCAN-SWA MKGFILSFRSEFYKSRKTLGFWASVMLPLFIGVLLFVGFYTKSEKLVSMPPMMLWLQFAGAILGIMGSLVLPMYIIFIAYSVNSIEHRADTWKTLFNLPIPRWSVYAAKFFYAVFLVFVCLALFVLFTIGFGNLLSIVKPELRFDDFHMEKELAQIYFKLFLSSLGILSIQFLLSLFWADFLKPMGIGFVCTIVGVAAASKGWEYSYLFPYAHPMAAISSMLKNNHGDPSNRLQIDVFTKDVFVSMAIALVVFIAGYFIVQRKSVK >NZ_CP043449|5114225:5159919|5158802_5158973_+|WP_149354145.1|transposase|DBSCAN-SWA MTTSLHDHFRWIPDPRTGNNKKHNLLEVIILSVLAVVCGAESWYEMEELARKKKTF >NZ_CP043449|5114225:5159919|5153678_5154104_+|WP_112658311.1|DBSCAN-SWA MNTTDTPKNIHQGRNVKRFREMLGLKQEALAIALGEEWSQKRVSLLESKETLEPEILSQVAKALKVPEDSIKNFDEEAAINIINSTFTSNDNSTSIAYQPNFTFNPIDKLVELFEENKKLYERLLQSEQEKVEILKDKHNK >NZ_CP043449|5114225:5159919|5157833_5158109_-|WP_112658301.1|DBSCAN-SWA MEKDNRLLPTHLSLFTGLFVCWQRNGFVSPFSVTRKTLMAFSKVASIATYHKCIRELDEYSYIRYQPSYHPKQGSLVYWPEAEAAAKKDTV >NZ_CP043449|5114225:5159919|5150641_5151421_-|WP_112658318.1|DBSCAN-SWA MQLQISSDLHLEFPENRNYMALNPLQVNGDILLLAGDIIPLKDIDRHRDFFSLLADNYEHTYWVPGNHEFYHFDIANRSGAFRESILSNVTLLNNSTIELNQIRIHFTTLWSEISKYKARYIERGMMDFRVIKNEGKWFSVEKYNQMHSDALAFLETALDAGGYNRNEDVRNLIVSHHVPTFQHYPKQYSDSVLNEAFATDLDAFIEDSGADYWIYGHHHDATPEFNIGKTKLLTNQLGYVHQDEHSNFDRGKTITIDA >NZ_CP043449|5114225:5159919|5144778_5145609_-|WP_112658325.1|DBSCAN-SWA MPLQPSIDKITEISRVLLSIAVRYLVFAGAFYLFFYVWRKRAYWYAKIQQRYPANKHLVREIVYSFFTILIFGAVIMLVIYASKQHLTRIYLNISDKGYPYCFLSIGLMILMHDTYFYWTHRAMHWKPLFKLMHKTHHLSTNPTPFAAYAFHPLEAVVEIGIVPLIAFTIPYHGTALTIFSLYSLLLNVTGHLGYELFPQGFTTHKLFKWHNTSTHHNMHHRLVKCNYGLYFNIWDRLMGTNHPDYEKSFEQVVEKREKVKYKGDGELLPNELATE >NZ_CP043449|5114225:5159919|5135581_5136916_+|WP_112658337.1|DBSCAN-SWA MTCLKKLLPALFLAASAITASAQTDPKATEVWDPEPEVVLPGAGNKPPSDAIILFDGKNLDKWTDQKGNKPSWIVKDGIVTVKPGSGSIITKQNFADCQLHIEWRTPAVVKGEGQERGNSGVIMQSRYELQILDSYKNRTYSNGQAGSVYKQYLPQVNASLKPGQWQKYDIIYTAPRFNIDSSVKSPAYITVLHNGILVQNHVAIKGTVAHVGQPKYQKHAFALPLLLQEHEFPVSFRNIWIREIGVQKLLNGKDKKGWYTYLDTLGKDNDVHNNFAIENGMVHVMGKYFGYMATKKSYDNYYLKVVFKWGSKQYHPREKGVRDAGILYHFGEGDKDIVWPRSIECQIQEGDCGDIWCVQHTNVVTPNKSAIEWDQQRVYRTANFENPRGEWNTIEIICNGNQIEHYVNGHLVNWGIASLSHGRILLQSEGAEIWYKSVELTPL >NZ_CP043449|5114225:5159919|5152037_5152466_-|WP_112658315.1|DBSCAN-SWA MNEKIKNDSIGLLFQQALNKKLESDLSKHHKNNLRYAFNHFMEFLGAEGREMSIRELTTSRIEEFLQLFGSSATYYMNRRLELNILFNLASRMIDEDLQAAKRTESRRIKATLHLAYEKEQLEPVFTFLKVNHINLYRCCLR >NZ_CP043449|5114225:5159919|5136970_5137705_+|WP_090530824.1|DBSCAN-SWA MIRCLVVDDEPLALHILEDYISKMPFLQLVKATTNPIEALTMVQAAEADLVFLDVQMPELTGIQFLKIANGKAKVILTTAYPQYALEGYELDVVDYLLKPIAFDRFFKSVQKAQSIIQPVAAKPQVVMQAEPVQQDDFSTDFIFVKTEHKIQKVYLHDIMFIEGLKDYISIFTSAERIITLQGMKKMEDALPEKHFVRVHKSYIVALNKIDSIERSRIQIGDKIIPVGDTYRDEFFRMIENKNI >NZ_CP043449|5114225:5159919|5159556_5159919_+|WP_149354147.1|transposase|DBSCAN-SWA MERNKEYYQNHLRTHNWDSRTTQNRYYISSLQADAAYFNKAIRTHWGIENQLHWQLDVGFAEDYNTTRNSQAAQNLAVVRKIALNILKADKTSKASIKAKRKMAGWNHKFLLTLIANKNF >NZ_CP043449|5114225:5159919|5150025_5150562_+|WP_112658319.1|DBSCAN-SWA MNNTTSISDDIIMTKIYYIRDQKVMLDSDLAELYRVETRRLNEQVIRNMDRFPNDFMFRLNESEFESLMSQIATSKRGGRRKLPYVFTEHGVLMLSSVLNSKQAIQVNIQVMRIFNRIRNMYLDNTELRLEIEQIKSKLIRHDKSLELVFGYLDELIEKKSQPIDRKRIGYMPDTDSF >NZ_CP043449|5114225:5159919|5156268_5157249_-|WP_112658305.1|DBSCAN-SWA MIGKPITGRSFGGCVHYVVNKQDAKIIASEGVRMQNASTITHDFNLQRKMRPELGKAVGHLVLSWSKEDLTKLTNQIMVERAKEYMDKIGIRNTQFVIVRHHDREHPHLHVIYNRVDNNGNTITDKNNFAKNVKACKEITLKYGYHLGEGKDQVNRQALKGTEKTRYELFDIIKATLKQSTDWKQLEAKLHKQGIGIAYKFRSGSTEVQGISFEKGDIKMKGSAIDRSLSYAGINTQLKVNRQMQQQQEIRRPDSLSLADQLREAIRQHGQADQLHLPIHGKGILETLLEPQFVAGPPDPMGDADIARRKRKKHEAEHSQSQGISR >NZ_CP043449|5114225:5159919|5127054_5130225_+|WP_112658362.1|DBSCAN-SWA MQSTFNKYNAKFQEALAGLNPEQLAAVNKMDGPVLVVAGPGTGKTQILAARIGKILTDTDALPSEILCLTYTDAGAVAMRKRLFEFIGPDAYRINIYTFHAFCNEIIQENLEYFGKLNLESLSDLESAMLFRELVDEFPNDHLLKRFTGDIYYDVPRLKSLFSTMKRENWNAEMIEQAVNDYLEDLPNREEFIYKRANAKAGIKIGDPKQKDIDAAHDVMKKLLAAVGEYQNYTEKMKTRGRYDYDDMIIWVLRAFRDNEEILRKYQERYQYILVDEFQDTSGSQNELLRFLLNYWDTPNVFVVGDDDQSIFKFQGANMKNILDFANDYVDTLYTVVLKHNYRSSQQILDISKALIDNNFERLTSQLNLDKALKSSHSRFNNLLVGPVIREYENPDKELVDVSLQIKQLVTDGVPPGEIAVIYRNHNQVEEMIKFLDVQKIQVNTKRKIDVLTQPFGEKIINILRYLAMELDSPYSGDELLFEIMHYDFFNIPPIEIAKASVAVFKENFGTLAYNQPKTSIRRYVSEMRPPKQADLFHPDQQQEMRYLVSNIDYLLKESVSVTLQQLFQSVIAKMGILKYIMQQQDKGSYMQMLTSFFDFLKDESRKRPDINLPDLISTIEMMKKHQIRLELNQSIFSENGVNFLTAHGSKGLEFEYVFFIGCDKRTWDSKGRNSGFSYPDTLTRSAADEIAQKEEARRLFYVAVTRAKQHLTISYAAKDRKDKDQEASQFIGEILASTNMQVQYPKVDAMDMIEFQATQFSEAEKPKVELLDKNYINQLLQNYTLSVTHLNSYLDCPLRFYFQCLIRVPSGKSPAATFGQAVHWALNKTYRQLKDNDNEFLSTENFMREFRWYMARNRDSFTKDEFKLRSAYGEKILPTYYEQNVPHWNKIAVTERSIKNIEIAGVPIKGNLDKIEFDGKQATVVDYKTGKPKNAKDKLLRPTNDAPNGGDYWRQAVFYKILIDHDRTTDWQVISTIFDFVEPISDGEYYREKFVITPEDVETVTGQIKDTYEQIMVHNFSTGCGKKECDWCHFVKSNFKQADEILGIVAEEEGE >NZ_CP043449|5114225:5159919|5114225_5115137_+|WP_112653556.1|transposase|DBSCAN-SWA MLTSDKIIEIFVKVDDFCKECEEQIAKHKLDAGNYKVRDRKASLADSEIITIVIAFHSGHFTNLKHFYITHICSHYKDFFPGLVSYNRFVELQQRVAVPMMLFLKTHCLGRSRGINFIDSTHIKVCHNRRIHNHKVFAATAERGQCSIGWFYGFKLHLIINDKGEILSFYLTKGNVDDRNVKLMTSMTEEIFGKLFGDKGYISKALADLLWGNGIQMITKPRKNMKDFNISQADKIMLRKRAIIECVYDELKNICKLQHTRHRSVNNFLMNIMGSLCAYHFFPKKPSLNIVFEEQDNQLLLAA >NZ_CP043449|5114225:5159919|5154866_5155298_+|WP_112658360.1|integrase|DBSCAN-SWA MLTGLRFCDIKYLNWKQVRGEQDNYYLQFRQRKTYKPQLVYISNQAFYLLGERGDPATLVFPKVNYNYTRDLLKVWPKQAGINKHLTFSCLRHTYATLQLDHGTDIYTISKLLGHLHLKTTQRYTRVMDKAKKEAANRIVLDL >NZ_CP043449|5114225:5159919|5124620_5126333_+|WP_112658353.1|DBSCAN-SWA MQASKTRIFYPLIIIFTFFTADVFAQKESYIQLLGKQNRWVDSVYHKMSRKQRVAQLLFVRAHTNKGKAYEDSVGQVIKEQQLGGVVFFQGGPVRQANLINKYQKLAKVPLIITMDGEWGLGMRLDSTISYPYQMTLGAIQDDNLIYKMGQQVAYDFKRLGMQMNLGPDMDVNNNPNNPVINYRSFGDNMYNVAKKGIAYFKGMQDAGLLTSAKHFPGHGDTNVDSHFDLPLLPYSRSRLDSLEMYPFREAINAGISGVMIAHMGIPSLDNTPKLPSTLSRPIITGILKDSLNFKGLVISDAMEMKGVTKYFPNGEADVKAFIAGNDIIELSENSDRAIGLIRKAVRHGSVSAAEFEARIKKILTAKYWAGLNQYKPTALTNLSQDINSDAAKALVQQLSDAAVTQLKNNKVKLKPFLKTAIVSIGVSSPTLFQMEVVKSFPNSRIFIINKDAPAIEVTNILNWLKQYQQIIVGIHDTRLRPQSKLDYSSDVKLMIADLASRKNTAFSVFANAYTIAGLPGLEKAGSLLVCYQMSPDLQQSAAKVLSGRLKPTGKLPVSVNAFFTTGMGL >NZ_CP043449|5114225:5159919|5152923_5153349_-|WP_112658313.1|DBSCAN-SWA MNSVLYIARKAKGFTEGQLARLLNMDEKTYVELENEVRLLDKDTCLKLRKIFDIPADYFYISIKREATLRANTMKDVLSLLKLPGAGLLPPEIHVKMVAMGTEALMAHQELYAAYHEIDILKRENAAIRDLYTALIEEKRL >NZ_CP043449|5114225:5159919|5151468_5152053_-|WP_112658317.1|DBSCAN-SWA MLPTLIKCVRVRGYFFELHDDGNYVVIRDIRLKVTFRERTTKVRVYDKSYRNYEWHPNGKVVFRIDTRLKSEWQDLKAVLLEEQLPKILAKLELAAKEEELYLEKARIWQANWERERKEKAEFDARKKQEKDAYKELMRDAERWKRAQLIRDYTLAVPDPDPQWFAWANAKADWLDPYDPGNVEWLKDADKNDF >NZ_CP043449|5114225:5159919|5158143_5158293_-|WP_167516244.1|DBSCAN-SWA MNVELITKGDLLEFKSDLLNEIKRIIQPGQASTKQWLKSVEVRKLLGIC |
44 | Bacillus_phage(25.0%) | transposase,integrase | attL 5137032:5137047|attR 5156391:5156406 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|