Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP023719 | Zymomonas mobilis subsp. mobilis ZM4 = ATCC 31821 plasmid pZM39, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP023715 | Zymomonas mobilis subsp. mobilis ZM4 = ATCC 31821 chromosome, complete genome | 4 crisprs | cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DinG,csa3,PD-DExK | 1 | 6 | 2 | 0 |
NZ_CP023718 | Zymomonas mobilis subsp. mobilis ZM4 = ATCC 31821 plasmid pZM36, complete sequence | 0 crisprs | NA | 0 | 0 | 2 | 0 |
NZ_CP023717 | Zymomonas mobilis subsp. mobilis ZM4 = ATCC 31821 plasmid pZM33, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP023716 | Zymomonas mobilis subsp. mobilis ZM4 = ATCC 31821 plasmid pZM32, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP023715_1 | 113783-114178 | Orphan |
I-F
Consensus repeat of NZ_CP023715_1
|
6 spacers
spacers of NZ_CP023715_1
>1.1|113811|32|NZ_CP023715|CRT ATACCGAGAACGATTATTGCCCGTAGTGTAAA >1.2|113871|32|NZ_CP023715|CRT GTTTGCTGCCATAAAATGCTCCTGCCGTGCAA >1.3|113931|32|NZ_CP023715|CRT TGAAAAAGGGTGACCCTATCAAAGTAGGGGCT >1.4|113991|32|NZ_CP023715|CRT CAAAACTGTCATATCGTAGCTAATATCTTCAC >1.5|114051|32|NZ_CP023715|CRT ACAGCTAAAACCCTTTTACCTTTACTGTCGGC >1.6|114111|32|NZ_CP023715|CRT TGAAAAAGGGTGACCCTATCAAAGTAGGGGCT >1.7|114059|31|NZ_CP023715|PILER-CR ACAGCTAAAACCCTTTTACCTTTACTGTCGG >1.8|114127|23|NZ_CP023715|PILER-CR GGTGACCCTATCAAAGTAGGGGC |
CRISPR arrays and Neighbor proteins around NZ_CP023715_1
The CRISPR arrays of NZ_CP023715_1 >merge|NZ_CP023715|1|113783-114178|CRT,PILER-CR ATTTGATGCTGCCTGTGCGGCAGTGAACATACCGAGAACGATTATTGCCCGTAGTGTAAATTTCTAAGCTGCCTGTGCGGCAGTGAACGTTTGCTGCCATAAAATGCTCCTGCCGTGCAATTTCTAAGCTGCCTATGCGGCAGTGAACTGAAAAAGGGTGACCCTATCAAAGTAGGGGCTTTTCTAAGCTGCCTATGCGGCAGTGAACCAAAACTGTCATATCGTAGCTAATATCTTCACTTTCTAAGCTGCCTATGCGGCAGTGAACACAGCTAAAACCCTTTTACCTTTACTGTCGGCTTTCTAAGCTGCCTATGCGGCAGTGAACTGAAAAAGGGTGACCCTATCAAAGTAGGGGCTTTTCTAAGCTGCCTATGCGGCAGTGAACTAGAAGAG >NZ_CP023715|1|1|113783-114170|CRT ATTTGATGCTGCCTGTGCGGCAGTGAAC ATACCGAGAACGATTATTGCCCGTAGTGTAAA TTTCTAAGCTGCCTGTGCGGCAGTGAAC GTTTGCTGCCATAAAATGCTCCTGCCGTGCAA TTTCTAAGCTGCCTATGCGGCAGTGAAC TGAAAAAGGGTGACCCTATCAAAGTAGGGGCT TTTCTAAGCTGCCTATGCGGCAGTGAAC CAAAACTGTCATATCGTAGCTAATATCTTCAC TTTCTAAGCTGCCTATGCGGCAGTGAAC ACAGCTAAAACCCTTTTACCTTTACTGTCGGC TTTCTAAGCTGCCTATGCGGCAGTGAAC TGAAAAAGGGTGACCCTATCAAAGTAGGGGCT TTTCTAAGCTGCCTATGCGGCAGTGAAC >NZ_CP023715|1|1|114022-114178|PILER-CR CTTTCTAAGCTGCCTATGCGGCAGTGAACACAGCTAA AACCCTTTTACCTTTACTGTCGGCTTTCTAA GCTGCCTATGCGGCAGTGAACTGAAAAAGGGTGACCC TATCAAAGTAGGGGCTTTTCTAA GCTGCCTATGCGGCAGTGAACTAGAAGAG
>NZ_CP023715.1|WP_011240092.1|112712_113186_-|Cys-tRNA(Pro)-deacylase MTKKTRGTAFLEKAGIAFTVHPYDYDPKAPAAGLQAAEALQQPAEIVYKTLMTEVDGKPVCVVVQVNHEVSMKKLAAAAGGKSANMIKPVDAERMTGYHVGGISPFGQKKRVPVIFDESAFQAEKIFINGGQRGVLVALAPEDARRAVDGKIASVAN >NZ_CP023715.1|WP_012817485.1|111116_112700_-|ATP-binding-protein MAINKERDEPFSYLWQILLFLTDLPLQKRTSLAQRCRMTVMIDMGKDPKGQVVPMDLEELLATRLLVQGNSGSGKSHLLRRMLEKSARFVQQIVIDPEGDFVTLAERFPHVAVEAAAYNESEIRVLAQRIREHRVSVVLNLEGLDVDNQMKCAAWFLATLFDAPRDHWYPAIVVVDEAQIFAPAQAGEVSDEARRLSLAAMTNLMCRGRKRGLAGVIATQRLAKLAKNVAAEASNFLMGRTFLDIDMARAADLLGMDRRQAESIRDLERGHFLALGPALCRRPIAVKIGEVETKSRTGGFKLMPLPDSKNSNPEDLLFSEPEEPALPLASPEPPPPPSSSQLMELLQKEKEEEAKIAEAEAAENPVDNSQKEALIDALLSSIVEEEENAYRQANLLYPEFTIGCRMHGLSTPPLDLTAFTKRLTIAKAGLSNDDLQDELWQPALKAASILEDDIQAVFLFLAKTAKENAPCPDDEMIARIYGTRSAGRARRLIGYMENQGIIAIRTDFGGRRSITLPALGWTTSTAA >NZ_CP023715.1|WP_014848986.1|110517_110850_+|hypothetical-protein MSHKKLEVSAGIRHRMAAEIMDHMNYMVDDPDQLSVKPANALEHIFLYEDSGAVIAEIPVVFKGESGRVLYDISHNRLVHSDIKHDQLKEMVSSKMPDFKEELLEYFREN >NZ_CP023715.1|WP_011240089.1|108978_110328_+|replication-associated-recombination-protein-A MDDLFNSVEPLVFTENEKQPLPENRPLADILRPKHLSDVIGQAHVTGENGIIGRMVAAGRLSSLILWGPPGTGKTSIAQLLAESVGMRFEMVSAIFSGVADLKKIFLKAEHHRQQGRQTLLFIDEIHRFNKGQQDSFLPYIENGTFVLVGATTENPSFALNAALLSRAQVVTLNRLDEEALGLLLERAETVSGQLLPVDENARKALIASADGDGRFLLNQAEILLAMNLTKSLSVPELAQILQKRMAIYDKDRDGHYNLISALHKSVRGSDPQAALYYLSRMLVAGEDPLYLLRRLTRMANEEVGLADPRAMEQCIAAKETYQLLGSPEGELAIAQACVYVATAPKSNAIYKAYNQAMDLARESGSVLPPPNILNAPTEMMKQQGYGEGYHYDHDMPDAFSGDNYWPENLPPVTLYQPNIRGYEKHITERLAFWEHLRQERKKGKPSGK >NZ_CP023715.1|WP_011240088.1|107643_108711_+|tRNA-dihydrouridine-synthase MKTEIPYYNPELSYEDNYKEGPFGYFADIVEKPDPSFAVSVKKPVSFLGCSVDLPFGIPAGPLLNSRYIKAAFHAGFDLCVYKTVRTQEHKSHPLPNVLAIHPEGVLSADCEAVLADTRYNQPLSITNSFGVPSFNPDIWQPDMAEAVKAASDHQVMIGSFQGTRGKGKIEEDYALAARMVAETGAPVLEANLSCPNEGVNSLLCFDAPLVQKIVEAIKAAVPDRPLLIKTAYFKDNAKLADLVSRVGHLVSGFSTINTLSARPLDEKGQAALSPSRPEGGVCGDAIRWAGLEMVQRLAAFREEKSLDYAIVGVGGVNKPEHYKAYIEAGANAVMTATGSMWNPHLAEETKKFLA >NZ_CP023715.1|WP_011240087.1|106316_107483_+|Na+/H+-antiporter-NhaA MRFSIRRFFSAASGGAIILLLSALLGLLLSNSFLSESYFKVLHLKMPFSALDDAPNLAEFISIAPMSLFFFVVIAEIKEEIISGHLASFRRVILPLISALGGMMIPACLYGLITSGHLEVSRGWAIPIATDAAFTLPIILALGRHVSEGARVWLMALAIFDDLLGIVVIALFYASHLNGYALFAAGLITAVMIGLNKKSVQNLWVYASAGVVLWWALLVSGLHPTIAGVITGLALPSVADQPEKASPLERGKQIIAPWVTWLILPLFGFVSMGMSLSAMSFHVLLAPVPLGVALGLFLGKPIGVFGATIMATRLKIATLPKGTSLRMLFGLSLLCGIGFTISLFIAELAFSGSDFLVPAKYGILMGSLLSALAGWLWLRFLKFPAKGV >NZ_CP023715.1|WP_011240086.1|104520_106158_+|hydroxylamine-reductase MLCFQCEQTHSGTGCVIRGVCTKTPEVAAIQDLMIFASAGLSYVAKKLPDSCEAERKEAASLVIQALFSTVTNVNFDADVLTKALYHLVDFRDALKAKLPEDVELPLAATLDFSRDRETLVKQGESYGIASRQKTLGIDVTGLQELLTYGMKGMAAYAHHAAVLDYRDPDVDNFLLEGMAALTDHSLDIQALLAVVMRCGEASYKTLALLDKANTSSFGHPVPTNVKMGPSKGKAILVSGHDLLDMKELLEQTKDTGIKVYTHGEMLPAHGYPELNKYPHLAGHYGGAWMLQRQEFINFPGPIVMTTNCLMEPRKEYAGRVFTRDLVGWPGLTHLPDRDFSKVIEAALESEGFTEDQESRSHIAGFGHHTVLDSADAVVSAIKKGDIKHFMLVGGCDGIKSGRHYFTDIAEKAPKDWVILTLGCGKFRVTDLDLGKIGDLPRLLDMGQCNDSYSAIRVALALAEAFDTDVNSLPLSLVLSWYEQKAVCVLLALLHLGVKGIRLGPTLPAFITPNMLKILVDNFDIKPIGNSAEEDLQEILAAKAA >NZ_CP023715.1|WP_011240085.1|104016_104454_+|Rrf2-family-transcriptional-regulator MLSISQSTGYAVLALSAIHAESEKLTMARDIAEKANIPRPYLTKILGRLQEAGLITAKRGQNGGLRLNRPPETISLLEIVKAIDGKDWGCGCFLGLPGCSNEHPCPMHSFWLKTRPVIVKQLENMTLDKAKHFTEAGWKFRSEEG >NZ_CP023715.1|WP_011240084.1|102980_103817_+|aminotransferase-class-IV MPIWLNGVLANNAVAEFNLNDKGLLLGYGVFDTALVIADKVAYREAHLEKLTKSCAALSLPVASSFLSEMMEKAAKDLPLGVIRITVTGGVGPRGMAFSPEAKPNVIVSASPIAATIFCPEIRLVLTPLRRNESSFTARIKTLNYLDAIMAVTEARQKSFDDALFLNTAGHVACSSTANLFMIRDGCLITPPVSDGILAGIMRANILRFAKSRDIPVEERSIGYEELLEADDIFLTNSLRLISQVTHLGEVALPRRSAALMALLESMVFDEINYSRSQ >NZ_CP023715.1|WP_011240083.1|102403_103015_+|aminodeoxychorismate/anthranilate-synthase-component-II MLLMIDNYDSFVVNLARYCERLGRKVSLFRHDKITLEEIEVMSPKAIILSPGPCSPEEAGISLDVLRQFSGKIPILGVCLGHQAIGVAFGGVIARASYPLHGRAVEISHVGKRLFKDIPNPFKAARYNSLIIQKTEEMEQHLTVDALSPEGEIMALSHKSHPTYGIQFHPESVLTEYGDALLSRFFDLEEAFYADMAERCISQ >NZ_CP023715.1|WP_011240093.1|114592_115552_-|S1/P1-nuclease MTDNRVSKLFKKRLTKLAIVAAMLTLPQPLYAWGMEGHEAIAALAWKYMTPTTRKKVNAILAMDHDRLTEPDFMSRATWADKWRSAGHGETEPWHFVDIEIDNPNLVTACAAASNRSNPMKNGGAQPCVVSQLDRFERELSSKQTSDQDRVLALKYVLHFVGDLHQPLHAADHDDRGGNCVKVSINNARSLNLHSYWDTYVVKEIDPDPQHLADSLKKEISPEDKKSWVLGDSKQWAMESFQLGKRYAYSFNPPAGCDATRPPIPLSAGYDSAARKVAASQLKKAGVRLAYILNHRLRSIPLSYFLQAQKQDAAANNNG >NZ_CP023715.1|WP_011240094.1|115569_118062_-|TonB-dependent-receptor MKNLLSAKTKSFLSFKQSRLNWVILYSSLWVSGAMAQNTVPPQPATDDTSSQQHAVTDNSANHPAKNDGAIVVTGRSYADGVTRRAFGGGLMIKEDSPKSKSTLTRDYIEKQTPGLNPMQLIALLPGVNSSDSDPMGLTGGHTSVRGMNESSMGYILEGFPLNDVGSHAVYPQEIVDSENLSTIQVAQGSADLDSPTVSAAGGVVNMHMIDPAKKMGGRANFTYGSYNTFRGFARFDTGEIGNSGTRAYFSFSDTHEDLWRGPGTEKKLHGEMKVVKEWGKDNKASLLVIGNNLDNINMPSVNMASWQKYGKGIMGGPVSGIANTVYSSVYTGNTKANTTYYKLHPNPFTNIYVSAPVHLNLGHKMTLTETPYFLYSNGNGGGAYWQDMNKMSYGSQTMSGTVDGQNYGQTLLYEPSITKTYRSGSTTKWTWTSGINRLMIGYWFEYSSQRQTAPYSLLNDDGSPRDKWGGGSNVILANGEKAEYRDNLTRTFIHTPFIGDTISLLNDKLTIDGGVKISIINRQGRNYLPDTSTGKNINQTYREVLPSGSIRYKINDEHQVFFSVATNYRIPMNTSLFDSGSYVAGTGYSNQAVKDLKPETSISEEFGWRYHGKLINTSLTYFHYDFHNRLFSQTVIDPNNPTSYYSRSINGGNQTTNGVDFEIGTRAIYNIRPYVSAEYIDARNRSNLAASAAGVSAILPTKGKFAPQTPRYQVGFGLDYDNGHIFGNFSLKYVGSQYSTFMNDEQVPSYVRMNIGGGYRFKSWGGLKSPTIRFNLSNITNKHYLNYASGLQTNAQYAKALDGQMVKGSAPTYSIASPFSAMFSISSGF >NZ_CP023715.1|WP_011240095.1|118384_119179_-|phosphatase-PAP2-family-protein MIKVPRFICMIALTSGILASGLSQSVSAHTEKSEPSSTYHFHSDPLLYLAPPPTSGSPLQAHDDQTFNSTRQLKGSTRWALATQDADLHLASVLKDYACAAGMNLDIAQLPHLANLIKRALRTEYDDIGRAKNNWNRKRPFVDTDQPICTEKDREGLGKQGSYPSGHTTIGWSVALILAELIPDHAANILQRGQIFGTSRIVCGAHWFSDVQAGYIMASGEIAALHGDADFRRDMELARKELEKARTSAHTPDDLLCKIEQSAR >NZ_CP023715.1|WP_011240096.1|119336_120281_-|metallophosphoesterase MISFNRRRFLSLSAGATFAAATAPRLYAGVPNRPPLRSHQSFTFVFITDTHLQPELNGAEGCHEAFLKARQFPADFAIHGGDHVFDALGVNANRATMLADLYKRTADDLRLPVYNTMGNHDCFGIYKESGAQPTDPFYGKKYFQDNFGQTYYSFDHKGVHFVILDSIGITEDRSYEGRVDAEQFNWLSRDLAAQPVGTPIIVSTHIPIMNAIDYASVPLNKMKHHSLSVINAADILELFDHYNVIGVFQGHTHVVERVEWHGVPYITGGSVCGNWWHGTRYGTPEGFMVVKVEKGKVIPHYESYGFHTIDPRNT >NZ_CP023715.1|WP_017466460.1|120788_121607_+|sel1-repeat-family-protein MKKILLLWVVVFSFVASRTQMQIRKELESFQSKYAALIKKPVQEKSSGRRLVVEKDSIPPDPPLRYQILLHPQEAAKKGDAEAQMFLGKAYLTGRSDVPKNSKQAVFWFQKSANQGYAEGEVALADAYHNGTGVGRDEAKAAFWYQKAAAQNNIEAEARLGFIYHQGRGLPKDEKMSFFWFDKAAHQGSLLAQTMVGVAYYYGSGVPQDKGRAFMWYQKAAHQGDVMAQYLLGMAYLKGEGVARSKRDGVFWLQRAAAQGDYNAFKILQRLQ >NZ_CP023715.1|WP_017466461.1|121837_122560_+|sel1-repeat-family-protein MKRILVLTAALLPVFAQPAFARIGVGRVVKMTRDGLKSPLQKAAERGDAKAQYALGNAYSKGQDVSKSDEQAVSWYQKSASQGYAPAQAALGYAYSSGLGVTHDDQQAVSFFQKAANQGNASAQYNLGMAYSNGQGVPHSDEEAASWYQRAAHQGYAPAEFNLGAAYYHGEGVVQDYGQAVFWYQKAAEQGDAKAQTALGVAYITGRGVTKSRDNALIWIQKAADQGDVTAQKILPALKK >NZ_CP023715.1|WP_160327976.1|122579_122729_+|hypothetical-protein MRKGWGGLLWGCPAFIDIGDWGQASFPNDLFYHGWLSGGALSVVVIPFE >NZ_CP023715.1|WP_014500662.1|122738_124106_+|sel1-repeat-family-protein MKKILLLSVLLSSSVTPSMAAPEKPHVVESDQIPLKQAAEAGDIAAQSNLGLAYYVGAAVPKDAAMAAFWFEKAASKGFSAAQYNLAGLYATGEGVAQSDKQAAFWYEKAAEQGIDEAEYNLALAYEQGKGVEQNYERALFWLKKAADQNFFKAETHLGLAYQAGIMLPRDDKKAVALFMKADRQVYYAEAQMALGNAYRRGAGVKQDDQKAVSYYQKAADQGDGEALTALGVFYMTGRGVPQNYERGLDCFRKAADKDVSAAEDNLGNAYRHGYGVPKDDEKAVYWYQKAADKGDAEAEYNLGLAYRKGEGISQDDAKAAFWYKKAADQGHVKAQLNMGFAYYQARGVAQDYARGIFLYRKAAEQGDSKAEYNLAIAYYNGVGEPKDLAQSIYWFQRAASHGEMSAQYNLGAFYMRGEGVPKDRNEAIFWLEKAAAQGDVEAQSTLHNLDHYPL >NZ_CP023715.1|WP_017466287.1|124243_125068_+|sel1-repeat-family-protein MRKFLFFTAVFLPFVANPVQAQTAKSTKVVAGKTALSLEQKARAGNPKAQTDLGTAYYNGQGMAQDYKQAISWYQKAANQGYPLAQYYLGNACLQGIGLTQSDEQAVSWYQKAANQGLAEAQYSLAIAYYTGRGVTQNYEQASFWFQRSANQGFVPAQFYLGVMYRNGAGIPEDDDRALFWFHKAADKGYADAQYNLGLIYHEGKVVKKDEKQATFWYQQAANQGLVEAEFNLGIAYLKGQGVQKDKDKATFWLEKAADKGDSHAQDVLEMMNK >NZ_CP023715.1|WP_011240099.1|125250_125751_+|sel1-repeat-family-protein MKKRIILLGLLLSIGGQIVYSQVRKTPSIFSQKKLHELKLAADQGDAEAEAALGEAFDFGKITPQDYQKAFFWYQKAADQSVAEAQYNLGGLYYKGAGRPKDGEKAVYWYRKAADQGYIDAQRNLALLYAKGELVPQSDEQAVYWYQKAADQGDAEAQKLLAMLAR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP023715_2 | 1245354-1245865 | Orphan |
I-F
Consensus repeat of NZ_CP023715_2
|
8 spacers
spacers of NZ_CP023715_2
>2.1|1245382|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT TAGCAGTGCCAGTGCTATCAAGAAAGAAATCC >2.2|1245442|33|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT AGATGGAACAAATGTGTGGTGGGAAAATACTTT >2.3|1245503|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT CCGTGCGTCATAGACACCGGAAGGTGCGCCGT >2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT TTCTCAAAAAGGAAAAGAAATTGGAACAGATT >2.5|1245623|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT GTCGAATAATTTTAAGCGTGAACCCGTATCAG >2.6|1245683|33|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT ATCGAACTGCGTGCCTGATAGCCGATACGCTGA >2.7|1245744|34|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT GATCGCGGGCAACGGTTTATTCAGCTATCCGCGC >2.8|1245806|32|NZ_CP023715|CRISPRCasFinder,CRT ATTTCGTCAGTGTCCGGAGAAACGCCCGTCAG |
CRISPR arrays and Neighbor proteins around NZ_CP023715_2
The CRISPR arrays of NZ_CP023715_2 >merge|NZ_CP023715|2|1245354-1245865|PILER-CR,CRISPRCasFinder,CRT GTTCACTGCCGCACAGGCAGCTTAGAAATAGCAGTGCCAGTGCTATCAAGAAAGAAATCCGTTCACTGCCGCACAGGCAGCTTAGAAAAGATGGAACAAATGTGTGGTGGGAAAATACTTTGTTCACTGCCGCACAGGCAGCTTAGAAACCGTGCGTCATAGACACCGGAAGGTGCGCCGTGTTCACTGCCGCACAGGCAGCTTAGAAATTCTCAAAAAGGAAAAGAAATTGGAACAGATTGTTCACTGCCGCACAGGCAGCTTAGAAAGTCGAATAATTTTAAGCGTGAACCCGTATCAGGTTCACTGCCGCACAGGCAGCTTAGAAAATCGAACTGCGTGCCTGATAGCCGATACGCTGAGTTCACTGCCGCACAGGCAGCTTAGAAAGATCGCGGGCAACGGTTTATTCAGCTATCCGCGCGTTCACTGCCGCACAGGCAGCTTAGAAAATTTCGTCAGTGTCCGGAGAAACGCCCGTCAGGTTCACTGCCGCGCAGGCAGTTGCTTAG >NZ_CP023715|2|2|1245354-1245805|PILER-CR GTTCACTGCCGCACAGGCAGCTTAGAAA TAGCAGTGCCAGTGCTATCAAGAAAGAAATCC GTTCACTGCCGCACAGGCAGCTTAGAAA AGATGGAACAAATGTGTGGTGGGAAAATACTTT GTTCACTGCCGCACAGGCAGCTTAGAAA CCGTGCGTCATAGACACCGGAAGGTGCGCCGT GTTCACTGCCGCACAGGCAGCTTAGAAA TTCTCAAAAAGGAAAAGAAATTGGAACAGATT GTTCACTGCCGCACAGGCAGCTTAGAAA GTCGAATAATTTTAAGCGTGAACCCGTATCAG GTTCACTGCCGCACAGGCAGCTTAGAAA ATCGAACTGCGTGCCTGATAGCCGATACGCTGA GTTCACTGCCGCACAGGCAGCTTAGAAA GATCGCGGGCAACGGTTTATTCAGCTATCCGCGC GTTCACTGCCGCACAGGCAGCTTAGAAA >NZ_CP023715|2|1|1245354-1245865|CRISPRCasFinder GTTCACTGCCGCACAGGCAGCTTAGAAA TAGCAGTGCCAGTGCTATCAAGAAAGAAATCC GTTCACTGCCGCACAGGCAGCTTAGAAA AGATGGAACAAATGTGTGGTGGGAAAATACTTT GTTCACTGCCGCACAGGCAGCTTAGAAA CCGTGCGTCATAGACACCGGAAGGTGCGCCGT GTTCACTGCCGCACAGGCAGCTTAGAAA TTCTCAAAAAGGAAAAGAAATTGGAACAGATT GTTCACTGCCGCACAGGCAGCTTAGAAA GTCGAATAATTTTAAGCGTGAACCCGTATCAG GTTCACTGCCGCACAGGCAGCTTAGAAA ATCGAACTGCGTGCCTGATAGCCGATACGCTGA GTTCACTGCCGCACAGGCAGCTTAGAAA GATCGCGGGCAACGGTTTATTCAGCTATCCGCGC GTTCACTGCCGCACAGGCAGCTTAGAAA ATTTCGTCAGTGTCCGGAGAAACGCCCGTCAG GTTCACTGCCGCGCAGGCAGTTGCTTAG >NZ_CP023715|2|2|1245354-1245865|CRT GTTCACTGCCGCACAGGCAGCTTAGAAA TAGCAGTGCCAGTGCTATCAAGAAAGAAATCC GTTCACTGCCGCACAGGCAGCTTAGAAA AGATGGAACAAATGTGTGGTGGGAAAATACTTT GTTCACTGCCGCACAGGCAGCTTAGAAA CCGTGCGTCATAGACACCGGAAGGTGCGCCGT GTTCACTGCCGCACAGGCAGCTTAGAAA TTCTCAAAAAGGAAAAGAAATTGGAACAGATT GTTCACTGCCGCACAGGCAGCTTAGAAA GTCGAATAATTTTAAGCGTGAACCCGTATCAG GTTCACTGCCGCACAGGCAGCTTAGAAA ATCGAACTGCGTGCCTGATAGCCGATACGCTGA GTTCACTGCCGCACAGGCAGCTTAGAAA GATCGCGGGCAACGGTTTATTCAGCTATCCGCGC GTTCACTGCCGCACAGGCAGCTTAGAAA ATTTCGTCAGTGTCCGGAGAAACGCCCGTCAG GTTCACTGCCGCGCAGGCAGTTGCTTAG
>NZ_CP023715.1|WP_011241715.1|1244662_1244797_+|entericidin-A/B-family-lipoprotein MRILIHFLMASAVLALAACNTVQGFGQDLSSAGQSMSNSAERNK >NZ_CP023715.1|WP_011241035.1|1242642_1244295_+|sensor-histidine-kinase MKAFFKNSPSKTFHKKQLLTRRLFLLFVSMPTWFRMLVVLSLAQLPPSFVALGAFIASSHQYRLNHQKEAQLIATESARTIDDSVDILAYDVRNIFDEEGSYDFADMGECSLLIQHMQSLGFVPVDYALMDTSGRLLCKSDNFKVNTAVLLPFKPLGANNFYNVDILENEGQLQFSFLGGHYPLSKYLVVGQIGRDSLLKIIERKVVPDGSALFLAQKSHLLPLISPKKPLLKQSHPVVIPTFDNSINLIYDNNLPPFRKVDLLFVLLPVLIWFMTASIGWVVVHYMLLRPLRRTQRAIVDYGKTGEIQKIKPASIGAAREIRQVSKAFYHVAVRLAAHERALRNALQHQKMLTREVHHRVKNNLQVVSSLLSIHSRRAETAKEKSAYATIQRRVNALAIVQRQYYNELDKGQGLDLSLLIKELVNGLQTSLQTFEKNFQIETEIDQVFVPLEKAMPIAFILVEIVDFSLAVDPLLPITIILHRHSAEFENTPDHAGLEIRAEALKQLSSRKGNEVIRRIISAFGRQLGGKSAEKEEDGYYYYDIDILPD >NZ_CP023715.1|WP_011241034.1|1242403_1242616_+|hypothetical-protein MNGSMTAHNHKRPVSQELPASSGNAAHADESAKQAEDRKNAIGFALRNVYQQTVEEAVPDDLLALLAKLN >NZ_CP023715.1|WP_038259288.1|1240580_1242059_+|DEAD/DEAH-box-helicase MSFADLGLSKELLQAVAELGYEEPTPVQAAAIPSVLMMRDLIAVAQTGTGKTASFVLPMIDILAHGRCRARMPRSLILEPTRELAAQVAENFEKYGKYHKLSMSLLIGGVPMAEQQAALEKGVDVLIATPGRLLDLFERGKILLSSCEMLVIDEADRMLDMGFIPDIETICTKLPTSRQTLLFSATMPPAIKKLADRFLSNPKQIEISRPATANTLIDQRLIEVSPRSKKKKLCDMLRAEKDHTAIIFCNRKTTVRQLATTLEQQGFSVGQIHGDMSQPERGSELERFKNGQISVLVASDIAARGLDVKGISHVFNFDVPTHPDDYIHRIGRTGRGGASGEALTFVTPADEEAITAIEKLMGVEIPRLGNRKKNTYPSKSEETVSPKQAKPAQEKSATPSPRKAPRKTETPKKLPEDRLDSVESDFPKNQAPRNTASRPVSKARTQERALRPENLRRVKPVALETAIDWNGPIPPFLNYSVPKTTARNVKKD >NZ_CP023715.1|WP_011241032.1|1239985_1240411_+|hypothetical-protein MPPRHFPHSGVMEIHLNVRNANGRNQRPLPPPPPPPDSPYNAPPPPPEAVWHEKKGPKCVSPEDIGAAAVSEKDSVDLMLKGGSRIRAHLEHCPALDFYSGFYVHAGRDGQICAKRDPVYARWGGECLINRFRVVEGVFKH >NZ_CP023715.1|WP_011241031.1|1238130_1239654_-|glucose-6-phosphate-isomerase MARIANKAAIDAAWKQVSACSEKTLKQLFEEDSNRLSGLVVETAKLRFDFSKNHLDSQKLTAFKKLLEACDFDARRKALFAGEKINITEDRAVEHMAERGQGAPASVARAKEYHARMRTLIEAIDAGAFGEVKHLLHIGIGGSALGPKLLIDALTRESGRYDVAVVSNVDGQALEEVFKKFNPHKTLIAVASKTFTTAETMLNAESAMEWMKKHGVEDPQGRMIALTANPAKASEMGIDDTRILPFAESIGGRYSLWSSIGFPAALALGWEGFQQLLEGGAAMDRHFLEAAPEKNAPILAAFADQYYSAVRGAQTHGIFAYDERLQLLPFYLQQLEMESNGKRVDLDGNLIDHPSAFITWGGVGTDAQHAVFQLLHQGTRLVPIEFIAAIKADDTLNPVHHKTLLTNAFAQGAALMSGRDNKDPARSYPGDRPSTTILMEELRPAQLGALIAFYEHRTFTNGVLLGINSFDQFGVELGKEMAHAIADHPENSDFDPSTKALIAAALK >NZ_CP023715.1|WP_011241030.1|1236621_1237968_-|glutathione-disulfide-reductase MTDYDFDLFVIGAGSGGVRASRIAASHGASVAIAEEYRIGGTCVIRGCVPKKMLYYAADFAADLKKAQRFGWTLPEKKFDWATLRDVVLSDVTRLEGLYTQTLDNNHITHYKEHAVIDSANQIRLASGKKITARYILVAVGAEPAKLDILGAEYAVTSNEMFLLPSLPKRALVVGGGYIANEFAGILNSFGVETTIATHGDRILRGYDEEIAARLVEIGQGHGIDYRFNADIARIDKDSSGRLTTHFKDGSQIESDLVLFAIGRVAKSRDLGLDKADVKTNDRGAILVDEENRTSCPSIYAVGDVTDRVQLTPVAIREGQAFADRVFGHKAASVDYDTIPTAVFSHPPLASAGLTEEEAKKRYKNIKIYKSNFRPMRNALIDSPDRALYKMVVDGDSDKVLGLHLIGQDSPEIIQLAAVAIKAGLTKQAFNDTVALHPSSAEELVLMR >NZ_CP023715.1|WP_011241029.1|1234196_1236155_-|potassium-transporter-Kup MSNDTSPGTSSVDSKSSDPSYGVPGHSHSDKDLLKLSLGAIGIVFGDIGTSPLYALKECFKGHHQLPVDDFHIYGLVSLIFWTMGLVVTVKYVMFIMKADNKGEGGSMSLLSLIIRGANPKLSRWLIVLGVFATALFYGDSMITPAMSVLSAVEGLTVIEPSFDSWVPPVSVVILIGLFCIQARGTESVGRLFGPIMLVYFATLAILGAFNIITRSPAILLALNPYYAIHFFASDPLQGFWALGSVVLSVTGAEALYADMGHFGRQPISLGWYWVVFPALTLNYLGQCALLSADHEAIANPFYFLAPDFLRVPLIILATFAAVIASQAVITGAFSVTQQAIQLGYIPRLRVNHTSASTVGQIYIPSVNWVLMFMVMVLIAMFKNSTNLANAYGIAVTGTMFITSCMMGVLVHRVWHWKAWQSIPLVSFFLLIDGAFFLSNVTKIPEGGWFPLLVGFVVFTMLMTWSRGRHLMAERMRQVAMPIQLFIRSAAASAVRIPGTAIFLTPEDDGVPHALLHNLKHNKILHERVILLTVKIEDVPYVDPHYRASMSSLEDGFYRLIVRYGFMEEPDVPLALNKIEQSGPMLRMDDTSFFISRQTLIPSTHTSMAIWREKLFAWMLRNSESATEFFKLPSNRVVELGSQIELVGSNGK >NZ_CP023715.1|WP_011241028.1|1232276_1233266_+|carbon-nitrogen-hydrolase-family-protein MSCHRVAVIQAGTSLFDTEKTLDRMEALCRQAAEQNVELAVFPEAYIGGYPKGLDFGARMGTRTEAGREDFLRYWKAAIDVPGKETARIGSFAAKMKAYLVVGVIERSEATLYCTALFFAPDGTLIGKHRKLMPTATERLVWGQGDGSTIEILDTAVGKLGAAICWENYMPVLRQVMYAGGVNIWCAPTVDQREIWQVSMRHIAYEGRLFVLSACQYMTRADAPADYDCIQGNDPETELIAGGSVIIDPMGNILAGPLYGQEGVLVADIDLSDTIKARYDLDVSGHYGRPDIFEIKVDRQSHQVITDQFSRDQATEKKPVSDSEISQLD >NZ_CP023715.1|WP_011241027.1|1231256_1232207_-|LysR-family-transcriptional-regulator MIRKINISDITRFDFNLVITFLALWHERSVTKAAARLSLSQSAVSASLSRLRQAAGDLLFIRTRQGMEPTQRAIDMVKSLSEGATLIYNAFISENEFDPARCNRHFSIGMSDDFQLALGSEISKQIQAIAPDASVVYRQTNRYTAQQMLENNDIDLAIVTTSLPRRGLWQQVIGEGGYACLCDAQSCGFSENPTLEAYLSLPHILVSYSGREGLVDEILSIMGRSRKIQTALTHFAALPPFLLGSKSIATIPSHAAISLAGYTGLTIFEAPLELTAYPIIATMRLSSQKDTALLWLFQIIKQAIRVQQNILPPPQS >NZ_CP023715.1|WP_011241036.1|1245956_1246745_-|twin-arginine-translocase-subunit-TatC MSETDDIHDEVDESAAPLLDHLLELRKRLLISLVALGIAFLLCLHFSRSIFAFLVQPLLRAGQGRLIYTDIFEAFFVDIKVAFFAAIMLAFPIVAMQIWRFIAPGLYSNEKRAFLPFLVMTPLLFLVGASMAYYVAMPIALHFLLGYQGNIGGVQQTALPAVGNYLNFVTKFIFGFGVAFLLPLVLLLLERAGFVTRQQLVAGRRYAIVASVAIAAVLTPPDIVSQLLLGVPLILLYEMALLAMLFGEKRRKKETDLVVAED >NZ_CP023715.1|WP_011241037.1|1246741_1247200_-|twin-arginine-translocase-subunit-TatB MFDVAPSELLLVAVVALVVIGPKDLPRAMRVVGRWLGKARKLSRHFRSGIDEMIRQSEMEDMEKRWAEENAKLLAENQGQGNQTASTSSPATPSPVSDDPAEQNIVFTSPADLEVNTADTSHLAANHTETTATTAASTPAKPKEADQQEKQS >NZ_CP023715.1|WP_011241038.1|1247284_1247548_-|twin-arginine-translocase-TatA/TatE-family-subunit MGGMSITHWIVVAVVVMIFFGKGRFSDMMGDVAKGIKSFKKGMSEDDTTPPAAPPAPAPRLENQPLPPENTTQNVAQNVPNDIKNNQ >NZ_CP023715.1|WP_011241039.1|1247700_1248402_-|hypothetical-protein MAFFFIPLFSGGHLSIFKSGFAPVITLSLASLLLGGCVVHHGQFDEMGSMTVRRSSCPAVAVPDYTGDVTLFNPSDQRTASAIDIEAVITKLRPKCDDTASGPVVTHLTFTVQARRRHVGGARDITLPYFVAVMRAGTRLLSKEMGTVRIHFEPGQMATDTEVTTSSAIDHDSATLPRDIIRQLNKVRQVTDVDASVDPTNDPKVRAAMKEASFEMLVGFQLTEPQLAYNATR >NZ_CP023715.1|WP_011241040.1|1248514_1249252_-|3-oxoacyl-ACP-reductase-FabG MFDLNGLTAVVTGASGGIGSAIAKALADQGAQVALSGTRESALKEVAAILPNDPIILPANLGQKEDVEQLVPRALEKLGKIDILVNNAGITRDGLMMRMKDEDWADVIALNLESVFRLSRAVIRPMMKTRFGRIINISSVVGQTGNAGQANYAAAKAGMIGMSKSLAREVASRGITVNCIAPGFIETKMTEILSEQQKEAAKSQIPAGRFGDIQDIAAAAVYLASKEAGYMTGQTLSVNGGMSML >NZ_CP023715.1|WP_011241041.1|1249321_1250257_-|ACP-S-malonyltransferase MRAFIFPGQGSQSVGMGQALADASLAARHVFEEVDEALKQNLFRLMSQGPEEELRLTENAQPAIMAHSMAVLAMLEKEGNIRLTDAASFVAGHSLGEYSALAAADAFNLPTTAHLLKKRGQAMQAAVPVGEGGMAAILGLDFETVESIAQEAAENDICQAANDNAPGQVVISGSLAAIERAVALAKGKGARRAVMLDVSAPFHCSLMQPAADVMAKALQENRPRQPIVPVFANVSATAETDPTKIMNLLVEQVTGRVRWRESIAAMAAAGVEEFVEFGGKVLAPMIKRIAPDCKATSLIAPADIENFAASL >NZ_CP023715.1|WP_011241042.1|1250777_1251152_+|30S-ribosomal-protein-S6 MPLYEHVFLARQDLAQTQVDGLAATATSIIEEKSGKVVKTEIWGLRNLAYRIQKNRKAYYIMLEIDAPADAIQELERQMALNEDVIRYMTVRVDAHEQGPSAMMRRGDRDRSNRSDRRRDRDAA >NZ_CP023715.1|WP_011241043.1|1251172_1251397_+|30S-ribosomal-protein-S18 MARPFFRRRKSCPFAAKDAPKIDYKDVRLLQGFVSERGKIVPSRITAVSAKKQRELARAIKRARHIGLLPFIVK >NZ_CP023715.1|WP_011241044.1|1251411_1252041_+|50S-ribosomal-protein-L9 MDVILLERIEKLGHIGDVVAVKNGYARNFLLPRKKALRANEANRKIFEANRAQIEADNAARRTDAEKESEVVNGLTVTLIRQASNTGHLYGSVSARDLADAIVEAKPEAKVAKNQIVLDRPIKSIGISEVRVVLHPEVAVKIKVNVARSPEEAELQAEGVDVMNQMFERDGASFTEDYDPNAEPGLATEAEEAVADADDNAETNSEESL >NZ_CP023715.1|WP_011241045.1|1252120_1253116_-|dipeptide-epimerase MTATRSLSIMGESLPLKTPFRISRGVKNTIDTIVANISESGVTGRGEGIPYPRYGQTVESALIEANSVSHKITEHYGREALLTLLPPGPARNALDCALWDIEARISGQSVASMMGIAKLEPLATAVTISLDEPEVMAKAAAKLAYCPVIKVKVDEHNPEDCIKAVRDQAPKARLIVDPNESWSFDLLDKMQNFLADARVALLEQPLAAGADEALKGFSPAVPICADEVFHSADDLDHIADRYQVINIKLDKTGGLTAAIDIMKQARSLNLSIMVGCMVCSSLSLAPAFHLAAQADFVDLDGADWLVHDRNDGMLLDNGILHPPSATFWGGP |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP023715_4 | 1593316-1593404 | Orphan |
I-F
Consensus repeat of NZ_CP023715_4
|
1 spacers
spacers of NZ_CP023715_4
>4.1|1593344|33|NZ_CP023715|CRISPRCasFinder ATCCGCAATTTCTGGAACATGATCTTGCTGACT |
CRISPR arrays and Neighbor proteins around NZ_CP023715_4
The CRISPR arrays of NZ_CP023715_4 >merge|NZ_CP023715|4|1593316-1593404|CRISPRCasFinder GTTCACTGCCGCACAGGCAGCTTAGAAAATCCGCAATTTCTGGAACATGATCTTGCTGACTGTTCACTGCCGCATAGGCAGATTGTACT >NZ_CP023715|4|3|1593316-1593404|CRISPRCasFinder GTTCACTGCCGCACAGGCAGCTTAGAAA ATCCGCAATTTCTGGAACATGATCTTGCTGACT GTTCACTGCCGCATAGGCAGATTGTACT
>NZ_CP023715.1|WP_011241321.1|1590753_1592280_+|amidophosphoribosyltransferase MSPSLSSELFSDSISTLTTTTESLDNGASDDTLHEECGVFGIWGADTAAAVVALGLHALQHRGQEAAGITSWDGKNFHSRRAVGHVAGNFDRDDAIRSLPGSCAIGHVRYATTGASTLCNVQPLYAELVSGGFAIAHNGNISNAETLRHQLVRHGSIFQSTSDTETIIHLVATSSYRSLLDRFIDALKQVEGAYSLVCLTPEGMIACRDPLGIRPLVLGKVGETFVVASETVALDIIGGTYIRQVEPGELIIISEKGLQSIHPFKKQKPRPCIFEHVYFSRPDSLIGSTSVYSVRKSIGIELARENPVDADMVIPVPDSGTPAAIGYAQQSSLPFELGIIRSHYVGRTFIQPGDQVRHLGVKLKHNANRALIKGKKLVLVDDSIVRGTTSVKIIRMLRDAGAKEIHLRIASPPTRHSCFYGVDTPERAKLLAAKMTVEQMAEYIGADSLAFISMDGLYRAVGEEARNDAQPQYCDACFTGAYPTPLTDLGELGASEQLVRLSEQVAIA >NZ_CP023715.1|WP_011241320.1|1588942_1590331_+|glutamate--cysteine-ligase MSTRQTSSSQNHPIESRDDLLRIFQAGEKPKAQWRVGTEHEKLVYKKQNHQAPSYEEKGGICDLLQGFTRFGWQPIYENDKIIGLSGDDGAISLEPAGQFELSGAPRSTIHESYDEICRHIQQTQEVGDELGLGFLGLGLWPDKKRSDLPLMPKGRYKIMTEYMPKVGKLGLDMMLRTCTIQSNIDYGSEADMVKKFRVSLALQPLATALFANSPFLEGHPNGFSSYRSHIWTDTDPHRTGILPFVFDDDFGYERYIDYMLSVPMYFVYRDGRYIDASGQDFRAFLRGELPALPNEKPILSDWVDHLSTAFPEVRLKSYLEMRGADGASAMMSPALSAFWISILYDSELLDTASDIIKSWSMDDYRNLRNEVPKKGLKTLIGGRQSLLDLGRQLWPLMNDALKRRAILNDKGQDESRYLAPIGEILESGQSLSDRLLARYHQTGNLDFIYQECDWAQPHILS >NZ_CP023715.1|WP_011241319.1|1587866_1588637_+|16S-rRNA-(uracil(1498)-N(3))-methyltransferase MVAEPAWPVNTLPRLYVEEKLSLEAVIIPDRAQAHYLLSVMRFKMGSQLVLFDNLTGEWLGEVIEAGRKHLQLKITHHLNEKESIPDLWLLTAPIKKGRIDWIYEKACELGVARITPVITQRTIVDRVNLERLQAHIVEAAEQCGRTSLSEVTEACSLKSLLAEWPEDRALFFADETGGEPMIEALSKRKMAAAILVGPEGGFTDQERDMINAVKQAVPVSLGPRILRADTAAIAATALWMAAAGDWQKQPRQANL >NZ_CP023715.1|WP_011241318.1|1586592_1587558_+|thiamine-phosphate-kinase MSGREQAFITALRQIAGDPAARNLSDDAAVLPRPSGDLVLSHDIIVENVHYFPSDPPETVAQKLVGVNLSDLAAKGAKPIGALMGYSLGPDYKWDQAFLKGLESVCHQYNLPLLGGDTVAVPRHTGHFSAMTVIGLAPSCGVPDRRAAKEGDELWVTSPIGDAGFGLNLLKQKKNINHSAQEKLVQAYRSPEPRLKEGIWLAPHVHAMADISDGLLIDAERIANASGLAVRIRLDRVPLSSEAISCFGDTKSTRLQAVTAGDDYQLIMACAANKRQELLKLSKEKQFDLYRVGQLTAGSGLSLFYGAEPIKQPDRLGYLHG >NZ_CP023715.1|WP_011241317.1|1586100_1586574_+|transcription-antitermination-factor-NusB MAQTQKRPHKNARSAARLAAVQALYQREMEKTPLNILLDEFHQYRLGATIEDATYTKAEPSFFDDIVRGVGTRCEEIDRVISENLSERWSLDRLDRPMRQILRAGTYELLARPDVPTATVISEYIDVANAFYDRQEKNFVNGLLDTVAKKLRSSNNA >NZ_CP023715.1|WP_011241316.1|1584808_1586101_+|histidinol-dehydrogenase MLLKLDSRKADFQADFTRLVDERRESEGDVSRDVSAIIADVKKRGDVAIAELTQKFDRHDLNKGGWQLTQEEIKKACDSLPSELMDALKLAATRIRYCHENQLPESSEMTDAAGVRMGVRWQAVEAAGLYVPGGRAAYCSSVLMNAVPAKVAGVKRLVMVTPTPDGFVNPAVIAAAVISEVDEIWKIGGAQAVAALALGTEKIKPVDVVVGPGNAWVAEAKRQLYGQVGIDMVAGPSEIVVVADKDNDPEWLAADLLSQAEHDPTSQSILISDSEDLIEKTIEAVGRRLEKLETQKVARESWDKHGATILVQSLDEAPALVDRLAPEHLELAVADPDALFANVHHSGSVFLGRYTPEAIGDYVGGPNHVLPTGRRARFSSGLSVIDFMKRTTYLNCSQEALSKIGPAAVTLAKAEGLPAHAESVISRLNK >NZ_CP023715.1|WP_011241315.1|1584150_1584819_+|ATP-phosphoribosyltransferase MTKPLVFAIPKGRILKEALPMLEAAGIIPEPAFLDKESRLLRFKTNRPDIEIIRVRAFDVATFVAHGAAQMGIVGSDVIEEFSYPELYAPVDLDIGHCRLSIAEPKRLAKDDDPREWSHVRVATKYPHLTHRHFEARGVQAECIKLNGAMEIAPALGLAGRIVDLVSSGRTLEENGLVEVEKIMPISARLIVNRAAFKMRAGDIAPLVENFRRLVGVADNVA >NZ_CP023715.1|WP_011241314.1|1583756_1584062_+|BolA-family-transcriptional-regulator MNMTSPSDTNEGPVTRLMRERLEAAFSPETLVIEDDSNKHAGHAGHPHRSESHFTVTLVSQAFENESRISRERMVHKALSDLLPDRIHALRLKLDTPLRQE >NZ_CP023715.1|WP_181859167.1|1581229_1583392_+|squalene--hopene-cyclase MNSLSRLLMKKIFGAEKTSYKPASDTIIGTDTLKRPNRRPEPTAKVDKTIFKTMGNSLNNTLVSACDWLIGQQKPDGHWVGAVESNASMEAEWCLALWFLGLEDHPLRPRLGNALLEMQREDGSWGVYFGAGNGDINATVEAYAALRSLGYSADNPVLKKAAAWIAEKGGLKNIRVFTRYWLALIGEWPWEKTPNLPPEIIWFPDNFVFSIYNFAQWARATMVPIAILSARRPSRPLRPQDRLDELFPEGRARFDYELPKKEGIDLWSQFFRTTDRGLHWVQSNLLKRNSLREAAIRHVLEWIIRHQDADGGWGGIQPPWVYGLMALHGEGYQLYHPVMAKALSALDDPGWRHDRGESSWIQATNSPVWDTMLALMALKDAKAEDRFTPEMDKAADWLLARQVKVKGDWSIKLPDVEPGGWAFEYANDRYPDTDDTAVALIALSSYRDKEEWQKKGVEDAITRGVNWLIAMQSECGGWGAFDKDNNRSILSKIPFCDFGESIDPPSVDVTAHVLEAFGTLGLSRDMPVIQKAIDYVRSEQEAEGAWFGRWGVNYIYGTGAVLPALAAIGEDMTQPYITKACDWLVAHQQEDGGWGESCSSYMEIDSIGKGPTTPSQTAWALMGLIAANRPEDYEAIAKGCHYLIDRQEQDGSWKEEEFTGTGFPGYGVGQTIKLDDPALSKRLLQGAELSRAFMLRYDFYRQFFPIMALSRAERLIDLNN >NZ_CP023715.1|WP_011241312.1|1580632_1581199_+|TetR-family-transcriptional-regulator MARPRTIDRERVLKSAEQLVQRAGATAMTLEAVAKEAGITKGGLQYCFGSKDDLITALIDRWFAAFDCEVKEYSQSDDSPAGEARAYVQASSQIDDATSARMVGMLVTLLQSPNHLKKIQAWYARWMEKNLGQSEEARHIRTMLFAAEGAFFLRSLGFIKMSESEWATVFDDIKKLVPSAQAGRASFK >NZ_CP023715.1|WP_011241322.1|1593724_1594429_+|SDR-family-oxidoreductase MTHRPLSDQIALVTGASRGIGAATAKALAEAGAHVILVARTATDLDKVEEQIYQKGGSATIAPLDITNSGSCHHLAAAISGRWPALDIMVFAAARYEAQPSIAAASPALQQMLAVNALATQDLLSRFDPLIQESRSAHIIGLTLPKSQAPYPYNGSYYASKMAMEAILLSYGAENAERDTIKVALAELEAVATEGRKRAFPDEKADLLRSPDEVAKAIVTMIVQDYANGWQGKL >NZ_CP023715.1|WP_011241323.1|1594495_1594963_+|RNA-pyrophosphohydrolase MDNLEYRSGVGIMLLNKDNLVFAACRNDMKEEAWQMPQGGLEAKETPEVGVLRELEEETGIPPRMVAIISHTKEWLTYDFPADLQASFFKNKYRGQRQLWFLARYLGRDEDININTDKPEFRAWKWVEPKQLPDLIVAFKKPLYEKILSEFSASL >NZ_CP023715.1|WP_011241324.1|1594987_1595971_+|DUF481-domain-containing-protein MQSRTISPWLLWRISQGAVLLSLVPVSEVWAEEPPKLIQEMVTKALALDDPKTVKSIVLIAKKTVPDSAAEIDAMVADYNTKVEAREAEKKRKELRRVADSGMFENWTGSVELGGAKMTGNTRQTAIYGAVALERNGINWTHTVKARTDFQRTYGTTSAERFTASYQPHYKFDERLYMYGLALYERDMFLGYRTRITGGSGIGYKVFDQPNLSLAVEGGPAYRHTIFIDSSRPNGRRIRDTAAMRGSFTTKWVVSPLLTVSEDSSIFFESKDITASSTTSLETKLIGNLSTKLSFSVYYEKDVSASKNPVDTTSRITFAYALGKKKK >NZ_CP023715.1|WP_011241325.1|1596195_1596762_-|nicotinamide-mononucleotide-transporter MSVLEWLAVLTSLLGIVFSTRQIRICWLFYGISSLLYGKIFFSIKLYADCLLQIFFFFSSIYGWFHWHHYQKADKMTVITASHKSLLRDIAMAAALSAIFGFYLKNYTDDAFPWVDAILSCYSIVAQFWAARLYKANWFLWIVIDFCYTALFCYRGLWLTAWLYSVFMVMAVIGLKKWQNKNPAVACD >NZ_CP023715.1|WP_011241326.1|1597231_1597555_+|YnfA-family-protein MLALLYIPAALAEITGCFSFWAWIRLHKSPLWLLPGIASLLLFAWLLTFSPAENAGKAYAVYGGIYIIMSLLWSWKVEATPPDHWDLIGAAFCLVGAAIILWMPRSL >NZ_CP023715.1|WP_011241327.1|1597561_1598734_-|phosphotransferase MAIKDDGMTEAAHKAVHQFGVSGYQTERDWPYLTILEINAVLASFSGQGKAIKILSHSRRPYSAAALFETDQKQTFFIKRHHHKIRNKTELLKEHLFARHLAQKSFPISTPMMADHNQTVIEKEPWIYEIHPQAQGVDIYQDVMSWEPFFNRDHAYEAGRMLALFHQAAQGFDESPRHHALLVSAGDTLLHDDFIKALSEWITAQPELLKQLEGKNWQQDITENILPFHHQLQPLTADITPLWGHGDWHSSNLMWTGRDPKAKVSCVLDLGMADYSSAMFDLATAIERNVIAWLDMDSRQDIVIYDQLFALLRGYHHIKPLSQMDKQLLSAFMPLLHVEYALSEIVYFGALLQDKTSADIAYYDYLLGHSRWFSGQEGQQLLQKIIHFEA >NZ_CP023715.1|WP_017466250.1|1598746_1601170_-|TonB-dependent-receptor MTYQDMTASEWRKYYQHFLVTSVFLAGISGVFPIHPAHAETQESPKSSDKTSSKNDAIIVTGRPLFKTANGFSVNDIGGGLIQKETETRSVSHISTDFIQKQAPTANAFDLVAMLPGANVTSSDPLGFSTQTNITIRGLSGDAIGYVLEGMPLNDVAYYTGYPTQFADSENYQQIGLAQGSADLDSPVLNAAGGVMNLNFRKPADKMGGYADFSYDSYNTNRQFLRFDMGEIGHSGVKGFVSYSHARTDNWRGAGYDEKQHVDFKFLKEWGQDSHVSLLGTWNKGITSYYPQVDKQSWKENGISGSNNLASRYNVNNDAAGSDYWRLYRAPEEIFYLAAPIDVRLASNLKLKVTPYGQWDRGNVPAGSTLNNSGLWNGTEAIAGTINLPNATDGTATVRSNYTQRSARAGVNASATWSLKNHDLTLGYWFDYSADKEQNSFTPVDSNGYASNIWADRHSTLIKMPDGSPLLGTNNRTHTYVNAVYLGDHMTFLQNRLTFDIGFKEVIMTRHGYNYLPGSQHKANFSTSEPLPRLGLRYQIDSKSHVFFSASTNFRTADETALYNSYDPTSGDIIVNGNKNLKNEYSVSEELGYRYSDALVTGSLTLFNYNFSNRQLQTVIVQNGSHIQSTINAGNQISRGVDFEIGLAPWHHISPYLSGEYLYTRQTSDLTVGDDLLPTKGKRAVRSPAFQGSLGVTYDDHHFFGMASVKYTGSQYGSFMNDEKIPAYVTGNISVGYRFTQEAFLKHPEIRLNFINIGNNHYLSGIASPTANAQDTVGRNGTVISGSAPQYYIGGGFAVLASLSSAF >NZ_CP023715.1|WP_011241329.1|1601328_1602120_-|pyruvate-formate-lyase-activating-protein MALIIKRPAVTSLVEEAGCDNTLKGRIHSTEIGGAVDGPGVRFVLFLAGCALRCQYCHNPDSWFLKNGRAVTLAEMMEEVASYADFLKRAGGGITISGGEPLVQPEFTGALLKAAKYLGLHTAIDTAGFLGAQADDALLSNTDLVLLDIKAFNDKRYKALTGVELQPTLAFAKRLAALKKPVWLRYVLVPGLTDNFNEIANLADFAATLGNIERVDVLPFHKMGEYKWKASGLAYKLGDTQPPSPALVEDVRGIFRDNGLNLS >NZ_CP023715.1|WP_181859171.1|1602103_1604350_-|formate-C-acetyltransferase MDSALDPWRGFKGRKWQREIDTRDFILSNVTSYTGNSDFLAGITPKTTKLWEKLQVSLEAERKTQGGVLDVDTSTVSNITAHAPGFIEKDLEVVVGLQTDAPLKRAIMPFGGYRMVKKGLEAYGFKEDESLSKIFPALRKTHNDGVFDVYTPEIMACRRSGIITGLPDAYGRGRIIGDYRRVALYGVDCLIEDKKEQGKRLERNPFDEETIKLREEVAEQIKALHELAAMAKSYGYDISQPAVTAQQAVQWTYFAYLAAVKEANGAAMSLGRVSTFLDIYIERDLKEGRITEAEAQELVDQFVMKLRIVRFLRTPEYDQLFSGDPTWVTESLGGMAIDGRTLVTKSSFRFLHTLENLGPAPEPNLTVLWSENLPKGFKDYCAKISIDTSSIQYENDDLMRSYWGDDYGIACCVSAMRIGKQMQFFGARANLAKTLLYAINGGRDEKSGVQVAPAFAPVTGDILDYEDVKSRMVQMMEWLSSVYINALNAIHFMHDKYMYERVEMALHDLEILRTMACGIAGLSVAADSLSAIKHAKVKIIRDERGLATDFKIEGDYPAYGNNDDRADEIAIWLVETFMNMLRKQTTYRRSVPTQSILTITSNVVYGKKTGNTPDGRRAGEPFAPGANPMHGRDLKGPVASMASVAKLPYAHAQDGISNTFTIVPNALGMNKEERIDNLIGLLSGYFGAGAHHMNVNVFDRNTLLDAVDHPEKYPQLTIRVSGYAVNFVKLTREQQMDVIHRTFHGLDN >NZ_CP023715.1|WP_011241331.1|1605182_1606613_+|cytochrome-ubiquinol-oxidase-subunit-I MVPDATALMLARIQFAFTVGFHIIFPAFSIGLAAYLAVLEGLWLKTGRNVYLHLFKYWIKIFALVFGMGVVSGLVMAYEFGTNWSLFSQKAGAITGPLLGYEVLTAFFLEAGFLGIMLFGLGRVGKGLHFLATCLVSIGTLISMTWILASNSWMQTPAGYSIDPKTGHFLPKSWFEVIFNPSFPYRLVHMGMAAFICVAFVVGATAAFHMLRDRKNGKPVTEPVRVMFSMALWMAAIAAPFQLLAGDMHGLNTLKYQPAKIAAMEGDWESEGPASEILFGIPNMKTERTDYAIKIPYAGSLILTHSLNGKVPGLKDYPRDQRPPSPILFFSFRIMVGLGGLMILLGLWSLFLRFRGQLYNNKALQWATLLMAPSGFIALLCGWVTTEVGRQPYTVYGLLRTSDSVSPVMLPSMIFSMTAFVIVYFFVFGAGMLILFRMLSHQPSSHEKGADPENPLQNSHAKGATQLAQDLSGKRS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP023715_3 | 1592755-1593146 | Orphan |
I-F
Consensus repeat of NZ_CP023715_3
|
6 spacers
spacers of NZ_CP023715_3
>3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT GCGCATCTTCTGATGCTTTTTTAGCTGCGGCC >3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT TTGACGCTGTGAGCGTGACGATATGCTTTCAC >3.3|1592903|33|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT ATCGGGGCATAAAATAGCGACTTGCTCACCGAT >3.4|1592964|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT GTCTGGCTGAAATGAGGTCCGACGATTTGCAT >3.5|1593024|33|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT ACCCCTCTTGGTAGTCGTATCTGCAGACGCAAT >3.6|1593085|33|NZ_CP023715|CRISPRCasFinder,CRT ATATAGAAGATTTATCAGATACGTTGAGAATAA |
CRISPR arrays and Neighbor proteins around NZ_CP023715_3
The CRISPR arrays of NZ_CP023715_3 >merge|NZ_CP023715|3|1592755-1593146|PILER-CR,CRISPRCasFinder,CRT GTTCACTGCCGCACAGGCAGCTTAGAAAGCGCATCTTCTGATGCTTTTTTAGCTGCGGCCGTTCACTGCCGCACAGGCAGCTTAGAAATTGACGCTGTGAGCGTGACGATATGCTTTCACGTTCACTGCCGCACAGGCAGCTTAGAAAATCGGGGCATAAAATAGCGACTTGCTCACCGATGTTCACTGCCGCACAGGCAGCTTAGAAAGTCTGGCTGAAATGAGGTCCGACGATTTGCATGTTCACTGCCGCACAGGCAGCTTAGAAAACCCCTCTTGGTAGTCGTATCTGCAGACGCAATGTTCACTGCCGCACAGGCAGCTTAGAAAATATAGAAGATTTATCAGATACGTTGAGAATAAGTTCACTGCCGCACAGGCAGCTTAGAAAA >NZ_CP023715|3|3|1592755-1593084|PILER-CR GTTCACTGCCGCACAGGCAGCTTAGAAA GCGCATCTTCTGATGCTTTTTTAGCTGCGGCC GTTCACTGCCGCACAGGCAGCTTAGAAA TTGACGCTGTGAGCGTGACGATATGCTTTCAC GTTCACTGCCGCACAGGCAGCTTAGAAA ATCGGGGCATAAAATAGCGACTTGCTCACCGAT GTTCACTGCCGCACAGGCAGCTTAGAAA GTCTGGCTGAAATGAGGTCCGACGATTTGCAT GTTCACTGCCGCACAGGCAGCTTAGAAA ACCCCTCTTGGTAGTCGTATCTGCAGACGCAAT GTTCACTGCCGCACAGGCAGCTTAGAAA >NZ_CP023715|3|2|1592755-1593146|CRISPRCasFinder GTTCACTGCCGCACAGGCAGCTTAGAAA GCGCATCTTCTGATGCTTTTTTAGCTGCGGCC GTTCACTGCCGCACAGGCAGCTTAGAAA TTGACGCTGTGAGCGTGACGATATGCTTTCAC GTTCACTGCCGCACAGGCAGCTTAGAAA ATCGGGGCATAAAATAGCGACTTGCTCACCGAT GTTCACTGCCGCACAGGCAGCTTAGAAA GTCTGGCTGAAATGAGGTCCGACGATTTGCAT GTTCACTGCCGCACAGGCAGCTTAGAAA ACCCCTCTTGGTAGTCGTATCTGCAGACGCAAT GTTCACTGCCGCACAGGCAGCTTAGAAA ATATAGAAGATTTATCAGATACGTTGAGAATAA GTTCACTGCCGCACAGGCAGCTTAGAAAA >NZ_CP023715|3|3|1592755-1593145|CRT GTTCACTGCCGCACAGGCAGCTTAGAAA GCGCATCTTCTGATGCTTTTTTAGCTGCGGCC GTTCACTGCCGCACAGGCAGCTTAGAAA TTGACGCTGTGAGCGTGACGATATGCTTTCAC GTTCACTGCCGCACAGGCAGCTTAGAAA ATCGGGGCATAAAATAGCGACTTGCTCACCGAT GTTCACTGCCGCACAGGCAGCTTAGAAA GTCTGGCTGAAATGAGGTCCGACGATTTGCAT GTTCACTGCCGCACAGGCAGCTTAGAAA ACCCCTCTTGGTAGTCGTATCTGCAGACGCAAT GTTCACTGCCGCACAGGCAGCTTAGAAA ATATAGAAGATTTATCAGATACGTTGAGAATAA GTTCACTGCCGCACAGGCAGCTTAGAAA
>NZ_CP023715.1|WP_011241321.1|1590753_1592280_+|amidophosphoribosyltransferase MSPSLSSELFSDSISTLTTTTESLDNGASDDTLHEECGVFGIWGADTAAAVVALGLHALQHRGQEAAGITSWDGKNFHSRRAVGHVAGNFDRDDAIRSLPGSCAIGHVRYATTGASTLCNVQPLYAELVSGGFAIAHNGNISNAETLRHQLVRHGSIFQSTSDTETIIHLVATSSYRSLLDRFIDALKQVEGAYSLVCLTPEGMIACRDPLGIRPLVLGKVGETFVVASETVALDIIGGTYIRQVEPGELIIISEKGLQSIHPFKKQKPRPCIFEHVYFSRPDSLIGSTSVYSVRKSIGIELARENPVDADMVIPVPDSGTPAAIGYAQQSSLPFELGIIRSHYVGRTFIQPGDQVRHLGVKLKHNANRALIKGKKLVLVDDSIVRGTTSVKIIRMLRDAGAKEIHLRIASPPTRHSCFYGVDTPERAKLLAAKMTVEQMAEYIGADSLAFISMDGLYRAVGEEARNDAQPQYCDACFTGAYPTPLTDLGELGASEQLVRLSEQVAIA >NZ_CP023715.1|WP_011241320.1|1588942_1590331_+|glutamate--cysteine-ligase MSTRQTSSSQNHPIESRDDLLRIFQAGEKPKAQWRVGTEHEKLVYKKQNHQAPSYEEKGGICDLLQGFTRFGWQPIYENDKIIGLSGDDGAISLEPAGQFELSGAPRSTIHESYDEICRHIQQTQEVGDELGLGFLGLGLWPDKKRSDLPLMPKGRYKIMTEYMPKVGKLGLDMMLRTCTIQSNIDYGSEADMVKKFRVSLALQPLATALFANSPFLEGHPNGFSSYRSHIWTDTDPHRTGILPFVFDDDFGYERYIDYMLSVPMYFVYRDGRYIDASGQDFRAFLRGELPALPNEKPILSDWVDHLSTAFPEVRLKSYLEMRGADGASAMMSPALSAFWISILYDSELLDTASDIIKSWSMDDYRNLRNEVPKKGLKTLIGGRQSLLDLGRQLWPLMNDALKRRAILNDKGQDESRYLAPIGEILESGQSLSDRLLARYHQTGNLDFIYQECDWAQPHILS >NZ_CP023715.1|WP_011241319.1|1587866_1588637_+|16S-rRNA-(uracil(1498)-N(3))-methyltransferase MVAEPAWPVNTLPRLYVEEKLSLEAVIIPDRAQAHYLLSVMRFKMGSQLVLFDNLTGEWLGEVIEAGRKHLQLKITHHLNEKESIPDLWLLTAPIKKGRIDWIYEKACELGVARITPVITQRTIVDRVNLERLQAHIVEAAEQCGRTSLSEVTEACSLKSLLAEWPEDRALFFADETGGEPMIEALSKRKMAAAILVGPEGGFTDQERDMINAVKQAVPVSLGPRILRADTAAIAATALWMAAAGDWQKQPRQANL >NZ_CP023715.1|WP_011241318.1|1586592_1587558_+|thiamine-phosphate-kinase MSGREQAFITALRQIAGDPAARNLSDDAAVLPRPSGDLVLSHDIIVENVHYFPSDPPETVAQKLVGVNLSDLAAKGAKPIGALMGYSLGPDYKWDQAFLKGLESVCHQYNLPLLGGDTVAVPRHTGHFSAMTVIGLAPSCGVPDRRAAKEGDELWVTSPIGDAGFGLNLLKQKKNINHSAQEKLVQAYRSPEPRLKEGIWLAPHVHAMADISDGLLIDAERIANASGLAVRIRLDRVPLSSEAISCFGDTKSTRLQAVTAGDDYQLIMACAANKRQELLKLSKEKQFDLYRVGQLTAGSGLSLFYGAEPIKQPDRLGYLHG >NZ_CP023715.1|WP_011241317.1|1586100_1586574_+|transcription-antitermination-factor-NusB MAQTQKRPHKNARSAARLAAVQALYQREMEKTPLNILLDEFHQYRLGATIEDATYTKAEPSFFDDIVRGVGTRCEEIDRVISENLSERWSLDRLDRPMRQILRAGTYELLARPDVPTATVISEYIDVANAFYDRQEKNFVNGLLDTVAKKLRSSNNA >NZ_CP023715.1|WP_011241316.1|1584808_1586101_+|histidinol-dehydrogenase MLLKLDSRKADFQADFTRLVDERRESEGDVSRDVSAIIADVKKRGDVAIAELTQKFDRHDLNKGGWQLTQEEIKKACDSLPSELMDALKLAATRIRYCHENQLPESSEMTDAAGVRMGVRWQAVEAAGLYVPGGRAAYCSSVLMNAVPAKVAGVKRLVMVTPTPDGFVNPAVIAAAVISEVDEIWKIGGAQAVAALALGTEKIKPVDVVVGPGNAWVAEAKRQLYGQVGIDMVAGPSEIVVVADKDNDPEWLAADLLSQAEHDPTSQSILISDSEDLIEKTIEAVGRRLEKLETQKVARESWDKHGATILVQSLDEAPALVDRLAPEHLELAVADPDALFANVHHSGSVFLGRYTPEAIGDYVGGPNHVLPTGRRARFSSGLSVIDFMKRTTYLNCSQEALSKIGPAAVTLAKAEGLPAHAESVISRLNK >NZ_CP023715.1|WP_011241315.1|1584150_1584819_+|ATP-phosphoribosyltransferase MTKPLVFAIPKGRILKEALPMLEAAGIIPEPAFLDKESRLLRFKTNRPDIEIIRVRAFDVATFVAHGAAQMGIVGSDVIEEFSYPELYAPVDLDIGHCRLSIAEPKRLAKDDDPREWSHVRVATKYPHLTHRHFEARGVQAECIKLNGAMEIAPALGLAGRIVDLVSSGRTLEENGLVEVEKIMPISARLIVNRAAFKMRAGDIAPLVENFRRLVGVADNVA >NZ_CP023715.1|WP_011241314.1|1583756_1584062_+|BolA-family-transcriptional-regulator MNMTSPSDTNEGPVTRLMRERLEAAFSPETLVIEDDSNKHAGHAGHPHRSESHFTVTLVSQAFENESRISRERMVHKALSDLLPDRIHALRLKLDTPLRQE >NZ_CP023715.1|WP_181859167.1|1581229_1583392_+|squalene--hopene-cyclase MNSLSRLLMKKIFGAEKTSYKPASDTIIGTDTLKRPNRRPEPTAKVDKTIFKTMGNSLNNTLVSACDWLIGQQKPDGHWVGAVESNASMEAEWCLALWFLGLEDHPLRPRLGNALLEMQREDGSWGVYFGAGNGDINATVEAYAALRSLGYSADNPVLKKAAAWIAEKGGLKNIRVFTRYWLALIGEWPWEKTPNLPPEIIWFPDNFVFSIYNFAQWARATMVPIAILSARRPSRPLRPQDRLDELFPEGRARFDYELPKKEGIDLWSQFFRTTDRGLHWVQSNLLKRNSLREAAIRHVLEWIIRHQDADGGWGGIQPPWVYGLMALHGEGYQLYHPVMAKALSALDDPGWRHDRGESSWIQATNSPVWDTMLALMALKDAKAEDRFTPEMDKAADWLLARQVKVKGDWSIKLPDVEPGGWAFEYANDRYPDTDDTAVALIALSSYRDKEEWQKKGVEDAITRGVNWLIAMQSECGGWGAFDKDNNRSILSKIPFCDFGESIDPPSVDVTAHVLEAFGTLGLSRDMPVIQKAIDYVRSEQEAEGAWFGRWGVNYIYGTGAVLPALAAIGEDMTQPYITKACDWLVAHQQEDGGWGESCSSYMEIDSIGKGPTTPSQTAWALMGLIAANRPEDYEAIAKGCHYLIDRQEQDGSWKEEEFTGTGFPGYGVGQTIKLDDPALSKRLLQGAELSRAFMLRYDFYRQFFPIMALSRAERLIDLNN >NZ_CP023715.1|WP_011241312.1|1580632_1581199_+|TetR-family-transcriptional-regulator MARPRTIDRERVLKSAEQLVQRAGATAMTLEAVAKEAGITKGGLQYCFGSKDDLITALIDRWFAAFDCEVKEYSQSDDSPAGEARAYVQASSQIDDATSARMVGMLVTLLQSPNHLKKIQAWYARWMEKNLGQSEEARHIRTMLFAAEGAFFLRSLGFIKMSESEWATVFDDIKKLVPSAQAGRASFK >NZ_CP023715.1|WP_011241322.1|1593724_1594429_+|SDR-family-oxidoreductase MTHRPLSDQIALVTGASRGIGAATAKALAEAGAHVILVARTATDLDKVEEQIYQKGGSATIAPLDITNSGSCHHLAAAISGRWPALDIMVFAAARYEAQPSIAAASPALQQMLAVNALATQDLLSRFDPLIQESRSAHIIGLTLPKSQAPYPYNGSYYASKMAMEAILLSYGAENAERDTIKVALAELEAVATEGRKRAFPDEKADLLRSPDEVAKAIVTMIVQDYANGWQGKL >NZ_CP023715.1|WP_011241323.1|1594495_1594963_+|RNA-pyrophosphohydrolase MDNLEYRSGVGIMLLNKDNLVFAACRNDMKEEAWQMPQGGLEAKETPEVGVLRELEEETGIPPRMVAIISHTKEWLTYDFPADLQASFFKNKYRGQRQLWFLARYLGRDEDININTDKPEFRAWKWVEPKQLPDLIVAFKKPLYEKILSEFSASL >NZ_CP023715.1|WP_011241324.1|1594987_1595971_+|DUF481-domain-containing-protein MQSRTISPWLLWRISQGAVLLSLVPVSEVWAEEPPKLIQEMVTKALALDDPKTVKSIVLIAKKTVPDSAAEIDAMVADYNTKVEAREAEKKRKELRRVADSGMFENWTGSVELGGAKMTGNTRQTAIYGAVALERNGINWTHTVKARTDFQRTYGTTSAERFTASYQPHYKFDERLYMYGLALYERDMFLGYRTRITGGSGIGYKVFDQPNLSLAVEGGPAYRHTIFIDSSRPNGRRIRDTAAMRGSFTTKWVVSPLLTVSEDSSIFFESKDITASSTTSLETKLIGNLSTKLSFSVYYEKDVSASKNPVDTTSRITFAYALGKKKK >NZ_CP023715.1|WP_011241325.1|1596195_1596762_-|nicotinamide-mononucleotide-transporter MSVLEWLAVLTSLLGIVFSTRQIRICWLFYGISSLLYGKIFFSIKLYADCLLQIFFFFSSIYGWFHWHHYQKADKMTVITASHKSLLRDIAMAAALSAIFGFYLKNYTDDAFPWVDAILSCYSIVAQFWAARLYKANWFLWIVIDFCYTALFCYRGLWLTAWLYSVFMVMAVIGLKKWQNKNPAVACD >NZ_CP023715.1|WP_011241326.1|1597231_1597555_+|YnfA-family-protein MLALLYIPAALAEITGCFSFWAWIRLHKSPLWLLPGIASLLLFAWLLTFSPAENAGKAYAVYGGIYIIMSLLWSWKVEATPPDHWDLIGAAFCLVGAAIILWMPRSL >NZ_CP023715.1|WP_011241327.1|1597561_1598734_-|phosphotransferase MAIKDDGMTEAAHKAVHQFGVSGYQTERDWPYLTILEINAVLASFSGQGKAIKILSHSRRPYSAAALFETDQKQTFFIKRHHHKIRNKTELLKEHLFARHLAQKSFPISTPMMADHNQTVIEKEPWIYEIHPQAQGVDIYQDVMSWEPFFNRDHAYEAGRMLALFHQAAQGFDESPRHHALLVSAGDTLLHDDFIKALSEWITAQPELLKQLEGKNWQQDITENILPFHHQLQPLTADITPLWGHGDWHSSNLMWTGRDPKAKVSCVLDLGMADYSSAMFDLATAIERNVIAWLDMDSRQDIVIYDQLFALLRGYHHIKPLSQMDKQLLSAFMPLLHVEYALSEIVYFGALLQDKTSADIAYYDYLLGHSRWFSGQEGQQLLQKIIHFEA >NZ_CP023715.1|WP_017466250.1|1598746_1601170_-|TonB-dependent-receptor MTYQDMTASEWRKYYQHFLVTSVFLAGISGVFPIHPAHAETQESPKSSDKTSSKNDAIIVTGRPLFKTANGFSVNDIGGGLIQKETETRSVSHISTDFIQKQAPTANAFDLVAMLPGANVTSSDPLGFSTQTNITIRGLSGDAIGYVLEGMPLNDVAYYTGYPTQFADSENYQQIGLAQGSADLDSPVLNAAGGVMNLNFRKPADKMGGYADFSYDSYNTNRQFLRFDMGEIGHSGVKGFVSYSHARTDNWRGAGYDEKQHVDFKFLKEWGQDSHVSLLGTWNKGITSYYPQVDKQSWKENGISGSNNLASRYNVNNDAAGSDYWRLYRAPEEIFYLAAPIDVRLASNLKLKVTPYGQWDRGNVPAGSTLNNSGLWNGTEAIAGTINLPNATDGTATVRSNYTQRSARAGVNASATWSLKNHDLTLGYWFDYSADKEQNSFTPVDSNGYASNIWADRHSTLIKMPDGSPLLGTNNRTHTYVNAVYLGDHMTFLQNRLTFDIGFKEVIMTRHGYNYLPGSQHKANFSTSEPLPRLGLRYQIDSKSHVFFSASTNFRTADETALYNSYDPTSGDIIVNGNKNLKNEYSVSEELGYRYSDALVTGSLTLFNYNFSNRQLQTVIVQNGSHIQSTINAGNQISRGVDFEIGLAPWHHISPYLSGEYLYTRQTSDLTVGDDLLPTKGKRAVRSPAFQGSLGVTYDDHHFFGMASVKYTGSQYGSFMNDEKIPAYVTGNISVGYRFTQEAFLKHPEIRLNFINIGNNHYLSGIASPTANAQDTVGRNGTVISGSAPQYYIGGGFAVLASLSSAF >NZ_CP023715.1|WP_011241329.1|1601328_1602120_-|pyruvate-formate-lyase-activating-protein MALIIKRPAVTSLVEEAGCDNTLKGRIHSTEIGGAVDGPGVRFVLFLAGCALRCQYCHNPDSWFLKNGRAVTLAEMMEEVASYADFLKRAGGGITISGGEPLVQPEFTGALLKAAKYLGLHTAIDTAGFLGAQADDALLSNTDLVLLDIKAFNDKRYKALTGVELQPTLAFAKRLAALKKPVWLRYVLVPGLTDNFNEIANLADFAATLGNIERVDVLPFHKMGEYKWKASGLAYKLGDTQPPSPALVEDVRGIFRDNGLNLS >NZ_CP023715.1|WP_181859171.1|1602103_1604350_-|formate-C-acetyltransferase MDSALDPWRGFKGRKWQREIDTRDFILSNVTSYTGNSDFLAGITPKTTKLWEKLQVSLEAERKTQGGVLDVDTSTVSNITAHAPGFIEKDLEVVVGLQTDAPLKRAIMPFGGYRMVKKGLEAYGFKEDESLSKIFPALRKTHNDGVFDVYTPEIMACRRSGIITGLPDAYGRGRIIGDYRRVALYGVDCLIEDKKEQGKRLERNPFDEETIKLREEVAEQIKALHELAAMAKSYGYDISQPAVTAQQAVQWTYFAYLAAVKEANGAAMSLGRVSTFLDIYIERDLKEGRITEAEAQELVDQFVMKLRIVRFLRTPEYDQLFSGDPTWVTESLGGMAIDGRTLVTKSSFRFLHTLENLGPAPEPNLTVLWSENLPKGFKDYCAKISIDTSSIQYENDDLMRSYWGDDYGIACCVSAMRIGKQMQFFGARANLAKTLLYAINGGRDEKSGVQVAPAFAPVTGDILDYEDVKSRMVQMMEWLSSVYINALNAIHFMHDKYMYERVEMALHDLEILRTMACGIAGLSVAADSLSAIKHAKVKIIRDERGLATDFKIEGDYPAYGNNDDRADEIAIWLVETFMNMLRKQTTYRRSVPTQSILTITSNVVYGKKTGNTPDGRRAGEPFAPGANPMHGRDLKGPVASMASVAKLPYAHAQDGISNTFTIVPNALGMNKEERIDNLIGLLSGYFGAGAHHMNVNVFDRNTLLDAVDHPEKYPQLTIRVSGYAVNFVKLTREQQMDVIHRTFHGLDN >NZ_CP023715.1|WP_011241331.1|1605182_1606613_+|cytochrome-ubiquinol-oxidase-subunit-I MVPDATALMLARIQFAFTVGFHIIFPAFSIGLAAYLAVLEGLWLKTGRNVYLHLFKYWIKIFALVFGMGVVSGLVMAYEFGTNWSLFSQKAGAITGPLLGYEVLTAFFLEAGFLGIMLFGLGRVGKGLHFLATCLVSIGTLISMTWILASNSWMQTPAGYSIDPKTGHFLPKSWFEVIFNPSFPYRLVHMGMAAFICVAFVVGATAAFHMLRDRKNGKPVTEPVRVMFSMALWMAAIAAPFQLLAGDMHGLNTLKYQPAKIAAMEGDWESEGPASEILFGIPNMKTERTDYAIKIPYAGSLILTHSLNGKVPGLKDYPRDQRPPSPILFFSFRIMVGLGGLMILLGLWSLFLRFRGQLYNNKALQWATLLMAPSGFIALLCGWVTTEVGRQPYTVYGLLRTSDSVSPVMLPSMIFSMTAFVIVYFFVFGAGMLILFRMLSHQPSSHEKGADPENPLQNSHAKGATQLAQDLSGKRS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NZ_CP023715_2 | 2.6|1245683|33|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1245683-1245715 | 33 | NZ_CP023715.1 | 293330-293362 | 1 | 0.97 |
1. spacer 2.6|1245683|33|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to position: 293330-293362, mismatch: 1, identity: 0.97
atcgaactgcgtgcctgatagccgatacgctga CRISPR spacer atcgaactacgtgcctgatagccgatacgctga Protospacer ********.************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP023715_3 | 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592843-1592874 | 32 | NZ_CP021792 | Zymomonas mobilis subsp. mobilis strain NRRL B-1960 plasmid pZMO1960_1A, complete sequence | 1418-1449 | 0 | 1.0 |
NZ_CP023715_3 | 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592843-1592874 | 32 | NC_019198 | Zymomonas mobilis subsp. mobilis NCIMB 11163 plasmid pZMO1A, complete sequence | 1479-1510 | 0 | 1.0 |
NZ_CP023715_3 | 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592843-1592874 | 32 | NC_019210 | Zymomonas mobilis subsp. mobilis plasmid pZMO1B, complete sequence | 1479-1510 | 0 | 1.0 |
NZ_CP023715_3 | 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592843-1592874 | 32 | NC_011363 | Zymomonas mobilis subsp. mobilis ATCC 10988 plasmid pZMO1, complete sequence | 1482-1513 | 1 | 0.969 |
NZ_CP023715_3 | 3.6|1593085|33|NZ_CP023715|CRISPRCasFinder,CRT | 1593085-1593117 | 33 | NC_001845 | Zymomonas mobilis ATCC10988 plasmid pZMO1, complete sequence | 653-685 | 1 | 0.97 |
NZ_CP023715_3 | 3.6|1593085|33|NZ_CP023715|CRISPRCasFinder,CRT | 1593085-1593117 | 33 | NC_009716 | Escherichia sp. Sflu5 cryptic plasmid pAK51 | 3617-3649 | 1 | 0.97 |
NZ_CP023715_3 | 3.6|1593085|33|NZ_CP023715|CRISPRCasFinder,CRT | 1593085-1593117 | 33 | NC_005701 | Zymomonas mobilis ATCC 10988 plasmid pZMO2, complete sequence | 8-40 | 1 | 0.97 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | NC_013784 | Zymomonas mobilis subsp. mobilis ZM4 plasmid pZZM401, complete sequence | 12257-12288 | 2 | 0.938 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | NZ_CP003712 | Zymomonas mobilis subsp. mobilis NRRL B-12526 plasmid pZM1252603, complete sequence | 22552-22583 | 2 | 0.938 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | NZ_CP003718 | Zymomonas mobilis subsp. mobilis str. CP4 = NRRL B-14023 plasmid pZM1402303, complete sequence | 12257-12288 | 2 | 0.938 |
NZ_CP023715_3 | 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592843-1592874 | 32 | NC_019019 | Zymomonas mobilis plasmid pZMN1-1, complete sequence | 1475-1506 | 2 | 0.938 |
NZ_CP023715_3 | 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592843-1592874 | 32 | NC_009716 | Escherichia sp. Sflu5 cryptic plasmid pAK51 | 4439-4470 | 4 | 0.875 |
NZ_CP023715_3 | 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592843-1592874 | 32 | NC_001845 | Zymomonas mobilis ATCC10988 plasmid pZMO1, complete sequence | 1513-1544 | 4 | 0.875 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | MK448979 | Streptococcus phage Javan534, complete genome | 13215-13246 | 7 | 0.781 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | MK448799 | Streptococcus phage Javan535, complete genome | 13215-13246 | 7 | 0.781 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | MK448800 | Streptococcus phage Javan539, complete genome | 13216-13247 | 7 | 0.781 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | MT939241 | Enterococcus phage 9183, complete genome | 69460-69491 | 7 | 0.781 |
NZ_CP023715_1 | 1.5|114051|32|NZ_CP023715|CRT | 114051-114082 | 32 | NZ_CP018236 | Rhizobium leguminosarum strain Vaf-108 plasmid unnamed8 | 109921-109952 | 8 | 0.75 |
NZ_CP023715_1 | 1.7|114059|31|NZ_CP023715|PILER-CR | 114059-114089 | 31 | NZ_CP018236 | Rhizobium leguminosarum strain Vaf-108 plasmid unnamed8 | 109921-109951 | 8 | 0.742 |
NZ_CP023715_2 | 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1245563-1245594 | 32 | NZ_LR214998 | Mycoplasma conjunctivae strain NCTC10147 plasmid 2 | 1218-1249 | 8 | 0.75 |
NZ_CP023715_2 | 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1245563-1245594 | 32 | NZ_CP049042 | Pseudohalocynthiibacter aestuariivivens strain RR4-35 plasmid pRR4-35_5, complete sequence | 37299-37330 | 8 | 0.75 |
NZ_CP023715_2 | 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1245563-1245594 | 32 | NZ_CP049700 | Bradyrhizobium sp. 1S5 strain 323S2 plasmid pB323S2a, complete sequence | 291573-291604 | 8 | 0.75 |
NZ_CP023715_3 | 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592843-1592874 | 32 | CP053324 | Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence | 9447-9478 | 8 | 0.75 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | MT446411 | UNVERIFIED: Escherichia virus TH40, complete genome | 67504-67535 | 9 | 0.719 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | MT446421 | UNVERIFIED: Escherichia virus TH55, complete genome | 80840-80871 | 9 | 0.719 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | MT446396 | UNVERIFIED: Escherichia virus TH22, complete genome | 137515-137546 | 9 | 0.719 |
NZ_CP023715_2 | 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1245563-1245594 | 32 | MN692951 | Marine virus AFVG_117M12, complete genome | 19672-19703 | 10 | 0.688 |
NZ_CP023715_2 | 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1245563-1245594 | 32 | NC_019526 | Enterobacteria phage vB_KleM-RaK2, complete genome | 67317-67348 | 10 | 0.688 |
NZ_CP023715_2 | 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1245563-1245594 | 32 | MT708547 | Klebsiella phage Muenster, complete genome | 189204-189235 | 10 | 0.688 |
NZ_CP023715_2 | 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1245563-1245594 | 32 | AB897757 | Klebsiella phage K64-1 DNA, complete genome | 67288-67319 | 10 | 0.688 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | NC_020292 | Clostridium saccharoperbutylacetonicum N1-4(HMT) plasmid Csp_135p, complete sequence | 13543-13574 | 10 | 0.688 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | MN583270 | Pseudomonas aeruginosa strain NK546 plasmid pNK546b, complete sequence | 67159-67190 | 10 | 0.688 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | MF360958 | Salicola phage SCTP-2, complete genome | 175798-175829 | 10 | 0.688 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | JN231330 | UNVERIFIED: Uncultured phage contig03 MexF-like gene, complete sequence | 1910-1941 | 10 | 0.688 |
NZ_CP023715_2 | 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1245563-1245594 | 32 | KY549443 | Enterococcus phage EFP01, complete genome | 110490-110521 | 11 | 0.656 |
NZ_CP023715_3 | 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT | 1592783-1592814 | 32 | AP013403 | Uncultured Mediterranean phage uvMED DNA, complete genome, group G15, isolate: uvMED-CGR-U-MedDCM-OCT-S41-C7 | 29040-29071 | 11 | 0.656 |
1. spacer 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP021792 (Zymomonas mobilis subsp. mobilis strain NRRL B-1960 plasmid pZMO1960_1A, complete sequence) position: , mismatch: 0, identity: 1.0
ttgacgctgtgagcgtgacgatatgctttcac CRISPR spacer ttgacgctgtgagcgtgacgatatgctttcac Protospacer ********************************
2. spacer 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NC_019198 (Zymomonas mobilis subsp. mobilis NCIMB 11163 plasmid pZMO1A, complete sequence) position: , mismatch: 0, identity: 1.0
ttgacgctgtgagcgtgacgatatgctttcac CRISPR spacer ttgacgctgtgagcgtgacgatatgctttcac Protospacer ********************************
3. spacer 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NC_019210 (Zymomonas mobilis subsp. mobilis plasmid pZMO1B, complete sequence) position: , mismatch: 0, identity: 1.0
ttgacgctgtgagcgtgacgatatgctttcac CRISPR spacer ttgacgctgtgagcgtgacgatatgctttcac Protospacer ********************************
4. spacer 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NC_011363 (Zymomonas mobilis subsp. mobilis ATCC 10988 plasmid pZMO1, complete sequence) position: , mismatch: 1, identity: 0.969
ttgacgctgtgagcgtgacgatatgctttcac CRISPR spacer ttgacgctgtgagcgtgacgatatgctttcgc Protospacer ******************************.*
5. spacer 3.6|1593085|33|NZ_CP023715|CRISPRCasFinder,CRT matches to NC_001845 (Zymomonas mobilis ATCC10988 plasmid pZMO1, complete sequence) position: , mismatch: 1, identity: 0.97
atatagaagatttatcagatacgttgagaataa CRISPR spacer atatagaagatttgtcagatacgttgagaataa Protospacer *************.*******************
6. spacer 3.6|1593085|33|NZ_CP023715|CRISPRCasFinder,CRT matches to NC_009716 (Escherichia sp. Sflu5 cryptic plasmid pAK51) position: , mismatch: 1, identity: 0.97
atatagaagatttatcagatacgttgagaataa CRISPR spacer atatagaagatttgtcagatacgttgagaataa Protospacer *************.*******************
7. spacer 3.6|1593085|33|NZ_CP023715|CRISPRCasFinder,CRT matches to NC_005701 (Zymomonas mobilis ATCC 10988 plasmid pZMO2, complete sequence) position: , mismatch: 1, identity: 0.97
atatagaagatttatcagatacgttgagaataa CRISPR spacer atatagaagatttgtcagatacgttgagaataa Protospacer *************.*******************
8. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NC_013784 (Zymomonas mobilis subsp. mobilis ZM4 plasmid pZZM401, complete sequence) position: , mismatch: 2, identity: 0.938
gcgcatcttctgatgcttttttagctgcggcc CRISPR spacer gcgcttcttctgctgcttttttagctgcggcc Protospacer **** ******* *******************
9. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP003712 (Zymomonas mobilis subsp. mobilis NRRL B-12526 plasmid pZM1252603, complete sequence) position: , mismatch: 2, identity: 0.938
gcgcatcttctgatgcttttttagctgcggcc CRISPR spacer gcgcttcttctgctgcttttttagctgcggcc Protospacer **** ******* *******************
10. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP003718 (Zymomonas mobilis subsp. mobilis str. CP4 = NRRL B-14023 plasmid pZM1402303, complete sequence) position: , mismatch: 2, identity: 0.938
gcgcatcttctgatgcttttttagctgcggcc CRISPR spacer gcgcttcttctgctgcttttttagctgcggcc Protospacer **** ******* *******************
11. spacer 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NC_019019 (Zymomonas mobilis plasmid pZMN1-1, complete sequence) position: , mismatch: 2, identity: 0.938
ttgacgctgtgagcgtgacgatatgctttcac CRISPR spacer ttgacgctgtgaccgtgacgatatcctttcac Protospacer ************ *********** *******
12. spacer 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NC_009716 (Escherichia sp. Sflu5 cryptic plasmid pAK51) position: , mismatch: 4, identity: 0.875
ttgacgctgtgagcgtgacgatatgctttcac CRISPR spacer ttgacgctgtgagcgtgacgatatgcttatcg Protospacer **************************** .
13. spacer 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NC_001845 (Zymomonas mobilis ATCC10988 plasmid pZMO1, complete sequence) position: , mismatch: 4, identity: 0.875
ttgacgctgtgagcgtgacgatatgctttcac CRISPR spacer ttgacgctgtgagcgtgacgatatgcttatcg Protospacer **************************** .
14. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to MK448979 (Streptococcus phage Javan534, complete genome) position: , mismatch: 7, identity: 0.781
gcgcatcttctgatgcttttttagctg--cggcc CRISPR spacer acccatcttctgatggttttttcgctgactgg-- Protospacer .* ************ ****** **** .**
15. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to MK448799 (Streptococcus phage Javan535, complete genome) position: , mismatch: 7, identity: 0.781
gcgcatcttctgatgcttttttagctg--cggcc CRISPR spacer acccatcttctgatggttttttcgctgactgg-- Protospacer .* ************ ****** **** .**
16. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to MK448800 (Streptococcus phage Javan539, complete genome) position: , mismatch: 7, identity: 0.781
gcgcatcttctgatgcttttttagctg--cggcc CRISPR spacer acccatcttctgatggttttttcgctgactgg-- Protospacer .* ************ ****** **** .**
17. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to MT939241 (Enterococcus phage 9183, complete genome) position: , mismatch: 7, identity: 0.781
gcgc-atcttctgatgcttttttagctgcggcc CRISPR spacer -cgctgtcttctgatgctttcttagccgcacca Protospacer *** .**************.*****.**. *
18. spacer 1.5|114051|32|NZ_CP023715|CRT matches to NZ_CP018236 (Rhizobium leguminosarum strain Vaf-108 plasmid unnamed8) position: , mismatch: 8, identity: 0.75
acagctaaaacccttttacctttactgtcggc CRISPR spacer tgacatcgaaccattttacctttcctgtcggc Protospacer * * .**** ********** ********
19. spacer 1.7|114059|31|NZ_CP023715|PILER-CR matches to NZ_CP018236 (Rhizobium leguminosarum strain Vaf-108 plasmid unnamed8) position: , mismatch: 8, identity: 0.742
acagctaaaacccttttacctttactgtcgg CRISPR spacer tgacatcgaaccattttacctttcctgtcgg Protospacer * * .**** ********** *******
20. spacer 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR214998 (Mycoplasma conjunctivae strain NCTC10147 plasmid 2) position: , mismatch: 8, identity: 0.75
ttctcaaaaaggaaaagaaattggaacagatt CRISPR spacer ccaacaaaaaggaaaataaaatggaacaaaat Protospacer .. ************ *** *******.* *
21. spacer 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP049042 (Pseudohalocynthiibacter aestuariivivens strain RR4-35 plasmid pRR4-35_5, complete sequence) position: , mismatch: 8, identity: 0.75
---ttctcaaaaaggaaaagaaattggaacagatt CRISPR spacer aggtcttc---aaggaaaagatattggaagagatg Protospacer *..** ********** ******* ****
22. spacer 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP049700 (Bradyrhizobium sp. 1S5 strain 323S2 plasmid pB323S2a, complete sequence) position: , mismatch: 8, identity: 0.75
ttctcaaaaaggaaaagaaattggaacagatt-- CRISPR spacer gactcaaacaggaaaagaaactgga--cgatcgg Protospacer ****** ***********.**** ***.
23. spacer 3.2|1592843|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to CP053324 (Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
ttgacgctgtgagcgtgacgatatgctttcac CRISPR spacer gtcacgctgtgaacgtaacgatatgcgtaaat Protospacer * *********.***.********* * *.
24. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to MT446411 (UNVERIFIED: Escherichia virus TH40, complete genome) position: , mismatch: 9, identity: 0.719
gcgcatcttctgatgcttttttagctgcggcc CRISPR spacer tcgcatcttctgtttcttttttagacgccatg Protospacer *********** * ********* .** ..
25. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to MT446421 (UNVERIFIED: Escherichia virus TH55, complete genome) position: , mismatch: 9, identity: 0.719
gcgcatcttctgatgcttttttagctgcggcc CRISPR spacer tcgcatcttctgtttcttttttagacgccatg Protospacer *********** * ********* .** ..
26. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to MT446396 (UNVERIFIED: Escherichia virus TH22, complete genome) position: , mismatch: 9, identity: 0.719
gcgcatcttctgatgcttttttagctgcggcc CRISPR spacer tcgcatcttctgtttcttttttagacgccatg Protospacer *********** * ********* .** ..
27. spacer 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to MN692951 (Marine virus AFVG_117M12, complete genome) position: , mismatch: 10, identity: 0.688
ttctcaaaaaggaaaagaaattggaacagatt CRISPR spacer ggacgcaacaggaaaagaaattggaaaagaaa Protospacer . ** ***************** ***
28. spacer 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NC_019526 (Enterobacteria phage vB_KleM-RaK2, complete genome) position: , mismatch: 10, identity: 0.688
ttctcaaaaaggaaaagaaattggaacagatt CRISPR spacer agtacaaaaaggacaagaaattggtactgcaa Protospacer . ********* ********** ** *
29. spacer 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to MT708547 (Klebsiella phage Muenster, complete genome) position: , mismatch: 10, identity: 0.688
ttctcaaaaaggaaaagaaattggaacagatt CRISPR spacer agtacaaaaaggacaagaaattggtactgcaa Protospacer . ********* ********** ** *
30. spacer 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to AB897757 (Klebsiella phage K64-1 DNA, complete genome) position: , mismatch: 10, identity: 0.688
ttctcaaaaaggaaaagaaattggaacagatt CRISPR spacer agtacaaaaaggacaagaaattggtactgcaa Protospacer . ********* ********** ** *
31. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to NC_020292 (Clostridium saccharoperbutylacetonicum N1-4(HMT) plasmid Csp_135p, complete sequence) position: , mismatch: 10, identity: 0.688
gcgcatcttctgatgcttttttagctgcggcc CRISPR spacer tattatcatctgatgcttctttagctgaattc Protospacer .*** **********.******** . .*
32. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to MN583270 (Pseudomonas aeruginosa strain NK546 plasmid pNK546b, complete sequence) position: , mismatch: 10, identity: 0.688
gcgcatcttctgatgcttttttagctgcggcc CRISPR spacer agtgaaggacttatgcttttttagctgaggcc Protospacer . * ** *************** ****
33. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to MF360958 (Salicola phage SCTP-2, complete genome) position: , mismatch: 10, identity: 0.688
gcgcatcttctgatgcttttttagctgcggcc CRISPR spacer catcatcttctggtgcttttttacctaagaaa Protospacer *********.********** **. *.
34. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to JN231330 (UNVERIFIED: Uncultured phage contig03 MexF-like gene, complete sequence) position: , mismatch: 10, identity: 0.688
gcgcatcttctgatgcttttttagctgcggcc CRISPR spacer ccccaacttcagatgcttttttagctaatcta Protospacer * ** **** ***************. .
35. spacer 2.4|1245563|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to KY549443 (Enterococcus phage EFP01, complete genome) position: , mismatch: 11, identity: 0.656
ttctcaaaaaggaaaagaaattggaacagatt CRISPR spacer cgtccaaaatggaaaagaaattgaaacgtgaa Protospacer . ..***** *************.***. .
36. spacer 3.1|1592783|32|NZ_CP023715|PILER-CR,CRISPRCasFinder,CRT matches to AP013403 (Uncultured Mediterranean phage uvMED DNA, complete genome, group G15, isolate: uvMED-CGR-U-MedDCM-OCT-S41-C7) position: , mismatch: 11, identity: 0.656
gcgcatcttctgatgcttttttagctgcggcc CRISPR spacer cggcaacttctgatgcttttgtagcaattaat Protospacer *** ************** **** .. . .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
383000 : 393484
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP023715|383000:393484|DBSCAN-SWA ATCAATGATGCACAGGACTGACACCATCTGACGCATGATGGTTCAAAGCTGTCGGCGGGAAAGCGGCCAGATCATCGGCTTCGCTCCAGTCGATAGGATCAAGCTTTTCAGTCAAAGCGAGTGCCAGAACCTCATCAACATGCGATACCGGAATAATCTTGAGACCCTTTTTAATGTTTTCCGGTAATTCAATCAGGTCTTTCTGATTTTCTTCCGGAATCAACACCGTTTTGATCCCGCCTCTTAGGGCGGCAAGAAGTTTCTCTTTCAATCCACCAATCGGCAGGACGCGACCTCTTAAGGTGACTTCACCGGTCATAGCGACATCGCGGCGCACTGGGACACCTGTCAGAACCGACGTAATCGCTGTCACAATACCGATACCCGCTGAAGGTCCATCTTTTGGAACAGCACCTTCCGGCAAATGGATATGGACATCTTTCCGAGTAAAGATACTGGGTTTAATGCCATAGAAGGGTGCACGTGATTTGACGAAGGACAACGCCGCTTGAATGGACTCCTTCATCACATCGCCAAGTTTACCCGTCGTGCGGATATTGCCTCTACCTGGGACAGTGACACTTTCGATCGTCAGCAATTCGCCGCCGACCTCTGTCCATGCCAGACCCGTGACGACACCAACCTGATCTTCTTTTTCGGAAATACCAAATTTATACCGCCGGACACCCGAAAAATCGCTCAGATTTTTAGCCGTGATCGTGATCGTTTTGGCTTTCTTTTCAAGAATACGCCGCAAGGCTTTACGGGCAATTTTAGCCAGTTCACGTTCCAGCGCACGAACACCCGATTCACGGGTATAATAGCGGATAAGATCACGGATAGCGTCTGTCTCAACCGTAAATTCGCCTTTTTTCAAACCATGGGATTTGATTTGGCGCGGTAAAAGATGGCTGGTCGCAATCTCGATTTTTTCGTCTTCGGTATAACCTTCAAGACGGATGATTTCCATGCGATCAATTAGAGGGGCGGGCAAATTCAAAGAATTGGCCGTTGCCACAAACATGACATCCGAAAGATCAATATCGGTTTCAAGATAATGATCCTGAAATTTATTATTCTGTTCGGGGTCTAAAACCTCAAGCAAGGCGGATGCCGGATCACCACGGGAATCCTGTCCCAATTTGTCAATTTCATCGAGCAGAAAGAGCGGATTAAAGCTTTCTGCTTTTTTGATATTGGTGACGATTTTACCCGGAAGCGAGCCAATATAAGTCCGACGATGGCCGCGTATTTCGGCTTCATCACGAACGCCGCCGAGCGATTGTCTGACAAAGTTACGACCCGTGGCCTTGGCAATAGAGCGTGCCAATGAGGTTTTACCCACACCCGGAGGCCCAACGAGGCAAAGGATAGGCCCTTTCAACTTGTTCGTACGAGCCTGAACCGCGATATATTCGACAATTCGTTCTTTAACCTTATCAAGGCCGAAATGTTCTTCACCAAGAATTTTTTCAGCCGCGATGATATCCTTTTTAACCCGTGTTTTCTTGCCCCATGGCATATTCAACAAGGTTTCAAGATAAGAACGCACCACCGTTGCTTCGGCAGACATAGCGCCCATCGTCCGCAATTTTTTTAATTCCGAAGTGGCCTTGGCCTTCACCTCTTTCGGAAGTTTCGGGTTATTCAGCTTTTGCGTGAATTCAGAGAGTTCGTCGCCGTCCTCTTCTTCTTCGCCATTCCCGCTGCTTTCGCCTGTGGAAAGTTCTCTCTGAATAGCTTTGAGCTGTTCATTGAGGTAATATTCCCGCTGGTTCTTTTCCATCTGGCGTTTGACACGGCTTCTGATCTTGCGTTCAACCTGTAAAACGCCAAGTTCGCCTTCCATCAGGCCAAACGTCATTTCCAGTCTTTTAAAGGGGTTCAGCTCTTCAAGAAGCGGCTGTTTGTCGGCTACTTTGACAGCCAAATTAACAGCGATCGCATCAGCCAGTCGAGATGGGGCATCAATCTCTTTGATTTCATGGGCGATATCGCCCGGTAATTTGCGATTGAGTTTAGCGTAATGTTCGAACTGATCCTTGACCGAGCGCATCAGAGCCTCGACCTCGTCATTTTCGACGGAGGTATCCTCAAGGGGTTCTACTTCGGCGATGAGATGGCCACTGGAATCGTCCATGTCGCTTATTTTGGCGCGTTTCCGGCCTTCTACCAACACCCGCACGGTGCCGTCGGGTAATTTAAGAAGTTGTAACACATTCGCGACCACGCCAATGTCATAAAGCGCTTCACGATTAGGGTCTTCTTCAGCCGGGTCGCGTTGGGAAACGAGAAAAATAGTCTTCTCTGCTGCCATCACGCTTTCAAGCGCTGCTACCGATTTTTCACGACCGACAAAAAGCGGCGCGATCATATGAGGAAATACGACGATGTCGCGCAGCGGTAAAACGGGAAGGGTTTCTTTCATCACAACTCCACTGGGGCGGCAAAATGCTCACACCTTTTCATAGATATGGTGTCGATCTTTCTGGGTTCAATCACGGGGGTCTATCAGGAAAATAACTTATTTGGGAAAAGAATTTAAAATAACATAGAATAAAACTTTGTGCATGAGTAACAATAAAAATTTTTACTGATGATAGAAATATCGTGATTGGATAAGATGAAACAATCTTTGTGCCAACAGAATATTGAACAGAAGCTGGATCAATGGGCCGTCGAAAATCACTTGATACAACAAGGGCTGCATCAATTTGGTTATCCACTTTATCAAGCCGTCGAAAGAGGCGTTTCCAGTCTGGCTCGGGTGATTGTCGGGCAACAGCTTCATACCAAAGTCGCGGATGGGATATGGCAAAAACTAGTCTGTTCTATCGGAGACATTACGGCTGATCGTCTTTTGTCTGTTGACGAGGCGATTTTGCGGCAATGTGGATTGTCACCGTCTAAGATTGCTTATTTGAAAGATTTGGCGATGCGCTCTGTCTCGGGGTTGGATTTGTTCGCTTTGCCCGAAGGAGATGATGATGCGGTCGATCTTTTGATGTCCGTGCATGGGATTGGTCGTTGGACGGCTGAAAATTATCTGATCTTTGCCGAAGGGCGTTTGGATATTTGGCCTGCGGCTGATTTAGGGATAAGAATTGCCACCGGATATCTTTATCAATTATCCTATCGGCCTGATATGAAAGAAACCCGTGGGCTTGGGGATATATTCCGTCCTTATAGAAGTATAATGGCTTTGTTTTTATGGCATCAATATCGAAATAAAAATTTCTGCTGATTTTATTTTAACTTCAATTTTTTCTAATCTCTTTATTTCTGCCCACTCTTTGTTGTTGTAAATTCTGTTTTAATAGAATTTGCATGACTGCTTCTTCTACTTTTAAAATTGAATATATCATACAGGCGCGCAACCTTGCCAAATTAGGGGCAACCAATCGCGATATTGCCGATTTTTTTGGTGTGAGCGAACGCACGATCAATCGTTGGGTTTTGAAATATCCCGATTTTGCCAATGCTGTCAGAATTGGGAAAGCGGCGGCGGACAATCGGGTTGAGGCTTCGCTCTATCAAAGAGCCGTAGGCTATAGTTACGATTGCCAAAAAGTGCTGATGGTTTCCGGTAAGCCTGAAGTCGTGGAATTTGTTGAGCATCTTCCGCCCGATATTACAGCCTGTAGTTTTTGGTTACGGAACCGTCGTCCTAAACAATGGCGGGAAAAACAAACGGCAGAAATAACCGGCCAGAATGGTGCGCCTATTTTGGCGCGTATAGAACGCGTCATTGTCGGCGAAGAGCATAACGCATCTGGTCGGCCTATATCCTCTTCGGAATGGGATAACGATCCTGTTGATACGGAAAAGATGTGATGTCTTTTCCGGCTTTAAGAATTCCGACCGCCAAAGTCTTTAGACCTTTATTAAAACCGGCACGCTATAAAGGCGCTTATGGTGGAAGAGGTTCCGGTAAATCCCATTTTTTCGCGGAAATGTTGGTTGAAGATTGTCTTCGCCTTCCGGGATTGCGGGCTGTCTGTATTCGAGAAGTTCAAAAATCCCTGAAGGATTCAGCGAAAAGGCTGATTGAGGCGAAATTATCGGCATATCATTTGGGACGCAATGTCGGTTTTCGGGTTTTTCGTGACCACATCCAAACCCCCGGCAATGGCGTTATCATCTTTCAAGGGATGCAAGACCATACCGCCGAAAGTATCAAGTCTTTGGAAGGCTTTTCACGAGCATGGGTCGAAGAAGCGCAGACCCTATCTCAACGATCTTTGGAGCTTTTAAGGCCAACCATCAGAACGATTGATTCAGAATTATGGTTTTCGTGGAATCCTCGTTTCAAAACTGATCCGGTTGACCGGATGTTGCGCGGCGAAACGCCACCGACCGGTGCGGTTGTGGTTCAAGCTAATTGGGAAAATAATCCGTGGTTTCCCTCGCCGTTGGATCAAGAAAGACGGGATTGCCTGACGAATGATCCTGATAAATACCGCCATATATGGGAAGGGGGATATGCGGAGATAACCGAGGGGGCTTATTATGCCCAAGCCTTGGCCAAAGCCCGATCTGAAAAACGGATAGCTGTCGTTGCCGCTGACCCGTTAATGACCTTGCGGGCGGTTTGGGATATCGGCGGCACGGGTGCCAAAGCTGATGCTACCGCGATATGGATTGTCCAATATGTCGGACGGGAAATCCGTTTCCTCGATCATTATGAAGCGCAAGGGCAGCCCTTATCTGCCCATCTTCATTGGTTGCGTTCGCATGATTATGGCGGGGCTTTATGTATCTTACCTCATGATGGCGCGCAGCATGATAAAATCGCCTCAACAACCTATGAAGGCGCGTTGCGTGAAGCCGGATTTTCGGTGCGGGTTATTCCTAATCAGGGAGCCGGAGCCGCAATGCAGCGGATTGAAGCAGCTCGTCGGTTATTTCCCCAGATGTGGTTTGATGAAAATCACTGCCGTGGCGGTTTGGAAGCCCTTGGATGGTATCACGAAAAACGGGACGAAATACGAGGGATAGGGTTAGGACCCGATCATGATTGGTCAAGTCATTCCGCGGATGCTTTCGGTTTGGCGGCTATTGCTTGGGAACCGCCCGTAACGAGCCGGAAAATCACCTATAGCAATAAAGGAATATTTTAATGAAACGGGCTGAAAAGAGCTTTCTGCCCCGAAGGGGGCGCTATGTCTTTGGATATTGATAGCAATATAGATTTGAAAGAAAAACAGACGGATTTCTTGCTTTCTAACCATGAGTTATTAGCGGTCTTGCACGAAAAAGCCGATCAAGCAGAAAGCTGGCATAATAGTTTACTGGCTGAAGATCAGGCCAATGCTATCGATTTTTATGAAGCGCGACCCTTTGGGGATGAAGAGGACGGTCGGAGTCAGGTCGTTAGTCCCGATGTCGCCGAGGTAGTCGATTATATGACGATCAGCCTGTTACGGACGATTGTGTCAGGCGATCGCGTGATTGAATTTGAGCCGATAGCCGCCGAGCAAGCGCAAGATGCCGATGATGCGACCGAGGCGGTCAGCTATGCTTTTATGCATGGGCAGGACGGCTATAAAATCCTGCATGACTGGATCCAGTCGGGATTGATTGAAAAAATAGGCATTATCAAAACCGCAGTCCTATCGGAAAGACGTGCTACAATCCGCCATATTACTGTCGATGATGATGCCTTGGCGGCTTTGTTGATGGAGGCCGAGGATAATCCCGATATTCAGATTACCTTGAATAATGACGATGGTAGCGGCCAATATGAGGTAACCGTTACCCGTTATCAGCTTCAAAAACGCTATGTCGATATGCCGATTCCGTCCGAAGAATATCGCGTTTCGGCCAGAACTCGCCATGAAGATGATGCTGATTATCAGGCGCATGTCAGTTATAAAACGCTGTCTGATCTTATTTCGATGGGGTTCGATCGCGATATTGTCGAGAGCCTGCCAAGTGATAAGAGTTTTCCCAATAGCGATGGCCGTTCTGATGCCAGATGGCGGGATGAATCCTTTCTGTCCGGCAGTAGCGATCAAGCCAATCGCGAAGTTCTCTTATATGAAGAATATGTCCGCATCGATCGGGATGGCGATGGCATTGCTGAATTATTGCAGATTTTTCGGGTAAAAGATGTCTTACTTTCGATTGAAGAAGTAGACGAAGCGCCCTTTGTCGTTTGGACACCTTTCCCGCGCGCCCATCGGATGATTGGTAATTCTTTGGCCGAGAAGGTTATGGATATCCAGCGGGTTAAGTCAGTGCTGATGCGTCAGGCTTTGGATGGGGTTTACCAGACCAATGCGCCCCGTATGGCGGTAAATGTCGATGGTTTAACCGAAGATACTTTTGACGATTTATTGACAATTAGACCCGGGGCGATTGTCCGTTATCGGGGTGGCATTCCACCAACGCCGTTAAATGCCGGTTTCGATATCCAAAAATCTTTGGGTATGATCGAATATATGCAGTCGGCTCAGGAAAGCCGGACGGGGATTACCCGTCTTAATCAGGGATTGGATGCTGACAGTCTTAATAAAACCGCGACCGGTCAGGCCTTGCTTCAGGCGCAAGGGCAGCAAATGGAAGAATATGTTGCCCGCAATTTTGCACAAAGTCTTGGGCGGTTATTCCAAAAGAAATTATGGCTGATGATTGCATCGGGCGATCCGATGGCGATCAAGGTTGAAGGTCTGTATAAAACGGTTGATCCGGCTTTGTGGCCGCCGGATATGCGCGTGCGTGTCACGGTCGGATTGGGATCGGGGCGAAAAGATCAGCGTTTGGCCTATCGTCAGCAGTTGTTATCGATTCAGCAACAGGCGTTGGCCGTTGGTTTAACCGGTTCCAAGCAGATTTATAATAATATCGCCGCAATGATCCGAGATTGTGGTTTGGGTAATCCGACTGATTATTTGATTGATCCTGATATTCGCTTGGCAGGTAATCAGGCTGAAAATCCTGTGAATAATAATTCGGCTGCGGCGCAAAATTCTTCTGGCAGTGTAGGAAATAATCCCGATTATACAGAGTTGAAAGCCCGACAAGATATCAATCTTCAAGGGCAGAAAATGGCTGCTGATCAGGAACGGAGTATGGCCGAATTTGCTTTGAAAAAGCAGGAAACCGAGGCCAAGCTGGCGATGCAACAGGAAGAGCATAAACAGCGTTTGGCCTTGGTGCGCGAAAAAGCCGAAGAAGAGGCCATTTTAGCAAGGCAACGTTCCGATTTTGAAGCCTCGCTTGCCAAGGAAACAGCCGATCGTAATTACCAGATCGCTTTGAAATTGGCAGAAGCTGGGAAAAATATTCCAGCGGATAAAAAGGGGGATAGGGTGCCGCAAAACAAAGCAGGGGGCGCATTGGATAAATAATGGAGCAAGATCCCATTTTGCGGGCAAGGCGTTGGAAAGCTTTTTATGAAGAAAAGGGCGGGTTGAAAGCTATTTTGCAAGAAATCGGCACCCGTTATATCCAGCGTATGTCCGAGATTGCCCCATGGGAAGCCGAAGCTGACCGAAAATTATTACGTTTGGCGATGGCTAATCGAATTGTGGGGCAAATTGATAACTTGATTCAAGTGATTATTGCAGATGGGCAATTGGCTGATCAGGCCAAAGAACATGCCCGAAAAATAGAGAATTTACCCGAGCGAAAACGTCGCTGGCTGTAGGCCGAGCTATCAAGGTCTTCATTCTATTTATCGAGATAAAACCGGCTGGGTGGCTGGAAAGAGAGTCACCGGAGTGATCTTCTGCGCCCATTTTACCGGTTGGACATTCAGGAATATTGATAATGACAAATATAAAAGGCGCAGCCTTGAGGCGACCTCGTCGGTCTATTCAGGAAGCTGCCGAAGCATTGCAAGCCAATTTGGCACCGGATCAGGATGATCCGATGGCTGGAAATAATGCGGATATGCCGCCATCCGATCAGGCATCCGACGATCAGAGCAATATGGGCGCAACGGCAGCAGGAAATGGTGCCAATCAAGGAAATGGTCAGCCGGATATGAGTGCGGCTTGGCAAGAACATTGTCGGGCCTTGCATTACCATTATGCCGACCAGCTTGCTAAATATGCCGAAGCGATTACCCCGAAAAAACCTGATCCGCAGTTATTGGTCAGTGATCCGGCAAGCTATGCCGCACAATTGGCAAGTTATGAAGACTTAACGGCCAAGCGTGACCAAATTGTTCAGGAAGTTATCCAGATTTCTCGCCAGAATGAAATGGCTGAACTGGCTGCCAGAAGGGCATGGGCGCAGGGCGAACATCAGCGTTTGATATCGCTTTTGCCTGAATGGGGCGATGATAATCAGAGACCGGCTATCTTGGCGGCTTTTGAAGAAACGGGACGGCATCTCGGTTATCCAGATCATGTTTTGGCCGAGGCTGATTCCAACGACATTATGGCTTTGAAAAAAGCCCATGAATGGCGAAGAAAATCCGAAAAATGGGATGCCCTACAACAGGGTAAGGCGGCGGCTATCAAATCAGCCAAAACATCGAGAAAAACCGCAGTTCCAGGAACGTCCCAGCCTTATGGCGCGGCCAAAAGCCGGAAACTAAATGAATCATTGGGGCAGCTTCGCGAAACTGGCGATGTCCGGTCAGCCGCTGCGGCGATGAACGCCCTTTTTAAATAATCTACTTTTCAAGGAATTTTGAATTATGTCTGTTGCCTCTAATACCGTCCAAACCTATTCGCGTGTCGGTATTCGTGAAGATCTTTCCGATATTATTTATAATATCAGCCCGACTGGAGTCTTGGTCCGTTAATGTAGTTTATACGCTGCATTAAATGAAAAGGCGGTGAATTCGGGGGACGTCCAACCGTCATAGACGAGGATAATCCCGAGCCAAGCTTGAGGTGAAAATCTCTTGAAGGTGTAACGACTAACAGCCGAGCCTACCGACAAAAAACGATAAAACTGATGCCGTTATTTCTGAAAAAGAAGCGGGTATGATTCTTTGCATTTTTACTGAAAATGCCGGAAATTTCGGAAGAAATATTTCAGTCTTTTTTGTCCATGGCAGTAATGCTGACACGAGTGCCGCCCGCGAAAGCGATGATATAGTCTGAGCCTTGCAGGAATGTAAGGAAGCAAGGGATAAAGAGCCTTTGCGGTAACAATCTGGAAACACCTTTTGTAACAGCTATCGGTCAAACGACAGCCAAAAATACCTATACCGAATGGCAGACCGACAATCTTGCCAGCGCTAATGCCCAGAATAAACAGGTCGAAGGAGCTGATCTTGCCAATGAAAGCCGCCAGCCAACGGTTCGGGTCGGCAATTATACCCAGATCATGACCAAAGTTGTCGGGACATCGACGACCGATCGGGCGGTGCATAATGCCGGACGCGGTGATGAACATGCCTATCAGTTGGCACGTGCTGGTCAGGAATTGAAACGCGATATTGAGGCGCGCTTTACTGGTAATTTTGCAGCCATTCCCGGAGATGGGGCAGTCGTCGCGCGTGAAACAGCAGGAGCTTTGGCATGGCTGCGCAGTAATGCCCATCGTGGCGACGGCGGTGCCAATCCGGTGATGTCCGGTGGCGACAATGGCAGCGGTTATCCGACAACAGCGGCGACCACAGGTAAGGCGCGGCTTTATACCGAGGCTTTGTTAAAAGAAGTTTTGGGCGATATCTGGGTCAGTGGTGGTCAGCCAAATATGGTGATTACCTCTTTGAAACTGAAACAGACAGCGGCAGCCTTTCCGGGGTTGGCTTCCAACCGGCGCGATACGGGCGACCAGAAAGCGCGTATTATTGCCGGTGCGGATATCTATGTTTCCGATGTCGGCGAAGTCCAGTTTGTACCTGATCGTTTTTGCGACAATAGCAGCGCTTTGGTCATTGACCCTGAATATTGGTCGGTTGCGACCTTAGATCCGATTCAGAAACGGTCTTTGGCAACAACGGGGCTGGCTGATCGTGATGCCCTTTATACTGAAATTGCCTTGCGCTGCCATAATGAAGCAGCCTCCGGTGTTTTGGCAGATTTAAGCGCTGCCTGATTTTCGTGATCGGGAGGATGTCTTCTTGCAAGACATCTTCCCGTAAAAATTCGGAAAGGGAATATTCCATGGCTTTTTCTTCTGCCTTGGTCGCCTTTTTCAAAAAAAAGGTCGGGCTGGGCTTGAAATGCGGCTTTGATATTCGAGGGCATGATGGCTGCTGAACGCTTGTTATCCTATAATCCGGTGACGGGGCTTAAAAGTTGGTTTTCCTCTTCAGAGGAGAATCAGGGCAGTTGGCATATCCGCTATGAACAGGATTGTTCTGCGGCTTTGGAAGCCAATAAACAGGCACAAAATGAAGATTTTGATCGGCGTTCTTCGATGTGGCACGCCGCCCATGTTCCAGCCGTCGTTTTGATGGAATGGCTGGTGAAATATGGTGTCCGATATTGGGACAAAAATCACGCGCCTGCCGTTCGCCGTTTACTCAATCATCCTGATTATCGCTATTTGCGCGTTAATCACTTTATTATGTGA
Protein sequences of DBSCAN-SWA_1 >NZ_CP023715|383000:393484|392089_393004_+|WP_011240305.1|DBSCAN-SWA MKSLCGNNLETPFVTAIGQTTAKNTYTEWQTDNLASANAQNKQVEGADLANESRQPTVRVGNYTQIMTKVVGTSTTDRAVHNAGRGDEHAYQLARAGQELKRDIEARFTGNFAAIPGDGAVVARETAGALAWLRSNAHRGDGGANPVMSGGDNGSGYPTTAATTGKARLYTEALLKEVLGDIWVSGGQPNMVITSLKLKQTAAAFPGLASNRRDTGDQKARIIAGADIYVSDVGEVQFVPDRFCDNSSALVIDPEYWSVATLDPIQKRSLATTGLADRDALYTEIALRCHNEAASGVLADLSAA >NZ_CP023715|383000:393484|390772_391624_+|WP_011240302.1|DBSCAN-SWA MTNIKGAALRRPRRSIQEAAEALQANLAPDQDDPMAGNNADMPPSDQASDDQSNMGATAAGNGANQGNGQPDMSAAWQEHCRALHYHYADQLAKYAEAITPKKPDPQLLVSDPASYAAQLASYEDLTAKRDQIVQEVIQISRQNEMAELAARRAWAQGEHQRLISLLPEWGDDNQRPAILAAFEETGRHLGYPDHVLAEADSNDIMALKKAHEWRRKSEKWDALQQGKAAAIKSAKTSRKTAVPGTSQPYGAAKSRKLNESLGQLRETGDVRSAAAAMNALFK >NZ_CP023715|383000:393484|385622_386243_+|WP_011240297.1|DBSCAN-SWA MKQSLCQQNIEQKLDQWAVENHLIQQGLHQFGYPLYQAVERGVSSLARVIVGQQLHTKVADGIWQKLVCSIGDITADRLLSVDEAILRQCGLSPSKIAYLKDLAMRSVSGLDLFALPEGDDDAVDLLMSVHGIGRWTAENYLIFAEGRLDIWPAADLGIRIATGYLYQLSYRPDMKETRGLGDIFRPYRSIMALFLWHQYRNKNFC >NZ_CP023715|383000:393484|386832_388119_+|WP_011240299.1|terminase|DBSCAN-SWA MSFPALRIPTAKVFRPLLKPARYKGAYGGRGSGKSHFFAEMLVEDCLRLPGLRAVCIREVQKSLKDSAKRLIEAKLSAYHLGRNVGFRVFRDHIQTPGNGVIIFQGMQDHTAESIKSLEGFSRAWVEEAQTLSQRSLELLRPTIRTIDSELWFSWNPRFKTDPVDRMLRGETPPTGAVVVQANWENNPWFPSPLDQERRDCLTNDPDKYRHIWEGGYAEITEGAYYAQALAKARSEKRIAVVAADPLMTLRAVWDIGGTGAKADATAIWIVQYVGREIRFLDHYEAQGQPLSAHLHWLRSHDYGGALCILPHDGAQHDKIASTTYEGALREAGFSVRVIPNQGAGAAMQRIEAARRLFPQMWFDENHCRGGLEALGWYHEKRDEIRGIGLGPDHDWSSHSADAFGLAAIAWEPPVTSRKITYSNKGIF >NZ_CP023715|383000:393484|388161_390351_+|WP_011240300.1|DBSCAN-SWA MSLDIDSNIDLKEKQTDFLLSNHELLAVLHEKADQAESWHNSLLAEDQANAIDFYEARPFGDEEDGRSQVVSPDVAEVVDYMTISLLRTIVSGDRVIEFEPIAAEQAQDADDATEAVSYAFMHGQDGYKILHDWIQSGLIEKIGIIKTAVLSERRATIRHITVDDDALAALLMEAEDNPDIQITLNNDDGSGQYEVTVTRYQLQKRYVDMPIPSEEYRVSARTRHEDDADYQAHVSYKTLSDLISMGFDRDIVESLPSDKSFPNSDGRSDARWRDESFLSGSSDQANREVLLYEEYVRIDRDGDGIAELLQIFRVKDVLLSIEEVDEAPFVVWTPFPRAHRMIGNSLAEKVMDIQRVKSVLMRQALDGVYQTNAPRMAVNVDGLTEDTFDDLLTIRPGAIVRYRGGIPPTPLNAGFDIQKSLGMIEYMQSAQESRTGITRLNQGLDADSLNKTATGQALLQAQGQQMEEYVARNFAQSLGRLFQKKLWLMIASGDPMAIKVEGLYKTVDPALWPPDMRVRVTVGLGSGRKDQRLAYRQQLLSIQQQALAVGLTGSKQIYNNIAAMIRDCGLGNPTDYLIDPDIRLAGNQAENPVNNNSAAAQNSSGSVGNNPDYTELKARQDINLQGQKMAADQERSMAEFALKKQETEAKLAMQQEEHKQRLALVREKAEEEAILARQRSDFEASLAKETADRNYQIALKLAEAGKNIPADKKGDRVPQNKAGGALDK >NZ_CP023715|383000:393484|386326_386833_+|WP_011240298.1|DBSCAN-SWA MTASSTFKIEYIIQARNLAKLGATNRDIADFFGVSERTINRWVLKYPDFANAVRIGKAAADNRVEASLYQRAVGYSYDCQKVLMVSGKPEVVEFVEHLPPDITACSFWLRNRRPKQWREKQTAEITGQNGAPILARIERVIVGEEHNASGRPISSSEWDNDPVDTEKM >NZ_CP023715|383000:393484|390350_390650_+|WP_011240301.1|DBSCAN-SWA MEQDPILRARRWKAFYEEKGGLKAILQEIGTRYIQRMSEIAPWEAEADRKLLRLAMANRIVGQIDNLIQVIIADGQLADQAKEHARKIENLPERKRRWL >NZ_CP023715|383000:393484|383000_385427_-|WP_011240296.1|DBSCAN-SWA MKETLPVLPLRDIVVFPHMIAPLFVGREKSVAALESVMAAEKTIFLVSQRDPAEEDPNREALYDIGVVANVLQLLKLPDGTVRVLVEGRKRAKISDMDDSSGHLIAEVEPLEDTSVENDEVEALMRSVKDQFEHYAKLNRKLPGDIAHEIKEIDAPSRLADAIAVNLAVKVADKQPLLEELNPFKRLEMTFGLMEGELGVLQVERKIRSRVKRQMEKNQREYYLNEQLKAIQRELSTGESSGNGEEEEDGDELSEFTQKLNNPKLPKEVKAKATSELKKLRTMGAMSAEATVVRSYLETLLNMPWGKKTRVKKDIIAAEKILGEEHFGLDKVKERIVEYIAVQARTNKLKGPILCLVGPPGVGKTSLARSIAKATGRNFVRQSLGGVRDEAEIRGHRRTYIGSLPGKIVTNIKKAESFNPLFLLDEIDKLGQDSRGDPASALLEVLDPEQNNKFQDHYLETDIDLSDVMFVATANSLNLPAPLIDRMEIIRLEGYTEDEKIEIATSHLLPRQIKSHGLKKGEFTVETDAIRDLIRYYTRESGVRALERELAKIARKALRRILEKKAKTITITAKNLSDFSGVRRYKFGISEKEDQVGVVTGLAWTEVGGELLTIESVTVPGRGNIRTTGKLGDVMKESIQAALSFVKSRAPFYGIKPSIFTRKDVHIHLPEGAVPKDGPSAGIGIVTAITSVLTGVPVRRDVAMTGEVTLRGRVLPIGGLKEKLLAALRGGIKTVLIPEENQKDLIELPENIKKGLKIIPVSHVDEVLALALTEKLDPIDWSEADDLAAFPPTALNHHASDGVSPVHH >NZ_CP023715|383000:393484|393157_393484_+|WP_011240306.1|DBSCAN-SWA MAAERLLSYNPVTGLKSWFSSSEENQGSWHIRYEQDCSAALEANKQAQNEDFDRRSSMWHAAHVPAVVLMEWLVKYGVRYWDKNHAPAVRRLLNHPDYRYLRVNHFIM >NZ_CP023715|383000:393484|391649_391757_+|WP_011240303.1|DBSCAN-SWA MSVASNTVQTYSRVGIREDLSDIIYNISPTGVLVR |
10 | Sinorhizobium_phage(33.33%) | terminase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1209063 : 1222271
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP023715|1209063:1222271|DBSCAN-SWA GATGAAAAGACAACCGCAGACGAATAAAGGCAATATCAATCTTCTTTCTTATAAGAAGAAACGGTATTTTAAAACAGCCTGTAGTTTGGCGATGTCTTTTTTGTTATTCCCTCCTACACCCTCCGCGTTTGCAGCAACAGTAACAAATAAAAATCCTGTTATTGAACCTGGTTTTCTTTCTGATGATGATATTGCATTAGAAAATAGCGAAACCGATTTTTCGCAAGTTGTTGTGTCCCGTGTGGTACCAATAGATAATGTAAAAGAAGTTTTATCTCCACGAGAGATACAGGGATATAAGGCTTGCCTTTCGGCCATACGCCATGGCAGTTGGCAAGCTGCACAGCAATGGATATCTTCGAATCCAGACGGCCTACTAACGGATTTTGTAACAGCTGAACTTTATTTGGCAAAAAATTCACCTAAAGTCAGCCCAGATGATATTATCAATTTAATTCAGAAGTCACCTTACTTACCTCAGGGTGAGGCTTTGGCACGTCTTGCTTACAACCACGGTGTAAAAACACTTCCAGCACTGCCAGAAAAAAATAATTTATTCTTTTTATCAGGCGCGCCACAAAGGCCTGTTAATCACCACAATGCTACAAACCGTTCAGCTATGGTTTTTGAAACCAAAGCACTGCCTTTAATAAAAGCCGATGCTTCTGATCAGGCTGAAAATTTATTTCAAGATTCTCAACAAAATTTATCTGATGATATCAAAACCGAATGGGCTCAACGTATTGCATGGAGTTACTATTTAAATGGTAATGATGATGCCGCGCGCCGCTTAGCTGTTCAAGCCGAAAATGGCACTGGGCGCTGGACAACCGATGCAAGCTGGGTTGAAGGCTTAGCCGATTGGCAAAAAGAAGATTTCTCAGCGGCTATGCAGGCTTTTTCTCGTGTTGCATCCTATAGCGAAAATAAAGAAATGAAGGCTGCAGGTCTATTCTGGGAAGCCAGAGCCGCGATGGCATCAGGGCACCCTGAATATACTCAAAATTTATTACGTTCCGCTAGCCATATGCCAGAAACTTTTTATGGTCTCTTGGCAGAAAAAGCCCTAGGAAAAACTGTTAAGTCCTCTGTTCGACCTGTCGCCTTATCTGAAGACGAGCTGAAATGTTTGACAACACAGCCTAATATCAGAATAGCCATTGCGTTGAACCAAATTGGCGAATATCAGCTTGCCAGTGAAACTTTAAAACACCAAGCACGACTAGGTGCCCCTAAAGACCATAACAATTTAATTCATTTAGCATCTTATCTTCGTTTACCAGAGGCCCAATTATGGTTAGCCCATCATGGTCCCGCGGGTTTTTCTGCAGATGTTGATGCACGTTATCCTTCTCCAAATTGGGCTCCGGCAAGTGGTTGGCATGTAGATCCCTATCTTGTTTATGCACATGCCTTACAAGAATCCCAGTTCCGTAGCCGAGCCGTTAGCAACAAAGGTGCGCGAGGCGTTATGCAAATTACACCTGCAACAGCTCGTTTGGTCGCTAAACAACATGCTACAAATATTGATATAGAAGATCTTGATAAACCAACTATCAGCTTTGAATATGGTCAAGCCTATATCGAATGGCTACGGGATACGAGCTACACAGGCGGGTTGTTGCCCAAAGTCATTGCAGCCTATAACGCGGGTCCAGCTGCCTTGCCCCGCTGGAAAGACCGTGATCATGGCGACCCACTGCTTTTTATTGAATCTATACCATATGCAGAAACCCGCGCCTATGTTGCGACTGTATTGCGTAATTATTGGATATATCAGCAAAAAGCATCTTCATCTTCTCAAAGCTTGAATGCGATGGCGCAAGGGATGTGGCCTAAATTTCCAGGAATGCCCGGCGATAATGCTGTCCGGTTAAATAATAAGGCAAAAGAAGTTAATAAATTACTTGCATTAAATAATATAAATAAACAAAATATGTAAATATTGTATGGTTATTTTTGACATGAAGAACACAAAAAAGTTGAACGACCGCCTAGGGTGTAACGCTCGATCGTATGACCACATTCACATTGTTCACCCTCTTTGCCGTATACTTTGAATTTAGTCGAAAAATAACCTAGCTCTCCATTAGGTCGTGCATAGTCTTTTAAGGTTGACCCGCCCTCAGCTATGGCCTTTTGCAAAATATTTTTGATAGAAAAAACAAGTGAAGTTATTTCATCAAAATTTAAATTTTTGCTAGGCCGTTGAGGATGTATCTTGGCTTGGTGCAAAGCCTCACAAGCATAAATATTACCAATGCCAGCAACTACTTTTTGATCTAATAATATTTTTTTTATGGGAGCAGAAGAAGAGAATAACTTCTTCTGTAAATATTCTGGATTAAAATTCCCAGTTAAAGGTTCTGGCCCGATATTTCTAAAATAAGACCATTCTAACAATTGATTTTTTTTTACTAAATCAAGAGAGCCAAAGCGACGCGGGTCATAAAGTGAAACTATAAAATTATTTTTTGTTTGAAGAACGAAGTGGTCGTGTTTCTCAAAATTTTCAGGATTTATTTTCCATCGACCGGACATCCCAAGATGGAAAATTAAAGCATCATCCCGATCATTGACAATAATGCCATATTTTGCGCGTCGACTAAGAGAAATGATAGTTGATCCTATTAGCCTTTCTTGAATATCAGAGGGTATTGGCCGTCTTAAAGATGCCCGGCGCACTTTCACATCTATAATTTTTTCACCCATCAGGACTTCAGATAGTCCACGAATGGTAGTTTCAACTTCTGGTAGCTCTGGCAACAGGATCTTCCGTTTAAATTGTCGAATATATCCAGAAATGGTTAATAGACAAAATTATTCAAGTATTCTGTTTTTGTGTAAACATAACATTGACCAAAAGATATTAGTTGTGGCAAAGAGCCTTTGATTTCAAGATCAAAATAATCCTAGCGGGTGTAGCTCAATGGTAGAGCAGGAGCTTCCCAAGCTTAAGACGAGGGTTCGATTCCCTTCATCCGCTCCATTCTCAACACCTAAAATAAGAAAAAACAGCAGCATAAAGTTTTTTACGCTTCAATGTTCTTTTGATTAGCATTAAAACTTTTTCATATGGAAAAAGTTTCTTTCGGTTTTTCAGACGTCTCGCCTCAGGAAAAAACACATCTTGTTGGCGATGTCTTTCGTCGTGTAGCATCGCGCTATGATTTAATGAATGATGCTATGTCAGGTGGACTCCATCGGTTATGGAAAGATGATTTTGTTCGTCTGGTTCAACCAAAGGCCAGTGAACATATTCTAGATATGGCGGGCGGCACCGGAGATATTGCCTTTCGATTGGCAAAATACGGTACAAATGTCACGATTGCAGATATCAATCCAGCTATGCTTGAAGTCGGAAAAAAGAGAGCTATAGCCCGCTCGATAGAAAACCTTACATGGAAAGAAGAAAATGCTGAAGCATTATCATTTAATGATAATGTTTTCGACGCTTACACTATAGCTTTTGGTATTAGAAACGTAACACATATACAAAAAGCACTAGACGAAGCGTGGCGTGTTTTAAAAGTAGGGGGACGTTTCTTTTGCATGGAATTTTCCCAGACGAAGTGGTCAGGATTTTCTAATCTTTATAAAATGTATTCAACACATATCGTACCTAAAATTGGTCAGCTATTAGCCAATGATGAGGACAGTTATCGCTATCTTATCGAATCTATCGAAAGATTCCCTAACATAGAAAAATTTTCTGATATGATTAAATCAGCTGGTTTTGTCCAAATACGAGCAAGGCCAATTTTAGGCGGTTTAGTGGCTATACATAGCGGTTGGAAAGTCTAATTAATGACATTAGCCGTTGTTCATTTTTGGCGTTTATTCCGCTGGAGCCGAATATTATCGAAACATGGCGTTTTAAAAACGGTAGAGAAATCTACAATTGTTCCGTTATCCCTTCGTTTGATAATAAGATTACTTAGGATCGGCTATCGAGTTCCAAACCCACCCGATTATACAGGCGCTTTTGTCGCCTTAGGACCAGCTGCAATCAAATTTGGTCAGACTTTAGCGACACGACCAGATCTAATCGGAGAAGAAACAGCCGCTCAATTGGCATGTTTACAAGATTCTGTACCGCCTTTGCCTTTTAAAGATATTCGTATAGTTATCGAAAATGCCTTAGGATGTCCTATTGAGAAATCCTTCCGTTTTTTTAACGAAATTCCTATTGGTTCGGCATCTATTGCTCAAGTGTATCAGGCAGAAACACTTGATGGCGTTACAGTCGCGGTAAAAGTACTGCGTCCAGGAATAAAGCTGGCTTTTCGAAAGGCTACAGAAACATATGAATGGGCTGCAACGAAAATAGAATCATTAAATGATGAGTTTATCCGCCTACGTCCTAAATTAGTTGTAAAAACTTTCCGCCAATGGACAATACGTGAGCTTGATTTAAGGAGAGAAGCTGCGTCTGCCTCGGAGTTAGCGGAAAATATGGAAGCGGTTCCCGGTTTTAAAGTCCCTGTAATAGACTGGAATCGGACATCCCAGTCTATGATGGTAATGCAATGGATTGATGGCATTAAGCTTTCAGATCATCAAAAATTGCAGGAGGCCGGATATGATCTAAAATCTTTGGCAACTCGGCTTGTTCGCAGTTTCTTAAGGCAGGCTATAGCTGATGGTTTTTTCCATGCTGACCTCCATCACGGAAATTTATTTGCATTAAAAGATGGATCTCTTGCTGTTGTTGATTTTGGTATTATGGGTCGGATCGACAAAAAAGCACGGCGCTGGTTAGCTGAAATATTATACGGCCTTATTACAGGTAACTATTCCAGAATAGCGGAAATTCATTTCGAGGCAGGCTATGTTCCATCTCACCATGACATAGCAGAATTCACGACTGCTTTGCGGACTATTGGAGAACCTATCAGAGGATTATCTGTTCGCGATATTTCTATCAGTCATATGCTGGACGGGTTATTTTCTATAACCCGTGAATTTGAGATGCAGACCCAGCCACATCTTTTACTTTTGCAAAAAACGATGGTTGTAGTTGAAGGAGTGGCTACCTCTTTATATCCTGATATCAATTTATGGGATACAGCGGAGCCTTTTGTAAGAGAATGGTTACGCTCAGAGCTGGGACCCGAAGCTAAAATAGCTGAAGAATTTTATAAAACACTTCAAAGCATTAAGCGGTTACCAGAACTCATTGATCGGATAGATCAATATTATCCGCCTCTCACCATAGAGGAACAGCTCTATCCTGTAGCGAATAAGAATTCGATTAAATCTGCTGCTACAAAACATTATTTGCCTGTCTTTATCCTTAGCGCGTTTATTATCGGGATAGTTCTGGGCCATTATCATTTTTTTTAATTCTCTCCTCCAAATCGAGATAATCTATGTCTTCTTTTTTAAATGGAAAGCGAATCCTGTTAGTTATTTCTGGTAGTATTGCAGCTATCAAAGCCCCAGATATAATCCGCTTATTCAGAAAGAAAAAAGCTGACATCCGCTGTCTCATAACAAAAGGTGGAGCGAATTTTATAACGCCACTTGCTCTTGCTAGTTTATCAGGCAATCCAGTAGCGCAGGATATGTGGGATGAAAGCGAAGAGGCTTCAATTCGTCATATTCGTCTAGCCCGTGAAGCCGATATGATTATTGTCGCACCTGCATCGGCTGACTTTATTTCAAAAATGGCTCATGGATTGGCTAACGACTTGGCTTCTACTGTAGTTCTTGCAGCGGATAGTCCAATCTTAGTCGCGCCTGCAATGAATCATCGCATGTGGCATCACTCAGCAACTCAACGGAATATTCATCAACTAAAATCTGATGGCATATCTTTTGTTGATCCAGAAGCTGGAGCTATGGCTTGTGGAGAAACAGGAATTGGACGACTAGCAGCACCAGAAGACATTCTTTTATCTGCAGAGTCTCTCTTTGCAGAAGAACAAAATCATCAATTTTTAAAAAATAGACATTTTATTGTGACAGCAGGACCAACACATGAACCAATTGATCCTGTCCGCTACCTAGCCAATCGCTCTTCTGGAAAACAGGGTTTCGCGATCGCTGAAGCCTTACAAAAATTAGGTGCAAAAGTTACACTTGTTACGGGGCCTGTTTATCAAAACACCCCAGACAGGGTAGAACGCGTGAATGTTGAAACGGCTATTGAAATGGAAAAAGCTGTCGAAAATGCTCTCCCAGCAGATGGTGCTATTTTTACAGCGGCTGTTGCTGACTGGAGGTTCAATTATTCTCCATCAAAATATAAAAAAGGTAGATCAGAACCTGAGCTTATACCGATAGAAAATCCAGATATCTTATCGAATATATCTCACAGGAAAATAGACCGTCCTTCTTTAGTTGTTGGTTTCGCTGCAGAATCAGAAAATCTATTAGAAAATGCAAAAGTAAAGTTACAACAAAAAGGTTGTGACTGGATAATAGCCAATCCTGTTATTGAGAGTGAAGATGCCCCCTCGGCAATGGGCGGAGACTATAATAAAATCTTCATTGCTAGCCATGCAGGTATAGAAAATTGGCCTCTTTTGTCCAAAAAAGAGGTTGCCTCTCGGTTAGCGAGTAAAATAGCCGATTACTTTAATCTTATAAATGAGCACCCATCATCAATTATATCAAAATAAAATTTTGCTTGGCATAAAGGGATATCATTTTATGAAAATAAAACTTGAAATAAAACGCCTACCCCATGGCGCAAACCTCCCCTTCCCTTCTTATGCTAGTGAGGGCGCTGCGGGAATGGATGTAGTGTCAGCAGAAGACGTAATTTTGCAACCTATGCAACGTTATCCGGTTAAAACTGGTTTCGCTGTAGCTATTCCTAACGGATATGAAATTCAGGTTCGTGCTAGATCAGGGTTAGCGTTAAAGCATGGTATAGCTTGCCCTAATGCTCCTGGAACTATTGATTCTGATTATCGTGGCGAAGTTAAAATTCTTCTTATTAATTTGGGCAGCGAAGCTTTTGAAATTAAAAGAGGTGATCGTATTGCGCAGCTGATATTGGCTTCAGTAACGCAAGCTGTATTTTGTGAGGTCACTGATTTAGATGATACCCAGCGTGGCCATAACGGATTTGGATCGACCGGAATATGATACTTACTGATAATCAGCTAGATCGTTATGCCCGTCATATTGTTTTACCTGAAATCGGCGGTAAGGGACAAAAAAAACTCTTATCAGCTCATGTCGCAATTGTTGGTGCTGGAGGAATAGGTTCTCCGGTTATTCAATATTTGGCGGCTGCTGGTGTCGGTCGTCTTACTATTGTTGATAATGACGAGGTTTCTCTCTCGAATTTACAGAGACAAACACTTTTTGCGACCCGTGATATCGGAGCACATAAAGTAGCTATGGCTGCTAATGTTGTCCAACGTCTGAATCCTGATGTTAAAGTATTGCCTTATGACCAGAAACTGGATGCAGAGAATGCTAAGAAGCTGATTGGTCAGGCTGACATTATTGTGGATGGCAGTGATAATTTTGGAACCCGTTTGGTTGTCAGCGATGTAGCAACAGAGTTGAAAATTCCTCTAGTGTCAGCTGCTGTTATTCGTTTTGAGGCGCAGTTAGCTGTTTTCAAAGGATATGAAGCAAGTAAACCTTGCTATCGCTGTTTTGTCGGTGATGATCCGGGTGAACCGGAGATGACTTGTTCATCGCAAGGTGTTCTCGGTGCATTGACTGGAACGATCGGAAGTTTAGCTGCGATTGAGACTATCCGTTTGATTACAAGATTTGGTTCCGAGAGCGGCGACAAATTGCTGCTGATTGATGCCAAAGATTTTCGTTTTAGAACTGTGTCTTTGAGTAAAGACCCGAATTGTCGCTGCAAGCATAGCGCGGAATAGCTTTTGATTGATTGGATAAATTAGATCGTAAGCGTATGATAAGACGATAACTAAAGAAATAAGAGTCATCGCACGAGCCAAGGTAGCGTGGAATATATGCATAGATAAATAGAAATCTCTGCCTATCTTTGCAGGCGCCTTCTCGAGAGTGGCAAGGCTGTCTTTCGAAATGGATGCGGATGCTCGTGATCTTGATACTATCGACGCGAGTTTTTGGATCGCCTTTAATAATCGCTTTGCGGATTGAGATTAGAGGATCTGTAAAGATAGGGTGTGATTGAATCCGTTGTCGGCCTTGAGTGGAATAGATAGTCGGGCGGCTGAAAGCGAGGCTGGGAAGCTTGGGGTGTAAAAAAGCTAAGGATGTAAAGTAGAAACGGGATAGCCATCGCTATCCCGTTTTTGTTTTTATTCGGCCACTTCGTCTTTGGGGGTGGCTTTAGAAGTCTTTTTCGCTGCGGTCGTTTTGCTGGATGGAGCTTTTTTAGTCGCGCTTTTTGAAGCGGGCTTCTTTGCCGTTGTTTTTTTTGCGACAGCTTTCTTGGCGGGTGACTTGGCAGGAGTGGCATCGCCATCTTCTGAAGTCGCGTTGTCGGTAGTGGCCGATTTTGGGGTTGCAGTTTTTGTTGTCGCCTTTTTAGGGGCGGCTTTTTTTGTGGTGGTCTTCTTGGCGGCCGGTTTCTTGGCTGCTGCCTTTTTGGTTTTCTTCTTTTTCGGCATTTTGGCGCGAGCTTCTAATAACGCCAAAGCCTCTTCGAAAGTCAGGCTTTCGGGGGTTGTGTCTTTCGGTAAAGAGGCATTGGTTTCGCCATCGGTGACATAGGGCCCGTAGCGGCCATTCATCAGTTGAACTGGCTTTTCGGTAACAGGGCTATCGCCAAAAACCTTCAAAGGTTCGCGGGATAATGTCGCACCGGTGCGCCCCTTATTTTGGGCGGCTTCGGCCAGTTTTACAACAGCCCGATCGATGTCAATGTCCAACAGCTCTTCCGTGGAAGACAGGCGGGCATATTTGCCGTCACAATTCAGATAAGGACCGTAACGTCCGGTTCCGGCAATAATCATTTTGCCGCTTTCGGGATGAAGCCCAATTTCGCGAGGCAGTGCCAAGAGGCGGCTGGCTAATGAGAAATCAACCTCCCTTGGATTGACATCGCGGGGGACAGAAACCCGTTTAGCATTTTTGCCTTCACCAAGTTGGATATAAGGCCCAAAACGGCCTGTCCGGCGTGTAACCGCCCTTGCCTTTCTGGATTCTTCTTCCTCTTTGGCTTGTTCATCCGCTTTGCTAGCCGCCGTATCATTGTTTTCTGTTTCCAGAGCAACTTTTACCGAGGAGGTCGCAGAAGAAGGCGTGCTAGCCAGATGGGCGACATTATCTGAAGGGCTGTTATCAGACGTTATGGCATTGCCGACATCATGGCCGGTAGAAACGGCCTTGCTATCAATCTTTTTATCACTGTTAGAGACAGAGGCATCGTCAGAATTGCGGTCAGGAGAGAGCGCATGGCTGGCATTCTGATCGCTAGAACTGTCCTTGTGATCCGCAGGAATCGCGCGGCTATCAATGTCTCCATTACGATGAGATAGAAGCCTATTATCGGCATCCTGACCGGAAGAAGATGCCCCGCTGCCAATATCCCCATCAGGATCGGAGACCGTAGCATTATCGTCAGTAGAACCGCCTTTGCGCGCCGCAGCGATTGTGCTGCTGCTGTTATTTCCCTTGTTGTCGGAGACGGCGGTACGACCTGCGTTATTTTTACCGGCAAGCGCCTGATTGCTGATACCGGCTTGGCTTTCAGCATCGATTCCGGCAGTTGTCGAAACCCCATGAGCCGAGGAATCCCTGCTATTGCCAGAAGACGCCGTTTCGTCACCTGTCATGCGGGATTGTGAGGAGGCGACATTTCCGGTTGAAGAGGCGGGTTCGTCAGGATCAAAAGGTAAGTCATCCCTGTCTGTATGGGATGCGCTATTCCCCTTACCCGAAAAGGCCGCATTAGCCGCGATACCGCCGTCAGTAACGCCCGTTTCAGAGGCGGAAGCGGTGTTAGCGGGTAAAATATCCCCTTGGTCTGAAAACGCCATATAGCCGTCATCATCCGAATTTTCGGGCATATAACCGAGCAGGACGGGTTCGTTATCGTCAACGCCGTCTTCACCCTGCCCAAAGCCACGGGTATATTTACATTCGGGATAATTCGAGCAGGCAATAAAGGCACCGAAACGGCCACCTTTTAACGCCAAACGACCCGTATGGCAGGAGGGACATTCACGGGGATCATGGCCGTCACCCTTGTCAGGGAAAAGCCAAGGTTCAAGGAATTTATCCAATTCTGCGGTGATAGCCGAGGGCTGTTGTTCCATGACTTCGGCGGTTTTGGGTTTGAAGTCATGCCAGAAAGCATCAAGGACTTTTTGCCAGTCAGCCCGACCTCCGGAAATTTCGTCAAGGCTGTCTTCCAAGCCCGCGGTAAAGTCATAGCTGACATAACGTTCGAAGAAACGTTCGAGAAAGGCCGTCAGCAAACGTCCGGCTTCGCTGGGAATGAAGCGGTTGCGTTCCAGCGTGACATAGGCACGGTCTTTTAGAACCTGAATAACCGAAGCGTAAGTGGATGGGCGGCCAATGCCCAAATCTTCCATTTTTTTGACCAAGCTCGCTTCCGAATAACGGGGCGGCGGCTGGGTGAAATGTTGATCTGCCCTGACTTCTTTTTTCAAAGGTGCATCGCCTGCCCGTAATAAAGGCAAACGGGCGCGGCCTTCTTTGCCATCGCGGTCATTGGCGTTATCGTCAGCGCTTTCTTCATAAAGCGTCAGAAAGCCAGAGAATAAGACGACCTGCCCCGTGACCCGCAAGACATTTTGCCCCGTGCCATCGGTTAAATCGACCGTTGTCCGTTCCAATTTGGCTGAGGCCATTTGGCTGGCAAGGGCGCGTTTCCAGATCAGTTCATAAAGACGGGCATGATCGCCGCTGGCCGCTTTGTCGCGCGAAAAATCGGTTGGGCGAATAGCCTCATGGGCTTCTTGGGCATTTTTAGCTTTGACCTGATATTGTCTCGGCTTGTCCGGCACATAGCCGCCGTCATAGCGTTGGGTGATGGTTTTTCGAGCCGCGGCAATCGCGCTTTCATCCATCTGAACCCCGTCCGTCCGCATATAGGTAATAAGCCCTTCTTCATAAAGGCTTTGAGCGAGGCGCATCGTATGGCTGGCCGCAAAACCCAATTTACGGGCGGCTTCCTGTTGCAAGGTCGAAGTTGTGAAAGGTGGTTGCGGATTACGAGAAACCGGTTTGGTTTCGACATTGGAAACCGTGAAATGACCGGATTCAACATCGGATTGAGCGGCCTCGGCCTGCGTCTTATCTGCGATAGACAGCTTTTCGAGTTTTTGGCCGCGCCATTGGATAAGTCGCGACGTAAAGCCGATACTGTCGGCTTCCATATCGGCTTCTACCGACCAATATTCTTGCGGTTTAAAGCCTTCTATTTCTTGTTCGCGGTCAACCACCAGCCGTAAGGCGACTGATTGCACCCGTCCGGCTGATTTTGCGCCCGGTAATTTCCGCCATAAAATCGGAGAGAGCGTGAAACCGACGAGATAATCCAAAGCCCGACGCGCCAGATAGGCATTGATTAAATCATTATCGAGTTGGCGAGGATGCGCCATAGCATCGGTAACAGCCGCTTTGGTAATAGCATTAAAGGTCACCCGTTCGACATCATCGGGTAACAGTTTTTTGGTCTTTAATAATTCGAGAATATGCCAGCTGATCGCTTCCCCTTCGCGGTCAGGGTCAGTCGCGAGAATAAGGCTGTCAGACTCTTTTACCGCGTCTTCAATCGCTTTCAGGCGGTGGGCTTTGTCAGGGTAATTTTGCCAGACCATGGAAAAGCCGTTGTCGGGATTAACAGAGCCATCCTTGGAGGGAAGATCGCGGATATGACCATAAGAGGCCAGCACCTGATAGCCCGACCCAAGATATTTTTCGATAGTTTTGGCCTTGGCCGGTGATTCGACAACAACGAGTTTCATGATACTCCTGACCATTCCGCGCAATTTACGGCGGAGAGTGAGCCTTTTTTTTCGTTATCAAAGGGAAAAATCCCTTTGAAAAAATGATAAACAGACAAGGGTTGATCTATTCTGTATTCGCGGGGCAGAGTTGAAAGCAACAAAAAATCATTCAAAAAAGCAGACTTACCCGTCCGCCTGCATGACTTTCGAGCCGTCCAGCTAGCTCCATTTCCAATAAGATGGTTTGGATTATTGCATTTTCTAGCCCTGATTGCCGGATAAGTTCATTTACCCCGACAGGGGCTGAACCAATCAGGCTTTGGATAATTTCGCGGTCCTTTGCCTTGACCTCGGCAATGGCCGGAGAAGAACGATAGCCAATCTGGCCATTTTCCAAGCCGGAAAAACGCGGTTCGAGGGGGCTTTTTTCGGTGGGTTCTTCAAAAAGCGCGGCCATGACAGAAACTGCCTCGATAATCTGATCACTATTTTGGATCAGTGTCGCACCTTCCCTAATAAGCTGGTTACAGCCTTGGCTGCGCGCATCCATCGGAAAGCCCGGAATGGCCATGACTTCCCGCCCGATTTCTGTCGCCAATTTGGCGGTAATAAGTGAACCGGATTTAGGAGAGGCTTCGACCACCAAAACGCCAAGGGCAATTCCGGCGATAATCCGGTTTCGTGCTGGGAAATGGCGGGCTTTTGGTTCGGTTTTGGGTGGCATTTCACTGACGACCAAGCCTTTTTCAGAAATAGCCTGCTGCAATTCTTTGTTTTCGGGTGGATAAAAGCTATCGAGACCCCCCGCAATAACCGCAATCGTTTTCTCTATCCCCGCCCCTTGATGCGCGGCGGTATCAATGCCGCGTGCCAGACCGGAGATAACAGGATAGCCTTTATTCGCTAATTCTTGTCCGAGATCATGGGCAAAACGGCAGGCGGCGGCCGAAGCATTTCTTGCCCCGACCATCGCAATCGCCGGTTTTTGCAATAAACGCGGATTCCCTTTCACCAAGAGAACCGGAGGGGCATTATCCAATTGATCGAGAAGCGGCGGATAATAAGGCGTATGGCGAAAAATATAACGCGCGCCTAAAGTTTTGACTTGCTCTATTTCTTCATTGACCGTTTTTTCCGAGGCGATTTTCGGCGCTTGGCCGCCGCCCCTTTTAGCCAGAAGCGGCACGGCTTTTAGCGCTTCTTGGGGTGAAGAATATCGGGTTATCAATCGGTTATAGGTTACAGCCCCGATACGGGGTGTTCGGATAAGCCGTAAAGAGGCGCGGTAATCTTCCTCGGTCATAGACGGGAAGCGAGATGAGGTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP023715|1209063:1222271|1216156_1216915_+|WP_011241015.1|DBSCAN-SWA MILTDNQLDRYARHIVLPEIGGKGQKKLLSAHVAIVGAGGIGSPVIQYLAAAGVGRLTIVDNDEVSLSNLQRQTLFATRDIGAHKVAMAANVVQRLNPDVKVLPYDQKLDAENAKKLIGQADIIVDGSDNFGTRLVVSDVATELKIPLVSAAVIRFEAQLAVFKGYEASKPCYRCFVGDDPGEPEMTCSSQGVLGALTGTIGSLAAIETIRLITRFGSESGDKLLLIDAKDFRFRTVSLSKDPNCRCKHSAE >NZ_CP023715|1209063:1222271|1221113_1222271_-|WP_011241017.1|DBSCAN-SWA MTSSRFPSMTEEDYRASLRLIRTPRIGAVTYNRLITRYSSPQEALKAVPLLAKRGGGQAPKIASEKTVNEEIEQVKTLGARYIFRHTPYYPPLLDQLDNAPPVLLVKGNPRLLQKPAIAMVGARNASAAACRFAHDLGQELANKGYPVISGLARGIDTAAHQGAGIEKTIAVIAGGLDSFYPPENKELQQAISEKGLVVSEMPPKTEPKARHFPARNRIIAGIALGVLVVEASPKSGSLITAKLATEIGREVMAIPGFPMDARSQGCNQLIREGATLIQNSDQIIEAVSVMAALFEEPTEKSPLEPRFSGLENGQIGYRSSPAIAEVKAKDREIIQSLIGSAPVGVNELIRQSGLENAIIQTILLEMELAGRLESHAGGRVSLLF >NZ_CP023715|1209063:1222271|1215719_1216160_+|WP_011241014.1|DBSCAN-SWA MKIKLEIKRLPHGANLPFPSYASEGAAGMDVVSAEDVILQPMQRYPVKTGFAVAIPNGYEIQVRARSGLALKHGIACPNAPGTIDSDYRGEVKILLINLGSEAFEIKRGDRIAQLILASVTQAVFCEVTDLDDTQRGHNGFGSTGI >NZ_CP023715|1209063:1222271|1211015_1211828_-|WP_011241010.1|DBSCAN-SWA MPELPEVETTIRGLSEVLMGEKIIDVKVRRASLRRPIPSDIQERLIGSTIISLSRRAKYGIIVNDRDDALIFHLGMSGRWKINPENFEKHDHFVLQTKNNFIVSLYDPRRFGSLDLVKKNQLLEWSYFRNIGPEPLTGNFNPEYLQKKLFSSSAPIKKILLDQKVVAGIGNIYACEALHQAKIHPQRPSKNLNFDEITSLVFSIKNILQKAIAEGGSTLKDYARPNGELGYFSTKFKVYGKEGEQCECGHTIERYTLGGRSTFLCSSCQK >NZ_CP023715|1209063:1222271|1214434_1215688_+|WP_011241013.1|DBSCAN-SWA MSSFLNGKRILLVISGSIAAIKAPDIIRLFRKKKADIRCLITKGGANFITPLALASLSGNPVAQDMWDESEEASIRHIRLAREADMIIVAPASADFISKMAHGLANDLASTVVLAADSPILVAPAMNHRMWHHSATQRNIHQLKSDGISFVDPEAGAMACGETGIGRLAAPEDILLSAESLFAEEQNHQFLKNRHFIVTAGPTHEPIDPVRYLANRSSGKQGFAIAEALQKLGAKVTLVTGPVYQNTPDRVERVNVETAIEMEKAVENALPADGAIFTAAVADWRFNYSPSKYKKGRSEPELIPIENPDILSNISHRKIDRPSLVVGFAAESENLLENAKVKLQQKGCDWIIANPVIESEDAPSAMGGDYNKIFIASHAGIENWPLLSKKEVASRLASKIADYFNLINEHPSSIISK >NZ_CP023715|1209063:1222271|1212869_1214408_+|WP_011241012.1|DBSCAN-SWA MTLAVVHFWRLFRWSRILSKHGVLKTVEKSTIVPLSLRLIIRLLRIGYRVPNPPDYTGAFVALGPAAIKFGQTLATRPDLIGEETAAQLACLQDSVPPLPFKDIRIVIENALGCPIEKSFRFFNEIPIGSASIAQVYQAETLDGVTVAVKVLRPGIKLAFRKATETYEWAATKIESLNDEFIRLRPKLVVKTFRQWTIRELDLRREAASASELAENMEAVPGFKVPVIDWNRTSQSMMVMQWIDGIKLSDHQKLQEAGYDLKSLATRLVRSFLRQAIADGFFHADLHHGNLFALKDGSLAVVDFGIMGRIDKKARRWLAEILYGLITGNYSRIAEIHFEAGYVPSHHDIAEFTTALRTIGEPIRGLSVRDISISHMLDGLFSITREFEMQTQPHLLLLQKTMVVVEGVATSLYPDINLWDTAEPFVREWLRSELGPEAKIAEEFYKTLQSIKRLPELIDRIDQYYPPLTIEEQLYPVANKNSIKSAATKHYLPVFILSAFIIGIVLGHYHFF >NZ_CP023715|1209063:1222271|1217323_1220962_-|WP_011241016.1|DBSCAN-SWA MKLVVVESPAKAKTIEKYLGSGYQVLASYGHIRDLPSKDGSVNPDNGFSMVWQNYPDKAHRLKAIEDAVKESDSLILATDPDREGEAISWHILELLKTKKLLPDDVERVTFNAITKAAVTDAMAHPRQLDNDLINAYLARRALDYLVGFTLSPILWRKLPGAKSAGRVQSVALRLVVDREQEIEGFKPQEYWSVEADMEADSIGFTSRLIQWRGQKLEKLSIADKTQAEAAQSDVESGHFTVSNVETKPVSRNPQPPFTTSTLQQEAARKLGFAASHTMRLAQSLYEEGLITYMRTDGVQMDESAIAAARKTITQRYDGGYVPDKPRQYQVKAKNAQEAHEAIRPTDFSRDKAASGDHARLYELIWKRALASQMASAKLERTTVDLTDGTGQNVLRVTGQVVLFSGFLTLYEESADDNANDRDGKEGRARLPLLRAGDAPLKKEVRADQHFTQPPPRYSEASLVKKMEDLGIGRPSTYASVIQVLKDRAYVTLERNRFIPSEAGRLLTAFLERFFERYVSYDFTAGLEDSLDEISGGRADWQKVLDAFWHDFKPKTAEVMEQQPSAITAELDKFLEPWLFPDKGDGHDPRECPSCHTGRLALKGGRFGAFIACSNYPECKYTRGFGQGEDGVDDNEPVLLGYMPENSDDDGYMAFSDQGDILPANTASASETGVTDGGIAANAAFSGKGNSASHTDRDDLPFDPDEPASSTGNVASSQSRMTGDETASSGNSRDSSAHGVSTTAGIDAESQAGISNQALAGKNNAGRTAVSDNKGNNSSSTIAAARKGGSTDDNATVSDPDGDIGSGASSSGQDADNRLLSHRNGDIDSRAIPADHKDSSSDQNASHALSPDRNSDDASVSNSDKKIDSKAVSTGHDVGNAITSDNSPSDNVAHLASTPSSATSSVKVALETENNDTAASKADEQAKEEEESRKARAVTRRTGRFGPYIQLGEGKNAKRVSVPRDVNPREVDFSLASRLLALPREIGLHPESGKMIIAGTGRYGPYLNCDGKYARLSSTEELLDIDIDRAVVKLAEAAQNKGRTGATLSREPLKVFGDSPVTEKPVQLMNGRYGPYVTDGETNASLPKDTTPESLTFEEALALLEARAKMPKKKKTKKAAAKKPAAKKTTTKKAAPKKATTKTATPKSATTDNATSEDGDATPAKSPAKKAVAKKTTAKKPASKSATKKAPSSKTTAAKKTSKATPKDEVAE >NZ_CP023715|1209063:1222271|1209063_1211004_+|WP_011241009.1|DBSCAN-SWA MKRQPQTNKGNINLLSYKKKRYFKTACSLAMSFLLFPPTPSAFAATVTNKNPVIEPGFLSDDDIALENSETDFSQVVVSRVVPIDNVKEVLSPREIQGYKACLSAIRHGSWQAAQQWISSNPDGLLTDFVTAELYLAKNSPKVSPDDIINLIQKSPYLPQGEALARLAYNHGVKTLPALPEKNNLFFLSGAPQRPVNHHNATNRSAMVFETKALPLIKADASDQAENLFQDSQQNLSDDIKTEWAQRIAWSYYLNGNDDAARRLAVQAENGTGRWTTDASWVEGLADWQKEDFSAAMQAFSRVASYSENKEMKAAGLFWEARAAMASGHPEYTQNLLRSASHMPETFYGLLAEKALGKTVKSSVRPVALSEDELKCLTTQPNIRIAIALNQIGEYQLASETLKHQARLGAPKDHNNLIHLASYLRLPEAQLWLAHHGPAGFSADVDARYPSPNWAPASGWHVDPYLVYAHALQESQFRSRAVSNKGARGVMQITPATARLVAKQHATNIDIEDLDKPTISFEYGQAYIEWLRDTSYTGGLLPKVIAAYNAGPAALPRWKDRDHGDPLLFIESIPYAETRAYVATVLRNYWIYQQKASSSSQSLNAMAQGMWPKFPGMPGDNAVRLNNKAKEVNKLLALNNINKQNM >NZ_CP023715|1209063:1222271|1212137_1212866_+|WP_011241011.1|DBSCAN-SWA MEKVSFGFSDVSPQEKTHLVGDVFRRVASRYDLMNDAMSGGLHRLWKDDFVRLVQPKASEHILDMAGGTGDIAFRLAKYGTNVTIADINPAMLEVGKKRAIARSIENLTWKEENAEALSFNDNVFDAYTIAFGIRNVTHIQKALDEAWRVLKVGGRFFCMEFSQTKWSGFSNLYKMYSTHIVPKIGQLLANDEDSYRYLIESIERFPNIEKFSDMIKSAGFVQIRARPILGGLVAIHSGWKV |
9 | Pseudomonas_phage(12.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
2696 : 14768
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP023718|2696:14768|DBSCAN-SWA CATGGCGCAATCACAATATCAAAATGCAAATCTATCAAAAATTGACCTGTCACAACTACCGGCACCGACGATTATTGAACCGCTGGATTACGAAACAATTTATCGACAGCAGCTCATCAATTTTCGCTCCGATAATCCGCAATTTGTAGACAACGATATTGTTGAATCTGATGTTGTTGTAAAATTGCTAGAGACAACATCCTATCGTGAATTATTGCTTCGGCAACGCATTAATGATGCTGTAAAGGGAGTGCTCGTTTCATTTGCACATGGCTCTGATCTTGATCAGATTGCTGCTCGGTTTGGTGTTGTTAGACAAACCGTTACACCGGCAGATACGTTGAATAATGTTTCCGCTGTAATAGAAACAGATGATAGCTTGCTGAATCGTATTCTACTTGCACCATCTGGATTTTCAGTAGCAGGTCCCGCTGATGCTTACAGATTTCATGCTCTCTCAGCAGACGGCAGGGTCTTCGACGCTAGCGCCACATCTCCTGAAGCCGGTACGGTTCTAGTCTCTGTATTATCAAACGAAAACGATGGCACGGCATCTTCTGAGATATTGCAGAAGGTTAATGCACATCTGACAGCTGATGATATTCGTCCTCTCACTGATTATGTCATTGTAAAATCAGCTGAAATTATACCATTTTCTGTCGATGCAACGTTAAAATTTTACGCTGGGCCTTCTGCTGCAATTGTGCAGCAAAATGCTATAGATCGGTTAAATAATTATCTAAAAAATAGTCGAAAAATTGGGCGCGATATCACTATCGCGGCCCTCCATGCGGCTCTGACAGTCGAAGGGGTTCAGAACGTTATACTTAATAGTCCGACTGTAGACATCGTTATTAGCGATACACAAGCAGGATATTGCAAATCAATCAATATCAGTAACGGCGGGATTGCAGAATGATTACCTCACTGCTTCCGCCTAATTCGACAGACCTTGAACGCGCGCTCGAAAAAACCTCTGCCAGCAATTTTGCCCTGCCTGCTGAGGATGTCAAAGAAATTTGGAACCCTGACACTTGCCCAACTGACTTGCTACCCTATTGCGCCGTAAATAACGGACTCAATCAATGGTCAGATGACTGGCCAGAGAACGTCAAGCGCCAGCGTATCTCCACAGCAATCGAAATCGCTAGACACCAAGGGACGGTCAAATCAATTCGTGATGTTGTGTCTGCATTCGGCAGTGCTGTTGAAATGCGGGAATGGTTCGAAACTGAACCTCGAGGAAAACCACGTACATTTGATTTGGTTCTGACTGTCAATAGCCAAAGCGATGCTTCACCAAGCGCTAAATATGTCGACGATATTGTTCAGGCGATTGTGTCAGCCAAACCAGCAGCTGCCCAATTTCTATTCACCATGGGCATCGATACCGCTGGTCAAATAGGGGAAATTGGTTTCGCCCGTCCTGTAATTTCTATTCATCTTATTTCAGAGGATGCACGTTAGTGGCTCTTAAAATTACCATTACTGATGCTGGTCGCGCCGCTCTGATTAATGCAGACCACTCTGGTACAAGACTAGTCAGCATTTCCAGTGTCGGTGTTTCTGCCACAGCAATCACAGCCGATAAATCTGCTACCAGTCTAGCAGATGAGATCAAACGTTTAACAACAATTTCAGGGAAGATTACGGCTTCTGATGCAATCCATCTTATTGCCAAAGATGATGGTTCTGATGTCTATACCATTAAATCTTTTGCATTATATCTGGATGACGGAACGTTATTCGCGATCTACGGCCAGTCGTCGCCTATTCTTGAAAAATCAGCGGCTGCAATGCTTCTGCTACAGTTGGATATCCGTTTCGCTGATATCGACGCCACACAAATTCAATTCGGCAATCTCGACTTTATCAATCCAGCAGCGACAGATCAAACCGTAGGTGTCGCCCGTTTAACCGCGCTTGGCGAGGCAGAACAGTCGACCGAGAATATTGCTGTCTCGCCCGCTGAATTGCGACGCTACGCAGAAAAGCTAGCCCGCCAAGCTGATATGATCGCAGCTCTGGCTACCAAATTTGATAAAACGGGCGGAACAGTTAATGGTTTCGTGAATGCAGATGCATTTAACAGCGGCGCATTTTACGCGAATAAAGAGAAAGCCGATTTTTCCTATGGCGGTCAGCATCTAGCTCTTCAGGCTGATGGTAATTGCGTTTATTATGATGGCACTGCTCCCTATGCTAGTATTAGTCCTACTCTTGCAAACTTTCCGGCGAATACACTCGTTACTGGCAATACTGTCTGGCACCGTGGTAACGATGGTGCGGGTTCTGGCTTGGATGCTGACTTATTAGATGGTCTTGATAGTTCGGCTTTCCTTCGTACTGGTGGATCTACTTTTACCGGAGATAGCTATGTAAATGGCTGGGCGTTTTATTGCTCAACTGGTGGCTCAAAAGCACGTATCAATTTCGAGACGATAGATCATATTTCCTTGTGGAATAATGCTGGAAATGTTGCAGCATATTTCGATTTATCCAGTGATAATAAAGACCTATATGTCGGCGGAAATATTAGAGCATCAGCTTTAAATACAGCAACATTCTCAGCAAATGGTACGCAAGTCGATTTTCTCTACGCGAACCGCCGGACATCATTTCAGCCCGATGGAAATAACGTTTGGTACGGGACGGATGGTGTAGCAAGATCGATAATCAAAGACGGTAATTTCTGGACGCACGGCAGTGTCTCAACAGATAATGTAGTCAATGCGACGGGTATGTTTCATGTTCAATCATGGAACGGCCTAAATGCTCTTGATATGCACTCTGACGATGATGGTATCGTTCGGATGTATGCGAAAAATGGAAATGTCGACTATCAGGGGTTTTTGGCTTTTAATCCACAGGGGAACATCGACTTTTATTCTCCTACTGGTCAGTGTTTTATCAATGGCCAGTTGATCTGGAATAAAAATAATGATGGAGCTGGCTCTGGTTTAGATGCCGATTTCTTGGACGGTATCGATAGTACCGGATTTGTAAAAGTTACTGGGTCTGCCATGCAGGGCGACCTCTTTATGAATGGCTGGGCAATCTATAACCAAGGTGGCTACGGTTCAGGTAGCACTCGTGTGCAATTTGAAAAGAATAATCACCTGACGTTGTTTAATGGAAATGGTGAGATAGCCGTATGGTTTGACTTGGATGGCTCTAATAAAGCTCTAAATTGTAACGATGCCATAAATACGCAGAAACTGTATTCTCCGACATTCGCAACAGATCAATCGTCAGTAACTATAAGCTATGGCAAAAGACATGTGGCCTTTCAGCCCGATGGAAATAATGTCTGGTATGGGACAGATAATGCCGTTTTAACGAGTATTCTGGATGGCAATTTCTGGACGCGTGGCTCTCTATCAGCAGCTAATAATATCAATGCGACAGGTATGTTGCATGTTCAATCATTCAATGGCCAAAATGGTTTAGACATGCATTCCGATGATGATGGTATCGTCAGAATGTATGGGAAGAACGCGAATGTTAATTGGCAGGGGTTTTTGGCTTTTAATCCACAGGGTAACATCGATTTCTATTCTCCTACTGGTCAGTGTTTTATTAACGGCCAGTTGATCTGGAATAAGAATAATGACGGGGCTGGCTCTGGTTTAGATGCTGATCTTCTAGACGGTCTCGATAGCTCTTATTTTGCACCCAATTCTCGAATACAGACGTGGGGTGTTCCCAACGGGCTCATGAAACGTGTCGATGATCATGTCGAACTACATCTGCGTTTAGATAAAATAAATGGCACGATCACGCTGCCTCATACCTTTAATAACATCGTTGATATTCGCGTTCAGCCCTTTGCCCATGATGGCGGTGATGCGACATTTGCAGCTCCGGTCAGCAATACAAATAATACAGTTACGATTGACGCTTGGGCTCGATGGAAAGGTGAAACCCATGACGTGGACGTGGGTGGATTTATCTACGTCTATGGCAATTAAATCTTTAAATTCAGGAGAATAAAATGAACGTATTTACGATGCCGACTATTAAGTCGGTTAGCGAACTCGACAGCAATACGCAGTCAGTCTCAGTCATTTTTGGTCTTGTCGATCGCACTGGCCAGAGCTTTGACTATGCTCGTGATGTCAATGCCGTCTTTGATACAAATAACAATTTTGATCAAATGGGAACAATGCAGCGCTGTTATCAGGTGCTGGCTGGTCTCATCACGAAATGCTCAAGCGGCGTAATCTCTAAAAAGCAGCAGGACGATTATATCGCTCAACAAGCCGCAGCTAAAAAAGCCGCTGAAGATGCCGAAATGAAGCGCAAGGCCGACGAAGAGGCTCAAGCAGCTGCATTGAAATCGGCTGCCGAAGCTGCCGCAAAAGAAGCAGCCGACACGGCATCATCTGATCAAGCAAATGCAACAGCGGCACAGTCTGATGCAACGACTACTTCGGCAACACCCGCCGCCTAATTTTATACAATAAGAAAGATCGGGGGCTGGCCAACATGGTCAGCCTATGAATGAATAGGTCGCTTTGGGAATAAAAATAAGAACAAAAGGAGAACAAAAATAGGCAAATCCTGTCTTTCTCCATGTTTTTTCTGTGTAGCGAGCGGAAAAATAGCGACTTTCCCATCGACACCTTTGATGGGAACAATCGACCTTTCTTTGTATGATTATTGCCGCCAATAATTCGGTATGCACAATTTAAATCGAAAAATAGCAAACATGATCGCATTCGGTGTGGTGAAAAGCCTCAGCGATCAAGGCGCTATTATTACTATTGCAGATATCGATACGCCAGAGCTTCCATGGTCGACTATAGCCAATAAAAACTTGAGCATTTGGTCTCCACCTCCTGTAGGTGCGCAAGTCATCGTGTTTGCACCTAATGGCGATTTGAATAGTGCTGCGATTATTGGCCAGCTACCATCCGATAGCCATCCGTTACCATCAAAAAGTGAAAGCGAGATTGTCATCGCATTTGGGGATGGGACGCTAATCAAATATGATTTGTCAGATAAGAAATTTACGGGAGAATTTGCAGGGGACGCAACGCTTAAATTCCCGCAAGGTCTCGAAATTATTGGAGACTTATCTGTCTCTGGAAAAGTGACGGCAGAAACAGTCAAAGCCGACACAATAATCGGTAGCTCTGACGTCTCCGGTGGCGGAATTAGCCTGAAAAATCATACGCATTCAACAAAAATGGGACCAACGAGTCCACCATCATGATGACAGAACATATCTTGGGCTGGTTTTTCGGTGGCCTGATTATAGCCGTCATGGTGTCTTTGGGAGCGATCGCGGGGCCACCCCATGACTAGTTTTGATCGTTACACCGGCCAGTCTATTTCAAACGATGCATCAATCTGTCAGTCAATTGGCGATATCATCACCACACCTGTTGGCAGCCGAGTTTGTCGCCGTGATTACGGCTCTCTTGTCCCAGAATTACTAGATCAGCCGCTGTCTGCTCGGACACAATTATTGCTCTACGCATCGACAGCAAACGCAGTCTCGGAATGGGAACCGCGTGTCACGCTGAAAACCGTTAATCTGACAGTCGATACCACTGGGAAATCGGTGCTCGCAATGCAGTATTCATACAAAAATCAGCCAAAAACAGGATCGCTCTCACTCTCTCTAGGAAACATATAATGTCCATTATTCACGGGATTTCAATTAATGAAACGTCTGATACATCCGGCGCGATTACAGTTACGTCTACGGCTGTCATCGGAATCGTCGCAACAGCAGATGACGCTGACAGTTCAATATTTCCGCTCGACACACCAGTCTTAATTTCCGATTTAACCGCCGCTATTGGTAAGTCCGGAAAAACGGGTACACTCCACAAAGCATTGATTGCAATCAATGCCAATGTCTCAGCTCCAGTTATTGTTGTCCGTGTTGCCGATAACAAAGATACAGATGCTCTTAATGCATCTGTGATCGGGCTTTACGATGGCAAGCGCACCGGAATTCAAGCCTTACTTTCAGCCGAAGCGGCGACAGGACTTCGTCCGACTATCATCGGAGCCCCCGACCTTGATACGCAGCCTGTTGTCACAGCCCTTGTTTCTATTGCCAAGAGTCTGCGGGCAATGGTCTACGCAAAGGCAATCGGAGATACCATCTCTGATGCCATTAAATATCGAGCCAACTTTGATGCGCGCGAACTAATGTTAATTTGGCCGAATGTCATTGCATATGATTCCGTCAATGCCAAAAATGATGTGTTTTCATCAGCGGCATTCGCTCTTGGTCAGCGAGCGGCTATCGACGAGAGTACCGGATTTAACAAGACGGTATCAAACGTCGCCGTCAGTGGCATTCTCGGATTAGAACATCCGATTTCTTTTGATCTTACAAGCATGAATTCTGATGCCGGTTTATTGAACCAGTCTCAGATTACCGCTGTTATTCGACATAATGGATTTCGTTTCTGGGGTAACCGTGCTGCAACCTCAGATCGTGACTACGCATTCGAAAGCGCGACACGGACACATTATACGATCATTGAATCGATTATCTCAGGTAGCGAGTGGGCAATTGATCAGCCGCTAACAACAGCGCTGATCCAAACGATTGTCGACGAAGTCAATAATCTCTTTCGAACACTCAAACTTAAAAATCAGATTATTGGGGCAAAATGCTGGTACGATACAGACAAGAATTCTGCCGCCAATCTCGCGGCTGGCCAGCTGTATCTCAGTTATAATTTTACGCCAACGGCTCCCAATGAAAACCTCAATATCACGGCCACAATTACAGACACATATTACGTCAACCTGAACTCTTCTCTGTCCAGCTAGACGAAATCCGGAGCATAAAATGAGTATACCAAAAACACTACGCTCGATGGTTCTATTCCACGGCAGTCAGACATGGCTCGGTGAGGTTAGCAATGTGACCCTCCCGAAATTAACCCGTAAAACTTCGGACTGGCGAGGTGGCGGAATGCCTGCTGGGGTACCTCTTGACCTTGGTATGGATAACCTCGGTGAACTATCATTCACTGCGGGGGGGCCGATGCAAAAGACCCTGTCTGCCTTTGCAGGTGGGATGCTTGGTCAGAATTTACGATTTGTCGGAGAATATCGTCAACAGGATACAGGGCTTGTCGATATAATCGAAGTATCCGTCCGTGGCCGCTATTCAGAAATTGATTTCGGTGAGCAAAAAGTCGGGGATGTCGGATCGTTCAAAGGAACAGTCAAGCTCTCTTATCTGAAAATTGTTTGGAATGGTTCAACGCTGATTGAACTTGATCCTCTTCTCGGAACCGAGATTGTTGACGGTATCAATATCGGATCAACATTGTTTAAAACGCTCGGCCTTATTTAATATTTATATAAAAATTTAAAAAATATAAATATAAAATTATTTTAATATATTTTTATTTTATCAATAAAAACAACGAGGTAATCATGGAAAACGAAAACACTATTGCCCCCGATCAATCTCAAAATGCGACTGTTAATGCAGTGACGCCTGCTGCCGATGTGGTAGCTTCTCCGGTCATTGATGCCTCGGCCAATAATACCATTAAACTCGGATATGCGTTGATGCGCGGCGAGCAAGAAATCAAAAATATTGAATTACGCAAACCGGTAACTGGTGATCTGCGTGGTCTCAGTGTCACTCAGCTGTTGAATACCGATGTTGACCAGATGTGCAAATTTTTACCACGCATCACTACGCCTGCATTGATGCCGTCTGAAATTGAAAAATTGCCTGTCGGTGACTTTCTGTCTCTGGCCATGAAATCGCTCGGTTTTTTTACGGAGAGCCAGTCCCAGACTGCATAGAGGATGTAGTGGCAGATCTGGCGGCCATATTCCACTGGTCGCCAGAACAGCTATTTGAAATGCCTGTCTCTGAATTGATGGATTGGAGAGAGCGGGCAATAAAACGATTTAATCAAATGTATGGTGGCGAGAGTGGCGAGTAACGAACTCAAACTGAAAATTATTTTAGAAGCGTTCGACAAAGTCACTACGCCGCTCGAACGTATCCGAAAGTCTGGCGTCAAGACAAGTAAAGCGTTTCAAGATACCCAGAAAGCATTGAATGCACTCAAGCGGGCTCAATCGGCGACAGAAGCCTTCCAGCGCACACAAGATCAGATCGAGAAGACAAAAAAGCGCTTTGACCAATATAAAGAGAAGCTGAAAGAAACACAGGCCGCTATTGATGGAACCAAGAACCCTACGGATAAGCAGGTCAAAAAACTTCATGAATTGGCCGCTATCGTAGGGAATATGCCGGATAAGCTGAAAGCCCAGAATGAAAAACTGGATCAGCTTAAATCTAAACTGCAATCGGCTGGCGGTTCAGCGGATCATCTCGGACAATATCAAGACAAGCTCAAAAATCACATTGAAAAAACAAATCACGCGCTAGAGCGTGAAGCCGCTCATATTGAACGCGTCGAACACACGACAAAACGCTTGGTTGCTATCCGAGATAAAGCTGCCAAATTCGGATCGCTGGAAACAATTATGTCAGGTGCGGCTGCCGCAGCTCCGGCATTGGTTGTTGCACGGTCAGCCATCGAGCATGAAGATTCCATGGCGGAAATCAGCAAAGTGGCAAAGATGACTGCTGCACAAAAGGCAGATATTGATGCAGCTCTAAAAACGATGGCTAATAGTGGGCCAGCTACTTATGCCGAATTAGCGGAATCTGCAGCGACTGCTGCTCGTCAGTCGATCGGTATTAAACAAAATGCCGATGGTACGGCCACGATTGATACCGCTCAGTTAATGCATTTTACAGATAAGTCGAACAAAGCAGCCGTGGCACTTGGTATGGATCGTGAAGGCACTGGTGCAATGATCGGCCATATGCGGAACAATGATTATTCCGAAAAACAGATTGATACGACACTAGATCAGATGTCCGTTATCATGAATAAATTTGGTGGCCACGGCGAGTATATCAGAGAAGTTTTTGCTAAAAATCTTCCGATCGTCAAAAATGCCAATATGTCAGTTTCTGATCTCGCTGTGTTAGGCAATCTTAATGATACAGCGGGTCTTCAAGCCGATGAAGCATCGACAGGTATTAAGCATATGCTTAACGCGCTAACTGTCGGCGATAAAGGCGCTACAAAACGTCAGCAACTCTATTTCAAAGATACGGGCAAGACAGCTGGCCAGTGGCAGAAGTTGATGCAGGAGCAGGGGGGCGGCGAGACCATCCTTCAGTTTCTGGAACAGGTTAAAAAAATGCCGGTCATTAAGCGTAGCGCTTTGCTAAACGGTATTTTTGGGAAAGAGGGCGCTTCATCTGTCGCGACCATGGCGAACGAAAGCGACCATTTTAAGCAGCAAAGGGCGGCTGTTAATGATCCTAATATGCTCAAAAATGATGGCGTTGAAAACGAAAACAAAATACGAGCCGCAACGACAGCAAATCAGCTAAAAATGTTAAAAAATAACTTTGACACTTTAGCGGCTGATGTTGGTACTCAGCTTTTACCTCAAATCCAAGCGCTAGCGGCAAAACTGATTGAAATTAGCCGGAATGTAGATAATTTTGCCTTGCATCATAAAACGCTCATTAAAAATCTGGGAAAGACAGCGATGGTTGTCGGGCCTGCTATTGTTGCTATGGCAGGGCTTGGATTTGCTATCCGAACCGTAGCTAGCACTGGAATTGGATTATATAAAACTTTTCAATTTTTTAGAAAACTGAAAGACGCCAGCTTTCTTGTAAAAATGATTGAGCATTTTGGTAAGGCTGGAAAGGGAGTAAAAAAACTTTTCGGACTTTTTAAACTTTTCAAGAAGATCAATTTCGCATCGACGATTATCTCGGGCATCCAATTGATTGTTCGGGCAATCAGCGCTGCTGGAGCTTTATTAGCGGCCAATCCTGTTATCTTGGCGATTTCGGCAATCGTTATTGCTATCGCTGGTGCTGCCTATCTAATTTATCAGAACTGGGATTCGATCAAAAAATACACAGCAGAAGGTCTGTCTGCGATCAAACAAGCATGGAGCAGTGTGAGTCAGTTTTTTAGTGGACTCTGGAATGATTTCAAAGAAATGGGTGGTCATGTCATCCATGGACTAGTCGACGGTATTACCTCGGCAGGAAGTGCTGTCAAAGAAGCCGTTATAAATATGGGCAGCAATGTCATCGGCTGGTTCAAAGAAAAACTGGGGATACATTCTCCAAGTCGTGTTTTTCACAGCCTCGGAGGCTTCATTGTCGATGGCCTGAATAACGGTATCTCTGATAACGCGCATCACCCGATCAACCATATTCGTAATCTGGCCGAGCAAATTTCATCTGCATTCCGGCCAGATTTATCGGGATTGGCATTATCCAGTCATTCACCCCGCATTATTACGAAATCTGTTTTCGAAGAGGCTCGCGGTAGTGGCTCCAATCAGCCGAAAAACACTCGGCCTATAAGCCAAATTTTTCATTTCACAATTAATGCCGCGCCTAATCAAAGCCCGATGGATATTGGCCATTCTGTTCAGAAAATAGCTCAAGACGCTATTCGCGCGCCTCAGTATTCAGACACGCCAGATTGGGTTTACTAATGTTATTTGCTCTCGGAATGTTCACTTTTGAGTTATCGTCACTTGCTCCAGAAAATCTGGATAGAACGACGACATGGGAATTTGGCTCTAACAAACGCTTAGGCGCGCGTGCTGCTGCCCAGTTCACCGGACTAGGCGAAACAGTGACGTTATCTGGTACTGTTTATGCAGAAATAGCTAATGCTCTTGCATCAGAGCAGAATCAGCGGATTTCGCTTAAACAGCGCATCGATAAACTCATTCATCCACAAATTTCGAAATCCAAAATCTCTGATTTTCTGACTGGACGCCAAGATCAGCATAATCAAAGCGCAGAGATAGTCTCGATTGATGCACTCAGAGAAATGGCGGATCAGGGGCAGGACTGGAAATTGGTCGATGGCACCGGAAAAATATACGGCTCGTATATAGTCACATCCATAGTAGAAAAGATGAAATATCTTTGGTCAGACGGCAGACCCCGACAGATAGATTTCGAACTTCATCTACAGCGCGTCGATGACGACAATACAACAAACTCTAGTAGTCAGATTACATACGCATGAGCATTTTAACCCCAGATTTCCAGATCATTATCAATGGTAAAGATGTCAGCCCAAAAATCCGGCCAAGACTGATGCACCTCATTTTACGCGAATACGCAGGAGAGCAGGTCGATACGGCGACTATTGTATTGGATGATACAGATGACAAGTTGTCTATACCAGATATCGACGGACTGATTGAGGTTAAAATTGGCTTCAAAGGCCAGCCACTGGTGAATAAAGGTAAGTTTATTTTTGAGAATTTGCGTTATACAGAGCCACCTCGTCAGATAACGATTACGGCACATTCTGCTGCGGTGGAAAGCGTTATCAAAAAACGCCAAGATTATGCGTGGCATGATACAACACTGGGTGCGATCATAGAAACGGTAGCAGGTCGTAACAATCTGACAGCCCGTATAGATCCTATATTAGCCGCTTTAACCAGTGGCTCAGTACATCAACAAAATGAAAGCGACCTTGCCTTCATTCAACGCCTTGCCCGAGAGCATGACGCAACAGGTACGATTAAAAACAATTATTTGATCTTTGTCCCAAAAGGGACAGACAAGACTGCTACAGGCCAAATATTCCCCATATTAAATATCGAAAGAAATGCGGTTACCAATTACGAGTATATCCTTAATCATGTGGATGCAGGAGCCATCGAAGCAAAATGGCACTCTAAAAAGCAGGCACATTATCACATTGTAAAATTTGGCGATGGCGCGGCCACGAAGAGGCTGTCAAGTTGCTATTATAGCGCAGAATCTGCAATACAAGCTGCCCGTGCAGAATACAATCATCGTATGCGCGGAGGTCAGAAATTGAATATAAATGTCGCAACCGGTATGCCTGCGTCATCTATTGGTCAAACTGTTAAAGTTACTGGATTAAAATCCAAGATAGACGATCAATTTTGGCAGATTAAAGGTATTTCTCACGAAATTTATGGTGGCCTAAAAACAAACTTTAATCTCGTGGTTTTGAATCAAGAATGA
Protein sequences of DBSCAN-SWA_1 >NZ_CP023718|2696:14768|2696_3614_+|WP_062339727.1|plate|DBSCAN-SWA MAQSQYQNANLSKIDLSQLPAPTIIEPLDYETIYRQQLINFRSDNPQFVDNDIVESDVVVKLLETTSYRELLLRQRINDAVKGVLVSFAHGSDLDQIAARFGVVRQTVTPADTLNNVSAVIETDDSLLNRILLAPSGFSVAGPADAYRFHALSADGRVFDASATSPEAGTVLVSVLSNENDGTASSEILQKVNAHLTADDIRPLTDYVIVKSAEIIPFSVDATLKFYAGPSAAIVQQNAIDRLNNYLKNSRKIGRDITIAALHAALTVEGVQNVILNSPTVDIVISDTQAGYCKSINISNGGIAE >NZ_CP023718|2696:14768|9515_10028_+|WP_012954683.1|tail|DBSCAN-SWA MSIPKTLRSMVLFHGSQTWLGEVSNVTLPKLTRKTSDWRGGGMPAGVPLDLGMDNLGELSFTAGGPMQKTLSAFAGGMLGQNLRFVGEYRQQDTGLVDIIEVSVRGRYSEIDFGEQKVGDVGSFKGTVKLSYLKIVWNGSTLIELDPLLGTEIVDGINIGSTLFKTLGLI >NZ_CP023718|2696:14768|10111_10492_+|WP_012954682.1|tail|DBSCAN-SWA MENENTIAPDQSQNATVNAVTPAADVVASPVIDASANNTIKLGYALMRGEQEIKNIELRKPVTGDLRGLSVTQLLNTDVDQMCKFLPRITTPALMPSEIEKLPVGDFLSLAMKSLGFFTESQSQTA >NZ_CP023718|2696:14768|8338_9496_+|WP_012954684.1|tail|DBSCAN-SWA MSIIHGISINETSDTSGAITVTSTAVIGIVATADDADSSIFPLDTPVLISDLTAAIGKSGKTGTLHKALIAINANVSAPVIVVRVADNKDTDALNASVIGLYDGKRTGIQALLSAEAATGLRPTIIGAPDLDTQPVVTALVSIAKSLRAMVYAKAIGDTISDAIKYRANFDARELMLIWPNVIAYDSVNAKNDVFSSAAFALGQRAAIDESTGFNKTVSNVAVSGILGLEHPISFDLTSMNSDAGLLNQSQITAVIRHNGFRFWGNRAATSDRDYAFESATRTHYTIIESIISGSEWAIDQPLTTALIQTIVDEVNNLFRTLKLKNQIIGAKCWYDTDKNSAANLAAGQLYLSYNFTPTAPNENLNITATITDTYYVNLNSSLSS >NZ_CP023718|2696:14768|7376_7913_+|WP_081094487.1|plate|DBSCAN-SWA MHNLNRKIANMIAFGVVKSLSDQGAIITIADIDTPELPWSTIANKNLSIWSPPPVGAQVIVFAPNGDLNSAAIIGQLPSDSHPLPSKSESEIVIAFGDGTLIKYDLSDKKFTGEFAGDATLKFPQGLEIIGDLSVSGKVTAETVKADTIIGSSDVSGGGISLKNHTHSTKMGPTSPPS >NZ_CP023718|2696:14768|13784_14768_+|WP_062339740.1|DBSCAN-SWA MSILTPDFQIIINGKDVSPKIRPRLMHLILREYAGEQVDTATIVLDDTDDKLSIPDIDGLIEVKIGFKGQPLVNKGKFIFENLRYTEPPRQITITAHSAAVESVIKKRQDYAWHDTTLGAIIETVAGRNNLTARIDPILAALTSGSVHQQNESDLAFIQRLAREHDATGTIKNNYLIFVPKGTDKTATGQIFPILNIERNAVTNYEYILNHVDAGAIEAKWHSKKQAHYHIVKFGDGAATKRLSSCYYSAESAIQAARAEYNHRMRGGQKLNINVATGMPASSIGQTVKVTGLKSKIDDQFWQIKGISHEIYGGLKTNFNLVVLNQE >NZ_CP023718|2696:14768|10500_10635_+|WP_012954681.1|tail|DBSCAN-SWA MADLAAIFHWSPEQLFEMPVSELMDWRERAIKRFNQMYGGESGE >NZ_CP023718|2696:14768|4161_6666_+|WP_062339731.1|DBSCAN-SWA MALKITITDAGRAALINADHSGTRLVSISSVGVSATAITADKSATSLADEIKRLTTISGKITASDAIHLIAKDDGSDVYTIKSFALYLDDGTLFAIYGQSSPILEKSAAAMLLLQLDIRFADIDATQIQFGNLDFINPAATDQTVGVARLTALGEAEQSTENIAVSPAELRRYAEKLARQADMIAALATKFDKTGGTVNGFVNADAFNSGAFYANKEKADFSYGGQHLALQADGNCVYYDGTAPYASISPTLANFPANTLVTGNTVWHRGNDGAGSGLDADLLDGLDSSAFLRTGGSTFTGDSYVNGWAFYCSTGGSKARINFETIDHISLWNNAGNVAAYFDLSSDNKDLYVGGNIRASALNTATFSANGTQVDFLYANRRTSFQPDGNNVWYGTDGVARSIIKDGNFWTHGSVSTDNVVNATGMFHVQSWNGLNALDMHSDDDGIVRMYAKNGNVDYQGFLAFNPQGNIDFYSPTGQCFINGQLIWNKNNDGAGSGLDADFLDGIDSTGFVKVTGSAMQGDLFMNGWAIYNQGGYGSGSTRVQFEKNNHLTLFNGNGEIAVWFDLDGSNKALNCNDAINTQKLYSPTFATDQSSVTISYGKRHVAFQPDGNNVWYGTDNAVLTSILDGNFWTRGSLSAANNINATGMLHVQSFNGQNGLDMHSDDDGIVRMYGKNANVNWQGFLAFNPQGNIDFYSPTGQCFINGQLIWNKNNDGAGSGLDADLLDGLDSSYFAPNSRIQTWGVPNGLMKRVDDHVELHLRLDKINGTITLPHTFNNIVDIRVQPFAHDGGDATFAAPVSNTNNTVTIDAWARWKGETHDVDVGGFIYVYGN >NZ_CP023718|2696:14768|10624_13243_+|WP_160327983.1|tail|DBSCAN-SWA MASNELKLKIILEAFDKVTTPLERIRKSGVKTSKAFQDTQKALNALKRAQSATEAFQRTQDQIEKTKKRFDQYKEKLKETQAAIDGTKNPTDKQVKKLHELAAIVGNMPDKLKAQNEKLDQLKSKLQSAGGSADHLGQYQDKLKNHIEKTNHALEREAAHIERVEHTTKRLVAIRDKAAKFGSLETIMSGAAAAAPALVVARSAIEHEDSMAEISKVAKMTAAQKADIDAALKTMANSGPATYAELAESAATAARQSIGIKQNADGTATIDTAQLMHFTDKSNKAAVALGMDREGTGAMIGHMRNNDYSEKQIDTTLDQMSVIMNKFGGHGEYIREVFAKNLPIVKNANMSVSDLAVLGNLNDTAGLQADEASTGIKHMLNALTVGDKGATKRQQLYFKDTGKTAGQWQKLMQEQGGGETILQFLEQVKKMPVIKRSALLNGIFGKEGASSVATMANESDHFKQQRAAVNDPNMLKNDGVENENKIRAATTANQLKMLKNNFDTLAADVGTQLLPQIQALAAKLIEISRNVDNFALHHKTLIKNLGKTAMVVGPAIVAMAGLGFAIRTVASTGIGLYKTFQFFRKLKDASFLVKMIEHFGKAGKGVKKLFGLFKLFKKINFASTIISGIQLIVRAISAAGALLAANPVILAISAIVIAIAGAAYLIYQNWDSIKKYTAEGLSAIKQAWSSVSQFFSGLWNDFKEMGGHVIHGLVDGITSAGSAVKEAVINMGSNVIGWFKEKLGIHSPSRVFHSLGGFIVDGLNNGISDNAHHPINHIRNLAEQISSAFRPDLSGLALSSHSPRIITKSVFEEARGSGSNQPKNTRPISQIFHFTINAAPNQSPMDIGHSVQKIAQDAIRAPQYSDTPDWVY >NZ_CP023718|2696:14768|7997_8339_+|WP_012954685.1|DBSCAN-SWA MTSFDRYTGQSISNDASICQSIGDIITTPVGSRVCRRDYGSLVPELLDQPLSARTQLLLYASTANAVSEWEPRVTLKTVNLTVDTTGKSVLAMQYSYKNQPKTGSLSLSLGNI >NZ_CP023718|2696:14768|6689_7148_+|WP_108128663.1|DBSCAN-SWA MNVFTMPTIKSVSELDSNTQSVSVIFGLVDRTGQSFDYARDVNAVFDTNNNFDQMGTMQRCYQVLAGLITKCSSGVISKKQQDDYIAQQAAAKKAAEDAEMKRKADEEAQAAALKSAAEAAAKEAADTASSDQANATAAQSDATTTSATPAA >NZ_CP023718|2696:14768|13242_13788_+|WP_062339739.1|tail|DBSCAN-SWA MLFALGMFTFELSSLAPENLDRTTTWEFGSNKRLGARAAAQFTGLGETVTLSGTVYAEIANALASEQNQRISLKQRIDKLIHPQISKSKISDFLTGRQDQHNQSAEIVSIDALREMADQGQDWKLVDGTGKIYGSYIVTSIVEKMKYLWSDGRPRQIDFELHLQRVDDDNTTNSSSQITYA >NZ_CP023718|2696:14768|3610_4162_+|WP_062339728.1|tail|DBSCAN-SWA MITSLLPPNSTDLERALEKTSASNFALPAEDVKEIWNPDTCPTDLLPYCAVNNGLNQWSDDWPENVKRQRISTAIEIARHQGTVKSIRDVVSAFGSAVEMREWFETEPRGKPRTFDLVLTVNSQSDASPSAKYVDDIVQAIVSAKPAAAQFLFTMGIDTAGQIGEIGFARPVISIHLISEDAR |
13 | Burkholderia_phage(27.27%) | tail,plate | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
28793 : 36436
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP023718|28793:36436|DBSCAN-SWA CATGATAATAGGAACGCTCAATCAAAAAGGTGGGGCAGGAAAGACAACGGTAGCTGTCAATCTTGCCGCCTCTTTGCAGCAAGATCAGAAACGTGTGTTGCTGATTGATGCTGATCCGCAGGCCAGCGCGTCAGACTGGTCTATCGCCCGCAAGAGATCGCTGCCTGAGTTCGCTATCCCATGTGTGCAGCTTGCTACTGCCGATATGCACAGGCAGATTCAGAAATATAAATCTGAATATGACTTCATTATTGTTGATGGTGCACCAAGAACAACGGATCTCGCTCGTTCAGCGATAGCAGCCTCTGATATTATTGTTATCCCTGTCCAGCCAAGTGCCTTCGATATTTGGGCGGCTGATGCCATCGTGAAATTGATTGAGCAGGCCAGAAAAATAAAAACTGTGGCTGGTTGTTTCCTATTGAACCGAGTGAATAAGCGCGCTGCTATTTCTGTCAGTGTTGCTGAAGCCCTTGAAGGCCATTCACTACCCTTGCTTGAAACACAATTATCTCAGCGTGTCCGGTTCGCAGAAAGTGCATTAGACGGTCGGTCTGTGTTTGACTATCCGGCTGGACGTAATGCGGCTGCCGATGAAATTAAATCCCTAAAATCCGAAATTCTGAAATTGTATGAGGCATCTCATGGCTAAAAAATCGATCAGCTTAACCCGTCCTTCTGATGAACCAACTGCTCCTGCTGATCTTGATAAGTGGGTTGCCGATAACCGCAAAATAGATACCGAAAAAACGAAGCGTGTCGCTGCCGATATTCCGATTCAGGTTCATATGAATCTGAAAATGTTTTGCACAAAAGAGGATCGCAAGATCGGTGATGTCCTTCGTGAAATGATCGTGGAAAAGTTCGGTTAGGTTTTGAGAAACAAGCTATGTTTTTCTTAAAAAATTAGGTCTTTCATAAATGATCGAGGGTGCATAAATATGCATAATGGGTTTGTGGCCTTATTTTATCAAATAAAGCCATTCTAAGGTAGTATTCTGGGGCATTTACGACAGGTCTGCATAGGGACGAAAATGCATAAAAAAGCGACCTCTTTCCGCCGGGGGCTGGGTTAGGTTACCACGCCAAACAGGTCAGAACAACTTTTGGGGGAGTTCTGATAGATTTCTTTGAGTTTTGGGTTTTGAGACTGCTCTCGTGATGAGAGCCAGTGCATATAAAAAAGGGTGGCTAATGCCACCCTTTAACACATGGAGATAATAAGCTCGCTACTGAATAGCAGGCAGCTCGAAGTCGATGATATTTCTACCAACCATCTCATTGATCTCTGAGAACACAGACATCAGCGGTTCAATCTCTAGCTGATAAAATGCAGAAGTTGCTTTGCTAACATCGCCAAAGCCTGATCCATTAGGCGGCACCATCCCTAACAGCTGGGGTGGTACGCGGTGAGCTGCCAGAATGTCATCTCTTGTCACATTCTTGATATTAATAAACTCGTCCTTGGCACCGACCTCCGTGATCGGGATCAGTTGTACATCTCCTTTTTTTCCGTTGGGTGAGTGCAAGAACAGATTCTTGAAATTATTCCGGCCTTTTGAATTCTGTAGCTCCTGCTGGATCTGATCAGCATCTTTATCTGTGATGGTTCCTGACATAAAAAGAATGAAGCCAGCGTGACTGCCGTTTTTATAATATCGAATTCGGAACAGTGTTGCAGAATGATTCAAGGCGAGAGAATGCAGCGCCCCGAGGTATTCTGGTACTCCATAAATTTCCTGATTAATGTCAGGTTGCTTGATCTGCAAAACAGGATTGAGAAATGTCTGAAATTGGTTGAACAGCGGAGTCCAGAAGAACTCTCCATCGTTTACCCCTCTCCGAGTGTAAATAGCTGGAGCATGTTTGATCTGTAGCAAATCACCCATTCTGTTTGTGATTAGCTCACAATAGGCATTGCCAAAAACGAGGAAGTCTTGAACCATTGCGGCAAAGGTTTGACGCGACAGATATTTATTCGGCTTGAGCGCACGTGATAACAGGTTTCGTTTCAGGAATATCGCCGATGAATGATGAGGTGATATCTGGAATAGCCGCGCCAGTCCTTCTTGAATAATGGGTGGTTCATACCACTGACCATTCGAGTAACATTGCCCGAGATTAAGAAAGTCCATCCCACTGACCGGTTCGGCATCACTAACAGGATTTTCTGTCTCGAAAGAAAAGCAGTGCACATTTTCCTGAACAGGAGATGCTTTAGCGGTCTCATGCGTAATGACTTCATTCATTAGAAAAATTTCACTTTCATAGCTTTTGACATCGGATCAGATCCATCAAGCGGTTCATTGATCAACACATGCATAATTGCCCAGGCGAGGTCGGCATGGCCGGTGTCGCGGGCGCGGTCAGAACGAAAAGTGATAGCCTTGCCGCTATTGGTGATCGTCTTCTTGATAGAGATAAATGAGCTGGCAATATCGATGCATTCTGCATCCATCTCCAACCGTCCCAGCCGCAACAAATTCTGGGCTTTCATGACCATTCGCTGTTTTGTTTCCAGCGAATATTCGATGCGAACAATGCCATTGATTTCATTATCGATGCACTGGCAGACGGCATCGCCAATCCCGCTGCGTTCAATACCGAGATAGGTGCAATTATAGCGACTGAGCTTGTCTTTGATGAACGTGGCCTGTTGCTGGAAATCGAGACCTCTAAGTTGGCAACGTTCCAATATACGGAATTTCCCGCCTGTCTTTTCAGGTGGCGCCAGAATAACTAATGCAGCGTTATCACCTTCTTCACTATTCTGGGGGTCATATCCAGCCCAGACGGGTCTATCTCCAAAGGGGCGTTCAAGCTGAGGATTAAAGTCGTCCCAATCTGCCAGACTGTCGACCATCGCATTTTTAAGGGATTCAAAACTAAAGGCTGATAGACTATCGTCAACAAATTGGCATCCGAACAGGTTCGCAAATTCTTCAGCACTATATTCTGCTTTTAACTCTTCAATATCGAATAGATCACAGCCGCCTGCTGCTGCGTCATCAACTGTAACGACATTACGCCAAACCCGATCTGGTCCTAATGCTCCAAACCTAAGGGCTTCATGAGAGACATCGACATCGATTTGTTCCTGTTTTTTCTTTCTTTTGTTGTGACGTTCACCATTCCAATAGGCGTAAGCGGCATGAGCAATAGTTGATGGCGTTGAGAATAATGTTCTACGCCATTTCTTATGAGTAGCCATTCCAGATGCGACCTTGAAGAGTTCTTCAAAGCCGAATGTCCAGAAAAATTCATCGAAATAGAGGTTGCCATGCCGACCTTGGGCTGTCCTGAAATTCGTGCCAAGAAAATGGAACTCTGCAGCAGGCGTATTAGCTGGTAGAACCTCACTTGTGATGAGCATCGGGTCGCCAGTTAAATTAACACCGACTTGTTTGGCGAATGCTACGATATATGAGCGGAACTGGAGTGCCTGAGCGCGAGATGCCGACAAAAAGATTTCATTTCGCCCAGTTTTCAATGCATCTAAAAGAGCTTCAAAAGCGAAATAATAGGTCGCACCAATCTGGCGAGCTTTAAGGATCATCCGGTTGCGCTGATCTCGGTTGTCATACCAACGACGCTGATAATCGTATAAGCTGTTCTCGAAAATATCCCGTAATTGCTCGACCTGATCTACCGTAAAGTGATTGTTTTTTGGCTTCTTGCGAGGTTTTTCTCTAAGATCGGTTTCTTTCCCAGATTCTTCATATTTGTGGACGCGAGCAATAGAGGCGATTTGGCGCATAAGAAGGTCGATTTCTTTGAAATCCCGTCCTTCTTTCTTCTCTTTTAGAATGAGAGCTGACAGTCGATAATCAAGATTGCTCTCGATCCTTTTGATCATCGGCGTATCATCCCATTTATCCCGCGATTTCCACGAGTGAATCGTCGGCACTGGAACATCAAGATCGTCAGAAATTTGGTTTACTGACCAGCCGCGCCAGTAGAGGCTTTTTGTCTCACGCTTCTTATCTTTCTTTCGACTGGGGTCGATTTCAATAATATCAGTCATAGGCTGATTATTCAGTATCTATGCCACTTCTAAAGTGACGAGTTTCCCATTGGGCAGGCCGATGGGAAGACCTGTGTTGAGAAATCGGTTTTTGCTCTGCAGTTTGCGAAGAAACATCGCAACGAGTAAAAAATGGCAACAAAAACCAGTAAACCGACCCGCATTTTAGTCTCGGGTGAGACTGTTGATGGGCGCGAAGTCAGTGACGCTATTATTCAAGAGATGGCGGACAGCTATGATCCAAAAATTTATGCTGCGCGTGTCAATTGTGAACATATCGATGGGTTAGTCCCTGAGAATAAAGGCGGCCAGTTCCCTGCCTATGGTACTGTCCTGTCGCTAGCCGCAGAGCCATTAGAAGTTGAAATATCTGGAAATAAAGAAATGGTTCTGGCGCTCAATGCGGTCATTGAAGTCAACGAAGACTACATGGAACTGAATGCGAAGGGTCAGAAAACTCTCTGGTCTGCCGAATTCGTTACAAATTTTCGTGATACGAAGAAAGCTTATCTTACTGGCTTGGCCATCACCGATAATCCAGCTTCGATTTGTACCGAAGTTATCAAGCTTTCAAAACAGCCTTTTCAGAAGCCTGAATTTGTATCGGAAAAGCCAGATTCTGAAGTCAAAAGCATGTTCAAAAAGATCATGGAAAAGCTGACCACTAACCCAGAAAAACCATCCAAAACCGAACCGGAAAAAACATTGGCTAAATCAGAAAAAGAGCCCGTTTTCAATCTGGATGCCGACAATGGTCTCGCGGAATTACTGTCAAAATATATGACTTCTGTCGAGTCTGATCGTGAGAAAGCGTCAGCAGAACTTTCTGAAATGAAAGAGCAATTCTCTGCGCTGAAGGGCAAGCTCGAAATTGAACCGGCTCACCAATTCAGCCGATCACCAGCAACAGGTGGGTCAGATAATCCTGCCGCTGGTTGGTAAGCGTCACCGTCCTATTTCCCGAATTTTAAACTGGAACTATTATGCAGCCCCATACGCGCAAATTATTCAATGATCTCAAAAACACGACGGCTCGGATAAATGGTCTATCCGGTGGAGCGCTGGCCGCAGCTGAGCAGTTTACCGTAGCCCCGAATGCTGCCCAGAAATTAGAACAGACTATTCAAGAGTCGAGCGAATTTCTGAGCCGGATTAACAGCAATGTTCCTGTTAATCAGCAAGTTGGCCAGACTATCGGTATCGGTACCATCTCATCTTTGGCTAGCCGGACGGATACGCGGAATGGTGAACGTAATCCGGCTAATCCGACCGATAGCAGTGAGAAATACCGTTATGAATGTGCCCAGACCAACTATGATTCTGCGCTGCCCTATCCGCTATTAGATGCATGGGCACATCGGCCTGAATTTCGGATAATCGTGAATACGGCCATTGCCGAACAGCAGGGTCGCGACCGTATCATGATTGGGTTTAATGGTATTCAGGCGGCTATCCAGACCGACCGTAGCAAAAATCCGCTTCTTCAGGATGTGAATATCGGTTGGCTCGAAAAGATCCGACAAAATGCCGAAAAACAGCATCTGCGGAAAGGCGCTGGGGTCAAAATTCGGGCAGATGGATCGGGCGATTATGTTAATCTGGATGCTTTAGTTTTTGATGCAAAAACGCTGCTTCAGCCGTGGTATCGTAACCGGACGGATCTGGTTGTTCTGATCGGAGATGGCTTGATCCATGATAAATATCTGCGTCTGATTTCGACCGCTGGTGATGATACCCAGAAGCAGGTTGCCCGGGACATTTTGTTAAGCAATGTCTCGCTTGGTGGATTGCCGACCTATCGTGTCCCGCATTTCCCCGAAAATGCATTGCTGATTACGACGTTCAAAAATCTGTCAATCTATGTTCAGAATAATTCGCGCCGTCGACAGATCGTGGATGCACCGCGTCGTAATCAAATCGAAGACTATGAATCCGTCAATGAGGCCTTTGTTGTCGAAGATTACGGCCTCTGTTCCTTTGTCGAGCATATCGACATCGGCGATGCTGTCAGCACTATTAATGACAGTTCAAGCAACACTTCGGCGCCTACCACGCCTCCTACCACGCCTCCTACCACGCCTCCTACGGATAATAGCAATGCTGCTGGCACTGAAACCAGTAAAGCAGGCGGCTAATTGTGGCATTATCTCCGGCTCAACGATATCAGCTTGAGATAGCTGGACGGCAGTCTCCTGTCGTCCAGTTGGATAAACGAGCTCATTTCCAGAATATTGAAACGTCGCAAATCTCTATTCGTCTGATGCATGATCTCAGGCGATTGAAGTCAGTGCAATCAATAAAGAGAAAGGCTGCTATTAAACGCGAAATTTTGCCGTTTTACGCAGACTATGTTGCCGGAGTGTTAGCCCAGTCAGAACTGACCGGTCATGCTTCCAAGAATGATATTGTTGGTACTGTGTTGGTCTGGCGTCTAGATGCTGGAGACTATAAAGGCGCGCTCGAAATTGCGCAGCATATGCTCAAATATGATTTGCCCTTACCTGCCCGTTTCGAGCGTGATGTTGCGGCATTACTTGTGACAGAAGTTGCTAATGCTGCGCTCTATGCTGTTGATTTAAAAAAAGAATTTGATTTAGCTATACTTGATCAAGTTGCAGATATGACAGCTGATGCTGATATGTCTGACGATATGAGAGCCCAGCTATATAAAGCGCAAGGCCTCTTATGTCAGGATGATGGGCGAGCTTTAAAAACATTAAAAGCTGCACTCGAAATGAATCCGTCTGCAGGCGTGAAATCTTACATTAAATCTTTAGAAAAACGAGTTTCTCAATCTGAGAGTAAAGGTGCCCAACTGGCCGCGGGGGCAGATCAGCAGAACGACAAACCAGTATCTGATGCTGTCGCTCAGGCTGATCTCCACCCCCGTAAAAATAGCCGTTCTCGTAAATACAGGACAACGTCATGAGTACAGCTCAATTTGAAAGTGTTAGCGGCTCGATCGCTAATTACAAACCGGACGACCGTATTATTACGAATGACGGTTTTTTCCCGAATATCGACACAGCAGAAATTAGATCAGGTTTCCGTGTGTTAGATAGCATTACAGCCGATCGGTTGCGCCAAGCTGTTATCAGATCAATGATTTCTGTTAATCAGGATCTTTCCAGCTGGAAAGCCGCAAAGATCAAAGACGGTGCCTCTAAATTATCTGACGTGTCATCTGATCAAATTGATGGCCAGTCGATTCTTAAAATGCAGTATTTTCAAGCTGTTGGCTGTTATACGCGGGCAGATCTAATCGACAGCTATCGTGATGTTGATCGTACGCGTCATGGGCGGCACGAGACCGACGATTTAGAGCCAGCTTCAGGCGATCTGCGACGCGATGCTATCCACGCAATCCGCGATATTTTAGGCAAAACACGAACAACTGCGGTGCTGATATGAGCAGCATCCTTGCCGCACAAGAAGGCGATACGCTTGATCTTCTTCTCTGGCGCGATGCTGGCCTTGGATCTGAAAGCGTGGCTGCTGTTTTAGCTGCTAATCAGAATCTGGCGATAACCGGAGCCGTACTGCCGCAAGGCACTCAAATTATCATTCCAGACGATGTCCCAACTGCAAAAACATCTGATGTCGTCCAGCTTTGGGACTAA
Protein sequences of DBSCAN-SWA_2 >NZ_CP023718|28793:36436|32948_33758_+|WP_062339716.1|capsid|DBSCAN-SWA MATKTSKPTRILVSGETVDGREVSDAIIQEMADSYDPKIYAARVNCEHIDGLVPENKGGQFPAYGTVLSLAAEPLEVEISGNKEMVLALNAVIEVNEDYMELNAKGQKTLWSAEFVTNFRDTKKAYLTGLAITDNPASICTEVIKLSKQPFQKPEFVSEKPDSEVKSMFKKIMEKLTTNPEKPSKTEPEKTLAKSEKEPVFNLDADNGLAELLSKYMTSVESDREKASAELSEMKEQFSALKGKLEIEPAHQFSRSPATGGSDNPAAGW >NZ_CP023718|28793:36436|30021_31041_-|WP_012954703.1|portal|DBSCAN-SWA MNEVITHETAKASPVQENVHCFSFETENPVSDAEPVSGMDFLNLGQCYSNGQWYEPPIIQEGLARLFQISPHHSSAIFLKRNLLSRALKPNKYLSRQTFAAMVQDFLVFGNAYCELITNRMGDLLQIKHAPAIYTRRGVNDGEFFWTPLFNQFQTFLNPVLQIKQPDINQEIYGVPEYLGALHSLALNHSATLFRIRYYKNGSHAGFILFMSGTITDKDADQIQQELQNSKGRNNFKNLFLHSPNGKKGDVQLIPITEVGAKDEFINIKNVTRDDILAAHRVPPQLLGMVPPNGSGFGDVSKATSAFYQLEIEPLMSVFSEINEMVGRNIIDFELPAIQ >NZ_CP023718|28793:36436|28793_29444_+|WP_062339711.1|DBSCAN-SWA MIIGTLNQKGGAGKTTVAVNLAASLQQDQKRVLLIDADPQASASDWSIARKRSLPEFAIPCVQLATADMHRQIQKYKSEYDFIIVDGAPRTTDLARSAIAASDIIVIPVQPSAFDIWAADAIVKLIEQARKIKTVAGCFLLNRVNKRAAISVSVAEALEGHSLPLLETQLSQRVRFAESALDGRSVFDYPAGRNAAADEIKSLKSEILKLYEASHG >NZ_CP023718|28793:36436|33799_34951_+|WP_063630122.1|capsid|DBSCAN-SWA MQPHTRKLFNDLKNTTARINGLSGGALAAAEQFTVAPNAAQKLEQTIQESSEFLSRINSNVPVNQQVGQTIGIGTISSLASRTDTRNGERNPANPTDSSEKYRYECAQTNYDSALPYPLLDAWAHRPEFRIIVNTAIAEQQGRDRIMIGFNGIQAAIQTDRSKNPLLQDVNIGWLEKIRQNAEKQHLRKGAGVKIRADGSGDYVNLDALVFDAKTLLQPWYRNRTDLVVLIGDGLIHDKYLRLISTAGDDTQKQVARDILLSNVSLGGLPTYRVPHFPENALLITTFKNLSIYVQNNSRRRQIVDAPRRNQIEDYESVNEAFVVEDYGLCSFVEHIDIGDAVSTINDSSSNTSAPTTPPTTPPTTPPTDNSNAAGTETSKAGG >NZ_CP023718|28793:36436|35741_36227_+|WP_062339719.1|head|DBSCAN-SWA MSTAQFESVSGSIANYKPDDRIITNDGFFPNIDTAEIRSGFRVLDSITADRLRQAVIRSMISVNQDLSSWKAAKIKDGASKLSDVSSDQIDGQSILKMQYFQAVGCYTRADLIDSYRDVDRTRHGRHETDDLEPASGDLRRDAIHAIRDILGKTRTTAVLI >NZ_CP023718|28793:36436|29436_29664_+|WP_062339713.1|DBSCAN-SWA MAKKSISLTRPSDEPTAPADLDKWVADNRKIDTEKTKRVAADIPIQVHMNLKMFCTKEDRKIGDVLREMIVEKFG >NZ_CP023718|28793:36436|36223_36436_+|WP_012954697.1|tail|DBSCAN-SWA MSSILAAQEGDTLDLLLWRDAGLGSESVAAVLAANQNLAITGAVLPQGTQIIIPDDVPTAKTSDVVQLWD >NZ_CP023718|28793:36436|31040_32816_-|WP_062339714.1|DBSCAN-SWA MTDIIEIDPSRKKDKKRETKSLYWRGWSVNQISDDLDVPVPTIHSWKSRDKWDDTPMIKRIESNLDYRLSALILKEKKEGRDFKEIDLLMRQIASIARVHKYEESGKETDLREKPRKKPKNNHFTVDQVEQLRDIFENSLYDYQRRWYDNRDQRNRMILKARQIGATYYFAFEALLDALKTGRNEIFLSASRAQALQFRSYIVAFAKQVGVNLTGDPMLITSEVLPANTPAAEFHFLGTNFRTAQGRHGNLYFDEFFWTFGFEELFKVASGMATHKKWRRTLFSTPSTIAHAAYAYWNGERHNKRKKKQEQIDVDVSHEALRFGALGPDRVWRNVVTVDDAAAGGCDLFDIEELKAEYSAEEFANLFGCQFVDDSLSAFSFESLKNAMVDSLADWDDFNPQLERPFGDRPVWAGYDPQNSEEGDNAALVILAPPEKTGGKFRILERCQLRGLDFQQQATFIKDKLSRYNCTYLGIERSGIGDAVCQCIDNEINGIVRIEYSLETKQRMVMKAQNLLRLGRLEMDAECIDIASSFISIKKTITNSGKAITFRSDRARDTGHADLAWAIMHVLINEPLDGSDPMSKAMKVKFF >NZ_CP023718|28793:36436|34953_35745_+|WP_062339718.1|DBSCAN-SWA MALSPAQRYQLEIAGRQSPVVQLDKRAHFQNIETSQISIRLMHDLRRLKSVQSIKRKAAIKREILPFYADYVAGVLAQSELTGHASKNDIVGTVLVWRLDAGDYKGALEIAQHMLKYDLPLPARFERDVAALLVTEVANAALYAVDLKKEFDLAILDQVADMTADADMSDDMRAQLYKAQGLLCQDDGRALKTLKAALEMNPSAGVKSYIKSLEKRVSQSESKGAQLAAGADQQNDKPVSDAVAQADLHPRKNSRSRKYRTTS |
9 | Burkholderia_phage(37.5%) | tail,portal,head,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|