Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP026101 | Paraburkholderia caribensis strain DSM 13236 chromosome 1, complete sequence | 3 crisprs | WYL,RT,DEDDh,csa3,DinG,cas3,PD-DExK,c2c9_V-U4,cas14j | 0 | 3 | 3 | 0 |
CP026103 | Paraburkholderia caribensis strain DSM 13236 chromosome 3, complete sequence | 1 crisprs | csa3,cas3 | 0 | 1 | 0 | 0 |
CP026104 | Paraburkholderia caribensis strain DSM 13236 chromosome 4, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP026102 | Paraburkholderia caribensis strain DSM 13236 chromosome 2, complete sequence | 4 crisprs | csa3,DinG,cas3,WYL | 3 | 2 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP026101_1 | 2302363-2302452 | Orphan |
NA
Consensus repeat of CP026101_1
|
1 spacers
spacers of CP026101_1
>1.1|2302389|38|CP026101|CRISPRCasFinder GAAAAAACCAAAACCCCGAACAGCGACGCGTGAAGCGA |
CRISPR arrays and Neighbor proteins around CP026101_1
The CRISPR arrays of CP026101_1 >merge|CP026101|1|2302363-2302452|CRISPRCasFinder GCTAACGAATCGCGGATGCCAGCGAAGAAAAAACCAAAACCCCGAACAGCGACGCGTGAAGCGAGCTAACGCATCGCGGATGCCAGCGAA >CP026101|1|1|2302363-2302452|CRISPRCasFinder GCTAACGAATCGCGGATGCCAGCGAA GAAAAAACCAAAACCCCGAACAGCGACGCGTGAAGCGA GCTAACGCATCGCGGATGCCAGCGAA
>CP026101.1|AUT53462.1|2301308_2301695_+|DUF4440-domain-containing-protein MTDDERAIRELIDTWLAASKAGETATVMSLMTDDAVFMVPGQKPFGKAAFGVAAEGQKNVDIDGKSEILELQVLGDWAFLRSQLEVTITLKDGSAPPVRHAGNTLTILRKETDGRWLVARDANLLVKG >CP026101.1|AUT52183.1|2300293_2301226_+|EamA-family-transporter MMGLSVVRPFKSMKNSTLAEPIAFDTAPVSSAPKLHVGMAALLGVLSMSCVQFGAALSAPTMAAFGAFSTTWLRLAWAALILALVVRPKLRSYSRAHWLAAGVLGVAMAGMTLCFFSAIERIPLGLAVAIDFLGPLAVATLGVRRLRALLWPLLAVAGVLLLAHDRSGWIGEPVGMLLAAGAACGWGSYIVLMKKTGALFDGLEGLSVSLIAAALVATPFGLVEHGLHIAPMQLAATAGLAVLVPLLPYALEMIALRHMPAASFGILMSVEPAIGAAAGFIVLHQPMTALQLAGTLFVVSASVGVIVTSK >CP026101.1|AUT52182.1|2299334_2300231_-|LysR-family-transcriptional-regulator MRDFDSSLLRAFVTVAETGAVSAAAVRLARTQAAVSMQLRRLEDDIGQRLLERSPRGVRLTDAGHRLLPYAHAILGAGEDARRALEVGDVAGTVRLGMLEDVAVGRLPRALRRFSAAHPQVALEIVVDASAALSQRLNEGALDVLVGDPAMVDATPLVTWTQPLFWVGARGYLADPQSPLPLVAFGGACLWQQQVMTALRRAGIAWRVVCTSTSLPAVQSAVEAGLGVSVLLDGNIRYDTMRVLGAADGLPEPPAADLGLFVRQAAGAQEAAVDALRTFLCEALDLDFIERTSRAPRR >CP026101.1|AUT53461.1|2298495_2299167_-|TetR/AcrR-family-transcriptional-regulator MPTTHPSRRQPKQARAEFALDSILEAAARTLESHGKAGLTTQRVADTAGFSIGAIYQYFPNKEGLVEALASRELERLTAMMKEALTQPAPFGTGLNARRMMRATAAFIGDRPRLYSILRAEWADAAPDTAIGEGMRRYFELIAGTLNRENPDLGKRIACDEARFVLFRAISGVLLATALERPHYFGTDAFEDEMVRLILGFLNYDLDPDIPRLAPEGGSFTSA >CP026101.1|AUT52181.1|2298072_2298276_-|hypothetical-protein MNHHQLESALDHLEHILARISGTDHLPLSYWRKRVDDVTAAARIPAQKNRARRLDEALSALESRAGA >CP026101.1|AUT52180.1|2297828_2297990_+|DUF1328-domain-containing-protein MLRYAAIFFVIAIIAAVFGFGGIAAGAAEIAKVLFFIFIVIFLVTLLMGVIRR >CP026101.1|AUT52179.1|2297187_2297679_-|DUF523-domain-containing-protein MKRILVSACLAGLPVRYDGSAKTLASMLLQTWRDEGRLVVVCPEVAAGFATPRRPAEIQLRRNGHDVLDGTARIRDNAGADVTALFIDGARHALQQALAHDCRYALLADGSPSCGSSFIHDGTFSRVAHEAVGVTAALLERHGIRVFAPDGIDELAASINVDG >CP026101.1|AUT52178.1|2296512_2297163_+|lysine-transporter-LysE MGLSLQQFAMVAGAHLLALLSPGPDFFLIARSALLRGWRKTGAVCFGIACANGVFIVLAVGGFAALHRHGIAFALVQAAGCAYLFYLGVLMLRHARAASIAAHVQDDSPASNTGAWPTRFAMGFASAILNPKNALFYASLFALLAARDAPFSAQIVYGVWMFAAVFGWDLLVAMGVGHPAVVARFTRHGAAIERVTGVVLLAIATSVLTMLAREWL >CP026101.1|AUT52177.1|2295608_2296424_+|AraC-family-transcriptional-regulator MNDQRYWCDPQLPFVESRRASHSRACYVPHTHETLSIGAVDSGHSNYACGGDRARLGPGSLVLIPAMRVHSCNPDAQSEWSYQMLHLDVAWASAVLRENGSADADTVLACPSINQNREAYLRYCALNRLLFSNADSGEKEAALILFVGERSWLGEARDLPPVPRIAGERLARITGLLHDAYGERLPIAQLAQMAGMSRYAFIRAFRAATGMSPHAYQLDLRINAARRLLRHGRALTAIAHELGFADQAHFQRAFKERVAMTPGAYRRAAVS >CP026101.1|AUT52176.1|2292232_2295106_-|sugar-ABC-transporter MPVGIQSIFEARQALDGVMQGDKPRMVRTTMAQPRALVRALTLVRMVALTSTAALALHGVAAHAQTQAQAQAPASTQLHAQASGGAQAPRVNLAFFYGSRVPVGELQAFDAVVIDPASGFDPAAHPLRHTVWLARTHADAAQATPDAFVAAQIETLWQRGYRGFLLDTPTAIAAVDAIRAAHPDARLVIGGDAALQAALPHAKALYAVIGPSLVRDAASGNVAAGERDARSAAAQQFTQTTGVPVVSIETCPADDRACARATAAQVLAAGVTPYVTNASLNAVGIGAIEVLPRKVLIVQDSDEDLPLDETPGVRDLATPLNYLGYDVEYANVHEPLPEGITPDRYAGVVAWLQGDETPNSGAWRAWVDARLAAHVPMVFLGQFGFDAAEDEGRALDLQAVAGPFADKIEVVSRDPMVGFEVDPKLGTRDLTGVQVGSASRSLLRVKSGEATLDQIAITPWGGFAMSPYTVVSLNGIGQERWAIQPIAFLREALRLQPMPSPSVTTENGRRLFMSHVDGDGFASRAEFPGADYSGEALFQQIFTRYKVPMTLSVIEGEVGPKGLYPQISPRLEEIARKMFALPYVAIGTHTYSHPFEWENVDAKTGERIDRGGGDTAFSLNIPNYTFNIDREVTGSIDYINSRLAPPGKKTTILQWPGNCEPPAIVVRKVYAAGVDNVNGGDTVITKSANSWTNIAPIGVLKGPGAYQVYAPNQDENVYTNDWLGPFYGFTRVLETFDMTDKPLRFKPIDIYYHMYSGTKVASLRALDQIFAAVLKQPVLPVHMTDYAHKVLDWRSFAVARTVQSEASNAKSSDWIVRGNGEVRELHWPLTSSPDLRASRGVTGYAAGPDGTYIHIADGAARVSFDPAGALSKADALPYIAEANGFVRDFKRDGKNMSFEFGSYYQPFVKLANAQTCSATVAGRAVPLQRDGAYVRFDTPALNALEAHYQPVEIRCER >CP026101.1|AUT52184.1|2302465_2305189_-|alpha-ketoglutarate-dehydrogenase MTDLSSGARPVLALTQARIDSDPQETAEWLAALDGVVQHVGLERAQYLFDRLAAHALGNGVATARANVTPYANTISVDQQPPYPGDLDTEEKLAAALRWNALAMVVRANRAYGELGGHIASYASAADLFEVGFNHFFRAASQSPGGHGGDLVYFQPHSSPGVYARAFLEGFLDETHLEHYRREIAGPGLCSYPHPWLMPDFWQFPTGSMGIGPINSIYQARFMRYLQNRGLQKTEGRKVWGFFGDGEMDEPESIGALSLAAREGLDNLVFVINCNLQRLDGPVRSNGRIIDELEAQFTGAGWNVIKVVWGSDWDGLFARDRTGALLRAFAHTVDGQFQTFSANDGAYNRERFFGQNPELAALAAHLSNDDIDRLRRGGHDVRKLHAAYDRALKHIGQPTVILAKTMKGFGMGAIGQGRMTTHQQKKLDVEQLKAFRDRFRLPLSDSDVEQLKFYKPAENSPEMQYLHARRAALGGYLPRRRKAASQTPTVPALSSWGQFALDANGKEMSTTMAIVRMLGSLLKDASLGPRVVPVVADEARTFGMANLFRQVGIYSPLGQLYEPEDMGSMLYYREDTGGQILEEGISEAGAVSSWIAAATSYSVHDLPMLPFYIYYSMFGFQRIGDLIWAAADQRARGFLIGATAGKTTLGGEGLQHQDGTSHLAASTVPNCRAYDPAFAYEVAMIVDEGMQEMIGRQRDVFYYLTVTNENYAQPSLPADSVDRVREGVLKGMYALDVASLETAQVQLLGSGAILGEVQAAARMLKDDWNIDAAVWSVTSFTELHRDGVASERAERLFGDHGTGTPYVTSALAASRGPVIAATDYVRAVPELIRAFVSRRYVTLGTDGFGRSDTRAALRAFFEVDRASIVIAALKALAEEGAVARGVVEEALARYGCHRDGRAAPWER >CP026101.1|AUT52185.1|2305347_2305887_+|Lrp/AsnC-family-transcriptional-regulator MSSEARPAARRLDRIDIAILQQLQQNARITNAELARAVNLSPTPCFNRVRALEKLGLFRQQVTLLDAGALGLRINVFIQVSLEKQVEDALRRFEQEVGERPEVMECYLMTGDADYLLRVVVPDMQSLERFIVQWLTKIPGVSNIRSSFALKQVRYKTALPLPVAGLTLPTEDDTPREWA >CP026101.1|AUT52186.1|2305913_2307548_-|AMP-dependent-synthetase MLPAADTYDGLVAAFEWRIPPQYNIGIDACDKWADGSGRLALICETRDGQATRYSFDQLKSLSDRFANALRRSGVKKGDRVGIFLAQSVETALAHLAVYKCGAIAVPLFALFGPDALQYRLSDSGAVALVTDLGGAQKIASVRASLPELRSIFCVDAEHADTALQVESFWSALDESPAAFDAEPTAADDPAVIIYTSGTTGKPKGALHAHRVLLGHLPGVEMPQAFFPNDARLMWTPADWAWIGGLFDVLLPSWHHGVAVLARRFEKFDGEAAFDLMQRHAVTHTFLPPTALKMMRAVEHPERWKLSLRAVASGGESLGAELIEWGRRALGVTINEFYGQTECNVVVSSCATLFDPCFGSIGKVVPGHRVAIVDDAGHTVPRGEPGNIAIHAPDPVMFLGYWRNESATRDKFRGDWLLTGDMGLMDADGFIRFVGRDDDVITSAGYRIGPAPIEDCLLRHPAVRMAAVVGAPDAQRTEIVTAFVVLNPGYQASDALVQTLQLHVKTHLAAHEYPRAIHFVDALPMTATGKVIRRELRERVTPPR >CP026101.1|AUT52187.1|2307632_2308163_-|MgtC/SapB-family-protein MGGWWHEVWLTMAREFSDLNDVKAITQVVMRLGLALLLGGALGFEREMAGRDAGLRTHMLVATGSALFVLVPLQAGFSQDNMSRVLQGLVSGIGFLGAGAIIKLSAQREVRGLTTAASLWLAAGVGVAAGLGREATAILSTVIALAILGGVRMIKPLVPPYTHDVPAQDESSKRVE >CP026101.1|AUT52188.1|2308336_2308609_+|hypothetical-protein MKKCILLAGIGVLAACTAVSGVDRKQNGYLSVTSRGRISLISWNSVRNAGIKHAKAYCREQNKELHTVEIHTNGVRSAGTQSVEVVFECI >CP026101.1|AUT52189.1|2308654_2309026_-|DUF2591-domain-containing-protein MRVSELEGALLDYWVARADNLPKPRVDDGFCWIEEPACDGDPAGALEAAFAPSTDWAQCGPIIERARIHLVPAAAGDRASWTGSVPAGASTIEQVGESPLIAAMRAFVASRFGDTVADEAGTH >CP026101.1|AUT52190.1|2309152_2309920_+|SGNH/GDSL-hydrolase-family-protein MTFDYRQASPCAKPSPRRSHALAAVLLGIAALQADAAAGRTRAAATADRPVIIDAQGDSTMFGYQTSDGFNKSWQTPDNPPALLQAALQARFGPRVIVQNNGVPGATLVDREKGINGYSQPYAQWAATSPAHIVIVNFALNDADNHVKEPPSAFRAHLMRFIEESQGAGRIVVLEEPNPVDYSVNKRIVPRYVAVVDEMAKHYRLALIRQYAYIGAMHDWRSLLIDGVHPTDALYRLKAERQRAVVAPIVAKLVE >CP026101.1|AUT52191.1|2309932_2310220_-|PAAR-domain-containing-protein MKRYLILNGDKTTVSGTVQAVSSTIQLEGRDVAHEGDNVICPACNTTGKIRCDGPREVMTAPDGRHAALSDDLCICKCEPPPKLVASQQTFSVGE >CP026101.1|AUT53463.1|2310478_2311834_-|malate-permease MKSSTQAALSATSSGRTQAPHAVVEWWWRVFDLRIGSLPLPVYVLMLGVLGAMAAKGKLAADLPTGIALVAVGGFTCAELAKRIPWIRHIGATSIFAAFIPSMLVYYKLMPEPVVKAVTTFTKTSNFLYLFIAAIIVGSVLSMDRQMLIKGFVRIFVPVAAGSVAAALVGTAVGTALGLDARHALFMVVVPIMAGGVGEGALPLSAGYAQIMGVEQGPLFAQVLSAVMLGNIAAICCAGLLSYLGTRRPEWTGNGRLTRAGDSDDDIAQRPASFEFDVGSVAAAGSTAIAFYLLGVLSHQLFGWPAPVVMLVLVVAAQLFQLVSPRVRGGARFMYGFFSTAVTYPLMFAISVAMTPWGEIVTAFHWVNIVTAVSTVLTLTVTGFFVGRLVGMYPVEAAIVNATHSGLGGTGDVAILTAANRMELMPFAQIATRIGGALTVMAALGVFAYWK >CP026101.1|AUT52192.1|2311872_2313237_-|serine--pyruvate-aminotransferase MKSTVAPTSVRKTRFTEATIGLFERTIPDPFVLAILITAIVAVLSAMFAPHASLGKLVGGWYKGFFDILTFAFQITLVLVTGHAFAHAPIVQRVFKSLVSVARTPVQAATLTFVLVAVASFCNWGLGLVVSALLAREVAKRMRVDFAWIVAAGFSGWVVWASGISSSIALAQSTPGSAMNVVQKITGEVLPFSATVFTGFNLVPTIAMLLAMPFVLAWLKPRDEDAVLLDTQKHPDAAPREKPTGKLSFARWIEYSWLGSAFIGATGIALLVLAQSEHIAFSGVNAVIFVMFIAGVILHGYPLAYADAVKNAARQTGSMMLQYPLYGGIMGMMDATGLPNVISHFFIAISNAHTLPFWSYVCSLIVTFFIPSGGGHWAVQGPFVVPAAVALHASVPATTMAVAMGEQVSNMMQPFWAAPVVAMAGIGVQRVLGFTVMTFIVGALVYGAALLLLV |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP026101_2 | 3300489-3300572 | Orphan |
NA
Consensus repeat of CP026101_2
|
1 spacers
spacers of CP026101_2
>2.1|3300512|38|CP026101|CRISPRCasFinder GAATCGCGGATGCCAGCGAAAAAACCAAAACACCGAAC |
CRISPR arrays and Neighbor proteins around CP026101_2
The CRISPR arrays of CP026101_2 >merge|CP026101|2|3300489-3300572|CRISPRCasFinder GGCGACGCTTGAAGCGAGCTAACGAATCGCGGATGCCAGCGAAAAAACCAAAACACCGAACGGCGACGCCTGAAGCGAGCTAAC >CP026101|2|2|3300489-3300572|CRISPRCasFinder GGCGACGCTTGAAGCGAGCTAAC GAATCGCGGATGCCAGCGAAAAAACCAAAACACCGAAC GGCGACGCCTGAAGCGAGCTAAC
>CP026101.1|AUT52963.1|3299209_3299794_+|hypothetical-protein MIVTSFAHVLRFLLAACCLMSLGGCAWTLITAADATGSVIQAGYAIASNYSSPTFINGRRAKISAVCIEVNQLVSVGDFVPALQLALDKRGIRSDVYNPGTSPAGCEARLVYNASVDYGRRSFSDEPTQYLSMIDLTLIQHGRILVTARYQTGGLGVDRYSSASVKLDGLIGKMVVDQIAELQPESIQTSSIGK >CP026101.1|AUT52962.1|3298658_3299156_+|hypothetical-protein MSFDRRLPSLSRLSLRLTQAGRDAALTATAALLFATAANAAQITPVKQAEPADVCPALSHIVSSADFKKLRDEPAATLPGVDPIDDCRANAHSYDCRWRAHWEADGFVNDPLEEIGADIAACFPNVVHDINTPTRQHFIVKTADRRVSVTASVQGQNELRLRIAR >CP026101.1|AUT52961.1|3297688_3298462_+|KR-domain-containing-protein MSDASFESVVLVTGAGSGIGAALARRIAAPRVALMLHARGADDASRARLDQVAATCAASGATCATVLADLAERGASEHAVHQTLARFGALDQIVANAGHAQRQTIGTLDFDALAESFAAMPAAFAALVKRAAPALETSKRGRVVAVSSFVAHRYRADSAFAGTAAAKAALESLAKTAAAELAQHGVTVNCVAPGYTRKDRGPSAENAPAWARAAEATPLGQVAEPDDVAALIAFLLSDSARHITGQVIHVDGGLTLG >CP026101.1|AUT53548.1|3295958_3297692_+|ABC-transporter-ATP-binding-protein/permease MIDNSKNPADITAWGLIKPYWVSEDRWKARGLLALVIAMNMTMVAANVWFNSWQRTFFDAIQQYNYPVFKYSLLQFTVIALALILLGSYRTYFRQMLEFRWRQWLTNRYLNDWLGDRAYYRIERDNLADNPDQRVSADLQGLASASLNLSLGLLSTTVTLFSFIVILWNLSGAFAFHMFGTEFSIPGYMVWAALIYAAVGSWVTHKVNHPLVSINYQQQRVEADFRFSLIRIRENADQIALYQGERSEEQQLKGVFSHIRENWRLIMRFTRRFNIVVISYSQLAIVFPYIAAAPKYFSKSISFGMYQQVTGAFGTVSDSFSWFINNYDSLAEWRATVNRLREFHRVMRSQHLHESVVEGTAHGGINVHVTDTDSIEVTNLRLQRPNGEPMANVGSFTIAPKTRWLVRGPSGAGKSTLMRTLAGLWPFGEGTIEKPADAKLLFIPQRSYLPIGTLKAALCYPSEASAYSDEACRDVLTVCRLPELAERLGESAHWERSLSPGEQQRLAAARALLQQPDFLFLDEATSALDPENESIIYNALIERLPNAAIVSVAHRKTLEAFHDHTLFIERAVEREAA >CP026101.1|AUT52960.1|3294160_3295300_-|FAD-dependent-oxidoreductase MRTSAQPDFAVIGGGLCGRLVAWQLAGEGHRVALYERGDAAGSQAAAWVAAAMLAPLAEAASAELLITRLGAASLETWPTLLAQLPEPVFFQRNGSLIVWHHSDRAEAPLFERRLRANAPAELLDGGLVALAGAQVGAAEPALAGRFTQGWLLPHEGQLDNRQVLSALAAGLAQRGVETHWNTSVDDGALPPAKVTIDCRGLGAKPVMPTLRGIRGEVARVHAPGIDLTRPVRLLHPRYPLYIAPKQDDLYVIGATEVEGEDMSPVSVRSALELLSAAFSVHPGFGEARILELNSQCRPTLPDHRPVLLWDGASTLRVNGLYRHGYMIVPEVAGEAVRLASALLDGRVADSDGFADWQRNARWSELFRLDREPAVTLNV >CP026101.1|AUT52959.1|3293952_3294150_-|thiamine-biosynthesis-protein-ThiS MDIQINQKPLSLPEGATVADALSAYGARPPFAVALNGNFVARGQHAARALQAGDKLDVVHPVAGG >CP026101.1|AUT52958.1|3293068_3293893_-|thiazole-synthase MNSHANAPADALTLYGETFQSRVLLGTSRYPSLQSLSDSIAASKPGMVTVALRRQMSEGGAEAGFFDLLKRHGVPLLPNTAGCQTVSEAVTTAHMAREVFDTDWIKLELIGDDYTLQPDPVGLIEAAAQLVKDGFKVLPYCTEDLVIGRRLLDAGCEALMPWGAPIGTGKGVVNPYGLRVLRERLPDVPLIVDAGLGVPSHACQVMEWGFDGVLLNTAVSQATHPETMARAFAMGVEAGREAYLAGPMAERETAHASTPVVGMPFWHQDGSAAA >CP026101.1|AUT52957.1|3291950_3293072_-|thiamine-phosphate-synthase MTETLKLAGRDLFWPPADELTEAAERIRAHLGDWPPTHVDWRICLTPPDDANGGDLIVFTDLKQSSAQHVEQIARWQTQGAGVIEAAEGRAVLHLGGVRYQLEGHLAEDWIAALAAFLDCGFDPHDALVLALAWRDGDETRSDDAWPCDMSHFPRVAGLPDAPAQAFAACPDALGLYAVLPTAEWVERVAGFGVKTLQLRRKTAEPEELKREIARSVAAGREHGACVFINDHWQAAIDAGAYGVHLGQEDVHTADLHALSKAGVRLGLSTHGYYEMLTALHFRPSYIALGAVFPTTTKVMPTAPQGLARLARYVKLLDGVVPLVAIGGISGDVLPQVLATGVKSAAVVRAITEAADPASAAATLQKAFLQQKV >CP026101.1|AUT52956.1|3291042_3291864_-|ABC-transporter-ATP-binding-protein MPSSSETLLELRDVDFGYGERLVLSNLNLRFKRGQVVAVMGGSGCGKTTVLRLIGGLVRAQRGQVMFHGQDIGAQTRDGLYALRRKMGMLFQFGALFTDMSVFENVAFALREHTDLPEELIRDLVLMKLNAVGLRGARDLAPSEISGGMARRVALARAIALDPELMMYDEPFAGLDPISLGITANLIRALNTALGATSILVTHDVPESFAIADYVYFLANGGVLAEGTPAELRASTDPTVRQFIDGTPDGPFKFHYPSNTPLAADFGIGGGRA >CP026101.1|AUT52955.1|3290278_3291046_-|ABC-transporter-permease MISFIGRSVICGLGQTGYATRMFLRLVLEFFPLLRRPRLVTKQIHFVGNYSLVIIAVSGLFVGFVLGLQGYYTLNRYGSEQALGLLVALSLVRELGPVVTALLFAGRAGTSLTAEIGLMKAGEQLTAMEMMAVDPLKVVVAPRMWAGIISMPILAAIFSAVGVLGGYVVGVLMIGVDAGAFWSQMQGGVDAYRDVGNGVIKSIVFGFAVTFIALYQGYEAKPTPEGVSRATTKTVVYASLAVLGLDFLLTALMFS >CP026101.1|AUT52964.1|3300777_3301524_+|DNA-binding-response-regulator MNSSIMVVDDDPVVRDIVRDYLQGRGFTVSVLENGMALQQALQHERPALVVLDIMMPELDGISALRALRLAGDNIPVILLTARADVIDRVIGLELGADDYLGKPFDPSELVARIRSVLRRRESAAPSAPENRAPYRFGRFEVNFPARELRRDGERIALRSSEFAMLKVFVSHAMTVLTRAQLLEKLHGGTDTHRNRSLDVSIWRLRRLIEVDPSEPRYVQTVWGKGYVFVPDGEIGAAERYDAPVANL >CP026101.1|AUT52965.1|3301624_3302311_-|flagellar-biosynthesis-protein-FlgH MNAMHMPRAAAVLTSAAVFYALAGCGSTKDSIVDTPMLPPLSTAPLNVNTQGAIFQAGTGILLYETPRAQHIGDVLTIRLSESYTGSNSTNAQASRASDITAEAADKSTGTAARLARLFNIGSASTTFKGQGSIADTSGMTGTLAVTVIGTMPTGNLVVSGEKLISMGGNRDRLRLSGIVNPKDIESGNYVASSKVANARIEQAGQGMLADSTTLGWLQRMFMSVLTF >CP026101.1|AUT52966.1|3302365_3303091_-|hypothetical-protein MQLTSIGRALTACAVAMCCALMPFAASAQNMLPPQQAAALRMSAIGAKKRAADKPFAFRGIPLGITLDEFRAVSRVRATPLGSVPVCETDNVAGSLGMRLKTSQSLTIACQWAHRVADGWEVSRAVVDGAPADEHVLRFVRVDGQSGFRLYEISFVIDEITADDLRDAFEDRYGAPRTATQVSSPTAGQLPVYIWENDVSSITLCLLPATHNATLIYLLKDPDAYMKSVVRQWQASSPDAG >CP026101.1|AUT53549.1|3303121_3303331_-|hypothetical-protein MPRSRFYTHVRRAATRLAMWSAVGAAVLPLDGCAVAALPCRLTSATLKILPVVGHVAATPFDACAAAID >CP026101.1|AUT52967.1|3303347_3303551_-|hypothetical-protein MTVQNNHFTTQLQSIARPSAAEKPSARGASTTAAATATGDKTGDKAGAAQTSGSPVGMVGNHVNTTA >CP026101.1|AUT52968.1|3303958_3304696_+|two-component-system-response-regulator-OmpR MNPQVLIVDDDPVVRDLLCRFLQSNGYDASVLHDGTHLQRRLERERPSVVVLDIMMPNTDGLRALTALRAAGDDIPVIFVTARGTVADRIIGLSLGADDYLTKPFDPRELLARIQTVLRRRGPATTSAPEARKRYRFGPFELDFATRTLSRDDTRVTLRDSEFALLKIFVNNPYKVLSRVLIHDLVHRDDLPFRDRSLDVPIWRLRRVIENDPSNPCYVQTVRGKGYVFVPDADPNGAPFAADPA >CP026101.1|AUT52969.1|3304637_3306017_+|HAMP-domain-containing-protein MCSSPTRTPTARPSPPIPRDARVTRIRNPLNTLFGRMALLSSAVLFAIQAGWFVLVVMQPPHHEVDGYARGILLALQAANGEPVNGADVAPALRVHLVPTWNMPATVHLEPPTRRPFVELTRHLRASLPVGTEIAVDDTHMPRLWVRFPRKSMWVVIPVDVPPRPRFVIESISMLLAALLLSLLAVWQMQRPLTRVAHAARAFGAGSRPEPVSEQGPRELRDLIGSFNDMMRRLNEAGDDQAVMLAGVAHDLKAPLTRLKLRASVLADENERAGLIRDVDSLTNIVQQFLEFAGQSAESGPMTEVDAFLREQFSSTDGNEGDEADSGDEAEAPLFRLDLQAGSRFTLPRTLLDRLVTNLVDNAFEHGAPPVEIATSRDEQQWLIDVRDHGPGIPEDRIAAAMKPFVRLDAARGGEGHCGLGLAIVVRLAHHRGGKCTVENHPEGGLHVRVALPVAMPEA >CP026101.1|AUT52970.1|3306226_3307291_-|protein-tyrosine-phosphatase MRLITLSIGSSNLHVTRVRLSHPRVALAPHLVDTTMTSSAKADISLAAAHDLYALPTVPAPSHARAQRASRRRFLKSTAGALLLSGMGSTLLTACGGNNAGSDQAPTPRLASLENFRDVGGTAAGYPTVDGRVVRRNAFYRSNALTQSAADAAVLDSLGIAAVCDLRTPGEIERASDALPANAAYVKINVTGREDVITPMLDNEASAVSSMERAQRLYVTDAVQRAAFGSLLSQLASTAGPQLIHSSAGKDRAGWAAALLLSIANVPFDIIMQDYLLSNTYMANAISARVEARRQQSGDLAASAEKPLASVQSSFLQASFDQVQSSYGTMSGYLTRGLGLTQSTVDTLRERLVL >CP026101.1|AUT52971.1|3307546_3308818_+|MFS-transporter MASFQWFTELSTRERRTLYAGFGGYAVDAFDFMIYSFLIPTLIATWGMSKSEAGMIATSSLISSAIGGWLAGILADRYGRIRVLQWTIATFALFTCLSGFTHNFWQLLTTRTLQGIGFGGEWSVVTIMMAETIRSPQHRAKAVGTVQSSWSFGWGAAAILYWAFFALLPEEYAWRACFWIGIVPALWIIYIRRNVSDPDIYLATRRARDNGFDTSHFLQIFSAAHLKTTILGSALCSGMLGGYYAITTWLPTYLKTVRHLSVFNTSGYLVVLIVGSFIGYIVGAILSDRIGRRASFVLFAIGSFVLGMIYTMLPITDGAMLLLGFPLGIVVQGIFAGVGAYLSELYPNAIRGSGQGFCYNLGRGVGSFFPILVGTLSQTMTLVKAIGIVAGSGYLLVVVAALALPETKGKSLAAESAESAEHV >CP026101.1|AUT52972.1|3308826_3310095_+|D-amino-acid-dehydrogenase MRTIVLGGGIIGVATAFYLRERGCDVTVIERESDVALATSFGNAGVIAPGYVTPWAAPGMPFKILKYLFKPASPLIFRPTFDLAQWRWIARWLRECDLARFRVNKQRMQRIAYYSRECLREFRGHHPFEYGRSQGYLQLFRTAFDVELAQPALAVLRDAGISHREVSAAECAEIEPGLRWARQAPLSGLYLPDDEAGDCARFTRELRAICEAHGVRFRFDTRVTALDVRGRSVHGVHVESAAGSETLVANAVVVAAGVDSADLLAPLGVKVPLYPVKGYSATLQIVDDEKSPRAALMDESLKTAITRFGPNLRVAGTAELGNRQTTLREQALQTLMKVLDDWFPHATAPSSAQFWVGRRPMTPDGAPLLGPSGIDGLWINLGHGSTGWAMSLGSGRVVADLITQREPEIDLDGLTLGRYRGS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP026101_3 | 3310901-3310984 | Orphan |
NA
Consensus repeat of CP026101_3
|
1 spacers
spacers of CP026101_3
>3.1|3310924|38|CP026101|CRISPRCasFinder GTTTGGTGGTTTGGCCTTTGCGCTGGCATCCGCGAATT |
CRISPR arrays and Neighbor proteins around CP026101_3
The CRISPR arrays of CP026101_3 >merge|CP026101|3|3310901-3310984|CRISPRCasFinder GTTAGCTCGCTTCACGCGTCGCCGTTTGGTGGTTTGGCCTTTGCGCTGGCATCCGCGAATTGTTAGCTCGCTTCACGCGTCGCC >CP026101|3|3|3310901-3310984|CRISPRCasFinder GTTAGCTCGCTTCACGCGTCGCC GTTTGGTGGTTTGGCCTTTGCGCTGGCATCCGCGAATT GTTAGCTCGCTTCACGCGTCGCC
>CP026101.1|AUT52973.1|3310133_3310751_-|RNA-2',3'-cyclic-phosphodiesterase MQRTDPHPDQKESAGSASDPNLEATTNPHDYQRCFIALVPDTATRDALSSIDIPTTARRVPYEQLHLTVTFIGVLPQEKAAPLIESLTHETVSLKRTPITKIEHWPRASHPRLTVATLAMSDEFVALDWRVRSSMIALGLPVDARTFRPHVTLARYRQDAAAVGPAMDLQHELIACFDSLTLYSSTLARTGARYRSLASVPVVYG >CP026101.1|AUT52972.1|3308826_3310095_+|D-amino-acid-dehydrogenase MRTIVLGGGIIGVATAFYLRERGCDVTVIERESDVALATSFGNAGVIAPGYVTPWAAPGMPFKILKYLFKPASPLIFRPTFDLAQWRWIARWLRECDLARFRVNKQRMQRIAYYSRECLREFRGHHPFEYGRSQGYLQLFRTAFDVELAQPALAVLRDAGISHREVSAAECAEIEPGLRWARQAPLSGLYLPDDEAGDCARFTRELRAICEAHGVRFRFDTRVTALDVRGRSVHGVHVESAAGSETLVANAVVVAAGVDSADLLAPLGVKVPLYPVKGYSATLQIVDDEKSPRAALMDESLKTAITRFGPNLRVAGTAELGNRQTTLREQALQTLMKVLDDWFPHATAPSSAQFWVGRRPMTPDGAPLLGPSGIDGLWINLGHGSTGWAMSLGSGRVVADLITQREPEIDLDGLTLGRYRGS >CP026101.1|AUT52971.1|3307546_3308818_+|MFS-transporter MASFQWFTELSTRERRTLYAGFGGYAVDAFDFMIYSFLIPTLIATWGMSKSEAGMIATSSLISSAIGGWLAGILADRYGRIRVLQWTIATFALFTCLSGFTHNFWQLLTTRTLQGIGFGGEWSVVTIMMAETIRSPQHRAKAVGTVQSSWSFGWGAAAILYWAFFALLPEEYAWRACFWIGIVPALWIIYIRRNVSDPDIYLATRRARDNGFDTSHFLQIFSAAHLKTTILGSALCSGMLGGYYAITTWLPTYLKTVRHLSVFNTSGYLVVLIVGSFIGYIVGAILSDRIGRRASFVLFAIGSFVLGMIYTMLPITDGAMLLLGFPLGIVVQGIFAGVGAYLSELYPNAIRGSGQGFCYNLGRGVGSFFPILVGTLSQTMTLVKAIGIVAGSGYLLVVVAALALPETKGKSLAAESAESAEHV >CP026101.1|AUT52970.1|3306226_3307291_-|protein-tyrosine-phosphatase MRLITLSIGSSNLHVTRVRLSHPRVALAPHLVDTTMTSSAKADISLAAAHDLYALPTVPAPSHARAQRASRRRFLKSTAGALLLSGMGSTLLTACGGNNAGSDQAPTPRLASLENFRDVGGTAAGYPTVDGRVVRRNAFYRSNALTQSAADAAVLDSLGIAAVCDLRTPGEIERASDALPANAAYVKINVTGREDVITPMLDNEASAVSSMERAQRLYVTDAVQRAAFGSLLSQLASTAGPQLIHSSAGKDRAGWAAALLLSIANVPFDIIMQDYLLSNTYMANAISARVEARRQQSGDLAASAEKPLASVQSSFLQASFDQVQSSYGTMSGYLTRGLGLTQSTVDTLRERLVL >CP026101.1|AUT52969.1|3304637_3306017_+|HAMP-domain-containing-protein MCSSPTRTPTARPSPPIPRDARVTRIRNPLNTLFGRMALLSSAVLFAIQAGWFVLVVMQPPHHEVDGYARGILLALQAANGEPVNGADVAPALRVHLVPTWNMPATVHLEPPTRRPFVELTRHLRASLPVGTEIAVDDTHMPRLWVRFPRKSMWVVIPVDVPPRPRFVIESISMLLAALLLSLLAVWQMQRPLTRVAHAARAFGAGSRPEPVSEQGPRELRDLIGSFNDMMRRLNEAGDDQAVMLAGVAHDLKAPLTRLKLRASVLADENERAGLIRDVDSLTNIVQQFLEFAGQSAESGPMTEVDAFLREQFSSTDGNEGDEADSGDEAEAPLFRLDLQAGSRFTLPRTLLDRLVTNLVDNAFEHGAPPVEIATSRDEQQWLIDVRDHGPGIPEDRIAAAMKPFVRLDAARGGEGHCGLGLAIVVRLAHHRGGKCTVENHPEGGLHVRVALPVAMPEA >CP026101.1|AUT52968.1|3303958_3304696_+|two-component-system-response-regulator-OmpR MNPQVLIVDDDPVVRDLLCRFLQSNGYDASVLHDGTHLQRRLERERPSVVVLDIMMPNTDGLRALTALRAAGDDIPVIFVTARGTVADRIIGLSLGADDYLTKPFDPRELLARIQTVLRRRGPATTSAPEARKRYRFGPFELDFATRTLSRDDTRVTLRDSEFALLKIFVNNPYKVLSRVLIHDLVHRDDLPFRDRSLDVPIWRLRRVIENDPSNPCYVQTVRGKGYVFVPDADPNGAPFAADPA >CP026101.1|AUT52967.1|3303347_3303551_-|hypothetical-protein MTVQNNHFTTQLQSIARPSAAEKPSARGASTTAAATATGDKTGDKAGAAQTSGSPVGMVGNHVNTTA >CP026101.1|AUT53549.1|3303121_3303331_-|hypothetical-protein MPRSRFYTHVRRAATRLAMWSAVGAAVLPLDGCAVAALPCRLTSATLKILPVVGHVAATPFDACAAAID >CP026101.1|AUT52966.1|3302365_3303091_-|hypothetical-protein MQLTSIGRALTACAVAMCCALMPFAASAQNMLPPQQAAALRMSAIGAKKRAADKPFAFRGIPLGITLDEFRAVSRVRATPLGSVPVCETDNVAGSLGMRLKTSQSLTIACQWAHRVADGWEVSRAVVDGAPADEHVLRFVRVDGQSGFRLYEISFVIDEITADDLRDAFEDRYGAPRTATQVSSPTAGQLPVYIWENDVSSITLCLLPATHNATLIYLLKDPDAYMKSVVRQWQASSPDAG >CP026101.1|AUT52965.1|3301624_3302311_-|flagellar-biosynthesis-protein-FlgH MNAMHMPRAAAVLTSAAVFYALAGCGSTKDSIVDTPMLPPLSTAPLNVNTQGAIFQAGTGILLYETPRAQHIGDVLTIRLSESYTGSNSTNAQASRASDITAEAADKSTGTAARLARLFNIGSASTTFKGQGSIADTSGMTGTLAVTVIGTMPTGNLVVSGEKLISMGGNRDRLRLSGIVNPKDIESGNYVASSKVANARIEQAGQGMLADSTTLGWLQRMFMSVLTF >CP026101.1|AUT52974.1|3311025_3312465_-|hypothetical-protein MCIVRGMTKMPDIKELTPEQKDALIIDLVRRLNELEAKLEKNSHNSSKPPSSDGPKRKPKSLRNTSDARPGAQPGHKGKTLKRVAQADHIEIHPVARVCDKCGNRIAAASVAVLPEGRQVIDLPPTRFEVTEHRVQIAQCRCCGKQHSGAFPKGVSQAVQYGPQIRAAAVYLTQYQQLPVARTAQALEDLFGLHVSTGTVQHSIDQAAQLLAPCVDQIQQALRGQPVVHFDESCMRVGRESHWLHVASTHALSWYGAHSKRGSQALDSFGILPGFTGVAVHDGWRPYAGYECEHALCNAHHLRELVFVLESTQQPWAQQMIDLLRQAKREVELSRASGNNMLSPARQRYYTRRSRALIARARKLNPQQAREPLRQERRGRIRQSFTCNLLTRLHKYADEVWRFIADHRVPFDNNQAERDIRMPKLKQKISGCFRSESGMEAFCTIRSYLATLRKQNRSLINALALGFAGFVVSPLVTAE >CP026101.1|AUT52975.1|3313146_3314613_-|glutamate-synthase-subunit-beta MGKATGFLEFERRHEAYEAPLTRVKHYKEFVSALTDDEAKIQGARCMDCGIPFCNNGCPVNNIIPDFNDLVFRQDWKNAIDVLHSTNNFPEFTGRICPAPCEAACTLGINDDPVGIKSIEHAIIDKAWAEGWVAPQPPKHKTGKKVAVVGSGPAGLAAAQQLARVGHDVTVFEKNDRVGGLLRYGIPDFKLEKWLIDRRMRQMEAEGVTFRANVFVGKDPLPAHIGNTAKETITPEELKDQFDAVILTGGSETPRDLPVPGRELAGIHYAMEFLPQQNKVNAGDKVADQLLAKGKHVVVIGGGDTGSDCVGTSNRHGAKGVTQFELLPQPPEEENKPLVWPYWPVKLRTSSSHEEGCERDWAVATKRFEGKNGKVEKLIAARVEWKDGKMVEVPDSQFEMKADLVLLAMGFTQPVSPVLEAFGVDKDARGNVRASTEGDKAYYTSVEKVFTAGDMRRGQSLVVWAIREGRQCARSVDAYLMGHSELPR >CP026101.1|AUT52976.1|3314714_3319418_-|glutamate-synthase-subunit-alpha MNDHQQPLSTVPAAQGLYDPANEHDACGVGFVAHIKGKKSHEIIQQGLKILENLDHRGAVGADPLMGDGAGILIQIPDSFYREEMAKQGVTLPPEGEYGVGMIFLPKEHASRLACEQELERTVKAEGQVVLGWRDVPADHTMPISPTVKASEPLIRQIFIGRGKDIMVTDALERKLYVIRKTASHRIQALKLKHGKEYFVPSMSARTVVYKGLLLAGQVGVYYRDLQDERVVSALALVHQRFSTNTFPAWELAHPYRMIAHNGEINTVKGNVNWLNARTGAIASHVLGDDLPKLWPLIYPGQSDTASFDNCLELLVMAGYPLVHAVMMMIPEAWEQHTLMDDNRRAFYEYHAAMMEPWDGPAAIAFTDGRQIGATLDRNGLRPARYIVTDDDLVIMASEAGTLPIPESKIVKKWRLQPGKMFLIDMEHGRIIDDKELKDNLANAKPYKSWIDAVRIKLDEIEPNAEDVVTERREAAALLDRQQAFGYTQEDLKFLMAPMAQAGEEAVGSMGNDSPLAVMSNKNKTLYHYFKQLFAQVTNPPIDPIRENMVMSLVSFVGPKPNLLDTNNINPPMRLEVSQPVLDFKDIAKIRAIDQYTGGKFSSYELNICYPVSWGKEGIEARLASLCAEAVDAVKSGYNMLIVSDRKTDRDNVAIPALLATSAIHSHLVQQGLRTSTGLVVETGSARETHHFALLAGFGAEAVHPYLAMETLAQMAAGMKGDLSAEKAVYNFTKAIGKGLHKVMSKMGISTYMSYTGAQIFEAVGLAEDLVNKYFKGTASKVGGIGLFEVAEEAIRLHRDAFGDNPVLANMLDAGGEYAYRVRGEDHMWTPDAIAKLQHSARSNSYQTYKEYAHLINDQTKRHMTFRGLFEFKVDPSKAIPLDEVESAKEIVKRFATGAMSLGSISTEAHATLAVAMNRIGGKSNTGEGGEDENRYRNELRGIPIKNGDTMKSILGDEVVTDIPLKEGDSLRSKIKQVASGRFGVTAEYLASADQIQIKMAQGAKPGEGGQLPGHKVSEYIGKLRYSVPGVGLISPPPHHDIYSIEDLAQLIHDLKNANPAASISVKLVSEVGVGTVAAGVAKAKADHVVIAGHDGGTGASPLSSVKHAGTPWELGLAETQQTLVLNQLRGRIRVQADGQMKTGRDVVIGALLGADEFGFATAPLVVEGCIMMRKCHLNTCPVGVATQDPVLRAKFQGQPEHVVNFFFFIAEEVREIMAQLGVRKFDDLIGHSEYLDMKKGIEHWKAKGLDFSRVFYQPDVPASVARMHVDSQDHGLDRALDHTLIEKAKAAIEKGEHVSFIQPVRNVNRTVGAMLSGTIAKKYGHDGLPDDAIHIQLKGTAGQSFGAFLAKGITLDLVGDGNDYVGKGLSGGRIIIRPTNDFRGKSEENIICGNTVMYGAIEGESFFRGVAGERFCVRNSGATAVVEGTGDHGCEYMTGGTVVVLGETGRNFAAGMSGGLAYVYDVDGTFAAKCNKSMVALEPVLQQAEQERTVDKALWHMGQTDEALLKGLIERHFQFTGSPRAKALLENWDASRRQFVKVFPTEYKRALGEMGAKKAAKEVLAA >CP026101.1|AUT52977.1|3319728_3320436_-|transposase MARLARLYVPDQPQHVILRGLDQQPAFVDDQDYELFIDCLKAASRDHHLSVHAYALMPGAVQLLVTPTDESSLPKAMQAVGRRYVAHFNRRYSRRGTLWEGRYRATVIEGEKYFLLASRVVEMSPVRNQLVSTPEDYRWSSYRHHIGLTLDSLITDHRLYWSLGNTPFERQRAYRELCEQPLDEREASQLQQATLKGWVLGSDSYREWAARAANRRVSPLPRGRPRKVRETPQTQ >CP026101.1|AUT52978.1|3320623_3321346_+|hypothetical-protein MKLKQALGVAALACITTTAHAQSAGSFFVTTGWFHLAPQSSSDPLRETNVNGTPVNITVPNTGATLGSGDTIGFTGGYFVTDHIATEFVIGVPPQFDLHGSGAFQQYGKLGSAKQWSPTLLFKYYFNQPQAKFRPYLGLGVSRVSFTDEHITNGAFEANVLHGPTTVTTDSSWEPVFNAGFTYAFTDHWFAGFSISYLPLSTTAKLNTQAQTPIGTVNVQSETKIRLNPIVTYVNLGYRF >CP026101.1|AUT52979.1|3321678_3322062_+|DUF883-domain-containing-protein MTALPNTRDALGESWTTAGRRARRIARHSRHAAEDIASELRTLMTELENTLGDGTQADAAVLRTQMRKRLDEARTRLNDTRDAMRERAEAAIHDADDYVHENPWRTIAIVGGVALIAGALLARGGSR >CP026101.1|AUT52980.1|3322251_3323472_-|deoxyguanosinetriphosphate-triphosphohydrolase MSEIRSDPLSESLDAASVTPVTGVVSLPTIAALEAHLAPYAAHSSQSRGRRHHEAPPSARTEFQRDRDRIVHSTAFRRLEYKTQVFVNHEGDLFRTRLTHSLEVAQIARSVARNLRVNEDLVEAISLAHDLGHTPFGHAGQDALNECMRDYGGFEHNLQSLAVVDDLEEHYGAFDGLNLCFETREGILKHCSRENARRLGELGERFLQGRQPSIEAQIANLADEIAYNNHDVDDGLRSGLLTIEQLAEVELWHTHYDAARRDYPQIEGRRLIHETVRRIINTLIVDLIDTTTRNIAQHAPASLDDVRRAPPLVAHSDAVAAQATQLKRFLFKNLYRHYRVMRMANKAQRVIAGLFDAFIDDPRLLPPAYQTPDAAKQPRLIAHYIAGMTDRYASKEYQRLFIVDGD >CP026101.1|AUT52981.1|3323529_3324612_-|3-dehydroquinate-synthase MITVNVELGERAYPIHIGADLIGRSELFTPHIRGASVTIVTNTTVDPLYGDTLRKALAPLGKDVTTVVLPDGEAHKNWETLNLIFDALLGARADRKTTLIALGGGVIGDMTGFAAACYMRGVPFIQVPTTLLSQVDSSVGGKTGINHPLGKNMIGAFYQPQAVIADIGALRTLPPRELAAGVAEVIKTGAIADATFFDWIEANIEALNRREPEALAEAVKRSCEIKASVVAADEREGGLRAILNFGHTFGHAIEAGLGYGEWLHGEAVGCGMVMAADLSVRLGHLDEAARKRLVAVIEAAHLPVQAPTLGAARYVDLMRVDKKAEAGEIKFILLKRFGDTLITRAPDEAVLQTLDASVGT >CP026101.1|AUT52982.1|3324628_3325180_-|shikimate-kinase MQPRDAHANVFFVGLMGAGKTTVGRAVARRLDRPFFDSDHEIEARTGARIPVIFELEGESGFRDREAQVIAELTGRESIVLATGGGAVLRPENRDALRAHGIVVYLRANPHDLWLRTRRDKNRPLLQTEDPKGRLEALYEVRDPLYRECAHFVIETGRPSVNGLVNMVLMQLEMAGVAKPATS >CP026101.1|AUT52983.1|3325317_3326901_-|type-IV-pilus-secretin-PilQ MMRLNVMRSMLACAAFVAMAARASLPPLPADMPFDEALTPAGMPPLPRVVTTDVANPFTPDATDEAGAAGRETDPARDAAQPPSAARVETEPKRQDEARTEPLEGPPVPLPPAARLSTNASPSIPADSPITLHFQHAELGAVLGAFAKFTGLNIVASDKARGAVTLHLDNVPWRAAFDTLLDVNGLAMEQRSNVIWVAPLSELAARERQRFEAHARAAELEPLASRTFELHYAHAEELRKLLTASGNQRVLSKRGAAMADPRTNLLFVTDLDARLAQIAELIASLDRPTRQVLIEARIVEAEKGFSRNLGVKLSMLATNEDGKAIGVVGGKEGAIYDLSARPISGFDAATAGFTLFAAQATRLVNIELSALEAEGLGRIVSSPRVVTADRMKAIVEQGTELPYQAKVGQGVSGVQFRRASLKLEVEPQITPDGRVVLDLDVAKDSVGEQTASGPAINTKHVQTRVEVEDGGTVSIGGIYESDDRDDVTRVPLLGKIPLLGALFRHRAHRDLTSELVVFITPRVVQTN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP026101_2 | 2.1|3300512|38|CP026101|CRISPRCasFinder | 3300512-3300549 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 63884-63921 | 2 | 0.947 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1036124-1036161 | 2 | 0.947 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1551786-1551823 | 3 | 0.921 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 214984-215021 | 3 | 0.921 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1271522-1271559 | 4 | 0.895 |
CP026101_1 | 1.1|2302389|38|CP026101|CRISPRCasFinder | 2302389-2302426 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1036168-1036205 | 5 | 0.868 |
CP026101_2 | 2.1|3300512|38|CP026101|CRISPRCasFinder | 3300512-3300549 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1317562-1317599 | 5 | 0.868 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1313541-1313578 | 5 | 0.868 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1313683-1313720 | 5 | 0.868 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 290848-290885 | 5 | 0.868 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1316845-1316882 | 5 | 0.868 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2554516-2554553 | 5 | 0.868 |
CP026101_2 | 2.1|3300512|38|CP026101|CRISPRCasFinder | 3300512-3300549 | 38 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 2002081-2002118 | 7 | 0.816 |
CP026101_1 | 1.1|2302389|38|CP026101|CRISPRCasFinder | 2302389-2302426 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 508524-508561 | 8 | 0.789 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 535790-535827 | 8 | 0.789 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 271225-271262 | 8 | 0.789 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 515006-515043 | 8 | 0.789 |
CP026101_2 | 2.1|3300512|38|CP026101|CRISPRCasFinder | 3300512-3300549 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1313714-1313751 | 9 | 0.763 |
CP026101_3 | 3.1|3310924|38|CP026101|CRISPRCasFinder | 3310924-3310961 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 289790-289827 | 9 | 0.763 |
1. spacer 2.1|3300512|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 2, identity: 0.947
gaatcgcggatgccagcgaaaaaaccaaaac-accgaac CRISPR spacer gaatcgcggatgccagcgaaaacaccaaaacaaccgaa- Protospacer ********************** ******** ******
2. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 2, identity: 0.947
gtttggtggtttggcctttgcgctggcatccgcgaatt CRISPR spacer gtttggtgttttggtctttgcgctggcatccgcgaatt Protospacer ******** *****.***********************
3. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 3, identity: 0.921
gtttggtggtt-tggcctttgcgctggcatccgcgaatt CRISPR spacer -tctggttgttatggcctttgcgctggcatccgcgaatt Protospacer *.**** *** ***************************
4. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 3, identity: 0.921
-gtttggtggtttggcctttgcgctggcatccgcgaatt CRISPR spacer cgtttgggcgttt-gcctttgcgctggcatccgcgaatt Protospacer ****** **** *************************
5. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 4, identity: 0.895
gtttggtggtttggcctttgcgctggcatccgcgaatt CRISPR spacer gtttggttgttttgcctttgcgctggcatccgcgtttt Protospacer ******* **** ********************* **
6. spacer 1.1|2302389|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.868
gaaaaaaccaaaaccccgaacagcgacgcgtgaagcga CRISPR spacer gcaaagaccaaaaccccgaacggcgacgcgtgaagcac Protospacer * ***.***************.**************.
7. spacer 2.1|3300512|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.868
gaatcgcggatgccagcgaaaaaaccaaaacaccgaac CRISPR spacer atagcgcggatgccagcgaaaaagcaaaaacaccgaac Protospacer . * *******************.* ************
8. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.868
gtttggtggtttggcctttgcgctggcatccgcgaatt CRISPR spacer gtttggtgttttggcctttgcgctggcatccgcgctgg Protospacer ******** *************************
9. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.868
gtttggtggtttggcctttgcgctggcatccgcgaatt CRISPR spacer gtttggtgttttggcctttgcgctggcatccgcgctgg Protospacer ******** *************************
10. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.868
gtttggtggtttggcctttgcgctggcatccgcgaatt CRISPR spacer ttttggtgttttggtctttgcgctggcatccgcgattc Protospacer ******* *****.******************** *.
11. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.868
gtttggtggtttggcctttgcgctggcatccgcgaatt CRISPR spacer gttcggtgttttggcctttgcgctggcatccgcgtttc Protospacer ***.**** ************************* *.
12. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.868
gtttggtggtttggcctttgcgctggcatccgcgaatt CRISPR spacer gtttggtgggttcgcctttgcgctggcatccgcgtgat Protospacer ********* ** ********************* . *
13. spacer 2.1|3300512|38|CP026101|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.816
gaatcgcggatgccagcgaaaaaaccaaaacaccgaac CRISPR spacer gcatcgcggatgccagcgaaaaagccaaagcaaaccac Protospacer * *********************.*****.** **
14. spacer 1.1|2302389|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.789
gaaaaaaccaaaaccccgaacagcgacgcgtgaagcga CRISPR spacer caaaggccgataacaccgaacggcgacgcgtgaagcga Protospacer ***.. * * *** ******.****************
15. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.789
gtttggtggtttggcctttgcgctggcatccgcgaatt CRISPR spacer tccggcttgttttgcttttgcgctggcatccgcgaatt Protospacer .. * * **** **.**********************
16. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.789
gtttggtggtttggcctttgcgctggcatccgcgaatt CRISPR spacer gcgtcgggcgtttgactttgcgctggcatccgcgaatt Protospacer *. * * * ** * ***********************
17. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.789
gtttggtggtttggcctttgcgctggcatccgcgaatt CRISPR spacer gcgttgggcgtttgactttgcgctggcatccgcgaatt Protospacer *. * * * ** * ***********************
18. spacer 2.1|3300512|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.763
gaatcgcggatgccagcgaaaaaaccaaaacaccgaac CRISPR spacer aaatcgcggatgccagcgaaaaagcaaacgcccagcgc Protospacer .**********************.* ** .* * * .*
19. spacer 3.1|3310924|38|CP026101|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.763
gtttggtggt-----ttggcctttgcgctggcatccgcgaatt CRISPR spacer -----gcggccgcggtttgcgtttgcgctggcatccgcgaatt Protospacer *.**. ** ** **********************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1978755 : 1998675
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP026101|1978755:1998675|DBSCAN-SWA GCTATTTCCCCGGTCGTTTCCTGTCGCGCAACAGCGTTGTCTGTTCATCAAGCTGTGATGCCTGCGCCTTGGCGAGATCGGCACGTTGGGTGGCGTGCGCGACATCCGCCTGTCGACGATCCTGGCGCAGCTGCTCTACCGTCGCGCGAAGTGACCTGATCTCTTGCAGCAACGCGTCGTTACCACGCGAGCGGTATCGCCCCAGGTCGACCGTCTGAGCGCCGCTATCCGGCAGTTCAATAGATTGCGCTTCGGCCGCCGTCAAGACCCGCTCGCCTTTGTGAAGCTCGGCGATGTAACCGTCGAATGGCACGCGATACAGCCCCGATGCATGCGAACCGTTGACTGCCGTCCAGCCCTGCAGAGCGGCGATCGCTTCGGCCACCGTCTGCACCGAATCATTCACGTCGACGATGCCCGCGACCATCTGCTTGAGCTGCGTGAGCTGCGCATCGGCCGACGAACTGGCGTAGTCCATCGACTGCATGACTTCCGACAGATCCGCCTGATACTGCCCGGAGCTCGCGTTGTACGCCTTCGACGCGGTCAGGAAATCCTGCGCTGCCGAGGTGAGATTCGACTGCGCATTAGCATCGCCCCCGATCGCTGCGCTGTACAGTTCATCGAAGCGCTGCTTCGTCGCGGCATACTGTTGCTCCGGCGACAGCGTCGACAGGTCGCCCGTCGTTAGCGACTGACGGAACTGGTCGACCTGCGACTTGAACGACTGCAGGGCCTGAGACTGCGTGCTGTACGCGGACGCCACTGCTGACTCATACGACGAGACGACGTTATCGAATGCCGGCGCGAGCCCCAGCAGCGCCACATACGTCCTCTGACCCGCGTCCGTTGAGAGATCGAGCGACTCGACCAGATCGCGAAACTGATCACGCGTGTGGATGCCGCTATAGCCGAGCGCGACGAGCTGATCATTCACCGACTTGGCCGAATCGTCCGCCTGCTGCCCCGCTGACGTGAAGTGCTGATAGAAGTACGTTGCGCTGGTGTTGAACGAATCTACGCCACCGGAAAGGTTCAGCACGGCGAGCTGCGCATCGGTTGACGCTTTCTTCAGGTTGTCAAAGTCGCTCCCGAGATGCTGGAACGAATCGAACAACGACTTGACAGCGTTGAGTTCCGAAAGCAGCGTCGTGATGTCGTTGGCCGACAGCTTCGACGCATCGACGCCCTGCAGAACTGCCGCATACGGCGCATCCAGATTTGCCTTCTGAAGGGAATCGATTACCGAGCGCTGCAGTTGCAGGCTGAAGTCGTTCAGGACGGTCGTGGAATCCTTTACGCCCCGGAGATCCTGTCGATCCGGATACCAGTCCGACCCTGTCGTGAAGCCGGCGGCAACAAACGAATTGCCCTTCTGCGGACTGATTTCGTAGCTCGCCTTGTACTGGCCGAGACCATCGATCGAGCCGCCCAACTGCGTCGCCATCGACTGGATCGTCTGAAAGGTCGAGTTGATCTGCTGGGTGACCTGATCAGCCTCCGGATCACCACCCGACGGCCCTGCAAATTTCGTAGCGCTCTGACCGTTGGAGACGTACGAGGCGCCGTATCGGGTCTCGCCACCACCAATGAACGAGCCGGCCAATGCGCCCAACGCCGCGCCGATGACCGCGCCAATCGGACCCGCCCATGACCCGAGTTCAGCCCCGAGGGCCGTCCCCGCGACCGCTGATGACGTGCCGACCGCCAGGCCCCCCATCGCACCGAGCGAGCCGCCAAGACTCGAATAGCCCTTGTTACCAAACAACGCACCACCCGCCAGACCGCCCAGCAATCCGGCACCTCCATACATGAGCCCTACTGACGACGAGCCGAGGGAACTGCCTACCCCAGAGCCGGTGAAGCCATAAGCATTGGACCCAACCATGGACGCTGTGCTCGACGCGACCCCGCTACCGATGCCAGAACCGAGACCGCCGAGCACCACACCGCCACTGCCGAGCGCACCTGACGACGCACCGGCGATGGCTGCAGAGCCGAGCGCGGTCGACGCACCCCCGTACCCTTGCAGCCACGACATGATCGTGTTGTACCCGTTCGACAGGTTGTTATACGTGCCGACGGGGTTCGAGAGCAGGTTGTTGACCGTGCTGTTGCCGCCAAGGCCGTAGCTCTGCAGCACCGAATTCTGCACACCGGTACCGTCCGTAAGGCCGGCGATGTTGGCGATCACGCTGACGATGAACGGCTTGGCGAACGCCTTGTAGATCTCGTCGATCACCGTGGCCTCGAAGGTGTTCTTCAACGCCTTCGTCCAGCTCGACCATCCTGCCGTGCCGTCGGTGAGCATTTGCAGGAATCCATTATGGAAATCCGAGCCAATGCCATCGATCGTGCTTTTCCACCCGCTGACAAGATCGTCCTGCTGCTTCTTGGCCTGCTCCAGATCATCGAGCGATGACTGATCGTCAGCAACGCTGTGTAACACTTTGGAAAGTGACTTTGCCTTTTCGATCGCGTCATCCCAAGGCTTCGTATCACCCACCTCGCCATTGAGCATGGCAATCGCACGGCCCTGTTCGAGCGCCGCCAGCGTTTCGTCCGCACGCGACGCACGCAGGGCGTCGACGGCACCCTTCGTCATGCCGAACGTATCGATCTGATCTTTCAGCGACTGGTCATTCTGCGTGGCGGAATCGATCTGCTTCTGCATTGCCTCCAGCTGGGACTTGCCATACTTGTCCCATACCTCGTCTTCCTGAGCAGCCGATACCGAGACCTTGGCAAAGAAGTCTTCAACGGCCTTGCTGCCGTCAGCCAGCGCCTTTTTCTCCTGATTGGCGAGCTGCGCACGTTGCGAAGCACTGAGCGCCCGGTTCTTGATGCCCTGTGCGATGAGAGCGGCTTCCCTATTCGCCGCATCGATCTGGTCGGACGCCGCCTGCGCAAGGAGGTCGCGCTGCTGCTGGTAATACTGACTTGCCGACAGCGTGCCGCTCTTGTACGAGGCGTCGAGCACCTTGCGGGCGCTATCGATCGCCGAAAGCTCGTCGGCGAGTGCATCCTTGACTGCCTGCACTTCGCCGGAAAGTTCAGCGCGATCGACGAGGCCCGTGCCGCCTTTCCTCTGCGTCTTATCCTTGTTGCGATCGTTGATCTTTTGCTGGTCTGCCAGCTGCTGTTCCGGCGAGAGGTTCAGCGCGGCCGTCCGGTCAAGGTATTCGTTGACCTCCTGCGTGCGCTTCTCAGCGGGCGTGGCAAACTGCTTGTTCCACGTGTCGTACCACGACTTAGCGTCGATCAGCTGTCGCTGCTGTTCCTCCTGCGATTGCTTGTCGCGTGCGGCCTTGACGGCGGCGGCCTGTTTCGCAATGGCATCCTGCAGATCGGCCTCATCACATGCATCCCATTGGCCTATCGGGAACCGTGCAGCCTTGTTCGCCTGCAGGCGCGCGACGACTTCGGCAGGACCGGCAGCAGCACCAAACGAGCCGATCGCATCGACCGCGCCGCTGATCATGTTTTTGATGTCACGCCAGCCAGCGAGAATGATGCCCTCGTTTTTTGCGATATCAACAGTACGCTGCTCCATGGCAGCCGAGAACGCTTCGACCGCGACTTGCGCCGCCCCCGTCGCATCACCCTGCTTTTCCAGCGCCGTGATCTGATCGTACGTCGACGCGGTCAGATAATGGTATTGCTCGTTCAGCTTGACCGATGCCTTGACGGGATCATCGGCCAGCTTCGTGAGGTCATCGACCATCTGCTTCGCAGAGGTCGACGTGTACGTCGCGACGTCCGCGACCGTGCGACCGAGGCTCGCGATCTCGTCGCCCGTCAACCGCCCGGTAGCAGCAAGCGCAGTCACAGCCTCAACTGCCGTGTCCACCGTCGCCCCGCCGGCCGTAGCCGCTACAGCAAGCTCGCGTAGTCCATCAGTCGTGAGTCCCACATAGCCGCCGGTCATCACCAGCGCCTCGTTCATCTGCGCATTCTGCTCGGCGACATGCACCATCGCAGCGCCCACGAGAACCAGCGGAGCGAGCACCGCTGCAATCTGCAATGCCACTCCCGACGCTGCGCCCTCAAGCAGGTCCATGCGCTCCGCGAGCACCATGATGGATCCGCCGAAGTTGCTCCAGCTACCGGTGAGCGCCTCGTGTGCCAGTACCAGGACCTCCCTACGCGCGCCCGCAGTCTTCAGACCGAAGCCTTCCATCTCCGAGCTCGCTGACTTCGCGCTGGATGCAACGGCGCCGAGCGCCTTCTGGCTCGCCGCAATATCCGCCATCGACGCCTTGTATTCGCCGATGGCGATCGCGCCCGACCTGAACGCCGCTTCGAGGTCTTTCTCCTGCTGCGCCAGTTGTCGGAGCTGGACGGATGCCTGGTCGACCCGGATGCCTGCGAGCGTGCGCTCATAGTCGTCGGTAGAGATCATCCCCGACCTGAAAGCGTCGTCGAGCAACGCCTGATCAGCGGCCAGCTTGCGCGTCGCCGCGCCGAGCGGATCGTACTTTGCTGCCAGCGCGGCGAGCTTGGCGAGGCGCGCATCCTCGTCCTTGCCGACGGCAGCGAGTGCTGCGTCGTAGTCCTGCAGTGAGAGTTTGCCCGTCGCCATCGCGCGATCAAGTCGGGCATATTGATCACCGATCGCCGCAAAGCTGGTTGAGCCCTGAGCGAGCGTCGTGCGCAGCTGCTGCATCTCGTCGTTCATCACTCGCGTCGACTGCGCCGCGTCGATCATGGCCTGCGTCTGGGCCGCAACGTTCGCGCGAACCTGATCAGCGGACGAGCCCAGCGATCCGATGCCGCCGGCGGCCGACTGCGCTGCCGAAGCCTGGGATGCCATCGCCTGAGCGGCCTCGAGCGAGCGGGCGACCATATCCTTGATACGAGCCGCTGCCTGTGCCTCCGCCTCGCTGACGATCGCAAGACTGTCCGACAGCCCCTTGCCGGCCGAGCCGGTTGCCTGCATCGCAGCCGCCAGATCGCGAGCACCGAAAGAACCGGCGGACAACGCCTGACTGATTGCTGTACTCGCTGCCGTCGTGCGACTCTGGCTCTCGACCAGCTCGGCCAGATACTGCGTCGAATCGAGCGTCAGCGTGACGTTGATTGATCCGGCCGTTGCTCCCATATTGCATTCCTCGAAAGCGCCGCGCTGGCGCGATTACGACCGCGAAGCGCGATCGGTGAACACCTGAAGCGCCGCGCGCTCCATGACGCGCACAGCATCGAACAGTTCACGATGGCGCTTTCGCCTGAAGCCGAACATACGCATCGCGGCCTCGACTGCGGGATAGTCGAGACCTTCATAAAAGACGCCGCCGCCGGTCAACGACGCGACGACTGACTTTCGCCACTGCGTGCCGCACGCCGCGAATAGCTCGACGGCATCCCAGTTCTCCGCCAGCACCTCGAAGTCGTCGGTGCTCCCACGCGAACGCGCTGCGGCCACTACATCAGTCGACGCACCGAACGTCGCAAGGGCATCGGCGACATCGGCGTCGGCAGCGAAGTCGTCGCGACGATCGCCCGCCCAGAAGCGGGCAGCGTCGATCAGTTTTTTTCTGGCAGATTCGCGAGGGTCGTCACGAACGCGCGCGAAATCGCACGGATGGTATGTGGATCGTCGATCTGCTCGGCGAGGTTCTGCTCGGAAAACGGCAGCGAGTCCCCGTTCCCGTCGAGCACTTCACCGTCCGGCCAGCCGACGATGACACTTTTGACCAGTTCGCCGGGCGGCTGCTTGATCAGTTCCTGAAATTCAGATTCCTTGACACGGCGCGCGATCAGTACGACCGACTGCTCCGTGATGTTGCCCTCCGCGTCGATTTGCGGAACGGTGACTTTGAGCTTGAACGTGTTGCTTTTCGCCTTGATCAGCGGCATGTTGGCTCTCCTGAGTGTAAAAAGAAACGGGCCACCTGGTCAGGCGGCCCGCGAGGTACTACGGTTGAGAAAAGTTACTTTGGTCAGGTCAGCGTGATCGTCAGTTCGTCGTTTCCATCTACCGGGTTGAGTGTGAGCGTCGCAGACATGCTCGCGATGCCGTTGTTGTCGGTGTATGACGGCGACGTGAGCTGCACGGCCGGCGCGGCGATCGTCACGATGTTGCCGGCCGTCTTGCCGTGCGTGAGCGAGAGGGCAAGATTCTGTGCGACGGCGATCGCGCTCCAGTAGTCCTTGTCTGCCACGCGCGCGAGCTCGAATGCAACGCTGCCGGTCGGCTTGCGATCGGTGATCAGGGCACCCGCTGCGCCAGGCAGGCTGCGGTAGGTGACCGTATTCGCCATGTCGATCTGCAGTGCATTGAGCGTCGCCGCGTAGCCGCCGAGAGAAATGGCAGGCGTGTTGTCGTTATTGACGACAAGCGGATCCTTGAACTTCGAGAAGTCTGTGGTGGGTAGCGCCTGATCGACAACGGGCGAATAGTCACCAGTGAACTTGAACTTGAATTTCGGAATCGCGTTCGAAGTGAGATCGAGCGCAACCGTGCCATACGCGTTCGCGATCTTATGCAGCAGGCCATCGAGAAAATAATAGATCGTCGCCGGTTTCGGATCGTCGCTGATCGGTGCGTACGCCACGCTCGTGTTGTCGTCGGTCGTCGCGGCAAACGAGCATGCCTGCAGCAAGGGATCCCACGCCGGCAACGTGCCGGCTGCCCCACTGCCCGCGATTTCGACACTGAACGACAGCTCAGCGTGCTTCTCGGACACCAGCTGCTGATCGTTGCCAAGGTACGGTTTCACGTTATTGCGCTGCGCATAGGTGGCCGCAACGGGCGTCGAACTGACGTCGCTGACGATCATCGCGTTGTCTGCGCCCGTCGGGACGACAGCAATGCCAAACGCCGTCTGCAGCGCACACAGCACGACGGTCTTTCGGGTTCGCTTCGTGGTCATATGCCTGATTCCTGAAAGTGGAAGTGAAAAGACAGGACGCCGATCGGGTCGACTGACTTATTCGGTCAGGCTGTTCCAGCGGGTCCGGTAGATGTAGACGTAGTGCACTGTCCGCAGACACGATTCGCCGTCGACGTTGGCAAAGATCGGCGGATCCGTCTGTCCCTCGCTAGCGTCAATTAACGATGGCGCATCAAAGCTCATGACAATTGGGTGGGCCAGCTCGAGCACAGAATCTGCTGCCTTGTCGGGTGCCGTATCGCGCGTTATGACCGTGAAAAGCAATTCGGTCTGTCGCGTGGCGAAGCCTACGGCAGACCGGTCGGGCGGCGCTTCACCGCCGAGATGCACAATTAGCGCAAGACTGTGCTGTGCATCGAGCGCGTCGACGACCGAGCGACCGATCTGCACATTGAGCTGCTGCACATCTGGATCGGCCTCCAGTTGCGCGAGCAGCGCCTCGACGAAGGTTTCTCGAAGGGTTGTCATAGCCGTTTGAGCGGAGCGAAACTGAAAAAACCCGCCTCGTCGCTCGGACGGCCGGCTCGCGTGAGCGCATACTGCGCGCCTGCGATCGCAACCACCGTTTGCTTCGGCAGGTCCGGCGCATCAGCCGTGCGGTACTCGATGCCATAATCGACCGCCTGCGCCATGTTCTGCAGGATGTCGGAACCGGCTTGCAGAAAGTCGGCATAGAACGGCTGCGCAACGTCGGCGCCGCTCACCACCGTCACCTCGGTGAGCATGCCGGCGTCGACGGCCGCATCGAAAACCGCACCCAGATCGAAAGCGCTCACAGCCAGAGACGAACGTTGACCGTCGCGTCCGATGCCGCCTTCGGCAACGCGAAATGGCCAATCGGGTCGTTGTCGGTCGCAACTGTGGTCACGACACCGTTCGTCTCGTCCCAATACGCCTTGTCGCCGACGACAGCCGTGCCCGTCGTTGCGCTTGGCAGCGAGAAGACACCGGTCGTGTCGTACTCGCCTTCTGTGTTTGCGGCATAGCTGCCCAAGGCAACGGCCGGCAGCTTGGCGTTACCGAGCAGAACGAGCTGGCCGGAAGTGACCGCAACGGCCAGCGTGGCCGCCAGAATGCGGCCCGGTTGAACAAAGTTTTTCATGGATCGATTTCCTTGAGAGATGTAGGTGCAAATGAAAACGCCGGCACGTGGCCGGCGTTTTCAACGCGTGCGAGCGGGGATTACTGACCGGGATTCTTGAACAGGCCGCGGAAGTCGATCGCCTTCGCGGCGAAGTCAAGACGGGCCTTCACCTTGACGCCGTCGACATCGAAGTCGATCGACTGCTCGGTATACAACCCCTCTTCCCCGTCGAGGTAGCAGTACTCGATCGTATCGATGGCGCCCGGATCCGCAACCACGTACCACGACTTCGCGCTGATCGCGTCCAGCCGCGACTCGACGATCGGCGTGAGAACCGACTGGAACGGGTTCTGCTGCGTCGCCTGCGTCGGCGTGTACGCATTGCTGGTGTACTGATATGCGACCGTTTCCAGCGCTGCCGGCACTAGCAGAAACTTCGGCGCGAGATTGAGCGCGGTACCATCGCCCGGTGCGCTCTGCGTACGCATTGCCGTGCGCGCTACTGAAAGCGAATCGATGCCGATCGCCGCAGCCGTACCGAGATTGCCGTGCGCTGCATGAAACAGCGCCTTCCCGTCGCTCATGGGCGCATTACCGGTAAGGGCCGCATACACCATGTCGGATTCGAGGTTTGCCGCTGCGCGACCGAAGAAAAGCGGCACGCGCTCGAGCGCCGACAGGTCGTCGTTGATGATCATCTGGCGGGTGAATGCGACGACCTTCCCGTACGTAGCGAGCTGAATCGTCTCTCCACCGTCGGTCAGCGTGCCGTACTTGTACTCGCCCGCTTCGTTGATCTTCTCCAGCTTCAGTGCACCATCCACCATCACGCGCGTCGCCGCGCGGAAGTCCGACAACGCGCCCTGACGCGCCCACGACTGGAACGTACGCGGCGCGGCGCCGTAGGCATCGCGAAGCGTGCGGTTGATCACGTTGCCGAAAACCACCGGAAGATCCGAAGTCGTGCCGTAACCACCACGCACGCCGAGCGCAGTGCCGGCCAGATCCATCACGCCCATGCCACGCGTATCGATGCCGGCGGCTTCGAGGCCCACCCGGCACAATTCGCGAAGCGTCAGCCCGCGATACTGGCGAGCCGCATCGGTCAGCTTGTGACGCGGGTTGACACGGTGCATCAGTGCATCGGTCATTGCTGCGCGACGCTGGACCGTCTCGTCTGACACCGTCACGACGTTGGCCGGGCCGCGCTGCGGATTTGCCGCGCTGCGCTCTGCCTGAAGGCGCAGGATCTCGGCACGAGCAGCGTCGACGGTAACACCGCGCGAAATGAAGCCATCAATCAGTTCCTGCTGATTGTCCAGCACACTAGCACGCACCGCAGCGCGCAGATCAGTCACGCGCTGCCGTTCCGCTTCGATACCAGCGTTGCGAGCCGCATCATTTTGGGACGCCGCGTCATCGCTTCGTGCCGTCGGCGCGGCGGGCGTAGCGGGCGCAGCAGCCGGAGCGGCAGGAGTGACGGAAGTTTGGGGTTGGTCATTTTCACCGGGCATCACAGCTCCTTGGTCAAGTTGCGAGGCACGAGCCCCGTCAGAAGAGCCCCCAGCGCTTCGATCATTGAAGACGCAGGGGAAGAATCGTTGAGAAGCGCCATCGTCGAGAGCTCGTTGGCCGCGCACGGTGGCATTTGGATCCGCTGGAATCGACACGAGCGAGATCTCGTAGGGTTCCCAATCGATCGCGCGATAGATCCACTGATCGTTGTCCTCCTGACCCGGAGGAATCATGTCGATCGCGTAGATCCGGTAGCCGAATGAGACATTGCGCAGGATGCCGTCCTGCACGTCCTGAAAGTAAGGCTGCACTGCGTCACGCTGGGAGAAACGCAAGCTCGCCTGGCCGGTTCCATTCGCGCTATCGAGCGACGCGCTCGACACGACACCGAGAACTGACTCGAGACCACCCCAGCGGTCGTGATCGTTCAGGACCGGTGCATTCCCGGAGGTCAGGCGATCCATCCGGACAGCACCGTCGCCAGTGCTTAATTCTTCGATATAGGGCCGGTCGCGCCAATAGTCGTAACGAAGCACGGTCGCGCCGGCCGTCCACTGGACGTCGACGGTGCGTGCCTCCGCATTGACCGATGACACCGGCTGCAGGCGGGTTTGCAGAGGCATGGATGCGTCAGGCGTAGCAGCACCGCCGCGACGGCCGTTTGCAGGTTGGGGCATGGTTCACTCCAATGAGAAACGCCCGCGCATGGCGGGCGTCGGAGATATCAAAGTTGCTTTGCAGATCAATGCTCAGCGCCATCTAGTAGGCGCTCAAGGCGATAGAGGAATGCGCGCGCGGCATTCGCATCAAAAGACCGGCCGGAAACCTGCACTTCGATCGCCAGCGGGTCTGCCGAAATTTCGGCGTCGGTCTCGTCAGGATCGTCGCCCAGGTCGCGAATGCTCTGGTGTCGGCTTTTCAGACGCGCGTTGATGAGCTCGATCATTCCGTTCGCTTCGCGTAACGGATCAATGAACTCGATCCGATGCGTCGACCACGTCACGTCAGCGATCGCCGATCTAGCCTGTCCCGCGAGGAATGCAGTAGAGACGAAGCGTTCCGCCACCTTATCGCAGAACATTGGAATGAATACGAGCCACATTTCCTGCTCGAGCATGCGTTTGAATTCCATCTTGCCCATGCGACCGCTGGTGAAGTTGACCTGCGAGTAATCGCCCGTTAGCTGCTCGTATGTCACATCGGTGCCGGCCGCGATCGCGCGCAGGTCGACGCGGACGCTCGCCTCGTATCCGTTGCTCGTAGCAGGCGCGGAAAACGTGACCTCCTCGCCCTGCCGAAGATATTCGATCATGCCGGGCGACAACGACTCGACGCGCGGGCCGCTCCCCTCCTGACGGACAGGGCCGGTCGTATAACCATCGTCGCTCGACGTCACAAAAGCCGCGAAGCATGCCTCTATCTTTTTGCGCACCCGCTCCGCGTCCTGATATTCGTCAAGATCGCGTGCGGCCCAGATAGCAGTCGCGAGCCATGGAAAGCCGCGAACTGAATTCGGCCGGTCGGTCGCATCGAACACGTGCAAAACCTCGGACGCGGGCACAAAGCGGCTTTGCAGATTCCTCGGCACCTGCGTGACTTCACCCGGATGCTGGTCGAAAAGCCAGTACCCGGTACGCTGTCCGATCAGATTAAACTGAACACCCGCGATGATGAATCCACCCGCCACCGGCCCGATCTTCAATGAGTCGAGGTAGTCGATCTCAAGCACCTGCAGCTGCAACGGCACTTCGAACCCGTCCGACGGCAGCCGAGTACGAAAGCGCACGAGCACCTCGCCCGACTCCTTCAATGCGCGGTAGACCTGAGCCTGCAAACCAAAAAAGTCGAGCAACCCGGCGGCGTCGCAATACTTCGTCCAGCGCTTGAAAACCTTCTGCAGATTCTTGTCGCTGAACTTCGCCTGCACTCCGGTGCCGATCGCGTTGGCCGCCATGATCTTGAGCGCGCGACGGATGTGCGGATTGTTGCGTACGAGGTCGCGAGCACGGTTGCGCAGAACGTTGAGTGCCGGCAGGACTTCCGCCGTTGCGCTCGCCCCCGAGGCTTTCCACCCACCCGATCGCGGGCCACGCTTCGCTCCATCGAAGCCACGCGCGGCCAGCATCGACATCCGTGCTCGCATCCGCGTTGCTGCATAGCGCGGCGCAACCCACTCGATCGCCCTGTCCAGAACGTTTGTTTTCATGTCAGAACCGTTTGTAGATTGCGACGCTCGAACGCGGCGCACGGGTTGCAGACTGGCTCGCAAGTTCGGACTTGATGACATCGCGAGCCTTCAACAGATCGCTCATCGAACGATAGGTAACACGCTTCCCGTTGAACTCAACCGACAGCGTGCCAGTCGCGATCGCTTTCTCGATCGCGTCGAGGTTTTGTTGTGTAAAGGCCATGATCAGTCCTCGATCTCAGGGAACTCTGGAAGCTCGACGGTTTGATTGGCGAGTGCATGCGTGCAATCCCCCAGGAACTGGATCCGTCCGTCGGTGATGAACGTATGGCAGCGGTGCATCGGGATGTCGTGATCACCGCCCCACCAAGAATTCACGCTCGGCGTGAAGGTCGGCTGCTCGAAATCCCCATTGAACCCCCAGTGCGGCTTACCGGTGACGTGCGGCGATTCCTCCGTCACACCAGGCGGCAACCAACTGACGGGCAGTACGCATCCCATCGGTGTGCCGTCGCTCCAGGTGCACCCTGGACAGAGGAACTTGACGCCGTAGAATCGGCCTTCGCTGTCGTTAACGATTTTCGCTTTCACGGGTCCGTCCCTTCCAATTATCATGACCGGCCTCGCAGCCAGTTGCTACGGCGCGGAATCCACGCTTGCCCTTGCGTCGACGCGGCTGGTTGCTCGGGATGCGCCGACTCGACAGGCGGCGCCAGAACCGCCTCTGCTGCGATCAACGCGTTGTCAGTGACTGCGATCGCTGCGTCAGACTCCGCCGCGCCAGCCTGAGAAGTTGCAACAGGCTCAGCCGGCAGAGCCGCAAAGAGGTCGTGGACGCGCGGCTCGATAACCGCCTCCAGCGCCGCCCAATCTGCATCCTGGTACGTGTTCAAACGCAGACGCGGGTGATATGCGCAGGCGAGGTTATAGACCTTCAGATCAAGCGCCTCATTTCGTTTTCGAAGCTTGTCCCATCGGTCTTTCGACGGGTTATACGCTTCTGCCGTCAACTGCTCGAAGTACTCGTCGTCGAGGTCGGTCGAAAAGTGCATACGCCGATCTGCAGGCTCCGCTTCCTCATCGGTGACGAGCGCGCCAAAGATCCTGCTCTTGGCCGTGTCGGTCCCGACCGGCCAGAGCTTCACGCCCTTCGTATATGTCTTGCCCTTGATCGTCACGTCAACGTCGGTCGGGCGGCCGATTATCGGTTTGTGTTTCTCGGATGCACCCTTGACCGCAAACACACCCTTGTGACGCCGCGTGCGGCAGTAGTCGTAAACGTCTTGCGTACGGCCGCCGCCAGAGTCGACTGCGCATAGCTCGATACGCATCGAGATTCCATAGGCGTTGACGAACGTCCGCTCGAGATAAGTGTCGAGCTGCTTCCACACGACTGGCTGCGCCGGATCGCCACGGAACACGACATGGTCGATCGTCCAGTTGCGCATTCCGCGCCCCCAGCCGTCGACCGACACCTCCAGACGATCATTCTGCGTATCCACCGCGCATGTGAGCTTGAGCACACCCGGCGGGATCTGCCGCAGCTTGTACGGCAATGCTCGGCGCTTGATGGTCTCCCACTTCAACTCCGCGCTCTTGTCTTCCCAGCATTCAGCGAGGGCGTTGTTCACGAACGCGATCATCTTGTCCGTGTCGGTTTGCGCGGCCTCCCAGTCGTCCATCAGATCGGACCATGGTCGCCATCCGAGCGGCGCATACAGAGCGCTCAGGTGGAAGCTCGCAGTCTTGCCATCGCCGGCCGCAGTGGGCATCCAGTAGGCGCCCTCGTAGCCCCGCGTTTTCCAGACGCTTTCAGGATTGCCGGCGCCGCAGCCGGTCTGGCAGTAGTACAGCACGACGCTCGGATCGTCGGGCGAACGGCGCATACCCTGGCGCCAGTCGAAAAACTGAGGCGAGCCGCAGTCCGGGCACCGCACGAAGTAGCGACGTTGATCGCCACTTTCGTAGAGCTTCTCGATCTGTGATCGCCGTTTAATCGTCGGTGTGCTGTTCGCGAAAATCTTCGCCCGACGGCCAAAGTTGCTTGTGCGGTTTTTGGCGAGGTCGATCGGATTGCCCTGACCGTCAACATTCAGCACATACTCGTCGATTTCCTCGAGCAGCACGTACCGCACGGTCGTCGATTTCAGACGACCAGCCTTGGTCGCGCTGACGAGGTTCATCAGCCCGCCAGGGAATTTTTTACGCAGCTTCGTGTTTTCGCTGCCTTTCTTCATCGCGTCACGCACCCGGCGACGCAAATCCCGCGTCGACACACGCATGGGCTCGAAGCGGTCCATCTCCCACTTCTCGGCGTCGTCGTACGTTGCGAATACGGCGAGGATGTTACCGGCAGCTGTCGTGATGCAGCGACCAATGAAGTTTTCGCCCAGCGCCGAGCCGCCGAGCTGGTGCCCCTTCATGAACGGGACAACAATTACGCGGCTGTTATCGAACGGCCGGTCATCGTCATGCGCGTAGCGCGTTACCGTGCTCGTCTGACCTGAGAGCGCATCCATGATGCCGACGAGGTATGGCGTGCGCTCATTACGCCACTTGCCGGGCTCAGGGCTGCTTTCCGGAAGGACGCGATGCTGCTCTGACCATTCGGCGATGCCGATGCGTTTGTCAGGCTGTATTGCCTCCGTAATCTTCTTCAGGAATGCTTCGGTCGCTCCCATCGTCGTCATCCGTATCGTTTTCGCGCAACAGCGCCGTCGCGTCGACGGATGCGAGCGCACGAGTGAGTTCCGTTTCGAGCATTGATTCGACCCTCGTCGGATCGGACTCCGCCGCGAGGGCATCTTTCAGACGAACAGGGATGTTCATCACGTTATCCCGCACGGTGCGAAACGCGGTGAAGGCGAGGCGCTGCGCGTCAGCGAGCGGCAGCGTCGTACCGCGTTCGCGCTCGAGGTCCATCCGCTCGCGCTCAAGCCTCGTCTGTTCTCTCGCCGCACGCGCAGCGCGATACGCGACCATTGATGGGTCTTCCTTGCTCGCGGCAGCAGGAAGGTCGTCTTCCTCAACGTCATCGTCGGGATTAGCCGGGGGAGACGGCATCGAGAACGCAGCATTCCCGAGCGACGGGCGGGATTGATCGGTGATCGAGCGTCGCGACTCGTCTGTGTTGCGACGCCAGGCCGCGACGGCTGTATCGGCATCTATCTTGCCGTCCGTGTCGACAGCAATTCGTCCCGATTGGATCGCCTTCTGCACAGCTCGCAGCGTGACGCCGACGTGCCGCGCGAACGCCCTTTGTCCGAGCTTCGCCATCCGACCTCCAGAAACGAGAAAGGGCAGTCGGTGACTACCCGACTACCCCTAAATTGACTACCTGACTACCCGACTGACTACCCTGAAAAGTTGCTTTGACGACGCGTGTGTTGGGGCTCGAATTACCCGCATACCCGTACTTCAGGGAAGGACCCGTGACCGTCGGCGGTCGCCCGCTGCACACCCGACCGGTCACCACCTCGCCGTCGCAAGCGCCATCAGGAACGCCCGCTGCATTGCAGGCTCCAGCGTCCCGTCGACGGTCTGCTCTGCTACATCGTAGAACGGATATCGTTCTTCGTAATGCGGCGTAGCACTGAACGCGAACACCGGGCGAATCGTCGAGCCGTGCCCGAACATGTATCGCGCCCATATGCCAAGTGCGAATCGCCCACCAGCTGGTTTACCCACGAAGTAGCGCACGCCCTTGACCGGATCGCGCGACTTGCGCTTTTTCGATCGGGCGGTCTCGTTCTGTGCGCTGTCCCGCGACGCGCGGATCTGCGAGGCAATCGACGAATACACGCCACGTGGCACATTGCCGTAGGCATCGCGCACGTTCGCATTCGTCGGCGTCGCATAGTCGCCGTTAGGCAAAACCCCGGACAACGCGAGCATCTGCTCGAATCGCGTGTACCCGCGCGATCCACCCGCCACCTGCGGTTGCAGGTACGTGCCGGCCGCCGTGCCCTTGAACGCGAACTGGCGAAAGCCGACTACGACACTCGGACTCGACGACGTCGCGCGCTGAAGCACGGAAACCGAGTTGAGCGTGTATGGCGTCGGACGGTCGAATGCATCGCGCATCTCCGCTTGCTGTGCGACAACGATGTTCTGCGCAACCATGTTCAGCGCTGCGCGCGCGGCGAACACGAACTGCTTGCCCGCTTTAGCCTCCAGCTGCTCGATCATCGGCAGCGACTTGACCTGTGCGCTCACCTGGTCCATACGAGCTCCCAATGCAAAAAGCCCCGAGGCTTGCGCGCTCAGGGCTTCAGATATTCCTGTTTCGTACGGGCGAACGCCCGCCGATCAATTCCCGACAGGCTGTTGGCATGTTATTCGTCGCGCCGCTCGCGCGATTCTGTATGACTGCCGGGACAAGGTTGCTCCACGAGTGTGCGGGGCTCACAACATCCAGTGACTCGGTAAAGGATGCATGGCGCGAATTGTAGACAGGAGTTTTCTTGAAATGCAAGTGTTTCATCGTTGAGCGTACCGACGCACTGTGTCATTCGCTGCACCGTCAAGTGCGTCCAGCAGCGCGTGCACACCATGAAAGTGCCGCTGCCAATGTGCCCGATATTGATCCAGCGGAATCGCGAGCGCGACGGAACGTGCTGCGTGATCTTCAGGGCGGCGGCCACTTCCCTGGCATGCACCGCAGATCTTCCGCCCTGCACGTGCACGCGATGCAGGCATAGCGACACGGCCTAATCCAAGACACTTTTCGCATCGCTCAATTTCCCGGTAGACAATCGGCCCACGTCCATCGCGCCGGACAGCAAAGGGGATCAGGAATTCATCGACACACATAGTCCCCACTCCATTGCACGAGGGGCAAACAACATGTACCACGGATTCCTGAACGCCTCGACCGCTCACGCCCCGGCCGTCGCATTCGACACAGAGGTCAGCAATCCATTCAACAATCACCTGACGTGCGAACCGCTCGACGATATCAGCAATCTGTCGCTCGACCTTTTTACCGGCCTTTGCCGCAGTCCTTTCTGCTGCGCGCGTCACCCCGGTAAATTTTGAGCGATCGAACTTTCCCGGTTCACGCAATCGCTTCGCGAGCAGCAATGTGGCCCGGTGCAAACCCGAACGCCTCAAATCCTGCCCGTATTTCATGCGCCACAACAGGCGGCCGAGCTCGTTTGCAAACGCGAGCGCACCCAAAGTTACTTGGGGATCGGCAATCGGATCGGAGAACTGTCCGCGAACATTCATCGCAATACCGGCTTGCTCTTTCAGGTCGATGTTCACCGCTCCTCCTCGTCGATCAGAACGTTCTTCCGTCCCAAGACCCAAACGTCCCAACGAATTCGGTTGTGTATGCGCGGGCGCGCATCCGCGACGTGCGCACCCCCATGCGCCCACACCCGCACGTCGCGCACATATGCGCCTACGCGCACGAGGCGCATGCGCTGGGACCTTGGGACAAGGGACGAGCAACGGCGCGCCGATGATTGGCATGTGGCGCGCCGTTTCGTGGCGGGAAAGCGGATGCGATCAGAGAGGCGCATCGTCGTCTTCGCAGCCGCCGCTCGACGCGGCGCGGGTCGGCATCTGGACTGGCTTTTCCCGAGGCGGCACGTAATACCACTCACGGTCACCTGTCGTCTCCCGCTGTCGCGCCCAGCCAAGCCGCTTCAGTGCTTTGCCAACGCGCCGCTGTTCGGCAGGCGTCTGCTTCGTGATCTCGATCTTGAGGATGTCGTCGAGGACTTCCTCCATCGTCACGCGCGGCCTGTACTTGAGTTCGCGCGCGATCTTCACCTCGAGCACGTCGCCCTCAAAGCGCGCTTCCTGCTCCTCCTGGAAGAGCGGCTTCTCCTCGTGTGTAACGCGCCACGGCGCCCAAATCTGGCCGCCTGCTTCGGCGTTCTCGCGCTGCCACCGGTGATACTCATGCACCGCCTCAGCCCAGATCTGATCGCGGTTTTCAGCCAAGCCGGAGATGTCGAGAACGTCGCCGCAGCGGACGGGCCAATAGCGGCGGCCGCCTGATTCATCCTTCAGGTACACATCGAAGTTGACCGTGCCGCCAAAGACACTCTGACGGGGCACGTCAACGGCACGCTTCGCATATGGCGGCCTATACGTATCCACTGCGACAGTGAAGAATCGCTTTGAGCTGGAGGAGTCGCTTTTGTTGAGAGCATCAAGCTCGGCAAGCTCGATGACCCACTTCCCAGCCATCACTGCGTAGGAATCCTTGTCGCCAATGACGATGTTGGCGTCGGTGAACCACCGTTCGCCAAATAGCGTGCGAAACGCGTTTGATTTGCCGCCATCCTGCGCGCCTTCGAGGATAAGCACGTTGTCCATCTTGCAACCGGGCTGCATGACGCGGCCGACAGCGCCGAGCAGGTACTTGAAGCCGACGAGCCTTGCATATTCGGTATCCTCGGCATGCAGCCATCGATGTAGCCAGAAGTGCAGACGCGGCGTGCTGTCCCACGTCAGGCCGTTGAGATAGTCGCGCACCTCGTGATAGCGGTTGCGATCGGCGACAAGGAAGACCGCCTGCGCGATGATGTCCGCACGCGGACTGAATCCCCAACCCTGACCCAGCCATAGCGCGAGGCGAGAATCGTCGGCATCAGTCCATTCACCGACCTCTCCGCCCTCGAAAGGTGGGGCACGACGCTTCACGATACGAAGCGCGAACTGTTCGAATGCAAGCACGCCCGACCAGCGCTTGTCATTCGAAAGGATCAGGAAAACGTTGTCGATCGTCGGCAGGATTGCGCCGGATTTCTCCGCACGACGCAGATCGCGCATCCAGGTGTATGCGCCGTTCTCCGCCTCGATATCAGGGGGATCGTCCTCTGTCTGGCCCGCAGCGCCAGCGGACGTTGGGTGTGCTGCCAATACTTTCCTGGATCCGCTTTGAGAAGCCGGTGAGACAATCGCGGCCTGCTCCATCGCGTCAAGCAATCTTGCGGCACGGTTGTAGCCGATCCGCAGACTGCGCTGAACGGCCGATATCGACGCCCGCTTCATCTCGGAGACGAGCCTGACCGCCTTGTCATAAAGCGGGTCCTGAGCGTCGTCGACGGCCAACTCGGCAGCAGGCTGTTTGATGTCAGCTGGCGTAAGGCCCGCCAGGATCGCTGCCGTAATCTGAGATTTGACGACGTGCAACCCCTCGTCGACGTGCAGGTCGTTGTAATCGGTTATTTTCCGGTCACCGCGTGCGCTGAATCTCGGATAGATGACGCTTGCGTTGTCGATCATCGACGCCGCATCGAATGCATGCTTAAGGCCAGTATTCTCAAAGCGACGCGTTCGCTCGGGGACCACGTCGTTGCCGAACGTCAATGCAAGGGACTCAACACCGTGGCGATCGGTGTCGCGTGTGACGCGTACCATGTACCACGTCTTGCGGGATTCGATGCGGATAGCCTCGCCGCCGATCTCGATCTCGCCGACGTAGCCGAATTCCTCGGCCAGATGATCACGCAGGCGCCGCTCGATCTGCCAGTCGTCGTCCGCACAGAACATCACGTGGATGCCCGCATACGTATCGCGCAGATAGCGGGCAGCGGCCATGATATTGCCAGCGTCAAAGCACACCATCACAGGTACCACGCCGTCCGTCGCCATGCGGATCGAACGTGCCGTTGCATAGCCTTCCGCGACAAGCACGACCTTGCTGTCGGCGTTGATGTCGCCGAGCAGGAACAGCGCGCCCTTCTTCTCCATGCCCTTGTTGAAGCGCTTCGCGCCCGTGGACGTGATCTTCTGCAGACCGACGAGCCGGTAACCGTCCGGATAGTTGAACATCGGCACGAGCACATCTCCGTCGGCGGCAAAACGCACGCCTTCGGCTGTGATCTGCTTGCGCGCGAGGTAGTCAGACTCGCCCACGTCGCTGCCTTCCTGCCACTGCTGCCGCGCGCGGTTAGCGGCCAGCTTCGCCGCATGCTGACGCTTTCGCTCCTCTTCACGCGCGAGCGCATCCTGACGGGCCCTCGTCTCTGCGATGTCTTCAGGCGAAAGCGCCTCGCCATGCCAGGCGAACGCCTGTGCGCCGTTGTCATTACCGGACCAGCAGCCGAACGCCCCCGTGTAGCCGATCACCTTGCCGGCCTTGACGACTTCGTGGAGCGAATACCAGTGCTTCTTCTTCGGGCCGTATCGGTGCGGCTTGCCGTCCGCTACAGGATGTCCATCAGGCAGTTTTGGATGCCCATGTGCCATCAATTGACCAACAATCCCCGCAAAGTCCGACAT
Protein sequences of DBSCAN-SWA_1 >CP026101|1978755:1998675|1990453_1990657_-|AUT51954.1|DBSCAN-SWA MAFTQQNLDAIEKAIATGTLSVEFNGKRVTYRSMSDLLKARDVIKSELASQSATRAPRSSVAIYKRF >CP026101|1978755:1998675|1991045_1993115_-|AUT53435.1|terminase|DBSCAN-SWA MGATEAFLKKITEAIQPDKRIGIAEWSEQHRVLPESSPEPGKWRNERTPYLVGIMDALSGQTSTVTRYAHDDDRPFDNSRVIVVPFMKGHQLGGSALGENFIGRCITTAAGNILAVFATYDDAEKWEMDRFEPMRVSTRDLRRRVRDAMKKGSENTKLRKKFPGGLMNLVSATKAGRLKSTTVRYVLLEEIDEYVLNVDGQGNPIDLAKNRTSNFGRRAKIFANSTPTIKRRSQIEKLYESGDQRRYFVRCPDCGSPQFFDWRQGMRRSPDDPSVVLYYCQTGCGAGNPESVWKTRGYEGAYWMPTAAGDGKTASFHLSALYAPLGWRPWSDLMDDWEAAQTDTDKMIAFVNNALAECWEDKSAELKWETIKRRALPYKLRQIPPGVLKLTCAVDTQNDRLEVSVDGWGRGMRNWTIDHVVFRGDPAQPVVWKQLDTYLERTFVNAYGISMRIELCAVDSGGGRTQDVYDYCRTRRHKGVFAVKGASEKHKPIIGRPTDVDVTIKGKTYTKGVKLWPVGTDTAKSRIFGALVTDEEAEPADRRMHFSTDLDDEYFEQLTAEAYNPSKDRWDKLRKRNEALDLKVYNLACAYHPRLRLNTYQDADWAALEAVIEPRVHDLFAALPAEPVATSQAGAAESDAAIAVTDNALIAAEAVLAPPVESAHPEQPAASTQGQAWIPRRSNWLRGRS >CP026101|1978755:1998675|1990659_1991049_-|AUT51955.1|DBSCAN-SWA MIIGRDGPVKAKIVNDSEGRFYGVKFLCPGCTWSDGTPMGCVLPVSWLPPGVTEESPHVTGKPHWGFNGDFEQPTFTPSVNSWWGGDHDIPMHRCHTFITDGRIQFLGDCTHALANQTVELPEFPEIED >CP026101|1978755:1998675|1984693_1985626_-|AUT51948.1|DBSCAN-SWA MTTKRTRKTVVLCALQTAFGIAVVPTGADNAMIVSDVSSTPVAATYAQRNNVKPYLGNDQQLVSEKHAELSFSVEIAGSGAAGTLPAWDPLLQACSFAATTDDNTSVAYAPISDDPKPATIYYFLDGLLHKIANAYGTVALDLTSNAIPKFKFKFTGDYSPVVDQALPTTDFSKFKDPLVVNNDNTPAISLGGYAATLNALQIDMANTVTYRSLPGAAGALITDRKPTGSVAFELARVADKDYWSAIAVAQNLALSLTHGKTAGNIVTIAAPAVQLTSPSYTDNNGIASMSATLTLNPVDGNDELTITLT >CP026101|1978755:1998675|1985683_1986115_-|AUT51949.1|DBSCAN-SWA MTTLRETFVEALLAQLEADPDVQQLNVQIGRSVVDALDAQHSLALIVHLGGEAPPDRSAVGFATRQTELLFTVITRDTAPDKAADSVLELAHPIVMSFDAPSLIDASEGQTDPPIFANVDGESCLRTVHYVYIYRTRWNSLTE >CP026101|1978755:1998675|1993062_1993710_-|AUT51956.1|DBSCAN-SWA MAKLGQRAFARHVGVTLRAVQKAIQSGRIAVDTDGKIDADTAVAAWRRNTDESRRSITDQSRPSLGNAAFSMPSPPANPDDDVEEDDLPAAASKEDPSMVAYRAARAAREQTRLERERMDLERERGTTLPLADAQRLAFTAFRTVRDNVMNIPVRLKDALAAESDPTRVESMLETELTRALASVDATALLRENDTDDDDGSDRSIPEEDYGGNTA >CP026101|1978755:1998675|1993902_1994658_-|AUT51957.1|DBSCAN-SWA MDQVSAQVKSLPMIEQLEAKAGKQFVFAARAALNMVAQNIVVAQQAEMRDAFDRPTPYTLNSVSVLQRATSSSPSVVVGFRQFAFKGTAAGTYLQPQVAGGSRGYTRFEQMLALSGVLPNGDYATPTNANVRDAYGNVPRGVYSSIASQIRASRDSAQNETARSKKRKSRDPVKGVRYFVGKPAGGRFALGIWARYMFGHGSTIRPVFAFSATPHYEERYPFYDVAEQTVDGTLEPAMQRAFLMALATARW >CP026101|1978755:1998675|1983888_1984281_-|AUT51946.1|DBSCAN-SWA MIDAARFWAGDRRDDFAADADVADALATFGASTDVVAAARSRGSTDDFEVLAENWDAVELFAACGTQWRKSVVASLTGGGVFYEGLDYPAVEAAMRMFGFRRKRHRELFDAVRVMERAALQVFTDRASRS >CP026101|1978755:1998675|1986111_1986423_-|AUT51950.1|DBSCAN-SWA MSAFDLGAVFDAAVDAGMLTEVTVVSGADVAQPFYADFLQAGSDILQNMAQAVDYGIEYRTADAPDLPKQTVVAIAGAQYALTRAGRPSDEAGFFSFAPLKRL >CP026101|1978755:1998675|1984277_1984610_-|AUT51947.1|DBSCAN-SWA MPLIKAKSNTFKLKVTVPQIDAEGNITEQSVVLIARRVKESEFQELIKQPPGELVKSVIVGWPDGEVLDGNGDSLPFSEQNLAEQIDDPHTIRAISRAFVTTLANLPEKN >CP026101|1978755:1998675|1994913_1995663_-|AUT53436.1|DBSCAN-SWA MNVRGQFSDPIADPQVTLGALAFANELGRLLWRMKYGQDLRRSGLHRATLLLAKRLREPGKFDRSKFTGVTRAAERTAAKAGKKVERQIADIVERFARQVIVEWIADLCVECDGRGVSGRGVQESVVHVVCPSCNGVGTMCVDEFLIPFAVRRDGRGPIVYREIERCEKCLGLGRVAMPASRARAGRKICGACQGSGRRPEDHAARSVALAIPLDQYRAHWQRHFHGVHALLDALDGAANDTVRRYAQR >CP026101|1978755:1998675|1986829_1988923_-|AUT51952.1|DBSCAN-SWA MPQPANGRRGGAATPDASMPLQTRLQPVSSVNAEARTVDVQWTAGATVLRYDYWRDRPYIEELSTGDGAVRMDRLTSGNAPVLNDHDRWGGLESVLGVVSSASLDSANGTGQASLRFSQRDAVQPYFQDVQDGILRNVSFGYRIYAIDMIPPGQEDNDQWIYRAIDWEPYEISLVSIPADPNATVRGQRALDDGASQRFFPCVFNDRSAGGSSDGARASQLDQGAVMPGENDQPQTSVTPAAPAAAPATPAAPTARSDDAASQNDAARNAGIEAERQRVTDLRAAVRASVLDNQQELIDGFISRGVTVDAARAEILRLQAERSAANPQRGPANVVTVSDETVQRRAAMTDALMHRVNPRHKLTDAARQYRGLTLRELCRVGLEAAGIDTRGMGVMDLAGTALGVRGGYGTTSDLPVVFGNVINRTLRDAYGAAPRTFQSWARQGALSDFRAATRVMVDGALKLEKINEAGEYKYGTLTDGGETIQLATYGKVVAFTRQMIINDDLSALERVPLFFGRAAANLESDMVYAALTGNAPMSDGKALFHAAHGNLGTAAAIGIDSLSVARTAMRTQSAPGDGTALNLAPKFLLVPAALETVAYQYTSNAYTPTQATQQNPFQSVLTPIVESRLDAISAKSWYVVADPGAIDTIEYCYLDGEEGLYTEQSIDFDVDGVKVKARLDFAAKAIDFRGLFKNPGQ >CP026101|1978755:1998675|1995945_1998675_-|AUT51958.1|DBSCAN-SWA MSDFAGIVGQLMAHGHPKLPDGHPVADGKPHRYGPKKKHWYSLHEVVKAGKVIGYTGAFGCWSGNDNGAQAFAWHGEALSPEDIAETRARQDALAREEERKRQHAAKLAANRARQQWQEGSDVGESDYLARKQITAEGVRFAADGDVLVPMFNYPDGYRLVGLQKITSTGAKRFNKGMEKKGALFLLGDINADSKVVLVAEGYATARSIRMATDGVVPVMVCFDAGNIMAAARYLRDTYAGIHVMFCADDDWQIERRLRDHLAEEFGYVGEIEIGGEAIRIESRKTWYMVRVTRDTDRHGVESLALTFGNDVVPERTRRFENTGLKHAFDAASMIDNASVIYPRFSARGDRKITDYNDLHVDEGLHVVKSQITAAILAGLTPADIKQPAAELAVDDAQDPLYDKAVRLVSEMKRASISAVQRSLRIGYNRAARLLDAMEQAAIVSPASQSGSRKVLAAHPTSAGAAGQTEDDPPDIEAENGAYTWMRDLRRAEKSGAILPTIDNVFLILSNDKRWSGVLAFEQFALRIVKRRAPPFEGGEVGEWTDADDSRLALWLGQGWGFSPRADIIAQAVFLVADRNRYHEVRDYLNGLTWDSTPRLHFWLHRWLHAEDTEYARLVGFKYLLGAVGRVMQPGCKMDNVLILEGAQDGGKSNAFRTLFGERWFTDANIVIGDKDSYAVMAGKWVIELAELDALNKSDSSSSKRFFTVAVDTYRPPYAKRAVDVPRQSVFGGTVNFDVYLKDESGGRRYWPVRCGDVLDISGLAENRDQIWAEAVHEYHRWQRENAEAGGQIWAPWRVTHEEKPLFQEEQEARFEGDVLEVKIARELKYRPRVTMEEVLDDILKIEITKQTPAEQRRVGKALKRLGWARQRETTGDREWYYVPPREKPVQMPTRAASSGGCEDDDAPL >CP026101|1978755:1998675|1978755_1983855_-|AUT51945.1|tail|DBSCAN-SWA MGATAGSINVTLTLDSTQYLAELVESQSRTTAASTAISQALSAGSFGARDLAAAMQATGSAGKGLSDSLAIVSEAEAQAAARIKDMVARSLEAAQAMASQASAAQSAAGGIGSLGSSADQVRANVAAQTQAMIDAAQSTRVMNDEMQQLRTTLAQGSTSFAAIGDQYARLDRAMATGKLSLQDYDAALAAVGKDEDARLAKLAALAAKYDPLGAATRKLAADQALLDDAFRSGMISTDDYERTLAGIRVDQASVQLRQLAQQEKDLEAAFRSGAIAIGEYKASMADIAASQKALGAVASSAKSASSEMEGFGLKTAGARREVLVLAHEALTGSWSNFGGSIMVLAERMDLLEGAASGVALQIAAVLAPLVLVGAAMVHVAEQNAQMNEALVMTGGYVGLTTDGLRELAVAATAGGATVDTAVEAVTALAATGRLTGDEIASLGRTVADVATYTSTSAKQMVDDLTKLADDPVKASVKLNEQYHYLTASTYDQITALEKQGDATGAAQVAVEAFSAAMEQRTVDIAKNEGIILAGWRDIKNMISGAVDAIGSFGAAAGPAEVVARLQANKAARFPIGQWDACDEADLQDAIAKQAAAVKAARDKQSQEEQQRQLIDAKSWYDTWNKQFATPAEKRTQEVNEYLDRTAALNLSPEQQLADQQKINDRNKDKTQRKGGTGLVDRAELSGEVQAVKDALADELSAIDSARKVLDASYKSGTLSASQYYQQQRDLLAQAASDQIDAANREAALIAQGIKNRALSASQRAQLANQEKKALADGSKAVEDFFAKVSVSAAQEDEVWDKYGKSQLEAMQKQIDSATQNDQSLKDQIDTFGMTKGAVDALRASRADETLAALEQGRAIAMLNGEVGDTKPWDDAIEKAKSLSKVLHSVADDQSSLDDLEQAKKQQDDLVSGWKSTIDGIGSDFHNGFLQMLTDGTAGWSSWTKALKNTFEATVIDEIYKAFAKPFIVSVIANIAGLTDGTGVQNSVLQSYGLGGNSTVNNLLSNPVGTYNNLSNGYNTIMSWLQGYGGASTALGSAAIAGASSGALGSGGVVLGGLGSGIGSGVASSTASMVGSNAYGFTGSGVGSSLGSSSVGLMYGGAGLLGGLAGGALFGNKGYSSLGGSLGAMGGLAVGTSSAVAGTALGAELGSWAGPIGAVIGAALGALAGSFIGGGETRYGASYVSNGQSATKFAGPSGGDPEADQVTQQINSTFQTIQSMATQLGGSIDGLGQYKASYEISPQKGNSFVAAGFTTGSDWYPDRQDLRGVKDSTTVLNDFSLQLQRSVIDSLQKANLDAPYAAVLQGVDASKLSANDITTLLSELNAVKSLFDSFQHLGSDFDNLKKASTDAQLAVLNLSGGVDSFNTSATYFYQHFTSAGQQADDSAKSVNDQLVALGYSGIHTRDQFRDLVESLDLSTDAGQRTYVALLGLAPAFDNVVSSYESAVASAYSTQSQALQSFKSQVDQFRQSLTTGDLSTLSPEQQYAATKQRFDELYSAAIGGDANAQSNLTSAAQDFLTASKAYNASSGQYQADLSEVMQSMDYASSSADAQLTQLKQMVAGIVDVNDSVQTVAEAIAALQGWTAVNGSHASGLYRVPFDGYIAELHKGERVLTAAEAQSIELPDSGAQTVDLGRYRSRGNDALLQEIRSLRATVEQLRQDRRQADVAHATQRADLAKAQASQLDEQTTLLRDRKRPGK >CP026101|1978755:1998675|1988988_1990452_-|AUT51953.1|portal|DBSCAN-SWA MKTNVLDRAIEWVAPRYAATRMRARMSMLAARGFDGAKRGPRSGGWKASGASATAEVLPALNVLRNRARDLVRNNPHIRRALKIMAANAIGTGVQAKFSDKNLQKVFKRWTKYCDAAGLLDFFGLQAQVYRALKESGEVLVRFRTRLPSDGFEVPLQLQVLEIDYLDSLKIGPVAGGFIIAGVQFNLIGQRTGYWLFDQHPGEVTQVPRNLQSRFVPASEVLHVFDATDRPNSVRGFPWLATAIWAARDLDEYQDAERVRKKIEACFAAFVTSSDDGYTTGPVRQEGSGPRVESLSPGMIEYLRQGEEVTFSAPATSNGYEASVRVDLRAIAAGTDVTYEQLTGDYSQVNFTSGRMGKMEFKRMLEQEMWLVFIPMFCDKVAERFVSTAFLAGQARSAIADVTWSTHRIEFIDPLREANGMIELINARLKSRHQSIRDLGDDPDETDAEISADPLAIEVQVSGRSFDANAARAFLYRLERLLDGAEH >CP026101|1978755:1998675|1986419_1986749_-|AUT51951.1|DBSCAN-SWA MKNFVQPGRILAATLAVAVTSGQLVLLGNAKLPAVALGSYAANTEGEYDTTGVFSLPSATTGTAVVGDKAYWDETNGVVTTVATDNDPIGHFALPKAASDATVNVRLWL |
16 | Pseudomonas_phage(11.11%) | portal,terminase,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2535972 : 2550195
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP026101|2535972:2550195|DBSCAN-SWA CTCAGTGGTCTCCAGCGCCGCCCCCCCCTGAGTCCGGGTCGGCGGGCGTTGCCGTCACAACCGTCGCCAATGACTCGCTAAGCCAGTCGTTGTATGCCGTAATGAAATAAGTTCCGCCTGCCGCGACCGTGCAGGTGGCTGCGGAGCCTTCATATACAACCGCGCCTGAGTTGTCGCGCACACGATACCCGGTGACGTTCGCGTAGCCCGCTACTGACATCGTCAAAGTCACTGTGGGTGCTGCGCCGGAAACGCTGACCGATGGAGCTGGCGGCGGCGCATTCTTGACAATGATCGTCGCGGGATTCCCGAGCCCGGCGATATTGCCTGCCTCGACCCGGACATCGAACGTATCTGCCAGTGCTTCATACTGCCGCGCCTGATCCAGCGTCCACGTGAAGTTCTGCGCAGTAATATCGGCTCCGTATTTGACGACACCGCCCACGACGACATCGACATGGAAATAATCCACGTCAGCGCGTCGCTGGATTTCGATCGACAGCTGGTTGCCGACGAAAGGCTCGCGCAGTGAGAGCGTCGGCGGCGCAGGCGGATATTTGAACTCCGTTACTGTACCCGTCCATGTCGTGTACGGCCCCGGCAAGCCACTCGCACCGAAAGCGCGCACGCGGAACCTCCATGTACCGATTGCAACGGGCACATCGAGCACGTTTGAACCCGTCGCGCCGAGCTTGGCCCACGTGACACCACCATCAGAACTGCCCCAGTATTCGTAAAGTAGCGCCCCCGCCGCCGGCGTAGCCGTGATGGACACGCGCCCCGGTGTGATCTGCCAGTTCGCAGAAACCTGGGCAATGATCGGCGCGTGTACCACGGTTGGCAGCAACGATGGTGACGTCGGAGCGGGCGGACTCAGGCCCGTCTCAGCCGTGTGCGGACTGGCTGCATTGTTGACAAACGTGAGTTCCACCTTCCCATCGCGCGGCACCGCCCTGATCAACCGGACATCCTGTGACACCTTGTCGATCGCCGGACCAAAGGAGAACCGGGTCGGCTCTTCCAGGTGCCCGTCACTGAACTCGAAAAACTCAGGCAGCGGATCGATCAGCAGCATCGAGAGACCATCGTCGGCGTCAGGCTGCACGACCCTCAACGCTGCCGTCGGCATCCCGTTGCGTTGCGCGAAGTAGACGTAATGCTGCGCACCCGCAGTCCATTCAAGCTGCTCGCTCACATAGAGATGCGAGCCCTCGACGTCCTCCACCACCCCAGAAATCCCCCACCTCGGCAGATCGTGCGAGATGCCGACGAGGTCTCCATACAGCGGAATCAGTCCGTCGAGCTCAGCGGTAATCGACACCGTCTTGCGCTGGTCACGGTTCGCGTACGCCATATAGATGCCTTCGCGAAATGCCTGCGCGCGTTTCGTCGGCCCGACCATCTGCACGCGTTTCTCGACGAGCGCGGGCGAGCCAGTCGGGATACAGGCGACATCCTGCCAGCTCCACGTGGTCTCATCCATATACTCGACCGTGACGTAATCCGGCGAATCGACGCTAAAGAAGCCGTAGTCCGCTTCAAACGAGCCGGCGACAATGTTGTCCGGCGTGAACATCGCCGTTCGCACCGTCTTCAGTTCGTCGCGGACAACGCTCACGACCCCGCCGAAGTACATCGGTATCGCGCGGCCCGCGCGGCAGATGGTGGTGAGCGCGTCCCAGAACGATCCCTTCGAGTCGAACACACCATTGAACTCGTCACCGCGCGACGCCCACACGGCATCCAGTCGAACCAGCGCGTCGAGATCGAGTCGCCGGTCTGGAAGACCGAGACTGTAATCCGTGTTGCGTACGGCATCCGCAATGGCCCATGCAATGCTCCTGGTCGGCTGCGGCGCACTCCATCCGTTGCCCGTCCAGACAGGCAGCTTTCGCGTTCCGATGACGTGCACGTCGTGCGCGGTGCTCGAATTCAGCGAGTTCGTCGCTCGGGCGCGCATCGCAAGCAGCGTGACATTGCCGTAGTAACGCGCGCTCGGCAGATAGGCACGCAGTGCTGCCCACTGGATGCGGTTCTGCGCCCGCGAGTCTTCATTGAGGTTGGATGTGCGCCGCATGCGCACCTCGTAGCGCCCGCCTGCGACAGGATAGCGATACGACACCATCTGCGGCTGCGCCGTCGCCATACTGACCGTGCGCACGTCGAGCGCGATCCAGTCCGCCAGCGCATGCCCTGCATCATCGATCAGCCGCGCCTGCGCCTCGAACGTCAGCGACAGCGACTGCAGCGAGCCGTCGTCGGCCGCATAAAACAGGCCGGACGGCAAGGTGATATCGACGCCGATAAAGTTGGCCGCGCTGCCCGCCGGATTGGCCACGAACGGCCCCACCCAGTCGCCACTGTCATTCGGTGCCTGCAACTCGATACCGCTTACCGCATTGCTCGTCACGACATTGTCCGGAAACAGCGTAACCGGCTGGTTCGGCCCAACGATTTCGTATTCGATCTCGCTGAAGTTGCCGATGTCAGTGTCACCGATCAGCACCTGTTCGACCTCCAGCTCGCCCTGCGTGATACAAAAGAGCTCATAGAGGTACAGCTCATTGCCCGCATATTCGGTGTACGACTGCGACGCGAGATCGGGCGTGAGCTTCATCCGGCCATACAGCACAGGAATCGCCTCAAGCAGACGGGCGGTGTTACCGCTCGATCCGAGCGCATAGGTCGGACTGGTTGTCGTGCCCTGTGCGACGTTCGCCGTCTTCGCCGGAACGATGGCATTCACGAGCATCGAGCCGGCGGCCATCACCGCCATGGACACGCCCGCCTGCACCGCCATCATGCCAGCCGTCACTGTGCCAGCCGCCACACCCGACGCCGCGCCATAGGCCGCCGCAGCAGCCCCGCCCGTATAGACGGCAGCCACTGCGATCGCGATCATCAGGACTGCCTGCAGGGGATTGGAGCCGCCACCTTGGGCAACCGGAATGAAATGAACGATGGCACCGTGAGGAATTGGCTTCGACCAGTCGGCTCGCAGCCAGAAGTCATTGTCGACGCATACGATGGATTGTTGCATGTCGACGCCGGCAAGTTTTCCCCACTGCGCGGGCGACATCGGCGATTCGATCTGACAGACCTGCCGCCCCTGCGCATGACAATATGGATTCACCGCGCGCACGATTGTCGCGCTATAGGAAGCTGTACCAGCTGGCACGTGGATACCCCATCATTCTGAGTTGTGGTCGCGGCGTAAAAACGACGCCAACGCCTTCGATCGCATGCAACACACCACCGCCATCGAGATCCAAATACACGCCGACATGCGGCCTGTCGCCACCTCGCAGCAACACACCGCATCCGTGCACTGGCTGGTCGACAACTCGCCATGCGCCGGCTTCCATCTGCGCGTGGTACAGATCACGCCTCTCGTCCGGAACATCGGGCAGGTCCGGCAACACGATGCCAAAATGATGCCGCTGCACCCACGCCAGCAGTCCCCAGCAGTCGAATGATTCAGGCCCACGTGCGCCGCTTGTCCAGGGCCGATTGATGTACTGGTTCGCGTCGTCTGCCGTCATCGCACGAGCCCCGGAAAAACGTTCGGGCTGTAGACCTTCGACGGGAACGGACTGTTATTGATATCCGTGAGGGTCGCCGTGGCCGTCACGCGCGTCAAAGTTGCTTTGGCGCTCGCGATATACAGCGTCAGCGGCTCACCGTCCTGCGGACCAGCGCCGGAATTGGATGCGAGGAAGCGCCGATGCGTGCATTCGATTTCCTCTGTTTGCGTCACCGCACTCTCGAGATGTCCAATGATCTCCGCGCTCGCACCATCAATGACAATATCGATCTGCGGCGCCTGCCCCTCTTCCATCGATGGCAGACTGAACTGGAACGCCACCGCAAGGAAGGTCACGTTTTTGCCGGCATCGAGCGGTGCATCGAACTCCAGCCGCGCCGTCAGATCGATATAGTCTGCAACGACACGGATGGGTGCCGGATTACCGCTCCCGTCGATAAACGATGAATGCCGCAGTTCGAGCGTATCGAGCACGACTTCGCCCTGCGGATTGCTGGCATACACCTCCGCCAGCGCCTCAGTAATGGTCGCCATCAATCCCACCTCTTTGGTCCAGGCAAGCTCTCGTGAATCAGCCGGTGTAGCGAATCGCCCATTGCGGCGATGTCCGCAGCCACATACGTTAGCAGCACGTCGTACTCGTCGCGCGACGGCAACGGCAGCGCCTTGACTTCCAGCGTCGCGGTCACGTCAAACCATGCCCGCGAGTCGCTGATCGCGCTTTTATATGGCGGGTCATCCATAAACCGCGCCTGCACCTGATTCACGCCCATGCCGTTGAGCAGGCCGACAGCAAACCAGTCGGTCCCCAGATTAATCTCGTAGGCAAGAAAGCCTTCGAAAAGGGCGTACTGCGAATGGGTGAATCGCCACGTCACCGATACGTGAGACGGAACGTTCGTGAAGCGACGTCGCTGCCGCGCGCGACCGTTATCCATGTCGGTGCGCGCGTAAGGCGTCACGGGCTGAACGCCATAACCGTTCGCACGCGGGTCAGGCAGCGTCGCCGGCCACGTTGGTAGCTCTGCCATCAGGGTTGCCCCAGGGTTCGGTTAAGACCGTACCGGCGTTGCATTGTGCGGGCCGTTGCGCCCCGCCCGGATGCAATACCGGACGCGATATGCTGGTCGACTTTTTCGAGAATCACATCCATGCGCAGGCCGCCATCTGGCAGGCGCGACTGCGTCACCTGCTCGACACCAGCCGGCGCGTTATGGATATGCAGATCGATGTTCGGGCCGGAGGCTGCGCCCGCGCCGGGCGAATCGTCGCCGACGTATCCGCCGCTGGCGAAGCGGTTGCGGCCGCTGACCCTGCCGTCCCCGTTCAGCTCTTCGAGGAAGGTGCGCATCCCCGGCTTCGACACGACGCTTTCCTTCACGACAAATTCGCCGTTTGAAAGCCGCGCATTGATGCTGTCGCTTGTACCCGTTCCCGGCCCCCATACCGACCCACCCGTCGCAAAGCCGAGGCCGGCACCCACGCCAAACAGCGAGCCGCTGCCCTGGATCGATGCCGGCATCGTGAAGCCGTAGGCCGCGCTGCCCGTGCCTGATGCCGAGCCAAACAGCGCACCGAAGAGACTGTTCGTAATGCTCGCGTACACCCTGTTTGCGAAGAGTTGAGCGAACGAAGAGAGCATGCTCGACACCATTGATTGCACGGCCTGCGCCGGCGTCCGCGTGCCCGACACGATGTCCGAAAACAGGCTCGAAAACGCCGTCTTGCCCGCGTCCGTGAAATCCTTCAGATAGCCCTGGCTGTCGAGCATCGACTGACGAATCCGGTCGCGCAACTGGTCGAGACTGCGCAACACGCCTTCGTCGCTGGTGGTCCACGACAGCCTGTTCACCTGGTCATAGATAGATTGCAACGACTGCACCGTCTCCGCCGAATTGGAACGCAACGCCGAGAAGGCATCAAGCAACCCGGTCAGCCCTTCCTGCTGATCCAGCGAGATACTGTTCTGGGCATCCTTCGCCCGCGCGACGATATCGTTGTACTGCGCGTTGAGAATCGTGAGCTGCCGGCTCTGGTCGAGAAACGCCGCACTGCCCAGATCCCCCGTCGTCGCCGCCTGCAACCTCGCCCCCCGGTTACGCTCGTCAAAATCGTGGGTGGCTTTCGGAACCGTGACGCCCGACTGTGCCAGCAGTGCATCGCGCGCATCCTCGATCGACTTCTGATACTTACGTTGCGCCTCCGTCTGCTGCGTCATATAGACCGCGTCCTTCGCGGCGTTGTCCTGACGCGCCTTCGCGATCTTCGTGTCGAGCTCACCGATCTCCTGGGTGATCCTGATCCGGTCCTGCAACGGCGCTTTCCAGTACGCCGCCTGCAGCGTCGACCTTTCCTGCTCATAGGCCGCGACAGTCTTCTGAGCCGCGTCATCGGCCAGTGTGATTTCCGCCGTGTAAAACGTCTGATCGGAAATCAAAGTCGCTTTGTGCAGTGCCTGCAGCTGATCATCGGCATTTTTATACGCCGACTGGATCAGTTCCAGACCGTTCTTCGTGTCCTGCAATGCGGCATCCAGCAGGCTTTTGCGGACCTTGTTCGCCTCCGTCGCCCCCGACGTATCCGTGTATTGCTTGCGCAGGTAAGCCTCGTCGGTTGCCTGCTGCTGTTTGGAGACGGCATTTCCCGGATTGGCCTGGTTGTATTCGGCGACCCTGCGCCGGTAGTCGTCGAGCGCATCGTTGACCCGGTTGATGCCCTTCTCTTCGTCGCGCAATTTTTTCAGAAAATCCGAAGCGGCGATGCCCGCCTGCTGAACCTGAGCCTGCTGCGATTTGTCGAGTGCGGCGTCCTGCTCGCGCAATGCGTCCCGATTGAGCGATTCCAGTCTTGCCTGTGCCGCCTGCAATTGCGGTTGCAGAAGGTCAGTGTTCATCGCACCAGAAGGCGCATTAAGCGCGTTCTGCAGCCGCTGCACCTCTGCCGTTGCGTTCGCAATCTGCTCGGCGGCCGTCTCCGCCCGCCCGATCGACTTCATCCATTCCCACGCGCTCTGGATTGCCGAACCGACACCGTGCCAGGCCGTCTGAAGGTAACCGAGGTTAGGCAGCGACTCCATGCGCAGGTGGTCGTCGAGTGCCTTCGCAACCACGAGCATCGCACCCTGCCTGTCACCTGCATCCTCGAGTTGACGGATGTAGTCGTAGGTCGACGTCGTGATGAAGTGCATGCTCTGATTGTGCTGCTCGGCCCACTTCGCCACGCCCTCCGGCATCCGCGCGTAATCCTTCGCGATGTCTTCGAGCTTCTCGCCGGTCAGGTCGTGCATGCGCACGACATCCTCACCGAGCACCTGCAGCGACTGTCCGGTGATCTGCCCCGACGATACAAGCGCCTGCAGCCCTTCACGTGCAGTCCCGAGACTGTTGCCCGTGCTTGCCGCGATCGCCTGGGTCAACGCAGCGAAGCTGCTCGCCGTCTGGCCAGCATAATTACCGGTTGTTTGCAGCGACTTCGTCAGCGTATCGGCTTCCTCGTGCCCCTTGTAGGCAGCAACCGCGAACAGGCCGAGACCGGCTACTACCGCCGAGATGGCCGCACCGACTGGCGAAAGCGCGAAGGCGAAGAAATCAACCTGCTCGGCGTAGACCATCAGCGAACCGCCGAAATTCTTGATGTTGCCTGTGGCAAGTTCGTGCGCCATGACAAGCAGTTCCCGACGCGCGCTCGCGGTGTGCGTGCCGATCTTGTCCATGCCTGCCGCGCCGTCGTCGCCCGCCTTGCCGACCGCTGAGCCGACGCCCTGGGCGTCTGATCGGAGACTGCGCGCAGCCGCCTGATAGGACGACGGGTCGGCGGTGATCTGTACAACGAGCTGCCCCAACGATCTAGACGTTCCGGCCATAGCGAACCCGCAATAAAAAAGCCCCGCGTGAGCGGGGCACTATCAAGCGGACTACGTCGGGAACAGCGCGTCGAGCGCTGCCTCCTCCATGATCCGCACTTTCTCGAAGATCGCGGCACGGCGTTTCGGCTTCACACCGGTCAGTTTCATGACCGGCTCGATAGCGCTGTAATCGAGGCCCGTCTGAATGAGTCTCGCGCTGCCCCATGTGGAGACAGCGACTGTGCGCCATTGCGTGGTCAGCGCGAGAAACACCTGAACGGTAGGCCAGTTCTCGGGATAGACCCCGAATTCGGGCTCGCTCTGCTGCGTGCGTGCCTGCGCGACATCCTCCGCACGCGCACCGAACGCGGCCAGTGCATCAGCAACACCGGCATCGATCGGATTCTCGTCGACATCGACGCCTGCCCATAGCCGTGCCGCGTCGATCAGTTTTTTCGCGTGGCTCCCGATGTACCTTGCAGGAAGGTGTCCCACAGTGCGACGACAGCGTGCGGGATCTGCAGCAGTGCATCGCGGTATTCGGCAGAGAACGGCAGCTCGCCACCGTCATCACCCTTGAGGCCAGACCAGCCGACCAGCACGTCGGCCAAGGCAACATAGACGGGCTGATTCGCCTTCATCAGCGTCTCTGCCTCGTCGCGTTTCAGGCGCTTGAACTCGGCGGTGAATTCACTTGCCTCCACTGCGCCATCGTCGGTGTTGCCAGGCTCGACGACCATAACCTTCGCCTTGAAGGTCGGATTTTTTGCCAGAACGTAGGGCATGCTGTGATCCTCAAATGACGACGGGCCGCCATCAGCGACCCGTTCCGGTTTGCAGCAAAGTTACTTGACCGTGATGACCAGCTCGTCGTTGCCGGTGAGCGGCGTGACCGTGAGCGTCGCATCGAGCATGACCTTGTTGTCCTGATCCGTGTATGACGGATCGGTCAGTTGCACCTGTGGCGCGTCGAATTGCACGATATTGCCAGCGCCGACACCGTGCGTAATCGTCAGCGTGCCGAGGAGCGCATCCTTCGTCGCCGTCCACCAGTCCTTGTCGGCCACCGACCCGAGTTGCATGGTGATCTTGCCCGTCGGCTTGCGATCCGTGACTTCCGCGCGCTCGTAGCCGATCAGTTGCGCCCAGTTGAGCGTATTGGCAACGTCGAGCGACAGAGCCTGCAGCGGCCCCGTAAAACCGTGCATCGACCACGTCGTGAACTGGGTGCTCGCGATCTTCGGCTGCAGGAACTTCGAGAAGTCCGTGCCGGCGGGAACCGGCGAATCCGTCACGGGGTTATAGACACCCATGAAATGGAACTTGAGCTTGGGAACCTGCTTGACCGTGAAGTCGACGGACACTGTGCCGTATGCATCGGTCAGCTTGTGCAGCAGTCCATCGAGGTAGTAATAGATGGTGAGCGGCGTCTGCGGCTGATCGCTGACCGGCGCATAGACCACACTCGTATCCTCGGTCACGGTTTCCGAGAAATAACACGGCACCAGCAGCCGGCCCCACGCGGGTGGCGTCCCGGCCGCACCCGATGCGGCAATCTCGATCTCGAAATCGAGCTCGGCATGACAGCCCGCCGGAAGTTGCTGATCGTTTCCAAAGTATGGCCGGATCGTATCGCGCGACACGTAATCGGCGGCGACCGGCTTCGCGGAGATATTGCTGACGAGCATCGCATCGGCCGCACTCGTCGGCACGGCGGCCGTGCCAATGGCGATCTGCAGCGCCGCCAGCACGACCGACTTCTTCATGGACTTTGCACCCATGTCTGCTCCTTGGCAAAAGGGCGCGAAACACCCTGCATGACGCACGGCGTACGCGTTGGATTTCGTTGGATTTACAGCAGGCTGTTGGGAGGCGTCTGATACTGGATGGTGTAGCGCATGGTCACGATGCCCACGCCGCCATCAATATCGGCGGTTTCCGGCTCGTCGGTTGTCACTTCTTCTACGCCGACAATTTCCGGGCCGTCGAACGCCATCACGATCGGATGCGAACGCTCGAAGATCACGTCCGCCGCCCGGTCAGGCGCCGGATCGCGGACGACCGCCGAGACGAGAAGATCGCACTGACGCGTTGTGCGCCCGATCGTCGACGCTAGGACGATATCGCGGCCCCTGCTCACGACGACGACCTTCGGCTCTTCGCGGTCCACCGCCTCATAGATCGACCGCTCGACCACGACGCCGCTCGACTGCAACTGCGCGTCAGATTTCAGCGCATCCATGATCGAAGACACGAAGGTTTCACGTATCGTCGTCATACAGCCTTCTCCAGATCGGCGACGCTGAAGTAACCGTCATCTTTCCTGCGGGGTGGATGCCGGACCCTGTAGCGTGTGCCATCAATTTCCAGCATGGAACCGCGAGCAAGCTCGGGCATATCGGCCGTCTGATACTCGACCCGATAGTCCGCAGTCGTCACCCGCCCGCCGAGGTCCAGCATGTCGGGCGTTTCGAAGCCGACCTGCACAGGTCGAACCGAGCCATCGCCCAGCTCGACCTGCGCATTTTTCAGCATCCCTGCCGCCGCGAATGCCGGCCAGAACACGCGGAGGTCAAGCAAGGTCAGACGGCGAGCAGAGCAGCACTCGCGCCGTAAGATCGTTTGCGGCCTTCGGTGCGGCATAGACGCCAGCCTTCGCGTTCGCCCCGACGGTCGATGTCACGACAGCATTCGCCGGGTCCCAGTATGCGAAGTCGCCAACGGCCCCCACCGAGGGACCATCCGCCTGCAGCTCGAACACACCATCGAGACGATATTCGCCGGGCATATTCGCCGCGAAATCGGCCGTCGCAACGGCGGGCAGCTTTGCATTGCCCAGTTGCACGAACTGGCCCGAAACAACAGCAGCGGCAAGCGTCACCGTCAGCGTCTTGCCGCTCTGGATAAAGTTCTTCATGAGAATTTTCCTGTGCTGAATTGTTGAATGAGAAACGGACGCGACGATCGCCCCCGCCTTACCTCACCGCCGTCTACTGACCCGGGTTCTGGTACAGGCCGCGATAGTCGGTCGCCTTCGCCGCGAAATCGAGTCGCGCCTTGACCTTCAGGCCGTCGACATCGAAGTCAAGCGACTGCTCGGTGTAAAGCCCCTGCTCGCCTTCGAGGTAGCAGTACTCGACCGTATCAACCATCGCCGGATCAGCAGCCAGGTACCACGACTTCGTGCTCTTCGCATCGAGGCGCGGCTCGACGACCGGGGTCAGTGTCCCGATGAAAGGGTTCTGCTGCGTGGCCTGCGTCGGCGTGTACTGGTTGCTCGTGTACTGATACGCGACCGTCTCCAGCGCTGCCGGCACCAGCAGGAACGTCGGTGTCAGGTTCAGCGGCGTGCCATCGCCCGGCGCGGCTTGCACGCGCATTGCCGCACGTGCCGTCGATAGCGTGTCGACGCCGATTGCACCGCCGGCGGCGGCGAGGTTCTTATGCGCCGCATGGAACAGTGCCTTGCCGTCGCCCATAACGGGGTTACCCGTGAGCGCGGCATACACGAGATCCGATTCCAGATTGGCGGCAGCACGACCGAAGAACAGCGGAACGCGCTCCAGCGCCGACAGGTCGTCGTTGATGATCATCTGGCGGGTAAAGGCGACAATCTTGCCGTACGTGCCGAGCTGGATCGTCTCGCCCGAGTCGACCAGCTGGCCGTATTTGTATTCACCGGACTCGTTGACCTTTTCCAGTTTGATCGCGCCGTCGACCATCACGCGCGTCGCCGCGCGAAAGTCCGTCAGCACGCCCTGCCGTGCCCACGACTGGAAGCTGCGGGGCGCTGCCGTGTAGGCGTCACGCAGCGTGCGATTGATCACGTTGCCGAATACCACGGGCAGATCCGACGTCGAGCTGAAGCCGCCACGCTGCTGCATACCCAGTGCGATACCAGCCAGCTCGCGCACATCGAGACCACGCACATCGACACCTACCGCTTCGAGGCCGACCCGGCAGAACTCTCGCAGGTTCAGTCCGCGATACTGTCGAGCGGCATCGTCGAGCTCGTGGCGCGGGTTGATACGATGCAACAGTGCGTCCGTCATCGCGGCACGACGCGTCTGCGCCTCGTCACGCACCGTCTGAATGTCCGCCGCCCCGCGCTGCGAATTCGCGTTCGAACGCTCCGCCTGCAAGCGCAGGATTTCAATACGGGCGGCGTCAGCCGTCACGCCGCGCTCGATGAAGCCGTCAATCAGTTGCTGCTGGTTTTCCAGCACGCTCGCACGGACTGCAGTGCGAATGTCGATCACACGCTGACGCTCCGACGCAACGGCTTCGGCACGAGCGGCGTCGGTTGCTTCGGTCGACGCTGCAGGCGACTGTGTCGCCGGCGGATTTTGAACAGGCGTGGTGCGGGTGGCGGACGTCGGTTGGTTGTTTTCATCGTCGGGCATTACGGCTCCTTGGTCAGTGTGCGCGGCGCGTGCGCCGTCAGAAGAACCCCCTGCACTGCGATCAGTGAAGGTGCAGGGGAAGAAACGTTGCTGCTGCGCCGCGACGGCTTGGCCGCCCTCGCCTCGCACAGTTGCGTTCGGGTCAGCGGGAATCGACACAAGCGAAATCTCGTACGGCTCCCAGTCGGTCGCGCGGTAGATCCACGTGTCGTTACCCTCCTGACCCGGCGGAATCATGTCGATCGCGTACACGCGGTAGCCGAACGAGATGTTCCGCAGGATCCTGTCGACGACGTCCTGAAAGTACGGCTGCACGTCGTCGCGCGCGGAAAACCGGCACATCGCGTCGCCCGTACCCGTCGCGGCGTCGAGCGTCGCGCTGTCCACAACACCGAGAACGGAATCAATGCCGTCCCACGTGTCGTGATCCCGCAGAAAAGGCGCTGCACCCGATTGCAACCGTCCCATCCGTACAGCAGACGGGTCCGGGCTTAACTCCTCGAGGTAGCTGCGTTCGCGCCACCAGTCGTAACGCTGCACCTGCGCGCCGGTCGTCCATGTCACGGCGATGGAACGGCTCTCCGCGTTGACCGACGAAACAGGTTGCAGACGCGTAAGTAGCGGCATCGAATCGGCGGTCGCCCCGGCGCCGCCGCGATGCCCCTGTGCTGGAACGGGCATGATTCACTCCAAATGGAAACGCCCGCGTGAAGCGGGCGTTATATGGTCAAAGTCACTTTGGCGGGCTATCGTTCTGCCGACATCACCAGCACCTCGAGGCGGCGAAGGCGGGCCATGACCCGCTCGAACATACGGCCAGCATCGGGAGTGCCGGCGGGCTCTGGCTCCGACGTATTGAGCGGATCGGCATTGATTTCTGCATCGGTTTCTTCCGGATCGTCGCCCAGATCGCGAATGATCTGATGCCGACTCTTCAGCCGCGCCTCGATCAATGCGATCATGCCGTTCGCTTCCCGCAGTGGGTCGATCATCTCGATCCGGTTCGGCGACCAGGTCACGTCATAGTCGGGCGACGATGTCACTCCGGCGAGATATGCTGTTGCAGCGAACTGCCCCGCGACTGCTTCGCAGAACATCGGAATGAAGATCAACCACAGTTCCTGCAGGAGCATCCGGTTGAACTCCATCTTGCCCATGCGTCCGCTGGTGAAATTGACCTGTGAGTAGTCGCCCGTGAGCTGTTCGTACGTCGTGTCGGTCCCTGCCGCGATCGCGCGCAAATCGATCCGCACGCCAGCCTCATAGCCATCGTTCGTCTGCGGCGCCGCAAACTGCACCTCCTCATCGTTTCGCAGATACTCGATCATGCCCGGTGAAAGAGATTCGACCCGACGGCCATCGCCGGGACTGGCCACACCCGGCATACCCGCGCGGAACTGCTCGTCGTTCGATTTGACGAACACCGCGAAGCACGCCTCGATCTTTTTCCGGATGCGCTCTGCGTCCTGATATTCGTCGAGATCGCGCGCAGCCCATATGGCAGACGCTAACCACGGGAAACCACGTACGGAGTTCGGCCTGTCGATCGCATCGAAGATATGCAGCACCTCGCTCGCCGGTACAAACCGACTCATCATGTTCCTGGGCACCTGAGCGACTTCGCCGGGATGCTGATCGAACAACCAGTATCCAGTGCGCTGCCCGATCAGATTGAACTGCACGCCGGCCAGAACGAACCCACCGTCGACCTCCCCGACCTTCAGCGAGTCGAGATAATCGATTTCGAGGATCTGGATTTGCAGCGGCACCTCGTAACCATCGCCCGGTCGCCGACGGCGATAGCGGACCAGCACTTCGCCCGACAGTTTCATCGCACGGTATGCCTTGGCTTGCAGGCCGAAGAAGTCGAGCAGGCCGTCCGCGTCGCAGTACTTCACCCAGCGCTTGAACACTTTCTGCTGACGCTTCGATGCGAACTTCGCCTGAATGCCAGTGCCGATCGCATTCGCGACCATGACACGCAGCGCACGCCGCAGATGCGAGTTGTTGACGACAAGGTCACGGGCGCGATTGCGCAGGGTCGCAAGCGACGGCATCAGATTTGCCAGCGAACTGGCCCCTGACGTCGGCCACCCGGCCGCTCGGGGGCCGCGCTTCGCGCCATCGAATCCACGCACCGCCGCCAGCGCCATGCGTGCCCGCATGCGCGCGGCGCCATGCCGAGGCGCGATCCATTCGATTGCATGATCGAGAATGTTTGCTTTCATATCAGTACGGCCTATAGGTGCCAATGCTCGACCGCGAACCACCAAGACCTGACCGGTTTTCGAGATCGGACTTGATCAGATTGCGCACGCGCATCAGATCCGCTGTTGACTGATACGTGATCCGCTTGCCGTTGTACTCGACCGTCAACGTACCAGACGAGATCGCCGCCTCGATCGCGTCAAGGTTCTGCTGCGTGAATGCCAT
Protein sequences of DBSCAN-SWA_2 >CP026101|2535972:2550195|2545633_2545894_-|AUT52380.1|DBSCAN-SWA MLKNAQVELGDGSVRPVQVGFETPDMLDLGGRVTTADYRVEYQTADMPELARGSMLEIDGTRYRVRHPPRRKDDGYFSVADLEKAV >CP026101|2535972:2550195|2535972_2539113_-|AUT52372.1|tail|DBSCAN-SWA MRAVNPYCHAQGRQVCQIESPMSPAQWGKLAGVDMQQSIVCVDNDFWLRADWSKPIPHGAIVHFIPVAQGGGSNPLQAVLMIAIAVAAVYTGGAAAAAYGAASGVAAGTVTAGMMAVQAGVSMAVMAAGSMLVNAIVPAKTANVAQGTTTSPTYALGSSGNTARLLEAIPVLYGRMKLTPDLASQSYTEYAGNELYLYELFCITQGELEVEQVLIGDTDIGNFSEIEYEIVGPNQPVTLFPDNVVTSNAVSGIELQAPNDSGDWVGPFVANPAGSAANFIGVDITLPSGLFYAADDGSLQSLSLTFEAQARLIDDAGHALADWIALDVRTVSMATAQPQMVSYRYPVAGGRYEVRMRRTSNLNEDSRAQNRIQWAALRAYLPSARYYGNVTLLAMRARATNSLNSSTAHDVHVIGTRKLPVWTGNGWSAPQPTRSIAWAIADAVRNTDYSLGLPDRRLDLDALVRLDAVWASRGDEFNGVFDSKGSFWDALTTICRAGRAIPMYFGGVVSVVRDELKTVRTAMFTPDNIVAGSFEADYGFFSVDSPDYVTVEYMDETTWSWQDVACIPTGSPALVEKRVQMVGPTKRAQAFREGIYMAYANRDQRKTVSITAELDGLIPLYGDLVGISHDLPRWGISGVVEDVEGSHLYVSEQLEWTAGAQHYVYFAQRNGMPTAALRVVQPDADDGLSMLLIDPLPEFFEFSDGHLEEPTRFSFGPAIDKVSQDVRLIRAVPRDGKVELTFVNNAASPHTAETGLSPPAPTSPSLLPTVVHAPIIAQVSANWQITPGRVSITATPAAGALLYEYWGSSDGGVTWAKLGATGSNVLDVPVAIGTWRFRVRAFGASGLPGPYTTWTGTVTEFKYPPAPPTLSLREPFVGNQLSIEIQRRADVDYFHVDVVVGGVVKYGADITAQNFTWTLDQARQYEALADTFDVRVEAGNIAGLGNPATIIVKNAPPPAPSVSVSGAAPTVTLTMSVAGYANVTGYRVRDNSGAVVYEGSAATCTVAAGGTYFITAYNDWLSESLATVVTATPADPDSGGGGAGDH >CP026101|2535972:2550195|2545931_2546276_-|AUT52381.1|DBSCAN-SWA MKNFIQSGKTLTVTLAAAVVSGQFVQLGNAKLPAVATADFAANMPGEYRLDGVFELQADGPSVGAVGDFAYWDPANAVVTSTVGANAKAGVYAAPKAANDLTARVLLCSPSDLA >CP026101|2535972:2550195|2540545_2543377_-|AUT52376.1|tail|DBSCAN-SWA MAGTSRSLGQLVVQITADPSSYQAAARSLRSDAQGVGSAVGKAGDDGAAGMDKIGTHTASARRELLVMAHELATGNIKNFGGSLMVYAEQVDFFAFALSPVGAAISAVVAGLGLFAVAAYKGHEEADTLTKSLQTTGNYAGQTASSFAALTQAIAASTGNSLGTAREGLQALVSSGQITGQSLQVLGEDVVRMHDLTGEKLEDIAKDYARMPEGVAKWAEQHNQSMHFITTSTYDYIRQLEDAGDRQGAMLVVAKALDDHLRMESLPNLGYLQTAWHGVGSAIQSAWEWMKSIGRAETAAEQIANATAEVQRLQNALNAPSGAMNTDLLQPQLQAAQARLESLNRDALREQDAALDKSQQAQVQQAGIAASDFLKKLRDEEKGINRVNDALDDYRRRVAEYNQANPGNAVSKQQQATDEAYLRKQYTDTSGATEANKVRKSLLDAALQDTKNGLELIQSAYKNADDQLQALHKATLISDQTFYTAEITLADDAAQKTVAAYEQERSTLQAAYWKAPLQDRIRITQEIGELDTKIAKARQDNAAKDAVYMTQQTEAQRKYQKSIEDARDALLAQSGVTVPKATHDFDERNRGARLQAATTGDLGSAAFLDQSRQLTILNAQYNDIVARAKDAQNSISLDQQEGLTGLLDAFSALRSNSAETVQSLQSIYDQVNRLSWTTSDEGVLRSLDQLRDRIRQSMLDSQGYLKDFTDAGKTAFSSLFSDIVSGTRTPAQAVQSMVSSMLSSFAQLFANRVYASITNSLFGALFGSASGTGSAAYGFTMPASIQGSGSLFGVGAGLGFATGGSVWGPGTGTSDSINARLSNGEFVVKESVVSKPGMRTFLEELNGDGRVSGRNRFASGGYVGDDSPGAGAASGPNIDLHIHNAPAGVEQVTQSRLPDGGLRMDVILEKVDQHIASGIASGRGATARTMQRRYGLNRTLGQP >CP026101|2535972:2550195|2540048_2540546_-|AUT52375.1|DBSCAN-SWA MAELPTWPATLPDPRANGYGVQPVTPYARTDMDNGRARQRRRFTNVPSHVSVTWRFTHSQYALFEGFLAYEINLGTDWFAVGLLNGMGVNQVQARFMDDPPYKSAISDSRAWFDVTATLEVKALPLPSRDEYDVLLTYVAADIAAMGDSLHRLIHESLPGPKRWD >CP026101|2535972:2550195|2539123_2539513_-|AUT52373.1|DBSCAN-SWA MTADDANQYINRPWTSGARGPESFDCWGLLAWVQRHHFGIVLPDLPDVPDERRDLYHAQMEAGAWRVVDQPVHGCGVLLRGGDRPHVGVYLDLDGGGVLHAIEGVGVVFTPRPQLRMMGYPRASWYSFL >CP026101|2535972:2550195|2539509_2540049_-|AUT52374.1|DBSCAN-SWA MATITEALAEVYASNPQGEVVLDTLELRHSSFIDGSGNPAPIRVVADYIDLTARLEFDAPLDAGKNVTFLAVAFQFSLPSMEEGQAPQIDIVIDGASAEIIGHLESAVTQTEEIECTHRRFLASNSGAGPQDGEPLTLYIASAKATLTRVTATATLTDINNSPFPSKVYSPNVFPGLVR >CP026101|2535972:2550195|2545211_2545637_-|AUT52379.1|DBSCAN-SWA MTTIRETFVSSIMDALKSDAQLQSSGVVVERSIYEAVDREEPKVVVVSRGRDIVLASTIGRTTRQCDLLVSAVVRDPAPDRAADVIFERSHPIVMAFDGPEIVGVEEVTTDEPETADIDGGVGIVTMRYTIQYQTPPNSLL >CP026101|2535972:2550195|2548508_2549990_-|AUT52383.1|portal|DBSCAN-SWA MKANILDHAIEWIAPRHGAARMRARMALAAVRGFDGAKRGPRAAGWPTSGASSLANLMPSLATLRNRARDLVVNNSHLRRALRVMVANAIGTGIQAKFASKRQQKVFKRWVKYCDADGLLDFFGLQAKAYRAMKLSGEVLVRYRRRRPGDGYEVPLQIQILEIDYLDSLKVGEVDGGFVLAGVQFNLIGQRTGYWLFDQHPGEVAQVPRNMMSRFVPASEVLHIFDAIDRPNSVRGFPWLASAIWAARDLDEYQDAERIRKKIEACFAVFVKSNDEQFRAGMPGVASPGDGRRVESLSPGMIEYLRNDEEVQFAAPQTNDGYEAGVRIDLRAIAAGTDTTYEQLTGDYSQVNFTSGRMGKMEFNRMLLQELWLIFIPMFCEAVAGQFAATAYLAGVTSSPDYDVTWSPNRIEMIDPLREANGMIALIEARLKSRHQIIRDLGDDPEETDAEINADPLNTSEPEPAGTPDAGRMFERVMARLRRLEVLVMSAER >CP026101|2535972:2550195|2543428_2543734_-|AUT53483.1|DBSCAN-SWA MAAFGARAEDVAQARTQQSEPEFGVYPENWPTVQVFLALTTQWRTVAVSTWGSARLIQTGLDYSAIEPVMKLTGVKPKRRAAIFEKVRIMEEAALDALFPT >CP026101|2535972:2550195|2549991_2550195_-|AUT52384.1|DBSCAN-SWA MAFTQQNLDAIEAAISSGTLTVEYNGKRITYQSTADLMRVRNLIKSDLENRSGLGGSRSSIGTYRPY >CP026101|2535972:2550195|2544204_2545125_-|AUT52378.1|DBSCAN-SWA MKKSVVLAALQIAIGTAAVPTSAADAMLVSNISAKPVAADYVSRDTIRPYFGNDQQLPAGCHAELDFEIEIAASGAAGTPPAWGRLLVPCYFSETVTEDTSVVYAPVSDQPQTPLTIYYYLDGLLHKLTDAYGTVSVDFTVKQVPKLKFHFMGVYNPVTDSPVPAGTDFSKFLQPKIASTQFTTWSMHGFTGPLQALSLDVANTLNWAQLIGYERAEVTDRKPTGKITMQLGSVADKDWWTATKDALLGTLTITHGVGAGNIVQFDAPQVQLTDPSYTDQDNKVMLDATLTVTPLTGNDELVITVK >CP026101|2535972:2550195|2543805_2544144_-|AUT52377.1|DBSCAN-SWA MPYVLAKNPTFKAKVMVVEPGNTDDGAVEASEFTAEFKRLKRDEAETLMKANQPVYVALADVLVGWSGLKGDDGGELPFSAEYRDALLQIPHAVVALWDTFLQGTSGATRKN >CP026101|2535972:2550195|2546349_2548443_-|AUT52382.1|DBSCAN-SWA MPVPAQGHRGGAGATADSMPLLTRLQPVSSVNAESRSIAVTWTTGAQVQRYDWWRERSYLEELSPDPSAVRMGRLQSGAAPFLRDHDTWDGIDSVLGVVDSATLDAATGTGDAMCRFSARDDVQPYFQDVVDRILRNISFGYRVYAIDMIPPGQEGNDTWIYRATDWEPYEISLVSIPADPNATVRGEGGQAVAAQQQRFFPCTFTDRSAGGSSDGARAAHTDQGAVMPDDENNQPTSATRTTPVQNPPATQSPAASTEATDAARAEAVASERQRVIDIRTAVRASVLENQQQLIDGFIERGVTADAARIEILRLQAERSNANSQRGAADIQTVRDEAQTRRAAMTDALLHRINPRHELDDAARQYRGLNLREFCRVGLEAVGVDVRGLDVRELAGIALGMQQRGGFSSTSDLPVVFGNVINRTLRDAYTAAPRSFQSWARQGVLTDFRAATRVMVDGAIKLEKVNESGEYKYGQLVDSGETIQLGTYGKIVAFTRQMIINDDLSALERVPLFFGRAAANLESDLVYAALTGNPVMGDGKALFHAAHKNLAAAGGAIGVDTLSTARAAMRVQAAPGDGTPLNLTPTFLLVPAALETVAYQYTSNQYTPTQATQQNPFIGTLTPVVEPRLDAKSTKSWYLAADPAMVDTVEYCYLEGEQGLYTEQSLDFDVDGLKVKARLDFAAKATDYRGLYQNPGQ |
14 | Achromobacter_phage(30.0%) | portal,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2972190 : 2984996
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP026101|2972190:2984996|DBSCAN-SWA CTCAAACCACTACAGTCTGCGCCTCGCCTTCCTTGCTATCACGCACGACCCCGATCTTCCAGACCTGCTCGCCAGCAGCGGAAAGCAACCCCGTCGCCGTATCCGCGTCAGCGGCAGAAACAATAACCGCCATCCCGATCCCGCAATTGAACACTCGATGCATCTCCGCATCAGCCACCCCGCCATGCTTCTGCAGCCAGGCGAACAGCGGCGGCAGCGGCCACGCACGATGATCCAGCTCGGCCGTCAGGCCTTCCCTGAGCACGCGCGGAATATTCTCCACGAGCCCGCCGCCCGTGATATGCGCCATGCCCTTCACTTCAAGCTGCTGCATCAACGCCAGCAAAGGCTTCACATAAATATGCGTCGGCGCCATCAGCGCATCAGCAAGCGAACGCCCGTCGAAATCGGCATTCAGATCCGGCTGGGCGCGCTCAATGACCTTCCGCACCAGCGAAAAACCATTCGAATGAATACCGCTCGACGCAAGGCCCAGCACCACATCGCCCGGCACGATCTTGCTGCCGTCGATGATCTTGCTCTTTTCCACCGCGCCGACCGCAAAACCCGCCAGGTCGTACTCGCCATCCGGATACATGCCCGGCATTTCGGCCGTTTCGCCGCCGATCAGCGCGCAACCAGCCAGCTCGCAGCCCTGCGCGATGCCCTTCACGACCGTCGCGGCCGTGCCGACATCCAGCTTGCCGCACGCAAAGTAGTCGAGGAAAAACAGCGGCTCGGCACCCTGCACCAGAATGTCGTTGACGCTCATCGCGACGAGATCCTGGCCAACTGTGTCATGCTTGTTCAGCTGGAACGCCAGCTTCAGCTTCGTGCCGACGCCGTCCGTGCCCGATACGAGCACCGGCTCCTTGTACTTCTTCGGCACCTCGAACAGTGCGCCGAACCCACCAATGCCGCCCAGCACGCCGTCGCGCAGCGTCTTCTTGGCAAAGGGCTTGATCGCGTCGACCAGGGCGTCGCCCGCGTCGATGTCCACGCCCGCGTCGCGATACGACAAACCTTGGGCCGAATCAGGGGAATTCGGGGCGGATTTCGGTTGATTCATGGGGGGAGAGCTCGAAGGTCGGTAAAATGCGATTTTACCCGATGCCGGCCGCACGGCTGGAATTTGAGACGCCGACACGACGCCTGGGAAACCCGCTTTGCAGCAAAACAGTCCGATCCTCACGCCGTATCAGCGCCGCGCCTTTATCTGGCTTGCCATTGCGCTCGCCGTCGGCATTCTGCTCTGGCTGTTGAGTCCCGTGCTCACGCCGTTCCTGCTCGGCGCGATCCTCGCGTATATCCTGCAGCCGGGCGTCGCGTGGATGACCCGCCGGCACGTCCCGCGCGGCATCGCCGCACTGCTGATGATCCTGTTCTTCGCCTTGATCGTCACGCTGCTGGGGCTGCTGGTGCTCGGCGTCATCCAGAAGGAAGTGCCGCAACTCAAGCAGCAGGTGCCGTCGTTCTTCTCGCATCTGCATGGCTGGCTGCAGCCCAAGCTGGCGTTGCTGGGCATCATCGATCCGCTCGATTTCGCGAGCATCCGCGACGTGGTGATGGGCCAGCTCGAAGGCAGCGCGCAGACGGTCGTGCTGTATGCGTGGACCTCGATCCGCACCAGCTCGAACGTGATGCTCACGGTGGTCGGCAACGTCGTGATGGTGCCGCTCGTGCTGTTCTATCTGCTGTACGACTGGAATGCGATGCTCGCGCGCCTGCGCGGCTTCGTCCCGCGACGCTTTTTTTCGAAGACCATTCATCTCGCGCGCGACATGGATCACATGCTGTCGCAGTACCTGCGCGGCCAGTTGCTCGTGATGGGCGTGCTCGCTGTGTTCTATGCGGCCGCGCTGTATGTCGCCGGGTTCGAGATCGCGCTGCCCGTCGGCATCTTCACGGGGCTCGCCGTGTTCATTCCGTATATCGGCTTCGCGACAGGCCTCGCGCTCGCGCTCCTCGCCGCCCTGCTGCAATTCGGCGACTGGTACGGGTTCGGCGCCGTCGCGGTGATCTACGGCGTCGGCCAGATACTCGAAAGTTTCTTCCTGACGCCGCGGCTGGTCGGCGAACGGATCGGCCTGCATCCGCTCGCGGTAATCTTCGCGCTGCTTGCGTTCGGCCAGTTGTTCGGTTTTTTCGGCGTGCTGCTCGCGCTGCCCGTCAGCGCGATACTGTCTGTCGCGTTCCGCGAGTTGCGGCAGAGCTATCTGTCCAGTTCGCTTTACAAGAACTGATCGCTTATCGGTTTGCATATCGGTCACTAACGGCTTATTTATTGTGTCGCGCCAATTGACGCTCGATCTCGGCACCCCGCCGCCATCGACATTCGACAACTTCTTCGCCGGCGCCAACGCCGAGCTGGTCACGCGCCTGCGCGAGCTGGACGCGGCGCTCTCGGCCGGCCCGGTCGCGGACCGCACGTTCTACGTCTGGGGCGAGACGGGTAGCGGGCGCACGCATCTGCTGGAGGCGCTCGTGCATGAAGCACCGCCGGGCCACGCGCGCTATGCCGGCCCGCAAAGCAGTCTCGCGGCTTTCGCGTTCGACCCCGCCGTCGCGCTGTATGCAATCGACGACTGTGACCGCCTGTCCGGCGCGCAGCAGATCGCGATGTTCAACCTGTTCAACGAAGTGCGCGCCCATCCGACCAGCGCGCTCGTCGCCGCGGGCAACGCCGCGCCGATGGGCCTCGACGTGCGTGAAGACCTGCGCACGCGGCTCGGCTGGGGCCTCGTGTTCCATGTCGCGCCGCTCGCCGACGACGGCAAGGCGGCCGTTCTGAAGCGCGCGGCGCGCGAGCGCGGCATCAATCTCGCCGACGACGTGCCCGCCTACCTGCTGACCCATTTCCGCCGCGACATGCCAAGTCTGATGGCATTGCTCGATGCGCTCGACCGCTTTTCGCTCGAGCAGAAGCGCGCCGTCACGCTGCCGCTTTTGCGCACCATGCTCGCGTCGCCCGACGGCGCCAGCACTGCGAGCGGCACGGTCGACGCGCCTCGTCGCGCGGCTGCCCAGGCCTCATCATCCGCTTCAAGTAAAATAGTCCCCCATGGCTAACCTCGCTCTCTTCGACCTCGATCACACGCTCATCCCCACCGACAGCGACCACGAATGGGGCCGCTTCATGGTGAAACTCGGCATCGTCGAAGCCGAAAGTTTTGCGCGTGAAAACGATCGCTTCTTTGCCGACTACAGGGCCGGCAAGCTCGACATTCATGCCTATCTGGTCGCGATGCTGACGCCGCTGGCGAAATACCCGCGCTCGCAGCTCAAAACGTGGCACGACCAGTACATGCATGAGGTCATCAAACCCGCCATCGTGCCCGCCGCGATGGAACTGGTGCGCAAGCATCGCGACGCGGGCGACCTGTGCTGCATGGTCACCGCGACCAACGAATTCATCACCGCGCCGATCGCCGAAGTGTTCGGCGTCGAGAAGCTGATCGCGTGCGAAGTGGAGACGGTCGACGGTCACCCCGCATCGGACTACACCGGCTACCCGAAAGGCACGCCGAGCTACCGTGAAGGCAAGATCGTGCGGACGGAAGAATGGCTCGCGTCGATCGGTAAAACATGGTCGGACTTCGAACGCAGCTATTTCTACAGCGATTCGCATAACGACATCCCGCTGCTCGAAAAGGTCACCGACCCGATCGCGACCAACCCGGACGACACGCTACGCGCACATGCCGAAAAGCATGGCTGGCGCATCCTCGAACTCTTTCAACCCTCGTGATCAAGAAACTCATTCGCAAGCTGTTCGGACAGGACGCAGAGCCCGCTGACGAAGTCGCGCCGCCCGCAGAAGCAGACGACTACGCCGTCCCGGAAGAGCGCGCGCACTCCAGCCGCGCAGCCCGTACGTCATCGAAAGGCGCAGCATCGTCGAAAGGCGGGACGCGCCGCAAGCCGGCCCCAGCCGCCGTACCCGAGCCGGACGCGCCCGTCATCATCTCGTCGGAGATTCACGGCATCGACCCGTCGCTGATTTCGCGCAACGCGATCCGCGTGACGGAAGGCCTGCAACAGGCGGGCTTTCGCGCGTTCATCGTGGGCGGCGCCGTGCGCGATCTGCTGCTCGGCATCGCGCCGAAGGACTTCGACGTCGCGACGGACGCCACGCCCGAACAGGTGCAAAAGCTGTTCCGCCGCGCGCGCATCATCGGGCGGCGCTTTCAGATCGTGCACGTGCAGTTCGGCCAGGAAATCATCGAGACATCGACCTTCCGTGCGCTCGTGGACGCCCCGCCCGCTGACGCCGACGCCCCGCCGCCGCGCCGCCTGAAGCGCGACGAACTGGACCGTCGCACGCATGCCGTCGATGCAAGCGGCCGCGTGCTGCGCGACAACGTCTGGGGCGAGCAGCACGAAGACGCCACGCGCCGCGACTTCACCGTCAACGCGATGTATTACGATCCCGCGACGCAAACCGTGCTCGACTATCACAACGGCATGGCCGATGTGCGCGCACGCCTGCTGCGCATGATCGGCGACCCGGCAACGCGCTATCGCGAAGACCCGGTGCGTATGCTGCGCGTCGTGCGCTTCGCCGCGAAGCTCGATTTCGATATCGACGAAGCCACCCGCGCGCCCATCACCGAACTCGCCGATCTCATCAACAACGTGCCCGCCGCGCGGCTGTTCGACGAGATGCTCAAGCTGCTGCTGTCGGGCCACGCGCTCGCGTGCCTGCAGCGTCTGCGCAAGGAAGGGCTGCATCATGGGCTGTTGCCGCTGCTCGACGTCGTGCTCGAACAGCCGCACGGCGAGAAGTTCATCACGCTCGCGCTGAACAACACCGACGCGCGCGTACGCGCCGGCAAGCCGGTTTCGCCGGGCTTCCTGTTCGCCACGCTGCTGTGGCACGACATGCAGCAACGCTGGCAGCAGTACGAGGCGAACGGCGAGTTCCCGGTGCCCGCGCTGCATCGCGCGATGGACGACGTGCTCGACATGCAGACCGAGAAGCTCGCCATTCACAAACGTTTCTCGTCGGACATGCGCGAGATCTGGGGCCTGCAGCATCGACTGGAAAAGCGCTCGGGCCGCAGCGCGCTGAAGTTGCTGGAACACCAAAGATTTAGAGCGGGGTATGATTTCCTCCTGTTGCGCTGCGAATCGGGCGAACTGGATGAGTCGGTCGGTTCGTGGTGGACGGAGTTCATCGAAGGAGACATCGCCGCGCGCGAAGCACTGCTTGCGCAAGGCGGGAAGGACCGGGCGCCCAGAAAACGACGGCGGCGTAGCAGCAACAGCCGAAACCGCAGCAAACAGGGCGACGGAATGGAGGGCGGCACGGCTTCAGGAAACCGCGCAACAGACGATGCGAGCCACGACGGCCCGCATGACGACTGACGCATTTCCGGCGCGCGCCGTCGAACGCAAGTTGCAGGAACTGATGCCATGACGGTTGCCTATCTCGGCCTCGGCGCGAATCTCGGGGATGCGCGCCAGACCCTGAAAGACGCGGTGGTGTGCCTGGCACAACAGCACACCATCACCGTGCTCGCGAAATCGAGCCTTTATCGAACCGCCCCGATCGATGCGAGCGGCGACGATTATCTGAATCTCGTCGTCAAGCTCGACACTACGTTGCCCGTTCGCCATCTGCTCGCGCTGTGTCACAAGATCGAGCATCACTTCGGCCGTGAGCGCCCCTTTCGCAACGCGCCGCGTACGATCGATATCGACATCCTGCTTTACGGCGAACACGCGATCGACGAACCCGATCTGATCGTGCCGCATCCGCGCATGACGGAGCGCGCGTTCGTGCTCGTGCCGCTCGTCGAAATCGAGCCCGCGCTGATCATCCCGCAACGCGGCCGCGCCGATGCGTTTCTGGCTGCCGTCGCCGGGCAGCGTATCGAAAAGATGAAATCCCCGTGCCAGTGCCTCGCGCCCAACACGGCCGAGGCGATCCGCACGCAAGACGCCAGCACCGATATGCCCAAGGGCGGCTGCCCATGAACTCCCTGCCGATGTCCGCCCCGCCGCTCACCGTCACGGCGCCCGACCTGCGTCCGCCGCACGGCTATCTGACGATCGAAGGACCGATCGGTGTCGGCAAGACGTCGCTCGCGCGCCGGCTCGCGCAACGCTGGTCGATGCACGAACTGCTTGAACGCCCGCAGGACAATCCGTTTCTCGAGCGCTTCTATCGCGACACGTCGCGCTATGCGCTGCCCGCGCAACTGAGCTTCGCGCTGCAGCGCGCGCAGCAGGTGCAGGACATCAGCGCGCTGCGCACGGCGGGCACGCCGCTCATCACCGATTTCATGACGCAGAAGAACGACATCTTCGCGCGCCTGACGTTACAGGACGACGAGTACCCGCTGTATCGCGCGCTCGCGGCGAAGCTCGATGTGCCCGGTCCGTCGCCGGATCTGATCATCTATCTGCAGGCCAGTCCCGAAGTGCTGTTCTCGCGCATTCAAAAGCGCGCGGTGCCAATGGAACTGCAGATTTCGGACGCGTACCTGCGCTCGCTGTGCGACGCGTACAACGACTTCTTCTATCACTACGACGGCGCGCCCGTTCTCACGGTCAGCGCCGAACACCTGAATCCGCTCGAATCCGACGCGGACCTTGCATTGCTCGTCGAACGCATCGAAGCGATGCGCGGGCGCAAGGAATTCTTTGTCAAAGGCGGCATGCTATAGCGTCTGGCTGTAGCGTGTAGCCGCATCCCAATGGATCACCCATGACCTATTTGCAGGAGACGAGCCGCAGCGCCGTCACCGTCCCGAAGCTCCAGGCGATGCGTGAAGCGGGTGACAAGATCGTGATGCTCACGTGCTACGACGCAAGCTTCGCGGCGCTGCTCGACCGCGCGGGTGTCGATTCGCTGCTGATCGGCGATTCGCTCGGTAACGTGCTGCAAGGCCAGAGCACGACGCTACCCGTCACGATCGACGAAATCGCGTATCACACGGCCTGTGTCGCGCGTGCGAAGCCCTCCGCGCTGATCGTCGCCGACATGCCGTTCGGCACCTACGGCACGCCCGCCGATGCATTCACGAATGCCGTCAAGCTGATGCAGTCAGGCGCGCAGATGGTGAAGCTCGAAGGCGGCGAGTGGCTCGCGGACACGGTGCGCTTTCTGGTCGAGCGCTCGGTGCCCGTGTGCGGTCACGTCGGGCTCACGCCGCAATCGGTGCACGCGTTCGGCGGCTTCAAGGTGCAAGGCAAGACGGAAGCGGGCGCGGCCCAGTTGCTGCGCGATTCGCGCGCGATGCAGGACGCGGGCGCGCAACTGCTGGTGATGGAAGCGATGCCGACGCTGCTCGCCGCCGAAGTCACGAAGCAGTTGCGCATCCCGACGATCGGCATTGGCGCGGGCGTCGAGTGCTCGGGTCAGGTGCTGGTGCTGCACGACATGCTCGGCATTTTCCCAGGCAAGCGTCCGCGTTTCGTGAAAGATTTCATGCAGGGACAGCCGAGCATTCTCGCAGCCGTCGAGGCGTATGTGCGGGCCGTCAAGGAACGTACGTTCCCGGGGCCTGAGCACACGTTCTGATTAAGCGTGTTCCTGCTGTGTTGCCGGCTCAACCATTCAAGCAATAATCCGCGCCGGCAACACACCGCGTAGCGCATTACAGAGCATCAGCGCTTGCGCATTCGACACATCTTCCCGGGTCAGCACCCGTTCTCCGGCCTGCATGTCGGGATCGTCCAGCAGCACACCACGCATCACGCCCGGCAACACGCCCGACGACAACGGCGGCGTCCACCAGCGTCCATCCAGCTTCACGAAAACGTTTGATCGACCGCCCTCGGTCAGTTCGCCGCGTTCGTTGAAGAACAACGTGTCGAACGCATCGTGCGATTCGGCTTCGCGCCAGCCGCGATCGTATTCCGCGCGACGCGTCGTCTTGTGCAACAGCAGCGGATCGGCTGCCTGCATCGTCGTGAAACCGTGATCGGGACCGAGCAGCACGCCGACCACTTCTTCCGTCAACGGCACGAGCGGCGCACAGGTGATTTCCGCCGCGCCAGCCTTGTCCAGCGTCAGCCGCATCCGGTGCGGCGCGGCGTTGCGCAGCGCCGCGCATTGCGCGACGAGTTGCGCGCGCAGCCTGTCTTCGTCGAATGCGAAGCCCAGCCACCGGGCGCTATGTTCAAGCCGCGCGAAATGCCGATCGAGATGTCGCACGCCGCCTTCGAGCGTCGCATACATCGTTTCGAACAGTTGAAAACCCGGGTCGGCGTCCGTCAGGAAACGTGCTTTCAATTTGCACTCCGCGTATTCGTCTTGCGCCACGCTATCGAGCACGATGCCCGCGCCAATGCCCATTTCGCCGTGCCGTGCTCCGTCGGGTGCGACGGCGCCAAGCGTCAGCGTGCGGATCGCCACCGACAGACAGAAATCGCCACACTGCTTGCCGACTGTCTCGCCGAGATCCGCATGCGCGTCGAGCCAGCCGATTGCGCCCGTGTACAGCCCGCGCGGGGTGGATTCGAGCTGCTCGATCAACTGCATGGTCTTGTGCTTGGGTGCGCCCGTGATGGAGCCGCACGGAAAGAGCGCACGCAGCACCTCGGCAAACGTCGTGCGCGCCTGCAAGGTCGCCGTCACGGTCGACGTCATCTGCCACAGCGACTGATACGGTTCGATCGAAAACAGCGCGGGCGTTTTCACCGTGCCGGTTTCCGCGATCCGTGACAGATCGTTGCGCAGCAGATCGACGATCATCACGTTTTCCGCGCGGTTCTTCGGGTCGCTCGCAAGGAACTCGGCGGCCGCGCGGTCCTGCTGCGCGTCCGTCGAACGCGGCGCAGTGCCCTTCATTGGCCGCGTGCGCAGCAAAGCGCCCTTCTTCTCGACGAACAGTTCCGGCGAGCACGATACGATCCATCGGTCGTCGGGCAACGCGATCAGCGCGCCGTAATGAACCGACTGCCGCGCGCGCAGCCGCCGGAACAACGCGGCGGGCGTGCCGAACACGTCGAAGTACAGCCGGTAGGTGTAGTTGATCTGATAGGAGTCGCCGGCACGCAGCGCGTCGTGAATCGCGCTGATGGCCTGCGTGAACTGCGCTTTGTCGACGCTCGCATGAACCCCGCCGATGCCTGCCACGGAAGGCTCGACGGCCGCGCTGTCGCGAAGCGCGAGCCATGTGTCGACCTCGTCGCGCGACATCTTCCTGCACGTGTCGAACAATAAAAAACGCAGCGTGCCACCGGCACGCTGCGTCTTCTGCTGTAACGCCTGTCCAAATTCGTAATCGGCCAGCACGACGGCGAACAGCCCGCTGTCGATATCGGCCGCAGCCGCCTCGCAGACGGCATCCAGCTGCGCGCGGTCAGTACAAACGTGCTCGCGACGAAACCCCGTGTACAGCCGGCTCGAACGGCGACTCGCAGTCGAATCGCAGTCGTCCAGCAACGCGAACACGGAGCTTCCTTCGTGAGCCGCTGTTTCCTGAGCCGCCATGCATGCCGCCAACTTCAGAGCGCGTTAATCGAAAAAGCTCTTGACGCGGTCGAACCAGCTCTTGCTCTGCGGGCTGTGACGCGCGCCGCCTTCCACGAGCGACTTCTCGAACTGCTGCAGCAGGTCGCGCTGCGCATCCGTGAGCTTGACGGGCGTTTCCACCTGAACATGCACGTACAGATCGCCCGCGATGCTCGAACGCAAACCCTTGATGCCCTTGCCGCGCAGCCGGAACGTCTTGCCCGACTGCGTGCCCTCCGGCACCGTGAAGCTGGCGCGGCCGGCAAGCGTCGGCACTTCGATCTCGCCGCCCAGCGCCGCCTTCGTGAACGGAATCGGCATCTGGCAATGCAGATCGTCGCCGTCGCGCTCGAACACGGCATGCGCCTTGATGTGAATCTCGACGTACAGATCGCCCGACGGACCACCGTTGATGCCCGGCTCGCCGTTGCCCGCCGAACGGATGCGCATACCGTCGTCGATGCCCGCCGGAATCTTCACTTCCAGCGTCTTGGTTTCCTTGGTCTTGCCCGCGCCGTGGCAGTGCGTGCAAGGCTCGGGAATGTAGGTGCCCGTGCCGTGGCACTTCGGACACGTCTGCTGGATGCTGAAGAAACCTTGCGACATCCGCACCGCGCCGGAGCCGTTACAGGTCGGGCAGGTTTCCGGCTTCGTGCCCGGCTTCGCGCCGGAGCCGTGACAGATTTCGCACGACACCCAGCTCGGCACGCGGATCTGCGTGTCGTAGCCGTGCGCGGCCTGCTCGAGCGTGATTTCCATGCTGTAGCGCAGATCCGCGCCGCGATACACCTGCGGACCACCGCGGCCGCTGCGCCCACCCGCCGCTGCCTGGCCGAAGATGTCGCCGAAGATATCGCCGAATGCGTCGGCAAAACCGCCGAAACCTTGGGCGCCCGCACCCGCCATGTTCGGATCGACGCCAGCGTGGCCGTACTGGTCGTACGCTGCACGCTTTTGCGAGTCCGACAACATTTCGTAGGCTTCCTTCACCTCTTTGAAATGCTCTTCCGCATCCTTGTTGCCCGGATTGCGGTCAGGGTGGTGCTTCATCGCGAGCTTGCGATAAGCCTTCTTGATTTCGTCGTCGCTCGCGTTCTTTGCAACGCCCAGAATCTCGTAGTAATCCCGTTTCGCCATATCGGTTCAACGCCATCCGCGCAATGCGGCGCGGCGGCTCCTCTTGAATGCTGGAGTCTCGCGACTCGTAAGGCTCTGGCTTCGACCACAATTACGGGTCAAAAGCCGCGCCCTCCATAAAACAAATGTGCCCGGAGAGCCGAAGGGCTCGCCAGGCGCGTGATCGGTTCAGACGCCGTGCAGGCGCTCAACCCTTTGCAGCCGGGCTGCGCATGGATGACATGCGCAGCCTTGCGGTCGGATAGCAACCCGGCTTAGTCCTTCTTCACTTCCTTGAACTCGGCGTCGACGACGTCGTCGGCCTGCTGGCTTTGGCCCGCCGCTGCTTCCGCACCGGCTGCGCCCGCACCTGCCGCGCCCGCTGCACCTTGCGCCTGCATGTCGGCGTACATCTTTTCGCCGAGCTTCTGCGACGCAGTCGCAACCGTCTCGATCTTGGCTTCGATCGCAGCCTTGTCGCTCGAGCCGCTCTTCAGCGTTTCCTCGAGGTCCTTCAGCGCGGCTTCGATCTTTTCCTTCTCGCCCGCGTCCAGCTTGTCGCCGTACTCGGTGAGCGCCTTCTTCGTGCTATGCACCAGCGCATCGCCCTGGTTGCGGGCGTCGGCCAGTTCACGCAGCTTGTGATCTTCTTCCGCGTTCGCTTCCGCGTCCTTCACCATCTTTTCGATTTCAGCTTCGGACAGACCCGAGTTCGCCTTGATCGTGATGCGGTTTTCCTTGCCCGTCGCCTTGTCCTTCGCGCCGACGTGCAGAATGCCGTTCGCGTCGATGTCGAAGCTCACTTCGATCTGCGGCACGCCGCGCGGTGCGGGAGGAATGCCTTCGAGGTTGAACTCGCCCAGCAGCTTGTTGCCCGCCGCCATTTCGCGCTCGCCCTGGAACACCTTGATCGTCACGGCCGACTGGTTGTCGTCCGCCGTCGAATACACCTGAGCGTGCTTCGTCGGGATCGTGGTGTTCTTGTTGATCATCTTCGTCATCACGCCGCCGAGCGTTTCAATGCCCAGCGACAGCGGCGTCACGTCCAGCAGCAGCACGTCCTTGCGGTCGCCCGACAGCACCTGACCTTGAATAGCTGCGCCGACAGCAACGGCTTCGTCCGGGTTCACGTCACGGCGCGGGTCCTTGCCGAAGAACTCCTTCACCTTTTCCTGCACCTTCGGCATACGCGTCTGGCCGCCGACGAGAATCACGTCGTCGATTTCACCGACCTTCACGCCTGCATCCTTGATCGCGACGCGGCACGGTTCGATGGTGCGCTCGATCAGGTCTTCGACCAGCGCTTCCAGCTTGGCGCGCGTGATCTTCAGGTTCAAGTGCTTCGGACCCGATGCGTCCGCCGTGATATACGGCAGGTTGATTTCCGTCTGCTGACCCGACGACAGCTCGATCTTCGCCTTTTCAGCCGCTTCCTTCAGGCGTTGCAGCGCGAGCACGTCCTTCGACAGATCGACGCCTTGCTCTTTCTTGAACTCGCCGATGATGTAGTCGATGATGCGCTGGTCGAAGTCTTCACCGCCGAGGAACGTGTCGCCGTTCGTCGACAGCACTTCGAACTGCATTTCGCCGTCGACGTCCGCGATTTCGATGATCGAAATGTCGAACGTGCCGCCGCCCAGGTCGAACACCGCGATCTTGCGGTCGCCCTTTTCAGCCTTGTCCAGACCGAAGGCGAGCGCAGCAGCCGTCGGTTCGTTGATGATCCGCTTCACGTCCAGACCGGCGATGCGGCCCGCGTCTTTGGTTGCCTGACGCTGGCTGTCGTTGAAGTACGCCGGGACCGTGATGACGGCTTCCGTGACCGTCTCGCCGAGATAGTCTTCGGCCGTTTTCTTCATCTTGCGCAGCACTTCAGCGGAGATCTGCGACGGCGCCAGCTTTTGACCGTGCGCTTCGACCCAAGCGTCGCCGTTGTCGTGCTTGACGATCTTGTAGGGCATCAGGCCGATGTCCTTCTGCACTTCTTTTTCTTCGAAGCGGCGGCCGATCAGGCGCTTGACCGCGTACAGCGTGTTCTTCGGGTTGGTGACCGATTGACGCTTGGCGGGCGCGCCGACGAGGACTTCGTTGTCGTCCATATAAGCGATGATCGACGGCGTGGTGCGCGCACCTTCCGAGTTCTCGATCACCTTGACCTGATTGCCTTCCATCAGCGCCACGCACGAATTGGTGGTGCCGAGGTCGATGCCGATGATTTTGCCCAT
Protein sequences of DBSCAN-SWA_3 >CP026101|2972190:2984996|2979753_2981634_-|AUT52698.1|DBSCAN-SWA MAAQETAAHEGSSVFALLDDCDSTASRRSSRLYTGFRREHVCTDRAQLDAVCEAAAADIDSGLFAVVLADYEFGQALQQKTQRAGGTLRFLLFDTCRKMSRDEVDTWLALRDSAAVEPSVAGIGGVHASVDKAQFTQAISAIHDALRAGDSYQINYTYRLYFDVFGTPAALFRRLRARQSVHYGALIALPDDRWIVSCSPELFVEKKGALLRTRPMKGTAPRSTDAQQDRAAAEFLASDPKNRAENVMIVDLLRNDLSRIAETGTVKTPALFSIEPYQSLWQMTSTVTATLQARTTFAEVLRALFPCGSITGAPKHKTMQLIEQLESTPRGLYTGAIGWLDAHADLGETVGKQCGDFCLSVAIRTLTLGAVAPDGARHGEMGIGAGIVLDSVAQDEYAECKLKARFLTDADPGFQLFETMYATLEGGVRHLDRHFARLEHSARWLGFAFDEDRLRAQLVAQCAALRNAAPHRMRLTLDKAGAAEITCAPLVPLTEEVVGVLLGPDHGFTTMQAADPLLLHKTTRRAEYDRGWREAESHDAFDTLFFNERGELTEGGRSNVFVKLDGRWWTPPLSSGVLPGVMRGVLLDDPDMQAGERVLTREDVSNAQALMLCNALRGVLPARIIA >CP026101|2972190:2984996|2978901_2979717_+|AUT52697.1|DBSCAN-SWA MTYLQETSRSAVTVPKLQAMREAGDKIVMLTCYDASFAALLDRAGVDSLLIGDSLGNVLQGQSTTLPVTIDEIAYHTACVARAKPSALIVADMPFGTYGTPADAFTNAVKLMQSGAQMVKLEGGEWLADTVRFLVERSVPVCGHVGLTPQSVHAFGGFKVQGKTEAGAAQLLRDSRAMQDAGAQLLVMEAMPTLLAAEVTKQLRIPTIGIGAGVECSGQVLVLHDMLGIFPGKRPRFVKDFMQGQPSILAAVEAYVRAVKERTFPGPEHTF >CP026101|2972190:2984996|2973355_2974432_+|AUT52692.1|DBSCAN-SWA MQQNSPILTPYQRRAFIWLAIALAVGILLWLLSPVLTPFLLGAILAYILQPGVAWMTRRHVPRGIAALLMILFFALIVTLLGLLVLGVIQKEVPQLKQQVPSFFSHLHGWLQPKLALLGIIDPLDFASIRDVVMGQLEGSAQTVVLYAWTSIRTSSNVMLTVVGNVVMVPLVLFYLLYDWNAMLARLRGFVPRRFFSKTIHLARDMDHMLSQYLRGQLLVMGVLAVFYAAALYVAGFEIALPVGIFTGLAVFIPYIGFATGLALALLAALLQFGDWYGFGAVAVIYGVGQILESFFLTPRLVGERIGLHPLAVIFALLAFGQLFGFFGVLLALPVSAILSVAFRELRQSYLSSSLYKN >CP026101|2972190:2984996|2978179_2978860_+|AUT53529.1|DBSCAN-SWA MSAPPLTVTAPDLRPPHGYLTIEGPIGVGKTSLARRLAQRWSMHELLERPQDNPFLERFYRDTSRYALPAQLSFALQRAQQVQDISALRTAGTPLITDFMTQKNDIFARLTLQDDEYPLYRALAAKLDVPGPSPDLIIYLQASPEVLFSRIQKRAVPMELQISDAYLRSLCDAYNDFFYHYDGAPVLTVSAEHLNPLESDADLALLVERIEAMRGRKEFFVKGGML >CP026101|2972190:2984996|2975250_2975937_+|AUT52694.1|DBSCAN-SWA MANLALFDLDHTLIPTDSDHEWGRFMVKLGIVEAESFARENDRFFADYRAGKLDIHAYLVAMLTPLAKYPRSQLKTWHDQYMHEVIKPAIVPAAMELVRKHRDAGDLCCMVTATNEFITAPIAEVFGVEKLIACEVETVDGHPASDYTGYPKGTPSYREGKIVRTEEWLASIGKTWSDFERSYFYSDSHNDIPLLEKVTDPIATNPDDTLRAHAEKHGWRILELFQPS >CP026101|2972190:2984996|2974475_2975258_+|AUT52693.1|DBSCAN-SWA MSRQLTLDLGTPPPSTFDNFFAGANAELVTRLRELDAALSAGPVADRTFYVWGETGSGRTHLLEALVHEAPPGHARYAGPQSSLAAFAFDPAVALYAIDDCDRLSGAQQIAMFNLFNEVRAHPTSALVAAGNAAPMGLDVREDLRTRLGWGLVFHVAPLADDGKAAVLKRAARERGINLADDVPAYLLTHFRRDMPSLMALLDALDRFSLEQKRAVTLPLLRTMLASPDGASTASGTVDAPRRAAAQASSSASSKIVPHG >CP026101|2972190:2984996|2981658_2982792_-|AUT52699.1|DBSCAN-SWA MAKRDYYEILGVAKNASDDEIKKAYRKLAMKHHPDRNPGNKDAEEHFKEVKEAYEMLSDSQKRAAYDQYGHAGVDPNMAGAGAQGFGGFADAFGDIFGDIFGQAAAGGRSGRGGPQVYRGADLRYSMEITLEQAAHGYDTQIRVPSWVSCEICHGSGAKPGTKPETCPTCNGSGAVRMSQGFFSIQQTCPKCHGTGTYIPEPCTHCHGAGKTKETKTLEVKIPAGIDDGMRIRSAGNGEPGINGGPSGDLYVEIHIKAHAVFERDGDDLHCQMPIPFTKAALGGEIEVPTLAGRASFTVPEGTQSGKTFRLRGKGIKGLRSSIAGDLYVHVQVETPVKLTDAQRDLLQQFEKSLVEGGARHSPQSKSWFDRVKSFFD >CP026101|2972190:2984996|2983046_2984996_-|AUT52700.1|DBSCAN-SWA MGKIIGIDLGTTNSCVALMEGNQVKVIENSEGARTTPSIIAYMDDNEVLVGAPAKRQSVTNPKNTLYAVKRLIGRRFEEKEVQKDIGLMPYKIVKHDNGDAWVEAHGQKLAPSQISAEVLRKMKKTAEDYLGETVTEAVITVPAYFNDSQRQATKDAGRIAGLDVKRIINEPTAAALAFGLDKAEKGDRKIAVFDLGGGTFDISIIEIADVDGEMQFEVLSTNGDTFLGGEDFDQRIIDYIIGEFKKEQGVDLSKDVLALQRLKEAAEKAKIELSSGQQTEINLPYITADASGPKHLNLKITRAKLEALVEDLIERTIEPCRVAIKDAGVKVGEIDDVILVGGQTRMPKVQEKVKEFFGKDPRRDVNPDEAVAVGAAIQGQVLSGDRKDVLLLDVTPLSLGIETLGGVMTKMINKNTTIPTKHAQVYSTADDNQSAVTIKVFQGEREMAAGNKLLGEFNLEGIPPAPRGVPQIEVSFDIDANGILHVGAKDKATGKENRITIKANSGLSEAEIEKMVKDAEANAEEDHKLRELADARNQGDALVHSTKKALTEYGDKLDAGEKEKIEAALKDLEETLKSGSSDKAAIEAKIETVATASQKLGEKMYADMQAQGAAGAAGAGAAGAEAAAGQSQQADDVVDAEFKEVKKD >CP026101|2972190:2984996|2972190_2973258_-|AUT52691.1|DBSCAN-SWA MNQPKSAPNSPDSAQGLSYRDAGVDIDAGDALVDAIKPFAKKTLRDGVLGGIGGFGALFEVPKKYKEPVLVSGTDGVGTKLKLAFQLNKHDTVGQDLVAMSVNDILVQGAEPLFFLDYFACGKLDVGTAATVVKGIAQGCELAGCALIGGETAEMPGMYPDGEYDLAGFAVGAVEKSKIIDGSKIVPGDVVLGLASSGIHSNGFSLVRKVIERAQPDLNADFDGRSLADALMAPTHIYVKPLLALMQQLEVKGMAHITGGGLVENIPRVLREGLTAELDHRAWPLPPLFAWLQKHGGVADAEMHRVFNCGIGMAVIVSAADADTATGLLSAAGEQVWKIGVVRDSKEGEAQTVVV >CP026101|2972190:2984996|2977604_2978168_+|AUT52696.1|DBSCAN-SWA MTVAYLGLGANLGDARQTLKDAVVCLAQQHTITVLAKSSLYRTAPIDASGDDYLNLVVKLDTTLPVRHLLALCHKIEHHFGRERPFRNAPRTIDIDILLYGEHAIDEPDLIVPHPRMTERAFVLVPLVEIEPALIIPQRGRADAFLAAVAGQRIEKMKSPCQCLAPNTAEAIRTQDASTDMPKGGCP >CP026101|2972190:2984996|2975933_2977556_+|AUT52695.1|DBSCAN-SWA MIKKLIRKLFGQDAEPADEVAPPAEADDYAVPEERAHSSRAARTSSKGAASSKGGTRRKPAPAAVPEPDAPVIISSEIHGIDPSLISRNAIRVTEGLQQAGFRAFIVGGAVRDLLLGIAPKDFDVATDATPEQVQKLFRRARIIGRRFQIVHVQFGQEIIETSTFRALVDAPPADADAPPPRRLKRDELDRRTHAVDASGRVLRDNVWGEQHEDATRRDFTVNAMYYDPATQTVLDYHNGMADVRARLLRMIGDPATRYREDPVRMLRVVRFAAKLDFDIDEATRAPITELADLINNVPAARLFDEMLKLLLSGHALACLQRLRKEGLHHGLLPLLDVVLEQPHGEKFITLALNNTDARVRAGKPVSPGFLFATLLWHDMQQRWQQYEANGEFPVPALHRAMDDVLDMQTEKLAIHKRFSSDMREIWGLQHRLEKRSGRSALKLLEHQRFRAGYDFLLLRCESGELDESVGSWWTEFIEGDIAAREALLAQGGKDRAPRKRRRRSSNSRNRSKQGDGMEGGTASGNRATDDASHDGPHDD |
11 | Pandoravirus(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP026103_1 | 1312920-1313005 | Orphan |
NA
Consensus repeat of CP026103_1
|
1 spacers
spacers of CP026103_1
>1.1|1312944|38|CP026103|CRISPRCasFinder AAAAGGCCGATAACACCGAACGGCGACGCGTGAAGCAC |
CRISPR arrays and Neighbor proteins around CP026103_1
The CRISPR arrays of CP026103_1 >merge|CP026103|1|1312920-1313005|CRISPRCasFinder GCTAACGTCACGCGGATGCCAGCGAAAAGGCCGATAACACCGAACGGCGACGCGTGAAGCACGCTAACGTAACGCGGATGCCAGCG >CP026103|1|1|1312920-1313005|CRISPRCasFinder GCTAACGTCACGCGGATGCCAGCG AAAAGGCCGATAACACCGAACGGCGACGCGTGAAGCAC GCTAACGTAACGCGGATGCCAGCG
>CP026103.1|AUT57097.1|1310925_1311915_+|cytochrome-d-ubiquinol-oxidase-subunit-II MDSSTHSLLAYVWFGLLGLMLVFYVVTDGFDLGVGILSLLRSRREDRDVMVESIGHVWDANETWLVVLGGGLFGAFPLAYAQLMQDLYLPIMALIAGLIMRGAAIEFRHSVEHGPLWDKVFGIGSLVAALAQGVVLGKIVTGLVPGEMSQGFVVVAAIGVVAGYALLGATYLVKKTVGMIEQWSRRLALLSAVLTVAAALLLTIATWFFSDVGLERWSQPGVMHVLIALGLAAALAFAFIMASLYMGASRGPFRGAVTLFIVSFAGLAVSLFPDFVPGKLGIVEAASDSHTLAFMLAGIGLIFPVMIGYNLYQYYIFRGKVVGEAHAGE >CP026103.1|AUT57096.1|1309494_1310895_+|cytochrome-ubiquinol-oxidase-subunit-I MLDHVANLSRAQFAMTAIFHILWPILTISLSAFLVLVEALWIKTGDVMYYRQARFWSKLLVLNFAVGVVSGIPMEFQFGTNWAGFSQYSGQFIGNILGFEGAMAFMLEAGFVGVMLLGWGRVPRGVHLFATGMVALGSSISAFWIMVANSWMQTPAGYAVVDGKIEVTNYLAAIFNPDMVWGVSHMWVAAIETGMFVIAGISAYNLFRKRHPEFFARSFKIALTVLVIAAPLQVWLGDSSGVSVFETQPAKGAAIEGHWHTNAPGTGASWSLLAWPDKQAQRNDWSLEVPGMLSVLGTHSLHGQVKGLTDFKPEDQPPMIPLLYYAFRVMAGIGFCFMLLAFWTVYALRKARGSLDALLARRKLLLAWVLCIPLPYVAVEAGWIVREVGRQPWVVYGLLRTSQAASTVAPSSVSLSMAMFFAFYVVLLVTFFVLARRWLRTGPDLTSVPPAIVTARAASTKSISGY >CP026103.1|AUT57095.1|1307144_1309502_+|formate-dehydrogenase MGKKKVIRIYSDPAGGWGALKATGEALTLQGIPVSGAKTLLHMNQPQGFDCPGCAWPDPKHTSSFEFCENGAKAVAWEATVNRCTPEFFAAHSVSELTAWDDYDLEMAGRLTHPMVYDASTDRYAPISWDDAFALVGRHLNALDHPDQADFYTSGRASNEAAFLYQLFVREFGSNNFPDCSNMCHEATSVGLPQSIGVGKGTVLLEDFEHADAIFIFGQNPGTNSPRMMSDLHSASRRGAKIVSFNPFRERALERFASPQNPVEMATLGYTPISTFLYQVKVGGDVAVLKGMMKAIVEADDAALAADKPRILDIEFIQGHTHGIDALLDDLRATSWDAIERHSGLSRADIENAANIYMQADNAILVYGMGITQHHRGTENVQQIANLALLRGNVGREGAGICPVRGHSNVQGNRTVGITEKPNKGLIEGIERAFGFRPPANHGNDVIATLEAMMRGDAKVFIGLGGNFAAAIPDWVRMQEAIRKLNLTVHIATKLNRSHLVHGKEALILPCLGRTEIDIQAGGPQSITVEDSMSMVHASAGRNEPASPHLMSEPAIVAGIARATLGEKSRVPWEQMVANYDHIRDAIEIVFPIFQAYNERIRVPGGFHLTSNARERVWDTPTGRANFLVFKGLDENPWHDDPDALWLTTMRSHDQYNTTLYSHSDRYRGVFGQRDVVFMNQHELHKRGLHPGERVDIVALSTDGIERVIRSFKVVEYSLPDGCCGAYYPEVNPLVPLYAFDPQSRTPSYKSVPVKIGRAAAVGPDSATRAIVMQAASHAGENSHA >CP026103.1|AUT57741.1|1306521_1306869_+|hypothetical-protein MRTIAFPFFRPSYAIAACSVALAATCAYQAPGVMQAILVVGANAQLPLERLAAEAGKPTPVSATDVSSPWKEHVPARARPAMLKLGSFGEPVRERHTHRGMGGYQRSVTDYKYWT >CP026103.1|AUT57094.1|1305547_1306522_+|LysR-family-transcriptional-regulator MDKILSMRIFSRVVESGSFSAVADHMNCSTGSVSRAVSSLEDHLHARLLQRTTRKVSLTEPGERYYRKCKKILADLEDAEAEAGDAHTSARGTLRIHCVTDLGLAQLTHSILEYRKRFPSVAVQVKFLPRMANLLEDDVDVSIVAAPALPDSRNVCKLIGHCERVLVASPAFLQTHRVETANDLDEHALTPMPFRVEPNGHPVKLSLVKPAGQSTGPEGARQFAINDTEATRIATLAGAGVAALPVHCVIDDLRNGRLLQLFPESRLQNTSVFAVYSSRHHIDAKIKTFIDFMTSHLKEALDTRVLTGHQPQTFSHVARVMENA >CP026103.1|AUT57093.1|1304316_1304958_-|HD-domain-containing-protein MNKNVAGVDIPDGVLARAAFEHVRGIEPELLLHHALRVFLFAALIGCKEALAFDMELLYVSALFHNAGLNERYAHSPNRFEIDSANAAREFLRCHRADESATAQVWTAIALHTTPGIPEHMPPLVALLSAGVQMDVRGARYHEFIAQQRNDIVQAFPRERGFKTKLIEAYARGMEHRPETTFGTVNADVLDRWDPDYRRLNFCGLVLGSEWPH >CP026103.1|AUT57092.1|1302527_1304273_-|FAD-dependent-oxidoreductase MNTPAINPPDLLSAQPDAGAPDLAMPYSSLEFRQHQMFPRLSAAQIASLRRFAQPMSFRAGELIFETGRIALGLFVLLHGRVRISSRDSFGRSTLVTEHDDGHFMAEMAQLSGKPALIDGVALTDCDTLVVSPDKLRALIVADAQLGEHIMRALILRRLGLIEQGLGPIIVGNGDDARLVRLQGFLRRNAYPATVIDARHDAEAATLLAGITTGPDDFPLVFCPNGSVLRAPDEAQLASCLGLVPTFERSHVYDVAIVGAGPAGLAAAVYAATEGLSVAVFDQRAPGGQAGASSRIENYLGFPTGISGQALAARAFQQALKFGAHLAIPGKVTCVDSEDGIHGLTLLDGQRVNARTVVVASGAAYRKPGIAGFDRFEGSGIYYWASPIEAKLVKGQDIVLIGGGNSAGQATVFLANFARSIRVLIRGADLNASMSKYLIDRIGSLPNVSLCTRCTLQALEGDEAGLTHVRVRREDEGDETIETRHLFLFIGADPKTDWLMSSGVELDSHGFVVTGFARRSQTPGGSGIHYPLETSLPGMFAVGDVRSESTKRVASAVGDGAAVVSQIHAYLAHCHAAAQNS >CP026103.1|AUT57091.1|1302261_1302531_+|hypothetical-protein MHGWKNPNDPFKKFEVVARASIAKVPYEVRIEAKACWRCAARAALAYPTQRWVSCNWIAAAGAISPDVTRTATCTRAQRVRRIAAASRG >CP026103.1|AUT57090.1|1300251_1302117_-|transcriptional-regulator MTRVGPLDIDLTRREARVDGMTVRIGNRAFDILELLIEAQGGLVSKETILERVWPDSVVGDNNLQVHMSALRKLLGDSRDLIKTIAGRGYRLVGSGACVQHEAGASLHDAPHGLVQSAVPNNLPACGSVLVGRDEATAHVSTVLRNARHVTLVGSGGIGKTRVAIEVARRLLEHAPGGVYFVSLGSASDMSCVLAMMASVIGVPPESGCSTRERIVEAIGGRRMLIVLDGCEHVIDGAAQLANHLLNACPHLRVLSTSREPLRIPSETLYWVPALDVPEPNDDTPRVRRCSAVSLFLIRARAIDARFATDDASLHVTGMVCRRLDGIPLAIELAAARAALLGIDTLAAHLDDRFGMLTGGTRTALPRHQTLKATLDWSHALLDEAERKTLRRVGIFADRFPLEAAVAVASDHETRELDVVAAMAALVEKSLVVASTGPGIASFRLLETTRMYARQKLDDNGERRVVALNHARYLSTVIDSNARAAGQCGGERWRSGMPALLDEVRAALGWVLSEDGDAALRETLPANAVFLFYELSLIDECCTWARRALAAIAPADESAHALPRQRARLRLLAALGAALVRVRGPNPETHAIWNEVLASAIASGDRPHADAGSRLISATPL >CP026103.1|AUT57089.1|1299297_1300218_+|LysR-family-transcriptional-regulator MNLSFEVLQALDAIDRTGTFAAAAEELHKVPSSLTYLVQKLEVDLGVKLFERTGRRAKLTHAGRVIVEEGRRLLEAARELELKAKRIEHGWESELRVAIDEIIPFDLIWPHVTEFYKLNLGTRLLLSKETLGGTWDALITRRADLVVGAAGEPPPIANLVAKPIGSLQHAFVMAPGHPLASAAEPLTMDAVARHRAVAISDTSRKLTPRTIALAANQEVLTVPTLETKLAAQIRGLGIGTVPECIAAGPLNRGQLVRKEVSGMRSVTHFYLAWRDDEAGKALRWWVDQLDRPDLIDDVAHRLVAMS >CP026103.1|AUT57098.1|1313187_1313574_-|DUF3331-domain-containing-protein MLANANVMDPWTQTIGLLGTASRLMAVAEAAAQPRHKTRSADEPVGAQVTLIDRPTPSTATIAWRDSTRGCFGDQVWRMARARMPGFCAMSGQAIRPGDAVYKPNPRPTPVNGDAMILASVLRDAATL >CP026103.1|AUT57099.1|1313644_1314610_-|AraC-family-transcriptional-regulator MTTTMLDLHAPLADVVSRRQASPIVAEQQWRRMTASASAHDAATQRDNVVVMRWTHNGDAPLEVSNEGSADDHCIGLNLKCAAMTFDHAGRRLVHGRLTAGAVQVTAPAVPTKAVFASSADVLHLFVSQQVLAECYQDLFQHSRDTGIVLDDPELIRDPVLERLGQALAVSQSNDAALGKMFTDSVSLAIVSHIVARHFAGATRRSREAAPLPQWRMNRVIEFVDAHLAEPIGLADIAASAGLTRMHFAAQFRRATGVRPHEYLLRRRVEHAQHLLVTSKHNVMDVALSCGFRSQAHFTTVFKKFVGETPHRWKEKTNDAR >CP026103.1|AUT57100.1|1314740_1314938_-|hypothetical-protein MPARRQTHEPNRRCAKAKRGCQRSGQNTPNQTKPNQTKPNQTKPNQTKPQPAPRPRRRPLNLHIG >CP026103.1|AUT57101.1|1315691_1316366_-|DNA-binding-response-regulator MSSDEMNDPNQSIVYVVDDDDSMRAAVTMLLRSVGLRVEAFASAQEFLSLDKPDIPSCLILDVRLKGQSGLAVQEQIAAGNVHVPIIFMTAHGDIAMSVKAMKAGAMDFLAKPFRDQDMLDAVATALAKDEERRKSERSVSDLRKRYESLTPREREVMAFVASGLMNKQIAAEMNLSEITVKIHRGQAMKKMESRSLADFVLKAEALGVKSLEGGASARTQRGV >CP026103.1|AUT57102.1|1316355_1318557_-|PAS-domain-S-box-protein MMFRQAAGMTSARDARVLFSLAGIVGVIVFVIDALTPLDIAIAVLYVVVVMLVASTGLRHATIATACACAALTVIAFLMSHDENYSGGSIARGIVSLLAIGTTSFLSLRNQANTARLQEQIQLLNLTHDAIVAYDMSDRITFWNQGAEELYGWTAEQAIGQRIHELTRTSSSIPVHELRDEVVRKGRWEGELERVRSDGSSVIVSSRFALWRDDKGRPRAILATNNDITMRKRMEAELQRQQEDLRATIDAIPGMVWSSSRDGELSYINRRWNELGITLTGGSGDVWTSIVHPDDWPAMHAAWRGAIATGKPFENVARIRQSNGSYRWMHIGADPLRDQNGQILRWYGVNTDIEERKQAEQALERSEAFLSDAQRLSRTGSIATRLPAGAMWWSDETYRIFEYSPDYTPGMELILARTHPDDLALVREAYESGRSGAPYVDVEHRLQMPDGRIKYVHYVAHLAVPQSASIEYVGALMDVTERHLAQDALDRSTAELAHVTRVTMLGELAASIAHEVTQPLAAIVTAGDAATRWLNRAKPDLGEVGQSISQMVRDAKRASDVIRQIRSMAQKRDPSQAVLDLNGIVRESIELVRRELDAARVELEASYAEPPPLVCGDRVQLQQVVINLVMNGVQAMAGITGQARRMCIATSRVDGHYGQVAVEDSGTGISEENVGRLFNAFFTTKADGMGMGLSICRSIVEAHGGRIWAESEEGRGATMQFVLPIDKGTCDEQ >CP026103.1|AUT57103.1|1318882_1319263_-|response-regulator MHNHPIASVIDDDESVRTAMSSLVRSLDWDVRLYASAEAFLASDVDQVACIISDVQMPGMSGLDMYRHLLDKGVTQPIIFISAFASDAVRRQALDLGAMCVLTKPVDGAEVSRCLARLEPDGSQGE >CP026103.1|AUT57104.1|1319558_1319846_+|DNA-binding-protein MHRLFANEENNVSAVIEHLEGEARERLIVWLKRRMQECNITLEALQHALQQDIDEAKRVRYRDASGNTWTGDGEHPEWLRRAVAAGQSVDHFLCE >CP026103.1|AUT57105.1|1320034_1320946_-|PilZ-domain-containing-protein MPLVPLSQREVTIGVPLPFSVYTADGRLLMARGHIIHSAAQCERLFVQGPFRQPFPGERREDARPEDADITPPSAGRARRGHEDQTLVGPFPVSGCIPEDFVITLANGPAISSRTRFVGALDDVSLLLAGAGVDPAFAPGEAVEGQFIAGRYRHAFESEVVGRHTSPFDVLYLRYPTEVRSRALRRHVRVGIDVTARLSQNDRPMAGTEVRAVDLSAAGVGLLVNANSNSLAPGEHFKLSLPLARAGRVRTAPLNCIARNRRTKDGETLVGAEFGNTSGDVRALVKEYVLDVLTGAVPPERHA >CP026103.1|AUT57106.1|1321260_1321686_+|glyoxalase MTTQALTSGIDHVGLAVRDLNLTRDFFVECLQWKQVGEKPDYPAAFVSDGHVMLTLWQVTNQANLVAFDRKTNVGLHHLALRVGSEEALSEIFRRVSQWPGVKVEFAPENLGAGPKRHTMIYEPGGIRLEFDFDPRLKAAG >CP026103.1|AUT57107.1|1321855_1322803_+|LysR-family-transcriptional-regulator MDQLYMLRAFVSAAQHQSFSKAAASLGVTTGSISKAIAKLETSIQTRVLHRTTRSVTLTEEAQSYYLSCCRLLEELDEANRRIMREREVDSGKLRLVIHPMLVSETFSQFLSSYRAVAPNVNLVVSVDEGAVNLYDGQFDMAMLPPHQVEQSAVIRRTLFKSSRSLVASADYLAQRGTPHRAADLAGHFLLLPSQSRQRSTNYVQVIENGQPVQVIPMSSMDGNDVLLRAAALAGAGIAELPEAMAREDVAMGKLVPVLPGCSISDSEVEICLFYSHRELLPARFRTFVDFCTEFFRLNSARRRAPLPDAQAQAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP026103_1 | 1.1|1312944|38|CP026103|CRISPRCasFinder | 1312944-1312981 | 38 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 768027-768064 | 0 | 1.0 |
CP026103_1 | 1.1|1312944|38|CP026103|CRISPRCasFinder | 1312944-1312981 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 508524-508561 | 3 | 0.921 |
CP026103_1 | 1.1|1312944|38|CP026103|CRISPRCasFinder | 1312944-1312981 | 38 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1036100-1036137 | 5 | 0.868 |
1. spacer 1.1|1312944|38|CP026103|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 0, identity: 1.0
aaaaggccgataacaccgaacggcgacgcgtgaagcac CRISPR spacer aaaaggccgataacaccgaacggcgacgcgtgaagcac Protospacer **************************************
2. spacer 1.1|1312944|38|CP026103|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 3, identity: 0.921
aaaaggccgataacaccgaacggcgacgcgtgaagcac CRISPR spacer caaaggccgataacaccgaacggcgacgcgtgaagcga Protospacer ***********************************.
3. spacer 1.1|1312944|38|CP026103|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.868
--aaaaggccgataacaccgaacggcgacgcgtgaagcac CRISPR spacer ccaaaacacca--aacaccgaacggcgacgcgtgaagcac Protospacer **** .**. ***************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP026102_1 | 235430-235534 | Orphan |
NA
Consensus repeat of CP026102_1
|
1 spacers
spacers of CP026102_1
>1.1|235470|25|CP026102|CRISPRCasFinder GAAAGCAAAGGCCGAAACACCGAAC |
CRISPR arrays and Neighbor proteins around CP026102_1
The CRISPR arrays of CP026102_1 >merge|CP026102|1|235430-235534|CRISPRCasFinder GGCGACGCGTGAAGCACGCTAACGCATCGCGGATGCCAGCGAAAGCAAAGGCCGAAACACCGAACGGCGACGCCTGAAGCACGCTAACGCATCGCGGATGCCAGC >CP026102|1|1|235430-235534|CRISPRCasFinder GGCGACGCGTGAAGCACGCTAACGCATCGCGGATGCCAGC GAAAGCAAAGGCCGAAACACCGAAC GGCGACGCCTGAAGCACGCTAACGCATCGCGGATGCCAGC
>CP026102.1|AUT53750.1|234196_235336_+|IS481-family-transposase MPWDVKDIMNRREDFVREAATQALAFSELCRKYTITRQTGYKWLARHRAEGIKGLADRSRRPHHSPKRSAQTIEARVLEMRQAHGWGGRKIAQRLRDLGETQIPAPATITEILRRHGLIDEQASRQRQHWQRFEHEYPNSLWQMDFKGDFPTLESGRCAPLTVIDDHSRYNVVLSACSRTTTQVVQEALERAFRCYGLPSCINTDNGAPWGSPSAPGQLTELAVWLIRLGIHVSYSRPYHPQTNGKDERFHRSLKAEVLQRHAFTTHEHVQRELDRWRQVYNTERPHEALGMAVPLTRYACSLRRMPGRLPEPEYRCGDAVLRVNSSGVVRVRGEKLKLSIALKGLQVAARPSEDEDGVIDIWFAHQRVAKLDLKAAKP >CP026102.1|AUT53749.1|231977_233342_+|MFS-transporter MQQPKHTVDVQDFIDSQRFSPFQWTILVLCFLVVAADGFDTAAVGFIAPSLVQDWGVARSALGPVMSAALVGLGIGALGAGPLADRIGRKTVLVLSVFFFGLWSLAAARADSIESLTALRFMTGLGLGAAMPNAVTLMSEYAPARIRAVAVNAMFCGFSCGLAIGGVASAWLIPHFGWHSVLVAGGVGPIVLTLVLIMLLPESAQFMVTRRRGDARIAKVLSRIAKDVRFGECRFVTGEPVAEHRGSALRVVLSSRFRFGTLMLWLAYFMGLLIYYLLTNWLPTLFKDTGFSGQNAALMTSLFPLGGVLGNLSVGWLMDRFRANRVIACTYVVAAVLVMLVGRGLGHQVWLGLLIFLTGTVVTSAVTSMSALAASFYPTQGRATGVAWMLGVGRIGGVAGALVGAALMGLGWQFGSVFSLLAVPAMIAAVGVFAVAARVRESGVGAVELTPAVE >CP026102.1|AUT53748.1|231569_231908_+|NIPSNAP-family-protein MSSGKPFIDHRIYTIRPRGMAEFIEVFDRLAMPIQLKYLGAPVGFYMSDIGALNQVVHLWGYESIGDYDQRRTARDADPEWPAYLQASAHLIVGQESRIIRRVEFRTLTALR >CP026102.1|AUT53747.1|229800_231558_+|FAD-binding-protein MNSNTHSPAPLTCDVLVIGSGAGGLSTAITARKHGLDVVVIEKEAYFGGTTAFSGGVLWIPGNRHARANGVSDTREAAKTYMRNETGAFYDGAAVDAFLDTGSQMLDFFERETEVKFVPTLYPDYHPNVGGGVDIGRSVVAAPFDARGLGDDIARLRPPLKTITFIGMMFNSSNADLKHFFNATRSIKSAAYVAKRLASHLKDLALYRRGVQITSGNALAARLAKTALSLGIPIHTNTAAQELMVSDKRVTGAIVKGPQGEMRIAARRGVVLACGGFSHDVARIAQAYPHVKRGGEHCSPVPKGNTGDGARMAESVGARVPIRYPQPAAWMPVSRVPMRDGTFGVFPHLLDRYKPGIIGVTRKGKRFTNEANSYHDVGAAMIEACRDEKDTAMWLICDHATIRKYGLGYAKPAPVPLGPLLRNGYLVKGRTLAELAQRAGIDAEALEATVRIYNEGATRGEDPEFGRGSTSFNRYLADPECKPNPCVAPIARGPYYALKVVMGDLGTFDGITTAVTGEVLDARGAVIDGLYAVGNDRASVMGGNYPGAGITLGPIMTFGYITGRRLAGISDNATSAQQRRQSETV >CP026102.1|AUT55821.1|228936_229767_+|SDR-family-NAD(P)-dependent-oxidoreductase MTKQNESEWLALNDKVCVVTGAAGGIGSAIAKVLGESGARLALLDREAGKCEDLAQTLGANGIEAFSFACDIGDARSVEAAAASVEAKLGAADVLVNNAGLLRPGGIEDIALDAWNAMLQVNLTGYMLCSQAFGRAMLRKGTGSIVHVASVAAHHPQTWSGAYSPGKAAVAMLSKQIAAEWGPRGVRSNAVCPGMIRTPLSASFYEQGDVEQRRSAMTASRRIGEPVDIADVVAFLASPRAGYVNGTELVVDGGLECMLMDLVPRPGFDAKANPAR >CP026102.1|AUT53746.1|227897_228791_-|AraC-family-transcriptional-regulator MRKIPNYDLYGESARPPWFDAFNFEWIPERSRPNDWHIAAHRHDALLQVLYIRSGSGHVVIESEKHVLAPPCIVVLPAQTVHAFVFSPEIDGLVITAAQRALESISKAVSPGLLPIFQRAAVIPVKASAGDDILMPLFTLLEQEYRGNARGHIAAGMSLMIALFVQVARLGDAAAMPATNAVADRRSGQIKRFRELVAAHFREHRTVEFYAEKLGITTAQLSRICRDELGHSPMSLVNEHLIREAQRDLVYSGLTIKQIAHALGFEDAAYFSRFFRKQTGATPKEFQAAAHTDLSLN >CP026102.1|AUT53745.1|226713_227886_+|4-hydroxybenzoate-3-monooxygenase MRTQVGIIGAGPAGLLLSHLLHLQGIDSVVLESRSREQIESTIRAGVLEQGTMDLLTETGVGERMKAEGALHHGFELAFEGKRRRIDLTDLTGKSITVYAQHEVIKDLVTARVAAQGALKFEVSDVSLHGTDGTAPSIRYRHRGEAHELQCDFIIGCDGSQGISRNAIPEALRRDYQRVYPFGWFGILVEAPPSSDELIYARHERGFALVSTRSPNVQRMYFQCDPKDTVDNWSDDRIWAEMHARVDSDEGHQVVEGKIFQKNIVGMRSFVSTTMQHGRLFLAGDAAHIVPPTGAKGLNLAVSDVRILSDALRAFYKEDRNDLLNSYSETALKRIWRAEHFSYWMTRMMHRLDDASPFEQQLQVAELEHVTTSRSAAISMAENYVGAVPV >CP026102.1|AUT53744.1|226137_226494_-|transporter MKIRNVLRLLAVVGCVTLTSNVYAQASDSMSMASTPSAPSKKATPADKKLGRDVRKALSKAPGFNVSNVFVKARGGAVVLSGSVPDGSQIPQATEVAKGVAGVTSVSNKLTLYSHGNN >CP026102.1|AUT53743.1|225646_226108_+|isoquinoline-1-oxidoreductase MLKLNINGKTVEVHSDPATPLLWVLRCELKMTGTKFGCGVGVCGACTVHVGNEAKTSCQEKLSDIGTSRITTIEGLQGTQARALKEAWTHVDVVQCGYCQSAQLMAASALIRRNPTPSRKEIDCAMHGIICRCGTYPRIRQAILEATGQGKLT >CP026102.1|AUT53742.1|223352_225653_+|xanthine-dehydrogenase-family-protein-molybdopterin-binding-subunit MLKRRTFLLGGVGLAGALVVGWSALPPRQRLVGSEALPVRPGEAALNGYVKIAADNTITVLMCRTEMGQGVHTGLAMLVAEELDANWADIRVANAPLDQIYNNVESVVGDLPFRPDDDSVVKELAVWLTRKLARDFGTVMTGGSSTINDLWRPMREAGACARTMLIAAAAERWSVKAADCRIEKGIVVHDAGHRASFGQLAMAATRQPLPRNPALKDPAAFRLIGKPLTRIEAASKLDGSAIFGIDVVPDGLLYASIKMCPTPGGTVRDFDGAAAAALPGVRKVLAVDAYNGGTGGVAVIADNVFIAMNALDMLTIHWNDGPTRGLTNAEVDRRLVQALDEGEGHAWYRHGDVEDALNRAAHTLKATYRAPYLAHAPLEPVNCTAQVKDGKATVWAATQVPAVARMHVARLLGIGTDDVDLQQQMPGGAFGRRLEVDFIAQSVAIAREAGGRPVQTLWSRQEDMQHDFYRPACLSRFRAGLDAQGQLIAWHNTSVSQSVVATWLARNYRIPDLGLNLDKTVSEGAFDQPYEMPNVWIGQRVVELPMPVGFFRSVGHSHQAFFIESFIDELAALARKDPVAFRASLLRRHPRHLAVLQKVASMSGWRAPAVWSEKGVRHARGVALHEAFGSVVGQVADVSLEAGNTVKIDHVYCAIDCGLPVNPNLIRQQVEGAIVFGLSAAFKEAITLADGAVIEGLYTQFDVVRMDECPDISVEIMPSKDHPQGVGESAVPPVAPAVANALFALTGTRSYALPLNMKLYSRGATC >CP026102.1|AUT53751.1|235586_236354_-|ABC-transporter-ATP-binding-protein MIDVDAVTVRFKTAAGAVDAVRNASFHVAQGEVFGLVGESGSGKSTILRALSGLTPIAQGTMRIAQHEQAARKRDVQMVFQDPYGSLHPRFTVDQTLREPLRISGIDRHEERIVNALREVGLNASFRFRYPHQLSGGQRQRVAIARALIVEPRVLLLDEPTSALDVSVQAEILNLLKRLHQERNLTMILVSHNLAVVGFLCSRVAIMRNGEIVEELDIGRVRAQQVESEYSRSLLLATGGYRRKAVEVIGVDASL >CP026102.1|AUT53752.1|236350_237232_-|ABC-transporter-ATP-binding-protein MSTHDTTRALCEIDDLRIAFRAHDGTMNEAVRGLSLTLNKGERLGIVGESGSGKSLTGRALLGLLPPAAHCTAKTMRFDGSDLLDMRADQRRKLCGQQMGMILQDPKYSLNPVMTVAQQMREAFALHEPKLGRRAMREKIIAALEAVHIRNPERVVDSYPHELSGGMGQRVMIAMMVSTGPRLLIADEPTSALDVLVSMQVLAVLDEMIAKHDTGLIFISHDLPLVMSFCDRVVVMYAGRVVETCAARDLVHAQHPYTRGLLAANPPLANPPDELPVLSRDPAWLNDVQGASA >CP026102.1|AUT53753.1|237233_238148_-|D-ala-D-ala-transporter-subunit MNTPRPTLKEWLLTDTPASRRQAALGLAYRRWRRFRGNPLSVFGFSILVLLVIVAIIGPWIAPHDPLRQVLSDRLLPPGSASHWLGTDQLGRDILSRIIYGSRLTLSIAILVVVVVVPIGLLIGTTAGFFGGWVDNVLMRVTDIALAFPKIVLALAFAAALGPGVFNAVIAISITAWPAYARLARAETLRLVQTDFIHVARLQGASNLRILLRYIVPLCSSSVIVRATLDMAGIILTVAGLGFLGLGAQPPSPEWGFMVASGRNVLLDSWWVATIPGFAILLVSLAFNLLGDGLRDVFDPRHGD >CP026102.1|AUT53754.1|238206_239277_-|ABC-transporter-permease MSTPITPIDQIRAASARSAGLRWTLRVLRWVLTLAITFTGLLAVTFVIGRKVPIDPVLAILGDRASASAYAAARIQLGLDKPLAEQFFIYVSAVLHGDLGVSLLTANPVIDDIKRVFPATLELATLSTIIGVLVGVPLGVIAAVRHNRWIDHVARFIGLIGSSVPVFWLGLMGLLLFYAKLHWVSGPGRLDPVFDGMVEPRTGSLLIDSLMAGEWDVFFNALSHIALPAAILGYYSVAYLSRMTRSFMLDQLNQEYITTARAKGLSERRVVWVHAFGNIAVPLLTVIALSYSFLLEGSVLTEIVFAWPGIGSYLTGALLNADMNAVLGSTLVIGITFIALNLLTDALYRVFDPRAR >CP026102.1|AUT53755.1|239290_240898_-|ABC-transporter-substrate-binding-protein MNLVLRNALAAVAVVSALTLTMTTTSTALAATPKDMLVIATTLDEFSTLDPGEVYELVPEEYVANTYDRLVRVDLKDPSKFNGDVAQSWTVSPDGLTFTFKIRPDLKFHSGNPLTADDVAWSIQRCVLLDKGAAAVLQGIGLTKDNALQNVKKIDDSTVSITTDQKYAPTFVLNVLGAWPASVLDKKLLLSHQKGNDFGNEWLRTNEAGSGAYKLVKWTANDSIILQKYDGYRMPLAMKRIVMRHVPEASSQRLLLENGDADVARNLSPDDLATLTKGNKVTVTSVPQATLLYLGLNVKNPNLAKPEVQEAMKWLIDYDGIQKNVTKNTFKVHQTFLPEGFLGALNSNPYHQDVAKAKALLAKAGLPNGFNVTMDVRSAYPYNEIAQAVQANLAQGGIKVEIIPGDNKQTLAKYRARQHDIYIGEWSADYIDPHSNAQGYAWNPDNSDKSSYKMLAWRNSWDIPDLTKETNAALAESSPGKRAQLYQAMQKEMLAKSPFVIMFQQVSQVAMRPGVSGLEVGPINDLVSYLHVKKQ >CP026102.1|AUT53756.1|240951_241518_-|D-alanyl-D-alanine-dipeptidase MTDTPQLIHITPETHGVELDLAYATADNFTGKPIYKEAHCLLLAPAEAGLRKAVELAASIGMKLRIFDAYRPPQAQQVLWDFLPDPTYIAELGRGSNHSRGTALDLTLIDSHGEALDMGTGFDAMVKESEHFHNGLPQHVQRNRLLLLGIMHAAGFTHIASEWWHYEIPGSRALPIIDNSESGPLKLM >CP026102.1|AUT53757.1|241523_242417_-|MurR/RpiR-family-transcriptional-regulator MSTAFAHTVEASFATLTPTAKRIASYMLANLERLGLETADQIAQQTGTSGISVGRFLRSVGYRNLDDLKRELRGAQSRPWFITDRLDAYRSERDDINDNGANGNAAGDAGDPSARSLDLELDAIRYVYQLAQGEIFARIAQRIAEADAVFILGIQSTRGISNAFYSYLEYLRPRVFYSDGMSGSYVDSLNSEFASPYLIVTDTRAYSRIARRYCEAATRRELPFALVTDLYCPWAREFPCDLIQVKTDVGQFWDSLAPLTCLFNLLLTSIVERLGPAIDQRVARNRELQRELDQFDL >CP026102.1|AUT53758.1|242864_244328_+|catalase MNKLTTAFGAPVVDNQNIQTAGPRGPALLQDVWFLEKLAHFDREVIPERRMHAKGSGAFGTFTVTHDISKYTRAKIFSQIGKKTELFARFSTVAGERGAADAERDIRGFAVKFYTDEGNWDLVGNNTPVFFLRDPLKFPDLNHAIKRDPRSGLRSAESNWDFWTQLPEALHQVTIVMSDRGIPKSFRHMHGFGSHTFSFINADKERFWVKFHLHTQQGIQNLSDAEATALVGADRESSHRDLYESIERNEFPKWTMYVQVMPEADASKTSYNPFDLTKIWPKKDYPLIEVGVMELNRNADNHFADVEQSAFNPANVVPGISFSPDKMLQGRLFSYGDAQRYRLGVNHSLIPVNAPRCPVHSYHRDGSMRVDGNMGGATPYNPNTRGEWLDQPDFSEPPLSIEGAADHWNHRTDDDYFSQPGNLFRLMSPEQQQALFDNTARALAGVSEPIRKLHIEHCTKADPAYGQGVAAALESAGTAGATPNRAL >CP026102.1|AUT53759.1|244407_245352_-|LysR-family-transcriptional-regulator MDLNDVRIFVSVVQTGSLMNAASRMGVPLATISRRIRALEKELNVQLLERSARGTRLTDAGARLYQHASLGVEILKDGEEAVVSDQAMLKGRLRISLPPAFDIWWDLLHDFQRRYPDIRLHVYTTERRVDLIEEGIDVALRVGAIAHEAMVARRMLSYRHVLVASPQLIERFGMPREPAALSRLPCALWNRAPNDANTWQLGKTTVEPHIVLTTNDYAQLRHRALNGEFVTEIPPFLAADSIRQGRLVPLLPAYPLPDQQVNLLYPSHRHPSAIVRTYLEFCQSRIAWFVDQCAIDWKSSSQDASAPDSLAQSK >CP026102.1|AUT53760.1|245442_245931_-|GAF-domain-containing-protein MFDATINTALPKAEFYRELASQARSLLEGESNQIANAANLSALIFHSLPELNWAGFYFALDGELVVGPFQGKPACVRIPMGRGVCGRAAETRETQVVPDVDAFPGHIACDSASRSEIVIPLQKASGELVGVLDIDSPVLARFDDEDRRGLEEVARIFVASLH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP026102_2 | 791753-791834 | Orphan |
NA
Consensus repeat of CP026102_2
|
1 spacers
spacers of CP026102_2
>2.1|791776|36|CP026102|CRISPRCasFinder ACGCTAACAATTCGCGGACGCCAGCGAAAAGGCAAA |
CRISPR arrays and Neighbor proteins around CP026102_2
The CRISPR arrays of CP026102_2 >merge|CP026102|2|791753-791834|CRISPRCasFinder CGCACAGGGGCGACGCCTGAAGCACGCTAACAATTCGCGGACGCCAGCGAAAAGGCAAACGCACAGGGGCGACGCCTAAAGC >CP026102|2|2|791753-791834|CRISPRCasFinder CGCACAGGGGCGACGCCTGAAGC ACGCTAACAATTCGCGGACGCCAGCGAAAAGGCAAA CGCACAGGGGCGACGCCTAAAGC
>CP026102.1|AUT54199.1|789530_791060_+|methylmalonate-semialdehyde-dehydrogenase-(CoA-acylating) MSAIPSTLADRKVDTVKLLINGEFVESSATEWRDIVNPATQEVLARVPFATKAEVDAAIRSAHAAFATWKNTPIGARMRIMLKYQALIREHMPRIAKTLTAEQGKTLPDAEGDIFRGLEVVEHACSIGTLQQGEFAENVAGSVDTYTLRQPIGVCAGITPFNFPAMIPLWMFPMAIVCGNTFVLKPSEQDPLSTMQLVELAIEAGVPKGVLNVVHGGKEVVDGLCTHDLVKAISFVGSTAVGTHVYNLGSQHGKRVQSMMGAKNHAVVLPDANREQTLNALVGAGFGAAGQRCMATSVVVLVGASKEWLPELVAKARTLKVNAGHEAGTDIGPVVSRSAKERILGLIEAGVKEGATLALDGRGVDVPGYEAGNFIGPTVFSDVSVEMDVYKTEIFGPVLCVMSVPTLDDAIALVNSNPMGNGVGLFTQSGAAARKFQSEIDVGQVGINIPIPVPVPFFSFTGSRGSKLGDLGPYGKQVVQFYTQTKTVTARWFDDVTVSDGVNTTISLR >CP026102.1|AUT55883.1|787729_789436_+|AMP-dependent-synthetase MTAAQSFFDARDLLLRHRTDYERAYREFRWPELGEFNWALDYFDVVAKGNDNPALWIVDDPAQEGLKLSYAQMSERSSRMANFLRGLGVARGDRLLLMLPNRVELWDVMLAAMKLGAIVLPATTQLSPDDVRDRVQIGGANFVVVDSAETGKFDALDTPLQRISVGAPREGWTDIAAAYAASPVFTPDGATRAADPMLLYFTSGTTSKPKLVEHTHESYPVGHLSTMYWIGLQPGDIHWNISSPGWAKHAWSCFFAPWNAQACVFVYNFARFAPKDTLDVLVRYQVTTLCAPPTVWRMLVQEPLASYPVKLREIVGAGEPLNPEIIERVRHAWNIVIRDGYGQTETTCQIGNPPGQPVVPGSMGRPLPGYRVELVDADDHPVTEGEIALPLGSRPLGLMTGYANNAKATEHAMRNGYYHTSDVAMRRDDGYLVYVGRSDDVFKSSDYRLSPFELESVLIEHEAIAEAAVVPSNDPLRLSVPKAFVTVRHGFEAGPELARDVFRFSREKLAPYKRIRRLQFSDLPKTISGKIRRVELRRRELERTAEPARLQDEYWEEDFPELRNGNGN >CP026102.1|AUT54198.1|786528_787662_+|acyl-CoA-dehydrogenase MDEFYTDEQRMIRDAARDFAVERLAPNAAQWDREGQLPADVVGQMGELGFLGMIVPPEWGGSYTDYIAYALALEEIAAGCAACATMMSVHNSVGCGPILNFGSDAQKDRYLADLATGKRIGAFCLTEPHAGSEANNIRTRAVLRDGQWVINGSKQFVTNGARASIAIVFAVTDPDAGKRGISAFIVPTDTPGFNVGRPESKLGIRASDTCPIALDDCAVPEANLLGAPGEGLRIALSNLEGGRIGIAAQAIGIARAAFDAARAYANERVQFGKALKEHQTIANMLADMATRLNAARLLVHHAARLRSAGRPCLSEASQAKLYASEMAEEICSKAIQIHGGYGYLEDYAVERHYRDARITQIYEGTSEVQRMVIARHV >CP026102.1|AUT55882.1|785201_786263_-|AraC-family-transcriptional-regulator MKTERGTISVSLVEETLALARARGVDVQPIVEAAGIAPQVLASAKSRVTPAQYGALWANIARTLDDEFFGQDAHAMKSGSFIAMTQMALTARNGGQALTRAVNFMRLVLDDMCAQIVTRDDRVRLQFAHRDGAPQPAMFAYATYFILVYGLVCWLVGRRIPLIEARFRCAEPPAAHEYRLMFCDDLSFGQSESYVDLAPDFLELPVVQTTKSIKPFLRDAPASFIIKYRNPGSLAARVRKTLRALPMPAWPGSDEMAQRLHVAEATMRRHLKQEGYTYQSIKDDLRRDIAISQLQRGGQSVADIAATLGFAEPSAFHRAFRKWTGMRPADYRAVNAHAAGAGISRRAERAESD >CP026102.1|AUT55881.1|783460_785116_-|serine-protease MGTYRLLPIRQAGSRRTPRRTRGAFVAAAPRAAWARTPRVIPGAAARLLQRAWLLVLAFCAAAAFAHSARAAQPPVTPGSVVVIPVAGAISPATADFIVRGLARAADDRAQLAVLQLDTPGGLDTSMRQIIKAILASPVPVATYIAPGGARAASAGTYITYASHIAAMAPGTNLGAASPVQLGIGGQDAPKPGQPPGLPGATPASGPAQKDNAASGALPLDSQSTELRKQLQDAQAYIRGLAQLRGRNVEWAERAVREAVSLSARDALEQKVVDLIARDIPDLLRQLDGRTYDTAAGAKHLTTAHAPVVTLEADWRSHFLAVITDPNVALILLMIGMYGLFFEFANPGFVLPGVAGAISLLLGLFALQLLPVNYVGLGLIFLGLAFLIAEAFLPTFGTLGFGGIVAFAIGALMLIDTDVPGYGVPLPMIAAVIVFSVLFIFGVSGMVLRSRRRPVVTGAEAMIGSVGVVLDDGLVADTAPGRADGSRDGPPDSLLHGEPDRVGWARVHGERWRVRSTSPLAAGHAVRVTGRRGLMLTVVPASNPSQEGEHT >CP026102.1|AUT54197.1|782684_783464_-|hypothetical-protein MIGFTFGFGSILILLAIVLIASAVRVFREYERGVVFMLGRFWKVKGPGLVLIIPVVQQVVRMDLRTVVFDVPPQDVITRDNVSVKVNAVVYFRVVDPERAVIQVARYFEATSQLSQTTLRAVLGKHELDELLSEREQLNTDIQRVLDAQTDAWGIKVSNVEIKHVDINETMIRAIARQAEAERERRAKVIHAEGELQASEKLLQAAQMLAQQPQAMTLRYLQTLTTIAADKNSTIVFPLPVDLLTAVIDRMSKPSQHMG >CP026102.1|AUT54196.1|782233_782485_-|DUF4148-domain-containing-protein MKLIPRMVLGALIGVAAVSSAFAQTSRVYDQNTPKTRAEVKADLVEWRKAGYDPLDWINYPANAIAAGRVVAQRRAQAQGTQQ >CP026102.1|AUT54195.1|780453_781755_-|guanine-permease MDSVKRYFGFDEAGTTLRVEVLAGVTTFLTMAYIIFVNPAILGDAGMPKDSVFVATCLVAALASLIMGFYANYPIACAPGMGLNAYFAYTVVKGMGFTWQAALGAVFISGCLFLIVTLFRVREVIVNGIPHSIRVAITGGIGLFLAIISLKTAGIVTGSPATLVTLGNLHDPHVVLAIIGFFVIVMLDVLRVRGAILIGIVGVTILSFFFGGNQFHGIVSMPPSISPTLFQLDVKAALSTGVLNVILVFFLVELFDATGTLMGVANRAGLLVHGKMHRLNRALLADSTAILAGSVLGTSSTTAYIESASGVQAGGRTGVTAITVAVLFLLALFFAPLAGVVPGYATAPALLYVSCLMLREMADLPWDDATEVVPAALTALMMPFTYSIANGVAFGFISYAGLKLLTGRARQVKLVVWVIAAVFLFRFFYLGAE >CP026102.1|AUT54194.1|778611_779199_-|molybdopterin-guanine-dinucleotide-biosynthesis-protein-MobA MAYASLATGVLLAAGYGSRFDPEGIHNKLLARLPDGTPVAFESAHRLLLVVPHVIAIVRPGSEMLARVLNDAGCHVIFSADAERGMGASLAAGIEASDDADGWIVALADMPRIATSSIEAVARAVDDGAPIVAPYYQGQRGHPVGFGIEHRDALLALDGDTGARALFATHPVKRIEVDDPGVLSDIDTPEDLRNV >CP026102.1|AUT54193.1|777646_778363_-|PIG-L-family-deacetylase MSETSPRLFIVSPHFDDAVFGCGALLAAHPDAAVCTVFAAPPAQDMRTDWDEKAGFASAYESVHARTLEDNDALAVLDAIPLRLPFRDAQYRDSPSIGQLAAALEEAIYGSTSNTLLMPLGLFHDDHGRVFEACCEILPRMSHLEWFAYEEAIYRPMPGLVQQRLVDLAGRGIVATPASPAAGHTLDRERQALLKREAVSAYESQLRAFGPHGYDDVYAEERYWRLTVDRQGARRARH >CP026102.1|AUT54200.1|791938_792832_+|3-hydroxyisobutyrate-dehydrogenase MKIGFIGLGNMGAPMALNLLKAGHTVTVFDLNPHAVQSLTEAGATAKRTPKEASTDVEYVITMLPAAAHVKAVLTGEEGILAGIAKNVTIIDSSTIDPASVKAFAALATQNGNTFVDAPVSGGTGGATAGTLTFMVGSTAETYEQVKPVLSAMGKNIVHCGETSTGQVAKICNNLVLGITMAGVSEAMALGEKLGIDPQVLGKIINTSTGRCWSSDTYNPFPGVIDTAPSTRGYTGGFGTDLMLKDLGLATDAAKLARQPVYLGALAQQLYQTMSTNGAGKLDFSAVIKLYRKDGDA >CP026102.1|AUT54201.1|792832_793630_+|enoyl-CoA-hydratase MIELDYAHDGSVALLTLKRPPANAFTPDGLLQLQHTIERLNGDAQVRAIVITGDGPKFFSAGADLNTFADGNKEIARQAASRFGSAFEALQNARPVVIAAINGYAMGGGLECALACDIRIAEQHAVMALPETAVGLLPCGCGTQTLPWLVGEGWAKRIVLTGERVDTATALRIGLVEEVVEKGAAREFALQMAARVAGLSPQAVTFSKDLIQQARNGVPRTAALAVERERFVDLFDGADQREGVNAFLEKRAPQWQGVQSKESQR >CP026102.1|AUT54202.1|793626_794796_+|enoyl-CoA-hydratase/isomerase-family-protein MNAVLNEASAHEPDVLFRVVNRVAIVTLNRPAALNALSHEMVRELAVLVERCRTDSEIVAIVLRGAGAKGFCAGGDVRALYGMRQRNETDWQQFFIDEYRLDYALHTFPKPVVALLDGIAMGGGMGLGQAARLRIVTERTKIAMPETRIGFLPDVGATRFLSVMPAEIELYVGLTGVTLTGAEALCFQLADLCVPSEWLDTFEERLLRIATADVAADELLRALRTVFEPPCNIVPHAGLGAFTQLILRHFDRRSGVERIVATLRQDLEREHVPQMGQREVRQWLQATYDALTSHSPTMLYVTRDALLRGRQMTLAECFRMELGIVTRAIEEGDFSEGVRAHLVDKDRKPRWAPATLAEVRPERVRHFLSSPWRTQAHPLADLGVEQALA >CP026102.1|AUT55884.1|794810_795824_-|hypothetical-protein MTVKNMKRKRRRRLKPLLLMGAACALLSARSASAQEIALYGGWLRGAGTNTYSWAIDYTEGFGRYLAGSITWLNEGHMPDHHRDGQAVQIWGRLPLAQNRFVIAVGVGPYRYFDTEAAEQGQGYSNTHGWGGLFSARATWYTSRRWTTSLQLNRVQVSNGPSTTAVLLGAGYQLDAPDEPGPRAWALPRTHDVTNNEVTVLVGQTILNSLESQTSIAESIEYRRGLTHWLDGTFGYLHEGGGLKARRDGLTAQLWLTRAFLDDQLTLGIGAGAYAAIHHGEDPDERSTGDGILSGLVSVSASYRFTQHWAARVTWNRVVTRYSRDTDVLMGGIGYRF >CP026102.1|AUT54203.1|796184_796721_+|(2Fe-2S)-binding-protein MPHTIARPEVTPPGDTPHAQPSSSVTIPVELNVNGTAYALALDPRTTLLDALREHLHLTGTKKGCDHGQCGACTVHVNGRRENACLSFAATHEGDTITTIEGIGEPDALHPMQAAFVECDGYQCGYCTSGQIMSAVALLDEAIGPDDADVREAMSGNLCRCGAYQNIVTAIQTVRGKR >CP026102.1|AUT54204.1|796730_797732_+|xanthine-dehydrogenase-family-protein-subunit-M MELFQLSRANDVRDAIVAGAASQTAQQGAQVRFLAGGTTLLDLMKLDVEKPARVVDIRRLPLDRVEVTDDGGVKIGALVRNADLALHPLIHEPYAVLSQALLAGASAQLRNMATTGGNLLQRTRCVYFRDTAMPCNKRAPGSGCAAITGFNRTMAILGTSDACIATNPSDMNVALAALGATVQIQGTKGARSVPIDDFYLLPGDTPERETVLEPGDLVTHVTLPPIPGSRSLYLKLRDRASYEFALASAAVVVNVVDGRITRARVALGGVGTKPWHAREAEAELAGAVPDAASFARAADAALANAKAQSQNGFKIELSRRCLIHALTQVMQSV >CP026102.1|AUT54205.1|797755_799990_+|xanthine-dehydrogenase-family-protein-molybdopterin-binding-subunit MSTVSDSLLSVIGQPQSRVDGPLKVSGRAQYTSDIDLPDMLYAVPVCATIASGRVTSLEFAAAQAMPGVRVILHRGNIGRFYRISGNSMETGFVDEARPPFDDDVIRYYGQYVAAVVAETFEAASNAAAAVKVGYDRTAHDVSDELEAKGEPHVQSERGDAASAFEAGEVTLDETYVTPVETHNPIELHATVAQWDGEGYTFYETTQAVSNHQGTLMQMLGLPKEKVRVISRYLGSGFGGKLWMWPHSLLAAAASRHTGQPVKLVVSRKMMFQNVGHRPTTQQRMRLSADRSGKLTSLRHDYLNHTAMADDYEESCGEITPFLYSVPNLRVTSGLVRRNVGSPTAMRGPGAVPGLYALESAMNELARKLDIDPVEFRLRNEPKVDESTGLPFSSRHFVECLTTGAEKFGWAQRTAEVGSMTRDGLTLGWGVGACGWPGLRFSAEASVDLRADGTARVVCGTQDIGTGTYTILAQLVAGHTGIPLDKIEVVLGDTMLPVGPISGGSAATASVIPAVLQAARAATEMVLARAAAVDESPFKGVDKDSLAFGAGRVHRKTEAAEKGVPFAQILQAAKMHAASGKGSAQGGFDDPLKKHYSIYSYGAHFAEVTWQPETARLRVNRVVTVIDAGRILNPRAGRNQIEGAVVMGVGMALFEHTMYDAQSGAPINSNLADYIVASHADTPALDVTFLDYPDPVFNELGARGIAEIGLAGVAAAITDAVHHATGVRVRRLPVMIEDLLLGSM >CP026102.1|AUT55885.1|800841_801780_-|LysR-family-transcriptional-regulator MGLTTMRTINHQRLRYFYAVLTQGSIRGAADDMNTSPSVITRQIRLLEEELGVTLFERGARGARPTEPAAHLLEFWEGCQSQQEKLEDQLHAFRGLRHGRVQLAVSEGFVDTLTEEVLAPFCAKYPALTIEMSMLARDGIVEEVAESRAHIGLAYNPPPHPRLQCLASSVQRAVLLLRREHPLAMRKRAATIDDLRAFPLAMMPQTFGIGHAVKMLEIAEGMQIEPAMTTNSLAVLKRMVAVENFVTLIGEFAARREVASGELTTVPVDHPVLQSTHARLLVKTSRLLSPGPMELLDWIRRRLSVFGDGVHG >CP026102.1|AUT54206.1|802102_802873_+|DNA-binding-response-regulator MKKLEHVLIVDDDSETRELVAIHLQRNGMRVSRASSGREMRAALGRDTPDLIVLELRLPDTDGLSLCRELRAGEFHAIPVVMLSARHDEADRIVALELGADDYMSKPFAIRELLARIRAVLRRTNMLPPGMRVAEAATVLRFGEWRLDTAARRLLDPEGTVVALSGAEYRLLRVFLDHPNRVLTRDQLLNLTQGRHADLLDRSIDLLVSRVRQRLHDGVRDGRYIKTLRNEGYLFSATVMRVESDVAHAPAMTCIA >CP026102.1|AUT54207.1|803301_804081_-|alpha/beta-hydrolase MTRSFRAILAFATMLCGLFALQTADAATPADLKGTNIVLVHGAFADGSSWNRVIPLLEAYGLHVVPVQNPLSSLADDVAATKRVIDQQTGPVVLVGHSWGGVVISQAGNDDKVKSLVYVAAFAPDANQSIADITQGMKPPAWANELRKDSARYLTLSDKAVRDDFALDLPAGQQRIVAATQGPWFSGCANDKVTQAAWHEKPSYFVIPGRDKMIDPHLQAKMATQIHAQVTRVDASHVAMLSQPEAVANAIIAAARHAH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP026102_3 | 1048658-1048744 | Orphan |
NA
Consensus repeat of CP026102_3
|
1 spacers
spacers of CP026102_3
>3.1|1048681|41|CP026102|CRISPRCasFinder CATAACACCGAACGGCGACGACGCGTGAAGCAAGACAACAA |
CRISPR arrays and Neighbor proteins around CP026102_3
The CRISPR arrays of CP026102_3 >merge|CP026102|3|1048658-1048744|CRISPRCasFinder ATCGCGGATGCCAGCGAAAAGGCCATAACACCGAACGGCGACGACGCGTGAAGCAAGACAACAAATCGCGGATGCCAGCGCAAAGGC >CP026102|3|3|1048658-1048744|CRISPRCasFinder ATCGCGGATGCCAGCGAAAAGGC CATAACACCGAACGGCGACGACGCGTGAAGCAAGACAACAA ATCGCGGATGCCAGCGCAAAGGC
>CP026102.1|AUT54386.1|1046902_1047823_+|hypothetical-protein MTRLSCLRHARAATGRHAMYSARQSHDFMISELALLEQLRLAIARKTLEVFYQPVIRLADNTCVGVESLLRWRLHGQDISPEIVVGLAEQHQLMGPLTDLVLHKSLDDLACVLSSDRSFRVSINVGSDDLRSVRFLDVLAQALKWTGVRASQVGIEATERGFMHPDATRSIIAALRWAGHPVYIDDFGTGYSCLSYLGTFHVDALKLDKAFVNPVENAHASSVVAPHIIAIAHDLGMEIVAEGIESAAQAEYLMRKGVQYGQGWYFAKAMPVVELLPWLAEHAQSVKRDERRRCARSPRLQLLRTP >CP026102.1|AUT54385.1|1045049_1046471_+|sensor-histidine-kinase MNNRFLPRTLGARLTALIFFSTSVILALSGAALYEALRSRMSLAASGHMQATLEALQADLANVRATSRISNHPHVWTDQMEGHQNMDMAIYDMAGRRLVSTRGFQPFALLGDLPPDRRNAAFDTRGARFRYITALARLAGDCVSVRVVVQYDKSENLASLRAHAWTIMLIEVVGVGIAAAFAYAITVFGLSPLRRFVSYAEEMSTCRLAQPLSGFDTSQELKELEHAFNGMLERLNDSFTRLGQFSSNLAHDMRTPLTNLQAAAQVALSLPRSTEEYRDVIESSVEEYQRLSGMIEDMLFLARSEKAHTLVKVCLLDAVVEAGRVAGYYETVASDAGVTIELSGRAEVRADLLLFQRALSNLLSNALAHAPRGSVIFVTCREEAGAAEIAVTDTGEGIGREHVDRIFERFYRIDPSRHNRGSGTGLGLAIVKSIMENHGGQCGVESCPGVRTTFWLRFPFADVNREAGSGDTS >CP026102.1|AUT54384.1|1044319_1044934_+|hypothetical-protein MIKATTLAVAVIVSMSVAPWASAQTAQPASGVQEQSALAARVQALLATAPPVEIKARIADIDRASRIITLHGPKGHDIDVAIGPQVENFNQLRVGDHVEVLYKNALLVSADKVADADKGIRKRVDSSVYQSTPSGYGAARQIEVQATVLDMNASKREVRLRGAYQAVTLVVGPDIDFKTLKVGDTVHAVFVSAYATRVTPISAH >CP026102.1|AUT55915.1|1042780_1043533_-|MipA/OmpV-family-protein MRYLHAAAALTIGAAMVISSRFAHSAEASLGAQVNVMPKYDGAASYRALPLPLFAYDNGLFFVSGLSAGIRYPIGAGISTGLIAQFDFGRDADDSSRLAGTNDISNTARVGAFVDWRRGKWHASLNALQATHSGYGLKVRLAGSYAALATPKNTVHLAVGATFGNGDYMNTYFGVTEQESLASQSRLAGYSPSGGIKGVDASVTWKHQLNPHWSTAAVLGVSSLVGDAADSPVVEHKAAIFGSVGLAYRF >CP026102.1|AUT54383.1|1041286_1042447_+|porin MKQQIVLACAVGAFAVSAHAQSSVTLYGSIDAGITYANNVSGKSVWQQGSGNLSNNYFGLRGAEELGGGLKAVFTVESGFDLNNGGFHNNDDIFNRQAFVGLKSDRYGAVTLGRQYDSTSEYLGPLSAAGAGFGNNLAGHPFDNDNLAQTYSTKNAVKYTSPNYAGVEFGGMYGFSNDANGFANGRTWSLGARYGTGPLSVAAGYTQSDNSGGLGGANSAASASQNISATLQRTYGLGATYAFGPAQVGLVWTHSQIDGLASLSSGGAALPGLTGMNLHLDNYEINGQYRLTPALAIVSSYTFTDGTVTGSNSGNSPKWHTFVLGTDYSLSKRTDVYLAGVYQHASGSLGYDANGNGIANVASINLLSPSSTNNQAAATIGLRHRF >CP026102.1|AUT54382.1|1039523_1040921_+|HAMP-domain-containing-protein MLRYLPASLRIRLTVLIAFYASIAFAVSGFVVYEAMMSRVEANATDKMEQLMSALQVHLVEVKSTDGITRDPDAWTEHVHGREYVAFAMFDVAGKELLSTRGFRNYPPVLDVQTPRNPVNLSTPTTALRYLVAIVPLNGRDSPAVRVAVQYDSSEEHELVRSNAEIIFIMGTIGILLAAISAYGVTMLGLSPLRRIVTRAEQMSIDGLGQPLPKLTSSTELLELGQAFNGMLARLDDSFTRLSEFSSDLAHDLRTPLTNLRAAAQVALAQSRAAPEYREVIESSVDEYERLSRMIDDMLFLARAERADLSLSICEFDAAAQARRVSGFYESLAQAADIAIDVRGQGIIHADLLLYQRAVSNLLANAIVYAPRNSTIDIECWEQPDAVVVLVSDRGPGIAPPNAERIFERFYRADPPQGKAISHGEGLGLAIVKSIMNLHHGACGVKSDPAVGTTFWLQFPVEKTH >CP026102.1|AUT55914.1|1038398_1039082_+|DNA-binding-response-regulator MRILIVEDEGKTGLYLRKGLTEAGYVADWVEDGISGQHQAETEDYDLLIVDVMLPGQDGWTLLHNLRRSKSTPVLFLTARDDVGDRIRGLELGADDYLAKPFDFVELTARVKSILRRARPQDSNTLRVSDLELDLTRRKATRQGRVILLTAKEFALLWLLMRREGEILPRAIIASQVWDMNFSSDTNVVDSAIRRLRSKLDDPFESKLIHTVRGMGYVLEVRSQATP >CP026102.1|AUT54381.1|1036857_1038108_-|IS701-family-transposase MSTSQRFDEYLEYLSQGFRHKHHIAGLRDYCTGLMRPLERKSTNAIAEDLQPARAAAMRQALHHFVARAPWCDDELLRQVARWVTPQMAGLSRSGWWIIGCNTFPKRGSQPVGVARQNHEASGRYDKCQIAVSVSLACESASLPIGWRLYLPRAWADDPIRRRKAGVPADVQFATRPKLALQQVEKLLAGGTPSRPVLADVSYGMDPEFRQGLIDLGLPYVLGVTSQARIWRPQAEALPSTGYRETGRLPSQTWRTADHYPISVRALAMEMPAHALQTISWREGNGNLRSSRFGVARVQHADSHACWARLQPLQWLLLKWPLGEPEPVRYWLSTLPEDTSINDLVAAAHYHWRTDRDHEELRQDFGLDHYSGRGWRGFHHHTTLCTASYGFNLGERLASERDLATRRLSIYPESGA >CP026102.1|AUT54380.1|1035105_1036695_-|hypothetical-protein MSKRFRLLAGIATLIPAFTLLAQDLPANPKPNPYLAAEKYAITHFDSSQSDSFPYAVPRGTFEVDLRKEKRIVAGPVNIMTLASTSPSYMWGVSSEGVTYIDVSNGGFKEVARIAAPGQKIISAQLHDRVLGQHFSNAAQVQKAVTDIYGLDWTRAVNGVYSVVDKDNGVYYNTADGFLTKFSLIDEKNPSAGIKVIKTIDMRSVIGPDAYLVGTGITYDGKLVVASNFTVSVLDRSLEGKARTIRLAPGEVVTNSFAIDDQNGIYIASNKIMHKLVWTGTRLSDDPADGAWTSPYDTGDQPPTIKLGNGTGSTPTLMGFGRDQDQLVVITDGANRMHLVAFWRNKIPTGWQTPAGAKSRRIAGQIAVTAGLTPLPKFIQTEQSVVVKGYGAFVVNNISQSGEKDKLVDVLALGPVNQPGHGTERFEWDPKAHRWQSVWTRGDVISISMVPSVSSASGIVFVNGYYKKTGWELTGLDWDTGKTVQRVEFGKDNLGNGAYAIIQYAPNGDLIFNSIGGPVRVHLKDPART >CP026102.1|AUT54379.1|1033966_1034920_-|phenol-degradation-protein-meta MHKRRLIQYMTDSLLAILCVAGMTQNSSATETGVGRPITGQQVTPYGGIVPPNSEWIVSWATIYYDGSLSASKKVSTGNQITGGLDYQVVYTIANLVKTWGVNLGGWNFASSIGVPVQYSNASSFNGLLRPDSATQFADLFFAPVIAGYRLSPTDYTALSLQIYAPTGAYNPDRIANAGQNTWTFTPGIAYTRLFPSNNLELTINYGVEFYTTNSATNYHNAAVSVLDVLALKRFRSGWSVGVVGGWIQQLGNDTGPTADLIGGAKGYSLGMGPTIGWAGKIGKTPVSANLRWVNEFSAKARPSGNAVQLSLSAAFE >CP026102.1|AUT54387.1|1048809_1049829_-|GlxA-family-transcriptional-regulator MSPDRTASLSHFAFMPLPNFTMIAFTNAIEVLRMANYLSGQTLYRWSIISPDGGPVSASNGLSVDTGPADCVGTPDIVFVCGGIDVQRVTTPEHQSTLRRFARAGVALGSLCTGTYALAKSGLLAGYACAIHWENMSALKEEFPDTRFLKELFVIDRDRVTCTGGVAPLDMMLNLIAARVGTPRVTQIAEQFIVEHVRDNSAQQRMPLVARLGSANKSLFEVIALMENNIEEPLSREELARLANMSQRQLQRLFREHLGMTPTHYYLTLRLRRARELLLQTDMSIMHITMACGFQSACHFSKSYRDAFGTAPTRERRKQVAPLAHAVISNSIGGVSVHA >CP026102.1|AUT54388.1|1049860_1050160_-|hypothetical-protein MQVRFGQVTESTGRAYPWCQCSKVMRFRDGALQRPAVICSCKEHRMGARQARGQCQHCTAPVRSRECTAATRLSSQGLAVWHACCAHSQNKKGRFPTRP >CP026102.1|AUT54389.1|1050342_1051617_+|serine-hydroxymethyltransferase MSNANPFFSQSLAERDAAVRKSVLKELERQQSQVELIASENIVSRAVLEAQGSVLTNKYAEGYPGKRYYGGCEFVDEVEALAIERIKKLFNADFANVQPHSGAQANGAVMLALAKPGDTILGMSLDAGGHLTHGAKPALSGKWFNAVQYGVDRETLRIDYDQVEKLAHEHKPSLIIAGFSAYPRVLDFARFRAIADSVGAKLMVDMAHIAGVIAAGRHPNPIEHAHVVTSTTHKTLRGPRGGFVLTNEEDIAKKINSAVFPGLQGGPLMHVIAGKAVAFGEALEDNFKTYIDNVLANAQALGEVLKEGGVDLVTGGTDNHLLLVDLRPKGLKGTQVEQALERAGITCNKNGIPFDTEKPTVTSGIRLGTPAGTTRGFGVAEFRDIGRLILEVFDALRTHPDGDAATEQRVRREIFALCERFPIY >CP026102.1|AUT54390.1|1051660_1052632_+|membrane-dipeptidase MSNLHDSSIIIDGLNISKFDRSVFEDMRKGGVTAVNCTVSVWEDFQKTIDNIAEMKQQIREYSEILTLVRTTDDILRAKKENKTGIIFGFQNSYAFEDNLGYIEVFKELGVNVVQLCYNTQNLVGTGCYEPDGGLSGYGREVIQEMNRVGIMVDLSHVGGKTSSDAIACSKKPVTYSHCCPSGLKEHPRNKSDEQLKEIADANGFVGVTMFAPFLKRGPDATVEDYLEAIDYVINVIGEDKVGIGTDFTQGYSTEFFDWITHDKGRYRRLTNFGKVVNPEGIRTIGEFPNLTAAMEKAGWSESRIKKVMGENWLRVFGEVWNV >CP026102.1|AUT54391.1|1052665_1053211_+|4-vinyl-reductase MQPQLPIDVDPNTGVWTTDALPMLYVPRHFFTNNHAAVEEALGVEAYAEILYKAGYKSAYYWCDKEAKQHGISGMAVFEHYLNRLSQRGWGLFKIIEADPATAHAKIELRYSSFVLQQPEKSGKLCYMFAGWFAGAMDWVNDTTEGGKKAPRSLSKEAQCAGEHSDHKHDHCVFEVSPLAA >CP026102.1|AUT54392.1|1053310_1055374_+|FAD-dependent-oxidoreductase MRYPNLFKPLTLNQLTLRNRIVSTAHAEVYAEPGGLPGDRYIRYYEEKAKGGVGLAVCGGSSPVSIDSPQGWWKSVNLSTDKIIDPLSRLAEAMHRHGAKIMIQATHMGRRSAFHGEHWPHLMTPSGVREPVHRGNAKIIEVEEIRRIISDFAAAAKRVKDAGMDGIEISAAHQHLIDQFWSPRTNFRTDEWGGSLENRLRFGVEVLQAVREAVGKDFCVGLRMCGDEFHEDGLDHEQLKEIAQAMSEKGLIDYIGVIGSGADTHNTLANCMPPMALPPEPFVHLAAGIKSVVKLPVMHAQSIRDAGQAERLLANGMVDLVGMTRAQIADPHMVIKIRDGREDEIKQCVGANYCIDRQYNGLDVLCVQNAATSREATMPHVIEKTRGPRRKVVVVGAGPAGLEAARVARSRGHDVVLFEKSDAVGGQIMLAAKAPQREQMAGIVRWFDMETKRLGVDRRLGVEADEKMILAEKPDIIVLATGGSSFTQQVPAWGVEEGLAVSSWDILSGKVEPKQNVLVYDGVSTHAGAGVADFISSRGSKVEIVTPDVKVADDVGGTTFPIFYRRLYAQGVIHTPNYWLDRVYEEDGKKIAVIRNEYTEEQEERAVDQVIIENGSTPNDALYWKLKPESVNRGQVDVHKLFAAEPQPSLSEELGNGRFLLFRVGDCISMHNIHGAIYDALRLCKDF >CP026102.1|AUT54393.1|1055376_1057296_+|DUF3483-domain-containing-protein MSPAFLITALLWVSVAGLAFAVAKRSAYWRLGRATAAGAFGWTNLLTIPKRYFVDLHHVVARDPYIAKTHVATAGGAIAAFALVFINYGLAIYSPWLDRLIFLAALIMLVGAVFVWRRRHAKDVPARLSRGPWNTLPWLLGSFALGLLLYTLLPASAMSGGLAIIFALLIAAGAFAMTFGAARGGPMKHALAGLLHLAFHPRQERFAAQGDVRREAVVPPTALKAPVLEQNEYGVGKPVEFRWNQLLSFDACVQCGKCEAACPAFAAGQPLNPKKLIQDLVTGMVGGTDAAYAGSPTPGIKVGQHGGEPQRPIISSLIEADTVWSCTTCRACVHECPMLIEHVDAIVDMRRNQTLVHGTVPGKGPEVLANLRETGTMGGYDKAARYDWSVDLSSPVAQPGKAVDVLLVAGEGAFDMRYQRTLRSLVKVLNKAGVNYAVLGAEETDTGDVARRLGDEATFQRMAKQMMGTLATLDFKRIVTADPHVMHSLRNEYRALGGRYDVLHHTTFLAELVASGKLSPKAIAAFNDKTITYHDPCYLGRYNGETEAPRQLLKTIGIKVVEMERHGKRGRCCGGGGGAPLTDIPGKQRIPDIRIADARSIGADVVAVGCPNCTAMLEGVVGPRPEVLDVAELVAAALE >CP026102.1|AUT54394.1|1057299_1058478_+|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MTTTIKRIDPRRPFVITAAGLKRITLGAEHVAGAANDHTLHAGAHGHAAVKTLRTTQAPQRCLLVVAHSDRGGLDDHARQALAAAALIADAATQVALLVLGELKDDAAALGADKVIELPSFDRRTFAPEREVQAVAACVAQLAPAHIFIPDNATGDGDLGRRYAALAGASIATHVVQIDAKQVSSYAQAKQAYATRALPDVILLAAGAVDTRLPFIGAGERLELTAFAQTSDASSSVYRDLGIEEIDAAQVALEEADFIVSAGNGVTDVAAFEKLASTFGAAIGASRVAVDNGMFTRDKQIGATGKTVEASVYIAFGISGAVQHLQGIKDCRHVIAVNLDGSAPIVKRANLTIIGDTQSTIASLIDAIDQARSGRGAGAAPAVKQIVEGVAA >CP026102.1|AUT54395.1|1058474_1059254_+|drug:proton-antiporter MNGKLEKIAVLVSVGKHPVSGVARYSRNDAAALEIGRQLSNQHAARLDVLHAGDPGNPALEEYLALGAERVEVLTCGDNGDAVSLLAARLKGYDLVLTGTCAEGAFDSGMLPYRLADALGVPLAGTAVDVTIAGGRATVRQFLPKGVRRRVEVALPAVVAVHPLATVTPRYAYARLRAGTIAPQRVEAGADAEAAQWTLAPVARKPVRLAAAEKRTGHARMLSATTTESRGGSVVIEGTSVEKAQVILDYLREHQLIEY >CP026102.1|AUT54396.1|1059295_1060576_+|aromatic-ring-hydroxylating-dioxygenase-subunit-alpha MKVSADIRALVDRRKKGYSLEAPFYLSDEIFALDMDAIFRQHWIQVAVEPDVPEPGDYVTVELGNDSILIVRDDDMQVRAFHNVCRHRGARLCNEDKGSVGNIVCPYHSWTYNLSGELMFAEHMGEKFDRCKHSLKSVHVENLAGLIFVCLAEQPPVDFAVMRAAMEPYLLPHDLPNCKIAAQIDIIEKGNWKLTMENNRECYHCVANHPELTISLYEYGFGYQRSPANAEGMDAFERTCIERAKQWEEMDLPSVEIDRLSDVTGFRTQRLPLDRSGESQTLDAKVASKKLLGEFQQADLGGLSFWTQPNSWHHFMSDHIVTFSVIPLSAGETLVRTKWLVHKDAVEGVDYDVANLTAVWNATNDQDRALVEFSQRGASSSAYEPGPYSPYTEGLVEKFSDWYVQRLAAHVESPVAEQRTINIKAV |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP026102_4 | 2008212-2008295 | Orphan |
NA
Consensus repeat of CP026102_4
|
1 spacers
spacers of CP026102_4
>4.1|2008235|38|CP026102|CRISPRCasFinder TGGTTTCGTGGTTTATGCTTCGCCGTTCGGTGTTCCGG |
CRISPR arrays and Neighbor proteins around CP026102_4
The CRISPR arrays of CP026102_4 >merge|CP026102|4|2008212-2008295|CRISPRCasFinder GTTTTGCGCTGGCATCCGCGGTTTGGTTTCGTGGTTTATGCTTCGCCGTTCGGTGTTCCGGGTTTTGCGCTGGCATCCGCGATT >CP026102|4|4|2008212-2008295|CRISPRCasFinder GTTTTGCGCTGGCATCCGCGGTT TGGTTTCGTGGTTTATGCTTCGCCGTTCGGTGTTCCGG GTTTTGCGCTGGCATCCGCGATT
>CP026102.1|AUT55118.1|2007433_2008171_+|GntR-family-transcriptional-regulator MASTSFALERAFRPKQIYEQVAERMRGEIRSGQFAPEARLPSERDLAARFGVGRPAVREALGALQNEGLVVTRRNSGTYVCADALQRLAAAPAAGEMPGDADFSPTSALDVRLILEPAIARRAAANAQRDELAEHYLAQMDSIDDVSDGVQRALWNDSDRLFHRQLAVMTGDALLVKIADEVAKAMDQPLWKRLKDDGIHDPGRIRLYVSEHRLIYEAIVDGDAEAAAFYVEQHIRRVRRDIAPK >CP026102.1|AUT55117.1|2006229_2007237_-|hypothetical-protein MKLNRMISTVEVHTGGEPFRIVTSGLPKMPGKTIVERRAWLKEHADHLRRALMLEPRGHADMYGGYLTDPVTEGADFGIIFVHNEGYSDHCGHGVIALATTAVSLGWVERTEPETRVGIDAPCGFIEAFVKWDGDHAGSVRFVNVPSFIWLRDVSVETPSFGTVRGDIAFGGAFYFYTSGEPFGLDVREDHVDRLIQFGAEVKRAANEAFKVQHPQIPEINHIYGTIIDNAPRHAGSTQANCCVFADREVDRSPTGSGTAGRVAQLYLRGQLRKDETLVNESIIGTVFRGRVLSETKLDRFDAVIPEIEGDAHICGFANWLVDERDPLTYGFLVR >CP026102.1|AUT55116.1|2005289_2006207_-|ornithine-cyclodeaminase MSSTPIFDAAGTARLIPFRALVDALKTAAADYAAGRVASPERLVVPLNDDGIMLSMPAAARDLAIHKLVNVCPRNGERALPTIHGQVMAFDPDTGKTLFILDGPTVTGRRTAAMSMLGIETLSRGAPNDILLIGTGTQAASHLRAIGELYPQARVRVLGTSAARAQAFCDAQRDTVRDLQALSGAAIPDSVDTVIALTTSRQAVYDEAPRAGRLVIGVGAFTPQMCEIGARTLAGSTLYADDLAGARHEAGDFIQAGIDWSTVTGIEGALASQPSFDTPIVFKTVGCAAWDLAAGRVARAALGAG >CP026102.1|AUT55115.1|2003587_2005072_-|glucose-6-phosphate-dehydrogenase MTTSTAPASPDRPLDMIIFGGAGDLSARKLLPALYMAHTHGNLPPETRILAIGRREWGRDDYLKWMDEQSRPFIESGAFDASAWDRFLSLFEYVRVDVDQAGDYERLAEASRPNALRVFYLSTSPELFTTICDNLSSHGLLDEHSRVVLEKPLGHDLASAQAINDSVGKHFSEHQIYRIDHYLGKETVQNLMVLRFGNAIFGPLWQAPYIRSVQITVAESVGVGTRAGFYDHTGAMRDMVQNHLLQLLCIVAMEPPVSLDADAVRDEKLKVLRSLRPMTAEDISRDTVRGQYTAGAVGGEPVKGYLEEANVPADSRAETFVALRAHINNWRWANVPFYLRTGKRMAKKLSEIVIEFADLPFSIMPNSPCGPRNCGNRLVIQLQPNESIQLQMLAKEPGSGMRTLPVNLNLDLEQAFTSRRAEAYERLLIDVVRGRLTHFMRRDELEAAWTWVDPIIEAWKRNGDKPRAYTAGTFGPGASTAMMARDNMVWSEES >CP026102.1|AUT55114.1|2002389_2003475_+|AraC-family-transcriptional-regulator MTIVDSLPLTSIDALPKERIRFGIVLLPNFTLTAFSGFVDMLRLSADEGDYSKPVRCSWSVIGDTLAPVRASCGIQITPWETFADAEPFDYVVVVGGLLHSGPQANDETLQFIRAAARGNTTLVGICTGVFALMRAGVLDEHRICVSWFHYWDFVERFPSVNPDALIADRLFVIDRRRITCSGGRASIDVAAAILLRHFETATVQKALRILLVGEMQKGNAPQPHPPGLEPATHPKVKRAILLMEQHVGRTLPLEELACKLDLSPRQLERLFKAETGKSPQAFAKQVRLRTAAWLLTSSDRTVADIASSCGFSDASHLGREFRKEFGMPPVMFREQRGGTPVEGDAAVAYEETFPGRVDVF >CP026102.1|AUT55113.1|2000631_2002020_-|L-serine-ammonia-lyase MNVSVFDLFKIGIGPSSSHTVGPMIAACRFASHIEDANLLAFVRRVKVELYGSLGATGKGHGTDKAVLLGLEGHLPDTIDPDLIEPRLADIRKGKRLALLGKHEIAFDEKEHIAFFRRLMSGTGSVVHPNGMRFQAFDENGQLLVEKEYYSVGGGFVVNREGDRVNGVRAGGEVPYPFRTGDDLMRVCRESGLSVAQVTFANECASRAPEDVREGLLTIWRTMAACVERGCKMHGELPGPMRVKRRAADLTVQLRTRSEESLRDPLSMLDWVNLYAMAVNEENAAGGRVVTAPTNGAAGVIPAVLHYYVKFVPGSNENGIVDFLLTAAAIGIIYKETASISGAEVGCQGEVGVACSMAAAALAAVMGGTPTQVENAAEIGMEHNLGMTCDPVGGLVQIPCIERNAMGAIKALNASRMALKGDGQHYVTLDNVIKTMRETGADMKTKYKETSRGGLAVNVIEC >CP026102.1|AUT55112.1|1999358_2000603_-|sarcosine-oxidase-subunit-beta-family-protein MSRYSIFSLFRNGLSYHENWERQWRSPEPKKEYDVVIVGGGGHGLATAYYLAKEHGVKNVAILEKGWIGGGNTARNTTIVRSNYLWDESAALYEKAMKLWEGLSQDLNYNVMFSQRGVLNLAHTLQDVRDTERRVNANRLNGVDAEFLTPEQIKEIEPTINLNSRYPVLGASIQRRAGVARHDAVAWGFARGADQAGVDIIQNCQVTGIRRDGGRVTGVDTVKGFIKAKKVAVVAAGNTTTLADMAGIRLPLESHPLQALVSEPIKPVVNSVIMSNAVHAYISQSDKGDLVIGAGVDQYTGFGQRGSFHIIEGTLQAIVEMFPVFSRVRMNRQWGGIVDVSPDACPIISKTDVKGLYFNCGWGTGGFKATPGSGWVFAHTIANDEPHPLNAAFSLDRFYTGHLIDEHGAAAVAH >CP026102.1|AUT55111.1|1999036_1999336_-|sarcosine-oxidase-subunit-delta MLTIECPWCGPRAESEFSCGGEADIARPLDTDKLTDKEWGDYLFMRKNPRGVHREQWLHTQGCRRWFMATRDTVSYEIQGYDTFKTGNTSADAQGGNKQ >CP026102.1|AUT55110.1|1996040_1999040_-|sarcosine-oxidase-subunit-alpha-family-protein MSQKNRLGAGGRINRAIPLTFTFNGRTYQGFQGDTLASALLANGVHFVARSFKYHRPRGIVTADVAEPNAVVQLERGAYTVPNARATEIELYQGLVATSVNAEPNLEHDRMAINQKFSRFMPAGFYYKTFMWPAKFWPKYEEKIREAAGLGKAPEVLDADRYDKCYAHCDVLVVGGGPTGLAAAHAAAVSGARVILVDDQRELGGSLLSSKTEIDGRAALSWVEKIEAELSRMADVTILSRSTAFGYQDHNLVTVTQRLTDHLPVSMRKGTRELLWKIRAKRVILATGAHERPIVFGNNDLPGVMMASAVSTYIHRFGVLPGRNAVVFTNNDAGYQCALDMKACGASVTVVDPRAQGNGALQAAARRHGVKIMNNAAVMTAHGKLRVTSVEVVAYANGKTGAKQADLPCDLVAMSGGYSPVLHLFAQSGGKAHWNDTKACFVPGKGMQPETSIGAAAGEFSLARGLRLAVDAGVEAVKSIGYAVTRVQVPQAAEVAESPLQPLWLVGSRTEAARGPKQFVDFQNDVSAADILLAAREGFESVEHVKRYTAMGFGTDQGKLGNINGMAILADALGKTIPETGTTTFRPNYTPVSFGTFAGRELGDLLDPIRKTAVHEWHVENGAMFEDVGNWKRPWYFPKSGEDLHAAVKRECLAVRNSVGILDASTLGKIDIQGPDAAKLLNWMYTNPWSKLEVGKCRYGLMLDENGMVFDDGVTVRLADQHFMMTTTTGGAARVLTWMERWLQTEWPDMKVRLASVTDHWATFAVVGPKSRKVVQKICSDIDFANEAFPFMSYRNGTVAGVKARVMRISFSGELAYEVNVPANMGRAVWEALMAAGAEFDITPYGTETMHVLRAEKGYIIVGQDTDGSITPHDLGMGGLVAKTKDFLGRRSLARSDTAKDGRKQFVGLLTDDPQLVLPEGSQIVAGPFQGETAPMLGHVTSSYYSPILNRSIALAVVKGGLNKMGQNVTIPLASGKQIAAKIASPVFYDTEGVRQHVE >CP026102.1|AUT55109.1|1995400_1996051_-|sarcosine-oxidase-subunit-gamma MWNEARNNAPGATSAVANRVAGQPWQESPLAGVGELVKKHAAAPSKKFHLREKAFCDLVNLRGDVSDAAFLGAVESVTGCRPPARPNTVVRGNGYDVLWLGPDEWLVRSQQPQAPVAEDKLVEALQGQFASAVDIGSGWTVLEVSGEKVRDVISRGCPLDLHPRVLAAGQCAQSHYFKASIVLVPIADDTYEIVVRRSFADYFVRIMLDAAEPLLS >CP026102.1|AUT55119.1|2009038_2010586_-|methyl-accepting-chemotaxis-protein MKSLTINARIATTIAFLGVLLIATGALGIFGMAKSNRAQRDGYEVNFASVVALGRSGTAMSRARFGLDWAMSNPHSPQLGEQLNRAKRLLGDADRAWAEFRALPKTPALQSLTDDLDAKRTAVLRDGIDQLIQAIGSGDTNWMDESRANHLIGLYSAMNASQGALEKYLDDAAQAAADHSSATFRTLLTACIASIAVGLGVAYLSWRALRRAIMSPMRDALGQFDAIASGELRTRVEIRSEDEMGTLLHGLATMQDKLGATITTVRKGSDSIAAATQQIAAGNLDLSQRTEEQAASLEQTAAAMDELTSTVQLNAENAQHASKLAEDASSMTAHGREAVGSLVETMHLIDAGSSKMTGIITAIEGIAFQTNILALNAAVEAARAGEEGRGFAVVAGEVRSLAQRSAAAAKEIGILIADSTSRVAHGAQIATGAGDTIRDIETAISRVAKIVGEIATASQQQSDGIKEVSLAVTQMDEVTQQNAALVEENAATAAALADEAKRLSELTAAFRVGVG >CP026102.1|AUT55120.1|2010766_2011717_-|AEC-family-transporter MLSTLEILLPVFALIFAGFFCRRRNLLGPTAASELNRFVVWLALPALLFDTMAHSTWHQLDQPAFIATFSIACAGVFVVVLLARLASGRHLADASVDAIAASYPNTGYIGFPLGLLAFGRASLTPTTIATIIVACVLFALAIVLIEIGLQTERTPHKLGAKVVWRLLRNPLIASPILGVLAASADVALPHSVETFLKLLSGAASPCALVSLGLFLAEKRTPAEQAAEPVTSFVLTAIKLIAQPALAWWIAARVFALPAPMVDMAVLLAALPTGTGPYMLAEFYEREAHITSQTILLSTLGSLVSLSLLLFYMHAPG >CP026102.1|AUT55121.1|2011824_2012775_+|LysR-family-transcriptional-regulator MLDVKPLRYFVTLAETRHFGRAAARLNLSQPPLSRQLAALEAALGVTLIERSPRSVTLTAAGERFYEDAKAILASIEQAARHARAAAAGDTGQLTVGFTMCAAYSVLPSYARAYGDAWPGVTLNLREVVSNDLAPQVLSGQIDAAIMFPGAQSKDLDTRAIFTEPLCVALSREHPLACAHQLKIAQLAREPFVMASEAVSPSLRATIVDHCAQGGFAPDVRFEVQLQQTVLSLVDEGVGIALVPESMRKAQLVGVVFRPLDDAPTISQMLVWSPSNRNPCLARFLEIAWKRRAERNGEESRASAHSGADAEKQRYR >CP026102.1|AUT55122.1|2012765_2013536_-|hypothetical-protein MVGKLLMRGMLAGIVAGLLTFAFARVAGEPLVDTAISFEEKMQTAHDHGDASGAHDHEEELVSRGTQAGLGLLTGVVAYGMAFGGLFALTFAYLHGRVGRLGARALSAWLAVGAYVAVVLVPTIKYPANPPSVGDPETIGMRTGLFFLMIVTSLVVAVFSMKVRKHLVSRLGVWNASIVGGIVFVAIIAAIQIALPTVNEVPEAFPAVVLWKFRFTALGMQAIMWATIGLLFGALVERSERIARASAASARNSAYL >CP026102.1|AUT56005.1|2013557_2013782_-|CbtB-domain-containing-protein MNDAVLDHAGQTDQPVITPIPLRELLPWILFGGLLMLLALYFVGAEQGATSLIPGMYVHEFVHDGRHLLGFPCH >CP026102.1|AUT55123.1|2014028_2014604_+|histidine-phosphatase-family-protein MRTRLLLISHPATAAQRKGTFPADDPLDTRAVEEATSFRASHAGLLNADAALSSPAACALDTARALGLAATIVPDLADADFGRWRGRRLLDVANEDTNALDTWTRDPSSAPHGGESFDALTLRVGGWLDAFEQRGTVIAVTHAGVIRAALMHVLQAPSARFARIEVPPLSVVELQRDQRGWTWWPAPDRRS >CP026102.1|AUT55124.1|2014619_2015018_-|hypothetical-protein MTIRAIAPLALWTVAQLAVAATGDASCGVLAGAGASGASSVSAASGFALRDGEPVDFIAGGKTVPGTLHVLKDGGIYRAYWQPQGRPERYVLANAGTDAVRLIATPAQGKPATDGMPGTTLNPQQVLSCPTL >CP026102.1|AUT55125.1|2015231_2015450_+|hypothetical-protein MATLEAFRSVLDDARTPEIIRNHIIDSLQYALRNHGQVFTSKEVEWLAKWDDARIPLAASRELQKRLTQTAD >CP026102.1|AUT55126.1|2015963_2016251_+|hypothetical-protein MTTQGIKTYKGYEIHPLIYPRRTANGVAHRNSIDSGYDASVRICRVGANAAADGRVFRLSYFRPFEGAGKARIACMEHAAQVIDGRVDGQTVSDL >CP026102.1|AUT55127.1|2016382_2016646_+|translation-initiation-factor-IF-1 MAKEELLELDGIVDEVLPDSRYRVTLDNGVVVGAYASGRMRKNHIRILAGDRVTLELSVYDLTKGRINFRHKDERSSGPRSAPMRRR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
CP026102_1 | 1.1|235470|25|CP026102|CRISPRCasFinder | 235470-235494 | 25 | CP026101.1 | 789622-789646 | 2 | 0.92 |
CP026102_1 | 1.1|235470|25|CP026102|CRISPRCasFinder | 235470-235494 | 25 | CP026101.1 | 2609896-2609920 | 2 | 0.92 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | CP026101.1 | 3236615-3236650 | 2 | 0.944 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | CP026102.1 | 355993-356028 | 1 | 0.972 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | CP026102.1 | 356637-356672 | 1 | 0.972 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | CP026102.1 | 2417525-2417560 | 2 | 0.944 |
CP026102_3 | 3.1|1048681|41|CP026102|CRISPRCasFinder | 1048681-1048721 | 41 | CP026102.1 | 2008947-2008987 | 2 | 0.951 |
1. spacer 1.1|235470|25|CP026102|CRISPRCasFinder matches to position: 789622-789646, mismatch: 2, identity: 0.92
gaaagcaaaggccgaaacaccgaac CRISPR spacer gaaagcaaagaccgacacaccgaac Protospacer **********.**** *********
2. spacer 1.1|235470|25|CP026102|CRISPRCasFinder matches to position: 2609896-2609920, mismatch: 2, identity: 0.92
gaaagcaaaggccgaaacaccgaac CRISPR spacer gaaagcaaaggccgacaccccgaac Protospacer *************** ** ******
3. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to position: 3236615-3236650, mismatch: 2, identity: 0.944
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaactcgcggatgccagcgaaaaggcaaa Protospacer **********.*******.*****************
4. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to position: 355993-356028, mismatch: 1, identity: 0.972
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggacgccagcgaagaggcaaa Protospacer ****************************.*******
5. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to position: 356637-356672, mismatch: 1, identity: 0.972
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgaaaaggcaaa Protospacer ******************.*****************
6. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to position: 2417525-2417560, mismatch: 2, identity: 0.944
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgaaaaagcaaa Protospacer ******************.***********.*****
7. spacer 3.1|1048681|41|CP026102|CRISPRCasFinder matches to position: 2008947-2008987, mismatch: 2, identity: 0.951
cataacaccgaacggcgacgacgcgtgaagcaagacaacaa CRISPR spacer caaaacaccgaacggcgacgacgcgagaagcaagacaacaa Protospacer ** ********************** ***************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP026102_1 | 1.1|235470|25|CP026102|CRISPRCasFinder | 235470-235494 | 25 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1856731-1856755 | 2 | 0.92 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 251067-251102 | 2 | 0.944 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1313582-1313617 | 2 | 0.944 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 514354-514389 | 3 | 0.917 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 427887-427922 | 3 | 0.917 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1856706-1856741 | 3 | 0.917 |
CP026102_1 | 1.1|235470|25|CP026102|CRISPRCasFinder | 235470-235494 | 25 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1316845-1316869 | 4 | 0.84 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 360003-360038 | 4 | 0.889 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1551778-1551813 | 4 | 0.889 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 223-258 | 4 | 0.889 |
CP026102_1 | 1.1|235470|25|CP026102|CRISPRCasFinder | 235470-235494 | 25 | NZ_CP032828 | Sphingomonas sp. YZ-8 plasmid unnamed1, complete sequence | 305162-305186 | 5 | 0.8 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 271159-271194 | 5 | 0.861 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 123273-123308 | 5 | 0.861 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2242572-2242607 | 6 | 0.833 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2241930-2241965 | 6 | 0.833 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1036838-1036873 | 6 | 0.833 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1049514-1049549 | 6 | 0.833 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 82434-82469 | 6 | 0.833 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1622761-1622796 | 6 | 0.833 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2239817-2239852 | 7 | 0.806 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 526122-526157 | 7 | 0.806 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1314653-1314688 | 8 | 0.778 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 214994-215029 | 8 | 0.778 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 767357-767392 | 8 | 0.778 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 83079-83114 | 8 | 0.778 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 123918-123953 | 8 | 0.778 |
CP026102_2 | 2.1|791776|36|CP026102|CRISPRCasFinder | 791776-791811 | 36 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1561890-1561925 | 8 | 0.778 |
1. spacer 1.1|235470|25|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 2, identity: 0.92
gaaagcaaaggccgaaacaccgaac CRISPR spacer gaaagcaaagcccaaaacaccgaac Protospacer ********** **.***********
2. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 2, identity: 0.944
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer aggctaacaattcgcggatgccagcgaaaaggcaaa Protospacer * ****************.*****************
3. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 2, identity: 0.944
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgaaaaagcaaa Protospacer ******************.***********.*****
4. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 3, identity: 0.917
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgaaaagacaag Protospacer ******************.************.***.
5. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 3, identity: 0.917
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggacgccagcgcaaaagcata Protospacer ************************** ***.*** *
6. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 3, identity: 0.917
acgctaacaattcgcggacgccagcgaaaaggcaaa-- CRISPR spacer acgctaacaattcgcggatgccagcgaaa--gcaaagc Protospacer ******************.********** *****
7. spacer 1.1|235470|25|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 4, identity: 0.84
gaaagcaaaggccgaaacaccgaac CRISPR spacer cagcgcaaaggccaaaacaccgaac Protospacer *. *********.***********
8. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 4, identity: 0.889
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer gagctaacaattcgcggatgccagcgaaaacgcaaa Protospacer . ****************.*********** *****
9. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 4, identity: 0.889
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgcaaaggccat Protospacer ******************.******* ****** *
10. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 4, identity: 0.889
acgctaacaattcgcggacgccagcga-aaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgacaaaggccc- Protospacer ******************.******** ******
11. spacer 1.1|235470|25|CP026102|CRISPRCasFinder matches to NZ_CP032828 (Sphingomonas sp. YZ-8 plasmid unnamed1, complete sequence) position: , mismatch: 5, identity: 0.8
gaaagcaaaggccgaaacaccgaac CRISPR spacer tcgcgcaaaggccgaaacaccgatc Protospacer . ******************* *
12. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.861
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgcaaagcaaac Protospacer ******************.******* **** **
13. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 5, identity: 0.861
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgaaagcgaaag Protospacer ******************.**********. * **.
14. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.833
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgcaaacacgag Protospacer ******************.******* *** .*.*.
15. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.833
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgcaaacacgag Protospacer ******************.******* *** .*.*.
16. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.833
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer atgctaacaattcgcggatgccagcgcaaaaataaa Protospacer *.****************.******* ***...***
17. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 6, identity: 0.833
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgaaagcaaaag Protospacer ******************.**********. . **.
18. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 6, identity: 0.833
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgaaagcaaaag Protospacer ******************.**********. . **.
19. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 6, identity: 0.833
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgaaagcaaaag Protospacer ******************.**********. . **.
20. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.806
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer acgctaacaattcgcggatgccagcgaaagcaaagg Protospacer ******************.**********. . *..
21. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.806
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer aggctaacaattcgcggatgccagcgaaagaaccac Protospacer * ****************.**********...* *
22. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.778
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer gcgagggcaaatcgcggatgccagcgaaaaggcaga Protospacer .** ..*** *******.***************.*
23. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.778
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer gcaaagccaattcgcggatgccagcgcaaaggcaaa Protospacer .*. . ***********.******* *********
24. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.778
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer gagctaacaattcgcggatgccagcgaaagcaaata Protospacer . ****************.**********. . * *
25. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.778
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer tagctaacaaatcgcggatgccagcgaaaaccaaac Protospacer ******** *******.*********** **
26. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.778
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer gcgctaacaattcgcggatgccagcaaaagcaaaag Protospacer .*****************.******.***. . **.
27. spacer 2.1|791776|36|CP026102|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.778
acgctaacaattcgcggacgccagcgaaaaggcaaa CRISPR spacer gagctaacaattcgcggatgccaacgaaagcgaaag Protospacer . ****************.****.*****. * **.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|